Header and Data Separation ADR

Abstract

The separation of header and data structures in Rollkit unlocks expanding the sequencing scheme beyond single sequencing and unlocks the use of a decentralized sequencer mode. This means that the creation of list of the transactions can be done by another network as well while nodes still produce headers after executing that list of transactions. This overall change is akin to the proposer-builder separation in the Ethereum protocol, where the Rollkit header producer acts as the proposer, and the sequencer, which produces a list of transactions, acts as the builder.

Before Separation

flowchart LR
    CS[Single Sequencer] -->|Creates| B[Block]
    B -->|Contains| SH1[SignedHeader]
    B -->|Contains| D1[Data]

    class CS,B,SH1,D1 node

After Separation

flowchart LR
    HP[Header Producer] -->|Creates| SH2[SignedHeader]
    SEQ[Sequencer] -->|Creates| D2[Data]
    SH2 -.->|References via DataCommitment| D2

    class HP,SEQ,SH2,D2 node

Protocol/Component Description

Before, Rollkit only supported the use of a single sequencer that was responsible for creating a list of transactions by reaping its mempool, executing them to produce a header, and putting them together in a block. Rollkit headers and data were encapsulated within a single block structure. The block struct looked like this:

// Block defines the structure of Rollkit block.
type Block struct {
	SignedHeader SignedHeader
	Data         Data
}

The separation of header and data into distinct structures allows them to be processed independently. The SignedHeader struct now focuses on the header information, while the Data struct handles transaction data separately. This separation is particularly beneficial in unlocking based sequencing, where users submit transactions directly to the Data Availability layer which acts as the entity responsible for creating the list of transactions.

classDiagram
    class Block {
        SignedHeader
        Data
    }

    class SignedHeader {
        Header
        Signature
    }

    class Header {
        ParentHash
        Height
        Timestamp
        ChainID
        DataCommitment
        StateRoot
        ExtraData
    }

    class Data {
        Metadata
        Txs
    }

    Block *-- SignedHeader
    Block *-- Data
    SignedHeader *-- Header

This change also affects how full nodes sync. Previously, full nodes would apply the transactions from the Block struct and verify that the header in SignedHeader matched their locally produced header. Now, with the separation, full nodes obtain the transaction data separately (via the DA layer directly in based sequencer mode, or via p2p gossip/DA layer in single sequencer mode) and verify it against the header signed by the header producer once they have both components. If a full node receives the header/data via a p2p gossip layer, they should wait to see the same header/data on the DA layer before marking the corresponding block as finalized in their view.

This ensures that the data integrity and consistency are maintained across the network.

// SignedHeader struct consists of the header and a signature
type SignedHeader struct {
    Header // Rollkit Header
    Signature  Signature // Signature of the header producer
    ...
}

// Header struct focusing on header information
type Header struct {
    // Hash of the previous block header.
    ParentHash Hash
    // Height represents the block height (aka block number) of a given header
    Height uint64
    // Block creation timestamp
    Timestamp uint64
    // The Chain ID
    ChainID string
    // Pointer to location of associated block data aka transactions in the DA layer
    DataCommitment Hash
    // Commitment representing the state linked to the header
    StateRoot Hash
    // Arbitrary field for additional metadata
    ExtraData []byte
}

// Data defines Rollkit block data.
type Data struct {
	*Metadata // Defines metadata for Data struct to help with p2p gossiping.
	Txs Txs // List of transactions to be executed
}

The publishBlock method in manager.go now creates the header and data structures separately. This decoupling allows for the header to be submitted to the DA layer independently of the block data, which can be built by a separate network. This change supports the transition from a single sequencer mode to a decentralized sequencer mode, making the system more modular.

Message Structure/Communication Format

Header Producer

Before the separation: Only the entire Block struct composed of both header and data was submitted to the DA layer. The Block and SignedHeader were both gossipped over two separate p2p layers: gossiping Block to just full nodes and gossiping the SignedHeader to full nodes and future light nodes to join that will only sync headers (and proofs).

After the separation: The SignedHeader and Data are submitted separately to the DA layer. Note that the SignedHeader has a Header that is linked to the Data via a DataCommitment from the DA layer. SignedHeader and Data are both gossipped over two separate p2p layers: gossiping Data to just full nodes and gossiping the SignedHeader to full nodes and future light nodes to join that will only sync headers (and proofs).

In based sequencing mode, the header producer is equivalent to a full node.

Before Separation

flowchart LR

    CS1[Single Sequencer] -->|Submits Block| DA1[DA Layer]
    CS1 -->|Gossips Block| FN1[Full Nodes]
    CS1 -->|Gossips SignedHeader| LN1[Light Nodes]

    class CS1,DA1,FN1,LN1 node

After Separation - Single Sequencer Mode

flowchart LR

    CS2[Single Sequencer] -->|Submits Data| DA2[DA Layer]
    HP2[Header Producer] -->|Submits SignedHeader| DA2

    CS2 -->|Gossips Data| FN2[Full Nodes]
    HP2 -->|Gossips SignedHeader| FN2
    HP2 -->|Gossips SignedHeader| LN2[Light Nodes]

    class CS2,HP2,DA2,FN2,LN2 node

After Separation - Based Mode

flowchart LR

    Users -->|Submit Txs| DA3[DA Layer]
    FN3[Full Node/Header Producer] -->|Reads Data| DA3

    class Users,DA3,FN3,LN3 node

Syncing Full Node

Before the separation: Full Nodes get the entire Block struct via p2p or the DA layer. They can choose to apply the block as soon as they get it via p2p OR just wait to see it on the DA layer. This depends on whether a full node opts in to the p2p layer or not. Gossiping the SignedHeader over p2p is primarily for light nodes to get the header.

After the separation: Full nodes get the Data struct and the SignedHeader struct separately over p2p and DA layers. In code, this refers to the HeaderStore and the DataStore in block manager. A Full node should wait for having both the Data struct and the corresponding SignedHeader to it before applying the block data to its associated state machine. This is so that the full node can verify that its locally produced header's state commitment after it applies the Data associated to a block is consistent with the Header inside the SignedHeader that is received from the header producer. The Header should contain a link to its associated Data via a DataCommitment that is a pointer to the location of the Data on the DA layer.

sequenceDiagram
    participant FN as Full Node
    participant P2P as P2P Network
    participant DA as DA Layer
    participant SM as State Machine

    Note over FN,DA: After Separation - Sync Process

    P2P->>FN: Receive Data
    P2P->>FN: Receive SignedHeader
    FN->>DA: Verify Data availability
    FN->>DA: Verify SignedHeader availability
    FN->>FN: Match Data with SignedHeader via DataCommitment
    FN->>SM: Apply Data to state machine
    FN->>FN: Verify locally produced header matches received Header
    FN->>FN: Mark block as finalized

In a single sequencer mode, before, a full node marks a block finalized, it should verify that both the SignedHeader and Data associated to it were made available on the DA layer by checking it directly or verifying DA inclusion proofs.

In based sequencing mode, blocks can be instantly finalized since the Data is directly always derived from the DA layer and already exists there. There's no need for a SignedHeader to exist on the DA layer.

sequenceDiagram
    participant DA as DA Layer
    participant FN as Full Node
    participant SM as State Machine

    Note over DA,FN: Based Sequencing Mode

    DA->>FN: Data already available
    FN->>FN: Read Data from DA
    FN->>FN: Execute transactions
    FN->>FN: Produce Header
    FN->>SM: Apply state changes
    FN->>FN: Finalize Block
    Note right of FN: No need to submit SignedHeader to DA

Assumptions and Considerations

Considerations include ensuring that headers and data are correctly synchronized and validated to prevent inconsistencies.
Ensure that all components interacting with headers and data are updated to handle them as separate entities.
Security measures should be in place to prevent unauthorized access or tampering with headers and data during transmission and storage.
Performance optimizations may be necessary to handle the increased complexity of managing separate header and data structures, especially in high-throughput environments.
Testing and validation processes should be updated to account for the new structure and ensure that all components function correctly in both single and decentralized sequencer modes.

Implementation

The implementation of this separation can be found in the Rollkit repository, specifically in the changes made to the manager.go file. The publishBlock method illustrates the creation of separate header and data structures, and the associated logic for handling them independently. See Rollkit PR #1789

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Header and Data Separation ADR

Abstract

Before Separation

After Separation

Protocol/Component Description

Message Structure/Communication Format

Header Producer

Before Separation

After Separation - Single Sequencer Mode

After Separation - Based Mode

Syncing Full Node

Assumptions and Considerations

Implementation

References

FilesExpand file tree

adr-014-header-and-data-separation.md

Latest commit

History

adr-014-header-and-data-separation.md

File metadata and controls

Header and Data Separation ADR

Abstract

Before Separation

After Separation

Protocol/Component Description

Message Structure/Communication Format

Header Producer

Before Separation

After Separation - Single Sequencer Mode

After Separation - Based Mode

Syncing Full Node

Assumptions and Considerations

Implementation

References