Skip to content

Spec: Clarify record-related snapshot summaries refer to physical layout#16005

Open
dannycjones wants to merge 1 commit intoapache:mainfrom
dannycjones:minor-spec-update-clarify-physical-records
Open

Spec: Clarify record-related snapshot summaries refer to physical layout#16005
dannycjones wants to merge 1 commit intoapache:mainfrom
dannycjones:minor-spec-update-clarify-physical-records

Conversation

@dannycjones
Copy link
Copy Markdown

This is a minor clarification to be clear that the following fields in snapshot summaries refer to the physical layout (i.e. records in data and delete files) rather than logical records (i.e. active records, that should be considered to be valid to return in queries).

It was unclear to me when implementing a compaction algorithm what the values referred to. I have reviewed the Java implementation to confirm that these fields - at least in Java - refer to the physical layout: https://github.com/apache/iceberg/blob/main/core/src/main/java/org/apache/iceberg/SnapshotSummary.java#L322-L325.

I acknowledge this is changing the specification (format/ dir) - please let me know if there is a good way to proceed. I'm hoping this is a simple clarification.

@github-actions github-actions bot added the Specification Issues that may introduce spec changes. label Apr 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Specification Issues that may introduce spec changes.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant