[VL] Enable enhanced tests for spark 4.0 & fix failures by infvg · Pull Request #11868 · apache/gluten

infvg · 2026-04-02T08:52:18Z

This PR enables enhanced test for spark 4.0 & resolves sql queries in iceberg due to the new metadata columns.

jinchengchenghh · 2026-04-02T10:13:05Z

cpp/velox/compute/iceberg/IcebergWriter.cc

+  auto inputRowVector = batch.getRowVector();
+  auto inputRowType = asRowType(inputRowVector->type());
+
+  // Filter columns to match the expected schema (rowType_)


The metadata columns should append in the end or beginning of the schema, and the number of the columns should be fixed value, so could we simplify the logic?

And the metadata column name is the specific name, we only need to match the pattern to decide it is metadata column, could you show the example schema to let us understand this issue.

I think we can simplify it by using fieldIds. Field ID > Integer.MAX - 200 are reserved for metadata columns:
https://iceberg.apache.org/spec/#reserved-field-ids

We can just splice the columns and remove any additional columns that appear at the end so we don't have to add any loops

Yes, that's what I want

jinchengchenghh · 2026-04-08T19:07:43Z

...ends-velox/src-iceberg/main/scala/org/apache/gluten/execution/AbstractIcebergWriteExec.scala

+    // Filter out metadata columns from the Spark output schema and reorder to match Iceberg schema
+    // Spark 4.0 may include metadata columns in the output schema during UPDATE operations,
+    // but these should not be written to the Iceberg table
+    val schemaFieldMap = schema.fields.map(f => f.name -> f).toMap


You could use Intellij to debug here, see the writeSchema and schema: StructType difference, also use slice to take some of the columns

zhouyuan · 2026-04-08T21:18:39Z

@infvg Thanks for the fix. Please fix the CI.

jinchengchenghh · 2026-04-10T14:33:52Z

cpp/velox/compute/iceberg/IcebergWriter.cc

-  dataSink_->appendData(batch.getRowVector());
+  auto inputRowVector = batch.getRowVector();
+
+  auto outputRowVector = std::make_shared<RowVector>(


Why you need it? Just set the rowType_?

Co-authored-by: Yuan <yuanzhou@apache.org>

github-actions bot added VELOX INFRA labels Apr 2, 2026

infvg force-pushed the icebergspark40fix branch 2 times, most recently from 2887032 to ef0d7ac Compare April 2, 2026 08:54

zhouyuan changed the title ~~Enable enhanced tests for spark 4.0 & fix failures~~ [VL] Enable enhanced tests for spark 4.0 & fix failures Apr 2, 2026

jinchengchenghh reviewed Apr 2, 2026

View reviewed changes

infvg force-pushed the icebergspark40fix branch 5 times, most recently from e6dc9d7 to c34f5b7 Compare April 8, 2026 18:47

jinchengchenghh reviewed Apr 8, 2026

View reviewed changes

infvg force-pushed the icebergspark40fix branch 2 times, most recently from a12d8da to 73c9e38 Compare April 8, 2026 20:31

zhouyuan approved these changes Apr 8, 2026

View reviewed changes

jinchengchenghh reviewed Apr 10, 2026

View reviewed changes

Enable enhanced tests for spark 4.0 & fix failures

7704838

Co-authored-by: Yuan <yuanzhou@apache.org>

infvg force-pushed the icebergspark40fix branch from 73c9e38 to 7704838 Compare April 10, 2026 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VL] Enable enhanced tests for spark 4.0 & fix failures#11868

[VL] Enable enhanced tests for spark 4.0 & fix failures#11868
infvg wants to merge 1 commit intoapache:mainfrom
infvg:icebergspark40fix

infvg commented Apr 2, 2026

Uh oh!

jinchengchenghh Apr 2, 2026

Uh oh!

jinchengchenghh Apr 2, 2026

Uh oh!

infvg Apr 3, 2026

Uh oh!

infvg Apr 5, 2026

Uh oh!

jinchengchenghh Apr 8, 2026

Uh oh!

jinchengchenghh Apr 8, 2026

Uh oh!

zhouyuan commented Apr 8, 2026

Uh oh!

jinchengchenghh Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

infvg commented Apr 2, 2026

Uh oh!

jinchengchenghh Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

jinchengchenghh Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

infvg Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

infvg Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

jinchengchenghh Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

jinchengchenghh Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

zhouyuan commented Apr 8, 2026

Uh oh!

jinchengchenghh Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants