feat(experimental): Add schema change support for BigQuery by iambriccardo · Pull Request #499 · supabase/etl

iambriccardo · 2025-12-11T06:58:37Z

Summary

This PR adds experimental schema-change support for BigQuery pipelines.

At a high level, the pipeline can now observe Postgres schema changes, persist versioned table schemas in its own state store, and use that information to decide what schema should exist in the destination at a given point in the replication stream.

How It Works

Schema changes are captured transactionally from Postgres with a DDL event trigger. When an ALTER TABLE happens, the trigger emits a logical replication message in the same transaction, so schema changes stay ordered with the row-level changes that follow.

The pipeline stores table schemas as versioned records keyed by snapshot_id. The initial schema is stored at 0/0, and later schema versions use the LSN associated with the DDL change. This gives the system deterministic schema indexing: every DDL message identifies the schema version by the LSN carried in the stream. On startup or recovery, the pipeline loads the latest schema version whose snapshot_id is less than or equal to the current flush position.

To avoid relying on the destination catalog as the source of truth, this PR replaces legacy table mappings with destination table metadata. That metadata stores the last applied schema version, the previous version, and a replication mask describing which columns are actively replicated. Destinations receive replicated schema information with events, so they can diff and apply schema changes directly instead of reloading schema ad hoc.

Key Design Points

Consistent Schema Loading

This PR makes schema loading consistent by using the same schema-description path for both the initial load and later schema updates. That means the schema shape the pipeline stores and the schema shape it later uses for diffing come from the same source of truth, which reduces drift between bootstrap and steady-state replication.

Replication Masks

A key structure in the design is the replication mask. Stored schemas can contain the full table definition, but not every column is necessarily replicated because publications can apply column-level filtering. The replication mask tells the system which columns in a schema version are actually active for replication.

That separation is important because it lets us:

keep a stable, complete schema history for each table
apply publication-level column filtering without mutating stored schema versions
diff and apply destination schema changes against the columns that are actually replicated

Migration Assumption

The storage migrations in this PR are intentionally designed as a one-way upgrade to keep the rollout simple and fast. In practice, that means multiple pipelines pointing at the same source database cannot safely run with mixed old and new state-store code.

If a source database has more than one pipeline using these ETL tables, they need to be upgraded together. Otherwise, an older pipeline can stop working after the migrated schema is used by a newer pipeline, for example on the next schema-store write, state-store write, or schema change.

Crash-Safe Properties

The crash-safety model is based on the assumption that we always know which schema the destination should have by looking at persisted metadata, not by inspecting the destination itself.

The metadata stores which schema version is currently applied, which version was applied before it, and a status that signals whether a schema transition completed. Because schema versions are indexed deterministically by LSN-backed snapshot_id, the system can recover by reloading the schema version for the current replication position and comparing it with the metadata for the destination.

Today, the status field is intentionally simple. In practice, it is mainly a signal for whether a schema transition finished cleanly or whether the destination should be treated as corrupted. If a destination table is left in applying, the current interpretation is that the schema change may have failed partway through and the destination is no longer trustworthy without cleanup or restart.

This is a pragmatic first step rather than a full repair model. Different destinations can later implement more sophisticated repair or reconciliation flows, but for now the engine mainly uses the status to detect potentially corrupted state and stop pretending the destination is healthy.

BigQuery still does not provide fully atomic multi-statement DDL, so schema application can fail partway through. This PR makes that state explicit through metadata such as applied and applying, but the exact recovery and repair semantics should be defined more clearly in follow-up work.

What Changed

capture ALTER TABLE changes in the replication stream with a DDL event trigger
store multiple schema versions per table using snapshot_id
replace legacy table mappings with destination table metadata, including schema state and replication masks
pass replicated schema information with events so destinations can diff and apply changes directly
add support for composite primary-key ordinals in schema storage
update integration tests to match the migrated etl.table_columns layout, including primary_key_ordinal_position

Validation

cargo test -p etl-api --test main --all-features pipelines:: -- --nocapture

Next Steps

clean up outdated schema snapshots once the retention condition is met; this can land in a follow-up PR because customers can start using the feature safely before cleanup exists
better define schema-change semantics, especially around column deletion and whether operations should behave as hard deletes, soft deletes, or other destination-specific policies
better define recovery semantics around applying state and partial schema application, including what cleanup or repair guarantees the engine should provide
extend the schema-change model to other destinations in follow-up work; this PR is intentionally focused on BigQuery first

…port

farazdagi

Absolutely amazing work. I've done a first pass and the architecture looks solid.

Most of the comments are just nitpicks, so feel free to ignore when resolving.

iambriccardo added 30 commits November 27, 2025 13:31

Improve

a22dc44

Improve

34a21c3

Improve

7b55c75

Improve

b0477a0

Improve

2921c5a

Improve

6f7202d

Improve

9c6eb9c

Improve

695b4e5

Improve

330f304

Improve

6859b19

Improve

c124deb

Improve

d978ac0

Improve

c4f7573

Improve

5d1bfd1

Improve

5d8806b

Improve

afd21e1

Improve

d357538

Improve

e06a009

Improve

da19127

Improve

77741ef

Improve

65768e7

Improve

9241fe9

Improve

aa25d22

Improve

f7f2e79

Merge remote-tracking branch 'origin/main' into riccardo/feat/ddl-sup…

639a3f0

…port

Improve

b716388

Improve

7d3b043

Improve

f201605

Improve

60fd3f3

Improve

cb76720

iambriccardo and others added 3 commits March 13, 2026 11:49

Merge branch 'main' into riccardo/feat/ddl-support-3

436954c

Merge

872c29e

Reformat

eb0b7d7

iambriccardo commented Apr 13, 2026

View reviewed changes

Comment thread etl-destinations/src/ducklake/core.rs Outdated

iambriccardo added 7 commits April 14, 2026 10:48

Improve

e5d9bdc

Add better tests

9d53510

Improve

96a5e4e

Improve

2191026

Improve

5aeab05

Improve

7ab1ea6

Improve

25c5351

iambriccardo commented Apr 15, 2026

View reviewed changes

Comment thread etl-destinations/src/bigquery/core.rs

iambriccardo added 11 commits April 15, 2026 16:02

Improve

76d0722

Improve

2b66def

Improve

716ee49

Improve

ee7f822

Improve

ca76e15

Improve

f0c07e1

Improve

7be341a

Improve

c16bc60

Improve

3d9fce2

Improve

3c70773

Fix the permissions

2fb797b

farazdagi approved these changes Apr 17, 2026

View reviewed changes

iambriccardo added 2 commits April 17, 2026 14:30

Fix PR comments

cf6f12a

Fix

f2a319d

farazdagi mentioned this pull request Apr 17, 2026

Sweep codebase for proper visibility modifiers (pub → pub(crate)) #670

Closed

iambriccardo enabled auto-merge (squash) April 20, 2026 06:54

iambriccardo merged commit 08a59b7 into main Apr 20, 2026
12 checks passed

iambriccardo deleted the riccardo/feat/ddl-support-3 branch April 20, 2026 06:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(experimental): Add schema change support for BigQuery#499

feat(experimental): Add schema change support for BigQuery#499
iambriccardo merged 198 commits into
mainfrom
riccardo/feat/ddl-support-3

iambriccardo commented Dec 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

farazdagi left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

iambriccardo commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How It Works

Key Design Points

Consistent Schema Loading

Replication Masks

Migration Assumption

Crash-Safe Properties

What Changed

Validation

Next Steps

Uh oh!

Uh oh!

Uh oh!

farazdagi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

iambriccardo commented Dec 11, 2025 •

edited

Loading