deps: [DO NOT MERGE] Start testing what will be DataFusion 54.0 by mbutrovich · Pull Request #3916 · apache/datafusion-comet

mbutrovich · 2026-04-08T21:40:03Z

Starting to test what will be DataFusion 54.0 to see if that helps find issues in DataFusion when the RC comes around. Lately we haven't gotten all the work done in time in Comet to test an RC, and find issues after the .0 release and have to wait for at least .1 to bump.

mbutrovich · 2026-04-16T00:58:50Z

Added a test for the Spark SQL failure for FULL join with filter on NULL. Testing the fix now, bumped the dependency for DataFusion to test apache/datafusion#21660

# Conflicts: # native/Cargo.lock

…er evaluates to NULL (#21660) ## Which issue does this PR close? - Closes #. ## Rationale for this change While enabling `SortMergeJoinExec` with filters in [DataFusion Comet](https://github.com/apache/datafusion-comet), I hit a correctness bug in DataFusion's `SortMergeJoinExec` for full outer joins with nullable filter columns. The bug was originally surfaced via the [SPARK-43113](https://issues.apache.org/jira/browse/SPARK-43113) reproducer. When a join filter expression evaluates to `NULL` (e.g., `l.b < (r.b + 1)` where `r.b` is `NULL`), the full outer join treats the row pair as **matched** instead of **unmatched**. Per SQL semantics, `NULL` in a boolean filter context is `false` (not satisfied), so the rows should be emitted as separate unmatched rows. The bug has been present since filtered full outer join support was added to SMJ in #12764 / #13369. It was never caught because: 1. The join fuzz tests generate filter column data with `Int32Array::from_iter_values()`, which never produces `NULL` values. 2. No existing unit test or sqllogictest exercised a full outer join filter that evaluates to `NULL`. ## What changes are included in this PR? **Root cause:** The full outer join code path had a special case that preserved raw `NULL` values from the filter expression result (`pre_mask`) instead of converting them to `false` via `prep_null_mask_filter` like left/right outer joins do. This caused two problems: 1. **Streamed (left) side:** In `get_corrected_filter_mask()`, `NULL` entries in `filter_mask` are treated as "pass through" (for pre-null-joined rows from `append_nulls()`). But when the filter expression itself returns `NULL`, those entries also appear as `NULL` in the mask — and get incorrectly treated as matched. This produced wrong join output (matched rows instead of unmatched). 2. **Buffered (right) side:** `BooleanArray::value()` was called on `NULL` positions in `pre_mask` to update `FilterState`. At NULL positions, the values buffer contains a deterministic but semantically meaningless result (computed from the default zero-storage of NULL inputs). For some rows this value happens to be `true`, which incorrectly marks unmatched buffered rows as `SomePassed` and silently drops them from the output. **Fix:** Remove the full outer join special case in `materializing_stream.rs`. All outer join types now uniformly use the null-corrected `mask` (where `NULL` → `false` via `prep_null_mask_filter`) for both deferred filtering metadata and `FilterState` tracking. Semi/anti/mark joins are unaffected — they use `BitwiseSortMergeJoinStream` which already converts NULLs to `false`. **Tests:** - New unit test `join_full_null_filter_result` reproducing the SPARK-43113 scenario with a nullable right-side column. - Modified `make_staggered_batches_i32` in `join_fuzz.rs` to inject ~10% `NULL` values into the filter column (`x`), so the fuzz tests exercise `NULL` filter handling across all join types. ## Are these changes tested? Yes. - New unit test (`join_full_null_filter_result`) directly reproduces the bug. - Existing 57 SMJ unit tests all pass. - All 41 join fuzz tests pass with the new nullable filter column data, including `test_full_join_1k_filtered` which compares `HashJoinExec` vs `SortMergeJoinExec` and would have caught this bug if the fuzz data had included `NULL`s. - Will run 100 iterations of the fuzz tests overnight to shake out any remaining nondeterministic issues. - Testing in Comet CI (all Spark SQL tests) apache/datafusion-comet#3916 ## Are there any user-facing changes? Full outer sort-merge joins with filters involving nullable columns now produce correct results. Previously, rows where the filter evaluated to `NULL` were incorrectly included as matched; they are now correctly emitted as unmatched (null-joined) rows.

mbutrovich added 2 commits April 8, 2026 17:37

Start testing what will be DataFusion 54.0.

931f2bc

fix expandexec

8d922b5

mbutrovich closed this Apr 14, 2026

mbutrovich reopened this Apr 15, 2026

mbutrovich and others added 7 commits April 15, 2026 15:10

Bump commits.

b75c321

Merge branch 'main' into df54

42813d9

Fix.

af84b73

Fix metrics aggregation for native scan.

e19aa50

enable SMJ with filter by default

e798f9a

add test for SPARK-43113

4236c27

test apache/datafusion#21660

3bff80b

mbutrovich mentioned this pull request Apr 16, 2026

fix: SortMergeJoin full outer join incorrectly matches rows when filter evaluates to NULL apache/datafusion#21660

Merged

mbutrovich added 4 commits April 15, 2026 21:13

Fix after merging in latest datafusion main in the feature branch.

ec18066

Fix after merging in latest datafusion main in the feature branch.

68f1a09

Merge branch 'main' into df54

276ad63

Fix after merging in latest datafusion main in the feature branch.

ab809d1

mbutrovich force-pushed the df54 branch from c917ac8 to ab809d1 Compare April 16, 2026 01:34

mbutrovich added 4 commits April 15, 2026 21:53

Fix after merging in latest datafusion main in the feature branch.

f716fc3

bump to latest commit on main after miri fix.

219dc9a

Merge branch 'main' into df54

9da46f4

# Conflicts: # native/Cargo.lock

fix after upmerge

f1a8041

mbutrovich added 2 commits April 16, 2026 11:28

bump to pick up SMJ with filter fix.

2b904ff

bump to pick up SMJ with filter fix.

ce476f7

mbutrovich mentioned this pull request Apr 16, 2026

ci: add breaking change detector apache/datafusion#21499

Open

mbutrovich added 2 commits April 16, 2026 14:17

Merge branch 'main' into df54

8d47993

Bump to latest commit to pick up Miri fix.

37318f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deps: [DO NOT MERGE] Start testing what will be DataFusion 54.0#3916

deps: [DO NOT MERGE] Start testing what will be DataFusion 54.0#3916
mbutrovich wants to merge 21 commits intoapache:mainfrom
mbutrovich:df54

mbutrovich commented Apr 8, 2026 •

edited

Loading

Uh oh!

mbutrovich commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mbutrovich commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mbutrovich commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mbutrovich commented Apr 8, 2026 •

edited

Loading