added support for MapFromEntries by athlcode · Pull Request #21720 · apache/datafusion

athlcode · 2026-04-18T14:18:42Z

Which issue does this PR close?

Closes # (none: prerequisite for apache/datafusion-comet#2706; follow-up to #17779 and #19274).

Rationale for this change

The existing Spark map_from_entries / map_from_arrays UDFs silently kept the last occurrence of a duplicate key, hardcoded to the LAST_WIN branch of Spark's spark.sql.mapKeyDedupPolicy. Spark's default is EXCEPTION, and Spark 4 raises SparkRuntimeException with error class DUPLICATED_MAP_KEY. Without matching the default, downstream engines (e.g. datafusion-comet) have to fall back to Spark even for the common case.

What changes are included in this PR?

In map_deduplicate_keys (utils.rs), raise [DUPLICATED_MAP_KEY] Duplicate map key {key} was found ... when a duplicate is encountered, matching Spark's default behavior and error class. The LAST_WIN branch is removed along with its TODO comment.
Update the two affected sqllogictest files to assert the new DUPLICATED_MAP_KEY error instead of the previous last-wins output.

No config-option or enum is introduced at this stage. map_from_entries.rs, map_from_arrays.rs, and config.rs are untouched.

Are these changes tested?

Yes. The duplicate-key assertions in sqllogictest/test_files/spark/map/map_from_entries.slt and sqllogictest/test_files/spark/map/map_from_arrays.slt are flipped from positive-result to query error.

Are there any user-facing changes?

Yes, a behavior change: duplicate keys now raise [DUPLICATED_MAP_KEY] under the default policy instead of silently collapsing to the last occurrence. This aligns with Spark's documented default. No new config keys, no API changes.

…darshan7/datafusion into support/MapFromEntries

added support for MapFromEntries

cc7d2cc

github-actions bot added common Related to common crate spark labels Apr 18, 2026

athlcode and others added 3 commits April 18, 2026 19:49

Merge branch 'main' into support/MapFromEntries

28ab5d2

removed MapKeyDedupPolicy

ba3c933

Merge branch 'support/MapFromEntries' of https://github.com/KrishnaSu…

bc2296f

…darshan7/datafusion into support/MapFromEntries

github-actions bot added sqllogictest SQL Logic Tests (.slt) and removed common Related to common crate labels Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added support for MapFromEntries#21720

added support for MapFromEntries#21720
athlcode wants to merge 4 commits intoapache:mainfrom
athlcode:support/MapFromEntries

athlcode commented Apr 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

athlcode commented Apr 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

athlcode commented Apr 18, 2026 •

edited

Loading