Skip to content

[python][ray] Optimize merge into self-merge updates on data evolution table#8141

Draft
XiaoHongbo-Hope wants to merge 2 commits into
apache:masterfrom
XiaoHongbo-Hope:self_merge_optimize
Draft

[python][ray] Optimize merge into self-merge updates on data evolution table#8141
XiaoHongbo-Hope wants to merge 2 commits into
apache:masterfrom
XiaoHongbo-Hope:self_merge_optimize

Conversation

@XiaoHongbo-Hope
Copy link
Copy Markdown
Contributor

Purpose

Tests

@XiaoHongbo-Hope XiaoHongbo-Hope changed the title Self merge optimize [python][ray] Optimize MERGE INTO self-merge updates on dataEvolution table Jun 6, 2026
@XiaoHongbo-Hope XiaoHongbo-Hope changed the title [python][ray] Optimize MERGE INTO self-merge updates on dataEvolution table [python][ray] Optimize merge into self-merge updates on dataEvolution table Jun 6, 2026
When source == target with ON ['_ROW_ID'], skip the inner join and
read the table only once. Aligned with Spark's isSelfMergeOnRowId
detection logic.
@XiaoHongbo-Hope XiaoHongbo-Hope changed the title [python][ray] Optimize merge into self-merge updates on dataEvolution table [python][ray] Optimize merge into self-merge updates on data evolution table Jun 6, 2026
…artifacts

- Remove incorrect len(clauses) guards reintroduced during rebase
  in build_matched_update_ds and build_not_matched_insert_ds
- Align build_self_merge_update_ds with multi-clause fall-through
  pattern used by build_matched_update_ds
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant