[SPARK-19335][SPARK-38200][SQL] Add upserts for writing to JDBC by EnricoMi · Pull Request #54449 · apache/spark

EnricoMi · 2026-02-24T15:08:12Z

What changes were proposed in this pull request?

This is a follow-up on #16685 and #16692.

Implements upsert mode for SaveMode.Append of the MySql, MsSql, and Postgres JDBC source.

See #41611 for an alternative using the MERGE INTO command (not supported by MySql).

Why are the changes needed?

The JDBC writer only supports either truncating the existing table or inserting. Duplicates, i.e. rows with identical values in the primary or unique index columns, cause an exception, permitting updating existing and inserting new rows.

Re-evaluating a partition due to executor loss will insert rows that have been inserted in an earlier attempt, which kills the entier Spark job.

Does this PR introduce any user-facing change?

This adds upsert and upsertKeyColumns options for SaveMode.Append of the JDBC source.

How was this patch tested?

Tests in JdbcSuite and integration suites.

Re-opens #49528.

…rtsError

- Use SparkUnsupportedOperationException - Remove unused string interpolation - Fix indentation

EnricoMi added 15 commits April 28, 2025 06:40

Implement upsert for MySQL

49adfe3

Code cleanup

94afa86

Move upsert tests into trait

1bb0457

Implement upsert for MsSqlServer

ba196a4

Fix non-existing upsert test for Postgres

a78cc0a

Add upsert concurrency integration test

c826ce8

Add tests with varying column order

feea1dd

Add test with varying column order, sketch more tests

7b77e4a

Revert empty line removal, fix scalastyle error

f551323

Refactor tableDoesNotSupportError to reuse in tableDoesNotSupportUpse…

1a045c5

…rtsError

Fix after merge master

48fbb3b

Fix after merge master

8d5bf9f

Fix after merge master

a6f8ed8

Apply code review comments

d9b33ea

- Use SparkUnsupportedOperationException - Remove unused string interpolation - Fix indentation

Merge remote-tracking branch 'upstream/master' into jdbc-upsert-2

8af0281

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[SPARK-19335][SPARK-38200][SQL] Add upserts for writing to JDBC#54449

[SPARK-19335][SPARK-38200][SQL] Add upserts for writing to JDBC#54449
EnricoMi wants to merge 15 commits intoapache:masterfrom
G-Research:jdbc-upsert-2

EnricoMi commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

EnricoMi commented Feb 24, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant