Batching support #54

fmterrorf · 2024-09-20T21:41:36Z

In conjuction commanded/commanded#569 adds support for batching Ecto projections.

Added tests and relevant fixes to batch projector added after update callback test Update dependencies docs temporary commanded version Skip over partial seen batch remove elixir_uuid after rebase Return error on partially seen batch again

cdegroot · 2024-09-20T21:46:28Z

Note: review can wait until Calmwave has dogfooded this for a bit.

drteeth

Thanks for the work on this feature. It looks great, well done.

As this PR relies on upstream changes to Commanded, it will have to wait until that work is merged. See commanded/commanded#569

Before calling the PR done, the reference to the calmwave fork would need to be dropped of course.

anderslemke · 2025-01-14T11:56:22Z

I'm curious what the status on this is. What are the experience in Calmwave, @cdegroot ?

Also, I have a question:

Let's say I set batch size to 100.
Now, let's assume that the projection is up to date, and a new event is added.
Will I get the call to project_batch right away, or will it wait until we have another 99 events ready for the batch?

yordis · 2025-11-18T06:37:06Z

@anderslemke The subscription flushes on timeout (milliseconds) if fewer events are available, so batch_size: 100 won't wait for 99 more events.

yordis

I noticed a few issues with partial batch handling:

No event filtering - When a batch contains already-seen events (e.g., events [2,3,4] when watermark is 2), the user projection receives all events instead of just [3,4]. This can cause duplicate projections.
No locking - The watermark check and update aren't atomic. Two concurrent batches could both pass the check and cause race conditions.
Test verification - The test at lines 74-91 should verify that "e4" was actually projected, not just that the call succeeded.

The core issue is that apply(multi_fn, [multi]) at line 152 passes the multi but not the filtered events, so the user's lambda processes the original batch from closure scope.

A robust approach would be: lock → filter unseen events → update watermark → pass only unseen events to user projection.

nikkocampbell and others added 6 commits September 18, 2024 15:43

Initial batching code

7b8161e

Added tests and relevant fixes to batch projector added after update callback test Update dependencies docs temporary commanded version Skip over partial seen batch remove elixir_uuid after rebase Return error on partially seen batch again

Update mix lock

850ef28

Do not worry about partial batch for now

86e82fc

Pattern match for list

dbe1925

Update the documentation

e1281ff

Remove unused line

40ad2fb

fmterrorf mentioned this pull request Sep 20, 2024

Batching support commanded/commanded#569

Merged

fmterrorf marked this pull request as ready for review September 20, 2024 21:42

drteeth requested changes Nov 29, 2024

View reviewed changes

yordis reviewed Nov 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batching support #54

Batching support #54

Uh oh!

fmterrorf commented Sep 20, 2024

Uh oh!

cdegroot commented Sep 20, 2024

Uh oh!

drteeth left a comment

Uh oh!

anderslemke commented Jan 14, 2025

Uh oh!

yordis commented Nov 18, 2025

Uh oh!

yordis left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Batching support #54

Are you sure you want to change the base?

Batching support #54

Uh oh!

Conversation

fmterrorf commented Sep 20, 2024

Uh oh!

cdegroot commented Sep 20, 2024

Uh oh!

drteeth left a comment

Choose a reason for hiding this comment

Uh oh!

anderslemke commented Jan 14, 2025

Uh oh!

yordis commented Nov 18, 2025

Uh oh!

yordis left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants