feat: async and multi-result set APIs WIP by zeroshade · Pull Request #3607 · apache/arrow-adbc

zeroshade · 2025-10-22T19:36:46Z

A draft for a possible update to the ADBC API to add several driver methods:

AdbcStatementExecuteQueryAsync
AdbcStatementNextResultSet
AdbcStatementNextResultSetAsync
AdbcConnectionReadPartitionAsync

This would also be a 1.2.0 revision of the ADBC API

Actually implementing these functions would be done in a separate PR once we've reached consensus on this addition to the API

CurtHagenlocher · 2025-10-22T19:48:31Z

c/include/arrow-adbc/adbc.h

+///   being returned successfully.  ADBC_STATUS_NOT_FOUND is returned
+///   when there are no more result sets.
+ADBC_EXPORT
+AdbcStatusCode AdbcStatementNextResultSet(struct AdbcStatement* statement,


It would be nice if this also worked for "schema-only" evaluation. (I started down this road last year and my work-in-progress was at main...CurtHagenlocher:arrow-adbc:MoreResults.)

We have AdbcStatementExecuteSchema for schema-only evaluation, so you're suggesting a NextResultSet function for that one too?

lidavidm · 2025-10-22T23:31:49Z

Do we want async variants of things like GetObjects, Prepare, etc.?

CurtHagenlocher · 2025-10-23T00:01:56Z

From a tactical perspective, should 1.2-related work all happen on a separate branch (at least until we feel the proposed changes have been proven out)?

On an unrelated note, are there Arrow interop tests yet for the device APIs? I seem to recall that there weren't any a year ago.

lidavidm · 2025-10-23T00:25:02Z

From a tactical perspective, should 1.2-related work all happen on a separate branch (at least until we feel the proposed changes have been proven out)?

Yeah, I think that'll have to be what happens due to the header changes.

lidavidm · 2025-10-23T06:09:38Z

c/include/arrow-adbc/adbc.h

+///
+/// A partition can be retrieved from AdbcPartitions.
+///
+/// This AdbcConnection must outlive the ArrowAsyncDeviceStreamHandler.


I think we need to comment on the lifetime of serialized_partition too - does it need to live until the call returns or until the callback finishes?

lidavidm · 2025-10-23T06:10:21Z

c/include/arrow-adbc/adbc.h

+ADBC_EXPORT
+AdbcStatusCode AdbcStatementExecuteQueryAsync(
+    struct AdbcStatement* statement, struct ArrowAsyncDeviceStreamHandler* handler,
+    int64_t* rows_affected, struct AdbcError* error);


When would rows_affected be populated? Maybe it should be included in schema metadata or something instead...

I guess it could go in ArrowAsyncProducer's additional_metadata. Or, provide an AdbcAsyncProducerGetMetadata(ArrowAsyncProducer*) like how we have an AdbcErrorFromArrayStream. (Speaking of which, we need an equivalent of that for the async stream.)

Hmm, that's a good point. Since it's an asynchronous execution it wouldn't be populated before returning. I guess we can define a canonical metadata key to indicate total rows?

lidavidm · 2025-10-23T06:14:08Z

Not to expand the scope too much, but looking at #1514 I wonder if we should add this parameter to Execute. (It might be cleaner to just add a separate call for it, though.)

zeroshade · 2025-10-23T15:32:54Z

Do we want async variants of things like GetObjects, Prepare, etc.?

I'm not sure if an async variant of Prepare makes that much sense, but it would make sense to add async variants for anything that currently returns an ArrowArrayStream such as GetObjects etc.

Would an async variant for Prepare be something like AdbcStatementPrepare(struct AdbcStatement*, void (*on_prepared)(struct AdbcStatement*, struct AdbcError*)) where it returns immediately and calls on_prepared when it finishes (with the error being non-nil if there was an error, and being nil if successful)?

zeroshade · 2025-10-23T16:47:06Z

Added more functions based on the comments and back-and-forth here. The current collection of new functions is:

ConnectionGetInfoAsync
ConnectionGetObjectsAsync
ConnectionGetTableSchemaAsync
ConnectionGetTableTypesAsync
ConnectionGetStatisticsAsync
ConnectionGetStatisticNamesAsync
ConnectionReadPartitionAsync
StatementExecuteSchemaAsync
StatementNextResultSetSchema
StatementNextResultSetSchemaAsync
StatementExecutePartitionsAsync
StatementNextResultSet
StatementExecuteQueryAsync
StatementNextResultSetAsync

Not to expand the scope too much, but looking at #1514 I wonder if we should add this parameter to Execute. (It might be cleaner to just add a separate call for it, though.)

personally, I'd prefer a separate call for this and do it in a follow-up PR rather than in this one. But I'm open to it here if there's consensus.

lidavidm · 2025-10-23T23:34:01Z

Prepare may have to do I/O, hence it should have an async variant.

paleolimbot

Awesome! Probably an async wrapper around a sync array stream in nanoarrow would help fill some of these in.

It's a bummer we have to repeat so much documentation but I suppose the existing documentation has mostly stayed the same.

Not to expand the scope too much, but looking at #1514 I wonder if we should add this parameter to Execute. (It might be cleaner to just add a separate call for it, though.)

I would love this (although I think a separate function makes sense). I can write that up as separate PR if that would be helpful.

c/include/arrow-adbc/adbc.h

zeroshade · 2025-10-24T19:31:01Z

Awesome! Probably an async wrapper around a sync array stream in nanoarrow would help fill some of these in.

Wouldn't it make more sense to just have nanoarrow implement the Async stream handler from the C Data interface?

I can write that up as separate PR if that would be helpful.

It would! Thanks!

zeroshade · 2025-10-24T19:47:21Z

Prepare may have to do I/O, hence it should have an async variant.

Added StatementPrepareAsync

paleolimbot · 2025-10-27T15:35:42Z

I can write that up as separate PR if that would be helpful.

It would! Thanks!

#3623

lidavidm · 2025-10-27T23:11:39Z

Just a shower thought (that can be split into a separate discussion) but I wonder, instead of adding async and non-async versions of APIs that don't return data, we could just add a set of AdbcStatementWait, AdbcConnectionWait, AdbcDatabaseWait that set the callback for whatever operations were just performed (probably this is too complex vs just adding duplicate functions though)

zeroshade · 2025-10-28T17:09:22Z

instead of adding async and non-async versions of APIs that don't return data, we could just add a set of AdbcStatementWait, AdbcConnectionWait, AdbcDatabaseWait that set the callback for whatever operations were just performed (probably this is too complex vs just adding duplicate functions though)

I like the idea that lets us avoid needing to have async and non-async versions of the functions, but I think this might be too complex. @CurtHagenlocher @paleolimbot thoughts?

lidavidm · 2025-11-12T00:30:47Z

c/include/arrow-adbc/adbc.h

+///   being returned successfully.  ADBC_STATUS_NOT_FOUND is returned
+///   when there are no more result sets.


I don't like overloading the return code. I think it'd be more consistent to return a schema with no release callback.

I can update this to do that, then I'll attempt to implement these APIs and see how it goes

Extracted from #3607 with influence by the comments there and main...CurtHagenlocher:arrow-adbc:MoreResults, this contains a proposal for handling multi-result set query execution via ADBC by adding a new function for drivers, `AdbcStatementNextResultSet`. This also includes the necessary changes for an ADBC API Revision 1.2.0 (macro defines and so on). The comment above the function includes all the semantic definitions of the behavior.

zeroshade added 2 commits October 22, 2025 15:33

feat: async and multi-result set APIs

d58d2a9

add ADBC_VERSION_1_2_0 constant

a017e4a

zeroshade requested a review from lidavidm as a code owner October 22, 2025 19:36

github-actions bot added this to the ADBC Libraries 21 milestone Oct 22, 2025

zeroshade requested review from amoeba, felipecrv and paleolimbot October 22, 2025 19:37

zeroshade modified the milestones: ADBC Libraries 21, ADBC API Specification 1.2.0 (Async) Oct 22, 2025

fix definition

11042fe

CurtHagenlocher reviewed Oct 22, 2025

View reviewed changes

lidavidm reviewed Oct 23, 2025

View reviewed changes

add more functions from comments

8793b13

ianmcook mentioned this pull request Oct 24, 2025

format: support multiple result sets #1358

Open

paleolimbot reviewed Oct 24, 2025

View reviewed changes

c/include/arrow-adbc/adbc.h Show resolved Hide resolved

add async variation of prepare

b70139e

paleolimbot mentioned this pull request Oct 27, 2025

feat(c/include/arrow-adbc): Add AdbcStatementRequestSchema #3623

Draft

lidavidm reviewed Nov 12, 2025

View reviewed changes

zeroshade mentioned this pull request Jan 8, 2026

feat: Spec multi-result-set API #3871

Merged

lidavidm modified the milestones: ADBC API Specification 1.2.0, ADBC API Specification Wishlist Jan 9, 2026

		/// being returned successfully. ADBC_STATUS_NOT_FOUND is returned
		/// when there are no more result sets.

Conversation

zeroshade commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CurtHagenlocher Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zeroshade Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

lidavidm commented Oct 22, 2025

Uh oh!

CurtHagenlocher commented Oct 23, 2025

Uh oh!

lidavidm commented Oct 23, 2025

Uh oh!

lidavidm Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

lidavidm Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

lidavidm Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

zeroshade Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

lidavidm commented Oct 23, 2025

Uh oh!

zeroshade commented Oct 23, 2025

Uh oh!

zeroshade commented Oct 23, 2025

Uh oh!

lidavidm commented Oct 23, 2025

Uh oh!

paleolimbot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zeroshade commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zeroshade commented Oct 24, 2025

Uh oh!

paleolimbot commented Oct 27, 2025

Uh oh!

lidavidm commented Oct 27, 2025

Uh oh!

zeroshade commented Oct 28, 2025

Uh oh!

lidavidm Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

zeroshade Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

zeroshade commented Oct 22, 2025 •

edited

Loading

CurtHagenlocher Oct 22, 2025 •

edited

Loading

zeroshade commented Oct 24, 2025 •

edited

Loading