Add audio chat API support for gpt-4o-audio-preview model #397
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR implements audio-to-audio chat completion support for the
gpt-4o-audio-previewmodel, replacing the traditional STT→Chat→TTS pipeline with a single API call for 2-3x faster response times.Features
audioChats()andaudioChatsStream()methodsAudioFormat(wav, mp3, flac, opus, pcm16) andVoice(alloy, echo, fable, onyx, nova, shimmer)AsyncThrowingStreamImplementation Details
AudioChatQuery,AudioChatResult,AudioChatStreamResultOpenAIProtocol,OpenAIAsync, with Combine supportTesting
Files Changed
Documentation
Added comprehensive Audio Chat section to README.md including:
Breaking Changes
None - this is a purely additive change.
Related