allow pushing frames to VAD when agent speech is uninterruptible #4418

chenghao-mou · 2025-12-30T21:10:48Z

This should close #4413

What happened:

VAD received audio frames, changing user stage to speaking;
Uninterruptible speech created, discarding audio frames for both STT and VAD. User state is stuck in speaking.

This PR should allow VAD to operate separately. Tested with

    @function_tool
    async def get_weather(self, location: str) -> str:
        """
        Called when the user asks about the weather.

        Args:
            location: The location to get the weather for
        """
        await asyncio.sleep(5) # <- interrupt here!
        self.session.say("And tomorrow is going to be sunny too.", allow_interruptions=False)
        return f"The weather in {location} is sunny today."

longcw

lgtm! do you think if we should ignore the user silence event if the speech is uninterruptible?

chenghao-mou · 2025-12-31T08:26:43Z

lgtm! do you think if we should ignore the user silence event if the speech is uninterruptible?

Do you mean skip waiting for user silence when the uninterruptible agent speech hasn't started? I think we should keep it because to the user, the agent is not yet speaking and they are not done talking, especially now with VAD being enabled at all time.

allow pushing frames to VAD only

c611c42

chenghao-mou requested a review from a team December 30, 2025 21:10

fix type issues

91cac9a

longcw approved these changes Dec 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

allow pushing frames to VAD when agent speech is uninterruptible #4418

allow pushing frames to VAD when agent speech is uninterruptible #4418

chenghao-mou commented Dec 30, 2025

Uh oh!

longcw left a comment

Uh oh!

chenghao-mou commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

allow pushing frames to VAD when agent speech is uninterruptible #4418

Are you sure you want to change the base?

allow pushing frames to VAD when agent speech is uninterruptible #4418

Conversation

chenghao-mou commented Dec 30, 2025

Uh oh!

longcw left a comment

Choose a reason for hiding this comment

Uh oh!

chenghao-mou commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants