voice: output retries for run(output_type=...) by theomonnom · Pull Request #6080 · livekit/agents

theomonnom · 2026-06-12T17:52:03Z

A run with an output_type ends with final_output=None whenever the model finishes its turn in prose instead of calling the task's completion tool — common with chatty models, and currently surfaced as a generic RuntimeError that callers can't distinguish or recover from.

Following pydantic-ai's output-tool semantics:

New output_options on run() (an options TypedDict in the style of keyterm_options/expressiveness): when the run ends without its output_type, the session re-prompts in the same context as a per-turn system message (max_retries, default 2) before raising; retry_instructions overrides the built-in retry prompt.
```
result = await sess.run(
    user_input=...,
    output_type=SummarizeOutput,
    output_options={"max_retries": 2, "retry_instructions": "Call submit_analysis, nothing else."},
)
```
A distinct UnexpectedModelBehavior (exported from livekit.agents, same name as pydantic-ai's) replaces the generic RuntimeError once the budget is exhausted, so callers can catch the failure specifically.

Defaults convert the dominant failure (model summarizes in prose) into a recovered run. Unit tests cover recovery, the prompt override, and exhaustion via FakeLLM.

🤖 Generated with Claude Code

…Error when exhausted

devin-ai-integration

Devin Review found 3 new potential issues.

devin-ai-integration · 2026-06-12T18:06:09Z

+        output_retries: int | OutputRetryOptions = 1,
    ) -> RunResult[Run_T]:
+        """output_retries: how many times to re-prompt the model when the run
+        ends without the expected output_type before raising RunOutputError;


📝 Info: Default output_retries mismatch between RunResult constructor and AgentSession.run()

The RunResult.__init__ default for output_retries is 1 (run_result.py:82), but AgentSession.run() always passes output_options.get('max_retries', 2) (agent_session.py:623), defaulting to 2. This means direct construction of RunResult (e.g., at agent_session.py:854 for capture_run) gets 1 retry, while runs through session.run() get 2. The capture_run path at line 854 doesn't set output_type, so retries are irrelevant there, but the inconsistency could be confusing for any future code path that constructs RunResult directly with an output_type.

Was this helpful? React with 👍 or 👎 to provide feedback.

Intentional: one silent recovery is the desired out-of-box behavior (it converts the dominant failure into a recovered run, matching pydantic-ai's default of 1 output retry), and the latency cost only occurs on runs that would previously have failed outright. The exception change is called out in the PR description; output_options={"retries": 0} restores fail-fast.

chenghao-mou · 2026-06-14T19:01:37Z

        user_input: str,
        input_modality: Literal["text", "audio"] = "text",
        output_type: type[Run_T] | None = None,
+        output_options: RunOutputOptions | None = None,


should this include NOT_GIVEN so None can be used to disable the retry behavior? otherwise we have to type {"max_retries": 0} to disable it explicitly.

chenghao-mou · 2026-06-14T19:03:07Z

+        run_state = RunResult(
+            user_input=user_input,
+            output_type=output_type,
+            output_retries=output_options.get("max_retries", 2),


nitpicking: we could follow the _resolve* pattern here to have explicit default value(s).

chenghao-mou · 2026-06-14T19:06:33Z

+        user_input: str | None = None,
+        output_type: type[Run_T] | None,
+        output_retries: int = 1,
+        output_retry_instructions: str | None = None,


nitpicking: should we just pass the output options here so default values and resolution can stay in one place?

voice: retry a run that ends without its output_type, raise RunOutput…

972a8bd

…Error when exhausted

theomonnom requested a review from a team as a code owner June 12, 2026 17:52

theomonnom added 2 commits June 12, 2026 10:57

voice: overridable output retry instructions

51fe116

voice: fold retry config into output_retries

ef284b8

devin-ai-integration Bot reviewed Jun 12, 2026

View reviewed changes

theomonnom added 2 commits June 12, 2026 11:08

voice: the task owns its output retry instructions

b71f13f

voice: output retry prompt configured on the session

f006d67

This comment was marked as resolved.

Sign in to view

theomonnom added 4 commits June 12, 2026 12:01

voice: group structured-output behavior into output_options

b7389bf

voice: rename RunOutputError to UnexpectedModelBehavior

9238266

voice: output_options moves to run()

50561cc

voice: retry only the no-output case, fold retry tests

e712c0d

This comment was marked as resolved.

Sign in to view

theomonnom added 4 commits June 12, 2026 12:54

fix formatting

5038667

voice: retry via per-turn system instructions, broaden retry guard

3a9cf15

voice: rename retries to max_retries

4538b33

voice: default output max_retries to 2

d9d91c9

This comment was marked as resolved.

Sign in to view

theomonnom added 2 commits June 12, 2026 15:58

drop the retry test file

0d4fd62

remove unrelated local changes

86acdca

theomonnom force-pushed the theo/output-retries branch from c63462a to 86acdca Compare June 12, 2026 23:00

Bobronium approved these changes Jun 12, 2026

View reviewed changes

theomonnom added 3 commits June 12, 2026 16:12

voice: UnexpectedModelBehavior extends RuntimeError

a6b7fa5

voice: rename RunOutputOptions (class only)

85b21fd

sort imports

6fce0bf

chenghao-mou reviewed Jun 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voice: output retries for run(output_type=...)#6080

voice: output retries for run(output_type=...)#6080
theomonnom wants to merge 18 commits into
mainfrom
theo/output-retries

theomonnom commented Jun 12, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Jun 12, 2026 •

edited

Loading

Uh oh!

theomonnom Jun 12, 2026

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

chenghao-mou Jun 14, 2026

Uh oh!

chenghao-mou Jun 14, 2026

Uh oh!

chenghao-mou Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

theomonnom commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

theomonnom Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

chenghao-mou Jun 14, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Jun 14, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Jun 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

theomonnom commented Jun 12, 2026 •

edited

Loading

devin-ai-integration Bot Jun 12, 2026 •

edited

Loading