server: reserve context budget for DSML tool calls by gmontana · Pull Request #105 · antirez/ds4

gmontana · 2026-05-12T18:17:55Z

Summary

Mitigates #48 for the ds4 DeepSeek V4 Flash tool-enabled chat path by reserving the last 256 decode tokens for DSML tool-call closure.

For requests with tools, ordinary text generation stops at a soft limit before the hard context limit. If generation is already inside a DSML tool call, or is at the soft limit with a partial tool-start marker at the end of the generated text, decoding can use the reserve.

Notes

This is intentionally a small server-side budget guard, not constrained decoding. Tool-enabled chats may finish with finish_reason=length up to 256 tokens earlier than the hard context limit when no tool call is in progress. Oversized tool arguments can still reach the hard limit and hit the existing unterminated tool call backstop.

The KV continued-checkpoint gate is left with its previous condition; this PR only changes decode budget decisions.

Validation

git diff --check
make ds4_test
./ds4_test --server
make ds4-server

I did not add a full model-backed reproduction test; the new coverage is a focused server-unit test for the decode budget logic and soft-limit transition.

For tool-enabled DeepSeek V4 chat requests, keep ordinary text generation out of the final 256 context tokens and allow that reserve only while a DSML tool call or partial tool-start marker is in progress. This mitigates antirez#48 without adding constrained decoding. Oversized tool arguments can still reach the hard context limit.

gmontana mentioned this pull request May 12, 2026

Context Window Exhaustion Causes Unterminated Tool Calls #48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: reserve context budget for DSML tool calls#105

server: reserve context budget for DSML tool calls#105
gmontana wants to merge 1 commit into
antirez:mainfrom
gmontana:fix/tool-call-context-budget

gmontana commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gmontana commented May 12, 2026

Summary

Notes

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant