[BUG] LiteLLM: Requested token count exceeds the model's maximum context length

### Problem (one or two sentences)

Certain models don't work via our LiteLLM Proxy due to `Requested token count exceeds the model's maximum context length` error.

### Context (who is affected and when)

I see this when using RooCode via our LiteLLM Proxy

### Reproduction steps

1) Configure models via LiteLLM
2) Use them

### Expected result

The task should be executed

### Actual result

Task is not executed, error is generated

### Variations tried (optional)

_No response_

### App Version

v3.53.0

### API Provider (optional)

LiteLLM

### Model Used (optional)

_No response_

### Roo Code Task Links (optional)

_No response_

### Relevant logs or errors (optional)

```shell
LiteLLM streaming error: 400
litellm.BadRequestError:
Together_aiException - Requested token count exceeds the model's maximum context length of 512000 tokens.
You requested a total of 521014 tokens: 9014 tokens from the input messages and 512000 tokens for the completion.
Please reduce the number of tokens in the input messages or the completion to fit within the limit.
No fallback model group found for original model_group=deepseek-ai/DeepSeek-V4-Pro. 
Fallbacks=[{'MiniMaxAI/MiniMax-M2.5': ['MiniMaxAI/MiniMax-M2.7']}]. Received Model Group=deepseek-ai/DeepSeek-V4-Pro
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] LiteLLM: Requested token count exceeds the model's maximum context length #12279

Problem (one or two sentences)

Context (who is affected and when)

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] LiteLLM: Requested token count exceeds the model's maximum context length #12279

Description

Problem (one or two sentences)

Context (who is affected and when)

Reproduction steps

Expected result

Actual result

Variations tried (optional)

App Version

API Provider (optional)

Model Used (optional)

Roo Code Task Links (optional)

Relevant logs or errors (optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions