feat: Enhance LLM configuration and routing with model profile attach… by MODSetter · Pull Request #869 · MODSetter/SurfSense

MODSetter · 2026-03-11T01:19:12Z

…ment

Added _attach_model_profile function to attach model context metadata to ChatLiteLLM.
Updated create_chat_litellm_from_config and create_chat_litellm_from_agent_config to utilize the new profile attachment.
Improved context profile caching in llm_router_service.py to include both minimum and maximum input tokens, along with token model names for better context management.
Introduced new methods for token counting and context trimming based on model profiles.

Description

Motivation and Context

FIX #

Screenshots

API Changes

This PR includes API changes

Change Type

Testing Performed

Tested locally
Manual/QA verification

Checklist

Follows project coding standards and conventions
Documentation updated as needed
Dependencies updated as needed
No lint/build errors or new warnings
All relevant tests are passing

High-level PR Summary

This PR enhances LLM configuration and routing by introducing model profile attachment and context-aware message trimming capabilities. The changes add a _attach_model_profile function that captures model context metadata (like max_input_tokens) from LiteLLM's model info and attaches it to ChatLiteLLM instances. The router service is updated to cache both minimum and maximum input token limits across all deployments, enabling smarter context management. Most significantly, this introduces sophisticated context trimming logic that uses binary search to intelligently truncate large messages (especially tool responses and document context) when they exceed the model's context window, preferring XML document boundaries for cleaner cuts and preserving system messages to maintain agent instructions.

⏱️ Estimated Review Time: 30-90 minutes

💡 Review Order Suggestion

Order	File Path
1	`surfsense_backend/app/agents/new_chat/llm_config.py`
2	`surfsense_backend/app/services/llm_router_service.py`

…ment - Added `_attach_model_profile` function to attach model context metadata to `ChatLiteLLM`. - Updated `create_chat_litellm_from_config` and `create_chat_litellm_from_agent_config` to utilize the new profile attachment. - Improved context profile caching in `llm_router_service.py` to include both minimum and maximum input tokens, along with token model names for better context management. - Introduced new methods for token counting and context trimming based on model profiles.

vercel · 2026-03-11T01:19:17Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
surf-sense-frontend	Building	Preview, Comment	Mar 11, 2026 1:19am

recurseml

Review by RecurseML

🔍 Review performed on 5571e8a..eec4db4

✨ No bugs found, your code is sparkling clean

✅ Files analyzed, no issues (1)

• surfsense_backend/app/services/llm_router_service.py

MODSetter merged commit 1ab5640 into main Mar 11, 2026
10 of 13 checks passed

vercel bot deployed to Preview March 11, 2026 01:20 View deployment

recurseml bot reviewed Mar 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Enhance LLM configuration and routing with model profile attach…#869

feat: Enhance LLM configuration and routing with model profile attach…#869
MODSetter merged 1 commit intomainfrom
dev

MODSetter commented Mar 11, 2026 •

edited by recurseml bot

Loading

Uh oh!

vercel bot commented Mar 11, 2026

Uh oh!

Uh oh!

recurseml bot left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

MODSetter commented Mar 11, 2026 • edited by recurseml bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Screenshots

API Changes

Change Type

Testing Performed

Checklist

High-level PR Summary

Uh oh!

vercel bot commented Mar 11, 2026

Uh oh!

Uh oh!

recurseml bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Review by RecurseML

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MODSetter commented Mar 11, 2026 •

edited by recurseml bot

Loading

recurseml bot left a comment •

edited

Loading