feat: Configurable model usage by galshubeli · Pull Request #386 · FalkorDB/QueryWeaver

galshubeli · 2026-02-05T13:09:09Z

No description provided.

overcut-ai · 2026-02-05T13:09:14Z

Completed Working on "Code Review"

✅ Workflow completed successfully.

railway-app · 2026-02-05T13:09:20Z

🚅 Deployed to the QueryWeaver-pr-386 environment in queryweaver

Service	Status	Web	Updated (UTC)
QueryWeaver	❌ Build Failed (View Logs)	Web	Feb 5, 2026 at 1:10 pm

coderabbitai · 2026-02-05T13:09:21Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dynemic-model

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-02-05T13:09:22Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

Scanned Files

None

api/routes/settings.py

+
+    except Exception as e:  # pylint: disable=broad-except
+        error_msg = str(e)
+        logging.warning("%s API key validation failed: %s", vendor.capitalize(), error_msg)


In general, to fix log injection you should sanitize or validate any user-provided values before logging them, removing or escaping newline and other control characters that could break log structure. Ideally, you also constrain such values to an expected set (e.g., known vendor identifiers) and log a safe fallback if the input is unexpected.

For this specific case, the minimal, non-breaking fix is to sanitize vendor (or a copy used for logging) by stripping or replacing newline and carriage-return characters (and optionally other control characters) before using it in the log message. This keeps all existing behavior (functionality and log content) the same for normal, valid inputs and just normalizes malicious or malformed values into a single-line safe string.

Concretely:

Introduce a small helper inside validate_api_key to sanitize a string for logging, or just sanitize vendor right before logging.

For example, create a local variable safe_vendor_for_log from vendor that replaces \r and \n with empty strings (or spaces), and maybe trims leading/trailing whitespace.

Use safe_vendor_for_log.capitalize() in the logging.warning call on line 76 instead of vendor.capitalize().

All changes occur within api/routes/settings.py in the shown function. No new imports are required since we can use built-in string methods.

api/routes/settings.py

+            )
+        else:
+            return JSONResponse(
+                content={"valid": False, "error": f"Failed to validate API key: {error_msg}"},


In general, to fix this class of issue the server should never send raw exception messages or stack traces to clients. Instead, it should (a) log the full error message and stack trace on the server side for diagnostics, and (b) return a generic, user-friendly error string or at most a coarse error category.

For this specific function, the best fix without changing behavior for known cases is:

Keep logging the detailed error_msg (and ideally a stack trace) to the server logs.

Preserve the existing tailored responses for “invalid/authentication” and “quota/rate” error patterns.

For all other errors (the else branch), stop including error_msg in the JSON response. Replace it with a generic message like "Failed to validate API key due to an internal error" or similar, so the client no longer sees provider/internal details.

Optionally, enhance logging to include the full stack trace with logging.exception so that removing detail from the response does not degrade debuggability.

Concretely, in api/routes/settings.py:

In the except Exception as e block, replace the final JSONResponse (lines 90–92) so that the error field is a fixed generic string and does not interpolate error_msg.

Optionally, also change the logging call to logging.exception to record a stack trace, while not affecting the response.

No new imports are strictly necessary; we can reuse the existing logging module.

Copilot

Pull request overview

This PR adds configurable AI model usage to QueryWeaver, allowing users to select and validate their preferred AI provider (OpenAI, Google Gemini, or Anthropic) through the settings UI. The custom API keys and models are stored in memory only and passed to all LLM agents for query processing.

Changes:

Added frontend settings context and UI for configuring AI vendor, API key, and model name with validation
Implemented backend API endpoint for validating API keys across multiple providers
Updated all AI agents to accept and use custom API keys and models
Enhanced configuration logic to support multiple AI providers with automatic fallback

Reviewed changes

Copilot reviewed 19 out of 20 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
app/src/types/api.ts	Added optional custom API configuration fields to ChatRequest interface
app/src/services/chat.ts	Modified to include custom API settings in chat requests with vendor-specific prefixing
app/src/pages/Settings.tsx	Added comprehensive AI model configuration UI with validation and provider selection
app/src/contexts/SettingsContext.tsx	Created new context for managing AI provider settings in memory
app/src/components/modals/SettingsModal.tsx	Completely refactored to support AI model configuration instead of query rules
app/src/components/layout/Sidebar.tsx	Changed settings icon from Settings to Sliders
app/src/components/chat/ChatInterface.tsx	Integrated custom AI settings into chat query requests
app/src/App.tsx	Added SettingsProvider to application context hierarchy
api/routes/settings.py	Created API endpoint for validating AI provider API keys
api/core/text2sql.py	Updated to extract and pass custom API settings to all agents
api/config.py	Enhanced to support multiple AI providers with priority-based fallback logic
api/app_factory.py	Registered new settings router
api/agents/utils.py	Updated BaseAgent to accept custom API key and model parameters
api/agents/response_formatter_agent.py	Modified to use custom API settings when provided
api/agents/relevancy_agent.py	Modified to use custom API settings when provided
api/agents/follow_up_agent.py	Modified to use custom API settings when provided
api/agents/analysis_agent.py	Modified to use custom API settings when provided
README.md	Updated documentation to reflect multi-provider support
.env.example	Updated with comprehensive AI provider configuration examples

Files not reviewed (1)

app/package-lock.json: Language not supported

Copilot · 2026-02-05T13:10:22Z

app/src/contexts/SettingsContext.tsx

+export const SettingsProvider: React.FC<SettingsProviderProps> = ({ children }) => {
+  const [vendor, setVendor] = useState<AIVendor>('openai');
+  const [apiKey, setApiKey] = useState<string | null>(null);
+  const [modelName, setModelName] = useState<string>('gpt-4o-mini');


The default model name 'gpt-4o-mini' is inconsistent with the example models shown elsewhere in the codebase (e.g., 'gpt-4.1' in AI_VENDORS). This inconsistency could confuse users about which model names are valid.

Suggested change

const [modelName, setModelName] = useState<string>('gpt-4o-mini');

const [modelName, setModelName] = useState<string>('gpt-4.1');

Copilot · 2026-02-05T13:10:22Z

app/src/components/modals/SettingsModal.tsx

-    setRules("");
+    setTempVendor('openai');
+    setTempApiKey('');
+    setTempModelName('gpt-4o');


The handleClear function sets the model name to 'gpt-4o', which is inconsistent with other default model names used throughout the codebase ('gpt-4.1', 'gpt-4o-mini'). Standardize on a single valid default model name.

Suggested change

setTempModelName('gpt-4o');

setTempModelName('gpt-4.1');

Copilot · 2026-02-05T13:10:23Z

api/core/text2sql.py

+        custom_api_key = chat_data.custom_api_key if hasattr(chat_data, 'custom_api_key') else None
+        custom_model = chat_data.custom_model if hasattr(chat_data, 'custom_model') else None


Using hasattr for Pydantic model attributes is unnecessary since ChatRequest already defines these fields with default None values. Access them directly as chat_data.custom_api_key and chat_data.custom_model for cleaner code.

Suggested change

custom_api_key = chat_data.custom_api_key if hasattr(chat_data, 'custom_api_key') else None

custom_model = chat_data.custom_model if hasattr(chat_data, 'custom_model') else None

custom_api_key = chat_data.custom_api_key

custom_model = chat_data.custom_model

Copilot · 2026-02-05T13:10:23Z

api/core/text2sql.py

+    custom_api_key = confirm_data.custom_api_key if hasattr(confirm_data, 'custom_api_key') else None
+    custom_model = confirm_data.custom_model if hasattr(confirm_data, 'custom_model') else None


Using hasattr for Pydantic model attributes is unnecessary since ConfirmRequest defines these fields with default None values. Access them directly for cleaner code.

overcut-ai

Findings summary:

Importance counts: Major (9)

Key themes:

Custom model/credential handling is inconsistent—the confirm flow drops user-provided keys/models and both the settings modal/page overwrite saved model names when opened.
Vendor prefix logic double-prefixes Gemini (and other) models in the UI and chat service, so validated models never reach the backend intact.
The new configuration paths have unsafe defaults: Anthropic-only setups fall back to Azure embeddings without credentials, and the /api/validate-api-key endpoint is unauthenticated and unthrottled.

Next steps:

Restore per-request credentials end-to-end (types, ChatInterface, ChatService) and ensure UI state preserves user-entered models instead of resetting to defaults.
Fix vendor-prefix construction so values already containing a provider aren’t rewritten, and align the documented model format with what the backend expects.
Harden backend configuration: only select embeddings when credentials exist, and require authentication + rate limiting before invoking litellm for key validation.

overcut-ai · 2026-02-05T13:45:18Z

api/config.py

+                EMBEDDING_MODEL_NAME = "voyage/voyage-3"
+        else:
+            # Anthropic has no native embeddings, fall back to Azure embeddings
+            EMBEDDING_MODEL_NAME = "azure/text-embedding-ada-002"


[major]: When only ANTHROPIC_API_KEY is configured, the new fallback forces EMBEDDING_MODEL_NAME = "azure/text-embedding-ada-002" (lines 96‑104) even if no Azure credentials are present. That makes the documented Anthropic-only setup impossible: every embedding call will now hit Azure without a key and fail at runtime, despite the user explicitly choosing Anthropic. Before selecting the Azure embedding model, ensure the Azure env vars exist, or fall back to a provider that actually has credentials (e.g., Voyage only when VOYAGE_API_KEY is set).

overcut-ai · 2026-02-05T13:45:37Z

app/src/components/chat/ChatInterface.tsx

-        use_user_rules: useRulesFromDatabase, // Backend fetches from DB when true
+        customApiKey: isApiKeyValid ? apiKey : undefined,
+        customModel: isApiKeyValid ? modelName : undefined,
+        customVendor: isApiKeyValid ? vendor : undefined,


[major]: The confirm flow never forwards the user’s custom credentials. streamConfirmOperation is still called without customApiKey/customModel/customVendor, and the ConfirmRequest type doesn’t have those fields, so destructive queries and confirmation retries run with the server default key/model even when the initial chat used a custom provider. To keep confirmation consistent (and avoid executing under the wrong account or failing because the vendor/model mismatch), extend ConfirmRequest, ChatInterface, and ChatService to pass through the optional custom credentials just like the initial chat request.

overcut-ai · 2026-02-05T13:46:19Z

api/routes/settings.py

+
+
+@settings_router.post("/validate-api-key")
+async def validate_api_key(request: Request, data: ValidateKeyRequest):


[major]: /api/validate-api-key is exposed without any authentication guard, so unauthenticated traffic can trigger arbitrary LiteLLM calls and abuse your infrastructure. The rest of the settings surface requires a token/session, but this route skips token_required (or any equivalent dependency). Please enforce authentication before issuing upstream validation requests to prevent anonymous DoS and keep the endpoint consistent with the rest of the API.

overcut-ai · 2026-02-05T13:46:39Z

api/routes/settings.py

+        )
+    # Note: 'gemini' is accepted as vendor (Google's LiteLLM prefix)
+
+    try:


[major]: Every POST to /api/validate-api-key results in a LiteLLM completion with user‑supplied credentials, but there’s no throttling or caching. An attacker can hammer this unauthenticated endpoint to burn your upstream quota or tie up the server doing proxy calls. Please add per-user/IP rate limiting (or reuse your existing quota middleware) and short‑circuit repeated failures so validation attempts can’t be abused for free resource exhaustion.

overcut-ai · 2026-02-05T13:46:58Z

app/src/components/modals/SettingsModal.tsx

+    const vendorConfig = getVendorConfig(tempVendor);
+    // Auto-update model name when vendor changes
+    if (vendorConfig) {
+      setTempModelName(vendorConfig.exampleModel);


[major]: The useEffect tied to tempVendor always runs setTempModelName(vendorConfig.exampleModel) (lines 58‑62), so opening the modal or focusing the vendor select overwrites any previously saved custom model value with the default example. As a result, users can’t persist their own model names—saving after reopening will always revert to the default. Only auto-fill when the user actually changes vendors and the model field is empty, or track the previous vendor to avoid clobbering existing selections.

overcut-ai · 2026-02-05T13:49:19Z

app/src/components/modals/SettingsModal.tsx

+
+    try {
+      // Map vendor to LiteLLM prefix (google -> gemini)
+      const vendorPrefix = vendorType === 'google' ? 'gemini' : vendorType;


[MAJOR]: vendorPrefix rewrites Google to gemini, yet the modal already asks users to enter the full LiteLLM model name (e.g. gemini/gemini-3-pro-preview). Saving or validating with a Gemini key therefore produces vendor: "gemini" but model: "gemini/gemini-…", which the backend rejects because the provider segment is duplicated. Either stop overriding the vendor when the model already contains a prefix or split the input so you don’t double-prepend gemini/. As written, Gemini validations always fail.

overcut-ai · 2026-02-05T13:50:41Z

app/src/pages/Settings.tsx

+  useEffect(() => {
+    const vendorConfig = AI_VENDORS.find(v => v.value === tempVendor);
+    if (vendorConfig) {
+      setTempModelName(vendorConfig.exampleModel);


[MAJOR]: This effect resets tempModelName to the vendor’s example model whenever tempVendor changes, including when the component simply loads the persisted settings. If a user previously saved gpt-4o-mini and returns later, the state change from openai→openai still reruns the effect and overwrites the field before they click Save. The practical result is that any custom model is silently lost. Only initialize to the example when no user value is present or when the user explicitly switches vendors.

overcut-ai · 2026-02-05T13:51:08Z

app/src/pages/Settings.tsx

+    setValidationStatus(null);
+
+    try {
+      const vendorPrefix = vendorType === 'google' ? 'gemini' : vendorType;


[MAJOR]: The validation call forces const vendorPrefix = vendorType === 'google' ? 'gemini' : vendorType and then sends model: tempModelName. Because the UI instructs users to enter the full LiteLLM model string (already prefixed with gemini/), every Gemini validation ends up posting gemini/gemini-… to the backend, which doesn’t match the provider-derived vendor. That mismatch makes all Google validations fail even with correct keys. Either accept provider-less suffixes in the UI or detect existing prefixes and avoid double-prepending gemini/.

overcut-ai · 2026-02-05T13:51:18Z

app/src/services/chat.ts

+        // Map vendor to LiteLLM prefix (google -> gemini)
+        const vendorPrefix = request.customVendor === 'google' ? 'gemini' : request.customVendor;
+        // Format model name with vendor prefix for LiteLLM
+        requestBody.custom_model = `${vendorPrefix}/${request.customModel}`;


[MAJOR]: custom_model is built as ${vendorPrefix}/${request.customModel} even though the Settings UI already captures the full LiteLLM identifier (e.g. openai/gpt-4o-mini). This produces openai/openai/… for OpenAI and gemini/gemini/… for Google, so the backend rejects the model and silently falls back to defaults. Please send the user-provided model verbatim when it already contains /, or change the UI to collect only the suffix so you can safely prepend the prefix once.

overcut-ai · 2026-02-05T13:51:38Z

app/src/components/modals/SettingsModal.tsx

+
+    try {
+      // Map vendor to LiteLLM prefix (google -> gemini)
+      const vendorPrefix = vendorType === 'google' ? 'gemini' : vendorType;


[MAJOR]: vendorPrefix rewrites Google to gemini, yet the modal already asks users to enter the full LiteLLM model name (e.g. gemini/gemini-3-pro-preview). Saving or validating with a Gemini key therefore produces vendor: "gemini" but model: "gemini/gemini-…", which the backend rejects because the provider segment is duplicated. Either stop overriding the vendor when the model already contains a prefix or split the input so you don’t double-prepend gemini/. As written, Gemini validations always fail.

galshubeli · 2026-02-16T14:42:31Z

@copilot add descriptioon to the pr

Copilot · 2026-02-16T14:42:42Z

@galshubeli I've opened a new pull request, #403, to work on those changes. Once the pull request is ready, I'll request review from you.

galshubeli · 2026-02-16T14:52:43Z

Good feature idea — runtime model selection is valuable.

Must fix:

Centralize vendor mapping — the google → gemini prefix mapping appears in 5 different files (SettingsContext, SettingsModal, Settings.tsx, ChatInterface, chat.ts). Extract to a single utility. Same for the AI_VENDORS config array — it's duplicated between SettingsModal and Settings.tsx.
Pick one settings UI — SettingsModal and Settings.tsx have near-identical AI configuration forms. Either extract a shared component or remove one.
Default model mismatch — SettingsContext defaults to gpt-4o-mini but clearSettings() resets to gpt-4.1. Should be consistent.

Should fix:

Add early validation — custom_model accepts any string and only fails at the litellm call. Validate model format and provider/key compatibility upfront in the settings route.
Clarify persistence story — model choice is per-request only with no server-side persistence. Document this clearly or add session-level storage so users don't have to reconfigure on every page refresh.

galshubeli added 4 commits December 22, 2025 14:50

init

c46de94

Merge branch 'staging' into dynemic-model

e4e2bbb

merge-staging

043463f

update-model-usage

9daa45d

galshubeli requested review from Copilot and gkorland February 5, 2026 13:09

galshubeli self-assigned this Feb 5, 2026

railway-app bot had a problem deploying to queryweaver / QueryWeaver-pr-386 February 5, 2026 13:09 Failure

github-advanced-security bot found potential problems Feb 5, 2026

View reviewed changes

Copilot AI reviewed Feb 5, 2026

View reviewed changes

overcut-ai bot reviewed Feb 5, 2026

View reviewed changes

galshubeli changed the title ~~Configurable model usage~~ feat: Configurable model usage Feb 16, 2026

Merge branch 'staging' into dynemic-model

c130cfc

railway-app bot temporarily deployed to queryweaver / QueryWeaver-pr-386 February 16, 2026 14:42 Destroyed

Copilot AI mentioned this pull request Feb 16, 2026

feat: Configurable model usage with runtime AI provider selection #403

Draft

@@ -73,7 +73,9 @@
                 except Exception as e:  # pylint: disable=broad-except
                     error_msg = str(e)
-                    logging.warning("%s API key validation failed: %s", vendor.capitalize(), error_msg)
+                    # Sanitize vendor before logging to prevent log injection via control characters
+                    safe_vendor_for_log = vendor.replace("\r", "").replace("\n", "").strip()
+                    logging.warning("%s API key validation failed: %s", safe_vendor_for_log.capitalize(), error_msg)
                     # Check for common error messages
                     if "invalid" in error_msg.lower() or "authentication" in error_msg.lower():

	const [modelName, setModelName] = useState<string>('gpt-4o-mini');
	const [modelName, setModelName] = useState<string>('gpt-4.1');

		custom_api_key = chat_data.custom_api_key if hasattr(chat_data, 'custom_api_key') else None
		custom_model = chat_data.custom_model if hasattr(chat_data, 'custom_model') else None

		custom_api_key = confirm_data.custom_api_key if hasattr(confirm_data, 'custom_api_key') else None
		custom_model = confirm_data.custom_model if hasattr(confirm_data, 'custom_model') else None



		@settings_router.post("/validate-api-key")
		async def validate_api_key(request: Request, data: ValidateKeyRequest):

Conversation

galshubeli commented Feb 5, 2026

Uh oh!

overcut-ai bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Completed Working on "Code Review"

Uh oh!

railway-app bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

github-actions bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

Check failure

Uh oh!

Copilot Autofix

Check warning

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot Autofix

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

overcut-ai bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

galshubeli commented Feb 16, 2026

Uh oh!

overcut-ai bot commented Feb 5, 2026 •

edited

Loading

railway-app bot commented Feb 5, 2026 •

edited

Loading

coderabbitai bot commented Feb 5, 2026 •

edited

Loading

github-actions bot commented Feb 5, 2026 •

edited

Loading