feat(cache): add provider prompt cache planner by zerob13 · Pull Request #1426 · ThinkInAIXYZ/deepchat

zerob13 · 2026-04-02T13:21:19Z

Summary by CodeRabbit

New Features
- Prompt caching across multiple LLM providers to improve repeated-request performance and reduce cost.
- Token usage reporting now includes cache-write metrics; cost estimates account for cached reads and cache writes.
UI
- Provider settings: Base URL can appear locked with an explicit “Modify”/unlock flow to safely change it.
Localization
- Added translations for the provider Base URL unlock/modify flow.

coderabbitai · 2026-04-02T13:21:38Z

📝 Walkthrough

Walkthrough

Adds prompt-cache detection and strategy, integrates cache keys/breakpoints across multiple LLM providers, propagates cache-read/write token accounting through streaming events, updates usage costing and DB schema, and adjusts presenter/session modelConfig to include conversationId.

Changes

Cohort / File(s)	Summary
Cache Strategy & Capabilities `src/main/presenter/llmProviderPresenter/promptCacheCapabilities.ts`, `src/main/presenter/llmProviderPresenter/promptCacheStrategy.ts`	New types and logic to resolve prompt-cache mode per provider/model and produce/apply cache plans (implicit/auto/explicit), TTLs, cache keys, and explicit breakpoint transformations.
Provider Integrations `src/main/presenter/llmProviderPresenter/providers/...` `providers/anthropicProvider.ts`, `providers/awsBedrockProvider.ts`, `providers/openAICompatibleProvider.ts`, `providers/openAIResponsesProvider.ts`, `providers/zenmuxProvider.ts`	Integrated prompt-cache plan application into request flows, mutated request payloads (prompt_cache_key, cache_control, explicit breakpoints), and made usage parsing cache-aware (cached_tokens, cache_write_tokens). Zenmux routing now delegates Anthropic vs OpenAI paths.
DeepChat Stream & Accumulation `src/main/presenter/deepchatAgentPresenter/accumulator.ts`, `src/main/presenter/deepchatAgentPresenter/index.ts`, `src/main/presenter/deepchatAgentPresenter/process.ts`	Accumulator now records `cacheWriteInputTokens`; `runStreamForMessage` includes `conversationId` in modelConfig; usage snapshot builder forwards cache-write token field.
Usage Accounting & Costing `src/main/presenter/usageStats.ts`	Added `cacheWriteInputTokens` to usage records and normalization; cost estimation now accounts separately for cache-write tokens with fallback pricing behavior.
DB Schema & Persistence `src/main/presenter/sqlitePresenter/tables/deepchatUsageStats.ts`, `src/main/presenter/newAgentPresenter/index.ts`	Added `cache_write_input_tokens` column (schema bumped to v22), migration to add column if missing, and upsert/backfill propagation of cache-write tokens.
Types & Events `src/shared/types/agent-interface.d.ts`, `src/shared/types/core/llm-events.ts`	Added `cacheWriteInputTokens?: number` to MessageMetadata and optional `cache_write_tokens?: number` to streamed UsageStreamEvent and its factory.
Tests `test/main/presenter/*` `test/main/presenter/llmProviderPresenter/`, `promptCacheStrategy.test.ts`, `awsBedrockProvider.test.ts`, `anthropicProvider.test.ts`, `openAI*.test.ts`, `zenmuxProvider.test.ts`, `usageStats.test.ts`, `sqlitePresenter.test.ts`	Extensive new/updated tests validating prompt-cache plan resolution, request mutations, streaming usage normalization (cached & cache_write tokens), cost estimation, and DB migration/data preservation.
Renderer: Provider Base URL UI & i18n `src/renderer/settings/components/ProviderApiConfig.vue`, `src/renderer/src/i18n/*/settings.json`, `src/types/i18n.d.ts`	Added locked-base-URL unlock flow in component and new i18n keys across locales; updated i18n typings.

Sequence Diagram

sequenceDiagram
    participant Client
    participant Presenter as DeepChat Presenter
    participant CacheStrategy as PromptCacheStrategy
    participant Provider as LLM Provider
    participant LLMAPI as LLM API
    participant Usage as Usage Tracker

    Client->>Presenter: runStreamForMessage(message, sessionId)
    Presenter->>CacheStrategy: resolvePromptCachePlan(providerId, apiType, modelId, messages, conversationId)
    CacheStrategy-->>Presenter: PromptCachePlan (mode, ttl, cacheKey?, breakpointPlan?)
    Presenter->>Provider: sendRequest(messages, modelConfig)
    Provider->>CacheStrategy: applyPromptCache(plan, messages)
    CacheStrategy-->>Provider: cachedRequest (prompt_cache_key / modified messages / cache_control)
    Provider->>LLMAPI: chat.completions.create(cachedRequest)
    LLMAPI-->>Provider: stream (deltas + usage with cached_tokens, cache_write_tokens)
    Provider->>Presenter: emitStreamEvent(usage: {prompt_tokens, completion_tokens, cached_tokens?, cache_write_tokens?})
    Presenter->>Usage: accumulate(usage)
    Usage->>Usage: normalizeUsageCounts(...) & estimateUsageCostUsd(...)
    Usage-->>Presenter: finalized usage & cost
    Presenter->>Client: deliver stream events with cache metrics

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat: add dimcode acp agent #1336 — touches accumulator logic in deepchatAgentPresenter (adds cacheWriteInputTokens handling).
feat: user dashboard #1354 — related token-accounting and presenter/DB plumbing (deepchatUsageStats, usage propagation).
fix(deepchat): support interleaved thinking #1373 — modifies runStreamForMessage in deepchatAgentPresenter, related to modelConfig/conversationId changes.

Suggested labels

codex

Poem

🐰 Hooray, a caching spree!

I tuck my prompts beneath the tree,
Read a few, write a few with glee,
Streams keep hopping, tokens tallied right —
My burrow beams in morning light 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 3.33% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately reflects the main feature: adding prompt cache planning capabilities across LLM providers. It directly relates to the extensive changes implementing cache strategies, integration, and UI features throughout the codebase.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/cache-control

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/main/presenter/deepchatAgentPresenter/accumulator.ts (1)

166-172: ⚠️ Potential issue | 🟡 Minor

Guard optional cache_write_tokens before overwriting metadata.

At Line 171, assigning the optional field directly can erase a previously captured value if a later usage event omits it.

💡 Suggested fix

     case 'usage': {
       state.metadata.inputTokens = event.usage.prompt_tokens
       state.metadata.outputTokens = event.usage.completion_tokens
       state.metadata.totalTokens = event.usage.total_tokens
       state.metadata.cachedInputTokens = event.usage.cached_tokens
-      state.metadata.cacheWriteInputTokens = event.usage.cache_write_tokens
+      if (typeof event.usage.cache_write_tokens === 'number') {
+        state.metadata.cacheWriteInputTokens = event.usage.cache_write_tokens
+      }
       break
     }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/main/presenter/deepchatAgentPresenter/accumulator.ts` around lines 166 -
172, The assignment in the case 'usage' block unconditionally overwrites
state.metadata.cacheWriteInputTokens with event.usage.cache_write_tokens which
may be omitted in later usage events; update the case 'usage' handler in
accumulator.ts so you only set state.metadata.cacheWriteInputTokens when
event.usage.cache_write_tokens is defined (e.g., check for !== undefined or
hasOwnProperty) and leave the existing metadata value unchanged otherwise,
keeping other assignments
(inputTokens/outputTokens/totalTokens/cachedInputTokens) as-is.

🧹 Nitpick comments (1)

test/main/presenter/llmProviderPresenter/awsBedrockProvider.test.ts (1)
112-132: Test uses camelCase field names for cache tokens.

The mock uses cacheReadInputTokens and cacheWriteInputTokens (camelCase), while the actual Bedrock SDK uses cache_read_input_tokens and cache_creation_input_tokens (snake_case). This works because the implementation's getBedrockUsageNumber helper accepts both naming conventions.

Consider adding a complementary test with snake_case field names to ensure the production path is also covered.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/main/presenter/llmProviderPresenter/awsBedrockProvider.test.ts` around
lines 112 - 132, Add a complementary test case that uses the Bedrock SDK's
snake_case cache token fields so the real production shape is covered: duplicate
the existing test that builds chunks via createBedrockChunk (and that exercises
getBedrockUsageNumber indirectly) but replace the usage object fields
cacheReadInputTokens/cacheWriteInputTokens with cache_read_input_tokens and
cache_creation_input_tokens, then assert the same aggregated token totals/output
as the camelCase test to ensure both naming conventions are handled.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/main/presenter/newAgentPresenter/index.ts`:
- Around line 2215-2218: The backfill is overwriting parsed metadata by setting
cacheWriteInputTokens to a hardcoded 0; update the object construction in
newAgentPresenter (the spread of metadata and the subsequent fields) to preserve
any existing value by using the parsed metadata property with a safe fallback
(e.g., cacheWriteInputTokens: metadata.cacheWriteInputTokens ?? 0) instead of 0
so historical cache-write usage is not lost; do the same pattern if
cachedInputTokens should also preserve metadata.cachedInputTokens.

---

Outside diff comments:
In `@src/main/presenter/deepchatAgentPresenter/accumulator.ts`:
- Around line 166-172: The assignment in the case 'usage' block unconditionally
overwrites state.metadata.cacheWriteInputTokens with
event.usage.cache_write_tokens which may be omitted in later usage events;
update the case 'usage' handler in accumulator.ts so you only set
state.metadata.cacheWriteInputTokens when event.usage.cache_write_tokens is
defined (e.g., check for !== undefined or hasOwnProperty) and leave the existing
metadata value unchanged otherwise, keeping other assignments
(inputTokens/outputTokens/totalTokens/cachedInputTokens) as-is.

---

Nitpick comments:
In `@test/main/presenter/llmProviderPresenter/awsBedrockProvider.test.ts`:
- Around line 112-132: Add a complementary test case that uses the Bedrock SDK's
snake_case cache token fields so the real production shape is covered: duplicate
the existing test that builds chunks via createBedrockChunk (and that exercises
getBedrockUsageNumber indirectly) but replace the usage object fields
cacheReadInputTokens/cacheWriteInputTokens with cache_read_input_tokens and
cache_creation_input_tokens, then assert the same aggregated token totals/output
as the camelCase test to ensure both naming conventions are handled.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 728b1b14-806a-46f1-b84a-cb800037ccb7

📥 Commits

Reviewing files that changed from the base of the PR and between ebe12d2 and 688197d.

📒 Files selected for processing (24)

src/main/presenter/deepchatAgentPresenter/accumulator.ts
src/main/presenter/deepchatAgentPresenter/index.ts
src/main/presenter/deepchatAgentPresenter/process.ts
src/main/presenter/llmProviderPresenter/promptCacheCapabilities.ts
src/main/presenter/llmProviderPresenter/promptCacheStrategy.ts
src/main/presenter/llmProviderPresenter/providers/anthropicProvider.ts
src/main/presenter/llmProviderPresenter/providers/awsBedrockProvider.ts
src/main/presenter/llmProviderPresenter/providers/openAICompatibleProvider.ts
src/main/presenter/llmProviderPresenter/providers/openAIResponsesProvider.ts
src/main/presenter/newAgentPresenter/index.ts
src/main/presenter/sqlitePresenter/tables/deepchatUsageStats.ts
src/main/presenter/usageStats.ts
src/shared/types/agent-interface.d.ts
src/shared/types/core/llm-events.ts
test/main/presenter/deepchatAgentPresenter/accumulator.test.ts
test/main/presenter/deepchatAgentPresenter/messageStore.test.ts
test/main/presenter/llmProviderPresenter/anthropicProvider.test.ts
test/main/presenter/llmProviderPresenter/awsBedrockProvider.test.ts
test/main/presenter/llmProviderPresenter/openAICompatibleProvider.test.ts
test/main/presenter/llmProviderPresenter/openAIResponsesProvider.test.ts
test/main/presenter/llmProviderPresenter/promptCacheStrategy.test.ts
test/main/presenter/newAgentPresenter/usageDashboard.test.ts
test/main/presenter/sqlitePresenter.test.ts
test/main/presenter/usageStats.test.ts

src/main/presenter/newAgentPresenter/index.ts

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (6)

test/main/presenter/newAgentPresenter/usageDashboard.test.ts (1)
518-524: Consider asserting cache_write_input_tokens in live update test.

The test explicitly sets cacheWriteInputTokens: 0 in the metadata (line 513) but the assertion doesn't verify this field was persisted. Adding it would strengthen test coverage for the live update path.
Proposed fix
     expect(row).toMatchObject({
       source: 'live',
       cached_input_tokens: 20,
+      cache_write_input_tokens: 0,
       input_tokens: 140,
       output_tokens: 60
     })
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/main/presenter/newAgentPresenter/usageDashboard.test.ts` around lines
518 - 524, The test for live updates should also assert that
cache_write_input_tokens was persisted: after fetching the row with
sqlitePresenter.deepchatUsageStatsTable.getByMessageId('message-1'), add an
expectation that the returned object includes cache_write_input_tokens: 0 (since
the test metadata set cacheWriteInputTokens: 0). This ensures the live update
path correctly maps the metadata field cacheWriteInputTokens to the stored
column cache_write_input_tokens.
src/main/presenter/llmProviderPresenter/providers/anthropicProvider.ts (1)
562-571: Non-streaming completions does not propagate cached_tokens and cache_write_tokens to the response.

The buildAnthropicUsageSnapshot returns cached_tokens and cache_write_tokens when present, but resultResp.totalUsage only extracts the base fields. If downstream consumers of LLMResponse.totalUsage expect cache-related metrics for non-streaming calls, they won't receive them.

If this is intentional (e.g., only streaming paths need detailed cache stats), consider adding a brief comment clarifying the design decision. Otherwise, consider propagating:
♻️ Optional: propagate cache tokens in non-streaming response
         resultResp.totalUsage = {
           prompt_tokens: usageSnapshot?.prompt_tokens ?? 0,
           completion_tokens: usageSnapshot?.completion_tokens ?? 0,
-          total_tokens: usageSnapshot?.total_tokens ?? 0
+          total_tokens: usageSnapshot?.total_tokens ?? 0,
+          ...(usageSnapshot?.cached_tokens != null && { cached_tokens: usageSnapshot.cached_tokens }),
+          ...(usageSnapshot?.cache_write_tokens != null && { cache_write_tokens: usageSnapshot.cache_write_tokens })
         }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/main/presenter/llmProviderPresenter/providers/anthropicProvider.ts`
around lines 562 - 571, The non-streaming response populates
resultResp.totalUsage from buildAnthropicUsageSnapshot but only copies
prompt_tokens/completion_tokens/total_tokens, dropping cached_tokens and
cache_write_tokens; update the code that sets resultResp.totalUsage (the block
that assigns from usageSnapshot) to also include cached_tokens and
cache_write_tokens when they exist (e.g., set cached_tokens:
usageSnapshot?.cached_tokens ?? 0 and cache_write_tokens:
usageSnapshot?.cache_write_tokens ?? 0), or if omission was intentional add a
short clarifying comment above this assignment explaining why cache metrics are
excluded; refer to buildAnthropicUsageSnapshot and resultResp.totalUsage to
locate the change.
src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts (1)
54-60: Unsafe type cast to access private anthropic field.

The pattern as unknown as { anthropic?: Anthropic } bypasses TypeScript's protection of private fields. If AnthropicProvider refactors the field name or visibility, this will silently break at runtime.

Consider one of these alternatives:
♻️ Option 1: Expose a protected setter in AnthropicProvider

In anthropicProvider.ts:
protected setAnthropicClient(client: Anthropic): void {
  this.anthropic = client
}
Then in ZenmuxAnthropicDelegate:
-    const self = this as unknown as { anthropic?: Anthropic }
-    self.anthropic = new Anthropic({
+    this.setAnthropicClient(new Anthropic({
       apiKey,
       baseURL: this.provider.baseUrl || ZENMUX_ANTHROPIC_BASE_URL,
       defaultHeaders: this.defaultHeaders,
       fetchOptions
-    })
+    }))
♻️ Option 2: Make the field protected in base class

Change private anthropic!: Anthropic to protected anthropic!: Anthropic in AnthropicProvider, allowing direct assignment in subclasses.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts` around
lines 54 - 60, The code uses an unsafe cast to set the private anthropic field
from ZenmuxAnthropicDelegate; instead, modify the AnthropicProvider base class
to either (a) add a protected setter method like setAnthropicClient(client:
Anthropic) and call that from ZenmuxAnthropicDelegate, or (b) change the field
declaration from private anthropic!: Anthropic to protected anthropic!:
Anthropic so the subclass can assign it directly; update ZenmuxAnthropicDelegate
to use the new setter or direct assignment of this.anthropic rather than the "as
unknown as" cast.
test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts (1)
167-333: Consider adding edge case tests for routing robustness.

The current tests cover the happy paths well. Consider adding tests for:

Case variations in model ID: isAnthropicModel uses .toLowerCase(), but a test confirming ANTHROPIC/CLAUDE-SONNET routes correctly would document this behavior.

Anthropic initialization failure: When apiKey is missing, ensureAnthropicDelegateReady should throw. A test verifying this error path would improve coverage.
💡 Example additional test cases
it('routes uppercase ANTHROPIC/* models through the Anthropic endpoint', async () => {
  const provider = new ZenmuxProvider(createProvider(), createConfigPresenter())
  const result = await provider.generateText('hello', 'ANTHROPIC/CLAUDE-SONNET-4-5')
  expect(mockAnthropicMessagesCreate).toHaveBeenCalled()
})

it('throws when Anthropic API key is missing', async () => {
  const provider = new ZenmuxProvider(
    createProvider({ apiKey: '' }),
    createConfigPresenter()
  )
  await expect(provider.generateText('hello', 'anthropic/claude-sonnet-4-5'))
    .rejects.toThrow('Anthropic SDK not initialized')
})
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts` around lines
167 - 333, Add two edge-case tests to zenmuxProvider.test.ts: one that calls
ZenmuxProvider.generateText with an uppercase model id like
'ANTHROPIC/CLAUDE-SONNET-4-5' to assert the request is routed through anthropic
(mockAnthropicMessagesCreate called) to validate the toLowerCase behavior of
isAnthropicModel, and another that constructs ZenmuxProvider with an empty
apiKey (createProvider({ apiKey: '' })) and asserts that generateText for an
anthropic model rejects/throws (e.g., expecting ensureAnthropicDelegateReady or
the anthropic delegate initialization to throw a clear "Anthropic SDK not
initialized" error) so the missing-API-key path is covered.
src/renderer/settings/components/ProviderApiConfig.vue (1)
211-220: Hardcoded provider allowlist may drift from backend DEFAULT_PROVIDERS.

This EDITABLE_BASE_URL_PROVIDER_IDS set is maintained separately from the canonical DEFAULT_PROVIDERS list in src/main/presenter/configPresenter/providers.ts. If new providers that should have editable base URLs are added to the backend, this frontend list will need manual synchronization.

Consider either:

Adding an editableBaseUrl flag to the provider schema in the backend and passing it through to the frontend

Documenting this coupling so future contributors know to update both locations
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/renderer/settings/components/ProviderApiConfig.vue` around lines 211 -
220, The frontend uses a hardcoded EDITABLE_BASE_URL_PROVIDER_IDS set that can
drift from the backend's canonical DEFAULT_PROVIDERS; add an editableBaseUrl
boolean to the provider schema on the backend (attached to DEFAULT_PROVIDERS),
ensure the presenter passes that flag through to the renderer, and change the
frontend to compute editable providers from the incoming providers list (e.g.,
provider.editableBaseUrl) instead of the hardcoded
EDITABLE_BASE_URL_PROVIDER_IDS; alternatively, add a clear comment/documentation
near EDITABLE_BASE_URL_PROVIDER_IDS and DEFAULT_PROVIDERS explaining the
coupling and required sync steps for future contributors.
src/renderer/src/i18n/en-US/settings.json (1)
613-618: Unused i18n keys: baseUrlUnlock dialog strings are never referenced in the component.

The baseUrlUnlock.title, baseUrlUnlock.description, and baseUrlUnlock.confirm keys in the i18n file are defined but not used by ProviderApiConfig.vue. The requestBaseUrlUnlock function simply sets baseUrlUnlocked.value = true without displaying any confirmation dialog. The component directly shows an editable input field instead.

Either remove these unused i18n keys or implement the confirmation dialog to match the intended UI structure.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/renderer/src/i18n/en-US/settings.json` around lines 613 - 618, The i18n
keys under baseUrlUnlock (baseUrlUnlock.title, baseUrlUnlock.description,
baseUrlUnlock.confirm) are unused by ProviderApiConfig.vue; either delete these
keys from src/renderer/src/i18n/en-US/settings.json or implement the
confirmation dialog flow in ProviderApiConfig.vue: update requestBaseUrlUnlock
to open a modal/confirmation component that uses
$t('settings.baseUrlUnlock.title'), $t('settings.baseUrlUnlock.description') and
$t('settings.baseUrlUnlock.confirm') and only set baseUrlUnlocked.value = true
when the user confirms; ensure the modal component and its confirm handler are
wired into the existing component state and translation namespace.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts`:
- Around line 209-225: The current routing in getEmbeddings and getDimensions
sends anthropic/* models to the Anthropic delegate, but Anthropic doesn't
support embeddings and the inherited BaseLLMProvider methods will throw at
runtime; update both getEmbeddings(modelId, texts) and getDimensions(modelId) to
detect isAnthropicModel(modelId) and immediately throw a clear, specific error
like "Embeddings not supported for Anthropic models: <modelId>" (do not call
ensureAnthropicDelegateReady or delegate methods for Anthropic models), leaving
openaiDelegate usage unchanged for non-Anthropic models so callers fail fast
with a helpful message.

---

Nitpick comments:
In `@src/main/presenter/llmProviderPresenter/providers/anthropicProvider.ts`:
- Around line 562-571: The non-streaming response populates
resultResp.totalUsage from buildAnthropicUsageSnapshot but only copies
prompt_tokens/completion_tokens/total_tokens, dropping cached_tokens and
cache_write_tokens; update the code that sets resultResp.totalUsage (the block
that assigns from usageSnapshot) to also include cached_tokens and
cache_write_tokens when they exist (e.g., set cached_tokens:
usageSnapshot?.cached_tokens ?? 0 and cache_write_tokens:
usageSnapshot?.cache_write_tokens ?? 0), or if omission was intentional add a
short clarifying comment above this assignment explaining why cache metrics are
excluded; refer to buildAnthropicUsageSnapshot and resultResp.totalUsage to
locate the change.

In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts`:
- Around line 54-60: The code uses an unsafe cast to set the private anthropic
field from ZenmuxAnthropicDelegate; instead, modify the AnthropicProvider base
class to either (a) add a protected setter method like
setAnthropicClient(client: Anthropic) and call that from
ZenmuxAnthropicDelegate, or (b) change the field declaration from private
anthropic!: Anthropic to protected anthropic!: Anthropic so the subclass can
assign it directly; update ZenmuxAnthropicDelegate to use the new setter or
direct assignment of this.anthropic rather than the "as unknown as" cast.

In `@src/renderer/settings/components/ProviderApiConfig.vue`:
- Around line 211-220: The frontend uses a hardcoded
EDITABLE_BASE_URL_PROVIDER_IDS set that can drift from the backend's canonical
DEFAULT_PROVIDERS; add an editableBaseUrl boolean to the provider schema on the
backend (attached to DEFAULT_PROVIDERS), ensure the presenter passes that flag
through to the renderer, and change the frontend to compute editable providers
from the incoming providers list (e.g., provider.editableBaseUrl) instead of the
hardcoded EDITABLE_BASE_URL_PROVIDER_IDS; alternatively, add a clear
comment/documentation near EDITABLE_BASE_URL_PROVIDER_IDS and DEFAULT_PROVIDERS
explaining the coupling and required sync steps for future contributors.

In `@src/renderer/src/i18n/en-US/settings.json`:
- Around line 613-618: The i18n keys under baseUrlUnlock (baseUrlUnlock.title,
baseUrlUnlock.description, baseUrlUnlock.confirm) are unused by
ProviderApiConfig.vue; either delete these keys from
src/renderer/src/i18n/en-US/settings.json or implement the confirmation dialog
flow in ProviderApiConfig.vue: update requestBaseUrlUnlock to open a
modal/confirmation component that uses $t('settings.baseUrlUnlock.title'),
$t('settings.baseUrlUnlock.description') and
$t('settings.baseUrlUnlock.confirm') and only set baseUrlUnlocked.value = true
when the user confirms; ensure the modal component and its confirm handler are
wired into the existing component state and translation namespace.

In `@test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts`:
- Around line 167-333: Add two edge-case tests to zenmuxProvider.test.ts: one
that calls ZenmuxProvider.generateText with an uppercase model id like
'ANTHROPIC/CLAUDE-SONNET-4-5' to assert the request is routed through anthropic
(mockAnthropicMessagesCreate called) to validate the toLowerCase behavior of
isAnthropicModel, and another that constructs ZenmuxProvider with an empty
apiKey (createProvider({ apiKey: '' })) and asserts that generateText for an
anthropic model rejects/throws (e.g., expecting ensureAnthropicDelegateReady or
the anthropic delegate initialization to throw a clear "Anthropic SDK not
initialized" error) so the missing-API-key path is covered.

In `@test/main/presenter/newAgentPresenter/usageDashboard.test.ts`:
- Around line 518-524: The test for live updates should also assert that
cache_write_input_tokens was persisted: after fetching the row with
sqlitePresenter.deepchatUsageStatsTable.getByMessageId('message-1'), add an
expectation that the returned object includes cache_write_input_tokens: 0 (since
the test metadata set cacheWriteInputTokens: 0). This ensures the live update
path correctly maps the metadata field cacheWriteInputTokens to the stored
column cache_write_input_tokens.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: dd029a65-0cad-4b89-a03c-ce6ee5516282

📥 Commits

Reviewing files that changed from the base of the PR and between 688197d and 6b1e1a2.

📒 Files selected for processing (22)

src/main/presenter/llmProviderPresenter/promptCacheCapabilities.ts
src/main/presenter/llmProviderPresenter/providers/anthropicProvider.ts
src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts
src/main/presenter/newAgentPresenter/index.ts
src/renderer/settings/components/ProviderApiConfig.vue
src/renderer/src/i18n/da-DK/settings.json
src/renderer/src/i18n/en-US/settings.json
src/renderer/src/i18n/fa-IR/settings.json
src/renderer/src/i18n/fr-FR/settings.json
src/renderer/src/i18n/he-IL/settings.json
src/renderer/src/i18n/ja-JP/settings.json
src/renderer/src/i18n/ko-KR/settings.json
src/renderer/src/i18n/pt-BR/settings.json
src/renderer/src/i18n/ru-RU/settings.json
src/renderer/src/i18n/zh-CN/settings.json
src/renderer/src/i18n/zh-HK/settings.json
src/renderer/src/i18n/zh-TW/settings.json
src/types/i18n.d.ts
test/main/presenter/llmProviderPresenter/promptCacheStrategy.test.ts
test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts
test/main/presenter/newAgentPresenter/usageDashboard.test.ts
test/renderer/components/ProviderApiConfig.test.ts

✅ Files skipped from review due to trivial changes (8)

src/renderer/src/i18n/fr-FR/settings.json
src/renderer/src/i18n/zh-HK/settings.json
src/renderer/src/i18n/zh-TW/settings.json
src/renderer/src/i18n/ja-JP/settings.json
src/renderer/src/i18n/ko-KR/settings.json
src/renderer/src/i18n/pt-BR/settings.json
src/renderer/src/i18n/fa-IR/settings.json
test/main/presenter/llmProviderPresenter/promptCacheStrategy.test.ts

🚧 Files skipped from review as they are similar to previous changes (1)

src/main/presenter/llmProviderPresenter/promptCacheCapabilities.ts

src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts

coderabbitai

🧹 Nitpick comments (3)

src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts (2)
54-60: Type cast bypasses encapsulation of the parent class's anthropic field.

The as unknown as { anthropic?: Anthropic } pattern is a maintenance risk. If AnthropicProvider renames or restructures the field, this will silently break at runtime.

Consider exposing a protected setter in AnthropicProvider or using a different initialization pattern that doesn't require reaching into private state.
♻️ Alternative: add a protected method in AnthropicProvider

In AnthropicProvider, add:
protected setAnthropicClient(client: Anthropic): void {
  this.anthropic = client
}
Then in ZenmuxAnthropicDelegate:
-    const self = this as unknown as { anthropic?: Anthropic }
-    self.anthropic = new Anthropic({
+    this.setAnthropicClient(new Anthropic({
       apiKey,
       baseURL: this.provider.baseUrl || ZENMUX_ANTHROPIC_BASE_URL,
       defaultHeaders: this.defaultHeaders,
       fetchOptions
-    })
+    }))
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts` around
lines 54 - 60, Replace the unsafe cast that writes into the parent class's
private field by adding a protected setter on AnthropicProvider (e.g., protected
setAnthropicClient(client: Anthropic): void { this.anthropic = client }) and
then call that setter from ZenmuxAnthropicDelegate instead of using "this as
unknown as { anthropic?: Anthropic }"; update the code that constructs the
Anthropic instance (the new Anthropic({...}) call) to pass the instance into
setAnthropicClient so the child no longer mutates private state via type
casting.
104-112: Consider a more descriptive error message when API key is missing.

When ensureClientInitialized() fails due to a missing API key, the error "Anthropic SDK not initialized" doesn't indicate the root cause. Users may not realize they need to configure an API key.
♻️ Proposed improvement
   private async ensureAnthropicDelegateReady(): Promise<ZenmuxAnthropicDelegate> {
     await this.anthropicDelegate.ensureClientInitialized()

     if (!this.anthropicDelegate.isClientInitialized()) {
-      throw new Error('Anthropic SDK not initialized')
+      throw new Error('Anthropic SDK not initialized: API key may be missing')
     }

     return this.anthropicDelegate
   }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts` around
lines 104 - 112, The current ensureAnthropicDelegateReady method throws a
generic "Anthropic SDK not initialized" error; update it to detect and surface
missing API key details by checking
anthropicDelegate.ensureClientInitialized()/isClientInitialized() results and,
when initialization failed due to a missing key, throw a more descriptive error
(e.g., mention missing Anthropic API key and which env var/config to set).
Modify ensureAnthropicDelegateReady to inspect anthropicDelegate for an explicit
missing-key condition (or catch the error from ensureClientInitialized()), and
include that detail in the thrown Error so callers of
ensureAnthropicDelegateReady get actionable guidance.
test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts (1)
210-221: Test relies heavily on internal implementation details.

The test directly manipulates private delegate state (clientInitialized, isInitialized, anthropic) using as any casts. This creates tight coupling between the test and internal implementation, making refactoring harder.

Consider adding a test-only initialization method or testing through the public API by ensuring proper mock setup that allows natural initialization flow.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts` around lines
210 - 221, The test mutates private delegate state
(anthropicDelegate.clientInitialized/isInitialized/anthropic) which couples
tests to internals; instead add a test-only initialization helper on the
provider (e.g., initializeTestAnthropicClient) that sets up the delegate’s
anthropic client with mockAnthropicMessagesCreate and mockAnthropicModelsList
and marks it initialized, or drive setup through the public initialization path
by spying/mock-resolving ensureClientInitialized to return a client configured
with those mocks; update the test to call that helper or the public init so it
no longer uses (provider as any).anthropicDelegate mutation and still verifies
behavior via the public API.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts`:
- Around line 54-60: Replace the unsafe cast that writes into the parent class's
private field by adding a protected setter on AnthropicProvider (e.g., protected
setAnthropicClient(client: Anthropic): void { this.anthropic = client }) and
then call that setter from ZenmuxAnthropicDelegate instead of using "this as
unknown as { anthropic?: Anthropic }"; update the code that constructs the
Anthropic instance (the new Anthropic({...}) call) to pass the instance into
setAnthropicClient so the child no longer mutates private state via type
casting.
- Around line 104-112: The current ensureAnthropicDelegateReady method throws a
generic "Anthropic SDK not initialized" error; update it to detect and surface
missing API key details by checking
anthropicDelegate.ensureClientInitialized()/isClientInitialized() results and,
when initialization failed due to a missing key, throw a more descriptive error
(e.g., mention missing Anthropic API key and which env var/config to set).
Modify ensureAnthropicDelegateReady to inspect anthropicDelegate for an explicit
missing-key condition (or catch the error from ensureClientInitialized()), and
include that detail in the thrown Error so callers of
ensureAnthropicDelegateReady get actionable guidance.

In `@test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts`:
- Around line 210-221: The test mutates private delegate state
(anthropicDelegate.clientInitialized/isInitialized/anthropic) which couples
tests to internals; instead add a test-only initialization helper on the
provider (e.g., initializeTestAnthropicClient) that sets up the delegate’s
anthropic client with mockAnthropicMessagesCreate and mockAnthropicModelsList
and marks it initialized, or drive setup through the public initialization path
by spying/mock-resolving ensureClientInitialized to return a client configured
with those mocks; update the test to call that helper or the public init so it
no longer uses (provider as any).anthropicDelegate mutation and still verifies
behavior via the public API.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 75c72e8e-4826-4081-80c0-515cbb1a2294

📥 Commits

Reviewing files that changed from the base of the PR and between 6b1e1a2 and 6071c39.

📒 Files selected for processing (2)

src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts
test/main/presenter/llmProviderPresenter/zenmuxProvider.test.ts

feat(cache): add provider prompt cache planner

688197d

coderabbitai bot reviewed Apr 2, 2026

View reviewed changes

src/main/presenter/newAgentPresenter/index.ts Show resolved Hide resolved

feat(provider): refine zenmux cache and UI

6b1e1a2

coderabbitai bot reviewed Apr 2, 2026

View reviewed changes

src/main/presenter/llmProviderPresenter/providers/zenmuxProvider.ts Show resolved Hide resolved

fix(zenmux): guard anthropic embeddings

6071c39

coderabbitai bot reviewed Apr 2, 2026

View reviewed changes

zerob13 merged commit 850336f into dev Apr 2, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cache): add provider prompt cache planner#1426

feat(cache): add provider prompt cache planner#1426
zerob13 merged 3 commits intodevfrom
codex/cache-control

zerob13 commented Apr 2, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 2, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zerob13 commented Apr 2, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

zerob13 commented Apr 2, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 2, 2026 •

edited

Loading