feat(tui): show prompt cache hit breakdown in /usage command by shuizhongyueming · Pull Request #231 · MoonshotAI/kimi-code

shuizhongyueming · 2026-05-29T16:25:38Z

Related Issue

Resolve #230

Problem

The /usage command only shows total input/output token counts. Users cannot verify prompt cache effectiveness without a breakdown of cache hits vs freshly computed tokens.

What changed

Added a cache breakdown subline below each model line in /usage:

20-segment progress bar visualizing the cache hit ratio
Percentage with one decimal when not whole
Absolute numbers: cache read tokens and non-cached (other) tokens
Model names padded to max width for clean multi-model alignment
Cache line always shown (including 0%)

Screenshot

Checklist

I have read the CONTRIBUTING document.
I have linked a related issue (feat: show prompt cache hit breakdown in /usage command #230).
I have added tests that prove my feature works.
Ran gen-changesets skill — added changeset.
Ran gen-docs skill — not needed (existing /usage docs cover the command; the breakdown is self-explanatory in-UI).

changeset-bot · 2026-05-29T16:25:43Z

🦋 Changeset detected

Latest commit: aaada08

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@moonshot-ai/kimi-code	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

shuizhongyueming · 2026-05-29T16:25:56Z

Screenshot: See the cache breakdown displayed in /usage command.

pkg-pr-new · 2026-05-29T16:26:39Z

pnpm dlx https://pkg.pr.new/@moonshot-ai/kimi-code@aaada08

npx https://pkg.pr.new/@moonshot-ai/kimi-code@aaada08

commit: aaada08

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds prompt-cache visibility to the /usage TUI report by rendering a per-model cache hit ratio bar plus read/other token breakdown, and updates tests + versioning metadata accordingly.

Changes:

Render per-model “cache hit” sublines (progress bar + read/other counts) under each model usage line.
Align model column widths across multi-model sessions (including the “total” row).
Add/adjust tests and publish a minor changeset for the new /usage output.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
apps/kimi-code/src/tui/components/messages/usage-panel.ts	Adds aligned model rows and a cache hit ratio subline per model in the session usage section.
apps/kimi-code/test/tui/components/messages/usage-panel.test.ts	Adds test coverage for new cache sublines (single-model, zero-read, multi-model).
.changeset/usage-cache-breakdown.md	Declares a minor release for exposing cache hit stats in `/usage`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+  // Compute max model name width for alignment (include "total" for multi-model)
+  const maxModelWidth = Math.max(
+    ...entries.map(([model]) => model.length),
+    entries.length > 1 ? 'total'.length : 0,
+  );


+    const cacheRatio = input > 0 ? usageNumber(row.inputCacheRead) / input : 0;
+    const bar = renderProgressBar(cacheRatio, 20);
+    const pct = `${(cacheRatio * 100).toFixed(1).replace(/\.0$/, '')}%`;
+    lines.push(
+      `${cacheIndent}${muted('cache')} ${bar} ${value(pct)} ${muted('hit')} ` +
+        `(${value(formatTokenCount(usageNumber(row.inputCacheRead)))} ${muted('read')} ` +
+        `· ${value(formatTokenCount(usageNumber(row.inputOther)))} ${muted('other')})`,
+    );


+    // Cache breakdown subline
+    const cacheIndent = ' '.repeat(maxModelWidth + 4); // "  model  " → 2 + maxModelWidth + 2
+    const cacheRatio = input > 0 ? usageNumber(row.inputCacheRead) / input : 0;
+    const bar = renderProgressBar(cacheRatio, 20);


chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 311f4deb66

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-29T16:27:53Z

+        `(${value(formatTokenCount(usageNumber(row.inputCacheRead)))} ${muted('read')} ` +
+        `· ${value(formatTokenCount(usageNumber(row.inputOther)))} ${muted('other')})`,


Include cache-creation tokens in the cache breakdown

When a provider reports inputCacheCreation, those input tokens are included in the model's input total and denominator for the hit percentage, but this new subline only prints inputCacheRead and inputOther. In sessions that create prompt-cache entries (for example inputCacheCreation > 0 and little/no inputOther), /usage can show thousands of input tokens while the breakdown says 0 read · 0 other, hiding the cache-write/miss portion users need to understand cache effectiveness. Include cache-creation as its own field or fold it into the non-hit count.

Useful? React with 👍 / 👎.

shuizhongyueming added 4 commits May 29, 2026 18:52

feat(tui): show prompt cache hit breakdown in /usage command

229fc74

fix(tui): show one decimal in /usage cache hit ratio when not whole

4685c1e

fix(tui): align /usage model names to max width for clean layout

bbb4622

chore: add changeset for /usage cache breakdown

311f4de

Copilot AI review requested due to automatic review settings May 29, 2026 16:25

Copilot AI reviewed May 29, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed May 29, 2026

View reviewed changes

fix(tui): guard empty entries and extract progress bar width constant

aaada08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tui): show prompt cache hit breakdown in /usage command#231

feat(tui): show prompt cache hit breakdown in /usage command#231
shuizhongyueming wants to merge 5 commits into
MoonshotAI:mainfrom
shuizhongyueming:feat/usage-cache-breakdown

shuizhongyueming commented May 29, 2026 •

edited

Loading

Uh oh!

changeset-bot Bot commented May 29, 2026 •

edited

Loading

Uh oh!

shuizhongyueming commented May 29, 2026

Uh oh!

pkg-pr-new Bot commented May 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		`(${value(formatTokenCount(usageNumber(row.inputCacheRead)))} ${muted('read')} ` +
		`· ${value(formatTokenCount(usageNumber(row.inputOther)))} ${muted('other')})`,

Conversation

shuizhongyueming commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue

Problem

What changed

Screenshot

Checklist

Uh oh!

changeset-bot Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

shuizhongyueming commented May 29, 2026

Uh oh!

pkg-pr-new Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

shuizhongyueming commented May 29, 2026 •

edited

Loading

changeset-bot Bot commented May 29, 2026 •

edited

Loading

pkg-pr-new Bot commented May 29, 2026 •

edited

Loading