feat: share multimodal hash helpers by CUHKSZzxy · Pull Request #4704 · InternLM/lmdeploy

CUHKSZzxy · 2026-06-24T12:53:44Z

Summary

move multimodal content hashing into a shared VL helper
update PyTorch prefix-cache paths to use the shared helper while preserving existing cache-key behavior
populate dict-style multimodal content hashes before TurboMind conversion when prefix caching is enabled

Validation

Focused VL hasher unit tests passed
PyTorch block-trie prefix-cache unit tests passed
Real PyTorch VL server repeated-image check showed cache reuse on a cacheable repeated multimodal prompt

Assistance

Assisted with Codex + GPT-5.5 xHigh Fast, reviewed manually

Copilot

Pull request overview

This PR centralizes multimodal content hashing into a shared lmdeploy.vl helper module, then updates PyTorch prefix-cache code paths (and tests) to use the shared implementation while keeping existing cache-key behavior stable.

Changes:

Added lmdeploy/vl/hasher.py with deterministic hashing helpers for both dataclass-style and dict-style multimodal payloads.
Rewired PyTorch prefix-cache hashing call sites to use the shared VL hasher (including unit test monkeypatch targets).
Added focused unit tests covering hash stability, sensitivity to content/meta/mRoPE, and ignoring position-only keys for dict-style items.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
`lmdeploy/vl/hasher.py`	Introduces shared deterministic multimodal hashing + “ensure content_hash” helpers for two multimodal representations.
`lmdeploy/pytorch/multimodal/data_type.py`	Removes local hashing implementation and re-exports shared hashing helpers for compatibility.
`lmdeploy/pytorch/messages.py`	Updates prefix-cache meta hashing fallback to call the shared VL hasher.
`lmdeploy/pytorch/engine/engine.py`	Ensures multimodal content hashes are populated after preprocessing when prefix caching is enabled.
`tests/test_lmdeploy/test_vl/test_hasher.py`	Adds unit tests validating hash determinism and correct inclusion/exclusion rules.
`tests/pytorch/paging/test_block_trie.py`	Adjusts monkeypatching to target the shared hasher module instead of the previous PyTorch-local symbol.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

grimoire · 2026-06-30T03:35:58Z

    DistServeInitRequest,
 )
 from lmdeploy.utils import get_logger, get_model
+from lmdeploy.vl import hasher as mm_hasher


Since we have alias make_multimodal_content_hash in pytorch/multimodal/data_type.py, the change in engine.py and messages.py might not be necessary.

lvhan028 · 2026-07-01T08:35:20Z

It does not match what I had in mind for multimodal fingerprints.

What I expect

Compute a stable content fingerprint early in the request path (before engine-specific preprocessing).
Attach it to each multimodal item and pass it through to all backends
Treat the fingerprint as general multimodal metadata, not something that only exists when enable_prefix_caching is on.

requirement mismatched

CUHKSZzxy added 2 commits June 24, 2026 20:53

feat: share multimodal hash helpers

5c65ea2

fix: skip turbomind multimodal hash hook

1632c23

CUHKSZzxy marked this pull request as ready for review June 25, 2026 09:28

Copilot AI review requested due to automatic review settings June 25, 2026 09:28

Copilot started reviewing on behalf of CUHKSZzxy June 25, 2026 09:28 View session

style: format multimodal hasher docstrings

12119c1

Copilot AI reviewed Jun 25, 2026

View reviewed changes

lvhan028 requested review from grimoire, lvhan028 and lzhangzz June 30, 2026 03:21

lvhan028 added the enhancement New feature or request label Jun 30, 2026

grimoire reviewed Jun 30, 2026

View reviewed changes

refactor: use pytorch multimodal hash aliases

4831063

lzhangzz previously approved these changes Jun 30, 2026

View reviewed changes

grimoire previously approved these changes Jun 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: share multimodal hash helpers#4704

feat: share multimodal hash helpers#4704
CUHKSZzxy wants to merge 4 commits into
InternLM:mainfrom
CUHKSZzxy:feat/share-vl-mm-hasher

CUHKSZzxy commented Jun 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

grimoire Jun 30, 2026

Uh oh!

lvhan028 commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

CUHKSZzxy commented Jun 24, 2026

Summary

Validation

Assistance

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

grimoire Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

lvhan028 commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants