refactor teacher server by hjh0119 · Pull Request #9457 · modelscope/ms-swift

hjh0119 · 2026-05-31T18:31:18Z

No description provided.

gemini-code-assist

Code Review

This pull request refactors the Generalized Knowledge Distillation (GKD) trainer to fetch teacher logprobs using a new inference-only client (VLLMInferClient) and a dedicated /infer/ endpoint, while also lazy-loading transformers imports across utility files to optimize startup time. The code review identified several critical issues in the new implementation: self.teacher_client is only initialized on the main process, which will cause crashes on non-zero ranks during evaluation; incorrect tuple unpacking and an undefined variable encoded_chunkbatch in _assemble_topk_for_chunk will raise ValueError and NameError; parse_prompt_logprobs lacks safety checks for None values and padding, leading to potential AttributeError and ValueError crashes; and a potential KeyError exists when popping _teacher_raw from chunks.

hjh0119 · 2026-06-01T11:29:40Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the GKD (Generalized Knowledge Distillation) pipeline to fetch teacher logprobs more efficiently using a new /infer/ endpoint and VLLMInferClient, aligning both standard and Megatron-based trainers. The review feedback highlights critical issues, including a rank mismatch in the Megatron trainer where the teacher client is initialized on the last rank but called on rank 0, missing safety checks and padding in parse_prompt_logprobs that could cause shape mismatches or AttributeErrors, and potential KeyErrors when popping keys from batch dictionaries.

refactor teacher server

06f4936

gemini-code-assist Bot reviewed May 31, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/gkd_trainer.py

Comment thread swift/rlhf_trainers/gkd_trainer.py Outdated

Comment thread swift/rlhf_trainers/utils.py

Comment thread swift/rlhf_trainers/gkd_trainer.py

hjh0119 added 4 commits June 1, 2026 16:14

fix

0826a30

fix deploy

d090c3e

fix deploy

42fe54d

fix default dict

67bcca8

Jintao-Huang approved these changes Jun 1, 2026

View reviewed changes

Comment thread swift/pipelines/infer/deploy.py

Comment thread swift/pipelines/infer/infer.py Outdated

Comment thread swift/utils/env.py Outdated

Comment thread swift/infer_engine/vllm_engine.py Outdated

fix

f041ea6

gemini-code-assist Bot reviewed Jun 1, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/utils.py

Comment thread swift/megatron/trainers/gkd_trainer.py

Comment thread swift/megatron/trainers/gkd_trainer.py

Comment thread swift/rlhf_trainers/gkd_trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor teacher server#9457

refactor teacher server#9457
hjh0119 wants to merge 6 commits into
modelscope:mainfrom
hjh0119:refactor-teacher-server

hjh0119 commented May 31, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Jun 1, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hjh0119 commented May 31, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Jun 1, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants