Normalize OpenPangu RoPE scaling defaults #28361

skyloevil · 2025-11-09T09:39:13Z

Summary

Remove hard-coded RoPE scaling defaults and load values from model config.
When instantiating OpenPanguEmbeddedAttention/OpenPanguMLAAttention, clone config.rope_scaling if it exists, otherwise start from {} and use default.
Backfill all DeepSeek YaRN defaults (beta_fast, beta_slow, factor, mscale, mscale_all_dim, type, original_max_position_embeddings) .

Verification

config.json add rope_scaling_config:

"rope_scaling": {
  "type": "yarn",
  "beta_fast": 48,
  "factor": 1.25,
  "mscale": 0.9,
  "mscale_all_dim": 0.5,
  "original_max_position_embeddings": 8192
}

CUDA_VISIBLE_DEVICES=0 vllm serve FreedomIntelligence/openPangu-Embedded-7B --port 8818 --trust_remote_code --served-model-name openPangu-Embedded-7B – logs show every OpenPanguEmbeddedAttention layer printing the resolved rope_scaling_config, confirming it was read from the config (see screenshot).

curl http://localhost:8818/v1/chat/completions … – inference succeeds with expected response, proving the model loads and serves normally (see screenshot).

After verification passed, the debug log was removed.

gemini-code-assist

Code Review

This pull request aims to normalize the RoPE scaling defaults for OpenPangu models by reading from config.rope_scaling and backfilling any missing YaRN parameters. My main concern is that the implementation seems to be in OpenPanguMLAAttention, while the verification logs and model configuration for FreedomIntelligence/openPangu-Embedded-7B indicate that OpenPanguEmbeddedAttention is being used. This suggests the fix might be in the wrong place and would not be applied as intended. I've left a critical comment detailing this issue. Please verify the correct location for this logic.

vllm/model_executor/models/openpangu.py

skyloevil · 2025-11-09T11:13:40Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a helper function _normalize_rope_scaling_defaults to centralize the logic for handling RoPE scaling parameters for OpenPangu models. This change effectively removes hardcoded default values from OpenPanguMLAAttention and ensures that both OpenPanguMLAAttention and OpenPanguEmbeddedAttention consistently apply these defaults. The implementation is clean, robust, and improves the model's configurability by allowing rope_scaling parameters to be specified in the model's configuration file. The changes are well-verified and represent a good improvement in code maintainability and consistency.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

vllm/model_executor/models/openpangu.py

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

skyloevil · 2025-11-09T14:23:35Z

/gemini review

skyloevil · 2025-11-09T14:24:54Z

@codex review

chatgpt-codex-connector · 2025-11-09T14:28:57Z

Codex Review: Didn't find any major issues. Can't wait for the next one!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

gemini-code-assist

Code Review

This pull request refactors the RoPE scaling configuration for OpenPangu models by introducing a normalization function, _normalize_rope_scaling_defaults. This change removes hardcoded values and allows for using RoPE scaling parameters from the model's configuration, with sensible defaults. The implementation is applied to both OpenPanguMLAAttention and OpenPanguEmbeddedAttention.

My review identifies a potential logic issue in the new normalization function that could lead to inconsistent values for type and rope_type in the resulting configuration dictionary. I've provided a suggestion to refactor the logic for better clarity and correctness, ensuring that the configuration remains consistent regardless of which keys are provided by the user.

vllm/model_executor/models/openpangu.py

gemini-code-assist bot reviewed Nov 9, 2025

View reviewed changes

vllm/model_executor/models/openpangu.py Outdated Show resolved Hide resolved

skyloevil force-pushed the feature/openpangu-rope-config branch from 148afe6 to 7f12cee Compare November 9, 2025 09:40

skyloevil marked this pull request as draft November 9, 2025 09:43

skyloevil force-pushed the feature/openpangu-rope-config branch from 77092fa to fc854be Compare November 9, 2025 11:10

gemini-code-assist bot reviewed Nov 9, 2025

View reviewed changes

skyloevil marked this pull request as ready for review November 9, 2025 12:10

chatgpt-codex-connector bot reviewed Nov 9, 2025

View reviewed changes

vllm/model_executor/models/openpangu.py Outdated Show resolved Hide resolved

skyloevil added 5 commits November 9, 2025 22:22

Load rope scaling from config in OpenPangu

7eee77f

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

Normalize rope scaling in embedded attention

ceefa88

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

Silence flake8 SIM108 warning

94c7e95

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

Apply ruff format after pre-commit

c7634d2

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

Respect custom rope scaling types

f0e2d61

Signed-off-by: zitian.zhao <zitian.zhao@tencentmusic.com>

skyloevil force-pushed the feature/openpangu-rope-config branch from db047af to f0e2d61 Compare November 9, 2025 14:22

gemini-code-assist bot reviewed Nov 9, 2025

View reviewed changes

vllm/model_executor/models/openpangu.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Normalize OpenPangu RoPE scaling defaults #28361

Normalize OpenPangu RoPE scaling defaults #28361

Uh oh!

skyloevil commented Nov 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

chatgpt-codex-connector bot commented Nov 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Normalize OpenPangu RoPE scaling defaults #28361

Are you sure you want to change the base?

Normalize OpenPangu RoPE scaling defaults #28361

Uh oh!

Conversation

skyloevil commented Nov 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Verification

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

skyloevil commented Nov 9, 2025

Uh oh!

chatgpt-codex-connector bot commented Nov 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

skyloevil commented Nov 9, 2025 •

edited by github-actions bot

Loading