[Hotfix] final fixes for P2P Transfer by JD-ETH · Pull Request #22663 · sgl-project/sglang

JD-ETH · 2026-04-13T05:15:44Z

Cherry-pick PR fix: deprecated interfaces after dump #22486 — fix: deprecated interfaces after dump
- Fix Qwen3 rope_parameters → use get_rope_config() helper instead of accessing config.rope_parameters dict directly
- (model_runner.py conflict resolved — redundant import removed)
fix(weight_checker): skip _weight_fp32 in weight equality check
- _reset_tensors(): skip _weight_fp32 buffers (don't randomize them)
- _postprocess_tensors(): add _weight_fp32 to non_persistent_buffer_patterns (don't fail on mismatch)
- Reason: Glm4MoeGate._weight_fp32 is a FP32 cache of the bf16 gate weight. Runtime invalidation after P2P weight update is not supported yet. Same skip pattern as cos_sin_cache / inv_freq.
chore: remove redundant local import of get_local_ip_auto
Cherry-pick [sglang-miles] fix fused qkv load weight from hf #22552 that fixes a special shard loading implementation in sglang

Validated models (all ✅ with --check-weight-update-equal + p2p)

Qwen3-4B (Qwen3ForCausalLM, 1 node)
GLM-Z1-9B-0414 (Glm4ForCausalLM, 1 node)
Moonlight-16B-A3B (DeepseekV2ForCausalLM, 2 nodes)
GLM-4.7-9B-Flash (Glm4MoeLiteForCausalLM, 2 nodes)
GLM-5_4layer (DeepseekV3ForCausalLM, 2 nodes)
Qwen3-30B-A3B (Qwen3MoeForCausalLM, 4 nodes)
GLM-4.5-Air (Glm4MoeForCausalLM, 8 nodes)

The stacked_params_mapping routes q_a_proj and kv_a_proj_with_mqa to ReplicatedLinear.weight_loader with a shard_id, but ReplicatedLinear does not support shard_id. Skip the stacked path for this param_name so weights fall through to the existing cached_a_proj path in do_load_weights(), which correctly caches both halves and torch.cats them before loading. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Change lazy import from `sglang.srt.utils` to `sglang.srt.utils.network` to match the module where `get_local_ip_auto` is actually defined. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Glm4MoeGate._weight_fp32 is a FP32 cache of the bf16 gate weight. Runtime invalidation of this cache after weight update is not yet supported. Skip it in both _reset_tensors and _postprocess_tensors, same pattern as cos_sin_cache and inv_freq. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Already imported at module level (line 183). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist · 2026-04-13T05:15:49Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

JensenFire · 2026-04-13T07:06:18Z

python/sglang/srt/utils/weight_checker.py

    def _reset_tensors(self):
        for name, param in self._model_state():
-            if "cos_sin_cache" in name or "freqs_cis" in name:
+            if "cos_sin_cache" in name or "freqs_cis" in name or "_weight_fp32" in name:


nit: maybe we could maintain a list where these keys could be skipped.

JD-ETH and others added 5 commits April 11, 2026 06:32

fix(model_runner): fix import path for get_local_ip_auto

ce15ac4

Change lazy import from `sglang.srt.utils` to `sglang.srt.utils.network` to match the module where `get_local_ip_auto` is actually defined. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: deprecated interfaces after dump

bdb33bd

chore: remove redundant local import of get_local_ip_auto

f4c7d30

Already imported at module level (line 183). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

JD-ETH requested review from Fridge003, Ying1123, ch-wan, fzyzcjy, hnyls2002, ispobock and merrymercy as code owners April 13, 2026 05:15

github-actions bot added the deepseek label Apr 13, 2026

This was referenced Apr 13, 2026

[sglang-miles] fix fused qkv load weight from hf #22552

Open

fix: deprecated interfaces after dump #22486

Open

JensenFire reviewed Apr 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hotfix] final fixes for P2P Transfer #22663

[Hotfix] final fixes for P2P Transfer #22663
JD-ETH wants to merge 5 commits intosgl-project:sglang-milesfrom
JD-ETH:patch-fix-dpsk-5-10-v4

JD-ETH commented Apr 13, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Apr 13, 2026

Uh oh!

JensenFire Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JD-ETH commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Apr 13, 2026

Uh oh!

JensenFire Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JD-ETH commented Apr 13, 2026 •

edited

Loading