support kimi 2.5/6 full param and lora by nanjiangwill · Pull Request #1057 · radixark/miles

nanjiangwill · 2026-04-30T22:45:37Z

megatron-bridge patch from commit 3fd3768045422d0aa5c97e90a4e6c659aea9acb9
sglang patch from commit bb9223d7c51fa66092b3bcae566ef4ecff309dc6

gemini-code-assist

Code Review

This pull request introduces support for Kimi VL and Kimi K2.5 models, including specialized weight conversion logic and multimodal token expansion for training. It significantly enhances the LoRA weight synchronization mechanism by implementing IPC staging buffers for faster transfers and a chunked streaming approach to handle large adapters with SGLang. Additionally, it adds support for shared-outer grouped-expert LoRA and updates various utility functions and quantization scripts to accommodate vision-tower and projector components. Feedback was provided to improve the robustness of rollout batch processing by using strict zipping.

gemini-code-assist · 2026-04-30T22:50:22Z

+    expanded_total_lengths = []
+    expanded_response_lengths = []
+
+    for i, (token_tensor, loss_mask_tensor) in enumerate(zip(tokens, loss_masks, strict=False)):


Using strict=True in zip is safer here to ensure that the number of token tensors and loss mask tensors match exactly, which is expected for a valid rollout batch.

Suggested change

for i, (token_tensor, loss_mask_tensor) in enumerate(zip(tokens, loss_masks, strict=False)):

for i, (token_tensor, loss_mask_tensor) in enumerate(zip(tokens, loss_masks, strict=True)):

support kimi 2.5/6 full param and lora

d5304a0

nanjiangwill requested review from Zhichenzzz, fzyzcjy, guapisolo, maocheng23, yueming-yuan and yushengsu-thu as code owners April 30, 2026 22:45

Zhichenzzz added run-ci-megatron run-ci-lora labels Apr 30, 2026

gemini-code-assist Bot reviewed Apr 30, 2026

View reviewed changes

yushengsu-thu self-assigned this Apr 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support kimi 2.5/6 full param and lora#1057

support kimi 2.5/6 full param and lora#1057
nanjiangwill wants to merge 1 commit intoradixark:mainfrom
nanjiangwill:kimi25

nanjiangwill commented Apr 30, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	for i, (token_tensor, loss_mask_tensor) in enumerate(zip(tokens, loss_masks, strict=False)):
	for i, (token_tensor, loss_mask_tensor) in enumerate(zip(tokens, loss_masks, strict=True)):

Conversation

nanjiangwill commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nanjiangwill commented Apr 30, 2026 •

edited

Loading