Skip to content

Conversation

@EunjuYang
Copy link
Contributor

Dependency of the PR

Commits to be reviewed in this PR

[CausalLM] qwen3_moe_cached is updated
  • This patch updates moe_cached to consider the priority of the topk correctly.
  • This patch puts higher priority to the recent token's higher priority expert.

Self evaluation:

  1. Build test: [X]Passed [ ]Failed [ ]Skipped
  2. Run test: [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Eunju Yang [email protected]

Summary

  • This is a draft. I need additional test for long context.

Signed-off-by: Eunju Yang [email protected]

- This patch updates moe_cached to consider the priority of the topk
  correctly.
- This patch puts higher priority to the recent token's higher priority
  expert.

Signed-off-by: Eunju Yang <[email protected]>
@EunjuYang EunjuYang force-pushed the causallm/update_qwen_caching branch from 76cc380 to 88b5647 Compare September 18, 2025 02:07
@github-actions
Copy link

github-actions bot commented Oct 2, 2025

This PR is stale because it has been open 14 days with no activity. Remove stale label or comment or this will be closed in 3 days.

@github-actions github-actions bot added the Stale label Oct 2, 2025
@github-actions
Copy link

github-actions bot commented Oct 5, 2025

This PR was closed because it has been stalled for 3 days with no activity.

@github-actions github-actions bot closed this Oct 5, 2025
@EunjuYang EunjuYang reopened this Oct 12, 2025
@EunjuYang EunjuYang removed the Stale label Oct 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant