[CI]【Hackathon 10th Spring No.39】fused_moe_marlin_backend.py unit test by bobby-cloudforge · Pull Request #7494 · PaddlePaddle/FastDeploy

bobby-cloudforge · 2026-04-19T20:02:53Z

Motivation

No.39 功能模块 fastdeploy/model_executor/layers/moe/fused_moe_marlin_backend.py 单元测试覆盖

Modifications

添加单测文件 tests/layers/test_fused_moe_marlin_backend.py

develop 分支：覆盖率0%，Miss行数115（17-361）

当前PR：覆盖率100%，Miss行数0

注：截图来源于本测试文件单独执行 pytest --cov 的结果（含 branch coverage），上方文字为 develop 已有测试与本 PR 合并后的 statement 覆盖率预估，统计口径有差异，以 CI 合并后实际结果为准。

覆盖行数增量 115-0 = 115 → 四舍五入 100 → 预估贡献 0.1⭐

Usage or Command

pytest tests/layers/test_fused_moe_marlin_backend.py

Accuracy Tests

N/A

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

paddle-bot · 2026-04-19T20:02:59Z

Thanks for your contribution!

PaddlePaddle-bot

🤖 AI Code Review | 2026-04-20 05:01 CST\n\n## 📋 Review 摘要\n\nPR 概述：为 fused_moe_marlin_backend.py 新增单元测试，覆盖纯函数（get_scale_perms、marlin_permute_scales 等）和 MarlinWeightOnlyMoEMethod 的 create_weights/process_loaded_weights/apply 方法。\n变更范围：tests/layers/（新增测试文件）\n影响面 Tag：CI OP\n\n### 问题\n\n| 级别 | 文件 | 概述 |\n|------|------|------|\n| 🟡 建议 | test_fused_moe_marlin_backend.py:38 | _NEED_STUB 机制在 GPU 可用时，per-test mock 可能失效 |\n| 🟡 建议 | test_fused_moe_marlin_backend.py:283 | test_apply_topk 和 test_apply_noaux_tc 大量重复 mock 配置，建议抽取公共 fixture |\n\n### 总体评价\n\n测试文件整体结构清晰，_DummyLayer 准确覆盖了 MarlinWeightOnlyMoEMethod 实际访问的所有 layer 属性，纯函数测试部分（TestPureFunctions）验证了真实计算逻辑。主要改进点在于两个 apply 测试之间的 mock 配置重复度较高，以及 stub 机制在有 GPU 环境下的鲁棒性可以加强。"

PaddlePaddle-bot · 2026-04-19T21:01:45Z

+_GPU_OPS = "fastdeploy.model_executor.ops.gpu"
+_DEEP_GEMM = f"{_GPU_OPS}.deep_gemm"
+
+_NEED_STUB = _GPU_OPS not in sys.modules


🟡 建议 当 GPU ops 可用时（_NEED_STUB = False），mb 模块在顶层通过真实模块导入，内部持有的 gpu_ops 引用指向真实模块对象。但各测试方法中 patch.object(_gpu_ops_stub, "gptq_marlin_repack", ...) 修改的是 stub 对象而非真实模块，mock 不会生效，测试可能直接调用真实 GPU 算子而失败。

建议改进方案：在 _NEED_STUB = False 分支下，将 _gpu_ops_stub 指向真实模块，使 per-test 的 patch.object 始终作用于正确的目标：

if _NEED_STUB: # ... 现有逻辑 ... else: from fastdeploy.model_executor.layers.moe import fused_moe_marlin_backend as mb _gpu_ops_stub = sys.modules[_GPU_OPS] # 使 patch.object 目标一致

PaddlePaddle-bot · 2026-04-19T21:01:45Z

+            paddle.zeros([g.shape[0], k], "int64"),
+        )
+
+        with (


🟡 建议 test_apply_topk（L219-260）和 test_apply_noaux_tc（L283-320）共享 5 组几乎相同的 patch 配置（sys.modules、gptq_marlin_repack、MoeWna16MarlinGemmApi、tritonmoe_preprocess_func、swiglu），代码重复度较高。

同目录下的兄弟测试文件（test_fused_moe_cutlass_backend.py、test_fused_moe_triton_backend.py）使用 pytest monkeypatch fixture 管理 mock。建议将公共 mock 抽取为 pytest fixture 或 contextmanager 辅助函数，例如：

from contextlib import contextmanager @contextmanager def _mock_gpu_ops(extra_modules=None): modules = {_GPU_OPS: _gpu_ops_stub, _DEEP_GEMM: _deep_gemm_stub} if extra_modules: modules.update(extra_modules) with ( patch.dict(sys.modules, modules, clear=False), patch.object(_gpu_ops_stub, "gptq_marlin_repack", lambda w, p, sk, sn, nb: paddle.zeros([sk // 16, sn * (nb // 2)], dtype=w.dtype)), patch.object(mb, "MoeWna16MarlinGemmApi", lambda *_a, **kw: (paddle.zeros([kw["size_m"], kw["size_n"]], "float32"),)), patch.object(mb, "tritonmoe_preprocess_func", lambda ids, ne, bm: (paddle.zeros([4], "int32"), paddle.zeros([1], "int32"), paddle.to_tensor([4], "int32"))), patch("paddle.incubate.nn.functional.swiglu", lambda x: x[..., : x.shape[-1] // 2], create=True), ): yield

这样每个测试方法只需额外添加各自特有的 mock（如 moe_topk_select 或 _moe_stub），可读性和可维护性都会提升。

bobby-cloudforge temporarily deployed to Metax_ci April 19, 2026 20:02 — with GitHub Actions Inactive

paddle-bot bot added the contributor External developers label Apr 19, 2026

This comment was marked as outdated.

Sign in to view

[CI]【Hackathon 10th Spring No.39】fused_moe_marlin_backend.py unit test

c849133

bobby-cloudforge force-pushed the feat/h10-39-marlin-moe-test1 branch from 8de7a8d to c849133 Compare April 19, 2026 20:45

bobby-cloudforge temporarily deployed to Metax_ci April 19, 2026 20:45 — with GitHub Actions Inactive

PaddlePaddle-bot reviewed Apr 19, 2026

View reviewed changes

luotao1 mentioned this pull request Apr 20, 2026

【Hackathon 10th】开源贡献个人挑战赛 · 春节特别季 PaddlePaddle/Paddle#77429

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI]【Hackathon 10th Spring No.39】fused_moe_marlin_backend.py unit test#7494

[CI]【Hackathon 10th Spring No.39】fused_moe_marlin_backend.py unit test#7494
bobby-cloudforge wants to merge 1 commit intoPaddlePaddle:developfrom
CloudForge-Solutions:feat/h10-39-marlin-moe-test1

bobby-cloudforge commented Apr 19, 2026

Uh oh!

paddle-bot bot commented Apr 19, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

PaddlePaddle-bot left a comment

Uh oh!

PaddlePaddle-bot Apr 19, 2026

Uh oh!

PaddlePaddle-bot Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bobby-cloudforge commented Apr 19, 2026

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Apr 19, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

PaddlePaddle-bot left a comment

Choose a reason for hiding this comment

Uh oh!

PaddlePaddle-bot Apr 19, 2026

Choose a reason for hiding this comment

Uh oh!

PaddlePaddle-bot Apr 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants