feat(benchmark/gemm): add base, dense, and deepseek GEMM benchmarks #226

Xiaoming-AMD · 2025-10-15T07:53:55Z

This PR introduces a unified GEMM Benchmark Suite in primus/tools/benchmark/, covering three benchmark types:

Benchmark Type	Description	Example Command
`base`	Raw matmul benchmark (manual M/N/K) for kernel validation or roofline testing	`primus-cli benchmark gemm --m 8192 --n 8192 --k 8192`
`dense`	Dense-model GEMM benchmark (e.g. Llama2/3, Mistral) with model-aware config	`primus-cli benchmark gemm-dense --model Llama3.1_8B`
`deepseek`	DeepSeek-style GEMM benchmark (with LoRA, Router, MoE experts)	`primus-cli benchmark gemm-deepseek --model Deepseek_V2`

All three benchmarks support standardized Markdown report output, consistent CLI usage, and extensible config handling.

GEMM Benchmark Report

Dense GEMM Benchmark Report

DeepSeek GEMM Benchmark Report

- Allow '--model' argument to be case-insensitive (e.g. 'llama3.1_8b' == 'Llama3.1_8B'). - Added lowercase mapping lookup for MODEL_CONFIGS. - Preserve canonical model name in report output.

Xiaoming-AMD added 4 commits October 14, 2025 02:02

feature(benchmark): add base gemm bench

3582cd9

feature(benchmark): add dense model gemm bench

1b3b4e7

feat(benchmark/gemm): support case-insensitive model name matching

a0a693f

- Allow '--model' argument to be case-insensitive (e.g. 'llama3.1_8b' == 'Llama3.1_8B'). - Added lowercase mapping lookup for MODEL_CONFIGS. - Preserve canonical model name in report output.

feature(benchmark): add deepseek model gemm bench

e68322c

Xiaoming-AMD requested review from limou102 and wenxie-amd as code owners October 15, 2025 07:53

Xiaoming-AMD added 2 commits October 18, 2025 17:28

Merge branch 'main' into feature/benchmark/gemm

23b40d8

Merge branch 'main' into feature/benchmark/gemm

f57054a

wenxie-amd approved these changes Oct 22, 2025

View reviewed changes

Xiaoming-AMD merged commit 48459a2 into main Oct 22, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(benchmark/gemm): add base, dense, and deepseek GEMM benchmarks #226

feat(benchmark/gemm): add base, dense, and deepseek GEMM benchmarks #226

Uh oh!

Xiaoming-AMD commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(benchmark/gemm): add base, dense, and deepseek GEMM benchmarks #226

feat(benchmark/gemm): add base, dense, and deepseek GEMM benchmarks #226

Uh oh!

Conversation

Xiaoming-AMD commented Oct 15, 2025

GEMM Benchmark Report

Dense GEMM Benchmark Report

DeepSeek GEMM Benchmark Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants