Skip to content

Conversation

@Xiaoming-AMD
Copy link
Collaborator

This PR introduces a unified GEMM Benchmark Suite in primus/tools/benchmark/, covering three benchmark types:

Benchmark Type Description Example Command
base Raw matmul benchmark (manual M/N/K) for kernel validation or roofline testing primus-cli benchmark gemm --m 8192 --n 8192 --k 8192
dense Dense-model GEMM benchmark (e.g. Llama2/3, Mistral) with model-aware config primus-cli benchmark gemm-dense --model Llama3.1_8B
deepseek DeepSeek-style GEMM benchmark (with LoRA, Router, MoE experts) primus-cli benchmark gemm-deepseek --model Deepseek_V2

All three benchmarks support standardized Markdown report output, consistent CLI usage, and extensible config handling.


GEMM Benchmark Report

image

Dense GEMM Benchmark Report

image

DeepSeek GEMM Benchmark Report

image

- Allow '--model' argument to be case-insensitive (e.g. 'llama3.1_8b' == 'Llama3.1_8B').
- Added lowercase mapping lookup for MODEL_CONFIGS.
- Preserve canonical model name in report output.
@Xiaoming-AMD Xiaoming-AMD merged commit 48459a2 into main Oct 22, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants