Skip to content

Test and publish latest "model of the moment" results: DeepSeek, etc #17

@mikek-mlcommons

Description

@mikek-mlcommons

To get attention to the utility of the benchmark, we should regularly test whatever the "model of the day" is (Deepseek, Grok 3, GPT-5, etc.)

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions