Test and publish latest "model of the moment" results: DeepSeek, etc #17

Open

Assignees

mikek-mlcommons

opened

on Feb 27, 2025

To get attention to the utility of the benchmark, we should regularly test whatever the "model of the day" is (Deepseek, Grok 3, GPT-5, etc.)

Metadata

Assignees

mikek-mlcommons

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests