Skip to content

(exp): Benchmark MFU as compute / nodes scale #7

@yair-schiff

Description

@yair-schiff

Which arch should we test?

  • Llama 3B using AR loss
  • ModernBert as the backbone

Metadata

Metadata

Assignees

Labels

experimentExperiment we want to runpriority:highHighest priority tickets

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions