Skip to content

The main branch is slower than expected #983

@wenhuach21

Description

@wenhuach21

Quantizing llama3-8B cost 15+, expected is 13

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions