Skip to content

what should I do if I want to improve the performance of hellaswag? #2154

Open
@mathCrazyy

Description

image

I want to find some dataset , for example OpenO1, KD 14B to 3B, or use lora, but I have a bad result:
image
the result of KD only reach 96.8% of the ori 3B Qwen2.5 model
what should I do? Thanks.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions