[Clean up] Move evaluation configs under model directories 

Currently, `evaluation.yaml`  exists under the `configs/` directory. To start, we wanted to just showcase this recipes as an example, but it is a core part of the finetuning process and therefore should mirror the pattern we've established for other configs in which they reside under model-specific directories. 

The change for each model directory will consist of four steps:
1. Copy `evaluation.yaml` under whichever model you are focused upon.
2. Update the defaults from `llama2` to the current model defaults.
3. Update the [_recipe_registry.py](https://github.com/pytorch/torchtune/blob/main/torchtune/_recipe_registry.py) to make sure the new YAML file can be found with the following command: `tune run eleuther_eval --config MODEL/evaluation`
4. Put up a PR with output from running the evaluation script. Here's an example for Qwen2: #1809

> If there are multiple sizes of model that exist in the directory, select the most commonly used one. This is certainly up for interpretation, but typically ~7B params is standard. We want to give a good example, but there's no need to proliferate configs for every model SIZE.

Checklist:
- [ ] Llama2
- [x] Code-Llama2 (#2209, thanks @ReemaAlzaid)
- [ ] Llama3
- [ ] Llama3.1
- [x] Llama3.2 (#2186, thanks @ReemaAlzaid)
- [x] Llama3.2V
- [x] Mistral (#1829, thanks @Yousof-kayal)
- [x] Phi3 (#1822, thanks @Harthi7)
- [x] Gemma (#1819, thanks @malinjawi)
- [ ] Gemma2
- [x] Qwen2
- [x] Qwen2.5 (#2230, thanks @Ankur-singh)

After all of these are completed, we will deprecate the `evaluation.yaml` configs in the base configs directory. 

Thanks, everyone, for your help! 🎉

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Clean up] Move evaluation configs under model directories #1810

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Clean up] Move evaluation configs under model directories #1810

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions