[experiment] self-improving evals

It would be an interesting experiment to see if a model could be used to improve the guidelines iteratively. 

So something like:

1. Model runs evals with current guidelines
2. Model observes output to ensure all evals still pass
3. Model makes a change to the guidelines (perhaps to remove something or refine to use less tokens)
4. Model returns to step 1

It might be important to use a "reasoning" model like o1 or r1 for this "architect" step before executing the evals again on a lower-model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[experiment] self-improving evals #31

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[experiment] self-improving evals #31

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions