Add reporting and researcher-in-the-loop evaluation tools

Without evaluation data, it is easy to misuse autonima with incorrect prompting and get a misleading result.

To prevent that, let's design user friendly tools to allow for researcher-in-the-loop prompt engineering and fine-tuning of the meta-analysis config file, and standard procedures for optimizing prompts with relatively minimal manual annotation and verification.

One idea is users could review a random sample of accepted/rejected studies, and based on that revise their prompt.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reporting and researcher-in-the-loop evaluation tools #37

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add reporting and researcher-in-the-loop evaluation tools #37

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions