Skip to content

Add reporting and researcher-in-the-loop evaluation tools #37

@adelavega

Description

@adelavega

Without evaluation data, it is easy to misuse autonima with incorrect prompting and get a misleading result.

To prevent that, let's design user friendly tools to allow for researcher-in-the-loop prompt engineering and fine-tuning of the meta-analysis config file, and standard procedures for optimizing prompts with relatively minimal manual annotation and verification.

One idea is users could review a random sample of accepted/rejected studies, and based on that revise their prompt.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions