Evaluate LLM-based screening

In #5 we will develop an LLM-based PRISMA-compliant screening workflow

We then need to evaluate its performance against neurometabench.

- First, evaluate the PubMed search itself, how many papers from the final inclusion list are even available? That is the maximum recall possible if only PubMed as a source
- Second, evaluate abstract only screening sep
- Third, evaluate final full text screening

Compute precision & recall for each step

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evaluate LLM-based screening #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Evaluate LLM-based screening #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions