paperorchestra-data

Benchmark data and automated evaluators for validating the PaperOrchestra pipeline proposed in arXiv 2604.05018 (Google).

Purpose

This repo hosts PaperWritingBench — reverse-engineered raw materials from 200 top-tier AI conference papers plus automated evaluators — used to benchmark the multi-agent pipeline implementation in the sister repo.

Pipeline under test

Input Processing → Literature Synthesis → Manuscript Generation → Visual Creation → Output

Scope

v0.1 — data creation only: Brain-science subset drawn from NeurIPS, ICLR, ICML, and CVPR (2020-2025). Target 200 papers with topic-diversity and venue-weighted quotas. Evaluator implementation (Citation F1, LLM-as-a-Judge, etc.) is deferred to v0.2.

Structure

papers/ — per-paper reverse-engineered raw materials (one dir per entry)
evaluators/ — automated quality evaluators (per stage + overall)
metadata/ — benchmark metadata, paper schema, selection criteria
- paper_schema.json — JSON schema for one benchmark entry
- selection_criteria.yaml — brain-science filter rules + venue×year quota
scripts/ — ingestion, parsing, raw-material generation, evaluation (TBD)
docs/ — plan, pipeline stages, benchmark protocol
- benchmark_plan.md — full execution plan (phases, risks, tooling)

Quick links

Execution plan: docs/benchmark_plan.md
Entry schema: metadata/paper_schema.json
Selection criteria: metadata/selection_criteria.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
benchmark/v0.1		benchmark/v0.1
docs		docs
evaluators		evaluators
guidelines		guidelines
metadata		metadata
scripts		scripts
templates		templates
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

paperorchestra-data

Purpose

Pipeline under test

Scope

Structure

Quick links

Related

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

paperorchestra-data

Purpose

Pipeline under test

Scope

Structure

Quick links

Related

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages