Skip to content

Transconnectome/paperorchestra-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

paperorchestra-data

Benchmark data and automated evaluators for validating the PaperOrchestra pipeline proposed in arXiv 2604.05018 (Google).

Purpose

This repo hosts PaperWritingBench — reverse-engineered raw materials from 200 top-tier AI conference papers plus automated evaluators — used to benchmark the multi-agent pipeline implementation in the sister repo.

Pipeline under test

Input Processing → Literature Synthesis → Manuscript Generation → Visual Creation → Output

Scope

v0.1 — data creation only: Brain-science subset drawn from NeurIPS, ICLR, ICML, and CVPR (2020-2025). Target 200 papers with topic-diversity and venue-weighted quotas. Evaluator implementation (Citation F1, LLM-as-a-Judge, etc.) is deferred to v0.2.

Structure

  • papers/ — per-paper reverse-engineered raw materials (one dir per entry)
  • evaluators/ — automated quality evaluators (per stage + overall)
  • metadata/ — benchmark metadata, paper schema, selection criteria
    • paper_schema.json — JSON schema for one benchmark entry
    • selection_criteria.yaml — brain-science filter rules + venue×year quota
  • scripts/ — ingestion, parsing, raw-material generation, evaluation (TBD)
  • docs/ — plan, pipeline stages, benchmark protocol
    • benchmark_plan.mdfull execution plan (phases, risks, tooling)

Quick links

Related

About

PaperWritingBench: benchmark data and evaluators for validating the PaperOrchestra pipeline (arXiv 2604.05018). Sister repo to PaperOrchestrator.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages