AuraOne Open

Open tools for the human-judgment layer of AI evaluation. Rubric authoring, scoring, judge calibration, reviewer agreement, drift detection, leakage audits, dataset documentation, and robotics review data — local, inspectable, and runnable without an AuraOne account.

The thesis behind this release: the best evals will stay private, but the standards for building trustworthy evals should not. Read it in full at resources/writing/measuring-human-judgment-layer.md.

What's in this repository

Component	What it is
`packages/evalkit/`	`auraone-evalkit` Python package + `evalkit` CLI for rubric validation, scoring, judge calibration, reviewer agreement, drift, leakage audits, sampling, and versioning.
`robotics-reviewkit/`	Schemas, exporters, taxonomy, and a static viewer for teleop review and failure data. Includes LeRobot and RLDS/OpenX export bridges.
`resources/buying-toolkit/`	Templates and checklists for teams buying human-data work: SOWs, RFPs, SLAs, vendor comparison, pilot design, reviewer certification, and program playbook.
`resources/writing/`	The thesis posts behind this release.
`docs/PRD/`	The audit trail from source to v0.1.0.

Quick start

pip install auraone-evalkit

evalkit validate-rubric path/to/rubric.jsonl
evalkit lint-rubric path/to/rubric.jsonl
evalkit score --rubric rubric.jsonl --responses outputs.jsonl --out scores.json

Full CLI reference: packages/evalkit/README.md.

What this release is not

Not an expert-authored benchmark. The tutorial datasets included here are synthetic and exist to make the tooling runnable.
Not real robotics data. The example episodes in robotics-reviewkit/examples/ are mock metadata.
Not a hosted AuraOne service. EvalKit runs locally with no API key, no tenant, no database.

These limits are intentional. The release ships the standards, not the dataset.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
docs/PRD		docs/PRD
packages/evalkit		packages/evalkit
resources		resources
robotics-reviewkit		robotics-reviewkit
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AuraOne Open

What's in this repository

Quick start

What this release is not

Related links

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AuraOne Open

What's in this repository

Quick start

What this release is not

Related links

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages