Skip to content

[evals] Add ASR and OCR-noisy text PPL evals #5097

@dlwh

Description

@dlwh

🤖 Part of #5005.

Description

Add PPL/gap eval sets for noisy text produced by ASR, OCR, screen scraping, and vision-to-text tools. Agents increasingly consume imperfect transcriptions rather than clean web text.

Initial sources:

Definition of Done

  • Add at least one ASR-noise source and one OCR-noise source.
  • Preserve noisy text and clean reference text where available.
  • Report clean-vs-noisy BPB deltas.
  • Document licensing/access constraints, especially for Common Voice.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions