Skip to content

Improve eval-results badge docs and add moderation note#2208

Draft
gary149 wants to merge 1 commit intomainfrom
improve-eval-results-docs
Draft

Improve eval-results badge docs and add moderation note#2208
gary149 wants to merge 1 commit intomainfrom
improve-eval-results-docs

Conversation

@gary149
Copy link
Contributor

@gary149 gary149 commented Feb 6, 2026

Summary

  • Clarify badge table descriptions to match actual implementation in moon-landing (e.g. verified = valid verifyToken reproduced via Inspect AI, community = open PR, leaderboard = benchmark has a leaderboard, source = external URL provided)
  • Add a tip after Community Contributions explaining how community scores can be moderated (author can close the PR to remove a disputed score)

Context

User feedback surfaced two common questions:

  1. "Who runs the evals?" — badge descriptions now make provenance clearer
  2. "Can someone submit false scores?" — new tip explains the PR-based moderation model

Test plan

  • Review rendered markdown for clarity

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Add a tip after Community Contributions explaining that community scores
are visible while the PR is open and the model author can close the PR
to remove a disputed score.
@gary149 gary149 force-pushed the improve-eval-results-docs branch from f89f93b to b8b7aed Compare February 6, 2026 15:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants