Skip to content

Popular repositories Loading

  1. autoarena autoarena Public

    Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation

    TypeScript 104 8

  2. kolena kolena Public

    Python client for Kolena's machine learning testing platform

    Python 48 5

  3. cheatsheet cheatsheet Public

    Webpage cheatsheet for ML evaluation and testing

    HTML 1

  4. financebench financebench Public

    Forked from patronus-ai/financebench

    Jupyter Notebook 1

  5. docker-sqitch docker-sqitch Public

    Forked from sqitchers/docker-sqitch

    Docker Image packaging for Sqitch

    Dockerfile

  6. cvat cvat Public

    Forked from cvat-ai/cvat

    Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

    TypeScript

Repositories

Showing 9 of 9 repositories

Sponsoring

  • @squidfunk
  • @pawamoy

Top languages

Loading…

Most used topics

Loading…