Skip to content

Add autocompare traces, UI model add, and CI tests#1

Open
abidlabs wants to merge 1 commit intomainfrom
feature/autocompare-traces-ui
Open

Add autocompare traces, UI model add, and CI tests#1
abidlabs wants to merge 1 commit intomainfrom
feature/autocompare-traces-ui

Conversation

@abidlabs
Copy link
Copy Markdown
Member

Summary

  • Adds autocompare() instrumentation for OpenAI + Anthropic (chat, responses/messages)
  • Introduces trace-first logging + dashboard trace view, model-size badges, and secondary metrics callbacks
  • Adds a + UI action to add a model and replay saved traces
  • Adds CI (ruff + pytest + Playwright) and a few unit/UI smoke tests

Test plan

  • ruff check . and ruff format --check .
  • pytest -q
  • Playwright UI smoke test included in pytest

kumquat

Made with Cursor

Implements OpenAI/Anthropic autocompare instrumentation, trace-first logging and dashboard views, monthly model presets, and secondary metrics callbacks. Adds UI + model replay flow and a minimal CI workflow with pytest + Playwright coverage. kumquat

Made-with: Cursor
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant