Skip to content

Old questions test cases#697

Open
jankrepl wants to merge 7 commits intomainfrom
old-questions-deepeval
Open

Old questions test cases#697
jankrepl wants to merge 7 commits intomainfrom
old-questions-deepeval

Conversation

@jankrepl
Copy link
Collaborator

@jankrepl jankrepl commented Feb 6, 2026

Closes #693

For reviewers

  • There are 59 new test cases
  • By default none of the test cases are run. To actually run them you need to set the --exclude-tags=""
  • I added the -n / --dry-run flag to the CLI to just enumerate the tests that would be run
  • All the test cases I added have the benchmark and $THREAD_ID tags and other tags based on what the user.md is
  • Once we merge this PR the idea would be to assign different tags to different people and create new issues

TODO

  • Create test cases
    • PLACHOLDER in expected_output.md
    • {} in expected_tool_calls.json
    • "benchmark", $thread_idinparams.json`
  • Make sure that test cases that have "benchmark" tag are not run in the CI
  • Add dry-run functionality to the CLI

@jankrepl jankrepl added no changelog Disables our GH action no deepeval Disables our GH action skip docs labels Feb 6, 2026
@jankrepl jankrepl marked this pull request as ready for review February 6, 2026 12:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

no changelog Disables our GH action no deepeval Disables our GH action skip docs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Past questions -> Deepeval test cases

1 participant