Skip to content

Commit bc35289

Browse files
2 parents 2aec74c + 73fc32c commit bc35289

File tree

366 files changed

+1037
-582
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

366 files changed

+1037
-582
lines changed

.github/workflows/test.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,10 +18,10 @@ jobs:
1818
matrix:
1919
benchmark:
2020
- example_bench
21+
- course_exam_bench
2122
# TODO: For now, we comment out other benchmarks as they have no tests
2223
# - arteval_bench
2324
# - cache_bench
24-
# - course_exam_bench
2525
# - course_project_bench
2626

2727
steps:

README.md

Lines changed: 36 additions & 29 deletions
Lines changed: 4 additions & 4 deletions

benchmarks/cache_bench/data/benchmark/cache_benchmarks.jsonl renamed to benchmarks/algo_cache_bench/data/benchmark/cache_benchmarks.jsonl

File renamed without changes.

benchmarks/cache_bench/data/benchmark/human_ref_results/argument_results.py renamed to benchmarks/algo_cache_bench/data/benchmark/human_ref_results/argument_results.py

File renamed without changes.

benchmarks/cache_bench/data/benchmark/human_ref_results/human_eval.jsonl renamed to benchmarks/algo_cache_bench/data/benchmark/human_ref_results/human_eval.jsonl

File renamed without changes.

benchmarks/cache_bench/data/benchmark/human_ref_results/human_results_summary.txt renamed to benchmarks/algo_cache_bench/data/benchmark/human_ref_results/human_results_summary.txt

File renamed without changes.

benchmarks/cache_bench/data/benchmark/human_ref_results/llm_results_eval.jsonl renamed to benchmarks/algo_cache_bench/data/benchmark/human_ref_results/llm_results_eval.jsonl

File renamed without changes.

benchmarks/cache_bench/data/benchmark/human_ref_results/llm_results_summary.txt renamed to benchmarks/algo_cache_bench/data/benchmark/human_ref_results/llm_results_summary.txt

File renamed without changes.

0 commit comments

Comments
 (0)