I accidentally created a benchmark.
To test on your favorite LLM, copy the main .md file named three_bucket_system.md,
and ask your LLM like:
"Read the @three_bucket_system.md and find any contradiction or errors in it."
| Name | Name | Last commit date | ||
|---|---|---|---|---|