03 Eval Harness

Learn a lightweight eval loop for AI outputs.

This example grades candidate responses against simple criteria and prints a pass/fail summary.

What this example teaches

python3 run.py --cases sample_input/eval_cases.json

python3 -m unittest discover -s tests -p "test_*.py"