-
Notifications
You must be signed in to change notification settings - Fork 12
Open
Description
A few months ago, Column Tax (which sells tax preparation software for banks to integrate into their mobile apps) released a benchmark for LLM calculation of US taxes, called TaxCalcBench. The test cases were all created manually, and they intend to update it for future years.
It would probably be useful to integrate TaxCalcBench as a test suite, so that we can ensure our future years' tax calculations are correct, at least for the cases that are included in TaxCalcBench.
The inputs in TaxCalcBench are a proprietary JSON that we would have to figure out how to input to the fact graph. The output is MeF XML; we'd need to find some way to check if two MeF XML files are equivalent.
Metadata
Metadata
Assignees
Labels
No labels