Skip to content

Use TaxCalcBench as a test suite #16

@ethanc8

Description

@ethanc8

A few months ago, Column Tax (which sells tax preparation software for banks to integrate into their mobile apps) released a benchmark for LLM calculation of US taxes, called TaxCalcBench. The test cases were all created manually, and they intend to update it for future years.

It would probably be useful to integrate TaxCalcBench as a test suite, so that we can ensure our future years' tax calculations are correct, at least for the cases that are included in TaxCalcBench.

The inputs in TaxCalcBench are a proprietary JSON that we would have to figure out how to input to the fact graph. The output is MeF XML; we'd need to find some way to check if two MeF XML files are equivalent.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions