Unitxt 1.23.1
What's Changed
- Add more metrics for schema linking by @kurhula in #1788
- Fixed argument_value_precision by @yoavkatz in #1794
- FIx granite guardian agentic metric and align it with unitxt built in tool calling types by @elronbandel in #1786
- Allow running benchmarks and recipes in cli by @elronbandel in #1785
- Add ToRR Benchmark Readme file by @csrajmohan in #1793
- Add tool calling correctness metric by @elronbandel in #1796
- Remove IBM branding from opensource doc by @yoavkatz in #1802
- Add LoadJsonFile loader and tests by @elronbandel in #1801
- LLM judge judgebench benchmarks by @martinscooper in #1800
- Added granite tool calling system prompt by @Narayanan-V-Eswar in #1798
- Documenation updates by @yoavkatz in #1790
- Cards for the Real MM RAG datasets by @assaftibm in #1795
- Add more judges by @martinscooper in #1808
- Fixed problematic load of json with a single dictionary line. by @yoavkatz in #1806
- Add more cross provider models by @martinscooper in #1807
- Fix model name by @martinscooper in #1809
- watsonx.ai mistral small support by @LukaszCmielowski in #1810
- Fix: number of batches calculation is incorrect by @martinscooper in #1805
- Fix example dependencies installation by @elronbandel in #1812
- Update version to 1.23.1 by @elronbandel in #1818
New Contributors
- @kurhula made their first contribution in #1788
- @LukaszCmielowski made their first contribution in #1810
Full Changelog: 1.23.0...1.23.1