Skip to content

Add jfinqa: Japanese Financial Numerical Reasoning QA#1169

Open
ajtgjmdjp wants to merge 1 commit intohuggingface:mainfrom
ajtgjmdjp:add-jfinqa
Open

Add jfinqa: Japanese Financial Numerical Reasoning QA#1169
ajtgjmdjp wants to merge 1 commit intohuggingface:mainfrom
ajtgjmdjp:add-jfinqa

Conversation

@ajtgjmdjp
Copy link

Summary

Adds jfinqa (Japanese Financial Numerical Reasoning QA) as a new evaluation task.

  • 1,000 questions across 3 subtasks: numerical reasoning, consistency checking, temporal reasoning
  • 68 companies from EDINET filings (J-GAAP / IFRS / US-GAAP)
  • Custom metrics for Japanese financial formats (fullwidth digits, △ negatives, kanji multipliers)
  • Dataset: ajtgjmdjp/jfinqa

Closes #1168

Changes

  • src/lighteval/tasks/tasks/jfinqa.py: Task file with 3 configs, prompt function, and 2 custom metrics (exact match + numerical match with 1% tolerance)

Test plan

  • pre-commit run --files src/lighteval/tasks/tasks/jfinqa.py passes
  • python -c "from lighteval.tasks.tasks.jfinqa import TASKS_TABLE" loads 3 tasks
  • Run evaluation with a small model to verify end-to-end

🤖 Generated with Claude Code

Three subtasks: numerical reasoning (550), consistency checking (200),
and temporal reasoning (250) from 68 companies' EDINET filings.

Custom metrics handle Japanese financial formats: fullwidth digits,
triangle negatives, kanji multipliers, and unit suffixes.

Dataset: https://huggingface.co/datasets/ajtgjmdjp/jfinqa

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add jfinqa: Japanese Financial Numerical Reasoning QA

1 participant