Skip to content

Conversation

potamides
Copy link
Owner

Add evaluation scripts of chat-based models (GPT-4, Claude, etc) for DeTikZify and TikZero. The scripts still need some clean-up before they can be merged but they might already be useful to whoever is interested in evaluating such models.

@potamides potamides linked an issue May 28, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

How do you evaluate the commercial LLMs

1 participant