Add Gemma4 26B-A4B Automations Evaluation#275
Conversation
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #275 +/- ##
=======================================
Coverage 56.88% 56.88%
=======================================
Files 51 51
Lines 2099 2099
=======================================
Hits 1194 1194
Misses 905 905 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
I guess the problem is it shows the total as 0, maybe I did something wrong running the eval |
|
Can you share the command you are using for computing the metrics? I do believe the command is different for these tests, but not sure if this is still accurate: https://github.com/allenporter/home-assistant-datasets/tree/main/datasets/automations#evaluate |
|
I ran I tried the commands from the automations readme but I got an error saying |
|
How about running |
|
That did it |
|
Very strong results, thank you. |
|
@allenporter by the way, this was with reasoning set to off, I didn't think to check if there was a way to designate that |
|
OK good call out. You could update models.yaml and with that detail just in the text description and it can then show up on the display https://github.com/allenporter/home-assistant-datasets/tree/main/reports#gemma4-26b-a4b |
Update to add Gemma4 26B-A4B automation performance