Stress-tested MiroFish with 3 live backtests, writeup inside #537
Metroxe
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey, wanted to share a real writeup of testing MiroFish on three different scenarios from the same week.
The three backtests:
57 agents total across the three sims, 4,748 agent-generated posts and comments. All three completed cleanly on a 16GB VM.
The ontology generation, agent persona system, and document-to-simulation pipeline are all genuinely novel. The political and cultural sims built rich knowledge graphs with meaningful relationships. The economic sim came out much sparser (9 nodes, 1 edge) which was the most interesting thing I found. Looks like domain-specific tuning could make a big difference there.
Full writeup with specifics on what impressed me and where it broke: https://x.com/Metroxe/status/2044496303938539744?s=20
Happy to share any of the raw simulation data or dig into specific results if anyone's curious. Excited to see where this goes.
Beta Was this translation helpful? Give feedback.
All reactions