Stress-tested MiroFish with 3 live backtests, writeup inside #537

Metroxe · 2026-04-15T20:52:08Z

Metroxe
Apr 15, 2026

Hey, wanted to share a real writeup of testing MiroFish on three different scenarios from the same week.

The three backtests:

A celebrity scandal (cultural discourse)
The Iran ceasefire (geopolitics)
The oil shock (financial markets)

57 agents total across the three sims, 4,748 agent-generated posts and comments. All three completed cleanly on a 16GB VM.

The ontology generation, agent persona system, and document-to-simulation pipeline are all genuinely novel. The political and cultural sims built rich knowledge graphs with meaningful relationships. The economic sim came out much sparser (9 nodes, 1 edge) which was the most interesting thing I found. Looks like domain-specific tuning could make a big difference there.

Full writeup with specifics on what impressed me and where it broke: https://x.com/Metroxe/status/2044496303938539744?s=20

Happy to share any of the raw simulation data or dig into specific results if anyone's curious. Excited to see where this goes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stress-tested MiroFish with 3 live backtests, writeup inside #537

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Stress-tested MiroFish with 3 live backtests, writeup inside #537

Uh oh!

Metroxe Apr 15, 2026

Replies: 0 comments

Metroxe
Apr 15, 2026