Could Pi underperform when models are tuned for native harnesses? #4121

marcusraty · 2026-05-03T04:46:47Z

marcusraty
May 3, 2026

Could Pi’s minimal read/write/edit/bash harness perform worse than provider-native harnesses like Claude Code or Codex because frontier models may be trained using RL/gyms or evaluated around their own tool shapes, prompts, patch formats, and search workflows?

jukofyork · 2026-05-11T13:37:35Z

jukofyork
May 11, 2026

There is at least one example of this for kimi-2.5 running in opencode:

anomalyco/opencode#20258

anomalyco/opencode#20259

1 reply

marcusraty May 11, 2026
Author

Thanks for your reply. Do you think those examples are related to system prompts which are specific to models vs what I was suggesting which is that frontier model companies are using different kinds of training to make their models works best with their Tool interfaces.

An example would be that Claude Opus is trained on examples of / post trained to use Claude Code harness style Tool calls Read() Edit() while GPT is trained on codex style Tool calls Bash(). That could mean these models may not work as well in pi or opencode harnesses.

aussetg · 2026-05-24T13:38:31Z

aussetg
May 24, 2026

Absolutely yes.

It is best if possible to try to "match" the harness the specific LLM was trained in. So for example if you love/use GPT it's best to have apply_patch instead of edit/write

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could Pi underperform when models are tuned for native harnesses? #4121

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Could Pi underperform when models are tuned for native harnesses? #4121

Uh oh!

Uh oh!

marcusraty May 3, 2026

Replies: 2 comments · 1 reply

Uh oh!

jukofyork May 11, 2026

Uh oh!

marcusraty May 11, 2026 Author

Uh oh!

aussetg May 24, 2026

marcusraty
May 3, 2026

Replies: 2 comments 1 reply

jukofyork
May 11, 2026

marcusraty May 11, 2026
Author

aussetg
May 24, 2026