Could Pi underperform when models are tuned for native harnesses? #4121
marcusraty
started this conversation in
General
Replies: 2 comments 1 reply
-
|
There is at least one example of this for |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
Absolutely yes. It is best if possible to try to "match" the harness the specific LLM was trained in. So for example if you love/use GPT it's best to have apply_patch instead of edit/write |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Could Pi’s minimal read/write/edit/bash harness perform worse than provider-native harnesses like Claude Code or Codex because frontier models may be trained using RL/gyms or evaluated around their own tool shapes, prompts, patch formats, and search workflows?
Beta Was this translation helpful? Give feedback.
All reactions