How can we verify this is not fake? #69

GatoDev · 2026-02-01T13:23:38Z

GatoDev
Feb 1, 2026

Sus

asgeirtj · 2026-02-04T18:25:54Z

asgeirtj
Feb 4, 2026
Maintainer

Sus

Good question. As much as I'd like to share the extraction methods used I fear they might be patched.

However there are some ways you can increase your trust in the prompts:

Ask the LLM if is some parts are correct. Then try asking about some parts which are not part of the claimed extracted prompt. Does the extracted prompt make sense internally? Web search some parts of the extracted prompt and see if others have extracted the exact same. Are there any specific extractions you find sus? In general, if all these prompts are simply written by me and are fake, that would make for a pretty short, pointless and stupid venture. With experience you sus out hallucinated system prompts pretty quickly, which is pretty easy to rule out with simple regenerations, if incorrect they should be different as per LLM's generative nature. And of course any and all extractions are result of same exact output being derived from countless generations with a fresh context. At the end of the day though, nobody can be sure all of these are 100% correct, but realistically they are almost surely about 9999,9999% correct as the whole internet diverge on the factual system prompts behind the scenes, this is mostly a completely solved issue with no system prompt that I know of realistically not getting revealed in the end (gpt-thinking models are kind of the final boss in this sport these days, but still has probably squealed all its secrets, though this can change by the day).

1 reply

asgeirtj Feb 12, 2026
Maintainer

This is also an approach you can try which I discovered by accident 🙂

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How can we verify this is not fake? #69

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How can we verify this is not fake? #69

Uh oh!

GatoDev Feb 1, 2026

Replies: 1 comment · 1 reply

Uh oh!

asgeirtj Feb 4, 2026 Maintainer

Uh oh!

asgeirtj Feb 12, 2026 Maintainer

GatoDev
Feb 1, 2026

Replies: 1 comment 1 reply

asgeirtj
Feb 4, 2026
Maintainer

asgeirtj Feb 12, 2026
Maintainer