Skip to content

docs: add a Troubleshooting how-to guide #18

@Abdul-Moiz31

Description

@Abdul-Moiz31

Summary

The docs currently have no troubleshooting / common-errors page. Every existing How-to Guide (Docker Compose, Kubernetes, Hybrid Gateway) walks a forward path — install, configure, expose — but nothing helps a user who hits an error along the way.

For a product whose setup spans CLI auth, OAuth, OIDC, hybrid gateways, secrets and elicitation flows, this is a real friction point.

Why this matters

A few concrete places in the existing docs where users will predictably get stuck without a page to land on:

  • reshapr login callback port. login opens the browser at http://localhost:5556 (getting-started.md L83). If that port is in use, or the browser blocks the callback, the docs don't say what the failure looks like or how to recover.
  • Linux + host.docker.internal. docker-compose.md L141-154 uses --add-host=host.docker.internal:host-gateway but never explains why — Linux users who omit the flag will see the proxy fail to reach the control plane and have no signpost to follow.
  • Default local creds. docker-compose.md L78 gives admin / password as the local defaults — easy to miss when scanning, and there's no page that consolidates "first-run gotchas."
  • Hybrid gateway registration. deploy-hybrid-gateway.md L34-49 details the registration → health-check → streaming lifecycle, but doesn't say what failure modes look like on the gateway side or in the control plane logs.
  • OAuth setup. config create-oauth (cli-commands.md L519-535) has many moving parts (oauth2ClientID, JWKS URI, scopes). The docs don't describe how to validate the setup or what a misconfigured token rejection looks like.

Proposal

Add a new page at docs/how-to-guides/troubleshooting.md, wired into sidebars.js under How-to Guides.

The structure I have in mind (open to your shaping):

  • CLI & authentication — login callback, expired tokens, wrong server URL
  • Local Docker Compose — Linux host.docker.internal, port conflicts, default credentials, container restarts
  • Hybrid gateways — registration failures, health-check timeouts, API token issues
  • Security & secrets — OAuth misconfiguration, elicitation flows, backend secret resolution
  • Each entry follows a tight pattern: SymptomLikely causeFixWhere to look (log line / command)

Where I need your input

I'd rather not seed this page with my guesses — for it to be useful, it needs to reflect the failures you actually see from real users. Could you share:

  1. The top 3–5 setup failures you see on try.reshapr.io or in support channels?
  2. Any error messages or log lines that you wish users would search for and land on a doc?
  3. Whether there's anything you'd explicitly not want documented (private endpoints, internal-only flows, etc.)?

Once I have that, I'm happy to open a PR with a structured first version that you can extend.

Out of scope for this issue

  • Detailed runbooks for production incidents (different audience — operations rather than first-time users).
  • Anything that needs server-side changes (e.g. adding a --debug flag to the CLI).

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentationenhancementNew feature or requesthelp wantedExtra attention is neededquestionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions