You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- **Proxy**: Routes via Squid at `http://172.30.0.10:3128`
315
315
316
316
### Health check
@@ -321,6 +321,46 @@ Docker healthcheck on the `/health` endpoint (port 10000):
321
321
- **Retries**: 5
322
322
- **Start period**: 2s
323
323
324
+
The `/health` endpoint returns a JSON object that includes a `models_fetch_complete` field, indicating whether the startup model-discovery pass has finished:
Use `models_fetch_complete` as a readiness gate before submitting the first inference request, ensuring model lists are warm. See the [Readiness polling](#readiness-polling) recipe below.
340
+
341
+
### Readiness polling
342
+
343
+
Poll `/health` (or `/reflect`) until `models_fetch_complete: true` before launching the agent command, so model lists are fully cached:
344
+
345
+
```bash
346
+
# Wait up to 30 seconds for model discovery to complete
The management port (10000) also exposes a `GET /reflect` endpoint for dynamic provider and model discovery. This allows agent harnesses to query which providers are configured and which models are available at runtime.
0 commit comments