fix: robust JSON extraction for mixed LLM responses by SergioChan · Pull Request #124 · 666ghj/MiroFish

SergioChan · 2026-03-10T21:50:31Z

SummarynnHarden backend JSON parsing for LLM responses so mixed outputs (markdown fences, pre/post text) are handled more robustly, reducing 500 errors reported during ontology generation.nn## Changesnn- Updated `LLMClient.chat()` to remove `<think ...>...</think>` tags case-insensitivelyn- Added `LLMClient._extract_json_payload()` to normalize and extract JSON from noisy model responsesn- Updated `chat_json()` to parse extracted payload instead of raw contentn- Added unit tests in `backend/tests/test_llm_client_json_extract.py` for fenced JSON and mixed-text extractionnn## Testingnn- Added targeted unit tests for extraction behaviorn- Could not execute tests in this environment because backend dev dependencies are not installed (`pytest`, `flask` missing)nnFixes #64

fix: robustly parse JSON payload from mixed LLM responses

08d5313

Fixes 666ghj#64

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. LLM API Any questions regarding the LLM API labels Mar 10, 2026