frousselet
diff --git a/‎.env.example‎
Lines changed: 4 additions & 0 deletions b/‎.env.example‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 2 additions & 1 deletion b/‎CHANGELOG.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎README.md‎
Lines changed: 2 additions & 2 deletions b/‎README.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎assistant/providers/anthropic.py‎
Lines changed: 184 additions & 0 deletions b/‎assistant/providers/anthropic.py‎
Lines changed: 184 additions & 0 deletions
diff --git a/‎assistant/providers/base.py‎
Lines changed: 7 additions & 2 deletions b/‎assistant/providers/base.py‎
Lines changed: 7 additions & 2 deletions
diff --git a/‎assistant/tests/test_anthropic.py‎
Lines changed: 138 additions & 0 deletions b/‎assistant/tests/test_anthropic.py‎
Lines changed: 138 additions & 0 deletions
@@ -29,6 +29,10 @@ POSTGRES_PORT=5432
 # AI_ASSISTANT_API_KEY=your-openai-api-key
 # AI_ASSISTANT_MODEL=gpt-4o-mini
 # AI_ASSISTANT_BASE_URL=https://api.openai.com/v1   # default; set for a custom gateway
+# Claude (Anthropic, native Messages API; no embeddings, so no semantic search):
+# AI_ASSISTANT_PROVIDER=anthropic
+# AI_ASSISTANT_API_KEY=your-anthropic-api-key
+# AI_ASSISTANT_MODEL=claude-opus-4-8
 # Self-hosted alternative (local LLM, no data egress): point at your own Ollama:
 # AI_ASSISTANT_PROVIDER=ollama
 # AI_ASSISTANT_OLLAMA_URL=http://host.docker.internal:11434
 
@@ -9,7 +9,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ### Added
 
-- **Ask Cairn: OpenAI and OpenAI-compatible providers**: the assistant gains an `openai` backend (`AI_ASSISTANT_PROVIDER=openai`) that targets OpenAI (ChatGPT, e.g. `gpt-4o-mini`) out of the box and, via `AI_ASSISTANT_BASE_URL`, any other endpoint implementing the OpenAI `/chat/completions` and `/embeddings` API (vLLM, LiteLLM, LocalAI, Together, Groq...). The shared request/response handling was extracted into a generic `OpenAICompatibleClient`; the existing `MistralClient` is now a thin subclass of it (Mistral already exposes an OpenAI-compatible API), so behaviour is unchanged for Mistral users. `AI_ASSISTANT_BASE_URL` now defaults to empty and each provider falls back to its own endpoint (`mistral` -> `api.mistral.ai`, `openai` -> `api.openai.com`); set it only to target a custom gateway.
+- **Ask Cairn: OpenAI and OpenAI-compatible providers**: the assistant gains an `openai` backend (`AI_ASSISTANT_PROVIDER=openai`) that targets OpenAI (ChatGPT, e.g. `gpt-4o-mini`) out of the box and, via `AI_ASSISTANT_BASE_URL`, any other endpoint implementing the OpenAI `/chat/completions` and `/embeddings` API (vLLM, LiteLLM, LocalAI, Together, Groq...). The shared request/response handling was extracted into a generic `OpenAICompatibleClient`; the existing `MistralClient` is now a thin subclass of it (Mistral already exposes an OpenAI-compatible API), so behaviour is unchanged for Mistral users. `AI_ASSISTANT_BASE_URL` now defaults to empty and each provider falls back to its own endpoint (`mistral` -> `api.mistral.ai`, `openai` -> `api.openai.com`, `anthropic` -> `api.anthropic.com`); set it only to target a custom gateway.
+- **Ask Cairn: Claude (Anthropic) provider**: a native `anthropic` backend (`AI_ASSISTANT_PROVIDER=anthropic`) talks to Claude through the Messages API (`POST /v1/messages`, `x-api-key` header, top-level `system`, `content` block list) - Claude is not OpenAI-compatible, so it has its own client. Routing uses forced tool use (a `plan` tool whose `input_schema` is the routing schema) and no `temperature`/`thinking` is sent (both 400 on the current Opus family). Set `AI_ASSISTANT_MODEL` to a Claude model id (e.g. `claude-opus-4-8`). Semantic search is not available with this provider, since Anthropic has no embeddings API.
 
 ## [0.27.1] - 2026-06-14
 
 
@@ -13,7 +13,7 @@ Manage your organisation's security posture, track compliance with regulatory fr
 - **Risks** : ISO 27005 and EBIOS RM (ANSSI v1.5, workshops 0 to 5) assessments, threat and vulnerability catalogs, treatment plans and formal risk acceptance
 - **Compliance** : frameworks, requirements, assessments, findings, action plans and inter-framework mappings, with Excel import
 - **Steering** : real-time dashboard, KPI indicators, ISO 27001 management reviews, and PDF/DOCX/PPTX report generation (SoA, audit report, risk register, meeting minutes)
-- **Ask Cairn (optional)** : natural-language questions in the command palette ("Which decisions were made at the last management review?"), answered by a pluggable LLM provider (Mistral AI by default; OpenAI / any OpenAI-compatible endpoint; self-hosted Ollama) that cites real records and enforces your permissions, with thumbs up/down feedback that admins can export to improve the assistant
+- **Ask Cairn (optional)** : natural-language questions in the command palette ("Which decisions were made at the last management review?"), answered by a pluggable LLM provider (Mistral AI by default; OpenAI / any OpenAI-compatible endpoint; Claude; self-hosted Ollama) that cites real records and enforces your permissions, with thumbs up/down feedback that admins can export to improve the assistant
 
 Everything is bilingual (English/French), audit-ready (full change history, versioning, lifecycle workflows with approvals) and access-controlled (role-based permissions, scope-based tenancy, passkey login).
 
@@ -47,7 +47,7 @@ To run the published image without cloning the repository, and for production no
 
 ## Tech stack
 
-Django 5.2 LTS, PostgreSQL 16, Django REST Framework, Django Channels + Redis (real-time), Bootstrap 5.3 + HTMX (frontend), Docker. Optional: Mistral AI, OpenAI / OpenAI-compatible endpoints, or self-hosted Ollama (Ask Cairn assistant).
+Django 5.2 LTS, PostgreSQL 16, Django REST Framework, Django Channels + Redis (real-time), Bootstrap 5.3 + HTMX (frontend), Docker. Optional: Mistral AI, OpenAI / OpenAI-compatible endpoints, Claude (Anthropic), or self-hosted Ollama (Ask Cairn assistant).
 
 ## Licence
 
 
@@ -0,0 +1,184 @@
+"""Anthropic (Claude) backend for the assistant (native Messages API).
+
+Claude is NOT OpenAI-compatible : it uses ``POST /v1/messages`` with an
+``x-api-key`` header, a top-level ``system`` parameter, and a ``content`` block
+list in the response. It therefore needs its own client rather than the shared
+``OpenAICompatibleClient``.
+
+Two operations are implemented: a chat completion constrained to a JSON Schema
+for tool routing (done with forced tool use, the reliable structured-output
+path on Claude) and a plain-text chat completion for the final summary
+sentence. Embeddings are not provided : Anthropic has no embeddings endpoint,
+so semantic search must use another provider (see ``embed``).
+
+Only the calling user's question and the compact, identifier-stripped record
+fields produced by the read-only catalog tools leave the platform; ids and
+UUIDs are scrubbed before the summary call (see ``engine._strip_identifiers``).
+"""
+
+import logging
+
+import httpx
+from django.conf import settings
+
+from assistant.providers.base import (
+    BaseClient,
+    MalformedModelOutput,
+    ModelNotAvailable,
+    ServiceUnreachable,
+)
+
+logger = logging.getLogger(__name__)
+
+
+class AnthropicClient(BaseClient):
+    PROVIDER_LABEL = "Claude"
+    # Applied when neither the constructor argument nor
+    # ``settings.AI_ASSISTANT_BASE_URL`` is set. The Messages endpoint is
+    # ``{base_url}/messages`` (so the default resolves to
+    # ``https://api.anthropic.com/v1/messages``).
+    DEFAULT_BASE_URL = "https://api.anthropic.com/v1"
+    # Pinned API version sent on every request (Anthropic requirement).
+    ANTHROPIC_VERSION = "2023-06-01"
+    # Name of the synthetic tool used to force structured routing output.
+    PLAN_TOOL_NAME = "plan"
+
+    def __init__(self, base_url=None, model=None, api_key=None):
+        self.base_url = (
+            base_url or settings.AI_ASSISTANT_BASE_URL or self.DEFAULT_BASE_URL
+        ).rstrip("/")
+        self.model = model or settings.AI_ASSISTANT_MODEL
+        self.api_key = api_key if api_key is not None else settings.AI_ASSISTANT_API_KEY
+        self.timeout = httpx.Timeout(
+            settings.AI_ASSISTANT_TIMEOUT,
+            connect=settings.AI_ASSISTANT_CONNECT_TIMEOUT,
+        )
+
+    def _headers(self):
+        if not self.api_key:
+            raise ServiceUnreachable(
+                f"{self.PROVIDER_LABEL} API key is not configured "
+                "(set AI_ASSISTANT_API_KEY)."
+            )
+        return {
+            "x-api-key": self.api_key,
+            "anthropic-version": self.ANTHROPIC_VERSION,
+            "content-type": "application/json",
+        }
+
+    @staticmethod
+    def _split_system(messages):
+        """Split OpenAI-style messages into Claude's (system, messages) shape.
+
+        Claude takes the system prompt as a top-level parameter, not as a
+        message with ``role: "system"``; user/assistant turns stay in
+        ``messages``.
+        """
+        system_parts = []
+        chat = []
+        for message in messages:
+            role = message.get("role")
+            content = message.get("content", "")
+            if role == "system":
+                if content:
+                    system_parts.append(content)
+            else:
+                chat.append({"role": role, "content": content})
+        return "\n\n".join(system_parts), chat
+
+    def _base_payload(self, messages):
+        system, chat = self._split_system(messages)
+        # No temperature / thinking: both are rejected (HTTP 400) on the
+        # current Opus family, which is the default model.
+        payload = {
+            "model": self.model,
+            "max_tokens": settings.AI_ASSISTANT_MAX_TOKENS,
+            "messages": chat,
+        }
+        if system:
+            payload["system"] = system
+        return payload
+
+    def _post(self, payload):
+        try:
+            return httpx.post(
+                f"{self.base_url}/messages",
+                json=payload,
+                headers=self._headers(),
+                timeout=self.timeout,
+            )
+        except (httpx.ConnectError, httpx.TimeoutException) as exc:
+            raise ServiceUnreachable(str(exc)) from exc
+        except httpx.HTTPError as exc:
+            raise ServiceUnreachable(str(exc)) from exc
+
+    def _raise_for_status(self, resp):
+        if resp.status_code in (401, 403):
+            # Never surface the key or auth detail to the caller.
+            logger.error(
+                "%s authentication failed (HTTP %s)",
+                self.PROVIDER_LABEL,
+                resp.status_code,
+            )
+            raise ServiceUnreachable("authentication failed")
+        if resp.status_code == 404:
+            raise ModelNotAvailable(self.model)
+        if resp.status_code >= 400:
+            raise ServiceUnreachable(f"HTTP {resp.status_code}: {resp.text[:200]}")
+
+    def _content_blocks(self, resp):
+        try:
+            blocks = resp.json()["content"]
+        except (KeyError, TypeError, ValueError) as exc:
+            raise MalformedModelOutput(resp.text[:200]) from exc
+        if not isinstance(blocks, list):
+            raise MalformedModelOutput(resp.text[:200])
+        return blocks
+
+    def chat_json(self, messages, json_schema, think=None):
+        """Chat completion constrained to ``json_schema``; returns the parsed object.
+
+        Uses forced tool use : a single ``plan`` tool whose ``input_schema`` is
+        the routing schema, with ``tool_choice`` pinned to it. The model must
+        emit a ``tool_use`` block whose ``input`` is the structured plan. The
+        plan schema keeps a free-form ``arguments`` object, which Claude tool
+        input schemas accept; server-side validation in the engine is the real
+        safety net.
+        """
+        payload = self._base_payload(messages)
+        payload["tools"] = [
+            {
+                "name": self.PLAN_TOOL_NAME,
+                "description": "Return the execution plan for the question.",
+                "input_schema": json_schema,
+            }
+        ]
+        payload["tool_choice"] = {"type": "tool", "name": self.PLAN_TOOL_NAME}
+        resp = self._post(payload)
+        self._raise_for_status(resp)
+        for block in self._content_blocks(resp):
+            if block.get("type") == "tool_use" and block.get("name") == self.PLAN_TOOL_NAME:
+                parsed = block.get("input")
+                if not isinstance(parsed, dict):
+                    raise MalformedModelOutput(str(parsed)[:200])
+                return parsed
+        raise MalformedModelOutput(resp.text[:200])
+
+    def chat_text(self, messages):
+        """Plain-text chat completion."""
+        resp = self._post(self._base_payload(messages))
+        self._raise_for_status(resp)
+        text = "".join(
+            block.get("text", "")
+            for block in self._content_blocks(resp)
+            if block.get("type") == "text"
+        )
+        return text.strip()
+
+    def embed(self, texts):
+        """Embeddings are unsupported : Anthropic has no embeddings endpoint."""
+        raise ServiceUnreachable(
+            "The Claude provider does not support embeddings (Anthropic has no "
+            "embeddings API). Disable AI_ASSISTANT_SEMANTIC_ENABLED, or set "
+            "AI_ASSISTANT_PROVIDER to a provider with embeddings for indexing."
+        )
@@ -55,8 +55,9 @@ def get_client():
     ``mistral`` (third-party API) is the default. ``openai`` covers OpenAI
     (ChatGPT) and any other OpenAI-compatible endpoint selected through
     ``AI_ASSISTANT_BASE_URL`` (vLLM, LiteLLM, LocalAI, Together, Groq...).
-    ``ollama`` (self-hosted local LLM) remains selectable for those who point
-    it at their own instance.
+    ``anthropic`` targets Claude through the native Messages API. ``ollama``
+    (self-hosted local LLM) remains selectable for those who point it at their
+    own instance.
     """
     provider = (settings.AI_ASSISTANT_PROVIDER or "mistral").lower()
     if provider == "ollama":
@@ -71,4 +72,8 @@ def get_client():
         from assistant.providers.openai_compatible import OpenAICompatibleClient
 
         return OpenAICompatibleClient()
+    if provider in ("anthropic", "claude"):
+        from assistant.providers.anthropic import AnthropicClient
+
+        return AnthropicClient()
     raise ServiceUnreachable(f"Unknown AI assistant provider: {provider!r}")
@@ -0,0 +1,138 @@
+"""Unit tests for the Anthropic (Claude) provider client (no real sockets)."""
+
+import json
+
+import httpx
+import pytest
+from django.test import override_settings
+
+from assistant.providers.anthropic import AnthropicClient
+from assistant.providers.base import (
+    MalformedModelOutput,
+    ModelNotAvailable,
+    ServiceUnreachable,
+)
+
+
+class FakeResponse:
+    def __init__(self, status_code=200, payload=None, text=""):
+        self.status_code = status_code
+        self._payload = payload if payload is not None else {}
+        self.text = text or json.dumps(self._payload)
+
+    def json(self):
+        return self._payload
+
+
+def _patch_post(monkeypatch, responses):
+    calls = []
+
+    def fake_post(url, json=None, headers=None, timeout=None):
+        calls.append({"url": url, "payload": dict(json), "headers": dict(headers or {})})
+        item = responses.pop(0)
+        if isinstance(item, Exception):
+            raise item
+        return item
+
+    monkeypatch.setattr(httpx, "post", fake_post)
+    return calls
+
+
+def _client():
+    return AnthropicClient(
+        base_url="https://api.anthropic.com/v1",
+        model="claude-opus-4-8",
+        api_key="sk-ant-test",
+    )
+
+
+def test_chat_json_uses_forced_tool_and_returns_input(monkeypatch):
+    payload = {"content": [{"type": "tool_use", "name": "plan", "input": {"steps": []}}]}
+    calls = _patch_post(monkeypatch, [FakeResponse(payload=payload)])
+    result = _client().chat_json(
+        [
+            {"role": "system", "content": "route this"},
+            {"role": "user", "content": "hi"},
+        ],
+        {"type": "object"},
+    )
+    assert result == {"steps": []}
+    call = calls[0]
+    assert call["url"].endswith("/messages")
+    # Native auth headers, not Bearer.
+    assert call["headers"]["x-api-key"] == "sk-ant-test"
+    assert call["headers"]["anthropic-version"] == "2023-06-01"
+    body = call["payload"]
+    assert body["model"] == "claude-opus-4-8"
+    assert body["max_tokens"] >= 1
+    # System prompt is hoisted to the top-level field, not a message.
+    assert body["system"] == "route this"
+    assert body["messages"] == [{"role": "user", "content": "hi"}]
+    # Sampling params / thinking must NOT be sent (400 on the Opus family).
+    assert "temperature" not in body
+    assert "thinking" not in body
+    # Structured output via forced tool use.
+    assert body["tools"][0]["name"] == "plan"
+    assert body["tools"][0]["input_schema"] == {"type": "object"}
+    assert body["tool_choice"] == {"type": "tool", "name": "plan"}
+
+
+def test_chat_json_without_tool_use_raises_malformed(monkeypatch):
+    payload = {"content": [{"type": "text", "text": "no plan here"}]}
+    _patch_post(monkeypatch, [FakeResponse(payload=payload)])
+    with pytest.raises(MalformedModelOutput):
+        _client().chat_json([{"role": "user", "content": "hi"}], {"type": "object"})
+
+
+def test_chat_text_concatenates_text_blocks(monkeypatch):
+    payload = {"content": [{"type": "text", "text": "  Two open "}, {"type": "text", "text": "risks.  "}]}
+    _patch_post(monkeypatch, [FakeResponse(payload=payload)])
+    assert _client().chat_text([{"role": "user", "content": "hi"}]) == "Two open risks."
+
+
+@override_settings(AI_ASSISTANT_BASE_URL="", AI_ASSISTANT_MODEL="claude-opus-4-8")
+def test_defaults_to_anthropic_endpoint(monkeypatch):
+    payload = {"content": [{"type": "text", "text": "ok"}]}
+    calls = _patch_post(monkeypatch, [FakeResponse(payload=payload)])
+    AnthropicClient(api_key="sk-ant-test").chat_text([{"role": "user", "content": "hi"}])
+    assert calls[0]["url"] == "https://api.anthropic.com/v1/messages"
+
+
+def test_missing_api_key_raises_clear_error(monkeypatch):
+    def boom(*a, **k):
+        raise AssertionError("must not hit the network without an API key")
+
+    monkeypatch.setattr(httpx, "post", boom)
+    client = AnthropicClient(base_url="https://api.anthropic.com/v1", model="m", api_key="")
+    with pytest.raises(ServiceUnreachable) as exc:
+        client.chat_text([{"role": "user", "content": "hi"}])
+    assert "API key" in str(exc.value)
+
+
+def test_auth_error_maps_to_unreachable_without_leaking(monkeypatch):
+    _patch_post(monkeypatch, [FakeResponse(status_code=401, text="invalid x-api-key")])
+    with pytest.raises(ServiceUnreachable) as exc:
+        _client().chat_text([{"role": "user", "content": "hi"}])
+    assert "sk-ant-test" not in str(exc.value)
+
+
+def test_unknown_model_maps_to_model_not_available(monkeypatch):
+    _patch_post(monkeypatch, [FakeResponse(status_code=404, text="model not found")])
+    with pytest.raises(ModelNotAvailable):
+        _client().chat_text([{"role": "user", "content": "hi"}])
+
+
+def test_connect_error_maps_to_unreachable(monkeypatch):
+    _patch_post(monkeypatch, [httpx.ConnectError("refused")])
+    with pytest.raises(ServiceUnreachable):
+        _client().chat_text([{"role": "user", "content": "hi"}])
+
+
+def test_embed_is_unsupported(monkeypatch):
+    def boom(*a, **k):
+        raise AssertionError("embed must not hit the network")
+
+    monkeypatch.setattr(httpx, "post", boom)
+    with pytest.raises(ServiceUnreachable) as exc:
+        _client().embed(["x"])
+    assert "embeddings" in str(exc.value).lower()