ADR 003: Detector API design

This ADR documents the design and decisions for the detectors APIs published and integrated into orchestrator. This will also serve the basis of expanding or enhancing the detectors API in future.

Detector API can be found at this Github page.

Motivation

This orchestrator is designed to work with various detectors. In the realm of guardrails and trustworthy AI, there can be different types of detectors for different use-cases. From the orchestrator perspective, we want the ability to work with many detectors and provide coherent interfaces to users. Thus, there is a need to provide common API definitions that individual detectors can expose (based on their use-case) and that can get consumed by the orchestrator without many changes.

Decisions

We will have multiple detector APIs divided based primarily on the input requirements and secondary on use-case. For designing the APIs, we will also use common nomenclatures for these inputs and use-cases that are used in other open-source projects in the context of generative AI.

Based on the input requirements, the APIs will be divided into following parts:

Text Analysis: This will cover detectors that accept single text as input, and that text can be coming from user or LLM.
Generation Analysis: This will cover detectors that needs to work on both input prompt and generated text in combination to provide a singular result.
Chat analysis: This will cover the detectors that work on chat history.
Context analysis: This will allow integrations with detectors that require context of a prompt, in forms of URL or documents.

Nomenclature

/text in the endpoint indicates the modality of the input.
Content / Contents: This specifies any arbitrary input and for the /text/contents endpoint, contents denotes text input. Rationale behind this selection:
1. Generic input, i.e. it is not too specific. So the input can be either prompt or LLM generated text.
2. Does not have correlated expectation in the output, like if the name was input, then one could expect output in response.
3. Lines up with how text is referred to as in some of the other open source APIs, specially for chat.
detector_id: This refers to an identifier to a deployment, service, or model id of the detector. It is a way to identify a detector from other.
context: Refers to the list of context_type objects, in string form allowing user to pass on the context to detectors.
context_type: Refers to the type of context provided in the API. It can be one of url and doc.
detection (in response object): Name of the detection, like EmailAddress
detection_type: Type of the detection, like HAP / PII.
score: Score returned by the detector. It can be confidence, probability etc.

Endpoints

/api/v1/text/contents - Text Analysis.
- Providing detector computation on contents (list of string).
- We are accepting list of string here instead of single string (content), to allow batch processing by detectors, which often can be more optimized than singular inputs.
- Each response in the output for this endpoint needs to be in order, corresponding to the input contents. If there are no detections on any of the inputs, the detector should respond with empty [] responses.
/api/v1/text/generation - Generation Analysis
- Providing detection computation on both prompt and generated text.
/api/v1/text/context/chat - Chat Analysis
- Providing detection computation on chat history.
/api/v1/text/context/doc - Context Analysis
- Providing detection computation of content w.r.t the context provided in the input.

Each of above endpoints will also provide an "evidence" block that will allow future integration with evidence for their response.

Consequences

To support all of these different endpoints, the orchestrator will need to integrate these endpoints in the detector client.
For contents API, already implemented orchestrator workflow, would need to be modified to new API.
Each detector developer would need to know about new detector APIs and their design.

Status

Accepted

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR 003: Detector API design

Motivation

Decisions

Nomenclature

Endpoints

Consequences

Status

FilesExpand file tree

003-detector-api-design.md

Latest commit

History

003-detector-api-design.md

File metadata and controls

ADR 003: Detector API design

Motivation

Decisions

Nomenclature

Endpoints

Consequences

Status