Agent with Human-in-the-Loop (HITL) approval that pauses execution before running sensitive tools (e.g.
create_file) and waits for human review. Simple questions are answered directly without triggering the approval loop.
Built with LangGraph and LangChain.
How it works:
User Input → LLM decides tool → Is it sensitive?
├── No → Execute tool automatically → Return result
└── Yes → PAUSE (interrupt) → Human approves/rejects
├── Approved → Execute tool → Return result
└── Rejected → Return rejection message
- uv — Python package manager
- Podman or Docker — for local container builds (Option A)
- oc — for OpenShift deployment
- Helm — for deploying to Kubernetes/OpenShift
- GNU Make and a bash-compatible shell — on Windows, use WSL (recommended) or Git Bash
make init creates a .env file from .env.example. Set your environment variables in the .env file.
cd agents/langgraph/human_in_the_loop
make initNow you will remove old .venv and create new. Next dependencies will be installed.
make envTracing is optional. If MLflow tracing is required, enable it by uncommenting and setting the following environment variables in the .env file.
MLFLOW_TRACKING_URI="http://localhost:5000"
MLFLOW_EXPERIMENT_NAME="langgraph-hitl-agent"
MLFLOW_HTTP_REQUEST_TIMEOUT=2
MLFLOW_HTTP_REQUEST_MAX_RETRIES=0Then start the MLflow server in a separate terminal:
# Start the MLflow server
uv run --extra tracing mlflow server --port 5000When MLFLOW_TRACKING_URI is set, make run-app and make run-cli will automatically install the tracing dependency.
To enable tracing and logging with MLflow on your OpenShift cluster, add the following environment variables to your .env file:
MLFLOW_TRACKING_URI="https://<openshift-dashboard-url>/mlflow"
MLFLOW_TRACKING_TOKEN="<your-openshift-token>"
MLFLOW_EXPERIMENT_NAME="langgraph-hitl-agent"
MLFLOW_TRACKING_INSECURE_TLS="true"
MLFLOW_WORKSPACE="default"Notes:
-
MLFLOW_TRACKING_URI- URL of your MLflow server. For local development, usehttp://localhost:5000. If using MLflow on an OpenShift cluster, replace<openshift-dashboard-url>with your cluster's data science gateway URL. -
MLFLOW_TRACKING_TOKEN- Required for OpenShift only. Your OpenShift authentication token, obtained from the OpenShift console. -
MLFLOW_EXPERIMENT_NAME- A descriptive name for your experiment (e.g., "LangGraph HITL Demo") -
MLFLOW_TRACKING_INSECURE_TLS- Required for OpenShift only. Set to"true"if your cluster does not use trusted certificates. -
MLFLOW_WORKSPACE- Required for OpenShift only. Project name. -
Tracing is optional; if you do not set
MLFLOW_TRACKING_URI, the application will run without MLflow logging. -
If
MLFLOW_TRACKING_URIis set, the application will attempt to connect to the MLflow server at startup. If the server is unreachable, the application will log a warning and continue running without tracing. -
You can control how long the application waits for the MLflow server by setting
MLFLOW_HEALTH_CHECK_TIMEOUT(in seconds, default:5).
This will install ollama if it is not installed already. Then pull needed models for local work.
The default model is llama3.1:8b. To use a different model, pass MODEL=:
make ollama MODEL=llama3.2:3b
make ollamaKeep this terminal open – the server needs to keep running. You should see output indicating the server started on
http://localhost:8321.
make llama-serverKeep this terminal open – the app needs to keep running. You should see output indicating the app started on
http://localhost:8000.
cd agents/langgraph/human_in_the_loop
make run-app # fails if port is already in use and print steps TO-DOOpen http://localhost:8000 in your browser. A green dot in the header means the agent is connected and ready.
When the agent pauses for approval, an Approve / Reject banner appears directly in the chat.
For terminal-based testing without a browser:
cd agents/langgraph/human_in_the_loop
make run-cliThis launches an interactive prompt where you can pick predefined questions or type your own. Tool calls and results are displayed inline with colored output.
cd agents/langgraph/human_in_the_loop
make initEdit .env with your model endpoint and container image:
API_KEY = your-api-key-here
BASE_URL = https://your-model-endpoint.com/v1
MODEL_ID = llama-3.1-8b-instruct
CONTAINER_IMAGE = quay.io/your-username/langgraph-hitl-agent:latestNotes:
-
API_KEY- your API key or contact your cluster administrator -
BASE_URL- should end with/v1 -
MODEL_ID- model identifier available on your endpoint -
CONTAINER_IMAGE– full image path where the agent container will be pushed and pulled from. The image is built locally, pushed to this registry, and then deployed to OpenShift.Format:
<registry>/<namespace>/<image-name>:<tag>Examples:
- Quay.io:
quay.io/your-username/langgraph-hitl-agent:latest - Docker Hub:
docker.io/your-username/langgraph-hitl-agent:latest - GHCR:
ghcr.io/your-org/langgraph-hitl-agent:latest
Note: OpenShift must be able to pull the container image. Make the image public, or configure an image pull secret for private registries.
- Quay.io:
Login to OC
oc login -u "login" -p "password" https://super-link-to-cluster:111Login ex. Docker
docker login -u='login' -p='password' quay.ioRequires Podman (or Docker) and a registry account (e.g., Quay.io).
make build # builds the image locally
make push # pushes to the registry specified in CONTAINER_IMAGENo Podman, Docker, or registry account needed — just the oc CLI.
make build-openshiftAfter the build completes, set CONTAINER_IMAGE in your .env to the internal registry URL printed after the build.
make dry-run # preview rendered Helm manifests (secrets redacted)make deployAfter deploying, the application may take about a minute to become available while the pod starts up.
The route URL is printed after make deploy. You can also retrieve it manually:
oc get route langgraph-hitl-agent -o jsonpath='{.spec.host}'make undeploySee OpenShift Deployment for more details.
Replace http://localhost:8000 with https://<YOUR_ROUTE_URL> in the examples below.
Step 1: Ask a general question (no approval needed)
curl -X POST https://<YOUR_ROUTE_URL>/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": "What is RedHat OpenShift Cluster"}],
"stream": false,
"thread_id": "demo-1"
}'Step 2: Ask to write that info into a file (triggers approval)
curl -X POST https://<YOUR_ROUTE_URL>/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": "Write that information into a file called demo.md"}],
"stream": false,
"thread_id": "demo-1"
}'The agent will pause and return finish_reason: "pending_approval" with the create_file tool call details.
Step 3: Approve the file creation
curl -X POST https://<YOUR_ROUTE_URL>/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": ""}],
"thread_id": "demo-1",
"approval": "yes"
}'The agent resumes, executes create_file, and returns the final result.
make testNon-streaming:
curl -X POST http://localhost:8000/chat/completions \
-H "Content-Type: application/json" \
-d '{"messages": [{"role": "user", "content": "What is RedHat OpenShift?"}], "stream": false, "thread_id": "demo-1"}'Streaming:
curl -sN -X POST http://localhost:8000/chat/completions \
-H "Content-Type: application/json" \
-d '{"messages": [{"role": "user", "content": "What is RedHat OpenShift?"}], "stream": true, "thread_id": "demo-1"}'curl http://localhost:8000/healthThis agent classifies tools into two categories:
| Category | Tools | Behavior |
|---|---|---|
| Safe | general chat | Responded to directly, no approval needed |
| Sensitive | create_file |
Paused for human review before execution |
When the LLM decides to call a sensitive tool, the agent:
- Pauses execution using LangGraph's
interrupt()mechanism - Returns the pending tool call details with
finish_reason: "pending_approval" - Includes a
thread_idto identify the paused conversation - Waits for a follow-up request with the human's decision
Step 1: Send a message that triggers a sensitive tool
curl -X POST http://localhost:8000/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": "Create a file named report.md with info about LangChain"}],
"stream": false,
"thread_id": "conversation-1"
}'Response (agent paused, waiting for approval):
{
"choices": [
{
"message": {
"role": "assistant",
"content": "{\"question\": \"Do you approve the following tool call(s)?\", \"tool_calls\": [\"Tool: create_file, Args: {...}\"], \"options\": [\"yes\", \"no\"]}"
},
"finish_reason": "pending_approval"
}
],
"thread_id": "conversation-1"
}Step 2: Approve or reject the tool call
# Approve
curl -X POST http://localhost:8000/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": ""}],
"thread_id": "conversation-1",
"approval": "yes"
}'
# Or reject
curl -X POST http://localhost:8000/chat/completions \
-H "Content-Type: application/json" \
-d '{
"messages": [{"role": "user", "content": ""}],
"thread_id": "conversation-1",
"approval": "no"
}'Each conversation requires a thread_id for HITL to work:
- The
thread_ididentifies the paused graph state - You must use the same
thread_idwhen sending the approval - State is stored in-memory using LangGraph's
MemorySavercheckpointer - State does not persist across server restarts (use a database checkpointer for production)
Edit src/human_in_the_loop/agent.py to add more sensitive tools to the interrupt list:
hitl_middleware = HumanInTheLoopMiddleware(
interrupt_on={
"create_file": True,
"delete_record": True,
},
)Edit src/human_in_the_loop/tools.py to add new tools:
@tool("delete_record", parse_docstring=True)
def delete_record(record_id: str) -> str:
"""Delete a record from the database. Requires human approval."""
# Implementation hereThis agent combines three key components:
- LangGraph StateGraph: Custom workflow with conditional routing for safe vs sensitive tools
- LangGraph Interrupts:
interrupt()pauses execution;Command(resume=...)resumes it - ChatOpenAI: OpenAI-compatible LLM client (connects to Llama-stack or OpenAI)
User Input → Agent Node (LLM) → Route Decision
├── No tools → END
├── Safe tool → Tool Node → Agent Node (loop)
└── Sensitive tool → Human Approval Node
├── interrupt() → PAUSE
├── resume("yes") → Tool Node → Agent Node → END
└── resume("no") → Rejection Message → END
Error: "No user message found in messages list"
- Solution: Ensure your request includes at least one message with
"role": "user"
Approval request returns error
- Solution: Use the same
thread_idfrom the pending approval response - The graph state must exist in the checkpointer for resume to work
State lost after server restart
- The default
MemorySaveris in-memory only - For production, use
PostgresSaver(seereact_with_database_memoryagent for reference)