This repository contains two complementary applications for working with Pega AI agents:
- Pega Agent Inspector: inspect agent configuration and chat with agents interactively.
- DeepEval Pega: run repeatable quality and regression evaluations against those agents.
Together, they support a practical workflow:
- Explore and validate an agent interactively.
- Capture golden conversations.
- Run automated regression and quality checks over time.
- Install prerequisites: Python 3.11+ and Node.js 18+.
- Start the inspector app:
cd apps/pega-agent-inspector
npm install
npm start- In another terminal, start the evaluation app:
cd apps/deepeval-pega
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
python create_db.py
reflex run- Open both apps in your browser:
- Inspector:
http://localhost:3002 - DeepEval UI:
http://localhost:3000
- Use Inspector to validate agent behavior, then run a regression in DeepEval with a golden dataset.
apps/
deepeval-pega/
pega-agent-inspector/
docs/
scenarios/
scripts/
Location: apps/deepeval-pega
DeepEval Pega is a project-agnostic evaluation framework for Pega AI agents. It uses DeepEval plus a pluggable LLM judge (Gemini, Bedrock, OpenAI, or GitHub Copilot) to score conversation quality and detect regressions.
- Reflex web app for:
- project configuration management
- golden dataset creation and management
- metric selection and evaluation runs
- REST API management
- FastAPI REST API for programmatic access to:
- projects
- datasets
- evaluations
- LLM profiles
- Golden-session capture and replay tooling
- 13 automated conversational/regression tests (quality, latency, tool usage, case lifecycle, bias/toxicity, and more)
- SQLite result persistence and an MCP server for querying QA outcomes
- Python (3.11+)
- Reflex
- FastAPI
- DeepEval
- pytest
- SQLite
cd apps/deepeval-pega
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
python create_db.py
reflex runThen open:
- UI:
http://localhost:3000 - Reflex backend:
http://localhost:8000(default)
For API use:
python run_api.pyOpen API docs at http://localhost:8100/docs.
- Full setup and usage: apps/deepeval-pega/README.md
- Project decisions: apps/deepeval-pega/Decisions.md
- Regression notes: apps/deepeval-pega/RegressionNeeds.md
Location: apps/pega-agent-inspector
Pega Agent Inspector is a developer utility for inspecting and interactively testing Pega AI agents through an Express proxy. The browser never calls Pega directly.
- Configuration panel for Pega connection + OAuth details
- Agent inspection panel showing:
- model and prompt configuration
- tool metadata
- SVG dependency graph
- request history
- Live multi-turn chat panel
- Support for two protocols:
- Pega DX API (
APImode) - Agent-to-Agent JSON-RPC (
A2Amode)
- Pega DX API (
- Node.js 18+
- Express
- Plain HTML/CSS/JavaScript frontend (no bundler)
cd apps/pega-agent-inspector
npm install
npm startDevelopment mode:
npm run devOpen http://localhost:3002 and configure your Pega endpoint, OAuth token URL, credentials, and agent target.
- Full setup and usage: apps/pega-agent-inspector/README.md
Recommended workflow for teams:
- Use apps/pega-agent-inspector to inspect prompts, tools, and behavior while iterating.
- Capture representative conversations and create golden datasets in apps/deepeval-pega.
- Run DeepEval regression suites after agent, prompt, or tool changes.
- Track and query quality trends using the generated reports and SQLite/MCP pipeline.
- Python 3.11+ (for DeepEval Pega)
- Node.js 18+ (for Pega Agent Inspector and Reflex frontend build)
- Access to a Pega environment with OAuth2 client credentials
- At least one LLM provider credential for DeepEval judging:
- Google Gemini, OpenAI, AWS Bedrock, or GitHub Copilot
Synchronizes both app subtrees from their dedicated remotes into this monorepo, then pushes the result.
What it does:
- Pulls latest
mainfrom remotepega-agent-inspectorintoapps/pega-agent-inspector(squashed subtree update). - Pulls latest
mainfrom remotedeepeval-pegaintoapps/deepeval-pega(squashed subtree update). - Pushes updated branch to
origin.
Run from repository root:
bash scripts/sync-subtrees.shRequirements:
- Git remotes named
pega-agent-inspectoranddeepeval-pegamust be configured. - You must have push access to
origin.
- The applications are independent but complementary.
- Each app has its own runtime, dependencies, and configuration.
- Inspector defaults to
localhost:3002and DeepEval UI defaults tolocalhost:3000, so both can run simultaneously without port conflicts. - For environment variables and provider-specific setup, use each app’s README.