GitHub - SalesforceAIResearch/enterprise-deep-research: Salesforce Enterprise Deep Research

We present Enterprise Deep Research (EDR), a multi-agent system that integrate:

Master Planning Agent for adaptive query decomposition.
Four specialized search agents (General, Academic, GitHub, LinkedIn).
Extensible MCP-based tool ecosystem supporting NL2SQL, file analysis, and enterprise workflows.
Visualization Agent for data-driven insights.
Reflection mechanism that detects knowledge gaps and updates research direction with optional human-in-the-loop steering guidance.
Real-time steering commands for continuous research refinement.

Note

These components enable automated report generation, real-time streaming, and seamless enterprise deployment, as validated on internal datasets.

🎥 Demo

We present a video demo of using EDR in web application for enterprise deep data analysis.

EDR: Web Application

edr_demo.mp4

Note

Multi-provider LLM support • Slack agent • Real-time streaming • Document analysis • Citation management • Parallel processing • Specialized benchmarking • Human-in-the-loop steering

🚀 Quick Start

Requirements: Python 3.11+ • Node.js 20.9.0+

Installation & Setup

# Clone and setup
git clone https://github.com/SalesforceAIResearch/enterprise-deep-research.git
cd enterprise-deep-research

# Python environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt

# Configure environment
cp .env.sample .env
# Edit .env with your API keys

# Frontend setup
cd ai-research-assistant && npm install && npm run build && cd ..

Environment Configuration

Required Variables:

TAVILY_API_KEY - Tavily search API key
One LLM provider key:
- OPENAI_API_KEY - OpenAI API key
- ANTHROPIC_API_KEY - Anthropic API key
- GROQ_API_KEY - Groq API key
- GOOGLE_CLOUD_PROJECT - Google Cloud project ID
- SAMBNOVA_API_KEY - SambaNova API key

Optional Settings:

LLM_PROVIDER - Default provider (default: openai)
LLM_MODEL - Model name (provider-specific defaults)
MAX_WEB_RESEARCH_LOOPS - Max iterations (default: 10)

Supported Models

Provider	Default Model	Available Models
OpenAI	`o4-mini`	`o4-mini`, `o4-mini-high`, `o3-mini`, `o3-mini-reasoning`, `gpt-4o`
Anthropic	`claude-sonnet-4`	`claude-sonnet-4`, `claude-sonnet-4-thinking`, `claude-3-7-sonnet`, `claude-3-7-sonnet-thinking`
Google	`gemini-2.5-pro`	`gemini-2.5-pro`, `gemini-1.5-pro-latest`, `gemini-1.5-flash-latest`
Groq	`deepseek-r1-distill-llama-70b`	`deepseek-r1-distill-llama-70b`, `llama-3.3-70b-versatile`, `llama3-70b-8192`
SambaNova	`DeepSeek-V3-0324`	`DeepSeek-V3-0324`

Running the Application

Full Stack (Recommended) - Single Command:

python -m uvicorn app:app --host 0.0.0.0 --port 8000

The application will serve both the backend API and pre-built frontend at http://localhost:8000

Backend API Documentation: http://localhost:8000/docs

💻 Usage

Command Line

python benchmarks/run_research.py "Your research question" \
  --provider openai --model o3-mini --max-loops 3

Web Interface

Navigate to http://localhost:8000 for interactive research with real-time progress tracking.

📚 Benchmarking & Development

Supported Benchmarks

DeepResearchBench: Comprehensive research evaluation
ResearchQA: Question-answering with citation verification
DeepConsult: Consulting-style analysis tasks

EDR-200 Dataset

The EDR-200 dataset contains 201 complete agentic research trajectories generated by Enterprise Deep Research—99 queries from DeepResearch Bench and 102 queries from DeepConsult. Unlike prior benchmarks that only capture final outputs, these trajectories expose the full reasoning process across search, reflection, and synthesis steps, enabling fine-grained analysis of agentic planning and decision-making dynamics.

Running Benchmarks

Refer to our detailed benchmarking guide.

Development Setup

# Testing
python -m pytest tests/
python test_agents.py

# Code quality
black src/ services/ benchmarks/
mypy src/ services/
flake8 src/ services/ benchmarks/

# Development server
python -m uvicorn app:app --reload --host 0.0.0.0 --port 8000
cd ai-research-assistant && npm run dev

📁 Project Structure

enterprise-deep-research/
├── ai-research-assistant/       # React frontend
├── benchmarks/                  # Evaluation framework
├── src/                        # Core research engine
│   ├── agent_architecture.py   # Multi-agent orchestration
│   ├── graph.py               # LangGraph workflow definitions
│   ├── state.py               # Research state management
│   ├── simple_steering.py     # Steering & task management
│   ├── steering_integration.py # Steering integration layer
│   ├── prompts.py             # Agent prompts & templates
│   ├── configuration.py       # Agent configuration
│   ├── utils.py               # Utility functions
│   ├── visualization_agent.py # Visualization generation
│   └── tools/                 # Research tools & MCP integration
├── services/                   # Backend services (research, analysis, parsing)
├── routers/                    # FastAPI endpoints
├── models/                     # Data schemas
├── app.py                      # Main FastAPI application
├── llm_clients.py              # LLM provider clients
├── session_store.py            # Session management
└── requirements.txt            # Python dependencies

Star History

📜 License & Citation

Licensed under Apache 2.0.

@article{prabhakar2025enterprisedeepresearch,
  title={Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics},
  author={Prabhakar, Akshara and Ram, Roshan and Chen, Zixiang and Savarese, Silvio and Wang, Frank and Xiong, Caiming and Wang, Huan and Yao, Weiran},
  journal={arXiv preprint arXiv:2510.17797},
  year={2025}
}

Acknowledgments: Built on LangGraph, Tavily, React, Tailwind CSS, and FastAPI.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
ai-research-assistant		ai-research-assistant
assets		assets
benchmarks		benchmarks
models		models
routers		routers
services		services
src		src
.env.sample		.env.sample
.gitignore		.gitignore
AI_ETHICS.md		AI_ETHICS.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
Tech_Report__Enterprise_Deep_Research.pdf		Tech_Report__Enterprise_Deep_Research.pdf
app.py		app.py
coding-agent.py		coding-agent.py
e2b.Dockerfile		e2b.Dockerfile
e2b.toml		e2b.toml
e2b_documentation.md		e2b_documentation.md
graph_test.py		graph_test.py
how_to_license.md		how_to_license.md
langgraph.json		langgraph.json
llm_clients.py		llm_clients.py
log-browser.yml		log-browser.yml
log-server.yml		log-server.yml
math_client.py		math_client.py
math_client_langgraph.py		math_client_langgraph.py
math_client_new.py		math_client_new.py
math_server.py		math_server.py
mcp_agent.secrets.yaml		mcp_agent.secrets.yaml
model_test.py		model_test.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
replit.nix		replit.nix
requirements.txt		requirements.txt
session_store.py		session_store.py
stream_test_2.html		stream_test_2.html
test_agents.py		test_agents.py
test_benchmark.py		test_benchmark.py
test_graph.py		test_graph.py
test_specialized_searches.py		test_specialized_searches.py
test_unified_query.py		test_unified_query.py
test_visualization.py		test_visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎥 Demo

EDR: Web Application

🚀 Quick Start

Installation & Setup

Environment Configuration

Supported Models

Running the Application

💻 Usage

Command Line

Web Interface

📚 Benchmarking & Development

Supported Benchmarks

EDR-200 Dataset

Running Benchmarks

Development Setup

📁 Project Structure

Star History

📜 License & Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Languages

License

SalesforceAIResearch/enterprise-deep-research

Folders and files

Latest commit

History

Repository files navigation

🎥 Demo

EDR: Web Application

🚀 Quick Start

Installation & Setup

Environment Configuration

Supported Models

Running the Application

💻 Usage

Command Line

Web Interface

📚 Benchmarking & Development

Supported Benchmarks

EDR-200 Dataset

Running Benchmarks

Development Setup

📁 Project Structure

Star History

📜 License & Citation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Languages

Packages