🤖 Multi-Agent AI System

Intelligent Data Analysis & Research Assistant Platform

A sophisticated multi-agent AI system that seamlessly processes both structured business data and unstructured research documents through natural language queries and intelligent agent orchestration.

🎯 Overview

This system addresses the common business challenge of analyzing both structured data (CSV/Excel files) and unstructured research documents (PDFs) through a single, intuitive interface. Instead of switching between different tools, users can ask natural language questions and get intelligent responses with automatic visualizations and insights.

Key Features

🧠 Intelligent Query Routing: Automatically determines whether your question is about data or research
📊 Data Intelligence: Analyzes business data with natural language queries and auto-generated charts
📄 Research Assistant: Processes research papers with summarization, keyword extraction, and Q&A
💬 Unified Chat Interface: Single interface for all types of queries
📈 Interactive Visualizations: Real-time charts and graphs based on your questions
🚀 Multiple Deployment Options: Local, Docker, and cloud deployment ready

🏗️ System Architecture

The system follows a multi-agent architecture pattern with three specialized AI agents coordinated by an intelligent orchestrator:

┌─────────────────────────────────────────────────────────────────┐
│                    USER INTERFACE (Streamlit)                   │
│                     Natural Language Queries                    │
└─────────────────────┬───────────────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────────────┐
│                  ORCHESTRATOR AGENT                            │
│              • Query Classification                             │
│              • Intent Recognition                               │
│              • Agent Routing                                    │
│              • Context Management                               │
└─────────────┬─────────────────────────────────┬─────────────────┘
              │                                 │
┌─────────────▼─────────────┐     ┌─────────────▼─────────────────┐
│   DATA INTELLIGENCE       │     │   RESEARCH ASSISTANT          │
│        AGENT               │     │         AGENT                 │
│                           │     │                               │
│ • CSV/Excel Processing    │     │ • PDF Text Extraction         │
│ • Pandas Integration      │     │ • Document Chunking           │
│ • SQL Query Generation    │     │ • Semantic Embeddings         │
│ • Chart Generation        │     │ • Vector Search (FAISS)       │
│ • Statistical Analysis    │     │ • Summarization               │
│                           │     │ • Keyword Extraction          │
└───────────────────────────┘     └───────────────────────────────┘
              │                                 │
┌─────────────▼─────────────────────────────────▼─────────────────┐
│                    BACKEND API (FastAPI)                        │
│               RESTful endpoints with OpenAPI docs               │
└─────────────────────────────────────────────────────────────────┘

Architecture Components

🎯 Orchestrator Agent

Purpose: Central coordinator that analyzes user queries and routes them to appropriate specialists
Technology: Custom classification algorithm with keyword analysis and context awareness
Intelligence: Handles query disambiguation and maintains conversation context

📊 Data Intelligence Agent

Purpose: Specializes in structured data analysis and business intelligence
Technology: Pandas for data processing, Plotly for visualizations, SQLite for storage
Capabilities: Aggregations, trend analysis, statistical computations, automatic chart generation

📄 Research Assistant Agent

Purpose: Handles unstructured document analysis and research tasks
Technology: PyMuPDF for PDF processing, Sentence Transformers for embeddings, FAISS for vector search
Capabilities: Document summarization, semantic search, keyword extraction, Q&A

🌐 Unified Interface

Frontend: Streamlit-based web application with chat interface
Backend: FastAPI REST API with automatic OpenAPI documentation
Communication: RESTful API endpoints with JSON data exchange

🚀 Quick Start

Option 1: Local Development

Clone the repository

git clone https://github.com/your-username/multi-agent-ai-system.git
cd multi-agent-ai-system

Install dependencies
```
pip install -r requirements.txt
```
Run the system
```
python run.py
```
Access the application
- Frontend: http://localhost:8501
- API Documentation: http://localhost:8000/docs

Option 2: Docker Deployment

Build and run with Docker Compose
```
docker-compose up --build
```
Access the services
- Frontend: http://localhost:8501
- Backend API: http://localhost:8000

Option 3: Streamlit Cloud

Deploy to Streamlit Cloud
- Fork this repository
- Go to share.streamlit.io
- Deploy using streamlit_app.py

📖 How to Use

1. Upload Your Data

For Data Analysis:

Upload CSV or Excel files containing your business data
Supported formats: .csv, .xlsx, .xls
Examples: sales data, customer information, financial records

For Research Analysis:

Upload PDF research papers or documents
The system will automatically extract and process the text
Creates searchable embeddings for intelligent Q&A

2. Ask Natural Language Questions

Data Analysis Examples:

"What was the total revenue in Q2?"
"Show me the top 5 customers by sales"
"Plot monthly revenue trends"
"Which product category performs best?"
"Create a bar chart of sales by region"

Research Analysis Examples:

"Summarize this research paper"
"What methodology was used in the study?"
"Extract the key findings"
"What are the main challenges discussed?"
"Find information about machine learning approaches"

3. Get Intelligent Responses

The system automatically:

Routes your query to the appropriate agent
Processes the request using specialized algorithms
Generates visualizations, summaries, or answers
Presents results in an easy-to-understand format

📊 Sample Data

The repository includes sample datasets for testing:

sales_data.csv: 40 sales transactions with product, revenue, and regional data
customer_data.csv: 30 customer profiles with demographics and purchasing behavior
sample_research_paper.txt: Research paper on computer vision and deep learning

🛠️ Technology Stack

Backend

FastAPI: Modern, fast web framework for building APIs
Python 3.9+: Core programming language
Pandas: Data manipulation and analysis
SQLite: Lightweight database for data storage
Uvicorn: ASGI server for FastAPI

AI & Machine Learning

Sentence Transformers: Text embeddings for semantic search
FAISS: Vector similarity search for document retrieval
NLTK: Natural language processing toolkit
PyMuPDF: PDF text extraction and processing

Visualization

Plotly: Interactive charts and graphs
Matplotlib: Statistical plotting library
Seaborn: Statistical data visualization

Frontend

Streamlit: Interactive web application framework
HTML/CSS: Custom styling and responsive design

Deployment

Docker: Containerization for consistent deployment
Docker Compose: Multi-container orchestration

📁 Project Structure

multi-agent-ai-system/
├── backend/                 # FastAPI backend application
│   ├── agents/             # AI agent implementations
│   │   ├── data_intelligence_agent.py
│   │   ├── research_assistant_agent.py
│   │   └── orchestrator_agent.py
│   └── main.py            # FastAPI application entry point
│
├── frontend/              # Streamlit frontend application
│   ├── app.py            # Full-featured app (with backend)
│   └── app_standalone.py # Standalone app (for cloud deployment)
│
├── docker/               # Docker configuration
│   ├── Dockerfile.backend
│   └── Dockerfile.frontend
│
├── sample_data/         # Sample datasets for testing
│   ├── sales_data.csv
│   ├── customer_data.csv
│   └── sample_research_paper.txt
│
├── .streamlit/          # Streamlit configuration
│   └── config.toml
│
├── requirements.txt     # Python dependencies
├── streamlit_requirements.txt  # Streamlit Cloud dependencies
├── docker-compose.yml   # Container orchestration
├── streamlit_app.py    # Cloud deployment entry point
└── run.py              # Local development launcher

🔧 Configuration

Environment Variables (Optional)

Create a .env file for custom configuration:

# API Configuration
API_HOST=0.0.0.0
API_PORT=8000
STREAMLIT_PORT=8501

# Development
DEBUG=False
LOG_LEVEL=INFO

Customization

Agent Behavior: Modify agent parameters in respective Python files
UI Styling: Update CSS in the Streamlit app files
Data Processing: Extend the data intelligence agent for custom analysis
Document Processing: Enhance the research assistant for specific document types

🤝 Contributing

We welcome contributions! Here's how to get started:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

FastAPI: For the excellent modern web framework
Streamlit: For making web app development incredibly simple
Sentence Transformers: For powerful text embeddings
Plotly: For beautiful interactive visualizations
The Open Source Community: For the amazing tools and libraries

📞 Support

If you encounter any issues or have questions:

Check the existing Issues
Create a new issue with detailed information
Include steps to reproduce any problems

Built with ❤️ for intelligent data analysis and research assistance

Transform your data analysis workflow with the power of AI agents working together seamlessly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 Multi-Agent AI System

🎯 Overview

Key Features

🏗️ System Architecture

Architecture Components

🚀 Quick Start

Option 1: Local Development

Option 2: Docker Deployment

Option 3: Streamlit Cloud

📖 How to Use

1. Upload Your Data

2. Ask Natural Language Questions

3. Get Intelligent Responses

📊 Sample Data

🛠️ Technology Stack

Backend

AI & Machine Learning

Visualization

Frontend

Deployment

📁 Project Structure

🔧 Configuration

Environment Variables (Optional)

Customization

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.streamlit		.streamlit
backend		backend
docker		docker
frontend		frontend
sample_data		sample_data
.dockerignore		.dockerignore
.gitignore		.gitignore
Demo Script.docx		Demo Script.docx
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
logs-dxtjain-multi-agent-ai-system-main-streamlit_app.py-2025-09-21T20_18_20.787Z.txt		logs-dxtjain-multi-agent-ai-system-main-streamlit_app.py-2025-09-21T20_18_20.787Z.txt
packages.txt		packages.txt
requirements.txt		requirements.txt
run.py		run.py
streamlit_app.py		streamlit_app.py
streamlit_requirements.txt		streamlit_requirements.txt
~$mo Script.docx		~$mo Script.docx

License

thedixitjain/Multi-Agent-AI-System

Folders and files

Latest commit

History

Repository files navigation

🤖 Multi-Agent AI System

🎯 Overview

Key Features

🏗️ System Architecture

Architecture Components

🚀 Quick Start

Option 1: Local Development

Option 2: Docker Deployment

Option 3: Streamlit Cloud

📖 How to Use

1. Upload Your Data

2. Ask Natural Language Questions

3. Get Intelligent Responses

📊 Sample Data

🛠️ Technology Stack

Backend

AI & Machine Learning

Visualization

Frontend

Deployment

📁 Project Structure

🔧 Configuration

Environment Variables (Optional)

Customization

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages