Edu Chatbot

RAG based chatbot, with human transfer mechanism, Architecture Diagram

Edu Chatbot is a customer service chatbot application, created for education enrichment businesses to auto-reply to customer inquiries. It manages customer inquiries across multiple channels including websites, WhatsApp, WeChat, Telegram, and more.

Updates (15May2025):

Main application "Edu Chatbot" is ready!
Follow the Setup Guide below to interact with it via Docker Compose.
ChromaDB as vector database for this version, which is only for running on local machines.
Currently in the process of building a separate applicaiton for chatbot creator, with the following features:
1. Designed to create a chatbot dynamically with user's instructions.
2. Weaviate as vector database
3. Automated deployment to cloud

Overview & Key Features

Edu Chatbot combines AI technologies with human oversight to ensure customer satisfaction and improve sales conversion:

🤖 Intelligent Interaction: Leverages on Retrieval-Augmented Generation (RAG) to respond to complex customer inquiries, customization according to business needs.
📚 Knowledge Base: Stores and indexes frequently asked questions (FAQs), course details, pricing information, information crawled from company website and other business-critical data in a vector database for rapid, accurate retrieval.
🎯 Personalized Recommendations: Gathers relevant student information such as age and interests to recommend the relevant course.
🧠 Intent Classification: Identifies customer needs to provide targeted responses.
😊 Sentiment Analysis: Detects customer satisfaction levels and able to escalate to human staff when reaching a pre-configured threshold.
🧩 Custom Conversation Simulator: Takes on parent role, generating realistic queries and follow-up questions. Creates customizable datasets with varying personas and complexity for pre-deployment testing and CI/CD monitoring during production phase.
📊 Comprehensive Evaluation: A spectrum of evaluation metrics for single-turn and multi-turn conversations.
👨‍💼 Human-in-the-Loop Design: Ensures quality customer service through a sophisticated handoff system that activates when:
1. A customer explicitly requests to speak with a human representative
2. The sentiment analysis module detects customer frustration or dissatisfaction
3. Staff members proactively choose to intervene via the support dashboard
🔄 Seamless Handoff: Enables staff to take over conversations when needed and return control to the chatbot once complex issues are resolved.
📱 Dual Interface: Features a comprehensive demonstration UI with customer-facing chat (left panel) and staff support dashboard (right panel) views.

RAG based chatbot, with human transfer mechanism

Demo

Check out Edu Chatbot in action: YouTube

To help you understand what's happening in the demo video, below is a write-up to illustrate the complete interaction flow:

The flow begins when a customer inquires about courses through a chatbot. The chatbot classifies the intent and gathers key information such as the student's age and interests. Based on the inputs, it recommends suitable courses along with details like descriptions, teacher profiles, pricing, and schedules. When the customer expresses concern about the price and requests a discount, the chatbot informs them it isn't authorized to offer one. A support staff monitoring the conversation then intervenes by clicking a "Take Over" button and offers a special discount. The customer accepts, and the staff hands the conversation back to the chatbot, which proceeds with enrollment, completing the interaction smoothly.

Setup Guide

Prerequisites

Python version 3.12+
Docker Desktop

Installation

Clone the repository

git clone https://github.com/Jeanetted3v/edu_chatbot.git
cd edu-chatbot

Configure environmental variables

Key variables you need: at least 1 LLM api key and a MongoDB URI
For the rest, you can put in placeholders such as "api_key" so that the system can still run.

cp .env.example .env
# Edit .env file with your API keys and configurations

(Alternative to Docker) Install Python dependencies If you're running locally without Docker:

python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate
pip install --upgrade pip
pip install -r requirements.txt

Data Configuration - Local

Place your unstructured FAQ documents (PDF) and structured data Excel files in the /data/data_to_ingest folder
In config/data_ingest.yaml, configure the paths under "local_docs" according to the file names and excel sheet names
Currently only support text information.

local_doc:
  paths:
    - path: ./data/data_to_ingest/excel.xlsx
      sheet: syn_data
    - path: ./data/data_to_ingest/rag_qna.pdf

In config/data_ingest.yaml, configure the chromadb collection name accordingly, default is set to "syn_data"

embedder:
  similarity_metric: cosine
  persist_dir: ./data/embeddings
  collection: syn_data
  vector_store: chromadb

Data Configuration - Gdrive (Temporarily disabled)

Or configure Google Drive access
Generate and download Google Drive API credentials JSON file
Place your credentials file in a secure location
In config/data_ingest.yaml, configure the Google Drive settings:

gdrive:
  credentials_path: /path/to/your/credentials.json   # Path to your Google API credentials JSON file
gdrive_doc:
  - file_id: abcd123efg456                           # ID from Google Sheets URL
    file_type: sheets                                # For Google Sheets documents
  - file_id: abcd123efg456                           # ID from Google Docs URL
    file_type: docs                                  # For Google Docs documents
  # - file_id: your_drive_pdf_file_id_here
  #   file_type: pdf   # Support for PDF files (coming soon)

File IDs can be found in Google Drive URLs:
- For Google Sheets: https://docs.google.com/spreadsheets/d/FILE_ID_HERE/edit
- For Google Docs: https://docs.google.com/document/d/FILE_ID_HERE/edit
- For Drive files: https://drive.google.com/file/d/FILE_ID_HERE/view

Other configurable parameters

Other configurable parameters such as LLM model, retreiver settings, prompts can be found in yaml files in the config/folder.
Main entry point for configurable parameters for the chatbot is at config/config.yaml
Main entry point for configurable parametes for the data ingestion pipeline is at config/data_ingest.yaml
Users feel free to change them or just use the current default settings.

Running the app

Start the application using Docker Compose

docker compose up --build

Access the application

Open in a web browser to interact with the chatbot (dual interface) (port 3000).
Or if you are familiar with SwaggerUI inference, feel free to interact directly with the Backend (port 8000).

Technical Implementation Details

📏 RAG or Long Context?

In view of recent advancement in LLM's context window, this chatbot is set up to use LLM to retreive information if data is within a certain token count. If token count is over a certain number, we'll fall back to use RAG instead.
Token count is set as a configurable parameter in config/data_ingest.yaml

📂 Loading documents from Local or Google Drive

Education company can either load data into Google Drive or locally for both structured and unstructured data ingestion.
This can be configured in config/data_ingest.yaml

✂️ Chunking

Various chunking strategies are configurable in the config/data_ingest.yaml file.
Currently Langchain is used for RecursiveCharacter and SemanticChunker chunking stratgies for the RAG pipeline.

🔍 Embedding, Vector Database & Retrieval

Implements ChromaDB for lightweight, high-performance vector storage.
Utilizes BM25 for efficient full-text keyword search, enabling robust lexical matching alongside semantic retrieval.
Applies CrossEncoder as a reranker to refine and boost relevance of retrieved results through deeper contextual scoring.

🤖 Modular Agentic RAG

PydanticAI is used here for its simplicity and data valiadation feature.
Also implemented PydanticAI logfire for LLM tracing.
Able to provide a more deterministic structured output, such as during intent classification process.
For other LLM functions, plain vanilla OpenAI API is used for simplicity and flexibility.
Incorporates a separate Reasoning Agent to elaborate on incoming query, assess whether the intent needs to be split into multiple intent and thus retrieval, and determine if RAG is required.

A Response Agent then synthesizes incoming query, chat history, and any retrieved documents to generate the final customer response. This modular approach optimizes latency, reduces unnecessary retrieval calls, and improves the relevance and coherence of responses.

💾 Saved Chat History

All chat histories are saved in MongoDB, which allows for tracing, further analysis and prompt enhancements.

📊 Evaluation

Ragas: Metrics include answer relevancy, faithfulness, context precision, answer correctness.
DeepEval: Conversational metrics are also used here since it involves multi-turn conversations.
Evaluation results are logged for continuous improvement of the system.

Ideas for Future Enhancements

Multi-Channel Integration

Implement direct integration with WhatsApp, WeChat, Telegram, and other messaging platforms
Develop a unified API layer for consistent experience across all communication channels
Enable channel-specific customizations while maintaining core functionality

Vector Database

To support more types of vector database

MultiModal data

To support MultiModal data RAG

Enhanced Evaluation

To add customized evaluation metrics

Project Structure

Edu_chatbot/                    # Root of project
├── assets/                     # Images and videos used in the README
├── config/                     # Configurable parameters, prompt templates
├── data/       
│   ├── convo/                  # Conversations extracted from MongoDB
│   ├── crawl/                  # Data scraped from websites (raw and parsed)
│   ├── data_to_ingest/         # Raw documents uploaded by users
│   ├── embeddings/             # Chunked and embedded document vectors
│   ├── eval/                   # Evaluation results (CSV, JSON)
│   └── simulations/            # Simulated user-chatbot interactions
├── dockerfiles/                # Dockerfiles for backend and frontend
├── src/
│   ├── backend/
│   │   ├── api/                # API endpoints and websocket logic
│   │   ├── chat/               # Chat modules: query handling, sentiment analysis, etc.
│   │   ├── database/           # Database interface and helper modules
│   │   ├── dataloaders/        # Loaders for local files and Google Drive
│   │   ├── dataprocessor/      # Chunking and embedding modules
│   │   ├── evaluation/         # RAGAS, DeepEval, and simulator components
│   │   ├── main/               # Entry points to run pipelines without API
│   │   ├── models/             # Pydantic models and schema definitions
│   │   └── utils/              # Logging, config, and helper utilities
│   └── frontend/               # React and Node.js frontend code
│       └── src/
│           └── app/
│               ├── components/ # Reusable UI components
│               ├── services/   # Frontend services (e.g., API calls)
│               └── page.tsx    # Main app page
├── .dockerignore               # Files/folders to exclude from Docker builds
├── .env                        # Environment variables (to be filled before running)
├── .env.example                # Template for .env file
├── .gitignore                  # Files/folders to ignore in Git
├── conftext.py                 # Config file for running DeepEval (pytest)
├── docker-compose.yml          # Compose file to run services locally
├── LICENSE                     # License information
├── README.md                   # Project overview and setup instructions
├── requirements.in             # Unresolved requirements input for pip-compile
└── requirements.txt            # Resolved, locked dependencies for pip install

Tech Stack

🧠 OpenAI: LLM provider for natural language understanding and generation
🔍 PydanticAI: Agentic framework for data validation and structured outputs
⛓️ Langchain: Document processing and chunking
😊 Vadar: Sentiment analysis
🔠 ChromaDB: Vector database for semantic search
💾 MongoDB: Chat history storage and data persistence
📁 GoogleDriveAPI: Remote data access and integration
⚡ FastAPI: Backend API framework, HTTP and Websocket. Websocket is for real-time communications, catering for this use case. ⚛️ NodeJS/React: Used for building the frontend interface for chat interaction, and user dashboard 📚 BM25: Traditional keyword-based retriever for efficient sparse search 🔍 CrossEncoder: Reranker used after initial retrieval to improve response relevance 🐳 Docker: Containerization and deployment
📊 RAGAS: RAG evaluation framework for measuring relevancy, faithfulness, correctness
📊 DeepEval: Evaluation framework for measuring conversational metrics such as role adherence, knowledge retention, conversation completeness, and relevancy

References & Thoughts

Klarna Chatbot Strategy Shift: Why Companies Are Rebalancing Human and AI Customer Service: A fintech company explaining their experience and challenges of mass implementation of chatbot in customer service and plans to tackle them. Lessons learnt:

Application-level measurements as well as LLM and RAG-leval evaluations are important.
Use real customer interaction data to identify and address AI shortcomings.
Understanding conversational paths by the specific use case or customer intent can give us a clear playbook for where to apply AI confidently (and maybe tell us how else we can incorporate AI)

"Long RAG" & Practical Guide for Model Selection for Real‑World Use Cases: A RAG technique shared by OpenAI based on long context. Suitable for use case with requirements on high precision and not particular about latency.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Edu Chatbot

Updates (15May2025):

Overview & Key Features

Demo

Setup Guide

Prerequisites

Installation

Data Configuration - Local

Data Configuration - Gdrive (Temporarily disabled)

Other configurable parameters

Running the app

Technical Implementation Details

Ideas for Future Enhancements

Project Structure

Tech Stack

References & Thoughts

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
assets		assets
config		config
data		data
dockerfiles		dockerfiles
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
docker-compose.yml		docker-compose.yml
requirements.in		requirements.in
requirements.txt		requirements.txt

License

Jeanetted3v/Edu-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Edu Chatbot

Updates (15May2025):

Overview & Key Features

Demo

Setup Guide

Prerequisites

Installation

Data Configuration - Local

Data Configuration - Gdrive (Temporarily disabled)

Other configurable parameters

Running the app

Technical Implementation Details

Ideas for Future Enhancements

Project Structure

Tech Stack

References & Thoughts

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages