EduRAG: Intelligent Teaching Assistant

🚀 Overview

EduRAG is a Retrieval-Augmented Generation (RAG) based AI system designed to generate accurate, context-aware answers from structured educational content.
It improves traditional question-answering by combining semantic search with LLM-based response generation.

🔥 Key Features

Retrieval-Augmented Generation (RAG) pipeline for improved answer accuracy
Custom chunking and chunk-merging strategy to enhance context quality
Embedding-based semantic search for relevant content retrieval
Multi-file JSON data processing and structuring
Context-aware response generation with reduced noise

🧠 How It Works

Raw data is preprocessed and divided into chunks
Chunks are intelligently merged to improve context
Embeddings are created for semantic understanding
Top-k relevant chunks are retrieved based on query
LLM generates a final answer using retrieved context

🛠️ Tech Stack

Python
JSON Data Processing
Embedding-based Semantic Search
Retrieval-Augmented Generation (RAG)

📂 Project Structure

. ├── merge_chunks.py ├── preprocess_json.py ├── processing_query.py ├── mp3_to_json.py ├── video_to_mp3.py ├── README.md

🎯 Key Highlights

Designed a custom chunk-merging mechanism to reduce context fragmentation
Improved answer quality by optimizing chunk size and grouping strategy
Built modular scripts for scalable data preprocessing and retrieval
Focused on enhancing LLM performance through better context handling

⚡ Usage / Workflow

Convert video content to audio using video_to_mp3.py
Transcribe audio to structured JSON using mp3_to_json.py
Preprocess raw data using preprocess_json.py
Merge chunks for improved context using merge_chunks.py
Perform query processing and retrieval using processing_query.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EduRAG: Intelligent Teaching Assistant

🚀 Overview

🔥 Key Features

🧠 How It Works

🛠️ Tech Stack

📂 Project Structure

🎯 Key Highlights

⚡ Usage / Workflow

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
Readme.md		Readme.md
merge_chunks.py		merge_chunks.py
mp3_to_json.py		mp3_to_json.py
preprocess_json.py		preprocess_json.py
processing_query.py		processing_query.py
video_to_mp3.py		video_to_mp3.py

Folders and files

Latest commit

History

Repository files navigation

EduRAG: Intelligent Teaching Assistant

🚀 Overview

🔥 Key Features

🧠 How It Works

🛠️ Tech Stack

📂 Project Structure

🎯 Key Highlights

⚡ Usage / Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages