This project implements a Causal Retrieval-Augmented Generation (RAG) system to analyze customer service conversations and produce explainable, evidence-backed answers to analytical queries.
Instead of generating free-form text, the system focuses on:
- Identifying why outcomes occur
- Extracting causal factors
- Grounding explanations in real transcript evidence
PredictX/
├── src/
│ └── main.py
├── data/
│ ├── Conversational_Transcript_Dataset.json
│ ├── queries.csv
│ └── submission_output.csv
├── requirements.txt
├── README.md
└── technical_report.pdf
- Retrieve relevant conversations using sentence embeddings
- Retrieve relevant dialogue turns
- Tag turns with causal signals (rule-based, deterministic)
- Aggregate dominant causal factors
- Produce explainable output with evidence
- Support follow-up questions using session memory
pip install -r requirements.txtpython src/main.pyThis will:
1.Read queries from data/queries.csv
2.Run the causal analysis pipeline
3.Write results to data/submission_output.csv
No external LLM calls are required
All reasoning is transparent and auditable
Designed for analytical and compliance-focused use cases