krkn-assist

This is a RAG chatbot built using LangGraph, LangChain, and either the IBM Granite model, LLaMA 3.1 via Ollama, or LLama 2. The chatbot answers technical questions based on the KRKN pod scenarios documentation.

Note: To ensure accurate responses based on the provided documentation, please include the keyword “krkn” or other krkn context in your questions. This helps the system retrieve relevant context from the Krkn knowledge base, rather than generating general answers from unrelated sources.

Setup Instructions

1. Clone the Repository

git clone https://github.com/krkn-chaos/krkn-assist.git
cd krkn-assist

2. Create + activate a virtual environment

python3 -m venv venv
source venv/bin/activate

3. Install dependencies

pip install -r requirements.txt

Running the Chatbot

On macOS:

If using the llama 3.1 LLM (reccomended), run this script:

brew install ollama
ollama run llama3

If using llama 2:7b LLM, run this script:

brew install ollama
ollama pull llama2:7b

Other operating systems :

Download instructions here

Ensure that ollama is running in the background

Terminal Interface

open main.py and uncomment the code for the LLM you would like to use
run python3 main.py (depending on your python version)

UI Interface

run streamlit run app.py

Steps

Document Processing: The system loads and processes documentation files from the docs/ directory under github.com/krkn-chaos/website, splitting them into manageable chunks for efficient retrieval.
- Documents can be loaded as:
  1. PDF (stored in a specific folder)
  2. Markdown files
  3. Urls
Vector Database Creation: Document chunks are converted into embeddings using HuggingFace's sentence transformers and stored in a Chroma vector database for semantic search.
RAG Pipeline Setup: A Retrieval-Augmented Generation (RAG) pipeline is established using LangGraph and LangChain, combining document retrieval with language model generation.
Model Integration: The chatbot integrates with your chosen LLM (IBM Granite, LLaMA 3.1 via Ollama, or LLaMA 2) to generate contextually relevant responses.
Query Processing: When you ask a question, the system:
- Retrieves relevant document chunks from the vector database
- Provides context to the language model
- Generates an answer based on the retrieved KRKN documentation
- Returns the response with source citations when available
Interactive Chat: The terminal interface allows for continuous conversation, maintaining context throughout the session.

Roadmap

Enhancements being planned can be found in the roadmap

Evaluation and Performance

Performance

LLM performance improves significantly with better laptop hardware. LLM was tested on two different laptops:

Laptop 1: Apple M3 Pro, 36 GB RAM, 12-core CPU, 18-core GPU
Laptop 2: Apple M1, 16 GB RAM, 8-core CPU, 12-core GPU

Answers were generated in under 10 seconds on laptop 1, whereas answers were generated in 15-30 seconds on laptop 2. (for llama 3.1 LLM)

Evaluating the model

If you want to evaluate the performance of the LLM being used to generate answers: User guide to the evaluation pipeline

Note: The output of steps 1-3 are the files in the folder evaluationPipeline

open eval.py and uncomment the code for the model you are evaluating
edit the email field on line 121 with the email that evaluation metrics should be sent to
after the script runs, open the json file (file name is on line 125)
copy the entire json file and open the Evaluation Pipeline Endpoint (must connected to VPN).
make sure the json structure matches the required format in the endpoint and paste it in these three endpoints/evaluate_context_retrieval, evaluate_response, and evaluate_all
evaluation metrics should be emailed to you

Evaluation data

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
.github/workflows		.github/workflows
data		data
evaluationPipleline		evaluationPipleline
rag_pipelines		rag_pipelines
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
eval.py		eval.py
fastapi_app.py		fastapi_app.py
main.py		main.py
requirements.txt		requirements.txt
requirements_krknctl.txt		requirements_krknctl.txt
roadmap.md		roadmap.md
test_krknctl_integration.py		test_krknctl_integration.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

krkn-assist

Setup Instructions

1. Clone the Repository

2. Create + activate a virtual environment

3. Install dependencies

Running the Chatbot

On macOS:

Other operating systems :

Terminal Interface

UI Interface

Steps

Roadmap

Evaluation and Performance

Performance

Evaluating the model

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

krkn-chaos/krkn-assist

Folders and files

Latest commit

History

Repository files navigation

krkn-assist

Setup Instructions

1. Clone the Repository

2. Create + activate a virtual environment

3. Install dependencies

Running the Chatbot

On macOS:

Other operating systems :

Terminal Interface

UI Interface

Steps

Roadmap

Evaluation and Performance

Performance

Evaluating the model

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages