NewUpdate : I have also created a local-llm based rag chatbot which is available in a seperate folder; It runs on Llama3.2 which I ran locally in my system. Feel free to check it out.
Experience the future of document interaction.
DocuSense AI is a premium, high-accuracy semantic search engine that deconstructs PDF documents into a neural vector space, allowing you to query insights with conversational precision.
- 🧠 Neural Semantic Analysis: Uses
Sentence-Transformersto map document hierarchy into high-dimensional vector representations. - ⚡ FAISS Acceleration: Powered by Meta's
FAISSfor lightning-fast similarity retrieval across massive text corpora. - 🔍 Dual-Stage Re-ranking: Implements a
Cross-Encoder(ms-marco-MiniLM-L-6-v2) to re-score hits, ensuring the top result is always the most contextually relevant. - ✨ Premium Glassmorphism UI: A state-of-the-art Streamlit interface featuring splash screens, data visualization, and micro-animations for an elite user experience.
| Component | Technology |
|---|---|
| Framework | Streamlit (Python) |
| Vector Engine | Meta FAISS (Intel-Optimized) |
| Embeddings | Sentence-Transformers (all-MiniLM-L6-v2) |
| Re-ranker | Cross-Encoder (MS-Marco) |
| Processing | Regex Sanitization & Chunking |
Ensure you have Python 3.9+ installed and a virtual environment active.
# Clone the repository
git clone https://github.com/vijayrajeshr/DocuSense-AI.git
cd DocuSense-AI
# Install dependencies
pip install -r requirements.txtstreamlit run app.py- Neural Sync: Upon launch, the engine completes a 3-second system synchronization.
- Deconstruction: Upload any PDF. The engine performs lexical analysis and geometry extraction.
- Vectorization: Sentences are normalized and transformed into dense vectors.
- Querying: Input conversational questions. The engine performs a multi-stage search and provides "Verified Insights" with a confidence score.
Hi, I have also created a local-llm based rag chatbot which is available in a seperate folder; It runs on Llama3.2 which I ran locally in my system. Feel free to check it out.
graph TD
A[PDF Upload] --> B[Text Decomposition]
B --> C[Neural Chunking]
C --> D[Vector Embedding]
D --> E[FAISS Indexing]
E --> F[Semantic Search]
F --> G[Cross-Encoder Re-ranking]
G --> H[Verified Insight]
DocuSense AI processes all data locally within your current session. No document text or vector weights are transmitted to external servers, ensuring 100% data sovereignty.
Built for precision. Designed for the elite.
DocuSense AI — Where documents find their voice.
