This repository contains two LLM-powered agent systems for information retrieval and web search. These systems leverage LLMs, semantic search, and web scraping to enhance knowledge discovery and retrieval.
A retrieval-augmented generation system that utilizes ChromaDB for efficient document retrieval.
📌 Features:
- Embeds and indexes documents.
- Uses
ChromaDB
for vector-based semantic search. - Supports multiple LLM providers:
LiteLLM
,HfApi
,Transformers
. - Interactive Gradio UI for easy interaction.
📂 Find more details in the Agentic RAG directory.
A multi-agent system that performs web searches and retrieves webpage content for basde on user's query.
📌 Features:
- Uses
DuckDuckGo
search API to find relevant web pages. - Converts webpage content into Markdown format for readability.
- A manager agent oversees the web search agent.
- Supports multiple LLM providers:
LiteLLM
,HfApi
,Transformers
. - Interactive Gradio UI for seamless user interaction.
📂 Find more details in the Multi-Agent directory.
A Browser Automation Vision-Language Model (VLM) Agent that automates web browsing tasks, such as navigating web pages, searching for content, handling popups, and extracting information.
📌 Features:
- Web Automation: Navigate to web pages, click on elements, and search within pages.
- Popup Handling: Close modals and popups with ease.
- Information Extraction: Extract and process webpage content.
- Screenshot Capture: Automatically captures screenshots during interactions.
- LLM Support: Works with models from
LiteLLM
,HfApi
, andTransformers
. - Gradio UI: Interactive web interface for user interaction.
📂 Find more details in the Browser Automation directory.
- Clone the repo.
- uv is recommended for easy install and setup for all agents in this repo. You only need run each script in their directory (
uv
will create the venv and install all the dependencies for you).
cd agentic_rag
uv run agentic_rag_chromadb.py
cd multi_agent
uv run multi_agent_web_search.py
cd browser_auto_agent
uv run browser_auto_vlm_agent.py
See LICENSE
for more details.
Contributions are welcome! Feel free to open issues and submit pull requests. 🚀