AI Web Scraper

An AI-powered web scraper application that leverages free LLM providers to perform intelligent web scraping based on user prompts. The system allows users to input scraping instructions via a user-friendly UI, process web data using LLM APIs, and output results in professional formats.

Features

🌐 Intelligent Web Scraping: Uses AI to understand and extract data based on natural language prompts
🎨 Modern UI: Responsive React frontend with TypeScript and Tailwind CSS
⚡ Fast Backend: High-performance FastAPI backend with async support
📄 Multiple Output Formats: Generate results in Word, PDF, Excel, or text formats
🔐 Secure: API keys stored securely in environment variables
🧪 Well-Tested: Comprehensive test suite with high coverage
📚 Well-Documented: Complete guides for developers, testers, and users

Tech Stack

Frontend: Vite + React + TypeScript + Tailwind CSS
Backend: FastAPI + Python
LLM Integration: OpenRouter/OpenAI APIs
Output Generation: python-docx, reportlab, openpyxl
Testing: Pytest (backend), Vitest (frontend)

Quick Start

Prerequisites

Node.js 18+ and npm
Python 3.8+
Git

Setup

Clone the repository

git clone <repository-url>
cd ai-webscraper

Backend Setup

cd backend
python -m venv venv
venv\Scripts\activate  # On Windows
pip install -r requirements.txt

# Copy .env.example to .env and add your API keys
copy .env.example .env

Frontend Setup
```
cd frontend
npm install
```

Run the Application

Terminal 1 (Backend):

cd backend
uvicorn app.main:app --reload

Terminal 2 (Frontend):

cd frontend
npm run dev

Access the Application
- Frontend: http://localhost:5173
- Backend API: http://localhost:8000
- API Docs: http://localhost:8000/docs

Documentation

Project Structure

ai-webscraper/
├── backend/          # FastAPI backend
├── frontend/         # Vite React frontend
├── docs/            # Documentation
└── README.md        # This file

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
docs		docs
frontend		frontend
samples		samples
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
e2e-test-results.md		e2e-test-results.md
integration-test.js		integration-test.js
integration-test.ps1		integration-test.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Web Scraper

Features

Tech Stack

Quick Start

Prerequisites

Setup

Documentation

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Web Scraper

Features

Tech Stack

Quick Start

Prerequisites

Setup

Documentation

Project Structure

License

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages