🎯 Competitive Programming Test Case Generator

Transform coding problem ideas into ready-to-test bundles with a single click

Features • Quick Start • Usage • Contributing • License

🚀 What is This?

A powerful FastAPI web application that scrapes coding problems from top competitive programming platforms, generates test cases with .in/.out files, and packages everything into downloadable bundles. Perfect for educators, competitive programmers, and coding interview preparation.

✨ Features

🌐 Multi-Platform Support: Scrapes from Codeforces, LeetCode, CodeChef, GeeksforGeeks, and AtCoder
🎨 Modern Web UI: Clean, single-page interface with real-time progress tracking
📊 Live Updates: Watch your scraping job progress with streaming logs
🎯 Smart Filtering: Select specific platforms and difficulty levels
✅ Problem Curation: Review, accept, or reject problems before downloading
📦 Instant Downloads: Get filtered ZIP bundles with organized test cases
⚡ Placeholder Mode: Generate synthetic problems instantly for testing
🔄 Job Queue: Background processing with concurrent job support

📸 Demo

Note: Screenshots and demo video coming soon! In the meantime, try the app yourself following the setup below.

💡 Example Workflow

Enter: "Give me 5 medium difficulty sorting problems"
Select platforms: Codeforces, LeetCode
Watch real-time scraping progress
Review and curate problems
Download filtered bundle with organized test cases

🏃 Quick Start

Prerequisites

Python 3.11+ (Download)
Git (Download)

Installation

# 1. Clone the repository
git clone https://github.com/AVPthegreat/codebase-problem-scrapper.git
cd codebase-problem-scrapper

# 2. Create virtual environment
python3 -m venv .venv

# 3. Activate virtual environment
# On macOS/Linux:
source .venv/bin/activate
# On Windows:
.venv\Scripts\activate

# 4. Install dependencies
pip install -e '.[dev]'

# 5. Launch the app
python scripts/run_web.py

First Run

Open your browser and navigate to http://127.0.0.1:8000

Try this example:

Prompt: "Give me 3 easy array problems"
Platforms: Select Codeforces
Difficulty: Easy
Click: "Generate Problems"

📖 Usage

Basic Workflow

Enter Your Prompt
- Describe what problems you want (e.g., "5 dynamic programming problems")
- Be specific about topics, difficulty, or quantity
Configure Options
- Platforms: Choose one or more (Codeforces, LeetCode, etc.)
- Difficulty: Easy, Medium, Hard, or Mixed
- Placeholder Mode: Enable for instant synthetic test problems
Monitor Progress
- Real-time progress bar and live logs
- See which platforms are being scraped
- Track problem discovery in real-time
Curate Results
- Review all discovered problems
- Accept ✅ or reject ❌ individual problems
- See problem descriptions and metadata
Download Bundle
- Click "Download Selected Problems"
- Get a ZIP file with organized folders
- Each problem includes .in and .out test files

Advanced Features

Placeholder Mode 🎭

Instant synthetic problems for UI testing
No network requests or rate limiting
Perfect for demos and development

Platform Filtering 🔍

Select specific platforms for targeted scraping
Combine multiple sources in one bundle
Leave all unchecked to search everywhere

Job Queue 📋

Multiple jobs can be queued
Background processing doesn't block UI
Check /recent for job history

🏗️ Project Structure

codebase-problem-scrapper/
├── src/
│   ├── app/
│   │   └── services/
│   │       ├── orchestrator.py      # Core bundle generation logic
│   │       └── scrapers/            # Platform-specific scrapers
│   │           ├── codeforces.py
│   │           ├── leetcode.py
│   │           ├── codechef.py
│   │           ├── geeksforgeeks.py
│   │           └── atcoder.py
│   └── webapp/
│       ├── main.py                   # FastAPI application
│       └── templates/                # Jinja2 HTML templates
│           ├── index.html            # Main UI
│           ├── job.html              # Job details
│           └── recent.html           # Job history
├── scripts/
│   └── run_web.py                    # Server launcher
├── tests/
│   └── test_orchestrator.py         # Unit tests
├── .github/
│   ├── ISSUE_TEMPLATE/               # Bug & feature templates
│   └── pull_request_template.md     # PR template
├── pyproject.toml                    # Dependencies & config
├── LICENSE                           # MIT License
├── CODE_OF_CONDUCT.md               # Community guidelines
├── SECURITY.md                       # Security policy
└── README.md                         # You are here!

⚙️ Configuration

Environment Variables

Currently, the app runs without external API keys. Future enhancements may include:

OpenAI integration for intelligent test case generation
OAuth for platform authentication
Custom scraping rate limits

Custom Settings

Edit scripts/run_web.py to customize:

Port: Change from default 8000
Host: Bind to 0.0.0.0 for network access (add auth first!)
Workers: Adjust concurrent scraping jobs

🧪 Testing

# Run all tests
pytest tests/

# Run with coverage
pytest tests/ --cov=src --cov-report=html

# Run specific test file
pytest tests/test_orchestrator.py -v

# Check code style
ruff check src/ tests/

🛡️ Security & Best Practices

✅ Never commit .env files or virtual environments
✅ Respect rate limits when scraping live platforms
✅ Use placeholder mode for demos and testing
✅ Add authentication before exposing beyond localhost
✅ Review the Security Policy before reporting vulnerabilities

🐛 Troubleshooting

Port 8000 already in use

# Kill process on port 8000
lsof -ti:8000 | xargs kill -9

# Or run on different port
# Edit scripts/run_web.py and change port number

Missing dependencies error

# Reinstall all dependencies
pip install -e '.[dev]'

# Or install specific package
pip install <package-name>

Virtual environment not activating

# On macOS/Linux
source .venv/bin/activate

# On Windows
.venv\Scripts\activate

# On Windows PowerShell
.venv\Scripts\Activate.ps1

Tests failing

# Ensure you're in virtual environment
source .venv/bin/activate  # or .venv\Scripts\activate

# Reinstall dependencies
pip install -e '.[dev]'

# Run tests with verbose output
pytest tests/ -v

Scraping returns no results

Check your internet connection
Some platforms may have rate limits or anti-scraping measures
Try placeholder mode for testing
Check platform availability (some may be down temporarily)

🤝 Contributing

We love contributions! Whether it's bug fixes, new features, or documentation improvements.

How to Contribute

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'feat: add amazing feature')
Push to your branch (git push origin feature/amazing-feature)
Open a Pull Request

See CONTRIBUTING.md for detailed guidelines.

Contribution Ideas

🎨 Add more platform scrapers
🚀 Improve scraping accuracy and speed
📱 Create mobile-responsive UI
🤖 Integrate AI for test case generation
🌐 Add internationalization support
📊 Add analytics and statistics

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

FastAPI - Modern Python web framework
Uvicorn - Lightning-fast ASGI server
Jinja2 - Powerful templating engine
All the competitive programming platforms for providing great problems

📬 Contact & Support

Author: Anant Vardhan Pandey
GitHub: @AVPthegreat
Issues: Report a bug or request a feature

Built with ❤️ by AVPTHEGREAT

⭐ Star this repo if you find it helpful!

Report Bug • Request Feature

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🎯 Competitive Programming Test Case Generator

🚀 What is This?

✨ Features

📸 Demo

💡 Example Workflow

🏃 Quick Start

Prerequisites

Installation

First Run

📖 Usage

Basic Workflow

Advanced Features

🏗️ Project Structure

⚙️ Configuration

Environment Variables

Custom Settings

🧪 Testing

🛡️ Security & Best Practices

🐛 Troubleshooting

🤝 Contributing

How to Contribute

Contribution Ideas

📜 License

🙏 Acknowledgments

📬 Contact & Support

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

🎯 Competitive Programming Test Case Generator

🚀 What is This?

✨ Features

📸 Demo

💡 Example Workflow

🏃 Quick Start

Prerequisites

Installation

First Run

📖 Usage

Basic Workflow

Advanced Features

🏗️ Project Structure

⚙️ Configuration

Environment Variables

Custom Settings

🧪 Testing

🛡️ Security & Best Practices

🐛 Troubleshooting

🤝 Contributing

How to Contribute

Contribution Ideas

📜 License

🙏 Acknowledgments

📬 Contact & Support