A robust web scraping tool designed to extract comprehensive information about Cambridge schools worldwide from the official Cambridge International Education website: Find a Cambridge School.
- Extract school data from 160+ countries
- Collect detailed school information including location, center and if private candidates accepted
- Export data in multiple formats (CSV, JSON)
-
Clone the repository:
git clone https://github.com/yourusername/cambridge_school_scraper.git cd cambridge_school_scraper -
Create a virtual environment (recommended):
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install dependencies:
pip install -r requirements.txt
This project requires:
- Python 3.7+
- Selenium WebDriver
- Chrome/Firefox WebDriver (depending on your configuration)
- Additional dependencies listed in
requirements.txt
from cambridge_scraper import CambridgeSchoolScraper
# Initialize the scraper
scraper = CambridgeSchoolScraper()Contributions are welcome! To contribute:
- Fork this repository
- Clone your fork
- Set up pre-commit hooks:
pre-commit install
- Create a new branch for your feature
- Make your changes
- Submit a pull request
This project is licensed under the MIT License