This project is a web scraper designed to collect housing listings from HousingTarget.com. It automates the process of gathering property details such as location, price, size, and description. Additionally, it estimates travel time to the ABN AMRO HQ using the NS API and optimizes performance through parallel processing. We listed here the intended features:
- Scrapes housing data from HousingTarget.com
- Filters listings based on city (Eindhoven, Amsterdam, Rotterdam)
- Calculates travel distance to ABN AMRO HQ using NS API
- Optimized with multiprocessing for faster execution
- Recommend and alert the potential housing
scraper-housing/
|-- logs # Fork the processing
|-- src # ns APIs fork
|-- scraper.py # Main scraper script
|-- config.cfg # Config file
|-- outputs # Temp storage for ETLsfunction to pull places/stations/etc... based on the query and country -> return object of all kinds? -> TODO:
- Write
get_placesfunction in ns api - Follow link instead of requesting a new html [1:00] [DONE]
- Look into how to do parallelism here [3:00]