Skip to content

Spider Scraper on Funda, TargetHousing.com, Karmenet etc... to fetch latest housing lasting. Optimizing listings selection for personal preference.

PhatLeBao-Phat/scraper-housing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scraper-housing

This project is a web scraper designed to collect housing listings from HousingTarget.com. It automates the process of gathering property details such as location, price, size, and description. Additionally, it estimates travel time to the ABN AMRO HQ using the NS API and optimizes performance through parallel processing. We listed here the intended features:

  • Scrapes housing data from HousingTarget.com
  • Filters listings based on city (Eindhoven, Amsterdam, Rotterdam)
  • Calculates travel distance to ABN AMRO HQ using NS API
  • Optimized with multiprocessing for faster execution
  • Recommend and alert the potential housing

1. Project Layout

scraper-housing/
|-- logs            # Fork the processing
|-- src             # ns APIs fork 
|-- scraper.py      # Main scraper script 
|-- config.cfg      # Config file 
|-- outputs         # Temp storage for ETLs

2. Items

function to pull places/stations/etc... based on the query and country -> return object of all kinds? -> TODO:

  • Write get_places function in ns api
  • Follow link instead of requesting a new html [1:00] [DONE]
  • Look into how to do parallelism here [3:00]

About

Spider Scraper on Funda, TargetHousing.com, Karmenet etc... to fetch latest housing lasting. Optimizing listings selection for personal preference.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published