Skip to content

nmugurtay/scrape-interwebz

 
 

Repository files navigation

Binder

Extracting Data from the Internet in Python

This workshop will cover how to extract data from the web using Python. We'll be covering both APIs and webscraping.

Topics Covered

  • How the web works
  • Accessing databases via RESTful APIs
  • HTML / CSS
  • Manipulating a webpage with Google DevTools
  • Webscraping with Beautiful Soup
  • Scraping javascript-heavy sites and interactive sites with Selenium

Requirements

This workshop will be using the Python programming language. See the software requirements here.

We will assume a basic knowledge of Python. If you've taken the D-Lab's Python Intensive, that should be sufficient.

Please note that materials are still in development, and will be changing.

Contact

Rochelle Terman: rterman@gmail.com

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

About

Tutorial on extracting data via APIs and webscraping

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 87.6%
  • HTML 12.4%