Team: Bryan Guevara, Patrick Fenn, Thet Zaw, Harris shepard
- curses
- bs4
To run the crawler follow these instructions
- Make sure you are connected to the internet
- Navigate to the folder containing the
general-threaded-crawler.pyfile - Create a
.txtfile with the seed links for the crawler (if not already there)
- Run the command
source exec.sh <max_pages> <max_threads> <url_threshold> <path_to_seed_file>
- Example:
source exec.sh 1000 100 100 ./seeds.txt
