pip install requests
pip install beautifulsoup4
pip install pymupdfThen run main.py by invoking the following command
python main.py #script to run all codeTo run the crawler and converter scripts seperately run the following commands in succession.
python crawler.py #script to download the pdf files
python conversion.py #script to convert all the pdf files to .txt and .xmlCheck the folder for the generated files