- This project helps us to scrap out all WP Blog Posts Data in an excel sheet
- It checks whether it is a valid Wordpress Website or not with the domain we provide
- Saves the
.xlsxfile on the Desktop of Cross Platform OS - The web page is mobile friendly as well
- This is a local project, and not been hosted anywhere, however, solely for learning purpose, and to understand what
Web Scrappingis, in this case,Wordpress Post Scrappingis done
- You have python lates version installed in youtr box
- You have installed all the files imported in the
/web-scrapping-api/blog_data.py -
/web-scrapping-api/test.pyis just for testing puropse, not that important file
- You have a little bit of understanding of Python Backend API dev, if you are looking for understanding the code
- To run the project, simply go inside the folder
/web-scrapping-api/in your terminal and runpython3 blog_data.py. This will run the API file on server - You need to run the
home.htmlfile on your browser, simply drag and drop the file on the Web browser, and you are good to go
- Insert all the data asked in the required field.
Domain nameshould be withouthttps://and should be a valid wordpress website, for examplejacklyons.mePer Page Countis nothing but, required as per the Wordpress blog post per page, you can put your desired number you want.10is the best value to be put for which WP also followsTotal Pagesis the total number of pages in your posts, for instance if you look at the pagination, the last page number is the total number of the pages you have. So add it accordingly
- Python Flask => Flask
- Python Docs => Python
- Wordpress API Docs => WP API Handbook
- Web Scrapping Definition => Web Scrapping
- XLSX Python Docs => Python xlsxwriter Module