A Python tool to search and analyze Google Scholar articles by keywords, sort by citations, and extract useful information.
- Search Google Scholar for academic papers
- Sort results by citation count
- Extract keywords from abstracts
- Save results to CSV file
-
Clone this repository:
git clone https://github.com/xodn348/google-scholar-search.git cd google-scholar-search
-
Install required packages:
pip install serpapi pandas nltk python-dotenv
- Create a
.env
file with your SerpAPI key and search parameters:SERPAPI_KEY=your_serpapi_key_here
- Get a SerpAPI key from https://serpapi.com/ (free tier available)
Run the script:
python3 serp.py
The tool will:
- Search Google Scholar for your keyword
- Retrieve up to 100 papers
- Sort them by citation count
- Extract keywords from abstracts
- Save results to a CSV file (e.g.,
byzantine_fault_tolerance_papers.csv
)
The script requires these Python modules:
serpapi
: For Google Scholar API accesspandas
: For data handling and CSV exportnltk
: For natural language processing and keyword extractionpython-dotenv
: For loading environment variables
The CSV output includes:
- Paper title
- Snippet/abstract
- Link to paper
- Citation count
- Extracted keywords
The script also prints the top 10 most frequent keywords in the results.
Apache-2.0 License
Junhyuk Lee (@xodn348)