This portfolio provides a collection of Jupyter Notebooks designed to facilitate the use of open bibliometric data sources for higher education institutions. The goal is to offer tools for cleaning, analysing and improving open data for bibliometric analyses.
Using open data sources for research assessment enables more compliance with DORA (San Francisco Declaration on Research Assessment), CoARA (Coalition for Advancing Research Assessment) and POSI (The Principles of Open Scholarly Infrastructure) compared to licensed commercial databases. Open data are also aligned with FAIR principles (Findable, Accessible, Interoperable, Reusable). In comparison to established commercial databases, they usually provide:
- Unpaywalled access to the data
- More transparency and possibilities for reuse due to less restrictive licensing
- Broader inclusivity and diversity
- Community-driven improvements
The portfolio currently includes two open bibliometric data sources, OpenAlex and OpenAIRE.
This portfolio offers scripts to start using the API of open bibliometric data sources like OpenAIRE and OpenAlex by providing:
- Tutorials in form of Jupyter Notebooks.
- Tools to identify quality issues (in preparation).
- Scripts to perform customized bibliometric analyses.
This repository contains Jupyter Notebooks and dependencies for conducting initial bibliometric analyses.
requirements.txt
- List of required Python packages (install usingpip install -r requirements.txt
).open_colors.py
- Open source color scheme we use for the visualizations.openalex_first_analyses.ipynb
- Initial analyses of bibliometric data from OpenAlex.openaire_first_analyses.ipynb
- Initial analyses of bibliometric data from OpenAIRE.openalex_oa_analyses.ipynb
- Analysis of Open Access (OA) published article trends using OpenAlex.openaire_oa_analyses.ipynb
- Analysis of Open Access (OA) published article trends using OpenAIRE.
Ensure you have Python installed (preferably version 3.8+).
We recommend using a virtual environment to manage dependencies.
- Clone this repository:
git clone https://github.com/eth-library/tobi-portfolio
- Create a virtual environment and activate it:
python -m venv venv # Create a virtual environment source venv/bin/activate # Activate on macOS/Linux venv\Scripts\activate # Activate on Windows
- Install dependencies:
pip install -r requirements.txt
Launch Jupyter Notebook and open the desired .ipynb
file:
jupyter notebook
Or open the Notebook file in your IDE.
Contributions are welcome! If you find issues or want to suggest improvements, please open an issue or send us an E-Mail.
Disclaimer: The exclusive right of use and the exclusive exploitation rights of the software belong to ETH Zurich (Federal Act on the Federal Institutes of Technology, Art. 36, para. 2).
We are in the process of registering the software under an Open Source License.
Towards Open Bibliometric Indicators (TOBI) is a project co-founded by the ETH Library and swissuniversities within the Open Science Program.
More on the TOBI Website: https://eth-library.github.io/tobi/