This repository contains the code for the automated API harvesting of dataset records for the UAB Dataset Catalog. The catalog is located at https://digitalcommons.library.uab.edu/datasets/. Please note that this code is under development, and feel free to reach out to Claire at cwarner at rockefeller.edu with any questions, comments, or suggestions.
This code enables API-facilitated searches for research datasets published online, either in the Zenodo repository or on the DataCite Commons. It is currently oriented towards finding datasets affiliated with the University of Alabama at Birmingham, but is intended to be transferable to other institutions. The code then loads the metadata of the located datasets as a Pandas dataframe and manipulates it to generate a new dataframe. This dataframe is compatible with the batch upload spreadsheet for the collection in the UAB Digital Commons instance, where the dataset catalog is hosted.