Skip to content

Repository for the code of the hackathon to be able to identify SDG topics inside parliamentary activity documents.

License

Notifications You must be signed in to change notification settings

politicalwatch/sdgs-hackathon

Repository files navigation

SDGs Hackathon

Here you can find a presentation explaining who we are and the problem we want to solve.

Dataset

The dataset can be found in this Google Drive folder.

Files you can find in the dataset

  • categories.json: Is a JSON file that contains all of our current categorization. It has three levels: Topic, Subtopic and tag. We only need the first level of categories for this model.
  • initiatives.json.zip: A zipped file with all of our dataset in JSON format. Only the title and the content fields of each entry need to be used to identify the SDGs. All of the dataset is tagged and less than 50% of the documents returned results with our current system.
  • small-batch.json: A sample file with just 20 items for testing purposes.

Call link

Jitsi room

Dev notes

For better versioning, please install Jupytext and follow this guide.

About

Repository for the code of the hackathon to be able to identify SDG topics inside parliamentary activity documents.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published