SDGs Hackathon

Here you can find a presentation explaining who we are and the problem we want to solve.

Dataset

The dataset can be found in this Google Drive folder.

Files you can find in the dataset

categories.json: Is a JSON file that contains all of our current categorization. It has three levels: Topic, Subtopic and tag. We only need the first level of categories for this model.
initiatives.json.zip: A zipped file with all of our dataset in JSON format. Only the title and the content fields of each entry need to be used to identify the SDGs. All of the dataset is tagged and less than 50% of the documents returned results with our current system.
small-batch.json: A sample file with just 20 items for testing purposes.

Call link

Jitsi room

Dev notes

For better versioning, please install Jupytext and follow this guide.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
1_preprocess.ipynb		1_preprocess.ipynb
1_preprocess.py		1_preprocess.py
2_final_features.ipynb		2_final_features.ipynb
3_modeling.ipynb		3_modeling.ipynb
3_modeling.py		3_modeling.py
LICENSE		LICENSE
README.md		README.md
get_tagger_response.py		get_tagger_response.py
read_data.py		read_data.py
remove_stop_words.py		remove_stop_words.py
stopwords.csv		stopwords.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SDGs Hackathon

Dataset

Files you can find in the dataset

Call link

Dev notes

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

politicalwatch/sdgs-hackathon

Folders and files

Latest commit

History

Repository files navigation

SDGs Hackathon

Dataset

Files you can find in the dataset

Call link

Dev notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages