GitHub

In this project, we designed a scoring system that captures the overarching concept of different levels of “inflammatory” comments on Reddit, performed annotations based on detailed guidelines, and extracted leading key words that are likely causing higher levels of “inflammatory” comments. We trained an bi-direcitonal LSTM model with an attention layer on a regression task in Pytorch to extract contributing keywords of higher inflammatory comments. Please see the heatmap example of inflammatory comment with highlighted intriguing keywords.

In order to utilize attention layer notebook, you need to obtain wording embedding files from Stanford NLP group at: https://nlp.stanford.edu/projects/glove/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.DS_Store		.DS_Store
Attention_heat_map.ipynb		Attention_heat_map.ipynb
Final Project Regression Models.ipynb		Final Project Regression Models.ipynb
Language Heat Map.png		Language Heat Map.png
NLP_259_Report.pdf		NLP_259_Report.pdf
README.md		README.md
final_labels.csv		final_labels.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

zheng100/NLP_Reddit_Project

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages