Skip to content

Briiick/NLP-disaster-tweets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLP with Disaster Tweets

Predict which Tweets are about real disasters and which ones are not.

Alexander Bricken


See articles here:

Submission Accuracy and Position on Leaderboard (at time of post): 84.063%, position #71 (although #52 if you subtract cheaters).

Project structure:

├── README.md                     <- The top-level README for developers using this project.
├── data
│   ├── raw                       <- The raw data
│   ├── submissions               <- The final data to be submitted
│
│
├── requirements.txt              <- Requirements for this project.
│
├── utils.py                      <- Utility functions for project.
├── tweet-scraping.py             <- Tweet scraping for more data.
│
├── notebooks                     <- Jupyter notebooks for this project.
│   ├── nlp_disaster_tweets       <- The main Jupyter notebook
│
├── data-dictionary.txt <- Data dictionaries, manuals, and all other explanatory materials.

Data

Raw data source: https://www.kaggle.com/c/nlp-getting-started/overview

Using The Project

Check in the notebooks folder to see the associated exploratory analysis.

If you want to play with it, simply type git clone https://github.com/Briiick/NLP-disaster-tweets.git in your terminal.

References

Natural Language Processing with Disaster Tweets

NLP with Disaster Tweets: EDA, cleaning and BERT

Basics of using pre-trained GloVe Vectors

Cleaning text data with Python

What is tokenization?

BERT Text Classification using Keras

Releases

No releases published

Packages

 
 
 

Contributors