NLP with Disaster Tweets

Predict which Tweets are about real disasters and which ones are not.

Alexander Bricken

See articles here:

Submission Accuracy and Position on Leaderboard (at time of post): 84.063%, position #71 (although #52 if you subtract cheaters).

Project structure:

├── README.md                     <- The top-level README for developers using this project.
├── data
│   ├── raw                       <- The raw data
│   ├── submissions               <- The final data to be submitted
│
│
├── requirements.txt              <- Requirements for this project.
│
├── utils.py                      <- Utility functions for project.
├── tweet-scraping.py             <- Tweet scraping for more data.
│
├── notebooks                     <- Jupyter notebooks for this project.
│   ├── nlp_disaster_tweets       <- The main Jupyter notebook
│
├── data-dictionary.txt <- Data dictionaries, manuals, and all other explanatory materials.

Data

Raw data source: https://www.kaggle.com/c/nlp-getting-started/overview

Using The Project

Check in the notebooks folder to see the associated exploratory analysis.

If you want to play with it, simply type git clone https://github.com/Briiick/NLP-disaster-tweets.git in your terminal.

References

Natural Language Processing with Disaster Tweets

NLP with Disaster Tweets: EDA, cleaning and BERT

Basics of using pre-trained GloVe Vectors

Cleaning text data with Python

What is tokenization?

BERT Text Classification using Keras

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
notebooks		notebooks
utility		utility
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP with Disaster Tweets

Predict which Tweets are about real disasters and which ones are not.

Alexander Bricken

Submission Accuracy and Position on Leaderboard (at time of post): 84.063%, position #71 (although #52 if you subtract cheaters).

Project structure:

Data

Using The Project

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLP with Disaster Tweets

Predict which Tweets are about real disasters and which ones are not.

Alexander Bricken

Submission Accuracy and Position on Leaderboard (at time of post): 84.063%, position #71 (although #52 if you subtract cheaters).

Project structure:

Data

Using The Project

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages