Kaggle: Real or Not? NLP with Disaster Tweets

Overview

Public competition on Kaggle to predict witch Tweets are about real disasters and with one's aren't, using Machine Learning models.

Summary

As part of our personal development and continuing education, I joined this Kaggle competition with a group of friends to improve our knowledge and develop more experience in the NLP field.

We decided to join this competition as a team to enrich each other experience and obtain better results thru collaboration.

Steps

Data exploration
Data cleaning with Python, Pandas and Regex
Checked the correct spelling and validation of words
Tokenization
Lemmatization
Vectorization of the data and removal of stop words
Exploration of different ML supervised/classification models with Sklearn
Modified hyperparameters to implement a Grid Search and H2o to improve the accuracy of the models
Preparation of the submission file

Results

After implementing different ML models, we achieved an accuracy of 0.80232 with a Support Vector Machine (SVC) model. This result can be improved with other methodologies and libraries.

Next steps

Explore and implement libraries like spacy and word embedding or methodologies like steaming. Also, we could drastically improve the accuracy using google libraries for NLP.

Tools

Python
Pandas
Regex
NLTK
Sklearn
H2o

Our Team

Esdras Campos	Saúl Romero	César Campuzano

https://github.com/EsdrasGrau	https://github.com/sromero9485	https://github.com/cesarcamp

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.DS_Store		.DS_Store
NLP_with_Disaster_Tweets.ipynb		NLP_with_Disaster_Tweets.ipynb
README.md		README.md
svc_8051.csv		svc_8051.csv
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kaggle: Real or Not? NLP with Disaster Tweets

Overview

Summary

Steps

Results

Next steps

Tools

Our Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kaggle: Real or Not? NLP with Disaster Tweets

Overview

Summary

Steps

Results

Next steps

Tools

Our Team

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages