GitHub - rmkeeler/udacity-project-disaster-alerts: ETL, ML and web interface for a disaster alert system.

Installation

Python version used: 3.8.7

Packages used:

Pandas 1.2.3
Numpy 1.20.2
Scikit-Learn 0.24.1
NLTK 3.6.2
Pickle 0.7.5
SQLAlchemy 1.4.15
Flask 1.1.2

Usage Instructions

Running data/process_data.py and then models/train_classifier.py in that order will put all necessary files where they need to go. Input datasets can be stored anywhere, but it's recommended to store them in the data directory.

To run data/process_data.py from the project root:

python data/process_data.py -f <messageCSVPath> -l <labelsCSVPath>

Where the -f argument specifies the path to a CSV file containing the text messages from which the model's features will be derived.

And the -l argument specifies the path to a CSV file containng the classifications of the messages in the messages file.

To run models/train_classifier from the project root:

python models/train_classifier.py <database_path> <model_save_location>

Where <database_path is the location of the database you saved when you ran data/process_data.

And <model_save_location> is the path and filename where you would like to save the model you're training in this step.

At that point, you're basically done. What's left is launching the Flask app, locally. Working directory should be the "app" directory (the same directory that contains run.py):

python run.py

The app will launch on a local server at 127.0.0.1:3001, by default. You can change that location by editing app/run.py.

You should see this when you access the server location:

And this after you submit text via the "classify message" button:

app/run.py also contains the plotly objects and the pandas analysis steps necessary to produce their data. Feel free to edit, as desired.

Project Movitation

I undertook this project as part of Udacity's Data Scientist nanodegree program. The primary motivations for this project were:

Exploring natural language processing and feature extraction
Exploring ML pipelining and model optimization
Practicing data wrangling via the Extract-Transform-Load process
Understanding how to build a data science pipeline that culminates in a public web application

The secondary motivation was the project's simulated goal: Creating a public web application that allows disaster relief workers to input a text message and learn what kind (if any) relief is desired by the sender. Routing social media messages to appropriate disaster relief agencies is the broader goal of the application.

File Descriptions

data/process_data.py: Takes file references in command prompt and performs ETL to clean datasets, merge them into one and load into a local db file.
models/train_classifier.py: Extracts data from db file created above, tokenizes messages and builds a predictive model using the resulting tokens. Saves the model to models/classifier.pkl
run.py: The Flask app. Creates a local server instances at 127.0.0.1:3001 and produces the web app, there. Also performs the analyses the populate the bar charts in the app.

Licensing, Authors and Acknowledgements

Data collected and provided by Appen for the purpose of a project in Udacity's Data Scientist Nanodegree program.

process_data.py and train_classifier.py provided by Udacity. Functions within them written by me (main() in train_classifier provided by Udacity).

Flask app html and app.py provided by Udacity and altered by me.

Feel free to use the code provided in this repository at your own discretion.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
app		app
data		data
models		models
screenshots		screenshots
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Installation

Usage Instructions

Project Movitation

File Descriptions

Licensing, Authors and Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Installation

Usage Instructions

Project Movitation

File Descriptions

Licensing, Authors and Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages