Predicting Train Arrivals

We're setting up a data pipeline to optimize CTA train arrival predictions at stations located near terminal stations. Currently, CTA only starts live tracking a train after it has left the terminal. As a result, arrival predictions at nearby stations (like Noyes, Central, or South Blvd) are practically useless. You typically only find out a train is approaching when it's just 2 minutes away.

However, we know that trains heading in the opposite direction usually wait at the terminal station before starting their next trip. With enough historical data, we can identify patterns and attempt to generate expected arrival schedules for these stations.

The class CTATrainDataParser is responsible for safely reading the data collected from the EL Tracker application. The function get_entries_by_station returns a tuple that pairs the timestamp of the arrival request with the corresponding train arrival response. From there, we extract features to feed into a learning algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
eta_replay		eta_replay
holdout_data		holdout_data
matches		matches
report		report
sample_data		sample_data
.gitignore		.gitignore
README.md		README.md
data_parser.ipynb		data_parser.ipynb
data_parser.py		data_parser.py
noyes_arrival_error_demo.mp4		noyes_arrival_error_demo.mp4
noyes_features.ipynb		noyes_features.ipynb
noyes_features.py		noyes_features.py
noyes_model.ipynb		noyes_model.ipynb
noyes_model.py		noyes_model.py
pickle_noyes.py		pickle_noyes.py
train_stops.csv		train_stops.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Train Arrivals

About

Uh oh!

Releases

Packages

Languages

KhachDavid/el-tracker-ml

Folders and files

Latest commit

History

Repository files navigation

Predicting Train Arrivals

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages