- Which airlines or airports have the worst delays?
Determine which destinations and arrival destinations have the most delays? Doing this using maps is actually pretty difficult, but you may choose an alternative visual to provide this information. Think about what kind of aggregates might work best to determine which airlines and airports are the best and worst in terms of delays.
- What causes delays?
Think about if you work at an airline and you want to decrease delays. What part of the flight causes the most delays? Do these causes vary by airport or time of year?
- You can also come up with your own question!
As you work with the data, come up with a question you're curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
https://www.kaggle.com/usdot/flight-delays/data
Some of the columns you want to use in your project will have coded values that represent longer more readable values. For instance the cancellation_reason column in the flights data set has the values: A, B, C, D These letters are not understandable by themselves. You need to replace these letters with the full reason to make your visualizations including this data more readable.
These letters correspond with the following reasons.
A - Airline/Carrier
B - Weather
C - National Air System
D - Security
You could review the Column Metadata tab on Kaggle for each data set to find details about the data like the one outlined above.