This project will collect and combine flight data and airport location data so it can be used for future analysis and visualizations.
Right now, this is a starting point — the data will be added later.
The goal is to:
- Find open data on commercial flights and airport locations
- Combine both datasets into one file
- Use the data for analysis, charts, or dashboards in the future
| Folder | Description |
|---|---|
data/ |
This is where flight and airport data files will go |
scripts/ |
Python scripts for cleaning or merging data |
notebooks/ |
Jupyter notebooks for data exploration |
README.md |
This file that explains the project |
I will look at these open datasets:
- OpenFlights – routes and airport coordinates
- OurAirports – airport information
- OpenSky Network – live flight data (optional)
airports.csv
| Column | Description |
|---|---|
| airport_id | Unique ID (IATA or ICAO) |
| name | Airport name |
| city | City |
| country | Country |
| latitude | Latitude |
| longitude | Longitude |
flights.csv
| Column | Description |
|---|---|
| origin_id | Departure airport code |
| destination_id | Arrival airport code |
| airline | Airline name |
| distance_km | Distance between airports (optional) |
Once data is added, a basic Python script might look like this:
import pandas as pd
airports = pd.read_csv("data/airports.csv")
flights = pd.read_csv("data/flights.csv")
merged = flights.merge(
airports, left_on="origin_id", right_on="airport_id", how="left"
)
print(merged.head())