This project showcases a full data analytics workflow using Excel and real CitiBike NYC trip data. I cleaned the raw dataset, performed descriptive analysis, and created visualizations to extract insights on rider behavior and station usage patterns.
- π§Ή Cleaned over 3,500 duplicate rows and handled missing values
- π Used pivot tables to analyze trip duration, user types, and usage patterns
- π Built visual charts to support a data-driven story for stakeholders
- π― Demonstrated Excel proficiency and a sharp eye for insights
Each visualization answers a key business question about CitiBike usage:
-
π [Top 20 Pick-Up Locations]
β What are the most popular pick-up locations across NYC? -
π§ [Trip Duration by Age Group]
β How does the average trip duration vary across different age groups? -
π₯ [Bike Rentals by Age Group]
β Which age group rents the most bikes? -
π [User Type by Day of Week]
β How does bike rental differ between Subscribers and Customers across weekdays and weekends?
- π Top Start Stations: The three busiest pick-up locations account for a significant portion of total rides
- β±οΈ Trip Duration: Younger riders (18β25) tend to have longer average trip durations
- π₯ Age Group Usage: Riders aged 25β35 rent the most bikes overall
- π User Behavior:
- Subscribers use CitiBike mostly during weekdays (commuting pattern)
- Customers use it more on weekends (leisure pattern)
- Microsoft Excel
- Pivot Tables & Pivot Charts
- Basic Descriptive Statistics (mean, median, range)
This project was built as part of my self-paced journey into data analytics. It reflects not just technical skills, but also my ability to communicate findings clearly and extract meaningful insights from raw data β all using just Excel.