The purpose of this project was to review the NYC Citi Bike bikesharing data and use the learning to create visualizations and conclusions for the possibility of a bikesharing program in Des Moines, Iowa. We utilized Jupyter Notebook and used Pandas to convert the trip duration to a datetime format. We then utilized Tableau for the visualizations.
We utilized Jupyter Notebook and pandas to convert the trip duration to a datetime format. After exporting that datafram as a csv, we were ready to visualize. Utilizing Tableau, we put together a series of visualizations to help us make decisions for the project. The Tableau story is found here: Tableau Dashboard - NYC Citi Bike Data Review
Some key notes from this data:
- Checkout Time by User
This was the typical trip duration. We saw that most trip durations overall were less than an hour.
- Checkout Time by Gender
This was the trip duration, but split out by gender. From this, we can see that regardless of gender, trips typically last under an hour.
- Trips by Weekday for Each Hour
This visual showed a heatmap of our top times by weekday. This shows us that 8-9am and 5-7pm are peak hours, and Thursday is the heaviest weekday.
- Trips by Gender (Weekday per Hour)
Another heatmap, but split out by gender. We see here that regardless of gender, peak hours and days remain consistent.
- User Trips by Gender by Weekday
This was split to show a customer vs subscriber, the weekday, and gender in a heatmap. Most of the rides were accomplished by subscribers, and again we see male as the top frequent rider gender and Thursday as the peak day.
- Top Starting Location
This is a map visual with the top starting locations. From here, we can see that the most frequented areas are close together, and the rides are a combination of commuters and tourists.
- Bike Utilization
Here we can see the bike id's and their total trips. This helps us plan that some bikes will be used more than others and will require more frequent maintenance.
Overall, this data was helpful to us in realizing some interesting factors to take into account. A couple key takeaways:
- Some bikes will be used more than others and maintenance should be expected accordingly. In addition it will be important to budget that maintenance in.
- Top starting locations were grouped in similar spots. To start with the program in Des Moines, we should start with a smaller number of bikes in key areas of tourism and commute as a beta test.
- Trip durations were consistently under 1 hour for the majority of the rides. From this we can plan bike stations accordingly.
There are a couple more visualizations that would help us with our analysis:
- A map showing a select number of starting and stopping stations to provide context on typical distance needed between stations.
- A bar chart that shows the top bikes that were utilized the most. An extra mention on total number of bikes will help us determine extra maintenance needed on a percentage of the bikes.






