Project Topic

The city of Calgary, Alberta, Canada adopted a cycling strategy to evaluate ways to increase the number of Calgarians choosing cycling as a means of transportation throughout the city.

The City collects and monitors information on the number of bicycle riders, gender, age, and helmet use annually. This dataset includes counts completed from 2013-2016.

My project will use this dataset to identify trends in cycling over the years - with a specific focus on gender and age difference in cycling in the city.

Data Sources

Data is available in multiple formats (CSV, RSS, XML) from the City of Calgary Open Data website:

Location of Data: https://data.calgary.ca/Transportation-Transit/Annual-Bicycle-Counts/ybd2-54bg

License URL: https://data.calgary.ca/d/Open-Data-Terms/u45n-7awa Contains information licensed under the Open Government Licence – City of Calgary.

Data Provided by: City of Calgary - Transportation Planning, Dataset Owner: Calgary Open Data

Description of the Data

The CSV file contains 290 rows and 23 columns and includes variables such as: Year, Month, Location, Gender, Helmet Use, Latitude, Longitude, Age

Creating a “tidy” dataset involved: -removal of Null values -properly assigning type to the various columns (eg. -reassigning MONTH so it can be display chronologically rather than as text alphabetically) -splitting the “location” column into separate Lat/Long columns.

Ideas about the figures that you will create to visualize this data:

I will be using OpenStreetMap to display the location of monitoring stations on a map of Calgary

Bar charts showing number/percent of cyclists from 2013-2016 (all interactive to sort by gender/year)

Possibly heatmaps showing trends in ridership and helmet usage by gender and age over the duration of the study.

Box plots to represent statistical distribution by gender/age.