In early 2022 staff from Oregon Department of Transportation’s Region 5 were interested in estimating the delay and safety related costs to pedestrians incurred when corners and intersections are closed while wheelchair ramps (ADA ramps) are being reconstructed. The objective of quantifying these costs aims to ensure construction contractors are considering these costs explicitly when determining project demolition and construction phasing. In order to accurately assess these costs a reliable estimate of pedestrian traffic volume is required. Estimates of pedestrian traffic informs economic valuation of delay and safety so that those costs are considered by contractors for ADA ramp projects and sidewalk closures are minimized in duration and impact to the community.
This report documents the data and methods used to estimate pedestrian traffic at 54 intersections in Region 5 where ADA ramp projects are planned in 2023. The reports uses observed traffic counts and push button actuations from ODOT traffic signals as well as some of the latest pedestrian traffic estimation techniques from recent and ongoing research. The data, methods, and results are summarised below in this report.
This project uses observed short duraction traffic counts, pedestrian crosswalk push button actuations from ODOT traffic signals, and land use information from the Department of Land Conservation and Development (DLCD) Place Type framework. These data are explained in more detail below.
This section summarizes the short-duration pedestrian traffic counts (SDC) collected at 54 locations throughout spring of 2022 using video and later reduced to tabular data and entered into the Oregon Traffic Monitoring System. These SDCs were collected for 16 hours with 47 locations having 32 hours of data collected across two separate days resulting in 1,116 hours of data. The hourly and daily pedestrian traffic volumes are summarized in the table below.
Measure | Hourly | Daily |
---|---|---|
Minimum | 0.00 | 1.00 |
1st Quantile | 0.00 | 24.00 |
Median | 2.00 | 41.00 |
Mean | 4.01 | 64.09 |
3rd Quantile | 6.00 | 85.00 |
Max | 49.00 | 331.00 |
N | 1616.00 | 101.00 |
Land use classifications are used in this report to match permanent count sites to SDCs. Land use information used below to define location types comes from the Department of Land Conservation and Development’s Place Types database. Place type classification uses built environment and transportation service availability data to describe the destination accessibility, design, and diversity measures to help provide an understanding of the interaction of land use and transportation choices for a given area. The base data for Place Types comes from the Environmental Protection Agency’s Smart Location Database (SLD) with those data elements used to inform a area type and a development type that are then combined to form the Place Type classification. Place type designations include 9 classifications including Rural, Isolated City, Rural Near Major City, City Near Major Center, and MPO (low/residential/employment/Mixed use/ TOD) and are available at the Census block level.
This section describes the development of hourly adjustment factors, the cluster analysis used to group signal controllers and the day-of-year traffic expansion factor methods. All data wrangling and analysis is performed in the R open source statiscial computing platform.
This project utilizes recent research from a Utah Department of Transportation (UDOT) funded research project lead by Patrick Singelton. Singleton and Runa (2021) combined over 10,000 hours of push button actuations and observed pedestrian traffic counts to create hourly adjustment factors. These factors are created by estimating a simple model that relates the observed traffic counts to the push button actuations. Using these simple models to adjust the hourly push button actuation data, push buttons can be used to estimate hourly counts with a small amount of error, around 3 +/- pedestrians per hour (Singleton & Runa 2020). This project tested a few model forms but ultimatley concluded that the quadratic form was best.This section describes the model development using data collected at the study intersections.
Generally speaking the hourly adjustment models rely on the assumption that the push buttons recorded by the ATCs at intersections relate to the pedestrian traffic observed using the intersection. Adjustment factors are necessary because pedestrians do not always push the button to activate the crosswalk or they may be traveling in groups which means the one push of the button represents more than one actual pedestrian. The chart below compares the hourly counts and hourly push button actuations at a select intersection used in this study.
Figure 3.1: Short-Duration Counts Compared with Push Button Actuations Device: 756 Intersection Id: 52029
For each intersection where counts and pedestrian push button actuations are available a linear-quadratic model is estimated. The model uses the observed hourly counts as the dependent variable with the push button actuations and a squared transformation of the push button actuations are the independent variables as shown below:
\[\begin{aligned}PedCounts_{i}= PushButtons_{i} + PushButtons_{i}^2\end{aligned}\]
Where i is the intersection where both SDC and push button data have been collected. To assess performance of these models two measures are used including adjusted R2 and mean average error (MAE). Adjusted R2 measures
the models’ goodness-of-fit between the observed and estimated values with a value of 1.0 being a perfect model and 0 being poorly performing model. MAE is another measure of model fit that averages the absolute values of the differences between estimated counts and observed counts helping to describe how big or small the estimation error is, in real terms, compared to the observed counts.
Below is a summary chart of both the adjusted R2 and the MAE for each of the 16 signals where counts and push button data were available concurrently. Of the 16 intersections, 13 perform well with low error and high goodness-of-fit metrics but three intersections perform poorly (766, 767,& 770) as measured by low goodness-of-fit measures. For these intersections a model based on grouped sites will be utilized instead of an intersection specific model. To group sites cluster analysis is performed and explained in more detail in the next section.
Figure 3.2: Hourly Adjustment Model performance Measures by Device and Cluster
There are two objectives for using cluster anlaysis. The first objective is to group signals together with similar temporal patterns along with their SDCs so that more robust hourly adjustment models can be developed using pooled data. The second objective of the cluster analysis is to allow for the application of the estimated hourly adjustment models where no SDCs were taken. There are 15 intersections near our study intersections where push button data exist but no SDC was collected so in order to take advantage of these data we will use models from pooled data where the data was poole dusing cluster analysis.
As can be seen in Figure 3.2 above three devices (signals) produce models with unacceptablly low R2 and even though the MAE is low this level of prediction power is insufficient. The models are not performing well mostly due to very low pedestrian activity at these intersections with the three sites only seeing a few pedestrians for the entire 16 hour period of SDC collection. To address this issue we group sites using an unsupervised machine learning algorthim called cluster analysis implemented using the kmeans function from the R Core Team (2021). Cluster analsis is a form of pattern recognition that has the ability to group data based on selected data elements, in this case the hourly proportion of total weekly unadjusted push button activity across all hours and days of the week using an annual data set. In this cluster analysis exercise, different numbers of clusters were tried but eight was found to work best to ensure balanced cluster frequencies and also based on visual inspection, sensible groupings for different travel patter types. In the chart below the clusters for the temporal patterns are described for five of the clusters while the three other clusters are not used becuase they were from signals that were too far and didnt conform to the placetypes required in the SDC matching criteria explained below.
Figure 3.3: Push Button Temporal Patterns Grouped Using Cluster Analysis
Figure 3.3 above shows 4 different temporal patterns as established from the clustering algorithm. Cluster 3 has a pattern where the hourly peak is around the noon to 2:00 pm hours for most days though this peak is less pronounced for Monday and Sunday. Cluster 4 exhibits a similar pattern but starts an hour later on most days until 3:00 pm and this peak period is much more pronounced than what is observed in the other clusters. Cluster 5 shows extended mid-afternoon peaks with weekend days revealing very long shallow afternoon peaks . Cluster 8 has lower peaks relative to the other clusters with fewer device observations that deviate subtantially from the average. Model results from clusters 3,4, and 5 will be applied to signal locations where no SDCs were collected while models built on cluster 8 will be used to improve performance for the devices highlighted in Figure 3.2 to perform poorly as device specific models.
This section documents the application of the device specific models and the cluster analysis based models to 31 devices (or traffic signals) that are within 1000 meters of the study locations. There are 35 signals within 1,000 meters of the 54 study locations but unfortunatley not all of those signals were included in a cluster that also had observed pedestrian traffic counts and so no cluster based model existed for them and they are removed. The chart featured in Figure 3.4 below shows the adjusted counts after a device specific hourly adjustment model is applied as well as the raw push button actuations. The patterns of estimated traffic exhibit expected overall patterns with less daily pedestrian traffic in the winter months and higher traffic volumes in the summer. This intersection is located near the Baker County Fairgrounds and Geiser Pollman Park both of which host a number of events that result in a relatively high amount of foot traffic during the summer weekends. The day where estimated estimated counts reached over 1,700 was a day when multiple events were occuring near the intersection including Baker City Bronc and Bull Riding event, Miners Jubilee and multiple music acts. There are 16 signals where push buttons and SDCs were collected concurrently and once the models estimated above are applied to each of the 16 intersections we produce 16 permanent count sites for use in the traffic factoring described below.
Figure 3.4: Push Button and Adjusted Pedestrian Count Comparison for Device Id 781 Campbell and Cedar Street
This section describes the process used to expand the short duraction counts (SDCs) using day-of-year (DOY) factors as well as how SDCs were matched with permanent count sites to estimate an AADPT. Since the SDCs were collected for only 16 hours of the day their is also a need to apply an hourly factor which are generated from the permanent count sites and applied to the SDC based on which permanent count sites were used in the DOY factoring.
The current state of the practice for expanding short duration pedestrian traffic counts is to use a Day-of-Year factors. These factors are derived from a permanaent counter by dividing the daily count by the annual total thus creating a factor for each day of the year. The DOY factor was shown by Hankey, Lindsey, and Marshall (2014), El Esawey (2016) and Nordback et al. (2019) to minimize error compared to standard Federal Highways Administration (2018) method which create 84 factors based on day of the week and month of year. Due to the sensitivity of daily conditions like weather, traditional factors are not recommended for use in expanding pedestrian short duration traffic counts due to their inability to account for these day-to-day variations in weather. The formula describing the DOY expansion factor method is below:
\[\begin{aligned}AADPT{i,y,j} = \frac1n\sum_{n=1}SDC{i,y,j} *\frac{1}{DOY{y,i}}\end{aligned}\]
Where:
i = intersection where SDC was collected
y = year of count
j = day SDC was collected
In order to expand the SDCs to represent an annual average daily estimate of pedestrian traffic the SDC must be matched to a permanent count site. The recommendtion from past research is to use at least three permanent (Nordback et al. (2019)) count sites to minimized annual estimation error. There are 54 locations where SDCs were collected and need to be expanded but 16 of those locations are at an intersection where push button data is being collected so the adjusted push buttons will be used directly. For the other 38 SDC locations a combination of proximity and land use characteristics criteria will be used to guide which permanent count sites to use for expansion.
The process for matching permanent count sites with SDCs uses a selection algorithm that ensures the Place Type for each site (SDC and permanent counter) are the same and that the closest permanent count site is selected while also aiming to use three permanent count sites. The last rule is not always acheived however as some of the SDCs are too far away from permanent counter with a maximum distance of 150,000 meters being applied. The last exception for these algorithmic rules is if the SDC is in a place type designated ‘rural’ in which cases the matching element of the algorithm is relaxed to also include ‘Rural Near Major City’.
The entire process including the hourly adjustment modeling, application ofmodels, and factoring is described below in Figure 3.5 below.
Figure 3.5: Process Work Flow
This section details the AADPT Results after applying the above methodologies to the SDC data. The results are presented in three formats including a chart, a dynamic map and a table (with ability to download data). The chart in Figure 4.1 shows the AADPT for each site and is grouped by the city in which the intersection resides. The dynamic map allowes users to see spatially the AADPT estimates and use the tooltips (click on circles) for the intersection points to see additional information about the location. The table below documents the final AADPT results for each intersection and include some information about the number of sites and what the place type was for both the SDC and the traffic signal (permanent counter).
Figure 4.1: AADPT Results by Intersection Id & City
This map is dynamic and can be explored. For more infomration about the intersection use the mouse to click on the interseciton points to reveal more information through the tooltips.
The table below is dynamic as well and can be searched using the search bar and any data of interest can be exported in a couple different formats using the table buttons.