1 Introduction

In early 2022 staff from Oregon Department of Transportation’s Region 5 were interested in estimating the delay and safety related costs to pedestrians incurred when corners and intersections are closed while wheelchair ramps (ADA ramps) are being reconstructed. The objective of quantifying these costs aims to ensure construction contractors are considering these costs explicitly when determining project demolition and construction phasing. In order to accurately assess these costs a reliable estimate of pedestrian traffic volume is required. Estimates of pedestrian traffic informs economic valuation of delay and safety so that those costs are considered by contractors for ADA ramp projects and sidewalk closures are minimized in duration and impact to the community.

This report documents the data and methods used to estimate pedestrian traffic at 54 intersections in Region 5 where ADA ramp projects are planned in 2023. The reports uses observed traffic counts and push button actuations from ODOT traffic signals as well as some of the latest pedestrian traffic estimation techniques from recent and ongoing research. The data, methods, and results are summarised below in this report.

2 Data

This project uses observed short duraction traffic counts, pedestrian crosswalk push button actuations from ODOT traffic signals, and land use information from the Department of Land Conservation and Development (DLCD) Place Type framework. These data are explained in more detail below.

2.1 Observed Traffic Counts

This section summarizes the short-duration pedestrian traffic counts (SDC) collected at 54 locations throughout spring of 2022 using video and later reduced to tabular data and entered into the Oregon Traffic Monitoring System. These SDCs were collected for 16 hours with 47 locations having 32 hours of data collected across two separate days resulting in 1,116 hours of data. The hourly and daily pedestrian traffic volumes are summarized in the table below.

Table 2.1: Observed Pedestrian Traffic Counts Data Summary
Measure	Hourly	Daily
Minimum	0.00	1.00
1st Quantile	0.00	24.00
Median	2.00	41.00
Mean	4.01	64.09
3rd Quantile	6.00	85.00
Max	49.00	331.00
N	1616.00	101.00

2.2 Push Button Actuations from Traffic Signals

This section summarizes the push button actuations from ODOT traffic signals in the study area. These push button data are created any time a person pushes the crosswalk button at an intersection with the Advanced Traffic Controllers (ATCs). The ATC records each phase of the traffic signal in a centralized data repository making them accessible for purposes of travel monitoring. There are 35 intersections with ATC signal hardware within 1,000 meters of the 54 study intersections. These push buttons actuation data will be used, once adjusted, as permanent pedestrian traffic counters. This adjustment will occur by combining counts and push button data and creating adjustment factors. This process is explained in more detail in the following sections. Throughout this document, the 35 signals are referred to as both ‘signals’ and ‘devices’ using such terminology as ‘device id’. The table below summarizes the push button actuations used in the development of the hourly adjustment models (Model Data) as well as the application of the model to create permanent count sites (Application Data).

Table 2.2: Select Signals’ Push Button Data Summary
	Model Data		Application Data
Measure	Hourly	Daily	Hourly	Daily
Minimum	0.0	3	1.0	1.0
1st Quantile	0.8	33	2.0	28.0
Median	2.5	58	3.0	54.0
Mean	3.7	63	4.8	68.1
3rd Quantile	6.0	97	6.0	88.0
Max	20.0	164	119.0	711.0
Observations	3264.0	25	175296.0	12310.0
Signal Count	16.0	16	35.0	35.0

2.3 Land Use - Place Type

Land use classifications are used in this report to match permanent count sites to SDCs. Land use information used below to define location types comes from the Department of Land Conservation and Development’s Place Types database. Place type classification uses built environment and transportation service availability data to describe the destination accessibility, design, and diversity measures to help provide an understanding of the interaction of land use and transportation choices for a given area. The base data for Place Types comes from the Environmental Protection Agency’s Smart Location Database (SLD) with those data elements used to inform a area type and a development type that are then combined to form the Place Type classification. Place type designations include 9 classifications including Rural, Isolated City, Rural Near Major City, City Near Major Center, and MPO (low/residential/employment/Mixed use/ TOD) and are available at the Census block level.

3 Methods

This section describes the development of hourly adjustment factors, the cluster analysis used to group signal controllers and the day-of-year traffic expansion factor methods. All data wrangling and analysis is performed in the R open source statiscial computing platform.

3.1 Development and Application of Hourly Adjustment Models

This project utilizes recent research from a Utah Department of Transportation (UDOT) funded research project lead by Patrick Singelton. Singleton and Runa (2021) combined over 10,000 hours of push button actuations and observed pedestrian traffic counts to create hourly adjustment factors. These factors are created by estimating a simple model that relates the observed traffic counts to the push button actuations. Using these simple models to adjust the hourly push button actuation data, push buttons can be used to estimate hourly counts with a small amount of error, around 3 +/- pedestrians per hour (Singleton & Runa 2020). This project tested a few model forms but ultimatley concluded that the quadratic form was best.This section describes the model development using data collected at the study intersections.

Generally speaking the hourly adjustment models rely on the assumption that the push buttons recorded by the ATCs at intersections relate to the pedestrian traffic observed using the intersection. Adjustment factors are necessary because pedestrians do not always push the button to activate the crosswalk or they may be traveling in groups which means the one push of the button represents more than one actual pedestrian. The chart below compares the hourly counts and hourly push button actuations at a select intersection used in this study.

Figure 3.1: Short-Duration Counts Compared with Push Button Actuations Device: 756 Intersection Id: 52029

3.1.1 Linear Model

For each intersection where counts and pedestrian push button actuations are available a linear-quadratic model is estimated. The model uses the observed hourly counts as the dependent variable with the push button actuations and a squared transformation of the push button actuations are the independent variables as shown below:

\[\begin{aligned}PedCounts_{i}= PushButtons_{i} + PushButtons_{i}^2\end{aligned}\] Where i is the intersection where both SDC and push button data have been collected. To assess performance of these models two measures are used including adjusted R² and mean average error (MAE). Adjusted R² measures the models’ goodness-of-fit between the observed and estimated values with a value of 1.0 being a perfect model and 0 being poorly performing model. MAE is another measure of model fit that averages the absolute values of the differences between estimated counts and observed counts helping to describe how big or small the estimation error is, in real terms, compared to the observed counts.

Below is a summary chart of both the adjusted R² and the MAE for each of the 16 signals where counts and push button data were available concurrently. Of the 16 intersections, 13 perform well with low error and high goodness-of-fit metrics but three intersections perform poorly (766, 767,& 770) as measured by low goodness-of-fit measures. For these intersections a model based on grouped sites will be utilized instead of an intersection specific model. To group sites cluster analysis is performed and explained in more detail in the next section.

Figure 3.2: Hourly Adjustment Model performance Measures by Device and Cluster

3.1.2 Cluster Analysis

There are two objectives for using cluster anlaysis. The first objective is to group signals together with similar temporal patterns along with their SDCs so that more robust hourly adjustment models can be developed using pooled data. The second objective of the cluster analysis is to allow for the application of the estimated hourly adjustment models where no SDCs were taken. There are 15 intersections near our study intersections where push button data exist but no SDC was collected so in order to take advantage of these data we will use models from pooled data where the data was poole dusing cluster analysis.

As can be seen in Figure 3.2 above three devices (signals) produce models with unacceptablly low R² and even though the MAE is low this level of prediction power is insufficient. The models are not performing well mostly due to very low pedestrian activity at these intersections with the three sites only seeing a few pedestrians for the entire 16 hour period of SDC collection. To address this issue we group sites using an unsupervised machine learning algorthim called cluster analysis implemented using the kmeans function from the R Core Team (2021). Cluster analsis is a form of pattern recognition that has the ability to group data based on selected data elements, in this case the hourly proportion of total weekly unadjusted push button activity across all hours and days of the week using an annual data set. In this cluster analysis exercise, different numbers of clusters were tried but eight was found to work best to ensure balanced cluster frequencies and also based on visual inspection, sensible groupings for different travel patter types. In the chart below the clusters for the temporal patterns are described for five of the clusters while the three other clusters are not used becuase they were from signals that were too far and didnt conform to the placetypes required in the SDC matching criteria explained below.

Figure 3.3: Push Button Temporal Patterns Grouped Using Cluster Analysis

Figure 3.3 above shows 4 different temporal patterns as established from the clustering algorithm. Cluster 3 has a pattern where the hourly peak is around the noon to 2:00 pm hours for most days though this peak is less pronounced for Monday and Sunday. Cluster 4 exhibits a similar pattern but starts an hour later on most days until 3:00 pm and this peak period is much more pronounced than what is observed in the other clusters. Cluster 5 shows extended mid-afternoon peaks with weekend days revealing very long shallow afternoon peaks . Cluster 8 has lower peaks relative to the other clusters with fewer device observations that deviate subtantially from the average. Model results from clusters 3,4, and 5 will be applied to signal locations where no SDCs were collected while models built on cluster 8 will be used to improve performance for the devices highlighted in Figure 3.2 to perform poorly as device specific models.

3.2 Model Application

This section documents the application of the device specific models and the cluster analysis based models to 31 devices (or traffic signals) that are within 1000 meters of the study locations. There are 35 signals within 1,000 meters of the 54 study locations but unfortunatley not all of those signals were included in a cluster that also had observed pedestrian traffic counts and so no cluster based model existed for them and they are removed. The chart featured in Figure 3.4 below shows the adjusted counts after a device specific hourly adjustment model is applied as well as the raw push button actuations. The patterns of estimated traffic exhibit expected overall patterns with less daily pedestrian traffic in the winter months and higher traffic volumes in the summer. This intersection is located near the Baker County Fairgrounds and Geiser Pollman Park both of which host a number of events that result in a relatively high amount of foot traffic during the summer weekends. The day where estimated estimated counts reached over 1,700 was a day when multiple events were occuring near the intersection including Baker City Bronc and Bull Riding event, Miners Jubilee and multiple music acts. There are 16 signals where push buttons and SDCs were collected concurrently and once the models estimated above are applied to each of the 16 intersections we produce 16 permanent count sites for use in the traffic factoring described below.

Figure 3.4: Push Button and Adjusted Pedestrian Count Comparison for Device Id 781 Campbell and Cedar Street

3.3 Estimating Annual Average Pedestrian Daily Traffic (AADPT)

This section describes the process used to expand the short duraction counts (SDCs) using day-of-year (DOY) factors as well as how SDCs were matched with permanent count sites to estimate an AADPT. Since the SDCs were collected for only 16 hours of the day their is also a need to apply an hourly factor which are generated from the permanent count sites and applied to the SDC based on which permanent count sites were used in the DOY factoring.

3.3.1 Day-of-Year (DOY) Expansion Factors

The current state of the practice for expanding short duration pedestrian traffic counts is to use a Day-of-Year factors. These factors are derived from a permanaent counter by dividing the daily count by the annual total thus creating a factor for each day of the year. The DOY factor was shown by Hankey, Lindsey, and Marshall (2014), El Esawey (2016) and Nordback et al. (2019) to minimize error compared to standard Federal Highways Administration (2018) method which create 84 factors based on day of the week and month of year. Due to the sensitivity of daily conditions like weather, traditional factors are not recommended for use in expanding pedestrian short duration traffic counts due to their inability to account for these day-to-day variations in weather. The formula describing the DOY expansion factor method is below:

\[\begin{aligned}AADPT{i,y,j} = \frac1n\sum_{n=1}SDC{i,y,j} *\frac{1}{DOY{y,i}}\end{aligned}\]

Where:
i = intersection where SDC was collected
y = year of count
j = day SDC was collected

3.3.2 Matching SDCs to Permanent Count Sites

In order to expand the SDCs to represent an annual average daily estimate of pedestrian traffic the SDC must be matched to a permanent count site. The recommendtion from past research is to use at least three permanent (Nordback et al. (2019)) count sites to minimized annual estimation error. There are 54 locations where SDCs were collected and need to be expanded but 16 of those locations are at an intersection where push button data is being collected so the adjusted push buttons will be used directly. For the other 38 SDC locations a combination of proximity and land use characteristics criteria will be used to guide which permanent count sites to use for expansion.

The process for matching permanent count sites with SDCs uses a selection algorithm that ensures the Place Type for each site (SDC and permanent counter) are the same and that the closest permanent count site is selected while also aiming to use three permanent count sites. The last rule is not always acheived however as some of the SDCs are too far away from permanent counter with a maximum distance of 150,000 meters being applied. The last exception for these algorithmic rules is if the SDC is in a place type designated ‘rural’ in which cases the matching element of the algorithm is relaxed to also include ‘Rural Near Major City’.

The entire process including the hourly adjustment modeling, application ofmodels, and factoring is described below in Figure 3.5 below.

Figure 3.5: Process Work Flow

4 Final AADPT Results

This section details the AADPT Results after applying the above methodologies to the SDC data. The results are presented in three formats including a chart, a dynamic map and a table (with ability to download data). The chart in Figure 4.1 shows the AADPT for each site and is grouped by the city in which the intersection resides. The dynamic map allowes users to see spatially the AADPT estimates and use the tooltips (click on circles) for the intersection points to see additional information about the location. The table below documents the final AADPT results for each intersection and include some information about the number of sites and what the place type was for both the SDC and the traffic signal (permanent counter).

4.1 Chart Results

Figure 4.1: AADPT Results by Intersection Id & City

4.2 Map Results

This map is dynamic and can be explored. For more infomration about the intersection use the mouse to click on the interseciton points to reveal more information through the tooltips.

4.3 Table Results

The table below is dynamic as well and can be searched using the search bar and any data of interest can be exported in a couple different formats using the table buttons.

References

El Esawey, Mohamed. 2016. “Toward a Better Estimation of Annual Average Daily Bicycle Traffic: Comparison of Methods for Calculating Daily Adjustment Factors.” Transportation Research Record: Journal of the Transportation Research Board 2593 (1): 28–36. https://doi.org/10.3141/2593-04.

Federal Highways Administration. 2018. Traffic Data Computation Method: Pocket Guide. Federal Highways Administrationg. https://www.fhwa.dot.gov/policyinformation/pubs/pl18027_traffic_data_pocket_guide.pdf.

Hankey, Steve, Greg Lindsey, and Julian Marshall. 2014. “Day-of-Year Scaling Factors and Design Considerations for Nonmotorized Traffic Monitoring Programs.” Transportation Research Record: Journal of the Transportation Research Board 2468 (1): 64–73. https://doi.org/10.3141/2468-08.

Nordback, Krista, Sirisha Kothuri, Dylan Johnstone, Greg Lindsey, Sherry Ryan, and Jeremy Raw. 2019. “Minimizing Annual Average Daily Nonmotorized Traffic Estimation Errors: How Many Counters Are Needed Per Factor Group?” Transportation Research Record: Journal of the Transportation Research Board 2673 (10): 295–310. https://doi.org/10.1177/0361198119848699.

R Core Team. 2021. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.

Singleton, Patrick A., and Ferdousy Runa. 2021. “Pedestrian Traffic Signal Data Accurately Estimates Pedestrian Crossing Volumes.” Transportation Research Record: Journal of the Transportation Research Board 2675 (6): 429–40. https://doi.org/10.1177/0361198121994126.

Pedestrian Traffic
Estimation for Liquidation
Costs

Author: Josh Roll
Unit: ODOT Research Unit
Email: josh.f.roll@odot.oregon.gov
Github Handle: JoshRoll

1 Introduction

2 Data

2.1 Observed Traffic Counts

2.2 Push Button Actuations from Traffic Signals

2.3 Land Use - Place Type

3 Methods

3.1 Development and Application of Hourly Adjustment Models

3.1.1 Linear Model

3.1.2 Cluster Analysis

3.2 Model Application

3.3 Estimating Annual Average Pedestrian Daily Traffic (AADPT)

3.3.1 Day-of-Year (DOY) Expansion Factors

3.3.2 Matching SDCs to Permanent Count Sites

4 Final AADPT Results

4.1 Chart Results

4.2 Map Results

4.3 Table Results

References

Pedestrian Traffic Estimation for Liquidation Costs

Author: Josh Roll Unit: ODOT Research Unit Email: josh.f.roll@odot.oregon.gov Github Handle: JoshRoll

1 Introduction

2 Data

2.1 Observed Traffic Counts

2.2 Push Button Actuations from Traffic Signals

2.3 Land Use - Place Type

3 Methods

3.1 Development and Application of Hourly Adjustment Models

3.1.1 Linear Model

3.1.2 Cluster Analysis

3.2 Model Application

3.3 Estimating Annual Average Pedestrian Daily Traffic (AADPT)

3.3.1 Day-of-Year (DOY) Expansion Factors

3.3.2 Matching SDCs to Permanent Count Sites

4 Final AADPT Results

4.1 Chart Results

4.2 Map Results

4.3 Table Results

References

Pedestrian Traffic
Estimation for Liquidation
Costs

Author: Josh Roll
Unit: ODOT Research Unit
Email: josh.f.roll@odot.oregon.gov
Github Handle: JoshRoll