Urban Analytics Final Project

Jacob McGhee

2024-12-02


Analyzing the Impact of Commercial Freight Vehicles on Traffic Safety in Metro Atlanta

Background

As an Atlanta resident, I have frequently experienced the heavy amounts of highway traffic that the metro area is well known for, and I’ve seen the many accidents that occur on our highways. Thinking deeper about what could be the causes of or possible solutiuons to this problem, I hypothesized based on anecdotable evidence that the heavy presence of freight traffic (commercial vehicles, semi trucks) within the metro area could be one of the biggest causes of traffic and accidents. Once, I was even involved in an accident on I-75 just miles from Midtown Atlanta where another driver had swerved into me after merging too late due to being stuck next to a semi truck for too long. I wondered, how often accidents like this occur?

Earlier this semester, I read that GDOT is working on a Commercial Vehicle Lane (CVL) Project 1 that would separate truck traffic into dedicated lanes alongside I-75 North between Macon and McDonough. Upon researching this concept further, I found other examples of this type of system from across the world. In New Jersey, a study 2 found that accidents were more likely overall in mixed-use lanes than in segregated car-only lanes, and trucks were involved in a disproportionally high level of accidents in the mixed-use lanes. A study from Dallas, Texas 3 used a simulation model to predict that segregated truck-only lanes could reduce travel times in all lanes. A rendered photo of GDOT’s proposed Commercial Vehicle Lanes can be seen below.

GDOT’s proposed Commercial Vehicle Lanes
GDOT’s proposed Commercial Vehicle Lanes

Here in Atlanta, regulations prohibit trucks from using the interstates and highways within the perimeter (I-285), which heavily reduces the number of trucks on those roads, and keeps truck flow very high on the I-285 bypass. However, as the metro has grown immensely in the past few decades, I-285 no longer serves as an effective “bypass” of the city. The urbanized region has extended far beyond the I-285 boundary and deep into the suburban counties surrounding the city. And areas like Cumberland & Perimeter, located near interchanges of I-285, have grown into central business districts that generate heavy amounts of commuter and consumer traffic. Therefore, I-285 is used very heavily today for local traffic, in addition to also being the only bypass option for through traffic like trucks. I suggest that given the sprawl of the city, we must think about how to better serve the needs of metro Atlanta residents (local traffic), and projects like Commercial Vehicle Lanes or truck bypasses could be a solution. In this project, though, I will look deeper into just one aspect of this larger problem: road safety.

Research Goals

In this project, I wish to better understand the impact that commercial vehicles have on the safety of drivers in the Atlanta metro, and if the implementation of truck-only lanes (as seen in GDOT’s upcoming I-75 Commercial Vehicles Lanes project) or a truck bypass of the entire metro could potentially lead to safer driving conditions for metro Atlanta residents. To do so, I will be analyzing accident data from different segments of highway around Atlanta. I would like to find out not only the impact that trucks have when they are involved in accidents, but also determine if there are differences in the frequency of and severeness of all accidents on routes with varying rates of truck traffic flow. In doing so, I am essentially trying to find out the difference in outcomes between routes like GA400 & any highway within the perimeter (I-285) where truck flow is legally limited, and routes like I-285 and the highways the extend outwards from it where truck flow is much higher. With this information, I would like to model the difference in projected accident frequency and severeness in the event that truck flow was reduced in the metro through one of the methods mentioned above.


Importing, Cleaning, Merging, & Calculating the Data


Importing the Accident Report Data

I used GDOT’s Crash Data Dashboard to download a dataset including all accidents reported in the Atlanta Metro between 2021 & 2023. In the code below, I cleaned the data for analysis.
date_time county road fatalities serious_injuries visible_injuries complaint_injuries num_vehicles latitude longitude
03/20/2023 06:43 AM Jackson Sr 60 0 0 0 1 2 34.12859 -83.71811
03/29/2023 09:14 PM Fulton Woodstock Rd 0 0 0 0 2 34.05890 -84.38351
03/24/2023 06:45 PM DeKalb Chamblee Dunwoody Rd 0 0 0 0 2 33.95150 -84.33661
03/24/2023 08:45 AM Fulton I 75 0 0 0 0 2 33.74518 -84.39013
03/25/2023 07:20 PM Hall Capitola Farm Rd 0 0 1 2 2 34.15721 -83.89668

In the following code, I spatially joined both sf dataframes to result in a final dataframe with a binary column ‘involved_truck’ that specifies whether a truck was involved or not in all of the accidents reported in the full dataset.

Next, I use a search pattern to filter the dataset to only include data points that occur on one of the metro Atlanta highways I am studying. The method I used was very effective at filtering out accidents on other roads, but it is not perfect due to the inconsistency in the naming of roads in GDOT’s data. However, the results of ther filtering are sufficient for their purpose: a map-based visualization of accidents.

date_time county road fatalities serious_injuries visible_injuries complaint_injuries num_vehicles geometry involved_truck
03/24/2023 08:45 AM Fulton I 75 0 0 0 0 2 POINT (-84.39013 33.74518) 1
03/22/2023 08:48 PM Fulton 285 0 0 0 0 2 POINT (-84.42885 33.90928) 1
03/20/2023 12:47 AM Fulton I-85 0 0 0 2 2 POINT (-84.39241 33.79218) 0
03/17/2023 09:15 AM DeKalb 285 0 0 0 0 2 POINT (-84.23135 33.73162) 1
03/10/2023 08:16 AM Fulton 285 0 0 0 0 2 POINT (-84.35884 33.9108) 0

GDOT Traffic Flow Data

I used GDOT’s Traffic Analysis & Data Application to download a dataset that includes average traffic flow (daily number of cars) and the percentage of that flow that is trucks (freight vehicles) at specified sensors alongside different highways in metro Atlanta.

Each point was used to estimate flow & truck percentage values for short segments of highway defined later in this report. To note: each point covers only one direction of the highway, and values for traffic in each direction of various highways was unavailable from GDOT. Therefore, in this report, I am assuming that flow rates are the same in each direction of traffic. In the code below, I specify the specific traffic sensors I will use in my analysis from GDOT’s traffic flow dataset and clean the data from GDOT and specify the sensors useful for the highways I am studying.

A small number of the sensors I’m studying had missing values for the percentage of the flow that was trucks. For these points, I used imputatiuon to estimate the percentage at each point based on values from the same highway but further from the city center and the difference between points near and far from the city on other highways.

id class geometry avg_flow avg_truck_per avg_num_truck
015-0276 1U : Urban Principal Arterial - Interstate POINT (-84.75219 34.22141) 83566.67 25.76667 21521
035-0127 1R : Rural Principal Arterial - Interstate POINT (-84.07545 33.22593) 91766.67 25.53333 23440
063-1192 1U : Urban Principal Arterial - Interstate POINT (-84.3896 33.59806) 196666.67 11.60000 22813
063-1201 1U : Urban Principal Arterial - Interstate POINT (-84.3764 33.64036) 145666.67 15.03333 22034
063-1207 1U : Urban Principal Arterial - Interstate POINT (-84.43306 33.61893) 182333.33 17.40000 31726

Highway Segments

I imported a shapefile of all highways in Georgia for mapping purposes. I then created a linestring spatial feature objects for each segment of highway I will be analyzing.
distance id geometry
5842.47 1 LINESTRING (-84.47792 33.62…
5015.05 2 LINESTRING (-84.37872 33.63…
8052.01 3 LINESTRING (-84.31866 33.67…
19894.57 4 LINESTRING (-84.23868 33.71…
6883.27 5 LINESTRING (-84.27076 33.90…

TomTom Traffic Speed Data

My next data source is TomTom, which measures traffic backups and speeds based on cell phone GPS data. From their website, I specified 44 segments of highway (in both directions of traffic) from across the metro area in order to study the impact of traffic speeds.

First, I read each of the 44 shapefiles from a folder and assigned id’s.

Next, I imported and cleaned a data file that included variable data for each segment.

route_id route route_description time length_m miles avg_time avg_speed differential distance geometry
1 285 Counter, 85 to 75 heavy_truck overnight 5842.47 3.63 00:03:20 65.22 1.00 5842.47 LINESTRING (-84.47792 33.62…
1 285 Counter, 85 to 75 heavy_truck am_rush 5842.47 3.63 00:03:21 64.93 1.00 5842.47 LINESTRING (-84.47792 33.62…
1 285 Counter, 85 to 75 heavy_truck midday 5842.47 3.63 00:03:27 63.20 1.03 5842.47 LINESTRING (-84.47792 33.62…
1 285 Counter, 85 to 75 heavy_truck pm_rush 5842.47 3.63 00:03:38 59.84 1.09 5842.47 LINESTRING (-84.47792 33.62…

Note: The data I gathered is from the entirety of August 2024, as my free trial only allowed for the download of one month’s data.

Note: From the TomTom data, the most useful variable I will include in my models is the speed differential value. In the dataset I downloaded, I gathered travel times (speeds) at four different time periods for each day: morning rush hour (7am-10am), midday (12pm-2pm), evening rush hour (3:30pm-7pm), and overnight (12am-2am). The differential value for each row in the dataset measures the difference in average speed between the given time of day and the base value, defined as the overnight speed. For example, a morning rush value of 1.5 would mean that the overnight speed divided by the morning speed = 1.5. A higher differential value means bigger difference in overnight vs daytime travel times, or in other words, more traffic/delays. This data will allow me to use a measure of traffic as an independent variable in modeling.


Merging the Traffic, Accident & Flow Data,

route_id route route_description time length_m miles avg_time avg_speed differential distance geometry fatalities_mi_yr accidents_mi_yr vehicles_mi_yr injuries_mi_yr avg_truck_per avg_flow fatalities_mi_yr_flow accidents_mi_yr_flow vehicles_mi_yr_flow injuries_mi_yr_flow is_285
1 285 Counter, 85 to 75 heavy_truck overnight 5842.47 3.63 00:03:20 65.22 1.00 5842.47 LINESTRING (-84.47792 33.62… 2.020202 406.9789 829.2011 11.20294 17.4 182333.3 1.11e-05 0.0022321 0.0045477 6.14e-05 1
1 285 Counter, 85 to 75 heavy_truck am_rush 5842.47 3.63 00:03:21 64.93 1.00 5842.47 LINESTRING (-84.47792 33.62… 2.020202 406.9789 829.2011 11.20294 17.4 182333.3 1.11e-05 0.0022321 0.0045477 6.14e-05 1
1 285 Counter, 85 to 75 heavy_truck midday 5842.47 3.63 00:03:27 63.20 1.03 5842.47 LINESTRING (-84.47792 33.62… 2.020202 406.9789 829.2011 11.20294 17.4 182333.3 1.11e-05 0.0022321 0.0045477 6.14e-05 1
1 285 Counter, 85 to 75 heavy_truck pm_rush 5842.47 3.63 00:03:38 59.84 1.09 5842.47 LINESTRING (-84.47792 33.62… 2.020202 406.9789 829.2011 11.20294 17.4 182333.3 1.11e-05 0.0022321 0.0045477 6.14e-05 1

Maps

Because I now have spatial data, the best way to analyze it is by mapping the data to visualize where accidents are occurring, where traffic is backed up, and which segments of highway have a lot of trucks on them!


Mapping Traffic on Atlanta Highways:

In these maps, the truck flow percentage is indicated by segment line width.



By mapping the speed differentials, I can get a better view of where traffic backups occur. I mapped differentials in the evening rush hour for traffic exiting Atlanta and in the morning rush hour for traffic entering Atlanta. I see that the morning differentials are much higher on the highways on the northside of Atlanta than the southside, possibly indicating a larger number of commuters using those roadways. I also see that differentials are far higher on the portions of highway outside of the perimeter, and lower inside the perimeter where truck traffic is limited. In the afternoon, the differentials are more evenly distributed across the region, although you see again that differentials are higher outside of the perimeter than inside. Also, the highest and lowest differentials happen to be on I-85 north of I-285 and on I-85 south of I-285, which is interesting. These two maps indicate that there could certainly be a relationship between the truck percentage on a highway and the buildup of rush hour traffic. Additionally, the fact that differentials are higher going towards the city rather than away could be a result of heavy congeestion at each of the I-285 interchanges, which is also heavily impacted by the number of trucks on I-285. Lastly, I note that speed differentials are actually highest on many portions of I-285, indicating that it is the highway most impacted by traffic delays in the metro.


Mapping Accident Frequency:

In these maps, the truck flow percentage is indicated by segment line width.



These two maps have a few interesting characteristics. In the first, you can see that each of the highways segments with a low truck percentage (<6%) also has a lower than average frequency of accidents. Of segments with a higher truck percentage, some segments have few accidents and other have many; no clear conclusion can be made spatially. The second map shows that highway segments with the highest averade daily traffic flow rates (I-75 in Cobb, I-85 in Gwinnett, I-285 North of I-20) tend to have some of the highest frequencies of accidents as well. From both maps, I can gather that the portion of I-285 between GA400 & I-85 has a strikingly high rate of accidents, which may be explained by the fact that the interchanges at each endpoint of this segment are known as some of the worst bottlenecks in the country (source). Additionally, GA400 has a very low accident rate, which may be explained by the fact that it has much fewer freight vehicles given that it is a state highway, not an interstate. The portions of I-85 & I-75 south of I-285 have much lower accident frequencies than the metro average.


Mapping Injury Frequency:

In this map, the truck flow percentage is indicated by segment line width.

From this map, it is very evident that the frequency of injuries is much higher on I-285 than on other highways, and the southern portion of I-285 has a particularly high frequency of injuries. Additionally, GA400 (both within and outside of the permiter), which has the lowest truck percentage in the metro, has the lowest rate of injuries out of any metro Atlanta highway. Injury frequency is high on most highway segments with a particularly high truck percentage, with I-75 south of the I-675 intersection being an exception; this could possibly be explained by the fact that this segment begins far outside of I-285 and extends further from the city center than any other segment. Another interesting characteristic is that while accidents are more frequent on the eastern half of I-285 than the western half, the opposite is true for injuries.


Mapping Fatality Frequency:

In this map, the truck flow percentage is indicated by segment line width.

The results of the third map are very interesting. Again, you see a very low rate of fatalities on GA400. I also see a low rate of fatalities on the western portion of I-285 despite it having a high rate of injuries. South of the I-675 interchange, I-75 does not have a particularly low rate of fatalities despite having a low rate of accidents & injuries; this highway segment has the highest truck percentage in the metro. I also see that, similarly to accident rates, the southern section of I-285 (which has a very high truck percentage) also has a very high frequency of fatalities. Inside of the perimeter, where truck traffic is limited, fatality rates tend to be lower, with the exception of the portion of I-85 between GA400 & I-285.


Analysis


Are Trucks More Likely to be Involved in Accidents?

segment involved_truck total_accidents total_fatalities total_injuries total_vehicles geometry avg_truck_per avg_flow
1 0 6336 20 204 12640 MULTIPOINT ((-84.47491 33.6… 17.40000 182333.3
1 1 2528 24 40 5420 MULTIPOINT ((-84.47493 33.6… 17.40000 182333.3
10 0 11532 24 208 23640 MULTIPOINT ((-84.46976 33.8… 14.58333 168000.0
10 1 5812 20 92 12272 MULTIPOINT ((-84.46967 33.8… 14.58333 168000.0
11 0 7756 16 120 16300 MULTIPOINT ((-84.41739 33.9… 10.00000 211666.7
11 1 2268 8 36 4928 MULTIPOINT ((-84.41746 33.9… 10.00000 211666.7
##                  segment            avg_truck_per  accidents_percent_truck 
##                 22.50000                 11.02997                 22.08409 
##   injuries_percent_truck fatalities_percent_truck   vehicles_percent_truck 
##                 15.84416                 21.52520                 22.77156

This result shows that while the average segment of highway in the study has a truck flow percentage of only 11%,

  • 22% of all accidents involved a truck
  • 16% of all injuries were from accidents involving a truck
  • 22% of all fatalities were from accidents involving a truck
  • 23% of all vehicles involved in an accident were in an accident involving a truck

Below, I will show these values on a chart:

These results show that on metro Atlanta highways, trucks were involved a disproportionately high rate of accidents compared to their traffic exposure, and they were also disproportionately involved in more serious accidents (those with injuries, fatalities, and multiple vehicles).


Do Accidents Involving Trucks Have More Serious Outcomes than Accidents Not Involving Trucks on Metro Atlanta Highways?

To answer this question, I will use the Wilcoxon Signed-Rank Test to identify if the number of vehicles involved in an accident, the number of serious injuries in an accident, or the number of fatalities in an accident differs between the two groups: accidents involving and not involving trucks.


Number of Vehicles per Accident:
## 
##  Wilcoxon rank sum test with continuity correction
## 
## data:  num_vehicles by involved_truck
## W = 1733508958, p-value < 2.2e-16
## alternative hypothesis: true location shift is not equal to 0
Truck Involved? Number of Vehicles geometry
No 2.021606 MULTIPOINT ((-83.72 34.4512…
Yes 2.087505 MULTIPOINT ((-83.64458 34.4…

There is an average of 2.02 vehicles involved in accidents without trucks, and 2.09 vehicles involved in accidents with trucks. Although this difference in means is small, I have a very large number of data points, and therefore the Wilcox test results in a p-value much lower than 0.05, which allows us to conclude with strong statistical significance that the average number of vehicles involved in each crash involving trucks is higher than the average number of vehicles involved in each crash not involving trucks.


Number of Serious Injuries:
## 
##  Wilcoxon rank sum test with continuity correction
## 
## data:  serious_injuries by involved_truck
## W = 1839658436, p-value = 3.822e-09
## alternative hypothesis: true location shift is not equal to 0
Truck Involved? Number of Serious Injuries geometry
No 0.0165838 MULTIPOINT ((-83.72 34.4512…
Yes 0.0110714 MULTIPOINT ((-83.64458 34.4…

There is an average of 0.011 serious injuries involved in accidents with trucks, and 0.017 vehicles involved in accidents without trucks. This would indicate that serious injuries are 50% more likely in accidents not involving trucks. I have a very large number of data points, and therefore the Wilcox test results in a p-value much lower than 0.05, which allows us to conclude with strong statistical significance that the average number of injuries involved in each crash involving trucks is lower than the average number of injuries involved in each crash not involving trucks.


Number of Fatalities:
## 
##  Wilcoxon rank sum test with continuity correction
## 
## data:  fatalities by involved_truck
## W = 1.832e+09, p-value = 0.9066
## alternative hypothesis: true location shift is not equal to 0
Truck Involved? Number of Fatalities geometry
No 0.0038224 MULTIPOINT ((-83.72 34.4512…
Yes 0.0037010 MULTIPOINT ((-83.64458 34.4…

The average number of fatalities involved in accidents with or without trucks is almost identical. The Wilcox test shows no statistically significant difference between the groups. The results of this may be skewed by the fact that despite having a very large dataset, the number presence of fatal accidents is uncommon.


Linear Regression:

I created four linear regression models to measure the impact of speed differentials, total flow rate, and the truck percentage on the frequency of accidents, serious injuries, fatalities, and the number of vehicles involved in accidents. Due to the fact that I’m analyzing only 44 sections of roadway, I’m considering p-values below 0.10 to be statistically significant.


Accident Frequency:
Linear Regression Coefficients
Variable Estimate Std. Error t value p-value
(Intercept) -0.00041 0.00029 -1.41123 0.16060
Truck Percentage 0.00002 0.00001 1.86501 0.06447
Average Daily Traffic Flow 0.00000 0.00000 4.58641 0.00001
Speed Differential 0.00071 0.00019 3.73550 0.00028
R-Squared 0.27255 NA NA NA

The first regression model identifies which independent variables impact the frequency of accidents. The results show a statistically significant positive correlation between all three independent factors and the frequency of accidents. This means that highways with more severe daytime delays, a higher number of vehicles on the road in general, or a higher truck percentage are more likely to see car accidents. The R-Squared value of 0.27 is not all that strong, which suggests that there could be other factors not included in the model that may also have an effect. It is important to note, though, that the R-Squared value may be lower due to the fact that I am studying only 44 highway segments. The following scatterplot also displays the relationship between the truck percentage and the frequency of accidents:


Injury Frequency:
Linear Regression Coefficients
Variable Estimate Std. Error t value p-value
(Intercept) 0.00002 1e-05 2.97756 0.00348
Truck Percentage 0.00000 0e+00 3.63478 0.00040
Average Daily Traffic Flow 0.00000 0e+00 0.85529 0.39399
Speed Differential 0.00000 0e+00 -0.86087 0.39092
R-Squared 0.09631 NA NA NA

The second regression model identifies which independent variables impact the frequency of serious injuries. The results show a statistically significant positive correlation between the truck percentage and the frequency of serious injuries. This means that serious injuries occur more frequently as the percentage of trucks on a highway increases. The R-Squared value of 0.09 is weak, which suggests that there could be other factors not included in the model that may also have an effect. It is important to note, though, that the R-Squared value may be lower due to the fact that I am studying only 44 highway segments. The following scatterplot displays the relationship between the truck percentage and the frequency of serious injuries:


Fatality Frequency:
Linear Regression Coefficients
Variable Estimate Std. Error t value p-value
(Intercept) 0.00001 0 2.70891 0.00767
Truck Percentage 0.00000 0 2.49192 0.01398
Average Daily Traffic Flow 0.00000 0 1.07960 0.28235
Speed Differential 0.00000 0 -1.54470 0.12489
R-Squared 0.06059 NA NA NA

The third regression model identifies which independent variables impact the frequency of fatalities. The results show a statistically significant positive correlation between the truck percentage and the frequency of fatalities. This means that serious injuries occur more frequently as the percentage of trucks on a highway increases. The R-Squared value of 0.06 is weak, which suggests that there could be other factors not included in the model that may also have an effect. It is important to note, though, that the R-Squared value may be lower due to the fact that I am studying only 44 highway segments. The following scatterplot displays the relationship between the truck percentage and the frequency of fatalities:


Number of Vehicles:
Linear Regression Coefficients
Variable Estimate Std. Error t value p-value
(Intercept) -0.00107 0.00062 -1.71539 0.08869
Truck Percentage 0.00004 0.00002 1.81322 0.07214
Average Daily Traffic Flow 0.00000 0.00000 4.81894 0.00000
Speed Differential 0.00152 0.00040 3.76203 0.00026
R-Squared 0.28360 NA NA NA

The fourth regression model identifies which independent variables impact the number of cars involved in accidents. The results show a statistically significant positive correlation between all three independent factors and the number of cars involved in accidents. This means that highways with more severe daytime delays, a higher number of vehicles on the road in general, or a higher truck percentage are more likely to see car accidents involving a large number of vehicles. The R-Squared value of 0.28 is not very strong, which suggests that there could be other factors not included in our model that may also have an effect. It is important to note, though, that the R-Squared value may be lower due to the fact that I am studying only 44 highway segments.


Predictions Using the Linear Models

For this section of the project, I focused on estimating changes in the frequency of accidents, serious injuries, and fatalities on I-285 if the number of trucks on the road was reduced by 75 percent. I chose the 75% figure as it would reduce the percentage of trucks as a share of total traffic flow on the road to about 3-4 percent, which would be inline with the current rates found on highways within the perimeter, where there are regulations reducing truck usage. I will note that these predictions are not perfect, as they only predict changes based on a reduction in the percentage trucks based on the linear models. The model assumes that the total traffic flow and speed differentials will remain the same, when in reality, I would expect these values to be slightly lower if trucks were diverted from the road. Due to this, the following results are likely slight underestimations.


Predicting Annual Accidents:
## [1] 0.2075943
## [1] -7863.456

The model output finds that in the event that trucks were to be diverted off of I-285, there would be a 21% reduction in the frequency of accidents, which would amount to roughly 7,863 fewer accidents annually. The following bar chart shows an estimate of the current number of accidents occurring annually and the predicted value from the linear regression model for each segment of I-285.


Predicting Annual Injuries:
## [1] 0.3467532
## [1] -194.8694

The model output finds that in the event that trucks were to be diverted off of I-285, there would be a 35% reduction in the number of serious injuries, which would amount to roughly 195 fewer serious injuries annually. The following bar chart shows an estimate of the current number of serious injuries occurring annually and the predicted value from the linear regression model for each segment of I-285.


Predicting Annual Fatalities:
## [1] 0.1848804
## [1] -33.36767

The model output finds that in the event that trucks were to be diverted off of I-285, there would be an 18% reduction in the number of fatalities, which would amount to roughly 33 lives saved every year. The following bar chart shows an estimate of the current number of fatalities occurring annually and the predicted value from the linear regression model for each segment of I-285.

Note: In this portion, I selected all segments of I-285, covering the entire perimeter. However, when drawing the segments, I did not include the many interchanges along I-285, as this would certainly alter the data. Because those short (but certainly dangerous) sections of highway are not present, the reductions in accidents, injuries, and fatalities that I calculated are actually underestimates.


Conclusions

From the analyses I conducted, I learned a significant amount about the impact of trucks on metro Atlanta highways.

  • After mapping the data, I saw that speed differential values showed a large difference between routes inside and outside the perimeter, especially on the northside of Atlanta and during the morning rush hour. On routes like I-75 & I-85 outside of the perimeter, speeds slowed down significantly during rush hour, however, the slowdown was much less pronounced on those same roads within the perimeter. This suggests that, amongst other factors, the presence of a higher truck flow outside of the perimeter could be making morning rush hour traffic much worse for commuters.
  • Analysis of the mapped data also found that I-285 as a whole seems to be one of the most dangerous highways in the metro, and the southern portion (which has a very high truck percentage) has a notably high frequency of fatalities. I also see that in all measures, GA400 (a main highway that sees heavy usage amongst local traffic but very limited truck usage) seems to have much lower rates of accidents, severe outcomes, and traffic slowdowns compared to the interstate highways in the region.
  • When analyzing the frequency & severity of accidents on different segments of highway, I saw that trucks were disproportionally more likely to be in accidents, accidents with multiple vehicles, and accidents that caused injuries or fatalities, in comparison to their share of traffic flow.
  • The linear regression models I ran show that the frequency of accidents and the number of vehicles involved in those accidents on a given highway segment are both positively influenced by the total traffic flow, the percentage of that flow that is trucks, and the average speed differential. However, only the truck percentage was found to be positively correlated with the number of fatalities and serious injuries. These results show that although other factors are likely at play, there is strong evidence that a large number of trucks on a given highway segment is associated with a higher rate of accidents and severe outcomes.
  • Making predictions with those linear models, I predicted how traffic safety would differ on I-285 if truck flow was reduced by 75% to be in line with the levels seen within the perimeter. The models predict a very significant 21% drop in accidents, 35% drop in serious injuries, and 18% drop in fatalities. This gives more weight to the hypothesis that the presence of a large number of trucks on metro Atlanta highways leads to more accidents and severe outcomes.

Limitations & Future Research

From this research, I gathered significant evidence that the presence of commercial vehicles on metro Atlanta highways is linked to higher accident rates and severe outcomes. However, the R-Squared values from the linear models also show that there are likely many other variables at play as well. In this study, I did not fully address the impact of the total daily traffic flow, the number of lanes on a road, the frequency of exits in a stretch of road, whether a road is in need of repaving or not, the presence of HOV or express lanes, the speed limit, or other possible variables on accident frequency and severity. Additionally, while I believe the speed differential data was a very useful addition to the linear models, I did not have an exact measure of how often or how severe traffic congestion is on different highways, or whether accidents are more frequent during times of congestion or when traffic is freely flowing.

In the future, it would be interesting to identify the direct impact that freight traffic has on causing traffic congestion and delays, and from that, determining if congestion leads to more frequent or severe accidents. I also would like to extend my research into the impacts that trucks within the metro area have on other aspects of quality of life. For example, I would love to compare Air Quality Index data in areas alongside high-truck-use routes like interstates and areas alongside limited-truck-use routes like GA400, and potentially model improvements to air quality in communities near the interstate if truck traffic was diverted outside of the metro. Additionally, I could study whether air quality would improve if Commercial Vehicle Lanes werre implemented, by modeling the reduction in fuel consumption in the case of lesser congestion and reduced travel times. This useful air quality data could be mapped alongside census data to provide the ability to determine the characteristics (income, race, etc) of communities most impacted by air pollution currently caused by trucks driving through urban Atlanta.


References

  1. GDOT’s Commercial Vehicle Lane Project. https://0014203-gdot.hub.arcgis.com
  2. New Jersey Turnpike Study: Does Separating Trucks from Other Traffic Improve Overall Safety? https://journals.sagepub.com/doi/10.1177/0361198105192200120
  3. Dallas, Texas Study: Analyzing the Impact of Strategies for Separating Trucks and Cars: IH 20 Truck-Only Lane Case Study. https://journals.sagepub.com/doi/10.1177/03611981231166386