Instructions:
library(readxl)
Road= read.csv("C:\\Users\\etfie\\OneDrive\\Data Visualization\\Road construction bids.csv")
There are 235 bids in this data set.
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
newdata<- summarize(Road, mean_ActualCost = mean(Actual_Cost, na.rm = TRUE),mean_DoTEstimate = mean(DoT_Estimate, na.rm = TRUE))
newdata
## mean_ActualCost mean_DoTEstimate
## 1 1268.715 1347.077
library(dplyr)
newdata <- summarize(Road,
mean_ActualCost = mean(Actual_Cost, na.rm = TRUE),
mean_DoTEstimate = mean(DoT_Estimate, na.rm = TRUE))
newdata$difference <- newdata$mean_DoTEstimate / newdata$mean_ActualCost
newdata<- summarize(Road,
mean_ActualCost = mean(Actual_Cost, na.rm = TRUE))
newdata
## mean_ActualCost
## 1 1268.715
library(dplyr)
newdata<- summarize(Road,
mean_DoTEstimate = mean(DoT_Estimate, na.rm = TRUE))
newdata
## mean_DoTEstimate
## 1 1347.077
difference
percent_difference = difference/cost
What is the mean difference between large and small projects? What does this tell us?
What is the mean percent difference between large and small projects? What does this tell us?
Plot a graph of the difference (between estimate and actual cost) vs. actual costs. Make your graph as descriptive as possible, adding smoothing, color grouping, scales, and labels to best present the data.
Plot a graph of the percent difference (between estimate and actual cost) vs. actual costs. Make your graph as descriptive as possible, adding color, grouping, scales, and labels to best present the data.
Summarize what these graphs show us.