1. Introduction

I propose the following 10 questions based on my own understanding:

-1. what is the average net price(NetPrice) for all schools -2.what is the median admission rate (AdmitRate) of colleges across various regions -3.how does the completion (CompRate) vary across public and private schools -4.what is the distribution of average SAT scores(AvgSAT) among college -5.what is the correlation between the average cost and the average debt of students -6.what is the median faculty salary in urban versus rural college settings -7. what is the average percent of female student across different regions -8. how does in-state tuition compared to out-of-state tuition for colleges -9.what is the standard deviation of undegraduate enrollement across all colleges -10.what is the correlation between median ACT score and admission rate

2.Analysis

we will explore the questions in detail

college = read.csv("https://www.lock5stat.com/datasets3e/CollegeScores4yr.csv")
head(college)
##                                  Name State     ID Main
## 1            Alabama A & M University    AL 100654    1
## 2 University of Alabama at Birmingham    AL 100663    1
## 3                  Amridge University    AL 100690    1
## 4 University of Alabama in Huntsville    AL 100706    1
## 5            Alabama State University    AL 100724    1
## 6           The University of Alabama    AL 100751    1
##                                                                Accred
## 1 Southern Association of Colleges and Schools Commission on Colleges
## 2 Southern Association of Colleges and Schools Commission on Colleges
## 3 Southern Association of Colleges and Schools Commission on Colleges
## 4 Southern Association of Colleges and Schools Commission on Colleges
## 5 Southern Association of Colleges and Schools Commission on Colleges
## 6 Southern Association of Colleges and Schools Commission on Colleges
##   MainDegree HighDegree Control    Region Locale Latitude Longitude AdmitRate
## 1          3          4  Public Southeast   City 34.78337 -86.56850    0.9027
## 2          3          4  Public Southeast   City 33.50570 -86.79935    0.9181
## 3          3          4 Private Southeast   City 32.36261 -86.17401        NA
## 4          3          4  Public Southeast   City 34.72456 -86.64045    0.8123
## 5          3          4  Public Southeast   City 32.36432 -86.29568    0.9787
## 6          3          4  Public Southeast   City 33.21187 -87.54598    0.5330
##   MidACT AvgSAT Online Enrollment White Black Hispanic Asian Other PartTime
## 1     18    929      0       4824   2.5  90.7      0.9   0.2   5.6      6.6
## 2     25   1195      0      12866  57.8  25.9      3.3   5.9   7.1     25.2
## 3     NA     NA      1        322   7.1  14.3      0.6   0.3  77.6     54.4
## 4     28   1322      0       6917  74.2  10.7      4.6   4.0   6.5     15.0
## 5     18    935      0       4189   1.5  93.8      1.0   0.3   3.5      7.7
## 6     28   1278      0      32387  78.5  10.1      4.7   1.2   5.6      7.9
##   NetPrice  Cost TuitionIn TuitonOut TuitionFTE InstructFTE FacSalary
## 1    15184 22886      9857     18236       9227        7298      6983
## 2    17535 24129      8328     19032      11612       17235     10640
## 3     9649 15080      6900      6900      14738        5265      3866
## 4    19986 22108     10280     21480       8727        9748      9391
## 5    12874 19413     11068     19396       9003        7983      7399
## 6    21973 28836     10780     28100      13574       10894     10016
##   FullTimeFac Pell CompRate Debt Female FirstGen MedIncome
## 1        71.3 71.0    23.96 1068   56.4     36.6      23.6
## 2        89.9 35.3    52.92 3755   63.9     34.1      34.5
## 3       100.0 74.2    18.18  109   64.9     51.3      15.0
## 4        64.6 27.7    48.62 1347   47.6     31.0      44.8
## 5        54.2 73.8    27.69 1294   61.3     34.3      22.1
## 6        74.0 18.0    67.87 6430   61.5     22.6      66.7

Q1: what is the average net price(NetPrice) for all schools

mean(college$NetPrice, na.rm = TRUE)
## [1] 19886.82

Q2: what is the median admission rate (AdmitRate)

median(college$AdmitRate, na.rm = TRUE)
## [1] 0.69505

Q3: how does the completion (CompRate) vary across public and private schools

tapply(college$CompRate,college$Control,mean, na.rm = TRUE)
##  Private   Profit   Public 
## 55.68127 29.41672 50.20110

Q4: what is the distribution of average SAT scores(AvgSAT) among college

hist(college$AvgSAT, main = " histogram of SAT scores", xlab = "SAT scores", col = "green")

Q5:what is the correlation between the average cost(cost) and the average debt (Debt) of students

cor(college$Cost,college$Debt, use = "complete.obs")
## [1] -0.2144525

Q6:what is the median faculty salary in urban versus rural college settings

tapply(college$FacSalary,college$Locale,median, na.rm = TRUE)
##   City  Rural Suburb   Town 
## 7506.5 6058.0 7518.0 6768.0

Q7: what is the average percent of female student across different regions

tapply(college$Female,college$Region , mean, na.rm= TRUE )
##   Midwest Northeast Southeast Territory      West 
##  58.73613  59.44461  59.47673  56.61702  59.86353

Q8:how does in-state tuition compared to out-of-state tuition for colleges

summary(college$TuitionIn)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##     480    9617   17662   21949   31958   88000      94
summary(college$TuitonOut)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##     480   16221   23606   25337   33331   88000      94

Q9:what is the standard deviation of undegraduate enrollement(Enrollement) across all colleges

sd(college$Enrollment, na.rm = TRUE)
## [1] 7473.072

Q10:what is the correlation between average combined SAT score and admission rate

cor(college$AvgSAT,college$AdmitRate, use = "complete.obs")
## [1] -0.4221255

summary