1. Intoduction

We use the data from…

I propose the following 10 questions based on my own understanding of the data.

  1. What is the average SAT score across all schools?
  2. What percentage of students are first-generation (FirstGen) across schools?
  3. What is the average latitude of schools in each region?
  4. What standard Deviation of Admission Rates by Region?
  5. What accreditation Agency Distribution?
  6. What variation in Admission Rates by Control Type (Private, Public, Profit)
  7. What Correlation between Admission Rate and Latitude
  8. What proportion of Schools Offering Graduate Degrees
  9. What is the distribution of average SAT scores (AvgSAT) among college?
  10. What is the correlation between median ACT score and admission rate?

Analysis

We will explore the questions in detail.

College= read.csv("https://www.lock5stat.com/datasets3e/CollegeScores4yr.csv")
head(College)
##                                  Name State     ID Main
## 1            Alabama A & M University    AL 100654    1
## 2 University of Alabama at Birmingham    AL 100663    1
## 3                  Amridge University    AL 100690    1
## 4 University of Alabama in Huntsville    AL 100706    1
## 5            Alabama State University    AL 100724    1
## 6           The University of Alabama    AL 100751    1
##                                                                Accred
## 1 Southern Association of Colleges and Schools Commission on Colleges
## 2 Southern Association of Colleges and Schools Commission on Colleges
## 3 Southern Association of Colleges and Schools Commission on Colleges
## 4 Southern Association of Colleges and Schools Commission on Colleges
## 5 Southern Association of Colleges and Schools Commission on Colleges
## 6 Southern Association of Colleges and Schools Commission on Colleges
##   MainDegree HighDegree Control    Region Locale Latitude Longitude AdmitRate
## 1          3          4  Public Southeast   City 34.78337 -86.56850    0.9027
## 2          3          4  Public Southeast   City 33.50570 -86.79935    0.9181
## 3          3          4 Private Southeast   City 32.36261 -86.17401        NA
## 4          3          4  Public Southeast   City 34.72456 -86.64045    0.8123
## 5          3          4  Public Southeast   City 32.36432 -86.29568    0.9787
## 6          3          4  Public Southeast   City 33.21187 -87.54598    0.5330
##   MidACT AvgSAT Online Enrollment White Black Hispanic Asian Other PartTime
## 1     18    929      0       4824   2.5  90.7      0.9   0.2   5.6      6.6
## 2     25   1195      0      12866  57.8  25.9      3.3   5.9   7.1     25.2
## 3     NA     NA      1        322   7.1  14.3      0.6   0.3  77.6     54.4
## 4     28   1322      0       6917  74.2  10.7      4.6   4.0   6.5     15.0
## 5     18    935      0       4189   1.5  93.8      1.0   0.3   3.5      7.7
## 6     28   1278      0      32387  78.5  10.1      4.7   1.2   5.6      7.9
##   NetPrice  Cost TuitionIn TuitonOut TuitionFTE InstructFTE FacSalary
## 1    15184 22886      9857     18236       9227        7298      6983
## 2    17535 24129      8328     19032      11612       17235     10640
## 3     9649 15080      6900      6900      14738        5265      3866
## 4    19986 22108     10280     21480       8727        9748      9391
## 5    12874 19413     11068     19396       9003        7983      7399
## 6    21973 28836     10780     28100      13574       10894     10016
##   FullTimeFac Pell CompRate Debt Female FirstGen MedIncome
## 1        71.3 71.0    23.96 1068   56.4     36.6      23.6
## 2        89.9 35.3    52.92 3755   63.9     34.1      34.5
## 3       100.0 74.2    18.18  109   64.9     51.3      15.0
## 4        64.6 27.7    48.62 1347   47.6     31.0      44.8
## 5        54.2 73.8    27.69 1294   61.3     34.3      22.1
## 6        74.0 18.0    67.87 6430   61.5     22.6      66.7

Q1: What is the average SAT score across all schools?

mean(College$Cost, na.rm = TRUE)
## [1] 34277.31

The mean cost is 34277.31

Q2: What percentage of students are first-generation (FirstGen) across schools?

barplot(College$Cost, main="Bar Plot of College Costs", col="skyblue", ylab="Cost ($)", xlab="Index of College")

Q3: What is the average latitude of schools in each region

tapply(College $Latitude, College$Region,mean, na.rm= TRUE)
##   Midwest Northeast Southeast Territory      West 
##  41.45213  41.31965  34.01651  18.11692  36.58587

Q4. What standard Deviation of Admission Rates by Region?

tapply(College$AdmitRate, College$Region,sd, na.rm= TRUE)
##   Midwest Northeast Southeast Territory      West 
## 0.1758712 0.2284142 0.1995188 0.2168663 0.2225495

Q5. What accreditation Agency Distribution?

table(College$Accred)
## 
##                                        Accrediting Bureau of Health Education Schools 
##                                                                                     4 
##                          Accrediting Commission for Acupuncture and Oriental Medicine 
##                                                                                     2 
##                                 Accrediting Commission of Career Schools and Colleges 
##                                                                                    20 
##                              Accrediting Council for Independent Colleges and Schools 
##                                                                                     8 
##                                              Association for Bibical Higher Educaiton 
##                                                                                    38 
##                               Association of Advanced Rabbinical and Talmudic Schools 
##                                                                                    56 
##                                         Association of Institutions of Jewish Studies 
##                                                                                     6 
##                                                     Council on Occupational Education 
##                                                                                     1 
##                                             Distance Education Accrediting Commission 
##                                                                                     9 
##                                                                                EXEMPT 
##                                                                                     1 
##                                                            Higher Learning Commission 
##                                                                                   625 
##                                          Middle States Commission on Higher Education 
##                                                                                   379 
##                                             Midwifery Education Accreditation Council 
##                                                                                     1 
##                                    National Association of Schools of Arts and Design 
##                                                                                     4 
##                                              National Association of Schools of Music 
##                                                                                     2 
##                                            New England Commission on Higher Education 
##                                                                                   157 
##                                     Northwest Commission on Colleges and Universities 
##                                                                                    84 
##                                                                                  NULL 
##                                                                                     4 
##                   Southern Association of Colleges and Schools Commission on Colleges 
##                                                                                   452 
##                           Transnational Association of Christian Colleges and Schools 
##                                                                                    29 
## Western Association of Schools and Colleges Senior Colleges and University Commission 
##                                                                                   130

Q6. What variation in Admission Rates by Control Type (Private, Public, Profit)

tapply(College$AdmitRate, College$Control, sd, na.rm= TRUE)
##   Private    Profit    Public 
## 0.2150574 0.2424042 0.1837239

Q7. What Correlation between Admission Rate and Latitude?

cor(College$AdmitRate, College$Latitude, use = "complete.obs")
## [1] 0.08186073

Q8. What proportion of Schools Offering Graduate Degrees?

College$OffersGraduateDegrees== "Yes"
## logical(0)

Q9. What is the distribution of average SAT scores (AvgSAT) among college?

summary(College$AvgSAT)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.    NA's 
##     564    1045    1121    1135    1198    1558     735

Q10. What is the correlation between median ACT score and admission rate?

cor(College$MidACT, College$AdmitRate, use= "complete.obs")
## [1] -0.4227796

Summary