Dataset Description

About the dataset

'data.frame':   5402 obs. of  13 variables:
 $ id                 : int  316 316 316 710 710 710 710 710 710 774 ...
 $ survey_year        : int  2015 2015 2015 2015 2015 2015 2015 2015 2015 2015 ...
 $ name               : chr  "Marathwada Agricultural University, Parbhani" "Marathwada Agricultural University, Parbhani" "Marathwada Agricultural University, Parbhani" "NALANDA UNIVERSITY" ...
 $ country_name       : chr  "AFGHANISTAN" "FIJI" "AFGHANISTAN" "JAPAN" ...
 $ levell             : chr  "Under Graduate" "Post Graduate" "Post Graduate" "Post Graduate" ...
 $ discipline_group_id: int  1 1 1 179 179 179 107 107 107 26 ...
 $ discipline_group   : chr  "Agriculture" "Agriculture" "Agriculture" "Cultural Studies" ...
 $ programme_id       : int  8 111 111 124 124 124 111 111 111 36 ...
 $ programme          : chr  "B.Agri.-Bachelor of Agriculture" "M.Sc.-Master of Science" "M.Sc.-Master of Science" "M.A.-Master of Arts" ...
 $ discipline         : chr  "Agriculture" "Agriculture" "Agriculture" "History" ...
 $ total              : int  2 1 1 1 1 1 2 1 2 1 ...
 $ girls              : int  0 0 0 0 0 0 1 0 1 0 ...
 $ course_mode        : chr  "Regular" "Regular" "Regular" "Regular" ...

Summary of the dataset

       id         survey_year       name           country_name      
 Min.   :  1.0   Min.   :2015   Length:5402        Length:5402       
 1st Qu.:173.0   1st Qu.:2015   Class :character   Class :character  
 Median :379.0   Median :2015   Mode  :character   Mode  :character  
 Mean   :343.9   Mean   :2015                                        
 3rd Qu.:490.0   3rd Qu.:2015                                        
 Max.   :815.0   Max.   :2015                                        
    levell          discipline_group_id discipline_group    programme_id   
 Length:5402        Min.   :  1.00      Length:5402        Min.   :  1.00  
 Class :character   1st Qu.: 20.00      Class :character   1st Qu.: 36.00  
 Mode  :character   Median : 68.00      Mode  :character   Median : 51.00  
                    Mean   : 79.48                         Mean   : 79.82  
                    3rd Qu.:133.00                         3rd Qu.:124.00  
                    Max.   :188.00                         Max.   :201.00  
  programme          discipline            total             girls        
 Length:5402        Length:5402        Min.   :  0.000   Min.   :  0.000  
 Class :character   Class :character   1st Qu.:  1.000   1st Qu.:  0.000  
 Mode  :character   Mode  :character   Median :  1.000   Median :  0.000  
                                       Mean   :  3.056   Mean   :  0.975  
                                       3rd Qu.:  2.000   3rd Qu.:  1.000  
                                       Max.   :297.000   Max.   :266.000  
 course_mode       
 Length:5402       
 Class :character  
 Mode  :character  
                   
                   
                   

Univariate Analysis

Histogram analysis for total students

Histogram analysis for female students

Histogram for Female Students in the M.A. program

Bivariate analysis

Boxplot for Acadamic levels

Multivariate Analysis

Scatterplot for Total Students VS Female Students

---
title: "Assignment"
output: 
  flexdashboard::flex_dashboard:
    orientation: columns
    vertical_layout: scroll
    source_code: embed
    social: menu
---

```{r setup, include=FALSE}
library(flexdashboard)
library(dplyr)
library(ggplot2)
library(MASS)
library(lattice)
library(DT)
```

## Dataset Description {.tabset}
```{r}
Enroll<-read.csv("/cloud/project/datasets/Foreign_Student_Enrollment-2015.csv")
```

### About the dataset
```{r}
str(Enroll)
```

### Summary of the dataset
```{r}
summary(Enroll)
```
```{r}
new_attribute<-Enroll %>% mutate(Percentage_of_girls=girls/total*100)
```

## Univariate Analysis {.tabset}
### Histogram analysis for total students
```{r}
ggplot(Enroll, aes(x = total)) + geom_histogram(binwidth = 10, fill = 'lightgreen', color = 'black') +
  labs(title = 'Histogram: Total Students Distribution', 
       x = 'Total Students', 
       y = 'Frequency') + theme_minimal()
```

### Histogram analysis for female students
```{r}
ggplot(Enroll, aes(x = girls)) + geom_histogram(binwidth = 15, fill = 'yellow', color = 'black') +
  labs(title = 'Histogram: Female Students Distribution', 
       x = 'Female Students', 
       y = 'Frequency') + theme_minimal()
```

```{r}
ma_enroll <- subset(Enroll, programme == "M.A.-Master of Arts")
```

### Histogram for Female Students in the M.A. program
```{r}
ggplot(ma_enroll, aes(x = girls)) + geom_histogram(binwidth = 3, fill = 'lightpink', color = 'black') +
  labs(title = 'Histogram: Female Students in M.A. Program', 
       x = 'Female Students', 
       y = 'Frequency') + theme_minimal()
```

## Bivariate analysis 
### Boxplot for Acadamic levels
```{r}
ggplot(Enroll, aes(x = levell, y = total)) + geom_boxplot(fill = 'lightblue', color = 'black') +
  labs(title = 'Boxplot: Total Students by Academic Level', 
       x = 'Academic Level', 
       y = 'Total Students') + theme_minimal() +
  theme(axis.text.x = element_text(angle = 90, hjust=1))
```

## Multivariate Analysis
### Scatterplot for Total Students VS Female Students
```{r}
ggplot(Enroll , aes(x = total, y = girls)) + geom_point(color = 'red', alpha = 0.6) +
  labs(title = 'Scatter Plot: Total Students vs Female Students', 
       x = 'Total Students', 
       y = 'Female Students') + theme_minimal()
```