'data.frame': 423 obs. of 9 variables:
$ Property.ID : num 1e+12 1e+12 1e+12 1e+12 1e+12 ...
$ Education.Institution.Category : chr "School" "School" "School" "School" ...
$ Education.Institution.Maintained.by: chr "Private" "Private" "Corporation" "Corporation" ...
$ Education.Institution.Name : chr "AMRITA FUN SCHOOL" "SINDHI VIDYALAYA SCHOOL" "BABY SCHOOL" "CORPORATION SCHOOL" ...
$ Ward.Number : int 14 14 14 15 15 15 15 15 22 22 ...
$ Address : chr "DR AMBEDKAR ROAD" "DR AMBEDKAR ROAD" "BHARATHIYAR STREET" "MARUTHAMALAI ROAD" ...
$ Length..mtr. : num 92 352.3 134.1 321.9 77.1 ...
$ Area..mtr. : num 522 5276 1008 4766 368 ...
$ Education.Institution.ID : num 1e+12 1e+12 1e+12 1e+12 1e+12 ...
Property.ID Education.Institution.Category
Min. :1.000e+12 Length:423
1st Qu.:1.000e+12 Class :character
Median :1.001e+12 Mode :character
Mean :1.001e+12
3rd Qu.:1.001e+12
Max. :1.001e+12
Education.Institution.Maintained.by Education.Institution.Name Ward.Number
Length:423 Length:423 Min. :10.00
Class :character Class :character 1st Qu.:40.00
Mode :character Mode :character Median :58.00
Mean :54.91
3rd Qu.:73.00
Max. :86.00
Address Length..mtr. Area..mtr.
Length:423 Min. : 22.86 Min. : 32.7
Class :character 1st Qu.: 93.17 1st Qu.: 459.3
Mode :character Median : 179.65 Median : 1672.4
Mean : 313.77 Mean : 11018.1
3rd Qu.: 367.97 3rd Qu.: 6918.1
Max. :4039.80 Max. :530419.9
Education.Institution.ID
Min. :1.000e+12
1st Qu.:1.000e+12
Median :1.001e+12
Mean :1.001e+12
3rd Qu.:1.001e+12
Max. :1.001e+12
Importance of components:
PC1 PC2
Standard deviation 1.3765 0.32461
Proportion of Variance 0.9473 0.05269
Cumulative Proportion 0.9473 1.00000
Length Class Mode
cluster 423 -none- numeric
centers 6 -none- numeric
totss 1 -none- numeric
withinss 3 -none- numeric
tot.withinss 1 -none- numeric
betweenss 1 -none- numeric
size 3 -none- numeric
iter 1 -none- numeric
ifault 1 -none- numeric
Call:
lm(formula = Area..mtr. ~ Length..mtr. + Education.Institution.Category,
data = Education)
Coefficients:
(Intercept)
-12504.40
Length..mtr.
85.67
Education.Institution.CategoryOthers
2018.95
Education.Institution.CategorySchool
-3499.73
Education.Institution.CategoryUniversity
-14979.92
---
title: "Assignment_Flexdashboard"
output:
flexdashboard::flex_dashboard:
orientation: rows
vertical_layout: scroll
theme: paper
social: ["facebook","whatsapp"]
source_code: embed
---
```{r setup, include=TRUE}
library(flexdashboard)
library(tidyverse)
library(dplyr)
library(ggplot2)
library(RColorBrewer)
library(crosstalk)
library(DT)
Education <- read.csv("Educational_Institution_n1.csv")
```
# View at Dataset
```{r}
datatable(Education,extensions = 'Buttons',options = list(dom = 'Bfrtip',Buttons = c('print','pdf')))
```
## Dataset Description {.tabset}
### Structure of the Dataset
```{r}
str(Education)
```
### Summary of the Dataset
```{r}
summary(Education)
```
## Univariate Analysis
### Histogram for numerical columns
```{r}
Education %>%
select_if(is.numeric) %>%
gather() %>%
ggplot(aes(value)) +
geom_histogram(bins = 30) +
facet_wrap(~ key, scales = 'free_x')
```
## Bivariate Analysis {.tabset}
### Scatter Plot
```{r}
ggplot(Education, aes(x = Length..mtr., y = Area..mtr., color = Education.Institution.Category)) +
geom_point() +
labs(title = "Scatter Plot of Length vs Area", x = "Length (mtr)", y = "Area (mtr)")
```
### Box Plot
```{r}
ggplot(Education, aes(x = Education.Institution.Category, y = Area..mtr.)) +
geom_boxplot() +
labs(title = "Box Plot of Area by Education Institution Category", x = "Institution Category", y = "Area (m²)")
```
## Multivariate Analysis {.tabset}
### PCA
```{r}
pca <- prcomp(Education[, c("Length..mtr.", "Area..mtr.")], scale. = TRUE)
summary(pca)
```
### Cluster Analysis
```{r}
kmeans <- kmeans(Education[, c("Length..mtr.", "Area..mtr.")], centers = 3)
summary(kmeans)
```
### Linear Regression
```{r}
lm(Area..mtr. ~ Length..mtr. + Education.Institution.Category, data = Education)
```