tibble [9 × 13] (S3: tbl_df/tbl/data.frame)
$ District_Name : chr [1:9] "Bagalkote" "Bagalkote" "Bellary" "Chikkaballapur" ...
$ ULB Name : chr [1:9] "Bagalkote" "Rabkavi Banahatti" "Ballari" "Chikkaballapur" ...
$ No of Tap Connections : num [1:9] 1154 1100 2388 647 1581 ...
$ Tap Connections - No of applications received: num [1:9] 1158 1106 2389 674 1647 ...
$ Tap Connections - No of applications approved: num [1:9] 1154 1100 2388 647 1581 ...
$ Tap Connections - No of applications pending : num [1:9] 4 6 1 27 66 12 46 10 2
$ Tap Connections - No of applications rejected: num [1:9] 4 41 446 9 37 29 215 41 50
$ No of UGD connections : num [1:9] 563 368 376 104 88 ...
$ UGD connections - No of applications received: num [1:9] 566 370 377 109 110 ...
$ UGD connections - No of applications approved: num [1:9] 563 368 376 104 88 ...
$ UGD connections - No of applications pending : num [1:9] 3 2 1 5 22 1 11 15 1
$ UGD connections - No of applications rejected: num [1:9] 7 11 54 3 20 2 1 12 41
$ last_update : chr [1:9] "28-07-2022" "28-07-2022" "28-07-2022" "28-07-2022" ...
- attr(*, "na.action")= 'omit' Named int [1:252] 2 3 5 6 7 8 9 10 11 12 ...
..- attr(*, "names")= chr [1:252] "2" "3" "5" "6" ...
District_Name ULB Name No of Tap Connections
Length:9 Length:9 Min. : 369
Class :character Class :character 1st Qu.: 647
Mode :character Mode :character Median :1154
Mean :1349
3rd Qu.:2067
Max. :2388
Tap Connections - No of applications received
Min. : 381
1st Qu.: 674
Median :1158
Mean :1369
3rd Qu.:2113
Max. :2389
Tap Connections - No of applications approved
Min. : 369
1st Qu.: 647
Median :1154
Mean :1349
3rd Qu.:2067
Max. :2388
Tap Connections - No of applications pending
Min. : 1.00
1st Qu.: 4.00
Median :10.00
Mean :19.33
3rd Qu.:27.00
Max. :66.00
Tap Connections - No of applications rejected No of UGD connections
Min. : 4.00 Min. : 1.0
1st Qu.: 29.00 1st Qu.: 88.0
Median : 41.00 Median : 368.0
Mean : 96.89 Mean : 503.2
3rd Qu.: 50.00 3rd Qu.: 416.0
Max. :446.00 Max. :2572.0
UGD connections - No of applications received
Min. : 12
1st Qu.: 109
Median : 370
Mean : 510
3rd Qu.: 417
Max. :2587
UGD connections - No of applications approved
Min. : 1.0
1st Qu.: 88.0
Median : 368.0
Mean : 503.2
3rd Qu.: 416.0
Max. :2572.0
UGD connections - No of applications pending
Min. : 1.000
1st Qu.: 1.000
Median : 3.000
Mean : 6.778
3rd Qu.:11.000
Max. :22.000
UGD connections - No of applications rejected last_update
Min. : 1.00 Length:9
1st Qu.: 3.00 Class :character
Median :11.00 Mode :character
Mean :16.78
3rd Qu.:20.00
Max. :54.00
# A tibble: 6 × 13
District_Name `ULB Name` No of Tap Connection…¹ Tap Connections - No…²
<chr> <chr> <dbl> <dbl>
1 Bagalkote Bagalkote 1154 1158
2 Bagalkote Rabkavi Banah… 1100 1106
3 Bellary Ballari 2388 2389
4 Chikkaballapur Chikkaballapur 647 674
5 Dhakshina Kannada Mangaluru 1581 1647
6 Mysore K.R.Nagar 369 381
# ℹ abbreviated names: ¹`No of Tap Connections`,
# ²`Tap Connections - No of applications received`
# ℹ 9 more variables: `Tap Connections - No of applications approved` <dbl>,
# `Tap Connections - No of applications pending` <dbl>,
# `Tap Connections - No of applications rejected` <dbl>,
# `No of UGD connections` <dbl>,
# `UGD connections - No of applications received` <dbl>, …
---
title: "EDA_23CSEA50"
output:
flexdashboard::flex_dashboard:
orientation: rows
vertical_layout: scroll
theme: flatly
social: menu
source_code: embed
navbar:
- { title: "Dataset Description", href: "#dataset-description" }
- { title: "Univariate Analysis", href: "#univariate-analysis" }
- { title: "Bivariate Analysis", href: "#bivariate-analysis" }
- { title: "Multivariate Analysis", href: "#multivariate-analysis" }
---
```{r setup, include=FALSE}
library(flexdashboard)
library(ggplot2)
library(readr)
library(dplyr)
library(DT)
# Load the dataset
data <- read_csv("JalanidhiDistrict.csv")
# Handling missing values by removing NA
data <- na.omit(data)
```
## Dataset Description {.tabset .active} {#dataset-description}
### View of dataset
```{r}
datatable(data, extensions = 'Buttons', options = list(dom='Bfrtip', buttons=c('copy','csv','print','pdf')))
```
### About the Dataset
```{r}
str(data)
```
### Summary of the Dataset
```{r}
summary(data)
```
### Head of the Dataset
```{r}
head(data)
```
## Univariate Analysis {.tabset} {#univariate-analysis}
### Histogram of Tap connections - No of Applications Received
```{r}
# Histogram of 'Tap Connections - No of applications received'
ggplot(data, aes(x = `Tap Connections - No of applications received`)) +
geom_histogram(binwidth = 50, fill = "green", color = "black") +
labs(title = "Histogram of Applications Received for Tap Connections",
x = "No. of Applications Received",
y = "Frequency") +
theme_minimal()
```
### Histogram of 'Tap Connections - No of applications approved'
```{r}
# Histogram of 'Tap Connections - No of applications approved'
ggplot(data, aes(x = `Tap Connections - No of applications approved`)) +
geom_histogram(binwidth = 50, fill = "orange", color = "black") +
labs(title = "Histogram of Applications Approved for Tap Connections",
x = "No. of Applications Approved",
y = "Frequency") +
theme_minimal()
```
## Bivariate Analysis {.tabset} {#bivariate-analysis}
### Box Plot for Tap Connections - No of applications approved
```{r}
# Boxplot of 'Tap Connections - No of applications approved'
ggplot(data, aes(y = `Tap Connections - No of applications approved`)) +
geom_boxplot(fill = "purple", color = "black") +
labs(title = "Boxplot of Applications Approved for Tap Connections",
y = "No. of Applications Approved") +
theme_minimal()
```
### Boxplot of Tap Connections - No of applications pending
```{r}
# Boxplot of 'Tap Connections - No of applications pending'
ggplot(data, aes(y = `Tap Connections - No of applications pending`)) +
geom_boxplot(fill = "red", color = "black") +
labs(title = "Boxplot of Applications Pending for Tap Connections",
y = "No. of Applications Pending") +
theme_minimal()
```
## Multivariate Analysis {.tabset} {#multivariate-analysis}
### Scatter Plot for Tap Connections - No of applications approved vs No of Tap Connections
```{r}
# Scatterplot of 'Tap Connections - No of applications approved' vs 'No of Tap Connections'
ggplot(data, aes(x = `Tap Connections - No of applications approved`,
y = `No of Tap Connections`)) +
geom_point(color = "purple") +
labs(title = "Scatterplot of Applications Approved vs No. of Tap Connections",
x = "No. of Applications Approved",
y = "No. of Tap Connections") +
theme_minimal()
```
### Scatterplot of Tap Connections - No of applications pending vs No of Tap Connections
```{r}
# Scatterplot of 'Tap Connections - No of applications pending' vs 'No of Tap Connections'
ggplot(data, aes(x = `Tap Connections - No of applications pending`,
y = `No of Tap Connections`)) +
geom_point(color = "red") +
labs(title = "Scatterplot of Applications Pending vs No. of Tap Connections",
x = "No. of Applications Pending",
y = "No. of Tap Connections") +
theme_minimal()
```