[1] "School.DBN" "Date" "Enrolled" "Absent" "Present"
[6] "Released"
[1] "School.DBN" "Date" "Enrolled" "Absent" "Present"
[6] "Released"
The Date column is often stored as text, which makes it difficult to perform analysis. Converting it into a proper date format ensures accurate data processing. It allows operations like sorting, filtering, and time-based analysis. Standard date formats also help avoid errors due to inconsistent representations. Therefore, date conversion is an essential step in data preprocessing.
Extracting the week number from the Date column helps in grouping data on a weekly basis. It allows analysis of attendance trends across different weeks.
This is useful for identifying patterns like peak or low attendance periods.
Most tools provide functions to extract week numbers from dates.
This step improves the accuracy of weekly data analysis
School.DBN Date Enrolled Absent Present Released week
1 01M015 57223-06-11 172 19 153 0 24
2 01M015 57223-06-12 171 17 154 0 24
3 01M015 57223-06-13 172 14 158 0 24
4 01M015 57223-06-18 173 7 166 0 25
5 01M015 57223-06-19 173 9 164 0 25
6 01M015 57223-06-20 173 11 162 0 25
'data.frame': 277153 obs. of 7 variables:
$ School.DBN: chr "01M015" "01M015" "01M015" "01M015" ...
$ Date : Date, format: "57223-06-11" "57223-06-12" ...
$ Enrolled : int 172 171 172 173 173 173 173 174 174 174 ...
$ Absent : int 19 17 14 7 9 11 10 7 7 8 ...
$ Present : int 153 154 158 166 164 162 163 167 167 166 ...
$ Released : int 0 0 0 0 0 0 0 0 0 0 ...
$ week : num 24 24 24 25 25 25 25 25 26 26 ...
School.DBN Date Enrolled Absent
Length:277153 Min. :57223-06-10 Min. : 1 Min. : 0.0
Class :character 1st Qu.:57224-01-05 1st Qu.: 329 1st Qu.: 23.0
Mode :character Median :57248-09-11 Median : 476 Median : 38.0
Mean :57239-03-03 Mean : 597 Mean : 50.5
3rd Qu.:57249-06-19 3rd Qu.: 684 3rd Qu.: 59.0
Max. :57250-01-21 Max. :5955 Max. :2151.0
Present Released week
Min. : 1.0 Min. : 0.000 Min. : 1.00
1st Qu.: 291.0 1st Qu.: 0.000 1st Qu.:13.00
Median : 430.0 Median : 0.000 Median :27.00
Mean : 544.5 Mean : 1.983 Mean :27.22
3rd Qu.: 640.0 3rd Qu.: 0.000 3rd Qu.:40.00
Max. :5847.0 Max. :5904.000 Max. :53.00
The heatmap shows student attendance patterns across different weeks. Darker colors (red) indicate higher attendance, while lighter colors represent lower attendance. The visualization helps identify trends and fluctuations in attendance across schools.
The line graph shows how attendance changes over time. It helps in understanding overall attendance trends during the academic year.
A line graph is used to visualize attendance trends over time.
The x-axis represents time (days/weeks), and the y-axis shows attendance values.
Data points are plotted and connected with lines to show changes over time.
A bar chart is used to visualize the average attendance for each week.
First, attendance data is grouped week-wise and the average is calculated.
Each bar represents one week and its corresponding average attendance.
The analysis of student attendance data using visualizations such as heatmap, line graph, and bar chart provides clear insights into attendance trends. The heatmap highlights variations across weeks and schools, while the line graph shows overall trends over time.