Introduction

This practice analysis utilized data on lifestyle factors and student performance found on Kaggle.1

This dataset provides a detailed view of student lifestyle patterns and their correlation with academic performance, represented by GPA. It contains 2,000 records of students’ daily habits across study, extracurriculars, sleep, socializing, and physical activities. Each student’s stress level is derived based on study and sleep hours, offering insights into how lifestyle factors may impact academic outcomes.

This dataset, contains data from 2,000 students of university collected via a Google Form survey. It includes information on study hours, extracurricular activities, sleep, socializing, physical activity, stress levels, and CGPA. The data covers an academic year from August 2023 to May 2024 and reflects student of Lisboa. This dataset can help analyze the impact of daily habits on academic performance and student well-being.

The purpose of this analysis was to practice statistical methods through the investigation of potential associations between academic performance (grades) and lifestyle factors. This analysis was financially and resource constrained and was, thus, uninformed by subject matter expertise. Findings were statistical in nature only. Any discussion was purely speculative and should not be taken into consideration without consulting appropriate experts in the field.

Methods

Results

The dataset includes data on lifestyle factors (extracurricular activity, physical activity, sleep, social activity, study time and stress levels), gender and academic performance. Extracurricular activity, physical activity, sleep, social activity and study time were continuous and measured in hours per day. Lifestyle factors measured in hours per day will be referred to collectively as time-based lifestyle activities. Stress levels were indicated by low, medium and high. Gender was recorded as female and male. Grades ranged from 5.6 to 10. The investigator assumed that grades could be as low as zero and as high as 10.

There were 2000 students in the dataset. Of these, 984 (49%) and 1016 (51%) were female and male, respectively. Median grades of all female and male students were 7.78 (IQR: 7.25-8.32), 7.75 (IQR: 7.22-8.3) and 7.8 (IQR: 7.25-8.32), respectively. No significant differences (p = 0.54) were found between the grades of female and male students.

The amount of time spent by students in each time-based lifestyle activity is shown below. Students tended to spend the most time in sleep (median: 7.5; IQR: 6.2-8.8) and study (median: 7.4; IQR: 6.3-8.7), followed by physical (median: 4.1; IQR: 2.4-6.1), social (median: 2.6; IQR: 1.2-4.1) and extracurricular activities (median: 2; IQR: 1-3).

Over half of all students (51.45%; n = 1029) reported experiencing high stress, while 33.7% (n = 674), and 14.85% (n = 297) reported experiencing moderate and low stress, respectively.

Hours spent studying (p < 0.0001) and stress levels (p < 0.0001) were associated with higher grades. For every hour spent studying, grades increased 0.385 points. The median grade for low, moderate and high level stress groups were 7.05 (IQR: 6.7-7.38), 7.55 (IQR: 7.18-7.95) and 8.18 (IQR: 7.72-8.65), respectively. Significant differences in grades between low and moderate (p < 0.0001), moderate and high (p < 0.0001) and low and high (p < 0.0001) stress levels were detected. Time spent in social (p = 0.0001) and physical activities (p < 0.0001) were found to negatively affect grades. For every hour spent in social and physical activities, there was a 0.038 and 0.101 decrease in grades, respectively. Time spent in extracurricular activities (p = 0.1516) and sleep (p = 0.8491) had no effect on grades. The median grade of female and male students were 7.75 (IQR: 7.22-8.3) and 7.8 (IQR: 7.25-8.32), respectively. No associations existed between grades and gender (p = 0.542).

Median hours spent studying among low, moderate and high stress groups were 5.5 (IQR: 5.2-5.7), moderate 7 (IQR: 6.4-7.4) and 8.7 (IQR: 8-9.3). Median hours spent in extracurricular activities were 2 (IQR: 0.9-3), 2 (IQR: 1.1-3) and 2 (IQR: 1-3), respectively. Median hours spent sleeping were 8 (IQR: 7.1-9.1), 7.9 (IQR: 6.9-8.98) and 6.8 (IQR: 5.6-8.4), respectively. Median hours spent in social activity were 3 (IQR: 1.4-4.2), 2.6 (IQR: 1.3-4.2) and 2.4 (IQR: 1.2-4), respectively. Median hours spent in physical activity were 5.3 (IQR: 4-7.4), 4.25 (IQR: 2.6-6) and 3.6 (IQR: 1.8-5.7), respectively. Hours spent studying increased with stress levels (p < 0.0001). Significant differences between study hours were detected among all stress groups (p < 0.0001). Hours in sleep (p < 0.0001), social activity (p = 0.0489) and physical activity (p < 0.0001) decreased with higher stress levels. No differences in extracurricular hours were detected among stress groups (p = 0.898). Significant differences in sleep hours among all stress levels was detected (p < 0.0001). Significant differences between hours spent in social activities were detected between high and low stress groups (p = 0.0472), but not between moderate and low (p = 0.402) and high and moderate (p = 0.374) groups. Significant differences in physical activity hours were detected between moderate and low (p < 0.0001), high and low (p < 0.0001) and high and moderate (p = 0.00574) stress groups.

Discussion

Limitations

Assumptions and disclosures of pre-study biasing factors

The investigator lacked the necessary background on educational practices in Lisboa to understand grade calculation and notation. Specifically, grades in the dataset ranged from 5.6 to 10. The investigator, insular in view, was unfamiliar with this type of grade notation. In order to proceed with the analysis, for the sake of practice, the investigator assumed that grades were on a scale between zero and 10, with zero being the lowest and 10 the highest.

No other assumptions about the data were knowingly made by the investigator. However, the investigator acknowledges that there were likely assumptions that were unknowingly in effect.

Supplemental figures


  1. https://www.kaggle.com/datasets/charlottebennett1234/lifestyle-factors-and-their-impact-on-students/data↩︎