This project analyzes a health insurance dataset to explore how various personal and lifestyle factors influence medical insurance charges.
The dataset includes variables such as age, sex, BMI, smoking status, number of children, region, and charges.
Our main goals are to:
- Understand trends through exploratory data analysis (EDA).
- Visualize key relationships using ggplot2 and Plotly.
- Apply statistical modeling (linear regression, t-tests, ANOVA) to identify the most impactful predictors.
This analysis highlights how behaviors and demographics—especially smoking—can significantly affect insurance costs.