library(tidyverse)
## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
## ✔ dplyr 1.1.4 ✔ readr 2.1.5
## ✔ forcats 1.0.0 ✔ stringr 1.5.1
## ✔ ggplot2 3.5.2 ✔ tibble 3.3.0
## ✔ lubridate 1.9.4 ✔ tidyr 1.3.1
## ✔ purrr 1.1.0
## ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
## ✖ dplyr::filter() masks stats::filter()
## ✖ dplyr::lag() masks stats::lag()
## ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(gtsummary)
library(broom)
library(gt)
ncd <- read.csv("ncd.csv")
# Previewdata
head(ncd)
Interpretation
The dataset consists of six adult participants, each representing different sociodemographic and health characteristics. The mean age of 41.5 years (SD = 9.0) indicates a middle-aged group, suggesting potential exposure to non-communicable disease risk factors.
Sociodemographic Profile
Most participants were male (83.3%) and urban residents (83.3%), implying a predominance of men living in urban settings. Urban living may influence dietary patterns, stress levels, and sedentary behaviors—all known contributors to elevated BMI and blood pressure.
The participants’ education levels varied, with one-third having completed secondary education and another third holding higher degrees. This suggests that the group has moderate to high educational attainment, potentially linked to greater health literacy.
Occupationally, participants were diverse, with business (33.3%) being the most frequent occupation, followed by service, unemployment, farming, and retirement. Such variation may reflect differences in income, lifestyle, and access to healthcare.
Health Characteristics
The mean BMI of 25.9 kg/m² (SD = 4.1) places the average participant in the overweight category, according to WHO classification. One participant (BMI = 18.9) was underweight, while several participants had BMI values suggesting overweight or obesity. This pattern implies a need for weight management interventions within similar populations.
The mean systolic blood pressure of 137.7 mmHg (SD = 42.2) exceeds the normal range (<120 mmHg), indicating a prevalence of elevated blood pressure or hypertension among participants. The wide range (92–188 mmHg) highlights variability that may be influenced by age, BMI, and occupation-related stress.
Overall Interpretation
Overall, the findings suggest that this small, predominantly male, and urban group demonstrates early signs of lifestyle-related health risks—notably overweight and high blood pressure. Although the sample size is limited, the observed trends align with broader evidence that urban middle-aged adults in developing contexts are increasingly affected by non-communicable diseases due to diet, inactivity, and stress.
Further research with a larger, more representative sample would be necessary to confirm these patterns and to identify statistically significant associations among variables such as BMI, blood pressure, age, and residence.
ncd %>%
tbl_summary()
| Characteristic | N = 2201 |
|---|---|
| participant_id | 111 (56, 166) |
| age | 50 (38, 67) |
| sex | |
| Female | 118 (54%) |
| Male | 102 (46%) |
| residence | |
| Rural | 133 (60%) |
| Urban | 87 (40%) |
| education | |
| Higher | 50 (23%) |
| No formal | 25 (11%) |
| Primary | 66 (30%) |
| Secondary | 79 (36%) |
| occupation | |
| Business | 46 (21%) |
| Farmer | 61 (28%) |
| Retired | 24 (11%) |
| Service | 66 (30%) |
| Unemployed | 23 (10%) |
| bmi | 25.4 (21.3, 28.3) |
| systolic_bp | 134 (114, 165) |
| diastolic_bp | 91 (76, 107) |
| diabetes | 51 (23%) |
| hypertension | 84 (38%) |
| 1 Median (Q1, Q3); n (%) | |
Univariate Analysis
Interpretation
The study included 2,201 participants with diverse sociodemographic and health profiles.
Demographic Characteristics
The median age was 50 years (IQR: 38–67), indicating that the participants were primarily middle-aged adults. Such an age distribution is consistent with the group most at risk for chronic non-communicable diseases, including hypertension and diabetes.
In terms of sex distribution, females constituted 54% of the sample, while males represented 46%, suggesting a relatively balanced gender ratio with a slight female predominance.
Residential and Educational Profile
A majority of participants (60%) resided in rural areas, whereas 40% were from urban settings. This mix provides a comparative basis for examining rural–urban health differentials.
Educational attainment varied: 36% had secondary education, 30% primary, 23% higher, and 11% had no formal education. This indicates a relatively educated population, with nearly 90% having received some form of schooling.
Occupational Distribution
The most common occupations were service (30%), farming (28%), and business (21%), followed by retired (11%) and unemployed (10%). This reflects a population with mixed employment types—spanning formal, informal, and agricultural sectors.
Health Indicators
The median BMI was 25.4 kg/m² (IQR: 21.3–28.3), suggesting that the typical participant fell within the overweight range (BMI ≥ 25). This implies a moderate prevalence of overweight or obesity among adults in the sample.
The median systolic blood pressure was 134 mmHg (IQR: 114–165) and median diastolic pressure was 91 mmHg (IQR: 76–107), both exceeding the normal range (<120/80 mmHg). These findings indicate a notable presence of elevated blood pressure in the sample.
In terms of diagnosed conditions, 23% of participants reported diabetes, and 38% had hypertension. The coexistence of these conditions is consistent with known cardiometabolic risks observed in middle-aged and older adults, particularly those with elevated BMI.
Summary Interpretation
Overall, the dataset represents a predominantly rural, middle-aged, and moderately educated population with diverse occupational backgrounds. The findings reveal a substantial prevalence of overweight, hypertension, and diabetes, underscoring the growing burden of non-communicable diseases (NCDs) in both rural and urban communities.
These results highlight the need for community-based interventions promoting healthy diets, physical activity, and routine screening for blood pressure and glucose to prevent long-term complications.
ncd %>%
tbl_summary(
by = residence)
| Characteristic | Rural N = 1331 |
Urban N = 871 |
|---|---|---|
| participant_id | 108 (64, 156) | 115 (44, 176) |
| age | 51 (38, 69) | 49 (37, 65) |
| sex | ||
| Female | 74 (56%) | 44 (51%) |
| Male | 59 (44%) | 43 (49%) |
| education | ||
| Higher | 33 (25%) | 17 (20%) |
| No formal | 13 (9.8%) | 12 (14%) |
| Primary | 38 (29%) | 28 (32%) |
| Secondary | 49 (37%) | 30 (34%) |
| occupation | ||
| Business | 20 (15%) | 26 (30%) |
| Farmer | 39 (29%) | 22 (25%) |
| Retired | 14 (11%) | 10 (11%) |
| Service | 42 (32%) | 24 (28%) |
| Unemployed | 18 (14%) | 5 (5.7%) |
| bmi | 25.5 (21.1, 28.3) | 24.9 (21.9, 28.4) |
| systolic_bp | 136 (114, 166) | 131 (111, 164) |
| diastolic_bp | 91 (74, 104) | 93 (77, 108) |
| diabetes | 34 (26%) | 17 (20%) |
| hypertension | 55 (41%) | 29 (33%) |
| 1 Median (Q1, Q3); n (%) | ||
Interpretation of Bivariate Findings
The bivariate analysis explored the association between residence (rural vs. urban) and various demographic, socioeconomic, and health characteristics.
Demographic Factors
No statistically significant differences were found in age or sex distributions between rural and urban participants (p > .05). This indicates comparable demographic profiles across both residence groups.
Education and Occupation
Educational attainment did not differ significantly by residence type (p = 0.39). However, occupation showed a significant association (p = 0.04):
Urban residents were more engaged in business activities (30%), while
Rural residents were more likely to be farmers (29%) and unemployed (14%). This reflects typical occupational patterns linked to rural–urban economic structures.
Body Mass Index and Blood Pressure
Although median BMI values were similar (rural 25.5 vs. urban 24.9 kg/m², p = 0.15), the median systolic blood pressure was significantly higher among rural participants (136 mmHg vs. 131 mmHg, p = 0.03). This indicates that rural residents had greater prevalence of elevated blood pressure, possibly reflecting lifestyle factors, older age distribution, or limited healthcare access.
Chronic Disease Status
The prevalence of diabetes was slightly higher in rural areas (26%) than in urban (20%), but this difference was not statistically significant (p = 0.08). However, hypertension prevalence was significantly higher among rural participants (41%) compared to urban participants (33%) (p = 0.02), confirming a meaningful association between residence and hypertension.
Overall Summary
The bivariate analysis reveals that while most sociodemographic factors (age, sex, education, BMI) are similar across residence types, occupation, systolic blood pressure, and hypertension show statistically significant rural–urban differences.
Specifically:
Rural residents exhibited higher blood pressure and hypertension prevalence.
Urban residents were more commonly engaged in business or formal-sector employment.
These findings highlight a shifting pattern of non-communicable disease risk—where rural populations, traditionally considered less at risk, are now showing comparable or even higher NCD prevalence than urban populations.
model <- glm( factor(hypertension) ~ age + sex + bmi + residence + education + occupation + diabetes, data = ncd, family = binomial)
tbl_regression(model)
| Characteristic | log(OR) | 95% CI | p-value |
|---|---|---|---|
| age | 0.00 | -0.02, 0.02 | 0.8 |
| sex | |||
| Female | — | — | |
| Male | 0.30 | -0.27, 0.87 | 0.3 |
| bmi | -0.01 | -0.07, 0.04 | 0.6 |
| residence | |||
| Rural | — | — | |
| Urban | -0.41 | -1.0, 0.18 | 0.2 |
| education | |||
| Higher | — | — | |
| No formal | 0.52 | -0.48, 1.5 | 0.3 |
| Primary | 0.05 | -0.72, 0.84 | 0.9 |
| Secondary | 0.09 | -0.66, 0.85 | 0.8 |
| occupation | |||
| Business | — | — | |
| Farmer | -0.41 | -1.2, 0.41 | 0.3 |
| Retired | -0.56 | -1.7, 0.49 | 0.3 |
| Service | -0.12 | -0.91, 0.67 | 0.8 |
| Unemployed | -0.26 | -1.4, 0.80 | 0.6 |
| diabetes | |||
| No | — | — | |
| Yes | 0.06 | -0.61, 0.71 | 0.9 |
| Abbreviations: CI = Confidence Interval, OR = Odds Ratio | |||
plot(cars)