This website has the results from the http://Lynda.com course on Healthcare Analytics Regression in R with instructor Monica Wahi. The data comes from BRFSS.
The dependent variable is sleep time. The indepdent variable is frequency of alcohol use. Covariates include gender and age.
Eventhough the variable for Sleep Time does meet the assumptions for linear regression, it is transformed to illustrate the option to log transform. Other options for continous variables that are not normally distributed is to use categories, indexes, quartiles, or ranking. The other options are not illustrated here.
Three models were tested. The Final model was selected with a forward stepwise approach and review of the literature. The final model included 21 variables.
| Model | # covar | # Significant covar | Adjusted R square |
|---|---|---|---|
| 1 | 2 | 1 | 0.0002 |
| 2 | 8 | 4 | 0.0521 |
| Final | 18 | 14 | 0.3060 |
The final regression model had 18 covariates. Fourteen of the covariates had positive associations. In other words the higher the value for the covariate the higher the estimate. After controllig for confounding variables compared to those who drink alcohol, those drinking alcohol weekly had on average 0.02 fewer hours of sleep per night. However the relationship was not statistically significant.