A warning message regarding “Coefficients” was returned, that one set of coefficients (in this case, “Phys”) was not defined due to singularities. Closer examination of the data reveals that this might be a case of collinearity, where one value can be perfectly predicted by one or more variables already in the data set.
To check whether this is the case, we create a new column, the sum of all hours that were categorized in this survey.
## [1] "24"
This confirms that one of the columns is redundant for our model, since the total hours entered will always add up to 24.
This also prompts the question, whether the distribution of hours spent by students with a similar GPA follows a pattern. We classified the students by their GPA and found the distribution of hours spent.