C. Donovan
12 Feb 2018
What about categorical \( y \)?
\[ \begin{array}{|l|c c|}\hline & y=0 (-ve) & y=1 (+ve) \\\hline \hat{y}=0 (-ve)& a & b\\ \hat{y}=1 (+ve) & c & d\\\hline \end{array} \]
_Ref: Section 3.3 THF (2nd Ed.). Section 3.4 for more advanced approaches (not examinable, interested parties only). McLeod & Xu (bestglm) give an overview and R examples
Columns can be:
In any event, the model is linear in the parameters
The aim (for this example) is to find parameter values that minimise RSS (or some loss function)
Basically statistics' - no notable inference here/yet.
We're going to select simpler models. Various reasons:
Often simply referred to as parsimony - which is deemed a good thing
Occam's razor: after William of Ockham “Entities must not be multiplied beyond necessity” (Non sunt multiplicanda entia sine necessitate) - although that's John Punch's (1639).
Use a brute-force algorithmic approach to search through all possible models - obviously a difficult combinatoric problem. COnsider:
The same rationale can be applied but from a very basic starting model, increasing in complexity.