- Logistic regression (LR) models a function of the probability of \(K\) classes with a linear combination of features \(x=(x_1,\dots,x_p)^T\). Unlike linear regression, LR ensures that probabilities are bounded between \([0,1]\).
- A multinomial logistic regression model has the form, \(\beta_i^T=(\beta_{i1},\dots,\beta_{ip})\):
\[ \begin{align*} \log\frac{P(G=1|X=x)}{P(G=K|X=x)} &= \beta_{10} + \beta_1^T x \\ \vdots \\ \log\frac{P(G=K-1|X=x)}{P(G=K|X=x)} &= \beta_{(K-1)0} + \beta_{K-1}^T x \\ \end{align*} \]