The formula for multiple linear regression is:
\[
Y = \beta_0 + \beta_1X_1 + \beta_2X_2 + ... + \beta_nX_n + \epsilon
\]
- \(Y\): Dependent variable (what we want to predict)
- \(\beta_0\): Intercept (constant term in the linear equation)
- \(\beta_1, \beta_2, ..., \beta_n\): Coefficients of independent variables \(X_1, X_2, ..., X_n\)
- \(X_1, X_2, ..., X_n\): Independent variables (predictors)
- \(\epsilon\): Error term (variation in \(Y\) not explained by \(X\))
This formula allows us to model the relationship between a dependent variable \(Y\) and multiple independent variables \(X_1, X_2, \ldots, X_n\). The \(\beta\) coefficients are estimated from the data and represent the influence of each independent variable on the dependent variable.