Think of linear regression as drawing the best possible straight line through a cloud of data points. We’re trying to understand how two things relate to each other:
- Y (dependent variable): What we’re trying to predict or explain
- X (independent variable): What we think affects Y
The whole point? Find that line that fits the data as well as possible.