(Intercept) Living_Area Bathrooms House_Age
5775.299 60.614 30089.928 -235.721
Introduction to Linear Transformations
2025-12-02
Today’s plan 📋
Review of Multiple Linear Regression (SLR) Concepts from Lecture 26
Interpreting Regression Model Output
Understanding the Hypotheses
Drawing conclusions
Answering estimation questions
Introduction to Transformation
Linear Regression Model Assumptions
How Transformations Help
Log (LN) Transformation of X or Y
Course Evaluations
I will step out for five minutes at the end of class.
HW 8 is due on Thursday, 12/4 (One day grace period)
HW 9 is posted and is due, Wednesday, 12/10.
In-person Final Exam is on 12/12/24 at 5:15 PM
On Thursday, 12/4, we will review the material covered after Quiz 2 using the posted practice questions.
On Tuesday, 12/9, there will be an in-class Q&A Review of all material from the whole semester.
In this course we will use R and RStudio to understand statistical concepts.
You will access R and RStudio through Posit Cloud.
I will post R/RStudio files on Posit Cloud that you can access in provided links.
I will also provide demo videos that show how to access files and complete exercises.
NOTE: The free Posit Cloud account is limited to 25 hours per month.
For those who want to go further with R/RStudio:
If you are interested in downloading R and RStudio to your own computer, I can guide you through the process.
The software is completely free but it does have to be updated a couple times each year.
Poll Everywhere - My User Name: penelopepoolereisenbies685
Below is the final model we arrived at in Lecture 26.
What is the estimated price of a house that is 3000 square feet, has 4 bathrooms, and is 30 years old?
Round your answer to a whole dollar amount.
Poll Everywhere - My User Name: penelopepoolereisenbies685
What is the CHANGE in price we can expect for a house that has aged 10 years and has two bathrooms and 1500 feet added during a renovation?
Round your answer to a whole dollar amount.
Model: \[ Est. Selling Price = 5775.299 + 60.614\times Living Area + 30089.928 \times Bathrooms - 235.721\times House Age \]
Interpretation:
If number of bathrooms and age of the house remain unchanged, each additional square foot is estimated to raise the selling price by about 61 dollars.
If living area and age of the house remain unchanged, each additional bathroom will raise the estimated selling price by about 30 THOUSAND dollars.
If living area and number of bathrooms remain unchanged, each additional year will LOWER the estimated selling price by about 236 dollars.
There are TWO primary assumptions of SLR:
There is a linear (straight line) relationship between the dependent variable (Y) and the independent variable (X).
At each value of X, the POPULATION of Y values are normally distributed
What do we do if data do not meet linear assumption?
Data can be transformed.
Many many many transformation options.
We will discuss just a couple of transformations
MAS 261 students are NOT expected to know what transformation is needed.
In this case, LN(X) works well
log command is LN (natural log)Correct Model:
Poll Everywhere - My User Name: penelopepoolereisenbies685
Poll Everywhere - My User Name: penelopepoolereisenbies685
If LN of years to maturity, LN(X), results in the best model, how do we interpret the intercept, \(b_0\)?
HINTS:
\(b_0\) is the value of Y, when our NEW X, LN of years to maturity equals 0.
LN(1) = 0, so when years to maturity = 1 (X = 1), then LN(X) = 0
A. \(b_0\) has no real world interpretation.
B. \(b_0\) is the yield (Y) when the bond is first issued (X = 0)
C. \(b_0\) is the yield when the bond matures in one year (X = 1)
D. \(b_0\) is the change in yield that happens in one year.
Suppose you are a manager of a motorcycle store
You want to predict the selling price of motorcycles based on ‘wheelbase’ (in inches).
For this purpose, you collect data from 86 motorcycle models.
Non-linear relationship between X and Y is apparent.
Linear regression between X and Y will not work on raw data.
Transformation of X and/or Y may linearize relationship.
For concave up non-linearity where Y > 0 for all values, we use LN(Y)
Model:
Poll Everywhere - My User Name: penelopepoolereisenbies685
The wheelbase (X) of a motorcycle is 50 inches.
The estimated regression equation is:
\[LN(\hat{Y}) = 3.8361 + 0.086\times X\]
What is the selling price (Y) of the motorcycle?
Round your answer to closest whole dollar.
NOTE: Use the exp command to back-transform estimate, \(LN(\hat{Y})\) to find the selling price in dollars, \(\hat{Y}\).
Two Essential Assumptions for Simple Linear Regression (SLR) and Multiple Linear Regression (MLR):
There is a straight line relationship between Y and each X in the model.
At each value of X, Y is normally distributed.
In MAS 261, we focus on Assumption 1 for SLR
Evaluating relationship visually
Linearizing relationship using LN(X), log(x), or LN(Y), log(y).
There are many transformation options, but we cover only these two which are most common for data with values greater than 0.
To submit an Engagement Question or Comment about material from Lecture 27: Submit it by midnight today (day of lecture).