Alexis Mekueko
11/4/2020
Github link: https://github.com/asmozo24/DATA606_Homework_Presentation
Web linl: https://rpubs.com/amekueko/684788
Linear regression is one of the statistical techniques used to determine whether or not an observation has a linear relationship with the observed response. This technique consists of : fitting a line, residuals, and correlation and Least squares regression. Let’s use linear regression to the following problem.
Answer:
As the number of tourists was increasing, their spending in Turkish was going up too. So, there is likely a linear relationship between the number of tourists and their spending by year. y = ax + b
Answer:
The response variables = tourist spending by year, explanatory = Number of tourists by year
Answer:
This linear relationship between the number of tourists and their spending carries some error and we might want to fit a regression line to measure the corelation between the two variables. In order words, we want to predict the values of the dependent/response variable (number of tourists) using the values from explanatory/independant variable. Can the number of tourists being a predictor for spending?
Answer: yes.
Based on the number of tourists/spending scatterplot , the data shows no there outliers and we observe a linear (positive/uphill) trend line and the residual plot shows a no particular pattern.
Based on the histogram plot, the data shows a normal distribution unimodal bell-shaped.
Based on the residuals plot, the variability of the number of tourists appears to be around the line residuals = 0.
Based on the problem statement, the sample is likely independent since the data collected from the number of tourists does not influence the spending. The problem did not let’s think otherwise.