Homework Assignment 2: Applied Regression Analysis

Objective: Perform regression analysis on two datasets (Income-Consumption and Olympic 100m) to calculate key statistics, test hypotheses, and compare models. Submit one Excel file with two sheets and a PDF report with interpretations.


Part 1: Income-Consumption Dataset (Manual Calculations)

Dataset: Income vs. Consumption.

Tasks (Excel Sheet 1)

  1. Calculate R-squared (R²)

    • Compute \(\bar{y}\) (mean consumption).
    • For each observation:
      • \((e_i)^2 = (y_i - \hat{y}_i)^2\)
      • \((y_i - \bar{y})^2\)
    • Sum to get \(SSR = \sum (e_i)^2\) and \(SST = \sum (y_i - \bar{y})^2\).
    • Calculate:
      \[ R^2 = 1 - \frac{SSR}{SST} \]
  2. Estimate Standard Error (s)
    \[ s = \sqrt{\frac{SSR}{n-2}} \]

  3. Standard Error of Slope (\(s_{\hat{\beta}_2}\))
    \[ s_{\hat{\beta}_2} = \sqrt{\frac{s^2}{\sum (x_i - \bar{x})^2}} \]

  4. Hypothesis Test for Slope

    • \(t = \frac{\hat{\beta}_2}{s_{\hat{\beta}_2}}\)
    • Reject \(H_0\) if \(|t| > 2\).
  5. 95% CI for Slope
    \[ \hat{\beta}_2 \pm 2 \cdot s_{\hat{\beta}_2} \]

  6. Prediction Interval
    For \(x_0 = \$10,000\):
    \[ \hat{y}_0 \pm 2 \cdot s \]

Interpretation (PDF Report)


Part 2: Olympic 100m Dataset (Linear vs. Exponential Models)

Dataset: Winning times (s) for men/women (1948–2004).

Tasks (Excel Sheet 2)

  1. Linear Model
    • Fit: \(\text{Time} = \beta_1 + \beta_2 \cdot \text{Year}\).
    • Predict equal-performance year by solving:
      \[ \beta_1^{men} + \beta_2^{men} \cdot \text{Year} = \beta_1^{women} + \beta_2^{women} \cdot \text{Year} \]
  2. Exponential Model
    • Transform: \(\ln(\text{Time}) = \ln(\beta_1) + \beta_2 \cdot \text{Year}\).
    • Use LINEST() to estimate \(\ln(\beta_1)\) and \(\beta_2\).
    • Convert back: \(\text{Time} = e^{\ln(\beta_1) + \beta_2 \cdot \text{Year}}\).
    • Predict equal-performance year.

Interpretation (PDF Report)
- Linear vs. Exponential**:
- Limitations:


Submission Guidelines

  1. Excel File:
    • Sheet 1: Income-Consumption manual calculations.
    • Sheet 2: Olympic 100m LINEST() outputs and predictions.
  2. PDF Report:
    • 1–2 pages.
    • Interpret R², slope test, CI, and model comparisons.

Key Notes for Students