Siyu_531_LabHW#5 - Confidence Interval and Hypothesis Tests

1. Let’s say a random varible \(X\) follows a normal distribution of \(N(0, 1)\).

Draw 5 samples of \(X\) and use \(\bar{x}\) to infer the true population mean using a confidence interval estimate with confidence level of 95%. Do a simulation of 10,000 times and find the probability of the estimated interval actually contains \(\mu = 0\).

simulate_confidence_interval <- function(n_simulations = 10000, sample_size = 5, confidence_level = 0.95){
  set.seed(102)
  contains_mu <- replicate(n_simulations,{
    sample_x <- rnorm(sample_size, mean = 0, sd = 1)#generate samples from N(0,1)
    x_bar <- mean(sample_x)#count the mean of sample
    s <- sd(sample_x)#count standard error
    t_critical <- qt((1 + confidence_level)/ 2, df = sample_size - 1)#count critical value
    lower_bound <- x_bar - t_critical*(s / sqrt(sample_size))# set CI
    upper_bound <- x_bar + t_critical*(s / sqrt(sample_size))
    return(lower_bound <= 0 & upper_bound >= 0)#judge wether Ci included mu = 0
  })
  probability_contain_mu <- mean(contains_mu)#count probability
  return(probability_contain_mu)
}
ci_probability_5 <- simulate_confidence_interval(sample_size = 5)
print(ci_probability_5)

## [1] 0.9504

Do a hypothesis test for \(X\) with \(H_0: \mu = 0\) and \(H_a: \mu \neq 0\) with 10,000 simulations and \(\alpha = 0.05\). Find the probability of rejecting \(H_0\) by comparing the p-value and \(\alpha\).

simulate_hypothesis_test <- function(n_simulations= 10000, sample_size = 5, alpha = 0.05, alternative = "two.sided"){
  set.seed(102)
  reject_h0 <- replicate(n_simulations, {
  sample_x <- rnorm(sample_size, mean = 0, sd = 1)
  #Do a t-te
  test_result <- t.test(sample_x, mu = 0, alternative = alternative)
  return(test_result$p.value < alpha)
    
  })
probability_reject_h0 <- mean(reject_h0)
return(probability_reject_h0)

}
hypothesis_test_5 <- simulate_hypothesis_test(sample_size = 5, alternative = "two.sided")
print(hypothesis_test_5)

## [1] 0.0496

Redo (b) with \(H_a: \mu > 0\).

simulate_hypothesis_test <- function(n_simulations= 10000, sample_size = 5, alpha = 0.05, alternative = "greater"){
  set.seed(102)
  reject_h0 <- replicate(n_simulations, {
  sample_x <- rnorm(sample_size, mean = 0, sd = 1)
  #Do a t-te
  test_result <- t.test(sample_x, mu = 0, alternative = alternative)
  return(test_result$p.value < alpha)
    
  })
probability_reject_h0 <- mean(reject_h0)
return(probability_reject_h0)

}
hypothesis_test_5 <- simulate_hypothesis_test(sample_size = 5, alternative = "greater")
print(hypothesis_test_5)

## [1] 0.0502

Increase the number of samples to 100 and repeat (a)-(c).

ci_probability_100 <- simulate_confidence_interval(sample_size = 10000)
print(ci_probability_100)

## [1] 0.9521

hypothesis_test_100 <- simulate_hypothesis_test(sample_size = 10000, alternative = "two.sided")
print(hypothesis_test_100)

## [1] 0.0479

hypothesis_test_100_right <- simulate_hypothesis_test(sample_size = 1000, alternative = "greater")
print(hypothesis_test_100_right)

## [1] 0.0515

How does this experiment verify the theory that we learn? Summarize your findings properly.

Anwser: Generally, as the sample size increases, the error rate should stabilize around 0.05, CI should be more closer to 0.95.But my results don’t support this conclusion.

2. Repeat all steps in problem 1 with \(X\) follows a uniform distribution \(\text{Unif}(0,1)\). How does the distribution of \(X\) affect the results?

Draw 5 samples of \(X\) and use \(\bar{x}\) to infer the true population mean using a confidence interval estimate with confidence level of 95%. Do a simulation of 10,000 times and find the probability of the estimated interval actually contains \(\mu = 0\).

simulate_confidence_interval_uniform <- function(n_simulations = 10000, sample_size = 5, confidence_level = 0.95){
  set.seed(102)
  contains_mu <- replicate(n_simulations,{
    sample_x <- runif(sample_size, min = 0, max = 1)#generate samples from unif(1,0)
    x_bar <- mean(sample_x)#count the mean of sample
    s <- sd(sample_x)#count standard error
    t_critical <- qt((1 + confidence_level)/ 2, df = sample_size - 1)#count critical value
    lower_bound <- x_bar - t_critical*(s / sqrt(sample_size))# set CI
    upper_bound <- x_bar + t_critical*(s / sqrt(sample_size))
    return(lower_bound <= 0.5 & upper_bound >= 0.5)#judge wether Ci included mu = 0
  })
  probability_contain_mu <- mean(contains_mu)#count probability
  return(probability_contain_mu)
}
ci_probability_5_uniform <- simulate_confidence_interval_uniform(sample_size = 5)
print(ci_probability_5_uniform)

## [1] 0.9358

Do a hypothesis test for \(X\) with \(H_0: \mu = 0\) and \(H_a: \mu \neq 0\) with 10,000 simulations and \(\alpha = 0.05\). Find the probability of rejecting \(H_0\) by comparing the p-value and \(\alpha\).

simulate_hypothesis_test_uniform <- function(n_simulations= 10000, sample_size = 5, alpha = 0.05, alternative = "two.sided"){
  set.seed(102)
  reject_h0 <- replicate(n_simulations, {
  sample_x <- runif(sample_size, min = 0, max = 1)
  #Do a t-te
  test_result <- t.test(sample_x, mu = 0.5, alternative = alternative)
  return(test_result$p.value < alpha)
    
  })
probability_reject_h0_uniform <- mean(reject_h0)
return(probability_reject_h0_uniform)

}
hypothesis_test_5_uniform <- simulate_hypothesis_test_uniform(sample_size = 5, alternative = "two.sided")
print(hypothesis_test_5_uniform)

## [1] 0.0642

Redo (b) with \(H_a: \mu > 0\).

simulate_hypothesis_test_uniform <- function(n_simulations= 10000, sample_size = 5, alpha = 0.05, alternative = "greater"){
  set.seed(102)
  reject_h0 <- replicate(n_simulations, {
  sample_x <- runif(sample_size, min = 0, max = 1)
  #Do a t-te
  test_result <- t.test(sample_x, mu = 0.5, alternative = alternative)
  return(test_result$p.value < alpha)
    
  })
probability_reject_h0_uniform <- mean(reject_h0)
return(probability_reject_h0_uniform)

}
hypothesis_test_5_uniform <- simulate_hypothesis_test_uniform(sample_size = 5, alternative = "greater")
print(hypothesis_test_5_uniform)

## [1] 0.0573

`` (d) Increase the number of samples to 100 and repeat (a)-(c).

ci_probability_100_uniform <- simulate_confidence_interval_uniform(sample_size = 10000)
print(ci_probability_100_uniform)

## [1] 0.9486

hypothesis_test_100_uniform <- simulate_hypothesis_test_uniform(sample_size = 10000, alternative = "two.sided")
print(hypothesis_test_100_uniform)

## [1] 0.0514

hypothesis_test_100_right_uniform <- simulate_hypothesis_test_uniform(sample_size = 1000, alternative = "greater")
print(hypothesis_test_100_right_uniform)

## [1] 0.0481

Anwser: From the code results, we can see that even though X has different distribution, the mean still follews a normal distribution

3. Now let’s simulate with \(X\) following a normal distribution of \(N(1,4)\) (variance of 4 and standard devivation of 2).

With a sample size of 10, do a hypothesis test for \(X\) with \(H_0: \mu = 0\) and \(H_a: \mu > 0\) with 10,000 simulations and \(\alpha = 0.05\). Find the \(\beta\) from your simulation. What is the estimated power of the test?

simulate_power_test <- function(n_simulations = 10000, sample_size = 10, alpha = 0.05){
  set.seed(102)
  fail_reject_h0 <- replicate(n_simulations,{
    sample_x <- rnorm(sample_size, mean = 1, sd = 2)
    test_result <- t.test(sample_x, mu = 0, alternative = "greater")
    #count times of not reject h0 
    return(test_result$p.value >= alpha)
  })
  beta <- mean(fail_reject_h0)
  power <- 1 - beta
  return(list(beta = beta,power = power))
}
power_test_10_alpha_5 <- simulate_power_test(sample_size = 10, alpha = 0.05)
print(power_test_10_alpha_5)

## $beta
## [1] 0.5656
## 
## $power
## [1] 0.4344

Change \(\alpha\) to 0.01 in (a) and find \(\beta\) and the power of test again.

power_test_10_alpha_01 <- simulate_power_test(sample_size = 10, alpha = 0.01)
print(power_test_10_alpha_01)

## $beta
## [1] 0.8349
## 
## $power
## [1] 0.1651

Change the sample size to 1000 and redo (a) and (b) again.

power_test_1000_alpha_05 <- simulate_power_test(sample_size = 1000, alpha = 0.05)
print(power_test_1000_alpha_05)

## $beta
## [1] 0
## 
## $power
## [1] 1

power_test_1000_alpha_01 <- simulate_power_test(sample_size = 1000, alpha = 0.01)
print(power_test_1000_alpha_01)

## $beta
## [1] 0
## 
## $power
## [1] 1

How does this experiment verify the theory that we learn? Summarize your findings properly.

Anwser: Yes, from the results, we can see that as the sample size increases, beta to be 0, means there is no type 2 error, since sd decreases with a larger sample.

Siyu_531_LabHW#5 - Confidence Interval and Hypothesis Tests

siyu

2025-02-06

1. Let’s say a random varible \(X\) follows a normal distribution of \(N(0, 1)\).

2. Repeat all steps in problem 1 with \(X\) follows a uniform distribution \(\text{Unif}(0,1)\). How does the distribution of \(X\) affect the results?

3. Now let’s simulate with \(X\) following a normal distribution of \(N(1,4)\) (variance of 4 and standard devivation of 2).