The dataset clouds.csv contains information from a cloud seeding experiment conducted during the summer of 1975. The purpose of the experiment was to investigate whether cloud seeding using silver iodide could increase rainfall. The dataset includes information about seeding status, cloud characteristics, and measured rainfall.
The goal of this analysis is to:
Compare rainfall between seeded and non-seeded clouds.
Build a multiple linear regression model to evaluate factors affecting rainfall.
Examine the relationship between sne and rainfall separately for seeded and non-seeded clouds.
X seeding time sne cloudcover
Min. : 1.00 no :12 Min. : 0.00 Min. :1.300 Min. : 2.200
1st Qu.: 6.75 yes:12 1st Qu.:15.75 1st Qu.:2.612 1st Qu.: 3.750
Median :12.50 Median :32.50 Median :3.250 Median : 5.250
Mean :12.50 Mean :35.33 Mean :3.169 Mean : 7.246
3rd Qu.:18.25 3rd Qu.:55.25 3rd Qu.:3.962 3rd Qu.: 7.175
Max. :24.00 Max. :83.00 Max. :4.650 Max. :37.900
prewetness echomotion rainfall
Min. :0.0180 moving :19 Min. : 0.280
1st Qu.:0.1405 stationary: 5 1st Qu.: 2.342
Median :0.2220 Median : 4.335
Mean :0.3271 Mean : 4.403
3rd Qu.:0.3297 3rd Qu.: 5.575
Max. :1.2670 Max. :12.850
The average rainfall for seeded clouds was approximately 4.63, while the average rainfall for non-seeded clouds was approximately 4.17. The standard deviation was slightly lower in the seeded group.
The boxplot shows substantial overlap between the two groups, suggesting that any difference in rainfall may be small.
t-test
t.test(rainfall ~ seeding, data = clouds)
Welch Two Sample t-test
data: rainfall by seeding
t = -0.3574, df = 20.871, p-value = 0.7244
alternative hypothesis: true difference in means between group no and group yes is not equal to 0
95 percent confidence interval:
-3.154691 2.229691
sample estimates:
mean in group no mean in group yes
4.171667 4.634167
Interpretation
The t-test produced a p-value of approximately 0.724.
Because the p-value is much larger than 0.05, there is no statistically significant evidence that cloud seeding increased rainfall in this dataset.
hist(residuals(model1),xlab ="Residuals",main ="Distribution of Residuals")
Interpretation
The ANOVA results indicate that echomotion and sne have the strongest relationship with rainfall, as they explain rhe largest proportion of variation in rainfall. However, neither variable is statistically significant at the 0.05 level, although both are close to significance (p < 0.10).
Relationship Between sne and Rainfall
Because sne appears to be related to rainfall, separate linear models were fit for seeded and non-seeded clouds.
Both groups show a negative relationship between sne and rainfall.
However, the seeded group has a steeper negative slope than the non-seeded group. This suggests that rainfall decreases more rapidly as sne increases in seeded clouds, indicating that the relationship between sne and rainfall differs depending on whether seeding occurred.
Conclusion
This analysis examined the effect of cloud seeding on rainfall.
The comparison of seeded and non-seeded clouds showed that the seeded group had slightly higher average rainfall. However, the t-test indicated that this difference was not statistically significant.
The multiple linear regression model suggested that sne and echomotion are the variables most strongly associated with rainfall.
Finally, separate models of rainfall versus sne showed that the seeded group had a steeper negative slope than the non-seeded group, suggesting that the effect of sne on rainfall may depend on seeding status.