Introduction
This notebook analyzes the built-in UK Lung Deaths dataset. The
dataset consists of monthly deaths from bronchitis, emphysema, and
asthma in the UK, categorized by sex (male and female).
Loading the Data
# Load the data
data("ldeaths")
Warning: data set ‘ldeaths’ not found
data("mdeaths")
Warning: data set ‘mdeaths’ not found
data("fdeaths")
Warning: data set ‘fdeaths’ not found
# Create a data frame
ldeaths_df <- data.frame(
date = time(ldeaths),
total_deaths = as.numeric(ldeaths),
male_deaths = as.numeric(mdeaths),
female_deaths = as.numeric(fdeaths)
)
# Display the first few rows of the dataset
head(ldeaths_df)
Basic Exploration
# Summary statistics
summary(ldeaths_df)
date total_deaths male_deaths female_deaths
Min. :1974 Min. :1300 Min. : 940 Min. : 330.0
1st Qu.:1975 1st Qu.:1552 1st Qu.:1138 1st Qu.: 411.0
Median :1977 Median :1870 Median :1344 Median : 512.0
Mean :1977 Mean :2057 Mean :1496 Mean : 560.7
3rd Qu.:1978 3rd Qu.:2552 3rd Qu.:1846 3rd Qu.: 681.5
Max. :1980 Max. :3891 Max. :2750 Max. :1141.0
# Check for missing values
sum(is.na(ldeaths_df))
[1] 0
Visualization
Total Deaths Over Time
ggplot(ldeaths_df, aes(x = date, y = total_deaths)) +
geom_line(color = "blue") +
labs(title = "Total Lung Deaths Over Time", x = "Year", y = "Total Deaths")

Male and Female Deaths Over Time
ggplot(ldeaths_df) +
geom_line(aes(x = date, y = male_deaths, color = "Male Deaths")) +
geom_line(aes(x = date, y = female_deaths, color = "Female Deaths")) +
labs(title = "Male and Female Lung Deaths Over Time", x = "Year", y = "Deaths") +
scale_color_manual("", breaks = c("Male Deaths", "Female Deaths"), values = c("blue", "red"))

Seasonal Decomposition
# Decompose the time series
ldeaths_decomp <- decompose(ts(ldeaths, frequency = 12))
mdeaths_decomp <- decompose(ts(mdeaths, frequency = 12))
fdeaths_decomp <- decompose(ts(fdeaths, frequency = 12))
# Plot the decompositions
plot(ldeaths_decomp)

plot(mdeaths_decomp)

plot(fdeaths_decomp)

Statistical Analysis
Correlation Between Male and Female Deaths
# Calculate correlation
correlation <- cor(ldeaths_df$male_deaths, ldeaths_df$female_deaths)
correlation
[1] 0.9762413
Seasonal Autoregressive Integrated Moving Average (SARIMA)
Model
# Fit SARIMA model for total deaths
sarima_model <- auto.arima(ldeaths)
summary(sarima_model)
Series: ldeaths
ARIMA(0,0,2)(2,1,0)[12] with drift
Coefficients:
ma1 ma2 sar1 sar2 drift
0.1844 -0.2477 -0.9533 -0.5024 -5.5270
s.e. 0.1487 0.1469 0.1325 0.1305 1.1477
sigma^2 = 56263: log likelihood = -417.34
AIC=846.67 AICc=848.26 BIC=859.24
Training set error measures:
ME RMSE MAE MPE MAPE
Training set 10.67715 207.3123 126.0795 0.0532634 6.068906
MASE ACF1
Training set 0.5372706 0.01905913
# Forecast next 12 months
forecast_sarima <- forecast(sarima_model, h = 12)
plot(forecast_sarima, main = "SARIMA Forecast for Total Deaths")

Conclusion
This analysis provided an overview of the UK Lung Deaths dataset,
including visualizations and a simple SARIMA model for forecasting
future deaths. The dataset shows clear seasonal patterns and a strong
correlation between male and female deaths due to lung diseases. ```
This corrected version should now run without issues. You can save
this content back into your R Markdown file and run it again.
LS0tCnRpdGxlOiAiVUsgTHVuZyBEZWF0aHMgRGF0YSBBbmFseXNpcyIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKYGBge3Igc2V0dXAsIGluY2x1ZGU9RkFMU0V9CiMgTG9hZCBuZWNlc3NhcnkgbGlicmFyaWVzCmxpYnJhcnkoZ2dwbG90MikKbGlicmFyeShmb3JlY2FzdCkKbGlicmFyeShkcGx5cikKYGBgCgojIyBJbnRyb2R1Y3Rpb24KClRoaXMgbm90ZWJvb2sgYW5hbHl6ZXMgdGhlIGJ1aWx0LWluIFVLIEx1bmcgRGVhdGhzIGRhdGFzZXQuIFRoZSBkYXRhc2V0IGNvbnNpc3RzIG9mIG1vbnRobHkgZGVhdGhzIGZyb20gYnJvbmNoaXRpcywgZW1waHlzZW1hLCBhbmQgYXN0aG1hIGluIHRoZSBVSywgY2F0ZWdvcml6ZWQgYnkgc2V4IChtYWxlIGFuZCBmZW1hbGUpLgoKIyMgTG9hZGluZyB0aGUgRGF0YQoKYGBge3IgbG9hZC1kYXRhfQojIExvYWQgdGhlIGRhdGEKZGF0YSgibGRlYXRocyIpCmRhdGEoIm1kZWF0aHMiKQpkYXRhKCJmZGVhdGhzIikKCiMgQ3JlYXRlIGEgZGF0YSBmcmFtZQpsZGVhdGhzX2RmIDwtIGRhdGEuZnJhbWUoCiAgZGF0ZSA9IHRpbWUobGRlYXRocyksCiAgdG90YWxfZGVhdGhzID0gYXMubnVtZXJpYyhsZGVhdGhzKSwKICBtYWxlX2RlYXRocyA9IGFzLm51bWVyaWMobWRlYXRocyksCiAgZmVtYWxlX2RlYXRocyA9IGFzLm51bWVyaWMoZmRlYXRocykKKQoKIyBEaXNwbGF5IHRoZSBmaXJzdCBmZXcgcm93cyBvZiB0aGUgZGF0YXNldApoZWFkKGxkZWF0aHNfZGYpCmBgYAoKIyMgQmFzaWMgRXhwbG9yYXRpb24KCmBgYHtyIGJhc2ljLWV4cGxvcmF0aW9ufQojIFN1bW1hcnkgc3RhdGlzdGljcwpzdW1tYXJ5KGxkZWF0aHNfZGYpCgojIENoZWNrIGZvciBtaXNzaW5nIHZhbHVlcwpzdW0oaXMubmEobGRlYXRoc19kZikpCmBgYAoKIyMgVmlzdWFsaXphdGlvbgoKIyMjIFRvdGFsIERlYXRocyBPdmVyIFRpbWUKCmBgYHtyIHRvdGFsLWRlYXRocy1wbG90fQpnZ3Bsb3QobGRlYXRoc19kZiwgYWVzKHggPSBkYXRlLCB5ID0gdG90YWxfZGVhdGhzKSkgKwogIGdlb21fbGluZShjb2xvciA9ICJibHVlIikgKwogIGxhYnModGl0bGUgPSAiVG90YWwgTHVuZyBEZWF0aHMgT3ZlciBUaW1lIiwgeCA9ICJZZWFyIiwgeSA9ICJUb3RhbCBEZWF0aHMiKQpgYGAKCiMjIyBNYWxlIGFuZCBGZW1hbGUgRGVhdGhzIE92ZXIgVGltZQoKYGBge3IgbWFsZS1mZW1hbGUtZGVhdGhzLXBsb3R9CmdncGxvdChsZGVhdGhzX2RmKSArCiAgZ2VvbV9saW5lKGFlcyh4ID0gZGF0ZSwgeSA9IG1hbGVfZGVhdGhzLCBjb2xvciA9ICJNYWxlIERlYXRocyIpKSArCiAgZ2VvbV9saW5lKGFlcyh4ID0gZGF0ZSwgeSA9IGZlbWFsZV9kZWF0aHMsIGNvbG9yID0gIkZlbWFsZSBEZWF0aHMiKSkgKwogIGxhYnModGl0bGUgPSAiTWFsZSBhbmQgRmVtYWxlIEx1bmcgRGVhdGhzIE92ZXIgVGltZSIsIHggPSAiWWVhciIsIHkgPSAiRGVhdGhzIikgKwogIHNjYWxlX2NvbG9yX21hbnVhbCgiIiwgYnJlYWtzID0gYygiTWFsZSBEZWF0aHMiLCAiRmVtYWxlIERlYXRocyIpLCB2YWx1ZXMgPSBjKCJibHVlIiwgInJlZCIpKQpgYGAKCiMjIyBTZWFzb25hbCBEZWNvbXBvc2l0aW9uCgpgYGB7ciBzZWFzb25hbC1kZWNvbXBvc2l0aW9ufQojIERlY29tcG9zZSB0aGUgdGltZSBzZXJpZXMKbGRlYXRoc19kZWNvbXAgPC0gZGVjb21wb3NlKHRzKGxkZWF0aHMsIGZyZXF1ZW5jeSA9IDEyKSkKbWRlYXRoc19kZWNvbXAgPC0gZGVjb21wb3NlKHRzKG1kZWF0aHMsIGZyZXF1ZW5jeSA9IDEyKSkKZmRlYXRoc19kZWNvbXAgPC0gZGVjb21wb3NlKHRzKGZkZWF0aHMsIGZyZXF1ZW5jeSA9IDEyKSkKCiMgUGxvdCB0aGUgZGVjb21wb3NpdGlvbnMKcGxvdChsZGVhdGhzX2RlY29tcCkKcGxvdChtZGVhdGhzX2RlY29tcCkKcGxvdChmZGVhdGhzX2RlY29tcCkKYGBgCgojIyBTdGF0aXN0aWNhbCBBbmFseXNpcwoKIyMjIENvcnJlbGF0aW9uIEJldHdlZW4gTWFsZSBhbmQgRmVtYWxlIERlYXRocwoKYGBge3IgY29ycmVsYXRpb24tYW5hbHlzaXN9CiMgQ2FsY3VsYXRlIGNvcnJlbGF0aW9uCmNvcnJlbGF0aW9uIDwtIGNvcihsZGVhdGhzX2RmJG1hbGVfZGVhdGhzLCBsZGVhdGhzX2RmJGZlbWFsZV9kZWF0aHMpCmNvcnJlbGF0aW9uCmBgYAoKIyMjIFNlYXNvbmFsIEF1dG9yZWdyZXNzaXZlIEludGVncmF0ZWQgTW92aW5nIEF2ZXJhZ2UgKFNBUklNQSkgTW9kZWwKCmBgYHtyIHNhcmltYS1tb2RlbH0KIyBGaXQgU0FSSU1BIG1vZGVsIGZvciB0b3RhbCBkZWF0aHMKc2FyaW1hX21vZGVsIDwtIGF1dG8uYXJpbWEobGRlYXRocykKc3VtbWFyeShzYXJpbWFfbW9kZWwpCgojIEZvcmVjYXN0IG5leHQgMTIgbW9udGhzCmZvcmVjYXN0X3NhcmltYSA8LSBmb3JlY2FzdChzYXJpbWFfbW9kZWwsIGggPSAxMikKcGxvdChmb3JlY2FzdF9zYXJpbWEsIG1haW4gPSAiU0FSSU1BIEZvcmVjYXN0IGZvciBUb3RhbCBEZWF0aHMiKQpgYGAKCiMjIENvbmNsdXNpb24KClRoaXMgYW5hbHlzaXMgcHJvdmlkZWQgYW4gb3ZlcnZpZXcgb2YgdGhlIFVLIEx1bmcgRGVhdGhzIGRhdGFzZXQsIGluY2x1ZGluZyB2aXN1YWxpemF0aW9ucyBhbmQgYSBzaW1wbGUgU0FSSU1BIG1vZGVsIGZvciBmb3JlY2FzdGluZyBmdXR1cmUgZGVhdGhzLiBUaGUgZGF0YXNldCBzaG93cyBjbGVhciBzZWFzb25hbCBwYXR0ZXJucyBhbmQgYSBzdHJvbmcgY29ycmVsYXRpb24gYmV0d2VlbiBtYWxlIGFuZCBmZW1hbGUgZGVhdGhzIGR1ZSB0byBsdW5nIGRpc2Vhc2VzLgpgYGAKClRoaXMgY29ycmVjdGVkIHZlcnNpb24gc2hvdWxkIG5vdyBydW4gd2l0aG91dCBpc3N1ZXMuIFlvdSBjYW4gc2F2ZSB0aGlzIGNvbnRlbnQgYmFjayBpbnRvIHlvdXIgUiBNYXJrZG93biBmaWxlIGFuZCBydW4gaXQgYWdhaW4u