Author: Lakshmi Prathyusha Mandadi
Course: Business Forecasting, ECON-6635-07
Published: December 13, 2024
Introduction
Diffusion Index is an analytical tool for evaluating the depth and
scope of trends in different financial markets or economic sectors.
Using important economic data, this research seeks to develop and
evaluate a diffusion index for the US that offers a thorough
understanding of economic activity. This indicator aids analysts in
determining the strength or weakness of economic cycles or market
trends.It measures the proportion of individual components in a dataset
that are showing positive changes in relation to the total number of
components. It is important for trend analysis and decision making.
The three economic indicators to create diffusion index I have chosen
are Retail Sales (RSAFSNA), Unemployment Rate (UNRATE) and Inflation
(CPIAUCSL).
Retail Sales: The health of consumer spending is
indicated by the Census Bureau’s monthly report on retail and food
services sales. Retail sales in a variety of industries, including
department stores, furniture stores, and home furnishings stores, are
shown in this report.
Unemployment Rate: The unemployment rate is one of
the main indicator of labor market health and overall economic activity.
A lower unemployment rate denotes economic expansion and higher consumer
spending, whereas an increase in it denotes an economic recession.
Inflation: The overall increase in prices for goods
and services within an economy is known as inflation. While extremely
low inflation may be a sign of a coming recession, excessive inflation
may indicate that the economy is excessive.
Data Collection
The Initial step in creating diffusion index is data collection and
we will obtain the data from the Federal Reserve Economic Data (FRED)
database.
# Obtain 3 economic Indicators data from FRED
getSymbols(c("RSAFSNA", "UNRATE", "CPIAUCSL"),
freq = "monthly",
src = "FRED", return.class = 'xts',
index.class = 'Date',
from = "2010-01-01",
to = Sys.Date(),
auto.assign = TRUE)
[1] "RSAFSNA" "UNRATE" "CPIAUCSL"
# Create a single time-series dataset from the data
data <- merge(RSAFSNA, UNRATE, CPIAUCSL)
colnames(data) <- c("Retail_Sales", "Unemployment_Rate", "Inflation_Rate")
data$date <- index(data)
Create Custom Diffusion Index
First, for each of the three indicators, determine if it is improving
or declining compared to the previous period. Then calculate the
Diffusion Index as the average of these improves or declines. By
aggregating these indicators, we can gain a broader picture of the
general economic trends.
data <- as.data.frame(scale_data)
# Calculate if each variable is increasing (1) or declining (-1) from the previous period
data = data %>%
mutate(
unemployment_diff = ifelse(Unemployment_Rate < lag(Unemployment_Rate, default = first(Unemployment_Rate)), 1, -1),
sales_diff = ifelse(Retail_Sales > lag(Retail_Sales, default = first(Retail_Sales)), 1, -1),
inflation_diff = ifelse(Inflation_Rate > lag(Inflation_Rate, default = first(Inflation_Rate)), 1, -1)
)
# Construct diffusion index: average using rowMeans
data$diffusion_index <- rowMeans(data[, c("unemployment_diff", "sales_diff", "inflation_diff")])
data <- xts(data, order.by = date)
Plotting Diffusion Index
The following plot visualizes the diffusion index over time, with a
loess smoother added to highlight the overall trend.
This plot shows the proportion of the selected economic variables
(unemployment rate, inflation rate, and Retail sales) that are
increasing over time.
library(ggplot2)
head(data)
Retail_Sales Unemployment_Rate Inflation_Rate unemployment_diff sales_diff
2010-01-01 -1.55 1.78 -1.29 -1 -1
2010-02-01 -1.58 1.78 -1.30 -1 -1
2010-03-01 -1.17 1.83 -1.29 -1 1
2010-04-01 -1.20 1.83 -1.29 -1 -1
2010-05-01 -1.13 1.69 -1.30 1 1
2010-06-01 -1.19 1.60 -1.30 1 -1
inflation_diff diffusion_index
2010-01-01 -1 -1.000
2010-02-01 -1 -1.000
2010-03-01 1 0.333
2010-04-01 1 -0.333
2010-05-01 -1 0.333
2010-06-01 -1 -0.333
# Plot Diffusion Index
ggplot(data, aes(x = date, y = diffusion_index)) +
geom_line(color = "purple") +
geom_smooth(method = "loess", color = "orange", se = TRUE) +
labs(title = "Diffusion Index Over Time", x = "Date", y = "Diffusion Index") +
theme_minimal() +
theme(axis.text.x = element_text(angle = 45, hjust = 1))

A value close to 1 indicates that most variables are increasing,
suggesting strong economic activity.
A value close to -1 indicates that most variables are decreasing,
suggesting weak economic activity.
The diffusion index highlights key factors in economy. After 2020,
the shaded confidence interval slightly widens, indicating a rise in
uncertainty or greater fluctuation in the economy over this time. The
trend’s flattening after 2020 is consistent with more general economic
difficulties such as the COVID-19 pandemic and the instability that
followed the recovery. While the purple variations serve to indicate
times of economic volatility, keeping an eye on the orange trend line
can reveal insights into economic turning moments.It shows a smoother
upward trend over the long run.
Comparison of Chicago Fed National Activity Index (CFNAI) with
Custom Diffustion Index
The Chicago Fed National Activity Index (CFNAI) is a measure of
national economic activity. We can evaluate the degree to which our
chosen indicators reflect general economic trends by contrasting our
diffusion index with the CFNAI. We will compare our custom diffusion
index with the CFNAI Diffusion Index by calculating the correlation
coefficient and plotting both series side by side.
# Compare CFNAIDIFF Index with Diffusion Index
getSymbols("CFNAIDIFF", src = "FRED", return.class = 'xts', index.class = 'Date', from = "2010-01-01", to = Sys.Date(), auto.assign = TRUE)
[1] "CFNAIDIFF"
CFNAIDIFF$date <- index(CFNAIDIFF)
# Merge both indexes
comparison_data <- merge(data$diffusion_index, CFNAIDIFF$CFNAIDIFF, join = "inner")
colnames(comparison_data) <- c("Diffusion_Index", "CFNAIDIFF_Index")
head(comparison_data)
Diffusion_Index CFNAIDIFF_Index
2010-01-01 -1.000 0.05
2010-02-01 -1.000 -0.09
2010-03-01 0.333 0.09
2010-04-01 -0.333 0.25
2010-05-01 0.333 0.44
2010-06-01 -0.333 0.25
Correlation Analysis
The correlation coefficient measures the linear relationship between
diffusion index and the CFNAI Diffusion Index. Similar economic trends
are indicated by a high positive correlation, which implies that the two
indexes move in synchronization.
# Calculate correlation
corr <- cor(comparison_data$Diffusion_Index, comparison_data$CFNAIDIFF_Index, use = "complete.obs")
print(paste("Correlation Coefficient: ", round(corr, 3)))
[1] "Correlation Coefficient: 0.195"
The correlation coefficient we got is 0.195 which is considered weak
and indicates that both indexes are not entirely in synch. While a weak
positive correlation means there is some relationship, it’s not strong
enough to conclude that the indicators are in sync or consistently
reflect the same economic trends.
ggplot of both Indexes
Below, we plot both indexes side by side to visualize their
similarities or differences over time.
# plot Diffusion Index and Chicago Fed Index
ggplot(comparison_data) +
geom_line(aes(x = index(comparison_data), y = Diffusion_Index), color = "cyan", alpha = 0.6) +
geom_line(aes(x = index(comparison_data), y = CFNAIDIFF_Index), color = "blue", alpha = 0.6) +
labs(title = "Comparison of Diffusion Index and CFNAIDIFF", x = "Date", y = "Index Value") +
theme_minimal() +
theme(axis.text.x = element_text(angle = 45, hjust = 1)) # Rotate x-axis labels if needed

NA
NA
When the two indices move in sync in the comparison plot, it
indicates that the economic trends they represent are strongly aligned.
Variations among the indexes could indicate variations in each index’s
reaction to particular economic occurrences or crises. Similar to the
diffusion index, it indicates the proportion of these indicators that
are increasing.
Table of Key Metrics:
Time Period |
Jan 2010 – Sep 2024 |
Jan 2010 – Sep 2024 |
Correlation (Custom vs CFNAI) |
0.195 |
0.195 |
Average Value |
~0.02 |
~-0.10 |
Standard Deviation |
0.21 |
0.45 |
Insights
1.Unpredictable Nature of Economy:
Compared to the custom diffusion index, the CFNAIDIFF is more
sensitive to underlying data inputs or economic shocks, as evidenced by
its strong fluctuations.
2.Indicator of Financial Crisis:
As of the most recent statistics in September 2024, the fall in both
indicators can indicate decreasing economic momentum.
3.Custom Index Stability:
The custom index’s smoother trend might suggest a more comprehensive
and insensitive view of the state of the economy.
Although there are noticeable differences in frequency and magnitude,
both indices show comparable periodic oscillations.In contrast to the
custom index, which moves more smoothly, the CFNAIDIFF is more volatile,
exhibiting strong spikes and falls. As the series draws to a close
(September 2024 for CFNAIDIFF), both indexes show a declining tendency.
This implies that current economic activity may be slowing down or
facing difficulties. In certain economic cycles, there are parallel
movements despite the modest correlation (0.195), suggesting partial
alignment.
Conclusion:
According to the comparison, the CFNAIDIFF records more recent
developments, whereas the custom diffusion index offers a more
comprehensive picture of economic stability. Both indices point to a
possible slowdown in the economy as of September 2024; additional
observation of upcoming data is necessary to validate these
indicators.
Lakshmi Prathyusha Mandadi, Guidance by Prof Dr A.E. Rodriguez,
Pompea College of Business, University of New Haven
lmand6@unh.newhaven.edu
