Diffusion Index and graph in via a ggplot
Three US economic variables were used to produce the Diffusion Index
for the US economy: the unemployment rate (UNRATE), the average hourly
earnings of all employees (CES0500000003), and • CPIAUCSL, or the
Consumer Price Index for All Urban Consumers
To enable straightforward interpretation, the data is then condensed
for the years 2010–2024 at monthly intervals.
To gauge the changes over time, the initial differences were computed
for every series. After that, the variations were transformed into up
(1), down (-1), or no change (0). This conversion aids in trend analysis
and Diffusion Index computation.
The following formula was used to determine the Diffusion Index:
Diffusion Index=(Positives/Total−Negatives/Total) In order to improve
interpretability, we scaled the time series of monthly Diffusion Index
values from -1 to 1 to -100 to 100.

The graph displays multiple economic performance cycles.
For example, there are times when the index stays positive for a long
time, which denotes steady economic expansion.
Long stretches below zero, on the other hand, indicate protracted
economic downturns.
Significant changes in policy or the economy may be reflected in
large index spikes or dips.
2010–2012: The years after the 2008 financial crisis, probably
indicating a recovery.
2013–2015: Stable economic expansion periods.
2020: A notable decline that may have been brought on by the COVID-19
pandemic.
2021–2023: Post-pandemic economic rebound.
Diffusion Index and graph in via a ggplot with a built in smoother or
you create the smoother.

The Smoother represents the time frame.
2010–2012: During this time, values are rising and remaining
positive, indicating a recovery phase from the 2008 financial
crisis.
2013–2015: Stable economic conditions are shown by relative
stability, with less variation around the zero line.
2020: A steep drop that most likely reflects the COVID-19 pandemic’s
economic effects.
2021–2023: A period of recovery during which the index resumes its
upward trajectory.
Here, the Smoother Line averages out short-term volatility to help
visualize the underlying trend.
Additionally, the confidence interval is indicated by the gray shaded
area, which displays the range that the actual trend most likely lies
within. Greater confidence in the trend estimate is indicated by a
narrower band.
Chicago Fed National Activity Index: Diffusion Index (CFNAIDIFF)

After that, I added a smoother to the data for the Chicago Fed
National Activity Index: Diffusion Index over the same time frame
(2010–2024) as my US Economic Diffusion Index. It is simple to
comprehend by saying that the index generally goes lower between 2010
and 2015, indicating times of slower economic activity. The indicator
exhibits greater volatility between 2015 and 2020, but it also shows a
minor upward tendency, suggesting a moderate improvement in the economy.
As a result of the COVID-19 pandemic, there is a noticeable peak around
2020 and a sharp fall after that year.
- calculating the correlation coefficient
unemp avghr cpi cfnaid
unemp 1.0000000 -0.59759267 -0.62428677 0.19477706
avghr -0.5975927 1.00000000 0.98786874 -0.04384157
cpi -0.6242868 0.98786874 1.00000000 -0.07225501
cfnaid 0.1947771 -0.04384157 -0.07225501 1.00000000
In general, there is little correlation between my economic
indicators and the Chicago Fed National Activity Index: Diffusion Index.
This suggests that although there are a few small correlations, other
factors probably have a greater impact on the Chicago Fed National
Activity Index: Diffusion Index.
In particular:
There is a small but marginally positive link between the
unemployment rate and CFNAIDIFF (0.19477706).
The CPI (-0.07225501) and average hourly wages (-0.04384157) have
very little effect on CFNAIDIFF.
- a ggplot of the two series side by side.

This graph compares two important economic indicators: the Chicago
Fed National Activity Diffusion Index (CFNAIDIFF) from 2010 to 2024 and
the US Economy Diffusion Index, which I developed. Both indices track
shifts in the state of the economy over time.
The blue line represents the US Economy Diffusion Index, which
fluctuates sharply and frequently. This suggests a very unstable
economic climate where things might shift drastically very quickly. A
more steady pattern of economic activity is suggested by the CFNAIDIFF,
which is represented by the orange line and has smoother and fewer
dramatic variations.
The US Economy Diffusion Index shows more noticeable ups and downs
between 2010 and 2015, indicating faster economic shifts. The CFNAIDIFF,
meantime, continues its more steady, slow trend. The COVID-19 pandemic’s
economic effects are probably reflected in the CFNAIDIFF’s notable peak
and subsequent drop around 2020. Significant activity is also shown in
the blue line throughout this time, suggesting notable changes in the
economy. Both indicators’ graphs exhibit ups and downs following the
2020 era, although the US Economic Diffusion primarily remains in the
positive region with just one dip to a negative region. In contrast to
my US Economic Diffusion Index, which has some positive development
towards the end, the Chicago Fed National Activity Diffusion Index
(CFNAIDIFF) frequently slips to the negative region and displays a
deteriorating tendency.
Conclusion:
All things considered, this comparison shows how the two indices
differ in terms of volatility and stability. The Chicago Fed National
Activity Diffusion Index (CFNAIDIFF) offers a more stable view of
economic patterns, whereas the US Economy Diffusion Index indicates more
abrupt and significant changes in the economy. We can better grasp the
subtleties of how many economic indicators represent the general state
and activity of the economy with the aid of this research.
---
title: "Fall24 Final Diffusion Confusion Econ-6635-07"
output: html_notebook
---

```{r message=FALSE, warning=FALSE, include=FALSE}
suppressWarnings({
  suppressPackageStartupMessages({
library(markovchain)
library(tidyverse)
library(quantmod)
library(tsbox)
library(vars)
library(tidyverse)
library(pdfetch)
library(lubridate)  
library(ggplot2)
library(forecast)
library(tsbox)
library(tsibble)
library(timetk)
library(TSstudio)
library(rio)
library(readxl)
library(tidyr)
library(stringr)
library(dygraphs)
library(quantmod) 
library(ggthemes)
library(gridExtra)
library(corrplot)})
})

getSymbols(c("UNRATE", "CES0500000003", "CPIAUCSL", "CFNAIDIFF"), 
                     freq = "monthly", 
                     src = "FRED", return.class = 'xts',
                     index.class  = 'Date',
                     from = "2010-01-01",
                     to = Sys.Date(),
                     periodicity = "monthly")

```

Diffusion Index and graph in via a ggplot 

```{r message=FALSE, warning=FALSE, include=FALSE}


              ts_info(UNRATE)
              ts_info(CES0500000003)
              ts_info(CPIAUCSL)
unemp = UNRATE
avghr = CES0500000003
cpi = CPIAUCSL

              ts_info(unemp)
              ts_info(avghr)
              ts_info(cpi)        
        
              #covert to ts
        unemp <- unemp["2010-01-01/2024-10-01"] |> ts_ts() 
        avghr <- avghr["2010-01-01/2024-10-01"] |> ts_ts() 
        cpi <- cpi["2010-01-01/2024-10-01"] |> ts_ts() 
        #
            
            #assemble it
            mydata = cbind.data.frame(unemp,avghr,cpi)
            head(mydata,3)
            

                  #' Obtain first differences
                  mydf = mydata %>% 
                    mutate(unempD1 = tsibble::difference(unemp, differences = 1),
                           avghrD1 = tsibble::difference(avghr, differences = 1),
                          cpiD1 = tsibble::difference(cpi, differences = 1)) %>% 
                    dplyr::select(c(unempD1, avghrD1, cpiD1)) |> na.omit()

                  colSums(is.na(mydf))

                # Convert to either up or down
                head(mydf,3)
                mydf_df = ifelse(mydf > 0, 1, -1 )
                mydf_df
                    table(mydf_df)
                    
      #convert to up,down, or no change
      mydf_mat = apply(mydf, 2, sign) 
          table(mydf_mat)
      mydf_mat
      

            pos = apply(mydf_mat, 1,  function(row) sum(row>0) ) # counts the positive
            neg = apply(mydf_mat, 1,  function(row) sum(row<0) ) # counts the negatives
            
                #pos = apply(mydf, 1,  function(row) sum(row>0) ) # counts the positive
                #neg = apply(mydf, 1,  function(row) sum(row<0) ) # counts the negatives
            
            tot = pos + neg
            (index = (pos/tot - neg/tot))
                table(index)
                str(index)
                
Date = seq.Date(from = as.Date("2010-05-1"), length.out = 177, by = "month")
            length(Date)
            
                index_df <- data.frame(index = index, time = Date) 

                cbind(mydf_mat, pos, neg, tot, index)

ma_index = zoo::rollmean(index, 7, align = "right")
    length(index)
```
Three US economic variables were used to produce the Diffusion Index for the US economy: the unemployment rate (UNRATE), the average hourly earnings of all employees (CES0500000003), and
• CPIAUCSL, or the Consumer Price Index for All Urban Consumers

To enable straightforward interpretation, the data is then condensed for the years 2010–2024 at monthly intervals.

To gauge the changes over time, the initial differences were computed for every series. After that, the variations were transformed into up (1), down (-1), or no change (0). This conversion aids in trend analysis and Diffusion Index computation.

The following formula was used to determine the Diffusion Index: Diffusion Index=(Positives/Total−Negatives/Total)
In order to improve interpretability, we scaled the time series of monthly Diffusion Index values from -1 to 1 to -100 to 100.



```{r echo=FALSE, message=FALSE, warning=FALSE}

ggplot(index_df, aes(x = time, y=index) ) + 
                  geom_line(color = "green") +    
                  geom_hline(yintercept = 0, color = "darkred") + 
                  labs(title = "US Economy Diffusion Index", x = "Observation", y = "Index Value") + 
                  theme_minimal()
```

The graph displays multiple economic performance cycles.

For example, there are times when the index stays positive for a long time, which denotes steady economic expansion.

Long stretches below zero, on the other hand, indicate protracted economic downturns.

Significant changes in policy or the economy may be reflected in large index spikes or dips.


2010–2012: The years after the 2008 financial crisis, probably indicating a recovery.

2013–2015: Stable economic expansion periods.

2020: A notable decline that may have been brought on by the COVID-19 pandemic.

2021–2023: Post-pandemic economic rebound.


Diffusion Index and graph in via a ggplot with a built in smoother or you create the smoother.

```{r echo=FALSE, message=FALSE, warning=FALSE}

ggplot(index_df, aes(x = time, y = index)) +
          geom_line(color = "darkgreen")+
          geom_hline(yintercept = 0, color = "red")+
          geom_smooth(colour = "green") +
          labs( title = "US Economy Diffusion Index with Smoother") +
          xlab("Months") +
          ylab("Change")+
          theme(axis.line.x = element_line(size= 0.75, colour = "black"), 
                axis.line.y = element_line(size= 0.75, colour = "black"), 
                legend.position = "bottom", 
                legend.direction = "horizontal", element_blank()) +
          theme_tufte()

```

The Smoother represents the time frame. 

2010–2012: During this time, values are rising and remaining positive, indicating a recovery phase from the 2008 financial crisis.

2013–2015: Stable economic conditions are shown by relative stability, with less variation around the zero line.

2020: A steep drop that most likely reflects the COVID-19 pandemic's economic effects.

2021–2023: A period of recovery during which the index resumes its upward trajectory.

Here, the Smoother Line averages out short-term volatility to help visualize the underlying trend.

Additionally, the confidence interval is indicated by the gray shaded area, which displays the range that the actual trend most likely lies within. Greater confidence in the trend estimate is indicated by a narrower band.


Chicago Fed National Activity Index: Diffusion Index (CFNAIDIFF)

```{r message=FALSE, warning=FALSE, include=FALSE}
cfnaid = CFNAIDIFF

ts_info(CFNAIDIFF)

#convert to ts
cfnaid <- cfnaid["2010-01-01/2024-10-01"] |> ts_ts()

length(cfnaid)

# Convert cfnaid to a data frame 
cfnaid_df <- data.frame(Date = Date, cfnaid = as.numeric(cfnaid[-c(1)]))
```


```{r echo=FALSE, message=FALSE, warning=FALSE}
ggplot(cfnaid_df, aes(x = Date, y = cfnaid)) +
          geom_line(color = "brown")+
          geom_hline(yintercept = 0, color = "red") +
          geom_smooth(colour = "violet") +
          labs( title = "Chicago Fed National Activity Index: Diffusion Index with smoother") +
          xlab("Months") +
          ylab("Change")+
          theme(axis.line.x = element_line(size= 0.75, colour = "black"), 
                axis.line.y = element_line(size= 0.75, colour = "black"), 
                legend.position = "bottom", 
                legend.direction = "horizontal", element_blank()) +
          theme_tufte()

```

After that, I added a smoother to the data for the Chicago Fed National Activity Index: Diffusion Index over the same time frame (2010–2024) as my US Economic Diffusion Index. It is simple to comprehend by saying that the index generally goes lower between 2010 and 2015, indicating times of slower economic activity. The indicator exhibits greater volatility between 2015 and 2020, but it also shows a minor upward tendency, suggesting a moderate improvement in the economy. As a result of the COVID-19 pandemic, there is a noticeable peak around 2020 and a sharp fall after that year.



(i) calculating the correlation coefficient

```{r echo=FALSE, message=FALSE, warning=FALSE}
correlation_df = cbind.data.frame(mydata,cfnaid)

correlation_coefficient = cor(correlation_df)
```


```{r echo=FALSE, message=FALSE, warning=FALSE}
correlation_coefficient
```

In general, there is little correlation between my economic indicators and the Chicago Fed National Activity Index: Diffusion Index. This suggests that although there are a few small correlations, other factors probably have a greater impact on the Chicago Fed National Activity Index: Diffusion Index. 

In particular:

There is a small but marginally positive link between the unemployment rate and CFNAIDIFF (0.19477706).

The CPI (-0.07225501) and average hourly wages (-0.04384157) have very little effect on CFNAIDIFF.


(ii) a ggplot of the two series side by side.

```{r message=FALSE, warning=FALSE, include=FALSE}
combined_data <- data.frame(Date = Date, 
                            `US Economy Diffusion Index` = index, 
                            `Chicago Fed National Activity Diffusion Index` = cfnaid[-1]) 

```


```{r echo=FALSE, message=FALSE, warning=FALSE}
ggplot(combined_data) + 
  geom_line(aes(x = Date, y = US.Economy.Diffusion.Index, 
                color = "US Economy Diffusion Index")) + 
  geom_line(aes(x = Date, y = Chicago.Fed.National.Activity.Diffusion.Index, 
                color = "Chicago Fed National Activity Diffusion Index")) + 
  geom_hline(yintercept = 0, color = "red") + 
  labs(title = "Comparison of US Economy Diffusion Index and Chicago Fed National Activity Diffusion Index", 
       x = "Months", y = "Change", color =  "Index Type") + 
  scale_color_manual(values = c("US Economy Diffusion Index" = "green", 
                                "Chicago Fed National Activity Diffusion Index" = "brown")) + 
  theme(axis.line.x = element_line(size = 0.75, colour = "black"), 
        axis.line.y = element_line(size = 0.75, colour = "black")) +
  theme_tufte()+
  theme(legend.position = "top", 
        plot.title = element_text(size = 10))
```

This graph compares two important economic indicators: the Chicago Fed National Activity Diffusion Index (CFNAIDIFF) from 2010 to 2024 and the US Economy Diffusion Index, which I developed. Both indices track shifts in the state of the economy over time.

The blue line represents the US Economy Diffusion Index, which fluctuates sharply and frequently. This suggests a very unstable economic climate where things might shift drastically very quickly. A more steady pattern of economic activity is suggested by the CFNAIDIFF, which is represented by the orange line and has smoother and fewer dramatic variations.

The US Economy Diffusion Index shows more noticeable ups and downs between 2010 and 2015, indicating faster economic shifts. The CFNAIDIFF, meantime, continues its more steady, slow trend. The COVID-19 pandemic's economic effects are probably reflected in the CFNAIDIFF's notable peak and subsequent drop around 2020. Significant activity is also shown in the blue line throughout this time, suggesting notable changes in the economy. 
Both indicators' graphs exhibit ups and downs following the 2020 era, although the US Economic Diffusion primarily remains in the positive region with just one dip to a negative region. In contrast to my US Economic Diffusion Index, which has some positive development towards the end, the Chicago Fed National Activity Diffusion Index (CFNAIDIFF) frequently slips to the negative region and displays a deteriorating tendency.



## Conclusion:
All things considered, this comparison shows how the two indices differ in terms of volatility and stability. The Chicago Fed National Activity Diffusion Index (CFNAIDIFF) offers a more stable view of economic patterns, whereas the US Economy Diffusion Index indicates more abrupt and significant changes in the economy. 
We can better grasp the subtleties of how many economic indicators represent the general state and activity of the economy with the aid of this research.


## Refrences:

https://fred.stlouisfed.org/ 

https://arods-docs.site44.com/CTScorecard.html 









