Abstract

This paper examines the key determinants of U.S. Gross Domestic Product (GDP) growth using a combination of Keynesian economic theory and statistical modeling. By leveraging ideal keynesian variables along with statistical algorithms (i.e. Lasso-Ridge regression), this study aims to provide a better understanding of the factors influencing GDP growth. In particular, we focus on variables such as consumption, investment, government spending, and net exports, central to Keynesian economic theory, while incorporating more modern variables like population, housing price index, and corporate profits. The results suggest that Unemployment Rate, Federal Funds Effective Rate, and Total Credit to Private Non-Financial Sector play significant roles in driving GDP growth, while Government Expenditures, Business Sentiment, and Median Weekly Income were removed from the model.

Introduction

Economic growth is one of the most fundamental objectives for policymakers, economists, and institutions around the world. To enhance the effectiveness monetary policies, it is critical to understand the underlying factors that drive economic growth, more specifically: GDP Growth. For over a century, the Keynesian economic framework has provided a solid foundation for understanding aggregate demand and its components, notably consumption, investment, government spending, and net exports. In this study, I drew inspiration from the Keynesian model to create a modern approach and investigate the relationship between GDP growth and these key determinants, utilizing a combination of theoretical insight and empirical modeling techniques.

Given the complexities of economic systems, I decided to use an elastic net model to model the relationships between the variables:

Theoretical Framework

Keynesian Economic Theory

Keynesian economic theory emphasizes the role of aggregate demand in determining the overall level of economic activity. GDP is the sum of consumption (C), investment (I), government spending (G), and net exports (X-M). Mathematically:

Where:

  • Consumption (C): Driven by consumer sentiment and household income, consumption is often the largest component of GDP.

  • Investment (I): Reflects business sentiment and the amount of credit available to firms, with potential lag effects on the economy.

  • Government Spending (G): Comprising public expenditures aimed at stimulating demand through fiscal policies, such as infrastructure investments.

  • Net Exports (X-M): Representing the trade balance, this component reflects external demand for domestic goods and services.

Keynesian Dynamics

In practice, these components interact within a dynamic system affected by external forces, such as changes in government policy, interest rates, or global economic conditions. The role of these exogenous factors has often been a subject of debate, especially when considering how external factors influence these domestic economic components.

  1. Government Expenditures (G) can directly inject demand into the economy through stimulus packages or public services.

  2. Interest Rates can influence Investment (I) by altering borrowing costs.

  3. Sentiment Indices (such as Consumer Sentiment and Business Sentiment) provide information on the willingness of households and firms to spend or invest, thereby influencing C and I.

  4. Net Exports (X-M) are impacted by exchange rates, trade policies, and global demand for domestic goods and services.

While the basic Keynesian model treats these relationships as linear and static, modern data-driven methods allow for more flexibility in modeling potentially complex dynamics.

Data Description

This study uses quarterly U.S. data from 1985 to 2024, gathered from Federal Reserve Bank of St. Louis. The data set consists of:

  1. GDP (In Billions): The target variable representing the U.S. Gross Domestic Product. The total monetary or market value of all goods and services produced within a country’s economy, measured in billions. It serves as a broad indicator of a nation’s economic health.

  2. Consumer Sentiment: A statistical measurement that indicates the overall attitude of consumers toward the economy. It reflects consumer confidence, considering factors like their views on personal finances, business conditions, and employment expectations.

  3. Federal Funds Effective Rate: The interest rate at which institutions lend reserve balances to other depository institutions overnight on an uncollateralized basis.

  4. Total Credit to Private Non-Financial Sector: Refers to the total credit provided to the private sector, excluding financial institutions, by banks and other lenders.

  5. Capacity Utilization: The percentage of potential economic output that is actually being produced. Higher levels of utilization indicate the economy is closer to full capacity and economic activity is high.

  6. Government Expenditures: The total amount of money spent by the government on goods and services, including public services, infrastructure, defense, healthcare, education, and social security programs.

  7. Net Exports: The difference between a country’s exports and imports of goods and services. A positive value (exports greater than imports) signifies a trade surplus, and a negative value signifies a trade deficit.

  8. Government Balance: The difference between government revenues (mostly from taxes) and expenditures. A positive balance indicates a surplus, while a negative balance indicates a deficit.

  9. Population: The total number of people living in a particular region or country. Key variable for understanding economic factors like consumer demand and labor force size.

  10. Housing Price Index: An index that measures the price changes of residential properties over time. It indicates how housing market values are changing and affects economic stability and wealth perception.

  11. Disposable Income: The amount of income left for an individual or household after taxes, typically available for spending or saving. It serves as an indicator of consumer financial health.

  12. Unemployment Rate: The percentage of the labor force that is unemployed and actively seeking employment. It is a key indicator of economic stability and job market conditions.

  13. Corporate Profits: The total profits of businesses within a given economy, reflecting business health and the economy’s overall financial performance. High corporate profits often suggest good economic conditions and potential for reinvestment.

The time series data are structured on a quarterly basis, ensuring consistency for time series modeling. All data were cleaned, transformed to a uniform time frequency, and merged into a single data set.

Methodology

Elastic Net Model

Elastic Net Regression was employed to model the relationship between the response variable and a set of predictors. This technique is particularly suited for data sets with a large number of predictors, potential multicollinearity among variables, or when variable selection is desired. The Elastic Net combines the penalties of Lasso (L1) and Ridge (L2) regression, enabling both feature selection and coefficient regularization.

Key Aspects of Elastic Net Regression:

  1. Model Specification: The Elastic Net minimizes a loss function comprising the residual sum of squares and a penalty term that includes a mixture of L1 and L2 norms. This penalty term is controlled by two hyperparameters:

    • Lambda: Controls the overall regularization strength, with larger values increasing shrinkage.

    • Alpha: Governs the trade-off between L1 (Lasso) and L2 (Ridge) penalties. alpha=1 results in Lasso regression, while alpha=0 corresponds to Ridge regression. A value of 0 < alpha < 1 blends the two.

  2. Data Preparation:

    • Predictors were standardized to have a mean of zero and a standard deviation of one, ensuring that variables with different scales were penalized equally.
  3. Regularization:

    • The L1 penalty encourages sparsity in the coefficients, potentially setting some coefficients to zero and facilitating variable selection.

    • The L2 penalty shrinks the magnitude of coefficients, addressing multicollinearity by distributing weight across correlated predictors.

  4. Hyperparameter Tuning:

    • Cross-validation was utilized to identify the optimal combination of lambda and alpha. This ensures that the model generalizes well to unseen data and avoids overfitting.

Improvements

ARIMA Model

An ARIMA model was applied to analyze GDP growth as a time series. The steps included transforming the data to achieve stationarity by differencing it. This was followed by analyzing the ACF (Autocorrelation Function) and PACF (Partial Autocorrelation Function) plots to determine the optimal p, d, and q parameters. I further validated the model using auto.arima to confirm the best-fitting ARIMA model.

  1. ARIMA (0, 2, 1): Two differencings were done to ensure stationarity of the data. One significant spike in the ACF graph indicates a q value of 1, for the moving average parameter.

  2. Root Mean Squared Error (RMSE): 308.11, lower than the RMSE value founded in this model. However, the ARIMA model fails to capture variables affecting GDP Growth.

  3. Mean Absolute Percentage Error (MAPE): 0.719%, a relatively low MAPE, but ARIMA does not capture the influence of variables (e.g., unemployment, CPI growth) that might affect GDP growth trends, which may have contributed to missing extreme shifts in economic behavior (e.g., unexpected events).

While the ARIMA model works well in forecasting GDP growth trends, it fails to incorporate exogenous factors or anomalies in the data, such as shocks or external policy changes, which are significant in explaining GDP fluctuations.

Stepwise Linear Regression

A stepwise linear regression model was applied in my second version of this paper to analyze GDP Growth as a linear model. I only used variables related to the keynesian methodology (Consumption, Investment, Government Spending, Net Exports). Stepwise regression involves fitting a full model and removing variables until the lowest AIC value is calculated. Stepwise allows insignificant variables to be included in the model as long as the AIC value is minimized.

  1. Root Mean Square Error: 956.54, identifying the typical deviation from the actual GDP values. This is significantly higher than the Elastic Net model which may be caused from the specific variables used in the model. Our improved model includes certain variables from the original model and new variables were introduced.

  2. Variable Importance: Following stepwise selection, variables such as Federal Funds Effective Rate, Government Expenditures, and Net Exports remain significant drivers of GDP growth, whereas Consumer Sentiment, Median Weekly Income and Capacity Utilization showed no significant effect and was removed from the model.

Elastic Net Model Results

The Elastic Net regression model was employed to analyze the relationship between various predictors and the target variable, GDP (in billions). The results of the model are summarized as follows:

  1. Optimal Lambda Value:
    The model selected the optimal regularization parameter, lambda, as 13.02394. This parameter strikes a balance between the L1 (lasso) and L2 (ridge) penalties, allowing for both feature selection and reduced collinearity among predictors.

  2. Root Mean Squared Error (RMSE):
    The model’s RMSE on the full data set was 317.31, indicating the average deviation of the predicted GDP values from the actual values. Given the scale of the GDP (in billions), this error suggests a high degree of accuracy.

  3. Mean Absolute Percentage Error (MAPE):

    The model’s MAE on the full data set was 2.42%, representing the average magnitude of errors between the predicted and actual GDP values. Given the GDP scale (in billions), this error indicates a high level of accuracy in the model’s predictions.

Residual Analysis: The residuals analysis confirms that the Elastic Net model effectively captured the relationship between predictors and GDP. The residuals exhibited a random scatter around zero, were approximately normally distributed, and displayed homoscedasticity. This demonstrates that the model’s assumptions are reasonably met, further validating its reliability and accuracy in predicting GDP.

  1. Adequate Model Fit: The Residuals vs. Fitted Values Plot demonstrates that the Elastic Net model has captured the relationship between the predictors and GDP fairly well, though there appears to be slight clustering at certain fitted values..

  2. No Heteroscedasticity: The spread of residuals is consistent, supporting the assumption of homoscedasticity.

  3. No Outliers: There are no significant outliers or influential residuals that might skew the model’s performance.

Conclusion

This study successfully identifies the key determinants of U.S. GDP growth, showing that Unemployment Rate, Federal Funds Effective Rate, and Total Credit to Private Non-Financial Sector play crucial roles in driving economic output. Both traditional and linear regression approaches support these findings, with an Elastic Net model offering superior predictive capability. These results provide valuable insights for policymakers, indicating that focusing on improving government spending and efficient utilization of capacity can significantly impact GDP growth. Future research may benefit from exploring other factors, such as political policies, and further exploration with other algorthms through hyperparameter optimization and research.

Sources

Gross Domestic Product (GDP)
Federal Reserve Bank of St. Louis. (2024). Gross Domestic Product (GDP) [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/GDP

Consumer Sentiment
University of Michigan. (2024). Index of Consumer Sentiment [Dataset]. University of Michigan Surveys of Consumers. Retrieved from https://fred.stlouisfed.org/series/UMCSENT

Federal Funds Effective Rate
Federal Reserve Bank of St. Louis. (2024). Federal Funds Effective Rate [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/FEDFUNDS

Total Credit to Private Non-Financial Sector
Bank for International Settlements. (2024). Total Credit to Private Non-Financial Sector [Dataset]. BIS Statistics Explorer. Retrieved from https://fred.stlouisfed.org/series/QUSPAM770A

Government Expenditures
Federal Reserve Bank of St. Louis. (2024). Government Consumption Expenditures and Gross Investment [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/GEXPND

Government Balance
International Monetary Fund. (2024). Government Net Lending/Borrowing [Dataset]. IMF DataMapper. Retrieved from https://fred.stlouisfed.org/series/FGLBAFQ027S

Median Weekly Income
U.S. Bureau of Labor Statistics. (2024). Median Weekly Earnings of Full-Time Wage and Salary Workers [Dataset]. Retrieved from https://fred.stlouisfed.org/series/LES1252881600Q

Population
U.S. Census Bureau. (2024). Total U.S. Population [Dataset]. Retrieved from https://fred.stlouisfed.org/series/POPTHM

Housing Price Index
Federal Housing Finance Agency. (2024). FHFA House Price Index [Dataset]. Retrieved from https://fred.stlouisfed.org/series/USSTHPI

Disposable Income
Federal Reserve Bank of St. Louis. (2024). Real Disposable Personal Income [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/DPI

Unemployment Rate
U.S. Bureau of Labor Statistics. (2024). Unemployment Rate [Dataset]. Retrieved from https://fred.stlouisfed.org/series/UNRATE

Net Exports
Federal Reserve Bank of St. Louis. (2024). Net Exports of Goods and Services [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/NETEXP

Capacity Utilization
Federal Reserve Bank of St. Louis. (2024). Capacity Utilization: Total Industry [Dataset]. FRED. Retrieved from https://fred.stlouisfed.org/series/TCU

Corporate Profits
U.S. Bureau of Economic Analysis. (2024). Corporate Profits After Tax with Inventory Valuation and Capital Consumption Adjustments [Dataset]. BEA. Retrieved from https://fred.stlouisfed.org/series/CP