\[ \definecolor{gray}{RGB}{192,192,192} \def\vect#1{\boldsymbol #1} \def\bigO#1{\mathcal{O}(#1)} \def\Cond#1#2{\left(#1 \mid #2\right)} \def\diff{{\mathop{}\!\mathrm{d}}} \]
E.g. a bank that provides loans to firms.
Important for both regulators and holders of the portfolio.
A component in bottom-up models.
The portfolio consists of \(n_t\) loans to firms \(R_t = \{i_{1t}, i_{2t}, \dots, i_{n_{t}t}\}\).
E.g. the current amount outstanding.
The default indicator \(Y_{it}\in \{0,1\}\) is 1 if firm \(i\) defaults in interval \(t\).
That is, \(G_{it}E_{it}\) is lost.
The loss in interval \(t\) is
\(E_{it}\in(0,\infty)\): exposure, \(G_{it}\in [0,1]\) loss-given-default, and \(Y_{it}\in \{0,1\}\) is the default indicator.
The loss in interval \(t\) is
\(E_{it}\in(0,\infty)\): exposure, \(G_{it}\in [0,1]\) loss-given-default, and \(Y_{it}\in \{0,1\}\) is the default indicator.
Focus on default indicators.
Particularly to model the tail of \(L_t\).
Given firm variables \(\vec x_{it}\) and macro variables \(\vec m_t\) one models the probability of default as
\[g(P(Y_{it} = 1 \mid \vec x_{it}, \vec m_t)) = \vec \beta^\top \vec x_{it} + \vec\gamma^\top\vec m_t \]
e.g. logit function. See Shumway (2001), Chava and Jarrow (2004), Duffie, Saita, and Wang (2007), and Campbell and Szilagyi (2008).
I.e. firms are only correlated through observable firm variables and macro variables.
E.g. an omitted macro variable.
Thus, the model is extended to
See Duffie et al. (2009), P. Chen and Wu (2014), and Qi, Zhang, and Zhao (2014). Koopman, Lucas, and Schwaab (2011), Koopman, Lucas, and Schwaab (2012), and Schwaab, Koopman, and Lucas (2017) use a similar model with group specific \(\alpha_t\)s or where groups load differently on \(\alpha_t\).
Though, we may expect a monotone partial effect.
\[g(P(Y_{it} = 1 \mid \vec x_{it}, \vec m_t,\alpha_t)) = \underbrace{\vec \beta^\top \vec f(\vec x_{it})}_? + \vec\gamma^\top\vec m_t + \alpha_t\]
Min and Lee (2005), Berg (2007), Alfaro et al. (2008), Kim and Kang (2010), and Jones, Johnstone, and Wilson (2017).
Relax assumptions and focus on the firm-level and aggregate performance.
See Lando et al. (2013), Filipe, Grammatikos, and Michala (2016), and Jensen, Lando, and Medhat (2017).
\[g(P(Y_{it} = 1 \mid \vec x_{it}, \vec m_t, \alpha_t)) = \underbrace{\vec \beta^\top_t}_? \vec f(\vec x_{it}) + \vec\gamma^\top\vec m_t + \alpha_t\]
Relax assumptions with a model that can be directly used for forecasting.
Implementations of fast approximation methods to estimate discrete time survival models.
Focus on non-linear effects with tall and narrow sample of limited liability companies.
Focus on time-varying effects with short and wide sample of US public companies.
Covers the particle based methods in the dynamichazard package
which are used in chapter 3.
The Oumuamua package is almost finished.
Covers implementations of fast approximation methods.
Covers the implemented particle based methods.
Interested in
\[P(Y_{it} = 1 \mid \vec x_{it}, Y_{i1} = \cdots = Y_{it-1} = 0) = h_t(\vec x_{it})\]
Can use non-parametric method.
Use parametric model for \(h_t\).
\[\begin{aligned} g(P(Y_{it} = 1 \mid \vec x_{it}, \vec m_t, \vec u_{it}, \vec\alpha_t)) &= \vec \beta^\top \vec x_{it} + \vec\gamma^\top\vec m_t + \vec\alpha_t^\top \vec u_{it} \\ \vec\alpha_t &= F \vec\alpha_{t-1} + \vec\epsilon_t \\ \vec\epsilon_t & \sim N(\vec 0, \Sigma) \end{aligned}\]
Yields intractable integral.
The former is suggested by Fahrmeir (1992) and Fahrmeir (1994) and for the latter see Julierm and Uhlmann (1997) and Wan and Merwe (2000).
The approximations are commonly used in engineering.
The implementations are in C++, very fast, and support computation in parallel.
The package also support
The mssm contains a new implementation of some of the above methods.
Danmarks Nationalbank
Danmarks Nationalbank
Recent papers suggest that “machine learning” models perform better on the firm-level.
Suggests that previous model may be misspecified.
E.g. due to co-movements in covariates.
Find better firm-level performance with more complex models.
Smaller difference in firm-level performance.
Suggest a model to easily account non-linear effects and with time-varying effects.
Danish limited liability companies.
Tall sample: 198 929 firms.
Limits ability to model changes through time.
Assumptions: linearity, additivity, and conditional independence.
Assumptions: conditional independence.
Assumptions: conditional independence.
Assumptions: conditional independence given the random effect.
Estimated partial effect in the generalized additive model of log of the size and net profit to a size measure of the firm.
◆: Generalized linear model, ▲: Generalized additive model, and ●: Gradient boosted tree model.
◆: Generalized linear model, ▲: Generalized additive model, ●: Gradient boosted tree model, ■: realized rate, and ○: Generalized linear mixed model. Bars are 90 pct. prediction intervals.
Danmarks Nationalbank
Due to evidence provided by Lando and Nielsen (2010).
Relax linearity and additivity.
Due to evidence provided by e.g. Lando et al. (2013).
the idiosyncratic stock volatility of the firm, the net income over total assets, and log market value over total liabilities. Two of them are related to distance-to-default.
with additional variables and non-linear effects.
Time-varying relative market size effect.
US public firms.
Short sample: 3 020 firms.
Wide sample: 1980 to 2016 with monthly time intervals.
The figures in the parentheses are Wald \(\chi^2\) statistics. \(\mathcal{M_1}\): model similar to Duffie, Saita, and Wang (2007), \(\mathcal{M_2}\): model with additional variables, and \(\mathcal{M_3}\): model with non-linear effects and an interaction.
Large difference in the log-likelihood.
Similar to evidence by Lando and Nielsen (2010) and Bharath and Shumway (2008).
\[\begin{aligned} \vec z_{it} &= (\vec x_{it}^\top, \vec m_t^\top, u_{it}, \alpha_t, b_t)^\top \\ g(P(Y_{it} = 1 \mid \vec z_{it})) &= \vec \beta^\top \vec x_{it} + \vec\gamma^\top\vec m_t + \alpha_t + b_tu_{it} \\ \begin{pmatrix}\alpha_t \\ b_t \end{pmatrix} &= \begin{pmatrix}\theta_1 & 0 \\ 0 & \theta_2 \end{pmatrix} \begin{pmatrix}\alpha_{t-1} \\ b_{t-1} \end{pmatrix} + \vec\epsilon_t \\ \vec\epsilon_t & \sim N\left(\vec 0, \begin{pmatrix} \sigma_1^2 & \rho\sigma_1\sigma_2 \\ \rho\sigma_1\sigma_2 & \sigma_2^2 \end{pmatrix}\right) \end{aligned}\]
\[\begin{aligned} \color{gray}{\vec z_{it}} & \color{gray}= \color{gray}{(\vec x_{it}^\top, \vec m_t^\top, u_{it}, \alpha_t, b_t)^\top} \\ \color{gray}{g(P(Y_{it} = 1 \mid \vec z_{it}))} & \color{gray}= \color{gray}{\vec \beta^\top \vec x_{it} + \vec\gamma^\top\vec m_t} + \alpha_t + b_tu_{it} \\ \begin{pmatrix}\alpha_t \\ b_t \end{pmatrix} &= \begin{pmatrix}\theta_1 & 0 \\ 0 & \theta_2 \end{pmatrix} \begin{pmatrix}\alpha_{t-1} \\ b_{t-1} \end{pmatrix} + \vec\epsilon_t \\ \vec\epsilon_t & \sim N\left(\vec 0, \begin{pmatrix} \sigma_1^2 & \rho\sigma_1\sigma_2 \\ \rho\sigma_1\sigma_2 & \sigma_2^2 \end{pmatrix}\right) \end{aligned}\]
The figures in the parentheses are Wald \(\chi^2\) statistics. \(\mathcal{M_4}\): model with non-linear effects, an interaction, and a random intercept and \(\mathcal{M_5}\): same as \(\mathcal{M_4}\) with a random relative market size slope.
The figures in the parentheses are Wald \(\chi^2\) statistics. \(\mathcal{M_4}\): model with non-linear effects, an interaction, and a random intercept and \(\mathcal{M_5}\): same as \(\mathcal{M_4}\) with a random relative market size slope.
The figures in the parentheses are Wald \(\chi^2\) statistics. \(\mathcal{M_4}\): model with non-linear effects, an interaction, and a random intercept and \(\mathcal{M_5}\): same as \(\mathcal{M_4}\) with a random relative market size slope.
Blue: lowest, black: highest. ◇: model as in Duffie, Saita, and Wang (2007), ▽: + covariates and non-linear effects, ▲: + random intercept, and ◆: + random size slope.
Blue: lowest, black: highest. ◇: model as in Duffie, Saita, and Wang (2007), ▽: + covariates and non-linear effects, ▲: + random intercept, and ◆: + random size slope.
Bars: 90% prediction interval. ○: realized rate, ◇: model as in Duffie, Saita, and Wang (2007), ▽: + covariates and non-linear effects, ▲: + random intercept, and ◆: + random size slope.
Bars: 90% prediction interval. ○: realized rate, ◇: model as in Duffie, Saita, and Wang (2007), ▽: + covariates and non-linear effects, ▲: + random intercept, and ◆: + random size slope.
Predicted partial effect in the final model.
Estimated marginal effect from a generalized additive model.
Slides are at rpubs.com/boennecd/PhD-defence.
Markdown is at github.com/boennecd/Talks.
Most of the code is at github.com/boennecd.
References on next slide.
Alfaro, Esteban, Noelia García, Matías Gámez, and David Elizondo. 2008. “Bankruptcy Forecasting: An Empirical Comparison of Adaboost and Neural Networks.” Decision Support Systems 45 (1): 110–22.
Berg, Daniel. 2007. “Bankruptcy Prediction by Generalized Additive Models.” Applied Stochastic Models in Business and Industry 23 (2). John Wiley & Sons, Ltd.: 129–43.
Bharath, Sreedhar T., and Tyler Shumway. 2008. “Forecasting Default with the Merton Distance to Default Model.” The Review of Financial Studies 21 (3): 1339–69. doi:10.1093/rfs/hhn044.
Briers, Mark, Arnaud Doucet, and Simon Maskell. 2009. “Smoothing Algorithms for State–Space Models.” Annals of the Institute of Statistical Mathematics 62 (1): 61. doi:10.1007/s10463-009-0236-2.
Campbell, Jens, John Y. And Hilscher, and Jan Szilagyi. 2008. “In Search of Distress Risk.” The Journal of Finance 63 (6). Blackwell Publishing Inc: 2899–2939.
Cappe, O., and E. Moulines. 2005. “Recursive Computation of the Score and Observed Information Matrix in Hidden Markov Models.” In IEEE/Sp 13th Workshop on Statistical Signal Processing, 2005, 703–8. doi:10.1109/SSP.2005.1628685.
Chava, Sudheer, and Robert A. Jarrow. 2004. “Bankruptcy Prediction with Industry Effects.” Review of Finance 8 (4): 537–69.
Chen, Peimin, and Chunchi Wu. 2014. “Default Prediction with Dynamic Sectoral and Macroeconomic Frailties.” Journal of Banking & Finance 40: 211–26. doi:https://doi.org/10.1016/j.jbankfin.2013.11.036.
Duffie, Darrell, Andreas Eckner, Guillaume Horel, and Leandro Saita. 2009. “Frailty Correlated Default.” The Journal of Finance 64 (5). Blackwell Publishing Inc: 2089–2123.
Duffie, Darrell, Leandro Saita, and Ke Wang. 2007. “Multi-Period Corporate Default Prediction with Stochastic Covariates.” Journal of Financial Economics 83 (3): 635–65. doi:https://doi.org/10.1016/j.jfineco.2005.10.011.
Fahrmeir, L. 1992. “Posterior Mode Estimation by Extended Kalman Filtering for Multivariate Dynamic Generalized Linear Models.” Journal of the American Statistical Association 87 (418): 501–9. doi:10.1080/01621459.1992.10475232.
———. 1994. “Dynamic Modelling and Penalized Likelihood Estimation for Discrete Time Survival Data.” Biometrika 81 (2). [Oxford University Press, Biometrika Trust]: 317–30. http://www.jstor.org/stable/2336962.
Fearnhead, Paul, David Wyncoll, and Jonathan Tawn. 2010. “A sequential smoothing algorithm with linear computational cost.” Biometrika 97 (2): 447–64. doi:10.1093/biomet/asq013.
Filipe, Sara Ferreira, Theoharry Grammatikos, and Dimitra Michala. 2016. “Forecasting Distress in European Sme Portfolios.” Journal of Banking & Finance 64: 112–35. doi:https://doi.org/10.1016/j.jbankfin.2015.12.007.
Jensen, Thais Lærkholm, David Lando, and Mamdouh Medhat. 2017. “Cyclicality and Firm Size in Private Firm Defaults.” International Journal of Central Banking 13 (4). Association of the International Journal of Central Banking: 97–145.
Jones, Stewart, David Johnstone, and Roy Wilson. 2017. “Predicting Corporate Bankruptcy: An Evaluation of Alternative Statistical Frameworks.” Journal of Business Finance & Accounting 44 (1-2): 3–34.
Julierm, Simon J., and Jeffrey K. Uhlmann. 1997. New Extension of the Kalman Filter to Nonlinear Systems. Vol. 3068. Orlando, FL, United States. doi:10.1117/12.280797.
Kim, Myoung-Jong, and Dae-Ki Kang. 2010. “Ensemble with Neural Networks for Bankruptcy Prediction.” Expert Systems with Applications 37 (4): 3373–9.
Koopman, Siem Jan, André Lucas, and Bernd Schwaab. 2011. “Modeling Frailty-Correlated Defaults Using Many Macroeconomic Covariates.” Journal of Econometrics 162 (2): 312–25.
———. 2012. “Dynamic Factor Models with Macro, Frailty, and Industry Effects for U.s. Default Counts: The Credit Crisis of 2008.” Journal of Business & Economic Statistics 30 (4). Taylor & Francis: 521–32. doi:10.1080/07350015.2012.700859.
Lando, David, and Mads Stenbo Nielsen. 2010. “Correlation in Corporate Defaults: Contagion or Conditional Independence?” Journal of Financial Intermediation 19 (3): 355–72. doi:https://doi.org/10.1016/j.jfi.2010.03.002.
Lando, David, Mamdouh Medhat, Mads Stenbo Nielsen, and Søren Feodor Nielsen. 2013. “Additive Intensity Regression Models in Corporate Default Analysis.” Journal of Financial Econometrics 11 (3): 443–85.
Lin, Ming T., Junni L. Zhang, Qiansheng Cheng, and Rong Chen. 2005. “Independent Particle Filters.” Journal of the American Statistical Association 100 (472). [American Statistical Association, Taylor & Francis, Ltd.]: 1412–21. http://www.jstor.org/stable/27590681.
Min, Jae H., and Young-Chan Lee. 2005. “Bankruptcy Prediction Using Support Vector Machine with Optimal Choice of Kernel Function Parameters.” Expert Systems with Applications 28 (4): 603–14.
Pitt, Michael K., and Neil Shephard. 1999. “Filtering via Simulation: Auxiliary Particle Filters.” Journal of the American Statistical Association 94 (446). [American Statistical Association, Taylor & Francis, Ltd.]: 590–99. http://www.jstor.org/stable/2670179.
Poyiadjis, George, Arnaud Doucet, and Sumeetpal S. Singh. 2011. “Particle Approximations of the Score and Observed Information Matrix in State Space Models with Application to Parameter Estimation.” Biometrika 98 (1). Biometrika Trust: 65–80. http://www.jstor.org/stable/29777165.
Qi, Min, Xiaofei Zhang, and Xinlei Zhao. 2014. “Unobserved Systematic Risk Factor and Default Prediction.” Journal of Banking & Finance 49: 216–27. doi:https://doi.org/10.1016/j.jbankfin.2014.09.009.
Schwaab, Bernd, Siem Jan Koopman, and André Lucas. 2017. “Global Credit Risk: World, Country and Industry Factors.” Journal of Applied Econometrics 32 (2): 296–317. doi:10.1002/jae.2521.
Shumway, Tyler. 2001. “Forecasting Bankruptcy More Accurately: A Simple Hazard Model.” The Journal of Business 74 (1). The University of Chicago Press: 101–24.
Wan, E. A., and R. Van Der Merwe. 2000. “The Unscented Kalman Filter for Nonlinear Estimation.” In Proceedings of the Ieee 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00ex373), 153–58. doi:10.1109/ASSPCC.2000.882463.