References
Adi, B. (2026). Business analytics for managers: A practical guide to data-driven decision making (3rd ed.). Lagos Business School Press.
Bates, D., & Maechler, M. (2023). cluster: Cluster analysis basics and extensions (R package version 2.1.4). CRAN. https://CRAN.R-project.org/package=cluster
Box, G. E. P., & Jenkins, G. M. (1970). Time series analysis: Forecasting and control. Holden-Day.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785
Greenwell, B., & Boehmke, B. (2020). Variable importance plots — an introduction to the vip package. The R Journal, 12(1), 343–366. https://doi.org/10.32614/RJ-2020-013
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology, 24(6), 417–441. https://doi.org/10.1037/h0071325
Hyndman, R. J., & Khandakar, Y. (2008). Automatic time series forecasting: The forecast package for R. Journal of Statistical Software, 27(3), 1–22. https://doi.org/10.18637/jss.v027.i03
Hyndman, R. J., & Athanasopoulos, G. (2021). Forecasting: Principles and practice (3rd ed.). OTexts. https://otexts.com/fpp3/
Kassambara, A., & Mundt, F. (2020). factoextra: Extract and visualise the results of multivariate data analyses (R package version 1.0.7). CRAN. https://CRAN.R-project.org/package=factoextra
Kuhn, M., & Wickham, H. (2020). Tidymodels: A collection of packages for modelling and machine learning using tidyverse principles. https://www.tidymodels.org
Le, S., Josse, J., & Husson, F. (2008). FactoMineR: An R package for multivariate analysis. Journal of Statistical Software, 25(1), 1–18. https://doi.org/10.18637/jss.v025.i01
Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30, 4765–4774.
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., & Lee, S.-I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, 281–297.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
R Core Team. (2024). R: A language and environment for statistical computing (Version 4.4.0). R Foundation for Statistical Computing. https://www.R-project.org/
Robinson, D., & Hayes, A. (2024). broom: Convert statistical objects into tidy tibbles (R package version 1.0.5). CRAN. https://CRAN.R-project.org/package=broom
Smith, A. (2023). pmdarima: ARIMA estimators for Python (Version 2.0.4). https://alkaline-ml.com/pmdarima
Seabold, S., & Perktold, J. (2010). Statsmodels: Econometric and statistical modeling with Python. Proceedings of the 9th Python in Science Conference, 92–96. https://doi.org/10.25080/Majora-92bf1922-011
Smith, A. (2023). pmdarima: ARIMA estimators for Python (Version 2.0.4). https://alkaline-ml.com/pmdarima
Waskom, M. L. (2021). Seaborn: Statistical data visualization. Journal of Open Source Software, 6(60), 3021. https://doi.org/10.21105/joss.03021
Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer-Verlag. https://ggplot2.tidyverse.org
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., … Yutani, H. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686
Wright, M. N., & Ziegler, A. (2017). ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software, 80(1), 1–17. https://doi.org/10.18637/jss.v080.i01
Zhu, H. (2021). kableExtra: Construct complex table with kable and pipe syntax (R package version 1.3.4). CRAN. https://CRAN.R-project.org/package=kableExtra
Dataset:
[Organisation Name redacted for anonymity]. (2025). Moneytor V10 investor account extract — classification and time-series datasets [Internal organisational records]. Asset Management Division, [Organisation Name redacted]. Data extracted December 2025 for academic research under internal data governance policy.