Forecasting COVID-19 Confirmed Cases Using Time Series Analysis

Authors

  • Akram Mohammed Radwan University College of Applied Sciences

Keywords:

COVID-19, predictive analytics, machine learning, regression, time series

Abstract

The novel coronavirus (COVID-19) pandemic is a major global health threat that is spreading very fast around the world. In the current study, we present a new forecasting model to estimate the number of confirmed cases of COVID-19 in the next two weeks based on the previously confirmed cases recorded for 62 countries around the world. The cumulative cases of these countries represent about 95% of the total global up to the date of data gathering. Seven regression models have been used for three rounds of predictions based on the data collected between February 21, 2020 and December 29, 2020. A number of different time series features have generated using feature-engineering methods to convert a time series forecast into a supervised learning problem and then build regression models. The performance of the models was evaluated using root mean squared log error, root mean squared error, mean absolute error, mean absolute percentage error, coefficient of determination and running time. The findings show a good performance and can reduce the error about 72% with a high coefficient of R2 = 0.990. In particular, XGB and Random Forest models have demonstrated their efficiency over other models.

Author Biography

Akram Mohammed Radwan, University College of Applied Sciences

Assistant Professor/ University College of Applied Sciences/ Palestine

References

Oliveira, T. P., Moral, R. A., "Global Short-Term Forecasting of Covid-19 Cases". Scientific Reports, vol. 11, pp. 1–22, 2021.

Ahmad, A., et al. "The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges". Archives of Computational Methods in Engineering, vol 28, pp. 2645–2653, 2020.

Vytla, V., et al. “Mathematical models for predicting COVID-19 pandemic: a review”. Journal of Physics: Conference Series, vol. 1797, 2021.

Rath S., Tripathy A., Tripathy A.R., "Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model". Diabetes Metab Syndr Clin Res Rev. vol. 14, no. 5, pp. 1467–1474, 2020.

Hernandez-Matamoros, A., Fujita, H., Hayashi, T., Perez-Meana, H. "Forecasting of COVID19 per regions using ARIMA models and polynomial functions". Applied soft computing. vol. 96, 2020.

Chowdhury, A.A., Hasan, K.T., Hoque, K.K.S., "Analysis and Prediction of COVID-19 Pandemic in Bangladesh by Using ANFIS and LSTM Network". Cognitive Computation, vol. 13, pp. 761–770, 2021.

Feng S., et al. "Prediction of the COVID-19 epidemic trends based on SEIR and AI models". PLoSONE, vol. 16, no. 1, 2021

Hassanat, A.B., et al. "A Simulation Model for Forecasting COVID-19 Pandemic Spread: Analytical Results Based on the Current Saudi COVID-19 Data". Sustainability, vol. 13, 4888, 2021.

Al-qaness, M.A.A. et al. "Optimization Method for Forecasting Confirmed Cases of COVID-19 in China". Journal of Clinical Medicine, vol. 9, no. 3, 674, 2020.

Samson, T. K., Ogunlaran, O. M., Raimi, O. M., "A Predictive Model for Confirmed Cases of COVID-19 in Nigeria". European Journal of Applied Sciences, vol. 8, no. 4, pp. 1–10, 2020.

Ribeiro, M.H., et al. "Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil". Chaos, Solitons and Fractals, vol. 135, pp. 1-10, 2020.

Breiman L., "Random forests". Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.

Gumaei, A., et al. "Prediction of COVID-19 confirmed cases using gradient boosting regression method". Computers, Materials and Continua, vol. 66, pp. 315-329, 2021.

Chen, T., Guestrin, C. "Xgboost: A scalable tree boosting system". In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, New York, NY, USA. ACM, pp. 785–794, 2016.

Ke G., et al. "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems, vol. 30, pp. 3149–3157, 2017.

Drucker H , Burges CJC , Kaufman L. and Smola AJ , Vapnik V., "Support Vector Regression Machines". In Mozer MC, Jordan MI, Petsche T, editors. Advances in neural information processing systems 9. MIT Press, pp.155–61, 1997.

Ribeiro, M.H.D.M., Coelho, L.S., "Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series". Applied soft computing, vol. 86, 2020.

Downloads

Published

2023-01-16

How to Cite

Radwan, A. M. (2023). Forecasting COVID-19 Confirmed Cases Using Time Series Analysis. Palestinian Journal of Technology and Applied Sciences (PJTAS), (6). Retrieved from https://journals.qou.edu/index.php/PJTAS/article/view/4002

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.