Forecasting COVID-19 Confirmed Cases Using Time Series Analysis
Keywords:
COVID-19, predictive analytics, machine learning, regression, time seriesAbstract
The novel coronavirus (COVID-19) pandemic is a major global health threat that is spreading very fast around the world. In the current study, we present a new forecasting model to estimate the number of confirmed cases of COVID-19 in the next two weeks based on the previously confirmed cases recorded for 62 countries around the world. The cumulative cases of these countries represent about 95% of the total global up to the date of data gathering. Seven regression models have been used for three rounds of predictions based on the data collected between February 21, 2020 and December 29, 2020. A number of different time series features have generated using feature-engineering methods to convert a time series forecast into a supervised learning problem and then build regression models. The performance of the models was evaluated using root mean squared log error, root mean squared error, mean absolute error, mean absolute percentage error, coefficient of determination and running time. The findings show a good performance and can reduce the error about 72% with a high coefficient of R2 = 0.990. In particular, XGB and Random Forest models have demonstrated their efficiency over other models.
References
Oliveira, T. P., Moral, R. A., "Global Short-Term Forecasting of Covid-19 Cases". Scientific Reports, vol. 11, pp. 1–22, 2021.
Ahmad, A., et al. "The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges". Archives of Computational Methods in Engineering, vol 28, pp. 2645–2653, 2020.
Vytla, V., et al. “Mathematical models for predicting COVID-19 pandemic: a review”. Journal of Physics: Conference Series, vol. 1797, 2021.
Rath S., Tripathy A., Tripathy A.R., "Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model". Diabetes Metab Syndr Clin Res Rev. vol. 14, no. 5, pp. 1467–1474, 2020.
Hernandez-Matamoros, A., Fujita, H., Hayashi, T., Perez-Meana, H. "Forecasting of COVID19 per regions using ARIMA models and polynomial functions". Applied soft computing. vol. 96, 2020.
Chowdhury, A.A., Hasan, K.T., Hoque, K.K.S., "Analysis and Prediction of COVID-19 Pandemic in Bangladesh by Using ANFIS and LSTM Network". Cognitive Computation, vol. 13, pp. 761–770, 2021.
Feng S., et al. "Prediction of the COVID-19 epidemic trends based on SEIR and AI models". PLoSONE, vol. 16, no. 1, 2021
Hassanat, A.B., et al. "A Simulation Model for Forecasting COVID-19 Pandemic Spread: Analytical Results Based on the Current Saudi COVID-19 Data". Sustainability, vol. 13, 4888, 2021.
Al-qaness, M.A.A. et al. "Optimization Method for Forecasting Confirmed Cases of COVID-19 in China". Journal of Clinical Medicine, vol. 9, no. 3, 674, 2020.
Samson, T. K., Ogunlaran, O. M., Raimi, O. M., "A Predictive Model for Confirmed Cases of COVID-19 in Nigeria". European Journal of Applied Sciences, vol. 8, no. 4, pp. 1–10, 2020.
Ribeiro, M.H., et al. "Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil". Chaos, Solitons and Fractals, vol. 135, pp. 1-10, 2020.
Breiman L., "Random forests". Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.
Gumaei, A., et al. "Prediction of COVID-19 confirmed cases using gradient boosting regression method". Computers, Materials and Continua, vol. 66, pp. 315-329, 2021.
Chen, T., Guestrin, C. "Xgboost: A scalable tree boosting system". In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, New York, NY, USA. ACM, pp. 785–794, 2016.
Ke G., et al. "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems, vol. 30, pp. 3149–3157, 2017.
Drucker H , Burges CJC , Kaufman L. and Smola AJ , Vapnik V., "Support Vector Regression Machines". In Mozer MC, Jordan MI, Petsche T, editors. Advances in neural information processing systems 9. MIT Press, pp.155–61, 1997.
Ribeiro, M.H.D.M., Coelho, L.S., "Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series". Applied soft computing, vol. 86, 2020.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Palestinian Journal of Technology and Applied Sciences (PJTAS)
This work is licensed under a Creative Commons Attribution 4.0 International License.
- The editorial board confirms its commitment to the intellectual property rights
- Researchers also have to commit to the intellectual property rights.
- The research copyrights and publication are owned by the Journal once the researcher is notified about the approval of the paper. The scientific materials published or approved for publishing in the Journal should not be republished unless a written acknowledgment is obtained by the Deanship of Scientific Research.
- Research papers should not be published or republished unless a written acknowledgement is obtained from the Deanship of Scientific Research.
- The researcher has the right to accredit the research to himself, and to place his name on all the copies, editions and volumes published.
- The author has the right to request the accreditation of the published papers to himself.