Intelligent Model for Data Analytical Study of Coronavirus COVID-19 Databases

Autor: Doaa Sami Khafaga, Faten Khalid Karim, Mohamed M. Dessouky, Mohamed A. El-Rashidy
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Electronics; Volume 11; Issue 13; Pages: 1975
ISSN: 2079-9292
DOI: 10.3390/electronics11131975
Popis: The pandemic coronavirus COVID-19 spread around the world with deaths exceeding that of SARS. COVID-19 is believed to have been transmitted from animals, especially from bats, and the virus is transmitted from person to person over time. This paper will help countries to make decisions that encourage access to corrected values and get some indication as to whether there are other factors that affect the spread of COVID-19, via methods such as by increasing the daily test rate. This paper presents an intelligent model for analyzing data collected from the countries affected by the COVID-19 virus. It considers the total number of tests that each country has undergone, the number of international tourist arrivals in each country, the percentage of employment, the life expectancy at birth, the median age, the population density, the number of people aged 65 years or older in millions, and the sex ratio. The proposed model is based on machine learning approaches using k-Means as a clustering approach, Support Vector Machine (SVM) as a classifier, and wrapper as a feature extraction approach. It consists of three phases of pre-processing the data collected, the discovery of outlier cases, the selection of the most effective features for each of the total infected, deaths, critical and recovery cases, and the construction of prediction models. Experimental results show that the extracted features of the wrapper technique have shown that it is more capable of fitting and predicting data than the Correlation-Based Feature Selection, Correlation Attribute Evaluation, Information Gain, and Relief Attribute Evaluation techniques. The SVM classifier also achieved the highest accuracy compared to other classification algorithms for predicting total infected, fatal, critical, and recovery cases.
Databáze: OpenAIRE