Please use this identifier to cite or link to this item:
https://biore.bio.bg.ac.rs/handle/123456789/6761
Title: | A large-scale machine learning study of sociodemographic factors contributing to COVID-19 severity | Authors: | Tumbas Z. Marko Marković, Sofija Salom, Igor Đorđević, Marko |
Keywords: | Random Forest;SARS-CoV-2;XGBoost;feature selection;mRMR;sociodemographic factors | Issue Date: | 2023 | Journal: | Frontiers in big data | Volume: | 6 | Start page: | 1038283 | Abstract: | Understanding sociodemographic factors behind COVID-19 severity relates to significant methodological difficulties, such as differences in testing policies and epidemics phase, as well as a large number of predictors that can potentially contribute to severity. To account for these difficulties, we assemble 115 predictors for more than 3,000 US counties and employ a well-defined COVID-19 severity measure derived from epidemiological dynamics modeling. We then use a number of advanced feature selection techniques from machine learning to determine which of these predictors significantly impact the disease severity. We obtain a surprisingly simple result, where only two variables are clearly and robustly selected-population density and proportion of African Americans. Possible causes behind this result are discussed. We argue that the approach may be useful whenever significant determinants of disease progression over diverse geographic regions should be selected from a large number of potentially important factors. |
URI: | https://biore.bio.bg.ac.rs/handle/123456789/6761 | ISSN: | 2624-909X | DOI: | 10.3389/fdata.2023.1038283 |
Appears in Collections: | Journal Article |
Show full item record
SCOPUSTM
Citations
3
checked on Nov 21, 2024
Page view(s)
1
checked on Nov 21, 2024
Google ScholarTM
Check
Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.