A critical analysis of COVID-19 research literature: Text mining approach
Autor: | Ferhat D. Zengul, James H. Willig, Michael J. Mugavero, James J. Cimino, Dursun Delen, Kierstin Cates Kennedy, Nurettin Oner, Bunyamin Ozaydin, Ayse G. Zengul |
---|---|
Přispěvatelé: | Delen, Dursun |
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Topic model
medicine.medical_specialty Medical education Text mining Coronavirus disease 2019 (COVID-19) Mechanism (biology) business.industry Text Mining Topic Modeling Natural language processing Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Public health SARS COV-2 COVID-19 Article Topic modeling Categorization medicine Portfolio Psychology business Natural Language Processing |
Zdroj: | Intelligence-Based Medicine |
Popis: | Objective Among the stakeholders of COVID-19 research, clinicians particularly experience difficulty keeping up with the deluge of SARS-CoV-2 literature while performing their much needed clinical duties. By revealing major topics, this study proposes a text-mining approach as an alternative to navigating large volumes of COVID-19 literature. Materials and methods We obtained 85,268 references from the NIH COVID-19 Portfolio as of November 21. After the exclusion based on inadequate abstracts, 65,262 articles remained in the final corpus. We utilized natural language processing to curate and generate the term list. We applied topic modeling analyses and multiple correspondence analyses to reveal the major topics and the associations among topics, journal countries, and publication sources. Results In our text mining analyses of NIH's COVID-19 Portfolio, we discovered two sets of eleven major research topics by analyzing abstracts and titles of the articles separately. The eleven major areas of COVID-19 research based on abstracts included the following topics: 1) Public Health, 2) Patient Care & Outcomes, 3) Epidemiologic Modeling, 4) Diagnosis and Complications, 5) Mechanism of Disease, 6) Health System Response, 7) Pandemic Control, 8) Protection/Prevention, 9) Mental/Behavioral Health, 10) Detection/Testing, 11) Treatment Options. Further analyses revealed that five (2,3,4,5, and 9) of the eleven abstract-based topics showed a significant correlation (ranked from moderate to weak) with title-based topics. Conclusion By offering up the more dynamic, scalable, and responsive categorization of published literature, our study provides valuable insights to the stakeholders of COVID-19 research, particularly clinicians. Graphical abstract Image 1 |
Databáze: | OpenAIRE |
Externí odkaz: |