The Efficiency of Different Distance Metrics for Keyword-Based Search in Medical Documents: A Short Case Study.

Autor: VATHY-FOGARASSY, Ágnes, SZEKÉR, Szabolcs, SZOLÁR, Balázs, FOGARASSY, György
Zdroj: Studies in Health Technology & Informatics; 2020, Vol. 271, p232-239, 8p, 1 Diagram, 2 Charts, 1 Graph
Abstrakt: Background: Processing of free text written medical texts involves many difficulties arising from typographical errors, synonyms, and abbreviations occurring in the texts. Methods: In this study, the applicability of the most common string similarity measures were analyzed and compared for the keyword-based medical text search. Results: The usefulness of the similarity measures was studied in a set of medical documents containing more than 20,000 echocardiography reports. Experimental results showed that the Jaro-Winkler dissimilarity measure is the most capable measure to explore the content of the medical texts. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index