Automatic Generation of Regular Expressions for Extracting Attribute Values of Medical Products
Autor: | Tomasz Łukaszuk, Mariusz Ferenc |
---|---|
Rok vydání: | 2018 |
Předmět: |
Computer science
business.industry 05 social sciences Pattern recognition 06 humanities and the arts 0603 philosophy ethics and religion 050105 experimental psychology Philosophy AZ20-999 060302 philosophy History of scholarship and learning. The humanities 0501 psychology and cognitive sciences Regular expression Artificial intelligence business |
Zdroj: | Studies in Logic, Grammar and Rhetoric, Vol 56, Iss 1, Pp 193-204 (2018) |
ISSN: | 2199-6059 0860-150X |
DOI: | 10.2478/slgr-2018-0049 |
Popis: | Resources of professional companies operating on the medical services market contain data from a huge number of transactional documents. This allows them to collect and process, among other actions, information about medical products. Organized data is obviously more valuable. In this paper, the possibility of supporting the process of organizing information is considered, with the goal to extract values of attributes of medical products from brief descriptions in transactional documents. This helps to build a structured product specification and makes it possible to make comparisons between products. For this purpose, an approach based on regular expressions and their generation with the use of the genetic algorithm is proposed. The results presented in the paper show a great potential of the presented method. |
Databáze: | OpenAIRE |
Externí odkaz: |