Terminology/Keyphrase Extraction for Creation of Book Indexes in Polish
Autor: | Malgorzata Marciniak, Agnieszka Mykowiecka, Piotr Rychlik |
---|---|
Rok vydání: | 2021 |
Předmět: |
Identification (information)
business.industry Computer science InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Subject (documents) Artificial intelligence computer.software_genre business GeneralLiterature_REFERENCE(e.g. dictionaries encyclopedias glossaries) computer Natural language processing Terminology |
Zdroj: | Linking Theory and Practice of Digital Libraries ISBN: 9783030863234 TPDL |
Popis: | The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually occur in the corresponding books. In the experiments, we use existing terminology and keyphrase extraction tools. For Polish, the first tool is better than the second one, but for English texts, the results are inconclusive. |
Databáze: | OpenAIRE |
Externí odkaz: |