Word Sense Induction Using Word Sketches
Autor: | Pavel Rychlý, Vojtěch Kovář, Miloš Jakubíček, Ondřej Herman |
---|---|
Rok vydání: | 2019 |
Předmět: |
Word embedding
Collocation Computer science business.industry 010501 environmental sciences computer.software_genre 01 natural sciences 010104 statistics & probability strategies tools standards for lexicographic resources (objective 3) WP4 Word-sense induction Word sketch Artificial intelligence 0101 mathematics Cluster analysis business computer Word (computer architecture) Natural language processing 0105 earth and related environmental sciences |
Zdroj: | Statistical Language and Speech Processing ISBN: 9783030313715 SLSP |
Popis: | We present three methods for word sense induction based on Word Sketches. The methods are being developed a part of an semiautomatic dictionary creation system, providing annotators with the summarized semantic behavior of a word. Two of the methods are based on the assumption of a word having a single sense per collocation. We cluster the Word Sketch based collocations by their co-occurrence behavior in the first method. The second method clusters the collocations using word embedding model. The last method is based on clustering of Word Sketch thesauri. We evaluate the methods and demonstrate their behavior on representative words. |
Databáze: | OpenAIRE |
Externí odkaz: |