ConSenses: Disambiguating content word groups based on knowledge base and definition embedding
Autor: | Kai-Wen Tuan, Li-Kuang Chen, Yi-Chien Lin, Kuan-Lin Lee, Jason S. Chang |
---|---|
Rok vydání: | 2020 |
Předmět: |
Computer science
business.industry WordNet 02 engineering and technology Content word computer.software_genre Knowledge-based systems 020204 information systems Noun Similarity (psychology) 0202 electrical engineering electronic engineering information engineering Artificial intelligence Cluster analysis Set (psychology) business computer Word (computer architecture) Natural language processing |
Zdroj: | TAAI |
DOI: | 10.1109/taai51410.2020.00055 |
Popis: | We introduce a method for disambiguating a given group of semantically related words with respect to a certain sense inventory, such as WordNet or Cambridge English Dictionary. In our approach, every member word is converted into a set of senses to be disambiguated. The method involves clustering relevant senses and filtering out irrelevant senses, and determining the intended senses for each word in the group based on pairwise sense similarity. A preliminary evaluation of our method on several datasets shows that the method extends and outperforms the previous work that only deals with noun groups [1]. Our method is more generally applicable, allowing nouns, verbs, and adjectives groups, and can be used to aligning two anthologies to combine knowledge resources, as well as to generate training data for word sense disambiguation tasks. |
Databáze: | OpenAIRE |
Externí odkaz: |