Sentence Similarity Computation based on WordNet and VerbNet
Autor: | Wafa Wali, Bilel Gargouri, Abdelmajid Ben Hamadou |
---|---|
Rok vydání: | 2018 |
Předmět: |
General Computer Science
Computer science business.industry Lexical similarity WordNet 02 engineering and technology computer.software_genre Paraphrase Semantic similarity Similarity (network science) 020204 information systems Taxonomy (general) 0202 electrical engineering electronic engineering information engineering Question answering 020201 artificial intelligence & image processing Artificial intelligence VerbNet business computer Natural language processing |
Zdroj: | Computación y Sistemas. 21 |
ISSN: | 2007-9737 1405-5546 |
DOI: | 10.13053/cys-21-4-2853 |
Popis: | Sentence similarity computing is increasingly growing in several applications, such as question answering, machine-translation, information retrieval and automatic abstracting systems. This paper firstly sums up several methods to calculate similarity between sentences which consider semantic and syntactic knowledge. Second, it presents a new method for the sentence similarity measure that aggregates, in a linear function, three components: the Lexical similarity Lexsim including the common words, the semantic similarity SemSim using the synonymy words and the syntactico-semantic similarity SynSemSim based on common semantic arguments, notably, thematic role and semantic class. Concerning the word-based semantic similarity, a measure is computed to estimate the semantic degree between words by exploiting the WordNet ”is a” taxonomy. Moreover, the semantic argument determination is based on the VerbNet database. The proposed method yielded competitive results compared to previously proposed measures and with regard to the Li’s benchmark, which shown a high correlation with human ratings. Furthermore, experiments performed on the Microsoft Paraphrase Corpus showed the best F-measure values compared to other measures for high similarity thresholds. |
Databáze: | OpenAIRE |
Externí odkaz: |