Similarity Computation of Chinese Question Based on Chunk
Autor: | Zheng-Tao Yu, Shi-ping Tang, Li Huang, Lei Hu, Jing-hui Deng |
---|---|
Rok vydání: | 2006 |
Předmět: |
Parsing
business.industry Computer science computer.software_genre Support vector machine Text mining Semantic similarity Similarity (network science) Rule-based machine translation Data_FILES Artificial intelligence Computational linguistics business computer Natural language Natural language processing Sentence |
Zdroj: | 2006 International Conference on Machine Learning and Cybernetics. |
DOI: | 10.1109/icmlc.2006.258809 |
Popis: | The currently similarity computation methods of Chinese sentence and their shortcomings are analyzed at first. According to the characteristic of the Chinese question sentence, Chinese question general chunk and special chunk are defined, and then a similarity computation method of Chinese question based on chunk is proposed. In this method, the semantic similarity of words is computed on the basis of HowNet. General chunk is recognized by chunk parsing theory and HMM learning method, and special chunk is retrieved with some heuristic rule or SVM learning methods, then the similarity of each chunk in the two question sentences is computed separately, then the similarity computation of question sentence is realized, which is based on chunk similarity. Finally, the experiment result of question similarity computation method shows that the method proposed in the paper gets a better performance than the others. |
Databáze: | OpenAIRE |
Externí odkaz: |