Discovering Chinese Concept-In-Corpus

Autor: Jian-Chao Chen, Qi-Lun Zheng, Zhao Li
Rok vydání: 2008
Předmět:
Zdroj: 2008 International Conference on Machine Learning and Cybernetics.
Popis: Concept is the basic of knowledge. A concept consists of a connotation and an extension. The paper comes up with a concept of concept-in-corpus which is a special kind of formal concept, and presents a discovering algorithm called FCWFT (filtering concept-word based on feature-tree) which automatically mine the connotation and the extension for a Chinese concept-in-corpus from corpus in Chinese. Our work is the first one attempting to mine formal concepts from free texts in the area of natural language processing. We test the algorithm with a large scale corpus. The result is encouraging.
Databáze: OpenAIRE