TCBPLK: A New Method of Text Categorization

Autor: Jian-Suo Xu
Rok vydání: 2007
Předmět:
Zdroj: 2007 International Conference on Machine Learning and Cybernetics.
Popis: This paper presents a new text categorization method based on P-L theory and Kohonen network, which called TCBPLK method. The Kohonen network is applied to realizing text categorization, which has a defect of too slowly speed of training. To text vector of high dimension, the defect is more obvious. Even the result of text categorization can not be acquired. The new method establishes vector space model of term weight by the theory of P-L, which enhances the function of the words from the viewpoint of categorization effect, and decreases the dimension of vector through eliminating redundant features. Experimental results confirm that TCBPLK method decreases the number of vector, and enhances the generalization and precision of text categorization.
Databáze: OpenAIRE