Cluster Based Term Weighting Model for Web Document Clustering
Autor: | M. Hanumanthappa, M. Mamatha, B. R. Prakash |
---|---|
Rok vydání: | 2014 |
Předmět: | |
Zdroj: | Advances in Intelligent Systems and Computing ISBN: 9788132217671 SocProS (2) |
Popis: | The term weight is based on the frequency with which the term appears in that document. The term weighting scheme measures the importance of a term with respect to a document and a collection. A term with higher weight is more important than a term with lower weight. A document ranking model uses these term weights to find the rank of a document in a collection. We propose a cluster-based term weighting models based on the TF-IDF model. This term weighting model update the inter-cluster and intra-cluster frequency components uses the generated clusters as a reference in improving the retrieved relevant documents. These inter cluster and intra-cluster frequency components are used for weighting the importance of a term in addition to the term and document frequency components. |
Databáze: | OpenAIRE |
Externí odkaz: |