The Automatic Clustering of Domain-Specific Chinese Documents

Autor: Shih-Chi Liu, 劉世琪
Rok vydání: 2007
Druh dokumentu: 學位論文 ; thesis
Popis: 95
In the domain of the knowledge management, enterprises are at the beginning of building and constructing document management system, the documents authors offer are not classified very effectively. This fact let user unable searching and using in effect under a large number of documents. A lot of research reveals keywords can help users to decide whether the document is useful. And gather together piles and piles of documents in accordance with its similarity, can offer a more efficient way of searching documents to users. For this reason the experiments using the Electronic Theses and Dissertations System searches the photonics documents about color filter or Liquid Crystal Display-LCD domain. We improve Kea, an algorithm for automatically extracting keyphrases from Chinese texts. Besides by analyzing the results of using Hierarchical Clustering Algorithms can assist administrators to assess the suitable ways of the categorized documents.
Databáze: Networked Digital Library of Theses & Dissertations