Use of the Normalized Word Vector Approach in Document Classification for an LKMC

Autor: Philip S. Nitse, Albert S. M. Tay, Robert Williams, Kevin R. Parker
Rok vydání: 2008
Předmět:
Zdroj: Issues in Informing Science and Information Technology. 5:513-524
ISSN: 1547-5867
1547-5840
DOI: 10.28945/1025
Popis: In order to realize the objective of expanding libr ary services to provide knowledge management support for small businesses, a series of requireme nts must be met. This particular phase of a larger research project focuses on one of the requirem ents: the need for a document classification system to rapidly determine the content of digital documents. Document classification techniques are examined to assess the available alternatives f or realization of Library Knowledge Management Centers (LKMCs). After evaluating prominent techniques the authors opted to investigate a less well-known method, the Normalized Word Vector (NWV) approach, which has been used successfully in classifying highly unstructured doc uments, i.e., student essays. The authors propose utilizing the NWV approach for LKMC automatic document classification with the goal of developing a system whereby unfamiliar documents can be quickly classified into existing topic categories. This conceptual paper will outline an a pproach to test NWV’s suitability in this area.
Databáze: OpenAIRE