Contextualized Latent Semantic Indexing: A New Approach to Automated Chinese Essay Scoring

Autor: Xu Yanyan, Ke Dengfeng, Su Kaile
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Journal of Intelligent Systems, Vol 26, Iss 2, Pp 263-285 (2017)
Druh dokumentu: article
ISSN: 0334-1860
2191-026X
66678986
DOI: 10.1515/jisys-2015-0048
Popis: The writing part in Chinese language tests is badly in need of a mature automated essay scoring system. In this paper, we propose a new approach applied to automated Chinese essay scoring (ACES), called contextualized latent semantic indexing (CLSI), of which Genuine CLSI and Modified CLSI are two versions. The n-gram language model and the weighted finite-state transducer (WFST), two critical components, are used to extract context information in our ACES system. Not only does CLSI improve conventional latent semantic indexing (LSI), but bridges the gap between latent semantics and their context information, which is absent in LSI. Moreover, CLSI can score essays from the perspectives of language fluency and contents, and address the local overrating and underrating problems caused by LSI. Experimental results show that CLSI outperforms LSI, Regularized LSI, and latent Dirichlet allocation in many aspects, and thus, proves to be an effective approach.
Databáze: Directory of Open Access Journals