Enhanced Latent Semantic Analysis by considering mistyped words in automated essay scoring

Autor: Almodad Biduk Asmani, Martin Sendra, Rudy Sutrisno, Josep Harianata, Derwin Suhartono
Rok vydání: 2016
Předmět:
Zdroj: 2016 International Conference on Informatics and Computing (ICIC).
DOI: 10.1109/iac.2016.7905734
Popis: In this paper, we present an approach to consider the mistyped words which frequently occur in an essay. Not all the mistyped words are totally wrong. If they are caused by human errors, we should look further to handle it. Thus, enhanced version of Latent Semantic Analysis (LSA) is proposed. LSA is enhanced by calculating the value of mistyped words before constructing the term document matrix. Essays which are scored consists of 119 English essays of student's writing assignment while several essays as gold standard are made by experts. All of the essays have been scored manually by the human grader. Enhanced LSA gives the closeness level of 0.242 to human judgment, while LSA indicates the closeness level of 0.244 to human judgment. Unfortunately, Enhanced LSA cannot outperform GLSA which is one of current methods. The experiment result indicates that Enhanced LSA by considering mistyped words has a better closeness value to human judgment compared with LSA
Databáze: OpenAIRE