EVALD – a Pioneer Application for Automated Essay Scoring in Czech
Autor: | Kateřina Rysová, Eva Hajičová, Magdaléna Rysová, Michal Novák, Jiří Mírovský |
---|---|
Rok vydání: | 2019 |
Předmět: |
Czech
050101 languages & linguistics 05 social sciences Computational linguistics. Natural language processing 0202 electrical engineering electronic engineering information engineering Mathematics education language 020201 artificial intelligence & image processing 0501 psychology and cognitive sciences 02 engineering and technology P98-98.5 Automated essay scoring language.human_language |
Zdroj: | Prague Bulletin of Mathematical Linguistics, Vol 113, Iss 1, Pp 9-30 (2019) |
ISSN: | 1804-0462 |
DOI: | 10.2478/pralin-2019-0004 |
Popis: | In the paper, we present EVALD applications (Evaluator of Discourse) for automated essay scoring. EVALD is the first tool of this type for Czech. It evaluates texts written by both native and non-native speakers of Czech. We describe first the history and the present in the automatic essay scoring, which is illustrated by examples of systems for other languages, mainly for English. Then we focus on the methodology of creating the EVALD applications and describe datasets used for testing as well as supervised training that EVALD builds on. Furthermore, we analyze in detail a sample of newly acquired language data – texts written by non-native speakers reaching the threshold level of the Czech language acquisition required e.g. for the permanent residence in the Czech Republic – and we focus on linguistic differences between the available text levels. We present the feature set used by EVALD and – based on the analysis – we extend it with new spelling features. Finally, we evaluate the overall performance of various variants of EVALD and provide the analysis of collected results. |
Databáze: | OpenAIRE |
Externí odkaz: |