Automatic Exam Correction Framework (AECF) for the MCQs, Essays, and Equations Matching

Autor:	Hossam Magdy Balaha, Mahmoud M. Saafan
Rok vydání:	2021
Předmět:	General Computer Science Matching (graph theory) document embedding Computer science 02 engineering and technology Semantics computer.software_genre MCQ matching Semantic similarity Similarity (network science) expression trees 0202 electrical engineering electronic engineering information engineering General Materials Science Word2vec Binary expression tree computer.programming_language Automatic exam correction business.industry General Engineering 020206 networking & telecommunications Python (programming language) word embedding Tokenization (data security) 020201 artificial intelligence & image processing lcsh:Electrical engineering. Electronics. Nuclear engineering Artificial intelligence business lcsh:TK1-9971 computer Sentence Natural language processing
Zdroj:	IEEE Access, Vol 9, Pp 32368-32389 (2021)
ISSN:	2169-3536
DOI:	10.1109/access.2021.3060940
Popis:	Automatic grading requires the adaption of the latest technologies. It has become essential especially when most of the courses became online courses (MOOCs). The objectives of the current work are (1) Reviewing the literature on the text semantic similarity and automatic exam correction systems, (2) Proposing an automatic exam correction framework (HMB-AECF) for MCQs, essays, and equations that is abstracted into five layers, (3) Suggesting equations similarity checker algorithm named “HMB-MMS-EMA”, (4) Presenting an expression matching dataset named “HMB-EMD-v1”, (5) Comparing the different approaches to convert textual data into numerical data (Word2Vec, FastText, Glove, and Universal Sentence Encoder (USE)) using three well-known Python packages (Gensim, SpaCy, and NLTK), and (6) Comparing the proposed equations similarity checker algorithm (HMB-MMS-EMA) with a Python package (SymPy) on the proposed dataset (HMB-EMD-v1). Eight experiments were performed on the Quora Questions Pairs and the UNT Computer Science Short Answer datasets. The best-achieved highest accuracy in the first four experiments was 77.95% without fine-tuning the pre-trained models by the USE. The best-achieved lowest root mean square error (RMSE) in the second four experiments was 1.09 without fine-tuning the used pre-trained models by the USE. The proposed equations similarity checker algorithm (HMB-MMS-EMA) reported 100% accuracy over the SymPy Python package which reported 71.33% only on “HMB-EMD-v1”.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e54abfbd5be3e846c12dde2348a2cc8d https://doi.org/10.1109/access.2021.3060940 Zobrazit plný text záznamu