Predicting at-Risk Students at Different Percentages of Course Length for Early Intervention Using Machine Learning Models

Autor:	Muhammad Adnan, Asad Habib, Jawad Ashraf, Shafaq Mussadiq, Arsalan Ali Raza, Muhammad Abid, Maryam Bashir, Sana Ullah Khan
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Predictive model earliest possible prediction at-risk students machine learning feed-forward neural network random forest Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 9, Pp 7519-7539 (2021)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2021.3049446
Popis:	Online learning platforms such as Massive Open Online Course (MOOC), Virtual Learning Environments (VLEs), and Learning Management Systems (LMS) facilitate thousands or even millions of students to learn according to their interests without spatial and temporal constraints. Besides many advantages, online learning platforms face several challenges such as students’ lack of interest, high dropouts, low engagement, students’ self-regulated behavior, and compelling students to take responsibility for settings their own goals. In this study, we propose a predictive model that analyzes the problems faced by at-risk students, subsequently, facilitating instructors for timely intervention to persuade students to increase their study engagements and improve their study performance. The predictive model is trained and tested using various machine learning (ML) and deep learning (DL) algorithms to characterize the learning behavior of students according to their study variables. The performance of various ML algorithms is compared by using accuracy, precision, support, and f-score. The ML algorithm that gives the best result in terms of accuracy, precision, recall, support, and f-score metric is ultimately selected for creating the predictive model at different percentages of course length. The predictive model can help instructors in identifying at-risk students early in the course for timely intervention thus avoiding student dropouts. Our results showed that students’ assessment scores, engagement intensity i.e. clickstream data, and time-dependent variables are important factors in online learning. The experimental results revealed that the predictive model trained using Random Forest (RF) gives the best results with averaged precision =0.60%, 0.79%, 0.84%, 0.88%, 0.90%, 0.92%, averaged recall =0.59%, 0.79%, 0.84%, 0.88%, 0.90%, 0.91%, averaged F-score =0.59%, 0.79%, 0.84%, 0.88%, 0.90%, 0.91%, and average accuracy =0.59%, 0.79%, 0.84%, 0.88%, 0.90%, 0.91% at 0%, 20%, 40%, 60%, 80% and 100% of course length.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/6ec59d4cf0114dd9ad1958c425a71553 Zobrazit plný text záznamu View record in DOAJ