A Multiple Linear Regression-Based Approach to Predict Student Performance

Autor: Ouafae El Aissaoui, Lahcen Oughdir, Yasser El Alami El Madani, Ahmed Dakkak, Youssouf El Allioui
Rok vydání: 2020
Předmět:
Zdroj: Advances in Intelligent Systems and Computing ISBN: 9783030366520
DOI: 10.1007/978-3-030-36653-7_2
Popis: Predicting students’ academic outcome is useful for any educational institution that aims to ameliorate students’ performance. Based on the resulted predictions, educators can provide support to students at risk of failure. Data mining and machine learning techniques were widely used to predict students’ performance. This process called Educational data mining. In this work, we have proposed a methodology to build a student’ performance prediction model using a supervised machine learning technique which is the multiple linear regression (MLR). Our methodology consists of three major steps, the first step aims to analyze and preprocess the students’ attributes/variables using a set of statistical analysis methods, and then the second step consists in selecting the most important variables using different methods. The third step aims to construct different MLR models based on the selected variables and compare their performance using the k-fold cross-validation technique. The obtained results show that the model built using the variables selected from the Multivariate Adaptive Regression Splines method (MARS), outperforms the other constructed models.
Databáze: OpenAIRE