Popis: |
MOOC aims at delivering online courses to tens of thousands to millions of heterogeneous learners at the same time, with minimal or no charge. It provides an alternate way to disseminate quality education to the section of people who cannot reach premier institutions. It has great potential to overcome the barriers of traditional learning systems. However, there are several challenges in MOOCs such as huge drop-out rates, improper automated assessments, varied student engagement, and attention etc. Learning Analytics helps us to contain such issues. Learning analytics, with the help of Big Data Technologies, helps us to interpret humongous MOOCs data to assess progress, predict performance and identify problems. To perform analytics, we developed a workflow using Apache Spark, a scalable inmemory computing framework. The data from edX platform has been used for experiments. It contains the information of more than 2 Lakh students from 39 courses. Initially, detailed statistical analysis has been carried out to understand the learning patterns and the behavior of online learners. Later, we have developed drop-out prediction models using various machine learning algorithms such as Random Forest, Gradient Boost, and Logistic Regression. A stacked ensemble model is developed and performance comparison with baseline models is carried out. It outperformed all other models with an accuracy of 91.2%. |