Popis: |
The work of most information systems involves the processing of data, its accumulation during operation and subsequent analysis. However, the analysis of such a large amount of information by a person is impossible without its preliminary automatic processing. For this purpose, Data Mining is used, which includes descriptive and predictive modeling. The statistical classification is one of the most understandable data analysis technologies for humans and relates to predictive modeling. This task consists in dividing the set of observations into classes based on their formal description. One of the methods for solving the classification problem is logistic regression, while scoring is a common area of application. This article discusses the application of scoring to the problem of assessing the probability of students' expulsion from the University based on data on their attendance and academic performance. The solution of this problem will allow curators of groups, directions and other interested parties to identify the tendency to expulsion in time, identify a risk group among students and take early measures to prevent the event predicted by the built model from becoming a fact. The built scoring model is subject to publication as a web service for further use in the software package for supporting the work of a University teacher. In this case, the model input receives aggregated characteristics obtained from accumulated data on student performance and attendance by the software package, which results in an integrated indicator of the probability of an event, namely, deductions. As a result of building a scoring model, a subsequent assessment of its quality is performed. |