A Prediction Approach Based on Self-Training and Deep Learning for Biological Data

Autor: Mohamed Lamine Berkane, Mahmoud Boufaida, Mohamed Nadjib Boufenara
Rok vydání: 2020
Předmět:
Zdroj: International Journal of Organizational and Collective Intelligence. 10:50-64
ISSN: 1947-9352
1947-9344
DOI: 10.4018/ijoci.2020100104
Popis: With the exponential growth of biological data, labeling this kind of data becomes difficult and costly. Although unlabeled data are comparatively more plentiful than labeled ones, most supervised learning methods are not designed to use unlabeled data. Semi-supervised learning methods are motivated by the availability of large unlabeled datasets rather than a small amount of labeled examples. However, incorporating unlabeled data into learning does not guarantee an improvement in classification performance. This paper introduces an approach based on a model of semi-supervised learning, which is the self-training with a deep learning algorithm to predict missing classes from labeled and unlabeled data. In order to assess the performance of the proposed approach, two datasets are used with four performance measures: precision, recall, F-measure, and area under the ROC curve (AUC).
Databáze: OpenAIRE