Addressing class-imbalance in multi-label learning via two-stage multi-label hypernetwork
Autor: | Chong Ho Lee, Kai Wei Sun |
---|---|
Rok vydání: | 2017 |
Předmět: |
Computer science
business.industry Process (engineering) Active learning (machine learning) Cognitive Neuroscience Feature vector Stability (learning theory) 02 engineering and technology Semi-supervised learning Machine learning computer.software_genre Computer Science Applications Data set Range (mathematics) ComputingMethodologies_PATTERNRECOGNITION Empirical research Artificial Intelligence 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business computer |
Zdroj: | Neurocomputing. 266:375-389 |
ISSN: | 0925-2312 |
DOI: | 10.1016/j.neucom.2017.05.049 |
Popis: | Multi-label learning is concerned with learning from data examples that are represented by a single feature vector while associated with multiple labels simultaneously. Existing multi-label learning approaches mainly focus on exploiting label correlations to facilitate the learning process. However, an intrinsic characteristic of multi-label learning, i.e., class-imbalance has not been well studied. In this paper, we propose a two-stage multi-label hypernetwork (TSMLHN) to exploit label correlations and to make use of them to address the class-imbalance problem in multi-label learning. In TSMLHN, labels of a multi-label data set are divided into two groups, i.e., imbalanced labels and common labels based on their imbalance ratios. In the first stage of TSMLHN, we train a multi-label hypernetwork (MLHN) which generates basic predictions for all labels. In the second stage of TSMLHN, we train TSMLHN based on the predictions obtained from MLHN and utilize the correlations between common labels and imbalanced labels to improve the learning performance of imbalanced labels. Our proposed TSMLHN is conceptually simple, yet it is effective in addressing the class-imbalance problem in multi-label learning. Empirical studies on a broad range of multi-label data sets demonstrate that TSMLHN achieves competitive performance against state-of-the-art multi-label learning algorithms. |
Databáze: | OpenAIRE |
Externí odkaz: |