An active learning method for data streams with concept drift
Autor: | Cheong Hee Park, Youngsoon Kang |
---|---|
Rok vydání: | 2016 |
Předmět: |
Concept drift
Computer science Data stream mining Active learning (machine learning) business.industry Decision tree 02 engineering and technology Semi-supervised learning computer.software_genre Machine learning Data modeling Support vector machine Data set 020204 information systems Incremental learning Active learning 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Data mining Artificial intelligence business Classifier (UML) computer |
Zdroj: | IEEE BigData |
DOI: | 10.1109/bigdata.2016.7840667 |
Popis: | In analyzing streaming data in which the underlying data distribution may change or the concept of interest may drift over time, the ability of a classifier to adapt to drifted concepts is very important to maintaining the prediction performance. However, the true class labels of data samples are often available only after some period of time or they are obtained by experts' efforts. In this paper, we develop an effective method for active learning on data streams with concept drift. The proposed method combines active learning and adaptive incremental learning. For unlabeled data samples, the degree of concept drift is estimated and used for both data selection for labeling and adaptive incremental learning of the current classifier. Experimental results on five artificial data sets and two real data sets demonstrate a competent performance of the proposed method. |
Databáze: | OpenAIRE |
Externí odkaz: |