Novelty detection with application to data streams
Autor: | João Gama, André Ponce de Leon F. de Carvalho, Eduardo J. Spinosa |
---|---|
Rok vydání: | 2009 |
Předmět: |
Data stream mining
Computer science business.industry k-means clustering Novelty Machine learning computer.software_genre Novelty detection Theoretical Computer Science Artificial Intelligence Outlier Unsupervised learning Computer Vision and Pattern Recognition Artificial intelligence Data mining Cluster analysis Representation (mathematics) business computer |
Zdroj: | Intelligent Data Analysis. 13:405-422 |
ISSN: | 1571-4128 1088-467X |
DOI: | 10.3233/ida-2009-0373 |
Popis: | This paper presents and evaluates an approach to novelty detection that addresses it as the problem of identifying novel concepts in a continuous learning scenario, as an extension to a single-class classification problem. OLINDDA, an OnLIne Novelty and Drift Detection Algorithm that implements this approach, uses efficient standard clustering algorithms to continuously generate candidate clusters among examples that were not explained by the current known concepts. Clusters complying with a validation criterion that takes cohesiveness and representativeness into account are initially identified as concepts. By merging similar concepts, OLINDDA may enhance the representation of some concepts as it advances toward its final goal of describing novel emerging concepts in an unsupervised way. The proposed approach is experimentally evaluated by the use of several measures taken throughout the learning process. Results show that it is capable of identifying novel concepts that are pure and correspond to real classes, disregarding unrepresentative clusters and outliers. |
Databáze: | OpenAIRE |
Externí odkaz: |