Partial Multi-View Clustering using Graph Regularized NMF
Autor: | Nishant Rai, Om D. Deshmukh, Santanu Chaudhury, Sumit Negi |
---|---|
Rok vydání: | 2016 |
Předmět: |
0209 industrial biotechnology
Fuzzy clustering Correlation clustering Constrained clustering 02 engineering and technology computer.software_genre 020901 industrial engineering & automation Data stream clustering CURE data clustering algorithm 0202 electrical engineering electronic engineering information engineering Canopy clustering algorithm Graph (abstract data type) 020201 artificial intelligence & image processing Data mining Cluster analysis computer Mathematics |
Zdroj: | ICPR |
DOI: | 10.1109/icpr.2016.7899961 |
Popis: | Real-world datasets consist of data representations (views) from different sources which often provide information complementary to each other. Multi-view learning algorithms aim at exploiting the complementary information present in different views for clustering and classification tasks. Several multi-view clustering methods that aim at partitioning objects into clusters based on multiple representations of the object have been proposed. Almost all of the proposed methods assume that each example appears in all views or at least there is one view containing all examples. In real-world settings this assumption might be too restrictive. Recent work on Partial View Clustering addresses this limitation by proposing a Non-negative Matrix Factorization based approach called PVC. Our work extends the PVC work in two directions. First, the current PVC algorithm is designed specifically for two-view datasets. We extend this algorithm for the k partial-view scenario. Second, we extend our k partial-view algorithm to include view specific graph laplacian regularization. This enables the proposed algorithm to exploit the intrinsic geometry of the data distribution in each view. The proposed method, which is referred to as GPMVC (Graph Regularized Partial Multi-View Clustering), is compared against 7 baseline methods (including PVC) on 5 publicly available text and image datasets. In all settings the proposed GPMVC method outperforms all baselines. For the purpose of reproducibility, we provide access to our code. |
Databáze: | OpenAIRE |
Externí odkaz: |