DSCD: A Novel Deep Subspace Clustering Denoise Network for Single-Cell Clustering
Autor: | Tao Zhou, Rui-Yi Li, Zhiye Wang, Yiwen Lu, Chang Yu, Siyun Hou |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
spectral clustering
General Computer Science Computer science business.industry Dimensionality reduction Big data auto-encoder General Engineering Pattern recognition Tracing Autoencoder Synthetic data Identification (information) sparse self-express General Materials Science Artificial intelligence lcsh:Electrical engineering. Electronics. Nuclear engineering Cluster analysis Representation (mathematics) business Single cell RNA-seq data lcsh:TK1-9971 |
Zdroj: | IEEE Access, Vol 8, Pp 109857-109865 (2020) |
ISSN: | 2169-3536 |
Popis: | Single-cell RNA sequencing(scRNA-seq) technology has boomed in the past decade which makes it possible to study biological problems at the resolution of cellular-level. Currently, the research mainly focuses on exploring the cellular heterogeneity, involving studies about identifying cell type identification, cell lineage tracing, spatial model reconstruction of complex organizations, etc. Clustering analysis is always the most effective way in grouping single cells in previous studies. However, existing scRNA-seq clustering methods separate pre-processing and clustering tasks that complicated the problem. In addition, the emergence of big data further limits the traditional clustering algorithms' application on scRNA-seq data. Therefore, developing novel clustering methods and improving clustering accuracy for growing scRNA-seq data is a continuous task. In this paper, we propose a highly integrated Deep Subspace Clustering Denoise Network named DSCD, which integrates denoise, dimension reduction and clustering in a unified framework. Based on the neural network architecture of autoencoder, DSCD discovers the low dimensional latent structure within scRNA-seq data from the compressed representation. Furthermore, we add a novel self-expressive denoise layer to learning the global relationships between single cells, which is the main innovation of DSCD. Experimental results on the synthetic data demonstrate the effectiveness of the novel denoise layer. From the clustering results on 5 real scRNA-seq datasets, we find that DSCD outperforms the related subspace clustering algorithms and state of the art methods. In conclusion, DSCD responds well to the rapidly increasing scRNA-seq data scale, greatly reduces human interference in dimension reduction and handles the noisy scRNA-seq data in proper way thus obtain a higher clustering accuracy. |
Databáze: | OpenAIRE |
Externí odkaz: |