pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment

Autor: Yinsheng Zhang, Qian Shang, Guoming Zhang
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Heliyon, Vol 7, Iss 2, Pp e06199- (2021)
Druh dokumentu: article
ISSN: 2405-8440
DOI: 10.1016/j.heliyon.2021.e06199
Popis: High-dimensional data are pervasive in this bigdata era. To avoid the curse of the dimensionality problem, various dimensionality reduction (DR) algorithms have been proposed. To facilitate systematic DR quality comparison and assessment, this paper reviews related metrics and develops an open-source Python package pyDRMetrics. Supported metrics include reconstruction error, distance matrix, residual variance, ranking matrix, co-ranking matrix, trustworthiness, continuity, co-k-nearest neighbor size, LCMC (local continuity meta criterion), and rank-based local/global properties. pyDRMetrics provides a native Python class and a web-oriented API. A case study of mass spectra is conducted to demonstrate the package functions. A web GUI wrapper is also published to support user-friendly B/S applications.
Databáze: Directory of Open Access Journals