Sparse statistical health monitoring: A novel variable selection approach to diagnosis and follow-up of individual patients

Autor:	Lutgarde M. C. Buydens, Udo F. H. Engelke, Lionel Blanchet, Ron A. Wevers, Jasper Engel
Přispěvatelé:	RS: NUTRIM - R4 - Gene-environment interaction, RS: NUTRIM - R3 - Respiratory & Age-related Health
Rok vydání:	2017
Předmět:	0301 basic medicine Computer science Feature selection Machine learning computer.software_genre Fault detection and isolation DISEASE Analytical Chemistry 03 medical and health sciences ROAD FAULT ISOLATION Health care Metabolomics CLASS-MODELING TECHNIQUES Spectroscopy Human disease diagnosis Mahalanobis distance business.industry Process Chemistry and Technology Dimensionality reduction Precision medicine Disorders of movement Donders Center for Medical Neuroscience [Radboudumc 3] DISCRIMINANT Variable smearing Computer Science Applications Identification (information) Multivariate statistical process monitoring 030104 developmental biology Sparse Mahalanobis distance Principal component analysis Metric (unit) Artificial intelligence Data mining business computer Software
Zdroj:	Chemometrics and Intelligent Laboratory Systems, 164, pp. 83-93 Chemometrics and Intelligent Laboratory Systems, 164, 83-93. Elsevier Science Chemometrics and Intelligent Laboratory Systems, 164, 83-93
ISSN:	0169-7439
Popis:	The -omics technologies are becoming increasingly important in health care and are expected to contribute to personalized health care. In a typical experiment, cases and controls are compared as a two-class classification problem. This approach is often unsuitable, for example, because the classes are not well defined due to associated populations being biologically too heterogeneous. Recently, statistical health monitoring (SHM) was introduced as a complementary approach to allow for predictions at the individual level. This approach could be of use in all sorts of applications such as diagnosis of rare diseases, analysis of individual patterns in disease manifestation, disease monitoring, or personalized therapy.SHM uses the framework of statistical process monitoring (SPM) in a clinical setting. The method essentially combines estimation of Mahalanobis distances (MD) with principal component analysis (PCA) to evaluate the difference in the -omics data of an individual subject to a normal reference range (normal operating conditions). It is well known from SPM, however, that reliable identification of the variables primarily responsible for this difference is hampered by the smearing effect, which is a result of the PCA step. To avoid this problem, we propose to combine estimation of the MD with variable selection via an 11-norm penalty instead of using dimension reduction. This way a sparse MD metric is obtained.The effectiveness of this method is illustrated by several simulation studies and its application to urine H-1-NMR metabolomics data for diagnosis of multiple inborn errors of metabolism.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4e51bd05aa1e5173369039f2cead756f https://hdl.handle.net/2066/177956 Zobrazit plný text záznamu Full Text from ScienceDirect