Using machine learning on cardiorespiratory fitness data for predicting hypertension: The Henry Ford ExercIse Testing (FIT) Project

Autor: Clinton A. Brawner, Mouaz H. Al-Mallah, Waqas Qureshi, Amjad M. Ahmed, Steven J. Keteyian, Sherif Sakr, Michael J. Blaha, Radwa Elshawi
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Male
Support Vector Machine
Databases
Factual

Computer science
lcsh:Medicine
Blood Pressure
02 engineering and technology
Forests
030204 cardiovascular system & hematology
computer.software_genre
Stress testing (software)
Vascular Medicine
Machine Learning
Bayes' theorem
Mathematical and Statistical Techniques
0302 clinical medicine
Medicine and Health Sciences
0202 electrical engineering
electronic engineering
information engineering

lcsh:Science
Aged
80 and over

Multidisciplinary
Ecology
Artificial neural network
Applied Mathematics
Simulation and Modeling
Middle Aged
Terrestrial Environments
Cardiorespiratory Fitness
Area Under Curve
Hypertension
Physical Sciences
Female
020201 artificial intelligence & image processing
LogitBoost
Algorithms
Statistics (Mathematics)
Research Article
Adult
Computer and Information Sciences
Adolescent
Research and Analysis Methods
Machine learning
Ecosystems
Machine Learning Algorithms
Young Adult
03 medical and health sciences
Naive Bayes classifier
Artificial Intelligence
Support Vector Machines
Humans
Statistical Methods
Artificial Neural Networks
Aged
Computational Neuroscience
business.industry
Ecology and Environmental Sciences
lcsh:R
Biology and Life Sciences
Computational Biology
Bayes Theorem
Cardiorespiratory fitness
Support vector machine
Exercise Test
lcsh:Q
Neural Networks
Computer

Artificial intelligence
business
computer
Mathematics
Neuroscience
Forecasting
Zdroj: PLoS ONE, Vol 13, Iss 4, p e0195344 (2018)
PLoS ONE
ISSN: 1932-6203
Popis: This study evaluates and compares the performance of different machine learning techniques on predicting the individuals at risk of developing hypertension, and who are likely to benefit most from interventions, using the cardiorespiratory fitness data. The dataset of this study contains information of 23,095 patients who underwent clinician- referred exercise treadmill stress testing at Henry Ford Health Systems between 1991 and 2009 and had a complete 10-year follow-up. The variables of the dataset include information on vital signs, diagnosis and clinical laboratory measurements. Six machine learning techniques were investigated: LogitBoost (LB), Bayesian Network classifier (BN), Locally Weighted Naive Bayes (LWB), Artificial Neural Network (ANN), Support Vector Machine (SVM) and Random Tree Forest (RTF). Using different validation methods, the RTF model has shown the best performance (AUC = 0.93) and outperformed all other machine learning techniques examined in this study. The results have also shown that it is critical to carefully explore and evaluate the performance of the machine learning models using various model evaluation methods as the prediction accuracy can significantly differ.
Databáze: OpenAIRE
Nepřihlášeným uživatelům se plný text nezobrazuje