Autor: |
Wiesmüller F; AIT Austrian Institute of Technology, Graz, Austria.; FH Joanneum, Graz, Austria.; Ludwig Boltzmann Institute for Digital Health and Prevention, Salzburg, Austria., Hayn D; AIT Austrian Institute of Technology, Graz, Austria.; Ludwig Boltzmann Institute for Digital Health and Prevention, Salzburg, Austria., Hoffmann F; Versicherungsanstalt öffentlich Bediensteter, Eisenbahnen und Bergbau, Vienna, Austria., Hanke S; FH Joanneum, Graz, Austria., Kastner P; telbiomed Medizintechnik und IT Service GmbH, Graz, Austria., Falgenhauer M; AIT Austrian Institute of Technology, Graz, Austria., Schreier G; AIT Austrian Institute of Technology, Graz, Austria. |
Abstrakt: |
Telehealth services are becoming more and more popular, leading to an increasing amount of data to be monitored by health professionals. Machine learning can support them in managing these data. Therefore, the right machine learning algorithms need to be applied to the right data. We have implemented and validated different algorithms for selecting optimal time instances from time series data derived from a diabetes telehealth service. Intrinsic, supervised, and unsupervised instance selection algorithms were analysed. Instance selection had a huge impact on the accuracy of our random forest model for dropout prediction. The best results were achieved with a One Class Support Vector Machine, which improved the area under the receiver operating curve of the original algorithm from 69.91 to 75.88 %. We conclude that, although hardly mentioned in telehealth literature so far, instance selection has the potential to significantly improve the accuracy of machine learning algorithms. |