TripletCough: Cougher Identification and Verification From Contact-Free Smartphone-Based Audio Recordings Using Metric Learning
Autor: | Stefan Jokic, David Cleres, Frank Rassouli, Claudia Steurer-Stey, Milo A. Puhan, Martin Brutsche, Elgar Fleisch, Filipe Barata |
---|---|
Přispěvatelé: | University of Zurich, Barata, Filipe |
Jazyk: | němčina |
Rok vydání: | 2022 |
Předmět: |
2208 Electrical and Electronic Engineering
health sciences 610 Medicine & health Health Informatics computer science 10060 Epidemiology Biostatistics and Prevention Institute (EBPI) Respiration Disorders Computer Science Applications Cough Health Information Management Cough monitoring metric learning mobile sensing remote patient monitoring speaker identification speaker verification triplet network 3605 Health Information Management 1706 Computer Science Applications Humans Neural Networks Computer Smartphone Electrical and Electronic Engineering Algorithms 2718 Health Informatics |
Zdroj: | IEEE Journal of Biomedical and Health Informatics, 26 (6) |
ISSN: | 2168-2194 2168-2208 |
Popis: | Cough, a symptom associated with many prevalent respiratory diseases, can serve as a potential biomarker for diagnosis and disease progression. Consequently, the development of cough monitoring systems and, in particular, automatic cough detection algorithms have been studied since the early 2000s. Recently, there has been an increased focus on the efficiency of such algorithms, as implementation on consumer-centric devices such as smartphones would provide a scalable and affordable solution for monitoring cough with contact-free sensors. Current algorithms, however, are incapable of discerning between coughs of different individuals and, thus, cannot function reliably in situations where potentially multiple individuals have to be monitored in shared environments. Therefore, we propose a weakly supervised metric learning approach for cougher recognition based on smartphone audio recordings of coughs. Our approach involves a triplet network architecture, which employs convolutional neural networks (CNNs). The CNNs of the triplet network learn an embedding function, which maps Mel spectrograms of cough recordings to an embedding space where they are more easily distinguishable. Using audio recordings of nocturnal coughs from asthmatic patients captured with a smartphone, our approach achieved a mean accuracy of 88% (10% SD) on two-way identification tests with 12 enrollment samples and accuracy of 80% and an equal error rate (EER) of 20% on verification tests. Furthermore, our approach outperformed human raters with regard to verification tests on average by 8% in accuracy, 4% in false acceptance rate (FAR), and 12% in false rejection rate (FRR). Our code and models are publicly available. IEEE Journal of Biomedical and Health Informatics, 26 (6) ISSN:2168-2194 ISSN:2168-2208 |
Databáze: | OpenAIRE |
Externí odkaz: |