A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment
Autor: | P. O'Neill, Stuart Cunningham, Pam Enderby, Siddharth Sehgal, Phil D. Green, Rebecca Palmer, Mark S. Hawley |
---|---|
Rok vydání: | 2013 |
Předmět: |
Adult
Male Direct voice input Audio mining Sound Spectrography Computer science Speech recognition Biomedical Engineering Speech synthesis computer.software_genre Speech Disorders Speech Recognition Software Communication Aids for Disabled Dysarthria Speech Production Measurement Artificial Intelligence Internal Medicine medicine Humans Aged Aged 80 and over Voice activity detection General Neuroscience Rehabilitation Equipment Design Middle Aged Speech processing Equipment Failure Analysis Treatment Outcome Augmentative and alternative communication Vocabulary Controlled Female medicine.symptom computer |
Zdroj: | IEEE Transactions on Neural Systems and Rehabilitation Engineering. 21:23-31 |
ISSN: | 1558-0210 1534-4320 |
DOI: | 10.1109/tnsre.2012.2209678 |
Popis: | A new form of augmentative and alternative communication (AAC) device for people with severe speech impairment-the voice-input voice-output communication aid (VIVOCA)-is described. The VIVOCA recognizes the disordered speech of the user and builds messages, which are converted into synthetic speech. System development was carried out employing user-centered design and development methods, which identified and refined key requirements for the device. A novel methodology for building small vocabulary, speaker-dependent automatic speech recognizers with reduced amounts of training data, was applied. Experiments showed that this method is successful in generating good recognition performance (mean accuracy 96%) on highly disordered speech, even when recognition perplexity is increased. The selected message-building technique traded off various factors including speed of message construction and range of available message outputs. The VIVOCA was evaluated in a field trial by individuals with moderate to severe dysarthria and confirmed that they can make use of the device to produce intelligible speech output from disordered speech input. The trial highlighted some issues which limit the performance and usability of the device when applied in real usage situations, with mean recognition accuracy of 67% in these circumstances. These limitations will be addressed in future work. |
Databáze: | OpenAIRE |
Externí odkaz: |