Phoneme discrimination using connectionist networks
Autor: | Raymond L. Watrous |
---|---|
Rok vydání: | 1990 |
Předmět: |
Auditory Pathways
Sound Spectrography Acoustics and Ultrasonics Computer science Speech recognition Models Neurological Speech Acoustics Nonlinear programming Arts and Humanities (miscellaneous) Connectionism Phonetics Humans Computer Simulation Nervous System Physiological Phenomena Set (psychology) Neurons Artificial neural network Signal Processing Computer-Assisted Cognition Degree (music) Linguistics ComputingMethodologies_PATTERNRECOGNITION Speech Perception Nerve Net Gradient descent Algorithms |
Zdroj: | The Journal of the Acoustical Society of America. 87:1753-1772 |
ISSN: | 0001-4966 |
DOI: | 10.1121/1.399424 |
Popis: | The application of connectionist networks to speech recognition is assessed using a set of eight representative phonetic discrimination problems chose with respect to a theory of phonetics. A connectionist network model called the temporal flow model (TFM) is defined which represents temporal relationships using delay links and permits general patterns of connectivity. It is argued that the model has properties appropriate for time varying signals such as speech. Networks are trained using gradient descent methods of iterative nonlinear optimization to reduce the mean-squared error between the actual and the desired response of the output units. Separate network solutions are demonstrated for all eight phonetic discrimination problems for one male speaker. The network solutions are analyzed carefully and are shown in every case to make use of known acoustic phonetic cues. The network solutions vary in the degree to which they make use of context-dependent cues to achieve phoneme recognition. The network solutions were tested on data not used for training and achieved an average accuracy of 99.5%. It is concluded that acoustic phonetic speech recognition can be accomplished using connectionist networks. |
Databáze: | OpenAIRE |
Externí odkaz: |