Attention based Residual-Time Delay Neural Network for Indian Language Identification
Autor: | Tirusha Mandava, Anil Kumar Vuppala |
---|---|
Rok vydání: | 2019 |
Předmět: |
Indian English
Artificial neural network Language identification Computer science Time delay neural network Speech recognition 020208 electrical & electronic engineering Word error rate 02 engineering and technology language.human_language Set (abstract data type) 030507 speech-language pathology & audiology 03 medical and health sciences Identification (information) ComputingMethodologies_PATTERNRECOGNITION Cepstrum 0202 electrical engineering electronic engineering information engineering language 0305 other medical science |
Zdroj: | IC3 |
Popis: | India is a multilingual society having more than 1600 languages. Most of these languages are having an overlapping set of phonemes. This makes developing language identification (LID) framework difficult for Indian languages. In this paper, the above challenge is addressed using phonetic features. To model the temporal variations in phonetic features, attention based residual-time delay neural network (RES-TDNN) is proposed. This network effectively captures long-range temporal dependencies through TDNN and attention mechanism. The proposed network has been evaluated on IIITH-ILSC database using phonetic and acoustic features. The database consists of 22 official Indian languages and Indian English. Attention based RES-TDNN outperformed the other state-of-the-art networks such as deep neural network, long short-term memory network and produced an equal error rate of 9.46%. Further, the fusion of shifted delta cepstral and phonetic features have improved the performance. |
Databáze: | OpenAIRE |
Externí odkaz: |