Clustered inter-phrase or word context-dependent models for continuously read Japanese
Autor: | Yu-Hung Kao, Kazuhiro Kondo, Barbara J. Wheatley |
---|---|
Rok vydání: | 1995 |
Předmět: |
Consonant
Vocabulary Phrase Acoustics and Ultrasonics Computer science business.industry Speech recognition media_common.quotation_subject Context (language use) computer.software_genre Variation (linguistics) Vowel Artificial intelligence business computer Coarticulation Word (computer architecture) Natural language processing media_common |
Zdroj: | Journal of the Acoustical Society of Japan (E). 16:299-310 |
ISSN: | 2185-3509 0388-2861 |
DOI: | 10.1250/ast.16.299 |
Popis: | This paper investigates methods to model inter-phrase or word context for continuous Japanese speech recognition. It is well known that in continuous speech, coarticulation between words or phrases induces allophonic variation of the beginning and ending phones in words or phrases. It was found that by compiling a network of contextdependent phonetic models which models these inter-word or inter-phrase context, recognition error reduction by 32 % can be achieved compared to models which do not account for inter-word context with task-dependent training, i.e. models that were trained with the same vocabulary as the test set. A more dramatic error reduction of up to 43% was possible with task-independent training. However, this will significantly increase the number of phonetic models required to model the vocabulary. With digit models, the increase in the number of models is 4 to 5 fold. To overcome this increase, we clustered the inter-word/phrase context into a few phonetic classes. Using one class for consonant inter-word context and two classes for vowel context, the recognition accuracy on digit string recognition was found to be virtually equal to the accuracy with unclustered models, while the number of phonetic models required was reduced by more than 50%. |
Databáze: | OpenAIRE |
Externí odkaz: |