Phonetic transcription in automatic speech recognition

Autor: Péter Tatai, Péter Mihajlik, Tibor Révész
Rok vydání: 2002
Předmět:
Zdroj: Acta Linguistica Hungarica. 49:407-425
ISSN: 1588-2624
1216-8076
DOI: 10.1556/aling.49.2002.3-4.9
Popis: This paper discusses automatic phonetic transcription to be applied in Hungarian speech recognition. It first deals with the basic technologies of automatic speech recognition (ASR) for the sake of readers not familiar with this scientific field, then it discusses the place of (automatic) phonetic transcription in ASR. After that, our method developed for transcribing Hungarian texts automatically is introduced. This technique is an extension of the traditional linear transcription approach; its output is called 'optioned' because it contains pronunciation options in parallel arcs. We present our experiences with promising improvements in recogniser training efficiency. The achievements are due to the application of deeper linguistic (phonological) knowledge. With the training technique developed not only the quality of the acoustic models can be enhanced, but also, at the same time, the amount of the required manual work can effectively be decreased.
Databáze: OpenAIRE