FST-Based Pronunciation Lexicon Compression for Speech Engines

Autor: Žiga Golob, Jerneja Žganec Gros, Mario Žganec, Boštjan Vesnicer, Simon Dobrišek
Jazyk: angličtina
Rok vydání: 2012
Předmět:
Zdroj: International Journal of Advanced Robotic Systems, Vol 9 (2012)
Druh dokumentu: article
ISSN: 1729-8814
DOI: 10.5772/52795
Popis: Finite-state transducers are frequently used for pronunciation lexicon representations in speech engines, in which memory and processing resources are scarce. This paper proposes two possibilities for further reducing the memory footprint of finite-state transducers representing pronunciation lexicons. First, different alignments of grapheme and allophone transcriptions are studied and a reduction in the number of states of up to 30% is reported. Second, a combination of grapheme-to-allophone rules with a finite-state transducer is proposed, which yields a 65% smaller finite-state transducer than conventional approaches.
Databáze: Directory of Open Access Journals