An expert system for the production of phoneme strings from unmarked English text using machine-induced rules

Autor:	Alberto M. Segre, Bruce Arne Sherwood, Wayne B. Dickerson
Rok vydání:	1983
Předmět:	business.industry Computer science Speech recognition String (computer science) Speech synthesis Pronunciation Diphone computer.software_genre Expert system MBROLA Artificial intelligence Transcription (software) business computer Natural language processing Utterance
Zdroj:	EACL
DOI:	10.3115/980092.980098
Popis:	The speech synthesis group at the Computer-Based Education Research Laboratory (CERL) of the University of Illinois at Urbana-Champaign is developing a diphone speech synthesis system based on pitch-adaptive short-time Fourier transforms. This system accepts the phonemic specification of an utterance along with pitch, time, and amplitude warping functions in order to produce high quality speech output from stored diphone templates.This paper describes the operation of a program which operates as a front end for the diphone speech synthesis system. The UTTER (for "Unmarked Text Transcription by Expert Rule") system maps English text onto a phoneme string, which is then used as an input to the diphone speech synthesis system. The program is a two-tiered Expert System which operates first on the word level and then on the (vowel or consonant) cluster level. The system's knowledge about pronunciation is organized in two decision trees automatically generated by an induction algorithm on a dynamically specified "training set" of examples.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::a0b9e8e7fb8f82cdf7fa6244bb824dd3 https://doi.org/10.3115/980092.980098 Zobrazit plný text záznamu