Bi-modal sentence structure for language modeling

Autor: G. Zavaliagkos, Kristine W. Ma, Marie Meteer
Rok vydání: 2000
Předmět:
Zdroj: Speech Communication. 31:51-67
ISSN: 0167-6393
DOI: 10.1016/s0167-6393(99)00060-6
Popis: According to discourse theories in linguistics, conversational utterances possess an informational structure. That is, each sentence consists of two components: the given and the new . The given refers to information that has previously been conveyed in the conversation such as that in That's interesting . The new section of a sentence introduces additional information that is new to the conversation such as the word interesting in the previous example. In this work, we take advantage of this inherent structure for the purpose of automatic conversational speech recognition by building sub-sentence discourse language models (LMs) to represent the bi-modal nature of each conversational sentence. The internal sentence structure is captured with a statistical sentence model regardless of whether the input sentences are linguistically or acoustically segmented. The proposed model is verified on the Switchboard corpus. The resulting model contributes to a reduction in both LM perplexity and word recognition error rate.
Databáze: OpenAIRE