Text segmentation of spoken meeting transcripts.

Autor: Sharp, Bernadette, Chibelushi, Caroline
Předmět:
Zdroj: International Journal of Speech Technology; Dec2008, Vol. 11 Issue 3/4, p157-165, 9p, 3 Diagrams, 3 Charts, 3 Graphs
Abstrakt: Text segmentation has played an important role in information retrieval as well as natural language processing. Current segmentation methods are well suited for written and structured texts making use of their distinctive macro-level structures; however text segmentation of transcribed multi-party conversation presents a different challenge given its ill-formed sentences and the lack of macro-level text units. This paper describes an algorithm suitable for segmenting spoken meeting transcripts combining semantically complex lexical relations with speech cue phrases to build lexical chains in determining topic boundaries. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index