A System for Information Retrieval from Large Records of Czech Spoken Data.

Autor: Sojka, Petr, Kopeček, Ivan, Pala, Karel, Nouza, Jan, Žďánský, Jindřich, Červa, Petr, Kolorenč, Jan
Zdroj: Text, Speech & Dialogue (9783540390909); 2006, p485-492, 8p
Abstrakt: In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index