Albayzin Evaluation: The PRHLT-UPV Audio Segmentation System
Autor: | Silvestre Cerdà, Joan Albert, Giménez Pastor, Adrián, Andrés Ferrer, Jesús, Civera Saiz, Jorge, Juan Císcar, Alfonso |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2012 |
Předmět: | |
Zdroj: | RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia instname |
Popis: | This paper describes the audio segmentation system developed by the PRHLT research group at the UPV for the Albayzin Audio Segmentation Evaluation 2012. The PRHLT-UPV audio segmentation system is based on a conventional GMM-HMM speech recognition approach in which the vocabulary set is defined by the power set of segment classes. MFCC features were extracted to represent the acoustic signal and the AK toolkit was used for both, training acoustic models and performing audio segmentation. Experimental results reveals that our system provides an excellent performance on speech detection, so it could be successfully employed to provide speech segments to a diarization or speech recognition system. The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 287755. Funding was also provided by the Spanish Government (iTrans2 project, TIN2009-14511; FPU scholarship AP2010-4349). |
Databáze: | OpenAIRE |
Externí odkaz: |