Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling

Autor:	Pablo Ruiz Fabo, Álvarez, Aitor, Arzelus, Haritz
Přispěvatelé:	Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice), Département Littératures et langage - ENS Paris (LILA), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Université Sorbonne Paris Cité (USPC)-Université Sorbonne Nouvelle - Paris 3, VicomTech, Département Littératures et langage (LILA), Ruiz Fabo, Pablo
Jazyk:	angličtina
Rok vydání:	2014
Předmět:	phoneme similarity matrices automatic subtitling [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] long audio alignment [SHS.LANGUE]Humanities and Social Sciences/Linguistics [SHS.LANGUE] Humanities and Social Sciences/Linguistics [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing
Zdroj:	LREC, Ninth International Conference on Language Resources and Evaluation LREC, Ninth International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland HAL Pablo Ruiz Fabo Scopus-Elsevier BASE-Bielefeld Academic Search Engine
Popis:	International audience; Long audio alignment systems for Spanish and English are presented, within an automatic subtitling application. Language-specific phone decoders automatically recognize audio contents at phoneme level. At the same time, language-dependent grapheme-to-phoneme modules perform a transcription of the script for the audio. A dynamic programming algorithm (Hirschberg's algorithm) finds matches between the phonemes automatically recognized by the phone decoder and the phonemes in the script's transcription. Alignment accuracy is evaluated when scoring alignment operations with a baseline binary matrix, and when scoring alignment operations with several continuous-score matrices, based on phoneme similarity as assessed through comparing multivalued phonological features. Alignment accuracy results are reported at phoneme, word and subtitle level. Alignment accuracy when using the continuous scoring matrices based on phonological similarity was clearly higher than when using the baseline binary matrix.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::2966745b1837a891cf31e5744f388dde https://hal.archives-ouvertes.fr/hal-01099239/file/387_Paper.pdf Zobrazit plný text záznamu