Aligning movies with scripts by exploiting temporal ordering constraints

Autor: Iftekhar Naim, Daniel Gildea, Henry Kautz, Abdullah Al Mamun, Young Chol Song, Jiebo Luo
Rok vydání: 2016
Předmět:
Zdroj: ICPR
DOI: 10.1109/icpr.2016.7899895
Popis: Scripts provide rich textual annotation of movies, including dialogs, character names, and other situational descriptions. Exploiting such rich annotations requires aligning the sentences in the scripts with the corresponding video frames. Previous work on aligning movies with scripts predominantly relies on time-aligned closed-captions or subtitles, which are not always available. In this paper, we focus on automatically aligning faces in movies with their corresponding character names in scripts without requiring closed-captions/subtitles. We utilize the intuition that faces in a movie generally appear in the same sequential order as their names are mentioned in the script. We first apply standard techniques for face detection and tracking, and cluster similar face tracks together. Next, we apply a generative Hidden Markov Model (HMM) and a discriminative Latent Conditional Random Field (LCRF) to align the clusters of face tracks with the corresponding character names. Our alignment models (especially LCRF) significantly outperform the previous state-of-the-art on two different movie datasets and for a wide range of face clustering algorithms.
Databáze: OpenAIRE