Entity linking across vision and language

Autor:	Marie-Francine Moens, Tinne Tuytelaars, Aparna Nurani Venkitasubramanian
Rok vydání:	2017
Předmět:	Markov random field Computer Networks and Communications business.industry Computer science 020207 software engineering 02 engineering and technology PSI_VISICS computer.software_genre Entity linking Hardware and Architecture 0202 electrical engineering electronic engineering information engineering Media Technology Key (cryptography) 020201 artificial intelligence & image processing Artificial intelligence business computer Software Natural language processing
Zdroj:	Multimedia Tools and Applications. 76:22599-22622
ISSN:	1573-7721 1380-7501
Popis:	We propose a novel weakly supervised framework that jointly tackles entity analysis tasks in vision and language. Given a video with subtitles, we jointly address the questions: a) What do the textual entity mentions refer to? and b) What/ who are in the video key frames? We use a Markov Random Field (MRF) to encode the dependencies within and across the two modalities. This MRF model incorporates beliefs using independent methods for the textual and visual entities. These beliefs are propagated across the modalities to jointly derive the entity labels. We apply the framework to a challenging dataset of wildlife documentaries with subtitles and show that this integrated modelling yields significantly better performance over text-based and vision-based approaches. We show that textual mentions that cannot be resolved using text-only methods are resolved correctly using our method. The approaches described here bring us closer to automated multimedia indexing. Nurani Venkitasubramanian A., Tuytelaars T., Moens M.-F., ''Entity linking across vision and language'', Multimedia tools and applications, vol. 76, no. 21, pp. 22599-22622, 24 pp., November 2017. ispartof: Multimedia Tools and Applications vol:76 issue:21 pages:22599-22622 status: published
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7abd7f8e646752008dcf46fe91dc8cbb https://doi.org/10.1007/s11042-017-4732-8 Zobrazit plný text záznamu Full text from SpringerLink