Scaling Up Sign Spotting Through Sign Language Dictionaries

Autor:	Varol, G, Momeni, L, Albanie, S, Afouras, T, Zisserman, A
Přispěvatelé:	Varol, G [0000-0002-8438-6152], Apollo - University of Cambridge Repository
Rok vydání:	2022
Předmět:	FOS: Computer and information sciences Sign spotting Artificial Intelligence Computer Vision and Pattern Recognition (cs.CV) Few-shot learning Computer Science - Computer Vision and Pattern Recognition Computer Vision and Pattern Recognition Sign language recognition Software
Popis:	The focus of this work is $\textit{sign spotting}$ - given a video of an isolated sign, our task is to identify $\textit{whether}$ and $\textit{where}$ it has been signed in a continuous, co-articulated sign language video. To achieve this sign spotting task, we train a model using multiple types of available supervision by: (1) $\textit{watching}$ existing footage which is sparsely labelled using mouthing cues; (2) $\textit{reading}$ associated subtitles (readily available translations of the signed content) which provide additional $\textit{weak-supervision}$; (3) $\textit{looking up}$ words (for which no co-articulated labelled examples are available) in visual sign language dictionaries to enable novel sign spotting. These three tasks are integrated into a unified learning framework using the principles of Noise Contrastive Estimation and Multiple Instance Learning. We validate the effectiveness of our approach on low-shot sign spotting benchmarks. In addition, we contribute a machine-readable British Sign Language (BSL) dictionary dataset of isolated signs, BSLDict, to facilitate study of this task. The dataset, models and code are available at our project page. Comment: Appears in: 2022 International Journal of Computer Vision (IJCV). 25 pages. arXiv admin note: substantial text overlap with arXiv:2010.04002
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ef07ed692b83376614e1cb0879669a91 https://www.repository.cam.ac.uk/handle/1810/337554 Zobrazit plný text záznamu