Probing Cross-Modal Representations in Multi-Step Relational Reasoning

Autor:	Parfenova, I., Elliott, D., Fernández, R., Pezzelle, S., Rogers, A., Calixto, I., Vulić, I., Saphra, N., Kassner, N., Camburu, O.-M., Bansal, T., Shwartz, V.
Přispěvatelé:	Language and Computation (ILLC, FNWI/FGw), ILLC (FNWI)
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Modal business.industry Computer science Construct (python library) Artificial intelligence Language model business Relational reasoning computer.software_genre computer Natural language processing Transformer (machine learning model) Task (project management)
Zdroj:	The 6th Workshop on Representation Learning for NLP: RepL4NLP 2021 : proceedings of the workshop : August 6, 2021, Bangkok, Thailand The 6th Workshop on Representation Learning for NLP Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021) Parfenova, I, Elliott, D, Fernández, R & Pezzelle, S 2021, Probing Cross-Modal Representations in Multi-Step Relational Reasoning . in Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021) . Association for Computational Linguistics, pp. 152-162, 6th Workshop on Representation Learning for NLP (RepL4NLP-2021), Online, 01/08/2021 . https://doi.org/10.18653/v1/2021.repl4nlp-1.16
DOI:	10.18653/v1/2021.repl4nlp-1.16
Popis:	We investigate the representations learned by vision and language models in tasks that require relational reasoning. Focusing on the problem of assessing the relative size of objects in abstract visual contexts, we analyse both one-step and two-step reasoning. For the latter, we construct a new dataset of three-image scenes and define a task that requires reasoning at the level of the individual images and across images in a scene. We probe the learned model representations using diagnostic classifiers. Our experiments show that pretrained multimodal transformer-based architectures can perform higher-level relational reasoning, and are able to learn representations for novel tasks and data that are very different from what was seen in pretraining.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::427ac3bd9ed628a9f7f9fd48337f5e85 https://dare.uva.nl/personal/pure/en/publications/probing-crossmodal-representations-in-multistep-relational-reasoning(8d880715-ff48-4d50-832e-62b6111929a9).html Zobrazit plný text záznamu