Výsledky vyhledávání

Report

Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs

Autor: Mehrafarin, Houman, Eshghi, Arash, Konstas, Ioannis

Evaluating Large Language Models (LLMs) on reasoning benchmarks demonstrates their ability to solve compositional questions. However, little is known of whether these models engage in genuine logical reasoning or simply rely on implicit cues to gener

Externí odkaz: http://arxiv.org/abs/2410.20200

Zobrazit plný text záznamu

Report

CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

Autor: Nikandrou, Malvina, Pantazopoulos, Georgios, Vitsakis, Nikolas, Konstas, Ioannis, Suglia, Alessandro

As Vision and Language models (VLMs) become accessible across the globe, it is important that they demonstrate cultural knowledge. In this paper, we introduce CROPE, a visual question answering benchmark designed to probe the knowledge of culture-spe

Externí odkaz: http://arxiv.org/abs/2410.15453

Zobrazit plný text záznamu

Report

Re-examining Sexism and Misogyny Classification with Annotator Attitudes

Autor: Jiang, Aiqi, Vitsakis, Nikolas, Dinkar, Tanvi, Abercrombie, Gavin, Konstas, Ioannis

Gender-Based Violence (GBV) is an increasing problem online, but existing datasets fail to capture the plurality of possible annotator perspectives or ensure the representation of affected groups. We revisit two important stages in the moderation pip

Externí odkaz: http://arxiv.org/abs/2410.03543

Zobrazit plný text záznamu

Report

Voices in a Crowd: Searching for Clusters of Unique Perspectives

Autor: Vitsakis, Nikolas, Parekh, Amit, Konstas, Ioannis

Language models have been shown to reproduce underlying biases existing in their training data, which is the majority perspective by default. Proposed solutions aim to capture minority perspectives by either modelling annotator disagreements or group

Externí odkaz: http://arxiv.org/abs/2407.14259

Zobrazit plný text záznamu

Report

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

Autor: Parekh, Amit, Vitsakis, Nikolas, Suglia, Alessandro, Konstas, Ioannis

Evaluating the generalisation capabilities of multimodal models based solely on their performance on out-of-distribution data fails to capture their true robustness. This work introduces a comprehensive evaluation framework that systematically examin

Externí odkaz: http://arxiv.org/abs/2407.03967

Zobrazit plný text záznamu

Report

Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation

Autor: Nikandrou, Malvina, Pantazopoulos, Georgios, Konstas, Ioannis, Suglia, Alessandro

Continual learning focuses on incrementally training a model on a sequence of tasks with the aim of learning new tasks while minimizing performance drop on previous tasks. Existing approaches at the intersection of Continual Learning and Visual Quest

Externí odkaz: http://arxiv.org/abs/2406.19297

Zobrazit plný text záznamu

Report

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Autor: Suglia, Alessandro, Greco, Claudio, Baker, Katie, Part, Jose L., Papaioannou, Ioannis, Eshghi, Arash, Konstas, Ioannis, Lemon, Oliver

AI personal assistants deployed via robots or wearables require embodied understanding to collaborate with humans effectively. However, current Vision-Language Models (VLMs) primarily focus on third-person view videos, neglecting the richness of egoc

Externí odkaz: http://arxiv.org/abs/2406.13807

Zobrazit plný text záznamu

Report

Visually Grounded Language Learning: a review of language games, datasets, tasks, and models

Autor: Suglia, Alessandro, Konstas, Ioannis, Lemon, Oliver

In recent years, several machine learning models have been proposed. They are trained with a language modelling objective on large-scale text-only data. With such pretraining, they can achieve impressive results on many Natural Language Understanding

Externí odkaz: http://arxiv.org/abs/2312.02431

Zobrazit plný text záznamu

Report

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

Autor: Pantazopoulos, Georgios, Nikandrou, Malvina, Parekh, Amit, Hemanthage, Bhathiya, Eshghi, Arash, Konstas, Ioannis, Rieser, Verena, Lemon, Oliver, Suglia, Alessandro

Interactive and embodied tasks pose at least two fundamental challenges to existing Vision & Language (VL) models, including 1) grounding language in trajectories of actions and observations, and 2) referential disambiguation. To tackle these challen

Externí odkaz: http://arxiv.org/abs/2311.04067

Zobrazit plný text záznamu

Report

No that's not what I meant: Handling Third Position Repair in Conversational Question Answering

Autor: Balaraman, Vevake, Eshghi, Arash, Konstas, Ioannis, Papaioannou, Ioannis

The ability to handle miscommunication is crucial to robust and faithful conversational AI. People usually deal with miscommunication immediately as they detect it, using highly systematic interactional mechanisms called repair. One important type of

Externí odkaz: http://arxiv.org/abs/2307.16689

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání