Výsledky vyhledávání - "Kreiss, Elisa"

Report

Updating CLIP to Prefer Descriptions Over Captions

Autor: Zur, Amir, Kreiss, Elisa, D'Oosterlinck, Karel, Potts, Christopher, Geiger, Atticus

Although CLIPScore is a powerful generic metric that captures the similarity between a text and an image, it fails to distinguish between a caption that is meant to complement the information in an image and a description that is meant to replace an

Externí odkaz: http://arxiv.org/abs/2406.09458

Zobrazit plný text záznamu

Report

CommVQA: Situating Visual Question Answering in Communicative Contexts

Autor: Naik, Nandita Shankar, Potts, Christopher, Kreiss, Elisa

Current visual question answering (VQA) models tend to be trained and evaluated on image-question pairs in isolation. However, the questions people ask are dependent on their informational needs and prior knowledge about the image content. To evaluat

Externí odkaz: http://arxiv.org/abs/2402.15002

Zobrazit plný text záznamu

Report

ContextRef: Evaluating Referenceless Metrics For Image Description Generation

Autor: Kreiss, Elisa, Zelikman, Eric, Potts, Christopher, Haber, Nick

Referenceless metrics (e.g., CLIPScore) use pretrained vision--language models to assess image descriptions directly without costly ground-truth reference texts. Such methods can facilitate rapid progress, but only if they truly align with human pref

Externí odkaz: http://arxiv.org/abs/2309.11710

Zobrazit plný text záznamu

Report

Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering

Autor: Naik, Nandita, Potts, Christopher, Kreiss, Elisa

Visual question answering (VQA) has the potential to make the Internet more accessible in an interactive way, allowing people who cannot see images to ask questions about them. However, multiple studies have shown that people who are blind or have lo

Externí odkaz: http://arxiv.org/abs/2307.15745

Zobrazit plný text záznamu

Report

Characterizing Image Accessibility on Wikipedia across Languages

Autor: Kreiss, Elisa, Srinivasan, Krishna, Piccardi, Tiziano, Hermosillo, Jesus Adolfo, Bennett, Cynthia, Bernstein, Michael S., Morris, Meredith Ringel, Potts, Christopher

We make a first attempt to characterize image accessibility on Wikipedia across languages, present new experimental results that can inform efforts to assess description quality, and offer some strategies to improve Wikipedia's image accessibility.

Externí odkaz: http://arxiv.org/abs/2305.09038

Zobrazit plný text záznamu

Report

Context Matters for Image Descriptions for Accessibility: Challenges for Referenceless Evaluation Metrics

Autor: Kreiss, Elisa, Bennett, Cynthia, Hooshmand, Shayan, Zelikman, Eric, Morris, Meredith Ringel, Potts, Christopher

Few images on the Web receive alt-text descriptions that would make them accessible to blind and low vision (BLV) users. Image-based NLG systems have progressed to the point where they can begin to address this persistent societal problem, but these

Externí odkaz: http://arxiv.org/abs/2205.10646

Zobrazit plný text záznamu

Report

Color Overmodification Emerges from Data-Driven Learning and Pragmatic Reasoning

Autor: Fang, Fei, Sinha, Kunal, Goodman, Noah D., Potts, Christopher, Kreiss, Elisa

Speakers' referential expressions often depart from communicative ideals in ways that help illuminate the nature of pragmatic language use. Patterns of overmodification, in which a speaker uses a modifier that is redundant given their communicative g

Externí odkaz: http://arxiv.org/abs/2205.09172

Zobrazit plný text záznamu

Report

Causal Distillation for Language Models

Autor: Wu, Zhengxuan, Geiger, Atticus, Rozner, Josh, Kreiss, Elisa, Lu, Hanson, Icard, Thomas, Potts, Christopher, Goodman, Noah D.

Publikováno v: NAACL 2022

Distillation efforts have led to language models that are more compact and efficient without serious drops in performance. The standard approach to distillation trains a student model against two objectives: a task-specific objective (e.g., language

Externí odkaz: http://arxiv.org/abs/2112.02505

Zobrazit plný text záznamu

Report

Inducing Causal Structure for Interpretable Neural Networks

Autor: Geiger, Atticus, Wu, Zhengxuan, Lu, Hanson, Rozner, Josh, Kreiss, Elisa, Icard, Thomas, Goodman, Noah D., Potts, Christopher

In many areas, we have well-founded insights about causal structure that would be useful to bring into our trained models while still allowing them to learn in a data-driven fashion. To achieve this, we present the new method of interchange intervent

Externí odkaz: http://arxiv.org/abs/2112.00826

Zobrazit plný text záznamu

Report

ReaSCAN: Compositional Reasoning in Language Grounding

Autor: Wu, Zhengxuan, Kreiss, Elisa, Ong, Desmond C., Potts, Christopher

Publikováno v: NeurIPS 2021 (Dataset and Benchmarks Track)

The ability to compositionally map language to referents, relations, and actions is an essential component of language understanding. The recent gSCAN dataset (Ruis et al. 2020, NeurIPS) is an inspiring attempt to assess the capacity of models to lea

Externí odkaz: http://arxiv.org/abs/2109.08994

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání