Výsledky vyhledávání - "Anna Rohrbach"

Report

The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval

Autor: Boris, Meinardus, Anil, Batra, Anna, Rohrbach, Marcus, Rohrbach

Recent studies have shown promising results in utilizing multimodal large language models (MLLMs) for computer vision tasks such as object detection and semantic segmentation. However, many challenging video tasks remain under-explored. Video-languag

Externí odkaz: http://arxiv.org/abs/2406.18113

Zobrazit plný text záznamu

Akademický článek

Toward explainable and advisable model for self‐driving cars

Autor: Jinkyu Kim, Anna Rohrbach, Zeynep Akata, Suhong Moon, Teruhisa Misu, Yi‐Ting Chen, Trevor Darrell, John Canny

Publikováno v: Applied AI Letters, Vol 2, Iss 4, Pp n/a-n/a (2021)

Abstract Humans learn to drive through both practice and theory, for example, by studying the rules, while most self‐driving systems are limited to the former. Being able to incorporate human knowledge of typical causal driving behavior should bene

Externí odkaz: https://doaj.org/article/0b03332c2c2646a1a21cfb885107a291

Zobrazit plný text záznamu

Akademický článek

Generating visual explanations with natural language

Autor: Lisa Anne Hendricks, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Zeynep Akata

Publikováno v: Applied AI Letters, Vol 2, Iss 4, Pp n/a-n/a (2021)

Abstract We generate natural language explanations for a fine‐grained visual recognition task. Our explanations fulfill two criteria. First, explanations are class discriminative, meaning they mention attributes in an image which are important to i

Externí odkaz: https://doaj.org/article/69ed6439dccc42589defb07bc3c6e376

Zobrazit plný text záznamu

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning

Autor: Jack Hessel, Jena D. Hwang, Jae Sung Park, Rowan Zellers, Chandra Bhagavatula, Anna Rohrbach, Kate Saenko, Yejin Choi

Publikováno v: Lecture Notes in Computer Science ISBN: 9783031200588

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::f039a20cbc68ecea512b1e7d14928f04
https://doi.org/10.1007/978-3-031-20059-5_32

Zobrazit plný text záznamu

Exposing the Limits of Video-Text Models through Contrast Sets

Autor: Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach

Publikováno v: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::acd392749a15ceb3c94acaeed8395f6c
https://doi.org/10.18653/v1/2022.naacl-main.261

Zobrazit plný text záznamu

Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly

Autor: Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach

Publikováno v: Lecture Notes in Computer Science ISBN: 9783031200588

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::57bb904fbd759b6707b9ae2f34566683
https://doi.org/10.1007/978-3-031-20059-5_9

Zobrazit plný text záznamu

Generating visual explanations with natural language

Autor: Anna Rohrbach, Zeynep Akata, Bernt Schiele, Trevor Darrell, Lisa Anne Hendricks

Publikováno v: Applied AI Letters, Vol 2, Iss 4, Pp n/a-n/a (2021)

We generate natural language explanations for a fine‐grained visual recognition task. Our explanations fulfill two criteria. First, explanations are class discriminative, meaning they mention attributes in an image which are important to identify a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f1457001fdb7f9907703ff7d489092b5
https://doaj.org/article/69ed6439dccc42589defb07bc3c6e376

Zobrazit plný text záznamu

Toward explainable and advisable model for self‐driving cars

Autor: John Canny, Teruhisa Misu, Chen Yi-Ting, Trevor Darrell, Zeynep Akata, Anna Rohrbach, Jinkyu Kim, Suhong Moon

Publikováno v: Applied AI Letters, Vol 2, Iss 4, Pp n/a-n/a (2021)

Humans learn to drive through both practice and theory, for example, by studying the rules, while most self‐driving systems are limited to the former. Being able to incorporate human knowledge of typical causal driving behavior should benefit auton

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c54e3c2bd04f0827bd84b9b1e2ddf83d
https://doi.org/10.1002/ail2.56

Zobrazit plný text záznamu

Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation

Autor: Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach

Detecting out-of-context media, such as "mis-captioned" images on Twitter, is a relevant problem, especially in domains of high public significance. In this work we aim to develop defenses against such misinformation for the topics of Climate Change,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7d84700e5194833787fca86ba0c9eec8

Zobrazit plný text záznamu

Object-Region Video Transformers

Autor: Roei Herzig, Elad Ben-Avraham, Karttikeya Mangalam, Amir Bar, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson

Recently, video transformers have shown great success in video understanding, exceeding CNN performance; yet existing video transformer models do not explicitly model objects, although objects can be essential for recognizing actions. In this work, w

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::457ada6210428d4e8a3a3f6f8b715301

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání