Výsledky vyhledávání

Report

Principles of Visual Tokens for Efficient Video Understanding

Autor: Hao, Xinyue, Li, Gen, Gowda, Shreyank N, Fisher, Robert B, Huang, Jonathan, Arnab, Anurag, Sevilla-Lara, Laura

Video understanding has made huge strides in recent years, relying largely on the power of the transformer architecture. As this architecture is notoriously expensive and video is highly redundant, research into improving efficiency has become partic

Externí odkaz: http://arxiv.org/abs/2411.13626

Zobrazit plný text záznamu

Report

Continual Learning Improves Zero-Shot Action Recognition

Autor: Gowda, Shreyank N, Moltisanti, Davide, Sevilla-Lara, Laura

Zero-shot action recognition requires a strong ability to generalize from pre-training and seen classes to novel unseen classes. Similarly, continual learning aims to develop models that can generalize effectively and learn new tasks without forgetti

Externí odkaz: http://arxiv.org/abs/2410.10497

Zobrazit plný text záznamu

Report

Learning Precise Affordances from Egocentric Videos for Robotic Manipulation

Autor: Li, Gen, Tsagkas, Nikolaos, Song, Jifei, Mon-Williams, Ruaridh, Vijayakumar, Sethu, Shao, Kun, Sevilla-Lara, Laura

Affordance, defined as the potential actions that an object offers, is crucial for robotic manipulation tasks. A deep understanding of affordance can lead to more intelligent AI systems. For example, such knowledge directs an agent to grasp a knife b

Externí odkaz: http://arxiv.org/abs/2408.10123

Zobrazit plný text záznamu

Report

Coarse or Fine? Recognising Action End States without Labels

Autor: Moltisanti, Davide, Bilen, Hakan, Sevilla-Lara, Laura, Keller, Frank

We focus on the problem of recognising the end state of an action in an image, which is critical for understanding what action is performed and in which manner. We study this focusing on the task of predicting the coarseness of a cut, i.e., deciding

Externí odkaz: http://arxiv.org/abs/2405.07723

Zobrazit plný text záznamu

Report

Efficient Pre-training for Localized Instruction Generation of Videos

Autor: Batra, Anil, Moltisanti, Davide, Sevilla-Lara, Laura, Rohrbach, Marcus, Keller, Frank

Procedural videos, exemplified by recipe demonstrations, are instrumental in conveying step-by-step instructions. However, understanding such videos is challenging as it involves the precise localization of steps and the generation of textual instruc

Externí odkaz: http://arxiv.org/abs/2311.15964

Zobrazit plný text záznamu

Report

Watt For What: Rethinking Deep Learning's Energy-Performance Relationship

Autor: Gowda, Shreyank N, Hao, Xinyue, Li, Gen, Gowda, Shashank Narayana, Jin, Xiaobo, Sevilla-Lara, Laura

Deep learning models have revolutionized various fields, from image recognition to natural language processing, by achieving unprecedented levels of accuracy. However, their increasing energy consumption has raised concerns about their environmental

Externí odkaz: http://arxiv.org/abs/2310.06522

Zobrazit plný text záznamu

Report

Telling Stories for Common Sense Zero-Shot Action Recognition

Autor: Gowda, Shreyank N, Sevilla-Lara, Laura

Video understanding has long suffered from reliance on large labeled datasets, motivating research into zero-shot learning. Recent progress in language modeling presents opportunities to advance zero-shot video analysis, but constructing an effective

Externí odkaz: http://arxiv.org/abs/2309.17327

Zobrazit plný text záznamu

Kniha

Técnicas de recepción y comunicación. ADGG0208. [elektronicky zdroj]

Autor: Viera Lara, Laura

Externí odkaz: Kolekce e-knih KNAV (Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.)

Kniha

Guía para la redacción de un proyecto de investigación. [elektronicky zdroj]

Autor: Lara, Laura

Externí odkaz: Kolekce e-knih KNAV (Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.)

Report

Learning Action Changes by Measuring Verb-Adverb Textual Relationships

Autor: Moltisanti, Davide, Keller, Frank, Bilen, Hakan, Sevilla-Lara, Laura

The goal of this work is to understand the way actions are performed in videos. That is, given a video, we aim to predict an adverb indicating a modification applied to the action (e.g. cut "finely"). We cast this problem as a regression task. We mea

Externí odkaz: http://arxiv.org/abs/2303.15086

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání