Výsledky vyhledávání

Report

Moving Off-the-Grid: Scene-Grounded Video Representations

Autor: van Steenkiste, Sjoerd, Zoran, Daniel, Yang, Yi, Rubanova, Yulia, Kabra, Rishabh, Doersch, Carl, Gokay, Dilara, Heyward, Joseph, Pot, Etienne, Greff, Klaus, Hudson, Drew A., Keck, Thomas Albert, Carreira, Joao, Dosovitskiy, Alexey, Sajjadi, Mehdi S. M., Kipf, Thomas

Current vision models typically maintain a fixed correspondence between their representation structure and image space. Each layer comprises a set of tokens arranged "on-the-grid," which biases patches or tokens to encode information at a specific sp

Externí odkaz: http://arxiv.org/abs/2411.05927

Zobrazit plný text záznamu

Report

AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism

Autor: Brannon, William, Beeferman, Doug, Jiang, Hang, Heyward, Andrew, Roy, Deb

Publikováno v: Proc. CSCW (2024) 65-68

Understanding and making use of audience feedback is important but difficult for journalists, who now face an impractically large volume of audience comments online. We introduce AudienceView, an online tool to help journalists categorize and interpr

Externí odkaz: http://arxiv.org/abs/2407.12613

Zobrazit plný text záznamu

Report

Bridging Dictionary: AI-Generated Dictionary of Partisan Language Use

Autor: Jiang, Hang, Beeferman, Doug, Brannon, William, Heyward, Andrew, Roy, Deb

Publikováno v: Proc. CSCW (2024) 79-82

Words often carry different meanings for people from diverse backgrounds. Today's era of social polarization demands that we choose words carefully to prevent miscommunication, especially in political communication and journalism. To address this iss

Externí odkaz: http://arxiv.org/abs/2407.09661

Zobrazit plný text záznamu

Report

TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Autor: Koppula, Skanda, Rocco, Ignacio, Yang, Yi, Heyward, Joe, Carreira, João, Zisserman, Andrew, Brostow, Gabriel, Doersch, Carl

We introduce a new benchmark, TAPVid-3D, for evaluating the task of long-range Tracking Any Point in 3D (TAP-3D). While point tracking in two dimensions (TAP) has many benchmarks measuring performance on real-world videos, such as TAPVid-DAVIS, three

Externí odkaz: http://arxiv.org/abs/2407.05921

Zobrazit plný text záznamu

Akademický článek

On the Boundaries : Reflecting Fifty Years Along

Autor: Heyward, Carter

Publikováno v: Anglican and Episcopal History, 2024 Jun 01. 93(2), 257-279.

Externí odkaz: https://www.jstor.org/stable/27316612

Zobrazit plný text záznamu

Report

BootsTAP: Bootstrapped Training for Tracking-Any-Point

Autor: Doersch, Carl, Luc, Pauline, Yang, Yi, Gokay, Dilara, Koppula, Skanda, Gupta, Ankush, Heyward, Joseph, Rocco, Ignacio, Goroshin, Ross, Carreira, João, Zisserman, Andrew

To endow models with greater understanding of physics and motion, it is useful to enable them to perceive how solid surfaces move and deform in real scenes. This can be formalized as Tracking-Any-Point (TAP), which requires the algorithm to track any

Externí odkaz: http://arxiv.org/abs/2402.00847

Zobrazit plný text záznamu

Report

Perception Test 2023: A Summary of the First Challenge And Outcome

Autor: Heyward, Joseph, Carreira, João, Damen, Dima, Zisserman, Andrew, Pătrăucean, Viorica

The First Perception Test challenge was held as a half-day workshop alongside the IEEE/CVF International Conference on Computer Vision (ICCV) 2023, with the goal of benchmarking state-of-the-art video models on the recently proposed Perception Test b

Externí odkaz: http://arxiv.org/abs/2312.13090

Zobrazit plný text záznamu

Report

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Autor: Papalampidi, Pinelopi, Koppula, Skanda, Pathak, Shreya, Chiu, Justin, Heyward, Joe, Patraucean, Viorica, Shen, Jiajun, Miech, Antoine, Zisserman, Andrew, Nematzdeh, Aida

Understanding long, real-world videos requires modeling of long-range visual dependencies. To this end, we explore video-first architectures, building on the common paradigm of transferring large-scale, image--text models to video via shallow tempora

Externí odkaz: http://arxiv.org/abs/2312.07395

Zobrazit plný text záznamu

Report

Learning from One Continuous Video Stream

Autor: Carreira, João, King, Michael, Pătrăucean, Viorica, Gokay, Dilara, Ionescu, Cătălin, Yang, Yi, Zoran, Daniel, Heyward, Joseph, Doersch, Carl, Aytar, Yusuf, Damen, Dima, Zisserman, Andrew

We introduce a framework for online learning from a single continuous video stream -- the way people and animals learn, without mini-batches, data augmentation or shuffling. This poses great challenges given the high correlation between consecutive v

Externí odkaz: http://arxiv.org/abs/2312.00598

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání