Zobrazeno 1 - 10
of 6 779
pro vyhledávání: '"P, Carreira"'
Following the successful 2023 edition, we organised the Second Perception Test challenge as a half-day workshop alongside the IEEE/CVF European Conference on Computer Vision (ECCV) 2024, with the goal of benchmarking state-of-the-art video models and
Externí odkaz:
http://arxiv.org/abs/2411.19941
Autor:
van Steenkiste, Sjoerd, Zoran, Daniel, Yang, Yi, Rubanova, Yulia, Kabra, Rishabh, Doersch, Carl, Gokay, Dilara, Heyward, Joseph, Pot, Etienne, Greff, Klaus, Hudson, Drew A., Keck, Thomas Albert, Carreira, Joao, Dosovitskiy, Alexey, Sajjadi, Mehdi S. M., Kipf, Thomas
Current vision models typically maintain a fixed correspondence between their representation structure and image space. Each layer comprises a set of tokens arranged "on-the-grid," which biases patches or tokens to encode information at a specific sp
Externí odkaz:
http://arxiv.org/abs/2411.05927
Autor:
Sundaram, Jothi Prasanna Shanmuga, Zharmagambetov, Arman, Gabidolla, Magzhan, Carreira-Perpinan, Miguel A., Cerpa, Alberto
IoT is rapidly growing from small-scale apps to large-scale apps. Small-scale apps employ short-range radios like Zigbee,BLE while large-scale apps employ long-range radios like LoRa,NB-IoT. The other upcoming category of apps like P2P energy-trade i
Externí odkaz:
http://arxiv.org/abs/2409.18043
Autor:
Koppula, Skanda, Rocco, Ignacio, Yang, Yi, Heyward, Joe, Carreira, João, Zisserman, Andrew, Brostow, Gabriel, Doersch, Carl
We introduce a new benchmark, TAPVid-3D, for evaluating the task of long-range Tracking Any Point in 3D (TAP-3D). While point tracking in two dimensions (TAP) has many benchmarks measuring performance on real-world videos, such as TAPVid-DAVIS, three
Externí odkaz:
http://arxiv.org/abs/2407.05921
Automatic static cost analysis infers information about the resources used by programs without actually running them with concrete data, and presents such information as functions of input data sizes. Most of the analysis tools for logic programs (an
Externí odkaz:
http://arxiv.org/abs/2405.06972
Autor:
Doersch, Carl, Luc, Pauline, Yang, Yi, Gokay, Dilara, Koppula, Skanda, Gupta, Ankush, Heyward, Joseph, Rocco, Ignacio, Goroshin, Ross, Carreira, João, Zisserman, Andrew
To endow models with greater understanding of physics and motion, it is useful to enable them to perceive how solid surfaces move and deform in real scenes. This can be formalized as Tracking-Any-Point (TAP), which requires the algorithm to track any
Externí odkaz:
http://arxiv.org/abs/2402.00847
The First Perception Test challenge was held as a half-day workshop alongside the IEEE/CVF International Conference on Computer Vision (ICCV) 2023, with the goal of benchmarking state-of-the-art video models on the recently proposed Perception Test b
Externí odkaz:
http://arxiv.org/abs/2312.13090
Autor:
Robertson, Brant, Johnson, Benjamin D., Tacchella, Sandro, Eisenstein, Daniel J., Hainline, Kevin, Arribas, Santiago, Baker, William M., Bunker, Andrew J., Carniani, Stefano, Carreira, Courtney, Cargile, Phillip A., Charlot, Stéphane, Chevallard, Jacopo, Curti, Mirko, Curtis-Lake, Emma, D'Eugenio, Francesco, Egami, Eiichi, Hausen, Ryan, Helton, Jakob M., Jakobsen, Peter, Ji, Zhiyuan, Jones, Gareth C., Maiolino, Roberto, Maseda, Michael V., Nelson, Erica, Pérez-González, Pablo G., Puskás, Dávid, Rieke, Marcia, Smit, Renske, Sun, Fengwu, Übler, Hannah, Whitler, Lily, Williams, Christina C., Willmer, Christopher N. A., Willott, Chris, Witstok, Joris
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters spanning $0.4-0.9\mu\mathrm{m}$) and novel JWST images with 1
Externí odkaz:
http://arxiv.org/abs/2312.10033
Autor:
Carreira, João, King, Michael, Pătrăucean, Viorica, Gokay, Dilara, Ionescu, Cătălin, Yang, Yi, Zoran, Daniel, Heyward, Joseph, Doersch, Carl, Aytar, Yusuf, Damen, Dima, Zisserman, Andrew
We introduce a framework for online learning from a single continuous video stream -- the way people and animals learn, without mini-batches, data augmentation or shuffling. This poses great challenges given the high correlation between consecutive v
Externí odkaz:
http://arxiv.org/abs/2312.00598
Autor:
Venkataramanan, Shashanka, Rizve, Mamshad Nayeem, Carreira, João, Asano, Yuki M., Avrithis, Yannis
Self-supervised learning has unlocked the potential of scaling up pretraining to billions of images, since annotation is unnecessary. But are we making the best use of data? How more economical can we be? In this work, we attempt to answer this quest
Externí odkaz:
http://arxiv.org/abs/2310.08584