Výsledky vyhledávání

Report

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Autor: Bagchi, Anurag, Bao, Zhipeng, Wang, Yu-Xiong, Tokmakov, Pavel, Hebert, Martial

We present REM, a framework for segmenting a wide range of concepts in video that can be described through natural language. Our method capitalizes on visual-language representations learned by video diffusion models on Internet-scale datasets. A key

Externí odkaz: http://arxiv.org/abs/2410.23287

Zobrazit plný text záznamu

Report

GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion

Autor: Guizilini, Vitor, Tokmakov, Pavel, Dave, Achal, Ambrus, Rares

3D reconstruction from a single image is a long-standing problem in computer vision. Learning-based methods address its inherent scale ambiguity by leveraging increasingly large labeled and unlabeled datasets, to produce geometric priors capable of g

Externí odkaz: http://arxiv.org/abs/2409.09896

Zobrazit plný text záznamu

Report

Dreamitate: Real-World Visuomotor Policy Learning via Video Generation

Autor: Liang, Junbang, Liu, Ruoshi, Ozguroglu, Ege, Sudhakar, Sruthi, Dave, Achal, Tokmakov, Pavel, Song, Shuran, Vondrick, Carl

A key challenge in manipulation is learning a policy that can robustly generalize to diverse visual environments. A promising mechanism for learning robust policies is to leverage video generative models, which are pretrained on large-scale datasets

Externí odkaz: http://arxiv.org/abs/2406.16862

Zobrazit plný text záznamu

Report

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Autor: Van Hoorick, Basile, Wu, Rundi, Ozguroglu, Ege, Sargent, Kyle, Liu, Ruoshi, Tokmakov, Pavel, Dave, Achal, Zheng, Changxi, Vondrick, Carl

Accurate reconstruction of complex dynamic scenes from just a single viewpoint continues to be a challenging task in computer vision. Current dynamic novel view synthesis methods typically require videos from many different camera viewpoints, necessi

Externí odkaz: http://arxiv.org/abs/2405.14868

Zobrazit plný text záznamu

Report

pix2gestalt: Amodal Segmentation by Synthesizing Wholes

Autor: Ozguroglu, Ege, Liu, Ruoshi, Surís, Dídac, Chen, Dian, Dave, Achal, Tokmakov, Pavel, Vondrick, Carl

We introduce pix2gestalt, a framework for zero-shot amodal segmentation, which learns to estimate the shape and appearance of whole objects that are only partially visible behind occlusions. By capitalizing on large-scale diffusion models and transfe

Externí odkaz: http://arxiv.org/abs/2401.14398

Zobrazit plný text záznamu

Report

Understanding Video Transformers via Universal Concept Discovery

Autor: Kowal, Matthew, Dave, Achal, Ambrus, Rares, Gaidon, Adrien, Derpanis, Konstantinos G., Tokmakov, Pavel

This paper studies the problem of concept-based interpretability of transformer representations for videos. Concretely, we seek to explain the decision-making process of video transformers based on high-level, spatiotemporal concepts that are automat

Externí odkaz: http://arxiv.org/abs/2401.10831

Zobrazit plný text záznamu

Report

Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models

Autor: Chu, Wen-Hsuan, Harley, Adam W., Tokmakov, Pavel, Dave, Achal, Guibas, Leonidas, Fragkiadaki, Katerina

Object tracking is central to robot perception and scene understanding. Tracking-by-detection has long been a dominant paradigm for object tracking of specific object categories. Recently, large-scale pre-trained models have shown promising advances

Externí odkaz: http://arxiv.org/abs/2310.06992

Zobrazit plný text záznamu

Akademický článek

Endovascular surgery in patients with coronary artery disease in combination with cancer

Autor: B. G. Alekyan, A. A. Gritskevich, N. G. Karapetyan, D. V. Ruchkin, A. A. Pechetov, P. V. Markov, B. N. Gurmikov, N. L. Irodova, L. G. Gyoletsyan, E. V. Tokmakov, A. V. Galstyan, A. Sh. Revishvili

Publikováno v: Южно-Российский онкологический журнал, Vol 5, Iss 3, Pp 39-49 (2024)

Purpose of the study. To analyze the long-term results from various strategies of endovascular treatment for coronary artery disease (CAD) in patients concomitant with cancer.Patients and methods. 74 patients with both CAD disease and cancer were tre

Externí odkaz: https://doaj.org/article/dba7c59e02574537a2503de4fed4f09e

Zobrazit plný text záznamu

Akademický článek

Activation of T7 RNA polymerase in Xenopus oocytes and cell-free extracts A A Tokmakov & Y Fukami T7 RNAP activation in Xenopus oocytes.

Autor: Tokmakov, Alexander A.¹ tokmak@phoenix.kobe-u.ac.jp, Fukami, Yasuo¹

Publikováno v: Genes to Cells. Nov2010, Vol. 15 Issue 11, p1136-1144. 9p. 1 Diagram, 8 Graphs.

Zobrazit plný text záznamu

Report

Tracking through Containers and Occluders in the Wild

Autor: Van Hoorick, Basile, Tokmakov, Pavel, Stent, Simon, Li, Jie, Vondrick, Carl

Tracking objects with persistence in cluttered and dynamic environments remains a difficult challenge for computer vision systems. In this paper, we introduce $\textbf{TCOW}$, a new benchmark and model for visual tracking through heavy occlusion and

Externí odkaz: http://arxiv.org/abs/2305.03052

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání