Výsledky vyhledávání - "Birchfield, Stan"

Audio-Visual Segmentation

Autor: Zhou, Jinxing, Wang, Jianyuan, Zhang, Jiayi, Sun, Weixuan, Zhang, Jing, Birchfield, Stan, Guo, Dan, Kong, Lingpeng, Wang, Meng, Zhong, Yiran

We propose to explore a new problem called audio-visual segmentation (AVS), in which the goal is to output a pixel-level map of the object(s) that produce sound at the time of the image frame. To facilitate this research, we construct the first audio

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::22c4f1730299dfd7d96cc98e931c918d

Zobrazit plný text záznamu

Vicinity Vision Transformer

Autor: Sun, Weixuan, Qin, Zhen, Deng, Hui, Wang, Jianyuan, Zhang, Yi, Zhang, Kaihao, Barnes, Nick, Birchfield, Stan, Kong, Lingpeng, Zhong, Yiran

Vision transformers have shown great success on numerous computer vision tasks. However, its central component, softmax attention, prohibits vision transformers from scaling up to high-resolution images, due to both the computational complexity and m

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cd1a7d3d33d80637a41295fdf2ecb518

Zobrazit plný text záznamu

RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis

Autor: Tremblay, Jonathan, Meshry, Moustafa, Evans, Alex, Kautz, Jan, Keller, Alexander, Khamis, Sameh, Müller, Thomas, Loop, Charles, Morrical, Nathan, Nagano, Koki, Takikawa, Towaki, Birchfield, Stan

We present a large-scale synthetic dataset for novel view synthesis consisting of ~300k images rendered from nearly 2000 complex scenes using high-quality ray tracing at high resolution (1600 x 1600 pixels). The dataset is orders of magnitude larger

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ddacb0a7787851d3b1068da4515c2f11

Zobrazit plný text záznamu

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

Autor: Lin, Yunzhi, Tremblay, Jonathan, Tyree, Stephen, Vela, Patricio A., Birchfield, Stan

We propose a single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category. Our method takes as input the previous and current frame from a monocular RGB video, as wel

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c6d034654800d8d9630bb5e354e542bd

Zobrazit plný text záznamu

Multi-view Fusion for Multi-level Robotic Scene Understanding

Autor: Lin, Yunzhi, Tremblay, Jonathan, Tyree, Stephen, Vela, Patricio A., Birchfield, Stan

Publikováno v: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

We present a system for multi-level scene awareness for robotic manipulation. Given a sequence of camera-in-hand RGB images, the system calculates three types of information: 1) a point cloud representation of all the surfaces in the scene, for the p

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7b071116793011f2f763d3ee68a66419
https://doi.org/10.1109/iros51168.2021.9635994

Zobrazit plný text záznamu

Single-Stage Keypoint-Based Category-Level Object Pose Estimation from an RGB Image

Autor: Lin, Yunzhi, Tremblay, Jonathan, Tyree, Stephen, Vela, Patricio A., Birchfield, Stan

Prior work on 6-DoF object pose estimation has largely focused on instance-level processing, in which a textured CAD model is available for each object being detected. Category-level 6-DoF pose estimation represents an important step toward developin

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d419824d4dced6eb9a97b8fcf450b324
http://arxiv.org/abs/2109.06161

Zobrazit plný text záznamu

NViSII: A Scriptable Tool for Photorealistic Image Generation

Autor: Morrical, Nathan, Tremblay, Jonathan, Lin, Yunzhi, Tyree, Stephen, Birchfield, Stan, Pascucci, Valerio, Wald, Ingo

We present a Python-based renderer built on NVIDIA's OptiX ray tracing engine and the OptiX AI denoiser, designed to generate high-quality synthetic images for research in computer vision and deep learning. Our tool enables the description and manipu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fadf1fd7dd7ae083c77369b3bf83e799

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání