Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Birchfield, Stan"'
Autor:
Zhou, Jinxing, Wang, Jianyuan, Zhang, Jiayi, Sun, Weixuan, Zhang, Jing, Birchfield, Stan, Guo, Dan, Kong, Lingpeng, Wang, Meng, Zhong, Yiran
We propose to explore a new problem called audio-visual segmentation (AVS), in which the goal is to output a pixel-level map of the object(s) that produce sound at the time of the image frame. To facilitate this research, we construct the first audio
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::22c4f1730299dfd7d96cc98e931c918d
Autor:
Sun, Weixuan, Qin, Zhen, Deng, Hui, Wang, Jianyuan, Zhang, Yi, Zhang, Kaihao, Barnes, Nick, Birchfield, Stan, Kong, Lingpeng, Zhong, Yiran
Vision transformers have shown great success on numerous computer vision tasks. However, its central component, softmax attention, prohibits vision transformers from scaling up to high-resolution images, due to both the computational complexity and m
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cd1a7d3d33d80637a41295fdf2ecb518
Autor:
Tremblay, Jonathan, Meshry, Moustafa, Evans, Alex, Kautz, Jan, Keller, Alexander, Khamis, Sameh, Müller, Thomas, Loop, Charles, Morrical, Nathan, Nagano, Koki, Takikawa, Towaki, Birchfield, Stan
We present a large-scale synthetic dataset for novel view synthesis consisting of ~300k images rendered from nearly 2000 complex scenes using high-quality ray tracing at high resolution (1600 x 1600 pixels). The dataset is orders of magnitude larger
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ddacb0a7787851d3b1068da4515c2f11
We propose a single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category. Our method takes as input the previous and current frame from a monocular RGB video, as wel
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c6d034654800d8d9630bb5e354e542bd
Publikováno v:
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
We present a system for multi-level scene awareness for robotic manipulation. Given a sequence of camera-in-hand RGB images, the system calculates three types of information: 1) a point cloud representation of all the surfaces in the scene, for the p
Prior work on 6-DoF object pose estimation has largely focused on instance-level processing, in which a textured CAD model is available for each object being detected. Category-level 6-DoF pose estimation represents an important step toward developin
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d419824d4dced6eb9a97b8fcf450b324
http://arxiv.org/abs/2109.06161
http://arxiv.org/abs/2109.06161
Autor:
Morrical, Nathan, Tremblay, Jonathan, Lin, Yunzhi, Tyree, Stephen, Birchfield, Stan, Pascucci, Valerio, Wald, Ingo
We present a Python-based renderer built on NVIDIA's OptiX ray tracing engine and the OptiX AI denoiser, designed to generate high-quality synthetic images for research in computer vision and deep learning. Our tool enables the description and manipu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fadf1fd7dd7ae083c77369b3bf83e799