Výsledky vyhledávání - "Liu, Jia‐Wei"

Report

Skinned Motion Retargeting with Dense Geometric Interaction Perception

Autor: Ye, Zijie, Liu, Jia-Wei, Jia, Jia, Sun, Shikun, Shou, Mike Zheng

Capturing and maintaining geometric interactions among different body parts is crucial for successful motion retargeting in skinned characters. Existing approaches often overlook body geometries or add a geometry correction stage after skeletal motio

Externí odkaz: http://arxiv.org/abs/2410.20986

Zobrazit plný text záznamu

Report

VideoLLM-online: Online Video Large Language Model for Streaming Video

Autor: Chen, Joya, Lv, Zhaoyang, Wu, Shiwei, Lin, Kevin Qinghong, Song, Chenan, Gao, Difei, Liu, Jia-Wei, Gao, Ziteng, Mao, Dongxing, Shou, Mike Zheng

Recent Large Language Models have been enhanced with vision capabilities, enabling them to comprehend images, videos, and interleaved vision-language content. However, the learning methods of these large multimodal models typically treat videos as pr

Externí odkaz: http://arxiv.org/abs/2406.11816

Zobrazit plný text záznamu

Report

ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors

Autor: Mao, Weijia, Cao, Yan-Pei, Liu, Jia-Wei, Xu, Zhongcong, Shou, Mike Zheng

We introduce ShowRoom3D, a three-stage approach for generating high-quality 3D room-scale scenes from texts. Previous methods using 2D diffusion priors to optimize neural radiance fields for generating room-scale scenes have shown unsatisfactory qual

Externí odkaz: http://arxiv.org/abs/2312.13324

Zobrazit plný text záznamu

Report

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Autor: Ran, Lingmin, Cun, Xiaodong, Liu, Jia-Wei, Zhao, Rui, Zijie, Song, Wang, Xintao, Keppo, Jussi, Shou, Mike Zheng

We introduce X-Adapter, a universal upgrader to enable the pretrained plug-and-play modules (e.g., ControlNet, LoRA) to work directly with the upgraded text-to-image diffusion model (e.g., SDXL) without further retraining. We achieve this goal by tra

Externí odkaz: http://arxiv.org/abs/2312.02238

Zobrazit plný text záznamu

Report

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Autor: Gu, Yuchao, Zhou, Yipin, Wu, Bichen, Yu, Licheng, Liu, Jia-Wei, Zhao, Rui, Wu, Jay Zhangjie, Zhang, David Junhao, Shou, Mike Zheng, Tang, Kevin

Current diffusion-based video editing primarily focuses on structure-preserved editing by utilizing various dense correspondences to ensure temporal consistency and motion alignment. However, these approaches are often ineffective when the target edi

Externí odkaz: http://arxiv.org/abs/2312.02087

Zobrazit plný text záznamu

Report

ColonNeRF: High-Fidelity Neural Reconstruction of Long Colonoscopy

Autor: Shi, Yufei, Lu, Beijia, Liu, Jia-Wei, Li, Ming, Shou, Mike Zheng

Colonoscopy reconstruction is pivotal for diagnosing colorectal cancer. However, accurate long-sequence colonoscopy reconstruction faces three major challenges: (1) dissimilarity among segments of the colon due to its meandering and convoluted shape;

Externí odkaz: http://arxiv.org/abs/2312.02015

Zobrazit plný text záznamu

Report

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Autor: Grauman, Kristen, Westbury, Andrew, Torresani, Lorenzo, Kitani, Kris, Malik, Jitendra, Afouras, Triantafyllos, Ashutosh, Kumar, Baiyya, Vijay, Bansal, Siddhant, Boote, Bikram, Byrne, Eugene, Chavis, Zach, Chen, Joya, Cheng, Feng, Chu, Fu-Jen, Crane, Sean, Dasgupta, Avijit, Dong, Jing, Escobar, Maria, Forigua, Cristhian, Gebreselasie, Abrham, Haresh, Sanjay, Huang, Jing, Islam, Md Mohaiminul, Jain, Suyog, Khirodkar, Rawal, Kukreja, Devansh, Liang, Kevin J, Liu, Jia-Wei, Majumder, Sagnik, Mao, Yongsen, Martin, Miguel, Mavroudi, Effrosyni, Nagarajan, Tushar, Ragusa, Francesco, Ramakrishnan, Santhosh Kumar, Seminara, Luigi, Somayazulu, Arjun, Song, Yale, Su, Shan, Xue, Zihui, Zhang, Edward, Zhang, Jinxu, Castillo, Angela, Chen, Changan, Fu, Xinzhu, Furuta, Ryosuke, Gonzalez, Cristina, Gupta, Prince, Hu, Jiabo, Huang, Yifei, Huang, Yiming, Khoo, Weslie, Kumar, Anush, Kuo, Robert, Lakhavani, Sach, Liu, Miao, Luo, Mi, Luo, Zhengyi, Meredith, Brighid, Miller, Austin, Oguntola, Oluwatumininu, Pan, Xiaqing, Peng, Penny, Pramanick, Shraman, Ramazanova, Merey, Ryan, Fiona, Shan, Wei, Somasundaram, Kiran, Song, Chenan, Southerland, Audrey, Tateno, Masatoshi, Wang, Huiyu, Wang, Yuchen, Yagi, Takuma, Yan, Mingfei, Yang, Xitong, Yu, Zecheng, Zha, Shengxin Cindy, Zhao, Chen, Zhao, Ziwei, Zhu, Zhifan, Zhuo, Jeff, Arbelaez, Pablo, Bertasius, Gedas, Crandall, David, Damen, Dima, Engel, Jakob, Farinella, Giovanni Maria, Furnari, Antonino, Ghanem, Bernard, Hoffman, Judy, Jawahar, C. V., Newcombe, Richard, Park, Hyun Soo, Rehg, James M., Sato, Yoichi, Savva, Manolis, Shi, Jianbo, Shou, Mike Zheng, Wray, Michael

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike re

Externí odkaz: http://arxiv.org/abs/2311.18259

Zobrazit plný text záznamu

Report

DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

Autor: Duisterhof, Bardienus P., Mandi, Zhao, Yao, Yunchao, Liu, Jia-Wei, Seidenschwarz, Jenny, Shou, Mike Zheng, Ramanan, Deva, Song, Shuran, Birchfield, Stan, Wen, Bowen, Ichnowski, Jeffrey

Teaching robots to fold, drape, or reposition deformable objects such as cloth will unlock a variety of automation applications. While remarkable progress has been made for rigid object manipulation, manipulating deformable objects poses unique chall

Externí odkaz: http://arxiv.org/abs/2312.00583

Zobrazit plný text záznamu

Report

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer

Autor: Zhang, Wenqiao, Lv, Zheqi, Zhou, Hao, Liu, Jia-Wei, Li, Juncheng, Li, Mengze, Tang, Siliang, Zhuang, Yueting

Active Domain Adaptation (ADA) aims to maximally boost model adaptation in a new target domain by actively selecting a limited number of target data to annotate.This setting neglects the more practical scenario where training data are collected from

Externí odkaz: http://arxiv.org/abs/2311.12905

Zobrazit plný text záznamu

Report

Instant3D: Instant Text-to-3D Generation

Autor: Li, Ming, Zhou, Pan, Liu, Jia-Wei, Keppo, Jussi, Lin, Min, Yan, Shuicheng, Xu, Xiangyu

Text-to-3D generation has attracted much attention from the computer vision community. Existing methods mainly optimize a neural field from scratch for each text prompt, relying on heavy and repetitive training cost which impedes their practical depl

Externí odkaz: http://arxiv.org/abs/2311.08403

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání