Zobrazeno 1 - 10
of 1 285
pro vyhledávání: '"WANG Kaixuan"'
Autor:
Shen, Xinke, Gan, Runmin, Wang, Kaixuan, Yang, Shuyi, Zhang, Qingzhu, Liu, Quanying, Zhang, Dan, Song, Sen
Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architec
Externí odkaz:
http://arxiv.org/abs/2411.04568
Publikováno v:
Photonics 2023,10(5),548
Predicting accurate normal maps of objects from two-dimensional images in regions of complex structure and spatial material variations is challenging using photometric stereo methods due to the influence of surface reflection properties caused by var
Externí odkaz:
http://arxiv.org/abs/2404.07766
Autor:
Hu, Mu, Yin, Wei, Zhang, Chi, Cai, Zhipeng, Long, Xiaoxiao, Chen, Hao, Wang, Kaixuan, Yu, Gang, Shen, Chunhua, Shen, Shaojie
We introduce Metric3D v2, a geometric foundation model for zero-shot metric depth and surface normal estimation from a single image, which is crucial for metric 3D recovery. While depth and normal are geometrically related and highly complimentary, t
Externí odkaz:
http://arxiv.org/abs/2404.15506
Autor:
Fu, Xiao, Yin, Wei, Hu, Mu, Wang, Kaixuan, Ma, Yuexin, Tan, Ping, Shen, Shaojie, Lin, Dahua, Long, Xiaoxiao
We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e.g., depth and normals, from single images. While significant research has already been conducted in this area, the progress has been substantial
Externí odkaz:
http://arxiv.org/abs/2403.12013
Multi-view depth estimation has achieved impressive performance over various benchmarks. However, almost all current multi-view systems rely on given ideal camera poses, which are unavailable in many real-world scenarios, such as autonomous driving.
Externí odkaz:
http://arxiv.org/abs/2403.07535
Autor:
Shen, Xuelun, Cai, Zhipeng, Yin, Wei, Müller, Matthias, Li, Zijun, Wang, Kaixuan, Chen, Xiaozhi, Wang, Cheng
Image matching is a fundamental computer vision problem. While learning-based methods achieve state-of-the-art performance on existing benchmarks, they generalize poorly to in-the-wild images. Such methods typically need to train separate models for
Externí odkaz:
http://arxiv.org/abs/2402.11095
Autor:
Guo, Zhongliang, Dong, Junhao, Qian, Yifei, Wang, Kaixuan, Li, Weiye, Guo, Ziheng, Wang, Yuheng, Li, Yanli, Arandjelović, Ognjen, Fang, Lei
Neural style transfer (NST) generates new images by combining the style of one image with the content of another. However, unauthorized NST can exploit artwork, raising concerns about artists' rights and motivating the development of proactive protec
Externí odkaz:
http://arxiv.org/abs/2401.09673
Autor:
Cheng, Kai, Long, Xiaoxiao, Yin, Wei, Wang, Jin, Wu, Zhiqiang, Ma, Yuexin, Wang, Kaixuan, Chen, Xiaozhi, Chen, Xuejin
Multi-camera setups find widespread use across various applications, such as autonomous driving, as they greatly expand sensing capabilities. Despite the fast development of Neural radiance field (NeRF) techniques and their wide applications in both
Externí odkaz:
http://arxiv.org/abs/2311.16945
Autor:
Yin, Wei, Zhang, Chi, Chen, Hao, Cai, Zhipeng, Yu, Gang, Wang, Kaixuan, Chen, Xiaozhi, Shen, Chunhua
Reconstructing accurate 3D scenes from images is a long-standing vision task. Due to the ill-posedness of the single-image reconstruction problem, most well-established methods are built upon multi-view geometry. State-of-the-art (SOTA) monocular met
Externí odkaz:
http://arxiv.org/abs/2307.10984
Autor:
Guo, Chunxi, Tian, Zhiliang, Tang, Jintao, Li, Shasha, Wen, Zhihua, Wang, Kaixuan, Wang, Ting
Text-to-SQL aims at generating SQL queries for the given natural language questions and thus helping users to query databases. Prompt learning with large language models (LLMs) has emerged as a recent approach, which designs prompts to lead LLMs to u
Externí odkaz:
http://arxiv.org/abs/2307.05074