Výsledky vyhledávání - "WANG Kaixuan"

Report

Dynamic-Attention-based EEG State Transition Modeling for Emotion Recognition

Autor: Shen, Xinke, Gan, Runmin, Wang, Kaixuan, Yang, Shuyi, Zhang, Qingzhu, Liu, Quanying, Zhang, Dan, Song, Sen

Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architec

Externí odkaz: http://arxiv.org/abs/2411.04568

Zobrazit plný text záznamu

Report

RMAFF-PSN: A Residual Multi-Scale Attention Feature Fusion Photometric Stereo Network

Autor: Luo, Kai, Ju, Yakun, Qi, Lin, Wang, Kaixuan, Dong, Junyu

Publikováno v: Photonics 2023,10(5),548

Predicting accurate normal maps of objects from two-dimensional images in regions of complex structure and spatial material variations is challenging using photometric stereo methods due to the influence of surface reflection properties caused by var

Externí odkaz: http://arxiv.org/abs/2404.07766

Zobrazit plný text záznamu

Report

Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

Autor: Hu, Mu, Yin, Wei, Zhang, Chi, Cai, Zhipeng, Long, Xiaoxiao, Chen, Hao, Wang, Kaixuan, Yu, Gang, Shen, Chunhua, Shen, Shaojie

We introduce Metric3D v2, a geometric foundation model for zero-shot metric depth and surface normal estimation from a single image, which is crucial for metric 3D recovery. While depth and normal are geometrically related and highly complimentary, t

Externí odkaz: http://arxiv.org/abs/2404.15506

Zobrazit plný text záznamu

Report

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Autor: Fu, Xiao, Yin, Wei, Hu, Mu, Wang, Kaixuan, Ma, Yuexin, Tan, Ping, Shen, Shaojie, Lin, Dahua, Long, Xiaoxiao

We introduce GeoWizard, a new generative foundation model designed for estimating geometric attributes, e.g., depth and normals, from single images. While significant research has already been conducted in this area, the progress has been substantial

Externí odkaz: http://arxiv.org/abs/2403.12013

Zobrazit plný text záznamu

Report

Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving

Autor: Cheng, JunDa, Yin, Wei, Wang, Kaixuan, Chen, Xiaozhi, Wang, Shijie, Yang, Xin

Multi-view depth estimation has achieved impressive performance over various benchmarks. However, almost all current multi-view systems rely on given ideal camera poses, which are unavailable in many real-world scenarios, such as autonomous driving.

Externí odkaz: http://arxiv.org/abs/2403.07535

Zobrazit plný text záznamu

Report

GIM: Learning Generalizable Image Matcher From Internet Videos

Autor: Shen, Xuelun, Cai, Zhipeng, Yin, Wei, Müller, Matthias, Li, Zijun, Wang, Kaixuan, Chen, Xiaozhi, Wang, Cheng

Image matching is a fundamental computer vision problem. While learning-based methods achieve state-of-the-art performance on existing benchmarks, they generalize poorly to in-the-wild images. Such methods typically need to train separate models for

Externí odkaz: http://arxiv.org/abs/2402.11095

Zobrazit plný text záznamu

Report

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack

Autor: Guo, Zhongliang, Dong, Junhao, Qian, Yifei, Wang, Kaixuan, Li, Weiye, Guo, Ziheng, Wang, Yuheng, Li, Yanli, Arandjelović, Ognjen, Fang, Lei

Neural style transfer (NST) generates new images by combining the style of one image with the content of another. However, unauthorized NST can exploit artwork, raising concerns about artists' rights and motivating the development of proactive protec

Externí odkaz: http://arxiv.org/abs/2401.09673

Zobrazit plný text záznamu

Report

UC-NeRF: Neural Radiance Field for Under-Calibrated Multi-view Cameras in Autonomous Driving

Autor: Cheng, Kai, Long, Xiaoxiao, Yin, Wei, Wang, Jin, Wu, Zhiqiang, Ma, Yuexin, Wang, Kaixuan, Chen, Xiaozhi, Chen, Xuejin

Multi-camera setups find widespread use across various applications, such as autonomous driving, as they greatly expand sensing capabilities. Despite the fast development of Neural radiance field (NeRF) techniques and their wide applications in both

Externí odkaz: http://arxiv.org/abs/2311.16945

Zobrazit plný text záznamu

Report

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

Autor: Yin, Wei, Zhang, Chi, Chen, Hao, Cai, Zhipeng, Yu, Gang, Wang, Kaixuan, Chen, Xiaozhi, Shen, Chunhua

Reconstructing accurate 3D scenes from images is a long-standing vision task. Due to the ill-posedness of the single-image reconstruction problem, most well-established methods are built upon multi-view geometry. State-of-the-art (SOTA) monocular met

Externí odkaz: http://arxiv.org/abs/2307.10984

Zobrazit plný text záznamu

Report

Retrieval-augmented GPT-3.5-based Text-to-SQL Framework with Sample-aware Prompting and Dynamic Revision Chain

Autor: Guo, Chunxi, Tian, Zhiliang, Tang, Jintao, Li, Shasha, Wen, Zhihua, Wang, Kaixuan, Wang, Ting

Text-to-SQL aims at generating SQL queries for the given natural language questions and thus helping users to query databases. Prompt learning with large language models (LLMs) has emerged as a recent approach, which designs prompts to lead LLMs to u

Externí odkaz: http://arxiv.org/abs/2307.05074

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání