Zobrazeno 1 - 10
of 764
pro vyhledávání: '"Li, Hangyu"'
Existing facial expression recognition (FER) methods typically fine-tune a pre-trained visual encoder using discrete labels. However, this form of supervision limits to specify the emotional concept of different facial expressions. In this paper, we
Externí odkaz:
http://arxiv.org/abs/2409.08598
Event cameras offer significant advantages for low-light video enhancement, primarily due to their high dynamic range. Current research, however, is severely limited by the absence of large-scale, real-world, and spatio-temporally aligned event-video
Externí odkaz:
http://arxiv.org/abs/2408.16254
Recent advances in text-to-3D generation have made significant progress. In particular, with the pretrained diffusion models, existing methods predominantly use Score Distillation Sampling (SDS) to train 3D models such as Neural RaRecent advances in
Externí odkaz:
http://arxiv.org/abs/2408.05008
Publikováno v:
ACM MM 2024
Text-to-3D content creation has recently received much attention, especially with the prevalence of 3D Gaussians Splatting. In general, GS-based methods comprise two key stages: initialization and rendering optimization. To achieve initialization, ex
Externí odkaz:
http://arxiv.org/abs/2408.01269
Event cameras harness advantages such as low latency, high temporal resolution, and high dynamic range (HDR), compared to standard cameras. Due to the distinct imaging paradigm shift, a dominant line of research focuses on event-to-video (E2V) recons
Externí odkaz:
http://arxiv.org/abs/2407.05547
Autor:
Tao, Zhengwei, Lin, Ting-En, Chen, Xiancai, Li, Hangyu, Wu, Yuchuan, Li, Yongbin, Jin, Zhi, Huang, Fei, Tao, Dacheng, Zhou, Jingren
Large language models (LLMs) have significantly advanced in various fields and intelligent agent applications. However, current LLMs that learn from human or external model supervision are costly and may face performance ceilings as task complexity a
Externí odkaz:
http://arxiv.org/abs/2404.14387
Event camera has recently received much attention for low-light image enhancement (LIE) thanks to their distinct advantages, such as high dynamic range. However, current research is prohibitively restricted by the lack of large-scale, real-world, and
Externí odkaz:
http://arxiv.org/abs/2404.00834
This short paper describes our method for the track of image compression. To achieve better perceptual quality, we use the adversarial loss to generate realistic textures, use region of interest (ROI) mask to guide the bit allocation for different re
Externí odkaz:
http://arxiv.org/abs/2401.08154
Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts. In this study, we propose a novel "Self-Explanatio
Externí odkaz:
http://arxiv.org/abs/2309.12940
Autor:
Li, Hangyu, Sun, Xiaotong
Recent transportation research suggests that autonomous vehicles (AVs) have the potential to improve traffic flow efficiency as they are able to maintain smaller car-following distances. Nevertheless, being a unique class of ground robots, AVs are su
Externí odkaz:
http://arxiv.org/abs/2309.12611