Výsledky vyhledávání

Report

Knowledge-Enhanced Facial Expression Recognition with Emotional-to-Neutral Transformation

Autor: Li, Hangyu, Xu, Yihan, Yao, Jiangchao, Wang, Nannan, Gao, Xinbo, Han, Bo

Existing facial expression recognition (FER) methods typically fine-tune a pre-trained visual encoder using discrete labels. However, this form of supervision limits to specify the emotional concept of different facial expressions. In this paper, we

Externí odkaz: http://arxiv.org/abs/2409.08598

Zobrazit plný text záznamu

Report

EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More

Autor: Chen, Kanghao, Liang, Guoqiang, Li, Hangyu, Lu, Yunfan, Wang, Lin

Event cameras offer significant advantages for low-light video enhancement, primarily due to their high dynamic range. Current research, however, is severely limited by the absence of large-scale, real-world, and spatio-temporally aligned event-video

Externí odkaz: http://arxiv.org/abs/2408.16254

Zobrazit plný text záznamu

Report

FlowDreamer: Exploring High Fidelity Text-to-3D Generation via Rectified Flow

Autor: Li, Hangyu, Chu, Xiangxiang, Shi, Dingyuan, Lin, Wang

Recent advances in text-to-3D generation have made significant progress. In particular, with the pretrained diffusion models, existing methods predominantly use Score Distillation Sampling (SDS) to train 3D models such as Neural RaRecent advances in

Externí odkaz: http://arxiv.org/abs/2408.05008

Zobrazit plný text záznamu

Report

A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness

Autor: Jiang, Lutao, Li, Hangyu, Wang, Lin

Publikováno v: ACM MM 2024

Text-to-3D content creation has recently received much attention, especially with the prevalence of 3D Gaussians Splatting. In general, GS-based methods comprise two key stages: initialization and rendering optimization. To achieve initialization, ex

Externí odkaz: http://arxiv.org/abs/2408.01269

Zobrazit plný text záznamu

Report

LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction

Autor: Chen, Kanghao, Li, Hangyu, Zhou, JiaZhou, Wang, Zeyu, Wang, Lin

Event cameras harness advantages such as low latency, high temporal resolution, and high dynamic range (HDR), compared to standard cameras. Due to the distinct imaging paradigm shift, a dominant line of research focuses on event-to-video (E2V) recons

Externí odkaz: http://arxiv.org/abs/2407.05547

Zobrazit plný text záznamu

Report

A Survey on Self-Evolution of Large Language Models

Autor: Tao, Zhengwei, Lin, Ting-En, Chen, Xiancai, Li, Hangyu, Wu, Yuchuan, Li, Yongbin, Jin, Zhi, Huang, Fei, Tao, Dacheng, Zhou, Jingren

Large language models (LLMs) have significantly advanced in various fields and intelligent agent applications. However, current LLMs that learn from human or external model supervision are costly and may face performance ceilings as task complexity a

Externí odkaz: http://arxiv.org/abs/2404.14387

Zobrazit plný text záznamu

Report

Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

Autor: Liang, Guoqiang, Chen, Kanghao, Li, Hangyu, Lu, Yunfan, Wang, Lin

Event camera has recently received much attention for low-light image enhancement (LIE) thanks to their distinct advantages, such as high dynamic range. However, current research is prohibitively restricted by the lack of large-scale, real-world, and

Externí odkaz: http://arxiv.org/abs/2404.00834

Zobrazit plný text záznamu

Report

TLIC: Learned Image Compression with ROI-Weighted Distortion and Bit Allocation

Autor: Jiang, Wei, Zhai, Yongqi, Li, Hangyu, Wang, Ronggang

This short paper describes our method for the track of image compression. To achieve better perceptual quality, we use the adversarial loss to generate realistic textures, use region of interest (ROI) mask to guide the bit allocation for different re

Externí odkaz: http://arxiv.org/abs/2401.08154

Zobrazit plný text záznamu

Report

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

Autor: Gao, Haoyu, Lin, Ting-En, Li, Hangyu, Yang, Min, Wu, Yuchuan, Ma, Wentao, Li, Yongbin

Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts. In this study, we propose a novel "Self-Explanatio

Externí odkaz: http://arxiv.org/abs/2309.12940

Zobrazit plný text záznamu

Report

On the Robotic Uncertainty of Fully Autonomous Traffic

Autor: Li, Hangyu, Sun, Xiaotong

Recent transportation research suggests that autonomous vehicles (AVs) have the potential to improve traffic flow efficiency as they are able to maintain smaller car-following distances. Nevertheless, being a unique class of ground robots, AVs are su

Externí odkaz: http://arxiv.org/abs/2309.12611

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání