Zobrazeno 1 - 10
of 13 444
pro vyhledávání: '"ZHANG, Huan"'
Outdoor images often suffer from severe degradation due to rain, haze, and noise, impairing image quality and challenging high-level tasks. Current image restoration methods struggle to handle complex degradation while maintaining efficiency. This pa
Externí odkaz:
http://arxiv.org/abs/2411.07893
The rapid advancements in Vision-Language Models (VLMs) have shown great potential in tackling mathematical reasoning tasks that involve visual context. Unlike humans who can reliably apply solution steps to similar problems with minor modifications,
Externí odkaz:
http://arxiv.org/abs/2411.00836
Open-source Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language understanding and generation, leading to widespread adoption across various domains. However, their increasing model sizes render local de
Externí odkaz:
http://arxiv.org/abs/2410.22307
Autor:
Guo, Xingang, Keivan, Darioush, Syed, Usman, Qin, Lianhui, Zhang, Huan, Dullerud, Geir, Seiler, Peter, Hu, Bin
Control system design is a crucial aspect of modern engineering with far-reaching applications across diverse sectors including aerospace, automotive systems, power grids, and robotics. Despite advances made by Large Language Models (LLMs) in various
Externí odkaz:
http://arxiv.org/abs/2410.19811
This paper provides a detailed analysis of the NeuroPiano dataset, which comprise 104 audio recordings of student piano performances accompanied with 2255 textual feedback and ratings given by professional pianists. We offer a statistical overview of
Externí odkaz:
http://arxiv.org/abs/2410.03139
Research in music understanding has extensively explored composition-level attributes such as key, genre, and instrumentation through advanced representations, leading to cross-modal applications using large language models. However, aspects of music
Externí odkaz:
http://arxiv.org/abs/2409.08795
Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music g
Externí odkaz:
http://arxiv.org/abs/2409.08155
In the domain of human-computer interaction, accurately recognizing and interpreting human emotions is crucial yet challenging due to the complexity and subtlety of emotional expressions. This study explores the potential for detecting a rich and fle
Externí odkaz:
http://arxiv.org/abs/2409.07901
Rapid advancements in artificial intelligence have significantly enhanced generative tasks involving music and images, employing both unimodal and multimodal approaches. This research develops a model capable of generating music that resonates with t
Externí odkaz:
http://arxiv.org/abs/2409.07827
Large language models (LLMs) have achieved impressive performance on code generation. Although prior studies enhanced LLMs with prompting techniques and code refinement, they still struggle with complex programming problems due to rigid solution plan
Externí odkaz:
http://arxiv.org/abs/2409.05001