Zobrazeno 1 - 10
of 97 481
pro vyhledávání: '"An Lijuan"'
In reading garden-path sentences, people must resolve competing interpretations, though initial misinterpretations can linger despite reanalysis. This study examines the role of inhibitory control (IC) in managing these misinterpretations among Chine
Externí odkaz:
http://arxiv.org/abs/2412.10006
Autor:
Liu, Weihua, Boumaraf, Said, Li, Jianwu, Lin, Chaochao, Liu, Xiabi, Niu, Lijuan, Werghi, Naoufel
Natural gradient descent (NGD) is a powerful optimization technique for machine learning, but the computational complexity of the inverse Fisher information matrix limits its application in training deep neural networks. To overcome this challenge, w
Externí odkaz:
http://arxiv.org/abs/2412.07441
Autor:
Wang, Xiyao, Yang, Zhengyuan, Li, Linjie, Lu, Hongjin, Xu, Yuancheng, Lin, Chung-Ching, Lin, Kevin, Huang, Furong, Wang, Lijuan
Despite significant advancements in vision-language models (VLMs), there lacks effective approaches to enhance response quality by scaling inference-time computation. This capability is known to be a core step towards the self-improving models in rec
Externí odkaz:
http://arxiv.org/abs/2412.03704
Autor:
Lin, Kevin Qinghong, Li, Linjie, Gao, Difei, Yang, Zhengyuan, Wu, Shiwei, Bai, Zechen, Lei, Weixian, Wang, Lijuan, Shou, Mike Zheng
Building Graphical User Interface (GUI) assistants holds significant promise for enhancing human workflow productivity. While most agents are language-based, relying on closed-source API with text-rich meta-information (e.g., HTML or accessibility tr
Externí odkaz:
http://arxiv.org/abs/2411.17465
Text-to-image diffusion models have demonstrated tremendous success in synthesizing visually stunning images given textual instructions. Despite remarkable progress in creating high-fidelity visuals, text-to-image models can still struggle with preci
Externí odkaz:
http://arxiv.org/abs/2411.16713
Autor:
Sun, Wenkui, Fan, Xiaoya, Jia, Lijuan, Chu, Tinyi, Yau, Shing-Tung, Wu, Rongling, Wang, Zhong
Differential equations offer a foundational yet powerful framework for modeling interactions within complex dynamic systems and are widely applied across numerous scientific fields. One common challenge in this area is estimating the unknown paramete
Externí odkaz:
http://arxiv.org/abs/2411.08651
Autor:
Liu, Qin, Wang, Jianfeng, Yang, Zhengyuan, Li, Linjie, Lin, Kevin, Niethammer, Marc, Wang, Lijuan
Semi-supervised video object segmentation (VOS) has been largely driven by space-time memory (STM) networks, which store past frame features in a spatiotemporal memory to segment the current frame via softmax attention. However, STM networks face mem
Externí odkaz:
http://arxiv.org/abs/2411.02818
Autor:
Zhao, Yuyang, Lin, Chung-Ching, Lin, Kevin, Yan, Zhiwen, Li, Linjie, Yang, Zhengyuan, Wang, Jianfeng, Lee, Gim Hee, Wang, Lijuan
Recent developments in 2D visual generation have been remarkably successful. However, 3D and 4D generation remain challenging in real-world applications due to the lack of large-scale 4D data and effective model design. In this paper, we propose to j
Externí odkaz:
http://arxiv.org/abs/2411.02319
Autor:
Hong, Yining, Liu, Beide, Wu, Maxine, Zhai, Yuanhao, Chang, Kai-Wei, Li, Linjie, Lin, Kevin, Lin, Chung-Ching, Wang, Jianfeng, Yang, Zhengyuan, Wu, Yingnian, Wang, Lijuan
Human beings are endowed with a complementary learning system, which bridges the slow learning of general world dynamics with fast storage of episodic memory from a new experience. Previous video generation models, however, primarily focus on slow le
Externí odkaz:
http://arxiv.org/abs/2410.23277
Autor:
Xia, Peng, Han, Siwei, Qiu, Shi, Zhou, Yiyang, Wang, Zhaoyang, Zheng, Wenhao, Chen, Zhaorun, Cui, Chenhang, Ding, Mingyu, Li, Linjie, Wang, Lijuan, Yao, Huaxiu
Interleaved multimodal comprehension and generation, enabling models to produce and interpret both images and text in arbitrary sequences, have become a pivotal area in multimodal learning. Despite significant advancements, the evaluation of this cap
Externí odkaz:
http://arxiv.org/abs/2410.10139