Zobrazeno 1 - 10
of 320
pro vyhledávání: '"Ding, Yuxin"'
In this paper, we propose a new setting for generating product descriptions from images, augmented by marketing keywords. It leverages the combined power of visual and textual information to create descriptions that are more tailored to the unique fe
Externí odkaz:
http://arxiv.org/abs/2402.13587
Autor:
Zhou, Lillian, Ding, Yuxin, Chen, Mingqing, Zhang, Harry, Prabhavalkar, Rohit, Guliani, Dhruv, Motta, Giovanni, Mathews, Rajiv
Automatic speech recognition (ASR) models are typically trained on large datasets of transcribed speech. As language evolves and new terms come into use, these models can become outdated and stale. In the context of models trained on the server but d
Externí odkaz:
http://arxiv.org/abs/2310.00141
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues
Conditional inference on joint textual and visual clues is a multi-modal reasoning task that textual clues provide prior permutation or external knowledge, which are complementary with visual content and pivotal to deducing the correct option. Previo
Externí odkaz:
http://arxiv.org/abs/2305.04530
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Pretrained Vision-Language Models (VLMs) have achieved remarkable performance in image retrieval from text. However, their performance drops drastically when confronted with linguistically complex texts that they struggle to comprehend. Inspired by t
Externí odkaz:
http://arxiv.org/abs/2305.02265
Autor:
Zhao, Yu, Li, Yunxin, Wu, Yuxiang, Hu, Baotian, Chen, Qingcai, Wang, Xiaolong, Ding, Yuxin, Zhang, Min
Medical dialogue generation is an important yet challenging task. Most previous works rely on the attention mechanism and large-scale pretrained language models. However, these methods often fail to acquire pivotal information from the long dialogue
Externí odkaz:
http://arxiv.org/abs/2206.08611
Publikováno v:
In Food Chemistry: X 30 June 2024 22
Autor:
Pelaz, Sara G., Flores-Hernández, Raquel, Vujic, Tatjana, Schvartz, Domitille, Álvarez-Vázquez, Andrea, Ding, Yuxin, García-Vicente, Laura, Belloso, Aitana, Talaverón, Rocío, Sánchez, Jean-Charles, Tabernero, Arantxa
Publikováno v:
In Translational Research October 2024 272:95-110
Single online handwritten Chinese character recognition~(single OLHCCR) has achieved prominent performance. However, in real application scenarios, users always write multiple Chinese characters to form one complete sentence and the contextual inform
Externí odkaz:
http://arxiv.org/abs/2108.02561
Autor:
Li, Yunxin, Zhao, Yu, Hu, Baotian, Chen, Qingcai, Xiang, Yang, Wang, Xiaolong, Ding, Yuxin, Ma, Lin
Previous works indicate that the glyph of Chinese characters contains rich semantic information and has the potential to enhance the representation of Chinese characters. The typical method to utilize the glyph features is by incorporating them into
Externí odkaz:
http://arxiv.org/abs/2107.00395
Publikováno v:
In Heliyon 15 April 2024 10(7)