Zobrazeno 1 - 10
of 27
pro vyhledávání: '"Zhao, Zhongzhou"'
Existing sign language translation methods follow a two-stage pipeline: first converting the sign language video to a gloss sequence (i.e. Sign2Gloss) and then translating the generated gloss sequence into a spoken language sentence (i.e. Gloss2Text)
Externí odkaz:
http://arxiv.org/abs/2312.10210
Existing fine-grained intensity regulation methods rely on explicit control through predicted emotion probabilities. However, these high-level semantic probabilities are often inaccurate and unsmooth at the phoneme level, leading to bias in learning.
Externí odkaz:
http://arxiv.org/abs/2307.00020
Existing data-to-text generation efforts mainly focus on generating a coherent text from non-linguistic input data, such as tables and attribute-value pairs, but overlook that different application scenarios may require texts of different styles. Ins
Externí odkaz:
http://arxiv.org/abs/2305.03256
Existing neural methods have shown great potentials towards generating informative text from structured tabular data as well as maintaining high content fidelity. However, few of them shed light on generating personalized expressions, which often req
Externí odkaz:
http://arxiv.org/abs/2304.08911
Digital human recommendation system has been developed to help customers find their favorite products and is playing an active role in various recommendation contexts. How to timely catch and learn the dynamics of the preferences of the customers, wh
Externí odkaz:
http://arxiv.org/abs/2210.10638
Recommender systems can automatically recommend users with items that they probably like. The goal of them is to model the user-item interaction by effectively representing the users and items. Existing methods have primarily learned the user's prefe
Externí odkaz:
http://arxiv.org/abs/2201.09490
Autor:
Xu, Guohai, Chen, Hehong, Li, Feng-Lin, Sun, Fu, Shi, Yunzhou, Zeng, Zhixiong, Zhou, Wei, Zhao, Zhongzhou, Zhang, Ji
Live streaming is becoming an increasingly popular trend of sales in E-commerce. The core of live-streaming sales is to encourage customers to purchase in an online broadcasting room. To enable customers to better understand a product without jumping
Externí odkaz:
http://arxiv.org/abs/2109.07411
Existing data-driven methods can well handle short text generation. However, when applied to the long-text generation scenarios such as story generation or advertising text generation in the commercial scenario, these methods may generate illogical a
Externí odkaz:
http://arxiv.org/abs/2108.07998
Many generation tasks follow a one-to-many mapping relationship: each input could be associated with multiple outputs. Existing methods like Conditional Variational AutoEncoder(CVAE) employ a latent variable to model this one-to-many relationship. Ho
Externí odkaz:
http://arxiv.org/abs/2108.07535
Vision-and-language pretraining (VLP) aims to learn generic multimodal representations from massive image-text pairs. While various successful attempts have been proposed, learning fine-grained semantic alignments between image-text pairs plays a key
Externí odkaz:
http://arxiv.org/abs/2108.07073