Zobrazeno 1 - 10
of 6 299
pro vyhledávání: '"CHENG, Bo"'
Autor:
Cheng, Bo, Ma, Yuhang, Wu, Liebucha, Liu, Shanyuan, Ma, Ao, Wu, Xiaoyu, Leng, Dawei, Yin, Yuhui
The task of layout-to-image generation involves synthesizing images based on the captions of objects and their spatial positions. Existing methods still struggle in complex layout generation, where common bad cases include object missing, inconsisten
Externí odkaz:
http://arxiv.org/abs/2410.14324
Synthesizing motion-rich and temporally consistent videos remains a challenge in artificial intelligence, especially when dealing with extended durations. Existing text-to-video (T2V) models commonly employ spatial cross-attention for text control, e
Externí odkaz:
http://arxiv.org/abs/2408.08189
Different languages have distinct phonetic systems and vary in their prosodic features making it challenging to develop a Text-to-Speech (TTS) model that can effectively synthesise speech in multilingual settings. Furthermore, TTS architecture needs
Externí odkaz:
http://arxiv.org/abs/2406.17257
Neural speech synthesis, or text-to-speech (TTS), aims to transform a signal from the text domain to the speech domain. While developing TTS architectures that train and test on the same set of speakers has seen significant improvements, out-of-domai
Externí odkaz:
http://arxiv.org/abs/2404.04645
Dissertation/ Thesis
Autor:
Cheng, Bo
This dissertation provides a comprehensive study of the phenomenon of transnational M&A of Chinese chip companies using a combination of literature review, mathematical statistics, and logical analysis. Sixteen representative cases are selected, and
Externí odkaz:
http://hdl.handle.net/20.500.12613/9569
Neural Text-to-Speech (TTS) systems find broad applications in voice assistants, e-learning, and audiobook creation. The pursuit of modern models, like Diffusion Models (DMs), holds promise for achieving high-fidelity, real-time speech synthesis. Yet
Externí odkaz:
http://arxiv.org/abs/2404.00569
Autor:
Habas, Bryan, Cheng, Bo
Inverted landing is a routine behavior among a number of animal fliers. However, mastering this feat poses a considerable challenge for robotic fliers, especially to perform dynamic perching with rapid body rotations (or flips) and landing against gr
Externí odkaz:
http://arxiv.org/abs/2403.00128
Continual Few-shot Relation Extraction (CFRE) is a practical problem that requires the model to continuously learn novel relations while avoiding forgetting old ones with few labeled training data. The primary challenges are catastrophic forgetting a
Externí odkaz:
http://arxiv.org/abs/2402.15713
In this paper, we propose an algorithm that allows joint refinement of camera pose and scene geometry represented by decomposed low-rank tensor, using only 2D images as supervision. First, we conduct a pilot study based on a 1D signal and relate our
Externí odkaz:
http://arxiv.org/abs/2402.13252
Query-based methods have garnered significant attention in object detection since the advent of DETR, the pioneering query-based detector. However, these methods face challenges like slow convergence and suboptimal performance. Notably, self-attentio
Externí odkaz:
http://arxiv.org/abs/2310.06470