Zobrazeno 1 - 10
of 11 163
pro vyhledávání: '"Zhu, Yi-An"'
Audio deepfake detection (ADD) is crucial to combat the misuse of speech synthesized from generative AI models. Existing ADD models suffer from generalization issues, with a large performance discrepancy between in-domain and out-of-domain data. More
Externí odkaz:
http://arxiv.org/abs/2407.18517
Autor:
Zhu, Yi, Falk, Tiago
Speech is known to carry health-related attributes, which has emerged as a novel venue for remote and long-term health monitoring. However, existing models are usually tailored for a specific type of disease, and have been shown to lack generalizabil
Externí odkaz:
http://arxiv.org/abs/2406.18731
Trajectory prediction forecasts nearby agents' moves based on their historical trajectories. Accurate trajectory prediction is crucial for autonomous vehicles. Existing attacks compromise the prediction model of a victim AV by directly manipulating t
Externí odkaz:
http://arxiv.org/abs/2406.11707
Autor:
Abdollahi, Mahsa, Zhu, Yi, Guimarães, Heitor R., Coallier, Nico, Maucourt, Ségolène, Giovenazzo, Pierre, Falk, Tiago H.
In this paper, we present a multimodal dataset obtained from a honey bee colony in Montr\'eal, Quebec, Canada, spanning the years of 2021 to 2022. This apiary comprised 10 beehives, with microphones recording more than 2000 hours of high quality raw
Externí odkaz:
http://arxiv.org/abs/2406.03657
Autor:
Lin, Bingqian, Nie, Yunshuang, Wei, Ziming, Zhu, Yi, Xu, Hang, Ma, Shikui, Liu, Jianzhuang, Liang, Xiaodan
Vision-Language Navigation (VLN) requires the agent to follow language instructions to reach a target position. A key factor for successful navigation is to align the landmarks implied in the instruction with diverse visual observations. However, pre
Externí odkaz:
http://arxiv.org/abs/2405.18721
Navigating efficiently across vortical flow fields presents a significant challenge in various robotic applications. The dynamic and unsteady nature of vortical flows often disturbs the control of underwater robots, complicating their operation in hy
Externí odkaz:
http://arxiv.org/abs/2405.14251
Autor:
Sun, Yutao, Dong, Li, Zhu, Yi, Huang, Shaohan, Wang, Wenhui, Ma, Shuming, Zhang, Quanlu, Wang, Jianyong, Wei, Furu
We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once. It consists of two components, i.e., a cross-decoder stacked upon a self-decoder. The self-decoder efficiently encodes global key-va
Externí odkaz:
http://arxiv.org/abs/2405.05254
This paper focuses on multi-agent stochastic differential games for jump-diffusion systems. On one hand, we study the multi-agent game for optimal investment in a jump-diffusion market. We derive constant Nash equilibria and provide sufficient condit
Externí odkaz:
http://arxiv.org/abs/2404.11967
Clickbaits are surprising social posts or deceptive news headlines that attempt to lure users for more clicks, which have posted at unprecedented rates for more profit or commercial revenue. The spread of clickbait has significant negative impacts on
Externí odkaz:
http://arxiv.org/abs/2404.11206
This work shows details of an evaluation of an observational system comprising a CMOS detector, 60-cm telescope, and filter complement. The system's photometric precision and differential photometric precision, and extinction coefficients were assess
Externí odkaz:
http://arxiv.org/abs/2403.12435