Zobrazeno 1 - 10
of 9 210
pro vyhledávání: '"JANEK, A"'
The rapid growth of camera-based IoT devices demands the need for efficient video compression, particularly for edge applications where devices face hardware constraints, often with only 1 or 2 MB of RAM and unstable internet connections. Traditional
Externí odkaz:
http://arxiv.org/abs/2411.19442
Autor:
Karner, Clemens, Gröhl, Janek, Selby, Ian, Babar, Judith, Beckford, Jake, Else, Thomas R, Sadler, Timothy J, Shahipasand, Shahab, Thavakumar, Arthikkaa, Roberts, Michael, Rudd, James H. F., Schönlieb, Carola-Bibiane, Weir-McCall, Jonathan R, Breger, Anna
When developing machine learning models, image quality assessment (IQA) measures are a crucial component for evaluation. However, commonly used IQA measures have been primarily developed and optimized for natural images. In many specialized settings,
Externí odkaz:
http://arxiv.org/abs/2410.24098
Several attempts have been made to handle multiple source separation tasks such as speech enhancement, speech separation, sound event separation, music source separation (MSS), or cinematic audio source separation (CASS) with a single model. These mo
Externí odkaz:
http://arxiv.org/abs/2410.23987
The architecture of Vision Transformers (ViTs), particularly the Multi-head Attention (MHA) mechanism, imposes substantial hardware demands. Deploying ViTs on devices with varying constraints, such as mobile phones, requires multiple models of differ
Externí odkaz:
http://arxiv.org/abs/2409.17978
Autor:
Saijo, Kohei, Ebbers, Janek, Germain, François G., Khurana, Sameer, Wichern, Gordon, Roux, Jonathan Le
The goal of text-queried target sound extraction (TSE) is to extract from a mixture a sound source specified with a natural-language caption. While it is preferable to have access to large-scale text-audio pairs to address a variety of text prompts,
Externí odkaz:
http://arxiv.org/abs/2409.13152
In this paper, we contribute to the mathematical foundations of the recently established theory of Spectrum Broadcast Structures (SBS). These are multipartite quantum states, encoding an operational notion of objectivity and exhibiting a more advance
Externí odkaz:
http://arxiv.org/abs/2409.12372
Detecting flying animals (e.g., birds, bats, and insects) using weather radar helps gain insights into animal movement and migration patterns, aids in management efforts (such as biosecurity) and enhances our understanding of the ecosystem.The conven
Externí odkaz:
http://arxiv.org/abs/2408.04424
Research on token-level reference-free hallucination detection has predominantly focused on English, primarily due to the scarcity of robust datasets in other languages. This has hindered systematic investigations into the effectiveness of cross-ling
Externí odkaz:
http://arxiv.org/abs/2407.13702
Autor:
Wang, Xing, Dabrowski, Joel Janek, Pinskier, Josh, Liow, Lois, Viswanathan, Vinoth, Scalzo, Richard, Howard, David
Modelling complex deformation for soft robotics provides a guideline to understand their behaviour, leading to safe interaction with the environment. However, building a surrogate model with high accuracy and fast inference speed can be challenging f
Externí odkaz:
http://arxiv.org/abs/2407.08222
Autor:
Cornell, Samuele, Ebbers, Janek, Douwes, Constance, Martín-Morató, Irene, Harju, Manu, Mesaros, Annamaria, Serizel, Romain
The Detection and Classification of Acoustic Scenes and Events Challenge Task 4 aims to advance sound event detection (SED) systems in domestic environments by leveraging training data with different supervision uncertainty. Participants are challeng
Externí odkaz:
http://arxiv.org/abs/2406.08056