Zobrazeno 1 - 10
of 35 752
pro vyhledávání: '"A. Yamagishi"'
In this work, the spatiotemporal pressure field of MHz-focused ultrasound is measured using a background-oriented schlieren technique combined with fast checkerboard demodulation and vector tomography (VT-BOS). Hydrophones have been commonly employed
Externí odkaz:
http://arxiv.org/abs/2410.23652
Autor:
Yamagishi, Yosuke, Hanaoka, Shouhei
In this work, we present our solution for the MICCAI 2024 CXR-LT challenge, achieving 4th place in Subtask 2 and 5th in Subtask 1. We leveraged an ensemble of ConvNeXt V2 and MaxViT models, pretrained on an external chest X-ray dataset, to address th
Externí odkaz:
http://arxiv.org/abs/2410.10710
The First VoicePrivacy Attacker Challenge is a new kind of challenge organized as part of the VoicePrivacy initiative and supported by ICASSP 2025 as the SP Grand Challenge It focuses on developing attacker systems against voice anonymization, which
Externí odkaz:
http://arxiv.org/abs/2410.07428
Target speaker extraction (TSE) aims to isolate individual speaker voices from complex speech environments. The effectiveness of TSE systems is often compromised when the speaker characteristics are similar to each other. Recent research has introduc
Externí odkaz:
http://arxiv.org/abs/2410.00811
In this work, we present AfriHuBERT, an extension of mHuBERT-147, a state-of-the-art (SOTA) and compact self-supervised learning (SSL) model, originally pretrained on 147 languages. While mHuBERT-147 was pretrained on 16 African languages, we expand
Externí odkaz:
http://arxiv.org/abs/2409.20201
Monod's law is a widely accepted phenomenology for bacterial growth. Since it has the same functional form as the Michaelis--Menten equation for enzyme kinetics, cell growth is often considered to be locally constrained by a single reaction. In contr
Externí odkaz:
http://arxiv.org/abs/2409.12482
Autor:
Sequeira, Ian, Barabas, Andrew Z., Barajas-Aguilar, Aaron H, Bacani, Michaela G, Nakatsuji, Naoto, Koshino, Mikito, Taniguichi, Takashi, Watanabe, Kenji, Sanchez-Yamagishi, Javier D.
Van der Waals (vdW) moires offer tunable superlattices that can strongly manipulate electronic properties. We demonstrate the in-situ manipulation of moire superlattices via heterostrain control in a vdW device. By straining a graphene layer relative
Externí odkaz:
http://arxiv.org/abs/2409.07427
Autor:
Huang, Wen-Chin, Fu, Szu-Wei, Cooper, Erica, Zezario, Ryandhimas E., Toda, Tomoki, Wang, Hsin-Min, Yamagishi, Junichi, Tsao, Yu
We present the third edition of the VoiceMOS Challenge, a scientific initiative designed to advance research into automatic prediction of human speech ratings. There were three tracks. The first track was on predicting the quality of ``zoomed-in'' hi
Externí odkaz:
http://arxiv.org/abs/2409.07001
In real-world applications, it is challenging to build a speaker verification system that is simultaneously robust against common threats, including spoofing attacks, channel mismatch, and domain mismatch. Traditional automatic speaker verification (
Externí odkaz:
http://arxiv.org/abs/2409.06327
Autor:
Chen, Zhengyang, Wang, Shuai, Zhang, Mingyang, Liu, Xuechen, Yamagishi, Junichi, Qian, Yanmin
Voice conversion (VC) aims to modify the speaker's timbre while retaining speech content. Previous approaches have tokenized the outputs from self-supervised into semantic tokens, facilitating disentanglement of speech content information. Recently,
Externí odkaz:
http://arxiv.org/abs/2409.05004