Zobrazeno 1 - 10
of 54 797
pro vyhledávání: '"A. Toda"'
Neural vocoders often struggle with aliasing in latent feature spaces, caused by time-domain nonlinear operations and resampling layers. Aliasing folds high-frequency components into the low-frequency range, making aliased and original frequency comp
Externí odkaz:
http://arxiv.org/abs/2411.06807
Subjective speech quality assessment (SSQA) is critical for evaluating speech samples as perceived by human listeners. While model-based SSQA has enjoyed great success thanks to the development of deep neural networks (DNNs), generalization remains a
Externí odkaz:
http://arxiv.org/abs/2411.03715
Autor:
Toda, Yo, Seto, Osamu
We discuss the neutrino mass and Hubble tension solutions and examine their effects on the Redshift-Space Distortion (RSD) observations. An analysis with RSD data indicates smaller amplitude of perturbation. Including RSD data results in a slightly w
Externí odkaz:
http://arxiv.org/abs/2410.21925
Physics-Informed Neural Networks (PINNs) are used to study vortex solutions in the 2+1 dimensional nonlinear partial differential equations. These solutions include the regularized long-wave (RLW) equation and the Zakharov-Kuznetsov (ZK) equation, wh
Externí odkaz:
http://arxiv.org/abs/2410.19014
Autor:
Hirano, Tomohiro, Toda, Alexis Akira
A rational bubble is a situation in which the asset price exceeds its fundamental value defined by the present value of dividends in a rational equilibrium model. We discuss the recent development of the theory of rational bubbles attached to real as
Externí odkaz:
http://arxiv.org/abs/2410.17425
Automatic music transcription (AMT), aiming to convert musical signals into musical notation, is one of the important tasks in music information retrieval. Recently, previous works have applied high-resolution labels, i.e., the continuous onset and o
Externí odkaz:
http://arxiv.org/abs/2409.19614
Developing a robust speech emotion recognition (SER) system in noisy conditions faces challenges posed by different noise properties. Most previous studies have not considered the impact of human speech noise, thus limiting the application scope of S
Externí odkaz:
http://arxiv.org/abs/2409.19585
Autor:
Pădurariu, Tudor, Toda, Yukinobu
In a previous paper, we introduced quasi-BPS categories for moduli stacks of semistable Higgs bundles. Under a certain condition on the rank, Euler characteristic, and weight, the quasi-BPS categories (called BPS in this case) are non-commutative ana
Externí odkaz:
http://arxiv.org/abs/2409.10800
In anomalous sound detection, the discriminative method has demonstrated superior performance. This approach constructs a discriminative feature space through the classification of the meta-information labels for normal sounds. This feature space ref
Externí odkaz:
http://arxiv.org/abs/2409.09332
Autor:
Huang, Wen-Chin, Fu, Szu-Wei, Cooper, Erica, Zezario, Ryandhimas E., Toda, Tomoki, Wang, Hsin-Min, Yamagishi, Junichi, Tsao, Yu
We present the third edition of the VoiceMOS Challenge, a scientific initiative designed to advance research into automatic prediction of human speech ratings. There were three tracks. The first track was on predicting the quality of ``zoomed-in'' hi
Externí odkaz:
http://arxiv.org/abs/2409.07001