Výsledky vyhledávání

Report

Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation

Autor: Yoneyama, Reo, Miyashita, Atsushi, Yamamoto, Ryuichi, Toda, Tomoki

Neural vocoders often struggle with aliasing in latent feature spaces, caused by time-domain nonlinear operations and resampling layers. Aliasing folds high-frequency components into the low-frequency range, making aliased and original frequency comp

Externí odkaz: http://arxiv.org/abs/2411.06807

Zobrazit plný text záznamu

Report

MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models

Autor: Huang, Wen-Chin, Cooper, Erica, Toda, Tomoki

Subjective speech quality assessment (SSQA) is critical for evaluating speech samples as perceived by human listeners. While model-based SSQA has enjoyed great success thanks to the development of deep neural networks (DNNs), generalization remains a

Externí odkaz: http://arxiv.org/abs/2411.03715

Zobrazit plný text záznamu

Report

Redshift-Space Distortion constraints on neutrino mass and models to alleviate the Hubble tension

Autor: Toda, Yo, Seto, Osamu

We discuss the neutrino mass and Hubble tension solutions and examine their effects on the Redshift-Space Distortion (RSD) observations. An analysis with RSD data indicates smaller amplitude of perturbation. Including RSD data results in a slightly w

Externí odkaz: http://arxiv.org/abs/2410.21925

Zobrazit plný text záznamu

Report

Discovery of Quasi-Integrable Equations from traveling-wave data using the Physics-Informed Neural Networks

Autor: Nakamula, A., Sawado, N., Shimasaki, K., Shimazaki, Y., Suzuki, Y., Toda, K.

Physics-Informed Neural Networks (PINNs) are used to study vortex solutions in the 2+1 dimensional nonlinear partial differential equations. These solutions include the regularized long-wave (RLW) equation and the Zakharov-Kuznetsov (ZK) equation, wh

Externí odkaz: http://arxiv.org/abs/2410.19014

Zobrazit plný text záznamu

Report

Note on Bubbles Attached to Real Assets

Autor: Hirano, Tomohiro, Toda, Alexis Akira

A rational bubble is a situation in which the asset price exceeds its fundamental value defined by the present value of dividends in a rational equilibrium model. We discuss the recent development of the theory of rational bubbles attached to real as

Externí odkaz: http://arxiv.org/abs/2410.17425

Zobrazit plný text záznamu

Report

Improved Architecture for High-resolution Piano Transcription to Efficiently Capture Acoustic Characteristics of Music Signals

Autor: Mi, Jinyi, Kim, Sehun, Toda, Tomoki

Automatic music transcription (AMT), aiming to convert musical signals into musical notation, is one of the important tasks in music information retrieval. Recently, previous works have applied high-resolution labels, i.e., the continuous onset and o

Externí odkaz: http://arxiv.org/abs/2409.19614

Zobrazit plný text záznamu

Report

Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions

Autor: Mi, Jinyi, Shi, Xiaohan, Ma, Ding, He, Jiajun, Fujimura, Takuya, Toda, Tomoki

Developing a robust speech emotion recognition (SER) system in noisy conditions faces challenges posed by different noise properties. Most previous studies have not considered the impact of human speech noise, thus limiting the application scope of S

Externí odkaz: http://arxiv.org/abs/2409.19585

Zobrazit plný text záznamu

Report

Topological K-theory of quasi-BPS categories for Higgs bundles

Autor: Pădurariu, Tudor, Toda, Yukinobu

In a previous paper, we introduced quasi-BPS categories for moduli stacks of semistable Higgs bundles. Under a certain condition on the rank, Euler characteristic, and weight, the quasi-BPS categories (called BPS in this case) are non-commutative ana

Externí odkaz: http://arxiv.org/abs/2409.10800

Zobrazit plný text záznamu

Report

Improvements of Discriminative Feature Space Training for Anomalous Sound Detection in Unlabeled Conditions

Autor: Fujimura, Takuya, Kuroyanagi, Ibuki, Toda, Tomoki

In anomalous sound detection, the discriminative method has demonstrated superior performance. This approach constructs a discriminative feature space through the classification of the meta-information labels for normal sounds. This feature space ref

Externí odkaz: http://arxiv.org/abs/2409.09332

Zobrazit plný text záznamu

Report

The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction

Autor: Huang, Wen-Chin, Fu, Szu-Wei, Cooper, Erica, Zezario, Ryandhimas E., Toda, Tomoki, Wang, Hsin-Min, Yamagishi, Junichi, Tsao, Yu

We present the third edition of the VoiceMOS Challenge, a scientific initiative designed to advance research into automatic prediction of human speech ratings. There were three tracks. The first track was on predicting the quality of ``zoomed-in'' hi

Externí odkaz: http://arxiv.org/abs/2409.07001

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání