Zobrazeno 1 - 10
of 107
pro vyhledávání: '"Minematsu, Nobuaki"'
Utterances by L2 speakers can be unintelligible due to mispronunciation and improper prosody. In computer-aided language learning systems, textual feedback is often provided using a speech recognition engine. However, an ideal form of feedback for L2
Externí odkaz:
http://arxiv.org/abs/2410.02239
Evaluating speech intelligibility is a critical task in computer-aided language learning systems. Traditional methods often rely on word error rates (WER) provided by automatic speech recognition (ASR) as intelligibility scores. However, this approac
Externí odkaz:
http://arxiv.org/abs/2409.11742
We propose a method of simulating the human process of foreign accentuation using Generative Spoken Language Model (GSLM) only with native speech corpora. When one listens to spoken words of a foreign language and repeats them, the repeated speech is
Externí odkaz:
http://arxiv.org/abs/2407.11370
With the growing amount of musical data available, automatic instrument recognition, one of the essential problems in Music Information Retrieval (MIR), is drawing more and more attention. While automatic recognition of single instruments has been we
Externí odkaz:
http://arxiv.org/abs/2306.08850
Autor:
Liu, Qianying, Gong, Zhuo, Yang, Zhengdong, Yang, Yuhang, Li, Sheng, Ding, Chenchen, Minematsu, Nobuaki, Huang, Hao, Cheng, Fei, Chu, Chenhui, Kurohashi, Sadao
Low-resource speech recognition has been long-suffering from insufficient training data. In this paper, we propose an approach that leverages neighboring languages to improve low-resource scenario performance, founded on the hypothesis that similar l
Externí odkaz:
http://arxiv.org/abs/2204.03855
Autor:
Zhao, Yi, Takaki, Shinji, Luong, Hieu-Thi, Yamagishi, Junichi, Saito, Daisuke, Minematsu, Nobuaki
Recent neural networks such as WaveNet and sampleRNN that learn directly from speech waveform samples have achieved very high-quality synthetic speech in terms of both naturalness and speaker similarity even in multi-speaker text-to-speech synthesis
Externí odkaz:
http://arxiv.org/abs/1807.11679
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
In Speech Communication October 2015 73:47-63
Publikováno v:
In Speech Communication September 2015 72:208-217
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.