Zobrazeno 1 - 10
of 67
pro vyhledávání: '"Akinobu, Lee"'
Publikováno v:
Transactions of the Society of Automotive Engineers of Japan; Mar2024, Vol. 55 Issue 2, p361-366, 6p
Publikováno v:
Computer Speech & Language. 62:101070
End-to-end neural-based dialogue systems can potentially generate tailored and coherent responses for user inputs. However, most of existing systems produce universal and non-informative responses, and they have not gone beyond chitchat yet. To tackl
Publikováno v:
2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2013). :8382-8385
Vancouver, BC, Canada, 26-31 May 2013
Autor:
Steve Renals, Ichi Takumi, Yoshihiko Nankaku, Keiichi Tokuda, Kei Hashimoto, Shuhei Tsutsumi, Daisuke Yamamoto, Junichi Yamagishi, Akinobu Lee, Keiichiro Oura, Takahiro Uchiya
Publikováno v:
Human-Harmonized Information Technology, Volume 2 ISBN: 9784431565338
Human-Harmonized Information Technology (2)
Human-Harmonized Information Technology (2)
This chapter introduces the idea of user-generated dialogue content and describes our experimental exploration aimed at clarifying the mechanism and conditions that makes it workable in practice. One of the attractive points of a speech interface is
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ed0a571ca93ef6739a76841e667e8132
https://doi.org/10.1007/978-4-431-56535-2_3
https://doi.org/10.1007/978-4-431-56535-2_3
Publikováno v:
IEICE transactions on information and systems. (3):668-678
This paper proposes Bayesian context clustering using cross validation for hidden Markov model (HMM) based speech recognition. The Bayesian approach is a statistical technique for estimating reliable predictive distributions by treating model paramet
Publikováno v:
Acoustical Science and Technology. 32:236-243
We propose a speech recognition technique using multiple model structures. In the use of context-dependent models, decision-tree-based context clustering is applied to find an appropriate parameter tying structure. However, context clustering is usua
Publikováno v:
IEICE Transactions on Information and Systems. :595-601
SUMMARY A technique for reducing the footprints of HMM-based speech synthesis systems by tying all covariance matrices of state distributions is described. HMM-based speech synthesis systems usually leave smaller footprints than unit-selection synthe
Publikováno v:
INTERSPEECH
This paper proposes a speaker adaptation technique using a nonlinear spectral transform based on GMMs. One of the most popular forms of speaker adaptation is based on linear transforms, e.g., MLLR. Although MLLR uses multiple transforms according to
Publikováno v:
IEICE Transactions on Information and Systems. :2693-2700
In a hidden Markov model (HMM), state duration probabilities decrease exponentially with time, which fails to adequately represent the temporal structure of speech. One of the solutions to this problem is integrating state duration probability distri
Publikováno v:
IEEE Transactions on Audio, Speech and Language Processing. 14:666-678
We propose a new algorithm for blind source separation (BSS), in which independent component analysis (ICA) and beamforming are combined to resolve the slow-convergence problem through optimization in ICA. The proposed method consists of the followin