Zobrazeno 1 - 10
of 16
pro vyhledávání: '"Michael Chinen"'
Publikováno v:
IET Signal Processing, Vol 16, Iss 9, Pp 1050-1070 (2022)
Abstract Speech coding has been shown to achieve good speech quality using either waveform matching or parametric reconstruction. For very low bit rate streams, recently developed generative speech models can reconstruct high‐quality wideband speec
Externí odkaz:
https://doaj.org/article/0515bfaa105a481994f5b7bc1386aa30
Autor:
Michael Chinen
Publikováno v:
IEEE Access, Vol 9, Pp 127320-127334 (2021)
Speech quality is often measured via subjective testing, or with objective estimators of mean opinion score (MOS) such as ViSQOL or POLQA. Typical MOS-estimation frameworks use signal level features but do not use language features that have been sho
Externí odkaz:
https://doaj.org/article/a5c84308d2254bdb94a997a6f1493406
Publikováno v:
Applied Sciences, Vol 10, Iss 9, p 3188 (2020)
Spatial audio is essential for creating a sense of immersion in virtual environments. Efficient encoding methods are required to deliver spatial audio over networks without compromising Quality of Service (QoS). Streaming service providers such as Yo
Externí odkaz:
https://doaj.org/article/552d6f3f67a14cc7ae6987e397c6a82b
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Speech coding facilitates the transmission of speech over low-bandwidth networks with minimal distortion. Neural-network based speech codecs have recently demonstrated significant improvements in quality over traditional approaches. While this new ge
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0a43855363016a979af0ae020b96a3c0
http://arxiv.org/abs/2207.02262
http://arxiv.org/abs/2207.02262
Publikováno v:
The Journal of the Acoustical Society of America. 149(6)
Intrusive subjective speech quality estimation of mean opinion score (MOS) often involves mapping a raw similarity score extracted from differences between the clean and degraded utterance onto MOS with a fitted mapping function. More recent models s
Publikováno v:
ICASSP
Good speech quality has been achieved using waveform matching and parametric reconstruction coders. Recently developed very low bit rate generative codecs can reconstruct high quality wideband speech with bit streams less than 3 kb/s. These codecs us
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::102c4952e8d52c947235d85c3aad1613
Autor:
Feargus O'Gorman, Felicia S. C. Lim, Michael Chinen, Nikita Gureev, Andrew Hines, Jan Skoglund
Publikováno v:
QoMEX
Estimation of perceptual quality in audio and speech is possible using a variety of methods. The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio, respectively,) provides improvements upon previous versions, in terms of both design
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dc416fbe010c6057d5f596dedba5be99
http://arxiv.org/abs/2004.09584
http://arxiv.org/abs/2004.09584
Publikováno v:
QoMEX
This study compares the performances of different algorithms for coding speech at low bit rates. In addition to widely deployed traditional vocoders, a selection of recently developed generative-model-based coders at different bit rates are contraste
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6febe502aa31344195616cf9b4d20947
Publikováno v:
WASPAA
We propose to implement speech enhancement by the regeneration of clean speech from a salient representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4de1ea4c8663da5586590fc484f09732