Výsledky vyhledávání - "Michael Chinen"

Akademický článek

Speech quality assessment with WARP‐Q: From similarity to subsequence dynamic time warp cost

Autor: Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines

Publikováno v: IET Signal Processing, Vol 16, Iss 9, Pp 1050-1070 (2022)

Abstract Speech coding has been shown to achieve good speech quality using either waveform matching or parametric reconstruction. For very low bit rate streams, recently developed generative speech models can reconstruct high‐quality wideband speec

Externí odkaz: https://doaj.org/article/0515bfaa105a481994f5b7bc1386aa30

Zobrazit plný text záznamu

Akademický článek

Marginal Effects of Language and Individual Raters on Speech Quality Models

Autor: Michael Chinen

Publikováno v: IEEE Access, Vol 9, Pp 127320-127334 (2021)

Speech quality is often measured via subjective testing, or with objective estimators of mean opinion score (MOS) such as ViSQOL or POLQA. Typical MOS-estimation frameworks use signal level features but do not use language features that have been sho

Externí odkaz: https://doaj.org/article/a5c84308d2254bdb94a997a6f1493406

Zobrazit plný text záznamu

Akademický článek

AMBIQUAL: Towards a Quality Metric for Headphone Rendered Compressed Ambisonic Spatial Audio

Autor: Miroslaw Narbutt, Jan Skoglund, Andrew Allen, Michael Chinen, Dan Barry, Andrew Hines

Publikováno v: Applied Sciences, Vol 10, Iss 9, p 3188 (2020)

Spatial audio is essential for creating a sense of immersion in virtual environments. Efficient encoding methods are required to deliver spatial audio over networks without compromising Quality of Service (QoS). Streaming service providers such as Yo

Externí odkaz: https://doaj.org/article/552d6f3f67a14cc7ae6987e397c6a82b

Zobrazit plný text záznamu

Multi-Channel Audio Signal Generation

Autor: W. Bastiaan Kleijn, Michael Chinen, Felicia S. C. Lim, Jan Skoglund

Publikováno v: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::0dc3c260e2f691be875bb0da4f4d8196
https://doi.org/10.1109/icassp49357.2023.10094853

Zobrazit plný text záznamu

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

Autor: Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund

Speech coding facilitates the transmission of speech over low-bandwidth networks with minimal distortion. Neural-network based speech codecs have recently demonstrated significant improvements in quality over traditional approaches. While this new ge

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0a43855363016a979af0ae020b96a3c0
http://arxiv.org/abs/2207.02262

Zobrazit plný text záznamu

Speech quality estimation with deep lattice networks

Autor: Andrew Hines, Michael Chinen, Jan Skoglund

Publikováno v: The Journal of the Acoustical Society of America. 149(6)

Intrusive subjective speech quality estimation of mean opinion score (MOS) often involves mapping a raw similarity score extracted from differences between the clean and degraded utterance onto MOS with a fitted mapping function. More recent models s

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6ce812f84de77faf5c480203218efd97
https://pubmed.ncbi.nlm.nih.gov/34241460

Zobrazit plný text záznamu

WARP-Q: Quality Prediction For Generative Neural Speech Codecs

Autor: Wissam A. Jassim, Jan Skoglund, Michael Chinen, Andrew Hines

Publikováno v: ICASSP

Good speech quality has been achieved using waveform matching and parametric reconstruction coders. Recently developed very low bit rate generative codecs can reconstruct high quality wideband speech with bit streams less than 3 kb/s. These codecs us

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::102c4952e8d52c947235d85c3aad1613

Zobrazit plný text záznamu

ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric

Autor: Feargus O'Gorman, Felicia S. C. Lim, Michael Chinen, Nikita Gureev, Andrew Hines, Jan Skoglund

Publikováno v: QoMEX

Estimation of perceptual quality in audio and speech is possible using a variety of methods. The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio, respectively,) provides improvements upon previous versions, in terms of both design

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dc416fbe010c6057d5f596dedba5be99
http://arxiv.org/abs/2004.09584

Zobrazit plný text záznamu

Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders

Autor: Michael Chinen, Andrew Hines, Wissam A. Jassim, Jan Skoglund

Publikováno v: QoMEX

This study compares the performances of different algorithms for coding speech at low bit rates. In addition to widely deployed traditional vocoders, a selection of recently developed generative-model-based coders at different bit rates are contraste

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6febe502aa31344195616cf9b4d20947

Zobrazit plný text záznamu

Generative Speech Enhancement Based on Cloned Networks

Autor: Jan Skoglund, W. Bastiaan Kleijn, Michael Chinen, Felicia S. C. Lim

Publikováno v: WASPAA

We propose to implement speech enhancement by the regeneration of clean speech from a salient representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4de1ea4c8663da5586590fc484f09732

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání