Zobrazeno 1 - 10
of 1 420
pro vyhledávání: '"Lee, Tan"'
Psychotherapy or counseling is typically conducted through spoken conversation between a therapist and a client. Analyzing the speech characteristics of psychotherapeutic interactions can help understand the factors associated with effective psychoth
Externí odkaz:
http://arxiv.org/abs/2409.02466
This paper presents a user-driven approach for synthesizing specific target voices based on user feedback rather than reference recordings, which is particularly beneficial for speech-impaired individuals who want to recreate their lost voices but la
Externí odkaz:
http://arxiv.org/abs/2408.17068
Representing speech as discretized units has numerous benefits in supporting downstream spoken language processing tasks. However, the approach has been less explored in speech synthesis of tonal languages like Mandarin Chinese. Our preliminary exper
Externí odkaz:
http://arxiv.org/abs/2406.08989
Covering all languages with a multilingual speech recognition model (MASR) is very difficult. Performing language extension on top of an existing MASR is a desirable choice. In this study, the MASR continual learning problem is probabilistically deco
Externí odkaz:
http://arxiv.org/abs/2406.06329
Toward high-performance multilingual automatic speech recognition (ASR), various types of linguistic information and model design have demonstrated their effectiveness independently. They include language identity (LID), phoneme information, language
Externí odkaz:
http://arxiv.org/abs/2401.03689
This research is about the creation of personalized synthetic voices for head and neck cancer survivors. It is focused particularly on tongue cancer patients whose speech might exhibit severe articulation impairment. Our goal is to restore normal art
Externí odkaz:
http://arxiv.org/abs/2401.03816
Counseling is carried out as spoken conversation between a therapist and a client. The empathy level expressed by the therapist is considered an important index of the quality of counseling and often assessed by an observer or the client. This resear
Externí odkaz:
http://arxiv.org/abs/2310.14181
Counseling is usually conducted through spoken conversation between a therapist and a client. The empathy level of therapist is a key indicator of outcomes. Presuming that therapist's empathy expression is shaped by their past behavior and their perc
Externí odkaz:
http://arxiv.org/abs/2310.14178
Autor:
Li, Jingyu, Lee, Tan
The development of deep neural networks (DNN) has significantly enhanced the performance of speaker verification (SV) systems in recent years. However, a critical issue that persists when applying DNN-based SV systems in practical applications is dom
Externí odkaz:
http://arxiv.org/abs/2309.13605
Transformer-based speech recognition (ASR) model with deep layers exhibited significant performance improvement. However, the model is inefficient for deployment on resource-constrained devices. Layer pruning (LP) is a commonly used compression metho
Externí odkaz:
http://arxiv.org/abs/2309.11768