Výsledky vyhledávání - "Williamson, Donald A"

Report

Building Trust Through Voice: How Vocal Tone Impacts User Perception of Attractiveness of Voice Assistants

Autor: Pias, Sabid Bin Habib, Freel, Alicia, Huang, Ran, Williamson, Donald, Kim, Minjeong, Kapadia, Apu

Voice Assistants (VAs) are popular for simple tasks, but users are often hesitant to use them for complex activities like online shopping. We explored whether the vocal characteristics like the VA's vocal tone, can make VAs perceived as more attracti

Externí odkaz: http://arxiv.org/abs/2409.18941

Zobrazit plný text záznamu

Report

The Impact of Perceived Tone, Age, and Gender on Voice Assistant Persuasiveness in the Context of Product Recommendations

Autor: Pias, Sabid Bin Habib, Huang, Ran, Williamson, Donald, Kim, Minjeong, Kapadia, Apu

Voice Assistants (VAs) can assist users in various everyday tasks, but many users are reluctant to rely on VAs for intricate tasks like online shopping. This study aims to examine whether the vocal characteristics of VAs can serve as an effective too

Externí odkaz: http://arxiv.org/abs/2405.04791

Zobrazit plný text záznamu

Report

The Drawback of Insight: Detailed Explanations Can Reduce Agreement with XAI

Autor: Pias, Sabid Bin Habib, Freel, Alicia, Trammel, Timothy, Akter, Taslima, Williamson, Donald, Kapadia, Apu

With the emergence of Artificial Intelligence (AI)-based decision-making, explanations help increase new technology adoption through enhanced trust and reliability. However, our experimental study challenges the notion that every user universally val

Externí odkaz: http://arxiv.org/abs/2404.19629

Zobrazit plný text záznamu

Report

CORN: Co-Trained Full- And No-Reference Speech Quality Assessment

Autor: Manocha, Pranay, Williamson, Donald, Finkelstein, Adam

Perceptual evaluation constitutes a crucial aspect of various audio-processing tasks. Full reference (FR) or similarity-based metrics rely on high-quality reference recordings, to which lower-quality or corrupted versions of the recording may be comp

Externí odkaz: http://arxiv.org/abs/2310.09388

Zobrazit plný text záznamu

Report

Privacy-preserving and Privacy-attacking Approaches for Speech and Audio -- A Survey

Autor: Liu, Yuchen, Kapadia, Apu, Williamson, Donald

In contemporary society, voice-controlled devices, such as smartphones and home assistants, have become pervasive due to their advanced capabilities and functionality. The always-on nature of their microphones offers users the convenience of readily

Externí odkaz: http://arxiv.org/abs/2309.15087

Zobrazit plný text záznamu

Report

MMViT: Multiscale Multiview Vision Transformers

Autor: Liu, Yuchen, Ong, Natasha, Peng, Kaiyan, Xiong, Bo, Wang, Qifan, Hou, Rui, Khabsa, Madian, Yang, Kaiyue, Liu, David, Williamson, Donald S., Yu, Hanchao

We present Multiscale Multiview Vision Transformers (MMViT), which introduces multiscale feature maps and multiview encodings to transformer models. Our model encodes different views of the input signal and builds several channel-resolution feature s

Externí odkaz: http://arxiv.org/abs/2305.00104

Zobrazit plný text záznamu

Report

Attention-based Speech Enhancement Using Human Quality Perception Modelling

Autor: Nayem, Khandokar Md., Williamson, Donald S.

Perceptually-inspired objective functions such as the perceptual evaluation of speech quality (PESQ), signal-to-distortion ratio (SDR), and short-time objective intelligibility (STOI), have recently been used to optimize performance of deep-learning-

Externí odkaz: http://arxiv.org/abs/2303.13685

Zobrazit plný text záznamu

Report

A Composite T60 Regression and Classification Approach for Speech Dereverberation

Autor: Li, Yuying, Liu, Yuchen, Williamson, Donald S.

Dereverberation is often performed directly on the reverberant audio signal, without knowledge of the acoustic environment. Reverberation time, T60, however, is an essential acoustic factor that reflects how reverberation may impact a signal. In this

Externí odkaz: http://arxiv.org/abs/2302.04932

Zobrazit plný text záznamu

Report

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Autor: Yi, Gaoxiong, Xiao, Wei, Xiao, Yiming, Naderi, Babak, Möller, Sebastian, Wardah, Wafaa, Mittag, Gabriel, Cutler, Ross, Zhang, Zhuohuang, Williamson, Donald S., Chen, Fei, Yang, Fuzheng, Shang, Shidong

With the advances in speech communication systems such as online conferencing applications, we can seamlessly work with people regardless of where they are. However, during online meetings, speech quality can be significantly affected by background n

Externí odkaz: http://arxiv.org/abs/2203.16032

Zobrazit plný text záznamu

Report

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Autor: Zhang, Zhuohuang, Xu, Yong, Yu, Meng, Zhang, Shi-Xiong, Chen, Lianwu, Williamson, Donald S., Yu, Dong

Many purely neural network based speech separation approaches have been proposed to improve objective assessment scores, but they often introduce nonlinear distortions that are harmful to modern automatic speech recognition (ASR) systems. Minimum var

Externí odkaz: http://arxiv.org/abs/2012.13442

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání