Výsledky vyhledávání - "Kowtha, Vasudha"

Report

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

Autor: Kowtha, Vasudha, Marques, Miquel Espi, Huang, Jonathan, Zhang, Yichi, Avendano, Carlos

This work investigates pretrained audio representations for few shot Sound Event Detection. We specifically address the task of few shot detection of novel acoustic sequences, or sound events with semantically meaningful temporal structure, without a

Externí odkaz: http://arxiv.org/abs/2305.02382

Zobrazit plný text záznamu

Report

Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

Autor: Mitra, Vikramjit, Kowtha, Vasudha, Chien, Hsiang-Yun Sherry, Azemi, Erdrin, Avendano, Carlos

Pre-trained model representations have demonstrated state-of-the-art performance in speech recognition, natural language processing, and other applications. Speech models, such as Bidirectional Encoder Representations from Transformers (BERT) and Hid

Externí odkaz: http://arxiv.org/abs/2303.03177

Zobrazit plný text záznamu

Report

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

Autor: Mitra, Vikramjit, Chien, Hsiang-Yun Sherry, Kowtha, Vasudha, Cheng, Joseph Yitan, Azemi, Erdrin

Estimating dimensional emotions, such as activation, valence and dominance, from acoustic speech signals has been widely explored over the past few years. While accurate estimation of activation and dominance from speech seem to be possible, the same

Externí odkaz: http://arxiv.org/abs/2207.03334

Zobrazit plný text záznamu

Report

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

Autor: Kowtha, Vasudha, Mitra, Vikramjit, Bartels, Chris, Marchi, Erik, Booker, Sue, Caruso, William, Kajarekar, Sachin, Naik, Devang

Emotion plays an essential role in human-to-human communication, enabling us to convey feelings such as happiness, frustration, and sincerity. While modern speech technologies rely heavily on speech recognition and natural language understanding for

Externí odkaz: http://arxiv.org/abs/2002.01323

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání