Výsledky vyhledávání - "Soheil Khorram"

Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition

Autor: John H. L. Hansen, Soheil Khorram, Salar Jafarlou, Vinay Kothapally

Publikováno v: ASRU

Despite significant efforts over the last few years to build a robust automatic speech recognition (ASR) system for different acoustic settings, the performance of the current state-of-the-art technologies significantly degrades in noisy reverberant

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b2a11bbf1f9df93aedc57f5d7259407e
https://doi.org/10.1109/asru46091.2019.9003805

Zobrazit plný text záznamu

Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition

Autor: John H. L. Hansen, Soheil Khorram, Shahram Ghorbani

Publikováno v: ASRU

Training acoustic models with sequentially incoming data -- while both leveraging new data and avoiding the forgetting effect-- is an essential obstacle to achieving human intelligence level in speech recognition. An obvious approach to leverage data

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c3103a3af9fc2e6e3020b02cb8fb0257
http://arxiv.org/abs/1910.00565

Zobrazit plný text záznamu

Exploiting Acoustic and Lexical Properties of Phonemes to Recognize Valence from Speech

Autor: Biqiao Zhang, Emily Mower Provost, Soheil Khorram

Publikováno v: ICASSP

Emotions modulate speech acoustics as well as language. The latter influences the sequences of phonemes that are produced, which in turn further modulate the acoustics. Therefore, phonemes impact emotion recognition in two ways: (1) they introduce an

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::88ef751a1c66b0333c6a83cae56492ca
https://doi.org/10.1109/icassp.2019.8683190

Zobrazit plný text záznamu

Trainable Time Warping: Aligning Time-Series in the Continuous-Time Domain

Autor: Soheil Khorram, Melvin G. McInnis, Emily Mower Provost

Publikováno v: ICASSP

DTW calculates the similarity or alignment between two signals, subject to temporal warping. However, its computational complexity grows exponentially with the number of time-series. Although there have been algorithms developed that are linear in th

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b97b867efea87499684a3025f4ccbb4a
http://arxiv.org/abs/1903.09245

Zobrazit plný text záznamu

Jointly Aligning and Predicting Continuous Emotion Annotations

Autor: Soheil Khorram, Emily Mower Provost, Melvin G. McInnis

Publikováno v: IEEE Trans Affect Comput

Time-continuous dimensional descriptions of emotions (e.g., arousal, valence) allow researchers to characterize short-time changes and to capture long-term trends in emotion expression. However, continuous emotion labels are generally not synchronize

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::79db627137c584f22624360631032371

Zobrazit plný text záznamu

Probabilistic Permutation Invariant Training for Speech Separation

Autor: Midia Yousefi, John H. L. Hansen, Soheil Khorram

Publikováno v: INTERSPEECH

Single-microphone, speaker-independent speech separation is normally performed through two steps: (i) separating the specific speech sources, and (ii) determining the best output-label assignment to find the separation error. The second step is the m

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::405491cb8d73a775770fd9680fbd3aab

Zobrazit plný text záznamu

The PRIORI Emotion Dataset: Linking Mood to Emotion Detected In-the-Wild

Autor: Soheil Khorram, Melvin G. McInnis, Mimansa Jaiswal, John Gideon, Emily Mower Provost

Publikováno v: INTERSPEECH

Bipolar Disorder is a chronic psychiatric illness characterized by pathological mood swings associated with severe disruptions in emotion regulation. Clinical monitoring of mood is key to the care of these dynamic and incapacitating mood states. Freq

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8228f3b12bccd26d593b7c570813d8c9
http://arxiv.org/abs/1806.10658

Zobrazit plný text záznamu

Pooling acoustic and lexical features for the prediction of valence

Autor: Zakaria Aldeneh, Soheil Khorram, Emily Mower Provost, Dimitrios Dimitriadis

Publikováno v: ICMI

In this paper, we present an analysis of different multimodal fusion approaches in the context of deep learning, focusing on pooling intermediate representations learned for the acoustic and lexical modalities. Traditional approaches to multimodal fe

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dc0292fc65f48463ce8382b0ea0fa67a
https://doi.org/10.1145/3136755.3136760

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání