Výsledky vyhledávání - "Kevin El Haddad"

Akademický článek

Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System

Autor: Noé Tits, Kevin El Haddad, Thierry Dutoit

Publikováno v: Informatics, Vol 8, Iss 4, p 84 (2021)

In this paper, we study the controllability of an Expressive TTS system trained on a dataset for a continuous control. The dataset is the Blizzard 2013 dataset based on audiobooks read by a female speaker containing a great variability in styles and

Externí odkaz: https://doaj.org/article/ce613f2836204276a2564a149c22b163

Zobrazit plný text záznamu

Akademický článek

End-to-end listening agent for audiovisual emotional and naturalistic interactions

Autor: Kevin El Haddad, Yara Rizk, Louise Heron, Nadine Hajj, Yong Zhao, Jaebok Kim, Trung Ngô Trọng, Minha Lee, Marwan Doumit, Payton Lin, Yelin Kim, Hüseyin Çakmak

Publikováno v: Journal of Science and Technology of the Arts, Vol 10, Iss 2 (2018)

In this work, we established the foundations of a framework with the goal to build an end-to-end naturalistic expressive listening agent. The project was split into modules for recognition of the user’s paralinguistic and nonverbal expressions, pre

Externí odkaz: https://doaj.org/article/fa1ac0b904974532b6abf0ac58fa1261

Zobrazit plný text záznamu

A New Perspective on Smiling and Laughter Detection: Intensity Levels Matter

Autor: Hugo Bohy, Kevin El Haddad, Thierry Dutoit

Publikováno v: 2022 10th International Conference on Affective Computing and Intelligent Interaction (ACII).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::be94c7acf76a8f761d8da0e488bae2d7
https://doi.org/10.1109/acii55700.2022.9953896

Zobrazit plný text záznamu

Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning

Autor: Noé Tits, Thierry Dutoit, Kevin El Haddad

Publikováno v: INTERSPEECH

Despite the growing interest for expressive speech synthesis, synthesis of nonverbal expressions is an under-explored area. In this paper we propose an audio laughter synthesis system based on a sequence-to-sequence TTS synthesis system. We leverage

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f610e83ed9ceef71f532145a205faeae
http://arxiv.org/abs/2008.09483

Zobrazit plný text záznamu

Neural Speech Synthesis with Style Intensity Interpolation

Autor: Kevin El Haddad, Thierry Dutoit, Noé Tits

Publikováno v: HRI (Companion)

State of the art in speech synthesis considerably reduced the gap between synthetic and human speech on the perception level. However the impact of a speech style control on the perception is not well known. In this paper, we propose a method to anal

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::1fc453653440600add254d40831fbaf8
https://doi.org/10.1145/3371382.3378297

Zobrazit plný text záznamu

Amused speech components analysis and classification: Towards an amusement arousal level assessment system

Autor: Stéphane Dupont, Thierry Dutoit, Kevin El Haddad, Huseyin Cakmak

Publikováno v: Computers & Electrical Engineering. 62:588-600

In this paper, we present our work on analysis and classification of smiled vowels, chuckling (or shaking) vowels and laughter syllables. This work is part of a larger framework that aims at assessing the level of amusement in speech using the audio

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::89760a70795f2f582b452eeaddfeb105
https://doi.org/10.1016/j.compeleceng.2017.06.012

Zobrazit plný text záznamu

The Theory behind Controllable Expressive Speech Synthesis: A Cross-Disciplinary Approach

Autor: Thierry Dutoit, Kevin El Haddad, Noé Tits

As part of the Human-Computer Interaction field, Expressive speech synthesis is a very rich domain as it requires knowledge in areas such as machine learning, signal processing, sociology, and psychology. In this chapter, we will focus mostly on the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dc8bfd02cc65328c1a59047e1dbe764f
http://www.intechopen.com/articles/show/title/the-theory-behind-controllable-expressive-speech-synthesis-a-cross-disciplinary-approach

Zobrazit plný text záznamu

Smile and Laugh Dynamics in Naturalistic Dyadic Interactions: Intensity Levels, Sequences and Roles

Autor: Sandeep Nallan Chakravarthula, James Kennedy, Kevin El Haddad

Publikováno v: ICMI

Smiles and laughs have been the subject of many studies over the past decades, due to their frequent occurrence in interactions, as well as their social and emotional functions in dyadic conversations. In this paper we push forward previous work by p

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8cc0d6e3e10942a3b0394e98ab18dbde
https://doi.org/10.1145/3340555.3353764

Zobrazit plný text záznamu

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis Through Audio Analysis

Autor: Fengna Wang, Kevin El Haddad, Thierry Dutoit, Noé Tits, Vincent Pagel

Publikováno v: INTERSPEECH

The field of Text-to-Speech has experienced huge improvements last years benefiting from deep learning techniques. Producing realistic speech becomes possible now. As a consequence, the research on the control of the expressiveness, allowing to gener

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::51ccc8fb37467032c56fe8ef649d6998
https://doi.org/10.21437/interspeech.2019-1426

Zobrazit plný text záznamu

An Open-Source Avatar for Real-Time Human-Agent Interaction Applications

Autor: François Zajéga, Kevin El Haddad, Thierry Dutoit

Publikováno v: ACII Workshops

In this work we present an open-source avatar project aiming to be used in Human-Agent Interaction systems. We demonstrate here a system that is portable, works in real-time and implemented in a modular way. This is an attempt to respond to the need

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::569a4a95fcb12effc10c45f8b39f4159
https://doi.org/10.1109/aciiw.2019.8925115

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání