Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels

Autor: Pierre Badin, Oriol Guasch, Saeed Dabbaghchian, Marc Arnela, Ian Stavness, Olov Engwall
Přispěvatelé: Royal Institute of Technology [Stockholm] (KTH ), GTM Grup de recerca en Tecnologies Mèdia, Universitat Ramon Llull [Barcelona] (URL), Department of Computer Science [Saskatoon, Saskatchewan] (U of S), University of Saskatchewan [Saskatoon] (U of S), GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP), Département Parole et Cognition (GIPSA-DPC), Grenoble Images Parole Signal Automatique (GIPSA-lab ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Grenoble Images Parole Signal Automatique (GIPSA-lab ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), European Project: 308874,EC:FP7:ICT,FP7-ICT-2011-C,EUNISON(2013)
Rok vydání: 2016
Předmět:
Zdroj: INTERSPEECH
Interspeech 2016-17th Annual Conference of the International Speech Communication Association
Interspeech 2016-17th Annual Conference of the International Speech Communication Association, Sep 2016, San Francisco, United States. pp.3569-3573, ⟨10.21437/Interspeech.2016-1500⟩
DOI: 10.21437/interspeech.2016-1500
Popis: We introduce a framework to study speech production using a biomechanical model of the human vocal tract, ArtiSynth. Electromagnetic articulography data was used as input to an inverse tracking simulation that estimates muscle activations to generate 3D jaw and tongue postures corresponding to the target articulator positions. For acoustic simulations, the vocal tract geometry is needed, but since the vocal tract is a cavity rather than a physical object, its geometry does not explicitly exist in a biomechanical model. A fully-automatic method to extract the 3D geometry (surface mesh) of the vocal tract by blending geometries of the relevant articulators has therefore been developed. This automatic extraction procedure is essential, since a method with manual intervention is not feasible for large numbers of simulations or for generation of dynamic sounds, such as diphthongs. We then simulated the vocal tract acoustics by using the Finite Element Method (FEM). This requires a high quality vocal tract mesh without irregular geometry or self-intersections. We demonstrate that the framework is applicable to acoustic FEM simulations of a wide range of vocal tract deformations. In particular we present results for cardinal vowel production, with muscle activations, vocal tract geometry, and acoustic simulations. QC 20160920 EUNISON
Databáze: OpenAIRE