Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels
Autor: | Pierre Badin, Oriol Guasch, Saeed Dabbaghchian, Marc Arnela, Ian Stavness, Olov Engwall |
---|---|
Přispěvatelé: | Royal Institute of Technology [Stockholm] (KTH ), GTM Grup de recerca en Tecnologies Mèdia, Universitat Ramon Llull [Barcelona] (URL), Department of Computer Science [Saskatoon, Saskatchewan] (U of S), University of Saskatchewan [Saskatoon] (U of S), GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP), Département Parole et Cognition (GIPSA-DPC), Grenoble Images Parole Signal Automatique (GIPSA-lab ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Grenoble Images Parole Signal Automatique (GIPSA-lab ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), European Project: 308874,EC:FP7:ICT,FP7-ICT-2011-C,EUNISON(2013) |
Rok vydání: | 2016 |
Předmět: |
Speech production
speech production Computer science [SHS.INFO]Humanities and Social Sciences/Library and information sciences Speech recognition 0206 medical engineering Strömningsmekanik och akustik 02 engineering and technology Tracking (particle physics) 01 natural sciences 0103 physical sciences vocal tract geometry 010301 acoustics Fluid Mechanics and Acoustics Computer Sciences 020601 biomedical engineering Electromagnetic articulography Datavetenskap (datalogi) biomechanical articulatory model vocal tract acoustics Computer Science::Sound Finite Element Method Production (computer science) Biomechanical model Vocal tract |
Zdroj: | INTERSPEECH Interspeech 2016-17th Annual Conference of the International Speech Communication Association Interspeech 2016-17th Annual Conference of the International Speech Communication Association, Sep 2016, San Francisco, United States. pp.3569-3573, ⟨10.21437/Interspeech.2016-1500⟩ |
DOI: | 10.21437/interspeech.2016-1500 |
Popis: | We introduce a framework to study speech production using a biomechanical model of the human vocal tract, ArtiSynth. Electromagnetic articulography data was used as input to an inverse tracking simulation that estimates muscle activations to generate 3D jaw and tongue postures corresponding to the target articulator positions. For acoustic simulations, the vocal tract geometry is needed, but since the vocal tract is a cavity rather than a physical object, its geometry does not explicitly exist in a biomechanical model. A fully-automatic method to extract the 3D geometry (surface mesh) of the vocal tract by blending geometries of the relevant articulators has therefore been developed. This automatic extraction procedure is essential, since a method with manual intervention is not feasible for large numbers of simulations or for generation of dynamic sounds, such as diphthongs. We then simulated the vocal tract acoustics by using the Finite Element Method (FEM). This requires a high quality vocal tract mesh without irregular geometry or self-intersections. We demonstrate that the framework is applicable to acoustic FEM simulations of a wide range of vocal tract deformations. In particular we present results for cardinal vowel production, with muscle activations, vocal tract geometry, and acoustic simulations. QC 20160920 EUNISON |
Databáze: | OpenAIRE |
Externí odkaz: |