A Fast Robust 1D Flow Model for a Self-Oscillating Coupled 2D FEM Vocal Fold Simulation

Arvind Vasudevan; Victor Zappi; Peter Anderson; Sidney Fels

2017 INTERSPEECH INTERSPEECH 2017

A Fast Robust 1D Flow Model for a Self-Oscillating Coupled 2D FEM Vocal Fold Simulation

Abstract

A balance between the simplicity and speed of lumped-element vocal fold models and the completeness and complexity of continuum-models is required to achieve fast high-quality articulatory speech synthesis. We develop and implement a novel self-oscillating vocal-fold model, composed of a 1D unsteady fluid model loosely coupled with a 2D FEM structural model. The flow model is capable of robustly handling irregular geometries, different boundary conditions, closure of the glottis and unsteady flow states. A method for a fast decoupled solution of the flow equations that does not require the computation of the Jacobian is provided. The model is coupled with a 2D real-time finite-difference wave-solver for simulating vocal tract acoustics and a 1D wave-reflection analog representation of the trachea. The simulation results are shown to agree with existing data in literature, and give realistic pressure-velocity distributions, glottal width and glottal flow values. In addition, the model is more than an order of magnitude faster to run than comparable 2D Navier-Stokes fluid solvers, while better capturing transitional flow than simple Bernoulli-based flow models. The vocal fold model provides an alternative to simple lumped-element models for faster higher-quality articulatory speech synthesis.

🧭 Keyword Pioneer — articulatory speech synthesis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Arvind Vasudevan , Victor Zappi , Peter Anderson , Sidney Fels

Topics

Speech & Audio > Synthesis > Text-to-Speech

Keywords

fluid dynamics finite element method vocal fold articulatory speech synthesis glottal flow unsteady flow

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017