Estimating vocal tract geometry from acoustic impedance using deep neural network.

Journal: JASA express letters
Published Date:

Abstract

A data-driven approach using artificial neural networks is proposed to address the classic inverse area function problem, i.e., to determine the vocal tract geometry (modelled as a tube of nonuniform cylindrical cross-sections) from the vocal tract acoustic impedance spectrum. The predicted cylindrical radii and the actual radii were found to have high correlation in the three- and four-cylinder model (Pearson coefficient (ρ) and Lin concordance coefficient (ρ) exceeded 95%); however, for the six-cylinder model, the correlation was low (ρ around 75% and ρ around 69%). Upon standardizing the impedance value, the correlation improved significantly for all cases (ρ and ρ exceeded 90%).

Authors

  • Balamurali B T
    Singapore University of Technology and Design, Singapore balamurali_bt@sutd.edu.sg, saumitrakapoor@gmail.com, jerming_chen@sutd.edu.sg.
  • Saumitra Kapoor
    Singapore University of Technology and Design, Singapore balamurali_bt@sutd.edu.sg, saumitrakapoor@gmail.com, jerming_chen@sutd.edu.sg.
  • Jer-Ming Chen
    Singapore University of Technology and Design, Singapore balamurali_bt@sutd.edu.sg, saumitrakapoor@gmail.com, jerming_chen@sutd.edu.sg.