Open Access. Powered by Scholars. Published by Universities.®

Communication Sciences and Disorders Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 2 of 2

Full-Text Articles in Communication Sciences and Disorders

Speaker-Specific Adaptation Of Maeda Synthesis Parameters For Auditory Feedback, Joseph Vonderhaar Apr 2017

Speaker-Specific Adaptation Of Maeda Synthesis Parameters For Auditory Feedback, Joseph Vonderhaar

Master's Theses (2009 -)

The Real-time Articulatory Speech Synthesizer (RASS) is a research tool in the Marquette Speech and Swallowing lab that simultaneously collects acoustic and articulatory data from human participants. The system is used to study acoustic-to-articulatory inversion, articulatory-to-acoustic synthesis mapping, and the effects of real-time acoustic feedback. Electromagnetic Articulography (EMA) is utilized to collect position data via sensors placed in a subject’s mouth. These kinematic data are then converted into a set of synthesis parameters that controls an articulatory speech synthesizer, which in turn generates an acoustic waveform matching the associated kinematics. Independently from RASS, the synthesized acoustic waveform can be further …


The Electromagnetic Articulography Mandarin Accented English (Ema-Mae) Corpus Of Acoustic And 3d Articulatory Kinematic Data, Jeffrey J. Berry, An Ji, Michael T. Johnson May 2014

The Electromagnetic Articulography Mandarin Accented English (Ema-Mae) Corpus Of Acoustic And 3d Articulatory Kinematic Data, Jeffrey J. Berry, An Ji, Michael T. Johnson

Speech Pathology and Audiology Faculty Research and Publications

There is a significant need for more comprehensive electromagnetic articulography (EMA) datasets that can provide matched acoustics and articulatory kinematic data with good spatial and temporal resolution. The Marquette University Electromagnetic Articulography Mandarin Accented English (EMA-MAE) corpus provides kinematic and acoustic data from 40 gender and dialect balanced speakers representing 20 Midwestern standard American English L1 speakers and 20 Mandarin Accented English (MAE) L2 speakers, half Beijing region dialect and half are Shanghai region dialect. Three dimensional EMA data were collected at a 400 Hz sampling rate using the NDI Wave system, with articulatory sensors on the midsagittal lips, lower …