Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Phonetics and Phonology

Yale Day of Data

Conference

Articles 1 - 1 of 1

Full-Text Articles in Social and Behavioral Sciences

Text-Speech Alignment: A Robin Hood Approach For Endangered Languages, Claire Bowern, Rikker Dockum, Sarah Babinski, Hunter Craft, Anelisa Fergus, Dolly Goldenberg Jan 2019

Text-Speech Alignment: A Robin Hood Approach For Endangered Languages, Claire Bowern, Rikker Dockum, Sarah Babinski, Hunter Craft, Anelisa Fergus, Dolly Goldenberg

Yale Day of Data

Forced alignment automatically aligns audio recordings of spoken language with transcripts at the level of individual sounds, greatly reducing the time required to prepare data for linguistic analysis. However, existing algorithms are mostly trained on a few well-documented languages. We test the performance of three algorithms against manually aligned data on data from a highly endangered language. At least some tasks, unsupervised alignment (either based on English or trained from a small corpus) is sufficiently reliable for it to be used on legacy data for low-resource languages. Descriptive phonetic work on vowel inventories and prosody can be accurately captured by …