Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Artificial Intelligence and Robotics

Natural Language Processing Faculty Publications

2022

Cross-modal

Articles 1 - 1 of 1

Full-Text Articles in Computer Sciences

Unsupervised Automatic Speech Recognition: A Review, Hanan Aldarmaki, Asad Ullah, Sreepratha Ram, Nazar Zaki Apr 2022

Unsupervised Automatic Speech Recognition: A Review, Hanan Aldarmaki, Asad Ullah, Sreepratha Ram, Nazar Zaki

Natural Language Processing Faculty Publications

Automatic Speech Recognition (ASR) systems can be trained to achieve remarkable performance given large amounts of manually transcribed speech, but large labeled data sets can be difficult or expensive to acquire for all languages of interest. In this paper, we review the research literature to identify models and ideas that could lead to fully unsupervised ASR, including unsupervised sub-word and word modeling, unsupervised segmentation of the speech signal, and unsupervised mapping from speech segments to text. The objective of the study is to identify the limitations of what can be learned from speech data alone and to understand the minimum …