Open Access. Powered by Scholars. Published by Universities.®

Arts and Humanities Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 4 of 4

Full-Text Articles in Arts and Humanities

The Accuracy Of Automatic And Human Live Captions In English, Pablo Romero-Fresco, Nazaret Fresno Dec 2023

The Accuracy Of Automatic And Human Live Captions In English, Pablo Romero-Fresco, Nazaret Fresno

Writing and Language Studies Faculty Publications and Presentations

Closed captions play a vital role in making live broadcasts accessible to many viewers. Traditionally, stenographers and respeakers have been in charge of their production, but this scenario is changing due to the steady improvements that automatic speech recognition has undergone in recent years. This technology is being used to create intralingual live captions without human assistance and broadcasters have begun to explore its use. As a result, human and automatic captions co-exist now on television and, while some research has focused on the accuracy of human live captions, comprehensive assessments of the accuracy and quality of automatic captions are …


Cai Tool-Supported Si Of Numbers: A Theoretical And Methodological Contribution, Francesca Maria Frittella Jul 2022

Cai Tool-Supported Si Of Numbers: A Theoretical And Methodological Contribution, Francesca Maria Frittella

International Journal of Interpreter Education

Numbers are an area of interpreting that is particularly prone to human error. Thanks to recent advancements in automatic speech recognition (ASR) and artificial intelligence (AI) technology, computer-assisted interpreting (CAI) tools may soon be used to enhance delivery accuracy for numbers during simultaneous interpreting (SI).

Given the novelty of the topic, the impact of in-booth CAI tool support on the SI of numbers is still largely under-researched. First, only a few studies have addressed the topic. Second, due to a number of methodological limitations, their findings yield only a partial understanding of the issue. The present work aims to make …


Multilingual Phoneme Models For Rapid Speech Processing System Development, Eric G. Hansen Sep 2006

Multilingual Phoneme Models For Rapid Speech Processing System Development, Eric G. Hansen

Theses and Dissertations

Current speech recognition systems tend to be developed only for commercially viable languages. The resources needed for a typical speech recognition system include hundreds of hours of transcribed speech for acoustic models and 10 to 100 million words of text for language models; both of these requirements can be costly in time and money. The goal of this research is to facilitate rapid development of speech systems to new languages by using multilingual phoneme models to alleviate requirements for large amounts of transcribed speech. The Global Phone database, winch contains transcribed speech from 15 languages, is used as source data …


Speech Recognition Using The Mellin Transform, Jesse R. Hornback Mar 2006

Speech Recognition Using The Mellin Transform, Jesse R. Hornback

Theses and Dissertations

The purpose of this research was to improve performance in speech recognition. Specifically, a new approach was investigating by applying an integral transform known as the Mellin transform (MT) on the output of an auditory model to improve the recognition rate of phonemes through the scale-invariance property of the Mellin transform. Scale-invariance means that as a time-domain signal is subjected to dilations, the distribution of the signal in the MT domain remains unaffected. An auditory model was used to transform speech waveforms into images representing how the brain "sees" a sound. The MT was applied and features were extracted. The …