Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electrical and Electronics

PDF

2010

Automatic speech recognition

Articles 1 - 2 of 2

Full-Text Articles in Engineering

Phoneme-Based Video Indexing Using Phonetic Disparity Search, Carlos Leon Barth Jan 2010

Phoneme-Based Video Indexing Using Phonetic Disparity Search, Carlos Leon Barth

Electronic Theses and Dissertations

This dissertation presents and evaluates a method to the video indexing problem by investigating a categorization method that transcribes audio content through Automatic Speech Recognition (ASR) combined with Dynamic Contextualization (DC), Phonetic Disparity Search (PDS) and Metaphone indexation. The suggested approach applies genome pattern matching algorithms with computational summarization to build a database infrastructure that provides an indexed summary of the original audio content. PDS complements the contextual phoneme indexing approach by optimizing topic seek performance and accuracy in large video content structures. A prototype was established to translate news broadcast video into text and phonemes automatically by using ASR …


Robust Dialog Management Through A Context-Centric Architecture, Victor C. Hung Jan 2010

Robust Dialog Management Through A Context-Centric Architecture, Victor C. Hung

Electronic Theses and Dissertations

This dissertation presents and evaluates a method of managing spoken dialog interactions with a robust attention to fulfilling the human user’s goals in the presence of speech recognition limitations. Assistive speech-based embodied conversation agents are computer-based entities that interact with humans to help accomplish a certain task or communicate information via spoken input and output. A challenging aspect of this task involves open dialog, where the user is free to converse in an unstructured manner. With this style of input, the machine’s ability to communicate may be hindered by poor reception of utterances, caused by a user’s inadequate command of …