Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Signal Processing

Master's Theses

2019

Computer Vision

Articles 1 - 1 of 1

Full-Text Articles in Engineering

Visual Speech Recognition Using A 3d Convolutional Neural Network, Matthew Rochford Dec 2019

Visual Speech Recognition Using A 3d Convolutional Neural Network, Matthew Rochford

Master's Theses

Main stream automatic speech recognition (ASR) makes use of audio data to identify spoken words, however visual speech recognition (VSR) has recently been of increased interest to researchers. VSR is used when audio data is corrupted or missing entirely and also to further enhance the accuracy of audio-based ASR systems. In this research, we present both a framework for building 3D feature cubes of lip data from videos and a 3D convolutional neural network (CNN) architecture for performing classification on a dataset of 100 spoken words, recorded in an uncontrolled envi- ronment. Our 3D-CNN architecture achieves a testing accuracy of …