Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 6 of 6

Full-Text Articles in Engineering

Visual Speech Recognition Using A 3d Convolutional Neural Network, Matthew Rochford Dec 2019

Visual Speech Recognition Using A 3d Convolutional Neural Network, Matthew Rochford

Master's Theses

Main stream automatic speech recognition (ASR) makes use of audio data to identify spoken words, however visual speech recognition (VSR) has recently been of increased interest to researchers. VSR is used when audio data is corrupted or missing entirely and also to further enhance the accuracy of audio-based ASR systems. In this research, we present both a framework for building 3D feature cubes of lip data from videos and a 3D convolutional neural network (CNN) architecture for performing classification on a dataset of 100 spoken words, recorded in an uncontrolled envi- ronment. Our 3D-CNN architecture achieves a testing accuracy of …


Adaptation Of A Deep Learning Algorithm For Traffic Sign Detection, Jose Luis Masache Narvaez Jul 2019

Adaptation Of A Deep Learning Algorithm For Traffic Sign Detection, Jose Luis Masache Narvaez

Electronic Thesis and Dissertation Repository

Traffic signs detection is becoming increasingly important as various approaches for automation using computer vision are becoming widely used in the industry. Typical applications include autonomous driving systems, mapping and cataloging traffic signs by municipalities. Convolutional neural networks (CNNs) have shown state of the art performances in classification tasks, and as a result, object detection algorithms based on CNNs have become popular in computer vision tasks. Two-stage detection algorithms like region proposal methods (R-CNN and Faster R-CNN) have better performance in terms of localization and recognition accuracy. However, these methods require high computational power for training and inference that make …


Strawberry Detection Under Various Harvestation Stages, Yavisht Fitter Mar 2019

Strawberry Detection Under Various Harvestation Stages, Yavisht Fitter

Master's Theses

This paper analyzes three techniques attempting to detect strawberries at various stages in its growth cycle. Histogram of Oriented Gradients (HOG), Local Binary Patterns (LBP) and Convolutional Neural Networks (CNN) were implemented on a limited custom-built dataset. The methodologies were compared in terms of accuracy and computational efficiency. Computational efficiency is defined in terms of image resolution as testing on a smaller dimensional image is much quicker than larger dimensions. The CNN based implementation obtained the best results with an 88% accuracy at the highest level of efficiency as well (600x800). LBP generated moderate results with a 74% detection accuracy …


Autonomous And Real Time Rock Image Classification Using Convolutional Neural Networks, Alexis David Pascual Feb 2019

Autonomous And Real Time Rock Image Classification Using Convolutional Neural Networks, Alexis David Pascual

Electronic Thesis and Dissertation Repository

Autonomous image recognition has numerous potential applications in the field of planetary science and geology. For instance, having the ability to classify images of rocks would allow geologists to have immediate feedback without having to bring back samples to the laboratory. Also, planetary rovers could classify rocks in remote places and even in other planets without needing human intervention. In 2017, Shu et. al. used a Support Vector Machine (SVM) classification algorithm to classify 9 different types of rock images using a with the image features extracted autonomously. Through this method, they achieved a test accuracy of 96.71%. Within the …


Estimation And Prediction Of The Human Gait Dynamics For The Control Of An Ankle-Foot Prosthesis, Guilherme Aramizo Ribeiro Jan 2019

Estimation And Prediction Of The Human Gait Dynamics For The Control Of An Ankle-Foot Prosthesis, Guilherme Aramizo Ribeiro

Dissertations, Master's Theses and Master's Reports

With the growing population of amputees, powered prostheses can be a solution to improve the quality of life for many people. Powered ankle-foot prostheses can be made to behave similar to the lost limb via controllers that emulate the mechanical impedance of the human ankle. Therefore, the understanding of human ankle dynamics is of major significance. First, this work reports the modulation of the mechanical impedance via two mechanisms: the co-contraction of the calf muscles and a change of mean ankle torque and angle. Then, the mechanical impedance of the ankle was determined, for the first time, as a multivariable …


Optimal Compression Of Point Clouds, Benjamin Robert Smith Jan 2019

Optimal Compression Of Point Clouds, Benjamin Robert Smith

Graduate Theses, Dissertations, and Problem Reports

Image-based localization is a crucial step in many 3D computer vision applications, e.g., self-driving cars, robotics, and augmented reality among others. Unfortunately, many image-based-localization applications require the storage of large scenes, and many camera pose estimators struggle to scale when the scene representation is large. To alleviate the aforementioned problems, many applications compress a scene representation by reducing the number of 3D points of a point cloud. The state-of-the-art compresses a scene representation by using a K-cover-based algorithm. While the state-of-the-art selects a subset of 3D points that maximizes the probability of accurately estimating the camera pose of a new …