Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Computer Sciences (15)
- Artificial Intelligence and Robotics (7)
- Engineering (5)
- Data Science (4)
- Computer Engineering (2)
-
- Electrical and Computer Engineering (2)
- Systems Architecture (2)
- Atmospheric Sciences (1)
- Bioimaging and Biomedical Optics (1)
- Biomedical Engineering and Bioengineering (1)
- Civil Engineering (1)
- Civil and Environmental Engineering (1)
- Climate (1)
- Computer and Systems Architecture (1)
- Computer-Aided Engineering and Design (1)
- Databases and Information Systems (1)
- Digital Communications and Networking (1)
- Earth Sciences (1)
- Heat Transfer, Combustion (1)
- Mechanical Engineering (1)
- Oceanography and Atmospheric Sciences and Meteorology (1)
- Other Computer Sciences (1)
- Other Earth Sciences (1)
- Transportation Engineering (1)
- Institution
Articles 1 - 16 of 16
Full-Text Articles in Physical Sciences and Mathematics
Task Classification During Visual Search Using Classic Machine Learning And Deep Learning, Devangi Vilas Chinchankar
Task Classification During Visual Search Using Classic Machine Learning And Deep Learning, Devangi Vilas Chinchankar
Master's Projects
In an average human life, the eyes not only passively scan visual scenes, but most times end up actively performing tasks including, but not limited to, searching, comparing, and counting. As a result of the advances in technology, we are observing a boost in the average screen time. Humans are now looking at an increasing number of screens and in turn images and videos. Understanding what scene a user is looking at and what type of visual task is being performed can be useful in developing intelligent user interfaces, and in virtual reality and augmented reality devices. In this research, …
Analysis Of Camera Trap Footage Through Subject Recognition, Nirnayak Bhardwaj
Analysis Of Camera Trap Footage Through Subject Recognition, Nirnayak Bhardwaj
Master's Projects
Motion-sensitive cameras, otherwise known as camera traps, have become increasingly popular amongst ecologists for studying wildlife. These cameras allow scientists to remotely observe animals through an inexpensive and non-invasive approach. Due to the lenient nature of motion cameras, studies involving them often generate excessive amounts of footage with many photographs not containing any animal subjects. Thus, there is a need for a system that is capable of analyzing camera trap footage to determine if a picture holds value for researchers. While research into automated image recognition is well documented, it has had limited applications in the field of ecology. This …
Deep Learning Strategies For Pool Boiling Heat Flux Prediction Using Image Sequences, Connor Heo
Deep Learning Strategies For Pool Boiling Heat Flux Prediction Using Image Sequences, Connor Heo
Graduate Theses and Dissertations
The understanding of bubble dynamics during boiling is critical to the design of advanced heater surfaces to improve the boiling heat transfer. The stochastic bubble nucleation, growth, and coalescence processes have made it challenging to obtain mechanistic models that can predict boiling heat flux based on the bubble dynamics. Traditional boiling image analysis relies on the extraction of the dominant physical quantities from the images and is thus limited to the existing knowledge of these quantities. Recently, machine-learning-aided analysis has shown success in boiling crisis detection, heat flux prediction, real-time image analysis, etc., whereas most of the existing studies are …
Exploring The Latent Space Of Image Captioning Networks, Mikian J. Musser
Exploring The Latent Space Of Image Captioning Networks, Mikian J. Musser
UNLV Theses, Dissertations, Professional Papers, and Capstones
State-of-the-art image captioning models can successfully produce a diverse set of accurate captions. Previous research has focused on improving caption diversity while maintaining a high level of fidelity. We shift the focus from accuracy and diversity to controllability. We use a modified version of the traditional encoder-decoder network that allows the model to produce a meaningful and structured latent space. We then explore the latent space using several latent cartographic methods: lerp, slerp, analogy completion, attribute vector rotation, and interpolation graphs. Additionally, we discuss different categories of latent space and provide modifications for each of the cartographic methods. Finally, we …
Learning To Interpret Fluid Type Phenomena Via Images, Simron Thapa
Learning To Interpret Fluid Type Phenomena Via Images, Simron Thapa
LSU Doctoral Dissertations
Learning to interpret fluid-type phenomena via images is a long-standing challenging problem in computer vision. The problem becomes even more challenging when the fluid medium is highly dynamic and refractive due to its transparent nature. Here, we consider imaging through such refractive fluid media like water and air. For water, we design novel supervised learning-based algorithms to recover its 3D surface as well as the highly distorted underground patterns. For air, we design a state-of-the-art unsupervised learning algorithm to predict the distortion-free image given a short sequence of turbulent images. Specifically, we design a deep neural network that estimates the …
Reasoning About Scene And Image Structure For Computer Vision, Zhihao Xia
Reasoning About Scene And Image Structure For Computer Vision, Zhihao Xia
McKelvey School of Engineering Theses & Dissertations
The wide availability of cheap consumer cameras has democratized photography for novices and experts alike, with more than a trillion photographs taken each year. While many of these cameras---especially those on mobile phones---have inexpensive optics and make imperfect measurements, the use of modern computational techniques can allow the recovery of high-quality photographs as well as of scene attributes.
In this dissertation, we explore algorithms to infer a wide variety of physical and visual properties of the world, including color, geometry, reflectance etc., from images taken by casual photographers in unconstrained settings. We specifically focus on neural network-based methods, while incorporating …
Exploring Behaviors Of Software Developers And Their Code Through Computational And Statistical Methods, Elia Eiroa Lledo
Exploring Behaviors Of Software Developers And Their Code Through Computational And Statistical Methods, Elia Eiroa Lledo
Computational and Data Sciences (PhD) Dissertations
As Artificial Intelligence (AI) increasingly penetrates all aspects of society, many obstacles emerge. This thesis identifies and discusses the issues facing Computer Vision and significant deficiencies in the Software Development Life-cycle that need to be resolved to facilitate the evolution toward true artificial intelligence. We explicitly review the concepts behind Convolutional Neural Network (CNN) models, the benchmark for computer vision. Chapter 2 highlights the mechanisms that have popularized CNNs while also specifying significant gaps that could garner the model inadequate for future use in safety-critical systems. We put forward two main limitations. Namely, CNNs do not use lack of information …
Forecasting Pedestrian Trajectory Using Deep Learning, Arsal Syed
Forecasting Pedestrian Trajectory Using Deep Learning, Arsal Syed
UNLV Theses, Dissertations, Professional Papers, and Capstones
In this dissertation we develop different methods for forecasting pedestrian trajectories. Complete understanding of pedestrian motion is essential for autonomous agents and social robots to make realistic and safe decisions. Current trajectory prediction methods rely on incorporating historic motion, scene features and social interaction to model pedestrian behaviors. Our focus is to accurately understand scene semantics to better forecast trajectories. In order to do so, we leverage semantic segmentation to encode static scene features such as walkable paths, entry/exits, static obstacles etc. We further evaluate the effectiveness of using semantic maps on different datasets and compare its performance with already …
Take The Lead: Toward A Virtual Video Dance Partner, Ty Farris
Take The Lead: Toward A Virtual Video Dance Partner, Ty Farris
Master's Theses
My work focuses on taking a single person as input and predicting the intentional movement of one dance partner based on the other dance partner's movement. Human pose estimation has been applied to dance and computer vision, but many existing applications focus on a single individual or multiple individuals performing. Currently there are very few works that focus specifically on dance couples combined with pose prediction. This thesis is applicable to the entertainment and gaming industry by training people to dance with a virtual dance partner.
Many existing interactive or virtual dance partners require a motion capture system, multiple cameras …
Multi-Modal Data Fusion, Image Segmentation, And Object Identification Using Unsupervised Machine Learning: Conception, Validation, Applications, And A Basis For Multi-Modal Object Detection And Tracking, Nicholas Lahaye
Computational and Data Sciences (PhD) Dissertations
Remote sensing and instrumentation is constantly improving and increasing in capability. Included within this, is the increase in amount of different instrument types, with various combinations of spatial and spectral resolutions, pointing angles, and various other instrument-specific qualities. While the increase in instruments, and therefore datasets, is a boon for those aiming to study the complexities of the various Earth systems, it can also present a large number of new challenges. With this information in mind, our group has set our aims on combining datasets with different spatial and spectral resolutions in an effective and as-general-as-possible way, with as little …
Design, Deployment, And Validation Of Computer Vision Techniques For Societal Scale Applications, Arup Kanti Dey
Design, Deployment, And Validation Of Computer Vision Techniques For Societal Scale Applications, Arup Kanti Dey
USF Tampa Graduate Theses and Dissertations
Artificial Intelligence techniques have ensued a significant impact on our daily lives. Numerous applications in so many diverse fields have been made possible by AI algorithms today, and there are many more yet to come. In this dissertation, we design, deploy and validate computer vision algorithms for innovative and high-impact societal scale applications.We specifically focus on two applications in this dissertation: Detection of distracted driving and Detection of breeding habitats of mosquito vectors.
Distracted driving on roads is a major problem around the world. Distracted driving is the case where a driver diverts his/her focus from the road and engages …
Signal Processing And Data Analysis For Real-Time Intermodal Freight Classification Through A Multimodal Sensor System., Enrique J. Sanchez Headley
Signal Processing And Data Analysis For Real-Time Intermodal Freight Classification Through A Multimodal Sensor System., Enrique J. Sanchez Headley
Graduate Theses and Dissertations
Identifying freight patterns in transit is a common need among commercial and municipal entities. For example, the allocation of resources among Departments of Transportation is often predicated on an understanding of freight patterns along major highways. There exist multiple sensor systems to detect and count vehicles at areas of interest. Many of these sensors are limited in their ability to detect more specific features of vehicles in traffic or are unable to perform well in adverse weather conditions. Despite this limitation, to date there is little comparative analysis among Laser Imaging and Detection and Ranging (LIDAR) sensors for freight detection …
Machine Learning For Terminal Procedure Chart Change Detection, Anthony M. Marchiafava
Machine Learning For Terminal Procedure Chart Change Detection, Anthony M. Marchiafava
University of New Orleans Theses and Dissertations
Terminal Procedure Charts are a constantly updated and necessary tool for aircraft personnel to approach and take off from airport runways safely. Detecting changes within these charts is a time-consuming and laborious process. Here machine learning techniques were used to predict regions of change in charts based on detecting the charts image regions and comparing features extracted from those regions. Outlined are methodologies to detect differences between two separate charts to produce images with changed regions clearly indicated. Both more conventional computer vision and machine learning techniques were applied. For images with minor shifts, the proposed model is able to …
An Intelligent Framework To Assess Embodied Cognition From Physical Activities In Children, Ashwin Ramesh Babu
An Intelligent Framework To Assess Embodied Cognition From Physical Activities In Children, Ashwin Ramesh Babu
Computer Science and Engineering Dissertations
Cognition refers to "The mental actions or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses many aspects of intellectual functions and processes such as attention, working memory, response inhibition, motor functions, and more. Humans start to develop these cognitive skills right from their childhood and become fully developed through their adulthood. Impairments in these cognitive functions, specifically in Executive Functions (Higher-order cognitive functions), disrupt their everyday life leading to a troubled childhood and lifelong difficulties in family, employment, and community functioning leading to socio-economic repercussions. Identifying such impairments at the right age (early childhood) …
Cascaded Deep Learning Network For Postearthquake Bridge Serviceability Assessment, Youjeong Jang
Cascaded Deep Learning Network For Postearthquake Bridge Serviceability Assessment, Youjeong Jang
Electronic Theses and Dissertations
Damages assessment of bridges is important to derive immediate response after severe events to decide serviceability. Especially, past earthquakes have proven the vulnerability of bridges with insufficient detailing. Due to lack of a national and unified post-earthquake inspection procedure for bridges, conventional damage assessments are performed by sending professional personnel to the onsite, detecting visually and measuring the damage state. To get accurate and fast damage result of bridge condition is important to save not only lives but also costs.
There have been studies using image processing techniques to assess damage of bridge column without sending individual to onsite. Convolutional …
Perceptually Improved Medical Image Translations Using Conditional Generative Adversarial Networks, Anurag Vaidya
Perceptually Improved Medical Image Translations Using Conditional Generative Adversarial Networks, Anurag Vaidya
Honors Theses
Magnetic resonance imaging (MRI) can help visualize various brain regions. Typical MRI sequences consist of T1-weighted sequence (favorable for observing large brain structures), T2-weighted sequence (useful for pathology), and T2-FLAIR scan (useful for pathology with suppression of signal from water). While these different scans provide complementary information, acquiring them leads to acquisition times of ~1 hour and an average cost of $2,600, presenting significant barriers. To reduce these costs associated with brain MRIs, we present pTransGAN, a generative adversarial network capable of translating both healthy and unhealthy T1 scans into T2 scans. We show that the addition of non-adversarial …