Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 26 of 26

Full-Text Articles in Entire DC Network

Semi-Automatic Hand Pose Estimation Using A Single Depth Camera, Giffy Jerald Chris Dec 2020

Semi-Automatic Hand Pose Estimation Using A Single Depth Camera, Giffy Jerald Chris

Computer Science and Engineering Theses

This paper addresses the problem of 3D hand pose annotations using a single depth camera. Although hand pose estimation methods rely critically on accurate 3D training data, creating such reliable training data is challenging and labor intensive. We propose a semi-automatic method for efficiently and accurately labeling the 3D hand key-points in a hand depth video. The process starts by selecting a subset of frames that are representative of all the frames in the dataset and the annotator only provides an estimate of the 2D hand key-points in these selected frames. We use this information to infer the 3D location …


New Methods For Deep Learning Based Real-Valued Inter-Residue Distance Prediction, Jacob Barger Nov 2020

New Methods For Deep Learning Based Real-Valued Inter-Residue Distance Prediction, Jacob Barger

Theses

Background: Much of the recent success in protein structure prediction has been a result of accurate protein contact prediction--a binary classification problem. Dozens of methods, built from various types of machine learning and deep learning algorithms, have been published over the last two decades for predicting contacts. Recently, many groups, including Google DeepMind, have demonstrated that reformulating the problem as a multi-class classification problem is a more promising direction to pursue. As an alternative approach, we recently proposed real-valued distance predictions, formulating the problem as a regression problem. The nuances of protein 3D structures make this formulation appropriate, allowing predictions …


Hybrid Deep Neural Networks For Mining Heterogeneous Data, Xiurui Hou Aug 2020

Hybrid Deep Neural Networks For Mining Heterogeneous Data, Xiurui Hou

Dissertations

In the era of big data, the rapidly growing flood of data represents an immense opportunity. New computational methods are desired to fully leverage the potential that exists within massive structured and unstructured data. However, decision-makers are often confronted with multiple diverse heterogeneous data sources. The heterogeneity includes different data types, different granularities, and different dimensions, posing a fundamental challenge in many applications. This dissertation focuses on designing hybrid deep neural networks for modeling various kinds of data heterogeneity.

The first part of this dissertation concerns modeling diverse data types, the first kind of data heterogeneity. Specifically, image data and …


Enrichment Of Ontologies Using Machine Learning And Summarization, Hao Liu Aug 2020

Enrichment Of Ontologies Using Machine Learning And Summarization, Hao Liu

Dissertations

Biomedical ontologies are structured knowledge systems in biomedicine. They play a major role in enabling precise communications in support of healthcare applications, e.g., Electronic Healthcare Records (EHR) systems. Biomedical ontologies are used in many different contexts to facilitate information and knowledge management. The most widely used clinical ontology is the SNOMED CT. Placing a new concept into its proper position in an ontology is a fundamental task in its lifecycle of curation and enrichment.

A large biomedical ontology, which typically consists of many tens of thousands of concepts and relationships, can be viewed as a complex network with concepts as …


Mind Maps And Machine Learning: An Automation Framework For Qualitative Research In Entrepreneurship Education, Yasser Farha Aug 2020

Mind Maps And Machine Learning: An Automation Framework For Qualitative Research In Entrepreneurship Education, Yasser Farha

Dissertations

Entrepreneurship Education researchers often measure entrepreneurial motivation of college students. It is important for stakeholders, such as policymakers and educators, to assert if entrepreneurship education can encourage students to become entrepreneurs, as well as to understand factors that influence entrepreneurial motivation. For that purpose, researchers have used different methods and instruments to measure students' entrepreneurial motivation. Most of these methods are quantitative, e.g., closed-ended surveys, whereas qualitative methods, e.g., open-ended surveys, are rarely used.

Mind maps are an attractive qualitative survey tool because they capture the individual's reflections, thoughts, and experiences. For Entrepreneurship Education, mind maps can be utilized to …


Information Retrieval Of Opioid Dependence Medications Reviews From Health-Related Social Media, Seyedeh Samaneh Omranian Aug 2020

Information Retrieval Of Opioid Dependence Medications Reviews From Health-Related Social Media, Seyedeh Samaneh Omranian

Theses and Dissertations

Social media provides a convenient platform for patients to share their drug usage experience with others; consequently, health researchers can leverage this potential data to gain valuable information about users’ drug satisfaction. Since the 1990s, opioid drug abuse has become a national crisis. In order to reduce the dependency of opioids, several drugs have been presented to the market, but little is known about patient satisfaction with these treatments. Sentiment analysis is a method to measure and interpret patients’ satisfaction. In the first phase of this study, we aimed to utilize social media posts to predict patients’ sentiment towards opioid …


Bayesian Topological Machine Learning, Christopher A. Oballe Aug 2020

Bayesian Topological Machine Learning, Christopher A. Oballe

Doctoral Dissertations

Topological data analysis encompasses a broad set of ideas and techniques that address 1) how to rigorously define and summarize the shape of data, and 2) use these constructs for inference. This dissertation addresses the second problem by developing new inferential tools for topological data analysis and applying them to solve real-world data problems. First, a Bayesian framework to approximate probability distributions of persistence diagrams is established. The key insight underpinning this framework is that persistence diagrams may be viewed as Poisson point processes with prior intensities. With this assumption in hand, one may compute posterior intensities by adopting techniques …


In Situ Sensor Calibration Using Noise Consistency, Shriiesh Var Sharma Aug 2020

In Situ Sensor Calibration Using Noise Consistency, Shriiesh Var Sharma

Computer Science and Engineering Theses

Robots rely on sensors to map their surroundings. As a result, the accuracy of the map depends heavily on the sensor noise and in particular on accurate knowledge of it. The common way to minimize the impact of sensor noise is to use filtering algorithms. Accuracy of these filtering algorithms (like the Kalman filter) relies on the accuracy of the user supplied measurement noise model. Inaccurate noise models lead to higher residual noise in state estimates and errors in the estimate of the precision of the state estimate. It is therefore important to have precise noise models and thus accurately …


Classification Of Factual And Non-Factual Statements Using Adversarially Trained Lstm Networks, Daniel Obembe Aug 2020

Classification Of Factual And Non-Factual Statements Using Adversarially Trained Lstm Networks, Daniel Obembe

Computer Science and Engineering Theses

Being able to determine which statements are factual and therefore likely candidates for further verification is a key value-add in any automated fact-checking system. For this task, it has been shown that LSTMs outperform regular machine learning models, such as SVMs. However, the complexity of LSTMs can also result in over fitting (Gal and Ghahramani,1997), leading to poorer performance as models fail to generalize. To resolve this issue, we set out to utilize adversarial training as away to improve the performance of LSTMs for the task of classifying statements as factual or non-factual. In our experiment, we implement the adversarial …


Applications Of Artificial Intelligence And Graphy Theory To Cyberbullying, Jesse D. Simpson Aug 2020

Applications Of Artificial Intelligence And Graphy Theory To Cyberbullying, Jesse D. Simpson

MSU Graduate Theses

Cyberbullying is an ongoing and devastating issue in today's online social media. Abusive users engage in cyber-harassment by utilizing social media to send posts, private messages, tweets, or pictures to innocent social media users. Detecting and preventing cases of cyberbullying is crucial. In this work, I analyze multiple machine learning, deep learning, and graph analysis algorithms and explore their applicability and performance in pursuit of a robust system for detecting cyberbullying. First, I evaluate the performance of the machine learning algorithms Support Vector Machine, Naïve Bayes, Random Forest, Decision Tree, and Logistic Regression. This yielded positive results and obtained upwards …


Transfer Learning: Bridging The Gap Between Deep Learning And Domain-Specific Text Mining, Chaoran Cheng May 2020

Transfer Learning: Bridging The Gap Between Deep Learning And Domain-Specific Text Mining, Chaoran Cheng

Dissertations

Inspired by the success of deep learning techniques in Natural Language Processing (NLP), this dissertation tackles the domain-specific text mining problems for which the generic deep learning approaches would fail. More specifically, the domain-specific problems are: (1) success prediction in crowdfunding, (2) variants identification in biomedical literature, and (3) text data augmentation for domains with low-resources.

In the first part, transfer learning in a multimodal perspective is utilized to facilitate solving the project success prediction on the crowdfunding application. Even though the information in a project profile can be of different modalities such as text, images, and metadata, most existing …


Yoga Pose Classification Using Deep Learning, Shruti Kothari May 2020

Yoga Pose Classification Using Deep Learning, Shruti Kothari

Master's Projects

Human pose estimation is a deep-rooted problem in computer vision that has exposed many challenges in the past. Analyzing human activities is beneficial in many fields like video- surveillance, biometrics, assisted living, at-home health monitoring etc. With our fast-paced lives these days, people usually prefer exercising at home but feel the need of an instructor to evaluate their exercise form. As these resources are not always available, human pose recognition can be used to build a self-instruction exercise system that allows people to learn and practice exercises correctly by themselves. This project lays the foundation for building such a system …


Emerging Technologies In Healthcare: Analysis Of Unos Data Through Machine Learning, Reyhan Merekar May 2020

Emerging Technologies In Healthcare: Analysis Of Unos Data Through Machine Learning, Reyhan Merekar

Student Theses and Dissertations

The healthcare industry is primed for a massive transformation in the coming decades due to emerging technologies such as Artificial Intelligence (AI) and Machine Learning. With a practical application to the UNOS (United Network of Organ Sharing) database, this Thesis seeks to investigate how Machine Learning and analytic methods may be used to predict one-year heart transplantation outcomes. This study also sought to improve on predictive performances from prior studies by analyzing both Donor and Recipient data. Models built with algorithms such as Stacking and Tree Boosting gave the highest performance, with AUC’s of 0.6810 and 0.6804, respectively. In this …


Using Color Thresholding And Contouring To Understand Coral Reef Biodiversity, Scott Vuong Tran May 2020

Using Color Thresholding And Contouring To Understand Coral Reef Biodiversity, Scott Vuong Tran

Master's Projects

This paper presents research outcomes of understanding coral reef biodiversity through the usage of various computer vision applications and techniques. It aims to help further analyze and understand the coral reef biodiversity through the usage of color thresholding and contouring onto images of the ARMS plates to extract groups of microorganisms based on color. The results are comparable to the manual markup tool developed to do the same tasks and shows that the manual process can be sped up using computer vision. The paper presents an automated way to extract groups of microorganisms based on color without the use of …


Automated Digit Recognition On Sound Pressure Level Meters Based On Deep Learning, Che-Wei Tung May 2020

Automated Digit Recognition On Sound Pressure Level Meters Based On Deep Learning, Che-Wei Tung

Theses and Dissertations

Sound pressure level (SPL) meter is one of the useful devices used for measuring the sound level pressure. The measurement device displays the SPL value in decibels (dB) on a standard LCD screen (no backlight). We could base on the digit number shown on the LCD screen to do some adjustments or evaluations. Thus, SPL has been widely used in several fields to quantify different noise, such as industrial, environmental, and aircraft noise. However, in my basic knowledge, there is no previous study used machine learning to auto-recognize the digit on the SPL meter. This thesis presents a novel system …


Investigating Machine Learning Techniques For Gesture Recognition With Low-Cost Capacitive Sensing Arrays, Michael Fahr Jr. May 2020

Investigating Machine Learning Techniques For Gesture Recognition With Low-Cost Capacitive Sensing Arrays, Michael Fahr Jr.

Computer Science and Computer Engineering Undergraduate Honors Theses

Machine learning has proven to be an effective tool for forming models to make predictions based on sample data. Supervised learning, a subset of machine learning, can be used to map input data to output labels based on pre-existing paired data. Datasets for machine learning can be created from many different sources and vary in complexity, with popular datasets including the MNIST handwritten dataset and CIFAR10 image dataset. The focus of this thesis is to test and validate multiple machine learning models for accurately classifying gestures performed on a low-cost capacitive sensing array. Multiple neural networks are trained using gesture …


Using Logical Specifications For Multi-Objective Reinforcement Learning, Kolby Nottingham Mar 2020

Using Logical Specifications For Multi-Objective Reinforcement Learning, Kolby Nottingham

Undergraduate Honors Theses

In the multi-objective reinforcement learning (MORL) paradigm, the relative importance of environment objectives is often unknown prior to training, so agents must learn to specialize their behavior to optimize different combinations of environment objectives that are specified post-training. These are typically linear combinations, so the agent is effectively parameterized by a weight vector that describes how to balance competing environment objectives. However, we show that behaviors can be successfully specified and learned by much more expressive non-linear logical specifications. We test our agent in several environments with various objectives and show that it can generalize to many never-before-seen specifications.


Machine Learning For Effective Parkinson's Disease Diagnosis, Brennon Brimhall Mar 2020

Machine Learning For Effective Parkinson's Disease Diagnosis, Brennon Brimhall

Undergraduate Honors Theses

Parkinson’s Disease is a degenerative neurological condition that affects approximately 10 million people globally. Because there is currently no cure, there is a strong motivation for research into improved and automated diagnostic procedures. Using Random Forests, a computer can effectively learn to diagnose Parkinson’s disease in a patient with high accuracy (94%), precision (95%), and recall (91%) across the data of over 2800 patients. Using similar techniques, I further determine that the most predictive medical tests relate to tremors observed in patients.


Carbon Footprint Of Machine Learning Algorithms, Gigi Hsueh Jan 2020

Carbon Footprint Of Machine Learning Algorithms, Gigi Hsueh

Senior Projects Spring 2020

With the rapid development of machine learning, deep learning has demonstrated superior performance over other types of learning. Research made possible by big data and high-end GPU's enabled those research advances at the expense of computation and environmental costs. This will not only slow down the advancement of deep learning research because not all researchers have access to such expensive hardware, but it also accelerates climate change with increasing carbon emissions. It is essential for machine learning research to obtain high levels of accuracy and efficiency without contributing to global warming. This paper discusses some of current approaches in estimating …


Computer Vision Gesture Recognition For Rock Paper Scissors, Nicholas Hunter Jan 2020

Computer Vision Gesture Recognition For Rock Paper Scissors, Nicholas Hunter

Senior Independent Study Theses

This project implements a human versus computer game of rock-paper-scissors using machine learning and computer vision. Player’s hand gestures are detected using single images with the YOLOv3 object detection system. This provides a generalized detection method which can recognize player moves without the need for a special background or lighting setup. Additionally, past moves are examined in context to predict the most probable next move of the system’s opponent. In this way, the system achieves higher win rates against human opponents than by using a purely random strategy.


Predicting Drug Misuse Status Using Machine Learning On Electronic Health Records, Robert Arnold Kania Jan 2020

Predicting Drug Misuse Status Using Machine Learning On Electronic Health Records, Robert Arnold Kania

Master's Theses

Substance misuse is a major problem in the world. in 2014, as many as 52,404 deaths in the US were caused by drug overdoses. in 2001, the monetary cost of drug misuse has been estimated to be 414 billion dollars. in this work, we explore the use of different machine learning algorithms in the prediction of cocaine misuse using structured and unstructured data found in electronic health records. These records contain various attributes that can help with this prediction, including but not limited to chart text data, previous diagnoses of certain diseases and information about the area the patient lives …


Development Of Machine Learning Models To Predict The Online Impact Of Research, Mohammed Murtuza Shahzad Syed Jan 2020

Development Of Machine Learning Models To Predict The Online Impact Of Research, Mohammed Murtuza Shahzad Syed

Graduate Research Theses & Dissertations

Scientific research is being increasingly shared online in a way such that there is a need to develop methodologies to measure the impact of specific papers in ways that go beyond traditional indicators of scholarly citations and beyond the scholarly community. In this thesis, new machine learning models are developed to measure and predict the impact ofresearch in the online context. The extent to which research papers are mentioned on social media platforms, i.e., their online sustainability, indicates the public's interest in and perhaps even the level of understanding of scientific topics. A research paper having a long lifespan, i.e., …


A Deep Learning Approach To Mapping Irrigation: U-Net Irrmapper, Thomas Henry Colligan Iv Jan 2020

A Deep Learning Approach To Mapping Irrigation: U-Net Irrmapper, Thomas Henry Colligan Iv

Graduate Student Theses, Dissertations, & Professional Papers

Accurate maps of irrigation are essential for understanding and managing water resources in light of a warming climate. We present a new method for mapping irrigation and apply it to the state of Montana over the years 2000-2019. The method is based on an ensemble of convolutional neural networks that only rely on raw Landsat surface reflectance data. The ensemble of networks method learns to mask clouds and ignore Landsat 7 scan-line failures without supervision, reducing the need for preprocessing data or feature engineering. Unlike other approaches to mapping irrigation, the method doesn't use other mapping products like the Cropland …


Machine Learning? In My Election? It's More Likely Than You Think: Voting Rules Via Neural Networks, Daniel Firebanks-Quevedo Jan 2020

Machine Learning? In My Election? It's More Likely Than You Think: Voting Rules Via Neural Networks, Daniel Firebanks-Quevedo

Honors Papers

Impossibility theorems in social choice have represented a barrier in the creation of universal, non-dictatorial, and non-manipulable voting rules, highlighting a key trade-off between social welfare and strategy-proofness. However, a social planner may be concerned with only a particular preference distribution and wonder whether it is possible to better optimize this trade-off. To address this problem, we propose an end-to-end, machine learning-based framework that creates voting rules according to a social planner's constraints, for any type of preference distribution. After experimenting with rank-based social choice rules, we find that automatically-designed rules are less susceptible to manipulation than most existing rules, …


Searching For Needles In The Cosmic Haystack, Thomas Ryan Devine Jan 2020

Searching For Needles In The Cosmic Haystack, Thomas Ryan Devine

Graduate Theses, Dissertations, and Problem Reports

Searching for pulsar signals in radio astronomy data sets is a difficult task. The data sets are extremely large, approaching the petabyte scale, and are growing larger as instruments become more advanced. Big Data brings with it big challenges. Processing the data to identify candidate pulsar signals is computationally expensive and must utilize parallelism to be scalable. Labeling benchmarks for supervised classification is costly. To compound the problem, pulsar signals are very rare, e.g., only 0.05% of the instances in one data set represent pulsars. Furthermore, there are many different approaches to candidate classification with no consensus on a best …


Text Mining Methods For Analyzing Online Health Information And Communication, Sifei Han Jan 2020

Text Mining Methods For Analyzing Online Health Information And Communication, Sifei Han

Theses and Dissertations--Computer Science

The Internet provides an alternative way to share health information. Specifically, social network systems such as Twitter, Facebook, Reddit, and disease specific online support forums are increasingly being used to share information on health related topics. This could be in the form of personal health information disclosure to seek suggestions or answering other patients' questions based on their history. This social media uptake gives a new angle to improve the current health communication landscape with consumer generated content from social platforms. With these online modes of communication, health providers can offer more immediate support to the people seeking advice. Non-profit …