Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 61

Full-Text Articles in Entire DC Network

Insights Into Cellular Evolution: Temporal Deep Learning Models And Analysis For Cell Image Classification, Xinran Zhao Mar 2024

Insights Into Cellular Evolution: Temporal Deep Learning Models And Analysis For Cell Image Classification, Xinran Zhao

Master's Theses

Understanding the temporal evolution of cells poses a significant challenge in developmental biology. This study embarks on a comparative analysis of various machine-learning techniques to classify cell colony images across different timestamps, thereby aiming to capture dynamic transitions of cellular states. By performing Transfer Learning with state-of-the-art classification networks, we achieve high accuracy in categorizing single-timestamp images. Furthermore, this research introduces the integration of temporal models, notably LSTM (Long Short Term Memory Network), R-Transformer (Recurrent Neural Network enhanced Transformer) and ViViT (Video Vision Transformer), to undertake this classification task to verify the effectiveness of incorporating temporal features into the classification …


Enhancing Telecom Churn Prediction: Adaboost With Oversampling And Recursive Feature Elimination Approach, Long Dinh Tran Jun 2023

Enhancing Telecom Churn Prediction: Adaboost With Oversampling And Recursive Feature Elimination Approach, Long Dinh Tran

Master's Theses

Churn prediction is a critical task for businesses to retain their valuable customers. This paper presents a comprehensive study of churn prediction in the telecom sector using 15 approaches, including popular algorithms such as Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, and AdaBoost.

The study is segmented into three sets of experiments, each focusing on a different approach to building the churn prediction model. The model is constructed using the original training set in the first set of experiments. The second set involves oversampling the training set to address the issue of imbalanced data. Lastly, the third set …


A Novel Approach To Extending Music Using Latent Diffusion, Keon Roohparvar, Franz J. Kurfess Jun 2023

A Novel Approach To Extending Music Using Latent Diffusion, Keon Roohparvar, Franz J. Kurfess

Master's Theses

Using deep learning to synthetically generate music is a research domain that has gained more attention from the public in the past few years. A subproblem of music generation is music extension, or the task of taking existing music and extending it. This work proposes the Continuer Pipeline, a novel technique that uses deep learning to take music and extend it in 5 second increments. It does this by treating the musical generation process as an image generation problem; we utilize latent diffusion models (LDMs) to generate spectrograms, which are image representations of music. The Continuer Pipeline is able to …


Balancing Privacy And Accuracy In Iot Using Domain-Specific Features For Time Series Classification, Pranshul Lakhanpal Jun 2023

Balancing Privacy And Accuracy In Iot Using Domain-Specific Features For Time Series Classification, Pranshul Lakhanpal

Master's Theses

ε-Differential Privacy (DP) has been popularly used for anonymizing data to protect sensitive information and for machine learning (ML) tasks. However, there is a trade-off in balancing privacy and achieving ML accuracy since ε-DP reduces the model’s accuracy for classification tasks. Moreover, not many studies have applied DP to time series from sensors and Internet-of-Things (IoT) devices. In this work, we try to achieve the accuracy of ML models trained with ε-DP data to be as close to the ML models trained with non-anonymized data for two different physiological time series. We propose to transform time series into domain-specific 2D …


Tempers Rising: The Effect Of Heat On Spite, Jake C. Cosgrove May 2023

Tempers Rising: The Effect Of Heat On Spite, Jake C. Cosgrove

Master's Theses

The relationship between heat and harmful outcomes is well documented, with research connecting various adverse economic outcomes to the climate. In the presence of increasing global warming and climate change, understanding why the climate leads to negative economic outcomes is essential for forming peaceful institutions of the future. We study how behavioral economic outcomes change in the presence of heat through a lab experiment involving 1,110 observations conducted in five different countries. This paper specifically focuses on the social preference outcome of spite. We find that increased time exposure to the treatment effect of heat is required to elicit an …


Modeling Daily Fantasy Basketball, Martin Jiang Mar 2023

Modeling Daily Fantasy Basketball, Martin Jiang

Master's Theses

Daily fantasy basketball presents interesting problems to researchers due to the extensive amounts of data that needs to be explored when trying to predict player performance. A large amount of this data can be noisy due to the variance within the sport of basketball. Because of this, a high degree of skill is required to consistently win in daily fantasy basketball contests. On any given day, users are challenged to predict how players will perform and create a lineup of the eight best players under fixed salary and positional requirements. In this thesis, we present a tool to assist daily …


Analysis And Usage Of Natural Language Features In Success Prediction Of Legislative Testimonies, Marine Cossoul Mar 2023

Analysis And Usage Of Natural Language Features In Success Prediction Of Legislative Testimonies, Marine Cossoul

Master's Theses

Committee meetings are a fundamental part of the legislative process in which
constituents, lobbyists, and legislators alike can speak on proposed bills at the
local and state level. Oftentimes, unspoken “rules” or standards are at play in
political processes that can influence the trajectory of a bill, leaving constituents
without a political background at an inherent disadvantage when engaging with
the legislative process. The work done in this thesis aims to explore the extent to
which the language and phraseology of a general public testimony can influence a
vote, and examine how this information can be used to promote civic …


Predicting Startup Success Using Publicly Available Data, Emily Gavrilenko Dec 2022

Predicting Startup Success Using Publicly Available Data, Emily Gavrilenko

Master's Theses

Predicting the success of an early-stage startup has always been a major effort for investors and venture funds. Statistically, there are about 305 million total startups created in a year, but less than 10% of them succeed to become profitable businesses. Accurately identifying the signs of startup growth is the work of countless investors, and in recent years, research has turned to machine learning in hopes of improving the accuracy and speed of startup success prediction.

To learn about a startup, investors have to navigate many different internet sources and often rely on personal intuition to determine the startup’s potential …


A Design Of A Digital Lockout Tagout System With Machine Learning, Brandon H. Chen Dec 2022

A Design Of A Digital Lockout Tagout System With Machine Learning, Brandon H. Chen

Master's Theses

Lockout Tagout (LOTO) is a safety procedure instated by the Occupational Safety and Health Administration (OSHA) when doing maintenance on dangerous machinery and hazardous power sources. In this procedure, authorized workers shut off the machinery and use physical locks and tags to prevent operation during maintenance. LOTO has been the industry standard for 32 years since it was instantiated, being used in many different industries such as industrial work, mining, and agriculture. However, LOTO is not without its issues. The LOTO procedure requires employees to be trained and is prone to human error. As well, there is a clash between …


Panodepth – Panoramic Monocular Depth Perception Model And Framework, Adley K. Wong Dec 2022

Panodepth – Panoramic Monocular Depth Perception Model And Framework, Adley K. Wong

Master's Theses

Depth perception has become a heavily researched area as companies and researchers are striving towards the development of self-driving cars. Self-driving cars rely on perceiving the surrounding area, which heavily depends on technology capable of providing the system with depth perception capabilities. In this paper, we explore developing a single camera (monocular) depth prediction model that is trained on panoramic depth images. Our model makes novel use of transfer learning efficient encoder models, pre-training on a larger dataset of flat depth images, and optimizing the model for use with a Jetson Nano. Additionally, we present a training and optimization framework …


Deep Learning For Detecting Trees In The Urban Environment From Lidar, Julian R. Rice Aug 2022

Deep Learning For Detecting Trees In The Urban Environment From Lidar, Julian R. Rice

Master's Theses

Cataloguing and classifying trees in the urban environment is a crucial step in urban and environmental planning. However, manual collection and maintenance of this data is expensive and time-consuming. Algorithmic approaches that rely on remote sensing data have been developed for tree detection in forests, though they generally struggle in the more varied urban environment. This work proposes a novel method for the detection of trees in the urban environment that applies deep learning to remote sensing data. Specifically, we train a PointNet-based neural network to predict tree locations directly from LIDAR data augmented with multi-spectral imaging. We compare this …


A Study Of Grammar-Based Fuzzing Approaches, Ziwei Wu Jun 2022

A Study Of Grammar-Based Fuzzing Approaches, Ziwei Wu

Master's Theses

Fuzzing is the process of finding security vulnerabilities in code by creating inputs that will activate the exploits. Grammar-based fuzzing uses a grammar, which represents the syntax of all inputs a target program will accept, allowing the fuzzer to create well-formed complex inputs. This thesis conducts an in-depth study on two blackbox grammar-based fuzzing methods, GLADE and Learn&Fuzz, on their performance and usability to the average user. The blackbox fuzzer Radamsa was also used to compare fuzzing effectiveness. From our results in fuzzing PDF objects, GLADE beats both Radamsa and Learn&Fuzz in terms of coverage and pass rate. XML inputs …


Comparing Learned Representations Between Unpruned And Pruned Deep Convolutional Neural Networks, Parker Mitchell Jun 2022

Comparing Learned Representations Between Unpruned And Pruned Deep Convolutional Neural Networks, Parker Mitchell

Master's Theses

While deep neural networks have shown impressive performance in computer vision tasks, natural language processing, and other domains, the sizes and inference times of these models can often prevent them from being used on resource-constrained systems. Furthermore, as these networks grow larger in size and complexity, it can become even harder to understand the learned representations of the input data that these networks form through training. These issues of growing network size, increasing complexity and runtime, and ambiguity in the understanding of internal representations serve as guiding points for this work.

In this thesis, we create a neural network that …


Legislative Language For Success, Sanjana Gundala Jun 2022

Legislative Language For Success, Sanjana Gundala

Master's Theses

Legislative committee meetings are an integral part of the lawmaking process for local and state bills. The testimony presented during these meetings is a large factor in the outcome of the proposed bill. This research uses Natural Language Processing and Machine Learning techniques to analyze testimonies from California Legislative committee meetings from 2015-2016 in order to identify what aspects of a testimony makes it successful. A testimony is considered successful if the alignment of the testimony matches the bill outcome (alignment is "For" and the bill passes or alignment is "Against" and the bill fails). The process of finding what …


Improving Relation Extraction From Unstructured Genealogical Texts Using Fine-Tuned Transformers, Carloangello Parrolivelli Jun 2022

Improving Relation Extraction From Unstructured Genealogical Texts Using Fine-Tuned Transformers, Carloangello Parrolivelli

Master's Theses

Though exploring one’s family lineage through genealogical family trees can be insightful to developing one’s identity, this knowledge is typically held behind closed doors by private companies or require expensive technologies, such as DNA testing, to uncover. With the ever-booming explosion of data on the world wide web, many unstructured text documents, both old and new, are being discovered, written, and processed which contain rich genealogical information. With access to this immense amount of data, however, entails a costly process whereby people, typically volunteers, have to read large amounts of text to find relationships between people. This delays having genealogical …


Strainer: State Transcript Rating For Informed News Entity Retrieval, Thomas M. Gerrity Jun 2022

Strainer: State Transcript Rating For Informed News Entity Retrieval, Thomas M. Gerrity

Master's Theses

Over the past two decades there has been a rapid decline in public oversight of state and local governments. From 2003 to 2014, the number of journalists assigned to cover the proceedings in state houses has declined by more than 30\%. During the same time period, non-profit projects such as Digital Democracy sought to collect and store legislative bill and hearing information on behalf of the public. More recently, AI4Reporters, an offshoot of Digital Democracy, seeks to actively summarize interesting legislative data.

This thesis presents STRAINER, a parallel project with AI4Reporters, as an active data retrieval and filtering system for …


Plant Disease Detection Through Convolutional Neural Networks: A Survey Of Existing Literature, Best Practices, And Implementation, Kevin Label Dec 2021

Plant Disease Detection Through Convolutional Neural Networks: A Survey Of Existing Literature, Best Practices, And Implementation, Kevin Label

Master's Theses

In the United States alone, common diseases spread among plants account for billions of dollars lost in crop yield each year. This issue is exacerbated in countries with less infrastructure to defend against crop epidemics, and can lead to famine and forced migration. Farmers can seek the help of plant pathology experts to defend against diseases and detect crop irregularities early on. However, access to experts can be difficult, and even those trained in the field may miss symptoms before it is too late. To assist in early disease detection, a number of papers have been released on the potential …


Map-Gan: Unsupervised Learning Of Inverse Problems, Brandon S. Campanella Dec 2021

Map-Gan: Unsupervised Learning Of Inverse Problems, Brandon S. Campanella

Master's Theses

In this paper we outline a novel method for training a generative adversarial network based denoising model from an exclusively corrupted and unpaired dataset of images. Our model can learn without clean data or corrupted image pairs, and instead only requires that the noise distribution is able to be expressed analytically and that the noise at each pixel is independent. We utilize maximum a posteriori estimation as the underlying solution framework, optimizing over the analytically expressed noise generating distribution as the likelihood and employ the GAN as the prior. We then evaluate our method on several popular datasets of varying …


Classifying Electrocardiogram With Machine Learning Techniques, Hillal Jarrar Dec 2021

Classifying Electrocardiogram With Machine Learning Techniques, Hillal Jarrar

Master's Theses

Classifying the electrocardiogram is of clinical importance because classification can be used to diagnose patients with cardiac arrhythmias. Many industries utilize machine learning techniques that consist of feature extraction methods followed by Naive- Bayesian classification in order to detect faults within machinery. Machine learning techniques that analyze vibrational machine data in a mechanical application may be used to analyze electrical data in a physiological application. Three of the most common feature extraction methods used to prepare machine vibration data for Naive-Bayesian classification are the Fourier transform, the Hilbert transform, and the Wavelet Packet transform. Each machine learning technique consists of …


Subnational Map Of Poverty Generated From Remote-Sensing Data In Africa: Using Machine Learning Models And Advanced Regression Methods For Poverty Estimation, Lionel N. Hanke Sep 2021

Subnational Map Of Poverty Generated From Remote-Sensing Data In Africa: Using Machine Learning Models And Advanced Regression Methods For Poverty Estimation, Lionel N. Hanke

Master's Theses

According to the 2020 poverty estimates from the World Bank, it is estimated that 9.1% - 9.4% of the global population lived on less than $1.90 per day. It is estimated that the Covid-19 pandemic further aggravated the issue by pushing more than 1% of the global population below the international poverty line of $1.90 per day (WorldBank, 2020). To provide help and formulate effective measures, poverty needs to be located as exact as possible. For this purpose, it was investigated whether regression methods with aggregated remote-sensing data could be used to estimate poverty in Africa. Therefore, five distinct regression …


Take The Lead: Toward A Virtual Video Dance Partner, Ty Farris Aug 2021

Take The Lead: Toward A Virtual Video Dance Partner, Ty Farris

Master's Theses

My work focuses on taking a single person as input and predicting the intentional movement of one dance partner based on the other dance partner's movement. Human pose estimation has been applied to dance and computer vision, but many existing applications focus on a single individual or multiple individuals performing. Currently there are very few works that focus specifically on dance couples combined with pose prediction. This thesis is applicable to the entertainment and gaming industry by training people to dance with a virtual dance partner.

Many existing interactive or virtual dance partners require a motion capture system, multiple cameras …


Soarnet, Deep Learning Thermal Detection For Free Flight, Jake T. Tallman Jun 2021

Soarnet, Deep Learning Thermal Detection For Free Flight, Jake T. Tallman

Master's Theses

Thermals are regions of rising hot air formed on the ground through the warming of the surface by the sun. Thermals are commonly used by birds and glider pilots to extend flight duration, increase cross-country distance, and conserve energy. This kind of powerless flight using natural sources of lift is called soaring. Once a thermal is encountered, the pilot flies in circles to keep within the thermal, so gaining altitude before flying off to the next thermal and towards the destination. A single thermal can net a pilot thousands of feet of elevation gain, however estimating thermal locations is not …


Implementing A Data Acquisition System For The Training Of Cloud Coverage Neural Networks, Weston C. Montgomery Jun 2021

Implementing A Data Acquisition System For The Training Of Cloud Coverage Neural Networks, Weston C. Montgomery

Master's Theses

Cal Poly is home to a solar farm designed to nominally generate 4.5 MW of electricity. The Gold Tree Solar Farm (GTSF) is currently the largest photovoltaic array in the California State University (CSU) system, and it was claimed to be able to produce approximately 11 GWh per year. These types of projections come from power generation models which have been developed to predict power production of these large solar fields. However, when it comes to near-term forecasting of power generation with variable sources such as wind and solar, there is definitely room for improvement.

The two primary factors that …


A Survey Of Computer Graphics Facial Animation Methods: Comparing Traditional Approaches To Machine Learning Methods, Joseph A. Johnson Jun 2021

A Survey Of Computer Graphics Facial Animation Methods: Comparing Traditional Approaches To Machine Learning Methods, Joseph A. Johnson

Master's Theses

Human communications rely on facial expression to denote mood, sentiment, and intent. Realistic facial animation of computer graphic models of human faces can be difficult to achieve as a result of the many details that must be approximated in generating believable facial expressions. Many theoretical approaches have been researched and implemented to create more and more accurate animations that can effectively portray human emotions. Even though many of these approaches are able to generate realistic looking expressions, they typically require a lot of artistic intervention to achieve a believable result. To reduce the intervention needed to create realistic facial animation, …


Application Of Machine Learning Techniques To Forecast Harmful Algal Blooms In Gulf Of Mexico, Bala Tripura Sundari Yerrapothu May 2021

Application Of Machine Learning Techniques To Forecast Harmful Algal Blooms In Gulf Of Mexico, Bala Tripura Sundari Yerrapothu

Master's Theses

The Harmful Algal Blooms (HABs) forecast is crucial for the mitigation of health hazards and to inform actions for the protection of ecosystems and fisheries in the Gulf of Mexico (GoM). For the sake of simplicity of our application we assume ocean color satellite imagery from the National Oceanic and Atmospheric Administration as a proxy for HABs.

In this study we use a deep neural network trained on the 2-Dimensional time series proxy data to provide a forecast of the HABs’ manifestations in the GoM.Our approach analyzes between both spatial and temporal features simultaneously. In addition, the network also helps …


Node Classification On Relational Graphs Using Deep-Rgcns, Nagasai Chandra Mar 2021

Node Classification On Relational Graphs Using Deep-Rgcns, Nagasai Chandra

Master's Theses

Knowledge Graphs are fascinating concepts in machine learning as they can hold usefully structured information in the form of entities and their relations. Despite the valuable applications of such graphs, most knowledge bases remain incomplete. This missing information harms downstream applications such as information retrieval and opens a window for research in statistical relational learning tasks such as node classification and link prediction. This work proposes a deep learning framework based on existing relational convolutional (R-GCN) layers to learn on highly multi-relational data characteristic of realistic knowledge graphs for node property classification tasks. We propose a deep and improved variant, …


Adaptive Learning Technique For Facial Recognition, Rachana Dineshkumar Bumb Dec 2020

Adaptive Learning Technique For Facial Recognition, Rachana Dineshkumar Bumb

Master's Theses

This research describes the adaptive learning technique for facial recognition. It is a common practice in convolutional neural network(CNN) based facial recognition to save its trained result on a large dataset and then load and apply it to ongoing facial recognition tasks. This generally used method lacks adaptation, and the ongoing evolution of new knowledge poses a key technical challenge. In this research, we propose a continued learning technique to incorporate new knowledge derived in each facial recognition process. A positive recognition with confidence score is assigned, and the image associated with this confidence is added to the image dataset …


Transfer Learning For Hyperspectral Images Utilizing Channel Selection Techniques And Ensemble Methods, Scott Daniel Vogel Dec 2020

Transfer Learning For Hyperspectral Images Utilizing Channel Selection Techniques And Ensemble Methods, Scott Daniel Vogel

Master's Theses

Hyperspectral images contain information from a wider range of the electromagnetic spectrum than natural images which gives them potential for better classification ability. However, hyperspectral datasets are typically small due to the expensive equipment needed to obtain the images, which can limit classification performance. One solution to this problem is transfer learning, in which a model trained on one dataset is reused for a separate dataset. Research has shown that transfer learning between hyperspectral datasets can give improved performance over models without transfer learning when training data are limited. Since extra hyperspectral data are not always available, the solution proposed …


Comparison Of Classification Algorithms And Undersampling Methods On Employee Churn Prediction: A Case Study Of A Tech Company, Heather Cooper Dec 2020

Comparison Of Classification Algorithms And Undersampling Methods On Employee Churn Prediction: A Case Study Of A Tech Company, Heather Cooper

Master's Theses

Churn prediction is a common data mining problem that many companies face across industries. More commonly, customer churn has been studied extensively within the telecommunications industry where there is low customer retention due to high market competition. Similar to customer churn, employee churn is very costly to a company and by not deploying proper risk mitigation strategies, profits cannot be maximized, and valuable employees may leave the company. The cost to replace an employee is exponentially higher than finding a replacement, so it is in any company’s best interest to prioritize employee retention.

This research combines machine learning techniques with …


Attentional Parsing Networks, Marcus Karr Dec 2020

Attentional Parsing Networks, Marcus Karr

Master's Theses

Convolutional neural networks (CNNs) have dominated the computer vision field since the early 2010s, when deep learning largely replaced previous approaches like hand-crafted feature engineering and hierarchical image parsing. Meanwhile transformer architectures have attained preeminence in natural language processing, and have even begun to supplant CNNs as the state of the art for some computer vision tasks.

This study proposes a novel transformer-based architecture, the attentional parsing network, that reconciles the deep learning and hierarchical image parsing approaches to computer vision. We recast unsupervised image representation as a sequence-to-sequence translation problem where image patches are mapped to successive layers …