Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Theses/Dissertations

2022

Machine learning

Institution
Publication

Articles 1 - 30 of 72

Full-Text Articles in Physical Sciences and Mathematics

A Neural Analysis-Synthesis Approach To Learning Procedural Audio Models, Danzel Serrano Dec 2022

A Neural Analysis-Synthesis Approach To Learning Procedural Audio Models, Danzel Serrano

Theses

The effective sound design of environmental sounds is crucial to demonstrating an immersive experience. Classical Procedural Audio (PA) models have been developed to give the sound designer a fast way to synthesize a specific class of environmental sounds in a physically accurate and computationally efficient manner. These models are controllable due to the choice of parameters from analyzing a class of sound. However, the resulting synthesis lacks the fidelity for the preferred immersive experience; thus, the sound designer would rather search through an extensive database for real recordings of a target sound class. This thesis proposes the Procedural audio Variational …


Algorithmic Solutions To Combat Online Fake News, Xinyi Zhou Dec 2022

Algorithmic Solutions To Combat Online Fake News, Xinyi Zhou

Dissertations - ALL

The unprecedented growth of new information producing, distributing, and consuming every moment on the Web has fostered the rise of ``fake news.'' Because of its detrimental effect on democracy, global economies, and public health, effectively combating online fake news has become an essential and urgent task.

This dissertation starts with making typological, theoretical, and empirical efforts to promote the public's comprehension of fake news and lay the foundation for algorithmically combating fake news. As there has been no universal definition of fake news, this dissertation discusses the definition of fake news from three dimensions: veracity, intention, and news, comparing it …


Probabilistic Forecasting Of Winter Mixed Precipitation Types In New York State Utilizing A Random Forest, Brian Chandler Filipiak Dec 2022

Probabilistic Forecasting Of Winter Mixed Precipitation Types In New York State Utilizing A Random Forest, Brian Chandler Filipiak

Legacy Theses & Dissertations (2009 - 2024)

Operational forecasters face a plethora of challenges when making a forecast; they must consider multiple data sources ranging from radar and satellites to surface and upper air observations, to numerical weather prediction output. Forecasts must be done in a limited window of time, which adds an additional layer of difficulty to the task. These challenges are exacerbated by winter mixed precipitation events where slight differences in thermodynamic profiles or changes in terrain create different precipitation types across small areas. In addition to being difficult to forecast, mixed precipitation events can have large-scale impacts on our society.


Improved Computational Prediction Of Function And Structural Representation Of Self-Cleaving Ribozymes With Enhanced Parameter Selection And Library Design, James D. Beck Dec 2022

Improved Computational Prediction Of Function And Structural Representation Of Self-Cleaving Ribozymes With Enhanced Parameter Selection And Library Design, James D. Beck

Boise State University Theses and Dissertations

Biomolecules could be engineered to solve many societal challenges, including disease diagnosis and treatment, environmental sustainability, and food security. However, our limited understanding of how mutational variants alter molecular structures and functional performance has constrained the potential of important technological advances, such as high-throughput sequencing and gene editing. Ribonuleic Acid (RNA) sequences are thought to play a central role within many of these challenges. Their continual discovery throughout all domains of life is evidence of their significant biological importance (Weinreb et al., 2016). The self-cleaving ribozyme is a class of noncoding Ribonuleic Acid (ncRNA) that has been useful for …


Atomlbs: An Atom Based Convolutional Neural Network For Druggable Ligand Binding Site Prediction, Md Ashraful Islam Dec 2022

Atomlbs: An Atom Based Convolutional Neural Network For Druggable Ligand Binding Site Prediction, Md Ashraful Islam

Theses and Dissertations

Despite advances in drug research and development, there are few and ineffective treatments for a variety of diseases. Virtual screening can drastically reduce costs and accelerate the drug discovery process. Binding site identification is one of the initial and most important steps in structure-based virtual screening. Identifying and defining protein cavities that are likely to bind to a small compound is the objective of this task. In this research, we propose four different convolutional neural networks for predicting ligand-binding sites in proteins. A parallel optimized data pipeline is created to enable faster training of these neural network models on minimal …


Divide-And-Conquer Distributed Learning: Privacy-Preserving Offloading Of Neural Network Computations, Lewis C.L. Brown Dec 2022

Divide-And-Conquer Distributed Learning: Privacy-Preserving Offloading Of Neural Network Computations, Lewis C.L. Brown

Graduate Theses and Dissertations

Machine learning has become a highly utilized technology to perform decision making on high dimensional data. As dataset sizes have become increasingly large so too have the neural networks to learn the complex patterns hidden within. This expansion has continued to the degree that it may be infeasible to train a model from a singular device due to computational or memory limitations of underlying hardware. Purpose built computing clusters for training large models are commonplace while access to networks of heterogeneous devices is still typically more accessible. In addition, with the rise of 5G networks, computation at the edge becoming …


Cyber Resilience Analytics For Cyber-Physical Systems, Md Ariful Haque Dec 2022

Cyber Resilience Analytics For Cyber-Physical Systems, Md Ariful Haque

Electrical & Computer Engineering Theses & Dissertations

Cyber-physical systems (CPSs) are complex systems that evolve from the integrations of components dealing with physical processes and real-time computations, along with networking. CPSs often incorporate approaches merging from different scientific fields such as embedded systems, control systems, operational technology, information technology systems (ITS), and cybernetics. Today critical infrastructures (CIs) (e.g., energy systems, electric grids, etc.) and other CPSs (e.g., manufacturing industries, autonomous transportation systems, etc.) are experiencing challenges in dealing with cyberattacks. Major cybersecurity concerns are rising around CPSs because of their ever-growing use of information technology based automation. Often the security concerns are limited to probability-based possible attack …


Machine Learning-Based Event Generator, Yasir Alanazi Dec 2022

Machine Learning-Based Event Generator, Yasir Alanazi

Computer Science Theses & Dissertations

Monte Carlo-based event generators have been the primary source for simulating particle collision experiments for the study of interesting physics scenarios. Monte Carlo generators rely on theoretical assumptions, which limit their ability to capture the full range of possible correlations between particle’s momenta. In addition, the simulations of the complete pipeline often take minutes to generate a single event even with the help of supercomputers.

In recent years, much attention has been devoted to the development of machine learning event generators. They demonstrate attractive advantages, including fast simulations, data compression, and being agnostic of theoretical assumptions. However, most of the …


Potential Of Vision Transformers For Advanced Driver-Assistance Systems: An Evaluative Approach, Andrew Katoch Oct 2022

Potential Of Vision Transformers For Advanced Driver-Assistance Systems: An Evaluative Approach, Andrew Katoch

Electronic Thesis and Dissertation Repository

In this thesis, we examine the performance of Vision Transformers concerning the current state of Advanced Driving Assistance Systems (ADAS). We explore the Vision Transformer model and its variants on the problems of vehicle computer vision. Vision transformers show performance competitive to convolutional neural networks but require much more training data. Vision transformers are also more robust to image permutations than CNNs. Additionally, Vision Transformers have a lower pre-training compute cost but can overfit on smaller datasets more easily than CNNs. Thus we apply this knowledge to tune Vision transformers on ADAS image datasets, including general traffic objects, vehicles, traffic …


Cnn-Based Dendrite Core Detection From Microscopic Images Of Directionally Solidified Ni-Base Alloys, Xiaoguang Li Oct 2022

Cnn-Based Dendrite Core Detection From Microscopic Images Of Directionally Solidified Ni-Base Alloys, Xiaoguang Li

Theses and Dissertations

Dendrite core is the center point of the dendrite. The information of dendrite core is very helpful for material scientists to analyze the properties of materials. Therefore, detecting the dendrite core is a very important task in the material science field. Meanwhile, because of some special properties of the dendrites, this task is also very challenging. Different from the typical detection problems in the computer vision field, detecting the dendrite core aims to detect a single point location instead of the bounding-box. As a result, the existing regressing bounding-box based detection methods can not work well on this task because …


Analyzing Microarchitectural Residue In Various Privilege Strata To Identify Computing Tasks, Tor J. Langehaug Sep 2022

Analyzing Microarchitectural Residue In Various Privilege Strata To Identify Computing Tasks, Tor J. Langehaug

Theses and Dissertations

Modern multi-tasking computer systems run numerous applications simultaneously. These applications must share hardware resources including the Central Processing Unit (CPU) and memory while maximizing each application’s performance. Tasks executing in this shared environment leave residue which should not reveal information. This dissertation applies machine learning and statistical analysis to evaluate task residue as footprints which can be correlated to identify tasks. The concept of privilege strata, drawn from an analogy with physical geology, organizes the investigation into the User, Operating System, and Hardware privilege strata. In the User Stratum, an adversary perspective is taken to build an interrogator program that …


Leveraging Subject Matter Expertise To Optimize Machine Learning Techniques For Air And Space Applications, Philip Y. Cho Sep 2022

Leveraging Subject Matter Expertise To Optimize Machine Learning Techniques For Air And Space Applications, Philip Y. Cho

Theses and Dissertations

We develop new machine learning and statistical methods that are tailored for Air and Space applications through the incorporation of subject matter expertise. In particular, we focus on three separate research thrusts that each represents a different type of subject matter knowledge, modeling approach, and application. In our first thrust, we incorporate knowledge of natural phenomena to design a neural network algorithm for localizing point defects in transmission electron microscopy (TEM) images of crystalline materials. In our second research thrust, we use Bayesian feature selection and regression to analyze the relationship between fighter pilot attributes and flight mishap rates. We …


The Interaction Of Different Primary Producers And Physical And Chemical Dynamics Of An Urban Shallow Lake, Majid Sahin Sep 2022

The Interaction Of Different Primary Producers And Physical And Chemical Dynamics Of An Urban Shallow Lake, Majid Sahin

Dissertations, Theses, and Capstone Projects

An artificial urban shallow lake, Prospect Park Lake (PPL), is situated on a terminal moraine in Brooklyn New York, and supplied with municipal water treated with ortho-phosphates. The constant input of the phosphate nutrient is the primary source of eutrophication in the lake. The numerous pools along the water course houses various aquatic phototrophs, which influence the water quality and the state of the system, driving conditions into favoring the survival of their species. In the first half of the dissertation, the focus of the project is on analyzing how the different primary producers in different regions of PPL affect …


Respiratory Pattern Analysis For Covid-19 Digital Screening Using Ai Techniques, Annita Tahsin Priyoti Aug 2022

Respiratory Pattern Analysis For Covid-19 Digital Screening Using Ai Techniques, Annita Tahsin Priyoti

Electronic Thesis and Dissertation Repository

Corona Virus (COVID-19) is a highly contagious respiratory disease that the World Health Organization (WHO) has declared a worldwide epidemic. This virus has spread worldwide, affecting various countries until now, causing millions of deaths globally. To tackle this public health crisis, medical professionals and researchers are working relentlessly, applying different techniques and methods. In terms of diagnosis, respiratory sound has been recognized as an indicator of one’s health condition. Our work is based on cough sound analysis. This study has included an in-depth analysis of the diagnosis of COVID-19 based on human cough sound. Based on cough audio samples from …


Design And Analysis Of Strategic Behavior In Networks, Sixie Yu Aug 2022

Design And Analysis Of Strategic Behavior In Networks, Sixie Yu

McKelvey School of Engineering Theses & Dissertations

Networks permeate every aspect of our social and professional life.A networked system with strategic individuals can represent a variety of real-world scenarios with socioeconomic origins. In such a system, the individuals' utilities are interdependent---one individual's decision influences the decisions of others and vice versa. In order to gain insights into the system, the highly complicated interactions necessitate some level of abstraction. To capture the otherwise complex interactions, I use a game theoretic model called Networked Public Goods (NPG) game. I develop a computational framework based on NPGs to understand strategic individuals' behavior in networked systems. The framework consists of three …


Human-Centered Machine Learning: Algorithm Design And Human Behavior, Wei Tang Aug 2022

Human-Centered Machine Learning: Algorithm Design And Human Behavior, Wei Tang

McKelvey School of Engineering Theses & Dissertations

Machine learning is increasingly engaged in a large number of important daily decisions and has great potential to reshape various sectors of our modern society. To fully realize this potential, it is important to understand the role that humans play in the design of machine learning algorithms and investigate the impacts of the algorithm on humans.

Towards the understanding of such interactions between humans and algorithms, this dissertation takes a human-centric perspective and focuses on investigating the interplay between human behavior and algorithm design. Accounting for the roles of humans in algorithm design creates unique challenges. For example, humans might …


Classification Models For 2,4-D Formulations In Damaged Enlist Crops Through The Application Of Ftir Spectroscopy And Machine Learning Algorithms, Benjamin Blackburn Aug 2022

Classification Models For 2,4-D Formulations In Damaged Enlist Crops Through The Application Of Ftir Spectroscopy And Machine Learning Algorithms, Benjamin Blackburn

Theses and Dissertations

With new 2,4-Dichlorophenoxyacetic acid (2,4-D) tolerant crops, increases in off-target movement events are expected. New formulations may mitigate these events, but standard lab techniques are ineffective in identifying these 2,4-D formulations. Using Fourier-transform infrared spectroscopy and machine learning algorithms, research was conducted to classify 2,4-D formulations in treated herbicide-tolerant soybeans and cotton and observe the influence of leaf treatment status and collection timing on classification accuracy. Pooled Classification models using k-nearest neighbor classified 2,4-D formulations with over 65% accuracy in cotton and soybean. Tissue collected 14 DAT and 21 DAT for cotton and soybean respectively produced higher accuracies than the …


Secrep : A Framework For Automating The Extraction And Prioritization Of Security Requirements Using Machine Learning And Nlp Techniques, Shada Khanneh Aug 2022

Secrep : A Framework For Automating The Extraction And Prioritization Of Security Requirements Using Machine Learning And Nlp Techniques, Shada Khanneh

Theses, Dissertations and Culminating Projects

Gathering and extracting security requirements adequately requires extensive effort, experience, and time, as large amounts of data need to be analyzed. While many manual and academic approaches have been developed to tackle the discipline of Security Requirements Engineering (SRE), a need still exists for automating the SRE process. This need stems mainly from the difficult, error-prone, and time-consuming nature of traditional and manual frameworks. Machine learning techniques have been widely used to facilitate and automate the extraction of useful information from software requirements documents and artifacts. Such approaches can be utilized to yield beneficial results in automating the process of …


Deep Active Genetic Learning With Evidential Uncertainty For Agriculture Crops And Lake Water Quality Assessment, Oguz M. Aranay Aug 2022

Deep Active Genetic Learning With Evidential Uncertainty For Agriculture Crops And Lake Water Quality Assessment, Oguz M. Aranay

Legacy Theses & Dissertations (2009 - 2024)

Despite significant advancements in the field of machine learning, there are two issues that still require further exploration. First, how to learn from a small dataset; and second, how to select appropriate features from the data. Although there exist many techniques to address these issues, choosing a combination of the techniques from these two groups is challenging, and worth investigating. To address these concerns, this thesis presents a learning framework that is based on a deep learning model utilizing active learning (with evidential uncertainty as a basis for acquisition function) for the first issue and a genetic algorithm for the …


Stability And Differential Privacy Of Stochastic Gradient Methods, Zhenhuan Yang Aug 2022

Stability And Differential Privacy Of Stochastic Gradient Methods, Zhenhuan Yang

Legacy Theses & Dissertations (2009 - 2024)

Recently there are a considerable amount of work devoted to the study of the algorithmic stability as well as differential privacy (DP) for stochastic gradient methods (SGM). However, most of the existing work focus on the empirical risk minimization (ERM) and the population risk minimization problems. In this paper, we study two types of optimization problems that enjoy wide applications in modern machine learning, namely the minimax problem and the pairwise learning problem.


Data Collection And Machine Learning Methods For Automated Pedestrian Facility Detection And Mensuration, Joseph Bailey Luttrell Iv Aug 2022

Data Collection And Machine Learning Methods For Automated Pedestrian Facility Detection And Mensuration, Joseph Bailey Luttrell Iv

Dissertations

Large-scale collection of pedestrian facility (crosswalks, sidewalks, etc.) presence data is vital to the success of efforts to improve pedestrian facility management, safety analysis, and road network planning. However, this kind of data is typically not available on a large scale due to the high labor and time costs that are the result of relying on manual data collection methods. Therefore, methods for automating this process using techniques such as machine learning are currently being explored by researchers. In our work, we mainly focus on machine learning methods for the detection of crosswalks and sidewalks from both aerial and street-view …


Towards Making Transformer-Based Language Models Learn How Children Learn, Yousra Mahdy Aug 2022

Towards Making Transformer-Based Language Models Learn How Children Learn, Yousra Mahdy

Boise State University Theses and Dissertations

Transformer-based Language Models (LMs), learn contextual meanings for words using a huge amount of unlabeled text data. These models show outstanding performance on various Natural Language Processing (NLP) tasks. However, what the LMs learn is far from what the meaning is for humans, partly due to the fact that humans can differentiate between concrete and abstract words, but language models make no distinction. Concrete words are words that have a physical representation in the world such as “chair”, while abstract words are ideas such as “democracy”. The process of learning word meanings starts from early childhood when children acquire their …


Solving The Challenges Of Concept Drift In Data Stream Classification., Hanqing Hu Aug 2022

Solving The Challenges Of Concept Drift In Data Stream Classification., Hanqing Hu

Electronic Theses and Dissertations

The rise of network connected devices and applications leads to a significant increase in the volume of data that are continuously generated overtime time, called data streams. In real world applications, storing the entirety of a data stream for analyzing later is often not practical, due to the data stream’s potentially infinite volume. Data stream mining techniques and frameworks are therefore created to analyze streaming data as they arrive. However, compared to traditional data mining techniques, challenges unique to data stream mining also emerge, due to the high arrival rate of data streams and their dynamic nature. In this dissertation, …


Data-Driven Research On Engineering Design Thinking And Behaviors In Computer-Aided Systems Design: Analysis, Modeling, And Prediction, Molla Hafizur Rahman Aug 2022

Data-Driven Research On Engineering Design Thinking And Behaviors In Computer-Aided Systems Design: Analysis, Modeling, And Prediction, Molla Hafizur Rahman

Graduate Theses and Dissertations

Research on design thinking and design decision-making is vital for discovering and utilizing beneficial design patterns, strategies, and heuristics of human designers in solving engineering design problems. It is also essential for the development of new algorithms embedded with human intelligence and can facilitate human-computer interactions. However, modeling design thinking is challenging because it takes place in the designer’s mind, which is intricate, implicit, and tacit. For an in-depth understanding of design thinking, fine-grained design behavioral data are important because they are the critical link in studying the relationship between design thinking, design decisions, design actions, and design performance. Therefore, …


Emotion Detection Using An Ensemble Model Trained With Physiological Signals And Inferred Arousal-Valence States, Matthew Nathanael Gray Aug 2022

Emotion Detection Using An Ensemble Model Trained With Physiological Signals And Inferred Arousal-Valence States, Matthew Nathanael Gray

Electrical & Computer Engineering Theses & Dissertations

Affective computing is an exciting and transformative field that is gaining in popularity among psychologists, statisticians, and computer scientists. The ability of a machine to infer human emotion and mood, i.e. affective states, has the potential to greatly improve human-machine interaction in our increasingly digital world. In this work, an ensemble model methodology for detecting human emotions across multiple subjects is outlined. The Continuously Annotated Signals of Emotion (CASE) dataset, which is a dataset of physiological signals labeled with discrete emotions from video stimuli as well as subject-reported continuous emotions, arousal and valence, from the circumplex model, is used for …


Developing Artificial Intelligence And Machine Learning To Support Primary Care Research And Practice, Jacqueline K. Kueper Jul 2022

Developing Artificial Intelligence And Machine Learning To Support Primary Care Research And Practice, Jacqueline K. Kueper

Electronic Thesis and Dissertation Repository

This thesis was motivated by the potential to use "everyday data", especially that collected in electronic health records (EHRs) as part of healthcare delivery, to improve primary care for clients facing complex clinical and/or social situations. Artificial intelligence (AI) techniques can identify patterns or make predictions with these data, producing information to learn about and inform care delivery. Our first objective was to understand and critique the body of literature on AI and primary care. This was achieved through a scoping review wherein we found the field was at an early stage of maturity, primarily focused on clinical decision support …


Learning From Machines: Insights In Forest Transpiration Using Machine Learning Methods, Morgan Tholl Jul 2022

Learning From Machines: Insights In Forest Transpiration Using Machine Learning Methods, Morgan Tholl

Dissertations and Theses

Machine learning has been used as a tool to model transpiration for individual sites, but few models are capable of generalizing to new locations without calibration to site data. Using the global SAPFLUXNET database, 95 tree sap flow data sites were grouped using three clustering strategies: by biome, by tree functional type, and through use of a k-means unsupervised clustering algorithm. Two supervised machine learning algorithms, a random forest algorithm and a neural network algorithm, were used to build machine learning models that predicted transpiration for each cluster. The performance and feature importance in each model were analyzed and compared …


Unpaired Style Transfer Conditional Generative Adversarial Network For Scanned Document Generation, David Jonathan Hawbaker Jul 2022

Unpaired Style Transfer Conditional Generative Adversarial Network For Scanned Document Generation, David Jonathan Hawbaker

Dissertations and Theses

Neural networks are a powerful machine learning tool, especially when trained on a large dataset of relevant high-quality data. Generative adversarial networks, image super resolution and most other image manipulation neural networks require a dataset of images and matching target images for training. Collecting and compiling that data can be time consuming and expensive. This work explores an approach for building a dataset of paired document images with a matching scanned version of each document without physical printers or scanners. A dataset of these document image pairs could be used to train a generative adversarial network or image super resolution …


Image-Based Crack Detection By Extracting Depth Of The Crack Using Machine Learning, Nishat Tabassum Jul 2022

Image-Based Crack Detection By Extracting Depth Of The Crack Using Machine Learning, Nishat Tabassum

Theses and Dissertations

Concrete structures have been a major aspect of social infrastructure since the ancient Roman times, so they have been used for many centuries. Concrete is used for the durability and support it provides to buildings and bridges. Assessing the state of these structures is important in preserving the longevity of structures and the safety of the public. Detecting cracks in their early stage allows repairs to be made without the need to replace the whole structure, so it reduces the cost. Traditional methods are slowly falling behind as technology advances and an increase in demand for a practical method of …


Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston Jun 2022

Leveraging Context Patterns For Medical Entity Classification, Garrett Johnston

Computer Science Senior Theses

The ability of patients to understand health-related text is important for optimal health outcomes. A system that can automatically annotate medical entities could help patients better understand health-related text. Such a system would also accelerate manual data annotation for this low-resource domain as well as assist in down- stream medical NLP tasks such as finding textual similarity, identifying conflicting medical advice, and aspect-based sentiment analysis. In this work, we investigate a state-of-the-art entity set expansion model, BootstrapNet, for the task of medical entity classification on a new dataset of medical advice text. We also propose EP SBERT, a simple model …