Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

2022

Machine Learning

Discipline
Institution
Publication
Publication Type
File Type

Articles 91 - 117 of 117

Full-Text Articles in Physical Sciences and Mathematics

Third-Integer Resonant Extraction Regulation System For Mu2e, Aakaash Narayanan Jan 2022

Third-Integer Resonant Extraction Regulation System For Mu2e, Aakaash Narayanan

Graduate Research Theses & Dissertations

A third-integer resonant slow extraction system is being developed for Fermilab's Delivery Ring to deliver protons to the upcoming Mu2e experiment. The timescale of the extraction (or spill) duration is 43 milliseconds, which is extremely short and unprecedented. Additionally, the experiment's strict and challenging requirements on the quality of the spill at this time scale has led to the development of a new Spill Regulation System (SRS) design. The SRS primarily consists of three components - slow regulation, fast regulation, and harmonic content suppressor. Contributions to the first two components of the SRS, i.e., Slow Regulation and Fast Regulation subsystems, …


Covidalert - A Wristwatch-Based System To Alert Users From Face Touching, Mrinmoy Roy Jan 2022

Covidalert - A Wristwatch-Based System To Alert Users From Face Touching, Mrinmoy Roy

Graduate Research Theses & Dissertations

Worldwide 219 million people have been infected and 4.5 million have lost their lives in ongoing Covid-19 pandemic. Until vaccines became widely available, precautions and safety measures like wearing masks, physical distancing, avoiding face touching were some of the primary means to curb the spread of virus. Face touching is a compulsive human behavior that can not be prevented without constantly making a conscious effort, even then it is inevitable. To address this problem, we have designed a smartwatch-based solution, CovidAlert, that leverages Random Forest algorithm trained on accelerometer and gyroscope data from the smartwatch to detect hand transition to …


Analyzing Behavioral Adaptation To Covid-19 And Return To Pre-Pandemic Baselines In A Cohort Of College Seniors, Vlado Vojdanovski Jan 2022

Analyzing Behavioral Adaptation To Covid-19 And Return To Pre-Pandemic Baselines In A Cohort Of College Seniors, Vlado Vojdanovski

Computer Science Senior Theses

As the critical phase of the COVID-19 pandemic seems to be winding down, it is important to analyze the adjustment to COVID-19 and return to normalcy of various populations. In this study we focus on the behavioral adjustments exhibited by a cohort of N=114 college seniors. To infer COVID-19 adjustment we compare the 2021 year (second year of COVID-19) to the 2020 year (first year of COVID-19) and 2019 (prepandemic baseline year). We begin with a broad analysis between the second and first covid year, finding that the second year of COVID-19 shows significant returns to pre-pandemic baselines on multiple …


Application Of Machine Learning In Geophysics: Ranking Teleseismic Shear Wave Splitting Measurements And Classifying Different Types Of Earthquakes, Yanwei Zhang Jan 2022

Application Of Machine Learning In Geophysics: Ranking Teleseismic Shear Wave Splitting Measurements And Classifying Different Types Of Earthquakes, Yanwei Zhang

Doctoral Dissertations

"During the past decades, applications of Machine Learning have been explosively developed to solve various academic and industrial problems, and over-human performance has been shown in diverse areas. In geophysical research, Machine Learning, especially Convolutional Neural Network (CNN), has been applied in numerous studies and demonstrated considerable potential. In this study, we applied CNN to solve two geophysical problems, ranking teleseismic shear splitting (SWS) measurements and classifying different types of earthquakes.

For ranking teleseismic SWS measurements, we utilized a CNN-based method to automatically select reliable SWS measurements. The CNN was trained by human-verified teleseismic SWS measurements and tested using synthetic …


Genetic Algorighm Representation Selection Impact On Binary Classification Problems, Stephen V. Maldonado Jan 2022

Genetic Algorighm Representation Selection Impact On Binary Classification Problems, Stephen V. Maldonado

Honors Undergraduate Theses

In this thesis, we explore the impact of problem representation on the ability for the genetic algorithms (GA) to evolve a binary prediction model to predict whether a physical therapist is paid above or below the median amount from Medicare. We explore three different problem representations, the vector GA (VGA), the binary GA (BGA), and the proportional GA (PGA). We find that all three representations can produce models with high accuracy and low loss that are better than Scikit-Learn’s logistic regression model and that all three representations select the same features; however, the PGA representation tends to create lower weights …


A Machine Learning Approach To Intended Motion Prediction For Upper Extremity Exoskeletons, Justin Berdell Jan 2022

A Machine Learning Approach To Intended Motion Prediction For Upper Extremity Exoskeletons, Justin Berdell

Graduate Research Theses & Dissertations

A fully solid-state, software-defined, one-handed, handle-type control device built around a machine-learning (ML) model that provides intuitive and simultaneous control in position and orientation each in a full three degrees-of-freedom (DOF) is proposed in this paper. The device, referred to as the “Smart Handle”, and it is compact, lightweight, and only reliant on low-cost and readily available sensors and materials for construction. Mobility chairs for persons with motor difficulties could make use of a control device that can learn to recognize arbitrary inputs as control commands. Upper-extremity exoskeletons used in occupational settings and rehabilitation require a natural control device like …


Forecasting Bitcoin, Ethereum And Litecoin Prices Using Machine Learning, Sai Prabhu Jaligama Jan 2022

Forecasting Bitcoin, Ethereum And Litecoin Prices Using Machine Learning, Sai Prabhu Jaligama

Graduate Research Theses & Dissertations

This research aims to predict the cryptocurrencies Bitcoin, Litecoin and Ethereum using Time Series Modelling with daily data of closing price from 16th of October 2018 to 9th of September 2021for a total of 1073 days. Augmented Dickey Fuller test was first used to check stationarity of the time series, then two forecasting algorithms called ARIMA, and PROPHET were used to make predictions. The findings show similar results for both the models for each of Bitcoin, Ethereum and Litecoin. The results achieved show modelling cryptocurrencies which are volatile using a single variable produces satisfying results.


Interpretable Machine Learning For Self-Service High-Risk Decision Making, Charles Recaido Jan 2022

Interpretable Machine Learning For Self-Service High-Risk Decision Making, Charles Recaido

All Master's Theses

This research contributes to interpretable machine learning via visual knowledge discovery in General Line Coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and GLC are combined to create a visual self-service machine learning model. Two variants of GLC known as Dynamic Scaffold Coordinates (DSC) are proposed. DSC1 and DSC2 can map in a lossless manner multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane using a dynamic scaffolding graph construction algorithm.

Hyperblock analysis is used to determine visually appealing dataset attribute orders and to reduce line occlusion. It is shown that hyperblocks can generalize decision tree …


A Machine Learning Algorithm Improves Surface Freeze-Thaw Classification, Fredrick Bunt Jan 2022

A Machine Learning Algorithm Improves Surface Freeze-Thaw Classification, Fredrick Bunt

Graduate Student Theses, Dissertations, & Professional Papers

The frozen or thawed state of the land surface is an important factor affecting a wide range of natural processes such as surface water movement, the carbon cycle, and ecosystem development. It is also important for human endeavors such as permafrost engineering and agricultural planning. This makes having an accurate record important. The Freeze-Thaw (FT) Earth System Data Record (FT-ESDR) is a global, daily product that strives to be a reliable record of the FT ground state. In its current form, the FT-ESDR uses annual regression analysis of reanalysis surface air temperatures (SAT) and brightness temperatures (Tb) at each grid …


Caption And Image Based Next-Word Auto-Completion, Meet Patel Jan 2022

Caption And Image Based Next-Word Auto-Completion, Meet Patel

Master's Projects

With the increasing number of options or choices in terms of entities like products, movies, songs, etc. which are now available to users, they try to save time by looking for an application or system that provides automatic recommendations. Recommender systems are automated computing processes that leverage concepts of Machine Learning, Data Mining and Artificial Intelligence towards generating product recommendations based on a user’s preferences. These systems have given a significant boost to businesses across multiple segments as a result of reduced human intervention. One similar aspect of this is content writing. It would save users a lot of time …


Searching For Anomalous Extensive Air Showers Using The Pierre Auger Observatory Fluorescence Detector, Andrew Puyleart Jan 2022

Searching For Anomalous Extensive Air Showers Using The Pierre Auger Observatory Fluorescence Detector, Andrew Puyleart

Dissertations, Master's Theses and Master's Reports

Anomalous extensive air showers have yet to be detected by cosmic ray observatories. Fluorescence detectors provide a way to view the air showers created by cosmic rays with primary energies reaching up to hundreds of EeV . The resulting air showers produced by these highly energetic collisions can contain features that deviate from average air showers. Detection of these anomalous events may provide information into unknown regions of particle physics, and place constraints on cross-sectional interaction lengths of protons. In this dissertation, I propose measurements of extensive air shower profiles that are used in a machine learning pipeline to distinguish …


Learning Robot Motion From Creative Human Demonstration, Charles C. Dietzel Jan 2022

Learning Robot Motion From Creative Human Demonstration, Charles C. Dietzel

Theses and Dissertations

This thesis presents a learning from demonstration framework that enables a robot to learn and perform creative motions from human demonstrations in real-time. In order to satisfy all of the functional requirements for the framework, the developed technique is comprised of two modular components, which integrate together to provide the desired functionality. The first component, called Dancing from Demonstration (DfD), is a kinesthetic learning from demonstration technique. This technique is capable of playing back newly learned motions in real-time, as well as combining multiple learned motions together in a configurable way, either to reduce trajectory error or to generate entirely …


Smart City Management Using Machine Learning Techniques, Mostafa Zaman Jan 2022

Smart City Management Using Machine Learning Techniques, Mostafa Zaman

Theses and Dissertations

In response to the growing urban population, "smart cities" are designed to improve people's quality of life by implementing cutting-edge technologies. The concept of a "smart city" refers to an effort to enhance a city's residents' economic and environmental well-being via implementing a centralized management system. With the use of sensors and actuators, smart cities can collect massive amounts of data, which can improve people's quality of life and design cities' services. Although smart cities contain vast amounts of data, only a percentage is used due to the noise and variety of the data sources. Information and communication technology (ICT) …


Hydrocarbon Pay Zone Prediction Using Ai Neural Network Modeling., Darren D. Guedon Jan 2022

Hydrocarbon Pay Zone Prediction Using Ai Neural Network Modeling., Darren D. Guedon

Graduate Theses, Dissertations, and Problem Reports

This paper captures the ability of AI neural network technology to analyze petrophysical datasets for pattern recognition and accurate prediction of the pay zone of a vertical well from the Santa Fe field in Kansas.

During this project, data from 10 completed wells in the Santa Fe field were gathered, resulting in a dataset with 25,580 records, ten predictors (logs data), and a single binary output (Yes or No) to identify the availability of Hydrocarbon over a half feet depth segment in the well. Several models composed of different predictors combinations were also tested to determine how impactful some logs …


Batch Normalization Preconditioning For Neural Network Training, Susanna Luisa Gertrude Lange Jan 2022

Batch Normalization Preconditioning For Neural Network Training, Susanna Luisa Gertrude Lange

Theses and Dissertations--Mathematics

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well understood. It is not suitable for use with very small mini-batch sizes or online learning. In this work, we propose a new method called Batch Normalization Preconditioning (BNP). Instead of applying normalization explicitly through a batch normalization layer as is done in BN, BNP applies normalization by conditioning the parameter gradients directly during training. This is designed to improve the Hessian matrix of the loss …


A Low-Cost Machine Learning Based Network Intrusion Detection System With Data Privacy Preservation, Jyoti Fakirah, Lauhim Mahfuz Zishan, Roshni Mooruth, Michael L. Johnstone, Wencheng Yang Jan 2022

A Low-Cost Machine Learning Based Network Intrusion Detection System With Data Privacy Preservation, Jyoti Fakirah, Lauhim Mahfuz Zishan, Roshni Mooruth, Michael L. Johnstone, Wencheng Yang

Research outputs 2022 to 2026

Network intrusion is a well-studied area of cyber security. Current machine learning-based network intrusion detection systems (NIDSs) monitor network data and the patterns within those data but at the cost of presenting significant issues in terms of privacy violations which may threaten end-user privacy. Therefore, to mitigate risk and preserve a balance between security and privacy, it is imperative to protect user privacy with respect to intrusion data. Moreover, cost is a driver of a machine learning-based NIDS because such systems are increasingly being deployed on resource-limited edge devices. To solve these issues, in this paper we propose a NIDS …


Predicting Outcomes Of El Clásico Using Random Forests And Extreme Gradient Boosting, Emanuel Jarquin Jan 2022

Predicting Outcomes Of El Clásico Using Random Forests And Extreme Gradient Boosting, Emanuel Jarquin

CMC Senior Theses

In the modern era, sports betting is becoming increasingly popular. This is especially true in the realm of soccer (or ‘football’ as it is known outside the United States). As a result, the concept of attempting to predict the outcomes of soccer matches using machine learning has garnered much attention in recent years. In this thesis, I utilize well-known machine learning techniques to predict the outcomes of El Clásico matchups and compare the predictive performance of these techniques. The predictive methods employed for this thesis are random forests using the party package in R and extreme gradient boosting using the …


Integrated Gradients Is A Nonlinear Generalization Of The Industry Standard Approach To Variable Attribution For Credit Risk Models, Jonathan Boardman, Md Shafiul Alam, Xiao Huang, Ying Xie Jan 2022

Integrated Gradients Is A Nonlinear Generalization Of The Industry Standard Approach To Variable Attribution For Credit Risk Models, Jonathan Boardman, Md Shafiul Alam, Xiao Huang, Ying Xie

Published and Grey Literature from PhD Candidates

In modern society, epistemic uncertainty limits trust in financial relationships, necessitating transparency and accountability mechanisms for both consumers and lenders. One upshot is that credit risk assessments must be explainable to the consumer. In the United States regulatory milieu, this entails both the identification of key factors in a decision and the provision of consistent actions that would improve standing. The traditionally accepted approach to explainable credit risk modeling involves generating scores with Generalized Linear Models (GLMs) - usually logistic regression, calculating the contribution of each predictor to the total points lost from the theoretical maximum, and generating reason codes …


Reinforcement Learning: Low Discrepancy Action Selection For Continuous States And Actions, Jedidiah Lindborg Jan 2022

Reinforcement Learning: Low Discrepancy Action Selection For Continuous States And Actions, Jedidiah Lindborg

Electronic Theses and Dissertations

In reinforcement learning the process of selecting an action during the exploration or exploitation stage is difficult to optimize. The purpose of this thesis is to create an action selection process for an agent by employing a low discrepancy action selection (LDAS) method. This should allow the agent to quickly determine the utility of its actions by prioritizing actions that are dissimilar to ones that it has already picked. In this way the learning process should be faster for the agent and result in more optimal policies.


The Burning Bush: Linking Lidar-Derived Shrub Architecture To Flammability, Michelle S. Bester Jan 2022

The Burning Bush: Linking Lidar-Derived Shrub Architecture To Flammability, Michelle S. Bester

Graduate Theses, Dissertations, and Problem Reports

Light detection and ranging (LiDAR) and terrestrial laser scanning (TLS) sensors are powerful tools for characterizing vegetation structure and for constructing three-dimensional (3D) models of trees, also known as quantitative structural models (QSM). 3D models and structural traits derived from them provide valuable information for biodiversity conservation, forest management, and fire behavior modeling. However, vegetation studies and 3D modeling methodologies often only focus on the forest canopy, with little attention given to understory vegetation. In particular, 3D structural information of shrubs is limited or not included in fire behavior models. Yet, understory vegetation is an important component of forested ecosystems, …


From Evaluating The Performance Of Approximations In Density Functional Theory To A Machine Learning Design, Pedram Tavazohi Jan 2022

From Evaluating The Performance Of Approximations In Density Functional Theory To A Machine Learning Design, Pedram Tavazohi

Graduate Theses, Dissertations, and Problem Reports

Density-functional theory (DFT) has gained popularity because of its ability to predict the properties of a large group of materials a priori. Even though DFT is exact, there are inaccuracies introduced into the theory due to the approximations in the exchange-correlation (XC) functionals. Over the 50 years of its existence, scientists have tried to improve the design of the XC functionals. The errors introduced by these functionals are not consistent across all types of solid-state materials. In this project, a high throughput framework was utilized to compare the theoretical DFT predictions with the experimental results available in the Inorganic Crystal …


Efficacy Of Reported Issue Times As A Means For Effort Estimation, Paul Phillip Maclean Jan 2022

Efficacy Of Reported Issue Times As A Means For Effort Estimation, Paul Phillip Maclean

Graduate Theses, Dissertations, and Problem Reports

Software effort is a measure of manpower dedicated to developing and maintaining and software. Effort estimation can help project managers monitor their software, teams, and timelines. Conversely, improper effort estimation can result in budget overruns, delays, lost contracts, and accumulated Technical Debt (TD). Issue Tracking Systems (ITS) have become mainstream project management tools, with over 65,000 companies using Jira alone. ITS are an untapped resource for issue resolution effort research. Related work investigates issue effort for specific issue types, usually Bugs or similar. They model their developer-documented issue resolution times using features from the issues themselves. This thesis explores a …


Novel Natural Language Processing Models For Medical Terms And Symptoms Detection In Twitter, Farahnaz Golrooy Motlagh Jan 2022

Novel Natural Language Processing Models For Medical Terms And Symptoms Detection In Twitter, Farahnaz Golrooy Motlagh

Browse all Theses and Dissertations

This dissertation focuses on disambiguation of language use on Twitter about drug use, consumption types of drugs, drug legalization, ontology-enhanced approaches, and prediction analysis of data-driven by developing novel NLP models. Three technical aims comprise this work: (a) leveraging pattern recognition techniques to improve the quality and quantity of crawled Twitter posts related to drug abuse; (b) using an expert-curated, domain-specific DsOn ontology model that improve knowledge extraction in the form of drug-to-symptom and drug-to-side effect relations; and (c) modeling the prediction of public perception of the drug’s legalization and the sentiment analysis of drug consumption on Twitter. We collected …


Deep Understanding Of Technical Documents : Automated Generation Of Pseudocode From Digital Diagrams & Analysis/Synthesis Of Mathematical Formulas, Nikolaos Gkorgkolis Jan 2022

Deep Understanding Of Technical Documents : Automated Generation Of Pseudocode From Digital Diagrams & Analysis/Synthesis Of Mathematical Formulas, Nikolaos Gkorgkolis

Browse all Theses and Dissertations

The technical document is an entity that consists of several essential and interconnected parts, often referred to as modalities. Despite the extensive attention that certain parts have already received, per say the textual information, there are several aspects that severely under researched. Two such modalities are the utility of diagram images and the deep automated understanding of mathematical formulas. Inspired by existing holistic approaches to the deep understanding of technical documents, we develop a novel formal scheme for the modelling of digital diagram images. This extends to a generative framework that allows for the creation of artificial images and their …


Exploiting Context In Linear Influence Games: Improved Algorithms For Model Selection And Performance Evaluation, Daniel Little Jan 2022

Exploiting Context In Linear Influence Games: Improved Algorithms For Model Selection And Performance Evaluation, Daniel Little

Honors Projects

In the recent past, extensive experimental works have been performed to predict joint voting outcomes in Congress based on a game-theoretic model of voting behavior known as Linear Influence Games. In this thesis, we improve the model selection and evaluation procedure of these past experiments. First, we implement two methods, Nested Cross-Validation with Tuning (Nested CVT) and Bootstrap Bias Corrected Cross-Validation (BBC-CV), to perform model selection and evaluation with less bias than previous methods. While Nested CVT is a commonly used method, it requires learning a large number of models; BBC-CV is a more recent method boasting less computational cost. …


Graph Neural Networks For Malware Classification, Vrinda Malhotra Jan 2022

Graph Neural Networks For Malware Classification, Vrinda Malhotra

Master's Projects

Malware is a growing threat to the digital world. The first step to managing this threat is malware detection and classification. While traditional techniques rely on static or dynamic analysis of malware, the generation of these features requires expert knowledge. Function call graphs (FCGs) consist of program functions as their nodes and their interprocedural calls as their edges, providing a wealth of knowledge that can be utilized to classify malware without feature extraction that requires experts. This project treats malware classification as a graph classification problem, setting node features using the Local Degree Profile (LDP) model and using different graph …


A Novel Handover Method Using Destination Prediction In 5g-V2x Networks, Pooja Shyamsundar Jan 2022

A Novel Handover Method Using Destination Prediction In 5g-V2x Networks, Pooja Shyamsundar

Master's Projects

This paper proposes a novel approach to handover optimization in fifth generation vehicular networks. A key principle in designing fifth generation vehicular network technology is continuous connectivity. This makes it important to ensure that there are no gaps in communication for mobile user equipment. Handovers can cause disruption in connectivity as the process involves switching from one base station to another. Issues in the handover process include poor load management for moving traffic resulting in low bandwidth or connectivity gaps, too many hops resulting in multiple unneccessary handovers, short dwell times and ineffective base station selection resulting in delays and …