Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

2020

Machine learning

Discipline
Institution
Publication

Articles 1 - 30 of 260

Full-Text Articles in Entire DC Network

Countering Internet Packet Classifiers To Improve User Online Privacy, Sina Fathi-Kazerooni Dec 2020

Countering Internet Packet Classifiers To Improve User Online Privacy, Sina Fathi-Kazerooni

Dissertations

Internet traffic classification or packet classification is the act of classifying packets using the extracted statistical data from the transmitted packets on a computer network. Internet traffic classification is an essential tool for Internet service providers to manage network traffic, provide users with the intended quality of service (QoS), and perform surveillance. QoS measures prioritize a network's traffic type over other traffic based on preset criteria; for instance, it gives higher priority or bandwidth to video traffic over website browsing traffic. Internet packet classification methods are also used for automated intrusion detection. They analyze incoming traffic patterns and identify malicious …


Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning, Christopher Michael Rytting Dec 2020

Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning, Christopher Michael Rytting

Theses and Dissertations

Large natural language models (such as GPT-2 or T5) demonstrate impressive abilities across a range of general NLP tasks. Here, we show that the knowledge embedded in such models provides a useful inductive bias, not just on traditional NLP tasks, but also in the nontraditional task of training a symbolic reasoning engine. We observe that these engines learn quickly and generalize in a natural way that reflects human intuition. For example, training such a system to model block-stacking might naturally generalize to stacking other types of objects because of structure in the real world that has been partially captured by …


A Comparative Study On Statistical And Machine Learning Forecasting Methods For An Fmcg Company, Zenah Yaser Alzubaidi Dec 2020

A Comparative Study On Statistical And Machine Learning Forecasting Methods For An Fmcg Company, Zenah Yaser Alzubaidi

Theses

Demand forecasting has been an area of study among scholars and businessmen ever since the start of the industrial revolution and has only gained focus in recent years with the advancements in AI. Accurate forecasts are no longer a luxury, but a necessity to have for effective decisions made in planning production and marketing. Many aspects of the business depend on demand, and this is particularly true for the Fast-Moving Consumer Goods industry where the high volume and demand volatility poses a challenge for planners to generate accurate forecasts as consumer demand complexity rises. Inaccurate demand forecasts lead to multiple …


Improving A Wireless Localization System Via Machine Learning Techniques And Security Protocols, Zachary Yorio Dec 2020

Improving A Wireless Localization System Via Machine Learning Techniques And Security Protocols, Zachary Yorio

Masters Theses, 2020-current

The recent advancements made in Internet of Things (IoT) devices have brought forth new opportunities for technologies and systems to be integrated into our everyday life. In this work, we investigate how edge nodes can effectively utilize 802.11 wireless beacon frames being broadcast from pre-existing access points in a building to achieve room-level localization. We explain the needed hardware and software for this system and demonstrate a proof of concept with experimental data analysis. Improvements to localization accuracy are shown via machine learning by implementing the random forest algorithm. Using this algorithm, historical data can train the model and make …


Reasoning About User Feedback Under Identity Uncertainty In Knowledge Base Construction, Ariel Kobren Dec 2020

Reasoning About User Feedback Under Identity Uncertainty In Knowledge Base Construction, Ariel Kobren

Doctoral Dissertations

Intelligent, automated systems that are intertwined with everyday life---such as Google Search and virtual assistants like Amazon’s Alexa or Apple’s Siri---are often powered in part by knowledge bases (KBs), i.e., structured data repositories of entities, their attributes, and the relationships among them. Despite a wealth of research focused on automated KB construction methods, KBs are inevitably imperfect, with errors stemming from various points in the construction pipeline. Making matters more challenging, new data is created daily and must be integrated with existing KBs so that they remain up-to-date. As the primary consumers of KBs, human users have tremendous potential to …


Intelligent Networks For High Performance Computing, William Whitney Schonbein Dec 2020

Intelligent Networks For High Performance Computing, William Whitney Schonbein

Computer Science ETDs

There exists a resurgence of interest in `smart' network interfaces that can operate on data as it flows through a network. However, while smart capabilities have been expanding, what they can do for high-performance computing (HPC) is not well-understood. In this work, we advance our understanding of the capabilities and contributions of smart network interfaces to HPC. First, we show current offloaded message demultiplexing can mitigate (but not eliminate) overheads incurred by multithreaded communication. Second, we demonstrate current offloaded capabilities can be leveraged to provide Turing complete program execution on the interface. We elaborate with a framework for offloading arbitrary …


Forecasting Bitcoin Prices Using N-Beats Deep Learning Architecture, Alikhan Bulatov Dec 2020

Forecasting Bitcoin Prices Using N-Beats Deep Learning Architecture, Alikhan Bulatov

Student Theses

The use of computationally intensive systems that employ machine learning algorithms is increasingly common in the field of finance. New state of the art deep learning architectures for time series forecasting are being developed each year making them more accurate than ever. This study evaluates the predictive power of the N-BEATS deep learning architecture trained on Bitcoin daily, hourly, and up-to-the-minute data in comparison with other popular time series forecasting methods such as LSTM and ARIMA. Prediction errors are measured with Mean Average Percentage Error (MAPE), and Root Mean Squared Error (RMSE). The results suggest that the developed N-BEATS model …


Metarec: Meta-Learning Meets Recommendation Systems, James Le Dec 2020

Metarec: Meta-Learning Meets Recommendation Systems, James Le

Theses

Artificial neural networks (ANNs) have recently received increasing attention as powerful modeling tools to improve the performance of recommendation systems. Meta-learning, on the other hand, is a paradigm that has re-surged in popularity within the broader machine learning community over the past several years. In this thesis, we will explore the intersection of these two domains and work on developing methods for integrating meta-learning to design more accurate and flexible recommendation systems.

In the present work, we propose a meta-learning framework for the design of collaborative filtering methods in recommendation systems, drawing from ideas, models, and solutions from modern approaches …


The Role Of Ai & Big Data In Habit Formation, Jingyu Cao Dec 2020

The Role Of Ai & Big Data In Habit Formation, Jingyu Cao

Theses

Forming habits are not easy for everyone. It requires professional methods and strong perseverance, which people usually feel hard to do by themself. However, people are eager to form good habits to have a better life.

This study aims to determine how AI & big data could help people to form habits. There are many applications on the market that already use this method to study user behavior in order to provide better service. My research has focused on how to conduct the personal plan and its effects on the action.

In this context, Marvelous is defined as the AI …


Clustered Hyperspectral Target Detection, Sean Onufer Stalley Dec 2020

Clustered Hyperspectral Target Detection, Sean Onufer Stalley

Dissertations and Theses

Aerial target detection is often used to search for relatively small things over large areas of land. Depending on the size and signature of the target, detection can be a very easy or very difficult task. By capturing images with several hundred color channels, hyperspectral sensors provide a new way of looking at this task, both literally and figuratively. Hyperspectral sensors can be used in many aerial target detection tasks such as identifying unhealthy trees in a forest, searching for minerals at a mining site, or finding the sources of chemical leaks at a factory. The high spectral resolution of …


Machine Learning Based Applications For Data Visualization, Modeling, Control, And Optimization For Chemical And Biological Systems, Yan Ma Dec 2020

Machine Learning Based Applications For Data Visualization, Modeling, Control, And Optimization For Chemical And Biological Systems, Yan Ma

LSU Doctoral Dissertations

This dissertation report covers Yan Ma’s Ph.D. research with applicational studies of machine learning in manufacturing and biological systems. The research work mainly focuses on reaction modeling, optimization, and control using a deep learning-based approaches, and the work mainly concentrates on deep reinforcement learning (DRL). Yan Ma’s research also involves with data mining with bioinformatics. Large-scale data obtained in RNA-seq is analyzed using non-linear dimensionality reduction with Principal Component Analysis (PCA), t-Distributed Stochastic Neighbor Embedding (t-SNE), and Uniform Manifold Approximation and Projection (UMAP), followed by clustering analysis using k-Means and Hierarchical Density-Based Spatial Clustering with Noise (HDBSCAN). This report focuses …


An Investigation Of Grammar Gender-Bias Correction For Google Translate When Translating From English To French, Ahmed Samy Merah Dec 2020

An Investigation Of Grammar Gender-Bias Correction For Google Translate When Translating From English To French, Ahmed Samy Merah

Student Theses

This work investigated how to address the Google Translate's gender-bias when translating from English to French. The developed solution is called GT gender-bias corrector that was built based on combining natural language processing and machine learning methods. The natural language processing was used to analyze the original sentences and their translations grammatically identifying parts of speech. The parts of speech analysis facilitated the identification of three patterns that are associated with the gender bias of Google Translate when translating from English to French. The three patterns were labeled simple, intermediate and complex to reflect the structure complexity. Samples of texts …


Using Machine Learning To Regulate Intensity Of Immersion Therapy Treatment Of Phobias Through Vital Feedback, Mark Beauchamp Dec 2020

Using Machine Learning To Regulate Intensity Of Immersion Therapy Treatment Of Phobias Through Vital Feedback, Mark Beauchamp

Student Theses

The treatment of acrophobia has been trying to keep up with newer technology with the incorporation of virtual reality for exposure therapy, but that approach still lacks automation and still leaves a good portion for human error. The proposed method introduced in this paper is that a machine learning model could replace the need for continuous human intervention. With a few different models of bridges and buildings and the ability for a machine learning model to dynamically alter the height of these building we could theoretically put the patient in the exact situation that will maximize the efficiency of their …


Reducing Body Contact Using Smart Mobile App And Machine Learning Soltutions, Rashed Saeed Abdulrahman Shaliya Dec 2020

Reducing Body Contact Using Smart Mobile App And Machine Learning Soltutions, Rashed Saeed Abdulrahman Shaliya

Theses

The physical contact or the daily body interaction with people by shaking hands, using electronic and payment cards or touching objects such as devices, pens, access cards and gates, all these habits increase the proportion of spreading microbes, viruses and spread diseases among the people all over the world. This project illustrates how body contact can lead to a global disaster by spreading dangerous diseases and deadly viruses among people because of their daily dealings and routine. Analytical techniques were used to explore the relevant data and visualize how body contact increases the infection of a disease to become a …


Automated Intelligent Cueing Device To Improve Ambient Gait Behaviors For Patients With Parkinson's Disease, Nader Naghavi Dec 2020

Automated Intelligent Cueing Device To Improve Ambient Gait Behaviors For Patients With Parkinson's Disease, Nader Naghavi

Doctoral Dissertations

Freezing of gait (FoG) is a common motor dysfunction in individuals with Parkinson’s disease (PD). FoG impairs walking and is associated with increased fall risk. Although pharmacological treatments have shown promise during ON-medication periods, FoG remains difficult to treat during medication OFF state and in advanced stages of the disease. External cueing therapy in the forms of visual, auditory, and vibrotactile, has been effective in treating gait deviations. Intelligent (or on-demand) cueing devices are novel systems that analyze gait patterns in real-time and activate cues only at moments when specific gait alterations are detected. In this study we developed methods …


Fire Code Violation Detection, Salim Elewa Dec 2020

Fire Code Violation Detection, Salim Elewa

Student Theses

his paper explores the creation of an object detection system for mobile using YOLO(You Only Look Once) algorithm., a real-time object detection model that is developed to run on a portable device such as a cellphone that does not have a Graphics Processing Unit (GPU). This algorithm is utilized to detect fire code violations, specifically the obstructed door in a fire separation: the areas surround- ing the door opening shall be kept clear of anything that would be likely to ob- struct. The machine learning algorithm utilized has been fine-tuned to fit the model based on accuracy levels. The author …


Fall Detection Using Neural Networks, Warren Zajac Dec 2020

Fall Detection Using Neural Networks, Warren Zajac

Student Theses

Falls inside of the home is a major concern facing the aging population. Monitoring the home environment to detect a fall can prevent profound consequences due to delayed emergency response. One option to monitor a home environment is to use a camera-based fall detection system. Conceptual designs vary from 3D positional monitoring (multi-camera monitoring) to body position and limb speed classification. Research shows varying degree of success with such concepts when designed with multi-camera setup. However, camera-based systems are inherently intrusive and costly to implement. In this research, we use a sound-based system to detect fall events. Acoustic sensors are …


A Targeted Adversarial Attack On Support Vector Machine Using The Boundary Line, Yessenia Rodriguez Dec 2020

A Targeted Adversarial Attack On Support Vector Machine Using The Boundary Line, Yessenia Rodriguez

Theses and Dissertations

In this thesis, a targeted adversarial attack is explored on a Support Vector Machine (SVM). SVM is defined by creating a separating boundary between two classes. Using a target class, any input can be modified to cross the “boundary line,” making the model predict the target class. To limit the modification, a percentage of an image of the target class is used to get several random sections. Using these sections, the input will be moved in small steps closer to the boundary point. The section that took the least number of steps to cause the model to predict the target …


Nature-Inspired Topology Optimization Of Recurrent Neural Networks, Abdelrahman A. Elsaid Dec 2020

Nature-Inspired Topology Optimization Of Recurrent Neural Networks, Abdelrahman A. Elsaid

Theses

Hand-crafting effective and efficient structures for recurrent neural networks (RNNs) is a difficult, expensive, and time-consuming process. To address this challenge, this work presents three nature-inspired (NI) algorithms for neural architecture search (NAS), introducing the subfield of nature-inspired neural architecture search (NI-NAS). These algorithms, based on ant colony optimization (ACO), progress from memory cell structure optimization, to bounded discrete-space architecture optimization, and finally to unbounded continuous-space architecture optimization. These methods were applied to real-world data sets representing challenging engineering problems, such as data from a coal-fired power plant, wind-turbine power generators, and aircraft flight data recorder (FDR) data.

Initial work …


Unifying Chemistry And Machine Learning For The Study Of Noncovalent Interactions, Jacob A. Townsend Dec 2020

Unifying Chemistry And Machine Learning For The Study Of Noncovalent Interactions, Jacob A. Townsend

Doctoral Dissertations

Gas separations are in great demand for carbon emission reduction, natural gas purification, oxygen isolation, and much more. Many of these separations rely on cost-prohibitive methods such as cryogenic distillation or strong-binding solvents. As a result, novel materials are being developed to subvert the energetic expense of gas separation processes. These studies focus on improving the performance of alternative materials, including (but not limited to) metal-organic frameworks, covalent organic frameworks, dense polymeric membranes, porous polymers, and ionic liquids.

In this work, the atomistic effects of functional units are explored for gas separations processes using electronic structure theory and machine learning. …


Price Prediction And Valuation Using Data Mining In Dubai Real Estate Market, Abdulla Alhathboor Dec 2020

Price Prediction And Valuation Using Data Mining In Dubai Real Estate Market, Abdulla Alhathboor

Theses

The purpose of this study is to find out the impact of data mining in predicting prices and values of real estate units in the Dubai real estate market. This market has always been one of the biggest markets in the economy of any nation worldwide and has always been considered one of the biggest indicators on the health of any economy. After the devastating crash of the world economy in 2008, many real estate projects were halted and economies are still recovering from that incident. Real estate brokers and agents found it difficult to sell any property during that …


Application Of Machine Learning Models: Stock Price Forecasting Of The Philippines' Top Six Conglomerates, Anthony Rey Llanos Dec 2020

Application Of Machine Learning Models: Stock Price Forecasting Of The Philippines' Top Six Conglomerates, Anthony Rey Llanos

Theses

The advent of digital age dramatically changed the way all aspects of commerce is conducted. From the largest multi-national conglomerates to the least small-and-medium enterprises and to the unassuming business-savvy individuals have adapted to take advantage of the benefits afforded by the resulting digital technology. Investing in, and profiting from, shares of stocks of companies listed in an organized stock exchange is one such instance. Gone are the days wherein stock investors and brokers are inseparable from their telephones to handle trades. Online platforms, powered by machine learning algorithms, have made investing in stocks not only accessible and convenient but …


Acquisition, Processing, And Analysis Of Video, Audio And Meteorological Data In Multi-Sensor Electronic Beehive Monitoring, Sarbajit Mukherjee Dec 2020

Acquisition, Processing, And Analysis Of Video, Audio And Meteorological Data In Multi-Sensor Electronic Beehive Monitoring, Sarbajit Mukherjee

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

In recent years, a widespread decline has been seen in honey bee population and this is widely attributed to colony collapse disorder. Hence, it is of utmost importance that a system is designed to gather relevant information. This will allow for a deeper understanding of the possible reasons behind the above phenomenon to aid in the design of suitable countermeasures.

Electronic Beehive Monitoring is one such way of gathering critical information regarding a colony’s health and behavior without invasive beehive inspections. In this dissertation, we have presented an electronic beehive monitoring system called BeePi that can be placed on top …


Deep Q Learning Applied To Stock Trading, Agnibh Dasgupta Dec 2020

Deep Q Learning Applied To Stock Trading, Agnibh Dasgupta

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Developing a strategy for stock trading is a vital task for investors. However, it is challenging to obtain an optimal strategy, given the complex and dynamic nature of the stock market. This thesis aims to explore the applications of Reinforcement Learning with the goal of maximizing returns from market investment, keeping in mind the human aspect of trading by utilizing stock prices represented as candlestick graphs. Furthermore, the algorithm studies public interest patterns in form of graphs extracted from Google Trends to make predictions. Deep Q learning has been used to train an agent based on fused images of stock …


Hierarchical Aggregation Of Multidimensional Data For Efficient Data Mining, Safaa Khalil Alwajidi Dec 2020

Hierarchical Aggregation Of Multidimensional Data For Efficient Data Mining, Safaa Khalil Alwajidi

Dissertations

Big data analysis is essential for many smart applications in areas such as connected healthcare, intelligent transportation, human activity recognition, environment, and climate change monitoring. Traditional data mining algorithms do not scale well to big data due to the enormous number of data points and the velocity of their generation. Mining and learning from big data need time and memory efficiency techniques, albeit the cost of possible loss in accuracy. This research focuses on the mining of big data using aggregated data as input. We developed a data structure that is to be used to aggregate data at multiple resolutions. …


How Negative Sampling Provides Class Balance To Rare Event Case Data Using A Vehicular Accident Prediction Project As A Use Case Scenario, Jeremy Roland Dec 2020

How Negative Sampling Provides Class Balance To Rare Event Case Data Using A Vehicular Accident Prediction Project As A Use Case Scenario, Jeremy Roland

Masters Theses and Doctoral Dissertations

Rare event case data occur at such an infrequent rate that even having high amounts of it can leave researchers starving for more information. There has always existed a tug and pull relationship among rare event case data, where a higher count of entries often leads to a lack of explanatory variables, and vice versa. In the research spectrum of rare event case probability prediction, several methods of data sampling exist to remedy the main issue of rare event case data: a lack of data to collect and learn from. The most effective methods often involve altering the distribution of …


Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun Dec 2020

Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun

Electronic Theses and Dissertations

This dissertation consists of three projects related to Modified-Half-Normal distribution and causal inference. In my first project, a new distribution called Modified-Half-Normal distribution was introduced. I explored a few of its distributional properties, the procedures for generating random samples based on Bayesian approaches, and the parameter estimation based on the method of moments. The second project deals with the problem of selection bias of average treatment effect (ATE) if we use the observational data. I combined the propensity score based inverse probability of treatment weighting (IPTW) method and the directed acyclic graph (DAG) to solve this problem. The third project …


In The Margins: Reconsidering The Range And Contribution Of Diazotrophs In Nearshore Environments, Corday R. Selden Dec 2020

In The Margins: Reconsidering The Range And Contribution Of Diazotrophs In Nearshore Environments, Corday R. Selden

OES Theses and Dissertations

Dinitrogen (N2) fixation enables primary production and, consequently, carbon dioxide drawdown in nitrogen (N) limited marine systems, exerting a powerful influence over the coupled carbon and N cycles. Our understanding of the environmental factors regulating its distribution and magnitude are largely based on the range and sensitivity of one genus, Trichodesmium. However, recent work suggests that the niche preferences of distinct diazotrophic (N2 fixing) clades differ due to their metabolic and ecological diversity, hampering efforts to close the N budget and model N2 fixation accurately. Here, I explore the range of N2 fixation …


Enhanced Traffic Incident Analysis With Advanced Machine Learning Algorithms, Zhenyu Wang Dec 2020

Enhanced Traffic Incident Analysis With Advanced Machine Learning Algorithms, Zhenyu Wang

Computational Modeling & Simulation Engineering Theses & Dissertations

Traffic incident analysis is a crucial task in traffic management centers (TMCs) that typically manage many highways with limited staff and resources. An effective automatic incident analysis approach that can report abnormal events timely and accurately will benefit TMCs in optimizing the use of limited incident response and management resources. During the past decades, significant efforts have been made by researchers towards the development of data-driven approaches for incident analysis. Nevertheless, many developed approaches have shown limited success in the field. This is largely attributed to the long detection time (i.e., waiting for overwhelmed upstream detection stations; meanwhile, downstream stations …


Unsupervised Structural Graph Node Representation Learning, Mikel Joaristi Dec 2020

Unsupervised Structural Graph Node Representation Learning, Mikel Joaristi

Boise State University Theses and Dissertations

Unsupervised Graph Representation Learning methods learn a numerical representation of the nodes in a graph. The generated representations encode meaningful information about the nodes' properties, making them a powerful tool for tasks in many areas of study, such as social sciences, biology or communication networks. These methods are particularly interesting because they facilitate the direct use of standard Machine Learning models on graphs. Graph representation learning methods can be divided into two main categories depending on the information they encode, methods preserving the nodes connectivity information, and methods preserving nodes' structural information. Connectivity-based methods focus on encoding relationships between nodes, …