Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Machine Learning

2023

Discipline
Institution
Publication

Articles 1 - 30 of 91

Full-Text Articles in Physical Sciences and Mathematics

Smart Applications And Resource Management In Internet Of Things, Zeinab Akhavan Dec 2023

Smart Applications And Resource Management In Internet Of Things, Zeinab Akhavan

Computer Science ETDs

Internet of Things (IoT) technologies are currently the principal solutions driving smart cities. These new technologies such as Cyber Physical Systems, 5G and data analytic have emerged to address various cities' infrastructure issues ranging from transportation and energy management to healthcare systems. An IoT setting primarily consists of a wide range of users and devices as a massive network interacting with different layers of the city infrastructure resulting in generating sheer volume of data to enable smart city services. The goal of smart city services is to create value for the entire ecosystem, whether this is health, education, transportation, energy, …


Study Of Augmentations On Historical Manuscripts Using Trocr, Erez Meoded Dec 2023

Study Of Augmentations On Historical Manuscripts Using Trocr, Erez Meoded

Theses and Dissertations

Historical manuscripts are an essential source of original content. For many reasons, it is hard to recognize these manuscripts as text. This thesis used a state-of-the-art Handwritten Text Recognizer, TrOCR, to recognize a 16th-century manuscript. TrOCR uses a vision transformer to encode the input images and a language transformer to decode them back to text. We showed that carefully preprocessed images and designed augmentations can improve the performance of TrOCR. We suggest an ensemble of augmented models to achieve an even better performance.


Generalized Differentiable Neural Architecture Search With Performance And Stability Improvements, Emily J. Herron Dec 2023

Generalized Differentiable Neural Architecture Search With Performance And Stability Improvements, Emily J. Herron

Doctoral Dissertations

This work introduces improvements to the stability and generalizability of Cyclic DARTS (CDARTS). CDARTS is a Differentiable Architecture Search (DARTS)-based approach to neural architecture search (NAS) that uses a cyclic feedback mechanism to train search and evaluation networks concurrently, thereby optimizing the search process by enforcing that the networks produce similar outputs. However, the dissimilarity between the loss functions used by the evaluation networks during the search and retraining phases results in a search-phase evaluation network, a sub-optimal proxy for the final evaluation network utilized during retraining. ICDARTS, a revised algorithm that reformulates the search phase loss functions to ensure …


Decoding Usage And Adoption Behavior Of The Low-Carbon Transportation Market: An Ai-Driven Exploration, Vuban Chowdhury Dec 2023

Decoding Usage And Adoption Behavior Of The Low-Carbon Transportation Market: An Ai-Driven Exploration, Vuban Chowdhury

Graduate Theses and Dissertations

The transportation sector stands as a significant contributor to greenhouse gas emissions in the United States, with its environmental impact steadily escalating over the past few decades. This has prompted government agencies to facilitate the adoption and usage of low-carbon transportation (LCT) options as alternatives to fossil-fuel-powered transportation. LCTs include modes of transportation that minimize the overall carbon footprint of the transportation sector by relying on energy sources that are environmentally sustainable. These sustainable transportation options have also garnered significant interest in the transportation research community. For government agencies and researchers alike, a comprehensive understanding of the adoption and usage …


Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada Dec 2023

Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada

Open Access Theses & Dissertations

Abstract:The rapid advancement of machine learning techniques has revolutionized the field of medical diagnosis by offering powerful tools to analyze complex data sets and make accurate predictions. In this proposed method, we present a novel approach that integrates machine learning and optimization models to enhance the accuracy of medical diagnoses. Our method focuses on fine-tuning and optimizing the parameters of machine learning algorithms commonly used in medical diagnosis, such as logistic regression, support vector machines, and neural networks. By employing optimization techniques, we systematically explore the parameter space of these algorithms to discover the most optimal configurations. Moreover, by representing …


A Design Strategy To Improve Machine Learning Resiliency Of Physically Unclonable Functions Using Modulus Process, Yuqiu Jiang Dec 2023

A Design Strategy To Improve Machine Learning Resiliency Of Physically Unclonable Functions Using Modulus Process, Yuqiu Jiang

Theses and Dissertations

Physically unclonable functions (PUFs) are hardware security primitives that utilize non-reproducible manufacturing variations to provide device-specific challenge-response pairs (CRPs). Such primitives are desirable for applications such as communication and intellectual property protection. PUFs have been gaining considerable interest from both the academic and industrial communities because of their simplicity and stability. However, many recent studies have exposed PUFs to machine-learning (ML) modeling attacks. To improve the resilience of a system to general ML attacks instead of a specific ML technique, a common solution is to improve the complexity of the system. Structures, such as XOR-PUFs, can significantly increase the nonlinearity …


Towards Explaining Neural Networks: Tools For Visualizing Activations And Parameters, Juan Puebla Dec 2023

Towards Explaining Neural Networks: Tools For Visualizing Activations And Parameters, Juan Puebla

Open Access Theses & Dissertations

There is a growing number of applications using neural networks for making decisions. However, there is a general lack of understanding of how neural networks work. Neural networks have even been described as black boxes which has led to a lack of trust in artificially intelligent programs. To remedy this, explainable artificial intelligence has risen as a means to validate the decision-making processes and the results of computer programs that use artificial intelligence. The work in this masterâ??s thesis is our contribution to explainable artificial intelligence, focusing on neural networks with the goal of helping users make more sense of …


Towards Long-Term Fairness In Sequential Decision Making, Yaowei Hu Dec 2023

Towards Long-Term Fairness In Sequential Decision Making, Yaowei Hu

Graduate Theses and Dissertations

With the development of artificial intelligence, automated decision-making systems are increasingly integrated into various applications, such as hiring, loans, education, recommendation systems, and more. These machine learning algorithms are expected to facilitate faster, more accurate, and impartial decision-making compared to human judgments. Nevertheless, these expectations are not always met in practice due to biased training data, leading to discriminatory outcomes. In contemporary society, countering discrimination has become a consensus among people, leading the EU and the US to enact laws and regulations that prohibit discrimination based on factors such as gender, age, race, and religion. Consequently, addressing algorithmic discrimination has …


Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa Dec 2023

Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa

Doctoral Dissertations

In the burgeoning field of quantum machine learning, the fusion of quantum computing and machine learning methodologies has sparked immense interest, particularly with the emergence of noisy intermediate-scale quantum (NISQ) devices. These devices hold the promise of achieving quantum advantage, but they grapple with limitations like constrained qubit counts, limited connectivity, operational noise, and a restricted set of operations. These challenges necessitate a strategic and deliberate approach to crafting effective quantum machine learning algorithms.

This dissertation revolves around an exploration of these challenges, presenting innovative strategies that tailor quantum algorithms and processes to seamlessly integrate with commercial quantum platforms. A …


Context-Aware Temporal Embeddings For Text And Video Data, Ahnaf Farhan Dec 2023

Context-Aware Temporal Embeddings For Text And Video Data, Ahnaf Farhan

Open Access Theses & Dissertations

Recent years have seen an exponential increase in unstructured data, primarily in the form of text, images, and videos. Extracting useful features and trends from large-scale unstructured datasets -- such as news outlets, scientific papers, and videos like security cameras or body cam recordings -- is faced with substantial challenges of volume, scalability, complexity, and semantic understanding. In analyzing trends, comprehending the temporal context is vital for uncovering patterns and narratives that are not apparent from a single video frame or text document. Despite its importance, many existing data mining and machine learning approaches overlook extracting evolutionary contextual features in …


General Population Projection Model With Census Population Data, Takenori Tsuruga Dec 2023

General Population Projection Model With Census Population Data, Takenori Tsuruga

Electronic Theses, Projects, and Dissertations

The US Census Bureau offers a wide range of data, and within this array, the American Community Survey 5-Year Estimate (ACS5) serves as a valuable resource for understanding the US population. This project embarks on an exploration of Machine Learning and the Software Development process with the goal of generating effective population projections from ACS5 data. The project aims to provide methods to make predictions for every city and town in the US, encompassing their total population and population divided into 5-year age groups. It's worth noting that while the generation of these projections is grounded in the generalized statistical …


Hypothyroid Disease Analysis By Using Machine Learning, Sanjana Seelam Dec 2023

Hypothyroid Disease Analysis By Using Machine Learning, Sanjana Seelam

Electronic Theses, Projects, and Dissertations

Thyroid illness frequently manifests as hypothyroidism. It is evident that people with hypothyroidism are primarily female. Because the majority of people are unaware of the illness, it is quickly becoming more serious. It is crucial to catch it early on so that medical professionals can treat it more effectively and prevent it from getting worse. Machine learning illness prediction is a challenging task. Disease prediction is aided greatly by machine learning. Once more, unique feature selection strategies have made the process of disease assumption and prediction easier. To properly monitor and cure this illness, accurate detection is essential. In order …


Analysis Of Student Behavior And Score Prediction In Assistments Online Learning, Aswani Yaramala Dec 2023

Analysis Of Student Behavior And Score Prediction In Assistments Online Learning, Aswani Yaramala

All Graduate Theses and Dissertations, Fall 2023 to Present

Understanding and analyzing student behavior is paramount in enhancing online learning, and this thesis delves into the subject by presenting an in-depth analysis of student behavior and score prediction in the ASSISTments online learning platform. We used data from the EDM Cup 2023 Kaggle Competition to answer four key questions. First, we explored how students seeking hints and explanations affect their performance in assignments, shedding light on the role of guidance in learning. Second, we looked at the connection between students mastering specific skills and their performance in related assignments, giving insights into the effectiveness of curriculum alignment. Third, we …


Ai Assisted Workflows For Computational Electromagnetics And Antenna Design, Oameed Noakoasteen Nov 2023

Ai Assisted Workflows For Computational Electromagnetics And Antenna Design, Oameed Noakoasteen

Electrical and Computer Engineering ETDs

These days large volumes of data can be recorded and manipulated with relative ease. If valuable information can be extracted from them, these vast amounts of data can be a rich resource not just for the digital economy but also for scientific discovery and development of technology. When it comes to deriving valuable information from data, Machine Learning (ML) emerges as the key solution. To unlock the potential benefits of ML to science and technology, extensive research is needed to explore what algorithms are suitable and how they can be applied.

To shine light on various ways that ML can …


Bayesian Structural Causal Inference With Probabilistic Programming, Sam A. Witty Nov 2023

Bayesian Structural Causal Inference With Probabilistic Programming, Sam A. Witty

Doctoral Dissertations

Reasoning about causal relationships is central to the human experience. This evokes a natural question in our pursuit of human-like artificial intelligence: how might we imbue intelligent systems with similar causal reasoning capabilities? Better yet, how might we imbue intelligent systems with the ability to learn cause and effect relationships from observation and experimentation? Unfortunately, reasoning about cause and effect requires more than just data: it also requires partial knowledge about data generating mechanisms. Given this need, our task then as computational scientists is to design data structures for representing partial causal knowledge, and algorithms for updating that knowledge in …


Machine Learning Modeling Of Polymer Coating Formulations: Benchmark Of Feature Representation Schemes, Nelson I. Evbarunegbe Nov 2023

Machine Learning Modeling Of Polymer Coating Formulations: Benchmark Of Feature Representation Schemes, Nelson I. Evbarunegbe

Masters Theses

Polymer coatings offer a wide range of benefits across various industries, playing a crucial role in product protection and extension of shelf life. However, formulating them can be a non-trivial task given the multitude of variables and factors involved in the production process, rendering it a complex, high-dimensional problem. To tackle this problem, machine learning (ML) has emerged as a promising tool, showing considerable potential in enhancing various polymer and chemistry-based applications, particularly those dealing with high dimensional complexities.

Our research aims to develop a physics-guided ML approach to facilitate the formulations of polymer coatings. As the first step, this …


Deciphering Trends And Tactics: Data-Driven Techniques For Forecasting Information Spread And Detecting Coordinated Campaigns In Social Media, Kin Wai Ng Lugo Nov 2023

Deciphering Trends And Tactics: Data-Driven Techniques For Forecasting Information Spread And Detecting Coordinated Campaigns In Social Media, Kin Wai Ng Lugo

USF Tampa Graduate Theses and Dissertations

The main objective of this dissertation is to develop models that predict and investigate the spread of information in social media over time. In this context, we consider topics of discussions as the information that spreads. Thus, we are interested in forecasting the number of messages per day in a future interval of time. We take a data-driven approach, in which we compare our results with real datasets from a multitude of socio-political contexts and from multiple social media platforms, specifically, Twitter and YouTube.

We identified a number of challenges related to forecasting social media time series per topic. First, …


Optimization And Application Of Graph Neural Networks, Shuo Zhang Sep 2023

Optimization And Application Of Graph Neural Networks, Shuo Zhang

Dissertations, Theses, and Capstone Projects

Graph Neural Networks (GNNs) are widely recognized for their potential in learning from graph-structured data and solving complex problems. However, optimal performance and applicability of GNNs have been an open-ended challenge. This dissertation presents a series of substantial advances addressing this problem. First, we investigate attention-based GNNs, revealing a critical shortcoming: their ignorance of cardinality information that impacts their discriminative power. To rectify this, we propose Cardinality Preserved Attention (CPA) models that can be applied to any attention-based GNNs, which exhibit a marked improvement in performance. Next, we introduce the Directional Node Pair (DNP) descriptor and the Robust Molecular Graph …


Data-Driven 2d Materials Discovery For Next-Generation Electronics, Zeyu Zhang Aug 2023

Data-Driven 2d Materials Discovery For Next-Generation Electronics, Zeyu Zhang

Dissertations

The development of material discovery and design has lasted centuries in human history. After the concept of modern chemistry and material science was established, the strategy of material discovery relies on the experiments. Such a strategy becomes expensive and time-consuming with the increasing number of materials nowadays. Therefore, a novel strategy that is faster and more comprehensive is urgently needed. In this dissertation, an experiment-guided material discovery strategy is developed and explained using metal-organic frameworks (MOFs) as instances. The advent of 7r-stacked layered MOFs, which offer electrical conductivity on top of permanent porosity and high surface area, opened up new …


Global Cyber Attack Forecast Using Ai Techniques, Nusrat Kabir Samia Aug 2023

Global Cyber Attack Forecast Using Ai Techniques, Nusrat Kabir Samia

Electronic Thesis and Dissertation Repository

The advancement of internet technology and growing involvement in the cyber world have made us prone to cyber-attacks inducing severe damage to individuals and organizations, including financial loss, identity theft, and reputational damage. The rapid emergence and evolution of new networks and new opportunities for businesses and technologies are increasing threats to security vulnerabilities. Hence cyber-crime analysis is one of the wide range applications of Data Mining that can be eventually used to predict and detect crime. However, there are several constraints while analyzing cyber-attacks, which are yet to be resolved for more accurate cyber security inspection.

Although there are …


Predicting Network Failures With Ai Techniques, Chandrika Saha Aug 2023

Predicting Network Failures With Ai Techniques, Chandrika Saha

Electronic Thesis and Dissertation Repository

Network failure is the unintentional interruption of internet services, resulting in widespread client frustration. It is especially true for time-sensitive services in the healthcare industry, smart grid control, and mobility control, among others. In addition, the COVID-19 pandemic has compelled many businesses to operate remotely, making uninterrupted internet access essential. Moreover, Internet Service Providers (ISPs) lose millions of dollars annually due to network failure, which has a negative impact on their businesses. Currently, redundant network equipment is used as a restoration technique to resolve this issue of network failure. This technique requires a strategy for failure identification and prediction to …


Towards Automated Mineral Identification In Martian Rocks From X-Ray Diffraction Patterns, Luke Tambakis Aug 2023

Towards Automated Mineral Identification In Martian Rocks From X-Ray Diffraction Patterns, Luke Tambakis

Electronic Thesis and Dissertation Repository

The CheMin (Chemistry and Mineralogy) instrument on the Curiosity rover has provided a rich set of X-ray diffraction (XRD) patterns from Martian rocks and regolith. These XRD patterns have allowed geologists to make exciting new discoveries about the mineralogy and the geological history of Mars. These discoveries pave the way for further Martian exploration and provide a deeper understanding of Martian geology. The Curiosity rover is very slow by design, travelling at about 4 cm/s. New, faster rovers are being developed to increase scientific throughput and exploration. XRD is valuable for future missions as it can produce new discov- eries …


Application Of Crystal Engineering In Multicomponent Pharmaceutical Crystals: A Study Of Theory And Practice, Soroush Ahmadi Nasrabadi Aug 2023

Application Of Crystal Engineering In Multicomponent Pharmaceutical Crystals: A Study Of Theory And Practice, Soroush Ahmadi Nasrabadi

Electronic Thesis and Dissertation Repository

Multicomponent crystallization, a prominent strategy in crystal engineering, offers the ability to modify the physicochemical properties of crystals by introducing a secondary component to their lattice structure. Such multicomponent crystals have found widespread application in the pharmaceutical industry. This thesis explores the experimental screening, characterization, application, and theoretical prediction of multicomponent crystals of Active Pharmaceutical Ingredients (APIs).

The first case study investigates a new solvate of Dasatinib which exhibits high instability at room temperature and transforms into a different polymorph upon desolvation. The crystal structure of this compound is obtained, revealing insights into its transient nature and the potential application …


Secondary Features Of Importance For A Url Ranking, Atajan Abdyyev Aug 2023

Secondary Features Of Importance For A Url Ranking, Atajan Abdyyev

Dissertations and Theses

This paper investigates the impact of secondary ranking factors on webpage relevance and rankings in the context of Search Engine Optimization (SEO), focusing on the jewelry domain within the United States e-commerce market. By generating a keyword list related to jewelry and retrieving top URLs from Google's search results, the study employs machine learning models including XGBoost, CatBoost, and Linear Regression to identify key features influencing webpage relevance and rankings.The findings highlight specific optimal ranges for features like Outlinks, Unique Inlinks, Flesch Reading Ease Score, and others, indicating their significant impact on better rankings. Notably, Random Forest model performed best …


An Interval-Valued Random Forests, Paul Gaona Partida Aug 2023

An Interval-Valued Random Forests, Paul Gaona Partida

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

There is a growing demand for the development of new statistical models and the refinement of established methods to accommodate different data structures. This need arises from the recognition that traditional statistics often assume the value of each observation to be precise, which may not hold true in many real-world scenarios. Factors such as the collection process and technological advancements can introduce imprecision and uncertainty into the data.

For example, consider data collected over a long period of time, where newer measurement tools may offer greater accuracy and provide more information than previous methods. In such cases, it becomes crucial …


Stressor: An R Package For Benchmarking Machine Learning Models, Samuel A. Haycock Aug 2023

Stressor: An R Package For Benchmarking Machine Learning Models, Samuel A. Haycock

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Many discipline specific researchers need a way to quickly compare the accuracy of their predictive models to other alternatives. However, many of these researchers are not experienced with multiple programming languages. Python has recently been the leader in machine learning functionality, which includes the PyCaret library that allows users to develop high-performing machine learning models with only a few lines of code. The goal of the stressor package is to help users of the R programming language access the advantages of PyCaret without having to learn Python. This allows the user to leverage R’s powerful data analysis workflows, while simultaneously …


Genetic Programming To Optimize Performance Of Machine Learning Algorithms On Unbalanced Data Set, Asitha Thumpati Aug 2023

Genetic Programming To Optimize Performance Of Machine Learning Algorithms On Unbalanced Data Set, Asitha Thumpati

Electronic Theses, Projects, and Dissertations

Data collected from the real world is often imbalanced, meaning that the distribution of data across known classes is biased or skewed. When using machine learning classification models on such imbalanced data, predictive performance tends to be lower because these models are designed with the assumption of balanced classes or a relatively equal number of instances for each class. To address this issue, we employ data preprocessing techniques such as SMOTE (Synthetic Minority Oversampling Technique) for oversampling data and random undersampling for undersampling data on unbalanced datasets. Once the dataset is balanced, genetic programming is utilized for feature selection to …


On Training Neurons With Bounded Compilations, Lance Kennedy Jul 2023

On Training Neurons With Bounded Compilations, Lance Kennedy

Master of Science in Computer Science Theses

Knowledge compilation offers a formal approach to explaining and verifying the behavior of machine learning systems, such as neural networks. Unfortunately, compiling even an individual neuron into a tractable representation such as an Ordered Binary Decision Diagram (OBDD), is an NP-hard problem. In this thesis, we consider the problem of training a neuron from data, subject to the constraint that it has a compact representation as an OBDD. Our approach is based on the observation that a neuron can be compiled into an OBDD in polytime if (1) the neuron has integer weights, and (2) its aggregate weight is bounded. …


Using Machine Learning Techniques To Model Encoder/Decoder Pair For Non-Invasive Electroencephalographic Wireless Signal Transmission, Ernst Fanfan Jul 2023

Using Machine Learning Techniques To Model Encoder/Decoder Pair For Non-Invasive Electroencephalographic Wireless Signal Transmission, Ernst Fanfan

Master of Science in Computer Science Theses

This study investigated the application and enhancement of Non-Invasive Brain-Computer Interfaces (NI-BCIs), focused on enhancing the efficiency and effectiveness of this technology for individuals with severe physical limitations. The core research goal was to improve current limitations associated with wires, noise, and invasive procedures often associated with BCI technology. The key discussed solution involves developing an optimized Encoder/Decoder (E/D) pair using machine learning techniques, particularly those borrowed from Generative Adversarial Networks (GAN) and other Deep Neural Networks, to minimize data transmission and ensure robustness against data degradation. The study highlighted the crucial role of machine learning in self-adjusting and isolating …


Extending The Convolution In Graph Neural Networks To Solve Materials Science And Node Classification Problems, Steph-Yves Mike Louis Jul 2023

Extending The Convolution In Graph Neural Networks To Solve Materials Science And Node Classification Problems, Steph-Yves Mike Louis

Theses and Dissertations

The usage of graph to represent one's data in machine learning has grown in popularity in both academia and the industry due to its inherent benefits. With its flexible nature and immediate translation to real life observed objects, graph representation had a considerable contribution in advancing the state-of-the-art performance of machine learning in materials.

In this dissertation proposal, we discuss how machines can learn from graph encoded data and provide excellent results through graph neural networks (GNN). Notably, we focus our adaptation of graph neural networks on three tasks: predicting crystal materials properties, nullifying the negative impact of inferior graph …