Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

McKelvey School of Engineering Theses & Dissertations

Discipline
Keyword
Publication Year

Articles 1 - 17 of 17

Full-Text Articles in Artificial Intelligence and Robotics

Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner May 2023

Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner

McKelvey School of Engineering Theses & Dissertations

Survey data collected from human subjects can contain a high number of features while having a comparatively low quantity of examples. Machine learning models that attempt to predict outcomes from survey data under these conditions can overfit and result in poor generalizability. One remedy to this issue is feature selection, which attempts to select an optimal subset of features to learn upon. A relatively unexplored source of information in the feature selection process is the usage of textual names of features, which may be semantically indicative of which features are relevant to a target outcome. The relationships between feature names …


Evaluating The Problem Solving Abilities Of Chatgpt, Fankun Zeng May 2023

Evaluating The Problem Solving Abilities Of Chatgpt, Fankun Zeng

McKelvey School of Engineering Theses & Dissertations

This thesis addresses the need for a fair evaluation of language models' problem solving abilities by presenting a unified evaluation framework for ChatGPT on 16 problem solving datasets (e.g., NaturalQA, HellaSwag, MMLU, etc.). We evaluate the model's performance using F1, exact match, and quasi-exact match metrics and find that ChatGPT is highly accurate in solving tasks that require commonsense and knowledge. However, we also identify truncated text bias and few-shot scenarios as challenges that may impact ChatGPT's performance. Our research highlights the importance of standardizing datasets and developing a unified evaluation system for the fair evaluation of language models. Overall, …


Understanding Societal Values Of Chatgpt, Yidan Tang May 2023

Understanding Societal Values Of Chatgpt, Yidan Tang

McKelvey School of Engineering Theses & Dissertations

As Large language models (LLMs) become increasingly pervasive in various domains, it is crucial to ensure that their outputs adhere to societal values and ethical considerations. In this thesis, we investigate the alignment of ChatGPT, a recent state-of-the-art large language model developed by OpenAI, with societal values. Specifically, we define the problem of societal values of LLMs and assemble a representative collection of 7 datasets covering 4 topics related to societal values. In-context learning techniques are applied and appropriate prompts are designed. The performance of each dataset is measured using a standardized evaluation system focused on accuracy. We then display …


Design And Analysis Of Strategic Behavior In Networks, Sixie Yu Aug 2022

Design And Analysis Of Strategic Behavior In Networks, Sixie Yu

McKelvey School of Engineering Theses & Dissertations

Networks permeate every aspect of our social and professional life.A networked system with strategic individuals can represent a variety of real-world scenarios with socioeconomic origins. In such a system, the individuals' utilities are interdependent---one individual's decision influences the decisions of others and vice versa. In order to gain insights into the system, the highly complicated interactions necessitate some level of abstraction. To capture the otherwise complex interactions, I use a game theoretic model called Networked Public Goods (NPG) game. I develop a computational framework based on NPGs to understand strategic individuals' behavior in networked systems. The framework consists of three …


Development Of The Assessment Of Clinical Prediction Model Transportability (Apt) Checklist, Sean Chonghwan Yu Aug 2022

Development Of The Assessment Of Clinical Prediction Model Transportability (Apt) Checklist, Sean Chonghwan Yu

McKelvey School of Engineering Theses & Dissertations

Clinical Prediction Models (CPM) have long been used for Clinical Decision Support (CDS) initially based on simple clinical scoring systems, and increasingly based on complex machine learning models relying on large-scale Electronic Health Record (EHR) data. External implementation – or the application of CPMs on sites where it was not originally developed – is valuable as it reduces the need for redundant de novo CPM development, enables CPM usage by low resource organizations, facilitates external validation studies, and encourages collaborative development of CPMs. Further, adoption of externally developed CPMs has been facilitated by ongoing interoperability efforts in standards, policy, and …


Machine Learning In Complex Scientific Domains: Hospitalization Records, Drug Interactions, Predictive Modeling And Fairness For Class Imbalanced Data, Arghya Datta Aug 2021

Machine Learning In Complex Scientific Domains: Hospitalization Records, Drug Interactions, Predictive Modeling And Fairness For Class Imbalanced Data, Arghya Datta

McKelvey School of Engineering Theses & Dissertations

Machine learning has demonstrated potential in analyzing large, complex datasets and has become ubiquitous across many fields of scientific research. As machine learning is actively deployed in many complex and critical domains, it is essential for machine learning to engage with domain expertise to aid in knowledge discovery as well as address challenges in predictive modeling in complex domains. Domain expertise represents an essential and elaborate collection of knowledge that is often under-utilized when applying machine learning in complex domains. In this dissertation, I have addressed existing challenges regarding knowledge discovery in complex domains via engagement with domain expertise, particularly …


A Neuromorphic Machine Learning Framework Based On The Growth Transform Dynamical System, Ahana Gangopadhyay Aug 2021

A Neuromorphic Machine Learning Framework Based On The Growth Transform Dynamical System, Ahana Gangopadhyay

McKelvey School of Engineering Theses & Dissertations

As computation increasingly moves from the cloud to the source of data collection, there is a growing demand for specialized machine learning algorithms that can perform learning and inference at the edge in energy and resource-constrained environments. In this regard, we can take inspiration from small biological systems like insect brains that exhibit high energy-efficiency within a small form-factor, and show superior cognitive performance using fewer, coarser neural operations (action potentials or spikes) than the high-precision floating-point operations used in deep learning platforms. Attempts at bridging this gap using neuromorphic hardware has produced silicon brains that are orders of magnitude …


Improving Additional Adversarial Robustness For Classification, Michael Guo May 2021

Improving Additional Adversarial Robustness For Classification, Michael Guo

McKelvey School of Engineering Theses & Dissertations

Although neural networks have achieved remarkable success on classification, adversarial robustness is still a significant concern. There are now a series of approaches for designing adversarial examples and methods to defending against them. This paper consists of two projects. In our first work, we propose an approach by leveraging cognitive salience to enhance additional robustness on top of these methods. Specifically, for image classification, we split an image into the foreground (salient region) and background (the rest) and allow significantly larger adversarial perturbations in the background to produce stronger attacks. Furthermore, we show that adversarial training with dual-perturbation attacks yield …


Stochastic Goal Recognition Design, Christabel Wayllace May 2021

Stochastic Goal Recognition Design, Christabel Wayllace

McKelvey School of Engineering Theses & Dissertations

Goal Recognition Design (GRD) is the problem of finding the least amount of environment modifications to force an acting agent to reveal its goal as early as possible. Figuring out an agent’s goal by observing its behavior is a problem studied in Psychology, Economics, and Artificial Intelligence, where it is known as goal recognition. Contrary to most common approaches where the focus is on finding faster algorithms to detect the goal, GRD takes an offline approach and focuses on environment design to facilitate goal recognition. This thesis investigates GRD problems when action outcomes are stochastic, which is the case of …


Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim May 2020

Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim

McKelvey School of Engineering Theses & Dissertations

Electronic Health Records (EHR) are widely adopted and used throughout healthcare systems and are able to collect and store longitudinal information data that can be used to describe patient phenotypes. From the underlying data structures used in the EHR, discrete data can be extracted and analyzed to improve patient care and outcomes via tasks such as risk stratification and prospective disease management. Temporality in EHR is innately present given the nature of these data, however, and traditional classification models are limited in this context by the cross- sectional nature of training and prediction processes. Finding temporal patterns in EHR is …


Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim May 2020

Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim

McKelvey School of Engineering Theses & Dissertations

Electronic Health Records (EHR) are widely adopted and used throughout healthcare systems and are able to collect and store longitudinal information data that can be used to describe patient phenotypes. From the underlying data structures used in the EHR, discrete data can be extracted and analyzed to improve patient care and outcomes via tasks such as risk stratification and prospective disease management. Temporality in EHR is innately present given the nature of these data, however, and traditional classification models are limited in this context by the cross-sectional nature of training and prediction processes. Finding temporal patterns in EHR is especially …


Graph Deep Learning: Methods And Applications, Muhan Zhang Dec 2019

Graph Deep Learning: Methods And Applications, Muhan Zhang

McKelvey School of Engineering Theses & Dissertations

The past few years have seen the growing prevalence of deep neural networks on various application domains including image processing, computer vision, speech recognition, machine translation, self-driving cars, game playing, social networks, bioinformatics, and healthcare etc. Due to the broad applications and strong performance, deep learning, a subfield of machine learning and artificial intelligence, is changing everyone's life.Graph learning has been another hot field among the machine learning and data mining communities, which learns knowledge from graph-structured data. Examples of graph learning range from social network analysis such as community detection and link prediction, to relational machine learning such as …


Differential Estimation Of Audiograms Using Gaussian Process Active Model Selection, Trevor Larsen May 2019

Differential Estimation Of Audiograms Using Gaussian Process Active Model Selection, Trevor Larsen

McKelvey School of Engineering Theses & Dissertations

Classical methods for psychometric function estimation either require excessive resources to perform, as in the method of constants, or produce only a low resolution approximation of the target psychometric function, as in adaptive staircase or up-down procedures. This thesis makes two primary contributions to the estimation of the audiogram, a clinically relevant psychometric function estimated by querying a patient’s for audibility of a collection of tones. First, it covers the implementation of a Gaussian process model for learning an audiogram using another audiogram as a prior belief to speed up the learning procedure. Second, it implements a use case of …


An Improved Algorithm For Learning To Perform Exception-Tolerant Abduction, Mengxue Zhang May 2017

An Improved Algorithm For Learning To Perform Exception-Tolerant Abduction, Mengxue Zhang

McKelvey School of Engineering Theses & Dissertations

Abstract

Inference from an observed or hypothesized condition to a plausible cause or explanation for this condition is known as abduction. For many tasks, the acquisition of the necessary knowledge by machine learning has been widely found to be highly effective. However, the semantics of learned knowledge are weaker than the usual classical semantics, and this necessitates new formulations of many tasks. We focus on a recently introduced formulation of the abductive inference task that is thus adapted to the semantics of machine learning. A key problem is that we cannot expect that our causes or explanations will be perfect, …


Indoor Scene Localization To Fight Sex Trafficking In Hotels, Abigail Stylianou Dec 2016

Indoor Scene Localization To Fight Sex Trafficking In Hotels, Abigail Stylianou

McKelvey School of Engineering Theses & Dissertations

Images are key to fighting sex trafficking. They are: (a) used to advertise for sex services,(b) shared among criminal networks, and (c) connect a person in an image to the place where the image was taken. This work explores the ability to link images to indoor places in order to support the investigation and prosecution of sex trafficking. We propose and develop a framework that includes a database of open-source information available on the Internet, a crowd-sourcing approach to gathering additional images, and explore a variety of matching approaches based both on hand-tuned features such as SIFT and learned features …


Automatically Characterizing Product And Process Incentives In Collective Intelligence, Allen Brockhurst Lavoie May 2016

Automatically Characterizing Product And Process Incentives In Collective Intelligence, Allen Brockhurst Lavoie

McKelvey School of Engineering Theses & Dissertations

Social media facilitate interaction and information dissemination among an unprecedented number of participants. Why do users contribute, and why do they contribute to a specific venue? Does the information they receive cover all relevant points of view, or is it biased? The substantial and increasing importance of online communication makes these questions more pressing, but also puts answers within reach of automated methods. I investigate scalable algorithms for understanding two classes of incentives which arise in collective intelligence processes. Product incentives exist when contributors have a stake in the information delivered to other users. I investigate product-relevant user behavior changes, …


Applying Bayesian Machine Learning Methods To Theoretical Surface Science, Shane Carr Dec 2015

Applying Bayesian Machine Learning Methods To Theoretical Surface Science, Shane Carr

McKelvey School of Engineering Theses & Dissertations

Machine learning is a rapidly evolving field in computer science with increasingly many applications to other domains. In this thesis, I present a Bayesian machine learning approach to solving a problem in theoretical surface science: calculating the preferred active site on a catalyst surface for a given adsorbate molecule. I formulate the problem as a low-dimensional objective function. I show how the objective function can be approximated into a certain confidence interval using just one iteration of the self-consistent field (SCF) loop in density functional theory (DFT). I then use Bayesian optimization to perform a global search for the solution. …