Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 785

Full-Text Articles in Entire DC Network

A Limited-Preemption Scheduling Model Inspired By Security Considerations, Benjamin Standaert, Fatima Raadia, Marion Sudvarg, Sanjoy Baruah, Thidapat Chantem, Nathan Fisher, Christopher Gill Sep 2024

A Limited-Preemption Scheduling Model Inspired By Security Considerations, Benjamin Standaert, Fatima Raadia, Marion Sudvarg, Sanjoy Baruah, Thidapat Chantem, Nathan Fisher, Christopher Gill

Computer Science and Engineering Publications and Presentations

Safety-critical embedded systems such as autonomous vehicles typically have only very limited computational capabilities on board that must be carefully managed to provide required enhanced functionalities. As these systems become more complex and inter-connected, some parts may need to be secured to prevent unauthorized access, or isolated to ensure correctness.

We propose the multi-phase secure (MPS) task model as a natural extension of the widely used sporadic task model for modeling both the timing and the security (and isolation) requirements for such systems. Under MPS, task phases reflect execution using different security mechanisms which each have associated execution time costs …


Experimental Design For Scientific Discovery, Quan Minh Nguyen Jul 2024

Experimental Design For Scientific Discovery, Quan Minh Nguyen

McKelvey School of Engineering Theses & Dissertations

Experimental design offers an elegant model of many problems where one navigates within a vast search space seeking data points with certain characteristics. A multitude of applications in science and engineering fall under this umbrella, with drug and materials discovery being prime examples. The experimental design approach maintains a probabilistic model of the search space, and uses Bayesian decision theory accounting for this model to guide the accumulation of observed data to maximize an experimentation objective of interest. This dissertation explores Bayesian optimization and active search, two realizations of the experimental design framework that model discovery tasks. While existing solutions …


Tree Recovery By Dynamic Programming, Gustavo Alberto Gratacos May 2024

Tree Recovery By Dynamic Programming, Gustavo Alberto Gratacos

McKelvey School of Engineering Theses & Dissertations

Tree-like structures are common, naturally occurring objects that are of interest to many fields of study, such as plant science and biomedicine. Analysis of these structures is typically based on skeletons extracted from captured data, which often contain spurious segments or cycles that need to be removed. We propose a dynamic programming algorithm which seeks to recover directed trees from these noisy skeletons. Our method recovers trees by removing edges and duplicating nodes while adhering to edge-label constraints. Our algorithm proceeds by iteratively merging graph nodes, such that the solution on the original graph can be obtained from those on …


Mending Trust In Ai: Trust Repair Policy Interventions For Large Language Models In Visual Data Journalism, Hangxiao Zhu May 2024

Mending Trust In Ai: Trust Repair Policy Interventions For Large Language Models In Visual Data Journalism, Hangxiao Zhu

McKelvey School of Engineering Theses & Dissertations

Trust in Large Language Models (LLMs) emerged as a pivotal concern. This is because, despite the transformative potential of LLMs in enhancing the interpretability and interactivity of complex datasets, the opacity of these models and instances of inaccuracies or biases have led to a significant trust deficit among end-users. Moreover, there is a tendency for people to personify AI tools that utilize these LLMs, attributing abilities and sensibilities that they do not truly possess. This thesis exploits this personification and proposes a comprehensive framework of trust repair policies tailored to address the challenges inherent in LLM annotations within data journalism …


Interpretable Deep Learning Via Sparse Representation For Protein-Dna Interactions, Shane Kuei-Hsien Chu Dec 2023

Interpretable Deep Learning Via Sparse Representation For Protein-Dna Interactions, Shane Kuei-Hsien Chu

McKelvey School of Engineering Theses & Dissertations

A central theme in modeling in regulatory genomics is to consider what representation is suited for DNA sequences that give the most helpful information. Traditionally, there are k-mers, position-weight-matrices, parametric statistical models like Hidden Markov models, and, more recently, deep neural network. In this work, we introduce sparse representations as a principled framework for problems in regulatory genomics. We show that sparse representation is a framework that allows us to build techniques to answer challenging inferential questions in regulatory genomics. Leveraging sparse representations, we reveal that gapped and long motifs are prevalent in in-vivo datasets like ChIP-Seq, often corresponding to …


An Assistive Interface For Displaying Novice's Code History, Ruiwei Xiao May 2023

An Assistive Interface For Displaying Novice's Code History, Ruiwei Xiao

McKelvey School of Engineering Theses & Dissertations

As Teaching Assistant (TA) programs grow in number and size in introductory CS courses, TAs play a significant role in novice programmers' experience and contribute to their success. However, many TAs are also relative beginners themselves and thus have limited experience in programming and teaching. Thus the effectiveness and consistency of their guidance can vary significantly. To improve interaction quality and assist TAs in providing better support, we examine the difficulties encountered by inexperienced TAs in previous literature and then identify the potential for the high cognitive load as an unaddressed difficulty that may prevent new TAs from initiating effective …


Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner May 2023

Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner

McKelvey School of Engineering Theses & Dissertations

Survey data collected from human subjects can contain a high number of features while having a comparatively low quantity of examples. Machine learning models that attempt to predict outcomes from survey data under these conditions can overfit and result in poor generalizability. One remedy to this issue is feature selection, which attempts to select an optimal subset of features to learn upon. A relatively unexplored source of information in the feature selection process is the usage of textual names of features, which may be semantically indicative of which features are relevant to a target outcome. The relationships between feature names …


Understanding Societal Values Of Chatgpt, Yidan Tang May 2023

Understanding Societal Values Of Chatgpt, Yidan Tang

McKelvey School of Engineering Theses & Dissertations

As Large language models (LLMs) become increasingly pervasive in various domains, it is crucial to ensure that their outputs adhere to societal values and ethical considerations. In this thesis, we investigate the alignment of ChatGPT, a recent state-of-the-art large language model developed by OpenAI, with societal values. Specifically, we define the problem of societal values of LLMs and assemble a representative collection of 7 datasets covering 4 topics related to societal values. In-context learning techniques are applied and appropriate prompts are designed. The performance of each dataset is measured using a standardized evaluation system focused on accuracy. We then display …


Evaluating The Problem Solving Abilities Of Chatgpt, Fankun Zeng May 2023

Evaluating The Problem Solving Abilities Of Chatgpt, Fankun Zeng

McKelvey School of Engineering Theses & Dissertations

This thesis addresses the need for a fair evaluation of language models' problem solving abilities by presenting a unified evaluation framework for ChatGPT on 16 problem solving datasets (e.g., NaturalQA, HellaSwag, MMLU, etc.). We evaluate the model's performance using F1, exact match, and quasi-exact match metrics and find that ChatGPT is highly accurate in solving tasks that require commonsense and knowledge. However, we also identify truncated text bias and few-shot scenarios as challenges that may impact ChatGPT's performance. Our research highlights the importance of standardizing datasets and developing a unified evaluation system for the fair evaluation of language models. Overall, …


Comments Of The Cordell Institute On Ai Accountability, Neil M. Richards, Woodrow Hartzog, Jordan Francis Jan 2023

Comments Of The Cordell Institute On Ai Accountability, Neil M. Richards, Woodrow Hartzog, Jordan Francis

Scholarship@WashULaw

These comments are a response to the National Telecommunications and Information Administration's 2023 request for comment on AI accountability (AI Accountability RFC, NTIA–2023–0005).

Responding to NTIA’s recent inquiry into AI assurance and accountability, we offer two main arguments regarding the importance of substantive legal protections. First, a myopic focus on concepts of transparency, bias mitigation, and ethics (for which procedural compliance efforts such as audits, assessments, and certifications are proxies) is insufficient when it comes to the design and implementation of accountable AI systems. We call rules built around transparency and bias mitigation “AI half-measures,” because they provide the appearance …


Speeding Up The Quantification Of Contrast Sensitivity Functions Using Multidimensional Bayesian Active Learning, Shohaib Shaffiey Aug 2022

Speeding Up The Quantification Of Contrast Sensitivity Functions Using Multidimensional Bayesian Active Learning, Shohaib Shaffiey

McKelvey School of Engineering Theses & Dissertations

No abstract provided.


Design And Analysis Of Strategic Behavior In Networks, Sixie Yu Aug 2022

Design And Analysis Of Strategic Behavior In Networks, Sixie Yu

McKelvey School of Engineering Theses & Dissertations

Networks permeate every aspect of our social and professional life.A networked system with strategic individuals can represent a variety of real-world scenarios with socioeconomic origins. In such a system, the individuals' utilities are interdependent---one individual's decision influences the decisions of others and vice versa. In order to gain insights into the system, the highly complicated interactions necessitate some level of abstraction. To capture the otherwise complex interactions, I use a game theoretic model called Networked Public Goods (NPG) game. I develop a computational framework based on NPGs to understand strategic individuals' behavior in networked systems. The framework consists of three …


Smart Sensing And Clinical Predictions With Wearables: From Physiological Signals To Mental Health, Ruixuan Dai Aug 2022

Smart Sensing And Clinical Predictions With Wearables: From Physiological Signals To Mental Health, Ruixuan Dai

McKelvey School of Engineering Theses & Dissertations

Wearable devices such as smartwatches and wristbands are gaining adoption. Recent advances in technology in wearables enable remote health monitoring. However, there are challenges in exploiting wearables in healthcare applications. First, sensor readings from wearables are vulnerable to motion and noise artifacts. A robust pipeline is needed to extract reliable measurements from noisy signals. Second, while wearables support an increasing number of sensing modalities, there is a significant need to generate more clinically meaningful measurements with wearables. Finally, to incorporate wearables into clinical practice, we need to establish the link between wearable measurements and clinical outcomes, thus supporting clinical decisions. …


Dynamic Continuous Distributed Constraint Optimization Problems, Khoi Hoang Aug 2022

Dynamic Continuous Distributed Constraint Optimization Problems, Khoi Hoang

McKelvey School of Engineering Theses & Dissertations

The Distributed Constraint Optimization Problem (DCOP) formulation is a powerful tool to model multi-agent coordination problems that are distributed by nature. The formulation is suitable for problems where the environment does not change over time and where agents seek their value assignment from a discrete domain. However, in many real-world applications, agents often interact in a more dynamic environment and their variables usually require a more complex domain. Thus, the DCOP formulation lacks the capabilities to model the problems in such dynamic and complex environments. To address these limitations, researchers have proposed Dynamic DCOPs (D-DCOPs) to model how DCOPs dynamically …


Tfa Inference: Using Mathematical Modeling Of Gene Expression Data To Infer The Activity Of Transcription Factors, Cynthia Ma Aug 2022

Tfa Inference: Using Mathematical Modeling Of Gene Expression Data To Infer The Activity Of Transcription Factors, Cynthia Ma

McKelvey School of Engineering Theses & Dissertations

Transcription factors (TFs) are a set of proteins that play a key role in the information processing system that enables a cell to respond to changes in internal and external state. By binding near a gene in a cell’s DNA, a TF can influence that gene’s expression level, triggering the appropriate increase or decrease in production levels of proteins that are needed to handle stressors like a change in nutrient availability or damage to the cell’s internal structures. Transcription factor activity (TFA) is a measure of how much effect a TF has on its target genes in a given sample …


Development Of The Assessment Of Clinical Prediction Model Transportability (Apt) Checklist, Sean Chonghwan Yu Aug 2022

Development Of The Assessment Of Clinical Prediction Model Transportability (Apt) Checklist, Sean Chonghwan Yu

McKelvey School of Engineering Theses & Dissertations

Clinical Prediction Models (CPM) have long been used for Clinical Decision Support (CDS) initially based on simple clinical scoring systems, and increasingly based on complex machine learning models relying on large-scale Electronic Health Record (EHR) data. External implementation – or the application of CPMs on sites where it was not originally developed – is valuable as it reduces the need for redundant de novo CPM development, enables CPM usage by low resource organizations, facilitates external validation studies, and encourages collaborative development of CPMs. Further, adoption of externally developed CPMs has been facilitated by ongoing interoperability efforts in standards, policy, and …


Model-Based Deep Learning For Computational Imaging, Xiaojian Xu Aug 2022

Model-Based Deep Learning For Computational Imaging, Xiaojian Xu

McKelvey School of Engineering Theses & Dissertations

This dissertation addresses model-based deep learning for computational imaging. The motivation of our work is driven by the increasing interests in the combination of imaging model, which provides data-consistency guarantees to the observed measurements, and deep learning, which provides advanced prior modeling driven by data. Following this idea, we develop multiple algorithms by integrating the classical model-based optimization and modern deep learning to enable efficient and reliable imaging. We demonstrate the performance of our algorithms by validating their performance on various imaging applications and providing rigorous theoretical analysis.

The dissertation evaluates and extends three general frameworks, plug-and-play priors (PnP), regularized …


Machine Learning And Scalable Informatics Methods To Predict Disease Status From Multimodal Biomedical Data, Hossein Mohammadian Foroushani Aug 2022

Machine Learning And Scalable Informatics Methods To Predict Disease Status From Multimodal Biomedical Data, Hossein Mohammadian Foroushani

McKelvey School of Engineering Theses & Dissertations

Biological understanding of complex diseases such as stroke and obesity is critical for the advancement of medicine. Further knowledge discovery can provide effective biomarkers to improve disease diagnosis and prognosis, identify driver mutations, predict individual genetic susceptibility for early prevention and effective disease management, and facilitate development of personalized drugs. Stroke is the second leading cause of death and long-term disability in the world. Thus, stroke management is a time-sensitive emergency. The initial hours after stroke onset map the trajectory of subsequent neurologic complications. Cerebral edema develops hours to days after acute ischemic stroke and may result in midline shift …


Integrating Physical Models And Deep Priors For Computational Imaging, Yu Sun Aug 2022

Integrating Physical Models And Deep Priors For Computational Imaging, Yu Sun

McKelvey School of Engineering Theses & Dissertations

This dissertation addresses integrating physical models and learning priors for computational imaging. The motivation of our work is driven by the recent discussion of learning-based methods that solve the imaging inverse problem by directly learning a measurement-to-image mapping from the existing data: they achieve superior performance over the traditional model-based methods but lack the physical model to impose sufficient interpretation and guarantee of the final image. We adopt the classic statistical inference as the underlying formulation and integrate learning models as implicit image priors, such that our framework is able to simultaneously leverage physical models and learning priors. Additionally, the …


Human-Centered Machine Learning: Algorithm Design And Human Behavior, Wei Tang Aug 2022

Human-Centered Machine Learning: Algorithm Design And Human Behavior, Wei Tang

McKelvey School of Engineering Theses & Dissertations

Machine learning is increasingly engaged in a large number of important daily decisions and has great potential to reshape various sectors of our modern society. To fully realize this potential, it is important to understand the role that humans play in the design of machine learning algorithms and investigate the impacts of the algorithm on humans.

Towards the understanding of such interactions between humans and algorithms, this dissertation takes a human-centric perspective and focuses on investigating the interplay between human behavior and algorithm design. Accounting for the roles of humans in algorithm design creates unique challenges. For example, humans might …


Scheduling For High Throughput And Small Latency In Parallel And Distributed Systems, Zhe Wang Aug 2022

Scheduling For High Throughput And Small Latency In Parallel And Distributed Systems, Zhe Wang

McKelvey School of Engineering Theses & Dissertations

Parallel and distributed systems are pervasive, such as web services, clouds, and cyber-physical systems. We often desire high throughput and small latency in the parallel and distributed system. However, since the system is distributed and the input is online, scheduling for high throughput while keeping the latency small is often challenging. In this dissertation, we developed scheduling algorithms, policies, and mechanisms to approach high throughput with small latency in various parallel and distributed applications. First, we developed AMCilk runtime system for running multi-programmed parallel jobs on many-processor machines. When running parallel jobs, the allocation of processors to the parallel jobs …


Geometric Algorithms For Modeling Plant Roots From Images, Dan Zeng Aug 2022

Geometric Algorithms For Modeling Plant Roots From Images, Dan Zeng

McKelvey School of Engineering Theses & Dissertations

Roots, considered as the ”hidden half of the plant”, are essential to a plant’s health and pro- ductivity. Understanding root architecture has the potential to enhance efforts towards im- proving crop yield. In this dissertation we develop geometric approaches to non-destructively characterize the full architecture of the root system from 3D imaging while making com- putational advances in topological optimization. First, we develop a global optimization algorithm to remove topological noise, with applications in both root imaging and com- puter graphics. Second, we use our topology simplification algorithm, other methods from computer graphics, and customized algorithms to develop a high-throughput …


Dataset Evaluation For Data Trading Using Expected Loss And Homomorphic Encryption, Minsung Joo May 2022

Dataset Evaluation For Data Trading Using Expected Loss And Homomorphic Encryption, Minsung Joo

Senior Honors Papers / Undergraduate Theses

Supervised machine learning suffers from the ``garbage-in garbage-out" phenomenon where the performance of a model is limited by the quality of the data. While a myriad of data is collected every second, there is no general rigorous method of evaluating the quality of a given dataset. This hinders fair pricing of data in scenarios where a buyer may look to buy data for use with machine learning. In this work, I propose using the expected loss corresponding to a dataset as a measure of its quality, relying on Bayesian methods for uncertainty quantification. Furthermore, I present a secure multi-party computation …


The Challenges Of Applying Computational Legal Analysis To Mhealth Security And Privacy Regulations, Brian Tung Aug 2021

The Challenges Of Applying Computational Legal Analysis To Mhealth Security And Privacy Regulations, Brian Tung

McKelvey School of Engineering Theses & Dissertations

As our world has grown in complexity, so have our laws. By one measure, the United States Code has grown over 30x as long since 1935, and the 186,000-page Code of Federal Regulations has grown almost 10x in length since 1938. Our growing legal system is too complicated; it’s impossible for people to know all the laws that apply to them. However, people are still subject to the law, even if they are unfamiliar with it. Therein lies the need for computational legal analysis. Tools of computation (e.g., data visualization, algorithms, and artificial intelligence) have the potential to transform civic …


Machine Learning For Analog/Mixed-Signal Integrated Circuit Design Automation, Weidong Cao Aug 2021

Machine Learning For Analog/Mixed-Signal Integrated Circuit Design Automation, Weidong Cao

McKelvey School of Engineering Theses & Dissertations

Analog/mixed-signal (AMS) integrated circuits (ICs) play an essential role in electronic systems by processing analog signals and performing data conversion to bridge the analog physical world and our digital information world.Their ubiquitousness powers diverse applications ranging from smart devices and autonomous cars to crucial infrastructures. Despite such critical importance, conventional design strategies of AMS circuits still follow an expensive and time-consuming manual process and are unable to meet the exponentially-growing productivity demands from industry and satisfy the rapidly-changing design specifications from many emerging applications. Design automation of AMS IC is thus the key to tackling these challenges and has been …


Photoacoustic Imaging, Feature Extraction, And Machine Learning Implementation For Ovarian And Colorectal Cancer Diagnosis, Eghbal Amidi Aug 2021

Photoacoustic Imaging, Feature Extraction, And Machine Learning Implementation For Ovarian And Colorectal Cancer Diagnosis, Eghbal Amidi

McKelvey School of Engineering Theses & Dissertations

Among all cancers related to women’s reproductive systems, ovarian cancer has the highest mortality rate. Pelvic examination, transvaginal ultrasound (TVUS), and blood testing for cancer antigen 125 (CA-125), are the conventional screening tools for ovarian cancer, but they offer very low specificity. Other tools, such as magnetic resonance imaging (MRI), computed tomography (CT), and positron emission tomography (PET), also have limitations in detecting small lesions. In the USA, considering men and women separately, colorectal cancer is the third most common cause of death related to cancer; for men and women combined, it is the second leading cause of cancer deaths. …


Continuous-Time And Complex Growth Transforms For Analog Computing And Optimization, Oindrila Chatterjee Aug 2021

Continuous-Time And Complex Growth Transforms For Analog Computing And Optimization, Oindrila Chatterjee

McKelvey School of Engineering Theses & Dissertations

Analog computing is a promising and practical candidate for solving complex computational problems involving algebraic and differential equations. At the fundamental level, an analog computing framework can be viewed as a dynamical system that evolves following fundamental physical principles, like energy minimization, to solve a computing task. Additionally, conservation laws, such as conservation of charge, energy, or mass, provide a natural way to couple and constrain spatially separated variables. Taking a cue from these observations, in this dissertation, I have explored a novel dynamical system-based computing framework that exploits naturally occurring analog conservation constraints to solve a variety of optimization …


A Neuromorphic Machine Learning Framework Based On The Growth Transform Dynamical System, Ahana Gangopadhyay Aug 2021

A Neuromorphic Machine Learning Framework Based On The Growth Transform Dynamical System, Ahana Gangopadhyay

McKelvey School of Engineering Theses & Dissertations

As computation increasingly moves from the cloud to the source of data collection, there is a growing demand for specialized machine learning algorithms that can perform learning and inference at the edge in energy and resource-constrained environments. In this regard, we can take inspiration from small biological systems like insect brains that exhibit high energy-efficiency within a small form-factor, and show superior cognitive performance using fewer, coarser neural operations (action potentials or spikes) than the high-precision floating-point operations used in deep learning platforms. Attempts at bridging this gap using neuromorphic hardware has produced silicon brains that are orders of magnitude …


Bayesian Quadrature With Prior Information: Modeling And Policies, Henry Chai Aug 2021

Bayesian Quadrature With Prior Information: Modeling And Policies, Henry Chai

McKelvey School of Engineering Theses & Dissertations

Quadrature is the problem of estimating intractable integrals. Such integrals regularly arise in engineering and the natural sciences, especially when Bayesian methods are applied; examples include model evidences, normalizing constants and marginal distributions. This dissertation explores Bayesian quadrature, a probabilistic, model-based quadrature method. Specifically, we study different ways in which Bayesian quadrature can be adapted to account for different kinds of prior information one may have about the task. We demonstrate that by taking into account prior knowledge, Bayesian quadrature can outperform commonly used numerical methods that are agnostic to prior knowledge, such as Monte Carlo based integration. We focus …


Machine Learning In Complex Scientific Domains: Hospitalization Records, Drug Interactions, Predictive Modeling And Fairness For Class Imbalanced Data, Arghya Datta Aug 2021

Machine Learning In Complex Scientific Domains: Hospitalization Records, Drug Interactions, Predictive Modeling And Fairness For Class Imbalanced Data, Arghya Datta

McKelvey School of Engineering Theses & Dissertations

Machine learning has demonstrated potential in analyzing large, complex datasets and has become ubiquitous across many fields of scientific research. As machine learning is actively deployed in many complex and critical domains, it is essential for machine learning to engage with domain expertise to aid in knowledge discovery as well as address challenges in predictive modeling in complex domains. Domain expertise represents an essential and elaborate collection of knowledge that is often under-utilized when applying machine learning in complex domains. In this dissertation, I have addressed existing challenges regarding knowledge discovery in complex domains via engagement with domain expertise, particularly …