Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Reinforcement learning

Discipline
Institution
Publication Year
Publication

Articles 1 - 30 of 50

Full-Text Articles in Engineering

Location Aware Task Offloading Framework For Edge Computing Empowered By Reconfigurable Intelligent Surfaces, Md Sahabul Hossain Dec 2023

Location Aware Task Offloading Framework For Edge Computing Empowered By Reconfigurable Intelligent Surfaces, Md Sahabul Hossain

Electrical and Computer Engineering ETDs

In this thesis, an energy efficient task offloading mechanism in a multi-access edge computing (MEC) environment is introduced, based on the principles of Contract Theory. The technology of Reconfigurable Intelligent Surfaces (RIS) is adopted and serves as the enabler for the energy efficient task offloading, from the perspective of location-awareness and improved communication environment. Initially a novel positioning, navigation, and timing solution is designed, based on the RIS technology and an artificial intelligent method that selects a set of RISs to perform the multilateration technique and determine the Internet of Things (IoT) nodes’ positions in an efficient and accurate manner …


Adaptive Personalized Drug Delivery Method For Warfarin And Anemia Management: Modeling And Control., Affan Affan Dec 2023

Adaptive Personalized Drug Delivery Method For Warfarin And Anemia Management: Modeling And Control., Affan Affan

Electronic Theses and Dissertations

Personalized precision medicine aims to develop the appropriate treatments for suitable patients at the right time to obtain optimal results. Personalized medicine is challenging due to inter- and intra-patient variability, narrow therapeutic window, the effect of other medications, comorbidity (more than one disease at a time), nonlinear patient dynamics, and time-varying patient dose response characteristics which include bleeding (internal and external). This research aims to develop a framework for an adaptive personalized modeling and control method with minimum clinical patient specific dose response data for optimal drug dosing. The proposed methodology is applied to anemia and warfarin management. It is …


Deep Reinforcement Learning For The Design Of Structural Topologies, Nathan Brown Dec 2023

Deep Reinforcement Learning For The Design Of Structural Topologies, Nathan Brown

All Dissertations

Advances in machine learning algorithms and increased computational efficiencies have given engineers new capabilities and tools for engineering design. The presented work investigates using deep reinforcement learning (DRL), a subset of deep machine learning that teaches an agent to complete a task through accumulating experiences in an interactive environment, to design 2D structural topologies. Three unique structural topology design problems are investigated to validate DRL as a practical design automation tool to produce high-performing designs in structural topology domains.

The first design problem attempts to find a gradient-free alternative to solving the compliance minimization topology optimization problem. In the proposed …


Near-Optimal Control Of A Quadcopter Using Reinforcement Learning, Alberto Velazquez-Estrada May 2023

Near-Optimal Control Of A Quadcopter Using Reinforcement Learning, Alberto Velazquez-Estrada

Theses and Dissertations

This paper presents a novel control method for quadcopters that achieves near-optimal tracking control for input-affine nonlinear quadcopter dynamics. The method uses a reinforcement learning algorithm called Single Network Adaptive Critics (SNAC), which approximates a solution to the discrete-time Hamilton-Jacobi-Bellman (DT-HJB) equation using a single neural network trained offline. The control method involves two SNAC controllers, with the outer loop controlling the linear position and velocities (position control) and the inner loop controlling the angular position and velocities (attitude control). The resulting quadcopter controller provides optimal feedback control and tracks a trajectory for an infinite-horizon, and it is compared with …


Using Actor-Critic Reinforcement Learning For Control Of A Quadrotor Dynamics, Edgar Adrian Torres May 2023

Using Actor-Critic Reinforcement Learning For Control Of A Quadrotor Dynamics, Edgar Adrian Torres

Theses and Dissertations

This paper presents a quadrotor controller using reinforcement learning to generate near-optimal control signals. Two actor-critic algorithms are trained to control quadrotor dynamics. The dynamics are further simplified using small angle approximation. The actor-critic algorithm’s control policy is derived from Bellman’s equation providing a sufficient condition to optimality. Additionally, a smoother converter is implemented into the trajectory providing more reliable results. This paper provides derivations to the quadrotor’s dynamics and explains the control using the actor-critic algorithm. The results and simulations are compared to solutions from a commercial, optimal control solver, called DIDO.


Optimizing Constraint Selection In A Design Verification Environment For Efficient Coverage Closure, Vanessa Cooper Jan 2023

Optimizing Constraint Selection In A Design Verification Environment For Efficient Coverage Closure, Vanessa Cooper

CCE Theses and Dissertations

No abstract provided.


Power System Dynamic Control And Performance Improvement Based On Reinforcement Learning, Wei Gao Jan 2023

Power System Dynamic Control And Performance Improvement Based On Reinforcement Learning, Wei Gao

Electronic Theses and Dissertations

This dissertation investigates the feasibility and effectiveness of using Reinforcement Learning (RL) techniques for power system dynamic control, particularly voltage and frequency control. The conventional control strategies used in power systems are complex and time-consuming due to the complicated high-order nonlinearities of the system. RL, which is a type of neural network-based technique, has shown promise in solving these complex problems by fitting any nonlinear system with the proper network structure.

The proposed RL algorithm, called Guided Surrogate Gradient-based Evolution Strategy (GSES) determines the weights of the policy (which generates the action for our control reference signal) without back-propagation process …


Low-Reynolds-Number Locomotion Via Reinforcement Learning, Yuexin Liu Aug 2022

Low-Reynolds-Number Locomotion Via Reinforcement Learning, Yuexin Liu

Dissertations

This dissertation summarizes computational results from applying reinforcement learning and deep neural network to the designs of artificial microswimmers in the inertialess regime, where the viscous dissipation in the surrounding fluid environment dominates and the swimmer’s inertia is completely negligible. In particular, works in this dissertation consist of four interrelated studies of the design of microswimmers for different tasks: (1) a one-dimensional microswimmer in free-space that moves towards the target via translation, (2) a one-dimensional microswimmer in a periodic domain that rotates to reach the target, (3) a two-dimensional microswimmer that switches gaits to navigate to the designated targets in …


Effective Resource Scheduling For Collaborative Computing In Edge-Assisted Internet Of Things Systems, Qianqian Wang Aug 2022

Effective Resource Scheduling For Collaborative Computing In Edge-Assisted Internet Of Things Systems, Qianqian Wang

Electronic Thesis and Dissertation Repository

Along with rapidly evolving communications technologies and data analytics, Internet of Things (IoT) systems interconnect billions of smart devices to gather, exchange, analyze data, and perform tasks autonomously, which poses a huge pressure on IoT devices' computing capabilities. Taking advantage of collaborative computing enabled by cloud computing and edge computing technologies, IoT devices can offload computation tasks to idle computing devices and remote servers, thus alleviating their pressure. However, scheduling resources effectively to realize collaborative computing remains a severe challenge due to diverse application objectives, limited distributed resources, and unpredictable environments. To overcome the above challenges, this thesis aims to …


Outdoor Operations Of Multiple Quadrotors In Windy Environment, Deepan Lobo May 2022

Outdoor Operations Of Multiple Quadrotors In Windy Environment, Deepan Lobo

Dissertations

Coordinated multiple small unmanned aerial vehicles (sUAVs) offer several advantages over a single sUAV platform. These advantages include improved task efficiency, reduced task completion time, improved fault tolerance, and higher task flexibility. However, their deployment in an outdoor environment is challenging due to the presence of wind gusts. The coordinated motion of a multi-sUAV system in the presence of wind disturbances is a challenging problem when considering collision avoidance (safety), scalability, and communication connectivity. Performing wind-agnostic motion planning for sUAVs may produce a sizeable cross-track error if the wind on the planned route leads to actuator saturation. In a multi-sUAV …


Efficient Deep Learning And Its Applications, Zi Wang May 2022

Efficient Deep Learning And Its Applications, Zi Wang

Doctoral Dissertations

Deep neural networks (DNNs) have achieved huge successes in various tasks such as object classification and detection, image synthesis, game-playing, and biological developmental system simulation. State-or-the-art performance on these tasks is usually achieved by designing deeper and wider DNNs with the cost of huge storage size and high computational complexity. However, the over-parameterization problem of DNNs constrains their deployment in resource-limited devices, such as drones and mobile phones.

With these concerns, many network compression approaches are developed, such as quantization, neural architecture search, network pruning, and knowledge distillation. These approaches reduce the sizes and computational costs of DNNs while maintaining …


Pulse-Coupled Oscillator Networks: Achieving Phase Continuity And Learning Optimal Control In Physical Systems, Timothy Anglea May 2022

Pulse-Coupled Oscillator Networks: Achieving Phase Continuity And Learning Optimal Control In Physical Systems, Timothy Anglea

All Dissertations

In this dissertation, we consider the application of pulse-coupled oscillator theory to real-world, physical networks. When the phase of an oscillator is associated with a physical measure, such as clock timing or robotic heading, discontinuous adjustments of the oscillator's phase is undesirable and potentially disadvantageous. Rather, continuous adjustment of the oscillator phase value is needed over a certain amount of time. To ensure that both synchronization and desynchronization can still be achieved under the constraint of continuous phase value changes, we pursue a novel approach to analyze the generalization of a pulse-coupled oscillator network with a time-varying coupling strength. We …


A Deep Reinforcement Learning Approach With Prioritized Experience Replay And Importance Factor For Makespan Minimization In Manufacturing, Jose Napoleon Martinez Apr 2022

A Deep Reinforcement Learning Approach With Prioritized Experience Replay And Importance Factor For Makespan Minimization In Manufacturing, Jose Napoleon Martinez

LSU Doctoral Dissertations

In this research, we investigated the application of deep reinforcement learning (DRL) to a common manufacturing scheduling optimization problem, max makespan minimization. In this application, tasks are scheduled to undergo processing in identical processing units (for instance, identical machines, machining centers, or cells). The optimization goal is to assign the jobs to be scheduled to units to minimize the maximum processing time (i.e., makespan) on any unit.

Machine learning methods have the potential to "learn" structures in the distribution of job times that could lead to improved optimization performance and time over traditional optimization methods, as well as to adapt …


Distributed Control And Learning Of Connected And Autonomous Vehicles Approaching And Departing Signalized Intersections, Joshua Onyeka Ogbebor Apr 2022

Distributed Control And Learning Of Connected And Autonomous Vehicles Approaching And Departing Signalized Intersections, Joshua Onyeka Ogbebor

LSU Master's Theses

This thesis outlines methods for achieving energy-optimal control policies for autonomous vehicles approaching and departing a signalized traffic intersection. Connected and autonomous vehicle technology has gained wide interest from both research institutions and government agencies because it offers immense promise in advancing efficient energy usage and abating hazards that beset the current transportation system. Energy minimization is itself crucial in reducing the greenhouse emissions from fossil-fuel-powered vehicles and extending the battery life of electric vehicles which are presently the major alternative to fossil-fuel-powered vehicles. Two major forms of fuel minimization are studied. First, the eco-driving problem is solved for a …


Multiagent Routing Problem With Dynamic Target Arrivals Solved Via Approximate Dynamic Programming, Andrew E. Mogan Mar 2022

Multiagent Routing Problem With Dynamic Target Arrivals Solved Via Approximate Dynamic Programming, Andrew E. Mogan

Theses and Dissertations

This research formulates and solves the multiagent routing problem with dynamic target arrivals (MRP-DTA), a stochastic system wherein a team of autonomous unmanned aerial vehicles (AUAVs) executes a strike coordination and reconnaissance (SCAR) mission against a notional adversary. Dynamic target arrivals that occur during the mission present the team of AUAVs with a sequential decision-making process which we model via a Markov Decision Process (MDP). To combat the curse of dimensionality, we construct and implement a hybrid approximate dynamic programming (ADP) algorithmic framework that employs a parametric cost function approximation (CFA) which augments a direct lookahead (DLA) model via a …


Team Air Combat Using Model-Based Reinforcement Learning, David A. Mottice Mar 2022

Team Air Combat Using Model-Based Reinforcement Learning, David A. Mottice

Theses and Dissertations

We formulate the first generalized air combat maneuvering problem (ACMP), called the MvN ACMP, wherein M friendly AUCAVs engage against N enemy AUCAVs, developing a Markov decision process (MDP) model to control the team of M Blue AUCAVs. The MDP model leverages a 5-degree-of-freedom aircraft state transition model and formulates a directed energy weapon capability. Instead, a model-based reinforcement learning approach is adopted wherein an approximate policy iteration algorithmic strategy is implemented to attain high-quality approximate policies relative to a high performing benchmark policy. The ADP algorithm utilizes a multi-layer neural network for the value function approximation regression mechanism. One-versus-one …


Developing Reactive Distributed Aerial Robotics Platforms For Real-Time Contaminant Mapping, Joshua Ashley Jan 2022

Developing Reactive Distributed Aerial Robotics Platforms For Real-Time Contaminant Mapping, Joshua Ashley

Theses and Dissertations--Electrical and Computer Engineering

The focus of this research is to design a sensor data aggregation system and centralized sensor-driven trajectory planning algorithm for fixed-wing aircraft to optimally assist atmospheric simulators in mapping the local environment in real-time. The proposed application of this work is to be used in the event of a hazardous contaminant leak into the atmosphere as a fleet of sensing unmanned aerial vehicles (UAVs) could provide valuable information for evacuation measures. The data aggregation system was designed using a state-of-the-art networking protocol and radio with DigiMesh and a process/data management system in the ROS2 DDS. This system was tested to …


Reinforcement Learning For Process Control: Applications To Energy Systems, Elijah Ballard Hedrick Jan 2022

Reinforcement Learning For Process Control: Applications To Energy Systems, Elijah Ballard Hedrick

Graduate Theses, Dissertations, and Problem Reports

Reinforcement learning (RL) is a machine learning method that has recently seen significant research activity owing to its successes in the areas of robotics and gameplaying (Silver et al., 2017). However, significant challenges exist in the extension of these control methods to process control problems, where state and input signals are nearly always continuous and more stringent performance guarantees are required. The goal of this work is to explore ways that modern RL algorithms can be adapted to handle process control problems; avenues for this work include using RL with existing controllers such as model predictive control (MPC) and adapting …


Network Management, Optimization And Security With Machine Learning Applications In Wireless Networks, Mariam Nabil Dec 2021

Network Management, Optimization And Security With Machine Learning Applications In Wireless Networks, Mariam Nabil

Theses and Dissertations

Wireless communication networks are emerging fast with a lot of challenges and ambitions. Requirements that are expected to be delivered by modern wireless networks are complex, multi-dimensional, and sometimes contradicting. In this thesis, we investigate several types of emerging wireless networks and tackle some challenges of these various networks. We focus on three main challenges. Those are Resource Optimization, Network Management, and Cyber Security. We present multiple views of these three aspects and propose solutions to probable scenarios. The first challenge (Resource Optimization) is studied in Wireless Powered Communication Networks (WPCNs). WPCNs are considered a very promising approach towards sustainable, …


Multi-Stage Stochastic Optimization And Reinforcement Learning For Forestry Epidemic And Covid-19 Control Planning, Sabah Bushaj Aug 2021

Multi-Stage Stochastic Optimization And Reinforcement Learning For Forestry Epidemic And Covid-19 Control Planning, Sabah Bushaj

Dissertations

This dissertation focuses on developing new modeling and solution approaches based on multi-stage stochastic programming and reinforcement learning for tackling biological invasions in forests and human populations. Emerald Ash Borer (EAB) is the nemesis of ash trees. This research introduces a multi-stage stochastic mixed-integer programming model to assist forest agencies in managing emerald ash borer insects throughout the U.S. and maximize the public benets of preserving healthy ash trees. This work is then extended to present the first risk-averse multi-stage stochastic mixed-integer program in the invasive species management literature to account for extreme events. Significant computational achievements are obtained using …


Learning Of Radar System For Target Detection, Wei Jiang Aug 2021

Learning Of Radar System For Target Detection, Wei Jiang

Dissertations

In this dissertation, the problem of data-driven joint design of transmitted waveform and detector in a radar system is addressed. Two novel learning-based approaches to waveform and detector design are proposed based on end-to-end training of the radar system. The first approach consists of alternating supervised training of the detector for a fixed waveform and reinforcement learning of the transmitter for a fixed detector. In the second approach, the transmitter and detector are trained simultaneously. Various operational waveform constraints, such as peak-to-average-power ratio (PAR) and spectral compatibility, are incorporated into the design. Unlike traditional radar design methods that rely on …


Improving Reinforcement Learning Techniques For Medical Decision Making, Matthew Baucum Aug 2021

Improving Reinforcement Learning Techniques For Medical Decision Making, Matthew Baucum

Doctoral Dissertations

Reinforcement learning (RL) is a powerful tool for developing personalized treatment regimens from healthcare data. In RL, an agent samples experiences from an environment (such as a model of patient health) to learn a policy that maximizes long-term reward. This dissertation proposes methodological and practical developments in the application of RL to treatment planning problems.

First, we develop a novel time series model for simulating patient health states from observed clinical data. We use a generative neural network architecture that learns a direct mapping between distributions over clinical measurements at adjacent time points. We show that this model produces realistic …


Model-Based And Model-Free Approaches For Power System Security Assessment, Mariana Magdy Mounir Kamel Aug 2021

Model-Based And Model-Free Approaches For Power System Security Assessment, Mariana Magdy Mounir Kamel

Doctoral Dissertations

Continuous security assessment of a power system is necessary to insure a reliable, stable, and continuous supply of electrical power to customers. To this end, this dissertation identifies and explores some of the various challenges encountered in the field of power system security assessment. Accordingly, several model-based and/or model-free approaches were developed to overcome these challenges.

First, a voltage stability index, named TAVSI, is proposed. This index has three important features: TAVSI applies to general load models including ZIP, exponential, and induction motor loads; TAVSI can be used for both measurement-based and model-based voltage stability assessment; and finally, TAVSI is …


High-Density Parking For Autonomous Vehicles., Parag J. Siddique Aug 2021

High-Density Parking For Autonomous Vehicles., Parag J. Siddique

Electronic Theses and Dissertations

In a common parking lot, much of the space is devoted to lanes. Lanes must not be blocked for one simple reason: a blocked car might need to leave before the car that blocks it. However, the advent of autonomous vehicles gives us an opportunity to overcome this constraint, and to achieve a higher storage capacity of cars. Taking advantage of self-parking and intelligent communication systems of autonomous vehicles, we propose puzzle-based parking, a high-density design for a parking lot. We introduce a novel method of vehicle parking, which leads to maximum parking density. We then propose a heuristic method …


Development And Implementation Of Novel Intelligent Motor Control For Performance Enhancement Of Pmsm Drive In Electrified Vehicle Application, Soumava Bhattacharjee Jul 2021

Development And Implementation Of Novel Intelligent Motor Control For Performance Enhancement Of Pmsm Drive In Electrified Vehicle Application, Soumava Bhattacharjee

Electronic Theses and Dissertations

The demand for electrified vehicles has grown significantly over the last decade causing a shift in the automotive industry from traditional gasoline vehicles to electric vehicles (EVs). With the growing evolution of EVs, high power density, and high efficiency of electric powertrains (e–drive) are of the utmost need to achieve an extended driving range. However, achieving an extended driving range with enhanced e-drive performance is still a bottleneck.

The control algorithm of e–drive plays a vital role in its performance and reliability over time. Artificial intelligence (AI) and machine learning (ML) based intelligent control methods have proven their continued success …


Maximizing User Engagement In Short Marketing Campaigns Within An Online Living Lab: A Reinforcement Learning Perspective, Aniekan Michael Ini-Abasi Jan 2021

Maximizing User Engagement In Short Marketing Campaigns Within An Online Living Lab: A Reinforcement Learning Perspective, Aniekan Michael Ini-Abasi

Wayne State University Dissertations

ABSTRACT

MAXIMIZING USER ENGAGEMENT IN SHORT MARKETING CAMPAIGNS WITHIN AN ONLINE LIVING LAB: A REINFORCEMENT LEARNING PERSPECTIVE

by

ANIEKAN MICHAEL INI-ABASI

August 2021

Advisor: Dr. Ratna Babu Chinnam Major: Industrial & Systems Engineering Degree: Doctor of Philosophy

User engagement has emerged as the engine driving online business growth. Many firms have pay incentives tied to engagement and growth metrics. These corporations are turning to recommender systems as the tool of choice in the business of maximizing engagement. LinkedIn reported a 40% higher email response with the introduction of a new recommender system. At Amazon 35% of sales originate from recommendations, …


Machine Learning Based Applications For Data Visualization, Modeling, Control, And Optimization For Chemical And Biological Systems, Yan Ma Dec 2020

Machine Learning Based Applications For Data Visualization, Modeling, Control, And Optimization For Chemical And Biological Systems, Yan Ma

LSU Doctoral Dissertations

This dissertation report covers Yan Ma’s Ph.D. research with applicational studies of machine learning in manufacturing and biological systems. The research work mainly focuses on reaction modeling, optimization, and control using a deep learning-based approaches, and the work mainly concentrates on deep reinforcement learning (DRL). Yan Ma’s research also involves with data mining with bioinformatics. Large-scale data obtained in RNA-seq is analyzed using non-linear dimensionality reduction with Principal Component Analysis (PCA), t-Distributed Stochastic Neighbor Embedding (t-SNE), and Uniform Manifold Approximation and Projection (UMAP), followed by clustering analysis using k-Means and Hierarchical Density-Based Spatial Clustering with Noise (HDBSCAN). This report focuses …


Intelligent Data-Driven Energy Flow Controllers For Renewable Energy And Electrified Transportation Systems, Juan Rafael Nunez Forestieri Nov 2020

Intelligent Data-Driven Energy Flow Controllers For Renewable Energy And Electrified Transportation Systems, Juan Rafael Nunez Forestieri

LSU Doctoral Dissertations

In recent years, large scale deployments of electrical energy generation using renewable sources (RES) such as wind, solar and ocean wave power, along with more sustainable means of transformation have emerged in response to different initiatives oriented toward reducing greenhouse gas emissions. Strategies facilitating the integration of renewable generation into the grid and electric propulsion in transportation systems are proposed in this work.

Chapter 2 investigates the grid-connected operation of a wave energy converter (WEC) along with a hybrid supercapacitor/undersea energy storage system (HESS). A combined sizing and energy management strategy (EMS) based on reinforcement learning (RL) is proposed. Comparisons …


Gpu Resource Optimization And Scheduling For Shared Execution Environments, Ryan Seamus Luley Aug 2020

Gpu Resource Optimization And Scheduling For Shared Execution Environments, Ryan Seamus Luley

Dissertations - ALL

General purpose graphics processing units have become a computing workhorse for a variety of data- and compute-intensive applications, from large supercomputing systems for massive data analytics to small, mobile embedded devices for autonomous vehicles. Making effective and efficient use of these processors traditionally relies on extensive programmer expertise to design and develop kernel methods which simultaneously trade off task decomposition and resource exploitation. Often, new architecture designs force code refinements in order to continue to achieve optimal performance. At the same time, not all applications require full utilization of the system to achieve that optimal performance. In this case, the …


Monte Carlo Tree Search Applied To A Modified Pursuit/Evasion Scotland Yard Game With Rendezvous Spaceflight Operation Applications, Joshua A. Daughtery Jun 2020

Monte Carlo Tree Search Applied To A Modified Pursuit/Evasion Scotland Yard Game With Rendezvous Spaceflight Operation Applications, Joshua A. Daughtery

Theses and Dissertations

This thesis takes the Scotland Yard board game and modifies its rules to mimic important aspects of space in order to facilitate the creation of artificial intelligence for space asset pursuit/evasion scenarios. Space has become a physical warfighting domain. To combat threats, an understanding of the tactics, techniques, and procedures must be captured and studied. Games and simulations are effective tools to capture data lacking historical context. Artificial intelligence and machine learning models can use simulations to develop proper defensive and offensive tactics, techniques, and procedures capable of protecting systems against potential threats. Monte Carlo Tree Search is a bandit-based …