Open Access. Powered by Scholars. Published by Universities.®

Operations Research, Systems Engineering and Industrial Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 8 of 8

Full-Text Articles in Operations Research, Systems Engineering and Industrial Engineering

Online Reinforcement Learning-Based Neural Network Controller Design For Affine Nonlinear Discrete-Time Systems, Qinmin Yang, Jagannathan Sarangapani Jul 2007

Online Reinforcement Learning-Based Neural Network Controller Design For Affine Nonlinear Discrete-Time Systems, Qinmin Yang, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

In this paper, a novel reinforcement learning neural network (NN)-based controller, referred to adaptive critic controller, is proposed for general multi-input and multi- output affine unknown nonlinear discrete-time systems in the presence of bounded disturbances. Adaptive critic designs consist of two entities, an action network that produces optimal solution and a critic that evaluates the performance of the action network. The critic is termed adaptive as it adapts itself to output the optimal cost-to-go function and the action network is adapted simultaneously based on the information from the critic. In our online learning method, one NN is designated as the …


Near Optimal Neural Network-Based Output Feedback Control Of Affine Nonlinear Discrete-Time Systems, Qinmin Yang, Jagannathan Sarangapani Jan 2007

Near Optimal Neural Network-Based Output Feedback Control Of Affine Nonlinear Discrete-Time Systems, Qinmin Yang, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

In this paper, a novel online reinforcement learning neural network (NN)-based optimal output feedback controller, referred to as adaptive critic controller, is proposed for affine nonlinear discrete-time systems, to deliver a desired tracking performance. The adaptive critic design consist of three entities, an observer to estimate the system states, an action network that produces optimal control input and a critic that evaluates the performance of the action network. The critic is termed adaptive as it adapts itself to output the optimal cost-to-go function which is based on the standard Bellman equation. By using the Lyapunov approach, the uniformly ultimate boundedness …


Online Reinforcement Learning Control Of Unknown Nonaffine Nonlinear Discrete Time Systems, Qinmin Yang, Jagannathan Sarangapani Jan 2007

Online Reinforcement Learning Control Of Unknown Nonaffine Nonlinear Discrete Time Systems, Qinmin Yang, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

In this paper, a novel neural network (NN) based online reinforcement learning controller is designed for nonaffine nonlinear discrete-time systems with bounded disturbances. The nonaffine systems are represented by nonlinear auto regressive moving average with exogenous input (NARMAX) model with unknown nonlinear functions. An equivalent affine-like representation for the tracking error dynamics is developed first from the original nonaffine system. Subsequently, a reinforcement learning-based neural network (NN) controller is proposed for the affine-like nonlinear error dynamic system. The control scheme consists of two NNs. One NN is designated as the critic, which approximates a predefined long-term cost function, whereas an …


Reinforcement Learning Based Output-Feedback Controller For Complex Nonlinear Discrete-Time Systems, Peter Shih, Jagannathan Sarangapani Jan 2007

Reinforcement Learning Based Output-Feedback Controller For Complex Nonlinear Discrete-Time Systems, Peter Shih, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

A novel reinforcement-learning based output-adaptive neural network (NN) controller, also referred as the adaptive-critic NN controller, is developed to track a desired trajectory for a class of complex feedback nonlinear discrete-time systems in the presence of bounded and unknown disturbances. This nonlinear discrete-time system consists of a second order system in nonstrict form and an affine nonlinear discrete-time system tightly coupled together. Two adaptive critic NN controllers are designed - primary one for the nonstrict system and the secondary one for the affine system. A Lyapunov function shows the uniformly ultimate boundedness (UUB) of the closed-loop tracking error, weight estimates …


Discrete-Time Neural Network Output Feedback Control Of Nonlinear Systems In Non-Strict Feedback Form, Pingan He, Jagannathan Sarangapani Jan 2004

Discrete-Time Neural Network Output Feedback Control Of Nonlinear Systems In Non-Strict Feedback Form, Pingan He, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

An adaptive neural network (NN)-based output feedback controller is proposed to deliver a desired tracking performance for a class of discrete-time nonlinear systems, which is represented in non-strict feedback form. The NN backstepping approach is utilized to design the adaptive output feedback controller consisting of: 1) a NN observer to estimate the system states with the input-output data, and 2) two NNs to generate the virtual and actual control inputs, respectively. The non-causal problem in the discrete-time backstepping design is avoided by using the universal NN approximator. The persistence excitation (PE) condition is relaxed both in the NN observer and …


Neuro Emission Controller For Minimizing Cyclic Dispersion In Spark Ignition Engines, Pingan He, Jagannathan Sarangapani Jan 2003

Neuro Emission Controller For Minimizing Cyclic Dispersion In Spark Ignition Engines, Pingan He, Jagannathan Sarangapani

Electrical and Computer Engineering Faculty Research & Creative Works

A novel neural network (NN) controller is developed to control spark ignition (SI) engines at extreme lean conditions. The purpose of neurocontroller is to reduce the cyclic dispersion at lean operation even when the engine dynamics are unknown. The stability analysis of the closed-loop control system is given and the boundedness of all signals is ensured. Results demonstrate that the cyclic dispersion is reduced significantly using the proposed controller. The neuro controller can also be extended to minimize engine emissions with high EGR levels, where similar complex cyclic dynamics are observed. Further, the proposed approach can be applied to control …


Adaptive Critic-Based Neural Network Controller For Uncertain Nonlinear Systems With Unknown Deadzones, Pingan He, Jagannathan Sarangapani, S. N. Balakrishnan Jan 2002

Adaptive Critic-Based Neural Network Controller For Uncertain Nonlinear Systems With Unknown Deadzones, Pingan He, Jagannathan Sarangapani, S. N. Balakrishnan

Electrical and Computer Engineering Faculty Research & Creative Works

A multilayer neural network (NN) controller in discrete-time is designed to deliver a desired tracking performance for a class of nonlinear systems with input deadzones. This multilayer NN controller has an adaptive critic NN architecture with two NNs for compensating the deadzone nonlinearity and a third NN for approximating the dynamics of the nonlinear system. A reinforcement learning scheme in discrete-time is proposed for the adaptive critic NN deadzone compensator, where the learning is performed based on a certain performance measure, which is supplied from a critic. The adaptive generating NN rejects the errors induced by the deadzone whereas a …


A Robust Unit Commitment Algorithm For Hydro-Thermal Optimization, Chao-An Li, Chung-Li Tseng, E. Hsu, R. B. Johnson, A. J. Svoboda May 1997

A Robust Unit Commitment Algorithm For Hydro-Thermal Optimization, Chao-An Li, Chung-Li Tseng, E. Hsu, R. B. Johnson, A. J. Svoboda

Engineering Management and Systems Engineering Faculty Research & Creative Works

This paper presents a unit commitment algorithm which combines the Lagrangian relaxation (LR), sequential unit commitment (SUC), and optimal unit decommitment (UD) methods to solve a general hydro-thermal optimization (HTO) problem. The authors argue that this approach retains the advantages of the LR method while addressing the method''s observed weaknesses to improve overall algorithm performance and quality of solution. The proposed approach has been implemented in a version of PG&E''s HTO program, and test results are presented.