Open Access. Powered by Scholars. Published by Universities.®

Operations Research, Systems Engineering and Industrial Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 2 of 2

Full-Text Articles in Operations Research, Systems Engineering and Industrial Engineering

Reinforcement-Learning-Based Output-Feedback Control Of Nonstrict Nonlinear Discrete-Time Systems With Application To Engine Emission Control, Peter Shih, Brian C. Kaul, Jagannathan Sarangapani, J. A. Drallmeier Oct 2009

Reinforcement-Learning-Based Output-Feedback Control Of Nonstrict Nonlinear Discrete-Time Systems With Application To Engine Emission Control, Peter Shih, Brian C. Kaul, Jagannathan Sarangapani, J. A. Drallmeier

Electrical and Computer Engineering Faculty Research & Creative Works

A novel reinforcement-learning-based output adaptive neural network (NN) controller, which is also referred to as the adaptive-critic NN controller, is developed to deliver the desired tracking performance for a class of nonlinear discrete-time systems expressed in nonstrict feedback form in the presence of bounded and unknown disturbances. The adaptive-critic NN controller consists of an observer, a critic, and two action NNs. The observer estimates the states and output, and the two action NNs provide virtual and actual control inputs to the nonlinear discrete-time system. The critic approximates a certain strategic utility function, and the action NNs minimize the strategic utility …


Neural Network Control Of Mobile Robot Formations Using Rise Feedback, Jagannathan Sarangapani, Travis Alan Dierks Apr 2009

Neural Network Control Of Mobile Robot Formations Using Rise Feedback, Jagannathan Sarangapani, Travis Alan Dierks

Electrical and Computer Engineering Faculty Research & Creative Works

In this paper, an asymptotically stable (AS) combined kinematic/torque control law is developed for leader-follower-based formation control using backstepping in order to accommodate the complete dynamics of the robots and the formation, and a neural network (NN) is introduced along with robust integral of the sign of the error feedback to approximate the dynamics of the follower as well as its leader using online weight tuning. It is shown using Lyapunov theory that the errors for the entire formation are as and that the NN weights are bounded as opposed to uniformly ultimately bounded stability which is typical with most …