Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Missouri University of Science and Technology

Learning Systems

Engineering Management and Systems Engineering Faculty Research & Creative Works

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Engineering

An Enhanced Least-Squares Approach For Reinforcement Learning, Hailin Li, Cihan H. Dagli Jan 2003

An Enhanced Least-Squares Approach For Reinforcement Learning, Hailin Li, Cihan H. Dagli

Engineering Management and Systems Engineering Faculty Research & Creative Works

This paper presents an enhanced least-squares approach for solving reinforcement learning control problems. Model-free least-squares policy iteration (LSPI) method has been successfully used for this learning domain. Although LSPI is a promising algorithm that uses linear approximator architecture to achieve policy optimization in the spirit of Q-learning, it faces challenging issues in terms of the selection of basis functions and training samples. Inspired by orthogonal least-squares regression (OLSR) method for selecting the centers of RBF neural network, we propose a new hybrid learning method. The suggested approach combines LSPI algorithm with OLSR strategy and uses simulation as a tool to …


An Empirical Analysis Of Backpropagation Error Surface Initiation For Injection Molding Process Control, Alice E. Smith, Elaine R. Raterman, Cihan H. Dagli Jan 1991

An Empirical Analysis Of Backpropagation Error Surface Initiation For Injection Molding Process Control, Alice E. Smith, Elaine R. Raterman, Cihan H. Dagli

Engineering Management and Systems Engineering Faculty Research & Creative Works

Backpropagation neural networks are trained by adjusting initially random interconnecting weights according to the steepest local error surface gradient. The authors examine the practical implications of the arbitrary starting point on the error landscape of the ensuing trained network. The effects on network convergence and performance are tested empirically, varying parameters such as network size, training rate, transfer function and data representation. The data used are live process control data from an injection molding plant