Open Access. Powered by Scholars. Published by Universities.®

Operations Research, Systems Engineering and Industrial Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

2003

Learning (Artificial Intelligence)

Articles 1 - 2 of 2

Full-Text Articles in Operations Research, Systems Engineering and Industrial Engineering

An Enhanced Least-Squares Approach For Reinforcement Learning, Hailin Li, Cihan H. Dagli Jan 2003

An Enhanced Least-Squares Approach For Reinforcement Learning, Hailin Li, Cihan H. Dagli

Engineering Management and Systems Engineering Faculty Research & Creative Works

This paper presents an enhanced least-squares approach for solving reinforcement learning control problems. Model-free least-squares policy iteration (LSPI) method has been successfully used for this learning domain. Although LSPI is a promising algorithm that uses linear approximator architecture to achieve policy optimization in the spirit of Q-learning, it faces challenging issues in terms of the selection of basis functions and training samples. Inspired by orthogonal least-squares regression (OLSR) method for selecting the centers of RBF neural network, we propose a new hybrid learning method. The suggested approach combines LSPI algorithm with OLSR strategy and uses simulation as a tool to …


Combining Evolving Neural Network Classifiers Using Bagging, Sunghwan Sohn, Cihan H. Dagli Jan 2003

Combining Evolving Neural Network Classifiers Using Bagging, Sunghwan Sohn, Cihan H. Dagli

Engineering Management and Systems Engineering Faculty Research & Creative Works

The performance of the neural network classifier significantly depends on its architecture and generalization. It is usual to find the proper architecture by trial and error. This is time consuming and may not always find the optimal network. For this reason, we apply genetic algorithms to the automatic generation of neural networks. Many researchers have provided that combining multiple classifiers improves generalization. One of the most effective combining methods is bagging. In bagging, training sets are selected by resampling from the original training set and classifiers trained with these sets are combined by voting. We implement the bagging technique into …