Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

2019

Series

PDF

Computer Engineering

University of Nevada, Las Vegas

Exploration and exploitation

Articles 1 - 1 of 1

Full-Text Articles in Engineering

A Graph-Based Reinforcement Learning Method With Converged State Exploration And Exploitation, Han Li, Tianding Chen, Hualiang Teng, Yingtao Jiang Jan 2019

A Graph-Based Reinforcement Learning Method With Converged State Exploration And Exploitation, Han Li, Tianding Chen, Hualiang Teng, Yingtao Jiang

Civil and Environmental Engineering and Construction Faculty Research

In any classical value-based reinforcement learning method, an agent, despite of its continuous interactions with the environment, is yet unable to quickly generate a complete and independent description of the entire environment, leaving the learning method to struggle with a difficult dilemma of choosing between the two tasks, namely exploration and exploitation. This problem becomes more pronounced when the agent has to deal with a dynamic environment, of which the configuration and/or parameters are constantly changing. In this paper, this problem is approached by first mapping a reinforcement learning scheme to a directed graph, and the set that contains all …