Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Mathematics

Georgia Southern University

2019

Count-based exploration

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Exploration Using Without-Replacement Sampling Of Actions Is Sometimes Inferior, Stephen W. Carden, S. Dalton Walker May 2019

Exploration Using Without-Replacement Sampling Of Actions Is Sometimes Inferior, Stephen W. Carden, S. Dalton Walker

Department of Mathematical Sciences Faculty Publications

In many statistical and machine learning applications, without-replacement sampling is considered superior to with-replacement sampling. In some cases, this has been proven, and in others the heuristic is so intuitively attractive that it is taken for granted. In reinforcement learning, many count-based exploration strategies are justified by reliance on the aforementioned heuristic. This paper will detail the non-intuitive discovery that when measuring the goodness of an exploration strategy by the stochastic shortest path to a goal state, there is a class of processes for which an action selection strategy based on without-replacement sampling of actions can be worse than with-replacement …