Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Virginia Commonwealth University

2006

Bandit problem

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Bayesian Analysis, Endogenous Data,And Convergence Of Beliefs, Andrew T. Foerster Jan 2006

Bayesian Analysis, Endogenous Data,And Convergence Of Beliefs, Andrew T. Foerster

Theses and Dissertations

Problems in statistical analysis, economics, and many other disciplines often involve a trade-off between rewards and additional information that could yield higher future rewards. This thesis investigates such a trade-off, using a class of problems known as bandit problems. In these problems, a reward-seeking agent makes decisions based upon his beliefs about a parameter that controls rewards. While some choices may generate higher short-term rewards, other choices may provide information that allows the agent to learn about the parameter, thereby potentially increasing future rewards. Learning occurs if the agent's subjective beliefs about the parameter converge over time to the parameter's …