An Experimental Analysis of the Two-Armed Bandit Program
- Creators
- Banks, Jeffrey
- Olson, Mark
- Porter, David
Abstract
We investigate, in an experimental setting, the behavior of single decision makers who at discrete time intervals over an "infinite" horizon may choose one action from a set of possible actions where this set is constant over time, i.e. a bandit problem. Two bandit environments are examined, one in which the predicted behavior should always be myopic (the two-armed bandit) and the other in which the predicted behavior should never be myopic (the one-armed bandit). We also investigate the comparative static predictions as the underlying parameter of the bandit environments are changed. The aggregate results show that the cutpoint behavior in the two bandit environments are quantitatively different and in the direction of the theoretical predictions. Furthermore, while a significant number of individual cutpoints exhibit nonstationarity (contrary to the theory), the most likely, i.e. maximum likelihood estimates, collection of decision rules that best explain overall behavior are those that are consistent with the underlying theory.
Additional Information
Published as Banks, Jeffrey, Mark Olson, and David Porter. "An experimental analysis of the bandit problem." Economic Theory 10, no. 1 (1997): 55-77.
Attached Files
Submitted - sswp892.pdf
Files
Name | Size | Download all |
---|---|---|
md5:4de3544f756027b52391b3318199ddb8
|
726.6 kB | Preview Download |
Additional details
- Alternative title
- An experimental analysis of the bandit problem
- Eprint ID
- 80690
- Resolver ID
- CaltechAUTHORS:20170822-142127351
- URL
- http://resolver.caltech.edu/CaltechAUTHORS:20160525-081816722
- Created
-
2017-08-23Created from EPrint's datestamp field
- Updated
-
2019-10-03Created from EPrint's last_modified field
- Caltech groups
- Social Science Working Papers