Error bounds for calculation of the Gittins indices
Wang, Y-G. (1997) Error bounds for calculation of the Gittins indices. Australian Journal of Statistics, 39(2), pp. 225-233.
For a wide class of semi-Markov decision processes the optimal policies are expressible in terms of the Gittins indices, which have been found useful in sequential clinical trials and pharmaceutical research planning. In general, the indices can be approximated via calibration based on dynamic programming of finite horizon. This paper provides some results on the accuracy of such approximations, and, in particular, gives the error bounds for some well known processes (Bernoulli reward processes, normal reward processes and exponential target processes).
Impact and interest:
Citation counts are sourced monthly from and citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
|Item Type:||Journal Article|
|Additional Information:||ISI Document Delivery No.: XX286 Times Cited: 3 Cited Reference Count: 19 Wang, YG Australian statistical publishing assoc inc Canberra|
|Keywords:||bandit process, clinical trials, dynamic programming, stopping time, multi-armed bandits, dynamic allocation indexes, delayed-responses, clinical-trials|
|Divisions:||Current > QUT Faculties and Divisions > Science & Engineering Faculty|
|Copyright Owner:||Copyright CSIRO|
|Deposited On:||24 Nov 2015 04:33|
|Last Modified:||24 Nov 2015 04:33|
Repository Staff Only: item control page