TY - JOUR
T1 - An efficient heuristic for a partially observable Markov decision process of machine replacement
AU - Sinuany-Stern, Zilla
AU - David, Israel
AU - Biran, Sigal
N1 - Funding Information:
Acknowledgements--A FORTRAN software of the Lovejoy algorithm, together with full documentation, have been made available to us by courtesy of W.S. Lovejoy and D.D. Ayers. This work has been partly supported by the Paul Evanier Center for Robotics and Production Management at Ben-Gurion University.
PY - 1997/1/1
Y1 - 1997/1/1
N2 - There is, so far, only limited practical experience applying solution schemes for real-life partially observable Markov decision processes (POMDP's). In this work we address the special-case POMDP associated with the famous machine-replacement problem. The machine deteriorates down a series of states according to known transition probabilities. A state is identified by a probability of producing a defective item. Only a sample of the produced items is observable at each stage, in which it is to be decided whether to replace the machine or not. We suggest a very simple heuristic decision-rule that can easily handle replacement-type problems of large size and which is based on the Howard solution of the fully observable version of the problem. By a simulation experimental design we compare the performance of this heuristic relative to the generic POMDP solution algorithm which has been proposed by Lovejoy.
AB - There is, so far, only limited practical experience applying solution schemes for real-life partially observable Markov decision processes (POMDP's). In this work we address the special-case POMDP associated with the famous machine-replacement problem. The machine deteriorates down a series of states according to known transition probabilities. A state is identified by a probability of producing a defective item. Only a sample of the produced items is observable at each stage, in which it is to be decided whether to replace the machine or not. We suggest a very simple heuristic decision-rule that can easily handle replacement-type problems of large size and which is based on the Howard solution of the fully observable version of the problem. By a simulation experimental design we compare the performance of this heuristic relative to the generic POMDP solution algorithm which has been proposed by Lovejoy.
UR - http://www.scopus.com/inward/record.url?scp=0031077014&partnerID=8YFLogxK
U2 - 10.1016/S0305-0548(96)00043-3
DO - 10.1016/S0305-0548(96)00043-3
M3 - Article
AN - SCOPUS:0031077014
VL - 24
SP - 117
EP - 126
JO - Computers and Operations Research
JF - Computers and Operations Research
SN - 0305-0548
IS - 2
ER -