TY - JOUR
T1 - Using mathematical programming to solve Factored Markov Decision Processes with Imprecise Probabilities
AU - Delgado, Karina Valdivia
AU - De Barros, Leliane Nunes
AU - Cozman, Fabio Gagliardi
AU - Sanner, Scott
PY - 2011/10/1
Y1 - 2011/10/1
N2 - This paper investigates Factored Markov Decision Processes with Imprecise Probabilities (MDPIPs); that is, Factored Markov Decision Processes (MDPs) where transition probabilities are imprecisely specified. We derive efficient approximate solutions for Factored MDPIPs based on mathematical programming. To do this, we extend previous linear programming approaches for linear approximations in Factored MDPs, resulting in a multilinear formulation for robust "maximin" linear approximations in Factored MDPIPs. By exploiting the factored structure in MDPIPs we are able to demonstrate orders of magnitude reduction in solution time over standard exact non-factored approaches, in exchange for relatively low approximation errors, on a difficult class of benchmark problems with millions of states. © 2011 Elsevier Inc. All rights reserved.
AB - This paper investigates Factored Markov Decision Processes with Imprecise Probabilities (MDPIPs); that is, Factored Markov Decision Processes (MDPs) where transition probabilities are imprecisely specified. We derive efficient approximate solutions for Factored MDPIPs based on mathematical programming. To do this, we extend previous linear programming approaches for linear approximations in Factored MDPs, resulting in a multilinear formulation for robust "maximin" linear approximations in Factored MDPIPs. By exploiting the factored structure in MDPIPs we are able to demonstrate orders of magnitude reduction in solution time over standard exact non-factored approaches, in exchange for relatively low approximation errors, on a difficult class of benchmark problems with millions of states. © 2011 Elsevier Inc. All rights reserved.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=80052406723&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=80052406723&origin=inward
U2 - 10.1016/j.ijar.2011.04.002
DO - 10.1016/j.ijar.2011.04.002
M3 - Article
SN - 0888-613X
SP - 1000
EP - 1017
JO - International Journal of Approximate Reasoning
JF - International Journal of Approximate Reasoning
ER -