TY - GEN
T1 - Strong probabilistic planning
AU - Do Lago Pereira, Silvio
AU - De Barros, Leliane Nunes
AU - Cozman, Fábio Gagliardi
PY - 2008/12/5
Y1 - 2008/12/5
N2 - We consider the problem of synthesizing policies, in domains where actions have probabilistic effects, that are optimal in the expected-case among the optimal worst-case strong policies. Thus we combine features from nondeterministic and probabilistic planning in a single framework. We present an algorithm that combines dynamic programming and model checking techniques to find plans satisfying the problem requirements: the strong preimage computation from model checking is used to avoid actions that lead to cycles or dead ends, reducing the model to a Markov Decision Process where all possible policies are strong and worst-case optimal (i.e., successful and minimum length with probability 1). We show that backward induction can then be used to select a policy in this reduced model. The resulting algorithm is presented in two versions (enumerative and symbolic); we show that the latter version allows planning with extended reachability goals. © 2008 Springer Berlin Heidelberg.
AB - We consider the problem of synthesizing policies, in domains where actions have probabilistic effects, that are optimal in the expected-case among the optimal worst-case strong policies. Thus we combine features from nondeterministic and probabilistic planning in a single framework. We present an algorithm that combines dynamic programming and model checking techniques to find plans satisfying the problem requirements: the strong preimage computation from model checking is used to avoid actions that lead to cycles or dead ends, reducing the model to a Markov Decision Process where all possible policies are strong and worst-case optimal (i.e., successful and minimum length with probability 1). We show that backward induction can then be used to select a policy in this reduced model. The resulting algorithm is presented in two versions (enumerative and symbolic); we show that the latter version allows planning with extended reachability goals. © 2008 Springer Berlin Heidelberg.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=57049184587&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=57049184587&origin=inward
U2 - 10.1007/978-3-540-88636-561
DO - 10.1007/978-3-540-88636-561
M3 - Conference contribution
SN - 3540886354
SN - 9783540886358
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 636
EP - 652
BT - Strong probabilistic planning
T2 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Y2 - 1 January 2018
ER -