TY - GEN

T1 - Learning Probably Approximately Complete and Safe Action Models for Stochastic Worlds.

AU - Juba, Brendan

AU - Stern, Roni

PY - 2022

Y1 - 2022

N2 - We consider the problem of learning action models for planning in unknown stochastic environments that can be defined using the Probabilistic Planning Domain Description Language (PPDDL). As input, we are given a set of previously executed trajectories, and the main challenge is to learn an action model that has a similar goal achievement probability to the policies used to create these trajectories. To this end, we introduce a variant of PPDDL in which there is uncertainty about the transition probabilities, specified by an interval for each factor that contains the respective true transition probabilities. Then, we present SAM+, an algorithm that learns such an imprecise-PPDDL environment model. SAM+ has a polynomial time and sample complexity, and guarantees that with high probability, the true environment is indeed captured by the defined intervals. We prove that the action model SAM+ outputs has a goal achievement probability that is almost as good or better than that of the policies used toproduced the training trajectories. Then, we show how to produce a PPDDL model based on this imprecise-PPDDL modelthat has similar properties.

M3 - Conference contribution

SP - 9795

EP - 9804

BT - The Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

ER -