TY - JOUR
T1 - Privacy preserving planning in stochastic environments
AU - Shani, Guy
AU - Stern, Roni
AU - Hefner, Tommy
N1 - Funding Information:
This research was supported by ISF grant #210/17 to Roni Stern. This research was also supported by the ISF fund under under grant #1210/18.
Publisher Copyright:
Copyright © 2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2020/5/29
Y1 - 2020/5/29
N2 - Collaborative privacy preserving planning (CPPP) has gained much attention in the past decade. To date, CPPP has focused on domains with deterministic action effects. In this paper, we extend CPPP to domains with stochastic action effects. We show how such environments can be modeled as an MDP. We then focus on the popular Real-Time Dynamic Programming (RTDP) algorithm for computing value functions for MDPs, extending it to the stochastic CPPP setting. We provide two versions of RTDP: a complete version identical to executing centralized RTDP, and an approximate version that sends significantly fewer messages and computes competitive policies in practice. We experiment on domains adapted from the deterministic CPPP literature.
AB - Collaborative privacy preserving planning (CPPP) has gained much attention in the past decade. To date, CPPP has focused on domains with deterministic action effects. In this paper, we extend CPPP to domains with stochastic action effects. We show how such environments can be modeled as an MDP. We then focus on the popular Real-Time Dynamic Programming (RTDP) algorithm for computing value functions for MDPs, extending it to the stochastic CPPP setting. We provide two versions of RTDP: a complete version identical to executing centralized RTDP, and an approximate version that sends significantly fewer messages and computes competitive policies in practice. We experiment on domains adapted from the deterministic CPPP literature.
UR - http://www.scopus.com/inward/record.url?scp=85088535297&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85088535297
SN - 2334-0835
VL - 30
SP - 258
EP - 262
JO - Proceedings International Conference on Automated Planning and Scheduling, ICAPS
JF - Proceedings International Conference on Automated Planning and Scheduling, ICAPS
T2 - 30th International Conference on Automated Planning and Scheduling, ICAPS 2020
Y2 - 26 October 2020 through 30 October 2020
ER -