Abstract
Collaborative privacy preserving planning (CPPP) has gained much attention in the past decade. To date, CPPP has focused on domains with deterministic action effects. In this paper, we extend CPPP to domains with stochastic action effects. We show how such environments can be modeled as an MDP. We then focus on the popular Real-Time Dynamic Programming (RTDP) algorithm for computing value functions for MDPs, extending it to the stochastic CPPP setting. We provide two versions of RTDP: a complete version identical to executing centralized RTDP, and an approximate version that sends significantly fewer messages and computes competitive policies in practice. We experiment on domains adapted from the deterministic CPPP literature.
Original language | English |
---|---|
Pages (from-to) | 258-262 |
Number of pages | 5 |
Journal | Proceedings International Conference on Automated Planning and Scheduling, ICAPS |
Volume | 30 |
State | Published - 29 May 2020 |
Event | 30th International Conference on Automated Planning and Scheduling, ICAPS 2020 - Nancy, France Duration: 26 Oct 2020 → 30 Oct 2020 |
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Information Systems and Management