Fingerprint
Dive into the research topics of 'R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning'. Together they form a unique fingerprint.- Sort by
- Weight
- Alphabetically
Ronen I. Brafman, Moshe Tennenholtz
Research output: Contribution to journal › Conference article › peer-review