TY - GEN
T1 - Resolving perceptual aliasing in the presence of noisy sensors
AU - Brafman, Ronen I.
AU - Shani, Guy
PY - 2005/1/1
Y1 - 2005/1/1
N2 - Agents learning to act in a partially observable domain may need to overcome the problem of perceptual aliasing - i.e., different states that appear similar but require different responses. This problem is exacerbated when the agent's sensors are noisy, i.e., sensors may produce different observations in the same state. We show that many well-known reinforcement learning methods designed to deal with perceptual aliasing, such as Utile Suffix Memory, finite size history windows, eligibility traces, and memory bits, do not handle noisy sensors well. We suggest a new algorithm, Noisy Utile Suffix Memory (NUSM), based on USM, that uses a weighted classification of observed trajectories. We compare NUSM to the above methods and show it to be more robust to noise.
AB - Agents learning to act in a partially observable domain may need to overcome the problem of perceptual aliasing - i.e., different states that appear similar but require different responses. This problem is exacerbated when the agent's sensors are noisy, i.e., sensors may produce different observations in the same state. We show that many well-known reinforcement learning methods designed to deal with perceptual aliasing, such as Utile Suffix Memory, finite size history windows, eligibility traces, and memory bits, do not handle noisy sensors well. We suggest a new algorithm, Noisy Utile Suffix Memory (NUSM), based on USM, that uses a weighted classification of observed trajectories. We compare NUSM to the above methods and show it to be more robust to noise.
UR - http://www.scopus.com/inward/record.url?scp=84899012421&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84899012421
SN - 0262195348
SN - 9780262195348
T3 - Advances in Neural Information Processing Systems
BT - Advances in Neural Information Processing Systems 17 - Proceedings of the 2004 Conference, NIPS 2004
PB - Neural information processing systems foundation
T2 - 18th Annual Conference on Neural Information Processing Systems, NIPS 2004
Y2 - 13 December 2004 through 16 December 2004
ER -