TY - GEN
T1 - Decentralized Anomaly Detection via Deep Multi-Agent Reinforcement Learning
AU - Szostak, Hadar
AU - Cohen, Kobi
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - We consider a decentralized anomaly detection problem, where multiple agents collaborate to localize a single anomalous process among a finite number M of processes. At each time, a subset of the processes can be observed by each agent, and the observations from each chosen process follow two different distributions, depending on whether the process is normal or abnormal. The communication channel between agents is rate-limited. The objective is a sequential search strategy that minimizes the Bayes risk, consisting of the sampling cost, and the joint terminal cost among the agents. This problem generalizes previous studies that considered anomaly detection by a single detector. We develop a novel algorithm based on deep multi-agent reinforcement learning optimization to minimize the Bayes risk. Numerical experiments demonstrate the ability of the algorithm to learn good policies in this challenging problem, and improve the single-agent performance by applying the proposed multi-agent collaborative learning method.
AB - We consider a decentralized anomaly detection problem, where multiple agents collaborate to localize a single anomalous process among a finite number M of processes. At each time, a subset of the processes can be observed by each agent, and the observations from each chosen process follow two different distributions, depending on whether the process is normal or abnormal. The communication channel between agents is rate-limited. The objective is a sequential search strategy that minimizes the Bayes risk, consisting of the sampling cost, and the joint terminal cost among the agents. This problem generalizes previous studies that considered anomaly detection by a single detector. We develop a novel algorithm based on deep multi-agent reinforcement learning optimization to minimize the Bayes risk. Numerical experiments demonstrate the ability of the algorithm to learn good policies in this challenging problem, and improve the single-agent performance by applying the proposed multi-agent collaborative learning method.
UR - http://www.scopus.com/inward/record.url?scp=85142646070&partnerID=8YFLogxK
U2 - 10.1109/Allerton49937.2022.9929423
DO - 10.1109/Allerton49937.2022.9929423
M3 - Conference contribution
AN - SCOPUS:85142646070
T3 - 2022 58th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2022
BT - 2022 58th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2022
PB - Institute of Electrical and Electronics Engineers
T2 - 58th Annual Allerton Conference on Communication, Control, and Computing, Allerton 2022
Y2 - 27 September 2022 through 30 September 2022
ER -