TY - GEN
T1 - Simultaneous Sensing and Channel Access based on Partial Observations via Deep Reinforcement Learning
AU - Bokobza, Yoel
AU - Dabora, Ron
AU - Cohen, Kobi
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - This paper is eligible for the Jack Keil Wolf ISIT Student Paper Award. In this paper we study dynamic spectrum access (DSA) in cognitive wireless networks, consisting of primary users (PUs) and a secondary user (SU) which has only partial observations. The traffic patterns of the PUs are modeled as finite-memory Markov chains, and are unknown to the SU. It is noted that as observations are partial, then both channel sensing and channel access actions affect the throughput. Our objective in this work is to design a DSA algorithm such that the SU's long-term throughput is maximized. To that aim, we show theoretically that the DSA problem can be formulated as a single-agent problem with a single policy for both sensing and access, and propose a novel algorithm that learns both the optimal access policy and the optimal sensing policy via deep Q-learning, which is referred to as Double Deep Q-network for Sensing and Access (DDQSA). To the best of our knowledge, this is the first instance of a deep Q-learning-based DSA algorithm, which learns both sensing and access policies. Our results show that the DDQSA algorithm learns a policy that implements both sensing and channel access, and achieves significantly better performance compared to existing approaches.
AB - This paper is eligible for the Jack Keil Wolf ISIT Student Paper Award. In this paper we study dynamic spectrum access (DSA) in cognitive wireless networks, consisting of primary users (PUs) and a secondary user (SU) which has only partial observations. The traffic patterns of the PUs are modeled as finite-memory Markov chains, and are unknown to the SU. It is noted that as observations are partial, then both channel sensing and channel access actions affect the throughput. Our objective in this work is to design a DSA algorithm such that the SU's long-term throughput is maximized. To that aim, we show theoretically that the DSA problem can be formulated as a single-agent problem with a single policy for both sensing and access, and propose a novel algorithm that learns both the optimal access policy and the optimal sensing policy via deep Q-learning, which is referred to as Double Deep Q-network for Sensing and Access (DDQSA). To the best of our knowledge, this is the first instance of a deep Q-learning-based DSA algorithm, which learns both sensing and access policies. Our results show that the DDQSA algorithm learns a policy that implements both sensing and channel access, and achieves significantly better performance compared to existing approaches.
KW - Cognitive radio networks
KW - deep reinforcement learning
KW - dynamic spectrum access
KW - wireless channels
UR - http://www.scopus.com/inward/record.url?scp=85136257453&partnerID=8YFLogxK
U2 - 10.1109/ISIT50566.2022.9834756
DO - 10.1109/ISIT50566.2022.9834756
M3 - Conference contribution
AN - SCOPUS:85136257453
T3 - IEEE International Symposium on Information Theory - Proceedings
SP - 2684
EP - 2689
BT - 2022 IEEE International Symposium on Information Theory, ISIT 2022
PB - Institute of Electrical and Electronics Engineers
T2 - 2022 IEEE International Symposium on Information Theory, ISIT 2022
Y2 - 26 June 2022 through 1 July 2022
ER -