RUIJIN DING, FEIFEI GAO, LING XING. Intelligent routing strategy in the Internet of things based on deep reinforcement learning. [J]. Chinese journal on internet of things, 2019, 3(2): 56-63.
DOI:
RUIJIN DING, FEIFEI GAO, LING XING. Intelligent routing strategy in the Internet of things based on deep reinforcement learning. [J]. Chinese journal on internet of things, 2019, 3(2): 56-63. DOI: 10.11959/j.issn.2096-3750.2019.00097.
Intelligent routing strategy in the Internet of things based on deep reinforcement learning
networking mode that connects everything would bring tremendous increase in the data volume and challenge the traditional routing protocols.The limitations of the existing routing protocols was analyzed when facing the data explosion and then the routing selection problem was re-modeled as a Markov decision process.On this basis
the deep reinforcement learning technique was utilized to choose the next-hop router for data transmission task in order to shorten the transmission path length while network congestion was avoided.The simulation results demonstrate that the congestion probability can be reduced significantly and the network throughput can be enhanced by the proposed strategy.
关键词
深度强化学习路由物联网网络堵塞
Keywords
deep reinforcement learningroutingInternet of thingsnetwork congestion
SUN Q B, LIU J, LI S ,et al. Internet of things:summarize on concepts,architecture and key technology problem[J]. Beijing:Journal of Beijing University of Posts and Telecommunications, 2010,33(3): 1-9.
LIU V, PARKS A, TALLA V ,et al. Ambient backscatter:wireless communication out of thin air[C]// ACM SIGCOMM Computer Communication Review. ACM, 2013,43(4): 39-50.
QIAN J, GAO F, WANG G ,et al. Noncoherent detections for ambient backscatter system[J]. IEEE Transactions on Wireless Communications, 2017,16(3): 1412-1422.
NORDRUM A . The Internet of fewer things[J]. IEEE Spectrum, 2016,53(10): 12-13.
QIAN J, PARKS A N, SMITH J R ,et al. IoT communications with M-PSK modulated ambient backscatter:algorithm,analysis and implementation[J]. IEEE Internet of Things Journal, 2019,6(1): 844-855.
FORTZ B, THORUP M . Internet traffic engineering by optimizing OSPF weights[J]. IEEE INFOCOM, 2000,2(3): 519-528.
HEDRICK C L . Routing information protocol[R]. 1988.
FORTZ B, THORUP M . Optimizing OSPF/IS-IS weights in a changing world[J]. IEEE Journal on Selected Areas in Communications, 2002,20(4): 756-767.
GRIFFIN T G, SHEPHERD F B, WILFONG G . The stable paths problem and interdomain routing[J]. IEEE/ACM Transactions on Networking (ToN), 2002,10(2): 232-243.
SUN Z J, XUE L, XU Y M ,et al. Overview of deep learning[J]. Application Research of Computers, 2012,29(8): 2806-2810.
KATO N, FADLULLAH Z M, MAO B ,et al. The deep learning vision for heterogeneous network traffic control:proposal,challenges and future perspective[J]. IEEE Wireless Communications, 2017,24(3): 146-153.
TANG F, MAO B, FADLULLAH Z M ,et al. On removing routing protocol from future wireless networks:a real-time deep learning approach for intelligent traffic control[J]. IEEE Wireless Communications, 2018,25(1): 154-160.
GAO Y, CHEN S F, LU X . Research on reinforcement learning technology:a review[J]. ACTA Automatica Sinica, 2004,30(1): 86-100.
MNIH V, KAVUKCUOGLU K, SILVER D ,et al. Human-level control through deep reinforcement learning[J]. Nature, 2015,518(7540):529.
BOYAN J A, LITTMAN M L . Packet routing in dynamically changing networks:a reinforcement learning approach[C]// Advances in Neural Information Processing Systems. Morgan Kaufmann Publishers Inc, 1994: 671-678.