Intelligent routing strategy in the Internet of things based on deep reinforcement learning

Ruijin DING; Feifei GAO; Ling XING

doi:10.11959/j.issn.2096-3750.2019.00097

您当前的位置：

首页 >

文章列表页 >

Intelligent routing strategy in the Internet of things based on deep reinforcement learning

Theory and Technology | 更新时间：2024-06-05

- Intelligent routing strategy in the Internet of things based on deep reinforcement learning
- Chinese Journal on Internet of Things Vol. 3, Issue 2, Pages: 56-63(2019)
- 作者机构：
  
  1. 清华大学自动化系，北京 100084
  2. 河南科技大学，河南洛阳 471023
- 作者简介：
- 基金信息：
- DOI：10.11959/j.issn.2096-3750.2019.00097
  CLC： TN915
- Published：30 June 2019，
  
  Published Online：2019-06，
- 稿件说明：
移动端阅览
RUIJIN DING, FEIFEI GAO, LING XING. Intelligent routing strategy in the Internet of things based on deep reinforcement learning. [J]. Chinese journal on internet of things, 2019, 3(2): 56-63.
DOI：

RUIJIN DING, FEIFEI GAO, LING XING. Intelligent routing strategy in the Internet of things based on deep reinforcement learning. [J]. Chinese journal on internet of things, 2019, 3(2): 56-63. DOI： 10.11959/j.issn.2096-3750.2019.00097.

摘要

随着物联网时代的到来，万物互联的传输模式引发数据量爆炸式增长，给传统路由协议带来了严峻挑战。阐述了在数据量急剧增长的情况下，已有路由协议的局限性，并将路由选择问题重新建模为马尔可夫决策过程。在此基础上，采用深度强化学习方法为每项数据传输任务选择下一跳路由器，从而在避免数据堵塞的前提下尽可能缩短数据传输路径长度。仿真结果表明，所提方法能够显著降低数据堵塞概率，增大网络吞吐量。

Abstract

At the era of the Internet of things

networking mode that connects everything would bring tremendous increase in the data volume and challenge the traditional routing protocols.The limitations of the existing routing protocols was analyzed when facing the data explosion and then the routing selection problem was re-modeled as a Markov decision process.On this basis

the deep reinforcement learning technique was utilized to choose the next-hop router for data transmission task in order to shorten the transmission path length while network congestion was avoided.The simulation results demonstrate that the congestion probability can be reduced significantly and the network throughput can be enhanced by the proposed strategy.

关键词

深度强化学习路由物联网网络堵塞

Keywords

deep reinforcement learningroutingInternet of thingsnetwork congestion

references

孙其博, 刘杰, 黎羴 ,等. 物联网:概念、架构与关键技术研究综述[J]. 北京:北京邮电大学学报, 2010,33(3): 1-9.

SUN Q B, LIU J, LI S ,et al. Internet of things:summarize on concepts,architecture and key technology problem[J]. Beijing:Journal of Beijing University of Posts and Telecommunications, 2010,33(3): 1-9.

LIU V, PARKS A, TALLA V ,et al. Ambient backscatter:wireless communication out of thin air[C]// ACM SIGCOMM Computer Communication Review. ACM, 2013，43(4): 39-50.

QIAN J, GAO F, WANG G ,et al. Noncoherent detections for ambient backscatter system[J]. IEEE Transactions on Wireless Communications, 2017,16(3): 1412-1422.

NORDRUM A . The Internet of fewer things[J]. IEEE Spectrum, 2016,53(10): 12-13.

QIAN J, PARKS A N, SMITH J R ,et al. IoT communications with M-PSK modulated ambient backscatter:algorithm,analysis and implementation[J]. IEEE Internet of Things Journal, 2019,6(1): 844-855.

FORTZ B, THORUP M . Internet traffic engineering by optimizing OSPF weights[J]. IEEE INFOCOM, 2000,2(3): 519-528.

HEDRICK C L . Routing information protocol[R]. 1988.

FORTZ B, THORUP M . Optimizing OSPF/IS-IS weights in a changing world[J]. IEEE Journal on Selected Areas in Communications, 2002,20(4): 756-767.

GRIFFIN T G, SHEPHERD F B, WILFONG G . The stable paths problem and interdomain routing[J]. IEEE/ACM Transactions on Networking (ToN), 2002,10(2): 232-243.

孙志军, 薛磊, 许阳明 ,等. 深度学习研究综述[J]. 计算机应用研究, 2012,29(8): 2806-2810.

SUN Z J, XUE L, XU Y M ,et al. Overview of deep learning[J]. Application Research of Computers, 2012,29(8): 2806-2810.

KATO N, FADLULLAH Z M, MAO B ,et al. The deep learning vision for heterogeneous network traffic control:proposal,challenges and future perspective[J]. IEEE Wireless Communications, 2017,24(3): 146-153.

TANG F, MAO B, FADLULLAH Z M ,et al. On removing routing protocol from future wireless networks:a real-time deep learning approach for intelligent traffic control[J]. IEEE Wireless Communications, 2018,25(1): 154-160.

高阳, 陈世福, 陆鑫 . 强化学习研究综述[J]. 自动化学报, 2004,30(1): 86-100.

GAO Y, CHEN S F, LU X . Research on reinforcement learning technology:a review[J]. ACTA Automatica Sinica, 2004,30(1): 86-100.

MNIH V, KAVUKCUOGLU K, SILVER D ,et al. Human-level control through deep reinforcement learning[J]. Nature, 2015,518(7540):529.

BOYAN J A, LITTMAN M L . Packet routing in dynamically changing networks:a reinforcement learning approach[C]// Advances in Neural Information Processing Systems. Morgan Kaufmann Publishers Inc, 1994: 671-678.

Views

1038

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Collaborative offloading computing scheme based on energy harvesting technology

Reinforcement learning-based channel access mechanism for multi-base station slotted Aloha with cooperative reception

Multi-data fusionaided indoor localization based on continuous action space deep reinforcement learning

An algorithm for joint optimization of dynamic routing and scheduling in time-sensitive networking

A survey of federated learning for 6G networks

Related Author

WANG Jun

ZHAO Haodong

HUANG Yuankang

ZHAN Wen

SUN Xinghua

Xuechen CHEN

Jiaxuan YI

Aixiang WANG

Related Institution

Nanjing University of Posts and Telecommunications

Sun Yat-sen University

Sun Yat-sen University Shenzhen Campus

School of Computer Science and Engineering, Central South University

School of Electronic Information, Central South University

⁰