

浏览全部资源
扫码关注微信
1. 电子科技大学信息与通信工程学院,四川 成都 611731
2. 电子科技大学(深圳)高等研究院,广东 深圳 518110
Online First:2023-03,
Published:30 March 2023
移动端阅览
Zhihong WANG, Supeng LENG, Kai XIONG. Multi-agent resource allocation strategy for UAV swarm-based cooperative sensing[J]. Chinese Journal on Internet of Things, 2023, 7(1): 18-26.
Zhihong WANG, Supeng LENG, Kai XIONG. Multi-agent resource allocation strategy for UAV swarm-based cooperative sensing[J]. Chinese Journal on Internet of Things, 2023, 7(1): 18-26. DOI: 10.11959/j.issn.2096-3750.2023.00326.
在智能物联网技术发展的推动下,无人机集群已广泛用于应急、救援等场景的感知监测。无人机在任务区域自动感知发现任务目标,邻近无人机组成协作感知与协作计算任务群组,协同完成数据的感知、采集和处理。然而,重复的感知资源以及多任务间的计算资源供需不平衡,会造成额外的计算与通信开销,增大端到端处理时延。为了应对这一挑战,提出了一种结合仿生学和多智能体独立强化学习的多任务资源分配策略,基于局部的任务信息进行资源协同分配决策。该方法用任务情景信息浓度表示各个任务的资源需求,并通过情景信息在各任务群组间的扩散,动态更新各任务异构资源需求。同时,结合独立多智能体强化学习方法进行智能决策,以对各任务异构资源进行智能协同分配。仿真结果表明,所提方案不仅能够有效缩短任务执行时间,还可显著提高计算资源利用率。
Driven by the development of intelligent internet of things (IoT) technology
unmanned aerial vehicle (UAV) swarms have been widely used for sensing and monitoring in emergency and rescue scenarios.The UAVs automatically sense and discover mission targets in the mission area
recruiting neighboring UAVs to form perception and computation task groups to collaboratively complete the perception
acquisition and processing of data.However
repetitive sensory data and imbalance in the supply and demand of computational resources between multiple tasks cause additional computational and communication overheads and increase the end-to-end processing latency.To address this challenge
a multi-task resource allocation approach combining bionics and multi-agent independent reinforcement learning was proposed
making collaborative resource allocation decisions based on local task information.The method represents the resource requirements of individual tasks as situational information concentrations and dynamically updates the heterogeneous resource requirements of each task by spreading the situational information across task groups.At the same time
it combines multi-agent independent reinforcement learning methods for intelligent decision making in order to collaboratively allocate the heterogeneous resources of each task.Simulation results show that this solution can not only effectively reduce the task execution time
but also significantly improve the computational resource utilization.
ERDELJ M , NATALIZIO E , CHOWDHURY K R , et al . Help from the sky:leveraging UAVs for disaster management [J ] . IEEE Pervasive Computing , 2017 , 16 ( 1 ): 24 - 32 .
GUPTA L , JAIN R , VASZKUN G . Survey of important issues in UAV communication networks [J ] . IEEE Communications Surveys & Tutorials , 2016 , 18 ( 2 ): 1123 - 1152 .
MA H J , LIU Y L , REN Y H , et al . Improved CNN classification method for groups of buildings damaged by earthquake,based on high resolution remote sensing images [J ] . Remote Sensing , 2020 , 12 ( 2 ): 260 .
LI T Y , LENG S P , WAWG Z H , et al . Intelligent resource allocation schemes for UAV-swarm-based cooperative sensing [J ] . IEEE Internet of Things Journal , 2022 , 9 ( 21 ): 21570 - 21582 .
MEI H B , YANG K , LIU Q , et al . Joint trajectory-resource optimization in UAV-enabled edge-cloud system with virtualized mobile clone [J ] . IEEE Internet of Things Journal , 2020 , 7 ( 7 ): 5906 - 5921 .
SEID A M , BOATENG G O , ANOKYE S , et al . Collaborative computation offloading and resource allocation in multi-UAV-assisted IoT networks:a deep reinforcement learning approach [J ] . IEEE Internet of Things Journal , 2021 , 8 ( 15 ): 12203 - 12218 .
CHOI H L , BRUNET L , HOW J P . Consensus-based decentralized auctions for robust task allocation [J ] . IEEE Transactions on Robotics , 2009 , 25 ( 4 ): 912 - 926 .
FU X W , FENG P , GAO X G . Swarm UAVs task and resource dynamic assignment algorithm based on task sequence mechanism [J ] . IEEE Access , 2019 ( 7 ): 41090 - 41100 .
BAKOLAS E , LEE Y . Decentralized game theoretic control for dynamic task allocation problems for multi-agent systems [C ] // Proceedings of 2021 American Control Conference (ACC) . Piscataway:IEEE Press , 2021 : 3228 - 3233 .
ZHANG C Y , LI Q Y , ZHU Y Y , et al . Dynamics of task allocation based on game theory in multi-agent systems [J ] . IEEE Transactions on Circuits and Systems II:Express Briefs , 2019 , 66 ( 6 ): 1068 - 1072 .
BAKSHI S , FENG T H , YAN Z Y , et al . A regularized quadratic programming approach to real-time scheduling of autonomous mobile robots in a prioritized task space [C ] // Proceedings of 2019 American Control Conference (ACC) . Piscataway:IEEE Press , 2019 : 1361 - 1366 .
ZHANG S H , ZHANG H L , DI B Y , et al . Cellular cooperative unmanned aerial vehicle networks with sense-and-send protocol [J ] . IEEE Internet of Things Journal , 2019 , 6 ( 2 ): 1754 - 1767 .
HOSSAIN A , CHAKRABARTI S , BISWAS P K . Impact of sensing model on wireless sensor network coverage [J ] . IET Wireless Sensor Systems , 2012 , 2 ( 3 ): 272 .
CHAKRABORTY A , ROUT R R , CHAKRABARTI A , et al . On network lifetime expectancy with realistic sensing and traffic generation model in wireless sensor networks [J ] . IEEE Sensors Journal , 2013 , 13 ( 7 ): 2771 - 2779 .
SHAKHOV V V , KOO I . Experiment design for parameter estimation in probabilistic sensing models [J ] . IEEE Sensors Journal , 2017 , 17 ( 24 ): 8431 - 8437 .
CHARLES R Q,HAOS , MOK C , et al . PointNet:deep learning on point sets for 3D classification and segmentation [C ] // Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2017 : 77 - 85 .
QI C R , YI L , SU H , et al . PointNet++:deep hierarchical feature learning on point sets in a metric space [C ] // Proceedings of the 31st International Conference on Neural Information Processing Systems . New York:ACM Press , 2017 : 5105 - 5114 .
ZHAO H T , WANG H J , WU W Y , et al . Deployment algorithms for UAV airborne networks toward on-demand coverage [J ] . IEEE Journal on Selected Areas in Communications , 2018 , 36 ( 9 ): 2015 - 2031 .
WANG B W , SUN Y J , DO-DUY T , , et al . Adaptive D-hop connected dominating set in highly dynamic flying ad-hoc networks [J ] . IEEE Transactions on Network Science and Engineering , 2021 , 8 ( 3 ): 2651 - 2664 .
VAW DYKE PARUNAK H , BRUECKNER S A , SAUTER J . Digital pheromones for coordination of unmanned vehicles [C ] // Proceedings of the 1st International Conference on Environments for Multi-Agent Systems . New York:ACM Press , 2004 : 246 - 263 .
XU X , LI R P , ZHAO Z F , et al . Stigmergic independent reinforcement learning for multiagent collaboration [J ] . IEEE Transactions on Neural Networks and Learning Systems , 2022 , 33 ( 9 ): 4285 - 4299 .
HOLLAND O E , . Multiagent systems:lessons from social insects and collective robotics [C ] // Adaptation,Coevolution,and Learning in Multiagent Systems:Papers from the 1996 AAAI Spring Symposium .[S.l.:s.n. ] , 1996 : 57 - 62 .
DORIGO M , BONABEAU E , THERAULAZ G . Ant algorithms and stigmergy [J ] . Future Generation Computer Systems , 2000 , 16 ( 8 ): 851 - 871 .
DI CARO G . AntNet:distributed stigmergetic control for communications networks [J ] . Journal of Artificial Intelligence Research , 1998 , 9 : 317 - 365 .
ZHANG C , JIN S , XUE W , et al . Independent reinforcement learning for weakly cooperative multiagent traffic control problem [J ] . IEEE Transcations on Vehicular Technology , 2021 , 70 ( 8 ): 7426 - 7436 .
LANCTOT M , ZAMBALDI V , GRUSLYS A , et al . A unified game-theoretic approach to multiagent reinforcement learning [C ] // Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS) . New York:ACM Press , 2017 : 4193 - 4206 .
MATIGNON L , LAUREWT G J , LE FORT-PIAT N . Independent reinforcement learners in cooperative Markov games:a survey regarding coordination problems [J ] . The Knowledge Engineering Review , 2012 , 27 ( 1 ): 1 - 31 .
ARULKUM ARAN K , DEISEWRC TH M P , BRUNDAGI M , et al . Deep reinforcement learning:a brief survey [J ] . IEEE Signal Processing Magazine , 2017 , 34 ( 6 ): 26 - 38 .
ZHANG T K , LEI J Y , LIU Y W , et al . Trajectory optimization for UAV emergency communication with limited user equipment energy:a safe-DQN approach [J ] . IEEE Transactions on Green Communications and Networking , 2021 , 5 ( 3 ): 1236 - 1247 .
FOERSTER J , NARDELLI N , FARQUHAR G , et al . Stabilising experience replay for deep multi-agent reinforcement learning [C ] // Proceedings of the 34th International Conference on Machine Learning . New York:ACM Press , 2017 : 1146 - 1155 .
0
Views
668
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621