AoI Optimal Trajectory Planning for Cooperative UAVs: A Multi-Agent Deep Reinforcement Learning Approach

2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT)(2022)

引用 0|浏览2
暂无评分
摘要
This paper considers a multiple Unmanned Aerial Vehicles (UAVs)-assisted IoT network, where the UAVs cooperatively collect data packets generated by the IoT devices (IDs) and transmit them to the Base Station (BS) continuously to improve the information freshness, in terms of the age of information (AoI). Particularly, we first formulate multi- UAV distributed cooperative dynamic trajectory planning problem as a decentralized partially observable Markov decision process (Dec-POMDP), where the update arrivals at IDs are stochastic and are unknown to the UAVs. Furthermore, in order to address the challenges arising from unknown environmental dynamics and conflict collision constraints, we devise a multi-agent deep rein-forcement learning (MADRL) based dynamic trajectory planning algorithm. The algorithm leverages the advantages of both the QMIX and Gated Recurrent Unit (GRU) techniques. Finally, simulation results are presented to demonstrate the effectiveness of our proposed algorithm.
更多
查看译文
关键词
Age of Information,Dec-POMDP,multi-agent deep reinforcement learning,UAV,trajectory plan
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要