基于强化学习的无人机集群航迹规划研究综述

    Trajectory planning for UAV swarm based on reinforcement rearning: a review

    • 摘要: 强化学习通过与环境持续交互更新策略,为缺乏精确模型的无人机航迹规划提供了新的求解思路。目前尚缺乏针对强化学习在无人机集群航迹规划场景下应用的系统综述。因此,文章对强化学习驱动的无人机集群航迹规划研究进行系统回顾。文章先阐明集群航迹规划的问题定义与核心挑战,再梳理强化学习的基本原理并分类讨论其在航迹规划中的典型应用,最后归纳现有成果与不足,展望后续发展方向。文章可为后续理论研究与工程应用提供可借鉴思路。

       

      Abstract: Reinforcement learning (RL) refines policies through continuous interaction with the environment, offering a model-free paradigm for unmanned aerial vehicle (UAV) trajectory generation. A systematic review devoted to RL-based cooperative path planning of multi-UAV systems is, however, still missing. To close this gap, we present a comprehensive survey of the relevant literature. First, we formalize the problem statement and identify the key challenges inherent in large-scale swarm trajectory optimization. Next, we summarize the fundamentals of RL and taxonomize its representative algorithms with emphasis on their specific adaptations to UAV path planning. Finally, we synthesize the achieved advances and remaining limitations, and outline prospective research directions. The delivered insights are intended to serve as a reference for future theoretical investigations and practical implementations.