Abstract: In multi-UAV cooperative area coverage tasks, the effectiveness of path planning directly impacts coverage efficiency and mission completion time. Traditional reinforcement learning (RL) ...