Skypilot

Abstract

Autonomous aerial vehicles (AAVs) have played a pivotal role in coverage operations and search missions. Recent advances in large language models (LLMs) offer promising opportunities to augment AAV intelligence. These advances help address complex challenges like area coverage optimization, dynamic path planning, and adaptive decision-making. However, the absence of physical grounding in LLMs leads to hallucination and reproducibility problems in spatial reasoning and decision-making. To tackle these issues, we present Skypilot, an LLM-enhanced two-stage framework that grounds language models in physical reality by integrating monte carlo tree search (MCTS). In the first stage, we introduce a diversified action space that encompasses generate, regenerate, fine-tune, and evaluate operations, coupled with physics-informed reward functions to ensure trajectory feasibility. In the second stage, we fine-tune Qwen3-4B on 23,000 MCTS-generated samples, achieving substantial inference acceleration while maintaining solution quality. Extensive numerical simulations and real-world flight experiments validate the efficiency and superiority of our proposed approach.

Keywords: Coverage search, LLM, Monte carlo tree search, Physical grounding.

💻 GitHub Code 🎬 Demo Video

Skypilot: Fine-Tuning LLM with Physical Grounding for AAV Coverage Search

Abstract

Method

Pipeline Overview

Radar-Based Localization

Monte Carlo Tree Search

Full Parameter Fine-tuning

Deployment

Results

Flight Video

Code