Notable agents robotics Microsoft

LLM-based Visual Code Completion for Aerospace Geometric Design

Hau Kit Yong, Robert Marsh, Edmar A. Silva, András Sóbester, Stuart E. Middleton

Published: Jun 15, 2026 — 14:46 UTC

Problem
The aerospace industry lacks LLM-based geometric design copilot systems, primarily due to safety and explainability concerns. This paper addresses this gap by presenting a novel application tailored for aerospace engineering design tasks, leveraging recent advancements in Large Language Models (LLMs) and Vision Language Models (VLMs). The work is a preprint and has not undergone peer review, indicating that the findings should be interpreted with caution.

Method
The authors propose a visual programming copilot application utilizing a visual programming variant of the ReAct methodology in conjunction with GPT 5.4. The copilot is integrated into a new Grasshopper plugin library named Wingbuilder, which includes custom components specifically designed for aerospace geometry abstraction. Additionally, the authors introduce the Aerospace Visual Programming Dataset (AVPD), comprising 18 tasks of varying difficulty levels, each crafted by aerospace experts, along with their corresponding ground truth solutions. The evaluation involved a user trial with two experienced aerospace engineers from a major aircraft manufacturing company, focusing on the copilot’s effectiveness in generating design suggestions.

Results
The user trial indicated that the copilot’s visual programming ReAct methodology successfully generated helpful suggestions, with participants expressing a willingness to use the tool in the future. However, the inference times for the ReAct methodology were noted as a limitation, particularly for complex tasks where the time spent waiting for suggestions was a critical factor. While specific quantitative metrics were not disclosed, the qualitative feedback from participants highlighted the tool’s potential utility in aerospace design workflows.

Limitations
The authors acknowledge the slow inference times of the ReAct methodology as a significant limitation, particularly for more complex tasks that require timely suggestions. Additionally, the study’s small sample size (only two engineers) may limit the generalizability of the findings. The reliance on a preprint status also suggests that the results should be interpreted with caution until further validation through peer review is conducted.

Why it matters
This work has significant implications for the integration of AI tools in aerospace engineering, particularly in enhancing design workflows through visual programming. The introduction of the AVPD and the Wingbuilder plugin could facilitate further research and development in this domain, potentially leading to broader adoption of LLM-based systems in safety-critical industries. The findings contribute to the ongoing discourse on the applicability of LLMs in specialized fields, as discussed in related literature on AI in engineering contexts, such as in arXiv.

By Callan Zhang · Jun 15, 2026 · Editorial standards →

Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.

Source: arXiv cs.CL