Hardware- and Vision-in-the-Loop Validation of Deep Monocular Pose Estimation for Autonomous Maritime UAV Flight
Maneesha Wickramasuriya, Beomyeol Yu, Jaden Shin, Mason Huslig, Taeyoung Lee, Murray Snyder
- Published
- Jun 17, 2026 — 15:18 UTC
Problem
The paper addresses the gap in reliable validation methods for vision-based relative pose estimation in autonomous UAV operations on maritime vessels. Current validation approaches are often costly, weather-dependent, and risky, particularly when conducted at sea. The authors propose a preprint framework that allows for indoor testing while simulating photorealistic maritime environments, thus providing a safer and more controlled setting for UAV autonomy development.
Method
The proposed framework integrates a deep transformer-based monocular pose estimator that processes rendered maritime views onboard the UAV. The system employs a delayed Kalman filter to fuse delayed vision measurements with high-rate inertial measurement unit (IMU) data, ensuring consistent state estimates for geometric control. The architecture captures critical embedded effects such as perception latency, asynchronous updates, and computational constraints, which are typically overlooked in pure simulation environments. The training compute specifics are not disclosed, but the focus is on real-time processing capabilities to facilitate autonomous flight tasks, including takeoff, trajectory tracking, and landing.
Results
The authors conducted a series of experiments demonstrating the system’s capabilities in closed-loop flight. The results indicate stable performance during autonomous operations, although specific quantitative metrics (e.g., accuracy, latency) against named baselines are not provided in the abstract. The framework’s effectiveness is underscored by its ability to handle the complexities of maritime environments, which are critical for real-world UAV deployment.
Limitations
The authors acknowledge that the framework is still in a preprint stage, indicating that further validation and peer review are necessary. Additionally, the reliance on indoor simulations may not fully capture the dynamic and unpredictable nature of outdoor maritime conditions. The paper does not discuss potential limitations related to the scalability of the approach or the computational overhead introduced by the delayed Kalman filter in real-time applications.
Why it matters
This work has significant implications for the development of autonomous UAV systems in maritime contexts, providing a safer and more effective validation method prior to real-world deployment. By addressing the challenges of pose estimation in complex environments, this framework can facilitate advancements in UAV autonomy, potentially influencing future research and applications in maritime operations. The findings are relevant for researchers and engineers working on UAV systems, as they highlight the importance of hardware-in-the-loop testing for reliable performance in real-world scenarios, as published in arXiv cs.AI.
By Callan Zhang · Jun 17, 2026 · Editorial standards →
Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.
Source: arXiv cs.AI