Main View
The adaptive control and maneuvering capabilities of Autonomous Underwater Vehicles (AUVs) have drawn significant attention, but achieving precise maneuvering control of AUVs is challenging due to their highly nonlinear dynamics, time-varying hydrodynamics, strong six-degree-of-freedom coupling, and environmental uncertainties. The contributions of this paper mainly include three parts:
The following figure illustrates the overall design of our controller. To fully leverage the advantages of the LLM-enhanced RL based S-Surface controller, while achieving simulation and perception of extreme marine conditions to evaluate the disturbance rejection performance, we decompose the proposed framework into three core modules.
Figure👇: The overall framework of our proposed controller, which comprises of three modules: (A) RL-based S-Surface Controller Module. (B) LLM-Enhanced Iterative Joint Optimization Module. (C) Environment-Aware & Simulation Module.
Video👇: 2D and 3D visualization of the extreme sea condition.
Main View
Top View
Side View
We validate the effectiveness of our proposed controller utilizing a REMUS 100 AUV, and we introduce two high-level tasks:
LLMs achieve flexible adjustments for joint optimization based on the bottleneck analysis, and terminate optimization when reaching system control limitations.
Video👇: Parameters for yaw tracking controller and reward weights, along with 2D projections of AUV trajectories from the 3D data collection tasks during the LLM optimization phase.
Main View
Figure👇: Performance results of the S-surface controller in tracking reference signals taken from a target tracking task, during the LLM optimization phase.
Figure👇: Comparative results of three controllers tracking reference signals taken from a target tracking task under extreme sea(ES) and very extreme sea(VES) conditions. When transitioning to VES conditions, the controllers exhibit progressive performance deterioration, with the PID controller showing worse flexibility and stability compared to the S-Surface controller. Additionally, the PVS suffers a complete loss of depth regulation capability.
Table👇: Performance metrics of different control methods evaluated during the data collection task under ES and VES conditions. Under the extreme sea condition, the S-Surface controller achieves performance close to the ideal setting, and under the very extreme sea condition, the S-Surface controller exhibits significantly lower performance degradation compared to PID.
The following videos visualize the 3D data collection and target tracking tasks performed by two AUVs. In the former case, the AUV must judiciously control its direction to efficiently serve sensor nodes due to its restricted tuning capability, while the latter requires more real-time control capabilities.
Video👇: 3D visualizations of two AUVs performing the data collection task. The AUVs can plan optimal routes as much as possible, achieving performance close to ideal control conditions.
Video👇: 3D visualizations of two AUVs performing the target tracking task. The AUVs also demonstrate high maneuverability in response to target turns.
⚠ UNDER CONSTRUCTION! COMING SOON
@article{xie2025AUVRScontrol,
title={Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extre Sea Conditions},
author={Xie, Guanwen and Xu, Jingzehua and Ding, Yimian and Zhang, Zhi and Zhang, Shuai and Li, Yi},
journal={arXiv preprint arXiv:2503.00527},
year={2025}
}