Hello! I’m Guanwen Xie, a first-grade graduate student from Tsinghua University, supervised by Prof. Daoyi Chen and Yong Ren. Before this, I’ve obtained B. Eng. of ocean technology in Zhejiang University, supervised by Prof. Dongfang Ma.

Since my undergraduate studies, I have been passionate about underwater robotics. My interest lies in integrating the ocean with the latest advancements in reinforcement learning and machine learning, such as learning from demonstrations (LfD) and experience mining, intelligent large language models(LLMs) , etc. I continuously explore new possibilities and strive to make research outcomes as universal and user-friendly as possible for researchers.

Feel free to email me at gwxie360@outlook.com !

🔥 News

  • 2024.09:  🎉🎉 I have reconstruct my homepage to make it more compact and easy-to-read.
  • 2024.09: The record of my first long-distance car driving experience - Beihai-Shenzhen POV video has uploaded to Bilibili.
  • 2024-08: I start a new stage of my study in Tsinghua University.

📝 Publications

( * denotes equal contribution )


Arxiv
sym

LLMs as Efficient Reward Function Searchers for Custom-Environment MORL

Guanwen Xie, Jingzehua Xu, Yiyuan Yang, Yimian Ding, Shuai Zhang

[Website&Code] [Arxiv] [BibTeX]

TLDR: An efficient reward function searcher using LLMs (ERFSL) is achieved by decomposing multi-objective tasks to provide clear textual task feedback, utilizing LLM’s strong semantic understanding capabilities, and incorporating versatile search strategies.

IEEE Transactions on Mobile Computing 2024
sym

Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?

Guanwen Xie*, Jingzehua Xu*, Ziqi Zhang, Xiangwang Hou, Dongfang Ma, Shuai Zhang, Yong Ren, Dusit Niyato

[PDF] [BibTeX]

TLDR: Leveraging sim2sim expert trajectories transformation process facilitates the generation of demonstrations. Based on this, a two-stage framework called FISHER of MADAC imitation learning (IL) and MAIGDT offline reinforcement learning (ORL) is employed to achieve high generalized and applicable multi-AUV target tracking policies without designating reward functions.

IEEE Internet of Things Journal 2024
sym

HPTVSim: A Simulator for Unmanned Underwater Vehicles Dedicated in the Underwater Pursuit-Evasion Game

Jingzehua Xu*, Guanwen Xie*, Zekai Zhang, Xiangwang Hou, Shuai Zhang, Yong Ren, Dusit Niyato

[PDF] [BibTeX]

TLDR: A highly customizable and high-precision physics simulation platform called HPTVsim has been developed based on ROS and Gazebo. Combined with the RL compatibility layer based on RoboEnv Class, it can be easily applied to a variety of RL tasks. Based on this platform, an efficient training framework of scenario transfer RL training and offline reinforcement learning is utilized for underwater pursuit-evasion game (UPEG).

Arxiv
sym

USV-AUV Collaboration Framework for Underwater Tasks under Extreme Sea Conditions

Jingzehua Xu*, Guanwen Xie*, Xinqi Wang, Yimian Ding, Shuai Zhang

[Code] [Arxiv] [BibTeX]

TLDR: By utilizing the USV-AUV collaborative framework, the accuracy of AUV positioning can be enhanced in extreme sea conditions. USV and AUV cooperatively execute the data collection task of internet of underwater things (IoUT).

  • Environment and Energy-Aware AUV-Assisted Data Collection for the Internet of Underwater Things, Zekai Zhang*, Jingzehua Xu*, Guanwen Xie, Jingjing Wang, Zhu Han and Yong Ren. IEEE Transactions on Mobile Computing 2024 [Online Publication] [PDF] [BibTeX]

  • FISHER: An Efficient Sim2sim Training Framework Dedicated in Multi-AUV Target Tracking via Learning from Demonstrations, Guanwen Xie*, Xinqi Wang*, Yimian Ding, Jingzehua Xu, Dongfang Ma, Jingjing Wang and Yong Ren. ICONIP 2024 [PDF] [BibTeX]
  • IMTVSim: An Integrated Modular Training and Verification Simulator for Unmanned Underwater Vehicles, Jingzehua Xu*, Guanwen Xie*, Zekai Zhang, Tianxiang Xing, Jingjing Wang and Yong Ren. OCEANS 2024 Halifax [PDF] [BibTeX]
  • Advanced Framework for Underwater Node Repair via Multi-AUV Based on Multi-Agent Offline Reinforcement Learning, Yimian Ding*, Jingzehua Xu*, Guanwen Xie, Gang Li, Jingjing Wang and Yong Ren. OCEANS 2024 Halifax [PDF] [BibTeX]

  • Enhancing Information Freshness: An AoI Optimized Markov Decision Process Dedicated in The Underwater Task, Yimian Ding*, Jingzehua Xu*, Yiyuan Yang, Guanwen Xie and Shuai Zhang. Arxiv [Code] [Arxiv] [BibTeX]
  • Multi-AUV Assisted Seamless Underwater Target Tracking Relying on Deep Learning and Reinforcement Learning, Jingzehua Xu*, Yimian Ding*, Zekai Zhang, Guanwen Xie, Ziyuan Wang, Yongming Zeng and Gang Li. IEEE WCCI 2024 [PDF] [BibTeX] [News Coverage(中文报道)]
  • Fisher-Information-Matrix-Based USBL Cooperative Location in USV–AUV Networks, Ziyuan Wang, Jingzehua Xu, Yuanzhe Feng, Yijing Wang, Guanwen Xie, Xiangwang Hou, Wei Men, and Yong Ren. Sensors 2023 [Online Publication] [PDF] [BibTeX]

🥇 Selected Honors and Awards

Scholarship

  • 2024 Outstanding Graduate (awarded to top undergraduates in Zhejiang University)
  • 2023 Runhe Scholarship (1% Top in ZJU)
  • 2022 Zhejiang Provincal Government Scholarship (3% Top in ZJU)

Experience

  • 2023 First Prize of “Aotuo Cup” National Underwater Robot Designing Competition (Top 5% of all participants of finals)
  • 2022 First Prize of Zhejiang Provincal Student Physics Innovation Competition (Theory)

📖 Educations & Skills

2024.09 - present, Tsinghua University, M. S. in Electric Information

2020.09 - 2024.06, Zhejiang University, B. Eng. in Ocean Technology

  • GPA: 4.74 / 5.00 or score 92.4 / 100 (rank 1st / 143)
  • Relative courses: Calculus(94), Linear Algebra(88), Probability Theory and Statistics(96), Partial Differential Equation(99), Foundamental of Ocean Engineering Modeling(ML-relative, 94), Software Development and Applications(95), Introduction to Computer Systems(93), Embedded Systems(96), Signals and Systems(97), Digital Signal Processing(98), Automatic Control Theory(95), Underwater Robot Design(96)

Programming💻 & Debugging 🐛🔧: Python/Pytorch, C/C++, ROS, Linux(Arch Linux/Ubuntu…), $\LaTeX$

Languages: English CET6 600

📊 Selected Experiences

2024.6 - 2024.12 New Jersey Institute of Technology, Department of Data Science, Research Assistant (Advisor - Prof. Shuai Zhang)

  • Efficient LLM reward searcher for MORL: An efficient reward function searcher using LLMs (ERFSL) is achieved by decomposing multi-objective tasks to provide clear textual task feedback, utilizing LLM’s strong semantic understanding capabilities, and incorporating versatile search strategies.

2023.7 - 2024.9 Zhejiang University & Tsinghua University, Undergraduate Dissertation (Advisor - Prof. Dongfang Ma (ZJU) & Yong Ren (THU) )

  • AUV Target Tracking via Learning from Demonstration deployed on high-precision simulation platform: A highly customizable and high-precision physics simulation platform called HPTVsim has been developed based on ROS and Gazebo. Combined with the RL compatibility layer based on RoboEnv Class, it can be easily applied to a variety of RL tasks. Then, a two-stage framework called FISHER of MADAC imitation learning (IL) and MAIGDT offline reinforcement learning (ORL) is employed to achieve high generalized and applicable multi-AUV target tracking policies without designating reward functions.

2023.2 - 2023.6 Zhejiang University, Ocean College, Undergraduate Researcher (Advisor - Prof. Yulin Si)

  • Underwater Robot Design: Developed a compact, energy-save and easy-to-control underwater robot via Raspberry Pi, STM32 with the functions of navigation, obstacle avoidance, letter and color recognition. Data augmentation procedure based on BAGAN is utilized for enhancing the accuracy of the recognition task. We participated in the “Aotuo Cup” national underwater robot designing competition and won the first prize.

2022.10 - 2023.7 Zhejiang University, Intelligent Underwater Optical Laboratory, Undergraduate Researcher (Advisor - Prof. Hong Song)

  • Underwater Image Enhancement Based on Polarization Imaging: A enhancing procedure maximizing the EME(measure of enhancement by entropy) of underwater polarized images based on stoke vector paramization and haze removal algorithm is proposed. I’ve also participated in applying this algorithm to a project concerning underwater 3D reconstruction and studied the basic knowledge about SLAM.

💻 Miscellaneous

I have a passion for computer-related knowledge and have a wide range of interests. I enjoy video editing and used to be a video uploader on a website called bilibili, where I uploaded fun videos about computer knowledge and computer viruses. Up to 2019.6, I gained 22k+ subscribers, ranking within the top 10,000 of bilibili. During my university experiences, I accumulated two years of computer repair experience within the electrical volunteer association(EVA) of Zhejiang University. I also love running and during my college years, I keep running 10 kilometers daily during the winter vacation of final year of undergraduate study. These experiences have taught me that no matter how difficult the situation, I have the courage to persevere!

This page was last updated at UTC 2024/09/28 02:39.