Currently, I am a second-year PhD student at Nanyang Technological University, working under the supervision of Dacheng Tao.
Previously, I completed my master's studies and began my research journey at
Tsinghua University. Earlier, I obtained my bachelor's degree at
Hunan University, where I cherished four wonderful years at the foothills of Yuelu Mountain.
My research goal is to unlock the true potential of Deep Reinforcement Learning toward practical real-world deployment. This requires RL agents capable of offline-online cooperation, handling multiple tasks, and never stopping learning in open-ended environments. To realize this vision, I currently focus on investigating the fundamental challenges inherent in DRL, particularly optimization pathologies, scalability limitations and exploration inefficiencies.
Meanwhile, I view RL as a paradigm for understanding and developing intelligence rather than just a technique. This inspires my interest in exploring its potential across domains, from large reasoning models and embodied agents to psychology and social sciences.
Greatness cannot be planned, so I play with joyful explorations!
ICML 2025
ICLR 2024
NeurIPS 2023
IJCV