 
        
        
Currently, I am a second-year PhD student at Nanyang Technological University, working under the supervision of Dacheng Tao.
Previously, I completed my master's studies and began my research journey at 
 Tsinghua University. Earlier, I obtained my bachelor's degree at
 Tsinghua University. Earlier, I obtained my bachelor's degree at  Hunan University, where I cherished four wonderful years at the foothills of Yuelu Mountain.
 Hunan University, where I cherished four wonderful years at the foothills of Yuelu Mountain.
My research goal is to unlock the true potential of Deep Reinforcement Learning toward practical real-world deployment. This requires RL agents capable of offline-online cooperation, handling multiple tasks, and never stopping learning in open-ended environments. To realize this vision, I currently focus on investigating the fundamental challenges inherent in DRL, particularly optimization pathologies, scalability limitations and exploration inefficiencies.
Meanwhile, I view RL as a paradigm for understanding and developing intelligence rather than just a technique. This inspires my interest in exploring its potential across domains, from large reasoning models and embodied agents to psychology and social sciences.
Greatness cannot be planned, so I play with joyful explorations!
 ICML 2025
          ICML 2025
         ICLR 2024
          ICLR 2024
         NeurIPS 2023
          NeurIPS 2023
         IJCV
          IJCV
         AI TIME,
    
     Recording (in Chinese)
 AI TIME,
    
     Recording (in Chinese)