Xiong-Hui Chen

P.hD. Student, LAMDA Group
Department of Computer Science and Technology
National Key Laboratory for Novel Software Technology Nanjing University
Supervisor: Prof. Yang Yu
Email: xiong-hui.chenn [at] outlook or chenxh [at] lamda.nju.edu.cn

[ Google scholar ] [ DBLP ] [ Research gate ] [ Github ] [ Twitter ] [ Zhihu ] [ LinkedIn ]

Currently, I am a third-year PhD student of School of Artificial Intelligence in Nanjing University under the supervision of Prof. Yang Yu and also a member of LAMDA Group, which is led by Prof. Zhi-Hua Zhou. Before my PhD research, I received my B.Sc. degree in Department of Software Engineering in 2018 from Southeast University . In September 2018, I was admitted to study for a M.Sc. degree in Nanjing University under the supervision of Prof. Yang Yu without entrance examination. From September 2020, I started studying for PhD degree under the supervision of Prof. Yang Yu.

Research interest: I am interested in handling challenges of Reinforcement Learning (RL) in real-world applications. In particular, I focus my research topics on sim2real transfer, offline RL, causal inference for RL, and real-world environment reconstruction. I am also working to develop RL policies for autonomous driving, recommender systems, robotics and industrial control systems. Currently, my research topic also focuses on large language models for decision-making and train large decision-making models.

OPEN to job/research/startup opportunities. Please feel free to contact me if you would like to receive my detailed CV.

News

Jan 17, 2024 Two Papers are accepted by ICLR 2023! [ Language Model Self-improvement, Policy-rehearsing]
Sep 24, 2023 Our two papers are accepted by NeurIPS 2023, where Adversarial Counterfactual Environment Model Learning is selected as spotlight! [ Adversarial Counterfactual Environment Model, Natural Language Instruction-following ]
Sep 1, 2023 Our paper Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions are accepted by TPAMI! [link]
Jun 22, 2023 Paper Object-Oriented Option Framework for Robotics Manipulation in Clutter is accepted by IROS 2023!
Apr 13, 2023 Our Scientific Research Cooperation with Meituan in 2021-2022 is awarded with ``Best Cooperation Award’’ (PI: Prof. Yang Yu
). [ Link ]
Feb 14, 2023 Paper Sim2Rec: A Simulator-based Decision-Making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems is accepted by ICDE 2023! [ Link ]

Selected publications

* indicates equal contribution
  1. ArXiv
    Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments | [ Link ]
    Xiong-Hui Chen , Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, and Yang Yu.
    In ArXiv. 2023.
  1. NeurIPS
    Adversarial Counterfactual Environment Model Learning | [ Link Code ] (Spotlight, rate
    Xiong-Hui Chen , Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, and Fangsheng Huang.
    In Advances in Neural Information Processing Systems 37. 2023.
  2. NeurIPS
    Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | [ Link Code ]
    Xiong-Hui Chen , Shengyi Jiang, Feng Xu, Zongzhang Zhang, and Yang Yu.
    In Advances in Neural Information Processing Systems 34. 2021.
  3. NeurIPS
    Offline Model-based Adaptable Policy Learning | [ Link Code ]
    Xiong-Hui Chen , Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei (Tony) Qin, Wenjie Shang, and Jieping Ye.
    In Advances in Neural Information Processing Systems 34. 2021.


Correspondence


Laboratory: Computer Science Building, Xianlin Campus of Nanjing University

Address: Xiong-Hui Chen, National Key Laboratory for Novel Software Technology, Nanjing University, Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China.
南京市栖霞区仙林大道163号, 南京大学仙林校区603信箱, 软件新技术国家重点实验室, 210023.