Ph.D. Student Profile

I am a third-year Ph.D. student at Eindhoven University of Technology, under the supervision of Mykola Pechenizkiy and Meng Fang. I am also very fortunate to work closely with Prof. Yali Du at King’s College London, and Prof. Biwei Huang at University of California San Diego. Prior to joining TU/e, I was a master student in Shandong University (SDU), supervised by Prof. Wei Zhang. I also obtained my bachelor's degree from Shandong University.

My current research interests lie in reinforcement learning (RL), especially causal RL, multi-agent RL, RL for LLMs.

News

May 2025, invited tutorial at OxML summer school.
Jan 2025, one paper accecpted by ICLR 2025.
Oct 2024, two papers accecpted by NeurIPS 2024 CRL workshop.
Oct 2024, invited talk at Women in AI & Robotics Reading Group.
Dec 2023, one papers accecpted by AAAI 2024.
Oct 2023, invited talk at RLChina.
Sep 2023, two papers accecpted by NeurIPS 2023.

Publication

*: Equal contribution, ✉: Corresponding author

[1] RuAG: Learned-rule-augmented Generation for Large Language Models. Yudi Zhang*, Pei Xiao*, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang. The Thirteenth International Conference on Learning Representations (ICLR 2025).
[Paper]

[2] Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach. Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy. Advances in Neural Information Processing Systems (NeurIPS), 2023.
[Project], [Paper]

[3] COOM: A Game Benchmark for Continual Reinforcement Learning. Tristan Tomilin, Meng Fang, Yudi Zhang, Mykola Pechenizkiy. Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS D&B), 2023.
[Paper] [Code]

[4] Large Language Models are Neurosymbolic Reasoners. Meng Fang*, Shilong Deng*, Yudi Zhang*, Zijing Shi, Ling Chen, Mykola Pechenizkiy, Jun Wang. Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024.
[Paper]

[5] RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking. Fangwei Zhong*, Xiao Bi*, Yudi Zhang, Wei Zhang, Yizhou Wang. Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2023 (Oral).
[Project], [Paper]

Workshops & Preprint:

[6] CAST: A causality-inspired spatial-temporal return decomposition approach for multi-agent reinforcement learning. Yudi Zhang, Yali Du, Biwei Huang, Meng Fang, Mykola Pechenizkiy. In Proceedings of the NeurIPS 2024 Causal Representation Learning Workshop.
[Paper]

[7] MACCA: Offline multi-agent reinforcement learning with causal credit assignment. Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang. In Proceedings of the NeurIPS 2024 Causal Representation Learning Workshop.
[Paper]

Internship

Microsoft Research Asia, mentored by Dr. Lu Wang, 2024.

Award

NeurIPS 2023 Travel Award
Outstanding Graduates of Shandong Province (2019)
Competitions: Second Prize in the Chinese Graduate Mathematical Modeling Competition (2019), First Prize in the National Electronic Design Competition, Shandong Province (2017), International Aquatic Robot Competition Champion (2018, 2019)
Scholarships: Shandong University first-class scholarship, Shandong University outstanding students special scholarship, etc.

Service

Journal Reviewer: Transactions on Machine Learning Research, IEEE Transactions on Artificial Intelligence.
Conference Reviewer: AAMAS 2024, ICML 2024, NeurIPS 2024, AAAI 2025, ICLR 2025, AISTATS 2025, ICML 2025.
Teaching Assistant: Generative AI in OxML 2024, 2IIG0 Data Mining and Machine Learning.
Supervised MSc students: Schipper Olivier, Beuningen Niels van, Dirk Michielsen.