Weipu Zhang (张维璞)

Ph.D. student since 2025, Beijing Institute of Technology,
National Key Lab of Autonomous Intelligent Unmanned System, Supervisor: Prof. Gang Wang 北京理工大学自动化学院，自主智能无人系统全国重点实验室，导师：王钢教授

Joint Ph.D. research track since 2026, Zhongguancun Academy (ZGCA)
AI & Game Research group, Supervisor: Prof. Jian Zhao 北京中关村学院，AI+游戏课题组，导师：赵鉴教授

Email: mail@weipuzhang.com

GitHub Google Scholar Bilibili

Research interests

My research interests center on building game agents that can learn, understand, and interact with games at a level of efficiency comparable to humans. To move toward this goal, I am particularly interested in reinforcement learning, computer vision, world models, and game generation, especially in settings that require strong generalization, long-horizon decision making, and efficient use of data and supervision.

Highlights

ICLR26: OC-STORM on Hollow Knight. Among the game-agent results I have worked on so far, this is the one that feels the most amazing to me: seeing an agent act coherently in a rich, difficult game world still feels remarkable.

Education

2026 - Present
Joint Ph.D. research track, Zhongguancun Academy
Supervisor: Prof. Jian Zhao
2025 - Present
Ph.D. student, Beijing Institute of Technology
Supervisor: Prof. Gang Wang
2023 - 2024
MSc in Cognitive Science, The University of Edinburgh
Graduated with Distinction
2019 - 2023
B.Eng. in Automation, Beijing Institute of Technology

Awards

2024
Cognitive Science MSc Dissertation Prize, The University of Edinburgh
Awarded to one student in the programme
2024
Poster Competition Prize, The University of Edinburgh
Awarded to one student in the School of Informatics
2021
Facebook Image Similarity Challenge 2021, NeurIPS 2021 Workshop
Global first place in Track 1 and third place in Track 2 Link

Publications

Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning

Weipu Zhang, Adam Jelley, Trevor McInroe, Amos Storkey, and Gang Wang

ICLR, 2026.

Paper Project page Code
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Weipu Zhang, Gang Wang, Jian Sun, Yetian Yuan, and Gao Huang

NeurIPS, 2023.

Paper Code
Mixture-of-World Models: Scaling Multi-Task Reinforcement Learning with Modular Latent Dynamics

Boxuan Zhang, Weipu Zhang, Zhaohan Feng, Wei Xiao, Jian Sun, Jie Chen, and Gang Wang

ICLR, 2026.

Paper
DyMoDreamer: World Modeling with Dynamic Modulation

Boxuan Zhang, Runqing Wang, Wei Xiao, Weipu Zhang, Jian Sun, Gao Huang, Jie Chen, and Gang Wang

NeurIPS, 2025.

Paper Code
Results and findings of the 2021 Image Similarity Challenge

Zoe Papakipos, Giorgos Tolias, Tomas Jenicek, Ed Pizzi, Shuhei Yokoo, Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang, Sanjay Addicam, Sergio Manuel Papadakis, Cristian Canton Ferrer, Ondrej Chum, and Matthijs Douze

NeurIPS Competition Track, PMLR, 2022.

Paper

Research interests

Highlights

Education

Awards

Publications

Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Mixture-of-World Models: Scaling Multi-Task Reinforcement Learning with Modular Latent Dynamics

DyMoDreamer: World Modeling with Dynamic Modulation

Results and findings of the 2021 Image Similarity Challenge