About Me

My name is Zheng Wu, and I am currently a Master’s student at the School of Computer Science, Shanghai Jiao Tong University, under the supervision of Prof. Zhuosheng Zhang. I received my Bachelor’s degree from the same institution in 2025. I also will continue my academic journey by transitioning into the Ph.D. program advised by Prof. Zhuosheng Zhang.

My research interests include (multimodal) large language models and general agents (especially GUI agents).

如果您对我的研究方向感兴趣,欢迎与我联系、探讨合作可能性。尤其欢迎有意向选择Zhuosheng Zhang老师作为导师的本科生或上海交通大学的校内本科生进行科研实习与合作。

Experience

B.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2021 – June 2025

Supervisor: Prof. Zhuosheng Zhang

M.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2025 – Present

Supervisor: Prof. Zhuosheng Zhang

Awards & Honors

Best Bachelor Thesis, 2025(top 1% in SJTU)

National Scholarship, 2024(top 0.2% nationwide)

National Scholarship, 2022(top 0.2% nationwide)

Publications

* Equal contribution.
# Corresponding Author.


[1] GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection for GUI Agents
Zheng Wu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Zhuosheng Zhang#.
AAAI 2026 Paper Code

[2] Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning
Zheng Wu, Xingyu Lou, Xinbei Ma, Yansi Li, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#.
ACL 2026 Paper Code

[3] Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents
Zheng Wu*, Pengzhou Cheng*#, Zongru Wu, Yuan Guo, Tianjie Ju, Aston Zhang, Gongshen Liu, Zhuosheng Zhang#.
TASLP Paper Code

[4] OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
Pengzhou Cheng*, Zheng Wu*, Zongru Wu, Aston Zhang, Zhuosheng Zhang#, Gongshen Liu#.
ACL 2025 Paper Code

[5] GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection
Zheng Wu, Chengcheng Han, Zhengxi Lu, Tianjie Ju, Yanyu Chen, Qi Gu#, Xunliang Cai, Zhuosheng Zhang#.
Preprint Paper Code

[6] OS-SPEAR: A Toolkit for the Safety, Performance, Efficiency, and Robustness Analysis of OS Agents
Zheng Wu, Yi Hua, Zhaoyuan Huang, Chenhao Xue, Yijie Lu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Gongshen Liu, Xinghao Jiang, Zhuosheng Zhang#.
Preprint Paper Code

[7] Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents
Zheng Wu, Heyuan Huang, Yanjia Yang, Yuanyi Song, Xingyu Lou, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#.
Preprint Paper Code

[8] VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang#, Zhuosheng Zhang#.
Preprint Paper Code

[9] ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in Massive-agent Ecosystem
Fangwen Wu*, Zheng Wu*, Jihong Wang, Yunku Chen, Ruiguang Pei, Heyuan Huang, Xin Liao, Xingyu Lou, Huarong Deng, Zhihui Fu, Weiwen Liu, Zhuosheng Zhang, Weinan Zhang, Jun Wang#.
Position Paper Paper Code

[10] ColorAgent: Building a Robust, Personalized, and Interactive OS Agent
Ning Li*, Qiqiang Lin*, Zheng Wu, Xiaoyun Mo, Weiming Zhang, Yin Zhao, Xiangmou Qu, Jiamu Zhou, Jun Wang, Congmin Zheng, Yuanyi Song, Hongjiang Chen, Heyuan Huang, Jihong Wang, Jiaxin Yin, Jingwei Yu, Junwei Liao, Qiuying Peng, Xingyu Lou#, Jun Wang, Weiwen Liu#, Zhuosheng Zhang#, Weinan Zhang.
Technical Report Paper

[11] Faithful Mobile GUI Agents with Guided Advantage Estimator
Haowen Hu*, Pengzhou Cheng*, Zheng Wu, Lingzhong Dong, Gongshen Liu#, Zhuosheng Zhang#.
ICML 2026 Paper Code

[12] Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
Zehao Deng, Tianjie Ju, Zheng Wu, Zhuosheng Zhang#, Gongshen Liu.
CVPR 2026 Paper Code

[13] Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-powered Mobile GUI Agents
Pengzhou Cheng, Haowen Hu, Zheng Wu, Zongru Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#.
EMNLP 2025 Paper Code

[14] See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang#, Gongshen Liu#.
CVPR 2026 Paper Code

[15] LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
Yanyu Chen, Jiyue Jiang, Dianzhi Yu, Zheng Wu, Jiahong Liu, Jiaming Han, Xiao Guo, Jinhu Qi, Yu Li, Yifei Zhang, Irwin King#.
KDD 2026 Paper

[16] MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft
Tianjie Ju, Yueqing Sun, Zheng Wu, Wei Zhang, Yaqi Huo, Xi Su, Qi Gu#, Xunliang Cai, Gongshen Liu, Zhuosheng Zhang#.
Preprint Paper Code

[17] Causal Probing for Internal Visual Representations in Multimodal Large Language Models
Zehao Deng*, Tianjie Ju*, Zheng Wu, Liangbo He, Jun Lan, Huijia Zhu, Weiqiang Wang, Zhuosheng Zhang#.
Preprint Paper Code

[18] Agent-ScanKit: Unraveling Memory and Reasoning of MLLM-Based Agents via Sensitivity Perturbations
Pengzhou Cheng, Lingzhong Dong, Zheng Wu, Zongru Wu, Xiangru Tang, Chengwei Qin, Zhuosheng Zhang#, Gongshen Liu#.
Preprint Paper Code

[19] Atomic-to-Compositional Generalization for Mobile Agents with Systematic Scheduling
Yuan Guo, Tingjia Miao, Zheng Wu, Pengzhou Cheng, Ming Zhou, Zhuosheng Zhang#.
Preprint Paper Code

[20] Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks
Zongru Wu, Pengzhou Cheng, Zheng Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#.
Preprint Paper Code

[21] Domain Adaptation of MLLM-based Computer-Using Agents with Standard Operating Procedure
Lingzhong Dong, Ziqi Zhou, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#.
KSEM 2026

[22] Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
Lingzhong Dong, Ziqi Zhou, Shuaibo Yang, Haiyue Sheng, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#.
Preprint Paper Code


Tutorials and Contributions

Participating: Dive into LLMs《动手学大模型》Course Series GitHub starsCourse Link

Participating: 《大模型开发全流程》Course Series Course Link

Academic Services

Reviewer of conferences:

AAAI, ACL, ICML, EMNLP

Reviewer of journals:

IJHCI

Flag Counter