About Me
My name is Zheng Wu, and I am currently a Master’s student at the School of Computer Science, Shanghai Jiao Tong University, under the supervision of Prof. Zhuosheng Zhang. I received my Bachelor’s degree from the same institution in 2025. I also will continue my academic journey by transitioning into the Ph.D. program advised by Prof. Zhuosheng Zhang.
My research interests include (multimodal) large language models and general agents (especially GUI agents).
如果您对我的研究方向感兴趣,欢迎与我联系、探讨合作可能性。尤其欢迎有意向选择Zhuosheng Zhang老师作为导师的本科生或上海交通大学的校内本科生进行科研实习与合作。
Experience
B.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2021 – June 2025
M.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2025 – Present
Awards & Honors
Best Bachelor Thesis, 2025(top 1% in SJTU)
National Scholarship, 2024(top 0.2% nationwide)
National Scholarship, 2022(top 0.2% nationwide)
Publications
* Equal contribution.
# Corresponding Author.
[1] GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection for GUI Agents
Zheng Wu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Zhuosheng Zhang#.
AAAI 2026
[2] Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning
Zheng Wu, Xingyu Lou, Xinbei Ma, Yansi Li, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#.
ACL 2026
[3] Mobile-Aptus: Confidence-Driven Proactive and Robust Interaction in MLLM-based Mobile-Using Agents
Zheng Wu*, Pengzhou Cheng*#, Zongru Wu, Yuan Guo, Tianjie Ju, Aston Zhang, Gongshen Liu, Zhuosheng Zhang#.
TASLP
[4] OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
Pengzhou Cheng*, Zheng Wu*, Zongru Wu, Aston Zhang, Zhuosheng Zhang#, Gongshen Liu#.
ACL 2025
[5] GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection
Zheng Wu, Chengcheng Han, Zhengxi Lu, Tianjie Ju, Yanyu Chen, Qi Gu#, Xunliang Cai, Zhuosheng Zhang#.
Preprint
[6] OS-SPEAR: A Toolkit for the Safety, Performance, Efficiency, and Robustness Analysis of OS Agents
Zheng Wu, Yi Hua, Zhaoyuan Huang, Chenhao Xue, Yijie Lu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Gongshen Liu, Xinghao Jiang, Zhuosheng Zhang#.
Preprint
[7] Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents
Zheng Wu, Heyuan Huang, Yanjia Yang, Yuanyi Song, Xingyu Lou, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#.
Preprint
[8] VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang#, Zhuosheng Zhang#.
Preprint
[9] ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in Massive-agent Ecosystem
Fangwen Wu*, Zheng Wu*, Jihong Wang, Yunku Chen, Ruiguang Pei, Heyuan Huang, Xin Liao, Xingyu Lou, Huarong Deng, Zhihui Fu, Weiwen Liu, Zhuosheng Zhang, Weinan Zhang, Jun Wang#.
Position Paper
[10] ColorAgent: Building a Robust, Personalized, and Interactive OS Agent
Ning Li*, Qiqiang Lin*, Zheng Wu, Xiaoyun Mo, Weiming Zhang, Yin Zhao, Xiangmou Qu, Jiamu Zhou, Jun Wang, Congmin Zheng, Yuanyi Song, Hongjiang Chen, Heyuan Huang, Jihong Wang, Jiaxin Yin, Jingwei Yu, Junwei Liao, Qiuying Peng, Xingyu Lou#, Jun Wang, Weiwen Liu#, Zhuosheng Zhang#, Weinan Zhang.
Technical Report
[11] Faithful Mobile GUI Agents with Guided Advantage Estimator
Haowen Hu*, Pengzhou Cheng*, Zheng Wu, Lingzhong Dong, Gongshen Liu#, Zhuosheng Zhang#.
ICML 2026
[12] Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
Zehao Deng, Tianjie Ju, Zheng Wu, Zhuosheng Zhang#, Gongshen Liu.
CVPR 2026
[13] Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-powered Mobile GUI Agents
Pengzhou Cheng, Haowen Hu, Zheng Wu, Zongru Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#.
EMNLP 2025
[14] See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang#, Gongshen Liu#.
CVPR 2026
[15] LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition
Yanyu Chen, Jiyue Jiang, Dianzhi Yu, Zheng Wu, Jiahong Liu, Jiaming Han, Xiao Guo, Jinhu Qi, Yu Li, Yifei Zhang, Irwin King#.
KDD 2026
[16] MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft
Tianjie Ju, Yueqing Sun, Zheng Wu, Wei Zhang, Yaqi Huo, Xi Su, Qi Gu#, Xunliang Cai, Gongshen Liu, Zhuosheng Zhang#.
Preprint
[17] Causal Probing for Internal Visual Representations in Multimodal Large Language Models
Zehao Deng*, Tianjie Ju*, Zheng Wu, Liangbo He, Jun Lan, Huijia Zhu, Weiqiang Wang, Zhuosheng Zhang#.
Preprint
[18] Agent-ScanKit: Unraveling Memory and Reasoning of MLLM-Based Agents via Sensitivity Perturbations
Pengzhou Cheng, Lingzhong Dong, Zheng Wu, Zongru Wu, Xiangru Tang, Chengwei Qin, Zhuosheng Zhang#, Gongshen Liu#.
Preprint
[19] Atomic-to-Compositional Generalization for Mobile Agents with Systematic Scheduling
Yuan Guo, Tingjia Miao, Zheng Wu, Pengzhou Cheng, Ming Zhou, Zhuosheng Zhang#.
Preprint
[20] Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks
Zongru Wu, Pengzhou Cheng, Zheng Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#.
Preprint
[21] Domain Adaptation of MLLM-based Computer-Using Agents with Standard Operating Procedure
Lingzhong Dong, Ziqi Zhou, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#.
KSEM 2026
[22] Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
Lingzhong Dong, Ziqi Zhou, Shuaibo Yang, Haiyue Sheng, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#.
Preprint
Tutorials and Contributions
Participating: Dive into LLMs《动手学大模型》Course Series
Participating: 《大模型开发全流程》Course Series
Academic Services
Reviewer of conferences:
AAAI, ACL, ICML, EMNLP
Reviewer of journals:
IJHCI
