About Me
My name is Zheng Wu, and I am currently a Master’s student at the School of Computer Science, Shanghai Jiao Tong University, under the supervision of Prof. Zhuosheng Zhang. I received my Bachelor’s degree from the same institution in 2025. I also will continue my academic journey by transitioning into the Ph.D. program advised by Prof. Zhuosheng Zhang.
My research interests include natural language processing, (multimodal) large language models, and GUI agents.
Experience
B.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2021 – June 2025
M.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2025 – Present
Awards & Honors
Best Bachelor Thesis, 2025(top 1% in SJTU)
National Scholarship, 2024(top 0.2% nationwide)
National Scholarship, 2022(top 0.2% nationwide)
Publications
* Equal contribution.
# Corresponding Author.
Zheng Wu*, Pengzhou Cheng*, Zongru Wu, Yuan Guo, Tianjie Ju, Aston Zhang, Gongshen Liu#, Zhuosheng Zhang#. Universal Confidence Integration Framework for Adaptive Interaction in Computer-Using Agent. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, under review)
Pengzhou Cheng*, Zheng Wu*, Zongru Wu, Aston Zhang, Zhuosheng Zhang#, Gongshen Liu#. OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025)
Zheng Wu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Zhuosheng Zhang#. GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection for GUI Agents. (Preprint)
Zheng Wu, Heyuan Huang, Yanjia Yang, Yuanyi Song, Xingyu Lou, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#. Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents. (Preprint)
Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang#, Zhuosheng Zhang#. VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents. (Preprint)
Fangwen Wu*, Zheng Wu*, Jihong Wang, Yunku Chen, Ruiguang Pei, Heyuan Huang, Xin Liao, Xingyu Lou, Huarong Deng, Zhihui Fu, Weiwen Liu, Zhuosheng Zhang, Weinan Zhang, Jun Wang#. ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in Massive-agent Ecosystem. (Position Paper)
Ning Li*, Qiqiang Lin*, Zheng Wu, Xiaoyun Mo, Weiming Zhang, Yin Zhao, Xiangmou Qu, Jiamu Zhou, Jun Wang, Congmin Zheng, Yuanyi Song, Hongjiang Chen, Heyuan Huang, Jihong Wang, Jiaxin Yin, Jingwei Yu, Junwei Liao, Qiuying Peng, Xingyu Lou#, Jun Wang, Weiwen Liu#, Zhuosheng Zhang#, Weinan Zhang. ColorAgent: Building a Robust, Personalized, and Interactive OS Agent. (Technical Report)
Pengzhou Cheng, Haowen Hu, Zheng Wu, Zongru Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#. Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-powered Mobile GUI Agents. The 2025 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP 2025)
Pengzhou Cheng, Lingzhong Dong, Zheng Wu, Zongru Wu, Xiangru Tang, Chengwei Qin, Zhuosheng Zhang#, Gongshen Liu#. Agent-ScanKit: Unraveling Memory and Reasoning of MLLM-Based Agents via Sensitivity Perturbations. (Preprint)
Yuan Guo, Tingjia Miao, Zheng Wu, Pengzhou Cheng, Ming Zhou, Zhuosheng Zhang#. Atomic-to-Compositional Generalization for Mobile Agents with Systematic Scheduling. (Preprint)
Zongru Wu, Pengzhou Cheng, Zheng Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#. Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks. (Preprint)
Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang#, Gongshen Liu#. See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles. (Preprint)
Lingzhong Dong, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#. Domain Adaptation of MLLM-based Computer-Using Agents with Standard Operating Procedure. (Under Review)
Lingzhong Dong, Ziqi Zhou, Shuaibo Yang, Haiyue Sheng, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#. Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents. (Preprint)
Tutorials and Contributions
Participating: Dive into LLMs《动手学大模型》Course Series
Participating: 《大模型开发全流程》Course Series
Academic Services
Reviewer of conferences:
AAAI 2026
