About Me

My name is Zheng Wu, and I am currently a Master’s student at the School of Computer Science, Shanghai Jiao Tong University, under the supervision of Prof. Zhuosheng Zhang. I received my Bachelor’s degree from the same institution in 2025. I also will continue my academic journey by transitioning into the Ph.D. program advised by Prof. Zhuosheng Zhang.

My research interests include natural language processing, (multimodal) large language models, and GUI agents.

Experience

B.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2021 – June 2025

Supervisor: Prof. Zhuosheng Zhang

M.Eng. in School of Computer Science, Shanghai Jiao Tong University
Sept. 2025 – Present

Supervisor: Prof. Zhuosheng Zhang

Awards & Honors

Best Bachelor Thesis, 2025(top 1% in SJTU)

National Scholarship, 2024(top 0.2% nationwide)

National Scholarship, 2022(top 0.2% nationwide)

Publications

* Equal contribution.

# Corresponding Author.


  • Zheng Wu*, Pengzhou Cheng*, Zongru Wu, Yuan Guo, Tianjie Ju, Aston Zhang, Gongshen Liu#, Zhuosheng Zhang#. Universal Confidence Integration Framework for Adaptive Interaction in Computer-Using Agent. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, under review)

  • Pengzhou Cheng*, Zheng Wu*, Zongru Wu, Aston Zhang, Zhuosheng Zhang#, Gongshen Liu#. OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025) Paper

  • Zheng Wu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Zhuosheng Zhang#. GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection for GUI Agents. (arXiv, under review) Paper

  • Zheng Wu, Heyuan Huang, Yanjia Yang, Yuanyi Song, Xingyu Lou, Weiwen Liu, Weinan Zhang, Jun Wang#, Zhuosheng Zhang#. Quick on the Uptake: Eliciting Implicit Intents from Human Demonstrations for Personalized Mobile-Use Agents. (arXiv, under review) Paper

  • Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang, Zhuosheng Zhang#. VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents. (arXiv, under review) Paper

  • Pengzhou Cheng, Haowen Hu, Zheng Wu, Zongru Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#. Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-powered Mobile GUI Agents. The 2025 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP 2025) Paper

  • Yuan Guo, Tingjia Miao, Zheng Wu, Pengzhou Cheng, Ming Zhou, Zhuosheng Zhang#. Atomic-to-Compositional Generalization for Mobile Agents with Systematic Scheduling. (arXiv, under review) Paper

  • Zongru Wu, Pengzhou Cheng, Zheng Wu, Tianjie Ju, Zhuosheng Zhang#, Gongshen Liu#. Smoothing Grounding and Reasoning for MLLM-Powered GUI Agents with Query-Oriented Pivot Tasks. (arXiv, under review) Paper

  • Lingzhong Dong, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu#, Zhuosheng Zhang#. Domain Adaptation of MLLM-based Computer-Using Agents with Standard Operating Procedure. (under review)


Tutorials and Contributions

Participating: Dive into LLMs《动手学大模型》Course Series GitHub starsCourse Link

Participating: 《大模型开发全流程》Course Series Course Link

Academic Services

Reviewer of conferences:

AAAI 2026