Zhuosheng Zhang

Tenure-Track Assistant Professor
School of Computer Science
Shanghai Jiao Tong University
Email: zhangzs@sjtu.edu.cn
Office: School of Software 5213
800 Dongchuan Road, Shanghai

Profile

I am a tenure-track assistant professor at Shanghai Jiao Tong University. I received my Ph.D. degree and my M.S. degree from Shanghai Jiao Tong University in 2023 and 2020, respectively. I was an intern at Amazon Web Services, Microsoft Research Redmond, Langboat Tech, NICT (Japan), and IBM. I have served as an action editor for ACL Rolling Review, and a (senior) area chair for ACL 2025, NeurIPS 2025, and EMNLP 2025.

My primary research interests include natural language processing, LLM Reasoning, and LLM Safety. I have published over 80 papers in top-tier conferences and journals, including TPAMI, ICML, ICLR, ACL, AAAI, EMNLP, TNNLS, TASLP, and COLING. I have won 1st place in various language understanding and reasoning leaderboards, such as SQuAD2.0, MuTual, RACE, ShARC, and CMRC. I was awarded as an Academic Star at Shanghai Jiao Tong University and was selected as one of the Global Top 100 Chinese Rising Stars in Artificial Intelligence. I won the Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS), WAIC 2024 Youth Outstanding Paper Award, WAIC 2024 YunFan Award: Bright Star, and Baidu Scholarship.

Tutorials

CVPR 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond
Hao Fei, Yuan Yao, Ao Zhang, Haotian Liu, Fuxiao Liu, Zhuosheng Zhang, Shuicheng Yan.
Seattle WA, USA
[Website]
LREC-COLING 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond
Hao Fei, Yuan Yao, Zhuosheng Zhang, Fuxiao Liu, Ao Zhang, Tat-Seng Chua.
Torino, Italia
[Website]
IJCNLP-AACL 2023: Learning WHO Saying WHAT to WHOM in Multi-Party Conversations
Jia-Chen Gu, Zhuosheng Zhang, and Zhen-Hua Ling.
Bali, Indonesia.
[Website]
IJCAI 2021: Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang and Hai Zhao.
Montreal, Canada (Virtual)
[Website]
For Beginners: Dive into LLMs《动手学大模型》系列编程实践教程 New Updates! (May 2025)
dive-into-llms

Selected Publications [Show All]

Discover

.
[2025]

Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing
Lingyong Fang, Xinzhong Wang, Depeng Wang, Zongru Wu, Ya Guo, Huijia Zhu, Zhuosheng Zhang, Gongshen Liu.
ICCV, 2025
[PDF] [Abstract]

Revisiting Data Auditing in Large Vision-Language Models
Hongyu Zhu, Sichu Liang, Wenwen Wang, Boheng Li, Tongxin Yuan, Fangqi Li, Hanyi Wang, Shi-Lin Wang, Zhuosheng Zhang.
ACM MM, 2025
[PDF] [Abstract]

Stealing Knowledge from Auditable Datasets
Hongyu Zhu, Sichu Liang, Wentao Hu, Wenwen Wang, Fangqi Li, Shilin Wang, Zhuosheng Zhang.
ECAI, 2025
[PDF] [Abstract]

The success of modern deep learning hinges on vast training data, much of which is scraped from the web and may include copyrighted or private content—raising serious legal and ethical concerns when used without authorization. Dataset-level provenance seeks to identify whether a model has been trained on specific data collections, thus protecting copyright holders while preserving data utility. Existing techniques either \textit{watermark} datasets to embed distinctive behaviors, or directly \textit{infer} usage from discrepancies in model outputs between seen and unseen samples. These approaches exploit the fundamental problem of empirical risk minimization to overfit to seen features. Hence, provenance signals are considered inherently hard to erase, while the adversary’s perspective remains largely overlooked, limiting our ability to assess reliability in real-world scenarios. In this work, we present a unified framework that interprets both watermarking and inference-based provenance as manifestations of output divergence, modeling the interaction between auditor and adversary as a min-max game over such divergences. This perspective motivates DivMin, a simple yet effective learning strategy that minimizes the relevant divergence to suppress provenance cues. Experiments across diverse image datasets demonstrate that, starting from a pretrained vision-language model, DivMin retains over 93\% of the full fine-tuning performance relative to a zero-shot baseline, while evading all six state-of-the-art auditing methods. Our findings establish divergence minimization as a direct and practical path to obfuscating provenance, offering a realistic simulation of potential adversary strategies to guide the development of more robust auditing techniques.

Probing then Editing Response Personality of Large Language Models
Tianjie Ju, Zhenyu Shao, Bowen Wang, Yujia Chen, Zhuosheng Zhang, Hao Fei, Mong-Li Lee, Wynne Hsu, Sufeng Duan, Gongshen Liu.
COLM, 2025
[PDF] [Abstract]

Caution for the Environment: LLM Agents are Susceptible to Environmental Distractions
Xinbei Ma, Yiting Wang, Yao Yao, Tongxin Yuan, Aston Zhang, Zhuosheng Zhang*, Hai Zhao*.
ACL, 2025
[PDF] [Abstract]

GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents
Lingxiao Diao, Xinyue Xu, Wanxuan Sun, Cheng Yang*, Zhuosheng Zhang*.
ACL, 2025
[PDF] [Abstract]

OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
Pengzhou Cheng, Zheng Wu, Zongru Wu, Tianjie Ju, Aston Zhang, Zhuosheng Zhang*, Gongshen Liu*
ACL-Findings, 2025
[PDF] [Abstract]

MEGen: Generative Backdoor into Large Language Models via Model Editing
Jiyang Qiu, Xinbei Ma, Zhuosheng Zhang, Hai Zhao*, Yun Li, Qianren Wang.
ACL-Findings, 2025
[PDF] [Abstract]

Risks of AI Scientists: Prioritizing Safeguarding Over Autonomy
Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein.
Nature Communications, 2025 (IF: 14.7)
[PDF] [Abstract]

Do NOT Think That Much for 2+ 3=? On the Overthinking of o1-Like LLMs
Xingyu Chen, Jiahao Xu, Tian Liang, Zhiwei He, Jianhui Pang, Dian Yu, Linfeng Song, Qiuzhi Liu, Mengfei Zhou, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu.
ICML, 2025
[PDF] [Abstract]

Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
Tianjie Ju, Yi Hua, Hao Fei, Zhenyu Shao, Yubin Zheng, Haodong Zhao, Mong-Li Lee, Wynne Hsu, Zhuosheng Zhang*, Gongshen Liu*.
ICML, 2025
[PDF] [Abstract]

ProbingPrivacy

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang#, Yao Yao#, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao.
CSUR, 2025 (IF: 23.8)
"Join us on an exciting journey from chain-of-thought reasoning to language agent!"
[PDF] [Abstract]

CoT-Igniting-Agent

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks. Additionally, theoretical proofs have illuminated their emergent reasoning capabilities, providing a compelling showcase of their advanced cognitive abilities in linguistic contexts. Critical to their remarkable efficacy in handling complex reasoning tasks, LLMs leverage the intriguing chain-of-thought (CoT) reasoning techniques, obliging them to formulate intermediate steps en route to deriving an answer. The CoT reasoning approach has not only exhibited proficiency in amplifying reasoning performance but also in enhancing interpretability, controllability, and flexibility. In light of these merits, recent research endeavors have extended CoT reasoning methodologies to nurture the development of autonomous language agents, which adeptly adhere to language instructions and execute actions within varied environments. This survey paper orchestrates a thorough discourse, penetrating vital research dimensions, encompassing: (i) the foundational mechanics of CoT techniques, with a focus on elucidating the circumstances and justification behind its efficacy; (ii) the paradigm shift in CoT; and (iii) the burgeoning of language agents fortified by CoT approaches. Prospective research avenues envelop explorations into generalization, efficiency, customization, scaling, and safety. We hope to offer readers a comprehensive understanding of prevalent research areas such as CoT reasoning and language agents and illuminate the interconnections weaving through these areas. This paper caters to a wide audience, including beginners seeking comprehensive knowledge of CoT reasoning and language agents, as well as experienced researchers interested in foundational mechanics and engaging in cutting-edge discussions on these topics. A repository for the related papers is available at https://github.com/Zoeyyao27/CoT-Igniting-Agent.

RaSA: Rank-Sharing Low-Rank Adaptation
Zhiwei He, Zhaopeng Tu, Xing Wang, Xingyu Chen, Zhijie Wang, Jiahao Xu, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang.
ICLR, 2025
[PDF] [Abstract]

RaSA

ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein.
ICLR, 2025
[PDF] [Abstract]

chemagent

SynGhost: Invisible and Universal Task-agnostic Backdoor Attack via Syntactic Transfer
Pengzhou Cheng, Wei Du, Zongru Wu, Fengwei Zhang, Libo Chen, Zhuosheng Zhang*, Gongshen Liu*.
NAACL-Findings, 2025
[PDF] [Abstract]

SynGhost

Look before You Leap: Enhancing Attention and Vigilance regarding Harmful Content with GuidelineLLM
Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Rongxiang Weng, Muyun Yang, Tiejun Zhao, Min Zhang.
AAAI, 2025
[PDF] [Abstract]

GuidelineLLM

Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Zongru Wu, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang*, Gongshen Liu*.
COLING, 2025
[PDF] [Abstract]

GraceFul

[2024 & Before]

Is it Possible to Edit Large Language Models Robustly?
Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang*, Hai Zhao*, Lifeng Liu, Yulong Wang.
EMNLP, 2024
"The robustness of model editing remains an open question."
[PDF] [Abstract]

edit_analysis

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang*, Rui Wang, Gongshen Liu.
EMNLP-Findings, 2024
"Are LLM agents aware of safety risks in real-world applications? Let's find out with R-Judge!"
[PDF] [Abstract]

R-Judge

Multimodal Chain-of-Thought Reasoning in Language Models
Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola.
TMLR, 2024
"Imagine learning a textbook with no figures: Multimodal-CoT surpasses humans on ScienceQA."
Featured in Dive into Deep Learning (Adopted at 500 universities from 70 countries)
[Top Trending Research on paperswithcode] [Idea Inspiration] [PDF] [Abstract]

MM-CoT

You Only Look at Screens: Multimodal Chain-of-Action Agents
Zhuosheng Zhang, Aston Zhang.
ACL-Findings, 2024
"Perform a task on smart phones? Train an agent using screenshots."
[PDF] [Abstract] [slides]

Auto-GUI

Automatic Chain of Thought Prompting in Large Language Models
Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola.
ICLR, 2023
"Let's think not just step by step, but also one by one."
Featured in Dive into Deep Learning (Adopted at 400 universities from 60 countries)
[PDF] [Abstract] [bilibili] [slides]

Auto-CoT

Universal Multimodal Representation for Language Understanding
Zhuosheng Zhang#, Kehai Chen, Rui Wang#, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao
TPAMI, 2023 (IF: 20.8)
"Let's retrieve images to overcome the lack of large-scale bilingual pairs."
[PDF] [Abstract]

UVR-NMT

SG-Net: Syntax Guided Transformer for Language Representation
Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao, Rui Wang.
TPAMI, 2022 (IF: 20.8)
[PDF] [Abstract]

SG-Net

Text Compression-aided Transformer Encoding
Zuchao Li, Zhuosheng Zhang, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita.
TPAMI, 2022 (IF: 20.8)
[PDF] [Abstract]

esc4nmt

Shared Tasks

[May 2022] HellaSwag Leaderboard on Commonsense Reasoning

The best among all submissions.

Leaderboard

Paper

[January 2021] ShARC Leaderboard on Conversational Question Answering

The best among all submissions.

Leaderboard

Paper

[September 2020] MuTual Leaderboard on Dialogue Reasoning Challenge

The best among all submissions.

Leaderboard

Paper

[July 2019] SQuAD2.0 Leaderboard on Machine Reading Comprehension

The best models for both single and ensemble settings among all submissions (2020.01).
The first to surpass human benchmark on both EM and F1 scores with a single model (from 2019.07-09).
The first time to exceed 90% F1 score with ensemble models.
[Leaderboard] [Paper] [Report]

[March 2019] RACE Leaderboard on Machine Reading Comprehension

The best among all submissions.
The best among all academic submissions.
[Leaderboard] [Paper] [Report]

[April 2019] SNLI Leaderboard on Language Inference

The best among all submissions.
[Leaderboard] [Paper]

[March 2019] GLUE Leaderboard on Language Understanding

The 3rd best among all submissions.
The best among all academic submissions.
[Leaderboard] [Paper]

[August 2017] Chinese Machine Reading Comprehension (CCL-CMRC 2017)

The best single system and the second ensemble system (Silver Medal).
[Leaderboard] [Paper] [Source]

Awards & Honors

2024: WAIC Youth Outstanding Paper Award, World Artificial Intelligence Conference.
2024: WAIC YunFan Award: Bright Star, World Artificial Intelligence Conference.
2023: Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS).
2023: Shanghai Outstanding Doctoral Graduate.
2022: Academic Stars of Graduate Students (10 recipients), Shanghai Jiao Tong University.
2021: Global Top 100 Chinese Rising Stars in Artificial Intelligence (Top 10 recommended), Baidu Research.
2021: Baidu Scholarship (10 recipients, worldwide), Baidu.
2020: National Scholarship of China, Ministry of Education of the P.R. China.
2019: Yang Yuanqing Education Fund, The foundation of Class 1988 in CS @ Shanghai Jiao Tong University.
2018: Academic Stars of Graduate Students (The only master student awardee), Shanghai Jiao Tong University.
2016: National Figures Nomination of College Students (20 total recipients), Ministry of Education of the P.R. China.
2015: CCF Elite Collegiate Award, China Computer Federation.

Teaching

NIS3353: Artificial Intelligence Security
Undergraduate, Shanghai Jiao Tong University, Spring 2025.
NIS8021: Frontier Technology in Natural Language Processing
Graduate, Shanghai Jiao Tong University, Fall 2024.
NIS3353: Artificial Intelligence Security
Undergraduate, Shanghai Jiao Tong University, Spring 2024.

Academic Service

Organization:
- Co-chair of Hot Paper Session at CCL 2025
- Session Chair at RL China 2024
- Session Chair at CJNLP 2024
- Session Chair at IJCNLP-AACL 2023
- Co-chair of Student Seminar at CCL 2022
- President of IBM Tech Club at Wuhan University, 2014-2015
(Senior) Area Chair / Action Editor/ SPC:
- ACL Rolling Review
- NeurIPS 2025
- EMNLP 2025
- ACL 2025
- LREC-COLING 2024
- IJCAI 2024
- ICLR 2023 TinyPapers
Program Committee Member:
- ML/AI conferences: ICLR, ICML, NeurIPS, AAAI, IJCAI, etc.
- CL/NLP conferences: ARR, ACL, EMNLP, COLING, NAACL, AACL, NLPCC, CCL, etc.
Journal Reviewer: Artificial Intelligence, IEEE/ACM TASLP, IEEE TNNLS, IEEE TETCI, IEEE Communications Magazine, ACM TALLIP, ACM TOIS, TMLR, Neurocomputing, Multimedia Systems, Neural Computing and Applications, Expert Systems With Applications.

Experience

Jul. 2022 - Aug. 2023, Amazon Web Services AI, CA, USA.
Applied Scientist Intern, advised by Dr. Aston Zhang, Mu Li, Alex Smola.
Feb. 2022 - June. 2022, Microsoft Cognitive Services Research Group, WA, USA.
Research Intern, advised by Dr. Shuohang Wang.
Mar. 2021 - Dec. 2021, Langboat Tech, Beijing, China.
Research Intern, advised by Prof. Ming Zhou.
Jun. 2019 - Jul. 2020, NICT, Kyoto, Japan.
Internship Research Fellow, advised by Prof. Rui Wang, Kehai Chen, Masao Utiyama, and Eiichiro Sumita.

Education

Sept. 2020 - Sept. 2023
Ph.D., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
Sept. 2016 - Mar. 2020
M.S., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
Sept. 2012 - Jun. 2016
B.S., Dept. of Computer Science and Engineering, Wuhan University, advised by Prof. Haojun Ai.

Lab Members

I am always fortunate to work with these brilliant young researchers. Those are the students I am (was) collaborating with.

Ph.D Students:
- Yansi Li (incoming)
- Zihe Yan (2024-)
- Yiming Wang, co-advising with Prof. Rui Wang (2023-)
- Zongru Wu, co-advising with Prof. Gongshen Liu (2022-)
- Haodong Zhao, co-advising with Prof. Gongshen Liu (2021-)
- Tianjie Ju, co-advising with Prof. Gongshen Liu (2021-)
- Zhiwei He, co-advising with Prof. Rui Wang (2021-)
- Xinbei Ma, co-advising with Prof. Hai Zhao (2021-)
Master Students:
- 2025: Zheng Wu (incoming), Lingxiao Diao (incoming)
- 2024: Lingzhong Dong
- 2023: Tongxin Yuan, co-advising with Prof. Gongshen Liu
- 2022: Anni Zou, co-advising with Prof. Hai Zhao ( → Alibaba Tongyi Lab)
Undergraduate Students:
- 2026: Yuan Guo, Yijie Lu
- 2025: Yiting Wang ( → MS Student at UCSD), Xuanchang Zhang ( → RA at UIUC)
- 2024: Yexin Wu ( → MS Student at UIUC)
- 2023: Sizhe Zhou ( → MS Student at UIUC)
- 2022: Siru Ouyang ( → PhD Student at UIUC), Jialin Chen ( → PhD Student at Yale University), Junlong Li ( → MS Student at SJTU), Dongjie Yang ( → PhD Student at SJTU), Yuchen He ( → MS Student at SJTU)
- 2021: Longxiang Liu ( → MS Student at ICT/CAS)
- 2020: Yuwei Wu ( → MS Student at CMU)