Zhuosheng Zhang

Tenure-Track Assistant Professor
School of Electronic Information and Electrical Engineering
Shanghai Jiao Tong University
Email: zhangzs@sjtu.edu.cn
Office: School of Software 5213
800 Dongchuan Road, Shanghai

Profile

I am a tenure-track assistant professor at Shanghai Jiao Tong University. I received my Ph.D. degree and my M.S. degree from Shanghai Jiao Tong University in 2023 and 2020, respectively. I was an intern at Amazon Web Services, Microsoft Research Redmond, Langboat Tech, NICT (Japan), and IBM. I have served as an action editor for ACL Rolling Review, and a (senior) area chair for ACL 2025, NeurIPS 2025, and EMNLP 2025.

My primary research interests include natural language processing, LLM Reasoning, and LLM Safety. I have published over 80 papers in top-tier conferences and journals, including TPAMI, ICML, ICLR, ACL, AAAI, EMNLP, TNNLS, TASLP, and COLING. I have won 1st place in various language understanding and reasoning leaderboards, such as SQuAD2.0, MuTual, RACE, ShARC, and CMRC. I was awarded as an Academic Star at Shanghai Jiao Tong University and was selected as one of the Global Top 100 Chinese Rising Stars in Artificial Intelligence. I won the Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS), WAIC 2024 Youth Outstanding Paper Award, WAIC 2024 YunFan Award: Bright Star, and Baidu Scholarship.

Tutorials

  • CVPR 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond
    Hao Fei, Yuan Yao, Ao Zhang, Haotian Liu, Fuxiao Liu, Zhuosheng Zhang, Shuicheng Yan.
    Seattle WA, USA
    [Website]
  • LREC-COLING 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond
    Hao Fei, Yuan Yao, Zhuosheng Zhang, Fuxiao Liu, Ao Zhang, Tat-Seng Chua.
    Torino, Italia
    [Website]
  • IJCNLP-AACL 2023: Learning WHO Saying WHAT to WHOM in Multi-Party Conversations
    Jia-Chen Gu, Zhuosheng Zhang, and Zhen-Hua Ling.
    Bali, Indonesia.
    [Website]
  • IJCAI 2021: Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
    Zhuosheng Zhang and Hai Zhao.
    Montreal, Canada (Virtual)
    [Website]
  • For Beginners: Dive into LLMs《动手学大模型》系列编程实践教程 New Updates! (May 2025)

Selected Publications [Show All]

Discover google scholar | semantic scholar | dblp.
[2025]
  • Do NOT Think That Much for 2+ 3=? On the Overthinking of o1-Like LLMs
    Xingyu Chen, Jiahao Xu, Tian Liang, Zhiwei He, Jianhui Pang, Dian Yu, Linfeng Song, Qiuzhi Liu, Mengfei Zhou, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu.
    ICML, 2025
    [PDF] [Abstract] [Bib]
  • Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
    Tianjie Ju, Yi Hua, Hao Fei, Zhenyu Shao, Yubin Zheng, Haodong Zhao, Mong-Li Lee, Wynne Hsu, Zhuosheng Zhang*, Gongshen Liu*.
    ICML, 2025
    [PDF] [Abstract] [Bib]
  • Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science
    Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein.
    Nature Communications, 2025 (IF: 14.7)
    [PDF] [Abstract] [Bib]
  • Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
    Zhuosheng Zhang#, Yao Yao#, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao.
    CSUR, 2025 (IF: 23.8)
    "Join us on an exciting journey from chain-of-thought reasoning to language agent!"
    [PDF] [Abstract] [Bib]
  • RaSA: Rank-Sharing Low-Rank Adaptation
    Zhiwei He, Zhaopeng Tu, Xing Wang, Xingyu Chen, Zhijie Wang, Jiahao Xu, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang.
    ICLR, 2025
    [PDF] [Abstract] [Bib]
  • ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
    Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein.
    ICLR, 2025
    [PDF] [Abstract] [Bib]
  • SynGhost: Invisible and Universal Task-agnostic Backdoor Attack via Syntactic Transfer
    Pengzhou Cheng, Wei Du, Zongru Wu, Fengwei Zhang, Libo Chen, Zhuosheng Zhang*, Gongshen Liu*.
    NAACL-Findings, 2025
    [PDF] [Abstract] [Bib]
  • Look before You Leap: Enhancing Attention and Vigilance regarding Harmful Content with GuidelineLLM
    Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Rongxiang Weng, Muyun Yang, Tiejun Zhao, Min Zhang.
    AAAI, 2025
    [PDF] [Abstract] [Bib]
  • Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
    Zongru Wu, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang*, Gongshen Liu*.
    COLING, 2025
    [PDF] [Abstract] [Bib]
[2024 & Before]
  • Is it Possible to Edit Large Language Models Robustly?
    Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang*, Hai Zhao*, Lifeng Liu, Yulong Wang.
    EMNLP, 2024
    "The robustness of model editing remains an open question."
    [PDF] [Abstract] [Bib]
  • R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
    Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang*, Rui Wang, Gongshen Liu.
    EMNLP-Findings, 2024
    "Are LLM agents aware of safety risks in real-world applications? Let's find out with R-Judge!"
    [PDF] [Abstract] [Bib]
  • Multimodal Chain-of-Thought Reasoning in Language Models
    Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola.
    TMLR, 2024
    "Imagine learning a textbook with no figures: Multimodal-CoT surpasses humans on ScienceQA."
    Featured in Dive into Deep Learning (Adopted at 500 universities from 70 countries)
    [Top Trending Research on paperswithcode] [Idea Inspiration] [PDF] [Abstract] [Bib]
  • You Only Look at Screens: Multimodal Chain-of-Action Agents
    Zhuosheng Zhang, Aston Zhang.
    ACL-Findings, 2024
    "Perform a task on smart phones? Train an agent using screenshots."
    [PDF] [Abstract] [Bib] [slides]
  • Automatic Chain of Thought Prompting in Large Language Models
    Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola.
    ICLR, 2023
    "Let's think not just step by step, but also one by one."
    Featured in Dive into Deep Learning (Adopted at 400 universities from 60 countries)
    [PDF] [Abstract] [Bib] [bilibili] [slides]
  • Universal Multimodal Representation for Language Understanding
    Zhuosheng Zhang#, Kehai Chen, Rui Wang#, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao
    TPAMI, 2023 (IF: 20.8)
    "Let's retrieve images to overcome the lack of large-scale bilingual pairs."
    [PDF] [Abstract] [Bib]
  • SG-Net: Syntax Guided Transformer for Language Representation
    Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao, Rui Wang.
    TPAMI, 2022 (IF: 20.8)
    [PDF] [Abstract] [Bib]
  • Text Compression-aided Transformer Encoding
    Zuchao Li, Zhuosheng Zhang, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita.
    TPAMI, 2022 (IF: 20.8)
    [PDF] [Abstract] [Bib]

Shared Tasks

[May 2022] HellaSwag Leaderboard on Commonsense Reasoning
[January 2021] ShARC Leaderboard on Conversational Question Answering
[September 2020] MuTual Leaderboard on Dialogue Reasoning Challenge
[July 2019] SQuAD2.0 Leaderboard on Machine Reading Comprehension
  • The best models for both single and ensemble settings among all submissions (2020.01).
  • The first to surpass human benchmark on both EM and F1 scores with a single model (from 2019.07-09).
  • The first time to exceed 90% F1 score with ensemble models.
    [Leaderboard] [Paper] [Report]
[March 2019] RACE Leaderboard on Machine Reading Comprehension
[April 2019] SNLI Leaderboard on Language Inference [March 2019] GLUE Leaderboard on Language Understanding
  • The 3rd best among all submissions.
  • The best among all academic submissions.
    [Leaderboard] [Paper]
[August 2017] Chinese Machine Reading Comprehension (CCL-CMRC 2017)

Awards & Honors

  • 2024: WAIC Youth Outstanding Paper Award, World Artificial Intelligence Conference.

  • 2024: WAIC YunFan Award: Bright Star, World Artificial Intelligence Conference.

  • 2023: Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS).

  • 2023: Shanghai Outstanding Doctoral Graduate.

  • 2022: Academic Stars of Graduate Students (10 recipients), Shanghai Jiao Tong University.

  • 2021: Global Top 100 Chinese Rising Stars in Artificial Intelligence (Top 10 recommended), Baidu Research.

  • 2021: Baidu Scholarship (10 recipients, worldwide), Baidu.

  • 2020: National Scholarship of China, Ministry of Education of the P.R. China.

  • 2019: Yang Yuanqing Education Fund, The foundation of Class 1988 in CS @ Shanghai Jiao Tong University.

  • 2018: Academic Stars of Graduate Students (The only master student awardee), Shanghai Jiao Tong University.

  • 2016: National Figures Nomination of College Students (20 total recipients), Ministry of Education of the P.R. China.

  • 2015: CCF Elite Collegiate Award, China Computer Federation.

Teaching

  • NIS3353: Artificial Intelligence Security
    Undergraduate, Shanghai Jiao Tong University, Spring 2025.
  • NIS8021: Frontier Technology in Natural Language Processing
    Graduate, Shanghai Jiao Tong University, Fall 2024.
  • NIS3353: Artificial Intelligence Security
    Undergraduate, Shanghai Jiao Tong University, Spring 2024.

Academic Service

  • Organization:
    • Session Chair at RL China 2024.
    • Session Chair at CJNLP 2024.
    • Session Chair at IJCNLP-AACL 2023.
    • Co-chair of CCL Student Seminar, 2022
    • President of IBM Tech Club at Wuhan University, 2014-2015.
  • (Senior) Area Chair / Action Editor/ SPC:
    • ACL Rolling Review
    • NeurIPS 2025
    • EMNLP 2025
    • ACL 2025
    • LREC-COLING 2024
    • IJCAI 2024
    • ICLR 2023 TinyPapers
  • Program Committee Member:
    • ML/AI conferences: ICLR, ICML, NeurIPS, AAAI, IJCAI, etc.
    • CL/NLP conferences: ARR, ACL, EMNLP, COLING, NAACL, AACL, NLPCC, CCL, etc.

  • Journal Reviewer: Artificial Intelligence, IEEE/ACM TASLP, IEEE TNNLS, IEEE TETCI, IEEE Communications Magazine, ACM TALLIP, ACM TOIS, TMLR, Neurocomputing, Multimedia Systems, Neural Computing and Applications, Expert Systems With Applications.

Experience

  • Jul. 2022 - Aug. 2023, Amazon Web Services AI, CA, USA.
    Applied Scientist Intern (remote), advised by Dr. Aston Zhang, Mu Li, Alex Smola.
  • Feb. 2022 - June. 2022, Microsoft Cognitive Services Research Group, WA, USA.
    Research Intern (remote), advised by Dr. Shuohang Wang.
  • Mar. 2021 - Dec. 2021, Langboat Tech, Beijing, China.
    Research Intern (remote), advised by Prof. Ming Zhou.
  • Jun. 2019 - Jul. 2020, NICT, Kyoto, Japan.
    Internship Research Fellow, advised by Prof. Rui Wang, Kehai Chen, Masao Utiyama, and Eiichiro Sumita.

Education

  • Sept. 2020 - Sept. 2023
    Ph.D., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
  • Sept. 2016 - Mar. 2020
    M.S., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
  • Sept. 2012 - Jun. 2016
    B.S., Dept. of Computer Science and Engineering, Wuhan University, advised by Prof. Haojun Ai.

Lab Members

I am always fortunate to work with these brilliant young researchers. Those are the students I am (was) collaborating with.