Zhuosheng Zhang, Shanghai Jiao Tong University

Publications

(#: equal contribution; Open resources are available in GitHub ★ )

[2025]

Revisiting Data Auditing in Large Vision-Language Models
Hongyu Zhu, Sichu Liang, Wenwen Wang, Boheng Li, Tongxin Yuan, Fangqi Li, Hanyi Wang, Shi-Lin Wang, Zhuosheng Zhang.
ACM MM, 2025
[PDF] [Abstract]

Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing
Lingyong Fang, Xinzhong Wang, Depeng Wang, Zongru Wu, Ya Guo, Huijia Zhu, Zhuosheng Zhang, Gongshen Liu.
ICCV, 2025
[PDF] [Abstract]

Probing then Editing Response Personality of Large Language Models
Tianjie Ju, Zhenyu Shao, Bowen Wang, Yujia Chen, Zhuosheng Zhang, Hao Fei, Mong-Li Lee, Wynne Hsu, Sufeng Duan, Gongshen Liu.
COLM, 2025
[PDF] [Abstract]

Caution for the Environment: LLM Agents are Susceptible to Environmental Distractions
Xinbei Ma, Yiting Wang, Yao Yao, Tongxin Yuan, Aston Zhang, Zhuosheng Zhang*, Hai Zhao*.
ACL, 2025
[PDF] [Abstract]

GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents
Lingxiao Diao, Xinyue Xu, Wanxuan Sun, Cheng Yang*, Zhuosheng Zhang*.
ACL, 2025
[PDF] [Abstract]

OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents
Pengzhou Cheng, Zheng Wu, Zongru Wu, Tianjie Ju, Aston Zhang, Zhuosheng Zhang*, Gongshen Liu*
ACL-Findings, 2025
[PDF] [Abstract]

MEGen: Generative Backdoor into Large Language Models via Model Editing
Jiyang Qiu, Xinbei Ma, Zhuosheng Zhang, Hai Zhao*, Yun Li, Qianren Wang.
ACL-Findings, 2025
[PDF] [Abstract]

Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science
Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein.
Nature Communications, 2025 (IF: 14.7)
[PDF] [Abstract]

Do NOT Think That Much for 2+ 3=? On the Overthinking of o1-Like LLMs
Xingyu Chen, Jiahao Xu, Tian Liang, Zhiwei He, Jianhui Pang, Dian Yu, Linfeng Song, Qiuzhi Liu, Mengfei Zhou, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu.
ICML, 2025
[PDF] [Abstract] [Bib]

@article{chen2024not,
    title={Do NOT Think That Much for 2+ 3=? On the Overthinking of o1-Like LLMs},
    author={Chen, Xingyu and Xu, Jiahao and Liang, Tian and He, Zhiwei and Pang, Jianhui and Yu, Dian and Song, Linfeng and Liu, Qiuzhi and Zhou, Mengfei and Zhang, Zhuosheng and others},
    journal={arXiv preprint arXiv:2412.21187},
    year={2024}
    }

Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
Tianjie Ju, Yi Hua, Hao Fei, Zhenyu Shao, Yubin Zheng, Haodong Zhao, Mong-Li Lee, Wynne Hsu, Zhuosheng Zhang*, Gongshen Liu*.
ICML, 2025
[PDF] [Abstract] [Bib]

ProbingPrivacy

@article{ju2025watch,
  title={Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models},
  author={Ju, Tianjie and Hua, Yi and Fei, Hao and Shao, Zhenyu and Zheng, Yubin and Zhao, Haodong and Lee, Mong-Li and Hsu, Wynne and Zhang, Zhuosheng and Liu, Gongshen},
  journal={arXiv preprint arXiv:2503.01208},
  year={2025}
}

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang#, Yao Yao#, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao.
CSUR, 2025
"Join us on an exciting journey from chain-of-thought reasoning to language agent!"
[PDF] [Abstract] [Bib]

CoT-Igniting-Agent

Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks. Additionally, theoretical proofs have illuminated their emergent reasoning capabilities, providing a compelling showcase of their advanced cognitive abilities in linguistic contexts. Critical to their remarkable efficacy in handling complex reasoning tasks, LLMs leverage the intriguing chain-of-thought (CoT) reasoning techniques, obliging them to formulate intermediate steps en route to deriving an answer. The CoT reasoning approach has not only exhibited proficiency in amplifying reasoning performance but also in enhancing interpretability, controllability, and flexibility. In light of these merits, recent research endeavors have extended CoT reasoning methodologies to nurture the development of autonomous language agents, which adeptly adhere to language instructions and execute actions within varied environments. This survey paper orchestrates a thorough discourse, penetrating vital research dimensions, encompassing: (i) the foundational mechanics of CoT techniques, with a focus on elucidating the circumstances and justification behind its efficacy; (ii) the paradigm shift in CoT; and (iii) the burgeoning of language agents fortified by CoT approaches. Prospective research avenues envelop explorations into generalization, efficiency, customization, scaling, and safety. We hope to offer readers a comprehensive understanding of prevalent research areas such as CoT reasoning and language agents and illuminate the interconnections weaving through these areas. This paper caters to a wide audience, including beginners seeking comprehensive knowledge of CoT reasoning and language agents, as well as experienced researchers interested in foundational mechanics and engaging in cutting-edge discussions on these topics. A repository for the related papers is available at https://github.com/Zoeyyao27/CoT-Igniting-Agent.

@article{zhang2025igniting,
  title={Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents},
  author={Zhang, Zhuosheng and Yao, Yao and Zhang, Aston and Tang, Xiangru and Ma, Xinbei and He, Zhiwei and Wang, Yiming and Gerstein, Mark and Wang, Rui and Liu, Gongshen and others},
  journal={ACM Computing Surveys},
  year={2025}
}

RaSA: Rank-Sharing Low-Rank Adaptation
Zhiwei He, Zhaopeng Tu, Xing Wang, Xingyu Chen, Zhijie Wang, Jiahao Xu, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang.
ICLR, 2025
[PDF] [Abstract] [Bib]

RaSA

ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein.
ICLR, 2025
[PDF] [Abstract] [Bib]

chemagent

SynGhost: Invisible and Universal Task-agnostic Backdoor Attack via Syntactic Transfer
Pengzhou Cheng, Wei Du, Zongru Wu, Fengwei Zhang, Libo Chen, Zhuosheng Zhang*, Gongshen Liu*.
NAACL-Findings, 2025
[PDF] [Abstract] [Bib]

SynGhost

Look before You Leap: Enhancing Attention and Vigilance regarding Harmful Content with GuidelineLLM
Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Rongxiang Weng, Muyun Yang, Tiejun Zhao, Min Zhang.
AAAI, 2025
[PDF] [Abstract] [Bib]

GuidelineLLM

@article{zhang2024look,
  title={Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM},
  author={Zhang, Shaoqing and Zhang, Zhuosheng and Chen, Kehai and Weng, Rongxiang and Yang, Muyun and Zhao, Tiejun and Zhang, Min},
  journal={arXiv preprint arXiv:2412.10423},
  year={2024}
}

Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Zongru Wu, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang*, Gongshen Liu*.
COLING, 2025
[PDF] [Abstract] [Bib]

GraceFul

@inproceedings{wu2024gracefully,
  title   = {Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining},
  author  = {Wu, Zongru and Cheng, Pengzhou and Fang, Lingyong and Zhang, Zhuosheng and Liu, Gongshen},
  booktitle = {Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025)},
  year    = {2025}
}

[2024]

Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Yiming Wang, Pei Zhang, Baosong Yang, Derek F. Wong, Zhuosheng Zhang, Rui Wang.
NeurIPS, 2024
[PDF] [Abstract] [Bib]

OOD-Math-Reasoning

@article{wang2024trajectory,
  title={Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning},
  author={Wang, Yiming and Zhang, Pei and Yang, Baosong and Wong, Derek F and Zhang, Zhuosheng and Wang, Rui},
  journal={arXiv preprint arXiv:2405.14039},
  year={2024}
}

Is it Possible to Edit Large Language Models Robustly?
Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang*, Hai Zhao*, Lifeng Liu, Yulong Wang.
EMNLP, 2024
"The robustness of model editing remains an open question."
[PDF] [Abstract] [Bib]

edit_analysis

@article{ma2024possible,
  title={Is it Possible to Edit Large Language Models Robustly?},
  author={Ma, Xinbei and Ju, Tianjie and Qiu, Jiyang and Zhang, Zhuosheng and Zhao, Hai and Liu, Lifeng and Wang, Yulong},
  journal={arXiv preprint arXiv:2402.05827},
  year={2024}
}

GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model
Xuanchang Zhang, Zhuosheng Zhang*, Hai Zhao*.
EMNLP, 2024
[PDF] [Abstract] [Bib]

GLaPE

@article{zhang2024glape,
  title={GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model},
  author={Zhang, Xuanchang and Zhang, Zhuosheng and Zhao, Hai},
  journal={arXiv preprint arXiv:2402.02408},
  year={2024}
}

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Tongxin Yuan#, Zhiwei He#, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang*, Rui Wang, Gongshen Liu.
EMNLP-Findings, 2024
"Are LLM agents aware of safety risks in real-world applications? Let's find out with R-Judge!"
[PDF] [Abstract] [Bib]

R-Judge

@article{yuan2024r,
  title={R-Judge: Benchmarking Safety Risk Awareness for LLM Agents},
  author={Yuan, Tongxin and He, Zhiwei and Dong, Lingzhong and Wang, Yiming and Zhao, Ruijie and Xia, Tian and Xu, Lizhen and Zhou, Binglin and Li, Fangqi and Zhang, Zhuosheng and Wang, Rui and Liu, Gongshen},
  journal={arXiv preprint arXiv:2401.10019},
  year={2024}
}

Dynamic Planning for LLM-based Graphical User Interface Automation
Shaoqing Zhang, Zhuosheng Zhang, Kehai Chen, Xinbe Ma, Muyun Yang, Tiejun Zhao, Min Zhang.
EMNLP-Findings, 2024
[PDF] [Abstract] [Bib]

D-PoT

@article{zhang2024dynamic,
  title={Dynamic Planning for LLM-based Graphical User Interface Automation},
  author={Zhang, Shaoqing and Zhang, Zhuosheng and Chen, Kehai and Ma, Xinbe and Yang, Muyun and Zhao, Tiejun and Zhang, Min},
  journal={arXiv preprint arXiv:2410.00467},
  year={2024}
}

Multimodal Chain-of-Thought Reasoning in Language Models
Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola.
TMLR, 2024
"Imagine learning a textbook with no figures: Multimodal-CoT surpasses humans on ScienceQA."
Featured in Dive into Deep Learning (Adopted at 500 universities from 70 countries)
[Top Trending Research on paperswithcode] [Idea Inspiration] [PDF] [Abstract] [Bib]

MM-CoT

@article{zhang2023multicot,
  title={Multimodal Chain-of-Thought Reasoning in Language Models},
  author={Zhang, Zhuosheng and Zhang, Aston and Li, Mu and Zhao, Hai and Karypis, George and Smola, Alex},
  journal={arXiv preprint arXiv:2302.00923},
  year={2023}
}

Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models
Tianjie Ju, Yijin Chen, Xinwei Yuan, Zhuosheng Zhang*, Wei Du, Yubin Zheng, Gongshen Liu*.
ACL, 2024
[PDF] [Abstract] [Bib]

MultiHopShortcuts

@article{ju2024investigating,
  title={Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models},
  author={Ju, Tianjie and Chen, Yijin and Yuan, Xinwei and Zhang, Zhuosheng and Du, Wei and Zheng, Yubin and Liu, Gongshen},
  journal={arXiv preprint arXiv:2402.11900},
  year={2024}
}

Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Zongru Wu, Zhuosheng Zhang*, Pengzhou Cheng, Gongshen Liu*. ACL, 2024
[PDF] [Abstract] [Bib]

MuScleLoRA

@article{wu2024acquiring,
  title={Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space},
  author={Wu, Zongru and Zhang, Zhuosheng and Cheng, Pengzhou and Liu, Gongshen},
  journal={arXiv preprint arXiv:2402.12026},
  year={2024}
}

On the Cross-lingual Consistency of Text Watermark for Large Language Models
Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang.
ACL, 2024
[PDF] [Abstract] [Bib]

X-SIR

@article{he2024can,
  title={Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models},
  author={He, Zhiwei and Zhou, Binglin and Hao, Hongkun and Liu, Aiwei and Wang, Xing and Tu, Zhaopeng and Zhang, Zhuosheng and Wang, Rui},
  journal={arXiv preprint arXiv:2402.14007},
  year={2024}
}

You Only Look at Screens: Multimodal Chain-of-Action Agents
Zhuosheng Zhang, Aston Zhang.
ACL-Findings, 2024
"Perform a task on smart phones? Train an agent using screenshots."
[PDF] [Abstract] [Bib] [slides]

Auto-UI

@article{zhang2023autoui,
  title={You Only Look at Screens: Multimodal Chain-of-Action Agents},
  author={Zhang, Zhuosheng and Zhang, Aston},
  journal={arXiv preprint arXiv:2309.11436},
  year={2023}
}

Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Tian Xia, Zhiwei He, Tong Ren, Yibo Miao, Zhuosheng Zhang, Yang Yang, Rui Wang.
ACL-Findings, 2024
[PDF] [Abstract] [Bib]

AmazonPriceHistory

@article{xia2024measuring,
  title={Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method},
  author={Xia, Tian and He, Zhiwei and Ren, Tong and Miao, Yibo and Zhang, Zhuosheng and Yang, Yang and Wang, Rui},
  journal={arXiv preprint arXiv:2402.15813},
  year={2024}
}

Comprehensive Cognitive LLM Agent for Smartphone GUI Automation
Xinbei Ma, Zhuosheng Zhang*, Hai Zhao*.
ACL-Findings, 2024
[PDF] [Abstract] [Bib]

AAgent

@article{ma2024comprehensive,
  title={Comprehensive Cognitive LLM Agent for Smartphone GUI Automation},
  author={Ma, Xinbei and Zhang, Zhuosheng and Zhao, Hai},
  journal={arXiv preprint arXiv:2402.11941},
  year={2024}
}

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Xiangru Tang#, Anni Zou#, Zhuosheng Zhang, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein.
ACL-Findings, 2024
[PDF] [Abstract] [Bib]

MedAgents

@article{tang2023medagents,
  title={MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning},
  author={Tang, Xiangru and Zou, Anni and Zhang, Zhuosheng and Zhao, Yilun and Zhang, Xingyao and Cohan, Arman and Gerstein, Mark},
  journal={arXiv preprint arXiv:2311.10537},
  year={2023}
}

Structured Chemistry Reasoning with Large Language Models
Siru Ouyang, Zhuosheng Zhang, Bing Yan, Xuan Liu, Jiawei Han, Lianhui Qin.
ICML, 2024
[PDF] [Abstract] [Bib]

StructChem

@article{ouyang2023structured,
  title={Structured Chemistry Reasoning with Large Language Models},
  author={Ouyang, Siru and Zhang, Zhuosheng and Yan, Bing and Liu, Xuan and Han, Jiawei and Qin, Lianhui},
  journal={arXiv preprint arXiv:2311.09656},
  year={2023}
}

Self-Prompting Large Language Models for Open-Domain QA
Junlong Li, Jinyuan Wang, Zhuosheng Zhang*, Hai Zhao*.
NAACL, 2024
"Free from training data and external knowledge corpus for ODQA."
[PDF] [Abstract] [Bib]

Self-Prompting

@article{li2022self,
  title={Self-Prompting Large Language Models for Open-Domain QA},
  author={Li, Junlong and hang, Zhuosheng and Zhao, Hai},
  journal={arXiv preprint arXiv:2212.08635},
  year={2022}
}

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu.
NAACL, 2024
[PDF] [Abstract] [Bib]

FeedbackMT

@article{he2024improving,
  title={Improving machine translation with human feedback: An exploration of quality estimation as a reward model},
  author={He, Zhiwei and Wang, Xing and Jiao, Wenxiang and Zhang, Zhuosheng and Wang, Rui and Shi, Shuming and Tu, Zhaopeng},
  journal={arXiv preprint arXiv:2401.12873},
  year={2024}
}

Mitigating Harmful Chain-of-Thought Reasoning with Selective Filtering
Yexin Wu, Zhuosheng Zhang*, Hai Zhao*.
LREC-COLING, 2024
[PDF] [Abstract] [Bib]

AuRoRA: A One-for-all Platform for Augmented Reasoning and Refining with Task-Adaptive Chain-of-Thought Prompting
Anni Zou, Zhuosheng Zhang*, Hai Zhao*.
LREC-COLING-Demos, 2024
[PDF] [Abstract] [Bib]

Multi-turn Dialogue Comprehension from a Topic-aware Perspective
Xinbei Ma, Yi Xu, Hai Zhao, Zhuosheng Zhang.
Neurocomputing, 2024
[PDF] [Abstract] [Bib]

@article{ma2023multi,
  title={Multi-turn Dialogue Comprehension from a Topic-aware Perspective},
  author={Ma, Xinbei and Xu, Yi and Zhao, Hai and Zhang, Zhuosheng},
  journal={arXiv preprint arXiv:2309.09666},
  year={2023}
}

Fact-driven Logical Reasoning for Machine Reading Comprehension
Siru Ouyang, Zhuosheng Zhang*, Hai Zhao*.
AAAI, 2024
[PDF] [Abstract] [Bib]

FocalReasoner

@article{ouyang2024fact,
  title={Fact-driven Logical Reasoning for Machine Reading Comprehension},
  author={Ouyang, Siru and Zhang, Zhuosheng and Zhao, Hai},
  journal={The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)},
  year={2024}
}

[2023]

Exploring Human-Like Translation Strategy with Large Language Models
Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang.
TACL, 2023
Delves into LLMs' potential for mimicking human translation strategies.
[PDF] [Abstract] [Bib]

MAPS-mt

@article{he2023exploring,
  title={Exploring Human-Like Translation Strategy with Large Language Models},
  author={He, Zhiwei and Liang, Tian and Jiao, Wenxiang and Zhang, Zhuosheng and Yang, Yujiu and Wang, Rui and Tu, Zhaopeng and Shi, Shuming and Wang, Xing},
  journal={arXiv preprint arXiv:2305.04118},
  year={2023}
}

Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang
EMNLP, 2023
[PDF] [Abstract] [Bib]

@article{qin2023chatgpt,
  title={Is ChatGPT a General-Purpose Natural Language Processing Task Solver?},
  author={Qin, Chengwei and Zhang, Aston and Zhang, Zhuosheng and Chen, Jiaao and Yasunaga, Michihiro and Yang, Diyi},
  journal={arXiv preprint arXiv:2302.06476},
  year={2023}
}

On Element-aware Automatic Summarization: Expert-writing Test Set and Chain-of-Thought Method
Yiming Wang, Zhuosheng Zhang, Rui Wang.
ACL, 2023
[PDF] [Abstract] [Bib]

SumCoT

@inproceedings{wang2023element,
  title={Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method},
  author={Wang, Yiming and Zhang, Zhuosheng and Wang, Rui},
  booktitle={The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)},
  year={2023}
}

Learning Better Masking for Better Language Model Pre-training
Dongjie Yang, Zhuosheng Zhang*, Hai Zhao*.
ACL, 2023
[PDF] [Abstract] [Bib]

BetterMasking

@inproceedings{yang2023learning,
  title={Learning Better Masking for Better Language Model Pre-training},
  author={Yang, Dongjie and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)},
  year={2023}
}

Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification
Anni Zou, Zhuosheng Zhang, Hai Zhao.
ACL-Findings, 2023
[PDF] [Abstract] [Bib]

Decker

@inproceedings{zou2023decker,
  title={Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification},
  author={Zou, Anni and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)},
  year={2023}
}

Automatic Chain of Thought Prompting in Large Language Models
Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola.
ICLR, 2023
"Let's think not just step by step, but also one by one."
Featured in Dive into Deep Learning (Adopted at 400 universities from 60 countries)
[PDF] [Abstract] [Bib] [bilibili]

Auto-CoT

@inproceedings{zhang2023automatic,
  title={Automatic Chain of Thought Prompting in Large Language Models},
  author={Zhang, Zhuosheng and Zhang, Aston and Li, Mu and Smola, Alex},
  booktitle={The Eleventh International Conference on Learning Representations (ICLR 2023)},
  year={2023}
}

Enhanced Speaker-aware Multi-party Multi-turn Dialogue Comprehension
Xinbei Ma, Zhuosheng Zhang*, Hai Zhao*.
TASLP, 2023
[PDF] [Abstract] [Bib]

ESA

@ARTICLE{10147329,
  author={Ma, Xinbei and Zhang, Zhuosheng and Zhao, Hai},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Enhanced Speaker-aware Multi-party Multi-turn Dialogue Comprehension}, 
  year={2023},
  volume={},
  number={},
  pages={1-16},
  doi={10.1109/TASLP.2023.3284516}
}

Universal Multimodal Representation for Language Understanding
Zhuosheng Zhang#, Kehai Chen, Rui Wang#, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao
TPAMI, 2023
"Let's retrieve images to overcome the lack of large-scale bilingual pairs."
[PDF] [Abstract] [Bib]

@ARTICLE{zhang2023universal,
  author={Zhang, Zhuosheng and Chen, Kehai and Wang, Rui and Utiyama, Masao and Sumita, Eiichiro and Li, Zuchao and Zhao, Hai},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  title={Universal Multimodal Representation for Language Understanding}, 
  year={2023},
  volume={},
  number={},
  pages={1-18},
  doi={10.1109/TPAMI.2023.3234170}}

Language Model Pre-training on True Negatives
Zhuosheng Zhang, Hai Zhao, Masao Utiyama, Eiichiro Sumita
AAAI, 2023
[PDF] [Abstract] [Bib]

@inproceedings{zhang2023TrueNeg,
  title={Language Model Pre-training on True Negatives},
  author={Zhang, Zhuosheng and Zhao, Hai and Utiyama, Masao and Sumita, Eiichiro},
  booktitle={The Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)},
  year={2023}
}

[2022]

SG-Net: Syntax Guided Transformer for Language Representation
Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao, Rui Wang.
TPAMI, 2022
[PDF] [Abstract] [Bib]

SG-Net

@article{zhang2022sg,
  title={SG-Net: Syntax Guided Transformer for Language Representation},
  author={Zhang, Zhuosheng and Wu, Yuwei and Zhou, Junru and Duan, Sufeng and Zhao, Hai and Wang, Rui},
  journal={IEEE Transactions on Pattern Analysis \& Machine Intelligence},
  volume={44},
  number={06},
  pages={3285--3299},
  year={2022},
  publisher={IEEE Computer Society}
}

Text Compression-aided Transformer Encoding
Zuchao Li, Zhuosheng Zhang, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita.
TPAMI, 2022
[PDF] [Abstract] [Bib]

esc4nmt

@article{li2022text,
  title={Text Compression-Aided Transformer Encoding},
  author={Li, Zuchao and Zhang, Zhuosheng and Zhao, Hai and Wang, Rui and Chen, Kehai and Utiyama, Masao and Sumita, Eiichiro},
  journal={IEEE Transactions on Pattern Analysis \& Machine Intelligence},
  volume={44},
  number={07},
  pages={3840--3857},
  year={2022},
  publisher={IEEE Computer Society}
}

Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension
Zhuosheng Zhang#, Hai Zhao, Longxiang Liu#.
TNNLS, 2022
[PDF] [Abstract] [Bib]

MDFN

@article{zhang2022cdn,
  author={Zhang, Zhuosheng and Zhao, Hai and Liu, Longxiang},
  journal={IEEE Transactions on Neural Networks and Learning Systems}, 
  title={Channel-Aware Decoupling Network for Multiturn Dialog Comprehension}, 
  year={2022},
  volume={},
  number={},
  pages={1-12},
  doi={10.1109/TNNLS.2022.3220047}
}

Instance Regularization for Discriminative Language Model Pre-training
Zhuosheng Zhang, Hai Zhao, Ming Zhou
EMNLP, 2022
[PDF] [Abstract] [Bib]

InstanceReg

@article{zhang2022instance,
  title={Instance Regularization for Discriminative Language Model Pre-training},
  author={Zhang, Zhuosheng and Zhao, Hai and Zhou, Ming},
  journal={arXiv preprint arXiv:2210.05471},
  year={2022}
}

Task Compass: Scaling Multi-task Pre-training with Task Prefix
Zhuosheng Zhang, Shuohang Wang, Yichong Xu, Yuwei Fang, Wenhao Yu, Yang Liu, Hai Zhao, Chenguang Zhu, Michael Zeng
EMNLP-Findings, 2022
Rank 1st on the HellaSwag commonsense reasoning leaderboard & The first to achieve human parity.
[PDF] [Abstract] [Bib] [Slides]

CompassMTL

              @article{zhang2022task,
                title={Task Compass: Scaling Multi-task Pre-training with Task Prefix},
                author={Zhang, Zhuosheng and Wang, Shuohang and Xu, Yichong and Fang, Yuwei and Yu, Wenhao and Liu, Yang and Zhao, Hai and Zhu, Chenguang and Zeng, Michael},
                journal={arXiv preprint arXiv:2210.06277},
                year={2022}
              }

Retrieval Augmentation for Commonsense Reasoning: A Unified Approach
Wenhao Yu, Chenguang Zhu, Zhihan Zhang, Shuohang Wang, Zhuosheng Zhang, Yuwei Fang and Meng Jiang
EMNLP, 2022
[PDF] [Abstract] [Bib]
```
              
```

Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Modeling
Yiyang Li, Hai Zhao and Zhuosheng Zhang
EMNLP, 2022
[PDF] [Abstract] [Bib]

BiDeN

@article{li2022back,
  title={Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Modeling},
  author={Li, Yiyang and Zhao, Hai and Zhang, Zhuosheng},
  journal={arXiv preprint arXiv:2204.08152},
  year={2022}
}

Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension
Jialin Chen, Zhuosheng Zhang, Hai Zhao.
COLING, 2022
[PDF] [Abstract] [Bib]

Logical-Reasoning-Graph

@inproceedings{chen2022hgm,
  title={Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension},
  author={Chen, jialin and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 29th the International Conference on Computational Linguistics (COLING 2022)},
  year={2022}
}

Tracing Origins: Coreference-aware Machine Reading Comprehension
Baorong Huang#, Zhuosheng Zhang#, Hai Zhao.
ACL, 2022
[PDF] [Abstract] [Bib]

CorefAwareMRC

@inproceedings{huang2021tracing,
  title={Tracing Origins: Coref-aware Machine Reading Comprehension},
  author={Huang, Baorong and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)},
  year={2021}
}

Structural Characterization for Dialogue Disentanglement
Xinbei Ma, Zhuosheng Zhang, Hai Zhao.
ACL, 2022
[PDF] [Abstract] [Bib]

StructureCharacterization4DD

Rank 1st on the Ubuntu IRC dialogue disentanglement benchmark.

@inproceedings{ma2022structural,
  title={Structural Modeling for Dialogue Disentanglement},
  author={Ma, Xinbei and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)},
  year={2022}
}

Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval
BoHong Wu, Zhuosheng Zhang, Jinyuan Wang, Hai Zhao.
ACL, 2022
[PDF] [Abstract] [Bib]

DCSR

@inproceedings{wu2022sentence,
  title={Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval},
  author={Wu, Bohong and Zhang, Zhuosheng and Wang, Jinyuan and Zhao, Hai},
  booktitle={The 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)},
  year={2022}
}

Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model
Jiayi Wang, Rongzhou Bao, Zhuosheng Zhang, Hai Zhao.
ACL-Findings, 2022
[PDF] [Abstract] [Bib]

Distinguishing-Non-Natural

@inproceedings{wang2022distinguishing,
  title={Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model},
  author={Wang, Jiayi and Bao, Rongzhou and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={Findings of the Association for Computational Linguistics: ACL 2022},
  year={2022}
}

Open Named Entity Modeling from Embedding Distribution
Ying Luo#, Hai Zhao, Zhuosheng Zhang#, Bingjie Tang.
TKDE, 2022
[PDF] [Abstract] [Bib]

@article{luo2022open,
  author={Luo, Ying and Zhao, Hai and Zhang, Zhuosheng and Tang, Bingjie},
  journal={IEEE Transactions on Knowledge and Data Engineering}, 
  title={Open Named Entity Modeling From Embedding Distribution}, 
  year={2022},
  volume={34},
  number={11},
  pages={5472-5483},
  doi={10.1109/TKDE.2021.3049654}
}

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles
Zhuosheng Zhang, Haojie Yu, Hai Zhao, Masao Utiyama.
TASLP, 2022
[PDF] [Abstract] [Bib]

AppleLM

@article{zhang2022apple,
  title={Which Apple Keeps Which Doctor Away? Colorful Word Representations With Visual Oracles},
  author={Zhang, Zhuosheng and Yu, Haojie and Zhao, Hai and Utiyama, Masao},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  volume={30},
  pages={49--59},
  year={2022},
  publisher={IEEE}
}

Syntax-aware Multi-spans Generation for Reading Comprehension
Zhuosheng Zhang, Yiqing Zhang, Hai Zhao.
TASLP, 2022
[PDF] [Abstract] [Bib]

SyntaxGen

@article{zhang2022syntax,
  title={Syntax-Aware Multi-Spans Generation for Reading Comprehension},
  author={Zhang, Zhuosheng and Zhang, Yiqing and Zhao, Hai},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  volume={30},
  pages={260--268},
  year={2022},
  publisher={IEEE}
}

DUMA: Reading Comprehension with Transposition Thinking
Pengfei Zhu, Zhuosheng Zhang, Hai Zhao, Xiaoguang Li.
TASLP, 2022
[PDF] [Abstract] [Bib]

DUMA

@ARTICLE{9664302,
  author={Zhu, Pengfei and Zhang, Zhuosheng and Zhao, Hai and Li, Xiaoguang},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={DUMA: Reading Comprehension With Transposition Thinking}, 
  year={2022},
  volume={30},
  number={},
  pages={269-279},
  doi={10.1109/TASLP.2021.3138683}}

Reference Knowledgeable Network for Machine Reading Comprehension
Yilin Zhao, Zhuosheng Zhang, Hai Zhao.
TASLP, 2022
[PDF] [Abstract] [Bib]

RekNet

@ARTICLE{9748021,
  author={Zhao, Yilin and Zhang, Zhuosheng and Zhao, Hai},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Reference Knowledgeable Network for Machine Reading Comprehension}, 
  year={2022},
  volume={30},
  number={},
  pages={1461-1473},
  doi={10.1109/TASLP.2022.3164219}}

Rethinking Textual Adversarial Defense for Pre-trained Language Models
Jiayi Wang, Rongzhou Bao, Zhuosheng Zhang, Hai Zhao.
TASLP, 2022
[PDF] [Abstract] [Bib]

Distinguishing-Non-Natural

Although pre-trained language models (PrLMs) have achieved significant success, recent studies demonstrate that PrLMs are vulnerable to adversarial attacks. By generating adversarial examples with slight perturbations on different levels (sentence / word / character), adversarial attacks can fool PrLMs to generate incorrect predictions, which questions the robustness of PrLMs. However, we find that most existing textual adversarial examples are unnatural, which can be easily distinguished by both human and machine. Based on a general anomaly detector, we propose a novel metric (Degree of Anomaly) as a constraint to enable current adversarial attack approaches to generate more natural and imperceptible adversarial examples. Under this new constraint, the success rate of existing attacks drastically decreases, which reveals that the robustness of PrLMs is not as fragile as they claimed. In addition, we find that four types of randomization can invalidate a large portion of textual adversarial examples. Based on anomaly detector and randomization, we design a universal defense framework, which is among the first to perform textual adversarial defense without knowing the specific attack. Empirical results show that our universal defense framework achieves comparable or even higher after-attack accuracy with other specific defenses, while preserving higher original accuracy at the same time. Our work discloses the essence of textual adversarial attacks, and indicates that (i) further works of adversarial attacks should focus more on how to overcome the detection and resist the randomization, otherwise their adversarial examples would be easily detected and invalidated; and (ii) compared with the unnatural and perceptible adversarial examples, it is those undetectable adversarial examples that pose real risks for PrLMs and require more attention for future robustness-enhancing strategies.

@ARTICLE{9833338,
  author={Wang, Jiayi and Bao, Rongzhou and Zhang, Zhuosheng and Zhao, Hai},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Rethinking Textual Adversarial Defense for Pre-Trained Language Models}, 
  year={2022},
  volume={},
  number={},
  pages={1-15},
  doi={10.1109/TASLP.2022.3192097}}

[2021]

Smoothing Dialogue States for Open Conversational Machine Reading
Zhuosheng Zhang#, Siru Ouyang#, Hai Zhao, Masao Utiyama, Eiichiro Sumita.
EMNLP, 2021
[PDF] [Abstract] [Bib]

OSCAR

@inproceedings{zhang2021oscar,
  title={Smoothing Dialogue States for Open Conversational Machine Reading},
  author={Zhang, Zhuosheng and Ouyang, Siru and Zhao, Hai and Utiyama, Masao and Sumita, Eiichiro},
  booktitle={The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)},
  year={2021}
}

Span Fine-tuning for Pre-trained Language Models
Rongzhou Bao, Zhuosheng Zhang, Hai Zhao.
EMNLP-Findings, 2021
[PDF] [Abstract] [Bib]

Span-FT

@inproceedings{bao2021spanft,
  title={Span Fine-tuning for Pre-trained Language Models},
  author={Bao, Rongzhou and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2021},
  year={2021}
}

Multi-tasking Dialogue Comprehension with Discourse Parsing
Yuchen He#, Zhuosheng Zhang#, Hai Zhao
PACLIC, 2021
[PDF] [Abstract] [Bib]

DPQA

@inproceedings{he2021mtldlg,
  title={Multi-tasking Dialogue Comprehension with Discourse Parsing},
  author={He, Yuchen and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 35th Pacific Asia Conference on Language, Information and Computation (PACLIC 35)},
  year={2021}
}

Multi-turn Dialogue Reading Comprehension with Pivot Turns and Knowledge
Zhuosheng Zhang, Junlong Li, Hai Zhao.
TASLP, 2021
[PDF] [Abstract] [Bib]

KKT

@article{zhang2021multi,
  title={Multi-Turn Dialogue Reading Comprehension With Pivot Turns and Knowledge},
  author={Zhang, Zhuosheng and Li, Junlong and Zhao, Hai},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  volume={29},
  pages={1161--1173},
  year={2021},
  publisher={IEEE}
}

Structural Pre-training for Dialogue Comprehension
Zhuosheng Zhang, Hai Zhao.
ACL, 2021
[PDF] [Abstract] [Bib]

SPIDER

@inproceedings{zhang2021structural,
  title={Structural Pre-training for Dialogue Comprehension},
  author={Zhang, Zhuosheng and Zhao, Hai},
  booktitle={The 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021)},
  year={2021}
}

Dialogue Graph Modeling for Conversational Machine Reading
Siru Ouyang#, Zhuosheng Zhang#, Hai Zhao.
ACL-Findings, 2021
Rank 1st on the ShARC leaderboard.
[PDF] [Abstract] [Bib]

DGM

@inproceedings{ouyang2021dialogue,
  title={Dialogue Graph Modeling for Conversational Machine Reading},
  author={Ouyang, Siru and Zhang, Zhuosheng and Zhao, Hai},
  booktitle={Findings of the Association for Computational Linguistics: ACL 2021},
  year={2021}
}

Retrospective Reader for Machine Reading Comprehension
Zhuosheng Zhang, Junjie Yang, Hai Zhao.
AAAI, 2021
Rank 1st on the SQuAD2.0 leaderboard.
* Highly Influential: featured as Most Influential AAAI 2021 Paper (top 5) in Paper Digest
[PDF] [Abstract] [Bib]

Retro-Reader

@inproceedings{zhang2021retro,
  title={Retrospective Reader for Machine Reading Comprehension},
  author={Zhang, Zhuosheng and Yang, Junjie and Zhao, Hai},
  booktitle={The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021)},
  year={2021}
}

Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang Liu#, Zhuosheng Zhang#, Hai Zhao, Xi Zhou, Xiang Zhou.
AAAI, 2021
Rank 1st on the MuTual leaderboard.
[PDF] [Abstract] [Bib]

MDFN

@inproceedings{liu2021filling,
  title={Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue},
  author={Liu, Longxiang and Zhang, Zhuosheng and and Zhao, Hai and Zhou, Xi and Zhou, Xiang},
  booktitle={The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021)},
  year={2021}
}

Topic-Aware Multi-turn Dialogue Modeling
Yi Xu, Hai Zhao, Zhuosheng Zhang
AAAI, 2021
[PDF] [Abstract] [Bib]

TADAM

@inproceedings{xu2021topic,
  title={Topic-aware multi-turn dialogue modeling},
  author={Xu, Yi and Zhao, Hai and Zhang, Zhuosheng},
  booktitle={The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021)},
  year={2021}
}

[2020]

Neural Machine Translation with Universal Visual Representation
Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao.
ICLR, 2020
Spotlight Oral, acceptance rate: 4.16%.
[PDF] [Abstract] [Bib] [Video]

UVR-NMT

@inproceedings{zhang2020neural,
title={Neural Machine Translation with Universal Visual Representation},
author={Zhuosheng Zhang and Kehai Chen and Rui Wang and Masao Utiyama and Eiichiro Sumita and Zuchao Li and Hai Zhao},
booktitle={International Conference on Learning Representations},
year={2020},
url={https://openreview.net/forum?id=Byl8hhNYPS}
}

Data-dependent Gaussian Prior Objective for Language Generation
Zuchao Li, Rui Wang, Kehai Chen, Masso Utiyama, Eiichiro Sumita, Zhuosheng Zhang, Hai Zhao.
ICLR, 2020
[PDF]

d2gpo

Semantics-aware BERT for Natural Language Understanding
Zhuosheng Zhang#, Yuwei Wu#, Hai Zhao, Zuchao Li, Shuailiang Zhang, Xi Zhou, Xiang Zhou.
AAAI, 2020
* Highly Influential: appeared in the Google Scholar Metrics 2022, citations rank: 19/1591 (top 1.2%) in AAAI 2020.
[PDF] [Abstract] [Bib]

SemBERT

@inproceedings{zhang2020semantics,
  title={Semantics-aware bert for language understanding},
  author={Zhang, Zhuosheng and Wu, Yuwei and Zhao, Hai and Li, Zuchao and Zhang, Shuailiang and Zhou, Xi and Zhou, Xiang},
  booktitle={Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)},
  volume={34},
  number={05},
  pages={9628--9635},
  year={2020}
}

Syntax-Guided Machine Reading Comprehension
Zhuosheng Zhang#, Yuwei Wu#, Junru Zhou, Sufeng Duan, Hai Zhao, Rui Wang.
AAAI, 2020
The first single model to surpass human benchmark on the SQuAD2.0 leaderboard.
[PDF] [Abstract] [Bib]

SG-Net

@inproceedings{zhang2020sg,
  title={SG-Net: Syntax-Guided Machine Reading Comprehension.},
  author={Zhang, Zhuosheng and Wu, Yuwei and Zhou, Junru and Duan, Sufeng and Zhao, Hai and Wang, Rui},
  booktitle={Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)},
  pages={9636--9643},
  year={2020}
}

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang, Hai Zhao, Yuwei Wu, Zhuosheng Zhang, Xi Zhou, Xiang Zhou.
AAAI, 2020
Rank 1st on the RACE leaderboard. [PDF] [Abstract] [Bib]

DCMN

@inproceedings{zhang2020dcmn+,
  title={{DCMN+}: Dual co-matching network for multi-choice reading comprehension},
  author={Zhang, Shuailiang and Zhao, Hai and Wu, Yuwei and Zhang, Zhuosheng and Zhou, Xi and Zhou, Xiang},
  booktitle={Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)},
  volume={34},
  number={05},
  pages={9563--9570},
  year={2020}
}

Explicit Sentence Compression for Neural Machine Translation
Zuchao Li, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, Hai Zhao.
AAAI, 2020

ESC

LIMIT-BERT: Linguistic Informed Multi-task BERT
Junru Zhou, Zhuosheng Zhang, Hai Zhao, Shuailiang Zhang.
EMNLP-Findings, 2020
[PDF] [Abstract] [Bib]

LIMIT-BERT

@inproceedings{zhou2020limit,
  title={LIMIT-BERT: Linguistic informed multi-task bert},
  author={Zhou, Junru and Zhang, Zhuosheng and Zhao, Hai and Zhang, Shuailiang},
  booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2020",
  year = "2020",
  publisher = "Association for Computational Linguistics",
  url = "https://www.aclweb.org/anthology/2020.findings-emnlp.399",
  doi = "10.18653/v1/2020.findings-emnlp.399",
  pages = "4450--4461",
}

Memory Network for Linguistic Structure Parsing
Zuchao Li, Chaoyu Guan, Hai Zhao, Rui Wang, Kevin Parnow, Zhuosheng Zhang.
TASLP, 2020
[PDF] [Abstract] [Bib]

@article{li2020memory,
  title={Memory Network for Linguistic Structure Parsing},
  author={Li, Zuchao and Guan, Chaoyu and Zhao, Hai and Wang, Rui and Parnow, Kevin and Zhang, Zhuosheng},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  volume={28},
  pages={2743--2755},
  year={2020},
  publisher={IEEE}
}

[2019]

Open Vocabulary Learning for Neural Chinese Pinyin IME
Zhuosheng Zhang, Yafang Huang, Hai Zhao.
ACL, 2019
[PDF] [Abstract] [Bib]

OpenIME

@inproceedings{zhang2019acl,
  title = "Open Vocabulary Learning for Neural {Chinese} Pinyin {IME}",
  author = "Zhang, Zhuosheng and Huang, Yafang and Zhao, Hai",
  booktitle = "Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)",
  url = "https://www.aclweb.org/anthology/P19-1154",
  pages = "1584--1594",
  year = "2019",
}

Effective Subword Segmentation for Text Comprehension
Zhuosheng Zhang, Hai Zhao, Kangwei Ling, Jiangtong Li, Zuchao Li, Shexia He, Guohong Fu.
TASLP, 2019
[PDF] [Abstract] [Bib]

subword_seg

@article{Zhang2019subword,
  title={Effective Subword Segmentation for Text Comprehension},
  author={Zhang, Zhuosheng and Zhao, Hai and Ling, Kangwei and Li, Jiangtong and He, Shexia and Fu, Guohong},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)},
  year={2019},
  volume={27},
  number={11},
  pages={1664-1674},
  doi={10.1109/TASLP.2019.2922537}
}

Explicit Contextual Semantics for Text Comprehension
Zhuosheng Zhang#, Yuwei Wu#, Zuchao Li and Hai Zhao.
PACLIC, 2019
[PDF] [Abstract] [Bib]

SemBERT

@inproceedings{zhang2019explicit,
  title = "Explicit Contextual Semantics for Text Comprehension",
  author = "Zhang, Zhuosheng and Wu, Yuwei and Li, Zuchao and Zhao, Hai",
  booktitle = "Proceedings of the 33rd Pacific Asia Conference on Language, Information and Computation (PACLIC 33)",
  year = "2019",
}

Dependency or Span, End-to-End Uniform Semantic Role Labeling
Zuchao Li, Shexia He, Hai Zhao, Yiqing Zhang, Zhuosheng Zhang, Xi Zhou, Xiang Zhou.
AAAI, 2019
[PDF] [Abstract] [Bib]

unisrl

[2018]

Modeling Multi-turn Conversation with Deep Utterance Aggregation
Zhuosheng Zhang#, Jiangtong Li#, Pengfei Zhu, Hai Zhao and Gongshen Liu.
COLING, 2018
Appeared in the Google Scholar 2020 h5-index list, top 1.2% (4/331) in COLING 2018.
[PDF] [Abstract] [Bib]

DeepUtteranceAggregation

@inproceedings{zhang2018dua,
                title = {Modeling Multi-turn Conversation with Deep Utterance Aggregation},
                author = {Zhang, Zhuosheng and Li, Jiangtong and Zhu, Pengfei and Zhao, Hai},
                booktitle = {Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018)},
                pages= {3740–-3752},
                year = {2018},
            }

Subword-augmented Embedding for Cloze Reading Comprehension
Zhuosheng Zhang#, Yafang Huang# and Hai Zhao.
COLING, 2018
[PDF] [Abstract] [Bib]
Rank 1st on the CCL-CMRC 2017 shared task (single model).

subMrc

@inproceedings{zhang2018mrc,
                title = {Subword-augmented Embedding for Cloze Reading Comprehension},
                author = {Zhang, Zhuosheng and Huang,Yafang and Zhao, Hai},
                booktitle = {Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018)},
                pages = {1802-–1814},
                year = {2018},
            }

One-shot Learning for Question-Answering in Gaokao History Challenge
Zhuosheng Zhang and Hai Zhao.
COLING, 2018
[PDF] [Abstract] [Bib]

OneshotQA

@inproceedings{zhang2018gaokao,
                title = {One-shot Learning for Question-Answering in Gaokao History Challenge},
                author = {Zhang, Zhuosheng and Zhao, Hai},
                booktitle = {Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018)},
                pages = {449–-461},
                year = {2018},
            }

Lingke: A Fine-grained Multi-turn Chatbot for Customer Service
Pengfei Zhu, Zhuosheng Zhang, Jiangtong Li, Yafang Huang, Hai Zhao.
COLING-Demos, 2018
[PDF] [Abstract] [Bib] [Demo]

@inproceedings{zhu2018lingke,
                title = {Lingke: A Fine-grained Multi-turn Chatbot for Customer Service},
                author = {Zhu, Pengfei and Zhang, Zhuosheng and Li, Jiangtong and Huang, Yafang and Zhao, Hai},
                booktitle = {Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), System Demonstrations},
                pages = {108-–112},
                year = {2018},
            }

Effective Character-augmented Word Embedding for Machine Reading Comprehension
Zhuosheng Zhang, Yafang Huang, Pengfei Zhu, Hai Zhao.
NLPCC, 2018
[PDF] [Abstract] [Bib]

@inproceedings{zhang2018char,
                title = {Effective Character-augmented Word Embedding for Machine Reading Comprehension},
                author = {Zhang, Zhuosheng and Huang, Yafang and Zhu, Pengfei and Zhao, Hai},
                booktitle = {Proceedings of the Seventh CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2018)},
                pages = {27-39},
                year = {2018},
            }

Neural-based Chinese Pinyin Aided Input Method with Customizable Association
Yafang Huang#, Zuchao Li#, Zhuosheng Zhang, Hai Zhao.
ACL-Demos, 2018
[PDF] [Abstract] [Bib] [Demo]

@inproceedings{Huang2018Moon,
                    title={{Moon IME:} Neural-based Chinese Pinyin Aided Input Method with Customizable Association},
                    author={Huang,Yafang and Li,Zuchao and Zhang,Zhuosheng and Zhao,Hai},
                    booktitle={Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), System Demonstrations},
                    pages = {140–145},
                    year={2018}
                }

A Unified Syntax-aware Framework for Semantic Role Labeling
Zuchao Li, Shexia He, Jiaxun Cai, Zhuosheng Zhang, Hai Zhao, Gongshen Liu, Linlin Li, Luo Si.
EMNLP, 2018
[PDF] [Abstract] [Bib]

unified_syn_srl

@inproceedings{li2018unified,
  title={A unified syntax-aware framework for semantic role labeling},
  author={Li, Zuchao and He, Shexia and Cai, Jiaxun and Zhang, Zhuosheng and Zhao, Hai and Liu, Gongshen and Li, Linlin and Si, Luo},
  booktitle={Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)},
  pages={2401--2411},
  year={2018}
}