王 瑞 (Wang, Rui)

上海交通大学计算机系 副教授 & 博士生导师
Associate Professor & Ph.D. Advisor
Department of Computer Science and Engineering
Shanghai Jiao Tong University

Email: wangrui12 (as you know) sjtu.edu.cn; wangrui.nlp (as you know) gmail.com
Address: 3-501 SEIEE Building, 800 Dongchuan Road, Shanghai 200240, China

Biography

Dr. Rui Wang is a computational linguist working as an associate professor at Shanghai Jiao Tong University since 2021. Before that, he was a researcher (tenured in 2020) at Japan National Institute of Information and Communications Technology (NICT) from 2016 to 2020. His research interests are traditional linguistic based and cutting-edge deep learning based approaches for machine translation (MT) and multi-lingual NLP. He has published more than 40 papers in top-tier NLP/ML/AI conferences and journals, such as ACL, EMNLP, ICLR, AAAI, IJCAI, TPAMI, TASLP, etc. He has also won several first places in top-tier MT/NLP shared tasks, such as WMT-2018, WMT-2019, WMT-2020, CoNLL-2019, etc. He served as the area chairs of ICLR-2021 and NAACL-2021. He gave cutting-edge tutorials at EACL-2021 and EMNLP-2021.


Machine Translation and Multilingual NLP Group

I am always fortunate to work with these brilliant researchers and Ph.D./master/undergraduate students. Please send your CV and research proposal (optional) to me if you want to join us.
非常有幸能和这样一群优秀的年轻人共事,希望我们能从彼此身上学习到有意义和有意思的东西,共同进步!

Ph.D. Students @SJTU

Zhiwei He (2021.9-)

Researcher @NICT

Kehai Chen (Postdoctoral Researcher, 2018.10-2020.12)
Zuchao Li (Technical Researcher, 2020.11-2020.12)

Interns @NICT

Shintaro Harada (Prof. Taro Watanabe's lab, NAIST, Japan, 2020.8-2020.10)
Chaoqun Duan (Prof. Tiejun Zhao's lab, HIT, China, 2019.6-2020.9)
Fengshun Xiao (Prof. Hai Zhao's lab, SJTU, China, 2019.10-2020.3)
Zhuosheng Zhang (Prof. Hai Zhao's lab, SJTU, China, 2019.6-2020.7)
Zuchao Li (Prof. Hai Zhao's lab, SJTU, China, 2019.4-2019.10)
Mingming Yang (Prof. Min Zhang's lab, Soochow University, China, 2018.9-2019.7)
Shu Jiang (Prof. Bao-Liang Lu's lab, SJTU, China, 2018.8-2019.1)
Haipeng Sun (Prof. Tiejun Zhao's lab, HIT, China, 2018.8-2020.4)
Zhisong Zhang (Prof. Hai Zhao's lab, SJTU, China, 2017.7-2018.4)
Kehai Chen (Prof. Tiejun Zhao's lab, HIT, China, 2017.1-2018.4)

Teaching

Tutorial

EACL-2021: Advances and Challenges in Unsupervised Neural Machine Translation. Rui Wang and Hai Zhao
    --This talk has also been given at 日本言語処理学会年次大会 (2019, 2020), Kyoto University (2020), etc.
EMNLP-2021: Syntax in End-to-End Natural Language Processing. Hai Zhao, Rui Wang, and Kehai Chen
CCMT-2019: Domain Adaptation for Neural Machine Translation. Chenhui Chu and Rui Wang
    --You also refer to our survey paper in [COLING-2018]
    --This talk has also been given at Shanghai Jiao Tong Univerisity (2019), ByteDance (2019), Chinese Academy of Sciences (2019), etc.

Academic Services

Area Chairs: ICLR-2021, NAACL-2021, CCL-2019, and CCL-2018
Organization Chairs: PACLIC-29 and YCCL-2012
PC Members: ACL, EMNLP, NAACL, ICLR, AAAI, IJCAI, etc.
Reviewers: CL, TACL, IEEE TASLP, etc.

Shared Tasks

WMT-2020: 1st in three tasks (supervised English->Chinese, supervised Polish->English, and unsupervised/low-resource German-Upper Sorbian) [Results][Paper]
CoNLL-2019: 1st in the DM sub-task and the 2nd overall [Results][Paper]
WMT-2019: 1st in the only unsupervised MT task (German-Czech) [Results] [Paper]
WAT-2018: 1st places in Myanmar (Burmese) <- English [Results][Paper]
WMT-2018: 1st places in four tasks (English<->Estonian and English<->Finnish) [Results][Paper]


Fundings

2019-2020: NICT tenure-track startup fund: "Toward Intelligent Machine Translation", PI, 8,000,000JPY
2019-2020 (19K20354): PI of Japan national fund (JSPS) for early-career scientists: "Unsupervised Neural Machine Translation in Universal Scenarios", PI, 4,180,000JPY

Selected Publication [Google Scholar] [DBLP]

Note: If you have a technical question about a research paper, it is best to try to get in contact with all of the authors, so the most appropriate person can respond as quickly as possible.

2021

Advances and Challenges in Unsupervised Neural Machine Translation
    Rui Wang and Hai Zhao
    16th conference of the European Chapter of the Association for Computational Linguistics (EACL-Tutorial), 2021

Syntax in End-to-End Natural Language Processing
    Hai Zhao, Rui Wang, and Kehai Chen
    The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP-Tutorial), 2021

SG-Net: Syntax Guided Transformer for Language Representation
    Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao*, and Rui Wang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Modeling Future Cost for Neural Machine Translation
    Chaoqun Duan, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Conghui Zhu*, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021

Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study
    Haipeng Sun, Rui Wang, Masao Utiyama, Benjamin Marie, Kehai Chen, Eiichiro Sumita, and Tiejun Zhao*
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2021

2020

Data-dependent Gaussian Prior Objective for Language Generation
    Zuchao Li, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao*
    International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
    [Codes] Note: this is a full-score paper and a long-time talk presentation

Neural Machine Translation with Universal Visual Representation
    Zhuosheng Zhang, Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, Zuchao Li, and Hai Zhao*
    International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
    [Codes]

Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation
    Haipeng Sun, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

Content Word Aware Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

Regularized Context Gates on Transformer for Machine Translation
    Xintong Li, Lemao Liu, Rui Wang, Guoping Huang, and Max Meng
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

High-order Semantic Role Labeling
    Zuchao Li, Hai Zhao*, Rui Wang and Kevin Parnow
    The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
    [Codes]

Reference Language based Unsupervised Neural Machine Translation
    Zuchao Li, Hai Zhao*, Rui Wang*, Masao Utiyama and Eiichiro Sumita
    The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
    [Codes]

Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training
    Haipeng Sun, Rui Wang, Kehai Chen, Xugang Lu, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 28th International Conference on Computational Linguistics (COLING-2020), Barcelona, Spain

Explicit Sentence Compression for Neural Machine Translation
    Zuchao Li, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao*
    Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
    [Codes]

SG-Net: Syntax-Guided Machine Reading Comprehension
    Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao*, and Rui Wang*
    Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
    [Codes]

Memory Network for Linguistic Structure Parsing
    Zuchao Li, Chaoyu Guan, Hai Zhao, Rui Wang, Kevin Parnow, and Zhuosheng Zhang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

A Novel Sentence-Level Agreement Architecture for Neural Machine Translation
    Mingming Yang, Rui Wang, Kehai Chen, Xing Wang, Tiejun Zhao, and Min Zhang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

Towards More Diverse Input Representation for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao, Munyun Yang, and Hai Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

Unsupervised Neural Machine Translation with Cross-lingual Language Representation Agreement
    Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

2019

Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation
    Haipeng Sun, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Neural Machine Translation with Reordering Embeddings
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Sentence-Level Agreement for Neural Machine Translation
     Mingming Yang, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Min Zhang*, and Tiejun Zhao
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Lattice-Based Transformer Encoder for Neural Machine Translation
     Fengshun Xiao, Jiangtong Li, Hai Zhao*, Rui Wang, and Kehai Chen
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Recurrent Positional Embedding for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP2019), Hong Kong, China

Neural Machine Translation with Sentence-level Topic Context
    Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2019

2018

Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation
    Rui Wang, Masao Utiyama, and Eiichiro Sumita
    The 56th Annual Meeting of the Association for Computational Linguistics (ACL-2018), Melbourne, Australia

Exploring Recombination for Efficient Decoding of Neural Machine Translation
    Zhisong Zhang, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Hai Zhao*
    2018 Conference on Empirical Methods in Natural Language Processing (EMNLP-2018), Brussels, Belgium
    [Codes]

A Survey of Domain Adaptation for Neural Machine Translation
    Chenhui Chu and Rui Wang
    The 27th International Conference on Computational Linguistics (COLING-2018), Santa Fe, USA

Syntax-Directed Attention for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-2018), New Orleans, Lousiana, USA

Sentence Selection and Weighting for Neural Machine Translation Domain Adaptation
    Rui Wang, Masao Utiyama, Andrew Finch, Lemao Liu, Kehai Chen, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018

A Neural Approach to Source Dependency-Based Context Model for Statistical Machine Translation
    Kehai Chen, Tiejun Zhao, Muyun Yang, Lemao Liu*, Akihiro Tamura, Rui Wang, Masao Utiyama, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018

Graph-based Bilingual Word Embedding for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Sabine Ploux*, Bao-Liang Lu, Masao Utiyama, and Eiichiro Sumita
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2018

2017

Sentence Embedding for Neural Machine Translation Domain Adaptation
    Rui Wang, Andrew Finch, Masao Utiyama, and Eiichro Sumita
    The 55th annual meeting of the Association for Computational Linguistics (ACL-2017), Vancouver, Canada

Instance Weighting for Neural Machine Translation Domain Adaptation
    Rui Wang, Masao Utiyama, Lemao Liu, Kehai Chen, and Eiichro Sumita
    Conference on Empirical Methods in Natural Language Processing (EMNLP-2017), Copenhagen, Denmark

Neural Machine Translation with Source Dependency Representation
    Kehai Chen, Rui Wang*, Masao Utiyama, Lemao Liu, Akihiro Tamura, Eiichiro Sumita, and Tiejun Zhao
    Conference on Empirical Methods in Natural Language Processing (EMNLP-2017), Copenhagen, Denmark

Context-Aware Smoothing for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    The 8th International Joint Conference on Natural Language Processing (IJCNLP 2017), Taipei, China

2016 and Before

A Bilingual Graph-based Semantic Model for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Sabine Ploux*, Bao-Liang Lu, and Masao Utiyama
    25th International Joint Conference on Artificial Intelligence (IJCAI-16), New York, USA
    [Codes]

Converting Continuous-Space Language Models into N-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation
    Rui Wang, Masao Utiyama*, Isao Goto, Eiichiro Sumita, Hai Zhao*, and Bao-Liang Lu
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2016

Connecting Phrase based Statistical Machine Translation Adaptation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama*, and Eiichro Sumita
    The 26th International Conference on Computational Linguistics (COLING-2016), Osaka, Japan

Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2015

Neural Network Based Bilingual Language Model Growing for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama, and Eiichro Sumita
    Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP-2014), Doha, Qatar

Converting Continuous-Space Language Models into N-gram Language Models for Statistical Machine Translation
    Rui Wang, Masao Utiyama, Isao Goto, Eiichro Sumita, Hai Zhao, and Bao-Liang Lu
    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP-2013), Seattle, USA