PARABANK: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-Constrained Neural Machine Translation

J. Edward Hu; Rachel Rudinger; Matt Post; Benjamin Van Durme

doi:10.1609/aaai.v33i01.33016521

PARABANK: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-Constrained Neural Machine Translation

Authors

J. Edward Hu Johns Hopkins University
Rachel Rudinger Johns Hopkins University
Matt Post Johns Hopkins University
Benjamin Van Durme Johns Hopkins University

DOI:

https://doi.org/10.1609/aaai.v33i01.33016521

Abstract

We present PARABANK, a large-scale English paraphrase dataset that surpasses prior work in both quantity and quality. Following the approach of PARANMT (Wieting and Gimpel, 2018), we train a Czech-English neural machine translation (NMT) system to generate novel paraphrases of English reference sentences. By adding lexical constraints to the NMT decoding procedure, however, we are able to produce multiple high-quality sentential paraphrases per source sentence, yielding an English paraphrase resource with more than 4 billion generated tokens and exhibiting greater lexical diversity. Using human judgments, we also demonstrate that PARABANK’s paraphrases improve over PARANMT on both semantic similarity and fluency. Finally, we use PARABANK to train a monolingual NMT model with the same support for lexically-constrained decoding for sentence rewriting tasks.

Downloads

Published

2019-07-17

How to Cite

Hu, J. E., Rudinger, R., Post, M., & Van Durme, B. (2019). PARABANK: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-Constrained Neural Machine Translation. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 6521-6528. https://doi.org/10.1609/aaai.v33i01.33016521

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

AAAI Technical Track: Natural Language Processing

PARABANK: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-Constrained Neural Machine Translation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription