Mandatory Fields

Authors

Peyman Passban, Qun Liu and Andy Way

Conference Title

HLT-NAACL 2018, the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Title of Paper

Improving Character-based Decoding Using Target-Side Morphological Information for Neural Machine Translation

Year

2018

Month

June

Status

Published

Peer Reviewed

Times Cited

()

Optional Fields

Search Keyword

Editors

Start Page

End Page

Location

New Orleans, USA

Start Date

01-JUN-18

End Date

06-JUN-18

Abstract

Recently, neural machine translation (NMT) has emerged as a powerful alternative to conventional statistical approaches. However, its performance drops considerably in the presence of morphologically rich languages (MRLs). Neural engines usually fail to tackle the large vocabulary and high out-of-vocabulary (OOV) word rate of MRLs. Therefore, it is not suitable to exploit existing word-based models to translate this set of languages. In this paper, we propose an extension to the state-of-the-art model of Chung et al. (2016), which works at the character level and boosts the decoder with target-side morphological information. In our architecture, an additional morphology table is plugged into the model. Each time the decoder samples from a target vocabulary, the table sends auxiliary signals from the most relevant affixes in order to enrich the decoder’s current state and constrain it to provide better predictions. We evaluated our model to translate English into German, Russian, and Turkish as three MRLs and observed significant improvements.

Funded By

URL

https://www.aclweb.org/anthology/N18-1006

DOI Link

10.18653/v1/n18-1006

Grant Details

Funding Body

Science Foundation Ireland (SFI)

Grant Details

SFI Research Centres Programme (Grant 13/RC/2106)