Mandatory Fields

Authors

Alberto Poncelas, Gideon Maillette de Buy Wenniger and Andy Way

Conference Title

EAMT 2018 - 21st Annual Conference of the European Association for Machine Translation

Title of Paper

Feature Decay Algorithms for Neural Machine Translation.

Year

2018

Month

May

Status

Published

Peer Reviewed

Times Cited

()

Optional Fields

Search Keyword

Editors

Start Page

239

End Page

248

Location

Alicante, Spain

Start Date

28-MAY-18

End Date

30-MAY-18

Abstract

Neural Machine Translation (NMT) systems require a lot of data to be competitive. For this reason, data selection techniques are used only for finetuning systems that have been trained with larger amounts of data. In this work we aim to use Feature Decay Algorithms (FDA) data selection techniques not only to fine-tune a system but also to build a complete system with less data. Our findings reveal that it is possible to find a subset of sentence pairs, that outperforms by 1.11 BLEU points the full training corpus, when used for training a GermanEnglish NMT system .

Funded By

URL

https://rua.ua.es/dspace/bitstream/10045/76084/1/EAMT2018-Proceedings_26.pdf

DOI Link

Grant Details

Funding Body

Science Foundation Ireland (SFI)

Grant Details

SFI Research Centres Programme (Grant 13/RC/2106); H2020 Marie SkłodowskaCurie grant agreement No 713567