Conference Publication Details
Mandatory Fields
Alberto Poncelas, Gideon Maillette de Buy Wenniger and Andy Way
EAMT 2018 - 21st Annual Conference of the European Association for Machine Translation
Feature Decay Algorithms for Neural Machine Translation.
2018
May
Published
1
()
Optional Fields
239
248
Alicante, Spain
28-MAY-18
30-MAY-18
Neural Machine Translation (NMT) systems require a lot of data to be competitive. For this reason, data selection techniques are used only for finetuning systems that have been trained with larger amounts of data. In this work we aim to use Feature Decay Algorithms (FDA) data selection techniques not only to fine-tune a system but also to build a complete system with less data. Our findings reveal that it is possible to find a subset of sentence pairs, that outperforms by 1.11 BLEU points the full training corpus, when used for training a GermanEnglish NMT system .
https://rua.ua.es/dspace/bitstream/10045/76084/1/EAMT2018-Proceedings_26.pdf
Grant Details
Science Foundation Ireland (SFI)
SFI Research Centres Programme (Grant 13/RC/2106); H2020 Marie SkłodowskaCurie grant agreement No 713567