Mandatory Fields

Authors

Iacer Calixto, Daniel Stein, Evgeny Matusov, Sheila Castilho and Andy Way

Conference Title

VL '17 - 6th Workshop on Vision and Language

Title of Paper

Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing Titles

Year

2017

Month

Unknown

Status

Published

Peer Reviewed

Times Cited

()

Optional Fields

Search Keyword

Editors

Start Page

End Page

Location

Valencia, Spain

Start Date

04-APR-17

End Date

04-APR-17

Abstract

In this paper, we study how humans perceive the use of images as an additional knowledge source to machine-translate usergenerated product listings in an e-commerce company. We conduct a human evaluation where we assess how a multi-modal neural machine translation (NMT) model compares to two text-only approaches: a conventional state-of-the-art attention-based NMT and a phrase-based statistical machine translation (PBSMT) model. We evaluate translations obtained with different systems and also discuss the data set of user-generated product listings, which in our case comprises both product listings and associated images. We found that humans preferred translations obtained with a PBSMT system to both text-only and multi-modal NMT over 56% of the time. Nonetheless, human evaluators ranked translations from a multi-modal NMT model as better than those of a text-only NMT over 88% of the time, which suggests that images do help NMT in this use-case.

Funded By

URL

http://aclweb.org/anthology/W17-2004

DOI Link

Grant Details

Funding Body

Science Foundation Ireland (SFI)

Grant Details

13/RC/2106