Mandatory Fields

Authors

Okita T.;Way A.

Conference Title

Proceedings - 2010 International Conference on Asian Language Processing, IALP 2010

Title of Paper

Hierarchical Pitman-Yor language model for machine translation

Year

2010

Month

December

Status

Published

Peer Reviewed

Times Cited

()

Optional Fields

Search Keyword

Hierarchical Pitman-Yor process Statistical machine translation Statistical smoothing method

Editors

Start Page

245

End Page

248

Location

Start Date

End Date

Abstract

The hierarchical Pitman-Yor process-based smoothing method applied to language model was proposed by Goldwater and by Teh; the performance of this smoothing method is shown comparable with the modified Kneser-Ney method in terms of perplexity. Although this method was presented four years ago, there has been no paper which reports that this language model indeed improves translation quality in the context of Machine Translation (MT). This is important for the MT community since an improvement in perplexity does not always lead to an improvement in BLEU score; for example, the success of word alignment measured by Alignment Error Rate (AER) does not often lead to an improvement in BLEU. This paper reports in the context of MT that an improvement in perplexity really leads to an improvement in BLEU score. It turned out that an application of the Hierarchical Pitman-Yor Language Model (HPYLM) requires a minor change in the conventional decoding process. Additionally to this, we propose a new Pitman-Yor process-based statistical smoothing method similar to the Good-Turing method although the performance of this is inferior to HPYLM. We conducted experiments; HPYLM improved by 1.03 BLEU points absolute and 6% relative for 50k EN-JP, which was statistically significant. © 2010 IEEE.

Funded By

URL

DOI Link

10.1109/IALP.2010.34

Grant Details

Funding Body

Grant Details