Stanford InfoLab Publication Server

Extensions to HMM-based Statistical Word Alignment Models

Toutanova, Kristina and Ilhan, H. Tolga and Manning, Christopher (2002) Extensions to HMM-based Statistical Word Alignment Models. In: 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), July 6-7, 2002, Philadelphia, PA.




This paper describes improved HMM-based word level alignment models for statistical machine translation. We present a method for using part of speech tag information to improve alignment accuracy, and an approach to modeling fertility and correspondence to the empty word in an HMM alignment model. We present accuracy results from evaluating Viterbi alignments against human-judged alignments on the Canadian Hansards corpus, as compared to a bigram HMM, and IBM model 4. The results show up to 16% alignment error reduction.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:statistical machine translation, word level alignment models
Related URLs:Project Homepage
ID Code:557
Deposited By:Import Account
Deposited On:16 Oct 2002 17:00
Last Modified:25 Dec 2008 10:11

Download statistics

Repository Staff Only: item control page