Toutanova, Kristina and Ilhan, H. Tolga and Manning, Christopher (2002) Extensions to HMM-based Statistical Word Alignment Models. In: 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), July 6-7, 2002, Philadelphia, PA.
BibTeX | DublinCore | EndNote | HTML |
![]()
| PDF 122Kb |
Abstract
This paper describes improved HMM-based word level alignment models for statistical machine translation. We present a method for using part of speech tag information to improve alignment accuracy, and an approach to modeling fertility and correspondence to the empty word in an HMM alignment model. We present accuracy results from evaluating Viterbi alignments against human-judged alignments on the Canadian Hansards corpus, as compared to a bigram HMM, and IBM model 4. The results show up to 16% alignment error reduction.
Item Type: | Conference or Workshop Item (Paper) | |
---|---|---|
Uncontrolled Keywords: | statistical machine translation, word level alignment models | |
Subjects: | Miscellaneous | |
Projects: | Miscellaneous | |
Related URLs: | Project Homepage | http://www-nlp.stanford.edu/ |
ID Code: | 557 | |
Deposited By: | Import Account | |
Deposited On: | 16 Oct 2002 17:00 | |
Last Modified: | 25 Dec 2008 10:11 |
Download statistics
Repository Staff Only: item control page