figshare
Browse
Simultaneous Word-Morpheme Alignment for Statistical Machine Tran.pdf (285.2 kB)

Simultaneous Word-Morpheme Alignment for Statistical Machine Translation

Download (285.2 kB)
journal contribution
posted on 2013-06-09, 00:00 authored by Elif Eyigoz, Daniel Gildea, Kemal OflazerKemal Oflazer
Current word alignment models for statistical machine translation do not address morphology beyond merely splitting words. We present a two-level alignment model that distinguishes between words and morphemes, in which we embed an IBM Model 1 inside an HMM based word alignment model. The model jointly induces word and morpheme alignments using an EM algorithm. We evaluated our model on Turkish-English parallel data. We obtained significant improvement of BLEU scores over IBM Model 4. Our results indicate that utilizing information from morphology improves the quality of word alignments.

History

Publisher Statement

This is the published version of Eyigoz, E., Gildea, D., & Oflazer, K. (2013). Simultaneous Word-Morpheme Alignment for Statistical Machine Translation. Proceedings of NAACL-HLT 2013, 32-40. © 2013 Association for Computational Linguistics

Date

2013-06-09

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC