figshare
Browse
file.pdf (167.69 kB)

Nonparametric Word Segmentation for Machine Translation

Download (167.69 kB)
journal contribution
posted on 2010-08-01, 00:00 authored by ThuyLinh Nguyen, Stephan Vogel, Noah A. Smith

We present an unsupervised word segmentation model for machine translation. The model uses existing monolingual segmentation techniques and models the joint distribution over source sentence segmentations and alignments to the target sentence. During inference, the monolingual segmentation model and the bilingual word alignment model are coupled so that the alignments to the target sentence guide the segmentation of the source sentence. The experiments show improvements on Arabic-English and ChineseEnglish translation tasks.

History

Publisher Statement

Copyright 2010 ACL

Date

2010-08-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC