Type

Conference Proceedings

Authors

Andy Way
Qun Liu
Peyman Passban

Subjects

Linguistics

Topics
probability neural network semantic language pairs statistical machine translation phrase based statistical machine translation machine translating experimental

Enriching phrase tables for statistical machine translation using mixed embeddings (2016)

Abstract The phrase table is considered to be the main bilingual resource for the phrase-based statistical machine translation (PBSMT) model. During translation, a source sentence is decomposed into several phrases. The best match of each source phrase is selected among several target-side counterparts within the phrase table, and processed by the decoder to generate a sentence-level translation. The best match is chosen according to several factors, including a set of bilingual features. PBSMT engines by default provide four probability scores in phrase tables which are considered as the main set of bilingual features. Our goal is to enrich that set of features, as a better feature set should yield better translations. We propose new scores generated by a Convolutional Neural Network (CNN) which indicate the semantic relatedness of phrase pairs. We evaluate our model in different experimental settings with different language pairs. We observe significant improvements when the proposed features are incorporated into the PBSMT pipeline.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres: ADAPT
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Machine translating

Full list of authors on original publication

Andy Way, Qun Liu, Peyman Passban

Experts in our system

1
Andy Way
Dublin City University
Total Publications: 229
 
2
Qun Liu
Dublin City University
Total Publications: 31
 
3
Peyman Passban
Dublin City University
Total Publications: 9