Type

Conference Proceedings

Authors

Andy Way
Chao-Hong Liu
Haithem Afli
Pintu Lohar

Subjects

Linguistics

Topics
machine translation english french named entities system alignment natural language processing machine translating participation

The ADAPT bilingual document alignment system at WMT16 (2016)

Abstract Comparable corpora have been shown to be useful in several multilingual natural language processing (NLP) tasks. Many previous papers have focused on how to improve the extraction of parallel data from this kind of corpus on different levels. In this paper, we are interested in improving the quality of bilingual comparable corpora according to increased document alignment score. We describe our participation in the bilingual document alignment shared task of the First Conference on Machine Translation (WMT16). We propose a technique based on sourceto-target sentence- and word-based scores and the fraction of matched source named entities. We performed our experiments on English-to-French document alignments for this bilingual task.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres: ADAPT
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Machine translating

Full list of authors on original publication

Andy Way, Chao-Hong Liu, Haithem Afli, Pintu Lohar

Experts in our system

1
Andy Way
Dublin City University
Total Publications: 229
 
2
Chao-Hong Liu
Dublin City University
Total Publications: 5
 
3
Haithem Afli
Dublin City University
Total Publications: 14
 
4
Pintu Lohar
Dublin City University
Total Publications: 10