Type

Conference Proceedings

Authors

Maria Giagkou
Prokopis Prokopidis
Vassilis Papavassiliou
Andy Way
Antonio Toral
Pavel Pecina

Subjects

Linguistics

Topics
web data language modelling environment domain adaptation english statistical machine translation language pairs machine translating

Towards using web-crawled data for domain adaptation in statistical machine translation (2011)

Abstract This paper reports on the ongoing work focused on domain adaptation of statistical machine translation using domain-specific data obtained by domain-focused web crawling. We present a strategy for crawling monolingual and parallel data and their exploitation for testing, language modelling, and system tuning in a phrase--based machine translation framework. The proposed approach is evaluated on the domains of Natural Environment and Labour Legislation and two language pairs: English–French and English–Greek.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland -> Dublin City University -> Subject = Computer Science
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Machine translating
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing

Full list of authors on original publication

Maria Giagkou, Prokopis Prokopidis, Vassilis Papavassiliou, Andy Way, Antonio Toral, Pavel Pecina

Experts in our system

1
Andy Way
Dublin City University
Total Publications: 229
 
2
Antonio Toral
Dublin City University
Total Publications: 18
 
3
Pavel Pecina
Dublin City University
Total Publications: 12