Type

Conference Proceedings

Authors

Andy Way
Pintu Lohar
Walid Aransa
Haithem Afli

Subjects

Linguistics

Topics
social media machine translating machine translation automatic error correction internet arabic language arabic user generated content pre processing

From Arabic user-generated content to machine translation: integrating automatic error correction (2016)

Abstract With the wide spread of the social media and online forums, individual users have been able to actively participate in the generation of online content in different languages and dialects. Arabic is one of the fastest growing languages used on Internet, but dialects (like Egyptian and Saudi Arabian) have a big share of the Arabic online content. There are many differences between Dialectal Arabic and Modern Standard Arabic which cause many challenges for Machine Translation of informal Arabic language. In this paper, we investigate the use of Automatic Error Correction method to improve the quality of Arabic User-Generated texts and its automatic translation. Our experiments show that the new system with automatic correction module outperforms the baseline system by nearly 22.59% of relative improvement.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres: ADAPT
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Machine translating

Full list of authors on original publication

Andy Way, Pintu Lohar, Walid Aransa, Haithem Afli

Experts in our system

1
Andy Way
Dublin City University
Total Publications: 229
 
2
Pintu Lohar
Dublin City University
Total Publications: 10
 
3
Haithem Afli
Dublin City University
Total Publications: 14