Type

Conference Proceedings

Authors

Andy Way
Pintu Lohar
Haithem Afli

Subjects

Computer Science

Topics
computer vision aligned machine translating collection web multimodal corpus multimodal data natural language processing

MultiNews: a web collection of an aligned multimodal and multilingual corpus (2017)

Abstract Integrating Natural Language Processing (NLP) and computer vision is a promising effort. However, the applicability of these methods directly depends on the availability of a specific multimodal data that includes images and texts. In this paper, we present a collection of a Multimodal corpus of comparable document and their images in 9 languages from the web news articles of Euronews website.1 This corpus has found widespread use in the NLP community in Multilingual and multimodal tasks. Here, we focus on its acquisition of the images and text data and their multilingual alignment.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> DCU Faculties and Centres = DCU Faculties and Schools: Faculty of Engineering and Computing: School of Computing
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres: ADAPT
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Machine translating

Full list of authors on original publication

Andy Way, Pintu Lohar, Haithem Afli

Experts in our system

1
Andy Way
Dublin City University
Total Publications: 229
 
2
Pintu Lohar
Dublin City University
Total Publications: 10
 
3
Haithem Afli
Dublin City University
Total Publications: 14