Type

Conference Proceedings

Authors

Ying Zhang
Eamonn Newman
Fabio Fantino
Gareth J. F. Jones

Subjects

Computer Science

Topics
information systems cultural heritage query machine translation standard system hybrid system information retrieval access information

Domain-specific query translation for multilingual information access using machine translation augmented with dictionaries mined from Wikipedia (2008)

Abstract Accurate high-coverage translation is a vital component of reliable cross language information access (CLIA) systems. While machine translation (MT) has been shown to be effective for CLIA tasks in previous evaluation workshops, it is not well suited to specialized tasks where domain specific translations are required. We demonstrate that effective query translation for CLIA can be achieved in the domain of cultural heritage (CH). This is performed by augmenting a standard MT system with domainspecific phrase dictionaries automatically mined from the online Wikipedia. Experiments using our hybrid translation system with sample query logs from users of CH websites demonstrate a large improvement in the accuracy of domain specific phrase detection and translation.
Collections Ireland -> Dublin City University -> Publication Type = Conference or Workshop Item
Ireland -> Dublin City University -> Subject = Computer Science
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres: Centre for Digital Video Processing (CDVP)
Ireland -> Dublin City University -> Status = Published
Ireland -> Dublin City University -> Subject = Computer Science: Information retrieval
Ireland -> Dublin City University -> DCU Faculties and Centres = Research Initiatives and Centres

Full list of authors on original publication

Ying Zhang, Eamonn Newman, Fabio Fantino, Gareth J. F. Jones

Experts in our system

1
Eamonn Newman
Dublin City University
Total Publications: 27
 
2
Gareth J. F. Jones
Dublin City University
Total Publications: 265