Mixed up with Machine Translation: Multi-word Units Disambiguation Challenge

Monti, Johanna - Barreiro, Anabela - Elia, Annibale - Monteleone, Mario (2010) Mixed up with Machine Translation: Multi-word Units Disambiguation Challenge. In: Translating and the Computer ASLIB Conference 2010, 18 e 19 novembre 2010, Londra. (In corso di pubblicazione)

Documento PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
75Kb

Abstract

With the rapid evolution of the Internet, translation has become part of the daily life of ordinary users, not only of professional translators. Machine translation has evolved along with different types of computer-assisted translation tools. Qualitative progress has been made in the field of machine translation, but not all problems have been solved. The current times are auspicious for the development of more sophisticated evaluation tools that measure the performance of specific linguistic phenomena. One problem in particular, namely the poor analysis and translation of multi-word units, is an arena where investment in linguistic knowledge systems with the goal of improving machine translation would be beneficial. This paper addresses the difficulties multi-word units present to machine translation, by comparing translations performed by systems adopting different approaches to machine translation. It proposes a solution for improving the quality of the translation of multi-word units by adopting a methodology that combines Lexicon Grammar resources with OpenLogos lexical resources and semantico-syntactic rules. Finally, it discusses how an ideal machine translation evaluation tool might look to correctly evaluate the performance of machine translation engines with regards to multi-word units and thus to contribute to the improvement of translation quality.

Tipologia del documento:Contributo a convegno o workshop (Comunicazione)
Parole chiave:Machine Translation, multiword units, Lexicon Grammar, OpenLogos, evaluation
Settori scientifico-disciplinari del MIUR:AREA 10 - Scienze dell'antichita' / filologico letterarie e storico artistiche > GLOTTOLOGIA E LINGUISTICA
Codice identificativo (ID):337
Depositato da:Dott.ssa Johanna Monti
Depositato il:27 Ott 2010 09:27
Ultima modifica:27 Ott 2010 09:27

Riservato allo Staff dell'Archivio: Accedi al record del documento