Mixed up with Machine Translation: Multi-word Units Disambiguation Challenge

Monti, Johanna and Barreiro, Anabela and Elia, Annibale and Monteleone, Mario (2010) Mixed up with Machine Translation: Multi-word Units Disambiguation Challenge. In: Translating and the Computer ASLIB Conference 2010, 18 e 19 novembre 2010, Londra. (In Press)

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


With the rapid evolution of the Internet, translation has become part of the daily life of ordinary users, not only of professional translators. Machine translation has evolved along with different types of computer-assisted translation tools. Qualitative progress has been made in the field of machine translation, but not all problems have been solved. The current times are auspicious for the development of more sophisticated evaluation tools that measure the performance of specific linguistic phenomena. One problem in particular, namely the poor analysis and translation of multi-word units, is an arena where investment in linguistic knowledge systems with the goal of improving machine translation would be beneficial. This paper addresses the difficulties multi-word units present to machine translation, by comparing translations performed by systems adopting different approaches to machine translation. It proposes a solution for improving the quality of the translation of multi-word units by adopting a methodology that combines Lexicon Grammar resources with OpenLogos lexical resources and semantico-syntactic rules. Finally, it discusses how an ideal machine translation evaluation tool might look to correctly evaluate the performance of machine translation engines with regards to multi-word units and thus to contribute to the improvement of translation quality.

Item Type:Conference or Workshop Item (Lecture)
Uncontrolled Keywords:Machine Translation, multiword units, Lexicon Grammar, OpenLogos, evaluation
Subjects:AREA 10 - Scienze dell'antichita' / filologico letterarie e storico artistiche > GLOTTOLOGIA E LINGUISTICA
ID Code:337
Deposited By:Dott.ssa Johanna Monti
Deposited On:27 Oct 2010 09:27
Last Modified:27 Oct 2010 09:27

Repository Staff Only: item control page