This document discusses using machine learning to improve contextual translation by choosing between conflicting translation mappings extracted from bilingual corpora. It presents a machine learning approach that builds decision tree classifiers to select the most appropriate mapping based on linguistic features in the source language. The selected features provide insight into important contextual factors for translation. The approach is evaluated on a Spanish-English translation task using a corpus of 351,026 aligned sentence pairs, achieving significantly better translated output than prior methods that did not distinguish context for conflicting mappings.
Related topics: