This document discusses multi-lingual search and machine translation. It introduces Tommaso Teofili and Suneel Marthi, who work on Apache projects related to natural language processing. They discuss why multi-lingual search is important to embrace diversity online. Statistical machine translation generates translations from models trained on parallel text corpora. Phrase-based models can translate phrases as units and handle reordering better than word-based models. Apache Joshua is an open source machine translation decoder used by many organizations.
Related topics: