This document provides an overview of supervised and semi-supervised machine learning techniques for natural language processing (NLP). It begins with an introduction on why machine learning is important for NLP. It then discusses three examples of supervised learning tasks: review classification, relevance ranking, and machine translation. The remainder of the document covers supervised learning methods including generative models, discriminative models such as AdaBoost and support vector machines (SVMs), and applications to NLP problems. It concludes by mentioning opportunities to work with Microsoft on related research.
Related topics: