This document discusses using data mining techniques to predict annual crop yields in India. It begins with an abstract that outlines how agriculture is important to the Indian economy but crop production depends on seasonal and environmental factors, making yield prediction challenging. The document then provides an introduction to data mining and its potential application to predict crop yields. It reviews literature on using various data mining methods like linear regression and k-nearest neighbor algorithms to predict yields of major crops in India based on historical data on climate, soil conditions and more. The goal is to help farmers choose optimal crops and improve farm productivity and profits.