WEKA is a collection of machine learning algorithms that can be used for data mining tasks. It supports various file formats like ARFF. The document discusses how to load data, perform univariate analysis, feature selection using info gain and correlation, and divide data into training, validation and test sets. It also demonstrates building a logistic regression model, analyzing it using ROC curves and cost-benefit analysis. WEKA allows reapplying models on new data and has advantages like being platform independent with a GUI, APIs and CLI, but has limitations such as limited visualizations and file formats.