This document provides instructions for Assignment 2 of the BMI 214 course on machine learning for expression data and genotype-phenotype associations. It includes instructions on using the Weka machine learning tool to perform supervised and unsupervised learning on gene expression datasets. For supervised learning, it has students classify leukemia samples and evaluate different classifiers. For unsupervised learning, it has students perform k-means clustering on a yeast gene expression dataset. It also includes exercises on feature selection to identify informative genes for classification.
Related topics: