This document discusses dimensionality reduction techniques for genomic data. It begins with an introduction to genomics data science and the "curse of dimensionality" for high-dimensional data. Popular dimensionality reduction methods like PCA, ISOMap and t-SNE are then described. Specific use cases are discussed, such as using PCA to analyze population variations, ISOMap to infer cell populations from single-cell RNA-seq data, and t-SNE to visualize tissue expression profiles from the GTEx dataset. The presentation encourages exploring different dimensionality reduction methods and visualizing the results.
Related topics: