This document summarizes the speaker's work developing GPU rigid body simulation techniques over three generations. It discusses optimizations made to port various stages of rigid body simulation like collision detection and constraint solving to the GPU. These include using parallel sorting, grid structures, and graph coloring to parallelize pairs testing and constraint solving while avoiding write conflicts between GPU threads. The techniques have allowed achieving CPU-level rigid body simulation quality entirely on the GPU.