This document summarizes research on dynamic load balancing techniques for graphics processors. It describes static and dynamic load balancing methods like blocking queues, non-blocking queues, task stealing, and static task lists. It evaluates these methods experimentally on graphics processors for problems like octree partitioning and a computation intensive game. The research finds that lock-free synchronization scales better than locks, work stealing can outperform static techniques, and synchronization plays a key role in dynamic load balancing performance on GPUs.
Related topics: