This paper presents a hybrid parallel breadth-first search (BFS) algorithm for distributed memory systems that uses two stacks. BFS is important for graph algorithms and parallelizing it is important for large graphs. The paper's contributions include using a sparse matrix representation and a hybrid 1D partitioning approach. Experimental results on large systems show the 1D hybrid algorithm scales better than flat 1D partitioning for larger concurrencies and is up to 1.8 times faster than 2D partitioning algorithms. The paper concludes by conjecturing that level synchronous BFS can be implemented without errors using relaxed queues.
Related topics: