Evaluation of Connected-Component Labeling Algorithms for Distributed-Memory Systems

Date of Submission: 
July 30, 2014
Report Number: 
Report PDF: 
Connected-component labeling is a key step in a wide-range of applications, such as community detection in social networks and coherent structure identification in massively-parallel scientific simulations. There have been several distributed-memory connected-component algorithms described in literature; however, little has been done regarding their scalability analysis. We present theoretical and experimental results for five algorithms: three that are direct implementations of previous approaches, one that is an implementation of a previous approach that is optimized to reduce communication, and one that is a novel approach based on graph contraction. Under weak scaling and for certain classes of graphs, the graph contraction algorithm scales consistently better than the four other algorithms. Furthermore, it uses significantly less memory than two of the alternative methods and is of the same order in terms of memory as the other two.