Show simple item record

dc.contributor.authorWanye, Frank
dc.contributor.authorGleyzer, Vitaliy
dc.contributor.authorKao, Edward
dc.contributor.authorFeng, Wu-chun
dc.date.accessioned2025-10-07T20:35:34Z
dc.date.available2025-10-07T20:35:34Z
dc.date.issued2025-07-20
dc.identifier.isbn979-8-4007-1869-4
dc.identifier.urihttps://hdl.handle.net/1721.1/163070
dc.descriptionHPDC ’25, Notre Dame, IN, USAen_US
dc.description.abstractStochastic block partitioning (SBP) is a statistical inference-based algorithm for clustering vertices within a graph. It has been shown to be statistically robust and highly accurate even on graphs with a complex structure, but its poor scalability limits its usability to smaller-sized graphs. In this manuscript we argue that one reason for its poor scalability is the agglomerative, or bottom-up, nature of SBP’s algorithmic design; the agglomerative computations cause high memory usage and create a large search space that slows down statistical inference, particularly in the algorithm’s initial iterations. To address this bottleneck, we propose Top-Down SBP, a novel algorithm that replaces the agglomerative (bottom-up) block merges in SBP with a block-splitting operation. This enables the algorithm to start with all vertices in one cluster and subdivide them over time into smaller clusters. We show that Top-Down SBP is up to 7.7× faster than Bottom-Up SBP without sacrificing accuracy and can process larger graphs than Bottom-Up SBP on the same hardware due to an up to 4.1× decrease in memory usage. Additionally, we adapt existing methods for accelerating BottomUp SBP to the Top-Down approach, leading to up to 13.2× speedup over accelerated Bottom-Up SBP and up to 403× speedup over sequential Bottom-Up SBP on 64 compute nodes. Thus, Top-Down SBP represents substantial improvements to the scalability of SBP, enabling the analysis of larger datasets on the same hardware.en_US
dc.publisherACM|The 34th International Symposium on High-Performance Parallel and Distributed Computingen_US
dc.relation.isversionofhttps://doi.org/10.1145/3731545.3731589en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleTop-Down SBP: Turning Graph Clustering Upside Downen_US
dc.typeArticleen_US
dc.identifier.citationWanye, Frank, Gleyzer, Vitaliy, Kao, Edward and Feng, Wu-chun. 2025. "Top-Down SBP: Turning Graph Clustering Upside Down."
dc.contributor.departmentLincoln Laboratoryen_US
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2025-10-01T07:47:07Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2025-10-01T07:47:07Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record