Show simple item record

dc.contributor.authorCheng, David R.
dc.contributor.authorShah, Viral
dc.contributor.authorGilbert, John R.
dc.contributor.authorEdelman, Alan
dc.date.accessioned2004-12-13T06:10:47Z
dc.date.available2004-12-13T06:10:47Z
dc.date.issued2005-01
dc.identifier.urihttp://hdl.handle.net/1721.1/7418
dc.description.abstractWe consider the often-studied problem of sorting, for a parallel computer. Given an input array distributed evenly over p processors, the task is to compute the sorted output array, also distributed over the p processors. Many existing algorithms take the approach of approximately load-balancing the output, leaving each processor with Θ(n/p) elements. However, in many cases, approximate load-balancing leads to inefficiencies in both the sorting itself and in further uses of the data after sorting. We provide a deterministic parallel sorting algorithm that uses parallel selection to produce any output distribution exactly, particularly one that is perfectly load-balanced. Furthermore, when using a comparison sort, this algorithm is 1-optimal in both computation and communication. We provide an empirical study that illustrates the efficiency of exact data splitting, and shows an improvement over two sample sort algorithms.en
dc.description.sponsorshipSingapore-MIT Alliance (SMA)en
dc.format.extent122908 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.relation.ispartofseriesComputer Science (CS);
dc.subjectParallel sortingen
dc.subjectdistributed-memory algorithmsen
dc.subjectHigh Performance Computingen
dc.titleFast Sorting on a Distributed-Memory Architectureen
dc.typeArticleen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record