GPU-accelerated Chemical Similarity Assessment for Large Scale Databases
Author(s)
Maggioni, Marco; Santambrogio, Marco Domenico; Liang, Jie
DownloadMaggioni-2011-GPU-accelerated Chem.pdf (392.4Kb)
PUBLISHER_CC
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
The assessment of chemical similarity between molecules is a basic operation in chemoinformatics, a computational area concerning with the manipulation of chemical structural information. Comparing molecules is the basis for a wide range of applications such as searching in chemical databases, training prediction models for virtual screening or aggregating clusters of similar compounds. However, currently available multimillion databases represent a challenge for conventional chemoinformatics algorithms raising the necessity for faster similarity methods. In this paper, we extensively analyze the advantages of using many-core architectures for calculating some commonly-used chemical similarity coe_cients such as Tanimoto, Dice or Cosine. Our aim is to provide a wide-breath proof-of-concept regarding the usefulness of GPU architectures to chemoinformatics, a class of computing problems still uncovered. In our work, we present a general GPU algorithm for all-to-all chemical comparisons considering both binary fingerprints and floating point descriptors as molecule representation. Subsequently, we adopt optimization techniques to minimize global memory accesses and to further improve e_ciency. We test the proposed algorithm on different experimental setups, a laptop with a low-end GPU and a desktop with a more performant GPU. In the former case, we obtain a 4-to-6-fold speed-up over a single-core implementation for fingerprints and a 4-to-7-fold speed-up for descriptors. In the latter case, we respectively obtain a 195-to-206-fold speed-up and a 100-to-328-fold speed-up.
Date issued
2011Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
Procedia Computer Science
Publisher
Elsevier B.V.
Citation
Maggioni, Marco, Marco Domenico Santambrogio, and Jie Liang. “GPU-Accelerated Chemical Similarity Assessment for Large Scale Databases.” Procedia Computer Science 4 (2011): 2007–2016. © 2011 Elsevier B.V.
Version: Final published version
ISSN
18770509