Show simple item record

dc.contributor.authorMaggioni, Marco
dc.contributor.authorSantambrogio, Marco Domenico
dc.contributor.authorLiang, Jie
dc.date.accessioned2014-12-12T19:16:42Z
dc.date.available2014-12-12T19:16:42Z
dc.date.issued2011
dc.identifier.issn18770509
dc.identifier.urihttp://hdl.handle.net/1721.1/92298
dc.description.abstractThe assessment of chemical similarity between molecules is a basic operation in chemoinformatics, a computational area concerning with the manipulation of chemical structural information. Comparing molecules is the basis for a wide range of applications such as searching in chemical databases, training prediction models for virtual screening or aggregating clusters of similar compounds. However, currently available multimillion databases represent a challenge for conventional chemoinformatics algorithms raising the necessity for faster similarity methods. In this paper, we extensively analyze the advantages of using many-core architectures for calculating some commonly-used chemical similarity coe_cients such as Tanimoto, Dice or Cosine. Our aim is to provide a wide-breath proof-of-concept regarding the usefulness of GPU architectures to chemoinformatics, a class of computing problems still uncovered. In our work, we present a general GPU algorithm for all-to-all chemical comparisons considering both binary fingerprints and floating point descriptors as molecule representation. Subsequently, we adopt optimization techniques to minimize global memory accesses and to further improve e_ciency. We test the proposed algorithm on different experimental setups, a laptop with a low-end GPU and a desktop with a more performant GPU. In the former case, we obtain a 4-to-6-fold speed-up over a single-core implementation for fingerprints and a 4-to-7-fold speed-up for descriptors. In the latter case, we respectively obtain a 195-to-206-fold speed-up and a 100-to-328-fold speed-up.en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (grant GM079804)en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (grant GM086145)en_US
dc.language.isoen_US
dc.publisherElsevier B.V.en_US
dc.relation.isversionofhttp://dx.doi.org/10.1016/j.procs.2011.04.219en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/en_US
dc.sourceElsevieren_US
dc.titleGPU-accelerated Chemical Similarity Assessment for Large Scale Databasesen_US
dc.typeArticleen_US
dc.identifier.citationMaggioni, Marco, Marco Domenico Santambrogio, and Jie Liang. “GPU-Accelerated Chemical Similarity Assessment for Large Scale Databases.” Procedia Computer Science 4 (2011): 2007–2016. © 2011 Elsevier B.V.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.mitauthorSantambrogio, Marco Domenicoen_US
dc.relation.journalProcedia Computer Scienceen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsMaggioni, Marco; Santambrogio, Marco Domenico; Liang, Jieen_US
mit.licensePUBLISHER_CCen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record