Show simple item record

dc.contributor.advisorLalana Kagal.en_US
dc.contributor.authorDas, Sourav.en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2020-03-24T15:35:50Z
dc.date.available2020-03-24T15:35:50Z
dc.date.copyright2019en_US
dc.date.issued2019en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/124239
dc.descriptionThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.en_US
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019en_US
dc.descriptionCataloged from student-submitted PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 67-68).en_US
dc.description.abstractDefined as undesirable effects of a medication that occur during or after usual clinical use, Adverse Drug Reactions (ADRs) pose a major health risk and result in the hospitalization of millions of patients each year. While pre-marketing clinical trials evaluate the safety and efficacy of a new drug, post-marketing surveillance identifies and monitors ADRs that were not previously identified during trials. Traditionally, most approaches tend to focus on ADR detection in the post-marketing phase. Also current approaches mostly use supervised machine learning, requiring significant preprocessing of the data and feature engineering. I developed a customizable framework based on unsupervised learning that allows users to run prediction tasks on different types of labeled graph data. The framework first creates a knowledge graph from the data and then uses an unsupervised algorithm to create embeddings (vector representations) of the nodes in the knowledge graph, and finally runs the prediction task. The framework enables an embedding to be learned for any newly added node as long as it is connected with the other nodes, and users can create embeddings for any pre-marketed drug as long as its related drug attributes are present in the knowledge graph. Using DrugBank and FAERS, I created a knowledge graph of drugs and drug attributes. To emulate drugs in the pre-marketing stage, I removed all the drug-ADR edges in the test dataset. Then, I experimented with different parameters of the node embedding algorithm and three different classifiers namely MLP, KNN and random forest. The models were trained to predict 9 different ADR associations for any drug, and our results showed that the MLP classifier was the best model with an AUROC score of 0.79, which is comparable to existing approaches but with much greater customizability. This approach has potential to improve how ADRs are predicted and allow them to be detected at a far earlier stage thus improving patient safetyen_US
dc.description.statementofresponsibilityby Sourav Das.en_US
dc.format.extent68 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titlePredicting unknown adverse drug reactions using an unsupervised node embedding algorithmen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1144999394en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2020-03-24T15:35:49Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record