Show simple item record

dc.contributor.advisorManolis Kellis.en_US
dc.contributor.authorShea, Andrew(Andrew L.)en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2020-09-15T22:02:00Z
dc.date.available2020-09-15T22:02:00Z
dc.date.copyright2020en_US
dc.date.issued2020en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/127522
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May, 2020en_US
dc.descriptionCataloged from the official PDF of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 61-66).en_US
dc.description.abstractElectronic health records (EHR) and their wealth of patient health information present new opportunities for understanding relationships between patients and their conditions. However, EHR data sparsity, quality, and accessibility present various computational challenges. To address these challenges, we apply spectral clustering and variational autoencoders to obtain compact patient representations and clusters from EHR in an unsupervised manner. We apply these methods to the MIMIC dataset, from which we only use ICD-9 diagnostic codes to ensure data accessibility. After obtaining clusters, we conduct high-resolution analysis by examining the 5 most frequent phenotypes within each cluster. We then conduct low-resolution analysis by examining the distribution of phenotypes within each cluster, examining the relationships amongst the most prevalent phenotypes in each cluster by constructing a cluster network, and comparing our findings to existing medical literature. While preliminary, these results suggest that learning from sparse EHR data is sufficient for uncovering associations between conditions and diseases.en_US
dc.description.statementofresponsibilityby Andrew Shea.en_US
dc.format.extent66 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titlePatient clustering using electronic medical recordsen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1193029579en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2020-09-15T22:02:00Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record