Show simple item record

dc.contributor.authorEvans, Ethan D
dc.contributor.authorDuvallet, Claire
dc.contributor.authorChu, Nathaniel D
dc.contributor.authorOberst, Michael K
dc.contributor.authorMurphy, Michael A
dc.contributor.authorRockafellow, Isaac
dc.contributor.authorSontag, David
dc.contributor.authorAlm, Eric J
dc.date.accessioned2021-10-27T19:56:29Z
dc.date.available2021-10-27T19:56:29Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/133758
dc.description.abstract© 2020, The Author(s). Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with data-driven analysis.
dc.language.isoen
dc.publisherSpringer Science and Business Media LLC
dc.relation.isversionof10.1038/s41598-020-74823-1
dc.rightsCreative Commons Attribution 4.0 International license
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.sourceScientific Reports
dc.titlePredicting human health from biofluid-based metabolomics using machine learning
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Department of Biological Engineering
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalScientific Reports
dc.eprint.versionFinal published version
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/PeerReviewed
dc.date.updated2021-02-02T17:20:14Z
dspace.orderedauthorsEvans, ED; Duvallet, C; Chu, ND; Oberst, MK; Murphy, MA; Rockafellow, I; Sontag, D; Alm, EJ
dspace.date.submission2021-02-02T17:20:21Z
mit.journal.volume10
mit.journal.issue1
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record