Show simple item record

dc.contributor.authorSeverson, Kristen
dc.contributor.authorMolaro, Mark
dc.contributor.authorBraatz, Richard D
dc.date.accessioned2020-06-02T18:39:46Z
dc.date.available2020-06-02T18:39:46Z
dc.date.issued2017-07
dc.date.submitted2017-05
dc.identifier.issn2227-9717
dc.identifier.urihttps://hdl.handle.net/1721.1/125630
dc.description.abstractDatasets with missing values arising from causes such as sensor failure, inconsistent sampling rates, and merging data from different systems are common in the process industry. Methods for handling missing data typically operate during data pre-processing, but can also occur during model building. This article considers missing data within the context of principal component analysis (PCA), which is a method originally developed for complete data that has widespread industrial application in multivariate statistical process control. Due to the prevalence of missing data and the success of PCA for handling complete data, several PCA algorithms that can act on incomplete data have been proposed. Here, algorithms for applying PCA to datasets with missing values are reviewed. A case study is presented to demonstrate the performance of the algorithms and suggestions are made with respect to choosing which algorithm is most appropriate for particular settings. An alternating algorithm based on the singular value decomposition achieved the best results in the majority of test cases involving process datasets. Keywords: principal component analysis; missing data; process data analytics; chemometrics; machine learning; multivariable statistical process control; process monitoring; Tennessee Eastman problemen_US
dc.language.isoen
dc.publisherMDPI AGen_US
dc.relation.isversionofhttp://dx.doi.org/10.3390/pr5030038en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceMDPIen_US
dc.titlePrincipal Component Analysis of Process Datasets with Missing Valuesen_US
dc.typeArticleen_US
dc.identifier.citationSeverson, Kristen et al. “Principal Component Analysis of Process Datasets with Missing Values.” Processes 5, 4 (July 2017): 38. © 2017 The Authorsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.relation.journalProcessesen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-08-14T18:20:09Z
dspace.date.submission2019-08-14T18:20:10Z
mit.journal.volume5en_US
mit.journal.issue4en_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record