Show simple item record

dc.contributor.authorHsu, Yuin-Jen David
dc.date.accessioned2019-02-14T19:26:40Z
dc.date.available2019-02-14T19:26:40Z
dc.date.issued2015-09
dc.date.submitted2015-08
dc.identifier.issn0306-2619
dc.identifier.urihttp://hdl.handle.net/1721.1/120459
dc.description.abstractClustering methods are often used to model energy consumption for two reasons. First, clustering is often used to process data and to improve the predictive accuracy of subsequent energy models. Second, stable clusters that are reproducible with respect to non-essential changes can be used to group, target, and interpret observed subjects. However, it is well known that clustering methods are highly sensitive to the choice of algorithms and variables. This can lead to misleading assessments of predictive accuracy and mis-interpretation of clusters in policymaking. This paper therefore introduces two methods to the modeling of energy consumption in buildings: clusterwise regression, also known as latent class regression, which integrates clustering and regression simultaneously; and cluster validation methods to measure stability. Using a large dataset of multifamily buildings in New York City, clusterwise regression is compared to common two-stage algorithms that use K-means and model-based clustering with linear regression. Predictive accuracy is evaluated using 20-fold cross validation, and the stability of the perturbed clusters is measured using the Jaccard coefficient. These results show that there seems to be an inherent tradeoff between prediction accuracy and cluster stability. This paper concludes by discussing which clustering methods may be appropriate for different analytical purposes. Keywords: Cluster-wise regression; Buildings; Energy consumption; Prediction accuracy; Cluster stability; Latent class regressionen_US
dc.description.sponsorshipUnited States. Department of Energy (Grant DE-EE0004261)en_US
dc.publisherElsevieren_US
dc.relation.isversionofhttp://dx.doi.org/10.1016/j.apenergy.2015.08.126en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceElsevieren_US
dc.titleComparison of integrated clustering methods for accurate and stable prediction of building energy consumption dataen_US
dc.typeArticleen_US
dc.identifier.citationHsu, David. “Comparison of Integrated Clustering Methods for Accurate and Stable Prediction of Building Energy Consumption Data.” Applied Energy 160 (December 2015): 153–163 © 2015 The Authoren_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Urban Studies and Planningen_US
dc.contributor.mitauthorHsu, Yuin-Jen David
dc.relation.journalApplied Energyen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-01-22T15:50:06Z
dspace.orderedauthorsHsu, Daviden_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-1108-9656
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record