Show simple item record

dc.contributor.authorCollell Talleda, Guillem
dc.contributor.authorPrelec, Drazen
dc.contributor.authorPatil, Kaustubh R
dc.date.accessioned2019-02-28T18:46:09Z
dc.date.available2019-02-28T18:46:09Z
dc.date.issued2017-09
dc.date.submitted2017-07
dc.identifier.issn0925-2312
dc.identifier.urihttp://hdl.handle.net/1721.1/120577
dc.description.abstractClass imbalance presents a major hurdle in the application of classification methods. A commonly taken approach is to learn ensembles of classifiers using rebalanced data. Examples include bootstrap averaging (bagging) combined with either undersampling or oversampling of the minority class examples. However, rebalancing methods entail asymmetric changes to the examples of different classes, which in turn can introduce their own biases. Furthermore, these methods often require specifying the performance measure of interest a priori, i.e., before learning. An alternative is to employ the threshold moving technique, which applies a threshold to the continuous output of a model, offering the possibility to adapt to a performance measure a posteriori, i.e., a plug-in method. Surprisingly, little attention has been paid to this combination of a bagging ensemble and threshold-moving. In this paper, we study this combination and demonstrate its competitiveness. Contrary to the other resampling methods, we preserve the natural class distribution of the data resulting in well-calibrated posterior probabilities. Additionally, we extend the proposed method to handle multiclass data. We validated our method on binary and multiclass benchmark data sets by using both, decision trees and neural networks as base classifiers. We perform analyses that provide insights into the proposed method. Keywords: Imbalanced data; Binary classification; Multiclass classification; Bagging ensembles; Resampling; Posterior calibrationen_US
dc.description.sponsorshipBurroughs Wellcome Fund (Grant 103811AI)en_US
dc.publisherElsevier BVen_US
dc.relation.isversionofhttp://dx.doi.org/10.1016/J.NEUCOM.2017.08.035en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceElsevieren_US
dc.titleA simple plug-in bagging ensemble based on threshold-moving for classifying binary and multiclass imbalanced dataen_US
dc.typeArticleen_US
dc.identifier.citationCollell, Guillem et al. “A Simple Plug-in Bagging Ensemble Based on Threshold-Moving for Classifying Binary and Multiclass Imbalanced Data.” Neurocomputing 275 (January 2018): 330–340 © 2017 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Economicsen_US
dc.contributor.departmentSloan School of Managementen_US
dc.contributor.mitauthorCollell Talleda, Guillem
dc.contributor.mitauthorPrelec, Drazen
dc.contributor.mitauthorPatil, Kaustubh R
dc.relation.journalNeurocomputingen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-02-27T18:04:13Z
dspace.orderedauthorsCollell, Guillem; Prelec, Drazen; Patil, Kaustubh R.en_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-9507-5368
dc.identifier.orcidhttps://orcid.org/0000-0002-0289-5480
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record