Show simple item record

dc.contributor.authorSontag, David Alexander
dc.date.accessioned2021-04-27T17:45:25Z
dc.date.available2021-04-27T17:45:25Z
dc.date.issued2017-08
dc.identifier.issn2640-3498
dc.identifier.urihttps://hdl.handle.net/1721.1/130531
dc.description.abstractWe consider multi-class classification where the predictor has a hierarchical structure that allows for a very large number of labels both at train and test time. The predictive power of such models can heavily depend on the structure of the tree, and although past work showed how to learn the tree structure, it expected that the feature vectors remained static. We provide a novel algorithm to simultaneously perform representation learning for the input data and learning of the hierarchical predictor. Our approach optimizes an objective function which favors balanced and easily-separable multi-way node partitions. We theoretically analyze this objective, showing that it gives rise to a boosting style property and a bound on classification error. We next show how to extend the algorithm to conditional density estimation. We empirically validate both variants of the algorithm on text classification and language modeling, respectively, and show that they compare favorably to common baselines in terms of accuracy and running time.en_US
dc.language.isoen
dc.publisherInternational Machine Learning Societyen_US
dc.relation.isversionofhttp://proceedings.mlr.press/v70/en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceProceedings of Machine Learning Researchen_US
dc.titleSimultaneous learning of trees and representations for extreme classification and density estimationen_US
dc.typeArticleen_US
dc.identifier.citationJernite, Yacine et al. “Simultaneous learning of trees and representations for extreme classification and density estimation.” Paper in the Proceedings of Machine Learning Research, 70, 34th International Conference on Machine Learning, Sydney, Australia, August 6-11 2017, International Machine Learning Society © 2017 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalProceedings of Machine Learning Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-04-06T17:50:50Z
dspace.orderedauthorsJernite, Y; Choromanska, A; Sontag, Den_US
dspace.date.submission2021-04-06T17:50:51Z
mit.journal.volume4en_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record