Show simple item record

dc.contributor.authorCutting, Douglass R
dc.contributor.authorKarger, David R
dc.contributor.authorPedersen, Jan O
dc.contributor.authorTukey, John W
dc.date.accessioned2021-10-27T20:09:28Z
dc.date.available2021-10-27T20:09:28Z
dc.date.issued2017
dc.identifier.urihttps://hdl.handle.net/1721.1/134850
dc.description.abstract<jats:p>Document clustering has not been well received as an information retrieval tool. Objections to its use fall into two main categories: first, that clustering is too slow for large corpora (with running time often quadratic in the number of documents); and second, that clustering does not appreciably improve retrieval.</jats:p> <jats:p>We argue that these problems arise only when clustering is used in an attempt to improve conventional search techniques. However, looking at clustering as an information access tool in its own right obviates these objections, and provides a powerful new access paradigm. We present a document browsing technique that employs docum-ent clustering as its primary operation. We also present fast (linear time) clustering algorithm.</jats:p>
dc.language.isoen
dc.publisherAssociation for Computing Machinery (ACM)
dc.relation.isversionof10.1145/3130348.3130362
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/
dc.sourceOther repository
dc.titleScatter/Gather: A Cluster-based Approach to Browsing Large Document Collections
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journalACM SIGIR Forum
dc.eprint.versionAuthor's final manuscript
dc.type.urihttp://purl.org/eprint/type/ConferencePaper
eprint.statushttp://purl.org/eprint/status/NonPeerReviewed
dc.date.updated2019-06-04T17:16:09Z
dspace.orderedauthorsCutting, DR; Karger, DR; Pedersen, JO; Tukey, JW
dspace.date.submission2019-06-04T17:16:10Z
mit.journal.volume51
mit.journal.issue2
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record