MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Comparison of generality based algorithm variants for automatic taxonomy generation

Author(s)
Madnick, Stuart E.; Henschel, Andreas; Wachter, Thomas; Woon, Wei Lee
Thumbnail
DownloadHenschel-2009-Comparison of generality based algorithm variants for automatic taxonomy generation.pdf (746.5Kb)
PUBLISHER_POLICY

Publisher Policy

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Terms of use
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Metadata
Show full item record
Abstract
We compare a family of algorithms for the automatic generation of taxonomies by adapting the Heymann-algorithm in various ways. The core algorithm determines the generality of terms and iteratively inserts them in a growing taxonomy. Variants of the algorithm are created by altering the way and the frequency, generality of terms is calculated. We analyse the performance and the complexity of the variants combined with a systematic threshold evaluation on a set of seven manually created benchmark sets. As a result, betweenness centrality calculated on unweighted similarity graphs often performs best but requires threshold fine-tuning and is computationally more expensive than closeness centrality. Finally, we show how an entropy-based filter can lead to more precise taxonomies.
Description
Supplementary Material can be found on http:// ssm-vm011.mit.edu/henschel/IIT09/.
Date issued
2010-04
URI
http://hdl.handle.net/1721.1/59371
Department
Sloan School of Management
Journal
International Conference on Innovations in Information Technology, 2009. IIT '09.
Publisher
Institute of Electrical and Electronics Engineers
Citation
Henschel, A. et al. “Comparison of generality based algorithm variants for automatic taxonomy generation.” Innovations in Information Technology, 2009. IIT '09. International Conference on. 2009. 160-164. © Copyright 2010 IEEE
Version: Final published version
Other identifiers
INSPEC Accession Number: 11227263
ISBN
978-1-4244-5698-7

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.