| dc.contributor.author | Madnick, Stuart E. | |
| dc.contributor.author | Henschel, Andreas | |
| dc.contributor.author | Wachter, Thomas | |
| dc.contributor.author | Woon, Wei Lee | |
| dc.date.accessioned | 2010-10-15T15:19:21Z | |
| dc.date.available | 2010-10-15T15:19:21Z | |
| dc.date.issued | 2010-04 | |
| dc.date.submitted | 2009-12 | |
| dc.identifier.isbn | 978-1-4244-5698-7 | |
| dc.identifier.other | INSPEC Accession Number: 11227263 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/59371 | |
| dc.description | Supplementary Material can be found on http://
ssm-vm011.mit.edu/henschel/IIT09/. | en_US |
| dc.description.abstract | We compare a family of algorithms for the automatic generation of taxonomies by adapting the Heymann-algorithm in various ways. The core algorithm determines the generality of terms and iteratively inserts them in a growing taxonomy. Variants of the algorithm are created by altering the way and the frequency, generality of terms is calculated. We analyse the performance and the complexity of the variants combined with a systematic threshold evaluation on a set of seven manually created benchmark sets. As a result, betweenness centrality calculated on unweighted similarity graphs often performs best but requires threshold fine-tuning and is computationally more expensive than closeness centrality. Finally, we show how an entropy-based filter can lead to more precise taxonomies. | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1109/IIT.2009.5413365 | en_US |
| dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
| dc.source | IEEE | en_US |
| dc.title | Comparison of generality based algorithm variants for automatic taxonomy generation | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Henschel, A. et al. “Comparison of generality based algorithm variants for automatic taxonomy generation.” Innovations in Information Technology, 2009. IIT '09. International Conference on. 2009. 160-164. © Copyright 2010 IEEE | en_US |
| dc.contributor.department | Sloan School of Management | en_US |
| dc.contributor.approver | Madnick, Stuart E. | |
| dc.contributor.mitauthor | Madnick, Stuart E. | |
| dc.relation.journal | International Conference on Innovations in Information Technology, 2009. IIT '09. | en_US |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
| dspace.orderedauthors | Henschel, Andreas; Woon, Wei Lee; Wachter, Thomas; Madnick, Stuart | en |
| dc.identifier.orcid | https://orcid.org/0000-0001-9240-2573 | |
| mit.license | PUBLISHER_POLICY | en_US |
| mit.metadata.status | Complete | |