MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Tuning Database-Friendly Random Projection Matrices for Improved Distance Preservation on Specific Data

Author(s)
López-Sánchez, Daniel; de Bodt, Cyril; Lee, John A.; Arrieta, Angélica G.; Corchado, Juan M.
Thumbnail
Download10489_2021_Article_2626.pdf (18.96Mb)
Publisher with Creative Commons License

Publisher with Creative Commons License

Creative Commons Attribution

Terms of use
Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/
Metadata
Show full item record
Abstract
Random Projection is one of the most popular and successful dimensionality reduction algorithms for large volumes of data. However, given its stochastic nature, different initializations of the projection matrix can lead to very different levels of performance. This paper presents a guided random search algorithm to mitigate this problem. The proposed method uses a small number of training data samples to iteratively adjust a projection matrix, improving its performance on similarly distributed data. Experimental results show that projection matrices generated with the proposed method result in a better preservation of distances between data samples. Conveniently, this is achieved while preserving the database-friendliness of the projection matrix, as it remains sparse and comprised exclusively of integers after being tuned with our algorithm. Moreover, running the proposed algorithm on a consumer-grade CPU requires only a few seconds.
Date issued
2021-07
URI
https://hdl.handle.net/1721.1/133016
Department
Massachusetts Institute of Technology. Media Laboratory
Journal
Applied Intelligence
Publisher
Springer US
Citation
López-Sánchez, D., de Bodt, C., Lee, J.A. et al. Tuning Database-Friendly Random Projection Matrices for Improved Distance Preservation on Specific Data. Appl Intell (2021)
Version: Final published version
ISSN
1573-7497
0924-669X

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.