| dc.contributor.author | Shaul, Hayim | |
| dc.contributor.author | Feldman, Dan | |
| dc.contributor.author | Rus, Daniela | |
| dc.date.accessioned | 2021-10-27T19:56:18Z | |
| dc.date.available | 2021-10-27T19:56:18Z | |
| dc.date.issued | 2020 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/133710 | |
| dc.description.abstract | <jats:title>Abstract</jats:title>
<jats:p>The <jats:italic>k</jats:italic>-nearest neighbors (<jats:italic>k</jats:italic>NN) classifier predicts a class of a query, <jats:italic>q</jats:italic>, by taking the majority class of its <jats:italic>k</jats:italic> neighbors in an existing (already classified) database, <jats:italic>S</jats:italic>. In secure <jats:italic>k</jats:italic>NN, <jats:italic>q</jats:italic> and <jats:italic>S</jats:italic> are owned by two different parties and <jats:italic>q</jats:italic> is classified without sharing data. In this work we present a classifier based on <jats:italic>k</jats:italic>NN, that is more efficient to implement with homomorphic encryption (HE). The efficiency of our classifier comes from a relaxation we make to consider <jats:italic>κ</jats:italic> nearest neighbors for <jats:italic>κ ≈k</jats:italic> with probability that increases as the statistical distance between Gaussian and the distribution of the distances from <jats:italic>q</jats:italic> to <jats:italic>S</jats:italic> decreases. We call our classifier <jats:italic>k</jats:italic>-ish Nearest Neighbors (<jats:italic>k</jats:italic>-ish NN). For the implementation we introduce <jats:italic>double-blinded coin-toss</jats:italic> where the bias and output of the toss are encrypted. We use it to approximate the average and variance of the distances from <jats:italic>q</jats:italic> to <jats:italic>S</jats:italic> in a scalable circuit whose depth is independent of |<jats:italic>S</jats:italic>|. We believe these to be of independent interest. We implemented our classifier in an open source library based on HElib and tested it on a breast tumor database. Our classifier has accuracy and running time comparable to current state of the art (non-HE) MPC solution that have better running time but worse communication complexity. It also has communication complexity similar to naive HE implementation that have worse running time.</jats:p> | |
| dc.language.iso | en | |
| dc.publisher | Walter de Gruyter GmbH | |
| dc.relation.isversionof | 10.2478/POPETS-2020-0045 | |
| dc.rights | Creative Commons Attribution-NonCommercial-NoDerivs License | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.source | De Gruyter | |
| dc.title | Secure k -ish Nearest Neighbors Classifier | |
| dc.type | Article | |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| dc.relation.journal | Proceedings on Privacy Enhancing Technologies | |
| dc.eprint.version | Final published version | |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | |
| dc.date.updated | 2021-04-02T18:33:09Z | |
| dspace.orderedauthors | Shaul, H; Feldman, D; Rus, D | |
| dspace.date.submission | 2021-04-02T18:33:10Z | |
| mit.journal.volume | 2020 | |
| mit.journal.issue | 3 | |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Authority Work and Publication Information Needed | |