Show simple item record

dc.contributor.authorBlanca, Antonio
dc.contributor.authorChen, Zongchen
dc.contributor.authorStefankovic, Daniel
dc.contributor.authorVigoda, Eric
dc.date.accessioned2024-09-04T17:26:49Z
dc.date.available2024-09-04T17:26:49Z
dc.identifier.issn1549-6325
dc.identifier.urihttps://hdl.handle.net/1721.1/156665
dc.description.abstractWe study the identity testing problem for high-dimensional distributions. Given as input an explicit distribution , an > 0, and access to sampling oracle(s) for a hidden distribution , the goal in identity testing is to distinguish whether the two distributions and are identical or are at least -far apart. When there is only access to full samples from the hidden distribution , it is known that exponentially many samples (in the dimension) may be needed for identity testing, and hence previous works have studied identity testing with additional access to various “conditional” sampling oracles. We consider a significantly weaker conditional sampling oracle, which we call the Coordinate Oracle, and provide a computational and statistical characterization of the identity testing problem in this new model. We prove that if an analytic property known as approximate tensorization of entropy holds for an -dimensional visible distribution , then there is an efficient identity testing algorithm for any hidden distribution using e(/) queries to the Coordinate Oracle. Approximate tensorization of entropy is a pertinent condition as recent works have established it for a large class of high-dimensional distributions. We also prove a computational phase transition: for a well-studied class of -dimensional distributions, specifically sparse antiferromagnetic Ising models over {+1, −1} , we show that in the regime where approximate tensorization of entropy fails, there is no efficient identity testing algorithm unless RP = NP. We complement our results with a matching Ω(/) statistical lower bound for the sample complexity of identity testing in the Coordinate Oracle model.en_US
dc.publisherACMen_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/3686799en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleComplexity of High-Dimensional Identity Testing with Coordinate Conditional Samplingen_US
dc.typeArticleen_US
dc.identifier.citationAntonio Blanca, Zongchen Chen, Daniel Štefankovič, and Eric Vigoda. 2024. Complexity of High-Dimensional Identity Testing with Coordinate Conditional Sampling. ACM Trans. Algorithms Just Accepted (August 2024).en_US
dc.relation.journalACM Transactions on Algorithmsen_US
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2024-09-01T07:45:14Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2024-09-01T07:45:14Z
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record