Show simple item record

dc.contributor.authorCocco, S.
dc.contributor.authorDe Leonardis, E.
dc.contributor.authorMonasson, R.
dc.contributor.authorBarton, John P.
dc.date.accessioned2015-06-17T14:53:26Z
dc.date.available2015-06-17T14:53:26Z
dc.date.issued2014-07
dc.date.submitted2014-06
dc.identifier.issn1539-3755
dc.identifier.issn1550-2376
dc.identifier.urihttp://hdl.handle.net/1721.1/97450
dc.description.abstractThe mean-field (MF) approximation offers a simple, fast way to infer direct interactions between elements in a network of correlated variables, a common, computationally challenging problem with practical applications in fields ranging from physics and biology to the social sciences. However, MF methods achieve their best performance with strong regularization, well beyond Bayesian expectations, an empirical fact that is poorly understood. In this work, we study the influence of pseudocount and L[subscript 2]-norm regularization schemes on the quality of inferred Ising or Potts interaction networks from correlation data within the MF approximation. We argue, based on the analysis of small systems, that the optimal value of the regularization strength remains finite even if the sampling noise tends to zero, in order to correct for systematic biases introduced by the MF approximation. Our claim is corroborated by extensive numerical studies of diverse model systems and by the analytical study of the m-component spin model for large but finite m. Additionally, we find that pseudocount regularization is robust against sampling noise and often outperforms L[subscript 2]-norm regularization, particularly when the underlying network of interactions is strongly heterogeneous. Much better performances are generally obtained for the Ising model than for the Potts model, for which only couplings incoming onto medium-frequency symbols are reliably inferred.en_US
dc.description.sponsorshipFrance. Agence nationale de la recherche (Coevstat Project Grant ANR-13-BS04-0012-01)en_US
dc.language.isoen_US
dc.publisherAmerican Physical Societyen_US
dc.relation.isversionofhttp://dx.doi.org/10.1103/PhysRevE.90.012132en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAmerican Physical Societyen_US
dc.titleLarge pseudocounts and L[subscript 2]-norm penalties are necessary for the mean-field inference of Ising and Potts modelsen_US
dc.typeArticleen_US
dc.identifier.citationBarton, J. P., S. Cocco, E. De Leonardis, and R. Monasson. “Large Pseudocounts and L[subscript 2]-Norm Penalties Are Necessary for the Mean-Field Inference of Ising and Potts Models.” Phys. Rev. E 90, no. 1 (July 2014). © 2014 American Physical Societyen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Chemical Engineeringen_US
dc.contributor.departmentRagon Institute of MGH, MIT and Harvarden_US
dc.contributor.mitauthorBarton, John P.en_US
dc.relation.journalPhysical Review Een_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsBarton, J. P.; Cocco, S.; De Leonardis, E.; Monasson, R.en_US
dc.identifier.orcidhttps://orcid.org/0000-0003-1467-421X
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record