Show simple item record

dc.contributor.authorDesrochers, Theresa M.
dc.contributor.authorGoodman, Noah D.
dc.contributor.authorGraybiel, Ann M.
dc.contributor.authorJin, Dezhe Z.
dc.date.accessioned2012-03-30T18:17:53Z
dc.date.available2012-03-30T18:17:53Z
dc.date.issued2010-09
dc.date.submitted2010-08
dc.identifier.issn0027-8424
dc.identifier.issn1091-6490
dc.identifier.urihttp://hdl.handle.net/1721.1/69896
dc.description.abstractHabits and rituals are expressed universally across animal species. These behaviors are advantageous in allowing sequential behaviors to be performed without cognitive overload, and appear to rely on neural circuits that are relatively benign but vulnerable to takeover by extreme contexts, neuropsychiatric sequelae, and processes leading to addiction. Reinforcement learning (RL) is thought to underlie the formation of optimal habits. However, this theoretic formulation has principally been tested experimentally in simple stimulus-response tasks with relatively few available responses. We asked whether RL could also account for the emergence of habitual action sequences in realistically complex situations in which no repetitive stimulus-response links were present and in which many response options were present. We exposed naïve macaque monkeys to such experimental conditions by introducing a unique free saccade scan task. Despite the highly uncertain conditions and no instruction, the monkeys developed a succession of stereotypical, self-chosen saccade sequence patterns. Remarkably, these continued to morph for months, long after session-averaged reward and cost (eye movement distance) reached asymptote. Prima facie, these continued behavioral changes appeared to challenge RL. However, trial-by-trial analysis showed that pattern changes on adjacent trials were predicted by lowered cost, and RL simulations that reduced the cost reproduced the monkeys’ behavior. Ultimately, the patterns settled into stereotypical saccade sequences that minimized the cost of obtaining the reward on average. These findings suggest that brain mechanisms underlying the emergence of habits, and perhaps unwanted repetitive behaviors in clinical disorders, could follow RL algorithms capturing extremely local explore/exploit tradeoffs.en_US
dc.description.sponsorshipNational Eye Institute (Grant EY012848)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N000014-07-10903)en_US
dc.description.sponsorshipNational Defense Science and Engineering Graduate Fellowshipen_US
dc.description.sponsorshipMcGovern Institute for Brain Research at MIT (Friends of the McGovern Fellowship)en_US
dc.description.sponsorshipAlfred P. Sloan Foundation (Sloan Research Fellowship)en_US
dc.language.isoen_US
dc.publisherProceedings of the National Academy of Sciences (PNAS)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1073/pnas.1013470107en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourcePNASen_US
dc.titleOptimal habits can develop spontaneously through sensitivity to local costen_US
dc.typeArticleen_US
dc.identifier.citationDesrochers, T. M. et al. “From the Cover: Optimal Habits Can Develop Spontaneously Through Sensitivity to Local Cost.” Proceedings of the National Academy of Sciences 107.47 (2010): 20512–20517. Web. 30 Mar. 2012.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.departmentMcGovern Institute for Brain Research at MITen_US
dc.contributor.approverGraybiel, Ann M.
dc.contributor.mitauthorDesrochers, Theresa M.
dc.contributor.mitauthorGoodman, Noah D.
dc.contributor.mitauthorGraybiel, Ann M.
dc.relation.journalProceedings of the National Academy of Sciencesen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsDesrochers, T. M.; Jin, D. Z.; Goodman, N. D.; Graybiel, A. M.en
dc.identifier.orcidhttps://orcid.org/0000-0002-7223-0556
dc.identifier.orcidhttps://orcid.org/0000-0002-4326-7720
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record