Show simple item record

dc.contributor.authorEnglhardt, Zachary
dc.contributor.authorMa, Chengqian
dc.contributor.authorMorris, Margaret E.
dc.contributor.authorChang, Chun-Cheng
dc.contributor.authorXu, Xuhai "Orson"
dc.contributor.authorQin, Lianhui
dc.contributor.authorMcDuff, Daniel
dc.contributor.authorLiu, Xin
dc.contributor.authorPatel, Shwetak
dc.contributor.authorIyer, Vikram
dc.date.accessioned2024-06-06T16:40:01Z
dc.date.available2024-06-06T16:40:01Z
dc.date.issued2024-05-13
dc.identifier.issn2474-9567
dc.identifier.urihttps://hdl.handle.net/1721.1/155207
dc.description.abstractPassively collected behavioral health data from ubiquitous sensors could provide mental health professionals valuable insights into patient's daily lives, but such efforts are impeded by disparate metrics, lack of interoperability, and unclear correlations between the measured signals and an individual's mental health. To address these challenges, we pioneer the exploration of large language models (LLMs) to synthesize clinically relevant insights from multi-sensor data. We develop chain-of-thought prompting methods to generate LLM reasoning on how data pertaining to activity, sleep and social interaction relate to conditions such as depression and anxiety. We then prompt the LLM to perform binary classification, achieving accuracies of 61.1%, exceeding the state of the art. We find models like GPT-4 correctly reference numerical data 75% of the time. While we began our investigation by developing methods to use LLMs to output binary classifications for conditions like depression, we find instead that their greatest potential value to clinicians lies not in diagnostic classification, but rather in rigorous analysis of diverse self-tracking data to generate natural language summaries that synthesize multiple data streams and identify potential concerns. Clinicians envisioned using these insights in a variety of ways, principally for fostering collaborative investigation with patients to strengthen the therapeutic alliance and guide treatment. We describe this collaborative engagement, additional envisioned uses, and associated concerns that must be addressed before adoption in real-world contexts.en_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.isversionof10.1145/3659604en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleFrom Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Modelsen_US
dc.typeArticleen_US
dc.identifier.citationEnglhardt, Zachary, Ma, Chengqian, Morris, Margaret E., Chang, Chun-Cheng, Xu, Xuhai "Orson" et al. 2024. "From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 8 (2).
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologiesen_US
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2024-06-01T07:58:35Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2024-06-01T07:58:36Z
mit.journal.volume8en_US
mit.journal.issue2en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record