dc.contributor.author | Wu, Julia | |
dc.contributor.author | Sivaraman, Venkatesh | |
dc.contributor.author | Kumar, Dheekshita | |
dc.contributor.author | Banda, Juan M. | |
dc.contributor.author | Sontag, David Alexander | |
dc.date.accessioned | 2021-07-22T16:03:29Z | |
dc.date.available | 2021-07-22T16:03:29Z | |
dc.date.issued | 2021-08 | |
dc.date.submitted | 2021-06 | |
dc.identifier.issn | 1532-0464 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/131127 | |
dc.description.abstract | The rapid evolution of the COVID-19 pandemic has underscored the need to quickly disseminate the latest clinical knowledge during a public-health emergency. One surprisingly effective platform for healthcare professionals
(HCPs) to share knowledge and experiences from the front lines has been social media (for example, the "#medtwitter" community on Twitter). However, identifying clinically-relevant content in social media without manual labelingis a challenge because of the sheer volume of irrelevant data. We present an
unsupervised, iterative approach to mine clinically relevant information from social media data, which begins by heuristically filtering for HCP-authored texts and incorporates topic modeling and concept extraction with MetaMap. This approach identifies granular topics and tweets with high clinical relevance from a set of about 52 million COVID-19-related tweets from January to mid-June 2020. We also show that because the technique does not require manual labeling, it can be used to identify emerging topics on a week-to-week basis. Our method
can aid in future public-health emergencies by facilitating knowledge transfer among healthcare workers in a rapidly-changing information environment, and by providing an efficient and unsupervised way of highlighting potential areas for clinical research. | en_US |
dc.language.iso | en | |
dc.publisher | Elsevier BV | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1016/j.jbi.2021.103844 | en_US |
dc.rights | Creative Commons Attribution-NonCommercial-NoDerivs License | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | Pulse of the pandemic: Iterative topic filtering for clinical information extraction from social media | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Wu, Julia et al. "Pulse of the pandemic: Iterative topic filtering for clinical information extraction from social media." Journal of Biomedical Informatics 120 (August 2021): 103844. © 2021 Elsevier | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.relation.journal | Journal of Biomedical Informatics | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dc.date.updated | 2021-07-22T12:03:30Z | |
dspace.orderedauthors | Wu, J; Sivaraman, V; Kumar, D; Banda, JM; Sontag, D | en_US |
dspace.date.submission | 2021-07-22T12:03:33Z | |
mit.journal.volume | 120 | en_US |
mit.license | PUBLISHER_CC | |
mit.metadata.status | Complete | |