Unsupervised Clinical Language Translation
Author(s)
Weng, Wei-Hung; Chung, Yu-An; Szolovits, Peter
DownloadAccepted version (1.015Mb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
© 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM. As patients' access to their doctors' clinical notes becomes common, translating professional, clinical jargon to layperson-understandable language is essential to improve patient-clinician communication. Such translation yields better clinical outcomes by enhancing patients' understanding of their own health conditions, and thus improving patients' involvement in their own care. Existing research has used dictionary-based word replacement or definition insertion to approach the need. However, these methods are limited by expert curation, which is hard to scale and has trouble generalizing to unseen datasets that do not share an overlapping vocabulary. In contrast, we approach the clinical word and sentence translation problem in a completely unsupervised manner. We show that a framework using representation learning, bilingual dictionary induction and statistical machine translation yields the best precision at 10 of 0.827 on professional-to-consumer word translation, and mean opinion scores of 4.10 and 4.28 out of 5 for clinical correctness and layperson readability, respectively, on sentence translation. Our fully-unsupervised strategy overcomes the curation problem, and the clinically meaningful evaluation reduces biases from inappropriate evaluators, which are critical in clinical machine learning.
Date issued
2019-08Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Publisher
Association for Computing Machinery (ACM)
Citation
Weng, Wei-Hung, Chung, Yu-An and Szolovits, Peter. 2019. "Unsupervised Clinical Language Translation." Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
Version: Author's final manuscript