Multimodal inductive transfer learning for detection of Alzheimer's dementia and its severity
Author(s)
Sarawgi, Utkarsh; Zulfikar, Wazeer; Soliman, Nouran; Maes, Pattie
DownloadAccepted version (414.4Kb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Copyright © 2020 ISCA Alzheimer's disease is estimated to affect around 50 million people worldwide and is rising rapidly, with a global economic burden of nearly a trillion dollars. This calls for scalable, cost-effective, and robust methods for detection of Alzheimer's dementia (AD). We present a novel architecture that leverages acoustic, cognitive, and linguistic features to form a multimodal ensemble system. It uses specialized artificial neural networks with temporal characteristics to detect AD and its severity, which is reflected through Mini-Mental State Exam (MMSE) scores. We first evaluate it on the ADReSS challenge dataset, which is a subject-independent and balanced dataset matched for age and gender to mitigate biases, and is available through DementiaBank. Our system achieves state-of-the-art test accuracy, precision, recall, and F1-score of 83.3% each for AD classification, and state-of-the-art test root mean squared error (RMSE) of 4.60 for MMSE score regression. To the best of our knowledge, the system further achieves state-of-the-art AD classification accuracy of 88.0% when evaluated on the full benchmark DementiaBank Pitt database. Our work highlights the applicability and transferability of spontaneous speech to produce a robust inductive transfer learning model, and demonstrates generalizability through a task-agnostic feature-space. The source code is available at https://github.com/wazeerzulfikar/alzheimers-dementia.
Date issued
2020-10Department
Massachusetts Institute of Technology. Media Laboratory; Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publisher
ISCA
Citation
Sarawgi, U, Zulfikar, W, Soliman, N and Maes, P. 2020. "Multimodal inductive transfer learning for detection of Alzheimer's dementia and its severity." Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October.
Version: Original manuscript
ISBN
9781713820697