Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

Haulcy, R'mani(R'mani Symon); Glass, James R

dc.contributor.author	Haulcy, R'mani(R'mani Symon)
dc.contributor.author	Glass, James R
dc.date.accessioned	2021-09-22T17:40:42Z
dc.date.available	2021-09-22T17:40:42Z
dc.date.issued	2021-01
dc.date.submitted	2020-10
dc.identifier.issn	1664-1078
dc.identifier.uri	https://hdl.handle.net/1721.1/132627
dc.description.abstract	Alzheimer's Disease (AD) is a form of dementia that affects the memory, cognition, and motor skills of patients. Extensive research has been done to develop accessible, cost-effective, and non-invasive techniques for the automatic detection of AD. Previous research has shown that speech can be used to distinguish between healthy patients and afflicted patients. In this paper, the ADReSS dataset, a dataset balanced by gender and age, was used to automatically classify AD from spontaneous speech. The performance of five classifiers, as well as a convolutional neural network and long short-term memory network, was compared when trained on audio features (i-vectors and x-vectors) and text features (word vectors, BERT embeddings, LIWC features, and CLAN features). The same audio and text features were used to train five regression models to predict the Mini-Mental State Examination score for each patient, a score that has a maximum value of 30. The top-performing classification models were the support vector machine and random forest classifiers trained on BERT embeddings, which both achieved an accuracy of 85.4% on the test set. The best-performing regression model was the gradient boosting regression model trained on BERT embeddings and CLAN features, which had a root mean squared error of 4.56 on the test set. The performance on both tasks illustrates the feasibility of using speech to classify AD and predict neuropsychological scores.	en_US
dc.publisher	Frontiers Media SA	en_US
dc.relation.isversionof	https://doi.org/10.3389/fpsyg.2020.624137	en_US
dc.rights	Creative Commons Attribution 4.0 International license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Frontiers	en_US
dc.title	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech	en_US
dc.type	Article	en_US
dc.identifier.citation	Haulcy, R'mani and James Glass. "Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech." Frontiers in Psyhcology 11 (January 2021): 624137. © 2021 Haulcy and Glass	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	Frontiers in Psyhcology	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.date.submission	2021-04-28T12:45:53Z
mit.journal.volume	11	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Complete	en_US

Files in this item

Name:: fpsyg-11-624137.pdf
Size:: 621.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record