Show simple item record

dc.contributor.authorCao, Lei
dc.contributor.authorTao, Wenbo
dc.contributor.authorAn, Sungtae
dc.contributor.authorJin, Jing
dc.contributor.authorYan, Yizhou
dc.contributor.authorLiu, Xiaoyu
dc.contributor.authorGe, Wendong
dc.contributor.authorSah, Adam
dc.contributor.authorBattle, Leilani
dc.contributor.authorSun, Jimeng
dc.contributor.authorChang, Remco
dc.contributor.authorWestover, Brandon
dc.contributor.authorMadden, Samuel
dc.contributor.authorStonebraker, Michael
dc.date.accessioned2021-11-05T15:34:30Z
dc.date.available2021-11-05T15:34:30Z
dc.date.issued2019
dc.identifier.urihttps://hdl.handle.net/1721.1/137531
dc.description.abstract© 2019 VLDB Endowment. In order to reduce the possibility of neural injury from seizures and sidestep the need for a neurologist to spend hours on manually reviewing the EEG recording, it is critical to automatically detect and classify "interictal-ictal continuum" (IIC) patterns from EEG data. However, the existing IIC classification techniques are shown to be not accurate and robust enough for clinical use because of the lack of high quality labels of EEG segments as training data. Obtaining high-quality labeled data is traditionally a manual process by trained clinicians that can be tedious, time-consuming, and errorprone. In this work, we propose Smile, an industrial scale system that provides an end-to-end solution to the IIC pattern classification problem. The core components of Smile include a visualizationbased time series labeling module and a deep-learning based active learning module. The labeling module enables the users to explore and label 350 million EEG segments (30TB) at interactive speed. The multiple coordinated views allow the users to examine the EEG signals from both time domain and frequency domain simultaneously. The active learning module first trains a deep neural network that automatically extracts both the local features with respect to each segment itself and the long term dynamics of the EEG signals to classify IIC patterns. Then leveraging the output of the deep learning model, the EEG segments that can best improve the model are selected and prompted to clinicians to label. This process is iterated until the clinicians and the models show high degree of agreement. Our initial experimental results show that our Smile system allows the clinicians to label the EEG segments at will with a response time below 500 ms. The accuracy of the model is progressively improved as more and more high quality labels are acquired over time.en_US
dc.language.isoen
dc.publisherVLDB Endowmenten_US
dc.relation.isversionof10.14778/3352063.3352138en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceVLDB Endowmenten_US
dc.titleSmile: a system to support machine learning on EEG data at scaleen_US
dc.typeArticleen_US
dc.identifier.citationCao, Lei, Tao, Wenbo, An, Sungtae, Jin, Jing, Yan, Yizhou et al. 2019. "Smile: a system to support machine learning on EEG data at scale." Proceedings of the VLDB Endowment, 12 (12).
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journalProceedings of the VLDB Endowmenten_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-01-29T17:48:42Z
dspace.orderedauthorsCao, L; Tao, W; An, S; Jin, J; Yan, Y; Liu, X; Ge, W; Sah, A; Battle, L; Sun, J; Chang, R; Westover, B; Madden, S; Stonebraker, Men_US
dspace.date.submission2021-01-29T17:49:06Z
mit.journal.volume12en_US
mit.journal.issue12en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record