A machine-learning approach to aerosol classification for single-particle mass spectrometry
Author(s)
Christopoulos, Costa (Costa D.)![Thumbnail](/bitstream/handle/1721.1/115038/1031985395-MIT.pdf.jpg?sequence=3&isAllowed=y)
DownloadFull printable version (2.977Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Earth, Atmospheric, and Planetary Sciences.
Advisor
David McGee.
Terms of use
Metadata
Show full item recordAbstract
Compositional analysis of atmospheric and laboratory aerosols is often conducted via single-particle mass spectrometry (SPMS), an in situ and real-time analytical technique that produces mass spectra on a single particle basis. In this study, machine learning classification algorithms are created using a dataset of SPMS spectra to automatically differentiate particles on the basis of chemistry and size. While clustering methods have been used to group aerosols into broad categories based on similarity, these models do not incorporate known aerosols labels and are not explicitly formulated for classification. Furthermore, traditional methods often rely on a smaller set of well-known, important variables whereas the proposed method is more general and flexible, allowing researchers to automatically quantify and select important variables from any aerosol subset. In this work, machine learning algorithms build a predictive model from a training set in which the aerosol type associated with each mass spectrum is known. Several such classification models were created to differentiate aerosol types in four broad categories: fertile soils, mineral/metallic particles, biological, and all other aerosols using -40 common positive and negative spectral features. For this broad categorization, machine learning resulted in a classification accuracy of -93%. More complex models were developed to classify aerosols into specific categories which resulted in a classification accuracy of -87%. The trained model was then applied to a 'blind' mixture of aerosols with model agreement on the presence of secondary organic aerosol, coated and uncoated mineral dust and fertile soil. Additionally, the model is used to characterize an ambient atmospheric dataset collected from the free troposphere.
Description
Thesis: S.B., Massachusetts Institute of Technology, Department of Earth, Atmospheric, and Planetary Sciences, 2017. Cataloged from PDF version of thesis. Includes bibliographical references (pages 21-26).
Date issued
2017Department
Massachusetts Institute of Technology. Department of Earth, Atmospheric, and Planetary SciencesPublisher
Massachusetts Institute of Technology
Keywords
Earth, Atmospheric, and Planetary Sciences.