A Machine Learning Approach to Zeolite Synthesis Enabled by Automatic Literature Data Extraction
Author(s)
Jensen, Zach; Kim, Edward; Kwon, Soonhyoung; Gani, Terry ZH; Román-Leshkov, Yuriy; Moliner, Manuel; Corma, Avelino; Olivetti, Elsa; ... Show more Show less
DownloadPublished version (1.264Mb)
Terms of use
Metadata
Show full item recordAbstract
© 2019 American Chemical Society. Zeolites are porous, aluminosilicate materials with many industrial and "green" applications. Despite their industrial relevance, many aspects of zeolite synthesis remain poorly understood requiring costly trial and error synthesis. In this paper, we create natural language processing techniques and text markup parsing tools to automatically extract synthesis information and trends from zeolite journal articles. We further engineer a data set of germanium-containing zeolites to test the accuracy of the extracted data and to discover potential opportunities for zeolites containing germanium. We also create a regression model for a zeolite's framework density from the synthesis conditions. This model has a cross-validated root mean squared error of 0.98 T/1000 Å 3 , and many of the model decision boundaries correspond to known synthesis heuristics in germanium-containing zeolites. We propose that this automatic data extraction can be applied to many different problems in zeolite synthesis and enable novel zeolite morphologies.
Date issued
2019Department
Massachusetts Institute of Technology. Department of Materials Science and Engineering; Massachusetts Institute of Technology. Department of Chemical EngineeringJournal
ACS Central Science
Publisher
American Chemical Society (ACS)