Machine learning for well rate estimation : integrated imputation and stacked ensemble modeling
Author(s)
Wilson, Oliver John.
Download1263351398-MIT.pdf (10.33Mb)
Other Contributors
Massachusetts Institute of Technology. Engineering and Management Program.
System Design and Management Program.
Terms of use
Metadata
Show full item recordAbstract
This thesis describes a stacked ensemble, supervised machine learning problem for well rate estimations utilizing well test features that are far from independent and identically distributed (IID), and exhibit missing data with a not missing at random (MNAR) classification from three different oil fields. This research introduces a novel integrated imputation procedure that combines the imputation model selection with the cross-validation procedure for downstream model tuning without data "leakage"--the primary objective shifts from minimizing the imputation data error to minimizing the downstream hold-out error. A stratified time-slicing rolling forecast cross-validation procedure is implemented to minimize over-fitting from the plethora of statistical assumptions that are violated. This thesis seeks to test a framework that will enable well rate estimations for fields available well test data to improve well surveillance capabilities in order to maximize production metrics and minimize adverse health and environmental impacts.
Description
Thesis: S.M. in Engineering and Management, Massachusetts Institute of Technology, System Design and Management Program, September, 2020 Cataloged from the official version of thesis. "September 2020." Includes bibliographical references (pages 115-118).
Date issued
2020Department
Massachusetts Institute of Technology. Engineering and Management ProgramPublisher
Massachusetts Institute of Technology
Keywords
Engineering and Management Program., System Design and Management Program.