Text Analytics to Inform Deviation Root Cause Analysis in Biomanufacturing

Nersesian, Lois E.

dc.contributor.advisor	Levi, Retsef
dc.contributor.advisor	Braatz, Richard D.
dc.contributor.author	Nersesian, Lois E.
dc.date.accessioned	2022-08-29T16:16:02Z
dc.date.available	2022-08-29T16:16:02Z
dc.date.issued	2022-05
dc.date.submitted	2022-05-12T16:46:19.700Z
dc.identifier.uri	https://hdl.handle.net/1721.1/144849
dc.description.abstract	In biomanufacturing, product quality and safety are critical and there are many controls in place to ensure that processes are followed within the prescribed operating limits. However, deviations from these processes inevitably occur, sometimes requiring in-depth investigations to determine the cause and prevent recurrence. Understanding quality trends on the manufacturing line is also critical in preventing quality issues. At Amgen, a leading biotechnology company, results of such investigations are stored long-term but only in a partially structured manner, making it hard to leverage this historical data to enhance deviation investigation efficiency and study long term quality trends. The goal of this project is to use these historical records to draw insights into the investigation process and help increase the efficiency and accuracy of future deviation investigations and overall quality assurance. To achieve this, we use natural language processing tools to derive information from text describing deviations and causal factors. Several methods are explored, namely, unsupervised clustering using machine learning and natural language processing to identify and cluster similar causal factors, explicit text extraction which identifies known key terms such as equipment mentioned in the text, and process-dependent step classification which leverages reference documents describing the manufacturing process to assign records to process steps. The outputs of these methods are presented in a proof-of-concept tool which can be used to assist investigators. Our results indicate that all these methods have benefits and drawbacks but can be used together for maximal insights. Based on the status of each method, we suggest that Amgen work to create a tool to present potential causal factors to investigators immediately, incorporating clustering and text extraction methods after minor refinement, and continue to explore the potential of process-driven methodologies.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Text Analytics to Inform Deviation Root Cause Analysis in Biomanufacturing
dc.type	Thesis
dc.description.degree	M.B.A.
dc.description.degree	S.M.
dc.contributor.department	Massachusetts Institute of Technology. Department of Chemical Engineering
dc.contributor.department	Sloan School of Management
mit.thesis.degree	Master
thesis.degree.name	Master of Business Administration
thesis.degree.name	Master of Science in Chemical Engineering

Files in this item

Name:: nersesian-loisn-sm-cheme-2022- ...
Size:: 17.10Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record