dc.contributor.advisor | Gupta, Amar | |
dc.contributor.advisor | Szolovits, Peter | |
dc.contributor.advisor | Rhodes, Donna H. | |
dc.contributor.author | Chen, Ta Hang | |
dc.date.accessioned | 2022-01-14T15:20:41Z | |
dc.date.available | 2022-01-14T15:20:41Z | |
dc.date.issued | 2021-06 | |
dc.date.submitted | 2021-06-25T20:15:58.184Z | |
dc.identifier.uri | https://hdl.handle.net/1721.1/139571 | |
dc.description.abstract | Automatic document processing is always a strategy for business executives to improve operational efficiency. With Optical Character Recognition (OCR) and machine learning techniques, businesses are able to apply Artificial Intelligence (AI) to automate the process. However, introducing an AI application to business is challenging; it is easy to fail because of the complexity between the technical and organizational components. This thesis considers document processing from a sociotechnical system perspective and leverages a four-step system analysis approach to identify the critical components.
This research also proposes a machine learning model using Support Vector Machine (SVM) as the classifier and Word2vec embeddings as document features to classify business documents. The proposed model reaches a 0.872 Macro F1-score using scanned business documents from the RVL-CDIP dataset. The proposed model outperforms the other commonly used rule-based algorithms, RIPPER and PART, showing that the proposed model is potentially suitable to be deployed into business to classify the
documents. | |
dc.publisher | Massachusetts Institute of Technology | |
dc.rights | In Copyright - Educational Use Permitted | |
dc.rights | Copyright MIT | |
dc.rights.uri | http://rightsstatements.org/page/InC-EDU/1.0/ | |
dc.title | An Artificial Intelligence Based Approach to Automate Document Processing in Business Area | |
dc.type | Thesis | |
dc.description.degree | S.M. | |
dc.description.degree | S.M. | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.contributor.department | System Design and Management Program. | |
mit.thesis.degree | Master | |
thesis.degree.name | Master of Science in Engineering and Management | |
thesis.degree.name | Master of Science in Electrical Engineering and Computer Science | |