Show simple item record

dc.contributor.advisorGupta, Amar
dc.contributor.advisorSzolovits, Peter
dc.contributor.advisorRhodes, Donna H.
dc.contributor.authorChen, Ta Hang
dc.date.accessioned2022-01-14T15:20:41Z
dc.date.available2022-01-14T15:20:41Z
dc.date.issued2021-06
dc.date.submitted2021-06-25T20:15:58.184Z
dc.identifier.urihttps://hdl.handle.net/1721.1/139571
dc.description.abstractAutomatic document processing is always a strategy for business executives to improve operational efficiency. With Optical Character Recognition (OCR) and machine learning techniques, businesses are able to apply Artificial Intelligence (AI) to automate the process. However, introducing an AI application to business is challenging; it is easy to fail because of the complexity between the technical and organizational components. This thesis considers document processing from a sociotechnical system perspective and leverages a four-step system analysis approach to identify the critical components. This research also proposes a machine learning model using Support Vector Machine (SVM) as the classifier and Word2vec embeddings as document features to classify business documents. The proposed model reaches a 0.872 Macro F1-score using scanned business documents from the RVL-CDIP dataset. The proposed model outperforms the other commonly used rule-based algorithms, RIPPER and PART, showing that the proposed model is potentially suitable to be deployed into business to classify the documents.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright MIT
dc.rights.urihttp://rightsstatements.org/page/InC-EDU/1.0/
dc.titleAn Artificial Intelligence Based Approach to Automate Document Processing in Business Area
dc.typeThesis
dc.description.degreeS.M.
dc.description.degreeS.M.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.departmentSystem Design and Management Program.
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Engineering and Management
thesis.degree.nameMaster of Science in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record