Show simple item record

dc.contributor.advisorMatt Pokress and George Verghese.en_US
dc.contributor.authorZahray, Lisa(Lisa A.)en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2020-11-23T17:39:28Z
dc.date.available2020-11-23T17:39:28Z
dc.date.copyright2019en_US
dc.date.issued2019en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/128575
dc.descriptionThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.en_US
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, June, 2019en_US
dc.descriptionCataloged from student-submitted PDF of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 61-63).en_US
dc.description.abstractManual data entry from a form into a database is a time consuming and error-prone task. In the case of prescription documents, errors are especially important to avoid in order to protect patients' health and safety. This project discusses the design and evaluation of a system that automates portions of data entry workflow, focusing on prescription information originating from fax forms. The first part of the thesis discusses the approaches used for faxes of a known format, using techniques including denoising, deskewing, template matching, and handwritten digit recognition. One successful task in this area was checkbox detection to identify whether prescriptions were renewed or denied. The second part of the thesis focuses on faxes of unknown formats, utilizing optical character recognition (OCR) technology and a customized implementation of an approximate string matching algorithm. Customer and prescriber information were extracted with high accuracy, and drug name extraction was investigated with suggestions for further improvement.en_US
dc.description.statementofresponsibilityby Lisa Zahray.en_US
dc.format.extent63 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleAutomating data extraction from prescription document images to reduce human erroren_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1220877663en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2020-11-23T17:39:27Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record