dc.contributor.advisor | Aziz Boxwala. | en_US |
dc.contributor.author | Stephen, Reejis, 1977- | en_US |
dc.contributor.other | Harvard University--MIT Division of Health Sciences and Technology. | en_US |
dc.date.accessioned | 2005-09-27T18:11:59Z | |
dc.date.available | 2005-09-27T18:11:59Z | |
dc.date.copyright | 2004 | en_US |
dc.date.issued | 2004 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/28760 | |
dc.description | Thesis (S.M.)--Harvard-MIT Division of Health Sciences and Technology, 2004. | en_US |
dc.description | Includes bibliographical references (leaves 66-67). | en_US |
dc.description.abstract | In order to automate data extraction from electronic medical documents, it is important to identify the correct context of the extracted information. Context in medical documents is provided by the layout of documents, which are partitioned into sections by virtue of a medical culture instilled through common practice and the training of physicians. Unfortunately, formatting and labeling is inconsistently adhered to in practice and human experts are usually required to identify sections in medical documents. A series of experiments tested the hypothesis that section identification independent of the label on sections could be achieved by using a neural network to elucidate relationships between features of sections (like size, position from start of the document) and the content characteristic of certain sections (subject-specific strings). Results showed that certain sections can be reliably identified using two different methods, and described the costs involved. The stratification of documents by document type (such as History and Physical Examination Documents or Discharge Summaries), patient diagnoses and department influenced the accuracy of identification. Future improvements suggested by the results in order to fully outline the approach were described. | en_US |
dc.description.statementofresponsibility | by Reejis Stephen. | en_US |
dc.format.extent | 144 leaves | en_US |
dc.format.extent | 4712642 bytes | |
dc.format.extent | 4731797 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | |
dc.subject | Harvard University--MIT Division of Health Sciences and Technology. | en_US |
dc.title | Context identification in electronic medical records | en_US |
dc.type | Thesis | en_US |
dc.description.degree | S.M. | en_US |
dc.contributor.department | Harvard University--MIT Division of Health Sciences and Technology | |
dc.identifier.oclc | 59823293 | en_US |