Show simple item record

dc.contributor.advisorG. Octo Barnett, Henry Chueh and Shawn N. Murphy.en_US
dc.contributor.authorChung, Jeanhee, 1972-en_US
dc.contributor.otherHarvard University--MIT Division of Health Sciences and Technology.en_US
dc.date.accessioned2005-09-27T17:09:43Z
dc.date.available2005-09-27T17:09:43Z
dc.date.copyright2004en_US
dc.date.issued2004en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/28584
dc.descriptionThesis (S.M.)--Harvard-MIT Division of Health Sciences and Technology, 2004.en_US
dc.descriptionIncludes bibliographical references (p. 38-39).en_US
dc.description.abstractThe task of gathering detailed patient information from narrative clinical text presents a significant barrier to clinical research. A prototype information extraction system was developed to extract pre-specified findings from narrative echocardiogram reports. The system which uses a Unified Medical Language System compatible architecture is very simple and takes advantage of canonical language use patterns to identify sentence templates with which concepts and their values can be identified. The data extracted from this system will be used to enrich an existing database used by clinical researchers in a large university healthcare system to identify potential research candidates fulfilling clinical inclusion criteria. The system was developed and evaluated using ten pre-determined clinical concepts. Concept-value pairs extracted by the system related to these ten conditions were compared with findings extracted manually by the author. The system was able to recall 78% of the relevant findings (CI, 76% to 80%), with a precision of 99% (CI, 98%-99%). Because data acquired from the system will ultimately be used in document and patient retrieval, preliminary analysis was done to evaluate document retrieval effectiveness. Median recall across the ten conditions was 36% (range, 0% to 93%). The system retrieved no documents for two of the ten conditions; median precision for the remaining eight conditions was 100% (range, 92% to 100%).en_US
dc.description.statementofresponsibilityby Jeanhee Chung.en_US
dc.format.extent39 p.en_US
dc.format.extent2470218 bytes
dc.format.extent2472397 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582
dc.subjectHarvard University--MIT Division of Health Sciences and Technology.en_US
dc.titleConcept-value pair extraction from semi-structured clinical reports : a case study using echocardiogram reportsen_US
dc.typeThesisen_US
dc.description.degreeS.M.en_US
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technology
dc.identifier.oclc57470726en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record