MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Concept-value pair extraction from semi-structured clinical reports : a case study using echocardiogram reports

Author(s)
Chung, Jeanhee, 1972-
Thumbnail
DownloadFull printable version (2.357Mb)
Other Contributors
Harvard University--MIT Division of Health Sciences and Technology.
Advisor
G. Octo Barnett, Henry Chueh and Shawn N. Murphy.
Terms of use
M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
The task of gathering detailed patient information from narrative clinical text presents a significant barrier to clinical research. A prototype information extraction system was developed to extract pre-specified findings from narrative echocardiogram reports. The system which uses a Unified Medical Language System compatible architecture is very simple and takes advantage of canonical language use patterns to identify sentence templates with which concepts and their values can be identified. The data extracted from this system will be used to enrich an existing database used by clinical researchers in a large university healthcare system to identify potential research candidates fulfilling clinical inclusion criteria. The system was developed and evaluated using ten pre-determined clinical concepts. Concept-value pairs extracted by the system related to these ten conditions were compared with findings extracted manually by the author. The system was able to recall 78% of the relevant findings (CI, 76% to 80%), with a precision of 99% (CI, 98%-99%). Because data acquired from the system will ultimately be used in document and patient retrieval, preliminary analysis was done to evaluate document retrieval effectiveness. Median recall across the ten conditions was 36% (range, 0% to 93%). The system retrieved no documents for two of the ten conditions; median precision for the remaining eight conditions was 100% (range, 92% to 100%).
Description
Thesis (S.M.)--Harvard-MIT Division of Health Sciences and Technology, 2004.
 
Includes bibliographical references (p. 38-39).
 
Date issued
2004
URI
http://hdl.handle.net/1721.1/28584
Department
Harvard University--MIT Division of Health Sciences and Technology
Publisher
Massachusetts Institute of Technology
Keywords
Harvard University--MIT Division of Health Sciences and Technology.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.