MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Concept extraction for disability insurance payment evaluation

Author(s)
Lai, Jeremy
Thumbnail
DownloadFull printable version (1.722Mb)
Alternative title
Evaluation of electronic medical records for insurance qualification
Other Contributors
Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.
Advisor
Peter Szolovits and William J. Long.
Terms of use
M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
Automated evaluation of claims for medical and disability insurance benefits poses a difficult challenge that will take years to be solved. The precise wording of insurance rules and the terse language in medical history files make it difficult for humans, let alone computers, to assess insurance payment qualification accurately. In this thesis, we work towards building a tool that will aid, but not replace, human evaluators. We automate the extraction of relevant parts of medical history files; if sufficiently accurate, this would eliminate the need for human evaluators to comb through hundreds of pages of medical history files. We first create a list of medical concepts, mainly disease and procedure names, from the cardiovascular section of the "Blue Book" for Disability Evaluation under Social Security. Then, using a variation of the longest common substring algorithm, we characterize each medical file line using its substring overlaps with the list of medical concepts. Finally, with human annotations of whether each medical file line is relevant or not, we build machine learning classifiers predicting each line's relevance using its overlap characterization. The classifiers we use are Naive Bayes and Support Vector Machines.
Description
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.
 
Cataloged from PDF version of thesis.
 
Includes bibliographical references (p. 27-28).
 
Date issued
2011
URI
http://hdl.handle.net/1721.1/66432
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.