Show simple item record

dc.contributor.authorKeller, Mikaela
dc.contributor.authorFreifeld, Clark C.
dc.contributor.authorBrownstein, John S.
dc.date.accessioned2010-03-10T16:07:28Z
dc.date.available2010-03-10T16:07:28Z
dc.date.issued2009-11
dc.date.submitted2009-06
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/1721.1/52463
dc.description.abstractBackground Automated surveillance of the Internet provides a timely and sensitive method for alerting on global emerging infectious disease threats. HealthMap is part of a new generation of online systems designed to monitor and visualize, on a real-time basis, disease outbreak alerts as reported by online news media and public health sources. HealthMap is of specific interest for national and international public health organizations and international travelers. A particular task that makes such a surveillance useful is the automated discovery of the geographic references contained in the retrieved outbreak alerts. This task is sometimes referred to as "geo-parsing". A typical approach to geo-parsing would demand an expensive training corpus of alerts manually tagged by a human. Results Given that human readers perform this kind of task by using both their lexical and contextual knowledge, we developed an approach which relies on a relatively small expert-built gazetteer, thus limiting the need of human input, but focuses on learning the context in which geographic references appear. We show in a set of experiments, that this approach exhibits a substantial capacity to discover geographic locations outside of its initial lexicon. Conclusion The results of this analysis provide a framework for future automated global surveillance efforts that reduce manual input and improve timeliness of reporting.en
dc.description.sponsorshipGoogle.orgen
dc.description.sponsorshipNational Library of Medicine and the National Institutes of Health (grant G08LM009776-01A2)en
dc.language.isoen_US
dc.publisherBioMed Central Ltd.en
dc.relation.isversionofhttp://dx.doi.org/10.1186/1471-2105-10-385en
dc.rightsCreative Commons Attributionen
dc.rights.urihttp://creativecommons.org/licenses/by/2.0/en
dc.sourceBioMed Centralen
dc.titleAutomated vocabulary discovery for geo-parsing online epidemic intelligenceen
dc.typeArticleen
dc.identifier.citationKeller, Mikaela, Clark Freifeld, and John Brownstein. “Automated vocabulary discovery for geo-parsing online epidemic intelligence.” BMC Bioinformatics 10.1 (2009): 385.en
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technologyen_US
dc.contributor.departmentProgram in Media Arts and Sciences (Massachusetts Institute of Technology)en_US
dc.contributor.approverFreifeld, Clark C.
dc.contributor.mitauthorFreifeld, Clark C.
dc.relation.journalBMC Bioinformaticsen
dc.eprint.versionFinal published versionen
dc.identifier.pmid19930702
dc.type.urihttp://purl.org/eprint/type/JournalArticleen
eprint.statushttp://purl.org/eprint/status/PeerRevieweden
dspace.orderedauthorsKeller, Mikaela; Freifeld, Clark C; Brownstein, John Sen
dspace.mitauthor.errortrue
mit.licensePUBLISHER_CCen
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record