MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Beagle : automated extraction and interpretation of visualizations from the web

Author(s)
Duan, Peitong
Thumbnail
DownloadFull printable version (2.033Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
Michael Stonebraker.
Terms of use
MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
In this paper, we present Beagle, an automated data collection system to mine the web for SVG-based visualization images, label them with their corresponding visualization type (i.e., bar, scatter, pie, etc.), and make them available as a queryable data store. The key idea behind Beagle is a new SVG-based classification design to more effectively label visualizations rendered in a browser. Furthermore, Beagle is designed from the ground up to be extendable and modifiable in a straightforward way, to anticipate when new artifacts appear on the web, such as new JavaScript libraries, new visualization types, and better browser support for SVG. We evaluated Beagle's classification techniques on multiple collections of SVG-based visualizations extracted from the web, and found that Beagle provides a significant boost in accuracy compared to existing classification techniques, across a wide variety of visualization types.
Description
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2017.
 
This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.
 
Cataloged from student-submitted student-submitted PDF version of thesis.
 
Includes bibliographical references (pages 71-74).
 
Date issued
2017
URI
http://hdl.handle.net/1721.1/112665
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.