dc.contributor.advisor | Michael Stonebraker. | en_US |
dc.contributor.author | Duan, Peitong | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2017-12-08T21:20:37Z | |
dc.date.available | 2017-12-08T21:20:37Z | |
dc.date.copyright | 2017 | en_US |
dc.date.issued | 2017 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/112665 | |
dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2017. | en_US |
dc.description | This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. | en_US |
dc.description | Cataloged from student-submitted student-submitted PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (pages 71-74). | en_US |
dc.description.abstract | In this paper, we present Beagle, an automated data collection system to mine the web for SVG-based visualization images, label them with their corresponding visualization type (i.e., bar, scatter, pie, etc.), and make them available as a queryable data store. The key idea behind Beagle is a new SVG-based classification design to more effectively label visualizations rendered in a browser. Furthermore, Beagle is designed from the ground up to be extendable and modifiable in a straightforward way, to anticipate when new artifacts appear on the web, such as new JavaScript libraries, new visualization types, and better browser support for SVG. We evaluated Beagle's classification techniques on multiple collections of SVG-based visualizations extracted from the web, and found that Beagle provides a significant boost in accuracy compared to existing classification techniques, across a wide variety of visualization types. | en_US |
dc.description.statementofresponsibility | by Peitong Duan. | en_US |
dc.format.extent | 74 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Beagle : automated extraction and interpretation of visualizations from the web | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M. Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 1014123603 | en_US |