Show simple item record

dc.contributor.advisorRegina Barzilay.en_US
dc.contributor.authorDirie, Abdi-Hakin Aen_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2018-12-11T20:38:30Z
dc.date.available2018-12-11T20:38:30Z
dc.date.copyright2017en_US
dc.date.issued2017en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/119519
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2017.en_US
dc.descriptionThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.en_US
dc.descriptionCataloged from student-submitted PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 63-64).en_US
dc.description.abstractE-commerce sites are increasingly becoming the norm for how consumers search, purchase, and review products. Such sites internally list millions of products, creating a torrent of product options that can overwhelm a browsing consumer. To facilitate their search, it helps to annotate each product with a table of attributes describing general features such as color, size, etc. However, the tables must be provided by the merchant, so there is a business incentive to automate this task by extracting attribute-value information directly from product titles and descriptions. However, while past methods have done extraction for only a handful of attributes, in practice their exists hundreds of diverse attributes. In this thesis, we present a single model for extracting information on all attributes. In addition, we show that incorporating extra information about intra-attribute similarity improves performance for data-poor attributes.en_US
dc.description.statementofresponsibilityby Abdi-Hakin A. Dirie.en_US
dc.format.extent64 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleExtracting diverse attribute-value information from product catalog text via transfer learningen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc1066345161en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record