| dc.contributor.advisor | Antonio Torralba. | en_US |
| dc.contributor.author | Shimanuki, Brian. | en_US |
| dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
| dc.date.accessioned | 2019-12-05T18:05:51Z | |
| dc.date.available | 2019-12-05T18:05:51Z | |
| dc.date.copyright | 2019 | en_US |
| dc.date.issued | 2019 | en_US |
| dc.identifier.uri | https://hdl.handle.net/1721.1/123143 | |
| dc.description | This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. | en_US |
| dc.description | Thesis: M. Eng. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019 | en_US |
| dc.description | Cataloged from student-submitted PDF version of thesis. | en_US |
| dc.description | Includes bibliographical references (pages 59-62). | en_US |
| dc.description.abstract | The computer vision and natural language processing communities have come together on image captioning related problems, but the fields have remained largely disjoint. There has been work in transforming text to text, images to text, text to images, and images to images, and work on generating images from nothing, and text from nothing, but no work on generating images and text together. This work looks at using GAN methods in order to generate images and text simultaneously using a shared representation. Visual representations of text are employed to use GAN techniques for both images and text. We present a framework for jointly generating images and text with a similarity loss that allows the model to learn a semantic representation. | en_US |
| dc.description.statementofresponsibility | by Brian Shimanuki. | en_US |
| dc.format.extent | 62 pages | en_US |
| dc.language.iso | eng | en_US |
| dc.publisher | Massachusetts Institute of Technology | en_US |
| dc.rights | MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. | en_US |
| dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
| dc.subject | Electrical Engineering and Computer Science. | en_US |
| dc.title | Joint generation of image and text with GANs | en_US |
| dc.title.alternative | Joint generation of image and text with generative adversarial networks | en_US |
| dc.type | Thesis | en_US |
| dc.description.degree | M. Eng. in Computer Science and Engineering | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.identifier.oclc | 1128823720 | en_US |
| dc.description.collection | M.Eng.inComputerScienceandEngineering Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science | en_US |
| dspace.imported | 2019-12-05T18:05:50Z | en_US |
| mit.thesis.degree | Master | en_US |
| mit.thesis.department | EECS | en_US |