Cross-Modal Scene Networks

Aytar, Yusuf; Castrejon, Lluis; Vondrick, Carl; Pirsiavash, Hamed; Torralba, Antonio

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/134371.2

Show simple item record

dc.contributor.author	Aytar, Yusuf
dc.contributor.author	Castrejon, Lluis
dc.contributor.author	Vondrick, Carl
dc.contributor.author	Pirsiavash, Hamed
dc.contributor.author	Torralba, Antonio
dc.date.accessioned	2021-10-27T20:04:40Z
dc.date.available	2021-10-27T20:04:40Z
dc.date.issued	2018
dc.identifier.uri	https://hdl.handle.net/1721.1/134371
dc.description.abstract	© 1979-2012 IEEE. People can recognize scenes across many different modalities beyond natural images. In this paper, we investigate how to learn cross-modal scene representations that transfer across modalities. To study this problem, we introduce a new cross-modal scene dataset. While convolutional neural networks can categorize scenes well, they also learn an intermediate representation not aligned across modalities, which is undesirable for cross-modal transfer applications. We present methods to regularize cross-modal convolutional neural networks so that they have a shared representation that is agnostic of the modality. Our experiments suggest that our scene representation can help transfer representations across modalities for retrieval. Moreover, our visualizations suggest that units emerge in the shared representation that tend to activate on consistent concepts independently of the modality.
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.isversionof	10.1109/TPAMI.2017.2753232
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source	other univ website
dc.title	Cross-Modal Scene Networks
dc.type	Article
dc.relation.journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
dc.eprint.version	Author's final manuscript
dc.type.uri	http://purl.org/eprint/type/JournalArticle
eprint.status	http://purl.org/eprint/status/PeerReviewed
dc.date.updated	2019-07-11T16:55:26Z
dspace.orderedauthors	Aytar, Y; Castrejon, L; Vondrick, C; Pirsiavash, H; Torralba, A
dspace.date.submission	2019-07-11T16:55:28Z
mit.journal.volume	40
mit.journal.issue	10
mit.metadata.status	Authority Work and Publication Information Needed

Files in this item

Name:: cmplaces_pami.pdf
Size:: 13.43Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/134371.2	2022-06-24T17:38:44Z	Metadata Changed: Verified or entered author name and department authority metadata.
1	1721.1/134371*	2021-10-27T20:04:40Z

*Selected version

DSpace@MIT

Notice

Cross-Modal Scene Networks

Files in this item

This item appears in the following Collection(s)

Version History