An Efficient Approach to Informative Feature Extraction from Multimodal Data

Wang, Lichen; Wu, Jiaxiang; Huang, Shao-Lun; Zheng, Lizhong; Xu, Xiangxiang; Zhang, Lin; Huang, Junzhou

dc.contributor.author	Wang, Lichen
dc.contributor.author	Wu, Jiaxiang
dc.contributor.author	Huang, Shao-Lun
dc.contributor.author	Zheng, Lizhong
dc.contributor.author	Xu, Xiangxiang
dc.contributor.author	Zhang, Lin
dc.contributor.author	Huang, Junzhou
dc.date.accessioned	2021-11-08T19:30:22Z
dc.date.available	2021-11-08T19:30:22Z
dc.date.issued	2019
dc.identifier.uri	https://hdl.handle.net/1721.1/137795
dc.description.abstract	Copyright © 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. One primary focus in multimodal feature extraction is to find the representations of individual modalities that are maximally correlated. As a well-known measure of dependence, the Hirschfeld-Gebelein-Rényi (HGR) maximal correlation becomes an appealing objective because of its operational meaning and desirable properties. However, the strict whitening constraints formalized in the HGR maximal correlation limit its application. To address this problem, this paper proposes Soft-HGR, a novel framework to extract informative features from multiple data modalities. Specifically, our framework prevents the “hard” whitening constraints, while simultaneously preserving the same feature geometry as in the HGR maximal correlation. The objective of Soft-HGR is straightforward, only involving two inner products, which guarantees the efficiency and stability in optimization. We further generalize the framework to handle more than two modalities and missing modalities. When labels are partially available, we enhance the discriminative power of the feature representations by making a semi-supervised adaptation. Empirical evaluation implies that our approach learns more informative feature mappings and is more efficient to optimize.	en_US
dc.language.iso	en
dc.publisher	Association for the Advancement of Artificial Intelligence (AAAI)	en_US
dc.relation.isversionof	10.1609/AAAI.V33I01.33015281	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	An Efficient Approach to Informative Feature Extraction from Multimodal Data	en_US
dc.type	Article	en_US
dc.identifier.citation	Wang, Lichen, Wu, Jiaxiang, Huang, Shao-Lun, Zheng, Lizhong, Xu, Xiangxiang et al. 2019. "An Efficient Approach to Informative Feature Extraction from Multimodal Data." Proceedings of the AAAI Conference on Artificial Intelligence, 33.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journal	Proceedings of the AAAI Conference on Artificial Intelligence	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-01-25T19:11:05Z
dspace.orderedauthors	Wang, L; Wu, J; Huang, S-L; Zheng, L; Xu, X; Zhang, L; Huang, J	en_US
dspace.date.submission	2021-01-25T19:11:13Z
mit.journal.volume	33	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 1811.08979.pdf
Size:: 1.502Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record