ViTac: Feature Sharing Between Vision and Tactile Sensing for Cloth Texture Recognition

Luo, Shan; Yuan, Wenzhen; Adelson, Edward; Cohn, Anthony G.; Fuentes, Raul

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137949.2

Author(s)

Luo, Shan; Yuan, Wenzhen; Adelson, Edward; Cohn, Anthony G.; Fuentes, Raul

DownloadSubmitted version (2.733Mb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

© 2018 IEEE. Vision and touch are two of the important sensing modalities for humans and they offer complementary information for sensing the environment. Robots could also benefit from such multi-modal sensing ability. In this paper, addressing for the first time (to the best of our knowledge) texture recognition from tactile images and vision, we propose a new fusion method named Deep Maximum Covariance Analysis (DMCA) to learn a joint latent space for sharing features through vision and tactile sensing. The features of camera images and tactile data acquired from a GelSight sensor are learned by deep neural networks. But the learned features are of a high dimensionality and are redundant due to the differences between the two sensing modalities, which deteriorates the perception performance. To address this, the learned features are paired using maximum covariance analysis. Results of the algorithm on a newly collected dataset of paired visual and tactile data relating to cloth textures show that a good recognition performance of greater than 90% can be achieved by using the proposed DMCA framework. In addition, we find that the perception performance of either vision or tactile sensing can be improved by employing the shared representation space, compared to learning from unimodal data.

Date issued

2018-05

URI

https://hdl.handle.net/1721.1/137949

Publisher

IEEE

Citation

Luo, Shan, Yuan, Wenzhen, Adelson, Edward, Cohn, Anthony G. and Fuentes, Raul. 2018. "ViTac: Feature Sharing Between Vision and Tactile Sensing for Cloth Texture Recognition."

Version: Original manuscript

Collections

MIT Open Access Articles

Version	Item	Date	Summary
2	1721.1/137949.2	2022-01-07T17:31:33Z	Authority information verified/added.
1	1721.1/137949*	2021-11-09T16:11:12Z

DSpace@MIT

Notice