Deep Nets: What have they ever done for Vision?

Yuille, Alan L.; Liu, Chenxi

Author(s)

Yuille, Alan L.; Liu, Chenxi

DownloadCBMM-Memo-088.pdf (7.884Mb)

Metadata

Show full item record

Abstract

This is an opinion paper about the strengths and weaknesses of Deep Nets. They are at the center of recent progress on Artificial Intelligence and are of growing importance in Cognitive Science and Neuroscience since they enable the development of computational models that can deal with a large range of visually realistic stimuli and visual tasks. They have clear limitations but they also have enormous successes. There is also gradual, though incomplete, understanding of their inner workings. It seems unlikely that Deep Nets in their current form will be the best long-term solution either for building general purpose intelligent machines or for understanding the mind/brain, but it is likely that many aspects of them will remain. At present Deep Nets do very well on specific types of visual tasks and on specific benchmarked datasets. But Deep Nets are much less general purpose, flexible, and adaptive than the human visual system. Moreover, methods like Deep Nets may run into fundamental difficulties when faced with the enormous complexity of natural images. To illustrate our main points, while keeping the references small, this paper is slightly biased towards work from our group.

Date issued

2018-05-10

URI

http://hdl.handle.net/1721.1/115292

Publisher

Center for Brains, Minds and Machines (CBMM)

Series/Report no.

CBMM Memo Series;088

Collections

CBMM Memo Series