Browsing Publications by Author "Rouditchenko, Andrew"
Now showing items 1-1 of 1
-
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset
Palmer, Ian; Rouditchenko, Andrew; Barbu, Andrei; Katz, Boris; Glass, James (Center for Brains, Minds and Machines (CBMM), The 22nd Annual Conference of the International Speech Communication Association (Interspeech), 2021-08-30)Visually-grounded spoken language datasets can enable models to learn cross-modal correspon- dences with very weak supervision. However, modern audio-visual datasets contain biases that un- dermine the real-world performance ...