dc.contributor.author | Zlateski, Aleksandar | |
dc.contributor.author | Jaroensri, Ronnachai | |
dc.contributor.author | Sharma, Prafull | |
dc.contributor.author | Durand, Frederic | |
dc.date.accessioned | 2020-03-30T13:45:05Z | |
dc.date.available | 2020-03-30T13:45:05Z | |
dc.date.issued | 2018-12 | |
dc.identifier.isbn | 9781538664209 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/124403 | |
dc.description.abstract | Convolutional networks (ConvNets) have become the dominant approach to semantic image segmentation. Producing accurate, pixel-level labels required for this task is a tedious and time consuming process; however, producing approximate, coarse labels could take only a fraction of the time and effort. We investigate the relationship between the quality of labels and the performance of ConvNets for semantic segmentation. We create a very large synthetic dataset with perfectly labeled street view scenes. From these perfect labels, we synthetically coarsen labels with different qualities and estimate human-hours required for producing them. We perform a series of experiments by training ConvNets with a varying number of training images and label quality. We found that the performance of ConvNets mostly depends on the time spent creating the training labels. That is, a larger coarsely-annotated dataset can yield the same performance as a smaller finely-annotated one. Furthermore, fine-tuning coarsely pre-trained ConvNets with few finely-annotated labels can yield comparable or superior performance to training it with a large amount of finely-annotated labels alone, at a fraction of the labeling cost. We demonstrate that our result is also valid for different network architectures, and various object classes in an urban scene. | en_US |
dc.language.iso | en | |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/cvpr.2018.00160 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Computer Vision Foundation | en_US |
dc.title | On the Importance of Label Quality for Semantic Segmentation | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Zlateski, Aleksandar et al. "On the Importance of Label Quality for Semantic Segmentation." 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, IEEE, December 2018. © 2018 IEEE. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.relation.journal | 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2019-05-29T13:33:27Z | |
dspace.date.submission | 2019-05-29T13:33:28Z | |
mit.metadata.status | Complete | |