Lan, Shiyi and Yu, Zhiding and Choy, Christopher and Radhakrishnan, Subhashree and Liu, Guilin and Zhu, Yuke and Davis, Larry S. and Anandkumar, Anima (2021) DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE , Piscataway, NJ, pp. 3386-3396. ISBN 978-1-6654-2812-5. https://resolver.caltech.edu/CaltechAUTHORS:20210831-203854134
![]() |
PDF
- Submitted Version
See Usage Policy. 2MB |
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20210831-203854134
Abstract
We introduce DiscoBox, a novel framework that jointly learns instance segmentation and semantic correspondence using bounding box supervision. Specifically, we propose a self-ensembling framework where instance segmentation and semantic correspondence are jointly guided by a structured teacher in addition to the bounding box supervision. The teacher is a structured energy model incorporating a pairwise potential and a cross-image potential to model the pairwise pixel relationships both within and across the boxes. Minimizing the teacher energy simultaneously yields refined object masks and dense correspondences between intra-class objects, which are taken as pseudo-labels to supervise the task network and provide positive/negative correspondence pairs for dense contrastive learning. We show a symbiotic relationship where the two tasks mutually benefit from each other. Our best model achieves 37.9% AP on COCO instance segmentation, surpassing prior weakly supervised methods and is competitive to supervised methods. We also obtain state of the art weakly supervised results on PASCAL VOC12 and PF-PASCAL with real-time inference.
Item Type: | Book Section | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| |||||||||
ORCID: |
| |||||||||
Additional Information: | © 2021 IEEE. Work done during an internship at NVIDIA Research. | |||||||||
Funders: |
| |||||||||
DOI: | 10.1109/ICCV48922.2021.00339 | |||||||||
Record Number: | CaltechAUTHORS:20210831-203854134 | |||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20210831-203854134 | |||||||||
Official Citation: | S. Lan et al., "DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision," 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 3386-3396, doi: 10.1109/ICCV48922.2021.00339 | |||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||
ID Code: | 110644 | |||||||||
Collection: | CaltechAUTHORS | |||||||||
Deposited By: | George Porter | |||||||||
Deposited On: | 01 Sep 2021 14:54 | |||||||||
Last Modified: | 26 Jul 2022 22:02 |
Repository Staff Only: item control page