Neglected Free Lunch – Learning Image Classifiers Using Annotation Byproducts | ICCV'23 Paper
Dongyoon Han1*, Junsuk Choe2*, Seonghyeok Chun3, John Joon Young Chung4
Minsuk Chang5, Sangdoo Yun1, Jean Y. Song6, Seong Joon Oh7†
* Equal contribution † Corresponding author
1 NAVER AI LAB 2 Sogang University 3 Dante Company 4 University of Michigan 5 NAVER AI LAB, currently at Google 6 DGIST 7 University of Tübingen
Supervised learning of image classifiers distills human knowledge into a parametric model f through pairs of images and corresponding labels (X,Y). We argue that this simple and widely used representation of human knowledge neglects rich auxiliary information from the annotation procedure, such as the time-series of mouse traces and clicks.
Our insight is that such annotation byproducts Z provide approximate human attention that weakly guides the model to focus on the foreground cues, reducing spurious correlations and discouraging shortcut learning.
We have created ImageNet-AB and COCO-AB to verify this:
They are ImageNet and COCO training sets enriched with sample-wise annotation byproducts, collected by replicating the respective original annotation tasks.
We refer to the new paradigm of training models with annotation byproducts as learning using annotation byproducts (LUAB).
We show that a simple multitask loss for regressing Z together with Y already improves the generalisability and robustness of the learned models. Compared to the original supervised learning, LUAB does not require extra annotation costs.
We provide example dataloaders for the annotation byproducts.
- Dataloader for ImageNet-AB: imagenet_dataloader.ipynb
- Dataloader for COCO-AB: coco_dataloader.ipynb
- Annotation tool for ImageNet: github.com/naver-ai/imagenet-annotation-tool
- Annotation tool for COCO: github.com/naver-ai/coco-annotation-tool
MIT License
Copyright (c) 2023-present NAVER Cloud Corp.
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
@inproceedings{han2023iccv,
title = {Neglected Free Lunch – Learning Image Classifiers Using Annotation Byproducts},
author = {Han, Dongyoon and Choe, Junsuk and Chun, Seonghyeok and Chung, John Joon Young and Chang, Minsuk and Yun, Sangdoo and Song, Jean Y. and Oh, Seong Joon},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2023}
}