Abstract
Few-shot learning aims to classify unseen classes with a few training examples. While recent works have shown that standard mini-batch training with carefully designed training strategies can improve generalization ability for unseen classes, well-known problems in deep networks such as memorizing training statistics have been less explored for few-shot learning. To tackle this issue, we propose self-augmentation that consolidates self-mix and self-distillation. Specifically, we propose a regional dropout technique called self-mix, in which a patch of an image is substituted into other values in the same image. With this dropout effect, we show that the generalization ability of deep networks can be improved as it prevents us from learning specific structures of a dataset. Then, we employ a backbone network that has auxiliary branches with its own classifier to enforce knowledge sharing. This sharing of knowledge forces each branch to learn diverse optimal points during training. Additionally, we present a local representation learner to further exploit a few training examples of unseen classes by generating fake queries and novel weights. Experimental results show that the proposed method outperforms the state-of-the-art methods for prevalent few-shot benchmarks and improves the generalization ability.
Original language | English |
---|---|
Pages (from-to) | 140-149 |
Number of pages | 10 |
Journal | Neural Networks |
Volume | 138 |
DOIs | |
Publication status | Published - 2021 Jun |
Bibliographical note
Publisher Copyright:© 2021 Elsevier Ltd
Keywords
- Classification
- Few-shot learning
- Generalization
- Knowledge distillation
ASJC Scopus subject areas
- Cognitive Neuroscience
- Artificial Intelligence