Self-augmentation: Generalizing deep networks to unseen classes for few-shot learning

Jin Woo Seo, Hong Gyu Jung, Seong Whan Lee

    Research output: Contribution to journalArticlepeer-review

    39 Citations (Scopus)

    Abstract

    Few-shot learning aims to classify unseen classes with a few training examples. While recent works have shown that standard mini-batch training with carefully designed training strategies can improve generalization ability for unseen classes, well-known problems in deep networks such as memorizing training statistics have been less explored for few-shot learning. To tackle this issue, we propose self-augmentation that consolidates self-mix and self-distillation. Specifically, we propose a regional dropout technique called self-mix, in which a patch of an image is substituted into other values in the same image. With this dropout effect, we show that the generalization ability of deep networks can be improved as it prevents us from learning specific structures of a dataset. Then, we employ a backbone network that has auxiliary branches with its own classifier to enforce knowledge sharing. This sharing of knowledge forces each branch to learn diverse optimal points during training. Additionally, we present a local representation learner to further exploit a few training examples of unseen classes by generating fake queries and novel weights. Experimental results show that the proposed method outperforms the state-of-the-art methods for prevalent few-shot benchmarks and improves the generalization ability.

    Original languageEnglish
    Pages (from-to)140-149
    Number of pages10
    JournalNeural Networks
    Volume138
    DOIs
    Publication statusPublished - 2021 Jun

    Bibliographical note

    Publisher Copyright:
    © 2021 Elsevier Ltd

    Keywords

    • Classification
    • Few-shot learning
    • Generalization
    • Knowledge distillation

    ASJC Scopus subject areas

    • Cognitive Neuroscience
    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'Self-augmentation: Generalizing deep networks to unseen classes for few-shot learning'. Together they form a unique fingerprint.

    Cite this