Skip to main navigation Skip to search Skip to main content

TASKNORM: Rethinking Batch Normalization for Meta-Learning

  • John Bronskill
  • , Jonathan Gordon
  • , James Requeima
  • , Sebastian Nowozin
  • , Richard E. Turner

Research output: Contribution to journalConference articlepeer-review

8 Scopus citations

Abstract

Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical nature of the meta-learning setting presents several challenges that can render conventional batch normalization ineffective, giving rise to the need to rethink normalization in this setting. We evaluate a range of approaches to batch normalization for meta-learning scenarios, and develop a novel approach that we call TASKNORM. Experiments on fourteen datasets demonstrate that the choice of batch normalization has a dramatic effect on both classification accuracy and training time for both gradient basedand gradientfree meta-learning approaches. Importantly, TAS-KNORM is found to consistently improve performance. Finally, we provide a set of best practices for normalization that will allow fair comparison of meta-learning algorithms.

Original languageEnglish
JournalProceedings of Machine Learning Research
Volume119
StatePublished - 1 Jan 2020
Externally publishedYes
Event37th International Conference on Machine Learning, ICML 2020 - Virtual, Online
Duration: 13 Jul 202018 Jul 2020

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'TASKNORM: Rethinking Batch Normalization for Meta-Learning'. Together they form a unique fingerprint.

Cite this