[Paper note] Aggregated Residual Transformations for Deep Neural Networks

  • paper
  • Author: Saining Xie, Ross Girshick, Piotr Dollar´, Zhuowen Tu, Kaiming He

Model

  • Solution space of Inception architecture is a strict subspace of the solution space of a single large layer (e.g., 5×5) operating on a high-dimensional embedding.
  • The transformations to be aggregated are all of the same topology.
  • Equivalent forms:
    [Paper note] Aggregated Residual Transformations for Deep Neural Networks_第1张图片
  • This strategy exposes a new dimension, which we call “cardinality” (the size of the set of transformations), as an essential factor in addition to the dimensions of depth and width.

Experiemnt

  • See original paper for training details.
  • Increasing cardinality is more efficient than increasing depth/width.
  • ImageNet
    • 224x224: 20.4% top-1, 5.3% top-5
    • 320x320/crop-to-299x299: 19.1% top-1, 4.4 top-5
  • ImageNet 5K (5000 class)
  • CIFAR
    • -10 3.58%
    • -100 17.31%
  • COCO detection

你可能感兴趣的:(paper-note)