[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation

Zero- and Few-Label Semantic Segmentation

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第1张图片Figure 1: We propose (generalized) zero- and few-label semantic segmentation tasks, i.e. segmenting classes whose labels are not seen by the model during training or the model has a few labeled samples of those classes. To tackle these tasks, we propose a model that transfers knowledge from seen classes to unseen classes using side information, e.g. semantic word embedding trained on free text corpus.

思路

在分割网络中嵌入类别语义信息,使用辅助信息(例如基于文本语料库训练得到的语义词嵌入)将已见类的知识迁移到未见类别。

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第2张图片

Figure 2: Our zero-label and few-label semantic segmentation model, i.e. SPNet, consists of two steps: visual semantic embedding and semantic projection. Zero-label semantic segmentation is drawn as an instance of our model. Replacing different components of SPNet, four tasks are addressed (Solid/dashed lines show the training/test procedures respectively).

两步:
一、视觉语义映射;
二、语义映射

域漂移校正
The extreme case of the imbalanced data problem occurs when there is no labeled training images of unseen classes, and this results in predictions being biased to seen classes. To fix this issue, we follow [8] and calibrate the prediction by reducing the scores of seen classes, which leads to:

(5) arg ⁡ max ⁡ u ∈ S ∪ U p ( y ^ i j = u ∣ x ; [ W s ; W u ] ) − γ I [ u ∈ S ] \arg\max_{u ∈ \mathcal {S∪U}} p({\hat y}_{ij} = u | x; [W_s ; W_u]) − γI[u ∈ \mathcal{S}] \tag 5 arguSUmaxp(y^ij=ux;[Ws;Wu])γI[uS](5)

where I = 1 I= 1 I=1 if u u u is a seen class and 0 otherwise, γ ∈ [ 0 , 1 ] γ∈[0,1] γ[0,1] is the calibration factor tuned on a held-out validation set.

在分类任务中,一整图片对应一个类别,语义信息有对应的视觉区域。那么,在分割任务中,每个类别的所有像素似乎无差异,带有语义信息的视觉区域不明显。

实验

词向量的效果

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第3张图片

网络结构的效果

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第4张图片

对象大小的效果

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第5张图片
Figure 3: mIoU of unseen classes on COCO-Stuff ordered wrt average object size (left to right).

GZSL结果

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第6张图片
Figure 4: GZLSS results on COCO-Stuff and PASCALVOC. We report mean IoU of unseen classes, seen classes and their harmonic mean (perception model is based on ResNet101 and the semantic embedding is ft + w2v). SPNet-C represents SPNet with calibration.

Generalized Zero-Shot Image Classification

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第7张图片

Few-Label Semantic Segmentation Task

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第8张图片

定性结果

[CVPR 2019 论文笔记] Semantic Projection Network for Zero- and Few-Label Semantic Segmentation_第9张图片

你可能感兴趣的:(paper)