weixin_39656383

cross_domain_adaptation

Feature Transformation Ensemble Model with Batch Spectral Regularization for Cross-Domain Few-Shot Classification

setup: source with data and label; target with data and no label

3 works:

Construct an ensemble prediction model by performing diverse feature transformations after a feature extraction network
1. a batch spectral regularization (BSR) mechanism: suppress all the singular values of the feature matrix in pre-training so that the pre-trained model can avoid overfitting to the source domain and generalize well to the target domain
2. feature transformation ensemble model: build multiple predictors in projected diverse feature spaces to facilitate cross-domain adaptation and increase prediction robustness
3. to mitigate the shortage of labeled data in the target domain:

exploit the unlabeled query set in fine-tuning through entropy minimization
label propagation (LP) step to refine the original classification results
$Y^{*}=(I-\alpha L)^{-1} \times \hat{Y}^{0}$
data augmentation techniques to augment both the few-shot and test instances from different angles to improve prediction performance

others:

penalizing smaller singular values of a feature matrix can help mitigate negative transfer in fine-tuning

？

为什么要让feature*不同的正交矩阵得到multiple diverse feature representation spaces
（1）算法的效果很好，但是使用了太多的方法（这些方法之间又没有什么联系），又没做ablation study，说不清楚到底是哪种方法起到了效果。

（2）与第一篇文章一样，这篇文章也是对输入进行了特征变换。这篇文章则采用了新的变换方式。那么，找到一个好的变换方式是很有必要的。好的变换方式包括

Cross-domain Self-supervised Learning for Domain Adaptation with Few Source Labels

setup: source with data and little label; target with data and no label

Some semi-supervised learning techniques such as entropy minimization [9], pseudo-labeling [16], and Virtual Adversarial Training (VAT) [21] have been often used in domain adaptation (e.g., [17,31,44]).
domain adaptation methods such as [7,18,31] with few source labels.
Prior work accomplished this by using an adversarial domain classifier [7]
or Mean Maximum Discrepancy [19] to align two domain feature distributions. Optimal transport [38] is often used to find a matching pair of two distributions, but this scales poorly [4] and is limited to find a matching in a batch [1]

main idea

learns features that are not only domain-invariant but also class-discriminative

work

captures apparent visual similarity with in-domain self- supervision in a domain adaptive manner and performs cross-domain feature matching with across-domain self-supervision.
Our CDS consists of two objectives: (1) learning visual similarity with in-domain supervision and (2) cross-domain matching with across-domain supervision.
in-domain self-supervision encourages a model to learn discriminative features by separating every instance within a domain

Instance Discrimination [39] (ID) : treating all the other images as negative pairs
measure the similarity of features in-domain, and then perform in-domain instance discrimination to learn visual similarity in each domain

across-domain self-supervision enables better knowledge transfer from the source do- main to the target domain by performing instance-to-instance matching across domains.

measure similarity between a feature and cross-domain features from the cross-domain memory bank and then minimize the entropy for cross-domain matching

the source and target memory banks

other

和某一篇DG文章好像，同时两个network，一个拉远一个拉近。不过加了self-supervised和不一样的setting

Cross-Domain Few-Shot Learning with Meta Fine-Tuning

在解决corss domain few-shot learning问题时，模型要在新domain的task上做微调，作者希望找到一组适合进行微调的初始化权重。

为此，作者使用元学习的方法进行训练，得到了meta fine-tuning模型。
explore the integration of transfer-learning (fine-tuning) with meta- learning algorithms, to train a network that has specific lay- ers that are designed to be adapted at a later fine-tuning stage

main idea

modify the episodic training process to include a first-order MAML-based meta-learning algorithm, and use a Graph Neural Network model as the subse- quent meta-learning module to compare the feature vectors.

work

meta fine-tuning的框架采用MAML，共分为两步

（1）微调

使用support set的数据对模型进行微调（只微调最后k层，这是迁移学习的习惯）（step1）

（2）根据微调做预测

对于微调后的模型，使用query set的数据进行预测，计算损失函数，反向传播，更新原模型的参数。（step2）

重复这两步，模型将学会一组初始化参数。这组参数在新任务上进行微调后，会取得比较好的效果。

relevant

Graph-based convolutions can create more flexible representations of data beyond a simple Eu- clidean space [1].

other

不是很有价值

Cross-domain few-shot classification via learned feature-wise transformation

main idea

use feature-wise transformation layers for augmenting the image features using affine transforms to simulate various feature distributions under different domains in the training stage.
tackle the domain generalization problem for recognizing novel category in the few-shot classification setting

work

feature-wise transformation layers
learning-to learn method to optimize the hyper-parameters of FT layers
（1）先取若干个domain（即若干个数据集，论文选了五个）

（2）从中选择一个作为unseen domain，其余作为seen domain

（3）针对传统的protonet进行微小改进，在所有层的batch normalization操作后面加入FT操作（如下图）

（4）在seen domain上计算损失，反向传播，更新除FT操作外的所有参数（下图中的θe和θm）

（5）去掉FT操作

（6）在unseen domain上计算损失，反向传播，专门更新FT操作的参数（下图中的θf）

？

问题

（1）为什么训练的时候要在unseen task上去掉FT操作？

我认为：如果去掉FT操作，就是让源域向目标域靠近；如果不去掉FT操作，就是既让源域向目标域靠近，又让目标域向源域靠近。

但是作者的出发点并不是让源域和目标域相互靠近（虽然实际上就是达到了这个效果），他的目标是让源域的图片特征更加多种多样，从而避免过拟合。因而当目标域的数据送入模型，是没有必要让它的特征也多种多样的。

思考：如果不去掉FT操作，也许我们可以让源域和目标域互相靠近，也许效果会更好。

（2）为什么不在源域就更新FT操作的参数？

因为FT操作是为目标域服务的，只有在目标域上才能判断FT有没有解决过拟合能力。

（3）作者提出的更新FT参数的方法是否合理？

作者在unseen task上去掉了FT操作，却又通过unseen task上的损失更新FT的参数，并且更新函数简单粗暴。

所以这个方法并不是特别有道理，应该找个更reasonable的方法。

收获

（1）对于任何需要超参数的方法，都可以试着使用元学习的方法学出来一组合适的超参数

（2）作者不是让模型从一个域迁移到另一个域，而是采用了元学习方法，从很多个域迁移到另一个域，从而实现了学会迁移。

metric-based meta-learning meth- ods (Garcia & Bruna, 2018; Sung et al., 2018; Vinyals et al., 2016; Snell et al., 2017; Oreshkin et al., 2018) have received considerable attention due to their simplicity and effectiveness.
few-shot, DA, DG, data augmentation, conditional normalization, regularization

Large Margin Mechanism and Pseudo Query Set on Cross-Domain Few-Shot Learning

work

First, we propose the pseudo query set and analyze its importance.
生成伪query set以微调模型
在meta-test阶段，对每个support set中的图像进行data augmentation，生成多个伪query set图像（这些图像是有标签的，标签就是数据增强前的图片标签）。使用伪query set集，few-shot模型可以在微调阶段更新参数，就像它们在meta-training阶段对普通查询集所做的那样。
用triplet loss改进原来的prototypical net
原来的protonet的目标是让相同的类靠近它们的原型（prototype/中心点）。
作者使用triplet loss，将同一类的数据拉到更接近其原型的地方，并拉大这些数据与其它类的原型之间的距离，损失函数变为下式
$P T$ Loss $=\sum_{i=1}^{N \times K} \sum_{j=1}^{N}\left(c_{s_{i}} \neq j\right)$ triplet $\left(s_{i}, p_{c_{s_{i}}}, p_{j}\right)$
Large margin mechanism
作者进一步使用了large margin mechanism，增大了分类器的判别能力。
（这一方法多用于人脸识别，作者认为人脸识别与meta learning的任务设定有相似性）
根据前面的分析, 如果 $\left|w_{1}\right||x| \cos \left(\theta_{1}\right)>\left|w_{2}\right||x| \cos \left(\theta_{2}\right),$ softmax 就将特征 $x$ 判为类别1。因此, 在训练过程中，我们可以根据样本标签, 通过减小 $\cos (\theta)$ 人为增加判别难度, 判别条件变成 (若 $x$ 属于类别1)：
$\left|w_{1}\right||x| \cos \left(m \theta_{1}\right)>\left|w_{2}\right||x| \cos \left(\theta_{2}\right)$
夹角变大增加了训练难度，迫使样本远离最初的决策边界，同时类内更加聚拢。类别之间新的决策边界如下

other

（1）作者显示用数据增强生成伪标签，又辅助两种增加分类器判别能力的方法，取得了良好的效果。但是不知道到底是那种机制起作用.

（2）除了生成伪标签，对于5-way 5-shot任务，也许我们可以把4shot作为support set，把1way作为query set更新参数。

Y. Guo, N. C. F. Codella, L. Karlinsky, J. R Smith, T. Rosing, and R. Feris. A new benchmark for evaluation of cross-domain few-shot learning. arXiv preprint arXiv:1912.07200, 2019.

Self-Supervised Prototypical Transfer Learning for Few-Shot Classification

set up
support set 没有标签

这篇文章指出：supprot set过少的数据（过小的batch size）是产生过拟合的原因

想办法增大batch size是关键

这篇文章增大batch size的方法还很一般（简单的data augmentation，别人去年已经用过了），还有待新的创新。

However, several works (Chen et al., 2019; Guo et al., 2019) show that common (non-episodical) transfer learning outperforms meta-learning methods on the realistic cross-domain setting, where training and novel classes come from different distributions.
unsupervised meta-learning approaches, constructing episodes via pseudo-labeling (Hsu et al., 2019; Ji et al., 2019) or image augmentations (Khodadadeh et al., 2019; Antoniou and Storkey, 2019; Qin et al., 2020), have addressed this problem.
In this, we draw inspiration from recent progress in unsupervised meta-learning (Khodadadeh et al., 2019) and self-supervised visual contrastive learning of representations (Chen et al., 2020; Ye et al., 2019).

你可能感兴趣的:(paper_note)

微信开发者验证接口开发 362217990 微信开发者 token 验证
微信开发者接口验证。 Token，自己随便定义，与微信填写一致就可以了。根据微信接入指南描述 http://mp.weixin.qq.com/wiki/17/2d4265491f12608cd170a95559800f2d.html 第一步：填写服务器配置第二步：验证服务器地址的有效性第三步：依据接口文档实现业务逻辑这里主要讲第二步验证服务器有效性。建一个
一个小编程题-类似约瑟夫环问题 BrokenDreams 编程
今天群友出了一题：一个数列,把第一个元素删除,然后把第二个元素放到数列的最后,依次操作下去,直到把数列中所有的数都删除,要求依次打印出这个过程中删除的数。 &
linux复习笔记之bash shell (5) 关于减号-的作用 eksliang linux关于减号“-”的含义 linux关于减号“-”的用途 linux关于“-”的含义 linux关于减号的含义
转载请出自出处： http://eksliang.iteye.com/blog/2105677 管道命令在bash的连续处理程序中是相当重要的，尤其在使用到前一个命令的studout（标准输出）作为这次的stdin（标准输入）时，就显得太重要了，某些命令需要用到文件名，例如上篇文档的的切割命令（split）、还有
Unix(3) 18289753290 unix ksh
1)若该变量需要在其他子进程执行，则可用"$变量名称"或${变量}累加内容什么是子进程？在我目前这个shell情况下，去打开一个新的shell，新的那个shell就是子进程。一般状态下，父进程的自定义变量是无法在子进程内使用的，但通过export将变量变成环境变量后就能够在子进程里面应用了。 2)条件判断： &&代表and ||代表or&nbs
关于ListView中性能优化中图片加载问题酷的飞上天空 ListView
ListView的性能优化网上很多信息，但是涉及到异步加载图片问题就会出现问题。具体参看上篇文章http://314858770.iteye.com/admin/blogs/1217594 如果每次都重新inflate一个新的View出来肯定会造成性能损失严重，可能会出现listview滚动是很卡的情况，还会出现内存溢出。现在想出一个方法就是每次都添加一个标识，然后设置图
德国总理默多克：给国人的一堂“震撼教育”课永夜-极光教育
http://bbs.voc.com.cn/topic-2443617-1-1.html德国总理默多克：给国人的一堂“震撼教育”课　安吉拉—默克尔，一位经历过社会主义的东德人，她利用自己的博客，发表一番来华前的谈话，该说的话，都在上面说了，全世界想看想传播——去看看默克尔总理的博客吧！　　德国总理默克尔以她的低调、朴素、谦和、平易近人等品格给国人留下了深刻印象。她以实际行动为中国人上了一堂
关于Java继承的一个小问题。。。随便小屋 java
今天看Java 编程思想的时候遇见一个问题，运行的结果和自己想想的完全不一样。先把代码贴出来！ //CanFight接口 interface Canfight { void fight(); } //ActionCharacter类 class ActionCharacter { public void fight() { System.out.pr
23种基本的设计模式 aijuans 设计模式
Abstract Factory：提供一个创建一系列相关或相互依赖对象的接口，而无需指定它们具体的类。　　Adapter：将一个类的接口转换成客户希望的另外一个接口。A d a p t e r模式使得原本由于接口不兼容而不能一起工作的那些类可以一起工作。　　Bridge：将抽象部分与它的实现部分分离，使它们都可以独立地变化。　　Builder：将一个复杂对象的构建与它的表示分离，使得同
《周鸿祎自述：我的互联网方法论》读书笔记 aoyouzi 读书笔记
从用户的角度来看,能解决问题的产品才是好产品,能方便/快速地解决问题的产品,就是一流产品. 商业模式不是赚钱模式一款产品免费获得海量用户后,它的边际成本趋于0,然后再通过广告或者增值服务的方式赚钱,实际上就是创造了新的价值链. 商业模式的基础是用户,木有用户,任何商业模式都是浮云.商业模式的核心是产品,本质是通过产品为用户创造价值. 商业模式还包括寻找需求
JavaScript动态改变样式访问技术百合不是茶 JavaScript style属性 ClassName属性
一:style属性格式: HTML元素.style.样式属性="值"; 创建菜单:在html标签中创建或者在head标签中用数组创建 <html> <head> <title>style改变样式</title> </head> &l
jQuery的deferred对象详解 bijian1013 jquery deferred对象
jQuery的开发速度很快，几乎每半年一个大版本，每两个月一个小版本。每个版本都会引入一些新功能，从jQuery 1.5.0版本开始引入的一个新功能----deferred对象。 &nb
淘宝开放平台TOP Bill_chen C++c 物流 C#
淘宝网开放平台首页：http://open.taobao.com/ 淘宝开放平台是淘宝TOP团队的产品，TOP即TaoBao Open Platform，是淘宝合作伙伴开发、发布、交易其服务的平台。支撑TOP的三条主线为： 1.开放数据和业务流程 * 以API数据形式开放商品、交易、物流等业务； &
【大型网站架构一】大型网站架构概述 bit1129 网站架构
大型互联网特点面对海量用户、海量数据大型互联网架构的关键指标高并发高性能高可用高可扩展性线性伸缩性安全性大型互联网技术要点前端优化 CDN缓存反向代理 KV缓存消息系统分布式存储 NoSQL数据库搜索监控安全想到的问题： 1.对于订单系统这种事务型系统，如
eclipse插件hibernate tools安装白糖_ Hibernate
eclipse helios(3.6)版 1.启动eclipse 2.选择 Help > Install New Software...> 3.添加如下地址： http://download.jboss.org/jbosstools/updates/stable/helios/ 4.选择性安装：hibernate tools在All Jboss tool
Jquery easyui Form表单提交注意事项 bozch jquery easyui
jquery easyui对表单的提交进行了封装，提交的方式采用的是ajax的方式，在开发的时候应该注意的事项如下： 1、在定义form标签的时候，要将method属性设置成post或者get，特别是进行大字段的文本信息提交的时候，要将method设置成post方式提交，否则页面会抛出跨域访问等异常。所以这个要
Trie tree(字典树)的Java实现及其应用-统计以某字符串为前缀的单词的数量 bylijinnan java实现
import java.util.LinkedList; public class CaseInsensitiveTrie { /** 字典树的Java实现。实现了插入、查询以及深度优先遍历。 Trie tree's java implementation.(Insert,Search,DFS) Problem Description Igna
html css 鼠标形状样式汇总 chenbowen00 html css
css鼠标手型cursor中hand与pointer Example：CSS鼠标手型效果 <a href="#" style="cursor:hand">CSS鼠标手型效果</a><br/> Example：CSS鼠标手型效果 <a href="#" style=&qu
[IT与投资]IT投资的几个原则 comsci it
无论是想在电商,软件,硬件还是互联网领域投资,都需要大量资金,虽然各个国家政府在媒体上都给予大家承诺,既要让市场的流动性宽松,又要保持经济的高速增长....但是,事实上,整个市场和社会对于真正的资金投入是非常渴望的,也就是说,表面上看起来,市场很活跃,但是投入的资金并不是很充足的......
oracle with语句详解 daizj oracle with with as
oracle with语句详解转在oracle中，select 查询语句，可以使用with,就是一个子查询，oracle 会把子查询的结果放到临时表中，可以反复使用例子:注意，这是sql语句，不是pl/sql语句，可以直接放到jdbc执行的 ----------------------------------------------------------------
hbase的简单操作 deng520159 数据库 hbase
近期公司用hbase来存储日志,然后再来分析 ,把hbase开发经常要用的命令找了出来. 用ssh登陆安装hbase那台linux后用hbase shell进行hbase命令控制台! 表的管理 1）查看有哪些表 hbase(main)> list 2）创建表 # 语法：create <table>, {NAME => <family&g
C语言scanf继续学习、算术运算符学习和逻辑运算符 dcj3sjt126com c
/* 2013年3月11日20:37:32 地点：北京潘家园功能：完成用户格式化输入多个值目的：学习scanf函数的使用 */ # include <stdio.h> int main(void) { int i, j, k; printf("please input three number:\n"); //提示用
2015越来越好 dcj3sjt126com 歌曲
越来越好房子大了电话小了感觉越来越好假期多了收入高了工作越来越好商品精了价格活了心情越来越好天更蓝了水更清了环境越来越好活得有奔头人会步步高想做到你要努力去做到幸福的笑容天天挂眉梢越来越好婆媳和了家庭暖了生活越来越好孩子高了懂事多了学习越来越好朋友多了心相通了大家越来越好道路宽了心气顺了日子越来越好活的有精神人就不显
java.sql.SQLException: Value '0000-00-00' can not be represented as java.sql.Tim feiteyizu mysql
数据表中有记录的time字段（属性为timestamp）其值为：“0000-00-00 00:00:00” 程序使用select 语句从中取数据时出现以下异常： java.sql.SQLException:Value '0000-00-00' can not be represented as java.sql.Date java.sql.SQLException: Valu
Ehcache（07）——Ehcache对并发的支持 234390216 并发 ehcache 锁 ReadLock WriteLock
Ehcache对并发的支持在高并发的情况下，使用Ehcache缓存时，由于并发的读与写，我们读的数据有可能是错误的，我们写的数据也有可能意外的被覆盖。所幸的是Ehcache为我们提供了针对于缓存元素Key的Read（读）、Write（写）锁。当一个线程获取了某一Key的Read锁之后，其它线程获取针对于同
mysql中blob,text字段的合成索引 jackyrong mysql
在mysql中，原来有一个叫合成索引的，可以提高blob,text字段的效率性能，但只能用在精确查询，核心是增加一个列，然后可以用md5进行散列，用散列值查找则速度快比如： create table abc(id varchar(10),context blog,hash_value varchar(40)); insert into abc(1,rep
逻辑运算与移位运算 latty 位运算逻辑运算
源码：正数的补码与原码相同例+7 源码：00000111 补码：00000111 （用8位二进制表示一个数）负数的补码：符号位为1，其余位为该数绝对值的原码按位取反；然后整个数加1。 -7 源码： 10000111 ，其绝对值为00000111 取反加一：11111001 为-7补码已知一个数的补码，求原码的操作分两种情况：
利用XSD 验证XML文件 newerdragon java xml xsd
XSD文件（XML Schema 语言也称作 XML Schema 定义（XML Schema Definition，XSD）。具体使用方法和定义请参看： http://www.w3school.com.cn/schema/index.asp java自jdk1.5以上新增了SchemaFactory类可以实现对XSD验证的支持，使用起来也很方便。以下代码可用在J
搭建 CentOS 6 服务器(12) - Samba rensanning centos
（1）安装 # yum -y install samba Installed: samba.i686 0:3.6.9-169.el6_5 # pdbedit -a rensn new password:123456 retype new password:123456 …… （2）Home文件夹 # mkdir /etc
Learn Nodejs 01 toknowme nodejs
（1）下载nodejs https://nodejs.org/download/ 选择相应的版本进行下载（2）安装nodejs 安装的方式比较多，请baidu下我这边下载的是“node-v0.12.7-linux-x64.tar.gz”这个版本（1）上传服务器（2）解压 tar -zxvf node-v0.12.
jquery控制自动刷新的代码举例 xp9802 jquery
1、html内容部分复制代码代码示例: <div id='log_reload'> <select name="id_s" size="1"> <option value='2'>-2s-</option> <option value='3'>-3s-</option

cross_domain_adaptation

Feature Transformation Ensemble Model with Batch Spectral Regularization for Cross-Domain Few-Shot Classification

3 works:

others:

？

Cross-domain Self-supervised Learning for Domain Adaptation with Few Source Labels

related

main idea

work

other

Cross-Domain Few-Shot Learning with Meta Fine-Tuning

main idea

work

relevant

other

Cross-domain few-shot classification via learned feature-wise transformation

main idea

work

？

related

Large Margin Mechanism and Pseudo Query Set on Cross-Domain Few-Shot Learning

work

other

related

Self-Supervised Prototypical Transfer Learning for Few-Shot Classification

related

你可能感兴趣的:(paper_note)