PAMI-2022
最新成果:https://github.com/ZitongYu/DeepFAS
人脸识别系统,automatic face recognition (AFR) system:
人脸活检:
人脸活检方法分类:
(1)传统方法
Most traditional algorithms are designed based on human liveness cues and handcrafted features
liveness cues 包括但不限于:
classical handcrafted descriptors designed for extracting effective spoofing patterns from various color spaces(RGB, HSV, and YCbCr)
(2)深度学习的方法
攻击类型,automatic face recognition (AFR) system 经常分为这两类
本文重点讨论 PAs——misleads the real-world AFR systems via presenting face upon physical mediums in front of the imaging sensors
PAs 不同切入角度有不同的分类形式
根据 attackers’ intention
根据 geometry property
根据 facial region covering
Sensor:
数据集(prevailing public FAS datasets):data amount, subject numbers, modality / sensor, environmental setup, and attack types.
samples(图片数) and subjects(IDs)
公开数据发展的趋势:
评价指标
测试方式(evaluation protocols):
open-set problem in practice,需要考虑 unseen domain generalization
有如下三种混合形式
(1)crafted features highly rely on the expert knowledge and not learnable, which are inefficient once enough training data are available;
(2)there might be feature gaps/incompatibility between handcrafted and deep features, resulting in performance saturation.
下面展开说说
treat FAS as a binary classification problem (e.g., ‘0’ for live while ‘1’ for spoofing faces, or vice versa)
bonafide versus PA
与常见的二分类视觉任务的不同点在于
very challenging to distinguish by even human eyes,性别二分类关注的是 semantic features,活检关注的是 arbitrary and unfaithful clues (e.g., screen bezel) for spoofing patterns,such intrinsic live/spoof clues are usually closely related with some position-aware auxiliary tasks.
存在的缺点:
these supervision signals only provide global (spatial/temporal) constraints for live/spoof embedding learning, which may causes FAS models to easily overfit to unfaithful patterns.
usually black-box and the characteristic of their learned features are hard to understand
(1)Pixel-Wise Supervision With Auxiliary Task
provide more fine-grained and contextual task-related clues for better intrinsic feature learning,而不是 unfaithful patterns (e.g., screen bezel).
auxiliary supervision signals:
存在的缺点:
(2)Pixel-Wise Supervision With Generative Model
usually relaxes the expert-designed hard constraints (e.g.,auxiliary tasks), and leaves the decoder to reconstruct more natural spoof-related trace.
The generated spoof patterns are visually insightful, and are challenging to manually describe with human prior knowledge.
缺点
such soft pixel-wise supervision might easily fall into the local optimum and overfit on unexpected interference (e.g., sensor noise),
解决方式之一
Pixel-Wise Supervision With Generative Model + Pixel-Wise Supervision With Auxiliary Task
Traditional end-to-end deep learning 缺点,下面场景会翻车
Generalized Deep Learning Method
Domain adaptation(DA) vs Domain Generalization(DG)
一个需要无标签的 target domain 数据,一个不需要 target domain 的数据
(1)Domain adaptation(DA)
The distribution of source and target features are usually matched in a learned feature space
minimize the distribution discrepancy between the source and the target domain by utilizing unlabeled target data,
缺点
(2)Domain Generalization(DG)
缺点
domain generalization benefits FAS models to perform well in unseen domain, but it is still unknown whether it deteriorates the discrimination capability for spoofing detection under the seen scenarios.
(1)Zero/Few-Shot Learning
缺点
few-shot learning 在 zero-shot case 场景会翻车
the failed detection usually occurs in the challenging attack types (e.g., transparent mask, funny eye, and makeup), which share similar appearance distribution with the bonafide
(2)Anomaly Detection
first trains a reliable one-class classifier to accurately cluster the live samples. Then any samples (e.g., unknown attacks) outside the margin of the live sample cluster would be detected as attacks
缺点
suffer from discrimination degradation compared with conventional live/spoof classification in the real-world open-set scenarios (i.e., both known and unknown attacks).
绿色框还有个 medium 的评价,P < M < G < VG
NIR (900 to 1800nm), poor imaging quality in long distance
SWIR(940nm and 1450nm)
dynamic flash is sensitive under outdoor environments and is not user-friendly due to the long temporal activation time
(1)Multi-Modal Fusion
(2)Cross-Modal Translation
pseudo modalities could be generated via cross-modality translation
missing modal data for multi-modal FAS
the limitations of the current development
automatically search and find the best-suited temporal architectures especially for multi-modal usage
rich temporal context vs binary or pixel-wise supervision
More advanced feature visualization manners and fine-grained pixel-wise spoof segmentation should be developed for interpretable FAS
transfer learning——缓解过拟合
disentangled learning——disentangle the intrinsic spoofing clues from the noisy representation
metric learning
self-supervised and semi-supervised learning
GrandTest
AFR-aware and FAS-aware
digital and physical attack types
federated learning
向「假脸」说 No:用OpenCV搭建活体检测器
活体检测的方法有很多,包括:
纹理分析(Texture analysis),该方法计算了面部区域的局部二值模式(Local Binary Patterns,LBP),用 SVM 将面部分为真实面部和伪造面部;
频率分析(Frequency analysis),比如检查面部的傅立叶域;
可变聚焦分析(Variable focusing analysis),例如检查连续两帧间像素值的变化;
启发式算法(Heuristic-Based algorithms),包括眼球运动、嘴唇运动和眨眼检测。这些算法试图追踪眼球运动和眨眼行为,来确保用户不是拿着谁的照片(因为照片不会眨眼也不会动嘴唇);
光流算法(Optical Flow algorithm),即检测 3D 对象和 2D 平面产生的光流的属性和差异;
3D 面部形状(3D face shape),类似于 iPhone 上的面部识别系统,这种算法可以让面部识别系统区分真实面部和其他人的照片或打印出来的图像;
暂时下载不到的文章 for free
阅读笔记
【DDFD】《Multi-view Face Detection Using Deep Convolutional Neural Networks》(ICMR-2015)
【IoU Loss】《UnitBox: An Advanced Object Detection Network》(ACM MM-2016)
【FAS】《Face Anti-Spoofing Using Patch and Depth-Based CNNs》(IJCB-2017)
【MLFP】《Face Presentation Attack with Latex Masks in Multispectral Videos》(CVPRW-2017)
【Face Detection】《Face Detection using Deep Learning: An Improved Faster RCNN Approach》(Neurocomputing-2018)
【GDConv】《MobileFaceNets:Efficient CNNs for Accurate RealTime Face Verification on Mobile Devices》(CCBR-2018)
【CASIA-SURF】《A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing》(CVPR-2019)
【FAS-FRN】《Recognizing Multi-modal Face Spoofing with Face Recognition Networks》(CVPR-2019 workshop)
【FaceBagNet】《FaceBagNet:Bag-of-local-features Model for Multi-modal Face Anti-spoofing》(CVPR-2019 workshop)
【FeatherNets】《FeatherNets:Convolutional Neural Networks as Light as Feather for Face Anti-spoofing》(CVPR-2019 workshop)
【WMCA】《Biometric Face Presentation Attack Detection with Multi-Channel Convolutional Neural Network》(TIFS-2019)
【WebFace260M】《WebFace260M:A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition》(CVPR-2021)
【EMFace】《EMface: Detecting Hard Faces by Exploring Receptive Field Pyramids》(arXiv-2021)