Yu Zhang, Xiaoguang Di, Bin Zhang, and Chunhui Wang
本文用信息熵理论和Retinex 模型构建了一个自监督图像增强网络,实现了弱光图像中,亮度与反射系数的分解。其中,最为重要的约束是:反射系数的最大通道与弱光图像的最大通道是conform的,并且前者的熵在本文的模型中应该是最大的。
问题:什么是最大通道?为什么 conform?为什么熵最大?
This paper proposes a self-supervised low light image enhancement method based on deep learning.
Inspired by information entropy theory and Retinex model, we proposed a maximum entropy based Retinex model. With this model, a very simple network can separate the illumination and reflectance, and the network can be trained with low light images only. We introduce a constraint that the maximum channel of the reflectance conforms to the maximum channel of the low light image and its entropy should be largest in our model to achieve self-supervised learning.
Our model is very simple and does not rely on any well-designed data set (even one low light image can complete the training). The network only needs minute-level training to achieve image enhancement. It can be proved through experiments that the proposed method has reached the state-of-the-art in terms of processing speed and effect.
As for low light image enhancement task, in the previous work, some solutions such as synthesizing low light images [7], using different exposure time images to obtain data [8], and so on, have achieved good visual effects. However, there are still two problems with those methods. One is how to ensure that the pre-trained network can be used for images collected from different devices, different scenes, and different lighting conditions rather than building new training data set. The other is how to determine whether the normal light image used for supervision is the best, there can be lots of normal light images for a low light image. Usually, the builder of the data set gets the normal light images by experience or artificial adjustment, which will cost lots of time and energy and we cannot make sure that the enhanced image can show the information contained in the low light image to the greatest extent with those normal light images.
传统基于数据集网络模型的两个问题:
采用了三个损失函数,分别如下:
Retinex model : S = R ◦ I
reconstruction loss : lrcon = || S − R ◦ I ||1
reflectance loss : lR =|| max c∈R,G,B Rc − F( max c∈R,G,B Sc ) ||1 + λ||△R||1 ,
where, F() means the histogram equalization operator to input image.
这个损失函数是本文的核心,下面解释了为什么提出这样一个损失函数:
This loss function means that maximum channel of the reflectance should conform to the maximum channel of the low light image and has the maximum entropy.
There are three main reasons why we choose the maximum channel to constrain.
Firstly, for a low light image, the maximum channel has the greatest impact on its visual effect.
Secondly, if other channels are selected, there is no doubt that saturation will occur according to the prior that the maximum channel must be greater than the other two channels.
Thirdly, if we choose one of the color channel, such as R, G or B channel, it is obviously not in line with the natural image.
本文的一个重要先验:反射图像的损失。
该损失函数意味着反射系数的最大通道应符合弱光图像的最大通道,且具有最大熵。这个先验的原因在于:
illumination loss : adopt structure-aware smoothness loss lI = ||△I ◦ exp (−λ3 △R)||1
最后的损失函数就是三者相加。这个公式的解很难,且费时。因此,这个混合损失函数,采用深度学习的方式求其解。
only need a very simple CNN structure to achieve the decomposition of the illumination and reflectance.
网络结构如下图所示。