Distilling the Knowledge in a Neural Network公式2推导

在这里插入图片描述

推导: \large\bf推导: 推导:
∂ C ∂ z i = ∑ j = 1 N ∂ C ∂ q j ∂ q j ∂ z i = ∂ C ∂ q i ∂ q i ∂ z i + ∑ j = 1 , j ≠ i N ∂ C ∂ q j ∂ q j ∂ z i = 1 T [ p i ( q i − 1 ) + ∑ j = 1 , j ≠ i N p j q i ] = 1 T ( q i − p i ) \begin{aligned} \frac{\partial C}{\partial z_i}&=\sum_{j=1}^N\frac{\partial C}{\partial q_j}\frac{\partial q_j}{\partial z_i}\\ &=\frac{\partial C}{\partial q_i}\frac{\partial q_i}{\partial z_i}+\sum _{j=1,j\neq i}^N \frac{\partial C}{\partial q_j}\frac{\partial q_j}{\partial z_i}\\&=\frac{1}{T}[p_i(q_i-1)+\sum _{j=1,j\neq i}^Np_jq_i]\\&=\frac{1}{T}(q_i-p_i)\end{aligned} ziC=j=1NqjCziqj=qiCziqi+j=1,j=iNqjCziqj=T1[pi(qi1)+j=1,j=iNpjqi]=T1(qipi)

你可能感兴趣的:(笔记,算法,机器学习,深度学习)