TITLE: Chained Predictions Using Convolutional Neural Networks
AUTHER: Georgia Gkioxari, Alexander Toshev, Navdeep Jaitly
ASSOCIATION: UC Berkeley, Google
FROM: arXiv:1605.02346
There are two formulations of the chain model in this work. The one used for single image is taken as an example here. It is a similar procedure in video version.
The inference stage is illustrated in the figure. The input is the image and the image is first fed to a CNN denoted as CNNx. For every stage, a joint of the person is localized by a CNN denoted as CNNy, denoted as “Predictio@0”. Then both the input and output of CNNy is used to predict next joint in the next stage. The procedure can be formalized as:
where h0 =CNNx(x), e(⋅) is a full neural net, mt is the operation of CNNy on ht , and P is the probability of the location of a joint.