We combine convolutional neural net (CNN) and dynamic programming (DP) in an end-to-end pipeline to accomplish left ventricle segmentation. While CNN is differentiable, DP implementation is non-differentiable due to its dependence on "argmin" function. We have approximated the gradient across DP by a neural network (see Fig. 1). Similar technique, known as synthetic gradients, has been used for asynchronous training of deep neural networks.
Our results (Fig. 2) show that EDPCNN can train with fewer labeled data. Further EDPCNN is quite resistant to overfitting compared to U-Net (see publication). We will continue to work with this direction, which we refer to as ADEPT: Architecture for Differentiable End-to-end Programming and Training.
Leg muscle segmentation from MRI is challenging because often there is no strong edge/boundary that delineates a muscle. Thus, we need to resort of supervised learning, where expert segmentations are first created to form a training set of images. Since the segmentation problem is 3D here, we can represent a stack of 2D contours by two 2D images (see details in our publications). Then we can learn a CNN that regresses this representation from 3D MRI. However, we noted that using a pre-trained PCA layer in conjunction with the CNN helps to improve the accuracy of segmentation. We train this system end-to-end.