In this work, we investigate a scheme for crowd sourcing image quality using machine learned metrics from user rankings of corrupted images. Using an HTML application, experienced observers ranked pairs of corrupted images with respect to image quality. A convolution neural network (CNN) was then trained to produce a quality score that was higher in the preferred images. The trained CNN was found to be more sensitive to artifacts from image blurring and wavelet compression than mean square error. Finally, preliminary use in training a machine learned image reconstruction is demonstrated.
Image Corruption: 3D complex, coil combined, T1 weighted brain structural images from 15 subjects were used to create 7,500 2D image pairs in the axial, sagittal, and coronal planes. For each pair, images were transformed to k-space, randomly undersampled (1-1.25x), apodized with a random Gaussian kernel (sigma=0-2kmax), and randomly wavelet compressed (0-50%). Images were scaled to minimize the MSE with the input, ground truth image, and saved as floating point complex values and as portable graphics format. Images were window leveled automatically using the central quarter of the image.
User Ranking: Image pairs were presented side by side to users experienced in evaluating T1 structural images in an HTML application, as shown in Figure 1. For each pair, users were asked to select the best image or skip the pair if the images were too similar to distinguish. Image pairs were selected at random from the database with repeated evaluation allowed.
Imaging Metric RankNet: To derive a quality metric, image pairs were fed to a RankNet4 based training schema shown in Figure 1. Each image is independently passed through a convolutional neural network (CNN) which aims to output a quality score. The quality scores are then subtracted and sigmoid activated to provide a rank preference. The CNN is trained using user ranks and cross entropy loss, augmentation, and 0.9/0.1 validation split. In this work, we used a ResNet architecture with few base filters (16) and bottlenecks per loop (2).
Analysis and use in Reconstruction: To investigate the effect of image corruptions, CNN image quality scores and MSE were computed and compared in a 1,000 images blurred with a Gaussian kernel, randomly undersampled in k-space, and compressed in the wavelet domain. As proof of concept, the CNN image quality was also used in a neural network image reconstruction based on 1D Fourier undersampling, and a network consisting of 2 partially fully connected layers (connected in only 1D) followed by a 3 layer CNN. Images were trained using the CNN metric as the loss function as shown in Figure 2.
1. Zhu B, Liu JZ, Cauley SF, et al . Image reconstruction by domain-transform manifold learning. Nature. 2018;555(7697):487-92.
2. Hammernik K, Klatzer T, Kobler E, et al. Learning a variational network for reconstruction of accelerated MRI data. Magn Reson Med. 2018;79(6):3055-71.
3. Yang G, Yu S, Dong H, et al. DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction. IEEE Trans Med Imaging. 2018;37(6):1310-21.
4. Burges CJC, Shaked T, Renshaw E, et al. Learning to Rank using Gradient Descent. Proceedings of the Twenty Second International Conference on Machine Learning; 2005.