The novel method for quality control applicable for commonly used breast MR images (T1WI, T2WI and DWI) in terms of semi-quantitative and quantitative analysis was proposed. The scan was performed across multiple sites on both a breast phantom and a volunteer. The provided scores and apparent spatial resolution were comparable between a phantom and a volunteer. Quality of breast MR images across sites were variable, and standardization using a dedicated breast phantom is considered necessary to assure good diagnostic performance of breast MRI.
A bilateral breast phantom was developed; one side contained a plate with multiple widths of slits (8 slits each for 0.5, 0.6, 0.8, 1, 2 and 3mm widths) for the estimation of spatial resolution, and the other side incorporated compartment with a range of ADC values (using polyvinylpyrrolidone (PVP) and nonane), covering clinical values from malignant to benign lesions6 (Fig.1). T1W, T2W and DW images of the breast phantom and a volunteer were acquired across 4 centers (two scanners each for 3T and 1.5T; detailed acquisition protocols shown in Table 1, with room temperature at 25℃). T1W and T2W images with high resolution were also acquired to generate the “reference” images to directly compare the images across multiple sites. “Apparent spatial resolution” at each site was inferred by following steps; sample images of a normal volunteer were scanned with 0.5mm in-plane resolution (total acquisition time (TA) ~8min), which was much higher than in clinical setting (TA~2.5min). Reference image sets were calculated by applying rectangular window filter in k-space of the sample images corresponding to 0.5~2.0mm (0.1mm step) spatial resolution using matlab (Mathwork, Natick, MA). Two breast radiologists assigned “apparent spatial resolution” to volunteer images at each site by comparing the reference image sets (regarding contour of breast etc) on consensus.
Semi- quantitative analysis: T1W, T2W and DW images in a phantom and normal breasts were read by two independent breast radiologists and 0-4 scores were assigned to evaluate image quality for each site (Fig.1), and their agreement was evaluated using weighted kappa.
Quantitative analysis: ADC values (obtained using b=0 and 600s/mm2) were calculated using ROIs in the compartments containing different PVP solutions and nonane for each DWI dataset.
The representative images of a breast phantom along the equivalent apparent resolution for all sites are shown in Fig.2-3. The equivalent apparent resolution varied more in T1W compared to T2W images. The apparent resolution seems not so different from their real MR images.
The result of scores is summarized in Fig.4. The scores from 2 readers for each site was averaged due to good agreement (weighted Kappa: 0.95-0.975). The scores for 4 sites became variable with 0.8-1.0mm slit for both T1W and T2W images. The appreciation of 3mm slit was most variable across 4 sites for DW images. All breast MR images of a volunteer varied across scanners, however, the scores obtained from a phantom and a volunteer as well as apparent resolutions were comparable at each site.
We proposed the novel method for quality control applicable for all breast MR images, which can provide objective assessments of both phantom and volunteer images. Quality of breast MR images across sites were variable, and standardization using breast phantoms is recommended to assure good diagnostic performance of breast MRI.
1. American College of Radiology. (2016). ACR practice parameter for the performance of contrast-enhanced magnetic resonance imaging (MRI) or the breast. Updated 2013.
2. Partridge SC, Nissan N, Rahbar H, et al. Diffusion-weighted breast MRI: Clinical applications and emerging techniques. J Magn Reson Imaging. 2017;45(2):337-355.
3. Malyarenko D, Galbán CJ, Londy FJ, et al. Multi-system repeatability and reproducibility of apparent diffusion coefficient measurement using an ice-water phantom. J Magn Reson Imaging. 2013; 37(5): 1238–1246.
4. Kinetic Curves of Malignant Lesions Are Not Consistent Across MRI Systems: Need for Improved Standardization of Breast Dynamic Contrast-Enhanced MRI Acquisition. AJR Am J Roentgenol. 2009; 193(3): 832–839.
5. Tuong B, Gardiner I. Development of a novel breast MRI phantom for quality control. AJR Am J Roentgenol. 2013;201(3):W511-5
6. Keenan KE, Peskin AP, Wilmes LJ, et al. Variability and bias assessment in breast ADC measuremnt across multiple systems. J Magn Reson Imaging. 2016 ;44(4):846-55.
Figure 2: The representative T1-weighted images of a breast phantom and a volunteer for 4 sites.
The equivalent apparent resolution of each breast image is also shown in the upper left corner of the breast images.
Figure 3: The representative T2-weighted images of a breast phantom and a volunteer with the equivalent apparent resolution for 4 sites.
The equivalent apparent resolution of each breast image is also shown in the upper left corner of the breast images.
Figure 4: Box-whisker plots of the scores and ADC values for a phantom, and the scores for a volunteer.
T1WI, T2WI and DWI scores at each slit width are provided for a phantom. ADC values were measured in 10%, 14%, 18%, 25% and 40% PVP, water and nonane of a phantom. T1WI, T2WI and DWI scores for each breast image is shown for a volunteer.