Harmonizing diffusion MRI data from multiple scanners

Hengameh Mirzaalian¹, Lipeng Ning¹, Peter Savadjiev¹, Ofer Pasternak¹, Sylvain Bouix¹, Oleg Michailovich², Marek Kubicki¹, Carl Fredrik Westin¹, Martha E. Shenton¹, and Yogesh Rathi¹

¹Harvard Medical School and Brigham and Women’s Hospital, Boston, USA., Boston, MA, United States, ²University of Waterloo, Toronto, ON, Canada

Synopsis

Diffusion MRI (dMRI) is increasing being used to study neuropsychiatric brain disorders. To increase sample size and statistical power of neuroscience studies, we need to aggregate data from multiple sites¹. However this is a challenging problem due to the presence of inter-site variability in the signal originating from several sources, e.g. number of head coils and their sensitivity, non-linearity in the imaging gradient, and other scanner related parameters². Prior works have addressed this issue either using meta analysis³, or by adding a statistical covariate⁴, which are not model free and may produce erroneous results.

Propose

In this work, we propose a novel method to harmonize dMRI data acquired from multiple sites and scanners, for a joint analysis of the data for increased statistical power.

Method

In our proposed pipeline, the inputs are dMRI images acquired at multiple sites. We set one of these sites as our reference site and the rest as our target sites. As output, we provide harmonized dMRI signals of the target sites, which are mapped to the reference site by removing inter-site variabilities but preserving intra-site variability. To account for regional-variations in the data, we use Freesurfer parcellated label maps, which also provide region-wise correspondences. The current work assumes that the acquired data has the same b-value and spatial resolution, even though the number of gradient directions could be slightly different. In our pipeline, we start by computing spherical harmonic (SH) coefficients of the dMRI signal S=[s₁...s_G]^T acquired along G unique gradient directions. In the SH basis, the signal S can be written as S=Σ_i,jY_ijC_ij, where Y_ij is a SH basis function of order i and phase j and C_ij are the corresponding SH coefficients. It is well-known that the l₂ norm of the SH coefficients at each order forms a set of rotation invariant (RISH) features⁵: |C_i|²=Σ_jC_ij².

Given the Freesurfer label maps for each subject, at each site, we compute the expected value of the region-wise RISH features as the average of the voxel-wise RISH features. As shown in Figure 1, these features vary significantly across sites as well as for different regions. Considering this fact, in the next step of our pipeline, we introduce a region based mapping Π such that average RISH features computed over all subjects in the target site match those in the reference site, i.e. E_t(Π(|C_i|²))=E_r(|C_i|²) . A simple Π, which satisfies this equality is Π(|C_i|²)=|C_i|²+E_r-E_t. Given this region-wise map of the RISH features, we need to find a voxel-wise mapping π to estimate harmonized SH coefficients:

Π(|C_i|²)=Σ_j [π(C_ij)]² =Σ_j|C_ij|² +E_r-E_t

We set π as a scaling function:

π(C_ij)=[(Π(|C_i|²))/(|C_i|²)] C_ij

The computed voxel-wise scales for a subject is shown in Figure 2. Applying the above mapping over all the regions of the brain, we can re-compute the signal at each voxel as ζ=Σ_iΣ_jπ(C_ij)Y_ij and project it back to the reference site.

Results

We validated our method on diffusion data acquired from seven different sites (including two GE, three Philips, and two Siemens scanners) on a group of age-matched healthy subjects. Scanner details and subject numbers for each site are reported in Table 1. Since all the data was from matched healthy subjects, we do not expect to see biological differences between the group of subjects from each site. Therefore, we hypothesize that the RISH or standard diffusion measures at the reference and the target sites are statistically not different. To validate our hypothesis, we used t-test to compute p-values for RISH features and standard single tensor based diffusion measures between the reference and the target sites; these p-values were computed before and after applying our harmonization method. Computed p-values for generalized FA (GFA) are shown in Table 2. It can be seen that all p-values-after harmonization are larger than 0.05, which indicates that we could not reject the null hypothesis (similar results are observed for all the RISH features and FA). We also computed the whole brain within site coefficient of variation (CV) in GFA, which did not change much after our harmonization. For independent validation, we also show results using tract-based spatial statistics before and after harmonization. In Figure 3, it can be seen that after data harmonization, all scanner specific group differences in the white matter were removed. Our experimental results demonstrate that, for nearly identical acquisition protocol across sites, scanner-specific differences can be accurately removed using the proposed method.

Discussion and Conclusion

To the best of our knowledge, this is a first method that has addressed the issue of harmonizing dMRI data acquired from multiple sites⁶. Our proposed method is independent of compartmental modeling (e.g., tensor, intra/extra cellular compartments, etc.), subject-dependent, and region specific. It maintains inter-subject variability, removes scanner specific differences, and makes it feasible to do joint analysis to significantly increase the sample size and statistical power of neuroimaging studies.

Acknowledgements

The authors would like to acknowledge the followinggrants which sup- ported this work: R01MH099797(PI: Rathi), R00EB012107 (PI: Setsompop), P41RR14075(PI: Rosen), R01MH074794 (PI:Westin), P41EB015902 (PI:Kiki- nis), Swedish Research Council (VR) grant 2012-3682, Swedish Foundation for Strategic Research (SSF) grant AM13-0090, VA Merit (PI: Shenton) and W81XWH-08-2-0159 (Imaging Core PI: Shenton).

References

[1] Venkatraman V, Gonzalez C, Landman B, Goh J, Reiter D, An Y, Resnick S, Region of interest correction factors improve reliability of diffusion imaging measures within and across scanners and field strengths. NeuroImage. 2015; 406–415.

[2] Zhu T, Hu R, Qiu X, Taylor M, Tso Y, Yiannoutsos C, Navia B, Mori S, Ekholm S, Schifitto G, Zhong J, Quantification of accuracy and precision of multi-center dti measurements: a diffusion phantom and human brain study. Neuroimage; 2011, 1398–1411.

[3] Salimi-Khorshidi G, Smith S, Keltner J, Wager T, Nichols T, Meta-analysis of neuroimaging data: a comparison of image-based and coordinate-based pooling of studies. Neuroimage; 2009.

[4] Forsyth J, Cannon T, Reliability of functional magnetic resonance imaging activation during working memory in a multisite study: analysis from the north american prodrome longitudinal study. Neuroimage; 2014. 41–52.

[5] Kazhdan M, Funkhouser T, Rusinkiewicz S, Rotation invariant spherical harmonic representation of 3D shape descriptors. Symposium on Geometry Processing; 2003.

[6] Mirzaalian H, Pierrefeu A, Savadjiev P, Pasternak O, Bouix S, Kubicki M, Westin C.F, Shenton M.E, Rathi Y, Harmonizing diffusion MRI data across multiple sites and scanners, MICCAI, 2015.

Figures

Figure 1. Freesurfer region based RISH features for different SH orders and sites.

Figure 2. First row (left to right): RISH features of order {0,2,4,6,8}. Second row: amount of shift for each region introduced by Π (Er-Et) different columns correspond to different order of spherical harmonics {0,2,4,6,8}. Third row: Scale computed at each voxel by π; different columns correspond to SH of order {0,2,4,6,8} respectively.

Table 1. Scanner details and subject numbers for each site (M - Male, F - Female, R - right handed, L - left handed).

Table 2. P-values before and after harmonization for GFA for different sites and ROIs.

Figure 3. TBSS results for the target sites before (a-f) and after (g) applying our method. The yellow-red colormap displays p-values less than 0.05.

Proc. Intl. Soc. Mag. Reson. Med. 24 (2016)

1042