1780

Towards Template-Invariant Voxel-Wise Analysis
Roman Fleysher1, Nelson Gil1, and Michael L Lipton 1
1Radiology, Albert Einstein College of Medicine, Bronx, NY, United States

Synopsis

Voxel-wise cluster analysis of any MR-derived metric necessitates non-linear registration to a common brain template. Because quality of inter-brain registrations is sensitive to the similarity between the brains, the results of such analyses are sensitive to the choice of the template. Despite judicious selection of the template may reduce registration errors, inherently the results should not depend on template choice. We show that a dramatic reduction in this sensitivity is achieved by filtering out poorly registered images. Consequently, voxel-wise cluster analyses of the remaining data become more robust and less sensitive to the choice of the template.

Introduction

Voxel-wise analyses of brain DTI, fMRI or any other MRI-derived metric, comparing groups or one patient to a group, rely on registration to a brain template to ensure that homologous regions are being compared. The neuroanatomical variability among individuals virtually ensures that one brain can never be exactly mapped onto another.1-4 Because quality of inter-brain registrations is sensitive to the similarity between them, results of voxel-wise analyses are sensitive to the choice of the template.4

Registration errors cause two main difficulties for studying sensitivity to the choice of template: absence of the “ground truth” and comparison of results obtained over different individual templates. In an ideal scenario (Fig 1), subject data is registered to a master template without errors for a voxel-wise analysis to generate the “ground truth”. The same analysis is performed over the various candidate templates. Subsequently, clusters are carried over to the master template without misalignment for comparison with the ground truth. If the clusters match the ground truth, the candidate template may be termed as “good”. Otherwise, it is “bad”. This ideal scenario would allow detailed analysis of the quality of the individual morphisms and development of a criterion with which to screen and filter out aberrant transformations thus converting a “bad” template into a “good”, albeit for a subset of subjects.

In the present study, we develop an “induction” process to manufacture the needed error-free morphisms. We then examine how template selection affects the results of a voxel-wise fractional anisotropy (FA) analysis that seeks to reproduce the well-known decline in white matter integrity with age. We demonstrate that excluding poorly-registered images dramatically increases robustness of the voxel-wise analysis to the choice of template.

METHODS

This study was approved by IRB and includes 96 datasets obtained as part of ongoing Einstein Lifespan Study (ELS). Images were reviewed by an experienced neuroradiologist and determined to be free of visible structural abnormalities. Imaging was performed using a 3.0T Philips Achieva TX scanner (Philips Medical Systems, Best, The Netherlands) utilizing its 32-channel head coil with the following protocol. T1W: TR/TE/TI = 9.9/4.6/900 msec, flip angle 8deg, 1mm3 isotropic resolution, 128x116x220 matrix; DTI: TR/TE = 10,000/65msec, 32 diffusion directions, b-value = 800 sec/mm2, 2mm3 isotropic resolution, 240x188x70 matrix; and field map to remove EPI distortions in DTI and small distortions in T1W: TR/TE = 20/2.4 msec, delta TE=2.3msec, flip angle 20deg, 4mm3 isotropic resolution, 64x64x50 matrix. DTI data were eddy corrected and registered to T1W. All brain extractions and registrations were visually inspected. All non-linear registrations (morphisms) were performed using ART.2 Clusters of voxels where FA was significantly correlated with age were identified by performing voxel-wise t-test with gender as covariate at p=0.005 and retaining clusters of 100 or more contiguous voxels.

We recognize that even with the best available algorithms, morph errors between brains A and B can never be turned off. However, a morphism can transform image A into another image B'. Image B' does not match image B exactly, but transformation between A and B' is exact by construction (Fig. 2A). When we take A to be the master template, we refer to image B' as the “induced template” with image B being the “inductor”. Specifically, we selected the JHU brain as the master template and morphed it onto the 96 T1W images of the ELS as inductors to produce 96 induced templates to be used as candidate templates (Fig. 2B). A similar induction process is used to turn off morph errors between subjects and the master template (Fig. 2C).

All 96 induced subjects were morphed to all 96 induced templates (red arrows in Figure 1). Each of these 9216 morphisms underwent quality filtering based on displacement of anatomical landmarks delineated by the ASEG module of FreeSurfer5 as previously reported.6 We defined a morphism as “good” if the mean displacement was less than 0.14mm and as “bad” if it was more than 0.15mm. Morphisms in between are discarded (Figure 3). To eliminate sensitivity to sample size introduced by these quality cuts, we selected 30 subjects, 25 good and 25 bad templates such that all subjects are morphed well to all good and poorly to all bad templates.

RESULTS

Because the same data were used throughout, the differences between FA clusters are not random but due to registration errors (red arrows in Figure 1). Each of the 50 sets of clusters was compared to the gold standard set using the Jaccard index (Figure 4) demonstrating marked improvement when registration quality filtering is applied. Jaccard index between shifted and unshifted gold standard clusters shows that “good” morphisms produce as good a match as a single diagonal 1mm voxel shift.

CONCLUSION.

Clusters in voxel-wise analyses are sensitive to registration errors which makes them sensitive to the choice of template. Filtering out poorly morphed images dramatically reduces this sensitivity, making cluster analysis more template independent. To eliminate spurious clusters such filtering should be performed even if the template is study-specific. Filtering is especially important when the choice of the template is dictated by some other considerations such as the individual subject T1W image in subject-based analysis.4

Acknowledgements

Support for this research was provided in part by the National Institute on Aging, grant 5P01AG003949-34.

References

1. Ardekani B. A., Guckemus S., Bachman A., et al. Quantitative Comparison of Algorithms for Inter-Subject Registration of 3D Volumetric Brain MRI Scans. J. Neurosci. Methods 2005; 142: 67

2. Klein A., Andersson J., Ardekani B. A., et al. Evaluation of 14 Nonlinear Deformation Algorithms Applied to Human Brain MRI Registration. NeuroImage 2009; 46: 786

3. Grachev I. D., Berdichevsky D., Rauch S. L., et al. A method for assessing the accuracy of intersubject registration of the human brain using anatomic landmarks. Neuroimage 1999; 9: 250.

4. Suri A. K., Fleysher R, Lipton M. L. Subject Based Registration for Individualized Analysis of Diffusion Tensor MRI. PLoS ONE 2015; 10(11): e0142288.

5. Fischl B., Salat D. H., Busa E., et al. Whole Brain Segmentation: Automated Labeling of Neuroanatomical Structures in the Human Brain. Neuron 2002; 33: 341.

6. Fleysher R, Kim N, Suri A, Lipton M, Branch C. Characterization of Registration Errors to Screen Aberrant Subject Results Prior to Voxel-Wise Whole Brain Analysis. Proceedings of 25th ISMRM 2017; 4684

Figures

Figure 1. Ideal scenario and study design. Subjects’ data is registered to candidate templates for voxel-wise analyses. Obtained clusters are carried over to the master template using exact morphisms for comparison with the “ground truth”. The “ground truth” clusters over the master template are obtained by applying error-free morphisms to the set of subjects’ data. Green double arrows represent non-linear registrations (morphisms) which are known exactly (Figure 2). Red arrows represent morphisms under investigation which contain errors.

Figure 2. Induction process. Even the best available algorithms cannot map brain A to brain B exactly (A). However, mapping from A to “brain B+error” (i.e. image B’ in main text) is exact. This principle is utilized to generate induced templates (B) and induced subjects (C): Morphism from each subject to the master is computed and applied to the FA map to obtain “ground truth”. Because we are not focused on studying FA dependence on age, we treat these morphisms as exact. These morphisms are inverted and applied to the master to produce induced T1W image for each subject.

Figure 3. Morphism quality filtering. Histogram of mean displacement error due to errors in morphisms between induced subjects and templates. Based on this bimodal distribution, registrations leading to atlas distance of less than 0.14mm are classified as “good”, while those leading to more than 0.15mm as “bad”. Intermediate transformations with their corresponding induced subject/template are discarded.

Figure 4. Improvement. Jaccard index J representing the spatial overlap between “ground truth” clusters XG and clusters of interest XC: the 25 “good” (red), the 25 “bad” (black). For reference, 12 white points are Jaccard indices with the “ground truth” shifted by a single 1mm voxel in three coordinates and diagonals. In 3D, Jaccard index can be much less than 1, still indicating close overlap. Overlap between the red and white points suggests that “good” morphisms produce clusters in close agreement with the ground truth independent of the template thanks to the morph quality filter.

Proc. Intl. Soc. Mag. Reson. Med. 28 (2020)
1780