This study demonstrates auditory human fMRI conducted at 9.4T field strength and submillimeter resolution for the first time. Tonotopic maps were measured robustly and reliably. Further, cortical regions with preference for natural sound categories were delineated. We generated ripple control sounds that closely match low level acoustical properties of natural sounds in four natural sound categories, such that the original category is not recognizable. We show that, in areas preferring speech sounds over other natural sounds, ripple control sounds of speech elicit stronger responses than ripple control sounds of non-speech. This indicates tuning to the low-level acoustical properties of speech.
Participants and set-up: 7 healthy human volunteers participated in the fMRI measurements performed with a 9.4 T scanner (Siemens, Erlangen, Germany) equipped with a head gradient set (80mT/m @ 400mT/m/s) and an 8Ch-TX/32ch-RX coil (Life Services LLC, Minneapolis, MN) focused on temporal lobe. 2 participants were excluded due to large motion artefacts or technical problems. A non-subject-specific B1+ shim was used, which was optimized for efficiency in bilateral auditory cortex and validated for safety through SAR modeling.
Stimuli and analysis: Amplitude modulated tone stimuli (0.25 – 4.0 kHz) were presented in a block design for tonotopic mapping (see Fig. 1 caption for design details). Tonotopic maps were obtained after standard pre-processing by best frequency mapping. 4 For the second experiment, stimuli were real life sounds from 4 categories (human speech, human voices, animal calls, and instruments/music). Control sounds were generated from the real life stimuli and consisted of ripples having the same main spectro-temporal modulation as the corresponding sound, and additionally matched in spectral envelope and dynamic temporal envelope. 14 stimuli of all eight categories (4 original + 4 control) were presented in a fast event-related design in silent gaps during MB-EPI acquisitions (in pseudo-randomized order, 112 stimuli in total, 5 repetitions each), while participants performed a 1-back task (by pressing a button). Target trials (5%) were discarded. We analyzed the time series by applying GLM denoise 5 to determine noise regressors and estimated the single voxel responses to all sound categories using multiple regression (i.e. General Linear Model).
BOLD fMRI imaging parameters: 48 transversal slices, TE/TR/TA = 17/2400/1200 ms, silent gap 1200 ms, matrix size 156x200, 0.8 mm isotropic nominal resolution, MB/GRAPPA-Factor 2/3 (FLEET reference lines 6), nominal flip angle 90°.
Anatomical imaging: T1-weighted weighted MPRAGE: TE/TR/TI = 2/3600/1200 ms, matrix size 384x384x256, 0.6 mm isotropic nominal resolution, GRAPPA-Factor 3, nominal flip angle 4°. Proton density-weighted MPRAGE: Same as T1-weighted MPRAGE except no inversion and TR = 1260 ms.
1. Binder JR, Frost JA, Hammeke TA, Bellgowan PS, Springer JA, Kaufman JN, Possing ET. Human temporal lobe activation by speech and nonspeech sounds. Cerebral cortex 2000;10(5):512-528.
2. Belin P, Zatorre RJ, Lafaille P, Ahad P, Pike B. Voice-selective areas in human auditory cortex. Nature 2000;403(6767):309-312.
3. Norman-Haignere S, Kanwisher NG, McDermott JH. Distinct Cortical Pathways for Music and Speech Revealed by Hypothesis-Free Voxel Decomposition. Neuron 2015;88(6):1281-1296.
4. Formisano E, Kim DS, Di Salle F, van de Moortele PF, Ugurbil K, Goebel R. Mirror-symmetric tonotopic maps in human primary auditory cortex. Neuron 2003;40(4):859-869.
5. Kay KN, Rokem A, Winawer J, Dougherty RF, Wandell BA. GLMdenoise: a fast, automated technique for denoising task-based fMRI data. Front Neurosci 2013;7(247):10.3389.
6. Polimeni JR, Bhat H, Witzel T, Benner T, Feiweier T, Inati SJ, Renvall V, Heberlein K, Wald LL. Reducing sensitivity losses due to respiration and motion in accelerated echo planar imaging by reordering the autocalibration data acquisition. Magnetic resonance in medicine 2015.