Real time MRI (rtMRI) in human speech is an active field of research, with a particular clinical focus on the assessment of speech disorders. In this work, a numerical phantom is developed to allow acquisition and reconstructions schemes for rtMRI to be compared to a ‘gold standard’. Previously acquired 2D rtMRI images of speech were used to create anatomical masks of various speech organs. An interpolation method was then used to create a continuous time model of the moving structures, which forms the dynamic phantom. The model is then tested using different k-space sampling schemes (Cartesian, radial and spiral).
The computational phantom was developed using a prototyping software development framework created in MATLAB (version 2016b, Mathworks, Natick, MA, USA). The whole development process was split into two stages; (I) Phantom Development and (II) Testing and Implementation (II). A flow diagram of the overall process can be seen in figure 1.
Stage I. Phantom Development: Previously acquired 2D rtMR dimages of a volunteer phonating a standard speech sample were used. Images were acquired at 3T (Achieva Tx, Philips Medical Systems, Best, The Netherlands) with a temporal resolution of 15 frames-per-second (fps) and a resolution of 1.719 x 1.719 mm2 to adequately capture the motion of the velum.5,9. These images were then edge enhanced using the Canny method10 and the relevant speech organs and structures segmented using a bespoke semi-automatic threshold tool. These segmentations were used to create binary masks, which were processed using morphological operators to make them more uniform resulting in 6 anatomical masks: ‘Mandible’, ‘Maxilla’, ‘Epiglottis’, ’Velum’, ’Tongue’ and ‘Head’, the latter representing the parts of the head not included in the other masks. A continuous time motion model was then created by linearly interpolating between two given masks in the time series. Finally, the 2D k-space phantom data was derived as a time series using FFT and a non-uniform fast Fourier transform (NUFFT) for Cartesian and non-Cartesian sampling trajectories respectively. The novel phantom has a simulated symmetrical FOV of 30 cm, image matrix size of 256 x 256, k-space matrix size of 256 x 256, spatial resolution of 1.719 x 1.719 mm2, a temporal resolution of 30 fps and slice thickness of 10mm.
Stage II. Implementation and Testing: The phantom was used to simulate and reconstruct a range of k-space sampling trajectories. The dynamic k-space phantom (two spatial frequency dimensions, kx and ky. and one temporal, t) was sampled in a manner that simulated Cartesian, radial and spiral trajectories, with optional added Gaussian noise. An inverse FTT and inverse NUFFT were used to reconstruct simulated images for Cartesian and non-Cartesian sampling trajectories respectively.
[1] Scott AD, Wylezinska M, Birch MJ, Miquel ME (2014) Speech MRI: Morphology and Function. European Journal of Medical Physics – Medica Physica 6:604-18.
[2] Scott AD, Boubertakh R, Birch MJ, Miquel ME (2012) Towards clinical assessment of velopharyngeal closure using MRI: Evaluation of real-time MRI sequences at 1.5T and 3T. British Journal of Radiology 85:e1083-92.
[3] Lingala SG, Sutton BP, Miquel ME, Nayak KS (2016) Recommendations for real-time speech MRI. Journal of Magnetic Resonance Imaging 43: 28-44.
[4] Burdumy, M., Traser, L., Richter, B., Echternach, M., Korvink, J. G., Hennig, J. and Zaitsev, M. (2015), Acceleration of MRI of the vocal tract provides additional insight into articulator modifications. J. Magn. Reson. Imaging, 42: 925–935. doi:10.1002/jmri.24857
[5] Lingala, S. G., Zhu, Y., Kim, Y.-C., Toutios, A., Narayanan, S. and Nayak, K. S. (2017), A fast and flexible MRI system for the study of dynamic vocal tract shaping. Magn. Reson. Med., 77: 112–125. doi:10.1002/mrm.26090
[6] Freitas AC, Wylezinska M, Birch MJ, Petersen SE, Miquel ME (2016) Comparison of Cartesian and non-Cartesian real-time MRI sequences at 1.5T to assess velar motion and velopharyngeal closure during speech. PLOS ONE, DOI: http://dx.doi.org/10.1371/journal.pone.0153322
[7] Freitas AC, Ruthven M, Boubertakh R, Miquel ME (2017) Real-time speech MRI: commercial Cartesian and non-Cartesian sequences at 3T and feasibility of offline TGV reconstruction to visualise velopharyngeal motion. Physica Medica
[8] Wissmann L, Santelli C, Segars WP, Kozerke S (2014) MRXCAT: Realistic numerical phantoms for cardiovascular magnetic resonance. Journal of Cardiovascular Magnetic Resonance 16:63.
[9] Ruthven M, Freitas A, Keevil S, Miquel M. Real-time speech MRI: What is the optimal temporal resolution for clinical velopharyngeal closure assessment?. 2016;24:(3208.).
[10] Canny J. A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell. 1986(6):679-698.
[11] Huang, F., Akao, J., Vijayakumar, S., Duensing, G.R. and Limkeman, M., 2005. K-t GRAPPA: A k-space implementation for dynamic MRI with high reduction factor. Magnetic Resonance in Medicine, 54(5), pp.1172-1184.