Exploratory Data Analysis Reveals Spatio-temporal Structure of Null fMRI Data R. Baumgartner, L. Ryner, R. Somorjai, R. Summers Institute for Biodiagnostics, National Research Council Canada, Winnipeg, Manitoba, Canada Introduction: Exploratory data analysis (EDA) as developed in computational statistics (2) provides methods for exploration of multidimensional data structures. It is conceptually different from the classical statistical confirmatory methods as it aims to let the data speak for itself. However, once the structure of the data is revealed, the question must be asked: do the structures arise only by chance, or do they actually reflect some real effect? Here we investigated the application of EDA tools to in vivo fMR1 data analysis in a systematic study with varying sampling rate. fMR1 data were acquired under the null hypothesis, i.e. no activation. Materials and Methods Water phantom: Five (2D) data sets were acquired with the following parameters: FAITEIMA = 60/50/128x128, and with TR variation TR = 334/625/1250/2500 ms (note that TR may be considered as the sampling frequency of the physiological effects). In vivo fMR1 data acquired under null condition: Three healthy subjects were investigated. For each subject, five in vivo data sets were acquired with the same parameters as for the water phantom. EDA (PreprocessineX3): Our aim was to divide the fMR1 timecourses (TCs) into two groups. The first group consisted of those TCs that are contaminated by white noise and the second comprised TCs that were either contaminated by colored (physiological) noise or contain the signal of interest. To separate the two types of TCs the l-lag (rl) and 2-lag (r2) Pearson autocorrelation coefficient was calculated for each TC. Then a rlr2 plot was created (the horizontal and vertical axes correspond to rl and r2, respectively). For white data this plot is symmetrically distributed in a disk around the origin of the coordinate system. Spatial distribution of the non-white TCs was also displayed. Fuzzv Clustering EDA (4,5) was applied in the time domain only to those TCs that survived the preprocessing step, in order to investigate their temporal behaviour across the TR range. Results and Conclusion: In Fig. 1 the rl maps obtained from the in vivo data for the TRs: (a) 3500, (b) 2500, (c) 1250, (d) 625, (e) 335 are displayed. Note that for in vivo data, the spatial pattern of the pixels contaminated by “physiological” noise is similar and yields overlapping regions across the whole TR range and the area contaminated comprises mostly grey matter. The rl-map for the water phantom (f) (TR=2500) is also displayed. There is no spatial (4 (4 (b) (cl (4 0-J (4 (e) (f) Fig. 2 The rl-r2 plots for the in vivo data (a,b,c,d,e) as in Fig. 1 and the water phantom (f). Arrows point to the regions where the rl-r2 plots are deformed due to colored noise. In Fig. 2 the rl-r2 plots for in vivo data (a,b,c,d,e as in Fig.1) and the water phantom (f) are shown. The rl-r2 plot for the water phantom is symmetrically distributed around the origin of the coordinates (OC). For in vivo data the rl-r2 plots are deformed and contain of two parts. One which is symmetrically distributed around the OC as in the case of water phantom and another one which appears as a tail in the upper right corner of this plot. These points correspond to the pixels highly contaminated by autocorrelated physiological noise. To identify them a threshold based on statistical significance of the rl was used (p-value=0.05). The TCs of the survivors were used as an input to FCA. For all subjects investigated KA was able to dctcct various kinds of physi&giutl noise artifacts for each ‘I’R and consistent tendencies could be identified. For low TR (high sampling fi’cqucncy), brcatbing and heart beat artifacts wcrc identified, The higher the TR, the more the ‘KS were contaminntcd by trends. As tbc regions co~~t~lt~iliate(iacross rhc ‘IX range overlap, this suggests aliasing from bigher freyttcncics as the sampling rate (TR) approaches the Nyquist bar&. Systematic inv~sti~~ltior~of Abealiasillg paitefns in fMR1 time-series is an importtmt challenge for better identification of “true activations” in f?vlRl (6). WC showed that iii&based methods that do not require prior knowledge about ‘the temporal shapes of the ‘KS offer valuable toois for MR time-series (spatiotemporal) artifact detection. References 1. Tukey J, Exploratory Data Analysis, Addison-Wesley, 1977. 2. Buja A, Tukey P, Computing and graphics in statistics, SpringerVerlag, 1991. http://www.research.att.com/-andreas 3. Somorjai R, JarmaszM, Baumgartner R, ISMRMP9, Philadelphia, 1999. 4. Somorjai R, Jarmasz M, Baumgartner R, NeuroImage, submitted, 1999. 5. JarmaszM, Somorjai R, ISMRMPI, Sydney, 1998, 2068. Evident a 3D analysissoftware package, http: //www.ibd.urc.ca/informaGcs 6. Mayhew J, Hu D, Zheng Y, et al. NeuroImage, 7,49-71,1998 Fig. 1 The rl-maps for the in vivo data TR= (a) 3500, (b) 2500, Proc. Intl. Sot. Mag. Reson. Med. 8 (2000) (c) 1250, (d) 665 (d) and (e) 335 ms. The water phantom rl-map (f) is also displayed. 1717
© Copyright 2026 Paperzz