Empirical Mode Decomposition in data-driven fMRI

Empirical Mode Decomposition in data-driven fMRI analysis
John McGonigle
Department of Computer Science
University of Bristol
Bristol, UK
[email protected]
Majid Mirmehdi
Department of Computer Science
University of Bristol
Bristol, UK
[email protected]
Abstract—Empirical Mode Decomposition has emerged in
recent years as a promising data analysis method to adaptively
decompose non-linear and non-stationary signals. Here we
introduce multi-EMD, to be used where there are many
thousands of signals to analyse and compare, such as is common
in the analysis of functional neuroimages. The number of component signals found through Empirical Mode Decomposition
varies at each location in the brain. We seek to rearrange
these components so that they may be compared to others
at a similar temporal scale. This is a data-driven process
based on grouping those components which have similar
dominant frequencies to target frequencies which have been
found to be most common from the initial decomposition. This
new set of rearranged components is then clustered so that
regions behaving synchronously at each temporal scale may be
discovered. Results are presented for both simulated and real
data from a functional MRI experiment.
Keywords-Magnetic resonance; Signal resolution; Timefrequency analysis; Clustering methods;
I. I NTRODUCTION
Since its development by Huang in 1998 [1], Empirical
Mode Decomposition (EMD) has been applied across many
diverse disciplines as a useful technique in signal decomposition. These have included tomographic reconstruction
in PET [2], diagnosis of mechanical problems in rotating
machinery [3], examination of EEG data [4] and bandpass
filtering [5]. Specifically, its adaptive nature is useful in a
data-driven analysis since no filter size or basis function is
specified beforehand, but rather discovered from the data
itself. We propose to use this technique as one step in a
data-driven analysis of functional neuroimages.
Functional magnetic resonance imaging (fMRI) is a form
of functional neuroimaging which takes many 3D scans
of the human brain over time. Each spatial position will
therefore have several hundred samples, which will vary in
intensity due to factors such as blood oxygenation, slight
subject motion, or artefacts inherent in the imaging process.
The resulting data will therefore be made up of many
thousands of these time courses. The strength and temporal
characteristics of these signals is of interest since they enable
us to elucidate and localise neural behaviour in the brain.
A wide variety of analysis techniques are applied to this
type of data. The most popular of these uses the general
Andrea L. Malizia
Psychopharmacology Unit
University of Bristol
Bristol, UK
[email protected]
linear model to identify voxels whose time course correlates
with the a priori experimental procedure, implemented as
statistical parametric mapping (SPM) [6]. Other techniques
attempt to group voxels which exhibit similar behaviour.
In wavelet-based cluster analysis (WCA), coefficients from
wavelet decompositions are grouped through optimisationbased clustering [7]. One drawback of decomposition methods such as that used in WCA is that the number and
temporal resolution of each component is defined a priori.
As such, there may be non-ideal situations where a true
component is smeared across several different frequency
ranges, or temporal scales.
In EMD, a signal may be adaptively decomposed into a
number of component signals, where, in the case of fMRI,
this number will vary across the many thousands of voxels
in the brain. This creates a difficulty when attempting to
compare decomposed signals from different voxels. Here
we propose a solution to this problem by the introduction
of multi-EMD and explore its use as a non-supervised and
completely data-driven decomposition step, splitting the data
into a number of different components, each corresponding
to a temporal scale discovered from the data itself before
carrying out clustering at each of these scales. It is applied
here on simulated and real data from an fMRI experiment.
II. P ROPOSED M ETHOD
A. Empirical Mode Decomposition
Empirical Mode Decomposition, as introduced in [1],
attempts to decompose a signal into a finite and often low
number of component signals or Intrinsic Mode Functions
(IMFs). These IMFs are defined as having (i) the same number of zero crossings as they have extrema (or a difference
of 1) and (ii) symmetric envelopes defined by local extrema.
Following [1] we describe the sifting process used to
discover each IMF:
1) Taking the original signal X, find all local extrema.
Connect all local maxima by a cubic spline to form an
upper envelope. Repeat for all local minima to form a
lower envelope.
2) Take the mean of these envelopes, m1 . The difference
between X and m1 is h1 , which we regard as a protoIMF.
non-stationary signals. Figure 1 shows an example of the
decomposition of a signal into its IMFs and residual.
Original Time Course
4
0
B. Use in brain imaging
−4
IMF 1
2
0
−2
IMF 2
Amplitude
2
0
−2
IMF 3
2
0
−2
IMF 4
2
0
−2
IMF 5
2
0
−2
Residual
2
0
−2
0
50
100
150
Timepoints
200
250
Figure 1.
An original voxel time course (from fMRI of the human
brain) together with its IMFs and residual following Empirical Mode
Decomposition.
EMD has been predominantly used on one signal at a time
and often with supervision. In fMRI one often has many tens
to hundreds of thousands of signals, or voxel time courses, to
examine. As in WCA, it can be useful to examine this data
at a suitable temporal scale, that is, one is often interested
in looking at the data without its higher frequency noise or
lower frequency drift. However, the number of component
temporal scales each signal is broken down into in WCA
will depend on the basis function imposed by the user.
By using EMD to perform a decomposition, this number
of component temporal scales and their frequency content
is adaptively driven by the data itself. Indeed, one IMF
may contain several different frequencies. This creates the
situation whereby different voxel time courses are broken
down into different numbers of IMFs. Unlike WCA, it is no
longer possible to choose just one or more temporal scales to
cluster the data at, since, for example, the 3rd IMF from one
voxel may be more comparable to the 4th IMF from another.
This will often depend on the signal conditions which will
vary spatially in the brain.
C. multi-EMD
3) If h1 does not fulfill the properties of an IMF steps
1 and 2 are carried out on it, so that h11 = h1 −
m11 . This continues for k iterations until h1k satisfies
the properties of an IMF or the stopping criteria are
reached (see below).
4) After the first IMF is found, it is subtracted from
the original data, and this remaining signal, X − h1k ,
treated as the original data in a repeat of steps 1 to 4.
5) This is all carried out n times, finding a component
signal, IMFj , at each level until no further IMFs can
be discovered and only the residual, r, remains.
The original data may therefore always be reconstructed
from the IMFs and residual as
X=
n
X
IMFj
+ r.
(1)
j=1
So that the process will not carry on indefinitely a
stopping criteria may be used for those proto-IMFs that do
not satisfy the definition of an IMF. Also following [1],
this can be achieved by limiting the size of the standard
deviation, SD, found from two consecutive sifting results as
PT
|hk−1 (t) − hk (t)|2
.
(2)
SDk = t=0PT
2
t=0 hk−1 (t)
Since the decomposition is based on the local characteristics of the data it may be applied to non-linear and
If the original data is to be broken down into a number
of temporal scales, it is necessary to discover what this
number should be, and also which IMF or residual from
each voxel should be used to represent the data at each
scale. To achieve this we introduce the Empirical Mode
Decomposition of multiple signals, or multi-EMD. This is
carried out as follows:
1) Perform EMD at every voxel, producing n IMFs for
each.
2) Find the median number of IMFs plus one (for residuals) that the data has been decomposed into. This is the
number of multi-EMD components, n0, to rearrange
the IMFs into.
3) Find the median most powerful frequency across each
of the initially decomposed IMF levels, and use these
as the target frequency for each multi-EMD component.
4) A new data-matrix is constructed for each multi-EMD
component and filled, at each voxel, with the IMF
or residual from that spatial position which has a
dominant frequency closest to the target for that scale.
K-means clustering may then be performed on any or all
of these new multi-EMD components.
D. Experimental set-up
1) Phantom: The phantom data used here is based on
that in [7]. It is a single square slice of side 128 with each
pixel having 256 timepoints (Figure 2(a)). Each row contains
B. Phantom
(a) One frame of the
phantom data (at a
CNR 5 times that used
here for ease of visualisation).
Figure 2.
(b) The number of
IMFs
each
pixel
may be decomposed
into. Darker colours
represent fewer IMFs.
(c) K-means using 5
clusters on the undecomposed phantom
data.
Different views of the simulated phantom data
circles, of radius 9 pixels, of a sine wave at decreasing
1
1
1
1
, 8π
, 12π
and 16π
Hz),
frequencies (from top to bottom 4π
with the columns being this signal with a contrast-to-noise
ratio (CNR) of 1, 0.6, 0.5 and 0.25 from left to right. These
are surrounded by a square area of side 14 containing a
Gaussian taper of the circles’ signal. This is the equivalent to
signals with maximum percentage changes of 2%, 1.2%, 1%
and 0.5% respectively. The background contains Gaussian
noise with a standard deviation of 2% signal change. These
are the type of noise levels one can expect to see in fMRI.
2) Functional MRI: The experimental data used here
comes from a block design sensorimotor task which involved
bilateral finger tapping, audio tones and a flashing checkerboard, all in synchrony at 3Hz. The data was collected as part
of a site inter-variability experiment, of which one subject’s
data from one day is examined here [8]. Total time was
85s with a repetition time of 3s. Each block was 15s long,
interleaved with a control condition of staring at a cross for
the same period. This design would be expected to show
activity in the motor, visual and auditory areas.
3) Wavelet-based Cluster Analysis: The most similar
related method to that which we are presenting here is the
wavelet-based cluster analysis (WCA) of Whitcher et al.
[7]. It seeks to decompose voxel time courses using Haar
wavelets so that whole frequency scales may be removed
from the data and only those frequency scales of relevance to
the experimental design retained for an optimization-based
clustering step. Here we will perform WCA on our phantom
data as a comparison to our multi-EMD method.
1) Pure k-means: The results of k-means clustering on
the undecomposed phantom data can be seen in Figure
2(c) (in all the clustering results shown here k-means was
used with 5 clusters). Only those pixels which have their
signal at a CNR of 1 and 0.6 are sensibly grouped, while
those pixels with a CNR of 0.5 and 0.25 are clustered with
the background noise, the second signal type cannot be
distinguished at all.
2) WCA: In Figure 3 the results of clustering at each level
of wavelet coefficients can be seen. At each step the effective
frequency is halved along with the temporal resolution.
Although the lower CNR signals are better discerned than
without any decomposition, it is clear that most scales retain
signal which would not be expected. In effect, information
from the single frequency simulated signals is smeared
throughout all the levels of wavelet coefficients.
3) multi-EMD: In Figure 4 the results of our multi-EMD
based clustering can be seen. Our algorithm has broken the
data up into 7 different components which have the target
frequencies of 0.4, 0.18, 0.098, 0.058, 0.035, 0.020 and
0.011 Hz. The simulated signals have the actual frequencies
of 0.080, 0.040, 0.026 and 0.020 Hz. It is seen that there is
a better discrimination between temporal scales when using
multi-EMD than with WCA. In a perfect decomposition of
the data only one row would be visible in each component
image. In effect, this EMD based decomposition is similar
to an adaptive and data-driven bandpass filter.
C. Functional MRI
The fMRI data was broken down into 5 different components, with target frequencies of 0.12, 0.055, 0.031, 0.020
and 0.0078 Hz. The frequency of the onset of each block
of activity in the experimental design was 0.033 Hz, corresponding to the 3rd component found by our algorithm
(shown in Figure 5). The highest percentage signal changes
are contained withing the green and blue clusters, while
other voxels behaving in synchrony but of lesser magnitude
(though still having a range above 0.5%) are shown in red.
Slices 12, 20 and 28 show activity corresponding to the
visual, auditory and motor cortex respectively as could be
expected from the experimental design.
IV. D ISCUSSION
III. R ESULTS
A. Number of IMFs
The total number of IMFs that make up a signal can be
a useful relative measure in its own right of whether it is
of possible interest or merely represents background noise.
In Figure 2(b) it can be seen that those areas of the image
where signals have been encoded can be decomposed into
fewer IMFs. This is to be expected since the less complex
a signal is the fewer component signals will make it up.
Due to the mixed modes in most IMFs, it may be useful
to perform a wider search for the best temporal scale,
incorporating the top 2 or 3 most powerful frequencies. This
would allow the process to be more resilient to aliasing from
other temporal scales. The technique could also be used
as part of an adaptive noise and artefact reduction step,
where groups of voxels found by the user to be due to
motion or other unwanted effects could be removed from
any further analysis, or have their signals used as unwanted
effects regressors in a later model driven analysis approach.
Figure 3. WCA applied to the phantom data. Each image is a different temporal scale. In a perfect decomposition each image would only show one row.
Effective frequency decreases from left to right in approximate powers of two. Colours do not correspond between scales.
Figure 4. The results of our multi-EMD based cluster analysis on the phantom data. The number of scales was chosen by our algorithm. An ideal temporal
decomposition would show only one row in each image. Colours do not correspond between scales.
A similar artefact removal process is often used in an
ICA based analysis. We have presented a novel approach
to the analysis of large groups of signals such as those
found in fMRI. Our method performs the Empirical Mode
Decomposition on many signals followed by the selection
and grouping of individual Intrinsic Mode Functions by
how similar their most powerful frequencies are to target
frequencies (themselves found from the data). This ensures
that the most ideal IMF from each signal is chosen since
different signals will often have a varying number of IMFs.
In brain imaging this will often be the case where separate
regions may have the same underlying neural response, but
experience different noise conditions. The process is datadriven, splitting the original data up into several components,
each at a different temporal scale. Further optimisation
based clustering may then be carried out on the rearranged
data. As such, multi-EMD based clustering appears to be
a promising approach in an exploratory and data-driven
analysis of functional brain imagery.
R EFERENCES
[1] N. E. Huang, Z. Shen, S. R. Long, M. C. Wu, H. H. Shih,
Q. Zheng, N.-C. Yen, C. C. Tung, and H. H. Liu, “The
empirical mode decomposition and the hilbert spectrum for
nonlinear and non-stationary time series analysis,” Proceedings
of the Royal Society A, vol. 454, no. 1971, pp. 903–995, 1998.
[2] A. McLennan, M. Brady, and M. Kelly, “Reconstruction
of clinical dPET data using empirical mode decomposition
temporal regularisation,” in Proceedings of Medical Image
Understanding and Analysis 2009, Kingston, July 2009.
[3] S. J. Loutridis, “Damage detection in gear systems using empirical mode decomposition,” Engineering Structures, vol. 26,
no. 12, pp. 1833–1841, Oct. 2004.
[4] C. M. Sweeney-Reed and S. J. Nasuto, “Detection of neural
correlates of self-paced motor activity using empirical mode
decomposition phase locking analysis,” Journal of Neuroscience Methods, vol. 184, no. 1, pp. 54–70, Oct. 2009.
[5] P. Ciuciu, P. Abry, C. Rabrait, H. Wendt, and A. Roche,
“Leader-based multifractal analysis for EVI fMRI time series:
Ongoing vs task-related brain activity,” in Proceedings of the
2007 IEEE International Symposium on Biomedical Imaging:
From Nano to Macro, Washington D.C., USA, April 2007.
[6] K. Friston, A. Holmes, K. Worsley, J. Poline, C. Frith, and
R. Frackowiak, “Statistical parametric maps in functional
imaging: A general linear approach,” Human Brain Mapping,
vol. 2, pp. 189–210, 1995.
Figure 5. Several slices from the 3rd multi-EMD scale. The algorithmically
found target frequency for this scale was 0.031Hz which is close to that of
the experimental activity (0.033 Hz). 8 clusters were used, with those having
a signal range of more than 0.5% being displayed here. The multi-EMD
algorithm has no knowledge of the spatial relationships between voxels.
[7] B. Whitcher, A. J. Schwarz, H. Barjat, S. C. Smart, R. I.
Grundy, and M. F. James, “Wavelet-based cluster analysis:
data-driven grouping of voxel time courses with application to
perfusion-weighted and pharmacological MRI of the rat brain,”
NeuroImage, vol. 24, no. 2, pp. 281–295, Jan. 2005.
[8] L. Friedman and G. H. Glover, “Report on a multicenter fMRI
quality assurance protocol,” Journal of Magnetic Resonance
Imaging, vol. 23, no. 6, pp. 827–839, 2006.