Random Field Theory & Alternatives

Topological Inference
Guillaume Flandin
Wellcome Trust Centre for Neuroimaging
University College London
With thanks to Justin Chumbley and Tom Nichols
SPM Course
London, May 2015
𝑦=𝑋
Preprocessings
𝛽+πœ€
General
Linear
Model
𝛽 = 𝑋𝑇 𝑋
βˆ’1
Random
Contrast c Field Theory
Statistical
Inference
𝑋𝑇 𝑦
π‘‡πœ€
πœ€
𝜎2 =
π‘Ÿπ‘Žπ‘›π‘˜(𝑋)
𝑆𝑃𝑀{𝑇, 𝐹}
Statistical Parametric Maps
frequency
time
mm
fMRI, VBM,
M/EEG source reconstruction
M/EEG 2D time-frequency
time
mm
M/EEG 1D channel-time
M/EEG
2D+t
scalp-time
mm
mm
mm
time
Inference at a single voxel
u
Null Hypothesis H0:
zero activation
Decision rule (threshold) u:
determines false positive
rate Ξ±
οƒž Choose u to give acceptable
Ξ± under H0
Null distribution of test statistic T
𝛼 = 𝑝(𝑑 > 𝑒|𝐻0 )
Multiple tests
uu

u
t
t
t
t
t

u

uu







If we have 100,000 voxels,
Ξ±=0.05
οƒž 5,000 false positive voxels.
This is clearly undesirable; to correct
for this we can define a null hypothesis
for a collection of tests.
t
Noise
Signal
Multiple tests
uu

u
t
t
t
t
t

u

uu







If we have 100,000 voxels,
Ξ±=0.05
οƒž 5,000 false positive voxels.
This is clearly undesirable; to correct
for this we can define a null hypothesis
for a collection of tests.
t
Use of β€˜uncorrected’ p-value, Ξ± =0.1
11.3%
11.3%
12.5% 10.8% 11.5% 10.0% 10.7% 11.2%
Percentage of Null Pixels that are False Positives
10.2%
9.5%
Family-Wise Null Hypothesis
Family-Wise Null Hypothesis:
Activation is zero everywhere
If we reject a voxel null hypothesis at any voxel,
we reject the family-wise Null hypothesis
A FP anywhere in the image gives a Family Wise Error (FWE)
Family-Wise Error rate (FWER) = β€˜corrected’ p-value
Use of β€˜uncorrected’ p-value, Ξ± =0.1
Use of β€˜corrected’ p-value, Ξ± =0.1
FWE
Bonferroni correction
The Family-Wise Error rate (FWER), Ξ±FWE, for a family of N
tests follows the inequality:
π›ΌπΉπ‘ŠπΈ ≀ 𝑁𝛼
where Ξ± is the test-wise error rate.
Therefore, to ensure a particular FWER choose:
π›ΌπΉπ‘ŠπΈ
𝛼=
𝑁
This correction does not require the tests to be independent but
becomes very stringent if dependence.
Spatial correlations
100 x 100 independent tests
Spatially correlated tests (FWHM=10)
Discrete data
Spatially extended data
Bonferroni is too conservative for spatial correlated data.
0.05
10,000 voxels οƒž α𝐡𝑂𝑁𝐹 = 10,000οƒžπ‘’π‘ = 4.42 (uncorrected 𝑒 = 1.64)
Random Field Theory
οƒž Consider a statistic image as a discretisation of a
continuous underlying random field.
οƒž Use results from continuous random field theory.
lattice
representation
Topological inference
Peak level inference
Topological feature:
Peak height
space
Topological inference
Cluster level inference
Topological feature:
Cluster extent
uclus : cluster-forming threshold
uclus
space
Topological inference
Set level inference
Topological feature:
Number of clusters
uclus : cluster-forming threshold
uclus
space
c
RFT and Euler Characteristic
Euler Characteristic πœ’π‘’ :
 Topological measure
πœ’π‘’ = # blobs - # holes
 at high threshold u:
πœ’π‘’ = # blobs
πΉπ‘ŠπΈπ‘… = 𝑝 πΉπ‘ŠπΈ
β‰ˆ 𝐸 πœ’π‘’
Expected Euler Characteristic
2D Gaussian Random Field
𝐸 πœ’π‘’ = πœ† Ξ© Ξ›
Search volume
1 2
𝑒 exp(βˆ’π‘’2 /2)/(2πœ‹)3/2
Roughness
(1/smoothness)
100 x 100 Gaussian Random Field
with FWHM=10 smoothing
Ξ±πΉπ‘ŠπΈ = 0.05 οƒž 𝑒𝑅𝐹𝑇 = 3.8
(𝑒𝐡𝑂𝑁𝐹 = 4.42, π‘’π‘’π‘›π‘π‘œπ‘Ÿπ‘Ÿ = 1.64)
Threshold
Smoothness
Smoothness parameterised in terms of FWHM:
Size of Gaussian kernel required to smooth i.i.d. noise to have
same smoothness as observed null (standardized) data.
FWHM
RESELS (Resolution Elements):
1 RESEL = πΉπ‘Šπ»π‘€π‘₯ πΉπ‘Šπ»π‘€π‘¦ πΉπ‘Šπ»π‘€π‘§
RESEL Count R = volume of search region in units of smoothness
1
2
3
4
5
6
7
8
9
10
Eg: 10 voxels, 2.5 FWHM, 4 RESELS
2
3
4
The number of resels is similar, but not identical
to the number independent observations.
voxels
data matrix
scans
=
design matrix
1
ο‚΄
Y = X
?
parameters
+

+
errors
?
ο₯
variance
Smoothness estimated from spatial
derivatives of standardised residuals:
Yields an RPV image containing local roughness
estimation.
οƒœ estimate
οƒœ
parameter
estimates
^

οƒœ
ο‚Έ
=
residuals
estimated variance
estimated
component
fields
s2
Random Field intuition
Corrected p-value for statistic value t
𝑝𝑐 = 𝑝 max 𝑇 > 𝑑
β‰ˆ 𝐸 πœ’π‘‘
∝ πœ† Ξ© Ξ› 1 2 𝑑 exp(βˆ’π‘‘ 2 /2)
 Statistic value t increases ?
– 𝑝𝑐 decreases (better signal)
 Search volume increases ( () ↑ ) ?
– 𝑝𝑐 increases (more severe correction)
 Smoothness increases ( ||1/2 ↓ ) ?
– 𝑝𝑐 decreases (less severe correction)
Random Field: Unified Theory
General form for expected Euler characteristic
β€’ t, F & 2 fields β€’ restricted search regions β€’ D dimensions β€’
𝐷
𝐸 πœ’π‘’ (Ξ©) =
𝑅𝑑 (Ξ©)ρ𝑑 (𝑒)
𝑑=0
Rd (): d-dimensional Lipschitz-Killing
curvatures of  (β‰ˆ intrinsic volumes):
– function of dimension,
space  and smoothness:
R0() = () Euler characteristic of 
R1() = resel diameter
R2() = resel surface area
R3() = resel volume
rd (u) : d-dimensional EC density of the field
– function of dimension and threshold,
specific for RF type:
E.g. Gaussian RF:
r0(u) = 1- (u)
r1(u) = (4 ln2)1/2 exp(-u2/2) / (2p)
r2(u) = (4 ln2) u exp(-u2/2) / (2p)3/2
r3(u) = (4 ln2)3/2 (u2 -1) exp(-u2/2) / (2p)2
r4(u) = (4 ln2)2 (u3 -3u) exp(-u2/2) / (2p)5/2

Peak, cluster and set level inference
Sensitivity
Regional
specificity
Peak level test:
height of local maxima
Cluster level test:
spatial extent above u
Set level test:
number of clusters
above u
: significant at the set level
: significant at the cluster level
: significant at the peak level
L1 > spatial extent threshold
L2 < spatial extent threshold
οƒŒ
οƒŒ
Random Field Theory
 The statistic image is assumed to be a good lattice
representation of an underlying continuous stationary
random field.
Typically, FWHM > 3 voxels
(combination of intrinsic and extrinsic smoothing)
 Smoothness of the data is unknown and estimated:
very precise estimate by pooling over voxels οƒž stationarity
assumptions (esp. relevant for cluster size results).
 A priori hypothesis about where an activation should be,
reduce search volume οƒž Small Volume Correction:
β€’
β€’
β€’
β€’
mask defined by (probabilistic) anatomical atlases
mask defined by separate "functional localisers"
mask defined by orthogonal contrasts
(spherical) search volume around previously reported coordinates
Conclusion
 There is a multiple testing problem and corrections have
to be applied on p-values (for the volume of interest only
(see SVC)).
 Inference is made about topological features (peak height,
spatial extent, number of clusters).
Use results from the Random Field Theory.
 Control of FWER (probability of a false positive anywhere
in the image) for a space of any dimension and shape.
References
 Friston KJ, Frith CD, Liddle PF, Frackowiak RS. Comparing functional
(PET) images: the assessment of significant change. J Cereb Blood Flow
Metab, 1991.
 Worsley KJ, Marrett S, Neelin P, Vandal AC, Friston KJ, Evans AC. A
unified statistical approach for determining significant signals in images of
cerebral activation. Human Brain Mapping, 1996.
 Taylor JE and Worsley KJ. Detecting Sparse Signals in Random Fields,
with an Application to Brain Mapping. J. Am. Statist. Assoc., 2007.
 Chumbley J & Friston KJ. False Discovery Rate Revisited: FDR and
Topological Inference Using Gaussian Random Fields. NeuroImage, 2008
 Chumbley J, Worsley KJ , Flandin G, and Friston KJ. Topological FDR for
neuroimaging. NeuroImage, 2010.
 Flandin G &Friston KJ. Topological Inference. Brain Mapping: An
Encyclopedic Reference, 2015.