prior distribution

Supplementary Materials to
Development of a diagnostic test based on multiple
continuous-biomarkers based on an imperfect reference-test.
Leandro García Barrado a, Els Coart b, Tomasz Burzykowski a,b, for the Alzheimerโ€™s
Disease Neuroimaging Initiative*
a
Interuniversity Institute for Biostatistics and statistical Bioinformatics (I-BioStat), Hasselt
University, Agoralaan Building D, 3590 Diepenbeek, Belgium
b
International Drug Development Institute (IDDI), Avenue Provinciale 30, 1340 Louvain-laNeuve, Belgium
*
Data used in preparation of this article were obtained from the Alzheimerโ€™s Disease Neuroimaging
Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to
the design and implementation of ADNI and/or provided data but did not participate in analysis or
writing of this report. A complete listing of ADNI investigators can be found at:
http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.
Appendix A: The ๐‘†๐‘’ + ๐‘†๐‘ > 1 restriction
To implement the ๐‘†๐‘’ + ๐‘†๐‘ > 1 restriction, a joint distribution for ๐‘†๐‘’ and ๐‘†๐‘ has to be
assumed. The chosen form of the joint distribution will have consequences for the marginal ๐‘†๐‘’
and ๐‘†๐‘ priors because of the potential dependence induced by the joint distribution.
One might attempt to implement the restriction while retaining the flat standard uniform
marginal priors for ๐‘†๐‘’ and ๐‘†๐‘. This is, however, not possible. A formal argumentation could be
built considering the copula representation of the joint distribution function of two standard
uniform marginal distribution functions. From the representation it follows that the joint
1
distribution function has to be bounded by the Fréchet lower (๐ถ๐ฟ ) and upper (๐ถ๐‘ˆ ) bound
copulas. These are expressed as follows:
๐ถ๐ฟ (๐‘ข1 , ๐‘ข2 ) = max{0, ๐‘ข1 + ๐‘ข2 โˆ’ 1} , ๐’– โˆˆ [0,1]2 , and
๐ถ๐‘ˆ (๐‘ข1 , ๐‘ข2 ) = min{๐‘ข1 , ๐‘ข2 } , ๐’– โˆˆ [0,1]2 .
It can be shown that, for ๐ถ๐ฟ we get ๐‘(๐‘ข1 + ๐‘ข2 = 1) = 1, while for ๐ถ๐‘ˆ we have ๐‘(๐‘ข1 + ๐‘ข2 >
1) = 0.5. In other words, no joint distribution with standard uniform margins will lead to
๐‘(๐‘ข1 + ๐‘ข2 > 1) = 1 for the desired restriction.
For the independent ๐‘†๐‘’ and ๐‘†๐‘ case, one could think of two uniform marginal distributions of
๐‘†๐‘’ and ๐‘†๐‘ restricted to be strictly higher than 0.5 to ensure that their sum exceeds 1.
To allow ๐‘†๐‘’ or ๐‘†๐‘ assuming values smaller than 0.5, a possible solution is to use a standard
uniform marginal distribution for ๐‘†๐‘’ or ๐‘†๐‘ and define a suitable conditional distribution of the
other parameter. For instance,
๐‘†๐‘’ ~๐‘ˆ(0,1)
๐‘†๐‘|๐‘†๐‘’ ~๐‘ˆ(1.001 โˆ’ ๐‘†๐‘’, 1).
Note that this solution is โ€˜asymmetricโ€™ in that one of the parameters is selected to be uniformly
distributed on the (0,1) interval. Obviously, other implementations of the ๐‘†๐‘’ + ๐‘†๐‘ > 1
restriction are also possible. They will differ in terms of informativeness of the marginal ๐‘†๐‘’ and
๐‘†๐‘ distributions and of the amount of dependence between ๐‘†๐‘’ and ๐‘†๐‘. The choice of the
2
implementation may require a careful consideration of the characteristics of the particular
problem at hand.
3
Appendix B: The ๐ด๐‘ˆ๐ถ๐‘Ž -prior approximation
Under the reparameterized ๐ด๐‘ˆ๐ถ๐‘Ž -prior definition it can be shown that when assuming that:
๐œน~๐‘ต๐’Œ (๐œฟ, ๐œณ),
the implied ๐ด๐‘ˆ๐ถ๐‘Ž prior distribution can be approximated based on quadratic forms theory [B1,
B2]. In particular, the distribution can be represented by an expansion in non-central chi-square
distributions:
โˆž
2
๐‘“(๐‘ฅ) = โˆ‘ ๐‘๐‘˜ ๐œ’(๐‘+2๐‘˜;๐œ)
(๐œ™ โˆ’1 (๐‘ฅ)2 ) × |
๐‘˜=0
2๐œ™ โˆ’1 (๐‘ฅ)
|;
๐œ™(๐œ™ โˆ’1 (๐‘ฅ))
๐‘“๐‘œ๐‘Ÿ ๐‘ฅ โˆˆ [0.5,1]
With
๐‘
๐œ = โˆšโˆ‘ ๐‘๐‘—2
๐‘—=0
1
๐‘
1 2
๐‘0 = โˆ ( )
๐œ†๐‘—
๐‘—=1
๐‘˜โˆ’1
๐‘๐‘˜ =
1
โˆ‘ ๐‘‘๐‘˜โˆ’๐‘Ÿ ๐‘๐‘Ÿ
2๐‘˜
๐‘Ÿ=0
๐‘
๐‘‘1 = โˆ‘(1 โˆ’ ๐‘๐‘—2 )(1 โˆ’
๐‘—=1
๐‘
๐‘˜
๐‘ƒ
1
)
๐œ†๐‘—
๐‘˜โˆ’1
๐‘๐‘—2
1
1
๐‘‘๐‘˜ = โˆ‘ (1 โˆ’ ) + ๐‘˜ โˆ‘ ( ) (1 โˆ’ )
๐œ†๐‘—
๐œ†๐‘—
๐œ†๐‘—
๐‘—=1
๐‘—=1
1
โ€ฒ
where ฮป1 , โ€ฆ , ฮปp are the eigenvalues of ฮจ and the vector bโ€ฒ = (b1 , โ€ฆ , bp ) = (P โ€ฒ ฮจ โˆ’2 ฮš) , which is
a by-product of diagonalizing ฮจ by ๐‘ƒ, the ๐‘ × ๐‘ orthonormal matrix of eigenvectors of ฮจ. By
4
using a finite number of terms in the series expansion, it is possible to approximate the
distribution of ๐ด๐‘ˆ๐ถ๐‘Ž for different choices of ๐›‹ and ฮจ, with arbitrary precision.
For instance, assume that ๐œฟ = (0,0,0)โ€ฒ and that standard deviations and correlation coefficients
resulting from the variance-covariance matrix ๐œณ vary between 0.1 and 1, and 0 and 0.9,
respectively. Figure B1 presents the histograms of 100,000 simulated values of the ๐ด๐‘ˆ๐ถ๐‘Ž for the
90 resulting combinations of standard deviations and correlations. Additionally, the figure
presents approximations computed from the appropriate expansion of 200 non-central chisquare distributions. From the figure it can be seen that the approximations correspond quite
closely to the histograms when standard deviations are larger than 0.4 and correlation
coefficients are smaller than 0.7. For more extreme values, the approximation tends to break
down at the higher tail of the distribution. Despite the issues for some of the cases, the seriesexpansion can be used to explore the specified prior distribution for the ๐ด๐‘ˆ๐ถ๐‘Ž .
5
Density
Standard deviation
0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
Correlation
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
AUC
Figure B1: Histogram (grey) and approximated densities (solid black line) of the simulated AUC
values.
[B1] Mathai AM, Provost SB. Quadratic forms in random variables. Marcel Dekker: Inc., New York, USA,
1992.
[B2] Ruben H. Probability content of regions under spherical normal distributions, IV: the distribution of
homogeneous and non-homogeneous quadratic functions of normal variables. The Annals of
Mathematical Statistics 1962; 33:542-570.
6
Appendix C: Implied ๐œ‡1 -prior distribution
Figure C1 shows the implied prior distribution for the components of parameter vector ๐1 . The
histograms shown in the figure are derived by simulating 100,000 values of ๐1 = ๐‘ธโˆ’1 ๐œน + ๐0
based on the proposed prior distributions for ๐œ‡0 , ๐›ด0 , ๐›ด1 , and ๐›ฟ. The solid lines in the panels
shown in Figure C1 represent the assumed flat priors for the components of ๐0 (note that the xaxis ranges from -15,000 to 15,000).
-15000
-5000
0
5000
10000
15000
-15000
0.00012
Density
0
1
0.00008
1
0.00000
0.00004
0
0.00000
0.00004
Density
0.00008
1
0.00004
0
Prior-distributions
mean components biomarker 3
0.00008
0.00012
Prior-distributions
mean components biomarker 2
0.00000
Density
0.00012
Prior-distributions
mean components biomarker 1
-5000
0
5000
10000
15000
-15000
-5000
0
5000
10000
Figure C1: Simulated prior distribution for ฮผ1 (histogram) based on proposed prior distributions
for ๐œน, ๐œฎ0 , and, ๐œฎ1 , and ๐0 (solid line).
7
Appendix D: Se/Sp-prior distribution sensitivity
Results of the analysis of the ADNI data using two different forms of the prior distribution for Se
and Sp.
Se/Sp Prior
Parameter
๐‘บ๐’† ~ ๐‘ฉ๐’†๐’•๐’‚(๐Ÿ, ๐Ÿ)
๐‘ฉ๐’†๐’•๐’‚(๐Ÿ, ๐Ÿ)๐‘ป[๐ŸŽ. ๐Ÿ“๐Ÿ, ๐Ÿ)
๐‘บ๐’‘|๐‘บ๐’† ~๐‘ฉ๐’†๐’•๐’‚(๐Ÿ, ๐Ÿ)๐‘ป[๐Ÿ. ๐ŸŽ๐ŸŽ๐Ÿ โˆ’ ๐‘บ๐’†, ๐Ÿ)
0.983
[0.959,0.994]
0.984 [0.959,0.994]
๐‘จ๐‘ผ๐‘ช๐’‚
0.826 [0.726,0.908]
0.825 [0.726,0.905]
๐‘บ๐’†
0.887
[0.803,0.949]
0.888 [0.805,0.951]
๐‘บ๐’‘
0.498 [0.412,0.584]
0.499 [0.414,0.584]
๐œฝ
8
Appendix E: Probability of AD of the resulting diagnostic score
Figure E1 shows the probabilities of AD based on the results from the proposed Bayesian latentclass model fitted to the ADNI dataset. Panel ๐‘Ž of Figure E1 shows the estimated probability of
AD for the resulting score based on the optimal linear-combination of the three biomarkers by
clinical diagnosis. Panel ๐‘ of Figure E1 illustrates the posterior probability of AD based on the
combined information from the biomarkers and clinical diagnosis, by clinical diagnosis.
a.
b.
10
12
14
16
18
20
0.2
0.4
0.6
0.8
ooooooooo
oo
ooooo
o
oooooooooo
+o+oooo o o o
+ooo
+oooo
ooooo
++
ooo o+ +
+
+
o
oo
+
oo
+
o
+
o
+
+
+
o
+
o
+
+
+
o o ++
++ +
++++
+++
+
+
+++
o
+++
o
o+++ +++++++++o
+++++++
+++++++
+++
++
+++++o+
+++++
o
0.0
Posterior probability of AD
0.8
0.6
0.4
0.2
ClDiag=0
ClDiag=1
0.0
Estimated probability of AD
1.0
Posterior probabilities by clinical diagnosis
1.0
Estimated probabilities by clinical diagnosis
10
Diagnostics index
12
14
+ ClDiag=0
o ClDiag=1
16
18
20
Diagnostic index
Figure E1: Probability of AD for estimated optimal combination score by clinical diagnosis.
Clinical controls indicated in grey, clinical cases in black. a. Estimated probability of AD. b.
Posterior probability of AD.
9