eAppendix for:
Distributed Lag Models
Examining Associations Between the Built Environment and Health
Jonggyu Baek, Brisa N. Sánchez, Veronica J. Berrocal, and Emma V. Sanchez-Vaznaugh
SIMULATION
We performed a small scale simulation study to improve our understanding of estimation and
inference of associations of interest in DLMs. In particular, we wanted to assess how estimates
of the associations of interest are affected by the degree of spatial correlation in the built
environment, and the shape of association between measured environment factors and an
outcome across distance. Further, we compared results obtained from DLMs to those obtained
from traditional approaches based on linear models whose goal is to estimate the average
association between features of the built environment and an outcome up to a-priori specified
distances.
For our simulations, we used as spatial domain the square (0, 500) × (0, 500). In the
square, we simulated food store locations (e.g., features of the built environment) by simulating
realizations from an inhomogeneous Poisson point process. The intensity of the inhomogeneous
Poisson process was taken to be a realization of a log Gaussian process with mean ๐๐ฅ , marginal
variance ๐๐ฅ2 , and exponential correlation function. In other words, the correlation between the log
intensity at two points on the 500 × 500 grid is given by ๐๐ฅ๐(โ๐/๐), where ๐ is the distance
between two points and ๐ is the decay parameter, i.e., the rate at which the correlation decays.
We considered three scenarios for the spatial variability of the intensity function: 1) the
marginal variance of the intensity function ๐๐ฅ2 is set equal to 0; this implies that the intensity is
constant over space and store locations are realizations of a homogeneous Poisson point process
with intensity equal to log(๐๐ฅ ); 2) ๐๐ฅ2 = 1 and ๐ = 5โ3; this corresponds to an intensity with a
spatial correlation that is equal to 0.05 when the distance between two points is equal to 5 units,
resulting in sampled food stores that display a small amount of clustering; and 3) ๐๐ฅ2 = 1 and
๐ = 20โ3; this corresponds to an intensity function with a correlation that decays to 0.05 at a
distance of 20 units, resulting in sampled food stores that display a large amount of clustering. In
each case, the mean of the log Gaussian process used to simulate the intensity of the
inhomogeneous Poisson process was taken to be equal to 0.15 (see Figure 2 in the manuscript).
For each of three built environment settings, we simulated one realization of the built
environment; however, given a realization of the built environment, we simulated 1000 datasets
with different locations for the health outcomes (e.g., the schools in our motivating application)
and different outcome values (e.g., childrenโs BMIz at the various schools).
To simulate schoolโs locations within the (0, 500) × (0, 500) region, we proceeded as
follows: we sampled ๐ โ {1,000, 6,000} schoolsโ (๐ฅ๐ , ๐ฆ๐ ) coordinates from a Uniform(0, 500)
distribution, for ๐ = 1, . . , ๐. Finally, after counting the number of locations in the built
environment around each outcome location, we obtained, for each location ๐, ๐๐ (๐๐โ1 ; ๐๐ ). We
used these coordinates to generate values of outcome ๐๐ by sampling from the model: ๐๐ =
โ๐ฟ๐=1 ๐ฝ(๐๐ )๐๐ (๐๐โ1 ; ๐๐ ) + ๐๐ , where ๐0 = 0, ๐๐ฟ = 10, ๐ฟ = 100 and ๐๐ ~ ๐(0, ๐ 2 ). We used two
function shapes for ๐ฝ(๐): 1) a step function given by ๐ฝ(๐) = 0.1 if ๐ โค 5 and 0 otherwise, which
results in the true data generating model ๐๐ = 0.1๐๐ (0; 5) + ๐๐ (Figure 3A in the manuscript),
and 2) a smooth function ๐ฝ(๐) = 0.1๐๐ (๐)/๐๐ (0), where ๐๐ (๐) is a normal density function with
mean 0 and standard deviation 5โ3 (Figure 3B in the manuscript). Note that that in the
traditional models used to study the effect of the built environment on health, the tacit
assumption is that the effect of the environment on health can be described by a step function of
distance; in other words, the association ๐ฝ(๐๐โ1 ; ๐๐ ) is deemed constant up to specified distance
๐๐ but is zero beyond ๐๐ . A step function ๐ฝ(๐) is likely unrealistic since it is hard to believe that
the association abruptly vanishes beyond distance 5, yet this assumption is frequently (implicitly)
made in practice. In contrast, the second function used for ๐ฝ(๐) implies that the association
decays smoothly with distance and is near zero by distance 5. We chose the variance ๐ 2 of the
error term so that the model ๐
2 was equal to either 0.2, 0.5 or 0.8 for the three different built
environment schemes. In our motivating example the number of available schools is near 6,000,
and the model ๐
2 was near 0.2 when the DLM was fitted without adjustment of confounders.
In fitting DLMs, we chose 100 lags, ๐ฟ = 100, with ๐๐ฟ = 10. We fitted the model within a
Bayesian framework and specified the following prior distributions: ๐ฝ0 โ 1, ๐ถ โ ๐,
๐1 ~ ๐(0, ๐๐2 ๐ฐ๐ฟโ2 ), ๐๐2 ~ ๐ผ๐บ(0.1, 1 × 10โ6 ), and ๐ 2 ~ ๐ผ๐บ(0.1, 1 × 10โ6 ). Details on posterior
inference and the MCMC algorithm are provided in the next section. For comparison, we also
fitted the traditional linear model, ๐๐ = ๐ฝ0 + ๐ฝ1 ๐๐ (0; ๐๐ ) + ๐๐ which assumes a constant effect of
the built environment up to a distance ๐๐ . We used ๐๐ = 2.5, 5, and 7.5, respectively, and
compared the estimate of ๐ฝ1 with the estimate of ๐ฝฬ
(0; ๐๐ ) obtained from the DLM for these three
distances.
To examine how well DLMs capture true buffer effects at given distance lags, bias,
variance, mean squared error (MSE), and coverage rate were calculated at each ๐๐ , ๐ = 1, 2, โฆ , ๐ฟ,
using the formulas:
ฬ
๐ต๐๐๐ (๐๐ ) = โ1000
๐=1 (๐ฝ๐ (๐๐โ1 ; ๐๐ ) โ ๐ฝ(๐๐โ1 ; ๐๐ ))โ1000,
ฬ ฬ
๐๐๐(๐๐ ) = โ1000
๐=1 ๐๐๐ (๐ฝ๐ (๐๐โ1 ; ๐๐ ))โ1000,
2
ฬ
๐๐๐ธ(๐๐ ) = โ1000
๐=1 (๐ฝ๐ (๐๐โ1 ; ๐๐ ) โ ๐ฝ(๐๐โ1 ; ๐๐ )) โ1000,
ฬ
ฬ
๐ถ๐๐ฃ๐๐๐๐๐(๐๐ ) = โ1000
๐=1 ๐ผ (๐ฝ๐,2.5% (๐๐ ) โค ๐ฝ(๐๐ ) โค ๐ฝ๐,97.5% (๐๐ )).
To summarize their overall performance and compare DLMs with classical regression models,
๐
we calculated the integrated MSE, ๐ผ๐๐๐ธ = ๐ฟ๐ฟ โ๐ฟ๐=1 ๐๐๐ธ(๐๐ ), for both models. In evaluating
IMSE for the classical regression models, ๐ฝฬ (๐๐โ1 ; ๐๐ ) was set equal to ๐ฝฬ1 for ๐๐ โค ๐๐ and zero
otherwise.
When the true DL coefficient function ๐ฝ(๐) is a step function, bias occurs around
distance lags where the step happens (eFigure 2A and 2B). Since the fitted DLM assumes that
the buffer effect is a continuous function of distance, bias at those lags is expected, and that
results in low coverage rates as well. When ๐ฝ(๐) varies continuously in r (eFigure 2C and 2D),
much less bias is present, and the bias primarily occurs at the smallest lags because the estimated
buffer effects are smoother than the true ๐ฝ(๐). Some degree of over-smoothing is expected to
occur when using random effect variances (vs GCV) to compute smoothing parameters.1 Also, at
the first few lags, there is relatively smaller amount of information since many DL covariates
๐๐ (๐๐โ1 ; ๐๐ ) in the first few lags have many zero values. Hence bias at smallest lags is expected.
Additionally, when the degree of clustering in the built environment becomes large, the range of
lags at which bias occurs becomes wider and coverage rates tend to be smaller.
For both functions ๐ฝ(๐), variance of the estimated buffer effects is larger at the first few
distance lags due to less information in DL covariates as previously explained. Note also that the
variance of the estimated coefficients at both end points (๐๐ = 0.1 and 10) tends to be larger than
for other values of ๐๐ because at the end points the coefficients are constrained only in one
direction. The estimated buffer effects are more variable when the spatial dependence in the
intensity function controlling the spatial distribution of the built environment features decays at a
slower rate. This can be anticipated because the amount of independent contributions of built
environment covariates ๐๐ (๐๐โ1 ; ๐๐ ) is decreased (compare panels eFigure 2A vs 2B, and 2C vs
2D). The MSE is primarily dominated by bias since the variance is fairly constant across a range
of distances, except at the endpoints, as mentioned above.
The comparison of estimated average association up to distance ๐๐ , with ๐๐ = 2.5, 5, and
7.5, obtained from DLMs and traditional linear models is reported in Table 1 of the manuscript.
The true average association up to distance ๐๐ , ๐ฝฬ
(0; ๐๐ ), is calculated using
2 )โ
โ๐๐=1 ๐ฝ(๐๐โ1 ; ๐๐ )๐(๐๐2 โ ๐๐โ1
๐๐๐2 . When locations of food stores are generated from a
homogeneous Poisson point process, the estimated associations from the traditional linear
models are very close to the true values and their coverage rates are close to 95% (i.e., valid
inference) for both functions used for ๐ฝ(๐). However, if there is clustering of locations in the
built environment, the estimated associations from the traditional models are positively biased
(away from the null) giving invalid inference unless the model is correctly specified (i.e., when
๐ฝ(๐) is the step function with ๐๐ = 5). In particular, when ๐๐ = 2.5, a huge amount of bias
occurs in the traditional models due to failure in adjusting the effects at longer lags. When the
buffer size selected was greater than the true buffer size in traditional models (e.g., ๐๐ = 7.5), the
amount of bias in estimates was smaller; however standard errors of the estimated coefficients
were grossly underestimated yielding invalid inference (e.g., very low coverage). Note that when
negative and positive bias is cancelled up to specified distances in the fitted DLMs, bias in
estimating the average buffer effect is close to zero (eFigure 2). In general, compared to the
traditional regression models, estimated average buffer effects obtained using DLMs generally
performed better having much less bias and better coverage rates except when the fitted
traditional models coincide with the true data generating models.
Since both the traditional regression models and the DLMs have some degree of bias, we
summarize their relative performance in terms of integrated mean squared error (IMSE) up to
distance ๐๐ฟ = 10 (eTable 1).1 When the true form of the ๐ฝ(๐) function is the step function, the
IMSE was minimum for the traditional regression models using the a-priori distance lag ๐๐ = 5,
which is not surprising since the estimated model is the data generating model. However, when
๐ฝ(๐) decays with distance ๐, the DLMs consistently yield the smallest IMSE.
To conserve space and avoid redundancy, here we only reported results for the simulation
setting with ๐ = 6000 and ๐
2 = 0.2, since this scenario corresponds to the data in our
motivating example. For the smaller sample sizes, bias and coverage rates of the DLM estimates
deteriorate, and the strong confounding bias in the traditional regression models persists. The
bias in the DLM is largely attenuated when the model ๐
2 increases, but this does not happen for
the traditional regression models.
To further examine assumptions used by the fitted DLMs we conducted additional
simulations: 1) we specified different numbers of lags, i.e., ๐ฟ = 25, 50, 200, to define ringshaped areas that differ from the ones (๐ฟ = 100) used in the data generating model; and 2) we
assumed different maximum distance ๐๐ฟ = 3, 20. As expected, using a smaller numbers of lags
in DLMs (๐ฟ = 25), resulted in smoother estimated DL coefficients because the DL coefficients
are estimated in wider ring shaped area and thus become coarser. A larger number of lags (๐ฟ =
200) yielded similar results as ๐ฟ = 100. When the maximum distance was misspecified and
๐๐ฟ = 3, we observed bias in the DL coefficients when there is clustering of locations in the built
environment. However, the amount of bias in estimates of the average buffer effect at ๐๐ = 2.5
was less than that from traditional regression models. Results were consistent to those with ๐๐ฟ =
10 when the maximum lag distance used to fit the DLMs was equal to 20.
ESIMATION
We constrain the coefficients ๐ฝ(๐๐โ1 ; ๐๐ ) to vary as a smooth function of distance ๐๐ , ๐ =
1, 2, โฆ , ๐ฟ, by using splines.2,3 This ensures that coefficients corresponding to adjacent areas are
similar, as we would not typically expect associations to change abruptly across distance. It also
alleviates possible numerical problems that may arise when many locations have zero food stores
between two given radii ๐๐โ1 and ๐๐ . In particular, we model the association coefficients
๐ฝ(๐๐โ1 ; ๐๐ ) using a radial basis function; that is
๐ฝ(๐๐โ1 ; ๐๐ ) = ๐ผ0 + ๐ผ1 ๐๐ + โ๐ฟ๐=1 ๐ผฬ๐ |๐๐ โ ๐๐ |3,
(1a)
1 ๐1
ฬ , where ๐ช0 = [ โฎ โฎ ], ๐ช1 =
In a matrix form, (1a) can be written as ๐ท = ๐ช0 ๐ถ + ๐ช1 ๐ถ
1 ๐๐ฟ
๐ผ
0
[|๐๐ โ ๐๐ |3 ]1โค๐,๐โค๐ฟ , ๐ถ = [๐ผ ], and ๐ถ
ฬ = (๐ผฬ1 , โฆ , ๐ผฬ๐ฟ )๐ . The coefficients ๐ถ
ฬ are penalized so the
1
squared second derivative of the estimated DL coefficient function is penalized. The objective is
ฬ ) โฅ2 subject to the
to minimize โฅ ๐ โ ๐๐ ๐ฝ0 โ ๐ฟ๐ท โฅ2 = โฅ ๐ โ ๐๐ ๐ฝ0 โ ๐ฟ(๐ช0 ๐ถ + ๐ช1 ๐ถ
ฬ ๐ ๐ช1 ๐ถ
ฬ โค ๐๐๐๐ ๐ก, and ๐ช๐0 ๐ถ
ฬ = 0. The latter constraint implies that there are really ๐ฟ
constrains ๐ถ
ฬ rather than ๐ฟ + 2 implied from the columns of ๐ช0 and ๐ช1 .1,4 As is well
free parameters ๐ถ and ๐ถ
ฬ = ๐ด1 ๐1 ,
known, the optimization problem can be re-written as a mixed model by redefining ๐ถ
where ๐ด1 is an ๐ฟ × (๐ฟ โ 2) orthogonal matrix to ๐ช0 , where ๐ด1 can be determined using the ๐๐
decomposition [๐ช0 ๐ช1 ] = ๐ธ๐ ๐น๐ and setting ๐ด1 as the 3rd to last columns of ๐ธ๐ .4 Further,
1/2
1/2
1/2
finding ๐ด2 that satisfies ๐ด2 = ๐ด2 ๐ด2 = ๐ด1๐ ๐ช1 ๐ด1 , and defining ๐1 through the
โ1/2
โ1โ2
transformation ๐1 to ๐ด2 ๐1, and re-structuring the data ๐ฟโ = ๐ฟ๐ช0 and ๐โ = ๐ฟ๐ช1 ๐ด1 ๐ด2 , the
mixed model becomes ๐ = ๐ฝ0 ๐๐ + ๐ฟโ ๐ถ + ๐โ ๐1 + ๐, where ๐ ~ ๐ต๐ (๐, ๐ 2 ๐ฐ) and
๐1 ~ ๐ต๐ฟโ2 (๐, ๐๐2 ๐ฐ). The smoothing parameter is ๐ = ๐ 2 /๐๐2 .
The mixed model can be fitted with packaged software for mixed models in the
frequentist framework. Once we have the estimates from the fitted regression, the estimates of
๐ถ
๐ถ
the DL coefficients can be obtained as ๐ท = ๐ [๐ ] and ๐ถ๐๐ฃ(๐ท) = ๐๐ถ๐๐ฃ ([๐ ]) ๐๐ where ๐ =
1
1
[๐ช0 ๐ช1 ๐ด1 ๐ด2โ1/2 ].
Alternatively, the model can be estimated in the Bayesian framework. With prior
distributions of ๐ฝ0 โ 1, ๐ถ โ ๐, ๐1 ~ ๐(0, ๐๐2 ๐ฐ๐ฟโ2 ) ๐๐2 ~ ๐ผ๐บ(๐๐ , ๐๐ ), ๐ 2 ~ ๐ผ๐บ(๐๐ , ๐๐ ), the full
conditionals are all available in closed forms. Let ๐ซโ = [๐๐ ๐ฟโ ๐โ ] =
โ2
[๐๐ ๐ฟ๐ช0 ๐ฟ๐ช1 ๐ด1 ๐ดโ1
], then the full conditional for ๐ฝ0 , ๐, ๐1 is ๐(๐ฝ0 , ๐, ๐1 | โ
) = ๐(๐, ๐บ),
2
โ1
where ๐บ = (๐ซโ ๐ ๐ซโ โ๐ 2 + ๐๐โ2 ๐ฎ) , ๐ฎ = ๐๐๐๐{๐3 , ๐๐ฟโ2 } and ๐ = ๐บ๐ซโ ๐ ๐โ๐ 2 . The full
conditional distribution for ๐๐2 is ๐(๐๐2 | โ
) = ๐ผ๐บ(๐๐ + (๐ฟ โ 2)โ2 , ๐๐ + ๐1 ๐ ๐ฎ๐1 โ2), while, the
full conditional distribution of ๐ 2 is ๐(๐ 2 | โ
) = ๐ผ๐บ(๐๐ + ๐โ2 , ๐๐ + (๐๐ ๐)โ2), where ๐ = ๐ โ
๐ซโ (๐ฝ0 , ๐, ๐1 )๐ . Inference for DL coefficients ๐ท is obtained by transforming posterior samples of
๐ถ
๐, ๐1 by ๐ [๐ ] with ๐ as described above. Inference for average lag effects, ๐ฝฬ
(0; ๐๐ ), can be
1
easily determined from posterior samples.
References
1.
Ruppert D, Wand MP, Carroll RJ. Semiparametric Regression. Cambridge University
Press; 2003.
2.
Hastie TJ, Tibshirani RJ. Generalized Additive Models. CRC Press; 1990.
3.
Zanobetti A, Wand MP, Schwartz J, Ryan LM. Generalized additive distributed lag
modelsโฏ: quantifying. Biostatistics. 2000;1(3):279-292.
4.
Green PJ, Silverman BW. Nonparametric Regression and Generalized Linear Models: A
Roughness Penalty Approac. CRC Press; 1993.
5.
Simon Wood. Generalized Additive Models: An Introduction with R. CRC Press; 2006.
eFigure 1. The estimated DL coefficients up to 7 miles from schools in the student characteristics
adjusted DLM. Food environment associations are only for (A) boys or (B) girls, and (C) the
difference of association by sex; associations are only for (D) non-Hispanic Whites or (E)
Hispanics, and (F) the difference of association by race/ethnicity; associations are only for (G)
5th grade children or (H) 7th grade children, and (I) the difference of association by grade.
eFigure 2. Bias, variance, MSE, and coverage rate at each ๐๐ , ๐ = 1, 2, โฆ , 100 for the cases when
๐ฝ(๐) is: (A) a step function under the built environment without clustering. (B) the step function
under the built environment with a large amount of clustering. (C) ๐ฝ(๐) is the normal pdf under
the built environment without clustering. (D) ๐ฝ(๐) is the normal pdf under the built environment
with a large amount of clustering. Reported results are from a simulation case with n = 6,000
and ๐
2 = 0.2.
eTable 1. Integrated MSE from fitted traditional linear models with distance lag ๐๐ = 2.5, 5, and
7.5 and from fitted DLMs with a maximum distance ๐๐ฟ = 10. Reported results are from a
simulation case with ๐ = 6,000 and ๐
2 = 0.2.
๐ฝ(๐)
Step
Curve
Spatial range
in the built
environment
Independence
5
20
Traditional
linear
model
(๐๐ =2.5)
Traditional
linear
model
(๐๐ =5)
Traditional
linear
model
(๐๐ =7.5)
DLM
IMSE*
2.512
7.500
12.883
IMSE*
0.004
0.004
0.003
IMSE*
2.055
1.861
1.945
IMSE*
0.125
0.269
0.360
0.790
0.743
0.767
1.113
1.060
1.083
0.006
0.010
0.013
Independence
0.205
5
0.169
20
0.179
* IMSE is multiplied by 100.
eTable 2. Simulation findings regarding the use of DIC for model selection. For each of 1000 datasets simulated for each scenario, we calculated DIC
for the pre-specified distances in Table 1 of the manuscript and the DLM. Because selection of traditional vs DL model may depend on the a-priori
specified distance, as a more comprehensive way to select the best traditional models, we also computed DIC for L traditional models using buffer
sizes ๐๐ โ {๐1 , โฆ , ๐๐ฟ }. The distance ๐(min DIC) that gave the traditional model with minimum DIC was selected as the โbestโ buffer size. The bias in
coefficients for the pre-specified buffer sizes is given in Table 1. Percent bias in the coefficient from the โbestโ traditional model,
๐ฬ1,๐๐๐๐(๐ท๐ผ๐ถ) compared to ๐ฝฬ
(0, ๐(min DIC) ), was calculated, as well as percent bias in the cumulative effect up to ๐(min DIC) computed from the DLM. The
minimum DIC value among traditional models was compared with the DIC from the DLM. DIC selected the model that generated the data in almost
all cases, except when the curve (Figure 3B in the manuscript) was used to generate data and there was lower power (e.g., n=1000). However, in
these cases, the estimates from even best traditional model (even when its DIC is lower compared to the DLM) remain more biased. While DIC may
select models that fit better, it may not select models that give unbiased effect estimates.
True
ฮฒ(r)
Step
Spatial
range in the
built
environment
Mean DIC from
N
DLM
traditional model with buffer size:
2.5 /
5/
7.5 /r(min DIC)
P(DLM is
selected)
via DIC
Mean of Mean %Bias in
r(min DIC)
Mean %Bias in
๐ฬ1,๐๐๐๐(๐ท๐ผ๐ถ)
chosen via
๐ฝฬฬ
(0, ๐(min DIC) )
from
DIC in
traditional
from DLM
traditional
model
model
5.0
5.4
9.4
5.0
2.3
3.8
0
1000
6000
2410
14391
2526 / 2394 / 2491 / 2393
15155 / 14353 / 14939 / 14353
0.01
0.00
5
1000
6000
4340
25988
4400 / 4333 / 4368 / 4332
26378 / 25973 / 26181 / 25972
0.03
0.01
5.0
5.0
7.4
2.3
12.5
7.6
20
1000
6000
4694
28146
4744 / 4689 / 4710 / 4687
28467 / 28133 / 28256 / 28132
0.03
0.01
5.0
5.0
8.1
2.3
14.0
8.7
0
1000
6000
186
1082
213 /
1276 /
277 / 314 /
1651 / 1877 /
204
1256
0.96
1.00
2.6
2.6
7.6
3.2
6.5
2.6
5
1000
6000
1643
9810
1652 / 1680 / 1721 /
9886 / 10054 / 10302 /
1643
9861
0.50
1.00
2.9
2.9
24.2
20.1
7.2
3.1
20
1000
6000
1793
10739
1802 / 1819 / 1845 / 1793
10813 / 10913 / 11071 / 10785
0.44
1.00
3.0
2.9
26.1
22.1
9.0
3.4
Curve
eTable 3. Results from simulation study that examines whether accounting for spatial patterning in the outcome resolves issues of bias in the
traditional model. Results are shown for simulation settings where true ๐ฝ(๐) is the curve shown in Figure 3A of the manuscript, ๐ โ
{1000, 6000}, and small or large spatial correlation in the built environment. Results from a traditional linear model including spatial
smoothing effects (model fitted is ๐๐ = ๐ฝ0 + ๐ฝ1 ๐๐ (0; ๐๐ ) + ๐ (๐ฅ๐, ๐ฆ๐ ) + ๐๐ , where ๐ (๐ฅ๐ , ๐ฆ๐ ) is a bivariate smoothing term of longitude (x) and
latitude (y) constructed using tensor product basis functions5) are compared to the traditional model without spatial terms. Summaries are
derived from 1000 simulated datasets. Accounting for the spatial patterning in the outcome does not resolve the bias in the estimated built
environment association. Est. beta is the mean of estimates in 1000 datasets. Coverage rate is the percent of 95% confidence intervals
including the true association. SD(beta) is the standard deviation of 1000 estimates. Mean(SE) is the mean of 1000 standard error estimates.
Spatial
range in the
built
environment
๐๐ = 2.5
๐๐ = 5
๐๐ = 7.5
Coverage
rate
-
SD*
(beta)
Mean*
(SE)
Est.
beta
0.021
Coverage
rate
-
SD*
(beta)
Mean*
(SE)
Est.
beta
0.010
Coverage
rate
-
SD*
(beta)
Mean*
(SE)
True value
Est.
beta
0.058
5
5
20
20
0.074
0.074
0.076
0.076
0.154
0.000
0.252
0.000
5.241
2.130
6.901
2.717
5.231
2.120
6.471
2.632
0.024
0.024
0.022
0.022
0.723
0.106
0.878
0.658
1.842
0.802
2.159
0.870
1.799
0.732
1.999
0.813
0.011
0.011
0.010
0.010
0.510
0.004
0.813
0.361
1.097
0.480
1.085
0.446
1.007
0.410
1.000
0.407
1000
5
0.074
0.152 5.234
6000
5
0.074
0.000 2.125
TLM
1000
20
0.076
0.216 6.782
6000
20
0.076
0.000 2.643
* SD(beta) and Mean(SE) are multiplied by 1000 for readability.
5.218
2.116
6.376
2.594
0.024
0.024
0.022
0.022
0.727
0.104
0.888
0.662
1.839
0.798
2.093
0.839
1.792
0.729
1.957
0.796
0.011
0.011
0.010
0.010
0.499
0.006
0.833
0.390
1.090
0.476
1.043
0.426
1.002
0.408
0.974
0.396
Model
TLM with
spatial
smoothing terms
N
1000
6000
1000
6000
© Copyright 2026 Paperzz