Trebuchet 48pt

EHS 655
Lecture 7:
Exposure grouping strategies
Exposure analysis in a nutshell
2
What we’ll cover today

Attenuation bias

Sampling issues and temporal trends

Exposure groups

Stata commands
3
ATTENUATION BIAS
Heederik, Attfield, 2000
4
Repeated measurements
Loomis, Kromhout, 2004
5
Ways to reduce attenuation bias

Increase number of exposure measurements per
person

Increase between-subject variability so interindividual exposure range is larger
6
SAMPLING ISSUES

Several different sampling strategies




Haphazard/convenience
Worst case
Representative
Random
Question: what are the
strengths and weaknesses of
each of these approaches?

How exposures were collected may determine what
we can infer from them

Also need to consider temporal trends
7
Sampling issues - temporal trends
Heederik, Attfield, 2000
8
Sampling issues – temporal trends
Davies, Teschke, Kennedy, Hodgson, Demers, 2008
9
Individual-level assessment

Exposure varies
greatly over time and
space

Measure individuals’
exposures


Use repeated
measurements on
individual to calculate
individual average
(only their data)
Often considered gold
standard approach
Ignacio and Bullock, AIHA, 2006
10
Question

If individual-level approach is gold standard, come up
with at least two reasons we might want to create
exposure groups?
11
Reasons we might not want individuallevel assessments

Even with repeated measurements, measured average
level is at best approximation of true exposure

N of repeated measurements per individual usually small
(scarcity of data)

Higher within-person variability and/or smaller interperson variability = more attenuation of exposureresponse relationship

Logistical/financial challenges

Lack of direct access to individuals
12
EXPOSURE GROUPS

To address scarce individual data, we commonly pool
data across individuals

Create group estimates

Increased amount of data in groups reduces random
variability associated with estimate

Apply group estimates to individuals who likely have
similar exposures

May not be truly applicable to all individuals in group
13
Group-level assessment

Create subgroups based on common features of
exposure


Ideally, all workers within each group measured
Must consider between-group, within-group, within-individual
variability
Tielemans, Kupper, Kromhout, Heederik, Houba, 1998
14
Advantages of grouping

Often more effective than individual-level assessment

Especially when temporal variability large

Logistically less demanding than individual approach

Should result in almost unbiased estimated
coefficients of exposure-response relationship

Attenuation very small with grouped exposure data
15
Grouping and attenuation
Seixas and Sheppard, 1996
16
Exposure groups

Originally “Homogeneous exposure groups” (HEGs)

Current trend is “Similar exposure groups” (SEGs)

Groups frequently defined by common structures



E.g., job title, work area, activity, behavior, agent, street,
etc
Individuals in different groups assumed to have different
exposures
Individuals within groups also have different exposures

Differences in activities, behaviors, protective equipment
17
Exposure groups

Statistical measures of central tendency often applied
to groups



Mean, median, mode
We treat every individual in the group as though they are
exposed at the this measure of central tendency
For categorical exposures, we assume groups are
exposed (1) or unexposed (0)

Assumes zero variability in groups
18
What do we need to think about when
grouping people?

Specificity



Summarizes variance within group
Large variance = highly specific
Precision


Summarizes variance between groups
Large variance = highly precise
19
Exposure grouping goals
Goal 1: create groups which
retain true individual
differences (specificity)

i.e., within-group
variability small compared
to between-group
variability
Goal 2: create groups that
are as large as possible
(precision)

Leads to exposure
estimates that are more
precise than
individual worker
means
20
Tradeoff between specificity and
precision

Individual-based strategy


Group-based strategy


Precise (small standard error) but biased estimates of
exposure-response relationship
Unbiased (high validity) but imprecise (large standard error)
estimates of exposure-response relationship
Validity, not precision, most important
21
Specificity vs precision (from the reading)
Precision
Specificity
Werner, Attfield, 2000
22
Specificity vs precision
Within-group variation
Between-group variation
Heederik, Attfield, 2000
23
Grouping strategies

A priori


Grouping takes place before measurement is made
A posteriori

Grouping takes place after measurement is made
24
Ways to create exposure groups

Group by







Activity type
Process
Agent
Exposure pathway
Location
Time
Etc.
25
Example: grouping by agent

Note variation in group size, exposure range, repeated
measurements
26
Example: grouping by job title
Lewne, Plato, Bellander, Alderling, Gustavsson, 2010
27
Example - grouping outcomes

Possibilities


Within group variability > between group variability
Between group variability > within group variability
Question: which
of these
grouping
approaches is
preferred?
Rappaport, Kromhout, Symanski, 1993
28
Group size and attenuation bias

Reducing attenuation bias

Increasing number of subjects per group can be as effective
as increasing number of measurements per subject
29
Question
70
80
90
100
110
What do you think of this as a grouping strategy, and why?
Carpenter
Electrician
Operator
Insulation
Cement Mason
Ironworker
Sheet Metal
Masonry
30
Resources

EPA ExpoBox

https://www.epa.gov/expobox
31
On to Stata

More basic data manipulation commands



Create dummy variables for groups
example: tabulate varname, gen(newvarname)
Let’s make or identify some groups and compare them
32
On to Stata

Bivariate analysis examples








tabulate varname1 varname2
tab2 varname1 varname2 varname3
tabstat varname, stat(mean sd count)
bysort varname1: tabstat varname2, stat(mean sd count)
table varname1, contents(mean varnamex sd varnamex)
by(varname2)
twoway scatter varname1 varname2
Graph matrix varname1 varname2 varname3, half
Graph box varname1, over(varname2)
33