Confidence limits for the ratio of two rates based on likelihood scores

STATISTICS IN MEDICINE
Statist. Med. 2003; 22:2071–2083 (DOI: 10.1002/sim.1405)
Condence limits for the ratio of two rates based
on likelihood scores: non-iterative method
P. L. Graham1; ∗; † , K. Mengersen1 and A. P. Morton2
1 Statistics;
School of Mathematical and Physical Sciences; University of Newcastle;
Callaghan NSW 2308; Australia
2 Infectious Diseases Department; Princess Alexandra Hospital; Brisbane Qld 4102; Australia
SUMMARY
This paper describes a method for creating a condence interval for the ratio of rates using the score
statistic. This non-iterative and easy to apply procedure produces condence intervals that are suitable
for use with Poisson data and simulation results indicate that it is close to the nominal level for a wide
range of scenarios. Copyright ? 2003 John Wiley & Sons, Ltd.
KEY WORDS:
condence interval, coverage probability; rate ratio; Poisson rates
1. INTRODUCTION
Rates are commonly used to describe the frequency of an event and are widely used in health
and many other applications. For example, consider the description of independent hospital
acquired infections such as device related bacteremias. Here, it is important to monitor the
number of such infections over particular time periods, but the baseline may dier over weeks,
months, or years depending on the size and practice of the hospital and the incidence of the
infection. Hence the rate, rather than a simple count, provides a standardized measure that can
be conveniently monitored or compared. A second example, also borrowed from biostatistics,
is taken from Rothman and Greenland [1] and involves the comparison of incidence of breast
cancer. Here two groups of women were compared to discover whether those who had been
examined using x-ray uoroscopy during treatment for tuberculosis had a higher rate of breast
cancer than those who had not been examined in that way.
It is a common aim in such applications not only to compute rates but to compare them.
Typically, such a comparison takes the form of a dierence or a ratio. While the latter loses
the absolute size of the comparison, it has a strong advantage in providing a natural reference
∗
Correspondence to: P. L. Graham, Statistics, School of Mathematical and Physical Sciences, University of
Newcastle, Callaghan NSW 2308, Australia.
† E-mail: [email protected]
Contract=grant sponsor: Australian Research Council SPIRT; contract=grant number: C10024120.
Published online 18 March 2003
Copyright ? 2003 John Wiley & Sons, Ltd.
Received September 2001
Accepted August 2002
2072
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
point of unity. An example of a widely employed ratio is the relative risk used in areas such
as evidence-based medicine. The statistical vehicle for comparison may be a hypothesis test
of equality of the rates (or, equivalently, that the ratio is equal to unity), but considerably
more information can be gained from a condence interval for the ratio.
To date, focus has been concentrated on developing condence intervals for rate ratios under
the assumption that the underlying rates have a binomial distribution. Hence the probability,
p, of an event is described through P(x) = ( nx )p x (1 − p)n−x where x = 0; 1; : : : ; n is the number
of events of interest observed and n is the total number of observed events. In a comparison
of two such proportions, p0 and p1 , say, a condence interval may be based on the ratio
p1 =p0 . A widely favoured maximum likelihood approach to the derivation of such condence
intervals uses score methods [2–5], however these methods require iterative calculation for
their solution. Nam [2] recently introduced a non-iterative approach which is further discussed,
along with the other score-based methods, in Section 2. Many other approaches are also
available such as large-sample Wald-type approximations using logs and other derivations of
score-based intervals [6, 7] but these will not be discussed further in this paper.
For rare events, or in the situation where denominator data may be unavailable or dicult
to collect, a Poisson distribution may better describe the behaviour of the observed
events
x
or be an adequate approximation. The Poisson distribution is dened as P(x) = x! e− where
x = 0; 1; 2; : : : is the number of events and is the rate of an event. Now interest is in the
ratio of two such rates, 1 = 0 , and its associated condence interval.
Condence intervals on the ratio of Poisson rates have been developed by Jaech [8] and
Rothman and Greenland [1], as described in Section 2. However, of the two alternatives
proposed in Jaech’s paper, one of these initially treats the data as binomial and the other uses
a transformation of the Poisson statistic. Rothman and Greenland on the other hand use a
large-sample Wald-based approximation.
The aim of this paper is thus to develop condence intervals for the ratio of two independent
rates with underlying Poisson distributions using a non-iterative likelihood score method. The
approach is a direct but useful generalization of Nam’s [2] development under the binomial
assumption. We show by example a comparison of the new interval in contexts of interest in
Section 4.
The paper proceeds as follows. Section 2 provides a background on the relevant methods
developed to date for condence intervals of ratios of rates under binomial and Poisson
assumptions. Section 3 presents the new non-iterative condence intervals assuming Poisson
distributions for the rates. In Section 4 an illustration is given and the result is compared with
those obtained from alternative methods. Finally, a simulation study in Section 5 reveals the
true level of the condence interval for a range of values and allows comparison to be made
between the proposed condence interval and alternative condence intervals.
2. PREVIOUS WORK ON CONFIDENCE INTERVALS FOR
THE RATIO OF RATES
Much of the earliest work on condence intervals using likelihood score methods was by
Bartlett [3]. Bartlett’s theoretical development was subsequently used by Gart [4] in his development of a condence interval for the ratio of two independent binomial proportions. Gart
and Nam [5] further developed this work by implementing a skewness correction. However
the solution of both condence interval formulae requires an iterative calculation that could
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2073
be computationally laborious, and can be dicult with certain data sets (see Nam [2] for
details). Thus Nam [2] derived a non-iterative method to calculate the condence interval for
the ratio of two binomial proportions. All of the above score methods for proportions use the
same procedures initially:
Take two binomial variables x 0 and x1 with corresponding sample sizes n0 and n1 and
parameters p0 and p1 . Dene to be p1 =p0 and let n· = n0 + n1 and qi = 1 − pi for i = 0 or
1. They reparameterize the log-likelihood by p1 = p0 giving
ln L = (x 0 + x1 ) ln p0 + (n0 − x 0 ) ln q0 + x1 ln + (n1 − x1 ) ln(1 − p0 )
and nd that the score for is
S (; p0 ) =
x1 − n1p1
q1 with variance var{S (; p0 )} = [2 {q0 =(n0 p0 ) + q1 =(n1p1 )}]−1 . The approximate 1 − condence limits are the two solutions to
(x 0 − n0 p̃0 )2
n0 (1 − p̃0 )
2
2
1+
= z=2
(1)
X () =
n0 p̃0 q̃0
n1 q̃0
where p̃0 is the maximum likelihood estimate of p0 in
Sp0 (; p0 ) =
x 0 − n0p0 x1 − n1p1
+
q0
q1
(2)
At this point Gart [4] and Gart and Nam [5] nd the approximate condence limits satisfying
(1) and (2) using an iterative procedure whereas Nam [2] uses a series of substitutions to
nd the condence limits analytically.
Various methods for calculating the condence interval for the ratio of two independent
Poisson rates have also been developed. Now taking x 0 and x1 to be independent Poisson
variables, with associated rates 0 and 1 and denominators n0 and n1 , Jaech [8] describes
two dierent methods for nding a condence interval of the ratio = 1 = 0 . The rst (we
call this the conditional binomial method) treats the data as binomial by initially ignoring
the Poisson denominators and instead using x1 =(x 0 + x1 ). Tables or software are used to nd
condence limits (denoted L and U ) on the new numerator and denominator and the resultant
interval, [B lower = n2 L=n1 (1 − L); B upper = n2 U=n1 (1 − U )], is found to be conservative (that is,
the true ratio, , is rejected less often than expected). The second method (we call this
method Jaech’s transformation), which may be used as an alternative to the score method,
uses a transformation of the Poisson statistic to approximate
√ a standard√normal distribution
when is suciently large. The transformation used is 2[ (x + 0:5) − ] and after some
manipulation the limits, k lower and k upper are found through
√[(x + 0:5)(x + 0:5)] ± 0:5z √[x + x + 1 − (z 2 =4)]
√
1
0
1
0
=2
n0
=2
k=
2 =4)
n1
x 0 + 0:5 − (z=2
At the nominal = 0:05 level Jaech nds this interval to be less conservative than the conditional binomial method and sometimes a little liberal (that is, the true ratio, , is rejected
more often than expected).
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
2074
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
Rothman and Greenland [1] describe a large-sample Wald-based approximation. Here again
x 0 and x1 are taken to be independent Poisson variables with respective denominators n0 and
n1 and associated rates 0 and 1 . The Wald condence limits are then found through


1 
x1 =n1
1

± z=2
+
explog

x 0 =n0
x1 x 0
Rothman and Greenland state that under certain conditions based on sample sizes that this
condence interval should be ‘adequate’, although they do not describe what they mean by
this term.
For many of the above intervals situations arise for which the intervals cannot be sensibly
calculated. Naturally, for all of these intervals nothing meaningful may be calculated when
both x 0 and x1 are 0. In other circumstances there are a variety of ways for intervals to have
aberrations. When these situations are encountered we advise following the directions of Gart
and Nam [5] or Koopman [7] in which either the oending x 0 or x1 is redened as 0.5 or
the lower and upper condence bounds are set at 0 and ∞, respectively.
Note that Newcombe [9] provides a useful discussion on these problems with respect to a
variety of condence intervals for single proportions.
3. CONFIDENCE LIMITS FOR THE RATIO OF RATES UNDER THE ASSUMPTION
OF POISSON DISTRIBUTIONS
Consider two independent Poisson variables x 0 and x1 with corresponding parameters 0 and
1 . The log-likelihood for this is
ln L( 0 ; 1 ) = x 0 ln 0 − ln(x 0 !) − 0 + x1 ln 1 − ln(x1 !) − 1
Since in practice rates may be computed using dierent denominators we allow for this by
n0 1
1
dening = 10=n
=n0 = n1 0 as the ratio of the standardized rates and let x· = x 0 + x1 . Because n0
and n1 are strictly greater than zero in any practical situation, we assume n0 ; n1 ¿0 for the
remainder of this paper. If the rates are already standardized this reduces to (the more usual)
= 10 .
The log-likelihood is reparameterized by 1 = 0 nn10 so that it becomes a function of and
0 where is now the parameter of interest and 0 is now a nuisance parameter
n1
n1
− ln(x 0 !x1 !)
ln L(; 0 ) = x· ln 0 − 0
+ x1 ln − 0 + x1 ln
n0
n0
Now the score for is
S (; 0 ) =
=
Copyright ? 2003 John Wiley & Sons, Ltd.
@ ln L(; 0 )
@
x1 n1
− 0
n0
(3)
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2075
and the score for 0 is
@ ln L(; 0 )
@ 0
x 0 + x1
n1
=
−
−1
0
n0
S 0 (; 0 ) =
(4)
To apply Bartlett’s method [3] we need the maximum likelihood estimator of 0 given ,
which is the solution to [S 0 (; 0 )] 0 = ˜ 0 = 0, that is
x 0 + x1
˜ 0 = n1
n0 + 1
(5)
and (using notation by Gart [4]) we nd
var{S (; ˜ 0 )} = I (; ˜ 0 ) −
=
1
2 ( nn10 + 1)
(I 0 (; ˜ 0 ))2
I 0 0 (; ˜ 0 )
where I (; ˜ 0 ) = 1 =2 is the variance of (3), I 0 0 (; ˜ 0 ) = ( nn10 + 1)= ˜ 0 is the variance of
(4) and I 0 (; ˜ 0 ) = nn10 is the covariance of (3) and (4). It is interesting to note here, that of
the three elements of the I matrix, it is only I 0 0 that involves the optimization of 0 .
Now, approximate 1 − condence limits are the two solutions to the equation
X 2 () =
{S (; ˜ 0 )}2
var{S (; ˜ 0 )}
2
= z=2
Substituting, we nd, for x 0 ¿0
n0
L =
n1
n0
U =
n1
2
x· −
2x 0 x1 + z=2
√
2
2
{z=2
x· (4x 0 x1 + z=2
x· )}
(6)
2x20
2
2x 0 x1 + z=2
x· +
√
2
2
{z=2
x· (4x 0 x1 + z=2
x· )}
2x20
(7)
Note that by earlier discussion n0 and n1 are greater than zero and hence is greater than
zero. In the special case x 0 = 0 one might follow the directions discussed in the previous
section in which we either redene x 0 as x 0 = 0:5 or dene the upper and lower condence
limits as 0 and ∞, respectively. Note also that although the previous situation had special
cases, setting @ ln L=@ 0 = 0 leads to the maximum likelihood estimator of 0 given as
shown in (5), always, with no special cases unlike related situations in Newcombe [10, 11]
and Tango [12].
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
2076
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
4. EXAMPLE
This illustration examines an example used in Rothman and Greenland [1]. Here two groups
of women were compared to discover whether those who had been examined using x-ray
uoroscopy during treatment for tuberculosis had a higher rate of breast cancer than those
who had not been examined using x-ray uoroscopy. In the given data set, the treatment
group consisted of women who had received x-ray uoroscopy and this group had 41 cases
of breast cancer in 28 010 person-years at risk. The control group consisted of women who
had not received x-ray uoroscopy and in this group there were 15 cases of breast cancer in
19 017 person-years at risk. In summary we have
x 0 = 15;
n0 = 19 017;
x1 = 41
and
n1 = 28 010:
Hence a 95 per cent condence interval on using (6) and (7) is [1:0358; 3:3249].
Because this data set has large denominators and small rates it is reasonable to consider
a comparison of the above interval with alternatives based on the binomial assumption [14].
Strictly speaking, methods such as that of Nam, which assume underlying binomial distributions, are inapplicable when denominators are person-years at risk. We employ it here simply
for comparison. Applying Nam’s method, which is the binomial equivalent of the above approach, we nd the condence interval to be [1.0361, 3.3241] which is almost the same as
that derived under the Poisson score method. Gart and Nams’ [5] score-based condence interval with skewness correction for binomial proportions gives a condence interval of [1.0440,
3.4364]. This interval has a wider upper limit than our original interval perhaps due to the
method or the skewness correction. Using Minitab [13] to nd binomial condence limits on
41=56 we then nd that Jaech’s [8] conditional binomial method gives an approximate interval
of [1.0057, 3.6093]. This is, as expected, wider than that obtained under the other methods.
Using Jaech’s transformation on the Poisson statistic we nd a 95 per cent condence interval
on to be [1.0436, 3.4339]. This interval appears to yield similar results to those found using
Gart and Nam’s iterative score method.
For further comparison, the Wald-based method described by Rothman and Greenland [1]
produces a 95 per cent condence interval of [1.0272, 3.3526]. This method appears to produce
a slightly wider interval than that obtained using the Poisson score method. Rothman and
Greenland describe the numbers used in this calculation as being suciently large enough to
produce an adequate approximation.
Note that for the conditional binomial method, Jaech [8] refers readers to the abridged tables
by Hald [15]. These tables are not recommended due to limited numerator and denominator
values. Newcombe (personal communication) recommended the use of more comprehensive
tables such as the Geigy Scientic Tables [16] or software programs such as SAS or Minitab
to nd the required limits. Note that the beta function in Excel may also be used to do this.
For the above comparison we have used moderate numerators and large denominators and it
is reasonable to expect that all of the methods will broadly agree. When small numerators are
used larger dierences between the methods are observed. Although this dierence varies a
great deal depending on the dierence between the numerators (the further apart the numerators
are, the more the methods agree), we found in general that Nam’s method and the Poisson
score-based method are very similar and are always the narrowest intervals and the conditional
binomial method is always widest (generally at least twice as wide as the score-based interval).
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2077
5. MONTE CARLO SIMULATION
A Monte Carlo simulation study was undertaken to examine the performance of the proposed
score-based interval, Jaech’s transformation interval and Rothman and Greenland’s Waldbased interval for all integer values of 0 and 1 between 1 and 40. The equivalence of
n1 = n0 was maintained for simplicity and without loss of generality. A methodology similar
to that described by Miettinen and Nurminen [6] was adopted. This involved generating a
random value from each of Poisson( 0 ) and Poisson(1 ) distributions. A nominal level = 0:05
condence interval was calculated based on these observed values and a test performed to see
whether the true ratio 10 was contained in the interval. This was repeated 1 000 000 times and
the number of times the ratio was not in the interval was recorded. As discussed in Sections
2 and 3, when x 0 is 0 there is a division by zero in the score-based interval and also for
Rothman and Greenland’s interval when either x 0 or x1 is 0. When this happened we followed
the directions of Gart and Nam [5] and Koopman [7] and tried two dierent approaches to
calculating the coverage probabilities for these intervals, rst dening x 0 = 0:5 for the scorebased interval and xi = 0:5 for i = 0; 1 in Rothman and Greenland’s Wald-based interval and
second dening the lower and upper bounds as 0 and ∞, respectively. The simulations under
the amendments produced very similar results so we will only discuss the case in which we
replaced xi = 0 with xi = 0:5. Jaech’s condence interval could only produce zero values with
certain non-integer values of x 0 depending on the value of z=2 . Since the chosen values for
0 ; 1 and z=2 for this interval could not produce zero values, amendments were not required.
Table I illustrates in detail the simulated coverage probabilities when 0 and 1 are small
(each between 1 and 8). The results of this simulation indicated that when 0 = 1 or 2 and
1 ¿4 (and vice versa) then the score-based interval is closest to nominal. When 0 = 1 then
Jaech’s method is closer to nominal, however the results for the score-based interval and
Jaech’s interval are very close when both rates are greater than 4. Elsewhere, Jaech’s interval
is usually closer in absolute value to the nominal 95 per cent level but this is not entirely
consistent and both are generally close to the nominal level. We also note that results are
very similar at both the 90 per cent and 99 per cent levels. The only major dierence occurs
at the 99 per cent level when 0 and 1 are small. Here, when one rate is equal to 1 and the
other rate is between 2 and 5 (see Table II), we nd that Jaech’s interval is very liberal and
the score-based interval is conservative.
Figure 1 shows a scatter plot comparing the coverage probabilities for the score method
and Jaech’s method. Here each point represents the coverage probability for a 0 − 1 pair.
This plot clearly presents the dierences in true levels and it is easy to see that many of the
intervals fall close to the 95 per cent level. Where this is not true the score-based intervals
are closer to the nominal level than Jaech’s intervals. A further comparison of the score-based
and Jaech’s intervals in a small simulation (100 iterations and for rates varying over 1 to 8
for each ) at the 95 per cent level showed that Jaech’s interval is usually wider than the
score-based interval.
Similarly, Figure 2 shows a comparison of the simulated coverage probabilities for the
score-based interval and Rothman and Greenland’s large-sample approximation method based
on Wald limits at the nominal 95 per cent level. Here we see that most of the score intervals
are around the 95 per cent level whereas the Wald-based intervals are almost all conservative
and on average at least half a per cent too large. Similar results are found when comparing
Jaech’s interval with the Wald interval. Lastly a box plot (Figure 3) shows that the distribution
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
2078
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
Table I. True simulated coverage probability at the nominal 95 per cent level. For each rate: numbers in the rst row correspond to Jaech’s interval; numbers in the second row correspond to
the proposed score interval; numbers in the third row correspond to the Wald-based interval.
0
1
1
2
3
4
5
6
7
8
1
98.59
99.27
100.00
98.12
95.77
99.82
95.10
96.56
99.03
97.46
94.86
97.24
97.86
95.86
97.08
97.41
95.58
96.48
97.49
95.91
96.52
97.51
95.67
96.22
2
98.13
98.26
99.82
95.83
97.56
99.93
96.25
97.04
99.54
95.65
96.54
98.84
96.70
96.07
98.34
96.46
96.12
97.53
96.85
96.50
96.93
97.04
96.50
96.63
3
95.10
96.57
99.04
96.21
96.93
99.53
95.29
96.64
99.54
95.48
95.89
99.36
95.11
96.42
98.18
94.72
96.23
97.82
95.41
96.10
97.87
95.20
96.52
97.56
4
97.43
95.51
97.20
95.60
96.42
98.84
95.46
95.82
99.34
95.54
96.18
98.81
95.00
95.85
98.43
95.40
95.97
97.98
95.79
96.18
97.62
94.91
95.71
97.38
5
97.86
96.12
97.09
96.69
95.88
98.35
95.13
95.72
98.18
94.99
95.39
98.44
95.64
95.88
97.99
95.34
95.55
97.61
94.99
95.33
97.57
95.58
94.76
97.83
6
97.41
95.61
96.44
96.47
95.88
97.54
94.76
95.46
97.85
95.35
95.43
97.95
95.31
95.35
97.63
95.55
95.62
97.26
95.21
95.24
97.25
95.15
95.30
96.72
7
97.51
95.93
96.53
96.84
96.32
96.93
95.43
95.05
97.88
95.78
95.67
97.62
94.99
95.15
97.56
95.19
95.21
97.26
95.40
95.41
96.76
95.68
95.14
96.70
8
97.48
95.65
96.20
97.09
96.44
96.66
95.23
95.80
97.58
94.86
95.19
97.36
95.57
94.55
97.82
95.13
95.25
96.72
95.66
95.13
96.66
95.18
95.16
96.32
Table II. True simulated coverage probability at the nominal 99 per cent level. For each
rate: numbers in the rst row correspond to Jaech’s interval; numbers in the second row
correspond to the proposed score interval.
0
1
1
2
3
4
5
6
7
8
1
99.96
100.00
89.77
99.64
94.36
99.22
96.95
98.49
98.41
98.46
97.57
98.34
98.58
98.32
99.15
98.34
2
89.76
99.85
99.54
99.94
99.51
99.65
98.42
99.25
98.98
99.37
99.44
99.16
99.42
98.79
99.42
98.79
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2079
1.00
0.99
JAECH
0.98
0.97
0.96
0.95
0.94
0.94
0.95
0.96
0.97
SCORE
0.98
0.99
1.00
Figure 1. Coverage probabilities of the score-based interval versus the
Jaech’s interval at the 95 per cent level.
1.00
0.99
ROTHMAN
0.98
0.97
0.96
0.95
0.94
0.94
0.95
0.96
0.97
SCORE
0.98
0.99
1.00
Figure 2. Coverage probabilities of the score-based interval versus the
Wald-based interval at the 95 per cent level.
of coverage probabilities for the simulated intervals discussed are all skewed right and all are
generally conservative, however, the Wald-based interval is more conservative than the others.
Table III shows the summary statistics for the coverage probabilities at the nominal 95 per
cent level. For example, ‘Max error-’ means the maximum amount the coverage was below
nominal. Here the minimum coverage probability for the score-based interval was 0.9455
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
2080
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
1.00
0.99
0.98
0.97
0.96
0.95
0.94
JAECH
ROTHMAN
SCORE
Figure 3. Box plots of coverage probabilities at the nominal 95 per cent level.
Table III. Summary statistics for the coverage probabilities at the 95 per cent level.
Median
Mean
Max error −
Max error +
Mean error −
Mean error +
SD errors −
SD errors +
Proportion −
Proportion ¡94:9 per cent
Proportion +
Proportion ¿96 per cent
Score
Jaech
Rothman
0.9512
0.9525
0.0045
0.0427
0.0009
0.0033
0.0009
0.0041
0.1638
0.0046
0.8150
0.0838
0.9509
0.9532
0.0061
0.0359
0.0008
0.0041
0.0010
0.0077
0.1650
0.0034
0.8069
0.0944
0.9564
0.9593
0.0012
0.0500
0.0010
0.0093
0.0002
0.0072
0.0013
0.0006
0.9988
0.3581
+ indicates above nominal.
− indicates below nominal.
which is 0.0045 below nominal. Note that this table also shows the proportion of times
coverage probabilities are above or below nominal for each interval and also the proportion
of times the intervals are above 96 per cent or below 94.9 per cent.
Newcombe [9] points out that many studies (including this one) use integer values in the
simulations and the extreme discreteness of the coverage behaviour that this causes may invite
discussion about the conclusions. Thus, to provide a fuller picture, we performed a simulation
study similar to the previous one but now using values of 0 and 1 between 1 and 10 in
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2081
1.00
0.99
JAECH
0.98
0.97
0.96
0.95
0.94
0.94
0.95
0.96
0.97
SCORE
0.98
0.99
1.00
Figure 4. Coverage probabilities of the score-based interval
versus Jaech’s interval at the 95 per cent level.
1.00
0.99
ROTHMAN
0.98
0.97
0.96
0.95
0.94
0.94
0.95
0.96
0.97
SCORE
0.98
0.99
1.00
Figure 5. Coverage probabilities of the score-based interval versus the
Wald-based interval at the 95 per cent level.
steps of 0.1. Figure 4 shows that the majority of coverage probabilities produced by both the
score-based method and Jaech’s method are close to nominal. However, for those coverage
probabilities that are not close to nominal, Jaech’s interval has more points scattered between
97 per cent and 98 per cent and the score-based interval has some points that are a lot
more conservative than Jaech’s interval. On the other hand, Figure 5 shows that Rothman and
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
2082
P. L. GRAHAM, K. MENGERSEN AND A. P. MORTON
Greenland’s interval is always more conservative than the score-based interval. Note also for
these small rates that Rothman and Greenland’s interval is always at least 1 per cent above
the nominal 95 per cent level.
6. DISCUSSION
The score-based condence interval for the ratio of two Poisson rates developed here is
competitive with Jaech’s interval as evidenced by the example and simulations shown in previous sections. Furthermore, we may determine that the score-based method is preferable over
Jaech’s method, in many situations, because the coverage probabilities are less variable, are
more often closer to the nominal level and also because the intervals are generally narrower.
The summary statistics and plots shown in Section 5 helped to illustrate the similarities
between the score-based method and Jaech’s method and further conrm the conservative
nature of Rothman and Greenland’s interval. Table III also showed that although the scorebased method and Jaech’s method may be liberal at times, less than 0.5 per cent of simulated
coverage probabilities are less than 94.9 per cent. Thus we may conclude that provided it
is acceptable to have the occasional slightly liberal interval, then both Jaech’s transformation
and the score-based intervals are clearly preferable to the Wald-based interval of Rothman
and Greenland.
Finally, the score-based interval is easy to apply, requires no iterative calculations and is
maintained at close to the nominated coverage.
ACKNOWLEDGEMENTS
The authors are greatly indebted to the reviewers, including R. Newcombe, for their extremely useful
comments and suggestions. They would also like to thank Dr Nam for his helpful comments.
REFERENCES
1. Rothman KJ, Greenland S. Modern Epidemiology. 2nd edn. Lippincott-Raven: Philadelphia, 1998.
2. Nam J. Condence limits for the ratio of two binomial proportions based on likelihood scores: non-iterative
method. Biometrical Journal 1995; 37:375– 379.
3. Bartlett MS. Approximate condence intervals, II. More than one unknown parameter. Biometrika 1953; 40:
306– 317.
4. Gart JJ. Approximate tests and interval estimation of the common relative risk in the combination of 2×2 tables.
Biometrika 1985; 72(3):673 – 677.
5. Gart JJ, Nam J. Approximate interval estimation of the ratio of binomial parameters: a review and corrections
for skewness. Biometrics 1988; 44:323 – 338.
6. Miettinen O, Nurminen M. Comparative analysis of two rates. Statistics in Medicine 1985; 4:213 – 226.
7. Koopman PAR. Condence intervals for the ratio of two binomial proportions. Biometrics 1984; 40:513 – 517.
8. Jaech JL. Comparing two methods of obtaining a condence interval for the ratio of Poisson parameters.
Technometrics 1970; 12(2):383 – 387.
9. Newcombe RG. Two-sided condence intervals for the single proportion: comparison of seven methods.
Statistics in Medicine 1998; 17:857– 872.
10. Newcombe RG. Interval estimation for the dierence between independent proportions: comparison of eleven
methods. Statistics in Medicine 1998; 17:873 – 890.
11. Newcombe RG. Improved condence intervals for the dierence between binomial proportions based on paired
data. Statistics in Medicine 1998; 17:2635– 2650.
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083
CONFIDENCE LIMITS FOR THE RATIO OF TWO RATES
2083
12. Tango T. Equivalence test and condence interval for the dierence in proportions for the paired-sample design.
Statistics in Medicine 1998; 17:891–908.
13. Minitab Inc. Minitab Release 12.1. Available from www.minitab.com, 1998.
14. Rao PV. Statistical Research Methods in the Life Sciences. Duxbury Press: California, 1998.
15. Hald A. Statistical Tables and Formulas. Wiley: New York, 1952.
16. Lentner C (ed.). Geigy Scientic Tables: Volume 2. 8th edn. CIBA-GEIGY: Basle, 1982; 89 – 102.
Copyright ? 2003 John Wiley & Sons, Ltd.
Statist. Med. 2003; 22:2071–2083