Frequency and Outcome of Cervical Cancer Prevention Failures in

Anatomic Pathology / DISCREPANCY ANALYSIS AND OUTCOMES
Frequency and Outcome of Cervical Cancer Prevention
Failures in the United States
Stephen S. Raab, MD,1 Dana Marie Grzybicki, MD, PhD,1 Richard J. Zarbo, MD, DMD,2
Chris Jensen, MD,3 Stanley J. Geyer, MD,4 Janine E. Janosky, PhD,5 Frederick A. Meier, MD,2
Colleen M. Vrbin,1 Gloria Carter, MD,1 and Kim R. Geisinger, MD6
Key Words: Papanicolaou test; Cervical cancer; Patient safety; Colposcopy
DOI: 10.1309/97JHG6GLY69BVF4Y
Abstract
We measured the frequency and outcome of cervical
cancer prevention failures that occurred in the
Papanicolaou (Pap) and colposcopy testing phases
involving 1,646,580 Pap tests in 4 American hospital
systems between January 1, 1998, and December 31,
2004. We defined a screening failure as a 2-step or
greater discordant Pap test result and follow-up biopsy
diagnosis. A total of 5,278 failures were detected
(0.321% of all Pap tests); 48% and 52% of failures
occurred in the Pap test and colposcopy phases,
respectively. Missed squamous cancers (1 in 187,786
Pap tests), glandular cancers (1 in 19,426 Pap tests),
and high-grade lesions (1 in 6,870 Pap tests)
constituted 4.1% of all failures. Unnecessary repeated
tests or diagnostic delays occurred in 70.8% and 63.9%
of failures involving high- and low-grade lesions,
respectively. We conclude that cervical cancer
prevention practices are remarkably successful in
preventing squamous cancers, although a high
frequency of failures results in low-impact negative
outcomes.
The success of cervical cancer prevention depends on
optimizing the performance of a number of unique testing
steps. Much of the medical literature has focused on the
Papanicolaou (Pap) test screening portion of cervical cancer
prevention, and numerous studies have evaluated the accuracy
of the Pap test and investigated the root cause of Pap test failures.1-3 Despite the well-recognized success of the Pap test in
reducing the incidence and mortality rates of cervical cancer,1,2
the Pap test portion of cervical cancer prevention services has
been the phase most radically transformed by new technologies and the effect of governmental regulations.4-9
Pap test screening failures may occur in any testing phase
from ordering, performing, processing, interpreting, reporting,
and then acting on the result of a test. Yet, if cervical cancer
prevention is viewed from the perspective of the total testing
process,10 the steps of initial patient recruitment, screening, and
follow-up also are vitally important for cancer prevention. It is
well documented that the majority of women in whom cervical
cancer develops never had a Pap test or did not have a recent
Pap test.11,12 The frequency and the effect of failures in phases
of the cervical cancer prevention testing cycle beyond the initial screening examination have been less well studied.
One method of detecting Pap test failures is by using the
cytologic-histopathologic (CH) correlation method by comparing the Pap test and follow-up cervical biopsy diagnoses.7,13,14 The College of American Pathologists and others
documented the usefulness of CH correlation in identifying
overall performance and error-prone steps in Pap testing.13,14
However, CH correlation also may be used to detect failures in
the follow-up colposcopy testing cycle.13 In the ASCUS-LSIL
[Atypical Squamous Cells of Undetermined Significance–LowGrade Squamous Intraepithelial Lesion] Triage Study (ALTS),
Am J Clin Pathol 2007;128:817-824
© American Society for Clinical Pathology
817
DOI: 10.1309/97JHG6GLY69BVF4Y
817
817
Raab et al / DISCREPANCY ANALYSIS AND OUTCOMES
Ferris et al15 reported that the mean colposcopist sensitivity
for detecting cervical intraepithelial neoplasia (CIN) 2 or 3
was lower than expected, indicating that failures in the colposcopy phase of testing are not uncommon.
Our primary goal in this study was to establish a baseline
level of cervical cancer screening failures occurring in the Pap
test and the colposcopy phases of testing using the CH correlation method. Our approach considered that each test in the
prevention process was unique and that each false-negative or
false-positive diagnosis had the potential to result in unnecessary delay or overtesting and/or overtreatment or undertreatment. We measured the frequency and impact of false-negative and false-positive Pap test and colposcopy biopsy diagnoses on patient care.
Materials and Methods
Cervical Cancer Prevention
In most developed nations, cervical cancer prevention
involves the successful performance of several distinct tests,4-9,16
and, conceptually, these tests conjointly may be viewed as a
total testing process.10 Women undergo a screening test, such
as a Pap test, follow-up colposcopy for specific Pap test
epithelial cell abnormality diagnoses,4,17 and a second clinic
visit for lesion excision for specific histopathologic diagnoses.18-20 The implementation of new Pap test technologies,
screening intervals, and follow-up protocols was variably
standardized during our study time frame. In general, women
who had a Pap test diagnosis of LSIL; high-grade SIL (HSIL);
atypical glandular cells; cancer, atypical squamous cells
(ASC), cannot exclude HSIL; or ASC, undetermined significance (ASC-US) (with a positive high-risk human papilloma
virus [HPV] DNA test result) were referred for colposcopy.6,17
Study Population
The study population included all women who received a
Pap test between January 1, 1998, and December 31, 2004, in
4 health care systems. This study is part of a larger project
funded by the Agency for Healthcare Research and Quality
and by the Centers for Disease Control and Prevention as a
means to study and improve patient safety in diagnostic testing and screening.13 The health care systems were the
University of Pittsburgh Medical Center, Pittsburgh, PA;
Henry Ford Health System, Detroit, MI; Western
Pennsylvania Hospital, Pittsburgh; and the University of Iowa
Healthcare, Iowa City; they were deidentified for the purposes of this study. All sites obtained institutional review board
approval for project performance. The 4 institutions adopted
liquid-based cytology between 2000 and 2004 and high-risk
HPV DNA testing for ASC-US diagnoses between 2002 and
2004. Our goal was not to specifically study the implementation of these technologies because they were introduced variably and gradually.
Definitions of Discrepancy and Test Failure
We defined a discrepancy as a difference between the cytologic and histopathologic diagnoses that indicated the presence
or absence of a pathologic entity or a definite difference in the
degree to which the pathologic condition was judged to be present.11 Cytologic and histopathologic diagnostic schema are different, and we developed a scaled hierarchy of categories to
determine if a discrepancy occurred ❚Table 1❚.13 We used the
2001 Bethesda System to classify Pap test results and the CIN
system to classify histopathologic diagnoses.17,21
The CH correlation discrepancy proportion depends on the
method of case detection.13 Correlation processes were standardized at the 4 study sites in 2002 ❚Table 2❚, and the effectiveness of cervical cancer screening before (n = 1,083,221
❚Table 1❚
Discordant and Nondiscordant Diagnoses
Histopathologic Specimen
Papanicolaou test diagnosis
NILM or unsatisfactory
ASC-US
ASC-H
Low-grade SIL (CIN 1)
High-grade SIL (CIN 2/3)
Dysplasia
Squamous cell carcinoma
AGC, favor neoplasia
AGC
AIS
Adenocarcinoma
≥ 2 steps)
Discordant Diagnoses (≥
Nondiscordant Diagnoses (<2 steps)
CIN 1 or higher (including dysplasia, not
otherwise specified)
CIN 2 or higher
Negative
Negative or cancer
Negative
Negative
Negative or CIN 1
Negative or any CIN
Any CIN
Negative or CIN 1
Negative or CIN 1
Negative
Negative or CIN 1
CIN 1 or higher
CIN 1 or CIN 2/3
CIN 1 or higher
CIN 1 or higher
CIN 2 or higher
AIS or adenocarcinoma
Negative or AIS or adenocarcinoma
CIN 2 or higher or AIS or adenocarcinoma
CIN 2 or higher or AIS or adenocarcinoma
AGC, atypical glandular cells; AIS, adenocarcinoma in situ; ASC-H, atypical squamous cells, cannot exclude HSIL; ASC-US, atypical squamous cells of undetermined
significance; CIN, cervical intraepithelial neoplasia; NILM, no evidence of intraepithelial lesion or malignancy; SIL, squamous intraepithelial lesion.
818
818
Am J Clin Pathol 2007;128:817-824
DOI: 10.1309/97JHG6GLY69BVF4Y
© American Society for Clinical Pathology
Anatomic Pathology / ORIGINAL ARTICLE
Pap tests) and after (n = 563,359 Pap tests) standardization
was evaluated separately. We defined a test failure as a 2-step
or greater discrepancy, because in most cases, one of the diagnoses did not accurately describe the presence or absence of a
disease in a patient.13,21,22 We examined only 2-step or greater
discrepancies because 1-step discrepancies were exceedingly
common, generally were associated with minimal impact on
clinical outcome, and were associated with poorly reproducible adjudication of failure cause.13 Thus, some 1-step differences with clinical import (eg, HSIL Pap test and CIN 1
histopathologic specimen) were not examined.
The review pathologist performed a root cause analysis
by reviewing all microscopic slides associated with the discrepant Pap test and histopathologic biopsy results. The
pathologist then adjudicated if the failure was secondary to the
preanalytic (sampling) or analytic (screening or interpretation)
phases of the Pap or colposcopy testing cycles.14 An analytic
failure was a failure in diagnostic categorization by the individual who originally screened or examined the case. A preanalytic failure was a failure in which diagnostic material was
not present on the slides of one case and present on the slides
of the other. Our previous work has shown that traditional
methods of adjudication of test failure root cause exhibited
moderate to low reproducibility. However, this adjudication
method may be used as a first step in exploring more detailed
analysis of failure cause.
We separately examined the relationship between specific diagnostic categorical differences and impact on care for
the following:
1. High-grade lesions (adenocarcinoma in situ and HSIL)
and cancers (squamous carcinoma and endocervical adenocarcinoma) not detected by Pap testing. The ability of the Pap
test to detect some glandular lesions is controversial,23,24 and,
therefore, discrepant cases of endometrial vault adenocarcinomas were excluded from our analysis. Because CH correlation
traditionally is not used to compare Pap test and histopathologic diagnoses with long intervals between specimen
procurement,14 we excluded women who had specimens
procured more than 6 months apart (eg, false-negative Pap test
results based on a 1-year look back). We determined the number of women who had a benign Pap test result with cancer or
high-grade lesion follow-up result.
2. High-grade lesions and cancers not detected in
histopathologic specimens when the Pap test result was abnormal. Because high-grade lesions and cancers are unlikely to
regress in an interval of 6 months or less, most of these discrepancies were secondary to failures in the colposcopy testing phase.
3. Disease misclassification on Pap tests and histopathologic specimens, excluding specimens from women who had
a missed cancer or high-grade lesion. This number represented the diagnostic noise in the screening system and was the
number of underdiagnosed or overdiagnosed lesions (excluding underdiagnosed cancers and high-grade lesions). We recognized LSIL disease regression complicated the test failure
adjudication process.25 We evaluated the cases of women who
had an original and review LSIL Pap test and follow-up
benign histopathologic diagnoses in 2 ways. First, we considered the failure as a preanalytic colposcopy failure regardless
of postcolposcopy follow-up because this discrepant pair
resulted in at least clinical confusion and patient anxiety.
Clinicians did not know whether the LSIL actually had
regressed and responded variably. Second, we only considered
the cases of women who had an abnormal Pap test or
❚Table 2❚
Prestudy and Consensus Methods of Cytologic-Histopathologic Correlation Detection and Adjudication Method
Step
Difference
Cytotechnologist
Prescreening
Reviewer
Adjudication Method
of Error Cause
6-mo wide using previous
month histopathologic
specimens to search for
cytologic cases; search
performed monthly
2-step
disagreement
No
1 of 3 pathologists
Cases shown to
original pathologist
for input; final
decision based on
reviewer
4-mo wide search
Any
disagreement
2-step
disagreement
Any
disagreement
Yes
1 pathologist
None
Yes
2 pathologists
None
No
Pathologist signing
Case shown to
out histopathologic
original cytologist
specimen
if disagreement
Yes
1 pathologist
Site
Case Retrieval Method
Search Interval
A
B
Computer search for all
correlating histopathologic and cytologic
specimens followed
by manual review for
those not correlating
Same as A
C
Same as A
D
12-mo search; cases
reviewed bimonthly
Reviewed daily
Cytologic case identified
when triggered by
positive histopathologic
specimen
Consensus Computer search for
6-mo wide using previous
all correlating histomonth histopathologic
pathologic and cytologic specimens to search for
specimens followed by
cytologic cases
manual review
2-step
disagreement
Final decision based
on reviewer
Am J Clin Pathol 2007;128:817-824
© American Society for Clinical Pathology
819
DOI: 10.1309/97JHG6GLY69BVF4Y
819
819
Raab et al / DISCREPANCY ANALYSIS AND OUTCOMES
histopathologic biopsy follow-up as preanalytic failures with
the remainder falling in an indeterminate failure class.
Impact on Patient Care
We performed clinical record review on all cases with Pap
test–histopathologic specimen discrepancies that had original
or review diagnoses of HSIL/CIN 2 or greater and a random
sample of 10% of all discrepancies that had original or review
diagnoses of less than HSIL/CIN 2.13 A 10% clinical record
review was performed on the latter subset because preliminary
studies showed that women with these discrepancies had very
similar assessments of outcomes and that the annual volume of
these discrepancies was prohibitively large. We extrapolated
the reviewed sample proportions of outcomes to all unreviewed
cases. The clinical record reviewer recorded additional tests
ordered, additional treatment protocols initiated, morbidity or
mortality related to additional tests or treatments, and delays in
diagnosis. A pathologist assessed the clinical impact of the discrepancies using the categories shown in ❚Table 3❚. This
scheme was based on error impact schema previously published in the medical literature.13,26,27 We note that other classification schemes use the term harm to indicate a negative clinical outcome that may be graded in degree.26,27
Data Analysis
All data analyses were performed using SPSS software
(version 14, SPSS, Chicago, IL). Data initially were analyzed
descriptively. Discrepant case frequencies were calculated for
different categories of initial Pap test or histopathologic diagnoses. Proportions of discrepant cases occurring during specific intervals were calculated as n/d, with n the number of
discrepant cases and d the total number of Pap tests examined
during the same interval. Appropriate denominator values
were retrieved from existing electronic laboratory information
systems using query tools designed to extract Pap test workload data.
Differences in discrepancy proportions were examined by
using the Fisher exact test or the χ2 test. Statistical significance was assumed at a P value of .05 or less.
Discrepant Pap test–histopathologic case pairs were
obtained concurrently or nonconcurrently. For nonconcurrent
discrepant case pairs, the Pap test was done during a separate
clinic or office visit before colposcopy. Concurrently obtained
discrepant specimens were obtained during the same colposcopy procedure. The practice of obtaining a Pap test specimen at the time of colposcopy with biopsy was variable;
therefore, we performed separate analyses including and
excluding these data.
Results
The number of all and nonconcurrent testing failures is
shown in ❚Table 4❚ and ❚Table 5❚, respectively. Standardized
CH correlation resulted in greater discrepancy detection compared with nonstandardized methods (P < .001). Excluding
concurrent case pairs, the proportion of women who had a Pap
test and a CH correlation discrepancy was 0.5% (or 1 in 204
Pap tests); 0.2% of the total (1 in 596 Pap tests) had a discrepancy secondary to an analytic failure. In the standardized time
frame, more women had a discrepancy secondary to a colposcopic failure compared with a Pap test failure.
❚Table 3❚
Severity Classification of Discrepancy
Classification
Description
No harm
The clinician acted regardless of an erroneous diagnosis; Example: A patient had a Pap test and
histopathologic biopsy during same colposcopic session; Pap test diagnosed as NILM and biopsy
tissue as CIN 1 (discrepancy adjudicated as a Pap test sampling error); clinician acted on the
histopathologic diagnosis regardless of the Pap test diagnosis
The clinician intervened before harm occurred or the clinician did not act on an erroneous diagnosis;
Example: Pap test diagnosed as HSIL and follow-up cervical biopsy diagnosed as benign (adjudicated
as a histopathologic sampling error); clinician proceeded with a therapeutic procedure regardless of
the histopathologic diagnosis
Near miss
Negative clinical outcome events
Minimal (grade I)
Mild (grade II)
Moderate (grade III)
Severe (grade IV)
Further unnecessary noninvasive diagnostic test(s) performed (eg, Pap test, blood test, or noninvasive
radiologic examination)
Delay in diagnosis or therapy of ≤6 mo
Unnecessary invasive further diagnostic test(s) (eg, tissue biopsy, reexcision, angiogram, or radionuclide study)
Delay in diagnosis or therapy of >6 mo
Minor morbidity owing to delay in therapy or unnecessary further diagnostic efforts or therapy
predicated on presence of (unjustified) diagnosis
Moderate morbidity owing to delay in therapy or (otherwise) unnecessary further diagnostic efforts or
therapy predicated on presence of (unjustified) diagnosis
Loss of life, limb, or other body part or long-lasting morbidity (>6 mo)
CIN, cervical intraepithelial neoplasia; HSIL, high-grade squamous intraepithelial lesion; NILM, no evidence of intraepithelial lesion or malignancy; Pap, Papanicolaou.
820
820
Am J Clin Pathol 2007;128:817-824
DOI: 10.1309/97JHG6GLY69BVF4Y
© American Society for Clinical Pathology
Anatomic Pathology / ORIGINAL ARTICLE
The number of nonconcurrent false-negative Pap test
results (original Pap test diagnosed as no evidence of intraepithelial lesion or malignancy [NILM]) and histopathologic
specimens (original histopathologic specimen diagnosed as
benign) is shown in ❚Table 6❚. CH correlation standardization
resulted in increased detection of discrepancies (P < .001). In
the poststandardized cohort, only 3 squamous cell carcinomas
(5.33 × 10–4% or 1 in 187,786 Pap tests) had an NILM Pap test
result, and none was adjudicated as an analytic failure. The
proportion of women who had an adenocarcinoma diagnosis
on a histopathologic specimen and an NILM Pap test result
was far more common (0.005% or 1 in 19,426 Pap tests with
1 in 46,947 Pap tests having an analytic failure). As a result of
CH correlation standardization, the number of benign
histopathologic specimens with LSIL Pap test results
increased by 3-fold, and 75% of these discrepancies were secondary to a preanalytic colposcopy failure.
The majority of false-negative colposcopy failures (bottom of Table 6) were associated with an LSIL Pap test result.
In this analysis, all cases in which the Pap diagnosis was
adjudicated as LSIL and the colposcopy diagnosis was not
adjudicated as an analytic failure were considered preanalytic colposcopy failures. Review of the follow-up of these
cases showed that 61.0% had a follow-up abnormal Pap test
or histopathologic diagnosis, 30.5% had no abnormal diagnoses, and 8.5% were lost to follow-up. Thus, 33.3% of
women with follow-up may have had LSIL regression before
colposcopy, reducing the total number of false-negative
histopathologic diagnoses from 2,104 to 1,710 cases.
The impact of failures is shown in ❚Table 7❚. We separately examined the impact of 2 discrepancy categories: HSIL/CIN
2+ and LSIL/CIN 1 and lower. In the pre-CH correlation standardized cohorts, more women had no-harm outcomes for all
discrepancy categories. In the standardized cohorts, there was a
❚Table 4❚
Total Cytologic-Histologic Correlation Discrepancies Including Concurrent Cases*
Papanicolaou Test Failures
Year
All Cases
1998-2001
2002-2004
Total
LSIL/CIN 1–
1998-2001
2002-2004
Total
HSIL/CIN 2+
1998-2001
2002-2004
Total
Colposcopy Failures
Total
Total
Preanalytic
Analytic
Total
Preanalytic
Analytic
1,837
3,441
5,278
1,083 (59.0)
1,653 (48.0)
2,736 (51.8)
694 (37.8)
1,016 (29.5)
1,710 (32.4)
389 (21.2)
637 (18.5)
1,026 (19.4)
754 (41.0)
1788 (52.0)
2,542 (48.2)
443 (24.1)
1372 (39.9)
1,815 (34.4)
311 (16.9)
416 (12.1)
727 (13.8)
1,517
3,051
4,568
954 (62.9)
1,508 (49.4)
2,462 (53.9)
662 (43.6)
974 (31.9)
1,636 (35.8)
292 (19.2)
534 (17.5)
826 (18.1)
563 (37.1)
1,543 (50.6)
2,106 (46.1)
294 (19.4)
1,159 (38.0)
1,453 (31.8)
269 (17.7)
384 (12.6)
653 (14.3)
320
390
710
129 (40.3)
145 (37.2)
274 (38.6)
32 (10.0)
33 (8.5)
65 (9.2)
97 (30.3)
112 (28.7)
209 (29.4)
191 (59.7)
245 (62.8)
436 (61.4)
149 (46.6)
213 (54.6)
362 (51.0)
42 (13.1)
32 (8.2)
74 (10.4)
CIN, cervical intraepithelial lesion; CIN 1–, CIN 1 or lower; CIN 2+, CIN 2 or higher; HSIL, high-grade squamous intraepithelial lesion; LSIL, low-grade squamous
intraepithelial lesion.
* Data are given as number (percentage).
❚Table 5❚
Total Cytologic-Histologic Correlation Discrepancies Excluding Concurrent Cases*
Papanicolaou Test Failures
Year
All Cases
1998-2001
2002-2004
Total
LSIL/CIN 1–
1998-2001
2002-2004
Total
HSIL/CIN 2+
1998-2001
2002-2004
Total
Colposcopy Failures
Total
Total
Preanalytic
Analytic
Total
Preanalytic
Analytic
1,379
2,763
4,142
766 (55.5)
1,125 (40.7)
1,891 (45.7)
439 (31.8)
559 (20.2)
998 (24.1)
327 (23.7)
566 (20.5)
893 (21.6)
613 (44.5)
1,638 (59.3)
2,251 (54.3)
386 (28.0)
1,259 (45.6)
1,645 (39.7)
227 (16.5)
379 (13.7)
606 (14.6)
1,110
2,402
3,512
659 (59.4)
992 (41.3)
1,651 (47.0)
416 (37.5)
526 (21.9)
942 (26.8)
243 (21.9)
466 (19.4)
709 (20.2)
451 (40.6)
1,410 (58.7)
1,861 (53.0)
260 (23.4)
1,059 (44.1)
1,319 (37.6)
191 (17.2)
351 (14.6)
542 (15.4)
269
361
630
107 (39.8)
133 (36.8)
240 (38.1)
23 (8.6)
33 (9.1)
56 (8.9)
84 (31.2)
100 (27.7)
184 (29.2)
162 (60.2)
228 (63.2)
390 (61.9)
126 (46.8)
200 (55.4)
326 (51.7)
36 (13.4)
28 (7.8)
64 (10.2)
CIN, cervical intraepithelial lesion; CIN 1–, CIN 1 or lower; CIN 2+, CIN 2 or higher; HSIL, high-grade squamous intraepithelial lesion; LSIL, low-grade squamous intraepithelial lesion.
* Data are given as number (percentage).
Am J Clin Pathol 2007;128:817-824
© American Society for Clinical Pathology
821
DOI: 10.1309/97JHG6GLY69BVF4Y
821
821
Raab et al / DISCREPANCY ANALYSIS AND OUTCOMES
shift of outcome classification from no harm to near miss. For
the nonconcurrent and concurrent HSIL/CIN 2+ groups, negative outcomes were observed in 70.8% to 22.7% of all failures, and severe negative outcomes were identified in a small
minority of cases. In the LSIL/CIN 1 or lower groups, negative outcomes were identified in 63.9% and 15.1% of the
nonconcurrent and concurrent discrepant cases, respectively;
the negative outcome was generally minimal or mild.
cancers, less effective in preventing adenocarcinomas, and
associated with a relatively large yearly number of lowgrade test failures with women having an excessive number
of repeated Pap tests, colposcopic procedures with unnecessary biopsies, anxiety, and delays in diagnosis. These data
also indicate that the majority of test failures, particularly
failures in diagnosing high-grade lesions, occur more frequently in the colposcopy rather than the initial Pap testing
cycle of cervical cancer prevention.
The level of squamous cancer prevention produced by
the American screening system is remarkable, and our data
corroborate the findings of numerous previously published
studies.1,2,16,18,19 Good screening tests trade off sensitivity
Discussion
These correlation failure data show that cervical cancer
prevention services are highly effective in preventing squamous
❚Table 6❚
Nonconcurrent False-Negative Pap Test and Colposcopic Examination Results
1998-2001
False-negative Pap test results
Adjudicated histopathologic diagnosis
SCC
Adenocarcinoma or AIS
CIN 2/3
CIN 1
Total
False-negative colposcopic results
Adjudicated cytologic diagnosis
SCC
Adenocarcinoma or AIS
HSIL
LSIL
Atypical (ASC-H or AGC)
Total
2002-2004
All Years
Total
Preanalytic
Analytic
Total
Preanalytic
Analytic
Total
Preanalytic Analytic
5
24
118
203
350
3
18
100
148
269
2
6
18
55
81
3
29
82
355
469
3
17
61
257
338
0
12
21
98
131
8
53
200
558
819
6
35
161
405
607
2
18
39
153
212
0
0
124
399
13
536
0
0
99
261
13
373
0
0
25
138
0
163
1
3
189
1,228
147
1,568
1
3
165
920
147
1,236
0
0
24
308
0
332
1
3
313
1,627
160
2,104
1
3
264
1,181
160
1,609
0
0
49
446
0
495
AGC, atypical glandular cells; AIS, adenocarcinoma in situ; ASC-H, atypical squamous cells, cannot exclude HSIL; CIN, cervical intraepithelial neoplasia; HSIL, high-grade
squamous intraepithelial lesion; LSIL, low-grade squamous intraepithelial lesion; Pap, Papanicolaou; SCC, squamous cell carcinoma.
❚Table 7❚
Impact of Discrepancy on Patient Care
HSIL/CIN 2+
% Discrepancy Type/Impact
Including concurrent case pairs
No harm
Near miss
Negative clinical outcome
Minimal
Mild
Moderate
Severe
Unknown
Excluding concurrent case pairs
No harm
Near miss
Negative clinical outcome
Minimal
Mild
Moderate
Severe
Unknown
LSIL/CIN 1 or Lower
1998-2001
2002-2004
1998-2001
2002-2004
23.2
16.5
3.2
25.2
57.2
12.8
8.9
34.9
36.3
23.2
0.0
0.4
0.4
39.6
25.8
0.3
2.1
3.8
19.0
10.9
0.0
0.0
0.0
36.8
14.3
0.0
0.4
4.9
23.0
9.8
3.1
22.3
50.8
10.4
3.4
28.7
41.7
25.0
0.0
0.0
0.5
42.3
26.0
0.3
2.2
3.8
25.2
13.6
0.0
0.0
0.0
45.9
17.5
0.0
0.5
4.0
CIN, cervical intraepithelial neoplasia; CIN 2+, CIN 2 or higher; HSIL, high-grade squamous intraepithelial lesion; LSIL, low-grade squamous intraepithelial lesion.
822
822
Am J Clin Pathol 2007;128:817-824
DOI: 10.1309/97JHG6GLY69BVF4Y
© American Society for Clinical Pathology
Anatomic Pathology / ORIGINAL ARTICLE
for specificity to maximize the detection of at-risk people.
However, it is false to assume that there is a value competition between detecting high-risk lesions and preventing the
95.9% of test failures detected by the CH correlation
process. The cervical cancer screening industry generally
has not focused on reducing these other test failures, which,
assuming 60 million Pap tests per year in the United States,28
occur in more than 285,000 women. Many American laboratories do not track or follow up the majority of CH correlation discrepancies,29 and the focus has been on increasing
Pap test squamous intraepithelial lesion detection through
Pap test technologies5,6,8,9 and regulation7 and not on preventing other failure types, particularly those related to colposcopy failures.
Determining the cause of test failure has been challenging because of observer bias and lack of adequate tools
and frontline skills to perform root cause analysis. Not
unexpectedly, our consortium found that the cause of test
failures generally was multifactorial and secondary to a
host of factors, including organizational structure and lack
of standardization, not simply practitioner lack of skill.30,31
Some pathologists understandably have advocated a medical-legal, panel-review–based approach to CH correlation,32
although this process has resulted in a measure of reproducibility of adjudicated cause rather than a focus on the
interplay of contributing factors. We found that many test
failures were secondary to poor specimen interpretability
composed of factors in specimen collection, processing, and
diagnostic interpretation.30 Adjudication of test failure into
preanalytic or analytic steps and Pap test or colposcopy
phases serves as a basis for quality improvement study and
is not meant to be punitive.13,30 Our data indicate that phases other than the Pap test interpretation phase need to be
addressed from a quality improvement perspective.
Interestingly, cervical cancer screening was far less
effective in the prevention of endocervical adenocarcinoma
than in the prevention of squamous carcinoma. We found
that for every 100,000 Pap tests, approximately 5 endocervical adenocarcinomas were missed. Endocervical adenocarcinomas are increasing in incidence,33 and in the United
States, missed adenocarcinomas are becoming an increasing
source of litigation.34 Not all experts believe that the Pap test
was designed to detect adenocarcinoma,35 and they attribute
false-negative Pap test results as secondary to patient-related
factors. Only rare studies have shown a decrease in mortality due to adenocarcinoma in screened populations.36 This
belief limits quality improvement because our findings indicate that these failures are secondary to combinations of factors, including patient-related factors as a contributing cause.
Our study has several limitations. First, our case selection criteria reduced our discrepancy proportion because we
excluded 1-step discordances and cases of women who had
a discrepant Pap test–histopathologic specimen separated by
more than 6 months. In the United States, the CH correlation
method traditionally has been used to evaluate case pairs
obtained within a relatively short time of each other.13,14
Second, we could not control for preanalytic variation in
screening intervals and follow-up protocols, including colposcopy referral rates and performance of concurrent Pap
tests with histopathologic biopsy. Because correlation practices were not standardized before 2002 and liquid-based
and high-risk HPV technologies were gradually introduced
during this time, we were not able to evaluate how technologies affected failure frequency.
We recognize that assessment of the impact of test failure on care was subjective,37 particularly for women who
may have lesions that regressed.38 However, excluding
cases of possible LSIL regression only reduced the overall
number of discrepant cases by 393. Some argue that cervical cancer screening may be viewed as a set of tests in
which the ultimate goal is the detection of disease rather
than the agreement of individual step diagnoses. In contrast,
we considered the diagnosis of each individual Pap or
histopathologic test as a unique step revealing information
that may conflict or concur with the preceding or follow-up
test diagnosis. We believe that the individual test approach
is a more patient-centric view of safety, and diagnostic discrepancies, at a minimum, may cause confusion and anxiety. An individual test focus also allows for more specific
quality improvement targeting.
The relatively high CH correlation discrepancy proportion justifies the development of error reduction interventions
in cervical cancer screening and further study of patient preference. Quality improvement in cervical cancer screening to
date primarily has not consisted of process improvement and
work redesign. Paradoxically, interventions that decrease the
false-negative Pap test result proportion may result in higher
discrepancy proportions with more women undergoing colposcopy with noncorrelating biopsy results. We note that our
error detection system did not evaluate other process breakdowns such as postanalytic recall failures or the preanalytic
problem of women who are not screened.
From the Departments of 1Pathology and 5Family Medicine and
Clinical Epidemiology, University of Pittsburgh School of
Medicine, Pittsburgh, PA; 2Pathology, Henry Ford Health System,
Detroit, MI; 3Pathology, University of Iowa Healthcare, Iowa City;
4Pathology, Western Pennsylvania Hospital, Pittsburgh; and
6Pathology, Wake Forest School of Medicine, Winston-Salem, NC.
Supported by grant HS13321-01 from the Agency for
Healthcare Research and Quality, Rockville, MD, and a grant from
the Centers for Disease Control and Prevention, Atlanta, GA.
Address reprint requests to Dr Raab: Dept of Pathology,
University of Pittsburgh School of Medicine, 5230 Centre Ave,
Pittsburgh, PA 15232.
Am J Clin Pathol 2007;128:817-824
© American Society for Clinical Pathology
823
DOI: 10.1309/97JHG6GLY69BVF4Y
823
823
Raab et al / DISCREPANCY ANALYSIS AND OUTCOMES
References
1. Kitchener HC, Castle PE, Cox JT. Chapter 7: achievements
and limitations of cervical cytology screening. Vaccine.
2006;24(suppl 3):S63-S70.
2. Saslow D, Runowicz CD, Solomon D, et al. American Cancer
Society guideline for the early detection of cervical neoplasia
and cancer. Cancer. 2002;52:342-362.
3. Renshaw AA. An accurate and precise methodology for
routine determination of the false-negative rate of
Papanicolaou smear screening. Cancer. 2001;93:86-92.
4. Wright TC Jr, Cox JT, Massad LS, et al. 2001 Consensus
guidelines for the management of women with cervical
cytological abnormalities. JAMA. 2002:287:295-304.
5. Davey E, Barratt A, Irwig L, et al. Effect of study design and
quality on unsatisfactory rates, cytology classifications, and
accuracy in liquid-based versus conventional cervical cytology:
a systematic review. Lancet. 2006;367:122-132.
6. Arbyn M, Sasieni P, Meijer CJ, et al. Chapter 9: clinical
applications of HPV testing: a summary of meta-analyses.
Vaccine. 2006;24(suppl 3);S78-S89.
7. Department of Health and Human Services, Health Care
Financing Administration. Medicare, Medicaid and CLIA
programs: regulations implementing the Clinical Laboratory
Improvement Amendments of 1988 (CLIA), 57 Federal
Register 7146 (1992) (codified at 42 CFR §493).
8. Brown AD, Garber AM. Cost-effectiveness of three methods
to enhance the sensitivity of Papanicolaou testing. JAMA.
1999;281:347-353.
9. Lehtinen M, Herrero R, Mayaud P, et al. Chapter 28: studies
to assess the long-term efficacy and effectiveness of HPV
vaccination in developed and developing countries. Vaccine.
2006;24(suppl 3):S233-S241.
10. Lundberg GD. Acting on significant laboratory results
[editorial]. JAMA. 1981;245:1762-1763.
11. Stenkvist B, Soderstrom J. Reasons for cervical cancer despite
extensive screening. J Med Screen. 1996;3:204-207.
12. Kenter GG, Schooderwald EM, Koelma IA, et al. The
cytological screening history of 469 patients with squamous
cell carcinoma of the cervix: does interval carcinoma exist?
Acta Obstet Gynecol Scand. 1996;75:400-403.
13. Raab SS, Grzybicki DM, Janosky JE, et al. Clinical impact and
frequency of anatomic pathology errors in cancer diagnosis.
Cancer. 2005;104:2205-2213.
14. Jones BA, Novis DA. Cervical biopsy–cytology correlation:
a College of American Pathologists Q-Probes study of 22,439
correlations in 348 laboratories. Arch Pathol Lab Med.
1996;120:523-531.
15. Ferris DG, Litaker MS, ALTS Group. Prediction of cervical
histologic results using an abbreviated Reid Colposcopic Index
during ALTS. Am J Obstet Gynecol. 2006;194:704-710.
16. van den Akker-van Marle ME, van Ballegooijen M, van
Oortmarssen GJ, et al. Cost-effectiveness of cervical cancer
screening: comparison of screening policies. J Natl Cancer Inst.
2002;94:193-204.
17. Solomon D, Davey D, Kurman R, et al. The 2001 Bethesda
System: terminology for reporting results of cervical cytology.
JAMA. 2002;287:2114-2119.
18. Eddy DM. Screening for cervical cancer. Ann Intern Med.
1990;113:214-226.
19. Goldie SJ, Kim JJ, Myers E. Chapter 19: cost effectiveness of
cervical cancer screening. Vaccine. 2006;24(suppl 3):S164S170.
824
824
Am J Clin Pathol 2007;128:817-824
DOI: 10.1309/97JHG6GLY69BVF4Y
20. Colgan TJ, Clarke A, Hakh N, et al. Screening for cervical
disease in mature women: strategies for improvement. Cancer.
2002;96:195-203.
21. Flowers LC, McCall MA. Diagnosis and management of
cervical intraepithelial neoplasia. Obstet Gynecol Clin North
Am. 2001;28:667-684, viii.
22. Kohn LT, Corrigan JM, Donaldson MS. To Err Is Human:
Building a Safer Health System. Washington, DC: National
Academy Press; 1999.
23. Wang SS, Sherman ME, Hildesheim A, et al. Cervical
adenocarcinoma and squamous cell carcinoma incidence
trends among white women and black women in the United
States for 1976-2000. Cancer. 2004;100:1035-1044.
24. Etherington IJ, Luesley DM. Adenocarcinoma in situ of the
cervix: controversies in diagnosis and treatment. J Lower Genit
Tract Dis. 2001;5:94-98.
25. Ostor AG. Natural history of cervical intraepithelial neoplasia:
a critical review. Int J Gynecol Pathol. 1993;12:186-192.
26. Battles JB, Kaplan HS, Van der Schaaf TW, et al. The
attributes of medical event reporting systems: experience with
a prototype medical event reporting system for transfusion
medicine. Arch Pathol Lab Med. 1998;122:231-238.
27. Rubin G, George A, Chinn DJ, et al. Errors in general
practice: development of an error classification and pilot study
of a method for detecting errors. Qual Saf Health Care.
2003;12:443-447.
28. Sirovich BR, Welch HG. The frequency of Pap smear
screening in the United States. J Gen Intern Med.
2004;19:243-250.
29. Vrbin CM, Grzybicki DM, Zaleski MS, et al. Variability in
cytologic-histologic correlation practices and implications on
patient safety. Arch Pathol Lab Med. 2005;129:893-898.
30. Raab SS, Stone CH, Wojcik EM, et al. Use of a new method
in reaching consensus on the cause of cytologic-histologic
correlation discrepancy. Am J Clin Pathol. 2006;126:836-842.
31. Raab SS, Vrbin CM, Grzybicki DM, et al. Errors in thyroid
gland fine-needle aspiration. Am J Clin Pathol. 2006:125:873882.
32. Renshaw AA, Young ML, Holladay EB. Blinded review of
Papanicolaou smears in the context of litigation. Cancer.
2004;102:136-141.
33. Vinh-Hung V, Bourgain C, Vlastos G, et al. Prognostic value
of histopathology and trends in cervical cancer: a SEER
population study. BMC Cancer. 2007;7:164 [Epub ahead
of print].
34. Cox JT. AGUS Pap smears: a follow-up strategy. Obgyn
Manage. 1998;27:74-97.
35. van Aspert-van Erp AJ, Smedts FM, Vooijs GP. Severe
cervical glandular lesions and severe cervical combined
lesions: predictive value of the Papanicolaou smear. Cancer.
2004;102:210-217.
36. Nieminen P, Kallio M, Hakama M. The effect of mass
screening on incidence and mortality of squamous and
adenocarcinoma of cervix uteri. Obstet Gynecol. 1995;85:10171021.
37. Gandhi TK, Kachalia A, Thomas EJ, et al. Missed and delayed
diagnoses in the ambulatory setting: a study of closed
malpractice claims. Ann Intern Med. 2006;145:488-496.
38. Jones BA, Novis DA. Follow-up of abnormal gynecologic
cytology: a College of American Pathologists Q-Probes study
of 16,132 cases from 306 laboratories. Arch Pathol Lab Med.
2000;124:665-671.
© American Society for Clinical Pathology