Deletions and Insertions in the p53 Tumor

[CANCER RESEARCH 56. 2130-2136.
May 1. 1996]
Deletions and Insertions in the p53 Tumor Suppressor Gene in Human Cancers:
Confirmation of the DNA Polymerase Slippage/Misalignment Model
Marc S. Greenblatt, Arthur P. Grollman, and Curtis C. Harris1
Laboratory of Human Carcinogenesis, National Cancer Institute, National Institutes of Health, Bethesda,
Pharmacological Sciences. Slate University of New York at Stony Brook, Stony Brook, New York 11794 [A. P. G.J
ABSTRACT
We analyzed all published deletions and insertions in the p53 gene to
assess the relevance of mutagenesis
models. Almost all deletions and
insertions can be explained by one or more of the following DNA sequence
features: monotonie base runs, adjacent or nonadjacent repeats of short
tandem sequences, palindromes, and runs of purines or pyrimidines (homocopolymer runs). Increased length of monotonie runs correlates posi
tively with increased frequency of events. Complex frameshift mutations
can be explained by the formation of quasi-palindromes, with mismatch
excision and replication using one strand of the palindrome as a template.
Deletions and insertions in the p53 tumor suppressor gene may reflect
both spontaneous and carcinogen-induced mutagenesis.
INTRODUCTION
The somatic mutations that accumulate during multistep carcinogenesis include major chromosomal events (such as translocations,
allelic deletions, and gene rearrangements) and point mutations (such
as base substitutions and small deletions or insertions; Ref. l). The
relative frequencies of different classes of mutations vary among
cancer-related genes. For example, mutations found in the APC tumor
suppressor gene are most often short deletions or insertions, whereas
activating mutations in the ras proto-oncogene are base substitutions
(1). The largest data base of somatic mutations in human cancers
exists for the p53 tumor suppressor gene (2, 3). All classes of muta
tions are found. Point mutations, often accompanied by deletion of the
second alÃ-ele,are the most common event. The majority of p53 point
mutations are base substitutions, but 10—15%of both somatic and
germ line p53 mutations are small, intragenic deletions or insertions
(1,4). This data set is large enough to test hypotheses regarding the in
vivo mechanisms of intragenic deletions and insertions.
In vitro mutagenesis studies have established that DNA sequence
context is the most important factor in determining the specificity of
deletions and insertions (reviewed in Refs. 5 and 6). Almost all small
deletions and insertions in model systems occur at monotonie runs of
two or more identical bases, or at repeats of 2-8-bp DNA motifs,
either in tandem or separated by intervening nucleotides. Several
mechanisms have been proposed. The most well-studied mechanism,
first proposed by Streisinger et al. (7) in 1966, involves slippage or
misalignment of the template DNA strands in iterated bases during
replication. This leads to either base deletion (if the nucleotides
excluded from pairing are on the template strand) or insertion (if they
are on the primer strand); replication "fixes" these mutations in the
daughter cells (5, 6). Misalignment can occur "spontaneously" in
monotonie runs or can be initiated by a spontaneous or adductinduced base misincorporation that creates a context for slippage (5,
8, 9).
Misalignment may also occur in repeat sequences that are not
Received 9/11/95; accepted 3/4/96.
The cosls of publication of this article were defrayed in part by the payment of page
charges. This article must therefore be hereby marked advertisement in accordance with
18 U.S.C. Section 1734 solely to indicate this fact.
1To whom requests for reprints should be addressed, at Laboratory of Human
Maryland
20892
[M. S. G., C. C. H.J, and Department
of
monotonie runs. When a repeat sequence mispairs with a tandem
(adjacent) or nontandem (nonadjacent) complementary motif, the loop
formed by the intervening oligonucleotide sequence may be deleted or
duplicated (10-12). Such repeats have been identified at the sites of
spontaneous and ionizing radiation-induced deletions (13, 14) and at
the sites of deletions in synthetic DNA sequences with adducted bases
(9). Loops may also form between nonadjacent inverted repeats (pal
indromes), leading to deletion of the loop and sometimes to complex
frameshift mutations involving both deletion and base substitution
(12, 15-17). A review of short (<20-bp) germ line deletions and
insertions in genes associated with human genetic disease confirmed
that these DNA sequence features are the most important factors
determining the specificity of these events (11).
Quantitative mutational spectrum analysis in vitro shows that: (a)
the frequency of frameshift mutations increases as the length of the
monotonie run increases; and (b) single-base deletions are much more
frequent in vitro than are single-base insertions (reviewed in Ref. 5).
The proposed explanation is superior stability of the hydrogen bond
ing of extensive repeats, and G:C-rich regions, and intermediate DNA
structures associated with deletions. DNA polymerase infidelity may
lead to spontaneous frameshift errors; different polymerases vary in
the frequency of errors and the ratio of deletions:insertions (5). The
surrounding sequence context is another major factor affecting the
likelihood of misalignment mutations (9, 18-20).
Limited analysis of smaller p53 mutation data bases has confirmed
the presence of some of these high-risk sequence features, including
monotonie and tandem repeats, at the sites of p53 somatic mutations
(10). Long runs of purines or pyrimidines, known as homocopolymers
(e.g., AGAGAG, CCTTCC, and AAAGGG) have also been noted at
sites ofp53 deletions (21). We updated and reviewed the data base of
all reported somatic p53 mutations (2) and analyzed the subset of all
deletions and insertions in more detail to assess: (a) the extent to
which the mechanisms determined in experimental systems could
account for somatic p53 deletions and insertions; (b) how patterns of
observed p53 deletions and insertions might refine existing models;
and (c) whether relative frequencies of deletions and insertions pro
vide information to distinguish between spontaneous or carcinogeninduced events.
MATERIALS
AND METHODS
The p53 mutation data base has been described and is available via E-mail
from the European Molecular Biology Library (2). We updated the data base
to include all p53 mutations published before September 1994, as identified by
searches of MEDLINE and Current Contents. All intron and exon sequences at
the sites of, adjacent to, and up to 40 bp away from insertions and deletions
were examined, in some cases with the aid of the Gene Jockey sequence
analysis program (Biosoft, Cambridge, United Kingdom). Regression and x2
analyses were performed using standard statistical software.
RESULTS
Since the publication of our original p53 data base (2), the preva
lence of deletions and insertions among all p53 mutations has de
creased from 13 to 11%. This reduction probably reflects a detection
2130
Carcinogenesis. National Cancer Institute. National Institutes of Health. Building 37.
Room 2C05, Bethesda, MD 20892-4255. Phone: (301) 496-2048; Fax: (301) 496-0497.
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
p53 DELETIONS AND INSERTIONS
Table I DNA sequence features associated with deletions and insertions of pS3 in human cancers
(%)Class
eventAllAllof
Sequence features associated with events
high-risksequence
feature9797951009794961009796Monotonieruns72807775868356867471Tandemrepeats57585667484467436683Tandemrepeats
only1112801083308gHomocopolymerruns565531426665521004738Nontandemrepeats192351700404
deletionsAll
insertionsComplex
frameshifts1
deletions1-bp
-bp
insertions2-bp
deletions2-bp
insertions3
deletions3
+ -bp
+ -bp insertionsn34925978121184827711324Any
bias, because most recent reports have examined only exons 5-8, in
which missense mutations are more common, and have neglected
exons 2-4 and 9-11, in which the majority of mutations are deletions
and insertions (1). The data base contains 370 deletions and insertions
in thep5J gene. Of these, 21 detected by sequencing of cDNA involve
entire exons or large deletions at splice sites, indicating aberrant
mRNA splicing. Because splicing mutations often result from either
point mutations in splice sequences at the intron-exon junction or
dysregulation of the splicing mechanism and not from loss or gain of
DNA, these mutations were not classified as DNA deletions or inser
tions. The patterns and DNA sequence features of the remaining 349
deletions and insertions (or "events") were analyzed.
Sizes and Relative Frequencies ofp53 Deletions and Insertions.
Simple deletions constituted 74% (259 of 349) of all events (71% of
one 1 bp, 76% of two bp, and 82% of &3 bp). Simple insertions
constituted 22% (78 of 349) of the events; the remaining 3% (12 of
349) were complex frameshift mutations that combined deletion,
insertion, and base substitution. In 49% (166 of 349, 118 deletions and
48 insertions) only 1 bp was affected; 10% (34 of 349, 27 deletions
and 7 insertions) involved 2 bp; and 39% (137 of 349, 113 deletions
and 24 insertions) altered 3 or more bp. Only six intragenic deletions
in p53 larger than 30 bp and unassociated with splice sites were
reported; the largest was 137 bp.
DNA Sequence Features at Sites of Deletions and Insertions.
We calculated the proportions of all DNA deletions and insertions
attributable to each of the following mechanisms (Table 1 and Fig. 1):
monotonie base runs, repeats of short tandem sequences, palindromes
(inverted repeats of dyad symmetry), and runs of four or more purines
or pyrimidines (homocopolymer runs).
Almost all [341 (98%) of 349] deletions and insertions can be
explained by one or more of these DNA sequence features. The most
common sequence motifs seen at the site of deletions or insertions are
monotonie runs (two to five consecutive identical bases), present at
the sites of 77% (269 of 349) of all events; only 46% of all p53 bp are
within these runs (odds ratio for an event occurring at a monotonie
run, 3.95; 95% confidence interval, 3.00-5.20; P < 0.0001). Re
peated tandem sequences, either adjacent to or distant from each
other, encompass or flank 30% of deletions or insertions, and palin
dromes flank another 5%. Homocopolymer runs of 4 or more bp occur
at the sites of 56% of events, but in only 3% are they the only
associated feature. Homocopolymer runs of 4 or more bp encompass
42% of all bp in p53-coding exons (odds ratio for an event occurring
at a homocopolymer run, 1.75; 95% confidence interval, 1.38-2.23;
P < 0.0001).
Short deletions and insertions ( 1 bp) are associated with a different
pattern of sequence features than are longer events (2 or more bp;
Table 1). For 1-bp changes (n = 118 deletions; n = 48 insertions),
monotonie runs and direct tandem repeat sequences account for al
most all events. Monotonie runs of 2-5 bp occur at 86 and 83% of
1-bp deletions and insertions, respectively. Where no monotonie
repeat exists (n = 24; 14% of 1-bp events), tandem repeat sequences
are present at 15 (94%) of 16 1-bp deletions and 7 (88%) of 8 1-bp
insertions. Overall, tandem repeats accompany 48% of 1-bp deletions
and 44% of 1-bp insertions; they are the only high-risk DNA se
quences present at 10% of 1-bp deletions and 8% of 1-bp insertions.
Runs of 4 or more homocopolymer bp occur at 64% of 1-bp events,
but in all but one instance (1%), they accompany other high-risk
sequence features.
Deletions of 2 bp (n = 27) are less likely to be associated with
monotonie runs (56%) and more likely to be associated with tandem
repeats (67%). In 33% of these 2-bp deletions, tandem repeats are the
only deletion-promoting DNA sequence features present. However,
six of seven insertions of 2 bp are at monotonie runs, and only three
are at tandem repeats. Homocopolymer runs accompany 52% of
deletions and all seven insertions of 2 bp. Of deletions of 3 or more
bp (n = 134), 68% were at monotonie runs, 58% were at direct
Fig. 1. Sequence features associated with p53 deletions and insertions.
MONO RUN. monotonie base run; TAN RPT. tandem repeat sequence;
HCP RUN, homocopolymer run of four or more bases; NONTAN RPT.
nonlandem repeated sequence (nonadjacent repeat sequence flanks
event); PAL, palindrome.
ANY
HCP
RUN
NONTAN
RPT
Sequence Feature
2131
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
PAL
pS3 DELETIONS
AND INSERTIONS
DELETION FREQUENCY
0.6
0.5-
0.4 -J
El
0.3
1 BP DELETIONS
•¿
ALL DELETIONS
0.2
0.1
0.0
Fig. 2. Relationship between length of monotonie
repeat and frequency of deletions and insertions in p53.
A, deletions increase in a linear, positive relationship with
length of repeat [R = 0.930 for 1 bp (Q), 0.864 for all
(*)]. B, insertions increase in frequency only at monotonic runs of five bases.
LENGTH OF REPEAT
INSERTION FREQUENCY
B
0.5
0.4
0.3
s
ALL INSERTIONS
•¿
1 BP INSERTIONS
0.2
0.1
0.0
LENGTH OF REPEAT
repeats, and only 43% were accompanied by homocopolymer se
quences. Notably, 43% were flanked by nontandem repeat sequences,
and 10% were flanked by palindromes, a sequence feature that rarely
accompanies shorter events. The same general patterns are seen for
insertions of 3 or more bp (n = 27; Table 1).
Krawczak and Cooper (11) identified a consensus sequence of 6 bp
(TG A/G A/G G/T A/C), which occurs frequently at sites of germ line
deletions in a variety of genetic diseases. We identified this sequence
six times in the coding exons of the p53 gene; 14 somatic p53
deletions or insertions are associated with it, including 11 at one site
in exon 5 (codons 173-175, TG AGG C). With only one exception, at
least one other deletion-associated sequence feature (usually tandem
repeat or palindrome) also is associated with these events. The region
of p53 that most frequently contains deletions or insertions is codons
151-159: CGC CCG CGC CGC ACC CGC GTC CGC GCC. Thirty
mutations (9% of all deletions and insertions, and 1% of all p53
mutations) have been reported at this G:C-rich sequence with multiple
runs and direct repeats.
Increased Length of Monotonie Runs Correlates with Increased
Frequency of Events. To test the prediction of the DNA polymerase
slippage model that deletions and insertions should be more common
at longer repeats (5), we calculated a regression curve of the frequency
of events per site. We divided the number of deletions and insertions
occurring at monotonie runs of 2, 3, 4, and 5 bp by the number of runs
of each length in p53-coding exons times the number of bases per run.
The resulting number describes the frequency of events per bp at
monotonie runs. One run of 6 bp occurs in exon 11, but because this
region is infrequently sequenced (only one deletion here has been
reported), it was not included in this analysis. Fig. 2 shows that for
deletions of 1 bp, the relationship is positive and linear; deletions are
proportionally more frequent at longer runs (R = 0.930). This rela
tionship is present but weaker when deletions of all lengths are
2132
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
p53 DELETIONS
Table 2 Features of deletions and insertions flanked by palindromes and tandems
nDeletionsInsertionsLength
(%)2 of paired sequence
bp3bp4bp5
bp>5
bpLength
(%)1-3
of deletion
bp4-9
bp10-20bp>20
bpLength
(%)1-3
of insertion
bp4-9
bp10-20
bp>20bpSegment
AND INSERTIONS
sequence context, with both stable palindromes and tandems absent
from the site.
Exogenous versus Endogenous Mutation Analysis by Tumor
Type. The mutational spectrum may be an index of the relative
frequencies of endogenous versus exogenous mutations in tumors and
may help detect patterns characteristic of exogenous mutagens. Pre
viously, we analyzed base substitution patterns among tumor types
and noted that substitutions that are thought to be due to exogenous
carcinogens are more frequent in tumors for which epidemiological
studies indicate a link with carcinogen exposure (1, 4). Endogenous
deamination of 5-methylcytosine at CpG sites results in G:C to A:T
transitions at CpG dinucleotides; CpG site transitions therefore have
been referred to as "spontaneous" mutations, which can occur in the
absence of exogenous mutagens. The prevalence of CpG site transi
tions varies widely among tumor types, generally correlating nega
tively with carcinogen association (1).
(%)Bothdeleted
Because DNA polymerase infidelity alone can produce frameshift
primersIntervening
mutations, deletions and insertions also have been suggested as a
onlyOne
loop
marker of spontaneous mutation (10). However, chemical carcinogens
loopPartial
primer +
loopOtherSegment
primer +
and their adducts can induce frameshift mutations in vitro (6, 18) and,
therefore, may contribute to frameshift mutations in human cancers.
(%)Bothinserted
We compared the prevalence of deletions and insertions among tu
primersIntervening
onlyOne
loop
mors to determine whether they correlated either positively or nega
loopPartial
primer +
tively
with carcinogen exposure and could thus be used as an indicator
loopOtherPalindromes1814401739222271443362500751421-290-757-710SO250250Tandems6560523129201533843130602020710-1257-6517-23000602020
primer +
of endogenous or exogenous mutations.
Deletions and insertions are most common in tumors of the thyroid
(17%), head and neck (15%), breast (15%), and sarcomas (15%); they
are least common in melanoma (5%) and other skin cancers (8%) and
considered, because these events are often associated with other
colon (8%), hepatocellular (8%), and cervical carcinomas (8%). Both
sequence features. Insertions of 1 or more bp are uncommon at runs
of these lists contain tumors that have been frequently (thyroid, head
of two, three, or four bases; this portion of the curve is flat. However,
and neck, skin, hepatocellular, and cervical) and infrequently (sarco
insertions increase sharply in frequency at 5-bp runs (Fig. 2). When
mas and colon) associated with environmental mutagens, and both
only mutations and sequences in the commonly studied exons 5-8
contain tumors that frequently (thyroid and colon) and infrequently
were considered, similar results were obtained (deletions at 4-bp runs
(head and neck and hepatocellular) contain CpG site transitions,
are slightly underrepresented; data not shown). Therefore, selection
markers of mutations unassociated with exogenous mutagens (1).
bias in detection of deletions is unlikely to affect these results.
The prevalence of deletions and insertions varies in tobacco-asso
Nonadjacent Tandem Repeat Sequences and Palindromes. Six
ciated cancers (e.g., cervix, 8%; lung, 10%; bladder, 12%; and head
ty-five mutations occur at nonadjacent tandem repeats, and 18 occur
and neck, 15%). The hot spot region from codons 151-159 is affected
at inverted repeats (palindromes). In all but one of these events, the
in 16% of 36 lung cancer deletions and insertions, including 3 around
length of the repeated motif is 3 or more bp (Table 2). More than
codon 157. Codon 157 is a hot spot for base substitutions in lung
three-fourths of the sequences deleted or inserted between tandem
cancer (1); this sequence may be the target of an important promutarepeats are between 4 and 20 bp long; 40% are between 4 and 9 bp,
genic, tobacco-associated adduci.
and 43% are between 10 and 20 bp, but only 12% are more than 20
bp. Of sequences deleted or inserted between palindromes, however,
DISCUSSION
more than three-fourths are 10 bp or greater; 40% are greater than 20
bp, and few are between 4 and 9 bp. Deletions or insertions flanked by
This analysis confirms many of the prior observations of deletions
tandem sequences usually include one full copy of the repeat and the
and insertions in human disease (10, 11, 21), highlights the relevance
complete sequence between the repeats. Those between inverted re
of many of the mechanistic models for deletion and insertion derived
peats usually include a portion, but not all, of the palindrome; in about
from in vitro mutagenesis studies (5, 6, 9), and suggests refinement of
one-fourth of cases, only the sequence between the repeat is affected
some models.
(Table 2).
The regression curve of the frequency of deletions and insertions
Complex Frameshift Mutations. Complex frameshifts are muta
versus the length of a monotonie base run shows that as the length of
a run increases, the probability of deletions (especially 1-bp deletions)
tions which involve combinations of deletion, insertion, and base
substitution. In a model derived from in vitro studies (6, 17), these
increases in a linear relationship. However, a different relationship is
mutations are explained by formation of quasi-palindromic loops,
noted for insertions. A run of repeated bases must reach a threshold of
which are treated as double-stranded DNA. Deletions, insertions, and
5 bp before insertions are likely to occur in the p53 gene. This
substitutions are generated by excision of mismatches and templatethreshold pattern has been demonstrated for other systems in vitro,
directed replacement of bases complementary to one side of the loop.
such as deletions in DNA polymerase a replication of free DNA (5).
Twelve complex frameshift mutations appear in the data base. Nine
Sequence context surrounding runs of identical length can contribute
of these can be explained by the formation of palindromes with
to differences in deletion and insertion frequency (5, 19, 20); there
template-directed mutation as described by the above model (exam
fore, a simple length-frequency relationship is difficult to interpret.
However, the same sequences in p53 are at risk for both deletions and
ples in Fig. 3; Ref. 17). Two others occur at the sites of tandem repeats
insertions; the fact that their incidence-versHj-length curves differ
of 5-6 bp. Only one complex frameshift is not explained by DNA
2133
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
pSJ DELETIONS
AND INSERTIONS
CODONS 235 - 241
CODONS 155-161
CC
T
AcxcG
G¡/Ac/
A
C*•
C
AC
fcC (r)
\G/
GT
CG
TC
del 4 C.GCC
ins 1 T
""
G—C
C—
GT©G
mutations fit a model
with DNA excision
of the palindrome as
circled; bold italics.
yAtoT^
AAC fc
^
A-T
C
A^v\cT\\TT
G.
A°T
Q
AG— GC
_AAdel
—¿ins
6 to
10
"~
CTG
/G\T—
—¿
fA\T—
[AG
/GG VA
AT^AA
*QT^lTTGGGGA
GT
C—
T
A— T
C
GA—
—¿
TTG
INTRON 6 - EXON 7, CODON 227
TA
CA
T
T
A
C
C—G
GCTCAC
TAG
jG ^^
CC
CA—
T
A—
TSs$£?GC Cde/3
C
Gr-A7"G—
CODONS 199-213
Fig. 3. Complex frameshift
of quasi-palindrome formation
and replication using one side
a template. Deleted bases are
new bases.
AA
CC
GT
T1
1A
A
C
C
C —¿
G
T
T
G
CTT
G
CT
C
A
C
TTGCGTGTGAGTATTTGG
1G^
T
:C
C
GG
TC
C/l
TC
TA
Ggc
GG
GA
AT
AC
CA
TTO
GAG
C
C
T
C
G
Cmult
fc
subs,
ñdelie
AC
T
AT
G
G
C
C —¿
VT/
C —¿
TT—¿
TC
GT
GT
TU'GGATTCQTCTATA
xG_L/'>»delie
suggests that the intermediate events involved in their generation
differ. One potential explanation is that repair of short heteroduplex
DNA structures predisposing to deletions is less efficient than that of
those leading to insertion (see below).
The proportion of p53 deletions and insertions that represent spon
taneous events due to DNA replication infidelity versus mutageninduced changes remains unclear. Frameshift mutations that seem to
form spontaneously in genomic DNA may be generated by DNA
polymerase-induced errors in replication or may arise as a result of
endogenous or exogenous damage to template DNA. Endogenous
DNA damage is generated by reactive oxygen species (22), products
of lipid peroxidation (23, 24), or other metabolic reactions that alter
DNA structure (25), thereby complicating the assignment of these
mutations to specific environmental agents.
We compared the prevalence of frameshift mutations in tumor
types with epidemiological and mutational spectrum evidence sup
porting carcinogen-induced and endogenous mutagenesis (1); associ
ation with cancers linked to carcinogens would strengthen the infer
ence that frameshifts were induced by exogenous damage. Deletions
and insertions did not correlate well with these indices. This suggests
that differences among tumor types in their prevalence may reflect
both endogenous and carcinogen-induced mutagenesis, such as organspecific carcinogens or tissue-specific properties that regulate repli
cation fidelity. The presence in lung cancers of a hot spot for frameshift mutations in the G:C-rich sequence at codons 151-159, near a
base substitution hot spot (1). suggests that carcinogen-mediated
frameshifts may occur in vivo.
Several models involving
DNA damage-induced
template mis
alignment or slippage during DNA replication have been proposed as
mechanisms of frameshift mutagenesis (5, 6, 8, 9, 17-19, 26-29). The
following sequence of reactions may lead to base deletions: DNA
polymerase encounters a damaged or adducted base at the replication
fork, and DNA synthesis is blocked, often transiently, by helix de
formation or adduci bulk. If chain extension is delayed, the nucleotide(s) at the 3' primer terminus may pair with a complementary
base(s) 5' to the lesion. This misaligned template generates a repli
cation product with deleted bases. The complementary base pairing at
the primer-template junction facilitates extension of the misaligned
chain. Base substitutions occur when insertion of an "incorrect" base
opposite the lesion is followed rapidly by chain extension. Adductinduced base misincorporation may also increase the stability of a
misaligned intermediate.
Frameshift mutations are induced in prokaryotic models by some
bulky chemical mutagens, including aflatoxin B, and arylamines. The
ability of a damaged nucleotide to generate frameshift mutations is
determined by the nature of the base inserted opposite the adduci, the
surrounding sequence context (adduct-induced frameshift mutations
occur more frequently in monotonie or tandem base runs), and the rate
of translesional DNA synthesis estimated from kinetic analysis (9, 19,
20, 30-32). Studies with chemically defined DNA adducts have
established the mutational specificity of some carcinogen-induced
lesions in vitro (31, 33). Although the frequency and sites of muta
tions are not always predictable, the relative propensity for some DNA
lesions to promote frameshift deletions in vitro can be estimated (9).
2134
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
p5J DELETIONS
AND INSERTIONS
In addition to the effects of specific mutagens, DNA repair capacity
is an important factor affecting mutagenesis. The rates and types of
spontaneous and induced mutations in Salmonella typhimurium can
vary dramatically depending on the DNA repair capacity of the strain
(17, 34). In eukaryotes, mismatch repair involves a complex of several
repair enzymes. One of these, MSH2, binds to palindromes with high
affinity, and to large and small insertions and G/T mismatches with
moderate affinity (35). MSH2 mutants are defective in the repair of
mismatches and small insertion/deletion mispairs (36); thus, this sys
tem is likely responsible for repair of the misalignments leading to
frameshifts. Germ line mutations in hMSH2 (37, 38) and other mis
match repair enzymes have been linked to hereditary nonpolyposis
colon cancer (39-41). Individuals who inherit these mismatch repair
abnormalities demonstrate genome-wide insertions and deletions in
repetitive DNA sequences (replication errors). This "mutator phenotype" may contribute to intragenic deletions in cancer-related genes
(42). However, replication errors and the mismatch repair status in
cases of colon or other tumors containing p53 mutations have not yet
been reported.
These mismatch repair enzymes also are involved in homologous
recombination, as are other proteins, including RecA and its eukaryotic homologues, which bind to homologous DNA regions and pro
mote pairing and strand exchange (43-46). The heteroduplex DNA
sequences formed during this process are generally longer than the
intermediate structures proposed for the mispairs that lead to frameshifts (47, 48), and it is uncertain how these recognition and repair
systems are involved in mutagenesis.
Palindromes and nonadjacent repeat sequences often flank dele
tions and insertions larger than 3 bp in vitro (5, 6, 16) and in this
report. The patterns we observed suggest that the intermediate path
ways differ for events initiated by these two sequence motifs. Palin
dromes generally flank larger deletions, suggesting that short loops
between palindromic dyad elements are less likely to be deleted than
are longer loops. The presence of short loops in the dyads associated
with complex frameshifts (Fig. 3) supports this explanation. The data
also indicate that, although direct repeats of 2-bp motifs can predis
pose to short deletions, more than 2 bp homology are required for both
palindromes and nontandem repeat elements to form stable structures.
Longer dyad sequences are required for palindromes than for tandems.
The deletions associated with tandem sequences usually exactly in
corporate the intervening loop and one copy of the repeat, confirming
the existing model (11, 13, 14, 49). However, the p53 deletions
associated with palindromes usually include portions of both flanking
motifs; the exact deletion generally cannot be predicted. This suggests
that different deletion mechanisms occur, perhaps involving variable
actions of DNA repair proteins (see above). The specific system or
mutagen also may influence repair. For example, the spectrum of
spontaneous and induced mutations in 5. typhimurium usually changes
when a gene for an error-prone DNA repair enzyme is inserted, but no
change is seen in mutations induced by the heterocyclic amine GluP-l (17, 26).
Analysis of complex frameshift mutations confirms that DNA
replication and repair mechanisms may be involved in their genesis.
Prior studies of complex frameshift mutations induced in S. typhi
murium generated a model of adduct-mediated mispairing, quasipalindromes, and DNA replication in palindromes (17). In this system,
the presence of an error-prone DNA repair gene enhances the forma
tion of complex frameshifts. We show that complex mutations in p53
are consistent with DNA excision repair and synthesis in palindromes,
with one strand of the palindrome used as a template (Fig. 3). This
may be triggered by an adduct-induced substitution mutation that
related genes, including APC in colon cancers (50) and pl6'"k4
(CDKN2 and MTS-1) in a variety of tumors (51-54). Interestingly,
1-bp deletions in APC are less often associated with monotonie runs
but are often associated with tandem repeats, perhaps reflecting the
influence of impaired mismatch repair due to abnormalities in the
hMLHl and HMSH2 genes in colon tissue. Of 24 deletions and five
insertions described in pl6'nt4, only 38% (11 of 29) affect 1 bp, and
48% (14 of 29) affect 3 or more bp. DNA sequence features in this
preliminary data base are similar to those seen in p53. Monotonie runs
(86%) and tandem sequences (72%) are the most frequent motifs
present at the sites of deletions.
Deletions and insertions in p53 are likely underreported, because
many investigators concentrate on exons 5-8, the highly conserved
DNA-binding region in which most missense mutations occur. More
complete analysis of all p5J-coding exons would give a more thor
ough picture of mutational patterns. Missense mutations constitute
77% of mutations in exons 5-8 but less than half of the mutations
outside of these regions (1). This is probably due to less frequent
selection of missense mutants in exons 2-4 and 9-11. Single-base
substitutions in the DNA-binding domain often alter p5J-induced
transactivation (1). However, single-amino acid substitutions in the
transactivation domain (amino acids 20-42) do not affect transacti
vation; some double substitutions do (55). Frameshifts occurring
within or 5' to the DNA-binding domain eliminate the p53 DNAbinding function and thus would alter p53 function and cell cycle
kinetics every time they occur. The carboxyl terminus (exons 9-11) of
p53 also is important in the oligomerization and nuclear localization
of the p53 protein (reviewed in Ref. 56) and regulates its sequencespecific DNA binding (57-60).
increases the stability of a palindrome (17).
Germ line and somatic mutation have been reported in other cancer-
ACKNOWLEDGMENTS
We are grateful to Dorothea Dudek for excellent editorial assistance. Peter
Shields for assistance with statistical analysis, and Thomas Kunkel for critical
review of the manuscript.
REFERENCES
1. Greenblatt. M. S., Bennett, W. P., Hollstein. M., and Harris. C. C. Mutations in the
p53 tumor suppressor gene: clues to cancer etiology and molecular pathogenesis.
Cancer Res., 54: 4855-4878. 1994.
2. Hollstein. M.. Rice, K.. Greenblatt, M. S.. Soussi, T.. Fuchs, R.. Sorlie. T.. Hovig. E.,
Smith-Sorensen, B., Montesano, R., and Harris. C. C. Database of p53 gene somatic
mutations in human tumors and cell lines. Nucleic Acids Res., 22; 3547-3551. 1994.
3. Hollstein. M., Shomer, B.. Greenblatt. M.. Soussi, T., Hovig, E., Montesano, R., and
Harris, C. C. Somatic point mutations in the p53 gene of human tumors and cell lines:
updated compilation. Nucleic Acids Res., 24: 141-146. 1996.
4. Hollstein, M., Sidransky, D., Vogelstein, B., and Harris. C. C. p53 mutations in
human cancers. Science (Washington DC), 253: 49-53. 1991.
5. Kunkel. T. A. Misalignment-mediated
DNA synthesis errors. Biochemistry. 29.
8003-8011. 1990.
6. Ripley, L. S. Frameshift mutation: determinants of specificity. Annu. Rev. Genet., 24:
189-213, 1990.
7. Streisinger, G., Okada. Y., Emrich. 1, Newton. J.. Tsugita. A.. Terzaghi, E., and
Inouye. M. Frameshift mutations and the genetic code. This paper is dedicated to
Professor Theodosius Dobzhansky on the occasion of his 66th birthday. Cold Spring
Harbor Symp. Quant. Biol.. 31: 77-84. 1966.
8. Benebek. K.. and Kunkel. T. A. Frameshift errors initialed by nucleotide misincorporation. Proc. Nati. Acad. Sci. USA. 87: 4946-4950. 1995.
9. Shibulani. S.. and Grollman, A. P. On the mechanism of frameshift (deletion)
mutagenesis in vitro. !. Biol. Chem., 268: 11703-11710, 1993.
10. Jego, N., Thomas, G., and Hamelin. R. Short direct repeats flanking deletions, and
duplicating insertions in p53 gene in human cancers. Oncogene, 8: 209-213. 1993.
11. Krawczak. M.. and Cooper. D. N. Gene deletions causing human genetic disease:
mechanisms of mutagenesis and the role of the local DNA sequence environment.
Hum. Genet., 86: 425-441, 1991.
12. Albertini, A. M.. Hofer, M., Calos, M. P., and Miller, J. H. On the formation of
spontaneous deletions: the importance of short sequence homologies in the generation
of large deletions. Cell, 29: 319-328, 1982.
13. Lutze, L. H., Cleaver, J. E., Morgan, W. F., and Winegar, R. A. Mechanisms involved
in rejoining DNA double-strand breaks induced by ionizing radiation and restriction
enzymes. Mutât.Res., 299: 225-232, 1993.
14. Lutze, L. H., Winegar. R. A., Jostes, R., Cross, F. T., and Cleaver, J. E. Radon-
2135
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
pS3 DELETIONS
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
AND INSERTIONS
induced deletions in human cells: role of nonhomologous strand rejoining. Cancer
Res., 52: 5126-5129, 1992.
Nalbantoglu. J.. Hartley. D., Phear, G., Tear, G.. and Meuth. M. Spontaneous deletion
formation a( the aprt locus of hamster cells: the presence of short sequence homologies and dyad symmetries at deletion termini. EMBO J., 5: 1199-1204. 1986.
Glickman, B. W.. and Ripley, L. S. Structural intermediates of deletion mutagenesis:
a role for palindromic DNA. Proc. Nati. Acad. Sci. USA, 81: 512-516, 1984.
Levine, J. G.. Schaaper. R. M.. and DeMarini, D. M. Complex frameshift mutations
mediated by plasmid pKM 101 : mutational mechanisms deduced from 4-aminobiphenyl-induced mutation spectra in Salmonella. Genetics, 136: 731-746, 1994.
Lambert. I. B., Napolitano. R. L., and Fuchs. R. P. Carcinogen-induced frameshift
mutagenesis in repetitive sequences. Proc. Nati. Acad. Sci. USA, 89: 1310-1314.
1992.
Belguise-Valladier, P., and Fuchs. R. P. Strong sequence-dependent polymorphism in
adduct-induced DNA structure: analysis of single /V-2-acetylaminofluorene residues
bound within the Narl mutation hot spot. Biochemistry. 30: 10091-10100, 1991.
Parris, C. N., Levy, D. D., Jessee, J.. and Seidman, M. M. Proximal and distal effects
of sequence context on ultraviolet mutational hotspots in a shuttle vector replicated in
xeroderma cells. J. Mol. Biol.. 236: 491-502. 1994.
Redston. M. S.. Caldas, C., Seymour, A. B.. Hruban, R. H., da Costa, L., Yeo. C. J..
and Kem, S. E. p53 mutations in pancreatic carcinoma and evidence of common
involvement of homocopolymer tracts in DNA microdeletions. Cancer Res.. 54:
3025-3033, 1994.
Breimer, L. H. Molecular mechanisms of oxygen radical carcinogenesis and mu
tagenesis: the role of DNA base damage. Mol. Carcinog., 3: 188-197. 1990.
Nath, R. G.. and Chung, F. L. Detection of exocyclic 1JV2-propanodeoxyguanosine
adducts as common DNA lesions in rodents and humans. Proc. Nati. Acad. Sci. USA,
91: 7491-7495, 1994.
Chaudhary, A. K., Nokubo, M.. Reddy, G. R.. Yeola, S. N., Morrow, J. D.. Blair.
I. A., and Marne»,L. J. Detection of endogenous malondialdehyde-deoxyguanosine
adducts in human liver. Science (Washington DC). 265: 1580-1582, 1994.
Lindahl. T. Instability and decay of the primary structure of DNA. Nature (Lond.).
362: 709-715, 1993.
Levine, J. G.. Knasmuller, S., Shelton. M. L., and DeMarini, D. M. Mutation spectra
of Glu-P-1 in Salmonella: induction of hotspot frameshifts and site-specific base
substitutions. Environ. Mol. Mutagen., 24: 11-22, 1994.
Gupta, P. K.. Johnson, D. L., Reid, T. M., Lee, M. S., Romano. L. J., and King, C. M.
Mutagenesis by single site-specific arylamine-DNA adducts. Induction of mutations
at multiple sites. J. Biol. Chem.. 264: 20120-20130, 1989.
Moriya, M., Takeshita. M., Johnson, F.. Peden, K.. Will. S., and Grollman. A. P.
Targeted mutations induced by a single acetylaminofluorene DNA adduci in mam
malian cells and bacteria. Proc. Nati. Acad. Sci. USA, 85: 1586-1589. 1988.
Wang, C. I., and Taylor, J. S. In vitro evidence that UV-induced frameshift and
substitution mutations at T tracts are the result of misalignment-mediated replication
past a specific thymine dimer. Biochemistry, 31: 3671-3681, 1992.
Burnouf. D., Koehl, P.. and Fuchs. R. P. Single adduct mutagenesis: strong effect of
the position of a single acetylaminofluorene adduct within a mutation hot spot. Proc.
Nati. Acad. Sci. USA, 86: 4147-4151, 1989.
Grollman. A. P.. and Shibutani. S. DNA Adducts: identification and Biological
Significance. Mutagenic specificity of chemical carcinogens as determined by studies
of single DNA adducts. IARC Sci. Pubi.. /25: 385-397, 1994.
Mendelman. L. V.. Petruska. J.. and Goodman. M. F. Base mispair extension kinetics.
Comparison of DNA polymerase alpha and reverse transcriptase. J. Biol. Chem., 265:
2338-2346, 1990.
Singer, B., and Essigmann, J. M. Site-specific mutagenesis: retrospective and pro
spective. Carcinogenesis (Lond.), 12: 949-955. 1991.
Bronstein, S. M.. Cochrane, J. E., Craft, T. R.. Swenberg, J. A., and Skopek. T. R.
Toxicity, mutagenicity, and mutational spectra of /V-ethyl-W-nitrosourea in human cell
lines with different DNA repair phenotypes. Cancer Res., 51: 5188-5197, 1991.
Alani. E., Chi, N. W., and Kolodner, R. The Sacchartnn\ces cerevisiae Msh2 protein
specifically binds to duplex oligonucleotides containing mismatched DNA base pairs
and insertions. Genes & Dev.. 9: 234-247, 1995.
Alani. E., Reenan. R. A., and Kolodner. R. D. Interaction between mismatch repair
and genetic recombination in Saccharomyces cerevisiae. Genetics, 137: 19-39, 1994.
Fishel, R., Lescoe, M. K., Rao, M. R.. Copeland, N. G., Jenkins, N. A.. Garber, J.,
Kane. M.. and Kolodner, R. The human mutator gene homolog MSH2 and its
association with hereditary nonpolyposis colon cancer. Cell, 75: 1027-1038, 1993.
Leach, F. S., Nicolaides, N. C., Papadopoulos, N., Liu. B., Jen, J., Parsons, R.,
Peltomaki. P.. Sistonen. P.. Aaltonen, L. A.. Nystrom-Lahti. M., Guan. X. Y.. Zhang.
J.. Meltzer. P. S., Yu, J. W., Kao. F. T.. Chen, D. J. Cerosaletti, K. M.. Foumier,
R. E., Todd. S., Lewis, T., Leach. R. J., Naylor. S. L., Weissenbach, J., Mecklin. J. P.,
Jarvinen, H.. Petersen, G. M.. Hamilton, S. R., Green. J.. Jass, J.. Watson. P., Lynch,
39.
40.
41.
42.
43.
44.
45.
46.
47.
48.
49.
50.
51.
52.
53.
54.
55.
56.
57.
58.
59.
60.
H. T., Trent. J. M., de la Chapelle. A., Kinzler. K. W.. and Vogelstein, B. Mutations
of a mutS homolog in hereditary nonpolyposis colorectal cancer. Cell, 75: 12151235, 1993.
Peltomaki, P.. Aaltonen, L. A., Sistonen. P.. Pylkkanen, L., Mecklin, J. P.. Jarvinen,
H.. Green, J. S., Jass, J. R., Weber, J. L., Leach, F. S., Peterson, G. M.. Hamilton.
S. R.. de la Chapelle, A., and Vogelstein, B. Genetic mapping of a locus predisposing
to human colorectal cancer. Science (Washington DC), 260: 810-812, 1993.
Aaltonen, L. A.. Peltomaki, P., Leach, F. S.. Sistonen. P.. Pylkkanen, L., Mecklin,
J. P., Jarvinen. H., Powell. S. M.. Jen, J.. Hamilton, S. R.. Peterson, G. M., Kinzler,
K. W., Vogelstein, B., and de la Chapelle, A. Clues to the pathogenesis of familial
colorectal cancer. Science (Washington DC). 260: 812-815, 1993.
Thibodeau, S. N.. Bren. G., and Schaid. D. Microsatellite instability in cancer of the
proximal colon. Science (Washington DC), 260: 816-819, 1993.
Markowitz. S., Wang. J., Myeroff. L.. Parsons. R.. Sun, L.. Lutterbaugh, J., Fan, R. S.,
Zborowska, E., Kinzler, K. W., Vogelstein, B., Brattain, M.. and Willson, J. K.
Inactivation of the type II TGF-ßreceptor in colon cancer cells with microsatellite
instability. Science (Washington DC). 26S: 1336-1338, 1995.
Radman, M.. and Wagner, R. Mismatch recognition in chromosomal interactions and
speciation. Chromosoma (Beri.), 102: 369-373, 1993.
Hollingsworth, N. M.. Ponte, L.. and Halsey, C. MSH5. a novel MutS homolog,
facilitates meiotic reciprocal recombination between homologs in Saccharom\ces
cerevisiae but not mismatch repair. Genes & Dev., 9: 1728-1739. 1995.
Baker, S. M., Bronner, C. E., Zhang, L., Plug, A. W., Robatzek, M., Warren, G.,
Elliott, E. A., Yu, J., Ashley, T., and Amheim. N. Male mice defective in the DNA
mismatch repair gene PMS2 exhibit abnormal chromosome synapsis in meiosis. Cell,
82: 309-319, 1995.
de Wind, N.. Dekker, M.. Berns, A., Radman, M.. and te Riele. H. Inactivation of the
mouse Afs/i2 gene results in mismatch repair deficiency, methylation tolerance,
hypcrrecombination, and predisposition to cancer. Cell. 82: 321-330, 1995.
Conley, E. C.. and West, S. C. Underwinding of DNA associated with duplex-duplex
pairing by RecA protein. J. Biol. Chem., 265: 10156-10163, 1990.
Hsieh, P., Camerini-Otero. C. S.. and Camerini-Otero, R. D. The synapsis event in the
homologous pairing of DNAs: RecA recognizes and pairs less than one helical repeat
of DNA. Proc. Nati. Acad. Sci. USA, 89: 6492-6496. 1992.
Studier, F. W., Rosenberg, A. H., Simon. M. N., and Dunn. J. J. Genetic and physical
mapping in the early region of bacteriophage T7 DNA. J. Mol. Biol.. 135: 917-937,
1979.
Nagase. H., and Nakamura, Y. Mutations of the APC gene. Hum. Mutât..2: 425-434,
1993.
Yeudall, W. A., Crawford, R. Y., Ensley, J. F., and Robbins. K. C. MTSI/CDK4I is
altered in cell lines derived from primary and metastatic oral squamous cell carci
noma. Carcinogenesis (Lond.). /5: 2683-2686. 1994.
Ohta. M., Nagai, H., Shimizu. M.. Rasio, D., Berd. D., Mastrangelo, M.. Singh, A. D.,
Shields, J. A., Shields, C. L., Croce. C. M., and Huebner, K. Rarity of somatic and
germline mutations of the cyclin-dependent kinase 4 inhibitor gene CDK41 in mela
noma. Cancer Res., 54: 5269-5272, 1994.
Spruck, C. H., Gonzalez-Zulueta, M., Shibata, A., Simoneau, A. R., Lin, M. F..
Gonzales, F., Tsai, Y. C., and Jones, P. A. pió gene in uncultured tumours. Nature
(Lond.), 370: 183-184. 1994.
Okamoto, A., Hussain, S. P., Hagiwara, K., Spillare. E. A.. Rusin. M. R., Demetrick,
D. J., Serrano, M.. Hannon. G. J.. Shiseki, M., Zariwala, M., Xiong, Y., Beach, D. H..
Yokota, J.. and Harris. C. C. Mutations in the pl6INK4/MTSl/CDKN2.
pl5INK4B/
MTS2 and pl8 genes in primary and metastatic lung cancer. Cancer Res.. 55:
1448-1451. 1995.
Lin. J., Chen, J., Elenbaas, B.. and Levine, A. J. Several hydrophobic amino acids in
the p53 amino-terminal domain are required for tran.scriptional activation, binding to
mdm-2 and the adenovirus 5 E1B 55-kD protein. Genes & Dev., 8: 1235-1246. 1994.
Hupp. T. R., and Lane. D. P. Allosteric activation of latent p53 tetramers. Curr. Biol.,
4: 865-875, 1995.
Bayle, J. H., Elenbaas, B.. and Levine, A. J. The carboxyl-terminal domain of the p53
protein regulates sequence-specific DNA binding through its nonspecific nucleic
acid-binding activity. Proc. Nati. Acad. Sci. USA, 92: 5729-5733. 1995.
Foord, O. S.. Bhattacharya. P.. Reich. Z.. and Rotter, V. A DNA binding domain is
contained in the C-terminus of wild type p53 protein. Nucleic Acids Res., 19:
5191-5198, 1991.
Wolkowicz, R., Peled, A., Elkind, N. B., and Rotter, V. Augmented DNA-binding
activity of p53 protein encoded by a carboxyl-terminal alternatively spliced mRNA is
blocked by p53 protein encoded by the regularly spliced form. Proc. Nati. Acad. Sci.
USA. 92: 6842-6846. 1995.
Jayaraman, J.. and Prives, C. Activation of p53 sequence-specific DNA binding by
short single strands of DNA requires the p53 C-terminus. Cell, 81: 1021-1029. 1995.
2136
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.
Deletions and Insertions in the p53 Tumor Suppressor Gene in
Human Cancers: Confirmation of the DNA Polymerase
Slippage/Misalignment Model
Marc S. Greenblatt, Arthur P. Grollman and Curtis C. Harris
Cancer Res 1996;56:2130-2136.
Updated version
E-mail alerts
Reprints and
Subscriptions
Permissions
Access the most recent version of this article at:
http://cancerres.aacrjournals.org/content/56/9/2130
Sign up to receive free email-alerts related to this article or journal.
To order reprints of this article or to subscribe to the journal, contact the AACR Publications
Department at [email protected].
To request permission to re-use all or part of this article, contact the AACR Publications
Department at [email protected].
Downloaded from cancerres.aacrjournals.org on June 15, 2017. © 1996 American Association for Cancer Research.