Microdeletion associated with the integration process of hepatitis B

volume 16 Number 11 1988
Nucleic Acids Research
Mkrodeletion associated with the integration process of hepatitis B virus DNA
Takaaki Nakamura, Takashi Tokino, Tsutomu Nagaya + and Kenichi Matsubara*
Institute for Molecular and Cellular Biology, Osaka University, Yamada-oka, Suita, Osaka 565, Japan
Received March 9, 1988; Revised and Accepted May 4, 1988
ABSTRACT
Hepatitis B vinos (HBV) DNA is often found in integrated form in
hepatocellular carcinomas (HCC) and in non-cancerous liver cells of chronic
carriers of HBV. However, the process of integration has not been well
understood. Analyses of integrant DNA was expected to give clues. However, the
majority of the integrants are products of multistep rearrangements following
integrations, and analysis of randomly selected samples do not give clues for
understanding the process of primary integrant formation. Therefore, one must
select an appropriate integrants) that has a simple structure. We surveyed a
collection of integrants prepared from many HCC's, and found one integrant that
has the simplest structure so far studied: The viral genome is almost complete, is
joined to cellular DNA using the cohesive end of the viral DNA, and furthermore,
the "left" and "right" flanking cellular DNA's are almost contiguous. Analysis of
the unoccupied sites in cellular DNA showed that, although almost contiguous, it
has generated a microdeletion (15 base pairs) in the target sequence. This target
sequence has a short region of homology to the sequence in the viral genome
located close to the junction. One integrant with strikingly similar features has
been reported independently. Two similar, but not identical cases from literatures
could be added to this category. Therefore, the integrants with these properties
may represent a unique category among those prepared from hepatocellular
carcinomas.
Based on these findings, we propose that this integrant represents the
primary product of integration, and discuss the intermediate acting in the process
of integration.
INTRODUCTION
Hepatitis B virus (HBV) is shown epidemiologically as a causative agent for
hepatocellular carcinoma (HCC) (1). The finding that HBV DNA is integrated in
many HCC's has raised much interest in the studies on the process of HBV DNA
integration and its subsequent effect on the induction of HCC (2). Because the
integration process can not be reproduced in cultured cells, attempts have been
madetoobtain insights into this problem by analyzing the integrants in HCC's (3,
4, 5, 6, 7, 8, 9, 10, 11). However, the results and models derived in different
research groups differed from each other, because many integrants are complex in
structure, reflecting the different process of their formation and multiple stages of
© IRL Press Limited, Oxford, England.
4865
Nucleic Acids Research
rearrangements (5, 7, 8, 12). Apparently, an appropriate integrants) must be
selected with which one can infer the integration process.
To select such integrants, two criteria must be met (i) The integrant
structure, of necessity, should be simple, (ii) The integrant should not be an
isolated case, but integrants with the similar structure should be found in multiple
members of HCC. We surveyed nineteen integrants obtained in our laboratory as
prepared from seven different HCC's (3) and looked for those that have the
simplest structure. The criteria for the "simple structure" was that the integrated
viral genome, as well as the flanking cellular DNA's, was not in rearranged form
or was not associated with large deletions. Absence of a large deletion could be an
unjustified criteria, but this was unavoidable for practical purposes in the
selection of candidates. We also took into account our previous inference (3, 9,13)
that the cohesive end region of the viral genome may be used in production of
primary integrants (3).
We found one integrant that meets these criteria. This paper reports the
structure of this integrant, and that of the unoccupied site of the cellular DNA
sequence used as a target for integration.
The results, compared with three reported cases (13, 14, 15) showed that
these integrants share the same basic features, indicating that they constitute a
unique category among the integrants. Based on these findings, we proposed a
model to account for the integration process of HBV DNA into liver cell
chromosomes.
MATERIALS AND METHODS
The integrant clone
The clone pY has been reported elsewhere (3).
Southern analyses
Restriction endonuclease fragments with unique cellular DNA sequences
were extracted from the "left" and "right" flanking cellular sequences of the
integrated clone pY (see Fig. 1A), and used as "left" and "right" probes for
Southern hybridization (16).
Cloning and analysis of cellular counterpart DNA sequences
A cosmid library was prepared from human leukocyte DNA partially
digested with Mbol (17). This library was screened by the left and the right probes
described above to isolate the cellular counterpart fragments covering the HBV
integration site. The fragment thus obtained was recloned into plasmid pBR325,
named pNY, and its sequence was analyzed as according to Sanger (18).
4866
Nucleic Acids Research
RESULTS AND DISCUSSION
HBV integration site in clone pY
The structure of an integrant clone pY is shown in Fig. 1 A. The viral insert
is 3.1 kb DNA of continuous subgenomic fragment that covers the genes C, S and
X. The left border, position 1822 lies 2 bp away from the DR1, that serves as a
specific site for initiation and termination of viral DNA and RNA syntheses (19,
20). Therightborder, position 1708, lies in the X gene.
To examine the structure of the cellular DNA in this integrant, we isolated
restriction fragments of cellular DNA containing unique sequences from the "left"
and the "right" flunking sequences of the integrant viral DNA. These DNA
fragments termed left and right probes were used as labeled probes in Southern
analyses. Fig. 2 shows the results of such analyses with human leukocyte DNA,
digested with different restriction enzymes. Both probes hybridized to one and the
same HindUl genomic DNA fragment, indicating that they lie very close together
in the unoccupied genome. The L and R probes did not hybridize to one DNA
fragment in other restriction digests, as there is more than one cleavage site for
such enzymes in between the L and R probe sites. We, therefore, cloned the
corresponding DNA fragment from a cosmid library, and recloned it into pBR325
for further analyses. This plasmid was designated pNY, whose restriction map is
HBV
lA)
DBl C
O—*•
H^Q G
X
9^_g
BCE
B
0
B
H
(B)
pNY
integration sita
Fig. l
Restriction map of the integrated HBV DNA and its flanking cellular
sequences in the clone pY (A), and that of the corresponding cellular sequence
around the unoccupied site (B). The open horizontal arrow denotes the HBV DNA
pointing the direction of its transcription. The thin line represents the flanking
cellular DNA. The open bars denote the unique cellular sequences used as the left
(L) and the right (R) probes for Southern analyses. The map of the partial HBV
genome is also displayed to demonstrate the location of the viral genes. C, HBcAg;
S, HBsAg; X, X gene. Restriction sites are : £coRI (E), flamHI (B), Bgltt (G),
Hindm (H), and Xhol (X). DR1 denotes the 11-base pair direct repeat sequence
that acts as a signal for transcription and replication (19, 20). The vertical arrow
in B represents the site of the viral DNA integration.
4867
Nucleic Acids Research
kb
L probe
E B G H
23.19.4 —
6.6 —
23.19.46.6-
—
#
4.4_
4.4 —
2.3 2.0-
kb
R probe
E B G H
1
'
#
2.32.0-
ft
Fig. 2
Southern blots of unoccupied cellular DNA using the left and right flanking
cellular sequences from an integrant as probes. Digests of normal human
leukocyte DNA by restriction enzymes as shown were electrophoresed and blotted
onto nitrocellulose filters and hybridized to the left (L) and the right (R) probes
prepared from the flanking sequence of the integrant clonejpY. The following
restriction enzymes were used: EcoBI (E), BaniSl (B), BgUI (G), and Hindm (H).
Numerals show the size of reference DNA fragments.
shown in Fig. IB. Comparison of this map with that of pY shows that the HBV
integration did not cause major rearrangements in the cellular sequence.
Nucleotide sequences at the unoccupied sites
To see the effect of HBV integration on cellular DNA, the sequence of the
flanking cellular DNA around the junction of the integrated HBV DNA in pY was
compared with that of the unoccupied cellular counterpart DNA in pNY. Fig. 3
shows that the sequence of the flanking cellular DNA is identical to that of the
unoccupied cellular DNA, except that it has a 15 bp (underlined) deletion at the
viral insertion site. This microdeletion may have been formed in association with
the HBV integration. No extensive homology was found between the HBV and
cellular DNA sequences, but we noticed a short (5 bp) sequence CCTCT being
shared between the viral DNA located close to the left junction and the deleted
portion of the cellular DNA. We also noticed a 3 bp sequence (TAA) at the right
junction homologous to the deleted portion of the cellular DNA (Fig. 4B).
A single nucleotide change, T to A, is observed in the pY at its 3' end (Fig. 3).
4868
Nucleic Acids Research
1S32
1708
HBV
CCATGCAACTTTTTCACCTCTGC
ATACTTCAAAGACTGT
pY
GTAQTTCCCqrTTTCACCTCTGC
ATACTfoTGATTTQTA
pNY
GTAGTTCC^AAATTCCTCTTGTTATTGATTTGTA
Fig. 3
DNA sequences around the HBV integration site. The sequences around the
left and right viral-cellular junctions in pY^are compared with the HBV sequence
(subtype adr, ref. 27, 28) and the unintegrated cellular sequence in the
corresponding region of the normal allele. The nucleotide numbers of the HBV
genome are indicated. The integrant HBV DNA sequence in the pY is boxed. The
deleted cellular sequence in the normal allele is underlined. Snort homologous
sequence that is present both in the viral DNA and in the deleted portion of
cellular DNA is demonstrated by thick bars.
This might reflect the polymorphism, since pY and pNY have been derived from
different individuals. Alternatively, the change might be accompanied with the
HBV integration. A similar one nucleotide substitution has been observed in
another integrant, DTI. These possibilities must be elucidated through collection
of more data (see Fig. 4).
Comparison with reported cases
These findings can be summarized as follows: (i) the "left" and "right"
flanking sequences are essentially contiguous in the normal unoccupied allele
DNA, but the target cellular DNA has generated a microdeletion, 15 bp in size, (ii)
the region of a microdeletion and the region of the viral genome that lies very close
to the viral-cell junction point carry a sequence homology, 5 bp, or 3 bp in size, (iii)
nonrearranged, and nearly the unit length of the viral genome , from DR1 at 5' end
to the gene X at 3' end, was integrated, (iv) at least one (in this case, both) of the
integrant viral DNA ends lies in the cohesive end region of the viral genome.
Upon surveying literatures, we noticed one case reported by Hino (14)
describing an integrant called C3 that has these four featureB in common. The C3
carries nearly the unit length of the viral genome, from the position 1824 on the
DR1 to 1762, almost identical to our case, and it has generated a 6 to 11 bp
microdeletion in the target cellular sequence in which a 5 bp sequence is
homologous to the viral sequence appearing at the junction. The striking
similarity between the pY and C3 leads us to propose that they can not be mere
coincidences of isolated cases among the HBV DNA integrants, but they represent
a unique category. Based on the simplest structures of both the HBV DNA and
cellular DNA in the integrants so far analyzed, we also propose, as noted in the
introduction, that they may represent the primary products of the virus
integration, reflecting the process of integration. The two integrated viral
4869
Nucleic Acids Research
OAGOCATAC^TCATTTOTAO •
TTGATITGTAG •
ATAGCjGATACACCCT •
GATACACCGT-
ACCGCAAATCCJCCOTCCAOTGTA-- CCCTCGAOTOTA-- -
Fig. 4
Four integrants that have the simplest structure, with several unique
properties in common, as gathered from this work and literatures ( ref. 13,14,15 ).
(A) Structures of the integrated HBV genome. Heavy lines and wavy lines
represent, respectively, viral genomes and the flanking cellular sequences. The
two 11-base pair direct repeats (DR1, DR2) are boxed. (B) The sequences around
the cellular-viral junctions in the integrants and those in the target cellular DNA'
s. The nucleotide numbers of the HBV genomes are indicated. The integrated
HBV genomes are boxed. DR1 and DR2 are the 11-base pair direct repeats.
Arrows show the direction of transcription of the viral genome. The
microdeletions in cellular sequences are underlined, and the small homologous
sequences between cellular and viral DNA's are underlined by thick bars and
weavy lines. Asterisks indicate mismatch bases.
genomes look very similar to thereversetranscripts of the pregenome DNA (2,19,
20), from which a region copying the 3' terminal portion of the mRNA or the 5'
portion of the first DNA strand, has been missing. We can speculate that such
DNA molecules, appearing as replicative intermediates, or as dead-end products
duringreplication,may have been used as substrates for integration.
4870
Nucleic Acids Research
Two integrants with similar, but not identical properties have been reported.
Dejean et al (13) found an integrant ADT1 covering 1591 to 2931, about 50% of the
viral genome, and Berger et al (15) found an integrant clone 26 covering 2536 to
1797, about three quarters of the viral genome. Both of these integrants are
associated with microdeletions in the target cellular sequence, 7 to 12 bp and 17
bp, and in both cases, one of the junctions terminates within the cohesive end
region of the viral DNA. Furthermore, in both clones, short homologies are found
between the deleted cellular sequence and the viral sequence that lies close to.or at
the junction (Fig. 4). However, the regions of the viral genome covered by these
two integrants are different from that of pY and C3, viz. they carry only a portion
of the viral genome, whereas the latter two carry almost the entire genome. In
addition, the ends of ADT1 and 26 do not coincide with those in pY and C3. It looks
as though the integration precursors for ADT1 and 26 are the same as those for pY
and C3, only that the former two have lost wider regions from both ends. If so, the
intermediate for the four integration events could have been viral replicative
intermediates that have generated some deletions, large or small, at one end or
both ends. Whether the missing regions have been absent from the beginning, viz.
in the replicative intermediates, or lost during integration is not clear. Other
replicative intermediates can also be imagined, but it seems to be too premature at
this time to further speculate the origin of the intermediates.
The similar features of the four integrants strongly suggests that the process
in which the viral DNA was integrated into cellular DNA was almost identical.
An attractive model would be to assume viral replicative intermediates, with one
of their broken ends in double stranded or in transient single stranded form, or one
end still attached to a protein (21) invading into cellular DNA carrying out strand
exchange using the short homology sequence. It is well known that illegitimate
recombinations are often associated with some deletion formation (22, 23). Fig. 4
shows that the four integrants carry the short homologous sequence close to,or at
one of the ends. It is likely that such a short homology may have served some role
in strand exchange. The nature and the location of the homologous sequence are
not uniform with these four samples. Thus, further accumulation of data are
awaited before we consider a detailed model to correlate the homology, directions
of strand invasion, and the structure of the substrate for integration. Such a
mechanism is quite different from that acting in retrovirus integration where the
direct repeat (in this case LTR) sequences are employed, giving rise to duplication
of the target cellular sequence (24,25).
Yaginuma et al (9) found an integrant whrerein the HBV DNA is flanked by
directly repeating DNA's derived from the host target sequence. No deletion in
4871
Nucleic Acids Research
cellular DNA was found in this case. They claimed, based on this finding, that the
HBV genome may be integrated by a mechanism similar to that acting in
integration of retrovirus (24). However, this case seems to be an exceptional one,
because no other case conforming to their model has been found since then.
Yaginuma et al (26) also reported an integrant 1707-1, that is associated
with 16 bp short deletion of the cellular DNA. However, a 5 bp viral sequence was
observed to repeat at both junctions, and no short homology as in the four cases in
Fig.4 was observed. This clone is .therefore, considered to belong to a different
category of integrants.
Many of the clones so far analyzed were found to comprise another category
of integrants wherein the structure of viral DNA, as well as cellular DNA, show
signs of complex rearrangements. Such rearrangements include chromosome
translocations, amplification of the integrated viral genome followed by their
transpositions along with the joined cellular DNA, and large deletions incurred
within the host chromosomes. We are tempted to hypothesize that such complex
integrants may have been formed from the simple primary integrants by
subsequent DNA,rearrangements. Tokino et al (12) have discussed the possibility
that such secondary rearrangement reactions may have been mediated by
transient activation of the cohesive end region in the integrant viral DNA. The
intriguing problems related to HBV replication, integration and subsequent
chromosome rearrangement will be solved in the future by establishing an
appropriate cell line. If secondary rearrangements are in fact frequent events, as
suggested, it would be interesting to know whether there is any correlation
between specific categories of integration and the type of HCC or non-cancerous
chronic carriers.
ACKNOWLEDGMENTS
We thank Dr. Y. Nakamura at Utah University for providing a cosmid library.
This work was supported by a Grant-in-Aid for Special Project Research in CancerBioScience from the Ministry of Education, Science and Culture of Japan.
*To whom correspondence should be addressed
••"Present address: Institute for Bioscience, Nippon Zeon Co., Yako 1-2-1, Kawasaki, 210 Japan
REFERENCES
1. Beasley, RJ5., Lin, C.C., Hwang, L.-Y., and Chin, C. S. (1981) Lancet ii,
1129-1133.
2. Tiollais, P., Pourcel, C. and Deiean, A. (1985) Nature 317,489-495.
3. Nagaya, T., Nakamura, T., Tokino, T., Tsurimoto, T., Imai, M., Mayumi, T.,
4872
Nucleic Acids Research
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26
27.
28.
Kamino, K., Yamamura, K., and Matsubara, K. (1987) Genes and
Development 1,773-782.
Dejean, A., Sonigo, P., Wain-Hobson, S., and Tiollais, P. (1984) Proc. Natl.
Acad.Sci. 81,5350-5354.
ffino, O., Shows, T. B., and Rogler, C. E. (1986) Proc. Natl. Acad. Sci. 83,
8338-8342.
Koch, S., Freytag von Loringhoven, A., Kahman, R., Hofscheider, P. H. and
Koshy, R. (1986) Nucl. Acids Res. 12, (17) 6871-6886.
Mizusawa, H., Taira, M., Yaginuma, K., Kobayashi, M., Yoshida, E., and
Koike, K. (1985) Proc. Natl. Acad. Sci. 82,208-212.
Rogler, C. E., Sherman, M., Su, C. Y., Shafritz, D. A. Summers, J., Shows, T.
B. Henderson, A., and Kews, M. (1985) Science 230,319-322.
Yaginuma, K., Kobayashi, M., Yoshida, E., and Koike, K. (1985) Proc. Natl.
Acad. Sci. 82,4458-4462.
Ziemer, M., Garcia, P., Shaul, Y., and Rutter, W. J. (1985) J. Virol. 53,885892.
Shih, C , Burke, K., Chou, M.^J., Zeldis, J. B., Yang, C.-S., Lee, C.-S.,
Isselbacher, K J., Wands, J.R. and Goodman, H. M. (1987) J. Virol. 61,34913498.
Tokino, T., Fukushige, S., Nakamura, T., Nagaya, T., Murotsu, T., Shiga, K.,
Aoki, N., and Matsubara, K. (1987) J. Virol. 61,3848-3854.
Dejean, A., Bougueleret, L., Grzescbik, K. H., and Tiollais, P. (1986) Nature.
322, 70-72.
Hino, O. (1986) KAN. TAN. SUI. (Japan) 13,455^161 (in Japanese).
Berger, I. and Shaul, Y. (1987) J. Virol. 61,1180-1186.
Southern, E. M. (1975) J. Mol. Biol. 98,503-517.
Collins, J. and Hohn, B. (1978) Proc. Natl. Acad. Sci. 75,4242^246.
Sanger, F., Nicklen, S., and Coulson. A. R. (1977) Proc. Natl. Acad. Sci. 74,
5463-5467.
Seeger, C , Ganem, D. and Varmus, H. E. (1986) Science 232,477-484.
Will, H., Reiser, W., Weimer, T., PfafT, M., Buescher, M., Sprengel, R.,
Catteneo, R. and Schaller, H. (1987) J. Virol. 61,904-911.
Gerlich, W. H. and Robinson, W. S. (1980) Cell 21,801-809.
Szostak, J. W., Orr-Weaver, T. L. and Rothstein, R. J. (1983) Cell 33,25-35.
Klar, A. J. S. and Strathern, J. N. (eds) (1984) Cold Spring Harbor Symp.
Quant Biol. 49.
Varmus, H. and Swanstrom, R. (1985) Replication of retroviruses. In
Weiss.R. (ed.), RNA tumor virus, pp.75-134. Cold Spring Harbor Laboratory,
Cold Spring Harbor, N. Y.
Panganibian, A. T. (1985) Cell 42,5-6.
Yaginuma, K., Kobayashi, H., Kobayashi, M., Morishima, T., Matsuyama,
K. and Koike, K. (1987) J. Virol. 61,1808-1813.
Fujiyama, A., Miyanohara, A., Nozaki, C , Yoneyama, T. Ohtomo, N. and
Matsubara, K. (1983) Nucl. Acids Res. 11 (13), 4601-4610.
Ohno, Y., Onda, H., Sasada, R., Igarashi, K. Sugino, Y. and Nishioka, K.
(1983) Nucl. Acids Res. 11 (6), 1747-1757.
4873