Detection, PCR-amplification, and characterization

GLOBAL JOURNAL OF BIOCHEMISTRY
Detection, PCR-amplification and characterization of a 4 intron-containing
non-symbiotic hemoglobin gene from the moss Physcomitrella patens
Fernando Martínez-Ocampo, Consuelo Vázquez-Limón, Raúl Arredondo-Peter*
Laboratorio de Biofísica y Biología Molecular, Departamento de Bioquímica y Biología Molecular,
Facultad de Ciencias, Universidad Autónoma del Estado de Morelos, Av. Universidad 1001,
Col. Chamilpa, 62210 Cuernavaca, Morelos, México
*
Author for Correspondence: Raúl Arredondo-Peter, email: [email protected]
Received 14 Mar 2012; Accepted 12 Apr 2012; Available Online 13 Apr 2012
Abstract
It has been well established that land plant non-symbiotic hemoglobin (nshb), symbiotic hemoglobin (shb) and leghemoglobin (lb)
genes are interrupted by 3 introns (IVS-I, IVS-II and IVS-III). Here, we report the detection, PCR-amplification and DNA sequencing of a 4
intron-containing nshb (nshb2) gene from the Physcomitrella patens DNA. Sequence analysis revealed that nshb2 is interrupted by introns IVSI, IVS-II and IVS-III, and by an extra intron (pre-IVS-I) interrupting coding sequences upstream to IVS-I. Phylogenetic analysis showed that P.
patens nsHb2 clusters with bryophyte nsHbs, however nsHb2 is divergent from P. patens nsHb1. Modeling of the tertiary structure of P. patens
nsHb2 showed that this protein could fold into the myoglobin-fold. The existence of intron pre-IVS-I in P. patens nshb2 but not in other known
land plant nshbs, shbs and lbs suggests that this intron inserted after gene duplication that originated nshb2. If this is correct intron insertion
possibly occurred in land plant nshbs other than P. patens nshb2. This will be verified as novel nshbs become available from newly sequenced
land plant genomes.
Keywords: Gene structure; Hemoglobin; Intron; Non-symbiotic; Physcomitrella patens; Plants
Abbreviations: Hb, hemoglobin; IVS, intervening sequence (intron); Lb, leghemoglobin; Mb, myoglobin; nsHb, non-symbiotic hemoglobin;
ORF, open reading frame; sHb, symbiotic hemoglobin; tHb, truncated hemoglobin
1. Introduction
Globins (Hbs) are a large family of proteins that have
been detected in organisms from the three kingdoms of life
[1]. In land plants three types of Hbs exist: symbiotic Hbs
(sHbs), non-symbiotic Hbs (nsHbs) and truncated Hbs (tHbs)
[2-4]. Symbiotic Hbs (or leghemoglobins (Lbs) when isolated
from legume species) are only localized in nodules of N2fixing land plants [5-9]. In contrast, nsHbs and tHbs are
widely distributed in land plants and are localized in tissues
from symbiotic and non-symbiotic plant organs [10-16].
Phylogenetic analysis of land plant Hbs suggests that these
proteins vertically evolved and diversified with land plants,
that the nsHb, sHb and Lb and tHb lineages evolved from
different ancestors previously to the emergence of
Archaeplastida (i.e. algae and land plants), and that sHbs and
Lbs originated from a nsHb ancestor [4, 17, 18].
During the last decades the availability of sequenced
genomes permitted the identification and characterization of
hb genes from several land plants, ranging from primitive
bryophytes to evolved monocots and dicots. Known land plant
nshb, shb and lb genes are interrupted by 3 introns (IVS-I,
IVS-II and IVS-III) mostly conserved at positions B12.2,
E15.0 and G7.0 (i.e. intron position within the myoglobin
(Mb)-fold and phase). Thus, the apparent gene structure for
land plant nshbs, shbs and lbs is 4 exons/3 introns. In contrast,
intron position and number is variable in known land plant thb
genes [18]. During the search for nshb gene copies in the
genome of the moss Physcomitrella patens at the COSSMOS
database (see subsection 2.1.) we detected a nshb (nshb2) that
is interrupted by 4 introns. This observation suggested that
gene structure for nshb2 is 5 exons/4 introns. Here, we report
the detection, PCR-amplification and characterization of 4
intron-containing nshb2 from the P. patens DNA. Results
from this work revealed that nshbs with gene structure other
than 4 exons/3 introns exist in land plants.
2. Experimental
2.1. Database search and mapping of the nshb2 gene in the
P. patens genome
Physcomitrella patens nshb (nshb1) gene and protein
sequences (GenBank accession number EF028055 and
ABK20873, respectively) were used as probes to search for
nshb gene copies in the P. patens genome at the COSMOSS
database (http://www.cosmoss.org/, last accessed on
September 2011) using the Blast tool [19]. Putative nsHb
sequences were subjected to a FUGE analysis (http://wwwcryst.bioc.cam.ac.uk, last accessed on September 2011) to
determine the most similar globin structure and presence of
proximal His at the Mb-fold position F8. Putative globins had
to satisfy the following criteria: length higher than 100 amino
acids, a FUGE Z score higher than 6 (which corresponds to
99% specificity [20]) with globin structures, and the presence
of proximal His at position F8. Scaffolds containing copies of
the nshb gene were used for mapping nshbs. This included the
detection of open reading frames (ORFs) 10-16 and 5-9 Kb
Global J. Biochem. 2012, 3: 12
1
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.
GLOBAL JOURNAL OF BIOCHEMISTRY
Archaeplastida nsHbs and Lbs, respectively,
were performed using the ClustalX program [23].
Archaeplastida nsHb and Lb sequences were
obtained
from
the
GenBank
database
(http://www.ncbi.nlm.nih.gov, last accessed on
September 2011) using the accession numbers
reported by Garrocho-Villegas et al. [4],
Fernández et al. [24] and Vinogradov et al. [18].
Sequence alignment was manually verified using
the procedure described by Kapp et al. [25] based
on the Mb-fold [26]. Phenogram was obtained
from aligned sequences using the MEGA
program (version 5.05) [27] and the Neighbor
Joining method [28] with 1000000 bootstrap
replicates.
2.4. Molecular modeling and analysis of the
predicted P. patens nsHb2 tertiary structure
The tertiary structure of P. patens nsHb2
was modeled using the automated mode of the IFigure 1. Mapping of the nshb1 (A) and 2 (B) genes in P. patens scaffolds _71 and
TASSER
server
_660, respectively. Numbers in parenthesis indicate the scaffold total length.
(http://zhanglab.ccmb.med.umich.edu/IArrows indicate open reading frames (ORFs), and ORF length, transcription
TASSER/, last accessed on November 2011) [29direction, and localization in the +/- strand.
31] and the crystal structure of rice Hb1 [32]
(PDB ID 1D8U) as a template. Model reliability
up- and downstream to nshbs, respectively, and ORF length,
was
evaluated
using
the
Verify3D
program
transcription direction and localization in the +/- strand.
(http://nihserver.mbi.ucla.edu/Verify_3D/, last accessed on
November 2011) [33]. Model was edited using the VMD
2.2. Plant growth, isolation of total DNA, PCR-amplification
program [34] and Adobe Photoshop® software. Distance of
and DNA sequencing
amino acid residues at the heme prosthetic group were
Physcomitrella patens var. Gransden was grown
calculated using the distance tool of the SwissPDBViewer
essentially as described by Garrocho-Villegas et al. [21].
program as described by Gopalasubramaniam et al. [35].
Briefly, protonemas were cultured in Knop medium in a plant
growth chamber (Biotronette Plant Growth Chamber, Lab3. Results and Discussion
Line) at 20oC with 16/8 h light/dark periods for 1 to 2 months,
plants were harvested, immediately frozen in liquid N2 and
3.1. Detection and mapping of nshb2 in the P. patens
stored at -70oC until used. Total DNA was isolated from 100
genome
mg
of
P.
patens
protonemas
using
the
The search for copies of nshb genes at the
cetyltrimethylammonium bromide (CTAB) method [22].
COSMOSS database using P. patens nshb1 sequences
Primers were designed for PCR-amplification using sequences
(GenBank accession number EF028055 and ABK20873) as
at the start and stop codons from the P. patens nshb2 gene
probe showed that a number of scaffolds containing hb-like
detected at the COSMOSS database (see subsection 2.1.).
sequences exist into the P. patens genome. After sequence
Primer sequences were: (sense) primer P. patens Hb2fwd: 5'evaluation only nshb sequences from scaffold_71 and
ATGGCATCATCGATTGGCG-3’, and (antisense) primer P.
scaffold_660 satisfied the criteria to be considered authentic
patens Hb2rvs: 5'-TCAAGTATGAGATTTAGCTGCTGC-3’.
globins (see subsection 2.1.). Scaffold_71 contains the P.
Total P. patens DNA (1.6 ng) was used as template for PCRpatens nshb1 sequence used as probe, and scaffold_660
amplification. PCR components and concentrations were 14
contains a copy (nshb2) of nshb1. Mapping of nshb1 and 2
pmol of each sense and antisense primer, 0.2 mM of each
revealed that these genes are located at the – and + strands
dNTP (Invitrogen), and 1 U of Taq DNA polymerase
from scaffolds _71 and _660, respectively, that genes coding
(Invitrogen) in 1X PCR buffer containing 2 mM MgCl2 in a
for a PAP2_haloperoxidase and hypothetical proteins are
final volume of 25 l. PCR-amplification was carried out for
flanking to nshb1, and that no ORFs exist 16 and 5 Kb up35 cycles at 58oC for annealing using a thermalcycler
and downstream to nshb2, respectively (Figure 1). Translation
(Minicycler, MJ Research). PCR products were detected in a
of nshb1 and 2 into predicted nsHb1 and 2 polypeptides and
1.2% agarose gel after staining with ethidium bromide,
sequence comparison between nshb1 and 2 genes revealed that
isolated from the gel using the GeneClean kit (Q-BIOgene)
nshb1 and 2 are interrupted by 3 and 4 introns, respectively.
and sequenced by PCR in both orientations using the sense
Gene nshb1 is interrupted by IVS-I, IVS-II and IVS-III
and antisense primers at the Institute of Biotechnology of the
previously recognized in known land plant nshbs, shbs and
National Autonomous University of México.
lbs. Gene nshb2 is interrupted by IVS-I, IVS-II and IVS-III
identically to those in nshb1, and by an extra intron
2.3. Sequence alignment and phenetic analysis
interrupting coding sequences upstream to IVS-I (see
Pairwise and multiple sequence alignment of P.
subsection 3.2.). Because sequences deposited in genome
patens nsHb1 and nsHb2, and P. patens nsHb2 and selected
databases occasionally result from sequencing and assembly
Global J. Biochem. 2012, 3: 12
2
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.
GLOBAL JOURNAL OF BIOCHEMISTRY
ag/gg, tg/gt and ag/ac, and ag/gt and ag/gt, respectively) from
P.. patens nshb2 and other land plant nshb
nshbs,
s, such as P. patens
nshb1 (GenBank accession number EF028055), rice hb1 [36]
and Parasponia hb [37],
[37], are highly
highly conserved.
3.3. Sequence alignment and phenetic analysis of P. patens
nsHb2 and selected Archaeplastida hemoglobins
Sequence alignment showed that nsHb1 and 2 are 63
and 78% identical and similar, respectively, and that these
proteins contain the highly conserved distal (H84) and
proximal (H119) histidine. Also, pairwise sequence alignment
showed that intron pre-IVS-I
pre
I from nsHb2 is located at position
preA11.1, and that introns IVS-I,
IVS I, IVS
IVS-II
II and IVS-III
IVS III are
located in nsHb1 and 2 at positions B12.2
B12.2,, E15.0 and G7.0,
respectively (Figure
(
4). Phylogenetic analysis of selected
Archaeplastida nsHbs and Lbs showed that P. patens nsHb2
clusters with bryophyte nsHbs, however nsHb2 is divergent to
P.. patens nsHb1 and Ceratodon purpureus nsHb (Figure
(
5).
The above observations indicate that P.. patens nsHb1 and 2
are conserved and evolved from a common ancestor.
However, the existence of intron pre-IVSpre
-I in nshb2 but not in
nshb1 and other known land plant nshb
nshbss suggests that pre
preIVS I inserted into nshb2 after gene duplication that originated
IVS-I
nshb2 (see section 4).
Figure 2. PCR-amplification
amplification of P. patens nshb2 (arrow).
Samples were subjected to electrophoresis in a 1.2% agarose gel
and stained with ethidium bromide. Line 1, molecular size
markers (1 kb plus, Invitrogen); line 2, a 5 ll aliquot from the
total PCR reaction. Molecular size markers are shown in bp.
artifacts we verified the existence of nshb2 in the P. patens
DNA by PCR-amplification
PCR amplification and DNA sequencing.
3.2. PCRPCR-amplification,
amplification, DNA sequencing and analysis of
P. patens nshb2
Total DNA isolated from P. patens protonemas was
used as template for PCR-amplification
PCR amplification in combination with
primers designed from the nshb2 sequence detected in
scaffold_660 (see subsection 2.2.). Figure 2 shows that a
single fragment of 1,832 bp in size was amplified from the P.
patens total DNA. This fragment was isolated and sequenced
in both orientations. Sequence comparison showed that 1,832
bp fragment and nshb2 from scaffold_660 are identical, thus
indicating that nshb2 exists in the P. patens DNA. Sequence
analysis showed that nshb2 is interrupted by introns at
positions 61, 341, 613 and 1,519 ((Figure
Figure 3). Introns at
positions 341, 613 and 1,519 interrupt nshb2 identically to
IVS-I,
IVS I, IVS-II
IVS II and IVS-III
IVS II from known land plant nshbs,
nshb
including to P. patens nshb1.. Intron at position 61 interrupts
nshb2 at coding sequences located upstream to IVS
IVS-I,
I, thus we
named this intron as pre-IVS
pre IVS-I.
I. Sequences flanking the
exon/intron boundaries to prepre-IVS-II are tg/gt and ag/ag. With
the exception of the 5´-end
5´ end of exon/intron boundary in IVS-II
IVS
(tg/gt), which is identical to the 55-end
end of exon/intron boundary
in pre-IVSpre
-I,
I, the exon/intron boundaries among pre
pre-IVS-II and
IVS-I,
IVS I, IVS-II
IVS II and IVS-III
IVS III are different ((Figure 3). However,
exon/intron boundaries in IVSIVS-I, IVS-II
II and IVS-III
IVS III (ga/gt and
Global J. Biochem. 2012,
2012 3: 12
3.4. Molecular modeling and analysis of the predicted P.
patens nsHb2 tertiary structure
The crystal structure of few land plant nsHbs has
been reported [17, 32, 38, 39],
39], but no crystallographic analysis
has been performed
performed on bryophyte nsHbs. However, the
predicted structure of C. purpureus and P
P. patens nsHbs was
reported by Garrocho-Villegas
Garrocho Villegas et al. [21] and is deposited in
the Caspur database
database (http://mi.caspur.it/PMDB/
(http://mi.caspur.it/PMDB/,
http://mi.caspur.it/PMDB/
ID:
PM0074985), respectively. Elucidating the tertiary structure of
nsHbs is important to understand the evolution and clarify the
function of these proteins in land plants. We modeled the
tertiary structure of predicted P. patens nsHb2 using the
automated mode of the I-TASSER
I TASSER server and the crystal
structure of rice Hb1 [32] (PDB ID 1D8U) as a template.
Evaluation using the Verify 3D program indicated that model
for P. patens nsHb2 is reliable, as values along the sequence
were higher than 0. Figure 6 shows that the predicted structure
of P. patens nsHb1 and 2 are highly similar, and that P. patens
nsHb2 could fold into the Mb-fold.
Mb fold. Minor differences between
the predicted structure of P. patens nsHb1 and 2 were detected
at the conformation of pre-helix
pre helix A and CD
CD- and GH-loop.
GH loop.
However, a major difference was that length of helix H is
longer in predicted P. patens nsHb2 than nsHb1. The
examination of amino acids that are essential for binding of
ligands to the Fe-heme
Fe heme showed that position of proximal and
distal His are similar among P. patens nsHb1
Hb1 and 2 and
hexacoordinate land plant nsHbs, such as rice Hb1 [32] and
nd
maize Hb [40] (not shown). This observation suggests that Fe
Feheme in P. patens nsHb1 and 2 is hexacoordinate.
4. Conclusions
Based on available sequences, for many years it was
known that land plant nshbs, shbs
shb and lbss are interrupted by 3
introns. As result from these observations it was postulated
that the ancestor to land plant nshbss was interrupted by 3
introns and that the 4 exons/3 introns gene structure
3
www.simplex-academic
www.simplex academic-publishers.com
publishers.com
© 2012 Simplex Academic Publishers.
Publishers All rights reserved.
reserved
GLOBAL JOURNAL OF BIOCHEMISTRY
1
1
............................................................
ATGGCATCATCGATTGGCGCCGCACCAGTTTCCGTTGCGCCAGAAACTGCACATGGTGCT
AMAAAAASAASAAIAAGAAAAAAAAPAAVAASAAVAAAAAPAAEAATAAAAAHAAGAAA
60
20
61
............................................................
Ggtaagcacggcgccgcatttgctcagctccatatttcgccgtcaccattacccccagtt
120
121
............................................................
ttcttaggaagttctgcttgggaatagaagcgcggctgagtatcttagaagctgttttgg
180
181
............................................................
tgtcccaatctcgattgaagaacctttctgtaactaatcatttgtaaccgtgaattcgtt
240
241
21
............................................................
gcagAGAAGGTTTACTCCAAGGAGAGCGTGGCCCTGGTGAAGCAGTCTTGGGAGCTTCTA
AAAAEAAKAAVAAYAASAAKAAEAASAAVAAAAALAAVAAKAAQAASAAWAAEAALAAL
300
39
301
40
............................................................
AAAGTGGACGCACAAGCAAACGCCGTCGCTTTCTTCAAAGAgtaaatacatctcacattc
AKAAVAADAAAAAQAAAAANAAAAAVAAAAAFAAFAAKAAE
360
53
361
............................................................
cgatattgtcgtgactggctgtagctcgagttttagtggatctgcctgtcgcagatgtga
420
421
............................................................
tgggcagtggtgctcgtgttactgccaatagagtcgtttcctgtctttcgtgaggtgcta
480
481
54
............................................................
agttggtggcatcatcagGGTATTCGAGATTGCACCAGCAGCCAAGGGGTTCTTCTCCTT
AAAAAAAAAAAAAAAAAAAAVAAFAAEAAIAAAAAPAAAAAAAAKAAGAAFAAFAASAAF
540
67
541
68
............................................................
CATGCAGGACACCAGCATCCCCTTCGAGGATAACCCCAAAGTGAAATTCCATGCTCTTCA
AAMAAQAADAATAASAAIAAPAAFAAEAADAANAAPAAKAAVAAKAAFAAHAAAAALAAQ
600
87
601
88
............................................................
GGTCTTCAAGTTGgtaagcactgagaatcctaaattccaactccctgagttttgagaaac
AAVAAFAAKAAL
660
91
661
............................................................
gtctgcacagctattgttatccgttacctagatgcaggtatatgttttgagattataact
720
721
............................................................
aagaacatcctgtcagggtggccgtaacaacattcttccttcatcacgcgcatgcccagt
780
781
............................................................
tttaggtacagtgatggaccacgtttgggccctgtgctctttgccttgcctgtgattaaa
840
841
............................................................
caagcaattctttagagcgatcgtgtaaaggaaatagttacagtcattctcaggggatct
900
901
............................................................
gacagatggtagagctgtgttcgaccagtacatccggctgcatgacgagtgtctggcatg
960
961
............................................................
ctcatagagcttggttgtccttttaattttcttcgttgaagacttgaatctaaaacttat
1020
1021
............................................................
tttcgtcctaatcattcagtttaattgctacgtcattcaagggtttacaatctagttgtg
1080
Figure 3. Nucleotide and predicted protein sequence of P. patens nshb2 amplified by PCR (Figure 2). Coding (exon) and non-coding
(intron) sequences are shown in upper and lowercase letters, respectively. Sequences flanking the exon/intron boundaries are underlined.
Distal (H84) and proximal (H119) His residues are shown with black background. The P. patens nshb2 gene sequence was deposited in the
GenBank database under the accession number JQ692084.
Global J. Biochem. 2012, 3: 12
4
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.
GLOBAL JOURNAL OF BIOCHEMISTRY
1081
............................................................
attagatggaacctatatcttgataacatggatcaactgagtaatttccggcagataagc
1140
1141
............................................................
atcattttgaaacttacggaaaccattcattacgttattccacttcagtcattcgttata
1200
1201
............................................................
tccctcgtgatctttgcccgtcctcatcaacgtgatgtttcgcccaacctcggttatcag
1260
1261
............................................................
ttcttggctaaattactttgtgaaaacttaagaatgaagtcccaagaaccgacttgaata
1320
1321
............................................................
ccggaaatcggcatgcttctaaagtagtaagccagacttgaactttaaatttagggctcc
1380
1381
92
............................................................
ttacacttcttaatggattcagACAGGGGACGCGGCGGCGCAACTCGGAGAGAAAGGCGC
AAAAAAAAAAAAAAAAAAAAAAATAAGAADAAAAAAAAAAAQAALAAGAAEAAKAAGAAA
1440
104
1441
105
............................................................
ATACGAGCTTCTCCAGTCTCGTCTTCACAGTCTGGCCGCCAAGCATCTCAGTAAGGGAGT
AAYAAEAALAALAAQAASAARAALAAHAASAALAAAAAAAAKAAHAALAASAAKAAGAAV
1500
124
1501
125
............................................................
TCAAGATGCTCATTTCGAGgtaagatccctgcctcatgccattaacttgaccccggtcat
AAQAADAAAAAHAAFAAE
1560
130
1561
............................................................
ctgcacatgcttccagcgatgctcattcccaattccgacaggcgtagacgaacatttcta
1620
1621
131
............................................................
atacgtgaaatgtaaatggaaaattaacccatgatcggttataccagGTGGTAAAAGAGG
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAVAAVAAKAAE
1680
134
1681
135
............................................................
CTTTGCTGCGGACGATCGCGGCAGGCTTGCCAGACCTCTGGTCCCCCGCATTGAAACAGG
AAALAALAARAATAAIAAAAAAAAGAALAAPAADAALAAWAASAAPAAAAALAAKAAQ
1740
154
1741
155
............................................................
CATGGGCCGATGCCTACGACGCACTTGCCACCACTCTCAAGAATGAAATGCACGCTCAAG
AAAWAAAAADAAAAAYAADAAAAALAAAAATAATAALAAKAANAAEAAMAAHAAAAAQ
1800
174
1801
175
..............................
CCGCCGCGGCAGCAGCTAAATCTCATACTTGA
AAAAAAAAAAAAAAAAAAKAASAAHAATAA*
1832
184
Figure 3. (Continued)
conserved in land plant nshbs, shbs and lbs [4, 21, 41]. These
postulates were confirmed over the last years as novel nshbs,
shbs and lbs became available from newly sequenced land
plant genomes. However, our search in land plant genomes
revealed the existence of a nshb gene (nshb2) interrupted by 4
introns in P. patens (see subsections 3.1. and 3.2.). This
observation indicated that gene structure other than 4 exons/3
introns may exist in land plants.
Introns interrupting P. patens nshb2 correspond to
canonical IVS-I, IVS-II and IVS-III from known land plant
nshbs, shbs and lbs plus pre-IVS-I. Thus, an intriguing
question is: what was the origin of intron pre-IVS-I in P.
patens nshb2? One possibility is that intron pre-IVS-I already
existed in an ancestral algal nshb and conserved into first land
plant nshbs. This implicates that nshbs from a number of
primeval land plants, such as liverworts and mosses other than
P. patens, are also interrupted by intron pre-IVS-I, and that a
nshb copy only interrupted by 3 introns (i.e. introns IVS-I,
IVS-II and IVS-III) originated from an ancestral 4 introncontaining nshb after (pre-IVS-I) intron loss. The resulting 3
intron-containing nshb was conserved and the ancestral 4
intron-containing nshb was lost during the subsequent
evolution of land plants. Other possibility is that nshb2
inserted into the P. patens genome by horizontal gene transfer
from a 4 intron-containing hb donor. This implicates that hb
donor was interrupted by introns pre-IVS-I, IVS-I, IVS-II and
IVS-III. However, a candidate for a 4 intron-containing hb
donor is not known. We consider the above possibilities
unlikely and favor the following most parsimonious
hypothesis: a 3 intron (IVS-I, IVS-II and IVS-III)-containing
nshb1 duplicated into the P. patens genome originating nshb2,
which was subsequently interrupted by insertion of intron preIVS-I thus resulting in 4 intron (pre-IVS-I, IVS-I, IVS-II
and IVS-III)-containing P. patens nshb2. If this hypothesis
Global J. Biochem. 2012, 3: 12
5
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.
GLOBAL JOURNAL OF BIOCHEMISTRY
Figure 4. Sequence alignment of P. patens nsHb1 and 2. Conserved amino acid residues, and distal and proximal His residues are shown
with asterisks and black background, respectively. Intron (IVS) position within the Mb-fold
Mb fold is indicated in parenthesis an
and
d with gray
background. Helix regions were calculated from the structure of rice Hb1 [32].
[32]
Figure 5. Phenetic relationships of P. patens nsHb1 and 2 (arrow) and selected Archaeplastida nsHbs and Lbs. Phenogram was constructed
from Hb sequences reported by Garrocho
Garrocho-Villegas
Villegas et al. [4],, Fernández et al. [24] and Vinogradov et al. [18]..
Global J. Biochem. 2012, 3: 12
6
www.simplex
www.simplex-academic
academic-publishers.com
publishers.com
© 2012
201 Simplex Academic Publishers.
Publishers All rights reserved.
reserved
GLOBAL JOURNAL OF BIOCHEMISTRY
8.
9.
10.
11.
Figure 6. Overlay of predicted P. patens nsHb1 (red) and 2 (blue)
tertiary structures. Helices (including prehelix A) are indicated
with letters A to H. The predicted structure of P. patens nsHb1
and 2 is deposited in the Protein Model Database (http://mi.
caspur.it/PMDB/) under the ID number PM0074985 and
PM0077988, respectively.
is correct, it is possible that intron insertion similar to that
postulated for P. patens nshb2 has occurred in other land plant
nshbs. This hypothesis will be verified as novel nshbs become
available from newly sequenced land plant genomes.
12.
13.
14.
Acknowledgements
Work in the authors´ laboratory has been financed by
SEP-PROMEP (grant number UAEMor-PTC-01-01/PTC23)
and Consejo Nacional de Ciencia y Tecnología (CoNaCyT
grant numbers 25229N and 42873Q), México. C. V.-L. is a
postdoctoral fellow supported by CoNaCyT.
15.
16.
References
1.
2.
3.
4.
5.
6.
7.
Vinogradov S. N., Hoogewijs D., Bailly X., ArredondoPeter R., Gough J., Dewilde S., Moens L., Vanfleteren J.
R., A phylogenomic profile of globins, BMC Evol. Biol. 6
(2006) 31-47.
Arredondo-Peter R., Hargrove M. S., Moran J. F., Sarath
G., Klucas R. V., Plant hemoglobins, Plant Physiol. 118
(1998) 1121-1126.
Ross E. J. H., Lira-Ruan V., Arredondo-Peter R., Klucas
R. V., Sarath G., Recent insights into plant hemoglobins,
Rev. Plant Biochem. Biotechnol. 1 (2002) 173-189.
Garrocho-Villegas V., Gopalasubramaniam S. K.,
Arredondo-Peter R., Plant hemoglobins: what we know
six decades after their discovery, Gene: Funct. Evol.
Genom. 398 (2007) 78-85.
Appleby C. A., Leghemoglobin and Rhizobium
respiration, Annu. Rev. Plant Physiol. 35 (1984) 443-478.
Appleby C. A., The origin and functions of haemoglobin
in plants, Sci. Prog. 76 (1992) 365-398.
Appleby C. A., Tjepkema J. D.,Trinick M. J.,
Hemoglobin in a nonleguminous plant Parasponia:
17.
18.
19.
20.
21.
22.
possible genetic origin and function in nitrogen fixation,
Science 220 (1983) 951-953.
Fleming A. I., Wittenberg J. B., Wittenberg B. A.,
Dudman W. F., Appleby C. A., The purification,
characterization and ligand-binding kinetics of
hemoglobin from root nodules of the non-leguminous
Casuarina glauca-Frankia symbiosis, Biochim. Biophys.
Acta 911 (1987) 209-220.
Tjepkema J. D., Hemoglobins in the nitrogen-fixing root
nodules of actinorhizal plants, Can. J. Bot. 61 (1983)
2924-2929.
Andersson C. R., Jensen E. O., Llewellyn D. J., Dennis E.
S., Peacock W. J., A new hemoglobin gene from soybean:
a role for hemoglobin in all plants, Proc. Natl. Acad. Sci.
USA 93 (1996) 5682-5687.
Lira-Ruan V., Ruiz-Kubli M., Arredondo-Peter R.,
Expression of non-symbiotic hemoglobin 1 and 2 genes in
rice (Oryza sativa) embryonic organs, Commun. Integrat.
Biol. 4 (2011) 457-458.
Lira-Ruan V., Sarath G., Klucas R. V., Arredondo-Peter
R., Synthesis of hemoglobins in rice (Oryza sativa var.
Jackson) plants growing in normal and stress conditions,
Plant Sci. 161 (2001) 279-287.
Ross E. J. H., Shearman L., Mathiesen M., Zhou J.,
Arredondo-Peter R., Sarath G., Klucas R. V., Nonsymbiotic hemoglobins are synthesized during
germination and in differentiating cell types, Protoplasma
218 (2001) 125-133.
Taylor E. R., Nie X. Z., MacGregor A. W., Hill R. D., A
cereal haemoglobin gene is expressed in seed and root
tissues under anaerobic conditions, Plant Mol. Biol. 24
(1994) 853-862.
Vieweg M. F., Hohnjec N.,Küster H., Two genes
encoding different truncated hemoglobins are regulated
during root nodule and arbuscular symbiosis of Medicago
truncatula, Planta 220 (2005) 757-766.
Watts R. A., Hunt P. W., Hvitved A. N., Hargrove M. S.,
Peacock W. J., Dennis E. S., A hemoglobin from plants
homologous to truncated hemoglobins of microorganisms,
Proc. Natl. Acad. Sci. USA 98 (2001) 10119-10124.
Kakar S., Hoffman F. G., Storz J. F., Fabian M., Hargrove
M. S., Structure and reactivity of hexacoordinate
hemoglobins, Biophys. Chem. 152 (2010) 1-14.
Vinogradov S. N., Fernández I., Hoogewijs D.,
Arredondo-Peter R., Phylogenetic relationships of plant
3/3 and 2/2 hemoglobins to bacterial and other eukaryotic
hemoglobins, Mol. Plant 4 (2011) 42-58.
Altschul S. F., Gish W., Miller W., Myers E. W., Lipman
D. J., Basic local alignment search tool, J. Mol. Biol. 215
(1990) 403-410.
Shi J., Blundell T., Miziguchi K., FUGUE: sequencestructure homology recognition using environmentspecific substitution tables and structure-dependent gap
penalties, J. Mol. Biol. 310 (2001) 243-257.
Garrocho-Villegas V., Arredondo-Peter R., Molecular
cloning and characterization of a moss (Ceratodon
purpureus) non-symbiotic hemoglobin provides insight
into the early evolution of plant non-symbiotic
hemoglobins, Mol. Biol. Evol. 25 (2008) 1482-1487.
Crose J., Amorese B., Isolation of plant DNA from fresh
tissue, Focus 9 (1978) 3-6.
Global J. Biochem. 2012, 3: 12
7
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.
GLOBAL JOURNAL OF BIOCHEMISTRY
23. Thompson J. D., Gibson T. J., Plewniak F., Jeanmougin
F., Higgins D. G., The clustal X windows interface:
flexible strategies for multiple sequence alignment aided
by quality analysis tools, Nucl. Acids Res. 24 (1997)
4876-4882.
24. Fernández I., Vinogradov S. N., Arredondo-Peter R.,
Identification and in silico characterization of a putative
ancestor to land plant non-symbiotic hemoglobins from
the prasinophyceae algae Micromonas and Ostreococcus,
Global J. Biochem. 1 (2010) 18-30.
25. Kapp O. H., Moens L., Vanfleteren J., Trotman C. N. A.,
Suzuki T., Vinogradov S. N., Alignment of 700 globin
sequences: extent of amino acid substitution and its
correlation with variation in volume, Protein Sci. 4 (1995)
2179-2190.
26. Lesk A. M., Chothia C., How different amino acid
sequences determine similar protein structures: the
structure and evolutionary dynamics of the globins, J.
Mol. Biol. 136 (1980) 225-270.
27. Tamura K., Peterson D., Peterson N., Stecher G., Nei M.,
Kumar S., MEGA 5: Molecular evolutionary genetics
analysis using maximum likelihood, evolutionary
distance, and maximum parsimony methods, Mol. Biol.
Evol. 28 (2011) 2731-2739.
28. Saitou N., Nei M., The neighbor- joining method, a new
method for reconstructing phylogenetic trees, Mol. Biol.
Evol. 4 (1987) 406-425.
29. Roy A., Kucukural A., Zhang Y., I-TASSER: a unified
platform for automated protein structure and function
prediction, Nature Protocols 5 (2010) 725-737.
30. Roy A., Xu D., Poisson J., Zhang Y., A protocol for
computer-based protein structure and function prediction,
J. Visual. Exp. 57 (2011) e3259.
31. Zhang Y., I-TASSER server for protein 3D structure
prediction, BMC Bioinformatics 9 (2008) 40.
32. Hargrove M., Brucker E. A., Stec B., Sarath G.,
Arredondo-Peter R., Klucas R. V., Olson J. S., PhilipsJr.
G. N., Crystal structure of a non-symbiotic hemoglobin,
Structure 8 (2000) 1005-1014.
33. Eisenberg D., Lüthy R., Bowie J. U., VERIFY3D:
assessment of protein models with three-dimensional
profiles, Meth. Enzymol. 277 (1997) 396-404.
34. Humphrey W., Dalke A., Schulten K., VMD-Visual
molecular dynamics, J. Mol. Graph. 14 (1996) 33-38.
35. Gopalasubramaniam S. K., Garrocho-Villegas V., Bustos
G., Pastor N., Arredondo-Peter R., Use of in silico
(computer) methods to predict and analyze the tertiary
structure of plant hemoglobins, Meth. Enzymol. 436
(2008) 393-410.
36. Arredondo-Peter R., Hargrove M. S., Sarath G., Moran J.
F., Lohrman J., Olson J. S., Klucas R. V., Rice
hemoglobins: gene cloning, analysis and oxygen-binding
37.
38.
39.
40.
41.
kinetics of a recombinant protein synthesized in
Escherichia coli, Plant Physiol. 115 (1997) 1259-1266.
Appleby C. A., Bogusz V., Dennis E. S., Peacock W. J.,
A role for hemoglobin in all plant roots?, Plant Cell
Environ. 11 (1988) 359-367.
Smagghe B. J., Kundu S., Hoy J. A., Halder P., Weiland
T. R., Savage A., Venugopal A., Goodman M., Premer S.,
Hargrove M. S., Role of phenylalanine B10 in plant
nonsymbiotic hemoglobins, Biochemistry 45 (2006)
9735-9745.
Sturms R., Kakar S., Trent J. T., Hargrove M. S., Trema
and Parasponia hemoglobins reveal convergent evolution
of oxygen transport in plants, Biochemistry 49 (2010)
4085-4093.
Saenz-Rivera J., Sarath G., Arredondo-Peter R., Modeling
the tertiary structure of a maize (Zea mays ssp. mays)
non-symbiotic hemoglobin, Plant Physiol. Biochem. 42
(2004) 891-897.
Arredondo-Peter R., Ramírez M., Sarath G., Klucas R. V.,
Sequence analysis of an ancient hemoglobin cDNA
isolated from the moss Physcomitrella patens (Accession
No. AF218049), Plant Physiol. 122 (2000) 1457.
Cite this article as:
Fernando Martínez-Ocampo et al.: Detection, PCR-amplification and characterization of a 4 intron-containing non-symbiotic
hemoglobin gene from the moss Physcomitrella patens. Global J. Biochem. 2012, 3: 12
Global J. Biochem. 2012, 3: 12
8
www.simplex-academic-publishers.com
© 2012 Simplex Academic Publishers. All rights reserved.