full text pdf

Amylase 2017; 1: 1–11
Research Article
Open Access
Vincent Valk, Rachel M. van der Kaaij, Lubbert Dijkhuizen*
The evolutionary origin and possible functional
roles of FNIII domains in two Microbacterium
aurum B8.A granular starch degrading enzymes,
and in other carbohydrate acting enzymes
DOI 10.1515/amylase-2017-0001
Received October 19, 2016; accepted December 17, 2016
Abstract: Fibronectin type III (FNIII) domains were first
identified in the eukaryotic plasma protein fibronectin,
where they act as structural spacers or enable proteinprotein interactions. Recently we characterized two large
and multi-domain amylases in Microbacterium aurum
B8.A that both carry multiple FNIII and carbohydrate
binding modules (CBMs). The role of (multiple) FNIII
domains in such carbohydrate acting enzymes is
currently unclear. Four hypothetical functions are
considered here: a substrate surface disruption domain,
a carbohydrate binding module, as a stable linker, or
enabling protein-protein interactions. We performed a
phylogenetic analysis of all FNIII domains identified in
proteins listed in the CAZy database. These data clearly
show that the FNIII domains in eukaryotic and archaeal
CAZy proteins are of bacterial origin and also provides
examples of interkingdom gene transfer from Bacteria
to Archaea and Eucarya. FNIII domains occur in a
wide variety of CAZy enzymes acting on many different
substrates, suggesting that they have a non-specific role
in these proteins. While CBM domains are mostly found
at protein termini, FNIII domains are commonly located
between other protein domains. FNIII domains in
carbohydrate acting enzymes thus may function mainly
as stable linkers to allow optimal positioning and/or
*Corresponding author: Lubbert Dijkhuizen, Microbial Physiology,
Groningen Biomolecular Sciences and Biotechnology Institute
(GBB), University of Groningen, Nijenborgh 7, 9747 AG Groningen,
The Netherlands; E-mail: [email protected]
Vincent Valk, Rachel M. van der Kaaij, Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB),
University of Groningen, Nijenborgh 7, 9747 AG Groningen, The
Netherlands
Vincent Valk, Top Institute of Food and Nutrition (TIFN), Niewe Kanaal 9A, 6709 PA Wageningen, The Netherlands
flexibility of the catalytic domain and other domains,
such as CBM.
Keywords: Microbacterium aurum; FNIII domains; protein
evolution; carbohydrate acting enzymes; phylogeny; CBM
Abbreviations: AA, auxiliary activities; CAZy,
Carbohydrate-Active
enZymes
database;
CBM,
carbohydrate binding module; CDD, Conserved Domain
Database; CE, carbohydrate esterases; FNIII, fibronectin
type III; GH, glycoside hydrolases; GT, glycosyl
transferases; PDB, Protein Data Bank; PL, polysaccharide
lyases.
1 Introduction
The Gram-positive bacterium Microbacterium aurum B8.A
was isolated from the sludge of a potato starch-processing
factory on the basis of its ability to use granular starch as
carbon- and energy source for growth [1]. Its extracellular
amylolytic enzymes were able to degrade starch granules,
initially by introducing pores. In biochemical studies we
have characterized MaAmyA and MaAmyB, two large
and multi-domain amylase enzymes. MaAmyA (148 kDa)
carries two carbohydrate binding modules (CBM25),
four fibronectin type III (FNIII) domains and one CBM74
(Fig. 1) [2,3]. Characterization of MaAmyA mutant proteins
with C-terminal deletions of various lengths showed that
the CBM25 domains are essential for the ability of this
enzyme to degrade raw starch. Deletion of the C-terminal
CBM74 in MaAmyA resulted in a three-fold reduction in
granular starch pore size [2,3]. Deletion of CBM25 and
CBM74 negatively affected degradation of raw starch,
but not that of soluble starch [2]. In MaAmyA a total of
three FNIII domains are present between the two CBM25
domains and the CBM74 domain (Fig. 1). It has remained
© 2017 Vincent Valk et al., published by De Gruyter Open.
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.
Unauthenticated
Download Date | 6/17/17 6:37 AM
2 V. Valk, et al.
unclear what the precise function is of these three FNIII
domains. MaAmyB (135 kDa) is the first characterized
member of a new glycoside hydrolase family 13 subfamily
(GH13_42), which has a strongly conserved general
domain organization featuring one to four FNIII domains
between the two CBM25 domains and the catalytic domain
(with an aberrant C-region) (Fig. 1) [4]. In both enzymes
the precise function of these four FNIII domains has
remained unclear; therefore we decided to study this in
a more detail, focusing on carbohydrate acting enzymes
in general.
The FNIII domain is an evolutionarily conserved
protein domain of approximately 100 amino acids,
with a β-sandwich structure. It is one of the three
types of internal repeats (FNI-FNII-FNIII), originally
identified in the eukaryotic plasma protein fibronectin.
FNIII domains in eukaryotic proteins usually act as
structural spacers, to arrange other domains in space,
as in fibronectin itself. FNIII domains also may have
functional roles in the formation of protein-protein
interactions [5]. FNIII domains are widely found in
extracellular eukaryotic proteins, especially in animals,
but also occur in plant and yeast proteins. Whereas
other fibronectin repeats (FNI and FNII domains)
only occur in proteins of Eucarya, FNIII domains also
are found in Bacteria, Archaea and viruses. The first
FNIII domains in prokaryotes were reported in 1990
[6]. Due to the low number of recognized bacterial
FNIII domains at that time, and phylogenetic analysis
placing them separate from the eukaryotic domains,
it was suggested that prokaryotes had acquired FNIII
domains from eukaryotes [7]. All initially identified
prokaryotic FNIII domains were associated with
carbohydrate acting enzymes, but more recently these
domains have also been identified in fibronectin
binding proteins and multiple other prokaryotic protein
types [8-10]. Our database searches showed that about
33% of all FNIII containing proteins included in SMART
database (15125) [10] are from bacteria. Approximately
19% of all bacterial FNIII domains occur in proteins
that are directly related to carbohydrates, such as
carbohydrate acting enzymes or proteins that contain
CBMs [10]. In this work we focus on FNIII domains that
are part of carbohydrate acting enzymes classified in
the Carbohydrate-Active enZymes (CAZy) database [11].
Microbial carbohydrate acting enzymes are able to
degrade complex carbohydrates, making them available
as a carbon and/or energy source for growth. Many are
listed in the CAZy database, which shows five different
enzyme classes based on their activities and mode of
action: glycoside hydrolases (GH), glycosyl transferases
(GT), polysaccharide lyases (PL), carbohydrate esterases
(CE) and auxiliary activities (AA) [11]. These classes have
been separated into families and some (larger) families
have been further divided into subfamilies, based on
the primary structure of their protein members [11,12].
Approximately 10% of these enzymes include one or more
CBMs [3,13] (this work).
The function of FNIII domains in carbohydrate
acting enzymes is currently unclear. So far no reciprocal
protein-protein interactions have been reported to exist
between bacterial FNIII domains, although an interaction
has been suggested to exist between fibronectin binding
proteins FNIII domains and eukaryotic fibronectin
[9]. Characterization of different carbohydrate acting
enzymes resulted in three suggestions for the role of their
FNIII domains: as stable linker [14], carbohydrate surface
disruption domain [15] and CBM [16].
In this work we traced all FNIII domains present in all
carbohydrate active enzymes listed in the CAZy database
of February 2016, most of which (98%) were from bacterial
origin. We used both bioinformatics approaches as well
as the available literature to analyse the possible roles of
FNIII domains in carbohydrate acting enzymes.
MaAmyA
1,417
MaAmyB
GH13_42
1,278
993
Figure 1. Domain organization of MaAmyA [2], MaAmyB [3] and the general domain organization of GH13_42 members. Numbers refer to
the number of amino acids in the protein. The length of GH13_42 varies for each member, as a reference the length of AmlC from S. lividans
(GenBank: CAB06816.1) is shown. Domains are indicated as follows: white, signal sequence; red, AB-regions of the catalytic domain;
pale yellow, C-region; blue, FNIII domain; green, CBM25 domain; pink, CBM74, refers to a novel CBM domain [3]; orange yellow, aberrant
C-region.
Unauthenticated
Download Date | 6/17/17 6:37 AM
2 Materials and methods
2.1 Extracting FNIII domain sequences from
all CAZy entries
We extracted the (linked) GenBank accession number for
each CAZy entry in the 01-28-2016 release [11]. When no
linked GenBank accession number was found, the linked
Protein Data Bank (PDB) entry was used instead. If neither
was present, the best matching sequence of the non-linked
entries was used. In case no database links were listed, we
attempted to trace the sequence in the databases using the
name and strain information. Obsolete sequences were
updated to their newer version as suggested by GenBank, if
available [17]. The accession numbers were used to extract
the corresponding protein FASTA sequences from GenBank.
All FASTA sequences were separated in files of 25,000
sequences and then submitted to batch CD-search, using
the live search option and Conserved Domain Database
(CDD) database [18]. The concise output was used and all
lines containing “specific” in the “Hit type field” and “FN3”
(FNIII domains are designated as FN3 in CDD) in the “short
name field” were extracted. The lines contained the start
and end amino acids for each FNIII domain. Microsoft
Excel 2010 was used to extract the sequence of each FNIII
domain from the full FASTA file. Since some sequences
contained more than one FNIII domain, sequences of FNIII
domains were named as follows: “accession number of
full sequence”-“start aa number of the FNIII domain” The
output was formatted as a FASTA file.
2.2 Extracting FASTA sequences for all CBMs
listed in CAZy
All enzymes containing one or more CBMs according to the
CAZy annotation in the 01-28-2016 release were extracted
as FASTA sequences, as described above. All FASTA
sequences were separated in files of 2,500 sequences
and submitted to dbCAN [19]. The output contained the
domain organization of the full-length proteins with
the positions of the CBMs up to CBM67 as annotated by
dbCAN. CBM families 68 and higher are currently not
included in dbCAN and are therefore not included in the
analysis, except for CBM74, which we annotated manually
based on our previous work [3].
2.3 Bioinformatics tools
All BLAST searches were performed with NCBI BLASTP
[20] using standard settings. Conserved domains were
detected using both the NCBI conserved domain finder
FNIII domains in carbohydrate acting enzymes
3
[18] (with forced live search, without low-complexity filter,
using the CDD), and dbCAN [19] with standard settings.
Alignments were made with Mega6.0 [21] using its build-in
muscle alignment with standard settings. Alignments
were visualized with Jalview 2.8.1 [22]. Phylogenetic trees
were made with Mega6.0 using maximum likelihood
method with gaps/missing data treatment set on partial
deletion instead of full deletion. Trees were visualized
with Interactive Tree Of Life v2 [23]. Information about
enzyme activity (EC number) was obtained from the CAZy
database [11]. Domain organization shown in the tree is
based on combined data of dbCAN and CDD. Enzyme class
information was obtained from CAZy. Secondary structure
prediction was done using the Phyre2 server with standard
settings [24].
2.4 Alignment and phylogenetic tree
An initial alignment and tree using all 3,219 FNIII domain
sequences extracted from proteins in the CAZy database
was constructed. Due to the high number of sequences
the alignment was not manually tuned but directly used to
produce a phylogenetic tree. This large tree was used to select
a smaller number of sequences including all eukaryotic and
all archaeal FNIII domains and a diverse selection (based
on the initial tree) of bacterial FNIII domains as present
in CAZy proteins. These selected sequences were used
for an alignment, which was manually tuned and used
to produce the phylogenetic tree shown (Fig. 2). For each
selected protein, all FNIII domains present were included
in the tree. A total of 107 proteins with 158 FNIII domains
were selected and used for the alignment and phylogenetic
tree construction.
Two additional phylogenetic trees were constructed
based on a random selection of FNIII domains found in
an equal number of proteins from each of the 4 kingdoms
(Archaea, Bacteria, Eucarya, viruses), obtained from
the SMART database [10]: one tree based on 100 FNIII
domains (Fig. S1), and one based on 1,000 domains (data
not shown).
2.5 Determination of carbohydrate related
FNIII domains
The FASTA files of all bacterial FNIII containing protein
sequences listed in the SMART database were extracted.
All 15,125 bacterial protein sequences were analysed (in
batches of maximal 3,000 sequences) with dbCAN. Results
were screened for carbohydrate related catalytic domain
hits. Proteins were considered carbohydrate related when
such a catalytic domain was identified in the sequence.
Unauthenticated
Download Date | 6/17/17 6:37 AM
4 V. Valk, et al.
Legend:
Shapes:
Complete domain
Incomplete domain
Domain incomplete N-terminus
Domain incomplete C-terminus
Position in tree based on aa sequence of this domain
Domain colors:
GH13 amylase AB region
GH13 amylase C region
GH catalytic domain
AA catalytic domain
CE catalytic domain
GT catalytic domain
FNIII
FNIII-Like2
CBM25
CBM74
Other CBM
Other domain
Legend:
C-terminus
Domain
organization
Domain of which the
sequence was used to
determine the position
in the tree
N-terminus
Outer ring
Center ring
Legend.1-111
Start aa no. of sequence used to
determine the position in the tree
Name (accession no.)
Inner ring
branch line
Inner ring colors (indicating genera/species
of the bacteria containing the protein)
0.1
Amycolatopsis
Bacillus
Cellulomonas
Corallococcus
Halorhabdus
Listeria
Microbacterium aurum B8.A
Paenibacillus
Ruminiclostridium
Streptomyces
Yersinia
Other species (present twice in this tree)
Other species (present once in this tree)
Center ring & branch line colors
(indicating a selection of clusters)
Outer ring colors
(indicating kingdom)
Eukaryotic cluster
Archaea
Archaeal cluster
Bacteria
C. thermocellum cluster
Eukarya
Microbacterium aurum cluster
Unclassified
1st FNIII of MaAmyA cluster
Cluster rich in interkingdom horizontal gene transfer
Figure 2. Phylogenetic tree with all eukaryotic, archaeal and (a random selection of) bacterial FNIII containing proteins in CAZy. MaAmyA,
MaAmyB and Clostridium thermocellum CbhA (GenBank: CAA56918.1) were also included. Phylogeny was based on the FNIII sequences as
defined by CDD. All FNIII domain sequences were obtained as described in the Methods section, except for the FNIII domains from CbhA,
which were extracted manually based on the published information [16]. Domain organization is based on combined dbCAN and CDD data.
Unauthenticated
Download Date | 6/17/17 6:37 AM
FNIII domains in carbohydrate acting enzymes
3 Results and discussion
5
identified were found in proteins that belong to the GH
class (64.2%). This also reflects the fact that nearly half
(48.5%) of all CAZy proteins belong to the GH class.
When we look at the number of FNIII domain containing
proteins in each enzyme class separately, the AA class has
the most FNIII containing proteins (2.3%). Interestingly,
the number of FNIII domains in CBM containing proteins
is even larger: 3.5% of all CBM containing proteins also
have one or more FNIII domains, and 71.7% of all FNIII
containing proteins also contain one or more CBMs (Table
1). This last number is even higher (76.0%) when dbCAN is
used instead of CAZy to annotate the CBM domains. This
clearly points to a link between the presence of FNIII and
CBM domains in proteins.
FNIII domains are conserved protein domains of about
60-120 amino acid residues that have an all β-sheet
secondary structure. In this work, FNIII domains are
defined as domains indicated by CD-search as a “Specific”
hit with the short name “FN3”. All domains matching
these criteria had one of the following defined domain
ID’s: cd00063 [18], pfam00041 [25] or smart00060 [10].
3.1 FNIII domains in carbohydrate acting
enzymes listed in CAZy
In total, we retrieved 575,089 unique FASTA protein
sequences from the CAZy database. These were submitted
to a CD-search, which identified 3,219 FNIII domains,
present in 2,486 CAZy proteins. Less than 0.5% of all
CAZy proteins thus contain one or more FNIII domains,
as defined by the criteria used in this study. Of these 2,486
proteins, 76% contained one FNIII domain, 21% contained
two FNIII domains and 3% contained three or more FNIII
domains, up to a maximum of seven FNIII domains in
proteins CBL14028.1, CBL07763.1 and CBL13779.1. The
3,219 FNIII domains identified in CAZy proteins in this
study is in sharp contrast with the 24 FNIII domains that
thus far explicitly have been reported in characterized
carbohydrate acting enzymes in the literature (see
below). Clearly, little is known about the abundance and
functionality of FNIII domains in CAZy proteins.
FNIII domains were identified in proteins belonging
to all five CAZy enzyme classes (Table 1). This is in contrast
to the specific CBM families, which are usually linked to
one or two different enzyme classes, but so far never to
all five [11]. A relatively large number of the FNIII domains
3.2 Phylogenetic analysis
A phylogenetic tree with all eukaryotic, archaeal and
(a random selection of) bacterial FNIII containing
carbohydrate acting enzymes (with phylogeny based
on the FNIII sequences as annotated by CDD) listed in
CAZy was made (Fig. 2). Two FNIII-like domains from
Clostridium thermocellum were also included (discussed
in the next paragraph). The tree shows six clusters. Three
clusters contain mainly bacterial FNIII domains, another
cluster is of a mixed type with several examples of interkingdom gene transfer (indicated in grey in Figure 2) The
two remaining clusters are clearly distant from the others.
One contains most of the FNIII domains in eukaryotic
CAZy proteins (indicated in green in Figure 2), while the
other contains most of the FNIII domains obtained from
archaeal CAZy proteins (indicated in blue in Figure 2).
There is no clear FNIII clustering evident at the bacterial
genus/species levels. For example, FNIII domains from
Table 1. Distribution of FNIII domains over proteins in the different CAZy enzyme classes.a
Associated
modulesc
CAZy enzyme classesb
AA
CE
GH
GT
PL
CBM
2.0%
5.5%
48.5%
39.4%
1.3%
8.7%
Percentage of proteins per class containing (one or more) FNIII domain(s)
2.3 %
2.3%
0.6%
0.1%
0.7%
3.5%
Number of proteins per class containing (one or more) FNIII domain(s)
264
720
1,596
280
54
1,783
Percentage of all FNIII containing proteins belonging to the class
10.6 %
29.0%
64.2%
11.3%
2.2%
71.7%
Percentages of total CAZy entries per classd
d
d
e
Some proteins in this table belong to multiple CAZy enzyme classes, therefore the percentages shown do not add up to 100%.
GH, Glycoside Hydrolases; GT, Glycosyl Transferases; PL, Polysaccharide Lyases; CE, Carbohydrate Esterases; AA, Auxiliary Activities.
c
The last column indicates the distribution of FNIII domains over CAZy entries with one or more CBM (based on the 01-28-2016 release of
CAZy); as defined by CAZy (http://www.cazy.org/; [11]).
d
Total number of proteins in each class: AA:11731, CE:31583, GH:279001, GT:226733, PL: 7245, CBM:50301.
e
Total number of FNIII domain containing proteins: 2486.
a
b
Unauthenticated
Download Date | 6/17/17 6:37 AM
6 V. Valk, et al.
similar enzymes obtained from different Streptomyces
species are scattered throughout the tree. This again
suggests that these FNIII domains have been obtained
through horizontal gene transfer. The tree suggests that
the FNIII domains in CAZy proteins originated from
bacteria, since the eukaryotic cluster (indicated in green in
Figure 2) and archaeal cluster (indicated in blue in Figure
2) are further from the origin of the tree. This contrasts to
literature, where it has been stated that prokaryotic FNIII
domains originate from Eucarya [7,26]. To analyse whether
the subset of FNIII in CAZy enzymes is representative
for all FNIII domains, two additional phylogenetic trees
were produced including 100 (Fig. S1) or 1,000 (data not
shown) FNIII domains randomly picked from SMART [10],
but containing an equal number of domains from each
of the four kingdoms. Neither of these two additional
phyloogenetic trees produced showed clear clustering of
FNIII domains according to kingdoms, or a clear origin of
the tree (Fig. S1). Clearly, these trees also do not support
the hypothesis that bacterial FNIII domains were originally
obtained from eukaryotes. This conclusion was reached
in 1992 based on phylogenetic analysis of the 13 bacterial
FNIII domains known at that time, all from carbohydrate
acting enzymes, when compared to 26 eukaryotic FNIII
domains [7]. This conclusion was restated in 2002, when
the first bacterial FNIII domain tertiary structure was
solved, and based on the phylogenetic analysis of 20 of
the 135 bacterial FNIII domains, which were at that time
(July 2001) listed in SMART [10]. This appears to be the last
time that the subject was studied using a bioinformatics
analysis, but subsequently it has been referred to in many
other FNIII domain studies and reviews (for example [2731]). The phylogenetic trees presented in this study (Fig.
2, Fig. S1) do not support the conclusion that Eucarya are
at the root for FNIII domains. Since about 33% of all FNIII
domains currently listed in SMART is from prokaryotes,
also the statement that there are many more eukaryotic
FNIII domains than prokaryotic ones has become
outdated [7,26]. We conclude that on basis of all available
information there is no evidence that bacterial FNIII
domains were originally obtained from eukaryotes.
3.3 FNIII domains as cellulose surface
modifiers
Two FNIII-like domains from Clostridium thermocellum
cellobiohydrolase CbhA (GenBank CAA56918.1) were
shown to act as cellulose surface modifiers [15], making the
smooth and even surface of the cellulose substrate rough
and uneven. These two domains were not among the 3,219
FNIII domains identified in this study, because they were
not identified as such by CD-search [19]. Our additional
Pfam analysis [25] also did not identify them as FNIII
domains. Therefore, these CbhA domains were extracted
manually from the protein sequence, using the published
information [15]. Their sequences were used for alignment
together with all other bacterial FNIII sequences obtained
from characterized carbohydrate acting proteins (Fig. 3).
The two FNIII-like domains extracted from CbhA
only show 7-14% identity and 15-30% similarity to the
24 characterized FNIII domains from the bacterial
carbohydrate acting enzymes included in the alignment
(Fig. 3), while identity among the 24 true FNIII domains is
17-80% (30-84% similarity). In a phylogenetic analysis, the
two CbhA domains do not cluster with other FNIII domains
(Fig. 2). Four residues that are known to be conserved in
FNIII domains [32] are also fully conserved in the FNIII
domains included in the alignment (Fig. 3; highlighted
in blue, pink, yellow and green), while only the leucine
residue is conserved in the two domains from ChbA. We
conclude that these two domains of ChbA, in fact, are not
FNIII domains at all according to the current definitions.
This also is reflected in their listing in CDD, where they
are currently indicated as “FNIII-like2 domains”. Since
the suggestion that FNIII domains function as surface
disruption domains was based only on the experimental
results obtained with these two domains in ChbA, this
hypothesis is now no longer supported by experimental
data.
3.4 Location of FNIII domain in the protein
Since domains acting as stable linkers between other
protein domains are not expected to be found at the
terminus of a protein, the position of FNIII domains
within the 2,486 FNIII containing enzymes listed in CAZy
was studied. For comparison, the position of CBMs in
CAZy proteins (based on annotation of CBM domains with
dbCAN) was included. We defined domains as terminal
domains when there are less than 100 amino acid residues
between the predicted domain sequence and the protein
N- or C-terminus. Also, it was not allowed for the remaining
amino acids to contain a protein domain as identified by
either CD-search or dbCAN.
In total, there are only 332 proteins that have a FNIII
domain either at the N- or at the C-terminus including five
proteins with both an N- and C-terminal FNIII domain.
This is 14% of all FNIII containing proteins. In comparison,
82% of all CBM containing proteins have a CBM located
at the protein terminus (Fig. 4). Only in a small number
of CBM families (13 out of 66 families), the majority of
the CBM members is not located at one of the protein
Unauthenticated
Download Date | 6/17/17 6:37 AM
FNIII domains in carbohydrate acting enzymes
10
ACR15165.1-927
ACR15165.1-1166
AAA19800.1-931
AAA19800.1-1169
ABY95795.1-928
ABY95795.1-1158
EFB38194.1-652
ACY75514.1-359
ADR71224.1-491
ADR71224.1-570
AAA81528.1-465
AAA81528.1-560
AAU21943.2-460
BAA76623.1-338
AEN08307.1-298
AEN08307.1-391
MaAmyA-530
MaAmyA-833
MaAmyA-925
MaAmyA-1017
MaAMyB-264
MaAMyB-356
MaAMyB-448
MaAMyB-540
First FNIII cellulase
Second FNIII cellulase
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
Y
-
L
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - DE Q
- - -
Y
-
T
-
- - - - - - - - - - - - - - - - - - - - - - - - DS
V I
E
V
E
T
K
P Q A P S N V V V T - - S G NG K V D L
- - - - - - - - - - - -T E S S RVS L
P Q V P S N V V A T - - S G NG K V D L
- - - - - - - - - - - -VE S S RVS L
P Q P I T D L K A V - - S G NG Q V D L
P P T A L G L QQ PG - I E S S R V T L
P AF P TG L S AV L DS S G NT ANL
PS APTGL RVTG - T T T S S I S L
- - - - - - - - - - - - - - - - - - -V
PS APTGL RVTG - T T T S T VAL
P S V PG NAR S TG - V T ANS V T L
P T AP T NL AS T A -QT T S S I T L
- S AP V NV R V TG - K T AT S V S L
P NAP T AL T V AE - L G AT S L K L
P T AP SG L AF T E - P ATGQV K L
P T A P S G L A F T E - P A AG Q I K L
P T V P S N V K A T - - A NG T T I T V
PAVPSGL KAT - - VSGT S VAL
PAVPSGL KAT - - VSGT S VAL
PAVPSGL KAT - - VSGT S VAL
PAVPSGL KAT - - VSGT S VAL
PAVPSGL KAT - - VSGT S VAL
PAVPSGL KAT - - VSGT S VAL
P A V P T G V K T V - - V S GQQ V S V
DK V T I DS P V AG E R F E AG K D I
P T V K L T AP K S NV V AYG NE F L
40
EGG LY E K I AS
ACR15165.1-927 39
ACR15165.1-1166
DACR15165.1-1166
YE I
30
DG G T F N K I A T
T GAAA19800.1-931
YAAA19800.1-931
N I S S V39
EGG LY E K I AS
AAA19800.1-1169
NAAA19800.1-1169
Y E I - Y R - - S30
DG G T F N K I A T
V SABY95795.1-928
YABY95795.1-928
N I - Y R S T V39
KGGL Y E K I AS
ABY95795.1-1158
GABY95795.1-1158
Y E I - Y K S L S44
E TG PF VK I AT
N SEFB38194.1-652
YEFB38194.1-652
N V - K R S T K39
SGGP Y T T I AT
YACY75514.1-359
R V - F R - - -39
GACY75514.1-359
- DG - - T Q V A E
YADR71224.1-491
T V - L V - - -21
- DG - - A A V G T
T GADR71224.1-491
- DG - - T Q V A E
T GADR71224.1-570
YADR71224.1-570
R V - F R - - -39
T GAAA81528.1-465
YAAA81528.1-465
NV - - -38
- -G - - ANL AT
T GAAA81528.1-560
YAAA81528.1-560
DV - - -38
- -G - - T AL AT
EAAU21943.2-460
YAAU21943.2-460
VVS
- - - 38
- - - - - NR S I S
A SBAA76623.1-338
YBAA76623.1-338
T V - - -39
- NG - - N P I G Q
YAEN08307.1-298
D I
A - - -39
- G G - - T L L AG
T GAEN08307.1-298
YAEN08307.1-391
DV
A - - -39
- DN - - V L R K S
T GAEN08307.1-391
MaAmyA-530
MaAmyA-530
T G G43
TGG - - S V T L T
SG YQ
I MaAmyA-833
T K Y RMaAmyA-833
VA
- - -37
DAG V V K Y V T V
MaAmyA-925
T K Y RMaAmyA-925
VA
- - -37
DAG V V K Y V T V
MaAmyA-1017
T K YMaAmyA-1017
RVA
- - -37
DAG V V K Y V T V
T K Y MaAMyB-264
RMaAMyB-264
VA
- - -37
DAG V V K Y V T V
T K Y MaAMyB-356
RMaAMyB-356
VA
- - -37
DAG V V K Y V T V
T K Y MaAMyB-448
RMaAMyB-448
VA
- - -37
DAG V V K Y V T V
T S Y MaAMyB-540
TMaAMyB-540
I S
- - T38
G SGAAT T V AS
First
K V FNIII
EFNIII
F -cellulase
Y N - - -51
-GD- - T L I S S
SFirst
cellulase
Second
I S R FNIII
V D -cellulase
- - V47
DG - - - E V I G S
20
50
NV T E V S NE V
NV TG V S NE V
NV TQ V ADT V
N I T S VAA - T S T - I S A - S VTG T VTG VKE - T AG - V AG DV
V AG DV
S T S - T SG - T SG - T SG - T SG - T SG - T SG - T T - - DT T AP
DR E AP
60
T T F E D A N V T NG L K Y V Y
Y N Y I D T S V I NG V T Y N Y
T T F E D T N V T NG L K Y V Y
Y N Y V D T S V I NG T T Y S Y
I T Y I D T D V T NG L K Y V Y
Y N Y V D T D V V NG K V Y Y Y
T NY T DTG V ATG T K Y Y Y
T S F T DT G L T AG T A H V Y
T S F T V T G L E AG T S Y A V
T S F T DT G L A AG T A H V Y
T T A T I S G L T AG T S Y T F
T T AT I S G L AADT S Y T F
T S AE I GG L K PG T AY S F
L S L T DS G L T P ATQ Y S Y
T T Y T DT R R ADE T - V T Y
T T Y T DT Q - P AG T T V S Y
TQ L I DS G L AE NT T Y S Y
T T L T V S D L A AG T A Y WF
T T L T V S G L S AQ T A Y WF
T T L T V S G L S AQ T A Y WF
T T L T V S D L A AG T A Y WF
T T L T V S G L S AQ T A Y WF
T T L T V S G L S AQ T A Y WF
AAK V I TG L AAS T T Y NF
Y T AK I TG AAVG AY NL K
Y E Y E WK A V E G N H E I S V
7
30
S WS Q S D - - - - G A T G Y N I - Y
T WN P S T D N V - G I Y D Y E I - Y
S WS Q S D - - - - G A T G Y N I - Y
T WS P S T D N V - G I Y N Y E I - Y
S WS A V D - - - - R A V S Y N I - Y
NWS L S T D N V - A I Y G Y E I - Y
T WN A A P - - - - G A N S Y N V - K
AWN A S T D D V - G V AG Y R V - F
S WT A S T D D R - G V T G Y T V - L
AWN A S T D D T - G V T G Y R V - F
AWN A S T D N V - G V T G Y N V - Y
S WT A S T D N V - G V T G Y D V - Y
AWD A P S S G A - N I A E Y V V S F
S WA A A T G A L - P I A S Y T V - Y
T WK A S S D D T - G V T G Y D I - Y
TW
WQ A S S D D K - G V T G Y D V - Y
T WT A S T D E C S G V S G Y Q I - T
S WS A V S - - - - G A T K Y R V A Y
S WS A V S - - - - G A T K Y R V A Y
S WS A V S - - - - G A T K Y R V A Y
S WS A V S - - - - G A T K Y R V A Y
S WS A V S - - - - G A T K Y R V A Y
S WS A V S - - - - G A T K Y R V A Y
S WT A V S - - - - G A T S Y T I S Y
N I RT VKS KT -PVS KVE F -Y
K I T A T A S D S DG K I S R V D - F
70
R S S V E G G L 38
R - - S D G G T 29
R S S V E G G L 38
R - - S D G G T 29
Y
R S T V K G G L 38
K S L S E T G P 43
R S T K S G G P 38
Y
R - - - - D G - 38
V - - - - D G - 20
R - - - - D G - 38
N - - - - - G - 37
N - - - - - G - 37
E - - - - - - - 37
R - - - - N G - 38
A - - - - G G - 38
A - - - - D N - 38
R T G G T G G - 42
R - - - D A G 36
R - - - D A G 36
R - - - D A G 36
R - - - D A G 36
R - - - D A G 36
R - - - D A G 36
S - - T G S G A 37
A
N - - - - G D 50
46
L - - V DG
80
A I S A I DE L - - K V V AV DL S F NR
A I S AV DE L G NE
KVV - - - - - - - S V T A V D S DG N E
KVVA - - - - - - VVS AVS N - - - AV R AV DAA - - T V R A R DA AG NV
A V R A V DA AG NL
T I K A K DA AG NL
T V K A K DA AG NV
T V S A K D A DG K L
F V T A T DS QG NT
H V R A R D A A G NQ
V V R A K DA AG N I
T V K A K DG A G - K V R A E NS AG Y S
K V R A E NS AG Y S
K V R A E NS AG T S
K V R A E NS AG Y S
K V R A E NS AG Y S
K V R A E NS AG T S
K V R A E NS AG NS
A V A V L S DG R R I
I AY DDDDAAS T
- - - - - - - - - T E S NV V T I - S E MS I DT V A - - - - - - - - - S AL S NE V E A - - - - - - - - - - - - - - - - - - - - - - - - - - - S AAS APL T VT
S AT SGT VTGE
S AAS NAV T V S
S AAS NAV S V K
H AG P T V E V T S L PS S AL AVK
S ADS NT V - - S AP S NT V - - - - - - - - - - - A Y S AQ V DV T A Y S AQ V DV T AY S TQ V DV T A Y S AQ V DV T A Y S AQ V DV T AY S TQ V DV T AY - - - - - - - E S PVT PVL VK
P DS V K I F V KQ
- - - - - - - - - - - - - - - - - - - - - - - - - AR
- - - - - - - - - - - - - - - - - - - - - - - - - DV
K
74
77
86
61
86
76
75
71
66
84
84
84
81
84
82
82
76
83
83
83
83
83
83
76
98
98
Figure 3. Alignment of all 24 bacterial FNIII domains explicitly reported in literature [2,4,6,15,16,34,36,38-43], in carbohydrate acting
enzymes. Sequences were extracted as described in the Methods section, except for the two “FNIII-like domains” from C. thermocellum
CbhA (GenBank: CAA56918.1), which were not recognized as such by the domain annotation tools, and were therefore extracted manually
[15]. Predicted (Phyre2 [24]) conserved β-sheets are indicated in red. A conserved tryptophan in the 2nd β-sheet is indicated in blue, a conserved tyrosine in the 3rd β-sheet is indicated in pink, a conserved leucine is indicated in yellow, while a conserved aromatic residue in the
6th β-sheet is indicated in green (only the leucine residue is conserved in the two FNIII-like domains of CbhA). Other conserved residues are
indicated in grey. Amino acid numbering is based on MaAmyA-833.
Unauthenticated
Download Date | 6/17/17 6:37 AM
8 V. Valk, et al.
3.5 Enzyme activities associated with FNIII
domains
termini (Fig. 4). Interestingly, terminal FNIII domains are
found at a similar ratio at either terminus of the protein.
In contrast, most of the individual CBM families seem to
have a preference for either the N- or C-terminus (Fig. 4).
Taken together, these results show that FNIII domains
are mainly found in between other protein domains. This
supports the hypothesis that FNIII domains may function
as stable and/or flexible linkers.
Of all enzymes listed in CAZy, 2,486 proteins contain one or
more FNIII domains (Table 1). Of these 2,486 enzymes, 124
have been characterized and have at least one EC number
in the CAZy database. In Figure 5 an overview is shown of
the number of different EC numbers associated with these
80
70
60
50
Both
40
C-terminal
N-terminal
30
20
CBM15
CBM38
CBM37
CBM28
CBM40
CBM46
CBM45
CBM53
CBM17
CBM30
CBM54
CBM44
CBM58
0
FNIII
10
CBM
% proteins with domain at terminus
90
Figure 4. Percentage of FNIII or CBM containing proteins (excluding CBM68-CBM73) with at least one domain located at the terminal protein
end (defined as < 100 amino acid residues from the protein terminus, not containing any other defined domains). CBM: percentage of all
CBM containing proteins with a terminal CBM. For specific CBM families, the percentage of proteins with less than 50% terminal CBMs is
also shown.
20
15
Lyases
Hydrolases
10
Transferases
Oxidoreductases
5
0
FNIII
CBM1
CBM13
CBM32
CBM35
CBM2
CBM6
CBM20
CBM48
CBM3
CBM5
CBM16
CBM22
CBM4
CBM25
CBM10
CBM34
CBM50
CBM72
Number of unique EC numbers
25
Figure 5. Number of unique EC numbers associated with characterized FNIII and CBM containing proteins. Only CBM containing proteins
associated with at least five unique EC numbers are shown. The major EC activity groups are indicated: Oxidoreductases (EC 1.x), Transferases (EC 2.x), Hydrolases (EC 3.x), Lyases (EC 4.x). None of the proteins was associated with the Isomerase (EC 5.x) or Ligase (EC 6.x)
classes.
Unauthenticated
Download Date | 6/17/17 6:37 AM
enzymes. The enzyme classes, to which the EC numbers
belong, are also indicated. The same analysis was also
performed for the 50,301 CBM containing enzymes (Table
1), of which 2,273 have one or more EC numbers listed in
CAZy. CBM families associated with five or more different
EC numbers are included in Figure 5.
Enzymes that contain FNIII domains are associated
with 24 unique EC numbers (Table S1). In addition,
they are associated with all four main enzyme activities
listed in CAZy, more than for any of the CBMs (Fig. 5).
CBM1, CBM13 and CBM32 containing enzymes are
associated with 20 different EC numbers, the highest
number of unique EC numbers associated with a single
CBM family (Fig. 5). For individual CBM families it is
to be expected that they do not associate with a large
number of activities, since their main function is to bind
to specific substrates that also the associated catalytic
domain can interact with. For example, a CBM type
able to specifically bind cellulose would be of little use
in a starch degrading enzyme. This is, indeed, largely
reflected in the results on CBM association with EC
numbers (Fig. 5). Some CBMs are associated with a large
number of EC numbers, but in those cases the enzymes
mainly have activities related to the substrate that is
bound by the CBM or related substrates. We consider
a substrate related when in a natural environment it is
common for the carbohydrate that is bound by the CBM
to be in close proximity to the substrate of the catalytic
domain. For example, some xylanases have a cellulose
binding domain, but since both xylan and cellulose are
known to be in close proximity to each other in plant
cell walls [33] this is considered as a related CBM. FNIII
domains, on the other hand, are associated with many
different enzymatic activities, ranging from chitinacting (chitinase) and cellulose-acting (cellulose) to
starch-acting (α-amylase and pullulanase). Such a broad
distribution does not point to a role for FNIII as a binding
domain that supports interaction of the resident catalytic
domain with a specific substrate. The appearance of
FNIII domains in many different enzyme types instead
suggests a less specific role, e.g., as a stable linker.
3.6 FNIII domains function mainly as stable
linkers
Since 2002, three functions have been suggested for FNIII
domains in carbohydrate active enzymes [14-16]. This
paper shows that two previously characterized domains
of C. thermocellum cellobiohydrolase CbhA, in fact,
are not FNIII domains at all (Figs. 2, 3). Their proposed
function in substrate surface disruption thus no longer
FNIII domains in carbohydrate acting enzymes
9
holds for true FNIII domains. When FNIII and CBM
domains are compared for their position in the protein
and association with specific enzyme activities, a clear
difference is observed. While CBMs generally display a
certain substrate binding specificity and usually are part
of enzymes with specific catalytic activities for the same
substrate, this is not the case for FNIII domains. The
suggested role for FNIII domains in substrate binding
thus appears unlikely. It is well known that CBMs use
aromatic residues to interact and bind to carbohydrates.
However, the three-dimensional structure of the FNIII
domain from chitinase A1 showed that the domain did
not have any accessible aromatic residues [26], which
makes it less likely that it can function as a CBM. As
shown in Table 1, 71.7% of the FNIII containing proteins
also contain a CBM (as annotated by CAZy), which
increases up to 76.0% when dbCAN is used to annotate
the CBM domains. Finally, FNIII domains are mainly
located between other domains, unlike CBMs, which are
often present at the terminal end of proteins. On basis
of these considerations we conclude that the available
evidence suggests that FNIII domains function as stable
linkers in multi-domain proteins.
In truncation studies, the full-length enzyme is
usually compared with constructs, in which either the
CBM or both the CBM and FNIII domain(s) were removed.
In these studies, removal of only the CBM usually affects
activity on insoluble substrates [15,16,34-40], while the
additional removal of the FNIII domains showed no
further effects. In a study of the chitinase A1 from Bacillus
circulans WL-12a, the single CBM present was re-attached
after removal of one or two intermediate FNIII domains,
allowing a direct test for the FNIII function [14]. Removal
of one FNIII domain showed a ~20% activity decrease
on insoluble colloidal chitin, and removal of both FNIII
domains resulted in a ~50% activity decrease. Removal of
only the CBM also caused a ~50% activity decrease. When
the CBM was not re-attached, the removal of one or both
FNIII domains did not further affect activity. In all cases
the activity on a soluble substrate remained unaffected
[14]. These experimental data thus support the suggested
function of FNIII domains as stable protein linkers, acting
in the optimal placement of different enzyme domains
towards each other, and towards the substrate.
In conclusion, the multiple FNIII domains in the M.
aurum B8.A MaAmyA and MaAmyB proteins thus may act
as a spacer between the catalytic domain, the two CBM25
domains, and the CBM74 domain (in case of MaAmyA)
(Fig. 1), to provide optimal position and/or flexibility of
these domains with respect to each other and towards the
substrate.
Unauthenticated
Download Date | 6/17/17 6:37 AM
10 V. Valk, et al.
Acknowledgments: This study was partly funded by the
Top Institute of Food & Nutrition (project B1003) and by
the University of Groningen.
Conflict of interest: The authors declare no conflict of
interests.
References
[1] Sarian F., van der Kaaij R., Kralj S., Wijbenga D.J., Binnema D.,
van der Maarel M., et al., Enzymatic degradation of granular
potato starch by Microbacterium aurum strain B8.A, Appl.
Microbiol. Biotechnol., 2012, 93, 645–654.
[2] Valk V., Eeuwema W., Sarian F.D., van der Kaaij R.M.,
Dijkhuizen L., Degradation of granular starch by the bacterium
Microbacterium aurum strain B8.A involves a modular
α-amylase enzyme system with FNIII and CBM25 domains,
Appl. Environ. Microbiol., 2015, 81, 6610–6620.
[3] Valk V., Lammerts van Bueren A., van der Kaaij R.M., Dijkhuizen
L., Carbohydrate binding module 74 is a novel starch binding
domain associated with large and multi-domain α-amylase
enzymes, FEBS J., 2016, 283, 2354–2368.
[4] Valk V., van der Kaaij R.M., Dijkhuizen L., Characterization
of the starch-acting MaAmyB enzyme from Microbacterium
aurum B8.A representing the novel subfamily GH13_42 with an
unusual, multi-domain organization, Sci. Rep., 2016, 6, 36100.
[5] Campbell I.D., Spitzfaden C., Building proteins with fibronectin
type III modules, Structure, 1994, 2, 333–337.
[6] Watanabe T., Suzuki K., Oyanagi W., Ohnishi K., Tanaka H.,
Gene cloning of chitinase A1 from Bacillus circulans WL-12
revealed its evolutionary relationship to Serratia chitinase and
to the type III homology units of fibronectin, J. Biol. Chem.,
1990, 265, 15659–15665.
[7] Bork P., Doolittle R.F., Proposed acquisition of an animal
protein domain by bacteria, Proc. Natl. Acad. Sci. USA, 1992,
89, 8990–8994.
[8] Henderson B., Nair S., Pallas J., Williams M.A., Fibronectin: a
multidomain host adhesin targeted by bacterial fibronectinbinding proteins, FEMS Microbiol. Rev., 2011, 35, 147–200.
[9] Konkel M.E., Larson C.L., Flanagan R.C., Campylobacter jejuni
FlpA binds fibronectin and is required for maximal host cell
adherence, J. Bacteriol., 2010, 192, 68–76.
[10] Letunic I., Doerks T., Bork P., SMART: recent updates, new
developments and status in 2015, Nucleic Acids Res., 2014,
10–13.
[11] Lombard V., Golaconda Ramulu H., Drula E., Coutinho P.M.,
Henrissat B., The carbohydrate-active enzymes database
(CAZy) in 2013, Nucleic Acids Res., 2014, 42, D490–D495.
[12] Stam M.R., Danchin E.G.J., Rancurel C., Coutinho P.M.,
Henrissat B., Dividing the large glycoside hydrolase family 13
into subfamilies: towards improved functional annotations
of α-amylase-related proteins, Protein Eng. Des. Sel., 2006, 19,
555–562
[13] Machovič M., Janeček Š., The evolution of putative starchbinding domains, FEBS Lett., 2006, 580, 6349–6356.
[14] Watanabe T., Ito Y., Yamada T., Hashimoto M., Sekine S.,
Tanaka H., The roles of the C-terminal domain and type III
domains of chitinase A1 from Bacillus circulans WL-12 in chitin
degradation, J. Bacteriol., 1994, 176, 4465–4472.
[15] Kataeva I.A., Seidel R.D., Shah A., West L.T., Li X.L., Ljungdahl
L.G., The fibronectin type 3-like repeat from the Clostridium
thermocellum cellobiohydrolase CbhA promotes hydrolysis of
cellulose by modifying its surface, Appl. Environ. Microbiol.,
2002, 68, 4292–4300.
[16] Lin F.P., Ma H.Y., Lin H.J., Liu S.M., Tzou W.S., Biochemical
characterization of two truncated forms of amylopullulanase
from Thermoanaerobacterium saccharolyticum NTOU1 to
identify its enzymatically active region, Appl. Biochem.
Biotechnol., 2011, 165, 1047–1056.
[17] Benson D.A., Karsch-Mizrachi I., Lipman D.J., Ostell J., Sayers
E.W., GenBank, Nucleic Acids Res., 2009, 37, D26–D31.
[18] Marchler-Bauer A., Lu S., Anderson J.B., Chitsaz F., Derbyshire
M.K., DeWeese-Scott C., et al., CDD: a Conserved Domain
Database for the functional annotation of proteins, Nucleic
Acids Res., 2011, 39, D225–D229.
[19] Yin Y., Mao X., Yang J., Chen X., Mao F., Xu Y., dbCAN: a
web resource for automated carbohydrate-active enzyme
annotation, Nucleic Acids Res., 2012, 40, W445–451.
[20] Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J., Basic
local alignment search tool, J. Mol. Biol., 1990, 215, 403–410.
[21] Tamura K., Stecher G., Peterson D., Filipski A., Kumar S.,
MEGA6: Molecular Evolutionary Genetics Analysis version 6.0,
Mol. Biol. Evol., 2013, 30, 2725–2729.
[22] Waterhouse A.M., Procter J.B., Martin D.M., Clamp M., Barton
G.J., Jalview version 2 - a multiple sequence alignment editor
and analysis workbench, Bioinformatics, 2009, 25, 1189–1191.
[23] Letunic I., Bork P., Interactive Tree Of Life v2: online annotation
and display of phylogenetic trees made easy, Nucleic Acids
Res., 2011, 39, W475–W478.
[24] Kelley L., Mezulis S., Yates C.M., Wass M.N., Sternberg M.J.E.,
The Phyre2 web portal for protein modeling, prediction and
analysis, Nat. Protoc., 2015, 10, 845–858.
[25] Finn R.D., Bateman A., Clements J., Coggill P., Eberhardt R.Y.,
Eddy S.R., et al., Pfam: the protein families database, Nucleic
Acids Res., 2014, 42, D222–D230.
[26] Jee J.G., Ikegami T., Hashimoto M., Kawabata T., Ikeguchi M.,
Watanabe T., et al., Solution structure of the fibronectin type
III domain from Bacillus circulans WL-12 chitinase A1, J. Biol.
Chem., 2002, 277, 1388–1397.
[27] Giglio K.M., Fong J.C., Yildiz F.H., Sondermann H., Structural
basis for biofilm formation via the Vibrio cholerae matrix
protein RbmA, J. Bacteriol., 2013, 195, 3277–3286.
[28] Grant J.R., Katz L.A., Phylogenomic study indicates widespread
lateral gene transfer in Entamoeba and suggests a past
intimate relationship with parabasalids, Genome Biol. Evol.,
2014, 6, 2350–2360.
[29] Han Y., Agarwal V., Dodd D., Kim J., Bae B., Mackie R.I., et al.,
Biochemical and structural insights into xylan utilization by the
thermophilic bacterium Caldanaerobius polysaccharolyticus, J.
Biol. Chem., 2012, 287, 34946–34960.
[30] Ma W.T., Lin J.H., Chen H.J., Chen S.Y., Shaw G.C., Identification
and characterization of a novel class of extracellular poly(3hydroxybutyrate) depolymerase from Bacillus sp. strain NRRL
B-14911, Appl. Environ. Microbiol., 2011, 77, 7924–7932.
[31] Rappoport N., Linial M., Viral proteins acquired from a host
converge to simplified domain architectures, PLoS Comput.
Biol., 2012, 8, e1002364.
Unauthenticated
Download Date | 6/17/17 6:37 AM
[32] Hoxha E., Campion S.R., Structure-critical distribution of
aromatic residues in the fibronectin type III protein family,
Protein J., 2014, 33, 165–173.
[33] Varner J.E., Louis S., Lin L.S., Louis S., Lin L.S., Plant cell wall
architecture, Cell, 1989, 56, 231–239.
[34] Chuang H.H., Lin H.Y., Lin F.P., Biochemical characteristics
of C-terminal region of recombinant chitinase from Bacillus
licheniformis implication of necessity for enzyme properties,
FEBS J., 2008, 275, 2240–2254.
[35] Kahar U.M., Chan K.G., Salleh M.M., Hii S.M., Goh K.M., A
high molecular-mass Anoxybacillus sp. SK3-4 amylopullulanase: characterization and its relationship in carbohydrate
utilization, Int. J. Mol. Sci., 2013, 14, 11302–11318.
[36] Kim D.Y., Han M.K., Park D.S., Lee J.S., Oh H.W., Shin D.H., et
al., Novel GH10 xylanase, with a fibronectin type 3 domain,
from Cellulosimicrobium sp. strain HY-13, a bacterium in the
gut of Eisenia fetida, Appl. Environ. Microbiol., 2009, 75,
7275–7279.
[37] Lin H.Y., Chuang H.H., Lin F.P., Biochemical characterization
of engineered amylopullulanase from Thermoanaerobacter
ethanolicus 39E-implicating the non-necessity of its 100
C-terminal amino acid residues, Extremophiles, 2008, 12,
641–650.
[38] Lin F.P., Ho Y.H., Lin H.Y., Lin H.J., Effect of C-terminal truncation
on enzyme properties of recombinant amylopullulanase from
Thermoanaerobacter pseudoethanolicus, Extremophiles, 2012,
16, 395–403.
FNIII domains in carbohydrate acting enzymes
11
[39] Suzuki K., Taiyoji M., Sugawara N., Nikaidou N., Henrissat
B., Watanabe T., The third chitinase gene (chiC) of Serratia
marcescens 2170 and the relationship of its product to other
bacterial chitinases, Biochem. J., 1999, 343, 587–596.
[40] Takasuka T.E., Acheson J.F., Bianchetti C.M., Prom B.M.,
Bergeman L.F., Book A.J., et al., Biochemical properties and
atomic resolution structure of a proteolytically processed
β-mannanase from cellulolytic Streptomyces sp. SirexAA-E,
PLoS One, 2014, 9, e94166.
[41] Alahuhta M., Xu Q., Brunecky R., Adney W.S., Ding S.Y., Himmel
M.E., et al., Structure of a fibronectin type III-like module from
Clostridium thermocellum, Acta Crystallogr. Sect. F, Struct. Biol.
Cryst. Commun., 2010, 66, 878–880.
[42] Kim D.Y., Ham S.J., Kim H.J., Kim J., Lee M.H., Cho H.Y., et al.,
Novel modular endo-β-1,4-xylanase with transglycosylation
activity from Cellulosimicrobium sp. strain HY-13 that is
homologous to inverting GH family 6 enzymes, Bioresour.
Technol., 2012, 107, 25–32.
[43] Mathupala S.P., Lowe S.E., Podkovyrov S.M., Zeikus JG.,
Sequencing of the amylopullulanase (apu) gene of Thermoanaerobacter ethanolicus 39E, and identification of the active
site by site-directed mutagenesis, J. Biol. Chem., 1993, 268,
16332–16344.
Supplemental Material: The online version of this article
(DOI: 10.1515/amylase-2017-0001) offers supplementary material.
Unauthenticated
Download Date | 6/17/17 6:37 AM