21st Century Directions in Biology Molecular Tools for Discovering the Secrets of Diatoms ANASTASIA SAADE AND CHRIS BOWLER Diatoms are photosynthetic unicellular eukaryotes found in most aquatic environments. They are major players in global biogeochemical cycles, and generate as much oxygen through photosynthesis as terrestrial rainforests do. Insights into their evolutionary origins have been revealed by the whole-genome sequencing of Thalassiosira pseudonana and Phaeodactylum tricornutum. We now know that diatoms contain unusual assortments of genes derived from different sources, including those acquired by horizontal gene transfer from bacteria. These genes confer novel metabolic and signaling capacities that may underlie the extraordinary ecological success of diatoms on Earth today. The availability of a suite of techniques that can be used to monitor and manipulate diatom genes is enhancing our knowledge of their novel characteristics. We highlight these recent developments and illustrate how they are being used to understand different aspects of diatom biology. We also discuss the use of diatoms in commercial applications, such as for nanotechnology and biofuel production. Keywords: genomics, microarrays, nanotechnology, transgenic technology, biofuel I n the upper zone of the ocean and other water bodies, down to depths where light can penetrate, one can typically find an abundant group of eukaryotic algae known as diatoms. These microscopic, unicellular organisms are characterized by ornate, lacework-like, silicified shells and are distributed all around the world. Diatoms are photosynthetic organisms that can convert the energy from sunlight into chemical energy in the form of ATP (adenosine triphosphate). This chemical reaction confers on diatoms the ability to produce their own nutrients (sugars), thus they have an autonomous metabolism and are called photoautotrophs. Diatoms absorb and fix large amounts of atmospheric carbon dioxide (CO2) while capturing light and water to generate a major fraction of the oxygen generated on Earth by photosynthesis. They are in fact believed to contribute between 20% and 25% of global primary production, equivalent to all terrestrial rainforests combined (Falkowski et al. 1998, Field et al. 1998, Smetacek 1999), and consequently play an essential role in the well-being of our global ecosystem. The scientist and artist Ernst Haeckel was one of the first to observe and describe diatoms in the late 19th century (Breidbach 2005). German biologist Robert Lauterborn subsequently made exquisite microscopic descriptions of subcellular events occurring during diatom cell division. A century later, Jeremy Pickett-Heaps translated Lauterborn’s observations from the original German and verified his discoveries using light and electron microscopy (Pickett-Heaps et al. 1984, De Martino et al. 2009). Others began describing diatoms’ habitats: Allen did research in the early 20th century (Allen WE 1926), and experimental culturing became more reliable with the finding that, in addition to light and macronutrients, certain micronutrients and vitamins were required to cultivate them (Harvey 1939). Ecological and descriptive studies continue to this day, with researchers now incorporating advanced techniques of molecular and cellular biology. Consequently, knowledge of diatoms’ basic biology and their potential for a range of commercial exploitations is now advancing rapidly. Diatoms can be recognized in the microscope by their highly ornamented silicified cell walls, known as frustules (figure 1). How diatoms generate these beautiful structures is largely unknown, although some insights are now being revealed (see below). The process is termed “biomineralization” (defined as the formation of inorganic materials under biological control), and the species-specific patterns indicate that it is genetically determined. Because marine organisms use more than 6.7 gigatons of silicon per year (Tréguer et al. 1995), it is particularly important to understand silicon uptake and deposition processes in diatoms. Furthermore, diatoms are used as bioindicators of pollution and water quality. Because many heavy metals and organic xenobiotics inhibit diatoms’ growth, other algae such as cyanobacteria come to dominate (Berland et al. 1976). It is therefore possible to determine water quality by analyzing plankton diversity. Diatoms are also used as hydrographic tracers because biogenic silica retains its primary oxygen BioScience 59: 757–765. ISSN 0006-3568, electronic ISSN 1525-3244. © 2009 by American Institute of Biological Sciences. All rights reserved. Request permission to photocopy or reproduce article content at the University of California Press’s Rights and Permissions Web site at www.ucpressjournals. com/reprintinfo.asp. doi:10.1525/bio.2009.59.9.7 www.biosciencemag.org October 2009 / Vol. 59 No. 9 • BioScience 757 21st Century Directions in Biology Figure 1. Electron micrograph of the elaborate silicified cell wall of a diatom (Thalassiosira oestrupii var. venrickae). The cell has a diameter of 9.5 microns. Image: Courtesy of Diana Sarno (Service for Taxonomy and Identification of Marine Phytoplankton, Stazione Zoologica Anton Dohrn, Naples, Italy). isotopic composition after burial (Sancetta 1981). This property can be used to monitor past surface temperatures and isotopic compositions of seawater (Shemesh et al. 1992). Additionally, because the frustule can retain its structural features over geological timescales, the diatom fossil record is of high quality. These observations reveal that diatoms have been major players in marine environments for at least the past 90 million years (Kooistra et al. 2007). Diatoms belong to the heterokont branch of the eukaryotes. This group lies within the hypothesized Chromalveolata kingdom within which several major lineages, including algae, can be found (Harper et al. 2005). These lineages were originally defined using morphological and developmental characters, and have subsequently been refined using molecular approaches—for example, sequence analysis of ribosomal RNA genes and highly conserved proteins such as RuBisCo (ribulose-1,5-bisphosphate carboxylase oxygenase) and elongation factor Tu (Baldauf et al. 1996, 2000). More recent, larger-scale phylogenomics approaches based on multiple sequence alignments are providing further insights into the evolutionary relationships between diatoms (Baldauf 2003, Li et al. 2006). Algal chloroplasts are believed to be derived from photosynthetic prokaryotes that invaded or were engulfed by a eukaryotic cell and then became endosymbionts more than 1.5 billion years ago (Gibbs 1981, Cavalier-Smith 1982, 1986). This event subsequently gave rise to the green and red algal lineages. The chromalveolates are thought to have derived from a second endosymbiotic event that occurred around 1 billion years ago (Yoon et al. 2004), in which a red, algal-like organism became associated a second time with a heterotrophic eukaryote (figure 2). The most striking evidence for this is the presence of four membranes surrounding the 758 BioScience • October 2009 / Vol. 59 No. 9 Figure 2. Schematic representation of the secondary endosymbiotic process thought to have given rise to the diatoms. An autotrophic red algal–like ancestor was endocytosed by a heterotrophic host cell. In the resulting cell, gene transfer occurred between the endosymbiont nucleus and the host nucleus, and probably also from the plastid and mitochondrial genomes. The resulting diatom cell contains the endosymbiont chloroplast, surrounded by four membranes, the host nucleus, and the host mitochondria. New genes have also been acquired by horizontal gene transfer from bacteria. Nuclei are shown in blue. Abbreviations: D, diatom; HGT, horizontal gene transfer; m, mitochondria; pp, primary plastid; SE, secondary endosymbiosis; sp, secondary plastid. chloroplasts in many photosynthetic chromalveolates such as the diatoms (Gibbs 1981). Diatoms are further divided into two groups, the centrics and pennates, on the basis of their radial and bilateral symmetry, respectively. Diatom fossils representing centric species date from the Cretaceous, whereas pennate diatoms appear to have arisen later, around 90 million years ago. Studies of diatom biology have gone through a paradigm shift following the recent incorporation of molecular and cellular methods to dissect their biology. Most of these studies have been performed on two species, Thalassiosira pseudonana and Phaeodactylum tricornutum, now considered model species for the centrics and pennates, respectively, because of the availability of whole-genome sequences and molecular tools to assess gene function (Armbrust et al. 2004, Poulsen et al. 2007, Siaut et al. 2007, Bowler at al. 2008). Diatom genome sequencing confirms novel evolutionary histories Both diatom genomes have been sequenced by the Joint Genome Institute in California. The sequence from the centric diatom T. pseudonana was the first to be reported (Armbrust et al. 2004), and it was the first of any eukaryotic marine phytoplankton species to be sequenced. The P. tricornutum genome was subsequently completed (Bowler et al. 2008). Both www.biosciencemag.org 21st Century Directions in Biology species contain around 11,000 predicted genes in approximately 30 million base-pair (Mbp; 32 Mbp for T. pseudonana and 27 Mbp for P. tricornutum) genomes. A careful functional and phylogenetic annotation of these genes, facilitated by the use of powerful computational approaches for predicting functional domains and subcellular locations, has provided new information to help understand the biology and evolutionary origins of diatoms. Whole-genome sequences from a wider range of other algal species have also become available, including a red alga, Cyanidioschyzon merolae (Matsuzaki et al. 2004), and three green algae species: Chlamydomonas reinhardtii (Merchant et al. 2007), Ostreococcus tauri (Derelle et al. 2006), and Ostreococcus lucimarinus (Palenik et al. 2007). In addition to whole-genome sequencing, expressed sequence tags (ESTs) provide a cheaper and simpler way to begin to acquire genomic data. The ESTs are generated from mRNA extracted from cells of a species of interest, transformed into complementary DNA (cDNA), and cloned into plasmids. A small region of each cDNA can then be sequenced to generate a tag that can serve to identify what the gene encodes. The ESTs from a range of unicellular algae have now been reported, including those from Fragilariopsis cylindrus, a diatom found within the ice in polar regions, and Pseudo-nitzschia multiseries, a bloom-forming diatom capable of synthesizing the toxin domoic acid. Both of these genome sequences are now being completed as well. As noted above, before genome sequencing technologies had been developed, a prevailing hypothesis was that diatoms originated from a secondary endosymbiotic event between a heterotrophic and an autotrophic eukaryote (figure 2). This hypothesis is supported by genome analysis, which revealed the presence of genes typical of both animal and plant classes of eukaryotes, such as components encoding the urea cycle and fatty acid oxidation, typical of animals, and genes encoding photosynthesis, found in plants. The proposed red algal origin of the diatom chloroplast is also supported by genome analysis (Oudot-Le Secq et al. 2007, Bowler et al. 2008). In addition to providing strong support for these hypotheses, examination of the predicted gene sets has also revealed the presence of hundreds of genes that are likely to be derived from horizontal gene transfer between bacteria and diatoms. Diatom genomes therefore appear to be “melting pots” of genes that have been derived from a variety of sources over evolutionary time, and it has been hypothesized that this unique cocktail of genes has conferred new metabolic and regulatory capacities that have been key in establishing their ecological finesse (Bowler et al. 2008). Comparisons of gene repertoires between T. pseudonana and P. tricornutum can also serve as a basis to explain the differences between centric and pennate diatoms. For example, in contrast to centric diatoms, raphid pennate diatoms possess a raphe, which permits them to move actively. They are major biofoulers, they include toxic species, and they generally respond most strongly to mesoscale iron fertilization (de Baar et al. 2005, Boyd et al. 2007, Kooistra et al. www.biosciencemag.org 2007). They also have amoeboid isogametes in contrast to motile sperm and oogamy in centric species. The availability of these two genome sequences, combined with the tools described below, allow the molecular bases of these differences to be explored and understood. Analysis of gene expression in diatoms Analysis of gene expression over time and in different conditions is a useful proxy for identifying the conditions in which a gene product plays an important role. Gene-expression studies are most often performed by quantifying levels of mRNA for the gene of interest. This is most commonly done by first converting mRNA into cDNA, and then quantifying the amount of expression with quantitative real-time polymerase chain reaction (qRT-PCR) using fluorescent dyes or fluorescent probes. This technique can be carried out with small amounts of mRNA, but it is crucial to have accurate reference genes to normalize expression levels. Siaut and colleagues (2007) identified several housekeeping genes in diatoms whose expression remains relatively constant in different conditions, and in particular proposed the use of RPS, a gene that encodes a 30S ribosomal protein subunit, and TBP, a gene encoding the TATA-box binding protein, as rather stable reference genes. A more general approach is to use ESTs to identify whole suites of genes expressed under particular conditions. When performed on a large scale and without normalizing for differences in mRNA levels of each gene, a global picture of gene expression can be obtained. In P. tricornutum, ESTs have been generated from cells grown in 16 different conditions, such as on different nitrogen sources, under iron limitation, or in high CO2. Between 6000 and 12,000 ESTs are available from each library, constituting a total of more than 130,000 ESTs. These sequences have been organized into a digital gene expression database that permits expression patterns of individual genes to be examined, and also allows the facile identification of genes displaying similar expression profiles (Maheswari et al. 2009; www.biologie.ens.fr/diatomics/EST3). Another approach to examining gene expression at the whole-genome level is to use microarrays. Technologies are now available to generate high-density arrays at a low cost, such as those from Agilent (www.agilent.com) and Nimblegen (www.nimblegen.com). A microarray typically contains oligonucleotides representing each gene of interest, for example, for each annotated gene in a diatom genome. Compared with ESTs, an advantage of using microarrays is that genes that are both up- and downregulated in a particular condition can be identified. On the other hand, ESTs can permit the identification of bona fide expressed genes that were not predicted by the in silico methods used for genome annotation. This limitation of typical gene-specific microarrays was circumvented by Mock and colleagues (2008), who identified additional unpredicted genes in T. pseudonana using tiled arrays. Tiled arrays are a kind of microarray in which a whole genome is represented by oligonucleotides, often on both DNA strands. Using this method, they identified 3500 putaOctober 2009 / Vol. 59 No. 9 • BioScience 759 21st Century Directions in Biology tive new genes, some of which corresponded to noncoding and antisense RNAs. Among those genes, 75 were identified by gene-specific expression profiles as potentially involved in silicon metabolism, and half of them encode proteins of unknown function. Interestingly, these genes also provided evidence of a link between silicon and iron metabolism pathways. The above-described methods for studying gene expression can help infer a function of diatom-specific genes, and several examples have now been reported in which these methods have been used to explore specific aspects of diatom biology, such as nutrient assimilation (Allen AE et al. 2008, Mock et al. 2008), light responses (Siaut et al. 2007), and gene expression during cell division (Gillard et al. 2008). Such studies are especially important, given the unusual combinations of genes that have been found in diatoms, such that empirical studies of gene expression in different conditions are required to understand how they function together in a coordinated manner. Transgenic technology It is also important to be able to manipulate the expression of single genes and to assess the consequences of that modulation for the organism under study. Genetic transformation technologies offer powerful ways of doing this by reverse genetics. In diatoms, the most commonly used technique for generating transgenic cells is based on helium-accelerated bombardment of microparticles coated with the DNA that is to be introduced. The methodology was initially reported for Cyclotella cryptica (Roessler et al. 1994) and subsequently for P. tricornutum (Apt et al. 1996), but has now been applied to a range of other diatoms, including most recently T. pseudonana (Poulsen et al. 2007). Notwithstanding, the tools are most highly developed for P. tricornutum—in particular, a series of different transformation vectors made from the Gateway cloning system from Invitrogen (www.invitrogen.com/gateway. html; Siaut et al. 2007), which greatly facilitates the generation of chimeric gene constructs for a range of different applications, such as for the inactivation and overexpression of genes, and for the localization of a gene product inside the cell by fusing it to fluorescent reporters. The green fluorescent protein (GFP) from the jellyfish Aequorea victoria, the most versatile fluorescent tag currently used in biology, was the subject of the project that won the 2008 Nobel prize for chemistry. Besides the wild-type green version, a series of differently colored variants are now available, which can even be combined to label different proteins in the same cell (figure 3). Such technologies complement the more traditional protein localization approaches by subcellular fractionation and immunolocalization, in that they permit localization to be visualized inside living cells. In addition to GFP, other reporter proteins have also been used in diatoms, such as luciferase and beta-glucuronidase, which allow studies of gene expression in response to particular conditions (Falciatore et al. 1999). Using GFP as a reporter for protein localization, Kroth (2007a, 2007b) studied the mechanisms of protein translocation and import into diatom chloroplasts, a fundamental but little-understood process in diatoms. This is of further interest because diatom plastids are surrounded by four instead of two membranes. These studies have shown that the outer two membranes of diatom plastids appear to be derived from Figure 3. Fluorescent image of a pair of transgenic Phaeodactylum tricornutum cells cotransformed with a Sec4 protein-YFP fusion (green) and a Histone H4-CFP fusion (blue). The Sec4 protein localizes to intracellular vesicles and to the plasma membrane, whereas H4 localizes to the nucleus. Chlorophyll autofluorescence from the plastid is shown in red. A brightfield view of two P. tricornutum cells (left) is shown for comparison. Images: Courtesy of Anton Montsant. 760 BioScience • October 2009 / Vol. 59 No. 9 www.biosciencemag.org 21st Century Directions in Biology plasma membrane and endoplasmic reticulum, whereas the inner two membranes resemble those found in green algae and higher plants, as would be predicted if diatom plastids were indeed acquired through secondary endosymbiosis (Gibbs 1981). Other studies have shown proteins localized in diatom mitochondria and other subcellular organelles (Siaut et al. 2007). Furthermore, Vardi and colleagues (2008) recently found that a protein associated with nitric oxide (NO) production localizes to diatom chloroplasts, in contrast to its plant ortholog, which was found in mitochondria (Guo and Crawford 2005). Such fundamental differences were instrumental in defining the role of the protein in diatoms (see below), thus showing the utility of GFP-based reporters for protein localization studies. Transgenes introduced in diatoms are under the control of promoters, DNA sequences upstream of protein coding sequences that spatiotemporally regulate gene expression. When a transgene is placed under the control of a chosen promoter, it will usually display the expression pattern of the gene from which the promoter was derived. Transgenes are most often expressed from strong promoters such as FCP (fucoxanthin chlorophyl protein) promoters (derived from genes encoding light-harvesting fucoxanthin-chlorophyll a/c-binding proteins). Although these promoters are to some extent regulated by light, they are generally considered to be rather constitutive. Transgenic technology also enables the modulation of gene expression using inducible promoter systems, which can be particularly useful when expression of a gene is lethal for the cell. Poulsen and Kröger (2005) reported the first method for inducible gene expression to study gene function in diatoms, based on a nitrate reductase promoter that is responsive to exogenous nitrate concentrations. With this system, a transgene can in principle be switched on and off simply by controlling the amount of nitrate in the growth medium. Overexpression of a gene of interest can help in understanding its function, particularly when its inactivation is lethal. An overexpressed gene can cause a change in phenotype, thereby providing information that is useful for understanding its function. Overexpression of specific genes has been reported several times in diatoms. A notable example is the overexpression of a gene encoding a glucose transporter to convert P. tricornutum cells from photoautotrophy to heterotrophy (Zaslavskaia et al. 2001). The inhibition of expression of a gene of interest has also become a crucial method for elucidating gene function. The method is most typically called gene silencing, and it consists of generating small RNAs complementary to the target gene. These small RNAs will bind to the transcribed product and inhibit its translation into protein. The technique has recently been reported in diatoms, providing for the first time a method of inactivating gene expression in these organisms (De Riso et al. 2009). In addition to the methods of reverse genetics described above, forward genetics can discover genes responsible for a defined phenotype. In forward genetics, cells are usually first www.biosciencemag.org mutagenized and then screened for interesting and unusual phenotypes in the mutagenized population. This method has proved to be extremely powerful for dissecting the basic biology of a wide range of both unicellular and multicellular organisms (Candela and Hake 2008, Carradice and Lieschke 2008), although it has not yet been reported in diatoms because of the difficulty of controlling their sexual cycle (Chepurnov et al. 2008). This is important because diatom cells are diploid, so any recessive mutation needs first to be fixed on both copies by going through a round of meiosis, that is, sex. To circumvent this shortcoming, a technique known as activation tagging has been used in other organisms (Candela and Hake 2008, Carradice and Lieschke 2008). The approach is based on the random introduction into the genome of enhancers, DNA sequences that positively affect expression of a gene when inserted close by. The use of such techniques to generate dominant mutations in diatoms has not yet been reported, although it is likely to be a useful strategy for isolating mutants in forward genetic screens. The benefits of using modern molecular technologies to study diatoms The use of molecular biology together with more classical studies of diatom biology has led to a range of advances for understanding their cell biology. The targeting of proteins to the plastid (Gruber et al. 2007) and the dissection of diatom cell-division mechanisms (Gillard et al. 2008) were mentioned previously. The availability of diatom genome sequences has also provided valuable starting points for exploring their responses to key limiting nutrients such as nitrogen, silicate, and iron (Allen AE et al. 2006). Iron metabolism is of particular interest because diatoms tend to dominate in mesoscale iron fertilization experiments (de Baar et al. 2005, Boyd et al. 2007), suggesting that they are iron limited under natural conditions. Iron responses have been studied at the transcriptional level in both sequenced diatoms (Allen AE et al. 2008, Bidle and Bender 2008, Mock et al. 2008). While both diatoms possess classical ferric reductase enzymes, these enzymes are more numerous in P. tricornutum. Furthermore, P. tricornutum possesses a number of gene clusters—absent in the T. pseudonana genome—that are highly expressed under iron limitation. Some of these genes, prokaryotic in character, point to new iron acquisition systems that have not yet been described in eukaryotic algae (Allen AE et al. 2008). Also notable is the presence of the iron-storage protein ferritin in P. tricornutum but not in T. pseudonana (Marchetti et al. 2008). These differences between the sequenced centric and pennate diatoms may partly explain the higher tolerance of P. tricornutum, and pennate diatoms in general, to iron limitation (Kustka et al. 2002, de Baar et al. 2005, Boyd et al. 2007). Carbon fixation in aquatic organisms can be enhanced by CO2-concentrating mechanisms (CCMs) that increase the availability of CO2 for the carbon-fixing RuBisCo enzyme. These mechanisms are defined as biophysical, because of the action of inorganic carbon uptake systems and carbonic October 2009 / Vol. 59 No. 9 • BioScience 761 21st Century Directions in Biology anhydrases, and biochemical, which is based on the cycling of CO2 through C4 intermediates. The mechanisms used by diatoms remain controversial despite intensive research (Giordano et al. 2005). Phaeodactylum tricornutum has been proposed to have a greater capacity for biophysical CCM than other diatoms (Kroth et al. 2008), and this is also supported by the higher numbers, compared with T. pseudonana, of bicarbonate transporters (seven and three, respectively) and carbonic anhydrases (four and one) encoded in its genome. Furthermore, one of the carbonic anhydrases is a beta-type, plastid-localized enzyme that is absent in T. pseudonana (Montsant et al. 2005). On the other hand, both diatoms may use a C4-based biochemical CCM involving the cycling of C4 intermediates between the inside of the plastid, the periplastidic space, and the mitochondria (Kroth et al. 2008). If confirmed, such a mechanism would be a highly novel means of capturing CO2, and may also help the cells to dissipate excess light energy. Illustrating the utility of molecular approaches to study diatom responses of ecological relevance, recent studies reveal the presence of complex inter- and intracellular signaling mechanisms that regulate population proliferation and even programmed cell death in response to environmental signals. When zooplankton graze on diatom populations, the diatoms release aldehydes such as decadienal that can reduce the reproductive capacity of the grazer population, potentially providing an antigrazing strategy. Increased aldehyde production by diatoms can also occur as a general response to wounding (Pohnert et al. 2004).Vardi and colleagues (2006) showed that both P. tricornutum and T. weissflogii respond to treatment with the aldehyde by producing NO, a phenomenon that is most likely regulated by changes in intracellular calcium concentrations. At high concentrations, the aldehyde causes cell death, whereas pretreatment with low concentrations can prime cells to become immune. In a subsequent study, Vardi and colleagues (2008) showed that expression of the PtNOA gene (NO associated), which encodes a GTP-binding protein belonging to the highly conserved YqeH subfamily, is increased in response to the aldehyde, and P. tricornutum cells overexpressing PtNOA display increased NO production and the appearance of several features symptomatic of stress. Adhesion of cells to surfaces was also compromised, implying the importance of NO-regulated events for biofilm formation (Thompson et al. 2008). These studies therefore identify a diatom gene that appears to be central for regulating stress sensitivity in diatoms. Diatom silicification is one of the most distinctive features of diatoms (figure 1). The frustule (the cell wall) is composed of two halves, a larger half and a smaller half called the epivalve and the hypovalve, respectively. The frustule is partly organic (proteins and polysaccharides) and partly bioinorganic (hydrated silicon, SiO2·[H2O]n). This incredibly robust and highly ornamented structure has several proposed roles, including protection from grazers and parasites through its mechanical resistance (Hamm et al. 2003, Pondaven et al. 2007), or as a proton-buffering agent (Milligan 762 BioScience • October 2009 / Vol. 59 No. 9 and Morel 2002). The frustule is synthesized during cell division within a membrane-bound organelle, termed the silica deposition vesicle (SDV), which rapidly extends to form a flat, large vesicle in which the new valve is synthesized. When the new valve is complete, it is bulk exocytosed and becomes the hypovalve of the new cell (Zurzolo and Bowler 2001). Pioneering biochemical studies of frustule composition have been performed by Kröger and colleagues in the pennate diatom Cylindrotheca fusiformis, and have led to the identification of several components found only in diatoms (Kröger et al. 1999, Kröger and Poulsen 2008). This work led to the discovery of silaffins, novel peptides that may participate in the basic biomineralization process within the SDV (Kröger et al. 1999). Remarkably, these silaffins can promote the formation of nanoscale silica spheres in vitro, and are the first peptides shown to be able to do this. They are encoded by modular genes whose gene product requires extensive posttranslational modifications such as the addition of phosphate and sugar groups during maturation from the precursor protein to the mature peptides. Other major organic constituents of diatom biosilica are putrescine-derived, long-chain polyamines, which, like the silaffins, can also induce rapid silicic acid precipitation in vitro (Kröger et al. 2000). Different diatoms are likely to have different complements of silaffins and polyamines that confer species-specific differences to silica precipitation and thereby result in species-specific nanopatterning, although they have so far been poorly characterized because of the difficulties of identifying the genes involved on the basis of only homology. Furthermore, it appears that the posttranslational modifications to these peptides are in fact more important than their amino acid sequence per se. Notwithstanding, silaffin genes have been fused to GFP, and the fusion proteins are incorporated into the silicified cell walls (Poulsen et al. 2007). Silaffin gene expression is also upregulated significantly during valve formation. Other proteinaceous components of diatom cell walls include frustulins and pleuralins (formerly called HEP proteins; Kröger and Poulsen 2008), containing conserved calcium-binding domains separated by hydroxyproline, polyproline/hydroxyproline, or polyglycine-rich regions. Like the silaffins, pleuralins are also tightly bound to silica and can be removed from diatom cell walls only after the solubilization of silica with hydrogen fluoride. Pleuralins are encoded by a small multigene family in C. fusiformis but have not been found in T. pseudonana or P. tricornutum, and so they are likely to represent a specific structural feature of this diatom. Pleuralin-1 is not targeted to the SDV but is directly secreted into the cleavage furrow that forms between the two daughter cells (Kröger and Wetherbee 2000). Association with the terminal girdle band of the hypotheca therefore occurs in the extracellular space. It will be interesting to determine how many other wall-associated proteins avoid the SDV during their secretion and incorporation into diatom frustules. www.biosciencemag.org 21st Century Directions in Biology Frustulins are much more loosely associated with diatom cell walls than are silaffins and pleuralins, and can be extracted with EDTA (ethylenediaminetetraacetic acid, a common chelating agent; Kröger and Poulsen 2008). They are glycoproteins that can bind calcium because of the presence of EF hands (helix-loop-helix structures in a family of calciumbinding proteins), and also contain characteristic acidic, cysteine-rich domains. Although frustulins are most likely conserved in all diatoms, including T. pseudonana and P. tricornutum, their function is not yet known. They are not thought to be involved in silica deposition. Biotechnology. The precision of the nanoscale pattern and architecture of the frustule far exceeds the capabilities of current materials and science engineers, suggesting that understanding diatom cell-wall biosynthesis will one day be exploitable in nanotechnological applications (Parkinson and Gordon 1999, Lopez et al. 2005). Although the metabolic pathways that drive cell-wall biosynthesis remain largely unexplored, they constitute a clear target for the discovery of novel protein functions that are unlikely to be found in other organisms and can be exploited in biotechnological applications. For example, diatom frustules can be incorporated into membranes and used for the size-selective separation of nanoparticles (Losic et al. 2006). Biomimetic studies also seem promising: For example, Vrieling and colleagues (2005) reported using water glass– based and polyethylene oxide–based polymers to control the synthesis of silica to generate ordered porous structures at the nanometer and micrometer levels. The use of frustules to make other functional materials by chemical conversion has also been reported (Bao et al. 2007). These technologies open up new opportunities to produce three-dimensional (3-D) siliceous materials that have never before been engineered (Kröger 2007, Kröger and Poulsen 2008). In addition, Gordon and Parkinson (2005) proposed another role for silica in linear lithographic techniques that are used to engineer microelectronics and that consist in replacement of silicon with another atom while maintaining the 3-D structure of origin. The genomic-enabled techniques described in this review can be of great utility for understanding and ultimately exploiting the silicon nanaofabrication capacities of diatoms, and some progress has already been made, as evidenced in the previous section. Further progress will very likely require additional high-quality biochemistry, as well as high-throughput, genetic-based screens, to identify diatom-specific genes of currently unknown function. The conversion of solar energy into chemical energy by photosynthesis has become of great interest for the generation of renewable energy resources. Diatoms have a high lipid content (up to 70% dry weight; Chisti 2007), and so they have been proposed as a source of biofuels (Kroth 2007a). Furthermore, the residual biomass is rich in protein and so could be used as animal feed. Moreover, because diatoms do not contain complex carbohydrate-based polymers such as cellulose, ruminants’ digestion of diatom-derived biomass www.biosciencemag.org generates less methane and other potent greenhouse gases than would be the case with other feed. Now that fossil fuels are being depleted and becoming more expensive, these diatombased applications are particularly appealing. Compared with plant-based sources of biofuels, diatoms and other algae are much more efficient converters of solar energy and have a much higher energy potential (Chisti 2008). Furthermore, they do not compete with food production, they can be grown in saltwater on marginal land, and they require less water inputs (Lebeau and Robert 2003, Dismukes et al. 2008). Phaeodactylum tricornutum is an attractive target for proof of principle because it has a high lipid content (up to 30%), it can be genetically manipulated, and it is already widely cultivated for commercial aquaculture. Genetic manipulation can potentially be used to increase photosynthetic efficiency to enable increased biomass yield, to enhance biomass growth rate, to increase oil content in biomass, to improve temperature tolerance to reduce the expense of cooling, to eliminate light saturation of photosynthesis, and to reduce photoinhibition and photooxidation. In addition, there is a need to identify new diatom strains with high oil content or to breed or select for improved strains. Conclusions The recently available genome sequences from two diatoms demonstrated the novel multilineage history of their gene repertoires and revealed that their genomes encode an enormous metabolic and regulatory potential that perhaps underlies their ecological success. The noncanonical nature of their genomes indicates that the functional exploration of diatomspecific genes is required to dissect their roles in diatom biology. Revealing the functions of these thousands of diatom genes that do not have proxies in conventional model organisms is going to be a major challenge, and it is highly unlikely (because of the lack of financial resources and dedicated personnel) that each can be experimentally assigned a function through reverse-genetics approaches such as gene knockouts. Nonetheless, new molecular techniques combined with biochemical approaches provide an excellent starting point for exploring novel aspects of diatom biology and for developing biotechnological applications. Additional computational approaches are likely to be required to help us predict protein functions, and proteomics approaches can help to associate specific proteins with specific cellular structures or protein complexes. As knowledge of diatom biology grows through laboratorybased experiments, additional technologies will need to be developed for exploring diatom biology in natural environments. Metabolomics technologies could be of some help, in that they can reveal the metabolic signatures of cells grown in specific conditions, as was recently illustrated (Allen AE et al. 2008). High-throughput genomics technologies that have yet to be developed for diatoms would also be a major boost for rapidly identifying mutations that result in major cellular perturbations (e.g., in the silicon nanofabrication process). In all such scenarios, the available genome sequences clearly provide a October 2009 / Vol. 59 No. 9 • BioScience 763 21st Century Directions in Biology major advance in our knowledge and in our opportunities to explore diatom biology. We eagerly await the forthcoming sequences from the polar diatom Fragilariopsis cylindrus and the toxin-producing Pseudo-nitzschia multiseries. Acknowledgments Funding for our work has been obtained from the European Union (EU)–funded FP6 Diatomics project (LSHG-CT2004-512035), the EU-FP6 Marine Genomics Network of Excellence (GOCE-CT-2004-505403), an ATIP (Actions Thématiques Initatives sur Programmes) Blanche grant from the Centre National de la Recherche Scientifique, and the Agence Nationale de la Recherche (France). References cited Allen AE, Vardi A, Bowler C. 2006. An ecological and evolutionary context for integrated nitrogen metabolism and related signaling pathways in marine diatoms. Current Opinion in Plant Biology 9: 264–273. Allen AE, LaRoche J, Maheswari U, Lommer M, Schauer N, Lopez PJ, Finazzi G, Fernie AR, Bowler C. 2008. Whole-cell response of the pennate diatom Phaeodactylum tricornutum to iron limitation. Proceedings of the National Academy of Sciences 105: 10438–10443. Allen WE. 1926. Remarks on surface distribution of marine plankton diatoms in the East Pacific. Science 63: 96–97. Apt KE, Kroth-Pancic PG, Grossman AR. 1996. Stable nuclear transformation of the diatom Phaeodactylum tricornutum. Molecular Genetics and Genomics 252: 572–579. Armbrust EV, et al. 2004. The genome of the diatom Thalassiosira pseudonana: Ecology, evolution, and metabolism. Science 306: 79–86. Baldauf SL. 2003. The deep roots of eukaryotes. Science 300: 1703–1706. Baldauf SL, Palmer JD, Doolittle WF. 1996. The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny. Proceedings of the National Academy of Sciences 93: 7749–7754. Baldauf SL, Roger AJ, Wenk-Siefert I, Doolittle WF. 2000. A kingdom-level phylogeny of eukaryotes based on combined protein data. Science 290: 972–977. Bao Z, et al. 2007. Chemical reduction of three-dimensional silica microassemblies into microporous silicon replicas. Nature 446: 172–175. Berland BR, Kapkov VI, Maestrini SY, Arlhac DP. 1976. Toxic effect of four heavy metals on the growth of unicellular marine algae. Comptes rendus hebdomadaires des séances de l’Académie des Sciences D 282: 633–636. Bidle KD, Bender SJ. 2008. Iron starvation and culture age activate metacaspases and programmed cell death in the marine diatom Thalassiosira pseudonana. Eukaryotic Cell 7: 223–236. Bowler C, et al. 2008. The Phaeodactylum genome reveals the evolutionary history of diatom genomes. Nature 456: 239–244. Boyd PW, et al. 2007. Mesoscale iron enrichment experiments 1993–2005: Synthesis and future directions. Science 315: 612–617. Breidbach O. 2005. Art Forms from the Ocean: The Radiolarian Prints of Ernst Haeckel. Prestel. Candela H, Hake S. 2008. The art and design of genetic screens: Maize. Nature Reviews Genetics 9: 192–203. Carradice D, Lieschke GJ. 2008. Zebrafish in hematology: Sushi or science? Blood 111: 3331–3342. Cavalier-Smith T. 1982. The evolutionary origin and phylogeny of eukaryote flagella. Symposia of the Society for Experimental Biology 35: 465–493. ———. 1986. The kingdoms of organisms. Nature 324: 416–417. Chepurnov VA, Mann DG, Von Dassow P, Vanormelingen P, Gillard J, Inzé D, Sabbe K, Vyverman W. 2008. In search of new tractable diatoms for experimental biology. Bioessays 30: 692–702. Chisti Y. 2007. Biodiesel from microalgae. Biotechnology Advances 25: 294–306. 764 BioScience • October 2009 / Vol. 59 No. 9 ———. 2008. Biodiesel from microalgae beats bioethanol. Trends in Biotechnology 26: 126–131. de Baar HJW, et al. 2005. Synthesis of iron fertilization experiments: From the iron age in the age of enlightenment. Journal of Geophysical Research Oceans 110: C09S16. doi: 10.1029/2004JC002601 De Martino A, Amato A, Bowler C. 2009. Mitosis in diatoms: Rediscovering an old model for cell division. Bioessays 31: 874–884. De Riso V, Raniello R, Maumus F, Rogato A, Bowler C, Falciatore A. 2009. Gene silencing in the marine diatom Phaeodactylum tricornutum. Nucleic Acids Research 37: e96. doi:10.1093/nar/gkp448 Derelle E, et al. 2006. Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proceedings of the National Academy of Sciences 103: 11647–11652. Dismukes GC, Carrieri D, Bennette N, Ananyev GM, Posewitz MC. 2008. Aquatic phototrophs: Efficient alternatives to land-based crops for biofuels. Current Opinions in Biotechnology 19: 235–240. Falciatore A, Casotti R, Leblanc C, Abrescia C, Bowler C. 1999. Transformation of nonselectable reporter genes in marine diatoms. Marine Biotechnology 1: 239–251. Falkowski PG, Barber RT, Smetacek V. 1998. Biogeochemical controls and feedbacks on ocean primary production. Science 281: 200–207. Field CB, Behrenfeld MJ, Randerson JT, Falkowski P. 1998. Primary production of the biosphere: Integrating terrestrial and oceanic components. Science 281: 237–240. Gibbs SP. 1981. The chloroplasts of some algal groups may have evolved from endosymbiotic eukaryotic algae. Annals of the New York Academy of Sciences 361: 193–208. Gillard J, et al. 2008. Physiological and transcriptomic evidence for a close coupling between chloroplast ontogeny and cell cycle progression in the pennate diatom Seminavis robusta. Plant Physiology 148: 1394–1411. Giordano M, Beardall J, Raven JA. 2005. CO2 concentrating mechanisms in algae: Mechanisms, environmental modulation, and evolution. Annual Review of Plant Biology 56: 99–131. Gordon R, Parkinson J. 2005. Potential roles for diatomists in nanotechnology. Journal of Nanoscience and Nanotechnology 5: 35–40. Gruber A, Vugrinec S, Hempel F, Gould SB, Maier UG, Kroth PG. 2007. Protein targeting into complex diatom plastids: Functional characterisation of a specific targeting motif. Plant Molecular Biology 64: 519–530. Guo FQ, Crawford NM. 2005. Arabidopsis nitric oxide synthase1 is targeted to mitochondria and protects against oxidative damage and dark-induced senescence. Plant Cell 17: 3436–3450. Hamm CE, Merkel R, Springer O, Jurkojc P, Maier C, Prechtel K, Smetacek V. 2003. Architecture and material properties of diatom shells provide effective mechanical protection. Nature 421: 841–843. Harper JT, Waanders E, Keeling PJ. 2005. On the monophyly of chromalveolates using a six-protein phylogeny of eukaryotes. International Journal of Systematic and Evolutionary Microbiology 55: 487–496. Harvey PH. 1939. Hereditary variation in plant nutrition. Genetics 24: 437–461. Kooistra WHCF, Gersonde R, Medlin LK, Mann DG. 2007. Pages 207–249 in Falkowski PG, Knoll AH, eds. Evolution of Primary Producers in the Sea. Academic Press. Kröger N. 2007. Prescribing diatom morphology: Toward genetic engineering of biological nanomaterials. Current Opinion in Chemical Biology 11: 662–669. Kröger N, Poulsen N. 2008. Diatoms—from cell wall biogenesis to nanotechnology. Annual Review of Genetics 42: 83–107. Kröger N, Wetherbee R. 2000. Pleuralins are involved in theca differentiation in the diatom Cylindrotheca fusiformis. Protist 151: 263–273. Kröger N, Deutzmann R, Sumper M. 1999. Polycationic peptides from diatom biosilica that direct silica nanosphere formation. Science 286: 1129–1132. Kröger N, Deutzmann R, Bergsdorf C, Sumper M. 2000. Species-specific polyamines from diatoms control silica morphology. Proceedings of the National Academy of Sciences 97: 14133–14138. Kroth PG. 2007a. Molecular biology and the biotechnological potential of diatoms. Advances in Experimental Medicine and Biology 616: 23–33. www.biosciencemag.org 21st Century Directions in Biology ———. 2007b. Genetic transformation: A tool to study protein targeting in diatoms. Methods in Molecular Biology 390: 257–268. Kroth PG, et al. 2008. A model for carbohydrate metabolism in the diatom Phaeodactylum tricornutum deduced from comparative whole genome analysis. PLoS One 3: e1426. Kustka A, Carpenter EJ,Sañudo-Wilhelmy SA. 2002. Iron and marine nitrogen fixation: Progress and future directions. Research in Microbiology 153: 255–262. Lebeau T, Robert JM. 2003. Diatom cultivation and biotechnologically relevant products, pt. 1: Cultivation at various scales. Applied Microbiology and Biotechnology 60: 612–623. Li S, Nosenko T, Hackett JD, Bhattacharya D. 2006. Phylogenomic analysis identifies red algal genes of endosymbiotic origin in the chromalveolates. Molecular Biology and Evolution 23: 663–674. Lopez PJ, Desclés J, Allen AE, Bowler C. 2005. Prospects in diatom research. Current Opinion in Biotechnology 16: 180–186. Losic D, Rosengarten G, Mitchell JG, Voelcker NH. 2006. Pore architecture of diatom frustules: Potential nanostructured membranes for molecular and particle separations. Journal of Nanoscience and Nanotechnology 6: 982–989. Maheswari U, Mock T, Armbrust EV, Bowler C. 2009. Update of the Diatom EST Database: A new tool for digital transcriptomics. Nucleic Acids Research 37: D1001–D1005. Matsuzaki M, et al. 2004. Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature 428 : 653–657. Marchetti A, Parker MS, Moccia LP, Lin EO, Arrieta AL, Ribalet F, Murphy ME, Maldonado MT, Armbrust EV. 2008. Ferritin is used for iron storage in bloom-forming marine pennate diatoms. Nature 457: 467–470. Merchant SS, et al. 2007. The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 318: 245–250. Milligan AJ, Morel FM. 2002. A proton buffering role for silica in diatoms. Science 297: 1848–1850. Mock T, et al. 2008. Whole-genome expression profiling of the marine diatom Thalassiosira pseudonana identifies genes involved in silicon bioprocesses. Proceedings of the National Academy of Sciences 105: 1579–1584. Montsant A, Jabbari K, Maheswari U, Bowler C. 2005. Comparative genomics of the pennate diatom Phaeodactylum tricornutum. Plant Physiology 137: 500–513. Oudot-Le Secq MP, Grimwood J, Shapiro H, Armbrust EV, Bowler C, Green BR. 2007. Chloroplast genomes of the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana: Comparison with other plastid genomes of the red lineage. Molecular Genetics and Genomics 277: 427–439. Palenik B, et al. 2007. The tiny eukaryote Ostreococcus provides genomic insights into the paraox of plankton speciation. Proceedings of the National Academy of Sciences 104: 7705–7710. Parkinson J, Gordon R. 1999. Beyond micromachining: The potential of diatoms. Trends in Biotechnology 17: 190–196. Pickett-Heaps J, Schmid A-MM, Tippit DH. 1984. Cell division in diatoms: A translation of part of Robert Lauterborn’s treatise of 1896 with some modern confirmatory observations. Protoplasma 120: 132–154. Pohnert G, Adolph S, Wichard T. 2004. Short synthesis of labeled and unlabeled 6Z, 9Z, 12Z, 15-hexadecatetraenoic acid as metabolic probes www.biosciencemag.org for biosynthetic studies on diatoms. Chemistry and Physics of Lipids 131: 159–166. Pondaven P, Gallinari M, Chollet S, Bucciarelli E, Sarthou G, Schultes S, Jean F. 2007. Grazing-induced changes in cell wall silicification in a marine diatom. Protist 158: 21–28. Poulsen N, Kröger N. 2005. A new molecular tool for transgenic diatoms: Control of mRNA and protein biosynthesis by an inducible promoterterminator cassette. FEBS Journal 272: 3413–3423. Poulsen N, Berne C, Spain J, Kröger N. 2007. Silica immobilization of an enzyme through genetic engineering of the diatom Thalassiosira pseudonana. Angewandte Chemie International Edition in English 46: 1843– 1846. Roessler PG, Bleibaum JL, Thompson GA, Ohlrogge JB. 1994. Characteristics of the gene that encodes acetyl-CoA carboxylase in the diatom Cyclotella cryptica. Annals of the New York Academy of Sciences 721: 250–256. Sancetta C. 1981. Diatoms as hydrographic tracers: Example from Bering Sea sediments. Science 211: 279–281. Shemesh A, Charles CD, Fairbanks RG. 1992. Oxygen isotopes in biogenic silica: Global changes in ocean temperature and isotopic composition. Science 256: 1434–1436. Siaut M, Heijde M, Mangogna M, Montsant A, Coesel S, Allen A, Manfredonia A, Falciatore A, Bowler C. 2007. Molecular toolbox for studying diatom biology in Phaeodactylum tricornutum. Gene 406: 23–35. Smetacek V. 1999. Diatoms and the ocean carbon cycle. Protist 150: 25–32. Thompson SEM, Taylor AR, Brownlee C, Callow ME, Callow JA. 2008. The role of nitric oxide in diatom adhesion in relation to substratum properties. Journal of Phycology 44: 967–976. Tréguer P, Nelson DM, Van Bennekom AJ, Demaster DJ, Leynaert A, Quéguiner B. 1995. The silica balance in the world ocean: A reestimate. Science 268: 375–379. Vardi A, Formiggini F, Casotti R, De Martino A, Ribalet F, Miralto A, Bowler C. 2006. A stress surveillance system based on calcium and nitric oxide in marine diatoms. PLoS Biology 4: e60. Vardi A, Bidle KD, Kwityn C, Hirsh DJ, Thompson SM, Callow JA, Falkowski P, Bowler C. 2008. A diatom gene regulating nitric-oxide signaling and susceptibility to diatom-derived aldehydes. Current Biology 18: 895–899. Vrieling EG, Sun Q, Beelen TP, Hazelaar S, Gieskes WW, van Santen RA, Sommerdijk NA. 2005. Controlled silica synthesis inspired by diatom silicon biomineralization. Journal of Nanoscience and Nanotechnology 5: 68–78. Yoon HS, Hackett JD, Ciniglia C, Pinto G, Bhattacharya D. 2004. A molecular timeline for the origin of photosynthetic eukaryotes. Molecular Biology and Evolution 21: 809–818. Zaslavskaia LA, Lippmeier JC, Shih C, Ehrhardt D, Grossman AR, Apt KE. 2001. Trophic conversion of an obligate photoautotrophic organism through metabolic engineering. Science 292: 2073–2075. Zurzolo C, Bowler C. 2001. Exploring bioinorganic pattern formation in diatoms. A story of polarized trafficking. Plant Physiology 127: 1339–1345. Anastasia Saade ([email protected]) and Chris Bowler (cbowler@biologie. ens.fr) are with the Department of Biology at the École Normale Supérieure in Paris. Bowler is the director of research, and his laboratory studies signaling in higher plants and marine diatoms. October 2009 / Vol. 59 No. 9 • BioScience 765
© Copyright 2026 Paperzz