Transfer functions for estimating paleoecological conditions (pH) from East African diatoms F. Gassel & F. Tekaia 2 l Ecole Normale Superieure, 92260, Fontenay-aux-Roses, France 2 Laboratoirede Statistique, 4 Place Jussieu, 75230, Paris Cedex 5, France Keywords: paleolimnology, diatoms, East Africa, transfer functions, pH-indicators Abstract Our purpose is to establish the quantitative relationship between recent diatom floras and ecological parameters, in order to extrapolate the results to the past. The parameter pH is here considered as an example. This work is based on the study of about 160 diatom samples from East Africa and of their corresponding biotopes. We propose some statistical methods to interpret the data. Correspondence analysis allows us to define the pH-indicator species. The regression calculations allow pH values to be calculated using the percentage of the diatom species in a sample. Introduction Material studied and data collection Our goal is to use fossil diatoms to quantify paleoecological conditions in East African lakes. This objective necessitates the following steps: a) studying the relationships between recent diatom communities and their corresponding biotopes of a statistically representative number of samples taken from East Africa; b) establishing an equation (transfer function) which would allow the value of an ecological parameter to be calculated, from the relative percentages of diatom species in a sample; c) extending the correlation to fossil samples once transfer functions are established for recent diatoms. This paper is intended to demonstrate the methods of approach used rather than to present final results. The parameter pH is exemplified since so far it appears to be the environmental factor indicated best by diatom floras. East African lakes are numerous and diversified. During the Quaternary they underwent wide fluctuations in water-level and water chemistry which have been recorded by changes in fossil diatom assemblages (Richardson & Richardson 1972; Holdship 1976; Harvey 1976; Richardson et al. 1978; Gasse 1975, 1977, 1980). The modern samples were collected from 98 stations (lakes, swamps, peat-pogs, rivers and thermal springs) situated between 12 N and 12 S latitude, 26 ° E and 43 E longitude, and ranging from 155 m below sea-level to 4 000 m.a.s.l. Detailed description of the localities will appear elsewhere (Gasse 1983). In most cases, pH, conductivity and temper'ature were measured in the field when the diatoms were collected. The pH ranges from about 5 to 10.9, and the conductivity from about 10 to 50 000 S cm 1. Chemical analyses of the corresponding waters were published by Talling & Tailing (1965), Tailing (1976), Kilham (1971), and Gasse (1975). The water temperature varies from a few degrees above zero in the mountains to 35 C in the deserts. Hydrobiologia 103, 85-90 (1983). © Dr W. Junk Publishers, The Hague. Printed in the Netherlands. 86 It reaches 55 C in thermal springs. Two principal chemical types of water can be distinguished: the sodium-bicarbonate-carbonate type, corresponding to the majority of the samples, were the factors pH, alkalinity and conductivity are strongly and positively correlated between themselves, and the sodium-chloride type which has a pH near 7. The following analyses are based on 156 contemporary diatom samples including phytoplankton, periphyton and mud. For each sample, a systematic inventory was established and the percentage of each taxon was evaluated by counting 300 to 1000 valves distributed on four slides. 579 species and varieties were identified. The check-list and autecological data will appear elsewhere (Gasse 1983). Statistical methods used As considerable progress has been made in the establishment of transfer functions between microorganism assemblages and ecological parameters (Kipp 1976; Roux 1979; Bryson& Kutzbach 1974), we attempt here to apply such methods to diatoms. Correspondenceanalysis (CA) The CA method (Benzecri 1973; Lebart et al. 1977; Benzecri 1980) aims at synthethizing the information contained in a data matrix and visualizing the relationships between the elements of two sets i (e.g. taxa percentages) and j (e.g. samples). The main properties and examples for defining diatom assemblages are given by Gasse & Tekaia (1979, 1982). Table I Table 3 classes pH pH, species I i pHZ pH3 pH4 - - kc s~~~I Table 2 samples 1 pH I PH j n I pH 22 ~0 PH3 0 pH4 4 0 Table 4 Table 5 classes pH pH pH 2 pH 3 pH 4 ,[ species i __ - s~~~I Tables 1 5. Tables submitted to correspondence analysis (CA). k! 87 We present here the types of tables which have undergone factor analysis and indicate for each table whether it is considered as a principal table (defining a factorial axis), or as a supplementary element of another table. The supplementary element does not participate in the definition of the axes, but is placed with regard to the axes defined by the principal table. In Tables 1-5, n and s refer to the number of samples and species, respectively. Table 1 is the initial table where the intersection of the row i and the column j is the percentage of species i in the sample j. In Table 2, sample j is also defined by its pH. One can consider breaking down this parameter into classes as follows: pH,, 5-6.9 (37 samples); pH 2, 7-7.9 (36 samples); pH 3, 8-8.6 (38 samples); pH 4 , 8.7-10.9 (37 samples). If the pH of sample j is equal to 5.5 (class pH,), the value is coded as 1000, meaning that this parameter is I in this class, where it is found, but 0 in all others. Table 2 will be placed as supplementary to Table 1. Table 3 is constructed from Table 1. If kic is designated as the value found at the intersection of the species i and the pH class c, ki, is the sum of the percentages pij of species i in the samples j situated in the class c of pH. Table 3 will be analyzed as a principal table. Table 1 is added to it as the supplementary table. Table 4 is the table of presence-absence; it is obtained from Table I by replacing the values that are not 0 by . If the species i is present in the sample j, pij = , if not pij = 0. Table 5 is constructed from Table 4 in the same manner as Table 3 from Table I. In this case, ki is the number of times that the species i is present in the samples found in the class c of pH. This table will be analyzed as a principal table and Table 4 will be added to it as a supplementary element. In our case, the number of species is high (579). Therefore, we cannot carry out regressions from Table I because it would give illusory results. It is known that the quality of a regression is based on the coefficient of multiple correlation R; the more this coefficient approaches 1, the better the regression. In cases where the independent variables are numerous, a high coefficient is obtained even if some are independent of the variable to be explained. It is therefore necessary to reduce the number of independent variables so that a significant result can be obtained. To do this a CA of the initial table (Table 1) is carried out (Cazes 1978; Roux 1979) and the 579 species are reduced to a number of factors. Each sample is defined by its co-ordinates on the factors which are now considered as the independent variables. An advantage of proceeding in this manner is that the new variables are not correlated between themselves, and the problem of an unstable coefficient of multiple correlation is avoided. On the other hand the inconvenience of this procedure is that it involves a table of 7 dimensions (if 7 factors are kept), and only a part of the initial formation is retained. It is necessary to ensure that the most important part of the information has been kept on the factorial space obtained. It is for this reason that we have analyzed Table 3 with Table I as a supplementary table. CA of Table 3 defines only 3 factors, and allows all the information to be maintained. Establishing a transferfunction Starting with the results obtained from CA and the regression analysis, it is possible to establish a transfer function which is represented by thd following formula: S y(j) = aipij + ej Regression analysis The goal of regression analysis (Cazes 1978; Benzecri 1978) is to establish a transfer function that uses the distribution of the species in a sample to determine the approximate value of an ecological parameter. The species play the role of independent variables, and the ecological parameter which is to be estimated is the dependent variable. where: s is the number of species YO) is the estimated value of the parameter y in sample j Pij is the percentage of the species i in the sample j y is the mean of the parameter y for all the samples 88 ej ai ai 1 - Results is the difference between the estimated value and the measured value of the parameter y in sample j is the coefficient of species i 1 b! b2 X/Xl GI (i) + 100 'kI dAX2 pH-indicatorspecies b3 G2 (i) + dAX3 G3 (i) where A1, X2, X3 are the relative inertias, respectively, to the I st, 2nd and 3rd factorial axis constructed by CA of Table 3. Gl (i), G2 (i) and G3 (i) are the coordinates of the species i on the I st, 2nd and 3rd factorial axis from Table 3, respectively. b l , b2 and b3 are the regression coefficients obtained from Table 3 with Table I as the supplementary table. The selection of indicator species is carried out by treating the data with CA. We considered the species percentages and analysed Table 2 followed by Table as a supplementary table (CA-percentage). We eliminated those species which were present in less than 3 samples and those with a percentage of less than 5% in any sample. The CA was then conducted with 245 taxa and 156 samples. Figure shows the projection of the species points and pHclasses on the factorial plane 1-2. The species characterizing each class of pH are situated close to the corresponding pH point. In Table 6, we have selected some of the species which have a strong influence on the definition of the factorial axes. AXIS 2 109 I 3708 4040 4 4 1.L 620 is907 2903 Lo10 12 391291 ty!Ly 2503 o 101 3520 5338 2935 2507 1630 ~93 4501 5325 2630 2630 502 4030 15325 1 \ ~~~~~~~~~1907 \104 5409 34 3910 34 3913742 391 203748 2908 2610 640 1655 3204 4203 3756 2904 3765 1401 0 3201 470 37 32606 5424 EZ. 3810 3938 ~ ,629 ...... ___1 1904 1917 19182611 52- \ ;. 604208 70 370 370139 926 39065 325 3701 11~ ~~~~ 4_____3925 1301 1912 5407 5003 ~~~~~pHCLASSES:~ p5304 pH CLASSES : pH1 5-6.9 pH2 : 7-7.9 pH3 8-8.6 pH4 :8.7-10.9501 8.7-109 pH4 360 : DIATOMSPECIES,NUMERICAL CODE 360 :pH-INDICATOR 2 295 9 362 361 36 366 361 37 37' 37 371 393 394 ~~~~~~~~~~~~~~~~~~~~~~4808 L931 4022 4044 3730 2401 241 1 351 5430 / 3932 4602 ___ ___ 3922 3911 375 5005 4610 904 3654 4036 3507 B2 1403 607 ~~~~~~~~-540142 2305 3728 32 2403 5401 / 1 6 365 / -- AXIS 394 ~~~~~~~~~~4 2 4614 3625 2613 32613 3923 361 \3712 3511 102 2915 3 4017 42 422 42 6-- 4003 393640353935 5417 3608 4240 3518 4224 25 I1 H 2916 3508 3914 1620N 3 3504 2912 3631 3717 3762 3T 1939 01 2930 2612 3501 3501 3647 -l 207 J K360 37i 23~03 30 3731 36 3506 3509 1002 -764 1670\ 3616 901 16351690 0 3940 - 2404 3901 3740 / 5335\ 3671 3916 3512 \ = 335 / 2l'5 302 \302/ 3513 3744 302 3j 13 30 301 / _. 25 _~~~~~~~~~~~0 Fig. 1. Correspondence analysis (CA) based on Table I (initial table, 245 taxa, 156 samples), and Table 2 (supplementary table, 156 samples, 4 pH classes). Projection of the species points and pH classes on factorial plane 1-2. The underlined species are those having an important contribution in the definition of the factorial axes. Their taxonomy is given in Table 6. On the right, the points corresponding to the species indicated in the vertical column were clustered round point pH ,. 89 Table 6. pH-indicator diatoms in East Africa. pH . (4 < U pH pH : < U pH : 5 6.9 Pinnularia graciloides Pinnulariaappendiculata Pinnulariaobscura Pinnulariamicrostauron Pinnulariacardinalis Eunotia praerupta Eunotia af. pseudoveneris Melosira distans v. africana Cymbellafonticola Navicula bryophila Navicula subtilissima Navicula tantula Stauroneis nana Surirella ovata Gomphonema olivaceum Diploneiselliptica near 7 Pinnulariatropica Pinnulariaacrosphaeria Pinnulariaborealis Eunotia lunaris Eunotiaflexuosa Cocconeis thumensis Stauroneis anceps Cocconeis diminuta Nitzschia linearis Nitzschia umbonata Navicula perpusilla Achnanthes minutissima Navicula symmetrica Amphora tenerrima Mastogloia elliptica Pinnulariainterrupta Nitzschia kutzingiana : 7-7.9 Synedra acus + v. Cyclotella stelligera Gomphonema parvulum Navicula salinicola Navicula iranensis Navicula cryptocephala Nitzschia elegantula Nitzschia palea Melosira distans Melosira varians Amphora coffaeaformis Melosira agassizii Fragilariapinnata : 8 8.6 Melosira italica+ v. Melosira granulata Melosira nyassensis Fragilariabrevistriata Fragilarialeptostauron Un U Amphora ovalis Melosira granulatav. angustissima Epithemia zebra Cocconeisplacentula Nitzschia amphibia Campylodiscus clypeus pH near 8.6 Stephanodiscus astraea Rhopalodiagibba Synedra rumpens v. neogena Nitzschia lancettula Nitzschiafonticola Fragilarialapponica Fragilariaconstruens Fragilariapinnatav. trigona Stephanodiscusdamasii Stephanodiscus hantzschii Synedra berolinensis Nitzschiafrustulum Cyclotella meneghiniana Chaetoceros muelleri pH : 8.6-10.9 Anomoeoneis sphaerophora+ v. Rhopalodia gibberula Cyclotella ocellata Epithemia argus Nitzschia subrostrata Nitzschia vitraea Nitzschia sigma Nitzschia estohensis Nitzschia latens Nitzschia pusilla Anomoeoneis costata Navicula damasii Navicula irregularis Navicula elkab Thalassiosirafaurii Thalassiosirarudolfi Cyclotella iris e cn < These species seem to display a clear preference for a given pH or pH class. If they are abundant, and if several taxa characteristic of the same pH class are associated in a sample, they can be considered as good pH-indicators for the investigated area. A CA taking into account the presence-absence of taxa was also carried out. Most of the species situated around the pH l point in Fig. I also characterize this class by their presence. But the effects of pH above 7 do not show up clearly. Many species are able to tolerate a wide range of pH, but the 90 species preference, deduced from the CA percentage, is a more sensitive indicator than the species tolerance. Transferfunction We have carried out regressions using the methods described above, and successively considered presence-absence (CA with principal Table 5, supplementary Table 4) and species percentage (CA with principal Table 3, supplementary Table 1) situations. The coefficients of multiple correlation, calculated by the method of least squares, are R = 0.745, and R' = 0.857, respectively. The best result is obtained by taking into account species percentage. We applied the transfer function established above to the 156 recent samples. The difference between the estimated values and the measured values is <±0.3 in 43% of the cases, <±0.5 in 65% of the cases, <± 1.0 in 90% of the cases. Large differences (<1.2) are observed in only two cases which correspond to the highest measured values (10.3, 10.9) that were underestimated by our calculation. Before using the transfer function on fossil samples the stability of the formula must be tested by applying it to new modern samples. In addition other improvements might be added; we are especially hoping to reduce the number of taxa necessary in its application. Acknowledgements We are grateful to Dr. J. Talling, Professor R. B. Wood, Professor J. Kalff, Dr. P. Kilham, and Dr. F. A. Perrott for having collected and sent numerous recent diatom samples. We would also like to thank Professor J. P. Benzecri for his criticism and comments. This work was supported by the Centre National de la Recherche Scientifique and the Ecole Normale Superieure de Fontenay-auxRoses. References Benzecri, J. P., 1973. La taxinomie. L'analyse des donnees 1. Dunod. Paris. 615 pp. Benzecri, J. P., 1978. Problemes statistiques et methodes geometriques. Cah. Anal. Donnbes 3: 131-146. Benzecri, J. P., 1980. Pratique de l'analyse des donnees. Analyse des correspondances. Dunod, Paris. 315 pp. Bryson, R. A. & Kutzbach, J. E., 1974. On the analysis of Pollen-Climate Canonical Transfer Functions, Quater. Res. 4: 162 174. Cazes, P., 1978. Mthodes de regression Ill. L'analyse des donnees. Cah. Anal. Donntes 3: 385-391. Gasse, F., 1975. L'evolution des lacs de l'Afar Central (Ethiopie et T.F.A.I.) du Plio-Pleistocene al'Actuel. Thesis, University of Paris 1: 390 pp., 3: 59 pl. Gasse, F., 1977. Evolution of Lake Abhe (Ethiopia and T.F.A.I.) from 70 000 b.p. Nature 265: 42-45. Gasse, F., 1980. Late Quaternary changes in lake levels and diatom assemblages on the Southeastern margin of the Sahara. Paleoecology of Africa 12: 333 350. Gasse, F., 1983. Diatoms from East Africa. Bibliotheca Phycologica, in press. Gasse, F. & Tekaia, F., 1979. La paleoecologie des diatomees. Cah. Anal. Donnees 4: 81-94. Gasse, F. & Tekaia, F., 1982. Tentative definition, comparison and interpretation of fossil diatom assemblages from Eastern Africa. Proceedings of VIth Symp. on Recent and Fossil diatoms, Budapest, 1980. Acta Geol. Ac. Scient. Hung, 25: 135-140. Harvey, T. K., 1976. The paleolimnology of Lake Mobutu Sese Seko, Uganda-Zaire: the last 28 000 years. Ph. D. Thesis, Duke University, U.S.A. 103 pp. Holdship, S. A., 1976. The paleolimnology of Lake Manyara, Tanzania; a diatom analysis of a 56 meter sediment core. Ph. D. Thesis, Duke University, U.S.A. 112 pp. Kilham, P., 1971. Biogeochemistry of African lakes and rivers. Ph. D. Thesis, Duke University, U.S.A. 199 pp. Kipp, N. G., 1976. New transfer function for estimating past sea-surface conditions from sea-bed distribution of planktonic foraminiferal assemblages in the North Atlantic, Geol. Soc. Am. Mem. 145: 3-41. Lebart, L., Morineau, A. &Tabard, N., 1977. Techniques de la description statistique, Dunod, Paris. Richardson, J. L., Harvey, T. J., Holdship, S. A., 1978. Diatoms in the history of shallow East African lakes. Pol. Arch. Hydrobiol. 25: 341-343. Richardson, J. L. & Richardson, A. E., 1972. History of an African Rift lake and its climatic implications. Ecol. M onog. 42: 499-534. Roux, M., 1979. Estimation des paleoclimats d'apres l'ecologie des Foraminifres. Cah. Anal. Donnees 4: 61-79. Tailing, J. F., 1976. Phytoplankton: composition, development and productivity. In: J. Rzoska (ed.). 'The Nile, Biology of an ancient River', Junk, The Hague. pp. 385-402. TaIling, J. F. &Tailing, . B., 1965. The chemical composition of African lake waters. Int. Rev. ges. Hydrobiol. 50: 421-463.
© Copyright 2026 Paperzz