volume 14 Number 2 1986 Nucleic Acids Research Nonepidermal members of the keratin multigene family: cDNA sequences and in situ localization of the mRNAs Bernhard Knapp, Martin Rentrop, JOrgen Schweizer and Hermelita Winter Institute of Experimental Pathology, German Cancer Research Center, Im Neuenheimer Feld 280, 6900 Heidelberg, FRG Received 22 October 1985; Revised and Accepted 13 December 1985 ABSTRACT A keratin set which consists of a type I 47kd and a type II 57kd protein occurs as a major constituent of the keratin patterns of various Internal stratified epitheHa of the mouse. We have Isolated specific cDNA clones of the two+complementary keratin subunits from a cDNA library constructed with polyA RNA of mouse tongue epithelium and present the complete nucleot1de and deduced amino a d d sequences of the 57kd protein and about 75% of the corresponding data of the 47kd protein. The comparison of the sequence data with those of known epidermal keratin mRNAs coding for the two types of keratin proteins reveals a fundamentally Identical and type-specific organization of the mRNAs Into both highly conserved and variable domains. In order to avoid cross-reactions wtth other members of the keratin multigene family, appropriately taylored S-labeled cDNA probes comprising the low and non-homologous 31 coding and noncoding domains of the mRNAs were used for 1n situ hybridization to tissue sections. The localization and distribution of the corresponding transcripts Indicates a strongly compartmentalized keratin expression 1n mouse tongue epithel1um. INTRODUCTION Keratins, a multigene family of water-Insoluble proteins are the constituent proteins of Intermediate filaments (IF) present 1n almost all epithelial cells. The complexity of the keratin polypeptide patterns Increases from simple to stratified non-kerat1n1z1ng tissues and 1s most complex 1n kerat1n1z1ng terminally differentiating epithelia (1-5). There 1s strong evidence that at least two complementary, oppositely charged keratin proteins are required for keratin IF assembly 1n vivo (6,7). Recently, sequence data from both cDNA and genomic clones of epidermal keratin proteins have extended our knowledge on the structural organization of this class of IF proteins (8-20). There 1s now general agreement that both addle type I proteins and basic to neutral type II keratin proteins possess a common structure 1n which a highly conserved central a-helical domain 1s flanked by rather variable non o-hel1cal carboxy and amino termi- © IRL Press Limited, Oxford, England. 751 Nucleic Acids Research nal regions. At the nucleotide level, the mRNAs of both types of keratins within one species reveal a high degree of divergence within their 3' and (as far as determined) 5' noncoding regions (12). These data stem rather exclusively from epidermal keratin subunits of different species. In the present paper we report on the Isolation and characterization of cDNA clones of nonepidermal members of both keratin subfamilies. The complementary type I 47kd and type II 57kd proteins are coexpressed 1n Internal, stratified mouse epithelia and represent the main keratin subunits of palate, cheek (21), esophagus, forestomach (22) and tongue (23). Comparisons with presently known sequence data provided evidence of generally Identical structural properties of keratin proteins 1n epithelia displaying different forms of cell differentiation and enabled us to prepare highly specific cDNA probes for the study of details of epithelial differentiation by means of 1n situ hybridization. METHODS Cloning of cDNAs and screening procedures PolyA+RNA from heat-separated mouse tongue epithelium was Isolated as described (24). Double-stranded cDNA > 600 bp was prepared from 10 ug of polyA+RNA (25), blunt-end Ugated Into pUC8 plasmid which was used to transform E. coll 0M 214 cells (26). The l i b r a r y was screened for keratin cDNA clones (27) with 32P-labeled enriched mRNAs (26,28) coding either for the 57kd or the 47kd keratin protein. Clones strongly hybridizing with one of the enriched keratin mRNAs were selected and screened by positive hybrid selection (26). The clones pkt57-l and pkt47-l containing the largest Inserts were selected for further investigation. All experiments were carried out according to the guide-lines for recombinant DNA published by the "Bundesm1n1ster fur Forschung und Technologie" of the Federal Republic of Germany. RNA blot analysis Ten ug polyA+RNA from different tissues were electrophoresed 1n 1.4% agarose gels containing 15 mH methylmercury hydroxide (28), transferred to nitrocellulose f i l t e r s (29), and hybridized overnight at 42"C (50% formamide) with the 32 P-labeled Inserts (30) of pkt57-l and pkt47-l. Blots were washed at 68"C 1n 0,lxSSC/0,ll SDS and autoradiographed. DNA sequence analysis Restriction site mapping was carried out according to Smith and B1rnst1el (31). 752 Suitable restriction fragments were subcloned Into the H13 mplO and Nucleic Acids Research mpll vectors (32), and sequencing was performed by the dideoxy chain terminator method of Sanger et al. (33). Both strands of the two clones were completely sequenced. The translation of the two cDNA clone sequences Into amino a d d sequences, the nucleotide and protein sequence comparisons, and molecular weight determinations were performed using the programs of Osterburg et al. (34). For the prediction of the secondary structure of the two keratin proteins both, the probabilistic methods of Chou and Fasman (35,36) and Gamier et al. (37) as well as the heptade convention analysis (10,38,39) were used. In situ hybridization In situ hybridization to tissue sections was performed according to the method described by Lawrence and Singer (40) with minor modifications. Briefly, 5 |jm frozen sections of mouse tongue were hybridized with Slabeled cDNA probes (specific activities 3-4x10 cpm/ug). As a control for the specificity of hybridization tissue sections were treated with RNAse (100 ug RNAse A and 100 units RNAse Tl/ml) prior to hybridization. After hybridization slides were washed and dried by passage through ethanolseries. Subsequently slides were dipped 1n Kodak NTB2 emulsion and exposed for 4-21 days at 4*C. Slides were developed in Kodak D19 and sections were stained with hematoxyl1n-eos1n. Details of the 1n situ hybridization technique will be published elsewhere. RESULTS AND DISCUSSION Isolation and identification of cDNA clones of the 57kd and 47 kd keratin proteins A cDNA library was constructed with po1yA+RNA from mouse tongue epithelium and screened with enriched mRNA probes coding for the 57kd and 47kd proteins. These probes were obtained by size fractionation of polyA+RNA from tongue epithelium using methylmercury hydroxide-agarose gels. Strongly positive recombinants Identified with these mRNA probes by means of colony hybridization were further screened by positive hybrid selection (results not shown). These screening procedures yielded the plasmids pkt57-l and pkt47-l with Inserts of about 2200 bp and 1300 bp, respectively. Size of mRNAs of the 57kd and 47kd keratin proteins and tissue-specific expression The size of the mRNAS coding for the two keratin proteins was determined by Northern blot analysis. Methylmercury hydroxide-agarose gel fractionated polyA+RNA from mouse tongue epithelium was hybridized under stringent con- 753 Nucleic Acids Research b c d ttb —4,8- X pkt 57- 1 Pkt 47-1 F1g. 1 . Identification 1n different tissues of mRNAs coding for the 57kd ana 47kd keratin proteins. Ten |jg of polyA RNA from mouse tongue epithelium (slots a ) , forestomach epithelium (slots b ) , footpad epidermis (slots c) and l i v e r (slots d) were separated by gel electrophoresis on 1.4% agarose gels containing 15 mM methyl mercury hydroxide. The RNA was blotted onto nitrocellulose paper and the blots were hybridized with T-labeled Insert DNAs of plasmid pkt57-l and pkt47-l, washed at 68'C with 0,lxSSC/0.1% SDS and autoradiographed. The 16S/23S and 18S/28S rRNAs (Indicated 1n kilobases) were run on the same gel as size markers. ditions with 32P-labeled pkt57-l and pkt47-l Inserts (F1g. l a ) . Each probe hybridized with a single mRNA species containing 2250 ± 100 and 1750 ± 50 bases, respectively. As expected (22), mRNAs corresponding to tongue mRNAs could be Identified when polyA+RNA from mouse forestomach epithelium was subjected to blot analysis (Fig* l b ) . In contrast under the highly s t r i n gent hybridization and wash conditions, epidermal and l i v e r mRNAs do not show cross reaction with pkt57-l and pkt47-l Inserts (F1g. 1 c,d). cDNA sequences of pkt57-l and pkt47-l The nucleotide sequence of the pkt57-l Insert, which represents the f u l l length transcript of the mRNA 1s shown 1n Figure 2. The size of this single Insert Is 2205 nucleotides. I t Includes the complete 3' end and extends far (117 nucleotides) Into the non-coding 5' end of the mRNA. Thirty five nucleotides upstream from the ATG i n i t i a t i o n codon appears a putative ribosome binding consensus sequence CTTCTG which 1s frequently encountered 1n eukaryotic mRNAs (41,42) and has also been described 1n the mRNA coding for the human 50kd keratin (14). The non-coding 3' end of the sequence comprises 492 nucleotides and contains a polyadenylation signal AATAAA (43) 754 Nucleic Acids Research 1 CCT OCC ATA OCC OCC OOC TTO OCA OGA CCC AOC CAC OCT CAT Q M CAT ACT CTT GTO ACC OCC GAO OTT OTA AAQ rt ACT TCT QTT CTA AOC fCC I TO CAO CTO CCT CCT CTC TCA OCC ATQ ATC CCC AOA CM] TCA AQT OTC CQC OCA OCC N J W TCC ATO AOC CTO OCT COS TCC TOC CAA OOC OOC OOC TAT OOO QOT OCC OCA OOC TTC OOC GTT OOA OOA TAC OCT tor last tor Va I Ala Ol f toi Cya Ola Qly Oly Cty Tyr Oly Oly Ala Oly Ql r Ptv« Cly V*l Qly Qly Tyf Qly i r > OCT aoo TTT OUT OCC oar oar r r r oac oar oac m OQA OQC TCC TTC AAC oai OCA OCA oar ccr aac r r c CCA A l * Qly 4M arc Val pfc* Qly A l a Oly Qly P*« aiy dy TOT CCT OCT OOA oar ATT CA<I QAA arc Cy« P r * A l a Q l y O l y M a O l a O l * V a l Qly Pi»* Qly Qly tor Ptw A«a Qly M X ATC AAC CAQ AOC CTO era Thr I I * A m a i a tor L*a L * * Afg Qiy aty Pr» Oly PW Pr» ACA OCT e r r c*o era OAO ATT OAC Thr P r a L * - * Qlf> V a l O l a I t * A a * H lit IM •-Hit •It CCA QAQ ATC CAO AAA ATC COC ACC OCG CAO COT QAO CAO ATC AAQ ACC CTC AAC AAC AAA TTC OCO TCC TTC ATC P r a O l * I I * Q l a Lys I I * A r § Thr A l a Q l a A r | Glw Q l " l l a L y * Thr L a * Aaa Aaa Lya P M A l a Ear P * * I I * TCI AOC AAT OAC AAA OOT COC CTA CAO TCA OAC CTA AAO ATO ATQ CAO OAC AOT CTQ OAO OAC TTC A M I ACC AAO TAT tor A M Aaai Lya Q l y A*m * • • • O l * Bar Qlw L*a Lya bhit ktol Q l * A*# t * r V a l Q l a A * p P h * L y a Thr Ly» Tyr OAA QAO OAO ATC AAC AAO COT ACA OCT OCA OAO AAT OAC TTC QTO OTC CTC AAO AAA OAT QTO OAT OCT OCC TAC • r « oco QAQ era ace CAQ ATO CAO ACA CAT arc TCA OAC ACA TCT OTA QTO era TCC ATO OAC AAC AAC COO AAC era A l a Q l * La« A l a O l a tm\ O I B Thr H t a V a l Sar A«# Thr t a r V a l V « l L*a %•! hail Aap A « J I A « M AI% A M L * « H i t TCC TOO TAC C M ATC AAQ OTC CAO CAO CTC CAO ATO TCA OCT GAC CAA CAT OQA OAT AOC CT« AAO ACC ACC AAO II** AAT QAO ATC TCA OAA CTC AAC AOO ATO ATC CAO AOQ CTQ COO OCA OAfl ATT OAfl AAC ATC AM) AAO CAO AOC CAQ 1*M ACT CCO C M OCA TCT OTO OCT OAT OCA OAO C M COT OQA O M CTC OCC CTC AAA OAT OCC TAC AOC AAA OOC OCA D M OAA CTQ OAC ACT OCA CTO CAO AAQ OOC AAO OAO OAC CTO OCC COO CTO CTC AflQ OAC TAC C M OCT CTC ATO AAC « . . L . * O l - T»r A l a L a - Q l - L y . A l a L y . O l * A « L a , A l a A T , L « L - A,, A * . Ty. O l . A l . L . u ^ t o ^ A « Iltt QTC AAO CTQ OCC CTO OAT OTO OAO ATT «0C ACC TAC AQO AAO CTQ CTO OAO OOC OAO QAQ TOC AOO ATO TCT OQA V a l L y * L*a A l a L*a A * » V a l O l a I I * A l a Thr Tyr A j f Lya L * « L*a Q l a O l y O l * O l * Cya Ax» a t * l t « r O l y \$— OAA TOC AAO AST OCT OTO AOC ATC TOO OTA OTT OOC OOC AOC CAO CAT TOO OQC ACT OOC CTT COC CTO OQT AOC t\\ 1 t T » OOT. TTT TOC TCC OQT TCT OQT TCT QOA AOT OOC TTT QQA TTT QOT QOT OOC ATT TAT OOC OQT TCT OQT TCC AMI 1 T 2 i AOQ QTC AOC OQT TCC TOA TAT CTC CTQ TOC OAT CTT CAC TTC ACC TCC ACT CTT CCT OOT OCT CAT CTT QTT ACT 1|M CTT CCC CCT CTQ CCT TOO TCC CTC AOT TCC AOQ AOQ ATC TTC TTO QCC ATO OTO TOA CCT CTO TTC TAT TOO AAC liri CTQ OTA OCT TCT TCT CAA CTT OQO TCA ACT OOC CCC CCA CCC CCT CCA TOC GAO OOO TOT CTO ATT TCA CCC CAA 1IU TOO ACA QAO OOO ATC AAO AOC AOA AAC TCC TTC TOC AOA TOT ACA CTO TCA CTC TCA OCT CCT CTC OTC ACA COO Itlt OCT TCC ACC TCA TCC TOO AOA TCT ATO TTC TAC AAT CAT OCT CAT TCT CCA TCA CCT OOC AOC AAC OOQ CTC TOC 1 1 M CAQ OTC AAC AAQ OAC TOC TTO OTQ TCC COT OCA ACC CTO CTC* CTC TCT CTA CTC ATC CCC AAT AAA ATO CTT TTO JIT* TCA TTC AAA AAA AAA AAA AAA AAA AAA AAA F1g. 2. Complete nucleotide sequence of the pkt57-l Insert and deduced amino a d d sequence of the 57kd keratin protein of mouse tongue epithelium. The nucieotide sequence 1s shown 1n the 51 to 3' direction of the mRNA strand. Nucleotides are numbered from the first base of the cDNA Insert at the left side and amino a d d s are numbered at the right side. The stop codon Is marked by three asterisks. Arrows denote the deduced demarcation of the central o-hel1cal domain. HI, VI, N and H2, V2, C Indicate the proposed threefold subdivision of the non-a-hel1cal end domains (19). Both the putative polyadenylation signal and the putative ribosome binding consensus sequence are underlined. 755 Nucleic Acids Research I GIG A M I ATT CCT G A C TGG C A T C T C AAA CAO AGC CCA OCT ACT C C A O A O COG G A C TAC ACT OCT TAT TAT A A G ACC t% ATC G A C G A C C T C C O G A T C AAG A T T C T C QAA G C C ACC A C O GAC AAC AAC C O O ATC A T C C T O G A G ATT GAC AAT O C C II* Gto Qlu Lau Araj lla Lyi II* L«u Gl w Ala Thr Thr Asp Asn Asn Arg II* Its Lau Olw lla Asp A a n Ala SO IbO AGG CTG OCT C C A O A T G A C TTC A C C CTC AAG TAC GAA AAT GAG C T Q A C C TTO COC C A O A O C Q T G G A Q G C C O A C ATC Af s L s u AI a AI a A s p f^ 9P P^i t Ar s Lau Lys Tvf QI u A sftGI u L su Thr Lf u Af s1 OIft^s f Va I GI u AI a A s p lla 22fc <VSC OOC CTG COC AOfi G T G C T G G A T OAG CTO ACT CTG OCT WC1 ACT O A C C T O GAA A T C f*ftQ ATT O A C AQC C T G AAT A«n Gly Lau Art Af s Val Lay A s p Glu Ltu Thr Ltu Ala Lya Thf Asp Law Qlu las I G i n lla O l u •*• Lau A»n i GQC C A O GTC Gly G i n Val 37* AAT G T G GAG ATG O A C O C C ACC CCT OOC ATT OAT CTG ACC CGT G T G C T G GCT GAG ATG AGG G A G C A O TAT GAA OCC Aan Val Glu ••••• t A s p A l a Thf Pro Glv M a Asp Lau Thr Af s Val Lau Ala Glu y § t Ar sj O l u G i n Tyr G l u Ala 460 C T Q GCA GAO AAC AAT C O G A G G G A T GCT GAG GAA TGG TTC C A O A C C A A C AGT O C A C A O C T C AAC A A O GAA O T A TCC Lau Ala Glu Lya Asn Af aj Af % A s p Ala Glu Glu Tfp Pha Gin Thr Lys tar Ala G l u Lau Asn L y s Glu Val fin ft?& TCC AAC GCT GAA A T G A T C C A G A C C AOC AAG ACA GAO ATC ACA GAA C T C A G O C G C ACA C T C C A O O G C C T O G A O ATT S a r A s n A l a G l u M s I l l a G i n T h f b a r L y s T h f G l u l l a T h f G l u L a u A f sj A i l T h f L a u O l n G l ^ L a u G l u l l a ; AAA GCC OGC CTQ GAA ACT ACQ TTG OCA GAO ACA GAG TGC COC TAC GCC CTG Lya Ala Gly Lau Glu Gar Thr Lau Ala Glu Tbr Qlu Cya Arf Tyi Ala Lau *S too 125 130 175 2OO 223 TbO CAO AAC CAG GAA TAC AAO ATG CTO TTG G*C ATC AAA ACA AGO CTG QAA CAO GAC ATC QCC ACC TAC COC ACC CTG Qln Aan Gin Glu Tyi Lya Mat Lau Lau Asp lla Lya Thr Arg Lau Glu Qln Glu lla Ala Thr Tyr Arf Bar Lau 275 • 2S CTA GAO GGC CAO CAT OCT AAC ATG ACC GCC TTC AAC TCC OOA GCA AAT AAC ACT ACT ACC TCC AAC OCC TCC CCC Lau Glu Qly Gin Aap Ala Lya Mai Thr Qly Ptta Aan Car Gly Qly Asn Asn Thr Thr Thr Bar Aart Gly Gar Pro 300 • 00 TCC TCC AAT TCC OGT COC CCA OAT TTC CCA AAO TAT TAA CTC ACC CTO GCO TCC CTT CCC TGG CCT TCA GCA GCC • 7B ACA GOA GQA GAG CTO ATG TCO CCT QTA OQA TGA GAG OAC TOC ACC ATO TOA TCC CCA GTC CCA TTO TCC CCC CAG lOaO GAC CCA TGG CTC TGA CAA CCC CTG AO0 TGG OCC CTT CCC CAT CTC TTC TGA TTG GAA TCT TTC TTC TCC CTA CCC I12§ CAT TCT CTC TAT TCT TTO GCT TCT CAG TGT CTC TAC AAC AAA GAA JM3 ATO OTT TCA ATA AAC TTC TCT OAT TTC 1200 CTO GAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA I2T& AAA AAA AAA AAA AA Fig. 3. Nucleotide sequence of the pkt47-l Insert and deduced amino a d d sequence of the 47kd keratin protein of mouse tongue epithelium. The stop codon, the end of the o-heiical domain and the putative polyadenylation signal are Indicated as described 1n legend to Figure 4. fifteen nucleotMes upstream from the polyA tall (A24K The polyadenylation signal 1s followed by a ATGC-sequence occurring at the same position 1n the mRNAs of the type II mouse 60kd (11, see also F1g.5) and bovine 69kd keratin proteins (13). The central 1572 nucleotides represent the coding region for the entire length of the 57kd keratin protein. The nucleotide sequence of the pkt47-l Insert 1s shown In Figure 3. The size of this Insert 1s 1289 nucleotides, thus comprising approximately 75X of the calculated length of the mRNA. The reading frame reveals a stop codon TAA which divides the sequence Into a coding region (936 nucleotides) and a 3' non-coding region (268 nucleotides) containing a putative polyadenylation signal AATAAA followed 17 nucleotides downstream by a 85 nucleotide long polyA tract. These sequence Informations supplement the 11st of currently available 756 Nucleic Acids Research 57kD 1A 47KD >, MO jjo 2A V c 2S L2 « 4 0 «4 w F1g. 4. Structural organization of the 57kd (upper part) and the 47kd (lower part) proteins. N and C Indicate the non-o-hel 1cal atnino and carboxy terminal regions. 1A, IB, 2A and 2B represent the colled coll regions and L I , L12 and L2 the non-co1led coll linker regions of the a-hel1cal domains. [Adoption of the nomenclature used by Steinert et a l . (11)]. The numbers denote (from l e f t to right) the entering positions of the Individual structural segments. NO = not determined. sequence data of epidermal keratins by one representative nonepidermal member of the type I and type I I keratin multigene family. Structural organization of the 57kd and 47kd keratin proteins The 57kd protein comprises 523 amino adds 1f the I n i t i a l meth1on1ne residue 1s excluded (Fig. 2). The calculated molecular size of 56,429 Da (without the unknown phosphate content) 1s 1n good agreement with the size estimated by gel electrophoresis (21). The amino add sequence of the 47kd protein (Fig. 3) could be deduced over a length of 312 amino adds and comprises the carboxy terminal part and almost the complete a-hel1cal region. Secondary structure analysis (10,35-39) of both nonepidermal proteins revealed a high conformity with the structural organization elucidated so far for epidermal keratin proteins. This 1s especially relevant for the typical subregions of the central a-hel1cal domain Into four colled coll regions and three short linker regions which cannot adopt colled colls (l9;F1g.4). In addition both the non a-hel1cal N and C termini of the type I I 57kd protein can be divided Into three distinct subdomains according to the natures of their amino add sequences (19). These comprise highly conserved H subdomains on either side of the rod domain, glydne and serine rich V subdomains of varible size and sequence and highly variable, usually basic N and C termini (F1g.2). In contrast these structural particularities cannot be detected 1n the C terminus of the type I 47kd protein (F1g.3). The entire region encompasses only 29 amino add residues. I t 1s therefore the shortest type I protein carboxy terminal part determined so far and confirms the prediction (19) that especially small type I keratin proteins may lack the substructuration of their non a-hel1cal end domains. 757 Nucleic Acids Research H i T^TCOCCAOk—CA—OTCA*OTOTCCQCOtiMKCTCCCCOOOCTTI*Ca>CTClCKTC*arr*TIOCTOQTOCICtCA*3CG«Ia>XTTT*OCT^^ Q--«ar--<xKM*o«TTCQocoTTa(MoaAT«3MTacToaaTTTOOToccaa^ -CTOOCOO JIT TATOO !•• C TQOCT r*Toocj^Kca*TTCocwoo**orrrci»c*TTOCToa-T--oa»ociOQ—TAOTaacTTTaocTtcoaTGGtoa—AOC T QOC TTTTOOTOQTOOCTATOQOQCWOCTOOCT TOOTOQTOOCT ATOOOOTWOC TOOC TTCCCO ;TTCCCA TtMATCCT T TCTT TQ*QACT TACATCAACOCC T TOMOAAQMCC ro6*OM: T rrCAOATOCAAACTCACATCTC*a^AC»TCTOTOOTCCTCTCCATOOTCA*CAACCaTAOCCTC r ^ TTOgc-aMTGOCCTTQOCCtQd»TAOC(KlriII<^IO-rtnTICICQT1CTO(y^OT(^ f * CACCTCCAO--ICTTCCTOQt-aCTCA--rcrTO--TT«rCTTCCCCCTCtaCCTTGG-TCCCICA—OIICC--*CGACOA--ICt-'CrTCCCCATQa-tCTQACCT-<:TaiTCTAT CACCAAaA*KtTQTCTCTOCTCCCAO*TQTCAT<>^T(kUCCTCAACCA<>TCCTn —TCCTTCT TCTQACTTCClCCAQMnCIClA^TJULtfclQCCCCCACAACAAAC • OTCAArAAOOACTQeTTCCrQTCCCOTOCAACCCTQCICCICTCTCTACTCATCCC^T11^-• Fig. 5> Comparison of the nucieotide sequences of the mRNAs coding for the nonepidermai 57kd keratin protein and the mouse epidermal 60kd keratin protein (11). Large arrows Indicate the positions of the region coding for the o^heHcal domains, small arrows denote the ends of conserved regions coding Tor the non a-hel1cal H subdomains (compare F1g. 2 ) . I n i t i a t i o n and stop codons and the putative polyadenylation signals are In boxes. Asterisks Indicate conserved nucleotides. Sequence comparisons of nonepidermai and epidermal keratin mRNAs mRNAs which code for epidermal type I and type I I keratin proteins of mouse and man (8-11, 14-19) were compared with the mRNAs of the nonepider758 Nucleic Acids Research mal 47kd and the 57kd keratin proteins. In each case, nucleotide sequences with the highest degree of sequence Identity correspond to those coding for the <*-hel1ca1 domains of the proteins. The calculated values are 1n the order of magnitude of 70-80% and therefore ressemble those determined for epidermal keratin mRNAs coding for the same type of proteins (12). Figure 5 which shows the alignment of the 57kd protein mRNA and the mRNA of the mouse epidermal type II 60kd protein (11) 1s representative for these multiple comparisons. In this case the sequence Identity of the a-hel 1x-cod1ng region 1s 75%. In addition the alignment reveals highly homologous regions of constant lengths Immediately upstream (108 nucleotides; sequence Identity 70%) and downstream (39 nucleotides; sequence Identity 53%) of the a-hel1xcoding region. These sequences are common to all known epidermal type II keratin mRNAs (11,16,19) and code for the non a-hel1cal H subdomains of this keratin protein subfamily (F1g. 2 ) . They are absent from both type I epidermal (10,17-19) and the type I nonepidermal keratin mRNA (F1g. 3 ) . In contrast both the lengths and the sequences of the remaining 5' and 31 regions, encompassing coding as well as noncoding sequences of the mRNAs are highly variable, unless functionally related epidermal keratin mRNAs of different species are compared (12). However, Figure 5 shows that the 5' end regions of type II keratin mRNAs are slightly more conserved than the 3' end regions. Occasionally they contain rather extended homologous nucleotide sequences I.e. positions 254-278 of the 57kd protein mRNA which code for a characteristic type II protein octade (11, 16, 19). As a rule, this 1s not the case for the 3'end regions which at most exhibit rare stretches of 4-6 conserved nucleotides and therefore seem to be most specific for any given keratin mRNA. Besides the demonstration of fundamentally Identically organized keratin mRNAs 1n epithelia showing different forms of cell differentiation, these comparisons are of paramount Importance 1n view of the generation of specific cDNA probes for 1n situ hybridization experiments. cDNA probes containig sequences complementary to the 3' non-homologous end regions of a keratin mRNA should exclude the danger of crossreactivity and therefore be Ideally suited to Investigate the specific cellular location of the corresponding mRNA. In situ hybridization Figure 6 Illustrates the results of the 1n situ hybridization to sections of adult mouse tongue using a 35S-labeled 553bp HhaI-H1ndII fragment derived from the 3' region of the pkt57-l Insert (nucleotides 1553-2106, 759 Nucleic Acids Research Fig. 6. In situ hybridization of the 553 bp 35S-labe1ed Hha I - H1nd II fragment of the pkt57-l Insert. The probe was hybridized to a sagittal section of the ventral side (a) and to a sagittal (b), frontal (c) and tangential (d) section of the dorsal side of adult mouse tongue. Similar results were obtained with a specific probe of the 47kd Insert. Control sections treated with RNAse prior to hybridization showed only background noise. Competition experiments with Increasing amounts of unlabelled specific cDNA fragments abolished the label, whereas unrelated DNA was Ineffective. BC, basal cells; SC, stratum corneum; CT, connective tissue; IP, Interpapillary region; FP, filiform papillae; DP, dermal papillae (H&E; x200). I.e. containing parts of the low homologous terminal coding region and the noncoding region without the polyA tract; F1g. 2 ) . Figure 6a shows a sagittal section of the ventral side of the tongue which 1s covered by a conti- 760 Nucleic Acids Research nuous, multilayered epithelium (44). It can easily be seen that the label 1s restricted to the epithelial cells of this tissue with negligible background density of silver grains over non-ep1thel1al cells. It should be emphasized that treatment of the sections with RNAse prior to hybridization did not lead to any detectable label 1n the tissue. Whereas most basal cells show a level of hybridization that 1s not significantly above background noise, the entire suprabasal compartment of living cells 1s heavily labeled. The highest density of silver grains 1s found over the spinous cell layers. The label gradually decreases 1n the granular cell layers and 1s absent from the dead stratum corneum. Unlike the ventral epithelium, the dorsal epithelium of the tongue shows a rather complex morphology 1n that numerous papillae - mainly of the filiform type - are distributed 1n parallel rows within an 1nterpap1llary epithelium (44,45). In situ hybridization with the same specific probe reveals that the corresponding mRNA 1s abundantly present 1n the suprabasal Interpapillary epithelium, however, again absent from Its basal cell layer as well as from the entire filiform papillae (F1g. 6b-d). As observed for the morphologically related ventral epithelium, there 1s a clear cut decrease of the suprabasal 1nterpap1llary label from the spinous to the granular cell layers (best visible 1n F1g. 6d). Whether the progressive death of cells at the transition from the granular to the cornified layer correlates with a simultaneous decrease of keratin gene expression or 1f there exists a temporal difference between cell death and cessation of keratin expression remains to be Investigated. Modulations of epithelial keratin mRNA expression by certain drugs (46,47) and the localization of these alterations at the cellular level by 1n situ hybridization with specific cDNA pobes may be a promising approach to study elements which control the differential expression of this multigene family. ACKNOWLEDGMENTS We thank H.P. Schmitt and A. Alonso for helpful advice and discussion and C. Frees for skillful technical assistance. We are Indebted to E. Aratmann for facilities with methylmercury hydroxide-agarose gel electrophoresis. This work was supported by the Deutsche Forschungsgemeinschaft (W1 489/2-1). REFERENCES 1. Moll , R., Franke, W.W., Schiller, D.L., Geiger, B. and Krepler, R. (1982) Cell 31, 11-24. 2. Tseng, S.C.G., Jarvinen, M.J., Nelson, W.G., Huang, J.W., WoodcockMitthell, J. and Sun, T.-T. (1982) Cell 30, 361-372. 761 Nucleic Acids Research 3. Wu, Y . J . , Parker, L.M., Binder, N.E., Beckett, M.A., Sinard, J . H . , G r i f f i t h s , C. T. and Rheinwald, J.G. (1982) Cell 3 1 , 693-703. 4 . Sun, T . - T . , Eichner, R., Nelson, W.G., Tseng, S.C.G., Weiss, R.A., J a r v i n e n , W. and Woodcock-Mitchell, J . (1983) J . I n v e s t . Dermatol. 8 1 , 109s-115s. 5. Schweizer, J . , K1njo, M., Fiirstenberger, G. and Winter, H. (1984) Cell 37, 159-170. 6 . Franke, W.W., S c h i l l e r , D.L., H a t z f e l d , H. and Winter, S. (1983) Proc. N a t l . Acad. Sc1. USA 80, 7113-7117. 7. S t e i n e r t , P.M., Jones, J.C.R. and Goldman, R.D. (1984) J . Cell B1ol. 99, 22s-27s. 8 . Hanukoglu, I . and Fuchs, E. (1982) Cell 3 1 , 243-252. 9 . Hanukoglu, I . and Fuchs, E. (1983) Cell 33, 915-924. 10. S t e i n e r t , P.W., R1ce, R.H., Roop, D.R., Trus, B.L. and Steven, A.C. (1983) Nature 302, 794-800. 1 1 . S t e i n e r t , P.M., Parry, D.A.D., Racoosin, E.L., I d l e r , W.W., Steven, • A . C , Trus, B.L. and Roop, D.R. (1984) Proc. N a t l . Acad. Sc1. USA 8 1 , 5709-5713. 12. Jorcano, J . L . , Rieger, M., Franz, J . K . , S c h i l l e r , D.L., M o l l , R. and Franke, W.W (1984) J . Mol. B1ol. 179, 257-281. 13. Jorcano, J . L . , Franz, J.K. and Franke, W.W (1984) D i f f e r e n t i a t i o n 28, 155-163. 14. Marchuk, D., McCrohon, S. and Fuchs, E. (1984) Cell 39, 491-498. 15. Johnson, L . D . , I d l e r , W.W., Zhou, X., Roop, D.R. and S t e i n e r t , P.M. (1985) Proc. N a t l . Acad. Sc1. USA 82, 1896-1900. 16. Tyner, A . L . , Eichman, M.J. and Fuchs, E. (1985) Proc. N a t l . Acad. Sc1. USA 82, 4683-4687. 17. Marchuk, 0 . , McCrohon, S. and Fuchs, E. (1985) Proc. N a t l . Acad. Sc1. USA 82, 1609-1613. 18. K r i e g , T . M . , Schafer, M.P., Cheng, C.K., F i l p u l a , D., F l a h e r t y , P., S t e i n e r t , P.M. and Roop, D.R. (1985) J . B1ol . Chem. 260, 5867-5870. 19. S t e i n e r t , P.M., Parry, D.A.D., I d l e r , W.W., Johnson, L . D . , Steven, A.C. and Roop, D.R. (1985) J . B1ol. Chem. 260, 7142-7149. 20. Rieger, M., Jorcano, J . L . and Franke, W.W (1985) EMBO J . 4 , 2261-2267. 2 1 . Schweizer, J . , W i n t e r , H. f M i l , M.W. and Mackenzie, I . C . (1984) D i f f e r e n t i a t i o n 2 6 , 144-153. 22. Schweizer, J . and Winter, H. (1983) J . B i o l . Chem. 258, 13268-13272. 23. Franke, W.W., Schmid, E., S c h i l l e r , D.L., Winter, S . , Jarasch, D.E., M o l l , R., Oenk, H . , Jackson, B.W. and Ilmensee, tU (1982) Cold Spring Harbor Symposium on Quantitative Biology, V o l . XLVI, 431-453. 24. Schweizer, J . and G o e r t t l e r , K. (1980) Eur. J . Biochem. 112, 243-249. 25. Wickens, H.P., B u e l l , G.N. and Schimke, R.T. (1978) J . B 1 o l . Chem. 253, 2483-2495. 2 6 . Man1at1s, T . , F r i t s c h , E.F. and Sambrook, J . (1982) Molecular c l o n i n g . A Laboratory Manual. Cold Spring Harbor Laboratory. 27. Grunstein, M. and Hogness, D.S. (1975) Proc. N a t l . Acad. Sc1. USA 72, 3961-3965. 2 8 . B a i l e y , J.M. and Davidson, N. (1975) A n a l . Biochem. 70, 75-85. 29. Thomas, P. (1980) Proc. N a t l . Acad. Sc1. USA 77, 5201-5205. 30. Rigby, P.W.J., Dieckmann, M., Rhodes, C. and Berg, P. (1977) J . Mol. B1ol. 113, 237-251. 3 1 . Smith, H.0. and B1rnst1el, M.L. (1976) Nucleic Acids Res. 3, 2387-2398. 3 2 . Messing, J . (1983) Meth. Enzymol. 101, 20-89. 33. Sanger, P., N i c k l e n , S. and Coulson, A.R. (1977) Proc. N a t l . Acad. Sc1. USA 74, 5463-5467. 34. Osterburg, G., G l a t t i n g , K.H. and Sommer, R. (1982) Nucleic Adds Res. 1 0 . 207-216. 762 Nucleic Acids Research 35. Chou, P.Y. and Fasman, G.D. (1978) J . B 1 o l . Chan. 252, 2127-2133. 36. Chou, P.Y. and Fasman, G.D. (1979) Biophys. J . 26, 367-383. 37. G a m i e r , J . , Osguthorpe, D.J. and Robson, B. (1978) J . Mol. B 1 o l . 120, 97-120. 38. P a r r y , D.A.D., Crewther, W.G., Fraser, R.D.B. and HcRae, T.P. (1977) J . Mol. B1ol. 133, 449-454. 3 9 . Crewther, W.G., I n g l i s , A.S. and McKern, N.H. (1978) Biochem. J . 173, 365-371. 40. Lawrence, J . B . and Singer, R.H. (1985) Nucleic A d d s Res. 13, 1777-1799. 4 1 . B a r a l l e , F.E. and Brownlee, G.G. (1978) Nature 274, 84-87. 42. Hagenbuechle, 0 . , Santer, H . , Argets1nger-Ste1tz, J . and Hans, R.J. (1978) Cell 13, 551-563. 4 3 . F i t z g e r a l d , M. and Shenk, T. (1981) Cell 24, 251-260. 4 4 . Cameron, I . L . (1966) J . Exp. Z o o l . 163, 271-284. 45. P o t t e n , C . S . , S c h o f i e l d , R. and L a j t h a , L.G. (1979) B1och1m. Biophys. Acta 560, 281-299. 46. Schweizer, J . , and W i n t e r , H. (1982) Cancer Res. 4 2 , 1517-1529. 47. Fuchs, E., and Green, H. (1981) Cell 25, 617-625. 763
© Copyright 2026 Paperzz