245. Elsevier. MOLBIO 01451. Isolation and characterization of the gene expressing the major salivary gland protein of the female mosquito, Aedes aegypti. *1.
Molecular and Biochemical Parasitolow, 44 (19911 245-254 Elsevier
245
MOLBIO 01451
Isolation and characterization of the gene expressing the major salivary gland protein of the female mosquito, Aedes aegypti A n t h o n y A. James, Karen B l a c k m e r *1 , O s v a l d o • "*2 Martnotn , Corine R. Ghosn, and Jeffrey V. Racioppi .3
Department of Molecular Biology and Biochemistry. University e#/'Calitbrnia, lrvine, CA. U.S.A.
(Received 13 July 1990; accepted 27 August 19901
We have undertaken a molecular analysis of the salivary glands of hematophagous insects in order to better understand their role in blood feeding and in the transmission of infectious diseases. To that end, genomic and cDNA clones of a gene designated D7, expressed abundantly in the adult female salivary glands of the vector mosquito Aedes aewpti, have been isolated and characterized. This gene encodes a mRNA shown by Northern analysis and in situ hybridization to tissue sections to be specifically transcribed in the distal-lateral and medial lobes of the glands, regions that are highly differentiated in females. The deduced gene product, a protein of approximately 37 kDa appears to be novel. Polyclonal antibodies made to a recombinant D7 product recognize a protein with the proper molecular weight in female salivary glands and saliva. These studies indicate that the D7 gene probably encodes the major secreted protein synthesized in the female salivary glands. The stage- and sex-limited expression of the D7 gene, and the secretion of its product, indicate that the product is most likely involved with the blood feeding capabilities of the female mosquito. Key words: Salivary gland: Sex specific gene: Vector biology
Introduction The possibility of using genetically engineered vector insects in disease control programs has prompted studies of insect organs and tissues where significant interactions with pathogens take Corre,spondem'e address." A.A. James, Department of Molecular Biology and Biochemistry, University of California. Irvine, CA 92717. U.S.A. *Present addresses." ITufts Medical School, Tufts University, Boston MA, 02118, U.S.A.: ZDepartamento de Parasitologia, ICB-Universidade de Silo Paulo C.P. 4365, 05580, Silo Paulo, S.P. Brazil; SDivision of Science, College of Basic Studies, Boston University, 871 Commonwealth Ave.. Boston, MA 02215, U.S.A. Note. Nucleotide sequence data reported in this paper have been submitted to the GenBankTM database with the accession numbers M33156 and M33157. Abbreviations." AMV, avian myeloblastosis virus: BSA, bovine serum albumin: cDNA, complementary DNA; poly(A)+, polyadenylaled: rRNA, ribosomal RNA: SDS. sodium dodecyl sulfate: SDS-PAGE, SDS-polyacrylamide electrophoresis: SSC, saline sodium citrate: TBS. Tris-buffered saline.
place. One objective is to use promoters of genes active in those specific tissues to control the expression of exogenous gene products selected to interfere with pathogen d e v e l o p m e n t . O u r efforts are focused on the salivary glands of adult female m o s q u i t o e s because prior to t r a n s m i s s i o n most pathogens must reside for some period of time in these organs• In addition, adult female mosquitoes are highly adapted ectoparasites of vertebrates, and as such, their salivary glands produce and secrete proteins that enable them to blood feed efficiently [1 ]. We began our analysis by isolating and characterizing the m a j o r genes expressed specifically in the salivary glands. These genes are principally those that encode the secreted salivary proteins. O u r characterization of each gene includes the det e r m i n a t i o n of the p r i m a r y structure, and its expression pattern, and where possible, the function of the gene product. Here we report the molecular and d e v e l o p m e n t a l properties of the m a j o r salivary gland-specific gene expressed a b u n d a n t l y and preferentially in k e d e s a e o ' p t i females. The
0166-6851/91/S03.50 © 1991 Elsevier Science Publishers B.V. (Biomedical Division)
246
abundance of the D7 gene product and its sexand stage-specificity suggest a role in blood feeding and/or digestion.
A .D'CIJ7 i l
! @'1 0'2[ ,7;'3 0'41 01~ 0'61 07 0'8 3'9 I!, ~E
Materials and Methods
Mosquito strains. The mosquitoes used in this study are the Rockefeller strain of A. aegypti.
47 ASGDGI2 °
Mosquitoes were reared using standard protocols [2] and the insects used in the following procedures were usually 3-5 days old after adult emergence.
_
-
[01i 02i C,3 04i
016 I
.
s!8
.
.
.
d9
.
I0i k~ IOObp
C ,~AEGD7 50kb
Nucleic acid isolation and labeling protocol,s'. RNA was prepared from whole adults and isolated tissues using a protocol from Chirgwin et al. [3]. Polyadenylated (poly(A) +) RNA was prepared following the protocol of Maniatis et al. [4]. D N A was prepared from adult mosquitoes using the protocol of Bender et al. [5]. Filter hybridization analyses of genomie and cloned DNAs were done according to Southern 16] using the highstringency buffer from Maniatis et al. [4]. DNA was labeled using a random primer kit supplied by Boehringer Mannheim, or by a standard nick translation reaction [71. RNA was end-labeled with [~2p] gamma-labeled ATP [8]. Riboprobes were synthesized from coding sequences cloned into the plasmid, p G E M I , using SH- and >Ssubstituted ribonucleotides in reaction conditions supplied by the manufacturer (Promega Biotec).
Developmental and spatial attalyses q~ RNA expression. Stage-, tissue- and sex-specific RNAs were examined by Northern analysis [9}. Exposures of Northern blots to X-ray film were typically overnight. Hybridizations in situ to tissue sections were performed as described l l0]. Exposures of sections of female glands were from 1 3 days, while male glands were exposed for 45 days.
Sequence determination and analysis. Subcloned genomic and eDNA fragments were sequenced in both directions (Fig. l) by the method of Sanger et al. [ l l i using a kit supplied by USBiochemicals. Oligonucleotide primers were either obtained commercially or synthesized with a Biosearch DNA synthesizer. RNA sequencing experiments were essentially identical to the DNA
i 0!5 0'4 0'6 cs ,'0 ,~
E
g
,'~ ,'s 2'o 2'2 I- as ee s0 se s~ s~sdIK~ S
E
2(;~ :b,~
Fig. I. Molecular maps of two cDNAs and a genomic clone homologous to the D7 gene. The thin horizontal lines in (A), (B) and (C) represent the sequenced DNA. The thick horizontal line in C represents the coding region of the D7 gene. lmron locations are marked by the Roman numerals I IV in (A) and (B), and by' the spaces in the thick horizontal line in (C). The arro,,,~s indicate the length and direction of the different sequencing gel readings. In (C). various restriclion endonuclease sites are indicated: B. BamHl: E, EuoRl; S, ,Slid. sequencing experiments, except that 3.25 l~.g total salivary gland RNA was used as the template and 5 units of AMV reverse transcriptase as the polymerase. The primer extension protocols are described in James et al. [l(t} and 0.9 pg total salivary gland RNA was used per reaction. Two 40-mer primers were used: D7R12, 5'GGTTCGTTAATCCATTCTTTCCATTTCGCTAA C A T T G G A A - 3 ' ; and RD7GEN 12,5'-CGGGATGAATGTTAGAC A G G C C T A ATGCGTTTTATATGCA-Y. D 7 R I 2 is complementary to nucleotides +223 through +262, and R D 7 G E N I 2 to nucleotides - 3 3 through +7 of the genomic sequence (Fig. 4). Samples were run on 6cA polyacrylamide sequencing gels for visualization. Computer analysis of sequence intormation was performed using the Amersham Staden software package. An analysis of similarity was performed using the Lipman-Pearson rapid search algorithms and RDF analysis [121, resident in the Georgetown University PIR database programs.
247
Preparation of" antibodies and immunodetection protocols. The full length ASGD7-12 cDNA was cloned into the pT7 expression plasmid [13] generating pT7-D7. Recombinant D7 protein synthesis was induced by heat shock of transformed Escherichia coli containing both pT7-D7 and pGP12. The recombinant D7 protein was purified by SDS-PAGE [14] and injected subcutaneously into a rabbit. A booster injection was given 40 days after the first immunization, and the rabbit bled 7 days later. The serum obtained was kept at - 2 0 ° until used for immunoblotting procedures. Material for immunoblotting was collected by dissecting salivary glands of 3-5-day-old males and females, or by collecting saliva as described [15]. Saliva and glands were stored at - 2 0 ° in TBS (10 mM Tris, pH 7.5/150 mM NaCI). An equal volume of loading buffer containing SDS and 2-mercaptoethanol was added to each sample and the samples resolved by SDS-PAGE on a 10% gel. Proteins were electrophoretically transferred to nitrocellulose using a custom apparatus. The resulting filter was blocked for 3 h in 3% BSA in TBS. The membrane was then incubated overnight in a 1/500 dilution of rabbit polyclonal anti-D7 serum in 3% BSA/TBS. After washing in 0.05% Triton X-100/TBS, the filter was incubated in the presence of '25I-labeled protein G (105 cpm ml - l ) for 1 h. The filter was washed as before, air-dried, and exposed to Kodak AR film with a Dupont Cronex intensifying screen at - 7 0 ° C .
library and genomic clones from an A. aegypti genomic library. Fig. 1 shows a molecular map of a genomic clone, AAEGD7 that includes the D7 gene.
Temporal and spatial e.~pression qf" the D7 gene. Northern analysis using 32p-labeled ASGD7-1 and total RNA from different developmental stages and isolated tissues demonstrates that the D7 cDNA is homologous to an approx. 1.2-kb polyadenylated RNA present in adult female salivary glands (Fig. 2). No expression is seen in embryos, larvae or pupae, and little is observed in adult males. The RNA can be detected up to 48 h following blood feeding (O. Marinotti and A.A. James, unpublished). The intensity of the signal in the female salivary gland lane after an exposure of 18 h indicates that this is an abundantly expressed gene. Hybridizations in situ to sections of adult salivary glands were used to reveal those cells containing D7 transcripts. Antisense RNA was synthesized from a 461-bp, 5'-end EcoRI-HinclI fragE
EL
L3
L4 EP LP S
S
A
C
M
F
C D7
1.2
rib
Results
Isolation of clones homologous to the D7 gene. A differential screening of an adult female cDNA library with end-labeled salivary gland RNA resulted in the isolation of clones homologous to RNAs expressed specifically in the adult salivary gland [10]. Six of these clones were apparently identical and were designated D7 after a representative clone, ASGD7-1. Approximately 150 bp were sequenced at both ends of ASGD7-1 using universal primers (New England Biolabs) and double-stranded template sequencing techniques. Using the information from these sequences, we synthesized two oligonucleotides and used them to select other clones from a salivary gland eDNA
Fig. 2. Developmental expression of the D7 gene. RNA wax prepared from various stages of development and different adult tissues, resolved on 0.9(/c agarose-t\)rmaldehyde denaturing gels, and transterred to nitrocellulose. The resulting Northem blot was probed with ~-'Pdabeled D7 eDNA (A and CI. After exposure, the probe was boiled off and the tilter was reprobed with a 3-'P-labeled clone AAEgl [10] with sequences homologous to the rRNA (B). (A) 5 l~g each of embryonic (E), early larval (EL), third instar larval (L3), flmnh instar larval (k4), early pupal (EP). late pupal (LP) total RNA, and 1 l~g of adult female salivary gland (S) total RNA, probed with D7. (B) same as (A), but probed with the ribosomal clone. (C) 2 /~g each of adult female salivary gland (S) and carcass (whole body with salivary glands removed) (C) total RNA, and 1 t/g each of male (M) poly(AV RNA and female (F) poly(A) + RNA probed with SGD7 1.
24g
Fig. 3. Spatial distribution of the D7 gene expression in adult female salivarv glands. Adult female salivary glands were dissected and prepared for m situ hybridization as in James et al. [10]. Sections were probed with antisense RNA probes nlade to the coding region of D7. (a) Giemsa-stained section of the female gland showing the medial (M), proximal lateral (PL) and distal-lateral (DL) lobes. (b) Dark-field image of the same tissue section as in (a) showing the grains corresponding to hybridization. Note that the proximal-lateral lobes (arrows) are the only regions not expressing the D7 RNA. ment o f the A S G D 7 - 1 2 c D N A and h y b r i d i z e d as in J a m e s et al. [10i, A total o f 52 a n t i s e n s e - p r o b e d and 7 s e n s e - p r o b e d f e m a l e s a l i v a r y g l a n d s were e x a m i n e d . Positive h y b r i d i z a t i o n was seen o n l y with the antisense p r o b e and was confined to the distal-lateral and m e d i a l lobes o f the female salivary glands (Fig. 3). T h e s e regions o f the g l a n d are h i g h l y d i f f e r e n t i a t e d in f e m a l e s as c o m p a r e d to males, and this sexual d i m o r p h i s m c o i n c i d e s with the f e m a l e - e n r i c h e d e x p r e s s i o n o f the D7 gene. R e p e a t e d efforts to detect l o c a l i z e d h y b r i d i z a t i o n in m a l e salivary glands were u n p r o d u c t i v e ; after 6 - w e e k e x p o s u r e s , a few grains were scattered o v e r all r e g i o n s o f the m a l e s a l i v a r y glands.
Characterization of the D 7 x,ene. A genomic clone, A A E G D 7 , was selected by h o m o l o g y to the c D N A clone A S G D 7 - 1 2 . F r o m this g e n o m i c clone a single EcoRI restriction e n d o n u c l e a s e f r a g m e n t 3819 bp in length was s e q u e n c e d (Fig. 4). This f r a g m e n t was d e t e r m i n e d to contain the entire
c o d i n g region o f the D7 gene by c o m p a r i s o n to s e q u e n c e d c D N A s . F o u r introns, I - I V , interrupt the c o d i n g region. The o b s e r v e d splice j u n c t i o n s c o n f o r m to k n o w n c o n s e n s u s a c c e p t o r and d o n o r sequences. O t h e r features consistent with k n o w n e u k a r y o t i c genes, a T A T A A A s e q u e n c e and a consensus p o l y a d e n y l a t i o n sequence, A A T A A A , were o b s e r v e d . The p r o b a b l e start o f transcription was defined by c o m p a r i n g p r i m e r e x t e n s i o n products and d i d e o x y s e q u e n c i n g o f R N A , and thus the nuc l e o t i d e s C and A were d e s i g n a t e d as +1 and +2, r e s p e c t i v e l y (Fig. 5). A l t h o u g h p r i m e r extension analysis and R N A s e q u e n c i n g can be c o n f o u n d e d by s e c o n d a r y structure o f the R N A , the d e s i g n a t e d transcription start site falls b e t w e e n a c o n s e n s u s T A T A A A sequence and the start of the translated portion o f the gene. The c o d i n g region o f this g e n o m i c clone disp l a y e d n u c l e o t i d e sequence p o l y m o r p h i s m s with the s e q u e n c e d c D N A s , ASGD7-11 and S G D 7 12 (Fig. 6). In addition, c D N A s ASGD7-11 and A S G D 7 - 1 2 also show sequence p o l y m o r p h i s m s b e t w e e n t h e m s e l v e s . Thus we have sequence i n f o r m a t i o n on three variants of the D7 gene from the R o c k e f e l l e r strain. A s d i s c u s s e d below, w h e t h e r this variation results from m u l t i p l e alleles or m u l t i p l e c o p i e s o f the D7 gene is still to be d e t e r m i n e d .
Characterization o / the D7 cDNAs and putative translation products. E l e v e n different c D N A s with h o m o l o g y to the original clone, ,kSGD7-1 were isolated from a salivary gland c D N A library and the largest were s e l e c t e d for further characterization. A s m e n t i o n e d p r e v i o u s l y , the s e q u e n c e d c D N A s , A S G D 7 - 1 1 , A S G D 7 - 1 2 , d i s p l a y e d nucleotide sequence p o l y m o r p h i s m s (Fig. 6). The s e q u e n c e of an additional c D N A chine, A S G D 7 8, was identical to that o f A S G D 7 - 1 2 . In general, the 5' ends o f the c D N A s are m o r e c o n s e r v e d than
Fig. 4. Primary nucleotide sequence of a genomic DNA fragment containing the D7 gene. A 3.8-kb EcoRl fragincnl c{mtammg the complete coding region of the D7 gene was subcloned from AAEGD7 and sequenced as described in Materials and Methods and Fig. 1. Various features of the sequence are listed in bold. A consensus eukaryotic TATAAA and polyadenylation consensus sequence (AATAAA) are present. The most likely translation initiation (ATG) and termination (TAG) codons are indicated. The dinucleotide CA were identified as the transcription initiation sites by primer extension analysis and direct didenxy sequencing of RNA and the C has been denoted +1 in the sequence, lntrons I-IV are in lower-case and underlined. In addition, the putative intron m the 3' end non-translated region is underlined. The restriction endonuclease cleavage sties for Slul and BamHl used for subclnning portions of the putative promoter are indicated.
249
GAATTCAAGGAACC~TTAAATGAATCTCTTGAAATATC~CTGGAAAC~TTACTCGTGAAGTCCTGGAGAATTTTCAGTTATA~ATCTGGGAAGA~TCCTTGCATGAATCCCTGGGGT~AT -2345 -2230 ~A~GAAGAAA~CC~GAGG~AT~C~GcAAGAA~C~CT~GACG~GAA~G~A~GG~TTGG~GAAATA~AAGGHT~GG~GAAA~AGAA~AC~GGAGAAA~C~G~GA~AA -2110 TAT~GAA~A~AT~GAAA~AA~T~AGAAAGAG~AC~GCA~AA~GGAAC~GAAA~CGC~GG~GGAA~G~GAAAA~CTTCAAGAAA~CAA~ATG~C~GAAAAAAC~ -1990 CTG~AGGAAA~T~TAAAGGATAA~GA~AGA~T~TTTAT~A~TGA~TCT~ATTATGGAGAAA~CAGCCC~AG~AGTTCA~CC~GAcAGAA~TAC~G~AAAACGA~G -1870 TT~TTTGTAATCTACTAATT~AGAGTCCAACGCTCGGGTTTTGAATTT~CCATATTTCCTACACAAATCCATGAAAGAATAATAGTCGAGAACTGT~AAAAAGTTATGAAAAATTATTGA -1750 AAACGGAA~CGAGACTAG~A~TG~AAAAGAG~ATCTATTG~CATTT~AGTTA~AAGGGCCAAT~AAT~GACA~AAACACAAG~TTCA~AACTC~T~AGAAATNTAAAGcA~C~A~ -1630 GATA~TGGCT~AGA~AAT~A~A~GTA~TC~C~AGA~T~GGC~AGTTGTTAGT~A~A~GAAAGGCCAAATGT~AAAGG~TAA~TATCCGCAGAG~C~A~GTCAA~AGTGAAG~AT -1510 ~TAAGCCAGCATGAAAGGG~ATTAT~AA~ATTT~TT~CAAAACCC~CCGGGTTCA~TATGC~GAGAAGGAAAAAC~GAAGTGA~TTTTTTG~GCTCTCG~G~CTG~ATAATAA~T -1390 ACCGTCCTT~AA~TAAAC~TTCCGTCAAAGT~CATAG~CCAAGCAATAAAAAAAAAGATGAAAA~T~TTCAATAAGGTATcACCAAAGA~ACATA~CATAC~GAGGGACCAAATGCA -1270 GTA~TAGAAGTGG~A~CAA~TCAGAGCC~GAGTGGGA~GGA~TGGTATCA~T~GA~ATAT~CAC~GTT~ACTGATGTCGTACAAACAGCTATT~G~GTAATTCT~CG~C~GCCACTTC -I150 AAATGAAAAC~GC~AGG~GAAACGGA~GAACAA~CAAAC~T~CC~TCAGCAAA~C~TG~C~A~CAT~AGAAAGTG~A~TTA~GAGGA~GAGCGG~AC~GCAA~GC~¢TCA~ -1030 A~GAGCGCTGGA~GGCACTGACG~CAC~T~C~GGA~¢cTCG~GGAG~GCT~G¢TGT~TCGACG~cATCT~CGA~GAAC~G~AG~AT~AC~AA~AGA~GCAGTAAG~CA~CT BamMI -910 TAAGGCTAAGTAAC~GTC~TCGTTT~GGCAACAATGATGAC~TTTCAG~T~G~ATTT~AAA~TGATAA~CGC~GTCTTGATAGTTTA~ACTGA~GAAAAAGTA~cAC~GTAcG~ -790 ~TACA~G~ATAA~GTATGCTGA~AC~TTT~CAGCTG~GTCAGTG~AAAGCA~GATTTTCT~GAT~GAA~CATGAGATG~AT~AGCAAC~AcCAT~AACGACG~G~AC~AA~TT -670 TAATGACGG~TAC~T~G~TAA~AGCCT~G~GTGATGA~GATCCTTGC~G~TGCG~AGG~ATA~T~CGACAA~T~GCTAGATTTC~GAT~TG~AGCAG~T~GTTAAA~AAAA
-550 ATAT~T~AGT~GAGAAA~GA~T~c~CA~C~AGAGAG~A~CGGATC~AACCG~A~TA~AGG~G~AA~AGGA~TGCGCA~CCAG~AC~G~CC~GG~AACGATGGCAA~ -~30 CGA~TAAAAC~G~GCC~ACGCA~AA~AGTTCTAG~TG~C~CTAAGCAATC~ATC~GG~C~TTTGG~C~G~GAAATC~GAT~AA~TCA~CGA~A~C~G~G~GAGCCTGGAG~ -310 ~GA~GCGA~A~ACATA~TGA~CGAT~GC~G~ATACA~A~G~T~TG~AAG~A~TA~CAATAA~AGA~CT~AGAGAcG~A~GGAAGCA~AC~A~G~A~A~AA~ -190 ~GAACAT~G~TAG~CTG~G~ACG~GTG~ACACA~CA~G~A~A~A~GGA~GA~TGAT~AGc~ACTCAA~C~CT~C~CA~CTGC~GG~GT~A~ATT~GCG~AAA~C~AT~ -70 A~TG~TAT~AGCTGA~GCTA~ATCACA~CAAC~TGCA~CGCA~A~G~TAACA~cATCC~GA~A~ACT~ACAA~AC~AGAA~AGC~A~T~C~A~CGCA ~u f +i +51 ATTTTTACAAC~TTTC~GTGgtaaga~gtt~cta9~9~at~caac~tcatttc~taa~aaatac~c¢a¢cttttaaqGTTG~CTCAATGGGACCATTTGATCCGGAGGAGATG~TGT I +171 T~ATCTTT~CG~G~TG~ATGGAAGACAA~TTGG~AGATGGAG~GAA~GA~T~C~AATGTTAGCGAAATGGAAAGAA~GGAT~A~GAAC~GGTAGACAGCCC~G~AA~T~AGTGTTTCG +291 GCAAATGCGTCC~GGT~AGAACAGGTC~G~ACGATCCGGTAG~CCAAAAGTTCGA~9~aa9Ct9~t~gC%ga~aC~gg~aat~ggataa~C~caCcgct~¢aa~9C~taqG~GT II +&ll ~GGTGATC~GG~GCAATTTAAGG~TT~TCCGT~TTGGGGGAAAAGAGCAAAGTTGAAGCAT~TGCTAACGcAGTTAAA~GTTGC~TT~CACAAATAACGA~TGTGCCGCTGTTTT~A +531 AAGCG~A~GAT~CTG~TcATAAGGCGCA~A~GGACACCAGCAAGAACTTG~Tc~ATGGAAA~AAGGAGTTGAcC~AGGGCcTcTATGAG~AGTTG~aa9~t~c¢~t~at~c~ III +651 ~aaaca~g~a~gaaca~a~att~c~c~GGAAAAGACATTCG~CAGAAGAAGCAATCCTACTT~GAGTTTTGCGAGAA~A~GTA~TACCCAGCTGGG~CAGA~AAGCG~A III +771 GCAACTT~TCAGA~AAGGC~TA~ACTGT~TTAGA~GATG~G~TGTTCAAGGAGCA~ACTG~TTGCGTGATGAAGGGTA~T~G~TACATTACGAAGGATAA~AA~TGGAT~ ~8gl ataataaatatca~t9caaacatcata9jaatc~acatt~cct~tca~GTGGAAGAGGTGAAGCGGGACTTCAAG~TAGTGAATAAAGATACGAAGGCACTCGAAAAGGTTTTGAATGAcT IV +I011 GTAAGT~TAAGG~AC~A~GCAA~GCAAAAGAGAAAT~ATGG~A~TA~TA~AAATGTTTGGTGGAAT~TTCGGTTAAGGATGATTT~AAAGAGG~THTGACTAT~GTGAGG~A~GGTCAC +1131 AGATTTATGC~TT~AATTTGC~CAAGAAA~GGCTTACAGCA~A~AGCAGTGCAATCT~AAGTGATGGAAATCGA~GGTAAA~AGTGTC~ACAA~AGAAT~GTGC~TTGAAAATGTCTT +1251 CcAATTCAATGGTTT~ATCGAAGT~GAAAGT~A~T~A(;'ATATAATAA~CATCA~GCAG~GGAAC~TTGTTTTAACTTATATTG~TTATTTGGT~ATTGTTGTAGTTATTTTTAAAHA +1371 TTTTA~TTTGGG~AG~CGAA~AAC~GT~AA~AGGAAAT~A~GTAGTT~TTGAA~CAT~GTTATTA~AA~ATATCGCAT~GTTCTGTACTGAAGA~TT~ +L470
250
TCGARD
G
Fig. 5. Comparison of primer extension and dideoxy sequencing products of female salivary gland RNA. RNA was annealed to oligonucleotide primers D7RI2 or RD7GEN12, and subjected to dideoxy sequencing reactions (lanes T, C, G, A) or primer extension analysis {lanes R and D), as described in Materials and Methods. Both methods produce strong stops at identical locations, 26 and 27 nucleolides upstream from the initiation codon ATG using the D7R 12 primer. The RD7GENI2 primer overlaps the putative transcription initiation site and therefore produces no extension product. Lanes T, C, G, A are the T,C.G. and A, sequencing reactions, respectively (note that the RNA sequence m the figure has been presented as the complementary, sense strand, so thai it may be read directly), and lanes R and D are the primer extension products generated by primers RD7GEN12 and D7RI2, respectively.
finning the reading frame of the D7 m R N A (O. Marinotti and A.A. James, unpublished). The deduced protein is 321 amino acids in length with a predicted minimum size of 36 920 Da. The protein is lysine-rich (12% of amino acid residues), has 10 cysteine residues, and a hydrophobic amino terminal region characteristic of a secretory signal peptide [16,17]. Several of the salivary gland secretory proteins are glycosylated [10,15], however, the deduced D7 protein lacks consensus sites for asparagine-linked glycosylation. A search of the PIR database for similarities of the D7 deduced protein with other reported proteins yielded no definitive functional similarity with other proteins. SDS-PAGE-isolated recombinant protein was injected into a rabbit to produce antibodies to the D7 product. Fig. 7 shows that the serum recognizes a 37-kDa protein present in female salivary glands and saliva but not in males. Additionally, the antibodies recognize recombinant and native D7 protein following both native and denaturing electrophoretic separation and transfer protocols (C. Ghosn, O. Marinotti, and A.A. James, unpublished). These data indicate that we have been able to generate antibodies to the D7 protein, that the protein is expressed only in females, and that the D7 protein is a secreted protein as predicted from the signal peptide.
Discussion the 3' ends. Of the 31 bases differing between the coding region of ASGD7-11 and ASGD7-12, only 5 cause changes in amino acids. ASGD7-12 was the longest c D N A recovered, and by comparison with the genomic sequence was missing only 21 nucleotides from the 5' end. However, this cDNA contains a deletion in the 3'-end non-translated region proximal to the consensus polyadenylation signal sequence that is not observed in ASGD7-11. Nucleotide variation in that region may have created a small intron that was excised before this particular c D N A was synthesized. We have generated a putative protein sequence based on the major open reading frame present in the sequenced cDNAs (Fig. 6). The putative openreading frame of ASGD7-12 was cloned and expressed in the phage T7 system [13], generating a protein with the predicted molecular weight, con-
The D7 cDNAs and homologous gene were isolated using standard differential screening techniques with labelled salivary gland-specific RNA. The D7 gene encodes an m R N A that is preferentially expressed in the distal-lateral and medial lobes of the adult female salivary glands. The high number of D7 representative clones in the salivary gland library, the short time of exposures of the in situ hybridizations, and the strong signal on the Northern blots indicate that the D7 gene is abundantly expressed. The D7 gene encodes a major secreted salivary gland protein of 37 kDa. The putative signal peptide in the D7 protein indicates that it is a secreted salivary gland product, confirmed by the antibody detection of the D7 protein in saliva. The abundance and molecular weight of the D7 deduced gene product indicate that it is most likely the
251
& ,~CTAGAATI~AAGCTGCCTCTATTACTCGCAATAGTTACAACCTT TTCCGTG'GTTGCTTCAACGGGACCAT TTGATCCGGAGGAGATGCTGTTCAOCT TTACGCGCTGCATGGAAGACAAT M K L P L L L A I .......................................
I0
20
30
V T T F S V V A S T G P F D P E E M L F T F T R C M E I) N ~TTTT~CGTGGTTGCcT~AACGGGAC~TTTGAT~CGGAGG~GATGCTGTTCACCTTTACGCGCT~CATGGAAGACAAT
40
50
60
70
80
90
100
110
120
TTGGAAGATGGACCGAATCGACTTCCAATGTTAGCGAAATGGAAAGAATGGATTAACGAACCGGTAGACAGCCCCGCAACTCAGTGTTTCGGCAAATGCGTCCTGGTAAGAACAGGTCTG L E D G P N R L P M L a K W K E W I N E P V D S P A T Q C F G K C V L V R T G L TTG~AAGA~G~ACCGA~TCGACTT~CAATGTTAGC~AAATGG~A~GAATGGATTAA~GAACCGGTAGA~AGCCCCGCAACTC~GTGTTTCGGCAAATGCGTCCTGGTAAGAACAGGTCTG 130 140 150 160 170 180 190 200 210 220 230 240 TACG~TCCGGT~GCCC~AAAGTTCGAT~CGTCGGTGATCCAGGAGCAATTTAAGGCTTATCCGTCCT TGGGGGAAAAG~GCAAAGTTGAAGCATATGCTAACGCAGTTCA~CAGTTGCCT Y D PV~Q KF DASV I Q E Q F KAY P S L GE K S K V E ~ Y ~ N A V Q Q L P TACGATCCGGTAGCCCAAAAGTTCGATGC~;T~GGTGATCCAGGAGCAATTTAAGGCTTATCCGTCCTTGGGGGAAAAGAGCAAAGTTGAAGCATATGCTAACGCAGTTCAACAGTTGCCT 250 260 270 280 290 300 310 320 330 340 350 360 TCCACAAATAACGACTGTG~GCTGTTTTCAAAG~GTACGATCCTGTTCATAAGGCGCATAAGGACACCAGCAAGAACTTGTTCCATGGAAACAAGGAGfIGAccAAGGGCCTCTATGAG S T N N D C A A V F K A Y D P V H K A H K D T S K N L F H G N K E L T K G L Y E TC~ACAAATAACGACTGTGC~G~TGTTTTCAAA~CG~ACGMC~TGTTCATAAGGCGCATAAGGA~AC~AGCAAGAACTTGTT~CATGGAAACAAGGAGTTGACCAAGGGCCTCTATGAG 370 380 390 400 410 420 430 440 450 460 470 480 AAGTT~GAAAAGA~ATTCGCCAGAAGAAGAAATCCTA~TTCGAGTTTTGCGAGAA~AAGTACTA~CCAG~TGGATCAGAT~AGCG~CAGCA~T~G~A~GAT~GG~ATAC~C~GTC K L G K D I R O K K K/e S Y F E F C E N K Y Y P A G S D K R Q Q L C K I R 0 Y T V AAGTTGGGAAAAGACATTCGCCAGAAGAAG~A~TCCTACTTCGAGTTTTGIGAGAACAAGTACTACCCAGCTGG~TCAGATAAGCGCCAGCAA~TTTGTAAGATAAGGCAA~ACACTGT~ 490 500 510 520 530 540 550 560 570 580 590 600
~TAGATGATGCGCTGTTC~AGGAGCACACTGATTGCGTGATGAAGGGTMTCGC~ACATAACGAAG~AT~G~AC~GGATGC~GAAGAGG7~AAA~GGGACTTCAAGCTAG~GAATaAA L D D A L F K E H T D C V M K G I R Y I T K N N E L D A E E V K R D F ~/NL/e V N K TTAGATGMGCGCTGTTCA~GGAGCACACTGATTGCGTGaTGAAGGGTATTCGCT~CAT~A~GAAGA~TAATGAACTGGATGCTGAAGAGGTGAA~CGAGA~TT~AIGC~GTGA~TAAA 610 620 630 640 650 660 670 080 690 700 710 720 GATACGAAGG~ACTTGAAAAGGTTTTGAATGACTGTAAGTCTAAGGAACCAAGCAACGCAAAAGAGAAATCATGGCACTACTACAAATGTTTGGTGGAATCTTCGGTTAAGGATGATTTC D T K A L E K V L N D C K S K E P S N A K/G E K S W H Y Y K C L V E S S V K D D F GACACGAAGGCACTTGAAAAGGTTTTGAATGACTGCAAGTCTAAAGAACCAAGCAACGCA~GAGAGAAATCATGGCACTAIT~CAAATGTTTGGTCGAGTCTTCGGTTAAGGATGATT TC 730 740 750 760 770 780 790 800 810 820 830 840 AAAGAGGCTTTTGACTATCGTGAGGTACGGTCACAGATTTATGCCTTCAATTTG•CCAAGAAGCAGGCTTA•AGCAAA•CAGCAGTGCAATCTCAAGTGATGGAAATCGACGGTAAACAG K E A F D Y R E V R S O I Y A F N L P K K Q A/V Y S K P A V g S Q V M E I D G K Q AAGGAGGCTTTTGACTATCGTGAGGTACGGTCACAGATTTATGCCTTCAATTTGCCfAAGAAGCAAGfTTACAGCAAACCGGCCGTACAGTCCCAAGTIATGGAAATCGATGGTAAGCAG
850
860
870
880
890
900
TGTCCACAATAGAATAGTGCATTGAAAATGTCTTCCAATTCAATGG ..................
910
920
g30
940
950
960
TTAAATAAAGAAATAATGATCATCACGCAAAAAAAAAAAAAAAA
C P Q TGCCCACAATAGAA~AGTGTA~TGAAAATGTCTTCCA~TTTAATGGTCTCATCGAAGTAGAAAATTAAATAAAGAAATAATGA~nAT~ATGCAAAAAAAAAAAAAAAA 970 980 990 1000 1010 1020 1030 i040 1050 1060
Fig. 6. Primary nucleotide sequence of the D7 c D N A s and their deduced amino acid sequence. The cDNA clones, ASG7-12 (top) and ASGD7-11 (bottom) were sequenced as described in Materials and Methods and in Fig. 1, and aligned to show sequence homology. The deduced amino acid sequence is listed in the middle with the putatitive secretory signal peptide underlined. Numbering refers to the nucleotide sequences and dashed regions represent sequences not found in the respective clones. Nucleotide sequence polymorphisms are highlighted in bold in ASGD7-1 I. A m i n o acids differences resulting from sequence polymorphisms are indicated as doublets with the first amino acid derived from ASGT-12 and the second amino acid from ASGD7-11. A translation initiation codon (ATG) and consensus polyadenylation signal sequence, and the restriction endonuclease Hi,loll site (GTTGAC) are listed in bold in the ASGT-12 sequence. The positions of introns I-IV are indicated by the vertical arrows.
252
C
F
M
SI
$5
24-1iiiiiiiiiii~i+ili! ~
Fig. 7. Antibody detection of D7 protein in female salivary glands and saliva. C, Coomassie Blue-stained proteins from 6 pairs of whole female salivary glands separated by SDSPAGE. F, antibody detection of D7 from 6 pairs of female saliva U glands. M, antibody detection of D7 from 20 pairs of male salivary glands. S1, $5, antibody detection of D7 in saliva collected from 26 females, 1 and 5-day exposures, respectively. Proteins from saliva and salivary glands were resolved by SDS+PAGE, transferred to nitrocellulose filters, and specitic recognition determined as described in Materials and Methods. Numbers refer to molecular weight markers in kDa.
major secreted salivary gland protein, designated polypeptide 7, by Racioppi and Spielman [15]. We take the signal evident in Northern analysis of male poly(A) + RNA and the lack of a discernible pattern of expression the in situ hybridization to tissue sections of male thorax as an indication that all of the cells in the male glands synthesize the D7 RNA at a low level. However, it is possible that some other tissue in males is expressing the D7 RNA. If D7 expression is confined to the salivary glands, then sex-specitic regulation of the D7 gene product appears to occur both transcriptionally and translationally. The low level of RNA expressed in males does not yield a protein detectable by polyclonal antibodies. Efforts to resolve the origin of the nucleotide sequence differences between the genomic and cDNA clones have been so far unsuccessful. We do not yet know if these differences result from allelic variation of a single locus or from multiple genes. Southern analyses have revealed the presence of a copy of a dispersed, highly repetitive DNA sequence in a region adjacent to the D7 gene (C. Ghosn and A.A. James, unpublished), and variation in this region may account for the multiple bands often seen in these analyses and
complicate the standard approaches to evaluati n g gene copy number. We have isolated additional D7 genomic clones and the analysis of these should provide an answer. Mosquitoes are holometabolous, having distinct larval, pupal, and adult stages. Larvae feed on bacteria and other particulate organic matter in their aqueous environment, adult males feed on nectar [18], and adult females feed on nectar and blood. The restricted expression of D7 in adult female salivary glands indicates that it is involved in one of the two adult feeding modes. The female salivary glands are morphologically distinct from the male salivary glands, possessing large distallateral and medial lobes not found in males. The fact that only females feed on blood is coincident with this structural dimorphism and it has been proposed that the female-specific lobes synthesize and secrete the salivary components involved in blood feeding [19]. For example, an apyrase (ATP-diphosphohydrolase) has been described in the female salivary glands. The apyrase acts as an anti-platelet aggregating factor and helps the female ingest blood [1]. The apyrase activity has been localized to the medial lobe and distal-lateral lobes of the female salivary gland [20]. Conversely, two salivary gland activities found in both males and females+ an (~-glucosidase [21] and a lysozyme-like activity [22], are restricted to the proximal-lateral lobes of the female salivary gland. These observations combined with the stage, sex, and spatial restricted expression of the D7 gene make it likely that the D7 gene product is involved in blood feeding. The lack of reported proteins with sequence similarities to the D7 gene product is not surprising. Efforts to investigate the biochemistry and molecular biology of the secreted salivary gland proteins of hematophagous arthropods have only recently been initiated and there is a potential that many new families of proteins will be discovered. In addition to the characterization o1 novel pharmacologically active products, the analysis of genes encoding the salivary gland proteins provides information about the DNA sequences that are good candidates for genetic transformation of vectors. The promoters of these abundantly expressed genes could be used to direct the expression of exogenous DNA sequences, which en-
253 c o d e a n t i p a t h o g e n activities, in the f e m a l e saliv a r y g l a n d . T h i s o r g a n is a g o o d target, as m e n t i o n e d before, b e c a u s e it is a site w h e r e m o s t p a t h o g e n s m u s t e n t e r b e f o r e t r a n s m i s s i o n to a vertebrate host. The functional analysis of vector insect promoters a n d their a p p l i c a t i o n to d i r e c t i n g the e x p r e s sion o f e x o g e n o u s g e n e s is n o w d e p e n d e n t on the d e v e l o p m e n t o f t e c h n i q u e s that e n a b l e the integ r a t i o n a n d s t a b i l i z a t i o n o f e x o g e n o u s D N A seq u e n c e s in the target i n s e c t g e n o m e . O u r efforts in c h a r a c t e r i z i n g the 5 ' - e n d c o n t r o l l i n g D N A seq u e n c e s c o n t a i n i n g the p u t a t i v e p r o m o t e r s o f the abundantly expressed salivary gland genes anticipate the d e v e l o p m e n t o f p r o t o c o l s for g e n e t i c a l l y transforming mosquitoes.
Acknowledgements T h i s w o r k was s u p p o r t e d b y f u n d s m a d e a v a i l able b y the J o h n D. a n d C a t h e r i n e T. M a c A r t h u r F o u n d a t i o n , by a g r a n t f r o m the N a t i o n a l Institutes o f H e a l t h , N I A I D , A I 2 9 7 4 6 , a n d in part b y B R S G S07 R R 0 7 0 0 8 a w a r d e d b y the B i o m e d i cal R e s e a r c h S u p p o r t G r a n t P r o g r a m , D i v i s i o n o f Research Resources, NIH.
References l Ribeiro, J.M.C. (1987) Role of saliva in blood feeding by arthropods. Annu. Rev. Entomol. 32, 463478. 2 Rossignol, P.A. and Spielman, A. (1982) Fluid transport across the ducts of the salivary glands of the mosquito. J. Insect. Physiol. 28, 574 581. 3 Chirgwim J.M., Przybyla, A.E., MacDonald, R.J. and Rutter, W.J. (1979) Isolation of biological active ribonucleic acid from sources enriched in ribonuclease. Biochemistry 18, 5294-5299. 4 Maniatis, T., Fritsch. E.F. and Sambrook, J. (1982) Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. 5 Bender, W., Spierer, P. and Hogness, D.S. (1983) Chromosomal walking and jumping to isolate DNA from the Ace
and Rosy loci and the Bithorax Complex in Drosophila melanogaster. J. Mol. Biol. 168, 17-33. 6 Southern, E.M. (1975) Detection of specific sequences among DNA fragments separated by gel electrophoresis. J. Mol. Biol. 98, 503-517. 7 Rigby, P.W.J., Dieckmann, M., Rhodes, C. and Berg, P. (1977) Labeling deoxyribonucleic acid to high specific activity in vitro by nick translation with DNA polymerase I. J. Mol. Biol. 113, 237 251. 8 Maizels, N. (1976) Dictyostelium 17S, 25S, and 5S rDNAs lie within a 38,000 base pair repeated unit. Cell 9, 43 IM-38. 9 Color, H.V. and Rosbash, M. (1982) Behavior of individual maternal pA + RNAs during embryogenesis of Xenopus laevis. Dev. Biol. 94, 79-86. 10 James, A.A., Blackmer, K. and Racioppi, J.V. (1989) A salivary gland-specific, maltase-like gene of the vector mosquito, Aedes ae~,,ypti. Gene 75, 73-83. 11 Sanger, F., Nicklen, S. and Coulson, A.R. (1977) DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74. 5463-5467. 12 Lipman, D.J. and Pearson, W.R. (1985) Rapid and sensitive protein similarity searches. Science 227, 1435 1441. 13 Tabor, S. and Richardson, C.C. (1985) A bacteriophage T7 RNA polymerase/promoter system for controlled exclusive expression of specific genes. Proc. Natl. Acad. Sci. USA 82, 1074-1078. 14 Laemmli, U.K. (1970) Cleavage of structural proteins during the assembly of the head of the bacteriophage T4. Nature 227, 680 685. 15 Racioppi, J. and Spielman, A. (1987) Secretory proteins from the salivary glands of the adult Aedes ae~)7~ti mosquitoes. Insect Biochem. 17, 503-511. 16 Kyte, J. and Doolittle, R.F. (1982) A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105 132. 17 Von Heijne, G. (1985) Signal sequences-the limits of variation. J. Mol. Biol. 184, 99-105. 18 Christophers, S.R. (1960)Aedes aegypti, the Yellow Fever mosquito. Cambridge University Press, Cambridge. 19 Marinotti, O., James, A.A. and Ribeiro, J.M.C. (In Press) Diet and salivation in Aedes ae~ypti female mosquitoes. J. Insect Physiol. 20 Ribeiro, J.M.C., Sarkis, J.J.F., Rossignol, P.A. and Spielman, A. (1984) Salivary apyrase of Aedes ae;,ypti: characterization and secretory fate. Comp. Biochem. Physiol. 79B, 81 86. 21 Marinotti, O. and James, A.A. (In Press) An alphaglucosidase in the salivary glands of the vector mosquito, Aedes aegypti. Insect Biochem. 22 Rossignol, P.A. and Leuders, A.M. (1986) Bacteriolytic factor in the salivary glands of the Aedes aegypti. Comp. Biochem. Physiol. 83B, 819-822.