Back to EveryPatent.com
United States Patent | 6,165,461 |
Cobb ,   et al. | December 26, 2000 |
Compositions and methods are provided for potentiating the activity of the mitogen-activated protein kinase p38. In particular the mitogen-activated protein kinase kinase MEK6, and variants thereof that stimulate phosphorylation of p38 are provided. Such compounds may be used, for example, for therapy of diseases associated with the p38 cascade and to identify antibodies and other agents that inhibit or activate signal transduction via p38.
Inventors: | Cobb; Melanie (Dallas, TX); Hutchison; Michele (Dallas, TX); Chen; Zhu (Dallas, TX); Berman; Kevin (Dallas, TX) |
Assignee: | Board of Regents, University of Texas System (Austin, TX) |
Appl. No.: | 060410 |
Filed: | April 14, 1998 |
Current U.S. Class: | 424/94.5; 435/194 |
Intern'l Class: | A61K 038/51; C12N 009/12 |
Field of Search: | 435/194 424/94.5 |
Foreign Patent Documents | |||
WO 99/02699 | Jan., 1999 | WO. |
Boulton et al., "An Insulin-Stimulated Protein Kinase Similar to Yeast Kinases Involved in Cell Cycle Control," Science 249:64-67, 1990. Burbelo et al., "A Conserved Binding Motif Defines Numerous Candidate Target Proteins for Both Cdc42 and Rac GTPases," J. Biol. Chem. 270: 29071-29074, 1995. Courhesne et al., "A Putative Protein Kinase Overcomes Pheromone-Induced Arrest of Cell Cycling in S. cerevisiae," Cell 58: 1107-1119, 1989. Elion et al., "FUS3 Encodes a cdc2+/CDC28-Related Kinase Required for the Transition from Mitosis into Conjugation," Cell 60: 649-664, 1990. Hunter and Plowman, "The protein kinases of budding yeast: six score and more," Trends in Biochem. Sci. 22:18-22, 1997. Leberer et al., "The protein kinase homologue Ste20p is required to the link yeast pheromone response G-protein .beta..gamma. submits to downstream signalling components," The EMBO Journal 11:4815-4824, 1992. Ramer and Davis, "A dominant truncation allele identifies a gene, STE20, that encodes a putative kinase necessary for mating in Saccharomyces cerevisiae," Proc. Natl. Acad. Sci. USA 90:452-456, 1993. Rhodes et al., "STE11 is a protein kinase required for cell-type-specific transcription and signal transduction in yeast," Genes and Development 4: 1862-1874, 1990. Robinson et al., "Contributions of the Mitogen-activated Protein (MAP) Kinase Backbone and Phosphorylation Loop to MEK Specificity," The Journal of Biological Chemistry 271(47): 29734-29739, 1996. Su et al., "NIK is a new Ste20-related kinase that binds NCK and MEKK1 and activates the SAPK/JNK cascade via a conserved regulatory domain," The EMBO Journal 16(6): 1279-1290, 1997. Teague et al., "Nucleotide sequence of the yeast regulatory gene STE7 predicts a protein homologous to protein kinases," Proc. Natl. Acad. Sci. USA 83: 7371-7375, 1986. Wu et al., "Molecular Characterization of Ste20p, a Potential Mitogen-activated Protein or Extracelluar Signal-regulated Kinase Kinase (MEK) Kinase Kinase from Saccharonmyces cerevisiae," Journal of Biological Chemistry 270(27): 15984-15992, 1995. Allen et al., "PAK2 Mutation in Nonsyndromic X-Linked Mental Retardation," Nature Genetics20:25-30, 1998. Creasy and Chernoff, "Cloning and Characterization of a Member of the MST Subfamily of Ste20-Like Kinases," Gene 167:303-306, 1995. Database EMBL Accession No. AA234623, Mar. 6, 1997. Database EMBL Accession No. AF068864, Sep. 23, 1998. Hutchinson et al., "Isolation of TAO1, a Protein Kinase That Activates MEKs in Stress-Activated Protein Kinase Cascades," J. Biol. Chem. 273(44):28625-28632, 1998. Marra et al., genban-est111 database, Accession No. g1541866, Sep. 1996. |
TABLE 1 __________________________________________________________________________ TAO1 TAO2 90 TAO2 ceTAO 65 61 ceTAO STE20d 40 39 37 STE20 GCK 43 42 35 40 GCK MLK1 32 30 27 30 29 MLK1 MST1 47 43 42 42 47 28 MST1 MEKK1 34 33 27 30 30 30 29 __________________________________________________________________________
__________________________________________________________________________ # SEQUENCE LISTING - - - - (1) GENERAL INFORMATION: - - (iii) NUMBER OF SEQUENCES: 26 - - - - (2) INFORMATION FOR SEQ ID NO: 1: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 3312 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 121..3123 - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #1: - - TCTGCAGTAT GGTAGATTAT TATTTATGCA TTTATGCCAG TGTGGCTTCA TT - #CATACAGA 60 - - TGAACCAAGC TTTGGGATAG CAGTATAAAA TTAGAATCAG ACAGCTGACT GC - #TCAGCAGG 120 - - ATG CCA TCA ACT AAC AGA GCA GGC AGT CTA AA - #G GAC CCT GAA ATC GCA 168 Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Ly - #s Asp Pro Glu Ile Ala 1 5 - # 10 - # 15 - - GAG CTC TTC TTC AAA GAA GAT CCG GAA AAA CT - #C TTC ACA GAT CTC AGA 216 Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Le - #u Phe Thr Asp Leu Arg 20 - # 25 - # 30 - - GAA ATC GGC CAT GGG AGC TTT GGA GCA GTT TA - #T TTT GCA CGA GAT GTG 264 Glu Ile Gly His Gly Ser Phe Gly Ala Val Ty - #r Phe Ala Arg Asp Val 35 - # 40 - # 45 - - CGT ACT AAT GAA GTG GTG GCC ATC AAG AAA AT - #G TCT TAT AGT GGA AAG 312 Arg Thr Asn Glu Val Val Ala Ile Lys Lys Me - #t Ser Tyr Ser Gly Lys 50 - # 55 - # 60 - - CAG TCT ACT GAG AAA TGG CAG GAT ATT ATT AA - #G GAA GTC AAG TTT CTA 360 Gln Ser Thr Glu Lys Trp Gln Asp Ile Ile Ly - #s Glu Val Lys Phe Leu 65 - # 70 - # 75 - # 80 - - CAA AGA ATA AAA CAT CCC AAC AGT ATA GAA TA - #C AAA GGC TGC TAT TTA 408 Gln Arg Ile Lys His Pro Asn Ser Ile Glu Ty - #r Lys Gly Cys Tyr Leu 85 - # 90 - # 95 - - CGT GAA CAC ACA GCA TGG CTT GTA ATG GAA TA - #T TGT TTA GGA TCT GCT 456 Arg Glu His Thr Ala Trp Leu Val Met Glu Ty - #r Cys Leu Gly Ser Ala 100 - # 105 - # 110 - - TCG GAT TTA CTA GAA GTT CAT AAA AAG CCA TT - #A CAA GAA GTG GAA ATA 504 Ser Asp Leu Leu Glu Val His Lys Lys Pro Le - #u Gln Glu Val Glu Ile 115 - # 120 - # 125 - - GCA GCA ATT ACA CAT GGT GCT CTC CAG GGA TT - #A GCT TAT TTA CAT TCT 552 Ala Ala Ile Thr His Gly Ala Leu Gln Gly Le - #u Ala Tyr Leu His Ser 130 - # 135 - # 140 - - CAT ACC ATG ATC CAT AGA GAT ATC AAA GCA GG - #A AAT ATC CTT CTG ACA 600 His Thr Met Ile His Arg Asp Ile Lys Ala Gl - #y Asn Ile Leu Leu Thr 145 1 - #50 1 - #55 1 - #60 - - GAA CCA GGC CAA GTG AAA CTT GCT GAC TTT GG - #A TCT GCT TCC ATG GCC 648 Glu Pro Gly Gln Val Lys Leu Ala Asp Phe Gl - #y Ser Ala Ser Met Ala 165 - # 170 - # 175 - - TCC CCT GCC AAT TCT TTT GTG GGA ACA CCA TA - #T TGG ATG GCC CCA GAA 696 Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Ty - #r Trp Met Ala Pro Glu 180 - # 185 - # 190 - - GTA ATT TTA GCC ATG GAT GAA GGA CAA TAT GA - #T GGC AAA GTT GAT GTA 744 Val Ile Leu Ala Met Asp Glu Gly Gln Tyr As - #p Gly Lys Val Asp Val 195 - # 200 - # 205 - - TGG TCT CTT GGA ATA ACA TGT ATT GAA TTA GC - #C GAG AGG AAG CCT CCT 792 Trp Ser Leu Gly Ile Thr Cys Ile Glu Leu Al - #a Glu Arg Lys Pro Pro 210 - # 215 - # 220 - - TTA TTT AAT ATG AAT GCA ATG AGT GCC TTA TA - #T CAC ATA GCC CAA AAT 840 Leu Phe Asn Met Asn Ala Met Ser Ala Leu Ty - #r His Ile Ala Gln Asn 225 2 - #30 2 - #35 2 - #40 - - GAA TCC CCT ACA CTA CAG TCT AAT GAA TGG TC - #T GAT TAT TTT CGA AAC 888 Glu Ser Pro Thr Leu Gln Ser Asn Glu Trp Se - #r Asp Tyr Phe Arg Asn 245 - # 250 - # 255 - - TTT GTA GAT TCT TGC CTC CAG AAA ATC CCT CA - #A GAT CGC CCT ACA TCA 936 Phe Val Asp Ser Cys Leu Gln Lys Ile Pro Gl - #n Asp Arg Pro Thr Ser 260 - # 265 - # 270 - - GAG GAA CTT TTA AAG CAC ATG TTT GTT CTT CG - #A GAG CGC CCT GAA ACA 984 Glu Glu Leu Leu Lys His Met Phe Val Leu Ar - #g Glu Arg Pro Glu Thr 275 - # 280 - # 285 - - GTG TTA ATA GAT CTT ATT CAA AGG ACA AAG GA - #T GCA GTA AGA GAG CTG 1032 Val Leu Ile Asp Leu Ile Gln Arg Thr Lys As - #p Ala Val Arg Glu Leu 290 - # 295 - # 300 - - GAC AAT CTA CAA TAT CGA AAG ATG AAG AAA CT - #C CTT TTC CAG GAG GCA 1080 Asp Asn Leu Gln Tyr Arg Lys Met Lys Lys Le - #u Leu Phe Gln Glu Ala 305 3 - #10 3 - #15 3 - #20 - - CAT AAT GGA CCA GCA GTA GAA GCA CAG GAA GA - #A GAG GAG GAG CAA GAT 1128 His Asn Gly Pro Ala Val Glu Ala Gln Glu Gl - #u Glu Glu Glu Gln Asp 325 - # 330 - # 335 - - CAT GGT GGT GGC CGG ACA GGA ACA GTA AAT AG - #T GTT GGA AGC AAT CAG 1176 His Gly Gly Gly Arg Thr Gly Thr Val Asn Se - #r Val Gly Ser Asn Gln 340 - # 345 - # 350 - - TCT ATC CCC AGT ATG TCT ATC AGT GCC AGT AG - #C CAA AGC AGC AGT GTT 1224 Ser Ile Pro Ser Met Ser Ile Ser Ala Ser Se - #r Gln Ser Ser Ser Val 355 - # 360 - # 365 - - AAT AGT CTT CCA GAT GCA TCG GAT GAC AAG AG - #T GAG CTA GAC ATG ATG 1272 Asn Ser Leu Pro Asp Ala Ser Asp Asp Lys Se - #r Glu Leu Asp Met Met 370 - # 375 - # 380 - - GAG GGA GAC CAT ACA GTG ATG TCT AAC AGT TC - #T GTC ATC CAC TTA AAA 1320 Glu Gly Asp His Thr Val Met Ser Asn Ser Se - #r Val Ile His Leu Lys 385 3 - #90 3 - #95 4 - #00 - - CCT GAG GAG GAA AAT TAC CAA GAA GAA GGA GA - #T CCT AGA ACA AGA GCA 1368 Pro Glu Glu Glu Asn Tyr Gln Glu Glu Gly As - #p Pro Arg Thr Arg Ala 405 - # 410 - # 415 - - TCA GCT CCA CAG TCT CCA CCT CAA GTG TCT CG - #T CAC AAA TCA CAT TAT 1416 Ser Ala Pro Gln Ser Pro Pro Gln Val Ser Ar - #g His Lys Ser His Tyr 420 - # 425 - # 430 - - CGT AAT AGA GAA CAC TTT GCA ACT ATA CGA AC - #A GCA TCA CTG GTT ACA 1464 Arg Asn Arg Glu His Phe Ala Thr Ile Arg Th - #r Ala Ser Leu Val Thr 435 - # 440 - # 445 - - AGA CAG ATG CAA GAA CAT GAG CAG GAC TCT GA - #A CTT AGA GAA CAG ATG 1512 Arg Gln Met Gln Glu His Glu Gln Asp Ser Gl - #u Leu Arg Glu Gln Met 450 - # 455 - # 460 - - TCT GGT TAT AAG CGG ATG AGG CGA CAG CAT CA - #G AAG CAG CTG ATG ACT 1560 Ser Gly Tyr Lys Arg Met Arg Arg Gln His Gl - #n Lys Gln Leu Met Thr 465 4 - #70 4 - #75 4 - #80 - - CTG GAA AAT AAA CTG AAG GCA GAA ATG GAC GA - #A CAT CGG CTC AGA TTA 1608 Leu Glu Asn Lys Leu Lys Ala Glu Met Asp Gl - #u His Arg Leu Arg Leu 485 - # 490 - # 495 - - GAC AAA GAT CTT GAA ACT CAG CGC AAC AAT TT - #C GCT GCA GAA ATG GAG 1656 Asp Lys Asp Leu Glu Thr Gln Arg Asn Asn Ph - #e Ala Ala Glu Met Glu 500 - # 505 - # 510 - - AAA CTT ATT AAG AAA CAC CAA GCT TCT ATG GA - #A AAA GAG GCT AAA GTG 1704 Lys Leu Ile Lys Lys His Gln Ala Ser Met Gl - #u Lys Glu Ala Lys Val 515 - # 520 - # 525 - - ATG GCC AAC GAG GAG AAA AAA TTC CAA CAA CA - #C ATT CAG GCT CAA CAG 1752 Met Ala Asn Glu Glu Lys Lys Phe Gln Gln Hi - #s Ile Gln Ala Gln Gln 530 - # 535 - # 540 - - AAG AAA GAA CTG AAT AGC TTT TTG GAG TCT CA - #A AAA AGA GAA TAT AAA 1800 Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gl - #n Lys Arg Glu Tyr Lys 545 5 - #50 5 - #55 5 - #60 - - CTT CGA AAA GAG CAG CTT AAG GAG GAG CTG AA - #T GAA AAC CAG AGC ACA 1848 Leu Arg Lys Glu Gln Leu Lys Glu Glu Leu As - #n Glu Asn Gln Ser Thr 565 - # 570 - # 575 - - CCT AAA AAA GAA AAG CAG GAA TGG CTT TCA AA - #G CAG AAG GAG AAT ATT 1896 Pro Lys Lys Glu Lys Gln Glu Trp Leu Ser Ly - #s Gln Lys Glu Asn Ile 580 - # 585 - # 590 - - CAA CAT TTT CAG GCA GAA GAA GAA GCT AAT CT - #T CTT CGA CGT CAA AGG 1944 Gln His Phe Gln Ala Glu Glu Glu Ala Asn Le - #u Leu Arg Arg Gln Arg 595 - # 600 - # 605 - - CAG TAT CTA GAG CTA GAA TGT CGT CGC TTC AA - #A AGA AGA ATG TTA CTT 1992 Gln Tyr Leu Glu Leu Glu Cys Arg Arg Phe Ly - #s Arg Arg Met Leu Leu 610 - # 615 - # 620 - - GGT CGG CAT AAC TTG GAA CAG GAC CTT GTC AG - #G GAG GAG TTA AAC AAA 2040 Gly Arg His Asn Leu Glu Gln Asp Leu Val Ar - #g Glu Glu Leu Asn Lys 625 6 - #30 6 - #35 6 - #40 - - AGG CAG ACT CAG AAG GAC TTA GAA CAT GCA AT - #G TTA CTG CGA CAG CAT 2088 Arg Gln Thr Gln Lys Asp Leu Glu His Ala Me - #t Leu Leu Arg Gln His 645 - # 650 - # 655 - - GAA TCC ATG CAA GAA CTG GAG TTT CGC CAC CT - #C AAC ACT ATT CAG AAG 2136 Glu Ser Met Gln Glu Leu Glu Phe Arg His Le - #u Asn Thr Ile Gln Lys 660 - # 665 - # 670 - - ATG CGC TGT GAG TTG ATC AGA CTG CAA CAT CA - #A ACT GAG CTT ACT AAC 2184 Met Arg Cys Glu Leu Ile Arg Leu Gln His Gl - #n Thr Glu Leu Thr Asn 675 - # 680 - # 685 - - CAG CTG GAA TAC AAT AAG AGA AGG GAA CGG GA - #A CTA AGA CGG AAA CAT 2232 Gln Leu Glu Tyr Asn Lys Arg Arg Glu Arg Gl - #u Leu Arg Arg Lys His 690 - # 695 - # 700 - - GTC ATG GAA GTT CGA CAG CAG CCT AAG AGT TT - #G AAG TCT AAA GAA CTC 2280 Val Met Glu Val Arg Gln Gln Pro Lys Ser Le - #u Lys Ser Lys Glu Leu 705 7 - #10 7 - #15 7 - #20 - - CAA ATA AAA AAG CAG TTT CAG GAT ACC TGC AA - #A ATT CAA ACC AGA CAG 2328 Gln Ile Lys Lys Gln Phe Gln Asp Thr Cys Ly - #s Ile Gln Thr Arg Gln 725 - # 730 - # 735 - - TAC AAA GCA TTA AGG AAT CAC CTA CTG GAG AC - #T ACA CCA AAG AGT GAG 2376 Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Th - #r Thr Pro Lys Ser Glu 740 - # 745 - # 750 - - CAC AAA GCT GTT CTG AAA AGA CTC AAG GAG GA - #A CAG ACT CGG AAG TTA 2424 His Lys Ala Val Leu Lys Arg Leu Lys Glu Gl - #u Gln Thr Arg Lys Leu 755 - # 760 - # 765 - - GCC ATC TTG GCT GAG CAG TAT GAT CAT AGC AT - #T AAT GAA ATG CTC TCC 2472 Ala Ile Leu Ala Glu Gln Tyr Asp His Ser Il - #e Asn Glu Met Leu Ser 770 - # 775 - # 780 - - ACA CAA GCT CTG CGT TTG GAT GAA GCA CAG GA - #A GCA GAA TGC CAG GTT 2520 Thr Gln Ala Leu Arg Leu Asp Glu Ala Gln Gl - #u Ala Glu Cys Gln Val 785 7 - #90 7 - #95 8 - #00 - - TTG AAG ATG CAG CTA CAG CAG GAA CTG GAG CT - #G TTG AAT GCA TAT CAG 2568 Leu Lys Met Gln Leu Gln Gln Glu Leu Glu Le - #u Leu Asn Ala Tyr Gln 805 - # 810 - # 815 - - AGC AAA ATC AAG ATG CAG GCT GAG GCC CAA CA - #T GAT CGA GAG CTT CGA 2616 Ser Lys Ile Lys Met Gln Ala Glu Ala Gln Hi - #s Asp Arg Glu Leu Arg 820 - # 825 - # 830 - - GAG CTG GAA CAA AGG GTC TCC CTT CGG AGA GC - #A CTC TTA GAA CAG AAG 2664 Glu Leu Glu Gln Arg Val Ser Leu Arg Arg Al - #a Leu Leu Glu Gln Lys 835 - # 840 - # 845 - - ATT GAA GAA GAG ATG TTG GCT TTG CAG AAT GA - #A CGC ACA GAA CGA ATA 2712 Ile Glu Glu Glu Met Leu Ala Leu Gln Asn Gl - #u Arg Thr Glu Arg Ile 850 - # 855 - # 860 - - CGT AGC CTG CTC GAG CGC CAG GCC AGA GAA AT - #T GAA GCT TTT GAC TCT 2760 Arg Ser Leu Leu Glu Arg Gln Ala Arg Glu Il - #e Glu Ala Phe Asp Ser 865 8 - #70 8 - #75 8 - #80 - - GAA AGC ATG AGA TTA GGT TTT AGT AAC ATG GT - #C CTT TCT AAT CTC TCC 2808 Glu Ser Met Arg Leu Gly Phe Ser Asn Met Va - #l Leu Ser Asn Leu Ser 885 - # 890 - # 895 - - CCT GAG GCA TTC AGC CAC AGC TAC CCA GGA GC - #T TCT AGC TGG TCT CAC 2856 Pro Glu Ala Phe Ser His Ser Tyr Pro Gly Al - #a Ser Ser Trp Ser His
900 - # 905 - # 910 - - AAT CCT ACT GGG GGT TCA GGA CCT CAC TGG GG - #T CAT CCC ATG GGT GGC 2904 Asn Pro Thr Gly Gly Ser Gly Pro His Trp Gl - #y His Pro Met Gly Gly 915 - # 920 - # 925 - - ACA CCA CAA GCT TGG GGT CAT CCG ATG CAA GG - #C GGA CCC CAA CCA TGG 2952 Thr Pro Gln Ala Trp Gly His Pro Met Gln Gl - #y Gly Pro Gln Pro Trp 930 - # 935 - # 940 - - GGT CAC CCC TCA GGG CCA ATG CAA GGG GTA CC - #T CGA GGT AGC AGT ATA 3000 Gly His Pro Ser Gly Pro Met Gln Gly Val Pr - #o Arg Gly Ser Ser Ile 945 9 - #50 9 - #55 9 - #60 - - GGA GTC CGC AAT AGC CCC CAG GCT CTG AGG CG - #G ACA GCT TCT GGG GGA 3048 Gly Val Arg Asn Ser Pro Gln Ala Leu Arg Ar - #g Thr Ala Ser Gly Gly 965 - # 970 - # 975 - - CGG ACG GAA CAG GGC ATG AGC AGA AGC ACG AG - #T GTC ACT TCA CAA ATA 3096 Arg Thr Glu Gln Gly Met Ser Arg Ser Thr Se - #r Val Thr Ser Gln Ile 980 - # 985 - # 990 - - TCC AAT GGG TCA CAC ATG TCT TAC ACA TAATAATTG - #A AAGTGGCAAT 3143 Ser Asn Gly Ser His Met Ser Tyr Thr 995 - # 1000 - - TCCGCTGGAG CTGTCTGCCA AAAGAAACTG CCTACAGACA TCAGCACAGC AG - #CCTCCTCA 3203 - - CTTGGGTACT ACCGGGTGGA AGCTGTGCAT ATGGTATATT TTATTCGTCT TT - #GTAAAGCG 3263 - - TTATGTTTTG TGTTTACTAA TTGGGATGTC ATAGTATTTG GCTGCCGGG - # 3312 - - - - (2) INFORMATION FOR SEQ ID NO: 2: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1001 amino - #acids (B) TYPE: amino acid (D) TOPOLOGY: linear - - (ii) MOLECULE TYPE: protein - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #2: - - Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Ly - #s Asp Pro Glu Ile Ala 1 5 - # 10 - # 15 - - Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Le - #u Phe Thr Asp Leu Arg 20 - # 25 - # 30 - - Glu Ile Gly His Gly Ser Phe Gly Ala Val Ty - #r Phe Ala Arg Asp Val 35 - # 40 - # 45 - - Arg Thr Asn Glu Val Val Ala Ile Lys Lys Me - #t Ser Tyr Ser Gly Lys 50 - # 55 - # 60 - - Gln Ser Thr Glu Lys Trp Gln Asp Ile Ile Ly - #s Glu Val Lys Phe Leu 65 - # 70 - # 75 - # 80 - - Gln Arg Ile Lys His Pro Asn Ser Ile Glu Ty - #r Lys Gly Cys Tyr Leu 85 - # 90 - # 95 - - Arg Glu His Thr Ala Trp Leu Val Met Glu Ty - #r Cys Leu Gly Ser Ala 100 - # 105 - # 110 - - Ser Asp Leu Leu Glu Val His Lys Lys Pro Le - #u Gln Glu Val Glu Ile 115 - # 120 - # 125 - - Ala Ala Ile Thr His Gly Ala Leu Gln Gly Le - #u Ala Tyr Leu His Ser 130 - # 135 - # 140 - - His Thr Met Ile His Arg Asp Ile Lys Ala Gl - #y Asn Ile Leu Leu Thr 145 1 - #50 1 - #55 1 - #60 - - Glu Pro Gly Gln Val Lys Leu Ala Asp Phe Gl - #y Ser Ala Ser Met Ala 165 - # 170 - # 175 - - Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Ty - #r Trp Met Ala Pro Glu 180 - # 185 - # 190 - - Val Ile Leu Ala Met Asp Glu Gly Gln Tyr As - #p Gly Lys Val Asp Val 195 - # 200 - # 205 - - Trp Ser Leu Gly Ile Thr Cys Ile Glu Leu Al - #a Glu Arg Lys Pro Pro 210 - # 215 - # 220 - - Leu Phe Asn Met Asn Ala Met Ser Ala Leu Ty - #r His Ile Ala Gln Asn 225 2 - #30 2 - #35 2 - #40 - - Glu Ser Pro Thr Leu Gln Ser Asn Glu Trp Se - #r Asp Tyr Phe Arg Asn 245 - # 250 - # 255 - - Phe Val Asp Ser Cys Leu Gln Lys Ile Pro Gl - #n Asp Arg Pro Thr Ser 260 - # 265 - # 270 - - Glu Glu Leu Leu Lys His Met Phe Val Leu Ar - #g Glu Arg Pro Glu Thr 275 - # 280 - # 285 - - Val Leu Ile Asp Leu Ile Gln Arg Thr Lys As - #p Ala Val Arg Glu Leu 290 - # 295 - # 300 - - Asp Asn Leu Gln Tyr Arg Lys Met Lys Lys Le - #u Leu Phe Gln Glu Ala 305 3 - #10 3 - #15 3 - #20 - - His Asn Gly Pro Ala Val Glu Ala Gln Glu Gl - #u Glu Glu Glu Gln Asp 325 - # 330 - # 335 - - His Gly Gly Gly Arg Thr Gly Thr Val Asn Se - #r Val Gly Ser Asn Gln 340 - # 345 - # 350 - - Ser Ile Pro Ser Met Ser Ile Ser Ala Ser Se - #r Gln Ser Ser Ser Val 355 - # 360 - # 365 - - Asn Ser Leu Pro Asp Ala Ser Asp Asp Lys Se - #r Glu Leu Asp Met Met 370 - # 375 - # 380 - - Glu Gly Asp His Thr Val Met Ser Asn Ser Se - #r Val Ile His Leu Lys 385 3 - #90 3 - #95 4 - #00 - - Pro Glu Glu Glu Asn Tyr Gln Glu Glu Gly As - #p Pro Arg Thr Arg Ala 405 - # 410 - # 415 - - Ser Ala Pro Gln Ser Pro Pro Gln Val Ser Ar - #g His Lys Ser His Tyr 420 - # 425 - # 430 - - Arg Asn Arg Glu His Phe Ala Thr Ile Arg Th - #r Ala Ser Leu Val Thr 435 - # 440 - # 445 - - Arg Gln Met Gln Glu His Glu Gln Asp Ser Gl - #u Leu Arg Glu Gln Met 450 - # 455 - # 460 - - Ser Gly Tyr Lys Arg Met Arg Arg Gln His Gl - #n Lys Gln Leu Met Thr 465 4 - #70 4 - #75 4 - #80 - - Leu Glu Asn Lys Leu Lys Ala Glu Met Asp Gl - #u His Arg Leu Arg Leu 485 - # 490 - # 495 - - Asp Lys Asp Leu Glu Thr Gln Arg Asn Asn Ph - #e Ala Ala Glu Met Glu 500 - # 505 - # 510 - - Lys Leu Ile Lys Lys His Gln Ala Ser Met Gl - #u Lys Glu Ala Lys Val 515 - # 520 - # 525 - - Met Ala Asn Glu Glu Lys Lys Phe Gln Gln Hi - #s Ile Gln Ala Gln Gln 530 - # 535 - # 540 - - Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gl - #n Lys Arg Glu Tyr Lys 545 5 - #50 5 - #55 5 - #60 - - Leu Arg Lys Glu Gln Leu Lys Glu Glu Leu As - #n Glu Asn Gln Ser Thr 565 - # 570 - # 575 - - Pro Lys Lys Glu Lys Gln Glu Trp Leu Ser Ly - #s Gln Lys Glu Asn Ile 580 - # 585 - # 590 - - Gln His Phe Gln Ala Glu Glu Glu Ala Asn Le - #u Leu Arg Arg Gln Arg 595 - # 600 - # 605 - - Gln Tyr Leu Glu Leu Glu Cys Arg Arg Phe Ly - #s Arg Arg Met Leu Leu 610 - # 615 - # 620 - - Gly Arg His Asn Leu Glu Gln Asp Leu Val Ar - #g Glu Glu Leu Asn Lys 625 6 - #30 6 - #35 6 - #40 - - Arg Gln Thr Gln Lys Asp Leu Glu His Ala Me - #t Leu Leu Arg Gln His 645 - # 650 - # 655 - - Glu Ser Met Gln Glu Leu Glu Phe Arg His Le - #u Asn Thr Ile Gln Lys 660 - # 665 - # 670 - - Met Arg Cys Glu Leu Ile Arg Leu Gln His Gl - #n Thr Glu Leu Thr Asn 675 - # 680 - # 685 - - Gln Leu Glu Tyr Asn Lys Arg Arg Glu Arg Gl - #u Leu Arg Arg Lys His 690 - # 695 - # 700 - - Val Met Glu Val Arg Gln Gln Pro Lys Ser Le - #u Lys Ser Lys Glu Leu 705 7 - #10 7 - #15 7 - #20 - - Gln Ile Lys Lys Gln Phe Gln Asp Thr Cys Ly - #s Ile Gln Thr Arg Gln 725 - # 730 - # 735 - - Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Th - #r Thr Pro Lys Ser Glu 740 - # 745 - # 750 - - His Lys Ala Val Leu Lys Arg Leu Lys Glu Gl - #u Gln Thr Arg Lys Leu 755 - # 760 - # 765 - - Ala Ile Leu Ala Glu Gln Tyr Asp His Ser Il - #e Asn Glu Met Leu Ser 770 - # 775 - # 780 - - Thr Gln Ala Leu Arg Leu Asp Glu Ala Gln Gl - #u Ala Glu Cys Gln Val 785 7 - #90 7 - #95 8 - #00 - - Leu Lys Met Gln Leu Gln Gln Glu Leu Glu Le - #u Leu Asn Ala Tyr Gln 805 - # 810 - # 815 - - Ser Lys Ile Lys Met Gln Ala Glu Ala Gln Hi - #s Asp Arg Glu Leu Arg 820 - # 825 - # 830 - - Glu Leu Glu Gln Arg Val Ser Leu Arg Arg Al - #a Leu Leu Glu Gln Lys 835 - # 840 - # 845 - - Ile Glu Glu Glu Met Leu Ala Leu Gln Asn Gl - #u Arg Thr Glu Arg Ile 850 - # 855 - # 860 - - Arg Ser Leu Leu Glu Arg Gln Ala Arg Glu Il - #e Glu Ala Phe Asp Ser 865 8 - #70 8 - #75 8 - #80 - - Glu Ser Met Arg Leu Gly Phe Ser Asn Met Va - #l Leu Ser Asn Leu Ser 885 - # 890 - # 895 - - Pro Glu Ala Phe Ser His Ser Tyr Pro Gly Al - #a Ser Ser Trp Ser His 900 - # 905 - # 910 - - Asn Pro Thr Gly Gly Ser Gly Pro His Trp Gl - #y His Pro Met Gly Gly 915 - # 920 - # 925 - - Thr Pro Gln Ala Trp Gly His Pro Met Gln Gl - #y Gly Pro Gln Pro Trp 930 - # 935 - # 940 - - Gly His Pro Ser Gly Pro Met Gln Gly Val Pr - #o Arg Gly Ser Ser Ile 945 9 - #50 9 - #55 9 - #60 - - Gly Val Arg Asn Ser Pro Gln Ala Leu Arg Ar - #g Thr Ala Ser Gly Gly 965 - # 970 - # 975 - - Arg Thr Glu Gln Gly Met Ser Arg Ser Thr Se - #r Val Thr Ser Gln Ile 980 - # 985 - # 990 - - Ser Asn Gly Ser His Met Ser Tyr Thr 995 - # 1000 - - - - (2) INFORMATION FOR SEQ ID NO: 3: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4296 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 193..3171 - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #3: - - AGGGGAGGCT TCCCGGGCCC GCCCCTCAGG AAGGGCGAAA GCTGAGGAAG AG - #GTGGCGAG 60 - - GGGGAAGGTC TCCTTGCCCC TCTCCCCGCT TGTCAGAGCA ACTGGAGTAC CC - #CAGGCGGA 120 - - AGCGGAGGCG CTGGGGCACC ATAGTGACCC CTACCAGGCA AGATCCCAAT TT - #CAGGGCCC 180 - - CCAGGGGCCA TC ATG CCA GCT GGG GGC CGG GCC GGG - # AGC CTG AAG GAC 228 Met Pro Al - #a Gly Gly Arg Ala Gly Ser Leu Lys Asp 1 - # 5 - # 10 - - CCT GAT GTA GCT GAG CTC TTC TTC AAA GAT GA - #C CCT GAG AAG CTT TTC 276 Pro Asp Val Ala Glu Leu Phe Phe Lys Asp As - #p Pro Glu Lys Leu Phe 15 - # 20 - # 25 - - TCT GAC CTC CGG GAA ATT GGC CAT GGC AGT TT - #T GGA GCT GTG TAC TTT 324 Ser Asp Leu Arg Glu Ile Gly His Gly Ser Ph - #e Gly Ala Val Tyr Phe 30 - # 35 - # 40 - - GCC CGG GAT GTC CGG AAC AGT GAG GTG GTG GC - #C ATC AAG AAG ATG TCC 372 Ala Arg Asp Val Arg Asn Ser Glu Val Val Al - #a Ile Lys Lys Met Ser 45 - # 50 - # 55 - # 60 - - TAT AGT GGG AAG CAA TCA AAT GAG AAA TGG CA - #G GAT ATC ATC AAG GAG 420 Tyr Ser Gly Lys Gln Ser Asn Glu Lys Trp Gl - #n Asp Ile Ile Lys Glu 65 - # 70 - # 75 - - GTG CGG TTC TTA CAG AAG CTA CGG CAT CCT AA - #T ACC ATT CAG TAC CGG 468 Val Arg Phe Leu Gln Lys Leu Arg His Pro As - #n Thr Ile Gln Tyr Arg 80 - # 85 - # 90 - - GGC TGT TAC CTG AGG GAG CAC ACA GCT TGG CT - #G GTG ATG GAG TAT TGC 516 Gly Cys Tyr Leu Arg Glu His Thr Ala Trp Le - #u Val Met Glu Tyr Cys 95 - # 100 - # 105 - - CTG GGT TCA GCT TCT GAT CTT CTC GAA GTG CA - #C AAG AAG CCG CTG CAG 564 Leu Gly Ser Ala Ser Asp Leu Leu Glu Val Hi - #s Lys Lys Pro Leu Gln 110 - # 115 - # 120 - - GAG GTA GAG ATT GCA GCT GTG ACC CAT GGT GC - #G CTT CAG GGC CTG GCC 612 Glu Val Glu Ile Ala Ala Val Thr His Gly Al - #a Leu Gln Gly Leu Ala 125 1 - #30 1 - #35 1 - #40 - - TAT CTA CAT TCA CAC AAC ATG ATC CAT AGA GA - #T GTG AAG GCT GGG AAC 660 Tyr Leu His Ser His Asn Met Ile His Arg As - #p Val Lys Ala Gly Asn 145 - # 150 - # 155 - - ATC TTG CTG TCA GAA CCA GGC TTG GTG AAA CT - #G GGG GAC TTT GGC TCC 708 Ile Leu Leu Ser Glu Pro Gly Leu Val Lys Le - #u Gly Asp Phe Gly Ser 160 - # 165 - # 170 - - GCA TCC ATC ATG GCA CCT GCC AAC TCA TTT GT - #G GGC ACT CCA TAC TGG 756 Ala Ser Ile Met Ala Pro Ala Asn Ser Phe Va - #l Gly Thr Pro Tyr Trp 175 - # 180 - # 185 - - ATG GCT CCA GAG GTG ATC CTA GCC ATG GAT GA - #G GGA CAA TAT GAT GGC 804 Met Ala Pro Glu Val Ile Leu Ala Met Asp Gl - #u Gly Gln Tyr Asp Gly
190 - # 195 - # 200 - - AAA GTG GAT GTC TGG TCC TTG GGG ATA ACC TG - #T ATT GAG CTA GCG GAG 852 Lys Val Asp Val Trp Ser Leu Gly Ile Thr Cy - #s Ile Glu Leu Ala Glu 205 2 - #10 2 - #15 2 - #20 - - CGG AAG CCA CCA CTG TTT AAC ATG AAT GCA AT - #G AGT GCC TTA TAC CAC 900 Arg Lys Pro Pro Leu Phe Asn Met Asn Ala Me - #t Ser Ala Leu Tyr His 225 - # 230 - # 235 - - ATT GCA CAG AAT GAA TCC CCT GCT CTC CAG TC - #A GGA CAC TGG TCT GAG 948 Ile Ala Gln Asn Glu Ser Pro Ala Leu Gln Se - #r Gly His Trp Ser Glu 240 - # 245 - # 250 - - TAC TTC CGG AAT TTT GTT GAC TCC TGT CTT CA - #G AAA ATC CCT CAA GAC 996 Tyr Phe Arg Asn Phe Val Asp Ser Cys Leu Gl - #n Lys Ile Pro Gln Asp 255 - # 260 - # 265 - - AGA CCA ACC TCA GAG GTT CTT TTG AAG CAC CG - #C TTT GTG CTC CGG GAG 1044 Arg Pro Thr Ser Glu Val Leu Leu Lys His Ar - #g Phe Val Leu Arg Glu 270 - # 275 - # 280 - - CGG CCA CCC ACA GTC ATC ATG GAC CTA ATT CA - #G AGG ACC AAG GAT GCT 1092 Arg Pro Pro Thr Val Ile Met Asp Leu Ile Gl - #n Arg Thr Lys Asp Ala 285 2 - #90 2 - #95 3 - #00 - - GTA CGG GAA CTA GAT AAC CTG CAG TAC CGA AA - #G ATG AAG AAG ATA CTA 1140 Val Arg Glu Leu Asp Asn Leu Gln Tyr Arg Ly - #s Met Lys Lys Ile Leu 305 - # 310 - # 315 - - TTC CAA GAG GCA CCC AAT GGC CCT GGT GCT GA - #G GCC CCA GAG GAA GAG 1188 Phe Gln Glu Ala Pro Asn Gly Pro Gly Ala Gl - #u Ala Pro Glu Glu Glu 320 - # 325 - # 330 - - GAG GAA GCA GAA CCT TAC ATG CAC CGA GCA GG - #G ACA CTG ACC AGT CTA 1236 Glu Glu Ala Glu Pro Tyr Met His Arg Ala Gl - #y Thr Leu Thr Ser Leu 335 - # 340 - # 345 - - GAG AGT AGC CAT TCA GTG CCC AGC ATG TCC AT - #C AGC GCC TCC AGC CAA 1284 Glu Ser Ser His Ser Val Pro Ser Met Ser Il - #e Ser Ala Ser Ser Gln 350 - # 355 - # 360 - - AGC AGC TCA GTC AAC AGC CTA GCA GAT GCC TC - #A GAT AAT GAA GAA GAG 1332 Ser Ser Ser Val Asn Ser Leu Ala Asp Ala Se - #r Asp Asn Glu Glu Glu 365 3 - #70 3 - #75 3 - #80 - - GAG GAG GAG GAA GAG GAA GAA GAA GAG GAG GA - #G GAA GAA GAA GGC CCT 1380 Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gl - #u Glu Glu Glu Gly Pro 385 - # 390 - # 395 - - GAA TCC CGA GAG ATG GCC ATG ATG CAG GAG GG - #G GAG CAT ACA GTC ACT 1428 Glu Ser Arg Glu Met Ala Met Met Gln Glu Gl - #y Glu His Thr Val Thr 400 - # 405 - # 410 - - TCC CAC AGC TCC ATC ATC CAC CGG CTG CCG GG - #C TCA GAC AAC CTA TAT 1476 Ser His Ser Ser Ile Ile His Arg Leu Pro Gl - #y Ser Asp Asn Leu Tyr 415 - # 420 - # 425 - - GAT GAT CCC TAC CAG CCA GAG ATG ACC CCA GG - #T CCA CTC CAA CCA CCT 1524 Asp Asp Pro Tyr Gln Pro Glu Met Thr Pro Gl - #y Pro Leu Gln Pro Pro 430 - # 435 - # 440 - - GCA GCC CCT CCC ACC TCC ACC TCC TCC TCT TC - #T GCT CGC CGC AGA GCT 1572 Ala Ala Pro Pro Thr Ser Thr Ser Ser Ser Se - #r Ala Arg Arg Arg Ala 445 4 - #50 4 - #55 4 - #60 - - TAT TGC CGC AAC CGA GAC CAC TTT GCC ACC AT - #C CGT ACT GCC TCC CTG 1620 Tyr Cys Arg Asn Arg Asp His Phe Ala Thr Il - #e Arg Thr Ala Ser Leu 465 - # 470 - # 475 - - GTC AGC CGT CAG ATC CAG GAG CAT GAG CAG GA - #C TCG GCC CTG CGG GAG 1668 Val Ser Arg Gln Ile Gln Glu His Glu Gln As - #p Ser Ala Leu Arg Glu 480 - # 485 - # 490 - - CAA CTA AGT GGC TAC AAG CGG ATG CGG CGT CA - #G CAC CAG AAG CAA CTG 1716 Gln Leu Ser Gly Tyr Lys Arg Met Arg Arg Gl - #n His Gln Lys Gln Leu 495 - # 500 - # 505 - - CTG GCC CTG GAG TCC CGT CTG AGG GGT GAA CG - #T GAG GAG CAC AGT GGG 1764 Leu Ala Leu Glu Ser Arg Leu Arg Gly Glu Ar - #g Glu Glu His Ser Gly 510 - # 515 - # 520 - - CGG TTG CAG CGT GAA CTC GAG GCA CAG CGG GC - #T GGC TTT GGG ACT GAG 1812 Arg Leu Gln Arg Glu Leu Glu Ala Gln Arg Al - #a Gly Phe Gly Thr Glu 525 5 - #30 5 - #35 5 - #40 - - GCT GAG AAG CTG GCC CGG AGG CAC CAG GCC AT - #T GGT GAG AAG GAA GCA 1860 Ala Glu Lys Leu Ala Arg Arg His Gln Ala Il - #e Gly Glu Lys Glu Ala 545 - # 550 - # 555 - - CGA GCT GCT CAG GCT GAG GAG CGG AAG TTC CA - #G CAG CAC ATC TTG GGG 1908 Arg Ala Ala Gln Ala Glu Glu Arg Lys Phe Gl - #n Gln His Ile Leu Gly 560 - # 565 - # 570 - - CAG CAG AAG AAG GAA CTG GCT GCC CTG CTG GA - #G GCA CAG AAG CGA ACC 1956 Gln Gln Lys Lys Glu Leu Ala Ala Leu Leu Gl - #u Ala Gln Lys Arg Thr 575 - # 580 - # 585 - - TAT AAG CTT CGG AAG GAG CAG TTG AAA GAG GA - #G CTC CAG GAG AAC CCT 2004 Tyr Lys Leu Arg Lys Glu Gln Leu Lys Glu Gl - #u Leu Gln Glu Asn Pro 590 - # 595 - # 600 - - AGC ACA CCC AAA CGA GAG AAG GCT GAG TGG CT - #G TTG AGG CAG AAA GAG 2052 Ser Thr Pro Lys Arg Glu Lys Ala Glu Trp Le - #u Leu Arg Gln Lys Glu 605 6 - #10 6 - #15 6 - #20 - - CAG TTG CAA CAG TGC CAG GCA GAG GAG GAG GC - #A GGG CTA CTG CGG AGG 2100 Gln Leu Gln Gln Cys Gln Ala Glu Glu Glu Al - #a Gly Leu Leu Arg Arg 625 - # 630 - # 635 - - CAA CGC CAG TAC TTT GAG CTT CAG TGT CGC CA - #A TAC AAG CGC AAG ATG 2148 Gln Arg Gln Tyr Phe Glu Leu Gln Cys Arg Gl - #n Tyr Lys Arg Lys Met 640 - # 645 - # 650 - - CTA CTG GCT CGG CAC AGC CTA GAC CAG GAC CT - #G CTT CGA GAG GAC TTG 2196 Leu Leu Ala Arg His Ser Leu Asp Gln Asp Le - #u Leu Arg Glu Asp Leu 655 - # 660 - # 665 - - AAT AAG AAA CAG ACA CAG AAG GAC TTG GAG TG - #T GCT CTG CTG TTA CGG 2244 Asn Lys Lys Gln Thr Gln Lys Asp Leu Glu Cy - #s Ala Leu Leu Leu Arg 670 - # 675 - # 680 - - CAG CAT GAG GCT ACC CGA GAG CTG GAG CTA CG - #A CAG CTC CAG GCT GTC 2292 Gln His Glu Ala Thr Arg Glu Leu Glu Leu Ar - #g Gln Leu Gln Ala Val 685 6 - #90 6 - #95 7 - #00 - - CAG CGC ACA CGT GCT GAA CTC ACC CGC CTT CA - #G CAC CAG ACA GAG CTA 2340 Gln Arg Thr Arg Ala Glu Leu Thr Arg Leu Gl - #n His Gln Thr Glu Leu 705 - # 710 - # 715 - - GGC AAC CAG TTG GAG TAC AAC AAG CGA CGG GA - #G CAA GAG TTG CGG CAG 2388 Gly Asn Gln Leu Glu Tyr Asn Lys Arg Arg Gl - #u Gln Glu Leu Arg Gln 720 - # 725 - # 730 - - AAG CAC GCG GCC CAG GTT CGC CAG CAG CCC AA - #G AGC CTC AAA GTA CGT 2436 Lys His Ala Ala Gln Val Arg Gln Gln Pro Ly - #s Ser Leu Lys Val Arg 735 - # 740 - # 745 - - GCA GGC CAG CTA CCC ATG GGC CTC CCT GCT AC - #C GGG GCT CTG GGA CCA 2484 Ala Gly Gln Leu Pro Met Gly Leu Pro Ala Th - #r Gly Ala Leu Gly Pro 750 - # 755 - # 760 - - CTC AGC ACA GGC ACC CTT AGT GAA GAG CAG CC - #C TGC TCA TCT GGC CAG 2532 Leu Ser Thr Gly Thr Leu Ser Glu Glu Gln Pr - #o Cys Ser Ser Gly Gln 765 7 - #70 7 - #75 7 - #80 - - GAG GCA ATC CTG GGC CAA AGG ATG CTG GGA GA - #G GAG GAG GAA GCA GTG 2580 Glu Ala Ile Leu Gly Gln Arg Met Leu Gly Gl - #u Glu Glu Glu Ala Val 785 - # 790 - # 795 - - CCA GAG AGA ATG ATT CTG GGA AAG GAA GGG AC - #T ACT TTG GAG CCA GAG 2628 Pro Glu Arg Met Ile Leu Gly Lys Glu Gly Th - #r Thr Leu Glu Pro Glu 800 - # 805 - # 810 - - GAG CAG AGG ATT CTG GGG GAA GAA ATG GGA AC - #C TTT AGT TCC AGC CCA 2676 Glu Gln Arg Ile Leu Gly Glu Glu Met Gly Th - #r Phe Ser Ser Ser Pro 815 - # 820 - # 825 - - CAA AAA CAT AGG AGT CTG GTT AAT GAG GAA GA - #T TGG GAT ATA TCT AAA 2724 Gln Lys His Arg Ser Leu Val Asn Glu Glu As - #p Trp Asp Ile Ser Lys 830 - # 835 - # 840 - - GAA ATG AAG GAG AGT AGA GTC CCA TCC CTG GC - #A TCC CAG GAG AGA AAT 2772 Glu Met Lys Glu Ser Arg Val Pro Ser Leu Al - #a Ser Gln Glu Arg Asn 845 8 - #50 8 - #55 8 - #60 - - ATT ATT GGC CAG GAA GAG GCT GGG GCA TGG AA - #T CTG TGG GAG AAG GAG 2820 Ile Ile Gly Gln Glu Glu Ala Gly Ala Trp As - #n Leu Trp Glu Lys Glu 865 - # 870 - # 875 - - CAT GGA AAC CTT GTG GAT ATG GAG TTC AAG CT - #T GGC TGG GTC CAG GGT 2868 His Gly Asn Leu Val Asp Met Glu Phe Lys Le - #u Gly Trp Val Gln Gly 880 - # 885 - # 890 - - CCA GTT CTG ACT CCA GTG CCT GAG GAG GAA GA - #G GAG GAG GAA GAG GAG 2916 Pro Val Leu Thr Pro Val Pro Glu Glu Glu Gl - #u Glu Glu Glu Glu Glu 895 - # 900 - # 905 - - GGA GGG GCT CCA ATT GGA ACC CCC AGG GAC CC - #T GGA GAT GGC TGT CCT 2964 Gly Gly Ala Pro Ile Gly Thr Pro Arg Asp Pr - #o Gly Asp Gly Cys Pro 910 - # 915 - # 920 - - TCC CCA GAT ATC CCC CCA GAG CCA CCT CCA TC - #A CAT CTG AGA CAG TAC 3012 Ser Pro Asp Ile Pro Pro Glu Pro Pro Pro Se - #r His Leu Arg Gln Tyr 925 9 - #30 9 - #35 9 - #40 - - CCT GCT AGC CAG CTT CCT GGA TTC TTG TCT CA - #T GGC CTC CTG ACT GGC 3060 Pro Ala Ser Gln Leu Pro Gly Phe Leu Ser Hi - #s Gly Leu Leu Thr Gly 945 - # 950 - # 955 - - CTC TCC TTT GCA GTG GGG TCC TCC TCT GGC CT - #C TTG CCC CTA CTA CTT 3108 Leu Ser Phe Ala Val Gly Ser Ser Ser Gly Le - #u Leu Pro Leu Leu Leu 960 - # 965 - # 970 - - CTG CTG CTA CTC CCA TTG CTG GCA CCC AGG TG - #G AGG TGG CTT GCA GGC 3156 Leu Leu Leu Leu Pro Leu Leu Ala Pro Arg Tr - #p Arg Trp Leu Ala Gly 975 - # 980 - # 985 - - AGC ACT GCT GGC CCT T GAGGTAGGAC TAGTGGGCCT GGGG - #GCTTCA 3202 Ser Thr Ala Gly Pro 990 - - TACCTGTTCC TTTGTACAGC TCTACACCTG CCACCCAGTC TGTTCTTACT CC - #TGGCTCAG 3262 - - GGCACTGCAC TGGGGGCTGT CCTTAGCCTG AGCTGGCGCA GAGGCCTTAT GG - #GTGTGCCT 3322 - - CTGGGCCTTG GGGCTGCCTG GCTCCTAGCT TGGCCCAGCC TGGCTTTACC TC - #TGGCAGCT 3382 - - ATGGCGGCTG GGGGCAAATG GGTACGGCAG CAAGGCCCCC AGATGCGTCG GG - #GCATCTCT 3442 - - CGACTCTGGT TGCGGGTTCT GCTACGCCTG TCACCCATGG TCTTTCGGGC CC - #TACAGGGC 3502 - - TGTGCGGCTG TGGGAGACCG GGGGCTGTTT GCCCTGTACC CTAAGACCAA TA - #AGAATGGT 3562 - - TTCCGAAGTC GACTGCCTGT CCCTTGGCCC CGTCAGGGAA ATCCTCGCAC TA - #CACAGCAC 3622 - - CCACTAGCTC TGTTAGCAAG AGTTTGGGCT CTGTGCAAGG GCTGGAACTG GC - #GCCTAGCA 3682 - - CGGGCTAGCC ATAGATTAGC TTCTTGTTTG CCCCCCTGGG CTGTTCATAT AC - #TAGCTAGC 3742 - - TGGGGCCTGC TTAAGGGTGA AAGGCCCAGT CGGATCCCTC GGCTGCTACC GC - #GAAGCCAA 3802 - - CGCCGTCTTG GGCTCTCAGC TTCCCGACAG CTACCACCAG GGACTGTAGC TG - #GGCGGAGA 3862 - - TCTCAGACCC GCAGGGCCCT GCCTCCCTGG AGGTAACCAG TTCTAACCCT CC - #ACCCAAAT 3922 - - TTAGGGCATT GAGCACTTTA TCTCCCATGA CTCAGTAAAG TCTCTCCAGT CC - #CTTGGCCT 3982 - - CTCCTCCCCT TCTGACCTTT CTTCCTCAGT ATGTTTCCCC AGGTCCAATC CC - #AGCCCCAG 4042 - - ATGTAGATTT CTAGACAGGC AGCCTCCTCT ACTGTGGAGT CCAGAATGAC AC - #TCTTGTGT 4102 - - TTTCCCCAGT CCCCTAAGTT ATTGCTGTCC CCTGCTGTGT GTGTGCTCAT CC - #TCACCCTC 4162 - - ATCGGCTCAG GCCTGGGGCC AGGGGTGGCA GGGAGGGAAG TCATGGGGGT TT - #TCCCTCTT 4222 - - TGATTTTGTT TTTCTGTCTC CCTTCCAACC TGTCCCCTTC CCCTCCACCA AA - #AGAGAAAA 4282 - - AAAAAAAAAA AAAA - # - # - # 4296 - - - - (2) INFORMATION FOR SEQ ID NO: 4: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 993 amino - #acids (B) TYPE: amino acid (D) TOPOLOGY: linear - - (ii) MOLECULE TYPE: protein
- - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #4: - - Met Pro Ala Gly Gly Arg Ala Gly Ser Leu Ly - #s Asp Pro Asp Val Ala 1 5 - # 10 - # 15 - - Glu Leu Phe Phe Lys Asp Asp Pro Glu Lys Le - #u Phe Ser Asp Leu Arg 20 - # 25 - # 30 - - Glu Ile Gly His Gly Ser Phe Gly Ala Val Ty - #r Phe Ala Arg Asp Val 35 - # 40 - # 45 - - Arg Asn Ser Glu Val Val Ala Ile Lys Lys Me - #t Ser Tyr Ser Gly Lys 50 - # 55 - # 60 - - Gln Ser Asn Glu Lys Trp Gln Asp Ile Ile Ly - #s Glu Val Arg Phe Leu 65 - # 70 - # 75 - # 80 - - Gln Lys Leu Arg His Pro Asn Thr Ile Gln Ty - #r Arg Gly Cys Tyr Leu 85 - # 90 - # 95 - - Arg Glu His Thr Ala Trp Leu Val Met Glu Ty - #r Cys Leu Gly Ser Ala 100 - # 105 - # 110 - - Ser Asp Leu Leu Glu Val His Lys Lys Pro Le - #u Gln Glu Val Glu Ile 115 - # 120 - # 125 - - Ala Ala Val Thr His Gly Ala Leu Gln Gly Le - #u Ala Tyr Leu His Ser 130 - # 135 - # 140 - - His Asn Met Ile His Arg Asp Val Lys Ala Gl - #y Asn Ile Leu Leu Ser 145 1 - #50 1 - #55 1 - #60 - - Glu Pro Gly Leu Val Lys Leu Gly Asp Phe Gl - #y Ser Ala Ser Ile Met 165 - # 170 - # 175 - - Ala Pro Ala Asn Ser Phe Val Gly Thr Pro Ty - #r Trp Met Ala Pro Glu 180 - # 185 - # 190 - - Val Ile Leu Ala Met Asp Glu Gly Gln Tyr As - #p Gly Lys Val Asp Val 195 - # 200 - # 205 - - Trp Ser Leu Gly Ile Thr Cys Ile Glu Leu Al - #a Glu Arg Lys Pro Pro 210 - # 215 - # 220 - - Leu Phe Asn Met Asn Ala Met Ser Ala Leu Ty - #r His Ile Ala Gln Asn 225 2 - #30 2 - #35 2 - #40 - - Glu Ser Pro Ala Leu Gln Ser Gly His Trp Se - #r Glu Tyr Phe Arg Asn 245 - # 250 - # 255 - - Phe Val Asp Ser Cys Leu Gln Lys Ile Pro Gl - #n Asp Arg Pro Thr Ser 260 - # 265 - # 270 - - Glu Val Leu Leu Lys His Arg Phe Val Leu Ar - #g Glu Arg Pro Pro Thr 275 - # 280 - # 285 - - Val Ile Met Asp Leu Ile Gln Arg Thr Lys As - #p Ala Val Arg Glu Leu 290 - # 295 - # 300 - - Asp Asn Leu Gln Tyr Arg Lys Met Lys Lys Il - #e Leu Phe Gln Glu Ala 305 3 - #10 3 - #15 3 - #20 - - Pro Asn Gly Pro Gly Ala Glu Ala Pro Glu Gl - #u Glu Glu Glu Ala Glu 325 - # 330 - # 335 - - Pro Tyr Met His Arg Ala Gly Thr Leu Thr Se - #r Leu Glu Ser Ser His 340 - # 345 - # 350 - - Ser Val Pro Ser Met Ser Ile Ser Ala Ser Se - #r Gln Ser Ser Ser Val 355 - # 360 - # 365 - - Asn Ser Leu Ala Asp Ala Ser Asp Asn Glu Gl - #u Glu Glu Glu Glu Glu 370 - # 375 - # 380 - - Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gl - #y Pro Glu Ser Arg Glu 385 3 - #90 3 - #95 4 - #00 - - Met Ala Met Met Gln Glu Gly Glu His Thr Va - #l Thr Ser His Ser Ser 405 - # 410 - # 415 - - Ile Ile His Arg Leu Pro Gly Ser Asp Asn Le - #u Tyr Asp Asp Pro Tyr 420 - # 425 - # 430 - - Gln Pro Glu Met Thr Pro Gly Pro Leu Gln Pr - #o Pro Ala Ala Pro Pro 435 - # 440 - # 445 - - Thr Ser Thr Ser Ser Ser Ser Ala Arg Arg Ar - #g Ala Tyr Cys Arg Asn 450 - # 455 - # 460 - - Arg Asp His Phe Ala Thr Ile Arg Thr Ala Se - #r Leu Val Ser Arg Gln 465 4 - #70 4 - #75 4 - #80 - - Ile Gln Glu His Glu Gln Asp Ser Ala Leu Ar - #g Glu Gln Leu Ser Gly 485 - # 490 - # 495 - - Tyr Lys Arg Met Arg Arg Gln His Gln Lys Gl - #n Leu Leu Ala Leu Glu 500 - # 505 - # 510 - - Ser Arg Leu Arg Gly Glu Arg Glu Glu His Se - #r Gly Arg Leu Gln Arg 515 - # 520 - # 525 - - Glu Leu Glu Ala Gln Arg Ala Gly Phe Gly Th - #r Glu Ala Glu Lys Leu 530 - # 535 - # 540 - - Ala Arg Arg His Gln Ala Ile Gly Glu Lys Gl - #u Ala Arg Ala Ala Gln 545 5 - #50 5 - #55 5 - #60 - - Ala Glu Glu Arg Lys Phe Gln Gln His Ile Le - #u Gly Gln Gln Lys Lys 565 - # 570 - # 575 - - Glu Leu Ala Ala Leu Leu Glu Ala Gln Lys Ar - #g Thr Tyr Lys Leu Arg 580 - # 585 - # 590 - - Lys Glu Gln Leu Lys Glu Glu Leu Gln Glu As - #n Pro Ser Thr Pro Lys 595 - # 600 - # 605 - - Arg Glu Lys Ala Glu Trp Leu Leu Arg Gln Ly - #s Glu Gln Leu Gln Gln 610 - # 615 - # 620 - - Cys Gln Ala Glu Glu Glu Ala Gly Leu Leu Ar - #g Arg Gln Arg Gln Tyr 625 6 - #30 6 - #35 6 - #40 - - Phe Glu Leu Gln Cys Arg Gln Tyr Lys Arg Ly - #s Met Leu Leu Ala Arg 645 - # 650 - # 655 - - His Ser Leu Asp Gln Asp Leu Leu Arg Glu As - #p Leu Asn Lys Lys Gln 660 - # 665 - # 670 - - Thr Gln Lys Asp Leu Glu Cys Ala Leu Leu Le - #u Arg Gln His Glu Ala 675 - # 680 - # 685 - - Thr Arg Glu Leu Glu Leu Arg Gln Leu Gln Al - #a Val Gln Arg Thr Arg 690 - # 695 - # 700 - - Ala Glu Leu Thr Arg Leu Gln His Gln Thr Gl - #u Leu Gly Asn Gln Leu 705 7 - #10 7 - #15 7 - #20 - - Glu Tyr Asn Lys Arg Arg Glu Gln Glu Leu Ar - #g Gln Lys His Ala Ala 725 - # 730 - # 735 - - Gln Val Arg Gln Gln Pro Lys Ser Leu Lys Va - #l Arg Ala Gly Gln Leu 740 - # 745 - # 750 - - Pro Met Gly Leu Pro Ala Thr Gly Ala Leu Gl - #y Pro Leu Ser Thr Gly 755 - # 760 - # 765 - - Thr Leu Ser Glu Glu Gln Pro Cys Ser Ser Gl - #y Gln Glu Ala Ile Leu 770 - # 775 - # 780 - - Gly Gln Arg Met Leu Gly Glu Glu Glu Glu Al - #a Val Pro Glu Arg Met 785 7 - #90 7 - #95 8 - #00 - - Ile Leu Gly Lys Glu Gly Thr Thr Leu Glu Pr - #o Glu Glu Gln Arg Ile 805 - # 810 - # 815 - - Leu Gly Glu Glu Met Gly Thr Phe Ser Ser Se - #r Pro Gln Lys His Arg 820 - # 825 - # 830 - - Ser Leu Val Asn Glu Glu Asp Trp Asp Ile Se - #r Lys Glu Met Lys Glu 835 - # 840 - # 845 - - Ser Arg Val Pro Ser Leu Ala Ser Gln Glu Ar - #g Asn Ile Ile Gly Gln 850 - # 855 - # 860 - - Glu Glu Ala Gly Ala Trp Asn Leu Trp Glu Ly - #s Glu His Gly Asn Leu 865 8 - #70 8 - #75 8 - #80 - - Val Asp Met Glu Phe Lys Leu Gly Trp Val Gl - #n Gly Pro Val Leu Thr 885 - # 890 - # 895 - - Pro Val Pro Glu Glu Glu Glu Glu Glu Glu Gl - #u Glu Gly Gly Ala Pro 900 - # 905 - # 910 - - Ile Gly Thr Pro Arg Asp Pro Gly Asp Gly Cy - #s Pro Ser Pro Asp Ile 915 - # 920 - # 925 - - Pro Pro Glu Pro Pro Pro Ser His Leu Arg Gl - #n Tyr Pro Ala Ser Gln 930 - # 935 - # 940 - - Leu Pro Gly Phe Leu Ser His Gly Leu Leu Th - #r Gly Leu Ser Phe Ala 945 9 - #50 9 - #55 9 - #60 - - Val Gly Ser Ser Ser Gly Leu Leu Pro Leu Le - #u Leu Leu Leu Leu Leu 965 - # 970 - # 975 - - Pro Leu Leu Ala Pro Arg Trp Arg Trp Leu Al - #a Gly Ser Thr Ala Gly 980 - # 985 - # 990 - - Pro - - - - (2) INFORMATION FOR SEQ ID NO: 5: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 414 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #5: - - ACGANTCACC AGTTGGAAGT TACTCCAAAG AATGAGCACA AAACAATCTT AA - #AGACACTG 60 - - AAAGATGAGC AGACAAGAAA ACTTGCCATT TNGGCAGAGC AGTATGAACA GA - #GTATAAAT 120 - - GAAATGATGG CCTCTCANGC GTTACGGCTA GATGAGGCTC AAGAAGCAGA AT - #GCCAGGCC 180 - - TTGAGGCTAC AGCTCCAGCA GGAAATGGAG CTGCTCAACG CCTACCAGAG CA - #AAATCAAG 240 - - ATGCAAACAG AGGCACAACA TGAACGTGAG CTCCAGAAGC TAGAGCAGAG AG - #TGTCTCTG 300 - - CGCAGAGCAC ACCTTGAGCA GAAGATTGAA GAGGAGCTGG CTGCCCTTCA GA - #AGGAACGC 360 - - AGCGAGAGAA TAAAGAACCT ATTGGAAAGG CAAGAGCGAG AGATTGGAAA CT - #TT 414 - - - - (2) INFORMATION FOR SEQ ID NO: 6: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 314 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #6: - - GAACAAAGTC ATGCCTTAAT AGTTCTGCTG ATGTTGGCCT TTCCTGAGGT AT - #TTTCTGCA 60 - - AGCAGTAATC AACAAATCTC CTAAAGGAGT CTGTCCATTC ATTAGACTGT AA - #CGTTGGGG 120 - - AGTCATTCTG GGCAATGTGA TATAAGGCAC TCATTGCATT CATGTTGAAA AG - #GGGCGGCT 180 - - TCCGTTCCGC CAATTCAATA CAAGTGATGC CAAGTGACCA AATATCAACT TT - #CCCATCAT 240 - - ACTGTCCTTC ATCCATAGCT AAGATCACCT CTGGAGCCAT CCAGTAAGGT GT - #GCCCACGA 300 - - AGGAGTTGGC CAGG - # - # - # 314 - - - - (2) INFORMATION FOR SEQ ID NO: 7: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 370 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #7: - - ACCAAATTCC CAAATCCCAT TCTGAGGCTC TCCATGTCAA AAGTTTCAAT CT - #CTCGCTCT 60 - - TGCCTTTCCA ATAGGTTCTT TATTCTCTCG CTGCGTTCCT TCTGAAGGGC AG - #CCAGCTCC 120 - - TCTTCAATCT TCTGCTCAAG GTGTGGTCTG CGCAGAGACA CTCTCTGCTC TA - #GCTTCTGG 180 - - AGCTCACGTT CATGTTGTGC CTCTGTTNGN ATCTTGATTT GGNTCTGGTA GG - #CGTTGAGC 240 - - AGCTCCATTT CCTGCTGGAG CTGTAGCCTC AAGGCCTGGC ATTCTGCTTC TT - #GAGCCTCA 300 - - TCTAGCCGTA ACGCTTGAGA GGCCATCATT TCATTTATAC TCTGTTCATA CT - #GCTCTGCC 360 - - AAAATGGCAA - # - # - # 370 - - - - (2) INFORMATION FOR SEQ ID NO: 8: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 190 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #8: - - CAACAGCAGA AAAACTTAAA GGCCATGGAA ATGCAAATTA AAAAACAGTT TC - #AGGACACT 60 - - TGCAAAGTAC AGACCAAACA GTATAAAGCA CTCAAGAATC ACCAGTTGGA AG - #TTACTCCA 120 - - AAGAATGAGC ACAAAACAAT CTTAAAGACA CTGAAAGATG AGCAGACAAG AA - #AACTTGCC 180 - - ATTTTGGCAG - # - # - # 190 - - - - (2) INFORMATION FOR SEQ ID NO: 9: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 65 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #9: - - GAGCAGTATG AACAGAGTAT AAATGAAATG ATGGCCTCTC AAGCGTTACG GC - #TAGATGAG 60 - - GCTCA - # - # - # 65 - - - - (2) INFORMATION FOR SEQ ID NO: 10: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 219 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #10: - - ACGAGTCCCC CCGAGAGCTA GAGTACAGGC AGCTGCACAC GTTACAGAAG CT - #ACGCATGG 60 - - ATCTGATCCG TTTACAGCAC CAGACGGAAC TGGAAAACCA GCTGGAGTAC AA - #TAAGAGGC 120 - - GAGAAAGAGA ACTGCACAGA AAGCATGTCA TGGAACTTCG GCAACAGCCA AA - #AAACTTAA 180 - - AGGCCATGGA ANTGCAATTT AAAAAACAGT TCCAGGAAA - # - # 219 - - - - (2) INFORMATION FOR SEQ ID NO: 11: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 85 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #11: - - GTGCATATGG TATATTTNAT TCATTTTTGT AAAGCGTTCT GTTTTGTGTT TA - #CTAATTGG 60 - - GATGTCATAG TACTTGGCTG CCGGG - # - # 85 - - - - (2) INFORMATION FOR SEQ ID NO: 12: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #12: - - CTCACTTGGG TACTACAGTG TGGAAGCTGA GTGCATATGG TATATT - # 46 - - - - (2) INFORMATION FOR SEQ ID NO: 13: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 116 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #13: - - GATATTTGGT CATTGGGTAT CACGTGTATA GAGCTGGCCG AACGTCGTCC AC - #CATTGTTC 60 - - AGTATGAATG CAATGTCTGC CCTCTACCAT ATTGCTCAAA ATGATCCTCC AA - #CTCT 116 - - - - (2) INFORMATION FOR SEQ ID NO: 14: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 118 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #14: - - CTGAAAGGCC TGGATTATCT GCACTCAGAG CGCAAGATCC ACCGAGATAT CA - #AAGCTGCC 60 - - AACGTGCTGC TCTCGGAGCA GGGTGATGTG AAGATGGCAG ACTTCGGTGT GG - #CTGGCA 118 - - - - (2) INFORMATION FOR SEQ ID NO: 15: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 110 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #15: - - GACCCAGAGG AACTCTTCAC CAAGCTTGAC CGCATTGGCA AAGGCTCATT TG - #GGGAGGTG 60 - - TACAAGGGGA TCGACAACCA CACCAAGGAA GTGGTGGCCA TCAAGATCAT - # 110 - - - - (2) INFORMATION FOR SEQ ID NO: 16: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 134 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #16: - - TCAGGATTCT GGAGCTCTGG AGTTCCATTA GTGGCTATCA GATACAATGC CC - #TGAGTGGA 60 - - TTTTCATTAA GGTAAGGGGG TTCACCTTCC ACCATTTCAA TTGCCATAAT TC - #CAAGAGAC 120 - - CAGATATCAA CTTT - # - # - # 134 - - - - (2) INFORMATION FOR SEQ ID NO: 17: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 278 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #17: - - Met Ala Pro Ala Val Leu Gln Lys Pro Gly Va - #l Ile Lys Asp Pro Ser 1 5 - # 10 - # 15 - - Ile Ala Ala Leu Phe Ser Asn Lys Asp Pro Gl - #u Gln Asp Leu Arg Glu 20 - # 25 - # 30 - - Ile Gly His Gly Ser Phe Gly Ala Val Tyr Ph - #e Ala Tyr Asp Lys Lys 35 - # 40 - # 45 - - Asn Glu Gln Thr Val Ala Ile Lys Lys Met As - #n Phe Ser Gly Lys Gln 50 - # 55 - # 60 - - Ala Val Glu Lys Trp Asn Asp Ile Leu Lys Gl - #u Val Ser Phe Leu Asn 65 - #70 - #75 - #80 - - Thr Val Val His Pro His Ile Val Asp Tyr Ly - #s Ala Cys Phe Leu Lys 85 - # 90 - # 95 - - Asp Thr Thr Cys Trp Leu Val Met Glu Tyr Cy - #s Ile Gly Ser Ala Ala 100 - # 105 - # 110 - - Asp Ile Val Asp Val Leu Arg Lys Gly Met Ar - #g Glu Val Glu Ile Ala 115 - # 120 - # 125 - - Ala Ile Cys Ser Gln Thr Leu Asp Ala Leu Ar - #g Tyr Leu His Ser Leu 130 - # 135 - # 140 - - Lys Arg Ile His Arg Asp Ile Lys Ala Gly As - #n Ile Leu Leu Ser Asp 145 1 - #50 1 - #55 1 - #60 - - His Ala Ile Val Lys Leu Ala Asp Phe Gly Se - #r Ala Ser Leu Val Asp 165 - # 170 - # 175 - - Pro Ala Gln Thr Phe Ile Gly Thr Pro Phe Ph - #e Met Ala Pro Glu Val 180 - # 185 - # 190 - - Ile Leu Ala Met Asp Glu Gly His Tyr Thr As - #p Arg Ala Asp Ile Trp 195 - # 200 - # 205 - - Ser Leu Gly Ile Thr Cys Ile Glu Leu Ala Gl - #u Arg Arg Pro Pro Leu 210 - # 215 - # 220 - - Phe Ser Met Asn Ala Met Ser Ala Leu Tyr Hi - #s Ile Ala Gln Asn Asp 225 2 - #30 2 - #35 2 - #40 - - Pro Pro Thr Leu Ser Pro Ile Asp Thr Ser Gl - #u Gln Pro Glu Trp Ser 245 - # 250 - # 255 - - Leu Glu Phe Val Gln Phe Ile Asp Lys Cys Le - #u Arg Lys Pro Ala Glu 260 - # 265 - # 270 - - Glu Arg Met Ser Ala Glu 275 - - - - (2) INFORMATION FOR SEQ ID NO: 18: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 273 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #18: - - Arg Glu Glu Arg Glu Arg Arg Lys Lys Gln Le - #u Tyr Ala Lys Leu Asn 1 5 - # 10 - # 15 - - Glu Ile Cys Ser Asp Gly Asp Pro Ser Thr Ly - #s Tyr Ala Asn Leu Val 20 - # 25 - # 30 - - Lys Ile Gly Gln Gly Ala Ser Gly Gly Val Ty - #r Thr Ala Tyr Glu Ile 35 - # 40 - # 45 - - Gly Thr Asn Val Ser Val Ala Ile Lys Gln Me - #t Asn Leu Glu Lys Gln 50 - # 55 - # 60 - - Pro Lys Lys Glu Leu Ile Ile Asn Glu Ile Le - #u Val Met Lys Gly Ser 65 - #70 - #75 - #80 - - Lys His Pro Asn Ile Val Asn Phe Ile Asp Se - #r Tyr Val Leu Lys Gly 85 - # 90 - # 95 - - Asp Leu Trp Val Ile Met Glu Tyr Met Glu Gl - #y Gly Ser Leu Thr Val 100 - # 105 - # 110 - - Asp Val Val Thr His Cys Ile Leu Thr Glu Gl - #y Gln Ile Gly Ala Val 115 - # 120 - # 125 - - Cys Arg Glu Thr Leu Ser Gly Leu Glu Phe Le - #u His Ser Lys Gly Val 130 - # 135 - # 140 - - Leu His Arg Asp Ile Lys Ser Asp Asn Ile Le - #u Leu Ser Met Glu Gly 145 1 - #50 1 - #55 1 - #60 - - Asp Ile Lys Leu Thr Asp Phe Gly Phe Cys Al - #a Gln Ile Asn Glu Leu 165 - # 170 - # 175 - - Asn Leu Lys Arg Thr Thr Met Val Gly Thr Pr - #o Tyr Trp Met Ala Pro 180 - # 185 - # 190 - - Glu Val Val Ser Arg Lys Glu Tyr Gly Pro Ly - #s Val Asp Ile Trp Ser 195 - # 200 - # 205 - - Leu Gly Ile Met Ile Ile Glu Met Ile Glu Gl - #y Glu Pro Pro Tyr Leu 210 - # 215 - # 220 - - Asn Glu Thr Pro Leu Arg Ala Leu Tyr Leu Il - #e Ala Thr Asn Gly Thr 225 2 - #30 2 - #35 2 - #40 - - Pro Lys Leu Lys Glu Pro Glu Asn Leu Ser Se - #r Ser Leu Lys Lys Phe 245 - # 250 - # 255 - - Leu Asp Trp Cys Leu Cys Cys Val Glu Pro Gl - #u Asp Arg Ala Ser Ala 260 - # 265 - # 270 - - Thr - - - - (2) INFORMATION FOR SEQ ID NO: 19: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 24 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 31 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #19: - - GACGCTGGAT CCAAAGATAC TGGNCAAGGG NGC - # - # 33 - - - - (2) INFORMATION FOR SEQ ID NO: 20: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 3 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 6 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 13 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 16 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 19 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #20: - - GGNGTNCCAG TTNGTNGCNA T - # - # - #21 - - - - (2) INFORMATION FOR SEQ ID NO: 21: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 11 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 14 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 18 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #21: - - AAAGGAAGCA NAGNCAGNAA CGGAAGAT - # - # 28 - - - - (2) INFORMATION FOR SEQ ID NO: 22: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base - #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 19 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (ix) FEATURE: (A) NAME/KEY: - (B) LOCATION: 22 (D) OTHER INFORMATION: - #/note= "Where N is inosine" - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #22: - - GACGCTGAAT TCACCTTCNG GNGCCATCCA - # - # 30 - - - - (2) INFORMATION FOR SEQ ID NO: 23: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #23: - - Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Le - #u Gln Tyr Arg Lys Met 1 5 - # 10 - # 15 - - Lys Lys Leu Leu 20 - - - - (2) INFORMATION FOR SEQ ID NO: 24: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear
- - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #24: - - Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gl - #n Lys Arg Glu Tyr Lys 1 5 - # 10 - # 15 - - Leu Arg Lys - - - - (2) INFORMATION FOR SEQ ID NO: 25: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #25: - - Arg Glu Leu Arg Glu Leu Glu Gln Arg Val Se - #r Leu Arg Arg Ala Leu 1 5 - # 10 - # 15 - - Leu Glu Gln Lys 20 - - - - (2) INFORMATION FOR SEQ ID NO: 26: - - (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 8 amino - #acids (B) TYPE: amino acid (C) STRANDEDNESS: (D) TOPOLOGY: linear - - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #26: - - His Arg Asp Ile Lys Ala Gly Asn 1 5 __________________________________________________________________________