Back to EveryPatent.com



United States Patent 6,159,469
Choi ,   et al. December 12, 2000

Streptococcus pneumoniae antigens and vaccines

Abstract

The present invention relates to novel vaccines for the prevention or attenuation of infection by Streptococcus pneumoniae. The invention further relates to isolated nucleic acid molecules encoding antigenic polypeptides of Streptococcus pneumoniae. Antigenic polypeptides are also provided, as are vectors, host cells and recombinant methods for producing the same. The invention additionally relates to diagnostic methods for detecting Streptococcus nucleic acids, polypeptides and antibodies in a biological sample.


Inventors: Choi; Gil H. (Rockville, MD); Kunsch; Charles A. (Atlanta, GA); Barash; Steven C. (Rockville, MD); Dillon; Patrick J. (Carlsbad, CA); Dougherty; Brian (Killingworth, CT); Fannon; Michael R. (Silver Spring, MD); Rosen; Craig A. (Laytonsville, MD)
Assignee: Human Genome Sciences, Inc. (Rockville, MD)
Appl. No.: 961083
Filed: October 30, 1997

Intern'l Class: A61K 039/40; A61K 039/00; A61K 039/385; A61K 039/02
Field of Search: 424/165.1,184.1,197.11,200.1,244.1 435/7.2,7.34,7.36,69.3,252.3,253.4,340,388.4 536/24.3,24.32,25.3


References Cited
Foreign Patent Documents
622081Nov., 1994EP.
687688Dec., 1995EP.
93/10238May., 1993WO.
95/06732Mar., 1995WO.
95/14712Jun., 1995WO.
95/31548Nov., 1995WO.
96/05859Feb., 1996WO.
96/08582Mar., 1996WO.
96/16082May., 1996WO.
96/33276Oct., 1996WO.
97/43303Nov., 1997WO.
98/18930May., 1998WO.
98/26072Jun., 1998WO.


Other References

Martin et al., The EMBO Journal, 11(11):3831-3836 (1992).
Altschul et al., J. Mol. Biol., 215:403-410 (1990).
Pearson et al., Proc. Natl. Acad. Sci., 85:2444-2448 (1988).

Primary Examiner: Housel; James C.
Assistant Examiner: Hines; Ja-Na A.
Attorney, Agent or Firm: Human Genome Sciences, Inc.

Parent Case Text



This application claims the benefit of the filing date of Provisional Application Ser. No. 60/029,960 filed on Oct. 31, 1996.
Claims



What is claimed is:

1. An isolated nucleic acid molecule comprising a polynucleotide selected from the group consisting of:

(a) a polynucleotide encoding amino acids 1 to 796 of SEQ ID NO:56;

(b) a polynucleotide encoding amino acids 2 to 796 of SEQ ID NO:56;

(c) a polynucleotide encoding an epitope-bearing portion of amino acid residues 1 to 796 of SEQ ID NO:56;

(d) a polynucleotide probe encoding at least 15 contiguous amino acid residues of amino acid residues 1 to 796 of SEQ ID NO:56;

(e) a polynucleotide consisting of at least 50 contiguous nucleotides of nucleotides 1-2389 of SEQ ID NO:55; and

(f) a polynucleotide completely complementary to a polynucleotide in (a), (b), (c), (d), or (e) above.

2. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (a).

3. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (b).

4. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (c).

5. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (d).

6. The isolated nucleic acid molecule of claim 5, wherein said polynucleotide encodes at least 30 contiguous amino acid residues of amino acid residues 1 to 796 of SEQ ID NO:56.

7. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (e).

8. The isolated nucleic acid molecule of claim 7, wherein said polynucleotide consists of at least 100 contiguous nucleotides of nucleotides 1-2389 of SEQ ID NO:55.

9. The isolated nucleic acid molecule of claim 1, wherein said polynucleotide is (f).

10. The isolated nucleic acid molecule of claim 2, wherein said polynucleotide is SEQ ID NO:55.

11. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-10 to Arg-17 of SEQ ID NO:56.

12. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Lys-29 to Ser-39 of SEQ ID NO:56.

13. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ser-140 to Ala-153 of SEQ ID NO:56.

14. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-158 to Tyr-169 of SEQ ID NO:56.

15. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Asp-175 to Ala-183 of SEQ ID NO:56.

16. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Gly-216 to Asn-236 of SEQ ID NO:56.

17. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ala-261 to Leu-270 of SEQ ID NO:56.

18. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Arg-282 to Phe-291 of SEQ ID NO:56.

19. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Thr-297 to Ala-305 of SEQ ID NO:56.

20. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Pro-342 to Gln-362 of SEQ ID NO:56.

21. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Phe-455 to Asp-463 of SEQ ID NO:56.

22. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues His-497 to Thr-511 of SEQ ID NO:56.

23. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ala-521 to Gly-529 of SEQ ID NO:56.

24. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ile-537 to Val-546 of SEQ ID NO:56.

25. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ile-556 to Ala-568 of SEQ ID NO:56.

26. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Pro-581 to Ser-595 of SEQ ID NO:56.

27. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Glu-670 to Ala-685 of SEQ ID NO:56.

28. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Ser-696 to Ala-705 of SEQ ID NO:56.

29. The isolated nucleic acid molecule of claim 4, wherein said epitope-bearing portion comprises amino residues Leu-782 to Ser-791 of SEQ ID NO:56.

30. A method of producing a polypeptide comprising:

(a) culturing a host cell containing the polynucleotide of claim 4 under conditions suitable to produce a polypeptide encoded by said polynucleotide;

(b) isolating said polypeptide from said host cell.

31. A polypeptide produced by the method of claim 30.

32. A polypeptide produced according to the method of claim 30.

33. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule comprises a heterologous polynucleotide sequence.

34. The isolated nucleic acid molecule of claim 33, wherein said heterologous polynucleotide sequence encodes a heterologous polypeptide.

35. A method for making a recombinant vector comprising inserting the isolated nucleic acid molecule of claim 1 into a vector.

36. A recombinant vector comprising the isolated nucleic acid molecule of claim 1.

37. The recombinant vector of claim 36, wherein said nucleic acid molecule is operably associated with a heterologous regulatory sequence that controls gene expression.

38. A recombinant host cell comprising the isolated nucleic acid molecule of claim 1.

39. The recombinant host cell of claim 38, wherein said nucleic acid molecule is operably associated with a heterologous regulatory sequence that controls gene expression.

40. A method of using the polynucleotide of claim 1 to detect a nucleic acid molecule which hybridizes to said polynucleotide in a biological sample comprising:

(a) obtaining a biological sample suspected of containing said nucleic acid molecule;

(b) contacting said biological sample with said polynucleotide under conditions suitable for hybridization of said polynucleotide to said nucleic acid molecule; and

(c) determining the presence or absence of said nucleic acid molecule in said biological sample.

41. A method of detecting Streptococcus nucleic acids in a biological sample obtained from an animal comprising:

(a) contacting the biological sample with the nucleic acid of claim 1 under conditions such that hybridization occurs, and

(b) detecting hybridization of said nucleic acid to the Streptococcus nucleic acid sequences present in the biological sample, wherein the detection of said hybridization is indicative of the presence of Streptococcus nucleic acids in said biological sample.

42. An isolated polypeptide comprising a member selected from the group consisting of:

(a) a polypeptide of amino acid residues 1 to 796 of SEQ ID NO:56;

(b) a polypeptide of amino acid residues 2 to 796 of SEQ ID NO:56; and

(c) an epitope-bearing portion of a polypeptide consisting of amino acid residues 1 to 796 of SEQ ID NO:56.

43. The isolated polypeptide of claim 42, wherein said polypeptide is (c).

44. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-10 to Arg-17 of SEQ ID NO:56.

45. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Lys-29 to Ser-39 of SEQ ID NO:56.

46. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ser-140 to Ala-153 of SEQ ID NO:56.

47. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-158 to Tyr-169 of SEQ ID NO:56.

48. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Asp-175 to Ala-183 of SEQ ID NO:56.

49. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Gly-216 to Asn-236 of SEQ ID NO:56.

50. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ala-261 to Leu-270 of SEQ ID NO:56.

51. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Arg-282 to Phe-291 of SEQ ID NO:56.

52. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Thr-297 to Ala-305 of SEQ ID NO:56.

53. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Pro-342 to Gln-362 of SEQ ID NO:56.

54. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Phe-455 to Asp-463 of SEQ ID NO:56.

55. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues His-497 to Thr-511 of SEQ ID NO:56.

56. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ala-521 to Gly-529 of SEQ ID NO:56.

57. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ile-537 to Val-546 of SEQ ID NO:56.

58. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ile-556 to Ala-568 of SEQ ID NO:56.

59. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Pro-581 to Ser-595 of SEQ ID NO:56.

60. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Glu-670 to Ala-685 of SEQ ID NO:56.

61. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Ser-696 to Ala-705 of SEQ ID NO:56.

62. The isolated polypeptide of claim 43, wherein said epitope-bearing portion comprises amino residues Leu-782 to Ser-791 of SEQ ID NO:56.

63. The isolated polypeptide of claim 42, wherein said polypeptide is (b).

64. The isolated polypeptide of claim 42, wherein said polypeptide is fused to a heterologous polypeptide.

65. A method of producing the polypeptide of claim 42 comprising:

(a) culturing a host cell capable of expressing said polypeptide under conditions suitable to produce said polypeptide; and

(b) isolating said polypeptide from host cell.

66. A polypeptide produced according to the method of claim 65.

67. The isolated polypeptide of claim 42, wherein said polypeptide is (a).

68. A method of using the polypeptide of claim 67 to detect an antibody which specifically binds said polypeptide comprising:

(a) obtaining a biological sample containing said antibody;

(b) obtaining a biological sample containing said polypeptide;

(c) contacting said antibody containing sample with said polypeptide containing sample under conditions suitable for said antibody to specifically bind said polypeptide; and

(d) detecting the presence of said antibody bound to said polypeptide.

69. A method of attenuating an infection caused by a member of the Streptococcus genus in an animal, comprising administering to said animal the polypeptide of claim 42, wherein said polypeptide is administered in an amount effective to attenuate said infection.

70. The method of claim 69 wherein the polypeptide comprises amino acid residues 1 to 796 of SEQ ID NO:56.

71. The method of claim 65 wherein the animal is a human.

72. The method of claim 69 wherein the polypeptide comprises an epitope-bearing portion of amino acid residues 1 to 796 of SEQ ID NO:56.

73. The method of claim 72 wherein the animal is a human.
Description



FIELD OF THE INVENTION

The present invention relates to novel Streptococcus pneumoniae antigens for the detection of Streptococcus and for the prevention or attenuation of disease caused by Streptococcus. The invention further relates to isolated nucleic acid molecules encoding antigenic polypeptides of S. pneumoniae. Antigenic polypeptides are also provided, as are vectors, host cells and recombinant methods for producing the same. The invention additionally relates to diagnostic methods for detecting Streptococcus gene expression.

BACKGROUND OF THE INVENTION

Streptococcus pneumoniae has been one of the most extensively studied microorganisms since its first isolation in 1881. It was the object of many investigations that led to important scientific discoveries. In 1928, Griffith observed that when heat-killed encapsulated pneumococci and live strains constitutively lacking any capsule were concomitantly injected into mice, the nonencapsulated could be converted into encapsulated pneumococci with the same capsular type as the heat-killed strain. Years later, the nature of this "transforming principle," or carrier of genetic information, was shown to be DNA. (Avery, O. T., et al., J. Exp. Med., 79:137-157 (1944)).

In spite of the vast number of publications on S. pneumoniae many questions about its virulence are still unanswered, and this pathogen remains a major causative agent of serious human disease, especially community-acquired pneumonia. (Johnston, R. B., et al., Rev. Infect. Dis. 13(Suppl. 6):S509-517 (1991)). In addition, in developing countries, the pneumococcus is responsible for the death of a large number of children under the age of 5 years from pneumococcal pneumonia. The incidence of pneumococcal disease is highest in infants under 2 years of age and in people over 60 years of age. Pneumococci are the second most frequent cause (after Haemophilus influenzae type b) of bacterial meningitis and otitis media in children. With the recent introduction of conjugate vaccines for H. influenzae type b, pneumococcal meningitis is likely to become increasingly prominent. S. pneumoniae is the most important etiologic agent of community-acquired pneumonia in adults and is the second most common cause of bacterial meningitis behind Neisseria meningitidis.

The antibiotic generally prescribed to treat S. pneumoniae is benzylpenicillin, although resistance to this and to other antibiotics is found occasionally. Pneumococcal resistance to penicillin results from mutations in its penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused by a sensitive strain, treatment with penicillin is usually successful unless started too late. Erythromycin or clindamycin can be used to treat pneumonia in patients hypersensitive to penicillin, but resistant strains to these drugs exist. Broad spectrum antibiotics (e.g., the tetracyclines) may also be effective, although tetracycline-resistant strains are not rare. In spite of the availability of antibiotics, the mortality of pneumococcal bacteremia in the last four decades has remained stable between 25 and 29%. (Gillespie, S. H., et al., J. Med. Microbiol. 28:237-248 (1989).

S. pneumoniae is carried in the upper respiratory tract by many healthy individuals. It has been suggested that attachment of pneumococci is mediated by a disaccharide receptor on fibronectin, present on human pharyngeal epithelial cells. (Anderson, B. J., et al., J. Immunol. 142:2464-2468 (1989). The mechanisms by which pneumococci translocate from the nasopharynx to the lung, thereby causing pneumonia, or migrate to the blood, giving rise to bacteremia or septicemia, are poorly understood. (Johnston, R. B., et al., Rev. Infect. Dis. 13(Suppl. 6):S509-517 (1991).

Various proteins have been suggested to be involved in the pathogenicity of S. pneumoniae, however, only a few of them have actually been confirmed as virulence factors. Pneumococci produce an IgAl protease that might interfere with host defense at mucosal surfaces. (Komfield, S. J., et al., Rev. Inf Dis. 3:521-534 (1981). S. pneumoniae also produces neuraminidase, an enzyme that may facilitate attachment to epithelial cells by cleaving sialic acid from the host glycolipids and gangliosides. Partially purified neuraminidase was observed to induce meningitis-like symptoms in mice; however, the reliability of this finding has been questioned because the neuraminidase preparations used were probably contaminated with cell wall products. Other pneumococcal proteins besides neuraminidase are involved in the adhesion of pneumococci to epithelial and endothelial cells. These pneumococcal proteins have as yet not been identified. Recently, Cundell et al., reported that peptide permeases can modulate pneumococcal adherence to epithelial and endothelial cells. It was, however, unclear whether these permeases function directly as adhesions or whether they enhance adherence by modulating the expression of pneumococcal adhesions. (DeVelasco, E. A., et al., Micro. Rev. 59:591-603 (1995). A better understanding of the virulence factors determining its pathogenicity will need to be developed to cope with the devastating effects of pneumococcal disease in humans.

Ironically, despite the prominent role of S. pneumoniae in the discovery of DNA, little is known about the molecular genetics of the organism. The S. pneumoniae genome consists of one circular, covalently closed, double-stranded DNA and a collection of so-called variable accessory elements, such as prophages, plasmids, transposons and the like. Most physical characteristics and almost all of the genes of S. pneumoniae are unknown. Among the few that have been identified, most have not been physically mapped or characterized in detail. Only a few genes of this organism have been sequenced. (See, for instance current versions of GENBANK and other nucleic acid databases, and references that relate to the genome of S. pneumoniae such as those set out elsewhere herein.) Identification of in vivo-expressed, and broadly protective, antigens of S. pneumoniae has remained elusive.

SUMMARY OF THE INVENTION

The present invention provides isolated nucleic acid molecules comprising polynucleotides encoding the S. pneumoniae polypeptides described in Table 1 and having the amino acid sequences shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226. Thus, one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding any of the amino acid sequences of the polypeptides shown in Table 1; and (b) a nucleotide sequence complementary to any of the nucleotide sequences in (a).

Further embodiments of the invention include isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in (a) or (b) above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in (a) or (b) above. This polynucleotide which hybridizes does not hybridize under stringent hybridization conditions to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides which encode the amino acid sequences of epitope-bearing portions of an S. pneumoniae polypeptide having an amino acid sequence in (a) above.

The present invention also relates to recombinant vectors, which include the isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells and for using these vectors for the production of S. pneumoniae polypeptides or peptides by recombinant techniques.

The invention further provides isolated S. pneumoniae polypeptides having an amino acid sequence selected from the group consisting of an amino acid sequence of any of the polypeptides described in Table 1.

The polypeptides of the present invention also include polypeptides having an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 1, as well as polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid molecules encoding such polypeptides.

The present invention further provides a vaccine, preferably a multi-component vaccine comprising one or more of the S. pneumoniae polynucleotides or polypeptides described in Table 1, or fragments thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the S. pneumoniae polypeptide(s) are present in an amount effective to elicit an immune response to members of the Streptococcus genus in an animal. The S. pneumoniae polypeptides of the present invention may further be combined with one or more immunogens of one or more other streptococcal or non-streptococcal organisms to produce a multi-component vaccine intended to elicit an immunological response against members of the Streptococcus genus and, optionally, one or more non-streptococcal organisms.

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" DNA, wherein the DNA encodes one or more streptococcal polypeptides and, optionally, one or more polypeptides of a non-streptococcal organism. The DNA encoding one or more polypeptides may be constructed such that these polypeptides are expressed fusion proteins.

The vaccines of the present invention may also be administered as a component of a genetically engineered organism. Thus, a genetically engineered organism which expresses one or more S. pneumoniae polypeptides may be administered to an animal. For example, such a genetically engineered organism may contain one or more S. pneumoniae polypeptides of the present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a genetically engineered organism may secrete one or more S. pneumoniae polypeptides.

The vaccines of the present invention may be co-administered to an animal with an immune system modulator (e.g., CD86 and GM-CSF).

The invention also provides a method of inducing an immunological response in an animal to one or more members of the Streptococcus genus, preferably one or more isolates of the S. pneumoniae genus, comprising administering to the animal a vaccine as described above.

The invention further provides a method of inducing a protective immune response in an animal, sufficient to prevent or attenuate an infection by members of the Streptococcus genus, preferrably at least S. pneumoniae, comprising administering to the animal a composition comprising one or more of the polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may be conjugated to another immunogen and/or administered in admixture with an adjuvant.

The invention further relates to antibodies elicited in an animal by the administration of one or more S. pneumoniae polypeptides of the present invention and to methods for producing such antibodies.

The invention also provides diagnostic methods for detecting the expression of genes of members of the Streptococcus genus in an animal. One such method involves assaying for the expression of a gene encoding S. pneumoniae peptides in a sample from an animal. This expression may be assayed either directly (e.g., by assaying polypeptide levels using antibodies elicited in response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying for antibodies having specificity for amino acid sequences described in Table 1). An example of such a method involves the use of the polymerase chain reaction (PCR) to amplify and detect Streptococcus nucleic acid sequences.

The present invention also relates to nucleic acid probes having all or part of a nucleotide sequence described in Table 1 (shown as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225) which are capable of hybridizing under stringent conditions to Streptococcus nucleic acids. The invention further relates to a method of detecting one or more Streptococcus nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids encoding Streptococcus polypeptides, comprising: (a) contacting the sample with one or more of the above-described nucleic acid probes, under conditions such that hybridization occurs, and (b) detecting hybridization of said one or more probes to the Streptococcus nucleic acid present in the biological sample.

The invention also includes immunoassays, including an immunoassay for detecting Streptococcus, preferrably at least isolates of the S. pneumoniae genus, comprising incubation of a sample (which is suspected of being infected with Streptococcus) with a probe antibody directed against an antigen/epitope of S. pneumoniae, to be detected under conditions allowing the formation of an antigen-antibody complex; and detecting the antigen-antibody complex which contains the probe antibody. An immunoassay for the detection of antibodies which are directed against a Streptococcus antigen comprising the incubation of a sample (containing antibodies from a mammal suspected of being infected with Streptococcus) with a probe polypeptide including an epitope of S. pneumoniae, under conditions that allow the formation of antigen-antibody complexes which contain the probe epitope containing antigen.

Some aspects of the invention pertaining to kits are those for: investigating samples for the presence of polynucleotides derived from Streptococcus which comprise a polynucleotide probe including a nucleotide sequence selected from Table 1 or a fragment thereof of approximately 15 or more nucleotides, in an appropriate container; analyzing the samples for the presence of antibodies directed against a Streptococcus antigen made up of a polypeptide which contains a S. pneumoniae epitope present in the polypeptide, in a suitable container; and analyzing samples for the presence of Streptococcus antigens made up of an anti-S. pneumoniae antibody, in a suitable container.

DETAILED DESCRIPTION

The present invention relates to recombinant antigenic S. pneumoniae polypeptides and fragments thereof. The invention also relates to methods for using these polypeptides to produce immunological responses and to confer immunological protection to disease caused by members of the genus Streptococcus, at least isolates of the S. pneumoniae genus. The invention further relates to nucleic acid sequences which encode antigenic S. pneumoniae polypeptides and to methods for detecting S. pneumoniae nucleic acids and polypeptides in biological samples. The invention also relates to S. pneumoniae-specific antibodies and methods for detecting such antibodies produced in a host animal.

DEFINITIONS

The following definitions are provided to clarify the subject matter which the inventors consider to be the present invention.

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a disease state or render an animal infected with such an organism susceptible to a disease state (e.g., a secondary infection). Further included are species and strains of the genus Streptococcus which produce disease states in animals.

As used herein, the term "organism" means any living biological system, including viruses, regardless of whether it is a pathogenic agent.

As used herein, the term "Streptococcus" means any species or strain of bacteria which is members of the genus Streptococcus. Such species and strains are known to those of skill in the art, and include those that are pathogenic and those that are not.

As used herein, the phrase "one or more S. pneumoniae polypeptides of the present invention" means polypeptides comprising the amino acid sequence of one or more of the S. pneumoniae polypeptides described in Table 1 and disclosed as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226. These polypeptides may be expressed as fusion proteins wherein the S. pneumoniae polypeptides of the present invention are linked to additional amino acid sequences which may be of streptococcal or non-streptococcal origin. This phrase further includes polypeptide comprising fragments of the S. pneumoniae polypeptides of the present invention.

Additional definitions are provided throughout the specification.

EXPLANATION OF TABLE 1

Table 1, below, provides information describing 113 open reading frames (ORFs) which encode potentially antigenic polypeptides of S. pneumoniae of the present invention. The table lists the ORF identifier which consists of the letters SP, which denote S. pneumoniae, followed immediately by a three digit numeric code, which arbitrarily number the potentially antigenic polypeptides of S. pneumoniae of the present invention and the nucleotide or amino acid sequence of each ORF and encoded polypeptide. The table further correlates the ORF identifier with a sequence identification number (SEQ ID NO:). The actual nucleotide or amino acid sequence of each ORF identifier is also shown in the Sequence Listing under the corresponding SEQ ID NO.

Thus, for example, the designation "SP126" refers to both the nucleotide and amino acid sequences of S. pneumoniae polypeptide number 126 of the present invention. Further, "SP126" correlates with the nucleotide sequence shown as SEQ ID NO:223 and with the amino acid sequence shown as SEQ ID NO:224 as is described in Table 1.

The open reading frame within each "ORF" begins with the second nucleotide shown. Thus, the first codon for each nucleotide sequence shown is bases 2-4, the second 5-7, the third 8-10, and so on.

EXPLANATION OF TABLE 2

Table 2 lists the antigenic epitopes present in each of the S. pneumoniae polypeptides described in Table 1 as predicted by the inventors. Each S. pneumoniae polypeptide shown in Table 1 has one or more antigenic epitopes described in Table 2. It will be appreciated that depending on the analytical criteria used to predict antigenic determinants, the exact address of the determinant may vary slightly. The exact location of the antigenic determinant may shift by about 1 to 5 residues, more likely 1 to 2 residues, depending on the criteria used. Thus, the first antigenic determinant described in Table 2, "Lys-1 to Ile-10" of SP001, represents a peptide comprising the lysine at position 1 in SEQ ID NO:2 through and including the isoleucine at position 10 in SEQ ID NO:2, but may include more or fewer residues than those 10. It will also be appreciated that, generally speaking, amino acids can be added to either terminus of a peptide or polypeptide containing an antigenic epitope without affecting its activity, whereas removing residues from a peptide or polypeptide containing only the antigenic determinant is much more likely to destroy activity. It will be appreciated that the residues and locations shown described in Table 2 correspond to the amino acid sequences for each ORF shown in Table 1 and in the Sequence Listing.

EXPLANATION OF TABLE 3

Table 3 shows PCR primers designed by the inventors for the amplification of polynucleotides encoding polypeptides of the present invention according to the method of Example 1. PCR primer design is routine in the art and those shown in Table 3 are provided merely for the convenience of the skilled artisan. It will be appreciated that others can be used with equal success.

For each primer, the table lists the corresponding ORF designation from Table 1 followed by either an "A" or a "B". The "A" primers are the 5' primers and the "B" primers 3'. A restriction enzyme site was built into each primer to allow ease of cloning. The restriction enzyme which will recognize and cleave a sequence within each primer is shown in Table 3, as well, under the heading "RE" for restriction enzyme. Finally the sequence identifier is shown in Table 3 for each primer for easy correlation with the Sequence Listing.

SELECTION OF NUCLEIC ACID SEQUENCES ENCODING ANTIGENIC S. PNEUMONIAE POLYPEPTIDES

The present invention provides a select number of ORFs from those presented in the fragments of the S. pneumoniae genome which may prove useful for the generation of a protective immune response. The sequenced S. pneumoniae genomic DNA was obtained from a sub-cultured isolate of S. pneumoniae Strain 7/87 14.8.91, which has been deposited at the American Type Culture Collection, as a convenience to those of skill in the art. The S. pneumoniae isolate was deposited on Oct. 10, 1996 at the ATCC, 10801 University Blvd., Manassas, Va. 20110-2209, U.S.A., and given accession number 55840. A genomic library constructed from DNA isolated from the S. pneumoniae isolate was also deposited at the ATCC on Oct. 11, 1996 and given ATCC Deposit No. 97755. A more complete listing of the sequence obtained from the S. pneumoniae genome may be found in co-pending U.S. Provisional application Ser. No. 60/029,960, filed Oct. 31, 1996, incorporated herein by reference in its entirety. Some ORFs contained in the subset of fragments of the S. pneumoniae genome disclosed herein were derived through the use of a number of screening criteria detailed below.

The selected ORFs do not consist of complete ORPFs. Although a polypeptide representing a complete ORF may be the closest approximation of a protein native to an organism, it is not always preferred to express a complete ORF in a heterologous system. It may be challenging to express and purify a highly hydrophobic protein by common laboratory methods. Thus, the polypeptide vaccine candidates described herein may have been modified slightly to simplify the production of recombinant protein. For example, nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal sequence, have been excluded from some constructs used for in vitro expression of the polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus have also been excluded from the recombinant expression constructs. Thus, in one embodiment, a polypeptide which represents a truncated or modified ORF may be used as an antigen.

While numerous methods are known in the art for selecting potentially immunogenic polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all theoretical S. pneumoniae ORFs for several aspects of potential immunogenicity. One set of selection criteria are as follows:

1. Type I signal sequence: An amino terminal type I signal sequence generally directs a nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and Kendall, D. A. Mol. Microbiol. 13:765-773 (1994)). The length of the type I signal sequence is approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus. In addition, the central region of the signal sequence adopts an alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and -3 positions.

2. Type IV signal sequence: The type IV signal sequence is an example of the several types of functional signal sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the type IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bacteriol. 174:7345-7351 (1992)). These are typically six to eight amino acids with a net basic charge followed by an additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative to the cleavage site.

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C., J. Bioenerg. Biomembr. 22:451-471 (1990)).

4. LPXTG motif: It has been experimentally determined that most anchored proteins found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410 (1996)). The conserved region consists of six charged amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any amino acid.

An algorithm for selecting antigenic and immunogenic S. pneumoniae polypeptides including the foregoing criteria was developed. Use of the algorithm by the inventors to select immunologically useful S. pneumoniae polypeptides resulted in the selection of a number of the disclosed ORFs. Polypeptides comprising the polypeptides identified in this group may be produced by techniques standard in the art and as further described herein.

NUCLEIC ACID MOLECULES

The present invention provides isolated nucleic acid molecules comprising polynucleotides encoding the S. pneumoniae polypeptides having the amino acid sequences described in Table 1 and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226, which were determined by sequencing the genome of S. pneumoniae and selected as putative immunogens.

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer (such as the Model 373 from Applied Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of DNA sequences determined as above. Therefore, as is known in the art for any DNA sequence determined by this automated approach, any nucleotide sequence determined herein may contain some errors. Nucleotide sequences determined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule. The actual sequence can be more precisely determined by other approaches including manual DNA sequencing methods well known in the art. As is also known in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion.

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or polynucleotide, a sequence of deoxyribonucleotides, and for an RNA molecule or polynucleotide, the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence described in Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule having a sequence in which each deoxyribonucleotide A, G or C described in Table 1 has been replaced by the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced by a ribonucleotide U.

Nucleic acid molecules of the present invention may be in the form of RNA, such as MRNA, or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or produced synthetically. The DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding strand, also known as the sense strand, or it may be the non-coding strand, also referred to as the anti-sense strand.

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, which has been removed from its native environment. For example, recombinant DNA molecules contained in a vector are considered isolated for the purposes of the present invention. Further examples of isolated DNA molecules include recombinant DNA molecules maintained in heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention further include such molecules produced synthetically.

Isolated nucleic acid molecules of the present invention include DNA molecules comprising a nucleotide sequence described in Table 1 and shown as SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225; DNA molecules comprising the coding sequences for the polypeptides described in Table 1 and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226; and DNA molecules which comprise sequences substantially different from those described above but which, due to the degeneracy of the genetic code, still encode the S. pneumoniae polypeptides described in Table 1. Of course, the genetic code is well known in the art. Thus, it would be routine for one skilled in the art to generate such degenerate variants.

The invention also provides nucleic acid molecules having sequences complementary to any one of those described in Table 1. Such isolated molecules, particularly DNA molecules, are useful as probes for detecting expression of Streptococcal genes, for instance, by Northern blot analysis or the polymerase chain reaction (PCR).

The present invention is further directed to fragments of the isolated nucleic acid molecules described herein. By a fragment of an isolated nucleic acid molecule having a nucleotide sequence described in Table 1, is intended fragments at least about 15 nt, and more preferably at least about 17 nt, still more preferably at least about 20 nt, and even more preferably, at least about 25 nt in length which are useful as diagnostic probes and primers as discussed herein. Of course, larger fragments 50-100 nt in length are also useful according to the present invention as are fragments corresponding to most, if not all, of a nucleotide sequence described in Table 1. By a fragment at least 20 nt in length, for example, is intended fragments which include 20 or more contiguous bases of a nucleotide sequence as described in Table 1. Since the nucleotide sequences identified in Table 1 are provided as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225, generating such DNA fragments would be routine to the skilled artisan. For example, such fragments could be generated synthetically.

Preferred nucleic acid fragments of the present invention also include nucleic acid molecules comprising nucleotide sequences encoding epitope-bearing portions of the S. pneumoniae polypeptides identified in Table 1. Such nucleic acid fragments of the present invention include, for example, nucleotide sequences encoding polypeptide fragments comprising from about the amino terminal residue to about the carboxy terminal residue of each fragment shown in Table 2. The above referred to polypeptide fragments are antigenic regions of the S. pneumoniae polypeptides identified in Table 1.

In another aspect, the invention provides isolated nucleic acid molecules comprising polynucleotides which hybridize under stringent hybridization conditions to a portion of a polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic acid sequence identified in Table 1. By "stringent hybridization conditions" is intended overnight incubation at 42.degree. C. in a solution comprising: 50% formamide, 5.times.SSC (150 mM NaCl, 15 nM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5.times.Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1.times.SSC at about 65.degree. C.

By polynucleotides which hybridize to a "portion" of a polynucleotide is intended polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and more preferably at least about 17 nt, still more preferably at least about 20 nt, and even more preferably about 25-70 nt of the reference polynucleotide. These are useful as diagnostic probes and primers as discussed above and in more detail below.

Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, for instance, a portion 50-100 nt in length, or even to the entire length of the reference polynucleotide, are also useful as probes according to the present invention, as are polynucleotides corresponding to most, if not all, of a nucleotide sequence as identified in Table 1. By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more contiguous nucleotides from the nucleotide sequence of the reference polynucleotide (e.g., a nucleotide sequences as described in Table 1). As noted above, such portions are useful diagnostically either as probes according to conventional DNA hybridization techniques or as primers for amplification of a target sequence by PCR, as described in the literature (for instance, in Molecular Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire disclosure of which is hereby incorporated herein by reference).

Since nucleic acid sequences encoding the S. pneumoniae polypeptides of the present invention are identified in Table 1 and provided as SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225, generating polynucleotides which hybridize to portions of these sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides of the present invention could be generated synthetically according to known techniques.

As indicated, nucleic acid molecules of the present invention which encode S. pneumoniae polypeptides of the present invention may include, but are not limited to those encoding the amino acid sequences of the polypeptides by themselves; and additional coding sequences which code for additional amino acids, such as those which provide additional functionalities. Thus, the sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which are commercially available. As described by Gentz and colleagues (Proc. Natl. Acad. Sci. USA 86:821-824 (1989)), for instance, hexa-histidine provides for convenient purification of the resulting fusion protein.

Thus, the present invention also includes genetic fusions wherein the S. pneumoniae nucleic acid sequences coding sequences identified in Table 1 are linked to additional nucleic acid sequences to produce fusion proteins. These fusion proteins may include epitopes of streptococcal or non-streptococcal origin designed to produce proteins having enhanced immunogenicity. Further, the fusion proteins of the present invention may contain antigenic determinants known to provide helper T-cell stimulation, peptides encoding sites for post-translational modifications which enhance immunogenicity (e.g., acylation), peptides which facilitate purification (e.g., histidine "tag"), or amino acid sequences which target the fusion protein to a desired location (e.g., a heterologous leader sequence).

In all cases of bacterial expression, an N-terminal methionine residues is added. In many cases, however, the N-terminal methionine residues is cleaved off post-translationally. Thus, the invention includes polypeptides shown in Table 1 with, and without an N-terminal methionine.

The present invention thus includes nucleic acid molecules and sequences which encode fusion proteins comprising one or more S. pneumoniae polypeptides of the present invention fused to an amino acid sequence which allows for post-translational modification to enhance immunogenicity. This post-translational modification may occur either in vitro or when the fusion protein is expressed in vivo in a host cell. An example of such a modification is the introduction of an amino acid sequence which results in the attachment of a lipid moiety.

Thus, as indicated above, the present invention includes genetic fusions wherein a S. pneumoniae nucleic acid sequence identified in Table 1 is linked to a nucleotide sequence encoding another amino acid sequence. These other amino acid sequences may be of streptococcal origin (e.g., another sequence selected from Table 1) or non-streptococcal origin.

The present invention further relates to variants of the nucleic acid molecules of the present invention, which encode portions, analogs or derivatives of the S. pneumoniae polypeptides described in Table 1. Variants may occur naturally, such as a natural allelic variant. By an "allelic variant" is intended one of several alternate forms of a gene occupying a given locus on a chromosome of an organism (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985)). Non-naturally occurring variants may be produced using art-known mutagenesis techniques.

Such variants include those produced by nucleotide substitutions, deletions or additions. The substitutions, deletions or additions may involve one or more nucleotides. These variants may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of the S. pneumoniae polypeptides disclosed herein or portions thereof. Silent substitution are most likely to be made in non-epitopic regions. Guidance regarding those regions containing epitopes is provided herein, for example, in Table 2. Also especially preferred in this regard are conservative substitutions.

Further embodiments of the invention include isolated nucleic acid molecules comprising a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical to: (a) a nucleotide sequence encoding any of the amino acid sequences of the polypeptides identified in Table 1; and (b) a nucleotide sequence complementary to any of the nucleotide sequences in (a) above.

By a polynucleotide having a nucleotide sequence at least, for example, 95% "identical" to a reference nucleotide sequence encoding a S. pneumoniae polypeptide described in Table 1, is intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the subject S. pneumoniae polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence. These mutations of the reference sequence may occur at the 5' or 3' terminal positions of the reference nucleotide sequence or anywhere between those terminal positions, interspersed either individually among nucleotides in the reference sequence or in one or more contiguous groups within the reference sequence.

Certain nucleotides within some of the nucleic acid sequences shown in Table 1 were ambiguous upon sequencing. Completely unknown sequences are shown as an "N". Other unresolved nucleotides are known to be either a purine, shown as "R", or a pyrimidine, shown as "Y". Accordingly, when determining identity between two nucleotide sequences, identity is met where any nucleotide, including an "R", "Y" or "N", is found in a test sequence and at the corresponding position in the reference sequence (from Table 1). Likewise, an A, G or "R" in a test sequence is identical to an "R" in the reference sequence; and a T, C or "Y" in a test sequence is identical to a "Y" in the reference sequence.

As a practical matter, whether any particular nucleic acid molecule is at least 90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, a nucleotide sequence described in Table 1 can be determined conventionally using known computer programs such as the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). Bestfit uses the local homology algorithm of Smith and Waterman (Advances in Applied Mathematics 2:482-489 (1981)), to find the best segment of homology between two sequences. When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference nucleotide sequence and that gaps in homology of up to 5% of the total number of nucleotides in the reference sequence are allowed.

The present application is directed to nucleic acid molecules at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequences described in Table 1. One of skill in the art would still know how to use the nucleic acid molecule, for instance, as a hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the nucleic acid molecules of the present invention include, inter alia, (1) isolating Streptococcal genes or allelic variants thereof from either a genomic or cDNA library and (2) Northern Blot or PCR analysis for detecting Streptococcal mRNA expression.

Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a large number of nucleic acid molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to a nucleic acid sequence identified in Table 1 will encode the same polypeptide. In fact, since degenerate variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even without performing the above described comparison assay.

It will be further recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will also encode proteins having antigenic epitopes of the S. pneumoniae polypeptides of the present invention. This is because the skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to significantly effect the antigenicity of a polypeptide (e.g., replacement of an amino acid in a region which is not believed to form an antigenic epitope). For example, since antigenic epitopes have been identified which contain as few as six amino acids (see Harlow, et al., Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988), page 76), in instances where a polypeptide has multiple antigenic epitopes the alteration of several amino acid residues would often not be expected to eliminate all of the antigenic epitopes of that polypeptide. This is especially so when the alterations are in regions believed to not constitute antigenic epitopes.

VECTORS AND HOST CELLS

The present invention also relates to vectors which include the isolated DNA molecules of the present invention, host cells which are genetically engineered with the recombinant vectors, and the production of S. pneumoniae polypeptides or fragments thereof by recombinant techniques.

Recombinant constructs may be introduced into host cells using well known techniques such as infection, transduction, transfection, transvection, electroporation and transformation. The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors may be replication competent or replication defective. In the latter case, viral propagation generally will occur only in complementing host cells.

The polynucleotides may be joined to a vector containing a selectable marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be packaged in vitro using an appropriate packaging cell line and then transduced into host cells.

Preferred are vectors comprising cis-acting control regions to the polynucleotide of interest. Appropriate trans-acting factors may be supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.

In certain preferred embodiments in this regard, the vectors provide for specific expression, which may be inducible and/or cell type-specific. Particularly preferred among such vectors are those inducible by environmental factors that are easy to manipulate, such as temperature and nutrient additives.

Expression vectors useful in the present invention include chromosomal-, episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as cosmids and phagemids.

The DNA insert should be operatively linked to an appropriate promoter, such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be known to the skilled artisan. The expression constructs will further contain sites for transcription initiation, termination and, in the transcribed region, a ribosome binding site for translation. The coding portion of the mature transcripts expressed by the constructs will preferably include a translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to be translated.

As indicated, the expression vectors will preferably include at least one selectable marker. Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the above-described host cells are known in the art.

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXT1 and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan.

Among known bacterial promoters suitable for use in the present invention include the E. coli laci and lacZ promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the mouse metallothionein-I promoter.

Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods. Such methods are described in many standard laboratory manuals (for example, Davis, et al, Basic Methods In Molecular Biology (1986)).

Transcription of DNA encoding the polypeptides of the present invention by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional activity of a promoter in a given host cell-type. Examples of enhancers include the SV40 enhancer, which is located on the late side of the replication origin at bp 100 to 270, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular environment, appropriate secretion signals may be incorporated into the expressed polypeptide. The signals may be endogenous to the polypeptide or they may be heterologous signals.

The polypeptide may be expressed in a modified form, such as a fusion protein, and may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising various portions of constant region of immunoglobin molecules together with another human protein or part thereof. In many cases, the Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein has been expressed, detected and purified in the advantageous manner described. This is the case when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human proteins, such as, hIL5-receptor has been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al., J. Molec. Recogn. 8:52-58 (1995) and Johanson, K. et al., J. Biol. Chem. 270 (16):9459-9471 (1995).

The S. pneumoniae polypeptides can be recovered and purified from recombinant cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") is employed for purification. Polypeptides of the present invention include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells.

POLYPEPTIDES AND FRAGMENTS

The invention further provides isolated polypeptides having the amino acid sequences described in Table 1, and shown as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through SEQ ID NO:226, and peptides or polypeptides comprising portions of the above polypeptides. The terms "peptide" and "oligopeptide" are considered synonymous (as is commonly recognized) and each term can be used interchangeably as the context requires to indicate a chain of at least two amino acids coupled by peptidyl linkages. The word "polypeptide" is used herein for chains containing more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences herein are written from left to right and in the direction from amino terminus to carboxy terminus.

Some amino acid sequences of the S. pneumoniae polypeptides described in Table 1 can be varied without significantly effecting the antigenicity of the polypeptides. If such differences in sequence are contemplated, it should be remembered that there will be critical areas on the polypeptide which determine antigenicity. In general, it is possible to replace residues which do not form part of an antigenic epitope without significantly effecting the antigenicity of a polypeptide. Guidance for such alterations is given in Table 2 wherein epitopes for each polypeptide is delineated.

The polypeptides of the present invention are preferably provided in an isolated form. By "isolated polypeptide" is intended a polypeptide removed from its native environment. Thus, a polypeptide produced and/or contained within a recombinant host cell is considered isolated for purposes of the present invention. Also intended as an "isolated polypeptide" is a polypeptide that has been purified, partially or substantially, from a recombinant host cell. For example, recombinantly produced versions of the S. pneumoniae polypeptides described in Table 1 can be substantially purified by the one-step method described by Smith and Johnson (Gene 67:31-40 (1988)).

The polypeptides of the present invention include: (a) an amino acid sequence of any of the polypeptides described in Table 1; and (b) an amino acid sequence of an epitope-bearing portion of any one of the polypeptides of (a); as well as polypeptides with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in (a) or (b) above, as well as polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those above.

By "% similarity" for two polypeptides is intended a similarity score produced by comparing the amino acid sequences of the two polypeptides using the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711) and the default settings for determining similarity. Bestfit uses the local homology algorithm of Smith and Waterman (Advances in Applied Mathematics 2:482489 (1981)) to find the best segment of similarity between two sequences.

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to a reference amino acid sequence of a S. pneumoniae polypeptide is intended that the amino acid sequence of the polypeptide is identical to the reference sequence except that the polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the reference amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in the reference sequence may be deleted or substituted with another amino acid, or a number of amino acids up to 5% of the total amino acid residues in the reference sequence may be inserted into the reference sequence. These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence.

The amino acid sequences shown in Table 1 may have on or more "X" residues. "X" represents unknown. Thus, for purposes of defining identity, if any amino acid is present at the same position in a reference amino acid sequence (shown in Table 1) where an X is shown, the two sequences are identical at that position.

As a practical matter, whether any particular polypeptide is at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to, for instance, an amino acid sequence shown in Table 1, can be determined conventionally using known computer programs such the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the fall length of the reference amino acid sequence and that gaps in homology of up to 5% of the total number of amino acid residues in the reference sequence are allowed.

As described below, the polypeptides of the present invention can also be used to raise polyclonal and monoclonal antibodies, which are useful in assays for detecting Streptococcal protein expression.

In another aspect, the invention provides peptides and polypeptides comprising epitope-bearing portions of the S. pneumoniae polypeptides of the invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides of the invention. An "immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be confined to a few loci on the molecule. On the other hand, a region of a protein molecule to which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes (Geysen, et al, Proc. Natl. Acad. Sci. USA 81:3998-4002 (1983)). Predicted antigenic epitopes are shown in Table 2, below.

As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that contain a region of a protein molecule to which an antibody can bind), it is well known in that art that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein (for instance, Sutcliffe, J., et al., Science 219:660-666 (1983)). Peptides capable of eliciting protein-reactive sera are frequently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and are confined neither to immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and those of six or fewer residues generally are ineffective at inducing antibodies that bind to the mimicked protein; longer, peptides, especially those containing proline residues, usually are effective (Sutcliffe, et al., supra, p. 661). For instance, 18 of 20 peptides designed according to these guidelines, containing 8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies glycoprotein induced antibodies that precipitated the respective proteins.

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the native protein (Sutcliffe, et al., supra, p. 663). The antibodies raised by antigenic epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies to different peptides may be used for tracking the fate of various regions of a protein precursor which undergoes post-translational processing. The peptides and anti-peptide antibodies may be used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in competition assays since it has been shown that even short peptides (e.g., about 9 amino acids) can bind and displace the larger peptides in immunoprecipitation assays (for instance, Wilson, et al., Cell 37:767-778 (1984) p. 777). The anti-peptide antibodies of the invention also are useful for purification of the mimicked protein, for instance, by adsorption chromatography using methods well known in the art.

Antigenic epitope-bearing peptides and polypeptides of the invention designed according to the above guidelines preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 15 to about 30 amino acids contained within the amino acid sequence of a polypeptide of the invention. However, peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide of the invention, containing about 30 to about 50 amino acids, or any length up to and including the entire amino acid sequence of a polypeptide of the invention, also are considered epitope-bearing peptides or polypeptides of the invention and also are useful for inducing antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic residues and highly hydrophobic sequences are preferably avoided); and sequences containing proline residues are particularly preferred.

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate Streptococcal-specific antibodies include portions of the amino acid sequences identified in Table 1. More specifically, Table 2 discloses antigenic fragments of polypeptides of the present invention, which antigenic fragments comprise amino acid sequences from about the first amino acid residues indicated to about the last amino acid residue indicated for each fragment. The polypeptide fragments disclosed in Table 2 are believed to be antigenic regions of the S. pneumoniae polypeptides described in Table 1. Thus the invention further includes isolated peptides and polypeptides comprising an amino acid sequence of an epitope shown in Table 2 and polynucleotides encoding said polypeptides.

The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means for making peptides or polypeptides including recombinant means using nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of the present invention may be fused to a larger polypeptide which acts as a carrier during recombinant production and purification, as well as during immunization to produce anti-peptide antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical synthesis. For instance, Houghten has described a simple method for synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid variants of a segment of the HA1 polypeptide which were prepared and characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Pat. No. 4,631,211 to Houghten and coworkers (1986). In this procedure the individual resins for the solid-phase synthesis of various peptides are contained in separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps involved in solid-phase methods. A completely manual procedure allows 500-1000 or more syntheses to be conducted simultaneously (Houghten, et al., supra, p. 5134).

Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies according to methods well known in the art (for instance, Sutcliffe, et al., supra; Wilson, et al., supra; Chow, M., et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J., et al., J. Gen. Virol. 66:2347-2354 (1985)). Generally, animals may be immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 100 .mu.g peptide or carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an immunized animal may be increased by selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of the selected antibodies according to methods well known in the art.

Immunogenic epitope-bearing peptides of the invention, i.e., those parts of a protein that elicit an antibody response when the whole protein is the immunogen, are identified according to methods known in the art. For instance, Geysen, et al., supra, discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an enzyme-linked immunosorbent assay. Interaction of synthesized peptides with antibodies is then easily detected without removing them from the support. In this manner a peptide bearing an immunogenic epitope of a desired protein may be identified routinely by one of ordinary skill in the art. For instance, the immunologically important epitope in the coat protein of foot-and-mouth disease virus was located by Geysen et al. supra with a resolution of seven amino acids by synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn at every position within the epitope were synthesized, and the particular amino acids conferring specificity for the reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing peptides of the invention can be made routinely by this method. U.S. Pat. No. 4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an immunogenic epitope of a desired protein.

Further still, U.S. Pat. No. 5,194,392, to Geysen (1990), describes a general method of detecting or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the epitope (i.e., a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Pat. No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the ligand binding site of a particular receptor of interest. Similarly, U.S. Pat. No. 5,480,971 to Houghten, R. A. et al. (1996) discloses linear C.sub.1 -C.sub.7 -alkyl peralkylated oligopeptides and sets and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention also can be made routinely by these methods.

The entire disclosure of each document cited in this section on "Polypeptides and Fragments" is hereby incorporated herein by reference.

As one of skill in the art will appreciate, the polypeptides of the present invention and the epitope-bearing fragments thereof described above can be combined with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facilitate purification and show an increased half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of mammalian immunoglobulins (EPA 0,394,827; Traunecker et al., Nature 331:84-86 (1988)). Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and neutralizing other molecules than a monomeric S. pneumoniae polypeptide or fragment thereof alone (Fountoulakis et al., J. Biochem. 270:3958-3964 (1995)).

DIAGNOSTIC ASSAYS

The present invention further relates to a method for assaying for Streptococcal infection in an animal via detecting the expression of genes encoding Streptococcal polypeptides (e.g., the polypeptides described Table 1). This method comprises analyzing tissue or body fluid from the animal for Streptococcus-specific antibodies or Streptococcal nucleic acids or proteins. Analysis of nucleic acid specific to Streptococcus can be done by PCR or hybridization techniques using nucleic acid sequences of the present invention as either hybridization probes or primers (cf Molecular Cloning: A Laboratory Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 1989; Eremeeva et al., J. Clin. Microbiol. 32:803-810 (1994) which describes differentiation among spotted fever group Rickettsiae species by analysis of restriction fragment length polymorphism of PCR-amplified DNA). Methods for detecting B. burgdorferi nucleic acids via PCR are described, for example, in Chen et al, J. Clin. Microbiol 32:589-595 (1994).

Where diagnosis of a disease state related to infection with Streptococcus has already been made, the present invention is useful for monitoring progression or regression of the disease state whereby patients exhibiting enhanced Streptococcus gene expression will experience a worse clinical outcome relative to patients expressing these gene(s) at a lower level.

By "assaying for Streptococcal infection in an animal via detection of genes encoding Streptococcal polypeptides" is intended qualitatively or quantitatively measuring or estimating the level of one or more Streptococcus polypeptides or the level of nucleic acid encoding Streptococcus polypeptides in a first biological sample either directly (e.g., by determining or estimating absolute protein level or nucleic level) or relatively (e.g., by comparing to the Streptococcus polypeptide level or mRNA level in a second biological sample). The Streptococcus polypeptide level or nucleic acid level in the second sample used for a relative comparison may be undetectable if obtained from an animal which is not infected with Streptococcus. When monitoring the progression or regression of a disease state, the Streptococcus polypeptide level or nucleic acid level may be compared to a second sample obtained from either an animal infected with Streptococcus or the same animal from which the first sample was obtained but taken from that animal at a different time than the first. As will be appreciated in the art, once a standard Streptococcus polypeptide level or nucleic acid level which corresponds to a particular stage of a Streptococcus infection is known, it can be used repeatedly as a standard for comparison.

By "biological sample" is intended any biological sample obtained from an animal, cell line, tissue culture, or other source which contains Streptococcus polypeptide, MRNA, or DNA. Biological samples include body fluids (such as plasma and synovial fluid) which contain Streptococcus polypeptides, and muscle, skin, and cartilage tissues. Methods for obtaining tissue biopsies and body fluids are well known in the art.

The present invention is useful for detecting diseases related to Streptococcus infections in animals. Preferred animals include monkeys, apes, cats, dogs, cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans.

Total RNA can be isolated from a biological sample using any suitable technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski and Sacchi, Anal. Biochem. 162:156-159 (1987). mRNA encoding Streptococcus polypeptides having sufficient homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between complementary sequences are then assayed using any appropriate method. These include Northern blot analysis, S1 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in combination with the polymerase chain reaction (RT-PCR), and reverse transcription in combination with the ligase chain reaction (RT-LCR).

Northern blot analysis can be performed as described in Harada et al., Cell 63:303-312 (1990). Briefly, total RNA is prepared from a biological sample as described above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium phosphate buffer. A S. pnuemoniae polypeptide DNA sequence shown in Table 1 labeled according to any appropriate method (such as the .sup.32 P-multiprimed DNA labeling system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and exposed to x-ray film. DNA for use as probe according to the present invention is described in the sections above and will preferably at least 15 bp in length.

S1 mapping can be performed as described in Fujita et al., Cell 49:357-367 (1987). To prepare probe DNA for use in S1 mapping, the sense strand of an above-described S. pnuemoniae DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. The antisense DNA can then be digested using an appropriate restriction endonuclease to generate further DNA probes of a desired length. Such antisense probes are useful for visualizing protected bands corresponding to the target mRNA (i.e., mRNA encoding Streptococcus polypeptides).

Preferably, levels of mRNA encoding Streptococcus polypeptides are assayed using the RT-PCR method described in Makino et al., Technique 2:295-301 (1990). By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the RT products are then subject to PCR using labeled primers. Alternatively, rather than labeling the primers, a labeled DNTP can be included in the PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler according to conventional techniques. After a suitable number of rounds to achieve amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate bands (corresponding to the mRNA encoding the Streptococcus polypeptides)) is quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, reagent and gel concentrations, and labeling methods are well known in the art. Variations on the RT-PCR method will be apparent to the skilled artisan.

Assaying Streptococcus polypeptide levels in a biological sample can occur using any art-known method. Preferred for assaying Streptococcus polypeptide levels in a biological sample are antibody-based techniques. For example, Streptococcus polypeptide expression in tissues can be studied with classical immunohistological methods. In these, the specific recognition is provided by the primary antibody (polyclonal or monoclonal) but the secondary detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an immunohistological staining of tissue section for pathological examination is obtained. Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Streptococcus polypeptides for Western-blot or dot/slot assay (Jalkanen, M., et al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087-3096 (1987)). In this technique, which is based on the use of cationic solid phases, quantitation of a Streptococcus polypeptide can be accomplished using an isolated Streptococcus polypeptide as a standard. This technique can also be applied to body fluids.

Other antibody-based methods useful for detecting Streptococcus polypeptide gene expression include immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and the radioimmunoassay (RIA). For example, a Streptococcus polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and quantify a Streptococcus polypeptide. The amount of a Streptococcus polypeptide present in the sample can be calculated by reference to the amount present in a standard preparation using a linear regression computer algorithm. Such an ELISA for detecting a tumor antigen is described in Iacobelli et al., Breast Cancer Research and Treatment 11:19-30 (1988). In another ELISA assay, two distinct specific monoclonal antibodies can be used to detect Streptococcus polypeptides in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as the enzyme-labeled probe.

The above techniques may be conducted essentially as a "one-step" or "two-step" assay. The "one-step" assay involves contacting the Streptococcus polypeptide with immobilized antibody and, without washing, contacting the mixture with the labeled antibody. The "two-step" assay involves washing before contacting the mixture with the labeled antibody. Other conventional methods may also be employed as suitable. It is usually desirable to immobilize one component of the assay system on a support, thereby allowing other components of the system to be brought into contact with the component and readily removed from the sample.

Streptococcus polypeptide-specific antibodies for use in the present invention can be raised against an intact S. pneumoize polypeptide of the present invention or fragment thereof. These polypeptides and fragments may be administered to an animal (e.g., rabbit or mouse) either with a carrier protein (e.g., albumin) or, if long enough (e.g., at least about 25 amino acids), without a carrier.

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact molecules as well as antibody fragments (such as, for example, Fab and F(ab').sub.2 fragments) which are capable of specifically binding to a Streptococcus polypeptide. Fab and F(ab').sub.2 fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less non-specific tissue binding of an intact antibody (Wahl et al., J. Nucl. Med. 24:316-325 (1983)). Thus, these fragments are preferred.

The antibodies of the present invention may be prepared by any of a variety of methods. For example, the S. pneumoniae polypeptides identified in Table 1, or fragments thereof, can be administered to an animal in order to induce the production of sera containing polyclonal antibodies. In a preferred method, a preparation of a S. pneumoniae polypeptide of the present invention is prepared and purified to render it substantially free of natural contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal antisera of high specific activity.

In the most preferred method, the antibodies of the present invention are monoclonal antibodies. Such monoclonal antibodies can be prepared using hybridoma technology (Kohler et al., Nature 256:495 (1975); Kohler et al., Eur. J. Immunol. 6:511 (1976); Kohler et al., Eur. J. Immunol. 6:292 (1976); Hammerling et al., In: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N.Y., (1981) pp. 563-681 ). In general, such procedures involve immunizing an animal (preferably a mouse) with a S. pneumoniae polypeptide antigen of the present invention. Suitable cells can be recognized by their capacity to bind anti-Streptococcus polypeptide antibody. Such cells may be cultured in any suitable tissue culture medium; however, it is preferable to culture cells in Earle's modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at about 56.degree. C.), and supplemented with about 10 g/l of nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 .mu.g/ml of streptomycin. The splenocytes of such mice are extracted and fused with a suitable myeloma cell line. Any suitable myeloma cell line may be employed in accordance with the present invention; however, it is preferable to employ the parent myeloma cell line (SP.sub.2 O), available from the American Type Culture Collection, Rockville, Md. After fusion, the resulting hybridoma cells are selectively maintained in HAT medium, and then cloned by limiting dilution as described by Wands et al. (Gastroenterology 80:225-232 (1981)). The hybridoma cells obtained through such a selection are then assayed to identify clones which secrete antibodies capable of binding the Streptococcus polypeptide antigen administered to immunized animal.

Alternatively, additional antibodies capable of binding to Streptococcus polypeptide antigens may be produced in a two-step procedure through the use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance with this method, Streptococcus polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma cells, and the hybridoma cells are screened to identify clones which produce an antibody whose ability to bind to the Streptococcus polypeptide-specific antibody can be blocked by a Streptococcus polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the Streptococcus polypeptide-specific antibody and can be used to immunize an animal to induce formation of further Streptococcus polypeptide-specific antibodies.

It will be appreciated that Fab and F(ab').sub.2 and other fragments of the antibodies of the present invention may be used according to the methods disclosed herein. Such fragments are typically produced by proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab').sub.2 fragments). Alternatively, Streptococcus polypeptide-binding fragments can be produced through the application of recombinant DNA technology or through synthetic chemistry.

Of special interest to the present invention are antibodies to Streptococcus polypeptide antigens which are produced in humans, or are "humanized" (i.e., non-immunogenic in a human) by recombinant or other technology. Humanized antibodies may be produced, for example by replacing an immunogenic portion of an antibody with a corresponding, but non-immunogenic portion (i.e., chimeric antibodies) (Robinson, R. R. et al., International Patent Publication PCT/US86/02269; Akira, K. et al., European Patent Application 184,187; Taniguchi, M., European Patent Application 171,496; Morrison, S. L. et al., European Patent Application 173,494; Neuberger, M. S. et al., PCT Application WO 86/01533; Cabilly, S. et al., European Patent Application 125,023; Better, M. et al., Science 240:1041-1043 (1988); Liu, A. Y. et al., Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); Liu, A. Y. et al., J. Immunol. 139:3521-3526 (1987); Sun, L. K. et al., Proc. Natl. Acad. Sci. USA 84:214-218 (1987); Nishimura, Y. et al., Canc. Res. 47:999-1005 (1987); Wood, C. R. et al., Nature 314:446-449 (1985)); Shaw et al., J. Natl. Cancer Inst. 80:1553-1559 (1988). General reviews of "humanized" chimeric antibodies are provided by Morrison, S. L. (Science, 229:1202-1207 (1985)) and by Oi, V. T. et al., BioTechniques 4:214 (1986)). Suitable "humanized" antibodies can be alternatively produced by CDR or CEA substitution (Jones, P. T. et al., Nature 321:552-525 (1986); Verhoeyan et al., Science 239:1534 (1988); Beidler, C. B. et al., J. Immunol. 141:4053-4060 (1988)).

Suitable enzyme labels include, for example, those from the oxidase group, which catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is particularly preferred as it has good stability and its substrate (glucose) is readily available. Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels include radioisotopes, such as iodine (.sup.125 I, .sup.121 I ), carbon (.sup.14 C), sulphur (.sup.35 S), tritium (.sup.3 H), indium (.sup.112 In), and technetium (.sup.99m Tc), and fluorescent labels, such as fluorescein and rhodamine, and biotin.

Further suitable labels for the Streptococcus polypeptide-specific antibodies of the present invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase.

Examples of suitable radioisotopic labels include .sup.3 H, .sup.111 In, .sup.125 I, .sup.131 I, .sup.32 P, .sup.35 S, .sup.14 C, .sup.51 Cr, .sup.57 To, .sup.58 Co, .sup.59 Fe, .sup.75 Se, .sup.152 Eu, .sup.90 Y, .sup.67 Cu, .sup.217 Ci, .sup.211 At, .sup.212 Pb, .sup.47 Sc, .sup.109 Pd, etc. .sup.111 In is a preferred isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the .sup.125 I or .sup.131 I-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more favorable gamma emission energy for imaging (Perkins et al., Eur. J. Nucl. Med. 10:296-301 (1985); Carasquillo et al., J. NucL Med. 28:281-287 (1987)). For example, .sup.111 In coupled to monoclonal antibodies with 1-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumorous tissues, particularly the liver, and therefore enhances specificity of tumor localization (Esteban et al., J. Nucl. Med. 28:861-870 (1987)).

Examples of suitable non-radioactive isotopic labels include .sup.157 Gd, .sup.55 Mn, .sup.162 Dy, .sup.52 Tr, and .sup.56 Fe.

Examples of suitable fluorescent labels include an .sup.152 Eu label, a fluorescein label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label.

Examples of suitable toxin labels include diphtheria toxin, ricin, and cholera toxin.

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin label.

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei such as Gd, Mn, and iron.

Typical techniques for binding the above-described labels to antibodies are provided by Kennedy et al., Clin. Chim. Acta 70:1-31 (1976), and Schurs et al., Clin. Chim. Acta 81:1-40 (1977). Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are incorporated by reference herein.

In a related aspect, the invention includes a diagnostic kit for use in screening serum containing antibodies specific against S. pneumoniae infection. Such a kit may include an isolated S. pneumoniae antigen comprising an epitope which is specifically immunoreactive with at least one anti-S. pneumoniae antibody. Such a kit also includes means for detecting the binding of said antibody to the antigen. In specific embodiments, the kit may include a recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen may be attached to a solid support.

In a more specific embodiment, the detecting means of the above-described kit includes a solid support to which said peptide or polypeptide antigen is attached. Such a kit may also include a non-attached reporter-labelled anti-human antibody. In this embodiment, binding of the antibody to the S. pneumoniae antigen can be detected by binding of the reporter labelled antibody to the anti-S. pneumoniae antibody.

In a related aspect, the invention includes a method of detecting S. pneumoniae infection in a subject. This detection method includes reacting a body fluid, preferrably serum, from the subject with an isolated S. pneumoniae antigen, and examining the antigen for the presence of bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to a solid support, and serum is reacted with the support. Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The support is then examined for the presence of reporter-labeled antibody.

The solid surface reagent employed in the above assays and kits is prepared by known techniques for attaching protein material to solid support material, such as polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods generally include non-specific adsorption of the protein to the support or covalent attachment of the protein, typically through a free amine group, to a chemically reactive group on the solid support, such as an activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in conjunction with biotinylated antigen(s).

THERAPEUTICS AND MODES OF ADMINISTRATION

The present invention also provides vaccines comprising one or more polypeptides of the present invention. Heterogeneity in the composition of a vaccine may be provided by combining S. pneumoniae polypeptides of the present invention. Multi-component vaccines of this type are desirable because they are likely to be more effective in eliciting protective immune responses against multiple species and strains of the Streptococcus genus than single polypeptide vaccines. Thus, as discussed in detail below, a multi-component vaccine of the present invention may contain one or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 8, of the S. pneumoniae polypeptides identified in Table 1, or fragments thereof.

Multi-component vaccines are known in the art to elicit antibody production to numerous immunogenic components. Decker, M. and Edwards, K., J. Infect. Dis. 174:S270-275 (1996). In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic agents. Aristegui, J. et al., Vaccine 15:7-9 (1997).

The present invention thus also includes multi-component vaccines. These vaccines comprise more than one polypeptide, immunogen or antigen. An example of such a multi-component vaccine would be a vaccine comprising more than one of the S. pneumoniae polypeptides described in Table 1. A second example is a vaccine comprising one or more, for example 2 to 10, of the S. pneumoniae polypeptides identified in Table 1 and one or more, for example 2 to 10, additional polypeptides of either streptococcal or non-streptococcal origin. Thus, a multi-component vaccine which confers protective immunity to both a Streptococcal infection and infection by another pathogenic agent is also within the scope of the invention.

As indicated above, the vaccines of the present invention are expected to elicit a protective immune response against infections caused by species and strains of Streptococcus other than strain of S. pneumoniae deposited with that ATCC.

Further within the scope of the invention are whole cell and whole viral vaccines. Such vaccines may be produced recombinantly and involve the expression of one or more of the S. pneumoniae polypeptides described in Table 1. For example, the S. pneumoniae polypeptides of the present invention may be either secreted or localized intracellular, on the cell surface, or in the periplasmic space. Further, when a recombinant virus is used, the S. pneumoniae polypeptides of the present invention may, for example, be localized in the viral envelope, on the surface of the capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing heterologous proteins are known in the art. See, e.g., Robinson, K. et al., Nature Biotech. 15:653-657 (1997); Sirard, J. et al., Infect. Immun. 65:2029-2033 (1997); Chabalgoity, J. et al., Infect. Immun. 65:2402-2412 (1997). These cells may be administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, for example, report the successful use in mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells surface.

A multi-component vaccine can also be prepared using techniques known in the art by combining one or more S. pneumoniae polypeptides of the present invention, or fragments thereof, with additional non-streptococcal components (e.g., diphtheria toxin or tetanus toxin, and/or other compounds known to elicit an immune response). Such vaccines are useful for eliciting protective immune responses to both members of the Streptococcus genus and non-streptococcal pathogenic agents.

The vaccines of the present invention also include DNA vaccines. DNA vaccines are currently being developed for a number of infectious diseases. Boyer, J et al., Nat. Med. 3:526-532 (1997); reviewed in Spier, R., Vaccine 14:1285-1288 (1996). Such DNA vaccines contain a nucleotide sequence encoding one or more S. pneumoniae polypeptides of the present invention oriented in a manner that allows for expression of the subject polypeptide. The direct administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to elicit protective immunity in mice against borrelial challenge. Luke, C. et al., J. Infect. Dis. 175:91-97 (1997).

The present invention also relates to the administration of a vaccine which is co-administered with a molecule capable of modulating immune responses. Kim, J. et al., Nature Biotech. 15:641-646 (1997), for example, report the enhancement of immune responses produced by DNA immunizations when DNA sequences encoding molecules which stimulate the immune response are co-administered. In a similar fashion, the vaccines of the present invention may be co-administered with either nucleic acids encoding immune modulators or the immune modulators themselves. These immune modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and CD86.

The vaccines of the present invention may be used to confer resistance to streptococcal infection by either passive or active immunization. When the vaccines of the present invention are used to confer resistance to streptococcal infection through active immunization, a vaccine of the present invention is administered to an animal to elicit a protective immune response which either prevents or attenuates a streptococcal infection. When the vaccines of the present invention are used to confer resistance to streptococcal infection through passive immunization, the vaccine is provided to a host animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered and directly provided to a recipient suspected of having an infection caused by a member of the Streptococcus genus.

The ability to label antibodies, or fragments of antibodies, with toxin molecules provides an additional method for treating streptococcal infections when passive immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, capable of recognizing the S. pneumoniae polypeptides disclosed herein, or fragments thereof, as well as other Streptococcus proteins, are labeled with toxin molecules prior to their administration to the patient. When such toxin derivatized antibodies bind to Streptococcus cells, toxin moieties will be localized to these cells and will cause their death.

The present invention thus concerns and provides a means for preventing or attenuating a streptococcal infection resulting from organisms which have antigens that are recognized and bound by antisera produced in response to the polypeptides of the present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its administration to an animal results either in the total or partial attenuation (i.e., suppression) of a symptom or condition of the disease, or in the total or partial immunity of the animal to the disease.

The administration of the vaccine (or the antisera which it elicits) may be for either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are provided in advance of any symptoms of streptococcal infection. The prophylactic administration of the compound(s) serves to prevent or attenuate any subsequent infection. When provided therapeutically, the compound(s) is provided upon or after the detection of symptoms which indicate that an animal may be infected with a member of the Streptococcus genus. The therapeutic administration of the compound(s) serves to attenuate any actual infection. Thus, the S. pneumoniae polypeptides, and fragments thereof, of the present invention may be provided either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after the initiation of an actual infection.

The polypeptides of the invention, whether encoding a portion of a native protein or a functional derivative thereof, may be administered in pure form or may be coupled to a macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable proteins which may act as macromolecular carrier for enhancing the immunogenicity of the polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of the present invention to such macromolecular carriers are disclosed in Harlow et al., Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988), the entire disclosure of which is incorporated by reference herein.

A composition is said to be "pharmacologically acceptable" if its administration can be tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an agent is said to be administered in a "therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of a recipient patient.

While in all instances the vaccine of the present invention is administered as a pharmacologically acceptable compound, one skilled in the art would recognize that the composition of a pharmacologically acceptable compound varies with the animal to which it is administered. For example, a vaccine intended for human use will generally not be co-administered with Freund's adjuvant. Further, the level of purity of the S. pneumoniae polypeptides of the present invention will normally be higher when administered to a human than when administered to a non-human animal.

As would be understood by one of ordinary skill in the art, when the vaccine of the present invention is provided to an animal, it may be in a composition which may contain salts, buffers, adjuvants, or other substances which are desirable for improving the efficacy of the composition. Adjuvants are substances that can be used to specifically augment a specific immune response. These substances generally perform two functions: (1) they protect the antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically stimulate immune responses.

Normally, the adjuvant and the composition are mixed prior to presentation to the immune system, or presented separately, but into the same site of the animal being immunized. Adjuvants can be loosely divided into several groups based upon their composition. These groups include oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, AlK(SO.sub.4).sub.2, AlNa(SO.sub.4).sub.2, AlNH.sub.4 (SO.sub.4), silica, kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and certain natural substances (for example, wax D from Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or Bordetella pertussis, and members of the genus Brucella. Other substances useful as adjuvants are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present invention include aluminum salts, such as AlK(SO.sub.4).sub.2, AlNa(SO.sub.4).sub.2, and AlNH.sub.4 (SO.sub.4). Examples of materials suitable for use in vaccine compositions are provided in Remington's Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, Pa., pp. 1324-1341 (1980), which reference is incorporated herein by reference).

The therapeutic compositions of the present invention can be administered parenterally by injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, or orally. The compositions may alternatively be administered intramuscularly, or intravenously. Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance antigen absorption. Liquid dosage forms for oral administration may generally comprise a liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, such as purified water. Besides the inert diluents, such compositions can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, flavoring, or perfuming agents.

Therapeutic compositions of the present invention can also be administered in encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide-co-glycolide) has been shown to stimulate protective immune responses. Shahin, R. et al., Infect. Immun. 63:1195-1200 (1995). Similarly, orally administered encapsulated Salmonella typhimurium antigens have also been shown to elicit protective immunity in mice. Allaoui-Attarki, K. et al., Infect. Immun. 65:853-857 (1997). Encapsulated vaccines of the present invention can be administered by a variety of routes including those involving contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, intraduodenally).

Many different techniques exist for the timing of the immunizations when a multiple administration regimen is utilized. It is possible to use the compositions of the invention more than once to increase the levels and diversities of expression of the immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple immunizations are given, they will be given one to two months apart.

According to the present invention, an "effective amount" of a therapeutic composition is one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to provide an effective amount of the composition will vary depending upon such factors as the animal's or human's age, condition, sex, and extent of disease, if any, and other variables which can be adjusted by one of ordinary skill in the art.

The antigenic preparations of the invention can be administered by either single or multiple dosages of an effective amount. Effective amounts of the compositions of the invention can vary from 0.01-1,000 .mu.g/ml per dose, more preferably 0.1-500 .mu.g/ml per dose, and most preferably 10-300 .mu.g/ml per dose.

Having now generally described the invention, the same will be more readily understood through reference to the following example which is provided by way of illustration, and is not intended to be limiting of the present invention, unless specified.

EXAMPLES

EXAMPLE 1

Expression and Purification of S. pneumoniae Polypeptides in E. coli

The bacterial expression vector pQE10 (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311) is used in this example for cloning of the nucleotide sequences shown in Table 1 and for expressing the polypeptides identified in Table 1. The components of the pQE10 plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present invention expresses the polypeptide with the six His residues (i.e., a "6.times.His tag")) covalently linked to the amino terminus.

The DNA sequences encoding the desired portions of the polypeptides of Table 1 are amplified using PCR oligonucleotide primers from either a DNA library constructed from S. pnuemoniae, such as the one deposited by the inventors at the ATCC for convenience, ATCC Deposit No. 97755, or from DNA isolated from the same organism such as the S. pneumoniae strain deposited with the ATCC as Deposit No. 55840. A list of PCR primers which can be used for this purpose is provided in Table 3, below. The PCR primers anneal to the nucleotide sequences encoding both the amino terminal and carboxy terminal amino acid sequences of the desired portion of the polypeptides of Table 1. Additional nucleotides containing restriction sites to facilitate cloning in the pQE10 vector were added to the 5' and 3' primer sequences, respectively. Such restriction sites are listed in Table 3 for each primer. In each case, the primer comprises, from the 5' end, 4 random nucleotides to prevent "breathing" during the annealing process, a restriction site (shown in Table 3), and approximately 15 nucleotides of S. pneumoniae ORF sequence (the complete sequence of each cloning primer is shown as SEQ ID NO:227 through SEQ ID NO:452).

For cloning the polypeptides of Table 1, the 5' and 3' primers were selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' primer begins may be varied to amplify a DNA segment encoding any desired portion of the complete amino acid sequences described in Table 1. Similarly, one of ordinary skill in the art would further appreciate that the point in the protein coding sequence where the 3' primer begins may also be varied to amplify a DNA segment encoding any desired portion of the complete amino acid sequences described in Table 1.

The amplified DNA fragment and the pQE10 vector are digested with the appropriate restriction enzyme(s) and the digested DNAs are then ligated together. The ligation mixture is transformed into competent E. coli cells using standard procedures such as those described in Sambrook et al., Molecular Cloning: a Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Transformants are identified by their ability to grow under selective pressure on LB plates. Plasmid DNA is isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture under selection. The O/N culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lacI repressor. Cells subsequently are incubated further for 3 to 4 hours. Cells are then harvested by centrifugation.

The cells are stirred for 3-4 hours at 4 C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant containing the protein of interest is loaded onto a nickel-nitrilo-tri-acetic acid ("NiNTA") affinity resin column (available from QIAGEN, Inc., supra). Proteins with a 6.times.His tag bind to the NI-NTA resin with high affinity and can be purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH8, then washed with 10 volumes of 6 M guanidine-HCl pH6, and finally the polypeptide is eluted with 6 M guanidine-HCl, pH 5.0.

The purified protein is then renatured by dialyzing it against phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein can be successfully refolded while immobilized on the Ni-NTA column. The recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH7.4, containing protease inhibitors. The renaturation should be performed over a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 mM immidazole. Imidazole is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH6 buffer plus 200 mM NaCl. The purified protein is stored at 4.degree. C. or frozen at -80.degree. C.

The DNA sequences encoding the amino acid sequences of Table 1 may also be cloned and expressed as fusion proteins by a protocol similar to that described directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, Wis. 53711) is preferentially used in place of pQE10.

Each of the polynucleotides shown in Table 1, was successfully amplified and subcloned into pQE10 as described above using the PCR primers shown in Table 3. These pQE10 plasmids containing the DNAs of Table 1, except SP023, SP042, SP054, SP063, SP081, SP092, SP114, SP122, SP123, SP126, and SP127, were deposited with the ATCC as a pooled deposit as a convenience to those of skill in the art. This pooled deposit was desposited on Oct. 16, 1997 and given ATCC Deposit No. 209369. Those of ordinary skill in the art appreciate that isolating an individual plasmid from the pooled deposit is trivial provided the information and reagents described herein. Each of the deposited clones is capable of expressing its encoded S. pneumoniae polypeptide.

EXAMPLE 2

Immunization and Detection of Immune Responses

Methods

Growth of bacterial innoculum, immunization of Mice and Challenge with S pneumoniae.

Propagation and storage of, and challenge by S. pneumoniae are preformed essentially as described in Aaberge, I. S. et al., Virulence of Streptococcus pneumoniae in mice: a standardized method for preparation and frozen storage of the experimental bacterial inoculum, Microbial Pathogenesis, 18:141 (1995), incorporated herein by reference.

Briefly, Todd Hewitt (TH) broth (Difco laboratories, Detroit, Mich.) with 17% FCS, and horse blood agar plates are used for culturing the bacteria. Both broth and blood plates are incubated at 37.degree. C. in a 5% CO.sub.2 atmosphere. Blood plates are incubated for 18 hr. The culture broth is regularly 10-fold serially diluted in TH broth kept at room temperature and bacterial suspensions are kept at room temperature until challenge of mice.

For active immunizations C3H/HeJ mice (The Jackson Laboratory, Bar Harbor, Me.) are injected intraperitoneally (i.p.) at week 0 with 20 g of recombinant streptococcal protein, or phosphate-buffered saline (PBS), emulsified with complete Freund's adjuvant (CFA), given a similar booster immunization in incomplete Freund's adjuvant (IFA) at week 4, and challenged at week 6. For challenge S. pneumoniae are diluted in TH broth from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the base of the tail with 0.1 ml of these dilutions (serial dilutions are used to find medium infectious dose). Streptococci used for challenge are passaged fewer than six times in vitro. To assess infection, blood samples are obtained from the distal part of the lateral femoral vein into heparinized capillary tubes. A 25 ul blood sample is serially 10-fold diluted in TH broth, and 25 ul of diluted and undiluted blood is plated onto blood agar plates. The plates are incubated for 18 hr. and colonies are counted.

Other methods are known in the art, for example, see Langermann, S. et al., J. Exp. Med., 180:2277 (1994), incorporated herein by reference.

Immunoassays

Several immunoassay formats are used to quantify levels of streptococcal-specific antibodies (ELISA and immunoblot), and to evaluate the functional properties of these antibodies (growth inhibition assay). The ELISA and immunoblot assays are also used to detect and quantify antibodies elicited in response to streptococcal infection that react with specific streptococcal antigens. Where antibodies to certain streptococcal antigens are elicited by infection this is taken as evidence that the streptococcal proteins in question are expressed in vivo. Absence of infection-derived antibodies (seroconversion) following streptococcal challenge is evidence that infection is prevented or suppressed. The immunoblot assay is also used to ascertain whether antibodies raised against recombinant streptococcal antigens recognize a protein of similar size in extracts of whole streptococci. Where the natural protein is of similar, or identical, size in the immunoblot assay to the recombinant version of the same protein, this is taken as evidence that the recombinant protein is the product of a full-length clone of the respective gene.

Enzyme-Linked Immunosorbant Assay (ELISA).

The ELISA is used to quantify levels of antibodies reactive with streptococcus antigens elicited in response to immunization with these streptococcal antigens. Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Va., or equivalent) are coated with antigen by incubating 50 1 of 1 g/mil protein antigen solution in a suitable buffer, typically 0.1 M sodium carbonate buffer at pH 9.6. After decanting unbound antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash buffer (PBS, 0.2% Tween 20, pH 7.4). After washing, duplicate serial two-fold dilutions of sera in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After three washes, bound antibodies are detected with H.sub.2 O.sub.2 and 2,2'-azino-di-(3-ethylbenzthiazoline sulfonate) (Schwan, T. G., et al., Proc. Natl. Acad. Sci. USA 92:2909-2913 (1985)) (ABTS.RTM., Kirkegaard & Perry Labs., Gaithersburg, Md.) and A.sub.405 is quantified with a Molecular Devices, Corp. (Menlo Park, Calif.) VmaxTm plate reader. IgG levels twice the background level in serum from naive mice are assigned the minimum titer of 1:100.

Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis (SDS-PAGE) and Immunoblotting

Using a single well format, total streptococcal protein extracts or recombinant streptococcal antigen are boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. Gels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of antibody to be tested for reactivity with specific streptococcal antigens, followed by the appropriate secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using ECL.TM. reagents (Amersham Corp., Arlington Heights, Ill.).

EXAMPLE 3

Detection of Streptococcus mRNA expression

Northern blot analysis is carried out using methods described by, among others, Sambrook et al., supra. to detect the expression of the S. pneumoniae nucleotide sequences of the present invention in animal tissues. A cDNA probe containing an entire nucleotide sequence shown in Table 1 is labeled with .sup.32 P using the rediprime.TM. DNA labeling system (Amersham Life Science), according to manufacturer's instructions. After labeling, the probe is purified using a CHROMA SPIN-100 .TM. column (Clontech Laboratories, Inc.), according to manufacturer's protocol number PT1200-1. The purified labeled probe is then used to detect the expression of Streptococcus MRNA in an animal tissue sample.

Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using ExpressHyb.TM. hybridization solution (Clontech) according to manufacturer's protocol number PT1190-1. Following hybridization and washing, the blots are mounted and exposed to film at -70.degree. C. overnight, and films developed according to standard procedures.

It will be clear that the invention may be practiced otherwise than as particularly described in the foregoing description and examples.

Numerous modifications and variations of the present invention are possible in light of the above teachings and, therefore, are within the scope of the appended claims.

The entire disclosure of all publications (including patents, patent applications, journal articles, laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference.

                                      TABLE 1
    __________________________________________________________________________
    SP001  nucleotide  (SEQ  ID  NO:1)
    TAAAATCTACGACAATAAAAATCAACTCATTGCTGACTTGGGTTCTGAACGCCGCGTCAATGCCCAAGC
    TAATGATATTCCCACAGATTTGGTTAAGGCAATCGTTTCTATCGAAGACCATCGCTTCTTCGACCACAG
    GGGGATTGATACCATCCGTATCCTGGGAGCTTTCTTGCGCAATCTGCAAAGCAATTCCCTCCAAGGTGG
    ATCAACTCTCACCCAACAGTTGATTAAGTTGACTTACTTTTCAACTTCGACTTCCGACCAGACTATTTC
    TCGTAAGGCTCAGGAAGCTTGGTTAGCGATTCAGTTAGAACAAAAAGCAACCAAGCAAGAAATCTTGAC
    CTACTATATAAATAAGGTCTACATGTCTAATGGGAACTATGGAATGCAGACAGCAGCTCAAAACTACTA
    TGGTAAAGACCTCAATAATTTAAGTTTACCTCAGTTAGCCTTGCTGGCTGGAATGCCTCAGGCACCAAA
    CCAATATGACCCCTATTCACATCCAGAAGCAGCCCAAGACCGCCGAAACTTGGTCTTATCTGAAATGAA
    AAATCAAGGCTACATCTCTGCTGAACAGTATGAGAAAGCAGTCAATACACCAATTACTGATGGACTACA
    AAGTCTCAAATCAGCAAGTAATTACCCTGCTTACATGGATAATTACCTCAAGGAAGTCATCAATCAAGT
    TGAAGAAGAAACAGGCTATAACCTACTCACAACTGGGATGGATGTCTACACAAATGTAGACCAAGAAGC
    TCAAAAACATCTGTGGGATATTTACAATACAGACGAATACGTTGCCTATCCAGACGATGAATTGCAAGT
    CGCTTCTACCATTGTTGATGTTTCTAACGGTAAAGTCATTGCCCAGCTAGGAGCACGCCATCAGTCAAG
    TAATGTTTCCTTCGGAATTAACCAAGCAGTAGAAACAAACCGCGACTGGGGATCAACTATGAAACCGAT
    CACAGACTATGCTCCTGCCTTGGAGTACGGTGTCTACGATTCAACTGCTACTATCGTTCACGATGAGCC
    CTATAACTACCCTGGGACAAATACTCCTGTTTATAACTGGGATAGGGGCTACTTTGGCAACATCACCTT
    GCAATACGCCCTGCAACAATCGCGAAACGTCCCAGCCGTGGAAACTCTAAACAAGGTCGGACTCAACCG
    CGCCAAGACTTTCCTAAATGGTCTAGGAATCGACTACCCAAGTATTCACTACTCAAATGCCATTTCAAG
    TAACACAACCGAATCAGACAAAAAATATGGAGCAAGTAGTGAAAAGATGGCTGCTGCTTACGCTGCCTT
    TGCAAATGGTGGAACTTACTATAAACCAATGTATATCCATAAAGTCGTCTTTAGTGATGGGAGTGAAAA
    AGAGTTCTCTAATGTCGGAACTCGTGCCATGAAGGAAACGACAGCCTATATGATGACCGACATGATGAA
    AACAGTCTTGACTTATGGAACTGGACGAAATGCCTATCTTGCTTGGCTCCCTCAGGCTGGTAAAACAGG
    AACCTCTAACTATACAGACGAGGAAATTGAAAACCACATCAAGACCTCTCAATTTGTAGCACCTGATGA
    ACTATTTGCTGGCTATACGCGTAAATATTCAATGGCTGTATGGACAGGCTATTCTAACCGTCTGACACC
    ACTTGTAGGCAATGGCCTTACGGTCGCTGCCAAAGTTTACCGCTCTATGATGACCTACCTGTCTGAAGG
    AAGCAATCCAGAAGATTGGAATATACCAGAGGGGCTCTACAGAAATGGAGAATTCGTATTTAAAAATGG
    TGCTCGTTCTACGTGGAACTCACCTGCTCCACAACAACCCCCATCAACTGAAAGTTCAAGCTCATCATC
    AGATAGTTCAACTTCACAGTCTAGCTCAACCACTCCAAGCACAAATAATAGTACGACTACCAATCCTAA
    CAATAATACGCAACAATCAAATACAACCCCTGATCAACAAAATCAGAATCCTCAACCAGCACAACCA
    SP001  AMINO  ACID  (SEQ  ID  NO:2)
    KIYDNKNQLIADLGSERRVNAQANDIPTDLVKAIVSIEDHRFFDHRGIDTIRILGAFLRNLQSNSLQGG
    STLTQQLIKLTYFSTSTSDQTISRKAQEAWLAIQLEQKATKQEILTYYINKVYMSNGNYGMQTAAQNYY
    GKDLNNLSLPQLALLAGMPQAPNQYDPYSHPEAAQDRRNLVLSEMKNQGYISAEQYEKAVNTPITDGLQ
    SLKSASNYPAYMDNYLKEVINQVEEETGYNLLTTGMDVYTNVDQEAQKHLWDIYNTDEYVAYPDDELQV
    ASTIVDVSNGKVIAQLGARHQSSNVSFGINQAVETNRDWGSTMKPITDYAPALEYGVYDSTATIVHDEP
    YNYPGTNTPVYNWDRGYFGNITLQYALQQSRNVPAVETLNKVGLNRAKTFLNGLGIDYPSIHYSNAISS
    NTTESDKKYGASSEKMAAAYAAFANGGTYYKPMYIHKVVFSDGSEKEFSNVGTRAMKETTAYMMTDMMK
    TVLTYGTGRNAYLAWLPQAGKTGTSNYTDEEIENHIKTSQFVAPDELFAGYTRKYSMAVWTGYSNRLTP
    LVGNGLTVAAKVYRSMMTYLSEGSNPEDWNIPEGLYRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSS
    DSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQP
    SP004  nucleotide  (SEQ  ID  NO:3)
    AAATTACAATACGGACTATGAATTGACCTCTGGAGAAAAATTACCTCTTCCTAAAGAGATTTCAGGTTA
    CACTTATATTGGATATATCAAAGAGGGAAAAACGACTTCTGAGTCTGAAGTAAGTAATCAAAAGAGTTC
    AGTTGCCACTCCTACAAAACAACAAAAGGTGGATTATAATGTTACACCGAATTTTGTAGACCATCCATC
    AACAGTACAAGCTATTCAGGAACAAACACCTGTTTCTTCAACTAAGCCGACAGAAGTTCAAGTAGTTGA
    AAAACCTTTCTCTACTGAATTAATCAATCCAAGAAAAGAAGAGAAACAATCTTCAGATTCTCAAGAACA
    ATTAGCCGAACATAAGAATCTAGAAACGAAGAAAGAGGAGAAGATTTCTCCAAAAGAAAAGACTGGGGT
    AAATACATTAAATCCACAGGATGAAGTTTTATCAGGTCAATTGAACAAACCTGAACTCTTATATCGTGA
    GGAAACTATGGAGACAAAAATAGATTTTCAAGAAGAAATTCAAGAAAATCCTGATTTAGCTGAAGGAAC
    TGTAAGAGTAAAACAAGAAGGTAAATTAGGTAAGAAAGTTGAAATCGTCAGAATATTCTCTGTAAACAA
    GGAAGAAGTTTCGCGAGAAATTGTTTCAACTTCAACGACTGCGCCTAGTCCAAGAATAGTCGAAAAAGG
    TACTAAAAAAACTCAAGTTATAAAGGAACAACCTGAGACTGGTGTAGAACATAAGGACGTACAGTCTGG
    AGCTATTGTTGAACCCGCAATTCAGCCTGAGTTGCCCGAAGCTGTAGTAAGTGACAAAGGCGAACCAGA
    AGTTCAACCTACATTACCCGAAGCAGTTGTGACCGACAAAGGTGAGACTGAGGTTCAACCAGAGTCGCC
    AGATACTGTGGTAAGTGATAAAGGTGAACCAGAGCAGGTAGCACCGCTTCCAGAATATAAGGGTAATAT
    TGAGCAAGTAAAACCTGAAACTCCGGTTGAGAAGACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGT
    TCCAGTAAAACCAACAGAAGAAACACCAGTAAATCCAAATGAAGGTACTACAGAAGGAACCTCAATTCA
    AGAAGCAGAAAATCCAGTTCAACCTGCAGAAGAATCAACAACGAATTCAGAGAAAGTATCACCAGATAC
    ATCTAGCAAAAATACTGGGGAAGTGTCCAGTAATCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAA
    TAAACCAGAACATAATGACTCTAAAAATGAAAATTCAGAAAAAACTGTAGAAGAAGTTCCAGTAAATCC
    AAATGAAGGCACAGTAGAAGGTACCTCAAATCAAGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAAC
    ACAAACAAACTCTGGGAAAATAGCTAACGAAAATACTGGAGAAGTATCCAATAAACCTAGTGATTCAAA
    ACCACCAGTTGAAGAATCAAATCAACCAGAAAAAAACGGAACTGCAACAAAACCAGAAAATTCAGGTAA
    TACAACATCAGAGAATGGACAAACAGAACCAGAACCATCAAACGGAAATTCAACTGAGGATGTTTCAAC
    CGAATCAAACACATCCAATTCAAATGGAAACGAAGAAATTAAACAAGAAAATGAACTAGACCCTGATAA
    AAAGGTAGAAGAACCAGAGAAAACACTTGAATTAAGAAATGTTTCCGACCTAGAGTTA
    SP004  amino  acid  (SEQ  ID  NO:4)
    NYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSESEVSNQKSSVATPTKQQKVDYNVTPNFVDHPS
    TVQAIQEQTPVSSTKPTEVQVVEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGV
    NTLNPQDEVLSGQLNKPELLYREETMETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNK
    EEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGEPE
    VQPTLPEAVVTDKGETEVQPESPDTVVSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEV
    PVKPTEETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESN
    KPEHNDSKNENSEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSK
    PPVEESNQPEKNGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDK
    KVEEPEKTLELRNVSDLEL
    SP006  nucleotide  (SEQ  ID  NO:5)
    TGAGAATCAAGCTACACCCAAAGAGACTAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGT
    GCCACCATTTGACTACGAAGACAAGGGCAATCTGACAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGA
    TGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCCTGGGAGAGCATCTTCCCAGGACTTGA
    TTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCGTGCTGAAAAATACCTTTA
    CTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGACTTCTCTTGA
    CCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGGAA
    TCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGA
    CCTTGCTAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCG
    TGGTTTAGACCTCTCAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAG
    CGACCAAAAAGAGTTTAAAGAGCAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGA
    AAAACTCAGCAATACCTATCTAGGTGGTTCTTACCTCCCAGATCAATCTCAGTTACAA
    SP006  amino  acid  (SEQ  ID  NO:6)
    ENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTAWESIFPGLD
    SGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNWN
    QKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSVVDLPSADSPSNYIIFSS
    DQKEFKEQFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ
    SP007  nucleotide  (SEQ  ID  NO:7)
    TGGTAACCGCTCTTCTCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGA
    TACTGGTGGTGTTGATGACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGA
    ACACAATCTTTCAAAAGATAACGGTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAA
    CTTGCAACAAGCGGCTGGAAGTTACAACCTAATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAA
    AGATGCAGCAAAAGAACACACTGACTTGAACTATGTCTTGATTGATGATGTGATTAAAGACCAAAAGAA
    TGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACCTTGCAGGTGTGGCTGCAGCAAAAACAAC
    TAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCTCTCGTTTTGAAGCAGGATT
    CAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGGTTCATTTGGTGA
    TGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCAAGTAGC
    TGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA
    AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGA
    ATCAAACTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGC
    AGAAAGAGGAGAATTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGC
    AGTAACAAACCTTTCAGAAGAAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAG
    CGTAAAAGTTCCTGAAAAA
    SP007  amino  acid  (SEQ  ID  NO:8)
    GNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDNGFTYFQSTSEADYANN
    LQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLAGVAAAKTT
    KTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVA
    GGTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKA
    ERGEFPGGQVIVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEK
    SP008  nucleotide  (SEQ  ID  NO:9)
    TGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGCTGATTCAGGTGACAAACCTCTTATCAAAATGTAC
    CAAATCGGTGACAAACCAGACAACTTGGATGAATTGTTAGCAAATGCCAACAAAATCATTGAAGAAAAA
    GTTGGTGCCAAATTGGATATCCAATACCTTGGCTGGGGTGACTATGGTAAGAAAATGTCAGTTATCACA
    TCATCTGGTGAAAACTATGATATTGCCTTTGCAGATAACTATATTGTAAATGCTCAAAAAGGTGCTTAC
    GCTGACTTGACAGAATTGTACAAAAAAGAAGGTAAAGACCTTTACAAAGCACTTGACCCAGCTTACATC
    AAGGGTAATACTGTAAATGGTAAGATTTACGCTGTTCCAGTTGCAGCCAACGTTGCATCATCTCAAAAC
    TTTGCCTTCAACGGAACTCTCCTTGCTAAATATGGTATCGATATTTCAGGTGTTACTTCTTACGAAACT
    CTTGAGCCAGTCTTGAAACAAATCAAAGAAAAAGCTCCAGACGTAGTACCATTTGCTATTGGTAAAGTT
    TTCATCCCATCTCATAATTTTGACTACCCAGTAGCAAACGGTCTTCCATTCGTTATCGACCTTGAAGGC
    GATACTACTAAAGTTGTAAACCGTTACGAAGTGCCTCGTTTCAAAGAACACTTGAAGACTCTTCACAAA
    TTCTATGAAGCTGGCTACATTCCAAAAGACGTCGCAACAAGCGATACTTCCTTTGACCTTCAACAAGAT
    ACTTGGTTCGTTCGTGAAGAAACAGTAGGACCAGCTGACTACGGTAACAGCTTGCTTTCACGTGTTGCC
    AACAAAGATATCCAAATCAAACCAATTACTAACTTCATCAAGNAAAACCAAACAACACAAGTTGCTAAC
    TTTGTCATCTCAAACAACTCTAAGAACAAAGAAAAATCAATGGAAATCTTGAACCTCTTGAATACGAAC
    CCAGAACTCTTGAACGGTCTTGTTTACGGTCCAGAAGGCAAGAACTGGGAAAAAATTGAAGGTAAAGAA
    AACCGTGTTCGCGTTCTTGATGGCTACAAAGGAAACACTCACATGGGTGGATGGAACACTGGTAACAAC
    TGGATCCTTTACATCAACGAAAACGTTACAGACCAACAAATCGAAAATTCTAAGAAAGAATTGGCAGAA
    GCTAAAGAATCTCCAGCGCTTGGATTTATCTTCAATACTGACAATGTGAAATCTGAAATCTCAGCTATT
    GCTAACACAATGCAACAATTTGATACAGCTATCAACACTGGTACTGTAGACCCAGATAAAGCGATTCCA
    GAATTGATGGAAAAATTGAAATCTGAAGGTGCCTACGAAAAAGTATTGAACGAAATGCAAAAACAATAC
    GATGAATTCTTGAAAAACAAAAAA
    SP008  amino  acid  (SEQ  ID  NO:10)
    CGNLTGNSKKAADSGDKPVIKMYQIGDKPDNLDELLANANKIIEEKVGAKLDIQYLGWGDYGKKMSVIT
    SSGENYDIAFADNYIVNAQKGAYADLTELYKKEGKDLYKALDPAYIKGNTVNGKIYAVPVAANVASSQN
    FAFNGTLLAKYGIDISGVTSYETLEPVLKQIKEKAPDVVPFAIGKVFIPSDNFDYPVANGLPFVIDLEG
    DTTKVVNRYEVPRFKEHLKTLHKFYEAGYIPKDVATSDTSFDLQQDTWFVREETVGPADYGNSLLSRVA
    NKDIQIKPITNFIKXNQTTQVANFVISNNSKNKEKSMEILNLLNTNPELLNGLVYGPEGKNWEKIEGKE
    NRVRVLDGYKGNTHMGGWNTGNNWILYINENVTDQQIENSKKELAEAKESPALGFIFNTDNVKSEISAI
    ANTMQQFDTAINTGTVDPDKAIPELMEKLKSEGAYEKVLNEMQKQYDEFLKNKK
    SP009  nucleotide  (SEQ  ID  NO:11)
    TGGTCAAGGAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTG
    GACACCAAATACCAACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGT
    GGATGTTGATTTGAAATTGCCACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATT
    TGCAGTCTATTTCCAAGACTACATGGCTAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGC
    TATTGTTGAACACAATACATCAGGAATCATCTCTCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTT
    GGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGAACTTGCTATGTTGAAAACCTTGGTAGAATC
    TCAAGGTGGAGACTTTGAGAAGGTTGAAAAAGTACCAAATAACGACTCAAACTCAATCACACCGATTGC
    CAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGTATCCTTGCTAAATCTCAAGGTGT
    AGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTACTATTCACCAGTTATCATCGC
    AAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAGGCTA
    CCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGA
    AAAACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATG
    GGGTCAATTTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGA
    AGACTTGACAGACAAAGGCTTCACCAACGAATTTGTGAAA
    SP009  amino  acid  (SEQ  ID  NO:12)
    GQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLPPEESSSDLVINGKAPF
    AVYFQDYMAKKLEKGAGITAVAAIVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAMLKTLVES
    QGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVIIA
    NNDYLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKW
    GQFDAARWNAFYKWDKENGILKEDLTDKGFTNEFVK
    SP010  nucleotide  (SEQ  ID  NO:13)
    TAGCTCAGGTGGAAACGCTGGTTCATCCTCTGGAAAAACAACTGCCAAAGCTCGCACTATCGATGAAAT
    CAAAAAAAGCGGTGAACTGCGAATCGCCGTGTTTGGAGATAAAAAACCGTTTGGCTACGTTGACAATGA
    TGGTTCTACCAAGGTACGCTACGATATTGAACTAGGGAACCAACTAGCTCAAGACCTTGGTGTCAAGGT
    TAAATACATTTCAGTCGATGCTGCCAACCGTGCGGAATACTTGATTTCAAACAAGGTAGATATTACTCT
    TGCTAACTTTACAGTAACTGACGAACGTAAGAAACAAGTTGATTTTGCCCTTCCATATATGAAAGTTTC
    TCTGGGTGTCGTATCACCTAAGACTGGTCTCATTACAGACGTCAAACAACTTGAAGGTAAAACCTTAAT
    TGTCACAAAAGGAACGACTGCTGAGACTTATTTTGAAAAGAATCATCCAGAAATCAAACTCCAAAAATA
    CGACCAATACAGTGACTCTTACCAAGCTCTTCTTGACGGACGTGGAGATGCCTTTTCAACTGACAATAC
    GGAAGTTCTAGCTTGGGCGCTTGAAAATAAAGGATTTGAAGTAGGAATTACTTCCCTCGGTGATCCCGA
    TACCATTGCGGCAGCAGTTCAAAAAGGCAACCAAGAATTGCTAGACTTCATCAATAAAGATATTGAAAA
    ATTAGGCAAGGAAAACTTCTTCCACAAGGCCTATGAAAAGACACTTCACCCAACCTACGGTGACGCTGC
    TAAAGCAGATGACCTGGTTGTTGAAGGTGGAAAAGTTGAT
    SP010  amino  acid  (SEQ  ID  NO:14)
    SSGGNAGSSSGKTTAKARTIDEIKKSGELRIAVFGDKKPFGYVDNDGSTKVRYDIELGNQLAQDLGVKV
    KYISVDAANRAEYLISNKVDITLANFTVTDERKKQVDFALPYMKVSLGVVSPKTGLITDVKQLEGKTLI
    VTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPD
    TIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLVVEGGKVD
    SP011  nucleotide  (SEQ  ID  NO:15)
    CTCCAACTATGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGAC
    CAAAACCTTGGAAGAAATCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAA
    TGTACCAAATGCTGGTGAAGTATTGAAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATAT
    TTACCCACAGTCCATCGAACTGCAAGAATGGGCAAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGA
    CTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATGCTGTAAACGAAAAAGTTTACAACGTTCC
    TTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATAAATTCGAAGAACTGGGCTTGAAGGTTCC
    TGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAACACCATTTGGAAT
    TGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGACAGCAACAGGTGGAGG
    AAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAA
    AGATGATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGG
    CTATACCGATGTTATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGAT
    CACAGCGATTAATGAACAAAAACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGG
    ACAAAGCTTAACCGTTGGTGCGGGAGACTTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGA
    AGCCAATGCCTTTGTGGAATATATGACCCGTCCAGAAGTCATGCAAAAATACTACGATGTGGACGGATC
    TCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCACCGCTTGCTGGTATGACCGAATATGC
    CTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCAGACTTCCATACCTTGAC
    CATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTTTAACCCGAT
    GAAAGCGGATGTGGAT
    SP011  amino  acid  (SEQ  ID  NO:16)
    SNYGKSADGTVTIEYFNQKKEMTKTLEEITRDFEKENPKIKVKVVNVPNAGEVLKTRVLAGDVPDVVNI
    YPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYVVPFTANAYGIYYNKDKFEELGLKVP
    ETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMK
    DDIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKG
    QSLTVGAGDLAWSISATTKHPKEANAFVEYMTRPEVMQKYYDVDGSPTAIEGVKQAGEDSPLAGMTEYA
    FTDRHLVWLQQYWTSEADFHTLTMYYVLTGDKQGMVNDLNAFFNPMKADVD
    SP012  nucleotide  (SEQ  ID  NO:17)
    TGGGAAAAATTCTAGCGAAACTAGTGGAGATAATTGGTCAAAGTACCAGTCTAACAAGTCTATTACTAT
    TGGATTTGATAGTACTTTTGTTCCAATGGGATTTGCTCAGAAAGATGGTTCTTATGCAGGATTTGATAT
    TGATTTAGCTACAGCTGTTTTTGAAAAATACGGAATCACGGTAAATTGGCAACCGATTGATTGGGATTT
    GAAAGAAGCTGAATTGACAAAAGGAACGATTGATCTGATTTGGAATGGCTATTCCGCTACAGACGAACG
    CCGTGAAAAGGTGGCTTTCAGTAACTCATATATGAAGAATGAGCAGGTATTGGTTACGAAGAAATCATC
    TGGTATCACGACTGCAAAGGATATGACTGGAAAGACATTAGGAGCTCAAGCTGGTTCATCTGGTTATGC
    GGACTTTGAAGCAAATCCAGAAATTTTGAAGAATATTGTCGCTAATAAGGAAGCGAATCAATACCAAAC
    CTTTAATGAAGCCTTGATTGATTTGAAAAACGATCGAATTGATGGTCTATTGATTGACCGTGTCTATGC
    AAACTATTATTTAGAAGCAGAAGGTGTTTTAAACGATTATAATGTCTTTACAGTTGGACTAGAAACAGA
    AGCTTTTGCGGTTGGAGCCCGTAAGGAAGATACAAACTTGGTTAAGAAGATAAATGAAGCTTTTTCTAG
    TCTTTACAAGGACGGCAAGTTCCAAGAAATCAGCCAAAAATGGTTTGGAGAAGATGTAGCAACCAAAGA
    AGTAAAAGAAGGACAG
    SP012  nucleotide  (SEQ  ID  NO:18)
    GKNSSETSGDNWSKYQSNKSITIGFDSTFVPMGFAQKDGSYAGFDIDLATAVFEKYGITVNWQPIDWDL
    KEAELTKGTIDLIWNGYSATDERREKVAFSNSYMKNEQVLVTKKSSGITTAKDMTGKTLGAQAGSSGYA
    DFEANPEILKNIVANKEANQYQTFNEALIDLKNDRIDGLLIDRVYANYYLEAEGVLNDYNVFTVGLETE
    AFAVGARKEDTNLVKKINEAFSSLYKDGKFQEISQKWFGEDVATKEVKEGQ
    SP013  nucleotide  (SEQ  ID  NO:19)
    TGCTAGCGGAAAAAAAGATACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGC
    TGATATTACTAAAAATATTGCTGGTGACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCC
    ACACGAATACGAACCACTTCCTGAAGACGTTAAGAAAACTTCTGAGGCTAATTTGATTTTCTATAACGG
    TATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATTGGTAGAAAATGCCAAGAAAACTGAAAA
    CAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAGGTCAAAATGAAAAAGGAAA
    AGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATCGCCAAACAATT
    GAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTTAGA
    CAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAG
    CGAAGGAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATAC
    TGAAGAAGAAGGAACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATC
    ACTCTTTGTAGAATCAAGTGTGGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAAT
    CTACGCTCAAATCTTTACTGACTCTATCGCAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGAT
    GAAATACAACCTTGACAAGATTGCTGAAGGATTGGCAAAA
    SP013  amino  acid  (SEQ  ID  NO:20)
    ASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVKKTSEANLIFYNG
    INLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQL
    SAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINT
    EEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMM
    KYNLDKIAEGLAK
    SP014  nucleotide  (SEQ  ID  NO:21)
    TGGCTCAAAAAATACAGCTTCAAGTCCAGATTATAAGTTGGAAGGTGTAACATTCCCGCTTCAAGAAAA
    GAAAACATTGAAGTTTATGACAGCCAGTTCACCGTTATCTCCTAAAGACCCAAATGAAAAGTTAATTTT
    GCAACGTTTGGAGAAGGAAACTGGCGTTCATATTGACTGGACCAACTACCAATCCGACTTTGCAGAAAA


ACGTAACTTGGATATTTCTAGTGGTGATTTACCAGATGCTATCCACAACGACGGAGCTTCAGATGTGGA CTTGATGAACTGGGCTAAAAAAGGTGTTATTATTCCAGTTGAAGATTTGATTGATAAATACATGCCAAA TCTTAAGAAAATTTTGGATGAGAAACCAGAGTACAAGGCCTTGATGACAGCACCTGATGGGCACATTTA CTCATTTCCATGGATTGAAGAGCTTGGAGATGGTAAAGAGTCTATTCACAGTGTCAACGATATGGCTTG GATTAACAAAGATTGGCTTAAGAAACTTGGTCTTGAAATGCCAAAAACTACTGATGAITTGATTAAAGT CCTAGAAGCTTTCAAAAACGGGGATCCAAATGGAAATGGAGAGGCTGATGAAATTCCATTTTCATTTAT TAGTGGTAACGGAAACGAAGATTTTAAATTCCTATTTGCTGCATTTGGTATAGGGGATAACGATGATCA TTTAGTAGTAGGAAATGATGGCAAAGTTGACTTCACAGCAGATAACGATAACTATAAAGAAGGTGTCAA ATTTATCCGTCAATTGCAAGAAAAAGGCCTGATTGATAAAGAAGCTTTCGAACATGATTGGAATAGTTA CATTGCTAAAGGTCATGATCAGAAATTTGGTGTTTACTTTACATGGGATAAGAATAATGTTACTGGAAG TAACGAAAGTTATGATGTTTTACCAGTACTTGCTGGACCAAGTGGTCAAAAACACGTAGCTCGTACAAA CGGTATGGGATTTGCACGTGACAAGATGGTTATTACCAGTGTAAACAAAAACCTAGAATTGACAGCTAA ATGGATTGATGCACAATACGCTCCACTCCAATCTGTGCAAAATAACTGGGGAACTTACGGAGATGACAA ACAACAAAACATCTTTGAATTGGATCAAGCGTCAAATAGTCTAAAACACTTACCACTAAACGGAACTGC ACCAGCAGAACTTCGTCAAAAGACTGAAGTAGGAGGACCACTAGCTATCCTAGATTCATACTATGGTAA AGTAACAACCATGCCTGATGATGCCAAATGGCGTTTGGATCTTATCAAAGAATATTATGTTCCTTACAT GAGCAATGTCAATAACTATCCAAGAGTCTTTATGACACAGGAAGATTTGGACAAGATTGCCCATATCGA AGCAGATATGAATGACTATATCTACCGTAAACGTGCTGAATGGATTGTAAATGGCAATATTGATACTGA GTGGGATGATTACAAGAAAGAACTTGAAAAATACGGACTTTCTGATTACCTCGCTATTAAACAAAAATA CTACGACCAATACCAAGCAAACAAAAAC SP014 amino acid (SEQ ID NO:22) GSKNTASSPDYKLEGVTFPLQEKKTLKFMTASSPLSPKDPNEKLILQRLEKETGVHIDWTNYQSDFAEK RNLDISSGDLPDAIHNDGASDVDLMNWAKKGVIIPVEDLIDKYMPNLKKILDEKPEYKALMTAPDGHIY SFPWIEELGDGKESIHSVNDMAWINKDWLKKLGLEMPKTTDDLIKVLEAFKNGDPNGNGEADEIPFSFI SGNGNEDFKFLFAAFGIGDNDDHLVVGNDGKVDFTADNDNYKEGVKFIRQLQEKGLIDKEAFEHDWNSY IAKGHDQKFGVYFTWDKNNVTGSNESYDVLPVLAGPSGQKHVARTNGMGFARDKMVITSVNKNLELTAK WIDAQYAPLQSVQNNWGTYGDDKQQNIFELDQASNSLKHLPLNGTAPAELRQKTEVGGPLAILDSYYGK VTTMPDDAKWRLDLIKEYYVPYMSNNNNYPRVFMTQEDLDKIAHIEADMMDYIYRKPAEWIVNGNIDTE WDDYKKELEKYGLSDYLAIKQKYYDQYQANKN SP015 nucleotide (SEQ ID NO:23) TAGTACAAACTCAAGCACTAGTCAGACAGAGACCAGTAGCTCTGCTCCAACAGAGGTAACCATTAAAAG TTCACTGGACGAGGTCAAACTTTCCAAAGTTCCTGAAAAGATTGTGACCTTTGACCTCGGCGCTGCGGA TACTATTCGCGCTTTAGGATTTGAAAAAAATATCGTCGGAATGCCTACAAAAACTGTTCCGACTTATCT AAAAGACCTAGTGGGAACTGTCAAAAATGTTGGTTCTATGAAAGAACCTGATTTAGAAGCTATCGCCGC CCTTGAGCCTGATTTGATTATCGCTTCGCCACGTACACAAAAATTCGTAGACAAATTCAAAGAAATCGC CCCAACCGTTCTCTTCCAAGCAAGCAAGGACGACTACTGGACTTCTACCAAGGCTAATATCGAATCCTT AGCAAGTGCCTTCGGCGAAACTGGTACACAGAAAGCCAAGGAAGAATTGACCAAGCTAGACAAGAGCAT CCAAGAAGTCGCTACTAAAAATGAAAGCTCTGACAAAAAAGCCCTTGCGATCCTCCTTAATGAAGGAAA AATGGCAGCCTTTGGTGCCAAATCTCGTTTCTCTTTCTTGTACCAAACCTTGAAATTCAAACCAACTGA TACAAAATTTGAAGACTCACGCCACGGACAAGAAGTCAGCTTTGAAAGTGTCAAAGAAATCAACCCTGA CATCCTCTTTGTCATCAACCGTACCCTTGCCATCGGTGGGGACAACTCTAGCAACGACGGTGTCCTAGA AAATGCCCTTATCGCTGAAACACCTGCTGCTAAAAATGGTAAGATTATCCAACTAACACCAGACCTCTG GTATCTAAGCGGAGGCGGACTTGAATCAACAAAACTCATGATTGAAGACATACAAAAAGCTTTGAAA SP015 amino acid (SEQ ID NO:24) STNSSTSQTETSSSAPTEVTIKSSLDEVKLSKVPEKIVTFDLGAADTIPALGFEKNIVGMPTKTVPTYL KDLVGTVKNVGSMKEPDLEAIAALEPDLIIASPRTQKFVDKFKEIAPTVLFQASKDDYWTSTKANIESL ASAFGETGTQKAKEELTKLDKSIQEVATKNESSDKKALAILLNEGKNAAFGAKSRFSFLYQTLKFKPTD TKFEDSRHGQEVSFESVKEINPDILFVINRTLAIGGDNSSNDGVLENALIAETPAAKNGKIIQLTPDLW YLSGGGLESTKLMIEDIQKALK SP016 nucleotide (SEQ ID NO:25) TGGCAATTCTGGCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTG GTGGGCATTCCCAGTATTTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCAT CGAAGCGTTTGAAAAAGCAAACCCAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCC TGAAAAAATCACAACAGCCATCGAAGCAGGAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTAT CATCCAATACGGTAAAAACGGTAAATTGGCTGAGTTGAATGACCTCTTCACAGATGAATTTGTTAAAGA TGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAGACAAGGCTTATATGTATCCGATTAGTTC TGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGGAGTAGCAAACCTTGTAAAAGA AGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGGTTACACACCAGGTTC ATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTATAGCGGTTC TGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAAGC AACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAATTTGACGGTGGGGCAGATATCCAAAA CTTTGCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACT TTTAGAAGCAAGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGA GTACCTTGTAAACGGGTTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATT CATCCAGTTTATCGCAGATGACAAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGT CCGTACTTCATTTGGAAAACTTTATGAAGACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTA CTCACCATACTACAACACTATTGATGGATTTGCTGAAATGAGAACACTTTGGTTCCCAATGTTGCAATC TGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTGAAAGCCTTCACTGAAAAAGCGAACGAAACAAT CAAAAAAGCTATGAAACAA SP016 amino acid (SEQ ID NO:26) GNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDIKVKLETIDFKSGP EKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPISS APFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGS VTDEKVSKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKL LEASKVEVVEVPFPSDEGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPV RTSFGKLYEDKRMETISGWTQYYSPYYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETI KKAMKQ SP017 nucleotide (SEQ ID NO:27) TTCACAAGAAAAAACAAAAAATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAAC AGTCGGTAGTAAGTCTCAAGGAGCTGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAG CATTCAAGGGAAATACGATGAAATCATCGTAGCCAACAAACACTATCCATTGTCTAAAGACTATAATCC AGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAAACTCATCAAAGCGATGCAAGAGGCAGGTTTCCC TATTAGTGATCATTACAGTGGTTTTAGAAGTTATGAAACTCAGACCAAGCTCTATCAAGATTATGTCAA CCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTATAGCGAACACCAGACAGGCTT GGCCTTTGATGTGATTGGGACTGATGGTGATTTGGTGACAGAAGAAAAAGCAGCCCAATGGCTCTTGGA TCATGCAGCTGATTATGGCTTTGTTGTCCGTTATCTCAAAGGCAAGGAAAAGGAAACAGGCTATATGGC TGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGTTTGGA AGAATACTATGGCTTTGAAGGCGGAGACTACGTCGAT SP017 amino acid (SEQ ID NO:28) SQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVANKHYPLSKDYNP GENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQTGL AFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLE EYYGFEGGDYVD SP019 nucleotide (SEQ ID NO:29) GAAAGGTCTGTGGTCAAATAATCTTACCTGCGGTTATGATGAAAAAATAATCTTGGAAAATATAAATAT AAAAATACCTGAAGAAAAAATATCAGTTATTATTGGGTCAAATGGTTGTGGGAAATCAACACTCATTAA AACCTTGTCTCGACTTATAAAGCCATTAGAGGGAGAAGTATTGCTTGATAATAAATCAATTAATTCTTA TAAAGAAAAAGATTTAGCAAAACACATAGCTATATTACCTCAATCTCCAATAATCCCTGAATCAATAAC AGTAGCTGATCTTGTAAGCCGTGGTCGTTTCCCCTACAGAAAGCCTTTTAAGAGTCTTGGAAAAGATGA CCTTGAAATAATAAACAGATCAATGGTTAAGGCCAATGTTGAAGATCTAGCAAATAACCTAGTTGAAGA ACTTTCTGGGGGTCAAAGGCAAAGAGTATGGATAGCTCTAGCCCTAGCCCAAGATACAAGTATCCTACT TTTAGATGAGCCAACTACTTACTTGGATATCTCATATCAAATAGAACTATTAGACCTCTTGACTGATCT AAACCAAAAATATAAGACAACCATTTGCATGATTTTGCACGATATAAATCTAACAGCAAGATACGCTGA TTACCTATTTGCAATTAAAGAAGGTAAACTTGTTGCAGAGGGAAAGCCTGAAGATATACTAAATGATAA ACTAGTTAAAGATATCTTTAATCTTGAAGCAAAAATTATACGTGACCCTATTTCCAATTCGCCTCTAAT GATTCCTATTGGCAAGCACCATGTTAACTCT SP019 amino acid (SEQ ID NO:30) KGLWSNNLTCGYDEKIILENINIKIPEEKISVIIGSNGCGKSTLIKTLSRLIKPLEGEVLLDNKSINSY KEKDLAKHIAILPQSPIIPESITVADLVSRGRFPYRKPFKSLGKDDLEIINRSMVKANVEDLANNLVEE LSGGQRQRVWIALALAQDTSILLLDEPTTYLDISYQIELLDLLTDLNQKYKTTICMILHDINLTARYAD YLFAIKEGKLVAEGKPEDILNDKLVKDIFNLEAKIIRDPISNSPLMIPTGKHHVS SP020 nucleotide (SEQ ID NO:31) AAACTCAGAAAAGAAAGCAGACAATGCAACAACTATCAAAATCGCAACTGTTAACCGTAGCGGTTCTGA AGAAAAACGTTGGGACAAAATCCAAGAATTGGTTAAAAAAGACGGAATTACCTTGGAATTTACAGAGTT CACAGACTACTCACAACCAAACAAAGCAACTGCTGATGGCGAAGTAGATTTGAACGCTTTCCAACACTA TAACTTCTTGAACAACTGGAACAAAGAAAACGGAAAAGACCTTGTAGCGATTGCAGATACTTACATCTC TCCAATCCGCCTTTACTCAGGTTTGAATGGAAGTGCCAACAAGTACACTAAAGTAGAAGACATCCCAGC AAACGGAGAAATCGCTGTACCGAATGACGCTACAAACGAAAGCCGTGCGCTTTATTTGCTTCAATCAGC TGGCTTGATTAAATTGGATGTTTCTGGAACTGCTCTTGCAACAGTTGCCAACATCAAAGAAAATCCAAA GAACTTGAAAATCACTGAATTGGACGCTAGCCAAACAGCTCGTTCATTGTCATCAGTTGACGCTGCCGT TGTAAACAATACCTTCGTTACAGAAGCAAAATTGGACTACAAGAAATCACTTTTCAAAGAACAAGCTGA TGAAAACTCAAAACAATGGTACAACATCATTGTTGCAAAAAAAGATTGGGAAACATCACCTAAGGCTGA TGCTATCAAGAAAGTAATCGCAGCTTACCACACAGATGACGTGAAAAAAGTTATCGAAGAATCATCAGA TGGTTTGGATCAACCAGTTTGG SP020 amino acid (SEQ ID NO:32) NSEKKADNATTIKIATVNRSGSEEKRWDKIQELVKKDGITLEFTEFTDYSQPNKATADGEVDLNAFQHY NFLNNWNKENGKDLVAIADTYISPIRLYSGLNGSANKYTKVEDIPANGEIAVPNDATNESRALYLLQSA GLIKLDVSGTALATVANIKENPKNLKITELDASQTARSLSSVDAAVVNNTFVTEAKLDYKKSLFKEQAD ENSKQWYNIIVAKKDWETSPKADAIKKVIAAYHTDDVKKVIEESSDGLDQPVW SP021 nucleotide (SEQ ID NO:33) TTCGAAAGGGTCAGAAGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTA TGAGCAAGTGAAAAGCAACCCTTCAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGA AAAACAATATGGCTCAGAGCTTGATGATAAAGAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATA TGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTATGACTCTTGAAACACGTAAAGCTCAAATTCG TACAAGTAAATTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGAAGCTGAATTGACAGATGAAGCCTATAA GAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAATAATGAAGATAAGGC CAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAATTCAAC TGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGGAGC AAGTCCAAAAAAGCCGCTTTTCGCTTTTAGATGTGGGATGGTGTTTCTGGATGTGGATTACAGCAACTG GGGCACACCAAGCCTACAG SP021 amino acid (SEQ ID NO:34) SKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDKEVDDTIAEEKKQY GENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLNNEDKA KEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPGASPKKPLFAFRCGMVFLDVDYSNW GTPSLQ SP022 nucleotide (SEQ ID NO:35) GGGGATGGCAGCTTTTAAAAATCCTAACAATCAATACAAAGCTATTACAATTGCTCAAACTCTAGGTGA TGATGCTTCTTCAGAGGAATTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTC AAACCTTTCAACAGTTAAAACTAAAGCTACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTAC GTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTAAATGGTATTTCTATGAGTCTGGTGATGTGAAGAC AGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGACTTAGGTGTCATGCAGACTGGATTTGT AAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTACAGGCTGGGGAACAGATGG TAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAAATGGCACTTG GTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTATGC CTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGA ATGGGTAAAC SP022 amino acid (SEQ ID NO:36) GMAAFKNPNNQYKAITIAQTLGDDASSEELAGRYGSAVQCTEVTASNLSTVKTKATVVEKPLKDFRAST SDQSGWVESNGKWYFYESGDVKTGWVKTDGKWYYLNDLGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDG SRWFYFDGSGANKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYYAYGSGALAVSTTTPDGYRVNGNGE WVN SP023 nucleotide (SEQ ID NO:37) AGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGCTGAGGCTACAAGGTTAAA AAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTAAAGAGCAAGG TAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAATGA TGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAA GGTAGCAGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCG CCGTAACTACCCAACCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA AAAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGC AAAAGCGGAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAA AGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACAACC ACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCAAAACCAGAGAATCCAGCTGAACA ACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGAAGAAGAATA TAATCGCTTGACTCAACAGCAACCGCCAAAAA&TGAAAAACCAGCACAACCATCTACTCCAAAAACAGG CTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCA AAACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGG TTCATGGTACTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTA CTACCTAAACGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAA CGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGG TGATATGGCGACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAA AGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCAGTCAA CACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC SP023 amino acid (SEQ ID NO:38) DEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEND AKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVK KAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQP QPAPAPKAEKPAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG WKQENGMWYFYNTDGSNATGWLQNNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSNATGWLQNNGSWY YLNANGSMATGWLQYNGSWYYLNANGSNATGWLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMK ASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN SP025 nucleotide (SEQ ID NO:39) CTGTGGTGAGGAAGAAACTAAAAAGACTCAAGCAGCACAACAGCCAAAACAACAAACGACTGTACAACA AATTGCTGTTGGAAAAGATGCTCCAGACTTCACATTGCAATCCATGGATGGCAAAGAAGTTAAGTTATC TGATTTTAAGGGTAAAAAGGTTTACTTGAAGTTTTGGGCTTCATGGTGTGGTCCATGCAAGAAAAGTAT GCCAGAGTTGATGGAACTAGCGGCGAAACCAGATCGTGATTTCGAAATTCTTACTGTCATTGCACCAGG AATTCAAGGTGAAAAAACTGTTGAGCAATTCCCACAATGGTTCCAGGAACAAGGATATAAGGATATCCC AGTTCTTTATGATACCAAAGCAACCACTTCCAAGCTTATCAAATTCGAAGCATTCCTACAGAATATT SP025 amino acid (SEQ ID NO:40) CGEEETKKTQAAQQPKQQTTVQQIAVGKDAPDFTLQSMDGKEVKLSDFKGKKVYLKFWASWCGPCKKSM PELMELAAKPDRDFEILTVIAPGIQGEKTVEQFPQWFQEQGYKDIPVLYDTKATTSKLIKFEAFLQNI SP026 nucleotide (SEQ ID NO:41) GACTTTTAACAATAAAACTATTGAAGAGTTGCACAATCTCCTTGTCTCTAAGGAAATTTCTGCAACAGA ATTGACCCAAGCAACACTTGAAAATATCAAGTCTCGTGAGGAAGCCCTCAATTCATTTGTCACCATCGC TGAGGAGCAAGCTCTTGTTCAAGCTAAAGCCATTGATGAAGCtGGAATTGATGCTGACAATGTCCTTTC AGGAATTCCACTTGCTGTTAAGGATAACATCTCTACAGACGGTATTCTCACAACTGCTGCCTCAAAAAT GCTCTACAACTATGAGCCAATCTTTGATGCGACagCTgTTGCCAATGCAAAAACCAAGGGCATGATTGT CGTTGGAAAGACCAACATGGACGAATTTGCTATGGGTGGTTCAGGtGAAACTTCACACTACGGAGCAAC TAAAAACGCTTGGAACCACAGCAAGGTTCCTGGTGGGTCATCAAGTGGTTCTGCCGCAGCTGTAGCCTC AGGACAAGTTCGCTTGTCACTTGGTTCTGATACTGGTGGTTCCATCCGCCAACCTGCTGCCTTCAACGG AATCGTTGGTCTCAAACCAACCTACGGAACAGTTTCACGTTTCGGTCTCATTGCCTTTGGTAGCTCATT AGACCAGATTGGACCTTTTGCTCCTACTGTTAAGGAAAATGCCCTCTTGCTCAACGCTATTGCCAGCGA AGATGCTAAAGACTCTACTTCTGCTCCTGTCCGCATCGCCGACTTTACTTCAAAAATCGGCCAAGACAT CAAGGGTATGAAAATCGCTTTGCCTAAGGAATACCTAGGCGAAGGAATTGATCCAGAGGTTAAGGAAAC AATCTTAAACGCGGCCAAACACTTTGAAAAATTGGGTGCTATCGTCGAAGAAGTCAGCCTTCCTCACTC TAAATACGGTGTTGCCGTTTATTACATCATCGCTTCATCAGAAGCTTCATCAAACTTGCAACGCTTCGA CGGTATCCGTTACGGCTATCGCGCAGAAGATGCAACCAACCTTGATGAAATCTATGTAAACAGCCGAAG CCAAGGTTTTGGTGAAGAGGTAAAACGTCGTATCATGCTGGGTACTTTCAGTCTTTCATCAGGTTACTA TGATGCCTACTACAAAAAGGCTGGTCAAGTCCGTACCCTCATCATTCAAGATTTCGAAAAAGTCTTCGC GGATTACGATTTGATTTTGGGTCCAACTGCTCCAAGTGTTGCCTATGACTTGGATTCTCTCAACCATGA CCCAGTTGCCATGTACTTAGCCGACCTATTGACCATACCTGTAAACTTGGCAGGACTGCCTGGAATTTC GATTCCTGCTGGATTCTCTCAAGGTCTACCTGTCGGACTCCAATTGATTGGTCCCAAGTACTCTGAGGA AACCATTTACCAAGCTGCTGCTGCTTTTGAAGCAACAACAGACTACCACAAACAACAACCCGTGATTTT TGGAGGTGACAAC SP026 amino acid (SEQ ID NO:42) TFNNKTIEELHNLLVSKEISATELTQATLENIKSREEALNSFVTIAEEQALVQAKAIDEAGIDADNVLS GIPLAVKDNISTDGILTTAASKMLYNYEPIFDATAVANAKTKGMIVVGKTNMDEFAMGGSGETSHYGAT KNAWNHSKVPGGSSSGSAAAVASGQVRLSLGSDTGGSIRQPAAFNGIVGLKPTYGTVSRFGLIAFGSSL DQIGPFAPTVKENALLLNAIASEDAKDSTSAPVRIADFTSKIGQDIKGMKIALPKEYLGEGIDPEVKET ILNAAKHFEKLGAIVEEVSLPHSKYGVAVYYIIASSEASSNLQRFDGIRYGYRAEDATNLDEIYVNSRS QGFGEEVKRRIMLGTFSLSSGYYDAYYKKAGQVRTLIIQDFEKVFADYDLILGPTAPSVAYDLDSLNHD PVAMYLADLLTIPVNLAGLPGISIPAGFSQGLPVGLQLIGPKYSEETIYQAAAAFEATTDYHKQQPVIF GGDN SP030 nucleotide (SEQ ID NO:43) CTTTACAGGTAAACAACTACAAGTCGGCGACAAGGCGCTTGATTTTTCTCTTACTACAACAGATCTTTC TAAAAAATCTCTGGCTGATTTTGATGGCAAGAAAAAAGTCTTGAGTGTCGTTCCTTCTATCGATACAGG CATCTGCTCAACTCAAACACGTCGTTTTAATGAAGAATTGGCTGGACTGGACAACACGGTCGTATTGAC TGTTTCAATGGACCTACCTTTTGCTCAAAAACGTTGGTGCGGTGCTGAAGGCCTTGACAATGCCATTAT GCTTTCAGACTACTTTGACCATTCTTTCGGGCGCGATTATGCCCTCTTGATCAACGAATGGCACCTATT AGCACGCGCAGTCTTTGTCCTCGATACTGACAATACGATTCGCTACGTTGAATACGTGGATAATATCAA TTCTGAGCCAAACTTCGAA SP030 amino acid (SEQ ID NO:44) FTGKQLQVGDKALDFSLTTTDLSKKSLADFDGKKKVLSVVPSIDTGICSTQTRRFNEELAGLDNTVVLT VSMDLPFAQKRWCGAEGLDNAIMLSDYFDHSFGRDYALLINEWHLLARAVFVLDTDNTIRYVEYVDNIN SEPNFE SP031 nucleotide (SEQ ID NO:45) CCAGGCTGATACAAGTATCGCAGACATTCAAAAAAGAGGCGAACTGGTTGTCGGTGTCAAACAAGACGT TCCCAATTTTGGTTACAAnGATCCCAAGACCGGTACTTATTCTGGTATCGAAaCCGACTTGGCCAAGAT GGTAGCTGATGAACTCAAGGTCAAGATTCGCTATGTGCCGGTTACAGCACAAACCCGCGGCCCCCTTCT AGACAATGAACAGGTCGATATGGATATCGCGACCTTTACCATCACGGACGAACGCAAAAAACTCTACAA CTTTACCAGTCCCTACTACACAGACGCTTCTGGATTTTTGGTCAATAAATCTGCCAAAATCAAAAAGAT TGAGGACCTAAACGGCAAAACCATCGGAGTCGCCCAAGGTTCTATCACCCAACGCCTGATTACTGAACT GGGTAAAAAGAAAGGTCTGAAGTTTAAATTCGTCGAACTTGGTTCCTACCCAGAATTGATTACTTCCCT GCACGCTCATCGTATCGATACCTTTTCCGTTGACCGCTCTATTCTATCTGGCTACACTAGTAAACGGAC AGCACTACTAGATGATAGTTTCAAGCCATCTGACTACGGTATTGTTACCAAGAAATCAAATACAGAGCT CAACGACTATCTTGATAACTTGGTTACTAAATGGAGCAAGGATGGTAGTTTGCAGAAACTTTATGACCG

TTACAAGCTCAAACCATCTAGCCATACTGCAGAT SP031 amino acid (SEQ ID NO:46) QADTSIADIQKRGELVVGVKQDVPNFGYXDPKTGTYSGIETDLAKMVADELKVKIRYVPVTAQTRGPLL DNEQVDMDIATFTITDERKKLYNFTSPYYTDASGFLVNKSAKIKKIEDLNGKTIGVAQGSITQRLITEL GKKKGLKFKFVELGSYPELITSLHAHRIDTFSVDRSILSGYTSKRTALLDDSFKPSDYGIVTKKSNTEL NDYLDNLVTKWSKDGSLQKLYDRYKLKPSSHTAD SP032 nucleotide (SEQ ID NO:47) GTCTGTATCATTTGAAAACAAAGAAACAAACCGTGGTGTCTTgACTTTCACTATCTCTCAAGACCAAAT CAAACCAGAATTGGACCGTGTCTTCAAGtCAGTGAAGAAATCTCTTAATGTTCCAGGTTTCCGTAAAGG TCACCTTCCACGCCCTATCTTCGACCAAAAATTTGGTGAAGAAGCTCTTTATCAAGATGCAATGAACGC ACTTTTGCCAAACGCTTATGAAGCAGCTGTAAAAGAAGCTGGTCTTGAAGTGGTTGCCCAACCAAAAAT TGACGTAACTTCAATGGAAAAAGGTCAAGACTGGGTTATCACTGCTGAAGTCGTTACAAAACCTGAAGT AAAATTGGGTGACTACAAAAACCTTGAAGTATCAGTTGATGTAGAAAAAGAAGTAACTGACGCTGATGT CGAAGAGCGTATCGAACGCGAACGCAACAACCTGGCTGAATTGGTTATCAAGGAAGCTGCTGCTGAAAA CGGCGACACTGTTGTGATCGACTTCGTTGGTTCTATCGACGGTGTTGAATTTGACGGTGGAAAAGGTGA AAACTTCTCACTTGGACTTGGTTCAGGTCAATTCATCCCTGGTTTCGAAGACCAATTGGTAGGTCACTC AGCTGGCGAAACCGTTGATGTTATCGTAACATTCCCAGAAGACTACCAAGCAGAAGACCTTGCAGGTAA AGAAGCTAAATTCGTGACAACTATCCACGAAGTAAAAGCTAAAGAAGTTCCGGCTCTTGACGATGAACT TGCAAAAGACATTGATGAAGAAGTTGAAACACTTGCTGACTTGAAAGAAAAATACAGCAAAGAATTGGC TGCTGCTAAAGAAGAAGCTTACAAAGATGCAGTTGAAGGTGCAGCAATTGATACAGCTGTAGAAAATGC TGAAATCGTAGAACTTCCAGAAGAAATGATCCATGAAGAAGTTCACCGTTCAGTAAATGAATTCCTTGG GAATTTGCAACGTCAAGGGATCAACCCTGACATGTACTTCCAAATCACTGGAACTACTCAAGAAGACCT TCACAACCAATACCAAGCAGAAGCTGAGTCACGTACTAAGACTAACCTTGTTATCGAAGCAGTTGCCAA AGCTGAAGGATTTGATGCTTCAGAAGAAGAAATCCAAAAAGAAGTTGAGCAATTGGCAGCAGACTACAA CATGGAAGTTGCACAAGTTCAAAACTTGCTTTCAGCTGACATGTTGAAACATGATATCACTATCAAAAA AGCTGTTGAATTGATCACAAGCACAGCAACAGTAAAA SP032 amino acid (SEQ ID NO:48) SVSFENKETNRGVLTFTISQDQIKPELDRVFKSVKKSLNVPGFRKGHLPRPIFDQKFGEEALYQDAMNA LLPNAYEAAVKEAGLEVVAQPKIDVTSMEKGQDWVITAEVVTKPEVKLGDYKNLEVSVDVEKEVTDADV EERIERERNNLAELVIKEAAAENGDTVVIDFVGSIDGVEFDGGKGENFSLGLGSGQFIPGFEDQLVGHS AGETVDVIVTFPEDYQAEDLAGKEAKFVTTIHEVKAKEVPALDDELAKDIDEEVETLADLKEKYSKELA AAKEEAYKDAVEGAAIDTAVENAEIVELPEEMIHEEVHRSVNEFLGNLQRQGINPDMYFQITGTTQEDL HNQYQAEAESRTKTNLVIEAVAKAEGFDASEEEIQKEVEQLAADYNMEVAQVQNLLSADMLKHDITIKK AVELITSTATVK SP033 nucleotide (SEQ ID NO:49) TGGTCAAAAGGAAAGTCAGACAGGAAAGGGGATGAAAATTGTGACCAGTTTTTATCCTATCTACGCTAT GGTTAAGGAAGTATCTGGTGACTTGAATGATGTTCGGATGATTCAGTCAAGTAGTGGTATTCACTCCTT TGAACCTTCGGCAAATGATATCGCAGCCATCTATGATGCAGATGTCTTTGTTTACCATTCTCATACACT CGAATCTTGGGCAGGAAGTCTGGATCCAAATCTAAAAAAATCCAAAGTGAAGGTCTTAGAGGCTTCTGA GGGAATGACCTTGGAACGTGTCCCTGGACTAGAGGATGTGGAAGCAGGGGATGGAGTTGATGAAAAAAC GCTCTATGACCCTCACACATGGCTAGATCCTGAAAAAGCTGGAGAAGAAGCCCAAATTATCGCTGATAA ACTTTCAGAGGTGGATAGTGAGCATAAAGAGACTTATCAAAAAAATGCGCAACCTTTATCAAAAAAGCT CAGGAAT SP033 amino acid (SEQ ID NO:50) GQKESQTGKGMKIVTSFYPIYAMVKEVSGDLNDVRMIQSSSGIHSFEPSANDIAAIYDADVFVYHSHTL ESWAGSLDPNLKKSKVKVLEASEGMTLERVPGLEDVEAGDGVDEKTLYDPHTWLDPEKAGEEAQIIADK LSEVDSEHKETYQKNAQPLSKKLRN SP034 nucleotide (SEQ ID NO:51) GAAGGATAGATATATTTTAGCATTTGAGACATCCTGTGATGAGACCAGTGTCGCCGTCTTGAAAAACGA CGATGAGCTCTTGTCCAATGTCATTGCTAGTCAAATTGAGAGTCACAAACGTTTTGGTGGCGTAGTGCC CGAAGTAGCCAGTCGTCACCATGTCGAGGTCATTACAGCCTGTATCGAGGAGGCATTGGCAGAAGCAGG GATTACCGAAGAGGACGTGACAGCTGTTGCGGTTACCTACGGACCAGGCTTGGTCGGAGCCTTGCTAGT TGGTTTGTCAGCTGCCAAGGCCTTTGCTTGGGCTCACGGACTTCCACTGATTCCTGTTAATCACATGGC TGGGCACCTCATGGCAGCTCAGAGTGTGGAGCCTTTGGAGTTTCCCTTGCTAGCCCTCTTGGTCAGCGG CGGACACACAGAGTTGGTTTATGTTTCGGAGGCAGGAGATTATAAGATTGTTGGGGAAACCCGTGATGA TGCGGTTGGTGAGGCTTATGATAAGGTCGGCCGTGTCATGGGCTTGACCTATCCTGCAGGTCGTGAGAT TGACGAGCTGGCTCATCAGGGGCAGGATATTTATGATTTCCCCCGTGCCATGATTAAGGAAGATAATCT GGAGTTCTCCTTCTCAGGTTTGAAATCTGCCTTTATCAATCTTCATCACAATGCCGAGCAAAAGGGAGA AAGCCTGTCTACAGAAGATTTGTGTGCTTCCTTCCAAGCAGCAGTTATGGACATTCTCATGGCAAAAAC CAAGAAGGCTTTGGAGAAATATCCTGTTAAAATCCTAGTTGTGGCAGGTGGTGTGGCAGCCAATAAAGG TCTCAGAGAACGCCTAGCAGCCGAAATCACAGATGTCAAGGTTATCATCCCCCCTCTGCGACTCTGCGG AGACAATGCAGGTATGATTGCCTATGCCAGCGTCAGCNAGTGGAACAAAGAAAACTTCGCAGGCTGGGA CCTCAATGCCAAACCAAGTCTTGCCTTTGATACCATGGAA SP034 amino acid (SEQ ID NO:52) KDRYILAFETSCDETSVAVLKNDDELLSNVIASQIESHKRFGGVVPEVASRHHVEVITACIEEALAEAG ITEEDVTAVAVTYGPGLVGALLVGLSAAKAFAWAHGLPLIPVNHMAGHLMAAQSVEPLEFPLLALLVSG GHTELVYVSEAGDYKIVGETRDDAVGEAYDKVGRVMGLTYPAGREIDELAHQGQDIYDFPRAMIKEDNL EFSFSGLKSAFINLHHNAEQKGESLSTEDLCASFQAAVMDILMAKTKKALEKYPVKILVVAGGVAANKG LRERLAAEITDVKVIIPPLRLCGDNAGMIAYASVSXWNKENFAGWDLNAKPSLAFDTME SP035 nucleotide (SEQ ID NO:53) GGTAGTTAAAGTTGGTATTAACGGTTTCGGACGTATCGGTCGTCTTGCTTTCCGTCGTATCCAAAACGT AGAAGGTGTTGAAGTTACACGCATCAACGACCTTACAGATCCAGTTATGCTTGCACACTTGTTGAAATA CGACACAACTCAAGGTCGTTTCGACGGTACTGTTGAAGTTAAAGAAGGTGGATTTGAAGTTAACGGTAA ATTCATCAAAGTTTCTGCTGAACGTGATCCAGAACAAATCGACTGGGCTACTGACGGTGTAGAAATCGT TCTTGAAGCTACTGGTTTCTTTGCTAAGAAAGAAGCAGCTGAAAAACACCTTAAAGGTGGAGCTAAAAA AGTTGTTATCACTGCTCCTGGTGGAAACGACGTTAAAACAGTTGTATTCAACACTAACCACGACGTTCT TGACGGTACTGAAACAGTTATCTCAGGTGCTTCATGTACTACAAACTGCTTGGCTCCAATGGCTAAAGC TCTTCAAGACAACTTTGGTGTTGTTGAAGGATTGATGACTACTATCCACGCTTACACTGGTGACCAAAT GATCCTTGACGGACCACACCGTGGTGGTGACCTTCGCCGTGCTCGCGCTGGTGCTGCAAACATCGTTCC TAACTCAACTGGTGCTGCAAAAGCTATCGGTCTTGTAATCCCAGAATTGAATGGTAAACTTGACGGATC TGCACAACGCGTTCCAACTCCAACTGGATCAGTTACTGAATTGGTAGCAGTTCTTGAAAAGAACGTTAC TGTTGATGAAGTGAACGCAGCTATGAAAGCAGCTTCAAACGAATCATACGGTTACACAGAAGATCCAAT CGTATCTTCAGATATCGTAGGTATGTCTTACGGTTCATTGTTTGACGCAACTCAAACTAAAGTTCTTGA CGTTGACGGTAAACAATTGGTTAAAGTTGTATCATGGTACGACAACGAAATGTCATACACTGCACAACT TGTTCGTACTCTTGGAATACTTCGCAAAAATTGC SP035 amino acid (SEQ ID NO:54) VVKVGINGFGRIGRLAFRRIQNVEGVEVTRINDLTDPVMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGK FIKVSAERDPEQIDWATDGVEIVLEATGFFAKKEAAEKHLKGGAKKVVITAPGGNDVKTVVFNTNHDVL DGTETVISGASCTTNCLAPMAKALQDNFGVVEGLMTTIHAYTGDQMILDGPHRGGDLRRARAGAANIVP NSTGAAKAIGLVIPELNGKLDGSAQRVPTPTGSVTELVAVLEKNVTVDEVNAAMKAASNESYGYTEDPI VSSDIVGMSYGSLFDATQTKVLDVDGKQLVKVVSWYDNEMSYTAQLVRTLGILRKNC SP036 nucleotide (SEQ ID NO:55) TTCTTACGAGTTGGGACTGTATCAAGCTAGAACGGTTAAGGAAAATAATCGTGTTTCCTATATAGATGG AAAACAAGCGACGCAAAAAACGGAGAATTTGACTCCTGATGAGGTTAGCAAGCGTGAAGGAATCAATGC TGAGCAAATCGTCATCAAGATAACAGACCAAGGCTATGTCACTTCACATGGCGACCACTATCATTATTA CAATGGTAAGGTTCCTTATGACGCTATCATCAGTGAAGAATTACTCATGAAAGATCCAAACTATAAGCT AAAAGATGAGGATATTGTTAATGAGGTCAAGGGTGGATATGTTATCAAGGTAGATGGAAAATACTATGT TTACCTTAAGGATGCTGCCCACGCGGATAACGTCCGTACAAAAGAGGAAATCAATCGACAAAAACAAGA GCATAGTCAACATCGTGAAGGTGGAACTCCAAGAAACGATGGTGCTGTTGCCTTGGCACGTTCGCAAGG ACGCTATACTACAGATGATGGTTATATCTTTAATGCTTCTGATATCATAGAGGATACTGGTGATGCTTA TATCGTTCCTCATGGAGATCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTGGCTGC TGCAGAAGCCTTCCTATCTGGTCGAGGAAATCTGTCAAATTCAAGAACCTATCGCCGACAAAATAGCGA TAACACTTCAAGAACAAACTGGGTACCTTCTGTAAGCAATCCAGGAACTACAAATACTAACACAAGCAA CAACAGCAACACTAACAGTCAAGCAAGTCAAAGTAATGACATTGATAGTCTCTTGAAACAGCTCTACAA ACTGCCTTTGAGTCAACGACATGTAGAATCTGATGGCCTTGTCTTTGATCCAGCACAAATCACAAGTCG AACAGCTAGAGGTGTTGCAGTGCCACACGGAGATCATTACCACTTCATCCCTTACTCTCAAATGTCTGA ATTGGAAGAACGAATCGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAG GCCAGAACAACCAAGTCCACAACCGACTCCGGAACCTAGTCCAGGCCCGCAACCTGCACCAAATCTTAA AATAGACTCAAATTCTTCTTTGGTTAGTCAGCTGGTACGAAAAGTTGGGGAAGGATATGTATTCGAAGA AAAGGGCATCTCTCGTTATGTCTTTGCGAAAGATTTACCATCTGAAACTGTTAAAAATCTTGAAAGCAA GTTATCAAAACAAGAGAGTGTTTCACACACTTTAACTGCTAAAAAAGAAAATGTTGCTCCTCGTGACCA AGAATTTTATGATAAAGCATATAATCTGTTAACTGAGGCTCATAAAGCCTTGTTTGNAAATAAGGGTCG TAATTCTGATTTCCAAGCCTTAGACAAATTATTAGAACGCTTGAATGATGAATCGACTAATAAAGAAAA ATTGGTAGATGATTTATTGGCATTCCTAGCACCAATTACCCATCCAGAGCGACTTGGCAAACCAAATTC TCAAATTGAGTATACTGAAGACGAAGTTCGTATTGCTCAATTAGCTGATAAGTATACAACGTCAGATGG TTACATTTTTGATGAACATGATATAATCAGTGATGAAGGAGATGCATATGTAACGCCTCATATGGGCCA TAGTCACTGGATTGGAAAAGATAGCCTTTCTGATAAGGAAAAAGTTGCAGCTCAAGCCTATACTAAAGA AAAAGGTATCCTACCTCCATCTCCAGACGCAGATGTTAAAGCAAATCCAACTGGAGATAGTGCAGCAGC TATTTACAATCGTGTGAAAGGGGAAAAACGAATTCCACTCGTTCGACTTCCATATATGGTTGAGCATAC AGTTGAGGTTAAAAACGGTAATTTGATTATTCCTCATAAGGATCATTACCATAATATTAAATTTGCTTG GTTTGATGATCACACATACAAAGCTCCAAATGGCTATACCTTGGAAGATTTGTTTGCGACGATTAAGTA CTACGTAGAACACCCTGACGAACGTCCACATTCTAATGATGGATGGGGCAATGCCAGTGAGCATGTGTT AGGCAAGAAAGACCACAGTGAAGATCCAAATAAGAACTTCAAAGCGGATGAAGAGCCAGTAGAGGAAAC ACCTGCTGAGCCAGAAGTCCCTCAAGTAGAGACTGAAAAAGTAGAAGCCCAACTCAAAGAAGCAGAAGT TTTGCTTGCGAAAGTAACGGATTCTAGTCTGAAAGCCAATGCAACAGAAACTCTAGCTGGTTTACGAAA TAATTTGACTCTTCAAATTATGGATAACAATAGTATCATGGCAGAAGCAGAAAAATTACTTGCGTTGTT AAAAGGAAGTAATCCTTCATCTGTAAGTAAGGAAAAAATAAAC SP036 amino acid (SEQ ID NO:56) SYELGLYQARTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYY NGKVPYDAIISEELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADMVRTKEEINRQKQE HSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAA AEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYK LPLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFIPYSQMSELEERIARIIPLRYRSNHWVPDSR PEQPSPQPTPEPSPGPQPAPNLKIDSNSSLVSQLVRKVGEGYVFEEKGISRYVFAKDLPSETVKNLESK LSKQESVSHTLTAKKENVAPRDQEFYDKAYNLLTEAHKALFXNKGRNSDFQALDKLLERLNDESTNKEK LVDDLLAFLAPITHPERLGKPNSQIEYTEDEVRIAQLADKYTTSDGYIFDEHDIISDEGDAYVTPHMGH SHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYHVEHT VEVKNGNLIIPHKDHYHNIKFAWFDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVL GKKDHSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDSSLKANATETLAGLRN NLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEKIN SP038 nucleotide (SEQ ID NO:57) TACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAA GGAGCGAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCTAG TGAATGGCTTCGTTTTGACGGTGCTCATTCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCC TTATCTTTTAGGACAGGGGGGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACA GCTGGAGAATAAACAAGTTGTGTATGTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGC AGCCTTCCAGCAGTATTTTAATGGAGACCAGTTGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGC TAGTCAATATGCAGCGACTCGCTTACTGCAACAGTTCCCAAACGTAGCTATGAAGGACCTGGTTCAGAA GTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGATTGAATTATTGGCTCGTTTTAATGA ACGCCAAGCTTCCTTTTTTGGTCAGTTTTCGGTTAGAGGCTATGTTAACTACGATAAGCATGTAGCTAA GTATTTAAAAATCTTGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGTTGTCAAAGCAGATGCTGA AAAAAATACTTCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGGATTT GAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGNNTTGCAGTT GGTTTTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGTTAATAAAAAATG GATGNACTATGCTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGA AAGTCAAGGTTTTACCAATATAGCAGATTTTTCTAAGGACGGCGGGGAGCCTTTCTTTATGAAGGACAC CATTCACCTTGGTTGGTTGGGTTGGTTGGCTTTTGACAAGGCAGTTGATCCTTTCCTATCCAATCCCAC ACCAGCTCCGACTTACCATCTGAATGAGCGCTTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGT CAAAGAA SP038 amino acid (SEQ ID NO:58) TEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFDGAHSAVLAEKYNRSYRP YLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTSFLKHQSGDQA SQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHVAK YLKILPDQFSYQAIEDVVKADAEKNTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSPEYNXLQL VLTQFSKSKVNPIFIIPPVNKKWMXYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDT IHLGWLGWLAFDKAVDPFLSNPTPAPTYHLNERFFSKDWATYDGDVKE SP039 nucleotide (SEQ ID NO:59) GGTTTTGAGAAAGTATTTGCAGGGGGCCCTGATTGAGTCGATTGAGCAAGTGGAAAATGACCGTATTGT GGAAATTACAGTTTCCAATAAAAACGAGATTGGAGACCATATCCAGGCTACCTTGATTATCGAAATTAT GGGGAAACACAGTAATATTCTACTGGTCGATAAAAGCAGTCATAAAATCCTCGAAGTTATCAAACACGT CGGCTTTTCACAAAATAGCTACCGCACCTTACTTCCAGGATCGACCTATATCGCTCCGCCAAGTACAAA ATCTCTCAATCCTTTTACTATCAAGGATGAAAAGCTCTTTGAAATCCTGCAAACCCAAGAACTAACAGC AAAAAATCTTCAAAGCCTCTTTCAAGGTCTGGGACGCGATACGGCAAATGAATTGGAAAGGATACTGGT TAGTGAAAAACTTTCCGCTTTCCGAAATTTTTTCAATCAAGAAACCAAGCCATGCTTGACTGAGACTTC CTTCAGTCCAGTTCCTTTTGCAAATCAGGTGGGAGAGCCTTTTGCAAATCTTTCTGATTTGTTGGACAC CTACTATAAGGATAAGGCTGAGCGCGACCGCGTCAAACAGCAGGCCAGTGAACTGATTCGTCGTGTTGA AAATGAACTTCAGAAAAACCGACACAAACTCAAAAAACAGGAAAAAGAGTTACTGGCGACAGACAACGC TGAAGAATTTCGTCAAAAAGGAGAATTGCTGACAACCTTCCTCCACCAAGTGCCTAACGACCAAGACCA GGTTATCCTAGACAACTACTATACCAACCAACCTATCATGATTGCGCTTGATAAGGCTCTGACTCCCAA CCAGAATGCCCAACGCTATTTTAAACGGTATCAGAAACTCAAAGAAGCTGTCAAATACTTGACTGATTT GATTGAAGAAACCAAAGCCACTATTCTCTATCTGGAAAGTGTAGAAACCGTCCTCAACCAAGCTGGACT GGAAGAAATCGCTGAAATCCGTGAAGAATTGATTCAAACAGGTTTTATCCGCAGAAGACAACGGGAGAA AATCCAGAAACGCAAAAAACTAGAACAATATCTAGCAAGCGATGGCAAAACCATCATCTATGTCGGACG AAACAATCTTCAAAATGAGGAATTGACCTTTAAAATGGCCCGCAAGGAGGAACTTTGGTTCCATGCTAA GGACATTCCTGGAAGCCATGTTGTCATCTCAGGAAATCTTGACCCATCTGATGCAGTCAAGACAGACGC AGCAGAGTTAGCTGCCTACTTCTCTCAAGGGCGCCTGTCGAATCTGGTGCAGGTAGATATGATTGAAGT CAAAAACTCAATAAACCAACTGGTGGAAAACCCGGCTTTGTCACTTACACAGGACAAAAGACCCTCCG CGTCACACCAGACTCCAAAAAAATTGCATCCATGAAAAAATCC SP039 amino acid (SEQ ID NO:60) VLRKYLQGALIESIEQVENDRIVEITVSNKNEIGDHIQATLIIEIMGKHSNILLVDKSSHKILEVIKHV GFSQNSYRTLLPGSTYIAPPSTKSLNPFTIKDEKLFEILQTQELTAKNLQSLFQGLGRDTANELERILV SEKLSAFRNFFNQETKPCLTETSFSPVPFANQVGEPFANLSDLLDTYYKDKAERDRVKQQASELIRRVE NELQKNRHKLKKQEKELLATDNAEEFRQKGELLTTFLHQVPNDQDQVILDNYYTNQPIMIALDKALTPN QNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETVLNQAGLEEIAEIREELIQTGFIRRRQREK IQKRKKLEQYLASDGKTIIYVGRNNLQNEELTFKNARKEELWFHAKDIPGSHVVISGNLDPSDAVKTDA AELAAYFSQGRLSNLVQVDMIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIASMKKS SP040 nucleotide (SEQ ID NO:61) GACAACATTTACTATCCATACAGTAGAGTCAGCACCAGCAGAAGTGAAAGAAATTCTTGAAACAGTAGA AAAAGACAACAATGGCTATATTCCCAACCTAATCGGTCTCTTGGCCAATGCCCCGACTGTTTTAGAAGC CTACCAAATTGTCTCATCTATCCACCGTCGCAACAGCCTGACACCCGTTGAGCGTGAAGTGGTGCAAAT CACGGCAGCCGTGACCAATGGTTGTGCCTTCTGTGTCGCAGGTCACACAGCCTTTTCCATCAAACAAAT CCAGATGAATGATGACTTGATTCAAGCTCTTCGCAATCGTACTCCAATTGAAACAGATCCTAAATTGGA TACCCTAGCTAAGTTTACCTTGGCAGTTATCAATACCAAGGGTCGTGTAGGAGATGAAGCCTTGTCTGA GTTTTTAGAAGCTGGCTACACTCAACAAAATGCCTTGGATGTGGTTTTTGGTGTCAGCCTAGCAATCCT CTGTAACTATGCCAACAACTTAGCTAATACACCAATTAATCCAGAATTGCAACCTTATGCC SP040 amino acid (SEQ ID NO:62) TTFTIHTVESAPAEVKEILETVEKDNNGYIPNLIGLLANAPTVLEAYQIVSSIHRRNSLTPVEREVVQI TAAVTNGCAFCVAGHTAFSIKQIQMNDDLIQALRNRTPIETDPKLDTLAKFTLAVINTKGRVGDEALSE FLEAGYTQQNALDVVFGVSLAILCNYANNLANTPINPELQPYA SP041 nucleotide (SEQ ID NO:63) GGCTAAGGAAAGAGTGGATGTACTAGCTTATAAACAGGGGTTGTTTGAAACGAGAGAGCAGGCCAAGCG AGGTGTGATGGCTGGCCTAGTCGTAGCAGTCCTTAATGGAGAACGGTTTGACAAGCCAGGAGAGAAAAT TCCAGATGACACCGAATTAAAACTCAAGGGGGAGAAACTCAAGTATGTCAGCCGTGGTGGTTTGAAACT GGAAAAGGCCTTGCAGGTCTTTGATTTGTCGGTGGATGGCGCGACTACGATTGATATCGGGGCCTCTAC TGGAGGTTTTACCGATGTCATGCTACAGAATAGTGCCAAGTTGGTCTTTGCAGTCGATGTTGGTACCAA TCAGTTGGCTTGGAAATTACGCCAAGACCCACGAGTTGTCAGCATGGAGCAGTTCAATTTCCGCTATGC TGAAAAGACTGATTTCGAGCAGGAGCCGAGCTTTGCCAGTATTGATGTGAGTTTCATTTCCCTTAGTCT GATTTTGCCAGCCTTGCACCGTGTCTTGGCTGATCAAGGTCAGGTGGTAGCACTTGTCAAACCTCAGTT TGAGGCAGGACGTGAGCAGATTGGGAAAAATGGAATTATTCGAGATGCTAAGGTTCATCAGAATGTCCT TGAATCTGTAACAGCTATGGCAGTAGAGGTAGGTTTTTCAGTCCTTGGCTTGGACTTTTCTCCCATCCA AGGTGGACATGGAAATATTGAATTTTTAGCGTATTTGAAAAAAGAAAAGTCAGCAAGCAATCAGATTCT TGCTGAGATTAAAGAAGCAGTAGAGAGGGCGCATAGTCAATTTAAAAATGAA SP041 amino acid (SEQ ID NO:64) AKERVDVLAYKQGLFETREQAKRGVMAGLVVAVLNGERFDKPGEKIPDDTELKLKGEKLKYVSRGGLKL EKALQVFDLSVDGATTIDIGASTGGFTDVMLQNSAKLVFAVDVGTNQLAWKLRQDPRVVSMEQFNFRYA EKTDFEQEPSFASIDVSFISLSLILPALHRVLADQGQVVALVKPQFEAGREQIGKNGIIRDAKVHQNVL ESVTAMAVEVGFSVLGLDFSPIQGGHGNIEFLAYLKKEKSASNQILAEIKEAVERAHSQFKNE SP042 nucleotide (SEQ ID NO:65) TTGTTCCTATGAACTTGGTCGTCACCAAGCTGGTCAGGTTAAGAAAGAGTCTAATCGAGTTTCTTATAT AGATGGTGATCAGGCTGGTCAAAAGGCAGAAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGAT CAACGCCGAACAAATNGTNATCAAGATTACGGATCAAGGTTATGTGACCTCTCATGGAGACCATTATCA TTACTATAATGGCAAGGTTCCTTATGATGCCATCATCAGTGAAGAGCTCCTCATGAAAGATCCGAATTA TCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATTAAGGTAAACGGTAAATA CTATGTNTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAACGTCAGAA GCAGGAACGCAGTCATAATCATAACTCAAGAGCAGATAATGCTGTTGCTGCAGCCAGAGCCCAAGGACG TTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATAT CGTTCCTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGC AGAAGCCTATTGGAATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGC TCAACCAAGATTGTCAGAGAACCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAA CATTTCAAGCCTTTTACGTGAATTGTATGCTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCT TATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCCAGAGGTGTAGCTGTCCCTCATGGTAACCATTA CCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAATTGCTCGTATTATTCCCCTTCGTTA TCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAATCGACTCCGGAACCTAG TCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAATTGGTCAA AGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC CAAGGATCTTTCAGCAGAAACAGCAGCAGGCATTGATAGCAAACTGGCCAAGCAGGAAAGTTTATCTCA TAAGCTAGGAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTT ACTAGCAAGAATTCACCAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAA CCTGTTGGAACGACTCAAGGATGTCNCAAGTGATAAAGTCAAGTTAGTGGANGATATTCTTGCCTTCTT AGCTCCGATTCGTCATCCAGAACGTTTAGGAAAACCAAATGCGCAAATTACCTACACTGATGATGAGAT TCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGACGGTTATATCTTTGATCCTCGTGATATAAC CAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCACTGGATTAAAAAAGATAGTTT GTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCCCTCCTTCGACAGA CCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCAGCTAA GAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAAT

CATACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACC TAAGGGGTATACTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCC GCATTCAGATAATGGTTTTGGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATAC CAATCAAACGGAAAAACCAAGCGAGGAGAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGA AGAGAAACCGCAAAGCGAGAAACCAGAGTCTCCAAAACCAACAGAGGAACCAGAAGAATCACCAGAGGA ATCAGAAGAACCTCAGGTCGAGACTGAAAAGGTTGAAGAAAAACTGAGAGAGGCTGAAGATTTACTTGG AAAAATCCAGGAT SP042 amino acid (SEQ ID NO:66) CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQXVIKITDQGYVTSHGDHYH YYNGKVPYDAIISEELLMKDPNYQLKDSDIVNEIKGGYVIKVNGKYYVYLKDAAHADNIRTKEEIKRQK QERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAA EAYWNGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGL IFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS PSPQPAPNPQPAPSNPIDEKLVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSH KLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVXSDKVKLVXDILAFL APIRHPERLGKPNAQITYTDDEIQVAKLAGKYTTEDGYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSL SEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAKGAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI IPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADT NQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEKLREAEDLLG KIQD SP043 nucleotide (SEQ ID NO:67) TTATAAGGGTGAATTAGAAAAAGGATACCAATTTGATGGTTGGGAAATTTCTGGTTTCGAAGGTAAAAA AGACGCTGGCTATGTTATTAATCTATCAAAAGATACCTTTATAAAACCTGTATTCAAGAAAATAGAGGA GAAAAAGGAGGAAGAAAATAAACCTACTTTTGATGTATCGAAAAAGAAAGATAACCCACAAGTAAACCA TAGTCAATTAAATGAAAGTCACAGAAAAGAGGATTTACAAAGAGAAGAGCATTCACAAAAATCTGATTC AACTAAGGATGTTACAGCTACAGTTCTTGATAAAAACAATATCAGTAGTAAATCAACTACTAACAATCC TAATAAG SP043 amino acid (SEQ ID NO:68) YKGELEKGYQFDGWEISGFEGKKDAGYVINLSKDTFIKPVFKKIEEKKEEENKPTFDVSKKKDNPQVNH SQLNESHRKEDLQREEHSQKSDSTKDVTATVLDKNNISSKSTTNNPNK SP044 nucleotide (SEQ ID NO:69) GAATGTTCAGGCTCAAGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGA TGCGATTATTCTTGAAAGCAATGGACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGA TGGAAGTGATTCTCGCTATCCATGGAGAGAAGGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCG TGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACTTGATTTTATTTTGGTGACCCATACCCACAG TGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTGACCGAGTCTATCTTAAGAAATA TAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTATGATAAGGTTTTACA GACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCAGTTTGG GGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTG GGATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGA TTTAGATAATGTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTT TAATCATCACCATGATACCAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGAT TGTTCAAACTTCGGATAGTCTACCTTGGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGA ACGAGGAATTGAGAGAATCAACGCAGCCAGCAAAGACTATGATGCAACAGTTTTTGATATTCGAAAAGA CGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAGTTTTCAAGCTGGTTGGCATAAGAGTGC ATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTGTCGGTTGGAATGAAATCGA AGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAATGGAACAATCA TTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTATTA TTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGT TGATGGTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGA SP044 amino acid (SEQ ID NO:70) NVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWREGIETSYKHVLTDR VFRRLKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGYDKVLQ TAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISVVKVNGKKIYLGGD LDNVHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFIKNLSPSLIVQTSDSLPWKNGVDSEYVNWLKE RGIERINAASKDYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIE GEWYYFNQTGILLQNQWKKWNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQMEIGWIQDKEQWYYLDV DGSMKTGWLQYMGQWYYFAPSGE SP045 nucleotide (SEQ ID NO:71) CTTGGGTGTAACCCATATCCAGCTCCTTCCAGTCTTGTCTTACTACTTTGTCAATGAATTGAAAAACCA TGAACGCTTGTCTGACTACGCTTCAAGCAACAGCAACTACAACTGGGGATATGACCCTCAAAACTACTT CTCCTTGACTGGTATGTACTCAAGCGATCCTAAGAATCCAGAAAAACGAATCGCAGAATTTAAAAACCT CATCAACGAAATCCACAAACGTGGTATGGGAGCTATCCTAGATGTCGTTTATAACCACACAGCCAAAGT CGATCTCTTTGAAGATTTGGAACCAAACTACTACCACTTTATGGATGCCGATGGCACACCTCGAACTAG CTTTGGTGGTGGACGCTTGGGGACAACCCACCATATGACCAAACGGCTCCTAATTGACTCTATCAAATA CCTAGTTGATACCTACAAAGTGGATGGCTTCCGTTTCGATATGATGGGAGACCATGACGCCGCTTCTAT CGAAGAAGCTTACAAGGCTGCACGCGCCCTCAATCCAAACCTCATCATGCTTGGTGAAGGTTGGAGAAC CTATGCCGGTGATGAAAACATGCCTACTAAAGCTGCTGACCAAGATTGGATGAAACATACCGATACTGT CGCTGTCTTTTCAGATGACATCCGTAACAACCTCAAATCTGGTTATCCAAACGAAGGTCAACCTGCCTT TATCACAGGTGGCAAGCGTGATGTCAACACCATCTTTAAAAATCTCATTGCTCAACCAACTAACTTTGA AGCTGACAGCCCTGGAGATGTCATCCAATACATCGCAGCCCATGATAACTTGACCCTCTTTGACATCAT TGCCCAGTCTATCAAAAAAGACCCAAGCAAGGCTGAGAACTATGCTGAAATCCACCGTCGTTTACGACT TGGAAATCTCATGGTCTTGACAGCTCAAGGAACTCCATTTATCCACTCCGGTCAGGAATATGGACGTAC TAAACAATTCCGTGACCCAGCCTACAAGACTCCAGTAGCAGAGGATAAGGTTCCAAACAAATCTCACTT GTTGCGTGATAAGGACGGCAACCCATTTGACTATCCTTACTTCATCCATGACTCTTACGATTCTAGTGA TGCAGTCAACAAGTTTGACTGGACTAAGGCTACAGATGGTAAAGCTTATCCTGAAAATGTCAAGAGCCG TGACTATATGAAAGGTTTGATTGCCCTTCGTCAATCTACAGATGCCTTCCGACTTAAGAGTCTTCAAGA TATCAAAGACCGTGTCCACCTCATCACTGTCCCAGGCCAAAATGGTGTGGAAAAAGAGGATGTAGTGAT TGGCTACCAAATCACTGCTCCAAACGGCGATATCTACGCAGTCTTTGTCAATGCGGATGAAAAAGCTCG CGAATTTAATTTGGGAACTGCCTTTGCACATCTAAGAAATGCGGAAGTTTTGGCAGATGAAAACCAAGC AGGACCAGTCGGAATTGCCAACCCGAAAGGACTTGAATGGACTGAAAAAGGCTTGAAATTGAATGCCCT TACAGCTACTGTTCTTCGAGTCTCTCAAAATGGAACTAGCCATGAGTCAACTGCAGAAGAGAAACCAGA CTCAACCCCTTCCAAGCCTGAACATCAAAATGAAGCTTCTCACCCTGCACATCAAGACCCAGCTCCAGA AGCTAGACCTGATTCTACTAAACCAGATGCCAAAGTAGCTGATGCGGAAAATAAACCTAGCCAAGCTAC AGCTGATTCACAAGCTGAACAACCAGCACAAGAAGCACAAGCATCATCTGTAAAAGAAGCGGTTCGAAA CGAATCGGTAGAAAACTCTAGCAAGGAAAATATACCTGCAACCCCAGATAAACAAGCTGAA SP045 nucleotide (SEQ ID NO:72) LGVTHIQLLPVLSYYFVNELKNHERLSDYASSNSNYNWGYDPQNYFSLTGMYSSDPKNPEKRIAEFKNL INEIHKRGMGAILDVVYNHTAKVDLFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLIDSIKY LVDTYKVDGFRFDMMGDHDAASIEEAYKAARALNPNLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTV AVFSDDIRNNLKSGYPNEGQPAFITGGKRDVNTIFKNLIAQPTNFEADSPGDVIQYIAAHDNLTLFDII AQSIKKDPSKAENYAEIHRRLRLGNLMVLTAQGTPFIHSGQEYGRTKQFRDPAYKTPVAEDKVPNKSHL LRDKDGNPFDYPYFIHDSYDSSDAVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDAFRLKSLQD IKDRVHLITVPGQNGVEKEDVVIGYQITAPNGDIYAVFVNADEKAREFNLGTAFAHLRNAEVLADENQA GPVGIANPKGLEWTEKGLKLNALTATVLRVSQNGTSHESTAEEKPDSTPSKPEHQNEASHPAHQDPAPE ARPDSTKPDAKVADAENKPSQATADSQAEQPAQEAQASSVKEAVRNESVENSSKENIPATPDKQAE SP046 nucleotide (SEQ ID NO:73) TAGTGATGGTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGT TTTNGATACTCATTATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCT AAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGA CAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTA TGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCTTGGTTTTA TATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATTTCAA ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGT ACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGC TGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAA TGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATG GGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATG GATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTA CGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGA TGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTA TCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGG TAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTT GTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGA GAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCATCTTTC TGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATGGTTTTAAGCTTGAGAA TCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATAAGGTATTTAG TTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACATTA CCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGC CAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATT TGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAG AACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAA AATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGGCAAAGAT SP046 amino acid (SEQ ID NO:74) SDGTWQGKQYLKEDGSQAANEWVXDTHYQSWFYIKADANYAENEWLKQGDDYFYLKSGGYMAKSEWVED KGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQIKGKDYYFK SGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGYMAAN EWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVY DSHSQAWYYFKSGGYMAKNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANNYDSDGEKLSYISQG SVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLS DMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHY HINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGR TFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD SP048 nucleotide (SEQ ID NO:75) TGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAATAGAGTATGATGACGCTGACAA TGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTTCAAGTACGCGAAT TCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGATGGCTA TGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGA TAGAAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCC ATTTGCTACTAAATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAA GAATGGGAAAACAGTTACAACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTAC TTTAAGTCAAAAAAAAGACTCtCAAACAAGAGTTGTTAAAAAAAGaACCArkCCCCAAGTTCTTGTCCA AGAAATTCCAATCGAAACAGAATATCTCGATGGCCCaACTCTTGATAAAAGTCAAGAAGTAGAAGAAGT AGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGGTAGATGAACGTGATGGAACAATTGAAGA AACTACTTCTCGTCAAATTACTAAAGAGATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGA AAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTCGTATCCTGTATCTGTTACATCTAACCAAGGAAC AGATGTAGCAGTAGAACCAGCTAAAGCAGTTGCTCCAACAACAGACTGGAAACAAGAAAATGGTATGTG GTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGGTACAAGTTAATAGTTCATGGTACTACCT CAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATTATGTAAATAC ATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGCG T SP048 amino acid (SEQ ID NO:76) GIQYVRDDTRDKEEGIEYDDADNGDIIVKVATKPKVVTKKISSTRIRYEKDETKDRSENPVTIDGEDGY VTTTRTYDVNPETGYVTEQVTVDRKEATDTVIKVPAKSKVEEVLVPFATKYEADNDLSAGQEQEITLGK NGKTVTTITYNVDGKSGQVTESTLSQKKDSQTRVVKKRTXPQVLVQEIPIETEYLDGPTLDKSQEVEEV GEIGKLLLLQSILVDERDGTIEETTSRQITKEMVKRRIRRGTREPEKVVVPEQSSIPSYPVSVTSNQGT DVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVNSSWYYLNSNGSMKVNQWFQVGGKWYYVNT SGELAVNTSIDGYRVNDNGEWVR SP049 nucleotide (SEQ ID NO:77) GGATAATAGAGAAGCATTAAAAACCTTTATGACGGGTGAAAATTTTTATCTCCAACATTATCTAGGAGC ACATAGGGAAGAACTAAATGGAGAGCATGGCTATACCTTCCGTGTTTGGGCACCTAATGCTCAGGCTGT TCACTTGGTTGGTGATTTTACCAACTGGATTGAAAATCAGATTCCAATGGTAAGAAATGATTTTGGGGT CTGGGAAGTCTTTACCAATATGGCTCAAGAAGGGCATATTTACAAATATCATGTCACACGTCAAAATGG TCATCAACTGATGAAGATTGACCCTTTTGCTGTCAGGTATGAGGCTCGTCCAGGAACAGGGGCAATCGT AACAGAGCTTCCTGAGAAGAAATGGAAGGATGGACTTTGGCTGCCACGAAGAAAACGTTGGGGCTTTGA AGAGCGTCCTGTCAATATTTATGAAGTTCACGCTGGATCATGGAAAAGAAATTCTGATGGCAGTCCTTA TAGTTTTGCCCAGCTCAAGGATGAACTCATTCCTTATCTCGTTGAAATGAACTATACTCATATTGAGTT TATGCCCTTGATGTCCCATCCTTTGGGCTTGAGTTGGGGGTATCAGCTTATGGGTTACTTCGCTTTAGA GCATGCTTATGGCCGACCAGAGGAGTTTCAAGATTTTGTC SP049 amino acid (SEQ ID NO:78) DNREALKTFMTGENFYLQHYLGAHREELNGEHGYTFRVWAPNAQAVHLVGDFTNWIENQIPMVRNDFGV WEVFTNMAQEGHIYKYHVTRQNGHQLMKIDPFAVRYEARPGTGAIVTELPEKKWKDGLWLARRKRWGFE ERPVNIYEVHAGSWKRNSDGSPYSFAQLKDELIPYLVEMNYTHIEFMPLMSHPLGLSWGYQLMGYFALE HAYGRPEEFQDFV SP050 nucleotide (SEQ ID NO:79) AGATTTTGTCGAGGAGTGTCATACCCATAATATTGGGGTTATTGTGGACTGGGTACCAGNTCACTTTAC CATCAACGATGATGCCTTAGCCTATTATGATGGGACACCGACTTTTGAATACCAAGACCATAATAAGGC TCATAACCATGGTTGGGGTGCCCTTAATTTTGACCTTGGAAAAAATGAAGTCCAGTCCTTCTTAATTTC TTGCATTAAGCATTGGATTGATGTCTATCATTTGGATGGTATTCGTGTGGATGCTGTTAGCAACATGCT CTATTTGGACTATGATGATGCTCCATGGACACCTAATAAAGATGGCGGAAATCTCAACTATGAAGGTTA TTATTTCCTTCAGCGCTTGAATGAGGTTATTAAGTTAGAATATCCAGATGTGATGATGATTGCAGAAGA AAGTTCGTCTGCGATCAAGATTACGGGAATGAAAGAGATTGGTGGTCTAGGATTTGACTACAAATGGAA CATGGGCTGGATGAATGATATCCTCCGTTTCTACGAAGAAGATCCGATCTATCGTAAATATGACTTTAA CCTGGTGACTTTCAGCTTTATGTATGTTTNCAAGGAGAATTATCTCTTGCCATTCTCGCACGATGAAGT GGTTCATGGCAAGAAGAGTATGATGCATAAGATGTGGGGAGATCGTTACAATCAATTCGCAGGCTTGCG CAATCTCTATACGTACCAAATTTGTCACCCTGGTAAGAAATTGCTCTTCATGGGTAGCGAATACGGTCA ATTCCTAGAATGGAAATCTGAAGAACAGTTGGAATGGTCTAACCTAGAAGACCCAATGAATGCTAAGAT GAAGTATTTCGCTTCTCAGCTAAACCAGTTTTACAAAGATCATCGCTGTCTGTGGGAAATTGATACCAG CTATGATGGTATTGAAATCATTGATGCGGATAATCGAGACCAGAGTGTTCTTTCCTTTATTCGTAAGGG TAAAAAGGGA SP050 amino acid (SEQ ID NO:80) DFVEECHTHNIGVIVDWVPXHFTINDDALAYYDGTPTFEYQDHNKAHNHGWGALNFDLGKNEVQSFLIS CIKHWIDVYHLDGIRVDAVSNMLYLDYDDAPWTPNKDGGNLMYEGYYFLQRLNEVIKLEYPDVMMIAEE SSSAIKITGMKEIGGLGFDYKWNMGWMNDILRFYEEDPIYRKYDFNLVTFSFMYVXKENYLLPFSHDEV VHGKKSMMHKMWGDRYNQFAGLRNLYTYQICHPGKKLLFMGSEYGQFLEWKSEEQLEWSNLEDPMNAKM KYFASQLNQFYKDHRCLWEIDTSYDGIEIIDADNRDQSVLSFIRKGKKG SP051 nucleotide (SEQ ID NO:81) ATCTGTAGTTTATGCGGATGAAACACTTATTACTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGAT AGTAGAAGAAAAGGCTGATAAAGCTTTGGAAACTAAAAATATAGTTGAAAGGACAGAACAAAGTGAACC TAGTTCAACTGAGGCTATTGCATCTGAGNAGAAAGAAGATGAAGCCGTAACTCCAAAAGAGGAAAAAGT GTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAGCTTCAAATCAAGAAAAACCGCTCAA GGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACAGGAAAGTGGATTTTAA TCAAAATTGGTACTTTAAACTCAATGCAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTAC GTGGAAAAAATTAGATTTACCGTATGACTGGAGTATCTTTAACGATTTCGATCATGAATCTCCTGCACA AAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGA CCTCAAGAAAAATGTTCGCCTTACTTTTGATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCA GTTAGTGGGGCATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGA TGGTCGTGAGAATGTGATTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAG TGGTATCTATCGTGATGTGACTTTACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTAT TTTAACACCAAAACTTGAAGAACAACAACATGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAA TACGGACGACAAAGACCATGAACTTGTAGCCGAATATCAAATCGTTGAACGAGGTGGTCATGCTGTAAC AGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGAATCAACAAGCCTAGATGCGATTTTAGA AGTTGAAAGACCAAAACTCTGGACTGTTTTAAATGACAAACCTGCCTTGTACGAATTGATTACGCGTGT TTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTACCGTTACTATCACTGGACTCC AAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTCCATGGAGTATCCTTGCACCACGACCATGG GGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAGT TAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTT ACTCGTTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCTTATGACTATGGACGTTTCTT TGAAAAAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCAT GGTCGAAAGAGGCAAAAACAACCCTGCTATCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAA TGGTGATGCCCACTCTTTAGCAACTGTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACTCG CTATGTTACCATGGGAGCAGATAAATTCCGTTTCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGA TGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATAATTACAAAGCCCTTAGAGCTAAGCATCCAAA ATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACGTGGAAGTTACTATCGCCCTGAACG TGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTATGGAAATGATCGTGTGGG TTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGCTGGACAGTTTAT CTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTTAAGAG CTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGC SP051 amino acid (SEQ ID NO:82) SVVYADETLITHTAEKPKEEKMIVEEKADKALETKNIVERTEQSEPSSTEAIASEXKEDEAVTPKEEKV SAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQNWYFKLNANSKEAIKPDADVST WKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTFDGVYMDSQVYVNGQ LVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHVEKNGTTI LTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILE VERPKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHG ALGAEENYKAEYRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFF EKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTR YVTMGADKFRFGNGSGGHEKIADELDAVGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPER ELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKS SYFGIVDTAGIPKHDFYLYQS SP052 nucleotide (SEQ ID NO:83) TTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGCCAATGGGT TTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGGGAAAACAAAGAATTAGCATC CAAAGTAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATGCTTCTAGTGTAGAATTGTT CTTGAATGGAAAATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCGATGGGCGGACTTACCA AGAAGGTGCAAATGCTAATGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCTTGGAAGC AATTGCTCGTGATGAATCTGGCAAGGAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGC AGTTCGTCTTATTAAGGAAGACCATGCGATTGCAGCAGATGGAAAAGACTTGACTTACATCTACTATGA AATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCTAATAATCTGGTTCGCTTCCAATTGCATGGCCA AGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGTGAACGCTATAAGGCGCAAGCAGATGG TTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAATCAACTGAACAAGCAGGGAA ATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTTACTGGTAAGAAAGA AGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGGCACCTGA AATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCG

TGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGACTTGAA TTCTGTAGACAAATCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTG GGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTT AGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAAC TGTAACGGTTGGTGGTGAGGCAGTAACAGGTCTTACTAGTCAAAAACCAATGCAATACCGCACTCTTGC TTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCAAG CGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATGC AATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACC AGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTT GCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTAT CCAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCA GGAACCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCAC TTGGCAAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGG AATTGACCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGAC AACTCCAATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTC ATCAGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAA TGGTCGCTTAGAAGGTACGCAATTAACA SP052 amino acid (SEQ ID NO:84) YFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWWWENKELASKVADSEGKIPVRAYSNASSVELF LNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDESGKEIARDKITTAGKPAA VRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRERYKAQADG SWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPE MPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSELPVVKRIAPNTDLN SVDKSVSYVLIDGSVEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVPT VTVGGEAVTGLTSQKPMQYRTLAYGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYA IQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLEL HKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVTDLHQEPSLPATVTVEYDKGFPKTHKVT WQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAPQLPESVRTYDSNGHVS SAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLT SP053 nucleotide (SEQ ID NO:85) AGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGG TCGCTTAGAAGGTACGCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGC AAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGA CCCAGTTTCAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTG GAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTC CGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTGATTGA GTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGT CTTTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGA AATGAACCACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAA CAAGCGTGGAACGTCTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACA AACAAGAATCCAAGTTGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGA GTCTGTAGATGGAAAAGTTCCGGCAGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCC AAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATA CCGTCTGCACTTCACTAAGGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCG CTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGG CTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCA ATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAA ACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAAC CAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCG TCGTTGGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGA AAATGGTAAGATTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGA TGCACCATCTAAACTCGTTTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAA CTACCAAGCCTACGACGCAGACCATCCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGC GGATAAAGACATTGCAGCTGGTGATGAAATCAACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAG ATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAG TGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGA AAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCA AGTAGCTTCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGTTCGCCTCGTTTCAGAAAG TGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGC TGCTGTACAAGAAGATCTTCCAAAAATCGAATTTGTTGAAAAAGATTTGGCATACAAGACAGTTGAGAA AAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTTGGAAAAGAACGTAT CTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAGTTCCGACAGA CCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAAGC AGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAG SP053 amino acid (SEQ ID NO:86) AKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAFASDSNPSD PVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIE YYVGKTVPTAPKNPSFVGNEDHVFNDSAWWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADN KRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVP SVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDG YETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASAT NDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERTVTQGKVQFFADSGTD APSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDEINVTFKAIKAKAMR WRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYKGQRPKVSVEENNQ VASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAYKPVEK KDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKA DTKPIDSSEASQTNKAQ SP054 nucleotide (SEQ ID NO:87) CTATCACTATGTAAATAAAGAGATTATTTCACAAGAAGCTAAAGATTTAATTCAGACAGGAAAGCCTGA CAGGAATGAAGTTGTATATGGTTTGGTGTATCAAAAAGATCAGTTGCCTCAAACAGGGACAGAA SP054 amino acid (SEQ ID NO:88) YHYVNKEIISQEAKDLIQTGKPDRNEVVYGLVYQKDQLPQTGTE SP055 nucleotide (SEQ ID NO:89) TGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGTAGAGACAGAGGA AGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATCGGAGGTAAAACC TACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGCTCCAGTTGAAGA AGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAGTCAACCATCAGA CAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAAAAGGC ACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACA AGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGACTGTTAATCAATC TATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGAGGAACCAAAAGT TGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACCAGTTGAGCCAGA AAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGAAGATAAAATAAA GGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAAAGCTAGTTCAGT TTCTCCTACTGATTAT SP055 amino acid (SEQ ID NO:90) ETPQSITNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPAPVEE VGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETPKQ EESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAPVEPE KQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDY SP056 nucleotide (SEQ ID NO:91) GGATGCTCAAGAAACTGCGGGAGTTCACTATAAATATGTGGCAGATTCAGAGCTATCATCAGAAGAAAA GAAGCAGCTTGTCTATGATATTCCGACATACGTGGAGAATGATGATGAAACTTATTATCTTGTTTATAA GTTAAATTCTCAAAATCAACTGGCGGAATTGCCAAATACTGGAAGCAAGAATGAGAGGCAA SP056 amino acid (SEQ ID NO:92) DAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDETYYLVYKLNSQNQLAELPNTGSKNERQ SP057 nucleotide (SEQ ID NO:93) CGACAAAGGTGAGACTGAGGTTCAACCAGAGTCGCCAGATACTGTGGTAAGTGATAAAGGTGAACCAGA GCAGGTAGCACCGCTTCCAGAATATAAGGGTAATATTGAGCAAGTAAAACCTGAAACTCCGGTTGAGAA GACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGTTCCAGTAAAACCAACAGAAGAAACACCAGTAAA TCCAAATGAAGGTACTACAGAAGGAACCTCAATTCAAGAAGCAGAAAATCCAGTTCAACCTGCAGAAGA ATCAACAACGAATTCAGAGAAAGTATCACCAGATACATCTAGCAAAAATACTGGGGAAGTGTCCAGTAA TCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAATAAACCAGAACATAATGACTCTAAAAATGAAAA TTCAGAAAAAACTGTAGAAGAAGTTCCAGTAAATCCAAATGAAGGCACAGTAGAAGGTACCTCAAATCA AGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAACACAAACAAACTCTGGGAAAATAGCTAACGAAAA TACTGGAGAAGTATCCAATAAACCTAGTGATTCAAAACCACCAGTTGAAGAATCAAATCAACCAGAAAA AAACGGAACTGCAACAAAACCAGAAAATTCAGGTAATACAACATCAGAGAATGGACAAACAGAACCAGA ACCATCAAACGGAAATTCAACTGAGGATGTTTCAACCGAATCAAACACATCCAATTCAAATGGAAACGA AGAAATTAAACAAGAAAATGAACTAGACCCTGATAAAAAGGTAGAAGAACCAGAGAAAACACTTGAATT AAGAAAT SP057 amino acid (SEQIDNO:94) DKGETEVQPESPDTVVSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVN PNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESNKPEHNDSKNEN SEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEK NGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDKKVEEPEKTLEL RN SP058 nucleotide (SEQ ID NO:95) AAATCAATTGGTAGCACAAGATCCAAAAGCACAAGATAGCACTAAACTGACTGCTGAAAAATCAACTGT TAAAGCACCTGCTCAAAGAGTAGATGTAAAAGATATAACTCATTTAACAGATGAAGAAAAAGTTAAGGT TGCTATTTTACAAGCAAATGGTTCAGCATTAGACGGAGCGACAATCAATGTAGCTGGAGATGGTACAGC AACAATCACATTCCCAGATGGTTCAGTAGTGACGATTCTAGGAAAAGATACAGTTCAACAATCTGCGAA AGGTGAATCTGTAACTCAAGAAGCTACACCAGAGTATAAGCTAGAAAATACACCAGGTGGAGATAAGGG AGGCAATACTGGAAGCTCAGATGCTAATGCGAATGAAGGCGGTGGTAGCCAGGCGGGTGGATCAGCTCA CACAGGTTCACAAAACTCAGCTCAATCACAAGCTTCTAAGCAATTAGCTACTGAAAAAGAATCAGCTAA AAATGCCATTGAAAAAGCAGCCAAGGACAAGCAGGATGAAATCAAAGGCGCACCGCTTTCTGATAAAGA AAAAGCAGAACTTTTAGCAAGAGTGGAAGCAGAAAAACAAGCAGCTCTCAAAGAGATTGAAAATGCGAA AACTATGGAAGATGTGAAGGAAGCAGAAACGATTGGAGTGCAAGCCATTGCCATGGTTACAGTTCCTAA GAGACCAGTGGCTCCTAAT SP058 amino acid (SEQIDNO:96) NQLVAQDPKAQDSTKLTAEKSTVKAPAQRVDVKDITHLTDEEKVKVAILQANGSALDGATINVAGDGTA TITFPDGSVVTILGKDTVQQSAKGESVTQEATPEYKLENTPGGDKGGNTGSSDANANEGGGSQAGGSAH TGSQNSAQSQASKQLATEKESAKNAIEKAAKDKQDEIKGAPLSDKEKAELLARVEAEKQAALKEIENAK TMEDVKEAETIGVQAIAMVTVPKRPVAPN SP059 nucleotide (SEQ ID NO:97) CAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAATGGCTCTGGGACACGGGGTGCCTT CACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAATTGACAACACTGCCAAAACAGCTGT GATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAATGCTAATGCTATCGGCTACATCTC CTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGGTGTCAAGGCTAGTCGAGACACAGT TTTAGATGGTGAATACCCTCTTCAACGTCCCTTCAACATTGTTTGGTCTTCTAATCTTTCCAAGCTAGG TCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGTGGTCACAGATAATAAATTTATTGA AGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGGCAAGTTGTCTGTTGTAGGTTCCAC TTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAAAGAAAATCCAGAAGTTACGATTGA TATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGAGAAAACCGCTGATATTGGTATGGT TTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGATGCTATTGCTTTAGACGGTATTGC TGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGCTGAACTTGCAGACGTTTTTAGTGG CAAATTAACCACCTGGGACAAGATTAAA BP059 amino acid (SEQ ID NO:98) KQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNSTEGVLSAVQGNANAIGYIS LGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQVVTDNKFIE AKTETTEYTSQHLSGKLSVVGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKEKTADIGMV SRELTPEEGKSLTHDAIALDGIAVVVNNDNKASQVSMAELADVFSGKLTTWDKIK SP060 nucleotide (SEQ ID NO:99) ATTCGATGATGCGGATGAAAAGATGACCCGTGATGAAATTGCCTATATGCTGACAAATAGTGAAGAAAC ATTGGATGCTGATGAGATTGAGATGCTACAAGGTGTCTTTTCGCTCGATGAACTGATGGCACGAGAGGT TATGGTTCCTCGAACGGATGCCTTTATGGTGGATATTCAGGATGATAGTCAAGCCATTATCCAAAGTAT TTTAAAACAAAATTATTCTCGTATCCCGGTTTATGATGGGGATAAGGACAATGTAATTGGAATCATTCA CACCAAGAGTCTCCTTAAGGCAGGCTTTGTGGACGGTTTTGACAATATTGTTTGGAAGAGAATTTTACA AGATCCACTTTTTGTACCTGAAACTATTTTTGTGGATGACTTGCTAAAAGAACTGCGAAATACCCAAAG ACAAATG SP060 amino acid (SEQ ID NO:100) FDDADEKMTRDEIAYMLTNSEETLDADEIEMLQGVFSLDELMAREVMVPRTDAFMVDIQDDSQAIIQSI LKQNYSRIPVYDGDKDNVIGIIHTKSLLKAGFVDGFDNIVWKRILQDPLFVPETIFVDDLLKELRNTQR QM SP062 nucleotide (SEQ ID NO:101) GGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAAGTTC AGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAA GGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCG TCGTAACTACCCAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA AAAAGCGGAGCTTGAACTAGTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAA SP062 amino acid (SEQ ID NO:102) ESRSKVDEAVSKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDR RNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQ SP063 nucleotide (SEQ ID NO:103) ATGGACAACAGGAAACTGGGACGAGGTTATATCTGGTAAGATTGACAAGTACAAAGATCCAGATATTCC AACAGTTGAATCACAAGAAGTTACGTCAGACTCTAGTGATAAAGAAATAACGGTAAGGTATGACCGTTT ATCAACACCAGAAAAACCAATCCCACAACCAAATCCAGAGCATCCAAGTGTTCCGACACCAAACCCAGA ACTACCAAATCAAGAGACTCCAACACCAGATAAACCAACTCCAGAACCAGGTACTCCAAAAACTGAAAC TCCAGTGAATCCAGACCCAGAAGTTCCGACTTATGAGACAGGTAAGAGAGAGGAATTGCCAAACACAGG TACAGAAGCTAAT SP063 amino acid (SEQ ID NO:104) WTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYDRLSTPEKPIPQPNPEHPSVPTPNPE LPNQETPTPDKPTPEPGTPKTETPVNPDPEVPTYETGKREELPNTGTEAN SP064 nucleotide (SEQ ID NO:105) CGATGGGCTCAATCCAACCCCAGGTCAAGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTT ATCAGAAAAACCAGGAGACACCGTTCTCACTCAAGCGAAACCTGAGGGCGTTACTGGAAATACGAATTC ACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGAAACAAGCCCTTCTAGTCTGGATACACTTTT TGAAAAAGATGAAGAAGCTCAAAAAAATCCAGAGCTAACAGATGTCTTAAAAGAAACTGTAGATACAGC TGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGTAAAAGGTGGAGTGAAAGA AAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAAAGGTCCTTTCAC TGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATGGTATGTTAACTCGTCTATTACT AAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGCTTTACCTCCTCTTGA AGGATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCA AGCTTTAATTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAAA TAAAGACGGTAAAGCTGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATT AGTTGCTAAAGAAACAGTTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGC CTACCTAGAAAAAGCCAAGGGTGAAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACT CTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAA CGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAAACGTGAAGACCAAAGGTCAATACTTCTA TCAANTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAA NGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAA CATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGAAACAGTTCAAAAAGC CGTTGCAGACAACGTTAANGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGG TCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCG TCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGNGACGCTAAAAACCCAGCNCTATC TCCACTAGGTGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAANTAGCCTTGGACGGAAATGTAGC TGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGT CAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTAT TAAGATAAATGTTAAAGAAACATCAGACACAGCAAATGGTTCATTATCACCTTCTAACTCTGGTTCTGG CGTGACTCCGATGAATCACAATCATGCTACAGGTACTACAGATAGCATGCCTGCTGACACCATGACAAG TTCTACCAACACGATGGCAGGTGAAAACATGGCTGCTTCTGCTAACAAGATGTCTGATACGATGATGTC AGAGGATAAAGCTATG SP064 amino acid (SEQIDNO:106) DGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKPEGVTGNTNSLPTPTERTEVSEETSPSSLDTLF EKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKENTKDSIDVPAAYLEKAEGKGPFT AGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYEVDLNGNTVGKQGQ ALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDSIDVPAA YLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFY QXALDGNVAGKEKQALIDQFRAXGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKA VADNVXDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALS PLGENVKTKGQYFYQXALDGNVAGKEKQALIDQFRANGTQTYSATVVVYGNKDGKPDLDNIVATKKVTI KINVKETSDTANGSLSPSNSGSGVTPMNHNHATGTTDSMPADTMTSSTNTMAGENAAASANKMSDTMMS EDKAM SP065 nucleotide (SEQ ID NO:107) TTCCAATCAAAAACAGGCAGATGGTAAACTCAATATCGTGACAACCTTTTACCCTGTCTATGArTTTAC CAAGCAAGTCGCAGGAGATACGGCTAATGTAGAACTCCTAATCGGTGCTGGGACAGAACCTCATGAATA CGAACCATCTGCCAAGGCAGTTGCCAAAATCCAAGATGCAGATACCTTCGTTTATGAAAATGAAAACAT GGAAACATGGGTACCTAAATTGCTAGATACCTTGGATAAGAAAAAAGTGAAAACCATCAAGGCGACAGG CGATATGTTGCTCTTGCCAGGTGGCGAGGAAGAAGAGGGAGACCATGACCATGGAGAAGAAGGTCATCA CCATGAGTTTGACCCCCATGTTTGGTTATCACCAGTTCGTGCCATtAAACTAGTAGAGCACCATCCGCG ACACTTGTCAGCAGATTATCCTGATAAAAAAGAGACCTTTGAGAAGAATGCAGCTGCCTATATCGAAAA ATTGCAAGCCTTGGATAAGGCTTACGCAGAAGGTTTGTCTCAAGCAAAACAAAAGAGCTTTGTGACTCA ACACGCAgCCTTTAACTaTCTTGCCTTGGACTATGGGACTC SP065 amino acid (SEQ ID NO:106) SNQKQADGKLNIVTTFYPVYEFTKQVAGDTANVELLIGAGTEPHEYEPSAKAVAKIQDADTFVYENENM ETWVPKLLDTLDKKKVKTIKATGDMLLLPGGEEEEGDHDHGEEGHHHEFDPHVWLSPVRAIKLVEHHPR HLSADYPDKKETFEKNAAAYIEKLQALDKAYAEGLSQAKQKSFVTQHAAFNYLALDYGT SP067 nucleotide (SEQ ID NO:109) TATCACAGGATCGAACGGTAAGACAACCACAACGACTATGATTGGGGAAGTTTTGACTGCTGCTGGCCA ACATGGTCTTTTATCAGGGAATATCGGCTATCCAGCTAGTCAGGTTGCTCAAATAGCATCAGATAAGGA CACGCTTGTTATGGAACTTTCTTCTTTCCAACTCATGGGTGTTCAAGAATTCCATCCAGAGATTGCGGT TATTACCAACCTCATGCCAACTCATATCGACTACCATGGGTCATTTTCGGAATATGTAGCAGCCAAGTG GAATATCCAGAACAAGATGACAGCAGCTGATTTCCTTGTCTTGAACTTTAATCAAGACTTGGCAAAAGA CTTGACTTCCAAGACAGAAGCCACTGTTGTACCATTTTCAACACTTGAAAAGGTTGATGGAGCTTATCT GGAAGATGGTCAACTCTACTTCCGTGGTGAAGTAGTCATGGCAGCGAATGAAATCGGTGTTCCAGGTAG

CCACAATGTGGAAAATGCCCTTGCGACTATTGCTGTAGCCAAGCTTCGTGATGTGGACAATCAAACCAT CAAGGAAACTCTTTCAGCCTTCGGTGGTGTCAAACACCGTCTCCAGTTTGTGGATGACATCAAGGGTGT TAAATTCTATAACGACAGTAAATCAACTAATATCTTGGCTACTCAAAAAGCCTTGTCAGGATTTGACAA CAGCAAGGTCGTCTTGATTGCAGGTGGTTTGGACCGTGGCAATGAGTTTGACGAATTGGTGCCAGACAT TACTGGACTCAAGAAGATGGTCATCCTGGGTCAATCTGCAGAACGTGTCAAACGGGCAGCAGACAAGGC TGGTGTCGCTTATGTGGAGGCGACAGATATTGCAGATGCGACCCGCAAGGCCTATGAGCTTGCGACTCA AGGAGATGTGGTTCTTCTTAGTCCTGCCAATGCTAGCTGGGATATGTATGCTAACTTTGAAGTACGTGG CGACCTCTTTATCGACACAGTAGCGGAGTTAAAAGAA SP067 amino acid (SEQ ID NO:110) GITGSNGKTTTTTMIGEVLTAAGQHGLLSGNIGYPASQVAQIASDKDTLVMELSSFQLMGVQEFHPEIA VITNLMPTHIDYHGSFSEYVAAKWNIQNKMTAADFLVLNFNQDLAKDLTSKTEATVVPFSTLEKVDGAY LEDGQLYFRGEVVMAANEIGVPGSHNVENALATIAVAKLRDVDNQTIKETLSAFGGVKHRLQFVDDIKG VKFYNDSKSTNILATQKALSGFDNSKVVLIAGGLDRGNEFDELVPDITGLKKMVILGQSAERVKRAADK AGVAYVEATDIADATRKAYELATQGDVVLLSPANASWDMYANFEVRGDLFIDTVAELKE SP068 nucleotide (SEQ ID NO:111) AAGTTCATCGAAGATGGTTGGGAAGTCCACTATATCGGGGACAAGTGTGGTATCGAACACCAAGAAATC CTTAAGTCAGGTTTGGATGTCACCTTCCATTCTATTGCGACTGGAAAATTGCGTCGCTATTTCTCTTGG CAAAATATGCTGGACGTCTTCAAAGTTGGTTGGGGAATTGTCCAATCGCTCTTTATCATGTTGCGACTG CGTCCACAGACCCTTTTTTCAAAGGGGGGCTTTGTCTCAGTACCGCCTGTTATCGCTGCGCGTGTGTCA GGAGTGCCTGTCTTTATTCACGAATCTGACCTGTCTATGGGCTTGGCCAATAAAATCGCCTATAAATTT GCGACTAAGATGTATTCAACCTTTGAACAAGCTTCGAGTTTGGCTAAGGTTGAGCATGTGGGAGCGG SP068 amino acid (SEQ ID NO:112) SSSKMVGKSTISGTSVVSNTKKSLSQVWMSPSILLRLENCVAISLGKICWTSSKLVGELSNRSLSCCDC VHRPFFQRGALSQYRLLSLRVCQECLSLFTNLTCLWAWPIKSPINLRLRCIQPLNKLRVWLRLSMWER SP069 nucleotide (SEQ ID NO:113) ATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGG TGTTGGAACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCC GCAAAAAATTACAGTCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGA CCCTAGTCAAATTGATAGTCGGGTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTAC GAGTGACCAAGAGACATTGGATAGAATTGATAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAAC AGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCGACCGCAATGGTGTTGTCTTACCGGCAGTTAT CACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCACCAAGTTCAAGCACATCAAATTC AAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAAAACGAAT SP069 amino acid (SEQ ID NO:114) SLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDKVKIVPEID PSQIDSRVQIENVNVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGSVPLQAIDRNGVVLPAVI TPFDTIMKVTTKPVAPSSSTSNSSTSSSSETSSSTKATSSKTN SP070 nucleotide (SEQ ID NO:115) GCACCAGATGGGGCACAAGGTTCAGGGATCAGATGTTGAAAAGTACTACTTTACCCAACGCGGTCTTGA GCAGGCAGGAATTACCATTCTTCCTTTTGATGAAAAAAATCTAGACGGTGATATGGAAATTATCGCTGG AAATGCCTTTCGTCCAGATAACAACGTCGAAATTGCCTATGCGGACCAAAATGGTATCAGCTACAAACG TTACCATGAGTTTCTAGGTAGCTTTATGCGTGACTTTGTTAGCATGGGAGTAGCAGGAGCACATGGAAA AACTTCAACGACAGGTATGTTGTCTCATGTCTTGTCTCACATTACAGATACCAGCTTCTTGATTGGAGA TGGGACAGGTCGTGGTTCGGCCAATGCCAAATATTTTGTCTTTGAATCTGACGAATATGAGCGTCACTT CATGCCTTACCACCCAGAATACTCTATTATCACCAACATTGACTTTGACCATCCAGATTATTTCACAAG TCTCGAGGATGTTTTTAATGCCTTTAACGACTATGCCAAACAAATCACCAAGGGTCTTTTTGTCTATGG TGAAGATGCTGAATTGCGTAAGATTACGTCTGATGCACCAATTTATTATTATGGTTTTGAAGCTGAAGG CAATGACTTTGTAGCTAGTGATCTTCTTCGTTCAATAACTGGTTCAACCTTCACCGTTCATTTCCGTGG ACAAAACTTGGGGCAATTCCACATTCCAACCTTTGGTCGTCACAATATCATGAATGCGACAGCCGTTAT TGGTCTTCTTTACACAGCAGGATTTGATTTGAACTTGGTGCGTGAGCACTTGAAAACATTTGCCGGTGT TAAACGTCGTTTCACTGAGAAAATTGTCAATGATACAGTGATTATCGATGACTTTGCCCACCATCCAAC AGAAATTATTGCGACCTTGGATGCGGCTCGTCAGAAATACCCAAGCAAGGAAATTGTAGCAGTCTTTCA ACCGCATACCTTTACAAGAACCATTGCCTTGTTGGACGACTTTGCCCATGCTTTAAACCAAGCAGATGC TGTTTATCTAGCGCAAATTTATGGCTCGGCTCGTGAAGTAGATCATGGTGACGTTAAGGTAGAAGACCT AGCCAACAAAATCAACAAAAAACACCAAGTGATTACTGTTGAAAATGTTTCTCCACTCCTAGACCATGA CAATGCTGTTTACGTCTTTATGGGAGCAGGAGACATCCAAACCTATGAATACTCATTTGAGCGTCTCTT GTCTAACTTGACAAGCAATGTTCAA SP070 amino acid (SEQ ID NO:116) HQMGHKVQGSDVEKYYFTQRGLEQAGITILPFDEKNLDGDMEIIAGNAFRPDNNVEIAYADQNGISYKR YHEFLGSFMRDFVSMGVAGAHGKTSTTGMLSHVLSHITDTSFLIGDGTGRGSANAKYFVFESDEYERHF MPYHPEYSIITNIDFDHPDYFTSLEDVFNAFNDYAKQITKGLFVYGEDAELRKITSDAPIYYYGFEAEG NDFVASDLLRSITGSTFTVHFRGQNLGQFHIPTFGRHNIMNATAVIGLLYTAGFDLNLVREHLKTFAGV KRRFTEKIVNDTVIIDDFAHHPTEIIATLDAARQKYPSKEIVAVFQPHTFTRTIALLDDFAHALNQADA VYLAQIYGSAREVDHGDVKVEDLANKINKKHQVITVENVSPLLDHDNAVYVFMGAGDIQTYEYSFERLL SNLTSNVQ SP071 nucleotide (SEQ ID NO:117) TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC AGTQCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGSCAGTAGAAGAAGAACTTCCGGCTTT AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA TAATGAGGAAAATACTGAAACATCAACTCAAGATTTCCAATTAGAGTATAAGAAAATAGAGATTAAAGA TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGATATTTAAGTCTAAGTGAAGC GCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGATCGCTTCAAAGAAATGTACCTACCTGT AAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGACGGTAGCCGTTGATCAACTTGTCGAAGA AGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGCTAAATCTAAAGCAGAGCAACCAGGAGT TTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAATCTGTCTGGTGTCTATACATTGGCTTC AGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGACAAGTTATCTCACAGGTGCATTTACAGG GAGCTTGATCGGTTCTGATCGAACAAAATCGTATGCCATTTATGATTTGAAGAAACCATTATTTGATAC ATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGTTTCTGCTGATAGTAAAGAAAATGTCGC AGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGTTGCAGTAGAAGGAAAAATCTCAGGTGC GAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGTGATAGAAAACAGCTCGTTTACAGGGAA ACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGGAGGAATAGTAGGTAATATAACAGGAAA TAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTCTACTAATGCACGCAATAATAACCAAAC AGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGATATCTAATTCGGTTGCTACTGGAGAAAT ACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGGATCTACGTGGCAAAACGGTCGAGTAAA TAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTATCACCGGTGATCAATACGCAGCAGCAGA TGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGACAGATTCGCTACAAAATTATCAAAAGA CCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAACTCTTGATGATACTGGGCAAGATTTAAA ACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGCAGAAGCTGAAAGAAAAGTAGCTTATAG CAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGTAGTTCACTATGGTAACAAAGTAGCGAC AACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCCGATGAAAGATGATGAAGTAGTAACGGA TATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACATTTCAAAGATAATACAGTAGAATACCT AGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAATCGAATACAATGTTACAGGAAAAGAATA TATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGATAACGAATAACGTACTAAGCGACTTGCA AAATGTAACACTTAAC SP071 amino acid (SEQ ID NO:118) FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSIGVG EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDVVHSADLEWNQGQGKVSL QGEASGDDGLSEKSSIAADNLSSNDSFASQVEQNPDHKGESVVRPTVPEQGNPVSATTVQSAEEEVLAT TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP TLLKNRRKIERQGQAGTRTIQYEDYIVNGNVVETKEVSRTEVAPVNEVVKVGTLVKVKPTVEITNLTKV ENKKSITVSYNLIDTTSAYVSAKTQVFHGDKLVKEVDIENPAKEQVISGLDYYTPYTVKTHLTYNLGEN NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRRYLSLSEAPTDTAKYFVKVKSDRFKEMYLPV KSITENTDGTYKVTVAVDQLVEEGTDGYKDDYTFTVAKSKAEQPGVYTSFKQLVTAMQSNLSGVYTLAS DMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIYDLKKPLFDTLNGATVRDLDIKTVSADSKENVA ALAKAANSANINNVAVEGKISGAKSVAGLVASATNTVIENSSFTGKLIANHQDSNKNDTGGIVGNITGN SSRVNKVRVDALISTNARNNNQTAGGIVGRLENGALISNSVATGEIRNGQGYSRVGGIVGSTWQNGRVN NVVSNVDVGDGYVITGDQYAAADVKNASTSVDNRKADRFATKLSKDQIDAKVADYGITVTLDDTGQDLK RNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLVVHYGNKVATTDKLYTTELLDVVPMKDDEVVTD INNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVIEYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQ NVTLN SP072 nucleotide (SEQ ID NO:119) TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC AGTGCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGsCAGTAGAAGAAGAACTTCCGGCTTT AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA TAATGAGGAAAATACTGAAACATCAACTCAAGATTTCCAATTAGAGTATAAGAAAATAGAGATTAAAGA TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGA SP072 amino acid (SEQ ID NO:120) FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSIGVG EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDVVHSADLEWNQGQGKVSL QGEASGDDGLSEKSSIAADNLSSNDSFASQVEQNPDHKGESVVRPTVPEQGNPVSATTVQSAEEEVLAT TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP TLLKNRRKIERQGQAGTRTIQYEDYIVNGNVVETKEVSRTEVAPVNEVVKVGTLVKVKPTVEITNLTKV ENKKSITVSYNLIDTTSAYVSAKTQVFHGDKLVKEVDIENPAKEQVTSGLDYYTPYTVKTHLTYNLGEN NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRR SP073 nucleotide (SEQ ID NO:121) TCGTAGATATTTAAGTCTAAGTGAAGCGCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGA TCGCTTCAAAGAAATGTACCTACCTGTAAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGAC GGTAGCCGTTGATCAACTTGTCGAAGAAGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGC TAAATCTAAAGCAGAGCAACCAGGAGTTTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAA TCTGTCTGGTGTCTATACATTGGCTTCAGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGAC AAGTTATCTCACAGGTGCATTTACAGGGAGCTTGATCGGTTCTGATGGAACAAAATCGTATGCCATTTA TGATTTGAAGAAACCATTATTTGATACATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGT TTCTGCTGATAGTAAAGAAAATGTCGCAGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGT TGCAGTAGAAGGAAAAATCTCAGGTGCGAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGT GATAGAAAACAGCTCGTTTACAGGGAAACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGG AGGAATAGTAGGTAATATAACAGGAAATAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTC TACTAATGCACGCAATAATAACCAAACAGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGAT ATCTAATTCGGTTGCTACTGGAGAAATACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGG ATCTACGTGGCAAAACGGTCGAGTAAATAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTAT CACCGGTGATCAATACGCAGCAGCAGATGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGA CAGATTCGCTACAAAATTATCAAAAGACCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAAC TCTTGATGATACTGGGCAAGATTTAAAACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGC AGAAGCTGAAAGAAAAGTAGCTTATAGCAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGT AGTTCACTATGGTAACAAAGTAGCGACAACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCC GATGAAAGATGATGAAGTAGTAACGGATATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACA TTTCAAAGATAATACAGTAGAATACCTAGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAAT CGAATACAATGTTACAGGAAAAGAATATATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGAT AACGAATAACGTACTAAGCGACTTGCAAAATGTAACACTTAAC SP073 amino acid (SEQ ID NO:122) RRYLSLSEAPTDTAKYFVKVKSDRFKEMYLPVKSITENTDGTYKVTVAVDQLVEEGTDGYKDDYTFTVA KSKAEQPGVYTSFKQLVTAMQSNLSGVYTLASDMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIY DLKKPLFDTLNGATVRDLDIKTVSADSKENVAALAKAANSANINNVAVEGKISGAKSVAGLVASATNTV IENSSFTGKLIANHQDSNKNDTGGIVGNITGNSSRVNKVRVDALISTNARNNNQTAGGIVGRLENGALI SNSVATGEIRNGQGYSRVGGIVGSTWQNGRVNVVVSNVDVGDGYVITGDQYAAADVKNASTSVDNRKAD RFATKLSKDQIDAKVADYGITVTLDDTGQDLKRNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLV VHYGNKVATTDKLYTTELLDVVPMKDDEVVTDINNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVI EYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQNVTLN SP074 nucleotide (SEQ ID NO:123) CTTTGGTTTTGAAGGAAGTAAGCGTGGACAATTTGCTGTAGAAGGAATCAATCAACTTCGTGAGCATGT AGACACTCTATTGATTATCTCAAACAACAATTTGCTTGAAATTGTTGATAAGAAAACACCGCTTTTGGA GGCTCTTAGCGAAGCGGATAACGTTCTTCGTCAAGGTGTTCAAGGGATTACCGATTTGATTACCAATCC AGGATTGATTAACCTTGACTTTGCCGATGTGAAAACGGTAATGGCAAACAAAGGGAATGCTCTTATGGG TATTGGTATCGGTAGTGGAGAAGAACGTGTGGTAGAAGCGGCACGTAAGGCAATCTATTCACCACTTCT TGAAACAACTATTGACGGTGCTGAGGATGTTATCGTCAACGTTACTGGTGGTCTTGACTTAACCTTGAT TGAGGCAGAAGAGGCTTCACAAATTGTGAACCAGGCAGCAGGTCAAGGAGTGAACATCTGGCTCGGTAC TTCAATTGATGAAAGTATGCGTGATGAAATTCGTGTAACAGTTGTTGCAACGGGTGTTCGTCAAGACCG CGTAGAAAAGGTTGTGGCTCCACAAGCTAGATCTGCTACTAACTACCGTGAGACAGTGAAACCAGCTCA TTCACATGGCTTTGATCGTCATTTTGATATGGCAGAAACAGTTGAATTGCCAAAACAAAATCCACGTCG TTTGGAACCAACTCAGGCATCTGCTTTTGGTGATTGGGATCTTCGCCGTGAATCGATTGTTCGTACAAC AGATTCAGTCGTTTCTCCAGTCGAGCGCTTTGAAGCCCCAATTTCACAAGATGAAGATGAATTGGATAC ACCTCCATTTTTCAAAAATCGT SP074 amino acid (SEQ ID NO:124) FGFEGSKRGQFAVEGINQLREHVDTLLIISNNNLLEIVDKKTPLLEALSEADNVLRQGVQGITDLITNP GLINLDFADVKTVNANKGNALMGIGIGSGEERVVEAARKAIYSPLLETTIDGAEDVIVNVTGGLDLTLI EAEEASQIVNQAAGQGVNIWLGTSIDESMRDEIRVTVVATGVRQDRVEKVVAPQARSATNYRETVKPAH SHGFDRHFDMAETVELPKQNPRRLEPTQASAFGDWDLRRESIVRTTDSVVSPVERFEAPISQDEDELDT PPFFKNR SP075 nucleotide (SEQ ID NO:125) CTACTACCTCTCGAGAGAAAGTGACCTAGAGGTGACCGTTTTTGACCATGAGCAAGGTCAAGCCACCAA GGCCGCAGCAGGAATTATCAGTCCTTGGTTTTCCAAACGCCGTAATAAAGCCTGGTACAAGATGGCGCG CTTGGGGGCTGATTTTTATGTGGATTTATTAGCTGATTTAGAGAAATCAGGACAAGAAATCGACTTTTA CCAGCGTTCGGGAGTCTTTCTCTTGAAAAAGGATGAATCCAATTTGGAAGAACTTTATCAACTGGCCCT CCAGCGCAGAGAAGAATCTCCCTTGATAGGGCAATTAGCCATTCTGAACCAAGCCTCAGCTAATGAATT ATTCCCTGGTTTGCAGGGATTTGACCGCCTGCTCTATGCTTCTGGTGGAGCGAGAGTAGATGGCCAACT TTTAGTGACTCGTTTGCTGGAAGTCAGTCATGTCAAGCTGGTCAAAGAAAAAGTGACTCTGACACCGTT AGCATCAGGCTACCAGATTGGTGAAGAGGAGTTTGAGCAGGTTATTTTGGCGACGGGAGCTTGGTTGGG GGACATGTTAGAGCCTTTAGGTTATGAAGTGGATGTCCGTCCTCAAAAAGGACAACTACGAGATTATCA GCTTGCCCAAGACATGGAAGATTACCCTGTTGTCATGCCAGAAGGGGAGTGGGATTTGATTCCCTTTGC AGGTGGGAAATTATCCTTAGGCGCTACCCACGAAAATGACATGGGATTTGATTTGACGGTAGATGAAAC CTTGCTCCAACAAATGGAGGAGGCCACCTTGACTCACTATCTGATTTTGGCTGAAGCTACTTCAAAATC TGAGCGTGTTGGAATCCGTGCCTACACCAGTGATTTCTCTCCTTTCTTTGGGCAGGTGCCTGACTTAAC TGGTGTCTATGCAGCCAGTGGACTAGGTTCATCAGGCCTCACAACTGGTCCTATCATTGGTTACCATCT AGCCCAACTGATCCAAGACAAGGAGTTGACCTTGGACCCTCTAAATTACCCAATTGAAAACTATGTCAA ACGAGTAAAAAGCGAA SP075 amino acid (SEQ ID NO:126) YYLSRESDLEVTVFDHEQGQATKAAAGIISPWFSKRRNKAWYKMARLGADFYVDLLADLEKSGQEIDFY QRSGVFLLKKDESNLEELYQLALQRREESPLIGQLAILNQASANELFPGLQGFDRLLYASGGARVDGQL LVTRLLEVSHVKLVKEKVTLTPLASGYQIGEEEFEQVILATGAWLGDMLEPLGYEVDVRPQKGQLRDYQ LAQDMEDYPVVMPEGEWDLIPFAGGKLSLGATHENDMGFDLTVDETLLQQMEEATLTHYLILAEATSKS ERVGIRAYTSDFSPFFGQVPDLTGVYAASGLGSSGLTTGPIIGYHLAQLIQDKELTLDPLNYPIENYVK RVKSE SP076 nucleotide (SEQ ID NO:127) TAAGGTCAAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGA GTATGAAATGGGGCAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCTTTCAGTTGGA TAAAAATGTGTCTATCGTGGAGATGAAAATTCCTCAGTCTTGGGCAGGTCAAAGTCTGAGTAAATTAGA CCTCCGTGGCAAATACAATCTGAATATTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGA ATTTGGACCAGATGACCTCTTGAAAGCAGATACCTATATTTTGGCAGTCATCAACAACCAGTATTTGGA TACCCTA SP076 amino acid (SEQ ID NO:128) KVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAGQSLSKLD LRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTL SP077 nucleotide (SEQ ID NO:129) TGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTTAGCTAGCAAGTATCCTAATATCGTTAG AGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGGCTTGGTAGAGGCTTCTGGGCG CTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCGTGCCTACTTGAAAATTCTTGAAACTTG CAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTG SP077 amino acid (SEQ ID NO:130)

DGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEASGRYFKVVDSDDWVDPRAYLKILETC RNLRAKVKRWMSL SP078 nucleotide (SEQ ID NO:131) TAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGCTTTGTCAAACAAGAAGA AAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAAAGCCTTACTCGA TTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAATCTTCCACCTAT TCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGA AGATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACTTCC AAGCTTACAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAA TATCAAAATCTTAGAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGG GCCATCAGTGACCAAGTATGAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATC AGATGACCTCGCTCTAGCCTTGGCTGCCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCT AATCGGAATTGAAGTGCCCAACTCCGATATTGCCACTGTATCTTTCCGAGAACTATGGGAACAATCGCA AACGAAAGCAGAAAATTTCTTGGAAATTCCTTTAGGGAAGGCTGTTAATGGAACCGCAAGAGCTTTTGA CCTTTCTAAAATGCCCCACTTGCTAGTTGCAGGTTCAACGGGTTCAGGGAAGTCAGTAGCAGTTAACGG CATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTATGATGGTCGATCCCAAGAT GGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCCAGTCGTGACCAATCCACGCAAAGC CAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCAAGGTGGGAGT TCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCC GCTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGA AGATGCTATCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCG TCCATCTGTTGATGTCATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTC ATCAGGAACAGACTCCCGTACGATTTTGGATGAAAATGGAGCAGAAAAACTTCTTGGTCGAGGAGACAT GCTCTTTAAACCGATTGATGAAAATCATCCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGT TGAGCGCATTGTGAACTTCATCAAGACTCAGGCAGATGCAGACTACGATGAGAGTTTTGATCCAGGTGA GGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTGGTGGTGATCCGCTTTTTGAAGAAGCTAA GTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCGTCGTTTATCAGTTGGATTTAA CCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCTGAAGGTACCAAACC TCGAAAAGTGTTACAACAA SP07B amino acid (SEQ ID NO:132) RGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGEILTEEAVQNLPPI PEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVREN IKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSL IGIEVPNSDIATVSFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNG IIASILMKARPDQVKFMMPVDPKMVELSVYNDIPHLLIPWTNPRKASKALQKVVDEMENRYELFAKVGV PNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQR PSVDVISGLIKAAVPSRVAFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRLQGSFISDDDV ERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKASASMIQRRLSVGFN RATRLMEELEIAGVIGPAEGTKPRKVLQQ SP079 nucleotide (SEQ ID NO:133) TCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGGCCAATAT GTATAAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATTTTGGGAAGACAAG CTTCCTTTATGAAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGA AAGTTTGCTTCAACCATCACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGG CATTGCTAAGCAGGATCATCTAGCCTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGT TCCGAAAAAGATTGCTCAAGAATATGGCTTGAAGACCATTTCAGACTTGAAAAAAGTGGAAGGGCAGTT GAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAAGATGGAAATAAGGGCTTGCAATCAATGTATGG TCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATCAGGCTATTCAGTCAGGGGATATTCA AATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACAGGTCTTGGAAGATGACAA GCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCAAGAAACACCCAGAGTT GGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTACCAAGT CGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA A SP079 amino acid (SEQ ID NO:134) QKEKENLVIAGKIGPEPEILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTE SLLQPSPKVSHEPEQVYQVARDGIAKQDHLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQL KAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRYQAIQSGDIQITDAYSTDAELERYDLQVLEDDK QLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVGVEGKSAKQVAKEFLQEQGLLKK SP080 nucleotide (SEQ ID NO:135) ACGTTCTATTGAGGACCACTTTGATTCAAACTTCGAATTGGAATATAACCTCAAAGAAAAAGGGAAAAC AGATCTTTTGAAGCTAGTTGATAAAACAACTGACATGCGTCTGCATTTTATCCGCCAAACTCATCCACG CGGTCTCGGAGATGCTGTTTTGCAAGCCAAGGCTTTCGTCGGAAATGAACCTTTTGTCGTTATGCTTGG TGATGACTTGATGGATATCACAGACGAAAAGGCTGTTCCACTTACCAAACAACTCATGGATGACTACGA GCGTACCCACGCGTCTACTATCGCTGTCATGCCAGTCCCTCATGACGAAGTATCTGCTTACGGGGTTAT TGCTCCGCAAGGCGAAGGAAAAGATGGTCTTTACAGTGTTGAAACCTTTGTTGAAAAACCAGCTCCAGA GGACGCTCCTAGCGACCTTGCTATTATCGGACGCTACCTCCTCACGCCTGAAATTTTTGAGATTCTCGA AAAGCAAGCTCCAGGTGCAGGAAATGAAATTCAGCTGACAGATGCAATCGACACCCTCAATAAAACACA ACGTGTATTTGCTCGTGAGTTCAAAGGGGCTCGTTACGATGTCGGAGACAAGTTTGGCTTCATGAAAAC ATCCATCGACTACGCCCTCAAACACCCACAAGTCAAAGATGATTTGAAGAATTACCTCATCCAACTTGG AAAAGAATTGACTGAGAAGGAA SP080 amino acid (SEQ ID NO:136) RSIEDHFDSNFELEYNLKEKGKTDLLKLVDKTTDMRLHFIRQTHPRGLGDAVLQAKAFVGNEPFVVMLG DDLMDITDEKAVPLTKQLMDDYERTHASTIAVMPVPHDEVSAYGVIAPQGEGKDGLYSVETFVEKPAPE DAPSDLAIIGRYLLTPEIFEILEKQAPGAGNEIQLTDAIDTLNKTQRVFAREFKGARYDVGDKFGFMKT SIDYALKHPQVKDDLKNYLIQLGKELTEKE SP081 nucleotide (SEQ ID NO:137) CGCTCAAAATACCAGAGGTGTTCAGCTAATCGAGCACGTTTCTCCTCAAATGTTGAAAGCCCAATTGGA GAGTGTCTTTTCTGATATTCCACCTCAGGCTGTAAAAACTGGAATGTTGGCTACTACTGAAATCATGGA AATCATCCAACCCTATCTTAAAAAACTGGATTGTCCCTATGTCCTTGATCCTGTTATGGTTGCTACAAG TGGAGATGCCTTGATTGACTCAAATGCTAGAGACTATCTCAAAACAAACTTACTACCTCTAGCAACTAT TATTACGCCAAATCTTCCTGAAGCAGAAGAGATTGTTGGTTTTTCAATCCATGACCCCGAAGACATGCA GCGTGCTGGTCGCCTGATTTTAAAAGAATTTGGTCCTCAGTCTGTGGTTATCAAAGGCGGACATCTCAA AGGTGGTGCTAAAGATTTCCTCTTTACCAAGAATGAACAATTTGTCTGGGAAAGCCCACGAATTCAAAC CTGTCACACCCATGGTACT SP081 amino acid (SEQ ID NO:138) AQNTRGVQLIEHVSPQMLKAQLESVFSDIPPQAVKTGMLATTEIMEIIQPYLKKLDCPYVLDPVMVATS GDALIDSNARDYLKTNLLPLATIITPNLPEAEEIVGFSIHDPEDMQRAGRLILKEFGPQSVVIKGGHLK GGAKDFLFTKNEQFVWESPRIQTCHTHGT SP082 nucleotide (SEQ ID NO:139) AATTGTACAATTAGAAAAAGATAGCAAATCAGACAAAGAACAAGTTGATAAACTATTTGAATCATTTGA TGCATCTTCAGATGAATCTATTTCTAAATTAAAAGAACTATCTGAAACTTCACTTAAAACCGATGCAGG TAAAGACTATCTTAATAACAAAGTCAAAGAATCATCTAAAGCAATTGTAGATTTTCATTTGCAAAAAGG TTTGGCTTATGATGTTAAAGATTCAGATGACAAATTTAAAGATAAAGCAACTCTTGAAACAAATGTAAA AGAAATTACAAAACAAATTGATTTTATCAAAAAAGTTGATGAAACTTTTAAACAAGAGAATTTGGAAGA AACTCTTAAATCTCTAAATGATCTTGTTGATAAATATCAAAAACAAATCGAACTTTTGAAGAAAGAAGA AGAAAAAGCTGCTGAAAAAGCTGCTGAAAAAGCAAAGGAATCTTCTAGTCAAAGTAATTCTTCTGGTAG TGCTTCTAATGAGTCTTATAATGGATCTTCCAATTCAAATGTAGATTATAGTTCATCTGAACAAACTAA TGGATATTCAAATAATTATGGCGGTCAAGATTATTCTGGTTCAGGAGATAGTTCAACAAATGGTGGATC ATCAGAACAATATTCATCTAGCAATTCAAACAGCGGAGCAAATAATGTCTACAGATATAAAGGCACTGG TGCTGACGGCTATCAAAGATACTACTACAAAGATCATAATAATGGAGATGTGTATGATGACGATGGAAA TTACCTTGGGAACTTTGGTGGCGGCATTGCAGAACCTAGTCAACGC SP082 amino acid (SEQ ID NO:140) IVQLEKDSKSDKEQVDKLFESFDASSDESISKLKELSETSLKTDAGKDYLNNKVKESSKAIVDFHLQKG LAYDVKDSDDKFKDKATLETNVKEITKQIDFIKKVDETFKQENLEETLKSLNDLVDKYQKQIELLKKEE EKAAEKAAEKAKESSSQSNSSGSASNESYNGSSNSNVDYSSSEQTNGYSNNYGGQDYSGSGDSSTNGGS SEQYSSSNSNSGAVVVYRYKGTGADGYQRYYYKDHNNGDVYDDDGNYLGNFGGGIAEPSQR SP083 nucleotide (SEQ ID NO:141) TCTGACCAAGCAAAAAGAAGCAGTCAATGACAAAGGAAAAGCAGCTGTTGTTAAGGTGGTGGAAAGCCA GGCAGAACTTTATAGCTTAGAAAAGAATGAAGATGCTAGCCTAAGAAAGTTACAAGCAGATGGACGCAT CACGGAAGAACAGGCTAAAGCTTATAAAGAATACAATGATAAAAATGGAGGAGCAAATCGTAAAGTCAA TGAT SP083 amino acid (SEQ ID NO:142) LTKQKEAVNDKGKAAVVKVVESQAELYSLEKNEDASLRKLQADGRITEEQAKAYKEYNDKNGGANRKVN D SP084 nucleotide (SEQ ID NO:143) GTCCGGCTCTGTCCAGTCCACTTTTTCAGCGGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACT CTATCGGGAAACCCAAAAACGCAGTGTAGCCAGTCAGCAAAAGACTAGTCTGAACTTAGATGGGCAGAC GCTTAGCAATGGCAGTCAAAAGTTGCCAGTCCCTAAAGGAATTCAGGCCCCATCAGGCCAAAGTATTAC ATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGAATTTCAGACCAGTAAAGGAGCGATTCG CTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAAAT SP084 amino acid (SEQ ID NO:144) SGSVQSTFSAVEEQIFFMEFEELYRETQKRSVASQQKTSLNLDGQTLSNGSQKLPVPKGIQAPSGQSIT FDRAGGNSSLAKVEFQTSKGAIRYQLYLGNGKIKRIKETKN SP085 nucleotide (SEQ ID NO:145) GGGACAAATTCAAAAAAATAGGCAAGAGGAAGCAAAAATCTTGCAAAAGGAAGAAGTCTTGAGGGTAGC TAAGATGGCCCTGCAGACGGGGCAAAATCAGGTAAGCATCAACGGAGTTGAGATTCAGGTATTTTCTAG TGAAAAAGGATTGGAGGTCTACCATGGTTCAGAACAGTTGTTGGCAATCAAAGAGCCA SP085 amino acid (SEQ ID NO:146) GQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVSINGVEIQVFSSEKGLEVYHGSEQLLAIKEP SP086 nucleotide (SEQ ID NO:147) TCGCTACCAGCAACAAAGCGAGCAAAAGGAGTGGCTCTTGTTTGTGGACCAACTTGAGGTAGAATTAGA CCGTTCGCAGTTCGAAAAAGTAGAAGGCAATCGCCTATACATGAAGCAAGATGGCAAGGACATCGCCAT CGGTAAGTCAAAGTCAGATGATTTCCGTAAAACGAATGCTCGTGGTCGAGGTTATCAGCCTATGGTTTA TGGACTCAAATCTGTACGGATTACAGAGGACAATCAACTGGTTCGCTTTCATTTCCAGTTCCAAAAAGG CTTAGAAAGGGAGTTCATCTATCGTGTGGAAAAAGAAAAAAGT SP086 amino acid (SEQ ID NO:148) RYQQQSEQKEWLLFVDQLEVELDRSQFEKVEGNRLYMKQDGKDIAIGKSKSDDFRKTNARGRGYQPMVY GLKSVRITEDNQLVRFHFQFQKGLEREFIYRVEKEKS SP087 nucleotide (SEQ ID NO:149) GAACCGACAAGTCGCCCACTATCAAGACTATGCTTTGAATAAAGAAAAATTGGTTGCTTTTGCTATGGC TAAACGAACCAAAGATAAGGTTGAGCAAGAAAGTGGGGAACAGTTTTTTAATCTAGGTCAGGTAAGCTA TCAAAACAAGAAAACTGGCTTAGTGACGAGGGTTCGTACGGATAAGAGCCAATATGAGTTTCTGTTTCC TTCAGTCAAAATCAAAGAAGAGAAAAGAGATAAAAAGGAAGAGGTAGCGACCGATTCAAGCGAAAAAGT GGAGAAGAAAAAATCAGAAGAGAAGCCTGAAAAGAAAGAGAATTCA SP087 amino acid (SEQ ID NO:150) NRQVAHYQDYALNKEKLVAFAMAKRTKDKVEQESGEQFFNLGQVSYQNKKTGLVTRVRTDKSQYEFLFP SVKIKEEKRDKKEEVATDSSEKVEKKKSEEKPEKKENS SP088 nucleotide (SEQ ID NO:151) GGTTGTCGGCTGGCAATATATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTAT CAGATTAGAAGGTTTTCCAAAGTCAGAGTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGT TGGTTGGAAAACATTAGAGATTAAAACTAAAGACAGTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGA TTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTACTTTAATCAAAATCATTCTTTAGAGAC AGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAATTAATGGAGAAAACTACCT TGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATCCAACAACTGG TATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATGGC CACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGG TTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCA AGATGGTTCAACTTGGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAA TGGCAACTGGTACTATGCTTATAGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGT CAACTATAATGGCGAATGGGTTCGG SP088 amino acid (SEQ ID NO:152) VVGWQYIPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKRED SEDKEEKRYYTNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDSTWYYLDPTTG IMQTGWQYLGNKWYYLRSSGAMATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQ DGSTWYYLNAGNGDMKTGWFQVNGNWYYAYSSGALAVNTTVDGYSVYYNGEWVR SP089 nucleotide (SEQ ID NO:153) GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAG AAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGA TTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAAT TAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTA TGTGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTA CATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAA ATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGA TGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGG TGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAA AATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTA CATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGC TGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGC GAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAA TGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCT TTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGC TATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGA CTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCC AGTAGCTTCTCATCTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGA TGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGA ATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAA GGAAGCCGAAGAACATTACCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTG GGGAAGAAGTAAAATTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTA CCTTTCTGCTAAGACATTTGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAA TTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGA CCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAG SP089 amino acid (SEQ ID NO:154) AKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQI KGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLK SGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGK IAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMA KNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLA ITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFD GFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNW GRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASD PYWGEKIASVMMKINEK SP090 nucleotide (SEQ ID NO:155) ATTTGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGA TCTAAAAGAAACCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGAT GGTTGTCGGCTGGCAATATATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCCAAGAATAGA GATTGCTCTTAGACCAGATTGGTTTTATTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCA AGTTTTAGAAGCAAAAACTGCTACGAATACCAACAAACATCATGGGGAAGAATATGATAGCCAAGCAGA GAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACTTTAAAAACTGGTTGGATTTATGAAGA GGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAACAGATTGACGGTTGGAGA GCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAGCAGCTCCATG GTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTACTA CCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGC TGAAAATGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGG AGCTATGGCAACTGGTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATAT GAAAACAGGCTGGTTCCAAGTCAATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAA TACCACAGTAGGTGGTTACTACTTAAACTATAATGGTGAATGGGTTAAG SP090 amino acid (SEQ ID NO:156) VFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDPLSGEMVVGWQYIPAPHKGVTIGPSPRI EIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHTLKTGWIYE EGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWYYLNPATGIMQTGWQYLGNRWY YLHSSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGD MKTGWFQVNGNWYYAYDSGALAVNTTVGGYYLNYNGEWVK SP091 nucleotide (SEQ ID NO:157) TGTCGCTGCAAATGAAACTGAAGTAGCAAAAACTTCGCAGGATACAACGACAGCTTCAAGTAGTTCAGA GCAAAATCAGTCTTCTAATAAAACGCAAACGAGCGCAGAAGTACAGACTAATGCTGCTGCCCACTGGGA TGGGGATTATTATGTAAAGGATGATGGTTCTAAAGCTCAAAGTGAATGGATTTTTGACAACTACTATAA GGCTTGGTTTTATATTAATTCAGATGGTCGTTACTCGCAGAATGAATGGCATGGAAATTACTACCTGAA ATCAGGTGGATATATGGCCCAAAACGAGTGGATCTATGACAGTAATTACAAGAGTTGGTTTTATCTCAA GTCAGATGGGGCTTATGCTCATCAAGAATGGCAATTGATTGGAAATAAGTGGTACTACTTCAAGAAGTG GGGTTACATGGCTAAAAGCCAATGGCAAGGAAGTTATTTCTTGAATGGTCAAGGAGCTATGATGCAAAA TGAATGGCTSCTATGATCCAGCCTATTCTGCTTATTTTTATCTAAAATCCGATGGAACTTATGCTAACC AAGAGTGGCAAAAAGTGGGCGGCAAATGGTACTATTTCAAGAAGTGGGGCTATATGGCTCGGAATGAGT GGCAAGGCAACTACTATTTGACTGGAAGTGGTGCCATGGCGACTGACGAAGTGATTATGGATGGTACTC GCTATATCTTTGCGGCCTCTGGTGAGCTCAAAGAAAAAAAAGATTTGAATGTCGGCTGGGTTCACAGAG ATGGTAAGCGCTATTTCTTTAATAATAGAGAAGAACAAGTGGGAACCGAACATGCTAAGAAAGTCATTG ATATTAGTGAGCACAATGGTCGTATCAATGATTGGAAAAAGGTTATTGATGAGAACGAAGTGGATGGTG TCATTGTTCGTCTAGGTTATAGCGGTAAAGAAGACAAGGAATTGGCGCATAACATTAAGGAGTTAAACC GTCTGGGAATTCCTTATGGTGTCTATCTCTATACCTATGCTGAAAATGAGACCGATGCTGAGAGTGACG CTAAACAGACCATTGAACTTATAAAGAAATACAATATGAACCTGTCTTACCCTATCTATTATGATGTTG AGAATTGGGAATATGTAAATAAGAGCAAGAGAGCTCCAAGTGATACAGGCACTTGGGTTAAAATCATCA ACAAGTACATGGACACGATGAAGCAGGCGGGTTATCAAAATGTGTATGTCTATAGCTATCGTAGTTTAT TACAGACGCGTTTAAAACACCCAGATATTTTAAAACATGTAAACTGGGTAGCGGCCTATACGAATGCTT TAGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGAAAG GAATCCAAGGGCGCGTAGATGTCAGCGTTTGGTAT SP091 amino acid (SEQ ID NO:158) VAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYK AWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYYFKKW GYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNEW QGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVID ISEHNGRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAESDA KQTIELIKKYNMNLSYPIYYDVENWEYVNKSKRAPSDTGTWVKIINKYMDTMKQAGYQNVYVYSYRSLL QTRLKHPDILKHVNWVAAYTNALEWENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY SP092 nucleotide (SEQ ID NO:159) TACGTCTCAGCCTACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAA GAAATATGAGGAAGCAAAAGCAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGC

AGAAGACGCTCAGAAAAAGTATGAAGATGATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGA AGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGTTCAAAATGCATATAAAGAGTACCGAGAAGTTCA AAATCAACGTAGTAAATATAAATCTGACGCTGAATATCAGAAAAAATTAACAGAGGTCGACTCTAAAAT AGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTAAGAGCAGTTGTAGTTCC TGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAGAAGAAGCTAAAGCAGAAGAAAAAGTAGCTAA GAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTGA AATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGA TAATTTGAAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAA AAAAGGAGAAGCTGAGCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACT TCTTGACAGCCTTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTT GGATAAAAAAGCTGATGAACTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAAT ATTACTTGGAGGGGCTGATNCTGAAGATGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGC TGAATTGGAAAAAACTCAAAAAGAATTAGATGCAGCTCTTAATGAGTTAGGCCCTGATGGAGATGAAGA AGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCC AGCTCCAAAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGA GCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGAAACCAGCTGAAGAGCCTACTCA ACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAATAC TGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGC TATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGC AAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTG GCTCCAATACAATGGCTCATGGTACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATA CAACGGTTCATGGTATTACCTCAACGCTAATGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTC ATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAGGTTGGGCTAAAGTCAACGGTTCATGGTACTA CCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGC ATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTTAGG TGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTT SP092 amino acid (SEQ ID NO:160) TSQPTFVRAEESPQVVEKSSLEKKYEEAKAKADTAKKDYETAKKKAEDAQKKYEDDQKRTEEKARKEAE ASQKLNDVALVVQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDSKIEKARKEQQDLQNKFNEVRAVVVP EPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAKELEIEKLQYEISTLEQEVATAQHQVD NLKKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAEL DKKADELQNKVADLEKEISNLEILLGGADXEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEE ETPAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQ PEKPATPKTGWKQENGMWYFYNTDGSMAIGWLQNNGSWYYLNANGANATGWVKDGDTWYYLEASGAMKA SQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNANGDNATGWLQYNGSWYYLNANGDMATGWAKVNGS WYYLNANGANATGWAKVNGSWYYLNANGSNATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLG ALAVNTTVDGYKVNANGEWV P093 nucleotide (SEQ ID NO:161) TGGACAGGTGAAAGGTCATGCTACATTTGTGAAATCCATGACAACTGAAATGTACCAAGAACAACAGAA CCATTCTCTCGCCTACAATCAACGCTTGGNTTCGCAAAATCGCATTGTAGATCCTTTTTTGGCGGAGGG ATATGAGGTCAATTACCAAGTGTCTGACGACCCTGATGCAGTCTATGGTTACTTGTCTATTCCAAGTTT GGAAATCATGGAGCCGGTTTATTTGGGAGCAGATTATCATCATTTAGGGATGGGCTTGGCTCATGTGGA TGGTACACCGCTGCCTCTGGATGGTACAGGGATTCGCTCAGTGATTGCTGGGCACCGTGCAGAGCCAAG CCATGTCTTTTTCCGCCATTTGGATCAGCTAAAAGTTGGAGATGCTCTTTATTATGATAATGGCCAGGA AATTGTAGAATATCAGATGATGGACACAGAGATTATTTTACCGTCGGAATGGGAAAAATTAGAATCGGT TAGCTCTAAAAATATCATGACCTTGATAACCTGCGATCCGATTCCTACCTTTAATAAACGCTTATTAGT GAATTTTGAACGAGTCGCTGTTTATCAAAAATCAGATCCACAAACAGCTGCAGTTGCGAGGGTTGCTTT TACGAAAGAAGGACAATCTGTATCGCGTGTTGCAACCTCTCAATGGTTG SP093 amino acid (SEQ ID NO:162) GQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLXSQNRIVDPFLAEGYEVNYQVSDDPDAVYGYLSTPSL EIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSHVFFRHLDQLKVGDALYYDNGQE IVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAVYQKSDPQTAAVARVAF TKEGQSVSRVATSQWL SP094 nucleotide (SEQ ID NO:163) GATTGCTCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGC CAAGGAAGTTGGTGCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGA TCAGTTGATGGTAGCTATTCGTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAG CCAGATTAATCCACATTTCCTCTATAACACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAG TCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCAACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGA CTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGTATCTCTTTATCCAGAAACAACGCTATGG AGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCTTTGATAATTTAGTCTTACCCAAGCTGGTCCT ACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCAGGGCCATATTAAACT TTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTCCAAGATGC TGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTCAA ACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAAT ATATATAAATAGAATAGAAACTAGC SP094 amino acid (SEQ ID NO:164) IAPLKDLRETMLEIASGAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSS QINPHFLYNTLDTIIWMAEFHDSQRVVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYG DKLEYEINENVAFDNLVLPKLVLQPLVENALYHGIKEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDA GDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKVEIYINRIETS SP095 nucleotide (SEQ ID NO:165) TAGGTCATATGGGACTTTTTTTCTACAACAAAATAGGCTCCATAATATCTATAAGGGATTTACCCACTA CAAATATTATAGAGCCGAAAATTCACATCTAATATATGCAGACTACTTTGAAATGAAATTAAAAAAATT ATTAAAGGATGACACAAAAGTTTTTGAAAAATCTACATTCAAATTTGTAGAAGGATATAAAATATACCT GACAGAATCTAAAGAATCTGGAATTAAACAAATGGACAATGTCATAAAATATTTTGAGTTTATTGAATC TAAAAGTATTGCTTTATATTTTCAAAAACGATTAAATGAGCTGATAGAT SP095 amino acid (SEQ ID NO:166) RSYGTFFLQQNRLHNIYKGFTHYKYYRAENSHLIYADYFEMKLKKLLKDDTKVFEKSTFKFVEGYKIYL TESKESGIKQMDNVIKYFEFIESKSIALYFQKRLNELID SP096 nucleotide (SEQ ID NO:167) CAACGTTGAGAATTATTTGCGAATGTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTG TTTATTAATCAATGATGGCTCTCCAGATCATTCATCCAAAATATGTGAAGAATTTGTAGAGAAAGATTC TCGTTTCAAATATTTTGAGAAAGCAAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTC GGGGGGGGGCGTACATTACTTTTGTAGACTC SP096 amino acid (SEQ ID NO:168) NVENYLRMCLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLSSARNLGIECS GGGVHYFCRL SP097 nucleotide (SEQ ID NO:169) CTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCA GACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGAC GGATGTTCTGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGAC CATCTTGAAGTCAGATAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTAC AGATGACAGTGTGCAGATGAAAACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCA TCAGGGAGCTATGCCTGTTTTGACTCCAGCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCA AGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTGTGCTTCGTTTGGATATTTCTTATGAAACTCTGGA AGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGCCTTCATTATCAATGAAAACCATGAATT TGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAGGCTATGAAACCCTACATCGA TACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATTGCAGGAACTGATTG GACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAG SP097 amino acid (SEQ ID NO:170) YYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENPSQDKVEGIRDLFLT ILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQWVISVTQ ELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYID TGQGYTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQ SP098 nucleotide (SEQ ID NO:171) GACAAAAACATTAAAACGTCCTGAGGTTTTATCACCTGCAGGGACTTTAGAGAAGCTAAAGGTAGCTGT TCAGTATGGAGCAGATGCTGTCTTTATCGGTGGTCAGGCCTATGGTCTTCGTAGCCGTGCGGGAAACTT TACTTTCGAACAGATGGAAGAAGGCGTGCAGTTTGCGGCCAAGTATGGTGCCAAGGTCTATGTAGCGGC TAATATGGTTATGCACGAAGGAAATGAAGCTGGTGCTGGTGAGTGGTTCCGTAAACTGCGTGATATCGG GATTGCAGCAGTTATCGTATCTGACCCAGCCTTGATTATGATTGCAGTGACTGAAGCACCAGGCCTTGA AATCCACCTTTCTACCCAAGCCAGTGCCACTAACTATGAAACCCTTGAGTTCTGGAAAGAGCTAGGCTT GACTCGTGTCGTTTTAGCGCGTGAGGTTTCAATGGAAGAATTAGCTGAGATCCGCAAACGTACAGATGT TGAAATTGAAGCCTTTGTCCATGGAGCTATGTGTATTTCATACTCTGGACGTTGTACTCTTTCAAACCA CATGAGTATGCGTGATGCCAACCGTGGTGGATGTTCTCAGTCATGCCGTTGGAAATACGACCTTTACGA TATGCCATTTGGGAAAGAACGTAAGAGTTTGCAGGGTGAGATTCCAGAAGAATTTTCAATGTCAGCCGT TGACATGTCTATGATTGACCANATTCCAGATATGATTGAAAATGGTGTGGACAGTCTAAAAATCGAAGG ACGTATGNAGTCTATTCACTANGTATCAACAGTAACCAACTGCTACAAGGCGGCTGTGGATGCCTATCT TGAAAGTCCTGAAAAGTTTGAAGCTATCAAACAAGACTTGGTGGACGAGATGTGGAAGGTTGCCCAACG TGAACTGGCTACAGGATTTTACTATGGTACACCATCTGAAAATGAGCAGTTGTTTGGTGCTCGTCGTAA AATCCCTGAGTACAAGTTTGTCGCTGAAGTGGTTTCTTATGATGATGCGGCACAAACAGCAACTATTCG TCAACGAAACGTCATTAACGAAGGGGACCAAGTTGAGTTTTATGGTCCAGGTTTCCGTCATTTTGAAAC CTATATTGAAGATTTGCATGATGCTAAAGGCAATAAAATCGACCGCGCTCCAAATCCAATGGAACTATT GACTATTAAAGTCCCACAACCTGTTCAATCAGGAGACATGGTTCGAGCTCTTAAAGAGGGGCTTATCAA TCTTTATAAGGAAGATGGAACCAGCGTCACAGTTCGTGCT SP098 amino acid (SEQ ID NO:172) TKTLKRPEVLSPAGTLEKLKVAVQYGADAVFIGGQAYGLRSRAGNFTFEQMEEGVQFAAKYGAKVYVAA NMVMHEGNEAGAGEWFRKLRDIGIAAVIVSDPALIMIAVTEAPGLEIHLSTQASATNYETLEFWKELGL TRVVLAREVSMEELAEIRKRTDVEIEAFVHGAMCISYSGRCTLSNHMSMRDANRGGCSQSCRWKYDLYD MPFGKERKSLQGEIPEEFSMSAVDMSMIDXIPDMIENGVDSLKIEGRMXSIHXVSTVTNCYKAAVDAYL ESPEKFEAIKQDLVDEMWKVAQRELATGFYYGTPSENEQLFGARRKIPEYKFVAEVVSYDDAAQTATIR QRNVINEGDQVEFYGPGFRHFETYIEDLHDAKGNKIDRAPNPMELLTIKVPQPVQSGDMVRALKEGLIN LYKEDGTSVTVRA SP099 nucleotide (SEQ ID NO:173) TTCTCAGGAGACCTTTAAAAATATCACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGG AACGCCTCGTGGTGCTGGGAATATCAAGGGTGAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGA GTCTTATGTCAAACGTATCAACGCTATCGGAGATTTGACTGGATATGACCTGATTGAAACGCCAGAAAC CAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGTAGCTTGATGATTACAGGTGTCAATGA CTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAGGAGAGCACTTAACCAACGA CGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAAGTAGGGGACAAGGT TAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGAAGTGACAAT CAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACACAGC TATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGC AACCTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTAT CAACTGGAAGAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGG TATGTACAAGATGGCCAAC SP099 amino acid (SEQ ID NO:174) SQETFKNITNSFSMQINRRVNQGTPRGAGNIKGEDIKKITENKAIESYVKRINAIGDLTGYDLIETPET KKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDKILLHKDLAAKHGWKVGDKV KLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGYTEDTAIYGDA TFRVTADKNLDDVMKELNGISGIMWKSYTLVKSSSNYPALEQSISGMYKMAN SP100 nucleotide (SEQ ID NO:175) AGTAAATGCGCAATCAAATTCATTAATATTAATAGATGAACCTGAAATCTCACTTCATCCGAGTGCAAT CTATAAATTTAAAGAGTTTTTACTTCAAGAGTGTTTAAATAAAAAACATCAAATTATTATCACTACACA TTCTACACAACTTATAAAAGATTTTCCTAGAGAAGCCGTGAAACTTTTAGTGAAAAACGGAGAAAAGGT AGATGTTATTGAAAATATTGATTATCAGGATGCATTTTTTGAATTAGGTGATGTGTATCATTCTAGGAA GATGATTTATGTTGAAGATAGACTAGCTAAATATATTCTAGAGTTTGTTATCACTCATTCAGGTAGTGA GAATCTTAAACAGAATTTAGTAGTGAGATATATTCCTGGTGGAGCAAATCAAATAATTTGTAATAATAT TTTAAACTCATCGTATTTAGATTCCGATAACCATTATTTTTGGCTTGATGGAGATCAAAACACTAATGT TAGTGAATCAAATAATTTAATGAACTATCTTGAAAATGGTGTTGTTATATCAGATAAAATTCCTGAATC AGATAATAAAAATCTTGATGATATTATAAAATTGATAANGGGATGTCCAATTAAATTTAATGTTTCAGG TAATAAAGGGCAAAAAAATAATATTGAATTAATTGCGAAACAAAGAAGCTTTATAGATTATTGGGCTAA ATAC SP100 amino acid (SEQ ID NO:176) VNAQSNSLILIDEPEISLHPSAIYKFKEFLLQECLNKKHQIIITTHSTQLIKDFPREAVKLLVKNGEKV DVIENIDYQDAFFELGDVYHSRKMIYVEDRLAKYILEFVITHSGSENLKQNLVVRYIPGGANQIICNNI LNSSYLDSDNHYFWLDGDQNTNVSESNNLMNYLENGVVISDKIPESDNKNLDDIIKLIXGCPIKFNVSG NKGQKNNIELIAKQRSFIDYWAKY SP101 nucleotide (SEQ ID NO:177) TTACCGCGTTCATCAAGATGTCAAACAAGTCATGACCTATCAACCCATGGTGCGAGAAATATTGAGTGA ACAAGACACCCCAGCAAACGAAGAGCTTGTGCTTGCTATGATTTATACTGAAACAAAAGGAAAAGAAGG CGATGTTATGCAGTCTAGTGAGTCTGCAAGTGGTTCCACCAACACCATCAATGATAATGCCTCTAGCAT TCGGCAAGGCATTCAAACTCTGACAGGCAATCTCTATCTGGCGCAGAAGAAGGGGGTAGATATCTGGAC AGCTGTTCAAGCCTATAATTTTGGACCTGCCTATATCGATTTTATCGCCCAAAATGGCAAGGAAAATAC CCTGGCTCTAGCCAAACAGTACTCTCGTGAGACTGTTGCCCCCTTGCTTGGTAATAGGACTGGAAAGAC TTATAGTTATATTCACCCCATTTCCATTTTTCACGGTGCTGAACTCTATGTAAATGGAGGAAACTATTA TTATTCTAGACAGGTACGACTTAACCTTTACATCATCAAATGTTTCACTCTCTTTTCAACATCTGGC SP101 amino acid (SEQ ID NO:178) YRVHQDVKQVMTYQPMVREILSEQDTPANEELVLAMIYTETKGKEGDVMQSSESASGSTNTINDNASSI RQGIQTLTGNLYLAQKKGVDIWTAVQAYNFGPAYIDFIAQNGKENTLALAKQYSRETVAPLLGNRTGKT YSYIHPISIFHGAELYVNGGNYYYSRQVRLNLYIIKCFTLFSTSG SP102 nucleotide (SEQ ID NO:179) GTGGATGGGCTTTAACTATCTTCGTATTCGCCGTGCGGCTAAAATTGTGGACAATGAGGAGTTTGAAGC CTTGATTCGTACGGGTCAATTGATTGATTTGCGCGACCCAGCAGAATTCCACAGAAAACATATCCTTGG TGCACGCAATATTCCTTCAAGTCAGTTGAAAACTAGTCTTGCAGCCCTTCGTAAAGATAAACCTGTCCT TCTCTACGAAAACCAACGTGCGCAACGAGTTACAAATGCAGCTCTTTACTTGAAAAAACAAGGTTTTTC TGAGATTTATATCCTTTCTTATGGCTTGGATTCTTGGAAAGGGAAAGTGAAGACTAGC SP102 amino acid (SEQ ID NO:180) WMGFNYLRIRRAAKIVDNEEFEALIRTGQLIDLRDPAEFHRKHILGARNIPSSQLKTSLAALRKDKPVL LYENQRAQRVTNAALYLKKQGFSEIYILSYGLDSWKGKVKTS SP103 nucleotide (SEQ ID NO:181) ACTAAACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTC AAGTCAGAAAAGTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAAT TGTAATCAAAATTACAGATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAA AGTTCCTTATGATGCCCTCTTTAGTGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGC TGATATTGTCAATGAAGTCAAGGGTGGTTATATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAA AGATGCAGCTCATGCTGATAATGTTCGAACTAAAGATGAAATCAATCGTCAAAAACAAGAACATGTCAA AGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCTCAGGGACGATATACGACAAATGATGG TTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATATCGTTCCTCATGGAGGTCA CTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGCACATCTGGCTGG AAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAATCTGTAGC AAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCTATGA TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCG TACACCAAATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTGC CTTAGAAGAAAAGATTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAA ACCTAATGAAGTAGTGTCTAGTCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAGTAAGGA GCTCTCTTCAGCATCTGATGGTTATATTTTTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTA TATTGTAAGACATGGTGATCATTTCCATTACATTCCAAAATCAAATCAAATTGGGCAACCGACTCTTCC AAACAATAGTCTAGCAACACCTTCTCCATCTCTTCCAATCAATCCAGGAACTTCACATGAGAAACATGA AGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAATCAGGTTTTGTCATGAGTCA CGGAGACCACAATCATTATTTCTTCAAGAAG SP103 amino acid (SEQ ID NO:162) LNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGK VPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQKQEHVK DNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAG KNMQPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKE LSSASDGYIFNPKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHE EDGYGFDANRIIAEDESGFVMSHGDHNHYFFKK SP105 nucleotide (SEQ ID NO:183) TGACTACCTTGAAATCCCACTTTACAGCTATCTTGGTGGATTCAACACTAAAGTTCTTCCAACTCCAAT GATGAACATCATCAACGGTGGTTCTCACTCTGACGCTCCAATCGCTTTCCAAGAGTTCATGATCTTGCC AGTTGGTGCGCCAACATTTAAAGAAGCCCTTCGTTACGGTGCTGAAATCTTCCACGCTCTTAAGAAAAT CCTTAAATCACGTGGTTTGGAAACTGCCGTAGGTGACGAAGGTGGATTCGCTCCTCGTTTCGAAGGAAC TGAAGATGGTGTTGAAACTATCCTTGCTGCGATTGAAGCTGCTGGATATGTACCAGGTAAAGACGTATT TATCGGATTTGACTGTGCTTCATCAGAATTCTACGATAAAGAACGTAAAGTTTACGACTACACTAAATT TGAAGGTGAAGGTGCTGCTGTTCGTACATCTGCAGAACAAATCGACTACCTTGAAGAATTGGTTAACAA ATACCCAATCATCACTATTGAAGATGGTATGGATGAAAACGACTGGGATGGTTGGAAAGCTCTTACTGA ACGTCTTGGTAAGAAAGTACAACTTGTTGGTGACGACTTCTTCGTAACAAACACTGACTACCTTGCACG TGGTATCCAAGAAGGTGCTGCTAACTCAATCCTTATCAAAGTTAACCAAATCGGTACTCTTACTGAAAC TTTTGAAGCTATCGAAATGGCTAAAGAAGCTGGTTACACTGCTGTTGTATCACACCGTTCAGGTGAAAC TGAAGATTCAACAATCGCTGATATTGCAGTTGCAACTAACGCAGGACAAATCAAGACTGGTTCACTTTC ACGTACAGACCGCATCGCTAAATACAACCAATTGCTTCGTATCGAAGACCAACTTGGTGAAGTAGCTGA ATATCGTGGATTGAAATCATTCTACAACCTTAAAAAA SP105 amino acid (SEQ ID NO:184) DYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPIAFQEFMILPVGAPTFKEALRYGAETFHALKKI LKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFIGFDCASSEFYDKERKVYDYTKF EGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGKKVQLVGDDFFVTNTDYLAR GIQEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLS RTDRIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK SP106 nucleotide (SEQ ID NO:185) TCGTATCTTTTTTTGGAGCAATGTTCGCGTAGAAGGACATTCCATGGATCCGACCCTAGCGGATGGCGA AATTCTCTTCGTTGTAAAACACCTTCCTATTGACCGTTTTGATATCGTGGTGGCCCATGAGGAAGATGG CAATAAGGACATCGTCAAGCGCGTGATTGGAATGCCTGGCGACACCATTCGTTACGAAAATGATAAACT CTACATCAATGACAAAGAAACGGACGAGCCTTATCTAGCAGACTATATCAAACGCTTCAAGGATGACAA ACTCCAAAGCACTTACTCAGGCAAGGGCTTTGAAGGAAATAAAGGAACTTTCTTTAGAAGTATCGCTCA AAAAGCTCAAGCCTTCACAGTTGATGTCAACTACAACACCAACTTTAGCTTTACTGTTCCAGAAGGAGA ATACCTTCTCCTCGGAGATGACCGCTTGGTTTCGAGCGACAGCCGCCACGTAGGTACCTTCAAAGCAAA AGATATCACAGGGGAAGCTAAATTCCGCTTATGGCCAATCACCCGTATCGGAACATTT SP106 amino acid (SEQ ID NO:186) RIFFWSNVRVEGHSMDPTLADGEILFVVKHLPIDRFDIVVAHEEDGNKDIVKRVIGMPGDTIRYENDKL YINDKETDEPYLADYIKRFKDDKLQSTYSGKGFEGNKGTFFRSIAQKAQAFTVDVNYNTNFSFTVPEGE YLLLGDDRLVSSDSRHVGTFKAKDITGEAKFRLWPITRIGTF SP107 nucleotide (SEQ ID NO:187)

GGACTCTCTCAAAGATGTGAAAGCAAATGCTAGCGACAGCAAGCCTGCACAGGACAAGAAGGATGCAAA ACAAGGAACGGAAGATAGTAAGGATTCAGATAAGATGACTGAAACAAACTCAGTTCCGGCAGGAGTGAT TGTGGTCAGTCTACTTGCCCTCCTAGGCGTGATTGCCTTCTGGCTGATTCGCCGTAAGAAAGAGTCAGA AATCCAGCAATTAAGCACGGAATTGATCAAGGTTCTAGGACAGCTAGATGCAGAAAAAGCGGATAAAAA AGTCCTTGCCAAAGCCCAAAACCTTCTCCAAGAAACCCTTGATTTCGTGAAAGAAGAAAATGGCTCAGC AGAGACAGAAACTAAACTAGTAGAGGAGCTTAAAGCAATCCTTGACAAACTCAAG SP107 amino acid (SEQ ID NO:188) DSLKDVKANASDSKPAQDKKDAKQGTEDSKDSDKMTETNSVPAGVIVVSLLALLGVIAFWLIRRKKESE IQQLSTELIKVLGQLDAEKADKKVLAKAQNLLQETLDFVKEENGSAETETKLVEELKAILDKLK SP108 nucleotide (SEQ ID NO:189) CAAGAAATCCTATCATCTCTTCCAGAAGCAAACAGAGACGAGGGGAATTCAGACTCAGTTGATTGAAGA ATCGCTTAGTCAGCAGACTATAATCCAGTCCTTCAATGCTCAAACAGAATTTATCCAAAGATTGCGTGA GGCTCATGACAACTACTCAGGCTATTCTCAGTCAGCCATCTTTTATTCTTCAACGGTCAATCCTTCGAC TCGCTTTGTAAATGCACTCATTTATGCCCTTTTAGCTGGAGTAGGAGCTTATCGTATCATGATGGGTTC AGCCTTGACCGTCGGTCGTTTAGTGACTTTTTTGAACTATGTTCAGCAATACACCAAGCCCTTTAACGA TATTTCTTCAGTGCTAGCTGAGTTGCAAAGTGCTCTGGCTTGCGTAGAGCGTATCTATGGAGTCTTAGA TAGCCCTGAAGTGGCTGAAACAGGTAAGGAAGTCTTGACGACCAGTGACCAAGTTAAGGGAGCTATTTC CTTTAAACATGTCTCTTTTGGCTACCATCCTGAAAAAATTTTGATTAAGGACTTGTCTATCGATATTCC AGCTGGTAGTAAGGTAGCCATCGTTGGTCCGACAGGTGCTGGAAAATCAACTCTTATCAATCTCCTTAT GCGTTTTTATCCCATTAGCTCGGGAGATATCTTGCTGGATGGGCAATCCATTTATGATTATACACGAGT ATCATTGAGACAGCAGTTTGGTATGGTGCTTCAAGAAACCTGGCTCACACAAGGGACCATTCATGATAA TATTGCCTTTGGCAATCCTGAAGCCAGTCGAGAGCAAGTAATTGCTGCTGCCAAAGCAGCTAATGCAGA CTTTTTCATCCAACAGTTGCCACAGGGATACGATACCAAGTTGGAAAATGCTGGAGAATCTCTCTCTGT CGGCCAAGCTCAGCTCTTGACCATAGCCCGAGTCTTTCTGGCTATTCCAAAGATTCTTATCTTAGACGA GGCAACTTCTTCCATTGATACACGGACAGAAGTGCTGGTACAGGATGCCTTTGCAAAACTCATGAAGGG CCGCACAAGTTTCATCATTGCTCACCGTTTGTCAACCATTCAGGATGCGGATTTAATTCTTGTCTTAGT AGATGGTGATATTGTTGAATATGGTAACCATCAAGAACTCATGGATAGAAAGGGTAAGTATTACCAAAT GCAAAAAGCTGCGGCTTTTAGTTCTGA A SP108 amino acid (SEQ ID NO:190) KKSYHLFQKQTETRGIQTQLIEESLSQQTIIQSFNAQTEFIQRLREAHDNYSGYSQSAIFYSSTVNPST RFVNALIYALLAGVGAYRIMMGSALTVGRLVTFLNYVQQYTKPFNDISSVLAELQSALACVERIYGVLD SPEVAETGKEVLTTSDQVKGAISFKHVSFGYHPEKILIKDLSIDIPAGSKVAIVGPTGAGKSTLINLLM RFYPISSGDILLDGQSIYDYTRVSLRQQFGMVLQETWLTQGTIHDNIAFGNPEASREQVIAAAKAANAD FFIQQLPQGYDTKLENAGESLSVGQAQLLTIARVFLAIPKILILDEATSSIDTRTEVLVQDAFAKLMKG RTSFIIAHRLSTIQDADLILVLVDGDIVEYGNHQELMDRKGKYYQMQKAAAFSSE SP109 nucleotide (SEQ ID NO:191) ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGT GAAAAAAACAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGG AGTGGAGGAATATTTCTCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGG AGAAACAACCAAGTACGAATTGATTAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGC CAAGCAAATTTCTCTCAAGGAAATCCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAA ACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGT GATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACTT GAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCA TATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGA ATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTC AAACCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTC ATCAAAAGACAAATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGT CACCTACAAGGAAGATAAGTCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCA CCTTGGGCATGTCTTTACGGATGGTCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTC TATCCGCTTTATTCCCAAAGACCAAATGGAAGAAAAAGGCTACGCTTATTTACTAGATTATGTTGAT SP109 amino acid (SEQ ID NO:192) RNAGQTDASQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRG ETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGNDVGTQYRTGVYYTDDKDLEV INQVFDEVAKKYDQPLAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEE LKKTLSPEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDVV TYKEDKSYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVD BP110 nucleotide (SEQ ID NO:193) TGTATAGTTTTTAGCGCTTGTTCTTCTAATTCTGNTAAAAATGAAGAAAATACTTCTAAAGAGCATGCG CCTGATAAAATAGTTTTAGATCATGCTTTCGGTCAAACTATATTAGATAAAAAACCTGAAAGAGTTGCA ACTATTGCTTGGGGAAATCATGATGTAGCATTAGCTTTAGGAATAGTTCCTGTTGGATTTTCAAAAGCA AATTACGGTGTAAGTGCTGATAAAGGAGTTTTACCATGGACAGAAGAAAAAATCAAAGAACTAAATGGT AAAGCTAACCTATTTGACGATTTGGATGGACTTAACTTTGAAGCAATATCAAATTCTAAACCAGATGTT ATCTTAGCAGGTTATTCTGGTATAACTAAAGAAGATTATGACACTCTATCA SP110 amino acid (SEQ ID NO:194) CIVFSACSSNSXKNEENTSKEHAPDKIVLDHAFGQTILDKKPERVATIAWGNHDVALALGIVPVGFSKA NYGVSADKGVLPWTEEKIKELNGKANLFDDLDGLNFEAISNSKPDVILAGYSGITKEDYDTLS SP111 nucleotide (SEQ ID NO:195) GTGTGTCGAGCATATTCTGAAGCAAACCTATCAAAATATAGAAATTATTTTAGTTGATGACGGTTCTAC GGATAATTCTGGGGAAATTTGTGATGCTTTTATGATGCAAGATAATCGTGTGCGAGTATTGCATCAAGA AAATAAGGGGGGGGCAGCACAAGCTAAAAATATGGGGATTAGTGTAGCTAAGGGAGAGTACATCACGAT TGTTGATTCAGATGATATCGTAAAAGAAAATATGATTGAAACTCTTTATCAGCAAGTCCAAGAAAAGGA TGCAGATGTTGTTATAGGGAATTACTATAATTATGACGAAAGTGACGGGAATTTTTATTTTTATGTAAC AGGGCAAGATTTTTGCGTCGAAGAATTAGCTATACAAGAAATTATGAACCGTCAAGCAGGAGATTGGAA ATTCAATAGCTCGGCCTTTATATTGCCGACATTTAAGTTGATTAAAAAAGAATTATTCAATGAAGTTCA CTTTTCAAATGGTCGCCGCTTTGATGATGAAGCAACTATGCATCGCTTTTATCTTTTAGCCTCTAAAAT CGTCTTTATAAACGATAATCTCTATCTGTATAGAAGACGTTCAGGAAGCATCATGAGAACGGAATTTGA TCTTTCCTGGGCAAGAGATATTGTTGAAGTGTTTTCTAAGAAAATATCGGATTGTGTCTTGGCTGGTTT GGATGTCTCCGTTCTGCGTATTCGATTTGTCAATCTTTTAAAAGATTATAAGCAAACTTTAGAATACCA TCAATTAACAGATACTGAGGAATATAAAGATATTTGTTTCAGATTAAAGTTGTTTTTTGATGCAGAACA AAGAAATGGTAAAAGT SP111 amino acid (SEQ ID NO:196) CVEHILKQTYQNIEIILVDDGSTDNSGEICDAFMMQDNRVRVLHQENKGGAAQAKNMGISVAKGEYITI VDSDDIVKENMIETLYQQVQEKDADVVIGNYYNYDESDGNFYFYVTGQDFCVEELAIQEIMNRQAGDWK FNSSAFILPTFKLIKKELFNEVHFSNGRRFDDEATMHRFYLLASKIVFINDNLYLYRRRSGSIMRTEFD LSWARDIVEVFSKKISDCVLAGLDVSVLRIRFVNLLKDYKQTLEYHQLTDTEEYKDICFRLKLFFDAEQ RNGKS SP0112 nucleotide (SEQ ID NO:197) GTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTGTTTATTAATCAATGATGGCTCTCC AGATCATTCATCCAAAATATGTGAAGAATTTGTAGAGAAAGATTCTCGTTTCAAATATTTTGAGAAAGC AAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTCGGGGGGGGCGTACATTACTTTTGT AGACTCTGATGATTGGTTGGAACATGATGCTTTAGACCGATTATATGGTGCTTTGAAAAAGGAAAACGC AGATATTAGTATCGGGCGTTATAATTCTTATGATGAAACACGCTATGTGTATATGACTTATGTTACGGA TCCAGATGATTCTCTAGAAGTGATAGAAGGTAAAGCAATTATGGATAGGGAAGGTGTCGAAGAAGTCAG AAATGGGAACTGGACTGTAGCTGTCTTGAAGTTATTCAAGAGAGAGTTACTACAAGATTTACCATTTCC TATAGGAAAAATTGCAGAGGATACTTACTGGACATGGAAGGTACTTCTAAGAGCTTCGAGGATAGTCTA TTTGAATCGTTGTGTTTACTGGTACCGTGTTGGTTTATCTGATACTTTATCGAATACATGGAGTGAAAA GCGTATGTATGATGAAATTGGGGCTAGGGAAGAAAAGATAGCTATTTTAGCAAGTTCAGACTATGACTT GACCAATCATATTTTGATTTATAAAAATAGATTACAAAGAGTGATAGCAAAATTAGAAGAACAAAATAT GCAGTTCACAGAGATTTACAGAAGAATGATGGAAAAATTGTCTTTACTTCCG SP0112 amino acid (SEQ ID NO:198) CLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLSSARNLGIECSGGAYITFV DSDDWLEHDALDRLYGALKKENADISIGRYNSYDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVR NGNWTVAVLKLFKRELLQDLPFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEK RMYDEIGAREEKIAILASSDYDLTNHILIYKNRLQRVIAKLEEQNMQFTEIYRRMMEKLSLLP SP113 nucleotide (SEQ ID NO:199) GTGCCTAGATAGTATTATTACTCAAACATATAAAAATATTGAGATTGTTGTCGTTAATGATGGTTCTAC GGATGCTTCAGGTGAAATTTGTAAAGAATTTTCAGAAATGGATCACCGAATTCTCTATATAGAACAAGA AAATGCTGGTCTTTCTGCCGCACGAAACACCGGTCTGAATAATATGTCCGGAAATTATGTGACCTTTGT GGACTCGGATGATTGGATTGAGCAAGATTATGTAGAAACTCTATATAAAAAAATAGTAGAGTATCAGGC TGATATTGCAGTTGGTAATTATTATTCTTTCAACGAAAGTGAAGGAATGTTCTACTTTCATATATTGGG AGACTCCTATTATGAGAAAGTATATGATAATGTTTCTATCTTTGAGAACTTGTATGAAACTCAAGAAAT GAAGAGTTTTGCTTTGATATCTGCTTGGGGTAAACTCTATAAGGCAAGATTGTTTGAGCAGTTGCGCTT TGACATAGGTAAATTAGGAGAAGATGGTTACCTCAATCAAAAGGTATATTTATTATCAGAAAAGGTAAT TTATTTAAATAAAAGTCTTTATGCTTATCGGATTAGAAAAGGTAGTTTATCAAGAGTTTGGACAGAAAA GTGGATGCACGCTTTAGTTGATGCTATGTCTGAACGTATTACGCTACTAGCTAATATGGGTTATCCTCT AGAGAAACACTTGGCAGTTTATCGTCAGATGTTGGAAGTCAGTCTCGCCAACGGTCAAGCTAGTGGTTT ATCTGACACAGCAACGTATAAAGAGTTTGAAATGAAACAAAGGCTTTTAAATCAGCTATCGAGACAAGA GGAAAGTGAAAAGAAAGCCATTGTCCTCGCAGCAAACTATGGCTATGTAGACCAAGTTTTAACGACAAT CAAGTCTATTTGTTATCATAATCGTTCGATTCGTTTTTATCTGATTCATAGCGATTTTCCAAATGAATG GATTAAGCAATTAAATAAGCGCTTAGAGAAGTTTGACTCAGAAATTATTAATTGTCGGGTAACTTCTGA GCAAATTTCATGTTATAAATCGGATATTAGTTACACAGTCTTTTTACGCTATTTCATAGCTGATTTCGT GCAAGAAGACAAGGCCCTCTACTTGGACTGTGATCTAGTTGTAACGAAAAATCTGGATGACTTGTTTGC TACAGACTTACAAGATTATCCTTTGGCTGCTGTTAGAGATTTTGGGGGCAGAGCTTATTTTGGTCAAGA AATCTTTAATGCCGGTGTTCTCTTGGTAAACAATGCTTTTTGGAAAAAAGAGAATATGACCCAAAAATT AATTGATGTAACCAATGAATGGCATGATAAGGTGGATCAGGCAGATCAGAGCATCTTGAATATGCTTTT TGAACATAAATGGTTGGAATTGGACTTTGATTATAATCATATTGTCATTCATAAACAGTTTGCTGATTA TCAATTGCCTGAGGGTCAGGATTATCCTGCTATTATTCACTATCTTTCTCATCGGAAACCGTGGAAAGA TTTGGCGGCCCAAACCTATCGTGAAGTTTGGTGGTACTATCATGGGCTTGAATGGACAGAATTGGGACA AAACCATCATTTACATCCATTACAAAGATCTCACATCTATCCAATAAAGGAACCTTTCACTTGTCTAAT CTATACTGCCTCAGACCATATTGAACAAATTGAGACATTGGTTCAATCCTTGCCTGATATTCAGTTTAA GATAGCAGCTAGAGTAATAGTTAGTGATCGATTGGCTCAGATGACAATTTATCCAAACGTGACTATATT TAACGGAATTCACTATTTGGTAGATGTCGATAATGAATTGGTAGAAACCAGTCAAGTACTTTTAGATAT TAATCATGGCGAAAAGACAGAAGAAATTCTCGATCAATTTGCTAATCTTGGCAAGCCTATCTTATCCTT TGAAAATACTAAAACCTATGAAGTAGGTCAGGAGGCATATGCTGTTGACCAAGTTCAAGCAATGATTGA AAAATTGAGAGAAATAAGCAAA SP113 amino acid (SEQ ID NO:200) CLDSIITQTYKNIEIVVVNDGSTDASGEICKEFSEMDHRILYIEQENAGLSAARNTGLNNMSGNYVTFV DSDDWIEQDYVETLYKKIVEYQADIAVGNYYSFNESEGMFYFHILGDSYYEKVYDNVSIFENLYETQEM KSFALISAWGKLYKARLFEQLRFDIGKLGEDGYLNQKVYLLSEKVIYLNKSLYAYRIRKGSLSRVWTEK WMHALVDAMSERITLLANMGYPLEKHLAVYRQMLEVSLANGQASGLSDTATYKEFEMKQRLLNQLSRQE ESEKKAIVLAANYGYVDQVLTTIKSICYHNRSIRFYLIHSDFPNEWIKQLNKRLEKFDSEIINCRVTSE QISCYKSDISYTVFLRYFIADFVQEDKALYLDCDLVVTKNLDDLFATDLQDYPLAAVRDFGGRAYFGQE IFNAGVLLVNNAFWKKENMTQKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNHIVIHKQFADY QLPEGQDYPAIIHYLSHRKPWKDLAAQTYREVWWYYHGLEWTELGQNHHLHPLQRSHIYPIKEPFTCLI YTASDHIEQIETLVQSLPDIQFKIAARVIVSDRLAQMTIYPNVTIFNGIHYLVDVDNELVETSQVLLDI NHGEKTEEILDQFANLGKPILSFENTKTYEVGQEAYAVDQVQAMIEKLREISK SP114 nucleotide (SEQ ID NO:201) CATTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGG TCGCTTGTGTGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATT GTCGCAAGCACGAAATGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGA TTATATCCATCCAGAAATGATTCAGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAG CTGTGGTGTCATGAATGTCTATGCTAATGATGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGT CTGTGATTCTCAAACATTTCTAAAGGAATACCTCATAGGTGAAAAAATACCTGGGACGATTTGCAATAA GCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAAGGGGTTGATTTACGAAGATGCCTATTA CCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAACCCTATTATTACTATTTCCA TAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTGATATCTACCAAAA GTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGCCTATGC CCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGAT TCATCGTTTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGAGAAGAAT TAGTGCTTTGGCCCTATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATC TAAAAAATTACAT SP114 amino acid (SEQ ID NO:202) IQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVLHKKNEGLSQARNDGMKQAHGDYLIFIDSDD YIHPEMIQSLYEQLVQEDADVSSCGVMMVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPGTICNK LIKRQIATALSFPKGLIYEDAYYHFDLIKLAKKYVVNTKPYYYYFHRGDSITTKPYAEKDLAYIDIYQK FYNEVVKNYPDLKEVAFFRLAYAHFFILDKMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIFRKGRRI SALALFINISLYRFLLLKNIEKSKKLH SP115 nucleotide (SEQ ID NO:203) TAAGGCTGATAATCGTGTTCAAATGAGAACGACGATTAATAATGAATCGCCATTGTTGCTTTCTCCGTT GTATGGCAATGATAATGGTAACGGATTATGGTGGGGGAACACATTGAAGGGAGCATGGGAAGCTATTCC TGAAGATGTAAAGCCATATGCAGCGATTGAACTTCATCCTGCAAAAGTCTGTAAACCAACAAGTTGTAT TCCACGAGATACGAAAGAATTGAGAGAATGGTATGTCAAGATGTTGGAGGAAGCTCAAAGTCTAAACAT TCCAGTTTTCTTGGTTATTATGTCGGCTGGAGAGCGTAATACAGTTCCTCCAGAGTGGTTAGATGAACA ATTCCAAAAGTATAGTGTGTTAAAAGGTGTTTTAAATATTGAGAATTATTGGATTTACAATAACCAGTT AGCTCCGCATAGTGCTAAATATTTGGAAGTTTGTGCCAAATATGGAGCGCATTTTATCTGGCATGATCA TGAAAAATGGTTCTGGGAAACTATTATGAATGATCCGACATTCTTTGAAGCGAGTCAAAAATATCATAA AAATTTGGTGTTGGCAACTAAAAATACGCCAATAAGAGATGATGCGGGTACAGATTCTATCGTTAGTGG ATTTTGGTTGAGTGGCTTATGTGATAACTGGGGCTCATCAACAGATACATGGAAATGGTGGGAAAAACA TTATACAAACACATTTGAAACTGGAAGAGCTAGGGATATGAGATCCTATGCATCGGAACCAGAATCAAT GATTGCTATGGAAATGATGAATGTATATACTGGGGGAGGCACAGTTTATAATTTCGAATGTGCCGCGTA TACATTTATGACAAATGATGTACCAACTCCAGCATTTACTAAAGGTATTATTCCTTTCTTTAGACATGC TATACAAAATCCAGCTCCAAGTAAGGAAGAAGTTGTAAATAGAACAAAAGCTGTATTTTGGAATGGAGA AGGTAGGATTAGTTCATTAAACGGATTTTATCAAGGACTTTATTCGAATGATGAAACAATGCCTTTATA TAATAATGGGAGATATCATATTCTTCCTGTAATACATGAGAAAATTGATAAGGAAAAGATTTCATCTAT ATTCCCTAATGCAAAAATTTTGACTAAAAATAGTGAGGAATTGTCTAGTAAAGTCAACTATTTAAACTC GCTTTATCCAAAACTTTATGAAGGAGATGGGTATGCTCAGCGTGTAGGTAATTCCTGGTATATTTATAA TAGTAATGCTAATATCAATAAAAATCAGCAAGTAATGTTGCCTATGTATACTAATAATACAAAGTCGTT ATCGTTAGATTTGACGCCACATACTTACGCTGTTGTTAAAGAAAATCCAAATAATTTACATATTTTATT GAATAATTACAGGACAGATAAGACAGCTATGTGGGCATTATCAGGAAATTTTGATGCATCAAAAAGTTG GAAGAAAGAAGAATTAGAGTTAGCGAACTGGATAAGCAAAAATTATTCCATCAATCCTGTAGATAATGA CTTTAGGACAACAACACTTACATTAAAAGGGCATACTGGTCATAAACCTCAGATAAATATAAGTGGCGA TAAAAATCATTATACTTATACAGAAAATTGGGATGAGAATACCCATGTTTATACCATTACGGTTAATCA TAATGGAATGGTAGAGATGTCTATAAATACTGAGGGGACAGGTCCAGTCTCTTTCCCAACACCAGATAA ATTTAATGATGGTAATTTGAATATAGCATATGCAAAACCAACAACACAAAGTTCTGTAGATTACAATGG AGACCCTAATAGAGCTGTGGATGGTAACAGAAATGGTAATTTTAACTCTGGTTCGGTAACACACACTAG GGCAGATAATCCCTCTTGGTGGGAAGTCGATTTGAAAAAAATGGATAAAGTTGGGCTTGTTAAAATTTA TAATCGCACAGATGCTGAGACTCAACGTCTATCTAATTTT SP115 amino acid (SEQ ID NO:204) KADNRVQMRTTINNESPLLLSPLYGNDNGNGLWWGNTLKGAWEAIPEDVKPYAAIELHPAKVCKPTSCI PRDTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSVLKGVLNIENYWIYNNQL APHSAKYLEVCAKYGAHFIWHDHEKWFWETIMNDPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSG FWLSGLCDNWGSSTDTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGGTVYNFECAAY TFMTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEVVNRTKAVFWNGEGRISSLNGFYQGLYSNDETMPLY NNGRYHILPVIHEKIDKEKISSIFPNAKILTKNSEELSSKVVYLNSLYPKLYEGDGYAQRVGNSWYIYN SNANINKNQQVMLPMYTNNTKSLSLDLTPHTYAVVKENPNNLHILLNNYRTDKTAMWALSGNFDASKSW KKEELELANWISKNYSINPVDNDFRTTTLTLKGHTGHKPQINISGDKNHYTYTENWDENTHVYTITVNH NGMVEMSINTEGTGPVSFPTPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR ADNPSWWEVDLKKMDKVGLVKIYNRTDAETQRLSNF SP117 nucleotide (SEQ ID NO:205) CTGTGGCAATCAGTCAGCTGCTTCCAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAA TGGCTCTGGGACACGGGGTGCCTTCACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAAT TGACAACACTGCCAAAACAGCTGTGATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAA TGCTAATGCTATCGGCTACATCTCCTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGG TGTCAAGGCTAGTCGAGACACAGTTTTAGATGGTGAATACCCTCTTCAACGTCCCTTCAACATTGTTTG GTCTTCTAATCTTTCCAAGCTAGGTCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGT GGTCACAGATAATAAATTTATTGAAGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGG CAAGTTGTCTGTTGTAGGTTCCACTTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAA AGAAAATCCAGAAGTTACGATTGATATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGA GAAAACCGCTGATATTGGTATGGTTTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGA TGCTATTGCTTTAGACGGTATTGCTGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGC TGAACTTGCAGACGTTTTTAGTGGCAAATTAACCACCTGGGACAAGATTAAA SP117 amino acid (SEQ ID NO:206) CGNQSAASKQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNSTEGVLSAVQGN ANAIGYISLGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQV VTDNKFIEAKTETTEYTSQHLSGKLSVVGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKE KTADIGMVSRELTPEEGKSLTHDAIALDGIAVVVNNDNKASQVSMAELADVFSGKLTTWDKIK SP118 nucleotide (SEQ ID NO:207) TTGTCAACAACAACATGCTACTTCTGAGGGGACGAATCAAAGGCAAAGCAGTTCAGCGAAAGTTCCATG GAAAGCTTCATACACCAACCTAAACAACCAGGTAAGTACAGAAGAGGTCAAATCTCTCTTATCAGCTCA CTTGGATCCAAATAGTGTTGATGCATTTTTTAATCTCGTTAATGACTATAATACCATTGTCGGCTCAAC TGGCTTATCAGGAGATTTCACTTCCTTTACTCACACCGAATACGATGTTGAGAAAATCAGTCATCTCTG GAATCAAAAGAAGGGCGATTTTGTTGGGACCAACTGCCGTATCAATAGTTATTGTCTTTTGAAAAATTC AGTCACCATTCCAAAGCTTGAAAAGAATGACCAGTTGCTTTTCCTAGATAATGATGCGATTGATAAAGG AAAGGTCTTTGATTCACAAGATAAGGAAGAGTTTGATATTCTATTTTCGAGAGTTCCAACTGAGTCAAC TACAGATGTCAAGGTTCACGCTGAAAAGATGGAAGCATTCTTCTCACAATTTCAATTCAATGAAAAAGC TCGAATGCTGTCTGTAGTCTTGCACGACAATTTGGATGGCGAGTATCTGTTTGTAGGCCACGTTGGGGT CTTAGTACCTGCTGATGACGGTTTCTTATTTGTAGAGAAATTGACTTTCGAAGAGCCCTACCAAGCGAT TAAATTTGCTAGTAAGGAAGATTGCTACAAGTATTTGGGCACCAAGTATGCGGATTATACAGGCGAGGG ACTGGCTAAGCCTTTTATCATGGATAATGATAAGTGGGTTAAACTT SP118 amino acid (SEQ ID NO:206) CQQQHATSEGTNQRQSSSAKVPWKASYTNLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVGST GLSGDFTSFTHTEYDVEKISHLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLLFLDNDAIDKG KVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQFQFNEKARMLSVVLHDNLDGEYLFVGHVGV LVPADDGFLFVEKLTFEEPYQAIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL SP119 nucleotide (SEQ ID NO:209) TTGTTCAGGCAAGTCCGTGACTAGTGAACACCAAACGAAAGATGAAATGAAGACGGAGCAGACAGCTAG TAAAACAAGCGCAGCTAAAGGGAAAGAGGTGGCTGATTTTGAATTGATGGGAGTAGATGGCAAGACCTA CCGTTTATCTGATTACAAGGGCAAGAAAGTCTATCTCAAATTCTGGGCTTCTTGGTGTTCCATCTGTCT GGCTAGTCTTCCAGATACGGATGAGATTGCTAAAGAAGCTGGTGATGACTATGTGGTCTTGACAGTAGT GTCACCAGGACATAAGGGAGAGCAATCTGAAGCGGACTTTAAGAATTGGTATAAGGGATTGGATTATAA AAATCTCCCAGTCCTAGTTGACCCATCAGGCAAACTTTTGGAAACTTATGGTGTCCGTTCTTACCCAAC CCAAGCCTTTATAGACAAAGAAGGCAAGCTGGTCAAAACACATCCAGGATTCATGGAAAAAGATGCAAT TTTGCAAACTTTGAAGGAATTAGCC

SP119 amino acid (SEQ ID NO:210) CSGKSVTSEHQTKDEMKTEQTASKTSAAKGKEVADFELMGVDGKTYRLSDYKGKKVYLKFWASWCSICL ASLPDTDEIAKEAGDDYVVLTVVSPGHKGEQSEADFKNWYKGLDYKNLPVLVDPSGKLLETYGVRSYPT QAFIDKEGKLVKTHPGFMEKDAILQTLKELA SP120 nucleotide (SEQ ID NO:211) CTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAACAGAAATTAGTAAAGA CGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTCTCACGTGT TCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAAT CCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCA GTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGT GGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGA TTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTA TCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGAATTGAAAAAGACCCTGTCACCTGA GGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAACCGTTACTGGGATAAATTTGA ATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACAAATTTGAGTCTGG TTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAGTCCTA CAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGG TCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCA AATGGAAGAAAAAGGTACGCTTATTTAC SP120 amino acid (SEQ ID NO:212) SQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRGETTKYELI NQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGNDVGTQYRTGVYYTDDKDLEVINQVFDEV AKKYDQPLAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEELKKTLSPE EYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDVVTYKEDKSY NMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGTLIY SP121 nucleotide (SEQ ID NO:213) TTGTCAGTCAGGTTCTAATGGTTCTCAGTCTGCTGTGGATGCTATCAAACAAAAAGGGAAATTAGTTGT GGCAACCAGTCCTGACTATGCACCCTTTGAATTTCAATCATTGGTTGATGGAAAGAACCAGGTAGTCGG TGCAGACATCGACATGGCTCAGGCTATCGCTGATGAACTTGGGGTTAAGTTGGAAATCTCAAGCATGAG TTTTGACAATGTTTTGACCAGTCTTCAAACTGGTAAGGCTGACCTAGCAGTTGCAGGAATTAGTGCTAC TGACGAGAGAAAAGAAGTCTTTGATTTTTCAATCCCATACTATGAAAACAAGATTAGTTTCTTGGTTCG TAAGGCTGATGTGGAAAAATACAAGGATTTAACTAGCCTAGAAAGTGCTAATATTGCAGCCCAAAAAGG GACTGTTCCAGAATCAATGGTCAAGGAACAATTGCCAAAAGTTCAATTAACTTCCCTAACTAATATGGG TGAAGCAGTCAATGAATTGCAGGCTGGAAAAATAGATGCTGTTCATATGGATGAGCCTGTTGCACTTAG TTATGCTGCTAAAAACGCTGGCTTAGCTGTCGCAACTGTCAGCTTGAAGATGAAGGACGGCGACGCCAA TGCC SP121 amino acid (SEQ ID NO:214) CQSGSNGSQSAVDAIKQKGKLVVATSPDYAPFEFQSLVDGKNQVVGADIDMAQAIADELGVKLEISSMS FDNVLTSLQTGKADLAVAGISATDERKEVFDFSIPYYENKISFLVRKADVEKYKDLTSLESANIAAQKG TVPESMVKEQLPKVQLTSLTNMGEAVNELQAGKIDAVHMDEPVALSYAAKNAGLAVATVSLKMKDGDAN A BP122 nucleotide (SEQ ID NO:215) GGAAACTTCACAGGATTTTAAAGAGAAGAAAACAGCAGTCATTAAGGAAAAAGAAGTTGTTAGTAAAAA TCCTGTGATAGACAATAACACTAGCAATGAAGAAGCAAAAATCAAAGAAGAAAATTCCAATAAATCCCA AGGAGATTATACGGACTCATTTGTGAATAAAAACACAGAAAATCCCAAAAAAGAAGATAAAGTTGTCTA TATTGCTGAATTTAAAGATAAAGAATCTGGAGAAAAAGCAATCAAGGAACTATCCAGTCTTAAGAATAC AAAAGTTTTATATACTTATGATAGAATTTTTAACGGTAGTGCCATAGAAACAACTCCAGATAACTTGGA CAAAATTAAACAAATAGAAGGTATTTCATCGGTTGAAAGGGCACAAAAAGTCCAACCCATGATGAATCA TGCCAGAAAGGAAATTGGAGTTGAGGAAGCTATTGATTACCTAAAGTCTATCAATGCTCCGTTTGGGAA AAATTTTGATGGTAGAGGTATGGTCATTTCAAATATCGATACTGGAACAGATTATAGACATAAGGCTAT GAGAATCGATGATGATGCCAAAGCCTCAATGAGATTTAAAAAAGAAGACTTAAAAGGCACTGATAAAAA TTATTGGTTGAGTGATAAAATCCCTCATGCGTTCAATTATTATAATGGTGGCAAAATCACTGTAGAAAA ATATGATGATGGAAGGGATTATTTTGACCCACATGGGATGCATATTGCAGGGATTCTTGCTGGAAATGA TACTGAACAAGACATCAAAAACTTTAACGGCATAGATGGAATTGCACCTAATGCACAAATTTTCTCTTA CAAAATGTATTCTGACGCAGGATCTGGGTTTGCGGGTGATGAAACAATGTTTCATGCTATTGAAGATTC TATCAAACACAACGTTGATGTTGTTTCGGTATCATCTGGTTTTACAGGAACAGGTCTTGTAGGTGAGAA ATATTGGCAAGCTATTCGGGCATTAAGAAAAGCAGGCATTCCAATGGTTGTCGCTACGGGTAACTATGC GACTTCTGCTTCAAGTTCTTCATGGGATTTAGTAGCAAATAATCATCTGAAAATGACCGACACTGGAAA TGTAACACGAACTGCAGCACATGAAGATGCGATAGCGGTCGCTTCTGCTAAAAATCAAACAGTTGAGTT TGATAAAGTTAACATAGGTGGAGAAAGTTTTAAATACAGAAATATAGGGGCCTTTTTCGATAAGAGTAA AATCACAACAAATGAAGATGGAACAAAAGCTCCTAGTAAATTAAAATTTGTATATATAGGCAAGGGGCA AGACCAAGATTTGATAGGTTTGGATCTTAGGGGCAAAATTGCAGTAATGGATAGAATTTATACAAAGGA TTTAAAAAATGCTTTTAAAAAAGCTATGGATAAGGGTGCACGCGCCATTATGGTTGTAAATACTGTAAA TTACTACAATAGAGATAATTGGACAGAGCTTCCAGCTATGGGATATGAAGCGGATGAAGGTACTAAAAG TCAAGTGTTTTCAATTTCAGGAGATGATGGTGTAAAGCTATGGAACATGATTAATCCTGATAAAAAAAC TGAAGTCAAAAGAAATAATAAAGAAGATTTTAAAGATAAATTGGAGCAATACTATCCAATTGATATGGA AAGTTTTAATTCCAACAAACCGAATGTAGGTGACGAAAAAGAGATTGACTTTAAGTTTGCACCTGACAC AGACAAAGAACTCTATAAAGAAGATATCATCGTTCCAGCAGGATCTACATCTTGGGGGCCAAGAATAGA TTTACTTTTAAAACCCGATGTTTCAGCACCTGGTAAAAATATTAAATCCACGCTTAATGTTATTAATGG CAAATCAACTTATGGCTATATGTCAGGAACTAGTATGGCGACTCCAATCGTGGCAGCTTCTACTGTTTT GATTAGACCGAAATTAAAGGAAATGCTTGAAAGACCTGTATTGAAAAATCTTAAGGGAGATGACAAAAT AGATCTTACAAGTCTTACAAAAATTGCCCTACAAAATACTGCGCGACCTATGATGGATGCAACTTCTTG GAAAGAAAAAAGTCAATACTTTGCATCACCTAGACAACAGGGAGCAGGCCTAATTAATGTGGCCAATGC TTTGAGAAATGAAGTTGTAGCAACTTTCAAAAACACTGATTCTAAAGGTTTGGTAAACTCATATGGTTC CATTTCTCTTAAAGAAATAAAAGGTGATAAAAAATACTTTACAATCAAGCTTCACAATACATCAAACAG ACCTTTGACTTTTAAAGTTTCAGCATCAGCGATAACTACAGATTCTCTAACTGACAGATTAAAACTTGA TGAAACATATAAAGATGAAAAATCTCCAGATGGTAACCAAATTGTTCCAGAAATTCACCCAGAAAAAGT CAAAGGAGCAAATATCACATTTGAGCATGATACTTTCACTATAGGCGCAAATTCTAGCTTTGATTTGAA TGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAATTTGTAGAATCATTTATTCATTTTGAGTC AGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAACTTCCAACCTTCTTTGTCGATGCC TCTAATGGGATTTGCTGGGAATTGGAACCACGAACCAATCCTTGATAAATGGGCTTGGGAAGAAGGGTC AAGATCAAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCTTAAATAAGGG AATTGGTGGAGAACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATAAAAA TACAACATCCCTGGATCAAAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGCTCCATCATC AAGTGGTTCTAAGATTGCTAACATTTATCCTTTAGATTCAAATGGAAATCCTCAAGATGCTCAACTTGA AAGAGGATTAACACCTTCTCCACTTGTATTAAGAAGTGCAGAAGAAGGATTGATT SP122 amino acid (SEQ ID NO:216) ETSQDFKEKKTAVIKEKEVVSKNPVIDNNTSNEEAKIKEENSNKSQGDYTDSFVNKNTENPKKEDKVVY IAEFKDKESGEKAIKELSSLKNTKVLYTYDRIFNGSAIETTPDNLDKIKQIEGISSVERAQKVQPMMNH ARKEIGVEEAIDYLKSINAPFGKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDLKGTDKN YWLSDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKNFNGIDGIAPNAQIFSY KMYSDAGSGFAGDETMFHAIEDSIKHMVDVVSVSSGFTGTGLVGEKYWQAIRALRKAGIPMVVATGNYA TSASSSSWDLVANNHLKMTDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNIGAFFDKSK ITTNEDGTKAPSKLKFVYIGKGQDQDLIGLDLRGKIAVMDRIYTKDLKNAFKKAMDKGARAIMVVNTVN YYNRDNWTELPAMGYEADEGTKSQVFSISGDDGVKLWNMINPDKKTEVKRNNKEDFKDKLEQYYPIDME SFNSNKPNVGDEKEIDFKFAPDTDKELYKEDIIVPAGSTSWGPRIDLLLKPDVSAPGKNIKSTLNVING KSTYGYMSGTSMATPIVAASTVLIRPKLKEMLERPVLKNLKGDDKIDLTSLTKIALQNTARPMMDATSW KEKSQYFASPRQQGAGLINVANALRNEVVATFKNTDSKGLVNSYGSISLKEIKGDKKYFTIKLHNTSNR PLTFKVSASAITTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDLN AVINVGEAKNKNKFVESFIHFESVEAMEALNSSGKKINFQPSLSMPLMGFAGNWNHEPILDKWAWEEGS RSKTLGGYDDDGKPKIPGTLNKGIGGEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSS SGSKIANIYPLDSNGNPQDAQLERGLTPSPLVLRSAEEGLI SP123 nucleotide (SEQ ID NO:217) TGTGGTCGAAGTTGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGT AGAGACAGAGGAAGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATC GGAGGTAAAACCTACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGC TCCAGTTGAAGAAGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAG TCAACCATCAGACAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGA AGACGAAAAGGCACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGA AACACCGAAACAAGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGAC TGTTAATCAATCTATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGA GGAACCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACC AGTTGAGCCAGAAAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGA AGATAAAATAAAGGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAA AGCTAGTTCAGTTTCTCCTACTGATTATTCTACAGCAAGTTACAATGCTCTTGGACCTGTTTTAGAAAC TGCAAAAGGTGTCTATGCTTCAGAGCCTGTAAAACAGCCTGAGGTAAATAGCGAGACAAATAAACTTAA AACGGCTATTGACGCTCTAAACGTTGATAAAACTGAATTAAACAATACGATTGCAGATGCAAAAACAAA GGTAAAAGAACATTACAGTGATAGAAGTTGGCAAAACCTCCAAACTGAAGTTACAAAGGCTGAAAAAGT TGCAGCTAATACAGATGCTAAACAAAGTGAAGTTAACGAAGCTGTTGAAAAATTAACTGCAACTATTGA AAAATTGGTTGAATTATCTGAAAAGCCAATATTAACATTGACTAGTACCGATAAGAAAATATTGGAACG TGAAGCTGTTGCTAAGTATACTCTAGAAAATCAAAACAAAACAAAAATCAAATCAATCACAGCTGAATT GAAAAAAGGAGAAGAAGTTATTAATACTGTAGTCCTTACAGATGACAAGGTAACAACAGAAACTATAAG CGCTGCATTTAAGAACCTAGAGTACTACAAAGAATACACCCTATCTACAACTATGATTTACGACAGAGG TAACGGTGAAGAAACTGAAACTCTAGAAAATCAAAATATTCAATTAGATCTTAAAAAAGTTGAGCTTAA AAATATTAAACGTACAGATTTAATCAAATACGAAAATGGAAAAGAAACTAATGAATCACTGATAACAAC TATTCCTGATGATAAGAGCAATTATTATTTAAAAATAACTTCAAATAATCAGAAAACTACATTACTAGC TGTTAAAAATATAGAAGAAACTACGGTTAACGGAACACCTGTATATAAAGTTACAGCAATCGCAGACAA TTTAGTCTCTAGAACTGCTGATAATAAATTTGAAGAAGAA SP123 amino acid (SEQ ID NO:218) VVEVETPQSITNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPA PVEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEE TPKQEESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAP VEPEKQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDYSTASYNALGPVLET AKGVYASEPVKQPEVNSETNKLKTAIDALNVDKTELNNTIADAKTKVKEHYSDRSWQNLQTEVTKAEKV AANTDAKQSEVNEAVEKLTATIEKLVELSEKPILTLTSTDKKILEREAVAKYTLENQNKTKIKSITAEL KKGEEVINTVVLTDDKVTTETISAAFKNLEYYKEYTLSTTMIYDRGNGEETETLENQNIQLDLKKVELK NIKRTDLIKYENGKETNESLITTIPDDKSNYYLKITSNNQKTTLLAVKNIEETTVNGTPVYKVTAIADN LVSRTADNKFEEE SP124 amino acid (SEQ ID NO:219) AACACCTGTATATAAAGTTACAGCAATCGCAGACAATTTAGTCTCTAGAACTGCTGATAATAAATTTGA AGAAGAATACGTTCACTATATTGAAAAACCTAAAGTCCACGAAGATAATGTATATTATAATTTCAAAGA ATTAGTGGAAGCTATTCAAAACGATCCTTCAAAAGAATATCGTCTGGGACAATCAATGAGCGCTAGAAA TGTTGTTCCTAATGGAAAATCATATATCACTAAAGAATTCACAGGAAAACTTTTAAGTTCTGAAGGAAA ACAATTTGCTATTACTGAATTGGAACATCCATTATTTAATGTGATAACAAACGCAACGATAAATAATGT GAATTTTGAAAATGTAGAGATAGAACGTTCTGGTCAAGATAATATTGCATCATTAGCCAATACTATGAA AGGTTCTTCAGTTATTACAAATGTCAAAATTACAGGCACACTTTCAGGTCGTAATAATGTTGCTGGATT TGTAAATAATATGAATGATGGAACTCGTATTGAAAATGTTGCTTTCTTTGGCAAACTACACTCTACAAG TGGAAATGGCTCTCATACAGGGGGAATTGCAGGTACAAACTATAGAGGAATTGTTAGAAAAGCATATGT TGATGCTACTATTACAGGAAACAAAACACGCGCCAGCTTGTTAGTTCCTAAAGTAGATTATGGATTAAC TCTAGACCATCTTATTGGTACAAAAGCTCTCCTAACTGAGTCGGTTGTAAAAGGTAAAATAGATGTTTC AAATCCAGTAGAAGTTGGAGCAATAGCAAGTAAGACTTGGCCTGTAGGTACGGTAAGTAATTCTGTCAG CTATGCTAAGATTATCCGTGGAGAGGAGTTATTCGGCTCTAACGACGTTGATGATTCTGATTATGCTAG TGCTCATATAAAAGATTTATATGCGGTAGAGGGATATTCGTCAGGTAATAGATCATTTAGGAAATCTAA AACATTTACTAAATTAACTAAAGAACAAGCTGATGCTAAAGTTACTACTTTCAATATTACTGCTGATAA ATTAGAAAGTGATCTATCTCCTCTTGCAAAACTTAATGAAGAAAAAGCCTATTCTAGTATTCAAGATTA TAACGCTGAATATAACCAAGCCTATAAAAATCTTGAAAAATTAATACCATTCTACAATAAAGATTATAT TGTATATCAAGGTAATAAATTAAATAAAGAACACCATCTAAATACTAAAGAAGTTCTTTCTGTTACCGC GATGAACAACAATGAGTTTATCACAAACCTAGATGAAGCTAATAAAATTATTGTTCACTATGCGGACGG TACAAAAGATTACTTTAACTTGTCTTCTAGCAGTGAAGGTTTAAGTAATGTAAAAGAATATACTATAAC TGACTTAGGAATTAAATATACACCTAATATCGTTCAAAAAGATAACACTACTCTTGTTAATGATATAAA ATCTATTTTAGAATCAGTAGAGCTTCAGTCTCAAACGATGTATCAGCATCTAAATCGATTAGGTGACTA TAGAGTTAATGCAATCAAAGATTTATATTTAGAAGAAAGCTTCACAGATGTTAAAGAAAACTTAACAAA CCTAATCACAAAATTAGTTCAAAACGAAGAACATCAACTAAATGATTCTCCAGCTGCTCGTCAAATGAT TCGTGATAAAGTCGAGAAAAACAAAGCAGCTTTATTACTAGGTTTAACTTACCTAAATCGTTACTATGG AGTTAAATTTGGTGATGTTAATATTAAAGAATTAATGCTATTCAAACCAGATTTCTATGGTGAAAAAGT TAGCGTATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGA CGCATTCGGTCAAGTA SP124 amino acid (SEQ ID NO:220) TPVYKVTAIADNLVSRTADNKFEEEYVHYIEKPKVHEDNVYYNFKELVEAIQNDPSKEYRLGQSMSARN VVPNGKSYITKEFTGKLLSSEGKQFAITELEHPLFNVITNATINNVNFENVEIERSGQDNIASLANTMK GSSVITNVKITGTLSGRNNVAGFVNNMNDGTRIENVAFFGKLHSTSGNGSHTGGIAGTNYRGIVRKAYV DATITGNKTRASLLVPKVDYGLTLDHLIGTKALLTESVVKGKIDVSNPVEVGAIASKTWPVGTVSNSVS YAKIIRGEELFGSNDVDDSDYASAHIKDLYAVEGYSSGNRSFRKSKTFTKLTKEQADAKVTTFNITADK LESDLSPLAKLNEEKAYSSIQDYNAEYNQAYKNLEKLIPFYNKDYIVYQGNKLNKEHHLNTKEVLSVTA MNNNEFITNLDEANKIIVHYADGTKDYFNLSSSSEGLSNVKEYTITDLGIKYTPNIVQKDNTTLVNDIK SILESVELQSQTMYQHLNRLGDYRVNAIKDLYLEESFTDVKENLTNLITKLVQNEEHQLNDSPAARQMI RDKVEKNKAALLLGLTYLNRYYGVKFGDVNIKELMLFKPDFYGEKVSVLDRLIEIGSKENNIKGSRTFD AFGQV SP125 nucleotide (SEQ ID NO:221) ATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGACGCATT CGGTCAAGTATTGGCTAAATATACTAAATCAGGTAATTTAGATGCATTTTTAAATTATAATAGACAATT GTTCACAAATATAGACAATATGAACGATTGGTTTATTGATGCTACAGAAGACCATGTCTACATCGCAGA ACGCGCTTCTGAGGTCGAAGAAATTAAAAATTCTAAACATCGTGCATTCGATAATTTAAAACGAAGTCA CCTTAGAAATACTATACTCCCACTACTGAATATTGATAAAGCACATCTTTATTTAATTTCAAATTATAA TGCAATTGCCTTTGGTAGTGCAGAGCGATTAGGTAAAAAATCATTAGAAGATATTAAAGATATCGTTAA CAAAGCTGCAGATGGTTATAGAAACTATTATGATTTCTGGTATCGTCTAGCGTCTGATAACGTTAAACA ACGACTACTAAGAGATGCTGTTATTCCTATTTGGGAAGGTTATAACGCTCCTGGTGGATGGGTTGAAAA ATATGGCCGCTATAATACCGACAAAGTATATACTCCTCTTAGAGAATTCTTTGGTCCTATGGATAAGTA TTATAATTATAATGGAACAGGAGCTTATGCTGCTATATATCCTAACTCTGATGATATTAGAACTGATGT AAAATATGTTCATTTAGAAATGGTTGGTGAATACGGTATTTCAGTTTACACACATGAAACAACACACGT CAACGACCGTGCGATTTACTTAGGTGGCTTTGGACACCGTGAAGGTACTGATGCTGAAGCATATGCTCA GGGTATGCTACAAACTCCTGTTACTGGTAGTGGATTTGATGAGTTTGGTTCTTTAGGTATTAATATGGT ATTTAAACGCAAAAATGATGGGAATCAGTGGTATATTACAGATCCAAAAACTCTAAAAACACGAGAAGA TATTAATAGATATATGAAGGGTTATAATGACACTTTAACTCTTCTTGATGAAATTGAGGCTGAATCTGT GATTTCTCAACAAAATAAAGATTTAAATAGTGCATGGTTCAAAAAAATAGATAGAGAATACCGTGATAA CAATAAATTAAATCAATGGGATAAAATTCGAAATCTAAGTCAAGAAGAGAAAAATGAATTAAATATTCA ATCTGTTAATGATTTAGTTGATCAACAATTAATGACTAATCGCAATCCAGGTAATGGTATCTATAAACC CGAAGCAATTAGCTATAACGATCAATCACCTTATGTAGGTGTTAGAATGATGACCGGTATCTACGGAGG TAATACTAGTAAAGGTGCTCCTGGAGCTGTTTCATTCAAACATAATGCTTTTAGATTATGGGGTTACTA CGGATACGAAAATGGGTTCTTAGGTTATGCTTCAAATAAATATAAACAACAATCTAAAACAGATGGTGA GTCTGTTCTAAGTGATGAATATATTATCAAGAAAATATCTAACAATACATTTAATACTATTGAAGAATT TAAAAAAGCTTACTTCAAAGAAGTTAAAGATAAAGCAACGAAAGGATTAACAACATTCGAAGTAAATGG TTCTTCCGTTTCATCATACGATGATTTACTGACATTGTTTAAAGAAGCTGTTAAAAAAGATGCCGAAAC TCTTAAACAAGAAGCAAACGGTAATAAAACAGTATCTATGAATAATACAGTTAAATTAAAAGAAGCTGT TTATAAGAAACTTCTTCAACAAACAAATAGCTTTAAAACTTCAATCTTTAAA SP125 amino acid (SEQ ID NO:222) LDRLIEIGSKENNIKGSRTFDAFGQVLAKYTKSGNLDAFLNYNRQLFTNIDNMNDWFIDATEDHVYIAE RASEVEEIKNSKHRAFDNLKRSHLRNTILPLLNIDKAHLYLISNYNAIAFGSAERLGKKSLEDIKDIVN KAADGYRNYYDFWYRLASDNVKQRLLRDAVIPIWEGYNAPGGWVEKYGRYNTDKVYTPLREFFGPMDKY YNYNGTGAYAAIYPNSDDIRTDVKYVHLEMVGEYGISVYTHETTHVNDRAIYLGGFGHREGTDAEAYAQ GMLQTPVTGSGFDEFGSLGINMVFKRKNDGNQWYITDPKTLKTREDINRYMKGYNDTLTLLDEIEAESV ISQQNKDLNSAWFKKIDREYRDNNKLNQWDKIRNLSQEEKNELNIQSVNDLVDQQLMTNRNPGNGIYKP EAISYNDQSPYVGVRMMTGIYGGNTSKGAPGAVSFKHNAFRLWGYYGYENGFLGYASNKYKQQSKTDGE SVLSDEYIIKKISNNTFNTIEEFKKAYFKEVKDKATKGLTTFEVNGSSVSSYDDLLTLFKEAVKKDAET LKQEANGNKTVSMNNTVKLKEAVYKKLLQQTNSFKTSIFK SP126 nucleotide (SEQ ID NO:223) TAAGACAGATGAACGGAGCAAGGTGTTTGACTTTTCCATTCCCTACTATACTGCAAAAAATAAACTCAT TGTCAAAAAATCTGACTTGACTACTTATCAGTCTGTAAACGACTTGGCGCAGAAAAAGGTTGGAGCGCA GAAAGGTTCGATTCAAGAGACGATGGCGAAAGATTTGCTACAAAATTCTTCCCTCGTATCTCTGCCTAA AAATGGGAATTTAATCACAGATTTAAAATCAGGACAAGTGGATGCCGTTATCTTTGAAGAACCTGTTTC CAAGGGATTTGTGGAAAATAATCCTGATTTAGCAATCGCAGACCTCAATTTTGAAAAAGAGCAAGATGA TTCCTACGCGGTAGCCATgAAAAAAGATAGCAAGAAATTGAAGAGGCAGTTCGATAAAACCATTCAAAA GTTGAAGGAGTCTGGGGAATTAGACAAACTCATTGAGGAAGCCTTA SP126 amino acid (SEQ ID NO:224) KTDERSKVFDFSIPYYTAKNKLIVKKSDLTTYQSVNDLAQKKVGAQKGSIQETMAKDLLQNSSLVSLPK NGNLITDLKSGQVDAVIFEEPVSKGFVENNPDLAIADLNFEKEQDDSYAVAMKKDSKKLKRQFDKTIQK LKESGELDKLIEEAL SP127 nucleotide (SEQ ID NO:225) CTGTGAGAATCAAGCTACACCCAAAGAGACTAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGA CGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGACAGGCTTTGATATCGAAGTTTTAAAGGCAGT AGATGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCCTGGGAGAGCATCTTCCCAGGACT TGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCGTGCTGAAAAATACCT TTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGACTTCTCT TGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTG GAATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCT AGACCTTGCTAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGA CCGTGGTTTAGACCTCTCAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTC AAGCGACCAAAAAGAGTTTAAAGAGCAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCT TGAAAAACTCAGCAATACCTATCTAGGTGGTTCTTACCTCCCAGATCAATCTCAGTTACAA SP127 amino acid (SEQ ID NO:226) CENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTAWESIFPGL DSGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNW NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSVVDLPSADSPSNYIIFS SDQKEFKEQFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ

TABLE 2 ______________________________________ S. pneumoniae Antigenic Epitopes ______________________________________ SP001 Lys-1 to Ile-10; Leu-13 to Lys-32; Arg-41 to Ile-51; Ser-85 to Glu-97; Ala-159 to His-168; Val-309 to Thr-318; Val-341 to Asn-352; Asn-415 to Met-430; Phe-454 to Asn-464; Ser-573 to Gly-591; Asn-597 to Thr-641; and Asn-644 to Ala-664. SP004 Thr-9 to Thr-24; Ile-29 to Ala-48; Thr-49 to Val-56; Val-286 to Val- 312; Pro-316 to Glu-344; Val-345 to Ile-367; Gln-368 to Val-399; Ser-400 to Glu-431; Asn-436 to Ala-457; Ile-467 to Ala-498; and Thr-499 to Glu- 540. SP006 Glu-1 to Lys-13; Pro-24 to Gly-36; Val-104 to Thr-112; Ala-118 to Asn- 130; Trp-137 to Ala-146; Ser-151 to Ile-159; Ile-181 to Leu-188; and Pro-194 to Tyr-202. SP007 Gly-1 to Asn-7; Tyr-24 to Gln-34; His-47 to Phe-55; Ser-60 to Ala-67; Ala-122 to Leu-129; Leu-221 to Lys-230; Val-236 to Phe-256; and Asp-271 to Gly-283; and Leu-291 to Asp-297. SP008 Leu-4 to Lys-17; Gln-24 to Leu-32; Asp-60 to Ser-66; Ser-70 to Asp-76; Ala-276 to Lys-283; Asn-304 to Lys-311; and Thr-429 to Pro-437. SP009 Thr-4 to Glu-11; Leu-50 to Asp-60; Ile-102 to Trp-123; and Ser-138 to Ile-157. SP010 Phe-34 to Gly-41; Asp-44 to Lys-50; Leu-172 to Val-186; Leu-191 to Val- 198; Ser-202 to Ile-209; and Val-213 to Leu-221. SP011 Asn-2 to Thr-10; Asp-87 to Ala-102; Tyr-125 to Glu-132; Thr-181 to Tyr- 189; Arg-217 to Thr-232; Asn-257 to Lys-264; Pro-271 to Ser-278; Tyr- 317 to Ala-325; Glu-327 to Pro-337; and Thr-374 to Val-381. SP012 Gly-1 to Lys-19; Phe-34 to Tyr-41; Leu-109 to Lys-126; and Leu-231 to Glu-247. SP013 Ala-1 to Lys-12; Ile-42 to Pro-53; Leu-138 to Lys-146; Ile-205 to Lys- 217; Ser-235 to Ile-251; and Ser-261 to Tyr-272. SP014 Gly-1 to Val-16; Leu-35 to Leu-44; Asp-73 to Asp-81; Ile-83 to Asp-92; Glu-145 to Ile-153; Phe-188 to Asn-196; Ser-208 to Phe-215; Ile-224 to Leu-231; and Asn-235 to Ala-243. SP015 Ser-1 to Pro-16; Asn-78 to Glu-88; Ala-100 to Val-108; Ala-122 to Thr- 129; Thr-131 to Ser-137; Leu-201 to Ser-220; and Gly-242 to Val-251. SP016 Gly-1 to Glu-20; Thr-30 to Val-38; Gln-94 to Asn-105; Lys-173 to Pro- 182; Gly-189 to Arg-197; Ser-207 to Val-224; Pro-288 to Leu-298; Ala- 327 to Ala-342; and Ser-391 to Ala-402. SP017 Ser-1 to Thr-12; Ala-36 to Tyr-45; Gln-48 to Ile-54; Lys-59 to Lys-76; Tyr-113 to Leu-138; and Phe-212 to Asp-219. SP019 Val-97 to Glu-117; Asp-163 to Leu-169; Thr-182 to Thr-191; and Lys-241 to Ser-250. SP020 Asn-18 to Lys-25; Thr-47 to Glu-60; Trp-75 to Val-84; Gly-102 to Val- 110; Pro-122 to Ala-131; and Glu-250 to Pro-258. SP021 Serl to Asp-8; Val-44 to Asp-54; Ala-117 to Val-125; Thr-165 to Thr- 173; and Glu-180 to Pro-189. SP022 Phe-5 to Lys-13; Thr-20 to Ser-36; Glu-59 to Lys-81; Tyr-85 to Gly-93; Trp-94 to Trp-101; and Thr-195 to Trp-208. SP023 Gln-45 to Glu-59; Asp-69 to Pro-85; Lys-111 to Asn-121; Pro-218 to Ala- 228; and Glu-250 to Asn-281. SP025 Gln-14 to Thr-20; Gly-27 to Phe-33; Gly-63 to Glu-71; and Ile-93 to Phe-102. SP028 Asp-171 to Pro-179; Tyr-340 to Glu-350; Pro-455 to Tyr-463; and Asp-474 to Pro-480. SP030 Leu-22 to Leu-37; Trp-81 to Ala-90; Phe-101 to Ala-106; Thr-124 to Tyr- 130; and Asn-138 to Glu-144. SP031 Asp-8 to Val-16; Gly-27 to Thr-35; Gly-178 to Asp-195; Thr-200 to Asp209; Trp-218 to Leu-224; and Lys-226 to Asp-241. SP032 Ser-9 to Asp-28; Phe-31 to Val-40; Gly-42 to Arg-50; Ile-52 to Leu-60; Asp-174 to Phe-186; Leu-324 to Met-333; and Thr-340 to Asn-347. SP033 Gln-2 to Ile-13; Phe-46 to Ile-53; and Asp-104 to Thr-121. SP034 Glu-36 to Gly-43; Ala-188 to Asp-196; Trp-313 to Gly-320; and Leu-323 to Leu-329. SP035 Arg-19 to Asp-36; Asp-47 to Val-57; Asn-134 to Thr-143; Asp-187 to Arg-196; and Glu-222 to Ser-230. SP036 Arg-10 to Arg-17; Lys-29 to Ser-39; Ser-140 to Ala-153; Arg-158 to Tyr- 169; Asp-175 to Ala-183; Gly-216 to Asn-236; Ala-261 to Leu-270; Arg- 282 to Phe-291; and Thr-297 to Ala-305; Pro-342 to Gln-362; Phe-455 to Asp-463; His-497 to Thr-511; Ala-521 to Gly-529; Ile-537 to Val-546; Ile-556 to Ala-568; Pro-581 to Ser-595; Glu-670 to Ala-685; Ser-696 to Ala-705 and Leu-782 to Ser-791. SP038 Glu-61 to Pro-69; Phe-107 to Ala-115; Leu-130 to Tyr-141; Ala-229 to Glu-237; Ser-282 to Asn-287; Ala-330 to Glu-338; and Tyr-387 to Glu- 393. SP039 Ser-28 to Asp-35; Pro-88 to Pro-96; Leu-125 to Arg-135; Phe-149 to Leu- 157; Gln-246 to Val-254; Ala-357 to Thr-362; Gly-402 to Lys-411; and Leu-440 to Pro-448. SP040 Thr-21 to Ile-30; His-54 to Gln-68; Arg-103 to Leu-117; and Thr-127 to Leu-136. SP041 Gly-36 to Asp-49; Leu-121 to Val-128; and Ala-186 to Ile-196. SP042 Gly-11 to Arg-19; Ile-23 to Lys-31; His-145 to Asn-151; Gln-159 to Asp- 166; Ile-175 to Asp-181; Gly-213 to Tyr-225; Ile-283 to Val-291; Pro- 329 to Glu-364; Arg-372 to Ser-386; Thr-421 to Phe-430; Leu-445 to Val- 453; Ile-486 to Ala-497; Asp-524 to Ala-535; His-662 to Gly-674; and His-679 to Gln-702. SP043 Lys-2 to Asp-12; Val-58 to Asn-68; Ser-87 to Asp-95; and Asp-102 to Lys-117. SP044 Gln-3 to Lys-11; Asp-37 to Tyr-52; Glu-171 to Leu-191; His-234 to Asn- 247; and Asn-283 to Ala-291. SP045 Tyr-52 to Ile-63; Asp-212 to Gln-227; Ser-315 to Thr-332; Leu-345 to Phe-354; Asp-362 to Val-370; Thr-518 to Asn-539; Ala-545 to Lys-559; and Val-601 to Pro-610. SP046 Gln-9 to Ala-18; Glu-179 to Lys-186; Lys-264 to Glu-271; Gly-304 to Glu-17; Ser-503 to Asn-511; Asn-546 to Thr-553; and Asn-584 to Asp-591. SP048 Tyr-4 to Asp-25; Lys-33 to Val-70; Asp-151 to Thr-170; Asp-222 to Val-257; Thr-290 to Phe-301; and Gly-357 to Val-367. SP049 Ala-23 to Arg-37; Tyr-85 to Gln-95; Glu-106 to Ile-118; Arg-131 to ILE-144; Gly-150 to Ser-162; and Ala-209 to Asp-218. SP050 Asp-95 to Glu-113; Gly-220 to Gly-228; Asn-284 to Glu-295; Thr-298 to Val-315. SP051 Lys-16 to Glu-50; Lys-57 to Asn-104; Ser-158 to Trp-173; Asp-265 to Pro-279; Val-368 to Tyr-386; Glu-420 to Ile-454; Pro-476 to Ile-516; Phe-561 to Gly-581; Thr-606 to Gly-664; and Glu-676 to Val-696. SP052 Asn-41 to Tyr-60; Phe-80 to Glu-103; Ala-117 to Val-139; Ile-142 to Leu-155; Val-190 to Lys-212; Glu-276 to Phe-283; Arg-290 to Ser-299; Leu-328 to Val-351; Gly-358 to Thr-388; Glu-472 to Ala-483; Val-533 to Asn-561; Asp-595 to Val-606; Glu-609 to Val-620; Glu-672 to Ser- 691. SP053 Ala-62 to Val-101; Thr-147 to Leu-174; Lys-204 to Val-216; Gln-228 to Val-262; Ser-277 to Gly-297; Thr-341 to Glyn-368; Thr-385 to Ala- 409; Thr-414 to Ser-453; Asn-461 to Leu-490; Glu-576 to Thr-625; Gly-630 to Arg-639; and Asp-720 to Leu-740. SP054 Glu-7 to Val-28; and Tyr-33 to Glu-44. SP055 Pro-3 to Val-18; Thr-21 to Lys-53; Val-84 to Lys-99; Ile-162 to Val- 172; and Val-204 to Ser-241. SP056 Val-34 to Tyr-41; Leu-47 to Glu-55; and Pro-57 to Gln-66. SP057 Asp-1 to Val-25; Pro-29 to Ile-80; Asn-96 to Val-145; and Pro-150 to Glu-172. SP059 Ala-64 to Thr-70; Leu-82 to His-138; and Val-228 to Asn-236. SP059 Val-10 to Thr-24; Ser-76 to Pro-102; Ser-109 to Ile-119; Ser-124 to Val-130; Thr-186 to Ile-194; and Asn-234 to Ser-243. SP060 Leu-70 to Arg-76; and Val-79 to Ile-88. SP062 Glu-14 to Lys-28; Ser-32 to Lys-46; and Glu-66 to Thr-74. SP063 Ile-10 to Val-25; Val-30 to Thr-40; Asp-44 to Pro-54; Asn-57 to Val- 63; Pro-71 to Val-100; and Thr-105 to Thr-116. SP064 Pro-12 to Leu-32; Val-40 to Leu-68; Asp-95 to Ala-125; Ser-164 to Glu-184; Ser-314 to Glu-346; Asn-382 to Val-393; Leu-463 to Gln-498; Asn-534 to Lys-548; and Lys-557 to Gly-605. SP065 Asn-2 to Ile-12; Ala-39 to Thr-61; and His-135 to Ala-155. SP067 Gly-1 to Thr-13; Asp-203 to Asn-218; and Gly-240 to Asp-253. SP068 Ser-2 to Ser-12; Val-17 to Gln-26; and Lys-54 to Cys-67. SP069 Ser-32 to Thr-41; Pro-66 to Glu-80; Thr-110 to Val-122; and Val-147 to Thr-180. SP070 Lys-6 to Tyr-16; Gln-19 to Ile-27; Arg-50 to Ala-58; Leu-112 to Val- 128; Ile-151 to Asn-167; Leu-305 to Phe-321. SP071 Gln-92 to Asn-158; Gln-171 to Gln-188; Val-204 to Val-240; Thr-247 to Ala-273; Glu-279 to Thr-338; Pro-345 to Glu-368; Asn-483 to Lys-539; Val-552 to Ala-568; Glu-575 to Ser-591; Ser-621 to Gly-640; Gln-742 to Gly-758. SP072 Val-68 to Tyr-81; Tyr-86 to Val-121; Leu-127 to Gly-140; Gly-144 to Ala-155; Gln-168 to Val-185; Asp-210 to Try-241; Glu-246 to Thr-269; Lys-275 to Tyr-295; Gly-303 to Pro-320; Arg-327 to Ile-335; Thr-338 to Thr-364; Tyr-478 to Phe-495; and Tyr-499 to Arg-521. SP073 Glu-37 to Val-45; Glu-55 to Val-68; Thr-104 to Thr-119; Ile-127 to Tyr-135; Asn-220 to Ile-232; Thr-237 to Ala-250; Ser-253 to Ala-263; Glu-284 to Ile-297; and Met-438 to Asn-455. SP074 Gly-2 to Ala-12; Gly-96 to Ile-110; and Thr-220 to Phe-239. SP075 Phe-33 to Tyr-42; Gln-93 to Gly-102; and Val-196 to Asp-211. SP076 Ser-64 to Leu-76; and Phe-81 to Ala-101. BP077 Asp-1 to Glu-12; Tyr-26 to Val-36; and Val-51 to Try-62. SP078 Ala-193 to Ile-208; Tyr-266 to Asn-275; Glu-356 to Leu-369; Ala-411 to Gly-422; Ser-437 to Pro-464; Thr-492 to Glu-534; and Glu-571 to Gln-508. SP079 Gly-11 to Leu-20; Lys-39 to Leu-48; Leu-72 to Val-85; Asn-147 to Ser- 158; Ile-178 to Asp-187; Tyr-189 to Gln-201; and Leu-203 to Ala-216 SP080 Ser-2 to Glu-12; Gln-42 to Ala-51; Ala-116 to Ser-127; Phe-131 to Asp-143; and Ile-159 to Ile-171. SP081 Gln-2 to Leu-9; Gln-49 to Cys-57; Ile-108 to Val-131; Gly-134 to Leu- 145; and Trp-154 to Cys-162. SP082 Ile-101 to Ser-187; Gly-191 to Asn-221; Arg-225 to Arg-236; Tyr-239 to Leu-255; and Gly-259 to Arg-268. SP083 Ser-28 to Asp-70. SP084 Leu-42 to Gln-66; Thr-69 to Lys-81; Glu-83 to Arg-92; and Gly-98 to Asn-110. SP085 Gln-2 to Val-22; and Ser-45 to Glu-51. SP086 Leu-18 to Gln-65; and Lys-72 to Val-83. SP087 Ser-45 to Leu-53; and Thr-55 to Gln-63 SP088 Pro-8 to Ile-16; Leu-25 to Trp-33; Tyr-35 to Gln-43; Leu-51 to Val-59; Val-59 to Arg-67; Thr-55 to Tyr-63; Asn-85 to Gly-93; Thr-107 to Leu-115; Leu-115 to Trp-123; Ala-121 to Thr-129; Tyr-153 to Ala-161; His-176 to Gly-184; Tyr-194 to Ala-202; Ala-217 to Gly-225; and Asn-85 to Gly-93. SP089 Trp-43 to Ala-51; Gln-68 to Phe-76; Val-93 to Gln-101; Phe-106 to Phe-114; Lys-117 to Lys-125; Trp-148 to Phe-156; Glu-168 to Gln-176; Ile-193 to Tyr-201; Lys-203 to Lys-211; Glu-212 to Gln-220; Ile-237 to Tyr-245; Lys-247 to Lys-255; Glu-256 to Gln-264; Met-275 to Gly-283;

Lys-286 to Gly-294; Trp-292 to Glu-300; Asp-289 to Thr-297; Tyr-315 to Ser-323; Asp-334 to Lys-342; Pro-371 to Arg-379; Arg-485 to Asn-493; Lys-527 to Arg-535; Phe-537 to Met-545; and Tyr-549 to Glu-557. SP090 Phe-2 to Gln-10; Gln-13 to Lys-21; Tyr-19 to Glu-27; Tyr-39 to Met-47; Pro-65 to Leu-73; Tyr-121 to His-129; Lys-147 to Ile-155; Gly-161 to Lys-169; Gly-218 to Trp-226; Asp-230 to Thr-238; Tyr-249 to Ala-257; and Ala-272 to Gly-280. SP091 Ser-19 to Ser-27; Asn-25 to Thr-33; Val-51 to Gln-59; Asn-75 to Asn-83; Ile-103 to Trp-111; Tyr-113 to Ala-121; Leu-175 to Asn-183; Glu-185 to Trp-193; Ala-203 to Tyr-211; Val-250 to Phe-258; Asn-260 to Thr-268; Ser-278 to Asp-286; Tyr-305 to Leu-313; Asn-316 to Gly-324; Asn-374 to Asp-382; Asn-441 to Gly-449; and Ser-454 to Gln-462. SP092 Arg-95 to Glu-103; Ala-216 to Val-224; Leu-338 to Glu-346; Pro-350 to Ala-358; Pro-359 to Ala-367; Pro-368 to Ala-376; Pro-377 to Ala-385; Pro-386 to Ala-394; Pro-395 to Ala-403; Pro-350 to Ala-358; Gln-414 to Lys-422; Pro-421 to Asn-429; Trp-465 to Tyr-473; Phe-487 to Tyr-495; Asn-517 to Gly-525; Trp-586 to Tyr-594; Phe-608 to Tyr-616; and Asp-630 to Gly-638. SP093 Gln-30 to Ile-38; Gln-52 to Val-60; Ala-108 to His-116; Tyr-133 to Glu-141; Tyr-192 to Ala-200; and Phe-207 to Ser-215. SP094 Ala-87 to Val-95; Leu-110 to Cys-118; Gln-133 to Leu-141; Ser-185 to Leu-193; Ile-195 to Gly-203; Asp-206 to Gln-214; Ser-211 to Gly-219; Ile-241 to Thr-249. SP095 Arg-1 to Gln-9; Phe-7 to Asn-15; Thr-21 to Asn-30; Leu-46 to Phe-54; and Ser-72 to Met-80. SP096 Gly-29 to Ile-37; Glu-52 to Ser-60; and Leu-64 to Gly-72. SP097 Ala-11 to Thr-19; Glu-53 to Glu-61; Ser-91 to Lys-99; Thr-123 to Gln-131; and Gly-209 to Lys-217. SP098 Thr-3 to Ser-11; Gly-38 to Phe-46; Tyr-175 to Asn-183; Met-187 to Cys-195; Gln-197 to Leu-205; Tyr-307 to Gln-315; Gly-318 to Tyr-326; Asn-348 to Val-356; Lys-377 to Pro-385; and Leu-415 to Val-423. SP099 Arg-19 to Gly-27; Asp-76 to Ser-84; Val-90 to Lys-98; Phe-165 to Val-173; Leu-237 to Pro-245. SP100 His-111 to Gln-119; Ser-141 to His-149; Asp-154 to Ser-162; Gln-158 to Gln-166; Asp-154 to Gln-166; Lys-180 to Gln-188; and Ser-206 to Gln-214. SP101 Glu-23 to Glu-31; Glu-40 to Val-48; Gln-50 to Ser-58; Thr-61 to Ile-69; Leu-82 to Ile-90; Ala-108 to Leu-116; Gln-121 to Pro-129; and Leu-130 to Thr-138. SP102 Asp-32 to His-40; Arg-48 to Lys-56; and Asp-102 to Thr-110. SP103 Arg-5 to Gln-13; Gln-22 to Leu-30; Arg-151 to Gln-159; Arg-167 to Gln-175; Pro-189 to Glu-197; Gly-207 to Leu-215; Ser-219 to Gln-227; Ser-233 to Ser-241; Pro-255 to Asp-264; Lys-272 to 6Gly-280; Ser-318 to Val-326; Thr-341 to Asp-351; Asn-356 to Thr-364; Val-370 to Tyr-378; Ile-379 to Gln-387; and Met-435 to Tyr-443. SP105 Asn-28 to Pro-36; Thr-77 to Phe-85; Arg-88 to Val-96; Gly-107 to Phe-115; Asp-169 to Asp-177; His-248 to Ser-256; and Ser-274 to Ala-282. SP106 Val-10 to Thr-18; Ile-62 to Tyr-70; Ile-71 to Pro-79; Lys-86 to Gln-94; Lys-100 to Thr-108; Phe-132 to Leu-140; and Asp-145 to Arg-153. SP107 Asp-33 to Val-41; and Arg-63 to Gln-71. SP108 Lys-9 to Gln-17; Leu-44 to Ser-52; Ser-63 to Phe-71; Tyr-109 to Ser-117; Ile-183 to Ile-191; Pro-194 to Leu-202; Gly-257 to Gln-265; Ala-323 to Thr-331; and Leu-381 to Tyr-389. SP109 Asn-2 to Gln-10; Ala-65 to Lys-73; Leu-76 to Glu-84; Thr-111 to Asp-119; Gln-116 to Tyr-124; Tyr-130 to Val-138; Asp-173 to Gly-181; Asp-196 to Ser-204; Asn-231 to Ser-239; Phe-252 to Ser-260; Phe-270 to Tyr-278; Val-291 to His-299; Asp-306 to Leu-314; and Pro-327 to Gly-335. SP110 Ser-8 to Glu-16; Ile-37 to Val-45; Ala-107 to Val-115; and Gly-122 to Thr-130. SP111 Asp-19 to Glu-28; Leu-43 to Ala-51; Asn-102 to Phe-110; Gln-133 to Ser-141; Phe-162 to Asp-170; Tyr-194 to Met-202; and Asp-273 to Ser-281. SP112 Asp-3 to Gln-11; Gly-21 to Ile-29; Ala-46 to Arg-54; Arg-98 to Arg-106; Thr-114 to Val-122; Gln-133 to Asn-141; and Leu-223 to Thr-231 SP113 Asn-19 to Gly-27; Arg-54 to Ser-62; Val-69 to Gln-77; Ser-117 to Asn-125; Gly-164 to Leu-172; Tyr-193 to Ser-201; Cys-303 to Phe-311; His-315 to Ile-323; Arg-341 to Cys-349; Ile-347 to Ser-355; Arg-403 to Phe-411; Gln-484 to Pro-492; Ser-499 to Leu-507; Ile-541 to Thr-549 Asn-622 to Ile-630; and Glu-645 to Gly-653. SP114 Gly-17 to Leu-25; His-40 to Gln-48; Arg-49 to Arg-57; Ile-65 to Pro-73; Asn-101 to Asp-111; Gly-128 to Cys-136; Phe-183 to Thr-191; and Pro-268 to Ile-276. SP0115 Met-8 to Ser-16; Tyr-24 to Leu-32; Cys-68 to Leu-76; Ser-100 to Pro-108; Thr-193 to Thr-201; Gly-238 to Pro-250; Thr-280 to Phe-288; Pro-303 to Asn-312; Trp-319 to Leu-328; Leu-335 to Leu-344; Lys-395 to Ala-403; Asn-416 to Gln-424; Tyr-430 to Ser-438; Val-448 to Leu-456; Leu-460 to Thr-468; Pro-502 to Thr-510; Lys-515 to Ile-524; Gln-523 to His-532; Tyr-535 to Thr-543; Ser-559 to Pro-567; Thr-572 to Asn-580; Val-594 to Arg-602; Arg-603 to Asn-611; Thr-620 to Trp-628; and Tyr-644 to Arg-653. BP117 Ala-6 to Gly-14; Ile-19 to Thr-27; Thr-99 to Leu-107; Ser-117 to Asp-125; His-131 to Val-139; Ile-193 to Gly-201; and Val-241 to Gln-249. SP118 Ser-8 to Trp-23; His-46 to Ala-54; Asn-93 to Gly-101; Val-100 to Ser-108; Arg-155 to Asp-163; and His-192 to Leu-200. SP119 Tyr-46 to Lys-54; Ser-93 to Ser-101; Trp-108 to Asn-116; Val-121 to Glu-129; and Tyr-131 to Gln-139. SP120 Ala-57 to Lys-65; Leu-68 to Glu-76; Thr-103 to Tyr-116; Tyr-122 to Val-130; His-163 to Gly-173; Asp-188 to Ser-196; Ser-222 to Ser-231; Phe-244 to Ser-252; Pro-262 to Tyr-270; Val-283 to His-291; and Asp-298 to Leu-306. SP121 Ser-3 to Ala-11; Asp-13 to Leu-21; Ser-36 to Val-44; and Gln-136 to Met-144. SP122 Asn-28 to Lys-36; Glu-39 to Thr-50; Val-54 to Lys-62; Asn-106 to Leu-114; Phe-159 to Gly-167; Asn-172 to Arg-180; Glu-199 to Asn-207; Lys-230 to His-241; Asn-252 to Gly-263; Met-278 to Ala-287; Thr-346 to Asp-354; Lys-362 to Thr-370; Asp-392 to Asn-405; Asp-411 to Ala-424; Gly-434 to Gly-443; Tyr-484 to Glu-492; Ile-511 to Leu-519; Asn-524 to Asp-538; Glu-552 to Ile-567; Val-605 to Lys-613; Phe-697 to Ala-705; Phe-722 to Leu-730; Leu-753 to Leu-761; Asp-787 to Gln-795; Leu-858 to Asn-866; Ala-892 to Thr-901; Gly-903 to Ile-913; Ile-921 to Asn-931; Asn-938 to Pro-951; Gly-960 to Lys-970; Leu-977 to Asp-985; and Leu-988 to Pro-996. SP123 Val-4 to Asn-12; Glu-47 to Leu-55; Lys-89 to Glu-100; Ser-165 to Thr-173; Lys-234 to Val-242; Ser-258 to Ser-266; Glu-284 to Asn-292; Tyr-327 to Leu-335; Tyr-457 to Thr-465; Tyr-493 to Glu-501; Thr-506 to Tyr-514; Lys-517 to Thr-525; Asn-532 to Gly-540; and Arg-556 to Glu-564. SP124 rg-16 to Glu-24; Gln-52 to Arg-60; Asn-69 to Tyr-77; Glu-121 to Asn-129; Ala-134 to Val-142; Thr-151 to Ala-159; Asn-164 to Glu-172; His-181 to His-189; Thr-210 to Ala-218; Ser-244 to Val-252; Phe-287 to Tyr-297; Ser-312 to Thr-323; His-433 to Tyr-441; Ser-445 to Asn-453; Asn-469 to Thr-477; Asn-501 to Asn-509; Gln-536 to Ala-547; and Gln-608 to Asp-621. SP125 Ser-9 to Asp-21; Ala-28 to Leu-36; Asn-49 to Phe-57; Val-137 to Arg-145; Asn-155 to Leu-163; Glu-183 to Asp-191; Gly-202 to Tyr-210; Pro-221 to Asp-229; Phe-263 to Ala-271; Phe-300 to Gln-308; Asp-313 to Glu-321; Asn-324 to Asp-332; Ile-346 to Asn-354; Asp-362 to Lys-370; Met-402 to Gly-410; Gly-437 to Gly-445; Ser-471 to Glu-483; Gly-529 to Asp-537; Gln-555 to Val-563; and Leu-579 to Lys-587. SP126 Leu-22 to Thr-30; Val-65 to Leu-73; and Thr-75 to Asp-83. SP127 Glu-2 to Ala-12; Asp-28 to Thr-36; Val-105 to Thr-113; Lys-121 to Thr-129; Trp-138 to Pro-146; Ser-152 to Ile-160; Lys-180 to Asp-188; Leu-194 to Asn-202; and Gly-228 to Thr-236. ______________________________________

TABLE 3 __________________________________________________________________________ S. pneumoniae ORF Cloning Primers Primer Name SEQ ID Sequence RE __________________________________________________________________________ SP001A NO:227 GACTGGATCCTAAAATCTACGACAATAAAAATC Bam HI SP001B NO:228 CTGAGTCGACTGGTTGTGCTGGTTGAG Sal I SP004A NO:229 GTCAGGATCCAAATTACAATACGGACTATG Bam HI SP004B NO:230 CAGTGTCGACTAACTCTAGGTCGGAAAC Sal I SP006A NO:231 GACTGGATCCTGAGAATCAAGCTACACCCAAAGAG Bam HI SP006B NO:232 AGTCAAGCTTTTGTAACTGAGATTGATCTGG Hind III SP007A NO:233 GACTGGATCCTGGTAACCGCTCTTCTCGTAACGCAGC Bam HI SP007B NO:234 AGTCAAGCTTTTTCAGGAACTTTTACGCTTCC Hind III SP008A NO:235 AGTCAGATCTTGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGC Bgl II SP008B NO:236 ACTGAAGCTTTTTTGTTTTTCAAGAATTCATCG Hind III SP009A NO:237 GACTGGATCCTGGTCAAGGAACTGCTTCTAAAGAC Bam HI SP009B NO:238 AGTCAAGCTTTCACAAATTCGTTGGTGAAGCC Hind III SP010A NO:239 GACTGGATCCTAGCTCAGGTGGAAACGCTGGTTCATCC Bam HI SP010B NO:240 AGTCAAGCTTATCAACTTTTCCACCTTCAACAACC Hind III SP011A NO:241 GTCAAGATCTCTCCAACTATGGTAAATCTGCGGATGG Bgl II SP011B NO:242 AGTCCTGCAGATCCACATCCGCTTTCATCGGGTTAAAGAAGG Pst I SP012A NO:243 GACTGGATCCTGGGAAAAATTCTAGCGAAACTAGTGG Bam HI SP012B NO:244 GTCACTGCAGCTGTCCTTCTTTTACTTCTTTGGTTGC Pst I SP013A NO:245 GACTGGATCCTGCTAGCGGAAAAAAAGATACAACTTCTGG Bam HI SP013B NO:246 CTGAAAGCTTTTTTGCCAATCCTTCAGCAATCTTGTC Hind III SP014A NO:247 GACTAGATCTTGGCTCAAAAAATACAGCTTCAAGTCC Bgl II SP014B NO:248 AGTCCTGCAGGTTTTTGTTTGCTTGGTATTGGTCG Pst I SP015A NO:249 GACTGGATCCTAGTACAAACTCAAGCACTAGTCAGACAGAG Bam HI SP015B NO:250 CAGTCTGCAGTTTCAAAGCTTTTTGTATGTCTTC Pst I SP016A NO:251 GACTGGATCCTGGCAATTCTGGCGGAAGTAAAGATGC Bam HI SP016B NO:252 AGTCAAGCTTGTTTCATAGCTTTTTTGATTGTTTCG Hind III SP017A NO:253 GACTGGATCCTTCACAAGAAAAAACAAAAAATGAAGATGG Bam HI SP017B NO:254 AGTCAAGCTTATCGACGTAGTCTCCGCCTTC Hind III SP019A NO:255 GACTGGATCCGAAAGGTCTGTGGTCAAATAATCTTACC Bam HI SP019B NO:256 AGTCAAGCTTAGAGTTAACATGGTGCTTGCCAATAGG Hind III SP020A NO:257 GACTGGATCCAAACTCAGAAAAGAAAGCAGACAATGC Bam HI SP020B NO:258 AGTCAAGCTTCCAAACTGGTTGATCCAAACCATCTG Hind III SP021A NO:259 GACTGGATCCTTCGAAAGGGTCAGAAGGTGCAGACC Bam HI SP021B NO:260 AGTCAAGCTTCTGTAGGCTTGGTGTGCCCCAGTTGC Hind III SP022A NO:261 CTGAGGATCCGGGGATGGCAGCTTTTAAAAATC Bam HI SP022B NO:262 CAGTAAGCTTGTTTACCCATTCACCATTACC Hind III SP023A NO:263 CAGTGGATCCAGACGAGCAAAAAATTAAG Bam HI SP023B NO:264 TCAGAAGCTTGTTTACCCATTCACCATT Hind III SP025A NO:265 GACTGGATCCCTGTGGTGAGGAAGAAACTAAAAAG Bam HI SP025B NO:266 CTGAGTCGACAATATTCTGTAGGAATGCTTCGAATTTG Sal I SP028A NO:267 CTGAGGATCCGACTTTTAACAATAAAACTATTGAAGAG Bam HI SP028B NO:268 GTCACTGCAGGTTGTCACCTCCAAAAATCACGG Pst I SP030A NO:269 GACTGGATCCCTTTACAGGTAAACAACTACAAGTCGG Bam HI SP030B NO:270 CAGTAAGCTTTTCGAAGTTTGGCTCAGAATTG Hind III SP031A NO:271 GACTGGATCCCCAGGCTGATACAAGTATCGCA Bam HI SP031B NO:272 CAGTAAGCTTATCTGCAGTATGGCTAGATGG Hind III SP032A NO:273 GACTGGATCCGTCTGTATCATTTGAAAACAAAGAAAC Bam HI SP032B NO:274 CAGTCTGCAGTTTTACTGTTGCTGTGCTTGTG Pst I SP033A NO:275 ACTGAGATCTTGGTCAAAAGGAAAGTCAGACAGGAAAGG Bgl II SP033B NO:276 CAGTAAGCTTATTCCTGAGCTTTTTTGATAAAGGTTGCGCA Hind III SP034A NO:277 ACTGGGATCCGAAGGATAGATATATTTTAGCATTTGAGAC Bam HI SP034B NO:278 AGTCAAGCTTCCATGGTATCAAAGGCAAGACTTGG Hind III SP035A NO:279 GTCAGGATCCGGTAGTTAAAGTTGGTATTAACGG Bam HI SP035B NO:280 AGTCAAGCTTGCAATTTTTGCGAAGTATTCCAAGAG Hind III SP036A NO:281 AGTCGGATCCTTCTTACGAGTTGGGACTGTATCAAGC Bam HI SP036B NO:282 AGTCAAGCTTGTTTATTTTTTCCTTACTTACAGATGAAGG Hind III SP038A NO:283 AGTCGGATCCTACTGAGATGCATCATAATCTAGGAGC Bam HI SP038B NO:284 TCAGCTCGAGTTCTTTGACATCTCCATCATAAGTCGC Xho I SP039A NO:285 GACTGGATCCGGTTTTGAGAAAGTATTTGCAGGGG Bam HI SP039B NO:286 CAGTAAGCTTGGATTTTTTCATGGATGCAATTTTTTTGG Hind III SP040A NO:287 GACTGGATCCGACAACATTTACTATCCATACAGTAGAGTCAGC Bam HI SP040B NO:288 GACTAAGCTTGGCATAAGGTTGCAATTCTGGATTAATTGG Hind III SP041A NO:289 GACTGGATCCGGCTAAGGAAAGAGTGGATG Bam HI

SP041B NO:290 GACTAAGCTTTTCATTTTTAAATTGACTATGCGCCCG Hind III SP042A NO:291 GACTGGATCCTTGTTCCTATGAACTTGGTCGTCACC Bam HI SP042B NO:292 CATGAAGCTTATCCTGGATTTTTCCAAGTAAATCT Hind III SP043A NO:293 GACTGGATCCTTATAAGGGTGAATTAGAAAAAGG Bam HI SP043B NO:294 GACTAAGCTTCTTATTAGGATTGTTAGTAGTTG Hind III SP044A NO:295 GACTGGATCCGAATGTTCAGGCTCAAGAAAGTTCAGG Bam HI SP044B NO:296 GACTAAGCTTTTCCCCTGATGGAGCAAAGTAATACC Hind III SP045A NO:297 GACTGGATCCCTTGGGTGTAACCCATATCCAGCTCCTTCC Bam HI SP045B NO:298 GACTGTCGACTTCAGCTTGTTTATCTGGGGTTGC Sal I SP046A NO:299 GACTGGATCCTAGTGATGGTACTTGGCAAGGAAAACAG Bam HI SP046B NO:300 ACTGCTGCAGATCTTTGCCACCTAGCTTCTCATTG Pst I SP048A NO:301 GTCAGGATCCTGGGATTCAATATGTCAGAGATGATACTAG Bam HI SP048B NO:302 CTAGAAGCTTACGCACCCATTCACCATTATCATTG Hind III SP049A NO:303 GTCAGGATCCGGATAATAGAGAAGCATTAAAAACC Bam HI SP049B NO:304 AGTCAAGCTTGACAAAATCTTGAAACTCCTCTGGTC Hind III SP050A NO:305 GTCAGGATCCAGATTTTGTCGAGGAGTGTCATACC Bam HI SP050B NO:306 AGTCAAGCTTTCCCTTTTTACCCTTACGAATCCAGG Hind III SP051A NO:307 GACTGGATCCATCTGTAGTTTATGCGGATGAAACACTTATTAC Bam HI SP051B NO:308 GACTGTCGACGCTTTGGTAGAGATAGAAGTCATG Sal I SP052A NO:309 GACTGGATCCTTACTTTGGTATCGTAGATACAGCCGGC Bam HI SP052B NO:310 AGTCAAGCTTTGTTAATTGCGTACCTTCTAAGCGACC Hind III SP053A NO:311 GACTGGATCCAGCTAAGGTTGCATGGGATGCGATTCG Bam HI SP053B NO:312 GACTGTCGACCTGGGCTTTATTAGTTTGACTAGC Sal I SP054A NO:313 CAGTGGATCCCTATCACTATGTAAATAAAGAGA Bam HI SP054B NO:314 ACTGAAGCTTTTCTGTCCCTGTTTGAGGCA Hind III SP055A NO:315 CAGTGGATCCTGAGACTCCTCAATCAATAACAAA Bam HI SP055B NO:316 ACGTAAGCTTATAATCAGTAGGAGAAACTGAACT Hind III SP056A NO:317 CAGTGGATCCGGATGCTCAAGAAACTGCGG Bam HI SP056B NO:318 GACTAAGCTTTTGCCTCTCATTCTTGCTTCC Hind III SP057A NO:319 CAGTGGATCCCGACAAAGGTGAGACTGAG Bam HI SP057B NO:320 ACGTAAGCTTATTTCTTAATTCAAGTGTTTTCTCTG Hind III SP058A NO:321 GACTGGATCCAAATCAATTGGTAGCACAAGATCC Bam HI SP058B NO:322 CAGTGTCGACATTAGGAGCCACTGGTCTC Sal I SP059A NO:323 CAGTGGATCCCAAACAGTCAGCTTCAGGAAC Bam HI SP059B NO:324 GACTCTGCAGTTTAATCTTGTCCCAGGTGG Pst I SP060A NO:325 GACTGGATCCATTCGATGATGCGGATGAAAAG Bam HI SP060B NO:326 GACTAAGCTTCATTTGTCTTTGGGTATTTCGCA Hind III SP062A NO:327 CAGTGGATCCGGAGAGTCGATCAAAAGTAG Bam HI SP062B NO:328 GTCACTGCAGTTGCTCGTCTCGAGGTTC Pst I SP063A NO:329 CAGTGGATCCATGGACAACAGGAAACTGGGAC Bam HI SP063B NO:330 CAGTAAGCTTATTAGCTTCTGTACCTGTGTTTG Hind III SP064A NO:331 GACTGGATCCCGATGGGCTCAATCCAACCCCAGGTCAAGTC Bam HI SP064B NO:332 GACTCTGCAGCATAGCTTTATCCTCTGACATCATCGTATC Pst I SP065A NO:333 GACTGGATCCTTCCAATCAAAAACAGGCAGATGG Bam HI SP065B NO:334 GACTAAGCTTGAGTCCCATAGTCCAAGGCA Hind III SP067A NO:335 AGTCGGATCCTATCACAGGATCGAACGGTAAGACAACC Bam HI SP067B NO:336 ACTGGTCGACTTCTTTTAACTCCGCTACTGTGTC Sal I SP068A NO:337 CAGTGGATCCAAGTTCATCGAAGATGGTTGGGAAGTCC Bam HI SP068B NO:338 GATCGTCGACCCGCTCCCACATGCTCAACCTT Sal I SP069A NO:339 TGACGGATCCATCGCTAGCTAGTGAAATGCAAGAAAG Bam HI SP069B NO:340 TGACAAGCTTATTCGTTTTTGAACTAGTTGCTTTCGT Hind III SP070A NO:341 GACTGGATCCGCACCAGATGGGGCACAAGGTTCAGGG Bam HI SP070B NO:342 TGACAAGCTTAACTTGTAACGAACAGTTCAATCTG Hind III SP071A NO:343 GACTAGATCTTTTTAACCCAACTGTTGGTACTTTCC Bgl II SP071B NO:344 TGACAAGCTTGTTAGGTGTTACATTTTGACCGTC Hind III SP072A NO:345 ACTGAGATCTTTTTAACCCAACTGTTGGTACTTTC Bgl II SP072B NO:346 GACTAAGCTTTCTACGATAACGATCATTTTCTTTACC Hind III SP073A NO:347 GACTGTCGACTCGTAGATATTTAAGTCTAAGTGAAGCG Sal I SP073B NO:348 AGTCAAGCTTGTTAGGTGTTACATTTTGCAAGTC Hind III SP074A NO:349 GACTGGATCCCTTTGGTTTTGAAGGAAGTAAG Bam HI SP074B NO:350 TGACCTGCAGACGATTTTTGAAAAATGGAGGTGTATC Pst I SP075A NO:351 CAGTGGATCCCTACTACCTCTCGAGAGAAAG Bam HI SP075B NO:352 ACTGAAGCTTTTCGCTTTTTACTCGTTTGACA Hind III SP076A NO:353 CAGTGGATCCTAAGGTCAAAAGTCAGACCGCTAAGAAAGTGC Bam HI

SP076B NO:354 CAGTAAGCTTTAGGGTATCCAAATACTGGTTGTTGATG Hind III SP077A NO:355 TGACAGATCTTGACGGGTCTCAGGATCAGACTCAGG Bgl II SP077B NO:356 TGACAAGCTTCAAAGACATCCACCTCTTGACCTTTG Hind III SP078A NO:357 GACTGGATCCTAGAGGCTTTGCCAAATGGTGGGAAGGG Bam HI SP078B NO:358 GTCAGTCGACTTGTTGTAACACTTTTCGAGGTTTGGTACC Sal I SP079A NO:359 CAGTGGATCCTCAAAAAGAGAAGGAAAACTTGG Bam HI SP079B NO:360 CAGTCTGCAGTTTCTTCAACAAACCTTGTTCTTG Pst I SP080A NO:361 CAGTGGATCCACGTTCTATTGAGGACCACTT Bam HI SP080B NO:362 CAGTAAGCTTTTCCTTCTCAGTCAATTCTTTTCC Hind III SP081A NO:363 GACTGGATCCCGCTCAAAATACCAGAGGTGTTCAG Bam HI SP081B NO:364 GACTAAGCTTAGTACCATGGGTGTGACAGGTTTGAA Hind III SP082A NO:365 CTGAGGATCCAATTGTACAATTAGAAAAAGATAGC Bam HI SP082B NO:366 TGACAAGCTTGCGTTGACTAGGTTCTGCAATGCC Hind III SP083A NO:367 GACTGGATCCTCTGACCAAGCAAAAAGAAGCAGTCAATGA Bam HI SP083B NO:368 TCAGCAGCTGATCATTGACTTTACGATTTGCTCC Bgl II SP084A NO:369 GACTGGATCCGTCCGGCTCTGTCCAGTCCACTTTTTCAGCG Bam HI SP084B NO:370 TCAGAAGCTTATTTTTTGTTTCCTTAATGCGTT Hind III SP085A NO:371 GACTGGATCCGGGACAAATTCAAAAAAATAGGCAAGAGG Bam HI SP085B NO:372 GTCAAAGCTTTGGCTCTTTGATTGCCAACAACTG Hind III SP086A NO:373 GACTGGATCCTCGCTACCAGCAACAAAGCGAGCAAAAGG Bam HI SP086B NO:374 GACTAAGCTTACTTTTTTCTTTTTCCACACGA Hind III SP087A NO:375 CAGTGGATCCGAACCGACAAGTCGCCCACTATCAAGACT Bam HI SP087B NO:376 CTGAAAGCTTTGAATTCTCTTTCTTTTCAGGCT Hind III SP088A NO:377 TCGAGGATCCGGTTGTCGGCTGGCAATATATCCCGT Bam HI SP088B NO:378 CAGTAAGCTTCCGAACCCATTCGCCATTATAGTTGAC Hind III SP089A NO:379 AGTCGGATCCGGCCAAATCAGAATGGGTAGAAGAC Bam HI SP089B NO:380 TGACCTGCAGCTTCTCATTGATTTTCATCATCAC Pst I SP090A NO:381 GACTGGATCCATTTGCAGATGATTCTGAAGGATGG Bam HI SP090B NO:382 TCAGCTGCAGCTTAACCCATTCACCATTCTAGTTTAAG Pst I SP091A NO:383 GACTGGATCCTGTCGCTGCAAATGAAACTGAAGTAGC Bam HI SP091B NO:384 GACTAAGCTTATACCAAACGCTGACATCTACGCG Hind III SP092A NO:385 AGTCAGATCTTACGTCTCAGCCTACTTTTGTAAGAGC Bgl II SP092B NO:386 GACTAAGCTTAACCCATTCACCATTGGCATTGAC Hind III SP093A NO:387 CAGTGGATCCTGGACAGGTGAAAGGTCATGCTACATTTGTG Bam HI SP093B NO:388 GACTAAGCTTCAACCATTGAGACCTTGCAACAC Hind III SP094A NO:389 GTCAGGATCCGATTGCTCCTTTGAAGGATTTGAGAGAAACC Bam HI SP094B NO:390 GACTAAGCTTCGATCAAAGATAAGATAAATATATATAAAGT Hind III SP095A NO:391 GACTGGATCCTAGGTCATATGGGACTTTTTTTCTACAACAAAATAGG Bam HI SP095B NO:392 TGACAAGCTTATCTATCAGCTCATTTAATCGTTTTTG Hind III SP096A NO:393 CTGAGGATCCCAACGTTGAGAATTATTTGCGAATG Bam HI SP096B NO:394 TGACAAGCTTGAGTCTACAAAAGTAATGTAC Hind III SP097A NO:395 GTCAGGATCCCTACTATCAATCAAGTTCTTCAGCC Bam HI SP097B NO:396 TGACAAGCTTGACTGAGGCTTGGACCAGATTGAAAAG Hind III SP098A NO:397 GACTGGATCCGACAAAAACATTAAAACGTCCTGAGG Bam HI SP098B NO:398 GACTAAGCTTAGCACGAACTGTGACGCTGGTTCC Hind III SP099A NO:399 GACTGGATCCTTCTCAGGAGACCTTTAAAAATATC Bam HI SP099B NO:400 GACTAAGCTTGTTGGCCATCTTGTACATACC Hind III SP100A NO:401 GACTGGATCCAGTAAATGCGCAATCAAATTC Bam HI SP100B NO:402 AGTCCTGCAGGTATTTAGCCCAATAATCTATAAAGCT Pst I SP101A NO:403 CAGTGGATCCTTACCGCGTTCATCAAGATGTC Bam HI SP101B NO:404 GACTAAGCTTGCCAGATGTTGAAAAGAGAGTG Hind III SP102A NO:405 GACTGGATCCGTGGATGGGCTTTAACTATCTTCGTATTCG Bam HI SP102B NO:406 AGTCAAGCTTGCTAGTCTTCACTTTCCCTTTCC Hind III SP103A NO:407 GACTGTCGACACTAAACCAGCATCGTTCGCAGGA Sal I SP103B NO:408 CTGACTGCAGCTTCTTGAAGAAATAATGATTGTGG Pst I SP105A NO:409 CAGTGGATCCTGACTACCTTGAAATCCCACTT Bam HI SP105B NO:410 CAGTAAGCTTTTTTTTAAGGTTGTAGAATGATTTCAATC Hind III SP106A NO:411 CAGTGTCGACTCGTATCTTTTTTTGGAGCAATGTT Sal I SP106B NO:412 GACTAAGCTTAAATGTTCCGATACGGGTGATTG Hind III SP107A NO:413 CAGTGGATCCGGACTCTCTCAAAGATGTGAAAG Bam HI SP107B NO:414 GACTAAGCTTCTTGAGTTTGTCAAGGATTGCTTT Hind III SP108A NO:415 CAGTGGATCCCAAGAAATCCTATCATCTCTTCCAGAAG Bam HI SP108B NO:416 GACTAAGCTTTTCAGAACTAAAAGCCGCAGCTT Hind III SP109A NO:417 GACTGGATCCACGAAATGCAGGGCAGACAG

Bam HI SP109B NO:418 CAGTAAGCTTATCAACATAATCTAGTAAATAAGCGT Hind III SP110A NO:419 CAGTGGATCCTGTATAGTTTTTAGCGCTTGTTCTTC Bam HI SP110B NO:420 GTCAAAGCTTTGATAGAGTGTCATAATCTTCTTTAG Hind III SP111A NO:421 GACTGGATCCGTGTGTCGAGCATATTCTGAAG Bam HI SP111B NO:422 CAGTAAGCTTACTTTTACCATTTCTTTGTTCTGCATC Hind III SP112A NO:423 GACTGTCGACGTGTTTGGATAGCATTCAGAATCAGACG Sal I SP112B NO:424 CAGTAAGCTTCGGAAGTAAAGACAATTTTTCC Hind III SP113A NO:425 CAGTGGATCCGTGCCTAGATAGTATTATTACTCAAAC Bam HI SP113B NO:426 GACTAAGCTTTTTGCTTATTTCTCTCAATTTTTC Hind III SP114A NO:427 CAGTGGATCCCATTCAGAAGCAGACCTATCAAAATC Bam HI SP114B NO:428 ACTGAAGCTTATGTAATTTTTTAGATTTTTCAATATTTTTCAG Hind III SP115A NO:429 AGTCGGATCCTAAGGCTGATAATCGTGTTCAAATG Bam HI SP115B NO:430 GACTAAGCTTAAAATTAGATAGACGTTGAGT Hind III SP117A NO:431 AGTCGGATCCCTGTGGCAATCAGTCAGCTGCTTCC Bam HI SP117B NO:432 GACTGTCGACTTTAATCTTGTCCCAGGTGGTTAATTTGCC Sal I SP118A NO:433 ACTGGTCGACTTGTCAACAACAACATGCTACTTCTGAG

Sal I SP118B NO:434 GACTCTGCAGAAGTTTAACCCACTTATCATTATCC Pst I SP119A NO:435 ACTGGGATCCTTGTTCAGGCAAGTCCGTGACTAGTGAAC Bam HI SP119B NO:436 GACTAAGCTTGGCTAATTCCTTCAAAGTTTGCA Hind III SP120A NO:437 AGTCGGATCCCTCGCAAATTGAAAAGGCGGCAGTTAGCC Bam HI SP120B NO:438 GACTAAGCTTGTAAATAAGCGTACCTTTTTCTTCC Hind III SP121A NO:439 TCAGGGATCCTTGTCAGTCAGGTTCTAATGGTTCTCAG Bam HI SP121B NO:440 AGTCAAGCTTGGCATTGGCGTCGCCGTCCTTC Hind III SP122A NO:441 GACTGGATCCGGAAACTTCACAGGATTTTAAAGAGAAG Bam HI SP122B NO:442 GACTGTCGACAATCAATCCTTCTTCTGCACTTCT Sal I SP123A NO:443 CAGTGGATCCTGTGGTCGAAGTTGAGACTCCTCAATC Bam HI SP123B NO:444 GACTAAGCTTTTCTTCAAATTTATTATCAGC Hind III SP124A NO:445 AGTCGGATCCAACACCTGTATATAAAGTTACAGCAATCG Bam HI SP124B NO:446 GACTGTCGACTACTTGACCGAATGCGTCGAATGTACG Sal I SP125A NO:447 CTGAGGATCCATTAGACAGATTAATTGAAATCGG Bam HI SP125B NO:448 GACTGTCGACTTTAAAGATTGAAGTTTTAAAGCT Sal I SP126A NO:449 TGACGGATCCTAAGACAGATGAACGGAGCAAGGTG Bam HI SP126B NO:450 CTGAAAGCTTTAAGGCTTCCTCAATGAGTTTGTCT Hind III SP127A NO:451 GACTGGATCCCTGTGAGAATCAAGCTACACCCA Bam HI SP127B NO:452 CTGAAAGCTTTTGTAACTGAGATTGATCTGGGAG Hind III __________________________________________________________________________

__________________________________________________________________________ # SEQUENCE LISTING - (1) GENERAL INFORMATION: - (iii) NUMBER OF SEQUENCES: 452 - (2) INFORMATION FOR SEQ ID NO: 1: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1999 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #1: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAAAATCTAC GACAATAAAA ATCAACTCAT TGCTGACTTG GGTTCTGAAC GC - #CGCGTCAA 60 - TGCCCAAGCT AATGATATTC CCACAGATTT GGTTAAGGCA ATCGTTTCTA TC - #GAAGACCA 120 - TCGCTTCTTC GACCACAGGG GGATTGATAC CATCCGTATC CTGGGAGCTT TC - #TTGCGCAA 180 - TCTGCAAAGC AATTCCCTCC AAGGTGGATC AACTCTCACC CAACAGTTGA TT - #AAGTTGAC 240 - TTACTTTTCA ACTTCGACTT CCGACCAGAC TATTTCTCGT AAGGCTCAGG AA - #GCTTGGTT 300 - AGCGATTCAG TTAGAACAAA AAGCAACCAA GCAAGAAATC TTGACCTACT AT - #ATAAATAA 360 - GGTCTACATG TCTAATGGGA ACTATGGAAT GCAGACAGCA GCTCAAAACT AC - #TATGGTAA 420 - AGACCTCAAT AATTTAAGTT TACCTCAGTT AGCCTTGCTG GCTGGAATGC CT - #CAGGCACC 480 - AAACCAATAT GACCCCTATT CACATCCAGA AGCAGCCCAA GACCGCCGAA AC - #TTGGTCTT 540 - ATCTGAAATG AAAAATCAAG GCTACATCTC TGCTGAACAG TATGAGAAAG CA - #GTCAATAC 600 - ACCAATTACT GATGGACTAC AAAGTCTCAA ATCAGCAAGT AATTACCCTG CT - #TACATGGA 660 - TAATTACCTC AAGGAAGTCA TCAATCAAGT TGAAGAAGAA ACAGGCTATA AC - #CTACTCAC 720 - AACTGGGATG GATGTCTACA CAAATGTAGA CCAAGAAGCT CAAAAACATC TG - #TGGGATAT 780 - TTACAATACA GACGAATACG TTGCCTATCC AGACGATGAA TTGCAAGTCG CT - #TCTACCAT 840 - TGTTGATGTT TCTAACGGTA AAGTCATTGC CCAGCTAGGA GCACGCCATC AG - #TCAAGTAA 900 - TGTTTCCTTC GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CA - #ACTATGAA 960 - ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA CT - #GCTACTAT 1020 - CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACT CCTGTTTATA AC - #TGGGATAG 1080 - GGGCTACTTT GGCAACATCA CCTTGCAATA CGCCCTGCAA CAATCGCGAA AC - #GTCCCAGC 1140 - CGTGGAAACT CTAAACAAGG TCGGACTCAA CCGCGCCAAG ACTTTCCTAA AT - #GGTCTAGG 1200 - AATCGACTAC CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CC - #GAATCAGA 1260 - CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT TT - #GCAAATGG 1320 - TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC TTTAGTGATG GG - #AGTGAAAA 1380 - AGAGTTCTCT AATGTCGGAA CTCGTGCCAT GAAGGAAACG ACAGCCTATA TG - #ATGACCGA 1440 - CATGATGAAA ACAGTCTTGA CTTATGGAAC TGGACGAAAT GCCTATCTTG CT - #TGGCTCCC 1500 - TCAGGCTGGT AAAACAGGAA CCTCTAACTA TACAGACGAG GAAATTGAAA AC - #CACATCAA 1560 - GACCTCTCAA TTTGTAGCAC CTGATGAACT ATTTGCTGGC TATACGCGTA AA - #TATTCAAT 1620 - GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT GTAGGCAATG GC - #CTTACGGT 1680 - CGCTGCCAAA GTTTACCGCT CTATGATGAC CTACCTGTCT GAAGGAAGCA AT - #CCAGAAGA 1740 - TTGGAATATA CCAGAGGGGC TCTACAGAAA TGGAGAATTC GTATTTAAAA AT - #GGTGCTCG 1800 - TTCTACGTGG AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CA - #AGCTCATC 1860 - ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA AT - #AGTACGAC 1920 - TACCAATCCT AACAATAATA CGCAACAATC AAATACAACC CCTGATCAAC AA - #AATCAGAA 1980 # 199 - #9 - (2) INFORMATION FOR SEQ ID NO:2: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 666 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: - Lys Ile Tyr Asp Asn Lys Asn Gln Leu Ile Al - #a Asp Leu Gly Ser Glu # 15 - Arg Arg Val Asn Ala Gln Ala Asn Asp Ile Pr - #o Thr Asp Leu Val Lys # 30 - Ala Ile Val Ser Ile Glu Asp His Arg Phe Ph - #e Asp His Arg Gly Ile # 45 - Asp Thr Ile Arg Ile Leu Gly Ala Phe Leu Ar - #g Asn Leu Gln Ser Asn # 60 - Ser Leu Gln Gly Gly Ser Thr Leu Thr Gln Gl - #n Leu Ile Lys Leu Thr #80 - Tyr Phe Ser Thr Ser Thr Ser Asp Gln Thr Il - #e Ser Arg Lys Ala Gln # 95 - Glu Ala Trp Leu Ala Ile Gln Leu Glu Gln Ly - #s Ala Thr Lys Gln Glu # 110 - Ile Leu Thr Tyr Tyr Ile Asn Lys Val Tyr Me - #t Ser Asn Gly Asn Tyr # 125 - Gly Met Gln Thr Ala Ala Gln Asn Tyr Tyr Gl - #y Lys Asp Leu Asn Asn # 140 - Leu Ser Leu Pro Gln Leu Ala Leu Leu Ala Gl - #y Met Pro Gln Ala Pro 145 1 - #50 1 - #55 1 - #60 - Asn Gln Tyr Asp Pro Tyr Ser His Pro Glu Al - #a Ala Gln Asp Arg Arg # 175 - Asn Leu Val Leu Ser Glu Met Lys Asn Gln Gl - #y Tyr Ile Ser Ala Glu # 190 - Gln Tyr Glu Lys Ala Val Asn Thr Pro Ile Th - #r Asp Gly Leu Gln Ser # 205 - Leu Lys Ser Ala Ser Asn Tyr Pro Ala Tyr Me - #t Asp Asn Tyr Leu Lys # 220 - Glu Val Ile Asn Gln Val Glu Glu Glu Thr Gl - #y Tyr Asn Leu Leu Thr 225 2 - #30 2 - #35 2 - #40 - Thr Gly Met Asp Val Tyr Thr Asn Val Asp Gl - #n Glu Ala Gln Lys His # 255 - Leu Trp Asp Ile Tyr Asn Thr Asp Glu Tyr Va - #l Ala Tyr Pro Asp Asp # 270 - Glu Leu Gln Val Ala Ser Thr Ile Val Asp Va - #l Ser Asn Gly Lys Val # 285 - Ile Ala Gln Leu Gly Ala Arg His Gln Ser Se - #r Asn Val Ser Phe Gly # 300 - Ile Asn Gln Ala Val Glu Thr Asn Arg Asp Tr - #p Gly Ser Thr Met Lys 305 3 - #10 3 - #15 3 - #20 - Pro Ile Thr Asp Tyr Ala Pro Ala Leu Glu Ty - #r Gly Val Tyr Asp Ser # 335 - Thr Ala Thr Ile Val His Asp Glu Pro Tyr As - #n Tyr Pro Gly Thr Asn # 350 - Thr Pro Val Tyr Asn Trp Asp Arg Gly Tyr Ph - #e Gly Asn Ile Thr Leu # 365 - Gln Tyr Ala Leu Gln Gln Ser Arg Asn Val Pr - #o Ala Val Glu Thr Leu # 380 - Asn Lys Val Gly Leu Asn Arg Ala Lys Thr Ph - #e Leu Asn Gly Leu Gly 385 3 - #90 3 - #95 4 - #00 - Ile Asp Tyr Pro Ser Ile His Tyr Ser Asn Al - #a Ile Ser Ser Asn Thr # 415 - Thr Glu Ser Asp Lys Lys Tyr Gly Ala Ser Se - #r Glu Lys Met Ala Ala # 430 - Ala Tyr Ala Ala Phe Ala Asn Gly Gly Thr Ty - #r Tyr Lys Pro Met Tyr # 445 - Ile His Lys Val Val Phe Ser Asp Gly Ser Gl - #u Lys Glu Phe Ser Asn # 460 - Val Gly Thr Arg Ala Met Lys Glu Thr Thr Al - #a Tyr Met Met Thr Asp 465 4 - #70 4 - #75 4 - #80 - Met Met Lys Thr Val Leu Thr Tyr Gly Thr Gl - #y Arg Asn Ala Tyr Leu # 495 - Ala Trp Leu Pro Gln Ala Gly Lys Thr Gly Th - #r Ser Asn Tyr Thr Asp # 510 - Glu Glu Ile Glu Asn His Ile Lys Thr Ser Gl - #n Phe Val Ala Pro Asp # 525 - Glu Leu Phe Ala Gly Tyr Thr Arg Lys Tyr Se - #r Met Ala Val Trp Thr # 540 - Gly Tyr Ser Asn Arg Leu Thr Pro Leu Val Gl - #y Asn Gly Leu Thr Val 545 5 - #50 5 - #55 5 - #60 - Ala Ala Lys Val Tyr Arg Ser Met Met Thr Ty - #r Leu Ser Glu Gly Ser # 575 - Asn Pro Glu Asp Trp Asn Ile Pro Glu Gly Le - #u Tyr Arg Asn Gly Glu # 590 - Phe Val Phe Lys Asn Gly Ala Arg Ser Thr Tr - #p Asn Ser Pro Ala Pro # 605 - Gln Gln Pro Pro Ser Thr Glu Ser Ser Ser Se - #r Ser Ser Asp Ser Ser # 620 - Thr Ser Gln Ser Ser Ser Thr Thr Pro Ser Th - #r Asn Asn Ser Thr Thr 625 6 - #30 6 - #35 6 - #40 - Thr Asn Pro Asn Asn Asn Thr Gln Gln Ser As - #n Thr Thr Pro Asp Gln # 655 - Gln Asn Gln Asn Pro Gln Pro Ala Gln Pro # 665 - (2) INFORMATION FOR SEQ ID NO: 3: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1714 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #3: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AAATTACAAT ACGGACTATG AATTGACCTC TGGAGAAAAA TTACCTCTTC CT - #AAAGAGAT 60 - TTCAGGTTAC ACTTATATTG GATATATCAA AGAGGGAAAA ACGACTTCTG AG - #TCTGAAGT 120 - AAGTAATCAA AAGAGTTCAG TTGCCACTCC TACAAAACAA CAAAAGGTGG AT - #TATAATGT 180 - TACACCGAAT TTTGTAGACC ATCCATCAAC AGTACAAGCT ATTCAGGAAC AA - #ACACCTGT 240 - TTCTTCAACT AAGCCGACAG AAGTTCAAGT AGTTGAAAAA CCTTTCTCTA CT - #GAATTAAT 300 - CAATCCAAGA AAAGAAGAGA AACAATCTTC AGATTCTCAA GAACAATTAG CC - #GAACATAA 360 - GAATCTAGAA ACGAAGAAAG AGGAGAAGAT TTCTCCAAAA GAAAAGACTG GG - #GTAAATAC 420 - ATTAAATCCA CAGGATGAAG TTTTATCAGG TCAATTGAAC AAACCTGAAC TC - #TTATATCG 480 - TGAGGAAACT ATGGAGACAA AAATAGATTT TCAAGAAGAA ATTCAAGAAA AT - #CCTGATTT 540 - AGCTGAAGGA ACTGTAAGAG TAAAACAAGA AGGTAAATTA GGTAAGAAAG TT - #GAAATCGT 600 - CAGAATATTC TCTGTAAACA AGGAAGAAGT TTCGCGAGAA ATTGTTTCAA CT - #TCAACGAC 660 - TGCGCCTAGT CCAAGAATAG TCGAAAAAGG TACTAAAAAA ACTCAAGTTA TA - #AAGGAACA 720 - ACCTGAGACT GGTGTAGAAC ATAAGGACGT ACAGTCTGGA GCTATTGTTG AA - #CCCGCAAT 780 - TCAGCCTGAG TTGCCCGAAG CTGTAGTAAG TGACAAAGGC GAACCAGAAG TT - #CAACCTAC 840 - ATTACCCGAA GCAGTTGTGA CCGACAAAGG TGAGACTGAG GTTCAACCAG AG - #TCGCCAGA 900 - TACTGTGGTA AGTGATAAAG GTGAACCAGA GCAGGTAGCA CCGCTTCCAG AA - #TATAAGGG 960 - TAATATTGAG CAAGTAAAAC CTGAAACTCC GGTTGAGAAG ACCAAAGAAC AA - #GGTCCAGA 1020 - AAAAACTGAA GAAGTTCCAG TAAAACCAAC AGAAGAAACA CCAGTAAATC CA - #AATGAAGG 1080 - TACTACAGAA GGAACCTCAA TTCAAGAAGC AGAAAATCCA GTTCAACCTG CA - #GAAGAATC 1140 - AACAACGAAT TCAGAGAAAG TATCACCAGA TACATCTAGC AAAAATACTG GG - #GAAGTGTC 1200 - CAGTAATCCT AGTGATTCGA CAACCTCAGT TGGAGAATCA AATAAACCAG AA - #CATAATGA 1260 - CTCTAAAAAT GAAAATTCAG AAAAAACTGT AGAAGAAGTT CCAGTAAATC CA - #AATGAAGG 1320 - CACAGTAGAA GGTACCTCAA ATCAAGAAAC AGAAAAACCA GTTCAACCTG CA - #GAAGAAAC 1380 - ACAAACAAAC TCTGGGAAAA TAGCTAACGA AAATACTGGA GAAGTATCCA AT - #AAACCTAG 1440 - TGATTCAAAA CCACCAGTTG AAGAATCAAA TCAACCAGAA AAAAACGGAA CT - #GCAACAAA 1500 - ACCAGAAAAT TCAGGTAATA CAACATCAGA GAATGGACAA ACAGAACCAG AA - #CCATCAAA 1560 - CGGAAATTCA ACTGAGGATG TTTCAACCGA ATCAAACACA TCCAATTCAA AT - #GGAAACGA 1620 - AGAAATTAAA CAAGAAAATG AACTAGACCC TGATAAAAAG GTAGAAGAAC CA - #GAGAAAAC 1680 # 1714 GTTT CCGACCTAGA GTTA - (2) INFORMATION FOR SEQ ID NO:4: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 571 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: - Asn Tyr Asn Thr Asp Tyr Glu Leu Thr Ser Gl - #y Glu Lys Leu Pro Leu # 15 - Pro Lys Glu Ile Ser Gly Tyr Thr Tyr Ile Gl - #y Tyr Ile Lys Glu Gly # 30 - Lys Thr Thr Ser Glu Ser Glu Val Ser Asn Gl - #n Lys Ser Ser Val Ala # 45 - Thr Pro Thr Lys Gln Gln Lys Val Asp Tyr As - #n Val Thr Pro Asn Phe

# 60 - Val Asp His Pro Ser Thr Val Gln Ala Ile Gl - #n Glu Gln Thr Pro Val #80 - Ser Ser Thr Lys Pro Thr Glu Val Gln Val Va - #l Glu Lys Pro Phe Ser # 95 - Thr Glu Leu Ile Asn Pro Arg Lys Glu Glu Ly - #s Gln Ser Ser Asp Ser # 110 - Gln Glu Gln Leu Ala Glu His Lys Asn Leu Gl - #u Thr Lys Lys Glu Glu # 125 - Lys Ile Ser Pro Lys Glu Lys Thr Gly Val As - #n Thr Leu Asn Pro Gln # 140 - Asp Glu Val Leu Ser Gly Gln Leu Asn Lys Pr - #o Glu Leu Leu Tyr Arg 145 1 - #50 1 - #55 1 - #60 - Glu Glu Thr Met Glu Thr Lys Ile Asp Phe Gl - #n Glu Glu Ile Gln Glu # 175 - Asn Pro Asp Leu Ala Glu Gly Thr Val Arg Va - #l Lys Gln Glu Gly Lys # 190 - Leu Gly Lys Lys Val Glu Ile Val Arg Ile Ph - #e Ser Val Asn Lys Glu # 205 - Glu Val Ser Arg Glu Ile Val Ser Thr Ser Th - #r Thr Ala Pro Ser Pro # 220 - Arg Ile Val Glu Lys Gly Thr Lys Lys Thr Gl - #n Val Ile Lys Glu Gln 225 2 - #30 2 - #35 2 - #40 - Pro Glu Thr Gly Val Glu His Lys Asp Val Gl - #n Ser Gly Ala Ile Val # 255 - Glu Pro Ala Ile Gln Pro Glu Leu Pro Glu Al - #a Val Val Ser Asp Lys # 270 - Gly Glu Pro Glu Val Gln Pro Thr Leu Pro Gl - #u Ala Val Val Thr Asp # 285 - Lys Gly Glu Thr Glu Val Gln Pro Glu Ser Pr - #o Asp Thr Val Val Ser # 300 - Asp Lys Gly Glu Pro Glu Gln Val Ala Pro Le - #u Pro Glu Tyr Lys Gly 305 3 - #10 3 - #15 3 - #20 - Asn Ile Glu Gln Val Lys Pro Glu Thr Pro Va - #l Glu Lys Thr Lys Glu # 335 - Gln Gly Pro Glu Lys Thr Glu Glu Val Pro Va - #l Lys Pro Thr Glu Glu # 350 - Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Gl - #u Gly Thr Ser Ile Gln # 365 - Glu Ala Glu Asn Pro Val Gln Pro Ala Glu Gl - #u Ser Thr Thr Asn Ser # 380 - Glu Lys Val Ser Pro Asp Thr Ser Ser Lys As - #n Thr Gly Glu Val Ser 385 3 - #90 3 - #95 4 - #00 - Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gl - #y Glu Ser Asn Lys Pro # 415 - Glu His Asn Asp Ser Lys Asn Glu Asn Ser Gl - #u Lys Thr Val Glu Glu # 430 - Val Pro Val Asn Pro Asn Glu Gly Thr Val Gl - #u Gly Thr Ser Asn Gln # 445 - Glu Thr Glu Lys Pro Val Gln Pro Ala Glu Gl - #u Thr Gln Thr Asn Ser # 460 - Gly Lys Ile Ala Asn Glu Asn Thr Gly Glu Va - #l Ser Asn Lys Pro Ser 465 4 - #70 4 - #75 4 - #80 - Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gl - #n Pro Glu Lys Asn Gly # 495 - Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Th - #r Thr Ser Glu Asn Gly # 510 - Gln Thr Glu Pro Glu Pro Ser Asn Gly Asn Se - #r Thr Glu Asp Val Ser # 525 - Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly As - #n Glu Glu Ile Lys Gln # 540 - Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Gl - #u Glu Pro Glu Lys Thr 545 5 - #50 5 - #55 5 - #60 - Leu Glu Leu Arg Asn Val Ser Asp Leu Glu Le - #u # 570 - (2) INFORMATION FOR SEQ ID NO: 5: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 748 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: - TGAGAATCAA GCTACACCCA AAGAGACTAG CGCTCAAAAG ACAATCGTCC TT - #GCTACAGC 60 - TGGCGACGTG CCACCATTTG ACTACGAAGA CAAGGGCAAT CTGACAGGCT TT - #GATATCGA 120 - AGTTTTAAAG GCAGTAGATG AAAAACTCAG CGACTACGAG ATTCAATTCC AA - #AGAACCGC 180 - CTGGGAGAGC ATCTTCCCAG GACTTGATTC TGGTCACTAT CAGGCTGCGG CC - #AATAACTT 240 - GAGTTACACA AAAGAGCGTG CTGAAAAATA CCTTTACTCG CTTCCAATTT CC - #AACAATCC 300 - CCTCGTCCTT GTCAGCAACA AGAAAAATCC TTTGACTTCT CTTGACCAGA TC - #GCTGGTAA 360 - AACAACACAA GAGGATACCG GAACTTCTAA CGCTCAATTC ATCAATAACT GG - #AATCAGAA 420 - ACACACTGAT AATCCCGCTA CAATTAATTT TTCTGGTGAG GATATTGGTA AA - #CGAATCCT 480 - AGACCTTGCT AACGGAGAGT TTGATTTCCT AGTTTTTGAC AAGGTATCCG TT - #CAAAAGAT 540 - TATCAAGGAC CGTGGTTTAG ACCTCTCAGT CGTTGATTTA CCTTCTGCAG AT - #AGCCCCAG 600 - CAATTATATC ATTTTCTCAA GCGACCAAAA AGAGTTTAAA GAGCAATTTG AT - #AAAGCGCT 660 - CAAAGAACTC TATCAAGACG GAACCCTTGA AAAACTCAGC AATACCTATC TA - #GGTGGTTC 720 # 748 TCTC AGTTACAA - (2) INFORMATION FOR SEQ ID NO:6: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 249 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: - Glu Asn Gln Ala Thr Pro Lys Glu Thr Ser Al - #a Gln Lys Thr Ile Val # 15 - Leu Ala Thr Ala Gly Asp Val Pro Pro Phe As - #p Tyr Glu Asp Lys Gly # 30 - Asn Leu Thr Gly Phe Asp Ile Glu Val Leu Ly - #s Ala Val Asp Glu Lys # 45 - Leu Ser Asp Tyr Glu Ile Gln Phe Gln Arg Th - #r Ala Trp Glu Ser Ile # 60 - Phe Pro Gly Leu Asp Ser Gly His Tyr Gln Al - #a Ala Ala Asn Asn Leu #80 - Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Le - #u Tyr Ser Leu Pro Ile # 95 - Ser Asn Asn Pro Leu Val Leu Val Ser Asn Ly - #s Lys Asn Pro Leu Thr # 110 - Ser Leu Asp Gln Ile Ala Gly Lys Thr Thr Gl - #n Glu Asp Thr Gly Thr # 125 - Ser Asn Ala Gln Phe Ile Asn Asn Trp Asn Gl - #n Lys His Thr Asp Asn # 140 - Pro Ala Thr Ile Asn Phe Ser Gly Glu Asp Il - #e Gly Lys Arg Ile Leu 145 1 - #50 1 - #55 1 - #60 - Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Va - #l Phe Asp Lys Val Ser # 175 - Val Gln Lys Ile Ile Lys Asp Arg Gly Leu As - #p Leu Ser Val Val Asp # 190 - Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr Il - #e Ile Phe Ser Ser Asp # 205 - Gln Lys Glu Phe Lys Glu Gln Phe Asp Lys Al - #a Leu Lys Glu Leu Tyr # 220 - Gln Asp Gly Thr Leu Glu Lys Leu Ser Asn Th - #r Tyr Leu Gly Gly Ser 225 2 - #30 2 - #35 2 - #40 - Tyr Leu Pro Asp Gln Ser Gln Leu Gln 245 - (2) INFORMATION FOR SEQ ID NO: 7: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 985 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: - TGGTAACCGC TCTTCTCGTA ACGCAGCTTC ATCTTCTGAT GTGAAGACAA AA - #GCAGCAAT 60 - CGTCACTGAT ACTGGTGGTG TTGATGACAA ATCATTCAAC CAATCAGCTT GG - #GAAGGTTT 120 - GCAGGCTTGG GGTAAAGAAC ACAATCTTTC AAAAGATAAC GGTTTCACTT AC - #TTCCAATC 180 - AACAAGTGAA GCTGACTACG CTAACAACTT GCAACAAGCG GCTGGAAGTT AC - #AACCTAAT 240 - CTTCGGTGTT GGTTTTGCCC TTAATAATGC AGTTAAAGAT GCAGCAAAAG AA - #CACACTGA 300 - CTTGAACTAT GTCTTGATTG ATGATGTGAT TAAAGACCAA AAGAATGTTG CG - #AGCGTAAC 360 - TTTCGCTGAT AATGAGTCAG GTTACCTTGC AGGTGTGGCT GCAGCAAAAA CA - #ACTAAGAC 420 - AAAACAAGTT GGTTTTGTAG GTGGTATCGA ATCTGAAGTT ATCTCTCGTT TT - #GAAGCAGG 480 - ATTCAAGGCT GGTGTTGCGT CAGTAGACCC ATCTATCAAA GTCCAAGTTG AC - #TACGCTGG 540 - TTCATTTGGT GATGCGGCTA AAGGTAAAAC AATTGCAGCC GCACAATACG CA - #GCCGGTGC 600 - AGATATTGTT TACCAAGTAG CTGGTGGTAC AGGTGCAGGT GTCTTTGCAG AG - #GCAAAATC 660 - TCTCAACGAA AGCCGTCCTG AAAATGAAAA AGTTTGGGTT ATCGGTGTTG AT - #CGTGACCA 720 - AGAAGCAGAA GGTAAATACA CTTCTAAAGA TGGCAAAGAA TCAAACTTTG TT - #CTTGTATC 780 - TACTTTGAAA CAAGTTGGTA CAACTGTAAA AGATATTTCT AACAAGGCAG AA - #AGAGGAGA 840 - ATTCCCTGGC GGTCAAGTGA TCGTTTACTC ATTGAAGGAT AAAGGGGTTG AC - #TTGGCAGT 900 - AACAAACCTT TCAGAAGAAG GTAAAAAAGC TGTCGAAGAT GCAAAAGCTA AA - #ATCCTTGA 960 # 985 CCTG AAAAA - (2) INFORMATION FOR SEQ ID NO:8: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 328 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: - Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser Se - #r Ser Asp Val Lys Thr # 15 - Lys Ala Ala Ile Val Thr Asp Thr Gly Gly Va - #l Asp Asp Lys Ser Phe # 30 - Asn Gln Ser Ala Trp Glu Gly Leu Gln Ala Tr - #p Gly Lys Glu His Asn # 45 - Leu Ser Lys Asp Asn Gly Phe Thr Tyr Phe Gl - #n Ser Thr Ser Glu Ala # 60 - Asp Tyr Ala Asn Asn Leu Gln Gln Ala Ala Gl - #y Ser Tyr Asn Leu Ile #80 - Phe Gly Val Gly Phe Ala Leu Asn Asn Ala Va - #l Lys Asp Ala Ala Lys # 95 - Glu His Thr Asp Leu Asn Tyr Val Leu Ile As - #p Asp Val Ile Lys Asp # 110 - Gln Lys Asn Val Ala Ser Val Thr Phe Ala As - #p Asn Glu Ser Gly Tyr # 125 - Leu Ala Gly Val Ala Ala Ala Lys Thr Thr Ly - #s Thr Lys Gln Val Gly # 140 - Phe Val Gly Gly Ile Glu Ser Glu Val Ile Se - #r Arg Phe Glu Ala Gly 145 1 - #50 1 - #55 1 - #60 - Phe Lys Ala Gly Val Ala Ser Val Asp Pro Se - #r Ile Lys Val Gln Val # 175 - Asp Tyr Ala Gly Ser Phe Gly Asp Ala Ala Ly - #s Gly Lys Thr Ile Ala # 190 - Ala Ala Gln Tyr Ala Ala Gly Ala Asp Ile Va - #l Tyr Gln Val Ala Gly # 205 - Gly Thr Gly Ala Gly Val Phe Ala Glu Ala Ly - #s Ser Leu Asn Glu Ser # 220 - Arg Pro Glu Asn Glu Lys Val Trp Val Ile Gl - #y Val Asp Arg Asp Gln 225 2 - #30 2 - #35 2 - #40 - Glu Ala Glu Gly Lys Tyr Thr Ser Lys Asp Gl - #y Lys Glu Ser Asn Phe # 255 - Val Leu Val Ser Thr Leu Lys Gln Val Gly Th - #r Thr Val Lys Asp Ile # 270 - Ser Asn Lys Ala Glu Arg Gly Glu Phe Pro Gl - #y Gly Gln Val Ile Val # 285 - Tyr Ser Leu Lys Asp Lys Gly Val Asp Leu Al - #a Val Thr Asn Leu Ser # 300 - Glu Glu Gly Lys Lys Ala Val Glu Asp Ala Ly - #s Ala Lys Ile Leu Asp 305 3 - #10 3 - #15 3 - #20 - Gly Ser Val Lys Val Pro Glu Lys 325 - (2) INFORMATION FOR SEQ ID NO: 9: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1404 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #9: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGCTGATT CAGGTGACAA AC - #CTGTTATC 60 - AAAATGTACC AAATCGGTGA CAAACCAGAC AACTTGGATG AATTGTTAGC AA - #ATGCCAAC 120 - AAAATCATTG AAGAAAAAGT TGGTGCCAAA TTGGATATCC AATACCTTGG CT - #GGGGTGAC 180 - TATGGTAAGA AAATGTCAGT TATCACATCA TCTGGTGAAA ACTATGATAT TG - #CCTTTGCA 240 - GATAACTATA TTGTAAATGC TCAAAAAGGT GCTTACGCTG ACTTGACAGA AT - #TGTACAAA 300

- AAAGAAGGTA AAGACCTTTA CAAAGCACTT GACCCAGCTT ACATCAAGGG TA - #ATACTGTA 360 - AATGGTAAGA TTTACGCTGT TCCAGTTGCA GCCAACGTTG CATCATCTCA AA - #ACTTTGCC 420 - TTCAACGGAA CTCTCCTTGC TAAATATGGT ATCGATATTT CAGGTGTTAC TT - #CTTACGAA 480 - ACTCTTGAGC CAGTCTTGAA ACAAATCAAA GAAAAAGCTC CAGACGTAGT AC - #CATTTGCT 540 - ATTGGTAAAG TTTTCATCCC ATCTGATAAT TTTGACTACC CAGTAGCAAA CG - #GTCTTCCA 600 - TTCGTTATCG ACCTTGAAGG CGATACTACT AAAGTTGTAA ACCGTTACGA AG - #TGCCTCGT 660 - TTCAAAGAAC ACTTGAAGAC TCTTCACAAA TTCTATGAAG CTGGCTACAT TC - #CAAAAGAC 720 - GTCGCAACAA GCGATACTTC CTTTGACCTT CAACAAGATA CTTGGTTCGT TC - #GTGAAGAA 780 - ACAGTAGGAC CAGCTGACTA CGGTAACAGC TTGCTTTCAC GTGTTGCCAA CA - #AAGATATC 840 - CAAATCAAAC CAATTACTAA CTTCATCAAG NAAAACCAAA CAACACAAGT TG - #CTAACTTT 900 - GTCATCTCAA ACAACTCTAA GAACAAAGAA AAATCAATGG AAATCTTGAA CC - #TCTTGAAT 960 - ACGAACCCAG AACTCTTGAA CGGTCTTGTT TACGGTCCAG AAGGCAAGAA CT - #GGGAAAAA 1020 - ATTGAAGGTA AAGAAAACCG TGTTCGCGTT CTTGATGGCT ACAAAGGAAA CA - #CTCACATG 1080 - GGTGGATGGA ACACTGGTAA CAACTGGATC CTTTACATCA ACGAAAACGT TA - #CAGACCAA 1140 - CAAATCGAAA ATTCTAAGAA AGAATTGGCA GAAGCTAAAG AATCTCCAGC GC - #TTGGATTT 1200 - ATCTTCAATA CTGACAATGT GAAATCTGAA ATCTCAGCTA TTGCTAACAC AA - #TGCAACAA 1260 - TTTGATACAG CTATCAACAC TGGTACTGTA GACCCAGATA AAGCGATTCC AG - #AATTGATG 1320 - GAAAAATTGA AATCTGAAGG TGCCTACGAA AAAGTATTGA ACGAAATGCA AA - #AACAATAC 1380 # 1404ACAA AAAA - (2) INFORMATION FOR SEQ ID NO:10: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 468 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: - Cys Gly Asn Leu Thr Gly Asn Ser Lys Lys Al - #a Ala Asp Ser Gly Asp # 15 - Lys Pro Val Ile Lys Met Tyr Gln Ile Gly As - #p Lys Pro Asp Asn Leu # 30 - Asp Glu Leu Leu Ala Asn Ala Asn Lys Ile Il - #e Glu Glu Lys Val Gly # 45 - Ala Lys Leu Asp Ile Gln Tyr Leu Gly Trp Gl - #y Asp Tyr Gly Lys Lys # 60 - Met Ser Val Ile Thr Ser Ser Gly Glu Asn Ty - #r Asp Ile Ala Phe Ala #80 - Asp Asn Tyr Ile Val Asn Ala Gln Lys Gly Al - #a Tyr Ala Asp Leu Thr # 95 - Glu Leu Tyr Lys Lys Glu Gly Lys Asp Leu Ty - #r Lys Ala Leu Asp Pro # 110 - Ala Tyr Ile Lys Gly Asn Thr Val Asn Gly Ly - #s Ile Tyr Ala Val Pro # 125 - Val Ala Ala Asn Val Ala Ser Ser Gln Asn Ph - #e Ala Phe Asn Gly Thr # 140 - Leu Leu Ala Lys Tyr Gly Ile Asp Ile Ser Gl - #y Val Thr Ser Tyr Glu 145 1 - #50 1 - #55 1 - #60 - Thr Leu Glu Pro Val Leu Lys Gln Ile Lys Gl - #u Lys Ala Pro Asp Val # 175 - Val Pro Phe Ala Ile Gly Lys Val Phe Ile Pr - #o Ser Asp Asn Phe Asp # 190 - Tyr Pro Val Ala Asn Gly Leu Pro Phe Val Il - #e Asp Leu Glu Gly Asp # 205 - Thr Thr Lys Val Val Asn Arg Tyr Glu Val Pr - #o Arg Phe Lys Glu His # 220 - Leu Lys Thr Leu His Lys Phe Tyr Glu Ala Gl - #y Tyr Ile Pro Lys Asp 225 2 - #30 2 - #35 2 - #40 - Val Ala Thr Ser Asp Thr Ser Phe Asp Leu Gl - #n Gln Asp Thr Trp Phe # 255 - Val Arg Glu Glu Thr Val Gly Pro Ala Asp Ty - #r Gly Asn Ser Leu Leu # 270 - Ser Arg Val Ala Asn Lys Asp Ile Gln Ile Ly - #s Pro Ile Thr Asn Phe # 285 - Ile Lys Xaa Asn Gln Thr Thr Gln Val Ala As - #n Phe Val Ile Ser Asn # 300 - Asn Ser Lys Asn Lys Glu Lys Ser Met Glu Il - #e Leu Asn Leu Leu Asn 305 3 - #10 3 - #15 3 - #20 - Thr Asn Pro Glu Leu Leu Asn Gly Leu Val Ty - #r Gly Pro Glu Gly Lys # 335 - Asn Trp Glu Lys Ile Glu Gly Lys Glu Asn Ar - #g Val Arg Val Leu Asp # 350 - Gly Tyr Lys Gly Asn Thr His Met Gly Gly Tr - #p Asn Thr Gly Asn Asn # 365 - Trp Ile Leu Tyr Ile Asn Glu Asn Val Thr As - #p Gln Gln Ile Glu Asn # 380 - Ser Lys Lys Glu Leu Ala Glu Ala Lys Glu Se - #r Pro Ala Leu Gly Phe 385 3 - #90 3 - #95 4 - #00 - Ile Phe Asn Thr Asp Asn Val Lys Ser Glu Il - #e Ser Ala Ile Ala Asn # 415 - Thr Met Gln Gln Phe Asp Thr Ala Ile Asn Th - #r Gly Thr Val Asp Pro # 430 - Asp Lys Ala Ile Pro Glu Leu Met Glu Lys Le - #u Lys Ser Glu Gly Ala # 445 - Tyr Glu Lys Val Leu Asn Glu Met Gln Lys Gl - #n Tyr Asp Glu Phe Leu # 460 - Lys Asn Lys Lys 465 - (2) INFORMATION FOR SEQ ID NO: 11: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 937 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #11: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGTCAAGGA ACTGCTTCTA AAGACAACAA AGAGGCAGAA CTTAAGAAGG TT - #GACTTTAT 60 - CCTAGACTGG ACACCAAATA CCAACCACAC AGGGCTTTAT GTTGCCAAGG AA - #AAAGGTTA 120 - TTTCAAAGAA GCTGGAGTGG ATGTTGATTT GAAATTGCCA CCAGAAGAAA GT - #TCTTCTGA 180 - CTTGGTTATC AACGGAAAGG CACCATTTGC AGTGTATTTC CAAGACTACA TG - #GCTAAGAA 240 - ATTGGAAAAA GGAGCAGGAA TCACTGCCGT TGCAGCTATT GTTGAACACA AT - #ACATCAGG 300 - AATCATCTCT CGTAAATCTG ATAATGTAAG CAGTCCAAAA GACTTGGTTG GT - #AAGAAATA 360 - TGGGACATGG AATGACCCAA CTGAACTTGC TATGTTGAAA ACCTTGGTAG AA - #TCTCAAGG 420 - TGGAGACTTT GAGAAGGTTG AAAAAGTACC AAATAACGAC TCAAACTCAA TC - #ACACCGAT 480 - TGCCAATGGC GTCTTTGATA CTGCTTGGAT TTACTACGGT TGGGATGGTA TC - #CTTGCTAA 540 - ATCTCAAGGT GTAGATGCTA ACTTCATGTA CTTGAAAGAC TATGTCAAGG AG - #TTTGACTA 600 - CTATTCACCA GTTATCATCG CAAACAACGA CTATCTGAAA GATAACAAAG AA - #GAAGCTCG 660 - CAAAGTCATC CAAGCCATCA AAAAAGGCTA CCAATATGCC ATGGAACATC CA - #GAAGAAGC 720 - TGCAGATATT CTCATCAAGA ATGCACCTGA ACTCAAGGAA AAACGTGACT TT - #GTCATCGA 780 - ATCTCAAAAA TACTTGTCAA AAGAATACGC AAGCGACAAG GAAAAATGGG GT - #CAATTTGA 840 - CGCAGCTCGC TGGAATGCTT TCTACAAATG GGATAAAGAA AATGGTATCC TT - #AAAGAAGA 900 # 937 TTCA CCAACGAATT TGTGAAA - (2) INFORMATION FOR SEQ ID NO:12: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 312 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: - Gly Gln Gly Thr Ala Ser Lys Asp Asn Lys Gl - #u Ala Glu Leu Lys Lys # 15 - Val Asp Phe Ile Leu Asp Trp Thr Pro Asn Th - #r Asn His Thr Gly Leu # 30 - Tyr Val Ala Lys Glu Lys Gly Tyr Phe Lys Gl - #u Ala Gly Val Asp Val # 45 - Asp Leu Lys Leu Pro Pro Glu Glu Ser Ser Se - #r Asp Leu Val Ile Asn # 60 - Gly Lys Ala Pro Phe Ala Val Tyr Phe Gln As - #p Tyr Met Ala Lys Lys #80 - Leu Glu Lys Gly Ala Gly Ile Thr Ala Val Al - #a Ala Ile Val Glu His # 95 - Asn Thr Ser Gly Ile Ile Ser Arg Lys Ser As - #p Asn Val Ser Ser Pro # 110 - Lys Asp Leu Val Gly Lys Lys Tyr Gly Thr Tr - #p Asn Asp Pro Thr Glu # 125 - Leu Ala Met Leu Lys Thr Leu Val Glu Ser Gl - #n Gly Gly Asp Phe Glu # 140 - Lys Val Glu Lys Val Pro Asn Asn Asp Ser As - #n Ser Ile Thr Pro Ile 145 1 - #50 1 - #55 1 - #60 - Ala Asn Gly Val Phe Asp Thr Ala Trp Ile Ty - #r Tyr Gly Trp Asp Gly # 175 - Ile Leu Ala Lys Ser Gln Gly Val Asp Ala As - #n Phe Met Tyr Leu Lys # 190 - Asp Tyr Val Lys Glu Phe Asp Tyr Tyr Ser Pr - #o Val Ile Ile Ala Asn # 205 - Asn Asp Tyr Leu Lys Asp Asn Lys Glu Glu Al - #a Arg Lys Val Ile Gln # 220 - Ala Ile Lys Lys Gly Tyr Gln Tyr Ala Met Gl - #u His Pro Glu Glu Ala 225 2 - #30 2 - #35 2 - #40 - Ala Asp Ile Leu Ile Lys Asn Ala Pro Glu Le - #u Lys Glu Lys Arg Asp # 255 - Phe Val Ile Glu Ser Gln Lys Tyr Leu Ser Ly - #s Glu Tyr Ala Ser Asp # 270 - Lys Glu Lys Trp Gly Gln Phe Asp Ala Ala Ar - #g Trp Asn Ala Phe Tyr # 285 - Lys Trp Asp Lys Glu Asn Gly Ile Leu Lys Gl - #u Asp Leu Thr Asp Lys # 300 - Gly Phe Thr Asn Glu Phe Val Lys 305 3 - #10 - (2) INFORMATION FOR SEQ ID NO: 13: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 799 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #13: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAGCTCAGGT GGAAACGCTG GTTCATCCTC TGGAAAAACA ACTGCCAAAG CT - #CGCACTAT 60 - CGATGAAATC AAAAAAAGCG GTGAACTGCG AATCGCCGTG TTTGGAGATA AA - #AAACCGTT 120 - TGGCTACGTT GACAATGATG GTTCTACCAA GGTACGCTAC GATATTGAAC TA - #GGGAACCA 180 - ACTAGCTCAA GACCTTGGTG TCAAGGTTAA ATACATTTCA GTCGATGCTG CC - #AACCGTGC 240 - GGAATACTTG ATTTCAAACA AGGTAGATAT TACTCTTGCT AACTTTACAG TA - #ACTGACGA 300 - ACGTAAGAAA CAAGTTGATT TTGCCCTTCC ATATATGAAA GTTTCTCTGG GT - #GTCGTATC 360 - ACCTAAGACT GGTCTCATTA CAGACGTCAA ACAACTTGAA GGTAAAACCT TA - #ATTGTCAC 420 - AAAAGGAACG ACTGCTGAGA CTTATTTTGA AAAGAATCAT CCAGAAATCA AA - #CTCCAAAA 480 - ATACGACCAA TACAGTGACT CTTACCAAGC TCTTCTTGAC GGACGTGGAG AT - #GCCTTTTC 540 - AACTGACAAT ACGGAAGTTC TAGCTTGGGC GCTTGAAAAT AAAGGATTTG AA - #GTAGGAAT 600 - TACTTCCCTC GGTGATCCCG ATACCATTGC GGCAGCAGTT CAAAAAGGCA AC - #CAAGAATT 660 - GCTAGACTTC ATCAATAAAG ATATTGAAAA ATTAGGCAAG GAAAACTTCT TC - #CACAAGGC 720 - CTATGAAAAG ACACTTCACC CAACCTACGG TGACGCTGCT AAAGCAGATG AC - #CTGGTTGT 780 #799 GAT - (2) INFORMATION FOR SEQ ID NO:14: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 266 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: - Ser Ser Gly Gly Asn Ala Gly Ser Ser Ser Gl - #y Lys Thr Thr Ala Lys # 15 - Ala Arg Thr Ile Asp Glu Ile Lys Lys Ser Gl - #y Glu Leu Arg Ile Ala # 30 - Val Phe Gly Asp Lys Lys Pro Phe Gly Tyr Va - #l Asp Asn Asp Gly Ser # 45 - Thr Lys Val Arg Tyr Asp Ile Glu Leu Gly As - #n Gln Leu Ala Gln Asp # 60 - Leu Gly Val Lys Val Lys Tyr Ile Ser Val As - #p Ala Ala Asn Arg Ala #80 - Glu Tyr Leu Ile Ser Asn Lys Val Asp Ile Th - #r Leu Ala Asn Phe Thr # 95 - Val Thr Asp Glu Arg Lys Lys Gln Val Asp Ph - #e Ala Leu Pro Tyr Met # 110 - Lys Val Ser Leu Gly Val Val Ser Pro Lys Th - #r Gly Leu Ile Thr Asp # 125

- Val Lys Gln Leu Glu Gly Lys Thr Leu Ile Va - #l Thr Lys Gly Thr Thr # 140 - Ala Glu Thr Tyr Phe Glu Lys Asn His Pro Gl - #u Ile Lys Leu Gln Lys 145 1 - #50 1 - #55 1 - #60 - Tyr Asp Gln Tyr Ser Asp Ser Tyr Gln Ala Le - #u Leu Asp Gly Arg Gly # 175 - Asp Ala Phe Ser Thr Asp Asn Thr Glu Val Le - #u Ala Trp Ala Leu Glu # 190 - Asn Lys Gly Phe Glu Val Gly Ile Thr Ser Le - #u Gly Asp Pro Asp Thr # 205 - Ile Ala Ala Ala Val Gln Lys Gly Asn Gln Gl - #u Leu Leu Asp Phe Ile # 220 - Asn Lys Asp Ile Glu Lys Leu Gly Lys Glu As - #n Phe Phe His Lys Ala 225 2 - #30 2 - #35 2 - #40 - Tyr Glu Lys Thr Leu His Pro Thr Tyr Gly As - #p Ala Ala Lys Ala Asp # 255 - Asp Leu Val Val Glu Gly Gly Lys Val Asp # 265 - (2) INFORMATION FOR SEQ ID NO: 15: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1189 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #15: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTCCAACTAT GGTAAATCTG CGGATGGCAC AGTGACCATC GAGTATTTCA AC - #CAGAAAAA 60 - AGAAATGACC AAAACCTTGG AAGAAATCAC TCGTGATTTT GAGAAGGAAA AC - #CCTAAGAT 120 - CAAGGTCAAA GTCGTCAATG TACCAAATGC TGGTGAAGTA TTGAAGACAC GC - #GTTCTCGC 180 - AGGAGATGTG CCTGATGTGG TCAATATTTA CCCACAGTCC ATCGAACTGC AA - #GAATGGGC 240 - AAAAGCAGGT GTTTTTGAAG ATTTGAGCAA CAAAGACTAC CTGAAACGCG TG - #AAAAATGG 300 - CTACGCTGAA AAATATGCTG TAAACGAAAA AGTTTACAAC GTTCCTTTTA CA - #GCTAATGC 360 - TTATGGAATT TACTACAACA AAGATAAATT CGAAGAACTG GGCTTGAAGG TT - #CCTGAAAC 420 - CTGGGATGAA TTTGAACAGT TAGTCAAAGA TATCGTTGCT AAAGGACAAA CA - #CCATTTGG 480 - AATTGCAGGT GCAGATGCTT GGACACTCAA TGGTTACAAT CAATTAGCCT TT - #GCGACAGC 540 - AACAGGTGGA GGAAAAGAAG CAAATCAATA CCTTCGTTAT TCTCAACCAA AT - #GCCATTAA 600 - ATTGTCGGAT CCGATTATGA AAGATGATAT CAAGGTCATG GACATCCTTC GC - #ATCAATGG 660 - ATCTAAGCAA AAGAACTGGG AAGGTGCTGG CTATACCGAT GTTATCGGAG CC - #TTCGCACG 720 - TGGGGATGTC CTCATGACAC CAAATGGGTC TTGGGCGATC ACAGCGATTA AT - #GAACAAAA 780 - ACCGAACTTT AAGATTGGGA CCTTCATGAT TCCAGGAAAA GAAAAAGGAC AA - #AGCTTAAC 840 - CGTTGGTGCG GGAGACTTGG CATGGTCTAT CTCAGCCACC ACCAAACATC CA - #AAAGAAGC 900 - CAATGCCTTT GTGGAATATA TGACCCGTCC AGAAGTCATG CAAAAATACT AC - #GATGTGGA 960 - CGGATCTCCA ACAGCGATCG AAGGGGTCAA ACAAGCAGGA GAAGATTCAC CG - #CTTGCTGG 1020 - TATGACCGAA TATGCCTTTA CGGATCGTCA CTTGGTCTGG TTGCAACAAT AC - #TGGACCAG 1080 - TGAAGCAGAC TTCCATACCT TGACCATGAA CTATGTCTTG ACCGGTGATA AA - #CAAGGCAT 1140

# 1189TGCCT TCTTTAACCC GATGAAAGCG GATGTGGAT - (2) INFORMATION FOR SEQ ID NO:16: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 396 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: - Ser Asn Tyr Gly Lys Ser Ala Asp Gly Thr Va - #l Thr Ile Glu Tyr Phe # 15 - Asn Gln Lys Lys Glu Met Thr Lys Thr Leu Gl - #u Glu Ile Thr Arg Asp # 30 - Phe Glu Lys Glu Asn Pro Lys Ile Lys Val Ly - #s Val Val Asn Val Pro # 45 - Asn Ala Gly Glu Val Leu Lys Thr Arg Val Le - #u Ala Gly Asp Val Pro # 60 - Asp Val Val Asn Ile Tyr Pro Gln Ser Ile Gl - #u Leu Gln Glu Trp Ala #80 - Lys Ala Gly Val Phe Glu Asp Leu Ser Asn Ly - #s Asp Tyr Leu Lys Arg # 95 - Val Lys Asn Gly Tyr Ala Glu Lys Tyr Ala Va - #l Asn Glu Lys Val Tyr # 110 - Asn Val Pro Phe Thr Ala Asn Ala Tyr Gly Il - #e Tyr Tyr Asn Lys Asp # 125 - Lys Phe Glu Glu Leu Gly Leu Lys Val Pro Gl - #u Thr Trp Asp Glu Phe # 140 - Glu Gln Leu Val Lys Asp Ile Val Ala Lys Gl - #y Gln Thr Pro Phe Gly 145 1 - #50 1 - #55 1 - #60 - Ile Ala Gly Ala Asp Ala Trp Thr Leu Asn Gl - #y Tyr Asn Gln Leu Ala # 175 - Phe Ala Thr Ala Thr Gly Gly Gly Lys Glu Al - #a Asn Gln Tyr Leu Arg # 190 - Tyr Ser Gln Pro Asn Ala Ile Lys Leu Ser As - #p Pro Ile Met Lys Asp # 205 - Asp Ile Lys Val Met Asp Ile Leu Arg Ile As - #n Gly Ser Lys Gln Lys # 220 - Asn Trp Glu Gly Ala Gly Tyr Thr Asp Val Il - #e Gly Ala Phe Ala Arg 225 2 - #30 2 - #35 2 - #40 - Gly Asp Val Leu Met Thr Pro Asn Gly Ser Tr - #p Ala Ile Thr Ala Ile # 255 - Asn Glu Gln Lys Pro Asn Phe Lys Ile Gly Th - #r Phe Met Ile Pro Gly # 270 - Lys Glu Lys Gly Gln Ser Leu Thr Val Gly Al - #a Gly Asp Leu Ala Trp # 285 - Ser Ile Ser Ala Thr Thr Lys His Pro Lys Gl - #u Ala Asn Ala Phe Val # 300 - Glu Tyr Met Thr Arg Pro Glu Val Met Gln Ly - #s Tyr Tyr Asp Val Asp 305 3 - #10 3 - #15 3 - #20 - Gly Ser Pro Thr Ala Ile Glu Gly Val Lys Gl - #n Ala Gly Glu Asp Ser # 335 - Pro Leu Ala Gly Met Thr Glu Tyr Ala Phe Th - #r Asp Arg His Leu Val # 350 - Trp Leu Gln Gln Tyr Trp Thr Ser Glu Ala As - #p Phe His Thr Leu Thr # 365 - Met Asn Tyr Val Leu Thr Gly Asp Lys Gln Gl - #y Met Val Asn Asp Leu # 380 - Asn Ala Phe Phe Asn Pro Met Lys Ala Asp Va - #l Asp 385 3 - #90 3 - #95 - (2) INFORMATION FOR SEQ ID NO: 17: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 775 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #17: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGGAAAAAT TCTAGCGAAA CTAGTGGAGA TAATTGGTCA AAGTACCAGT CT - #AACAAGTC 60 - TATTACTATT GGATTTGATA GTACTTTTGT TCCAATGGGA TTTGCTCAGA AA - #GATGGTTC 120 - TTATGCAGGA TTTGATATTG ATTTAGCTAC AGCTGTTTTT GAAAAATACG GA - #ATCACGGT 180 - AAATTGGCAA CCGATTGATT GGGATTTGAA AGAAGCTGAA TTGACAAAAG GA - #ACGATTGA 240 - TCTGATTTGG AATGGCTATT CCGCTACAGA CGAACGCCGT GAAAAGGTGG CT - #TTCAGTAA 300 - CTCATATATG AAGAATGAGC AGGTATTGGT TACGAAGAAA TCATCTGGTA TC - #ACGACTGC 360 - AAAGGATATG ACTGGAAAGA CATTAGGAGC TCAAGCTGGT TCATCTGGTT AT - #GCGGACTT 420 - TGAAGCAAAT CCAGAAATTT TGAAGAATAT TGTCGCTAAT AAGGAAGCGA AT - #CAATACCA 480 - AACCTTTAAT GAAGCCTTGA TTGATTTGAA AAACGATCGA ATTGATGGTC TA - #TTGATTGA 540 - CCGTGTCTAT GCAAACTATT ATTTAGAAGC AGAAGGTGTT TTAAACGATT AT - #AATGTCTT 600 - TACAGTTGGA CTAGAAACAG AAGCTTTTGC GGTTGGAGCC CGTAAGGAAG AT - #ACAAACTT 660 - GGTTAAGAAG ATAAATGAAG CTTTTTCTAG TCTTTACAAG GACGGCAAGT TC - #CAAGAAAT 720 - CAGCCAAAAA TGGTTTGGAG AAGATGTAGC AACCAAAGAA GTAAAAGAAG GA - #CAG 775 - (2) INFORMATION FOR SEQ ID NO:18: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 258 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: - Gly Lys Asn Ser Ser Glu Thr Ser Gly Asp As - #n Trp Ser Lys Tyr Gln # 15 - Ser Asn Lys Ser Ile Thr Ile Gly Phe Asp Se - #r Thr Phe Val Pro Met # 30 - Gly Phe Ala Gln Lys Asp Gly Ser Tyr Ala Gl - #y Phe Asp Ile Asp Leu # 45 - Ala Thr Ala Val Phe Glu Lys Tyr Gly Ile Th - #r Val Asn Trp Gln Pro # 60 - Ile Asp Trp Asp Leu Lys Glu Ala Glu Leu Th - #r Lys Gly Thr Ile Asp #80 - Leu Ile Trp Asn Gly Tyr Ser Ala Thr Asp Gl - #u Arg Arg Glu Lys Val # 95 - Ala Phe Ser Asn Ser Tyr Met Lys Asn Glu Gl - #n Val Leu Val Thr Lys # 110 - Lys Ser Ser Gly Ile Thr Thr Ala Lys Asp Me - #t Thr Gly Lys Thr Leu # 125 - Gly Ala Gln Ala Gly Ser Ser Gly Tyr Ala As - #p Phe Glu Ala Asn Pro # 140 - Glu Ile Leu Lys Asn Ile Val Ala Asn Lys Gl - #u Ala Asn Gln Tyr Gln 145 1 - #50 1 - #55 1 - #60 - Thr Phe Asn Glu Ala Leu Ile Asp Leu Lys As - #n Asp Arg Ile Asp Gly # 175 - Leu Leu Ile Asp Arg Val Tyr Ala Asn Tyr Ty - #r Leu Glu Ala Glu Gly # 190 - Val Leu Asn Asp Tyr Asn Val Phe Thr Val Gl - #y Leu Glu Thr Glu Ala # 205 - Phe Ala Val Gly Ala Arg Lys Glu Asp Thr As - #n Leu Val Lys Lys Ile # 220 - Asn Glu Ala Phe Ser Ser Leu Tyr Lys Asp Gl - #y Lys Phe Gln Glu Ile 225 2 - #30 2 - #35 2 - #40 - Ser Gln Lys Trp Phe Gly Glu Asp Val Ala Th - #r Lys Glu Val Lys Glu # 255 - Gly Gln - (2) INFORMATION FOR SEQ ID NO: 19: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 868 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #19: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGCTAGCGGA AAAAAAGATA CAACTTCTGG TCAAAAACTA AAAGTTGTTG CT - #ACAAACTC 60 - AATCATCGCT GATATTACTA AAAATATTGC TGGTGACAAA ATTGACCTTC AT - #AGTATCGT 120 - TCCGATTGGG CAAGACCCAC ACGAATACGA ACCACTTCCT GAAGACGTTA AG - #AAAACTTC 180 - TGAGGCTAAT TTGATTTTCT ATAACGGTAT CAACCTTGAA ACAGGTGGCA AT - #GCTTGGTT 240 - TACAAAATTG GTAGAAAATG CCAAGAAAAC TGAAAACAAA GACTACTTCG CA - #GTCAGCGA 300 - CGGCGTTGAT GTTATCTACC TTGAAGGTCA AAATGAAAAA GGAAAAGAAG AC - #CCACACGC 360 - TTGGCTTAAC CTTGAAAACG GTATTATTTT TGCTAAAAAT ATCGCCAAAC AA - #TTGAGCGC 420 - CAAAGACCCT AACAATAAAG AATTCTATGA AAAAAATCTC AAAGAATATA CT - #GATAAGTT 480 - AGACAAACTT GATAAAGAAA GTAAGGATAA ATTTAATAAG ATCCCTGCTG AA - #AAGAAACT 540 - CATTGTAACC AGCGAAGGAG CATTCAAATA CTTCTCTAAA GCCTATGGTG TC - #CCAAGTGC 600 - TTACATCTGG GAAATCAATA CTGAAGAAGA AGGAACTCCT GAACAAATCA AG - #ACCTTGGT 660 - TGAAAAACTT CGCCAAACAA AAGTTCCATC ACTCTTTGTA GAATCAAGTG TG - #GATGACCG 720 - TCCAATGAAA ACTGTTTCTC AAGACACAAA CATCCCAATC TACGCTCAAA TC - #TTTACTGA 780 - CTCTATCGCA GAACAAGGTA AAGAAGGCGA CAGCTACTAC AGCATGATGA AA - #TACAACCT 840 # 868 GGAT TGGCAAAA - (2) INFORMATION FOR SEQ ID NO:20: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 289 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: - Ala Ser Gly Lys Lys Asp Thr Thr Ser Gly Gl - #n Lys Leu Lys Val Val # 15 - Ala Thr Asn Ser Ile Ile Ala Asp Ile Thr Ly - #s Asn Ile Ala Gly Asp # 30 - Lys Ile Asp Leu His Ser Ile Val Pro Ile Gl - #y Gln Asp Pro His Glu # 45 - Tyr Glu Pro Leu Pro Glu Asp Val Lys Lys Th - #r Ser Glu Ala Asn Leu # 60 - Ile Phe Tyr Asn Gly Ile Asn Leu Glu Thr Gl - #y Gly Asn Ala Trp Phe #80 - Thr Lys Leu Val Glu Asn Ala Lys Lys Thr Gl - #u Asn Lys Asp Tyr Phe # 95 - Ala Val Ser Asp Gly Val Asp Val Ile Tyr Le - #u Glu Gly Gln Asn Glu # 110 - Lys Gly Lys Glu Asp Pro His Ala Trp Leu As - #n Leu Glu Asn Gly Ile # 125 - Ile Phe Ala Lys Asn Ile Ala Lys Gln Leu Se - #r Ala Lys Asp Pro Asn # 140 - Asn Lys Glu Phe Tyr Glu Lys Asn Leu Lys Gl - #u Tyr Thr Asp Lys Leu 145 1 - #50 1 - #55 1 - #60 - Asp Lys Leu Asp Lys Glu Ser Lys Asp Lys Ph - #e Asn Lys Ile Pro Ala # 175 - Glu Lys Lys Leu Ile Val Thr Ser Glu Gly Al - #a Phe Lys Tyr Phe Ser # 190 - Lys Ala Tyr Gly Val Pro Ser Ala Tyr Ile Tr - #p Glu Ile Asn Thr Glu # 205 - Glu Glu Gly Thr Pro Glu Gln Ile Lys Thr Le - #u Val Glu Lys Leu Arg # 220 - Gln Thr Lys Val Pro Ser Leu Phe Val Glu Se - #r Ser Val Asp Asp Arg 225 2 - #30 2 - #35 2 - #40 - Pro Met Lys Thr Val Ser Gln Asp Thr Asn Il - #e Pro Ile Tyr Ala Gln # 255 - Ile Phe Thr Asp Ser Ile Ala Glu Gln Gly Ly - #s Glu Gly Asp Ser Tyr # 270 - Tyr Ser Met Met Lys Tyr Asn Leu Asp Lys Il - #e Ala Glu Gly Leu Ala # 285 - Lys - (2) INFORMATION FOR SEQ ID NO: 21: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1546 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #21: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGCTCAAAA AATACAGCTT CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CA - #TTCCCGCT 60 - TCAAGAAAAG AAAACATTGA AGTTTATGAC AGCCAGTTCA CCGTTATCTC CT - #AAAGACCC 120 - AAATGAAAAG TTAATTTTGC AACGTTTGGA GAAGGAAACT GGCGTTCATA TT - #GACTGGAC 180 - CAACTACCAA TCCGACTTTG CAGAAAAACG TAACTTGGAT ATTTCTAGTG GT - #GATTTACC 240 - AGATGCTATC CACAACGACG GAGCTTCAGA TGTGGACTTG ATGAACTGGG CT - #AAAAAAGG 300 - TGTTATTATT CCAGTTGAAG ATTTGATTGA TAAATACATG CCAAATCTTA AG - #AAAATTTT 360 - GGATGAGAAA CCAGAGTACA AGGCCTTGAT GACAGCACCT GATGGGCACA TT - #TACTCATT 420 - TCCATGGATT GAAGAGCTTG GAGATGGTAA AGAGTCTATT CACAGTGTCA AC - #GATATGGC 480 - TTGGATTAAC AAAGATTGGC TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CT - #ACTGATGA 540 - TTTGATTAAA GTCCTAGAAG CTTTCAAAAA CGGGGATCCA AATGGAAATG GA - #GAGGCTGA 600 - TGAAATTCCA TTTTCATTTA TTAGTGGTAA CGGAAACGAA GATTTTAAAT TC - #CTATTTGC 660 - TGCATTTGGT ATAGGGGATA ACGATGATCA TTTAGTAGTA GGAAATGATG GC - #AAAGTTGA 720 - CTTCACAGCA GATAACGATA ACTATAAAGA AGGTGTCAAA TTTATCCGTC AA - #TTGCAAGA 780 - AAAAGGCCTG ATTGATAAAG AAGCTTTCGA ACATGATTGG AATAGTTACA TT - #GCTAAAGG

840 - TCATGATCAG AAATTTGGTG TTTACTTTAC ATGGGATAAG AATAATGTTA CT - #GGAAGTAA 900 - CGAAAGTTAT GATGTTTTAC CAGTACTTGC TGGACCAAGT GGTCAAAAAC AC - #GTAGCTCG 960 - TACAAACGGT ATGGGATTTG CACGTGACAA GATGGTTATT ACCAGTGTAA AC - #AAAAACCT 1020 - AGAATTGACA GCTAAATGGA TTGATGCACA ATACGCTCCA CTCCAATCTG TG - #CAAAATAA 1080 - CTGGGGAACT TACGGAGATG ACAAACAACA AAACATCTTT GAATTGGATC AA - #GCGTCAAA 1140 - TAGTCTAAAA CACTTACCAC TAAACGGAAC TGCACCAGCA GAACTTCGTC AA - #AAGACTGA 1200 - AGTAGGAGGA CCACTAGCTA TCCTAGATTC ATACTATGGT AAAGTAACAA CC - #ATGCCTGA 1260 - TGATGCCAAA TGGCGTTTGG ATCTTATCAA AGAATATTAT GTTCCTTACA TG - #AGCAATGT 1320 - CAATAACTAT CCAAGAGTCT TTATGACACA GGAAGATTTG GACAAGATTG CC - #CATATCGA 1380 - AGCAGATATG AATGACTATA TCTACCGTAA ACGTGCTGAA TGGATTGTAA AT - #GGCAATAT 1440 - TGATACTGAG TGGGATGATT ACAAGAAAGA ACTTGAAAAA TACGGACTTT CT - #GATTACCT 1500 # 1546ACT ACGACCAATA CCAAGCAAAC AAAAAC - (2) INFORMATION FOR SEQ ID NO:22: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 515 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: - Gly Ser Lys Asn Thr Ala Ser Ser Pro Asp Ty - #r Lys Leu Glu Gly Val # 15 - Thr Phe Pro Leu Gln Glu Lys Lys Thr Leu Ly - #s Phe Met Thr Ala Ser # 30 - Ser Pro Leu Ser Pro Lys Asp Pro Asn Glu Ly - #s Leu Ile Leu Gln Arg # 45 - Leu Glu Lys Glu Thr Gly Val His Ile Asp Tr - #p Thr Asn Tyr Gln Ser # 60 - Asp Phe Ala Glu Lys Arg Asn Leu Asp Ile Se - #r Ser Gly Asp Leu Pro #80 - Asp Ala Ile His Asn Asp Gly Ala Ser Asp Va - #l Asp Leu Met Asn Trp # 95 - Ala Lys Lys Gly Val Ile Ile Pro Val Glu As - #p Leu Ile Asp Lys Tyr # 110 - Met Pro Asn Leu Lys Lys Ile Leu Asp Glu Ly - #s Pro Glu Tyr Lys Ala # 125 - Leu Met Thr Ala Pro Asp Gly His Ile Tyr Se - #r Phe Pro Trp Ile Glu # 140 - Glu Leu Gly Asp Gly Lys Glu Ser Ile His Se - #r Val Asn Asp Met Ala 145 1 - #50 1 - #55 1 - #60 - Trp Ile Asn Lys Asp Trp Leu Lys Lys Leu Gl - #y Leu Glu Met Pro Lys # 175 - Thr Thr Asp Asp Leu Ile Lys Val Leu Glu Al - #a Phe Lys Asn Gly Asp # 190 - Pro Asn Gly Asn Gly Glu Ala Asp Glu Ile Pr - #o Phe Ser Phe Ile Ser # 205 - Gly Asn Gly Asn Glu Asp Phe Lys Phe Leu Ph - #e Ala Ala Phe Gly Ile # 220 - Gly Asp Asn Asp Asp His Leu Val Val Gly As - #n Asp Gly Lys Val Asp 225 2 - #30 2 - #35 2 - #40 - Phe Thr Ala Asp Asn Asp Asn Tyr Lys Glu Gl - #y Val Lys Phe Ile Arg # 255 - Gln Leu Gln Glu Lys Gly Leu Ile Asp Lys Gl - #u Ala Phe Glu His Asp # 270 - Trp Asn Ser Tyr Ile Ala Lys Gly His Asp Gl - #n Lys Phe Gly Val Tyr # 285 - Phe Thr Trp Asp Lys Asn Asn Val Thr Gly Se - #r Asn Glu Ser Tyr Asp # 300 - Val Leu Pro Val Leu Ala Gly Pro Ser Gly Gl - #n Lys His Val Ala Arg 305 3 - #10 3 - #15 3 - #20 - Thr Asn Gly Met Gly Phe Ala Arg Asp Lys Me - #t Val Ile Thr Ser Val # 335 - Asn Lys Asn Leu Glu Leu Thr Ala Lys Trp Il - #e Asp Ala Gln Tyr Ala # 350 - Pro Leu Gln Ser Val Gln Asn Asn Trp Gly Th - #r Tyr Gly Asp Asp Lys # 365 - Gln Gln Asn Ile Phe Glu Leu Asp Gln Ala Se - #r Asn Ser Leu Lys His # 380 - Leu Pro Leu Asn Gly Thr Ala Pro Ala Glu Le - #u Arg Gln Lys Thr Glu 385 3 - #90 3 - #95 4 - #00 - Val Gly Gly Pro Leu Ala Ile Leu Asp Ser Ty - #r Tyr Gly Lys Val Thr # 415 - Thr Met Pro Asp Asp Ala Lys Trp Arg Leu As - #p Leu Ile Lys Glu Tyr # 430 - Tyr Val Pro Tyr Met Ser Asn Val Asn Asn Ty - #r Pro Arg Val Phe Met # 445 - Thr Gln Glu Asp Leu Asp Lys Ile Ala His Il - #e Glu Ala Asp Met Asn # 460 - Asp Tyr Ile Tyr Arg Lys Arg Ala Glu Trp Il - #e Val Asn Gly Asn Ile 465 4 - #70 4 - #75 4 - #80 - Asp Thr Glu Trp Asp Asp Tyr Lys Lys Glu Le - #u Glu Lys Tyr Gly Leu # 495 - Ser Asp Tyr Leu Ala Ile Lys Gln Lys Tyr Ty - #r Asp Gln Tyr Gln Ala # 510 - Asn Lys Asn 515 - (2) INFORMATION FOR SEQ ID NO: 23: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 895 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #23: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAGTACAAAC TCAAGCACTA GTCAGACAGA GACCAGTAGC TCTGCTCCAA CA - #GAGGTAAC 60 - CATTAAAAGT TCACTGGACG AGGTCAAACT TTCCAAAGTT CCTGAAAAGA TT - #GTGACCTT 120 - TGACCTCGGC GCTGCGGATA CTATTCGCGC TTTAGGATTT GAAAAAAATA TC - #GTCGGAAT 180 - GCCTACAAAA ACTGTTCCGA CTTATCTAAA AGACCTAGTG GGAACTGTCA AA - #AATGTTGG 240 - TTCTATGAAA GAACCTGATT TAGAAGCTAT CGCCGCCCTT GAGCCTGATT TG - #ATTATCGC 300 - TTCGCCACGT ACACAAAAAT TCGTAGACAA ATTCAAAGAA ATCGCCCCAA CC - #GTTCTCTT 360 - CCAAGCAAGC AAGGACGACT ACTGGACTTC TACCAAGGCT AATATCGAAT CC - #TTAGCAAG 420 - TGCCTTCGGC GAAACTGGTA CACAGAAAGC CAAGGAAGAA TTGACCAAGC TA - #GACAAGAG 480 - CATCCAAGAA GTCGCTACTA AAAATGAAAG CTCTGACAAA AAAGCCCTTG CG - #ATCCTCCT 540 - TAATGAAGGA AAAATGGCAG CCTTTGGTGC CAAATCTCGT TTCTCTTTCT TG - #TACCAAAC 600 - CTTGAAATTC AAACCAACTG ATACAAAATT TGAAGACTCA CGCCACGGAC AA - #GAAGTCAG 660 - CTTTGAAAGT GTCAAAGAAA TCAACCCTGA CATCCTCTTT GTCATCAACC GT - #ACCCTTGC 720 - CATCGGTGGG GACAACTCTA GCAACGACGG TGTCCTAGAA AATGCCCTTA TC - #GCTGAAAC 780 - ACCTGCTGCT AAAAATGGTA AGATTATCCA ACTAACACCA GACCTCTGGT AT - #CTAAGCGG 840 - AGGCGGACTT GAATCAACAA AACTCATGAT TGAAGACATA CAAAAAGCTT TG - #AAA 895 - (2) INFORMATION FOR SEQ ID NO:24: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 298 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: - Ser Thr Asn Ser Ser Thr Ser Gln Thr Glu Th - #r Ser Ser Ser Ala Pro # 15 - Thr Glu Val Thr Ile Lys Ser Ser Leu Asp Gl - #u Val Lys Leu Ser Lys # 30 - Val Pro Glu Lys Ile Val Thr Phe Asp Leu Gl - #y Ala Ala Asp Thr Ile # 45 - Arg Ala Leu Gly Phe Glu Lys Asn Ile Val Gl - #y Met Pro Thr Lys Thr # 60 - Val Pro Thr Tyr Leu Lys Asp Leu Val Gly Th - #r Val Lys Asn Val Gly #80 - Ser Met Lys Glu Pro Asp Leu Glu Ala Ile Al - #a Ala Leu Glu Pro Asp # 95 - Leu Ile Ile Ala Ser Pro Arg Thr Gln Lys Ph - #e Val Asp Lys Phe Lys # 110 - Glu Ile Ala Pro Thr Val Leu Phe Gln Ala Se - #r Lys Asp Asp Tyr Trp # 125 - Thr Ser Thr Lys Ala Asn Ile Glu Ser Leu Al - #a Ser Ala Phe Gly Glu # 140 - Thr Gly Thr Gln Lys Ala Lys Glu Glu Leu Th - #r Lys Leu Asp Lys Ser 145 1 - #50 1 - #55 1 - #60 - Ile Gln Glu Val Ala Thr Lys Asn Glu Ser Se - #r Asp Lys Lys Ala Leu # 175 - Ala Ile Leu Leu Asn Glu Gly Lys Met Ala Al - #a Phe Gly Ala Lys Ser # 190 - Arg Phe Ser Phe Leu Tyr Gln Thr Leu Lys Ph - #e Lys Pro Thr Asp Thr # 205 - Lys Phe Glu Asp Ser Arg His Gly Gln Glu Va - #l Ser Phe Glu Ser Val # 220 - Lys Glu Ile Asn Pro Asp Ile Leu Phe Val Il - #e Asn Arg Thr Leu Ala 225 2 - #30 2 - #35 2 - #40 - Ile Gly Gly Asp Asn Ser Ser Asn Asp Gly Va - #l Leu Glu Asn Ala Leu # 255 - Ile Ala Glu Thr Pro Ala Ala Lys Asn Gly Ly - #s Ile Ile Gln Leu Thr # 270 - Pro Asp Leu Trp Tyr Leu Ser Gly Gly Gly Le - #u Glu Ser Thr Lys Leu # 285 - Met Ile Glu Asp Ile Gln Lys Ala Leu Lys # 295 - (2) INFORMATION FOR SEQ ID NO: 25: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1261 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #25: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGCAATTCT GGCGGAAGTA AAGATGCTGC CAAATCAGGT GGTGACGGTG CC - #AAAACAGA 60 - AATCACTTGG TGGGCATTCC CAGTATTTAC CCAAGAAAAA ACTGGTGACG GT - #GTTGGAAC 120 - TTATGAAAAA TCAATCATCG AAGCGTTTGA AAAAGCAAAC CCAGATATAA AA - #GTGAAATT 180 - GGAAACCATC GACTTCAAGT CAGGTCCTGA AAAAATCACA ACAGCCATCG AA - #GCAGGAAC 240 - AGCTCCAGAC GTACTCTTTG ATGCACCAGG ACGTATCATC CAATACGGTA AA - #AACGGTAA 300 - ATTGGCTGAG TTGAATGACC TCTTCACAGA TGAATTTGTT AAAGATGTCA AC - #AATGAAAA 360 - CATCGTACAA GCAAGTAAAG CTGGAGACAA GGCTTATATG TATCCGATTA GT - #TCTGCCCC 420 - ATTCTACATG GCAATGAACA AGAAAATGTT AGAAGATGCT GGAGTAGCAA AC - #CTTGTAAA 480 - AGAAGGTTGG ACAACTGATG ATTTTGAAAA AGTATTGAAA GCACTTAAAG AC - #AAGGGTTA 540 - CACACCAGGT TCATTGTTCA GTTCTGGTCA AGGGGGAGAC CAAGGAACAC GT - #GCCTTTAT 600 - CTCTAACCTT TATAGCGGTT CTGTAACAGA TGAAAAAGTT AGCAAATATA CA - #ACTGATGA 660 - TCCTAAATTC GTCAAAGGTC TTGAAAAAGC AACTAGCTGG ATTAAAGACA AT - #TTGATCAA 720 - TAATGGTTCA CAATTTGACG GTGGGGCAGA TATCCAAAAC TTTGCCAACG GT - #CAAACATC 780 - TTACACAATC CTTTGGGCAC CAGCTCAAAA TGGTATCCAA GCTAAACTTT TA - #GAAGCAAG 840 - TAAGGTAGAA GTGGTAGAAG TACCATTCCC ATCAGACGAA GGTAAGCCAG CT - #CTTGAGTA 900 - CCTTGTAAAC GGGTTTGCAG TATTCAACAA TAAAGACGAC AAGAAAGTCG CT - #GCATCTAA 960 - GAAATTCATC CAGTTTATCG CAGATGACAA GGAGTGGGGA CCTAAAGACG TA - #GTTCGTAC 1020 - AGGTGCTTTC CCAGTCCGTA CTTCATTTGG AAAACTTTAT GAAGACAAAC GC - #ATGGAAAC 1080 - AATCAGCGGC TGGACTCAAT ACTACTCACC ATACTACAAC ACTATTGATG GA - #TTTGCTGA 1140 - AATGAGAACA CTTTGGTTCC CAATGTTGCA ATCTGTATCA AATGGTGACG AA - #AAACCAGC 1200 - AGATGCTTTG AAAGCCTTCA CTGAAAAAGC GAACGAAACA ATCAAAAAAG CT - #ATGAAACA 1260 # 1261 - (2) INFORMATION FOR SEQ ID NO:26: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 420 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: - Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala Ly - #s Ser Gly Gly Asp Gly # 15 - Ala Lys Thr Glu Ile Thr Trp Trp Ala Phe Pr - #o Val Phe Thr Gln Glu # 30 - Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu Ly - #s Ser Ile Ile Glu Ala # 45 - Phe Glu Lys Ala Asn Pro Asp Ile Lys Val Ly - #s Leu Glu Thr Ile Asp # 60 - Phe Lys Ser Gly Pro Glu Lys Ile Thr Thr Al - #a Ile Glu Ala Gly Thr

#80 - Ala Pro Asp Val Leu Phe Asp Ala Pro Gly Ar - #g Ile Ile Gln Tyr Gly # 95 - Lys Asn Gly Lys Leu Ala Glu Leu Asn Asp Le - #u Phe Thr Asp Glu Phe # 110 - Val Lys Asp Val Asn Asn Glu Asn Ile Val Gl - #n Ala Ser Lys Ala Gly # 125 - Asp Lys Ala Tyr Met Tyr Pro Ile Ser Ser Al - #a Pro Phe Tyr Met Ala # 140 - Met Asn Lys Lys Met Leu Glu Asp Ala Gly Va - #l Ala Asn Leu Val Lys 145 1 - #50 1 - #55 1 - #60 - Glu Gly Trp Thr Thr Asp Asp Phe Glu Lys Va - #l Leu Lys Ala Leu Lys # 175 - Asp Lys Gly Tyr Thr Pro Gly Ser Leu Phe Se - #r Ser Gly Gln Gly Gly # 190 - Asp Gln Gly Thr Arg Ala Phe Ile Ser Asn Le - #u Tyr Ser Gly Ser Val # 205 - Thr Asp Glu Lys Val Ser Lys Tyr Thr Thr As - #p Asp Pro Lys Phe Val # 220 - Lys Gly Leu Glu Lys Ala Thr Ser Trp Ile Ly - #s Asp Asn Leu Ile Asn 225 2 - #30 2 - #35 2 - #40 - Asn Gly Ser Gln Phe Asp Gly Gly Ala Asp Il - #e Gln Asn Phe Ala Asn # 255 - Gly Gln Thr Ser Tyr Thr Ile Leu Trp Ala Pr - #o Ala Gln Asn Gly Ile # 270 - Gln Ala Lys Leu Leu Glu Ala Ser Lys Val Gl - #u Val Val Glu Val Pro # 285 - Phe Pro Ser Asp Glu Gly Lys Pro Ala Leu Gl - #u Tyr Leu Val Asn Gly # 300 - Phe Ala Val Phe Asn Asn Lys Asp Asp Lys Ly - #s Val Ala Ala Ser Lys 305 3 - #10 3 - #15 3 - #20 - Lys Phe Ile Gln Phe Ile Ala Asp Asp Lys Gl - #u Trp Gly Pro Lys Asp # 335 - Val Val Arg Thr Gly Ala Phe Pro Val Arg Th - #r Ser Phe Gly Lys Leu # 350 - Tyr Glu Asp Lys Arg Met Glu Thr Ile Ser Gl - #y Trp Thr Gln Tyr Tyr # 365 - Ser Pro Tyr Tyr Asn Thr Ile Asp Gly Phe Al - #a Glu Met Arg Thr Leu # 380 - Trp Phe Pro Met Leu Gln Ser Val Ser Asn Gl - #y Asp Glu Lys Pro Ala 385 3 - #90 3 - #95 4 - #00 - Asp Ala Leu Lys Ala Phe Thr Glu Lys Ala As - #n Glu Thr Ile Lys Lys # 415 - Ala Met Lys Gln 420 - (2) INFORMATION FOR SEQ ID NO: 27: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 658 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #27: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTCACAAGAA AAAACAAAAA ATGAAGATGG AGAAACTAAG ACAGAACAGA CA - #GCCAAAGC 60 - TGATGGAACA GTCGGTAGTA AGTCTCAAGG AGCTGCCCAG AAGAAAGCAG AA - #GTGGTCAA 120 - TAAAGGTGAT TACTACAGCA TTCAAGGGAA ATACGATGAA ATCATCGTAG CC - #AACAAACA 180 - CTATCCATTG TCTAAAGACT ATAATCCAGG GGAAAATCCA ACAGCCAAGG CA - #GAGTTGGT 240 - CAAACTCATC AAAGCGATGC AAGAGGCAGG TTTCCCTATT AGTGATCATT AC - #AGTGGTTT 300 - TAGAAGTTAT GAAACTCAGA CCAAGCTCTA TCAAGATTAT GTCAACCAAG AT - #GGAAAGGC 360 - AGCAGCTGAC CGTTACTCTG CCCGTCCTGG CTATAGCGAA CACCAGACAG GC - #TTGGCCTT 420 - TGATGTGATT GGGACTGATG GTGATTTGGT GACAGAAGAA AAAGCAGCCC AA - #TGGCTCTT 480 - GGATCATGCA GCTGATTATG GCTTTGTTGT CCGTTATCTC AAAGGCAAGG AA - #AAGGAAAC 540 - AGGCTATATG GCTGAAGAAT GGCACCTGCG TTATGTAGGA AAAGAAGCTA AA - #GAAATTGC 600 - TGCAAGTGGT CTCAGTTTGG AAGAATACTA TGGCTTTGAA GGCGGAGACT AC - #GTCGAT 658 - (2) INFORMATION FOR SEQ ID NO:28: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 219 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: - Ser Gln Glu Lys Thr Lys Asn Glu Asp Gly Gl - #u Thr Lys Thr Glu Gln # 15 - Thr Ala Lys Ala Asp Gly Thr Val Gly Ser Ly - #s Ser Gln Gly Ala Ala # 30 - Gln Lys Lys Ala Glu Val Val Asn Lys Gly As - #p Tyr Tyr Ser Ile Gln # 45 - Gly Lys Tyr Asp Glu Ile Ile Val Ala Asn Ly - #s His Tyr Pro Leu Ser # 60 - Lys Asp Tyr Asn Pro Gly Glu Asn Pro Thr Al - #a Lys Ala Glu Leu Val #80 - Lys Leu Ile Lys Ala Met Gln Glu Ala Gly Ph - #e Pro Ile Ser Asp His # 95 - Tyr Ser Gly Phe Arg Ser Tyr Glu Thr Gln Th - #r Lys Leu Tyr Gln Asp # 110 - Tyr Val Asn Gln Asp Gly Lys Ala Ala Ala As - #p Arg Tyr Ser Ala Arg # 125 - Pro Gly Tyr Ser Glu His Gln Thr Gly Leu Al - #a Phe Asp Val Ile Gly # 140 - Thr Asp Gly Asp Leu Val Thr Glu Glu Lys Al - #a Ala Gln Trp Leu Leu 145 1 - #50 1 - #55 1 - #60 - Asp His Ala Ala Asp Tyr Gly Phe Val Val Ar - #g Tyr Leu Lys Gly Lys # 175 - Glu Lys Glu Thr Gly Tyr Met Ala Glu Glu Tr - #p His Leu Arg Tyr Val # 190 - Gly Lys Glu Ala Lys Glu Ile Ala Ala Ser Gl - #y Leu Ser Leu Glu Glu # 205 - Tyr Tyr Gly Phe Glu Gly Gly Asp Tyr Val As - #p # 215 - (2) INFORMATION FOR SEQ ID NO: 29: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 790 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #29: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GAAAGGTCTG TGGTCAAATA ATCTTACCTG CGGTTATGAT GAAAAAATAA TC - #TTGGAAAA 60 - TATAAATATA AAAATACCTG AAGAAAAAAT ATCAGTTATT ATTGGGTCAA AT - #GGTTGTGG 120 - GAAATCAACA CTCATTAAAA CCTTGTCTCG ACTTATAAAG CCATTAGAGG GA - #GAAGTATT 180 - GCTTGATAAT AAATCAATTA ATTCTTATAA AGAAAAAGAT TTAGCAAAAC AC - #ATAGCTAT 240 - ATTACCTCAA TCTCCAATAA TCCCTGAATC AATAACAGTA GCTGATCTTG TA - #AGCCGTGG 300 - TCGTTTCCCC TACAGAAAGC CTTTTAAGAG TCTTGGAAAA GATGACCTTG AA - #ATAATAAA 360 - CAGATCAATG GTTAAGGCCA ATGTTGAAGA TCTAGCAAAT AACCTAGTTG AA - #GAACTTTC 420 - TGGGGGTCAA AGGCAAAGAG TATGGATAGC TCTAGCCCTA GCCCAAGATA CA - #AGTATCCT 480 - ACTTTTAGAT GAGCCAACTA CTTACTTGGA TATCTCATAT CAAATAGAAC TA - #TTAGACCT 540 - CTTGACTGAT CTAAACCAAA AATATAAGAC AACCATTTGC ATGATTTTGC AC - #GATATAAA 600 - TCTAACAGCA AGATACGCTG ATTACCTATT TGCAATTAAA GAAGGTAAAC TT - #GTTGCAGA 660 - GGGAAAGCCT GAAGATATAC TAAATGATAA ACTAGTTAAA GATATCTTTA AT - #CTTGAAGC 720 - AAAAATTATA CGTGACCCTA TTTCCAATTC GCCTCTAATG ATTCCTATTG GC - #AAGCACCA 780 # 790 - (2) INFORMATION FOR SEQ ID NO:30: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 262 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: - Lys Gly Leu Trp Ser Asn Asn Leu Thr Cys Gl - #y Tyr Asp Glu Lys Ile # 15 - Ile Leu Glu Asn Ile Asn Ile Lys Ile Pro Gl - #u Glu Lys Ile Ser Val # 30 - Ile Ile Gly Ser Asn Gly Cys Gly Lys Ser Th - #r Leu Ile Lys Thr Leu # 45 - Ser Arg Leu Ile Lys Pro Leu Glu Gly Glu Va - #l Leu Leu Asp Asn Lys # 60 - Ser Ile Asn Ser Tyr Lys Glu Lys Asp Leu Al - #a Lys His Ile Ala Ile #80 - Leu Pro Gln Ser Pro Ile Ile Pro Glu Ser Il - #e Thr Val Ala Asp Leu # 95 - Val Ser Arg Gly Arg Phe Pro Tyr Arg Lys Pr - #o Phe Lys Ser Leu Gly # 110 - Lys Asp Asp Leu Glu Ile Ile Asn Arg Ser Me - #t Val Lys Ala Asn Val # 125 - Glu Asp Leu Ala Asn Asn Leu Val Glu Glu Le - #u Ser Gly Gly Gln Arg # 140 - Gln Arg Val Trp Ile Ala Leu Ala Leu Ala Gl - #n Asp Thr Ser Ile Leu 145 1 - #50 1 - #55 1 - #60 - Leu Leu Asp Glu Pro Thr Thr Tyr Leu Asp Il - #e Ser Tyr Gln Ile Glu # 175 - Leu Leu Asp Leu Leu Thr Asp Leu Asn Gln Ly - #s Tyr Lys Thr Thr Ile # 190 - Cys Met Ile Leu His Asp Ile Asn Leu Thr Al - #a Arg Tyr Ala Asp Tyr # 205 - Leu Phe Ala Ile Lys Glu Gly Lys Leu Val Al - #a Glu Gly Lys Pro Glu # 220 - Asp Ile Leu Asn Asp Lys Leu Val Lys Asp Il - #e Phe Asn Leu Glu Ala 225 2 - #30 2 - #35 2 - #40 - Lys Ile Ile Arg Asp Pro Ile Ser Asn Ser Pr - #o Leu Met Ile Pro Ile # 255 - Gly Lys His His Val Ser 260 - (2) INFORMATION FOR SEQ ID NO: 31: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 781 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #31: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AAACTCAGAA AAGAAAGCAG ACAATGCAAC AACTATCAAA ATCGCAACTG TT - #AACCGTAG 60 - CGGTTCTGAA GAAAAACGTT GGGACAAAAT CCAAGAATTG GTTAAAAAAG AC - #GGAATTAC 120 - CTTGGAATTT ACAGAGTTCA CAGACTACTC ACAACCAAAC AAAGCAACTG CT - #GATGGCGA 180 - AGTAGATTTG AACGCTTTCC AACACTATAA CTTCTTGAAC AACTGGAACA AA - #GAAAACGG 240 - AAAAGACCTT GTAGCGATTG CAGATACTTA CATCTCTCCA ATCCGCCTTT AC - #TCAGGTTT 300 - GAATGGAAGT GCCAACAAGT ACACTAAAGT AGAAGACATC CCAGCAAACG GA - #GAAATCGC 360 - TGTACCGAAT GACGCTACAA ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CA - #GCTGGCTT 420 - GATTAAATTG GATGTTTCTG GAACTGCTCT TGCAACAGTT GCCAACATCA AA - #GAAAATCC 480 - AAAGAACTTG AAAATCACTG AATTGGACGC TAGCCAAACA GCTCGTTCAT TG - #TCATCAGT 540 - TGACGCTGCC GTTGTAAACA ATACCTTCGT TACAGAAGCA AAATTGGACT AC - #AAGAAATC 600 - ACTTTTCAAA GAACAAGCTG ATGAAAACTC AAAACAATGG TACAACATCA TT - #GTTGCAAA 660 - AAAAGATTGG GAAACATCAC CTAAGGCTGA TGCTATCAAG AAAGTAATCG CA - #GCTTACCA 720 - CACAGATGAC GTGAAAAAAG TTATCGAAGA ATCATCAGAT GGTTTGGATC AA - #CCAGTTTG 780 # 781 - (2) INFORMATION FOR SEQ ID NO:32: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 260 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: - Asn Ser Glu Lys Lys Ala Asp Asn Ala Thr Th - #r Ile Lys Ile Ala Thr # 15 - Val Asn Arg Ser Gly Ser Glu Glu Lys Arg Tr - #p Asp Lys Ile Gln Glu # 30 - Leu Val Lys Lys Asp Gly Ile Thr Leu Glu Ph - #e Thr Glu Phe Thr Asp # 45 - Tyr Ser Gln Pro Asn Lys Ala Thr Ala Asp Gl - #y Glu Val Asp Leu Asn # 60 - Ala Phe Gln His Tyr Asn Phe Leu Asn Asn Tr - #p Asn Lys Glu Asn Gly #80 - Lys Asp Leu Val Ala Ile Ala Asp Thr Tyr Il - #e Ser Pro Ile Arg Leu # 95 - Tyr Ser Gly Leu Asn Gly Ser Ala Asn Lys Ty - #r Thr Lys Val Glu Asp # 110 - Ile Pro Ala Asn Gly Glu Ile Ala Val Pro As - #n Asp Ala Thr Asn Glu # 125 - Ser Arg Ala Leu Tyr Leu Leu Gln Ser Ala Gl - #y Leu Ile Lys Leu Asp # 140 - Val Ser Gly Thr Ala Leu Ala Thr Val Ala As - #n Ile Lys Glu Asn Pro 145 1 - #50 1 - #55 1 -

#60 - Lys Asn Leu Lys Ile Thr Glu Leu Asp Ala Se - #r Gln Thr Ala Arg Ser # 175 - Leu Ser Ser Val Asp Ala Ala Val Val Asn As - #n Thr Phe Val Thr Glu # 190 - Ala Lys Leu Asp Tyr Lys Lys Ser Leu Phe Ly - #s Glu Gln Ala Asp Glu # 205 - Asn Ser Lys Gln Trp Tyr Asn Ile Ile Val Al - #a Lys Lys Asp Trp Glu # 220 - Thr Ser Pro Lys Ala Asp Ala Ile Lys Lys Va - #l Ile Ala Ala Tyr His 225 2 - #30 2 - #35 2 - #40 - Thr Asp Asp Val Lys Lys Val Ile Glu Glu Se - #r Ser Asp Gly Leu Asp # 255 - Gln Pro Val Trp 260 - (2) INFORMATION FOR SEQ ID NO: 33: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 640 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #33: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTCGAAAGGG TCAGAAGGTG CAGACCTTAT CAGCATGAAA GGGGATGTCA TT - #ACAGAACA 60 - TCAATTTTAT GAGCAAGTGA AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TA - #AATATGAC 120 - CATCCAAAAA GTTTTTGAAA AACAATATGG CTCAGAGCTT GATGATAAAG AG - #GTTGATGA 180 - TACTATTGCC GAAGAAAAAA AACAATATGG CGAAAACTAC CAACGTGTCT TG - #TCACAAGC 240 - AGGTATGACT CTTGAAACAC GTAAAGCTCA AATTCGTACA AGTAAATTAG TT - #GAGTTGGC 300 - AGTTAAGAAG GTAGCAGAAG CTGAATTGAC AGATGAAGCC TATAAGAAAG CC - #TTTGATGA 360 - GTACACTCCA GATGTAACGG CTCAAATCAT CCGTCTTAAT AATGAAGATA AG - #GCCAAAGA 420 - AGTTCTCGAA AAAGCCAAGG CAGAAGGTGC TGATTTTGCT CAATTAGCCA AA - #GATAATTC 480 - AACTGATGAA AAAACAAAAG AAAATGGTGG AGAAATTACC TTTGATTCTG CT - #TCAACAGA 540 - AGTACCTGGA GCAAGTCCAA AAAAGCCGCT TTTCGCTTTT AGATGTGGGA TG - #GTGTTTCT 600 # 640 AACT GGGGCACACC AAGCCTACAG - (2) INFORMATION FOR SEQ ID NO:34: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 213 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: - Ser Lys Gly Ser Glu Gly Ala Asp Leu Ile Se - #r Met Lys Gly Asp Val # 15 - Ile Thr Glu His Gln Phe Tyr Glu Gln Val Ly - #s Ser Asn Pro Ser Ala # 30 - Gln Gln Val Leu Leu Asn Met Thr Ile Gln Ly - #s Val Phe Glu Lys Gln # 45 - Tyr Gly Ser Glu Leu Asp Asp Lys Glu Val As - #p Asp Thr Ile Ala Glu # 60 - Glu Lys Lys Gln Tyr Gly Glu Asn Tyr Gln Ar - #g Val Leu Ser Gln Ala #80 - Gly Met Thr Leu Glu Thr Arg Lys Ala Gln Il - #e Arg Thr Ser Lys Leu # 95 - Val Glu Leu Ala Val Lys Lys Val Ala Glu Al - #a Glu Leu Thr Asp Glu # 110 - Ala Tyr Lys Lys Ala Phe Asp Glu Tyr Thr Pr - #o Asp Val Thr Ala Gln

# 125 - Ile Ile Arg Leu Asn Asn Glu Asp Lys Ala Ly - #s Glu Val Leu Glu Lys # 140 - Ala Lys Ala Glu Gly Ala Asp Phe Ala Gln Le - #u Ala Lys Asp Asn Ser 145 1 - #50 1 - #55 1 - #60 - Thr Asp Glu Lys Thr Lys Glu Asn Gly Gly Gl - #u Ile Thr Phe Asp Ser # 175 - Ala Ser Thr Glu Val Pro Gly Ala Ser Pro Ly - #s Lys Pro Leu Phe Ala # 190 - Phe Arg Cys Gly Met Val Phe Leu Asp Val As - #p Tyr Ser Asn Trp Gly # 205 - Thr Pro Ser Leu Gln 210 - (2) INFORMATION FOR SEQ ID NO: 35: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 631 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #35: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA TT - #GCTCAAAC 60 - TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG CT - #GTTCAGTG 120 - TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG TT - #GTAGAAAA 180 - ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG AA - #TCTAATGG 240 - TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA CA - #GATGGTAA 300 - ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT TT - #TCTGGTAG 360 - CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG AT - #GGTAGCAG 420 - ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG AA - #AATGGCAC 480 - TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG TC - #GGACCACA 540 - CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC CA - #GATGGTTA 600 # 631 GGTG AATGGGTAAA C - (2) INFORMATION FOR SEQ ID NO:36: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 210 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: - Gly Met Ala Ala Phe Lys Asn Pro Asn Asn Gl - #n Tyr Lys Ala Ile Thr # 15 - Ile Ala Gln Thr Leu Gly Asp Asp Ala Ser Se - #r Glu Glu Leu Ala Gly # 30 - Arg Tyr Gly Ser Ala Val Gln Cys Thr Glu Va - #l Thr Ala Ser Asn Leu # 45 - Ser Thr Val Lys Thr Lys Ala Thr Val Val Gl - #u Lys Pro Leu Lys Asp # 60 - Phe Arg Ala Ser Thr Ser Asp Gln Ser Gly Tr - #p Val Glu Ser Asn Gly #80 - Lys Trp Tyr Phe Tyr Glu Ser Gly Asp Val Ly - #s Thr Gly Trp Val Lys # 95 - Thr Asp Gly Lys Trp Tyr Tyr Leu Asn Asp Le - #u Gly Val Met Gln Thr # 110 - Gly Phe Val Lys Phe Ser Gly Ser Trp Tyr Ty - #r Leu Ser Asn Ser Gly # 125 - Ala Met Phe Thr Gly Trp Gly Thr Asp Gly Se - #r Arg Trp Phe Tyr Phe # 140 - Asp Gly Ser Gly Ala Met Lys Thr Gly Trp Ty - #r Lys Glu Asn Gly Thr 145 1 - #50 1 - #55 1 - #60 - Trp Tyr Tyr Leu Asp Glu Ala Gly Ile Met Ly - #s Thr Gly Trp Phe Lys # 175 - Val Gly Pro His Trp Tyr Tyr Ala Tyr Gly Se - #r Gly Ala Leu Ala Val # 190 - Ser Thr Thr Thr Pro Asp Gly Tyr Arg Val As - #n Gly Asn Gly Glu Trp # 205 - Val Asn 210 - (2) INFORMATION FOR SEQ ID NO: 37: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1360 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #37: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AGACGAGCAA AAAATTAAGC AAGCAGAAGC GGAAGTTGAG AGTAAACAAG CT - #GAGGCTAC 60 - AAGGTTAAAA AAAATCAAGA CAGATCGTGA AGAAGCAGAA GAAGAAGCTA AA - #CGAAGAGC 120 - AGATGCTAAA GAGCAAGGTA AACCAAAGGG GCGGGCAAAA CGAGGAGTTC CT - #GGAGAGCT 180 - AGCAACACCT GATAAAAAAG AAAATGATGC GAAGTCTTCA GATTCTAGCG TA - #GGTGAAGA 240 - AACTCTTCCA AGCCCATCCC TGAAACCAGA AAAAAAGGTA GCAGAAGCTG AG - #AAGAAGGT 300 - TGAAGAAGCT AAGAAAAAAG CCGAGGATCA AAAAGAAGAA GATCGCCGTA AC - #TACCCAAC 360 - CAATACTTAC AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TT - #AAAAAAGC 420 - GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA GGAACCTCGA AACGAGGAAA AA - #GTTAAGCA 480 - AGCAAAAGCG GAAGTTGAGA GTAAAAAAGC TGAGGCTACA AGGTTAGAAA AA - #ATCAAGAC 540 - AGATCGTAAA AAAGCAGAAG AAGAAGCTAA ACGAAAAGCA GCAGAAGAAG AT - #AAAGTTAA 600 - AGAAAAACCA GCTGAACAAC CACAACCAGC GCCGGCTCCA AAAGCAGAAA AA - #CCAGCTCC 660 - AGCTCCAAAA CCAGAGAATC CAGCTGAACA ACCAAAAGCA GAAAAACCAG CT - #GATCAACA 720 - AGCTGAAGAA GACTATGCTC GTAGATCAGA AGAAGAATAT AATCGCTTGA CT - #CAACAGCA 780 - ACCGCCAAAA ACTGAAAAAC CAGCACAACC ATCTACTCCA AAAACAGGCT GG - #AAACAAGA 840 - AAACGGTATG TGGTACTTCT ACAATACTGA TGGTTCAATG GCGACAGGAT GG - #CTCCAAAA 900 - CAATGGCTCA TGGTACTACC TCAACAGCAA TGGCGCTATG GCGACAGGAT GG - #CTCCAAAA 960 - CAATGGTTCA TGGTACTATC TAAACGCTAA TGGTTCAATG GCAACAGGAT GG - #CTCCAAAA 1020 - CAATGGTTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GG - #CTCCAATA 1080 - CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTTCAATG GCGACAGGAT GG - #CTCCAATA 1140 - CAATGGCTCA TGGTACTACC TAAACGCTAA TGGTGATATG GCGACAGGTT GG - #GTGAAAGA 1200 - TGGAGATACC TGGTACTATC TTGAAGCATC AGGTGCTATG AAAGCAAGCC AA - #TGGTTCAA 1260 - AGTATCAGAT AAATGGTACT ATGTCAATGG CTCAGGTGCC CTTGCAGTCA AC - #ACAACTGT 1320 # 1360 AATG CCAATGGTGA ATGGGTAAAC - (2) INFORMATION FOR SEQ ID NO:38: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 453 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: - Asp Glu Gln Lys Ile Lys Gln Ala Glu Ala Gl - #u Val Glu Ser Lys Gln # 15 - Ala Glu Ala Thr Arg Leu Lys Lys Ile Lys Th - #r Asp Arg Glu Glu Ala # 30 - Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Ly - #s Glu Gln Gly Lys Pro # 45 - Lys Gly Arg Ala Lys Arg Gly Val Pro Gly Gl - #u Leu Ala Thr Pro Asp # 60 - Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Se - #r Ser Val Gly Glu Glu #80 - Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Ly - #s Lys Val Ala Glu Ala # 95 - Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Al - #a Glu Asp Gln Lys Glu # 110 - Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Ty - #r Lys Thr Leu Glu Leu # 125 - Glu Ile Ala Glu Ser Asp Val Glu Val Lys Ly - #s Ala Glu Leu Glu Leu # 140 - Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Gl - #u Glu Lys Val Lys Gln 145 1 - #50 1 - #55 1 - #60 - Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Gl - #u Ala Thr Arg Leu Glu # 175 - Lys Ile Lys Thr Asp Arg Lys Lys Ala Glu Gl - #u Glu Ala Lys Arg Lys # 190 - Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pr - #o Ala Glu Gln Pro Gln # 205 - Pro Ala Pro Ala Pro Lys Ala Glu Lys Pro Al - #a Pro Ala Pro Lys Pro # 220 - Glu Asn Pro Ala Glu Gln Pro Lys Ala Glu Ly - #s Pro Ala Asp Gln Gln 225 2 - #30 2 - #35 2 - #40 - Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Gl - #u Glu Tyr Asn Arg Leu # 255 - Thr Gln Gln Gln Pro Pro Lys Thr Glu Lys Pr - #o Ala Gln Pro Ser Thr # 270 - Pro Lys Thr Gly Trp Lys Gln Glu Asn Gly Me - #t Trp Tyr Phe Tyr Asn # 285 - Thr Asp Gly Ser Met Ala Thr Gly Trp Leu Gl - #n Asn Asn Gly Ser Trp # 300 - Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Th - #r Gly Trp Leu Gln Asn 305 3 - #10 3 - #15 3 - #20 - Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gl - #y Ser Met Ala Thr Gly # 335 - Trp Leu Gln Asn Asn Gly Ser Trp Tyr Tyr Le - #u Asn Ala Asn Gly Ser # 350 - Met Ala Thr Gly Trp Leu Gln Tyr Asn Gly Se - #r Trp Tyr Tyr Leu Asn # 365 - Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gl - #n Tyr Asn Gly Ser Trp # 380 Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Th - #r Gly Trp Val Lys Asp 385 3 - #90 3 - #95 4 - #00 - Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gl - #y Ala Met Lys Ala Ser # 415 - Gln Trp Phe Lys Val Ser Asp Lys Trp Tyr Ty - #r Val Asn Gly Ser Gly # 430 - Ala Leu Ala Val Asn Thr Thr Val Asp Gly Ty - #r Gly Val Asn Ala Asn # 445 - Gly Glu Trp Val Asn 450 - (2) INFORMATION FOR SEQ ID NO: 39: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 412 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #39: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTGTGGTGAG GAAGAAACTA AAAAGACTCA AGCAGCACAA CAGCCAAAAC AA - #CAAACGAC 60 - TGTACAACAA ATTGCTGTTG GAAAAGATGC TCCAGACTTC ACATTGCAAT CC - #ATGGATGG 120 - CAAAGAAGTT AAGTTATCTG ATTTTAAGGG TAAAAAGGTT TACTTGAAGT TT - #TGGGCTTC 180 - ATGGTGTGGT CCATGCAAGA AAAGTATGCC AGAGTTGATG GAACTAGCGG CG - #AAACCAGA 240 - TCGTGATTTC GAAATTCTTA CTGTCATTGC ACCAGGAATT CAAGGTGAAA AA - #ACTGTTGA 300 - GCAATTCCCA CAATGGTTCC AGGAACAAGG ATATAAGGAT ATCCCAGTTC TT - #TATGATAC 360 - CAAAGCAACC ACTTCCAAGC TTATCAAATT CGAAGCATTC CTACAGAATA TT - # 412 - (2) INFORMATION FOR SEQ ID NO:40: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 137 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: - Cys Gly Glu Glu Glu Thr Lys Lys Thr Gln Al - #a Ala Gln Gln Pro Lys # 15 - Gln Gln Thr Thr Val Gln Gln Ile Ala Val Gl - #y Lys Asp Ala Pro Asp # 30 - Phe Thr Leu Gln Ser Met Asp Gly Lys Glu Va - #l Lys Leu Ser Asp Phe # 45 - Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Al - #a Ser Trp Cys Gly Pro # 60 - Cys Lys Lys Ser Met Pro Glu Leu Met Glu Le - #u Ala Ala Lys Pro Asp #80 - Arg Asp Phe Glu Ile Leu Thr Val Ile Ala Pr - #o Gly Ile Gln Gly Glu # 95 - Lys Thr Val Glu Gln Phe Pro Gln Trp Phe Gl - #n Glu Gln Gly Tyr Lys # 110 - Asp Ile Pro Val Leu Tyr Asp Thr Lys Ala Th - #r Thr Ser Lys Leu Ile # 125 - Lys Phe Glu Ala Phe Leu Gln Asn Ile # 135 - (2) INFORMATION FOR SEQ ID NO: 41: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1462 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #41: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:

- GACTTTTAAC AATAAAACTA TTGAAGAGTT GCACAATCTC CTTGTCTCTA AG - #GAAATTTC 60 - TGCAACAGAA TTGACCCAAG CAACACTTGA AAATATCAAG TCTCGTGAGG AA - #GCCCTCAA 120 - TTCATTTGTC ACCATCGCTG AGGAGCAAGC TCTTGTTCAA GCTAAAGCCA TT - #GATGAAGC 180 - tGGAATTGAT GCTGACAATG TCCTTTCAGG AATTCCACTT GCTGTTAAGG AT - #AACATCTC 240 - TACAGACGGT ATTCTCACAA CTGCTGCCTC AAAAATGCTC TACAACTATG AG - #CCAATCTT 300 - TGATGCGACa gCTgTTGCCA ATGCAAAAAC CAAGGGCATG ATTGTCGTTG GA - #AAGACCAA 360 - CATGGACGAA TTTGCTATGG GTGGTTCAGG tGAAACTTCA CACTACGGAG CA - #ACTAAAAA 420 - CGCTTGGAAC CACAGCAAGG TTCCTGGTGG GTCATCAAGT GGTTCTGCCG CA - #GCTGTAGC 480 - CTCAGGACAA GTTCGCTTGT CACTTGGTTC TGATACTGGT GGTTCCATCC GC - #CAACCTGC 540 - TGCCTTCAAC GGAATCGTTG GTCTCAAACC AACCTACGGA ACAGTTTCAC GT - #TTCGGTCT 600 - CATTGCCTTT GGTAGCTCAT TAGACCAGAT TGGACCTTTT GCTCCTACTG TT - #AAGGAAAA 660 - TGCCCTCTTG CTCAACGCTA TTGCCAGCGA AGATGCTAAA GACTCTACTT CT - #GCTCCTGT 720 - CCGCATCGCC GACTTTACTT CAAAAATCGG CCAAGACATC AAGGGTATGA AA - #ATCGCTTT 780 - GCCTAAGGAA TACCTAGGCG AAGGAATTGA TCCAGAGGTT AAGGAAACAA TC - #TTAAACGC 840 - GGCCAAACAC TTTGAAAAAT TGGGTGCTAT CGTCGAAGAA GTCAGCCTTC CT - #CACTCTAA 900 - ATACGGTGTT GCCGTTTATT ACATCATCGC TTCATCAGAA GCTTCATCAA AC - #TTGCAACG 960 - CTTCGACGGT ATCCGTTACG GCTATCGCGC AGAAGATGCA ACCAACCTTG AT - #GAAATCTA 1020 - TGTAAACAGC CGAAGCCAAG GTTTTGGTGA AGAGGTAAAA CGTCGTATCA TG - #CTGGGTAC 1080 - TTTCAGTCTT TCATCAGGTT ACTATGATGC CTACTACAAA AAGGCTGGTC AA - #GTCCGTAC 1140 - CCTCATCATT CAAGATTTCG AAAAAGTCTT CGCGGATTAC GATTTGATTT TG - #GGTCCAAC 1200 - TGCTCCAAGT GTTGCCTATG ACTTGGATTC TCTCAACCAT GACCCAGTTG CC - #ATGTACTT 1260 - AGCCGACCTA TTGACCATAC CTGTAAACTT GGCAGGACTG CCTGGAATTT CG - #ATTCCTGC 1320 - TGGATTCTCT CAAGGTCTAC CTGTCGGACT CCAATTGATT GGTCCCAAGT AC - #TCTGAGGA 1380 - AACCATTTAC CAAGCTGCTG CTGCTTTTGA AGCAACAACA GACTACCACA AA - #CAACAACC 1440 # 1462ACA AC - (2) INFORMATION FOR SEQ ID NO:42: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 487 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: - Thr Phe Asn Asn Lys Thr Ile Glu Glu Leu Hi - #s Asn Leu Leu Val Ser # 15 - Lys Glu Ile Ser Ala Thr Glu Leu Thr Gln Al - #a Thr Leu Glu Asn Ile # 30 - Lys Ser Arg Glu Glu Ala Leu Asn Ser Phe Va - #l Thr Ile Ala Glu Glu # 45 - Gln Ala Leu Val Gln Ala Lys Ala Ile Asp Gl - #u Ala Gly Ile Asp Ala # 60 - Asp Asn Val Leu Ser Gly Ile Pro Leu Ala Va - #l Lys Asp Asn Ile Ser #80 - Thr Asp Gly Ile Leu Thr Thr Ala Ala Ser Ly - #s Met Leu Tyr Asn Tyr # 95 - Glu Pro Ile Phe Asp Ala Thr Ala Val Ala As - #n Ala Lys Thr Lys Gly # 110 - Met Ile Val Val Gly Lys Thr Asn Met Asp Gl - #u Phe Ala Met Gly Gly # 125 - Ser Gly Glu Thr Ser His Tyr Gly Ala Thr Ly - #s Asn Ala Trp Asn His # 140 - Ser Lys Val Pro Gly Gly Ser Ser Ser Gly Se - #r Ala Ala Ala Val Ala 145 1 - #50 1 - #55 1 - #60 - Ser Gly Gln Val Arg Leu Ser Leu Gly Ser As - #p Thr Gly Gly Ser Ile # 175 - Arg Gln Pro Ala Ala Phe Asn Gly Ile Val Gl - #y Leu Lys Pro Thr Tyr # 190 - Gly Thr Val Ser Arg Phe Gly Leu Ile Ala Ph - #e Gly Ser Ser Leu Asp # 205 - Gln Ile Gly Pro Phe Ala Pro Thr Val Lys Gl - #u Asn Ala Leu Leu Leu # 220 - Asn Ala Ile Ala Ser Glu Asp Ala Lys Asp Se - #r Thr Ser Ala Pro Val 225 2 - #30 2 - #35 2 - #40 - Arg Ile Ala Asp Phe Thr Ser Lys Ile Gly Gl - #n Asp Ile Lys Gly Met # 255 - Lys Ile Ala Leu Pro Lys Glu Tyr Leu Gly Gl - #u Gly Ile Asp Pro Glu # 270 - Val Lys Glu Thr Ile Leu Asn Ala Ala Lys Hi - #s Phe Glu Lys Leu Gly # 285 - Ala Ile Val Glu Glu Val Ser Leu Pro His Se - #r Lys Tyr Gly Val Ala # 300 - Val Tyr Tyr Ile Ile Ala Ser Ser Glu Ala Se - #r Ser Asn Leu Gln Arg 305 3 - #10 3 - #15 3 - #20 - Phe Asp Gly Ile Arg Tyr Gly Tyr Arg Ala Gl - #u Asp Ala Thr Asn Leu # 335 - Asp Glu Ile Tyr Val Asn Ser Arg Ser Gln Gl - #y Phe Gly Glu Glu Val # 350 - Lys Arg Arg Ile Met Leu Gly Thr Phe Ser Le - #u Ser Ser Gly Tyr Tyr # 365 - Asp Ala Tyr Tyr Lys Lys Ala Gly Gln Val Ar - #g Thr Leu Ile Ile Gln # 380 - Asp Phe Glu Lys Val Phe Ala Asp Tyr Asp Le - #u Ile Leu Gly Pro Thr 385 3 - #90 3 - #95 4 - #00 - Ala Pro Ser Val Ala Tyr Asp Leu Asp Ser Le - #u Asn His Asp Pro Val # 415 - Ala Met Tyr Leu Ala Asp Leu Leu Thr Ile Pr - #o Val Asn Leu Ala Gly # 430 - Leu Pro Gly Ile Ser Ile Pro Ala Gly Phe Se - #r Gln Gly Leu Pro Val # 445 - Gly Leu Gln Leu Ile Gly Pro Lys Tyr Ser Gl - #u Glu Thr Ile Tyr Gln # 460 - Ala Ala Ala Ala Phe Glu Ala Thr Thr Asp Ty - #r His Lys Gln Gln Pro 465 4 - #70 4 - #75 4 - #80 - Val Ile Phe Gly Gly Asp Asn 485 - (2) INFORMATION FOR SEQ ID NO: 43: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 433 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #43: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTTTACAGGT AAACAACTAC AAGTCGGCGA CAAGGCGCTT GATTTTTCTC TT - #ACTACAAC 60 - AGATCTTTCT AAAAAATCTC TGGCTGATTT TGATGGCAAG AAAAAAGTCT TG - #AGTGTCGT 120 - TCCTTCTATC GATACAGGCA TCTGCTCAAC TCAAACACGT CGTTTTAATG AA - #GAATTGGC 180 - TGGACTGGAC AACACGGTCG TATTGACTGT TTCAATGGAC CTACCTTTTG CT - #CAAAAACG 240 - TTGGTGCGGT GCTGAAGGCC TTGACAATGC CATTATGCTT TCAGACTACT TT - #GACCATTC 300 - TTTCGGGCGC GATTATGCCC TCTTGATCAA CGAATGGCAC CTATTAGCAC GC - #GCAGTCTT 360 - TGTCCTCGAT ACTGACAATA CGATTCGCTA CGTTGAATAC GTGGATAATA TC - #AATTCTGA 420 # 433 - (2) INFORMATION FOR SEQ ID NO:44: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 144 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: - Phe Thr Gly Lys Gln Leu Gln Val Gly Asp Ly - #s Ala Leu Asp Phe Ser # 15 - Leu Thr Thr Thr Asp Leu Ser Lys Lys Ser Le - #u Ala Asp Phe Asp Gly # 30 - Lys Lys Lys Val Leu Ser Val Val Pro Ser Il - #e Asp Thr Gly Ile Cys # 45 - Ser Thr Gln Thr Arg Arg Phe Asn Glu Glu Le - #u Ala Gly Leu Asp Asn # 60 - Thr Val Val Leu Thr Val Ser Met Asp Leu Pr - #o Phe Ala Gln Lys Arg #80 - Trp Cys Gly Ala Glu Gly Leu Asp Asn Ala Il - #e Met Leu Ser Asp Tyr # 95 - Phe Asp His Ser Phe Gly Arg Asp Tyr Ala Le - #u Leu Ile Asn Glu Trp # 110 - His Leu Leu Ala Arg Ala Val Phe Val Leu As - #p Thr Asp Asn Thr Ile # 125 - Arg Tyr Val Glu Tyr Val Asp Asn Ile Asn Se - #r Glu Pro Asn Phe Glu # 140 - (2) INFORMATION FOR SEQ ID NO: 45: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 724 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #45: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CCAGGCTGAT ACAAGTATCG CAGACATTCA AAAAAGAGGC GAACTGGTTG TC - #GGTGTCAA 60 - ACAAGACGTT CCCAATTTTG GTTACAAnGA TCCCAAGACC GGTACTTATT CT - #GGTATCGA 120 - AaCCGACTTG GCCAAGATGG TAGCTGATGA ACTCAAGGTC AAGATTCGCT AT - #GTGCCGGT 180 - TACAGCACAA ACCCGCGGCC CCCTTCTAGA CAATGAACAG GTCGATATGG AT - #ATCGCGAC 240 - CTTTACCATC ACGGACGAAC GCAAAAAACT CTACAACTTT ACCAGTCCCT AC - #TACACAGA 300 - CGCTTCTGGA TTTTTGGTCA ATAAATCTGC CAAAATCAAA AAGATTGAGG AC - #CTAAACGG 360 - CAAAACCATC GGAGTCGCCC AAGGTTCTAT CACCCAACGC CTGATTACTG AA - #CTGGGTAA 420 - AAAGAAAGGT CTGAAGTTTA AATTCGTCGA ACTTGGTTCC TACCCAGAAT TG - #ATTACTTC 480 - CCTGCACGCT CATCGTATCG ATACCTTTTC CGTTGACCGC TCTATTCTAT CT - #GGCTACAC 540 - TAGTAAACGG ACAGCACTAC TAGATGATAG TTTCAAGCCA TCTGACTACG GT - #ATTGTTAC 600 - CAAGAAATCA AATACAGAGC TCAACGACTA TCTTGATAAC TTGGTTACTA AA - #TGGAGCAA 660 - GGATGGTAGT TTGCAGAAAC TTTATGACCG TTACAAGCTC AAACCATCTA GC - #CATACTGC 720 # 724 - (2) INFORMATION FOR SEQ ID NO:46: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 241 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:46:(xi) SEQUENCE DESCRIPTION: SEQ - Gln Ala Asp Thr Ser Ile Ala Asp - # Ile Gln Lys Arg Gly Glu Leu Val # 15 - Val Gly Val Lys Gln Asp Val Pro - # Asn Phe Gly Tyr Xaa Asp Pro Lys # 30 - Thr Gly Thr Tyr Ser Gly Ile Glu - # Thr Asp Leu Ala Lys Met Val Ala # 45 - Asp Glu Leu Lys Val Lys Ile Arg - # Tyr Val Pro Val Thr Ala Gln Thr # 60 - Arg Gly Pro Leu Leu Asp Asn Glu - # Gln Val Asp Met Asp Ile Ala Thr # 80 - Phe Thr Ile Thr Asp Glu Arg Lys - # Lys Leu Tyr Asn Phe Thr Ser Pro # 95 - Tyr Tyr Thr Asp Ala Ser Gly Phe - # Leu Val Asn Lys Ser Ala Lys Ile # 110 - Lys Lys Ile Glu Asp Leu Asn Gly - # Lys Thr Ile Gly Val Ala Gln Gly # 125 - Ser Ile Thr Gln Arg Leu Ile Thr - # Glu Leu Gly Lys Lys Lys Gly Leu # 140 - Lys Phe Lys Phe Val Glu Leu Gly - # Ser Tyr Pro Glu Leu Ile Thr Ser # 160 - Leu His Ala His Arg Ile Asp Thr - # Phe Ser Val Asp Arg Ser Ile Leu # 175 - Ser Gly Tyr Thr Ser Lys Arg Thr - # Ala Leu Leu Asp Asp Ser Phe Lys # 190 - Pro Ser Asp Tyr Gly Ile Val Thr - # Lys Lys Ser Asn Thr Glu Leu Asn # 205 - Asp Tyr Leu Asp Asn Leu Val Thr - # Lys Trp Ser Lys Asp Gly Ser Leu # 220 - Gln Lys Leu Tyr Asp Arg Tyr Lys - # Leu Lys Pro Ser Ser His Thr

Ala # 240 - Asp - (2) INFORMATION FOR SEQ ID NO: 47: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1279 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 47: SEQUENCE DESCRIPTION: - GTCTGTATCA TTTGAAAACA AAGAAACAAA CCGTGGTGTC TTgACTTTCA CT - #ATCTCTCA 60 - AGACCAAATC AAACCAGAAT TGGACCGTGT CTTCAAGtCA GTGAAGAAAT CT - #CTTAATGT 120 - TCCAGGTTTC CGTAAAGGTC ACCTTCCACG CCCTATCTTC GACCAAAAAT TT - #GGTGAAGA 180 - AGCTCTTTAT CAAGATGCAA TGAACGCACT TTTGCCAAAC GCTTATGAAG CA - #GCTGTAAA 240 - AGAAGCTGGT CTTGAAGTGG TTGCCCAACC AAAAATTGAC GTAACTTCAA TG - #GAAAAAGG 300 - TCAAGACTGG GTTATCACTG CTGAAGTCGT TACAAAACCT GAAGTAAAAT TG - #GGTGACTA 360 - CAAAAACCTT GAAGTATCAG TTGATGTAGA AAAAGAAGTA ACTGACGCTG AT - #GTCGAAGA 420 - GCGTATCGAA CGCGAACGCA ACAACCTGGC TGAATTGGTT ATCAAGGAAG CT - #GCTGCTGA 480 - AAACGGCGAC ACTGTTGTGA TCGACTTCGT TGGTTCTATC GACGGTGTTG AA - #TTTGACGG 540 - TGGAAAAGGT GAAAACTTCT CACTTGGACT TGGTTCAGGT CAATTCATCC CT - #GGTTTCGA 600 - AGACCAATTG GTAGGTCACT CAGCTGGCGA AACCGTTGAT GTTATCGTAA CA - #TTCCCAGA 660 - AGACTACCAA GCAGAAGACC TTGCAGGTAA AGAAGCTAAA TTCGTGACAA CT - #ATCCACGA 720 - AGTAAAAGCT AAAGAAGTTC CGGCTCTTGA CGATGAACTT GCAAAAGACA TT - #GATGAAGA 780 - AGTTGAAACA CTTGCTGACT TGAAAGAAAA ATACAGCAAA GAATTGGCTG CT - #GCTAAAGA 840 - AGAAGCTTAC AAAGATGCAG TTGAAGGTGC AGCAATTGAT ACAGCTGTAG AA - #AATGCTGA 900 - AATCGTAGAA CTTCCAGAAG AAATGATCCA TGAAGAAGTT CACCGTTCAG TA - #AATGAATT 960 - CCTTGGGAAT TTGCAACGTC AAGGGATCAA CCCTGACATG TACTTCCAAA TC - #ACTGGAAC 1020 - TACTCAAGAA GACCTTCACA ACCAATACCA AGCAGAAGCT GAGTCACGTA CT - #AAGACTAA 1080 - CCTTGTTATC GAAGCAGTTG CCAAAGCTGA AGGATTTGAT GCTTCAGAAG AA - #GAAATCCA 1140 - AAAAGAAGTT GAGCAATTGG CAGCAGACTA CAACATGGAA GTTGCACAAG TT - #CAAAACTT 1200 - GCTTTCAGCT GACATGTTGA AACATGATAT CACTATCAAA AAAGCTGTTG AA - #TTGATCAC 1260 # 127 - #9 - (2) INFORMATION FOR SEQ ID NO:48: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 426 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:48:(xi) SEQUENCE DESCRIPTION: SEQ - Ser Val Ser Phe Glu Asn Lys Glu - # Thr Asn Arg Gly Val Leu Thr Phe # 15 - Thr Ile Ser Gln Asp Gln Ile Lys - # Pro Glu Leu Asp Arg Val Phe Lys # 30 - Ser Val Lys Lys Ser Leu Asn Val - # Pro Gly Phe Arg Lys Gly His Leu # 45 - Pro Arg Pro Ile Phe Asp Gln Lys - # Phe Gly Glu Glu Ala Leu Tyr Gln # 60 - Asp Ala Met Asn Ala Leu Leu Pro - # Asn Ala Tyr Glu Ala Ala Val Lys # 80 - Glu Ala Gly Leu Glu Val Val Ala - # Gln Pro Lys Ile Asp Val Thr Ser # 95 - Met Glu Lys Gly Gln Asp Trp Val - # Ile Thr Ala Glu Val Val Thr Lys # 110 - Pro Glu Val Lys Leu Gly Asp Tyr - # Lys Asn Leu Glu Val Ser Val Asp # 125 - Val Glu Lys Glu Val Thr Asp Ala - # Asp Val Glu Glu Arg Ile Glu Arg # 140 - Glu Arg Asn Asn Leu Ala Glu Leu - # Val Ile Lys Glu Ala Ala Ala Glu # 160 - Asn Gly Asp Thr Val Val Ile Asp - # Phe Val Gly Ser Ile Asp Gly Val # 175 - Glu Phe Asp Gly Gly Lys Gly Glu - # Asn Phe Ser Leu Gly Leu Gly Ser # 190 - Gly Gln Phe Ile Pro Gly Phe Glu - # Asp Gln Leu Val Gly His Ser Ala # 205 - Gly Glu Thr Val Asp Val Ile Val - # Thr Phe Pro Glu Asp Tyr Gln Ala # 220 - Glu Asp Leu Ala Gly Lys Glu Ala - # Lys Phe Val Thr Thr Ile His Glu # 240 - Val Lys Ala Lys Glu Val Pro Ala - # Leu Asp Asp Glu Leu Ala Lys Asp # 255 - Ile Asp Glu Glu Val Glu Thr Leu - # Ala Asp Leu Lys Glu Lys Tyr Ser # 270 - Lys Glu Leu Ala Ala Ala Lys Glu - # Glu Ala Tyr Lys Asp Ala Val Glu # 285 - Gly Ala Ala Ile Asp Thr Ala Val - # Glu Asn Ala Glu Ile Val Glu Leu # 300 - Pro Glu Glu Met Ile His Glu Glu - # Val His Arg Ser Val Asn Glu Phe # 320 - Leu Gly Asn Leu Gln Arg Gln Gly - # Ile Asn Pro Asp Met Tyr Phe Gln # 335 - Ile Thr Gly Thr Thr Gln Glu Asp - # Leu His Asn Gln Tyr Gln Ala Glu # 350 - Ala Glu Ser Arg Thr Lys Thr Asn - # Leu Val Ile Glu Ala Val Ala Lys # 365 - Ala Glu Gly Phe Asp Ala Ser Glu - # Glu Glu Ile Gln Lys Glu Val Glu # 380 - Gln Leu Ala Ala Asp Tyr Asn Met - # Glu Val Ala Gln Val Gln Asn Leu # 400 - Leu Ser Ala Asp Met Leu Lys His - # Asp Ile Thr Ile Lys Lys Ala Val # 415 - Glu Leu Ile Thr Ser Thr Ala Thr - # Val Lys # 425 - (2) INFORMATION FOR SEQ ID NO: 49: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 490 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 49: SEQUENCE DESCRIPTION: - TGGTCAAAAG GAAAGTCAGA CAGGAAAGGG GATGAAAATT GTGACCAGTT TT - #TATCCTAT 60 - CTACGCTATG GTTAAGGAAG TATCTGGTGA CTTGAATGAT GTTCGGATGA TT - #CAGTCAAG 120 - TAGTGGTATT CACTCCTTTG AACCTTCGGC AAATGATATC GCAGCCATCT AT - #GATGCAGA 180 - TGTCTTTGTT TACCATTCTC ATACACTCGA ATCTTGGGCA GGAAGTCTGG AT - #CCAAATCT 240 - AAAAAAATCC AAAGTGAAGG TCTTAGAGGC TTCTGAGGGA ATGACCTTGG AA - #CGTGTCCC 300 - TGGACTAGAG GATGTGGAAG CAGGGGATGG AGTTGATGAA AAAACGCTCT AT - #GACCCTCA 360 - CACATGGCTA GATCCTGAAA AAGCTGGAGA AGAAGCCCAA ATTATCGCTG AT - #AAACTTTC 420 - AGAGGTGGAT AGTGAGCATA AAGAGACTTA TCAAAAAAAT GCGCAACCTT TA - #TCAAAAAA 480 # 490 - (2) INFORMATION FOR SEQ ID NO:50: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 163 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:50:(xi) SEQUENCE DESCRIPTION: SEQ - Gly Gln Lys Glu Ser Gln Thr Gly - # Lys Gly Met Lys Ile Val Thr Ser # 15 - Phe Tyr Pro Ile Tyr Ala Met Val - # Lys Glu Val Ser Gly Asp Leu Asn # 30 - Asp Val Arg Met Ile Gln Ser Ser - # Ser Gly Ile His Ser Phe Glu Pro # 45 - Ser Ala Asn Asp Ile Ala Ala Ile - # Tyr Asp Ala Asp Val Phe Val Tyr # 60 - His Ser His Thr Leu Glu Ser Trp - # Ala Gly Ser Leu Asp Pro Asn Leu # 80 - Lys Lys Ser Lys Val Lys Val Leu - # Glu Ala Ser Glu Gly Met Thr Leu # 95 - Glu Arg Val Pro Gly Leu Glu Asp - # Val Glu Ala Gly Asp Gly Val Asp # 110 - Glu Lys Thr Leu Tyr Asp Pro His - # Thr Trp Leu Asp Pro Glu Lys Ala # 125 - Gly Glu Glu Ala Gln Ile Ile Ala - # Asp Lys Leu Ser Glu Val Asp Ser # 140 - Glu His Lys Glu Thr Tyr Gln Lys - # Asn Ala Gln Pro Leu Ser Lys Lys # 160 - Leu Arg Asn - (2) INFORMATION FOR SEQ ID NO: 51: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1006 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 51: SEQUENCE DESCRIPTION: - GAAGGATAGA TATATTTTAG CATTTGAGAC ATCCTGTGAT GAGACCAGTG TC - #GCCGTCTT 60 - GAAAAACGAC GATGAGCTCT TGTCCAATGT CATTGCTAGT CAAATTGAGA GT - #CACAAACG 120 - TTTTGGTGGC GTAGTGCCCG AAGTAGCCAG TCGTCACCAT GTCGAGGTCA TT - #ACAGCCTG 180 - TATCGAGGAG GCATTGGCAG AAGCAGGGAT TACCGAAGAG GACGTGACAG CT - #GTTGCGGT 240 - TACCTACGGA CCAGGCTTGG TCGGAGCCTT GCTAGTTGGT TTGTCAGCTG CC - #AAGGCCTT 300 - TGCTTGGGCT CACGGACTTC CACTGATTCC TGTTAATCAC ATGGCTGGGC AC - #CTCATGGC 360 - AGCTCAGAGT GTGGAGCCTT TGGAGTTTCC CTTGCTAGCC CTCTTGGTCA GC - #GGCGGACA 420 - CACAGAGTTG GTTTATGTTT CGGAGGCAGG AGATTATAAG ATTGTTGGGG AA - #ACCCGTGA 480 - TGATGCGGTT GGTGAGGCTT ATGATAAGGT CGGCCGTGTC ATGGGCTTGA CC - #TATCCTGC 540 - AGGTCGTGAG ATTGACGAGC TGGCTCATCA GGGGCAGGAT ATTTATGATT TC - #CCCCGTGC 600 - CATGATTAAG GAAGATAATC TGGAGTTCTC CTTCTCAGGT TTGAAATCTG CC - #TTTATCAA 660 - TCTTCATCAC AATGCCGAGC AAAAGGGAGA AAGCCTGTCT ACAGAAGATT TG - #TGTGCTTC 720 - CTTCCAAGCA GCAGTTATGG ACATTCTCAT GGCAAAAACC AAGAAGGCTT TG - #GAGAAATA 780 - TCCTGTTAAA ATCCTAGTTG TGGCAGGTGG TGTGGCAGCC AATAAAGGTC TC - #AGAGAACG 840 - CCTAGCAGCC GAAATCACAG ATGTCAAGGT TATCATCCCC CCTCTGCGAC TC - #TGCGGAGA 900 - CAATGCAGGT ATGATTGCCT ATGCCAGCGT CAGCNAGTGG AACAAAGAAA AC - #TTCGCAGG 960 # 1006AAC CAAGTCTTGC CTTTGATACC ATGGAA - (2) INFORMATION FOR SEQ ID NO:52: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 335 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:52:(xi) SEQUENCE DESCRIPTION: SEQ - Lys Asp Arg Tyr Ile Leu Ala Phe - # Glu Thr Ser Cys Asp Glu Thr Ser # 15

- Val Ala Val Leu Lys Asn Asp Asp - # Glu Leu Leu Ser Asn Val Ile Ala # 30 - Ser Gln Ile Glu Ser His Lys Arg - # Phe Gly Gly Val Val Pro Glu Val # 45 - Ala Ser Arg His His Val Glu Val - # Ile Thr Ala Cys Ile Glu Glu Ala # 60 - Leu Ala Glu Ala Gly Ile Thr Glu - # Glu Asp Val Thr Ala Val Ala Val # 80 - Thr Tyr Gly Pro Gly Leu Val Gly - # Ala Leu Leu Val Gly Leu Ser Ala # 95 - Ala Lys Ala Phe Ala Trp Ala His - # Gly Leu Pro Leu Ile Pro Val Asn # 110 - His Met Ala Gly His Leu Met Ala - # Ala Gln Ser Val Glu Pro Leu Glu # 125 - Phe Pro Leu Leu Ala Leu Leu Val - # Ser Gly Gly His Thr Glu Leu Val # 140 - Tyr Val Ser Glu Ala Gly Asp Tyr - # Lys Ile Val Gly Glu Thr Arg Asp # 160 - Asp Ala Val Gly Glu Ala Tyr Asp - # Lys Val Gly Arg Val Met Gly Leu # 175 - Thr Tyr Pro Ala Gly Arg Glu Ile - # Asp Glu Leu Ala His Gln Gly Gln # 190 - Asp Ile Tyr Asp Phe Pro Arg Ala - # Met Ile Lys Glu Asp Asn Leu Glu # 205 - Phe Ser Phe Ser Gly Leu Lys Ser - # Ala Phe Ile Asn Leu His His Asn # 220 - Ala Glu Gln Lys Gly Glu Ser Leu - # Ser Thr Glu Asp Leu Cys Ala Ser # 240 - Phe Gln Ala Ala Val Met Asp Ile - # Leu Met Ala Lys Thr Lys Lys Ala # 255 - Leu Glu Lys Tyr Pro Val Lys Ile - # Leu Val Val Ala Gly Gly Val Ala # 270 - Ala Asn Lys Gly Leu Arg Glu Arg - # Leu Ala Ala Glu Ile Thr Asp Val # 285 - Lys Val Ile Ile Pro Pro Leu Arg - # Leu Cys Gly Asp Asn Ala Gly Met # 300 - Ile Ala Tyr Ala Ser Val Ser Xaa - # Trp Asn Lys Glu Asn Phe Ala Gly # 320 - Trp Asp Leu Asn Ala Lys Pro Ser - # Leu Ala Phe Asp Thr Met Glu # 335 - (2) INFORMATION FOR SEQ ID NO: 53: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1000 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 53: SEQUENCE DESCRIPTION: - GGTAGTTAAA GTTGGTATTA ACGGTTTCGG ACGTATCGGT CGTCTTGCTT TC - #CGTCGTAT

60 - CCAAAACGTA GAAGGTGTTG AAGTTACACG CATCAACGAC CTTACAGATC CA - #GTTATGCT 120 - TGCACACTTG TTGAAATACG ACACAACTCA AGGTCGTTTC GACGGTACTG TT - #GAAGTTAA 180 - AGAAGGTGGA TTTGAAGTTA ACGGTAAATT CATCAAAGTT TCTGCTGAAC GT - #GATCCAGA 240 - ACAAATCGAC TGGGCTACTG ACGGTGTAGA AATCGTTCTT GAAGCTACTG GT - #TTCTTTGC 300 - TAAGAAAGAA GCAGCTGAAA AACACCTTAA AGGTGGAGCT AAAAAAGTTG TT - #ATCACTGC 360 - TCCTGGTGGA AACGACGTTA AAACAGTTGT ATTCAACACT AACCACGACG TT - #CTTGACGG 420 - TACTGAAACA GTTATCTCAG GTGCTTCATG TACTACAAAC TGCTTGGCTC CA - #ATGGCTAA 480 - AGCTCTTCAA GACAACTTTG GTGTTGTTGA AGGATTGATG ACTACTATCC AC - #GCTTACAC 540 - TGGTGACCAA ATGATCCTTG ACGGACCACA CCGTGGTGGT GACCTTCGCC GT - #GCTCGCGC 600 - TGGTGCTGCA AACATCGTTC CTAACTCAAC TGGTGCTGCA AAAGCTATCG GT - #CTTGTAAT 660 - CCCAGAATTG AATGGTAAAC TTGACGGATC TGCACAACGC GTTCCAACTC CA - #ACTGGATC 720 - AGTTACTGAA TTGGTAGCAG TTCTTGAAAA GAACGTTACT GTTGATGAAG TG - #AACGCAGC 780 - TATGAAAGCA GCTTCAAACG AATCATACGG TTACACAGAA GATCCAATCG TA - #TCTTCAGA 840 - TATCGTAGGT ATGTCTTACG GTTCATTGTT TGACGCAACT CAAACTAAAG TT - #CTTGACGT 900 - TGACGGTAAA CAATTGGTTA AAGTTGTATC ATGGTACGAC AACGAAATGT CA - #TACACTGC 960 # 1000 CTTG GAATACTTCG CAAAAATTGC - (2) INFORMATION FOR SEQ ID NO:54: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 333 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:54:(xi) SEQUENCE DESCRIPTION: SEQ - Val Val Lys Val Gly Ile Asn Gly - # Phe Gly Arg Ile Gly Arg Leu Ala # 15 - Phe Arg Arg Ile Gln Asn Val Glu - # Gly Val Glu Val Thr Arg Ile Asn # 30 - Asp Leu Thr Asp Pro Val Met Leu - # Ala His Leu Leu Lys Tyr Asp Thr # 45 - Thr Gln Gly Arg Phe Asp Gly Thr - # Val Glu Val Lys Glu Gly Gly Phe # 60 - Glu Val Asn Gly Lys Phe Ile Lys - # Val Ser Ala Glu Arg Asp Pro Glu # 80 - Gln Ile Asp Trp Ala Thr Asp Gly - # Val Glu Ile Val Leu Glu Ala Thr # 95 - Gly Phe Phe Ala Lys Lys Glu Ala - # Ala Glu Lys His Leu Lys Gly Gly # 110 - Ala Lys Lys Val Val Ile Thr Ala - # Pro Gly Gly Asn Asp Val Lys Thr # 125 - Val Val Phe Asn Thr Asn His Asp - # Val Leu Asp Gly Thr Glu Thr Val # 140 - Ile Ser Gly Ala Ser Cys Thr Thr - # Asn Cys Leu Ala Pro Met Ala Lys # 160 - Ala Leu Gln Asp Asn Phe Gly Val - # Val Glu Gly Leu Met Thr Thr Ile # 175 - His Ala Tyr Thr Gly Asp Gln Met - # Ile Leu Asp Gly Pro His Arg Gly # 190 - Gly Asp Leu Arg Arg Ala Arg Ala - # Gly Ala Ala Asn Ile Val Pro Asn # 205 - Ser Thr Gly Ala Ala Lys Ala Ile - # Gly Leu Val Ile Pro Glu Leu Asn # 220 - Gly Lys Leu Asp Gly Ser Ala Gln - # Arg Val Pro Thr Pro Thr Gly Ser # 240 - Val Thr Glu Leu Val Ala Val Leu - # Glu Lys Asn Val Thr Val Asp Glu # 255 - Val Asn Ala Ala Met Lys Ala Ala - # Ser Asn Glu Ser Tyr Gly Tyr Thr # 270 - Glu Asp Pro Ile Val Ser Ser Asp - # Ile Val Gly Met Ser Tyr Gly Ser # 285 - Leu Phe Asp Ala Thr Gln Thr Lys - # Val Leu Asp Val Asp Gly Lys Gln # 300 - Leu Val Lys Val Val Ser Trp Tyr - # Asp Asn Glu Met Ser Tyr Thr Ala # 320 - Gln Leu Val Arg Thr Leu Gly Ile - # Leu Arg Lys Asn Cys # 330 - (2) INFORMATION FOR SEQ ID NO: 55: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2389 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 55: SEQUENCE DESCRIPTION: - TTCTTACGAG TTGGGACTGT ATCAAGCTAG AACGGTTAAG GAAAATAATC GT - #GTTTCCTA 60 - TATAGATGGA AAACAAGCGA CGCAAAAAAC GGAGAATTTG ACTCCTGATG AG - #GTTAGCAA 120 - GCGTGAAGGA ATCAATGCTG AGCAAATCGT CATCAAGATA ACAGACCAAG GC - #TATGTCAC 180 - TTCACATGGC GACCACTATC ATTATTACAA TGGTAAGGTT CCTTATGACG CT - #ATCATCAG 240 - TGAAGAATTA CTCATGAAAG ATCCAAACTA TAAGCTAAAA GATGAGGATA TT - #GTTAATGA 300 - GGTCAAGGGT GGATATGTTA TCAAGGTAGA TGGAAAATAC TATGTTTACC TT - #AAGGATGC 360 - TGCCCACGCG GATAACGTCC GTACAAAAGA GGAAATCAAT CGACAAAAAC AA - #GAGCATAG 420 - TCAACATCGT GAAGGTGGAA CTCCAAGAAA CGATGGTGCT GTTGCCTTGG CA - #CGTTCGCA 480 - AGGACGCTAT ACTACAGATG ATGGTTATAT CTTTAATGCT TCTGATATCA TA - #GAGGATAC 540 - TGGTGATGCT TATATCGTTC CTCATGGAGA TCATTACCAT TACATTCCTA AG - #AATGAGTT 600 - ATCAGCTAGC GAGTTGGCTG CTGCAGAAGC CTTCCTATCT GGTCGAGGAA AT - #CTGTCAAA 660 - TTCAAGAACC TATCGCCGAC AAAATAGCGA TAACACTTCA AGAACAAACT GG - #GTACCTTC 720 - TGTAAGCAAT CCAGGAACTA CAAATACTAA CACAAGCAAC AACAGCAACA CT - #AACAGTCA 780 - AGCAAGTCAA AGTAATGACA TTGATAGTCT CTTGAAACAG CTCTACAAAC TG - #CCTTTGAG 840 - TCAACGACAT GTAGAATCTG ATGGCCTTGT CTTTGATCCA GCACAAATCA CA - #AGTCGAAC 900 - AGCTAGAGGT GTTGCAGTGC CACACGGAGA TCATTACCAC TTCATCCCTT AC - #TCTCAAAT 960 - GTCTGAATTG GAAGAACGAA TCGCTCGTAT TATTCCCCTT CGTTATCGTT CA - #AACCATTG 1020 - GGTACCAGAT TCAAGGCCAG AACAACCAAG TCCACAACCG ACTCCGGAAC CT - #AGTCCAGG 1080 - CCCGCAACCT GCACCAAATC TTAAAATAGA CTCAAATTCT TCTTTGGTTA GT - #CAGCTGGT 1140 - ACGAAAAGTT GGGGAAGGAT ATGTATTCGA AGAAAAGGGC ATCTCTCGTT AT - #GTCTTTGC 1200 - GAAAGATTTA CCATCTGAAA CTGTTAAAAA TCTTGAAAGC AAGTTATCAA AA - #CAAGAGAG 1260 - TGTTTCACAC ACTTTAACTG CTAAAAAAGA AAATGTTGCT CCTCGTGACC AA - #GAATTTTA 1320 - TGATAAAGCA TATAATCTGT TAACTGAGGC TCATAAAGCC TTGTTTGNAA AT - #AAGGGTCG 1380 - TAATTCTGAT TTCCAAGCCT TAGACAAATT ATTAGAACGC TTGAATGATG AA - #TCGACTAA 1440 - TAAAGAAAAA TTGGTAGATG ATTTATTGGC ATTCCTAGCA CCAATTACCC AT - #CCAGAGCG 1500 - ACTTGGCAAA CCAAATTCTC AAATTGAGTA TACTGAAGAC GAAGTTCGTA TT - #GCTCAATT 1560 - AGCTGATAAG TATACAACGT CAGATGGTTA CATTTTTGAT GAACATGATA TA - #ATCAGTGA 1620 - TGAAGGAGAT GCATATGTAA CGCCTCATAT GGGCCATAGT CACTGGATTG GA - #AAAGATAG 1680 - CCTTTCTGAT AAGGAAAAAG TTGCAGCTCA AGCCTATACT AAAGAAAAAG GT - #ATCCTACC 1740 - TCCATCTCCA GACGCAGATG TTAAAGCAAA TCCAACTGGA GATAGTGCAG CA - #GCTATTTA 1800 - CAATCGTGTG AAAGGGGAAA AACGAATTCC ACTCGTTCGA CTTCCATATA TG - #GTTGAGCA 1860 - TACAGTTGAG GTTAAAAACG GTAATTTGAT TATTCCTCAT AAGGATCATT AC - #CATAATAT 1920 - TAAATTTGCT TGGTTTGATG ATCACACATA CAAAGCTCCA AATGGCTATA CC - #TTGGAAGA 1980 - TTTGTTTGCG ACGATTAAGT ACTACGTAGA ACACCCTGAC GAACGTCCAC AT - #TCTAATGA 2040 - TGGATGGGGC AATGCCAGTG AGCATGTGTT AGGCAAGAAA GACCACAGTG AA - #GATCCAAA 2100 - TAAGAACTTC AAAGCGGATG AAGAGCCAGT AGAGGAAACA CCTGCTGAGC CA - #GAAGTCCC 2160 - TCAAGTAGAG ACTGAAAAAG TAGAAGCCCA ACTCAAAGAA GCAGAAGTTT TG - #CTTGCGAA 2220 - AGTAACGGAT TCTAGTCTGA AAGCCAATGC AACAGAAACT CTAGCTGGTT TA - #CGAAATAA 2280 - TTTGACTCTT CAAATTATGG ATAACAATAG TATCATGGCA GAAGCAGAAA AA - #TTACTTGC 2340 # 2389TAATC CTTCATCTGT AAGTAAGGAA AAAATAAAC - (2) INFORMATION FOR SEQ ID NO:56: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 796 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein #ID NO:56:(xi) SEQUENCE DESCRIPTION: SEQ - Ser Tyr Glu Leu Gly Leu Tyr Gln - # Ala Arg Thr Val Lys Glu Asn Asn # 15 - Arg Val Ser Tyr Ile Asp Gly Lys - # Gln Ala Thr Gln Lys Thr Glu Asn # 30 - Leu Thr Pro Asp Glu Val Ser Lys - # Arg Glu Gly Ile Asn Ala Glu Gln # 45 - Ile Val Ile Lys Ile Thr Asp Gln - # Gly Tyr Val Thr Ser His Gly Asp # 60 - His Tyr His Tyr Tyr Asn Gly Lys - # Val Pro Tyr Asp Ala Ile Ile Ser # 80 - Glu Glu Leu Leu Met Lys Asp Pro - # Asn Tyr Lys Leu Lys Asp Glu Asp # 95 - Ile Val Asn Glu Val Lys Gly Gly - # Tyr Val Ile Lys Val Asp Gly Lys # 110 - Tyr Tyr Val Tyr Leu Lys Asp Ala - # Ala His Ala Asp Asn Val Arg Thr # 125 - Lys Glu Glu Ile Asn Arg Gln Lys - # Gln Glu His Ser Gln His Arg Glu # 140 - Gly Gly Thr Pro Arg Asn Asp Gly - # Ala Val Ala Leu Ala Arg Ser Gln # 160 - Gly Arg Tyr Thr Thr Asp Asp Gly - # Tyr Ile Phe Asn Ala Ser Asp Ile # 175 - Ile Glu Asp Thr Gly Asp Ala Tyr - # Ile Val Pro His Gly Asp His Tyr # 190 - His Tyr Ile Pro Lys Asn Glu Leu - # Ser Ala Ser Glu Leu Ala Ala Ala # 205 - Glu Ala Phe Leu Ser Gly Arg Gly - # Asn Leu Ser Asn Ser Arg Thr Tyr # 220 - Arg Arg Gln Asn Ser Asp Asn Thr - # Ser Arg Thr Asn Trp Val Pro Ser # 240 - Val Ser Asn Pro Gly Thr Thr Asn - # Thr Asn Thr Ser Asn Asn Ser Asn # 255 - Thr Asn Ser Gln Ala Ser Gln Ser - # Asn Asp Ile Asp Ser Leu Leu Lys # 270 - Gln Leu Tyr Lys Leu Pro Leu Ser - # Gln Arg His Val Glu Ser Asp Gly # 285 - Leu Val Phe Asp Pro Ala Gln Ile - # Thr Ser Arg Thr Ala Arg Gly Val # 300 - Ala Val Pro His Gly Asp His Tyr - # His Phe Ile Pro Tyr Ser Gln Met

# 320 - Ser Glu Leu Glu Glu Arg Ile Ala - # Arg Ile Ile Pro Leu Arg Tyr Arg # 335 - Ser Asn His Trp Val Pro Asp Ser - # Arg Pro Glu Gln Pro Ser Pro Gln # 350 - Pro Thr Pro Glu Pro Ser Pro Gly - # Pro Gln Pro Ala Pro Asn Leu Lys # 365 - Ile Asp Ser Asn Ser Ser Leu Val - # Ser Gln Leu Val Arg Lys Val Gly # 380 - Glu Gly Tyr Val Phe Glu Glu Lys - # Gly Ile Ser Arg Tyr Val Phe Ala # 400 - Lys Asp Leu Pro Ser Glu Thr Val - # Lys Asn Leu Glu Ser Lys Leu Ser # 415 - Lys Gln Glu Ser Val Ser His Thr - # Leu Thr Ala Lys Lys Glu Asn Val # 430 - Ala Pro Arg Asp Gln Glu Phe Tyr - # Asp Lys Ala Tyr Asn Leu Leu Thr # 445 - Glu Ala His Lys Ala Leu Phe Xaa - # Asn Lys Gly Arg Asn Ser Asp Phe # 460 - Gln Ala Leu Asp Lys Leu Leu Glu - # Arg Leu Asn Asp Glu Ser Thr Asn # 480 - Lys Glu Lys Leu Val Asp Asp Leu - # Leu Ala Phe Leu Ala Pro Ile Thr # 495 - His Pro Glu Arg Leu Gly Lys Pro - # Asn Ser Gln Ile Glu Tyr Thr Glu # 510 - Asp Glu Val Arg Ile Ala Gln Leu - # Ala Asp Lys Tyr Thr Thr Ser Asp # 525 - Gly Tyr Ile Phe Asp Glu His Asp - # Ile Ile Ser Asp Glu Gly Asp Ala # 540 - Tyr Val Thr Pro His Met Gly His - # Ser His Trp Ile Gly Lys Asp Ser # 560 - Leu Ser Asp Lys Glu Lys Val Ala - # Ala Gln Ala Tyr Thr Lys Glu Lys # 575 - Gly Ile Leu Pro Pro Ser Pro Asp - # Ala Asp Val Lys Ala Asn Pro Thr # 590 - Gly Asp Ser Ala Ala Ala Ile Tyr - # Asn Arg Val Lys Gly Glu Lys Arg # 605 - Ile Pro Leu Val Arg Leu Pro Tyr - # Met Val Glu His Thr Val Glu Val # 620 - Lys Asn Gly Asn Leu Ile Ile Pro - # His Lys Asp His Tyr His Asn Ile # 640 - Lys Phe Ala Trp Phe Asp Asp His - # Thr Tyr Lys Ala Pro Asn Gly Tyr # 655 - Thr Leu Glu Asp Leu Phe Ala Thr - # Ile Lys Tyr Tyr Val Glu His Pro # 670 - Asp Glu Arg Pro His Ser Asn Asp - # Gly Trp Gly Asn Ala Ser Glu His # 685 - Val Leu Gly Lys Lys Asp His Ser - # Glu Asp Pro Asn Lys Asn Phe Lys # 700 - Ala Asp Glu Glu Pro Val Glu Glu - # Thr Pro Ala Glu Pro Glu Val Pro # 720 - Gln Val Glu Thr Glu Lys Val Glu - # Ala Gln Leu Lys Glu Ala Glu Val # 735 - Leu Leu Ala Lys Val Thr Asp Ser - # Ser Leu Lys Ala Asn Ala Thr Glu # 750 - Thr Leu Ala Gly Leu Arg Asn Asn - # Leu Thr Leu Gln Ile Met Asp Asn # 765 - Asn Ser Ile Met Ala Glu Ala Glu - # Lys Leu Leu Ala Leu Leu Lys Gly # 780 - Ser Asn Pro Ser Ser Val Ser Lys - # Glu Lys Ile Asn # 795 - (2) INFORMATION FOR SEQ ID NO: 57: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1180 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 57: SEQUENCE DESCRIPTION: - TACTGAGATG CATCATAATC TAGGAGCTGA AAAGCGTTCA GCAGTGGCTA CT - #ACTATCGA 60 - TAGTTTTAAG GAGCGAAGTC AAAAAGTCAG AGCACTATCT GATCCAAATG TG - #CGTTTTGT 120 - TCCCTTCTTT GGCTCTAGTG AATGGCTTCG TTTTGACGGT GCTCATTCTG CG - #GTATTAGC 180 - TGAGAAATAC AATCGTTCCT ACCGTCCTTA TCTTTTAGGA CAGGGGGGAG CT - #GCATCGCT 240 - TAACCAATAT TTTGGAATGC AACAGATGTT ACCACAGCTG GAGAATAAAC AA - #GTTGTGTA 300 - TGTTATCTCA CCTCAGTGGT TCAGTAAAAA TGGCTATGAT CCAGCAGCCT TC - #CAGCAGTA 360 - TTTTAATGGA GACCAGTTGA CTAGTTTTCT GAAACATCAA TCTGGGGATC AG - #GCTAGTCA 420 - ATATGCAGCG ACTCGCTTAC TGCAACAGTT CCCAAACGTA GCTATGAAGG AC - #CTGGTTCA 480 - GAAGTTGGCA AGTAAAGAAG AATTGTCGAC AGCAGACAAT GAAATGATTG AA - #TTATTGGC 540 - TCGTTTTAAT GAACGCCAAG CTTCCTTTTT TGGTCAGTTT TCGGTTAGAG GC - #TATGTTAA 600 - CTACGATAAG CATGTAGCTA AGTATTTAAA AATCTTGCCA GACCAGTTTT CT - #TATCAGGC 660 - AATAGAAGAT GTTGTCAAAG CAGATGCTGA AAAAAATACT TCCAATAATG AG - #ATGGGAAT 720 - GGAAAATTAT TTCTATAATG AGCAGATCAA GAAGGATTTG AAGAAATTAA AG - #GATTCTCA 780 - GAAAAGCTTT ACCTATCTCA AGTCGCCAGA GTATAATGNN TTGCAGTTGG TT - #TTAACACA 840 - GTTTTCTAAA TCTAAGGTAA ACCCGATTTT TATCATTCCA CCTGTTAATA AA - #AAATGGAT 900 - GNACTATGCT GGTCTACGAG AGGATATGTA CCAACAAACG GTGCAGAAGA TT - #CGCTACCA 960 - GTTAGAAAGT CAAGGTTTTA CCAATATAGC AGATTTTTCT AAGGACGGCG GG - #GAGCCTTT 1020 - CTTTATGAAG GACACCATTC ACCTTGGTTG GTTGGGTTGG TTGGCTTTTG AC - #AAGGCAGT 1080 - TGATCCTTTC CTATCCAATC CCACACCAGC TCCGACTTAC CATCTGAATG AG - #CGCTTTTT 1140 # 1180 ACTT ATGATGGAGA TGTCAAAGAA - (2) INFORMATION FOR SEQ ID NO:58: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 393 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: - Thr Glu Met His His Asn Leu Gly Ala Glu Ly - #s Arg Ser Ala Val Ala # 15 - Thr Thr Ile Asp Ser Phe Lys Glu Arg Ser Gl - #n Lys Val Arg Ala Leu # 30 - Ser Asp Pro Asn Val Arg Phe Val Pro Phe Ph - #e Gly Ser Ser Glu Trp # 45 - Leu Arg Phe Asp Gly Ala His Ser Ala Val Le - #u Ala Glu Lys Tyr Asn # 60 - Arg Ser Tyr Arg Pro Tyr Leu Leu Gly Gln Gl - #y Gly Ala Ala Ser Leu #80 - Asn Gln Tyr Phe Gly Met Gln Gln Met Leu Pr - #o Gln Leu Glu Asn Lys # 95 - Gln Val Val Tyr Val Ile Ser Pro Gln Trp Ph - #e Ser Lys Asn Gly Tyr # 110 - Asp Pro Ala Ala Phe Gln Gln Tyr Phe Asn Gl - #y Asp Gln Leu Thr Ser # 125 - Phe Leu Lys His Gln Ser Gly Asp Gln Ala Se - #r Gln Tyr Ala Ala Thr # 140 - Arg Leu Leu Gln Gln Phe Pro Asn Val Ala Me - #t Lys Asp Leu Val Gln 145 1 - #50 1 - #55 1 - #60 - Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr Al - #a Asp Asn Glu Met Ile # 175 - Glu Leu Leu Ala Arg Phe Asn Glu Arg Gln Al - #a Ser Phe Phe Gly Gln # 190 - Phe Ser Val Arg Gly Tyr Val Asn Tyr Asp Ly - #s His Val Ala Lys Tyr # 205 - Leu Lys Ile Leu Pro Asp Gln Phe Ser Tyr Gl - #n Ala Ile Glu Asp Val # 220 - Val Lys Ala Asp Ala Glu Lys Asn Thr Ser As - #n Asn Glu Met Gly Met 225 2 - #30 2 - #35 2 - #40 - Glu Asn Tyr Phe Tyr Asn Glu Gln Ile Lys Ly - #s Asp Leu Lys Lys Leu # 255 - Lys Asp Ser Gln Lys Ser Phe Thr Tyr Leu Ly - #s Ser Pro Glu Tyr Asn # 270 - Xaa Leu Gln Leu Val Leu Thr Gln Phe Ser Ly - #s Ser Lys Val Asn Pro # 285 - Ile Phe Ile Ile Pro Pro Val Asn Lys Lys Tr - #p Met Xaa Tyr Ala Gly # 300 - Leu Arg Glu Asp Met Tyr Gln Gln Thr Val Gl - #n Lys Ile Arg Tyr Gln 305 3 - #10 3 - #15 3 - #20 - Leu Glu Ser Gln Gly Phe Thr Asn Ile Ala As - #p Phe Ser Lys Asp Gly # 335 - Gly Glu Pro Phe Phe Met Lys Asp Thr Ile Hi - #s Leu Gly Trp Leu Gly # 350 - Trp Leu Ala Phe Asp Lys Ala Val Asp Pro Ph - #e Leu Ser Asn Pro Thr # 365 - Pro Ala Pro Thr Tyr His Leu Asn Glu Arg Ph - #e Phe Ser Lys Asp Trp # 380 - Ala Thr Tyr Asp Gly Asp Val Lys Glu 385 3 - #90 - (2) INFORMATION FOR SEQ ID NO: 59: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1423 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #59: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGTTTTGAGA AAGTATTTGC AGGGGGCCCT GATTGAGTCG ATTGAGCAAG TG - #GAAAATGA 60 - CCGTATTGTG GAAATTACAG TTTCCAATAA AAACGAGATT GGAGACCATA TC - #CAGGCTAC 120 - CTTGATTATC GAAATTATGG GGAAACACAG TAATATTCTA CTGGTCGATA AA - #AGCAGTCA 180 - TAAAATCCTC GAAGTTATCA AACACGTCGG CTTTTCACAA AATAGCTACC GC - #ACCTTACT 240 - TCCAGGATCG ACCTATATCG CTCCGCCAAG TACAAAATCT CTCAATCCTT TT - #ACTATCAA 300 - GGATGAAAAG CTCTTTGAAA TCCTGCAAAC CCAAGAACTA ACAGCAAAAA AT - #CTTCAAAG 360 - CCTCTTTCAA GGTCTGGGAC GCGATACGGC AAATGAATTG GAAAGGATAC TG - #GTTAGTGA 420 - AAAACTTTCC GCTTTCCGAA ATTTTTTCAA TCAAGAAACC AAGCCATGCT TG - #ACTGAGAC 480 - TTCCTTCAGT CCAGTTCCTT TTGCAAATCA GGTGGGAGAG CCTTTTGCAA AT - #CTTTCTGA 540 - TTTGTTGGAC ACCTACTATA AGGATAAGGC TGAGCGCGAC CGCGTCAAAC AG - #CAGGCCAG 600 - TGAACTGATT CGTCGTGTTG AAAATGAACT TCAGAAAAAC CGACACAAAC TC - #AAAAAACA 660 - GGAAAAAGAG TTACTGGCGA CAGACAACGC TGAAGAATTT CGTCAAAAAG GA - #GAATTGCT 720 - GACAACCTTC CTCCACCAAG TGCCTAACGA CCAAGACCAG GTTATCCTAG AC - #AACTACTA 780 - TACCAACCAA CCTATCATGA TTGCGCTTGA TAAGGCTCTG ACTCCCAACC AG - #AATGCCCA 840 - ACGCTATTTT AAACGGTATC AGAAACTCAA AGAAGCTGTC AAATACTTGA CT - #GATTTGAT 900 - TGAAGAAACC AAAGCCACTA TTCTCTATCT GGAAAGTGTA GAAACCGTCC TC - #AACCAAGC 960 - TGGACTGGAA GAAATCGCTG AAATCCGTGA AGAATTGATT CAAACAGGTT TT - #ATCCGCAG 1020 - AAGACAACGG GAGAAAATCC AGAAACGCAA AAAACTAGAA CAATATCTAG CA - #AGCGATGG 1080 - CAAAACCATC ATCTATGTCG GACGAAACAA TCTTCAAAAT GAGGAATTGA CC - #TTTAAAAT 1140 - GGCCCGCAAG GAGGAACTTT GGTTCCATGC TAAGGACATT CCTGGAAGCC AT - #GTTGTCAT 1200 - CTCAGGAAAT CTTGACCCAT CTGATGCAGT CAAGACAGAC GCAGCAGAGT TA - #GCTGCCTA 1260 - CTTCTCTCAA GGGCGCCTGT CGAATCTGGT GCAGGTAGAT ATGATTGAAG TC - #AAAAAACT 1320 - CAATAAACCA ACTGGTGGAA AACCCGGCTT TGTCACTTAC ACAGGACAAA AG - #ACCCTCCG 1380 # 142 - #3ATTGCATC CATGAAAAAA TCC - (2) INFORMATION FOR SEQ ID NO:60: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 474 amino (B) TYPE: amino acid

(C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: - Val Leu Arg Lys Tyr Leu Gln Gly Ala Leu Il - #e Glu Ser Ile Glu Gln # 15 - Val Glu Asn Asp Arg Ile Val Glu Ile Thr Va - #l Ser Asn Lys Asn Glu # 30 - Ile Gly Asp His Ile Gln Ala Thr Leu Ile Il - #e Glu Ile Met Gly Lys # 45 - His Ser Asn Ile Leu Leu Val Asp Lys Ser Se - #r His Lys Ile Leu Glu # 60 - Val Ile Lys His Val Gly Phe Ser Gln Asn Se - #r Tyr Arg Thr Leu Leu #80 - Pro Gly Ser Thr Tyr Ile Ala Pro Pro Ser Th - #r Lys Ser Leu Asn Pro # 95 - Phe Thr Ile Lys Asp Glu Lys Leu Phe Glu Il - #e Leu Gln Thr Gln Glu # 110 - Leu Thr Ala Lys Asn Leu Gln Ser Leu Phe Gl - #n Gly Leu Gly Arg Asp # 125 - Thr Ala Asn Glu Leu Glu Arg Ile Leu Val Se - #r Glu Lys Leu Ser Ala # 140 - Phe Arg Asn Phe Phe Asn Gln Glu Thr Lys Pr - #o Cys Leu Thr Glu Thr 145 1 - #50 1 - #55 1 - #60 - Ser Phe Ser Pro Val Pro Phe Ala Asn Gln Va - #l Gly Glu Pro Phe Ala # 175 - Asn Leu Ser Asp Leu Leu Asp Thr Tyr Tyr Ly - #s Asp Lys Ala Glu Arg # 190 - Asp Arg Val Lys Gln Gln Ala Ser Glu Leu Il - #e Arg Arg Val Glu Asn # 205 - Glu Leu Gln Lys Asn Arg His Lys Leu Lys Ly - #s Gln Glu Lys Glu Leu # 220 - Leu Ala Thr Asp Asn Ala Glu Glu Phe Arg Gl - #n Lys Gly Glu Leu Leu 225 2 - #30 2 - #35 2 - #40 - Thr Thr Phe Leu His Gln Val Pro Asn Asp Gl - #n Asp Gln Val Ile Leu # 255 - Asp Asn Tyr Tyr Thr Asn Gln Pro Ile Met Il - #e Ala Leu Asp Lys Ala # 270 - Leu Thr Pro Asn Gln Asn Ala Gln Arg Tyr Ph - #e Lys Arg Tyr Gln Lys # 285 - Leu Lys Glu Ala Val Lys Tyr Leu Thr Asp Le - #u Ile Glu Glu Thr Lys # 300 - Ala Thr Ile Leu Tyr Leu Glu Ser Val Glu Th - #r Val Leu Asn Gln Ala 305 3 - #10 3 - #15 3 - #20 - Gly Leu Glu Glu Ile Ala Glu Ile Arg Glu Gl - #u Leu Ile Gln Thr Gly # 335 - Phe Ile Arg Arg Arg Gln Arg Glu Lys Ile Gl - #n Lys Arg Lys Lys Leu # 350 - Glu Gln Tyr Leu Ala Ser Asp Gly Lys Thr Il - #e Ile Tyr Val Gly Arg # 365 - Asn Asn Leu Gln Asn Glu Glu Leu Thr Phe Ly - #s Met Ala Arg Lys Glu # 380 - Glu Leu Trp Phe His Ala Lys Asp Ile Pro Gl - #y Ser His Val Val Ile 385 3 - #90 3 - #95 4 - #00 - Ser Gly Asn Leu Asp Pro Ser Asp Ala Val Ly - #s Thr Asp Ala Ala Glu # 415 - Leu Ala Ala Tyr Phe Ser Gln Gly Arg Leu Se - #r Asn Leu Val Gln Val # 430 - Asp Met Ile Glu Val Lys Lys Leu Asn Lys Pr - #o Thr Gly Gly Lys Pro # 445 - Gly Phe Val Thr Tyr Thr Gly Gln Lys Thr Le - #u Arg Val Thr Pro Asp # 460 - Ser Lys Lys Ile Ala Ser Met Lys Lys Ser 465 4 - #70 - (2) INFORMATION FOR SEQ ID NO: 61: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 544 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #61: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GACAACATTT ACTATCCATA CAGTAGAGTC AGCACCAGCA GAAGTGAAAG AA - #ATTCTTGA 60 - AACAGTAGAA AAAGACAACA ATGGCTATAT TCCCAACCTA ATCGGTCTCT TG - #GCCAATGC 120 - CCCGACTGTT TTAGAAGCCT ACCAAATTGT CTCATCTATC CACCGTCGCA AC - #AGCCTGAC 180 - ACCCGTTGAG CGTGAAGTGG TGCAAATCAC GGCAGCCGTG ACCAATGGTT GT - #GCCTTCTG 240 - TGTCGCAGGT CACACAGCCT TTTCCATCAA ACAAATCCAG ATGAATGATG AC - #TTGATTCA 300 - AGCTCTTCGC AATCGTACTC CAATTGAAAC AGATCCTAAA TTGGATACCC TA - #GCTAAGTT 360 - TACCTTGGCA GTTATCAATA CCAAGGGTCG TGTAGGAGAT GAAGCCTTGT CT - #GAGTTTTT 420 - AGAAGCTGGC TACACTCAAC AAAATGCCTT GGATGTGGTT TTTGGTGTCA GC - #CTAGCAAT 480 - CCTCTGTAAC TATGCCAACA ACTTAGCTAA TACACCAATT AATCCAGAAT TG - #CAACCTTA 540 # 544 - (2) INFORMATION FOR SEQ ID NO:62: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 181 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: - Thr Thr Phe Thr Ile His Thr Val Glu Ser Al - #a Pro Ala Glu Val Lys # 15 - Glu Ile Leu Glu Thr Val Glu Lys Asp Asn As - #n Gly Tyr Ile Pro Asn # 30 - Leu Ile Gly Leu Leu Ala Asn Ala Pro Thr Va - #l Leu Glu Ala Tyr Gln # 45 - Ile Val Ser Ser Ile His Arg Arg Asn Ser Le - #u Thr Pro Val Glu Arg # 60 - Glu Val Val Gln Ile Thr Ala Ala Val Thr As - #n Gly Cys Ala Phe Cys #80 - Val Ala Gly His Thr Ala Phe Ser Ile Lys Gl - #n Ile Gln Met Asn Asp # 95 - Asp Leu Ile Gln Ala Leu Arg Asn Arg Thr Pr - #o Ile Glu Thr Asp Pro # 110 - Lys Leu Asp Thr Leu Ala Lys Phe Thr Leu Al - #a Val Ile Asn Thr Lys # 125 - Gly Arg Val Gly Asp Glu Ala Leu Ser Glu Ph - #e Leu Glu Ala Gly Tyr # 140 - Thr Gln Gln Asn Ala Leu Asp Val Val Phe Gl - #y Val Ser Leu Ala Ile 145 1 - #50 1 - #55 1 - #60 - Leu Cys Asn Tyr Ala Asn Asn Leu Ala Asn Th - #r Pro Ile Asn Pro Glu # 175 - Leu Gln Pro Tyr Ala 180 - (2) INFORMATION FOR SEQ ID NO: 63: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 811 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #63: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGCTAAGGAA AGAGTGGATG TACTAGCTTA TAAACAGGGG TTGTTTGAAA CG - #AGAGAGCA 60 - GGCCAAGCGA GGTGTGATGG CTGGCCTAGT CGTAGCAGTC CTTAATGGAG AA - #CGGTTTGA 120 - CAAGCCAGGA GAGAAAATTC CAGATGACAC CGAATTAAAA CTCAAGGGGG AG - #AAACTCAA 180 - GTATGTCAGC CGTGGTGGTT TGAAACTGGA AAAGGCCTTG CAGGTCTTTG AT - #TTGTCGGT 240 - GGATGGCGCG ACTACGATTG ATATCGGGGC CTCTACTGGA GGTTTTACCG AT - #GTCATGCT 300 - ACAGAATAGT GCCAAGTTGG TCTTTGCAGT CGATGTTGGT ACCAATCAGT TG - #GCTTGGAA 360 - ATTACGCCAA GACCCACGAG TTGTCAGCAT GGAGCAGTTC AATTTCCGCT AT - #GCTGAAAA 420 - GACTGATTTC GAGCAGGAGC CGAGCTTTGC CAGTATTGAT GTGAGTTTCA TT - #TCCCTTAG 480 - TCTGATTTTG CCAGCCTTGC ACCGTGTCTT GGCTGATCAA GGTCAGGTGG TA - #GCACTTGT 540 - CAAACCTCAG TTTGAGGCAG GACGTGAGCA GATTGGGAAA AATGGAATTA TT - #CGAGATGC 600 - TAAGGTTCAT CAGAATGTCC TTGAATCTGT AACAGCTATG GCAGTAGAGG TA - #GGTTTTTC 660 - AGTCCTTGGC TTGGACTTTT CTCCCATCCA AGGTGGACAT GGAAATATTG AA - #TTTTTAGC 720 - GTATTTGAAA AAAGAAAAGT CAGCAAGCAA TCAGATTCTT GCTGAGATTA AA - #GAAGCAGT 780 # 811 CAAT TTAAAAATGA A - (2) INFORMATION FOR SEQ ID NO:64: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 270 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: - Ala Lys Glu Arg Val Asp Val Leu Ala Tyr Ly - #s Gln Gly Leu Phe Glu # 15 - Thr Arg Glu Gln Ala Lys Arg Gly Val Met Al - #a Gly Leu Val Val Ala # 30 - Val Leu Asn Gly Glu Arg Phe Asp Lys Pro Gl - #y Glu Lys Ile Pro Asp # 45 - Asp Thr Glu Leu Lys Leu Lys Gly Glu Lys Le - #u Lys Tyr Val Ser Arg # 60 - Gly Gly Leu Lys Leu Glu Lys Ala Leu Gln Va - #l Phe Asp Leu Ser Val #80 - Asp Gly Ala Thr Thr Ile Asp Ile Gly Ala Se - #r Thr Gly Gly Phe Thr # 95 - Asp Val Met Leu Gln Asn Ser Ala Lys Leu Va - #l Phe Ala Val Asp Val # 110 - Gly Thr Asn Gln Leu Ala Trp Lys Leu Arg Gl - #n Asp Pro Arg Val Val # 125 - Ser Met Glu Gln Phe Asn Phe Arg Tyr Ala Gl - #u Lys Thr Asp Phe Glu # 140 - Gln Glu Pro Ser Phe Ala Ser Ile Asp Val Se - #r Phe Ile Ser Leu Ser 145 1 - #50 1 - #55 1 - #60 - Leu Ile Leu Pro Ala Leu His Arg Val Leu Al - #a Asp Gln Gly Gln Val # 175 - Val Ala Leu Val Lys Pro Gln Phe Glu Ala Gl - #y Arg Glu Gln Ile Gly # 190 - Lys Asn Gly Ile Ile Arg Asp Ala Lys Val Hi - #s Gln Asn Val Leu Glu # 205 - Ser Val Thr Ala Met Ala Val Glu Val Gly Ph - #e Ser Val Leu Gly Leu # 220 - Asp Phe Ser Pro Ile Gln Gly Gly His Gly As - #n Ile Glu Phe Leu Ala 225 2 - #30 2 - #35 2 - #40 - Tyr Leu Lys Lys Glu Lys Ser Ala Ser Asn Gl - #n Ile Leu Ala Glu Ile # 255 - Lys Glu Ala Val Glu Arg Ala His Ser Gln Ph - #e Lys Asn Glu # 270 - (2) INFORMATION FOR SEQ ID NO: 65: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2290 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 65: - TTGTTCCTAT GAACTTGGTC GTCACCAAGC TGGTCAGGTT AAGAAAGAGT CT - #AATCGAGT 60 - TTCTTATATA GATGGTGATC AGGCTGGTCA AAAGGCAGAA AACTTGACAC CA - #GATGAAGT 120 - CAGTAAGAGG GAGGGGATCA ACGCCGAACA AATNGTNATC AAGATTACGG AT - #CAAGGTTA 180 - TGTGACCTCT CATGGAGACC ATTATCATTA CTATAATGGC AAGGTTCCTT AT - #GATGCCAT 240 - CATCAGTGAA GAGCTCCTCA TGAAAGATCC GAATTATCAG TTGAAGGATT CA - #GACATTGT 300 - CAATGAAATC AAGGGTGGTT ATGTCATTAA GGTAAACGGT AAATACTATG TN - #TACCTTAA 360 - GGATGCAGCT CATGCGGATA ATATTCGGAC AAAAGAAGAG ATTAAACGTC AG - #AAGCAGGA 420 - ACGCAGTCAT AATCATAACT CAAGAGCAGA TAATGCTGTT GCTGCAGCCA GA - #GCCCAAGG 480 - ACGTTATACA ACGGATGATG GGTATATCTT CAATGCATCT GATATCATTG AG - #GACACGGG 540 - TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC ATTCCTAAGA AT - #GAGTTATC 600 - AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG AAGCAGGGAT CT - #CGTCCTTC 660 - TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA TTGTCAGAGA AC - #CACAATCT 720 - GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC ATTTCAAGCC TT - #TTACGTGA 780 - ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT GATGGCCTTA TT - #TTCGACCC 840 - AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC CCTCATGGTA AC - #CATTACCA 900 - CTTTATCCCT TATGAACAAA TGTCTGAATT GGAAAAACGA ATTGCTCGTA TT - #ATTCCCCT 960 - TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA GAACAACCAA GT - #CCACAATC 1020 - GACTCCGGAA CCTAGTCCAA GTCCGCAACC TGCACCAAAT CCTCAACCAG CT - #CCAAGCAA 1080 - TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA GTAGGCGATG GT - #TATGTCTT 1140 - TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT CTTTCAGCAG AA - #ACAGCAGC 1200 - AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT CATAAGCTAG GA - #GCTAAGAA 1260 - AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG GCTTATGACT TA - #CTAGCAAG

1320 - AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT GATTTTGAGG CT - #TTGGATAA 1380 - CCTGTTGGAA CGACTCAAGG ATGTCNCAAG TGATAAAGTC AAGTTAGTGG AN - #GATATTCT 1440 - TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA AAACCAAATG CG - #CAAATTAC 1500 - CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC AAGTACACAA CA - #GAAGACGG 1560 - TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG GATGCCTATG TA - #ACTCCACA 1620 - TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT GAAGCTGAGA GA - #GCGGCAGC 1680 - CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG ACAGACCATC AG - #GATTCAGG 1740 - AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC GTGAAAGCAG CT - #AAGAAGGT 1800 - GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA GAAGTCAAAA AC - #GGTAGTTT 1860 - AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT GAGTGGTTTG AC - #GAAGGCCT 1920 - TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG GCGACTGTCA AG - #TACTATGT 1980 - CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT GGTAACGCTA GC - #GACCATGT 2040 - TCAAAGAAAC AAAAATGGTC AAGCTGATAC CAATCAAACG GAAAAACCAA GC - #GAGGAGAA 2100 - ACCTCAGACA GAAAAACCTG AGGAAGAAAC CCCTCGAGAA GAGAAACCGC AA - #AGCGAGAA 2160 - ACCAGAGTCT CCAAAACCAA CAGAGGAACC AGAAGAATCA CCAGAGGAAT CA - #GAAGAACC 2220 - TCAGGTCGAG ACTGAAAAGG TTGAAGAAAA ACTGAGAGAG GCTGAAGATT TA - #CTTGGAAA 2280 # 2290 - (2) INFORMATION FOR SEQ ID NO:66: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 763 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: - Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gl - #y Gln Val Lys Lys Glu # 15 - Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gl - #n Ala Gly Gln Lys Ala # 30 - Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Ar - #g Glu Gly Ile Asn Ala # 45 - Glu Gln Xaa Val Ile Lys Ile Thr Asp Gln Gl - #y Tyr Val Thr Ser His # 60 - Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Va - #l Pro Tyr Asp Ala Ile #80 - Ile Ser Glu Glu Leu Leu Met Lys Asp Pro As - #n Tyr Gln Leu Lys Asp # 95 - Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Ty - #r Val Ile Lys Val Asn # 110 - Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Al - #a His Ala Asp Asn Ile # 125 - Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gl - #n Glu Arg Ser His Asn # 140 - His Asn Ser Arg Ala Asp Asn Ala Val Ala Al - #a Ala Arg Ala Gln Gly 145 1 - #50 1 - #55 1 - #60 - Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe As - #n Ala Ser Asp Ile Ile # 175 - Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro Hi - #s Gly Asp His Tyr His # 190

- Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Gl - #u Leu Ala Ala Ala Glu # 205 - Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pr - #o Ser Ser Ser Ser Ser # 220 - Tyr Asn Ala Asn Pro Ala Gln Pro Arg Leu Se - #r Glu Asn His Asn Leu 225 2 - #30 2 - #35 2 - #40 - Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gl - #y Glu Asn Ile Ser Ser # 255 - Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Se - #r Glu Arg His Val Glu # 270 - Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Il - #e Thr Ser Arg Thr Ala # 285 - Arg Gly Val Ala Val Pro His Gly Asn His Ty - #r His Phe Ile Pro Tyr # 300 - Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Al - #a Arg Ile Ile Pro Leu 305 3 - #10 3 - #15 3 - #20 - Arg Tyr Arg Ser Asn His Trp Val Pro Asp Se - #r Arg Pro Glu Gln Pro # 335 - Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Se - #r Pro Gln Pro Ala Pro # 350 - Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile As - #p Glu Lys Leu Val Lys # 365 - Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Va - #l Phe Glu Glu Asn Gly # 380 - Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Se - #r Ala Glu Thr Ala Ala 385 3 - #90 3 - #95 4 - #00 - Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Se - #r Leu Ser His Lys Leu # 415 - Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser As - #p Arg Glu Phe Tyr Asn # 430 - Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gl - #n Asp Leu Leu Asp Asn # 445 - Lys Gly Arg Gln Val Asp Phe Glu Ala Leu As - #p Asn Leu Leu Glu Arg # 460 - Leu Lys Asp Val Xaa Ser Asp Lys Val Lys Le - #u Val Xaa Asp Ile Leu 465 4 - #70 4 - #75 4 - #80 - Ala Phe Leu Ala Pro Ile Arg His Pro Glu Ar - #g Leu Gly Lys Pro Asn # 495 - Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gl - #n Val Ala Lys Leu Ala # 510 - Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Ph - #e Asp Pro Arg Asp Ile # 525 - Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pr - #o His Met Thr His Ser # 540 - His Trp Ile Lys Lys Asp Ser Leu Ser Glu Al - #a Glu Arg Ala Ala Ala 545 5 - #50 5 - #55 5 - #60 - Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pr - #o Pro Ser Thr Asp His # 575 - Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Al - #a Glu Ala Ile Tyr Asn # 590 - Arg Val Lys Ala Ala Lys Lys Val Pro Leu As - #p Arg Met Pro Tyr Asn # 605 - Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Se - #r Leu Ile Ile Pro His # 620 - Tyr Asp His Tyr His Asn Ile Lys Phe Glu Tr - #p Phe Asp Glu Gly Leu 625 6 - #30 6 - #35 6 - #40 - Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu As - #p Leu Leu Ala Thr Val # 655 - Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pr - #o His Ser Asp Asn Gly # 670 - Phe Gly Asn Ala Ser Asp His Val Gln Arg As - #n Lys Asn Gly Gln Ala # 685 - Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Gl - #u Lys Pro Gln Thr Glu # 700 - Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Ly - #s Pro Gln Ser Glu Lys 705 7 - #10 7 - #15 7 - #20 - Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Gl - #u Glu Ser Pro Glu Glu # 735 - Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Va - #l Glu Glu Lys Leu Arg # 750 - Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln As - #p # 760 - (2) INFORMATION FOR SEQ ID NO: 67: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 352 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #67: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTATAAGGGT GAATTAGAAA AAGGATACCA ATTTGATGGT TGGGAAATTT CT - #GGTTTCGA 60 - AGGTAAAAAA GACGCTGGCT ATGTTATTAA TCTATCAAAA GATACCTTTA TA - #AAACCTGT 120 - ATTCAAGAAA ATAGAGGAGA AAAAGGAGGA AGAAAATAAA CCTACTTTTG AT - #GTATCGAA 180 - AAAGAAAGAT AACCCACAAG TAAACCATAG TCAATTAAAT GAAAGTCACA GA - #AAAGAGGA 240 - TTTACAAAGA GAAGAGCATT CACAAAAATC TGATTCAACT AAGGATGTTA CA - #GCTACAGT 300 - TCTTGATAAA AACAATATCA GTAGTAAATC AACTACTAAC AATCCTAATA AG - # 352 - (2) INFORMATION FOR SEQ ID NO:68: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 117 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: - Tyr Lys Gly Glu Leu Glu Lys Gly Tyr Gln Ph - #e Asp Gly Trp Glu Ile # 15 - Ser Gly Phe Glu Gly Lys Lys Asp Ala Gly Ty - #r Val Ile Asn Leu Ser # 30 - Lys Asp Thr Phe Ile Lys Pro Val Phe Lys Ly - #s Ile Glu Glu Lys Lys # 45 - Glu Glu Glu Asn Lys Pro Thr Phe Asp Val Se - #r Lys Lys Lys Asp Asn # 60 - Pro Gln Val Asn His Ser Gln Leu Asn Glu Se - #r His Arg Lys Glu Asp #80 - Leu Gln Arg Glu Glu His Ser Gln Lys Ser As - #p Ser Thr Lys Asp Val # 95 - Thr Ala Thr Val Leu Asp Lys Asn Asn Ile Se - #r Ser Lys Ser Thr Thr # 110 - Asn Asn Pro Asn Lys 115 - (2) INFORMATION FOR SEQ ID NO: 69: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1312 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #SEQ ID NO: 69: SEQUENCE DESCRIPTION: - GAATGTTCAG GCTCAAGAAA GTTCAGGAAA TAAAATCCAC TTTATCAATG TT - #CAAGAAGG 60 - TGGCAGTGAT GCGATTATTC TTGAAAGCAA TGGACATTTT GCCATGGTGG AT - #ACAGGAGA 120 - AGATTATGAT TTCCCAGATG GAAGTGATTC TCGCTATCCA TGGAGAGAAG GA - #ATTGAAAC 180 - GTCTTATAAG CATGTTCTAA CAGACCGTGT CTTTCGTCGT TTGAAGGAAT TG - #GGTGTCCA 240 - AAAACTTGAT TTTATTTTGG TGACCCATAC CCACAGTGAT CATATTGGAA AT - #GTTGATGA 300 - ATTACTGTCT ACCTATCCAG TTGACCGAGT CTATCTTAAG AAATATAGTG AT - #AGTCGTAT 360 - TACTAATTCT GAACGTCTAT GGGATAATCT GTATGGCTAT GATAAGGTTT TA - #CAGACTGC 420 - TGCAGAAAAA GGTGTTTCAG TTATTCAAAA TATCACACAA GGGGATGCTC AT - #TTTCAGTT 480 - TGGGGACATG GATATTCAGC TCTATAATTA TGAAAATGAA ACTGATTCAT CG - #GGTGAATT 540 - AAAGAAAATT TGGGATGACA ATTCCAATTC CTTGATTAGC GTGGTGAAAG TC - #AATGGCAA 600 - GAAAATTTAC CTTGGGGGCG ATTTAGATAA TGTTCATGGA GCAGAAGACA AG - #TATGGTCC 660 - TCTCATTGGA AAAGTTGATT TGATGAAGTT TAATCATCAC CATGATACCA AC - #AAATCAAA 720 - TACCAAGGAT TTCATTAAAA ATTTGAGTCC GAGTTTGATT GTTCAAACTT CG - #GATAGTCT 780 - ACCTTGGAAA AATGGTGTTG ATAGTGAGTA TGTTAATTGG CTCAAAGAAC GA - #GGAATTGA 840 - GAGAATCAAC GCAGCCAGCA AAGACTATGA TGCAACAGTT TTTGATATTC GA - #AAAGACGG 900 - TTTTGTCAAT ATTTCAACAT CCTACAAGCC GATTCCAAGT TTTCAAGCTG GT - #TGGCATAA 960 - GAGTGCATAT GGGAACTGGT GGTATCAAGC GCCTGATTCT ACAGGAGAGT AT - #GCTGTCGG 1020 - TTGGAATGAA ATCGAAGGTG AATGGTATTA CTTTAACCAA ACGGGTATCT TG - #TTACAGAA 1080 - TCAATGGAAA AAATGGAACA ATCATTGGTT CTATTTGACA GACTCTGGTG CT - #TCTGCTAA 1140 - AAATTGGAAG AAAATCGCTG GAATCTGGTA TTATTTTAAC AAAGAAAACC AG - #ATGGAAAT 1200 - TGGTTGGATT CAAGATAAAG AGCAGTGGTA TTATTTGGAT GTTGATGGTT CT - #ATGAAGAC 1260 - AGGATGGCTT CAATATATGG GGCAATGGTA TTACTTTGCT CCATCAGGGG AA - # 1312 - (2) INFORMATION FOR SEQ ID NO:70: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 437 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: - Asn Val Gln Ala Gln Glu Ser Ser Gly Asn Ly - #s Ile His Phe Ile Asn # 15 - Val Gln Glu Gly Gly Ser Asp Ala Ile Ile Le - #u Glu Ser Asn Gly His # 30 - Phe Ala Met Val Asp Thr Gly Glu Asp Tyr As - #p Phe Pro Asp Gly Ser # 45 - Asp Ser Arg Tyr Pro Trp Arg Glu Gly Ile Gl - #u Thr Ser Tyr Lys His # 60 - Val Leu Thr Asp Arg Val Phe Arg Arg Leu Ly - #s Glu Leu Gly Val Gln #80 - Lys Leu Asp Phe Ile Leu Val Thr His Thr Hi - #s Ser Asp His Ile Gly # 95 - Asn Val Asp Glu Leu Leu Ser Thr Tyr Pro Va - #l Asp Arg Val Tyr Leu # 110 - Lys Lys Tyr Ser Asp Ser Arg Ile Thr Asn Se - #r Glu Arg Leu Trp Asp # 125 - Asn Leu Tyr Gly Tyr Asp Lys Val Leu Gln Th - #r Ala Ala Glu Lys Gly # 140 - Val Ser Val Ile Gln Asn Ile Thr Gln Gly As - #p Ala His Phe Gln Phe 145 1 - #50 1 - #55 1 - #60 - Gly Asp Met Asp Ile Gln Leu Tyr Asn Tyr Gl - #u Asn Glu Thr Asp Ser # 175 - Ser Gly Glu Leu Lys Lys Ile Trp Asp Asp As - #n Ser Asn Ser Leu Ile # 190 - Ser Val Val Lys Val Asn Gly Lys Lys Ile Ty - #r Leu Gly Gly Asp Leu # 205 - Asp Asn Val His Gly Ala Glu Asp Lys Tyr Gl - #y Pro Leu Ile Gly Lys # 220 - Val Asp Leu Met Lys Phe Asn His His His As - #p Thr Asn Lys Ser Asn 225 2 - #30 2 - #35 2 - #40 - Thr Lys Asp Phe Ile Lys Asn Leu Ser Pro Se - #r Leu Ile Val Gln Thr # 255 - Ser Asp Ser Leu Pro Trp Lys Asn Gly Val As - #p Ser Glu Tyr Val Asn # 270 - Trp Leu Lys Glu Arg Gly Ile Glu Arg Ile As - #n Ala Ala Ser Lys Asp # 285 - Tyr Asp Ala Thr Val Phe Asp Ile Arg Lys As - #p Gly Phe Val Asn Ile # 300 - Ser Thr Ser Tyr Lys Pro Ile Pro Ser Phe Gl - #n Ala Gly Trp His Lys 305 3 - #10 3 - #15 3 - #20 - Ser Ala Tyr Gly Asn Trp Trp Tyr Gln Ala Pr - #o Asp Ser Thr Gly Glu # 335 - Tyr Ala Val Gly Trp Asn Glu Ile Glu Gly Gl - #u Trp Tyr Tyr Phe Asn # 350 - Gln Thr Gly Ile Leu Leu Gln Asn Gln Trp Ly - #s Lys Trp Asn Asn His # 365 - Trp Phe Tyr Leu Thr Asp Ser Gly Ala Ser Al - #a Lys Asn Trp Lys Lys # 380 - Ile Ala Gly Ile Trp Tyr Tyr Phe Asn Lys Gl - #u Asn Gln Met Glu Ile 385 3 - #90 3 - #95 4 - #00 - Gly Trp Ile Gln Asp Lys Glu Gln Trp Tyr Ty - #r Leu Asp Val Asp Gly # 415 - Ser Met Lys Thr Gly Trp Leu Gln Tyr Met Gl - #y Gln Trp Tyr Tyr Phe # 430 - Ala Pro Ser Gly Glu 435 - (2) INFORMATION FOR SEQ ID NO: 71: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1855 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #71: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTTGGGTGTA ACCCATATCC AGCTCCTTCC AGTCTTGTCT TACTACTTTG TC - #AATGAATT 60 - GAAAAACCAT GAACGCTTGT CTGACTACGC TTCAAGCAAC AGCAACTACA AC - #TGGGGATA 120 - TGACCCTCAA AACTACTTCT CCTTGACTGG TATGTACTCA AGCGATCCTA AG - #AATCCAGA 180 - AAAACGAATC GCAGAATTTA AAAACCTCAT CAACGAAATC CACAAACGTG GT - #ATGGGAGC

240 - TATCCTAGAT GTCGTTTATA ACCACACAGC CAAAGTCGAT CTCTTTGAAG AT - #TTGGAACC 300 - AAACTACTAC CACTTTATGG ATGCCGATGG CACACCTCGA ACTAGCTTTG GT - #GGTGGACG 360 - CTTGGGGACA ACCCACCATA TGACCAAACG GCTCCTAATT GACTCTATCA AA - #TACCTAGT 420 - TGATACCTAC AAAGTGGATG GCTTCCGTTT CGATATGATG GGAGACCATG AC - #GCCGCTTC 480 - TATCGAAGAA GCTTACAAGG CTGCACGCGC CCTCAATCCA AACCTCATCA TG - #CTTGGTGA 540 - AGGTTGGAGA ACCTATGCCG GTGATGAAAA CATGCCTACT AAAGCTGCTG AC - #CAAGATTG 600 - GATGAAACAT ACCGATACTG TCGCTGTCTT TTCAGATGAC ATCCGTAACA AC - #CTCAAATC 660 - TGGTTATCCA AACGAAGGTC AACCTGCCTT TATCACAGGT GGCAAGCGTG AT - #GTCAACAC 720 - CATCTTTAAA AATCTCATTG CTCAACCAAC TAACTTTGAA GCTGACAGCC CT - #GGAGATGT 780 - CATCCAATAC ATCGCAGCCC ATGATAACTT GACCCTCTTT GACATCATTG CC - #CAGTCTAT 840 - CAAAAAAGAC CCAAGCAAGG CTGAGAACTA TGCTGAAATC CACCGTCGTT TA - #CGACTTGG 900 - AAATCTCATG GTCTTGACAG CTCAAGGAAC TCCATTTATC CACTCCGGTC AG - #GAATATGG 960 - ACGTACTAAA CAATTCCGTG ACCCAGCCTA CAAGACTCCA GTAGCAGAGG AT - #AAGGTTCC 1020 - AAACAAATCT CACTTGTTGC GTGATAAGGA CGGCAACCCA TTTGACTATC CT - #TACTTCAT 1080 - CCATGACTCT TACGATTCTA GTGATGCAGT CAACAAGTTT GACTGGACTA AG - #GCTACAGA 1140 - TGGTAAAGCT TATCCTGAAA ATGTCAAGAG CCGTGACTAT ATGAAAGGTT TG - #ATTGCCCT 1200 - TCGTCAATCT ACAGATGCCT TCCGACTTAA GAGTCTTCAA GATATCAAAG AC - #CGTGTCCA 1260 - CCTCATCACT GTCCCAGGCC AAAATGGTGT GGAAAAAGAG GATGTAGTGA TT - #GGCTACCA 1320 - AATCACTGCT CCAAACGGCG ATATCTACGC AGTCTTTGTC AATGCGGATG AA - #AAAGCTCG 1380 - CGAATTTAAT TTGGGAACTG CCTTTGCACA TCTAAGAAAT GCGGAAGTTT TG - #GCAGATGA 1440 - AAACCAAGCA GGACCAGTCG GAATTGCCAA CCCGAAAGGA CTTGAATGGA CT - #GAAAAAGG 1500 - CTTGAAATTG AATGCCCTTA CAGCTACTGT TCTTCGAGTC TCTCAAAATG GA - #ACTAGCCA 1560 - TGAGTCAACT GCAGAAGAGA AACCAGACTC AACCCCTTCC AAGCCTGAAC AT - #CAAAATGA 1620 - AGCTTCTCAC CCTGCACATC AAGACCCAGC TCCAGAAGCT AGACCTGATT CT - #ACTAAACC 1680 - AGATGCCAAA GTAGCTGATG CGGAAAATAA ACCTAGCCAA GCTACAGCTG AT - #TCACAAGC 1740 - TGAACAACCA GCACAAGAAG CACAAGCATC ATCTGTAAAA GAAGCGGTTC GA - #AACGAATC 1800 - GGTAGAAAAC TCTAGCAAGG AAAATATACC TGCAACCCCA GATAAACAAG CT - #GAA 1855 - (2) INFORMATION FOR SEQ ID NO:72: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 618 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: - Leu Gly Val Thr His Ile Gln Leu Leu Pro Va - #l Leu Ser Tyr Tyr Phe # 15 - Val Asn Glu Leu Lys Asn His Glu Arg Leu Se - #r Asp Tyr Ala Ser Ser # 30 - Asn Ser Asn Tyr Asn Trp Gly Tyr Asp Pro Gl - #n Asn Tyr Phe Ser Leu # 45 - Thr Gly Met Tyr Ser Ser Asp Pro Lys Asn Pr - #o Glu Lys Arg Ile Ala # 60 - Glu Phe Lys Asn Leu Ile Asn Glu Ile His Ly - #s Arg Gly Met Gly Ala #80 - Ile Leu Asp Val Val Tyr Asn His Thr Ala Ly - #s Val Asp Leu Phe Glu # 95 - Asp Leu Glu Pro Asn Tyr Tyr His Phe Met As - #p Ala Asp Gly Thr Pro # 110 - Arg Thr Ser Phe Gly Gly Gly Arg Leu Gly Th - #r Thr His His Met Thr # 125 - Lys Arg Leu Leu Ile Asp Ser Ile Lys Tyr Le - #u Val Asp Thr Tyr Lys # 140 - Val Asp Gly Phe Arg Phe Asp Met Met Gly As - #p His Asp Ala Ala Ser 145 1 - #50 1 - #55 1 - #60 - Ile Glu Glu Ala Tyr Lys Ala Ala Arg Ala Le - #u Asn Pro Asn Leu Ile # 175 - Met Leu Gly Glu Gly Trp Arg Thr Tyr Ala Gl - #y Asp Glu Asn Met Pro # 190 - Thr Lys Ala Ala Asp Gln Asp Trp Met Lys Hi - #s Thr Asp Thr Val Ala # 205 - Val Phe Ser Asp Asp Ile Arg Asn Asn Leu Ly - #s Ser Gly Tyr Pro Asn # 220 - Glu Gly Gln Pro Ala Phe Ile Thr Gly Gly Ly - #s Arg Asp Val Asn Thr 225 2 - #30 2 - #35 2 - #40 - Ile Phe Lys Asn Leu Ile Ala Gln Pro Thr As - #n Phe Glu Ala Asp Ser # 255 - Pro Gly Asp Val Ile Gln Tyr Ile Ala Ala Hi - #s Asp Asn Leu Thr Leu # 270 - Phe Asp Ile Ile Ala Gln Ser Ile Lys Lys As - #p Pro Ser Lys Ala Glu # 285 - Asn Tyr Ala Glu Ile His Arg Arg Leu Arg Le - #u Gly Asn Leu Met Val # 300 - Leu Thr Ala Gln Gly Thr Pro Phe Ile His Se - #r Gly Gln Glu Tyr Gly 305 3 - #10 3 - #15 3 - #20 - Arg Thr Lys Gln Phe Arg Asp Pro Ala Tyr Ly - #s Thr Pro Val Ala Glu # 335 - Asp Lys Val Pro Asn Lys Ser His Leu Leu Ar - #g Asp Lys Asp Gly Asn # 350 - Pro Phe Asp Tyr Pro Tyr Phe Ile His Asp Se - #r Tyr Asp Ser Ser Asp # 365 - Ala Val Asn Lys Phe Asp Trp Thr Lys Ala Th - #r Asp Gly Lys Ala Tyr # 380 - Pro Glu Asn Val Lys Ser Arg Asp Tyr Met Ly - #s Gly Leu Ile Ala Leu 385 3 - #90 3 - #95 4 - #00 - Arg Gln Ser Thr Asp Ala Phe Arg Leu Lys Se - #r Leu Gln Asp Ile Lys # 415 - Asp Arg Val His Leu Ile Thr Val Pro Gly Gl - #n Asn Gly Val Glu Lys # 430 - Glu Asp Val Val Ile Gly Tyr Gln Ile Thr Al - #a Pro Asn Gly Asp Ile # 445 - Tyr Ala Val Phe Val Asn Ala Asp Glu Lys Al - #a Arg Glu Phe Asn Leu # 460 - Gly Thr Ala Phe Ala His Leu Arg Asn Ala Gl - #u Val Leu Ala Asp Glu 465 4 - #70 4 - #75 4 - #80 - Asn Gln Ala Gly Pro Val Gly Ile Ala Asn Pr - #o Lys Gly Leu Glu Trp # 495 - Thr Glu Lys Gly Leu Lys Leu Asn Ala Leu Th - #r Ala Thr Val Leu Arg # 510 - Val Ser Gln Asn Gly Thr Ser His Glu Ser Th - #r Ala Glu Glu Lys Pro # 525 - Asp Ser Thr Pro Ser Lys Pro Glu His Gln As - #n Glu Ala Ser His Pro # 540 - Ala His Gln Asp Pro Ala Pro Glu Ala Arg Pr - #o Asp Ser Thr Lys Pro 545 5 - #50 5 - #55 5 - #60 - Asp Ala Lys Val Ala Asp Ala Glu Asn Lys Pr - #o Ser Gln Ala Thr Ala # 575 - Asp Ser Gln Ala Glu Gln Pro Ala Gln Glu Al - #a Gln Ala Ser Ser Val # 590 - Lys Glu Ala Val Arg Asn Glu Ser Val Glu As - #n Ser Ser Lys Glu Asn # 605 - Ile Pro Ala Thr Pro Asp Lys Gln Ala Glu # 615 - (2) INFORMATION FOR SEQ ID NO: 73: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1774 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #73: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAGTGATGGT ACTTGGCAAG GAAAACAGTA TCTGAAAGAA GATGGCAGTC AA - #GCAGCAAA 60 - TGAGTGGGTT TTNGATACTC ATTATCAATC TTGGTTCTAT ATAAAAGCAG AT - #GCTAACTA 120 - TGCTGAAAAT GAATGGCTAA AGCAAGGTGA CGACTATTTT TACCTCAAAT CT - #GGTGGCTA 180 - TATGGCCAAA TCAGAATGGG TAGAAGACAA GGGAGCCTTT TATTATCTTG AC - #CAAGATGG 240 - AAAGATGAAA AGAAATGCTT GGGTAGGAAC TTCCTATGTT GGTGCAACAG GT - #GCCAAAGT 300 - AATAGAAGAC TGGGTCTATG ATTCTCAATA CGATGCTTGG TTTTATATCA AA - #GCAGATGG 360 - ACAGCACGCA GAGAAAGAAT GGCTCCAAAT TAAAGGGAAG GACTATTATT TC - #AAATCCGG 420 - TGGTTATCTA CTGACAAGTC AGTGGATTAA TCAAGCTTAT GTGAATGCTA GT - #GGTGCCAA 480 - AGTACAGCAA GGTTGGCTTT TTGACAAACA ATACCAATCT TGGTTTTACA TC - #AAAGAAAA 540 - TGGAAACTAT GCTGATAAAG AATGGATTTT CGAGAATGGT CACTATTATT AT - #CTAAAATC 600 - CGGTGGCTAC ATGGCAGCCA ATGAATGGAT TTGGGATAAG GAATCTTGGT TT - #TATCTCAA 660 - ATTTGATGGG AAAATGGCTG AAAAAGAATG GGTCTACGAT TCTCATAGTC AA - #GCTTGGTA 720 - CTACTTCAAA TCCGGTGGTT ACATGACAGC CAATGAATGG ATTTGGGATA AG - #GAATCTTG 780 - GTTTTACCTC AAATCTGATG GGAAAATAGC TGAAAAAGAA TGGGTCTACG AT - #TCTCATAG 840 - TCAAGCTTGG TACTACTTCA AATCTGGTGG CTACATGGCG AAAAATGAGA CA - #GTAGATGG 900 - TTATCAGCTT GGAAGCGATG GTAAATGGCT TGGAGGAAAA ACTACAAATG AA - #AATGCTGC 960 - TTACTATCAA GTAGTGCCTG TTACAGCCAA TGTTTATGAT TCAGATGGTG AA - #AAGCTTTC 1020 - CTATATATCG CAAGGTAGTG TCGTATGGCT AGATAAGGAT AGAAAAAGTG AT - #GACAAGCG 1080 - CTTGGCTATT ACTATTTCTG GTTTGTCAGG CTATATGAAA ACAGAAGATT TA - #CAAGCGCT 1140 - AGATGCTAGT AAGGACTTTA TCCCTTATTA TGAGAGTGAT GGCCACCGTT TT - #TATCACTA 1200 - TGTGGCTCAG AATGCTAGTA TCCCAGTAGC TTCTCATCTT TCTGATATGG AA - #GTAGGCAA 1260 - GAAATATTAT TCGGCAGATG GCCTGCATTT TGATGGTTTT AAGCTTGAGA AT - #CCCTTCCT 1320 - TTTCAAAGAT TTAACAGAGG CTACAAACTA CAGTGCTGAA GAATTGGATA AG - #GTATTTAG 1380 - TTTGCTAAAC ATTAACAATA GCCTTTTGGA GAACAAGGGC GCTACTTTTA AG - #GAAGCCGA 1440 - AGAACATTAC CATATCAATG CTCTTTATCT CCTTGCCCAT AGTGCCCTAG AA - #AGTAACTG 1500 - GGGAAGAAGT AAAATTGCCA AAGATAAGAA TAATTTCTTT GGCATTACAG CC - #TATGATAC 1560 - GACCCCTTAC CTTTCTGCTA AGACATTTGA TGATGTGGAT AAGGGAATTT TA - #GGTGCAAC 1620 - CAAGTGGATT AAGGAAAATT ATATCGATAG GGGAAGAACT TTCCTTGGAA AC - #AAGGCTTC 1680 - TGGTATGAAT GTGGAATATG CTTCAGACCC TTATTGGGGC GAAAAAATTG CT - #AGTGTGAT 1740 # 1774 AAGC TAGGTGGCAA AGAT - (2) INFORMATION FOR SEQ ID NO:74: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 591 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: - Ser Asp Gly Thr Trp Gln Gly Lys Gln Tyr Le - #u Lys Glu Asp Gly Ser # 15 - Gln Ala Ala Asn Glu Trp Val Xaa Asp Thr Hi - #s Tyr Gln Ser Trp Phe # 30 - Tyr Ile Lys Ala Asp Ala Asn Tyr Ala Glu As - #n Glu Trp Leu Lys Gln # 45 - Gly Asp Asp Tyr Phe Tyr Leu Lys Ser Gly Gl - #y Tyr Met Ala Lys Ser # 60 - Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Ty - #r Leu Asp Gln Asp Gly #80 - Lys Met Lys Arg Asn Ala Trp Val Gly Thr Se - #r Tyr Val Gly Ala Thr # 95 - Gly Ala Lys Val Ile Glu Asp Trp Val Tyr As - #p Ser Gln Tyr Asp Ala # 110 - Trp Phe Tyr Ile Lys Ala Asp Gly Gln His Al - #a Glu Lys Glu Trp Leu # 125 - Gln Ile Lys Gly Lys Asp Tyr Tyr Phe Lys Se - #r Gly Gly Tyr Leu Leu # 140 - Thr Ser Gln Trp Ile Asn Gln Ala Tyr Val As - #n Ala Ser Gly Ala Lys 145 1 - #50 1 - #55 1 - #60 - Val Gln Gln Gly Trp Leu Phe Asp Lys Gln Ty - #r Gln Ser Trp Phe Tyr # 175 - Ile Lys Glu Asn Gly Asn Tyr Ala Asp Lys Gl - #u Trp Ile Phe Glu Asn # 190 - Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Ty - #r Met Ala Ala Asn Glu # 205 - Trp Ile Trp Asp Lys Glu Ser Trp Phe Tyr Le - #u Lys Phe Asp Gly Lys # 220 - Met Ala Glu Lys Glu Trp Val Tyr Asp Ser Hi - #s Ser Gln Ala Trp Tyr 225 2 - #30 2 - #35 2 - #40 - Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala As - #n Glu Trp Ile Trp Asp # 255

- Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gl - #y Lys Ile Ala Glu Lys # 270 - Glu Trp Val Tyr Asp Ser His Ser Gln Ala Tr - #p Tyr Tyr Phe Lys Ser # 285 - Gly Gly Tyr Met Ala Lys Asn Glu Thr Val As - #p Gly Tyr Gln Leu Gly # 300 - Ser Asp Gly Lys Trp Leu Gly Gly Lys Thr Th - #r Asn Glu Asn Ala Ala 305 3 - #10 3 - #15 3 - #20 - Tyr Tyr Gln Val Val Pro Val Thr Ala Asn Va - #l Tyr Asp Ser Asp Gly # 335 - Glu Lys Leu Ser Tyr Ile Ser Gln Gly Ser Va - #l Val Trp Leu Asp Lys # 350 - Asp Arg Lys Ser Asp Asp Lys Arg Leu Ala Il - #e Thr Ile Ser Gly Leu # 365 - Ser Gly Tyr Met Lys Thr Glu Asp Leu Gln Al - #a Leu Asp Ala Ser Lys # 380 - Asp Phe Ile Pro Tyr Tyr Glu Ser Asp Gly Hi - #s Arg Phe Tyr His Tyr 385 3 - #90 3 - #95 4 - #00 - Val Ala Gln Asn Ala Ser Ile Pro Val Ala Se - #r His Leu Ser Asp Met # 415 - Glu Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gl - #y Leu His Phe Asp Gly # 430 - Phe Lys Leu Glu Asn Pro Phe Leu Phe Lys As - #p Leu Thr Glu Ala Thr # 445 - Asn Tyr Ser Ala Glu Glu Leu Asp Lys Val Ph - #e Ser Leu Leu Asn Ile # 460 - Asn Asn Ser Leu Leu Glu Asn Lys Gly Ala Th - #r Phe Lys Glu Ala Glu 465 4 - #70 4 - #75 4 - #80 - Glu His Tyr His Ile Asn Ala Leu Tyr Leu Le - #u Ala His Ser Ala Leu # 495 - Glu Ser Asn Trp Gly Arg Ser Lys Ile Ala Ly - #s Asp Lys Asn Asn Phe # 510 - Phe Gly Ile Thr Ala Tyr Asp Thr Thr Pro Ty - #r Leu Ser Ala Lys Thr # 525 - Phe Asp Asp Val Asp Lys Gly Ile Leu Gly Al - #a Thr Lys Trp Ile Lys # 540 - Glu Asn Tyr Ile Asp Arg Gly Arg Thr Phe Le - #u Gly Asn Lys Ala Ser 545 5 - #50 5 - #55 5 - #60 - Gly Met Asn Val Glu Tyr Ala Ser Asp Pro Ty - #r Trp Gly Glu Lys Ile # 575 - Ala Ser Val Met Met Lys Ile Asn Glu Lys Le - #u Gly Gly Lys Asp # 590 - (2) INFORMATION FOR SEQ ID NO: 75: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1105 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #75: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGGATTCAA TATGTCAGAG ATGATACTAG AGATAAAGAA GAGGGAATAG AG - #TATGATGA 60 - CGCTGACAAT GGGGATATTA TTGTAAAAGT AGCGACTAAA CCTAAGGTAG TA - #ACCAAGAA 120 - AATTTCAAGT ACGCGAATTC GTTATGAAAA AGATGAAACA AAAGACCGTA GT - #GAAAATCC 180 - TGTTACAATT GATGGAGAGG ATGGCTATGT AACTACGACA AGGACCTACG AT - #GTTAATCC 240 - AGAGACTGGT TATGTTACCG AACAGGTTAC TGTTGATAGA AAAGAAGCCA CG - #GATACAGT 300 - TATCAAAGTT CCAGCTAAAA GCAAGGTTGA AGAAGTTCTT GTTCCATTTG CT - #ACTAAATA 360 - TGAAGCAGAC AATGACCTTT CTGCAGGACA GGAGCAAGAG ATTACTCTAG GA - #AAGAATGG 420 - GAAAACAGTT ACAACGATAA CTTATAATGT AGATGGAAAG AGTGGACAAG TA - #ACTGAGAG 480 - TACTTTAAGT CAAAAAAAAG ACTCtCAAAC AAGAGTTGTT AAAAAAAGaA CC - #ArkCCCCA 540 - AGTTCTTGTC CAAGAAATTC CAATCGAAAC AGAATATCTC GATGGCCCaA CT - #CTTGATAA 600 - AaGTCAAGAA GTAGAAGAAG TAGGAGAAAT TGGTAAATTA CTCTTACTAC AA - #TCTATACT 660 - GGTAGATGAA CGTGATGGAA CAATTGAAGA AACTACTTCT CGTCAAATTA CT - #AAAGAGAT 720 - GGTAAAAAGA CGTATAAGGA GAGGGACGAG AGAACCTGAA AAAGTTGTTG TT - #CCTGAGCA 780 - ATCATCTATT CCTTCGTATC CTGTATCTGT TACATCTAAC CAAGGAACAG AT - #GTAGCAGT 840 - AGAACCAGCT AAAGCAGTTG CTCCAACAAC AGACTGGAAA CAAGAAAATG GT - #ATGTGGTA 900 - TTTTTATAAT ACTGATGGTT CCATGGCAAC AGGTTGGGTA CAAGTTAATA GT - #TCATGGTA 960 - CTACCTCAAC AGCAACGGTT CTATGAAAGT CAATCAATGG TTCCAAGTTG GT - #GGTAAATG 1020 - GTATTATGTA AATACATCGG GTGAGTTAGC GGTCAATACA AGTATAGATG GC - #TATAGAGT 1080 # 1105 TGGG TGCGT - (2) INFORMATION FOR SEQ ID NO:76: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 368 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: - Gly Ile Gln Tyr Val Arg Asp Asp Thr Arg As - #p Lys Glu Glu Gly Ile # 15 - Glu Tyr Asp Asp Ala Asp Asn Gly Asp Ile Il - #e Val Lys Val Ala Thr # 30 - Lys Pro Lys Val Val Thr Lys Lys Ile Ser Se - #r Thr Arg Ile Arg Tyr # 45 - Glu Lys Asp Glu Thr Lys Asp Arg Ser Glu As - #n Pro Val Thr Ile Asp # 60 - Gly Glu Asp Gly Tyr Val Thr Thr Thr Arg Th - #r Tyr Asp Val Asn Pro #80 - Glu Thr Gly Tyr Val Thr Glu Gln Val Thr Va - #l Asp Arg Lys Glu Ala # 95 - Thr Asp Thr Val Ile Lys Val Pro Ala Lys Se - #r Lys Val Glu Glu Val # 110 - Leu Val Pro Phe Ala Thr Lys Tyr Glu Ala As - #p Asn Asp Leu Ser Ala # 125 - Gly Gln Glu Gln Glu Ile Thr Leu Gly Lys As - #n Gly Lys Thr Val Thr # 140 - Thr Ile Thr Tyr Asn Val Asp Gly Lys Ser Gl - #y Gln Val Thr Glu Ser 145 1 - #50 1 - #55 1 - #60 - Thr Leu Ser Gln Lys Lys Asp Ser Gln Thr Ar - #g Val Val Lys Lys Arg # 175 - Thr Xaa Pro Gln Val Leu Val Gln Glu Ile Pr - #o Ile Glu Thr Glu Tyr # 190 - Leu Asp Gly Pro Thr Leu Asp Lys Ser Gln Gl - #u Val Glu Glu Val Gly # 205 - Glu Ile Gly Lys Leu Leu Leu Leu Gln Ser Il - #e Leu Val Asp Glu Arg # 220 - Asp Gly Thr Ile Glu Glu Thr Thr Ser Arg Gl - #n Ile Thr Lys Glu Met 225 2 - #30 2 - #35 2 - #40 - Val Lys Arg Arg Ile Arg Arg Gly Thr Arg Gl - #u Pro Glu Lys Val Val # 255 - Val Pro Glu Gln Ser Ser Ile Pro Ser Tyr Pr - #o Val Ser Val Thr Ser # 270 - Asn Gln Gly Thr Asp Val Ala Val Glu Pro Al - #a Lys Ala Val Ala Pro # 285 - Thr Thr Asp Trp Lys Gln Glu Asn Gly Met Tr - #p Tyr Phe Tyr Asn Thr # 300 - Asp Gly Ser Met Ala Thr Gly Trp Val Gln Va - #l Asn Ser Ser Trp Tyr 305 3 - #10 3 - #15 3 - #20 - Tyr Leu Asn Ser Asn Gly Ser Met Lys Val As - #n Gln Trp Phe Gln Val # 335 - Gly Gly Lys Trp Tyr Tyr Val Asn Thr Ser Gl - #y Glu Leu Ala Val Asn # 350 - Thr Ser Ile Asp Gly Tyr Arg Val Asn Asp As - #n Gly Glu Trp Val Arg # 365 - (2) INFORMATION FOR SEQ ID NO: 77: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 661 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #77: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGATAATAGA GAAGCATTAA AAACCTTTAT GACGGGTGAA AATTTTTATC TC - #CAACATTA 60 - TCTAGGAGCA CATAGGGAAG AACTAAATGG AGAGCATGGC TATACCTTCC GT - #GTTTGGGC 120 - ACCTAATGCT CAGGCTGTTC ACTTGGTTGG TGATTTTACC AACTGGATTG AA - #AATCAGAT 180 - TCCAATGGTA AGAAATGATT TTGGGGTCTG GGAAGTCTTT ACCAATATGG CT - #CAAGAAGG 240 - GCATATTTAC AAATATCATG TCACACGTCA AAATGGTCAT CAACTGATGA AG - #ATTGACCC 300 - TTTTGCTGTC AGGTATGAGG CTCGTCCAGG AACAGGGGCA ATCGTAACAG AG - #CTTCCTGA 360 - GAAGAAATGG AAGGATGGAC TTTGGCTGGC ACGAAGAAAA CGTTGGGGCT TT - #GAAGAGCG 420 - TCCTGTCAAT ATTTATGAAG TTCACGCTGG ATCATGGAAA AGAAATTCTG AT - #GGCAGTCC 480 - TTATAGTTTT GCCCAGCTCA AGGATGAACT CATTCCTTAT CTCGTTGAAA TG - #AACTATAC 540 - TCATATTGAG TTTATGCCCT TGATGTCCCA TCCTTTGGGC TTGAGTTGGG GG - #TATCAGCT 600 - TATGGGTTAC TTCGCTTTAG AGCATGCTTA TGGCCGACCA GAGGAGTTTC AA - #GATTTTGT 660 # 661 - (2) INFORMATION FOR SEQ ID NO:78: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 220 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: - Asp Asn Arg Glu Ala Leu Lys Thr Phe Met Th - #r Gly Glu Asn Phe Tyr # 15 - Leu Gln His Tyr Leu Gly Ala His Arg Glu Gl - #u Leu Asn Gly Glu His # 30 - Gly Tyr Thr Phe Arg Val Trp Ala Pro Asn Al - #a Gln Ala Val His Leu # 45 - Val Gly Asp Phe Thr Asn Trp Ile Glu Asn Gl - #n Ile Pro Met Val Arg # 60 - Asn Asp Phe Gly Val Trp Glu Val Phe Thr As - #n Met Ala Gln Glu Gly #80 - His Ile Tyr Lys Tyr His Val Thr Arg Gln As - #n Gly His Gln Leu Met # 95 - Lys Ile Asp Pro Phe Ala Val Arg Tyr Glu Al - #a Arg Pro Gly Thr Gly # 110 - Ala Ile Val Thr Glu Leu Pro Glu Lys Lys Tr - #p Lys Asp Gly Leu Trp # 125 - Leu Ala Arg Arg Lys Arg Trp Gly Phe Glu Gl - #u Arg Pro Val Asn Ile # 140 - Tyr Glu Val His Ala Gly Ser Trp Lys Arg As - #n Ser Asp Gly Ser Pro 145 1 - #50 1 - #55 1 - #60 - Tyr Ser Phe Ala Gln Leu Lys Asp Glu Leu Il - #e Pro Tyr Leu Val Glu # 175 - Met Asn Tyr Thr His Ile Glu Phe Met Pro Le - #u Met Ser His Pro Leu # 190 - Gly Leu Ser Trp Gly Tyr Gln Leu Met Gly Ty - #r Phe Ala Leu Glu His # 205 - Ala Tyr Gly Arg Pro Glu Glu Phe Gln Asp Ph - #e Val # 220 - (2) INFORMATION FOR SEQ ID NO: 79: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 976 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #79: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AGATTTTGTC GAGGAGTGTC ATACCCATAA TATTGGGGTT ATTGTGGACT GG - #GTACCAGN 60 - TCACTTTACC ATCAACGATG ATGCCTTAGC CTATTATGAT GGGACACCGA CT - #TTTGAATA 120 - CCAAGACCAT AATAAGGCTC ATAACCATGG TTGGGGTGCC CTTAATTTTG AC - #CTTGGAAA 180 - AAATGAAGTC CAGTCCTTCT TAATTTCTTG CATTAAGCAT TGGATTGATG TC - #TATCATTT 240 - GGATGGTATT CGTGTGGATG CTGTTAGCAA CATGCTCTAT TTGGACTATG AT - #GATGCTCC 300 - ATGGACACCT AATAAAGATG GCGGAAATCT CAACTATGAA GGTTATTATT TC - #CTTCAGCG 360 - CTTGAATGAG GTTATTAAGT TAGAATATCC AGATGTGATG ATGATTGCAG AA - #GAAAGTTC 420 - GTCTGCGATC AAGATTACGG GAATGAAAGA GATTGGTGGT CTAGGATTTG AC - #TACAAATG 480 - GAACATGGGC TGGATGAATG ATATCCTCCG TTTCTACGAA GAAGATCCGA TC - #TATCGTAA 540 - ATATGACTTT AACCTGGTGA CTTTCAGCTT TATGTATGTT TNCAAGGAGA AT - #TATCTCTT 600 - GCCATTCTCG CACGATGAAG TGGTTCATGG CAAGAAGAGT ATGATGCATA AG - #ATGTGGGG 660 - AGATCGTTAC AATCAATTCG CAGGCTTGCG CAATCTCTAT ACGTACCAAA TT - #TGTCACCC 720 - TGGTAAGAAA TTGCTCTTCA TGGGTAGCGA ATACGGTCAA TTCCTAGAAT GG - #AAATCTGA 780 - AGAACAGTTG GAATGGTCTA ACCTAGAAGA CCCAATGAAT GCTAAGATGA AG - #TATTTCGC 840 - TTCTCAGCTA AACCAGTTTT ACAAAGATCA TCGCTGTCTG TGGGAAATTG AT - #ACCAGCTA 900 - TGATGGTATT GAAATCATTG ATGCGGATAA TCGAGACCAG AGTGTTCTTT CC - #TTTATTCG 960 # 976 - (2) INFORMATION FOR SEQ ID NO:80:

- (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 325 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: - Asp Phe Val Glu Glu Cys His Thr His Asn Il - #e Gly Val Ile Val Asp # 15 - Trp Val Pro Xaa His Phe Thr Ile Asn Asp As - #p Ala Leu Ala Tyr Tyr # 30 - Asp Gly Thr Pro Thr Phe Glu Tyr Gln Asp Hi - #s Asn Lys Ala His Asn # 45 - His Gly Trp Gly Ala Leu Asn Phe Asp Leu Gl - #y Lys Asn Glu Val Gln # 60 - Ser Phe Leu Ile Ser Cys Ile Lys His Trp Il - #e Asp Val Tyr His Leu #80 - Asp Gly Ile Arg Val Asp Ala Val Ser Asn Me - #t Leu Tyr Leu Asp Tyr # 95 - Asp Asp Ala Pro Trp Thr Pro Asn Lys Asp Gl - #y Gly Asn Leu Asn Tyr # 110 - Glu Gly Tyr Tyr Phe Leu Gln Arg Leu Asn Gl - #u Val Ile Lys Leu Glu # 125 - Tyr Pro Asp Val Met Met Ile Ala Glu Glu Se - #r Ser Ser Ala Ile Lys # 140 - Ile Thr Gly Met Lys Glu Ile Gly Gly Leu Gl - #y Phe Asp Tyr Lys Trp 145 1 - #50 1 - #55 1 - #60 - Asn Met Gly Trp Met Asn Asp Ile Leu Arg Ph - #e Tyr Glu Glu Asp Pro # 175 - Ile Tyr Arg Lys Tyr Asp Phe Asn Leu Val Th - #r Phe Ser Phe Met Tyr # 190 - Val Xaa Lys Glu Asn Tyr Leu Leu Pro Phe Se - #r His Asp Glu Val Val # 205 - His Gly Lys Lys Ser Met Met His Lys Met Tr - #p Gly Asp Arg Tyr Asn # 220 - Gln Phe Ala Gly Leu Arg Asn Leu Tyr Thr Ty - #r Gln Ile Cys His Pro 225 2 - #30 2 - #35 2 - #40 - Gly Lys Lys Leu Leu Phe Met Gly Ser Glu Ty - #r Gly Gln Phe Leu Glu # 255 - Trp Lys Ser Glu Glu Gln Leu Glu Trp Ser As - #n Leu Glu Asp Pro Met # 270 - Asn Ala Lys Met Lys Tyr Phe Ala Ser Gln Le - #u Asn Gln Phe Tyr Lys # 285 - Asp His Arg Cys Leu Trp Glu Ile Asp Thr Se - #r Tyr Asp Gly Ile Glu # 300 - Ile Ile Asp Ala Asp Asn Arg Asp Gln Ser Va - #l Leu Ser Phe Ile Arg 305 3 - #10 3 - #15 3 - #20 - Lys Gly Lys Lys Gly 325 - (2) INFORMATION FOR SEQ ID NO: 81: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2134 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #81: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATCTGTAGTT TATGCGGATG AAACACTTAT TACTCATACT GCTGAGAAAC CT - #AAAGAGGA 60 - AAAAATGATA GTAGAAGAAA AGGCTGATAA AGCTTTGGAA ACTAAAAATA TA - #GTTGAAAG 120 - GACAGAACAA AGTGAACCTA GTTCAACTGA GGCTATTGCA TCTGAGNAGA AA - #GAAGATGA 180 - AGCCGTAACT CCAAAAGAGG AAAAAGTGTC TGCTAAACCG GAAGAAAAAG CT - #CCAAGGAT 240

- AGAATCACAA GCTTCAAATC AAGAAAAACC GCTCAAGGAA GATGCTAAAG CT - #GTAACAAA 300 - TGAAGAAGTG AATCAAATGA TTGAAGACAG GAAAGTGGAT TTTAATCAAA AT - #TGGTACTT 360 - TAAACTCAAT GCAAATTCTA AGGAAGCCAT TAAACCTGAT GCAGACGTAT CT - #ACGTGGAA 420 - AAAATTAGAT TTACCGTATG ACTGGAGTAT CTTTAACGAT TTCGATCATG AA - #TCTCCTGC 480 - ACAAAATGAA GGTGGACAGC TCAACGGTGG GGAAGCTTGG TATCGCAAGA CT - #TTCAAACT 540 - AGATGAAAAA GACCTCAAGA AAAATGTTCG CCTTACTTTT GATGGCGTCT AC - #ATGGATTC 600 - TCAAGTTTAT GTCAATGGTC AGTTAGTGGG GCATTATCCA AATGGTTATA AC - #CAGTTCTC 660 - ATATGATATC ACCAAATACC TTCAAAAAGA TGGTCGTGAG AATGTGATTG CT - #GTCCATGC 720 - AGTCAACAAA CAGCCAAGTA GCCGTTGGTA TTCAGGAAGT GGTATCTATC GT - #GATGTGAC 780 - TTTACAAGTG ACAGATAAGG TGCATGTTGA GAAAAATGGG ACAACTATTT TA - #ACACCAAA 840 - ACTTGAAGAA CAACAACATG GCAAGGTTGA AACTCATGTG ACCAGCAAAA TC - #GTCAATAC 900 - GGACGACAAA GACCATGAAC TTGTAGCCGA ATATCAAATC GTTGAACGAG GT - #GGTCATGC 960 - TGTAACAGGC TTAGTTCGTA CAGCGAGTCG TACCTTAAAA GCACATGAAT CA - #ACAAGCCT 1020 - AGATGCGATT TTAGAAGTTG AAAGACCAAA ACTCTGGACT GTTTTAAATG AC - #AAACCTGC 1080 - CTTGTACGAA TTGATTACGC GTGTTTACCG TGACGGTCAA TTGGTTGATG CT - #AAGAAGGA 1140 - TTTGTTTGGT TACCGTTACT ATCACTGGAC TCCAAATGAA GGTTTCTCTT TG - #AATGGTGA 1200 - ACGTATTAAA TTCCATGGAG TATCCTTGCA CCACGACCAT GGGGCGCTTG GA - #GCAGAAGA 1260 - AAACTATAAA GCAGAATATC GCCGTCTCAA ACAAATGAAG GAGATGGGAG TT - #AACTCCAT 1320 - CCGTACAACC CACAACCCTG CTAGTGAGCA AACCTTGCAA ATCGCAGCAG AA - #CTAGGTTT 1380 - ACTCGTTCAG GAAGAGGCCT TTGATACGTG GTATGGTGGC AAGAAACCTT AT - #GACTATGG 1440 - ACGTTTCTTT GAAAAAGATG CCACTCACCC AGAAGCTCGA AAAGGTGAAA AA - #TGGTCTGA 1500 - TTTTGACCTA CGTACCATGG TCGAAAGAGG CAAAAACAAC CCTGCTATCT TC - #ATGTGGTC 1560 - AATTGGTAAT GAAATAGGTG AAGCTAATGG TGATGCCCAC TCTTTAGCAA CT - #GTTAAACG 1620 - TTTGGTTAAG GTTATCAAGG ATGTTGATAA GACTCGCTAT GTTACCATGG GA - #GCAGATAA 1680 - ATTCCGTTTC GGTAATGGTA GCGGAGGGCA TGAGAAAATT GCTGATGAAC TC - #GATGCTGT 1740 - TGGATTTAAC TATTCTGAAG ATAATTACAA AGCCCTTAGA GCTAAGCATC CA - #AAATGGTT 1800 - GATTTATGGA TCAGAAACAT CTTCAGCTAC CCGTACACGT GGAAGTTACT AT - #CGCCCTGA 1860 - ACGTGAATTG AAACATAGCA ATGGACCTGA GCGTAATTAT GAACAGTCAG AT - #TATGGAAA 1920 - TGATCGTGTG GGTTGGGGGA AAACAGCAAC CGCTTCATGG ACTTTTGACC GT - #GACAACGC 1980 - TGGCTATGCT GGACAGTTTA TCTGGACAGG TACGGACTAT ATTGGTGAAC CT - #ACACCATG 2040 - GCACAACCAA AATCAAACTC CTGTTAAGAG CTCTTACTTT GGTATCGTAG AT - #ACAGCCGG 2100 # 2134 TTCT ATCTCTACCA AAGC - (2) INFORMATION FOR SEQ ID NO:82: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 711 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: - Ser Val Val Tyr Ala Asp Glu Thr Leu Ile Th - #r His Thr Ala Glu Lys # 15 - Pro Lys Glu Glu Lys Met Ile Val Glu Glu Ly - #s Ala Asp Lys Ala Leu # 30 - Glu Thr Lys Asn Ile Val Glu Arg Thr Glu Gl - #n Ser Glu Pro Ser Ser # 45 - Thr Glu Ala Ile Ala Ser Glu Xaa Lys Glu As - #p Glu Ala Val Thr Pro # 60 - Lys Glu Glu Lys Val Ser Ala Lys Pro Glu Gl - #u Lys Ala Pro Arg Ile #80 - Glu Ser Gln Ala Ser Asn Gln Glu Lys Pro Le - #u Lys Glu Asp Ala Lys # 95 - Ala Val Thr Asn Glu Glu Val Asn Gln Met Il - #e Glu Asp Arg Lys Val # 110 - Asp Phe Asn Gln Asn Trp Tyr Phe Lys Leu As - #n Ala Asn Ser Lys Glu # 125 - Ala Ile Lys Pro Asp Ala Asp Val Ser Thr Tr - #p Lys Lys Leu Asp Leu # 140 - Pro Tyr Asp Trp Ser Ile Phe Asn Asp Phe As - #p His Glu Ser Pro Ala 145 1 - #50 1 - #55 1 - #60 - Gln Asn Glu Gly Gly Gln Leu Asn Gly Gly Gl - #u Ala Trp Tyr Arg Lys # 175 - Thr Phe Lys Leu Asp Glu Lys Asp Leu Lys Ly - #s Asn Val Arg Leu Thr # 190 - Phe Asp Gly Val Tyr Met Asp Ser Gln Val Ty - #r Val Asn Gly Gln Leu # 205 - Val Gly His Tyr Pro Asn Gly Tyr Asn Gln Ph - #e Ser Tyr Asp Ile Thr # 220 - Lys Tyr Leu Gln Lys Asp Gly Arg Glu Asn Va - #l Ile Ala Val His Ala 225 2 - #30 2 - #35 2 - #40 - Val Asn Lys Gln Pro Ser Ser Arg Trp Tyr Se - #r Gly Ser Gly Ile Tyr # 255 - Arg Asp Val Thr Leu Gln Val Thr Asp Lys Va - #l His Val Glu Lys Asn # 270 - Gly Thr Thr Ile Leu Thr Pro Lys Leu Glu Gl - #u Gln Gln His Gly Lys # 285 - Val Glu Thr His Val Thr Ser Lys Ile Val As - #n Thr Asp Asp Lys Asp # 300 - His Glu Leu Val Ala Glu Tyr Gln Ile Val Gl - #u Arg Gly Gly His Ala 305 3 - #10 3 - #15 3 - #20 - Val Thr Gly Leu Val Arg Thr Ala Ser Arg Th - #r Leu Lys Ala His Glu # 335 - Ser Thr Ser Leu Asp Ala Ile Leu Glu Val Gl - #u Arg Pro Lys Leu Trp # 350 - Thr Val Leu Asn Asp Lys Pro Ala Leu Tyr Gl - #u Leu Ile Thr Arg Val # 365 - Tyr Arg Asp Gly Gln Leu Val Asp Ala Lys Ly - #s Asp Leu Phe Gly Tyr # 380 - Arg Tyr Tyr His Trp Thr Pro Asn Glu Gly Ph - #e Ser Leu Asn Gly Glu 385 3 - #90 3 - #95 4 - #00 - Arg Ile Lys Phe His Gly Val Ser Leu His Hi - #s Asp His Gly Ala Leu # 415 - Gly Ala Glu Glu Asn Tyr Lys Ala Glu Tyr Ar - #g Arg Leu Lys Gln Met # 430 - Lys Glu Met Gly Val Asn Ser Ile Arg Thr Th - #r His Asn Pro Ala Ser # 445 - Glu Gln Thr Leu Gln Ile Ala Ala Glu Leu Gl - #y Leu Leu Val Gln Glu # 460 - Glu Ala Phe Asp Thr Trp Tyr Gly Gly Lys Ly - #s Pro Tyr Asp Tyr Gly 465 4 - #70 4 - #75 4 - #80 - Arg Phe Phe Glu Lys Asp Ala Thr His Pro Gl - #u Ala Arg Lys Gly Glu # 495 - Lys Trp Ser Asp Phe Asp Leu Arg Thr Met Va - #l Glu Arg Gly Lys Asn # 510 - Asn Pro Ala Ile Phe Met Trp Ser Ile Gly As - #n Glu Ile Gly Glu Ala # 525 - Asn Gly Asp Ala His Ser Leu Ala Thr Val Ly - #s Arg Leu Val Lys Val # 540 - Ile Lys Asp Val Asp Lys Thr Arg Tyr Val Th - #r Met Gly Ala Asp Lys 545 5 - #50 5 - #55 5 - #60 - Phe Arg Phe Gly Asn Gly Ser Gly Gly His Gl - #u Lys Ile Ala Asp Glu # 575 - Leu Asp Ala Val Gly Phe Asn Tyr Ser Glu As - #p Asn Tyr Lys Ala Leu # 590 - Arg Ala Lys His Pro Lys Trp Leu Ile Tyr Gl - #y Ser Glu Thr Ser Ser # 605 - Ala Thr Arg Thr Arg Gly Ser Tyr Tyr Arg Pr - #o Glu Arg Glu Leu Lys # 620 - His Ser Asn Gly Pro Glu Arg Asn Tyr Glu Gl - #n Ser Asp Tyr Gly Asn 625 6 - #30 6 - #35 6 - #40 - Asp Arg Val Gly Trp Gly Lys Thr Ala Thr Al - #a Ser Trp Thr Phe Asp # 655 - Arg Asp Asn Ala Gly Tyr Ala Gly Gln Phe Il - #e Trp Thr Gly Thr Asp # 670 - Tyr Ile Gly Glu Pro Thr Pro Trp His Asn Gl - #n Asn Gln Thr Pro Val # 685 - Lys Ser Ser Tyr Phe Gly Ile Val Asp Thr Al - #a Gly Ile Pro Lys His # 700 - Asp Phe Tyr Leu Tyr Gln Ser 705 7 - #10 - (2) INFORMATION FOR SEQ ID NO: 83: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2167 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #83: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTACTTTGGT ATCGTAGATA CAGCCGGCAT TCCAAAACAT GACTTCTATC TC - #TACCAAAG 60 - CCAATGGGTT TCTGTTAAGA AGAAACCGAT GGTACACCTT CTTCCTCACT GG - #AACTGGGA 120 - AAACAAAGAA TTAGCATCCA AAGTAGCTGA CTCAGAAGGT AAGATTCCAG TT - #CGTGCTTA 180 - TTCGAATGCT TCTAGTGTAG AATTGTTCTT GAATGGAAAA TCTCTTGGTC TT - #AAGACTTT 240 - CAATAAAAAA CAAACCAGCG ATGGGCGGAC TTACCAAGAA GGTGCAAATG CT - #AATGAACT 300 - TTATCTTGAA TGGAAAGTTG CCTATCAACC AGGTACCTTG GAAGCAATTG CT - #CGTGATGA 360 - ATCTGGCAAG GAAATTGCTC GAGATAAGAT TACGACTGCT GGTAAGCCAG CG - #GCAGTTCG 420 - TCTTATTAAG GAAGACCATG CGATTGCAGC AGATGGAAAA GACTTGACTT AC - #ATCTACTA 480 - TGAAATTGTT GACAGCCAGG GGAATGTGGT TCCAACTGCT AATAATCTGG TT - #CGCTTCCA 540 - ATTGCATGGC CAAGGTCAAC TGGTCGGTGT AGATAACGGA GAACAAGCCA GC - #CGTGAACG 600 - CTATAAGGCG CAAGCAGATG GTTCTTGGAT TCGTAAAGCA TTTAATGGTA AA - #GGTGTTGC 660 - CATTGTCAAA TCAACTGAAC AAGCAGGGAA ATTCACCCTG ACTGCCCACT CT - #GATCTCTT 720 - GAAATCGAAC CAAGTCACTG TCTTTACTGG TAAGAAAGAA GGACAAGAGA AG - #ACTGTTTT 780 - GGGGACAGAA GTGCCAAAAG TACAGACCAT TATTGGAGAG GCACCTGAAA TG - #CCTACCAC 840 - TGTTCCGTTT GTATACAGTG ATGGTAGCCG TGCAGAACGT CCTGTAACCT GG - #TCTTCAGT 900 - AGATGTGAGC AAGCCTGGTA TTGTAACGGT GAAAGGTATG GCTGACGGAC GA - #GAAGTAGA 960 - AGCTCGTGTA GAAGTGATTG CTCTTAAATC AGAGCTACCA GTTGTGAAAC GT - #ATTGCTCC 1020 - AAATACTGAC TTGAATTCTG TAGACAAATC TGTTTCCTAT GTTTTGATTG AT - #GGAAGTGT 1080 - TGAAGAGTAT GAAGTGGACA AGTGGGAGAT TGCCGAAGAA GATAAAGCTA AG - #TTAGCAAT 1140 - TCCAGGTTCT CGTATTCAAG CGACCGGTTA TTTAGAAGGT CAACCAATTC AT - #GCAACCCT 1200 - TGTGGTAGAA GAAGGCAATC CTGCGGCACC TGCAGTACCA ACTGTAACGG TT - #GGTGGTGA 1260 - GGCAGTAACA GGTCTTACTA GTCAAAAACC AATGCAATAC CGCACTCTTG CT - #TATGGAGC 1320 - TAAGTTGCCA GAAGTCACAG CAAGTGCTAA AAATGCAGCT GTTACAGTTC TT - #CAAGCAAG 1380 - CGCAGCAAAC GGCATGCGTG CGAGCATCTT TATTCAGCCT AAAGATGGTG GC - #CCTCTTCA 1440 - AACCTATGCA ATTCAATTCC TTGAAGAAGC GCCAAAAATT GCTCACTTGA GC - #TTGCAAGT 1500 - GGAAAAAGCT GACAGTCTCA AAGAAGACCA AACTGTCAAA TTGTCGGTTC GA - #GCTCACTA 1560 - TCAAGATGGA ACGCAAGCTG TATTACCAGC TGATAAAGTA ACCTTCTCTA CA - #AGTGGTGA 1620 - AGGGGAAGTC GCAATTCGTA AAGGAATGCT TGAGTTGCAT AAGCCAGGAG CA - #GTCACTCT 1680 - GAACGCTGAA TATGAGGGAG CTAAAGACCA AGTTGAACTC ACTATCCAAG CC - #AATACTGA 1740 - GAAGAAGATT GCGCAATCCA TCCGTCCTGT AAATGTAGTG ACAGATTTGC AT - #CAGGAACC 1800 - AAGTCTTCCA GCAACAGTAA CAGTTGAGTA TGACAAAGGT TTCCCTAAAA CT - #CATAAAGT 1860 - CACTTGGCAA GCTATTCCGA AAGAAAAACT AGACTCCTAT CAAACATTTG AA - #GTACTAGG 1920 - TAAAGTTGAA GGAATTGACC TTGAAGCGCG TGCAAAAGTC TCTGTAGAAG GT - #ATCGTTTC 1980 - AGTTGAAGAA GTCAGTGTGA CAACTCCAAT CGCAGAAGCA CCACAATTAC CA - #GAAAGTGT 2040 - TCGGACATAT GATTCAAATG GTCACGTTTC ATCAGCTAAG GTTGCATGGG AT - #GCGATTCG 2100 - TCCAGAGCAA TACGCTAAGG AAGGTGTCTT TACAGTTAAT GGTCGCTTAG AA - #GGTACGCA 2160 # 2167 - (2) INFORMATION FOR SEQ ID NO:84: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 722 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: - Tyr Phe Gly Ile Val Asp Thr Ala Gly Ile Pr - #o Lys His Asp Phe Tyr # 15 - Leu Tyr Gln Ser Gln Trp Val Ser Val Lys Ly - #s Lys Pro Met Val His # 30 - Leu Leu Pro His Trp Asn Trp Glu Asn Lys Gl - #u Leu Ala Ser Lys Val # 45 - Ala Asp Ser Glu Gly Lys Ile Pro Val Arg Al - #a Tyr Ser Asn Ala Ser # 60 - Ser Val Glu Leu Phe Leu Asn Gly Lys Ser Le - #u Gly Leu Lys Thr Phe #80 - Asn Lys Lys Gln Thr Ser Asp Gly Arg Thr Ty - #r Gln Glu Gly Ala Asn # 95 - Ala Asn Glu Leu Tyr Leu Glu Trp Lys Val Al - #a Tyr Gln Pro Gly Thr # 110 - Leu Glu Ala Ile Ala Arg Asp Glu Ser Gly Ly - #s Glu Ile Ala Arg Asp # 125 - Lys Ile Thr Thr Ala Gly Lys Pro Ala Ala Va - #l Arg Leu Ile Lys Glu # 140 - Asp His Ala Ile Ala Ala Asp Gly Lys Asp Le - #u Thr Tyr Ile Tyr Tyr 145 1 - #50 1 - #55 1 - #60 - Glu Ile Val Asp Ser Gln Gly Asn Val Val Pr - #o Thr Ala Asn Asn Leu # 175 - Val Arg Phe Gln Leu His Gly Gln Gly Gln Le - #u Val Gly Val Asp Asn # 190 - Gly Glu Gln Ala Ser Arg Glu Arg Tyr Lys Al - #a Gln Ala Asp Gly Ser # 205 - Trp Ile Arg Lys Ala Phe Asn Gly Lys Gly Va - #l Ala Ile Val Lys Ser # 220 - Thr Glu Gln Ala Gly Lys Phe Thr Leu Thr Al - #a His Ser Asp Leu Leu 225 2 - #30 2 - #35 2 - #40 - Lys Ser Asn Gln Val Thr Val Phe Thr Gly Ly - #s Lys Glu Gly Gln Glu # 255 - Lys Thr Val Leu Gly Thr Glu Val Pro Lys Va - #l Gln Thr Ile Ile Gly # 270 - Glu Ala Pro Glu Met Pro Thr Thr Val Pro Ph - #e Val Tyr Ser Asp Gly # 285 - Ser Arg Ala Glu Arg Pro Val Thr Trp Ser Se - #r Val Asp Val Ser Lys # 300 - Pro Gly Ile Val Thr Val Lys Gly Met Ala As - #p Gly Arg Glu Val Glu 305 3 - #10 3 - #15 3 - #20 - Ala Arg Val Glu Val Ile Ala Leu Lys Ser Gl - #u Leu Pro Val Val Lys # 335 - Arg Ile Ala Pro Asn Thr Asp Leu Asn Ser Va - #l Asp Lys Ser Val Ser # 350 - Tyr Val Leu Ile Asp Gly Ser Val Glu Glu Ty - #r Glu Val Asp Lys Trp # 365 - Glu Ile Ala Glu Glu Asp Lys Ala Lys Leu Al - #a Ile Pro Gly Ser Arg # 380 - Ile Gln Ala Thr Gly Tyr Leu Glu Gly Gln Pr - #o Ile His Ala Thr Leu 385 3 - #90 3 - #95 4 - #00 - Val Val Glu Glu Gly Asn Pro Ala Ala Pro Al - #a Val Pro Thr Val Thr # 415 - Val Gly Gly Glu Ala Val Thr Gly Leu Thr Se - #r Gln Lys Pro Met Gln # 430 - Tyr Arg Thr Leu Ala Tyr Gly Ala Lys Leu Pr - #o Glu Val Thr Ala Ser # 445 - Ala Lys Asn Ala Ala Val Thr Val Leu Gln Al - #a Ser Ala Ala Asn Gly # 460 - Met Arg Ala Ser Ile Phe Ile Gln Pro Lys As - #p Gly Gly Pro Leu Gln 465 4 - #70 4 - #75 4 - #80 - Thr Tyr Ala Ile Gln Phe Leu Glu Glu Ala Pr - #o Lys Ile Ala His Leu # 495 - Ser Leu Gln Val Glu Lys Ala Asp Ser Leu Ly - #s Glu Asp Gln Thr Val # 510 - Lys Leu Ser Val Arg Ala His Tyr Gln Asp Gl - #y Thr Gln Ala Val Leu # 525 - Pro Ala Asp Lys Val Thr Phe Ser Thr Ser Gl - #y Glu Gly Glu Val Ala # 540 - Ile Arg Lys Gly Met Leu Glu Leu His Lys Pr - #o Gly Ala Val Thr Leu 545 5 - #50 5 - #55 5 - #60 - Asn Ala Glu Tyr Glu Gly Ala Lys Asp Gln Va - #l Glu Leu Thr Ile Gln # 575 - Ala Asn Thr Glu Lys Lys Ile Ala Gln Ser Il - #e Arg Pro Val Asn Val # 590 - Val Thr Asp Leu His Gln Glu Pro Ser Leu Pr - #o Ala Thr Val Thr Val # 605 - Glu Tyr Asp Lys Gly Phe Pro Lys Thr His Ly - #s Val Thr Trp Gln Ala # 620 - Ile Pro Lys Glu Lys Leu Asp Ser Tyr Gln Th - #r Phe Glu Val Leu Gly 625 6 - #30 6 - #35 6 - #40 - Lys Val Glu Gly Ile Asp Leu Glu Ala Arg Al - #a Lys Val Ser Val Glu # 655 - Gly Ile Val Ser Val Glu Glu Val Ser Val Th - #r Thr Pro Ile Ala Glu # 670 - Ala Pro Gln Leu Pro Glu Ser Val Arg Thr Ty - #r Asp Ser Asn Gly His # 685 - Val Ser Ser Ala Lys Val Ala Trp Asp Ala Il - #e Arg Pro Glu Gln Tyr # 700 - Ala Lys Glu Gly Val Phe Thr Val Asn Gly Ar - #g Leu Glu Gly Thr Gln 705 7 - #10 7 - #15 7 - #20 - Leu Thr - (2) INFORMATION FOR SEQ ID NO: 85: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2329 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #85: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AGCTAAGGTT GCATGGGATG CGATTCGTCC AGAGCAATAC GCTAAGGAAG GT - #GTCTTTAC 60 - AGTTAATGGT CGCTTAGAAG GTACGCAATT AACAACTAAA CTTCATGTTC GC - #GTATCTGC 120 - TCAAACTGAG CAAGGTGCAA ACATTTCTGA CCAATGGACC GGTTCAGAAT TG - #CCACTTGC 180 - CTTTGCTTCA GACTCAAATC CAAGCGACCC AGTTTCAAAT GTTAATGACA AG - #CTCATTTC 240 - CTACAATAAC CAACCAGCCA ATCGTTGGAC AAACTGGAAT CGTACTAATC CA - #GAAGCTTC 300 - AGTCGGTGTT CTGTTTGGAG ATTCAGGTAT CTTGAGCAAA CGCTCCGTTG AT - #AATCTAAG 360 - TGTCGGATTC CATGAAGACC ATGGAGTTGG TGTACCGAAG TCTTATGTGA TT - #GAGTATTA 420 - TGTTGGTAAG ACTGTCCCAA CAGCTCCTAA AAACCCTAGT TTTGTTGGTA AT - #GAGGACCA 480 - TGTCTTTAAT GATTCTGCCA ACTGGAAACC AGTTACTAAT CTAAAAGCCC CT - #GCTCAACT 540 - CAAGGCTGGA GAAATGAACC ACTTTAGCTT TGATAAAGTT GAAACCTATG CT - #GTTCGTAT 600 - TCGCATGGTT AAAGCAGATA ACAAGCGTGG AACGTCTATC ACAGAGGTAC AA - #ATCTTTGC 660 - GAAACAAGTT GCGGCAGCCA AGCAAGGACA AACAAGAATC CAAGTTGACG GC - #AAAGACTT 720 - AGCAAACTTC AACCCTGATT TGACAGACTA CTACCTTGAG TCTGTAGATG GA - #AAAGTTCC 780 - GGCAGTCACA GCAAGTGTTA GCAACAATGG TCTCGCTACC GTCGTTCCAA GC - #GTTCGTGA 840 - AGGTGAGCCA GTTCGTGTCA TCGCGAAAGC TGAAAATGGC GACATCTTAG GA - #GAATACCG 900 - TCTGCACTTC ACTAAGGATA AGAGCTTACT TTCTCATAAA CCAGTTGCTG CG - #GTTAAACA 960 - AGCTCGCTTG CTACAAGTAG GTCAAGCACT TGAATTGCCG ACTAAGGTTC CA - #GTTTACTT 1020 - CACAGGTAAA GACGGCTACG AAACAAAAGA CCTGACAGTT GAATGGGAAG AA - #GTTCCAGC 1080 - GGAAAATCTG ACAAAAGCAG GTCAATTTAC TGTTCGAGGC CGTGTCCTTG GT - #AGTAACCT 1140 - TGTTGCTGAG ATCACTGTAC GAGTGACAGA CAAACTTGGT GAGACTCTTT CA - #GATAACCC 1200 - TAACTATGAT GAAAACAGTA ACCAGGCCTT TGCTTCAGCA ACCAATGATA TT - #GACAAAAA 1260 - CTCTCATGAC CGCGTTGACT ATCTCAATGA CGGAGATCAT TCAGAAAATC GT - #CGTTGGAC 1320 - AAACTGGTCA CCAACACCAT CTTCTAATCC AGAAGTATCA GCGGGTGTGA TT - #TTCCGTGA 1380 - AAATGGTAAG ATTGTAGAAC GGACTGTTAC ACAAGGAAAA GTTCAGTTCT TT - #GCAGATAG 1440 - TGGTACGGAT GCACCATCTA AACTCGTTTT AGAACGCTAT GTCGGTCCAG AG - #TTTGAAGT 1500 - GCCAACCTAC TATTCAAACT ACCAAGCCTA CGACGCAGAC CATCCATTCA AC - #AATCCAGA 1560 - AAATTGGGAA GCTGTTCCTT ATCGTGCGGA TAAAGACATT GCAGCTGGTG AT - #GAAATCAA 1620 - CGTAACATTT AAAGCTATCA AAGCCAAAGC TATGAGATGG CGTATGGAGC GT - #AAAGCAGA 1680 - TAAGAGCGGT GTTGCGATGA TTGAGATGAC CTTCCTTGCA CCAAGTGAAT TG - #CCTCAAGA 1740 - AAGCACTCAA TCAAAGATTC TTGTAGATGG AAAAGAACTT GCTGATTTCG CT - #GAAAATCG 1800 - TCAAGACTAT CAAATTACCT ATAAAGGTCA ACGGCCAAAA GTCTCAGTTG AA - #GAAAACAA 1860 - TCAAGTAGCT TCAACTGTGG TAGATAGTGG AGAAGATAGC TTTCCAGTAC TT - #GTTCGCCT 1920 - CGTTTCAGAA AGTGGAAAAC AAGTCAAGGA ATACCGTATC CACTTGACTA AG - #GAAAAACC 1980 - AGTTTCTGAG AAGACAGTTG CTGCTGTACA AGAAGATCTT CCAAAAATCG AA - #TTTGTTGA 2040 - AAAAGATTTG GCATACAAGA CAGTTGAGAA AAAAGATTCA ACACTGTATC TA - #GGTGAAAC 2100 - TCGTGTAGAA CAAGAAGGAA AAGTTGGAAA AGAACGTATC TTTACAGCGA TT - #AATCCTGA 2160 - TGGAAGTAAG GAAGAAAAAC TCCGTGAAGT GGTAGAAGTT CCGACAGACC GC - #ATCGTCTT 2220 - GGTTGGAACC AAACCAGTAG CTCAAGAAGC TAAAAAACCA CAAGTGTCAG AA - #AAAGCAGA 2280 # 2329TTCAA GTGAAGCTAG TCAAACTAAT AAAGCCCAG - (2) INFORMATION FOR SEQ ID NO:86: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 776 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: - Ala Lys Val Ala Trp Asp Ala Ile Arg Pro Gl - #u Gln Tyr Ala Lys Glu # 15 - Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gl - #y Thr Gln Leu Thr Thr # 30 - Lys Leu His Val Arg Val Ser Ala Gln Thr Gl - #u Gln Gly Ala Asn Ile # 45 - Ser Asp Gln Trp Thr Gly Ser Glu Leu Pro Le - #u Ala Phe Ala Ser Asp # 60 - Ser Asn Pro Ser Asp Pro Val Ser Asn Val As - #n Asp Lys Leu Ile Ser #80 - Tyr Asn Asn Gln Pro Ala Asn Arg Trp Thr As - #n Trp Asn Arg Thr Asn # 95 - Pro Glu Ala Ser Val Gly Val Leu Phe Gly As - #p Ser Gly Ile Leu Ser # 110 - Lys Arg Ser Val Asp Asn Leu Ser Val Gly Ph - #e His Glu Asp His Gly # 125 - Val Gly Val Pro Lys Ser Tyr Val Ile Glu Ty - #r Tyr Val Gly Lys Thr # 140 - Val Pro Thr Ala Pro Lys Asn Pro Ser Phe Va - #l Gly Asn Glu Asp His 145 1 - #50 1 - #55 1 - #60 - Val Phe Asn Asp Ser Ala Asn Trp Lys Pro Va - #l Thr Asn Leu Lys Ala # 175 - Pro Ala Gln Leu Lys Ala Gly Glu Met Asn Hi - #s Phe Ser Phe Asp Lys # 190 - Val Glu Thr Tyr Ala Val Arg Ile Arg Met Va - #l Lys Ala Asp Asn Lys # 205 - Arg Gly Thr Ser Ile Thr Glu Val Gln Ile Ph - #e Ala Lys Gln Val Ala # 220 - Ala Ala Lys Gln Gly Gln Thr Arg Ile Gln Va - #l Asp Gly Lys Asp Leu 225 2 - #30 2 - #35 2 - #40 - Ala Asn Phe Asn Pro Asp Leu Thr Asp Tyr Ty - #r Leu Glu Ser Val Asp # 255 - Gly Lys Val Pro Ala Val Thr Ala Ser Val Se - #r Asn Asn Gly Leu Ala # 270 - Thr Val Val Pro Ser Val Arg Glu Gly Glu Pr - #o Val Arg Val Ile Ala # 285 - Lys Ala Glu Asn Gly Asp Ile Leu Gly Glu Ty - #r Arg Leu His Phe Thr # 300 - Lys Asp Lys Ser Leu Leu Ser His Lys Pro Va - #l Ala Ala Val Lys Gln 305 3 - #10 3 - #15 3 - #20 - Ala Arg Leu Leu Gln Val Gly Gln Ala Leu Gl - #u Leu Pro Thr Lys Val # 335 - Pro Val Tyr Phe Thr Gly Lys Asp Gly Tyr Gl - #u Thr Lys Asp Leu Thr # 350 - Val Glu Trp Glu Glu Val Pro Ala Glu Asn Le - #u Thr Lys Ala Gly Gln # 365 - Phe Thr Val Arg Gly Arg Val Leu Gly Ser As - #n Leu Val Ala Glu Ile # 380 - Thr Val Arg Val Thr Asp Lys Leu Gly Glu Th - #r Leu Ser Asp Asn Pro 385 3 - #90 3 - #95 4 - #00 - Asn Tyr Asp Glu Asn Ser Asn Gln Ala Phe Al - #a Ser Ala Thr Asn Asp # 415 - Ile Asp Lys Asn Ser His Asp Arg Val Asp Ty - #r Leu Asn Asp Gly Asp # 430 - His Ser Glu Asn Arg Arg Trp Thr Asn Trp Se - #r Pro Thr Pro Ser Ser # 445 - Asn Pro Glu Val Ser Ala Gly Val Ile Phe Ar - #g Glu Asn Gly Lys Ile # 460 - Val Glu Arg Thr Val Thr Gln Gly Lys Val Gl - #n Phe Phe Ala Asp Ser

465 4 - #70 4 - #75 4 - #80 - Gly Thr Asp Ala Pro Ser Lys Leu Val Leu Gl - #u Arg Tyr Val Gly Pro # 495 - Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Ty - #r Gln Ala Tyr Asp Ala # 510 - Asp His Pro Phe Asn Asn Pro Glu Asn Trp Gl - #u Ala Val Pro Tyr Arg # 525 - Ala Asp Lys Asp Ile Ala Ala Gly Asp Glu Il - #e Asn Val Thr Phe Lys # 540 - Ala Ile Lys Ala Lys Ala Met Arg Trp Arg Me - #t Glu Arg Lys Ala Asp 545 5 - #50 5 - #55 5 - #60 - Lys Ser Gly Val Ala Met Ile Glu Met Thr Ph - #e Leu Ala Pro Ser Glu # 575 - Leu Pro Gln Glu Ser Thr Gln Ser Lys Ile Le - #u Val Asp Gly Lys Glu # 590 - Leu Ala Asp Phe Ala Glu Asn Arg Gln Asp Ty - #r Gln Ile Thr Tyr Lys # 605 - Gly Gln Arg Pro Lys Val Ser Val Glu Glu As - #n Asn Gln Val Ala Ser # 620 - Thr Val Val Asp Ser Gly Glu Asp Ser Phe Pr - #o Val Leu Val Arg Leu 625 6 - #30 6 - #35 6 - #40 - Val Ser Glu Ser Gly Lys Gln Val Lys Glu Ty - #r Arg Ile His Leu Thr # 655 - Lys Glu Lys Pro Val Ser Glu Lys Thr Val Al - #a Ala Val Gln Glu Asp # 670 - Leu Pro Lys Ile Glu Phe Val Glu Lys Asp Le - #u Ala Tyr Lys Thr Val # 685 - Glu Lys Lys Asp Ser Thr Leu Tyr Leu Gly Gl - #u Thr Arg Val Glu Gln # 700 - Glu Gly Lys Val Gly Lys Glu Arg Ile Phe Th - #r Ala Ile Asn Pro Asp 705 7 - #10 7 - #15 7 - #20 - Gly Ser Lys Glu Glu Lys Leu Arg Glu Val Va - #l Glu Val Pro Thr Asp # 735 - Arg Ile Val Leu Val Gly Thr Lys Pro Val Al - #a Gln Glu Ala Lys Lys # 750 - Pro Gln Val Ser Glu Lys Ala Asp Thr Lys Pr - #o Ile Asp Ser Ser Glu # 765 - Ala Ser Gln Thr Asn Lys Ala Gln # 775 - (2) INFORMATION FOR SEQ ID NO: 87: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 133 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #87: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTATCACTAT GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TT - #CAGACAGG 60 - AAAGCCTGAC AGGAATGAAG TTGTATATGG TTTGGTGTAT CAAAAAGATC AG - #TTGCCTCA 120 # 133 - (2) INFORMATION FOR SEQ ID NO:88: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 44 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: - Tyr His Tyr Val Asn Lys Glu Ile Ile Ser Gl - #n Glu Ala Lys Asp Leu # 15 - Ile Gln Thr Gly Lys Pro Asp Arg Asn Glu Va - #l Val Tyr Gly Leu Val # 30 - Tyr Gln Lys Asp Gln Leu Pro Gln Thr Gly Th - #r Glu # 40 - (2) INFORMATION FOR SEQ ID NO: 89: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 775 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #89: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGAGACTCCT CAATCAATAA CAAATCAGGA GCAAGCTAGG ACAGAAAACC AA - #GTAGTAGA 60 - GACAGAGGAA GCTCCAAAAG AAGAAGCACC TAAAACAGAA GAAAGTCCAA AG - #GAAGAACC 120 - AAAATCGGAG GTAAAACCTA CTGACGACAC CCTTCCTAAA GTAGAAGAGG GG - #AAAGAAGA 180 - TTCAGCAGAA CCAGCTCCAG TTGAAGAAGT AGGTGGAGAA GTTGAGTCAA AA - #CCAGAGGA 240 - AAAAGTAGCA GTTAAGCCAG AAAGTCAACC ATCAGACAAA CCAGCTGAGG AA - #TCAAAAGT 300 - TGAACAAGCA GGTGAACCAG TCGCGCCAAG AGAAGACGAA AAGGCACCAG TC - #GAGCCAGA 360 - AAAGCAACCA GAAGCTCCTG AAGAAGAGAA GGCTGTAGAG GAAACACCGA AA - #CAAGAAGA 420 - GTCAACTCCA GATACCAAGG CTGAAGAAAC TGTAGAACCA AAAGAGGAGA CT - #GTTAATCA 480 - ATCTATTGAA CAACCAAAAG TTGAAACGCC TGCTGTAGAA AAACAAACAG AA - #CCAACAGA 540 - GGAACCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AGAGAAGACG AA - #CAGGCACC 600 - AACGGCACCA GTTGAGCCAG AAAAGCAACC AGAAGTTCCT GAAGAAGAGA AG - #GCTGTAGA 660 - GGAAACACCG AAACCAGAAG ATAAAATAAA GGGTATTGGT ACTAAAGAAC CA - #GTTGATAA 720 - AAGTGAGTTA AATAATCAAA TTGATAAAGC TAGTTCAGTT TCTCCTACTG AT - #TAT 775 - (2) INFORMATION FOR SEQ ID NO:90: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 258 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: - Glu Thr Pro Gln Ser Ile Thr Asn Gln Glu Gl - #n Ala Arg Thr Glu Asn # 15 - Gln Val Val Glu Thr Glu Glu Ala Pro Lys Gl - #u Glu Ala Pro Lys Thr # 30 - Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Gl - #u Val Lys Pro Thr Asp # 45 - Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Gl - #u Asp Ser Ala Glu Pro # 60 - Ala Pro Val Glu Glu Val Gly Gly Glu Val Gl - #u Ser Lys Pro Glu Glu #80 - Lys Val Ala Val Lys Pro Glu Ser Gln Pro Se - #r Asp Lys Pro Ala Glu # 95 - Glu Ser Lys Val Glu Gln Ala Gly Glu Pro Va - #l Ala Pro Arg Glu Asp # 110 - Glu Lys Ala Pro Val Glu Pro Glu Lys Gln Pr - #o Glu Ala Pro Glu Glu # 125 - Glu Lys Ala Val Glu Glu Thr Pro Lys Gln Gl - #u Glu Ser Thr Pro Asp # 140 - Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Gl - #u Glu Thr Val Asn Gln 145 1 - #50 1 - #55 1 - #60 - Ser Ile Glu Gln Pro Lys Val Glu Thr Pro Al - #a Val Glu Lys Gln Thr # 175 - Glu Pro Thr Glu Glu Pro Lys Val Glu Gln Al - #a Gly Glu Pro Val Ala # 190 - Pro Arg Glu Asp Glu Gln Ala Pro Thr Ala Pr - #o Val Glu Pro Glu Lys # 205 - Gln Pro Glu Val Pro Glu Glu Glu Lys Ala Va - #l Glu Glu Thr Pro Lys # 220 - Pro Glu Asp Lys Ile Lys Gly Ile Gly Thr Ly - #s Glu Pro Val Asp Lys 225 2 - #30 2 - #35 2 - #40 - Ser Glu Leu Asn Asn Gln Ile Asp Lys Ala Se - #r Ser Val Ser Pro Thr # 255 - Asp Tyr - (2) INFORMATION FOR SEQ ID NO: 91: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 199 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #91: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGATGCTCAA GAAACTGCGG GAGTTCACTA TAAATATGTG GCAGATTCAG AG - #CTATCATC 60 - AGAAGAAAAG AAGCAGCTTG TCTATGATAT TCCGACATAC GTGGAGAATG AT - #GATGAAAC 120 - TTATTATCTT GTTTATAAGT TAAATTCTCA AAATCAACTG GCGGAATTGC CA - #AATACTGG 180 #199 CAA - (2) INFORMATION FOR SEQ ID NO:92: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 66 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: - Asp Ala Gln Glu Thr Ala Gly Val His Tyr Ly - #s Tyr Val Ala Asp Ser # 15 - Glu Leu Ser Ser Glu Glu Lys Lys Gln Leu Va - #l Tyr Asp Ile Pro Thr # 30 - Tyr Val Glu Asn Asp Asp Glu Thr Tyr Tyr Le - #u Val Tyr Lys Leu Asn # 45 - Ser Gln Asn Gln Leu Ala Glu Leu Pro Asn Th - #r Gly Ser Lys Asn Glu # 60 - Arg Gln 65 - (2) INFORMATION FOR SEQ ID NO: 93: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 835 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #93: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CGACAAAGGT GAGACTGAGG TTCAACCAGA GTCGCCAGAT ACTGTGGTAA GT - #GATAAAGG 60 - TGAACCAGAG CAGGTAGCAC CGCTTCCAGA ATATAAGGGT AATATTGAGC AA - #GTAAAACC 120 - TGAAACTCCG GTTGAGAAGA CCAAAGAACA AGGTCCAGAA AAAACTGAAG AA - #GTTCCAGT 180 - AAAACCAACA GAAGAAACAC CAGTAAATCC AAATGAAGGT ACTACAGAAG GA - #ACCTCAAT 240 - TCAAGAAGCA GAAAATCCAG TTCAACCTGC AGAAGAATCA ACAACGAATT CA - #GAGAAAGT 300 - ATCACCAGAT ACATCTAGCA AAAATACTGG GGAAGTGTCC AGTAATCCTA GT - #GATTCGAC 360 - AACCTCAGTT GGAGAATCAA ATAAACCAGA ACATAATGAC TCTAAAAATG AA - #AATTCAGA 420 - AAAAACTGTA GAAGAAGTTC CAGTAAATCC AAATGAAGGC ACAGTAGAAG GT - #ACCTCAAA 480 - TCAAGAAACA GAAAAACCAG TTCAACCTGC AGAAGAAACA CAAACAAACT CT - #GGGAAAAT 540 - AGCTAACGAA AATACTGGAG AAGTATCCAA TAAACCTAGT GATTCAAAAC CA - #CCAGTTGA 600 - AGAATCAAAT CAACCAGAAA AAAACGGAAC TGCAACAAAA CCAGAAAATT CA - #GGTAATAC 660 - AACATCAGAG AATGGACAAA CAGAACCAGA ACCATCAAAC GGAAATTCAA CT - #GAGGATGT 720 - TTCAACCGAA TCAAACACAT CCAATTCAAA TGGAAACGAA GAAATTAAAC AA - #GAAAATGA 780 - ACTAGACCCT GATAAAAAGG TAGAAGAACC AGAGAAAACA CTTGAATTAA GA - #AAT 835 - (2) INFORMATION FOR SEQ ID NO:94: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 278 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: - Asp Lys Gly Glu Thr Glu Val Gln Pro Glu Se - #r Pro Asp Thr Val Val # 15 - Ser Asp Lys Gly Glu Pro Glu Gln Val Ala Pr - #o Leu Pro Glu Tyr Lys # 30 - Gly Asn Ile Glu Gln Val Lys Pro Glu Thr Pr - #o Val Glu Lys Thr Lys # 45 - Glu Gln Gly Pro Glu Lys Thr Glu Glu Val Pr - #o Val Lys Pro Thr Glu # 60 - Glu Thr Pro Val Asn Pro Asn Glu Gly Thr Th - #r Glu Gly Thr Ser Ile #80 - Gln Glu Ala Glu Asn Pro Val Gln Pro Ala Gl - #u Glu Ser Thr Thr Asn # 95 - Ser Glu Lys Val Ser Pro Asp Thr Ser Ser Ly - #s Asn Thr Gly Glu Val # 110 - Ser Ser Asn Pro Ser Asp Ser Thr Thr Ser Va - #l Gly Glu Ser Asn Lys # 125 - Pro Glu His Asn Asp Ser Lys Asn Glu Asn Se - #r Glu Lys Thr Val Glu # 140 - Glu Val Pro Val Asn Pro Asn Glu Gly Thr Va - #l Glu Gly Thr Ser Asn 145 1 - #50 1 - #55 1 - #60 - Gln Glu Thr Glu Lys Pro Val Gln Pro Ala Gl - #u Glu Thr Gln Thr Asn # 175 - Ser Gly Lys Ile Ala Asn Glu Asn Thr Gly Gl - #u Val Ser Asn Lys Pro # 190 - Ser Asp Ser Lys Pro Pro Val Glu Glu Ser As - #n Gln Pro Glu Lys Asn # 205 - Gly Thr Ala Thr Lys Pro Glu Asn Ser Gly As - #n Thr Thr Ser Glu Asn # 220 - Gly Gln Thr Glu Pro Glu Pro Ser Asn Gly As - #n Ser Thr Glu Asp Val 225 2 - #30 2 - #35 2 - #40 - Ser Thr Glu Ser Asn Thr Ser Asn Ser Asn Gl - #y Asn Glu Glu Ile Lys # 255 - Gln Glu Asn Glu Leu Asp Pro Asp Lys Lys Va - #l Glu Glu Pro Glu Lys

# 270 - Thr Leu Glu Leu Arg Asn 275 - (2) INFORMATION FOR SEQ ID NO: 95: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 709 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #95: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC ACTAAACTGA CT - #GCTGAAAA 60 - ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA GATATAACTC AT - #TTAACAGA 120 - TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT TCAGCATTAG AC - #GGAGCGAC 180 - AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC CCAGATGGTT CA - #GTAGTGAC 240 - GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT GAATCTGTAA CT - #CAAGAAGC 300 - TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT AAGGGAGGCA AT - #ACTGGAAG 360 - CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG GGTGGATCAG CT - #CACACAGG 420 - TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA GCTACTGAAA AA - #GAATCAGC 480 - TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT GAAATCAAAG GC - #GCACCGCT 540 - TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA GCAGAAAAAC AA - #GCAGCTCT 600 - CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG GAAGCAGAAA CG - #ATTGGAGT 660 # 709GGTTA CAGTTCCTAA GAGACCAGTG GCTCCTAAT - (2) INFORMATION FOR SEQ ID NO:96: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 236 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: - Asn Gln Leu Val Ala Gln Asp Pro Lys Ala Gl - #n Asp Ser Thr Lys Leu # 15 - Thr Ala Glu Lys Ser Thr Val Lys Ala Pro Al - #a Gln Arg Val Asp Val # 30 - Lys Asp Ile Thr His Leu Thr Asp Glu Glu Ly - #s Val Lys Val Ala Ile # 45 - Leu Gln Ala Asn Gly Ser Ala Leu Asp Gly Al - #a Thr Ile Asn Val Ala # 60 - Gly Asp Gly Thr Ala Thr Ile Thr Phe Pro As - #p Gly Ser Val Val Thr #80 - Ile Leu Gly Lys Asp Thr Val Gln Gln Ser Al - #a Lys Gly Glu Ser Val # 95 - Thr Gln Glu Ala Thr Pro Glu Tyr Lys Leu Gl - #u Asn Thr Pro Gly Gly # 110 - Asp Lys Gly Gly Asn Thr Gly Ser Ser Asp Al - #a Asn Ala Asn Glu Gly # 125 - Gly Gly Ser Gln Ala Gly Gly Ser Ala His Th - #r Gly Ser Gln Asn Ser # 140 - Ala Gln Ser Gln Ala Ser Lys Gln Leu Ala Th - #r Glu Lys Glu Ser Ala 145 1 - #50 1 - #55 1 - #60 - Lys Asn Ala Ile Glu Lys Ala Ala Lys Asp Ly - #s Gln Asp Glu Ile Lys # 175 - Gly Ala Pro Leu Ser Asp Lys Glu Lys Ala Gl - #u Leu Leu Ala Arg Val # 190 - Glu Ala Glu Lys Gln Ala Ala Leu Lys Glu Il - #e Glu Asn Ala Lys Thr

# 205 - Met Glu Asp Val Lys Glu Ala Glu Thr Ile Gl - #y Val Gln Ala Ile Ala # 220 - Met Val Thr Val Pro Lys Arg Pro Val Ala Pr - #o Asn 225 2 - #30 2 - #35 - (2) INFORMATION FOR SEQ ID NO: 97: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 787 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #97: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CAAACAGTCA GCTTCAGGAA CGATTGAGGT GATTTCACGA GAAAATGGCT CT - #GGGACACG 60 - GGGTGCCTTC ACAGAAATCA CAGGGATTCT CAAAAAAGAC GGTGATAAAA AA - #ATTGACAA 120 - CACTGCCAAA ACAGCTGTGA TTCAAAATAG TACAGAAGGT GTTCTCTCAG CA - #GTTCAAGG 180 - GAATGCTAAT GCTATCGGCT ACATCTCCTT GGGATCTTTA ACGAAATCTG TC - #AAGGCTTT 240 - AGAGATTGAT GGTGTCAAGG CTAGTCGAGA CACAGTTTTA GATGGTGAAT AC - #CCTCTTCA 300 - ACGTCCCTTC AACATTGTTT GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG AT - #TTTATCAG 360 - CTTTATCCAC TCCAAACAAG GTCAACAAGT GGTCACAGAT AATAAATTTA TT - #GAAGCTAA 420 - AACCGAAACC ACGGAATATA CAAGCCAACA CTTATCAGGC AAGTTGTCTG TT - #GTAGGTTC 480 - CACTTCAGTA TCTTCTTTAA TGGAAAAATT AGCAGAAGCT TATAAAAAAG AA - #AATCCAGA 540 - AGTTACGATT GATATTACCT CTAATGGGTC TTCAGCAGGT ATTACCGCTG TT - #AAGGAGAA 600 - AACCGCTGAT ATTGGTATGG TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AG - #AGTCTCAC 660 - CCATGATGCT ATTGCTTTAG ACGGTATTGC TGTTGTGGTC AATAATGACA AT - #AAGGCAAG 720 - CCAAGTCAGT ATGGCTGAAC TTGCAGACGT TTTTAGTGGC AAATTAACCA CC - #TGGGACAA 780 # 787 - (2) INFORMATION FOR SEQ ID NO:98: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 262 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: - Lys Gln Ser Ala Ser Gly Thr Ile Glu Val Il - #e Ser Arg Glu Asn Gly # 15 - Ser Gly Thr Arg Gly Ala Phe Thr Glu Ile Th - #r Gly Ile Leu Lys Lys # 30 - Asp Gly Asp Lys Lys Ile Asp Asn Thr Ala Ly - #s Thr Ala Val Ile Gln # 45 - Asn Ser Thr Glu Gly Val Leu Ser Ala Val Gl - #n Gly Asn Ala Asn Ala # 60 - Ile Gly Tyr Ile Ser Leu Gly Ser Leu Thr Ly - #s Ser Val Lys Ala Leu #80 - Glu Ile Asp Gly Val Lys Ala Ser Arg Asp Th - #r Val Leu Asp Gly Glu # 95 - Tyr Pro Leu Gln Arg Pro Phe Asn Ile Val Tr - #p Ser Ser Asn Leu Ser # 110 - Lys Leu Gly Gln Asp Phe Ile Ser Phe Ile Hi - #s Ser Lys Gln Gly Gln # 125 - Gln Val Val Thr Asp Asn Lys Phe Ile Glu Al - #a Lys Thr Glu Thr Thr # 140 - Glu Tyr Thr Ser Gln His Leu Ser Gly Lys Le - #u Ser Val Val Gly Ser 145 1 - #50 1 - #55 1 - #60 - Thr Ser Val Ser Ser Leu Met Glu Lys Leu Al - #a Glu Ala Tyr Lys Lys # 175 - Glu Asn Pro Glu Val Thr Ile Asp Ile Thr Se - #r Asn Gly Ser Ser Ala # 190 - Gly Ile Thr Ala Val Lys Glu Lys Thr Ala As - #p Ile Gly Met Val Ser # 205 - Arg Glu Leu Thr Pro Glu Glu Gly Lys Ser Le - #u Thr His Asp Ala Ile # 220 - Ala Leu Asp Gly Ile Ala Val Val Val Asn As - #n Asp Asn Lys Ala Ser 225 2 - #30 2 - #35 2 - #40 - Gln Val Ser Met Ala Glu Leu Ala Asp Val Ph - #e Ser Gly Lys Leu Thr # 255 - Thr Trp Asp Lys Ile Lys 260 - (2) INFORMATION FOR SEQ ID NO: 99: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 421 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #99: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATTCGATGAT GCGGATGAAA AGATGACCCG TGATGAAATT GCCTATATGC TG - #ACAAATAG 60 - TGAAGAAACA TTGGATGCTG ATGAGATTGA GATGCTACAA GGTGTCTTTT CG - #CTCGATGA 120 - ACTGATGGCA CGAGAGGTTA TGGTTCCTCG AACGGATGCC TTTATGGTGG AT - #ATTCAGGA 180 - TGATAGTCAA GCCATTATCC AAAGTATTTT AAAACAAAAT TATTCTCGTA TC - #CCGGTTTA 240 - TGATGGGGAT AAGGACAATG TAATTGGAAT CATTCACACC AAGAGTCTCC TT - #AAGGCAGG 300 - CTTTGTGGAC GGTTTTGACA ATATTGTTTG GAAGAGAATT TTACAAGATC CA - #CTTTTTGT 360 - ACCTGAAACT ATTTTTGTGG ATGACTTGCT AAAAGAACTG CGAAATACCC AA - #AGACAAAT 420 # 421 - (2) INFORMATION FOR SEQ ID NO:100: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 140 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: - Phe Asp Asp Ala Asp Glu Lys Met Thr Arg As - #p Glu Ile Ala Tyr Met # 15 - Leu Thr Asn Ser Glu Glu Thr Leu Asp Ala As - #p Glu Ile Glu Met Leu # 30 - Gln Gly Val Phe Ser Leu Asp Glu Leu Met Al - #a Arg Glu Val Met Val # 45 - Pro Arg Thr Asp Ala Phe Met Val Asp Ile Gl - #n Asp Asp Ser Gln Ala # 60 - Ile Ile Gln Ser Ile Leu Lys Gln Asn Tyr Se - #r Arg Ile Pro Val Tyr #80 - Asp Gly Asp Lys Asp Asn Val Ile Gly Ile Il - #e His Thr Lys Ser Leu # 95 - Leu Lys Ala Gly Phe Val Asp Gly Phe Asp As - #n Ile Val Trp Lys Arg # 110 - Ile Leu Gln Asp Pro Leu Phe Val Pro Glu Th - #r Ile Phe Val Asp Asp # 125 - Leu Leu Lys Glu Leu Arg Asn Thr Gln Arg Gl - #n Met # 140 - (2) INFORMATION FOR SEQ ID NO: 101: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 331 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #101: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGAGAGTCGA TCAAAAGTAG ATGAAGCTGT GTCTAAGTTT GAAAAGGACT CA - #TCTTCTTC 60 - GTCAAGTTCA GACTCTTCCA CTAAACCGGA AGCTTCAGAT ACAGCGAAGC CA - #AACAAGCC 120 - GACAGAACCA GGAGAAAAGG TAGCAGAAGC TAAGAAGAAG GTTGAAGAAG CT - #GAGAAAAA 180 - AGCCAAGGAT CAAAAAGAAG AAGATCGTCG TAACTACCCA ACCATTACTT AC - #AAAACGCT 240 - TGAACTTGAA ATTGCTGAGT CCGATGTGGA AGTTAAAAAA GCGGAGCTTG AA - #CTAGTAAA 300 # 331 CCTC GAGACGAGCA A - (2) INFORMATION FOR SEQ ID NO:102: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 110 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: - Glu Ser Arg Ser Lys Val Asp Glu Ala Val Se - #r Lys Phe Glu Lys Asp # 15 - Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Th - #r Lys Pro Glu Ala Ser # 30 - Asp Thr Ala Lys Pro Asn Lys Pro Thr Glu Pr - #o Gly Glu Lys Val Ala # 45 - Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Ly - #s Lys Ala Lys Asp Gln # 60 - Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Il - #e Thr Tyr Lys Thr Leu #80 - Glu Leu Glu Ile Ala Glu Ser Asp Val Glu Va - #l Lys Lys Ala Glu Leu # 95 - Glu Leu Val Lys Val Lys Ala Asn Glu Pro Ar - #g Asp Glu Gln # 110 - (2) INFORMATION FOR SEQ ID NO: 103: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 358 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #103: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATGGACAACA GGAAACTGGG ACGAGGTTAT ATCTGGTAAG ATTGACAAGT AC - #AAAGATCC 60 - AGATATTCCA ACAGTTGAAT CACAAGAAGT TACGTCAGAC TCTAGTGATA AA - #GAAATAAC 120 - GGTAAGGTAT GACCGTTTAT CAACACCAGA AAAACCAATC CCACAACCAA AT - #CCAGAGCA 180 - TCCAAGTGTT CCGACACCAA ACCCAGAACT ACCAAATCAA GAGACTCCAA CA - #CCAGATAA 240 - ACCAACTCCA GAACCAGGTA CTCCAAAAAC TGAAACTCCA GTGAATCCAG AC - #CCAGAAGT 300 - TCCGACTTAT GAGACAGGTA AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AA - #GCTAAT 358 - (2) INFORMATION FOR SEQ ID NO:104: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 119 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: - Trp Thr Thr Gly Asn Trp Asp Glu Val Ile Se - #r Gly Lys Ile Asp Lys # 15 - Tyr Lys Asp Pro Asp Ile Pro Thr Val Glu Se - #r Gln Glu Val Thr Ser # 30 - Asp Ser Ser Asp Lys Glu Ile Thr Val Arg Ty - #r Asp Arg Leu Ser Thr # 45 - Pro Glu Lys Pro Ile Pro Gln Pro Asn Pro Gl - #u His Pro Ser Val Pro # 60 - Thr Pro Asn Pro Glu Leu Pro Asn Gln Glu Th - #r Pro Thr Pro Asp Lys #80 - Pro Thr Pro Glu Pro Gly Thr Pro Lys Thr Gl - #u Thr Pro Val Asn Pro # 95 - Asp Pro Glu Val Pro Thr Tyr Glu Thr Gly Ly - #s Arg Glu Glu Leu Pro # 110 - Asn Thr Gly Thr Glu Ala Asn 115 - (2) INFORMATION FOR SEQ ID NO: 105: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1879 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #105: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CGATGGGCTC AATCCAACCC CAGGTCAAGT CTTACCTGAA GAGACATCGG GA - #ACGAAAGA 60 - GGGTGACTTA TCAGAAAAAC CAGGAGACAC CGTTCTCACT CAAGCGAAAC CT - #GAGGGCGT 120 - TACTGGAAAT ACGAATTCAC TTCCGACACC TACAGAAAGA ACTGAAGTGA GC - #GAGGAAAC 180 - AAGCCCTTCT AGTCTGGATA CACTTTTTGA AAAAGATGAA GAAGCTCAAA AA - #AATCCAGA 240 - GCTAACAGAT GTCTTAAAAG AAACTGTAGA TACAGCTGAT GTGGATGGGA CA - #CAAGCAAG 300 - TCCAGCAGAA ACTACTCCTG AACAAGTAAA AGGTGGAGTG AAAGAAAATA CA - #AAAGACAG 360 - CATCGATGTT CCTGCTGCTT ATCTTGAAAA AGCTGAAGGG AAAGGTCCTT TC - #ACTGCCGG 420 - TGTAAACCAA GTAATTCCTT ATGAACTATT CGCTGGTGAT GGTATGTTAA CT - #CGTCTATT 480 - ACTAAAAGCT TCGGATAATG CTCCTTGGTC TGACAATGGT ACTGCTAAAA AT - #CCTGCTTT 540 - ACCTCCTCTT GAAGGATTAA CAAAAGGGAA ATACTTCTAT GAAGTAGACT TA - #AATGGCAA 600 - TACTGTTGGT AAACAAGGTC AAGCTTTAAT TGATCAACTT CGCGCTAATG GT - #ACTCAAAC 660 - TTATAAAGCT ACTGTTAAAG TTTACGGAAA TAAAGACGGT AAAGCTGACT TG - #ACTAATCT 720 - AGTTGCTACT AAAAATGTAG ACATCAACAT CAATGGATTA GTTGCTAAAG AA - #ACAGTTCA 780 - AAAAGCCGTT GCAGACAACG TTAAAGACAG TATCGATGTT CCAGCAGCCT AC - #CTAGAAAA 840 - AGCCAAGGGT GAAGGTCCAT TCACAGCAGG TGTCAACCAT GTGATTCCAT AC - #GAACTCTT 900 - CGCAGGTGAT GGCATGTTGA CTCGTCTCTT GCTCAAGGCA TCTGACAAGG CA - #CCATGGTC 960 - AGATAACGGC GACGCTAAAA ACCCAGCCCT ATCTCCACTA GGCGAAAACG TG - #AAGACCAA 1020

- AGGTCAATAC TTCTATCAAN TAGCCTTGGA CGGAAATGTA GCTGGCAAAG AA - #AAACAAGC 1080 - GCTCATTGAC CAGTTCCGAG CAAANGGTAC TCAAACTTAC AGCGCTACAG TC - #AATGTCTA 1140 - TGGTAACAAA GACGGTAAAC CAGACTTGGA CAACATCGTA GCAACTAAAA AA - #GTCACTAT 1200 - TAACATAAAC GGTTTAATTT CTAAAGAAAC AGTTCAAAAA GCCGTTGCAG AC - #AACGTTAA 1260 - NGACAGTATC GATGTTCCAG CAGCCTACCT AGAAAAAGCC AAGGGTGAAG GT - #CCATTCAC 1320 - AGCAGGTGTC AACCATGTGA TTCCATACGA ACTCTTCGCA GGTGATGGTA TG - #TTGACTCG 1380 - TCTCTTGCTC AAGGCATCTG ACAAGGCACC ATGGTCAGAT AACGGNGACG CT - #AAAAACCC 1440 - AGCNCTATCT CCACTAGGTG AAAACGTGAA GACCAAAGGT CAATACTTCT AT - #CAANTAGC 1500 - CTTGGACGGA AATGTAGCTG GCAAAGAAAA ACAAGCGCTC ATTGACCAGT TC - #CGAGCAAA 1560 - CGGTACTCAA ACTTACAGCG CTACAGTCAA TGTCTATGGT AACAAAGACG GT - #AAACCAGA 1620 - CTTGGACAAC ATCGTAGCAA CTAAAAAAGT CACTATTAAG ATAAATGTTA AA - #GAAACATC 1680 - AGACACAGCA AATGGTTCAT TATCACCTTC TAACTCTGGT TCTGGCGTGA CT - #CCGATGAA 1740 - TCACAATCAT GCTACAGGTA CTACAGATAG CATGCCTGCT GACACCATGA CA - #AGTTCTAC 1800 - CAACACGATG GCAGGTGAAA ACATGGCTGC TTCTGCTAAC AAGATGTCTG AT - #ACGATGAT 1860 # 187 - #9 - (2) INFORMATION FOR SEQ ID NO:106: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 626 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: - Asp Gly Leu Asn Pro Thr Pro Gly Gln Val Le - #u Pro Glu Glu Thr Ser # 15 - Gly Thr Lys Glu Gly Asp Leu Ser Glu Lys Pr - #o Gly Asp Thr Val Leu # 30 - Thr Gln Ala Lys Pro Glu Gly Val Thr Gly As - #n Thr Asn Ser Leu Pro # 45 - Thr Pro Thr Glu Arg Thr Glu Val Ser Glu Gl - #u Thr Ser Pro Ser Ser # 60 - Leu Asp Thr Leu Phe Glu Lys Asp Glu Glu Al - #a Gln Lys Asn Pro Glu #80 - Leu Thr Asp Val Leu Lys Glu Thr Val Asp Th - #r Ala Asp Val Asp Gly # 95 - Thr Gln Ala Ser Pro Ala Glu Thr Thr Pro Gl - #u Gln Val Lys Gly Gly # 110 - Val Lys Glu Asn Thr Lys Asp Ser Ile Asp Va - #l Pro Ala Ala Tyr Leu # 125 - Glu Lys Ala Glu Gly Lys Gly Pro Phe Thr Al - #a Gly Val Asn Gln Val # 140 - Ile Pro Tyr Glu Leu Phe Ala Gly Asp Gly Me - #t Leu Thr Arg Leu Leu 145 1 - #50 1 - #55 1 - #60 - Leu Lys Ala Ser Asp Asn Ala Pro Trp Ser As - #p Asn Gly Thr Ala Lys # 175 - Asn Pro Ala Leu Pro Pro Leu Glu Gly Leu Th - #r Lys Gly Lys Tyr Phe # 190 - Tyr Glu Val Asp Leu Asn Gly Asn Thr Val Gl - #y Lys Gln Gly Gln Ala # 205 - Leu Ile Asp Gln Leu Arg Ala Asn Gly Thr Gl - #n Thr Tyr Lys Ala Thr # 220 - Val Lys Val Tyr Gly Asn Lys Asp Gly Lys Al - #a Asp Leu Thr Asn Leu 225 2 - #30 2 - #35 2 - #40 - Val Ala Thr Lys Asn Val Asp Ile Asn Ile As - #n Gly Leu Val Ala Lys # 255 - Glu Thr Val Gln Lys Ala Val Ala Asp Asn Va - #l Lys Asp Ser Ile Asp # 270 - Val Pro Ala Ala Tyr Leu Glu Lys Ala Lys Gl - #y Glu Gly Pro Phe Thr # 285 - Ala Gly Val Asn His Val Ile Pro Tyr Glu Le - #u Phe Ala Gly Asp Gly # 300 - Met Leu Thr Arg Leu Leu Leu Lys Ala Ser As - #p Lys Ala Pro Trp Ser 305 3 - #10 3 - #15 3 - #20 - Asp Asn Gly Asp Ala Lys Asn Pro Ala Leu Se - #r Pro Leu Gly Glu Asn # 335 - Val Lys Thr Lys Gly Gln Tyr Phe Tyr Gln Xa - #a Ala Leu Asp Gly Asn # 350 - Val Ala Gly Lys Glu Lys Gln Ala Leu Ile As - #p Gln Phe Arg Ala Xaa # 365 - Gly Thr Gln Thr Tyr Ser Ala Thr Val Asn Va - #l Tyr Gly Asn Lys Asp # 380 - Gly Lys Pro Asp Leu Asp Asn Ile Val Ala Th - #r Lys Lys Val Thr Ile 385 3 - #90 3 - #95 4 - #00 - Asn Ile Asn Gly Leu Ile Ser Lys Glu Thr Va - #l Gln Lys Ala Val Ala # 415 - Asp Asn Val Xaa Asp Ser Ile Asp Val Pro Al - #a Ala Tyr Leu Glu Lys # 430 - Ala Lys Gly Glu Gly Pro Phe Thr Ala Gly Va - #l Asn His Val Ile Pro # 445 - Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Th - #r Arg Leu Leu Leu Lys # 460 - Ala Ser Asp Lys Ala Pro Trp Ser Asp Asn Gl - #y Asp Ala Lys Asn Pro 465 4 - #70 4 - #75 4 - #80 - Ala Leu Ser Pro Leu Gly Glu Asn Val Lys Th - #r Lys Gly Gln Tyr Phe # 495 - Tyr Gln Xaa Ala Leu Asp Gly Asn Val Ala Gl - #y Lys Glu Lys Gln Ala # 510 - Leu Ile Asp Gln Phe Arg Ala Asn Gly Thr Gl - #n Thr Tyr Ser Ala Thr # 525 - Val Asn Val Tyr Gly Asn Lys Asp Gly Lys Pr - #o Asp Leu Asp Asn Ile # 540 - Val Ala Thr Lys Lys Val Thr Ile Lys Ile As - #n Val Lys Glu Thr Ser 545 5 - #50 5 - #55 5 - #60 - Asp Thr Ala Asn Gly Ser Leu Ser Pro Ser As - #n Ser Gly Ser Gly Val # 575 - Thr Pro Met Asn His Asn His Ala Thr Gly Th - #r Thr Asp Ser Met Pro # 590 - Ala Asp Thr Met Thr Ser Ser Thr Asn Thr Me - #t Ala Gly Glu Asn Met # 605 - Ala Ala Ser Ala Asn Lys Met Ser Asp Thr Me - #t Met Ser Glu Asp Lys # 620 - Ala Met 625 - (2) INFORMATION FOR SEQ ID NO: 107: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 593 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #107: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTCCAATCAA AAACAGGCAG ATGGTAAACT CAATATCGTG ACAACCTTTT AC - #CCTGTCTA 60 - TGArTTTACC AAGCAAGTCG CAGGAGATAC GGCTAATGTA GAACTCCTAA TC - #GGTGCTGG 120 - GACAGAACCT CATGAATACG AACCATCTGC CAAGGCAGTT GCCAAAATCC AA - #GATGCAGA 180 - TACCTTCGTT TATGAAAATG AAAACATGGA AACATGGGTA CCTAAATTGC TA - #GATACCTT 240 - GGATAAGAAA AAAGTGAAAA CCATCAAGGC GACAGGCGAT ATGTTGCTCT TG - #CCAGGTGG 300 - CGAGGAAGAA GAGGGAGACC ATGACCATGG AGAAGAAGGT CATCACCATG AG - #TTTGACCC 360 - CCATGTTTGG TTATCACCAG TTCGTGCCAT tAAACTAGTA GAGCACCATC CG - #CGACACTT 420 - GTCAGCAGAT TATCCTGATA AAAAAGAGAC CTTTGAGAAG AATGCAGCTG CC - #TATATCGA 480 - AAAATTGCAA GCCTTGGATA AGGCTTACGC AGAAGGTTTG TCTCAAGCAA AA - #CAAAAGAG 540 - CTTTGTGACT CAACACGCAg CCTTTAACTa TCTTGCCTTG GACTATGGGA CT - #C 593 - (2) INFORMATION FOR SEQ ID NO:108: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 197 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: - Ser Asn Gln Lys Gln Ala Asp Gly Lys Leu As - #n Ile Val Thr Thr Phe # 15 - Tyr Pro Val Tyr Glu Phe Thr Lys Gln Val Al - #a Gly Asp Thr Ala Asn # 30 - Val Glu Leu Leu Ile Gly Ala Gly Thr Glu Pr - #o His Glu Tyr Glu Pro # 45 - Ser Ala Lys Ala Val Ala Lys Ile Gln Asp Al - #a Asp Thr Phe Val Tyr # 60 - Glu Asn Glu Asn Met Glu Thr Trp Val Pro Ly - #s Leu Leu Asp Thr Leu #80 - Asp Lys Lys Lys Val Lys Thr Ile Lys Ala Th - #r Gly Asp Met Leu Leu # 95 - Leu Pro Gly Gly Glu Glu Glu Glu Gly Asp Hi - #s Asp His Gly Glu Glu # 110 - Gly His His His Glu Phe Asp Pro His Val Tr - #p Leu Ser Pro Val Arg # 125 - Ala Ile Lys Leu Val Glu His His Pro Arg Hi - #s Leu Ser Ala Asp Tyr # 140 - Pro Asp Lys Lys Glu Thr Phe Glu Lys Asn Al - #a Ala Ala Tyr Ile Glu 145 1 - #50 1 - #55 1 - #60 - Lys Leu Gln Ala Leu Asp Lys Ala Tyr Ala Gl - #u Gly Leu Ser Gln Ala # 175 - Lys Gln Lys Ser Phe Val Thr Gln His Ala Al - #a Phe Asn Tyr Leu Ala # 190 - Leu Asp Tyr Gly Thr 195 - (2) INFORMATION FOR SEQ ID NO: 109: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1003 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #109: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TATCACAGGA TCGAACGGTA AGACAACCAC AACGACTATG ATTGGGGAAG TT - #TTGACTGC 60 - TGCTGGCCAA CATGGTCTTT TATCAGGGAA TATCGGCTAT CCAGCTAGTC AG - #GTTGCTCA 120 - AATAGCATCA GATAAGGACA CGCTTGTTAT GGAACTTTCT TCTTTCCAAC TC - #ATGGGTGT 180 - TCAAGAATTC CATCCAGAGA TTGCGGTTAT TACCAACCTC ATGCCAACTC AT - #ATCGACTA 240 - CCATGGGTCA TTTTCGGAAT ATGTAGCAGC CAAGTGGAAT ATCCAGAACA AG - #ATGACAGC 300 - AGCTGATTTC CTTGTCTTGA ACTTTAATCA AGACTTGGCA AAAGACTTGA CT - #TCCAAGAC 360 - AGAAGCCACT GTTGTACCAT TTTCAACACT TGAAAAGGTT GATGGAGCTT AT - #CTGGAAGA 420 - TGGTCAACTC TACTTCCGTG GTGAAGTAGT CATGGCAGCG AATGAAATCG GT - #GTTCCAGG 480 - TAGCCACAAT GTGGAAAATG CCCTTGCGAC TATTGCTGTA GCCAAGCTTC GT - #GATGTGGA 540 - CAATCAAACC ATCAAGGAAA CTCTTTCAGC CTTCGGTGGT GTCAAACACC GT - #CTCCAGTT 600 - TGTGGATGAC ATCAAGGGTG TTAAATTCTA TAACGACAGT AAATCAACTA AT - #ATCTTGGC 660 - TACTCAAAAA GCCTTGTCAG GATTTGACAA CAGCAAGGTC GTCTTGATTG CA - #GGTGGTTT 720 - GGACCGTGGC AATGAGTTTG ACGAATTGGT GCCAGACATT ACTGGACTCA AG - #AAGATGGT 780 - CATCCTGGGT CAATCTGCAG AACGTGTCAA ACGGGCAGCA GACAAGGCTG GT - #GTCGCTTA 840 - TGTGGAGGCG ACAGATATTG CAGATGCGAC CCGCAAGGCC TATGAGCTTG CG - #ACTCAAGG 900 - AGATGTGGTT CTTCTTAGTC CTGCCAATGC TAGCTGGGAT ATGTATGCTA AC - #TTTGAAGT 960 # 100 - #3ACAGTAGC GGAGTTAAAA GAA - (2) INFORMATION FOR SEQ ID NO:110: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 335 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: - Gly Ile Thr Gly Ser Asn Gly Lys Thr Thr Th - #r Thr Thr Met Ile Gly # 15 - Glu Val Leu Thr Ala Ala Gly Gln His Gly Le - #u Leu Ser Gly Asn Ile # 30 - Gly Tyr Pro Ala Ser Gln Val Ala Gln Ile Al - #a Ser Asp Lys Asp Thr # 45 - Leu Val Met Glu Leu Ser Ser Phe Gln Leu Me - #t Gly Val Gln Glu Phe # 60 - His Pro Glu Ile Ala Val Ile Thr Asn Leu Me - #t Pro Thr His Ile Asp #80 - Tyr His Gly Ser Phe Ser Glu Tyr Val Ala Al - #a Lys Trp Asn Ile Gln # 95 - Asn Lys Met Thr Ala Ala Asp Phe Leu Val Le - #u Asn Phe Asn Gln Asp # 110 - Leu Ala Lys Asp Leu Thr Ser Lys Thr Glu Al - #a Thr Val Val Pro Phe # 125 - Ser Thr Leu Glu Lys Val Asp Gly Ala Tyr Le - #u Glu Asp Gly Gln Leu # 140 - Tyr Phe Arg Gly Glu Val Val Met Ala Ala As - #n Glu Ile Gly Val Pro 145 1 - #50 1 - #55 1 - #60 - Gly Ser His Asn Val Glu Asn Ala Leu Ala Th - #r Ile Ala Val Ala Lys

# 175 - Leu Arg Asp Val Asp Asn Gln Thr Ile Lys Gl - #u Thr Leu Ser Ala Phe # 190 - Gly Gly Val Lys His Arg Leu Gln Phe Val As - #p Asp Ile Lys Gly Val # 205 - Lys Phe Tyr Asn Asp Ser Lys Ser Thr Asn Il - #e Leu Ala Thr Gln Lys # 220 - Ala Leu Ser Gly Phe Asp Asn Ser Lys Val Va - #l Leu Ile Ala Gly Gly 225 2 - #30 2 - #35 2 - #40 - Leu Asp Arg Gly Asn Glu Phe Asp Glu Leu Va - #l Pro Asp Ile Thr Gly # 255 - Leu Lys Lys Met Val Ile Leu Gly Gln Ser Al - #a Glu Arg Val Lys Arg # 270 - Ala Ala Asp Lys Ala Gly Val Ala Tyr Val Gl - #u Ala Thr Asp Ile Ala # 285 - Asp Ala Thr Arg Lys Ala Tyr Glu Leu Ala Th - #r Gln Gly Asp Val Val # 300 - Leu Leu Ser Pro Ala Asn Ala Ser Trp Asp Me - #t Tyr Ala Asn Phe Glu 305 3 - #10 3 - #15 3 - #20 - Val Arg Gly Asp Leu Phe Ile Asp Thr Val Al - #a Glu Leu Lys Glu # 335 - (2) INFORMATION FOR SEQ ID NO: 111: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 412 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #111: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AAGTTCATCG AAGATGGTTG GGAAGTCCAC TATATCGGGG ACAAGTGTGG TA - #TCGAACAC 60 - CAAGAAATCC TTAAGTCAGG TTTGGATGTC ACCTTCCATT CTATTGCGAC TG - #GAAAATTG 120 - CGTCGCTATT TCTCTTGGCA AAATATGCTG GACGTCTTCA AAGTTGGTTG GG - #GAATTGTC 180 - CAATCGCTCT TTATCATGTT GCGACTGCGT CCACAGACCC TTTTTTCAAA GG - #GGGGCTTT 240 - GTCTCAGTAC CGCCTGTTAT CGCTGCGCGT GTGTCAGGAG TGCCTGTCTT TA - #TTCACGAA 300 - TCTGACCTGT CTATGGGCTT GGCCAATAAA ATCGCCTATA AATTTGCGAC TA - #AGATGTAT 360 - TCAACCTTTG AACAAGCTTC GAGTTTGGCT AAGGTTGAGC ATGTGGGAGC GG - # 412 - (2) INFORMATION FOR SEQ ID NO:112: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 137 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: - Ser Ser Ser Lys Met Val Gly Lys Ser Thr Il - #e Ser Gly Thr Ser Val # 15 - Val Ser Asn Thr Lys Lys Ser Leu Ser Gln Va - #l Trp Met Ser Pro Ser # 30 - Ile Leu Leu Arg Leu Glu Asn Cys Val Ala Il - #e Ser Leu Gly Lys Ile # 45 - Cys Trp Thr Ser Ser Lys Leu Val Gly Glu Le - #u Ser Asn Arg Ser Leu # 60 - Ser Cys Cys Asp Cys Val His Arg Pro Phe Ph - #e Gln Arg Gly Ala Leu #80 - Ser Gln Tyr Arg Leu Leu Ser Leu Arg Val Cy - #s Gln Glu Cys Leu Ser # 95 - Leu Phe Thr Asn Leu Thr Cys Leu Trp Ala Tr - #p Pro Ile Lys Ser Pro # 110 - Ile Asn Leu Arg Leu Arg Cys Ile Gln Pro Le - #u Asn Lys Leu Arg Val # 125 - Trp Leu Arg Leu Ser Met Trp Glu Arg # 135 - (2) INFORMATION FOR SEQ ID NO: 113: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 544 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #113: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATCGCTAGCT AGTGAAATGC AAGAAAGTAC ACGTAAATTC AAGGTTACTG CT - #GACCTAAC 60 - AGATGCCGGT GTTGGAACGA TTGAAGTTCC TTTGAGCATT GAAGATTTAC CC - #AATGGGCT 120 - GACCGCTGTG GCGACTCCGC AAAAAATTAC AGTCAAGATT GGTAAGAAGG CT - #CAGAAGGA 180 - TAAGGTAAAG ATTGTACCAG AGATTGACCC TAGTCAAATT GATAGTCGGG TA - #CAAATTGA 240 - AAATGTCATG GTGTCAGATA AAGAAGTGTC TATTACGAGT GACCAAGAGA CA - #TTGGATAG 300 - AATTGATAAG ATTATCGCTG TTTTGCCAAC TAGCGAACGT ATAACAGGTA AT - #TACAGTGG 360 - TTCAGTACCT TTGCAGGCAA TCGACCGCAA TGGTGTTGTC TTACCGGCAG TT - #ATCACTCC 420 - GTTTGATACA ATAATGAAGG TGACTACAAA ACCAGTAGCA CCAAGTTCAA GC - #ACATCAAA 480 - TTCAAGTACA AGCAGTTCAT CGGAGACATC TTCGTCAACG AAAGCAACTA GT - #TCAAAAAC 540 # 544 - (2) INFORMATION FOR SEQ ID NO:114: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 181 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: - Ser Leu Ala Ser Glu Met Gln Glu Ser Thr Ar - #g Lys Phe Lys Val Thr # 15 - Ala Asp Leu Thr Asp Ala Gly Val Gly Thr Il - #e Glu Val Pro Leu Ser # 30 - Ile Glu Asp Leu Pro Asn Gly Leu Thr Ala Va - #l Ala Thr Pro Gln Lys # 45 - Ile Thr Val Lys Ile Gly Lys Lys Ala Gln Ly - #s Asp Lys Val Lys Ile # 60 - Val Pro Glu Ile Asp Pro Ser Gln Ile Asp Se - #r Arg Val Gln Ile Glu #80 - Asn Val Met Val Ser Asp Lys Glu Val Ser Il - #e Thr Ser Asp Gln Glu # 95 - Thr Leu Asp Arg Ile Asp Lys Ile Ile Ala Va - #l Leu Pro Thr Ser Glu # 110 - Arg Ile Thr Gly Asn Tyr Ser Gly Ser Val Pr - #o Leu Gln Ala Ile Asp # 125 - Arg Asn Gly Val Val Leu Pro Ala Val Ile Th - #r Pro Phe Asp Thr Ile # 140 - Met Lys Val Thr Thr Lys Pro Val Ala Pro Se - #r Ser Ser Thr Ser Asn 145 1 - #50 1 - #55 1 - #60 - Ser Ser Thr Ser Ser Ser Ser Glu Thr Ser Se - #r Ser Thr Lys Ala Thr # 175 - Ser Ser Lys Thr Asn 180 - (2) INFORMATION FOR SEQ ID NO: 115: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1267 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #115: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GCACCAGATG GGGCACAAGG TTCAGGGATC AGATGTTGAA AAGTACTACT TT - #ACCCAACG 60 - CGGTCTTGAG CAGGCAGGAA TTACCATTCT TCCTTTTGAT GAAAAAAATC TA - #GACGGTGA 120 - TATGGAAATT ATCGCTGGAA ATGCCTTTCG TCCAGATAAC AACGTCGAAA TT - #GCCTATGC 180 - GGACCAAAAT GGTATCAGCT ACAAACGTTA CCATGAGTTT CTAGGTAGCT TT - #ATGCGTGA 240 - CTTTGTTAGC ATGGGAGTAG CAGGAGCACA TGGAAAAACT TCAACGACAG GT - #ATGTTGTC 300 - TCATGTCTTG TCTCACATTA CAGATACCAG CTTCTTGATT GGAGATGGGA CA - #GGTCGTGG 360 - TTCGGCCAAT GCCAAATATT TTGTCTTTGA ATCTGACGAA TATGAGCGTC AC - #TTCATGCC 420 - TTACCACCCA GAATACTCTA TTATCACCAA CATTGACTTT GACCATCCAG AT - #TATTTCAC 480 - AAGTCTCGAG GATGTTTTTA ATGCCTTTAA CGACTATGCC AAACAAATCA CC - #AAGGGTCT 540 - TTTTGTCTAT GGTGAAGATG CTGAATTGCG TAAGATTACG TCTGATGCAC CA - #ATTTATTA 600 - TTATGGTTTT GAAGCTGAAG GCAATGACTT TGTAGCTAGT GATCTTCTTC GT - #TCAATAAC 660 - TGGTTCAACC TTCACCGTTC ATTTCCGTGG ACAAAACTTG GGGCAATTCC AC - #ATTCCAAC 720 - CTTTGGTCGT CACAATATCA TGAATGCGAC AGCCGTTATT GGTCTTCTTT AC - #ACAGCAGG 780 - ATTTGATTTG AACTTGGTGC GTGAGCACTT GAAAACATTT GCCGGTGTTA AA - #CGTCGTTT 840 - CACTGAGAAA ATTGTCAATG ATACAGTGAT TATCGATGAC TTTGCCCACC AT - #CCAACAGA 900 - AATTATTGCG ACCTTGGATG CGGCTCGTCA GAAATACCCA AGCAAGGAAA TT - #GTAGCAGT 960 - CTTTCAACCG CATACCTTTA CAAGAACCAT TGCCTTGTTG GACGACTTTG CC - #CATGCTTT 1020 - AAACCAAGCA GATGCTGTTT ATCTAGCGCA AATTTATGGC TCGGCTCGTG AA - #GTAGATCA 1080 - TGGTGACGTT AAGGTAGAAG ACCTAGCCAA CAAAATCAAC AAAAAACACC AA - #GTGATTAC 1140 - TGTTGAAAAT GTTTCTCCAC TCCTAGACCA TGACAATGCT GTTTACGTCT TT - #ATGGGAGC 1200 - AGGAGACATC CAAACCTATG AATACTCATT TGAGCGTCTC TTGTCTAACT TG - #ACAAGCAA 1260 # 1267 - (2) INFORMATION FOR SEQ ID NO:116: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 422 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116: - His Gln Met Gly His Lys Val Gln Gly Ser As - #p Val Glu Lys Tyr Tyr # 15 - Phe Thr Gln Arg Gly Leu Glu Gln Ala Gly Il - #e Thr Ile Leu Pro Phe # 30 - Asp Glu Lys Asn Leu Asp Gly Asp Met Glu Il - #e Ile Ala Gly Asn Ala # 45 - Phe Arg Pro Asp Asn Asn Val Glu Ile Ala Ty - #r Ala Asp Gln Asn Gly # 60 - Ile Ser Tyr Lys Arg Tyr His Glu Phe Leu Gl - #y Ser Phe Met Arg Asp #80 - Phe Val Ser Met Gly Val Ala Gly Ala His Gl - #y Lys Thr Ser Thr Thr # 95 - Gly Met Leu Ser His Val Leu Ser His Ile Th - #r Asp Thr Ser Phe Leu # 110 - Ile Gly Asp Gly Thr Gly Arg Gly Ser Ala As - #n Ala Lys Tyr Phe Val # 125 - Phe Glu Ser Asp Glu Tyr Glu Arg His Phe Me - #t Pro Tyr His Pro Glu # 140 - Tyr Ser Ile Ile Thr Asn Ile Asp Phe Asp Hi - #s Pro Asp Tyr Phe Thr 145 1 - #50 1 - #55 1 - #60 - Ser Leu Glu Asp Val Phe Asn Ala Phe Asn As - #p Tyr Ala Lys Gln Ile # 175 - Thr Lys Gly Leu Phe Val Tyr Gly Glu Asp Al - #a Glu Leu Arg Lys Ile # 190 - Thr Ser Asp Ala Pro Ile Tyr Tyr Tyr Gly Ph - #e Glu Ala Glu Gly Asn # 205 - Asp Phe Val Ala Ser Asp Leu Leu Arg Ser Il - #e Thr Gly Ser Thr Phe # 220 - Thr Val His Phe Arg Gly Gln Asn Leu Gly Gl - #n Phe His Ile Pro Thr 225 2 - #30 2 - #35 2 - #40 - Phe Gly Arg His Asn Ile Met Asn Ala Thr Al - #a Val Ile Gly Leu Leu # 255 - Tyr Thr Ala Gly Phe Asp Leu Asn Leu Val Ar - #g Glu His Leu Lys Thr # 270 - Phe Ala Gly Val Lys Arg Arg Phe Thr Glu Ly - #s Ile Val Asn Asp Thr # 285 - Val Ile Ile Asp Asp Phe Ala His His Pro Th - #r Glu Ile Ile Ala Thr # 300 - Leu Asp Ala Ala Arg Gln Lys Tyr Pro Ser Ly - #s Glu Ile Val Ala Val 305 3 - #10 3 - #15 3 - #20 - Phe Gln Pro His Thr Phe Thr Arg Thr Ile Al - #a Leu Leu Asp Asp Phe # 335 - Ala His Ala Leu Asn Gln Ala Asp Ala Val Ty - #r Leu Ala Gln Ile Tyr # 350 - Gly Ser Ala Arg Glu Val Asp His Gly Asp Va - #l Lys Val Glu Asp Leu # 365 - Ala Asn Lys Ile Asn Lys Lys His Gln Val Il - #e Thr Val Glu Asn Val # 380 - Ser Pro Leu Leu Asp His Asp Asn Ala Val Ty - #r Val Phe Met Gly Ala 385 3 - #90 3 - #95 4 - #00 - Gly Asp Ile Gln Thr Tyr Glu Tyr Ser Phe Gl - #u Arg Leu Leu Ser Asn # 415 - Leu Thr Ser Asn Val Gln 420 - (2) INFORMATION FOR SEQ ID NO: 117: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 3121 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #117: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTTTAACCCA ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TA - #GTTTTATT 60 - GGTTTCTAAA AGGGAAAATG GAAAGAAACG ACTTGTTCAT TTTCTGCTGT TG - #ACTAGCAT

120 - GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC AGCCAGATTT TA - #TCTGCCTA 180 - TAATAGTCAG CTTTCTATCG GAGTCGGGGA ACATTTACCA GAGCCTCTGA AA - #ATCGAAGG 240 - TTATCAATAT ATTGGTTATA TCAAAACTAA GAAACAGGAT AATACAGAGC TT - #TCAAGGAC 300 - AGTTGATGGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AA - #ACATCAGA 360 - TGTAGTTCAT TCAGCTGATT TAGAATGGAA CCAAGGACAG GGGAAGGTTA GT - #TTACAAGG 420 - TGAAGCATCA GGGGATGATG GACTTTCAGA AAAATCTTCT ATAGCAGCAG AC - #AATCTATC 480 - TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GA - #GAATCTGT 540 - AGTTCGACCA ACAGTGCCAG AACAAGGAAA TCCTGTGTCT GCTACAACGG TG - #CAGAGTGC 600 - GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAACTTC CA - #TTGGAAAC 660 - CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGCCGCA GTCCGTGAAG AC - #TTACCAGT 720 - CTACACTAAG CCACTAGAAA CCAAAGGTAC ACAAGGACCC GGACATGAAG GT - #GAAGCTGC 780 - AGTTCGCGAG GAAGAACCAG CTTACACAGA ACCGTTAGCA ACGAAAGGCA CG - #CAAGAGCC 840 - AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTACACGG AA - #CCGGTAGC 900 - GACAAAAGGC ACACAAGAAC CCGAACATGA GGGCGAaCGG sCAGTAGAAG AA - #GAACTTCC 960 - GGCTTTAGAG GTCACTACAC GAAATAGAAC GGAAATCCAG AATATTCCTT AT - #ACAACAGA 1020 - AGAAATTCAG GATCCAACAC TTCTGAAAAA TCGTCGTAAG ATTGAACGAC AA - #GGGCAAGC 1080 - AGGGACACGT ACAATTCAAT ATGAAGACTA CATCGTAAAT GGTAATGTCG TA - #GAAACTAA 1140 - AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GA - #ACACTTGT 1200 - GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AA - #AAATCTAT 1260 - AACTGTAAGT TATAACTTAA TAGACACTAC CTCAGCATAT GTTTCTGCAA AA - #ACGCAAGT 1320 - TTTCCATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AA - #GAGCAAGT 1380 - AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CT - #TATAATTT 1440 - GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TA - #GAGTATAA 1500 - GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA AT - #GATCGTTA 1560 - TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TT - #GTAAAAGT 1620 - GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA TCTATTACAG AA - #AATACGGA 1680 - TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CA - #GACGGTTA 1740 - CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GA - #GTTTACAC 1800 - ATCCTTTAAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT AT - #ACATTGGC 1860 - TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT AT - #CTCACAGG 1920 - TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TT - #TATGATTT 1980 - GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG AT - #ATTAAAAC 2040 - TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA AT - #AGCGCGAA 2100

- TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CG - #GGATTAGT 2160 - AGCGAGCGCA ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TT - #ATCGCAAA 2220 - TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CA - #GGAAATAG 2280 - TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT ACTAATGCAC GC - #AATAATAA 2340 - CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CT - #AATTCGGT 2400 - TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA GTCGGAGGAA TA - #GTAGGATC 2460 - TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GA - #GATGGTTA 2520 - TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA AATGCAAGTA CA - #TCAGTTGA 2580 - TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG AC - #GCGAAAGT 2640 - TGCTGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AA - #CGTAATCT 2700 - AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TA - #GCTTATAG 2760 - CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT AT - #GGTAACAA 2820 - AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CG - #ATGAAAGA 2880 - TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TT - #ATGTTACA 2940 - TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TC - #ATAAACAG 3000 - TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA TTCACACCAG AA - #GCATTTGT 3060 - TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC TTGCAAAATG TA - #ACACTTAA 3120 # 3121 - (2) INFORMATION FOR SEQ ID NO:118: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 1040 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: - Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Th - #r Ala Gly Leu Ser Leu # 15 - Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gl - #y Lys Lys Arg Leu Val # 30 - His Phe Leu Leu Leu Thr Ser Met Gly Val Gl - #n Leu Leu Pro Ala Ser # 45 - Ala Phe Gly Leu Thr Ser Gln Ile Leu Ser Al - #a Tyr Asn Ser Gln Leu # 60 - Ser Ile Gly Val Gly Glu His Leu Pro Glu Pr - #o Leu Lys Ile Glu Gly #80 - Tyr Gln Tyr Ile Gly Tyr Ile Lys Thr Lys Ly - #s Gln Asp Asn Thr Glu # 95 - Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Al - #a Gln Arg Asp Ser Gln # 110 - Pro Asn Ser Thr Lys Thr Ser Asp Val Val Hi - #s Ser Ala Asp Leu Glu # 125 - Trp Asn Gln Gly Gln Gly Lys Val Ser Leu Gl - #n Gly Glu Ala Ser Gly # 140 - Asp Asp Gly Leu Ser Glu Lys Ser Ser Ile Al - #a Ala Asp Asn Leu Ser 145 1 - #50 1 - #55 1 - #60 - Ser Asn Asp Ser Phe Ala Ser Gln Val Glu Gl - #n Asn Pro Asp His Lys # 175 - Gly Glu Ser Val Val Arg Pro Thr Val Pro Gl - #u Gln Gly Asn Pro Val # 190 - Ser Ala Thr Thr Val Gln Ser Ala Glu Glu Gl - #u Val Leu Ala Thr Thr # 205 - Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Gl - #u Thr Lys Gly Thr Gln # 220 - Glu Pro Gly His Glu Gly Glu Ala Ala Val Ar - #g Glu Asp Leu Pro Val 225 2 - #30 2 - #35 2 - #40 - Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gl - #n Gly Pro Gly His Glu # 255 - Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Al - #a Tyr Thr Glu Pro Leu # 270 - Ala Thr Lys Gly Thr Gln Glu Pro Gly His Gl - #u Gly Lys Ala Thr Val # 285 - Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Va - #l Ala Thr Lys Gly Thr # 300 - Gln Glu Pro Glu His Glu Gly Glu Arg Xaa Va - #l Glu Glu Glu Leu Pro 305 3 - #10 3 - #15 3 - #20 - Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Gl - #u Ile Gln Asn Ile Pro # 335 - Tyr Thr Thr Glu Glu Ile Gln Asp Pro Thr Le - #u Leu Lys Asn Arg Arg # 350 - Lys Ile Glu Arg Gln Gly Gln Ala Gly Thr Ar - #g Thr Ile Gln Tyr Glu # 365 - Asp Tyr Ile Val Asn Gly Asn Val Val Glu Th - #r Lys Glu Val Ser Arg # 380 - Thr Glu Val Ala Pro Val Asn Glu Val Val Ly - #s Val Gly Thr Leu Val 385 3 - #90 3 - #95 4 - #00 - Lys Val Lys Pro Thr Val Glu Ile Thr Asn Le - #u Thr Lys Val Glu Asn # 415 - Lys Lys Ser Ile Thr Val Ser Tyr Asn Leu Il - #e Asp Thr Thr Ser Ala # 430 - Tyr Val Ser Ala Lys Thr Gln Val Phe His Gl - #y Asp Lys Leu Val Lys # 445 - Glu Val Asp Ile Glu Asn Pro Ala Lys Glu Gl - #n Val Ile Ser Gly Leu # 460 - Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr Hi - #s Leu Thr Tyr Asn Leu 465 4 - #70 4 - #75 4 - #80 - Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Se - #r Thr Gln Asp Phe Gln # 495 - Leu Glu Tyr Lys Lys Ile Glu Ile Lys Asp Il - #e Asp Ser Val Glu Leu # 510 - Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg Ty - #r Leu Ser Leu Ser Glu # 525 - Ala Pro Thr Asp Thr Ala Lys Tyr Phe Val Ly - #s Val Lys Ser Asp Arg # 540 - Phe Lys Glu Met Tyr Leu Pro Val Lys Ser Il - #e Thr Glu Asn Thr Asp 545 5 - #50 5 - #55 5 - #60 - Gly Thr Tyr Lys Val Thr Val Ala Val Asp Gl - #n Leu Val Glu Glu Gly # 575 - Thr Asp Gly Tyr Lys Asp Asp Tyr Thr Phe Th - #r Val Ala Lys Ser Lys # 590 - Ala Glu Gln Pro Gly Val Tyr Thr Ser Phe Ly - #s Gln Leu Val Thr Ala # 605 - Met Gln Ser Asn Leu Ser Gly Val Tyr Thr Le - #u Ala Ser Asp Met Thr # 620 - Ala Asp Glu Val Ser Leu Gly Asp Lys Gln Th - #r Ser Tyr Leu Thr Gly 625 6 - #30 6 - #35 6 - #40 - Ala Phe Thr Gly Ser Leu Ile Gly Ser Asp Gl - #y Thr Lys Ser Tyr Ala # 655 - Ile Tyr Asp Leu Lys Lys Pro Leu Phe Asp Th - #r Leu Asn Gly Ala Thr # 670 - Val Arg Asp Leu Asp Ile Lys Thr Val Ser Al - #a Asp Ser Lys Glu Asn # 685 - Val Ala Ala Leu Ala Lys Ala Ala Asn Ser Al - #a Asn Ile Asn Asn Val # 700 - Ala Val Glu Gly Lys Ile Ser Gly Ala Lys Se - #r Val Ala Gly Leu Val 705 7 - #10 7 - #15 7 - #20 - Ala Ser Ala Thr Asn Thr Val Ile Glu Asn Se - #r Ser Phe Thr Gly Lys # 735 - Leu Ile Ala Asn His Gln Asp Ser Asn Lys As - #n Asp Thr Gly Gly Ile # 750 - Val Gly Asn Ile Thr Gly Asn Ser Ser Arg Va - #l Asn Lys Val Arg Val # 765 - Asp Ala Leu Ile Ser Thr Asn Ala Arg Asn As - #n Asn Gln Thr Ala Gly # 780 - Gly Ile Val Gly Arg Leu Glu Asn Gly Ala Le - #u Ile Ser Asn Ser Val 785 7 - #90 7 - #95 8 - #00 - Ala Thr Gly Glu Ile Arg Asn Gly Gln Gly Ty - #r Ser Arg Val Gly Gly # 815 - Ile Val Gly Ser Thr Trp Gln Asn Gly Arg Va - #l Asn Asn Val Val Ser # 830 - Asn Val Asp Val Gly Asp Gly Tyr Val Ile Th - #r Gly Asp Gln Tyr Ala # 845 - Ala Ala Asp Val Lys Asn Ala Ser Thr Ser Va - #l Asp Asn Arg Lys Ala # 860 - Asp Arg Phe Ala Thr Lys Leu Ser Lys Asp Gl - #n Ile Asp Ala Lys Val 865 8 - #70 8 - #75 8 - #80 - Ala Asp Tyr Gly Ile Thr Val Thr Leu Asp As - #p Thr Gly Gln Asp Leu # 895 - Lys Arg Asn Leu Arg Glu Val Asp Tyr Thr Ar - #g Leu Asn Lys Ala Glu # 910 - Ala Glu Arg Lys Val Ala Tyr Ser Asn Ile Gl - #u Lys Leu Met Pro Phe # 925 - Tyr Asn Lys Asp Leu Val Val His Tyr Gly As - #n Lys Val Ala Thr Thr # 940 - Asp Lys Leu Tyr Thr Thr Glu Leu Leu Asp Va - #l Val Pro Met Lys Asp 945 9 - #50 9 - #55 9 - #60 - Asp Glu Val Val Thr Asp Ile Asn Asn Lys Ly - #s Asn Ser Ile Asn Lys # 975 - Val Met Leu His Phe Lys Asp Asn Thr Val Gl - #u Tyr Leu Asp Val Thr # 990 - Phe Lys Glu Asn Phe Ile Asn Ser Gln Val Il - #e Glu Tyr Asn Val Thr # 10050 - Gly Lys Glu Tyr Ile Phe Thr Pro Glu Ala Ph - #e Val Ser Asp Tyr Thr # 10205 - Ala Ile Thr Asn Asn Val Leu Ser Asp Leu Gl - #n Asn Val Thr Leu Asn # 10401030 - # 1035 - (2) INFORMATION FOR SEQ ID NO: 119: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1567 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #119: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTTTAACCCA ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TA - #GTTTTATT 60 - GGTTTCTAAA AGGGAAAATG GAAAGAAACG ACTTGTTCAT TTTCTGCTGT TG - #ACTAGCAT 120 - GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC AGCCAGATTT TA - #TCTGCCTA 180 - TAATAGTCAG CTTTCTATCG GAGTCGGGGA ACATTTACCA GAGCCTCTGA AA - #ATCGAAGG 240 - TTATCAATAT ATTGGTTATA TCAAAACTAA GAAACAGGAT AATACAGAGC TT - #TCAAGGAC 300 - AGTTGATGGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AA - #ACATCAGA 360 - TGTAGTTCAT TCAGCTGATT TAGAATGGAA CCAAGGACAG GGGAAGGTTA GT - #TTACAAGG 420 - TGAAGCATCA GGGGATGATG GACTTTCAGA AAAATCTTCT ATAGCAGCAG AC - #AATCTATC 480 - TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GA - #GAATCTGT 540 - AGTTCGACCA ACAGTGCCAG AACAAGGAAA TCCTGTGTCT GCTACAACGG TG - #CAGAGTGC 600 - GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAACTTC CA - #TTGGAAAC 660 - CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGCCGCA GTCCGTGAAG AC - #TTACCAGT 720 - CTACACTAAG CCACTAGAAA CCAAAGGTAC ACAAGGACCC GGACATGAAG GT - #GAAGCTGC 780 - AGTTCGCGAG GAAGAACCAG CTTACACAGA ACCGTTAGCA ACGAAAGGCA CG - #CAAGAGCC 840 - AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTACACGG AA - #CCGGTAGC 900 - GACAAAAGGC ACACAAGAAC CCGAACATGA GGGCGAaCGG sCAGTAGAAG AA - #GAACTTCC 960 - GGCTTTAGAG GTCACTACAC GAAATAGAAC GGAAATCCAG AATATTCCTT AT - #ACAACAGA 1020 - AGAAATTCAG GATCCAACAC TTCTGAAAAA TCGTCGTAAG ATTGAACGAC AA - #GGGCAAGC 1080 - AGGGACACGT ACAATTCAAT ATGAAGACTA CATCGTAAAT GGTAATGTCG TA - #GAAACTAA 1140 - AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GA - #ACACTTGT 1200 - GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AA - #AAATCTAT 1260 - AACTGTAAGT TATAACTTAA TAGACACTAC CTCAGCATAT GTTTCTGCAA AA - #ACGCAAGT 1320 - TTTCCATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AA - #GAGCAAGT 1380 - AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CT - #TATAATTT 1440 - GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TA - #GAGTATAA 1500 - GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA AT - #GATCGTTA 1560 # 1567 - (2) INFORMATION FOR SEQ ID NO:120: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 522 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: - Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Th - #r Ala Gly Leu Ser Leu # 15 - Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gl - #y Lys Lys Arg Leu Val

# 30 - His Phe Leu Leu Leu Thr Ser Met Gly Val Gl - #n Leu Leu Pro Ala Ser # 45 - Ala Phe Gly Leu Thr Ser Gln Ile Leu Ser Al - #a Tyr Asn Ser Gln Leu # 60 - Ser Ile Gly Val Gly Glu His Leu Pro Glu Pr - #o Leu Lys Ile Glu Gly #80 - Tyr Gln Tyr Ile Gly Tyr Ile Lys Thr Lys Ly - #s Gln Asp Asn Thr Glu # 95 - Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Al - #a Gln Arg Asp Ser Gln # 110 - Pro Asn Ser Thr Lys Thr Ser Asp Val Val Hi - #s Ser Ala Asp Leu Glu # 125 - Trp Asn Gln Gly Gln Gly Lys Val Ser Leu Gl - #n Gly Glu Ala Ser Gly # 140 - Asp Asp Gly Leu Ser Glu Lys Ser Ser Ile Al - #a Ala Asp Asn Leu Ser 145 1 - #50 1 - #55 1 - #60 - Ser Asn Asp Ser Phe Ala Ser Gln Val Glu Gl - #n Asn Pro Asp His Lys # 175 - Gly Glu Ser Val Val Arg Pro Thr Val Pro Gl - #u Gln Gly Asn Pro Val # 190 - Ser Ala Thr Thr Val Gln Ser Ala Glu Glu Gl - #u Val Leu Ala Thr Thr # 205 - Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Gl - #u Thr Lys Gly Thr Gln # 220 - Glu Pro Gly His Glu Gly Glu Ala Ala Val Ar - #g Glu Asp Leu Pro Val 225 2 - #30 2 - #35 2 - #40 - Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gl - #n Gly Pro Gly His Glu # 255 - Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Al - #a Tyr Thr Glu Pro Leu # 270 - Ala Thr Lys Gly Thr Gln Glu Pro Gly His Gl - #u Gly Lys Ala Thr Val # 285 - Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Va - #l Ala Thr Lys Gly Thr # 300 - Gln Glu Pro Glu His Glu Gly Glu Arg Xaa Va - #l Glu Glu Glu Leu Pro 305 3 - #10 3 - #15 3 - #20 - Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Gl - #u Ile Gln Asn Ile Pro # 335 - Tyr Thr Thr Glu Glu Ile Gln Asp Pro Thr Le - #u Leu Lys Asn Arg Arg # 350 - Lys Ile Glu Arg Gln Gly Gln Ala Gly Thr Ar - #g Thr Ile Gln Tyr Glu # 365 - Asp Tyr Ile Val Asn Gly Asn Val Val Glu Th - #r Lys Glu Val Ser Arg # 380 - Thr Glu Val Ala Pro Val Asn Glu Val Val Ly - #s Val Gly Thr Leu Val 385 3 - #90 3 - #95 4 - #00 - Lys Val Lys Pro Thr Val Glu Ile Thr Asn Le - #u Thr Lys Val Glu Asn # 415 - Lys Lys Ser Ile Thr Val Ser Tyr Asn Leu Il - #e Asp Thr Thr Ser Ala # 430 - Tyr Val Ser Ala Lys Thr Gln Val Phe His Gl - #y Asp Lys Leu Val Lys # 445 - Glu Val Asp Ile Glu Asn Pro Ala Lys Glu Gl - #n Val Ile Ser Gly Leu # 460 - Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr Hi - #s Leu Thr Tyr Asn Leu 465 4 - #70 4 - #75 4 - #80 - Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Se - #r Thr Gln Asp Phe Gln # 495 - Leu Glu Tyr Lys Lys Ile Glu Ile Lys Asp Il - #e Asp Ser Val Glu Leu # 510 - Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg # 520 - (2) INFORMATION FOR SEQ ID NO: 121: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1561 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #121: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TT - #GTAAAAGT 60 - GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA TCTATTACAG AA - #AATACGGA 120 - TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CA - #GACGGTTA 180 - CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GA - #GTTTACAC 240 - ATCCTTTAAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT AT - #ACATTGGC 300 - TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT AT - #CTCACAGG 360 - TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TT - #TATGATTT 420 - GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG AT - #ATTAAAAC 480 - TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA AT - #AGCGCGAA 540 - TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CG - #GGATTAGT 600 - AGCGAGCGCA ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TT - #ATCGCAAA 660 - TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CA - #GGAAATAG 720 - TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT ACTAATGCAC GC - #AATAATAA 780 - CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CT - #AATTCGGT 840 - TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA GTCGGAGGAA TA - #GTAGGATC 900 - TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GA - #GATGGTTA 960 - TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA AATGCAAGTA CA - #TCAGTTGA 1020 - TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG AC - #GCGAAAGT 1080 - TGCTGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AA - #CGTAATCT 1140 - AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TA - #GCTTATAG 1200 - CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT AT - #GGTAACAA 1260 - AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CG - #ATGAAAGA 1320 - TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TT - #ATGTTACA 1380 - TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TC - #ATAAACAG 1440 - TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA TTCACACCAG AA - #GCATTTGT 1500 - TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC TTGCAAAATG TA - #ACACTTAA 1560 # 1561 - (2) INFORMATION FOR SEQ ID NO:122: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 520 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: - Arg Arg Tyr Leu Ser Leu Ser Glu Ala Pro Th - #r Asp Thr Ala Lys Tyr # 15 - Phe Val Lys Val Lys Ser Asp Arg Phe Lys Gl - #u Met Tyr Leu Pro Val # 30 - Lys Ser Ile Thr Glu Asn Thr Asp Gly Thr Ty - #r Lys Val Thr Val Ala # 45 - Val Asp Gln Leu Val Glu Glu Gly Thr Asp Gl - #y Tyr Lys Asp Asp Tyr # 60 - Thr Phe Thr Val Ala Lys Ser Lys Ala Glu Gl - #n Pro Gly Val Tyr Thr #80 - Ser Phe Lys Gln Leu Val Thr Ala Met Gln Se - #r Asn Leu Ser Gly Val # 95 - Tyr Thr Leu Ala Ser Asp Met Thr Ala Asp Gl - #u Val Ser Leu Gly Asp # 110 - Lys Gln Thr Ser Tyr Leu Thr Gly Ala Phe Th - #r Gly Ser Leu Ile Gly # 125 - Ser Asp Gly Thr Lys Ser Tyr Ala Ile Tyr As - #p Leu Lys Lys Pro Leu # 140 - Phe Asp Thr Leu Asn Gly Ala Thr Val Arg As - #p Leu Asp Ile Lys Thr 145 1 - #50 1 - #55 1 - #60 - Val Ser Ala Asp Ser Lys Glu Asn Val Ala Al - #a Leu Ala Lys Ala Ala # 175 - Asn Ser Ala Asn Ile Asn Asn Val Ala Val Gl - #u Gly Lys Ile Ser Gly # 190 - Ala Lys Ser Val Ala Gly Leu Val Ala Ser Al - #a Thr Asn Thr Val Ile # 205 - Glu Asn Ser Ser Phe Thr Gly Lys Leu Ile Al - #a Asn His Gln Asp Ser # 220 - Asn Lys Asn Asp Thr Gly Gly Ile Val Gly As - #n Ile Thr Gly Asn Ser 225 2 - #30 2 - #35 2 - #40 - Ser Arg Val Asn Lys Val Arg Val Asp Ala Le - #u Ile Ser Thr Asn Ala # 255 - Arg Asn Asn Asn Gln Thr Ala Gly Gly Ile Va - #l Gly Arg Leu Glu Asn # 270 - Gly Ala Leu Ile Ser Asn Ser Val Ala Thr Gl - #y Glu Ile Arg Asn Gly # 285 - Gln Gly Tyr Ser Arg Val Gly Gly Ile Val Gl - #y Ser Thr Trp Gln Asn # 300 - Gly Arg Val Asn Asn Val Val Ser Asn Val As - #p Val Gly Asp Gly Tyr 305 3 - #10 3 - #15 3 - #20 - Val Ile Thr Gly Asp Gln Tyr Ala Ala Ala As - #p Val Lys Asn Ala Ser # 335 - Thr Ser Val Asp Asn Arg Lys Ala Asp Arg Ph - #e Ala Thr Lys Leu Ser # 350 - Lys Asp Gln Ile Asp Ala Lys Val Ala Asp Ty - #r Gly Ile Thr Val Thr # 365 - Leu Asp Asp Thr Gly Gln Asp Leu Lys Arg As - #n Leu Arg Glu Val Asp # 380 - Tyr Thr Arg Leu Asn Lys Ala Glu Ala Glu Ar - #g Lys Val Ala Tyr Ser 385 3 - #90 3 - #95 4 - #00 - Asn Ile Glu Lys Leu Met Pro Phe Tyr Asn Ly - #s Asp Leu Val Val His # 415 - Tyr Gly Asn Lys Val Ala Thr Thr Asp Lys Le - #u Tyr Thr Thr Glu Leu # 430 - Leu Asp Val Val Pro Met Lys Asp Asp Glu Va - #l Val Thr Asp Ile Asn # 445 - Asn Lys Lys Asn Ser Ile Asn Lys Val Met Le - #u His Phe Lys Asp Asn # 460 - Thr Val Glu Tyr Leu Asp Val Thr Phe Lys Gl - #u Asn Phe Ile Asn Ser 465 4 - #70 4 - #75 4 - #80 - Gln Val Ile Glu Tyr Asn Val Thr Gly Lys Gl - #u Tyr Ile Phe Thr Pro # 495 - Glu Ala Phe Val Ser Asp Tyr Thr Ala Ile Th - #r Asn Asn Val Leu Ser # 510 - Asp Leu Gln Asn Val Thr Leu Asn # 520 - (2) INFORMATION FOR SEQ ID NO: 123: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 850 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #123: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTTTGGTTTT GAAGGAAGTA AGCGTGGACA ATTTGCTGTA GAAGGAATCA AT - #CAACTTCG 60 - TGAGCATGTA GACACTCTAT TGATTATCTC AAACAACAAT TTGCTTGAAA TT - #GTTGATAA 120 - GAAAACACCG CTTTTGGAGG CTCTTAGCGA AGCGGATAAC GTTCTTCGTC AA - #GGTGTTCA 180 - AGGGATTACC GATTTGATTA CCAATCCAGG ATTGATTAAC CTTGACTTTG CC - #GATGTGAA 240 - AACGGTAATG GCAAACAAAG GGAATGCTCT TATGGGTATT GGTATCGGTA GT - #GGAGAAGA 300 - ACGTGTGGTA GAAGCGGCAC GTAAGGCAAT CTATTCACCA CTTCTTGAAA CA - #ACTATTGA 360 - CGGTGCTGAG GATGTTATCG TCAACGTTAC TGGTGGTCTT GACTTAACCT TG - #ATTGAGGC 420 - AGAAGAGGCT TCACAAATTG TGAACCAGGC AGCAGGTCAA GGAGTGAACA TC - #TGGCTCGG 480 - TACTTCAATT GATGAAAGTA TGCGTGATGA AATTCGTGTA ACAGTTGTTG CA - #ACGGGTGT 540 - TCGTCAAGAC CGCGTAGAAA AGGTTGTGGC TCCACAAGCT AGATCTGCTA CT - #AACTACCG 600 - TGAGACAGTG AAACCAGCTC ATTCACATGG CTTTGATCGT CATTTTGATA TG - #GCAGAAAC 660 - AGTTGAATTG CCAAAACAAA ATCCACGTCG TTTGGAACCA ACTCAGGCAT CT - #GCTTTTGG 720 - TGATTGGGAT CTTCGCCGTG AATCGATTGT TCGTACAACA GATTCAGTCG TT - #TCTCCAGT 780 - CGAGCGCTTT GAAGCCCCAA TTTCACAAGA TGAAGATGAA TTGGATACAC CT - #CCATTTTT 840 # 850 - (2) INFORMATION FOR SEQ ID NO:124: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 283 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: - Phe Gly Phe Glu Gly Ser Lys Arg Gly Gln Ph - #e Ala Val Glu Gly Ile # 15 - Asn Gln Leu Arg Glu His Val Asp Thr Leu Le - #u Ile Ile Ser Asn Asn # 30

- Asn Leu Leu Glu Ile Val Asp Lys Lys Thr Pr - #o Leu Leu Glu Ala Leu # 45 - Ser Glu Ala Asp Asn Val Leu Arg Gln Gly Va - #l Gln Gly Ile Thr Asp # 60 - Leu Ile Thr Asn Pro Gly Leu Ile Asn Leu As - #p Phe Ala Asp Val Lys #80 - Thr Val Met Ala Asn Lys Gly Asn Ala Leu Me - #t Gly Ile Gly Ile Gly # 95 - Ser Gly Glu Glu Arg Val Val Glu Ala Ala Ar - #g Lys Ala Ile Tyr Ser # 110 - Pro Leu Leu Glu Thr Thr Ile Asp Gly Ala Gl - #u Asp Val Ile Val Asn # 125 - Val Thr Gly Gly Leu Asp Leu Thr Leu Ile Gl - #u Ala Glu Glu Ala Ser # 140 - Gln Ile Val Asn Gln Ala Ala Gly Gln Gly Va - #l Asn Ile Trp Leu Gly 145 1 - #50 1 - #55 1 - #60 - Thr Ser Ile Asp Glu Ser Met Arg Asp Glu Il - #e Arg Val Thr Val Val # 175 - Ala Thr Gly Val Arg Gln Asp Arg Val Glu Ly - #s Val Val Ala Pro Gln # 190 - Ala Arg Ser Ala Thr Asn Tyr Arg Glu Thr Va - #l Lys Pro Ala His Ser # 205 - His Gly Phe Asp Arg His Phe Asp Met Ala Gl - #u Thr Val Glu Leu Pro # 220 - Lys Gln Asn Pro Arg Arg Leu Glu Pro Thr Gl - #n Ala Ser Ala Phe Gly 225 2 - #30 2 - #35 2 - #40 - Asp Trp Asp Leu Arg Arg Glu Ser Ile Val Ar - #g Thr Thr Asp Ser Val # 255 - Val Ser Pro Val Glu Arg Phe Glu Ala Pro Il - #e Ser Gln Asp Glu Asp # 270 - Glu Leu Asp Thr Pro Pro Phe Phe Lys Asn Ar - #g # 280 - (2) INFORMATION FOR SEQ ID NO: 125: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1051 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #125: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTACTACCTC TCGAGAGAAA GTGACCTAGA GGTGACCGTT TTTGACCATG AG - #CAAGGTCA 60 - AGCCACCAAG GCCGCAGCAG GAATTATCAG TCCTTGGTTT TCCAAACGCC GT - #AATAAAGC 120 - CTGGTACAAG ATGGCGCGCT TGGGGGCTGA TTTTTATGTG GATTTATTAG CT - #GATTTAGA 180 - GAAATCAGGA CAAGAAATCG ACTTTTACCA GCGTTCGGGA GTCTTTCTCT TG - #AAAAAGGA 240 - TGAATCCAAT TTGGAAGAAC TTTATCAACT GGCCCTCCAG CGCAGAGAAG AA - #TCTCCCTT 300 - GATAGGGCAA TTAGCCATTC TGAACCAAGC CTCAGCTAAT GAATTATTCC CT - #GGTTTGCA 360 - GGGATTTGAC CGCCTGCTCT ATGCTTCTGG TGGAGCGAGA GTAGATGGCC AA - #CTTTTAGT 420 - GACTCGTTTG CTGGAAGTCA GTCATGTCAA GCTGGTCAAA GAAAAAGTGA CT - #CTGACACC 480 - GTTAGCATCA GGCTACCAGA TTGGTGAAGA GGAGTTTGAG CAGGTTATTT TG - #GCGACGGG 540 - AGCTTGGTTG GGGGACATGT TAGAGCCTTT AGGTTATGAA GTGGATGTCC GT - #CCTCAAAA 600 - AGGACAACTA CGAGATTATC AGCTTGCCCA AGACATGGAA GATTACCCTG TT - #GTCATGCC 660 - AGAAGGGGAG TGGGATTTGA TTCCCTTTGC AGGTGGGAAA TTATCCTTAG GC - #GCTACCCA 720 - CGAAAATGAC ATGGGATTTG ATTTGACGGT AGATGAAACC TTGCTCCAAC AA - #ATGGAGGA 780 - GGCCACCTTG ACTCACTATC TGATTTTGGC TGAAGCTACT TCAAAATCTG AG - #CGTGTTGG 840 - AATCCGTGCC TACACCAGTG ATTTCTCTCC TTTCTTTGGG CAGGTGCCTG AC - #TTAACTGG 900 - TGTCTATGCA GCCAGTGGAC TAGGTTCATC AGGCCTCACA ACTGGTCCTA TC - #ATTGGTTA 960 - CCATCTAGCC CAACTGATCC AAGACAAGGA GTTGACCTTG GACCCTCTAA AT - #TACCCAAT 1020 # 1051 CGAG TAAAAAGCGA A - (2) INFORMATION FOR SEQ ID NO:126: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 350 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: - Tyr Tyr Leu Ser Arg Glu Ser Asp Leu Glu Va - #l Thr Val Phe Asp His # 15 - Glu Gln Gly Gln Ala Thr Lys Ala Ala Ala Gl - #y Ile Ile Ser Pro Trp # 30 - Phe Ser Lys Arg Arg Asn Lys Ala Trp Tyr Ly - #s Met Ala Arg Leu Gly # 45 - Ala Asp Phe Tyr Val Asp Leu Leu Ala Asp Le - #u Glu Lys Ser Gly Gln # 60 - Glu Ile Asp Phe Tyr Gln Arg Ser Gly Val Ph - #e Leu Leu Lys Lys Asp #80 - Glu Ser Asn Leu Glu Glu Leu Tyr Gln Leu Al - #a Leu Gln Arg Arg Glu # 95 - Glu Ser Pro Leu Ile Gly Gln Leu Ala Ile Le - #u Asn Gln Ala Ser Ala # 110 - Asn Glu Leu Phe Pro Gly Leu Gln Gly Phe As - #p Arg Leu Leu Tyr Ala # 125 - Ser Gly Gly Ala Arg Val Asp Gly Gln Leu Le - #u Val Thr Arg Leu Leu # 140 - Glu Val Ser His Val Lys Leu Val Lys Glu Ly - #s Val Thr Leu Thr Pro 145 1 - #50 1 - #55 1 - #60 - Leu Ala Ser Gly Tyr Gln Ile Gly Glu Glu Gl - #u Phe Glu Gln Val Ile # 175 - Leu Ala Thr Gly Ala Trp Leu Gly Asp Met Le - #u Glu Pro Leu Gly Tyr # 190 - Glu Val Asp Val Arg Pro Gln Lys Gly Gln Le - #u Arg Asp Tyr Gln Leu # 205 - Ala Gln Asp Met Glu Asp Tyr Pro Val Val Me - #t Pro Glu Gly Glu Trp # 220 - Asp Leu Ile Pro Phe Ala Gly Gly Lys Leu Se - #r Leu Gly Ala Thr His 225 2 - #30 2 - #35 2 - #40 - Glu Asn Asp Met Gly Phe Asp Leu Thr Val As - #p Glu Thr Leu Leu Gln # 255 - Gln Met Glu Glu Ala Thr Leu Thr His Tyr Le - #u Ile Leu Ala Glu Ala # 270 - Thr Ser Lys Ser Glu Arg Val Gly Ile Arg Al - #a Tyr Thr Ser Asp Phe # 285 - Ser Pro Phe Phe Gly Gln Val Pro Asp Leu Th - #r Gly Val Tyr Ala Ala # 300 - Ser Gly Leu Gly Ser Ser Gly Leu Thr Thr Gl - #y Pro Ile Ile Gly Tyr 305 3 - #10 3 - #15 3 - #20 - His Leu Ala Gln Leu Ile Gln Asp Lys Glu Le - #u Thr Leu Asp Pro Leu # 335 - Asn Tyr Pro Ile Glu Asn Tyr Val Lys Arg Va - #l Lys Ser Glu # 350 - (2) INFORMATION FOR SEQ ID NO: 127: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 352 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 127: - TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GCTAGAAAAG ATTGGAGCTG AC - #TCGGTTAT 60 - CTCGCCAGAG TATGAAATGG GGCAGTCTCT AGCACAGACC ATTCTTTTCC AT - #AATAGTGT 120 - TGATGTCTTT CAGTTGGATA AAAATGTGTC TATCGTGGAG ATGAAAATTC CT - #CAGTCTTG 180 - GGCAGGTCAA AGTCTGAGTA AATTAGACCT CCGTGGCAAA TACAATCTGA AT - #ATTTTGGG 240 - TTTCCGAGAG CAGGAAAATT CCCCATTGGA TGTTGAATTT GGACCAGATG AC - #CTCTTGAA 300 - AGCAGATACC TATATTTTGG CAGTCATCAA CAACCAGTAT TTGGATACCC TA - # 352 - (2) INFORMATION FOR SEQ ID NO:128: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 117 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: - Lys Val Lys Ser Gln Thr Ala Lys Lys Val Le - #u Glu Lys Ile Gly Ala # 15 - Asp Ser Val Ile Ser Pro Glu Tyr Glu Met Gl - #y Gln Ser Leu Ala Gln # 30 - Thr Ile Leu Phe His Asn Ser Val Asp Val Ph - #e Gln Leu Asp Lys Asn # 45 - Val Ser Ile Val Glu Met Lys Ile Pro Gln Se - #r Trp Ala Gly Gln Ser # 60 - Leu Ser Lys Leu Asp Leu Arg Gly Lys Tyr As - #n Leu Asn Ile Leu Gly #80 - Phe Arg Glu Gln Glu Asn Ser Pro Leu Asp Va - #l Glu Phe Gly Pro Asp # 95 - Asp Leu Leu Lys Ala Asp Thr Tyr Ile Leu Al - #a Val Ile Asn Asn Gln # 110 - Tyr Leu Asp Thr Leu 115 - (2) INFORMATION FOR SEQ ID NO: 129: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 247 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear - (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - # 129: - TGACGGGTCT CAGGATCAGA CTCAGGAAAT CGCTGAGTGT TTAGCTAGCA AG - #TATCCTAA 60 - TATCGTTAGA GCCATCTATC AGGAAAATAA ATGCCATGGC GGTGCGGTCA AT - #CGTGGCTT 120 - GGTAGAGGCT TCTGGGCGCT ATTTTAAAGT AGTTGACAGT GATGACTGGG TG - #GATCCTCG 180 - TGCCTACTTG AAAATTCTTG AAACTTGCAG GAACTTGAGA GCAAAGGTCA AG - #AGGTGGAT 240 # 247 - (2) INFORMATION FOR SEQ ID NO:130: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 82 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: - Asp Gly Ser Gln Asp Gln Thr Gln Glu Ile Al - #a Glu Cys Leu Ala Ser # 15 - Lys Tyr Pro Asn Ile Val Arg Ala Ile Tyr Gl - #n Glu Asn Lys Cys His # 30 - Gly Gly Ala Val Asn Arg Gly Leu Val Glu Al - #a Ser Gly Arg Tyr Phe # 45 - Lys Val Val Asp Ser Asp Asp Trp Val Asp Pr - #o Arg Ala Tyr Leu Lys # 60 - Ile Leu Glu Thr Cys Arg Asn Leu Arg Ala Ly - #s Val Lys Arg Trp Met #80 - Ser Leu - (2) INFORMATION FOR SEQ ID NO: 131: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1744 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #131: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAGAGGCTTT GCCAAATGGT GGGAAGGGCA CGAGCGTCGA AAAGAGGAAC GC - #TTTGTCAA 60 - ACAAGAAGAA AAAGCTCGCC AAAAGGCTGA GAAAGAGGCT AGATTAGAAC AA - #GAAGAGAC 120 - TGAAAAAGCC TTACTCGATT TGCCTCCTGT TGATATGGAA ACGGGTGAAA TT - #CTGACAGA 180 - GGAAGCTGTT CAAAATCTTC CACCTATTCC AGAAGAAAAG TGGGTGGAAC CA - #GAAATCAT 240 - CCTGCCTCAA GCTGAACTTA AATTCCCTGA ACAGGAAGAT GACTCAGATG AC - #GAAGATGT 300 - TCAGGTCGAT TTTTCAGCCA AAGAAGCCCT TGAATACAAA CTTCCAAGCT TA - #CAACTCTT 360 - TGCACCAGAT AAACCAAAAG ATCAGTCTAA AGAGAAGAAA ATTGTCAGAG AA - #AATATCAA 420 - AATCTTAGAA GCAACCTTTG CTAGCTTTGG TATTAAGGTA ACAGTTGAAC GG - #GCCGAAAT 480 - TGGGCCATCA GTGACCAAGT ATGAAGTCAA GCCGGCTGTT GGTGTAAGGG TC - #AACCGCAT 540 - TTCCAATCTA TCAGATGACC TCGCTCTAGC CTTGGCTGCC AAAGATGTCC GG - #ATTGAAGC 600 - ACCAATCCCT GGGAAATCCC TAATCGGAAT TGAAGTGCCC AACTCCGATA TT - #GCCACTGT 660 - ATCTTTCCGA GAACTATGGG AACAATCGCA AACGAAAGCA GAAAATTTCT TG - #GAAATTCC 720 - TTTAGGGAAG GCTGTTAATG GAACCGCAAG AGCTTTTGAC CTTTCTAAAA TG - #CCCCACTT 780 - GCTAGTTGCA GGTTCAACGG GTTCAGGGAA GTCAGTAGCA GTTAACGGCA TT - #ATTGCTAG 840 - CATTCTCATG AAGGCGAGAC CAGATCAAGT TAAATTTATG ATGGTCGATC CC - #AAGATGGT 900 - TGAGTTATCT GTTTACAATG ATATTCCCCA CCTCTTGATT CCAGTCGTGA CC - #AATCCACG 960 - CAAAGCCAGC AAGGCTCTGC AAAAGGTTGT GGATGAAATG GAAAACCGTT AT - #GAACTCTT 1020 - TGCCAAGGTG GGAGTTCGGA ATATTGCAGG TTTTAATGCC AAGGTAGAAG AG - #TTCAATTC 1080 - CCAGTCTGAG TACAAGCAAA TTCCGCTACC ATTCATTGTC GTGATTGTGG AT - #GAGTTGGC 1140 - TGACCTCATG ATGGTGGCCA GCAAGGAAGT GGAAGATGCT ATCATCCGTC TT - #GGGCAGAA

1200 - GGCGCGTGCT GCAGGTATCC ACATGATTCT TGCAACTCAG CGTCCATCTG TT - #GATGTCAT 1260 - CTCTGGTTTG ATTAAGGCCA ATGTTCCATC TCGTGTAGCA TTTGCGGTTT CA - #TCAGGAAC 1320 - AGACTCCCGT ACGATTTTGG ATGAAAATGG AGCAGAAAAA CTTCTTGGTC GA - #GGAGACAT 1380 - GCTCTTTAAA CCGATTGATG AAAATCATCC AGTTCGTCTC CAAGGCTCCT TT - #ATCTCGGA 1440 - TGACGATGTT GAGCGCATTG TGAACTTCAT CAAGACTCAG GCAGATGCAG AC - #TACGATGA 1500 - GAGTTTTGAT CCAGGTGAGG TTTCTGAAAA TGAAGGAGAA TTTTCGGATG GA - #GATGCTGG 1560 - TGGTGATCCG CTTTTTGAAG AAGCTAAGTC TTTGGTTATC GAAACACAGA AA - #GCCAGTGC 1620 - GTCTATGATT CAGCGTCGTT TATCAGTTGG ATTTAACCGT GCGACCCGTC TC - #ATGGAAGA 1680 - ACTGGAGATA GCAGGTGTCA TCGGTCCAGC TGAAGGTACC AAACCTCGAA AA - #GTGTTACA 1740 # 1744 - (2) INFORMATION FOR SEQ ID NO:132: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 581 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: - Arg Gly Phe Ala Lys Trp Trp Glu Gly His Gl - #u Arg Arg Lys Glu Glu # 15 - Arg Phe Val Lys Gln Glu Glu Lys Ala Arg Gl - #n Lys Ala Glu Lys Glu # 30 - Ala Arg Leu Glu Gln Glu Glu Thr Glu Lys Al - #a Leu Leu Asp Leu Pro # 45 - Pro Val Asp Met Glu Thr Gly Glu Ile Leu Th - #r Glu Glu Ala Val Gln # 60 - Asn Leu Pro Pro Ile Pro Glu Glu Lys Trp Va - #l Glu Pro Glu Ile Ile #80 - Leu Pro Gln Ala Glu Leu Lys Phe Pro Glu Gl - #n Glu Asp Asp Ser Asp # 95 - Asp Glu Asp Val Gln Val Asp Phe Ser Ala Ly - #s Glu Ala Leu Glu Tyr # 110 - Lys Leu Pro Ser Leu Gln Leu Phe Ala Pro As - #p Lys Pro Lys Asp Gln # 125 - Ser Lys Glu Lys Lys Ile Val Arg Glu Asn Il - #e Lys Ile Leu Glu Ala # 140 - Thr Phe Ala Ser Phe Gly Ile Lys Val Thr Va - #l Glu Arg Ala Glu Ile 145 1 - #50 1 - #55 1 - #60 - Gly Pro Ser Val Thr Lys Tyr Glu Val Lys Pr - #o Ala Val Gly Val Arg # 175 - Val Asn Arg Ile Ser Asn Leu Ser Asp Asp Le - #u Ala Leu Ala Leu Ala # 190 - Ala Lys Asp Val Arg Ile Glu Ala Pro Ile Pr - #o Gly Lys Ser Leu Ile # 205 - Gly Ile Glu Val Pro Asn Ser Asp Ile Ala Th - #r Val Ser Phe Arg Glu # 220 - Leu Trp Glu Gln Ser Gln Thr Lys Ala Glu As - #n Phe Leu Glu Ile Pro 225 2 - #30 2 - #35 2 - #40 - Leu Gly Lys Ala Val Asn Gly Thr Ala Arg Al - #a Phe Asp Leu Ser Lys # 255 - Met Pro His Leu Leu Val Ala Gly Ser Thr Gl - #y Ser Gly Lys Ser Val # 270 - Ala Val Asn Gly Ile Ile Ala Ser Ile Leu Me - #t Lys Ala Arg Pro Asp # 285 - Gln Val Lys Phe Met Met Val Asp Pro Lys Me - #t Val Glu Leu Ser Val

# 300 - Tyr Asn Asp Ile Pro His Leu Leu Ile Pro Va - #l Val Thr Asn Pro Arg 305 3 - #10 3 - #15 3 - #20 - Lys Ala Ser Lys Ala Leu Gln Lys Val Val As - #p Glu Met Glu Asn Arg # 335 - Tyr Glu Leu Phe Ala Lys Val Gly Val Arg As - #n Ile Ala Gly Phe Asn # 350 - Ala Lys Val Glu Glu Phe Asn Ser Gln Ser Gl - #u Tyr Lys Gln Ile Pro # 365 - Leu Pro Phe Ile Val Val Ile Val Asp Glu Le - #u Ala Asp Leu Met Met # 380 - Val Ala Ser Lys Glu Val Glu Asp Ala Ile Il - #e Arg Leu Gly Gln Lys 385 3 - #90 3 - #95 4 - #00 - Ala Arg Ala Ala Gly Ile His Met Ile Leu Al - #a Thr Gln Arg Pro Ser # 415 - Val Asp Val Ile Ser Gly Leu Ile Lys Ala As - #n Val Pro Ser Arg Val # 430 - Ala Phe Ala Val Ser Ser Gly Thr Asp Ser Ar - #g Thr Ile Leu Asp Glu # 445 - Asn Gly Ala Glu Lys Leu Leu Gly Arg Gly As - #p Met Leu Phe Lys Pro # 460 - Ile Asp Glu Asn His Pro Val Arg Leu Gln Gl - #y Ser Phe Ile Ser Asp 465 4 - #70 4 - #75 4 - #80 - Asp Asp Val Glu Arg Ile Val Asn Phe Ile Ly - #s Thr Gln Ala Asp Ala # 495 - Asp Tyr Asp Glu Ser Phe Asp Pro Gly Glu Va - #l Ser Glu Asn Glu Gly # 510 - Glu Phe Ser Asp Gly Asp Ala Gly Gly Asp Pr - #o Leu Phe Glu Glu Ala # 525 - Lys Ser Leu Val Ile Glu Thr Gln Lys Ala Se - #r Ala Ser Met Ile Gln # 540 - Arg Arg Leu Ser Val Gly Phe Asn Arg Ala Th - #r Arg Leu Met Glu Glu 545 5 - #50 5 - #55 5 - #60 - Leu Glu Ile Ala Gly Val Ile Gly Pro Ala Gl - #u Gly Thr Lys Pro Arg # 575 - Lys Val Leu Gln Gln 580 - (2) INFORMATION FOR SEQ ID NO: 133: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 829 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #133: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TCAAAAAGAG AAGGAAAACT TGGTTATTGC TGGGAAAATA GGTCCAGAAC CA - #GAAATTTT 60 - GGCCAATATG TATAAGTTGC TGATTGAAGA AAATACCAGC ATGACTGCGA CT - #GTTAAACC 120 - GAATTTTGGG AAGACAAGCT TCCTTTATGA AGCTCTGAAA AAAGGCGATA TT - #GACATCTA 180 - TCCTGAATTT ACTGGTACGG TGACTGAAAG TTTGCTTCAA CCATCACCCA AG - #GTGAGTCA 240 - TGAACCAGAA CAGGTTTATC AGGTGGCGCG TGATGGCATT GCTAAGCAGG AT - #CATCTAGC 300 - CTATCTCAAA CCCATGTCTT ATCAAAACAC CTATGCTGTA GCTGTTCCGA AA - #AAGATTGC 360 - TCAAGAATAT GGCTTGAAGA CCATTTCAGA CTTGAAAAAA GTGGAAGGGC AG - #TTGAAGGC 420 - AGGTTTTACA CTCGAGTTTA ACGACCGTGA AGATGGAAAT AAGGGCTTGC AA - #TCAATGTA 480 - TGGTCTCAAT CTCAATGTAG CGACCATTGA GCCAGCCCTT CGCTATCAGG CT - #ATTCAGTC 540 - AGGGGATATT CAAATCACGG ATGCCTATTC GACTGATGCG GAATTGGAGC GT - #TATGATTT 600 - ACAGGTCTTG GAAGATGACA AGCAACTCTT CCCACCTTAT CAAGGGGCTC CA - #CTCATGAA 660 - AGAAGCTCTT CTCAAGAAAC ACCCAGAGTT GGAAAGAGTT CTTAATACAT TG - #GCTGGTAA 720 - GATTACAGAA AGCCAGATGA GCCAGCTCAA CTACCAAGTC GGTGTTGAAG GC - #AAGTCAGC 780 # 829GGAGT TTCTCCAAGA ACAAGGTTTG TTGAAGAAA - (2) INFORMATION FOR SEQ ID NO:134: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 276 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: - Gln Lys Glu Lys Glu Asn Leu Val Ile Ala Gl - #y Lys Ile Gly Pro Glu # 15 - Pro Glu Ile Leu Ala Asn Met Tyr Lys Leu Le - #u Ile Glu Glu Asn Thr # 30 - Ser Met Thr Ala Thr Val Lys Pro Asn Phe Gl - #y Lys Thr Ser Phe Leu # 45 - Tyr Glu Ala Leu Lys Lys Gly Asp Ile Asp Il - #e Tyr Pro Glu Phe Thr # 60 - Gly Thr Val Thr Glu Ser Leu Leu Gln Pro Se - #r Pro Lys Val Ser His #80 - Glu Pro Glu Gln Val Tyr Gln Val Ala Arg As - #p Gly Ile Ala Lys Gln # 95 - Asp His Leu Ala Tyr Leu Lys Pro Met Ser Ty - #r Gln Asn Thr Tyr Ala # 110 - Val Ala Val Pro Lys Lys Ile Ala Gln Glu Ty - #r Gly Leu Lys Thr Ile # 125 - Ser Asp Leu Lys Lys Val Glu Gly Gln Leu Ly - #s Ala Gly Phe Thr Leu # 140 - Glu Phe Asn Asp Arg Glu Asp Gly Asn Lys Gl - #y Leu Gln Ser Met Tyr 145 1 - #50 1 - #55 1 - #60 - Gly Leu Asn Leu Asn Val Ala Thr Ile Glu Pr - #o Ala Leu Arg Tyr Gln # 175 - Ala Ile Gln Ser Gly Asp Ile Gln Ile Thr As - #p Ala Tyr Ser Thr Asp # 190 - Ala Glu Leu Glu Arg Tyr Asp Leu Gln Val Le - #u Glu Asp Asp Lys Gln # 205 - Leu Phe Pro Pro Tyr Gln Gly Ala Pro Leu Me - #t Lys Glu Ala Leu Leu # 220 - Lys Lys His Pro Glu Leu Glu Arg Val Leu As - #n Thr Leu Ala Gly Lys 225 2 - #30 2 - #35 2 - #40 - Ile Thr Glu Ser Gln Met Ser Gln Leu Asn Ty - #r Gln Val Gly Val Glu # 255 - Gly Lys Ser Ala Lys Gln Val Ala Lys Glu Ph - #e Leu Gln Glu Gln Gly # 270 - Leu Leu Lys Lys 275 - (2) INFORMATION FOR SEQ ID NO: 135: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 712 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #135: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ACGTTCTATT GAGGACCACT TTGATTCAAA CTTCGAATTG GAATATAACC TC - #AAAGAAAA 60 - AGGGAAAACA GATCTTTTGA AGCTAGTTGA TAAAACAACT GACATGCGTC TG - #CATTTTAT 120 - CCGCCAAACT CATCCACGCG GTCTCGGAGA TGCTGTTTTG CAAGCCAAGG CT - #TTCGTCGG 180 - AAATGAACCT TTTGTCGTTA TGCTTGGTGA TGACTTGATG GATATCACAG AC - #GAAAAGGC 240 - TGTTCCACTT ACCAAACAAC TCATGGATGA CTACGAGCGT ACCCACGCGT CT - #ACTATCGC 300 - TGTCATGCCA GTCCCTCATG ACGAAGTATC TGCTTACGGG GTTATTGCTC CG - #CAAGGCGA 360 - AGGAAAAGAT GGTCTTTACA GTGTTGAAAC CTTTGTTGAA AAACCAGCTC CA - #GAGGACGC 420 - TCCTAGCGAC CTTGCTATTA TCGGACGCTA CCTCCTCACG CCTGAAATTT TT - #GAGATTCT 480 - CGAAAAGCAA GCTCCAGGTG CAGGAAATGA AATTCAGCTG ACAGATGCAA TC - #GACACCCT 540 - CAATAAAACA CAACGTGTAT TTGCTCGTGA GTTCAAAGGG GCTCGTTACG AT - #GTCGGAGA 600 - CAAGTTTGGC TTCATGAAAA CATCCATCGA CTACGCCCTC AAACACCCAC AA - #GTCAAAGA 660 - TGATTTGAAG AATTACCTCA TCCAACTTGG AAAAGAATTG ACTGAGAAGG AA - # 712 - (2) INFORMATION FOR SEQ ID NO:136: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 237 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:136: - Arg Ser Ile Glu Asp His Phe Asp Ser Asn Ph - #e Glu Leu Glu Tyr Asn # 15 - Leu Lys Glu Lys Gly Lys Thr Asp Leu Leu Ly - #s Leu Val Asp Lys Thr # 30 - Thr Asp Met Arg Leu His Phe Ile Arg Gln Th - #r His Pro Arg Gly Leu # 45 - Gly Asp Ala Val Leu Gln Ala Lys Ala Phe Va - #l Gly Asn Glu Pro Phe # 60 - Val Val Met Leu Gly Asp Asp Leu Met Asp Il - #e Thr Asp Glu Lys Ala #80 - Val Pro Leu Thr Lys Gln Leu Met Asp Asp Ty - #r Glu Arg Thr His Ala # 95 - Ser Thr Ile Ala Val Met Pro Val Pro His As - #p Glu Val Ser Ala Tyr # 110 - Gly Val Ile Ala Pro Gln Gly Glu Gly Lys As - #p Gly Leu Tyr Ser Val # 125 - Glu Thr Phe Val Glu Lys Pro Ala Pro Glu As - #p Ala Pro Ser Asp Leu # 140 - Ala Ile Ile Gly Arg Tyr Leu Leu Thr Pro Gl - #u Ile Phe Glu Ile Leu 145 1 - #50 1 - #55 1 - #60 - Glu Lys Gln Ala Pro Gly Ala Gly Asn Glu Il - #e Gln Leu Thr Asp Ala # 175 - Ile Asp Thr Leu Asn Lys Thr Gln Arg Val Ph - #e Ala Arg Glu Phe Lys # 190 - Gly Ala Arg Tyr Asp Val Gly Asp Lys Phe Gl - #y Phe Met Lys Thr Ser # 205 - Ile Asp Tyr Ala Leu Lys His Pro Gln Val Ly - #s Asp Asp Leu Lys Asn # 220 - Tyr Leu Ile Gln Leu Gly Lys Glu Leu Thr Gl - #u Lys Glu 225 2 - #30 2 - #35 - (2) INFORMATION FOR SEQ ID NO: 137: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 502 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #137: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CGCTCAAAAT ACCAGAGGTG TTCAGCTAAT CGAGCACGTT TCTCCTCAAA TG - #TTGAAAGC 60 - CCAATTGGAG AGTGTCTTTT CTGATATTCC ACCTCAGGCT GTAAAAACTG GA - #ATGTTGGC 120 - TACTACTGAA ATCATGGAAA TCATCCAACC CTATCTTAAA AAACTGGATT GT - #CCCTATGT 180 - CCTTGATCCT GTTATGGTTG CTACAAGTGG AGATGCCTTG ATTGACTCAA AT - #GCTAGAGA 240 - CTATCTCAAA ACAAACTTAC TACCTCTAGC AACTATTATT ACGCCAAATC TT - #CCTGAAGC 300 - AGAAGAGATT GTTGGTTTTT CAATCCATGA CCCCGAAGAC ATGCAGCGTG CT - #GGTCGCCT 360 - GATTTTAAAA GAATTTGGTC CTCAGTCTGT GGTTATCAAA GGCGGACATC TC - #AAAGGTGG 420 - TGCTAAAGAT TTCCTCTTTA CCAAGAATGA ACAATTTGTC TGGGAAAGCC CA - #CGAATTCA 480 # 502GTA CT - (2) INFORMATION FOR SEQ ID NO:138: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 167 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: - Ala Gln Asn Thr Arg Gly Val Gln Leu Ile Gl - #u His Val Ser Pro Gln # 15 - Met Leu Lys Ala Gln Leu Glu Ser Val Phe Se - #r Asp Ile Pro Pro Gln # 30 - Ala Val Lys Thr Gly Met Leu Ala Thr Thr Gl - #u Ile Met Glu Ile Ile # 45 - Gln Pro Tyr Leu Lys Lys Leu Asp Cys Pro Ty - #r Val Leu Asp Pro Val # 60 - Met Val Ala Thr Ser Gly Asp Ala Leu Ile As - #p Ser Asn Ala Arg Asp #80 - Tyr Leu Lys Thr Asn Leu Leu Pro Leu Ala Th - #r Ile Ile Thr Pro Asn # 95 - Leu Pro Glu Ala Glu Glu Ile Val Gly Phe Se - #r Ile His Asp Pro Glu # 110 - Asp Met Gln Arg Ala Gly Arg Leu Ile Leu Ly - #s Glu Phe Gly Pro Gln # 125 - Ser Val Val Ile Lys Gly Gly His Leu Lys Gl - #y Gly Ala Lys Asp Phe # 140 - Leu Phe Thr Lys Asn Glu Gln Phe Val Trp Gl - #u Ser Pro Arg Ile Gln 145 1 - #50 1 - #55 1 - #60 - Thr Cys His Thr His Gly Thr 165 - (2) INFORMATION FOR SEQ ID NO: 139: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 805 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #139: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AATTGTACAA TTAGAAAAAG ATAGCAAATC AGACAAAGAA CAAGTTGATA AA - #CTATTTGA 60

- ATCATTTGAT GCATCTTCAG ATGAATCTAT TTCTAAATTA AAAGAACTAT CT - #GAAACTTC 120 - ACTTAAAACC GATGCAGGTA AAGACTATCT TAATAACAAA GTCAAAGAAT CA - #TCTAAAGC 180 - AATTGTAGAT TTTCATTTGC AAAAAGGTTT GGCTTATGAT GTTAAAGATT CA - #GATGACAA 240 - ATTTAAAGAT AAAGCAACTC TTGAAACAAA TGTAAAAGAA ATTACAAAAC AA - #ATTGATTT 300 - TATCAAAAAA GTTGATGAAA CTTTTAAACA AGAGAATTTG GAAGAAACTC TT - #AAATCTCT 360 - AAATGATCTT GTTGATAAAT ATCAAAAACA AATCGAACTT TTGAAGAAAG AA - #GAAGAAAA 420 - AGCTGCTGAA AAAGCTGCTG AAAAAGCAAA GGAATCTTCT AGTCAAAGTA AT - #TCTTCTGG 480 - TAGTGCTTCT AATGAGTCTT ATAATGGATC TTCCAATTCA AATGTAGATT AT - #AGTTCATC 540 - TGAACAAACT AATGGATATT CAAATAATTA TGGCGGTCAA GATTATTCTG GT - #TCAGGAGA 600 - TAGTTCAACA AATGGTGGAT CATCAGAACA ATATTCATCT AGCAATTCAA AC - #AGCGGAGC 660 - AAATAATGTC TACAGATATA AAGGCACTGG TGCTGACGGC TATCAAAGAT AC - #TACTACAA 720 - AGATCATAAT AATGGAGATG TGTATGATGA CGATGGAAAT TACCTTGGGA AC - #TTTGGTGG 780 # 805 AGTC AACGC - (2) INFORMATION FOR SEQ ID NO:140: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 268 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: - Ile Val Gln Leu Glu Lys Asp Ser Lys Ser As - #p Lys Glu Gln Val Asp # 15 - Lys Leu Phe Glu Ser Phe Asp Ala Ser Ser As - #p Glu Ser Ile Ser Lys # 30 - Leu Lys Glu Leu Ser Glu Thr Ser Leu Lys Th - #r Asp Ala Gly Lys Asp # 45 - Tyr Leu Asn Asn Lys Val Lys Glu Ser Ser Ly - #s Ala Ile Val Asp Phe # 60 - His Leu Gln Lys Gly Leu Ala Tyr Asp Val Ly - #s Asp Ser Asp Asp Lys #80 - Phe Lys Asp Lys Ala Thr Leu Glu Thr Asn Va - #l Lys Glu Ile Thr Lys # 95 - Gln Ile Asp Phe Ile Lys Lys Val Asp Glu Th - #r Phe Lys Gln Glu Asn # 110 - Leu Glu Glu Thr Leu Lys Ser Leu Asn Asp Le - #u Val Asp Lys Tyr Gln # 125 - Lys Gln Ile Glu Leu Leu Lys Lys Glu Glu Gl - #u Lys Ala Ala Glu Lys # 140 - Ala Ala Glu Lys Ala Lys Glu Ser Ser Ser Gl - #n Ser Asn Ser Ser Gly 145 1 - #50 1 - #55 1 - #60 - Ser Ala Ser Asn Glu Ser Tyr Asn Gly Ser Se - #r Asn Ser Asn Val Asp # 175 - Tyr Ser Ser Ser Glu Gln Thr Asn Gly Tyr Se - #r Asn Asn Tyr Gly Gly # 190 - Gln Asp Tyr Ser Gly Ser Gly Asp Ser Ser Th - #r Asn Gly Gly Ser Ser # 205 - Glu Gln Tyr Ser Ser Ser Asn Ser Asn Ser Gl - #y Ala Asn Asn Val Tyr # 220 - Arg Tyr Lys Gly Thr Gly Ala Asp Gly Tyr Gl - #n Arg Tyr Tyr Tyr Lys 225 2 - #30 2 - #35 2 - #40 - Asp His Asn Asn Gly Asp Val Tyr Asp Asp As - #p Gly Asn Tyr Leu Gly # 255 - Asn Phe Gly Gly Gly Ile Ala Glu Pro Ser Gl - #n Arg # 265 - (2) INFORMATION FOR SEQ ID NO: 141: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 211 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #141: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TCTGACCAAG CAAAAAGAAG CAGTCAATGA CAAAGGAAAA GCAGCTGTTG TT - #AAGGTGGT 60 - GGAAAGCCAG GCAGAACTTT ATAGCTTAGA AAAGAATGAA GATGCTAGCC TA - #AGAAAGTT 120 - ACAAGCAGAT GGACGCATCA CGGAAGAACA GGCTAAAGCT TATAAAGAAT AC - #AATGATAA 180 # 211 CGTA AAGTCAATGA T - (2) INFORMATION FOR SEQ ID NO:142: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 70 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: - Leu Thr Lys Gln Lys Glu Ala Val Asn Asp Ly - #s Gly Lys Ala Ala Val # 15 - Val Lys Val Val Glu Ser Gln Ala Glu Leu Ty - #r Ser Leu Glu Lys Asn # 30 - Glu Asp Ala Ser Leu Arg Lys Leu Gln Ala As - #p Gly Arg Ile Thr Glu # 45 - Glu Gln Ala Lys Ala Tyr Lys Glu Tyr Asn As - #p Lys Asn Gly Gly Ala # 60 - Asn Arg Lys Val Asn Asp #70 - (2) INFORMATION FOR SEQ ID NO: 143: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 331 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #143: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC GGTAGAGGAA CAGATTTTCT TT - #ATGGAGTT 60 - TGAAGAACTC TATCGGGAAA CCCAAAAACG CAGTGTAGCC AGTCAGCAAA AG - #ACTAGTCT 120 - GAACTTAGAT GGGCAGACGC TTAGCAATGG CAGTCAAAAG TTGCCAGTCC CT - #AAAGGAAT 180 - TCAGGCCCCA TCAGGCCAAA GTATTACATT TGACCGAGCT GGGGGCAATT CG - #TCCCTGGC 240 - TAAGGTTGAA TTTCAGACCA GTAAAGGAGC GATTCGCTAT CAATTATATC TA - #GGAAATGG 300 # 331 AAGG AAACAAAAAA T - (2) INFORMATION FOR SEQ ID NO:144: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 110 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: - Ser Gly Ser Val Gln Ser Thr Phe Ser Ala Va - #l Glu Glu Gln Ile Phe # 15 - Phe Met Glu Phe Glu Glu Leu Tyr Arg Glu Th - #r Gln Lys Arg Ser Val # 30 - Ala Ser Gln Gln Lys Thr Ser Leu Asn Leu As - #p Gly Gln Thr Leu Ser # 45 - Asn Gly Ser Gln Lys Leu Pro Val Pro Lys Gl - #y Ile Gln Ala Pro Ser # 60 - Gly Gln Ser Ile Thr Phe Asp Arg Ala Gly Gl - #y Asn Ser Ser Leu Ala #80 - Lys Val Glu Phe Gln Thr Ser Lys Gly Ala Il - #e Arg Tyr Gln Leu Tyr # 95 - Leu Gly Asn Gly Lys Ile Lys Arg Ile Lys Gl - #u Thr Lys Asn # 110 - (2) INFORMATION FOR SEQ ID NO: 145: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 196 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #145: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGGACAAATT CAAAAAAATA GGCAAGAGGA AGCAAAAATC TTGCAAAAGG AA - #GAAGTCTT 60 - GAGGGTAGCT AAGATGGCCC TGCAGACGGG GCAAAATCAG GTAAGCATCA AC - #GGAGTTGA 120 - GATTCAGGTA TTTTCTAGTG AAAAAGGATT GGAGGTCTAC CATGGTTCAG AA - #CAGTTGTT 180 # 196 - (2) INFORMATION FOR SEQ ID NO:146: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 65 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: - Gly Gln Ile Gln Lys Asn Arg Gln Glu Glu Al - #a Lys Ile Leu Gln Lys # 15 - Glu Glu Val Leu Arg Val Ala Lys Met Ala Le - #u Gln Thr Gly Gln Asn # 30 - Gln Val Ser Ile Asn Gly Val Glu Ile Gln Va - #l Phe Ser Ser Glu Lys # 45 - Gly Leu Glu Val Tyr His Gly Ser Glu Gln Le - #u Leu Ala Ile Lys Glu # 60 - Pro 65 - (2) INFORMATION FOR SEQ ID NO: 147: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 319 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #147: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TCGCTACCAG CAACAAAGCG AGCAAAAGGA GTGGCTCTTG TTTGTGGACC AA - #CTTGAGGT 60 - AGAATTAGAC CGTTCGCAGT TCGAAAAAGT AGAAGGCAAT CGCCTATACA TG - #AAGCAAGA 120 - TGGCAAGGAC ATCGCCATCG GTAAGTCAAA GTCAGATGAT TTCCGTAAAA CG - #AATGCTCG 180 - TGGTCGAGGT TATCAGCCTA TGGTTTATGG ACTCAAATCT GTACGGATTA CA - #GAGGACAA 240 - TCAACTGGTT CGCTTTCATT TCCAGTTCCA AAAAGGCTTA GAAAGGGAGT TC - #ATCTATCG 300 #319 AGT - (2) INFORMATION FOR SEQ ID NO:148: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 106 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: - Arg Tyr Gln Gln Gln Ser Glu Gln Lys Glu Tr - #p Leu Leu Phe Val Asp # 15 - Gln Leu Glu Val Glu Leu Asp Arg Ser Gln Ph - #e Glu Lys Val Glu Gly # 30 - Asn Arg Leu Tyr Met Lys Gln Asp Gly Lys As - #p Ile Ala Ile Gly Lys # 45 - Ser Lys Ser Asp Asp Phe Arg Lys Thr Asn Al - #a Arg Gly Arg Gly Tyr # 60 - Gln Pro Met Val Tyr Gly Leu Lys Ser Val Ar - #g Ile Thr Glu Asp Asn #80 - Gln Leu Val Arg Phe His Phe Gln Phe Gln Ly - #s Gly Leu Glu Arg Glu # 95 - Phe Ile Tyr Arg Val Glu Lys Glu Lys Ser # 105 - (2) INFORMATION FOR SEQ ID NO: 149: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 322 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #149: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GAACCGACAA GTCGCCCACT ATCAAGACTA TGCTTTGAAT AAAGAAAAAT TG - #GTTGCTTT 60 - TGCTATGGCT AAACGAACCA AAGATAAGGT TGAGCAAGAA AGTGGGGAAC AG - #TTTTTTAA 120 - TCTAGGTCAG GTAAGCTATC AAAACAAGAA AACTGGCTTA GTGACGAGGG TT - #CGTACGGA 180 - TAAGAGCCAA TATGAGTTTC TGTTTCCTTC AGTCAAAATC AAAGAAGAGA AA - #AGAGATAA 240 - AAAGGAAGAG GTAGCGACCG ATTCAAGCGA AAAAGTGGAG AAGAAAAAAT CA - #GAAGAGAA 300 # 322ATT CA - (2) INFORMATION FOR SEQ ID NO:150: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 107 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:150: - Asn Arg Gln Val Ala His Tyr Gln Asp Tyr Al - #a Leu Asn Lys Glu Lys # 15 - Leu Val Ala Phe Ala Met Ala Lys Arg Thr Ly - #s Asp Lys Val Glu Gln # 30 - Glu Ser Gly Glu Gln Phe Phe Asn Leu Gly Gl - #n Val Ser Tyr Gln Asn # 45 - Lys Lys Thr Gly Leu Val Thr Arg Val Arg Th - #r Asp Lys Ser Gln Tyr # 60 - Glu Phe Leu Phe Pro Ser Val Lys Ile Lys Gl - #u Glu Lys Arg Asp Lys #80 - Lys Glu Glu Val Ala Thr Asp Ser Ser Glu Ly - #s Val Glu Lys Lys Lys # 95 - Ser Glu Glu Lys Pro Glu Lys Lys Glu Asn Se - #r # 105 - (2) INFORMATION FOR SEQ ID NO: 151: - (i) SEQUENCE CHARACTERISTICS:

#pairs (A) LENGTH: 784 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #151: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGTTGTCGGC TGGCAATATA TCCCGTTTCC ATCTAAAGGT AGTACAATTG GT - #CCTTACCC 60 - AAATGGTATC AGATTAGAAG GTTTTCCAAA GTCAGAGTGG TACTACTTCG AT - #AAAAATGG 120 - AGTGCTACAA GAGTTTGTTG GTTGGAAAAC ATTAGAGATT AAAACTAAAG AC - #AGTGTTGG 180 - AAGAAAGTAC GGGGAAAAAC GTGAAGATTC AGAAGATAAA GAAGAGAAGC GT - #TATTATAC 240 - GAACTATTAC TTTAATCAAA ATCATTCTTT AGAGACAGGT TGGCTTTATG AT - #CAGTCTAA 300 - CTGGTATTAT CTAGCTAAGA CGGAAATTAA TGGAGAAAAC TACCTTGGTG GT - #GAAAGACG 360 - TGCGGGGTGG ATAAACGATG ATTCGACTTG GTACTACCTA GATCCAACAA CT - #GGTATTAT 420 - GCAAACAGGT TGGCAATATC TAGGTAATAA GTGGTACTAC CTCCGTTCCT CA - #GGAGCAAT 480 - GGCCACTGGC TGGTATCAGG AAGGTACCAC TTGGTATTAT TTAGACCACC CA - #AATGGCGA 540 - TATGAAAACA GGTTGGCAAA ACCTTGGGAA CAAATGGTAC TATCTCCGTT CA - #TCAGGAGC 600 - TATGGCAACT GGTTGGTATC AAGATGGTTC AACTTGGTAC TACCTAAATG CA - #GGTAATGG 660 - AGACATGAAG ACAGGTTGGT TCCAGGTCAA TGGCAACTGG TACTATGCTT AT - #AGCTCAGG 720 - TGCTTTGGCA GTGAATACGA CCGTAGATGG CTATTCTGTC AACTATAATG GC - #GAATGGGT 780 # 784 - (2) INFORMATION FOR SEQ ID NO:152: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 261 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:152: - Val Val Gly Trp Gln Tyr Ile Pro Phe Pro Se - #r Lys Gly Ser Thr Ile # 15 - Gly Pro Tyr Pro Asn Gly Ile Arg Leu Glu Gl - #y Phe Pro Lys Ser Glu # 30 - Trp Tyr Tyr Phe Asp Lys Asn Gly Val Leu Gl - #n Glu Phe Val Gly Trp # 45 - Lys Thr Leu Glu Ile Lys Thr Lys Asp Ser Va - #l Gly Arg Lys Tyr Gly # 60 - Glu Lys Arg Glu Asp Ser Glu Asp Lys Glu Gl - #u Lys Arg Tyr Tyr Thr #80 - Asn Tyr Tyr Phe Asn Gln Asn His Ser Leu Gl - #u Thr Gly Trp Leu Tyr # 95 - Asp Gln Ser Asn Trp Tyr Tyr Leu Ala Lys Th - #r Glu Ile Asn Gly Glu # 110 - Asn Tyr Leu Gly Gly Glu Arg Arg Ala Gly Tr - #p Ile Asn Asp Asp Ser # 125 - Thr Trp Tyr Tyr Leu Asp Pro Thr Thr Gly Il - #e Met Gln Thr Gly Trp # 140 - Gln Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Ar - #g Ser Ser Gly Ala Met 145 1 - #50 1 - #55 1 - #60 - Ala Thr Gly Trp Tyr Gln Glu Gly Thr Thr Tr - #p Tyr Tyr Leu Asp His # 175 - Pro Asn Gly Asp Met Lys Thr Gly Trp Gln As - #n Leu Gly Asn Lys Trp # 190 - Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Th - #r Gly Trp Tyr Gln Asp # 205 - Gly Ser Thr Trp Tyr Tyr Leu Asn Ala Gly As - #n Gly Asp Met Lys Thr # 220 - Gly Trp Phe Gln Val Asn Gly Asn Trp Tyr Ty - #r Ala Tyr Ser Ser Gly 225 2 - #30 2 - #35 2 - #40 - Ala Leu Ala Val Asn Thr Thr Val Asp Gly Ty - #r Ser Val Asn Tyr Asn # 255 - Gly Glu Trp Val Arg 260 - (2) INFORMATION FOR SEQ ID NO: 153: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1708 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #153: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGCCAAATCA GAATGGGTAG AAGACAAGGG AGCCTTTTAT TATCTTGACC AA - #GATGGAAA 60 - GATGAAAAGA AATGCTTGGG TAGGAACTTC CTATGTTGGT GCAACAGGTG CC - #AAAGTAAT 120 - AGAAGACTGG GTCTATGATT CTCAATACGA TGCTTGGTTT TATATCAAAG CA - #GATGGACA 180 - GCACGCAGAG AAAGAATGGC TCCAAATTAA AGGGAAGGAC TATTATTTCA AA - #TCCGGTGG 240 - TTATCTACTG ACAAGTCAGT GGATTAATCA AGCTTATGTG AATGCTAGTG GT - #GCCAAAGT 300 - ACAGCAAGGT TGGCTTTTTG ACAAACAATA CCAATCTTGG TTTTACATCA AA - #GAAAATGG 360 - AAACTATGCT GATAAAGAAT GGATTTTCGA GAATGGTCAC TATTATTATC TA - #AAATCCGG 420 - TGGCTACATG GCAGCCAATG AATGGATTTG GGATAAGGAA TCTTGGTTTT AT - #CTCAAATT 480 - TGATGGGAAA ATGGCTGAAA AAGAATGGGT CTACGATTCT CATAGTCAAG CT - #TGGTACTA 540 - CTTCAAATCC GGTGGTTACA TGACAGCCAA TGAATGGATT TGGGATAAGG AA - #TCTTGGTT 600 - TTATCTCAAA TCTGATGGGA AAATAGCTGA AAAAGAATGG GTCTACGATT CT - #CATAGTCA 660 - AGCTTGGTAC TACTTCAAAT CCGGTGGTTA CATGACAGCC AATGAATGGA TT - #TGGGATAA 720 - GGAATCTTGG TTTTACCTCA AATCTGATGG GAAAATAGCT GAAAAAGAAT GG - #GTCTACGA 780 - TTCTCATAGT CAAGCTTGGT ACTACTTCAA ATCTGGTGGC TACATGGCGA AA - #AATGAGAC 840 - AGTAGATGGT TATCAGCTTG GAAGCGATGG TAAATGGCTT GGAGGAAAAA CT - #ACAAATGA 900 - AAATGCTGCT TACTATCAAG TAGTGCCTGT TACAGCCAAT GTTTATGATT CA - #GATGGTGA 960 - AAAGCTTTCC TATATATCGC AAGGTAGTGT CGTATGGCTA GATAAGGATA GA - #AAAAGTGA 1020 - TGACAAGCGC TTGGCTATTA CTATTTCTGG TTTGTCAGGC TATATGAAAA CA - #GAAGATTT 1080 - ACAAGCGCTA GATGCTAGTA AGGACTTTAT CCCTTATTAT GAGAGTGATG GC - #CACCGTTT 1140 - TTATCACTAT GTGGCTCAGA ATGCTAGTAT CCCAGTAGCT TCTCATCTTT CT - #GATATGGA 1200 - AGTAGGCAAG AAATATTATT CGGCAGATGG CCTGCATTTT GATGGTTTTA AG - #CTTGAGAA 1260 - TCCCTTCCTT TTCAAAGATT TAACAGAGGC TACAAACTAC AGTGCTGAAG AA - #TTGGATAA 1320 - GGTATTTAGT TTGCTAAACA TTAACAATAG CCTTTTGGAG AACAAGGGCG CT - #ACTTTTAA 1380 - GGAAGCCGAA GAACATTACC ATATCAATGC TCTTTATCTC CTTGCCCATA GT - #GCCCTAGA 1440 - AAGTAACTGG GGAAGAAGTA AAATTGCCAA AGATAAGAAT AATTTCTTTG GC - #ATTACAGC 1500 - CTATGATACG ACCCCTTACC TTTCTGCTAA GACATTTGAT GATGTGGATA AG - #GGAATTTT 1560 - AGGTGCAACC AAGTGGATTA AGGAAAATTA TATCGATAGG GGAAGAACTT TC - #CTTGGAAA 1620 - CAAGGCTTCT GGTATGAATG TGGAATATGC TTCAGACCCT TATTGGGGCG AA - #AAAATTGC 1680 # 1708 ATCA ATGAGAAG - (2) INFORMATION FOR SEQ ID NO:154: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 569 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: - Ala Lys Ser Glu Trp Val Glu Asp Lys Gly Al - #a Phe Tyr Tyr Leu Asp # 15 - Gln Asp Gly Lys Met Lys Arg Asn Ala Trp Va - #l Gly Thr Ser Tyr Val # 30 - Gly Ala Thr Gly Ala Lys Val Ile Glu Asp Tr - #p Val Tyr Asp Ser Gln # 45 - Tyr Asp Ala Trp Phe Tyr Ile Lys Ala Asp Gl - #y Gln His Ala Glu Lys # 60 - Glu Trp Leu Gln Ile Lys Gly Lys Asp Tyr Ty - #r Phe Lys Ser Gly Gly #80 - Tyr Leu Leu Thr Ser Gln Trp Ile Asn Gln Al - #a Tyr Val Asn Ala Ser # 95 - Gly Ala Lys Val Gln Gln Gly Trp Leu Phe As - #p Lys Gln Tyr Gln Ser # 110 - Trp Phe Tyr Ile Lys Glu Asn Gly Asn Tyr Al - #a Asp Lys Glu Trp Ile # 125 - Phe Glu Asn Gly His Tyr Tyr Tyr Leu Lys Se - #r Gly Gly Tyr Met Ala # 140 - Ala Asn Glu Trp Ile Trp Asp Lys Glu Ser Tr - #p Phe Tyr Leu Lys Phe 145 1 - #50 1 - #55 1 - #60 - Asp Gly Lys Met Ala Glu Lys Glu Trp Val Ty - #r Asp Ser His Ser Gln # 175 - Ala Trp Tyr Tyr Phe Lys Ser Gly Gly Tyr Me - #t Thr Ala Asn Glu Trp # 190 - Ile Trp Asp Lys Glu Ser Trp Phe Tyr Leu Ly - #s Ser Asp Gly Lys Ile # 205 - Ala Glu Lys Glu Trp Val Tyr Asp Ser His Se - #r Gln Ala Trp Tyr Tyr # 220 - Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Gl - #u Trp Ile Trp Asp Lys 225 2 - #30 2 - #35 2 - #40 - Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Ly - #s Ile Ala Glu Lys Glu # 255 - Trp Val Tyr Asp Ser His Ser Gln Ala Trp Ty - #r Tyr Phe Lys Ser Gly # 270 - Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gl - #y Tyr Gln Leu Gly Ser # 285 - Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr As - #n Glu Asn Ala Ala Tyr # 300 - Tyr Gln Val Val Pro Val Thr Ala Asn Val Ty - #r Asp Ser Asp Gly Glu 305 3 - #10 3 - #15 3 - #20 - Lys Leu Ser Tyr Ile Ser Gln Gly Ser Val Va - #l Trp Leu Asp Lys Asp # 335 - Arg Lys Ser Asp Asp Lys Arg Leu Ala Ile Th - #r Ile Ser Gly Leu Ser # 350 - Gly Tyr Met Lys Thr Glu Asp Leu Gln Ala Le - #u Asp Ala Ser Lys Asp # 365 - Phe Ile Pro Tyr Tyr Glu Ser Asp Gly His Ar - #g Phe Tyr His Tyr Val # 380 - Ala Gln Asn Ala Ser Ile Pro Val Ala Ser Hi - #s Leu Ser Asp Met Glu 385 3 - #90 3 - #95 4 - #00 - Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Le - #u His Phe Asp Gly Phe # 415 - Lys Leu Glu Asn Pro Phe Leu Phe Lys Asp Le - #u Thr Glu Ala Thr Asn # 430 - Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Se - #r Leu Leu Asn Ile Asn # 445 - Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Ph - #e Lys Glu Ala Glu Glu # 460 - His Tyr His Ile Asn Ala Leu Tyr Leu Leu Al - #a His Ser Ala Leu Glu 465 4 - #70 4 - #75 4 - #80 - Ser Asn Trp Gly Arg Ser Lys Ile Ala Lys As - #p Lys Asn Asn Phe Phe # 495 - Gly Ile Thr Ala Tyr Asp Thr Thr Pro Tyr Le - #u Ser Ala Lys Thr Phe # 510 - Asp Asp Val Asp Lys Gly Ile Leu Gly Ala Th - #r Lys Trp Ile Lys Glu # 525 - Asn Tyr Ile Asp Arg Gly Arg Thr Phe Leu Gl - #y Asn Lys Ala Ser Gly # 540 - Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Tr - #p Gly Glu Lys Ile Ala 545 5 - #50 5 - #55 5 - #60 - Ser Val Met Met Lys Ile Asn Glu Lys 565 - (2) INFORMATION FOR SEQ ID NO: 155: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 946 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #155: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATTTGCAGAT GATTCTGAAG GATGGCAGTT TGTCCAAGAA AATGGTAGAA CC - #TACTACAA 60 - AAAGGGGGAT CTAAAAGAAA CCTACTGGAG AGTGATAGAT GGGAAGTACT AT - #TATTTTGA 120 - TCCTTTATCC GGAGAGATGG TTGTCGGCTG GCAATATATA CCTGCTCCAC AC - #AAGGGGGT 180 - TACGATTGGT CCTTCTCCAA GAATAGAGAT TGCTCTTAGA CCAGATTGGT TT - #TATTTTGG 240 - TCAAGATGGT GTATTACAAG AATTTGTTGG CAAGCAAGTT TTAGAAGCAA AA - #ACTGCTAC 300 - GAATACCAAC AAACATCATG GGGAAGAATA TGATAGCCAA GCAGAGAAAC GA - #GTCTATTA 360 - TTTTGAAGAT CAGCGTAGTT ATCATACTTT AAAAACTGGT TGGATTTATG AA - #GAGGGTCA 420 - TTGGTATTAT TTACAGAAGG ATGGTGGCTT TGATTCGCGC ATCAACAGAT TG - #ACGGTTGG 480 - AGAGCTAGCA CGTGGTTGGG TTAAGGATTA CCCTCTTACG TATGATGAAG AG - #AAGCTAAA 540 - AGCAGCTCCA TGGTACTATC TAAATCCAGC AACTGGCATT ATGCAAACAG GT - #TGGCAATA 600 - TCTAGGTAAT AGATGGTACT ACCTCCATTC GTCAGGAGCT ATGGCAACTG GC - #TGGTATAA 660

- GGAAGGCTCA ACTTGGTACT ATCTAGATGC TGAAAATGGT GATATGAGAA CT - #GGCTGGCA 720 - AAACCTTGGG AACAAATGGT ACTATCTCCG TTCATCAGGA GCTATGGCAA CT - #GGTTGGTA 780 - TCAGGAAAGT TCGACTTGGT ACTATCTAAA TGCAAGTAAT GGAGATATGA AA - #ACAGGCTG 840 - GTTCCAAGTC AATGGTAACT GGTACTATGC CTATGATTCA GGTGCTTTAG CT - #GTTAATAC 900 # 946ACT TAAACTATAA TGGTGAATGG GTTAAG - (2) INFORMATION FOR SEQ ID NO:156: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 316 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:156: - Val Phe Ala Asp Asp Ser Glu Gly Trp Gln Ph - #e Val Gln Glu Asn Gly # 15 - Arg Thr Tyr Tyr Lys Lys Gly Asp Leu Lys Gl - #u Thr Tyr Trp Arg Val # 30 - Ile Asp Gly Lys Tyr Tyr Tyr Phe Asp Pro Le - #u Ser Gly Glu Met Val # 45 - Val Gly Trp Gln Tyr Ile Pro Ala Pro His Ly - #s Gly Val Thr Ile Gly # 60 - Pro Ser Pro Arg Ile Glu Ile Ala Leu Arg Pr - #o Asp Trp Phe Tyr Phe #80 - Gly Gln Asp Gly Val Leu Gln Glu Phe Val Gl - #y Lys Gln Val Leu Glu # 95 - Ala Lys Thr Ala Thr Asn Thr Asn Lys His Hi - #s Gly Glu Glu Tyr Asp # 110 - Ser Gln Ala Glu Lys Arg Val Tyr Tyr Phe Gl - #u Asp Gln Arg Ser Tyr # 125 - His Thr Leu Lys Thr Gly Trp Ile Tyr Glu Gl - #u Gly His Trp Tyr Tyr # 140 - Leu Gln Lys Asp Gly Gly Phe Asp Ser Arg Il - #e Asn Arg Leu Thr Val 145 1 - #50 1 - #55 1 - #60 - Gly Glu Leu Ala Arg Gly Trp Val Lys Asp Ty - #r Pro Leu Thr Tyr Asp # 175 - Glu Glu Lys Leu Lys Ala Ala Pro Trp Tyr Ty - #r Leu Asn Pro Ala Thr # 190 - Gly Ile Met Gln Thr Gly Trp Gln Tyr Leu Gl - #y Asn Arg Trp Tyr Tyr # 205 - Leu His Ser Ser Gly Ala Met Ala Thr Gly Tr - #p Tyr Lys Glu Gly Ser # 220 - Thr Trp Tyr Tyr Leu Asp Ala Glu Asn Gly As - #p Met Arg Thr Gly Trp 225 2 - #30 2 - #35 2 - #40 - Gln Asn Leu Gly Asn Lys Trp Tyr Tyr Leu Ar - #g Ser Ser Gly Ala Met # 255 - Ala Thr Gly Trp Tyr Gln Glu Ser Ser Thr Tr - #p Tyr Tyr Leu Asn Ala # 270 - Ser Asn Gly Asp Met Lys Thr Gly Trp Phe Gl - #n Val Asn Gly Asn Trp # 285 - Tyr Tyr Ala Tyr Asp Ser Gly Ala Leu Ala Va - #l Asn Thr Thr Val Gly # 300 - Gly Tyr Tyr Leu Asn Tyr Asn Gly Glu Trp Va - #l Lys 305 3 - #10 3 - #15 - (2) INFORMATION FOR SEQ ID NO: 157: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1415 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #157: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGTCGCTGCA AATGAAACTG AAGTAGCAAA AACTTCGCAG GATACAACGA CA - #GCTTCAAG

60 - TAGTTCAGAG CAAAATCAGT CTTCTAATAA AACGCAAACG AGCGCAGAAG TA - #CAGACTAA 120 - TGCTGCTGCC CACTGGGATG GGGATTATTA TGTAAAGGAT GATGGTTCTA AA - #GCTCAAAG 180 - TGAATGGATT TTTGACAACT ACTATAAGGC TTGGTTTTAT ATTAATTCAG AT - #GGTCGTTA 240 - CTCGCAGAAT GAATGGCATG GAAATTACTA CCTGAAATCA GGTGGATATA TG - #GCCCAAAA 300 - CGAGTGGATC TATGACAGTA ATTACAAGAG TTGGTTTTAT CTCAAGTCAG AT - #GGGGCTTA 360 - TGCTCATCAA GAATGGCAAT TGATTGGAAA TAAGTGGTAC TACTTCAAGA AG - #TGGGGTTA 420 - CATGGCTAAA AGCCAATGGC AAGGAAGTTA TTTCTTGAAT GGTCAAGGAG CT - #ATGATGCA 480 - AAATGAATGG CTSCTATGAT CCAGCCTATT CTGCTTATTT TTATCTAAAA TC - #CGATGGAA 540 - CTTATGCTAA CCAAGAGTGG CAAAAAGTGG GCGGCAAATG GTACTATTTC AA - #GAAGTGGG 600 - GCTATATGGC TCGGAATGAG TGGCAAGGCA ACTACTATTT GACTGGAAGT GG - #TGCCATGG 660 - CGACTGACGA AGTGATTATG GATGGTACTC GCTATATCTT TGCGGCCTCT GG - #TGAGCTCA 720 - AAGAAAAAAA AGATTTGAAT GTCGGCTGGG TTCACAGAGA TGGTAAGCGC TA - #TTTCTTTA 780 - ATAATAGAGA AGAACAAGTG GGAACCGAAC ATGCTAAGAA AGTCATTGAT AT - #TAGTGAGC 840 - ACAATGGTCG TATCAATGAT TGGAAAAAGG TTATTGATGA GAACGAAGTG GA - #TGGTGTCA 900 - TTGTTCGTCT AGGTTATAGC GGTAAAGAAG ACAAGGAATT GGCGCATAAC AT - #TAAGGAGT 960 - TAAACCGTCT GGGAATTCCT TATGGTGTCT ATCTCTATAC CTATGCTGAA AA - #TGAGACCG 1020 - ATGCTGAGAG TGACGCTAAA CAGACCATTG AACTTATAAA GAAATACAAT AT - #GAACCTGT 1080 - CTTACCCTAT CTATTATGAT GTTGAGAATT GGGAATATGT AAATAAGAGC AA - #GAGAGCTC 1140 - CAAGTGATAC AGGCACTTGG GTTAAAATCA TCAACAAGTA CATGGACACG AT - #GAAGCAGG 1200 - CGGGTTATCA AAATGTGTAT GTCTATAGCT ATCGTAGTTT ATTACAGACG CG - #TTTAAAAC 1260 - ACCCAGATAT TTTAAAACAT GTAAACTGGG TAGCGGCCTA TACGAATGCT TT - #AGAATGGG 1320 - AAAACCCTCA TTATTCAGGA AAAAAAGGTT GGCAATATAC CTCTTCTGAA TA - #CATGAAAG 1380 # 1415 AGAT GTCAGCGTTT GGTAT - (2) INFORMATION FOR SEQ ID NO:158: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 471 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:158: - Val Ala Ala Asn Glu Thr Glu Val Ala Lys Th - #r Ser Gln Asp Thr Thr # 15 - Thr Ala Ser Ser Ser Ser Glu Gln Asn Gln Se - #r Ser Asn Lys Thr Gln # 30 - Thr Ser Ala Glu Val Gln Thr Asn Ala Ala Al - #a His Trp Asp Gly Asp # 45 - Tyr Tyr Val Lys Asp Asp Gly Ser Lys Ala Gl - #n Ser Glu Trp Ile Phe # 60 - Asp Asn Tyr Tyr Lys Ala Trp Phe Tyr Ile As - #n Ser Asp Gly Arg Tyr #80 - Ser Gln Asn Glu Trp His Gly Asn Tyr Tyr Le - #u Lys Ser Gly Gly Tyr # 95 - Met Ala Gln Asn Glu Trp Ile Tyr Asp Ser As - #n Tyr Lys Ser Trp Phe # 110 - Tyr Leu Lys Ser Asp Gly Ala Tyr Ala His Gl - #n Glu Trp Gln Leu Ile # 125 - Gly Asn Lys Trp Tyr Tyr Phe Lys Lys Trp Gl - #y Tyr Met Ala Lys Ser # 140 - Gln Trp Gln Gly Ser Tyr Phe Leu Asn Gly Gl - #n Gly Ala Met Met Gln 145 1 - #50 1 - #55 1 - #60 - Asn Glu Trp Leu Tyr Asp Pro Ala Tyr Ser Al - #a Tyr Phe Tyr Leu Lys # 175 - Ser Asp Gly Thr Tyr Ala Asn Gln Glu Trp Gl - #n Lys Val Gly Gly Lys # 190 - Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Al - #a Arg Asn Glu Trp Gln # 205 - Gly Asn Tyr Tyr Leu Thr Gly Ser Gly Ala Me - #t Ala Thr Asp Glu Val # 220 - Ile Met Asp Gly Thr Arg Tyr Ile Phe Ala Al - #a Ser Gly Glu Leu Lys 225 2 - #30 2 - #35 2 - #40 - Glu Lys Lys Asp Leu Asn Val Gly Trp Val Hi - #s Arg Asp Gly Lys Arg # 255 - Tyr Phe Phe Asn Asn Arg Glu Glu Gln Val Gl - #y Thr Glu His Ala Lys # 270 - Lys Val Ile Asp Ile Ser Glu His Asn Gly Ar - #g Ile Asn Asp Trp Lys # 285 - Lys Val Ile Asp Glu Asn Glu Val Asp Gly Va - #l Ile Val Arg Leu Gly # 300 - Tyr Ser Gly Lys Glu Asp Lys Glu Leu Ala Hi - #s Asn Ile Lys Glu Leu 305 3 - #10 3 - #15 3 - #20 - Asn Arg Leu Gly Ile Pro Tyr Gly Val Tyr Le - #u Tyr Thr Tyr Ala Glu # 335 - Asn Glu Thr Asp Ala Glu Ser Asp Ala Lys Gl - #n Thr Ile Glu Leu Ile # 350 - Lys Lys Tyr Asn Met Asn Leu Ser Tyr Pro Il - #e Tyr Tyr Asp Val Glu # 365 - Asn Trp Glu Tyr Val Asn Lys Ser Lys Arg Al - #a Pro Ser Asp Thr Gly # 380 - Thr Trp Val Lys Ile Ile Asn Lys Tyr Met As - #p Thr Met Lys Gln Ala 385 3 - #90 3 - #95 4 - #00 - Gly Tyr Gln Asn Val Tyr Val Tyr Ser Tyr Ar - #g Ser Leu Leu Gln Thr # 415 - Arg Leu Lys His Pro Asp Ile Leu Lys His Va - #l Asn Trp Val Ala Ala # 430 - Tyr Thr Asn Ala Leu Glu Trp Glu Asn Pro Hi - #s Tyr Ser Gly Lys Lys # 445 - Gly Trp Gln Tyr Thr Ser Ser Glu Tyr Met Ly - #s Gly Ile Gln Gly Arg # 460 - Val Asp Val Ser Val Trp Tyr 465 4 - #70 - (2) INFORMATION FOR SEQ ID NO: 159: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1924 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #159: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TACGTCTCAG CCTACTTTTG TAAGAGCAGA AGAATCTCCA CAAGTTGTCG AA - #AAATCTTC 60 - ATTAGAGAAG AAATATGAGG AAGCAAAAGC AAAAGCTGAT ACTGCCAAGA AA - #GATTACGA 120 - AACGGCTAAA AAGAAAGCAG AAGACGCTCA GAAAAAGTAT GAAGATGATC AG - #AAGAGAAC 180 - TGAGGAGAAA GCTCGAAAAG AAGCAGAAGC ATCTCAAAAA TTGAATGATG TG - #GCGCTTGT 240 - TGTTCAAAAT GCATATAAAG AGTACCGAGA AGTTCAAAAT CAACGTAGTA AA - #TATAAATC 300 - TGACGCTGAA TATCAGAAAA AATTAACAGA GGTCGACTCT AAAATAGAGA AG - #GCTAGGAA 360 - AGAGCAACAG GACTTGCAAA ATAAATTTAA TGAAGTAAGA GCAGTTGTAG TT - #CCTGAACC 420 - AAATGCGTTG GCTGAGACTA AGAAAAAAGC AGAAGAAGCT AAAGCAGAAG AA - #AAAGTAGC 480 - TAAGAGAAAA TATGATTATG CAACTCTAAA GGTAGCACTA GCGAAGAAAG AA - #GTAGAGGC 540 - TAAGGAACTT GAAATTGAAA AACTTCAATA TGAAATTTCT ACTTTGGAAC AA - #GAAGTTGC 600 - TACTGCTCAA CATCAAGTAG ATAATTTGAA AAAACTTCTT GCTGGTGCGG AT - #CCTGATGA 660 - TGGCACAGAA GTTATAGAAG CTAAATTAAA AAAAGGAGAA GCTGAGCTAA AC - #GCTAAACA 720 - AGCTGAGTTA GCAAAAAAAC AAACAGAACT TGAAAAACTT CTTGACAGCC TT - #GATCCTGA 780 - AGGTAAGACT CAGGATGAAT TAGATAAAGA AGCAGAAGAA GCTGAGTTGG AT - #AAAAAAGC 840 - TGATGAACTT CAAAATAAAG TTGCTGATTT AGAAAAAGAA ATTAGTAACC TT - #GAAATATT 900 - ACTTGGAGGG GCTGATNCTG AAGATGATAC TGCTGCTCTT CAAAATAAAT TA - #GCTACTAA 960 - AAAAGCTGAA TTGGAAAAAA CTCAAAAAGA ATTAGATGCA GCTCTTAATG AG - #TTAGGCCC 1020 - TGATGGAGAT GAAGAAGAAA CTCCAGCGCC GGCTCCTCAA CCAGAGCAAC CA - #GCTCCTGC 1080 - ACCAAAACCA GAGCAACCAG CTCCAGCTCC AAAACCAGAG CAACCAGCTC CT - #GCACCAAA 1140 - ACCAGAGCAA CCAGCTCCAG CTCCAAAACC AGAGCAACCA GCTCCAGCTC CA - #AAACCAGA 1200 - GCAACCAGCT AAGCCGGAGA AACCAGCTGA AGAGCCTACT CAACCAGAAA AA - #CCAGCCAC 1260 - TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAT TTCTACAATA CT - #GATGGTTC 1320 - AATGGCAATA GGTTGGCTCC AAAACAACGG TTCATGGTAC TACCTAAACG CT - #AACGGCGC 1380 - TATGGCAACA GGTTGGGTGA AAGATGGAGA TACCTGGTAC TATCTTGAAG CA - #TCAGGTGC 1440 - TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA AC - #AGCAATGG 1500 - CGCTATGGCG ACAGGCTGGC TCCAATACAA TGGCTCATGG TACTACCTCA AC - #GCTAATGG 1560 - TGATATGGCG ACAGGATGGC TCCAATACAA CGGTTCATGG TATTACCTCA AC - #GCTAATGG 1620 - TGATATGGCG ACAGGATGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA AC - #GCTAACGG 1680 - TGCTATGGCT ACAGGTTGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA AC - #GCTAACGG 1740 - TTCAATGGCA ACAGGTTGGG TGAAAGATGG AGATACCTGG TACTATCTTG AA - #GCATCAGG 1800 - TGCTATGAAA GCAAGCCAAT GGTTCAAAGT ATCAGATAAA TGGTACTATG TC - #AATGGCTT 1860 - AGGTGCCCTT GCAGTCAACA CAACTGTAGA TGGCTATAAA GTCAATGCCA AT - #GGTGAATG 1920 # 1924 - (2) INFORMATION FOR SEQ ID NO:160: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 641 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:160: - Thr Ser Gln Pro Thr Phe Val Arg Ala Glu Gl - #u Ser Pro Gln Val Val # 15 - Glu Lys Ser Ser Leu Glu Lys Lys Tyr Glu Gl - #u Ala Lys Ala Lys Ala # 30 - Asp Thr Ala Lys Lys Asp Tyr Glu Thr Ala Ly - #s Lys Lys Ala Glu Asp # 45 - Ala Gln Lys Lys Tyr Glu Asp Asp Gln Lys Ar - #g Thr Glu Glu Lys Ala # 60 - Arg Lys Glu Ala Glu Ala Ser Gln Lys Leu As - #n Asp Val Ala Leu Val #80 - Val Gln Asn Ala Tyr Lys Glu Tyr Arg Glu Va - #l Gln Asn Gln Arg Ser # 95 - Lys Tyr Lys Ser Asp Ala Glu Tyr Gln Lys Ly - #s Leu Thr Glu Val Asp # 110 - Ser Lys Ile Glu Lys Ala Arg Lys Glu Gln Gl - #n Asp Leu Gln Asn Lys # 125 - Phe Asn Glu Val Arg Ala Val Val Val Pro Gl - #u Pro Asn Ala Leu Ala # 140 - Glu Thr Lys Lys Lys Ala Glu Glu Ala Lys Al - #a Glu Glu Lys Val Ala 145 1 - #50 1 - #55 1 - #60 - Lys Arg Lys Tyr Asp Tyr Ala Thr Leu Lys Va - #l Ala Leu Ala Lys Lys # 175 - Glu Val Glu Ala Lys Glu Leu Glu Ile Glu Ly - #s Leu Gln Tyr Glu Ile # 190 - Ser Thr Leu Glu Gln Glu Val Ala Thr Ala Gl - #n His Gln Val Asp Asn # 205 - Leu Lys Lys Leu Leu Ala Gly Ala Asp Pro As - #p Asp Gly Thr Glu Val # 220 - Ile Glu Ala Lys Leu Lys Lys Gly Glu Ala Gl - #u Leu Asn Ala Lys Gln 225 2 - #30 2 - #35 2 - #40 - Ala Glu Leu Ala Lys Lys Gln Thr Glu Leu Gl - #u Lys Leu Leu Asp Ser # 255 - Leu Asp Pro Glu Gly Lys Thr Gln Asp Glu Le - #u Asp Lys Glu Ala Glu # 270 - Glu Ala Glu Leu Asp Lys Lys Ala Asp Glu Le - #u Gln Asn Lys Val Ala # 285 - Asp Leu Glu Lys Glu Ile Ser Asn Leu Glu Il - #e Leu Leu Gly Gly Ala # 300 - Asp Xaa Glu Asp Asp Thr Ala Ala Leu Gln As - #n Lys Leu Ala Thr Lys 305 3 - #10 3 - #15 3 - #20 - Lys Ala Glu Leu Glu Lys Thr Gln Lys Glu Le - #u Asp Ala Ala Leu Asn # 335 - Glu Leu Gly Pro Asp Gly Asp Glu Glu Glu Th - #r Pro Ala Pro Ala Pro # 350 - Gln Pro Glu Gln Pro Ala Pro Ala Pro Lys Pr - #o Glu Gln Pro Ala Pro # 365 - Ala Pro Lys Pro Glu Gln Pro Ala Pro Ala Pr - #o Lys Pro Glu Gln Pro # 380 - Ala Pro Ala Pro Lys Pro Glu Gln Pro Ala Pr - #o Ala Pro Lys Pro Glu 385 3 - #90 3 - #95 4 - #00 - Gln Pro Ala Lys Pro Glu Lys Pro Ala Glu Gl - #u Pro Thr Gln Pro Glu # 415 - Lys Pro Ala Thr Pro Lys Thr Gly Trp Lys Gl - #n Glu Asn Gly Met Trp

# 430 - Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala Il - #e Gly Trp Leu Gln Asn # 445 - Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gl - #y Ala Met Ala Thr Gly # 460 - Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Le - #u Glu Ala Ser Gly Ala 465 4 - #70 4 - #75 4 - #80 - Met Lys Ala Ser Gln Trp Phe Lys Val Ser As - #p Lys Trp Tyr Tyr Val # 495 - Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Le - #u Gln Tyr Asn Gly Ser # 510 - Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Al - #a Thr Gly Trp Leu Gln # 525 - Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala As - #n Gly Asp Met Ala Thr # 540 - Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Ty - #r Leu Asn Ala Asn Gly 545 5 - #50 5 - #55 5 - #60 - Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gl - #y Ser Trp Tyr Tyr Leu # 575 - Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Va - #l Lys Asp Gly Asp Thr # 590 - Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Ly - #s Ala Ser Gln Trp Phe # 605 - Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gl - #y Leu Gly Ala Leu Ala # 620 - Val Asn Thr Thr Val Asp Gly Tyr Lys Val As - #n Ala Asn Gly Glu Trp 625 6 - #30 6 - #35 6 - #40 - Val - (2) INFORMATION FOR SEQ ID NO: 161: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 670 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #161: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGGACAGGTG AAAGGTCATG CTACATTTGT GAAATCCATG ACAACTGAAA TG - #TACCAAGA 60 - ACAACAGAAC CATTCTCTCG CCTACAATCA ACGCTTGGNT TCGCAAAATC GC - #ATTGTAGA 120 - TCCTTTTTTG GCGGAGGGAT ATGAGGTCAA TTACCAAGTG TCTGACGACC CT - #GATGCAGT 180 - CTATGGTTAC TTGTCTATTC CAAGTTTGGA AATCATGGAG CCGGTTTATT TG - #GGAGCAGA 240 - TTATCATCAT TTAGGGATGG GCTTGGCTCA TGTGGATGGT ACACCGCTGC CT - #CTGGATGG 300 - TACAGGGATT CGCTCAGTGA TTGCTGGGCA CCGTGCAGAG CCAAGCCATG TC - #TTTTTCCG 360 - CCATTTGGAT CAGCTAAAAG TTGGAGATGC TCTTTATTAT GATAATGGCC AG - #GAAATTGT 420 - AGAATATCAG ATGATGGACA CAGAGATTAT TTTACCGTCG GAATGGGAAA AA - #TTAGAATC 480 - GGTTAGCTCT AAAAATATCA TGACCTTGAT AACCTGCGAT CCGATTCCTA CC - #TTTAATAA 540 - ACGCTTATTA GTGAATTTTG AACGAGTCGC TGTTTATCAA AAATCAGATC CA - #CAAACAGC 600 - TGCAGTTGCG AGGGTTGCTT TTACGAAAGA AGGACAATCT GTATCGCGTG TT - #GCAACCTC 660 # 670 - (2) INFORMATION FOR SEQ ID NO:162: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 223 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: - Gly Gln Val Lys Gly His Ala Thr Phe Val Ly - #s Ser Met Thr Thr Glu # 15 - Met Tyr Gln Glu Gln Gln Asn His Ser Leu Al - #a Tyr Asn Gln Arg Leu # 30 - Xaa Ser Gln Asn Arg Ile Val Asp Pro Phe Le - #u Ala Glu Gly Tyr Glu # 45 - Val Asn Tyr Gln Val Ser Asp Asp Pro Asp Al - #a Val Tyr Gly Tyr Leu # 60 - Ser Ile Pro Ser Leu Glu Ile Met Glu Pro Va - #l Tyr Leu Gly Ala Asp #80 - Tyr His His Leu Gly Met Gly Leu Ala His Va - #l Asp Gly Thr Pro Leu # 95 - Pro Leu Asp Gly Thr Gly Ile Arg Ser Val Il - #e Ala Gly His Arg Ala # 110 - Glu Pro Ser His Val Phe Phe Arg His Leu As - #p Gln Leu Lys Val Gly # 125 - Asp Ala Leu Tyr Tyr Asp Asn Gly Gln Glu Il - #e Val Glu Tyr Gln Met # 140 - Met Asp Thr Glu Ile Ile Leu Pro Ser Glu Tr - #p Glu Lys Leu Glu Ser 145 1 - #50 1 - #55 1 - #60 - Val Ser Ser Lys Asn Ile Met Thr Leu Ile Th - #r Cys Asp Pro Ile Pro # 175 - Thr Phe Asn Lys Arg Leu Leu Val Asn Phe Gl - #u Arg Val Ala Val Tyr # 190 - Gln Lys Ser Asp Pro Gln Thr Ala Ala Val Al - #a Arg Val Ala Phe Thr # 205 - Lys Glu Gly Gln Ser Val Ser Arg Val Ala Th - #r Ser Gln Trp Leu # 220 - (2) INFORMATION FOR SEQ ID NO: 163: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 784 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #163: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GATTGCTCCT TTGAAGGATT TGAGAGAAAC CATGTTGGAA ATTGCTTCTG GT - #GCTCAAAA 60 - TCTTCGTGCC AAGGAAGTTG GTGCCTATGA ACTGAGAGAA GTAACTCGCC AA - #TTTAATGC 120 - TATGTTGGAT CAGATTGATC AGTTGATGGT AGCTATTCGT AGCCAGGAAG AA - #ACGACCCG 180 - TCAGTACCAA CTTCAAGCCC TTTCGAGCCA GATTAATCCA CATTTCCTCT AT - #AACACTTT 240 - GGACACCATC ATCTGGATGG CTGAATTTCA TGATAGTCAG CGAGTGGTGC AG - #GTGACCAA 300 - GTCCTTGGCA ACCTATTTCC GCTTGGCGCT CAATCAAGGC AAGGACTTGA TT - #TGTCTCTC 360 - TGACGAAATC AATCATGTCC GCCAGTATCT CTTTATCCAG AAACAACGCT AT - #GGAGATAA 420 - GCTGGAATAC GAAATTAATG AAAATGTTGC CTTTGATAAT TTAGTCTTAC CC - #AAGCTGGT 480 - CCTACAACCC CTTGTAGAAA ATGCTCTTTA CCATGGCATT AAGGAAAAGG AA - #GGTCAGGG 540 - CCATATTAAA CTTTCTGTCC AGAAACAGGA TTCGGGATTG GTCATCCGTA TT - #GAGGATGA 600 - TGGCGTTGGC TTCCAAGATG CTGGTGATAG TAGTCAAAGT CAACTCAAAC GT - #GGGGGAGT 660 - TGGTCTTCAA AATGTCGATC AACGGCTCAA ACTTCATTTT GGAGCCAATT AC - #CATATGAA 720 - GATTGATTCT AGACCCCAAA AAGGGACGAA AGTTGAAATA TATATAAATA GA - #ATAGAAAC 780 # 784 - (2) INFORMATION FOR SEQ ID NO:164: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 261 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: - Ile Ala Pro Leu Lys Asp Leu Arg Glu Thr Me - #t Leu Glu Ile Ala Ser # 15 - Gly Ala Gln Asn Leu Arg Ala Lys Glu Val Gl - #y Ala Tyr Glu Leu Arg # 30 - Glu Val Thr Arg Gln Phe Asn Ala Met Leu As - #p Gln Ile Asp Gln Leu # 45 - Met Val Ala Ile Arg Ser Gln Glu Glu Thr Th - #r Arg Gln Tyr Gln Leu # 60 - Gln Ala Leu Ser Ser Gln Ile Asn Pro His Ph - #e Leu Tyr Asn Thr Leu #80 - Asp Thr Ile Ile Trp Met Ala Glu Phe His As - #p Ser Gln Arg Val Val # 95 - Gln Val Thr Lys Ser Leu Ala Thr Tyr Phe Ar - #g Leu Ala Leu Asn Gln # 110 - Gly Lys Asp Leu Ile Cys Leu Ser Asp Glu Il - #e Asn His Val Arg Gln # 125 - Tyr Leu Phe Ile Gln Lys Gln Arg Tyr Gly As - #p Lys Leu Glu Tyr Glu # 140 - Ile Asn Glu Asn Val Ala Phe Asp Asn Leu Va - #l Leu Pro Lys Leu Val 145 1 - #50 1 - #55 1 - #60 - Leu Gln Pro Leu Val Glu Asn Ala Leu Tyr Hi - #s Gly Ile Lys Glu Lys # 175 - Glu Gly Gln Gly His Ile Lys Leu Ser Val Gl - #n Lys Gln Asp Ser Gly # 190 - Leu Val Ile Arg Ile Glu Asp Asp Gly Val Gl - #y Phe Gln Asp Ala Gly # 205 - Asp Ser Ser Gln Ser Gln Leu Lys Arg Gly Gl - #y Val Gly Leu Gln Asn # 220 - Val Asp Gln Arg Leu Lys Leu His Phe Gly Al - #a Asn Tyr His Met Lys 225 2 - #30 2 - #35 2 - #40 - Ile Asp Ser Arg Pro Gln Lys Gly Thr Lys Va - #l Glu Ile Tyr Ile Asn # 255 - Arg Ile Glu Thr Ser 260 - (2) INFORMATION FOR SEQ ID NO: 165: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 325 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #165: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGGCTC CATAATATCT AT - #AAGGGATT 60 - TACCCACTAC AAATATTATA GAGCCGAAAA TTCACATCTA ATATATGCAG AC - #TACTTTGA 120 - AATGAAATTA AAAAAATTAT TAAAGGATGA CACAAAAGTT TTTGAAAAAT CT - #ACATTCAA 180 - ATTTGTAGAA GGATATAAAA TATACCTGAC AGAATCTAAA GAATCTGGAA TT - #AAACAAAT 240 - GGACAATGTC ATAAAATATT TTGAGTTTAT TGAATCTAAA AGTATTGCTT TA - #TATTTTCA 300 # 325 CTGA TAGAT - (2) INFORMATION FOR SEQ ID NO:166: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 108 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:166: - Arg Ser Tyr Gly Thr Phe Phe Leu Gln Gln As - #n Arg Leu His Asn Ile # 15 - Tyr Lys Gly Phe Thr His Tyr Lys Tyr Tyr Ar - #g Ala Glu Asn Ser His # 30 - Leu Ile Tyr Ala Asp Tyr Phe Glu Met Lys Le - #u Lys Lys Leu Leu Lys # 45 - Asp Asp Thr Lys Val Phe Glu Lys Ser Thr Ph - #e Lys Phe Val Glu Gly # 60 - Tyr Lys Ile Tyr Leu Thr Glu Ser Lys Glu Se - #r Gly Ile Lys Gln Met #80 - Asp Asn Val Ile Lys Tyr Phe Glu Phe Ile Gl - #u Ser Lys Ser Ile Ala # 95 - Leu Tyr Phe Gln Lys Arg Leu Asn Glu Leu Il - #e Asp # 105 - (2) INFORMATION FOR SEQ ID NO: 167: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 238 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #167: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CAACGTTGAG AATTATTTGC GAATGTGTTT GGATAGCATT CAGAATCAGA CG - #TATCAAAA 60 - TTTTGAGTGT TTATTAATCA ATGATGGCTC TCCAGATCAT TCATCCAAAA TA - #TGTGAAGA 120 - ATTTGTAGAG AAAGATTCTC GTTTCAAATA TTTTGAGAAA GCAAACGGCG GT - #CTTTCATC 180 - AGCTCGTAAC CTAGGTATTG AATGTTCGGG GGGGGGCGTA CATTACTTTT GT - #AGACTC 238 - (2) INFORMATION FOR SEQ ID NO:168: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 79 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:168: - Asn Val Glu Asn Tyr Leu Arg Met Cys Leu As - #p Ser Ile Gln Asn Gln # 15 - Thr Tyr Gln Asn Phe Glu Cys Leu Leu Ile As - #n Asp Gly Ser Pro Asp # 30 - His Ser Ser Lys Ile Cys Glu Glu Phe Val Gl - #u Lys Asp Ser Arg Phe # 45 - Lys Tyr Phe Glu Lys Ala Asn Gly Gly Leu Se - #r Ser Ala Arg Asn Leu # 60 - Gly Ile Glu Cys Ser Gly Gly Gly Val His Ty - #r Phe Cys Arg Leu #75 - (2) INFORMATION FOR SEQ ID NO: 169: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 742 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear

#169: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTACTATCAA TCAAGTTCTT CAGCCATTGA GGCCACCATT GAGGGCAACA GC - #CAAACGAC 60 - CATCAGCCAG ACTAGCCACT TTATTCAGTC TTATATCAAA AAACTAGAAA CC - #ACCTCGAC 120 - TGGTTTGACC CAGCAGACGG ATGTTCTGGC CTATGCTGAG AATCCCAGTC AA - #GACAAGGT 180 - CGAGGGAATC CGAGATTTGT TTTTGACCAT CTTGAAGTCA GATAAGGACT TG - #AAAACTGT 240 - TGTGCTGGTG ACCAAATCTG GTCAGGTCAT TTCTACAGAT GACAGTGTGC AG - #ATGAAAAC 300 - TTCCTCTGAT ATGATGGCTG AGGATTGGTA CCAAAAGGCC ATTCATCAGG GA - #GCTATGCC 360 - TGTTTTGACT CCAGCTCGTA AATCAGATAG TCAGTGGGTC ATTTCTGTCA CT - #CAAGAACT 420 - TGTTGATGCA AAGGGAGCCA ATCTTGGTGT GCTTCGTTTG GATATTTCTT AT - #GAAACTCT 480 - GGAAGCCTAT CTCAATCAAC TCCAGTTGGG GCAGCAGGGC TTTGCCTTCA TT - #ATCAATGA 540 - AAACCATGAA TTTGTCTACC ATCCTCAACA CACAGTTTAT AGTTCGTCTA GC - #AAAATGGA 600 - GGCTATGAAA CCCTACATCG ATACAGGTCA GGGTTATACT CCTGGTCACA AA - #TCCTACGT 660 - CAGTCAAGAG AAGATTGCAG GAACTGATTG GACGGTGCTT GGCGTGTCAT CA - #TTGGAAAA 720 # 742GTC AG - (2) INFORMATION FOR SEQ ID NO:170: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 247 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:170: - Tyr Tyr Gln Ser Ser Ser Ser Ala Ile Glu Al - #a Thr Ile Glu Gly Asn # 15 - Ser Gln Thr Thr Ile Ser Gln Thr Ser His Ph - #e Ile Gln Ser Tyr Ile # 30 - Lys Lys Leu Glu Thr Thr Ser Thr Gly Leu Th - #r Gln Gln Thr Asp Val # 45 - Leu Ala Tyr Ala Glu Asn Pro Ser Gln Asp Ly - #s Val Glu Gly Ile Arg # 60 - Asp Leu Phe Leu Thr Ile Leu Lys Ser Asp Ly - #s Asp Leu Lys Thr Val #80 - Val Leu Val Thr Lys Ser Gly Gln Val Ile Se - #r Thr Asp Asp Ser Val # 95 - Gln Met Lys Thr Ser Ser Asp Met Met Ala Gl - #u Asp Trp Tyr Gln Lys # 110 - Ala Ile His Gln Gly Ala Met Pro Val Leu Th - #r Pro Ala Arg Lys Ser # 125 - Asp Ser Gln Trp Val Ile Ser Val Thr Gln Gl - #u Leu Val Asp Ala Lys # 140 - Gly Ala Asn Leu Gly Val Leu Arg Leu Asp Il - #e Ser Tyr Glu Thr Leu 145 1 - #50 1 - #55 1 - #60 - Glu Ala Tyr Leu Asn Gln Leu Gln Leu Gly Gl - #n Gln Gly Phe Ala Phe # 175 - Ile Ile Asn Glu Asn His Glu Phe Val Tyr Hi - #s Pro Gln His Thr Val # 190 - Tyr Ser Ser Ser Ser Lys Met Glu Ala Met Ly - #s Pro Tyr Ile Asp Thr # 205 - Gly Gln Gly Tyr Thr Pro Gly His Lys Ser Ty - #r Val Ser Gln Glu Lys # 220 - Ile Ala Gly Thr Asp Trp Thr Val Leu Gly Va - #l Ser Ser Leu Glu Lys 225 2 - #30 2 - #35 2 - #40 - Leu Asp Gln Val Arg Ser Gln 245 - (2) INFORMATION FOR SEQ ID NO: 171: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1282 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #171: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GACAAAAACA TTAAAACGTC CTGAGGTTTT ATCACCTGCA GGGACTTTAG AG - #AAGCTAAA 60 - GGTAGCTGTT CAGTATGGAG CAGATGCTGT CTTTATCGGT GGTCAGGCCT AT - #GGTCTTCG 120 - TAGCCGTGCG GGAAACTTTA CTTTCGAACA GATGGAAGAA GGCGTGCAGT TT - #GCGGCCAA 180 - GTATGGTGCC AAGGTCTATG TAGCGGCTAA TATGGTTATG CACGAAGGAA AT - #GAAGCTGG 240 - TGCTGGTGAG TGGTTCCGTA AACTGCGTGA TATCGGGATT GCAGCAGTTA TC - #GTATCTGA 300 - CCCAGCCTTG ATTATGATTG CAGTGACTGA AGCACCAGGC CTTGAAATCC AC - #CTTTCTAC 360 - CCAAGCCAGT GCCACTAACT ATGAAACCCT TGAGTTCTGG AAAGAGCTAG GC - #TTGACTCG 420 - TGTCGTTTTA GCGCGTGAGG TTTCAATGGA AGAATTAGCT GAGATCCGCA AA - #CGTACAGA 480 - TGTTGAAATT GAAGCCTTTG TCCATGGAGC TATGTGTATT TCATACTCTG GA - #CGTTGTAC 540 - TCTTTCAAAC CACATGAGTA TGCGTGATGC CAACCGTGGT GGATGTTCTC AG - #TCATGCCG 600 - TTGGAAATAC GACCTTTACG ATATGCCATT TGGGAAAGAA CGTAAGAGTT TG - #CAGGGTGA 660 - GATTCCAGAA GAATTTTCAA TGTCAGCCGT TGACATGTCT ATGATTGACC AN - #ATTCCAGA 720 - TATGATTGAA AATGGTGTGG ACAGTCTAAA AATCGAAGGA CGTATGNAGT CT - #ATTCACTA 780 - NGTATCAACA GTAACCAACT GCTACAAGGC GGCTGTGGAT GCCTATCTTG AA - #AGTCCTGA 840 - AAAGTTTGAA GCTATCAAAC AAGACTTGGT GGACGAGATG TGGAAGGTTG CC - #CAACGTGA 900 - ACTGGCTACA GGATTTTACT ATGGTACACC ATCTGAAAAT GAGCAGTTGT TT - #GGTGCTCG 960 - TCGTAAAATC CCTGAGTACA AGTTTGTCGC TGAAGTGGTT TCTTATGATG AT - #GCGGCACA 1020 - AACAGCAACT ATTCGTCAAC GAAACGTCAT TAACGAAGGG GACCAAGTTG AG - #TTTTATGG 1080 - TCCAGGTTTC CGTCATTTTG AAACCTATAT TGAAGATTTG CATGATGCTA AA - #GGCAATAA 1140 - AATCGACCGC GCTCCAAATC CAATGGAACT ATTGACTATT AAAGTCCCAC AA - #CCTGTTCA 1200 - ATCAGGAGAC ATGGTTCGAG CTCTTAAAGA GGGGCTTATC AATCTTTATA AG - #GAAGATGG 1260 # 1282GTG CT - (2) INFORMATION FOR SEQ ID NO:172: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 427 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: - Thr Lys Thr Leu Lys Arg Pro Glu Val Leu Se - #r Pro Ala Gly Thr Leu # 15 - Glu Lys Leu Lys Val Ala Val Gln Tyr Gly Al - #a Asp Ala Val Phe Ile # 30 - Gly Gly Gln Ala Tyr Gly Leu Arg Ser Arg Al - #a Gly Asn Phe Thr Phe # 45 - Glu Gln Met Glu Glu Gly Val Gln Phe Ala Al - #a Lys Tyr Gly Ala Lys # 60 - Val Tyr Val Ala Ala Asn Met Val Met His Gl - #u Gly Asn Glu Ala Gly #80 - Ala Gly Glu Trp Phe Arg Lys Leu Arg Asp Il - #e Gly Ile Ala Ala Val # 95 - Ile Val Ser Asp Pro Ala Leu Ile Met Ile Al - #a Val Thr Glu Ala Pro # 110 - Gly Leu Glu Ile His Leu Ser Thr Gln Ala Se - #r Ala Thr Asn Tyr Glu # 125 - Thr Leu Glu Phe Trp Lys Glu Leu Gly Leu Th - #r Arg Val Val Leu Ala # 140 - Arg Glu Val Ser Met Glu Glu Leu Ala Glu Il - #e Arg Lys Arg Thr Asp 145 1 - #50 1 - #55 1 - #60 - Val Glu Ile Glu Ala Phe Val His Gly Ala Me - #t Cys Ile Ser Tyr Ser # 175 - Gly Arg Cys Thr Leu Ser Asn His Met Ser Me - #t Arg Asp Ala Asn Arg # 190 - Gly Gly Cys Ser Gln Ser Cys Arg Trp Lys Ty - #r Asp Leu Tyr Asp Met # 205 - Pro Phe Gly Lys Glu Arg Lys Ser Leu Gln Gl - #y Glu Ile Pro Glu Glu # 220 - Phe Ser Met Ser Ala Val Asp Met Ser Met Il - #e Asp Xaa Ile Pro Asp 225 2 - #30 2 - #35 2 - #40 - Met Ile Glu Asn Gly Val Asp Ser Leu Lys Il - #e Glu Gly Arg Met Xaa # 255 - Ser Ile His Xaa Val Ser Thr Val Thr Asn Cy - #s Tyr Lys Ala Ala Val # 270 - Asp Ala Tyr Leu Glu Ser Pro Glu Lys Phe Gl - #u Ala Ile Lys Gln Asp # 285 - Leu Val Asp Glu Met Trp Lys Val Ala Gln Ar - #g Glu Leu Ala Thr Gly # 300 - Phe Tyr Tyr Gly Thr Pro Ser Glu Asn Glu Gl - #n Leu Phe Gly Ala Arg 305 3 - #10 3 - #15 3 - #20 - Arg Lys Ile Pro Glu Tyr Lys Phe Val Ala Gl - #u Val Val Ser Tyr Asp # 335 - Asp Ala Ala Gln Thr Ala Thr Ile Arg Gln Ar - #g Asn Val Ile Asn Glu # 350 - Gly Asp Gln Val Glu Phe Tyr Gly Pro Gly Ph - #e Arg His Phe Glu Thr # 365 - Tyr Ile Glu Asp Leu His Asp Ala Lys Gly As - #n Lys Ile Asp Arg Ala # 380 - Pro Asn Pro Met Glu Leu Leu Thr Ile Lys Va - #l Pro Gln Pro Val Gln 385 3 - #90 3 - #95 4 - #00 - Ser Gly Asp Met Val Arg Ala Leu Lys Glu Gl - #y Leu Ile Asn Leu Tyr # 415 - Lys Glu Asp Gly Thr Ser Val Thr Val Arg Al - #a # 425 - (2) INFORMATION FOR SEQ ID NO: 173: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 778 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #173: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTCTCAGGAG ACCTTTAAAA ATATCACCAA TAGCTTCTCC ATGCAAATCA AT - #CGTCGCGT 60 - CAACCAAGGA ACGCCTCGTG GTGCTGGGAA TATCAAGGGT GAAGACATCA AA - #AAAATCAC 120 - CGAAAACAAG GCCATTGAGT CTTATGTCAA ACGTATCAAC GCTATCGGAG AT - #TTGACTGG 180 - ATATGACCTG ATTGAAACGC CAGAAACCAA GAAGAATCTC ACTGCTGATC GT - #GCCAAGCG 240 - TTTTGGAAGT AGCTTGATGA TTACAGGTGT CAATGACTCC TCTAAAGAAG AC - #AAGTTTGT 300 - CTCTGGTTCT TATAAACTAG TCGAAGGAGA GCACTTAACC AACGACGACA AG - #GATAAAAT 360 - CCTCTTGCAC AAGGACTTGG CAGCCAAACA CGGCTGGAAA GTAGGGGACA AG - #GTTAAACT 420 - GGACTCTAAT ATCTACGATG CAGATAATGA AAAAGGAGCC AAGGAAACAG TT - #GAAGTGAC 480 - AATCAAGGGA CTCTTTGATG GTCATAATAA GTCAGCAGTA ACCTACTCAC AA - #GAACTTTA 540 - CGAAAACACA GCTATTACAG ACATTCACAC TGCTGCAAAA CTTTATGGAT AC - #ACAGAAGA 600 - CACAGCCATT TATGGGGACG CAACCTTCTT TGTAACAGCA GACAAGAACT TG - #GATGATGT 660 - TATGAAAGAG TTGAATGGCA TCAGTGGTAT CAACTGGAAG AGCTACACAC TC - #GTCAAGAG 720 - CTCCTCTAAC TACCCAGCTC TTGAGCAATC TATCTCTGGT ATGTACAAGA TG - #GCCAAC 778 - (2) INFORMATION FOR SEQ ID NO:174: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 259 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:174: - Ser Gln Glu Thr Phe Lys Asn Ile Thr Asn Se - #r Phe Ser Met Gln Ile # 15 - Asn Arg Arg Val Asn Gln Gly Thr Pro Arg Gl - #y Ala Gly Asn Ile Lys # 30 - Gly Glu Asp Ile Lys Lys Ile Thr Glu Asn Ly - #s Ala Ile Glu Ser Tyr # 45 - Val Lys Arg Ile Asn Ala Ile Gly Asp Leu Th - #r Gly Tyr Asp Leu Ile # 60 - Glu Thr Pro Glu Thr Lys Lys Asn Leu Thr Al - #a Asp Arg Ala Lys Arg #80 - Phe Gly Ser Ser Leu Met Ile Thr Gly Val As - #n Asp Ser Ser Lys Glu # 95 - Asp Lys Phe Val Ser Gly Ser Tyr Lys Leu Va - #l Glu Gly Glu His Leu # 110 - Thr Asn Asp Asp Lys Asp Lys Ile Leu Leu Hi - #s Lys Asp Leu Ala Ala # 125 - Lys His Gly Trp Lys Val Gly Asp Lys Val Ly - #s Leu Asp Ser Asn Ile # 140 - Tyr Asp Ala Asp Asn Glu Lys Gly Ala Lys Gl - #u Thr Val Glu Val Thr 145 1 - #50 1 - #55 1 - #60 - Ile Lys Gly Leu Phe Asp Gly His Asn Lys Se - #r Ala Val Thr Tyr Ser # 175 - Gln Glu Leu Tyr Glu Asn Thr Ala Ile Thr As - #p Ile His Thr Ala Ala # 190 - Lys Leu Tyr Gly Tyr Thr Glu Asp Thr Ala Il - #e Tyr Gly Asp Ala Thr # 205 - Phe Phe Val Thr Ala Asp Lys Asn Leu Asp As - #p Val Met Lys Glu Leu # 220 - Asn Gly Ile Ser Gly Ile Asn Trp Lys Ser Ty - #r Thr Leu Val Lys Ser

225 2 - #30 2 - #35 2 - #40 - Ser Ser Asn Tyr Pro Ala Leu Glu Gln Ser Il - #e Ser Gly Met Tyr Lys # 255 - Met Ala Asn - (2) INFORMATION FOR SEQ ID NO: 175: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 694 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #175: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AGTAAATGCG CAATCAAATT CATTAATATT AATAGATGAA CCTGAAATCT CA - #CTTCATCC 60 - GAGTGCAATC TATAAATTTA AAGAGTTTTT ACTTCAAGAG TGTTTAAATA AA - #AAACATCA 120 - AATTATTATC ACTACACATT CTACACAACT TATAAAAGAT TTTCCTAGAG AA - #GCCGTGAA 180 - ACTTTTAGTG AAAAACGGAG AAAAGGTAGA TGTTATTGAA AATATTGATT AT - #CAGGATGC 240 - ATTTTTTGAA TTAGGTGATG TGTATCATTC TAGGAAGATG ATTTATGTTG AA - #GATAGACT 300 - AGCTAAATAT ATTCTAGAGT TTGTTATCAC TCATTCAGGT AGTGAGAATC TT - #AAACAGAA 360 - TTTAGTAGTG AGATATATTC CTGGTGGAGC AAATCAAATA ATTTGTAATA AT - #ATTTTAAA 420 - CTCATCGTAT TTAGATTCCG ATAACCATTA TTTTTGGCTT GATGGAGATC AA - #AACACTAA 480 - TGTTAGTGAA TCAAATAATT TAATGAACTA TCTTGAAAAT GGTGTTGTTA TA - #TCAGATAA 540 - AATTCCTGAA TCAGATAATA AAAATCTTGA TGATATTATA AAATTGATAA NG - #GGATGTCC 600 - AATTAAATTT AATGTTTCAG GTAATAAAGG GCAAAAAAAT AATATTGAAT TA - #ATTGCGAA 660 # 694 GATT ATTGGGCTAA ATAC - (2) INFORMATION FOR SEQ ID NO:176: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 231 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:176: - Val Asn Ala Gln Ser Asn Ser Leu Ile Leu Il - #e Asp Glu Pro Glu Ile # 15 - Ser Leu His Pro Ser Ala Ile Tyr Lys Phe Ly - #s Glu Phe Leu Leu Gln # 30 - Glu Cys Leu Asn Lys Lys His Gln Ile Ile Il - #e Thr Thr His Ser Thr # 45 - Gln Leu Ile Lys Asp Phe Pro Arg Glu Ala Va - #l Lys Leu Leu Val Lys # 60 - Asn Gly Glu Lys Val Asp Val Ile Glu Asn Il - #e Asp Tyr Gln Asp Ala #80 - Phe Phe Glu Leu Gly Asp Val Tyr His Ser Ar - #g Lys Met Ile Tyr Val # 95 - Glu Asp Arg Leu Ala Lys Tyr Ile Leu Glu Ph - #e Val Ile Thr His Ser # 110 - Gly Ser Glu Asn Leu Lys Gln Asn Leu Val Va - #l Arg Tyr Ile Pro Gly # 125 - Gly Ala Asn Gln Ile Ile Cys Asn Asn Ile Le - #u Asn Ser Ser Tyr Leu # 140 - Asp Ser Asp Asn His Tyr Phe Trp Leu Asp Gl - #y Asp Gln Asn Thr Asn 145 1 - #50 1 - #55 1 - #60 - Val Ser Glu Ser Asn Asn Leu Met Asn Tyr Le - #u Glu Asn Gly Val Val # 175 - Ile Ser Asp Lys Ile Pro Glu Ser Asp Asn Ly - #s Asn Leu Asp Asp Ile

# 190 - Ile Lys Leu Ile Xaa Gly Cys Pro Ile Lys Ph - #e Asn Val Ser Gly Asn # 205 - Lys Gly Gln Lys Asn Asn Ile Glu Leu Ile Al - #a Lys Gln Arg Ser Phe # 220 - Ile Asp Tyr Trp Ala Lys Tyr 225 2 - #30 - (2) INFORMATION FOR SEQ ID NO: 177: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 550 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #177: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTACCGCGTT CATCAAGATG TCAAACAAGT CATGACCTAT CAACCCATGG TG - #CGAGAAAT 60 - ATTGAGTGAA CAAGACACCC CAGCAAACGA AGAGCTTGTG CTTGCTATGA TT - #TATACTGA 120 - AACAAAAGGA AAAGAAGGCG ATGTTATGCA GTCTAGTGAG TCTGCAAGTG GT - #TCCACCAA 180 - CACCATCAAT GATAATGCCT CTAGCATTCG GCAAGGCATT CAAACTCTGA CA - #GGCAATCT 240 - CTATCTGGCG CAGAAGAAGG GGGTAGATAT CTGGACAGCT GTTCAAGCCT AT - #AATTTTGG 300 - ACCTGCCTAT ATCGATTTTA TCGCCCAAAA TGGCAAGGAA AATACCCTGG CT - #CTAGCCAA 360 - ACAGTACTCT CGTGAGACTG TTGCCCCCTT GCTTGGTAAT AGGACTGGAA AG - #ACTTATAG 420 - TTATATTCAC CCCATTTCCA TTTTTCACGG TGCTGAACTC TATGTAAATG GA - #GGAAACTA 480 - TTATTATTCT AGACAGGTAC GACTTAACCT TTACATCATC AAATGTTTCA CT - #CTCTTTTC 540 # 550 - (2) INFORMATION FOR SEQ ID NO:178: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 183 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: - Tyr Arg Val His Gln Asp Val Lys Gln Val Me - #t Thr Tyr Gln Pro Met # 15 - Val Arg Glu Ile Leu Ser Glu Gln Asp Thr Pr - #o Ala Asn Glu Glu Leu # 30 - Val Leu Ala Met Ile Tyr Thr Glu Thr Lys Gl - #y Lys Glu Gly Asp Val # 45 - Met Gln Ser Ser Glu Ser Ala Ser Gly Ser Th - #r Asn Thr Ile Asn Asp # 60 - Asn Ala Ser Ser Ile Arg Gln Gly Ile Gln Th - #r Leu Thr Gly Asn Leu #80 - Tyr Leu Ala Gln Lys Lys Gly Val Asp Ile Tr - #p Thr Ala Val Gln Ala # 95 - Tyr Asn Phe Gly Pro Ala Tyr Ile Asp Phe Il - #e Ala Gln Asn Gly Lys # 110 - Glu Asn Thr Leu Ala Leu Ala Lys Gln Tyr Se - #r Arg Glu Thr Val Ala # 125 - Pro Leu Leu Gly Asn Arg Thr Gly Lys Thr Ty - #r Ser Tyr Ile His Pro # 140 - Ile Ser Ile Phe His Gly Ala Glu Leu Tyr Va - #l Asn Gly Gly Asn Tyr 145 1 - #50 1 - #55 1 - #60 - Tyr Tyr Ser Arg Gln Val Arg Leu Asn Leu Ty - #r Ile Ile Lys Cys Phe # 175 - Thr Leu Phe Ser Thr Ser Gly 180 - (2) INFORMATION FOR SEQ ID NO: 179: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 334 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #179: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GTGGATGGGC TTTAACTATC TTCGTATTCG CCGTGCGGCT AAAATTGTGG AC - #AATGAGGA 60 - GTTTGAAGCC TTGATTCGTA CGGGTCAATT GATTGATTTG CGCGACCCAG CA - #GAATTCCA 120 - CAGAAAACAT ATCCTTGGTG CACGCAATAT TCCTTCAAGT CAGTTGAAAA CT - #AGTCTTGC 180 - AGCCCTTCGT AAAGATAAAC CTGTCCTTCT CTACGAAAAC CAACGTGCGC AA - #CGAGTTAC 240 - AAATGCAGCT CTTTACTTGA AAAAACAAGG TTTTTCTGAG ATTTATATCC TT - #TCTTATGG 300 # 334 GGGA AAGTGAAGAC TAGC - (2) INFORMATION FOR SEQ ID NO:180: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 111 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: - Trp Met Gly Phe Asn Tyr Leu Arg Ile Arg Ar - #g Ala Ala Lys Ile Val # 15 - Asp Asn Glu Glu Phe Glu Ala Leu Ile Arg Th - #r Gly Gln Leu Ile Asp # 30 - Leu Arg Asp Pro Ala Glu Phe His Arg Lys Hi - #s Ile Leu Gly Ala Arg # 45 - Asn Ile Pro Ser Ser Gln Leu Lys Thr Ser Le - #u Ala Ala Leu Arg Lys # 60 - Asp Lys Pro Val Leu Leu Tyr Glu Asn Gln Ar - #g Ala Gln Arg Val Thr #80 - Asn Ala Ala Leu Tyr Leu Lys Lys Gln Gly Ph - #e Ser Glu Ile Tyr Ile # 95 - Leu Ser Tyr Gly Leu Asp Ser Trp Lys Gly Ly - #s Val Lys Thr Ser # 110 - (2) INFORMATION FOR SEQ ID NO: 181: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1342 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #181: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ACTAAACCAG CATCGTTCGC AGGAAAATAA GGACAATAAT CGTGTCTCTT AT - #GTGGATGG 60 - CAGCCAGTCA AGTCAGAAAA GTGAAAACTT GACACCAGAC CAGGTTAGCC AG - #AAAGAAGG 120 - AATTCAGGCT GAGCAAATTG TAATCAAAAT TACAGATCAG GGCTATGTAA CG - #TCACACGG 180 - TGACCACTAT CATTACTATA ATGGGAAAGT TCCTTATGAT GCCCTCTTTA GT - #GAAGAACT 240 - CTTGATGAAG GATCCAAACT ATCAACTTAA AGACGCTGAT ATTGTCAATG AA - #GTCAAGGG 300 - TGGTTATATC ATCAAGGTCG ATGGAAAATA TTATGTCTAC CTGAAAGATG CA - #GCTCATGC 360 - TGATAATGTT CGAACTAAAG ATGAAATCAA TCGTCAAAAA CAAGAACATG TC - #AAAGATAA 420 - TGAGAAGGTT AACTCTAATG TTGCTGTAGC AAGGTCTCAG GGACGATATA CG - #ACAAATGA 480 - TGGTTATGTC TTTAATCCAG CTGATATTAT CGAAGATACG GGTAATGCTT AT - #ATCGTTCC 540 - TCATGGAGGT CACTATCACT ACATTCCCAA AAGCGATTTA TCTGCTAGTG AA - #TTAGCAGC 600 - AGCTAAAGCA CATCTGGCTG GAAAAAATAT GCAACCGAGT CAGTTAAGCT AT - #TCTTCAAC 660 - AGCTAGTGAC AATAACACGC AATCTGTAGC AAAAGGATCA ACTAGCAAGC CA - #GCAAATAA 720 - ATCTGAAAAT CTCCAGAGTC TTTTGAAGGA ACTCTATGAT TCACCTAGCG CC - #CAACGTTA 780 - CAGTGAATCA GATGGCCTGG TCTTTGACCC TGCTAAGATT ATCAGTCGTA CA - #CCAAATGG 840 - AGTTGCGATT CCGCATGGCG ACCATTACCA CTTTATTCCT TACAGCAAGC TT - #TCTGCCTT 900 - AGAAGAAAAG ATTGCCAGAA TGGTGCCTAT CAGTGGAACT GGTTCTACAG TT - #TCTACAAA 960 - TGCAAAACCT AATGAAGTAG TGTCTAGTCT AGGCAGTCTT TCAAGCAATC CT - #TCTTCTTT 1020 - AACGACAAGT AAGGAGCTCT CTTCAGCATC TGATGGTTAT ATTTTTAATC CA - #AAAGATAT 1080 - CGTTGAAGAA ACGGCTACAG CTTATATTGT AAGACATGGT GATCATTTCC AT - #TACATTCC 1140 - AAAATCAAAT CAAATTGGGC AACCGACTCT TCCAAACAAT AGTCTAGCAA CA - #CCTTCTCC 1200 - ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT GAAGAAGATG GA - #TACGGATT 1260 - TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT GTCATGAGTC AC - #GGAGACCA 1320 # 1342AGA AG - (2) INFORMATION FOR SEQ ID NO:182: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 447 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: - Leu Asn Gln His Arg Ser Gln Glu Asn Lys As - #p Asn Asn Arg Val Ser # 15 - Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Se - #r Glu Asn Leu Thr Pro # 30 - Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Al - #a Glu Gln Ile Val Ile # 45 - Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser Hi - #s Gly Asp His Tyr His # 60 - Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Le - #u Phe Ser Glu Glu Leu #80 - Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys As - #p Ala Asp Ile Val Asn # 95 - Glu Val Lys Gly Gly Tyr Ile Ile Lys Val As - #p Gly Lys Tyr Tyr Val # 110 - Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Va - #l Arg Thr Lys Asp Glu # 125 - Ile Asn Arg Gln Lys Gln Glu His Val Lys As - #p Asn Glu Lys Val Asn # 140 - Ser Asn Val Ala Val Ala Arg Ser Gln Gly Ar - #g Tyr Thr Thr Asn Asp 145 1 - #50 1 - #55 1 - #60 - Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Gl - #u Asp Thr Gly Asn Ala # 175 - Tyr Ile Val Pro His Gly Gly His Tyr His Ty - #r Ile Pro Lys Ser Asp # 190 - Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Al - #a His Leu Ala Gly Lys # 205 - Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Se - #r Thr Ala Ser Asp Asn # 220 - Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Se - #r Lys Pro Ala Asn Lys 225 2 - #30 2 - #35 2 - #40 - Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Le - #u Tyr Asp Ser Pro Ser # 255 - Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Va - #l Phe Asp Pro Ala Lys # 270 - Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Il - #e Pro His Gly Asp His # 285 - Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Al - #a Leu Glu Glu Lys Ile # 300 - Ala Arg Met Val Pro Ile Ser Gly Thr Gly Se - #r Thr Val Ser Thr Asn 305 3 - #10 3 - #15 3 - #20 - Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gl - #y Ser Leu Ser Ser Asn # 335 - Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Se - #r Ser Ala Ser Asp Gly # 350 - Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Gl - #u Thr Ala Thr Ala Tyr # 365 - Ile Val Arg His Gly Asp His Phe His Tyr Il - #e Pro Lys Ser Asn Gln # 380 - Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Le - #u Ala Thr Pro Ser Pro 385 3 - #90 3 - #95 4 - #00 - Ser Leu Pro Ile Asn Pro Gly Thr Ser His Gl - #u Lys His Glu Glu Asp # 415 - Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Al - #a Glu Asp Glu Ser Gly # 430 - Phe Val Met Ser His Gly Asp His Asn His Ty - #r Phe Phe Lys Lys # 445 - (2) INFORMATION FOR SEQ ID NO: 183: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 934 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #183: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGACTACCTT GAAATCCCAC TTTACAGCTA TCTTGGTGGA TTCAACACTA AA - #GTTCTTCC 60 - AACTCCAATG ATGAACATCA TCAACGGTGG TTCTCACTCT GACGCTCCAA TC - #GCTTTCCA 120 - AGAGTTCATG ATCTTGCCAG TTGGTGCGCC AACATTTAAA GAAGCCCTTC GT - #TACGGTGC 180 - TGAAATCTTC CACGCTCTTA AGAAAATCCT TAAATCACGT GGTTTGGAAA CT - #GCCGTAGG 240 - TGACGAAGGT GGATTCGCTC CTCGTTTCGA AGGAACTGAA GATGGTGTTG AA - #ACTATCCT 300 - TGCTGCGATT GAAGCTGCTG GATATGTACC AGGTAAAGAC GTATTTATCG GA - #TTTGACTG 360 - TGCTTCATCA GAATTCTACG ATAAAGAACG TAAAGTTTAC GACTACACTA AA - #TTTGAAGG 420 - TGAAGGTGCT GCTGTTCGTA CATCTGCAGA ACAAATCGAC TACCTTGAAG AA - #TTGGTTAA 480 - CAAATACCCA ATCATCACTA TTGAAGATGG TATGGATGAA AACGACTGGG AT - #GGTTGGAA 540 - AGCTCTTACT GAACGTCTTG GTAAGAAAGT ACAACTTGTT GGTGACGACT TC - #TTCGTAAC 600 - AAACACTGAC TACCTTGCAC GTGGTATCCA AGAAGGTGCT GCTAACTCAA TC - #CTTATCAA 660

- AGTTAACCAA ATCGGTACTC TTACTGAAAC TTTTGAAGCT ATCGAAATGG CT - #AAAGAAGC 720 - TGGTTACACT GCTGTTGTAT CACACCGTTC AGGTGAAACT GAAGATTCAA CA - #ATCGCTGA 780 - TATTGCAGTT GCAACTAACG CAGGACAAAT CAAGACTGGT TCACTTTCAC GT - #ACAGACCG 840 - CATCGCTAAA TACAACCAAT TGCTTCGTAT CGAAGACCAA CTTGGTGAAG TA - #GCTGAATA 900 # 934 TTCT ACAACCTTAA AAAA - (2) INFORMATION FOR SEQ ID NO:184: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 311 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: - Asp Tyr Leu Glu Ile Pro Leu Tyr Ser Tyr Le - #u Gly Gly Phe Asn Thr # 15 - Lys Val Leu Pro Thr Pro Met Met Asn Ile Il - #e Asn Gly Gly Ser His # 30 - Ser Asp Ala Pro Ile Ala Phe Gln Glu Phe Me - #t Ile Leu Pro Val Gly # 45 - Ala Pro Thr Phe Lys Glu Ala Leu Arg Tyr Gl - #y Ala Glu Ile Phe His # 60 - Ala Leu Lys Lys Ile Leu Lys Ser Arg Gly Le - #u Glu Thr Ala Val Gly #80 - Asp Glu Gly Gly Phe Ala Pro Arg Phe Glu Gl - #y Thr Glu Asp Gly Val # 95 - Glu Thr Ile Leu Ala Ala Ile Glu Ala Ala Gl - #y Tyr Val Pro Gly Lys # 110 - Asp Val Phe Ile Gly Phe Asp Cys Ala Ser Se - #r Glu Phe Tyr Asp Lys # 125 - Glu Arg Lys Val Tyr Asp Tyr Thr Lys Phe Gl - #u Gly Glu Gly Ala Ala # 140 - Val Arg Thr Ser Ala Glu Gln Ile Asp Tyr Le - #u Glu Glu Leu Val Asn 145 1 - #50 1 - #55 1 - #60 - Lys Tyr Pro Ile Ile Thr Ile Glu Asp Gly Me - #t Asp Glu Asn Asp Trp # 175 - Asp Gly Trp Lys Ala Leu Thr Glu Arg Leu Gl - #y Lys Lys Val Gln Leu # 190 - Val Gly Asp Asp Phe Phe Val Thr Asn Thr As - #p Tyr Leu Ala Arg Gly # 205 - Ile Gln Glu Gly Ala Ala Asn Ser Ile Leu Il - #e Lys Val Asn Gln Ile # 220 - Gly Thr Leu Thr Glu Thr Phe Glu Ala Ile Gl - #u Met Ala Lys Glu Ala 225 2 - #30 2 - #35 2 - #40 - Gly Tyr Thr Ala Val Val Ser His Arg Ser Gl - #y Glu Thr Glu Asp Ser # 255 - Thr Ile Ala Asp Ile Ala Val Ala Thr Asn Al - #a Gly Gln Ile Lys Thr # 270 - Gly Ser Leu Ser Arg Thr Asp Arg Ile Ala Ly - #s Tyr Asn Gln Leu Leu # 285 - Arg Ile Glu Asp Gln Leu Gly Glu Val Ala Gl - #u Tyr Arg Gly Leu Lys # 300 - Ser Phe Tyr Asn Leu Lys Lys 305 3 - #10 - (2) INFORMATION FOR SEQ ID NO: 185: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 541 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #185: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TCGTATCTTT TTTTGGAGCA ATGTTCGCGT AGAAGGACAT TCCATGGATC CG - #ACCCTAGC 60 - GGATGGCGAA ATTCTCTTCG TTGTAAAACA CCTTCCTATT GACCGTTTTG AT - #ATCGTGGT 120 - GGCCCATGAG GAAGATGGCA ATAAGGACAT CGTCAAGCGC GTGATTGGAA TG - #CCTGGCGA 180 - CACCATTCGT TACGAAAATG ATAAACTCTA CATCAATGAC AAAGAAACGG AC - #GAGCCTTA 240 - TCTAGCAGAC TATATCAAAC GCTTCAAGGA TGACAAACTC CAAAGCACTT AC - #TCAGGCAA 300 - GGGCTTTGAA GGAAATAAAG GAACTTTCTT TAGAAGTATC GCTCAAAAAG CT - #CAAGCCTT 360 - CACAGTTGAT GTCAACTACA ACACCAACTT TAGCTTTACT GTTCCAGAAG GA - #GAATACCT 420 - TCTCCTCGGA GATGACCGCT TGGTTTCGAG CGACAGCCGC CACGTAGGTA CC - #TTCAAAGC 480 - AAAAGATATC ACAGGGGAAG CTAAATTCCG CTTATGGCCA ATCACCCGTA TC - #GGAACATT 540 # 541 - (2) INFORMATION FOR SEQ ID NO:186: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 180 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:186: - Arg Ile Phe Phe Trp Ser Asn Val Arg Val Gl - #u Gly His Ser Met Asp # 15 - Pro Thr Leu Ala Asp Gly Glu Ile Leu Phe Va - #l Val Lys His Leu Pro # 30 - Ile Asp Arg Phe Asp Ile Val Val Ala His Gl - #u Glu Asp Gly Asn Lys # 45 - Asp Ile Val Lys Arg Val Ile Gly Met Pro Gl - #y Asp Thr Ile Arg Tyr # 60 - Glu Asn Asp Lys Leu Tyr Ile Asn Asp Lys Gl - #u Thr Asp Glu Pro Tyr #80 - Leu Ala Asp Tyr Ile Lys Arg Phe Lys Asp As - #p Lys Leu Gln Ser Thr # 95 - Tyr Ser Gly Lys Gly Phe Glu Gly Asn Lys Gl - #y Thr Phe Phe Arg Ser # 110 - Ile Ala Gln Lys Ala Gln Ala Phe Thr Val As - #p Val Asn Tyr Asn Thr # 125 - Asn Phe Ser Phe Thr Val Pro Glu Gly Glu Ty - #r Leu Leu Leu Gly Asp # 140 - Asp Arg Leu Val Ser Ser Asp Ser Arg His Va - #l Gly Thr Phe Lys Ala 145 1 - #50 1 - #55 1 - #60 - Lys Asp Ile Thr Gly Glu Ala Lys Phe Arg Le - #u Trp Pro Ile Thr Arg # 175 - Ile Gly Thr Phe 180 - (2) INFORMATION FOR SEQ ID NO: 187: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 400 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #187: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGACTCTCTC AAAGATGTGA AAGCAAATGC TAGCGACAGC AAGCCTGCAC AG - #GACAAGAA 60 - GGATGCAAAA CAAGGAACGG AAGATAGTAA GGATTCAGAT AAGATGACTG AA - #ACAAACTC 120 - AGTTCCGGCA GGAGTGATTG TGGTCAGTCT ACTTGCCCTC CTAGGCGTGA TT - #GCCTTCTG 180 - GCTGATTCGC CGTAAGAAAG AGTCAGAAAT CCAGCAATTA AGCACGGAAT TG - #ATCAAGGT 240 - TCTAGGACAG CTAGATGCAG AAAAAGCGGA TAAAAAAGTC CTTGCCAAAG CC - #CAAAACCT 300 - TCTCCAAGAA ACCCTTGATT TCGTGAAAGA AGAAAATGGC TCAGCAGAGA CA - #GAAACTAA 360 # 400 AAAG CAATCCTTGA CAAACTCAAG - (2) INFORMATION FOR SEQ ID NO:188: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 133 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:188: - Asp Ser Leu Lys Asp Val Lys Ala Asn Ala Se - #r Asp Ser Lys Pro Ala # 15 - Gln Asp Lys Lys Asp Ala Lys Gln Gly Thr Gl - #u Asp Ser Lys Asp Ser # 30 - Asp Lys Met Thr Glu Thr Asn Ser Val Pro Al - #a Gly Val Ile Val Val # 45 - Ser Leu Leu Ala Leu Leu Gly Val Ile Ala Ph - #e Trp Leu Ile Arg Arg # 60 - Lys Lys Glu Ser Glu Ile Gln Gln Leu Ser Th - #r Glu Leu Ile Lys Val #80 - Leu Gly Gln Leu Asp Ala Glu Lys Ala Asp Ly - #s Lys Val Leu Ala Lys # 95 - Ala Gln Asn Leu Leu Gln Glu Thr Leu Asp Ph - #e Val Lys Glu Glu Asn # 110 - Gly Ser Ala Glu Thr Glu Thr Lys Leu Val Gl - #u Glu Leu Lys Ala Ile # 125 - Leu Asp Lys Leu Lys 130 - (2) INFORMATION FOR SEQ ID NO: 189: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1201 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #189: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CAAGAAATCC TATCATCTCT TCCAGAAGCA AACAGAGACG AGGGGAATTC AG - #ACTCAGTT 60 - GATTGAAGAA TCGCTTAGTC AGCAGACTAT AATCCAGTCC TTCAATGCTC AA - #ACAGAATT 120 - TATCCAAAGA TTGCGTGAGG CTCATGACAA CTACTCAGGC TATTCTCAGT CA - #GCCATCTT 180 - TTATTCTTCA ACGGTCAATC CTTCGACTCG CTTTGTAAAT GCACTCATTT AT - #GCCCTTTT 240 - AGCTGGAGTA GGAGCTTATC GTATCATGAT GGGTTCAGCC TTGACCGTCG GT - #CGTTTAGT 300 - GACTTTTTTG AACTATGTTC AGCAATACAC CAAGCCCTTT AACGATATTT CT - #TCAGTGCT 360 - AGCTGAGTTG CAAAGTGCTC TGGCTTGCGT AGAGCGTATC TATGGAGTCT TA - #GATAGCCC 420 - TGAAGTGGCT GAAACAGGTA AGGAAGTCTT GACGACCAGT GACCAAGTTA AG - #GGAGCTAT 480 - TTCCTTTAAA CATGTCTCTT TTGGCTACCA TCCTGAAAAA ATTTTGATTA AG - #GACTTGTC 540 - TATCGATATT CCAGCTGGTA GTAAGGTAGC CATCGTTGGT CCGACAGGTG CT - #GGAAAATC 600 - AACTCTTATC AATCTCCTTA TGCGTTTTTA TCCCATTAGC TCGGGAGATA TC - #TTGCTGGA 660 - TGGGCAATCC ATTTATGATT ATACACGAGT ATCATTGAGA CAGCAGTTTG GT - #ATGGTGCT 720 - TCAAGAAACC TGGCTCACAC AAGGGACCAT TCATGATAAT ATTGCCTTTG GC - #AATCCTGA 780 - AGCCAGTCGA GAGCAAGTAA TTGCTGCTGC CAAAGCAGCT AATGCAGACT TT - #TTCATCCA 840 - ACAGTTGCCA CAGGGATACG ATACCAAGTT GGAAAATGCT GGAGAATCTC TC - #TCTGTCGG 900 - CCAAGCTCAG CTCTTGACCA TAGCCCGAGT CTTTCTGGCT ATTCCAAAGA TT - #CTTATCTT 960 - AGACGAGGCA ACTTCTTCCA TTGATACACG GACAGAAGTG CTGGTACAGG AT - #GCCTTTGC 1020 - AAAACTCATG AAGGGCCGCA CAAGTTTCAT CATTGCTCAC CGTTTGTCAA CC - #ATTCAGGA 1080 - TGCGGATTTA ATTCTTGTCT TAGTAGATGG TGATATTGTT GAATATGGTA AC - #CATCAAGA 1140 - ACTCATGGAT AGAAAGGGTA AGTATTACCA AATGCAAAAA GCTGCGGCTT TT - #AGTTCTGA 1200 # 1201 - (2) INFORMATION FOR SEQ ID NO:190: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 400 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: - Lys Lys Ser Tyr His Leu Phe Gln Lys Gln Th - #r Glu Thr Arg Gly Ile # 15 - Gln Thr Gln Leu Ile Glu Glu Ser Leu Ser Gl - #n Gln Thr Ile Ile Gln # 30 - Ser Phe Asn Ala Gln Thr Glu Phe Ile Gln Ar - #g Leu Arg Glu Ala His # 45 - Asp Asn Tyr Ser Gly Tyr Ser Gln Ser Ala Il - #e Phe Tyr Ser Ser Thr # 60 - Val Asn Pro Ser Thr Arg Phe Val Asn Ala Le - #u Ile Tyr Ala Leu Leu #80 - Ala Gly Val Gly Ala Tyr Arg Ile Met Met Gl - #y Ser Ala Leu Thr Val # 95 - Gly Arg Leu Val Thr Phe Leu Asn Tyr Val Gl - #n Gln Tyr Thr Lys Pro # 110 - Phe Asn Asp Ile Ser Ser Val Leu Ala Glu Le - #u Gln Ser Ala Leu Ala # 125 - Cys Val Glu Arg Ile Tyr Gly Val Leu Asp Se - #r Pro Glu Val Ala Glu # 140 - Thr Gly Lys Glu Val Leu Thr Thr Ser Asp Gl - #n Val Lys Gly Ala Ile 145 1 - #50 1 - #55 1 - #60 - Ser Phe Lys His Val Ser Phe Gly Tyr His Pr - #o Glu Lys Ile Leu Ile # 175 - Lys Asp Leu Ser Ile Asp Ile Pro Ala Gly Se - #r Lys Val Ala Ile Val # 190 - Gly Pro Thr Gly Ala Gly Lys Ser Thr Leu Il - #e Asn Leu Leu Met Arg # 205 - Phe Tyr Pro Ile Ser Ser Gly Asp Ile Leu Le - #u Asp Gly Gln Ser Ile # 220 - Tyr Asp Tyr Thr Arg Val Ser Leu Arg Gln Gl - #n Phe Gly Met Val Leu 225 2 - #30 2 - #35 2 - #40 - Gln Glu Thr Trp Leu Thr Gln Gly Thr Ile Hi - #s Asp Asn Ile Ala Phe # 255 - Gly Asn Pro Glu Ala Ser Arg Glu Gln Val Il - #e Ala Ala Ala Lys Ala

# 270 - Ala Asn Ala Asp Phe Phe Ile Gln Gln Leu Pr - #o Gln Gly Tyr Asp Thr # 285 - Lys Leu Glu Asn Ala Gly Glu Ser Leu Ser Va - #l Gly Gln Ala Gln Leu # 300 - Leu Thr Ile Ala Arg Val Phe Leu Ala Ile Pr - #o Lys Ile Leu Ile Leu 305 3 - #10 3 - #15 3 - #20 - Asp Glu Ala Thr Ser Ser Ile Asp Thr Arg Th - #r Glu Val Leu Val Gln # 335 - Asp Ala Phe Ala Lys Leu Met Lys Gly Arg Th - #r Ser Phe Ile Ile Ala # 350 - His Arg Leu Ser Thr Ile Gln Asp Ala Asp Le - #u Ile Leu Val Leu Val # 365 - Asp Gly Asp Ile Val Glu Tyr Gly Asn His Gl - #n Glu Leu Met Asp Arg # 380 - Lys Gly Lys Tyr Tyr Gln Met Gln Lys Ala Al - #a Ala Phe Ser Ser Glu 385 3 - #90 3 - #95 4 - #00 - (2) INFORMATION FOR SEQ ID NO: 191: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1033 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #191: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ACGAAATGCA GGGCAGACAG ATGCCTCGCA AATTGAAAAG GCGGCAGTTA GC - #CAAGGAGG 60 - AAAAGCAGTG AAAAAAACAG AAATTAGTAA AGACGCAGAC TTGCACGAAA TT - #TATCTAGC 120 - TGGAGGTTGT TTCTGGGGAG TGGAGGAATA TTTCTCACGT GTTCCCGGGG TG - #ACGGATGC 180 - CGTTTCAGGC TATGCAAATG GTAGAGGAGA AACAACCAAG TACGAATTGA TT - #AACCAAAC 240 - AGGTCATGCA GAAACCGTCC ATGTCACCTA TGATGCCAAG CAAATTTCTC TC - #AAGGAAAT 300 - CCTGCTTCAC TATTTCCGCA TTATCAATCC AACCAGCAAA AATAAACAAG GA - #AATGATGT 360 - GGGGACCCAG TACCGTACTG GTGTTTATTA CACAGATGAC AAGGATTTGG AA - #GTGATTAA 420 - CCAAGTCTTT GATGAGGTGG CTAAGAAATA CGATCAACCT CTAGCAGTTG AA - #AAGGAAAA 480 - CTTGAAGAAT TTTGTGGTGG CTGAGGATTA CCATCAAGAC TATCTCAAGA AA - #AATCCAAA 540 - TGGCTACTGC CATATCAATG TTAATCAGGC GGCCTATCCT GTCATTGATG CC - #AGCAAATA 600 - TCCAAAACCA AGTGATGAGG AATTGAAAAA GACCCTGTCA CCTGAGGAGT AT - #GCAGTTAC 660 - CCAGGAAAAT CAAACAGAAC GAGCTTTCTC AAACCGTTAC TGGGATAAAT TT - #GAATCCGG 720 - TATCTATGTG GATATAGCAA CTGGGGAACC TCTCTTTTCA TCAAAAGACA AA - #TTTGAGTC 780 - TGGTTGTGGC TGGCCTAGTT TTACCCAACC CATCAGTCCA GATGTTGTCA CC - #TACAAGGA 840 - AGATAAGTCC TACAATATGA CGCGTATGGA AGTGCGGAGC CGAGTAGGAG AT - #TCTCACCT 900 - TGGGCATGTC TTTACGGATG GTCCACAGGA CAAGGGCGGC TTACGTTACT GT - #ATCAATAG 960 - CCTCTCTATC CGCTTTATTC CCAAAGACCA AATGGAAGAA AAAGGCTACG CT - #TATTTACT 1020 # 1033 - (2) INFORMATION FOR SEQ ID NO:192: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 344 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: - Arg Asn Ala Gly Gln Thr Asp Ala Ser Gln Il - #e Glu Lys Ala Ala Val # 15 - Ser Gln Gly Gly Lys Ala Val Lys Lys Thr Gl - #u Ile Ser Lys Asp Ala # 30 - Asp Leu His Glu Ile Tyr Leu Ala Gly Gly Cy - #s Phe Trp Gly Val Glu # 45 - Glu Tyr Phe Ser Arg Val Pro Gly Val Thr As - #p Ala Val Ser Gly Tyr # 60 - Ala Asn Gly Arg Gly Glu Thr Thr Lys Tyr Gl - #u Leu Ile Asn Gln Thr #80 - Gly His Ala Glu Thr Val His Val Thr Tyr As - #p Ala Lys Gln Ile Ser # 95 - Leu Lys Glu Ile Leu Leu His Tyr Phe Arg Il - #e Ile Asn Pro Thr Ser # 110 - Lys Asn Lys Gln Gly Asn Asp Val Gly Thr Gl - #n Tyr Arg Thr Gly Val # 125 - Tyr Tyr Thr Asp Asp Lys Asp Leu Glu Val Il - #e Asn Gln Val Phe Asp # 140 - Glu Val Ala Lys Lys Tyr Asp Gln Pro Leu Al - #a Val Glu Lys Glu Asn 145 1 - #50 1 - #55 1 - #60 - Leu Lys Asn Phe Val Val Ala Glu Asp Tyr Hi - #s Gln Asp Tyr Leu Lys # 175 - Lys Asn Pro Asn Gly Tyr Cys His Ile Asn Va - #l Asn Gln Ala Ala Tyr # 190 - Pro Val Ile Asp Ala Ser Lys Tyr Pro Lys Pr - #o Ser Asp Glu Glu Leu # 205 - Lys Lys Thr Leu Ser Pro Glu Glu Tyr Ala Va - #l Thr Gln Glu Asn Gln # 220 - Thr Glu Arg Ala Phe Ser Asn Arg Tyr Trp As - #p Lys Phe Glu Ser Gly 225 2 - #30 2 - #35 2 - #40 - Ile Tyr Val Asp Ile Ala Thr Gly Glu Pro Le - #u Phe Ser Ser Lys Asp # 255 - Lys Phe Glu Ser Gly Cys Gly Trp Pro Ser Ph - #e Thr Gln Pro Ile Ser # 270 - Pro Asp Val Val Thr Tyr Lys Glu Asp Lys Se - #r Tyr Asn Met Thr Arg # 285 - Met Glu Val Arg Ser Arg Val Gly Asp Ser Hi - #s Leu Gly His Val Phe # 300 - Thr Asp Gly Pro Gln Asp Lys Gly Gly Leu Ar - #g Tyr Cys Ile Asn Ser 305 3 - #10 3 - #15 3 - #20 - Leu Ser Ile Arg Phe Ile Pro Lys Asp Gln Me - #t Glu Glu Lys Gly Tyr # 335 - Ala Tyr Leu Leu Asp Tyr Val Asp 340 - (2) INFORMATION FOR SEQ ID NO: 193: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 396 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #193: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGTATAGTTT TTAGCGCTTG TTCTTCTAAT TCTGNTAAAA ATGAAGAAAA TA - #CTTCTAAA 60 - GAGCATGCGC CTGATAAAAT AGTTTTAGAT CATGCTTTCG GTCAAACTAT AT - #TAGATAAA 120 - AAACCTGAAA GAGTTGCAAC TATTGCTTGG GGAAATCATG ATGTAGCATT AG - #CTTTAGGA 180 - ATAGTTCCTG TTGGATTTTC AAAAGCAAAT TACGGTGTAA GTGCTGATAA AG - #GAGTTTTA 240 - CCATGGACAG AAGAAAAAAT CAAAGAACTA AATGGTAAAG CTAACCTATT TG - #ACGATTTG 300 - GATGGACTTA ACTTTGAAGC AATATCAAAT TCTAAACCAG ATGTTATCTT AG - #CAGGTTAT 360 # 396 AAGA TTATGACACT CTATCA - (2) INFORMATION FOR SEQ ID NO:194: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 132 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: - Cys Ile Val Phe Ser Ala Cys Ser Ser Asn Se - #r Xaa Lys Asn Glu Glu # 15 - Asn Thr Ser Lys Glu His Ala Pro Asp Lys Il - #e Val Leu Asp His Ala # 30 - Phe Gly Gln Thr Ile Leu Asp Lys Lys Pro Gl - #u Arg Val Ala Thr Ile # 45 - Ala Trp Gly Asn His Asp Val Ala Leu Ala Le - #u Gly Ile Val Pro Val # 60 - Gly Phe Ser Lys Ala Asn Tyr Gly Val Ser Al - #a Asp Lys Gly Val Leu #80 - Pro Trp Thr Glu Glu Lys Ile Lys Glu Leu As - #n Gly Lys Ala Asn Leu # 95 - Phe Asp Asp Leu Asp Gly Leu Asn Phe Glu Al - #a Ile Ser Asn Ser Lys # 110 - Pro Asp Val Ile Leu Ala Gly Tyr Ser Gly Il - #e Thr Lys Glu Asp Tyr # 125 - Asp Thr Leu Ser 130 - (2) INFORMATION FOR SEQ ID NO: 195: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 844 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #195: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GTGTGTCGAG CATATTCTGA AGCAAACCTA TCAAAATATA GAAATTATTT TA - #GTTGATGA 60 - CGGTTCTACG GATAATTCTG GGGAAATTTG TGATGCTTTT ATGATGCAAG AT - #AATCGTGT 120 - GCGAGTATTG CATCAAGAAA ATAAGGGGGG GGCAGCACAA GCTAAAAATA TG - #GGGATTAG 180 - TGTAGCTAAG GGAGAGTACA TCACGATTGT TGATTCAGAT GATATCGTAA AA - #GAAAATAT 240 - GATTGAAACT CTTTATCAGC AAGTCCAAGA AAAGGATGCA GATGTTGTTA TA - #GGGAATTA 300 - CTATAATTAT GACGAAAGTG ACGGGAATTT TTATTTTTAT GTAACAGGGC AA - #GATTTTTG 360 - CGTCGAAGAA TTAGCTATAC AAGAAATTAT GAACCGTCAA GCAGGAGATT GG - #AAATTCAA 420 - TAGCTCGGCC TTTATATTGC CGACATTTAA GTTGATTAAA AAAGAATTAT TC - #AATGAAGT 480 - TCACTTTTCA AATGGTCGCC GCTTTGATGA TGAAGCAACT ATGCATCGCT TT - #TATCTTTT 540 - AGCCTCTAAA ATCGTCTTTA TAAACGATAA TCTCTATCTG TATAGAAGAC GT - #TCAGGAAG 600 - CATCATGAGA ACGGAATTTG ATCTTTCCTG GGCAAGAGAT ATTGTTGAAG TG - #TTTTCTAA 660 - GAAAATATCG GATTGTGTCT TGGCTGGTTT GGATGTCTCC GTTCTGCGTA TT - #CGATTTGT 720 - CAATCTTTTA AAAGATTATA AGCAAACTTT AGAATACCAT CAATTAACAG AT - #ACTGAGGA 780 - ATATAAAGAT ATTTGTTTCA GATTAAAGTT GTTTTTTGAT GCAGAACAAA GA - #AATGGTAA 840 # 844 - (2) INFORMATION FOR SEQ ID NO:196: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 281 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: - Cys Val Glu His Ile Leu Lys Gln Thr Tyr Gl - #n Asn Ile Glu Ile Ile # 15 - Leu Val Asp Asp Gly Ser Thr Asp Asn Ser Gl - #y Glu Ile Cys Asp Ala # 30 - Phe Met Met Gln Asp Asn Arg Val Arg Val Le - #u His Gln Glu Asn Lys # 45 - Gly Gly Ala Ala Gln Ala Lys Asn Met Gly Il - #e Ser Val Ala Lys Gly # 60 - Glu Tyr Ile Thr Ile Val Asp Ser Asp Asp Il - #e Val Lys Glu Asn Met #80 - Ile Glu Thr Leu Tyr Gln Gln Val Gln Glu Ly - #s Asp Ala Asp Val Val # 95 - Ile Gly Asn Tyr Tyr Asn Tyr Asp Glu Ser As - #p Gly Asn Phe Tyr Phe # 110 - Tyr Val Thr Gly Gln Asp Phe Cys Val Glu Gl - #u Leu Ala Ile Gln Glu # 125 - Ile Met Asn Arg Gln Ala Gly Asp Trp Lys Ph - #e Asn Ser Ser Ala Phe # 140 - Ile Leu Pro Thr Phe Lys Leu Ile Lys Lys Gl - #u Leu Phe Asn Glu Val 145 1 - #50 1 - #55 1 - #60 - His Phe Ser Asn Gly Arg Arg Phe Asp Asp Gl - #u Ala Thr Met His Arg # 175 - Phe Tyr Leu Leu Ala Ser Lys Ile Val Phe Il - #e Asn Asp Asn Leu Tyr # 190 - Leu Tyr Arg Arg Arg Ser Gly Ser Ile Met Ar - #g Thr Glu Phe Asp Leu # 205 - Ser Trp Ala Arg Asp Ile Val Glu Val Phe Se - #r Lys Lys Ile Ser Asp # 220 - Cys Val Leu Ala Gly Leu Asp Val Ser Val Le - #u Arg Ile Arg Phe Val 225 2 - #30 2 - #35 2 - #40 - Asn Leu Leu Lys Asp Tyr Lys Gln Thr Leu Gl - #u Tyr His Gln Leu Thr # 255 - Asp Thr Glu Glu Tyr Lys Asp Ile Cys Phe Ar - #g Leu Lys Leu Phe Phe # 270 - Asp Ala Glu Gln Arg Asn Gly Lys Ser # 280 - (2) INFORMATION FOR SEQ ID NO: 197: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 811 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #197: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GTGTTTGGAT AGCATTCAGA ATCAGACGTA TCAAAATTTT GAGTGTTTAT TA - #ATCAATGA 60 - TGGCTCTCCA GATCATTCAT CCAAAATATG TGAAGAATTT GTAGAGAAAG AT - #TCTCGTTT 120

- CAAATATTTT GAGAAAGCAA ACGGCGGTCT TTCATCAGCT CGTAACCTAG GT - #ATTGAATG 180 - TTCGGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGGTTGGAAC AT - #GATGCTTT 240 - AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG GG - #CGTTATAA 300 - TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT ACGGATCCAG AT - #GATTCTCT 360 - AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG TC - #AGAAATGG 420 - GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG AT - #TTACCATT 480 - TCCTATAGGA AAAATTGCAG AGGATACTTA CTGGACATGG AAGGTACTTC TA - #AGAGCTTC 540 - GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT CT - #GATACTTT 600 - ATCGAATACA TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG AA - #GAAAAGAT 660 - AGCTATTTTA GCAAGTTCAG ACTATGACTT GACCAATCAT ATTTTGATTT AT - #AAAAATAG 720 - ATTACAAAGA GTGATAGCAA AATTAGAAGA ACAAAATATG CAGTTCACAG AG - #ATTTACAG 780 # 811 TTGT CTTTACTTCC G - (2) INFORMATION FOR SEQ ID NO:198: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 270 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:198: - Cys Leu Asp Ser Ile Gln Asn Gln Thr Tyr Gl - #n Asn Phe Glu Cys Leu # 15 - Leu Ile Asn Asp Gly Ser Pro Asp His Ser Se - #r Lys Ile Cys Glu Glu # 30 - Phe Val Glu Lys Asp Ser Arg Phe Lys Tyr Ph - #e Glu Lys Ala Asn Gly # 45 - Gly Leu Ser Ser Ala Arg Asn Leu Gly Ile Gl - #u Cys Ser Gly Gly Ala # 60 - Tyr Ile Thr Phe Val Asp Ser Asp Asp Trp Le - #u Glu His Asp Ala Leu #80 - Asp Arg Leu Tyr Gly Ala Leu Lys Lys Glu As - #n Ala Asp Ile Ser Ile # 95 - Gly Arg Tyr Asn Ser Tyr Asp Glu Thr Arg Ty - #r Val Tyr Met Thr Tyr # 110 - Val Thr Asp Pro Asp Asp Ser Leu Glu Val Il - #e Glu Gly Lys Ala Ile # 125 - Met Asp Arg Glu Gly Val Glu Glu Val Arg As - #n Gly Asn Trp Thr Val # 140 - Ala Val Leu Lys Leu Phe Lys Arg Glu Leu Le - #u Gln Asp Leu Pro Phe 145 1 - #50 1 - #55 1 - #60 - Pro Ile Gly Lys Ile Ala Glu Asp Thr Tyr Tr - #p Thr Trp Lys Val Leu # 175 - Leu Arg Ala Ser Arg Ile Val Tyr Leu Asn Ar - #g Cys Val Tyr Trp Tyr # 190 - Arg Val Gly Leu Ser Asp Thr Leu Ser Asn Th - #r Trp Ser Glu Lys Arg # 205 - Met Tyr Asp Glu Ile Gly Ala Arg Glu Glu Ly - #s Ile Ala Ile Leu Ala # 220 - Ser Ser Asp Tyr Asp Leu Thr Asn His Ile Le - #u Ile Tyr Lys Asn Arg 225 2 - #30 2 - #35 2 - #40 - Leu Gln Arg Val Ile Ala Lys Leu Glu Glu Gl - #n Asn Met Gln Phe Thr # 255 - Glu Ile Tyr Arg Arg Met Met Glu Lys Leu Se - #r Leu Leu Pro # 270

- (2) INFORMATION FOR SEQ ID NO: 199: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 2023 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #199: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GTGCCTAGAT AGTATTATTA CTCAAACATA TAAAAATATT GAGATTGTTG TC - #GTTAATGA 60 - TGGTTCTACG GATGCTTCAG GTGAAATTTG TAAAGAATTT TCAGAAATGG AT - #CACCGAAT 120 - TCTCTATATA GAACAAGAAA ATGCTGGTCT TTCTGCCGCA CGAAACACCG GT - #CTGAATAA 180 - TATGTCCGGA AATTATGTGA CCTTTGTGGA CTCGGATGAT TGGATTGAGC AA - #GATTATGT 240 - AGAAACTCTA TATAAAAAAA TAGTAGAGTA TCAGGCTGAT ATTGCAGTTG GT - #AATTATTA 300 - TTCTTTCAAC GAAAGTGAAG GAATGTTCTA CTTTCATATA TTGGGAGACT CC - #TATTATGA 360 - GAAAGTATAT GATAATGTTT CTATCTTTGA GAACTTGTAT GAAACTCAAG AA - #ATGAAGAG 420 - TTTTGCTTTG ATATCTGCTT GGGGTAAACT CTATAAGGCA AGATTGTTTG AG - #CAGTTGCG 480 - CTTTGACATA GGTAAATTAG GAGAAGATGG TTACCTCAAT CAAAAGGTAT AT - #TTATTATC 540 - AGAAAAGGTA ATTTATTTAA ATAAAAGTCT TTATGCTTAT CGGATTAGAA AA - #GGTAGTTT 600 - ATCAAGAGTT TGGACAGAAA AGTGGATGCA CGCTTTAGTT GATGCTATGT CT - #GAACGTAT 660 - TACGCTACTA GCTAATATGG GTTATCCTCT AGAGAAACAC TTGGCAGTTT AT - #CGTCAGAT 720 - GTTGGAAGTC AGTCTCGCCA ACGGTCAAGC TAGTGGTTTA TCTGACACAG CA - #ACGTATAA 780 - AGAGTTTGAA ATGAAACAAA GGCTTTTAAA TCAGCTATCG AGACAAGAGG AA - #AGTGAAAA 840 - GAAAGCCATT GTCCTCGCAG CAAACTATGG CTATGTAGAC CAAGTTTTAA CG - #ACAATCAA 900 - GTCTATTTGT TATCATAATC GTTCGATTCG TTTTTATCTG ATTCATAGCG AT - #TTTCCAAA 960 - TGAATGGATT AAGCAATTAA ATAAGCGCTT AGAGAAGTTT GACTCAGAAA TT - #ATTAATTG 1020 - TCGGGTAACT TCTGAGCAAA TTTCATGTTA TAAATCGGAT ATTAGTTACA CA - #GTCTTTTT 1080 - ACGCTATTTC ATAGCTGATT TCGTGCAAGA AGACAAGGCC CTCTACTTGG AC - #TGTGATCT 1140 - AGTTGTAACG AAAAATCTGG ATGACTTGTT TGCTACAGAC TTACAAGATT AT - #CCTTTGGC 1200 - TGCTGTTAGA GATTTTGGGG GCAGAGCTTA TTTTGGTCAA GAAATCTTTA AT - #GCCGGTGT 1260 - TCTCTTGGTA AACAATGCTT TTTGGAAAAA AGAGAATATG ACCCAAAAAT TA - #ATTGATGT 1320 - AACCAATGAA TGGCATGATA AGGTGGATCA GGCAGATCAG AGCATCTTGA AT - #ATGCTTTT 1380 - TGAACATAAA TGGTTGGAAT TGGACTTTGA TTATAATCAT ATTGTCATTC AT - #AAACAGTT 1440 - TGCTGATTAT CAATTGCCTG AGGGTCAGGA TTATCCTGCT ATTATTCACT AT - #CTTTCTCA 1500 - TCGGAAACCG TGGAAAGATT TGGCGGCCCA AACCTATCGT GAAGTTTGGT GG - #TACTATCA 1560 - TGGGCTTGAA TGGACAGAAT TGGGACAAAA CCATCATTTA CATCCATTAC AA - #AGATCTCA 1620 - CATCTATCCA ATAAAGGAAC CTTTCACTTG TCTAATCTAT ACTGCCTCAG AC - #CATATTGA 1680 - ACAAATTGAG ACATTGGTTC AATCCTTGCC TGATATTCAG TTTAAGATAG CA - #GCTAGAGT 1740 - AATAGTTAGT GATCGATTGG CTCAGATGAC AATTTATCCA AACGTGACTA TA - #TTTAACGG 1800 - AATTCACTAT TTGGTAGATG TCGATAATGA ATTGGTAGAA ACCAGTCAAG TA - #CTTTTAGA 1860 - TATTAATCAT GGCGAAAAGA CAGAAGAAAT TCTCGATCAA TTTGCTAATC TT - #GGCAAGCC 1920 - TATCTTATCC TTTGAAAATA CTAAAACCTA TGAAGTAGGT CAGGAGGCAT AT - #GCTGTTGA 1980 # 202 - #3AAATTGAG AGAAATAAGC AAA - (2) INFORMATION FOR SEQ ID NO:200: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 674 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: - Cys Leu Asp Ser Ile Ile Thr Gln Thr Tyr Ly - #s Asn Ile Glu Ile Val # 15 - Val Val Asn Asp Gly Ser Thr Asp Ala Ser Gl - #y Glu Ile Cys Lys Glu # 30 - Phe Ser Glu Met Asp His Arg Ile Leu Tyr Il - #e Glu Gln Glu Asn Ala # 45 - Gly Leu Ser Ala Ala Arg Asn Thr Gly Leu As - #n Asn Met Ser Gly Asn # 60 - Tyr Val Thr Phe Val Asp Ser Asp Asp Trp Il - #e Glu Gln Asp Tyr Val #80 - Glu Thr Leu Tyr Lys Lys Ile Val Glu Tyr Gl - #n Ala Asp Ile Ala Val # 95 - Gly Asn Tyr Tyr Ser Phe Asn Glu Ser Glu Gl - #y Met Phe Tyr Phe His # 110 - Ile Leu Gly Asp Ser Tyr Tyr Glu Lys Val Ty - #r Asp Asn Val Ser Ile # 125 - Phe Glu Asn Leu Tyr Glu Thr Gln Glu Met Ly - #s Ser Phe Ala Leu Ile # 140 - Ser Ala Trp Gly Lys Leu Tyr Lys Ala Arg Le - #u Phe Glu Gln Leu Arg 145 1 - #50 1 - #55 1 - #60 - Phe Asp Ile Gly Lys Leu Gly Glu Asp Gly Ty - #r Leu Asn Gln Lys Val # 175 - Tyr Leu Leu Ser Glu Lys Val Ile Tyr Leu As - #n Lys Ser Leu Tyr Ala # 190 - Tyr Arg Ile Arg Lys Gly Ser Leu Ser Arg Va - #l Trp Thr Glu Lys Trp # 205 - Met His Ala Leu Val Asp Ala Met Ser Glu Ar - #g Ile Thr Leu Leu Ala # 220 - Asn Met Gly Tyr Pro Leu Glu Lys His Leu Al - #a Val Tyr Arg Gln Met 225 2 - #30 2 - #35 2 - #40 - Leu Glu Val Ser Leu Ala Asn Gly Gln Ala Se - #r Gly Leu Ser Asp Thr # 255 - Ala Thr Tyr Lys Glu Phe Glu Met Lys Gln Ar - #g Leu Leu Asn Gln Leu # 270 - Ser Arg Gln Glu Glu Ser Glu Lys Lys Ala Il - #e Val Leu Ala Ala Asn # 285 - Tyr Gly Tyr Val Asp Gln Val Leu Thr Thr Il - #e Lys Ser Ile Cys Tyr # 300 - His Asn Arg Ser Ile Arg Phe Tyr Leu Ile Hi - #s Ser Asp Phe Pro Asn 305 3 - #10 3 - #15 3 - #20 - Glu Trp Ile Lys Gln Leu Asn Lys Arg Leu Gl - #u Lys Phe Asp Ser Glu # 335 - Ile Ile Asn Cys Arg Val Thr Ser Glu Gln Il - #e Ser Cys Tyr Lys Ser # 350 - Asp Ile Ser Tyr Thr Val Phe Leu Arg Tyr Ph - #e Ile Ala Asp Phe Val # 365 - Gln Glu Asp Lys Ala Leu Tyr Leu Asp Cys As - #p Leu Val Val Thr Lys # 380 - Asn Leu Asp Asp Leu Phe Ala Thr Asp Leu Gl - #n Asp Tyr Pro Leu Ala 385 3 - #90 3 - #95 4 - #00 - Ala Val Arg Asp Phe Gly Gly Arg Ala Tyr Ph - #e Gly Gln Glu Ile Phe # 415 - Asn Ala Gly Val Leu Leu Val Asn Asn Ala Ph - #e Trp Lys Lys Glu Asn # 430 - Met Thr Gln Lys Leu Ile Asp Val Thr Asn Gl - #u Trp His Asp Lys Val # 445 - Asp Gln Ala Asp Gln Ser Ile Leu Asn Met Le - #u Phe Glu His Lys Trp # 460 - Leu Glu Leu Asp Phe Asp Tyr Asn His Ile Va - #l Ile His Lys Gln Phe 465 4 - #70 4 - #75 4 - #80 - Ala Asp Tyr Gln Leu Pro Glu Gly Gln Asp Ty - #r Pro Ala Ile Ile His # 495 - Tyr Leu Ser His Arg Lys Pro Trp Lys Asp Le - #u Ala Ala Gln Thr Tyr # 510 - Arg Glu Val Trp Trp Tyr Tyr His Gly Leu Gl - #u Trp Thr Glu Leu Gly # 525 - Gln Asn His His Leu His Pro Leu Gln Arg Se - #r His Ile Tyr Pro Ile # 540 - Lys Glu Pro Phe Thr Cys Leu Ile Tyr Thr Al - #a Ser Asp His Ile Glu 545 5 - #50 5 - #55 5 - #60 - Gln Ile Glu Thr Leu Val Gln Ser Leu Pro As - #p Ile Gln Phe Lys Ile # 575 - Ala Ala Arg Val Ile Val Ser Asp Arg Leu Al - #a Gln Met Thr Ile Tyr # 590 - Pro Asn Val Thr Ile Phe Asn Gly Ile His Ty - #r Leu Val Asp Val Asp # 605 - Asn Glu Leu Val Glu Thr Ser Gln Val Leu Le - #u Asp Ile Asn His Gly # 620 - Glu Lys Thr Glu Glu Ile Leu Asp Gln Phe Al - #a Asn Leu Gly Lys Pro 625 6 - #30 6 - #35 6 - #40 - Ile Leu Ser Phe Glu Asn Thr Lys Thr Tyr Gl - #u Val Gly Gln Glu Ala # 655 - Tyr Ala Val Asp Gln Val Gln Ala Met Ile Gl - #u Lys Leu Arg Glu Ile # 670 - Ser Lys - (2) INFORMATION FOR SEQ ID NO: 201: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 910 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #201: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT GTTGATGATG GT - #GCAACAGA 60 - TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT GACAGGGTGT CA - #GTGCTTCA 120 - TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG ATGAAGCAGG CT - #CACGGGGA 180 - TTATCTGATT TTTATTGACT CAGATGATTA TATCCATCCA GAAATGATTC AG - #AGCTTATA 240 - TGAGCAATTA GTTCAAGAAG ATGCGGATGT TTCGAGCTGT GGTGTCATGA AT - #GTCTATGC 300 - TAATGATGAA AGCCCACAGT CAGCCAATCA GGATGACTAT TTTGTCTGTG AT - #TCTCAAAC 360 - ATTTCTAAAG GAATACCTCA TAGGTGAAAA AATACCTGGG ACGATTTGCA AT - #AAGCTAAT 420 - CAAGAGACAG ATTGCAACTG CCCTATCCTT TCCTAAGGGG TTGATTTACG AA - #GATGCCTA 480 - TTACCATTTT GATTTAATCA AGTTGGCCAA GAAGTATGTG GTTAATACTA AA - #CCCTATTA 540 - TTACTATTTC CATAGAGGGG ATAGTATTAC GACCAAACCC TATGCAGAGA AG - #GATTTAGC 600 - CTATATTGAT ATCTACCAAA AGTTTTATAA TGAAGTTGTG AAAAACTATC CT - #GACTTGAA 660 - AGAGGTCGCT TTTTTCAGAT TGGCCTATGC CCACTTCTTT ATTCTGGATA AG - #ATGTTGCT 720 - AGATGATCAG TATAAACAGT TTGAAGCCTA TTCTCAGATT CATCGTTTTT TA - #AAAGGCCA 780 - TGCCTTTGCT ATTTCTAGGA ATCCAATTTT CCGTAAGGGG AGAAGAATTA GT - #GCTTTGGC 840 - CCTATTCATA AATATTTCCT TATATCGATT CTTATTACTG AAAAATATTG AA - #AAATCTAA 900 # 910 - (2) INFORMATION FOR SEQ ID NO:202: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 303 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: - Ile Gln Lys Gln Thr Tyr Gln Asn Leu Glu Il - #e Ile Leu Val Asp Asp # 15 - Gly Ala Thr Asp Glu Ser Gly Arg Leu Cys As - #p Ser Ile Ala Glu Gln # 30 - Asp Asp Arg Val Ser Val Leu His Lys Lys As - #n Glu Gly Leu Ser Gln # 45 - Ala Arg Asn Asp Gly Met Lys Gln Ala His Gl - #y Asp Tyr Leu Ile Phe # 60 - Ile Asp Ser Asp Asp Tyr Ile His Pro Glu Me - #t Ile Gln Ser Leu Tyr #80 - Glu Gln Leu Val Gln Glu Asp Ala Asp Val Se - #r Ser Cys Gly Val Met # 95 - Asn Val Tyr Ala Asn Asp Glu Ser Pro Gln Se - #r Ala Asn Gln Asp Asp # 110 - Tyr Phe Val Cys Asp Ser Gln Thr Phe Leu Ly - #s Glu Tyr Leu Ile Gly # 125 - Glu Lys Ile Pro Gly Thr Ile Cys Asn Lys Le - #u Ile Lys Arg Gln Ile # 140 - Ala Thr Ala Leu Ser Phe Pro Lys Gly Leu Il - #e Tyr Glu Asp Ala Tyr 145 1 - #50 1 - #55 1 - #60 - Tyr His Phe Asp Leu Ile Lys Leu Ala Lys Ly - #s Tyr Val Val Asn Thr # 175 - Lys Pro Tyr Tyr Tyr Tyr Phe His Arg Gly As - #p Ser Ile Thr Thr Lys # 190 - Pro Tyr Ala Glu Lys Asp Leu Ala Tyr Ile As - #p Ile Tyr Gln Lys Phe # 205 - Tyr Asn Glu Val Val Lys Asn Tyr Pro Asp Le - #u Lys Glu Val Ala Phe # 220 - Phe Arg Leu Ala Tyr Ala His Phe Phe Ile Le - #u Asp Lys Met Leu Leu 225 2 - #30 2 - #35 2 - #40 - Asp Asp Gln Tyr Lys Gln Phe Glu Ala Tyr Se - #r Gln Ile His Arg Phe # 255 - Leu Lys Gly His Ala Phe Ala Ile Ser Arg As - #n Pro Ile Phe Arg Lys

# 270 - Gly Arg Arg Ile Ser Ala Leu Ala Leu Phe Il - #e Asn Ile Ser Leu Tyr # 285 - Arg Phe Leu Leu Leu Lys Asn Ile Glu Lys Se - #r Lys Lys Leu His # 300 - (2) INFORMATION FOR SEQ ID NO: 203: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1972 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #203: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAAGGCTGAT AATCGTGTTC AAATGAGAAC GACGATTAAT AATGAATCGC CA - #TTGTTGCT 60 - TTCTCCGTTG TATGGCAATG ATAATGGTAA CGGATTATGG TGGGGGAACA CA - #TTGAAGGG 120 - AGCATGGGAA GCTATTCCTG AAGATGTAAA GCCATATGCA GCGATTGAAC TT - #CATCCTGC 180 - AAAAGTCTGT AAACCAACAA GTTGTATTCC ACGAGATACG AAAGAATTGA GA - #GAATGGTA 240 - TGTCAAGATG TTGGAGGAAG CTCAAAGTCT AAACATTCCA GTTTTCTTGG TT - #ATTATGTC 300 - GGCTGGAGAG CGTAATACAG TTCCTCCAGA GTGGTTAGAT GAACAATTCC AA - #AAGTATAG 360 - TGTGTTAAAA GGTGTTTTAA ATATTGAGAA TTATTGGATT TACAATAACC AG - #TTAGCTCC 420 - GCATAGTGCT AAATATTTGG AAGTTTGTGC CAAATATGGA GCGCATTTTA TC - #TGGCATGA 480 - TCATGAAAAA TGGTTCTGGG AAACTATTAT GAATGATCCG ACATTCTTTG AA - #GCGAGTCA 540 - AAAATATCAT AAAAATTTGG TGTTGGCAAC TAAAAATACG CCAATAAGAG AT - #GATGCGGG 600 - TACAGATTCT ATCGTTAGTG GATTTTGGTT GAGTGGCTTA TGTGATAACT GG - #GGCTCATC 660 - AACAGATACA TGGAAATGGT GGGAAAAACA TTATACAAAC ACATTTGAAA CT - #GGAAGAGC 720 - TAGGGATATG AGATCCTATG CATCGGAACC AGAATCAATG ATTGCTATGG AA - #ATGATGAA 780 - TGTATATACT GGGGGAGGCA CAGTTTATAA TTTCGAATGT GCCGCGTATA CA - #TTTATGAC 840 - AAATGATGTA CCAACTCCAG CATTTACTAA AGGTATTATT CCTTTCTTTA GA - #CATGCTAT 900 - ACAAAATCCA GCTCCAAGTA AGGAAGAAGT TGTAAATAGA ACAAAAGCTG TA - #TTTTGGAA 960 - TGGAGAAGGT AGGATTAGTT CATTAAACGG ATTTTATCAA GGACTTTATT CG - #AATGATGA 1020 - AACAATGCCT TTATATAATA ATGGGAGATA TCATATTCTT CCTGTAATAC AT - #GAGAAAAT 1080 - TGATAAGGAA AAGATTTCAT CTATATTCCC TAATGCAAAA ATTTTGACTA AA - #AATAGTGA 1140 - GGAATTGTCT AGTAAAGTCA ACTATTTAAA CTCGCTTTAT CCAAAACTTT AT - #GAAGGAGA 1200 - TGGGTATGCT CAGCGTGTAG GTAATTCCTG GTATATTTAT AATAGTAATG CT - #AATATCAA 1260 - TAAAAATCAG CAAGTAATGT TGCCTATGTA TACTAATAAT ACAAAGTCGT TA - #TCGTTAGA 1320 - TTTGACGCCA CATACTTACG CTGTTGTTAA AGAAAATCCA AATAATTTAC AT - #ATTTTATT 1380 - GAATAATTAC AGGACAGATA AGACAGCTAT GTGGGCATTA TCAGGAAATT TT - #GATGCATC 1440 - AAAAAGTTGG AAGAAAGAAG AATTAGAGTT AGCGAACTGG ATAAGCAAAA AT - #TATTCCAT 1500 - CAATCCTGTA GATAATGACT TTAGGACAAC AACACTTACA TTAAAAGGGC AT - #ACTGGTCA 1560 - TAAACCTCAG ATAAATATAA GTGGCGATAA AAATCATTAT ACTTATACAG AA - #AATTGGGA 1620 - TGAGAATACC CATGTTTATA CCATTACGGT TAATCATAAT GGAATGGTAG AG - #ATGTCTAT 1680 - AAATACTGAG GGGACAGGTC CAGTCTCTTT CCCAACACCA GATAAATTTA AT - #GATGGTAA 1740 - TTTGAATATA GCATATGCAA AACCAACAAC ACAAAGTTCT GTAGATTACA AT - #GGAGACCC 1800 - TAATAGAGCT GTGGATGGTA ACAGAAATGG TAATTTTAAC TCTGGTTCGG TA - #ACACACAC 1860 - TAGGGCAGAT AATCCCTCTT GGTGGGAAGT CGATTTGAAA AAAATGGATA AA - #GTTGGGCT 1920 - TGTTAAAATT TATAATCGCA CAGATGCTGA GACTCAACGT CTATCTAATT TT - # 1972 - (2) INFORMATION FOR SEQ ID NO:204: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 657 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: - Lys Ala Asp Asn Arg Val Gln Met Arg Thr Th - #r Ile Asn Asn Glu Ser # 15 - Pro Leu Leu Leu Ser Pro Leu Tyr Gly Asn As - #p Asn Gly Asn Gly Leu # 30 - Trp Trp Gly Asn Thr Leu Lys Gly Ala Trp Gl - #u Ala Ile Pro Glu Asp # 45 - Val Lys Pro Tyr Ala Ala Ile Glu Leu His Pr - #o Ala Lys Val Cys Lys # 60 - Pro Thr Ser Cys Ile Pro Arg Asp Thr Lys Gl - #u Leu Arg Glu Trp Tyr #80 - Val Lys Met Leu Glu Glu Ala Gln Ser Leu As - #n Ile Pro Val Phe Leu # 95 - Val Ile Met Ser Ala Gly Glu Arg Asn Thr Va - #l Pro Pro Glu Trp Leu # 110 - Asp Glu Gln Phe Gln Lys Tyr Ser Val Leu Ly - #s Gly Val Leu Asn Ile # 125 - Glu Asn Tyr Trp Ile Tyr Asn Asn Gln Leu Al - #a Pro His Ser Ala Lys # 140 - Tyr Leu Glu Val Cys Ala Lys Tyr Gly Ala Hi - #s Phe Ile Trp His Asp 145 1 - #50 1 - #55 1 - #60 - His Glu Lys Trp Phe Trp Glu Thr Ile Met As - #n Asp Pro Thr Phe Phe # 175 - Glu Ala Ser Gln Lys Tyr His Lys Asn Leu Va - #l Leu Ala Thr Lys Asn # 190 - Thr Pro Ile Arg Asp Asp Ala Gly Thr Asp Se - #r Ile Val Ser Gly Phe # 205 - Trp Leu Ser Gly Leu Cys Asp Asn Trp Gly Se - #r Ser Thr Asp Thr Trp # 220 - Lys Trp Trp Glu Lys His Tyr Thr Asn Thr Ph - #e Glu Thr Gly Arg Ala 225 2 - #30 2 - #35 2 - #40 - Arg Asp Met Arg Ser Tyr Ala Ser Glu Pro Gl - #u Ser Met Ile Ala Met # 255 - Glu Met Met Asn Val Tyr Thr Gly Gly Gly Th - #r Val Tyr Asn Phe Glu # 270 - Cys Ala Ala Tyr Thr Phe Met Thr Asn Asp Va - #l Pro Thr Pro Ala Phe # 285 - Thr Lys Gly Ile Ile Pro Phe Phe Arg His Al - #a Ile Gln Asn Pro Ala # 300 - Pro Ser Lys Glu Glu Val Val Asn Arg Thr Ly - #s Ala Val Phe Trp Asn 305 3 - #10 3 - #15 3 - #20 - Gly Glu Gly Arg Ile Ser Ser Leu Asn Gly Ph - #e Tyr Gln Gly Leu Tyr # 335 - Ser Asn Asp Glu Thr Met Pro Leu Tyr Asn As - #n Gly Arg Tyr His Ile # 350 - Leu Pro Val Ile His Glu Lys Ile Asp Lys Gl - #u Lys Ile Ser Ser Ile # 365 - Phe Pro Asn Ala Lys Ile Leu Thr Lys Asn Se - #r Glu Glu Leu Ser Ser # 380 - Lys Val Asn Tyr Leu Asn Ser Leu Tyr Pro Ly - #s Leu Tyr Glu Gly Asp 385 3 - #90 3 - #95 4 - #00 - Gly Tyr Ala Gln Arg Val Gly Asn Ser Trp Ty - #r Ile Tyr Asn Ser Asn # 415 - Ala Asn Ile Asn Lys Asn Gln Gln Val Met Le - #u Pro Met Tyr Thr Asn # 430 - Asn Thr Lys Ser Leu Ser Leu Asp Leu Thr Pr - #o His Thr Tyr Ala Val # 445 - Val Lys Glu Asn Pro Asn Asn Leu His Ile Le - #u Leu Asn Asn Tyr Arg # 460 - Thr Asp Lys Thr Ala Met Trp Ala Leu Ser Gl - #y Asn Phe Asp Ala Ser 465 4 - #70 4 - #75 4 - #80 - Lys Ser Trp Lys Lys Glu Glu Leu Glu Leu Al - #a Asn Trp Ile Ser Lys # 495 - Asn Tyr Ser Ile Asn Pro Val Asp Asn Asp Ph - #e Arg Thr Thr Thr Leu # 510 - Thr Leu Lys Gly His Thr Gly His Lys Pro Gl - #n Ile Asn Ile Ser Gly # 525 - Asp Lys Asn His Tyr Thr Tyr Thr Glu Asn Tr - #p Asp Glu Asn Thr His # 540 - Val Tyr Thr Ile Thr Val Asn His Asn Gly Me - #t Val Glu Met Ser Ile 545 5 - #50 5 - #55 5 - #60 - Asn Thr Glu Gly Thr Gly Pro Val Ser Phe Pr - #o Thr Pro Asp Lys Phe # 575 - Asn Asp Gly Asn Leu Asn Ile Ala Tyr Ala Ly - #s Pro Thr Thr Gln Ser # 590 - Ser Val Asp Tyr Asn Gly Asp Pro Asn Arg Al - #a Val Asp Gly Asn Arg # 605 - Asn Gly Asn Phe Asn Ser Gly Ser Val Thr Hi - #s Thr Arg Ala Asp Asn # 620 - Pro Ser Trp Trp Glu Val Asp Leu Lys Lys Me - #t Asp Lys Val Gly Leu 625 6 - #30 6 - #35 6 - #40 - Val Lys Ile Tyr Asn Arg Thr Asp Ala Glu Th - #r Gln Arg Leu Ser Asn # 655 - Phe - (2) INFORMATION FOR SEQ ID NO: 205: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 811 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #205: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTGTGGCAAT CAGTCAGCTG CTTCCAAACA GTCAGCTTCA GGAACGATTG AG - #GTGATTTC 60 - ACGAGAAAAT GGCTCTGGGA CACGGGGTGC CTTCACAGAA ATCACAGGGA TT - #CTCAAAAA 120 - AGACGGTGAT AAAAAAATTG ACAACACTGC CAAAACAGCT GTGATTCAAA AT - #AGTACAGA 180 - AGGTGTTCTC TCAGCAGTTC AAGGGAATGC TAATGCTATC GGCTACATCT CC - #TTGGGATC 240 - TTTAACGAAA TCTGTCAAGG CTTTAGAGAT TGATGGTGTC AAGGCTAGTC GA - #GACACAGT 300 - TTTAGATGGT GAATACCCTC TTCAACGTCC CTTCAACATT GTTTGGTCTT CT - #AATCTTTC 360 - CAAGCTAGGT CAAGATTTTA TCAGCTTTAT CCACTCCAAA CAAGGTCAAC AA - #GTGGTCAC 420 - AGATAATAAA TTTATTGAAG CTAAAACCGA AACCACGGAA TATACAAGCC AA - #CACTTATC 480 - AGGCAAGTTG TCTGTTGTAG GTTCCACTTC AGTATCTTCT TTAATGGAAA AA - #TTAGCAGA 540 - AGCTTATAAA AAAGAAAATC CAGAAGTTAC GATTGATATT ACCTCTAATG GG - #TCTTCAGC 600 - AGGTATTACC GCTGTTAAGG AGAAAACCGC TGATATTGGT ATGGTTTCTA GG - #GAATTAAC 660 - TCCTGAAGAA GGTAAGAGTC TCACCCATGA TGCTATTGCT TTAGACGGTA TT - #GCTGTTGT 720 - GGTCAATAAT GACAATAAGG CAAGCCAAGT CAGTATGGCT GAACTTGCAG AC - #GTTTTTAG 780 # 811 TGGG ACAAGATTAA A - (2) INFORMATION FOR SEQ ID NO:206: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 270 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: - Cys Gly Asn Gln Ser Ala Ala Ser Lys Gln Se - #r Ala Ser Gly Thr Ile # 15 - Glu Val Ile Ser Arg Glu Asn Gly Ser Gly Th - #r Arg Gly Ala Phe Thr # 30 - Glu Ile Thr Gly Ile Leu Lys Lys Asp Gly As - #p Lys Lys Ile Asp Asn # 45 - Thr Ala Lys Thr Ala Val Ile Gln Asn Ser Th - #r Glu Gly Val Leu Ser # 60 - Ala Val Gln Gly Asn Ala Asn Ala Ile Gly Ty - #r Ile Ser Leu Gly Ser #80 - Leu Thr Lys Ser Val Lys Ala Leu Glu Ile As - #p Gly Val Lys Ala Ser # 95 - Arg Asp Thr Val Leu Asp Gly Glu Tyr Pro Le - #u Gln Arg Pro Phe Asn # 110 - Ile Val Trp Ser Ser Asn Leu Ser Lys Leu Gl - #y Gln Asp Phe Ile Ser # 125 - Phe Ile His Ser Lys Gln Gly Gln Gln Val Va - #l Thr Asp Asn Lys Phe # 140 - Ile Glu Ala Lys Thr Glu Thr Thr Glu Tyr Th - #r Ser Gln His Leu Ser 145 1 - #50 1 - #55 1 - #60 - Gly Lys Leu Ser Val Val Gly Ser Thr Ser Va - #l Ser Ser Leu Met Glu # 175 - Lys Leu Ala Glu Ala Tyr Lys Lys Glu Asn Pr - #o Glu Val Thr Ile Asp # 190 - Ile Thr Ser Asn Gly Ser Ser Ala Gly Ile Th - #r Ala Val Lys Glu Lys # 205 - Thr Ala Asp Ile Gly Met Val Ser Arg Glu Le - #u Thr Pro Glu Glu Gly # 220 - Lys Ser Leu Thr His Asp Ala Ile Ala Leu As - #p Gly Ile Ala Val Val 225 2 - #30 2 - #35 2 - #40 - Val Asn Asn Asp Asn Lys Ala Ser Gln Val Se - #r Met Ala Glu Leu Ala # 255 - Asp Val Phe Ser Gly Lys Leu Thr Thr Trp As - #p Lys Ile Lys # 270 - (2) INFORMATION FOR SEQ ID NO: 207:

- (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 805 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #207: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTGTCAACAA CAACATGCTA CTTCTGAGGG GACGAATCAA AGGCAAAGCA GT - #TCAGCGAA 60 - AGTTCCATGG AAAGCTTCAT ACACCAACCT AAACAACCAG GTAAGTACAG AA - #GAGGTCAA 120 - ATCTCTCTTA TCAGCTCACT TGGATCCAAA TAGTGTTGAT GCATTTTTTA AT - #CTCGTTAA 180 - TGACTATAAT ACCATTGTCG GCTCAACTGG CTTATCAGGA GATTTCACTT CC - #TTTACTCA 240 - CACCGAATAC GATGTTGAGA AAATCAGTCA TCTCTGGAAT CAAAAGAAGG GC - #GATTTTGT 300 - TGGGACCAAC TGCCGTATCA ATAGTTATTG TCTTTTGAAA AATTCAGTCA CC - #ATTCCAAA 360 - GCTTGAAAAG AATGACCAGT TGCTTTTCCT AGATAATGAT GCGATTGATA AA - #GGAAAGGT 420 - CTTTGATTCA CAAGATAAGG AAGAGTTTGA TATTCTATTT TCGAGAGTTC CA - #ACTGAGTC 480 - AACTACAGAT GTCAAGGTTC ACGCTGAAAA GATGGAAGCA TTCTTCTCAC AA - #TTTCAATT 540 - CAATGAAAAA GCTCGAATGC TGTCTGTAGT CTTGCACGAC AATTTGGATG GC - #GAGTATCT 600 - GTTTGTAGGC CACGTTGGGG TCTTAGTACC TGCTGATGAC GGTTTCTTAT TT - #GTAGAGAA 660 - ATTGACTTTC GAAGAGCCCT ACCAAGCGAT TAAATTTGCT AGTAAGGAAG AT - #TGCTACAA 720 - GTATTTGGGC ACCAAGTATG CGGATTATAC AGGCGAGGGA CTGGCTAAGC CT - #TTTATCAT 780 # 805 GTTA AACTT - (2) INFORMATION FOR SEQ ID NO:208: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 268 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: - Cys Gln Gln Gln His Ala Thr Ser Glu Gly Th - #r Asn Gln Arg Gln Ser # 15 - Ser Ser Ala Lys Val Pro Trp Lys Ala Ser Ty - #r Thr Asn Leu Asn Asn # 30 - Gln Val Ser Thr Glu Glu Val Lys Ser Leu Le - #u Ser Ala His Leu Asp # 45 - Pro Asn Ser Val Asp Ala Phe Phe Asn Leu Va - #l Asn Asp Tyr Asn Thr # 60 - Ile Val Gly Ser Thr Gly Leu Ser Gly Asp Ph - #e Thr Ser Phe Thr His #80 - Thr Glu Tyr Asp Val Glu Lys Ile Ser His Le - #u Trp Asn Gln Lys Lys # 95 - Gly Asp Phe Val Gly Thr Asn Cys Arg Ile As - #n Ser Tyr Cys Leu Leu # 110 - Lys Asn Ser Val Thr Ile Pro Lys Leu Glu Ly - #s Asn Asp Gln Leu Leu # 125 - Phe Leu Asp Asn Asp Ala Ile Asp Lys Gly Ly - #s Val Phe Asp Ser Gln # 140 - Asp Lys Glu Glu Phe Asp Ile Leu Phe Ser Ar - #g Val Pro Thr Glu Ser 145 1 - #50 1 - #55 1 - #60 - Thr Thr Asp Val Lys Val His Ala Glu Lys Me - #t Glu Ala Phe Phe Ser # 175 - Gln Phe Gln Phe Asn Glu Lys Ala Arg Met Le - #u Ser Val Val Leu His # 190 - Asp Asn Leu Asp Gly Glu Tyr Leu Phe Val Gl - #y His Val Gly Val Leu # 205 - Val Pro Ala Asp Asp Gly Phe Leu Phe Val Gl - #u Lys Leu Thr Phe Glu # 220 - Glu Pro Tyr Gln Ala Ile Lys Phe Ala Ser Ly - #s Glu Asp Cys Tyr Lys 225 2 - #30 2 - #35 2 - #40 - Tyr Leu Gly Thr Lys Tyr Ala Asp Tyr Thr Gl - #y Glu Gly Leu Ala Lys # 255 - Pro Phe Ile Met Asp Asn Asp Lys Trp Val Ly - #s Leu # 265 - (2) INFORMATION FOR SEQ ID NO: 209: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 508 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #209: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTGTTCAGGC AAGTCCGTGA CTAGTGAACA CCAAACGAAA GATGAAATGA AG - #ACGGAGCA 60 - GACAGCTAGT AAAACAAGCG CAGCTAAAGG GAAAGAGGTG GCTGATTTTG AA - #TTGATGGG 120 - AGTAGATGGC AAGACCTACC GTTTATCTGA TTACAAGGGC AAGAAAGTCT AT - #CTCAAATT 180 - CTGGGCTTCT TGGTGTTCCA TCTGTCTGGC TAGTCTTCCA GATACGGATG AG - #ATTGCTAA 240 - AGAAGCTGGT GATGACTATG TGGTCTTGAC AGTAGTGTCA CCAGGACATA AG - #GGAGAGCA 300 - ATCTGAAGCG GACTTTAAGA ATTGGTATAA GGGATTGGAT TATAAAAATC TC - #CCAGTCCT 360 - AGTTGACCCA TCAGGCAAAC TTTTGGAAAC TTATGGTGTC CGTTCTTACC CA - #ACCCAAGC 420 - CTTTATAGAC AAAGAAGGCA AGCTGGTCAA AACACATCCA GGATTCATGG AA - #AAAGATGC 480 # 508 AAGG AATTAGCC - (2) INFORMATION FOR SEQ ID NO:210: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 169 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: - Cys Ser Gly Lys Ser Val Thr Ser Glu His Gl - #n Thr Lys Asp Glu Met # 15 - Lys Thr Glu Gln Thr Ala Ser Lys Thr Ser Al - #a Ala Lys Gly Lys Glu # 30 - Val Ala Asp Phe Glu Leu Met Gly Val Asp Gl - #y Lys Thr Tyr Arg Leu # 45 - Ser Asp Tyr Lys Gly Lys Lys Val Tyr Leu Ly - #s Phe Trp Ala Ser Trp # 60 - Cys Ser Ile Cys Leu Ala Ser Leu Pro Asp Th - #r Asp Glu Ile Ala Lys #80 - Glu Ala Gly Asp Asp Tyr Val Val Leu Thr Va - #l Val Ser Pro Gly His # 95 - Lys Gly Glu Gln Ser Glu Ala Asp Phe Lys As - #n Trp Tyr Lys Gly Leu # 110 - Asp Tyr Lys Asn Leu Pro Val Leu Val Asp Pr - #o Ser Gly Lys Leu Leu # 125 - Glu Thr Tyr Gly Val Arg Ser Tyr Pro Thr Gl - #n Ala Phe Ile Asp Lys # 140 - Glu Gly Lys Leu Val Lys Thr His Pro Gly Ph - #e Met Glu Lys Asp Ala 145 1 - #50 1 - #55 1 - #60 - Ile Leu Gln Thr Leu Lys Glu Leu Ala 165 - (2) INFORMATION FOR SEQ ID NO: 211: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 994 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #211: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTCGCAAATT GAAAAGGCGG CAGTTAGCCA AGGAGGAAAA GCAGTGAAAA AA - #ACAGAAAT 60 - TAGTAAAGAC GCAGACTTGC ACGAAATTTA TCTAGCTGGA GGTTGTTTCT GG - #GGAGTGGA 120 - GGAATATTTC TCACGTGTTC CCGGGGTGAC GGATGCCGTT TCAGGCTATG CA - #AATGGTAG 180 - AGGAGAAACA ACCAAGTACG AATTGATTAA CCAAACAGGT CATGCAGAAA CC - #GTCCATGT 240 - CACCTATGAT GCCAAGCAAA TTTCTCTCAA GGAAATCCTG CTTCACTATT TC - #CGCATTAT 300 - CAATCCAACC AGCAAAAATA AACAAGGAAA TGATGTGGGG ACCCAGTACC GT - #ACTGGTGT 360 - TTATTACACA GATGACAAGG ATTTGGAAGT GATTAACCAA GTCTTTGATG AG - #GTGGCTAA 420 - GAAATACGAT CAACCTCTAG CAGTTGAAAA GGAAAACTTG AAGAATTTTG TG - #GTGGCTGA 480 - GGATTACCAT CAAGACTATC TCAAGAAAAA TCCAAATGGC TACTGCCATA TC - #AATGTTAA 540 - TCAGGCGGCC TATCCTGTCA TTGATGCCAG CAAATATCCA AAACCAAGTG AT - #GAGGAATT 600 - GAAAAAGACC CTGTCACCTG AGGAGTATGC AGTTACCCAG GAAAATCAAA CA - #GAACGAGC 660 - TTTCTCAAAC CGTTACTGGG ATAAATTTGA ATCCGGTATC TATGTGGATA TA - #GCAACTGG 720 - GGAACCTCTC TTTTCATCAA AAGACAAATT TGAGTCTGGT TGTGGCTGGC CT - #AGTTTTAC 780 - CCAACCCATC AGTCCAGATG TTGTCACCTA CAAGGAAGAT AAGTCCTACA AT - #ATGACGCG 840 - TATGGAAGTG CGGAGCCGAG TAGGAGATTC TCACCTTGGG CATGTCTTTA CG - #GATGGTCC 900 - ACAGGACAAG GGCGGCTTAC GTTACTGTAT CAATAGCCTC TCTATCCGCT TT - #ATTCCCAA 960 # 994 AAAG GTACGCTTAT TTAC - (2) INFORMATION FOR SEQ ID NO:212: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 331 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: - Ser Gln Ile Glu Lys Ala Ala Val Ser Gln Gl - #y Gly Lys Ala Val Lys # 15 - Lys Thr Glu Ile Ser Lys Asp Ala Asp Leu Hi - #s Glu Ile Tyr Leu Ala # 30 - Gly Gly Cys Phe Trp Gly Val Glu Glu Tyr Ph - #e Ser Arg Val Pro Gly # 45 - Val Thr Asp Ala Val Ser Gly Tyr Ala Asn Gl - #y Arg Gly Glu Thr Thr # 60 - Lys Tyr Glu Leu Ile Asn Gln Thr Gly His Al - #a Glu Thr Val His Val #80 - Thr Tyr Asp Ala Lys Gln Ile Ser Leu Lys Gl - #u Ile Leu Leu His Tyr # 95 - Phe Arg Ile Ile Asn Pro Thr Ser Lys Asn Ly - #s Gln Gly Asn Asp Val # 110 - Gly Thr Gln Tyr Arg Thr Gly Val Tyr Tyr Th - #r Asp Asp Lys Asp Leu # 125 - Glu Val Ile Asn Gln Val Phe Asp Glu Val Al - #a Lys Lys Tyr Asp Gln # 140 - Pro Leu Ala Val Glu Lys Glu Asn Leu Lys As - #n Phe Val Val Ala Glu 145 1 - #50 1 - #55 1 - #60 - Asp Tyr His Gln Asp Tyr Leu Lys Lys Asn Pr - #o Asn Gly Tyr Cys His # 175 - Ile Asn Val Asn Gln Ala Ala Tyr Pro Val Il - #e Asp Ala Ser Lys Tyr # 190 - Pro Lys Pro Ser Asp Glu Glu Leu Lys Lys Th - #r Leu Ser Pro Glu Glu # 205 - Tyr Ala Val Thr Gln Glu Asn Gln Thr Glu Ar - #g Ala Phe Ser Asn Arg # 220 - Tyr Trp Asp Lys Phe Glu Ser Gly Ile Tyr Va - #l Asp Ile Ala Thr Gly 225 2 - #30 2 - #35 2 - #40 - Glu Pro Leu Phe Ser Ser Lys Asp Lys Phe Gl - #u Ser Gly Cys Gly Trp # 255 - Pro Ser Phe Thr Gln Pro Ile Ser Pro Asp Va - #l Val Thr Tyr Lys Glu # 270 - Asp Lys Ser Tyr Asn Met Thr Arg Met Glu Va - #l Arg Ser Arg Val Gly # 285 - Asp Ser His Leu Gly His Val Phe Thr Asp Gl - #y Pro Gln Asp Lys Gly # 300 - Gly Leu Arg Tyr Cys Ile Asn Ser Leu Ser Il - #e Arg Phe Ile Pro Lys 305 3 - #10 3 - #15 3 - #20 - Asp Gln Met Glu Glu Lys Gly Thr Leu Ile Ty - #r # 330 - (2) INFORMATION FOR SEQ ID NO: 213: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 625 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #213: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TTGTCAGTCA GGTTCTAATG GTTCTCAGTC TGCTGTGGAT GCTATCAAAC AA - #AAAGGGAA 60 - ATTAGTTGTG GCAACCAGTC CTGACTATGC ACCCTTTGAA TTTCAATCAT TG - #GTTGATGG 120 - AAAGAACCAG GTAGTCGGTG CAGACATCGA CATGGCTCAG GCTATCGCTG AT - #GAACTTGG 180 - GGTTAAGTTG GAAATCTCAA GCATGAGTTT TGACAATGTT TTGACCAGTC TT - #CAAACTGG 240 - TAAGGCTGAC CTAGCAGTTG CAGGAATTAG TGCTACTGAC GAGAGAAAAG AA - #GTCTTTGA 300 - TTTTTCAATC CCATACTATG AAAACAAGAT TAGTTTCTTG GTTCGTAAGG CT - #GATGTGGA 360 - AAAATACAAG GATTTAACTA GCCTAGAAAG TGCTAATATT GCAGCCCAAA AA - #GGGACTGT 420 - TCCAGAATCA ATGGTCAAGG AACAATTGCC AAAAGTTCAA TTAACTTCCC TA - #ACTAATAT 480 - GGGTGAAGCA GTCAATGAAT TGCAGGCTGG AAAAATAGAT GCTGTTCATA TG - #GATGAGCC 540 - TGTTGCACTT AGTTATGCTG CTAAAAACGC TGGCTTAGCT GTCGCAACTG TC - #AGCTTGAA 600 # 625 GCCA ATGCC - (2) INFORMATION FOR SEQ ID NO:214: - (i) SEQUENCE CHARACTERISTICS:

#acids (A) LENGTH: 208 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: - Cys Gln Ser Gly Ser Asn Gly Ser Gln Ser Al - #a Val Asp Ala Ile Lys # 15 - Gln Lys Gly Lys Leu Val Val Ala Thr Ser Pr - #o Asp Tyr Ala Pro Phe # 30 - Glu Phe Gln Ser Leu Val Asp Gly Lys Asn Gl - #n Val Val Gly Ala Asp # 45 - Ile Asp Met Ala Gln Ala Ile Ala Asp Glu Le - #u Gly Val Lys Leu Glu # 60 - Ile Ser Ser Met Ser Phe Asp Asn Val Leu Th - #r Ser Leu Gln Thr Gly #80 - Lys Ala Asp Leu Ala Val Ala Gly Ile Ser Al - #a Thr Asp Glu Arg Lys # 95 - Glu Val Phe Asp Phe Ser Ile Pro Tyr Tyr Gl - #u Asn Lys Ile Ser Phe # 110 - Leu Val Arg Lys Ala Asp Val Glu Lys Tyr Ly - #s Asp Leu Thr Ser Leu # 125 - Glu Ser Ala Asn Ile Ala Ala Gln Lys Gly Th - #r Val Pro Glu Ser Met # 140 - Val Lys Glu Gln Leu Pro Lys Val Gln Leu Th - #r Ser Leu Thr Asn Met 145 1 - #50 1 - #55 1 - #60 - Gly Glu Ala Val Asn Glu Leu Gln Ala Gly Ly - #s Ile Asp Ala Val His # 175 - Met Asp Glu Pro Val Ala Leu Ser Tyr Ala Al - #a Lys Asn Ala Gly Leu # 190 - Ala Val Ala Thr Val Ser Leu Lys Met Lys As - #p Gly Asp Ala Asn Ala # 205 - (2) INFORMATION FOR SEQ ID NO: 215: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 3022 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #215: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - GGAAACTTCA CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AA - #GAAGTTGT 60 - TAGTAAAAAT CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TC - #AAAGAAGA 120 - AAATTCCAAT AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA AC - #ACAGAAAA 180 - TCCCAAAAAA GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AA - #TCTGGAGA 240 - AAAAGCAATC AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CT - #TATGATAG 300 - AATTTTTAAC GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TT - #AAACAAAT 360 - AGAAGGTATT TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA AT - #CATGCCAG 420 - AAAGGAAATT GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CT - #CCGTTTGG 480 - GAAAAATTTT GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CA - #GATTATAG 540 - ACATAAGGCT ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AA - #AAAGAAGA 600 - CTTAAAAGGC ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CG - #TTCAATTA 660 - TTATAATGGT GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT AT - #TTTGACCC 720 - ACATGGGATG CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG AC - #ATCAAAAA 780 - CTTTAACGGC ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AA - #ATGTATTC

840 - TGACGCAGGA TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AA - #GATTCTAT 900 - CAAACACAAC GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GT - #CTTGTAGG 960 - TGAGAAATAT TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TG - #GTTGTCGC 1020 - TACGGGTAAC TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CA - #AATAATCA 1080 - TCTGAAAATG ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG AT - #GCGATAGC 1140 - GGTCGCTTCT GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GT - #GGAGAAAG 1200 - TTTTAAATAC AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CA - #AATGAAGA 1260 - TGGAACAAAA GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AA - #GACCAAGA 1320 - TTTGATAGGT TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT AT - #ACAAAGGA 1380 - TTTAAAAAAT GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TG - #GTTGTAAA 1440 - TACTGTAAAT TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG GA - #TATGAAGC 1500 - GGATGAAGGT ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG TA - #AAGCTATG 1560 - GAACATGATT AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG AA - #GATTTTAA 1620 - AGATAAATTG GAGCAATACT ATCCAATTGA TATGGAAAGT TTTAATTCCA AC - #AAACCGAA 1680 - TGTAGGTGAC GAAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA AA - #GAACTCTA 1740 - TAAAGAAGAT ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA TA - #GATTTACT 1800 - TTTAAAACCC GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA AT - #GTTATTAA 1860 - TGGCAAATCA ACTTATGGCT ATATGTCAGG AACTAGTATG GCGACTCCAA TC - #GTGGCAGC 1920 - TTCTACTGTT TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG TA - #TTGAAAAA 1980 - TCTTAAGGGA GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC TA - #CAAAATAC 2040 - TGCGCGACCT ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT TT - #GCATCACC 2100 - TAGACAACAG GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG AA - #GTTGTAGC 2160 - AACTTTCAAA AACACTGATT CTAAAGGTTT GGTAAACTCA TATGGTTCCA TT - #TCTCTTAA 2220 - AGAAATAAAA GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT CA - #AACAGACC 2280 - TTTGACTTTT AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG AC - #AGATTAAA 2340 - ACTTGATGAA ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG TT - #CCAGAAAT 2400 - TCACCCAGAA AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT TC - #ACTATAGG 2460 - CGCAAATTCT AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA AA - #AACAAAAA 2520 - TAAATTTGTA GAATCATTTA TTCATTTTGA GTCAGTGGAA GCGATGGAAG CT - #CTAAACTC 2580 - CAGCGGGAAG AAAATAAACT TCCAACCTTC TTTGTCGATG CCTCTAATGG GA - #TTTGCTGG 2640 - GAATTGGAAC CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT CA - #AGATCAAA 2700 - AACACTGGGA GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT TA - #AATAAGGG 2760 - AATTGGTGGA GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC AA - #AATAGAAA 2820 - AGATAAAAAT ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA AT - #AACGAAGG 2880 - GATCAACGCT CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT TA - #GATTCAAA 2940 - TGGAAATCCT CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC TT - #GTATTAAG 3000 # 3022TGA TT - (2) INFORMATION FOR SEQ ID NO:216: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 1007 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: - Glu Thr Ser Gln Asp Phe Lys Glu Lys Lys Th - #r Ala Val Ile Lys Glu # 15 - Lys Glu Val Val Ser Lys Asn Pro Val Ile As - #p Asn Asn Thr Ser Asn # 30 - Glu Glu Ala Lys Ile Lys Glu Glu Asn Ser As - #n Lys Ser Gln Gly Asp # 45 - Tyr Thr Asp Ser Phe Val Asn Lys Asn Thr Gl - #u Asn Pro Lys Lys Glu # 60 - Asp Lys Val Val Tyr Ile Ala Glu Phe Lys As - #p Lys Glu Ser Gly Glu #80 - Lys Ala Ile Lys Glu Leu Ser Ser Leu Lys As - #n Thr Lys Val Leu Tyr # 95 - Thr Tyr Asp Arg Ile Phe Asn Gly Ser Ala Il - #e Glu Thr Thr Pro Asp # 110 - Asn Leu Asp Lys Ile Lys Gln Ile Glu Gly Il - #e Ser Ser Val Glu Arg # 125 - Ala Gln Lys Val Gln Pro Met Met Asn His Al - #a Arg Lys Glu Ile Gly # 140 - Val Glu Glu Ala Ile Asp Tyr Leu Lys Ser Il - #e Asn Ala Pro Phe Gly 145 1 - #50 1 - #55 1 - #60 - Lys Asn Phe Asp Gly Arg Gly Met Val Ile Se - #r Asn Ile Asp Thr Gly # 175 - Thr Asp Tyr Arg His Lys Ala Met Arg Ile As - #p Asp Asp Ala Lys Ala # 190 - Ser Met Arg Phe Lys Lys Glu Asp Leu Lys Gl - #y Thr Asp Lys Asn Tyr # 205 - Trp Leu Ser Asp Lys Ile Pro His Ala Phe As - #n Tyr Tyr Asn Gly Gly # 220 - Lys Ile Thr Val Glu Lys Tyr Asp Asp Gly Ar - #g Asp Tyr Phe Asp Pro 225 2 - #30 2 - #35 2 - #40 - His Gly Met His Ile Ala Gly Ile Leu Ala Gl - #y Asn Asp Thr Glu Gln # 255 - Asp Ile Lys Asn Phe Asn Gly Ile Asp Gly Il - #e Ala Pro Asn Ala Gln # 270 - Ile Phe Ser Tyr Lys Met Tyr Ser Asp Ala Gl - #y Ser Gly Phe Ala Gly # 285 - Asp Glu Thr Met Phe His Ala Ile Glu Asp Se - #r Ile Lys His Asn Val # 300 - Asp Val Val Ser Val Ser Ser Gly Phe Thr Gl - #y Thr Gly Leu Val Gly 305 3 - #10 3 - #15 3 - #20 - Glu Lys Tyr Trp Gln Ala Ile Arg Ala Leu Ar - #g Lys Ala Gly Ile Pro # 335 - Met Val Val Ala Thr Gly Asn Tyr Ala Thr Se - #r Ala Ser Ser Ser Ser # 350 - Trp Asp Leu Val Ala Asn Asn His Leu Lys Me - #t Thr Asp Thr Gly Asn # 365 - Val Thr Arg Thr Ala Ala His Glu Asp Ala Il - #e Ala Val Ala Ser Ala # 380 - Lys Asn Gln Thr Val Glu Phe Asp Lys Val As - #n Ile Gly Gly Glu Ser 385 3 - #90 3 - #95 4 - #00 - Phe Lys Tyr Arg Asn Ile Gly Ala Phe Phe As - #p Lys Ser Lys Ile Thr # 415 - Thr Asn Glu Asp Gly Thr Lys Ala Pro Ser Ly - #s Leu Lys Phe Val Tyr # 430 - Ile Gly Lys Gly Gln Asp Gln Asp Leu Ile Gl - #y Leu Asp Leu Arg Gly # 445 - Lys Ile Ala Val Met Asp Arg Ile Tyr Thr Ly - #s Asp Leu Lys Asn Ala # 460 - Phe Lys Lys Ala Met Asp Lys Gly Ala Arg Al - #a Ile Met Val Val Asn 465 4 - #70 4 - #75 4 - #80 - Thr Val Asn Tyr Tyr Asn Arg Asp Asn Trp Th - #r Glu Leu Pro Ala Met # 495 - Gly Tyr Glu Ala Asp Glu Gly Thr Lys Ser Gl - #n Val Phe Ser Ile Ser # 510 - Gly Asp Asp Gly Val Lys Leu Trp Asn Met Il - #e Asn Pro Asp Lys Lys # 525 - Thr Glu Val Lys Arg Asn Asn Lys Glu Asp Ph - #e Lys Asp Lys Leu Glu # 540 - Gln Tyr Tyr Pro Ile Asp Met Glu Ser Phe As - #n Ser Asn Lys Pro Asn 545 5 - #50 5 - #55 5 - #60 - Val Gly Asp Glu Lys Glu Ile Asp Phe Lys Ph - #e Ala Pro Asp Thr Asp # 575 - Lys Glu Leu Tyr Lys Glu Asp Ile Ile Val Pr - #o Ala Gly Ser Thr Ser # 590 - Trp Gly Pro Arg Ile Asp Leu Leu Leu Lys Pr - #o Asp Val Ser Ala Pro # 605 - Gly Lys Asn Ile Lys Ser Thr Leu Asn Val Il - #e Asn Gly Lys Ser Thr # 620 - Tyr Gly Tyr Met Ser Gly Thr Ser Met Ala Th - #r Pro Ile Val Ala Ala 625 6 - #30 6 - #35 6 - #40 - Ser Thr Val Leu Ile Arg Pro Lys Leu Lys Gl - #u Met Leu Glu Arg Pro # 655 - Val Leu Lys Asn Leu Lys Gly Asp Asp Lys Il - #e Asp Leu Thr Ser Leu # 670 - Thr Lys Ile Ala Leu Gln Asn Thr Ala Arg Pr - #o Met Met Asp Ala Thr # 685 - Ser Trp Lys Glu Lys Ser Gln Tyr Phe Ala Se - #r Pro Arg Gln Gln Gly # 700 - Ala Gly Leu Ile Asn Val Ala Asn Ala Leu Ar - #g Asn Glu Val Val Ala 705 7 - #10 7 - #15 7 - #20 - Thr Phe Lys Asn Thr Asp Ser Lys Gly Leu Va - #l Asn Ser Tyr Gly Ser # 735 - Ile Ser Leu Lys Glu Ile Lys Gly Asp Lys Ly - #s Tyr Phe Thr Ile Lys # 750 - Leu His Asn Thr Ser Asn Arg Pro Leu Thr Ph - #e Lys Val Ser Ala Ser # 765 - Ala Ile Thr Thr Asp Ser Leu Thr Asp Arg Le - #u Lys Leu Asp Glu Thr # 780 - Tyr Lys Asp Glu Lys Ser Pro Asp Gly Lys Gl - #n Ile Val Pro Glu Ile 785 7 - #90 7 - #95 8 - #00 - His Pro Glu Lys Val Lys Gly Ala Asn Ile Th - #r Phe Glu His Asp Thr # 815 - Phe Thr Ile Gly Ala Asn Ser Ser Phe Asp Le - #u Asn Ala Val Ile Asn # 830 - Val Gly Glu Ala Lys Asn Lys Asn Lys Phe Va - #l Glu Ser Phe Ile His # 845 - Phe Glu Ser Val Glu Ala Met Glu Ala Leu As - #n Ser Ser Gly Lys Lys # 860 - Ile Asn Phe Gln Pro Ser Leu Ser Met Pro Le - #u Met Gly Phe Ala Gly 865 8 - #70 8 - #75 8 - #80 - Asn Trp Asn His Glu Pro Ile Leu Asp Lys Tr - #p Ala Trp Glu Glu Gly # 895 - Ser Arg Ser Lys Thr Leu Gly Gly Tyr Asp As - #p Asp Gly Lys Pro Lys # 910 - Ile Pro Gly Thr Leu Asn Lys Gly Ile Gly Gl - #y Glu His Gly Ile Asp # 925 - Lys Phe Asn Pro Ala Gly Val Ile Gln Asn Ar - #g Lys Asp Lys Asn Thr # 940 - Thr Ser Leu Asp Gln Asn Pro Glu Leu Phe Al - #a Phe Asn Asn Glu Gly 945 9 - #50 9 - #55 9 - #60 - Ile Asn Ala Pro Ser Ser Ser Gly Ser Lys Il - #e Ala Asn Ile Tyr Pro # 975 - Leu Asp Ser Asn Gly Asn Pro Gln Asp Ala Gl - #n Leu Glu Arg Gly Leu # 990 - Thr Pro Ser Pro Leu Val Leu Arg Ser Ala Gl - #u Glu Gly Leu Ile # 10050 - (2) INFORMATION FOR SEQ ID NO: 217: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1696 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #217: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TGTGGTCGAA GTTGAGACTC CTCAATCAAT AACAAATCAG GAGCAAGCTA GG - #ACAGAAAA 60 - CCAAGTAGTA GAGACAGAGG AAGCTCCAAA AGAAGAAGCA CCTAAAACAG AA - #GAAAGTCC 120 - AAAGGAAGAA CCAAAATCGG AGGTAAAACC TACTGACGAC ACCCTTCCTA AA - #GTAGAAGA 180 - GGGGAAAGAA GATTCAGCAG AACCAGCTCC AGTTGAAGAA GTAGGTGGAG AA - #GTTGAGTC 240 - AAAACCAGAG GAAAAAGTAG CAGTTAAGCC AGAAAGTCAA CCATCAGACA AA - #CCAGCTGA 300 - GGAATCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AGAGAAGACG AA - #AAGGCACC 360 - AGTCGAGCCA GAAAAGCAAC CAGAAGCTCC TGAAGAAGAG AAGGCTGTAG AG - #GAAACACC 420 - GAAACAAGAA GAGTCAACTC CAGATACCAA GGCTGAAGAA ACTGTAGAAC CA - #AAAGAGGA 480 - GACTGTTAAT CAATCTATTG AACAACCAAA AGTTGAAACG CCTGCTGTAG AA - #AAACAAAC 540 - AGAACCAACA GAGGAACCAA AAGTTGAACA AGCAGGTGAA CCAGTCGCGC CA - #AGAGAAGA 600 - CGAACAGGCA CCAACGGCAC CAGTTGAGCC AGAAAAGCAA CCAGAAGTTC CT - #GAAGAAGA 660 - GAAGGCTGTA GAGGAAACAC CGAAACCAGA AGATAAAATA AAGGGTATTG GT - #ACTAAAGA 720 - ACCAGTTGAT AAAAGTGAGT TAAATAATCA AATTGATAAA GCTAGTTCAG TT - #TCTCCTAC 780 - TGATTATTCT ACAGCAAGTT ACAATGCTCT TGGACCTGTT TTAGAAACTG CA - #AAAGGTGT 840 - CTATGCTTCA GAGCCTGTAA AACAGCCTGA GGTAAATAGC GAGACAAATA AA - #CTTAAAAC

900 - GGCTATTGAC GCTCTAAACG TTGATAAAAC TGAATTAAAC AATACGATTG CA - #GATGCAAA 960 - AACAAAGGTA AAAGAACATT ACAGTGATAG AAGTTGGCAA AACCTCCAAA CT - #GAAGTTAC 1020 - AAAGGCTGAA AAAGTTGCAG CTAATACAGA TGCTAAACAA AGTGAAGTTA AC - #GAAGCTGT 1080 - TGAAAAATTA ACTGCAACTA TTGAAAAATT GGTTGAATTA TCTGAAAAGC CA - #ATATTAAC 1140 - ATTGACTAGT ACCGATAAGA AAATATTGGA ACGTGAAGCT GTTGCTAAGT AT - #ACTCTAGA 1200 - AAATCAAAAC AAAACAAAAA TCAAATCAAT CACAGCTGAA TTGAAAAAAG GA - #GAAGAAGT 1260 - TATTAATACT GTAGTCCTTA CAGATGACAA GGTAACAACA GAAACTATAA GC - #GCTGCATT 1320 - TAAGAACCTA GAGTACTACA AAGAATACAC CCTATCTACA ACTATGATTT AC - #GACAGAGG 1380 - TAACGGTGAA GAAACTGAAA CTCTAGAAAA TCAAAATATT CAATTAGATC TT - #AAAAAAGT 1440 - TGAGCTTAAA AATATTAAAC GTACAGATTT AATCAAATAC GAAAATGGAA AA - #GAAACTAA 1500 - TGAATCACTG ATAACAACTA TTCCTGATGA TAAGAGCAAT TATTATTTAA AA - #ATAACTTC 1560 - AAATAATCAG AAAACTACAT TACTAGCTGT TAAAAATATA GAAGAAACTA CG - #GTTAACGG 1620 - AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CT - #GCTGATAA 1680 # 1696 - (2) INFORMATION FOR SEQ ID NO:218: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 565 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: - Val Val Glu Val Glu Thr Pro Gln Ser Ile Th - #r Asn Gln Glu Gln Ala # 15 - Arg Thr Glu Asn Gln Val Val Glu Thr Glu Gl - #u Ala Pro Lys Glu Glu # 30 - Ala Pro Lys Thr Glu Glu Ser Pro Lys Glu Gl - #u Pro Lys Ser Glu Val # 45 - Lys Pro Thr Asp Asp Thr Leu Pro Lys Val Gl - #u Glu Gly Lys Glu Asp # 60 - Ser Ala Glu Pro Ala Pro Val Glu Glu Val Gl - #y Gly Glu Val Glu Ser #80 - Lys Pro Glu Glu Lys Val Ala Val Lys Pro Gl - #u Ser Gln Pro Ser Asp # 95 - Lys Pro Ala Glu Glu Ser Lys Val Glu Gln Al - #a Gly Glu Pro Val Ala # 110 - Pro Arg Glu Asp Glu Lys Ala Pro Val Glu Pr - #o Glu Lys Gln Pro Glu # 125 - Ala Pro Glu Glu Glu Lys Ala Val Glu Glu Th - #r Pro Lys Gln Glu Glu # 140 - Ser Thr Pro Asp Thr Lys Ala Glu Glu Thr Va - #l Glu Pro Lys Glu Glu 145 1 - #50 1 - #55 1 - #60 - Thr Val Asn Gln Ser Ile Glu Gln Pro Lys Va - #l Glu Thr Pro Ala Val # 175 - Glu Lys Gln Thr Glu Pro Thr Glu Glu Pro Ly - #s Val Glu Gln Ala Gly # 190 - Glu Pro Val Ala Pro Arg Glu Asp Glu Gln Al - #a Pro Thr Ala Pro Val # 205 - Glu Pro Glu Lys Gln Pro Glu Val Pro Glu Gl - #u Glu Lys Ala Val Glu # 220 - Glu Thr Pro Lys Pro Glu Asp Lys Ile Lys Gl - #y Ile Gly Thr Lys Glu 225 2 - #30 2 - #35 2 - #40 - Pro Val Asp Lys Ser Glu Leu Asn Asn Gln Il - #e Asp Lys Ala Ser Ser # 255 - Val Ser Pro Thr Asp Tyr Ser Thr Ala Ser Ty - #r Asn Ala Leu Gly Pro # 270 - Val Leu Glu Thr Ala Lys Gly Val Tyr Ala Se - #r Glu Pro Val Lys Gln # 285 - Pro Glu Val Asn Ser Glu Thr Asn Lys Leu Ly - #s Thr Ala Ile Asp Ala # 300 - Leu Asn Val Asp Lys Thr Glu Leu Asn Asn Th - #r Ile Ala Asp Ala Lys 305 3 - #10 3 - #15 3 - #20 - Thr Lys Val Lys Glu His Tyr Ser Asp Arg Se - #r Trp Gln Asn Leu Gln # 335 - Thr Glu Val Thr Lys Ala Glu Lys Val Ala Al - #a Asn Thr Asp Ala Lys # 350 - Gln Ser Glu Val Asn Glu Ala Val Glu Lys Le - #u Thr Ala Thr Ile Glu # 365 - Lys Leu Val Glu Leu Ser Glu Lys Pro Ile Le - #u Thr Leu Thr Ser Thr # 380 - Asp Lys Lys Ile Leu Glu Arg Glu Ala Val Al - #a Lys Tyr Thr Leu Glu 385 3 - #90 3 - #95 4 - #00 - Asn Gln Asn Lys Thr Lys Ile Lys Ser Ile Th - #r Ala Glu Leu Lys Lys # 415 - Gly Glu Glu Val Ile Asn Thr Val Val Leu Th - #r Asp Asp Lys Val Thr # 430 - Thr Glu Thr Ile Ser Ala Ala Phe Lys Asn Le - #u Glu Tyr Tyr Lys Glu # 445 - Tyr Thr Leu Ser Thr Thr Met Ile Tyr Asp Ar - #g Gly Asn Gly Glu Glu # 460 - Thr Glu Thr Leu Glu Asn Gln Asn Ile Gln Le - #u Asp Leu Lys Lys Val 465 4 - #70 4 - #75 4 - #80 - Glu Leu Lys Asn Ile Lys Arg Thr Asp Leu Il - #e Lys Tyr Glu Asn Gly # 495 - Lys Glu Thr Asn Glu Ser Leu Ile Thr Thr Il - #e Pro Asp Asp Lys Ser # 510 - Asn Tyr Tyr Leu Lys Ile Thr Ser Asn Asn Gl - #n Lys Thr Thr Leu Leu # 525 - Ala Val Lys Asn Ile Glu Glu Thr Thr Val As - #n Gly Thr Pro Val Tyr # 540 - Lys Val Thr Ala Ile Ala Asp Asn Leu Val Se - #r Arg Thr Ala Asp Asn 545 5 - #50 5 - #55 5 - #60 - Lys Phe Glu Glu Glu 565 - (2) INFORMATION FOR SEQ ID NO: 219: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1879 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #219: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CT - #GCTGATAA 60 - TAAATTTGAA GAAGAATACG TTCACTATAT TGAAAAACCT AAAGTCCACG AA - #GATAATGT 120 - ATATTATAAT TTCAAAGAAT TAGTGGAAGC TATTCAAAAC GATCCTTCAA AA - #GAATATCG 180 - TCTGGGACAA TCAATGAGCG CTAGAAATGT TGTTCCTAAT GGAAAATCAT AT - #ATCACTAA 240 - AGAATTCACA GGAAAACTTT TAAGTTCTGA AGGAAAACAA TTTGCTATTA CT - #GAATTGGA 300 - ACATCCATTA TTTAATGTGA TAACAAACGC AACGATAAAT AATGTGAATT TT - #GAAAATGT 360 - AGAGATAGAA CGTTCTGGTC AAGATAATAT TGCATCATTA GCCAATACTA TG - #AAAGGTTC 420 - TTCAGTTATT ACAAATGTCA AAATTACAGG CACACTTTCA GGTCGTAATA AT - #GTTGCTGG 480 - ATTTGTAAAT AATATGAATG ATGGAACTCG TATTGAAAAT GTTGCTTTCT TT - #GGCAAACT 540 - ACACTCTACA AGTGGAAATG GCTCTCATAC AGGGGGAATT GCAGGTACAA AC - #TATAGAGG 600 - AATTGTTAGA AAAGCATATG TTGATGCTAC TATTACAGGA AACAAAACAC GC - #GCCAGCTT 660 - GTTAGTTCCT AAAGTAGATT ATGGATTAAC TCTAGACCAT CTTATTGGTA CA - #AAAGCTCT 720 - CCTAACTGAG TCGGTTGTAA AAGGTAAAAT AGATGTTTCA AATCCAGTAG AA - #GTTGGAGC 780 - AATAGCAAGT AAGACTTGGC CTGTAGGTAC GGTAAGTAAT TCTGTCAGCT AT - #GCTAAGAT 840 - TATCCGTGGA GAGGAGTTAT TCGGCTCTAA CGACGTTGAT GATTCTGATT AT - #GCTAGTGC 900 - TCATATAAAA GATTTATATG CGGTAGAGGG ATATTCGTCA GGTAATAGAT CA - #TTTAGGAA 960 - ATCTAAAACA TTTACTAAAT TAACTAAAGA ACAAGCTGAT GCTAAAGTTA CT - #ACTTTCAA 1020 - TATTACTGCT GATAAATTAG AAAGTGATCT ATCTCCTCTT GCAAAACTTA AT - #GAAGAAAA 1080 - AGCCTATTCT AGTATTCAAG ATTATAACGC TGAATATAAC CAAGCCTATA AA - #AATCTTGA 1140 - AAAATTAATA CCATTCTACA ATAAAGATTA TATTGTATAT CAAGGTAATA AA - #TTAAATAA 1200 - AGAACACCAT CTAAATACTA AAGAAGTTCT TTCTGTTACC GCGATGAACA AC - #AATGAGTT 1260 - TATCACAAAC CTAGATGAAG CTAATAAAAT TATTGTTCAC TATGCGGACG GT - #ACAAAAGA 1320 - TTACTTTAAC TTGTCTTCTA GCAGTGAAGG TTTAAGTAAT GTAAAAGAAT AT - #ACTATAAC 1380 - TGACTTAGGA ATTAAATATA CACCTAATAT CGTTCAAAAA GATAACACTA CT - #CTTGTTAA 1440 - TGATATAAAA TCTATTTTAG AATCAGTAGA GCTTCAGTCT CAAACGATGT AT - #CAGCATCT 1500 - AAATCGATTA GGTGACTATA GAGTTAATGC AATCAAAGAT TTATATTTAG AA - #GAAAGCTT 1560 - CACAGATGTT AAAGAAAACT TAACAAACCT AATCACAAAA TTAGTTCAAA AC - #GAAGAACA 1620 - TCAACTAAAT GATTCTCCAG CTGCTCGTCA AATGATTCGT GATAAAGTCG AG - #AAAAACAA 1680 - AGCAGCTTTA TTACTAGGTT TAACTTACCT AAATCGTTAC TATGGAGTTA AA - #TTTGGTGA 1740 - TGTTAATATT AAAGAATTAA TGCTATTCAA ACCAGATTTC TATGGTGAAA AA - #GTTAGCGT 1800 - ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT CA - #CGTACATT 1860 # 187 - #9 - (2) INFORMATION FOR SEQ ID NO:220: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 626 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: - Thr Pro Val Tyr Lys Val Thr Ala Ile Ala As - #p Asn Leu Val Ser Arg # 15 - Thr Ala Asp Asn Lys Phe Glu Glu Glu Tyr Va - #l His Tyr Ile Glu Lys # 30 - Pro Lys Val His Glu Asp Asn Val Tyr Tyr As - #n Phe Lys Glu Leu Val # 45 - Glu Ala Ile Gln Asn Asp Pro Ser Lys Glu Ty - #r Arg Leu Gly Gln Ser # 60 - Met Ser Ala Arg Asn Val Val Pro Asn Gly Ly - #s Ser Tyr Ile Thr Lys #80 - Glu Phe Thr Gly Lys Leu Leu Ser Ser Glu Gl - #y Lys Gln Phe Ala Ile # 95 - Thr Glu Leu Glu His Pro Leu Phe Asn Val Il - #e Thr Asn Ala Thr Ile # 110 - Asn Asn Val Asn Phe Glu Asn Val Glu Ile Gl - #u Arg Ser Gly Gln Asp # 125 - Asn Ile Ala Ser Leu Ala Asn Thr Met Lys Gl - #y Ser Ser Val Ile Thr # 140 - Asn Val Lys Ile Thr Gly Thr Leu Ser Gly Ar - #g Asn Asn Val Ala Gly 145 1 - #50 1 - #55 1 - #60 - Phe Val Asn Asn Met Asn Asp Gly Thr Arg Il - #e Glu Asn Val Ala Phe # 175 - Phe Gly Lys Leu His Ser Thr Ser Gly Asn Gl - #y Ser His Thr Gly Gly # 190 - Ile Ala Gly Thr Asn Tyr Arg Gly Ile Val Ar - #g Lys Ala Tyr Val Asp # 205 - Ala Thr Ile Thr Gly Asn Lys Thr Arg Ala Se - #r Leu Leu Val Pro Lys # 220 - Val Asp Tyr Gly Leu Thr Leu Asp His Leu Il - #e Gly Thr Lys Ala Leu 225 2 - #30 2 - #35 2 - #40 - Leu Thr Glu Ser Val Val Lys Gly Lys Ile As - #p Val Ser Asn Pro Val # 255 - Glu Val Gly Ala Ile Ala Ser Lys Thr Trp Pr - #o Val Gly Thr Val Ser # 270 - Asn Ser Val Ser Tyr Ala Lys Ile Ile Arg Gl - #y Glu Glu Leu Phe Gly # 285 - Ser Asn Asp Val Asp Asp Ser Asp Tyr Ala Se - #r Ala His Ile Lys Asp # 300 - Leu Tyr Ala Val Glu Gly Tyr Ser Ser Gly As - #n Arg Ser Phe Arg Lys 305 3 - #10 3 - #15 3 - #20 - Ser Lys Thr Phe Thr Lys Leu Thr Lys Glu Gl - #n Ala Asp Ala Lys Val # 335 - Thr Thr Phe Asn Ile Thr Ala Asp Lys Leu Gl - #u Ser Asp Leu Ser Pro # 350 - Leu Ala Lys Leu Asn Glu Glu Lys Ala Tyr Se - #r Ser Ile Gln Asp Tyr # 365 - Asn Ala Glu Tyr Asn Gln Ala Tyr Lys Asn Le - #u Glu Lys Leu Ile Pro # 380 - Phe Tyr Asn Lys Asp Tyr Ile Val Tyr Gln Gl - #y Asn Lys Leu Asn Lys 385 3 - #90 3 - #95 4 - #00 - Glu His His Leu Asn Thr Lys Glu Val Leu Se - #r Val Thr Ala Met Asn # 415 - Asn Asn Glu Phe Ile Thr Asn Leu Asp Glu Al - #a Asn Lys Ile Ile Val # 430 - His Tyr Ala Asp Gly Thr Lys Asp Tyr Phe As - #n Leu Ser Ser Ser Ser # 445 - Glu Gly Leu Ser Asn Val Lys Glu Tyr Thr Il - #e Thr Asp Leu Gly Ile # 460 - Lys Tyr Thr Pro Asn Ile Val Gln Lys Asp As - #n Thr Thr Leu Val Asn

465 4 - #70 4 - #75 4 - #80 - Asp Ile Lys Ser Ile Leu Glu Ser Val Glu Le - #u Gln Ser Gln Thr Met # 495 - Tyr Gln His Leu Asn Arg Leu Gly Asp Tyr Ar - #g Val Asn Ala Ile Lys # 510 - Asp Leu Tyr Leu Glu Glu Ser Phe Thr Asp Va - #l Lys Glu Asn Leu Thr # 525 - Asn Leu Ile Thr Lys Leu Val Gln Asn Glu Gl - #u His Gln Leu Asn Asp # 540 - Ser Pro Ala Ala Arg Gln Met Ile Arg Asp Ly - #s Val Glu Lys Asn Lys 545 5 - #50 5 - #55 5 - #60 - Ala Ala Leu Leu Leu Gly Leu Thr Tyr Leu As - #n Arg Tyr Tyr Gly Val # 575 - Lys Phe Gly Asp Val Asn Ile Lys Glu Leu Me - #t Leu Phe Lys Pro Asp # 590 - Phe Tyr Gly Glu Lys Val Ser Val Leu Asp Ar - #g Leu Ile Glu Ile Gly # 605 - Ser Lys Glu Asn Asn Ile Lys Gly Ser Arg Th - #r Phe Asp Ala Phe Gly # 620 - Gln Val 625 - (2) INFORMATION FOR SEQ ID NO: 221: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 1777 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #221: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT CA - #CGTACATT 60 - CGACGCATTC GGTCAAGTAT TGGCTAAATA TACTAAATCA GGTAATTTAG AT - #GCATTTTT 120 - AAATTATAAT AGACAATTGT TCACAAATAT AGACAATATG AACGATTGGT TT - #ATTGATGC 180 - TACAGAAGAC CATGTCTACA TCGCAGAACG CGCTTCTGAG GTCGAAGAAA TT - #AAAAATTC 240 - TAAACATCGT GCATTCGATA ATTTAAAACG AAGTCACCTT AGAAATACTA TA - #CTCCCACT 300 - ACTGAATATT GATAAAGCAC ATCTTTATTT AATTTCAAAT TATAATGCAA TT - #GCCTTTGG 360 - TAGTGCAGAG CGATTAGGTA AAAAATCATT AGAAGATATT AAAGATATCG TT - #AACAAAGC 420 - TGCAGATGGT TATAGAAACT ATTATGATTT CTGGTATCGT CTAGCGTCTG AT - #AACGTTAA 480 - ACAACGACTA CTAAGAGATG CTGTTATTCC TATTTGGGAA GGTTATAACG CT - #CCTGGTGG 540 - ATGGGTTGAA AAATATGGCC GCTATAATAC CGACAAAGTA TATACTCCTC TT - #AGAGAATT 600 - CTTTGGTCCT ATGGATAAGT ATTATAATTA TAATGGAACA GGAGCTTATG CT - #GCTATATA 660 - TCCTAACTCT GATGATATTA GAACTGATGT AAAATATGTT CATTTAGAAA TG - #GTTGGTGA 720 - ATACGGTATT TCAGTTTACA CACATGAAAC AACACACGTC AACGACCGTG CG - #ATTTACTT 780 - AGGTGGCTTT GGACACCGTG AAGGTACTGA TGCTGAAGCA TATGCTCAGG GT - #ATGCTACA 840 - AACTCCTGTT ACTGGTAGTG GATTTGATGA GTTTGGTTCT TTAGGTATTA AT - #ATGGTATT 900 - TAAACGCAAA AATGATGGGA ATCAGTGGTA TATTACAGAT CCAAAAACTC TA - #AAAACACG 960 - AGAAGATATT AATAGATATA TGAAGGGTTA TAATGACACT TTAACTCTTC TT - #GATGAAAT 1020 - TGAGGCTGAA TCTGTGATTT CTCAACAAAA TAAAGATTTA AATAGTGCAT GG - #TTCAAAAA 1080 - AATAGATAGA GAATACCGTG ATAACAATAA ATTAAATCAA TGGGATAAAA TT - #CGAAATCT 1140 - AAGTCAAGAA GAGAAAAATG AATTAAATAT TCAATCTGTT AATGATTTAG TT - #GATCAACA 1200 - ATTAATGACT AATCGCAATC CAGGTAATGG TATCTATAAA CCCGAAGCAA TT - #AGCTATAA 1260 - CGATCAATCA CCTTATGTAG GTGTTAGAAT GATGACCGGT ATCTACGGAG GT - #AATACTAG 1320 - TAAAGGTGCT CCTGGAGCTG TTTCATTCAA ACATAATGCT TTTAGATTAT GG - #GGTTACTA 1380 - CGGATACGAA AATGGGTTCT TAGGTTATGC TTCAAATAAA TATAAACAAC AA - #TCTAAAAC 1440 - AGATGGTGAG TCTGTTCTAA GTGATGAATA TATTATCAAG AAAATATCTA AC - #AATACATT 1500 - TAATACTATT GAAGAATTTA AAAAAGCTTA CTTCAAAGAA GTTAAAGATA AA - #GCAACGAA 1560 - AGGATTAACA ACATTCGAAG TAAATGGTTC TTCCGTTTCA TCATACGATG AT - #TTACTGAC 1620 - ATTGTTTAAA GAAGCTGTTA AAAAAGATGC CGAAACTCTT AAACAAGAAG CA - #AACGGTAA 1680 - TAAAACAGTA TCTATGAATA ATACAGTTAA ATTAAAAGAA GCTGTTTATA AG - #AAACTTCT 1740 # 1777 TTTA AAACTTCAAT CTTTAAA - (2) INFORMATION FOR SEQ ID NO:222: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 592 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: - Leu Asp Arg Leu Ile Glu Ile Gly Ser Lys Gl - #u Asn Asn Ile Lys Gly # 15 - Ser Arg Thr Phe Asp Ala Phe Gly Gln Val Le - #u Ala Lys Tyr Thr Lys # 30 - Ser Gly Asn Leu Asp Ala Phe Leu Asn Tyr As - #n Arg Gln Leu Phe Thr # 45 - Asn Ile Asp Asn Met Asn Asp Trp Phe Ile As - #p Ala Thr Glu Asp His # 60 - Val Tyr Ile Ala Glu Arg Ala Ser Glu Val Gl - #u Glu Ile Lys Asn Ser #80 - Lys His Arg Ala Phe Asp Asn Leu Lys Arg Se - #r His Leu Arg Asn Thr # 95 - Ile Leu Pro Leu Leu Asn Ile Asp Lys Ala Hi - #s Leu Tyr Leu Ile Ser # 110 - Asn Tyr Asn Ala Ile Ala Phe Gly Ser Ala Gl - #u Arg Leu Gly Lys Lys # 125 - Ser Leu Glu Asp Ile Lys Asp Ile Val Asn Ly - #s Ala Ala Asp Gly Tyr # 140 - Arg Asn Tyr Tyr Asp Phe Trp Tyr Arg Leu Al - #a Ser Asp Asn Val Lys 145 1 - #50 1 - #55 1 - #60 - Gln Arg Leu Leu Arg Asp Ala Val Ile Pro Il - #e Trp Glu Gly Tyr Asn # 175 - Ala Pro Gly Gly Trp Val Glu Lys Tyr Gly Ar - #g Tyr Asn Thr Asp Lys # 190 - Val Tyr Thr Pro Leu Arg Glu Phe Phe Gly Pr - #o Met Asp Lys Tyr Tyr # 205 - Asn Tyr Asn Gly Thr Gly Ala Tyr Ala Ala Il - #e Tyr Pro Asn Ser Asp # 220 - Asp Ile Arg Thr Asp Val Lys Tyr Val His Le - #u Glu Met Val Gly Glu 225 2 - #30 2 - #35 2 - #40 - Tyr Gly Ile Ser Val Tyr Thr His Glu Thr Th - #r His Val Asn Asp Arg # 255 - Ala Ile Tyr Leu Gly Gly Phe Gly His Arg Gl - #u Gly Thr Asp Ala Glu # 270 - Ala Tyr Ala Gln Gly Met Leu Gln Thr Pro Va - #l Thr Gly Ser Gly Phe # 285 - Asp Glu Phe Gly Ser Leu Gly Ile Asn Met Va - #l Phe Lys Arg Lys Asn # 300 - Asp Gly Asn Gln Trp Tyr Ile Thr Asp Pro Ly - #s Thr Leu Lys Thr Arg 305 3 - #10 3 - #15 3 - #20 - Glu Asp Ile Asn Arg Tyr Met Lys Gly Tyr As - #n Asp Thr Leu Thr Leu # 335 - Leu Asp Glu Ile Glu Ala Glu Ser Val Ile Se - #r Gln Gln Asn Lys Asp # 350 - Leu Asn Ser Ala Trp Phe Lys Lys Ile Asp Ar - #g Glu Tyr Arg Asp Asn # 365 - Asn Lys Leu Asn Gln Trp Asp Lys Ile Arg As - #n Leu Ser Gln Glu Glu # 380 - Lys Asn Glu Leu Asn Ile Gln Ser Val Asn As - #p Leu Val Asp Gln Gln 385 3 - #90 3 - #95 4 - #00 - Leu Met Thr Asn Arg Asn Pro Gly Asn Gly Il - #e Tyr Lys Pro Glu Ala # 415 - Ile Ser Tyr Asn Asp Gln Ser Pro Tyr Val Gl - #y Val Arg Met Met Thr # 430 - Gly Ile Tyr Gly Gly Asn Thr Ser Lys Gly Al - #a Pro Gly Ala Val Ser # 445 - Phe Lys His Asn Ala Phe Arg Leu Trp Gly Ty - #r Tyr Gly Tyr Glu Asn # 460 - Gly Phe Leu Gly Tyr Ala Ser Asn Lys Tyr Ly - #s Gln Gln Ser Lys Thr 465 4 - #70 4 - #75 4 - #80 - Asp Gly Glu Ser Val Leu Ser Asp Glu Tyr Il - #e Ile Lys Lys Ile Ser # 495 - Asn Asn Thr Phe Asn Thr Ile Glu Glu Phe Ly - #s Lys Ala Tyr Phe Lys # 510 - Glu Val Lys Asp Lys Ala Thr Lys Gly Leu Th - #r Thr Phe Glu Val Asn # 525 - Gly Ser Ser Val Ser Ser Tyr Asp Asp Leu Le - #u Thr Leu Phe Lys Glu # 540 - Ala Val Lys Lys Asp Ala Glu Thr Leu Lys Gl - #n Glu Ala Asn Gly Asn 545 5 - #50 5 - #55 5 - #60 - Lys Thr Val Ser Met Asn Asn Thr Val Lys Le - #u Lys Glu Ala Val Tyr # 575 - Lys Lys Leu Leu Gln Gln Thr Asn Ser Phe Ly - #s Thr Ser Ile Phe Lys # 590 - (2) INFORMATION FOR SEQ ID NO: 223: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 460 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #223: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - TAAGACAGAT GAACGGAGCA AGGTGTTTGA CTTTTCCATT CCCTACTATA CT - #GCAAAAAA 60 - TAAACTCATT GTCAAAAAAT CTGACTTGAC TACTTATCAG TCTGTAAACG AC - #TTGGCGCA 120 - GAAAAAGGTT GGAGCGCAGA AAGGTTCGAT TCAAGAGACG ATGGCGAAAG AT - #TTGCTACA 180 - AAATTCTTCC CTCGTATCTC TGCCTAAAAA TGGGAATTTA ATCACAGATT TA - #AAATCAGG 240 - ACAAGTGGAT GCCGTTATCT TTGAAGAACC TGTTTCCAAG GGATTTGTGG AA - #AATAATCC 300 - TGATTTAGCA ATCGCAGACC TCAATTTTGA AAAAGAGCAA GATGATTCCT AC - #GCGGTAGC 360 - CATgAAAAAA GATAGCAAGA AATTGAAGAG GCAGTTCGAT AAAACCATTC AA - #AAGTTGAA 420 # 460 GACA AACTCATTGA GGAAGCCTTA - (2) INFORMATION FOR SEQ ID NO:224: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 153 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: - Lys Thr Asp Glu Arg Ser Lys Val Phe Asp Ph - #e Ser Ile Pro Tyr Tyr # 15 - Thr Ala Lys Asn Lys Leu Ile Val Lys Lys Se - #r Asp Leu Thr Thr Tyr # 30 - Gln Ser Val Asn Asp Leu Ala Gln Lys Lys Va - #l Gly Ala Gln Lys Gly # 45 - Ser Ile Gln Glu Thr Met Ala Lys Asp Leu Le - #u Gln Asn Ser Ser Leu # 60 - Val Ser Leu Pro Lys Asn Gly Asn Leu Ile Th - #r Asp Leu Lys Ser Gly #80 - Gln Val Asp Ala Val Ile Phe Glu Glu Pro Va - #l Ser Lys Gly Phe Val # 95 - Glu Asn Asn Pro Asp Leu Ala Ile Ala Asp Le - #u Asn Phe Glu Lys Glu # 110 - Gln Asp Asp Ser Tyr Ala Val Ala Met Lys Ly - #s Asp Ser Lys Lys Leu # 125 - Lys Arg Gln Phe Asp Lys Thr Ile Gln Lys Le - #u Lys Glu Ser Gly Glu # 140 - Leu Asp Lys Leu Ile Glu Glu Ala Leu 145 1 - #50 - (2) INFORMATION FOR SEQ ID NO: 225: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 751 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #225: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - CTGTGAGAAT CAAGCTACAC CCAAAGAGAC TAGCGCTCAA AAGACAATCG TC - #CTTGCTAC 60 - AGCTGGCGAC GTGCCACCAT TTGACTACGA AGACAAGGGC AATCTGACAG GC - #TTTGATAT 120 - CGAAGTTTTA AAGGCAGTAG ATGAAAAACT CAGCGACTAC GAGATTCAAT TC - #CAAAGAAC 180 - CGCCTGGGAG AGCATCTTCC CAGGACTTGA TTCTGGTCAC TATCAGGCTG CG - #GCCAATAA 240 - CTTGAGTTAC ACAAAAGAGC GTGCTGAAAA ATACCTTTAC TCGCTTCCAA TT - #TCCAACAA 300 - TCCCCTCGTC CTTGTCAGCA ACAAGAAAAA TCCTTTGACT TCTCTTGACC AG - #ATCGCTGG 360 - TAAAACAACA CAAGAGGATA CCGGAACTTC TAACGCTCAA TTCATCAATA AC - #TGGAATCA 420 - GAAACACACT GATAATCCCG CTACAATTAA TTTTTCTGGT GAGGATATTG GT - #AAACGAAT 480 - CCTAGACCTT GCTAACGGAG AGTTTGATTT CCTAGTTTTT GACAAGGTAT CC - #GTTCAAAA 540 - GATTATCAAG GACCGTGGTT TAGACCTCTC AGTCGTTGAT TTACCTTCTG CA - #GATAGCCC 600 - CAGCAATTAT ATCATTTTCT CAAGCGACCA AAAAGAGTTT AAAGAGCAAT TT - #GATAAAGC

660 - GCTCAAAGAA CTCTATCAAG ACGGAACCCT TGAAAAACTC AGCAATACCT AT - #CTAGGTGG 720 # 751 CAAT CTCAGTTACA A - (2) INFORMATION FOR SEQ ID NO:226: - (i) SEQUENCE CHARACTERISTICS: #acids (A) LENGTH: 250 amino (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear - (ii) MOLECULE TYPE: protein - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: - Cys Glu Asn Gln Ala Thr Pro Lys Glu Thr Se - #r Ala Gln Lys Thr Ile # 15 - Val Leu Ala Thr Ala Gly Asp Val Pro Pro Ph - #e Asp Tyr Glu Asp Lys # 30 - Gly Asn Leu Thr Gly Phe Asp Ile Glu Val Le - #u Lys Ala Val Asp Glu # 45 - Lys Leu Ser Asp Tyr Glu Ile Gln Phe Gln Ar - #g Thr Ala Trp Glu Ser # 60 - Ile Phe Pro Gly Leu Asp Ser Gly His Tyr Gl - #n Ala Ala Ala Asn Asn #80 - Leu Ser Tyr Thr Lys Glu Arg Ala Glu Lys Ty - #r Leu Tyr Ser Leu Pro # 95 - Ile Ser Asn Asn Pro Leu Val Leu Val Ser As - #n Lys Lys Asn Pro Leu # 110 - Thr Ser Leu Asp Gln Ile Ala Gly Lys Thr Th - #r Gln Glu Asp Thr Gly # 125 - Thr Ser Asn Ala Gln Phe Ile Asn Asn Trp As - #n Gln Lys His Thr Asp # 140 - Asn Pro Ala Thr Ile Asn Phe Ser Gly Glu As - #p Ile Gly Lys Arg Ile 145 1 - #50 1 - #55 1 - #60 - Leu Asp Leu Ala Asn Gly Glu Phe Asp Phe Le - #u Val Phe Asp Lys Val # 175 - Ser Val Gln Lys Ile Ile Lys Asp Arg Gly Le - #u Asp Leu Ser Val Val # 190 - Asp Leu Pro Ser Ala Asp Ser Pro Ser Asn Ty - #r Ile Ile Phe Ser Ser # 205 - Asp Gln Lys Glu Phe Lys Glu Gln Phe Asp Ly - #s Ala Leu Lys Glu Leu # 220 - Tyr Gln Asp Gly Thr Leu Glu Lys Leu Ser As - #n Thr Tyr Leu Gly Gly 225 2 - #30 2 - #35 2 - #40 - Ser Tyr Leu Pro Asp Gln Ser Gln Leu Gln # 250 - (2) INFORMATION FOR SEQ ID NO: 227: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #227: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 CTAC GACAATAAAA ATC - (2) INFORMATION FOR SEQ ID NO: 228: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 27 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #228: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 27 TGCT GGTTGAG - (2) INFORMATION FOR SEQ ID NO: 229: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double

(D) TOPOLOGY: linear #229: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 CAAT ACGGACTATG - (2) INFORMATION FOR SEQ ID NO: 230: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 28 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #230: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 28 TAGG TCGGAAAC - (2) INFORMATION FOR SEQ ID NO: 231: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #231: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TCAA GCTACACCCA AAGAG - (2) INFORMATION FOR SEQ ID NO: 232: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #232: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 CTGA GATTGATCTG G - (2) INFORMATION FOR SEQ ID NO: 233: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #233: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CCGC TCTTCTCGTA ACGCAGC - (2) INFORMATION FOR SEQ ID NO: 234: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #234: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 GAAC TTTTACGCTT CC - (2) INFORMATION FOR SEQ ID NO: 235: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 45 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #235: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: #45 AATT TGACAGGTAA CAGCAAAAAA GCTGC - (2) INFORMATION FOR SEQ ID NO: 236: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #236: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TTTT CAAGAATTCA TCG - (2) INFORMATION FOR SEQ ID NO: 237: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #237: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AGGA ACTGCTTCTA AAGAC - (2) INFORMATION FOR SEQ ID NO: 238: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #238: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 ATTC GTTGGTGAAG CC - (2) INFORMATION FOR SEQ ID NO: 239: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #239: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 AGGT GGAAACGCTG GTTCATCC - (2) INFORMATION FOR SEQ ID NO: 240: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #240: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TTTT CCACCTTCAA CAACC - (2) INFORMATION FOR SEQ ID NO: 241: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #241: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CTAT GGTAAATCTG CGGATGG - (2) INFORMATION FOR SEQ ID NO: 242: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 42 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #242: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 42 ATCC GCTTTCATCG GGTTAAAGAA GG - (2) INFORMATION FOR SEQ ID NO: 243: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #243: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AAAT TCTAGCGAAA CTAGTGG - (2) INFORMATION FOR SEQ ID NO: 244: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #244: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTCT TTTACTTCTT TGGTTGC - (2) INFORMATION FOR SEQ ID NO: 245: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #245: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 CGGA AAAAAAGATA CAACTTCTGG - (2) INFORMATION FOR SEQ ID NO: 246: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #246: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CAAT CCTTCAGCAA TCTTGTC - (2) INFORMATION FOR SEQ ID NO: 247: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #247: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AAAA AATACAGCTT CAAGTCC - (2) INFORMATION FOR SEQ ID NO: 248: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #248: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GTTT GCTTGGTATT GGTCG - (2) INFORMATION FOR SEQ ID NO: 249: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #249: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 AAAC TCAAGCACTA GTCAGACAGA G - (2) INFORMATION FOR SEQ ID NO: 250: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #250: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 AGCT TTTTGTATGT CTTC - (2) INFORMATION FOR SEQ ID NO: 251: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #251: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTCT GGCGGAAGTA AAGATGC - (2) INFORMATION FOR SEQ ID NO: 252: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #252: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TAGC TTTTTTGATT GTTTCG - (2) INFORMATION FOR SEQ ID NO: 253: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #253: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 AGAA AAAACAAAAA ATGAAGATGG - (2) INFORMATION FOR SEQ ID NO: 254: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #254: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 GTAG TCTCCGCCTT C - (2) INFORMATION FOR SEQ ID NO: 255: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #255: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 TCTG TGGTCAAATA ATCTTACC - (2) INFORMATION FOR SEQ ID NO: 256: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #256: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AACA TGGTGCTTGC CAATAGG - (2) INFORMATION FOR SEQ ID NO: 257: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #257: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AGAA AAGAAAGCAG ACAATGC - (2) INFORMATION FOR SEQ ID NO: 258: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #258: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TGGT TGATCCAAAC CATCTG - (2) INFORMATION FOR SEQ ID NO: 259: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #259: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 AGGG TCAGAAGGTG CAGACC - (2) INFORMATION FOR SEQ ID NO: 260: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #260: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 GCTT GGTGTGCCCC AGTTGC - (2) INFORMATION FOR SEQ ID NO: 261: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid

(C) STRANDEDNESS: double (D) TOPOLOGY: linear #261: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 GGCA GCTTTTAAAA ATC - (2) INFORMATION FOR SEQ ID NO: 262: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #262: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 CCAT TCACCATTAC C - (2) INFORMATION FOR SEQ ID NO: 263: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 29 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #263: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 29 GCAA AAAATTAAG - (2) INFORMATION FOR SEQ ID NO: 264: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 28 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #264: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 28 CCAT TCACCATT - (2) INFORMATION FOR SEQ ID NO: 265: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #265: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TGAG GAAGAAACTA AAAAG - (2) INFORMATION FOR SEQ ID NO: 266: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #266: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 CTGT AGGAATGCTT CGAATTTG - (2) INFORMATION FOR SEQ ID NO: 267: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #267: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 TAAC AATAAAACTA TTGAAGAG - (2) INFORMATION FOR SEQ ID NO: 268: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #268: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 ACCT CCAAAAATCA CGG - (2) INFORMATION FOR SEQ ID NO: 269: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #269: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AGGT AAACAACTAC AAGTCGG - (2) INFORMATION FOR SEQ ID NO: 270: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #270: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 GTTT GGCTCAGAAT TG - (2) INFORMATION FOR SEQ ID NO: 271: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #271: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TGAT ACAAGTATCG CA - (2) INFORMATION FOR SEQ ID NO: 272: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #272: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 AGTA TGGCTAGATG G - (2) INFORMATION FOR SEQ ID NO: 273: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #273: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 ATCA TTTGAAAACA AAGAAAC - (2) INFORMATION FOR SEQ ID NO: 274: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #274: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TGTT GCTGTGCTTG TG - (2) INFORMATION FOR SEQ ID NO: 275: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #275: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 AAAG GAAAGTCAGA CAGGAAAGG - (2) INFORMATION FOR SEQ ID NO: 276: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #276: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 GAGC TTTTTTGATA AAGGTTGCGC A - (2) INFORMATION FOR SEQ ID NO: 277: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #277: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TAGA TATATTTTAG CATTTGAGAC - (2) INFORMATION FOR SEQ ID NO: 278: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #278: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TATC AAAGGCAAGA CTTGG - (2) INFORMATION FOR SEQ ID NO: 279: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #279: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TAAA GTTGGTATTA ACGG - (2) INFORMATION FOR SEQ ID NO: 280: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #280: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TTTG CGAAGTATTC CAAGAG - (2) INFORMATION FOR SEQ ID NO: 281: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #281: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CGAG TTGGGACTGT ATCAAGC - (2) INFORMATION FOR SEQ ID NO: 282: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #282: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TTTT TCCTTACTTA CAGATGAAGG - (2) INFORMATION FOR SEQ ID NO: 283: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #283: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 GATG CATCATAATC TAGGAGC - (2) INFORMATION FOR SEQ ID NO: 284: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #284: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 GACA TCTCCATCAT AAGTCGC - (2) INFORMATION FOR SEQ ID NO: 285: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #285: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GAGA AAGTATTTGC AGGGG - (2) INFORMATION FOR SEQ ID NO: 286: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #286: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 TTTC ATGGATGCAA TTTTTTTGG - (2) INFORMATION FOR SEQ ID NO: 287: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 43 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #287: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 43 ATTT ACTATCCATA CAGTAGAGTC AGC - (2) INFORMATION FOR SEQ ID NO: 288: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #288: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 AGGT TGCAATTCTG GATTAATTGG - (2) INFORMATION FOR SEQ ID NO: 289: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #289: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 GGAA AGAGTGGATG - (2) INFORMATION FOR SEQ ID NO: 290: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #290: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTTA AATTGACTAT GCGCCCG - (2) INFORMATION FOR SEQ ID NO: 291: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #291: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 CTAT GAACTTGGTC GTCACC - (2) INFORMATION FOR SEQ ID NO: 292: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #292: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GATT TTTCCAAGTA AATCT - (2) INFORMATION FOR SEQ ID NO: 293: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base

(B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #293: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 GGGT GAATTAGAAA AAGG - (2) INFORMATION FOR SEQ ID NO: 294: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #294: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 AGGA TTGTTAGTAG TTG - (2) INFORMATION FOR SEQ ID NO: 295: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #295: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TCAG GCTCAAGAAA GTTCAGG - (2) INFORMATION FOR SEQ ID NO: 296: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #296: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TGAT GGAGCAAAGT AATACC - (2) INFORMATION FOR SEQ ID NO: 297: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #297: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TGTA ACCCATATCC AGCTCCTTCC - (2) INFORMATION FOR SEQ ID NO: 298: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #298: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TTGT TTATCTGGGG TTGC - (2) INFORMATION FOR SEQ ID NO: 299: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #299: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 TGGT ACTTGGCAAG GAAAACAG - (2) INFORMATION FOR SEQ ID NO: 300: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #300: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GCCA CCTAGCTTCT CATTG - (2) INFORMATION FOR SEQ ID NO: 301: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #301: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TCAA TATGTCAGAG ATGATACTAG - (2) INFORMATION FOR SEQ ID NO: 302: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #302: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 CCAT TCACCATTAT CATTG - (2) INFORMATION FOR SEQ ID NO: 303: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #303: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TAGA GAAGCATTAA AAACC - (2) INFORMATION FOR SEQ ID NO: 304: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #304: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 ATCT TGAAACTCCT CTGGTC - (2) INFORMATION FOR SEQ ID NO: 305: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #305: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TGTC GAGGAGTGTC ATACC - (2) INFORMATION FOR SEQ ID NO: 306: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #306: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TTTA CCCTTACGAA TCCAGG - (2) INFORMATION FOR SEQ ID NO: 307: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 43 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #307: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 43 AGTT TATGCGGATG AAACACTTAT TAC - (2) INFORMATION FOR SEQ ID NO: 308: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #308: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 GTAG AGATAGAAGT CATG - (2) INFORMATION FOR SEQ ID NO: 309: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #309: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 TGGT ATCGTAGATA CAGCCGGC - (2) INFORMATION FOR SEQ ID NO: 310: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #310: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTGC GTACCTTCTA AGCGACC - (2) INFORMATION FOR SEQ ID NO: 311: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #311: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 GGTT GCATGGGATG CGATTCG - (2) INFORMATION FOR SEQ ID NO: 312: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #312: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TTTA TTAGTTTGAC TAGC - (2) INFORMATION FOR SEQ ID NO: 313: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #313: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 CTAT GTAAATAAAG AGA - (2) INFORMATION FOR SEQ ID NO: 314: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #314: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 CCCT GTTTGAGGCA - (2) INFORMATION FOR SEQ ID NO: 315: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #315: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TCCT CAATCAATAA CAAA - (2) INFORMATION FOR SEQ ID NO: 316: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #316: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 AGTA GGAGAAACTG AACT - (2) INFORMATION FOR SEQ ID NO: 317: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #317: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 TCAA GAAACTGCGG - (2) INFORMATION FOR SEQ ID NO: 318: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #318: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 CTCA TTCTTGCTTC C - (2) INFORMATION FOR SEQ ID NO: 319: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 29 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #319: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 29 AGGT GAGACTGAG - (2) INFORMATION FOR SEQ ID NO: 320: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #320: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 TAAT TCAAGTGTTT TCTCTG - (2) INFORMATION FOR SEQ ID NO: 321: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #321: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 ATTG GTAGCACAAG ATCC - (2) INFORMATION FOR SEQ ID NO: 322: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 29 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #322: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 29 AGCC ACTGGTCTC - (2) INFORMATION FOR SEQ ID NO: 323: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #323: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 GTCA GCTTCAGGAA C - (2) INFORMATION FOR SEQ ID NO: 324: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #324: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 CTTG TCCCAGGTGG - (2) INFORMATION FOR SEQ ID NO: 325: - (i) SEQUENCE CHARACTERISTICS:

#pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #325: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TGAT GCGGATGAAA AG - (2) INFORMATION FOR SEQ ID NO: 326: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #326: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TCTT TGGGTATTTC GCA - (2) INFORMATION FOR SEQ ID NO: 327: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #327: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 TCGA TCAAAAGTAG - (2) INFORMATION FOR SEQ ID NO: 328: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 28 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #328: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 28 GTCT CGAGGTTC - (2) INFORMATION FOR SEQ ID NO: 329: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #329: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 AACA GGAAACTGGG AC - (2) INFORMATION FOR SEQ ID NO: 330: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #330: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TTCT GTACCTGTGT TTG - (2) INFORMATION FOR SEQ ID NO: 331: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #331: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 GCTC AATCCAACCC CAGGTCAAGT C - (2) INFORMATION FOR SEQ ID NO: 332: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #332: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TTTA TCCTCTGACA TCATCGTATC - (2) INFORMATION FOR SEQ ID NO: 333: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double

(D) TOPOLOGY: linear #333: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TCAA AAACAGGCAG ATGG - (2) INFORMATION FOR SEQ ID NO: 334: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #334: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 CATA GTCCAAGGCA - (2) INFORMATION FOR SEQ ID NO: 335: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #335: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 AGGA TCGAACGGTA AGACAACC - (2) INFORMATION FOR SEQ ID NO: 336: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #336: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TAAC TCCGCTACTG TGTC - (2) INFORMATION FOR SEQ ID NO: 337: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #337: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 ATCG AAGATGGTTG GGAAGTCC - (2) INFORMATION FOR SEQ ID NO: 338: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #338: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 CCAC ATGCTCAACC TT - (2) INFORMATION FOR SEQ ID NO: 339: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #339: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AGCT AGTGAAATGC AAGAAAG - (2) INFORMATION FOR SEQ ID NO: 340: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #340: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTTT GAACTAGTTG CTTTCGT - (2) INFORMATION FOR SEQ ID NO: 341: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #341: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 GATG GGGCACAAGG TTCAGGG - (2) INFORMATION FOR SEQ ID NO: 342: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #342: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TAAC GAACAGTTCA ATCTG - (2) INFORMATION FOR SEQ ID NO: 343: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #343: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 CCCA ACTGTTGGTA CTTTCC - (2) INFORMATION FOR SEQ ID NO: 344: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #344: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TGTT ACATTTTGAC CGTC - (2) INFORMATION FOR SEQ ID NO: 345: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #345: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 CCCA ACTGTTGGTA CTTTC - (2) INFORMATION FOR SEQ ID NO: 346: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #346: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 ATAA CGATCATTTT CTTTACC - (2) INFORMATION FOR SEQ ID NO: 347: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #347: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 ATAT TTAAGTCTAA GTGAAGCG - (2) INFORMATION FOR SEQ ID NO: 348: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #348: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TGTT ACATTTTGCA AGTC - (2) INFORMATION FOR SEQ ID NO: 349: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #349: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TTTT GAAGGAAGTA AG - (2) INFORMATION FOR SEQ ID NO: 350: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #350: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TTTG AAAAATGGAG GTGTATC - (2) INFORMATION FOR SEQ ID NO: 351: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #351: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 CCTC TCGAGAGAAA G - (2) INFORMATION FOR SEQ ID NO: 352: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #352: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TTTT ACTCGTTTGA CA - (2) INFORMATION FOR SEQ ID NO: 353: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 42 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #353: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 42 CAAA AGTCAGACCG CTAAGAAAGT GC - (2) INFORMATION FOR SEQ ID NO: 354: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #354: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 ATCC AAATACTGGT TGTTGATG - (2) INFORMATION FOR SEQ ID NO: 355: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #355: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 GTCT CAGGATCAGA CTCAGG - (2) INFORMATION FOR SEQ ID NO: 356: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #356: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 CATC CACCTCTTGA CCTTTG - (2) INFORMATION FOR SEQ ID NO: 357: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #357: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 CTTT GCCAAATGGT GGGAAGGG - (2) INFORMATION FOR SEQ ID NO: 358: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #358: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 TAAC ACTTTTCGAG GTTTGGTACC - (2) INFORMATION FOR SEQ ID NO: 359: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #359: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 AGAG AAGGAAAACT TGG - (2) INFORMATION FOR SEQ ID NO: 360: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #360: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 CAAC AAACCTTGTT CTTG - (2) INFORMATION FOR SEQ ID NO: 361: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #361: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 TATT GAGGACCACT T - (2) INFORMATION FOR SEQ ID NO: 362: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #362: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 CTCA GTCAATTCTT TTCC - (2) INFORMATION FOR SEQ ID NO: 363: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #363: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AAAT ACCAGAGGTG TTCAG - (2) INFORMATION FOR SEQ ID NO: 364: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #364: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 ATGG GTGTGACAGG TTTGAA - (2) INFORMATION FOR SEQ ID NO: 365: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid

(C) STRANDEDNESS: double (D) TOPOLOGY: linear #365: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 ACAA TTAGAAAAAG ATAGC - (2) INFORMATION FOR SEQ ID NO: 366: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #366: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 ACTA GGTTCTGCAA TGCC - (2) INFORMATION FOR SEQ ID NO: 367: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #367: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 CAAG CAAAAAGAAG CAGTCAATGA - (2) INFORMATION FOR SEQ ID NO: 368: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #368: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 GACT TTACGATTTG CTCC - (2) INFORMATION FOR SEQ ID NO: 369: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #369: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 CTCT GTCCAGTCCA CTTTTTCAGC G - (2) INFORMATION FOR SEQ ID NO: 370: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #370: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TGTT TCCTTAATGC GTT - (2) INFORMATION FOR SEQ ID NO: 371: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #371: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 AATT CAAAAAAATA GGCAAGAGG - (2) INFORMATION FOR SEQ ID NO: 372: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #372: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TTTG ATTGCCAACA ACTG - (2) INFORMATION FOR SEQ ID NO: 373: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #373: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 CCAG CAACAAAGCG AGCAAAAGG - (2) INFORMATION FOR SEQ ID NO: 374: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #374: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TTCT TTTTCCACAC GA - (2) INFORMATION FOR SEQ ID NO: 375: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #375: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 ACAA GTCGCCCACT ATCAAGACT - (2) INFORMATION FOR SEQ ID NO: 376: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #376: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 CTCT TTCTTTTCAG GCT - (2) INFORMATION FOR SEQ ID NO: 377: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #377: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 CGGC TGGCAATATA TCCCGT - (2) INFORMATION FOR SEQ ID NO: 378: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #378: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CCAT TCGCCATTAT AGTTGAC - (2) INFORMATION FOR SEQ ID NO: 379: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #379: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 ATCA GAATGGGTAG AAGAC - (2) INFORMATION FOR SEQ ID NO: 380: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #380: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 ATTG ATTTTCATCA TCAC - (2) INFORMATION FOR SEQ ID NO: 381: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #381: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AGAT GATTCTGAAG GATGG - (2) INFORMATION FOR SEQ ID NO: 382: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #382: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 CCAT TCACCATTCT AGTTTAAG - (2) INFORMATION FOR SEQ ID NO: 383: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #383: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TGCA AATGAAACTG AAGTAGC - (2) INFORMATION FOR SEQ ID NO: 384: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #384: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 AACG CTGACATCTA CGCG - (2) INFORMATION FOR SEQ ID NO: 385: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #385: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 TCAG CCTACTTTTG TAAGAGC - (2) INFORMATION FOR SEQ ID NO: 386: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #386: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TTCA CCATTGGCAT TGAC - (2) INFORMATION FOR SEQ ID NO: 387: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #387: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 GGTG AAAGGTCATG CTACATTTGT G - (2) INFORMATION FOR SEQ ID NO: 388: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #388: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TTGA GACCTTGCAA CAC - (2) INFORMATION FOR SEQ ID NO: 389: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #389: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 TCCT TTGAAGGATT TGAGAGAAAC C - (2) INFORMATION FOR SEQ ID NO: 390: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 41 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #390: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 41 AAGA TAAGATAAAT ATATATAAAG T - (2) INFORMATION FOR SEQ ID NO: 391: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 47 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #391: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 47ATAT GGGACTTTTT TTCTACAACA AAATAGG - (2) INFORMATION FOR SEQ ID NO: 392: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #392: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CAGC TCATTTAATC GTTTTTG - (2) INFORMATION FOR SEQ ID NO: 393: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #393: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TGAG AATTATTTGC GAATG - (2) INFORMATION FOR SEQ ID NO: 394: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #394: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 ACAA AAGTAATGTA C - (2) INFORMATION FOR SEQ ID NO: 395: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #395: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TCAA TCAAGTTCTT CAGCC - (2) INFORMATION FOR SEQ ID NO: 396: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #396: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 GGCT TGGACCAGAT TGAAAAG - (2) INFORMATION FOR SEQ ID NO: 397: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base

(B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #397: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 AACA TTAAAACGTC CTGAGG - (2) INFORMATION FOR SEQ ID NO: 398: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #398: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 AACT GTGACGCTGG TTCC - (2) INFORMATION FOR SEQ ID NO: 399: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #399: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GGAG ACCTTTAAAA ATATC - (2) INFORMATION FOR SEQ ID NO: 400: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #400: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 CATC TTGTACATAC C - (2) INFORMATION FOR SEQ ID NO: 401: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #401: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 TGCG CAATCAAATT C - (2) INFORMATION FOR SEQ ID NO: 402: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #402: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AGCC CAATAATCTA TAAAGCT - (2) INFORMATION FOR SEQ ID NO: 403: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #403: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 CGTT CATCAAGATG TC - (2) INFORMATION FOR SEQ ID NO: 404: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #404: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TGTT GAAAAGAGAG TG - (2) INFORMATION FOR SEQ ID NO: 405: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #405: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 GGGC TTTAACTATC TTCGTATTCG - (2) INFORMATION FOR SEQ ID NO: 406: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #406: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 CTTC ACTTTCCCTT TCC - (2) INFORMATION FOR SEQ ID NO: 407: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #407: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 CCAG CATCGTTCGC AGGA - (2) INFORMATION FOR SEQ ID NO: 408: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #408: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 GAAG AAATAATGAT TGTGG - (2) INFORMATION FOR SEQ ID NO: 409: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #409: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 CCTT GAAATCCCAC TT - (2) INFORMATION FOR SEQ ID NO: 410: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #410: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 AAGG TTGTAGAATG ATTTCAATC - (2) INFORMATION FOR SEQ ID NO: 411: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #411: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 CTTT TTTTGGAGCA ATGTT - (2) INFORMATION FOR SEQ ID NO: 412: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #412: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TCCG ATACGGGTGA TTG - (2) INFORMATION FOR SEQ ID NO: 413: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #413: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TCTC AAAGATGTGA AAG - (2) INFORMATION FOR SEQ ID NO: 414: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #414: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TTTG TCAAGGATTG CTTT - (2) INFORMATION FOR SEQ ID NO: 415: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #415: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 ATCC TATCATCTCT TCCAGAAG - (2) INFORMATION FOR SEQ ID NO: 416: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #416: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 ACTA AAAGCCGCAG CTT - (2) INFORMATION FOR SEQ ID NO: 417: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 30 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #417: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 30 TGCA GGGCAGACAG - (2) INFORMATION FOR SEQ ID NO: 418: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #418: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 ATAA TCTAGTAAAT AAGCGT - (2) INFORMATION FOR SEQ ID NO: 419: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #419: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 GTTT TTAGCGCTTG TTCTTC - (2) INFORMATION FOR SEQ ID NO: 420: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #420: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 AGTG TCATAATCTT CTTTAG - (2) INFORMATION FOR SEQ ID NO: 421: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #421: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 CGAG CATATTCTGA AG - (2) INFORMATION FOR SEQ ID NO: 422: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #422: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 ACCA TTTCTTTGTT CTGCATC - (2) INFORMATION FOR SEQ ID NO: 423: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #423: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 GGAT AGCATTCAGA ATCAGACG - (2) INFORMATION FOR SEQ ID NO: 424: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #424: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 TAAA GACAATTTTT CC - (2) INFORMATION FOR SEQ ID NO: 425: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #425: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 AGAT AGTATTATTA CTCAAAC - (2) INFORMATION FOR SEQ ID NO: 426: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #426: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TATT TCTCTCAATT TTTC - (2) INFORMATION FOR SEQ ID NO: 427: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 36 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #427: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 36 GAAG CAGACCTATC AAAATC - (2) INFORMATION FOR SEQ ID NO: 428: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 43 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #428: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 43 TTTT TTAGATTTTT CAATATTTTT CAG - (2) INFORMATION FOR SEQ ID NO: 429: - (i) SEQUENCE CHARACTERISTICS:

#pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #429: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TGAT AATCGTGTTC AAATG - (2) INFORMATION FOR SEQ ID NO: 430: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #430: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 AGAT AGACGTTGAG T - (2) INFORMATION FOR SEQ ID NO: 431: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #431: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 CAAT CAGTCAGCTG CTTCC - (2) INFORMATION FOR SEQ ID NO: 432: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 40 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #432: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 40 CTTG TCCCAGGTGG TTAATTTGCC - (2) INFORMATION FOR SEQ ID NO: 433: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #433: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 ACAA CAACATGCTA CTTCTGAG - (2) INFORMATION FOR SEQ ID NO: 434: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #434: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AACC CACTTATCAT TATCC - (2) INFORMATION FOR SEQ ID NO: 435: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #435: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 AGGC AAGTCCGTGA CTAGTGAAC - (2) INFORMATION FOR SEQ ID NO: 436: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #436: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 TTCC TTCAAAGTTT GCA - (2) INFORMATION FOR SEQ ID NO: 437: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double

(D) TOPOLOGY: linear #437: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 AATT GAAAAGGCGG CAGTTAGCC - (2) INFORMATION FOR SEQ ID NO: 438: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #438: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AAGC GTACCTTTTT CTTCC - (2) INFORMATION FOR SEQ ID NO: 439: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #439: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 GTCA GGTTCTAATG GTTCTCAG - (2) INFORMATION FOR SEQ ID NO: 440: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 32 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #440: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 32 GGCG TCGCCGTCCT TC - (2) INFORMATION FOR SEQ ID NO: 441: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 38 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #441: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 38 TTCA CAGGATTTTA AAGAGAAG - (2) INFORMATION FOR SEQ ID NO: 442: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #442: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 TCCT TCTTCTGCAC TTCT - (2) INFORMATION FOR SEQ ID NO: 443: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #443: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 CGAA GTTGAGACTC CTCAATC - (2) INFORMATION FOR SEQ ID NO: 444: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 31 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #444: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 31 AAAT TTATTATCAG C - (2) INFORMATION FOR SEQ ID NO: 445: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 39 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #445: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 39 TGTA TATAAAGTTA CAGCAATCG - (2) INFORMATION FOR SEQ ID NO: 446: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 37 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #446: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 37 ACCG AATGCGTCGA ATGTACG - (2) INFORMATION FOR SEQ ID NO: 447: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #447: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 CAGA TTAATTGAAA TCGG - (2) INFORMATION FOR SEQ ID NO: 448: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #448: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 GATT GAAGTTTTAA AGCT - (2) INFORMATION FOR SEQ ID NO: 449: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #449: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 AGAT GAACGGAGCA AGGTG - (2) INFORMATION FOR SEQ ID NO: 450: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 35 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #450: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 35 TTCC TCAATGAGTT TGTCT - (2) INFORMATION FOR SEQ ID NO: 451: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 33 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #451: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 33 GAAT CAAGCTACAC CCA - (2) INFORMATION FOR SEQ ID NO: 452: - (i) SEQUENCE CHARACTERISTICS: #pairs (A) LENGTH: 34 base (B) TYPE: nucleic acid (C) STRANDEDNESS: double (D) TOPOLOGY: linear #452: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: # 34 CTGA GATTGATCTG GGAG __________________________________________________________________________



Top