Back to EveryPatent.com
United States Patent |
5,776,680
|
Leibowitz
,   et al.
|
July 7, 1998
|
Diagnostic probes for pneumocystis carini
Abstract
The present invention pertains to a method for diagnosing for Pneumocystis
carinii by detecting the presence of a nucleic acid sequence containing
the 26S rRNA gene specific for Pneumocystis carinii. More particularly,
this invention relates to a method for diagnosing for Pneumocystis carinii
which comprises amplifying a sample of DNA from Pneumocystis carinii by
polymerase chain reaction (PCR) using species specific primers and
detecting the PCR products with species specific radioactive or
non-radioactive oligonucleotide probes. This invention also relates to a
method for diagnosing for various species of Pneumocystis carinii by
detecting the presence of a nucleic acid sequence containing the
particular 16S or 26S rRNA gene sequence specific for that species of
Pneumocystis carinii.
Inventors:
|
Leibowitz; Michael J. (Manalapan, NJ);
Liu; Yong (Piscataway, NJ)
|
Assignee:
|
University of Medicine & Dentistry of New Jersey (Newark, NJ)
|
Appl. No.:
|
505509 |
Filed:
|
July 21, 1995 |
Current U.S. Class: |
435/6; 435/91.2; 536/23.74; 536/24.32; 536/24.33 |
Intern'l Class: |
C12Q 001/68; C12P 019/34; C07H 021/04 |
Field of Search: |
435/6,91.2
536/23.74,24.32,24.33
935/8,77,78
|
References Cited
Foreign Patent Documents |
8803957 | Jun., 1988 | WO.
| |
9102092 | Feb., 1991 | WO.
| |
9119005 | Dec., 1991 | WO.
| |
Other References
Erlich et al, Science (1991) 252:1643-1651.
Regensburger, J. Gen Microb (1988) 134:1197-1204.
Williams BioTechniques (1989) 17:762-768.
White et al PCR Protocols: A Guide to Methods & Applications, 1990, Innis
et al, Eds, pp. 315-322.
|
Primary Examiner: Myers; Carla J.
Attorney, Agent or Firm: Muccino; Richard R.
Parent Case Text
This is a continuation application of application Ser. No. 08/298,087,
filed on 31 Aug. 1994, now abandoned, which application is a continuation
of Ser. No. 07/922,987, filed on 30 Jul. 1992, now abandoned.
Claims
We claim:
1. A method for diagnosing for Pneumocystis carinii which comprises
detecting the presence of a nucleic acid sequence containing the 26S rRNA
gene specific for Pneumocystis carinii in a sample which comprises the
steps of:
(a) treating the sample with an oligodeoxyribonucleotide polymerase chain
reaction primer for each strand of the nucleic acid sequence, four
different nucleoside triphosphates, and an agent for polymerization under
hybridizing conditions, such that for each strand an extension product of
each primer is synthesized which is sufficiently complementary to each
strand of the nucleic acid sequence being detected to hybridize therewith
and contains the 26S rRNA gene specific for Pneumocystis carinii, wherein
the primers are selected such that the extension product synthesized from
one primer, when it is separated from its complement, can serve as a
template for synthesis of the extension product of the other primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 26S rRNA gene specific
for Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred to diagnose
for Pneumocystis carinii, wherein hybridization is directly proportional
to the amount of nucleic acid sequence containing the 26S rRNA gene
specific for Pneumocystis carinii present in the sampler wherein the
primers and probes are selected from the group of polynucleotides
consisting of SEQ ID NOs: 6, 7, 13, 14, 17, 19-26, and 28-30.
2. The method according to claim 1, wherein in step (d) the probe is
specific for a sequence lying between two polymerase chain reaction (PCR)
primers on the Pneumocystis carinii gene.
3. The method according to claim 1, further comprising in steps (d) and (e)
a positive control which contains the 26S rRNA gene specific for
Pneumocystis carinii and a negative control which does not contain the 26S
rRNA gene.
4. The method according to claim 1, wherein the nucleic acid sequence
containing the 26S rRNA gene specific for Pneumocystis carinii is a CDNA
copy of RNA.
5. A method for diagnosing for a species of Pneumocystis carinii which
comprises detecting the presence of a nucleic acid sequence containing the
26S rRNA gene specific for that species of Pneumocystis carinii in a
sample which comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide polymerase chain
reaction primer for each strand of the nucleic acid sequence, four
different nucleoside triphosphates, and an agent for polymerization under
hybridizing conditions, such that for each strand an extension product of
each primer is synthesized which is sufficiently complementary to each
strand of the nucleic acid sequence being detected to hybridize therewith
and contains the 26S rRNA gene specific for that species of Pneumocystis
carinii, wherein the primers are selected such that the extension product
synthesized from one primer, when it is separated from its complement, can
serve as a template for synthesis of the extension product of the other
primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 26S rRNA gene specific
for that species of Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred to diagnose
for Pneumocystis carinii, wherein hybridization is directly proportional
to the amount of nucleic acid sequence containing the 26S rRNA gene
specific for that species of Pneumocystis carinii present in the sample;
wherein the primers and probes are selected from the group of
polynucleotides consisting of SEQ ID NOs: 6, 7, 13, 14, 17, 19-26, and
28-38.
6. The method according to claim 5, wherein in step (d) the probe is
specific for a sequence lying between two polymerase chain reaction (PCR)
primers on the Pneumosystis carinii gene.
7. The method according to claim 5, further comprising in steps (d) and (e)
a positive control which contains the 26S rRNA gene specific for
Pneumocystis carinii and a negative control which does not contain the 26S
rRNA gene.
8. The method according to claim 5, wherein the nucleic acid sequence
containing the 26S rRNA gene specific for that species of Pneumocystis
carinii is a cDNA copy of RNA.
9. A method for diagnosing for a species of Pneumocystis carinii which
comprises detecting the presence of a nucleic acid sequence containing the
16S rRNA gene specific for that species of Pneumocystis carinii in a
sample which comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide polymerase chain
reaction primer for each strand of the nucleic acid sequence, four
different nucleoside triphosphates, and an agent for polymerization under
hybridizing conditions, such that for each strand an extension product of
each primer is synthesized which is sufficiently complementary to each
strand of the nucleic acid sequence being detected to hybridize therewith
and contains the 16S rRNA gene specific for that species of Pneumocystis
carinii, wherein the primers are selected such that the extension product
synthesized from one primer, when it is separated from its complement, can
serve as a template for synthesis of the extension product of the other
primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 16S rRNA gene specific
for that species of Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred to diagnose
for Pneumocystis carinii, wherein hybridization is directly proportional
to the amount of nucleic acid sequence containing the 16S rRNA gene
specific for that species of Pneumocystis carinii present in the sample;
wherein the primers and probes are selected from the group of
polynucleotides consisting of SEQ ID NOs: 4, 5, 15, 16, 18, and 27.
10. The method according to claim 9, wherein in step (d) the probe is
specific for a sequence lying between two polymerase chain reaction (PCR)
primers on the Pneumocystis carinii gene.
11. The method according to claim 9, further comprising in steps (d) and
(e) a positive control which contains the 16S rRNA gene specific for
Pneumocystis carinii and a negative control which does not contain the 16S
rRNA gene.
12. The method according to claim 9, wherein the nucleic acid sequence
containing the 16S rRNA gene specific for that species of Pneumocystis
carinii is a cDNA copy of RNA.
Description
BACKGROUND OF THE INVENTION
1. Field of the Invention
This invention relates to a method for diagnosing for Pneumocystis carinii
by detecting the presence of a nucleic acid sequence containing the 26S
rRNA gene specific for Pneumocystis carinii. More particularly, this
invention relates to a method for diagnosing for Pneumocystis carinii
which comprises amplifying a sample of DNA from Pneumocystis carinii by
polymerase chain reaction (PCR) using species specific primers and
detecting the PCR products with species specific radioactive or
non-radioactive oligonucleotide probes. This invention also relates to a
method for diagnosing for various species of Pneumocystis carinii by
detecting the presence of a nucleic acid sequence containing the
particular 16S or 26S rRNA gene sequence specific for that species of
Pneumocystis carinii.
DESCRIPTION OF THE BACKGROUND
The disclosures referred to herein to illustrate the background of the
invention and to provide additional detail with respect to its practice
are incorporated herein by reference. For convenience, the disclosures are
referenced in the following text and respectively grouped in the appended
bibliography.
Pneumocystis carinii (P. carinii) is a ubiquitous eukaryotic microorganism
causing asymptomatic infections in most humans early in childhood (1) but
causing life-threatening pneumonia in immunosuppressed hosts including
patients with Acquired Immune Deficiency Syndrome (AIDS, 2). Although
morphologically P. carinii has properties associated with both protozoa
and yeasts, the 16S rRNA coding sequence of P. carinii grown in
immunosuppressed rats most resembled that of the yeast Saccharomyces
cerevisiae (S. cerevisiae, 3). This sequence also included a 390 base pair
insertion resembling a Group I intron, located 31 nucleotides from the 3'
end of the rRNA gene (3). Absence of this sequence from mature 16S rRNA
(4) and demonstration of its ability to spontaneously excise from
transcripts of cloned fragments of the gene (5) confirmed its identity as
a self-splicing intron (6-7). The sequence of the 5S rRNA of P. carinii
grown in nude rats showed closer similarity to 5S rRNA of Amoeba and
Myxomycota than to that of Ascomycetes such as Saccharomyces (8). However,
the validity of 5S rRNA sequence analysis as a taxonomic tool has been
questioned (9). In S. cerevisiae, the 5S rRNA is encoded in the same
genomic repeated element encoding 16S, 5.8S and 26S rRNAs, but on the
opposite strand (reviewed in 10), although most eukaryotes studied do not
have the gene for 5S rRNA linked to those for the other rRNA species.
Hybridization of chromosomal DNA separated by pulsed field electrophoresis
with 16S rRNA-derived probes has localized the 16S rRNA gene of
Pneumocystis to one or two 500 kbp. chromosomal DNAS, with the gene for 5S
rRNA apparently located elsewhere (11-12).
BRIEF DESCRIPTION OF THE FIGURES
FIGS. 1A and 1B show the DNA sequence of a portion of the rRNA-encoding
gene(s) of P. carinii isolated from immunosuppressed Sprague-Dawley rats
(Sasco) and the PCR amplifications which were subsequently cloned and
sequenced. The top line represents the DNA sequence of a portion of the
rRNA-encoding gene(s) of P. carinii isolated from immunosuppressed
Sprague-Dawley rats (Sasco). The horizontal lines below represent PCR
amplifications which were subsequently cloned and sequenced. Thin lines
(FIG. 1A) refer to PCR products from Sprague-Dawley rats (Sasco) and heavy
lines (FIG. 1B) refer to PCR products from Hooded rats. Numbers refer to
oligonucleotide primers (Table 1) used in each PCR reaction.
FIG. 2 shows the total contiguous sequence determined for P. carinii from
immunosuppressed Sprague-Dawley rats (Sasco) by the strategy shown in FIG.
1A. Except for the last 18 nucleotides (shown in lower case), capital
letters indicate rRNA coding sequences (positive strand), lower case
letters indicate spacers, and underlined lower case letters indicate Group
I introns. The initial 22 nucleotides are from the 3'-terminal portion of
the Group I intron in 16S rRNA. Nucleotides 23-53 are the second exon of
16S rRNA, 54-216 are internal transcribed spacer 1 (ITS1), 217-374 the
gene for 5.8S rRNA (identified by similarity to other 5.8S rRNA
sequences), 375-556 ITS2, and 557-4256 are the gene for 26S rRNA, with a
Group I intron sequence in lower case underlined. This sequence has been
deposited at EMBL/GenBank under accession No. M86760.
FIG. 3 shows a comparison of the sequence of the 5.8S rRNA gene of P.
carinii shown in FIG. 2 with the homologous sequences from Saccharomyces
cerevisiae (23) shown as Sc, Tetrahymena pyriformis (T. pyriformis) (24)
shown as Tp, and Homo sapiens (25) shown as Hs. Since the actual 5.8S rRNA
sequence was not determined, the termini of the P. carinii gene have been
chosen based on the known sequence of the homologous gene of S.
cerevisiae, to which it appears to be closely related. The three
nucleotides 5' to the proposed rRNA 5' terminus are shown here in lower
case letters.
FIG. 4 is a dendrogram generated by the "pileup" program of the Wisconsin-
GCG package indicating sequence similarity (but not necessarily
evolutionary relationships) among the 5.8S rRNAs compared in Table II.
FIG. 5 shows a comparison of the sequence of the 26S rRNA genes of P.
carinii (Pc) from FIG. 2, with homologous sequences from S. cerevisiae
(Sc), and T. pyriformis (Tp). The Group I self-splicing introns in the P.
carinii and T. pyriformis genes have been omitted. The final 18
nucleotides of the P. carinii sequence were determined from organisms from
immunosuppressed Hooded rats as shown in FIG. 2.
FIG. 6A shows the secondary structure into which the apparent Group I
intron in the gene for 26S rRNA of P. carinii can be folded. The helices
P1-P9 are conserved among Group I introns (6-7). The bases in the intron
are numbered 1 through 355, and the flanking exon regions are shown in
lower case letters. The consensus sequences P (nucleotides 80-91), Q
(nucleotides 202-211), R (nucleotides 247-260) and S (nucleotides 316-327)
are shown in boldface. FIG. 6B shows an alternative folding for the P8
helix of the intron (5) in the 16S rRNA gene.
FIG. 7 shows the sequence of the region from nucleotides 485 through 964 of
the 26S rRNA gene from P. carinii from Sprague-Dawley rats, as shown in
FIG. 5 (Pcd). This sequence was determined for three PCR products made
using oligonucleotides 4016 and 2892 as primers and for PCR products made
using the oligonucleotide pair 3425 and 3426, and the pair 2893 and 2982,
each resulting in products partially overlapping this region. This entire
sequence was thus determined on four or five isolates, with four separate
sequence determinations made for each PCR product. The sequence of DNA
amplified using the same primers (4016 and 2892) from P. carinii from
Hooded rats is shown as Pc2. The homologous regions of genes from S.
cerevisiae (Sc) and T. pyriformis (Tp) are also shown. The numbering is
according to the 26S rRNA sequence of Pcd as in FIG. 5. The sequence
denoted Pc2 has been deposited at EMBL/GenBank under accession No. 86761.
FIG. 8 shows a comparison of the sequences of the region from nucleotides
2911 through 3327 of the 26S rRNA gene of P. carinii (Pcd) from
Sprague-Dawley rats (FIG. 5) with the homologous regions from P. carinii
from Hooded rats (Pc2) and from S. cerevisiae (Sc) and T. pyriformis (Tp).
The fragment denoted Pc1 was amplified using primers 4138 and 4170. The
sequence shown for Pc2 was determined based on amplifications using primer
pair 4138 and 4139 and pair 4169 and 4170, and ligation-dependent PCR
amplification of a fragment extending from oligonucleotide 3427 through a
PstI site 381 nucleotides past the 3' end of the 26S rRNA gene. The
sequences of homologous regions of the 26S rRNA genes of S. cerevisiae
(Sc) and T. pyriformis (Tp) are shown.
FIG. 9 shows the results of PCR amplification confirming the sequence
differences between Pc1 and Pc2 shown in FIGS. 8 and 10. Primers 4358 and
4746 were used to amplify Pcd (lane 1) or Pc2 (lane2) DNA templates.
Primers 4743 and 4744 were used to amplify Pc1 (lane 3) or Pc2 (lane 4)
DNA. Lanes N contain a mixture of HindIII digested bacteriophage lambda
DNA and HaeIII digested replicative form DNA of bacteriophage phiXl74
(BRL).
FIG. 10 shows the sequence of the spacer region 3' to the 26S rRNA gene of
P. carinii from Hooded rats (FIG. 10), which was determined by
ligation-dependent PCR as described in the text. The sequences shown in
FIGS. 8 and 10 have been deposited at EMBL/GenBank under accession No.
M86759.
SUMMARY OF THE INVENTION
The present invention pertains to a method for diagnosing for Pneumocystis
carinii which comprises detecting the presence of a nucleic acid sequence
containing the 26S rRNA gene specific for Pneumocystis carinii in a sample
which comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide primer for each
strand of the nucleic acid sequence, four different nucleoside
triphosphates, and an agent for polymerization under hybridizing
conditions, such that for each strand an extension product of each primer
is synthesized which is sufficiently complementary to each strand of the
nucleic acid sequence being detected to hybridize therewith and contains
the 26S rRNA gene specific for Pneumocystis carinii, wherein the primers
are selected such that the extension product synthesized from one primer,
when it is separated from its complement, can serve as a template for
synthesis of the extension product of the other primer;
(b) treating the sample from step (o) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 26S rRNA gene specific
for Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred.
In another embodiment, the present invention pertains to a method for
diagnosing for a species of Pneumocystis carinii which comprises detecting
the presence of a nucleic acid sequence containing the 26S rRNA gene
specific for that species of Pneumocystis carinii in a sample which
comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide primer for each
strand of the nucleic acid sequence, four different nucleoside
triphosphates, and an agent for polymerization under hybridizing
conditions, such that for each strand an extension product of each primer
is synthesized which is sufficiently complementary to each strand of the
nucleic acid sequence being detected to hybridize therewith and contains
the 26S rRNA gene specific for that species of Pneumocystis carinii,
wherein the primers are selected such that the extension product
synthesized from one primer, when it is separated from its complement, can
serve as a template for synthesis of the extension product of the other
primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 26S rRNA gene specific
for that species of Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred.
In yet another embodiment, the present invention pertains to a method for
diagnosing for a species of Pneumocystis carinii which comprises detecting
the presence of a nucleic acid sequence containing the 16S rRNA gene
specific for that species of Pneumocystis carinii in a sample which
comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide primer for each
strand of the nucleic acid sequence, four different nucleoside
triphosphates, and an agent for polymerization under hybridizing
conditions, such that for each strand an extension product of each primer
is synthesized which is sufficiently complementary to each strand of the
nucleic acid sequence being detected to hybridize therewith and contains
the 16S rRNA gene specific for that species of Pneumocystis carinii,
wherein the primers are selected such that the extension product
synthesized from one primer, when it is separated from its complement, can
serve as a template for synthesis of the extension product of the other
primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 16S rRNA gene specific
for that species of Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred.
DETAILED DESCRIPTION OF THE INVENTION
This invention relates to a method for diagnosing for Pneumocystis carinii
by detecting the presence of a nucleic acid sequence containing the 26S
rRNA gene specific for Pneumocystis carinii. More particularly, this
invention relates to a method for diagnosing for Pneumocystis carinii
which comprises amplifying a sample of DNA from Pneumocystis carinii by
polymerase chain reaction (PCR) using species specific primers and
detecting the PCR products with species specific radioactive or
non-radioactive oligonucleotide probes. This invention also relates to a
method for diagnosing for various species of Pneumocystis carinii by
detecting the presence of a nucleic acid sequence containing the
particular 16S or 26S rRNA gene sequence specific for that species of
Pneumocystis carinii.
The term "oligonucleotide" as used herein refers to primers, probes,
oligomer fragments to be detected, oligomer controls, and unlabeled
blocking oligomers. Oligonucleotide are molecules comprised of two or more
deoxyribonucleotides or ribonucleotides.
The term "primer" as used herein refers to an oligonucleotide, preferably
an oligodeoxyribonucleotide, either naturally occurring such as a purified
restriction digest or synthetically produced, which is capable of acting
as a point of initiation of synthesis when subjected to conditions in
which synthesis of a primer extension product, which is complementary to a
nucleic acid strand, is induced, i.e., in the presence of nucleotides, an
agent for polymerization such as a DNA polymerase, and a suitable
temperature and pH. The primer must be sufficiently long to prime the
synthesis of extension products in the presence of the polymerization
agent.
In accord with the method of the present invention, the sequence of the
portion of the major rRNA-encoding operon (encoding the 16S, 5.8S and 26S
rRNA molecules specific for P. carinii) from organisms derived from the
lungs of immunosuppressed rats, including the genes for 5.8S and 26S
rRNAs, has been determined. These two genes show similarity to the
homologous genes of S. cerevisiae, with the gene for 26S rRNA also
containing an apparent Group I self-splicing intron.
The relatedness of different Pneumocystis isolates has been difficult to
determine in the absence of a long-term culture method for this organism.
The 5S rRNA gene amplified by polymerase chain reaction (PCR) from
multiple infected humans and rats had the identical sequences (13).
However, rat and human-derived organisms showed sequence differences in
their mitochondrial DNA (14). When portions of the 26S rRNA gene from two
different sources were sequenced, phylogenetically variable regions of the
gene were found to be different between these two organisms. This marked
sequence difference between 26S rRNA gene sequences may represent
differences between clones of the same species or may indicate the
existence of more than one species within the genus Pneumocystis. In
either case, such differences may provide a mechanism of recognizing the
relationships between different individual Pneumocystis isolates for
epidemiological studies. This appears to be the first such difference
reported between Pneumocystis isolates in the sequence of a chromosomal
gene.
The rRNA Operon of P. carinii
Although the exact phylogenetic relationship of P. carinii to other species
remains unknown, the 5.8S and 26S rRNA genes, like that for 16S rRNA (3),
are similar in primary sequence to the homologous genes of S. cerevisiae.
This finding contrasts with the report that the 5S rRNA gene most
resembles the sequence of the homologous genes of Amoeba or Myxomycota
rather than those of the Ascomycetes (8). The organization of the major
rRNA operon of P. carinii differs from that of S. cerevisiae in that for
the former there is no evidence that the 5S rRNA and 16S-5.8S-26S rRNA
operon genes are part of the same repeated DNA unit, based on pulsed field
electrophoresis studies (11-12). Linkage of the 5S rRNA gene to genes
encoding 16S rRNA or 26S rRNA by PCR techniques has not been observed. The
amount of DNA obtained from P. carinii was limited, and so classical
Southern analysis was not attempted.
The presence of Group I self-splicing introns in the 16S and 26S rRNA genes
of P. carinii distinguishes this organism from S. cerevisiae and from its
mammalian hosts. Since various compounds can specifically inhibit the
splicing of Group I introns in vitro (31), Group I intron splicing may
provide a specific target for development of new therapeutic agents
against P. carinii.
Taxonomy of P. carinii
The exact taxonomic relationships of P. carinii remain uncertain, in part
due to the limited number of eukaryotic microorganisms whose rRNA
sequences are known. Furthermore, the definitions of the groups denoted as
Fungi and Protozoa are so broad and imprecise that each includes very
distantly related organisms. It is possible that once more organisms of
this type are studied, these two groupings may prove to be inadequate, and
the taxonomy of the eukaryotic microorganisms may require some
redefinition. This has already proven to be the case for the
aicrosporidia, which have been placed in a group distinct from all other
eukaryotic microorganisms on the basis of their rRNA sequences (32).
In the absence of a long-term culture method or other tools for comparison
of different P. carinii organisms, the number of species within the genus
Pneumocystis is undefined. Antigenic differences between P. carinii
obtained from different mammalian host species have been demonstrated
(33-36), although their genetic basis is not proven. Although the 5S rRNA
gene sequences of multiple human and rat isolates of P. carinii are
identical (13), such isolates differ in the sequence of their
mitochondrial DNA (14). DNA hybridization methods with a cloned DNA
fragment have also suggested the non-identity of human and rat-derived P.
carinii, with differences noted among different human, but not rat,
isolates (37). Based on these results, it has been suggested that
subspecies of P. carinii may be designated based on the hosts from which
they are isolated (38).
The data presented herein show that multiple differences exist between the
26S rRNA gene sequences of P. carinii from Sprague-Dawley rats from Sasco
which were immunosuppressed in isolation (and therefore presumably
infected at some other location prior to their arrival here) and Hooded
rats which were immunosuppressed here without isolation (and therefore
presumably infected in this building or at some geographic location
distinct from the site at which the Sprague-Dawley rats were infected).
Since multiple independent PCR amplifications of portions of the 26S rRNA
gene prepared from templates derived from different individual rats of the
same type yielded identical sequences, there is no evidence that the
differences observed between the two sources represent PCR artefacts,
sequencing errors, or heterogeneity of rRNA sequences within an individual
cell, as has been reported in Plasmodium species (39). This variation
between different P. carinii isolates resembles that seen between
different individual humans, which also occurs in regions of the 26S rRNA
gene which are phylogenetically non-conserved (40). Sequence differences
in rRNA genes have been suggested as defining species differences within
the genus Giardia (41).
When Pc1 DNA template was amplified by PCR using the primer pair 4358
(universal) and 4746 (Pc1-specific), the expected 2,067 bp product was
produced; in contrast, no product was generated from Pc2 template with
these same primers (FIG. 9). Similarly, primers 4743 (Pc2-specific) and
4744 (Pc2-specific) amplified an approximately 3.0 kbp product from Pc2
template; no similar product was seen with Pc1 template (FIG. 9). Note
that in some reactions a barely detectable band of the same size seen with
Pc2 template was seen with Pc1 template using the latter primer pair.
These data are consistent with Pc1 and Pc2 each containing predominantly
genes encoding single distinct major 26S rRNA sequences.
Comparisons of the sequences of multiple P. carinii rRNA gene regions
should determine the extent of variability present. If different human
isolates of this organism vary as much as do different rat isolates, then
these sequences could be useful as epidemiological markers for identifying
strains of P. carinii and studying the spread of the organism and the
relative roles of new infection versus reactivation of earlier
asymptomatic colonization in the development of P. carinii pneumonitis in
immunosuppressed humans, including patients with AIDS. Since different
species of Tetrahymena differ more in their intron sequences than in the
sequences of adjacent conserved regions encoding rRNA (27), such regions
may prove to be even more variable between different P. carinii organisms.
Further studies may determine the variability within and between species
of the internal transcribed spacers (between the 16S and 5.8S rRNA and
5.8S and 26S rRNA genes) and external transcribed spacers (flanking the
rRNA coding regions). If these spacers contain regions with specific
functions in rRNA transcription or processing (30), such regions may show
sequence conservation.
The present invention is further illustrated by the following examples
which are not intended to limit the effective scope of the claims. All
parts and percentages in the examples and throughout the specification and
claims are by weight of the final composition unless otherwise specified.
EXAMPLES
METHODS
Growth and Purification of Pneumocystis carinii Sprague-Dawley rats from
Sasco, Inc. (Omaha, Neb.) were maintained in isolation cages with
protective filters (Lab Products, Maplewood, N.J.) with immunosuppression
by addition of dexamethasone (1 mg/ml) and tetracycline (0.5 mg/ml) to
their drinking water. Water and autoclaved 8% protein diet (ICN) were
provided ad libitum. Hooded rats (Harlan-Sprague-Dawley, Indianapolis,
Ind.), were treated in the same way but not isolated. Rats were sacrificed
after 8-12 weeks of immunosuppression or when signs of respiratory
distress were observed. All subsequent procedures were done at 4.degree.
C. Each pair of lungs was removed, minced with a scissors and the
homogenate was suspended in 25 ml of Dulbecco's Modified Eagle's Medium
(DMEM) and centrifuged for 10 minutes at 200.times.g to remove tissue
debris and lung cells. The supernatant was then transferred to a fresh
tube, cells were collected at 1,600.times.g and resuspended in 3 ml of
phosphate buffered saline (PBS). Suspended cells were loaded on
discontinuous Percoll gradients (10-40% in 10% steps) and after
centrifugation at 1,600.times.g for 30 minutes, trophozoites were found at
the 10-20% interface, cysts with some trophozoites and a few mammalian
cells at the 20-30% interface, and predominantly mammalian cells with some
cysts at the 30-40% interface.
For in vitro cultivation of P. carinii, mink lung cells of line ATCC CCL64
(15) grown to 80% confluence in 10 cm petri dishes in DMEM supplemented
with 10% fetal calf serum were used as feeder cells. Percoll gradient
purified cysts (5.times.10.sup.5) were added to each plate in the presence
of penicillin, streptomycin, gentamicin and fungizone, followed by
incubation at 37.degree. C. in a humidified 5% CO.sub.2 incubator. After
1-3 days in culture, the plates were gently agitated and the
Pneumocystis-containing medium was collected and centrifuged at
100.times.g for 5 minutes to pellet contaminating detached mammalian
cells. Only a few mammalian cells detached during the culture period and
these were efficiently removed by the centrifugation.
Microscopic Techniques
Pneumocystis trophozoites were quantitated in 5 ul samples air dried on
microscope slides and stained with Diff-Quik (Baxter Healthcare Co.,
Miami, Fla.). Cysts were identified by toluidine blue 0 stain (16). All
quantitation was done by counting three 5 ul samples for a total of 30 oil
immersion fields for each sample. All cultures and purified Pneumocystis
preparations were negative for fungal and bacterial contamination by
microscopy and culture, and for Mycoplasma contamination by MycoTect kit
(Gibco BRL).
Extraction of Nucleic Acids from Trophozoites
P. carinii cells from mink lung cell cultures were harvested by
centrifugation at 3,000 rpm for 30 minutes at 4.degree. C. in a Sorvall
SS-34 rotor, and were washed with chilled PBS. Cells were resuspended in
50 mM Tris-HCl ›Tris (hydroxymethyl) aminomethane hydrochloride!, 50 mM
Na-EDTA (sodium ethylenediaminetetraacetic acid), pH 8.0, and were lysed
by in cubation at 65.degree. C. for 30 minutes in the presence of 1% SDS
(sodium dodecyl sulfate). Proteins were removed by precipitation on ice in
the presence of 1.25N potassium acetate followed by centrifugation at room
temperature. Total nucleic acids were then concentrated by precipitation
in an equal volume of absolute ethanol on ice.
Oligonucleotides
DNA oligonucleotides were synthesized by beta-cyanoethyl phosphoramidite
chemistry on automated DNA synthesizers (Cyclone, Milligen and 380B,
Applied Biosystems), and were purified by chromatography on NENsorb-Prep
cartridges (NEN-DuPont) prior to use. Oligonucleotides used are listed in
Table 1.
TABLE 1
__________________________________________________________________________
Oligonucleotides Used for
PCR Amplifications and Sequencing
No. Sequence 5' Coordinate
Ref.
__________________________________________________________________________
228A
AACAGCTATGACCATGAT pUC polylinker
SEQ ID NO:1
229
TTCCCAGTCACGACGTTG pUC polylinker
SEQ ID NO:2
230
TGTAAAACGACGGCCAGT pUC polylinker
SEQ ID NO:3
1138
AGGGATTGGTTGGCCTGGTCCTCCGAA
637(+), 16S
3 SEQ ID NO:4
1887
CTTTCCAGTAATAGGCTTATCG
1726(-), 16S
3 SEQ ID NO:5
2892
GCTATCCTGAGGGAAACTTCGG
964(-), 26S SEQ ID NO:6
2893
CCCGTCTTGAAACACGGACCAAGG
635(+), 26S SEQ ID NO:7
2894
CCCGCGATCAGCAAAAGCTAATCTGG
1374(-), 16S
3 SEQ ID NO:8
2917
CCATACAGAAGACCATTCTTTATCCC
507(-), DHFR
18 SEQ ID NO:9
2918
GGCCGATCAAACTCTCTTCC
58(+), DHFR
18 SEQ ID NO:10
2919
GGGAAAAGGTCGTGGGGAGCG
977(-), TS
17 SEQ ID NO:11
2920
GGGGAAGACCGCCCTGATAGG
58(+), TS
17 SEQ ID NO:12
2982
GAGCCAATCCTTATCCCGAAGTTACG
1933(-), 26S
SEQ ID NO:13
2983
GTCTAAACCCAGCTCACGTTCCC
2933(-), 26S
SEQ ID NO:14
3175
GGGTGGTGGTGCATGGCCG
1262(+), 16S
3 SEQ ID NO:15
3176
CCTTCCGCAGGTTCACCTACGG
1796(-), 16S
3 SEQ ID NO:16
3243
CCGCAGCAGGTCTCCAAG 1833(+), 26S
SEQ ID NO:17
3425
CGAAAGAGAGGAGGTAGCACC
368(+), intron, 16S
5 SEQ ID NO:18
3426
GGTCCGTGTTTCAAGACGGG
654(-), 26S SEQ ID NO:19
3427
GGGAACGTGAGCTGGGTTTAG
2911(+), 26S
SEQ ID NO:20
4016
GGTTTGGCAGGCCAACATCGG
485(+), 26S SEQ ID NO:21
4138
CCATGAAAGTGTGGCCTATCG
2715(+), 26S
SEQ ID NO:22
4139
GCCTGGTCAGACAACCGC 3049(-), 26S
SEQ ID NO:23
4169
GGATTATGGCTGAACGCC 3074(+), 26S
SEQ ID NO:24
4170
GGCTTAATCTCAGCAGATCG
3328(-), 26S
SEQ ID NO:25
4358
GACGAGGCATTTGGCTACC
2267(-), 26S
SEQ ID NO:26
4443
GTACACACCGCCCGTCGC 1631(+), 16S
3 SEQ ID NO:27
4743
TTTAGCTCTTGATTGTAG 556(+), 26S, Pc2
SEQ ID NO:28
4744
CGCATATTTTATATTATG 3234(-), 26S, Pc2
SEQ ID NO:29
4746
GTTAGCTCTTGGCTTCTG 556(+), 26S, Pc1
SEQ ID NO:30
__________________________________________________________________________
TS refers to the thymidylate synthase (17) and DHFR refers to the
dihydrofolate reductase (18) genes of P. carinii.
Table 1 lists all primers used for PCR amplifications and sequencing. The
underlined G in 3243 was predicted for the 26S rRNA gene sequence based on
sequences from other organisms, but was A in the actual 26S rRNA sequence
of P. carinii. The underlined C in 4169 was present in the 26S rRNA gene
of P. carinii from Hooded rats but was A in the homologous location in
organisms from Sprague-Dawley rats, as described in the text. The
underlined C in 3425 is from the published intron sequence (5) but was T
in a clone of the intron amplified using flanking exon-derived primers
4434 and 3176.
Table II shows the extent of genetic identity as indicated by the
Wisconsin-GCG "Distances" program. Sequences are from GenBank with the
following accession numbers: Neurosopora crassa, Nc X02447; Cephalosporium
acremonium, Ca X06574; Alternaria alternata, Aa X17454; Saccharomyces
cerevisiae, Sc K01051; Schizosaccharomyces pombe, Sp J01359; Pneumocystis
carinii, Pc; Acanthamoeba castellani, Ac K00471; Chlamydomonas
reinhardtii, Cr M35013; Tetrahymena pyriformis, Tp M10752; Trypanosoma
brucei, Tb X05682; Plasmodium falciparum, Pf J04683; Dictyostelium
discoideum, Dd V00192; Phyarum polycephalum, Pp M13612; and Giardia
lamblia, Gl M35013.
TABLE II
__________________________________________________________________________
Sequence Similarity of 5.8S rRNAs of Simple Eukaryotes
Nc Ca Aa Sc Sp Pc Ac Cr Tp Tb Pf Dd Pp Gl
__________________________________________________________________________
Nc
1.0000
.9299
.9236
.9172
.8599
.8854
.7771
.7308
.6883
.6624
.5159
.5414
.5097
.4483
Ca 1.0000
.8924
.8797
.8544
.8418
.7215
.7244
.6688
.6519
.4873
.5506
.4968
.4828
Aa 1.0000
.9494
.8987
.8671
.7722
.7436
.6883
.6582
.5380
.5506
.5161
.4483
Sc 1.0000
.9114
.8734
.7848
.7564
.7143
.6392
.5316
.5696
.5161
.4483
Sp 1.0000
.8165
.7407
.7500
.7143
.5879
.5273
.5432
.5290
.4759
Pc 1.0000
.7468
.7051
.6753
.6519
5063
.5443
.5032
.4207
Ac 1.000
.7500
.6818
.5679
.5185
.5000
.5032
.4828
Cr 1.0000
.6429
.5641
.5513
.4744
.4516
.4552
Tp 1.0000
.5844
.5714
.5130
.5000
.4414
Tb 1.0000
.4702
.4691
.5161
.4138
Pf 1.0000
.4753
.4452
.3793
Dd 1.0000
.4065
.3862
Pp 1.0000
.4483
Gl 1.0000
__________________________________________________________________________
TABLE III
______________________________________
Sequence Similarity of 26S rRNAs of Simple Eukaryotes
Pc Sc Tp Pp
______________________________________
Pc -- 0.833 0.739
0.623
Sc -- 0.734
0.602
Tp -- 0.605
______________________________________
Table III shows the extent of genetic identity of 26S rRNA gene sequences,
calculated as in Table II. Abbreviations are as in Table II; sequences
from GenBank include Sc, J01355; Tp, X54004; and Pp, V01159.
Amplification and Cloning of DNA
Pneumocystis carinii DNA was amplified by means of PCR performed in a DNA
Thermal Cycler (Perkin Elmer Cetus) using thermostable DNA polymerase from
Thermus aquaticus (AmpliTaq, Perkin Elmer Cetus). Reactions were run in
the presence of 0.2 mM of each dNTP, 0.4 uM of each of the indicated
primers, 10 mM Tris-HCl (pH 8.3), 50 mM potassium chloride, 1.5 mM
MgCl.sub.2, gelatin (0.001% w/v), and 5 units of AmpliTaq DNA polymerase
in 10 ul total volume. Amplifications of segments over 1 kb. were
performed by incubation at 95.degree. C. for 2 minutes followed by 30
cycles of 94.degree. C. for 1 minute, 50.degree. C. for 1 minute, and
72.degree. C. for 1.5 minutes, followed by a 7 minute incubation at
72.degree. C. Amplifications of fragments of less than 1 kb. were
performed by 2 cycles of 94.degree. C. for 2 minutes, 58.degree. C. for 1
minute, and 72.degree. C. for 45 seconds, followed by 30 cycles of
94.degree. C. for 1 minute, 58.degree. C. for 1 minute, and 72.degree. C.
for 1 minute, followed by incubation at 72.degree. C. for 1 minute. For
some PCR reactions, the thermostable DNA polymerase from Thermus
thermophilus (Hot Tub, Amersham) was used, under reaction conditions
recommended by the manufacturer using 1.5 units of polymerase in a 100 ul
reaction, using 2 cycles of 94.degree. C. for 2 minutes, 58.degree. C. for
1 minute, and 70.degree. C. for 2 minutes, followed by 30 cycles of
94.degree. C. for 1 minute, 59.degree. C. for 1 minute, and 70.degree. C.
for 3 minutes, followed by incubation at 70.degree. C. for 10 minutes.
After PCR reaction, products were purified by agarose gel electrophoresis,
treated with T4 DNA polymerase (BRL) to generate blunt ends,
phosphorylated with T4 polynucleotide kinase (Pharmacia), ligated under
blunt end ligation conditions to SmaI-cut pUC18 DNA, and transformed into
E. coli DH5-alpha competent cells (BRL, Bethesda, Md.) as described (19).
Cells were grown in LB medium and plasmid DNA was extracted and purified
as described (19).
DNA Sequence Determination
DNA sequence determination was performed on the Genesis 2,000 Automated DNA
Sequencer (DuPont) according to the manufacturer's instructions for
sequencing reactions run on covalently closed superhelical DNA templates,
using DNA polymerase from bacteriophage T7 (Sequenase version 1.0, U.S.
Biochemicals). Primers used included oligonucleotides 228A, 229, and 230
(Table 1), which base pair with regions flanking the pUC18 polylinker, and
others listed in Table 1. For inserts of over 300 nucleotides without
convenient internal primer binding sites, nested deletions were generated
as described (19), which were then sequenced using the standard primers.
All sequences reported were determined at least twice for each DNA strand.
RESULTS
Sequence of the rRNA Operon of P. carinii
Prior to use for these experiments, nucleic acids from P. carinii were
shown to be from that source by confirmation of previously published
sequences using PCR methods. Primers 2920 and 2919 used in a PCR reaction
yielded a single 920 bP. product (based on agarose gel electrophoresis),
the size predicted for the thymidylate synthase gene with its 4
intervening sequences (17). A PCR utilizing primers 2918 and 2917
amplified a single 493 bp. product, as predicted for the dihydrofolate
reductase gene with a 43 bp. intervening sequence (18). The P.
carinii-specific primers for 16S rRNA, 1138 and 2894, yielded a single PCR
product of the predicted 738 bp. size (3). The "universal" 16S rRNA
primers, 3175 and 3176, generated two PCR products: one was 925 bp. in
length, the size predicted for the 16S rRNA gene with its Group I intron
(3, 5), and the other was 535 bp. in length. This smaller fragment had a
sequence identical to the corresponding region of human 18S rRNA (21), and
presumably represents amplification of contaminating mink lung cell
ribosomal DNA. The sequence of mink 16S rRNA is unknown, but is presumably
closely related to the human sequence.
FIG. 1 shows the DNA sequence of a portion of the rRNA-encoding gene(s) of
P. carinii isolated from immunosuppressed Sprague-Dawley rats (Sasco) and
the PCR amplifications which were subsequently cloned and sequenced. The
top line represents the DNA sequence of a portion of the rRNA-encoding
gene(s) of P. carinii isolated from immunosuppressed Sprague-Dawley rats
(Sasco). The horizontal lines below represent PCR amplifications which
were subsequently cloned and sequenced. Thin lines (FIG. 1A) refer to PCR
products from Sprague-Dawley rats (Sasco) and heavy lines (FIG. 1B) refer
to PCR products from Hooded rats. Numbers refer to oligonucleotide primers
(Table 1) used in each PCR reaction. Each PCR product, produced using
primers listed in Table 1, was cloned into pUC18 and both strands were
sequenced at least twice. All overlapping segments yielded the same
sequence, indicating an error rate of Taq polymerase-catalyzed PCR (22) of
less than one per 500 nucleotides. Rare misincorporation events in the
regions which were only amplified once cannot be ruled out.
FIG. 2 shows the total contiguous sequence determined for P. carinii from
immunosuppressed Sprague-Dawley rats (Sasco) by the strategy shown in FIG.
1A. Except for the last 18 nucleotides (shown in lower case), capital
letters indicate rRNA coding sequences (positive strand), lower case
letters indicate spacers, and underlined lower case letters indicate Group
I introns. The initial 22 nucleotides are from the 3'-terminal portion of
the Group I intron in 16S rRNA. Nucleotides 23-53 are the second exon of
16S rRNA, 54-216 are internal transcribed spacer 1 (ITS1), 217-374 the
gene for 5.8S rRNA (identified by similarity to other 5.8S rRNA
sequences), 375-556 ITS2, and 557-4256 are the gene for 26S rRNA, with a
Group I intron sequence in lower case underlined. This sequence has been
deposited at EMBL/GenBank under accession No. M86760. The sequence of the
final exon of the 16S rRNA gene agrees with that previously reported (3),
although the third base from the 3' end of the intron (C) previously
reported (5) is absent in our sequence. This sequence has been confirmed
in an additional amplified fragment including the entire intron sequence.
FIG. 3 shows a comparison of the sequence of the 5.8S rRNA gene of P.
carinii shown in FIG. 2 with the homologous sequences from Saccharomyces
cerevisiae (23) shown as Sc, Tetrahymena pyriformis (24) shown as Tp, and
Homo sapiens (25) shown as Hs. Since the actual 5.8S rRNA sequence was not
determined, the termini of the P. carinii gene have been chosen based on
the known sequence of the homologous gene of S. cerevisiae, to which it
appears to be closely related. The three nucleotides 5' to the proposed
rRNA 5' terminus are shown here in lower case letters. The 5.8S rRNA
sequence is 87% identical with the homologous rRNA of S. cerevisiae, which
was also the species to which P. carinii showed closest relatedness of its
16S rRNA gene (3). In contrast, the 5.8S rRNA sequence was 67% and 69%
identical with the homologous genes of T. pyriformis and H. sapiens,
respectively.
FIG. 5 shows the sequence of the 26S rRNA gene from FIG. 2 compared to
homologous genes from S. cerevisiae (26) and T. pyriformis (27). The
indicated P. carinii sequence has an apparent Group I self-splicing intron
sequence (see below) omitted after nucleotide 2241, and the T. pyriformis
sequence has an intron of the same type omitted from a location four
nucleotides 3' to the homologous site in the P. carinii gene (27). The
final 18 nucleotides of the P. carinii sequence were determined from
organisms from immunosuppressed Hooded rats as shown in FIG. 2. Thus the
26S rRNA genes of both P. carinii and T. pyriformis have Group I
self-splicing introns inserted into the same relatively conserved region.
Comparison of the three sequences shown in FIG. 5 indicates the relative
conservation of some regions of the 26S rRNA genes, and the greater
phylogenetic variability of other regions. The sequence of the coding
region of the P. carinii 26S rRNA gene shown in FIG. 5 is 83.3% identical
with the homologous gene of S. cerevisiae and 73.9% identical with that of
T. pyriformis. Therefore, based upon all three genes (encoding 16S, 5.8S
and 26S rRNA) of the major rRNA operon, P. carinii appears to be more
closely related to S. cerevisiae than to representative "protozoa."
Group I Self-splicing Introns of rRNA Genes
As set out in FIG. 2, an apparent Group I self-splicing intron interrupts
the 26S rRNA gene sequence in P. carinii. This intron is recognizable by
the presence of the conserved P, Q, R, and S segments (boldface in FIG.
6A)) present in all introns of this class, as previously reviewed (6-7).
There is 74% identity between the sequence of the putative Group I intron
in the 26S rRNA gene and that previously reported (5) in the 16S rRNA
gene. The entire sequence of the 16S rRNA gene intron in the P. carinii
isolate has been confirmed, and is identical to that reported (5) except
for the absence of the third nucleotide from the 3' end of the intron (C).
FIG. 6A shows the secondary structure into which the apparent Group I
intron in the gene for 26S rRNA of P. carinii can be folded. The helices
P1-P9 are conserved among Group I introns (6-7). The bases in the intron
are numbered 1 through 355, and the flanking exon regions are shown in
lower case letters. The consensus sequences P (nucleotides 80-91), Q
(nucleotides 202-211), R (nucleotides 247-260) and S (nucleotides 316-327)
are shown in boldface. FIG. 6B shows an alternative folding for the P8
helix of the intron (5) in the 16S rRNA gene.
FIG. 6A shows that the 26S rRNA gene intron can be folded into a structure
similar to that reported for other Group I self-splicing introns (6-7),
including that in the gene encoding 16S rRNA in P. carinii (5). This
structure is not necessarily the most stable folded structure possible
(28), but is most consistent with the consensus folding proposed for Group
I introns (7). The structure in FIG. 6A contains the conserved P1
double-helix made up of a pairing of the 5' exon-intron junction with an
internal guiding intron sequence (IGS). It also contains an unusually long
P8 helix with a bulge-loop on its 5' side. Although the previously
proposed structure for the 16S intron (5) does not have such an elongated
P8 helix, its structure also can be drawn in this way (FIG. 6B).
PCR primers pairing to the exons on either side of the 26S rRNA gene intron
were utilized, including a 5' primer with a 17-nucleotide 5' extension
consisting of a bacteriophage SP6 promoter (29), to generate a DNA product
consisting of the intron sequence with portions of both flanking exons
with an SP6 promoter at the 5' end of the positive strand. Transcription
of this DNA by bacteriophage SP6 RNA polymerase (Promega) results in
production of RNA catalyzing self-splicing under similar conditions to
those reported (5) for self-splicing of the intron in the 16S rRNA gene.
Thus the three rRNA genes encoding 16S, 5.8S and 26S rRNA of P. carinii
closely resemble their homologues in S. cerevisiae in sequence. However,
they contain Group I self-splicing introns in the 16S and 26S rRNA genes,
unlike most known fungi but like some protozoa (27).
Sequence Variation between P. carinii Isolates
In the course of studies to confirm the sequence shown in FIG. 2, various
regions of the rRNA operon of P. carinii were repeatedly amplified and
sequenced. Organisms obtained from the lungs of Sprague-Dawley rats
(Sasco) immunosuppressed in isolation chambers yielded the same sequences
for duplicate or overlapping amplifications, as summarized in FIG. 1. When
portions of the 26S rDNA were amplified, cloned and sequenced from P.
carinii obtained from Hooded rats immunosuppressed without isolation, they
were found to differ in sequence from the same regions obtained from
organisms from Sprague-Dawley rats from Sasco (FIGS. 7 and 8).
FIG. 7 shows the sequence of a region of the 26S rRNA gene which was
determined for five independent PCR products (summarized in FIG. 1) using
three different sets of primers from P. carinii from Sprague-Dawley rats,
for the region of nucleotides 485-964 as shown in FIG. 5. This sequence is
denoted Pc1 in FIG. 7, and was identical in all five determinations,
including three derived using PCR primers shown by the underlined
sequences in FIG. 7 and two using one primer outside this region and one
within it, as shown in the legend of FIG. 7. When the pair of primers
shown in FIG. 7 was used to amplify DNA from P. carinii from Hooded rats,
the sequence shown as Pc2 was obtained. Comparison of these sequences with
those of S. cerevisiae and T. pyriformis 26S rRNA sequences demonstrates
that the DNA sequences of the two P. carinii isolates differ from each
other at multiple positions, with the differences occurring mostly in
phylogenetically variable regions of the rRNA sequence. However, the two
P. carinii sequences are clearly more similar to each other than to the
sequence of the S. cerevisiae gene, indicating the phylogenetic
relatedness of these two isolates.
FIG. 8 shows a comparison of the sequences of the region from nucleotides
2911 through 3327 of the 26S rRNA gene of P. carinii (Pc1) from
Sprague-Dawley rats (FIG. 5) with the homologous regions from P. carinii
from Hooded rats (Pc2) and from S. cerevisiae (Sc) and T. pyriformis (Tp).
The fragment denoted Pc1 was amplified using primers 4138 and 4170. The
sequence shown for Pc2 was determined based on amplifications using primer
pair 4138 and 4139 and pair 4169 and 4170, and ligation-dependent PCR
amplification of a fragment extending from oligonucleotide 3427 through a
PstI site 381 nucleotides past the 3' end of the 26S rRNA gene. The
sequences of homologous regions of the 26S rRNA genes of S. cerevisiae
(Sc) and T. pyriformis (Tp) are shown. The 3'-terminal region of the 26S
rRNA gene of P. carinii from these two sources differed from each other,
with most of the differences in phylogenetically non-conserved regions.
Again the two P. carinii genes showed greater similarity to each other
than to the genes from other species.
When Pc1 DNA template was amplified by PCR using the primer pair 4358
(universal) and 4746 (Pc1-specific), the expected 2,067 bp product was
produced. In contrast, no product was generated from Pc2 template with
these same primers (FIG. 9). Similarly, primers 4743 (Pc2-specific) and
4744 (Pc2- specific) amplified an approximately 3.0 kbp product from Pc2
template; no similar product was seen with Pc1 template (FIG. 9). Note
that in some reactions a barely detectable band of the same size seen with
Pc2 template was seen with Pc1 template using the latter primer pair.
These data are consistent with Pc1 and Pc2 each containing predominantly
genes encoding single distinct major 26S rRNA sequences.
External Transcribed Spacer Sequence
The sequence of the 26S rRNA gene shown in FIG. 3 contains a
phylogenetically conserved EcoRI site at position 2875, which is located
in a highly conserved region of the sequence. DNA isolated from P. carinii
from Hooded rats was restricted with pairs of restriction enzymes,
including EcoRI and various other "6-cutters," and the resulting fragments
were then ligated into pUC18 cut with the same pairs of restriction
enzymes. The product of each of the ligation reactions was then subjected
to PCR amplification, with thermostable DNA polymerase from Thermus
thermophilus (Hot Tub, Amersham) using the primer pair: oligonucleotide
3427, which pairs on the positive strand at positions 2911-2931, and
oligonucleotide 230, which pairs with a pUC18 region 3' to the polylinker
(on the negative strand). When such PCR reactions were analyzed by agarose
gel electrophoresis with visualization of bands by ultraviolet
light-induced fluorescence in the presence of ethidium bromide, only the
pair of restriction enzymes EcoRI and PstI generated a visible DNA band.
When this band was cloned and sequenced, its 5' region had the sequence
shown as Pc2 in FIG. 8, followed by the final 18 nucleotides of the 26S
rRNA gene as shown in FIG. 5 and 381 nucleotides of the following spacer
region shown in FIG. 10, which would correspond to the external
transcribed spacer region in the homologous operon of most eukaryotes
(reviewed in 30). When the same ligation-dependent PCR procedure was
followed using the DNA from P. carinii from Sprague-Dawley rats, no
visible band of DNA was detected. This presumably indicates that the PstI
site in the spacer of the DNA denoted Pc2 is absent in Pc1 DNA, and the
next one is presumably too distant to support ligation-dependent PCR.
FIG. 10 shows the sequence of the spacer region 3' to the 26S rRNA gene of
P. carinii from Hooded rats (FIG. 8), which was determined by
ligation-dependent PCR. The sequences shown in FIGS. 8 and 10 have been
deposited at EMBL/GenBank under accession No. 86759.
In accord with the present invention, a method is provided for diagnosing
for Pneumocystis carinii which comprises detecting the presence of a
nucleic acid sequence containing the 26S rRNA gene specific for
Pneumocystis carinii in a sample which comprises the steps of:
(a) treating the sample with an oligodeoxyribonucleotide primer for each
strand of the nucleic acid sequence, four different nucleoside
triphosphates, and an agent for polymerization under hybridizing
conditions, such that for each strand an extension product of each primer
is synthesized which is sufficiently complementary to each strand of the
nucleic acid sequence being detected to hybridize therewith and contains
the 26S rRNA gene specific for Pneumocystis carinii, wherein the primers
are selected such that the extension product synthesized from one primer,
when it is separated from its complement, can serve as a template for
synthesis of the extension product of the other primer;
(b) treating the sample from step (a) under denaturing conditions to
separate the primer extension products from the templates on which they
are synthesized if the sequence to be detected is present;
(c) treating the product from step (b) with oligodeoxyribonucleotide
primers, four different nucleoside triphosphates, and an agent for
polymerization such that a primer extension product is synthesized using
each of the single strands produced in step (b) as a template, resulting
in amplification of the sequence to be detected if present;
(d) hybridizing the primer extension products from step (c) with a labeled
oligodeoxyribonucleotide probe complementary to the 26S rRNA gene specific
for Pneumocystis carinii;
(e) determining whether hybridization in step (d) has occurred.
Amplified products may be detected by electrophoresis on agarose gels
followed by hybridization with a radioactive or nonradioactive probe
consisting of a third oligonucleotide specific for a sequence lying
between two PCR primers on the P. carinii gene. The method may further
comprise in steps (d) and (e) a positive control which contains the 26S
rRNA gene specific for Pneumocystis carinii and a negative control which
does not contain the 26S rRNA gene.
This invention also provides a method for diagnosing for various species of
P. carinii by detecting the presence of a nucleic acid sequence containing
the particular 16S or 26S rRNA gene sequence specific for that species of
P. carinii. Specific PCR primers and hybridization probes for specific
subtypes of P. carinii may be employed based on sequence analysis of
different subtypes found in infected rats. Alternatively, single pairs of
PCR primers based on sequences shared by all isolates may be used for
strain identification if the distances between sequences shared by
different isolates are distinct. This latter approach may prove useful if
different strains differ in the location of the introns in their genes.
Preliminary data show that this is the case for the introns in 16S rRNA in
the two rat-derived P. carinii isolates described above.
Methods for amplifying and detecting nucleic acid sequences are described
in detail in U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,965,188, which
disclosures are incorporated herein by reference.
The present invention is also directed at methods for diagnosing for
Pneumocystis carinii which comprise detecting the presence of RNA
complementary to a nucleic acid sequence containing the 26S rRNA gene
specific for Pneumocystis carinii, the 26S rRNA gene specific for a
species of Pneumocystis carinii, and the 16S rRNA gene specific for a
species of Pneumocystis carinii. The methods involve using PCR to amplify
mRNA sequences from cDNA. In this method, the enzyme reverse transcriptase
and a primer specific for the RNA are employed to make a DNA copy of the
RNA. The DNA copy may then be amplified and detected by the methods of the
present invention. Examples of reverse transcriptase enzymes which may be
employed include Moloney murine leukemia virus (MuLV) and Avian
Myeloblastosis virus (AMV) enzymes. Methods for employing PCR to amplify
mRNA sequences from cDNA are more fully described in G. Veres et al.,
Science, 237:415-417 (1987) and PCR Protocols; A Guide to Methods and
Applications, Edited by M. A. Innis, D. H. Gelfand, J. J. Sninsky, and T.
J. White, Academic Press, 1990, pp. 21-27, which disclosures are
incorporated herein by reference.
Appendium of References
1. Pifer, L. L., Hughes, W. T., Stagno, S., and Woods, D. (1978)
Pediatrics, 61, 35-41.
2. Hughes, W. T. (1991) Annu. Rev. Med., 42, 287-295.
3. Edman, J. C., Kovacs, J. A., Masur, H., Santi, D. V., Elwood, H. J., and
Sogin, M. L. (1988) Nature, 334, 519-522.
4. Stringer, S. L., Stringer, J. R., Blase, M. A., Walzer, P. D., and
Cushion, M. T. (1989) Exptal. Parasitol., 68, 450-461.
5. Sogin, M. L., and Edman, J. C. (1989) Nucleic Acids Res., 17, 5349-5359.
6. Cech, T. R. (1990) Annu. Rev. Biochem., 59, 543-568.
7. Cech, T. R. (1988) Gene, 73, 259-271.
8. Watanabe, J., Hori, H., Tanabe, K., and Nakamura, Y. (1989) Mol.
Biochem. Parasitol., 32, 163-168.
9. Halanych, K. M. (1991) Mol. Biol. Evol., 8, 249-253.
10. Warner, J. (1989) Microbiol. Rev., 53, 256-271.
11. Yonagathan, T., Lin, H., and Buck, G. A. (1989). Molec. Microbiol., 3,
1473-1480.
12. Lundgren, B., Cotton, R., Lundgren, J. D., Edman, J. C., and Kovacs, J.
A. (1990) Infect. Immun., 58, 1705-1710.
13. Kitada, K., Oka, S., Kimura, S., Shimada, K., Serikawa, T., Yamada, J.,
Tsunoo, H., Egawa, K., and Nakamura, Y. (1991) J. Clin. Microbiol., 29,
1985-1990.
14. Sinclair, K., Wakefield, A. E., Banerji, S., and Hopkin, J. M. (1991)
Mol. Biochem. Parasitol., 45, 183-184.
15. Radding, J. A., Armstrong, M. Y. K., Ullu, E., and Richards, F. F.
(1989) Infect. Immun., 57, 2149-2157.
16. Witebsky, F. G., Andrews, J. W. B., Gill, V. J., and MacLowry, J. D.
(1988) J. Clin. Microbiol., 26, 774-775.
17. Edman, U., Edman, J. C., Lundgren, B., and Santi, D. V. (1989) Proc.
Natl. Acad. Sci. USA, 86, 6503-6507.
18. Edman, J. C., Edman, U., Cao, M., Lundgren, B., Kovacs, J. A., and
Santi, D. V. (1989) Proc. Natl. Acad. Sci. USA, 86, 8625-8629.
19. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular
Cloning: A Laboratory Manual, Second Edition. Cold Spring Harbor
Laboratory Press. Cold Spring Harbor.
20. Torczynski, R. M., Fuke, M., and Bollon, A. P. (1985) DNA, 4, 282-291.
21. Jones, M. D., and Foulkes, N. S. (1989) Nucleic Acids Res., 17,
8387-8388.
22. Zhou, Y., Zhang, X., and Ebright, R. H. (1991) Nucleic Acids Res., 19,
6052.
23. Bell, G. I., Degennaro, L. J., Gelfand, D. H., Bishop, R. J.,
Valenzuela, P., and Rutter, W. J. (1977) J. Biol. Chem., 252, 8118-8125.
24. Fujiwara, H., and Ishikawa, H. (1982) Nucleic Acids Res., 10,
5173-5182.
25. Nazar, R. N., Sitz, T. O., and Busch, H. (1976) Biochemistry, 15,
505-508.
26. Georgiev, O. I., Nikolaev, N., and Hadjiolov, A. A. (1981) Nucleic
Acids Res., 9, 6953-6958.
27. Nielsen, H., and Engberg, J. (1985) Nucleic Acids Res., 13, 7445-7455.
28. Zuker, M., and Stiegler, P. (1981) Nucleic Acids Res., 9, 133-148.
29. Nam, S. -C., and Kang, C. (1988) J. Biol. Chem., 263, 18123-18127.
30. Musters, W., Planta, R. J., van Heerikhuizen, H., and Raue (1990) in
Hill, W. E., Dahlberg, A., Garrett, R. A., Moore, P. B., Schlessinger, D.,
and Warner, J. R. (eds.), The Ribosome, Amer. Soc. Microbiol., New York,
pp. 435-442.
31. van Ahsen, U., Davies, J., and Schroeder, R. (1991) Nature, 353,
368-370.
32. Vossbrinck, C. R., Maddox, J. V., Friedman, S., Debrunner-Vossbrinck,
P. A., and Woese, C. R. (1987) Nature, 326, 411-414.
33. Kim, H. K., Hughes, W. T., and Feldman, S. (1972) Proc. Soc. Exptal.
Biol. Med., 142, 304-309.
34. Walzer, P. D., and Rutledge, M. E. (1980) J. Infect. Dis., 142, 449.
35. Gigliotti, F., Stokes, D. C., Cheatham, A. B., Davis, D. S., and
Hughes, W. T. (1986) J. Infect. Dis., 154, 315-322.
36. Link, M. J., Cushion, M. T., and Walzer, P. D. (1989) Infect. Immun.,
57, 1547-1555.
37. Tanabe, K., Fuchimoto, M., Egawa, K., and Nakamura, Y. (1988) J.
Infect. Dis., 157, 593-596.
38. Hughes, W. T., and Gigliotti, F. (1988) J. Infect. Dis., 157, 432-433.
39. Gunderson, J. J., Sogin, M. L., Wollett, G., Hollingdale, M., de la
Cruz, V. F., Waters, A. P., and McCutchan, T. F. (1987) Science, 238,
933-937.
40. Gonzalez, I. L., Gorski, J. L., Campen, T. J., Dorney, D. J., Erickson,
J. M., Sylvester, J. E., and Schmickel, R. D. (1985) Proc. Natl. Acad.
Sci. USA, 82, 7666-7670.
41. van Keulen, H., Campbell, S. L., Erlandsen, S. L., and Jarroll, E. L.
(1991) Mol. Biochem. Parasitol., 46, 275-284.
Throughout this application, various publications have been referenced. The
disclosures in these publications are incorporated herein by reference in
order to more fully describe the state of the art.
The invention being thus described, it will be obvious that the same may be
varied in many ways. Such variations are not to be regarded as a departure
from the spirit and scope of the invention and all such modifications are
intended to be included within the scope of the following claims.
__________________________________________________________________________
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(iii) NUMBER OF SEQUENCES: 32
(2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
AACAGCTATGACCATGAT18
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
TTCCCAGTCACGACGTTG18
(2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
TGTAAAACGACGGCCAGT18
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
AGGGATTGGTTGGCCTGGTCCTCCGAA27
(2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
CTTTCCAGTAATAGGCTTATCG22
(2) INFORMATION FOR SEQ ID NO:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
GCTATCCTGAGGGAAACTTCGG22
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
CCCGTCTTGAAACACGGACCAAGG24
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
CCCGCGATCAGCAAAAGCTAATCTGG26
(2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
CCATACAGAAGACCATTCTTTATCCC26
(2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
GGCCGATCAAACTCTCTTCC20
(2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
GGGAAAAGGTCGTGGGGAGCG21
(2) INFORMATION FOR SEQ ID NO:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:
GGGGAAGACCGCCCTGATAGG21
(2) INFORMATION FOR SEQ ID NO:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
GAGCCAATCCTTATCCCGAAGTTACG26
(2) INFORMATION FOR SEQ ID NO:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
GTCTAAACCCAGCTCACGTTCCC23
(2) INFORMATION FOR SEQ ID NO:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:
GGGTGGTGGTGCATGGCCG19
(2) INFORMATION FOR SEQ ID NO:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:
CCTTCCGCAGGTTCACCTACGG22
(2) INFORMATION FOR SEQ ID NO:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:
CCGCAGCAGGTCTCCAAG18
(2) INFORMATION FOR SEQ ID NO:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:
CGAAAGAGAGGAGGTAGCACC21
(2) INFORMATION FOR SEQ ID NO:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:
GGTCCGTGTTTCAAGACGGG20
(2) INFORMATION FOR SEQ ID NO:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
GGGAACGTGAGCTGGGTTTAG21
(2) INFORMATION FOR SEQ ID NO:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:
GGTTTGGCAGGCCAACATCGG21
(2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:
CCATGAAAGTGTGGCCTATCG21
(2) INFORMATION FOR SEQ ID NO:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:
GCCTGGTCAGACAACCGC18
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:
GGATTATGGCTGAACGCC18
(2) INFORMATION FOR SEQ ID NO:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:
GGCTTAATCTCAGCAGATCG20
(2) INFORMATION FOR SEQ ID NO:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:
GACGAGGCATTTGGCTACC19
(2) INFORMATION FOR SEQ ID NO:27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:
GTACACACCGCCCGTCGC18
(2) INFORMATION FOR SEQ ID NO:28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:
TTTAGCTCTTGATTGTAG18
(2) INFORMATION FOR SEQ ID NO:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:
CGCATATTTTATATTATG18
(2) INFORMATION FOR SEQ ID NO:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:
GTTAGCTCTTGGCTTCTG18
(2) INFORMATION FOR SEQ ID NO:31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4256 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:
CGAAAGAGAGGAGGTAGCACCGTTCCGTAGGTGAACCTGCGGAAGGATCATTAATGAAAT60
GTTGTCAAGAACTAGTTTATCTGGTTCTTGACATTTTCATCATAACACTTGTGAACATTA120
AAGATTTGCTTTGACAGGATGGGAGTTAGCTTTCGTCCTGTCAGAGGTTTTCAATTAAAA180
CTTTTTTGGTGTTTCGGTTAAAAATATAATTTTTAAAAACTTTCAGCAATGGATCTCTTG240
GTTCCCGCGTCGATGAAGAACGTGGCAAAATGCGATAAGTAGTGTGAATTGCAGAATTCA300
GTGACTCATCGAATTTTTGAACGCATATTGCGCTCCTCAGTATTCTGTGGAGCATGCCTG360
TTTGAGCGTCATTTTTATACTTGAACCTTTTTAAGGTTTGTGTTGGGCTATGCATTTTAG420
TATTTTTACAAGATGCTAGTCTAAAATGGAATCCAGAATATTATTTCGTGCAGCGTAATA480
GGGTTAAATTCCAATTCGCTGTTTTTAGAAATGATAGACTGGTTTGTCTATTGTTCCTAG540
AGAGCAATTTTTGAACCTTTGACCTCAAATCAGGTAGGATTACCCGCTGAACTTAAGCAT600
ATCAATAAGCGGAGGAAAAGAAACTAACAAGGATTCCCTCAGTAACGGCGAGTGAAGTGG660
GAAAAGCTCAAAATTAAAATCTGGCGAGGATCCTCGTCCGAGTTGTAATTTAGAGAAGTG720
CTTTTGGCTTGATGCTCTATTTAAAGTCCTTTGGAACAAGGCATCATAGAGGGTGATAAT780
CCCGTACGAGTAGGGTTATTAAGCTATGTAAAAGCACATTCGAAGAGTCGAGTTGTTTGG840
GATTGCAGCTCAAAATGGGTGGTAAATTTCATCTAAAGCTAAATATTAGCGGGAGACCGA900
TAGCGAACAAGTAGAGTGATCGAAAGATGAAAAGAACTTTGAAAAGAGAGTTAAATAGTA960
CGTGAAATTGCTGAAAGGGAAGCGCTTGCGATCAGACATGCCTTATCAGGATGTTGTTGT1020
CTTGACAATAACTATTACTTGGTTTGGCAGGCCAACATCGGTTTCAGCTGCTAGGTAAGT1080
GTCAAGAGAGGGTAGCCTCTTTCGTGGGGTGGTTAGCTCTTGGCTTCTGTAGTAGCAGGG1140
ACCGGAAGGTCTAGCGTCAGCTTGGTTGTTGGCTTAATGGTCTTAAGCGACCCGTCTTGA1200
AACACGGACCAAGGAGTCTAATATCTATGCGAGTGTTTGAGTGGAAAACTCATACGCGAA1260
ATGAAAGTGAAGCAAAAGGTAGGAACCCTTTAAGGGTGCACTATCGACCGGTTCAAATTT1320
ATTTGGATTGAGTAAGAGCATAGCTATTGGGACCCGAAAGATGGTGAACTATGCCTGAAT1380
AGGGTGAAGCCAGAGGAAACTCTGGTGGAGGCTCGTAGCGGTTCTGACGTGCAAATCGAT1440
CGTCAAATTTGGGCATAGGGGCGAAAGACTAATCGAACCATCTAGTAGCTGGTTCCTGCC1500
GAAGTTTCCCTCAGGATAGCAGAAACTCAATATCAGTTTTATGAGGTAAAGCGAATGATT1560
AGAGGCATTGGGGTTGAAACAACCTTAACCTATTCTCAAACTTTAAATATGTAAGAAGTC1620
CTTGTTGCTTAATTGAACATGGACATTAGAATGAGAGTTTCTAGTGGGCCATTTTTGGTA1680
AGCAGAACTGGCGATGCGGGATGAACCGAACGCGAGGTTAAGGTGCCGGAAGCACGCTCA1740
TCAGATACCACAAAAGGTGTTAGTTCATCTAGACAGTAGGACGGTGGCCATGGAAGTCGG1800
AATCCGCTAAGGAGTGTGTAACAACTCACCTACCGAATGAACTGGCCCTGAAAATGGATG1860
GCGCTCAAGCGTGCTACCTATACCTCGCCGTCTGGGATAATGATTCCTAGACGAGTAGGC1920
AGGCGTGGGGGTCGTGGCGAAGCCTAGGGCGTGAGCCCGGGTTGAACGGCCTCTAGTGCA1980
GATCTTGGTGGTAGTAGCAAATATTCAAATGAGGACTTTGAAGACTGAAGTGGGGAAAGG2040
TTCCATGCGAACAGTTATTGGGCATGGGTTAGTCGATCCTAAGAGATAGGGAAACTCCGT2100
TTTAAAGTGCGCGATTTTTCGCGCCTCTATCGAAAGGGAATCCGGTTAATATTCCGGAAC2160
CAGGATATGGATTCTTCACGGCAACGTAAATGAAGTCGGAGACGTCAGCGGGGGGCCTGG2220
GAAGAGTTATCTTTTCTTCTTAACAGCCTATCACCCTGGAATCGGTTTATCCGGAGATAG2280
GGTTCAATGGCTGGTAGAGTTCAGCACTTCTGTTGAATCCAGTGCGCTTTCGATGACCCT2340
TGAAAATCCGACGGAAGGAATAGTTTTCATGCCTGGTCGTACTCATAACCGCAACAGGTC2400
TCCAAGGTGAACAGCCTCTAGTTGATAGAATAATGTAGATAAGGGAAGTCGGCAAAATAG2460
ATCCGTAACTTCGGGATAAGGATTGGCTCTAAGGATTGGGTGCATTGGGCTTTAATCGGA2520
AGCTATTGGACCAGACGGGAACTACCTTGGGAAACCGAGGCGGATCCTGTTAGGATCGAT2580
CAGTGAATGATTTTAGCAGCCCTTTGGGCGTCCGATGCACGCTTAACAATCAACTTAGAA2640
CTGGTACGGACAAGGGGAATCTGACTGTCTAATTAAAACATAGCATTGCGATGGCCAGAA2700
AGTGGTGTTGACGCGATGTGATTTCTGCCCAGTGCTCTGAATGTCAAAGTGAAGAAATTC2760
AACCAAGCGCGGGTAAACGGCGGGAGTAACTATGACTCACCTTTTGAGGGTCATGAAAGC2820
GGCGCGAAAGTGTTAGCTAGTGATCCGAAAAATAAATTCGGGTTGCGACACTGTCAAATT2880
GCGGGGAGTCCCTAAAGATTCAACTACTAAGCAGCTTGTGGAAACACAGTTGTGGCCGAG2940
TTAATAGCCCTGGGTATAGTAACAATGTTGAATATGACTCTTAATTGAGGAAATGGGTGA3000
TCCGCAGCCAAATCCTAAGGACATTTTATTGTCTATGGATGCAGTTCAGCGACTAGACGG3060
CAGTGGGTATTGTAGAGATATGGGGTTATTTATGGCCTTATCTACAATGCTTAAGGTATA3120
GTCTAATCTCTTTCGAAAGAAAGAGTAGTGTGCTCTTAAGGTAGCCAAATGCCTCGTCAT3180
CTGATTAGTGACGCGCATGAATGGATTAACGAGATTCCCACTGTCCCTATCTACGATCTA3240
GCGAAACCACAGCCAAGGGAATGGGCTTGGCAAAATCAGCGGGGAAAGAAGACCCTGTTG3300
AGCTTGACTCTAGTTTGACATTGTGAAAAGACATAGAGGATGTAGAATAGGTGGGAGCTT3360
CGGCGCCTGTGAAATACCACCGCCTTTATTGTTTTTTTACTTAATCAGTGGAGCGGGACT3420
GAGCTTTTGCTCATCTTTTAGCGTTAAGGTCCTTTTACGGGCCGACCCGAGTTGATGACA3480
TTGTCAGATGGGGAGTTTGGCTGGGGCGGCACATCTGTCAAAAGATAACGCAGGTGTCCT3540
AAGGGGAGCTCATTGAGAACAGAAATCTCAAGTAGAATAAAAGGGTAAAAGTTCCCTTGA3600
TTTTGATTTTCAGTACGAATACAAACCATGAAAGTGTGGCCTATCGATCCTCTAAATCCT3660
CGAAATTTGAGGCTAGGGGTGCCAGAAAAGTTACCACAGGGATAACTGGCTTGTGGCAGC3720
CAAGCGTTCATAGCGACGTTGCTTTTTGATCCTTCGATGTCGGCTCTTCCTATCATACCG3780
AAGCAGAATTCGGTAAGCGTTGGATTGTTCACCCACTAATAGGGAACGTGAGCTGGGTTT3840
AGACCGTCGTGAGACAGGTTAGTTTTACCCTGCTGATGAAGTTATCGCAATGGTAATTCA3900
GCTTAGTACGAGAGGAACCGTTGATTCAGATATTTGGTTTTTGCGGTTGTCTGACCAGGC3960
AGTGCCGCGAAGCTATCATCTGTTGGATTATGGCTGAAAGCCTCTAAGTCAGAATCCATG4020
CCAGAAAGCGATGATATTTCCTCACGTTTTTTGATACAAATAGGCATCTTGCCAATATCA4080
GTATTTGGACGGGTGGAGGCGGACGGAAGTGTTCGTCTCTGTCCATTAATATTAATTAAT4140
ATTCGTGAGGGCGAATCCTTTGTAGACGACTTAGTTGAGGAACGGGGTATTGTAAGCAGT4200
AGAGTAGCCTTGTTGTTACGATCTGCTGAGATTAAGCCTTTGTTCCCAAGATTTGT4256
(2) INFORMATION FOR SEQ ID NO:32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 381 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: unknown
(D) TOPOLOGY: unknown
(ii) MOLECULE TYPE: DNA (genomic)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:
TCAAAAAGAACATTTCTTCTGAGTGGTGAGGGGTCCGTTAGAGCACACTCGCTCCTTGGA60
AGAGATGTTTTTTTTGATATTAGGAACCAATAGAATATTTAGAATTTAATTTAGATTAAA120
TTATAGAAGGGTATCTGTAGCGATAAGTTTCCATTTCAAATTTTTCTGATGCAGTAGTAT180
GTTCTTTTCTAAAATAAAATGATAGTTTATTAATGATTAAACTAATTATTATCCTTTGGC240
CATCTTTTTCTACATTTTCCAGAAACAGATCTAATTACGTTTTTGCTATCTATAATTATT300
AAAAATAATCATATATCTTTAAAGTTGACCTCAACGTCTTAAAATGTTTAGTTTTTTAAT360
TAACCCTAAACCCTAGAACAC381
__________________________________________________________________________
Top