Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PURIFIED THERMOSTABLE ENZYME
Document Type and Number:
WIPO Patent Application WO/1989/006691
Kind Code:
A2
Abstract:
Recombinant DNA sequences encoding a thermostable DNA polymerase from Thermus aquaticus can be used to produce a recombinant protein with a molecular weight of about 86,000-95,000 daltons. The thermostable recombinant enzyme can be used in a temperature-cycling chain reaction wherein at least one nucleic acid sequence is amplified in quantity from an existing sequence with the aid of selected primers and nucleotide triphosphates. The enzyme is preferably stored in a buffer containing non-ionic detergents that lends stability to the enzyme.

Inventors:
GELFAND DAVID H (US)
STOFFEL SUSANNE (US)
LAWYER FRANCES C (US)
SAIKI RANDALL K (US)
Application Number:
PCT/US1989/000127
Publication Date:
July 27, 1989
Filing Date:
January 12, 1989
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CETUS CORP (US)
International Classes:
C11D3/386; C12N15/09; C12N1/21; C12N9/12; C12N15/54; C12N15/70; C12P19/34; C12R1/01; C12R1/19; (IPC1-7): C12N9/12
Foreign References:
EP0258017A21988-03-02
EP0237362A11987-09-16
Other References:
Abstracts of the Annual Meeting of the American Society for Microbiology, Vol. 75, 1975 (US) D. EDGAR et al.: "Purification and Characterization of a DNA Polymerase from an Extreme Thermophile, Thermus Aquaticus", page 151
CHEMICAL ABSTRACTS, Vol. 85, No. 21, 22 November 1976, (Columbus, Ohio, US), A. CHIEN et al.: "Deoxyribonucleic Acid Polymerase from the Extreme Thermophile Thermus Aquaticus", see page 180* Abstract 155559t, & J. Bacteriol. 1976, 127 (3), 1550-7*
CHEMICAL ABSTRACTS, Vol. 93, No. 5, 4 August 1980, (Columbus, Ohio, US), A.S. KALEDIN et al.: "Isolation and Properties of DNA Polymerase from Extremal Thermophylic Bacteria Thermus Aquaticus YT-1", see page 377* Abstract 40169P & Biokhimiya (Moscow) 1980, 45 (4), 644-51*
CHEMICAL ABSTRACTS, Vol. 98, No. 7, 14 February 1983, (Columbus, Ohio, US), A.S. KALEDIN et al.: "Isolation and Properties of DNA Polymerase from the Extremely Thermophylic Bacteria Thermus Ruber", see page 298 * Abstract 49311q, & Biokhimiya (Moscow) 1982, 47 (11), 1785-91*
CHEMICAL ABSTRACTS, Vol. 105, No. 1, 7 July 1986, (Columbus, Ohio, US), A.K. FREY et al.: "Recovery of beta-Glactosidase by Adsorption from Unclarified Escherichia Coli Homogenate" see page 491 * Abstract 5024g & Eur. Congr. Biotechnol. 3rd, 1984, 1, 655-63*
Proc. Natl. Acad. Sci. USA, Vol. 85, December 1988 M.A. INNIS et al.: "DNA Sequencing with Thermus Aquaticus DNA Polymerase and direct Sequencing of Polymerase Chain Reaction-Amplified DNA", pages 9436-9440
Science, Vol. 239, 29 January 1988, R.K. SAIKI et al.: "Primer-Directed Enzymatic Amplification of DNA with a Thermostable DNA Polymerase" pages 487-491
Download PDF:
Claims:
WHAT IS CLAIMED IS:
1. A gεπe encoding a purified native thermostable DNA polymerase from Thermus aquaticus having a molecular we ght of 35 95,000 daltons and haying at least half of the activity at pH 6.4 that it has at pH 8.0.
2. Thε gεnε of claim 1 that was clonεd from the genome of Thεrmus aquaticus.
3. Thε gεnε of claim 2 that has the DNA sequence of Figure 1 or an allelic variant thereof.
4. The gene of claim 2 εncoding a polymerasε having a molεcular wεight of about 86,00095,000 daltons.
5. Thε gene of claim 4 encoding a polymerasε having the amino acid residuεs of 4832 of Figure 1.
6. The gene of dlaim 4 encoding a polymerase having the 832 amino acid sequεncε of Figurε 1.
7. '.
8. Thε gεnε of claim 2 εncoding a polymεrasε having a molεcular wεight of about 60,00065,000 daltons. * .
9. Thε gεnε of claim 7 εncoding a polymεrasε having the amino acid residuεs 290832 of Figurε 1.
10. A thεrmostablε εnzymε that is a polymεrase containing at least 50% homology to any contiguous stretch of ninε or morε amino acids shown in Figurε 1.
11. Thε thεrmostablε polymεrasε of claim 9 whεrεin said contiguous stretch of nine or morε amino acids is sεlεctεd from thε following sεquεncεs: 81 a) residues 190204; b) residues 262270; c) residues 569587; d) residues 713732; a) residues 743759; and f) rεsiduεs 778790.
12. The enzyme produced recombinantly from thε gεnε of claim 1.
13. The enzyme of claim 11 which has a nonblocked amino 10 terminus, .
14. The enzyme produced recombinantly from the gene of claim 4.
15. The enzyme produced recombinantly from the gene of claim 7.
16. 15. A stable enzyme composition comprising the enzyme of claim 11 in a buffer comprising one or more nonionic polymeric detergents. « .
17. A method for purifying a ther ostablε polymerasε which comprises trεating an aquεous mixture containing the thermostablε •20 polymerasε with a hydrophobic interaction support undεr conditions which promotε hydrophobic intεractions and εluting said thεrmostable polymεrasε from said support with a solvεnt which attεnuates hydrophobic intεractioπs.
18. Thε mεthod of claim 16 whεrεin thε hydrophobic 25 chromato graphic support is Phεnyl Sεpharosε. IS. Thε method of claim 16 whεrein said hydrophobic interactions are provided by a buffer with an ionic strength corresponding to greater than or aqual to 0.
19. 05 M NaCl.
20. The method of claim 13 wherein said hydrophobic interaction promotion conditions are provided using a buffer containing greater than or equal to 0.2 M ammonium sulfata.
21. The method of claim 16 wherεin said elution solvent uses a 04 M urεa gradiεnt.
22. Thε method of claim 15 wherein the thermostable polymerase is DNA polymerase isolated from Thermus aquaticus.
23. The method of claim 16 wherεin said thεrmostable polymεrasε is a rεcombinant εnzy e.
24. The method of claim 22 wherein the aquεous mixture has previously bεεn εnriched in thermostable polymerase activity by heat treating the cell lysatε.
25. Thε mεthod of claim 23 whεrεin thε hεat trεatmεnt is conducted at tεmperatures in thε range of at least 45 C to about 90 C.
26. A method for purifying a rεcombinant thermostable polymerasε produced in a heat labilε host cell which method comprisεs treating the cell lysate with tempεraturεs in thε rangε of at lεast 45 C to about 90 C and rεcovεring thε thεrmostablε polymεrasε activity.
27. Thε mεthod of claim 25 whεrεin said thεrmostablε polymεrasε is from Thεr us aquaticus.
Description:
PURIFIED THERMOSTABLE ENZYME

The present invention relates to a purified thermostable enzyme. In one embodiment the enzyme is DNA polymerase purified from

Thermus aquaticus and has a molecular weight of about 86,000-95,000. In another embodiment the enzyme is DNA polymerase produced by recombinant means.

Extensive research has been conducted on the isolation of DNA poly erases from mesophilic microorganisms such as E. coli. See, for example, Bessman et al., J. Biol. Che . (1957) 233:171-177 and Buttin and Kornberg (1966) J. Biol. Chem. 241:5419-5427.

In contrast, relatively little investigation has been made on the isolation and purification of DNA polymerases from ther ophiles, such as Thermus aquaticus. Kaledin et al., Biokhymiya (1980) 5_:644-651 discloses a six-step isolation and purification procedure of DNA polymerase from cells of T_ ; _ aquaticus YTl strain. These steps involve isolation of crude extract, DEAE-cellulose chromatography, fractionation on hydroxyapatite, fractionation on DEAE-cellulose, and chromatography on single-strand DNA-cellulose. The pools from each stage were not screened for contaminating endo- and exonuclease(s). The molecular weight of the purified enzyme is reported as 62,000 daltons per monomeric unit.

A second purification scheme for a polyπerase from T. aquaticus is described by A. Chi en et al., J. Bacteriol. (1976) 127:1550-1557. In this process, the crude extract is applied to a DEAE-Sephadex column. The dialyzed pooled fractions are then subjected to treatment on a phosphocellulose column. The pooled fractions are dialyzed and bovine serum albumin (BSA) is added to prevent loss of polymerase activity. The resulting mixture is loaded on a DNA-cellulose column. The pooled material from the column is dialyzed and analyzed by gel filtration to have a molecular weight of about 63,000 daltons, and, by sucrose gradient centrifugation of about 68,000 daltons.

The use of a thermostable enzyme to amplify existing nucleic acid sequences in amounts that are large compared to the amount initially present has been suggested in U.S. Patent No. 4,683,195. Primers, nucleotide triphosphates, and a polymerase are used in the process, which involves denaturation, synthesis of template strands and hybridization. The extension product of each primer becomes a template for the production of the desired nucleic acid sequence. The patent discloses that if the polymerase employed is a thermostable enzyme, it need not be added after every denaturation step, because the heat will not destroy its activity. No other advantages or details are provided on the use of a purified thermostable DNA polymerase. Furthermore, New England Biolabs had marketed a polymerase from J\_ aquaticus, but was unaware that the polymerase activity decreased substantially with time in a storage buffer not containing non-ionic detergents.

Accordingly, there is a desire in the art to produce a purified, stable thermostable enzyme that may be used to improve the nucleic acid amplification process described above.

Accordingly, the present invention provides a purified thermostable enzyme that catalyzes combination of nucleotide triphosphates to form a nucleic acid strand complementary to a nucleic acid template strand. Preferably the purified enzyme is DNA polymerase from Thermus aquaticus and has a molecular weight of about 86,000-95,000 daltons. This purified material may be used in a temperature-cycling amplification reaction wherein nucleic acid sequences are produced from a given nucleic acid sequence in amounts that are large compared to the amount initially present so that they can be manipulated and/or analyzed easily.

The gene encoding the DNA polymerase enzyme from Thermus aquaticus has also been identified and cloned and provides yet another means to prepare the thermostable enzyme of the present invention. In addition to the gene encoding the approximately .86-000-95,000 da!ton enzyme, gene derivatives encoding DNA polymerase activity are also presented.

The invention also encompasses a stable enzyme composition comprising a purified, thermostable enzyme as described above in a buffer containing one or more non-ionic polymeric detergents.

Finally, the invention provides a method of purification for the thermostable polymerase of the invention which comprises treating an aqueous mixture containing the thermostable polymerase with a hydrophobic interaction chromatographic support under conditions which promote hydrophobic interactions and eluting the bound thermostable pol,ymerase from said support with a solvent which attenuates hydrophobic interactions.

The purified enzyme, as well as the enzymes produced by recombinant DNA techniques, provides much more specificity than the Klenow fragment, which is not thermostable, when used in the temperature-cycling amplification reaction. In addition, the purified enzyme and the recombinantly produced enzymes exhibit the appropriate activity expected when TTP or other nucleotide triphosphates are not present in the incubation mixture with the DNA template. Also, the enzymes herein have a broader pH profile than that of the thermostable enzyme from Thermus aquaticus described in the literature, with more than 50% of the activity at pH 6.4 as at pH 8.

Figure 1 is the DNA sequence and the predicted amino acid sequence for Taq polymerase. The amino acid sequence corresponding to the deduced primary translation product is numbered 1-832.

Figure 2 is. a restriction site map of plasmid pFC83 that contains the Λ»<4.5 kb Hindi II T. aquaticus DNA insert subcloned into plasmid BSM13+.

Figure 3 is a restriction site map of plasmid pFC85 that contains the ~2.68 kb Hindlll to Asp718 T. aquaticus DNA insert subcloned into plasmid BSM13+. As used herein, "cell", "cell line", and "cell culture" can be used interchangeably and all such designations include progeny. Thus, the words "trans formants" or "transformed cells" includes the primary subject cell and cultures derived therefrom without regard for

the number of transfers. It is also understood that all progeny may not be precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the same functionality as screened for in the originally transformed cell are included.

The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for procaryotes, for example, include a promoter, optionally an operator sequence, a ribosome binding site, and possibly, other as yet poorly understood sequences. Eucaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers.

The term "expression system" refers to DNA sequences containing a desired coding sequence and control sequences in operable linkage, so that hosts transformed with these sequences are capable of producing the encoded proteins. In order to effect transformation, the expression system may be included on a vector; however, the relevant DNA may then also be integrated into the host chromosome.

The term "gene" as used herein refers to a DNA sequence that encodes a recoverable bioactive polypeptide or precursor. The polypeptide can be encoded by a full-length gene sequence or any portion of the coding sequence so long as the enzymatic activity is retained.

In one embodiment of the invention, the DNA sequence encoding a full-length thermostable DNA polymerase of Thermus aquaticus (Taq) is provided. Figure 1 shows this DNA sequence and the deduced amino acid sequence. For convenience, the amino acid sequence of this Taq polymerase will be used as a reference and other forms of the thermostable enzyme will be designated by referring to the sequence shown in Figure 1. Since the N-terminal ethionine may or may not be present, both forms are included in all cases wherein the thermostabl-e enzyme is produced in bacteria.

"Operably linked" refers to juxtaposition such that the normal function of the components can be performed. Thus, a coding

sequence "operably linked" to control sequences refers to a configuration wherein the coding sequences can be expressed under the control of the control sequences.

The term "mixture" as it relates to mixtures containing Taq polymerase refers to a collection of materials which includes Taq polymerase but which also includes alternative proteins. If the Taq polymerase is derived from recombinant host cells, the other proteins will ordinarily be those associated with the host. Where the host is bacterial, the comtaminating proteins will, of course, be bacterial proteins.

"Non-ionic polymeric detergents" refers to surface-active agents that have no ionic charge and that are characterized, for purposes of this invention, by their ability to stabilize the enzyme herein at a pH range of from about 3.5 to about 9.5, preferably from 4 to 8.5.

The term "oligonucleotide" as used herein is defined as a molecule comprised of two or more deoxyribonucleotides or ribonucleotides, preferably, more than three. Its exact size will depend on many factors, which in turn depend on the ultimate function or use of the oligonucleotide. The oligonucleotide may be derived synthetically or by cloning.

The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is initiated, i.e., in the presence of four different nucleotide triphosphates and thermostable enzyme in an appropriate buffer ("buffer" includes pH, ionic strength, cof actors, etc.) and at a suitable temperature. For Taq polymerase the buffer herein preferably contains 1.5-2 mM of a magnesium salt, preferably MgCl 2 _ 150-200 uM of each nucleotide, and IIJLM of each primer, along with preferably 50 mM KC1, 10 mM Tris buffer, pH 8-8.4, and lOO Ug/ml gelatin.

The primer is preferably single-stranded for maximum efficiency in ampl fication, but may alternatively be double- stranded. If double- stranded, the primer is first treated to separate its strands before being used to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the thermostable enzyme. The exact lengths of the primers will depend on many factors, including temperature, source of primer and use of the method. For example, depending on the complexity of the target sequence, the oligonucleotide primer typically contains 15-25 nucleotides, although it may contain more or fewer nucleotides. Short primer molecules generally require cooler temperatures to form sufficiently stable hybrid complexes with template. The primers herein are selected to be "substantially" complementary to the different strands of each specific sequence to be amplified. This means that the primers must be sufficiently complementary to hybridize with their respective strands. Therefore, the primer sequence need " not reflect the exact sequence of the template. For example, a non-complementary nucleotide fragment may be attached to the 5' end of the primer, with the remainder of the primer sequence being complementary to the strand. Alternativel , non- complementary bases or longer sequences can be interspersed into the primer, provided 'that the primer sequence has sufficient complementarity with the sequence of the strand to be amplified to hybridize therewith and thereby form a template for synthesis of the extension product of the other primer. However, for detection purposes, particulary using labeled sequence-specific probes, the primers typically have exact complementarity to obtain the best results.

As used herein, the terms "restriction endonucl eases" and "restriction enzymes" refer to bacterial enzymes each of which cut double-stranded DNA at or near a specific nucleotide sequence.

As used herein, the term "thermostable enzyme" refers to an enzyme which is stable to heat and is heat resistant and catalyzes (facilitates) combination of the nucleotides in the proper manner to form the primer extension products that are complementary to each nucleic acid strand. Generally, the synthesis will be initiated at the 3 1 end of each primer and will proceed in the 5' direction along the template strand, until synthesis terminates, producing molecules of different lengths. There may be a thermostable enzyme, however, which initiates synthesis at the 5' end and proceeds in the other direction, using the same process as described above.

The thermostable enzyme herein must satisfy a single criterion to be effective for the amplification reaction, i.e., the enzyme must not become irreversibly denatured (inactivated) when subjected to the elevated temperatures for the time necessary to effect denaturation of double-stranded nucleic acids. Irreversible denaturation for purposes herein refers to permanent and complete loss of enzymatic activity. The heating conditions necessary for nucleic acid denaturation will depend, e.g., on the buffer salt concentration and composition and the Vength and nucleotide composition of the nucleic acids being denatured, but typically range from about 90 to about 105 C for a time depending mainly on the temperature and the nucleic acid length, typically about 0.5 to four minutes. Higher temperatures may be tolerated as the buffer salt concentration and/or GC composition of the nucleic acid is increased. Preferably, the enzyme will not become ' irreversibl denatured at about 90-100 C.

The thermostable enzyme herein preferably has an optimum temperature at which it functions that is higher than about 40°C, which is the temperature below which hybridization of primer to template is promoted, although, depending on (1) salt concentration and composition and (2) composition and length of primer, hybridization can occur at higher temperature (e.g., 45-70°C). The higher the temperature optimum for the enzyme, the greater the specificity and/or selectivity of the primer-directed extension process. However, enzymes that are active below 40*C, e.g., at 37*C, are also within the scope of this invention provided they are heat-

stable. Preferably, the optimum temperature ranges from about 50 to 90 C C, more preferably 60-80°C.

The thermostable enzyme herein may be obtained from any source and may be a native or recombinant protein. Examples of enzymes that have been reported in the literature as being resistant to heat include heat-stable polymerases, such as, e.g., polymerases extracted from the thermophilic bacteria Thermus flavus, Thermus ruber, Thermus thermophilus, Bacillus stearothermophilus (which has a somewhat lower temperature optimum than the others listed), Thermus aquaticus, Thermus lacteus, Thermus rubens, and Methanothermus ferv dus. In addition, thermostable polymerases isolated from the thermophilic archaebacteria include, for example, Sulfolobus sol fataricus, Sulfolobus acidocaldarius, Thermoplasma acidophilum, Methanobacterium thermoautotrophicum, and Desulfurococcus mobilis. The thermostable enzyme of the invention has the amino acid sequence presented in Figure 1. In addition, any thermostable polymerase containing at least 50% homology to any contiguous stretch of nine or more amino acids presented therein is also intended to be within the scope of the invention. This homology can be determined using commercially available data banks such as the European Molecular Biology Laboratory (EMBL) or Genbank. Moreover, as new thermostable polymerases are identified, specific regions of homology between the newly identified sequences and the Taq polymerase sequence may be determined using, for example, the Sequence Analysis Software Package of the Genetics Computer Group of the University of Wisconsin. Specific regions of homology include the following sequences (numbered according to the numbering of amino acids in Figure 1): residues 190- 204, 262-270, 569-587, 718-732, 743-759, and 778-790.

The preferred thermostable enzyme herein is a DNA polymerase isolated from Thermus aquaticus. Various strains thereof are available from the American Type Culture Collection, Rockville, Maryland, and is described by T.D. Brock, J. Bact. (1969) 98:289-297, and by T. Oshima, Arch. Microbiol. (1978) 117: 189-196. One of these preferred strains is strain YT-1.

For recovering the native protein the cells are grown using any suitable technique. One such technique is described by Kaledin et al., Biokhimi.ya (1980), supra, the disclosure of which is incorporated herein by reference. Briefly, the cells are grown on a medium, in one liter, of nitrilotriacetic acid (100 mg), tryptone (3 g), yeast extract (3 g), succinic acid (5 g), sodium sulfite (50 mg), riboflavin (1 mg), K 2 HP0 4 (522 mg), MgS0 4 (480 mg), CaCl 2 (222 mg), NaCl (20 mg), and trace elements. The pH of the medium is adjusted to 8.0 ± 0.2 with KOH. The yield is increased up to 20 grams of cells/liter if cultivated with vigorous aeration at a temperature of 70 C. Cells in the late logarithmic growth stage (determined by absorbance at 550 nm) are collected by centrifugation, washed with a buffer and stored frozen at -20°C.

In another method for growing the cells, described in Chien et al., J. Bacteriol. (1976), supra, the disclosure of which is incorporated herein by reference, a defined mineral salts medium containing 0.3% glutamic acid supplemented with 0.1 mg/1 biotin, 0.1 mg/1 thiamine, and 0.05 mg/1 nicotinic acid is employed. The salts include nitrilotriacetic acid, CaS0 , MgS0 4 , NaCl, KN0 3 , NaN0 3 , ZnS0 4 , H3BO3, CuS0 4 , NaMo0 , CoCl 2 , FeCl 3 , MnS0 , and Na 2 HP0 4 . The pH of the medium is adjusted to 8.0 with NaOH.

In the Chien et al . technique, the cells are grown initially at 75 C in a water bath shaker. On reaching a certain density, 1 liter of these cells is transferred to 16-liter carboys which are placed in hot-air incubators. Sterile air is bubbled through the cultures and the temperature maintained at 75 C. The cells are allowed to grow for 20 hours before being collected by centrifuge.

After cell growth, the isolation and purification of the enzyme take place in six stages, each of which is carried out at a temperature below room temperature, preferably about 4*C.

In the first stage or step, the cells, if frozen, are thawed, disintegrated by ultrasound, suspended in a buffer at about pH 7.5, and centrifuged.

In the second stage, the supernatant is collected and then fractionated by adding a salt such as dry ammonium sulfate. The appropriate fraction (typically 45-75% of saturation) is collected, dissolved in a 0.2 M potassium phosphate buffer preferably at pH 6.5, and dialyzed against the same buffer.

The third step removes nucleic acids and some protein. The fraction from the second stage is applied to a DEAE-cellulose column equilibrated with the same buffer as used above. Then the column is washed with the same buffer and the flow-through protein-containing fractions, determined by absorbance at 280 nm, are collected and dialyzed against a 10 mM potassium phosphate buffer, preferably with the same ingredients as the first buffer, but at a pH of 7.5.

In the fourth step, the fraction so collected is applied to a hydroxyapatite column equilibrated with the buffer used for dialysis in the third step. The column is then washed and the enzyme eluted with a linear gradient of a buffer such as 0.01 M to 0.5 M potassium phosphate buffer at pH 7.5 containing 10 mM 2-mercaptoethanol and 5% glycerine. The pooled fractions containing thermostable enzyme (e.g., DNA polymerase) activity are dialyzed against the same buffer used for dialysis in the third step.

In the fifth stage, the dialyzed fraction is applied to a DEAE-cellulose column, equilibrated with the buffer used for dialysis in the third step. . The column is then washed and the enzyme eluted with a linear gradient of a buffer- such as 0.01 to 0.6 M C1 in the buffer used for dialysis in the third step. Fractions with thermostable enzyme activity are then tested for contaminating deoxyribonucl eases (endo- and exonucl eases) using any suitable procedure. For example, the endonuclease activity may be determined electrophoreticall from the change in molecular weight of phage lambda DNA or supercoiled plasmid DNA after incubation with an excess of DNA polymerase. Similarly, exonucl ease activity may be determined electrophoretically from the change in molecular weight of DNA after treatment with a restriction enzyme that cleaves at several sites.

The fractions determined to have no deoxyribonuclease activity are pooled and dialyzed against the same buffer used in the third step.

In the sixth step, the pooled fractions are placed on a phosphocellulose column with a set bed volume. The column is washed and the enzyme eluted with a linear gradient of a buffer such as 0.01 to 0.4 M KC1 in a potassium phosphate buffer at pH 7.5. The pooled fractions having thermostable polymerase activity and no deoxyribonuclease activity are dialyzed against a buffer at pH 8.0. The molecular weight of the dialyzed product may be determined by any technique, for example, by SDS-PAGE analysis using protein molecular weight markers. The molecular weight of one of the preferred enzymes herein, the DNA polymerase purified from Thermus aquaticus, is determined by the above method to be about 86,000-90,000 daltons. The molecular weight of this same DNA polymerase as determined by the predicted amino acid sequence is calculated to be approximately 94,000 daltons. Thus, the molecular weight of the full length DNA polymerase is dependent upon the method employed to determine this number and falls within the range of 86,000-95,000 daltons.

The thermostable enzyme of this invention may also be produced by recombinant DNA techniques, as the gene encoding this enzyme has been cloned from Thermus aquaticus genomic DNA. The complete coding sequence for the Thermus aquaticus (Taq) polymerase can be derived from bacteriophage CH35:Taq#4-2 on an approximately 3.5 kilobase (kb) BglII-Asp718 (partial) restriction fragment contained within an ^18 kb genomic DNA insert fragment. This bacteriophage was deposited with the American Type Culture Collection (ATCC) on May 29, 1987 and has accession no. 40,366. Alternatively, the gene can be constructed by ligating an • » 730 base pair (bp) Bqlll-Hindlll restriction fragnent isolated from plasmid pFC83 (ATCC 67,422 deposited Mβy 29, 1987) to an Λ/2.68 kb HindIII-Asp718 restriction fragment isolated from plasmid pFC85 (ATCC 67,421 deposited May 29, 1987). The pFC83 restriction fragment comprises the ami no- terminus of

the Taq polymerase gene while the restriction fragment from pFC85 comprises the carboxy-terminus. Thus, ligation of these two fragments into a correspondingly digested vector with appropriate control sequences will result in the translation of a full-length Taq polymerase.

As stated previously, the DNA and deduced amino acid sequence of a preferred thermostable enzyme is provided in Figure 1. In addition to the N-terminal deletion described supra, it has also been found that the entire coding sequence of the Taq polymerase gene is not required to recover a biologically active gene product with DNA polymerase activity. Amino-terminal deletions wherein approximately one-third of the coding sequence is absent has resulted in producing a gene product that is quite active in polymerase assays.

In addition to the N-terminal deletions, individual amino acid residues in the peptide chain comprising Taq polymerase may be modified by oxidation, reduction, or other derivatization, and the protein may be cleaved to obtain fragments that retain activity. Such alterations that do not destroy activity do not remove the protein from the definition, and are specifically included. Thus, modifications to the primary structure itself by deletion, addition, or alteration of the amino acids incorporated into the sequence during translation can be made without destroying the high temperature DNA polymerase activity of the protein. Such substitutions or other alterations result in proteins having an amino acid sequence encoded by DNA falling within the contemplated scope of the present invention.

Polyclonal antiserum from rabbits immunized with the purified 86,000-95,000 da!ton polymerase of this invention was used to probe a Thermus aquaticus partial genomic expression library to obtain the appropriate coding sequence as described below. The cloned genomic sequence can be expressed as a fusion polypeptide, expressed directly using its own control sequences, or expressed by constructions using control sequences appropriate to the particular host used for expression of the enzyme.

Of course, the availability of DNA encoding these sequences provides the opportunity to modify the codon sequence so as to generate mutein (mutant protein) forms also having DNA polymerase activity. Thus, these tools can provide the complete coding sequence for Taq DNA polymerase from which expression vectors applicable to a variety of host systems can be constructed and the coding sequence expressed. Portions of the Taq polymerase-encoding sequence are useful as probes to retrieve other thermostable polymerase-encoding sequences in a variety of species. Accordingly, portions of the genomic DNA encoding at least four to six amino acids can be replicated in £^_ coli and the denatured forms used as probes or ol igodeoxyri onucleotide probes can be synthesized which encode at least four to six amino acids and used to retrieve additional DNAs encoding a thermostable polymerase. Because there may not be a precisely exact match between the nucleotide sequence in the Thermus aquaticus form and that in the corresponding portion of other species, oligomers containing approximately 12-18 nucleotides (encoding the four to six amino acid stretch) are probably necessary to obtain hybridization under conditions of sufficient stringency to eliminate false positives. The sequences encoding six amino acids would supply information sufficient for such probes.

In general, terms, the production of a recombinant form of Taq polymerase typically involves the following:

First, a DNA is obtained that encodes the mature (used here to include all muteins) enzyme or a fusion of the Taq polymerase to an additional sequence that does not destroy its activity or to an additional sequence cleavable under controlled conditions (such as treatment with peptidase) to give an active protein. If the sequence is uninterrupted by introns it is suitable for expression in any host. This sequence should be in an excisable and recoverable form.

The excised or recovered coding sequence is then preferably placed in operable linkage with suitable control sequences in a replicable expression vector. The vector is used to transform a

suitable host and the transformed host cultured under favorable conditions to effect the production of the recombinant Taq polymerase. Optionally the Taq polymerase is isolated from the medium or from the cells; recovery and purification of the protein may not be necessary in some instances, where some impurities may be tolerated.

Each of the foregoing steps can be done in a variety of ways. For example, the desired coding sequences may be obtained from genomic fragments and used directly in appropriate hosts. The constructions for expression vectors operable in a variety of hosts are made using appropriate replicons and control sequences, as set forth below. Suitable restriction sites can, if not normally available, be added to the ends of the coding sequence so as to provide an excisable gene to insert into these vectors.

The control sequences, expression vectors, and transformation methods are dependent on the type of host cell used to express the gene. Generally, procaryotic, yeast, insect or mammalian cells are presently useful as hosts. Procaryotic hosts are in general the most efficient and convenient for the production of recombinant proteins and therefore preferred for the expression of Taq polymerase. In the particular case of Taq polymerase, evidence indicates that considerable deletion at the N-terminus of the protein may occur under both recombinant and native conditions, and that the DNA polymerase activity Q the protein is still retained. It appears that the native proteins previously isolated may be the result of proteolytic degradation, and not translation of a truncated gene. The mutein produced from the truncated gene of plasmid pFC85 is, however, fully active in assays for DNA polymerase, as is that produced from DNA encoding the full-length sequence. Since it is clear that certain N-terminal shortened forms of the polymerase are active, the gene constructs used for expression of these polymerases may also include the corresponding shortened forms of the coding sequence.

Procaryotes most frequently are represented by various strains of E^_ col i . However, other microbial strains may also be used, such as bacilli, for example, Bacillus subtil is, various species of Pseudomonas, or other bacterial strains. In such procaryotic systems, plasmid vectors that contain replication sites and control sequences derived from a species compatible with the host are used. For example, ~ coli is typically transformed using derivatives of pBR322, a plasmid derived from an E. coli species by Bolivar, et al., Gene (1977) 2_:95. pBR322 contains genes for ampicillin and tetra- cycline resistance, and thus provides additional markers that can be either retained or destroyed in constructing the desired vector. Commonly used procaryotic control sequences, which are defined herein to include promoters for transcription initiation, optionally with an operator, along with ribosome binding site sequences, include such commonly used promoters as the beta-lactamase (penicillinase) and lactose (lac) promoter systems (Chang, et al., Nature (1977) 198:1056), the tryptophan (trp) promoter system (Goeddel , et al . , Nucleic Acids Res. (1980) 8_:4057) and the lambda-derived P L promoter (Shi atake, et al . , Nature (1981) 292:128) and N-gene ribosome binding site, which has been made useful as a portable control cassette (as set forth in U.S. Patent No. 4,711,845, issued December 8, 1987), which comprises a first DNA sequence that is the P^ promoter operably linked to a second DNA seque"^ correspond ng to ~-~ upstream of a third DNA sequence having at least one restriction site that permits cleavage within six bp 3' of the N RBS sequence. Also useful is the phosphatase A (phoA) system described by Chang, et al. in European Patent Publication No. 196,864 published October 8, 1986, assigned to the same assignee and incorporated herein by reference. However, any available promoter system compatible with procaryotes can be used. In addition to bacteria, eucaryotic microbes, such as yeast, may also be used as hosts. Laboratory strains of Saccharomyces cerevisiae, Baker's yeast, are most used, although a number of other strains are commonly available. While vectors employing the 2 micron origin of replication are illustrated (Broach, J. R., Meth. Enz.

(1983) 101:307), other plasmid vectors suitable for yeast expression are known (see, for example, Stinchcomb, et al., Nature (1979) 282:39, Tschempe, et al . , Gene (1980) 10:157 and Clarke, L., et al., Meth. Enz. (1983) 101:300). Control sequences for yeast vectors include promoters for the synthesis of glycolytic enzymes (Hess, et al., J. Adv. Enzyme Reg. (1968) 7 l49; Holland, et al., Biotechnology (1978) 17_:4900).

Additional promoters known in the art include the promoter for 3-phosphoglycerate kinase (Hitzeman, et al., J. Biol. Chem. (1980) 255:2073), and those for other glycolytic enzymes, such as glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofruc to kinase, glucose-6-phosphate isomerase, 3- phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and gluco kinase. Other promoters that have the additional advantage of transcription controlled by growth conditions are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, and enzymes responsible for maltose and galactose ultilization (Holland, supra).

It is also believed that terminator sequences are desirable at the 3' end of the coding sequences. Such terminators are found in the 3' untranslated region following the coding sequences in yeast- derived genes. Many of the vectors illustrated contain control sequences derived from the enolase gene containing plasmid peno46 (Holland, M. J., et al., J. Biol. Chem. (1981) 256_:1385) or the LEU2 gene obtained from YEpl3 (Broach, J., et al., Gene (1978) 8_:121); however, any vector containing a yeast-compatible promoter, origin of replication, and other control sequences is suitable.

It is also, of course, possible to express genes encoding polypeptides in eucaryotic host cell cultures derived from multi cellular organisms. See, for example, Tissue Culture, Academic Press, Cruz and Patterson, editors (1973). Useful host cell lines include murine myelomas N51, VERO and HeLa cells, and Chinese hamster ovary (CH0) cells. Expression vectors for such cells ordinarily

include promoters and control sequences compatible with mammalian cells such as, for example, the commonly used early and late promoters from Simian Virus 40 (SV 40) (Fiers, et al., Nature (1978) 273:113), or other viral promoters such as those derived from polyoma, Adenovirus 2, bovine papiloma virus, or avian sarcoma viruses, or immunoglobulin promoters and heat shock promoters. A system for expressing DNA in mammalian systems using the BPV as a vector is disclosed in U.S. Patent 4,419,446. A modification of this system is described in U.S. Patent 4,601,978. General aspects of mammalian cell host system transformations have been described by Axel, U.S. Patent No. 4,399,216. It now appears, also, that "enhancer" regions are important in optimizing expression; these are, generally, sequences found upstream of the promoter region. Origins of replication may be obtained, if needed, from viral sources. However, integration into the chromosome is a common mechanism for DNA replication in eucaryotes.

Plant cells are also now available as hosts, and control sequences compatible with plant cells such as the nopaline. synthase promoter and polyadenylation signal sequences (Depicker, A., et al . , J. Mol. Appl. Gen. (1982) :561) are available.

Recently, in addition, expression systems employing insect cells utilizing the control systems provided by baculovirus vectors have been described (Miller, D. W., et al . , in Ge.netic Engineering

(1986) Setlow, J. K. et al., eds., Plenum Publishing, Vol. 8, pp. 277- 297). These systems are also successful in producing Taq polymerase.

Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described by Cohen, S. N., Proc. Natl. Acad. Sci. (USA) (1972) ^9_:2110 is used for procaryotes or other cells that contain substantial cell wall barriers. Infection with Agrobacterium tumefaciens (Shaw, C. H., et al., Gene (1983) 23_:315) is -used for certain plant cells. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology (1978) 52_:546 is preferred. Transformations

into yeast are carried out according to the method of Van Sol ngen, P., et al., J. Bact. (1977) 130:946 and Hsiao, C. L., et al., Proc. Natl. Acad. Sci. (USA) (1979) 76_:3829.

The strategy for isolating DNA encoding desired proteins, such as the Taq polymerase encoding DNA, using the bacteriophage vector lambda gtll, is as follows. A library can be constructed of EcoRI-flanked Alul fragnents, generated by complete digestion of Thermus aquaticus DNA, inserted at the EcoRI site in the lambda gtll phage (Young and Davis, Proc. Natl. Acad. Sci USA (1983) 80_:1194- 1198). Because the unique EcoRI site in this bacteriophage is located in the carboxy-terminus of the -galactosidase gene, inserted DNA (in the appropriate frame and orientation) is expressed as protein fused with -galactosidase under the control of the lactose operon promoter/operator.

Genomic expression libraries are then screened using the antibody plaque hybridization procedure. A modification of this procedure, referred to as "epitope selection," uses antiserum against the fusion protein sequence encoded by the phage, to confirm the identification of hybridized plaques. Thus, this library of recombinant phages could be screened with antibodies that recognize the 86,000-95,000 dalton Taq polymerase in order to identify phage that carry DNA segnents encoding the aπtigenic determinants of this protein.

Approximately 2 x 10^ recombinant phage are screened using total rabbit Taq polymerase antiserum. In this primary screen, positive signals are detected and one or more of these phages are purified from candidate plaques which failed to react with preimmune serum and reacted with immune serum and analyzed in some detail. To examine the fusion proteins produced by the recombinant phage, lysogens of the phage in the host Y1089 are produced. Upon induction of the lysogens and gel electrophoresis of the resulting proteins, each lysogen may be observed to produce a new protein, not found in the other lysogens, or duplicate sequences may result. Phage containing positive signals are picked; in this case, one positive

plaque was picked for further identification and replated at lower densities to purify recombinants and the purified clones were analyzed by size class via digestion with EcoRI restriction enzyme. Probes can then be made of the isolated DNA insert sequences and labeled appropriately and these probes can be used in conventional colony or plaque hybridization assays described in Maniatis et al., Molecular Cloning: A Laboratory Manual (1982), the disclosure of which is incorporated herein by reference.

The labeled probe was used to probe a second genomic library constructed in a Charon 35 bacteriophage (Wilhelmine, A. M. et al., Gene (1983) 2!6_:171-179). This library was made from Sau3A partial digestions of genomic Thermus aquaticus DNA and size fractionated fragments (15-20 kb) were cloned into the BamHI site of the Charon 35 phage. The probe was used to isolate phage containing DNA encoding the Taq polymerase. One of the resulting phage, designated

CH35 :Taq#4-2, was found to contain the entire gene sequence. Partial sequences encoding portions of the gene were also isolated.

Construction of suitable vectors containing the desired coding and control sequences employs standard ligation and restriction techniques that are well understood in the art. Isolated plasmids,

DNA sequences, or synthesized oligonucleotides are cleaved, tailored, and reli gated in the form desired.

Site-specific DNA cleavage is performed by treating with the suitable restriction enzyme (or enzymes) under conditions that are generally understood in the art, and the particulars of which are specified by the manufacturer of these commercially available restriction enzymes. See, e.g., New England Biolabs, Product Catalog. In general, about 1/. of plasmid or DNA sequence is cleaved by one unit of enzyme in about 20^1 of buffer solution; in the examples herein, typically an excess of restriction enzyme is used to ensure complete digestion of the DNA substrate. Incubation times of e> about one -hour to two hours at about 37 C are workable, although variations can be tolerated. After each incubation, protein is removed by extraction with phenol/chloroform, and may be followed by

ether extraction, and the nucleic acid recovered from aqueous fractions by precipitation with ethanol. If desired, size separation of the cleaved fragments may be performed by polyacryl amide gel or agarose gel electrophoresis using standard techniques. A general description of size separations is found in Methods in Enzymology (1980) 65_:499-560.

Restrict ion -cleaved fragments may be blunt-ended by treating with the large fragment of E. coli DNA polymerase I (Klenow) in the presence of the four deoxynucleotide triphosphates (dNTPs) using incubation times of about 15 to 25 minutes at 20 to 25 C in 50 mM Tris pH 7.6, 50 mM NaCl, 10 mM MgCl 2 , 10 mM DTT and 50-100 μW dNTPs. The Klenow fragment fills in at 5' sticky ends, but chews back protruding 3' single strands, even though the four dNTPs are present. If desired, selective repair can be performed by supplying only one of the, or selected, dNTPs within the limitations dictated by the nature of the sticky ends. After treatment with Klenow, the mixture is extracted with phenol /chloroform and ethanol precipitated. Treatment under appropriate conditions with SI nuclease results in hydrolysis of any single- stranded portion.-

Synthetic ol igonucleotides may be prepared using the tri ester method of Matteucci, et al., (J. Am. Chem. Soc. (1981) 103 :3185-3I91) or using automated synthesis methods. Kinasing of single strands prior to annealing or for labeling is achieved using an excess, e.g., approximately 10 units of polynucleotide kinase to 1 nM substrate in the presence of 50 mM Tris, pH 7.6, 10 mM MgCl , 5 mM dithiothreitol, 1-2 M ATP. If kinasing is for labeling of probe, the ATP will contain high specific activity gamma- P.

Ligations are performed in 15-30 uλ volumes under the following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 10 mM MgCl 2 , 10 mM DTT, 33 BSA, 10 mM-50 M NaCl, and either 40 ^M ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0 C C (for "sticky end" ligati-on) or 1 M ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 14 P C (for "blunt end" ligation). Intermolecular "sticky end" ligations are usually performed at 33-100 ^g/ml total DNA

concentrations (5-100 nM total end concentration). Intermolecular blunt end ligations (usually employing a 10-30 fold molar excess of linkers) are performed at l^M total ends concentration.

In vector construction employing "vector fragments", the vector fragment is commonly treated with bacterial alkaline phosphatase (BAP) in order to remove the 5' phosphate and prevent religation of the vector. BAP digestions are conducted at pH 8 in approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 1 unit of BAP per mg of vector at 60° C for about one hour. In order to recover the nucleic acid fragments, the preparation is extracted with phenol /chloroform and ethanol precipitated. Alternatively, religation can be prevented in vectors that have been double digested by additional restriction enzyme digestion of the unwanted fragments.

For portions of vectors derived from cDNA or genomic DNA that require sequence modifications, site-specific primer-directed utagenesis is used. This technique is now standard in the art, and is conducted using a synthetic oligonucleotide primer complementary to a single-stranded phage DNA to be mutagenized except for limited mismatching, representing the desired mutation. Briefly, the synthetic oligonucl-eo±itle is ased as a -primer "to direct synthesis of a strand complementary to the phage, and the resulting double-stranded DNA is transformed into a phage-supporting host bacterium. Cultures of the transformed bacteria are plated in top agar, permitting plaque formation from single cells that harbor the phage. Theoretically, 50% of the new plaques will contain the phage having, as a single strand, the mutated form; 50% will have the original sequence. The plaques are transferred to nitrocellulose filters and the "Ivfts" hybridized with kinased synthetic primer at a temperature that permits hybridization of an exact match, but at which the mismatches with the original strand are sufficient to prevent hybridization, flaques ±tiat hybridize with the probe are then picked and cultured, and the DNA is recovered.

In the constructions set forth below, correct ligations for plasmid construction are confirmed by first transforming E. col i

strain MM294, or other suitable host, with the ligation mixture. Successful transformants are selected by ampicillin, tetracycline or other antibiotic resistance or using other markers, depending on the mode of plasmid construction, as is understood in the art. Plasmids from the transformants are then prepared according to the method of Clewell, D.B., et al., Proc. Natl. Acad. Sci. (USA) (1969) 62_:1159, optionally following chloramphenicol amplification (Clewell, D.B., J. Bacteriol. (1972) 110_:667). The isolated DNA is analyzed by restriction and/or sequenced by the dideoxy method of Sanger, F., et al., Proc. Natl. Acad. Sci. (USA) (1977) 74_:5463 as further described by Messing, et al . , Nucleic Acids Res. (1981) :309 » or b the method of Maxam, et al., Methods in E_nzymology (1980) 65 :499.

Host strains used in cloning and expression herein are as follows: For cloning and sequencing, and for expression of constructions under control of most bacterial promoters, JE^ coli strain MM294 obtained from E_ ; _ coli Genetic Stock Center GCSC #6135, was used as the host. For expression under control of the PiN j - j g promoter, E. coli strain K12 MC1000 lambda lysogen, N 7 N 53 cI857 SusP 80 , ATCC 39531 may be used. Used herein are ^ coli DG116, which was deposited with ATCC (ATCC 53606) on April 7, 1987 and _E^ col KB2, which was deposited with ATCC (ATCC 53075) on March 29, 1985.

For M13 phage recombinants, E. coli strains susceptible to phage infection, such .as E. coli K12 strain DG98, are employed. The DG98 strain has been deposited with ATCC July 13, 1984 and has accession number 39768.

Mammalian expression can be accomplished in COS-7 C0S-A2, CV-1, and murine cells, and insect cell-based expression in Spodoptera frugipeida).

In addition to the purification procedures previously described, the thermostable polymerase of the invention may be purified using hydrophobic interaction chromatography. Hydrophobic interaction chromatography is a separation technique in which substances are separated on the basis of differing strengths of

hydrophobic interaction with an uncharged bed material containing hydrophobic groups. Typically, the column is first equilibrated under conditions favorable to hydrophobic binding, e.g., high ionic strength. A descending salt gradient may be used to elute the sample.

According to the invention, the aqueous mixture (containing either native or recombinant polymerase) is loaded onto a column containing a relatively strong hydrophobic gel such as Phenyl Sepharose (manufactured by Pharmacia) or Phenyl TSK (manufactured by Toyo Soda). To promote hydrophobic interaction with a Phenyl Sepharose column, a solvent is used which contains, for example, greater than or equal to 0.2 M ammonium sulfate, with 0.2 M being preferred. Thus the column and the sample are adjusted to 0.2 M ammonium sulfate in 50 mM Tris-lmM EDTA buffer and the sample applied to the column. The column is washed with the 0.2 M ammonium sulfate buffer. The enzyme may then be eluted with solvents which attenuate hydrophobic interactions such as, for example, decreasing salt gradients, ethylene or propylene glycol, or urea. For the recombinant Taq polymerase, a preferred embodiment involves washing the column sequentially with the Tris-EDTA buffer and the Tris-EDTA buffer containing 20% ethylene glycol. The Taq polymerase is subsequently eluted from the column with a 0-4 M urea gradient in the Tris-EDTA ethylene glycol buffer.

For long-term stability, the enzyme herein must be stored in a buffer that contains one or more non-ionic polymeric detergents. Such detergents are generally those that have a molecular weight in the range of approximately 100 to 250,000, preferably about 4,000 to 200,000 daltons and stabilize the enzyme at a pH of from about 3.5 to about 9.5, preferably from about 4 to 8.5. .Examples of such detergents include those specified on pages 295-298 of McCutcheon's Emulsifiers & Detergents, North American edition (1983), published by the McCutcheon Division of MC Publishing Co., 175 Rock Road, Glen Rock, NJ (USA), the entire disclosure of which is incorporated herein by reference. Preferably, the detergents are selected from the group comprising ethoxylated fatty alcohol ethers and lauryl ethers, ethoxylated al kyl phenols, octylphenoxy polyethoxy ethanol compounds,

modified oxyethylated and/or oxypropylated straight-chain alcohols, polyethylene glycol monooleate compounds, polysorbate compounds, and phenolic fatty alcohol ethers. More particularly preferred are Tween 20, from ICI Americas Inc., Wilmington, DE, which is a pol yo xy ethyl at ed (20) sorbitan mono! au rate, and Iconol NP-40, from BASF Wyandotte Corp. Parsippany, NJ, which is an ethoxylated alkyl phenol (nonyl).

The thermostable enzyme of this invention may be used for any purpose in which such enzyme is necessary or desirable. In a particularly preferred embodiment, the enzyme herein is employed in the amplification protocol set forth below.

The amplification protocol using the enzyme of the present invention may be the process for amplifying existing nucleic acid sequences that is disclosed and claimed in U.S. Patent No. 4,683,202, issued July 28, 1987, the disclosure of which is incorporated herein by reference. Preferably, however, the enzyme herein is used in the amplification process disclosed below.

Specifically, the- ampl if i cation method involves amplifying at least one specific nucleic acid sequence contained in a nucleic acid or a mixture of nucleic acids, wherein if the nucleic acid is double-stranded, it consists of two separated complementary strands of equal or unequal length, which process comprises:

(a) contacting each nucleic acid strand with four different nucleotide triphosphates and one oligonucleotide primer for each different specific sequence being amplified, wherein each primer is selected to be substantially complementary to different strands of each specific sequence, such that the extension product synthesized from one primer, when it is separated from its complement, can serve as a template for synthesis of the extension product of the other primer, said contacting being at a temperature which promotes hybridization of each primer to its complementary nucleic acid strand;

(b) contacting each nucleic acid strand, at the same time as or after step (a), with a DNA polymerase from Thermus aquaticus which enables combination of the nucleotide triphosphates to form

primer extension products complementary to each strand of each nucleic acid;

(c) maintaining the mixture from step (b) at an effective temperature for an effective time to promote the activity of the enzyme, and to synthesize, for each different sequence being amplified, an extension product of each primer which is complementary to each nucleic acid strand template, but not so high as to separate each extension product from its complementary strand template;

(d) heating the mixture from step (c) for an effective time and at an effective temperature to separate the primer extension products from the templates on which they were synthesized to produce single-stranded molecules, but not so high as to denature irreversibly the enzyme;

(e) cooling the mixture from step (d) for an effective time and to an effective temperature to promote hybridization of each primer to each of the single-stranded molecules produced in step (d); and

(f) maintaining the mixture from step (e) at an effective temperature for an effective time to promote the activity of the enzyme and to synthesize, for each different sequence being amplified, an extension product of each primer which is complementary to each nucleic acid strand template produced in step (d), but not so h gh as to separate each extension product from its complemnntary strand template wherein the effective time and temperatures in steps (e) and (f) may coincide (steps (e) and (f) are carried out simultaneously), or may be separate.

Steps (d)-(f) may be repeated until the desired level of sequence amplification is obtained.

The amplification method is useful not only for producing large amounts of an existing completely specified nucleic acid sequence, but also for producing nucleic acid sequences which are known to exist but are not completely specified. In either case an initial copy of the sequence to be amplified must be available, although it need not be pure or a discrete molecule.

In general, the amplification process involves a chain reaction for producing, in exponential quantities relative to the number of reaction steps involved, at least one specific nucleic acid sequence given (a) that the ends of the required sequence are known in sufficient detail that oligonucleotides can be synthesized which will hybridize to them, and (b) that a small amount of the sequence is available to initiate the chain reaction. The product of the chain reaction will be a discrete nucleic acid duplex with termini corresponding to the ends of the specific primers employed. Any nucleic acid sequence, in purified or nonpurified form, can be util zed as the starting nucleic acid(s), provided it contains or is suspected to contain the specific nucleic acid sequence desired. Thus, the process may employ, for example, DNA or RNA, including messenger RNA, which DNA or RNA may be single-stranded or double-stranded. In addition, a DNA-RNA hybrid which contains one strand of each may be utilized. A mixture of any of these nucleic acids may also be employed, or the nucleic acids produced from a previous amplification reaction herein using the same or different primers may be so utilized. * The specific nucleic acid sequence to be amplified may be only a fraction of a larger molecule or can be present initially as a discrete molecule, so that the specific sequence constitutes the entire nucleic acid.

It is not necessary that the sequence to be amplified be present initially in a pure form; it may be a minor fraction of a complex mixture, such as a portion of the beta-globin gene contained in whole human DNA (as exemplified in Saiki et al., Science, 230, 1530-1534 (1985)) or a portion of a nucleic acid sequence due to a particular microorganism which organism might constitute only a very minor fraction of a particular biological sample. The starting nucleic acid sequence may contain more than one desired specific nucleic acid sequence which may be the same or different. Therefore, the amplification process is useful not only for producing large amounts of one specific nucleic acid sequence, but also for amplifying simultaneously more than one different specific nucleic acid sequence located on the same or different nucleic acid molecules.

The nucleic acid(s) may be obtained from any source, for example, from plasmids such as pBR322, from cloned DNA or RNA, or from natural DNA or RNA from any source, including bacteria, yeast, viruses, organelles, and higher organisms such as plants or animals. DNA or RNA may be extracted from blood, tissue material such as chorionic villi, or amn otic cells by a variety of techniques such as that described by Maniatis et al . , supra, p. 280-281.

If probes are used which are specific to a sequence being amplified and thereafter detected, the cells may be directly used without extraction of the nucleic acid if they are suspended in hypotonic buffer and heated to about 90-100°C, until cell lysis and dispersion of intracellular components occur, generally 1 to 15 minutes. After the heating step the amplification reagents may be added directly to the lysed cells. Any specific nucleic acid sequence can be produced by the amplification process. It is only necessary that a sufficient number of bases at both ends of the sequence be known in sufficient detail so that two oligonucleotide primers can be prepared which will hybridize to different strands of the desired sequence and at relative positions along the sequence such that an extension product synthesized from one primer, when it is separated from its template (complement), can serve as a template for extension of the other primer into a nucleic acid sequence of defined length. The greater the knowledge about the bases at both ends of the sequence, the greater can be the specificity of the primers for the target nucleic acid sequence, and thus the greater the efficiency of the process.

It will be understood that the word "primer" as used hereinafter may refer to more than one primer, particularly in the case where there is some ambiguity in the information regarding the terminal sequence(s) of the fragment to be amplified. For instance, in the case where a nucleic acid sequence is inferred from protein sequence information, a collection of primers containing sequences representing all possible codon variations based on degeneracy of the genetic code will be used for each strand. One primer from this

collection will be homologous with the end of the desired sequence to be amplified.

The ol gonucleotide primers may be prepared using any suitable method, such as, for example, the phosphotriester and phosphodiester methods described above, or automated embodiments thereof. In one such automated embodiment, diethylphosphoramidites are used as starting materials and may be synthesized as described by Beaucage et al., Tetrahedron Letters (1981), 22_:1859-1862. One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. It is also possible to use a primer which has been isolated from a biological source (such as a restriction endonuclease digest).

The specific nucleic acid sequence is produced by using the nucleic acid containing that sequence as a template. The first step involves contacting each nucleic acid strand with four different nucleotide triphosphates and one oligonucleotide primer for each different nucleic acid sequence being amplified or detected. If the nucleic acids to be amplified or detected are DNA, then the nucleotide triphosphates are dATP, dCTP, dGTP and TTP. The nucleic acid strands are used as a template for the synthesis of additional nucleic acid strands. This synthesis can be performed using any suitable method. Generally it occurs in a buffered aqueous solution, preferably at a pH of 7-9, most preferably about 8. Preferably, a molar excess (for cloned nucleic acid, usually about 1000:1 primer:tempiate, and for genomic nucleic acid, usually about 10°:1 primer:tempiate) of the two oligonucleotide primers is added to the buffer containing the separated template strands. It is understood, however, that the amount of complementary strand may not be known if the process herein is used for diagnostic applications, so that the amount of primer relative to the amount of complementary strand cannot be determined with certainty. As a practical matter, however, the amount of primer added will generally be in molar excess over the amount of complementary strand (template) when the sequence to be amplified is contained in a mixture of complicated long-chain

nucleic acid strands. A large molar excess is preferred to improve the efficiency of the process.

Preferably the concentration of nucleotide triphosphates is 150-200^M each in the buffer for amplification and MgCl 2 is present in the buffer in an amount of 1.5-2 mM to increase the efficiency and specificity of the reaction.

The resulting solution is then treated according to whether the nucleic acids being amplified or detected are double or single- stranded. If the nucleic acids are single- stranded, then no denaturation step need be employed, and the reaction mixture is held at a temperature which promotes hybridization of the primer to its complementary target (template) sequence. Such temperature is generally from about 35" C to 65° C or more, preferably about 37-60 β C for an effective time, generally one-half to five minutes, preferably one-three minutes. Preferably, 45-58 p C is used for Taq polymerase and 15-mer primers to increase the specificity of primer hybridization. Shorter primers need lower temperatures.

The complement to the original single-stranded nucleic acid may be synthesized by add ' ing one or two oligonucleotide primers thereto. If an appropriate single primer is added, a primer extension product is synthesized in the presence of the primer, the DNA polymerase from Thermus aquaticus and the nucleotide triphosphates. The product will be partially complementary to the single-stranded nucleic acid and wilt hybridize with the nucleic acid strand to form a duplex of strands of unequal length which may then be separated into single strands as described above to produce two single separated complementary strands. Alternatively, two appropriate primers may be added to the single-stranded nucleic acid and the reaction carried out. If the nucleic acid contains two strands, it is necessary to separate the strands of the nucleic acid before it can be used as the template. This strand separation can be accomplished by any suitable denaturing method including physical, chemical or enzymatic means. One preferred physical method of separating the strands of the nucleic

aci d invol ves heati ng the nucl ei c aci d until it i s compl etel y ( _ 99%) denatured . Typical heat denaturation invol ves temperatures rangi ng from about 90 to 105"C for times general ly rangi ng from about 0.5 to 5 c minutes . Preferably the effecti ve denaturing temperature is 90-100 C for 0.5 to 3 minutes . Strand separation may al so be induced by an enzyme from the class of enzymes known as helicases or the enzyme RecA, whi ch has hel icase activity and in the presence of ri boATP is known to denature DNA. The reaction conditions suitabl e for separating the strands of nucl ei c aci ds with hel icases are descri bed by Kuhn Hoffmann-Berl ing, CSH-Quantitative Biology, 43 :63 (1978), and techni ques for usi ng RecA are revi ewed in C . Raddi ng, Ann . Rev . Genetics , 16_:405-37 (1982) . The denaturation produces two separated compl ementary strands of equal or unequal l ength.

If the double-stranded nucleic acid is denatured by heat, the reaction mixture is allowed to cool to a temperature which promotes hybridization of each primer present to its complementary target (template) sequence. This temperature is usually from about 35 C* to 65 C or more, depending on reagents, preferably 37-60 C, maintained for an effective time, generally 0.5 to 5 minutes, and preferably 1-3 minutes. In " practical terms, the temperature is simply lowered from about "~ ° Z to as low as 37 C, preferably to about 45-58* C for Taq polymerase, and hybridization occurs at a temperature within this range.

Whether the nucleic acid is single- or double-stranded, the DNA polymerase from Thermus aquaticus may be added at the denaturation step or when the temperature is being reduced to or is in the range for promoting hybridization. The reaction mixture is then heated to a temperature at which the activity of the enzyme is promoted or optimized, i.e., a temperature sufficient to increase the activity of the enzyme in facilitating synthesis of the primer extension products from the hybridized primer and template. The temperature must actually be sufficient to synthesize an extension product of each primer which is complementary to each nucleic acid template, but must not be so high as to denature each extension product from its complementary template (i.e., the temperature is generally less than about 80*0-90*0.

Depending mainly on the types of enzyme and nucleic acid(s) employed, the typical temperature effective for this synthesis

0 0 reaction generally ranges from about 40 to 80 C, preferably 50-75 C. The temperature more preferably ranges from about 65-75°C when a DNA polymerase from Thermus aquaticus is employed. The period of time required for this synthesis may range from about 0.5 to 40 minutes or more, depending mainly on the temperature, the length of the nucleic acid, the enzyme and the complexity of the nucleic acid mixture, preferably one to three minutes. If the nucleic acid is longer, a longer time period is generally required. The presence of dimethyl sul foxide (DMSO) is not necessary or recommended because DMSO was found to inhibit Taq polymerase enzyme activity.

The newly synthesized strand and its complementary nucleic acid strand form a double-stranded molecule which is used in the succeeding steps of the process. In the next step, the strands of the double-stranded molecule are separated by heat denaturation at a temperature effective to denature the molecule, but not so high that the thermostable enzyme is completely and irreversibly denatured or inactivated. Depending mainly on the type of enzyme and the length of nucleic acid, this temperature generally ranges from about 90 to 105 C, more preferably 90-100 C, and the time for denaturation typically ranges from 0.5 to four minutes, depending mainly on the temperature and nucleic acid length.

After this time, the temperature is decreased to a level which promotes hybridization of the primer to its complementary single-stranded molecule (template) produced from the previous step. Such temperature is described above.

After this hybridization step, or in lieu of (or concurrently with) the hybridization step, the temperature is adjusted to a temperature that is effective to promote the activity of the thermostable enzyme to enable synthesis of a primer extension product using as template the newly synthesized strand from the previous step. The temperature again must not be so high as to separate (denature) the extension product from its template, as previously

described (usually from 40 to 80 C for 0.5 to 40 minutes, preferably

50 to 70 C for one-three minutes). Hybridization may occur during this step, so that the previous step of cooling after denaturation is not required. In such a case, using simultaneous steps, the preferred

__> - temperature range is 50-70 C.

The heating and cooling steps of strand separation, hybridization, and extension product synthesis can be repeated as often as needed to produce the desired quantity of the specific nucleic acid sequence, depending on the ultimate use. The only Q limitation is the amount of the primers, thermostable enzyme and nucleotide triphosphates present. Preferably, the steps are repeated at least twice. For use in detection, the number of cycles will depend, e.g., on the nature of the sample. For example, fewer cycles will be required if the sample being amplified is pure. If the sample 5 is a complex mixture of nucleic acids, more cycles will be required to amplify the signal sufficiently for its detection. For general amplification and detection, preferably the process is repeated at least 20 times.

When labeled sequence-specific probes are employed as 0 described below, preferably the steps are repeated at least five times. When human genomic DNA is employed with such probes, the process is repeated preferably 15-30 times to amplify the sequence sufficiently that a. clearly detectable signal is produced, i.e., so that background noise does not interfere with detection. 5 As will be described in further detail below, the amount of the specific nucleic acid sequence produced will accumulate in an exponential fashion.

No additional nucleotides, primers, or thermostable enzyme need be added after the initial addition, provided that the enzyme has 0 not become denatured or inactivated irreversibly, in which case it is necessary to replenish the enzyme after each denaturing step. Addition of such materials at each step, however, will not adversely affect the reaction.

When it is desired to produce more than one specific nucleic acid sequence from the first nucleic acid or mixture of nucleic acids, the appropriate number of different oligonucleotide primers are utilized. For example, if two different specific nucleic acid ' sequences are to be produced, four primers are utilized. Two of the primers are specific for one of the specific nucleic acid sequences and the other two primers are specific for the second specific nucleic acid sequence. In this manner, each of the two different specific sequences can be produced exponentially by the present process. After the appropriate length of time has passed to produce the desired amount of the specific nucleic acid sequence, the reaction may be halted by inactivating the enzyme in any known manner (e.g., by adding EDTA, phenol, SDS or CHCI3) or by separating the components of the reaction. The amplification process may be conducted continuously. In one embodiment of an automated process, the reaction mixture may be temperature cycled such that the temperature is programmed to be controlled at a certain level for a certain time.

One such instrument for this purpose utilizes a liquid handling system under computer control to make liquid transfers of enzyme stored at a controlled temperature in a first receptacle into a second receptacle whose temperature is controlled by the computer to conform to a curtain incubation profile. The second receptacle stores the nucleic acid sequence(s) to be amplified plus the nucleotide tri hosphates and primers. The computer includes a user interface through which a user can enter process parameters that control the characteristics of the various steps in the amplification sequence such as the times and temperatures of incubation, the amount of enzyme to transfer, etc. A preferred machine that may be employed utilizes temperature cycling without a liquid handling system because the enzyme need not be transferred at every cycle. Such a machine is described more completely in European Patent Publication No. 236,069, published September 9, 1987, the disclosure of which is incorporated

herein by reference. Briefly, this instrument consists of the following systems:

1. A heat-conducting container for holding a given number of tubes, preferably 500^)1 tubes, which contain the reaction mixture

5 of nucleotide triphosphates, primers, nucleic acid sequences, and enzyme.

2. A means to heat, cool, and maintain the heat-conducting container above and below room temperature, which means has an input for receiving a control signal for controlling which of the 0 temperatures at or to which the container is heated, cooled or maintained. (These may be Peltier heat pumps available from Materials

Electronics Products Corporation in Trenton, NJ or a water heat exchanger.)

3. A computer means (e.g., a microprocessor controller), * * * coupled to the input of said means, to generate the signals that control automatically the amplification sequence, the temperature levels, and the temperature ramping and timing.

A representative amplification protocol for double-stranded DNA containing the desired sequence [β comprised of complementary 0 strands [S + J a* 10" L S" J is as f°π° ws - During the first and each subsequent reaction cycle, extension of each oligonucleotide primer on the original template will produce one new ssDNA molecule product of indefinite length that terminates with only one of the primers. These products, hereafter referred to as "long products," will accumulate in 5 a linear fashion; that is, the amount present after any number of cycles will be proportional to the number of cycles.

The long products thus produced will act as templates for one or the other of the oligonucleotide primers during subsequent cycles and will produce molecules of the desired sequence j_S + J or 0 s " 7. These molecules will also function as templates for one or the other of the oligonucleotide primers, producing further [s + J and [S " J, and thus a chain reaction can be sustained that will result in the accumulation of S at an exponential rate relative to the number of cycles.

By-products formed by oligonucleotide hybridizations other than those intended are not self-catalytic (except in rare instances) and thus accumulate at a linear rate. Each strand which terminates with the oligonucleotide sequence of one primer and the complementary sequence of the other is the specific nucleic acid sequence S that is desired to be produced.

The amount of original nucleic acid remains constant in the entire process, because it is not replicated. The amount of the long products increases linearly because they are produced only from the original nucleic acid. The amount of the specific sequence increases exponentially. Thus, the specific sequence will become the predominant species. This is illustrated in the following table, which indicates the relative amounts of the species theoretically present after n cycles, assuming 100% efficiency at each cycle:

Number of Double Strands After 0 to n C cles

When a single-stranded nucleic acid is utilized as the template, only one long product is formed per cycle.

A sequence within a given sequence can be amplified after a given number of amplifications to obtain greater specificity of the

reaction by adding after at l east one cycl e of ampl ification a set of primers that are complementary to internal sequences (that are not on the ends) of the sequence to be amplified. Such primers may be added at any stage and will provide a shorter amplified fragment. Alternatively, a longer fragment can be prepared by using primers with non-complementary ends but having some overlap with the primers previously utilized in the amplification.

The ampl fication method may be utilized to clone a particular nucleic acid sequence for insertion into a suitable expression vector. The vector may be used to transform an appropriate host organism to produce the gene product of the sequence by standard methods of recombinant DNA technology. Such cloning may involve direct ligation into a vector using blunt-end ligation, or use of restriction enzymes to cleave at sites contained within the primers.

In addition, the amplification process can be used for j_n_ vitro mutagenesis. The oligodeoxyribonucleotide primers need not be exactly complementary to the DNA sequence that is being amplified. It is only necessary that they be able to hybridize to the sequence sufficiently well to be extended by the thermostable enzyme. The product of an ampl fication reaction wherein the primers employed are not exactly complementary to the original template will contain the sequence of the primer rather than the template, thereby introducing an i_n_ vitro mutation. In further cycles this mutation will be amplified with an undiminished efficiency because no further mispaired priming is required. The mutant thus produced may be inserted into an appropriate vector by standard molecular biological techniques and might confer mutant properties on this vector such as the potential for production of an altered protein.

The process of making an altered DNA sequence as described above could be repeated on the altered DNA using different primers to induce further sequence changes. In this way, a series of mutated sequences cόαld gradually be produced wherein each new addition to the series could differ from the last in a minor way, but from the original DNA source sequence in an increasingly major way. In this

manner, changes could be made ultimately which were not feasible in a single step due to the inability of a v ry seriously mismatched primer to function.

In addition, the primer can contain as part of its sequence a non- complementary sequence, provided that a sufficient amount of the primer contains a sequence that is complementary to the strand to be amplified. For example, a nucleotide sequence that is not complementary to the template sequence (such as, e.g., a promoter, linker, coding sequence, etc.) may be attached at the 5' end of one or both of the primers, and thereby appended to the product of the amplification process. After the extension primer is added, sufficient cycles are run to achieve the desired amount of new template containing the non- complementary nucleotide insert. This allows production of large quantities of the combined fragments in a relatively short period of time (e.g., two hours or less) using a simple technique.

The amplification method may also be used to enable detection and/or characterization of specific nucleic acid sequences associated with infectious diseases, genetic disorders or cellular disorders such as cancer, e.g., oncogenes. Amplification is useful when the amount of nucleic acid available for analysis is very small, as, for example, in the prenatal diagnosis of sickle cell anemia using DNA obtained from fetal cells. Amplification is particularly useful if such an analysis is to be done on a small sample using non- radioactive detection techniques which may be inherently insensitive, or where radioactive techniques are being employed, but where rapid detection is desirable.

For the purposes of this invention, genetic diseases may include specific deletions and/or mutations in genomic DNA from any organism, such as, e.g., sickle cell anemia, cystic fibrosis, alpha- thai assemi a, beta-thai assemi a, and the like. Sickle cell anemia can be readily detected via oligomer restriction analysis as described by EP Patent Publication 164,054 published December 11, 1985, or via a RFLP-like analysis following amplification of the appropriate DNA

sequence by the amplification method. Alpha-Thai assemi a can be detected by the absence of a sequence, and beta-thai assemi a can be detected by the presence of a polymorphic restriction site closely linked to a mutation that causes the disease. All of these genetic diseases may be detected by amplifying the appropriate sequence and analyzing it by Southern blots without using radioactive probes. In such a process, for example, a small sample of DNA from, e.g., amniotic fluid containing a very low level of the desired sequence is amplified, cut with a restriction enzyme, and analyzed via a Southern blotting technique. The use of non- radioactive probes is facilitated by the high level of the amplified signal .

In another embodiment, a small sample of DNA may be amplified to a convenient level and then a further cycle of extension reactions performed wherein nucleotide derivatives which are readily detectable (such as "p-labeled or biotin-labeled nucleotide triphosphates) are incorporated directly into the final DNA product, which may be analyzed by restriction and electrophoretic separation or any other appropriate method. in a further embodiment, the nucleic acid may be exposed to a particul-ar restriction endonuclease prior to amplification. Since a sequence which has been cut cannot be amplified, the appearance of an amplified fragment,' despite prior restriction of the DNA sample, implies the absence' of a site for the endonuclease within the amplified sequence. The presence or absence of an amplified sequence can be detected by an appropriate method.

A practical application of the amplification technique, that is, in facilitating the detection of sickle cell anemia via the oligomer restriction technique described in EP 164,054, supra, and by Saiki et al., Bio/Technology, Vol. 3, pp. 1008-1012 (1985) is described in detail in the Saiki et al . .Science article cited above. In that Science article, a specific amplification protocol is exemplified using a beta-globin gene segment.

The amplification method herein may also be used to detect directly single-nucleotide variations in nucleic acid sequence (such as genomic DNA) using sequence-specific oligonucleotides, as described more fully in European Patent Publication 237,362, published September 16, 1987, the disclosure of which is incorporated herein by reference.

Briefly, in this process, the amplified sample is spotted directly on a series of membranes, and each membrane is hybridized with a different labeled sequence-specific oligonucleotide probe.

After hybridization the sample is washed and the label is detected. This technique is especially useful in detecting DNA polymorphisms.

Various infectious diseases can be diagnosed by the presence in clinical samples of specific DNA sequences characteristic of the causative microorganism. These include bacteria, such as Salmonella, Chla ydia, Neisseria; viruses, such as the hepatitis viruses, and parasites, such as the Plasmodiu responsible for malaria. U.S. Patent Reexamination Certificate Bl 4,358,535 issued to Fal kow et al . on May 13, 1986 describes the use of specific DNA hybridization probes for the diagnosis of infectious diseases. A relatively small number of pathogenic organisms may be present in a clinical sample from an infected patient and the DNA extracted from these may constitute only a very small fraction of the total DNA in the sample. Specific amplification of suspected pathogen-specific sequences prior to immobilization and detection by hybridization of the DNA samples could greatly improve the sensitivity and specificity of traditional procedures.

Routine clinical use of DNA probes for the diagnosis of infectious diseases would be simplified considerably if non- radioactively labeled probes could be employed as described in EP 63,879 to Ward. In this procedure biotin-containing DNA probes are detected by chromogenic enzymes linked to avidin or biotin-specific antibodies. This type of detection is convenient, but relatively insensitive The combination of specific DNA amplification by the present method and the use of stably labeled probes could provide the convenience and sensitivity required to make the Fal kow et al . and Ward procedures useful in a routine clinical setting.

A specific use of the amplification technology for detecting or monitoring for the AIDS virus is described in European Patent Publication 229,701, published July 22, 1987, the disclosure of which is incorporated herein by reference. Briefly, the amplification and detection process is used with primers and probes which are designed to amplify and detect, respectively, nucleic acid sequences that are substantially conserved among the nucleic acids in AIDS viruses and specific to the nucleic acids in AIDS viruses. Thus, the sequence to be detected must be sufficiently complementary to the nucleic acids in AIDS viruses to initiate polymerization preferably at room temperature in the presence of the enzyme and nucleotide triphosphates.

A preferred amplification process uses labeled primers. The label on the amplified product may be used to "capture" or immobilize the product for subsequent detection (e.g., biotinylated amplification primers yield labeled products that can be "captured" by their interaction with avidin or strepavidin). As demonstrated in the aforementioned amplification protocols, the extension product of one labeled primer when hybridized to the other becomes a template for the production of the desired " specific nucleic acid sequence, and vice versa, and the process is repeated as often as necessary to produce the desired amount of the sequence. Examples of specific preferred reagents that can be employed as the label are provided in U.S. Patent No. 4,582,789, the disclosure of which is incorporated herein by reference. The amplification process can also be utilized to produce sufficient quantities of DNA from a single copy human gene such that detection by a simple non-specific DNA stain such as ethidium bromide can be employed to diagnose DNA directly.

In addition to detecting infectious diseases and pathological abnormalities in the genome of organisms, the amplification process can also be used to detect DNA polymorphisms which may not be associated with any pathological state.

In summary, the ampl fication process is seen to provide a process for amplifying one or more specific nucleic acid sequences

using a chain reaction and a thermostable enzyme, in which reaction primer extension products are produced which can subsequently act as templates for further primer extension reactions. The process is especially useful in detecting nucleic acid sequences which are initially present in only very small amounts.

The following examples are offered by way of illustration only and are by no means intended to limit the scope of the claimed invention. In these examples, all percentages are by weight if for solids and by volume if for liquids, unless otherwise noted, and all temperatures are given in degrees Celsius.

EXAMPLE I

A. Synthesis of the Primers

The following two oligonucleotide primers were prepared by the method described below: 5'-ACACAACTGTGTTCACTAGC-3' (PC03)

5'-CAACπCATCCACGπCACC-3' (PC04)

These primers, both 20-mers, anneal to opposite strands of the genomic DNA with their 5' ends separated by a distance of 110 base pairs.

1. Automated Synthesis Procedures: The di ethyl phosphoramidites, synthesized according to Beaucage and

Caruthers (Tetrahedron Letters (1981) 22:1859-1862) were sequentially condensed to a nucleσside derivatized controlled pore glass support using a Biosearch SAM-1. The procedure included detritylation with trichloroacetic acid in dichloro ethane, condensation using benzotriazole as activating proton donor, and capping with acetic anhydride and dimethyl arainopyridine in tetrahydrofuran and pyridine. Cycle time was approximately 30 minutes. Yields at each step were essentially quantitative and were determined by collection and spectroscopic examination of the dimethoxytrityl alcohol released during detritylation.

2. Oligodeoxyribonucleotide Deprotection and Unification Procedures: The solid support was removed from the column and exposed

to 1 ml concentrated ammonium hydroxide at room temperature for four hours in a closed tube. The support was then removed by filtration and the solution containing the partially protected oligodeoxynucleotide was brought to 55 C for five hours. Ammonia was removed and the residue was applied to a preparative polyacrylamide gel. Electrophoresis was carried out at 30 volts/cm for 90 minutes after which the band containing the product was identified by UV shadowing of a fluorescent plate. The band was excised and eluted e with 1 ml distilled water overnight at 4 C. This solution was applied to an Altech RP18 column and eluted with a 7-13% gradient of acetonitrile in 1% ammonium acetate buffer at pH 6.0. The elution was monitored by UV absorbance at 260 nm and the appropriate fraction collected, quantitated by UV absorbance in a fixed volume and evaporated to dryness at room temperature in a vacuum centrifuge. 3. Characterization of Oligodeoxyribonucleotides: Test aliquots of the purified oligonucleotides were 32 P labeled with polynucleotide kinase and gamma- 32 P-ATP. The labeled compounds were examined by autoradiography of 14-20% polyacryl amide gels after electrophoresis for 45 minutes at 50 volts/cm. This procedure verifies the molecular weight. Base composition was determined by digestion of the oligodeoxyribonucleotide to nucleosides by use of venom diesterase and bacterial alkaline phosphatase and subsequent separation and quantisation of the derived nucleosides using a reverse phase HPLC column afid a 10% acetonitrile, 1% ammonium acetate mobile phase.

B. Isolation of Human Genomic DNA from Cell Line

High molecular weight genomic DNA was isolated from a T cell line, Molt 4, homozygous for normal beta-globin available from the Human Genetic Mutant Cell Depository, Camden, NJ as GM2219C using essentially the method of Maniatis et al., supra , p. 280-281.

C. Purification of a Polymerase From Thermus aquaticus

Thermus aquaticus strain YTl, available without restriction from the American Type Culture Collection, 12301 Park! awn Drive, Rockville, MD, as ATCC No. 25,104 was grown in flasks in the following medium:

Sodium Citrate 1 M

Potassium Phosphate, pH 7.9 5 mM

Ammonium Chloride 10 mM

Magnesium Sulfate 0.2 mM Calcium Chloride 0.1 mM

Sodium Chloride 1 g/1

Yeast Extract 1 g/1

Tryptone 1 g/1

Glucose 2 g/1 Ferrous Sulfate 0.01 mM

(The pH was adjusted to 8.0 prior to autoclaving. )

A 10-liter fermentor was inoculated from a seed flask o cultured overnight in the above medium at 70 C. A total of 600 ml from the seed flask was used to inoculate 10 liters of the same medium. The pH was controlled at 8.0 with ammonium hydroxide with the dissolved oxygen at 40%, with the temperature at 70 C, and with the stirring rate at 400 rpm.

After growth of the cells, they were purified using the protocol (with slight modification) of Kaledin et al., supra, through the first five stages and using a different protocol for the sixth

* o stage. All six steps were conducted at 4 C. The rate of fractionation on columns was 0.5 columns/hour and the volumes of gradients during elution were 10 column volumes. An alternative and preferred purification protocol is provided in Example XIII below. Briefly, the above culture of the ^ aquaticus cells was harvested by centrifugation after nine hours of cultivation, in late log phase, at a cell density of 1.4 g dry weight/1. Twenty grams of cells were resuspended in 80 ml of a buffer consisting of 50 mM TrisΗCl pH 7.5, 0.1 mM EDTA. Cells were lysed and the lysate was centrifuged for two hours at 35,000 rpm in a Beckman TI 45 rotor at 4 C. The supernatant was collected (fraction A) and the protein

fraction prec pitating between 45 and 75% saturation of ammonium sulfate was collected, dissolved in a buffer consisting of 0.2 M potassium phosphate buffer, pH 6.5, 10 mM 2-mercaptoethanol , and 5% glycerine, and finally dialyzed against the same buffer to yield fraction B. Fraction B was applied to a 2.2 x 30-cm column of DEAE- cellulose, equilibrated with the above described buffer. The column was then washed with the same buffer and the fractions containing protein (determined by absorbance at 280 nm) were collected. The combined protein fraction was dialyzed against a second buffer, containing 0.01 M potassium phosphate buffer, pH 7.5, 10 mM 2- ercaptoethanol , and 5% glycerine, to yield fraction C.

Fraction C was applied to a 2-6 x 21-cm column of hydroxyapatite, equilibrated with a second buffer. The column was then washed and the enzyme was eluted with a linear gradient of 0. Oi¬ 0.5 M potassium phosphate buffer, pH 7.5, containing 10 mM 2- mercaptoethanol and 5% glycerine. Fractions containing DNA polymerase activity (90-1130 mM .potassium phosphate) were combined, concentrated four-fold using an Amicon stirred cell and YMIO membrane, and dialyzed against the second buffer to yield fraction D. Fraction D was applied to a 1.6 x 28-cm column of DEAE- cellulose, equilibrated with the second buffer. The column was washed and the polymerase was eluted with a linear gradient of 0.01-0.5 M potassium phosphate in the second buffer. The fractions were assayed for contaminating endonuclease(s) and exonucl ease (s) by electrophoretically detecting the change in molecular weight of phage lambda DNA or supercoiled plasmid DNA after incubation with an excess of DNA polymerase (for endonuclease) and after treatment with a restriction enzyme that cleaves the DNA into several fragments (for exonuclease). Only those DNA polymerase fractions (65-95 mM potassium phosphate) having minimal nuclease contaminatiwi were pooled. To the pool was added autoclaved gelatin in an amount of 250 /Lg/ml , and dialysis was conducted against the second buffer to yield Fraction E.

Fraction E was applied to a phosphocellulose column and eluted with a 100 ml gradient (0.01-0.4 M KC1 gradient in 20 mM potassium phosphate buffer pH 7.5). The fractions were assayed for contaminating endo/exonuclease(s) as described above as well as for polymerase activity (by the method of aledin et al . ) and then pooled. The pooled fractions were dialyzed against the second buffer, then concentrated by dialysis against 30% glycerine and the second buffer.

The molecular weight of the polymerase was determined by SDS-PAGE analysis. Marker proteins (Bio-Rad low molecular weight standards) were phosphorylase B (92,500), bovine serum albumin

(66,200), ovalbumin (45,000), carbonic anhydrase (31,000), soybean trypsin inhibitor (21,500), and lysozyme (14,40D).

Preliminary data suggest that the polymerase has a molecular weight of about 86,000-95,000 daltons, not 62,000-63,000 daltons reported in the literature (e.g., by Kaledin et al.).

The polymerase was incubated in 50 j ~ of a mixture containing either 25 mM Tris-HCl pH 6.4 or pH 8.0, and 0.1 M KC1, 10 mM MgCl , 1 mM 2-mercaptoethanol , 10 nmoles each of dGTP, dATP, and TTP, and 0.5 ^ ( 3 H) dCTP, 8/<g "activatei* -c-alf thy us DNA, and 0.5- 5 units of the polymerase. "Activated" DNA is a native preparation of DNA after partial hydrolysis with DNase I until 5% of the DNA was transferred to the acid-soluble fraction. The reaction was conducted at 70 C for 30 minutes, and stopped by adding 50 j j of a saturated aqueous solution of sodium pyrophosphatε containing 0.125 M EDTA- Na . Samples were processed and activity was determined as described by Kaledin et al . , supra.

The results showed that at pH 6.4 the polymerase was more than one-half as active as at pH 8.0. In. contrast, Kaledin et al. found that at pH about 7,0, the enzyme therein had 8% of the activity at pH 8.3. Therefore, the pH profile for the thermostable enzyme herein is broader than that for the Kaledin -et al. enzyme.

Finally, when only one or more nucleotide tr phosphates were eliminated from a DNA polymerase assay reaction mixture, very little,

if any, activity was observed using the enzyme herein, and the activity was consistent with the expected value, and with an enzyme exhibiting high fidelity. In contrast, the activity observed using the Kaledin et al. (supra) enzyme is not consistent with the expected value, and suggests misincorporation of nucleotide tri phosphate (s).

D. Amplification Reaction

One microgram of the genomic DNA described above was diluted in an initial 100 1 aqueous reaction volume containing 25 mM Tris'HCl buffer (pH 8.0), 50 M KC1, 10 mM MgCl 2 , 5 M dithiothre tol, 200^g/ml gelatin, 1 M of primer PC03, l^M of primer PC04, 1.5 mM dATP, 1.5 mM dCTP, 1.5 mM dGTP and 1.5 mM TTP. The sample was heated for 10 minutes at 98 ' C to denature the genomic DNA, then cooled to room temperature. Four microliters of the polymerase from Thermus aquaticus was added to the reaction mixture and overlaid with a 100 l mineral oil cap. The sample was then placed in the aluminum heating block of the liquid handling and heating instrument described above.

The DNA sample underwent 20 cycles of amplification in the machine, repeating the following program cycle:

1) heating from 37° C to 98 C in heating block over a period of 2.5 minutes; and

2) cooling from 98 # C to 37 C over a period of three minutes to allow the primers and DNA to anneal.

After the * last cycle, the sample was incubated for an additional 10 minutes at 55" C to complete the final extension reaction.

E. Synthesis and Phosphorylat on of Oligodeoxyribonucleotide Probes

A labeled DNA probe, designated RS24, of the following sequence was prepared:

5 ' -*CCCACAGGGCAGTAACGGCAGACTTCTCCTCAGGAGTCAG-3 ' ( RS24)

where * indicates the label. This probe is 40 bases long, spans the fourth through seventeenth codons of the gene, and is complementary to the normal beta-globin allele (beta A ). The schematic diagram of primers and probes is given below:

110 bp

<- beta-αlobin

PC03 RS24 PC04

This probe was synthesized according to the procedures described in Section I of .Example I. The probe was labeled by contacting 20 pmole thereof with 4 units of T4 polynucleotide kinase (New England Biolabs) and about 40 pmole gamma "32 P-ATP (New England Nuclear, about 7000 Ci/mmole) in a 40 μ} reaction volume containing 70 mM Tris buffer (pH 7.6), 10 mM MgCl 2 , 1.5 mM spermine, and 10 mM dithiothreitol for 60 minutes at 37 C. The total volume was then adjusted to 100 πl with 25 mM EDTA and the probe purified according to the procedure of Maniatis et al., Molecular Cloning (1982), 466-467 over a 1 ml Bio Gel P-4 (BioRad) spin dialysis column equilibrated with Tris-EDTA ( TE) buffer (10 M Tris buffer, 0.1 mM EDTA, pH 8.0). TCA precipitation of the reaction product indicated that for RS24 the specific activity was 4.3 riCi/pmole and the final concentration was 0.118 pmole/μl.

F. Dot Blot Hybridizations

Four micro! iters of the amplified sample from Section IV and 5.6 μ) of appropriate dilutions of beta-globin plasmid DNA calculated to represent amplification efficiencies of 70, 75, 80, 85, 90, 95 and 100% were diluted with 200 jβ 0.4 N NaOH, 25 mM EDTA and spotted onto a Genatran 45 (Plasco) nylon filter by first wetting the filter with water, placing it in a Bio-Dot (Bio-Rad, Richmond, CA) apparatus for preparing dot blots which holds the filters in place, applying the samples, and rinsing each well with 0.1 ml of 20 x SSPE (3.6 M NaCl, 200 mM NaH 2 P0 , 20 mM EDTA), as disclosed by Reed and Mann, Nucleic

Acids Research, 13, 7202-7221 (1985). The filters were then removed, o rinsed in 20 x SSPE, and baked for 30 minutes at 80 C in a vacuum oven.

After baking, each filter was then contacted with 16 ml of a hybridization solution consisting of 3 x SSPE, 5 x Denhardt's solution (1 x = 0.02% polyvinyl pyrrol i done, 0.02% Ficoll, 0.02% bovine serum albumin, 0.2 M Tris, 0.2 mM EDTA, pH 8.0), 0.5% SDS and 30% formamide, and incubated for two hours at 42°C. Then 2 pmole of probe RS24 was added to the hybridization solution and the filter was incubated for two minutes at 42 C.

Finally, each hybridized filter was washed twice with 100 ml of 2 x SSPE and 0.1% SDS for 10 minutes at room temperature. Then the filters were treated once with 100 ml of 2 x SSPE, 0.1% SDS at 60° C for 10 minutes.

Each filter was then autoradiographed, with the signal readily apparent after two hours.

G. Discussion of Autoradiogram The autoradiogram of the dot blots was analyzed after two hours and compared in intensity to standard serial dilution beta- globin reconstructions prepared with Hae_III/Mae_I -digested pBR:beta A , where beta^ is the wild-type allele, as described in Saiki et al., Science, supra. Analysis of the reaction product indicated that the overall amplification efficiency was about 95%, corresponding to a 630, 000-fold increase in the beta-globin target sequence.

EXAMPLE II

A. Amplification Reaction

Two 1 μg samples of genomic DNA extracted from the Molt 4 cell line as described in .Example I were each diluted in a 100 J reaction volume containing 50 mM KC1, 25 mM TrisΗCl buffer pH 8.0, 10 mM MgCl 2 , 1 fl of primer PC03, 1 ιM of primer PC04, 200 / ig/ml gelatin, 10% dimethyl sul foxide (by volume), and 1.5 mM each of dATP, dCTP, dGTP and TTP. After this mixture was heated for 10 minutes at 98 C to denature the genomic DNA, the samples were cooled to room temperature and 4 Λ of the polymerase from Thermus aquaticus described in Example I was added to each sample. The samples were overlaid with mineral oil to prevent condensation and evaporative loss.

One of the samples was placed in the heating block of the machine described in Example I and subjected to 25 cycles of amplification, repeating the following program cycle:

(1) heating from 37 to 93 C over a period of 2.5 minutes;

(2) cooling from 93 °C to 37 "c over a period of three minutes to allow the primers and DNA to anneal; and

(3) maintaining at 37 C for two minutes.

After the last cycle the sample was incubated for an additiona 1l 10 minutes at 60 ° C to complete the final extension reaction.

The second sample was placed in the heat-conducting container of the machine, described in more detail in EP 236,069, supra. The heat-conducting container is attached to Peltier heat pumps which adjust the temperature upwards or downwards and a microprocessor controller to control automatically the amplificat on sequence, the temperature levels, the temperature ramping and the timing of the temperature.

The second sample was subjected to 25 cycles of amplification, repeating the following program cycle: (i) heating from 37 to 95 C over a period of three minutes;

(2) maintaining at 95 c C for 0.5 minutes to allow denaturation to occur; o

(3) cooling from 95 to 37 C over a period of one minute; and (4) maintaining at 37 C for one minute.

B. Analysis

Two tests were done for analysis, a dot blot and an agarose gel analysis.

For the dot blot analysis, a labeled DNA probe, designated RS18, of the following sequence was prepared.

S , -*(;TCCTGAGGAGAAGTCTGC-3 ' (RS18) where * indicatss the label. This probe is 19 bases long, spans the fourth through seventeenth coαons of the gene, and is complementary to the normal oeta-glo in allele (beta"). The schematic diagram of primers and probes is given below: 110 bp h bueattaa--αglloohbiinn *

PC03 RSI 8 PC04

This probe was synthesized according to the procedures 0 described in Section I of Example I. The probe was labeled by contacting 10 pmole thereof with 4 units of T4 polynucleo ide kinase

(New England Biolabs) and about 40 pmole gamma 3 P- ATP (New England

Nuclear, about 7000 Ci/mmole) in a 0^.1 reaction volume containing 70 mM Tris'HCl buffer (pH 7.6), 10 mM MgCl 2 ,.1.5 mM sper ine and 10 M

15 dithiothreitol for 60 minutes at 37°C. The total volume was then adjusted to 100 μ> with 25 mM EDTA and purified according to the procedure of Maniatis et al;, supra, p. 466-467 over a 1 ml Bio Gel P-

4 (BioRad) spin dialysis column equilibrated with Tris-EDTA (TE) buffer (10 mM Tris -HCl buffer, 0.1 M EDTA, pH 8.0). TCA

20 prec itation of the reaction product indicated that for RSI 8 the specific activity was 4.6 JuCi/ mole and the final concentration was

0.114 pmole/wl.

Five micro! iters of the amplified sample from Section I and of a sample amplified as described above except using the Klenow

' 25 fragment of ^ coli DNA Polymerase I instead of the thermostable enzyme were diluted with 195^1 0.4 N NaOH, 25 mM EDTA and spotted onto two replicate Genatran 45 (Plasco) nylon filters by first wetting the filters with water, placing them in a Bio-Dot (Bio-Rad, Richmond, CA) apparatus for preparing dot blots which holds the filters in

30 place, applying the samp es, and rinsing each well with 0.4 ml of 20 x SSPE (3.6 M NaCl, 200 mM NaH 2 P0 4 , 20 mM EDTA), as disclosed by Reed and Mann, supra. The filters were then removed, rinsed in 20 x SSPE, and baked for 30 minutes at 80 C in a vacuum oven.

After baking, each filter was then contacted with δ ml of a hybridization solution consisting of 5 x SSPE, 5 x Oenhardt's solution (1 x = 0.02% polyvinyl pyrrol idone, 0.02% Ficoll, 0.02% bovine serum albumin, 0.2 M Tris, 0.2 M EDTA, pH 8.0) and 0.5% SDS, and incubated for 50 minutes at 55 C. Tnen 5 vl of probe RS18 as added to the hybridization solution and the filter was incubated for 60 minutes at 55 C.

Finally, each hybridized filter was washed twice with 100 ml of 2 x SSPE and 0.1% SDS for 10 minutes at room temperature. Then the filters were treated twice more with 100 ml of 5 x SSPE, 0.1% SDS at 60 C for 1) one minute and 2) three minutes, respectively.

Each filter was then autoradiographed, with the signal readily apparent after 90 minutes.

In the agarose gel analysis, 5^1 each amplification reaction was loaded onto 4% NuSieve/0.5% agarose gel in 1 x TBE buffer

(0.089 M Tris, 0.089 M boric acid, and 2 mM EDTA) and electrophoresed for 60 minutes at 100V. After staining with ethidium bromide, DNA was visualized by UV fluorescence.

The results show that the machines used in Example I and this example were equally effective in amplifying the DNA, showing discrete -high-intensity 110-base pair bands of similar intensity, corresponding to the desired sequence, as well as a few other discrete bands of much lower- intensity. In contrast, the amplification method which involves reagent transfer after each cycle using the Klenow fragment of E. col Polymerase I, gave a DNA smear resulting from the non-specific amplification of many unrelated DNA sequences.

It is expected that similar improvements in amplification and detection would be achieved in evaluating HLA-DQ, DR and DP regions. If in the above experiments the amplification reaction buffer contains 2 mM MgCl instead of 10 mM MgCl 2 and 150-200 ,M of

each nucleotide rather than 1.5 M of each, and if the lower temperature of 37 C is raised to 45-58 C during amplification, better specificity and efficiency of amplification occurs. Also, OMSO was found not necessary or preferred for amplifi ation.

EXAMPLE III

Amplification and Cloning

For ampl fication of a 119-base pair fragπent on t.he human beta-globin gene, a total of 1 icrogram each of human genomic DNA isolated from the Molt 4 cell line or from the GM2064 cell line (representing a ho o∑ygous deletion of the beta- and delta-hemoglobin region and available from the Human Genetic Mutant Cell Depository, Camden, NJ) as described above was amplified in a lOO^αl reaction volume containing 50 mM KC1 , 25 mM Tris* HCl pH 8, 10 mM MgCl 2 , 200/jLg/ml gelatin, 5 mM 2-mercaptoethanol , 1.5 mM each of dATP, dCTP, TTP, and dGTP, and l of each of the following primers:

5 '-CTTCTGcagCAACTGTGTTCAεTAGC-3 (GH18) 5 ' -CACaAgCTTCATCCACGTTCACC-3 ' ( GH19) where lower case letters denote mismatches from wild- type sequence to create restriction enzyme sites. GH18 is a 26-base oligonucleotide complementary to the negative strand and contains an internal Pstl site. GH19 is a 23-base oligonucleotide complementary to the plus strand and contains .an internal Hindlll recognition sequence. These primers were selected- by first screening the regions of the gene for homology to the Pstl and Hindlll restriction sites. The primers were then prepared as described in Example I.

The above reaction mixtures were heated for 10 minutes at 95

^ and then cooled to room temperature. A total of 4 μλ of the polymerase described in Example I was added to each reaction mixture, and then each mixture was overlayed with mineral oil. The reaction mixtures were subjected to 30 cycles of amplification with the following program:

2.5 min. ramp, 37 to 98 C 3 min. ramp, 98 to 37 C 2 min. soak, 37 C

After the last cycle, the reaction mixtures were incubated tor 20 minutes at 65 C to complete the final extension. The mineral ooii ' l was extracted with chloroform and the mixtures were stored at -20

c.

A total of 10 l of the amplified product was digested with 0.5 μg M13mpl0 cloning vector, which is publicly available from .Boehringer-Mannheim, in a 50 volume containing 50 mM NaCl, 10 mM Tris'HCl, pH 7.8, 10 mM MgCl 2 , 20 units Pstl and 26 units Hindlll for 90 minutes at 37 C Q. The reaction was stopped by freezing at -20 C. The volume was adjusted to 110^1 with TE buffer and loaded (10Q/_u) onto a 1 ml BioGel P-4 spin dialysis column. One 0.1 ml fraction was collected and ethanol precipitated.

(At this point it was discovered that there was beta-globin amplification product in the GM2064 sample. Subsequent experiments traced the source of contamination to the primers, either GH18 or GH19. Because no other " source of primers was available, the experiment was continued with the understanding that some cloned sequences would be derived from the contaminating DNA in the primers.)

The ethanol pellet was resuspended in 15 \ water, then adjusted to 20^1 volume containing 50 mM Tris'HCl, pH 7.8, 10 mM MgCl 2 » 0.5 mM ATP, 10 M dithiothreitol, and 400 units ligase. This mixture was incubated for three hours at 16 C.

Ten microliters of ligation reaction mixture containing Molt 4 DNA was transformed into E. coli strain JM103 competent cells, which are publicly available from BRL in Bethesda, MD. The procedure followed for preparing the transformed strain is described in Messing, j. (1981) Third Cleveland Symposium on Macromolecules: Recombinant DNA, ed. A. Walton, Elsevier, Amsterdam, 143-163. A total of 651 colorless plaques (ami 0 blue plaques) were obtained. Of these, 119 had a (+)- strand insert (18%) and 19 had a (-)- strand insert (3%). This is an increase of almost 20-fold over the percentage of beta-globin positive

plaques among the primer- pos tive plaques from the ampl fication technique using Klenow fragment of t^ col i Polymerase I, where the reaction proceeded for two minutes at 25 β C, after which the steps of heating to 100 β Cfor two minutes, cooling, adding Klenow fragment, and reacting were repeated nine times. These results confirm the improved specificity of the amplification reaction employing the thermostable enzyme herein.

In a later cloning experiment with GM2064 and the contaminated primers, 43 out of 510 colorless plagues (8%) had the (+)- strand insert. This suggests that approximately one-half of the 119 clones from Molt 4 contain the contaminant sequence.

Ten of the (+)- strand clones from Molt 4 were sequenced.

Five were normal wild-type sequence and five had a single C to T mutation in the third position of the second codon of the gene (CAC to CAT). Four of the contaminant clones from GM2064 were sequenced and all four were normal.

Restriction site-modified primers may also be used to amplify and clone and parti.ally sequence the human N-ras oncogene and to clone base pair segments of the HLA DQ-alpha, DQ-beta and DR-beta genes using the above technique.

Again, if the concentrations of MgCl and nucleotides are reduced to 2 mM and 150-200 μ * respectively, and the minimum cycling

<_> 6 temperature is increased from 37 C to 45-58 C, the specificity and efficiency of the amplification reaction can be increased.

EXAMPLE IV

Gene Retrieval

A. IDENTIFICATION OF A DNA SEQUENCE PROBE FOR THE TAQ POLYMERASE GENE

A specific DNA sequence probe for the Taq pol gene was obtained following immunological screening of a lambdagtll expression library. J_. aquaticus DNA was digested to completion with Alul, ligated with EcoRI 12-mer l nkers (CCGGAATTCCGG, New England Biolabs), digested with EcoRI and ligated with dephosphorylated, EcoRI-digested

la bdagtll DNA ( Pro ega Biotech). The ligated DNA was packaged (Gigapack Plus, Stratagene) and transfected into " . coli K-12 strain Y1090 (provided by R. Young).

The initial library of 2 x 10 5 plaques was screened (Young. R.A., and R.W. Dayis (1983) Science, 222_:778-782) with a 1:2000 dilution of a rabbit polyclonal antiserum raised to purified Taq polymerase (see Examples I and XIII). Candidate plaques were replated at limiting dilution and rescreened until homogeneous {*- ' cycles). Phage were purified from candidate plaques which failed to react with preim une serum and reacted with immune serum.

Candidate phage were used to lysogenize E. col K-12 strain Y1089 (R. Young). Lysogens were screened for the production of an IPTG inducible fusion protein (larger than beta-galactosidase) which reacted with the Taq polymerase antiserum. Solid phase, size- fractionated fusion proteins were used to affinity purify epi tope- specific- antibodies from the total polyclonal antiserum (Goldstein, L.S.B., et al. (1986) J. Cell Biol. 102:2076-2087).

The "fished", ep-i tope- selected antibodies were used, in turn, in a Western analysis to identify which lambdagtll phage candidates encoded DNA sequences uniquely specific to Taq polymerase. One lambdagtll phage candidate, designated lambdagtll, specifically selected antibodies from the total rabbit polyclonal Taq polymerase antiserum- which uniquely reacted with both purified Taq polymerase and crude- extract fractions containing Taq polymerase. This phage, lambdagtrl, was used for further study.

The -^llS bp EcoRI -adapted Alul fragment of Thermus aquaticus DNA was labeled (Maniatis et al., supra) to generate a Taq polymerase- specific probe. The probe was used in Southern analyses and to screen a J_. aquaticus DNA random genomic library.

B. CONSTRUCTION AND SCREENING OF A THERMUS AQUATICUS RANDOM GENOMIC LIBRARY*

Lambda phage Charon 35 (Wilhelmine, A. M. et al . , supra) was annealed and ligated via its cohesive ends, digested to completion

with BamHI, and the annealed arms were purified from the "stuffer" fragnents by potassium acetate density gradient ul racsntri ugation (Maniatis, et al . , supra). T. aquati us DNA was partially digested with Sau 3 and the 15-20 kb size fraction purified by sucrose density gradient ul tracentrifugation. The random genomic library was constructed by ligating the target and vector DNA fragments at -a 1:1 molar ratio. The UNA was packaged and transf ected into . coli K-12 strains LE392 or K802. A library of ✓"20,000 initial phage containing ^99 recombi πants was amplified on ~ . col K-12 strain LE392.

The CH35 Taq genomic phage library was screened (Maniatis et al., supra) with the radiolabeled EcoRI insert of gtll:l. Specifically hybridizing candidate phage plaques were purified and further analyzed. One phage, designated Ch35::4-2, released > four J_. aquaticus DNA fragments upon digestion with Hindlll (* * °8.0, 4.5, 0.8, 0.58 kb)

The four Hindlll T. aquaticus DNA fragments were ligated with Hindlll digested plasmid BSM13 + (3.2 kb, Vector Cloning Systems, San Diego) and individually cloned following transformation of ~. col i K-12 strain DG98.

The 8.0 kb Hindlll DNA fragment from CH35::4-2 was isolated in plasmid pFC82 (11.2 kb), while the 4.5 kb Hindlll DNA fragment from CH35::4-2 was isolated in plasmid pFC83 (7.7 kb).

E. coli strain DG98 harboring pFC82 was shown to contain a thermostable, high temperature DNA polymerase activity (Table 1). In addition, these cells synthesize a new Λ -'δO kd molecular weight polypeptide which is i muno! ogi call related to Taq DNA polymerase.

The Taq polymerase coding region of the 8.0 kb Hindlll DNA fragment was further localized to the 1 ac -promoter proximal 2.68 kb Hindlll to Asp718 portion of the 8.0 kb Hindlll fragment. This region was subcloηed to yield plasmid pFC85 (6.0 kb). Upon induction with IPTG, ~. coli DG98 cells harboring plasmid pFC85 synthesize up to 100- fold more thermostable, Taq polymerase- related activity (Table 1) than the original parent clone (pFC82/DG98). While cells harboring pFC85

57 synthesize a significant amount of a thermostable DNA polymerase activity, only a portion of the Taq pol DNA sequence is translated, resulting in the accumulation of a -^60 d Taq polymerasa-rsl atad pol peptide.

TABLE 1 Expression of a Thermostable DNA Polymerase Activity in E. coli *

Sam pi e

BSM13/DG98 pFC82/DG98

PFC85/DG98

^Cells were grown to late log phase (+/- IPTG, 10 mM), harvested, sonicated, heated at 75 C for 20 minutes, centrifuged and the clarified supernatant assayed at 70 C for DNA polymerase activity. * 1 unit = 1 nMole dCTP incorporated in 30 minutes.

EXAMPLE V

Expression of Taq Polymerase

The thermostable gene of the present invention can be expressed in any of a variety of bacterial expression vectors including DG141 (ATCC ' 39588) and pP | _N RBS ATG, vectors disclosed in U.S. Patent No. 4,711,845, the disclosure of which is incorporated herein by reference. Both of these host vectors are pBR322 derivatives that have either a sequence containing a tryptophan promoter-operator and ribosome binding site with an operably linked ATG start codon (DG141) or a sequence containing the lambda P j _ promoter and gene N ribosome binding site operably linked to an ATG start codon (pP^N^ j ATG). Either one of these host vectors may be restricted with Sac I, and blunt ended with Klenow or SI nuclease to construct a convenient restriction site for subsequent insertion of the Taq polymerase gene.

The full -length Taq polymerase gene was constructed from the DNA insert fragments subcloned into plasmids ρFC33 and ρFCS5 as follows. Vector SSM13* (commercially available from Vector Cloning Systems, San Diego, CA) was digested at the unique Hindlll site, repaired with Klenow and dNTPs, and ligated with T4 DNA ligase to a Bglll octanucleo ide linker, 5'-CAGATCTG-3' (New England 3iolabs), and transformed into £^ coli stra n DG98. Plasmids were isolated from Amp R l_acZal ha + transformants. One of the clones was digested with Bgl II and Asp718 restriction enzymes, and the large vector fragment purified by gel electrophoresis.

Next, plasmid pFC83 was digested with Bgl II and Hindlll and the -^730 base pair fragment was isolated. Plasmid pFC85 was digested with Hindlll and Asp718 and the <*-**2.68 kb fragment isolated and joined in a three-piece ligation to the Λ 730 base pair Bgl II-HindIII fragment from pFC83 and the Bgl II-Asp718 vector fragment of BSM13 + . This ligation mixture was used to transform E^ coli strain DG98 (ATCC 39,768 deposited July 13, 1984) from which Amp R colonies were selected and an^δ.δδ kilobase plasmid (pLSGl) was isolated. Isopropyl-beta-D- thiogalactoside (IPTG)- induced DG98 cells harboring pLSGl synthesized Taq DNA polymerase indistinguishable in size from the native enzyme isolated from T. aquaticus.

Oligonucleoti de-directed mutagenesis (see Zoller and Smith, Nuc. Acids Res. (1982) 1^:6487-6500) was used to simultaneously 1) introduce an Sphl site within codons 3 to 5 of the Taq DNA polymerase gene sequence (see Figure 1, nt 8-13), 2) increase the A/T content of four of the first seven codons without effecting a change in the encoded amino acids (within codons 2-7 in Figure 1), 3) delete 170 nucleotides of the !acZ DNA and T^ aquaticus DNA 5' to the DNA polymerase gene initiation codon. Bacteriophage R408 (Russel, M., et a!., Gene, (1986) 45.:333-

338) was used to infect pLSGl/DG98 cells and direct the synthesis of the single-stranded DNA (ss) form (plus strand) of pLSGl. Purified pLSGl ssDNA was annealed with purified PvuII-digested BSM13 "1" Bglll vector fragments and the 47-mer mutagenic ol gonucleotide DG26 (5'-

CCCTTGGGCTCAAAAAGTGGAAGCATGCCTCTCATAGCTGTTTCCTS). Foil owing extens on with £. coli DNA polymerase I Klenow fragment, transformation of DG98 cells, and selection of Amp transformants, the colonies were screened with 5' 32 P-labεled DG26, ' - Hybridizing candidates were screened for loss of the Bgl II restriction site, deletion of approximately 170 base pairs of lacZ:T. aquaticus DNA, and introduction of a unique Sp l site. One candidate, designated pLSG2, was sequenced and shown to encode the desired sequence.

pLSGl sequence: S.D. 47bp BgJ_.II 105bp CAGGMACAGCT ATG ACC ATG A£ATCT

...AAC ATG AGG GGG ATG CTG CCC CTC TTT pLSG2 sequence:

S.D. Sphl CAGGAAACAGCTATG AGA. GGC ATG CTT CCA CTT TTT

Oligonucleotide-directed mutagenesis was used to introduce a unique Bgl II site in plasmid pLSG2 immediately following the TGA stop codon for the Taq polymerase gene (following nucleotide 2499 in Figure 1). As above, bacteriophage R408 was used to generate the single- stranded (plus) form of plasmid pLSG2. Purified pLSG2 ssDNA was annealed with purified _Pvu_I I -digested BSM13 + ~ Vl vector fragment and the 29-mer utagenic oligonucleotide SC107 (5 1 -

GCATGGGGTGGTAGATCTCACTCCTTGGC). Following extension with Klenow fragment (50 mM each dNTP), transformation of DG98 cells and selection for Amp R transformants, colonies were screened with 5 1 32 P-labeled SC107. Hybridizing candidates were screened for acquisition of a unique Bgl II site. One candidate, designated p5YC1578, was sequenced and shown to contain the desired sequence.

pLSG2 sequence:

... GCC AAG GAG TGA TAC CAC CCC ATG C pSYC1578 sequence:

,. GCC AAG GAG TGA GATC TAC CAC CCC ATG C

EXAMPLE VI

Construction of expression vectors pDGlδO and 0DGI6I

The Amp or Tet lambda P j _ promoter, gene N ribosome binding site, polyl inker, BT cry PRE (BT) (positive retro regulatory element, described in U.S. Patent No. 4,666,848, issued May 19, 1987), in a Col El cop vector were constructed from previously described plasmids and the duplex synthetic oligonucleotide linkers DG31 and DG32. The DG31/32 duplex linker encodes a 5' Hindlll cohesive end followed by Sad, Ncol, Kpnl/Asp718, Xmal/Smal recognition sites and a 3 1 BamHI cohesive end.

Asp718 Sac I Ncol Xmal

DG31 5' AGCTTATGAGCTCCATGGTACCCCGGG

ATACTCGAGGTACCATGGGGCCCCTAG-5 ' DG32

A. Construction of A p R plasmid pDGlδO

Plasmid pFC54.t, a 5.96 kb plasmid described in U.S. Patent 4,666,848, supra, was -digested with Hindlll and BamHI and the isolated vector fragment was ligated with a 5-fold molar excess of nonphosphorylated and annealed DG31/32 duplex. Following ligation, the DNA was digested with Xbal (to inactivate the parent vector IL-2 DNA fragment) and used to transform __ coli K12 strain DG116 to ampicilϋn resistance. Colonies were screened for loss of the des- ala-ser 1 " IL-2 mutein sequence and acquisition of the DG31/32 polyl inker sequence by restriction enzyme digestion. The polyl inker region in one candidate, designated pDGlδO, was sequenced and shown to encode the desired poly! inker DNA sequence.

B . Construction of Tet R plasmid oDGlδl

Plasmid pAW740CH8 (ATCC 67605), the source of a modified tetrac cline resistance gene wherein the SamHI and Hindlll restriction sites were eliminated, and which contains the lambdaP| promoter, gene. N ribosome binding site, cry PRE in a Col El cop ts vector, was digested to completion with Hindlll and -Sa HI and the 4.19 kb yector fragment purified by agarose gel electrophoresis. The purified vector DNA fragment was ligated with a 5-fold molar excess of nonphosphorylated annealed DG31/32 duplex. E. col K12 strain DG116 was transformed with a portion of the DNA, and Tet R colonies screened for presence of 4.2 kb plasmids. Several candidates were further screened by restriction enzyme digestion and the polyl inker region sequenced by the Sanger method. One of the candidates with the desired sequence was designated pDG161.

EXAMPLE VII

A. Construction- of an Amp R P L promoter, gene N ribosome binding site, (N RBS ) Taq polymerase (832) BT cry PRE, cop ts expression vector

To express the full-length (832 amino acid) mutated Taq polymerase sequence encoded by plasmid pSYC1578 under the control of the lambda P-_ promoter and gene N ribosome binding site, plasmids pSYC1578 and pFC54.t were used. Plasmid pSYC1578 was digested with Sphl and Bgl II and 'the resulting approximate 2.5 kb Taq polymerase gene fragment purified by agarose gel electrophoresis and electroelution. Plasmid pFC54.t was digested to completion with Hindlll and .BamHI and the vector fragπent purified by agarose gel electrophoresis. The synthetic oligonucleotides DG27 (5'-

AGCTTATGAGAGGCATG) and DG28 (5'-CCTCTCATA) were synthesized and annealed. Purified pFC54.t fragment (0.085 pmoles), purified Taq polymerase gene fragment (0.25 pmoles) and annealed nonphosphorylated DG27/28 duplex adaptor (0.43 pmoles) were combined in 30^ and ligated at*14 C. A portion of the ligated DNA was heated to 75*C (15 minutes) to inactivate the DNA ligase in the samples and treated with Xbal to linearize (inactivate) any IL-2 mutein containing ligation

products. The ligated and digested DNA (approximately 100 ng) was used to transform ]__ col K12 strain DGllδ to ampicilliπ resistance. a

Am colonies were screened for the presence of an approximate 8 kb plasmid which yielded the expected digestion products with Hindlll (521 bp + 7,410 bp), EcoRI (3,250 bp + 4,781 bp) and S ~ hl (3,031 bp),

Asp 713 (3,031 bp) 3 3amHI (8,031 bp) and PvuII (4,090 bp ÷ 3,477 bp + 464 bp). Several candidates were subjected to DNA sequence analysis at the 5' lambdaP L :TaqPol junction and the 3' Taq Pol :BT junction. One of the candidates was also screened with an anti -Taq polymerase antibody for the synthesis of an approximate 90 kd immunoreactive antigen. Single colonies were transferred from a SO^C culture plate to a 41°C culture plate for two hours. The colonies were scraped with a toothpick from both the 30°C and 41°C plates, boiled in SDS loading buffer, subjected to SDS-PAGE electrophoresis and the separated proteins transferred to a nitrocellulose membrane. The membranes were probed with a 1:6,000 dilution of a polyclonal anti-Taq antibody and developed with a goat anti -rabbit HRP conjugate. All of the candidates tested showed evidence of temperature inducible approximate 90 kd Taq pol erase- related protein. One of the several plasmid candidates which directed the synthesis of Taq polymerase in E. coli and contained the expected DNA sequence was designated pLSG5.-

B. Construction of a Tet P-_ promoter, gene N ribosome binding site, Taq polymerase (832) BT cry PRE cop ts expression vector To express 'the full length (832 amino acid) mutated Taq polymerase sequence encoded by plasmid pSYC1578 under control of the lambda P^ promoter and gene N ribosome binding site in a Tet R vector, we used plasmids pSYC1578 and pAW740CHB. Plasmid pSYC1578 was digested with Sphl and Bglll and the resulting approximate 2.5 kb Taq polymerase gene fragment was purified by agarose gel electrophoresis and electroelution. Plasmid pAW740CHB was digested to completion with Hindlll and BamHI and the resulting 4.19 kb vector fragment purified by agarose ' gel electrophoresis and electroelution. The synthetic ol gonucleotides DG27 and DG28 (described previously) were annealed. Purified pΛJ740CHB vector fragment (0.12 pmoles) was ligated with

purified Taq polymerase gene fragment (0.24 pmoles) and annealed nonphosphorylated DG27/ 8 duplex adaptor (0.24 pmoles) in SO^ul at 14 C. A portion of the l gated ONA (100 ng) was used to transform £. col K12 strain DGllδ to tetracycline resistance. Tet Λ candidates were screened for the presence of an approximate δ.7 kb plasmid which yielded the expected digestion products with Hindlll (621 bp + 6,074 bp), EcoRI (3,445 bp + 3,250 bp), As£718 (6,695 bp), Sohl (3,445 bp + 3,250 bp), B -amHI (6,695 bp) and jVuII (3,477 bp + 2,754 bp + 454 bp). Several candidates were subjected to DNA sequence analysis at the 5' lambdaP L :TaqPol junction and the 3' TaqPol :BT junction. Candidates were also screened by single colony im unoblot as described above for the temperature inducible synthesis of Taq polymerase. One of the plasmid candidates which directed the synthesis of Taq polymerase in _E__ coli and contained the expected DNA sequence was designated pLSGδ.

EXAMPLE VIII

Construction of a Met4 ( 3) 829 amino acid form of Taq polymerase

The predicted fourth codon of native Taq polymerase directs the incorporation of a methionine residue (see pLSGl and pLSG2 5 1 sequences above). To obtain a further mutated form of the Taq polymerase gene that would direct the synthesis of an 829 amino acid primary translation, product*, we used plasmids pSYC1578 and pDGlδl. Plasmid pSYC1578 was. digested with Sphl, treated with E. coli DNA polymerase I Klenow fragment in the presence of dGTP to remove the four-base 3 1 cohesive end and generate a CTT (leucine, 5th codon) blunt end. Following inactivation of the DNA polymerase and concentration of the sample, the DNA was digested with Bglll and the approximate 2.5 kb Taq polymerase gene fragnent purified by agarose gel electrophoresis and electroelution. Plasmid pDGlδl was digested to completion with Sac I, repaired with E. col DNA polymerase I Klenow fragment iα the presence of dGTP to remove the four base 3 1 cohesive end and generate an ATG terminated duplex blunt end. Following inactivation of the polymerase, the sample was digested with BamHI.

54

Digested pDG161 (0.146 pmole) and purified Taq polymerase fragment (0.295 pmole) were ligated at SO^g/ l under sticky end conditions overnight. The partially ligated DNA sample (3am HI /Bgl II ends) was diluted to and ligated for fiye hours under blunt end condi ions. The DNA ligase was inactivated (75 C, 10 minutes) and the sample digested with Ncol to linearize any ligation products containing the pDGlδl polyl nker sequence. Sixty nanograms of the ligated and digested DNA was used to transform j[__ coli K12 strain DG116 to tetracycϋne resistance. Tet R candidates were screened for the presence of an approximate 6.7 kb plasmid which yielded the expected digestion products when treated with Hindlll (512 bp + 6,074 bp), EcoRI (3,445 bp + 3,241 bp) and Sphl (6,586 bp). Colonies were screened as above by single colony immunoblot for the temperature inducible synthesis of an approximate 90 kd Taq polymerase-related polypeptide. One of the plasmids, designated pLSG7, that directed the synthesis of a Taq polymerase-related polypeptide was subjected to Sanger sequence determination at the 5' lambdaP | _ promoter:Taq polymerase junction and the 3' Taq polymerase :BT junction. Analysis of the DNA sequence at the 5' junction confirmed the restriction enzyme analysis (loss of one of the Sphl sites and a 612 bp Hindlll fragment, sl ghtly smaller than the 621 bp Hindlll fragment in pLSGδ) and indicated the derivation of a plasmid encoding an 829 amino acid form of Taq polymerase.

EXAMPLE IX Construction of Met289 ("7289) 544 amino acid form of Taq polymerase

During purification of native Taq polymerase (Example XIII) we obtained an altered form of Taq polymerase that catalyzed the template dependent incorporation of dNTP at 70 C. This altered form of Taq polymerase was immunologically related to the approximate 90 kd form described in Example XIII but was of lower molecular weight. Based on mobility, relative to BSA and ovalbumin following SDS-PAGE electrophoresis, the apparent molecular weight of this form is approximately 61 kd. This altered form of the enzyme is not present in carefully prepared crude extracts of Thermus aquaticus cells as

55 determined by SDS-PAGE Western blot analysis or in situ DNA polymerase activity determination (Spanos, A., and Hubscher, ϋ. (1983) Meth. £nz. _91_:253-277) following SDS-PAGE gel electrophoresis. This form appears to be proteolytic artifact that may arise during sample handling. This lower molecular weight form was purified to homogeneity and subjected to N-tεππinal sequence determination on an ABI automated -gas phase sequencer. Comparison of the obtained N-terminal sequence with the predicted amino acid sequence of the Taq polymerase gene (see Figure 1) indicates this shorter form arose as a. result of proteolytic cleavage between gl sg and se^gg.

To obtain a further truncated form of a Taq polymerase gene that would direct the synthesis of a 544 amino acid primary translation product we used plasmids pFC54.t, pSYC1578 and the complementary synthetic oligonucleotides DG29 (5'- AGCTTATGTCTCCAAAAGCT) and DG30 ( 5 -AGCTTTTGGAGACATA) . Plasmid pFC54.t was digested to completion with Hindlll and BamHI. Plasmid pSYC1578 was digested with BstXI and treated with jϊ^ col i DNA polymerase I Klenow fragment in the presence of all 4 dNTPs to remove the 4 nucleotide 3' cohesive end * and generate a CTG- terminated duplex blunt end encoding leu 2 g4 in the Taq polymerase sequence (see pLSGl, nucleotide 880). The DNA sample was digested to completion with Bgl II and the approximate 1.6 kb BstXI (repaired )/Bql II Taq DNA fragment was purified by agarose gel electrophoresis and electroelution. The pFC54.t plasmid digest (0.1 pmole) was ligated with the Taq polymerase gene fragment (0.3 pmole) and annealed nonphosphorylated DG29/DG30 duplex adaptor (0.5 pmole) under sticky ligase conditions at 30 tg/ml , 15°C overnight. The DNA was diluted to approximately 10 microgram per ml and ligation continued under blunt end conditions. The ligated DNA sample was digested with Xbal to linearize (inactivate) any IL-2 utein-encoding ligation products. 80 nanograms of the l gated and digested DNA was used to transform E^ coli K12 strain DG116 to ampicillin resistance. Amp candidates were screened for the presence of an approximate 7.17 kb plasmid which yielded the expected digestion products with EcoRI (4,781 bp + 2,386 bp), Pstl (4,138 bp + 3,029 bp), Agal (7,167 bp) and Hindi 11 /Pstl (3,400 bp + 3,029 bp + 738 bp). ___

δδ coli colonies harboring candidate plasmids were screened as above by single colony immuno blot for the ts peratura-induci ble synthesis of an approximate 61 kd Taq polymerase related polypeptide. In addition, candidate plasmids were subjected to DNA sεquεnca determination at the 5' lambda ? | _ promoter: Taq DNA junction and the 3" Taq DNA:BT cry PRE junction. One of the plasmids encoding thε intended DNA sequence and directing the synthesis of a temperature- nducible 61 kd Taq polymerase related polypeptide was designated pLSGδ.

Yet another truncated Taq pol merasε gene contained within the ^2.68 kb HindIII-Asp718 fragment of plasmid pFC85 can be expressed using, for example, plasmid pP^ ^g j ATG, by operably linking the ami no- terminal Hindlll restriction site encoding the Taq pol genε to an ATG initiation codon. The product of this fusion upon expression will yield an ^70, 000-72, 000 dalton truncated polymerase. This specific construction can be made by digesting plasmid pFC85 with Hindlll and treating with Klenow fragment in the presence of dATP and dGTP. The resulting fragment is. treatεd further with SI nuclease to remove any single-stranded extensions and the resulting DNA digested with Asp718 and treated with Klenow fragment in the presence of all four dNTPs. The recovered fragment can be ligated using T4 DNA ligase to dephosphorylated plasmid which had been digested with Sacl and treated with Klenow fragment in the presence of dGTP to construct an ATG blunt end. This ligation mixture can then be used to transform E. coli DG116 and the transformants screened for production of Taq polymerase. Expression can be confirmed by Western iπmunoblot analysis and activity analysis.

" EXAMPLE X

Construction of Amp R trp promoter operator, trpL ribosome binding site, Taq polymerase (832) BT cry PRE cop ts expression vector To substitute the E^ col trp operoπ promoter/operator and leader peptide ribosome binding site, we used plasmids pLSG5 and pFC52. pFC52 was the source of the trp promoter, cop* 5 and ampicillin resistant determinants. However, plasmid pCS4, described in U.S.

Patent No. 4,711,345, supra, the disclosure of which is incorporated herein by reference, may be used to provide the identical "fragment. Plasmid pLSG5 was digested to completion with Sp l. The Son I was inactiyated (70°C, 10 minutes) and the digested DNA was ligated overnight at 15*C with an εxcass of annealed nonphosphorylated DG27/23 duplex adaptor (see above). The T4 DNA ligase was inactiyated (7Q ό C, 10 minutes) and the DNA digested to completion with Mlul. The DNA sample was sequentially extracted with phenol and ether, ethanol precipitated and finally resuspended in 10 mM Tris chloride pH 8, 1 M EDTA. Plasmid pFC52 (or pCS4) was digested to completion with Mlul and extracted with phenol, ether and concentrated as above. The DNA sample was digested to completion with Hindlll and the Hindlll inactivated (75°C, 15 minutes). The pLSG5 and pFC52 samples werε ligated overnight in equal molar ratio and at 30 ig/ml under sticky end conditions. The T4 ligase was inactivated (70°C, 10 minutes) and the ligated DNA was digested with Xbal to linearize (inactivate) any IL-2 encoding ligation products (from the pFC52 unwanted, 1.65 kb Hindlll/Mlul DNA fragment). E coli K12 strain DG116 was transformed to ampicillin resistance with 30 nanogram of the ligated DNA. Amp colonies were screened for the presence of approximate 7.78 kb plasmids which yielded the expected digestion products with EcoRI (4,781 bp .+ 3,002 bp), Spjnl. (7,783 bp), Hindlll (7,162 bp + 621 bp), Clal (7,783 bp) and Clal/Mlul (3,905 bp + 3,878 bp). Candidate colonies were further screened for expression of an approximate 90 kd Taq polymerase related protein by single colony SDS-PAGE immuno blotting (as above). Plasmids from two of the candidates showing the intended properties were transformed into E coli K12 strain KB2 (ATCC No. 53075).

By Western im unoblot, both plasmids in both hosts were shown to direct the synthesis of an approximate 90 kd Taq polymerase- related polypeptide upon trp limitation. By Comassie staining of SDS- PAGE fractionated whole cell extract proteins, the trp promoter/Taq polymerase pi asmids ^ in E. coli K12 strain KB2 direct the accumulation of significantly more Taq polymerase than in J _ coli K12 strain DG116. One of the plasmids was designated pLSGlO.

EXAMPLE XI

Synthesis of Recombinant Taq DNA Polymerase Activity in E. coli

E. col K12 (DG116) strains harboring plasmids pDGlSO, or pLSG5, or pL5G6 were grown at 32*C in Bonner-Yogel minimal salts media containing 0.52 glucose, 10 _iιg/ml thiamine, 0.25Ϊ (w/v) Oifco casamino acids and ampicillin (100 jug/ml) or tεtracycline (10 Jig/ml) as appropriate. Cells were grown to Ag Q0 of about 0.8 and shifted to 37 C to simultaneously derepress the lambda P j _ promoter (inactivation of clg57 r 2P re sso«*) and increase the copy number of the Col El cop ts plasmid vector. After six-nine hours of growth at 37°C, aliquots of the cells were harvested, the cells centrifugεd and thε pεllεts storεd at -70°C.

Alternatively, JΞ^ coli K12 strain KB2 harboring plasmid pLSGlO was grown for eight hours at 32°C in Bonner-Vogel minimal salts mεdia containing 0.5? glucose, 5 ιg/ml tryptophan, 10Mj/ thiamine,

0.252 Difco casamino acids and 100 yug/ml ampicillin to an Ag 0 g of

3.0. Cells were harvested as above.

Cell pellets were * resuspεndεd to about 62.5 Aggg/ml (~ * 50- 160 yUg total protein/ml) in 50 mM Tris-Cl, pH 7.5, 1 M EDTA, 2.4 mM PMSF and 0.5 lg/ml leupeptin and lysεd by sonication. Aliquots of the sonicated extracts were subjected to SDS-PAGE and analyzed by Coomassie staining and Western immunoblottiπg with rabbit polyclonal anti -Taq polymerase antibody. In addition, portions of thε extracts were assayed in a high temperature (74°C) DNA polymerase assay (see Example XIII below).

Western immunoblotting showed significant induction and synthesis of an approximately 94 kd Taq DNA polymerase relatεd polypeptide in induced strains harboring plasmids pLSG5, 6, and 10. Coomassie blue staining of SDS-PAGE- separated total cell protein revaled the presence of a new predominant protein a -^94 kd in these induced strains. Finally, high temperature activity assays confirmed the significant level of recombinant Taq DNA polymerasε synthεsis in these E. coli strains (see table, below). '

* 1 unit = 10 nmole total nucleotide incorporated at 74 C/30 minutes. EXAMPLE XII

Purification of Recombinant Taq DNA Polymerase

E. coli strain DGllδ harboring plasmid pLSG5 was grown in a 10 L fermεntor. The medium was 10 mM (NH ) 2 S0 4 , 25 mM KH 2 P0 , 4 mM Na 3 Citrate, 400 ^ FeCl 3 , 28^ ZnCl 2 , 34 H CoC1 2 , 33 M NaMo0 4 , 27 tM CaCl 2 , 30 ^M CuCl 2 , and 32 ^M H3BO3. The medium was adjusted to pH 6.5 with NaOH, 15 mM, " and sterilized. The following sterile components wεrε addεd: 20 mg/1 thiaminε'HCl , 3 mM MgS0 4 , 10 g/1 glucosε and 12.5 mg/1 ampicillin. Thε pH was adjustεd to 6.8 and hεld thεre using NH 0H. Glucose was fed to the culture in conjunction with the alkali dεmand, to maintain a glucose concentration at 40% of air saturation, by automatic incrεasεs in rpm (350 to 1000) and airflow (2 to 5 1/min). Foaming was control lεd on dεmand using pol propylεnε glycol.

The fermεntor was inoculatεd with cεlls and grown to Agg = 5.0 (14.25 hours). Thε tεmperaturε was raisεd to 37 β C to inducε synthesis of rεcombinant Taq polymerasε and growth continued for five hours to Ag 80 of 16.5.

Unless otherwise indicated, all purification steps werε conductεd at 4 β C. Twenty grams (wεt wεight) of inducεd frozen E. coli K12 strain DGllδ harboring plasmid pLSG5 was thawed in 3 volumes of 50 mM Tris-Cl, pH 7.5, 1 mM EDTA, 3 mM PMSF, 0.64 /β/ml leupεptin and

disrupted in a French Press at 20,000 psi . The lysate was adjusted to 5.5X cell yolu e with additional buffer and soπϊcsted (4 x 30 seconds) to rεduce viscosity (Fraction I). The crude total call lysate was adjusted to 0.2 M (NH 4 ),S0 4 (25.43 g/1) and centrifuged for 15 minutes at 20,000XG. Thε supεrnatant (Fraction II) was heatad to 75°C (in a 100 C water bath) and maintainεd at 72-75 C for 15 minutes to dεnature E. coli host proteins. The sample was rapidly cooled to 4°C by swirling in an ice water bath. After 20 minutεs at O^C, thε s»amplε was centrifuged at 20,000XG for 15 minutεs to prεcipitate the denatured proteins. The supernatant (Fraction III) was appliεd at 4 ml/hr to a 6 ml Phenyl -Sepharosε CL-4B (Pharmacia) column equilibrated with 50 mM Tris-Cl, pH 7.5, 1 mM EDTA (Buffer A) containing 0.2 M (NH 4 ) 2 S0 4 . The column was sequentially washed with 3-10 column volumes of a) the samε buffεr, b) Buffεr A, c) Buffεr A containing 20% εthylεne glycol to remove nuclεic acids and non-Taq polymεrasε proteins. Taq DNA poly erasε activity was εlutεd with 60 ml linεar gradiεnt of 0-4 M urεa in Buffεr A containing 20% εthylεne glycol. The activε fractions ( Λ, 2 M urεa) wεrε poolεd (Fraction IV) and applied at 3 ml/hr to a 12 ml * (1.5 X 6.0 cm) Heparin-Sepharosε CL-δB (Pharmacia) column equilibrated in 50 mM Tris-Cl, pH 7.5, 0.1 mM EDTA, 0.2% Tween 20 (Buffεr B) containing 0.1 M KC1. Thε column was washεd with 2 column volumεs of Buffεr B containing 0.15 M KC1. The Taq polymεrasε was εlutεd with a 120 ml linear gradiεnt of 0.15-0.65 M KC1 in Buffεr B. Thε Taq polymεrasε εlutεd as a single A23 Q and activity peak af-O.29 M KC1. '

Purified recombinant and native Taq polymerasε protεins coraigratε following electrophorεsis on SDS-PAGE and staining with Coomassie blue. The purified Taq polymerasε protεins mi ratε slightly faster than purified Phosphorylasε B (Pharmacia), consistεnt with a mol cular we ght predicted from the DNA sequence (of pLSG5) of 93,920 dal ons. the peak activity fractions wεre pooled and a portion sub εcted to N-terminal amino acid sεquεncε dεtεr i nat on on an

Appϋεd Biosystems gas phasε sequεncεr. In contrast to native Taq polymerasε which has a blockεd amino terminus, the sεquεncε of thε

71 purified recombinant Taq polymerase and the individual cycle yields were consistent with the sequence predicted for the amino terminus of the Taq polymerase protein encoded by plasmid pLSG5.

The recombinant Taq pol merasa ancoded by plasmid pLSGS and purified as described could amplify a human "single copy' sequence. Using a low temperatura limit of 55°C, extension temperature of 72°C, upper tεmpεratura limit of 94° C and a 2-2.5 minute cycle time, comparable yields and efficiency were noted for native and recombinant Taq polymεrase using 1-2 units/100 ^1 PCR.

EXAMPLE XIII

Purification

The thermostable polymerase may be purifiεd directly from a culture of Thermus aquaticus following the example disclosed below or, alternatively, from a bacterial culture containing the recombinantly produced enzymε with only minor modifications necessary in the preparation of the crude extract.

After harvesting by centrifugation, 60 grams of cells were resuspended in 75 ml of a buffer consisting of 50 mM Tris-Cl pH 8, 1 mM EDTA. Cells were lysεd in a French Prεss at 14,000-16,000 PS I after which 4 volumes (300 ml) of additional Tris-EDTA were added. Buffer A (beta-mercaptoethanol to 5 mM and NP-40 and Tween 20 to 0.5% (v/v) each) was added and the solution was sonicated thoroughly while cooling. The resultant homogeneous suspension was diluted further with Buffer A such that thε final volumε was 7.5-8 timεs thε starting eel 1 wεight; this was dεsignated Fraction I.

Thε polymerasε activity in Fraction I and subsεquεnt fractions was dεtermined in a 50 ρ ~ ixturε containing 0.025 M TAPS- HC1 pH 9.4 (20 C), 0.002 M MgCl 2 , 0.05 M KC1, 1 M 2-mercaptoethanol , 0.2 M each dGTP, dATP, TTP, 0.1 M dCTP f l pha~ 32 P, .05 Ci/mM], 12.5 μq "activatεd" salmon spεππ DNA and 0.01-0.2 units of thε polymεrasε (dilutεd in 10 M Tris-HCl, pH 8, 50 M KC1, 1 mg/ml autoclavεd gεlatiπ, 0.5% NP-40, 0.5% Tween 20, and 1 mM 2- mεrcaptoethanol ). Onε unit corrεsponds to 10 nmoles of product

synthεsizεd in 30 minutes. "Activated" DNA is a native preparation of DNA after partial hydrolysis with DNasε I * until 5% of the DNA was transferred to the acid-soluble fraction. The reaction was conducted at 74° C for 10 minutes and then 0^ was transferred to 1.0 ml of 50 λ g/ml carrier DNA in 2 mM EDTA at 0 9 C. An equal volume (1.0 ml) of 20% TCA, 2% sodium pyro phosphate was added. After 15-20 minutes at 0 o. the samples wera filtered through Whatman GF/'C discs and extensively washed with cold 5% TCA-1% pyro phosphate, followεd by cold 95% εthanol , driεd and countεd. Fraction I was centrifugεd for two hours at 35,000 rpm in a

.Beckman TI 45 rotor at 2°C and the collected supernatant was designated Fraction II.

Thε Taq polymerase activity was precipitated with Polymin P

(BRL, Gaithersburg, MD) (10%, w/v, adjusted to pH 7.5 and autoclaved) after the minimum amount of Polymin P nεcεssary to precipitate 90-95% of the activit was dεtermined, which amount was generally found to be between 0.25% and 0.3% final volume.

An appropriate level of Polymin P was added slowly to Fraction II while stirring for 15 minutes at 0 β C. This solution was centrifugεd at 13,000 rpm for 20 minutεs in a Beckman JA 14 rotor at 2 C. The supεrnatant was assayεd for activity and the pεllet was rεsuspεndεd in 1/5 volume of 0.5X Buffer A (diluted 1:2 with H 0). This suspension was receπtrifuged and the pellet resuspended in 1/4

- volume of Buffεr A containing 0.4 M KC1. This suspension was homogenizεd thoroughly and left ovεrnight at 4°C. Thε homogεnate was cεntrifugεd as abovε and thε collεctεd supernatant dεsignatεd Fraction III.

The protein fraction was collected by "prεcipitation" at 75% saturation of ammonium sulfatε, cεntrifuged (at 27,000 rpm, SW27 rotor, 30 minutes) and the floating pellicle was resuspendεd in 50 M Tris-Cl pH 8, 1 mM EDTA. Thεsε steps werε rεpeated and thε protein suspension was dialyzed extεnsively with P-cell buffεr (20 mM KP0 4 pH 7.5, 0.5 mM EDTA, 5 mM beta-mercapto ethanol , 5% (w/v) glycerol, 0.5% (v/v) NP-40 and Tweεn 20) containing 80 M KC1.

The dialysata was transferred to a centrifuge bottle to which was added any recovered protein from sacks riπ-.εd with the P- cell buffεr containing 30 mM KC1. Centrifugation was performed at 20,000 x g and thε time was reduced to 15 minutes. The supernatant was saved and any pell at remaining was washed, sxtractad with P-cell buffer and 80 mM KC1 , and recentrifuged. The supernatants were than combined to form Fraction IV.

Fraction IV was applied to a 2.2 x 22-cm column of phosphocellulose, equilibrated with the P-cell buffεr containing 80 mM KC1. The column was washed (2.5-3 column volumes) with the same buffer and the protein eluted using a linear gradiεnt of SO to 400 mM KC1 in P-cell buffer. Fractions containing DNA polymerase activity (*• 0.18-0.20 M KC1 ) were pooled and concentrated 3-4 fold on an Amicon stirred cell and YM30 mεmbranε. Thε cell was rinsed with thε P-cell buffer without KC1 and added to the fraction concentrate (0.15 M KC1 adjusted final volume) to form Fraction V.

Fraction V was applied to a 5 ml Reparin Sepharose CL-6B column (Pharmacia) equilibrated with P-cell buffer and 0.15 M KC1. The column was washed with -0.15 M KC1 buffer (3-4 column volumes) and the protein eluted with a linear gradient from 0.15 to 0.65 M KC1 in P-cell buffεr. A 1:10 dilution into diluent without gelatin was made for SDS-PAGE analysis and a subsequent 1:20 dilution into diluent with 1 mg/ml gelatin was made for use in enzyme assays. The activity fractions (eluting -at -^0.3 M KC1 ) were assayed on supercoiled DNA template for specific and non-specific endonucl eases/ to po isomerase by elεctrophorεtically dεtεcting the change in molecular weight of supεrcoilεd plasmid DNA aftεr incubation with an εxcεss of DNA polymεrase. Exonucl ease contamination was detected following incubation with small linear DNA frag εnts. In pεak fractions, an ' 88-92 kd protεin was found to bε thε major band. Thε major pool, dεsignated Fraction VI, had the highest polymerase activity with minimal detεctablε εndonuclεasε activity whεn this pool was assayεd for 30 minutεs at 55*C with -^3-5 polymεrase units/600 ng DNA.

74

Fraction VI was dialyzed against 10 mM KP0 4 pH 7.5, 5 mM beta-mercaptcethanol, 5% ^ / carol. 0.2% NP-40, and 0.2% Tween 20 (HA buffer). Thε dial zεd sample was app ied to a 3 ml column of hydroxyapatite and the enzyme eluted with a linear gradient of 10 to 250 M KPOΛ pH 7.5, HA buffer. DNA polymerase activity began to al uta at 75 mM KP0 4 with the peak at 100 mM KP0 4 . Active peak fractions were assayed at 1:100-1:300 dilution. As in thε prior chromatography step, a 1:10 dilution in diluεnt was prεpared without gelatin for SDS- PAGE analysis. Fractions with no significant endonuclεase or doublε- strand exonucl easε whεn assayed at 55 C with 5 polymerasε units were pooled and dεsignatεd Fraction VII.

Fraction VII was dialyzed against a solution of 25 mM sodium acetate pH 5.2, 5% glycerol, 5 mM beta-mercaptoethanol , 0.1 mM EDTA, 0.1% NP-40, and 0.1% Tweεn 20, adjusted to pH 5 at room temperature. The dialyzed sa plε was appliεd to a 2 ml DEAE-Tris-Acryl-M (LKB) column prε-εquil ibratεd and subsεquεntly washεd with the same buffer. The fraction containing polymerase activity that did not adhere to the column was pooled and adjusted to 50 mM NaCl in thε same buffer to yield Fraction VIII. Fraction VIII was appliεd to a 2 ml CM-Tris-Acryl M (LKB) column εquilibrated with the same buffer (25 M sodium acetate, 50 mM NaCl, 5% glycεrol, 0.1 mM EDTA, 0.1% NP-40, and 0.1% Twεεn 20). Thε column was washεd with 4-5 column volumes of thε samε buffer and the εnzymε εlutεd with a. linεar gradiεnt from 50 to 400 mM NaCl in sodium acetate buffer. The polymεrasε activity pεak eluted - 0.15 -0.20 M NaCl. The polymεrasε activity was assayεd at 1:300 to 1:500 dilution with thε first dilution 1:10 into diluεnt without gεlatin for thε SDS- PAGE analysis. An assay across thε activity pεak on supercoilεd DNA tεmplates for specific and non-specific endonucleasε/topoisomεrasε using DNA polymerase assay salts (25 mM TAPS-HCl pH 9.4, 2.0 mM MgCl 2 and 50 mM KC1 ) at 74 C was perfor εd, as wεll as assays for nuclεasεs on M13 ss DNA and pBR322 frag εnts. Activε fractions with no dεtεctablε πuclease(s) werε pooled and run on a silvεr stainεd SDS- PAGE mini gεl . Thε rεsults show a singlε <"' 88-92 kd band with a specific activity of ^200,000 uπits/mg.

This specific activity is more than an order of magnitude higher than that claimed for the previously isolated Taq polymerase and is at least an order of magnitude higher than that for E. coli polymerasε I.

EXAMPLE XIV The Taq polymerase purifiad as described above in Example

XIII was found to be frεε of any contaminating Taq endonucleasε and exonuclease activities. In addition, the Taq polymerase is preferably stored in storage buffεr containing from about 0.1 to about 0.5% volume/volume of each non-ionic poly εric detergent employed. More preferably the storage buffer consists of 50% (v/v) glycerol, 100 mM KCl, 20 mM Tris-Cl pH 8.0, 0.1 mM ethyl enedi aminetetraacetic acid (EDTA), 1 M dithiothreitol, 0.5% v/v NP-40, 0.5% v/v Tween 20, and 200 ιg/ml gelatin, and is preferably stored at -20 α C.

The stored Taq polymerase was diluted in a buffer consisting of 25 mM Tris Cl pH 8.0, 20 M KCl, 1 M beta-mercaptoethanol , 0.5% NP-40, 0.5% Tween-20, and 500 yUg/ml gelatin. A rεaction buffer was then prepared containing 50 M KCl, 10 mM Tris-Cl, pH 3.3, 1.5 mM MgCl 2» 0.01% (w/v) gelatin,- 200 μW each dNTP, IμW εach of the primers that definε a 500 basε pair targεt sεquεncε on a control template from bacteriophage lambda, and 2.0-2.5 units Taq polymerase/assay in a final volume of 100 [ . Template was added to the rεaction buffεr, the sample placed in a 0.5 ml polypropylene tube, and the sample topped with 100^1 of heavy white mineral oil to prevent evaporation.

At least a 10^-fold amplification was achiεved when the following conditions werε εmployεd, using 1 ng of control tεmplate (bacteriophage lambda DNA) where the target sequencε rεpreseπtεd approximatεly 1% of the starting mass of DNA.

First the template mixture was denatured for one minute, 30 sεconds at 94 β C by placing thε tube in a hεat bath. Thεn thε tubε was placεd in a heat bath at 37 β C for two minutεs. Then the tube was placed in a heat bath at 72°C for three minutes, and then in the heat bath at 94 C for one minute. This cycle was repεatεd for a total of 25 cyclεs. At the εnd of thε 25th cyclε, thε hεat dεnaturation step

at 94°C was omitted and replaced by extending the 72°C incubation step by an additional three minutes. Following termination of the assay, the samples were allowed to cool to room temperature and analyzed as described in previous axamplas. Thε template ma be optimally amplified with a different concentration of dNTPs and a different amount of Taq polymerase. Also, the size of thε targεt sεquεncε in thε DNA sampl will directly impact thε minimum timε requirεd for proper extension (72°C incubation step). An optimization of the tεmpεraturε cycling profile should be perfor εd for εach individual tεmplate to bε amplified, to obtain maximum efficiency.

EXAMPLE XV

Taq polymerase purified as described above in Example I was formulated for storagε as dεscribεd in thε prεvious εxamplε, but without the non-ionic polymeric detεrgεnts. Whεn assayεd for activity as dεscribεd in that εxamplε, thε εnzy ε storagε ixturε was found to ' bε inactivε. Whεn thε NP-40 and Twεεn 20 were added to the storage buffer, thε full εnzymε activity was rεstorεd, indicating that thε presεncε of the non-ionic dεtergents is necεssary to thε stability of the eπzymε formulation.

EXAMPLE XVI

Sεvεral 1 ~ q samples of human genomic DNA were subjectεd to 20-35 cyclεs of amplification as dεscribεd in Exa plε II, with equivalent units of either Klεnow fragmεnt or Taq polymεrasε, and analyzεd by agarosε gεl εlectrophoresis and Southern blot. Thε pri εrs usεd in thεse rεactions, PC03 and PC04, direct the synthesis of a 110-bp segmεnt of thε human bεta-globin gεnε. Thε Klεnow polymerasε amplifications εxhibited the smεar of DNA typically obsεrvεd with this εnzymε, thε apparent causε of which is thε non- spεcific annealing and extension of primers to unrεlatεd gεnomic sεquεncεs undεr what wεrε εssεntially non-stringεnt hybridization conditions (lx Klenow salts at 37° C). Nεverthelεss, by Southεrn blot

a specific 110-bp beta-globin target fragment was detected in all lanes. A substantiall different electrophoretic pattern was seen in the amplifications done with Taq polymerase where the single major band is the 110-bp target sequence. This remarkable specificity was undoubtedly due to the temperature at which the primers wera extended.

Although, like Klenow fragment amplifications, the annealing stεp was pεrfor εd at 37 C, thε tεmpεrature of Taq-catal zed reactions had to be raised to about 70°C before thε εnzymε exhibitεd significant activity. During this transition from 37 to 70 °C poorly matched primεr-template hybrids (which formed at 37°C) disassociated so that by the time the reaction reached an enzyme-activating tamperaturε, only highly complementary substrate was available for extension. This specificity also results in a greater yield of target sequence than similar amplifications done with Klenow fragmεnt bεcause the non- specific extension products effectively compete for the polymerase, thereby reducing the amount of 110-mer that can be made by the Klenow fragment.

EXAMPLE XVII

Amplification was carried out of a sample containing I j /ug Molt 4 DNA, 50 mM KCl, 10 mM Tris pH 8.3, 10 mM MgCl 2 , 0.01% gelatin, M of each of the following primers (to amplify a 150 bp region):

5'-CATGCCTCTTTGCACCATTC-3'(RS79) and 5 ' -TGGTAGCTGGATTGTAGCTG-3 ' ( RS80 )

1.5 M of each dNTP, and 5.0 units of Taq polymεrasε pεr 100 jι\ rεaction volumε. Thrεε additional samplεs were prepared containing 2.5, 1.3, or 0.6 units of Taq polymerasε. Thε amplification was carried out in thε tεmpεrature cycling machinε dεscribεd abovε using thε following cyclε, for 30 cyclεs: from 70 to 98 β C for 1 inutε hold at 98 β C for 1 minutε from 98 β C to 35, 45 or 55°C for 1 minute hold at 35, 45 or 55°C for 1 minute from 35, 45 or 55°C to 70°C for 1 minute hold at 70°C for 30 seconds

At 35 σ C annealing tempεrature, the 2.5 units/100 ^L! Taq enzyme dilution gave the best-signal-to noise ratio by agarose gel electrophoresis over all other Taq polymerase concentrations. At 45*C, the 5 units/100 / 1 Taq enzymε gave the best s nal-to-noise ratio over the other concentrations. At 55 C, thε 5 un ts/100 ^u.1 Taς enzyme gave the best signal-to-noise ratio over thε other concentrations and over thε 45 d C annεaling and improvεd yield. Thε Taq polymerasε has more specificity and bettεr yiεld at 5S°C.

In a separatε εxperiment thε Molt 4 DNA was 10-fold serially dilutεd into thε cεll line GM2064 DNA, containing no beta- or dεlta- globin sequences, avail ab ε from thε Human -Gεnεtic Mutant Cεll Depository, Camden, New Jεrsεy, at various concentrations reprεsenting varying copies per cell, and amplification was carried out on these samples as describεd in this εxamplε at annεaling temperaturεs of 35 σ C and 55 °C. At 35°C, thε bεst that can bε seen by agarose gel electrophoresis is 1 copy in 50 cells. At 55°C, the best that can be sεen is 1/5,000 cεlls (a 100-fold improvεmεnt ovεr thε lower tεmpεraturε), illustrating the importance of increasεd annεaling tε pεraturε for Taq polymεra.sε spεcificity undεr thεsε conditions. In a third εxpεrimεnt, DNA from a cεll linε 368H containing

HIV-positivε DNA, availablε from B. Poiεsz, State Uπivεrsity of Nεw York, Syracusε, NY, was similarly dilutεd into thε DNA from the SCI cell linε (dεpositεd with ATCC on March 19, 1985; an EBV-transformed bεta cεll linε ho ozygous for thε sicklε cεll allεlε and lacking any HIV sequences) at various concεntrations rεprεsεnting varying copies pεr cell, and amplification was carried out as dεscribεd in this .Example at annealing tεmpεratures of 35°C and 55 C C, using thε pri εrs SK38 and SK39, which amplify a 115 bp rεgion of thε HIV sεquεncε:

5 ' -ATAATCCACCTATCCCAGTAGGAGAAAT-3 ' ( SK38) and 5 ' -TTTGGTCCTTGTCTTATGTCCAGAATGC-3 ' ( SK39)

Thε results by agarosε gεl εlεctrophorεsis showεd that only the undiluted 368H sample could be dεtεctεd with thε annεaling tεmperature at 35 β C, whεrεas at lεast a IO "2 dilution can bε dεtected with the annεaling tεmperature at 55° C, giving a 100-fold improvεmεnt in dεtection.

The following bacteriophage and bacterial strains were deposited with the Aπerican Type Culture Collection, 12301 Park! awn Drive, Rockvillε, Maryland, USA (ATCC). Thase deposits were made under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for purposes of Patent Procedure and the Regulations thεrεundεr (Budapest Treaty),

cI 857 susP 80 )/pFC54.t

E. coli DG116/PA 740CHB 3291 67605 1/12/88