Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD OF DETECTING RESIDUAL GENOMIC DNA AND A KIT THEREOF
Document Type and Number:
WIPO Patent Application WO/2013/108083
Kind Code:
A1
Abstract:
The present disclosure relates to a highly specific and sensitive method of detecting host cell impurities in a biological sample by using quantitative real time polymerase chain reaction (q PCR).The present disclosure also provides novel designed primer and probe to amplify only the specific Alu family of dispersed repetitive sequences from Chinese hamster ovary cells used for expression of therapeutic proteins.

Inventors:
FADNIS RAHUL SHARAD (IN)
RAGHUNANDAN REENA NICHINMETLA (IN)
Application Number:
PCT/IB2012/051046
Publication Date:
July 25, 2013
Filing Date:
March 06, 2012
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BIOCON LTD (IN)
FADNIS RAHUL SHARAD (IN)
RAGHUNANDAN REENA NICHINMETLA (IN)
International Classes:
C12Q1/68
Foreign References:
US20090325175A12009-12-31
Other References:
VENABLE, D. ET AL.: "High-throughput and quantitative detection of residual NSO and CHO host cell genomic DNA", BIOPROCESS INTERNATIONAL, vol. 5, no. 6, June 2007 (2007-06-01), pages 56 - 60, 62, XP055078843
NISSOM, P.M.: "Specific detection of residual CHO host cell DNA by real-time PCR", BIOLOGICALS, vol. 35, no. 3, June 2007 (2007-06-01), pages 211 - 215, XP022360759
See also references of EP 2804957A4
Attorney, Agent or Firm:
VIJAYAKRISHNAN, Sindhu et al. (4121/B 6th Cross,19A Main, HAL II Stage ,,Bangalore 8, Karnataka, IN)
Download PDF:
Claims:
We Claim:

1. A sensitive and quantitative method to detect the residual genomic DNA from a host cell of rodent family as low as 20 femtogram in a protein product by using primer sequence ID number 22 and sequence ID number 23 and corresponding probe sequence ID number 24 targeted to amplify Alu family of the dispersed sequences.

2. The method as claimed in claim 1, wherein the technique for detecting residual genomic DNA from a mammalian host organism in a protein product, said method comprising steps of:

a. extracting DNA from the sample;

b. designing suitable primers to amplify a region of the CHO genomic DNA and designing a specific probe to anneal to a target sequence located between two PCR amplification primers;

c. amplifying the targeted sequences with the pair of specific primers and a DNA-based probe with a fluorescent reporter at one end and a quencher of fluorescence at the opposite end of the probe;

d. breakdown of the probe by the 5' to 3' exonuclease activity of the Taq polymerase breaks the reporter-quencher proximity and thus allows unquenched emission of fluorescence; and

e. the increase in fluorescence and accumulation of the PCR product is continually monitored throughout the PCR reaction by real-time PCR instrument.

Wherein the change of fluorescence in each cycle is proportional to the identification of specific amplified sequences and indicates the presence of residual genomic DNA in the sample.

3. The method as claimed in claim 2, wherein the amplifying is carried out using quantitative real time PCR employing Taq DNA polymerase.

4. The method as claimed in claim 1, wherein the target for Rodent family DNA qPCR is the 74bp Alu-equivalent consensus sequence.

5. The method as claimed in claim 1, wherein the protein product is selected from group comprising a monoclonal antibody, a therapeutic protein product and a recombinant protein product, or any combination thereof.

6. The method as claimed in claim 1, wherein the protein product is preferably produced by Chinese Hamster Ovary cell or mouse myeloma cell line-NSO.

7. The method as claimed in claim 2 wherein the fluorescent reporter is 6- carboxyfluorescein (FAM), and the quencher is Black Hole Quencher l(BHQ-l).

8. The method as claimed in claim 2 wherein the amplified product is detected using assay selected from group comprising SYBR green assay and TAQMAN probe assay.

9. The method as claimed in claim 1, wherein concentration of the forward and reverse primer is ranging from about 100 to about 800nM, preferably 400nM.

10. The method as claimed in claim 1 , wherein the concentration of the probe is ranging from about from about 1000 to about ΙΟΟηΜ, preferably ΙΟΟηΜ.

11. The method as claimed in claim 2, wherein the annealing is carried out at a temperature 56°C for time duration of about 1 minute.

12. The method as claimed in claim 1 , wherein minimum amount of the residual genomic DNA in the biological sample detected and optionally quantified is less than about 5 femtograms.

13. The method as claimed in claim 1 , wherein the residual genomic DNA identified from the recombinant protein product comprises of Alu equivalent concensus sequences designated as SEQ ID: 1, SEQ ID: 2, SEQ ID:3,SEQ ID:4, SEQ ID:5, SEQ ID:6 and SEQ ID:9.

14. The method as claimed in claim 1 , wherein the residual genomic DNA using specific Alu equivalent consensus sequences comprises the nucleic acid sequence of Figure 1.

15. The method as claimed in claim 1, wherein the primers used in the current disclosure consist of forward and reverse primers.

16. The method as claimed in claim 15, wherein the forward primers comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 7, SEQ ID NO: 11 , SEQ ID No: 14, SEQ ID NO: 19 and SEQ ID NO: 22.

17. The method as claimed in claim 15, wherein the reverse primers comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 8, SEQ ID NO: 12,SEQ ID No: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20 and SEQ ID NO: 23.

18. The method as claimed in claim 15, wherein the forward and the reverse primers have a sequence as given in SEQ ID NO: 22 and SEQ ID NO: 23.

19. The method as claimed in claim 1, wherein the single degenerate designed probe comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID No: 16, SEQ ID NO: 21 , and SEQ ID NO: 24.

20. The method as claimed in claim 19, wherein the single degenerate designed probe has a sequence set forth in SEQ ID NO: 24.

21. The method as claimed in claim 18 or 20, wherein the primers and the probe are used for detecting and optionally quantifying residual genomic DNA having an Alu sequence, from a biological sample containing a protein product.

22. The method as claimed in claim 19, wherein the probe comprises a fluorescent reporter 6-carboxyfluorescein (FAM) at 5Λ end, or a quencher Black Hole Quencher 1 (BHQ-1) at 3Λ end, or a combination thereof.

23. A kit for detecting and optionally quantifying residual genomic DNA having an Alu sequence, from a biological sample containing a protein product, said kit comprising the primer as claimed in claim 18, the probe as claimed in claim 20 and quantitative real-time PCR reagents, optionally along with an instruction manual.

Description:
METHOD OF DETECTING RESIDUAL GENOMIC DNA AND A KIT

THEREOF"

TECHNICAL FIELD

The present disclosure describes a sensitive method of determining and detecting residual DNA in a biological sample wherein specific region of Alu equivalent sequences in Chinese Hamster Ovary (CHO) genomic DNA is detected with the help of designed primers and probe. The designed probe helps in increasing the sensitivity as well as the accuracy of the method. Hence the identification of specific regions within the CHO genomic DNA increases the overall sensitivity and efficiency of the detection process.

BACKGROUND AND PRIOR ART OF THE DISCLOSURE

Biopharmaceuticals are medical drugs produced using biotechnology. They include proteins (including antibodies), nucleic acids (DNA, RNA or antisense oligonucleotides) and living microorganisms like virus and bacteria where the virulence of viruses and bacteria is reduced by the process of attenuation. They can be used for therapeutic or in vivo diagnostic purposes, and are produced by means other than direct extraction from a native (non-engineered) biological source.

The removal of host cell impurities is a critical step in the production of biopharmaceutical products. One impurity targeted for clearance during the purification process is residual DNA arising from host cells. Traditional methods of quantitating residual host cell DNA have been limited by laborious sample preparation protocols, lack of sensitivity and specificity, and slow time to results.

In the process of manufacturing biopharmaceuticals, strict guidelines need to be followed to ensure the final pharmaceutical formulations are free of impurities and meet the regulatory guidelines. Among the common impurities are host cells which need to be removed before the final formulation. Majority of the therapeutic proteins are expressed using recombinant technology in the cell lines derived from the rodent family. The purified protein from the bulk harvest is likely to carry traces of CHO host cell genomic DNA into the final formulation thereby posing additional risk. Therefore, regulatory agencies like the world health organization (WHO), European Medicine Agency (EMA) or Food and Drug Administration (FDA) have documented the permissible levels of host cell DNA per dose of the drug administered to the patients in the guidelines. The present disclosure discloses a method of early detection and quantification of host cell genomic DNA contamination using a quantitative real time PCR.

In addition to potential safety issues stemming from the presence of extraneous host cell DNA, the regulatory guidance for products produced in cell culture is that DNA content in the final product should be as low as possible, as determined by a highly sensitive method. Traditional methods of quantifying levels of residual host cell DNA have been limited by lack of sensitivity and specificity, poor assay reliability, low sample capacity, slow time-to-results and high cost per sample tested.

Residual DNA in final bulk products should be generally lower than 100 pg per therapeutic dose {Points to consider in the manufacture and testing of monoclonal antibody products for human use, Centre for Biologies Evaluation and Research, US food and drug administration; 28th February, 1997; www.fda.gov/cber/gdlns/ptc_mab.pdf).The FDA also recommends that detection methods used be sensitive enough to detect residual amounts present in picogram levels. Three techniques have the required sensitivity to detect picogram levels of impurities: hybridization, methods based on DNA-binding protein and quantitative PCR (q-PCR). Goldman et al. (Clinical chemistry 37: 1523, 1991) describes a method of determining levels of DNA contamination using polymerase chain reaction. However, the methods described here are not sensitive enough to detect contamination in picogram levels. U.S. Pat. No. 5,393,657 discloses the use of primers and PCR amplification to detect the presence of host cell DNA contamination. However, conventional molecular biology techniques such as Southern blotting and gel electrophoresis are used for detection and quantification of the impurities. This reduces the overall ease of operation as well as the sensitivity.

U.S. Patent application No. 2009/0325175 discloses a method of identifying host cell genomic DNA contamination utilizing quantitative real time Polymerase chain reaction, wherein the qPCR primers are complementary to the highly repetitive host cell genomic DNA sequences, e.g. Alu-equivalent sequences. The cited document does not talk about the specific 74 bp Alu consensus sequence for which the instant disclosure become more sensitive. Therefore, a process which increases specificity and sensitivity in detecting the presence of host cell contamination is required. The present disclosure can detect the residual DNA impurities from the host cell of the rodent family as low as 20 femtogram in a recombinant protein product by using specific primers and probe.

STATEMENT OF DISCLOSURE

Accordingly, the present disclosure relates to a sensitive and quantitative method to detect the residual genomic DNA from a host cell of rodent family as low as 20 femtogram in a protein product by using primer sequence ID number 22 and sequence ID number 23 and corresponding probe sequence ID number 24 targeted to amplify Alu family of the dispersed sequences; and a kit for detecting and optionally quantifying residual genomic DNA having an Alu sequence, from a biological sample containing a protein product, said kit comprising the primer as claimed in claim 18, the probe as claimed in claim 20 and quantitative real-time PCR reagents, optionally along with an instruction manual.

BRIEF DESCRIPTION OF ACCOMPANYING FIGURES

The following description gives examples of embodiments of the present disclosure and is made with respect to the attached Figures in a purely exemplifying and non limiting manner, wherein: Fig 1: Represents the alignment of five cloned Alu-equivalent sequences designated as 2FG (Seq ID no 2), US 15 (SEQ ID no 3), 200PG (Seq ID no 4), US6 (SEQ ID no 5) and US8 (SEQ ID no 6) obtained from CHO genomic DNA used. The consensus sequence mentioned as BioAlu in the alignment is represented as SEQ ID no 1. The similarity in the aligned sequences is represented by asterisk sign at the bottom row of the alignment.

Fig 2a: Represents the amplification plots of the standard CHO DNA dilutions starting from 2 ng, 200 pg and 20 pg. Fig 2b: Represents the melting curve of the PCR product formed with the use of SEQ ID nos. 7 and 8 with the CHO genomic DNA.

Fig 3: Represents the melt curve analysis of the gradient qPCR to optimize the annealing temperature.

Fig 4a: Represents the amplification plots of standard CHO DNA dilutions- 2 ng, 200 pg, 20 pg, 2 pg, 200 fg and 20 fg obtained with optimized concentration of primers (400 nM) and at optimized annealing temperature (64°C). The plots look linear and sigmoid. Fig 4b: Represents the melting curve obtained from the PCR product amplified from the standard DNA dilutions using optimized primer and annealing temperature for the reaction. The melting curve obtained is tapering having a sharp peak which is indicative of specific amplification of the target. Fig 5: Represents the SEQ ID no 9 obtained from target amplified from the CHO genomic DNA using primers SEQ ID no 7 and 8. The length of the sequence is found to be 71 bp.

Fig 6a: Represents the amplification plots obtained by using the Taqman probe SEQ ID no 10. Fig 6b: Represents the forward and reverse primer binding regions to the SEQ ID no 9, designated as SEQ ID nos. 11 and 12 respectively. Fig 7: Represents the amplification plots obtained by using the qPCR primer SEQ ID no 11 and 12 and qPCR probe SEQ ID no 10. The plots are wavy indicating that the probe binding efficiency is poor.

Fig 8a: Represents the forward and reverse primers designated as SEQ ID nos. 14 and 15 respectively, binding regions to the Seq ID no. 9.

Fig 8b: Represents the inefficient binding of primers/probe to the target indicated by no rise in amplification plots. Fig 9: Represents the lineage 1 of the Alu-equivalent sequence. The highlighted region is the reverse primer designated as SEQ ID no 17 is designed using online software.

Fig 10: Represents the lineage 2 of the Alu-equivalent sequence. The highlighted region is the reverse primer designated as SEQ ID no 18 is designed using online software.

Fig 11: Represents the amplification plots and melting curve obtained by using the primers SEQ ID no 17 and 18.

Fig 12: Represents the redesigned forward and reverse primers designated as SEQ ID nos. 19 and 20 respectively, which are shown as highlighted on the SEQ no 9.

Fig 13a: Represents the amplification of CHO standard DNA dilutions from 2ng to 20fg using the SEQ ID no 19 and 20. Fig 13b: Represents the melting curve of the PCR product formed using the seq ID nos 19 and 20.

Fig 14: Represents the alignment of sequences obtained from residual CHO DNA detected from downstream purification process samples using the SEQ ID no 19 and 20.

Fig 15: Represents the amplification plots of 3 standard DNA dilutions using the degenerate probe seq ID no 21 and qPCR primer SEQ ID no 19 and 20. Fig 16: Represents the redesigned forward and reverse primers designated as SEQ ID nos. 22 and 23 respectively, which are shown as highlighted on the SEQ ID no 9.

Fig 17: Represents the amplification plots obtained by using optimized concentration of qPCR primers SEQ ID no 22 and 23 and degenerate probe SEQ ID no 24.

Fig 18a: Represents the amplification plots obtained by using optimized concentration of qPCR primers SEQ ID nos 22 and 23 for the CHO standard DNA dilutions from 2ng to 200fg. Fig 18b: Represents the melting curve of the PCR product formed from using the SEQ ID nos 22 and 23.

Fig 19: Represents the amplification plots obtained by using optimized concentration of qPCR primers SEQ ID nos 22 and 23 and degenerate probe SEQID no 24.

Fig 20: Represents the amplification plots of standard DNA dilutions - 2ng, 200pg and 20pg with optimized concentrations of the primers (SEQ ID nos 22 and 23) and probe (SEQ ID no 24) Fig 21a: Represents the alignment of the Alu-equivalent sequence designated SEQ ID 1 and the published Alu sequence designated as SEQ27. The similarity in the aligned sequences is represented by asterisk sign at the bottom row of the alignment. Fig 21b: Represents the alignment of the Alu-equivalent sequence designated as SEQ ID 1 with the sequence designated as SEQ 28. The similarity in the aligned sequences is represented by asterisk sign at the bottom row of the alignment.

Fig 22a: Represents the alignment of sequence ID 27 with the sequence that is identified using primers designated as SEQ ID 25 and SEQ ID 26 on CHO genomic DNA (represented as Sbjct). The alignment is done using BLAST online software.

Fig 22b: Represents the alignment of SEQ ID 28 with the sequence that is identified using primers designated as SEQ ID 29 and SEQ ID 30 on CHO genomic DNA (represented as Sbjct). The alignment is done using BLAST online software.

Fig 23a: Represents the melting curve of the PCR amplified product from the primers designated as SEQ ID 25 and SEQ ID 26 with CHO DNA. The irregular pattern of the melting curve is indicative of high degeneracy existing within the sequence.

Fig 23b: Represents the melting curve of the PCR amplified product from the primers designated as SEQ ID 29 and SEQ ID 30 with CHO genomic DNA. The irregular pattern of the melting curve indicates that the high degeneracy exists within the sequence. Fig 23c: Represents the melting curve of the PCR amplified product from the primers SEQ ID 22 and SEQ ID 23 and CHO genomic DNA.

Fig 24a: Represents the standard DNA dilution amplification plot obtained using the Taqman probe SEQ ID 24 and primers SEQ ID 22 and SEQ ID 23 and CHO genomic DNA. Fig 24b: Represents the standard curve of the CHO DNA dilutions from 2ng to 20fg using the SEQ ID 22 and SEQ ID 23 with its Taqman probe SEQ ID 24.

Fig 25: Represents the sequence comparison of target (Seq ID no 1) with the PCR product amplified from NSO genomic DNA using primers SEQ ID nos 22 and 23.

Fig 26: Represents dynamic range and sensitivity. The figure represents the amplification plots generated from running the assay with a tenfold serial dilution from 2ng to 20fg of CHO genomic DNA, purified from CHO cell line producing therapeutic protein.

Fig 27: Represents the standard curve of the CHO DNA dilutions from 2ng to 20fg along with the slope and correlation coefficient.

LIST OF PUBLISHED SEQUENCES AND CORRESPONDING PROBES AND PRIMERS

BRIEF DESCRIPTION OF ACCOMPANYING SEQUENCE LISTINGS

SEQ ID No 1 represents Alu equivalent consensus sequence

SEQ ID No 2 represents Alu equivalent sequence amplified from DNA obtained from downstream manufacturing step samples. SEQ ID No 3 represents Alu equivalent sequence amplified from DNA obtained from downstream manufacturing step samples.

SEQ ID No 4 represents Alu equivalent sequence amplified from DNA obtained from downstream manufacturing step samples.

SEQ ID No 5 represents Alu equivalent sequence amplified from DNA obtained from downstream manufacturing step samples.

SEQ ID No 6 represents Alu equivalent sequence amplified from DNA obtained from downstream manufacturing step samples.

SEQ ID No 7 represents the forward qPCR primer

SEQ ID No 8 represents reverse qPCR primer

SEQ ID No 9 represents the sequence of the target obtained from the positive clone plasmid

SEQ ID No 10 represents the Taqman qPCR probe.

SEQ ID No 11 represents redesigned forward primer

SEQ ID No 12 represents redesigned reverse primer

SEQ ID No 13 represents the sequence of the Taqman qPCR probe

SEQ ID No 14 represents the sequence of the new forward primer

SEQ ID No 15 represents the sequence of the new reverse primer

SEQ ID No 16 represents the sequence of the new Taqman probe

SEQ ID No 17 represents reverse primer

SEQ ID No 18 represents reverse primer

SEQ ID No 19 represents forward primer

SEQ ID No 20 represents reverse primer

SEQ ID No 21 represents the degenerate Taqman qPCR probe

SEQ ID No 22 represents the forward qPCR primer

SEQ ID No 23 represents the reverse qPCR primer

SEQ ID No 24 represents a new designed degenerate probe with 1 degenerate base.

SEQ ID No 25 represents the sequence of the published forward primer

SEQ ID No 26 represents the sequence of the published reverse primers

SEQ ID No 27 represents the sequence of the published Alu sequence SEQ ID No 28 represents the sequence of the published Alu sequence

SEQ ID No 29 represents the published forward primer

SEQ ID No 30 represents the published reverse primer.

DETAILED DESCRIPTION OF THE DISCLOSURE

The present disclosure relates to a sensitive and quantitative method to detect the residual genomic DNA from a host cell of rodent family as low as 20 femtogram in a protein product by using primer sequence ID number 22 and sequence ID number 23 and corresponding probe sequence ID number 24 targeted to amplify Alu family of the dispersed sequences.

In an embodiment of the present disclosure, the technique for detecting residual genomic DNA from a mammalian host organism in a protein product, said method comprising steps of:

a. extracting DNA from the sample;

b. designing suitable primers to amplify a region of the CHO genomic DNA and designing a specific probe to anneal to a target sequence located between two PCR amplification primers;

c. amplifying the targeted sequences with the pair of specific primers and a DNA-based probe with a fluorescent reporter at one end and a quencher of fluorescence at the opposite end of the probe;

d. breakdown of the probe by the 5' to 3' exonuclease activity of the Taq polymerase breaks the reporter-quencher proximity and thus allows unquenched emission of fluorescence; and

e. the increase in fluorescence and accumulation of the PCR product is continually monitored throughout the PCR reaction by real-time PCR instrument.

Wherein the change of fluorescence in each cycle is proportional to the identification of specific amplified sequences and indicates the presence of residual genomic DNA in the sample. In another embodiment of the present disclosure, the amplifying is carried out using quantitative real time PCR employing Taq DNA polymerase.

In yet another embodiment of the present disclosure, the target for Rodent family DNA qPCR is the 74bp Alu-equivalent consensus sequence.

In still another embodiment of the present disclosure, the protein product is selected from group comprising a monoclonal antibody, a therapeutic protein product and a recombinant protein product, or any combination thereof.

In still another embodiment of the present disclosure, the protein product is preferably produced by Chinese Hamster Ovary cell or mouse myeloma cell line-NSO.

In still another embodiment of the present disclosure, the fluorescent reporter is 6- carboxyfluorescein (FAM), and the quencher is Black Hole Quencher l(BHQ-l).

In still another embodiment of the present disclosure, the amplified product is detected using assay selected from group comprising SYBR green assay and TAQMAN probe assay.

In still another embodiment of the present disclosure, concentration of the forward and reverse primer is ranging from about 100 to about 800nM, preferably 400nM.

In still another embodiment of the present disclosure, the concentration of the probe is ranging from about from about 1000 to about ΙΟΟηΜ, preferably ΙΟΟηΜ.

In still another embodiment of the present disclosure, wherein the annealing is carried out at a temperature 56°C for time duration of about 1 minute. In still another embodiment of the present disclosure, minimum amount of the residual genomic DNA in the biological sample detected and optionally quantified is less than about 5 femtograms. In still another embodiment of the present disclosure, the residual genomic DNA identified from the recombinant protein product comprises of Alu equivalent concensus sequences designated as SEQ ID: 1, SEQ ID: 2, SEQ ID:3,SEQ ID:4, SEQ ID:5, SEQ ID:6 and SEQ ID:9. In still another embodiment of the present disclosure, the residual genomic DNA using specific Alu equivalent consensus sequences comprises the nucleic acid sequence of Figure 1.

In still another embodiment of the present disclosure, the primers used in the current disclosure consist of forward and reverse primers.

In still another embodiment of the present disclosure, the forward primer comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 7, SEQ ID NO: 11, SEQ ID No: 14, SEQ ID NO: 19 and SEQ ID NO: 22.

In still another embodiment of the present disclosure, the reverse primers comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 8, SEQ ID NO: 12,SEQ ID No: 15, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 20 and SEQ ID NO: 23.

In still another embodiment of the present disclosure, the forward and the reverse primers have a sequence as given in SEQ ID NO: 22 and SEQ ID NO: 23.

In still another embodiment of the present disclosure, the single degenerate designed probe comprises nucleic acid sequence selected from group consisting of SEQ ID NO: 10, SEQ ID NO: 13, SEQ ID No: 16, SEQ ID NO: 21, and SEQ ID NO: 24. In still another embodiment of the present disclosure, the single degenerate designed probe has a sequence set forth in SEQ ID NO: 24. In still another embodiment of the present disclosure, the primers and the probe are used for detecting and optionally quantifying residual genomic DNA having an Alu sequence, from a biological sample containing a protein product.

In still another embodiment of the present disclosure, the probe comprises a fluorescent reporter 6-carboxyfluorescein (FAM) at 5 Λ end, or a quencher Black Hole Quencher 1 (BHQ-1) at 3 Λ end, or a combination thereof.

The present disclosure further relates to a kit for detecting and optionally quantifying residual genomic DNA having an Alu sequence, from a biological sample containing a protein product, said kit comprising the primer as claimed in claim 18 the probe as claimed in claim 20 and quantitative real-time PCR reagents, optionally along with an instruction manual.

The objective of the disclosure is to design a sensitive method of detecting residual genomic DNA (gDNA) present in a given Biological sample. Such a sensitive method finds application for demonstrating the DNA clearance capacity of the manufacturing process steps (for e.g. affinity chromatography, ion exchange chromatography) that holds the potential to reduce the genomic DNA content present in biological sample to picogram and/or femtogram levels. For example, the concentration of the protein sample provided for DNA estimation from the optimized chromatography stages is typically in range of 5-15 mg/ml. It is assumed that residual DNA quantity in sample from these optimized steps ranges in femtograms (e.g. 50 fg). To facilitate the DNA quantification using a method having sensitivity in picogram levels (for e.g. 1 pg) one has to process high sample volumes during DNA extraction procedure, a limiting factor for both DNA extraction procedure as well as qPCR assays. Other most practical option is to have a method in place with sensitivities in the femtogram range (20 fg as in present disclosure) capable of precise DNA quantification obtained from low sample volumes.

Second application for a sensitive method is when purified biological proteins that are required to be administered at higher dosages. Regulatory agencies expect the DNA levels in such high dosage proteins to be below the specified standards of 10 ng/dose suitable for human use. Having a sensitive method with potential to quantitate DNA as low as 20 fg empowers one to measure and report the precise quantities of residual DNA present in the biological protein thus facilitating compliance for DNA levels. This can be clearly illustrated by an example where therapeutic proteins A, B and C whose single human dosage identified are 300 mg, 1200 mg and 2000 mg respectively. It is assumed that the chromatography process step is optimized and the DNA content in the purified protein is as low as 100 fg/mg. Using methods having sensitivity limits as 1 pg/ reaction (method 1 , a qPCR based method for residual CHO genomic DNA quantification where the target is other than Alu-equivalent consensus sequence) and 20 fg/ reaction (method 2), the DNA levels estimated for a dose would work out as mentioned in Table 1 below:

Table 1: Impact of assay sensitivities on DNA quantities estimated for varying doses of therapeutic proteins

The observations from the Table 1 clearly indicate that high sensitivity of the method 2 facilitates precise quantification of the DNA levels per dose of therapeutic protein and provides greater assurance even if high dosage biological proteins are to be analyzed for DNA compliance.

Accordingly, qPCR method is developed using specific primers and probe designed to detect the presence of low concentrations (as low as 20 or 2 femtograms) of residual CHO genomic DNA in the given biological sample.

Another objective of the present disclosure is to determine the presence of residual genomic DNA using specific Alu equivalent consensus sequences as specified in SEQ ID: 1 by using primer sequence ID number 22 and sequence ID number 23 and corresponding probe sequence ID number 24 targeted to amplify Alu family of the dispersed sequences. Therapeutic recombinant proteins are expressed mainly in CHO cells (Chinese Hamster Ovary cells) and NSO cells (Mouse Myeloma cell line), belonging to Rodent family. The purified proteins from the bulk harvest is likely to carry traces of host cell genomic DNA into the final formulation thereby posing additional risk. Therefore, regulatory agencies like World Health Organisation (WHO), European Medicine Agency (EMA) or Food and Drug Administration (FDA) have documented in the guidelines the permissible levels of host cell DNA per dose of the drug administered to the patients.

The present disclosure provides a sensitive method for the detection of residual genomic DNA from a mammalian host organism as low as 20 femtogram in a recombinant protein product by using specific primers and probe. The said method comprising of:

i) Extraction of DNA from the sample

ii) Designing suitable primers to amplify a region of the CHO genomic DNA and a specific probe designed to anneal to a target sequence located between two PCR amplification primers

iii) Amplifying the targeted sequences with the pair of specific primers and a DNA- based probe with a fluorescent reporter at one end and a quencher of fluorescence at the opposite end of the probe.

iv) Breakdown of the probe by the 5' to 3' exonuclease activity of the Taq polymerase which breaks the reporter-quencher proximity and thus allows unquenched emission of fluorescence

v) The increase in fluorescence and accumulation of the PCR product is continually monitored throughout the PCR reaction by real-time PCR instrument.

Wherein the change of fluorescence in each cycle is proportional to the identification of specific amplified sequences and indicates the presence of residual genomic DNA in the sample. One embodiment of the present disclosure provides a method of detecting host cell DNA in a given sample using sensitive assay techniques like real time PCR. The method involves extraction of DNA, designing suitable primers to amplify a region of the CHO genomic DNA and monitoring the accumulation of the PCR product with the help of a real time PCR instrument.

Another embodiment of the present disclosure involves detecting and amplifying genomic DNA with the help of specific primers and probes. The probe is designed to anneal to a target sequence located between two PCR amplification primers and is labeled with the fluorescent reporter dye FAM (6-carboxyfluorescein), at the 5' end and the quencher dye, BHQ-1 (Black Hole Quencher 1) at the 3' end. The close proximity of the two dyes leads to the quenching of the reporter dye fluorescence. During polymerization, the exonuclease activity of Taq polymerase cleaves the probe, separating the quencher and reporter dyes, thus resulting in increased reporter dye fluorescence. This increase in fluorescence and accumulation of the PCR product is continually monitored throughout the PCR reaction by real-time PCR instrument. Within the linear range of amplification established for known quantities of DNA, the amount of residual CHO DNA in the sample is extrapolated on standard curve generated from cycle threshold values versus known quantities of CHO genomic DNA.

In one embodiment of the present disclosure, the target for CHO cell DNA qPCR is the Alu-equivalent consensus sequence described by Haynes et al (1981 , Mol. Cell. Biology 1 :573-83).

In another embodiment of the present disclosure, the target sequence is a truncated region of the Haynes sequence as disclosed in SEQ ID NO: 1.

In another embodiment of the present disclosure the primers used in the current disclosure consist of forward and reverse primers. In another embodiment of the present disclosure the forward and reverse primers used in the current disclosure are disclosed in SEQ ID NO: 22 and SEQ ID NO: 23 respectively. In another embodiment of the present disclosure the PCR amplified products are detected using SYBR green assay and TAQMAN probe assay.

In still another embodiment of the present disclosure, the Taqman probe disclosed herein has the sequence as seen in SEQ ID NO: 24.

In another embodiment the Taqman probe disclosed in SEQ ID NO: 24 has a single degeneracy which increases the sensitivity and accuracy of the probe as against multiple degenerate bases

In yet another embodiment of the present disclosure, the specific sequence of primers and probe disclosed herein, increase the sensitivity and efficiency of the method.

In still another embodiment of the present disclosure, the sequence disclosed in SEQ ID NO: 1 is detected at lower concentrations and much faster when compared to sequences in existing disclosures. As illustrated in example no 4 & 13, the identification of 74bp sequence as well as the design of primers to amplify the target sequence demonstrates the efficacy of the method.

The term "protein" is meant to include a sequence of amino acids for which the chain length is sufficient to produce the higher levels of secondary and/or tertiary and/or quaternary structure. This is to distinguish from "peptides" or other small molecular weight drugs that do not have such structure. In one embodiment, the proteins used herein have a molecular weight of at least about 47 kD. Examples of proteins encompassed within the definition used herein include therapeutic proteins. A "therapeutically active protein" or "therapeutic protein" refers to a protein which may be used for therapeutic purposes, i.e., for the treatment of a disorder in a subject. It should be noted that while therapeutic proteins may be used for treatment purposes, the disclosure is not limited to such use, as said proteins may also be used for in vitro studies. In a preferred embodiment, the therapeutic protein is a fusion protein or an antibody, or antigen-binding portion thereof. In one embodiment, the methods and compositions of the disclosure comprise at least two distinct proteins, which are defined as two proteins having distinct amino acid sequences. Additional distinct proteins do not include degradation products of a protein.

As used herein the the term "monoclonal antibody" as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to conventional (polyclonal) antibody preparation which typically includes different antibodies directed against different determinants (epitopes), each monclonal antibody is directed against a single determination on the antigen. The modifer "monoclonal" indicates the character of the antibody as being obtained from a substantially homogeneous population of antibodies, and is not to be construed as requiring production of the antibody by any particular method. For example, the monoclonal antibodies to be used in accordance with the present disclosure may be made by the hybridoma method first described by Kohler et al., Nature 256:495 (1975), or may be made by recombinant DNA methods (see, e.g., U.S. Pat. No. 4,816,567). The "monoclonal antibodies" may also be isolated from phage antibody libraries using the technique described in Clackson et al, Nature 352:624-626 (1991) and Marks et al, J. Mol. Biol. 222:581-597 (1991), for example.

The CHO cells described here may be used as host cells for expression of any protein of interest. This may be done by means known in the art. Protein expression in CHO cells and cell lines is well described in the literature, and the skilled person will have little difficulty in using the CHO cells and cell lines described here as hosts for protein expression. Thus, for example, the CHO cells and cell lines may be transfected by means known in the art with expression vectors capable of expressing the protein of interest.

As used herein the term "expression" refers to a process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation.

As used herein, the term "primer" refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced, (i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and at a suitable temperature and pH).

The term "pharmaceutical formulation" refers to preparations which are in such form as to permit the biological activity of the active ingredients to be unequivocally effective, and which contain no additional components which are toxic to the subjects to which the formulation would be administered.

The phrases "contaminant genomic DNA," "residual genomic DNA," and the like refer to any nucleic acid molecules remaining in the sample, e.g., purified protein sample, pharmaceutical formulation, etc. Nucleic acid molecules remaining in the sample can comprise apoptotic DNA fragments, i.e., fragments of DNA resulting from programmed cell death.

The term "Melting curve analysis" refers to an assessment of the dissociation- characteristics of double-stranded DNA during heating. The information gathered can be used to infer the presence and identity of single-nucleotide polymorphisms, to detect primer dimers, contaminating DNA, and PCR product from mis-annealed primer.

The term Femtogram refers as a unit of mass equal to 0.000 000 000 000 001 grams and with symbol fg. Picogram is a unit of mass equal to one trillionth of a gram (10 12 ), represented as pg.

The term "probe" refers to RNA or DNA fragment, radioactively or chemically labeled, that is used to detect specific nucleic acid sequences by hybridization TaqMan probes depend on the 5'-nuclease activity of the DNA polymerase used for PCR to hydrolyze an oligonucleotide that is hybridized to the target amplicon. TaqMan probes are oligonucleotides that have a fluorescent reporter dye attached to the 5' end and a quencher moiety coupled to the 3' end. These probes are designed to hybridize to an internal region of a PCR product. In the unhybridized state, the proximity of the fluorophore and the quench molecules prevents the detection of fluorescent signal from the probe. During PCR, when the polymerase replicates a template on which a TaqMan probe is bound, the 5'- nuclease activity of the polymerase cleaves the probe. This decouples the fluorophore and quenching dyes and FRET no longer occurs. Thus, fluorescence increases in each cycle, proportional to the amount of probe cleavage.

As used herein the term Quenching refers to any process which decreases the fluorescence intensity of a given substance.

As used herein the term Taqman degenerate probe is also called as double labeled oligonucleotides or double dye probe where one of the nucleotide is a degenerate base such as R = A or G nucleotide. A fluorophore is attached to the 5' end of the probe and a quencher to the 3' end. The fluorophore, FAM(6-carboxyfluorescein) is excited by the machine and passes its energy to the quencher, black hole quencher(BHQ). As used herein the term 5' - 3' exonuclease activity of the thermostable enzyme Thermus aquaticus DNA polymerase may be employed in a polymerase chain reaction product detection system to generate a specific detectable signal concomitantly with amplification.

The Ct (cycle threshold or cycle number) is defined as the number of cycles required for the fluorescent signal to cross the threshold (i.e. exceeds background level). Ct levels are inversely proportional to the amount of target nucleic acid in the sample (i.e. the lower the Ct level the greater the amount of target nucleic acid in the sample).

As used herein the term SYBR Green refers to a highly specific dye that binds to minor grooves of the double stranded DNA to detect the PCR product as it accumulates during the qPCR reaction. SYBR green assay provides the simplest and most economical format for detecting and quantitating PCR products in real-time reactions. SYBR Green upon excitation emits light. Thus, as a PCR product accumulates, fluorescence increases. The advantages of SYBR Green are that it is inexpensive, easy to use, and sensitive.

The disclosure is further elaborated with the help of following examples. However, these examples should not be construed to limit the scope of the disclosure. EXAMPLES:

Example: 1

Biological sample:

Any monoclonal antibodies or therapeutic proteins expressed in CHO-S cell line (obtained from Invitrogen cat#l 1619012).

Preparation of Standard DNA:

Genomic DNA from CHO cells is extracted from monoclonal antibody production strain by following phenol chloroform extraction method. DNA is resuspended in 10 mM Tris- 1 mM EDTA, pH 8.0, the concentration determined, and diluted to a storage concentration of 100 ng/μί and a working concentration of 10 ng/μΕ. This DNA is used as Standard in qPCR.

Extraction of DNA from Biological sample:

DNA from biological sample is extracted by using silica based DNA extraction columns where DNA binds to the silica membrane due to change in pH to acidic and DNA is eluted from the membrane with the addition of water which changes pH to neutral.

Example: 2

Materials and Method

The primers are obtained from Eurofins (Germany) and probe from Eurogentec (Belgium). Real time PCR machines Mx3000P from Stratagene and Realplex 2 from Eppendorf is used in this study. Reagents for Taqman qPCR are obtained from Thermo- Absolute qPCR probe master mix. SYBR green master mix is obtained from Bioline.

Example: 3

Determination of Alu-Equivalent Consensus Sequence:

The target for CHO cell DNA qPCR is the Alu- equivalent consensus sequence described by Haynes et al., Based on the 74 bp consensus sequences, forward and reverse primers are designed and used for qPCR. The template DNA for qPCR is extracted from the CHO cells routinely used for expression of biotherapeutic proteins. Six PCR reactions from different concentration of CHO genomic DNA starting from 2 ng are performed to confirm the presence of single band with the expected size of 74 bp. Reconfirmation is done using automated capillary based gel electrophoresis- Multi NA (Schimadzu). Each PCR product is subcloned into high copy number E.coli vector using blunt end cloning kit (Fermentas), transformed into chemically competent E.coli cells and plated on selection plates. From each transformation, 10 colonies are randomly selected (60 transformants) and screened for the presence of the insert by PCR.

95% of the transformants are positive for the presence of insert and among them 5 positive clones are selected for plasmid isolation and sequencing. The sequence of 5 clones is aligned using CLUSTAL W online software. An alignment of the six sequences including the consensus sequence is shown in Fig 1.

Example: 4

Primer and Probe Optimization:

Primers are designed to amplify a region of CHO Genomic DNA, and a specific probe designed to anneal to a target sequence located between two PCR amplification primers is labeled with the fluorescent reporter dye, FAM (6-carboxyflourescein), at the 5' end and the quencher dye, BHQ-1 (Black Hole Quencher 1), at the 3' end. The close proximity of the two dyes leads to quenching of the reporter dye fluorescence. During polymerization, the exonuclease activity of Taq polymerase cleaves the probe, separating the quencher and reporter dyes, thus resulting in increased reporter dye fluorescence. This increase in fluorescence and accumulation of the PCR product is continually monitored throughout the PCR reaction by real-time PCR instrument. Within the linear range of amplification established for known quantities of DNA, the amount of residual CHO DNA in the sample is calculated by extrapolating on standard curve generated from cycle threshold values versus known quantities of CHO genomic DNA.

The development of residual DNA detection assay started with the novel designing of primers based on the Haynes Sequence (Mol. Cell. Biology 1 :573-83). Seq ID nos 7 and 8 represent the forward qPCR primer and reverse qPCR primer respectively. Using these qPCR primers, SYBR green assay is performed where the concentration of the primers used is 100 nM. See Fig 2a for the amplification plots obtained. To improve the melting curve (Fig 2b) for obtaining tapering and sharp peaks, the annealing temperature and concentration of forward and reverse primers are optimized.

To optimize the annealing temperature, a gradient qPCR is performed in the temperature range of 55°C to 65°C in Realplex 2 qPCR machine from Eppendorf where the primer concentration is maintained at ΙΟΟηΜ. The melt curve analysis revealed that 64°C annealing temperature is optimal as at this temperature the melting curve showed a peak comparatively better than those observed at other temperatures within the temperature range (see Fig.3). All the subsequent experiments are further carried out at 64°C as annealing temperature.

To further improve the appearance of the melting curve or to specifically amplify the specific target, the primer concentration is varied. The concentration of forward and reverse primers is optimized to achieve early C T in amplification plot graph with a PCR product showing a single peak in a melting curve analysis in the SYBR green qPCR.

Both the primers are titrated at 100 nM, 200 nM, 400 nM and 800 nM concentrations and the C T values are subsequently compared. During these primer optimization trials, a single concentration of CHO genomic DNA (2 ng) is used. At 400 nM concentration both forward and reverse primers gave higher fluorescence and C T (Cycle threshold) at -15, and is used in subsequent experimentation where the primer efficiency is checked across DNA dilution range (Figs. 4a and 4b).

The PCR product is sequenced by cloning the fragment into E.coli high copy number vector and transformed into the E.coli competent cells. The transformants are screened for the presence of the insert. Seq ID no 9 represents the sequence of the target obtained from the positive clone plasmid (Fig 5).

Based on these results, a probe is designed to perform Taqman assay for specific amplification of the DNA where the probe binds to the target and emits fluorescence.

A Taqman qPCR assay is performed using the qPCR Probe SEQ ID no. 10 and primers forward and reverse SEQ ID no 7 and 8 with the CHO genomic DNA. The concentration of the probe used is 20 nM. The assay results revealed that the amplification plots are not sigmoid; instead drooping plots are obtained (Fig. 6a) indicating inefficient reaction between primer, probe and template DNA. Further, to improve the appearance of the amplification plots, the probe concentration is titrated against varied primer concentrations at 64°C as annealing temperature. No improvement is observed in the appearance of the amplification plots as observed in the Fig 6a.

Gradient PCR with temperature ranging from 53°C to 63°C is performed where the forward and reverse primer concentration of 400 nM and probe of 20 nM is used. 55°C temperature showed the plots with highest fluorescence intensity, however the C T obtained is at cycle number ~18 indicating delayed amplification of target DNA.

Hence the primers and the probe are redesigned based on the sequence of the specific target. SEQ ID 1 1 and 12 correspond to the redesigned primers as shown in the highlighted region on SEQ ID 9. The difference in redesign of forward primer is deletion of 2 bases at the 5' end and addition of 3 bases at the 3' end as compared to SEQ ID 7. Similarly for the redesign of the reverse primer (Seq ID no 12), 3 bases are deleted at the 3' end as compared to the seq ID no 8. The redesigned reverse primer is depicted by highlighted region on the complimentary strand of the seq ID no 9 (Fig 6b). The amplicon length using SEQ ID 1 1 andl2 primers are found to be 69 bp.

Taqman qPCR assay is performed using 400 nM concentration of the redesigned primers seq ID no 11 and 12 and 20 nM concentration of the Taqman qPCR probe Seq ID no 10. Three standard DNA dilutions-2 ng, 200 pg and 20 pg are used to perform the assay. The probe/primers binding efficiency to the template DNA is found to be inappropriate as shown by wavy amplification plots (Fig 7).

Based on these results, another probe is designed to bind to the target more efficiently resulting in better appearance of the amplification plots. Seq ID no 13 represents the sequence of the Taqman qPCR probe. On using Taqman probe (seq ID 13) with seq ID nos 11 and 12 primers on the template DNA, it is found that no amplification plots are observed indicating the poor performance of the probe as well as primers. The efficiency of the probe binding to the target is not good and hence the primers and probe are redesigned.

Seq ID nos 14 and 15 represent the sequence of the new forward and reverse primers and SEQ ID no 16 represents the sequence of the new Taqman probe.

The designing strategy is changed where the forward primer is designed on the complimentary strand of seq ID no 9 (shown by highlighted region in fig 8a) and similarly the reverse primer is designed on the sense strand (shown by highlighted region in fig 8 a) on seq ID no 9. The amplicon length is reduced to 66 bp with this design. The probe region (seq ID no 16) in the seq ID no 9 is shown in underlined font in Fig 8a. When the Taqman qPCR assay is performed using the forward and reverse primer SEQ ID nos 14 and 15 along with Taqman qPCR probe SEQ ID 16, the amplification of the standard DNA dilutions is not observed (Fig 8b). The reason could be the inefficient binding of primers and/or probe with the target DNA. After unsuccessful attempts of designing of optimum primer and probe combinations within 71 bp Alu-equivalent sequence, we explored sequence length flanking the 3' end of the SEQ ID 9. It is observed that the sequences downstream of 3' end of 71 bases are falling in 2 lineages. One of the lineages had sequence stretch of -293 bases designated as lineage 1 and another lineage (lineage 2) is having a sequence stretch of 420bases. The reverse primers are designed from both the lineages using online available software (PrimerQuest from Integrated DNA technologies). The best primer design results from the software for lineage 1 had amplicon length of 180 bases where as for lineage 2 it is 355 bases. The sequence stretches from lineage 1 and 2 are shown in Fig 9 and Fig 10. Using forward primer seq ID no 7 and reverse primer seq ID no 17 from lineage 1 , SYBR green qPCR assay is performed at 60°C annealing temperature. Delayed amplification plot for the standard DNA is observed at cycle number 25 and a blunt melting curve is observed for the standard DNA dilution.

Similar results are observed for qPCR results using forward primer SEQ ID 7 and reverse primer SEQ ID 18 from lineage 2. Amplification plot for standard DNA is observed at cycle number 32 and a sharp melting curve. The delayed cycle number made this primer inefficient (Fig 11).

These results indicate that the sequence stretch beyond 71 bases (like sequence stretch of 293 and 420 bases) are inefficient in providing targets for the residual DNA estimation assay as the DNA amplification is observed in the later cycles (beyond 25 cycles) and hence are not suitable. Therefore it is apparent that one should confine and optimize the target in and around the identified 71 base region in Alu-equivalent sequence.

Based on the above observations from the experiments, redesigning of primers is initiated considering and confining the design around 71 bp Alu equivalent sequence target. The forward primer SEQ ID no 19 is designed with the addition of 2 bases at the 5' end and deletion of 1 base at the 3' end of the seq ID no 7 forward primer. Similarly the reverse primer SEQ ID no 20 is designed with the addition of 5 bases at the 5' end and deletion of 3 bases at the 3' end of the seq ID no 8 reverse primer. The amplicon size is found to be 79bp (Fig 12). Upon performing SYBR green assay using the SEQ ID 19 and 20 primers with the CHO DNA, good amplification plots and sharp melt curve is obtained indicating the amplification of specific target (Fig 13a and 13b).

Several PCR reaction are set using the qPCR primer SEQ ID nos 19 and 20 and different concentrations of standard CHO genomic DNA. The PCR amplified products from downstream processing samples are sequenced and aligned using the online software- CLUSTAL W. The variance of Alu-equivalent sequences are observed on sequencing and aligning the PCR products (Fig 14). Based on the variations observed in the alignment (Fig 14), a degenerate probe is designed. The aim of designing a degenerate probe is to attain broad dynamic range so as to have reactivity with the sequence variants identified during alignment as well as provide high sensitivity and differential signal to background ratio. Seq ID no 21 represents the degenerate Taqman qPCR probe.

Degenerate qPCR probe seq ID no 21 along with the forward and reverse primer SEQ ID

19 and 20 is used to set up a Taqman qPCR. Standard DNA dilutions of 2 ng, 200 pg and

20 pg are analyzed in Mx3000P Stratagene machine. Inefficient amplification plots are obtained that is wavy instead of being sigmoidal (Fig 15).

Taking a clue from the inefficient binding of designed primer/probe, further modifications are made. To the existing forward primer SEQ ID no 19, 2 bases are added at the 3' end to modify the primer as represented by SEQ ID no 22. The reverse primer is also modified by addition of 3 bases at the 3' end and deletion of 5 bases at the 5' end. SEQ ID no 23 represents the reverse primer after modification. The amplicon length is found to be 74bp (Fig 16).

A novel degenerate probe with only 1 degenerate base is designed (represented by seq ID no 24) in order to reduce the degeneracy in the probe which might have an impact on the efficiency and probe binding to the target.

Using the primer SEQ ID 22 and 23 and probe seq ID no 24 Taqman assay is performed at 56°C annealing temperature. The CHO standard DNA dilutions from 2 ng to 20 fg are analysed and the amplification signals are detected as early as ~15 cycle number indicating that the binding efficiency of primer and probe are compatible with the template DNA (see Fig 17).

Both the primers are titrated at 100 nM, 200 nM, 400 nM and 800 nM concentrations in the SYBR green assay. During these primer optimization trials, a single concentration of CHO genomic DNA is used. At 400 nM concentration both forward and reverse primers gave higher fluorescence and early Gr at cycle no. ~12 is seen. The 400 nM concentration of forward and reverse primer (SEQ ID 22 and 23) is used in subsequent experimentation (See Fig 18a and 18b).

Degenerate qPCR probe SEQ ID 24 along with the forward and reverse primer SEQ ID no 22 and 23 is used to set up a Taqman qPCR. Standard DNA dilutions of 2 ng, 200 pg, 20 pg, 2 pg, 0.2 pg and 20 fg are analysed in Realplex 2 (Eppendorf) machine (see Fig 19). The DNA amplification is observed from cycle ~15 and is uniform across all the standard dilutions. The reaction had annealing temp of 56°C.

After primer concentration, Taqman probe concentration is titrated at 1000 nM, 500 nM, 400 nM, 200 nm and 100 nM concentrations and the C T values are subsequently compared. Significant improvements in the appearance of the amplification plots and sensitivity are noted with change in probe concentration. Concentration of probe is optimized to 100 nM based on the appearance of the amplification plots and early C T values and this is the concentration used for subsequent experimentation (see Fig 20).

Based on the above experiments, the forward/ reverse primer and probe design is finalized for assay development activity. The forward and reverse primers (SEQ ID 22 and 23) are designated as "forward qPCR primer" and "reverse qPCR primer" respectively. Similarly probe with SEQ ID 24 is designated as "Degenerate qPCR Probe".

Example: 5

Annealing Temperature optimization Annealing temperature optimization is done to get early CT, sigmoid amplification plots and a single peak in melting curve in SYBR green qPCR. Using gradient PCR (Realplex2), between 50° to 60°C, the annealing temperature is optimized to 56°C, at which the melting curve showed a single neat peak signifying amplification of a single product. At the same temperature the probe based Taqman qPCR is also optimized and is used for subsequent experimentation.

Example: 6

DNA preparation using QIAamp Viral RNA Kit

The extraction protocol is optimized to improve the recovery of residual genomic DNA from the samples received from the therapeutic proteins purification process. 200 μΐ of the sample spiked with genomic DNA (positive control) is treated with 560 μΐ Buffer AVL-lysis buffer in the presence of carrier RNA. This is allowed to react for 5 mins at room temperature. To this, 560 μΐ of absolute alcohol is added and mixed by pipetting. 640 μΐ of the mixture is loaded onto the column provided in the kit and centrifuged at 4500 rpm for 2 mins. Flow through is discarded and remaining mixture is loaded again and centrifuged at 4500 rpm for 2 mins. After discarding the flow through the empty column is centrifuged at 8000 rpm for 2 mins. 500 μΐ of wash buffer AW1 is added and centrifuged at 8000 rpm for 2 mins followed by 2 mins centrifugation at 12000 rpm. Flow through is discarded and 500 μΐ of wash buffer AW2 is added with changing the collection tube and centrifuged at 12000 rpm. Flow through is discarded, followed by dry spin for 2 mins at 13000 rpm. To elute the bound DNA, 40 μΐ of water preheated to 70°C is added, incubated at 70°C for 20 mins and centrifuged at 12000 rpm for 2 mins. The elution step is repeated to get a final elution volume of 80 μΐ. Similar protocol is followed for a test sample without spike DNA which serves as unspiked sample.

Example: 7

qPCR reaction.

The qPCR reaction is set up in a dedicated PCR preparation room to avoid contamination related issues.

The cocktail preparation, addition of extracted test sample to PCR tubes and standard DNA dilution (and addition) is performed in dedicated rooms. All the samples and standards are analyzed in triplicates. Each 50 μΐ reaction consisted of PCR grade water-4 μΐ (in case of SYBR green assay) or 3 μΐ (in case of Taqman assay), IX master mix-25 μΐ [Sensimix SYBR green (cat no QT625-05) or Absolute Taqman qPCR (cat no AB- 1136/A) master mix-composition not disclosed by the manufacturer], 400 nM of each forward and reverse primer and 100 nM of probe is used whenever Taqman qPCR assay is performed. The template DNA added to the standard DNA dilution tubes is 20 μΐ. In case of No template control, water is added instead of template DNA. Rest of the tubes are then moved to the dedicated sample preparation room. For spike and/or unspike samples, 10 μΐ of PCR grade water and 10 μΐ of extracted DNA from samples is added. The strips are sealed and centrifuged briefly. It is run on Stratagene Mx3000P instrument using the standard thermal cycler protocol which consisted of

1) Initial denaturation or Polymerase activation step at 95°C for 15 mins (Absolute Taqman qPCR master mix) or 10 mins (Sensimix SYBR green master mix).

2) 40 cycles of denaturing at 95°C for 15 seconds and annealing/extension at 56°C for lmin.

Example: 8

Standard curve preparation.

The stock CHO genomic DNA used for standard DNA dilutions is extracted and prepared from the CHO cells that are used for the production of biotherapeutic proteins. The extracted DNA is quantitated using Nanodrop spectrophotometer (Thermo scientific). The purity of DNA is assigned using absorbance ratio at 260/280nm. The purity as depicted from the ratio is found to be in the range of 1.8 to 2.0 (Molecular cloning, a laboratory manual by Sambrook & Russel, Third edition, Cold Spring Harbor laboratory press).

The starting stock of DNA is maintained at 100 ng/μΐ. Ten fold serial dilutions of the stock DNA is performed to obtain standards of 10 ng, 100 pg, 10 pg, 1 pg, 100 fg, 10 fg and 1 fg. These standard dilutions are used to achieve standard curve of 2 ng, 200 pg, 20 pg, 2 pg, 200 fg and 20 fg in qPCR.

Example: 9

Specificity of the assay

To check the specificity and reactivity of the primers and probe, genomic DNA from different sources are used as template. DNA from mammalian sources (other than CHO) like NSO and human, non-mammalian sources (E.coli and Pichia) are used. The results showed that the primers are reacting only to CHO and NSO as NSO also belongs to rodent family and it is well known in literature that the Alu-equivalent sequences are dispersed across rodent family. Other sources of DNA like human, E.coli and Pichia did not show reactivity to primers and probe, establishing the specificity of the primers.

Based on the primer reactivity result to NSO template DNA the qPCR primers seq ID nos 22 and 23 and degenerate probe seq ID no 24 are used to identify the presence of Alu equivalent sequences in NSO cell line also. The optimized concentrations of primers and probe are used for the DNA concentrations 2000 pg, 200 pg, 20 pg, 2 pg and 0.2 pg. Results are shown in the Table no. 2. Table 2 Comparison of C T s of standard DNA dilutions of CHO and NSO gDNA using SEQ ID nos 22 and 23.

Amount of C T s with C T s with NSO Standard CHO gDNA

DNA(pg) gDNA

2000 15.85 23.89 200 19.50 27.27

20 23.04 30.81

2 26.60 34.34

0.2 30.38 38.13

The results revealed that the primers react with NSO genomic DNA also, confirming the presence of Alu Equivalent sequences in NSO cell lines. Host cell DNA estimation in therapeutic proteins expressed from NSO cell line is done according to the Example 6. The experiment is set up to evaluate the NSO gDNA detection capability of the forward and reverse primers along with degenerate probe. The standard curve preparation and setting up qPCR reaction is exactly followed as described in Examples 7 and 8.

On analysis of samples, the assay is able to detect as little as 0.3 pg of residual NSO gDNA in the therapeutic protein expressed in NSO cell line (Table 3).

Table 3: Detection capability of primers Seq ID 22 & 23 and probe seq ID 24 with NSO cell line derived biological sample.

Amount of C T s with NSO Mean DNA

Standard gDNA quantity

DNA(pg)

2000 23.73 NA

200 27.61 NA

20 31.03 NA

2 33.38 NA

1 34.98 NA

Biological 36.57 0.3pg

sample The PCR amplified NSO gDNA product is cloned into E.coli high copy number plasmid and the sequence of the insert is verified and compared with the SEQ ID 1 (CHO gDNA). Nucleotide sequence revealed degeneracy in the probe binding region in NSO gDNA as compared to CHO gDNA (SEQ ID 1) as shown in Fig 25.

Example 10:

Sequence Comparison of the target sequence with other Alu targets.

The target sequence designated as SEQ 1 is compared with the Alu-equivalent sequences available in public domain. Since the rational of seq ID 27 & Seq ID 28 is being used as target Alu sequence for residual DNA quantification from biological proteins as described in patents US2009325175 and US5393657 respectively. Therefore the following sequence comparison of Seq ID 1 with Seq ID 27 & 28 was performed to identify the sequence variations between the target identified in present disclosure and those reported in public domain. The published sequences designated as SEQ ID 27and SEQ ID 28 are compared with the seq ID 1 using the CLUSTAL W online software. The percentage homology observed with the sequence ID 27 is 50% and hence the percentage variance is 50% (Fig 21a). The comparison of sequence ID 1 with the sequence 28 revealed percent homology as 71.6% and hence the percent variance is 28.4% (Fig 21b). These comparisons revealed the persisting difference in sequence between SEQ ID 1 and other published Alu target sequences.

Examplell:

Presence of other Alu targets in the genomic DNA.

The two PCR products amplified from CHO genomic DNA by using the primers SEQ ID 25 and SEQ 26 and SEQ ID 29 and SEQ ID 30 are sequenced. Sequencing results revealed the presence of similar sequences with 5 % variation with that of the SEQ ID 27 (Fig 22a) and 21% variance with that of the SEQ ID 28 (Fig 22b). Example 12:

Comparison of primer characteristics:

The important parameters to be considered when selecting or designing PCR primers are the ability of the primer to form a stable duplex with the specific site on the target DNA, and no duplex formation with another primer molecule or no hybridization at any other target site. The primer stability can be measured in the length (base pairs), hairpin loop and dimer formation, the %GC, kcal mol (duplex formation free energy, AG) or in melting temperature. The qPCR primers seq ID nos 22 and 23 are compared with the other primers SEQ ID 25 and 26. See Table 4.

Table 4: Represents the comparison of primers parameters.

On analyzing the primer parameters, it is found that the SEQ ID 22 has higher molecular weight, % GC, Tm and larger negative value for AG. It is known from publications that larger negative value for AG indicates stable, undesirable structures of the primers which helps in efficient binding to the target. The possibility of dimer and hairpin loop formation for SEQ ID 23 is lesser than SEQ ID 22, 25 and 26. Higher dimer formation results in inefficient primer binding to the template and hence reduces the product yield. Hairpin loop formation at its 3' end is troublesome, since this can cause internal primer extension, thus eliminating a given primer from the reaction. Hairpins near the 5' end, however, do not significantly affect the PCR.

The % GC of the SEQ ID no 1 is 55.4% and the % GC of the other target SEQ ID 27 is 57.6%. According to the published data, typically, primers should have a % GC similar to or higher than that of the amplified template. In this case, the % GC of the seq ID no 22 is close to its respective target (i.e. SEQ ID 1) while in case of SEQ ID no 25 having % GC 45.5 is lesser than its respective target which is 57.6%. As illustrated in example 13 and table 5, the use of primer sequences Seq ID no 22 & 23 for qPCR method facilitates early amplification by ~3 cycles for all standard DNA dilution points with respect to the use of primer seq IDs 25 & Seq ID 26 for same quantity of template DNA. The early amplification is due to the cumulative effect of lower dimer & hairpin loop characteristics of Seq ID 22 & Seq ID 23 primer combination used for the method (Table 4). As a result, the self reactivity of the primers is low & affinity towards the template DNA is enhanced leading to optimum qPCR reaction results (Selection of primers for Polymerase Chain reaction". Methods in Molecuar Biology, Vol 15, Humana Press Inc). Based on these results and observations, primer pairs i.e., SEQ ID 22 and 23 used in present disclosure are better in most of the essential parameters discussed above than the SEQ ID 25 and 26.

Example 13:

SYBR Green qPCR:

qPCR SYBR green assay is performed with the primers SEQ ID 25 and SEQ ID 26 with the CHO genomic DNA as template. The standard DNA dilutions used are 2 ng, 200 pg, 20 pg, 2 pg and 1 pg. The first standard DNA dilution amplified at cycle number 15.61 (Table 5). On analysis of melting curve, it is observed that there exists high proportion of degeneracy in Alu equivalent target sequences for the compared primers SEQ ID 25 and 26. This is reflected by the irregular pattern of the melt curves for the standard DNA dilutions used in the assay (Fig 23 a).

Similarly qPCR SYBR green assay is performed with the primers SEQ ID 29 and 30 with the present disclosure CHO genomic DNA as template. The standard DNA dilutions used are 2 ng, 200 pg, 20 pg, 2 pg, and 1 pg. The first standard DNA dilution amplified at cycle number 15.39 (Table 5). On analysis of melting curve, it is observed that there exists high proportion of degeneracy in Alu equivalent target sequences designated as SEQ ID 28. This is reflected by the irregular pattern of the melt curves for the standard DNA dilutions used in the assay (Fig 23b).

Similar analysis of the primers SEQ ID 22 and SEQ ID 23 with the CHO genomic DNA is performed in SYBR green assay.

At similar dilution pattern starting with 2 ng to 20 fg, the first DNA dilution amplified at cycle number 12.47 which is ~3 cycles earlier than the amplification observed with either SEQ ID 25 and 26 or SEQ 29 and SEQ 30 (Table 5). This clearly establishes that the target existing in the genomic DNA is in higher proportion. The reactivity or efficiency of the primers SEQ ID 22 and 23 and the template ID 1 is comparatively higher with respect to the primers from published data. In addition, the melt curve analysis depicted that the extent of degeneracy in the target sequence is comparatively less than those existing in targets amplified using primers SEQ ID 25 and 26 and SEQ ID 29 and SEQ ID 30. This is indicated by a single narrow peak of melt curve obtained from all the standard DNA dilutions used in the assay (as shown in Fig 23c).

Table 5: Represents the comparison of cycle numbers of amplification of each dilution of standard DNA in standard curve with SEQ ID no 22 and 23 and other primers

Example 14:

Taqman probe qPCR assay:

Taqman probe assay is more specific compared to SYBR green as probe detects the target specifically on reacting to the complementary template strand.

Taqman probe with single degenerate base is designed for the SEQ ID 1.

The DNA dilutions are performed similar to the one mentioned in example number 7. The thermal cycling conditions have been defined in the example number 6. The concentration of the probe added to the qPCR cocktail is optimized to 100 nM and primer concentration used is 400 nM Forward and Reverse primer (SEQ ID 22 and 23 respectively) per reaction. The first DNA dilution (2 ng) in the standard curve amplifies at ~15 cycle number and the last DNA dilution (20 fg) of the standard curve amplifies at cycle number 32 (see Fig 24a and Table 6). The PCR efficiency is found to be 100.8% with correlation coefficient (R 2 ) for reaction being 1. Slope of the standard curve is -3.3 (Fig 24 b). Table 6: Represents the cycle numbers of amplification of each dilution of standard DNA in standard curve obtained by using the primers SEQ ID no 22 and SEQ ID 23 with its respective Taqman probe SEQ ID 24.

To evaluate the performance of the Taqman assay, different downstream purification stage samples (200μ1) were analysed for the detection of the host cell DNA. Table 7 represents the Ct values & mean quantities of standard DNA dilutions along with the biological samples (CHT4 US, 098US, C1089US, C1098US & C2089 US). The PCR efficiency was found to be 0.9 with a slope of -3.57 & R 2 being 1.

Table 7: Represents the Ct values & mean quantities of standard DNA dilutions with the analysed biological samples.

Example 15:

CHO Residual DNA Quantification kit:

The developed method can be used as a kit to detect the residual CHO DNA in the biotherapeutic/biopharmaceutical products. Using the Taqman technology, the limitations of the traditional methods can be overcome by enabling rapid, specific quantification of femtogram levels of CHO host cell DNA. Reliable assay performance and quantitative results can be obtained in two hours, allowing high-confidence for testing across broad range of sample types, from in-process samples to final drug substance for lot release. Components of the Taqman residual DNA kit comprises of:

Forward primer.

Reverse primer.

Taqman Probe.

The DNA to be estimated is extracted from the therapeutic proteins to overcome the interference caused by the proteins present using commercial extraction kit. The purified DNA is then analysed in the presence of the Taqman master mix, primers and probe.

Taqman master mix such as Absolute qPCR probe master mix from Abgenes, UNG Taqman master mix from Eurogentec and Platinum® Quantitative PCR SuperMix-UDG from Invitrogen can be used.

The Taqman CHO residual DNA detection kit provides a highly sensitive detection of CHO DNA, allowing the use of small sample volumes to generate accurate results. The broad linear range provided by Taqman technology allows testing of samples containing variable levels of CHO DNA, such as, as high as nanogram levels in in-process samples to as low as femtogram levels in drug substance. Fig. 26 and 27 demonstrates the range and sensitivity of the assay. SEQUENCE LISTING

<1 10> Biocon Limited <120> METHOD OF DETECTING RESIDUAL GENOMIC DNA AND A KIT THEREOF

<130> IP19417 <160> 30

<170> Patentln version 3.5

<210> 1

<21 1> 74

<212> DNA

<213> Cricetulus griseus <220>

<221> gene

<222> (1)..(74)

<400> 1

gctggagaga tggctcgagg ttaagagcac tggctgctct tccagaggtc ctgagttcaa 60 ttcccagcaa ccac 74 <210> 2

<211> 74

<212> DNA

<213> Cricetulus griseus

<220>

<221> gene

<222> (1)..(74) <400> 2

gctggagaga tggctcgagg ttaagagcac tgactgctct tccagaagtc ctgagttcaa 60 ttcccagcaa ccac 74 <210> 3

<21 1> 74

<212> DNA

<213> Cricetulus griseus

<220>

<221> gene

<222> (1)..(74)

<400> 3

gctggagaga tggctcgagg ttaagagcac tgactgctct tccagaggtc ctgagttcaa 60 ttcccagcaa ccac 74

<210> 4

<21 1> 74

<212> DNA

<213> Cricetulus griseus

<220>

<221> gene

<222> (1)..(74)

<400> 4

gctggagaga tggctcgagg ttaagagcac cgactgctct tccagaggtc ctgagttcaa 60 ttcccagcaa ccac 74

<210> 5

<21 1> 74

<212> DNA

<213> Cricetulus griseus

<220>

<221> gene

<222> (1)..(74)

<400> 5

gctggagaga tggctcgagg ttaagagcac tgactgctct tccagaggtc ctgagttcaa 60 ttcccagcaa ccac

<210> 6

<21 1> 74

<212> DNA

<213> Cricetulus griseus <220>

<221> gene

<222> (1)..(74)

<400> 6

gctggagaga tggctcgagg ttgagaacac tggctgctct tccagaggtc ctgagttcaa 60 ttcccagcaa ccac 74 <210> 7

<21 1> 20

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Forward qPCR primer

<220>

<221> prim transcript

<222> (1)..(20)

<400> 7

tggagagatg gctcgaggtt 20

<210> 8

<21 1> 21

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Reverse qPCR primer <220>

<221> prim transcript

<222> (1)..(21)

<400> 8

tggttgctgg gaattgaact c 21

<210> 9

<21 1> 71

<212> DNA

<213> Cricetulus griseus <220>

<221> gene

<222> (1)..(71)

<223> Alu equivalent consensus sequence <400> 9

tggagagatg gctcgaggtt aagagcactg gctgctcttc cagaggtcct gagttcaatt 60 cccagcaacc a 71

<210> 10

<21 1> 22

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'FAM-3'BHQ qPCR Probe

<220>

<221> misc binding

<222> (1)..(22)

<223> 6-carboxyfluorescein (FAM) at the 5' end and 1 -Black hole quencher at the 3' end

<400> 10

tggcttgctc ttccagaggt cc 22

<210> 11 <21 1> 19

<212> D A

<213> Artificial Sequence <220>

<223> 5'-3' Forward qPCR primer

<220>

<221> prim transcript

<222> (1)..(19)

<400> 1 1

gagatggctc gaggttaag 19

<210> 12

<21 1> 18

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Reverse qPCR primer

<220>

<400> 12

tggttgctgg gaattgaa 18

<210> 13

<21 1> 18

<212> DNA

<213> Artificial Sequence <220>

<223> 5TAM-3'BHQ qPCR Probe

<220>

<221> misc binding

<222> (1)..(18) <223> 6-carboxyfluorescein (FAM) at the 5' end and 1 -Black hole quencher at the 3' end

<400> 13

actggctgct tgctcttc 18

<210> 14

<21 1> 19

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Forward qPCR primer

<220>

<221> prim transcript

<222> (1)..(19)

<400> 14

tctctaccga gctccaatt 19 <210> 15

<21 1> 19

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Reverse qPCR primer

<220>

<221> prim_transcript

<222> (1)..(19)

<400> 15

accaacgacc cttaacttg

<210> 16

<21 1> 15

<212> DNA

<213> Artificial Sequence <220>

<223> 5'FAM-3'BHQ qPCR Probe

<220>

<221> misc binding

<222> (1)..(15)

<223> 6-carboxyfluorescein (FAM) at the 5' end and 1 -Black hole quencher at the 3' end

<400> 16

cttctcgtcg gtcac 15

<210> 17

<21 1> 22

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Reverse qPCR primer

<220>

<221> prim transcript

<222> (1)..(22)

<400> 17

ctgcatgtat ccctgcaggc ca

<210> 18

<21 1> 27

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Reverse qPCR primer

<220>

<221> prim transcript

<222> (1)..(27) <400> 18

ggcacatact ttgaagggtt taaaggc 27

<210> 19

<21 1> 21

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Forward qPCR

<220>

<221> prim transcript

<222> (1)..(21)

<400> 19

gctggagaga tggctcgagg t

<210> 20

<21 1> 24

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Reverse qPCR primer

<220>

<400> 20

accatgtggt tgctgggaat tgaa 24

<210> 21

<21 1> 20

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'FAM-3'BHQ Degenerate qPCR Probe <220>

<221> misc binding

<222> (1)..(20)

<223> 6-carboxyfluorescein (FAM) at the 5' end and 1 -Black hole quencher at the 3' end

<400> 21

arcacygrct gctcttccag 20

<210> 22

<21 1> 23

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Forward qPCR primer

<220>

<221> prim transcript

<222> (1)..(23)

<400> 22

gctggagaga tggctcgagg tta 23 <210> 23

<21 1> 22

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Reverse qPCR primer

<220>

<221> prim transcript

<222> (1)..(22)

<400> 23

gtggttgctg ggaattgaac tc 22 <210> 24

<21 1> 21

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'FAM-3'BHQ Degenerate qPCR Probe

<220>

<221> misc binding

<222> (1)..(21)

<223> 6-carboxyfluorescein (FAM) at the 5' end and 1 -Black hole quencher at the 3' end

<400> 24

ctgctcttcc agargtcctg a 21

<210> 25

<21 1> 22

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Forward primer

<220>

<221> prim transcript

<222> (1)..(22)

<400> 25

ttggtggcac acacctttaa tc

<210> 26

<21 1> 23

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Reverse primer US 12/476101 <220>

<221> prim transcript

<222> (1)..(23)

<400> 26

gctggctctt gaactcacag ag <210> 27

<21 1> 80

<212> DNA

<213> Artificial Sequence <220>

<223> Alu sequence US 12/476101.

<220>

<221> gene

<222> (1)..(80)

<400> 27

cattggtggc acacaccttt aatcccagca ctcgggaggc agaggcaggt ggatctctgt gagttcaaga gccagcctgg 80

<210> 28

<21 1> 134

<212> DNA

<213> Artificial Sequence <220>

<223> Alu sequence US990300

<220>

<221> gene

<222> (1)..(134)

<400> 28

ccaggcattg gtggcacaca cctttagtcc cagcactcag gaggcagagg caggaggatc 60 acttgagttc aagagccagc ctggtctacc agagttcctg agttcaagcc aggctataca 120 gagaaaccct gtct 134 <210> 29

<21 1> 17

<212> DNA

<213> Artificial Sequence

<220>

<223> 5'-3' Forward primer US990300

<220>

<221> prim transcript

<222> (1)..(17)

<400> 29

ccaggcattg gtggcac

<210> 30

<21 1> 17

<212> DNA

<213> Artificial Sequence <220>

<223> 5'-3' Reverse primer US990300

<220>

<221> prim transcript

<222> (1)..(17)

<400> 30

agacagggtt tctctgt 17