Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND PRODUCTS FOR PRODUCING RNA MOLECULES
Document Type and Number:
WIPO Patent Application WO/2020/193610
Kind Code:
A1
Abstract:
The present invention relates to a method for producing RNA molecules that uses chimeric oligonucleotides to guide site-specific enzymatic (RNase H) cleavage of RNA transcripts (produced by run-off in vitro transcription) comprising a plurality of tandem repeats of the RNA molecule to be produced, to generate a plurality of copies of the RNA molecule.

Inventors:
FEYRER HANNES (SE)
PETZOLD KATJA (SE)
BARONTI LORENZO (SE)
Application Number:
PCT/EP2020/058310
Publication Date:
October 01, 2020
Filing Date:
March 25, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
FEYRER HANNES (SE)
PETZOLD KATJA (SE)
BARONTI LORENZO (SE)
International Classes:
C12Q1/6806; C12P19/34
Domestic Patent References:
WO2014134029A12014-09-04
WO2004058989A22004-07-15
Foreign References:
US4336173A1982-06-22
Other References:
J LAPHAM ET AL: "RNase H cleavage for processing of in vitro transcribed RNA for NMR studies and RNA ligation", RNA, vol. 2, no. 3, 1 March 1996 (1996-03-01), US, pages 289 - 296, XP055352700, ISSN: 1355-8382
ANGEW. CHEM. INT. ED., vol. 40, no. 11, 2001, pages 2004 - 2021
PHARM RES., vol. 25, no. 10, 2008, pages 2216 - 2230
CHEM. COMMUN., vol. 47, 2011, pages 6257 - 6259
NATURE, vol. 519, no. 7544, 2015, pages 486 - 90
Attorney, Agent or Firm:
DEHNS (GB)
Download PDF:
Claims:
Claims

1. A method for producing a plurality of copies of an RNA molecule, said method comprising:

(a) providing a DNA molecule comprising a promoter sequence for an RNA polymerase and a sequence encoding a plurality of tandem repeats of the RNA molecule to be produced under the control of the promoter sequence;

(b) transcribing the DNA molecule of (a) using an RNA polymerase to produce RNA transcripts containing a plurality of tandem repeats of the RNA molecule to be produced;

(c) hybridizing the RNA transcripts produced in (b) to DNA oligonucleotides comprising 2’-0-methylated ribonucleotides to produce partially double stranded RNA: DNA duplexes, wherein the DNA oligonucleotides guide enzymatic cleavage of the RNA transcripts at the ends of the tandem repeats of the RNA molecule to be produced; and

(d) cleaving the RNA strand of partially double stranded RNA: DNA duplexes of (c) with RNase H, to release the plurality of copies of the RNA molecule.

2. The method of claim 1 , wherein steps (a) to (d) are conducted in a single reaction vessel.

3. The method of claim 1 or 2, wherein the RNA polymerase is T7 RNA polymerase and the promoter sequence is a T7 promoter.

4. The method of any one of claims 1 to 3, wherein the RNaseH is E.coli RNase H.

5. The method of any one of claims 1 to 4, wherein the DNA molecule comprises a sequence encoding at least 4 tandem repeats of the RNA molecule to be produced.

6. The method of any one of claims 1 to 5, wherein the DNA molecule is at least a partially double stranded DNA molecule, wherein at least the promoter sequence is double stranded.

7. The method of any one of claims 1 to 6, wherein the DNA molecule is linear.

8. The method of any one of claims 1 to 7, wherein the DNA

oligonucleotides comprise at least eight nucleotides.

9. The method of any one of claims 1 to 8, wherein the DNA

oligonucleotides comprise four consecutive conventional nucleotides bordered by 2’-0-methylated ribonucleotides.

10. The method of any one of claims 1 to 9, wherein the DNA

oligonucleotides comprise at least four 2’-0-methylated ribonucleotides.

11. The method of any one of claims 1 to 10, wherein step (c) comprises rapidly heating the RNA transcripts to disrupt secondary structures in the RNA.

12. The method of claim 11 , wherein the step of rapidly heating the RNA transcripts uses microwaves.

13. The method of any one of claims 1 to 12, wherein the volume of the reaction mixture is at least 5ml_.

14. The method of any one of claim 1 to 13 further comprising a step of isolating the plurality of copies of the RNA molecule.

15. The method of claim 14, wherein the step of isolating the plurality of copies of the RNA molecule comprises HPLC purification.

16. The method of claim 15, wherein the HPLC fractions containing the plurality of copies of the RNA molecule are pooled and optionally concentrated.

17. The method of any one of claims 1 to 16 further comprising a step of conjugating or linking the RNA molecules to another molecule or component.

18. The method of any one of claims 1 to 17 further comprising a step of formulating the RNA molecules with a pharmacologically acceptable excipient to produce a pharmaceutical composition.

19. Use of: (i) a DNA molecule comprising a promoter sequence for an RNA polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence; and (ii) DNA oligonucleotides comprising 2’-0-methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA molecule, in the production of a plurality of copies of the RNA molecule.

20. A kit for use in the method of any one of claims 1 to 18 comprising:

(i) a DNA molecule comprising a promoter sequence for an RNA

polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence;

(ii) DNA oligonucleotides comprising 2’-0-methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA molecule,

and optionally

(iii) an RNA polymerase; and/or

(iv) an RNase H enzyme.

21. A reaction mixture comprising:

(i) a DNA molecule comprising a promoter sequence for an RNA

polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence;

(ii) an RNA polymerase;

(iii) DNA oligonucleotides comprising 2’-0-methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA; and optionally

(iv) an RNase H enzyme.

22. An RNA molecule obtained by the method of any one of claims 1 to 17.

23. A library comprising a plurality of RNA molecules obtained by the method of any one of claims 1 to 17.

24. A pharmaceutical composition comprising the RNA molecule of claim 22 together with a pharmacologically acceptable excipient. 25. The method, use, kit, reaction mixture, RNA molecule, library or pharmaceutical composition of any preceding claim, wherein the RNA molecule encodes a protein, peptide, tRNA, rRNA, viral RNA, small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), microRNA (miRNA), small interfering RNA (siRNA), piwi-interacting RNA (piRNA), ribozymal RNA, antisense RNA, CRISPR/Cas9 guide RNA or non-coding RNA.

Description:
Method and products for producing RNA molecules

The present invention relates to a method for producing RNA molecules. In particular, the invention provides a method that utilises chimeric oligonucleotides to guide site-specific enzymatic (RNase H) cleavage of RNA transcripts (produced by run-off in vitro transcription) comprising plurality of tandem repeats of the RNA molecule to be produced, to generate a plurality of copies of the RNA molecule. Advantageously, the method can be worked on a large scale and in a single reaction vessel to facilitate large yields of the desired RNA molecule. Thus, the invention also provides a reaction mixture for producing the RNA molecules and a kit for use in the method of the invention. RNA molecules obtained by the method and a library comprising a plurality of RNA molecules obtained by the method are also provided. Uses of the RNA molecules obtained by the method, e.g. in therapy, and pharmaceutical compositions comprising the RNA molecules are also contemplated.

RNA molecules are used widely in research and are increasingly being used in therapy. At present, synthetic RNA molecules predominate in basic research and drug development and these molecules commonly are conjugated with extensive and various modifications. However, such modifications make them completely different from cellular RNAs transcribed from the genome that usually consist of unmodified ribonucleotides or just contain a few posttranscriptional modifications. Thus, the structure and characteristics of synthetic RNA molecules or mimics is not necessarily compatible with many of their proposed utilities in RNA research and drug development.

The industrial method of choice for RNA production is solid-phase synthesis. While this is at least possible for small molecules, e.g. oligonucleotides, it is expensive and requires labour intensive post-synthesis treatment, such as purification. Moreover, practical issues attributable to remaining impurities in so- called pure RNA molecules can still be observed. For instance, residual reagents from solid-phase synthesis can lead to immune responses in patients meaning that many synthesised RNA molecules cannot be used in their proposed therapeutic utilities. For instance, RNA molecules have been proposed for use in therapy by interacting or interfering with miRNA/siRNA pathways. However, in these pathways the RNA molecules are completely enveloped by the acting proteins and

modifications to the RNA molecules can prevent their activity. Enzymatic production of RNAs can be achieved on a small scale using an RNA polymerase, such as the T7 RNA polymerase, in so-called“run-off” transcription. Whilst the T7 RNA polymerase performs well in vitro its utility is hampered due to its tendency to produce heterogeneous transcript lengths. For most applications (e.g. siRNA/miRNA therapy, structural biology) alternative length products cannot be tolerated, thereby limiting the utility of current enzymatic production methods. Furthermore, the purity and yield of enzymatically-produced transcripts are sequence-dependent. Thus, current methods need to be optimized based on the sequence of the molecule to be produced, which can be time- consuming, labour intensive and expensive, especially for ill-performing sequences. Accordingly, enzymatic methods of in vitro RNA production have so far not been feasible for industrial application.

In view of the issues with current methods for the production of RNA molecules, there is a desire for alternative methods of producing RNA molecules, particularly cost-efficient methods capable of producing homogeneous molecules with greater accuracy and on a large scale.

The present inventors have surprisingly determined that it is possible to produce RNA molecules suitable for use as therapeutic RNA drugs (e.g.

homogeneous products) using commonly used and well-characterised enzymes, even enzymes that have a tendency to produce heterogeneous transcript lengths, such as T7 RNA polymerase. Moreover, the method described herein can readily be used for large-scale production of RNA molecules and is significantly cheaper than solid-phase synthesis, even for short molecules, such as those used in RNA inference applications, e.g. siRNAs or microRNAs consisting of 20-25 nucleotides. Furthermore, the method avoids problems associated with residual reagents associated with RNA molecules produced by solid-phase synthesis.

In a representative example, the method first involves the design and preparation of a DNA template comprising a promoter sequence for an RNA polymerase (e.g. T7 RNA polymerase) and a sequence encoding a plurality of tandem repeats of the desired RNA molecule, which is under the control of the promoter sequence (i.e. operably linked to the promoter sequence). The DNA template is used to synthesise RNA transcripts using the DNA template and an RNA polymerase that recognises the promoter sequence, such that each of the RNA transcripts contains a plurality of tandem repeats of the desired RNA molecule. Chimeric oligonucleotide splints (comprising a mixture of conventional deoxyribonucleotides and 2’-0-methyl ribonucleotides) that are complementary to portions of the RNA transcripts are hybridized to the RNA transcripts to produce duplexes (RNA:splint duplexes). The duplexes are formed at the junctions between the tandem repeats of the desired RNA molecule to create site-specific cleavage domains in the partially double stranded RNA transcripts. The chimeric

oligonucleotides guide RNase H to cleave the RNA transcripts at the ends of the tandem repeats of the desired RNA molecule, thereby releasing a plurality of copies of the desired RNA molecule from each transcript. The RNA molecules can be isolated from the uncleaved chimeric oligonucleotides, e.g. using HPLC purification, whereby the RNA molecules may be used in various utilities and the chimeric oligonucleotides may be re-used in the method. A schematic of this representative example is shown in Figure 1.

Accordingly, at its broadest, the invention can be seen to provide the use of (i) a DNA molecule comprising a promoter sequence for an RNA polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence; and (ii) DNA oligonucleotides comprising 2’-0- methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage (RNase H cleavage) of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA molecule, in the production of a plurality of copies of the RNA molecule.

Alternatively viewed, the invention provides a method for producing a plurality of copies of an RNA molecule, said method comprising:

(a) providing a DNA molecule comprising a promoter sequence for an RNA polymerase and a sequence encoding a plurality of tandem repeats of the RNA molecule to be produced under the control of the promoter sequence;

(b) transcribing the DNA molecule of (a) using an RNA polymerase to produce RNA transcripts containing a plurality of tandem repeats of the RNA molecule to be produced;

(c) hybridizing the RNA transcripts produced in (b) to DNA oligonucleotides comprising 2’-0-methylated ribonucleotides to produce partially double stranded RNA: DNA duplexes, wherein the DNA oligonucleotides guide enzymatic cleavage of the RNA transcripts at the ends of the tandem repeats of the RNA molecule to be produced; and

(d) cleaving the RNA strand of partially double stranded RNA: DNA duplexes of (c) with RNase H, to release the plurality of copies of the RNA molecule. The term“RNA” or“RNA molecule” refers to a polymer of“conventional” ribonucleotides, i.e. ribonucleotides found in nature: ATP, GTP, CTP and UTP. A “desired RNA molecule” or“RNA molecule to be produced” refers to an RNA molecule produced by the method of the invention.

The present invention may be used to produce RNA molecules of any desired length. The methods may be used to produce RNA molecules that far exceed the length of RNA molecules that can be accurately produced using solid- phase synthesis methods. For instance, an RNA molecule produced by the method of the invention may be between about 10 up to about 10000 nucleotides in length. Thus, in some embodiments, the method may be viewed as the production of RNA molecules that are oligonucleotides (oligoribonucleotides). In other embodiments, the method may be viewed as the production of RNA molecules that are

polynucleotides (polyribonucleotides). In this respect, the boundary between the size of an“oligonucleotide” and“polynucleotide” is not well-defined in the art. For instance, a sequence of more than 400 nucleotides may be termed a

polynucleotide. Accordingly, the terms oligonucleotide and polynucleotide are used interchangeably herein to refer to nucleotide sequences (RNA molecules) within the size range specified above. However, where the term“polynucleotide” is used, it will typically refer to RNA molecules containing more than 400 nucleotides.

In some embodiments, the RNA molecule may be from about 10 to about 750 nucleotides in length, including from about 10 to about 500 nucleotides in length, e.g., from about 10 to about 450 nucleotides in length, such as from about 12 to about 400 nucleotides in length, from about 12 to about 300 nucleotides in length, from about 12 to about 250 nucleotides in length, from about 15 to about 200 nucleotides in length, from about 15 to about 150 nucleotides in length, from about 18 to about 100 nucleotides in length, from about 18 to about 75 nucleotides in length, from about 20 to about 70 nucleotides in length, from about 20 to about 60 nucleotides in length, and so on. In some embodiments, the RNA molecule contains about 10-400, 11-390, 12-380, 13-370, 14-360 or 15-350 nucleotides.

In some embodiments, the method finds utility in the production of RNA molecules for use in RNA interference applications, e.g. siRNAs and microRNAs. Accordingly, in some embodiments, the RNA molecule may contain about 20-50 nucleotides, such as at least about 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29 or 30 nucleotides. The invention may find utility in the effective production of longer RNA molecules, e.g. comprising about 30 or more nucleotides, such as about 35, 40, 45, 50, 60, 70, 80, 90, 100 or more nucleotides. For instance, the RNA molecules produced by the invention may contain about 30-1000, 40-900, 50-800, 60-700, 70- 600, 80-500, 90-450 or 100-400 nucleotides. In some embodiments, the RNA molecules (e.g. polynucleotides) produced by the invention may contain about 400- 10000, 500-9000, 600-8000, 700-7000, 800-6000, 900-5000 or 1000-4000 nucleotides, e.g. comprising about 500, 1000, 1500, 2000, 2500, 3000, 3500 or more nucleotides.

The method of the invention can also be used to produce RNA molecules of any desired sequence. In this respect, the method of the invention overcomes problems associated with the need to optimise in vitro transcription conditions for a particular sequence. As the method utilises a DNA template encoding a plurality of tandem repeats of the desired RNA molecule, even templates that are poorly transcribed, i.e. transcribed at low rates or yields, will provide sufficient copies of the RNA molecule to yield a useful amount of RNA molecules when the transcripts are cleaved. Moreover, as the method readily can be worked at large scales using cheap and commonly available reagents, the scale of the reaction can be adjusted to increase the yield of the reaction to obtain useful amounts of the desired RNA molecule without the need to optimise the in vitro transcription reaction. The ability to conduct the method of the invention in a single reaction vessel further assists to maximise the yield of the desired RNA molecule as products are not lost in intermediate purification steps.

It will be understood that the length and sequence of the desired RNA molecule produced by the method of the invention is dependent on being able to design an oligonucleotide splint suitable to guide the site-specific cleavage of the RNA transcripts to yield the desired RNA molecule, i.e. without cleaving the RNA transcripts within the RNA molecule. Thus, it is required that the desired RNA molecule does not contain internal cleavage sites, i.e. sites that are complementary to the oligonucleotide splints used in the hybridizing and cleavage steps.

In particular, it is required that the combined sequence consisting of the 3’ end of the desired RNA molecule (e.g. the terminal 4-10 nucleotides, e.g. about 4,

5, 6, 7 or 8 nucleotides) directly adjacent to the 5’ end of the desired RNA molecule (e.g. the terminal 4-10 nucleotides, e.g. about 4, 5, 6, 7 or 8 nucleotides) is not repeated elsewhere in the desired RNA molecule. In some instances, it may be possible to modify or mutate the sequence of the RNA molecule (e.g. by modifying or mutating the DNA template encoding the RNA molecule) to generate a sequence that has the functional properties of the desired RNA molecule but does not contain internal sequences that would be cleaved in the cleavage step. For instance, where the RNA molecule encodes a protein or peptide, a codon may be changed to remove an internal cleavage site whilst maintaining the amino acid encoded by the codon due to codon degeneracy. Similarly, a codon may be changed to remove an internal cleavage site such that the encoded protein contains a conservative amino acid substitution that does not substantially alter the function of the protein.

Thus, the method of the invention may be used to produce RNA molecules encoding a protein, peptide, tRNA, rRNA, viral RNA, small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), microRNA (miRNA), small interfering RNA (siRNA), piwi-interacting RNA (piRNA), ribozymal RNA, antisense RNA, CRISPR/Cas9 guide RNA or non-coding RNA. In some embodiments, the RNA molecule is a miRNA. In some embodiments, the RNA encodes a protein or peptide. In some embodiments, the protein or peptide is a therapeutic protein or peptide.

The DNA molecule of the invention comprises: (i) a promoter sequence for an RNA polymerase; and (ii) a sequence encoding a plurality of tandem repeats of the RNA molecule to be produced (the desired RNA molecule) which is under the control of the promoter sequence of (i). Thus, the DNA molecule for use in the method of the invention may be viewed as, and must be suitable for use as, a“DNA template” for in vitro transcription by an RNA polymerase, i.e. an RNA transcription template. Thus, the terms“DNA molecule”,“DNA template” and“template” are used interchangeably herein.

The promoter sequence of (i) and encoding sequence of (ii) must be arranged such that contacting the DNA molecule with an RNA polymerase that recognises the promoter sequence under conditions suitable for transcription results in the template dependent production of an RNA molecule (“transcript”) comprising a plurality of tandem repeats of the RNA molecule to be produced.

Thus, the promoter sequence and encoding sequence may be viewed as being “operably linked”, which term refers to the association or arrangement of sequences in a nucleic acid molecule such that the function of one sequence (i.e. the encoding sequence) is regulated by the other (i.e. the promoter sequence). In other words, the promoter sequence promotes or induces RNA polymerase directed transcription of the encoding sequence of the DNA template. ln some embodiments, the DNA template may comprise an“optimal initiation site” located at the 3’ end of the promoter sequence (or alternatively viewed at the 5’ end of the encoding sequence). For instance, some RNA polymerases, such as T7 RNA polymerase, may be less efficient at initiating transcription using the 5’ end of the sequence encoding the desired RNA molecule. Accordingly, it may be advantageous to insert an additional sequence at the start of the encoding sequence that is an optimal initiation site for the RNA polymerase. Such optimal initiate sites are known in the art. For instance, T7 RNA polymerase is particularly effective at initiating transcription when the encoding sequence starts with a poly-dG sequence, e.g. GGG. Thus, in some embodiments, the DNA molecule comprises a poly-dG sequence, such as GGG, at the start of the encoding sequence (at the 5’ end of the encoding sequence). In some embodiments, the optimal initiate site is GGGAGA (SEQ ID NO: 1).

In some embodiments, the tandem repeats encoded by the DNA template may be bordered on one or both sides by“spacer” sequences. For instance, a spacer sequence may be included between the optimal initiation site and the first tandem repeat to facilitate the removal of the optimal initiation site following cleavage of the RNA transcripts. Additionally or alternatively, a spacer sequence may be included at 3’ end of the tandem repeats to avoid the production of heterogeneous RNA molecules from the final tandem repeat.

Spacer sequences may be of any suitable length and sequence. Preferably, the spacer sequences result in cleavage products that are a different size to the desired RNA molecules such that they can be readily separated from the RNA molecules in downstream purification steps. In a representative embodiment, the spacer sequences comprise at least about 4 nucleotides, e.g. 4-100, 5-90, 6-80, 7- 70, 8-60, 9-50 or 10-40 nucleotides.

The term“transcription” refers to the DNA template dependent production (synthesis) of an RNA molecule using an RNA polymerase enzyme to a polymerise ribonucleotides thereby synthesising an RNA molecule that is complementary to the DNA strand used to template the polymerisation. Thus, a“transcript” is an RNA molecule produced from a DNA templated RNA polymerase reaction. The term“in vitro" transcription refers to a transcription reaction that occurs in solution outside a cell, e.g. in a reaction vessel.

The DNA molecule used in the method of the invention may be in any suitable form to function efficiently as a template for an in vitro transcription reaction. In this respect, for in vitro transcription to be effective and efficient, the RNA polymerase must bind with high affinity to the promoter sequence in the DNA molecule (template). Typically, RNA polymerase enzymes bind more efficiently to double stranded promoter sequences. For instance, the T7 RNA polymerase has been shown to bind to double stranded DNA comprising its promoter sequence (T AAT ACG ACT CACT ATA, SEQ ID NO: 2) with an K d of about 3.4-20nM. Thus, in some embodiments, the DNA molecule used in the method of the invention is at least partially double stranded, wherein at least the promoter sequence is double stranded. In preferred embodiments, the DNA molecule is fully double stranded.

The term“high affinity” refers to RNA polymerases that bind to their respective promoter sequence with a K d of about 5x1 O 7 M or less, such about 1x10 7 , 5x1 O 8 , 4x1 O 8 , 3x1 O 8 , 2x1 O 8 or 1x10 8 M or less.

The DNA molecule (template) provided in step (a) of the present method may be produced by any suitable means and a variety of means are well-known in the art e.g. assembly PCR, solid-phase synthesis. Accordingly, the method of the present invention may comprise additional steps of producing the DNA molecule (template) before step (a) of providing the DNA molecule.

The means used to produce the DNA molecule may depend on the length of the molecule, e.g. the number of tandem repeats encoding the desired RNA molecule and the length of each repeat (i.e. the length of the RNA molecule to be produced). For instance, if the DNA molecule contains a relatively small number of repeats (e.g. 10 or less, such as about 9, 8, 7, 6, 5 or 4) of an oligonucleotide sequence (e.g. comprising about 30 or fewer nucleotides, such as about 10-30, 11- 29, 12-28, 13-27, 14-26, 15-25 nucleotides), it may be cost-effective to produce the DNA molecule using solid-phase synthesis, i.e. if the DNA molecule contains 200 or fewer nucleotides. However, if the DNA molecule contains a large number of repeats (e.g. 11 or more repeats, such as about 12, 13, 14, 15, 20, 25, 30 or more repeats) and/or each repeat is large (e.g. about 30 nucleotides or more, e.g. about 40, 50, 60, 70, 80, 90, 100 nucleotides or more), it may be more cost-effective to use other means of producing the DNA molecule, e.g. assembly PCR.

Once the DNA molecule is produced, it may be necessary or advantageous to make the molecule double stranded and/or to increase the number of copies of the molecule (i.e. amplify the molecule) to produce sufficient template for the in vitro transcription reaction. Suitable means for producing and amplifying double stranded DNA molecules are well-known in the art, e.g. PCR, and any such means may be used in the invention.

It is well-known in the art that PCR amplification can introduce errors into the amplified molecules, which may impact on the accuracy of the RNA molecule produced in the method of the invention. Thus, in embodiments where the DNA molecule is amplified by PCR in order to provide the DNA molecule of according to step (a), it is preferred that the PCR utilises a high fidelity DNA polymerase, such as Pfu DNA polymerase or Pwo DNA polymerase. Numerous high fidelity DNA polymerases are known in the art and may be used in the invention.

In some embodiments is may be advantageous to insert the DNA molecule (template) into a plasmid, which can be replicated in bacteria, such as E.coli.

Suitable plasmid sequences are well known in the art. This allows the sequence of the DNA molecule (template) to be checked (e.g. by sequencing) and for any errors in the sequence to be corrected, e.g. via iterations of sequencing and mutagenesis using any suitable methods.

The insertion of the DNA molecule (template) into a plasmid also facilitates the generation of a significant number of copies of the DNA molecule. For instance, a single bacterial colony containing a plasmid comprising the sequence-verified DNA molecule (template) may be grown to amplify the plasmid, which may be subsequently purified from the bacteria using any suitable means known in the art.

In some embodiments, the DNA molecule (template) may be excised from the plasmid, e.g. using cleavage enzymes. The excised linear DNA molecule may be purified, e.g. using PAGE and gel extraction, or other suitable methods, to provide the DNA molecule of step (a). Thus, in some embodiments, the DNA molecule in the plasmid is bordered by cleavage domains or sites (e.g. restriction endonuclease recognition sites) to enable the excision of the DNA molecule.

In some embodiments, the plasmid containing the DNA molecule (template) may be linearized, e.g. using a restriction enzyme. The linearized plasmid may be used directly in the in vitro transcription reaction, i.e. the DNA molecule provided in step (a) may be a linearized plasmid.

Thus, in some embodiments, the DNA molecule provided in step (a) of the method is a linear DNA molecule.

In embodiments that involve excision of the DNA molecule from the plasmid or linearization of the plasmid it is preferable to cleave the plasmid directly adjacent to the end of the final repeat of the sequence encoding the desired RNA molecule, such that the RNA transcripts produced in the in vitro transcription reaction of step (b) do not contain additional sequences at the 3’ end. However, in some

embodiments, the cleavage site or cleavage domain may be downstream of the encoding sequence. The distance between the cleavage site and the end of the encoding sequence may depend on the length of the desired RNA molecule and a suitable distance can be determined by the skilled person. In this respect, the distance must result in an RNA molecule that may readily be separated from the desired RNA molecule upon cleavage of the RNA transcripts in step (d). In some representative embodiments, cleavage may occur within 20 nucleotides or less of the 3’ end of the encoding sequence, e.g. 15, 12, 10, 9, 8, 7, 6, 5 nucleotides or less of the 3’ end of the encoding sequence, such as within 1 , 2, 3 or 4 nucleotides of the encoding sequence. It will be evident that the cleavage site upstream of the DNA molecule in the plasmid (i.e. upstream (5’ of) the RNA polymerase promoter sequence) may be any distance from the 5’ end of the promoter sequence as the intervening sequence will not be transcribed.

In some embodiments, the DNA molecule may be a circular molecule. For instance, the plasmid in which the DNA molecule is inserted may be used directly in the RNA transcription reaction. Alternatively, the purified linear DNA molecule excised from the plasmid could be re-circularised using a suitable ligase enzyme, such as T4 ligase, to form a circular DNA molecule to be provided in step (a) of the present method. In embodiments in which the DNA molecule is a circular molecule, it may be advantageous to include a transcription terminator sequence in the DNA molecule downstream of the encoding sequence, e.g. to prevent transcription of the whole plasmid. Suitable transcription terminator sequences are known in the art.

In addition to amplification, the process of transfecting the DNA molecule (template) containing plasmid into bacteria also allows a bacterial glycerol stock to be produced. Bacteria comprising the DNA molecule (template) plasmid can be prepared in glycerol, frozen and stored stably for long periods of time.

Accordingly, in some embodiments step (a) of the present method comprises:

(i) cloning into a DNA plasmid a linear DNA molecule comprising a promoter sequence for an RNA polymerase and a sequence encoding a plurality of tandem repeats of the desired RNA molecule under the control of the promoter sequence;

(ii) amplifying the plasmid; and (iii) excising part of the plasmid containing the DNA molecule or cleaving the plasmid downstream (3’ of) the sequence encoding a plurality of tandem repeats of the desired RNA molecule.

In some embodiments, step (ii) comprises transfecting said DNA plasmid into bacteria and growing the bacteria and optionally isolating the plasmid.

In some embodiments, step (iii) further comprises a step of isolating the excised DNA molecule.

Thus, in some embodiments, the DNA molecule (template) comprises a 5’ end region and a 3’ end region each comprising a cleavage domain and wherein step (iii) comprises cleaving the cleavage domains in the end regions with a cleavage enzyme.

In some embodiments, the method may comprise a further step of (iv) circularising the excised DNA molecule obtained in step (iii).

Any suitable cleavage enzyme as defined herein may be used to excise the DNA molecule (template) from the plasmid. The cleavage enzyme may be selected to ensure that it does not cleave within the DNA molecule, i.e. within the promoter sequence, optimal initiation sequence (if present) and encoding sequence.

The promoter sequence in the DNA molecule (template) provided in step (a) may be selected based on the RNA polymerase used in the method. Any suitable RNA polymerase and cognate promoter sequence may be used in the method. In some embodiments, the promoter sequence is a promoter sequence suitable for use with any one of T7 RNA polymerase, T3 RNA polymerase, SP6 RNA polymerase or E.coli RNA polymerase. In some embodiments, the promoter sequence is suitable for use with T7 RNA polymerase. In some preferred embodiments, the promoter sequence comprises a nucleotide sequence as set forth in SEQ ID NO: 2. In particularly preferred embodiments, DNA molecule comprises a nucleotide sequence as set forth in SEQ ID NO: 2 and a nucleotide sequence as set forth in SEQ ID NO: 1. For instance, in some embodiments, the promoter sequence of SEQ ID NO: 2 is operably linked to the optimal initiation sequence of SEQ ID NO: 1 , which is further operably linked to the sequence encoding a plurality of tandem repeats of the desired RNA molecule.

The DNA molecule (template) comprises a sequence encoding a plurality of tandem repeats of the RNA molecule to be produced (the desired RNA molecule). The term“tandem repeat” refers to repeated sequences within a DNA molecule in which the repeated sequences are directly adjacent to each other, i.e. there are no intervening nucleotides between the end of one repeat and the start of the next repeat.

As used herein, the term“plurality” means two or more, e.g. at least 2, 3, 4,

5, 6, 7, 8, 9, 10, 20 or 30 or more, such as 50, 100, 150, 200, 250, 500, 1000, 5000, 10000 or more depending on the context of the invention. For instance, the DNA molecule may contain at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 20 or 30 or more tandem repeats of the sequence encoding the RNA molecule, such as 2-100, 3-90, 4-80, 5- 70, 6-60, 7-50, 8-40, 9-30 or 10-25 tandem repeats of the sequence encoding the RNA molecule. In some embodiments, the DNA molecule contains at least 4, 5, 6,

7, 8, 9, 10, 15, 20, 25 or more tandem repeats of the sequence encoding the RNA molecule.

As the in vitro transcription reaction results in the production of a plurality of RNA transcripts, each containing a plurality of tandem repeats of the desired RNA molecule, the cleavage reaction in step (d) will result in the release of a plurality of copies of the desired RNA molecule, e.g. 10 3 , 10 4 , 10 s , 10 6 , 10 7 , 10 s , 10 9 , 10 10 or more copies of the desired RNA molecule.

In some embodiments, the method of the invention may be used to produce more than one desired RNA molecule, i.e. the DNA molecule (template) may comprise a sequence encoding tandem repeats of more than one desired RNA molecule, e.g. 2, 3, 4, 5, 6, 7, 8, 9, 10 or more RNA molecules, i.e. RNA molecules with different sequences.

It will be evident that the encoding sequence may contain tandem repeats of different RNA molecules in any order. For instance, encoding sequences for different RNA molecules may be directly adjacent to each other such that the plurality of tandem repeats refers to the repetition of the sequence encoding all of the RNA molecules, e.g. for RNA molecules A and B the plurality of tandem repeats may be sequence ABABAB. For RNA molecules A, B and C, the plurality of tandem repeats may be sequence ABCABCABC. Alternatively, the DNA molecule may encode more than one plurality of tandem repeats, e.g. AAAAABBBBB,

AAAABBBBCCCC etc. The DNA molecule (template) may be designed (e.g. the order of the plurality of tandem repeats) to avoid or minimise interactions between the repeated sequences in the RNA transcripts that may interfere with their production or cleavage.

Thus, the present invention may be used to generate a plurality of copies of RNA molecules in controlled stoichiometry in a single reaction based on the number of copies of each RNA molecule encoded by the DNA molecule. This may be particularly useful for the production of double stranded RNA molecules, e.g. for use in RNA interference applications, such as siRNAs. However, in embodiments where the DNA molecule encodes more than one plurality of tandem repeats of RNA molecules, the RNA molecules do not need to be produced in equal amounts. For example, a 1 :2:3 ratio of RNA molecules A, B and C could be achieved using a DNA molecule with an encoding sequence AAAABBBBBBBBCCCCCCCCCCCC.

It will be evident that where it is desirable to produce double stranded RNA molecules or other RNA structures in which RNA molecules hybridise to each other, each type of RNA molecule may be produced separately in a method of the invention and subsequently mixed in the appropriate ratio to produce the double strand RNA molecule or RNA structure. Thus, in some embodiments, the method involves a step of mixing RNA molecules obtained by the method, e.g. to produce double stranded RNA molecules. The products of the mixture form a further embodiment of the invention.

It will be evident that in embodiments in which the DNA molecule (template) encodes a plurality of RNA molecules, it may be necessary to use a plurality of DNA oligonucleotides (i.e. DNA oligonucleotides with different sequences) in the hybridisation and cleavage steps, i.e. to ensure each RNA molecule is released in the cleavage reaction.

The term“different” refers to RNA molecules, their encoding sequences or DNA oligonucleotide sequences comprising one or more different nucleotides.

Thus, different oligonucleotide sequences may differ by one or more, e.g. 2, 3, 4, 5, 6, 7, 8, 9, 10 or more, nucleotides, e.g. 20, 30, 40, 50, 60, 70, 80, 90 or more nucleotides. The differences may be in the length and/or composition of the sequences. Alternatively viewed, the different sequences (e.g. RNA molecules) have less than 100% sequence identity each other, such as less than 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 70%, 60%, 50% sequence identity to each other. Different oligonucleotides may comprise the same nucleotide sequences but differ in the position of the 2’-0-methyl-NTPs within the oligonucleotide.

Any RNA polymerase may be used in the in vitro transcription reaction of the invention (step (b)). Suitable RNA polymerase enzymes include T7 RNA

polymerase, T3 RNA polymerase, SP6 RNA polymerase or E.coii RNA polymerase. As used herein, the term "RNA polymerase" includes not only naturally occurring enzymes but also all such modified derivatives, including also derivatives of naturally occurring RNA polymerase enzymes. The RNA polymerase used in the method of the invention must be suitable for use with the promoter sequence in the DNA molecule (template).

Particularly preferred RNA polymerase enzymes for use in the invention may be selected from the group consisting of T7 RNA polymerase, T3 RNA polymerase, SP6 RNA polymerase, E.coli RNA polymerase and derivatives, e.g. sequence-modified derivatives, or mutants thereof. In some preferred

embodiments, the RNA polymerase enzyme used in the invention is T7 RNA polymerase or a derivative, e.g. sequence-modified derivatives, or mutant thereof.

Sequence-modified derivatives or mutants of RNA polymerase enzymes include mutants that retain at least some of the functional activity, e.g. RNA polymerase activity, of the wild-type sequence. Mutations may affect the activity profile of the enzymes, e.g. enhance or reduce the rate of polymerisation, under different reaction conditions, e.g. temperature, template concentration, etc.

Mutations or sequence-modifications may also affect the thermostability of the enzyme.

Conditions suitable for in vitro transcription include any conditions that result in the template dependent production of RNA molecules (“transcripts”).

Representative conditions are set out below.

A typical in vitro transcription reaction mixture includes the DNA molecule (template) described above, an RNA polymerase (e.g. at a concentration of about 0.1-1.0mg/ml_, such as about 0.3 mg/ml_), nucleotides (ribonucleotides, ATP, GTP, CTP and UTP, e.g. at a concentration of about 1-5mM, e.g. about 3mM) and other components required for a RNA polymerase reaction as described below. The desired polymerase activity may be provided by one or more distinct polymerase enzymes. The temperature of the reaction will depend on the RNA polymerase used, but typically may be in the range of 20-50°C, such as about 25-40°C, preferably about 37°C. The reaction may be incubated at a suitable temperature for a time sufficient to produce a useful amount of RNA transcripts, e.g. at least about 1 hour, such as at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14 or 16 hours.

The in vitro transcription reaction mixture may further include an aqueous buffer medium that includes a source of divalent cations and a buffering agent. Any convenient source of divalent cations may be used, such magnesium, manganese, zinc and the like, where the cation will typically be magnesium. Any convenient source of magnesium cations may be employed, including MgCh, Mg-acetate, and the like. The amount of Mg 2+ present in the buffer may range from 0.5 to 40 mM, but will preferably range from about 5 to 15 mM, and will ideally be at about 10 mM. Representative buffering agents or salts that may be present in the buffer include Tris, Tricine, HEPES, MOPS and the like, where the amount of buffering agent will typically range from about 5 to 200 mM, usually from about 10 to 150 mM, and more usually from about 80 to 100 mM, where in certain preferred embodiments the buffering agent will be present in an amount sufficient to provide a pH ranging from about 6.0 to 9.5, typically about pH 8.0. Other agents which may be present in the buffer medium include a source of monovalent cations (e.g. sodium or potassium ions, e.g. provided in the form of NaCI or KCI), redox reagents, such as

dithiothreitol, agents to improve polymerase activity or initiation, such as UMP,

GMP, inorganic pyrophophatase and spermidine, crowding agents, such as DMSO, PEG or BSA, and the like.

The DNA oligonucleotides used in the methods of the invention guide or template the cleavage of the RNA transcripts produced in step (b) and enable the release of the plurality of copies of the desired RNA molecule. In particular, the DNA molecules guide RNase H cleavage of the RNA transcripts produced in step (b).

RNase H is present in, and may be isolated from, various organisms. It is characterised by its ability to hydrolyze only RNA: DNA hybrids (duplexes), wherein cleavage occurs in the RNA strand to yield a 3’-hydroxyl and 5’-phosphate at the hydrolysis site. Specifically, RNase H enzyme used in the method of the invention must be able to cleave the RNA strand opposite the 5’ end of the hybridised DNA oligonucleotide and this property is used to direct the site-specific cleavage of RNA molecules using RNase H. This is a property of E.coli RNase H.

For instance, RNase H can be used to cleave RNA molecules at specific sites using chimeric oligonucleotides comprising DNA nucleotides and 2’-0- methylated ribonucleotides. 2’-0-methylated oligoribonucleotides form stable duplexes with complementary RNA molecules but as RNase H does not recognise RNA duplexes as a substrate, the enzyme does not cleave such duplexes. Thus, when an oligonucleotide comprising a sequence of DNA nucleotides next to a sequence of 2’-0-methylated ribonucleotides is hybridized to its complementary sequence in an RNA molecule, RNase H will cleave the RNA molecule opposite the 5’ end of the sequence of DNA nucleotides. Thus, the DNA oligonucleotides for use in the method of the invention comprise a mixture of“conventional” deoxyribonucleotides (dNTPs) and 2’-0- methylated ribonucleotides (2’-0-methyl-NTPs). Accordingly, the DNA

oligonucleotides may be viewed as chimeric guide or splint oligonucleotides that hybridize to the RNA transcripts produced by the in vitro transcription reaction of step (b) to form cleavage domains in the RNA transcripts (RNA:DNA or RNA:splint duplexes) and enable the site-specific cleavage of the RNA transcripts by RNase H. The term DNA oligonucleotides is used herein on the basis that the molecules must contain a consecutive sequence of dNTPs and therefore comprise a DNA oligonucleotide. However, the term DNA oligonucleotide may be used

interchangeably herein with the term“oligonucleotide splint”,“oligonucleotide guide”,“oligonucleotide cleavage guide” or“chimeric splint”.

The design of the DNA oligonucleotides for use in the method of the invention will depend on the sequence of the desired RNA molecule. Any suitable design may be used as along as it is sufficient to enable specific cleavage of the RNA transcripts at the ends of the tandem repeats of the RNA molecule to be produced, i.e. the DNA oligonucleotides must hybridize (be complementary to) the regions of the RNA transcripts comprising the ends of tandem repeats such that cleavage occurs between the last nucleotide in one repeat and the first nucleotide in the adjacent repeat.

Thus, the DNA oligonucleotides may be of any suitable length to hybridize to the RNA transcripts produced in the in vitro transcription reaction of step (b). For instance, the DNA oligonucleotides may comprise at least 8 nucleotides (i.e. made up of dNTPs and 2’-0-methyl-NTPs), such as about 9, 10, 11 , 12, 13, 14, 15, 16,

17, 18, 19, 20 or more nucleotides. In some embodiments, the DNA

oligonucleotides may comprise about 20, 25, 30, 35, 40 or more nucleotides.

The DNA oligonucleotides may comprise any number, combination or design of dNTPs and 2’-0-methyl-NTPs that is sufficient to hybridize to the RNA transcripts and guide site-specific cleavage of the RNA transcripts to release the RNA molecules as defined above. In some embodiments, the DNA oligonucleotides may comprise a sequence of dNTPs bordered on one or both sides by 2’-0-methyl- NTPs. In some embodiments, the DNA oligonucleotides may comprise a sequence of 2’-0-methyl-NTPs bordered on one or both sides by dNTPs. When the dNTPs are bordered by 2’-0-methyl-NTPs only on one side, it is preferred that the 2’-0- methyl-NTPs are present at the 5’ end of the dNTP sequence. Alternatively viewed, when the 2’-0-methyl-NTPs are bordered by dNTPs only on one side, it is preferred that the dNTPs are present at the 3’ end of the 2’-0-methyl-NTP sequence.

In some embodiments, each sequence of dNTPs may comprise 3 or more consecutive nucleotides, such as 3, 4, 5, 6, 7, 8, 9, 10 or more consecutive nucleotides.

In some embodiments, each sequence of 2’-0-methyl-NTPs may comprise 3 or more consecutive nucleotides, such as 3, 4, 5, 6, 7, 8, 9, 10 or more consecutive nucleotides.

In a representative embodiment, the DNA oligonucleotides for use in the method of the invention comprise a sequence of 3 or more consecutive dNTPs bordered on both sides by sequences of 3 or more consecutive 2’-0-methyl-NTPs.

The term“conventional deoxyribonucleotides” as used herein refers to deoxyribonucleotides comprising one of the four bases found in DNA; adenine, guanine, cytosine and thymine. The term“conventional deoxyribonucleotides” thus encompasses, for example, dATP, dGTP, dCTP and dTTP. Whilst uracil is not typically found in DNA naturally, dUTP readily may be used instead of, or in addition to, dTTP. Thus, in the context of the present invention, dUTP may be viewed as a “conventional” deoxyribonucleotide.

The term“2’-0-methyl NTPs” as used herein refers to NTP analogues where a methyl group is added to the 2' hydroxyl of the ribose moiety of a nucleoside, producing a methoxy group.

Thus, the term“nucleotide” as used herein refers generally to a dNTP or NTP or an analogue thereof.

The step of hybridizing the RNA transcripts produced in the in vitro transcription reaction of step (b) to the DNA oligonucleotides may be performed under any suitable conditions, which may differ depending on the size and/or composition of the RNA transcripts and/or DNA oligonucleotides. Generally, the step of hybridizing comprises contacting the RNA transcripts with the DNA oligonucleotides, heating the mixture and subsequently cooling the mixture such that the RNA transcripts and DNA oligonucleotides form duplexes (RNA: DNA duplexes) at sites of complementarity, i.e. to produce a partially double stranded RNA: DNA duplex wherein only the regions at which the oligonucleotides hybridise are double stranded. Suitable conditions for hybridizing nucleic acid molecules are well-known in the art. In a representative embodiment, the mixture of RNA transcripts and DNA oligonucleotides is heated to a temperature suitable to denature any secondary structures present in the transcripts, such as about 85 °C or more, e.g. 85-100°C, such as about 95°C, and allowed to cool to a temperature to allow the

oligonucleotides to anneal to the RNA transcripts and that is suitable for the cleavage reaction to occur e.g. 40°C or less, such as about 20-37°C. It will be evident that the RNA transcripts may be heated before the addition of the DNA oligonucleotides.

Any suitable means for heating the RNA transcripts may be used, e.g. water bath, heat block etc. However, it is preferred that the RNA transcripts (the solution containing the RNA transcripts) is heated rapidly. Whilst conventional laboratory means of heating, such as a heat block, is suitable for small volumes, e.g. of 1ml_ or less, the heat transfer is too slow for large volumes, e.g. of 5ml_ or more, which can result in RNA transcript degradation. The inventors have found that for large volumes of RNA transcripts (i.e. for large scale production of the RNA molecules) rapid heating may be achieved using microwaves and, surprisingly, this does not have a deleterious effect on the RNA transcripts (e.g. degradation).

Thus, in some embodiments (particularly“large scale” embodiments), the step of hybridizing may comprise heating the RNA transcripts with microwaves, e.g. using a conventional microwave oven. The power and duration of microwaves will dependent on the volume of solution in which the RNA transcripts are present. As noted above, the solution should be rapidly heated to the required temperature. For instance, for a volume of 20 ml_, the microwaves used should be sufficient to heat the solution to the required temperature in about 30 seconds or less.

Representative conditions for a 20 ml_ sample are a first heating step at 450W for 10 seconds followed by a second heating step at 450W for 5 seconds. For larger volumes, such as 50 ml_ or more, rapid heating may be defined as heating to the required temperature with 90 seconds, preferably within 80, 70 or 60 seconds. Microwaves of a power of between about 300-1500W, such as about 450-1000W may be used in the invention.

Thus, in some embodiments, the rapid heating comprises heating the reaction mixture (solution) to the required temperature as defined above in about 5 minutes or less, e.g. about 4 or 3 minutes or less. For instance, rapid heating include heating the reaction mixture (solution) to the required temperature as defined above in about 180 seconds or less, e.g. 170, 160, 150, 140, 130, 120 seconds or less.

Similarly, cooling of the solution may be achieved by any suitable means and will dependent on the volume of the solution, the nature of the reaction vessel etc. For instance the solution may be left at room temperature until it reaches the required temperature for the cleavage reaction. In a representative embodiment, the solution may be cooled at a first temperature, e.g. 37°C for 15 minutes, followed by a second lower temperature, e.g. room temperature for 15 minutes.

The amount or concentration of DNA oligonucleotides used in the cleavage reaction will depend on the amount or concentration of RNA transcripts formed by the transcription step and/or the length of each RNA molecule repeat. The skilled person readily could determine the amount or concentration of DNA

oligonucleotides required. In this respect, it is preferred that an excess of DNA oligonucleotides is used in the reaction to ensure that the cleavage reaction is performed to completion or near completion, i.e. to maximise the number of RNA molecules released. In some embodiments, the DNA oligonucleotides may be added to a concentration of about 50-500 mM, such as about 75-300 pM, e.g. about 90-120 pM. As noted above, the DNA oligonucleotides are not cleaved by the RNase H enzyme and thus may be separated from cleaved RNA molecules and re used. Re-cycling of the DNA oligonucleotides reduces costs are represents a further advantage of the invention.

The term "hybridisation" or "hybridises" as used herein refers to the formation of a duplex between nucleotide molecules comprising sequences which are sufficiently complementary to form duplexes via Watson-Crick base pairing.

Two nucleotide sequences are "complementary" to one another when those molecules share base pair organization homology. "Complementary" nucleotide sequences will combine with specificity to form a stable duplex under appropriate hybridization conditions. For instance, two sequences of the same length may be viewed as complementary when the first sequence can bind to the second sequence in an anti-parallel sense wherein the sequence at the 3'-end of each strand binds to the sequence at the 5'-end of the other strand and each A, T(U), G and C of one sequence is then aligned with a T(U), A, C and G, respectively, of the other sequence. RNA sequences can also include complementary G=U or U=G base pairs. It will be understood that in the case of strands of unequal length, the ends of the strands may not be complementary, e.g. there may be an overhang at one or both ends of the longer sequence. Thus, two sequences need not have perfect homology to be "complementary" under the invention. Usually two sequences are sufficiently complementary when at least about 90% (preferably at least about 95%) of the nucleotides share base pair organization over a defined length of the molecule. However, in some preferred embodiments, the DNA oligonucleotides are fully complementary to the sequences in the RNA transcripts with which they hybridize.

The term“bordered” refers to domains (e.g. cleavage domains) or sequences that are directly or indirectly adjacent to other domains or sequence. For instance, in the context of a plasmid containing the DNA molecule (template) for use in the invention, the cleavage domains used to excise the DNA molecule (template) may be positioned at either end of the DNA sequence (template), i.e. the cleavage domains are upstream and downstream (at the 5’ and 3’ ends) of the DNA sequence (template). In some embodiments, the cleavage site of the cleavage domains (e.g. the site at which a cleavage enzyme cleaves a cleavage domain) is directly adjacent to the end of the DNA template it borders. In some embodiments, the DNA template and cleavage domain sequence may overlap, i.e. the end of the DNA template may form part of the cleavage domain, i.e. the cleavage domains may form the ends or part of the ends of the DNA template. Thus, in some embodiments, there may be one or more, e.g. 1 , 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more, nucleotides between the cleavage domain and the DNA template (i.e. between the ends of the sequences). In some embodiments, the cleavage domain and the DNA template may overlap one or more, e.g. 1 , 2, 3, 4, 5, 6, 7, 8, 9 or 10 or more, nucleotides.

In the context of the DNA oligonucleotides (splints) of the invention, “bordered” refers to sequences of dNTPs that are directly adjacent to sequences of 2’-0-methyl-NTPs.

The step of cleaving the RNA strand of the partially double stranded

RNA: DNA duplexes formed in step (c) may be achieved using any conditions suitable to enable RNase H site-specific cleavage of the RNA strand to release the RNA molecules.

Suitable conditions to cleave the RNA transcripts will be dependent on the RNase H used to achieve cleavage. In this respect, RNase H may be obtained from any suitable source and the cleavage step may follow the manufacturer’s instructions for the particular RNase H enzyme. In a preferred embodiment, the RNase H enzyme is a naturally-occurring enzyme (i.e. isolated from its natural source, e.g. animal or bacteria), a recombinant enzyme (e.g. expressed in an, and obtained from, a recombinant organism producing the enzyme, e.g. from a microbial, e.g. bacterial, expression system) or a derivative, e.g. sequence-modified derivatives, or mutant thereof. Thus, any enzyme or combination of enzymes in the enzyme commission number 3.1.26.4 may be used in the present invention. However, as noted above, it is essential that the RNase H enzyme is able to cleave the RNA strand opposite the 5’ end of the hybridised DNA oligonucleotide.

Thus, in some embodiments, the RNase H is an E.coli RNase H. Thus, in some embodiments the RNase H is selected from the following group, which refers to the UniProtKB/Swiss-Prot accession numbers: POA7Y4/RNH_ECOLI;

A7ZHV1/RNH_EC024; B7UJB0/RNH_ECO27; B7MBJ0/RNH_ECO45;

B7LHC0/ RNH_EC055; P0A7Y6/ RNH_EC057; B5ZOI8/RNH_EC05E;

B7N KW4/RN H_EC07I ; B7MQ23/RNH_EC081 ; B7M213/RNH_EC08A;

C4ZRV1/RNH_ECOBW; B1XD78/RNH_ECODH; A7ZWF6/RNH_ECOHS;

QOTLC3/RNH_ECOL5; POA7Y5/RNH_ECOL6; B1 IPU4/RNH_ECOLC;

B7N876/RNH_ECOLU; B6HZS7/RNH_ECOSE; B1 LHM3, RNH_ECOSM and functional variants and derivatives thereof or a combination thereof.

In some embodiments, the RNase H is POA7Y4/RNH_ECOLI or a functional variant or derivative thereof.

The source of the RNase H for use in the present invention is not particularly important and both natural and recombinant enzymes are contemplated for use in the methods described herein.

Accordingly, a solution comprising an RNase H may contain a mixture of enzymes each contributing to the RNA cleavage activity of the solution. Thus, in some embodiments, the RNase H is a mixture of enzymes, e.g. a mixture of any two or more of the enzymes described above. In some embodiments, the mixture may comprise one of the enzymes defined above and one or more other undefined RNase H. In a preferred embodiment, a mixture of RNase H enzymes comprises one of the enzymes defined above as a substantial part of the mixture, i.e.

contributing to at least about 30%, e.g. at least about 40%, 50%, 60%, or 70% of the RNA cleavage activity of the mixture.

In a representative embodiment, the RNase H may specifically bind to the RNA: DNA duplexes and selectively (e.g. specifically) cleave the RNA (opposite the 5’ end of the dNTP sequence in the oligonucleotide splint as defined above) in a Tris-HCI buffer (e.g. about 20mM). Cleavage may occur in a pH range of about 7.0- 9.0, e.g. about 8.0 such as about pH 7.8, over a range of temperatures, e.g. about 20-40 °C, e.g. about 37 °C, for a period of about 1-10 hours, e.g. about 3-6 hours. The reaction mix may contain other reagents, such as monovalent and divalent cations (e.g. in the form of KCI and MgCh, respectively) and DTT. In some embodiments, it may be advantageous to include an RNase inhibitor that does not inhibit RNase H to prevent unwanted degradation of the transcripts and cleaved RNA molecules. The RNase H enzyme may be used at a concentration of about 50-200 Units/mL, e.g. about 100 Units/mL. One unit may be defined as the amount of enzyme that will hydrolyze 1 nmol of the RNA in [ 3 H]-labelled poly(rA) * poly(dT), to acid-soluble ribonucleotides in a total reaction volume of 50 pi in 20 minutes at 37°C. The skilled person would readily be able to determine other suitable conditions.

In some embodiments, it may be desirable to stop the cleavage reaction. This may be achieved by any suitable means, such as heating the reaction mixture to about 65°C for about 20 minutes. Other suitable conditions for stopping the cleavage reaction or inactivating the RNase H are well-known in the art. Thus, in some embodiments, the method comprises a step of inactivating the RNase H enzyme.

The term "cleavage" as used herein includes any means of breaking a covalent bond. Thus, in the context of the invention, cleavage involves cleavage of a covalent bond in a nucleotide chain (i.e. strand cleavage or strand scission), for example by cleavage of a phosphodiester bond.

The term“cleavage domain” as used herein typically refers to a domain within the plasmid comprising the DNA molecule (template) that can be cleaved specifically to linearize the plasmid to produce the DNA molecule (template) or excise the DNA molecule (template) from the plasmid. In some embodiments, the term“cleavage domain” may be used to describe the RNA:DNA duplex formed by hybridizing the DNA oligonucleotides to the RNA transcripts insofar as the DNA oligonucleotides function to create a cleavage domain that may be site-specifi cally cleaved by RNase H. Thus, the DNA oligonucleotides may be viewed as forming RNase H cleavage domains in the RNA transcripts.

Thus, in some embodiments, a cleavage domain may comprise a sequence (a cleavage recognition site) that is recognised by one or more enzymes capable of cleaving a nucleic acid molecule, i.e. capable of breaking the phosphodiester linkage between two or more nucleotides. For instance, in the context of a plasmid containing the DNA molecule (template) of the invention, a cleavage domain may comprise a restriction endonuclease (restriction enzyme) recognition sequence. Restriction enzymes cut double-stranded or single stranded DNA at specific recognition nucleotide sequences known as restriction sites and suitable enzymes are well-known in the art. For example, it may be particularly advantageous to use rare-cutting restriction enzymes, i.e. enzymes with a long recognition site (at least 8 base pairs in length), to facilitate the design of the DNA molecule (template), e.g. to avoid the inclusion of a cleavage recognition site within the DNA template.

In some embodiments, a cleavage domain may comprise a sequence that is recognised by a type II restriction endonuclease, more preferably a type I Is restriction endonuclease. Any suitable cleavage domain and cleavage enzyme may be used in the invention to linearize or excise the DNA molecule (template).

The cleavage domains that border the DNA molecule (template) in a plasmid may be the same or different from each other. Advantageously, when the DNA molecule (template) is excised from the plasmid, the cleavage domains that border the DNA molecule (template) are the same such that a single cleavage step is sufficient to excise the DNA molecule (template). For instance, in some embodiments, the step of excising the DNA molecule (template) comprises contacting the plasmid containing the DNA molecule (template) with a single cleavage enzyme under conditions suitable to cleave the cleavage domains in the plasmid.

For instance, a cleavage enzyme, e.g. a restriction endonuclease, may specifically bind to its cleavage recognition site and selectively (e.g. specifically) cleave the nucleic acid in a variety of buffers, such as phosphate buffered saline (PBS), 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), HEPES buffered saline (HBS), and Tris buffered saline (TBS), both with and without EDTA. Cleavage may occur in a pH range of about 3.0-10.0, e.g. 4.0-9.0, 5.0-8.0, over a wide range of temperatures, e.g. 0-70 °C. The skilled person would readily be able to determine other suitable conditions.

The term“release” is used in the present context to refer to cleaving the RNA transcripts produced in the in vitro transcription reaction at the RNA:DNA duplexes formed in step (c) so as to detach, disconnect or separate the tandem repeats of the RNA molecule. It is desirable that the release of a given RNA molecule will involve cleavage at both ends of the RNA molecule, i.e. in both RNA:DNA duplexes bordering the RNA molecule.

It is not necessary for cleavage to occur at all of the RNA:DNA duplexes in the RNA transcripts in order to generate the RNA molecules. For instance, the DNA oligonucleotides (splints) may not hybridise to every complementary sequence in the RNA transcripts, which may reduce the efficiency of the cleavage step.

Nevertheless, cleavage of the RNA transcripts in a portion of the RNA:DNA duplexes will result in the release of a portion of RNA molecules. Thus, in some embodiments, the step of cleaving the RNA strand of the RNA:DNA duplexes results in cleavage of the RNA strand in at least about 50% of the RNA: DNA duplexes in the RNA transcripts, e.g. at least about 60%, 70% or 80%. In some embodiments, the step of cleaving the RNA strand of partially double stranded RNA:DNA duplexes results in cleavage of the RNA strand in at least about 90% of the RNA: DNA duplexes in the RNA transcripts, e.g. 95% or more.

Alternatively viewed, in some embodiments, the step of cleaving the RNA strand of partially double stranded RNA: DNA duplexes results in the release of at least about 50% of the RNA molecules contained in the RNA transcripts, e.g. at least about 60%, 70% or 80%. In some embodiments, the step of cleaving the RNA strand of partially double stranded RNA:DNA duplexes results in the release of at least about 90% of RNA molecules contained in the RNA transcripts, e.g. 95% or more.

Once the RNA molecules have been released, it may be desirable to isolate, separate or purify the RNA molecules from the cleavage reaction mixture (e.g. reaction components, such as RNase H and DNA oligonucleotides and/or degradation products, such as the 5’ ends of the transcripts which may contain an optimal initiation site and/or spacer sequence, uncleaved RNA transcripts, etc.) and for use in other applications.

Thus, in some embodiments, the method of the present invention further comprises a step of isolating, separating or purifying the RNA molecules. This isolation, separation or purification may be done by any suitable method known in the art. The isolated or purified RNA molecules form a further embodiment of the invention.

In some embodiments, following the isolation, separation or purification step the RNA molecules are preferably substantially free of any contaminating components derived from the materials or components used in the isolation procedure or in their preparation (e.g. reaction components and/or degradation products as described above). In some embodiments, the RNA molecules are purified to a degree of purity of more than about 50 or 60 %, e.g. more than about 70, 80 or 90%, such as more than about 95 or 99% purity as assessed w/w (dry weight). Such purity levels may include degradation products of the RNA molecules.

In some embodiments, it may be useful to prepare enriched preparations of the RNA molecules which have lower purity, e.g. contain less than about 50% of the RNA molecules of interest, e.g. less than about 40 or 30%.

As discussed above, the invention may result in a mixture or plurality (e.g. a library) of copies of RNA molecules. Thus, in some embodiments, it may be desirable to further separate the RNA molecules, e.g. by size, to obtain specific RNA molecules (i.e. to isolate specific RNA molecules) or to generate sub-groups or sub-libraries of RNA molecules. Any suitable means for separating the mixtures of RNA molecules to isolate the specific RNA molecules or sub-groups or sub libraries of RNA molecules may be employed.

Thus, in some embodiments, the method comprises a further step of separating RNA molecules from a mixture (e.g. library) of RNA molecules obtained by the method described above, to isolate copies of a specific RNA molecule or a sub-group of RNA molecules.

For example, the products of the cleavage reaction may be separated by size using gel electrophoresis using an agarose gel or a polyacrylamide gel. The desired RNA molecules can then be isolated from the gel and purified further, if necessary, according to methods known in the art. Other methods for purifying, isolating or separating the RNA molecules of the invention utilise chromatography (e.g. HPLC, FPLC, size-exclusion, ion-exchange, affinity, hydrophobic interaction, reverse-phase), organic extraction or capillary electrophoresis.

In some embodiments, the step of isolating the plurality of copies of the RNA molecule comprises HPLC purification, particularly ion-exchange HPLC. Thus, in some embodiments, the HPLC fractions containing the plurality of copies of the RNA molecule are pooled and optionally concentrated. Any suitable conditions for HPLC may be used to isolate the RNA molecules and exemplary conditions are set out in the Examples. However, the conditions may depend on the RNA molecules to be isolated and/or the molecules and/or reagents from which the RNA molecules are to be separated. As described above and shown in the Examples, the inventors have surprisingly determined that the method of the invention may be used effectively on a large scale to produce large amounts of RNA, e.g. hundreds of micrograms of RNA per millilitre of reaction mixture - an order of magnitude increase relative to conventional“run-off” transcription performed on the same scale. Thus, in some embodiments, the invention may be viewed as providing a large scale method for producing a plurality of copies of an RNA molecule.

In vitro transcription reactions typically are performed on a small scale, e.g. reaction volumes of about 1ml_ or less. Thus, the term“large scale” refers reaction volumes of about 5ml_ or more, such as about 10, 12, 15, 20, 25, 30, 40 or 50 ml_ or more, e.g. about 75 or 100ml_ or more. Accordingly, in some embodiments, the reaction volume of the reaction mixture(s) used the method of the invention is at least about 5ml_, such as about 10, 12, 15, 20, 25, 30, 40 or 50 ml_ or more, e.g. about 75 or 100ml_ or more. As discussed above, the determination that microwaves can be used to rapidly heat the completed in vitro transcription reaction mixture without degrading the RNA transcripts in step (c) of the method facilitates the use of such large volumes.

In some embodiments, the method of the invention results in, i.e. produces, the desired RNA molecules at a concentration of at least about 0.10 mg/ml_, e.g. at least about 0.15, 0.20, 0.25, 0.30, 0.35, 0.40, 0.45 or 0.50 mg/ml_. In some embodiments, the method of the invention produces the desired RNA molecules at a concentration of about 0.60, 0.70, 0.80, 0.90, 1.00 mg/ml_ or more.

The inventors have also determined that the efficiency of the method can be improved by performing all of the reactions in a single reaction vessel, a so-called “one-pot” reaction. For instance, the upon completion of the in vitro transcription reaction of step (b), the DNA oligonucleotides may be added to the reaction mixture which is subjected to conditions to facilitate hybridization of the DNA

oligonucleotides to the RNA transcripts (e.g. heating and cooling steps as described above). In some embodiments, the DNA oligonucleotides may be added to the reaction mixture after it is heated. The RNase H enzyme may then be added to the reaction mixture under suitable conditions (e.g. once the reaction mixture has cooled to a temperature that will not denature the RNase H but at which the enzyme is active) to cleave the RNA transcripts and release the RNA molecules.

In the“one-pot” embodiments of the invention, the reagents from previous reactions are not removed. Accordingly, the RNase H enzyme used in step (d) must be compatible with (i.e. functional in) the reagents (e.g. buffer) used in the previous steps, e.g. the in vitro transcription step, and/or the reaction mixture from the previous steps may be adjusted to conditions suitable for the next step, e.g. by adjusting the pH, addition of salts etc.

In some embodiments, the step of providing the DNA molecule (template) may comprise a step of cleaving a plasmid containing the DNA molecule (template). Thus, in some embodiments, the in vitro transcription reagents used in step (b) must be compatible with (i.e. functional in) the reagents (e.g. buffer) used in the previous steps, e.g. the cleavage step, and/or the reaction mixture from the previous step may be adjusted to conditions suitable for the next step, e.g. by adjusting the pH, addition of salts, buffers etc.

By“compatible with” or“functional in” is meant that the RNase H enzyme may show some reduced activity in cleaving the RNA strand of the RNA:DNA duplexes relative to the activity of the RNase H enzyme on the RNA: DNA duplexes in conditions that are optimum for the enzyme, e.g. in the buffer, salt and

temperature conditions recommended by the manufacturer. Thus, the RNase H enzyme may be considered to be compatible or functional if it has at least 50%, e.g. at least 60, 70, 80, 85, 90, 95, 96, 97, 98, 99 or 100%, activity relative to the activity of the RNase H enzyme in conditions that are optimum for the enzyme.

Similarly, the RNA polymerase enzyme may show some reduced activity in transcription of the DNA template relative to the activity of the RNA polymerase enzyme on the same DNA template in conditions that are optimum for the enzyme, e.g. in the buffer, salt and temperature conditions recommended by the

manufacturer. Thus, the polymerase enzyme may be considered to be compatible or functional if it has at least 50%, e.g. at least 60, 70, 80, 85, 90, 95, 96, 97, 98, 99 or 100%, activity relative to the activity of the RNA polymerase enzyme in conditions that are optimum for the enzyme.

As shown in the Examples, RNase H is compatible with the reagents used in a T7 polymerase in vitro transcription reaction. Thus, in some preferred embodiments, the RNA polymerase is T7 RNA polymerase and the RNase H is E.coli RNase H.

It will be understood that some of the method steps of the invention may be performed simultaneously. For instance, where the DNA template is provided by linearizing a plasmid containing the DNA template, the step of cleaving the plasmid may be performed simultaneously with the in vitro transcription reaction of step (b), i.e. steps (a) and (b) may be performed simultaneously or sequentially.

Similarly, the steps of hybridizing the DNA oligonucleotides to the RNA transcripts and cleaving the RNA transcripts in the RNA: DNA duplexes may be performed simultaneously, e.g. the reagents for the cleavage reaction may be provided during the hybridization step, e.g. during the cooling step. Thus, steps (c) and (d) may be performed simultaneously or sequentially.

Thus, in a further embodiment the invention may be seen as providing a reaction mixture comprising:

(i) a DNA molecule comprising a promoter sequence for an RNA

polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence;

(ii) an RNA polymerase;

(iii) DNA oligonucleotides comprising 2’-0-methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA; and optionally

(iv) an RNase H enzyme.

The reaction mixture also contains reagents needed to perform the methods of the invention, e.g. NTPs, buffers, ions, etc. as described above. In some embodiments, the reaction mixture may further comprise a cleavage enzyme (and associated buffers etc.), e.g. to linearize or excise the DNA molecule from a plasmid containing said DNA molecule.

It will be evident that the method of the invention may also be performed in batches, e.g. each step is performed in a separate reaction vessel or at least two steps are performed in separate reaction vessels. Batch reactions may be particularly useful where the reagents used in one step are not compatible with the reagents in a subsequent step, e.g. the RNase H enzyme is not functional in the conditions used in the in vitro transcription step. Thus, in some embodiments, the method may comprise additional purification steps between the steps of the method described above, e.g. to switch buffers etc. Suitable methods for such purification steps are known in the art, e.g. dialysis, and any such method may be employed in the method of the invention.

In some embodiments, it may be desirable to modify the RNA molecules produced by the method of the invention to alter their properties (e.g. improve their stability or half-life) or increase their utility, e.g. for use as a label or in therapy. For instance, the RNA molecules produced by the invention may be conjugated to another molecule or component by a chemical linker or spacer.

Thus, in some embodiments, the method provides a further step of conjugating or linking the RNA molecules to another molecule or component.

The term“conjugation” in the context of the present invention with respect to linking or joining a molecule or component to an RNA molecule refers to joining said molecule or component to the RNA molecule via a covalent bond. For instance, the conjugation may occur via a click chemistry reaction.

As used herein, the term“click chemistry,” generally refers to reactions that are modular, wide in scope, give high yields, generate only inoffensive by-products, such as those that can be removed by non-chromatographic methods, and are stereospecific (but not necessarily enantioselective). See, e.g., Angew. Chem. Int. Ed., 2001 , 40(11):2004-2021 , which is entirely incorporated herein by reference. In some cases, click chemistry can describe pairs of functional groups that can selectively react with each other in mild, aqueous conditions. Accordingly, click chemistry groups are suitable for the conjugation of functional groups to the RNA molecules of the present method. Common click chemistry reactions include azide- alkyne cycloadditions, alkyne-nitrone cycloadditions, alkene-tetrazine reactions and alkene-tetrazole reactions.

A specific example of click chemistry reaction can be the Huisgen 1 ,3- dipolar cycloaddition of an azide and an alkyne, i.e., Copper-catalysed reaction of an azide with an alkyne to form a 5-membered heteroatom ring called 1 ,2,3-triazole. The reaction can also be known as a Cu(l)-Catalyzed Azide-Alkyne Cycloaddition (CuAAC), a Cu(l) click chemistry or a Cu+ click chemistry. Catalyst for the click chemistry can be Cu(l) salts, or Cu(l) salts made in situ by reducing Cu(ll) reagent to Cu(l) reagent with a reducing reagent (Pharm Res. 2008, 25(10): 2216-2230). Known Cu(ll) reagents for the click chemistry can include, but are not limited to, Cu(ll)D(TBTA) complex and Cu(ll) (THPTA) complex. TBTA, which is tris-[(1- benzyl-1 H-1 ,2,3-triazol-4-yl)methyl]amine, also known as tris- (benzyltriazolylmethyl)amine, can be a stabilizing ligand for Cu(l) salts. THPTA, which is tris-(hydroxypropyltriazolylmethyl)amine, can be another example of stabilizing agent for Cu(l). Other conditions can also be accomplished to construct the 1 ,2,3-triazole ring from an azide and an alkyne using copper-free click chemistry, such as by the Strain-promoted Azide-Alkyne Click chemistry reaction (SPAAC, see, e.g., Chem. Commun., 2011 , 47:6257-6259 and Nature, 2015, 519(7544):486-90), each of which is entirely incorporated herein by reference.

It will be evident that any desirable molecules or components (i.e. entities) may be conjugated to RNA molecules produced by the present method. In some embodiments, the molecule or component may be a nucleic acid molecule, protein (e.g. an antibody or antigen-binding fragment thereof), peptide, small-molecule organic compound (e.g. biotin), fluorophore, metal-ligand complex, polysaccharide, nanoparticle, nanotube, polymer, cell, organelle, vesicle, virus, virus-like particle or any combination of these.

The cell may be a prokaryotic or eukaryotic cell.

In some embodiments, the RNA molecules may be conjugated or linked to a compound or molecule which has a therapeutic or prophylactic effect, e.g. an antibiotic, antiviral, vaccine, antitumour agent, e.g. a radioactive compound or isotope, cytokine, toxin, oligonucleotide, nucleic acid encoding gene or nucleic acid vaccine.

In some embodiments, the RNA molecules (e.g. aptamers) may be conjugated or linked to a label, e.g. a radiolabel, a fluorescent label, luminescent label, a chromophore label as well as to substances and enzymes which generate a detectable substrate, e.g. horse radish peroxidase, luciferase or alkaline

phosphatase. Labels for magnetic resonance imaging, positron emission tomography probes and boron 10 for neutron capture therapy may also be conjugated to the RNA molecules described herein.

In some embodiments, the molecule or component may be selected from the group consisting of: a fluorophore, a sterol, a polyether, a metal complex, a thiol containing molecule, a molecule containing a group providing increased nuclease resistance and a molecule containing a group capable of participating in a click chemistry reaction.

It will be evident that the molecule or component conjugated to the RNA molecules may interact with other molecules and such interactions may be covalent or non-covalent interactions. For instance, a peptide conjugated to the RNA molecules may interact with its cognate binding partner, such as an antibody, non- covalently. In a further example, a molecule containing a group capable of participating in a click chemistry reaction may by conjugated to another molecule or component as defined above via reaction with a reactive group of said molecule or component to form a covalent complex. Thus, in some embodiments, the RNA molecules and other component may be joined together either directly through a bond or indirectly through a linking group. Where linking groups are employed, such groups may be chosen to provide for covalent attachment of the RNA molecules and other component through the linking group. Linking groups of interest may vary widely depending on the nature of the other component. The linking group, when present, is in many embodiments biologically inert.

Many linking groups are known to those of skill in the art and find use in the invention. In representative embodiments, the linking group is generally at least about 50 daltons, usually at least about 100 daltons and may be as large as 1000 daltons or larger, for example up to 1000000 daltons if the linking group contains a spacer, but generally will not exceed about 500 daltons and usually will not exceed about 300 daltons. Generally, such linkers will comprise a spacer group terminated at either end with a reactive functionality capable of covalently bonding to the RNA molecules and other molecule or component.

Spacer groups of interest may include aliphatic and unsaturated

hydrocarbon chains, spacers containing heteroatoms such as oxygen (ethers such as polyethylene glycol) or nitrogen (polyamines), peptides, carbohydrates, cyclic or acyclic systems that may possibly contain heteroatoms. Spacer groups may also be comprised of ligands that bind to metals such that the presence of a metal ion coordinates two or more ligands to form a complex. Specific spacer elements include: 1 ,4-diaminohexane, xylylenediamine, terephthalic acid, 3,6- dioxaoctanedioic acid, ethylenediamine-N,N-diacetic acid, 1 ,T-ethylenebis(5-oxo-3- pyrrolidinecarboxylic acid), 4,4'-ethylenedipiperidine, oligoethylene glycol and polyethylene glycol. Potential reactive functionalities include nucleophilic functional groups (amines, alcohols, thiols, hydrazides), electrophilic functional groups (aldehydes, esters, vinyl ketones, epoxides, isocyanates, maleimides), functional groups capable of cycloaddition reactions, forming disulfide bonds, or binding to metals. Specific examples include primary and secondary amines, hydroxamic acids, N-hydroxysuccinimidyl esters, N-hydroxysuccinimidyl carbonates, oxycarbonylimidazoles, nitrophenylesters, trifluoroethyl esters, glycidyl ethers, vinylsulfones, and maleimides. Specific linker groups that may find use in the RNA molecule conjugates include heterofunctional compounds, such as azidobenzoyl hydrazide, N-[4-(p-azidosalicylamino)butyl]-3'-[2'-pyridyldithio]propio namid), bis- sulfosuccinimidyl suberate, dimethyladipimidate, disuccinimidyltartrate, N- maleimidobutyryloxysuccinimide ester, N-hydroxy sulfosuccinimidyl-4- azidobenzoate, N-succinimidyl [4-azidophenyl]-1 ,3'-dithiopropionate, N-succinimidyl [4-iodoacetyl]aminobenzoate, glutaraldehyde, and succinimidyl-4-[N- maleimidomethyl]cyclohexane-1-carboxylate, 3-(2-pyridyldithio)propionic acid N- hydroxysuccinimide ester (SPDP), 4-(N-maleimidomethyl)-cyclohexane-1- carboxylic acid N-hydroxysuccinimide ester (SMCC), and the like. For instance, a spacer may be formed with an azide reacting with an alkyne or formed with a tetrazine reacting with a trans-cyclooctene or a norbornene.

In some embodiments, it may be useful to conjugate the RNA molecules of the invention to a solid substrate (i.e. a solid phase or solid support) and it will be evident that this may be achieved in any convenient way. Thus, the manner or means of immobilisation and the solid support may be selected, according to choice, from any number of immobilisation means and solid supports as are widely known in the art and described in the literature. Thus, the RNA molecules may be directly bound to the support, for example via a domain or moiety of the RNA molecules (e.g. chemically cross-linked). In some embodiments, the RNA molecules may be bound indirectly by means of a linker group, or by an

intermediary binding group(s) (e.g. by means of a biotin-streptavidin interaction). Thus, the RNA molecules may be covalently or non-covalently linked to the solid support. The linkage may be a reversible (e.g. cleavable) or irreversible linkage. Thus, in some embodiments, the linkage may be cleaved enzymatically, chemically, or with light, e.g. the linkage may be a light-sensitive linkage.

Thus, in some embodiments, a RNA molecule may be provided with means for immobilisation (e.g. an affinity binding partner, e.g. biotin or a hapten, capable of binding to its binding partner, i.e. a cognate binding partner, e.g. streptavidin or an antibody) provided on the support. In some embodiments, the interaction between the RNA molecule and the solid support must be robust enough to allow for washing steps, i.e. the interaction between the RNA molecules and solid support is not disrupted (significantly disrupted) by the washing steps. For instance, it is preferred that with each washing step, less than 5%, preferably less than 4, 3, 2, 1 , 0.5 or 0.1 % of the RNA molecules is removed or eluted from the solid phase.

The solid support (phase or substrate) may be any of the well-known supports or matrices which are currently widely used or proposed for use with nucleic acids. These may take the form of particles (e.g. beads which may be magnetic, para-magnetic or non-magnetic), sheets, gels, filters, membranes, fibres, capillaries, slides, arrays or microtitre strips, tubes, plates or wells etc.

The support may be made of glass, silica, latex or a polymeric material, e.g. a polysaccharide polymer material, such as agarose (e.g. sepharose). Suitable are materials presenting a high surface area for binding of the RNA molecules of the invention. Such supports may have an irregular surface and may be for example porous or particulate, e.g. particles, fibres, webs, sinters or sieves. Particulate materials, e.g. beads are useful due to their greater binding capacity, particularly polymeric beads.

Conveniently, a particulate solid support used according to the invention will comprise spherical beads. The size of the beads is not critical, but they may for example be of the order of diameter of at least about 1 pm and preferably at least about 2 pm, 5 pm, 10 pm or 20 pm and have a maximum diameter of preferably not more than about 500 pm, and e.g. not more than about 100 pm.

Monodisperse particles, that is those which are substantially uniform in size (e.g. size having a diameter standard deviation of less than 5%) have the advantage that they provide very uniform reproducibility of reaction. Representative monodisperse polymer particles may be produced by the technique described in US-A-4336173.

However, to aid manipulation and separation, magnetic beads are advantageous. The term "magnetic" as used herein means that the support is capable of having a magnetic moment imparted to it when placed in a magnetic field, and thus is displaceable under the action of that field. In other words, a support comprising magnetic particles may readily be removed by magnetic aggregation, which provides a quick, simple and efficient way of separating the particles following the interactions or reactions with the RNA molecules.

In a further aspect, there is provided a pharmaceutical composition comprising an RNA molecule obtained using the method of the invention, together with a pharmacologically (or pharmaceutically) acceptable excipient. Thus, the method may comprise a further step of formulating an RNA molecule obtained using the method of the invention with a pharmacologically (or pharmaceutically) acceptable excipient to produce a pharmaceutical composition.

The excipient may include any excipients known in the art, for example any carrier or diluent or any other ingredient or agent such as buffer, antioxidant, chelator, binder, coating, disintegrant, filler, flavour, colour, glidant, lubricant, preservative, sorbent and/or sweetener etc.

The excipient may be selected from, for example, lactic acid, dextrose, sodium metabisulfate, benzyl alcohol, polyethylene glycol, propylene glycol, microcrystalline cellulose, lactose, starch, chitosan, pregelatinized starch, calcium carbonate, calcium sulfate, dextrates, dextrin, dextrose, dibasic calcium phosphate dihydrate, tribasic calcium phosphate, magnesium carbonate, magnesium oxide, maltodextrin, mannitol, powdered cellulose, sodium chloride, sorbitol and/or talc.

The pharmaceutical composition may be provided in any form known in the art, for example as a tablet, capsule, coated tablet, liquid, suspension, tab, sachet, implant, inhalant, powder, pellet, emulsion, lyophilisate, effervescent, spray, salve, emulsion, balm, plaster or any mixtures thereof. It may be provided e.g. as a gastric fluid-resistant preparation and/or in sustained action form. It may be a form suitable for oral, parenteral, topical, rectal, genital, subcutaneous, transurethral,

transdermal, intranasal, intraperitoneal, intramuscular and/or intravenous administration and/or for administration by inhalation.

Thus, in a further embodiment, the invention provides an RNA molecule obtained by the method of the invention. In some embodiments, the RNA molecules obtained by the method of the invention may be conjugated to another molecule or component and/or immobilized on a solid substrate.

In a further embodiment, the invention provides RNA molecules obtained by the method of the invention (including conjugates thereof) or a pharmaceutical composition comprising the RNA molecules for use in therapy. Alternatively viewed, the invention provides the use of RNA molecules obtained by the method of the invention (including conjugates thereof) in the manufacture or preparation of a medicament (e.g. pharmaceutical composition) for use in therapy. In a still further embodiment, the invention provides a method of treating a subject in need of therapy comprising administering to the subject an effective amount of RNA molecules obtained by the method of the invention (including conjugates thereof) or a pharmaceutical composition comprising the RNA molecules.

In a further aspect, the present invention provides a kit, particularly a kit for use in a method for producing a plurality of copies of an RNA molecule as described herein, the kit comprising: (i) a DNA molecule comprising a promoter sequence for an RNA

polymerase and a sequence encoding a plurality of tandem repeats of an RNA molecule under the control of the promoter sequence;

(ii) DNA oligonucleotides comprising 2’-0-methylated ribonucleotides, wherein the DNA oligonucleotides guide enzymatic cleavage of RNA transcripts produced from the DNA molecule at the ends of the tandem repeats of the RNA molecule, and optionally

(iii) an RNA polymerase; and/or

(iv) an RNase H enzyme.

The DNA molecule, RNA polymerase, DNA oligonucleotides and RNase H are as described above.

In some embodiments, the kit may further comprise a cleavage enzyme, e.g. a restriction endonuclease, as defined herein.

As noted above, the method may be used to produce RNA molecules of different sequences in a single reaction. Moreover, RNA molecules produced by the method of the invention may be pooled to generate a mixture or library of RNA molecules. Thus, in a yet further aspect, the present invention provides a library comprising a plurality of RNA molecules obtained by the method of the invention.

In some embodiments, the RNA molecules produced by the method of the invention may be used to produce their encoded protein or peptide, e.g. in an in vitro translation system. Thus, in some embodiments, the method further comprises a step of translating the RNA molecules and optionally isolating the translation product, e.g. protein or peptide.

In some embodiments, the RNA molecules of the invention may be transfected into cells, e.g. prokaryotic or eukaryotic cells. Thus, in some

embodiments, the method further comprises a step of transfecting or transferring the RNA molecules into cells. The transfected cells form a further embodiment of the invention.

The invention will now be described in more detail in the following non limiting Examples with reference to the following drawings:

Figure 1 shows a schematic of the method of the invention in which a chimeric oligonucleotide splint guides the RNase H site-specific cleavage of an RNA transcript comprising a plurality of (e.g. 26) tandem repeats of a desired RNA molecule to yield a plurality of copies of the desired RNA molecule. Figure 2 shows a photograph of a 20% denaturing PAGE showing the production of miR-34a binding site in the HNF4a mRNA (22nt). The numbers on the left show the sizes of the sequences based on a size ladder (not shown). Lane 1 : shows the product of an in vitro transcription reaction from a linearized plasmid encoding 26 repeats following a T7 promoter. 1ml reaction scale, 0.1 pi loaded.

Lane 2: shows the RNase H cleavage reaction of sample from lane 1. 0.1 mI loaded. Lane 3: shows the product of an in vitro transcription reaction from conventional template (single repeat following a T7 promoter sequence). 1ml reaction scale,

0.1 mI loaded.

Figure 3 shows a photograph of a PAGE stained with ethidium bromide to visualise the nucleic acid. The“Reference” lane shows a size ladder. The “Transcription” lane shows the product of an in vitro transcription reaction from a linearized plasmid containing a DNA template encoding 26 repeats of miR-34-5p following a T7 promoter (SEQ ID NO: 6). The“Cleavage (incomplete)” lane shows the product of the oligonucleotide splint guided RNase H cleavage reaction. The “HLPC purified” lane shows the miR-34-5p product following ion-exchange HPLC purification.

Figure 4 shows photographs of PAGE gels stained with a nucleic acid stain (GelRed) to visualise the nucleic acids. (A) Lane 1 shows a size ladder (sizes of some bands marked on the left); Lane 2 shows the circular plasmid containing the DLL1-2 DNA template; Lane 3 shows the linearized plasmid containing the DLL1-2 DNA template; Lanes 4 and 6 show the different amounts of the product of a small scale in vitro transcription reaction using the linearized plasmid; Lanes 5 and 7 show different amount of the product of a large scale in vitro transcription reaction using the linearized plasmid; and (B) Lane 1 shows a reference RNA molecule of 22 nucleotides; Lane 2 shows the product of a small scale cleavage reaction (1 pL RNase H for 24 hours) performed on the large scale in vitro transcription reaction shown in Lanes 5 and 7 of A; Lanes 3 and 4 show the product of a large scale cleavage reaction (1 mL RNase H for 24 hours) performed on the large scale in vitro transcription reaction shown in Lanes 5 and 7 of A.

Figure 5 shows a photograph of PAGE gels stained with a nucleic acid stain (SYBR Gold) to visualise the nucleic acids. The numbers across the top refer to constructs 1-6 described in Example 3. The numbers on the left show the sizes of the sequences based on a size ladder (not shown). All lanes marked“A” show the in vitro transcription products from the linearised plasmid from the respective construct. All lanes marked“B” are the cleavage reaction from the in vitro transcription product of A. The arrows indicate the target band for the RNA sequence. RNA 1 is 20nt, RNAs 2-5 are 22nt, RNA 6 is 75nt.

Examples

Example 1 - Enzymatic production of miR-34a

MicroRNAs have been linked with numerous diseases and the production of large amounts of these molecules cost-effectively is desirable. For instance, miR- 34a is a tumor suppressor gene, which is down-regulated in the K-562 cells and chronic myeloid leukemia (CML) patients due to aberrant DNA hypermethylation. Thus, miR-34a may find utility in the treatment of such patients.

A DNA template comprising the T7 promoter (SEQ ID NO: 2), the T7 optimal initiation sequence (SEQ ID NO: 1) and 27 tandem repeats of a sequence encoding miR-34a-3p (SEQ ID NO: 3) was produced (SEQ ID NO: 4) and cloned into pUC19. An equivalent construct containing only one copy of the sequence encoding miR- 34a-3p was produced as a control.

A DNA template comprising the T7 promoter (SEQ ID NO: 2), the T7 optimal initiation sequence (SEQ ID NO: 1) and 26 tandem repeats of a sequence encoding miR-34a-5p (SEQ ID NO: 5) was produced (SEQ ID NO: 6) and cloned into pUC19. An equivalent construct containing only one copy of the sequence encoding miR- 34a-5p was produced as a control.

The plasmids were linearized by cleaving the plasmid with BamHI according to the manufacturer’s instructions and linearized plasmid was used as the DNA template in 10ml_ transcription reactions using T7 polymerase as described below.

RNA obtained from the templates comprising multiple repeats of the encoding sequences were subjected to cleavage with RNase H as described below. The oligonucleotide splint used to cleave the transcripts comprising repeats of miR- 34a-3p consisted of the sequence mAmUmUmGAGGGmCmAmGmU (SEQ ID NO: 7), wherein each letter preceded by an“m” represents a 2’-0-methyl-NTP. The other letters represent dNTPs. The oligonucleotide splint used to cleave the transcripts comprising repeats of miR-34a-5p consisted of the sequence

mGmCmCmAACAAmCmCmAmG (SEQ ID NO: 8).

Figure 2 shows a picture of denaturing PAGE comparing the amount of miR- 34a binding site in the HNF4a mRNA obtained from the transcription and

subsequent cleavage reactions and demonstrates that the method of the invention generates much higher concentrations of the molecule. The amount of miR-34a-5p and miR-34a-3p generated in reactions as described above was quantified for several independent 10ml_ reactions and the average is shown in the table below.

Table 1

The data in Table 1 demonstrates that the method of the invention consistently results in a significant increase in the amount of target RNA molecules produced relative to conventional“run-off” transcription, which uses only one copy of the target molecule in the DNA template.

77 transcription reaction

Transcription reactions were optimised using small scale 50mI reactions and scaled-up proportionately for large scale (volume) reactions. Each reaction contained 100mM Tris-CI pH 8.0, 10mM MgCL, 25mM Spermidine, 1mM

Dithiothreitol, 5mM GMP, 3mM NTPs, T7 RNA polymerase 0.3mg/ml, inorganic pyrophophatase (IPPase) 0.1 mg/ml and DNA template in varying concentrations. Linearized plasmid was used at 2ng/pl. The plasmid linearization reaction was used directly as the input for transcription reaction without prior purification.

Large scale transcription reactions were performed using a reaction volume of at least 10ml for 16h at 37°C. The transcription reaction was used directly as the input for the RNase H cleavage reaction without further processing.

RNase H cleavage reaction

Cleavage reactions were optimised using small scale 20 mI volume reactions and scaled-up proportionately for large scale (volume) reactions. Typical conditions are:

20% (v/v) of 100mM chimeric cleavage guide (DNA oligonucleotide splint) was added to the transcription reaction and heated in the microwave in a closed 50mL centrifuge tube for 10 seconds and a further 5 seconds at 450W. For slow annealing, the mixture was cooled at 37°C and room temperature for 15 minutes each. To start the reaction, E.Coli RNase H (NEB (catalog # M297L): 100U/mL final concentration) was added with IPPase to 0.1 mg/ml. The cleavage reaction was checked for completion on a 20% denaturing PAGE of 20x20cm (8M urea, 1X TBE buffer, 14W, 2h, Biorad, Protean II) or a similar setup. Reactions took typically between 3 and 6h for completion. In case of incomplete cleavage reactions, microwave annealing was repeated and more RNase H was added. The reaction was stopped by adding excess of EDTA

(100mM, pH 6.5).

The cleavage reactions were concentrated and washed with water using Amicon stirred cells with 1 or 3 kDa cut-off before injection into HPLC.

Example 2 - HPLC purification of RNA molecules produced by RNase H cleavage

The completed cleavage reaction for mi-R34-5p described in Example 1 was concentrated and washed with water using Amicon stirred cells with 1 or 3 kDa cut-off before injection into HPLC.

For ion-exchange purification, a DNAPACTM PA200 column of 22x250mm (Thermo Scientific catalog # 088781) was used. Running buffers were A: 20mM NaAc, 20mM NaCICL, 10% ACN, pH 6.5; and B: 20mM NaAc, 600mM NaCICL,

10% ACN, pH 6.5. The sequence was run at 5.5mL/min flow rate and 75°C to provide denaturing conditions. The gradient for elution was 20-26% buffer B over 30 minutes.

Figure 3 shows that HPLC purification results in a highly pure product.

Example 3 - RNA molecule production is independent of template sequence and production

DNA templates were designed for a variety of desired RNA molecules as shown below. Moreover, the templates were used in the in vitro transcription reactions in the form linearized plasmids as described in Example 1 or solid-phase synthesised DNA. All templates were able to generate RNA transcripts that could be cleaved with RNase H using the oligonucleotide splints set out below, to generate high yields of the desired RNA molecules. These results demonstrate that the method is independent of template sequence and the method used to produce it.

Figure 4 shows the product of the DLL1-2 template described below.

Figure 5 shows the products of constructs 1-6 described below. A ntisense-miR-34a :

A DNA template encoding 27 repeats of the RNA sequence:

AACCAGCUAAGACACUGCCAGG (SEQ ID NO: 9) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of antisense- miR-34a consisted of the sequence

mGmGmUmUCCTGmGmCmAmG (SEQ ID NO: 10).

DLL 1-2:

A DNA template encoding 27 repeats of the RNA sequence:

CCGGCCGCCUGCGGCACUGCCU (SEQ ID NO: 11) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of DLL1-2 consisted of the sequence

mGmGmCmCmGmGAGGCmAmGmUmGmCmC (SEQ ID NO: 12). CD44:

Synthetic double strand DNA encoding 6 repeats of the RNA sequence GGCCACUAUGUGUUGUUACUGCCA (SEQ ID NO: 13) was produced by solid- phase synthesis and used directly in the in vitro transcription reaction. The antisense strand sequence of the template DNA is provided in SEQ ID NO: 14. The oligonucleotide splint used to cleave the transcripts comprising repeats of CD44 consisted of the sequence mGmGmCmCTGGCmAmGmUmA (SEQ ID NO: 15).

HNF4a:

Synthetic single strand DNA encoding 2 repeats of the HNF4a RNA sequence was produced by solid-phase synthesis and used directly in the in vitro transcription reaction with a T7 promoter oligonucleotide to provide a double stranded promoter sequence. The antisense strand sequence of the template DNA is provided in SEQ ID NO: 16. The oligonucleotide splint used to cleave the transcripts comprising repeats of HNF4a consisted of the sequence

mGmUmGmACGGTmUmCmCmC (SEQ ID NO: 17).

Construct 1:

A DNA template encoding 27 repeats of the RNA sequence:

GGGCCACAUCCCACUGCCA (SEQ ID NO: 18) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 18 consisted of the sequence

mGmUmGmACGGTmUmCmCmC (SEQ ID NO: 19).

Construct 2:

A DNA template encoding 26 repeats of the RNA sequence:

UGGCAGUGUCUUAGCUGGUUGU (SEQ ID NO: 20) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 20 consisted of the sequence

mGmCmCmAACAAmCmCmAmG (SEQ ID NO: 21).

Construct 3:

A DNA template encoding 26 repeats of the RNA sequence:

CAAUCAGCAAGUAUACUGCCCU (SEQ ID NO: 22) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 22 consisted of the sequence

mAmUmUmGAGGGmCmAmGmU (SEQ ID NO: 23).

Construct 4:

A DNA template encoding 26 repeats of the RNA sequence:

CCGGCCGCCUGCGGCACUGCCU (SEQ ID NO: 11) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 11 consisted of the sequence

mGmGmCmCmGmGAGGCmAmGmUmGmCmC (SEQ ID NO: 12).

Construct 5:

A DNA template encoding 26 repeats of the RNA sequence:

AACCAGCUAAGACACUGCCAGG (SEQ ID NO: 9) was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 9 consisted of the sequence

mGmGmUmUCCTGmGmCmAmG (SEQ ID NO: 10).

Construct 6:

A DNA template encoding 5 repeats of the RNA sequence set forth in SEQ ID NO: 24 was cloned into pUC19 as described in Example 1. The oligonucleotide splint used to cleave the transcripts comprising repeats of SEQ ID NO: 24 consisted of the sequence mGmAmCmGCGACmUmUmGmU (SEQ ID NO: 25).