Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD OF DETECTION OF A TARGET NUCLEIC ACID SEQUENCE
Document Type and Number:
WIPO Patent Application WO/2023/170144
Kind Code:
A1
Abstract:
The present invention provides a method for detecting a target nucleic acid sequence in a target nucleic acid molecule using padlock probes and rolling circle amplification (RCA) in a 2-stage RCA reaction, a so-called SuperRCA (sRCA), which generates a second-generation RCA product, by means of which the target nucleic acid sequence may be detected and distinguished from other nucleic acid sequences. The method utilises asymmetric PCR technology, and may be performed in multiplex to detect different multiple target nucleic acid sequences in one or more target nucleic acid molecules. Also provided are kits for use in the method.

Inventors:
LANDEGREN ULF (SE)
CHEN LEI (SE)
Application Number:
PCT/EP2023/055887
Publication Date:
September 14, 2023
Filing Date:
March 08, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
RARITY BIOSCIENCE AB (SE)
International Classes:
C12Q1/6848
Domestic Patent References:
WO2015071445A12015-05-21
WO2016184902A12016-11-24
WO2022117769A12022-06-09
WO2016184902A12016-11-24
WO2004079620A12004-09-16
WO2015071445A12015-05-21
Foreign References:
US20120088684A12012-04-12
EP2021084061W2021-12-02
US6573051B22003-06-03
Other References:
PRINS THEO W ET AL: "Optimised padlock probe ligation and microarray detection of multiple (non-authorised) GMOs in a single reaction", BMC GENOMICS, BIOMED CENTRAL LTD, LONDON, UK, vol. 9, no. 1, 4 December 2008 (2008-12-04), pages 584, XP021048056, ISSN: 1471-2164, DOI: 10.1186/1471-2164-9-584
SANCHEZ J AQUILES ET AL: "Linear-After-The-Exponential (LATE)-PCR: An advanced method of asymmetric PCR and its uses in quantitative real-time analysis", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, vol. 101, no. 7, 17 February 2004 (2004-02-17), pages 1933 - 1938, XP002397153, ISSN: 0027-8424, DOI: 10.1073/PNAS.0305476101
OKINO STEVEN T. ET AL: "Evaluation of bias associated with high-multiplex, target-specific pre-amplification", BIOMOLECULAR DETECTION AND QUANTIFICATION, vol. 6, 1 January 2016 (2016-01-01), pages 13 - 21, XP093051126, ISSN: 2214-7535, DOI: 10.1016/j.bdq.2015.12.001
CHEN LEI ET AL: "Ultra-sensitive monitoring of leukemia patients using superRCA mutation detection assays", NATURE COMMUNICATIONS, vol. 13, no. 1, 12 July 2022 (2022-07-12), XP093050079, Retrieved from the Internet DOI: 10.1038/s41467-022-31397-y
Attorney, Agent or Firm:
DZIEGLEWSKA, Hanna (GB)
Download PDF:
Claims:
Claims

1 . A method for detecting a target nucleic acid sequence in a target nucleic acid molecule in a sample, said method comprising:

(a) performing an asymmetric PCR reaction to generate amplicons of the target sequence using a set of primers comprising a first primer having a first melting temperature (Tm) and a second primer having a second Tm which is at least 10°C lower than the first Tm, wherein said asymmetric PCR reaction comprises in either order:

(i) an exponential PCR phase comprising no more than 12 cycles, in which primers are annealed at a first annealing temperature which allows annealing of both first and second primers; and

(ii) a linear amplification phase in which primers are annealed at a second higher annealing temperature which allows annealing only of the first primer, and only one strand is amplified, thereby preferentially accumulating single-stranded amplicons of the target nucleic acid sequence;

(b) contacting single-stranded amplicons from step (a) with a ligation mix comprising a ligase enzyme and ligating the 5’ and 3’ ends of the amplicons to circularise them;

(c) performing a first RCA reaction using the circularised amplicons as a first RCA template to generate a first RCA product (RCP) comprising multiple repeats of a complementary copy of the target nucleic acid sequence in the amplicons;

(d) contacting the first RCP with padlock probes specific for the target nucleic acid sequence and allowing the probes to hybridise to complementary copies in the multiple repeats;

(e) directly or indirectly ligating the hybridised padlock probes to circularise the hybridised padlock probes;

(f) performing a second RCA reaction using the circularised padlock probes as a second RCA template to generate second RCPs containing multiple repeat complementary copies of the circularised padlock probes; wherein steps (d) to (f) are optionally repeated one or more times; and

(g) detecting the second or final RCP to detect the circularised padlock probes, and thereby the target nucleic acid sequence. 2. The method of claim 1 , wherein the exponential reaction of step (a)(i) is performed first, followed by the linear reaction of step (a)(ii).

3. The method of claim 1 or claim 2, wherein the target nucleic acid sequence is an analyte for detection by the method, or a reporter for an analyte for detection by the method.

4. The method of any one of claims 1 to 3, wherein the method is performed in multiplex to detect multiple different target nucleic acid sequences, wherein in step (a), multiple asymmetric PCR reactions are performed using different primer sets to generate amplicons of multiple different target nucleic acid molecules or multiple different target nucleic acid sequences, and wherein said multiple PCR reactions are performed separately in parallel, or together in multiplex.

5. The method of claim 4, wherein multiple separate asymmetric PCR reactions are performed, and amplicons therefrom are pooled prior to step (b).

6. The method of any one of claims 1 to 5, wherein the method is for detecting a variant target nucleic acid sequence in a target nucleic acid molecule in a sample, and steps (d) to (g) comprise:

(d) contacting the first RCP with two or more padlock probes each comprising target binding regions specific for different variants of the target nucleic acid sequence and allowing the probes to hybridise to their target sequence complements in the multiple repeats;

(e) directly or indirectly ligating the padlock probes which have hybridised to their variant target sequence complements, to circularise the hybridised padlock probes;

(f) performing second RCA reactions using the circularised padlock probes as second RCA templates to generate second RCPs containing multiple repeat complementary copies of the circularised padlock probes;

(g) detecting the second or final RCP to identify the circularised padlock probes, and thereby the variant target nucleic acid sequence.

7. The method of any one of claims 1 to 6, wherein the target nucleic acid molecule is genomic DNA.

8. The method of any one of claims 1 to 6, wherein the target nucleic acid molecule is RNA, and the method comprises generating a cDNA copy of the target RNA before performing the asymmetric PCR reaction of step (a).

9. The method of any one of claims 6 to 8, wherein the variant target nucleic acid sequence is a mutant target nucleic acid sequence or a wild-type sequence that may be present at a given position in a target nucleic acid molecule, or an allelic variant at a target position in a target nucleic acid molecule, or a polymorphism that may be present in a target nucleic acid molecule.

10. The method of any one of claims 1 to 7, or 9. wherein the target nucleic acid molecules are cell-free DNA molecules.

11. The method of any one of claims 1 to 10, wherein the sample is a liquid biopsy sample.

12. The method of claim 11 , wherein the sample is blood plasma.

13. The method of any one of claims 1 to 10, wherein the variant nucleic acid sequence is detected in situ in a cell or tissue sample.

14. The method of any one of claims 1 to 13, wherein the sample is, or is prepared from, a clinical sample.

15. The method of any one of claims 1 to 14, wherein the method is performed in a single reaction vessel under temperature control, and said method comprises:

(i) providing a reaction mixture comprising single stranded amplicons from step (a), wherein said reaction mixture comprises the PCR polymerase, and an excess of PCR primers relative to the amplicons;

(ii) reducing the excess of primers from the reaction mixture of (i) and/or removing the primer sequence from the amplicons generated in (i);

(iii) contacting the reaction mixture with a ligation mix comprising a ligase enzyme and performing a ligation reaction to ligate the 5’ and 3’ ends of the amplicons to circularise the amplicons, wherein the ligation reaction is performed under conditions in which the ability of the PCR polymerase to extend the hybridised 3’ end of the amplicon on the ligation template is inhibited;

(iv) adding to the reaction mixture from (iii) an RCA mix comprising one or more RCA reagents including at least a RCA polymerase and performing a first RCA reaction using the circularised amplicons as a first RCA template to generate a first RCA product (RCP) comprising multiple repeats of a complementary copy of the target nucleic acid sequence in the amplicon;

(v) heating to inactivate the RCA polymerase;

(vi) contacting the first RCP with padlock probes specific for the target nucleic acid sequence and allowing the padlock probes to hybridise to the target sequence complements in the multiple repeats;

(vii) directly or indirectly ligating the hybridised padlock probes to circularise the hybridised padlock probes, wherein the ligation reaction is performed under conditions in which the ability of polymerase to extend the hybridised 3’ ends of the padlock probes is inhibited;

(viii) removing or rendering inert unligated padlock probes in the reaction mixture of (vii);

(ix) performing a second RCA reaction using the circularised padlock probes as second RCA templates to generate second RCPs containing multiple repeat complementary copies of the padlock probes; wherein steps (i) to (ix) are performed in a single reaction vessel and temperature is controlled during the steps; and wherein steps (vi) to (ix) are optionally repeated one or more times; and

(x) detecting the second or final RCPs to detect the circularised padlock probes, and thereby the target nucleic sequence. The method of any one of claims 1 to 15, wherein the padlock probe is:

(i) a 1-part padlock probe in the form of a single circularisable oligonucleotide comprising target-binding regions at its 5’ and 3’ ends; or

(ii) a 2-part padlock probe comprising a first oligonucleotide with a first, target- complementary, binding region at its 3’ end and a second binding region at its 5’ end which is complementary to a ligation template, and a second oligonucleotide with a first, target-complementary, binding region at its 5’ end and a second binding region at its 3’ end which is complementary to the ligation template; wherein the 3’ and 5’ ends of the first and second oligonucleotides are ligated together, templated respectively by the target sequence in the first RCP, and the ligation template.

17. The method of claim 16, wherein the ligation template serves as primer for the second RCA reaction.

18. The method of any one of claims 1 to 17, wherein the padlock probes each comprise a detection sequence which is specific to the padlock probe and the second or further RCPs are detected by detection probes which hybridise to the complementary copies in the second or further RCP of the detection sequence.

19. The method of claim 18, wherein the detection probes are labelled with detectable labels, preferably with a fluorescent label.

20. The method of any one of claims 1 to 19, wherein the second or final RCPs are detected by microscopy or by flow cytometry.

21 . The method of any one of claims 1 to 20, wherein down to step (f), the method is a homogenous method performed in solution or suspension.

22. The method of any one of claims 1 to 20, wherein the method is performed on a solid support.

23. The method of any one of claims 1 to 22, wherein the second or final RCPs are detected by imaging.

24. A kit for use in detecting a target nucleic acid sequence in a target nucleic acid molecule, said kit comprising:

(i) a set of primers for an asymmetric PCR reaction, wherein said primers are capable of amplifying the target nucleic acid sequence, and comprise a first primer having a first melting temperature (Tm) and a second primer having a second Tm which is at least 10°C lower than the first Tm; and

(ii) padlock probes which comprise target-binding regions which are specific for the target nucleic acid sequence; and optionally

(iii) ligation templates which comprise at or near their respective 5’ and 3‘ ends binding regions which are capable of hybridising to complementary binding sites in amplicons of the target nucleic acid sequence. The kit of claim 24, wherein said kit is for detection of multiple different target nucleic acid sequences in one or more target nucleic acid molecules and comprises multiple different primer sets (i) and padlock probes (ii) each specific for different target nucleic acid sequences, and optionally ligation templates (iii). The kit of claim 24 or claim 25, wherein said kit is for detection of different variants of a target nucleic acid sequence and comprises two or more different padlock probes, each specific for a different variant of the target sequence.

Description:
Method of detection of a target nucleic acid sequence

Field

The present disclosure and invention concerns the field of nucleic acid detection. Particularly, the present disclosure and invention relates to a method for detecting a target nucleic acid sequence in a target nucleic acid molecule using padlock probes and rolling circle amplification (RCA) in a 2-stage RCA reaction, a so-called SuperRCA (sRCA), which generates a second-generation RCA product, by means of which the target nucleic acid sequence may be detected and distinguished from other nucleic acid sequences. The method utilises asymmetric PCR technology, and may be performed in multiplex to detect different multiple target nucleic acid sequences in one or more target nucleic acid molecules. Also provided are kits for use in the method.

Background

The detection of target nucleic acid sequences has applications in many different fields, including notably clinically, for personalised medicine and in the diagnosis, prognosis and/or treatment of disease, such as cancer, infectious diseases and inherited or genetic disorders, as well as in research and biosecurity.

The target nucleic acid sequence may readily be detected using labelled hybridisation probes, but simple hybridisation probes have relatively high lower detection limits, and cannot readily be used to discriminate between similar nucleic acid sequences. To increase sensitivity, target nucleic acid molecules containing target sequences may typically be amplified, to increase the amount of target sequence available for detection. Any of a variety of techniques known in the art may be used for the amplification, including PCR and RCA.

PCR is of course a well-known nucleic acid detection technique for exponential amplification using two primers to produce a linear amplification product. Whilst widely used, in the context of nucleic acid detection, PCR suffers the drawback that amplicons comprise both the forward and reverse strands of the target nucleic acid molecule, of which only the forward strand comprises the desired target nucleic acid sequence, and thus the amplicons must be denatured to separate the strands and allow hybridisation of probes to the target nucleic acid sequence. Furthermore, during PCR, errors introduced by polymerase enzyme are amplified, and as the number of PCR cycles increases, so does the rate of polymerase errors. Whilst in many applications this can be tolerated, this is a particular problem when PCR is used for the detection of variant sequences, such as mutations, particularly those which differ in only a few nucleotides, and especially where these are low-frequency or low abundance variants.

Asymmetric PCR is a variant of PCR which is also well known in the art, and is typically used to preferentially amplify one strand of the nucleic acid molecule more than the other. Primers are added at unequal molar ratio, e.g. 10:1, to favour the synthesis of the desired strand. Primers for the desired strand are therefore in great excess in comparison to the primers for the undesired strand (the limiting primer), and this imbalance results in two phases of amplification. The first phase is exponential, and similarly to regular PCR, amplicons comprising both strands are generated. This initial phase ends upon consumption of the primers for the undesired strand and is followed by the second phase, which selectively amplifies the desired strand using the excess primers for said strand, constituting the linear phase of asymmetric PCR. This phase has lower reaction efficiency due to the melting temperature of the limiting primer decreasing below the optimal reaction annealing temperature as a result of its lower concentration, and requires extra cycles to generate a reasonable yield of amplicons.

Linear-After-The-Exponential (LATE)-PCR was developed to address the efficiency and yield problems faced by asymmetric PCR, and utilises a limiting primer with a higher melting temperature than the excess primer to maintain reaction efficiency as the limiting primer concentration decreases mid-reaction. In WO2016/184902, a method is described in which a linear asymmetric incremental polymerase reaction (Al PR) is followed by an exponential PCR. This was designed to improve the accuracy of detecting low abundance mutated DNA sequences by reducing the rate of polymerase error in early PCR cycles and therefore reducing the signal from false positives. The initial Al PR occurs at a high annealing temperature, and the latter exponential PCR occurs at a lower annealing temperature, with both stages performed within partitioned reactions such as droplet digital PCR.

RCA utilises a strand displacement polymerase enzyme, and requires a circular amplification template. Amplification of the circular template provides a concatenated RCA product, comprising multiple copies of a sequence complementary to that of the amplification template. Such a concatemer typically forms a ball or “blob”, which may readily be visualised and detected, and thus RCA- based assays have been adopted for the detection of nucleic acids, and indeed, more generally, as reporter systems for the detection of any target analyte. Both target nucleic acid analytes, which may themselves be circularised directly, or probes, or reporter nucleic acids more generally may provide template nucleic acid circles for RCA.

The specificity of a nucleic acid detection method may be improved by the use of probes which require dual recognition, or two binding sites for a target nucleic acid sequence, such as a padlock probe. Padlock probes are typically linear oligonucleotides with two separate target-complementary binding regions, connected by an intervening “backbone” region. When the probe has bound (hybridised) to its target nucleic acid sequence, the ends of the probe may be ligated together to circularise the probe with single nucleotide discrimination specificity within 6bp around the ligation site. The circularised padlock probe may then be used as the template for a RCA reaction, and the RCA product may be detected. This forms the basis of a number of detection assays in use today. Padlock probes thus provide an extra layer of specificity, since only probes correctly base-paired at the ligation site will be ligated to generate the template for the molecule which will be detected.

RCA-based assays have been described which rely on secondary amplification of the initial RCA product, to increase the amount of product which is detected, and thereby to provide amplification of the signal in the assay. These include for example hyberbranched RCA which generates many unclustered subsequent RCA products through the strand displacement activity. More recently, so-called “SuperRCA” (sRCA) reactions have been developed which comprise 2 or more rounds of RCA amplification, wherein the product of the second RCA reaction is linked to that of the first. One version of such a sRCA method is described in W02014/0796209. In this method, a probe capable of providing or functioning as a primer is hybridised to an initial RCA product and is used to prime the amplification of a second RCA template circle which hybridises to the “primer-probe”. The second RCA template may be generated by circularisation of a padlock probe which hybridises to the “primer-probe”. In WO 2015/071445, an alternative sRCA method termed “Padlock sRCA” is described, in which a padlock probe is used to bind directly to the initial RCA product.

In many applications, the nucleic acid sequences to be detected occur at low levels, for example in the case of rare mutations, or cell-free DNA (cfDNA) in plasma or other clinical samples, or where limited amounts of sample are available. In such cases, very sensitive methods of detection are required, particularly methods which allow a high amplification of the target nucleic acid to be achieved. PCT/EP2021/084061 describes a sRCA-based detection method of particular use in this regard, in which padlock probes bind to the target nucleic acid sequence and undergo gap-fill extension and ligation to generate many circularised copies of the target nucleic acid sequence, which serve as first RCA templates for the sRCA reaction.

Given the ubiquity and convenience of PCR as an amplification method, we have been working towards providing a robust and readily automatable detection method, suitable for use to detect rare variants, which combines PCR and sRCA. However, the polymerase error rate in the PCR step remains a problem to be addressed as it negatively impacts the accuracy of the assay as a whole, for example, when high numbers of exponential PCR cycles are performed to generate circular amplification templates for the RCA reaction. Furthermore, the undesired strand of the amplicon reduces efficiency of the assay by competing with the desired strand for generation of the RCA template.

There is a continuing need for further optimised assays with improved efficiency and sensitivity. The present disclosure, and invention, are directed to providing such a method.

In the present method, asymmetric PCR is used to generate amplicons of the target nucleic acid sequence for use in a subsequent superRCA (sRCA) reaction. An asymmetric PCR step has been developed, using primers with different melting temperatures, comprising an exponential PCR phase at a first annealing temperature and a linear amplification phase at a second higher annealing temperature to amplify the desired strand. The two phases can be performed in either order, exponential first, or exponential second. In order to improve accuracy of the method, the number of amplification cycles is kept low to reduce the incidence of polymerase errors, and in particular, the number of exponential cycles is kept low. The amplicons of the desired strand are ligated into circles and then amplified by RCA to generate a first RCA product containing multiple repeat complementary copies of the target sequence. The resulting first RCA product is then probed with a padlock probe, specific for the complementary copies of the target sequence. The circularised padlock probe is subjected to a further, second RCA reaction, which is used to generate a second (i.e. secondary) RCA product which is detected to detect the target sequence.

The use of low numbers of exponential PCR cycles, e.g. no more than 12, reduces the accumulation of polymerase errors in the amplicons and leads to increased accuracy of assay results by reducing the amount of false positives or false negatives. Furthermore, where the linear phase follows the exponential phase, the accumulation of single-stranded amplicons of the desired strand after the linear amplification phase renders optional the denaturation of double-stranded amplicons, to allow hybridisation to the ligation template or accessibility to ligase. In addition, the low yield of the undesired strand improves efficiency of the amplicon ligation step due to reduced competition with the desired strand for the ligation template or the ligase enzyme.

Accordingly, in a first aspect provided herein is a method for detecting a target nucleic acid sequence in a target nucleic acid molecule in a sample, said method comprising:

(a) performing an asymmetric PCR reaction to generate amplicons of the target sequence using a set of primers comprising a first primer having a first melting temperature (Tm) and a second primer having a second Tm which is at least 10°C lower than the first Tm, wherein said asymmetric PCR reaction comprises, in either order:

(i) an exponential PCR phase comprising no more than 12 cycles, in which primers are annealed at a first annealing temperature which allows annealing of both first and second primers; and

(ii) a linear amplification phase in which primers are annealed at a second higher annealing temperature which allows annealing only of the first primer, and only one strand is amplified, thereby preferentially accumulating (or in other words, amplifying) singlestranded amplicons of the target nucleic acid sequence;

(b) contacting single-stranded amplicons from step (a) with a ligation mix comprising a ligase enzyme and ligating the 5’ and 3’ ends of the amplicons to circularise them;

(c) performing a first RCA reaction using the circularised amplicons as a first RCA template to generate a first RCA product (RCP) comprising multiple repeats of a complementary copy of the target nucleic acid sequence in the amplicons;

(d) contacting the first RCP with padlock probes specific for the target nucleic acid sequence and allowing the probes to hybridise to the complementary copies in the multiple repeats;

(e) directly or indirectly ligating the hybridised padlock probes to circularise the hybridised padlock probes;

(f) performing a second RCA reaction using the circularised padlock probes as a second RCA template to generate second RCPs containing multiple repeat complementary copies of the circularised padlock probes; wherein steps (d) to (f) are optionally repeated one or more times; and

(g) detecting the second or final RCPs to detect the circularised padlock probes, and thereby the target nucleic acid sequence.

The second RCPs (which may also be referred to as secondary RCPs) together with the first RCP to which they are attached, form the superRCA (sRCA) product, which is the product of the assay which is detected. Analogously, if the padlock probing and RCA steps are repeated, then further generations of RCPs will be attached to the previous generation of RCP, up to the final generation of RCP (the “final RCP”), together forming the sRCA product.

In an embodiment, the exponential phase PCR reaction (i) is performed first and is followed by the linear amplification reaction (ii). In this embodiment, the asymmetric PCR reaction can be said to comprise (i) a first exponential phase and (ii) a second linear amplification phase, wherein the phases are as defined above.

It will be understood that the desired strand for preferential amplification and circularisation into the first RCA template can be selected depending on the respective strands to which the first and second primers bind. Thus, the target sequence may be preferentially amplified in either orientation according to choice. It follows, therefore, that any reference herein to a target nucleic acid sequence includes the sequence complementary to the target nucleic acid sequence.

Accordingly, in step (d), the padlock probes are allowed to hybridise to the complementary copies of the target sequence in the multiple repeats. In this context, the target sequence complements in the first RCP which are bound by the padlock probes are the complements of the target sequence as it appears in the amplicons. The term “complement”, in this regard, is synonymous with “complementary sequence” or “complementary copy”. It will further be understood that the single-stranded amplicons in step (b) are the preferentially amplified strands.

In an embodiment, step (b) is a template-directed ligation, and the contacting of step (b) comprises contacting the amplicons with a ligation template. Conveniently, a ligation template is included in the ligation reaction mix. However, this is not strictly necessary, and in an embodiment, a non-templated ligation may be performed. This uses a ligase which does not require a ligation template, such as a CircLigase™ enzyme.

The target nucleic acid sequence may be an analyte for detection by the method, or a reporter for an analyte for detection by the method.

The method presented above may be performed in multiplex to detect multiple different target nucleic acid sequences, which may be present in one or more nucleic acid molecules, wherein in step (a), multiple asymmetric PCR reactions are performed using different primer sets to generate amplicons of multiple different target nucleic acid molecules or multiple different target nucleic acid sequences, and wherein said multiple PCR reactions are performed separately in parallel, or together in multiplex.

In an embodiment, the amplicons from the multiple separate asymmetric PCR reactions are pooled prior to step (b). In another embodiment, multiple asymmetric PCR reactions are performed together in the same reaction mixture.

The methods presented above may be used to detect a variant target nucleic acid sequence in a target nucleic acid molecule. Target nucleic acid sequences may commonly occur in variant forms, for example, allelic variants, or mutant and wild-type sequences, and it may be desirable to detect which variant is present. It will thus be seen that the target nucleic acid sequence may be one of a number of different variants of the nucleic acid sequence which may occur.

Accordingly, in an embodiment, the method is for detecting a variant target nucleic acid sequence in a target nucleic acid molecule in a sample, and steps (d) to (g) comprise:

(d) contacting the first RCP with two or more padlock probes each comprising target binding regions specific for different variants of the target nucleic acid sequence and allowing the probes to hybridise to the target sequence complements in the multiple repeats;

(e) directly or indirectly ligating the padlock probes which have hybridised to their variant target sequence complements (i.e. to the variant sequence complements which are the targets of, or which correspond to the padlock probes), to circularise the hybridised padlock probes;

(f) performing second RCA reactions using the circularised padlock probes as second RCA templates to generate second RCPs containing multiple repeat complementary copies of the circularised padlock probe;

(g) detecting the final (e.g. second) RCPs to identify the circularised padlock probes, and thereby the variant target nucleic acid sequence.

In step (e), correctly hybridised padlock probes are ligated, that is those that have correctly and specifically hybridised to the complements of the variant target sequences that they are designed to detect (i.e. the padlock probes which have hybridised to their respective target sequence complements).

The method may be used for the detection of DNA or RNA, and may be performed in homogenous or heterogenous formats. It may be used to detect a target nucleic acid sequence in situ or in isolated form, or in a liquid sample.

Advantageously, the methods above may be performed in a single reaction vessel under temperature control, said method comprising:

(i) providing a reaction mixture comprising single-stranded amplicons from step (a), wherein said reaction mixture comprises the PCR polymerase, and an excess of PCR primers relative to the amplicons,

(ii) reducing the excess of primers from the reaction mixture of (i) and/or removing the primer sequence from the amplicons generated in (i);

(iii) contacting the reaction mixture with a ligation mix comprising a ligase enzyme and performing a ligation reaction to ligate the 5’ and 3’ ends of the amplicons to circularise the amplicons, wherein the ligation reaction is performed under conditions in which the ability of the PCR polymerase to extend the hybridised 3’ end of the amplicon on the ligation template is inhibited;

(iv) adding to the reaction mixture from (iii) an RCA mix comprising one or more RCA reagents including at least a RCA polymerase and performing a first RCA reaction using the circularised amplicons as a first RCA template to generate a first RCA product (RCP) comprising multiple repeats of a complementary copy of the target nucleic acid sequence in the amplicon;

(v) heating to inactivate the RCA polymerase; (vi) contacting the first RCP with padlock probes specific for the target nucleic acid sequence and allowing the padlock probes to hybridise to the target sequence complements in the multiple repeats;

(vii) directly or indirectly ligating said hybridised padlock probes to circularise the hybridised padlock probes, wherein the ligation reaction is performed under conditions in which the ability of polymerase to extend the hybridised 3’ ends of the padlock probes is inhibited;

(viii) removing or rendering inert unligated padlock probes in the reaction mixture of (vii);

(ix) performing a second RCA reaction using the circularised padlock probes as second RCA templates to generate second RCPs containing multiple repeat complementary copies of the circularised padlock probes; wherein steps (i) to (ix) are performed in a single reaction vessel and temperature is controlled during the steps; and wherein steps (vi) to (ix) are optionally repeated one or more times; and

(x) detecting the second or final RCPs to detect the circularised padlock probes, and thereby the target nucleic sequence.

In the above methods, the padlock probe may be:

(i) a 1-part padlock probe in the form of a single circularisable oligonucleotide comprising target-binding regions at its 5’ and 3’ ends; or

(ii) a 2-part padlock probe comprising a first oligonucleotide with a first, target-complementary, binding region at its 3’ end and a second binding region at its 5’ end which is complementary to a ligation template, and a second oligonucleotide with a first, target-complementary, binding region at its 5’ end and a second binding region at its 3’ end which is complementary to the ligation template; wherein the 3’ and 5’ ends of the first and second oligonucleotides are ligated together, templated respectively by the target sequence in the first RCP, and the ligation template.

In a second aspect, there is provided a kit for use in detecting a target nucleic acid sequence in a target nucleic acid molecule, said kit comprising:

(i) a set of primers for an asymmetric PCR reaction, wherein said primers are capable of amplifying the target nucleic acid sequence, and comprise a first primer having a first melting temperature (Tm) and a second primer having a second Tm which is at least 10°C lower than the first Tm; and (ii) padlock probes which comprise target-binding regions which are specific for the target nucleic acid sequence; and optionally

(iii) ligation templates which comprise at or near their respective 5’ and 3‘ ends binding regions which are capable of hybridising to complementary binding sites in amplicons of the target nucleic acid sequence;

In an embodiment, the kit is for detection of multiple different target nucleic acid sequences in one or more target nucleic acid molecules and comprises multiple different primer sets (i) and padlock probes (ii) each specific for different target nucleic acid sequences, and optionally ligation templates (iii).

In another embodiment, the kit is for detection of different variants of a target nucleic acid sequence and comprises two or more padlock probes, each specific for a different variant of the target sequence.

Detailed description

The present method provides a high-fidelity and highly-sensitive method for detecting a specific nucleotide sequence (the terms “nucleotide sequence” and “nucleic acid sequence” are used interchangeably herein). The method is particularly useful to detect variants of a target sequence, which may be present in a sample, for example, mutant sequences. Particularly, rare target sequences or sequence variants, or sequences or variants present in low abundance, or at low levels, may be detected.

The method combines a preliminary amplification of the target sequence by an asymmetric PCR step with a sRCA reaction to generate a sRCA product (sRCP), comprising at least a first and second RCPs, which is detected to detect the target sequence.

Thus, in a first step, amplicons of the target nucleic acid sequence, present in target nucleic acid molecules in a sample, are generated by asymmetric PCR. Such amplicons comprise double-stranded amplicons, generated during the exponential PCR reaction (e.g. in the first phase, or first reaction of the asymmetric PCR), and single-stranded amplicons of the desired strand comprising the target nucleic acid sequence, generated during the linear amplification reaction (e.g. in the second phase, or second reaction of the asymmetric PCR step. The number of cycles of the exponential PCR reaction is kept low, to no more than 12, and particularly no more than 11 or 10. Thus, the single-stranded amplicons of the desired strand predominate. If desired, the double-stranded amplicons may be denatured to release the desired single strands. The single-stranded amplicons are circularised by ligation, and subjected to RCA to create a RCA product comprising multiple repeated tandem complementary copies of the target sequence. These amplified target sequences are then probed with a target-sequence specific padlock probe, which, upon hybridisation to the target sequence-complementary sequences, is in turn circularised (to form a second RCA template, also referred to as a secondary RCA template) and amplified by RCA, generating a second (or secondary) RCA product, by means of which the target sequence may be detected.

The generation of a RCA product, specifically a sRCA product, provides a signal that can readily be detected, and counted. A digital counting readout can be implemented. This allows for digital detection of one reaction product for each detected target molecule. Further, the products of the second (or further) RCA reaction are large products which contain multiple (hundreds, and possibly up to the order of a thousand or so) copies of complements of the target sequence, and are of significant size. Such prominent reaction products may readily be collected, with minimal risk for mix-up with any other material in the reaction.

The present method affords a high level of amplification of the target sequence, by means of which the sensitivity of the detection method is improved, rendering it capable of detecting and identifying very rare target sequences. Thus, rare sequence variants, or sequences or variants present at low levels (“low abundance”), may be detected, or identified or discriminated. Further, the method permits accurate quantification of target nucleic acid sequences. For example, the ratios of different target nucleic acid sequences may be determined with high precision, e.g. in the context of determining copy numbers of chromosomes, where target sequences from 2 different chromosomes may be detected and compared. This may be of particular utility in detection of trisomy, for example, of chromosome 21, for example, in the context of non-invasive prenatal testing (NIPT). The basic principles of the detection method are shown schematically in Figure 1.

The use of asymmetric PCR to detect target nucleic acid sequences is, as noted above, known in the art, and various formats of the asymmetric PCR protocol are known. To optimise the SuperRCA protocol, the present method provides circularised RCA templates from a particular asymmetric PCR protocol for the subsequent 2-stage RCA reactions. Notably, by including the asymmetric PCR step, sensitivity of the method is increased by the provision of a high yield of amplicons of the target nucleic acid sequence, but limiting negative effects of polymerase error on the accuracy of the method.

In the asymmetric PCR protocol used in the method herein, the two PCR primers are designed to have different annealing temperatures I and temperature control is used during the method to control which primers anneal, and hence are able to prime the amplification reaction (i.e. there is a temperature “switch” in the method). Thus, the exponential reaction is performed at a lower temperature, at which both primers anneal, and the linear amplification is performed at a higher temperature at which only the higher Tm primer anneals. An increase or decrease in temperature is used to switch between the phases, depending on which phase is performed first. As noted above, the two phases (i) and (ii) may be performed in either order, that is, the exponential phase (i) first, followed by the linear phase (ii); or in an alternative embodiment, the linear phase (ii) is performed first, followed by the exponential phase (i).

In a preferred embodiment of the method, a first exponential reaction using both primers is performed first, followed by a second linear amplification reaction using the first primer only.

In reaction (i) of step (a), both primers are extended and both strands are amplified (e.g. the strand comprising the target sequence and its complement (complementary strand)). In reaction (ii) of step (a), only the first primer is extended, and hence only one strand is amplified (e.g. only the strand comprising the target sequence).

In other words, in the linear amplification reaction, there is mono-directional copying (in a single direction only), as opposed to bi-directional copying in the exponential reaction. In the linear amplification reaction, the synthesised copy is the complement to the sequence (or strand) to which the first primer anneals, and hence the synthesised copy cannot become a template for further amplification. Thus, for each thermal cycle, for each original single-stranded template only a single complementary copy is synthesised.

Specificity of the method is also increased by reducing the number of exponential PCR cycles. In the present method, a low number of exponential PCR cycles, specifically no more than 12, or particularly no more than 11 or 10 cycles, are performed in the exponential (e.g. initial) phase of the asymmetric PCR step, which has the effect of reducing the accumulation of polymerase errors in the amplicons, reducing the rate of false positives and false negatives in the assay results. In fact, in some embodiments, the number of exponential cycles may be no more than 9, 8 or 7 cycles. For example, 6 cycles may be performed.

Furthermore, efficiency of the method may be increased by removing the need to denature double-stranded amplicons to generate single strands for the ligation step. In an embodiment of the present method, the second phase of the asymmetric PCR step comprises a linear amplification reaction which generates single-stranded amplicons of the target nucleic acid sequence, which may be circularised in the ligation step without requiring a further denaturation step.

Efficiency of the method is also increased by reducing the competition of the complementary strand with the target strand during the ligation step. In the present method, when a denaturation step is not performed, the complementary strand of the double-stranded amplicons remains hybridised to the desired strand and is less likely to compete with the single-stranded amplicons for the ligation template or ligase enzyme. In an embodiment, the denaturation step may be performed. However, the accumulation of a high yield of single-stranded amplicons in the present method advantageously outcompetes the relatively low numbers of the complementary strand and thus the efficiency of the method is largely preserved.

In the present method, unlike a conventional asymmetric PCR (e.g. LATE- PCR), the asymmetric PCR step does not rely on use of a limiting concentration of one of the primers. Thus, the asymmetric PCR step may utilise substantially equal ratios of first and second primers, further increasing the efficiency and simplicity of the method. To allow this to happen, the first primer has a higher melting temperature (Tm) than the second primer, which binds to the complementary strand (i.e. to the strand complementary to that bound by the first primer, or alternatively, to the extension product of the first primer).

In one embodiment of the method, the annealing temperature of the first exponential PCR phase occurs at a temperature which is below the Tm of both primers, allowing annealing of both primers. After e.g. no more than 12 (or no more than 11 or 10) exponential PCR cycles, the annealing temperature is raised above the Tm of the second primer, but below the Tm of the first primer, to initiate the second linear amplification phase, wherein only the annealing of the first primer is allowed, thus generating single-stranded amplicons of only the desired strand.

Similarly, where the linear amplification is performed first, the annealing temperature in the first phase is above the Tm of the second primer and below the Tm of the first primer, so that only the first primer is able to anneal. After a selected number of linear cycles, the annealing temperature is lowered to below the Tm of both primers, allowing them both to anneal, and the exponential PCR reaction is performed, for no more than 12 cycles.

However, as discussed further below, the annealing temperature does not have to be below the Tm of a primer for that primer to anneal, as indeed is shown in Example 1 below. Thus, the annealing temperature may be a few degrees (e.g. 1- 7°C) above the Tm of a primer which is to be annealed. What is significant is the difference between the two annealing temperatures, to differentiate the annealing of the primers, and the two phases.

It will be understood that an exponential PCR reaction will generate doublestranded amplicons wherein the second strand comprises a complementary copy, in the opposite orientation (i.e. a reverse complement), to the sequence of the first strand. The second, linear, amplification reaction will generate copies of only one of these two strands, depending on which of the two strands the first higher Tm primer anneals to. Thus, it may be selected preferentially to amplify one or the other strand, e.g. the sense strand (or plus (+) strand) or the anti-sense strand (or minus (-) strand), depending on the primer design. Accordingly, as noted above, the term “target nucleic acid sequence” includes both orientations of the sequence, for example, both the sense and the anti-sense sequence. Thus, references to the target nucleic acid sequence may, depending on context, and as appropriate, include the complement of the target sequence, i.e. they may include sequences which are complementary to the target sequence.

In an embodiment, the first primer binds to the complement of the strand comprising the target nucleic acid sequence and the second primer binds to the strand comprising the target nucleic acid sequence. The extension product of the first primer thus contains a copy of the target nucleic acid sequence. Typically, but not necessarily, in the case of a genomic sequence, the target nucleic acid sequence will be the sequence as it appears in the sense strand, or the 5’ to 3’ strand of the target nucleic acid molecule. Thus, in an embodiment, the target nucleic acid sequence may be in the first strand (or, in other words, the 5’ to 3’ strand) of a double-stranded nucleic acid molecule, and the first primer may bind to the second strand (or, in other words, the 3’ to 5’ strand).

In one embodiment of the present method, unlike the Al PR method, the exponential PCR reaction is performed first, followed by the linear amplification reaction. In the method, irrespective of the order of the exponential and linear phases, the primers may advantageously be provided in similar or substantially the same concentrations in the amplification reaction mixture. In an embodiment, the PCR primers are provided in the same concentration range, or at the same concentration. In an embodiment, both primers are provided at a concentration at which they do not become depleted in the asymmetric PCR reactions. In other words, both primers remain in excess during the course of the asymmetric PCR step. In other words, neither of the primers are used in a limiting concentration. In still other words, both primers are used in non-limiting concentrations. Conveniently, the PCR primers may be used at concentrations typical for conventional, or exponential PCR reactions, e.g. at 100-1000 nM.

Advantageously, the asymmetric PCR step may be performed by thermal cycling to maximise efficiency. This may be done by procedures well known in the art. Thus, in an embodiment, the method may comprise a heating step to denature the target nucleic acid molecules and the temperature may then be reduced to allow the first and second primers to anneal (bind or hybridise) to their complementary binding sites in the target molecule, following which the annealed primers are subjected to extension by a PCR polymerase enzyme. This may involve a further temperature reduction and/or increase to optimize the conditions for primer extension, depending on the PCR polymerase enzyme used. The reaction is then heated again to denature the double-stranded molecules, and start the cycle again, that is to allow further primers to anneal, and be extended.

Where the order is reversed, and the linear amplification is performed first, following an initial denaturation, the temperature is reduced to a temperature which permits annealing only of the first primer, which is then extended, and the cycle is repeated, analogously to the method as described above.

The number of exponential cycles may be varied according to choice, and may depend on the nature of the target nucleic acid, sample, etc. as long as no more than 12 cycles are performed, e.g. 4, 5, 6, 7, 8, 9, 10, 11 or 12 cycles are performed, Thus, in embodiments, no more than 11, 10, 9, or 8 cycles may be performed, e.g. from any one of 4, 5 or 6 to any one of 8, 9, 10, 11 or 12. This is far fewer than is conventional in a PCR reaction. It has been observed that products accumulate in proportion to the repeated cycle numbers. This leads to an increased generation of double-stranded amplicons of the target sequence, and therefore fewer linear amplification cycles will need to be performed to generate a reasonable yield of single-stranded amplicons. Higher yields of double-stranded amplicons lead to an increase in the efficiency and sensitivity of the detection method, but a decrease in specificity due to accumulation of polymerase errors. If desired or appropriate, more or fewer exponential cycles can be performed, and the number of linear amplification cycles adjusted accordingly. It is within the capability of the skilled person in the art to decide how many exponential and linear cycles are required, taking into account the sample volume and impact of cycle number on efficiency of the method and accuracy of results. In an embodiment, at least 7 linear amplification cycles are performed, e.g. 7, 8, 9, 10, 11, or 12 cycles are performed. As discussed above, if desired or appropriate, more or fewer linear amplification cycles can be performed.

The present method may advantageously be automated. Thus, in an embodiment, the present method has been designed to allow a 2-stage RCA reaction (i.e. a sRCA) to be performed on single-stranded amplicons of the target nucleic acid sequence in a single reaction vessel, without any manual input and scalable for commercial use in robotic workflows.

At its most general, the method is for detecting a target nucleic acid sequence in a target nucleic acid molecule. The term “detecting” is used broadly herein to include any means of determining the presence of the target nucleic acid sequence in the target nucleic acid molecule. In the present method, the target nucleic acid is detected by detecting the presence or amount of the sRCA product, e.g. the second RCA product, which is generated, and can include detecting simply if it is present or not, or any form of measurement of the RCA product. It will be understood in this regard, that by detecting the second or further, or final RCP, since this is attached to the first or preceding RCP, the entire sRCA product is in effect detected. Thus, the RCA product of step (f) may be detected as the “signal” for the target nucleic acid sequence. Accordingly, detecting the second (or further) RCA product in step (g) includes determining, measuring, assessing or assaying the presence or absence or amount or location of the second (or further) RCA product in any way. The presence of a second (or further) RCA product (i.e. the confirmation of its presence or amount) is indicative or identificatory of the presence of the target nucleic acid sequence, as the successful generation of the RCA product is ultimately dependent on the presence of the target nucleic acid molecule, and more particularly of the target nucleic acid sequence therein. Quantitative and qualitative determinations, measurements or assessments are included, including semi-quantitative. Such determinations, measurements or assessments may be relative, for example, when two or more different target nucleic acid sequences, or target molecules, in a sample are being detected, or absolute. Accordingly, in an embodiment, the method may be for quantifying or determining the amount of target nucleic acid sequence which is present. The term "quantifying" when used in the context of quantifying a target nucleic acid sequence(s) in a sample can refer to absolute or to relative quantification. Absolute quantification may be accomplished by inclusion of known concentration(s) of one or more control nucleic acid molecules and/or referencing the detected level of the target nucleic acid sequence with known control nucleic acid molecules or sequences (e.g. through generation of a standard curve). Alternatively, relative quantification can be accomplished by comparison of detected levels or amounts between two or more different target nucleic acid molecules, or different target sequences, to provide a relative quantification of each of the two or more different nucleic acid molecules or sequences, i.e. , relative to each other. Thus, as noted above, ratios of target nucleic acid sequences present in a sample may be determined. Thus, copy numbers of target nucleic acid molecules, e.g. chromosomes, may be compared.

The target nucleic acid sequence is a sequence in any nucleic acid molecule that it is desired to detect or identify, or in other words the target of the assay. It may be DNA or RNA, or a modified variant thereof. Thus, the nucleic acid may be made up of ribonucleotides and/or deoxyribonucleotides as well as synthetic nucleotides that are capable of participating in Watson-Crick type or analogous base pair interactions. Thus, the nucleic acid may be or may comprise, e.g. bi-sulphite converted DNA, LNA, PNA or any other derivative containing a nonnucleotide backbone.

Typically, the target sequence will be an analyte it is desired to detect, for example, a nucleic acid present in a sample, e.g. in a cell or tissue sample or any biological sample etc. Thus, it may be a naturally occurring sequence, or a derivative or copy or amplicon thereof. However, this is not necessary, and the target sequence may instead be a reporter for an analyte of an assay. Reporter nucleic acids may be used or generated in the course of an assay for any analyte, for example, a protein or other biological molecule, or small molecule, in a sample. Thus, a reporter nucleic acid may be provided as a tag, or label, for a binding probe for an analyte, and may be detected in order to detect the analyte, for example in an immunoassay, e.g. as in an immunoPCR or immunoRCA reaction. A reporter nucleic acid may be generated in the course of an assay, for example, by a ligation reaction in a proximity ligation assay, or an extension reaction in a proximity extension assay, or by a cleavage reaction, or such like. Such a reporter target nucleic acid may therefore be a synthetic or artificial sequence.

In an embodiment, the target nucleic acid is a DNA molecule, natural or synthetic. The target nucleic acid molecule may be coding or non-coding DNA, for example genomic DNA or a sub-fraction thereof, or may be derived from genomic DNA, e.g. a copy or amplicon thereof, or it may be cDNA or a sub-fraction thereof, or an amplicon or copy thereof etc.

In another embodiment, the target nucleic acid molecule is a target RNA molecule. It may be an RNA molecule in a pool of RNA or other nucleic acid molecules, for example, genomic nucleic acids, whether human or from any source, from a transcriptome, or any other nucleic acid (e.g. organelle nucleic acids, i.e. mitochondrial or plastid nucleic acids), whether naturally occurring or synthetic. The target RNA molecule may thus be or may be derived from coding (i.e. pre-mRNA or mRNA) or non-coding RNA sequences (such as tRNA, rRNA, snoRNA, miRNA, siRNA, snRNA, exRNA, piRNA and long ncRNA). In one embodiment, the target nucleic acid molecule is a micro RNA (miRNA). In another embodiment, the target RNA molecule is 16S RNA, for example, wherein the 16S RNA is from and identificatory of a microorganism (e.g. a pathogenic microorganism) in a sample. Alternatively, the target RNA molecule may be genomic RNA, e.g. ssRNA or dsRNA of a virus having RNA as its genetic material. Notable such viruses include Ebola, HIV, SARS, SARS-CoV2, influenza, hepatitis C, West Nile fever, polio and measles. Accordingly, the target RNA molecule may be positive sense RNA, negative sense RNA, or double-stranded RNA from a viral genome, or positivesense RNA from a retroviral RNA genome.

Where the target molecule is an RNA molecule, the method may comprise a preliminary step of generating a cDNA copy of the target RNA molecule. The cDNA molecule is then amplified.

The asymmetric PCR reaction which generates amplicons of the target sequence is performed using a set of primers comprising a first primer having a first melting temperature (Tm) and a second primer having a second Tm which is at least 10°C lower than the first Tm. The term “amplicon” as used herein is defined as the amplification product of the asymmetric PCR reaction. References to amplicons as used herein usually refer to single-stranded amplicons generated during the linear amplification phase which are ligated into first RCA templates for use in the SuperRCA reaction, but may, depending on context, and as appropriate, also include the double-stranded amplicons generated during the exponential PCR phase.

A pair of PCR primers may be denoted as forward and reverse primers where the forward primer binds to the complementary strand of a target sequence which it is desired to amplify. Accordingly, in the present method, in an embodiment, the first primer (forward primer) has a sequence that is complementary to and therefore binds (anneals) to the second strand (minus strand), after which the first primer is extended to generate a copy of the desired first strand (plus strand). The second strand may also be referred to as the antisense, or template strand, and it is the complement of the first strand and vice versa. The second primer (reverse primer) has a sequence that is complementary to and therefore binds (anneals) to the first strand, after which it is extended to generate a copy of the second strand. The first strand may also be referred to as the desired, sense, or non-template strand. Alternatively put, the first primer anneals to and is extended on the second strand to generate an amplicon of the desired first strand, whilst the second primer anneals to and is extended on the first strand to generate a complement of the desired strand. The first primer also anneals to the extension product of the second primer and the complement of the second strand, whilst the second primer also anneals to the extension product of the first primer and the complement of the first strand. Accordingly, in such an embodiment, the first primer may be referred to as the minus strand primer and the second primer may be referred to as the plus strand primer. Analogously, in another embodiment, the first primer may be the reverse primer etc., (i.e. it may bind to the first stand (i.e. the plus strand/5’ to 3’ strand), and so on).

The melting temperature ™ of a primer is the temperature at which half of the primers are bound to the nucleic acid strand. In other words, it is the temperature at which an equilibrium is reached between bound and unbound primers. Primers may have different Tms depending on various factors of primer design. This may include notably the nucleotide composition and length of the primer. Thus, the primer may be designed with sequence mismatches, or it may comprise modified nucleotides such as locked DNA (LNA) bases, or it may be modified by the addition of minor groove binders.

The set of primers used in the method comprises a first primer having a higher Tm than the second primer, specifically at least 10°C higher than the second primer. Alternatively put, the first primer has a first Tm and the second primer has a second Tm that is at least 10°C lower than the first Tm. In an embodiment, the second Tm is at least 11 , 12, 13, 14, 15, 20, 25 or 30°C lower than the first Tm. The first primer may have a first Tm of 60-72°C, for example 65-70°C, e.g. 65, 66, 67, 68, 69 or 70°C. In an embodiment, the first primer may have a first Tm of 67°C. The second primer may have a second Tm of 50-62°C, e.g. 54-60°C, e.g. 54, 55, 56, 57, 58, 59, or 60°C. In an embodiment, the second primer may have a second Tm of 50 or 55°C.

The difference in primer Tms allows the use of different annealing temperatures to control the amplification of the target sequence. The annealing temperature of a PCR reaction is the temperature at which the primers are allowed to bind, or anneal, to the nucleic acid strand. Specifically, a first annealing temperature can be used, taking into account the second lower Tm, to allow both primers to anneal during the exponential PCR phase, which leads to extension of both primers on their respective strands, and generation of double-stranded amplicons. In other words, both strands of the target nucleic acid molecule are amplified. Alternatively put, the first strand and its complement, and the second strand and its complement, are amplified. A second annealing temperature can be used to allow only the primer with a higher first Tm (the first primer) to anneal to the desired strand, e.g. to the template (or anti-sense) strand, to generate singlestranded amplicons of the desired nucleic acid sequence. Thus, for example, during the linear amplification phase of the asymmetric PCR method, the first primer having a higher first Tm anneals to the template (second) strand to amplify the desired (first) strand. In other words, during the linear amplification phase, only the first strand is amplified.

The annealing temperature for a primer or primers may be the same as the Tm of the primer or of the primer with the lowest T m which is to be annealed. However, as will be understood in the art, it may be different. For example, the annealing temperature may be 4-10°C higher or lower than the primer Tm, for example 4-10°C higher, or 4-8°C higher, e.g. 4, 5, 6, 7 or 8°C higher. For example, the first annealing temperature may be 5-7°C higher than the second Tm of the second primer, e.g. 6°C higher, and the second annealing temperature may be 6- 8°C higher than the first Tm of the first primer, e.g. 7°C higher.

In order for the primers to anneal during each PCR cycle, any doublestranded nucleic acid molecules must first be denatured. Accordingly, where the target nucleic acid molecule occurs, or is provided, in double-stranded form at the beginning of each cycle, the temperature is raised to at least 90°C, for example 90- 100°C, or 94-98°C, e.g. to 98°C, although the appropriate denaturation temperature may vary depending on the Tm of the double-stranded nucleic acid molecules. The Tm of said molecules are affected by factors such as the size, complexity and GC content and the appropriate denaturation temperature can be determined by a person skilled in the art. Denaturation of the double-stranded nucleic acid molecules separates the strands into single-stranded nucleic acid molecules to which primers may anneal. The denaturation step may be performed for at least 10 seconds to 3 minutes, for example 10 seconds to 1 minute, e.g. 15 seconds to 30 seconds. In an embodiment, the initial denaturation step to separate the doublestranded target nucleic acid molecules is performed for 30 seconds, and the subsequent denaturation steps to separate the extension products from the original strands or to denature the amplicons is performed for 15 seconds.

After the denaturation step of a PCR cycle, the temperature may be sufficiently lowered as described above to allow annealing of the first and second primers for the exponential reaction, or for the first primer only for the linear amplification reaction. Once the primer(s) have annealed, the temperature may be maintained to allow the primer(s) to be extended by the PCR polymerase during the primer extension step of a PCR cycle. Alternatively, the extension temperature may be different to the annealing temperature, for example, higher than the annealing temperature. In an embodiment, the extension temperature is the same as the annealing temperature. In another embodiment, the extension temperature is different to the annealing temperature. The annealing and extension steps of the PCR cycle may be performed for a total of at least 60-180 seconds, for example 100-140 seconds, e.g. 120 seconds. In an embodiment, the total annealing and extension time in each cycle is 120 seconds.

Thus, the asymmetric PCR reaction of step (a) may comprise contacting the sample (or more particularly the target nucleic acid molecule) with the first and second PCR primers and PCR reagents (which will comprise at least a polymerase enzyme and dNTPs), and in one embodiment, the exponential reaction of step (a)(i) may comprise: denaturing the target nucleic acid molecule if it is double-stranded; incubating the PCR reaction mixture at the first annealing temperature to allow both primers to anneal; incubating the PCR reaction at the first extension temperature, which may be the same or different to the first annealing temperature, to allow the primers to be extended; and optionally repeating the denaturation, primer annealing and extension steps. Subsequently, the linear reaction of step (a)(ii) may comprise: denaturing the double-stranded amplicons from step (a)(i); incubating the PCR reaction mixture at the second annealing temperature to allow the first primer to anneal; incubating the PCR reaction at the second extension temperature, which may be the same or different to the second annealing temperature, to allow the primer to be extended; and optionally repeating the denaturation, primer annealing and extension steps.

In another embodiment, wherein the linear amplification is performed first, the linear reaction of step (a)(ii) may comprise: denaturing the target nucleic acid molecule if it is double-stranded; incubating the PCR reaction mixture at the second annealing temperature to allow only the first primer to anneal; incubating the PCR reaction at the second extension temperature, which may be the same or different to the second annealing temperature, to allow the first primer to be extended; and optionally repeating the denaturation, primer annealing and extension steps. Subsequently, the exponential reaction of step (a)(i) may comprise: subjecting the reaction mixture from step (a)(ii) to a denaturing step; incubating the PCR reaction mixture at the first annealing temperature to allow both the first and second primers to anneal; incubating the PCR reaction at the first extension temperature, which may be the same or different to the first annealing temperature, to allow the two primers to be extended; and optionally repeating the denaturation, primer annealing and extension steps.

The first primer may be defined as the “high” or “high temperature” primer and the second primer may be defined as the “low” or “low temperature” primer. The first annealing temperature, at which both primers anneal, may accordingly be defined as “low”, and the second annealing temperature, at which only the first (high) primer binds, may be accordingly be defined as “high”. Thus, the first annealing temperature is lower than the second annealing temperature.

Generally speaking, the second annealing temperature (the “high” annealing temperature) is generally at least about 8-12°C higher than the first annealing temperature (the “low” annealing temperature). In an embodiment, it is at least 10°C higher than the first annealing temperature. Ideally, the difference between the first (low) and second (high) annealing temperatures is about 10°C,

As discussed above, it is an advantage of the present method to reduce the number of exponential PCR cycles in order to reduce the accumulation of polymerase errors. Advantageously, to minimise polymerase errors, no more than 12, 11 , or 10 cycles are performed (i.e. 12, 11 or 10 or fewer). In an embodiment, no more than 5-8 cycles are performed, e.g. no more than 8, 7, 6, or 5 cycles are performed. In another embodiment, 6 cycles are performed.

In the present method, the number of linear amplification cycles performed is chosen taking into account the number of exponential cycles performed and the desired yield of single-stranded amplicons of the target sequence. Typically, no more than 40, 39, 35, 30, 25, or 20 cycles are performed. In an embodiment, at least 6-12 cycles are performed, e.g. at least 6, 7, 8, 9, 10, 11 or 12 cycles are performed. In another embodiment, 8 cycles are performed.

The PCR primers are designed to amplify the target sequence and hence to anneal within the target sequence, or at sites which flank the target sequence, according to principles well known in the art. This may depend on the nature of the target sequence, e.g. whether different target sequences are being detected, or different variants (e.g. mutants or alleles) of a gene are being detected. Generally speaking, the forward primer may anneal close to or at the 3’ end of the complementary target sequence. The reverse primer may anneal close to or at the 3’ end of the target sequence.

In the case of multiplex reactions where there is more than one target molecule or more than one target sequence, the amplification of each target may be performed separately, in parallel, i.e. individual amplification reactions may be performed, each using a single set of PCR primers. In this case, the resulting individual asymmetric PCR reaction mixtures, or more particularly, parts or aliquots thereof, may be pooled to provide the reaction mixture of step (a). Accordingly, in an embodiment, multiple separate asymmetric PCR reactions are performed, and amplicons therefrom are pooled prior to step (b).

Alternatively, the amplifications of different target sequences/molecules may be performed in multiplex in the same reaction mixture. Thus, the asymmetric PCR reaction may be performed using a plurality (or multiplicity) of primer sets (or in other words a pool of primers). It will be understood in this regard that where different variants of a target sequence are being investigated, the amplicons thereof may be generated using the same set of PCR primers (in other words, the amplicons may comprise one or more of a number of different variants, which may have been amplified using the same primer set; the primers may be designed to bind at sites which flank the variant position(s)). In the case of a multiplex amplification in the same reaction mixture, the number of sequences amplified may be 2 to 10,000 or more.

Thus, to detect target sequences in two or more different target molecules, a multiplicity of PCR primer sets (e.g. primer pairs) may be used, each specific for a different target sequence, that is, having target-binding regions which are complementary to primer binding sites in the target molecule which lie within or which flank the different target sequences. It will be understood in this respect that the flanking binding sites in different target molecules will be different for different target sequences, to allow for specific binding of the padlock probes. Alternatively or additionally, different and separate target sequences in the same target molecule may be detected, for example, different sequences in different genes on a chromosome, again using a multiplicity of different PCR primer sets, each specific for a different target sequence. However, as noted above, in a particular embodiment, the method is useful for detecting which of a number of possible different variant sequences is present in a given target molecule, for example whether a wild-type or mutant sequence is present, or which of a number of possible mutants, or different allelic variants, or polymorphisms etc. In such a protocol, a common primer set is used to generate the amplicons, and it is then determined which variant is present by using multiple different padlock probes, each specific for a different variant of the target sequence.

The term “multiple” or “multiplicity” as used herein means two or more, for example, 3, 4, 5, 6, 8, 9, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, or 100 or more. Indeed, thousands or tens of thousands of primer sets (and subsequently ligation templates or padlock probes) may be used. The number that can be used is not restricted, and can be varied. This will depend on the purpose of the method, the nature of the sample, the target nucleic acid sequences to be detected, and the number of possible variants etc. Thus, to detect wild-type and mutant variants for example, the number of different padlocks will depend on the number of different mutants possible, and may be, for example, 2-6, 2-5, 2-4 or 2-3 different padlocks. It will be appreciated that different aspects may be combined, in order to increase the overall multiplex of the assay. For example, in a given sample, the method may be used to detect different variants of different target sequences.

Where amplicons have been generated by different primers, e.g. from different target molecules, or different target sequences amplified by different primers from the same molecule, they may have different end sequences. Different or selected end sequences may be introduced by the PCR primer. Thus, the PCR primers used to amplify different target sequences may be provided with 5’ end sequences which are non-complementary to the target sequence/molecule, and which can be used to provide a common, or universal, end sequence to all amplicons. In this way, amplicons can be obtained which all have the same end sequence. This allows the use of the same ligation template to circularise all amplicons in the reaction mixture (or generated from the same sample, for example). In certain cases, it may be possible if desired for all amplicons from a group or set to be provided with the same (i.e. common) end sequence, and for different groups or sets to have different end sequences. For example, all amplicons from a given sample may be provided with the same end sequence, but amplicons from different samples, which are pooled together, may be provided with a different end sequence, allowing the same target sequences from different samples to be distinguished. Accordingly, in an embodiment, the end sequences may be defined as being or comprising binding sites for a ligation template.

Accordingly, in an embodiment, the PCR primers (and consequently the ligation templates) may be viewed as common primers, or as generic to a group of target sequences, or a group of variant target sequences. The PCR primers may thus have target-binding regions which are complementary to binding sites (e.g. flanking regions) in the target molecule which are common to different target sequences (that is, common binding sites which flank different target sequences, or different variants of a target sequence). Alternatively put, the amplification primers may have target binding regions which are common, or generic, for different target sequences, or different variants, or common for a group of target sequences or group of variants. That is, the amplification primers may have target binding regions which are capable of hybridising to complementary binding sites in the target molecule which are common to different target sequences, or to different variants of target sequences. The padlock probe is, however, specific to the target sequence, or to different variants of a target sequence, as described further below. In a different embodiment, the PCR primers may be specific for a particular target nucleic acid sequence. Thus, for example, primer sets may be used, each specific for a different target sequence. This may have utility for diagnostic testing, e.g. NIPT, where different sequences may be detected, for example, to detect different chromosomes (and determine their copy number, for example, to detect a trisomy) or in any situation where it is desired to detect one or more specific target sequences.

In an embodiment, the PCR primers may be designed in a way which assists the subsequent ligation step (b) of the method. Thus, the primers may include one or more uracil bases, to allow the primer to be degraded. In particular, the amplicons may be subjected to a digestion step with uracil-DNA-glycosylase (UDG) to digest the primer part of the amplicons (extension products). This is particularly so for the first primer (“high Tm” primer). This can assist in the ligation efficiency; the ligation template may be designed to hybridise to the 5’ end sequence of the extended part of the amplicon sequence. The PCR primers will be present in excess in the amplification reaction mixture following the asymmetric PCR step, in many cases very considerable excess, and accordingly, such a modification assists in reducing or minimising competition by unextended primers for the ligation template. A similar effect may be achieved by including phosphorothioate bonds in the primer, in combination with a 5’ single strand exonuclease.

To perform the asymmetric PCR step, the target nucleic acid is contacted with the PCR primers and other reagents necessary to perform the amplification. A sample containing the target nucleic acid molecule, or a part or fraction or aliquot thereof, may be contacted with the reagents. For example, a cell or tissue sample, or any sample containing cells, or a cell extract, may be contacted with the reagents directly. The sample may be prior-treated or processed prior to the contact. In other embodiments, the target nucleic acid may first be separated, or removed from the sample. Procedures for extracting or purifying nucleic acids, e.g. DNA, from various types of sample are well known in the art. For example, the nucleic acids may be isolated from cells, or from cell-free samples, such as plasma. It may in some cases also be desirable to fragment nucleic acid molecules. Procedures for this are known in the art, and include specific digestion, e.g. using nucleases, including restriction enzymes, for example, or by non-specific means, such as shearing. As noted above, for ease of operation and automation, it is generally desirable to perform the method in a single vessel format for multiple assays (i.e. for a multiplex assay), particularly for the sRCA steps. To enable this, where separate parallel asymmetric PCR amplifications have been carried out, after the asymmetric PCR reaction of (a), a part or portion of the resulting reaction mixture(s) of asymmetric PCR reaction(s) which are used to generate the amplicons may be introduced, or added, to a reaction vessel. For example, a 1-5 pL volume of one or more amplification reaction mixture may added to the vessel. However, in another embodiment, the amplifications may be performed in the vessel.

Thus, the present method may advantageously be automated. Accordingly, in an embodiment, the present method has been designed to allow a 2-stage (or further stage) RCA reaction (i.e. a sRCA) to be performed on single-stranded amplicons of the target nucleic acid sequence in a single reaction vessel, without any manual input and scalable for commercial use in robotic workflows.

By “single reaction vessel” is meant a single physical, or structural, compartment, that is, a physical structure which holds a reaction mixture volume. The term “compartment” as used in this context does not include droplets or emulsions, etc. as such, but this does not preclude that droplets or emulsions are used in any of the steps of the “single vessel embodiment of the method (i.e. within a reaction vessel in steps (i) to (x)). It will further be understood that the second (or further) RCA product may be removed from the reaction vessel for the purpose of detection in step (x), for example, to transfer it to a detection instrument, such as a flow cytometer, or a microscope slide etc. The vessel may thus be any vessel suitable for performing the steps of the method. Typically, this will be a reaction tube, but it may be a vessel of any format or configuration, including, for example, a reaction well or part of a microfluidic circuit.

As noted above, a limited number of cycles of exponential PCR are performed, and thus, the resulting mixture contains an excess of PCR primers relative to the amplicons. While a low cycle number has the advantage of improving specificity of the assay, a consequence is that an excess of PCR primers are present in the reaction mixture. Even such a limited number of cycles may result in a very large primer excess, for example, a 1000-fold excess or more. Thus, the primer excess can be at least 1000, 2000, 5000, 7000, or 10,000 fold. In some cases, it can be much higher, in the order of tens or hundreds of thousand-fold excess. By way of example, in a representative amplification reaction the resulting reaction mixture may contain amplicons (e.g. full length PCR products) in the order of 100 fM versus primers in the order of 500 nM. Such a primer excess may interfere in downstream steps of the method.

Whilst in some embodiments, the amplicons may be separated from the PCR amplification reaction mixture, it may be convenient to address this by performing a clean-up step to remove excess primers. This is particularly the case in the single-vessel format of the method.

Thus, in an embodiment, it may be convenient to include a clean-up step to remove, or otherwise reduce, at least the PCR primers, and optionally also the PCR polymerase. This may be done by enzymatic digestion and/or dilution.

In one embodiment, the excess primers are reduced by enzymatic digestion. In another embodiment, a volume of diluent is added to the reaction mixture to dilute the reaction mixture.

The PCR polymerase, if present in the reaction mixture in subsequent steps, may inhibit ligation of the amplicon by extending the hybridised 3’ end of the amplicon on the ligation template (where this is used). Accordingly, one way of reducing such interference is to perform a non-templated ligation step of the singlestranded amplicons. However, in another embodiment, the polymerase may be inactivated by enzymatic digestion. In another embodiment, the hybridisation of the amplicon to the ligation template may be controlled by temperature such that the 5’ end is allowed to hybridise first, followed by the 3’ end. These solutions may also be used to prevent extension of the hybridised 3’ end of the padlock probe during the second ligation step (step (e)).

In the case of enzymatic digestion of primers, this may be achieved by contacting the reaction mixture from (a) with a digestion enzyme capable of degrading the primers. Conveniently, this may be an enzyme with exonuclease activity, particularly an enzyme with exonuclease activity as its primary catalytic activity (as opposed to a polymerase enzyme with exonuclease activity, or part thereof having exonuclease activity). Thus, an exonuclease enzyme may be used (which term in the context herein does not include a polymerase or part thereof).

The term “contacting” is used broadly herein to include bringing the reagents in question into contact. Thus, one may be added to the other and vice versa, or they may each be introduced to each other etc. However, in the context of the single vessel procedure which is performed in steps (i) to (ix), contacting conveniently involves adding the reagent in question to the reaction mixture which is present in the single vessel, or in other words, adding the reagent or reagents or reagent mix in question to the vessel. Thus, in the case of step (ii), reducing the primer excess may involve adding a reagent or reagent mix containing a digestion enzyme to the reaction mixture of (i) and in particular, to the vessel which contains the reaction mixture of (i).

The exonuclease enzyme may be any suitable exonuclease enzyme. Exonucleases for performing clean-up of undesired nucleic acids are known in the art, and include, for example, exonuclease I, III or lambda, or mixtures thereof. The exonuclease which is selected should have strict single-stranded specific activity, for example Exol or RecJf.

Alternative digestion systems are possible. For example, if amplification primers contain uracil, then as noted above they may conveniently be digested using uracil DNA glycosylase (UDG), another well-known digestion system.

In another embodiment, the amplicon may be generated using nucleotides which are resistant to exonuclease digestion (e.g. comprising modified bases that can inhibit exonuclease activity), and primers which are composed of nucleotides which are sensitive to exonuclease digestion (e.g. normal, or conventional nucleotides). In this way, the primer part of the amplicons may be digested, whilst sparing the newly synthesised sequences in the amplicons.

This clean-up step may further include enzymatic digestion of the amplification polymerase. However, this is not strictly necessary, as interference from the polymerase may be addressed by other means. Nonetheless, in one embodiment, a clean-up reagent mixture may include, in addition to the enzyme with exonuclease activity, an enzyme with proteinase activity. This may be any suitable proteinase including Serine proteases, Cysteine proteases, Threonine proteases, Aspartic Proteases, Glutamic proteases, Metal loproteases and asparagine peptide lyases, but typically this will be proteinase K. Advantageously, a thermolabile, or non-thermophilic, proteinase is used, since this may subsequently be inactivated by heat. A heating step may thus be included, to inactivate the proteinase after it has been used, as discussed further below. The use of proteinase in this step is advantageous as the proteinase will also degrade the exonuclease. In this regard, a balance will occur, between enzymatic action of the exonuclease and its digestion by proteinase. However, it has been found that a sufficient degradation of the amplification primers may occur before the exonuclease is inactivated, and it is well within the routine skill of the person skilled in the art to find and optimise reaction conditions.

The clean-up reagents, e.g. enzymes, may be provided in any suitable buffer or diluent. Advantageously, this may be a buffer suitable for the subsequent ligation and RCA reactions. The reaction mixture (e.g. the reaction vessel for the single vessel format) may be incubated in conditions suitable for the clean-up reactions to take place. By way of representative example, this may be incubation at 37°C for 10 minutes, but this can of course be varied, or optimised, depending on the precise nature of the enzymes used etc., and could include leaving the reaction mixture at room temperature for a period of time.

Where the method is for detecting variants, e.g. mutations, then generally speaking, it is optimal for the clean-up step to be performed by enzymatic digestion of the primers. Preferably, this also includes proteinase digestion of the amplification polymerase.

However, as an alternative to enzymatic digestion, dilution of the amplification reaction mixture may be employed, to dilute the primers. This may be sufficient to reduce their interference to acceptable levels. Generally speaking, this may require a significant dilution factor, which may be determined by routine experimentation. For example, a dilution factor of at least X200 may be employed. This may be achieved by contacting the reaction mixture with a diluent, for example, adding a diluent to the reaction mixture, or by adding a part or aliquot of the amplification reaction mixture(s) to a diluent. The diluent may conveniently be a suitable buffer, for example, a buffer suitable for subsequent steps of the method. The diluent may, in another embodiment, be a reagent mix for the ligation step. In the case of a dilution step, a smaller aliquot of the individual amplification reaction mixtures, or of a pool thereof, may be contacted with the diluent, for example, a 0.1- 0.2pL aliquot.

Thus, in an embodiment, steps (i) and (ii) of the single vessel protocol may be combined, and may be performed by adding a part or aliquot of one or more amplification reaction mixtures to a volume of clean-up reagent mix present in the vessel. This clean-up reagent mix may be a simple diluent, e.g. buffer.

After the asymmetric PCR step, and optionally after any clean up step (e.g. step (ii) of the single-vessel protocol, there may be an optional step of denaturing double-stranded amplicons into single strands. This is not necessary as the amplicons are prepared in substantially single-stranded form, but including such a step may be advantageous in some embodiments. If included, the method may comprise a heating step to denature the amplicons, for example, heating to 95°C for 1-2 minutes, or similar conditions. Such a heating step may also inactivate any proteinase included in the clean-up reaction. However, generally speaking, it has been found that it is beneficial to include a separate, more gentle, heating step, e.g. incubation at 50-60°C for 5-20 minutes, for example, 55°C for 10 minutes to inactivate the proteinase if it is used. The method may then proceed to the ligation step (b), or to a denaturation step, and then the ligation step.

In this regard, if a denaturation step is included, in step (b), it is convenient to add the ligation mix, heat to denature, and then reduce the temperature to perform the ligation. Thus, the denaturation and the contacting of step (b) may be performed simultaneously or in either order. Alternatively, if there is no denaturation step at this point, the ligation mixture may be added, and then the reaction mixture may be heated to inactivate the proteinase if this is used.

The performance of the actual ligation reaction may be controlled (i.e. the ligation reaction may be initiated) by controlling the temperature of the reaction. This may include the annealing of the ligation template (where it is used) and/or the activity of the ligase enzyme. In other words, the ligation reaction occurs after the denaturation, where that is included, but the reagents for the ligation, the ligase and, in some embodiments, also the ligation template, may be added to the reaction mixture prior to or simultaneously with the denaturation of step (b). As noted above, a ligation template is typically included, as this tends to give better results, but this is not necessary, and a non-templated ligation may alternatively be performed. This may depend on the particular method being used, and the amplicons and ligation conditions being employed etc. In one embodiment, a ligation template is included in the ligation reaction mix.

After denaturation, the temperature is reduced to allow the amplicon to hybridise (or in other words anneal) to the ligation template, where this is used, and for the ligase enzyme to catalyse the ligation reaction. Such a temperature reduction may not be necessary if there has not been a denaturation step, and the method may instead involve increasing the temperature for the ligation step (e.g. for annealing of the ligation template and/or for ligase enzyme activity).

The ligation templates comprise sequences, or regions, of complementarity to the end sequences of the single-stranded amplicons which have been preferentially amplified. The ends of the amplicon strand hybridise to the ligation template, thereby bringing the ends of the hybridised amplicon into juxtaposition for ligation (i.e. adjacent to one another so that they can be ligated). As noted above, a ligation template can be designed to be complementary to a particular individual amplicon (in other words an amplicon comprising a particular target sequence may have a cognate ligation template, which is designed to hybridise to it, but not to other amplicons). In other embodiments, the ligation template may hybridise to a group or sub-set of amplicons, or to all amplicons in the reaction mixture, depending on the end sequences provided to the amplicons.

In this regard, it is generally the case that all amplicons (present in a reaction mixture) are ligated. Thus, in the case of a reaction mixture containing multiple different amplicons (representing multiple different target sequences), all the different types of amplicon which may be present, or which are to be assayed, are ligated into circles. Thus, the step of circularising the amplicons by ligation can be viewed as a step of sample preparation, or of preparing a library of molecules to be amplified and probed by the padlock probes.

The ligation is performed by a ligase enzyme. This may be any suitable ligase as known in the art. Representative ligases of interest include, but are not limited to, temperature sensitive ligases such as bacteriophage T4 DNA ligase, bacteriophage T7 ligase, E. coli ligase, and thermostable ligases such as Taq ligase, Tth ligase, Ampligase®, Pfu ligase and 9°N™ DNA Ligase. Advantageously, a thermostable or thermophilic ligase is used, since this may be retained in active form in the reaction mixture for the subsequent ligation of padlock probes in step (e) (step (vii) of the single vessel method). Suitable conditions for ligation are known in the art, and any reagents that are necessary and/or desirable may be combined with the reaction mixture and maintained under conditions sufficient for ligation. It will be evident that the ligation conditions may depend on the ligase enzyme used in the methods of the invention. Thus, for example, Ampligase may be used, which allows the ligation reaction to be performed at e.g. 50-60°C (for example, at 55- 60°C, e.g. 58°C).

For performing the ligation without a ligation template, CircLigase™ ssDNA Ligase may be used. This is a thermostable ATP-dependent ligase that catalyses intramolecular ligation (i.e. circularisation) of ssDNA templates having a 5'- phosphate and a 3'-hydroxyl group. In an embodiment, it may be desirable to design the ligation reaction in a manner to achieve an asymmetric ligation structure, such that there is a stronger hybridisation between an amplicon (extended primer product) and the ligation template, than between an unextended primer and the ligation template. This may be achieved by primer design, as discussed above.

As also noted above, in certain embodiments, it may be desirable to perform the ligation reaction under conditions in which the ability of the amplification polymerase to extend the 3’ end of the amplicon strand, which is hybridised to the ligation template, is inhibited. It will be understood that such conditions will be achieved if the amplification polymerase has been digested by a proteinase in a preceding clean-up step. However, this may also be achieved by other means, most notably by controlling the hybridisation of the amplicon to the ligation template such that the 5’ end of the amplicon is hybridised first. This allows the ligation reaction to proceed, before any significant extension of the hybridised 3’ end can take place. Thus, a 2-step hybridisation may be performed, to anneal the 5’ end of the amplicon to the ligation template at a first temperature, and then reducing the temperature to allow the 3’ end of the amplicon to anneal to the ligation template.

For example, a first annealing temperature may be selected to facilitate or optimise the annealing of the 5’ end of the amplicon. For example, an annealing temperature of 50-75°C may be used, e.g. 53 to 60°C, or more particularly 55 to 60°C.

The annealing temperature may be reduced for annealing of the 3’ end. Again, the appropriate conditions can be selected according to what is known in the art, and the particular reagents, e.g. enzymes used. For example, after the first annealing step, the temperature may be reduced to 28-40°C, e.g. 28-35, 30-35, 28- 33, 30-33, 28-32, or 30-32°C etc., for the second annealing step.

Suitable ligase enzymes may be selected to allow such a 2-step annealing. In this regard, thermophilic ATP-dependent ligases are available which are active in the temperature range 37-95°C. Accordingly, this allows a second/annealing temperature of about 37-45°C to be used, which is 10°C lower than the first annealing temperature.

The skilled person will be aware how to design the ligation template and/or the amplicon ends to be able to achieve the differential annealing of the 5’ and 3’ amplicon ends, e.g. to achieve a different Tm for the annealing of the respective ends. Thus, by appropriate design and selecting appropriate conditions, including temperature, a balance may be obtained, which allows stable hybridisation of the 5’ end, followed by hybridisation of the 3’ end. For example, the sequence at the 3’ end of the amplicon which hybridises to the ligation template, may be shorter than the 5’ end sequence. By way of representative example, the 3’ end sequence may be at least 6 nucleotides long, e.g. at least 7, or 8 nucleotides long, e.g. 6-20. 6-18, 6-15, or 6-12 nucleotides long. The 5’ end sequence may be at least 12 nucleotides long, e.g. at least 13, 14, or 15 nucleotides long, for example 12-30, 12-25, 12-20, 12-18, or 12-15 nucleotides long.

Once the amplicons have been circularised by ligation, they are subjected to RCA in step (c). RCA reactions are well known in the art, and hence the conditions for this step may be designed or selected according to protocols and principles known and described in the literature. A strand-displacing polymerase enzyme such as Phi29 or derivatives thereof is used. Conveniently, the ligation template acts as the primer for this first RCA reaction (i.e. as the “RCA primer”). However, this is not essential, and a separate RCA primer may be used. Further, if a ligation template is not used, then a RCA primer will need to be provided.

To ensure that no unwanted polymerase-catalysed extension reactions occur, reagents required for the RCA may conveniently be added to the reaction mixture after the ligation step. In the single-vessel method, they will be added after the ligation step. Thus, a RCA mix containing at least the RCA polymerase is added. It is possible for other RCA reagents, e.g. dNTPs, to be added before this, but generally speaking it is convenient to add a RCA reagent mix comprising the RCA polymerase and dNTPs. The RCA reagent mix will generally include an appropriate buffer and other components to aid the RCA reaction, as is known in the art. By way of representative example, using Phi29 polymerase, the RCA reaction may proceed at 37°C, for example, for 20-40 minutes.

After the first RCA reaction, there may be a step to inactivate the RCA polymerase enzyme used, e.g. by heating, prior to the subsequent steps (e.g. prior to addition of the padlock probe, or prior to ligation thereof). Such a heat inactivation step may, for example, comprise heating at 55-65°C for a period of time, e.g. for 5-15 minutes, for example, 10 minutes. Such a step is included in the single-vessel protocol set out above.

Thus, after this heat inactivation step, the temperature may, if desired, be reduced for the subsequent steps of contacting the first RCP with the padlock probe, and allowing it to hybridise.

The RCA reaction produces a concatemeric first RCA product (RCP) comprising multiple repeat copies of the complement of the ligated amplicon. It therefore comprises multiple repeat complementary copies of the target sequence which was present in the amplicon. In this way, the target sequence is amplified. The target sequence provides a binding site for the padlock probe. The first RCP thus provides multiple binding sites for the padlock probe. Thus, more particularly, multiple copies of a given padlock probe may hybridise to the first RCP. It will however be understood that not every available binding site in the first RCP needs be occupied or bound by a padlock probe - it suffices that a number or multiplicity of binding sites in the first RCP are bound by the padlock probe. More particularly, a target sequence in at least one monomer of the concatemeric first RCP is bound by a padlock probe, but more particularly, in at least 2, 4, 6, 8, 10, 12, 15, 20, 25, 30, 40, 50, 80, 100, 150, 200, 300, 400, 500, 700, 1000 or more monomers.

The method relies upon multiple probes being able to hybridise to the first RCA product. Accordingly, it will be understood that the first RCA product needs to be available for probe hybridisation. This requirement is a feature of all RCA-based detection methods, where an RCA product is detected by hybridising a probe, e.g. a detection probe, to the product, and is well understood in the art. Thus, it may be advantageous for the first RCA product to have low secondary structure. However, this feature may be compensated for by performing the method in conditions which favour hybridisation, according to principles well known in the art.

The padlock probe is specific for the target sequence and thus is used to identify or detect the target sequence, or a variant thereof. The padlock probe thus comprises target-specific binding regions which are specific for a particular target sequence or for a particular variant. The target binding regions are thus designed to be complementary to a specific sequence in the target sequence, or to distinguish or discriminate between different target sequences or variants. It will be understood in this regard that the padlock probe binds to the complement of the target sequence as it appeared in the amplicon which is subjected to the RCA. The term “target-specific” is accordingly used broadly herein to refer to the target sequence as it appears in the target molecule and its complement, and whichever of these is actually bound by the padlock probe will depend on the precise nature of the method, and which strand is amplified etc. The design of variant-specific, or allelespecific, probes is known in the art, and thus the binding regions of the padlock probe can readily be designed to detect or identify a desired target sequence or variant. In the context of variant sequence detection, the padlock probe accordingly allows genotyping of the target sequence in the first RCP and may be referred to as a genotyping padlock probe.

As discussed above, in the case of detecting a variant target sequence, two or more padlock probes may be used, each specific for a different variant sequence, and it may be detected which of these produces a second RCP in order to detect, or identify, which variant is present.

A padlock probe may alternatively be defined as a circularisable probe. The use of padlock or circularisable probes is well known in the art, including in the context of RCA reactions. A circularisable probe comprises one or more linear oligonucleotides which may be ligated together to form a circle. Padlock probes are well known and widely used and are well-reported and described in the literature. Thus, the principles of padlock probing are well understood and the design and use of padlock probes is known and described in the art. A padlock probe is typically a linear circularisable oligonucleotide which hybridises to its target nucleic acid sequence or molecule in a manner which brings the 5’ and 3’ ligatable ends of the probe into juxtaposition for ligation together, either directly or indirectly, with a gap in between. By ligating the hybridised 5' and 3' ends of the probe, the probe is circularised. It is understood that for circularisation (ligation) to occur, the ligatable 5’ end of the padlock probe has a free 5' phosphate group.

To allow the juxtaposition of the ends of the padlock probe for ligation, the padlock probe is designed to have the target-binding sites at or near its 5' and 3' ends. That is, the regions of complementarity which allow binding of the padlock probe to its target lie at or near the ends of the padlock probe.

To allow ligation, the 3’ and 5’ ends which are to be ligated (the “ligatable” 3’ and 5’ ends) are hybridised to the target sequence in the first RCP, which acts as the ligation template. The ligatable ends of a padlock probe may be brought into juxtaposition for ligation in various ways, depending on the probe design. Where the target-binding sites are located at the ends of the padlock probe, the binding of the padlock probe may bring the ends into said juxtaposition. Where the complementary binding sites in the target molecule or sequence lie directly adjacent (or contiguous) to one another, the ends of the padlock probe will hybridise directly adjacent to each other (i.e. with no gap) and may be ligated to each other directly. Thus, in this case, the ligatable ends of the probe are provided by the actual ends of the probe. However, in an alternative configuration, the padlock probe is a gap-fill padlock probe, and hence the binding sites at the ends of the padlock probe do not hybridise to adjacent binding sites, but rather to non-adjacent (non-contiguous) binding sites in the target sequence. In such an arrangement, the 5’ ligatable end of the probe is provided by the actual 5’ end of the probe. However, the ligatable 3’ end of the probe is generated by extension of the hybridised 3’ end of the probe, using the target sequence as an extension template to fill the gap between the hybridised ends of the probe. The extension reaction brings the extended 3’ end of the probe into juxtaposition for ligation. In this case, the ligatable 3’ end of the probe is thus the extended 3’ end of the probe.

In other embodiments, the ligatable 3’ and/or 5’ ends may be created, or generated, by cleavage. Thus, where the target binding sites do not lie at the ends of the padlock probe, but rather are located internally of the ends, near (rather than at) the ends of the probe, the probe will hybridise to the target in a manner in which there are unhybridised nucleotides located at the probe ends. In other words, after probe hybridisation there is an overhang, or flap, or an unhybridised additional sequence at one or both ends of the probe. This will prevent ligation of the hybridised probe, or indeed extension, if the unhybridised sequence is at the 3’ end. These unhybridised regions or nucleotides may be removed by cleavage, particularly by enzymatic cleavage, when the probe is hybridised to its target (i.e. by cleavage of the hybridised probe).

Hybridisation of padlock probes with an internal 5’ target binding site will result in a structure with a so-called 5’ flap. Padlock probes designed in this manner are known in the art, as are procedures and enzymes for cleaving them. Any enzyme capable of performing a reaction which removes a 5’ flap may be used in this step, i.e. any enzyme capable of cleaving, degrading or digesting a 5’ singlestranded sequence which is not hybridised to a target nucleic acid molecule, but typically this will be an enzyme with 5’ nuclease and/or structure-specific cleavage activity.

A structure-specific cleavage enzyme is an enzyme capable of recognising the junction between a single-stranded 5’ overhang and a DNA duplex, and cleaving the single-stranded overhang. Such enzymes are known in the art and include flap endonucleases (FENS), which are a class of enzymes having endonucleolytic activity and being capable of catalysing the hydrolytic cleavage of the phosphodiester bond at the junction of single- and double-stranded DNA. For example, the enzyme may be a native or recombinant archaeal FEN1 enzyme from P. furiosus (Pfu), A. fulgidus (Afu), M. jannaschii Mja') or M. thermoautotrophicum Mth).

Enzymes having 5’ nuclease activity include enzymes with 5’ exonuclease and/or 5’ endonuclease activity, and again, such enzymes are known in the art, e.g. Taq DNA polymerase and the 5’ nuclease domain thereof, or Exonuclease VIII. Other examples are RecJf and T5 exonuclease.

For cleavage of an unhybridised 3’ end (or 3’ flap), an enzyme with 3’ nuclease activity may be used. This may be 3’ exonuclease or 3’ endonuclease activity. This may be provided by a polymerase with 3’ exonuclease activity, or the 3’ exonuclease domain thereof, or by a separate exonuclease enzyme, e.g. exonuclease I, or by an endonuclease. By way of representative example, the enzyme may be T4 DNA polymerase, T7 DNA polymerase, DNA polymerase I, Klenow fragment of DNA polymerase I, Pyrococcus furiosus (Pfu) DNA polymerase and/or Pyrococcus woesei (Pwo) DNA polymerase.

In particular, a polymerase with 3’ exonuclease activity may be used for the step of extending the hybridised 3’ end of a gap-fill padlock probe - such an enzyme will remove the unhybridised 3’ nucleotides to leave a hybridised 3’ end before the extension reaction takes place. A polymerase with 3’ exonuclease activity but without strand-displacing activity is desirable. In the case of a cycling protocol, a thermophilic polymerase should be used. These include: Q5/Q5LI DNA polymerase, Phusion/Phusion II DNA polymerase, Taq DNA polymerase, Stoffel DNA polymerase, Pwo DNA polymerase, Kappa DNA polymerase, and SuperFi DNA polymerase.

The positioning of the target binding site away from the 3’ or 5’ end of the padlock probe will determine the length of the 3’ of 5’ flap. Generally speaking, it is preferred for the 3’ target binding site to be reasonably close to the 3’ end, for example within 7 or 6, or fewer nucleotides of the 3 end, e.g. 5, 4, 3, 2 or 1 nucleotides of the end. For the 5’ target binding site, a longer distance may be tolerated. In an embodiment, “near” to a 5’ or 3’ end of the probe means within 12 nucleotides or less of the end, e.g. within 10, 9, 8, 7 or 6 nucleotides of the end. In the case of the 3 ‘end, this may, for example, be within 8, 7, 6, 5, 4, 3, 2 or 1 nucleotides of the end or less, e.g. within 6 nucleotides or less.

In another embodiment, the 3’ end of the first padlock probe may comprise a hairpin structure which comprises the 3’ target binding region. In other words, the 3’ target binding region may be at least partially comprised within a hairpin structure at the 3’ end of the padlock probe. When the probe hybridises to the target molecule, strand displacement will cause the hairpin to open, and for the 3’ end of the probe to hybridise to the target molecule.

Padlock probes may be provided in 2 or more parts that are ligated together. This may involve the provision of an additional ligation template, for example, in the case of a 2-part probe, where each part comprises only one target-binding region, and the other end of each part hybridises to a common ligation template. In another embodiment, a 2-part padlock may take the form of a “connector” oligonucleotide with two target-binding regions at or near the 5’ and 3’ ends respectively, which hybridise to the target with a gap in between them, and a gap oligonucleotide which hybridises in the gap between the ends. The gap oligonucleotide may partially or fully fill the gap.

In a typical embodiment, however, the padlock is provided as a single circularisable oligonucleotide, whether as a gap-fill padlock or not.

In particular embodiments, the padlock probe does not have secondary structure, and more particularly does not comprise intramolecular double-stranded regions or stem-loop structures. However, dumbbell probes, which do have secondary structure, are a particular sub-type of padlock probe. The dumbbell probe comprises two stem-loop structures, joined stem to stem, wherein one of the “loops” is not closed, but is open with free 5’ and 3’ ends available for ligation to each other. This “open loop” functions as the target-binding domain of the probe. The closed loop functions simply as a spacer to join the end of the duplex (stem). In other words, it can be seen as a padlock probe with a region of duplex formed between complementary sequences (regions) of the padlock. The region of duplex functions as a signalling domain to which an intercalating agent can bind. Thus, the “open loop” of a dumbbell probe may comprise the target-binding regions of complementarity.

In another embodiment, as noted above, the padlock probe may comprise a hairpin structure comprising the 3’ end of the probe.

Whilst padlock probes may be designed with target-specific binding regions which are specific for a particular target sequence, it will be understood that nonspecific hybridisation may occur, particularly in the case of variant target sequences which are similar in sequence (e.g. where the variants are single nucleotide variants, such as SNPs, and such like). However, the high specificity of padlock probes arises from the failure of such non-specifically hybridised padlock probes to be ligated (and hence any such non-ligated padlock probes would not be amplified by the subsequent RCA step). Thus, in the second ligation step of the method herein, the padlock probes which have hybridised to their target sequence complements are ligated (that is, the padlock probes which have correctly hybridised to the complement of the target sequence they are intended to detect, or in other words, their corresponding or cognate, or respective target sequence).

The second ligation (of the padlock probe) may take place under similar conditions to the first. Generally speaking, the padlock probes may be added to the reaction mixture in a ligation mix, that is, in a reagent mix comprising reagents for ligation. This may be similar or the same as the ligation mix/reagents used in the preceding ligation step. In an embodiment, this may involve adding a ligase enzyme. However, if a thermostable ligase is used for the first ligation, then a further addition of ligase may not be necessary. Clearly, a separate ligation template is not necessary in this step, since ligation is templated by the first RCP to which the padlock probe binds. The second ligation is thus a templated ligation.

As for the preceding amplicon ligation in step (b), the ligation in step (e) may in certain embodiments be performed under conditions in which the ability of polymerase to extend the hybridised 3’ end of the padlock probe is inhibited. This includes amplification polymerase from the original reaction mixture and the RCA polymerase. The RCA polymerase may be inactivated in a preceding heating step. In particular, this may ensure that any hybridised but unligated padlock probes do not undergo any unwanted extension of their hybridised 3’ ends. It will be understood that this applies to the intended ligatable 3’ end of the padlock probe. Thus, if the ligatable 3’ end is created by a gap-fill extension reaction, then the conditions for the ligation reaction are such that the ligatable 3’ end is not further extended. The strategies for ensuring such conditions are as above. Accordingly, this may involve enzymatic degradation of any polymerase added for the gap-fill extension step after the gap-fill extension has taken place. Alternatively, the extension polymerase may be inactivated by heating. A gap-fill padlock may be useful in circumstances where it is desired to incorporate a desired sequence into the second RCA template, for example, for a further amplification. In an embodiment, this may be used for NGS colony generation, for example.

In the case of a padlock probe which does not require gap-fill extension, any amplification polymerase may have been inactivated by enzymatic digestion in a preceding, first, clean-up step, if such is included in the method. If such an enzymatic digestion was not included in the first clean-up step, or a first clean-up step is not performed, then analogously, unwanted extension may be avoided by controlling the hybridisation of the padlock probe as discussed above, i.e. with a 2- step hybridisation of the padlock probe, using different annealing temperatures.

After the ligation step, it may be desirable or advantageous for any unligated probes in the reaction mixture to be removed or rendered inert. By rendered inert is meant that the padlock probe is not able to take part in any unwanted hybridisation or ligation reactions. In particular, that the 3’ end of the probe is not capable of hybridising to any nucleic acid present in the mixture and priming an extension reaction. In this regard, it may particularly be desired to ensure that unligated padlock probes remaining in the mixture after the ligation reaction are not able to hybridise to the second RCP when it is generated. Thus, in effect a clean-up step may be performed at this stage (this may be a first or a second clean-up depending upon whether a first clean-up was performed. In the case of a single-vessel method, both clean up steps will be included. A clean-up reagent mix may be added to achieve this.

The clean-up reagent may be a digestion enzyme to degrade unreacted probes, e.g. an enzyme having exonuclease activity as discussed above. Where such an enzyme is used, it is then inactivated by heating, prior to the subsequent second RCA step (step (f). For example, if lambda exonuclease is used, the reaction mixture may be incubated at a temperature of 30-37°C for 10-20 minutes, followed by heat inactivation at about 70-75°C for 10-30 minutes. This can be varied appropriately depending on the precise reagents used, and the digestion and inactivation conditions are within the routine skill of the person skilled in the art to determine and optimise.

Alternatively, or additionally, a dilution step may be performed to perform this clean-up. Thus, a volume of diluent may be added to the reaction mixture. In particular, the reaction mixture may be diluted at least 4X. As noted above, where the method is to be used for detection of a sequence variant, it is generally preferred for the clean-up to be performed by enzymatic digestion.

Alternatively, or additionally, hairpin forming padlock probes may be used. Such a probe comprises self-complementary sequences which may hybridise to one another to form a hairpin-structure comprising the 3’ end of the probe. The stem of the hairpin may be designed to have a thermal stability that does not prevent the formation of a hybrid between the loop of the hairpin structure and the target sequence in the first RCP, nor inhibit RCA. However, the stem is sufficiently stable to prime extension, and thus convert non-ligated padlock probes to full hairpins, which are unable to hybridise to or prime extension using the second RCA product as extension template. Such padlock probes which can be inactivated are termed suicide padlock probes. Suicide padlock probes are also described in US 6, 573, 051.

The second RCA reaction is performed using the ligated padlock probes as the second RCA template. To initiate this second RCA, a second RCA reagent mix is contacted with (e.g. added to) the reaction mixture which contains the ligated padlocks. Generally speaking, this will involve adding a RCA polymerase, which may be as discussed above, and which may be provided in a reagent mix as discussed above. Whilst all the reagents for the second RCA may be added together in one reagent mix, it may be advantageous to add them separately, or step-wise. The second RCA is primed by a separate RCA primer. This may be prehybridised to the padlock probe or it may be separately added to the reaction mixture, or included in the RCA reagent mix. The binding site for the RCA primer may be provided in a region of the padlock probe which is different to the target binding regions (e.g. in the backbone region of the padlock, between the targetbinding ends).

In one embodiment, the RCA primer for the second RCA may be included in the clean-up reagent mix. In another embodiment, it may be added separately at the same time as the clean-up reagent mix. In another embodiment, it may be added to the reaction mixture after the exonuclease for clean-up has been inactivated. The second RCA may then be initiated by adding a RCA reagent mix comprising the RCA polymerase and dNTPs. In still another embodiment, the RCA polymerase is added in a first step, and then dNTPs and the RCA primer are separately added.

Where the RCA primer is included in the clean-up mix, it may be protected from digestion by the presence of phosphorothioate bonds. Alternatively, an exonuclease may be used in the clean-up which degrades double-stranded DNA and single-stranded DNA with a 5’ phosphate group; padlock probes have a 5’ phosphate group and so are degraded by such an exonuclease, and the RCA primer may be provided without a 5’ phosphate group, and so may be spared from digestion by the exonuclease. An example of such an exonuclease is lambda exonuclease. In another embodiment, the clean-up of unligated padlock probes may be performed by the RCA polymerase. Thus, Phi29 has 3’ exonuclease activity, and this may be used to digest the unligated padlocks. In such an embodiment, an addition of an RCA polymerase enzyme with 3’ exonuclease may be made, and the enzyme may be allowed to perform the digestion (i.e. the reaction mixture may be incubated). Subsequently, the RCA reaction may be performed. In this case, this may be achieved by adding a primer and nucleotides to allow the RCA reaction to proceed. Thus, in this embodiment, the RCA polymerase may be added prior to the second RCA step, for a clean-up purpose, before the second RCA reaction is performed.

As indicated above, reagents, procedures and protocols for conducting RCA reactions are known in the art and can be employed here.

The method herein requires a first RCA step, templated by the circularised amplicon, and at least a second RCA step, templated by the circularised padlock probe. The method may comprise further RCA steps, to generate a third, or further generation RCA product, using second, third, or fourth padlock probes, and so on, each targeting the target nucleic acid sequence. In other words, if desired, steps (d) to (f) may be repeated. The final generation RCA product may be detected.

It will be understood that such further generations of RCA may act to increase the signal which is ultimately detected. This may accordingly result in increased signal amplification. However, it will also be understood that where the first and further padlock probes are gap-fill padlocks, this will result in the increased synthesis of copies of the target nucleotide sequence, or of a part thereof (depending on whether successive gap-fill reactions are fully or partially overlapping). In other words, the method may result in a clonal production of large numbers of the filled-in sequence. In still other words, the filled-in sequence (i.e. the target nucleic acid sequence, or a part thereof) may be amplified. This can be useful for preparative reactions. If multiple successive (i.e. two or more) first and further padlock probe ligation steps involve gap-fill of the same or overlapping sequences, then more copies of a specific target sequence can be generated.

The second, or as noted above, further RCP product is detected in order to detect the target nucleic acid sequence. In order to achieve this, the RCP may be rendered detectable. For example, it may be provided with a detectable label or a means by which it may be detected. Conveniently, in one embodiment, detection reagents may be used to achieve this. This is described in more detail further below. However, briefly, detection reagents for detecting the second or further RCP may be included with one or more RCA reagents added to the reaction mixture, or detection reagents may be added to the reagent mixture after the second or further RCP has been generated. Alternatively, a label may be incorporated into the RCP as it is generated, for example, using labelled nucleotides.

In one advantageous embodiment, the detection reagent is a detection oligonucleotide which is hybridised to the RCP, as described further below. The RCP is detected by detecting the detection oligonucleotide. In this regard, the detection oligonucleotide may be provided with a label, or some other detectable moiety.

To carry out the actual detection, the reaction mixture, or a part of aliquot thereof, may be removed from the vessel and moved to a detection instrument, or for further processing before detection. Thus, the contact with the detection reagents may, in an embodiment, take place outside the reaction vessel in which the preceding steps of the method were performed.

The method may be carried out in heterogenous or homogenous formats. That is, it may be performed on a solid phase (or support), or in solution or suspension (i.e. without a solid phase or support), or indeed both, since a solid phase may be introduced at a later stage, for example, at the step of detecting the second RCP, or at the stage of generating the second RCP, etc. In this regard, in the single-vessel embodiment of the method, up to the point of generating the second RCP, it will be understood that any solid phase will be a solid phase that can be added to the vessel, e.g. a particulate solid phase such as beads, or the solid phase will be provided by the vessel itself (e.g. the walls and/or base of the vessel).

The format of the method may be selected based on the nature of the sample, or the target nucleic acid molecule, or the desired readout or detection technology used. In an embodiment, the method is an in-solution method, that is, a method performed in a liquid phase contained in the vessel.

It will be noted from the above description that the method involves changes of temperature within and between different steps. Indeed, a feature of the method is that it is performed under temperature control, at least for the asymmetric PCR step. Conveniently, the method may be performed in a thermal cycling instrument. This permits a ready control of the temperature changes. As noted above, the methods involve a number of steps in which reagents are contacted with the target nucleic acid molecule, or a subsequent reaction mixture. Conveniently, they may be added to a vessel containing the initial reaction mixture. The reagents are conveniently added in a buffer. In other words, the various reagent mixes used in the steps of the method discussed above typically include a buffer. In the case of a single step method in particular, the steps of the method are performed in the same reaction vessel without intervening washing steps. In such a situation, or indeed even if not in a single-vessel format, it is desirable for the buffers used for the various reagent additions to be compatible. In an embodiment, the same buffer is used for all the various reagents or reagent mixes.

The target nucleic acid molecule may be present within a sample, or obtained from a sample. The sample may be any sample which contains any amount of nucleic acid, from any source or of any origin, in which it is desired to detect a target nucleic acid sequence in a target nucleic acid molecule. A sample may thus be any clinical or non-clinical sample, and may be any biological, clinical or environmental sample in which the target nucleic acid molecule may occur.

The sample may be any sample which contains a target nucleic acid molecule, and includes both natural and synthetic samples, that is, materials which occur naturally or preparations which have been made. Naturally occurring samples may be treated or processed before being subjected to the methods herein. All biological and clinical samples are included, e.g. any cell or tissue sample of an organism, or any body fluid or preparation derived therefrom, as well as samples such as cell cultures, cell preparations, cell lysates etc. Environmental samples, e.g. soil and water samples or food samples are also included. The samples may be freshly prepared or they may be prior-treated in any convenient way e.g. for storage.

Representative samples thus include any material which may contain a target nucleic acid molecule, including, for example, foods and allied products, clinical and environmental samples. The sample may contain any viral or cellular material, including all prokaryotic or eukaryotic cells, viruses, bacteriophages, mycoplasmas, protoplasts and organelles. Such biological material may thus comprise all types of mammalian and non-mammalian animal cells, plant cells, algae including blue green algae, fungi, bacteria, protozoa etc., or a virus. The cells may be, for example, human cells, avian cells, reptile cells etc., without limitation. Representative samples thus include whole blood and blood-derived products such as plasma, serum and buffy coat, blood cells, urine, faeces, cerebrospinal fluid or any other body fluids (e.g. respiratory secretions, saliva, milk, etc.), tissues, biopsies, cell cultures, cell suspensions, conditioned media or other samples of cell culture constituents, etc. The sample may be pre-treated in any convenient or desired way to prepare for use in the method, for example, by cell lysis or purification, isolation of the nucleic acid, etc.

In one embodiment, the sample comprises microbial cells or viruses which have been isolated from a clinical sample or from a culture of a clinical sample. In such a sample, the target nucleic acid molecule may be a nucleotide sequence present in a microbial cell, e.g. a nucleotide sequence which is characteristic for, or discriminatory or identificatory of a microbial cell or virus, at any level, e.g. at type, group, class, genus, species or strain level.

In another embodiment, the sample may contain cell-free DNA. The sample may be a sample such as plasma or serum which directly contains cell-free DNA, or the cell-free DNA may be isolated.

In another embodiment, the sample may contain exosomes.

Since the target nucleic acid molecule need not itself be the target analyte of the assay, but can, for example, be a reporter molecule used or generated in the course of an assay for any desired analyte, the sample need not be a sample which naturally contains nucleic acid, or a source of nucleic acid (e.g. a cell or virus, or biological or clinical material etc.). As indicated above, the sample may be a synthetic or artificial sample. It may accordingly be a sample which has been subjected to a detection assay for an analyte in which a target nucleic has been generated, or to which a target nucleic acid molecule has been added. It may be a reaction mixture, or a reaction product, for example, the product resulting from an immunoassay to detect a target analyte, e.g. an immunoPCR, immunoRCA, or proximity assay (e.g. proximity ligation assay (PLA) or proximity extension assay (PEA).

The target analyte may be any analyte it is desired to detect. As discussed above, in embodiments, the target nucleic acid molecule of the method herein is the target analyte. In other embodiments, where the target nucleic acid molecule is a reporter, the target analyte may be any analyte it is desired to detect. The analyte may be a nucleic acid, a protein (which term includes peptides and polypeptides), or any other chemical or biological molecule or moiety, including, for example, carbohydrates, e.g. such as may occur as glycosyl groups on proteins. The target analyte may thus be a modified protein, for example, with a post-translational modification which is detected in an assay for an analyte.

In an embodiment, the target analyte may be a protein or component of a proteinaceous molecule which is detected on the surface of a cell, or vesicle, or other cellular or sub-cellular compartment. For example, extracellular vesicles, or exosomes, may be detected and distinguished by virtue of different proteins present on their surface. Prostasomes have been proposed as biomarkers for prostate cancer, and a particular or selected prostasome or other extracellular vesicle may be detected and distinguished by detecting one or more surface proteins thereon.

The padlock probes may comprise one or more further sequences which may serve to introduce a sequence into the ligated product, and thereby into the RCP (as a complementary copy). This may be, for example, a tag or detection sequence, e.g. a barcode or identificatory motif, or a binding site for a detection probe or primer. This is particularly the case for the padlock probe. Such a further sequence may be found, for example, in a portion of the backbone region of the padlock probe, that is the region between the target-binding regions. In a dumbbell probe, it may be in the duplex region of the probe. Tags such as barcodes or probe/primer binding sites may be designed with different needs/purposes, for example, to introduce a universal or common sequence to enable different ligated probes in a multiplex setting to be processed or detected together, e.g. a sample index or such like, or to introduce a binding site for a universal or common amplification primer. This would enable different ligated probes to be amplified together, e.g. in a library amplification by PCR or RCA.

In particular, the padlock probe may contain a detection sequence by which it may be detected. A complement of the detection sequence will become incorporated into the second (or further) RCP, and may be detected, for example by the binding to it of a detection probe, or by sequencing. The detection sequence may be specific to the padlock probe, and thus to the target sequence, or sequence variant it is desired to detect. Thus, each padlock probe may have a different detection sequence. The detection sequence may be detected to detect or identify which padlock probe was amplified in the RCA, and hence which target sequence was present. Such a protocol may be applied, for instance, in the context of the method for detecting a target sequence variant, where each padlock is provided with a detection sequence specific to particular variant. The detection sequence may thus be seen as a marker or identification sequence. The term “detection sequence” as used herein includes both the detection sequence as it occurs in the padlock probe, and the complementary copy as it appears in the RCP.

Accordingly, a tag/barcode sequence, including particularly a detection sequence, may be used to “label” different padlock probes so that they, or their ligation or amplification products, may readily be distinguished from one another. Additionally, such a sequence may be used to tag different samples etc., for example, so that they may be pooled (i.e. a “sample” tag or marker). Thus, in a multiplex setting, different probes (i.e. probes for different target nucleic acid sequences or different variants) may be provided with different tag sequences (e.g. different marker or detection sequences) and/or they may be provided with the same tag sequence(s) e.g. for the introduction of a common or universal sequence. Such methods may be used in conjunction with particular detection methods, including the use of detection probes or sequencing methods such as sequencing by hybridisation, sequencing by ligation or other next generation sequencing chemistries, e.g. in the multiplexed detection of multiple target nucleic acids in a sample.

The term "hybridisation" or "hybridises" as used herein refers to the formation of a duplex between nucleotide sequences which are sufficiently complementary to form duplexes via Watson-Crick base pairing, or any analogous base-pair interactions. Two nucleotide sequences are "complementary" to one another when those molecules share base pair organization homology. Hence, a region of complementarity in a molecule or probe or sequence refers to a portion of that molecule or probe or sequence that is capable of forming a duplex. Hybridisation does not require 100% complementarity between the sequences, and hence regions of complementarity to one another do not require the sequences to be fully complementary, although this is not excluded. Thus, the regions of complementarity may contain one or more mismatches. Accordingly, "complementary", as used herein, means "functionally complementary", i.e. a level of complementarity sufficient to mediate a productive hybridisation, which encompasses degrees of complementarity less than 100%. The degree of mismatch tolerated can be controlled by suitable adjustment of the hybridisation conditions. Those skilled in the art of nucleic acid technology can determine duplex stability empirically considering a number of variables including, for example, the length and base pair composition of the respective molecules or probe oligonucleotides, ionic strength, and incidence of mismatched base pairs, following the guidance provided by the art. Thus, the design of appropriate probes or ligation templates or primers, and binding regions thereof, and the conditions under which they hybridise to their respective targets is well within the routine skill of the person skilled in the art.

A region of complementarity, such as, for example, to a target sequence in the binding region of a padlock probe, or between a detection sequence and a detection probe, or a RCA primer to the circularised padlock probe, or a PCR primer to the target nucleic acid molecule/sequence, or a ligation template to the amplicon etc., may be at least 6 nucleotides long, to ensure specificity of binding, or more particularly, at least 7, 8, 9 or 10 nucleotides long. The upper limit of length of the region is not critical, but may, for example, be up to 50, 40, 35, 30, 25, 20 or 15 nucleotides. A complementary region may thus have a length in a range between any one of the lower length limits and upper length limits set out above. In the case of a padlock probe, the length of an individual target-binding region may be in the lower ranges, so that the total length of the two binding regions when hybridised to their target is within the upper ranges. For example, an individual target binding region may be 8-15, e.g. 10-12 nucleotides, so that the total hybridised length is 16- 30 nucleotides long, e.g. 20-24. It may be desirable, within the constraints of conformation of the probes, and spacing of the domains, and desired or favoured hybridisations, to minimise the total length of a padlock probe to minimise the size of the circle which is subjected to RCA, and hence to minimise the lengths of the complementary regions where possible.

The second RCP (and/or any further generation RCP) may be detected using any convenient protocol. Depending on the target sequence to be detected, the purpose of the method, and/or the specific details of the procedures employed in the method, the detection protocol employed may detect the RCP non- specifically or specifically.

For instance, the RCP may be detected directly, e.g. the concatemer may be cleaved to generate monomers which may be detected using gel electrophoresis, or more typically by hybridising labelled detection oligonucleotides (which may alternatively be referred to as detection probes) that hybridise to the detection sequence in the RCP, as discussed above. The detection oligonucleotide need not, however, be directly labelled. For example, the detection oligonucleotide may be an unlabelled probe which functions as a sandwich probe. The concept of sandwich probes is well known in the art and may be applied according to any convenient protocol. The sandwich probes can bind to the RCP but are not directly labelled themselves; instead, they comprise a sequence to which labelled secondary oligonucleotides can bind, thus forming a “sandwich” between the RCP and the labelled secondary oligonucleotide.

A RCP may also be detected using non-sequence-specific nucleic acid labelling methods, e.g. DNA binding stains or dyes, which are widely known in the literature, or by using labelled nucleotides for incorporation into the RCP. Alternatively, the RCP may be detected indirectly, e.g. the product may be amplified by PCR and the amplification products may be detected.

The RCP may be detected using any of the well-established methods for analysis of nucleic acid molecules known from the literature including liquid chromatography, electrophoresis, mass spectrometry, including CyTOF, microscopy, real-time PCR, fluorescent probes, microarray, colorimetric analysis such as ELISA, flow cytometry, mass spectrometry, or by turbidometric, magnetic, particle counting, electric, surface sensing, or weight-based detection techniques. Generally speaking, such techniques are relevant for in-solution assays.

Labelled detection oligonucleotides may be labelled with any detectable label, which may be directly or indirectly signal-giving. For example, the label may be spectroscopically or microscopically detectable, e.g. it may be a fluorescent or colorimetric label, a particle or an enzymatic label. Any of the labels used in immunohistochemical techniques may be used.

In multiplex procedures for detecting different target sequences and/or variant target sequences, different second or further RCPs may be detected and distinguished by in situ sequencing, including for example sequencing-by-synthesis, sequencing-by-hybridisation and sequencing-by-ligation, next generation sequencing and/or sequential barcode decoding techniques, including by sequencing-by-synthesis, -ligation or -hybridisation, and/or by using detection probes. Depending on the level of multiplexing, combinatorial labelling methods may be used, according to techniques well known in the art. For example, the large number of repeated sequences in the sRCA products can enable distinction amongst large numbers of such products via ratio labelling with fluorescent or other spectrophotometrically detectable probes. Such ratio-labelled detection probes may be used during flow cytometry, or microscopic detection techniques, e.g. imaging, to detect large numbers of sequences, e.g. the combination of at least two fluorophores at different ratios can lead to generation of multiple populations of fluorescent labels. For example, it has been found that using combinations of two fluorophores at different ratios 7 different populations can be created. This may be expanded using 3- or 4-colour combinations.

In methods which involve the use of detection oligonucleotides, the detection oligonucleotide or any secondary labelling oligonucleotide may be labelled with a directly or indirectly detectable label. A directly detectable label is one that can be directly detected without the use of additional reagents, while an indirectly detectable label is one that is detectable by employing one or more additional reagents, e.g., where the label is a member of a signal producing system made up of two or more components. In many embodiments, the label is a directly detectable label, where directly detectable labels of interest include, but are not limited to: fluorescent labels, radioisotopic labels, chemiluminescent labels, and the like. In many embodiments, the label is a fluorescent label, where the labelling reagent employed in such embodiments is a fluorescently tagged nucleotide(s), e.g. fluorescently tagged CTP (such as Cy3-CTP, Cy5-CTP) etc. Fluorescent moieties which may be used to tag nucleotides for producing labelled probe nucleic acids (i.e. detection probes) include, but are not limited to: fluorescein, the cyanine dyes, such as Cy3, Cy5, Alexa 555, Bodipy 630/650, and the like. Other labels, such as those described above, may also be employed as are known in the art.

Although various detection modalities may be employed, conveniently the second or further RCPs may be detected by microscopy or flow cytometry. In both cases, directly or indirectly labelled detection oligonucleotides may be used, for example, with fluorescent labels which may readily be detected. In particular, in a microscopy-based method, the RCPs may be detected by imaging.

The use of such detection techniques advantageously allow the second or further RCPs to be digitally recorded. Indeed, since the degree of signal amplification afforded by the present method allows the second or further RCPs to be visualised, they may be detected by a camera or any device including a camera, such as a mobile phone.

To detect second or further RCPs generated in a homogenous format, they may be captured or brought down to a solid support, or surface, to facilitate imaging, or microscopic detection more generally. A second RCP, being a second generation RCA product, is larger and heavier and hence readily amenable to bringing down to a surface by centrifugation. Thus, for example, tubes or plates may readily be spun to bring second RCPs down to the bottom of the tube or of a well for detection by microscopy, and particularly imaging.

A variant sequence to be detected in the method may comprise one or more variant bases. Thus, it may be a single nucleotide variant, e.g. a single nucleotide polymorphism (SNP) or mutation, or it may comprise two or more bases. Thus, a variant sequence may comprise a stretch of nucleotides, 2 or more bases of which may be variant. The variant bases may be contiguous or non-contiguous.

The length of the target sequence is not critical, and may vary according to circumstance, and the nature of the target molecule, the target sequence, or the variant position or locus. A target sequence may thus be, by way of representative example only, 1 to 10, e.g., 1-15, 1-12. 1-10, 1-8, 1-7 or 1-6 nucleotides long. In certain embodiments however, a target sequence longer than a single nucleotide may be beneficial, to improve specificity, and in such embodiments, the target sequence may be from any one of 2, 3, 4, 5, or 6 to any one of 6, 7, 8, 9, 10, 12, 15 or 20 nucleotides long. Exemplary target sequences may thus be 4-10, 4-8, 4-7, 4- 6, 5-10, 5-8, 5-7, or 6-8 nucleotides long, for example.

As noted above, also provided herein are kits for performing the methods. The kits may include the PCR primers and padlock probes as discussed above, optionally together with one or more ligation templates, and/or one or more reagents and/or instructions for use of the kit. Such reagents include dNTPs, polymerase and ligase enzymes, as well as RCA primers for the first and/or second RCA reactions. Further, components may include buffers or other reaction components for one or more of the various reactions. Still further optional components may include means or reagents for detecting the second or further RCP. This may include, for example, detection oligonucleotides and any necessary secondary labelling reagents, including, for example, as discussed above. Further optional components may include a solid support and/or means for capture and/or immobilisation of a target nucleic acid molecule, or of a reaction component. Instructions may be, for example, in printed form, or on a computer-readable medium, or as a website address.

Advantages of the methods herein are discussed above. Such advantages are particularly beneficial in the context of detecting a target sequence or variant in complex samples, or where they are present in low abundance. As discussed above, the present method provides a high degree of signal amplification, rendering the method very sensitive. The method is thus particularly suited to detecting or identifying very rare sequence variants. The method can be used, for example, to find and detect tumour-derived mutant DNA sequences in patient samples, including notably cell-free DNA in plasma, e.g., circulating tumour DNA. The method may thus find utility in the diagnosis or monitoring of cancer, or e.g. to reveal recurrence of the cancer. The method may be used in the context of any cell- free DNA, and may also find application in prenatal testing, including particularly NIPT. The technique is rapid, has minimal instrument requirements, and allows multiplex analysis of sequence variants for enhanced sensitivity.

The detection of low frequency or rare sequence variants or mutations also requires high specificity of detection, and a minimised risk of introducing artificial sequence alterations that could be mistaken for variant sequences. This is also afforded by the combination in the present method of the use of limited PCR cycles combined with RCA amplification, and the targeting of the target sequence by a padlock probe requiring dual target recognition.

In the present method, the first RCP is generated in a highly specific manner, and the generation of the second RCP at high amplification is dependent on the presence of the first RCP.

Each RCA product typically contains several hundreds of, or in some cases in the order of thousands of, complements of the RCA template circle (e.g. circularised amplicon or padlock probe) that was used to generate it. The second generation RCA product, which is obtained by RCA of the circularised padlock probes on the first RCA product, therefore can contain, for example, 1000X1000 monomer sequences, and is thus of a large size, and can have molecular weights which reach tens of gigaDaltons. Further, as noted above, further copies of monomer sequences may be generated by further generations of RCA reactions. Such reaction products can readily be detected. The dimensions of a second RCA product can reach several micrometers, and they may readily be visualised as individual products, e.g. by microscopy. Each second, or further, RCA product can be detected as a clonal product, generated from a single target nucleic acid molecule, without the need for compartmentalisation of the RCA reactions. For example, when labelled with fluorescent detection probes, prominent brightly fluorescent reaction products allow counting and distinction of individual amplification products of single molecules across wide fields of view at low magnification (e.g. 20X). The second or further RCA reaction products are sufficiently large and bright to be recorded by standard flow cytometry. This allows ready counting, and digital scoring in a matter of minutes using generally available instrumentation, thus offering excellent quantitative precision over wide dynamic ranges. As mentioned above, the repeat sequences present in the concatemeric second or further RCA product allow analysis of products labelled with distinct combinations of fluorophores, by ratio-labelling techniques, thereby allowing increased multiplexing. Because of their large size and considerable molecular weight, sRCA products may also be enriched by centrifugation in an ordinary desktop centrifuge or similar. Indeed, lower speed centrifugation or unit gravity may suffice. The new method herein thus enables multiple advantages.

The strong signal amplification afforded by the second RCA reaction allows the ready and easy visualisation of the signal, as discussed above, for example microscopically at low magnification or on a digitally scanned image and hence may permit rapid and easy visual inspection of assay results in a clinical scenario, e.g. inspection of pathology results in routine use. Thus, the methods of the invention are particularly suited to clinical analysis procedures.

The methods can be helpful to identify rare integrated copies of viral genomes in human tissues or for otherwise detecting rare RCA products such as upon inefficient mutation detection in tissues. Another example when easy identification of a rare event may be helpful is when screening for the presence of circulating tumour cells (CTC) among a vast majority of non-CTC cells. The strong signal produced by the method allows fast and easy identification of events (detection of CTCs) at low magnification.

The methods allow the detection assay to be speeded up, which may be of value at point of care locations such as doctor's offices etc. In this regard, the second RCA can be performed in a relatively short time.

The ability to perform the method in a single reaction vessel is a significant advantage, allowing ready automation.

Furthermore, the increases in signal strength/speed may allow other means of detection beyond the conventional fluorescence based methods, for example using turbidometric, magnetic, particle counting, electric, surface sensing, and weight-based detection techniques. For example, one individual sRCA product from a second generation RCA after a 1-hour amplification has the potential weight of several femtograms. Such a weight increase may be detected by methods and means known in the art such as cantilevers, surface plasmon methods, and microbalances e.g. quartz crystal microbalances etc. Further, as noted above, the increased size and weight of the second RCP allows it to efficiently be localised to a surface by centrifugation. Conveniently, this may be performed at 3000X in 15 minutes, in contrast to first generation RCA products, which cannot efficiently be captured by centrifugation using a bench-top centrifuge.

The present method can enable the generation of an enhanced signal which is localised to the product of the first RCA, and it also confers the ability to count individual reaction products (second RCA products) using standard flow cytometers or distributed on a planar surface, etc. for highly precise digital detection. In particular the second RCPs may be stained with chromogenic reagents such as HRP, and imaged via a smart phone camera. Thus, the method may permit an equivalent reaction to digital PCR, but with no need for emulsions or microfabricated structures, or conditions where exactly one template is present per compartment.

The prominent amplification products derived from the present method will further permit cloning of individual RCPs, since the product obtained from an individual first RCA template may be visualised. An individual second RCA product may therefore be identified and isolated. For example, with the aid of the present amplification method, visualization can be achieved in low melt agarose for isolation with no need for magnification, and the product may then be isolated e.g. scoped out with a toothpick, analogously to the isolation of bacterial colonies.

The detection of rare mutations can be very important clinically for diagnosis. For example, mutations in certain genes (e.g. KRAS mutations) can be diagnostically important and may serve to identify the emergence of acquired resistance to particular therapies (e.g. anti-EGFR therapy). Much effort has focused in recent years on developing methods for detecting such mutations. The present method could provide a useful addition to such methods.

In addition to enabling the detection of point mutations present at low frequencies in DNA samples, the present method also provides a powerful means of screening DNA samples for the presence of any and all of a very large number of distinct target sequences in a manner that is not possible by PCR alone or any other current method. However, unlike in methods based on PCR alone, the use of padlock probes and RCA allows for several hundred thousands of probes to be applied in parallel with no deterioration of target selectivity.

Furthermore, the discrete nature of the sRCA products allows digital detection by production of one reaction product for each detected target sequence and collection of these prominent reaction products, with minimal risk for mix-up with any other material in the reaction. For example, a probe mix could be created for all types of bacteria or all species of insects or fungi. This could then be used to identify positive reaction products, for example, by amplifying tag sequences on the padlock probes by PCR, and hybridising the products to tag arrays or similar.

Still further, as noted above where second, and optionally further, ligation and RCA reactions are performed using gap-fill padlocks, then a preparative production of copies of the target nucleic acid sequence becomes possible.

The present sRCA method also increases the precision of genotyping by interrogation of the repeated sequences of individual RCA products rather than of individual target sequences. Thereby, occasional mistypings by padlock probes can be tolerated without resulting in erroneous results as long as they are considerably rarer than the correct results within an individual RCA product. This allows for genotyping via a majority-vote mechanism. This can also have consequences for how the padlock probe-based genotyping is done, where it may be possible to enhance sequence distinction by using conditions where neither variant is detected with 100% efficiency, as long as the ratio between correct and incorrect reactions is satisfactory, and a sufficient number of repeats are detected by the padlock probes.

The method will now be described in more detail with reference to the following Figures and non-limiting examples.

Description of Figures

Figure 1 : schematic illustration of the method for generation of SuperRCA amplification products. A) DNA sequences of interest in a sample are amplified by asymmetric PCR, comprising a first exponential PCR reaction (A1) followed by a second linear amplification reaction (A2), which results in preferential accumulation of the desired strand. B) Amplified strands are converted to single-stranded DNA circles via templated ligation of their 5’ and 3’ ends. C) Oligonucleotides that template the circularisation reactions next serve as primers for RCA reactions. D) The RCA products are then interrogated with padlock probes specific for mutant or wildtype sequences. E) Ligated padlock probes, wound around the RCA products, thereafter template secondary RCA reactions, primed by an added oligonucleotide. F) For each starting DNA circle, the reaction gives rise to large clusters of mainly single-stranded DNA objects, called SuperRCA products. Up to a million fluorescence-labelled hybridisation probes can bind each of the mutant- or wildtypespecific products, allowing efficient counting via e.g. standard flow cytometry. ic PCR-sRCA method

The following PCR primers were designed for detection of a KRAS sequence.

Primers:

KRAS Fwd: ATTATAAGGCCTGCTGAAAATGACTGAATATAAACTUG Tm=66.7C

KRAS Rev: TCGTCAAGGCACTCTT Tm=55.1C

The method is performed in single-tube format.

Materials and Methods

Extraction ic DNA. DNA was extracted from BM cells or of whole blood using the QIAamp DNA Blood mini kit (Qiagen cat.51104) and eluted in 50 pL elution buffer. :ion. Sequences of interest in genomic DNA were amplified with SuperFi DNA polymerase (Thermo Scientific) in 25 pL PCR reactions containing 1X SuperFi buffer, 0.2 mM dNTP, 100 nM Fwd/Rev PCR primers, 330 ng gDNA and 0.0005 U/pL SuperFi DNA polymerase. The PCR program was as follows: 98°C for 30 sec, 6 cycles of 98°C for 15 sec, 62°C for 120 sec, for the exponential phase reaction. Then 8 cycles of 98°C for 15 sec, 72°C for 120 sec were carried out to perform the linear amplification reaction, in which only the forward primer anneals and is extended.

1 pL per target from the PCR based library prep was mixed with 20 pL clean-up solution containing 1X SuperRCA buffer (Rarity Bioscience AB), 0.125 pL UNG (Thermo Fisher Inc.), and 0.0006 LI/pL Thermoliable Proteinase K. The mixtures were incubated at 37°C for 10 min, followed by 55°C for 10 min.

Liqase-mediated circularisation of one strand of PCR products. 20 pL ligation solution containing 1X SuperRCA buffer (Rarity Bioscience), 100 nM of ligation template, complementary to both ends of one strand of the amplification products, 0.5 mM NAD (Sigma) and 2 II Ampligase (Lucigen) were added into the clean-up solution containing amplified PCR products. The mixtures were incubated at 95°C for 1 min, followed by 58°C for 30 min.

Target sequence amplification by a first RCA. Circularised strands of PCR products containing target nucleotide positions were amplified by RCA. 5 pL of 1X SuperRCA buffer (Rarity Bioscience), 1.8 mM dNTP (Invitrogen), 2.5 II Phi29 polymerase (New England Biolabs) and 0.5ug/uL BSA were added to the circularized products. The reactions were incubated at 37°C for 30 min, then 65°C for 10 min.

Genotyping of RCA products via padlock probe ligation. Padlock probes were hybridised to first-generation RCA products and ligated in a sequence-specific manner, by adding 5 pL ligation mix containing 1X SuperRCA buffer (Rarity Bioscience), 3 mM NAD (Sigma), 2.5 II Ampligase (Lucigen), and 60 nM genotyping padlock probe pairs to the reaction mixtures, incubating at 55°C for 30 min.

Digestion of the non-reacted genotyping probes. 5 pL clean-up solution containing 1X SuperRCA buffer (Rarity Bioscience AB), 1.2pM primer and 1 U/pL thermolabile exol was added into the reaction mixture and incubated at 37°C for 15 min then at 75°C for 20 min.

Secondary RCA templated by padlock probes bound to primary RCA products. 5 pL RCA mixture containing 1X SuperRCA buffer (Rarity Bioscience), 0.6mM dNTPs and 6 II Phi29 DNA polymerase (New England Biolabs) was added to the mixture and the reactions were incubated at 37°C for 30 min.

Digital recording of SuperRCA products by flow cytometry. The final reaction mixtures containing SuperRCA products were diluted into hybridisation buffer containing 100 nM fluorophore-labeled oligonucleotide probes specific for the different SuperRCA products, in 1X SuperRCA buffer (Rarity Bioscience) to a final volume of 250 pL. The solutions were applied onto the CytoFlex flow cytometer (Beckman Coulter) and SuperRCA products were counted at ‘Medium’ speed (30 pL/minute) for 150 seconds per sample.