METHOD OF SIGNAL ENCODING OF ANALYTES IN A SAMPLE

Title:

METHOD OF SIGNAL ENCODING OF ANALYTES IN A SAMPLE

Document Type and Number:

WIPO Patent Application WO/2020/254519

Kind Code:

Abstract:

The present invention is directed to a method of sequential signal-encoding of analytes in a sample, a use of a set of decoding oligonucleotides to sequentially signal-encode analytes in a sample, and to a kit for sequentially signal-encoding of analytes in a sample.

More Like This:

JP7215694	Biliary tract cancer detection kit or device and detection method
JP2024503343	Compositions and methods for the detection of bacteria and fungi associated with bacterial and candidal vaginitis
WO/1999/043854	METHODS FOR CLEAVING SINGLE-STRANDED AND DOUBLE-STRANDED DNA SUBSTRATES WITH NUCLEOTIDE INTEGRASE

Inventors:

GEIPEL ANDREAS (DE)
REINECKE FRANK (DE)
KORFHAGE CHRISTIAN (DE)
JAGEMANN NADINE (DE)
MOELLERING VANESSA (DE)

Application Number:

PCT/EP2020/067010

Publication Date:

December 24, 2020

Filing Date:

June 18, 2020

Export Citation:

Click for automatic bibliography generation Help

Assignee:

APOLLO LIFE SCIENCES GMBH (DE)

International Classes:

C12Q1/6813

Domestic Patent References:

WO2017189525A1

2017-11-02

Foreign References:

EP0611828A1	1994-08-24
US20180142286A1	2018-05-24

Other References:

KARUNANAYAKE MUDIYANSELAGE ARUNI P K K ET AL: ""Second-generation" fluorogenic RNA-based sensors", METHODS, vol. 161, 17 January 2019 (2019-01-17), pages 24 - 34, XP085713798, ISSN: 1046-2023, DOI: 10.1016/J.YMETH.2019.01.008
HARRY M. T. CHOI ET AL: "Third-generation in situ hybridization chain reaction: multiplexed, quantitative, sensitive, versatile, robust", DEVELOPMENT (SPECIAL ISSUE ON DEVELOPMENT AT THE SINGLE CELL LEVEL), vol. 145, no. 12, 15 June 2018 (2018-06-15), GB, XP055590305, ISSN: 0950-1991, DOI: 10.1242/dev.165753
SUN Z, NGUYEN T, MCAULIFFE K, YOU M: "Intracellular Imaging with Genetically Encoded RNA-based Molecular Sensors.", NANOMATERIALS (BASEL), vol. 9, 8 February 2019 (2019-02-08), XP002795696
LUBECK ET AL.: "Single-cell in situ RNA profiling by sequential hybridization", NAT. METHODS, vol. 11, no. 4, 2014, pages 360 - 361, XP055444270, DOI: 10.1038/nmeth.2892
SHAH ET AL.: "In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus", NEURON, vol. 92, no. 2, 2016, pages 342 - 357, XP029778129, DOI: 10.1016/j.neuron.2016.10.001
CHEN ET AL.: "RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells", SCIENCE, vol. 348, no. 6233, 2015, pages aaa6090, XP055391215, DOI: 10.1126/science.aaa6090
MOFFITT ET AL.: "High-throughput single-cell gene-expression profiling with multiplexed error-robust fluorescence in situ hybridization", PROC. NATL. ACAD. SCI. USA., vol. 113, no. 39, 2016, pages 11046 - 11051, XP055438958, DOI: 10.1073/pnas.1612826113
SHAH ET AL.: "Dynamics and spatial genomics of the nascent transcriptome by intron seqFISH", CELL, vol. 117, no. 2, 2018, pages 363 - 376
PLAYER ET AL.: "Single-copy gene detection using branched DNA (bDNA) in situ hybridization", J. HISTOCHEM. CYTOCHEM., vol. 49, no. 5, 2001, pages 603 - 611, XP001041855
WANG ET AL.: "RNAscope: a novel in situ RNA analysis platform for formalin-fixed, paraffin-embedded tissues", J. MOL. DIAGN., vol. 14, no. 1, 2012, pages 22 - 29
CHOI ET AL.: "Programmable in situ amplification for multiplexed imaging of mRNA expression", NAT. BIOTECHNOL., vol. 28, no. 11, 2010, pages 1208 - 1212, XP055554132, DOI: 10.1038/nbt.1692
CHOI ET AL.: "Third-generation in situ hybridization chain reaction: multiplexed, quantitative, sensitive, versatile, robust", DEVELOPMENT, vol. 145, no. 12, 2018, XP055590305, DOI: 10.1242/dev.165753
MATEO ET AL.: "Visualizing DNA folding and RNA in embryos at single-cell resolution", NATURE, vol. 568, 2019, pages 49, XP036746433, DOI: 10.1038/s41586-019-1035-4
CHENGLONG XIA, HAZEN P. BABCOCK, JEFFREY R. MOFFITT, XIAOWEI ZHUANG: "Multiplexed detection of RNA using MERFISH and branched DNA amplification", SCIENTIFIC REPORTS, vol. 9, no. 1, 1 December 2019 (2019-12-01), pages 1 - 13, XP055769128, DOI: 10.1038/s41598-019-43943-8

Attorney, Agent or Firm:

WITTE, WELLER & PARTNERPATENTANWÄLTE MBB (DE)

Download PDF:

View/Download PDF PDF Help

Claims:

Claims

1. A method of sequential signal-encoding of analytes in a sample, the method

comprising the steps:

(1) providing a set of analyte-specific probes, each analyte-specific probe comprising:

a binding element (S) that specifically interacts with one of the ana lytes to be encoded, and

an identifier element (T) comprising a nucleotide sequence which is unique to said set of analyte-specific probes (unique identifier se quence);

(2) incubating the set of analyte-specific probes with the sample, thereby allow ing a specific binding of the analyte-specific probes to the analyte to be en coded;

(3) removing non-bound probes from the sample;

(4) providing a set of decoding oligonucleotides, each decoding oligonucleotide comprising:

a first connector element (t) comprising a nucleotide sequence which is essentially complementary to at least a section of the unique identifier sequence, and

a translator element (c) comprising a nucleotide sequence allowing a specific hybridization of a signal oligonucleotide;

(5) incubating the set of decoding oligonucleotides with the sample, thereby al lowing a specific hybridization of the decoding oligonucleotides to the unique identifier sequence;

(6) removing non-bound decoding oligonucleotides from the sample; (7) providing a set of signal oligonucleotides, each signal oligonucleotide com prising:

a second connector element (C) comprising a nucleotide sequence which is essentially complementary to at least a section of the nu cleotide sequence of the translator element (c), and

a signal element; and

(8) incubating the set of signal oligonucleotides with the sample, thereby allow ing a specific hybridization of the signal oligonucleotides to the translator element (c).

2. The method of claim 1 , wherein the sample is a biological sample, preferably

comprising biological tissue, further preferably comprising biological cells.

3. The method of claim 2, wherein prior to step (2) the biological tissue and/or

biological cells are fixed.

4. The method of any of claims 1-3, wherein within the set of analyte-specific probes the individual analyte-specific probes comprise binding elements (S1 , S2, S3, S4, S5) which specifically interact with different sub-structures of one of the analytes to be encoded.

5. The method of any of claims 1-4, further comprising:

(9) removing non-bound signal oligonucleotides from the sample; and

(10) detecting the signal, and, optionally, further comprising: (11) selectively removing the decoding oligonucleotides and signal oligonucleo tides from the sample, thereby essentially maintaining the specific binding of the analyte-specific probes to the analyte to be encoded, and, optionally, further comprising:

(12) repeating steps (4)-(11) at least once [(4₂)-(11₂)] to generate an encoding scheme consisting of at least two signals.

6. The method of claim 5, wherein said encoding scheme is predetermined and

allocated to the analyte to be encoded.

7. The method of claim 5 and 6, wherein decoding oligonucleotides are used in

repeated steps (4₂)-(11₂) comprising a translator element (c2) which is identical with or differs from the translator element (d) of the decoding oligonucleotides used in previous steps (4)-(11).

8. The method of claims 5-7, wherein signal oligonucleotides are used in repeated steps (4₂)-(11₂) comprising a signal element which is identical with or differs from the signal element of the decoding oligonucleotides used in previous steps (4)- (1 1).

9. The method of any of claims 1-8, wherein the binding element (S) comprise a nucleic acid comprising a nucleotide sequence allowing a specific binding to the analyte to be encoded, preferably a specific hybridization to the analyte to be en coded.

10. The method of any of claims 1-9, wherein the binding element (S) comprise an amino acid sequence allowing a specific binding to the analyte to be encoded, preferably the binding element is an antibody.

11. The method of any of claims 1-10, wherein the analyte to be encoded is a nucleic acid, preferably DNA or RNA, further preferably mRNA.

12. The method of any of claims 1-11 , wherein the analyte to be encoded is a peptide or a protein.

13. Use of a set of decoding oligonucleotides to sequentially signal-encode analytes in a sample, each decoding oligonucleotide comprising:

a first connector element (t) comprising a nucleotide sequence which is es sentially complementary to at least a section of a nucleotide sequence which is unique to a set of analyte-specific probes (unique identifier se quence), and

a translator element (c) comprising a nucleotide sequence allowing a spe cific hybridization of a signal oligonucleotide.

14. A kit for sequentially signal-encoding of analytes in a sample, comprising

a set of analyte-specific probes, each analyte-specific probe comprising: a binding element (S) that specifically interacts with one of the ana lytes to be encoded, and

an identifier element (T) comprising a nucleotide sequence which is unique to said set of analyte-specific probes (unique identifier se quence); and a set of decoding oligonucleotides, each decoding oligonucleotide compris ing:

a first connector element (t) comprising a nucleotide sequence which is essentially complementary to at least a section of the unique identifier sequence, and

a translator element (c) comprising a nucleotide sequence allowing a specific hybridization of a signal oligonucleotide.

15. The kit of claim 14, further comprising:

a set of signal oligonucleotides, each signal oligonucleotide comprising: a second connector element (C) comprising a nucleotide sequence which is essentially complementary to at least a section of the nu cleotide sequence of the translator element (c), and

a signal element.

Description:

Method of signal encoding of analytes in a sample

[0001] The present invention is directed to a method of sequential signal encoding of analytes in a sample, a use of a set of decoding oligonucleotides to sequen tially signal-encode analytes in a sample, and to a kit for sequentially signal-encoding of analytes in a sample.

FIELD OF THE INVENTION

[0002] The present invention relates to the field of molecular biology, more par ticularly to the detection of analytes in a sample, preferably the detection of biomolecules such as nucleic acid molecules and/or proteins in a biological sample. BACKGROUND OF THE INVENTION

[0003] The analysis and detection of small quantities of analytes in biological and non-biological samples has become a routine practice in the clinical and analytical environment. Numerous analytical methods have been established for this purpose. Some of them use encoding techniques assigning a particular readable code to a specific first analyte which differs from a code assigned to a specific second analyte.

[0004] One of the prior art techniques in this field is the so-called 'single mole cule fluorescence in situ hybridization' (smFISH) essentially developed to detect mRNA molecules in a sample. In Lubeck et al. (2014), Single-cell in situ RNA profiling by sequen tial hybridization, Nat. Methods11 (4), p. 360-361 , the mRNAs of interest are detected via specific directly labeled probe sets. After one round of hybridization and detection, the set of mRNA specific probes is eluted from the mRNAs and the same set of probes with other (or the same) fluorescent labels is used in the next round of hybridization and imaging to generate gene specific color-code schemes over several rounds. The technology needs several differently tagged probe sets per transcript and needs to denature these probe sets after every detection round.

[0005] A further development of this technology does not use directly labeled probe sets. Instead, the oligonucleotides of the probe sets provide nucleic acid sequences that serve as initiator for hybridization chain reactions (HCR), a technology that enables signal amplification; see Shah et al. (2016), In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus, Neuron 92(2), p. 342-357.

[0006] Another technique referred to as 'multiplexed error robust fluorescence in situ hybridization' (merFISH) is described by Chen et al. (2015), RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells, Science 348(6233) :aaa6090. There, the mRNAs of interest are detected via specific probe sets that provide additional sequence elements for the subsequent specific hybridization of fluorescently labeled oligonucleotides. Each probe set provides four different sequence elements out of a total of 16 sequence elements. After hybridization of the specific probe sets to the mRNAs of interest, the so-called readout hybridizations are performed. In each readout hybridization one out of the 16 fluorescently labeled oligonucleotides complementary to one of the sequence elements is hybridized. All readout oligonucleotides use the same fluorescent color. After imaging, the fluorescent signals are destroyed via illumination and the next round of readout hybridization takes place without a denaturing step. As a result, a binary code is generated for each mRNA species. A unique signal signature of 4 signals in 16 rounds is created using only a single hybridization round for binding of specific probe sets to the mRNAs of interest, followed by 16 rounds of hybridization of readout oligonucleo tides labeled by a single fluorescence color.

[0007] A further development of this technology improves the throughput by us ing two different fluorescent colors, eliminating the signals via disulfide cleavage between the readout-oligonucleotides and the fluorescent label and an alternative hybridization buffer; see Moffitt et al. (2016), High-throughput single-cell gene-expression profiling with multiplexed error-robust fluorescence in situ hybridization, Proc. Natl. Acad. Sci. U S A. 113(39), p. 11046-11051.

[0008] A technology referred to as 'intron seqFISH' is described in Shah et al. (2018), Dynamics and spatial genomics of the nascent transcriptome by intron seqFISH, Cell 117(2), p. 363-376. There, the mRNAs of interest are detected via specific probe sets that provide additional sequence elements for the subsequent specific hybridization of fluorescently labeled oligonucleotides. Each probe set provides one out of 12 possible sequence elements (representing the 12‘pseudocolors’ used) per color-coding round. Each color-coding round consists of four serial hybridizations. In each of these serial hybridizations, three readout probes, each labeled with a different fluorophore, are hybrid ized to the corresponding elements of the mRNA-specific probe sets. After imaging, the readout probes are stripped off by a 55% formamide buffer and the next hybridization follows. After 5 color-coding rounds with 4 serial hybridizations each, the color-codes are completed.

[0009] EP 0 611 828 discloses the use of a bridging element to recruit a signal generating element to probes that specifically bind to an analyte. A more specific state ment describes the detection of nucleic acids via specific probes that recruit a bridging nucleic acid molecule. This bridging nucleic acids eventually recruit signal generating nucleic acids. This document also describes the use of a bridging element with more than one binding site for the signal generating element for signal amplification like branched DNA.

[0010] Player et al. (2001), Single-copy gene detection using branched DNA (bDNA) in situ hybridization, J. Histochem. Cytochem. 49(5), p. 603-611 , describe a method where the nucleic acids of interest are detected via specific probe sets providing an additional sequence element. In a second step, a preamplifier oligonucleotide is hybridized to this sequence element. This preamplifier oligonucleotide comprises multiple binding sites for amplifier oligonucleotides that are hybridized in a subsequent step. These amplifier oligonucleotides provide multiple sequence elements for the labeled oligonucleo tides. This way a branched oligonucleotide tree is build up that leads to an amplification of the signal.

[0011] A further development of this method referred to as is described by Wang et al. (2012), RNAscope: a novel in situ RNA analysis platform for formalin-fixed, paraffin-embedded tissues, J. Mol. Diagn. 14(1), p.22-29, which uses another design of the mRNA-specific probes. Here two of the mRNA-specific oligonucleotides have to hybridize in close proximity to provide a sequence that can recruit the preamplifier oligo nucleotide. This way the specificity of the method is increased by reducing the number of false positive signals.

[0012] Choi et al. (2010), Programmable in situ amplification for multiplexed im aging of mRNA expression, Nat. Biotechnol. 28(11), p. 1208-1212, disclose a method known as 'HCR-hybridization chain reaction'. The mRNAs of interest are detected via specific probe sets that provide an additional sequence element. The additional sequence element is an initiator sequence to start the hybridization chain reaction. Basically, the hybridization chain reaction is based on metastable oligonucleotide hairpins that self- assemble into polymers after a first hairpin is opened via the initiator sequence. [0013] A further development of the technology uses so called split initiator probes that have to hybridize in close proximity to form the initiator sequence for HCR, similarly to the RNAscope technology, this reduces the number of false positive signals; see Choi et al. (2018), Third-generation in situ hybridization chain reaction: multiplexed, quantitative, sensitive, versatile, robust. Development 145(12).

[0014] Mateo et al. (2019), Visualizing DNA folding and RNA in embryos at sin gle-cell resolution, Nature Vol, 568, p. 49ff . , disclose a method called 'optical reconstruc tion of chromatin structure (ORCA). This method is intended to make the chromosome line visible.

[0015] The methods known in the art, however, have numerous disadvantages. In particular, they are inflexible, expensive, complex, time consuming and quite often provide non-accurate results. In particular, the encoding capacities of the existing meth ods are low and do not meet the requirements of modern molecular biology and medicine.

[0016] Against this background, it is an object underlying the present invention to provide a method by means of which the disadvantages of the prior art methods can be reduced or even avoided.

[0017] The present invention satisfies these and other needs.

SUMMARY OF THE INVENTION

[0018] The present invention provides a method of sequential signal-encoding of analytes in a sample, the method comprising the steps:

(1) providing a set of analyte-specific probes, each analyte-specific probe comprising: a binding element (S) that specifically interacts with one of the ana lytes to be encoded, and an identifier element (T) comprising a nucleotide sequence which is unique to said set of analyte-specific probes (unique identifier se quence);

(2) incubating the set of analyte-specific probes with the sample, thereby al lowing a specific binding of the analyte-specific probes to the analyte to be encoded;

(3) removing non-bound probes from the sample;

(4) providing a set of decoding oligonucleotides, each decoding oligonucleo tide comprising: a first connector element (t) comprising a nucleotide sequence which is essentially complementary to at least a section of the unique identifier sequence, and a translator element (c) comprising a nucleotide sequence allowing a specific hybridization of a signal oligonucleotide;

(5) incubating the set of decoding oligonucleotides with the sample, thereby allowing a specific hybridization of the decoding oligonucleotides to the unique identifier sequence;

(6) removing non-bound decoding oligonucleotides from the sample;

(7) providing a set of signal oligonucleotides, each signal oligonucleotide comprising: a second connector element (C) comprising a nucleotide sequence which is essentially complementary to at least a section of the nu cleotide sequence of the translator element (c), and a signal element; and (8) incubating the set of signal oligonucleotides with the sample, thereby al lowing a specific hybridization of the signal oligonucleotides to the trans lator element (c).

[0019] The inventors have realized that this novel method provides the essen tial steps required to set up a process allowing the specific quantitative and/or spatial detection or counting of different analytes or different single analyte molecules in a sample in parallel via specific hybridization. The technology allows distinguishing a higher number of analytes than different signals available. In contrast to other state-of-the-art methods the oligonucleotides providing the detectable signal are not directly interacting with sample-specific nucleic acid sequences but are mediated by so called 'decoding- oligonucleotides'. This mechanism decouples the dependency between the analyte- specific oligonucleotides and the signal oligonucleotides and, therefore, results in a dramatical increase of the encoding capacity.

[0020] Another subject-matter of the invention is the use of a set of decoding ol igonucleotides to sequentially signal-encode analytes in a sample, each decoding oligo nucleotide comprising: a first connector element (t) comprising a nucleotide sequence which is essentially complementary to at least a section of a nucleotide sequence which is unique to a set of analyte-specific probes (unique identifier se quence), and a translator element (c) comprising a nucleotide sequence allowing a specific hybridization of a signal oligonucleotide.

[0021] A still further subject-matter of the present invention is a kit for sequen tially signal-encoding of analytes in a sample, comprising a set of analyte-specific probes, each analyte-specific probe comprising: a binding element (S) that specifically interacts with one of the analytes to be encoded, and an identifier element (T) comprising a nucleotide sequence which is unique to said set of analyte-specific probes (unique identifier se quence); and a set of decoding oligonucleotides, each decoding oligonucleotide com prising: a first connector element (t) comprising a nucleotide sequence which is essentially complementary to at least a section of the unique identifier sequence, and a translator element (c) comprising a nucleotide sequence allowing a specific hybridization of a signal oligonucleotide; and, preferably, a set of signal oligonucleotides, each signal oligonucleotide comprising: a second connector element (C) comprising a nucleotide sequence which is essentially complementary to at least a section of the nucleotide sequence of the translator element (c), and a signal element.

[0022] The use of decoding-oligonucleotides allows a much higher flexibility while dramatically decreasing the number of different signal oligonucleotides needed which in turn increases the encoding capacity achieved. The utilization of decoding- oligonucleotides leads to a sequential signal-coding technology that is more flexible, cheaper, simpler, faster and/or more accurate than other methods. In particular, the invention results in a significant increase of the encoding capacity in comparison to the prior art methods. [0023] The use of decoding oligonucleotides breaks the dependencies between the target specific probes and the signal oligonucleotides. Without decoupling target specific probes and signal generation as in the methods of the state of the art, two differ ent signals can only be generated for a certain target if using two different molecular tags. Each of these molecular tags can only be used once. Multiple readouts of the same molecular tag do not increase the information about the target. In order to create an encoding scheme, a change of the target specific probe set after each round is required (SeqFISH) or multiple molecular tags must be present on the same probe set (like mer- FISH, intronSeqFISH). These restrictions in the art are very relevant and reduce the flexibility, coding capacity, accuracy, reproducibility and increase the costs of the experi ment.

[0024] According to the invention an "analyte" is the subject to be specifically detected as being present or absent in a sample and, in case of its presence, to encode it. It can be any kind of entity, including a protein or a nucleic acid molecule (RNA or DNA) of interest. The analyte provides at least one site for specific binding with analyte-specific probes. Sometimes herein the term "analyte" is replaced by "target". An "analyte" accord ing to the invention incudes a complex of subjects, e.g. at least two individual nucleic acid, protein or peptides molecules. In an embodiment of the invention an "analyte" excludes a chromosome. In another embodiment of the invention an "analyte" excludes DNA.

[0025] A "sample" as referred to herein is a composition in liquid or solid form suspected of comprising the analytes to be encoded.

[0026] An "oligonucleotide" as used herein, refers to s short nucleic acid mole cule, such as DNA, PNA, LNA or RNA. The length of the oligonucleotides is within the range 4-200 nucleotides (nt), preferably 6-80 nt, more preferably 8-60 nt, more preferably 10-50 nt, more preferably 12 to 35 depending on the number of consecutive sequence elements. The nucleic acid molecule can be fully or partially single-stranded. The oligonu cleotides may be linear or may comprise hairpin or loop structures. The oligonucleotides may comprise modifications such as biotin, labeling moieties, blocking moieties, or other modifications. [0027] The "analyte-specific probe" consists of at least two elements, namely the so-called binding element (S) which specifically interacts with one of the analytes, and a so-called identifier element (T) comprising the 'unique identifier sequence'. The binding element (S) may be a nucleic acid such as a hybridization sequence or an aptamer, or a peptidic structure such as an antibody. The "unique identifier sequence" as comprised by the analyte-specific probe is unique in its sequence compared to other unique identifiers. "Unique" in this context means that it specifically identifies only one analyte, such as Cyclin A, Cyclin D, Cyclin E etc., or, alternatively, it specifically identifies only a group of analytes, independently whether the group of analytes comprises a gene family or not. Therefore, the analyte or a group of analytes to be encoded by this unique identifier can be distinguished from all other analytes or groups of analytes that are to be encoded based on the unique identifier sequence of the identifier element (T). Or, in other words, there is only one 'unique identifier sequence' for a particular analyte or a group of ana lytes, but not more than one, i.e. not even two. Due to the uniqueness of the unique identifier sequence the identifier element (T) hybridizes to exactly one type of decoding oligonucleotides. The length of the unique identifier sequence is within the range 8-60 nt, preferably 12-40 nt, more preferably 14-20 nt, depending on the number of analytes encoded in parallel and the stability of interaction needed. A unique identifier may be a sequence element of the analyte-specific probe, attached directly or by a linker, a covalent bond or high affinity binding modes, e.g. antibody-antigen interaction, streptavidin-biotin interaction etc. It is understood that the term "analyte specific probe" includes a plurality of probes which may differ in their binding elements (S) in a way that each probe binds to the same analyte but possibly to different parts thereof, for instance to different (e.g. neighbor ing) or overlapping sections of the nucleotide sequence comprised by the nucleic acid molecule to be encoded. However, each of the plurality of the probes comprises the same identifier element (T).

[0028] A "decoding oligonucleotide" consists of at least two sequence ele ments. One sequence element that can specifically bind to a unique identifier sequence, referred to as "first connector element" (t), and a second sequence element specifically binding to a signal oligonucleotide, referred to as "translator element" (c). The length of the sequence elements is within the range 8-60 nt, preferably 12-40 nt, more preferably 14-20 nt, depending on the number of analytes to be encoded in parallel, the stability of interaction needed and the number of different signal oligonucleotides used. The length of the two sequence elements may or may not be the same.

[0029] A "signal oligonucleotide" as used herein comprises at least two ele ments, a so-called "second connector element" (C) having a nucleotide sequence specifi cally hybridizable to at least a section of the nucleotide sequence of the translator element (c) of the decoding oligonucleotide, and a "signal element" which provides a detectable signal. This element can either actively generate a detectable signal or provide such a signal via manipulation, e.g. fluorescent excitation. Typical signal elements are, for example, enzymes that catalyze a detectable reaction, fluorophores, radioactive elements or dyes.

[0030] A "set" refers to a plurality of moieties or subjects, e.g. analyte-specific probes or decoding oligonucleotides, whether the individual members of said plurality are identical or different from each other. In an embodiment of the invention a single set refers to a plurality of oligonucleotides

[0031] An "analyte specific probe set" refers to a plurality of moieties or sub jects, e.g. analyte-specific probes that are different from each other and bind to independ ent regions of the analyte. A single analyte specific probe set is further characterized by the same unique identifier.

[0032] A "decoding oligonucleotide set" refers to a plurality of decoding oligo nucleotides specific for a certain unique identifier needed to realize the encoding inde pendent of the length of the code word. Each and all of the decoding oligonucleotides included in a "decoding oligonucleotide set" bind to the same unique identifier element (T) of the analyte-specific probe.

[0033] "Essentially complementary" means, when referring to two nucleotide sequences, that both sequences can specifically hybridize to each other under stringent conditions, thereby forming a hybrid nucleic acid molecule with a sense and an antisense strand connected to each other via hydrogen bonds (Watson-and-Crick base pairs). "Essentially complementary" includes not only perfect base-pairing along the entire strands, i.e. perfect complementary sequences but also imperfect complementary se quences which, however, still have the capability to hybridize to each other under strin gent conditions. Among experts it is well accepted that an "essentially complementary" sequence has at least 88% sequence identity to a fully or perfectly complementary sequence.

[0034] "Percent sequence identity" or "percent identity" in turn means that a sequence is compared to a claimed or described sequence after alignment of the sequence to be compared (the "Compared Sequence") with the described or claimed sequence (the "Reference Sequence"). The percent identity is then determined according to the following formula: percent identity = 100 [1 -(C/R)] wherein C is the number of differences between the Reference Sequence and the Compared Sequence over the length of alignment between the Reference Sequence and the Compared Sequence, wherein

(i) each base or amino acid in the Reference Sequence that does not have a corresponding aligned base or amino acid in the Compared Sequence and

(ii) each gap in the Reference Sequence and

(iii) each aligned base or amino acid in the Reference Sequence that is different from an aligned base or amino acid in the Compared Sequence, constitutes a difference and (iiii) the alignment has to start at position 1 of the aligned sequences; and R is the number of bases or amino acids in the Reference Sequence over the length of the alignment with the Compared Sequence with any gap created in the Reference Sequence also being counted as a base or amino acid.

[0035] If an alignment exists between the Compared Sequence and the Reference Sequence for which the percent identity as calculated above is about equal to or greater than a specified minimum Percent Identity then the Compared Sequence has the specified minimum percent identity to the Reference Sequence even though alignments may exist in which the herein above calculated percent identity is less than the specified percent identity.

[0036] In the "incubation" steps as understood herein the respective moieties or subjects such as probes or oligonucleotide, are brought into contact with each other under conditions well known to the skilled person allowing a specific binding or hybridization reaction, e.g. pH, temperature, salt conditions etc. Such steps may therefore, be preferably carried out in a liquid environment such as a buffer system which is well known in the art.

[0037] The "removing" steps according to the invention may include the washing away of the moieties or subjects to be removed such as the probes or oligonucleotides by certain conditions, e.g. pH, temperature, salt conditions etc., as known in the art.

[0038] It is understood that in an embodiment of the method according to the invention a plurality of analytes can be encoded in parallel. This requires the use of different sets of analyte-specific probes in step (1). The analyte-specific probes of a particular set differ from the analyte-specific probes of another set. This means that the analyte-specific probes of set 1 bind to analyte 1 , the analyte-specific probes of set 2 bind to analyte 2, the analyte-specific probes of set 3 bind to analyte 3, etc. In this embodiment also the use of different sets of decoding oligonucleotides is required in step (4). The decoding oligonucleotides of a particular set differ from the decoding oligonucleotides of another set. This means, the decoding oligonucleotides of set 1 bind to the analyte- specific probes of above set 1 of analyte-specific probes, the decoding oligonucleotides of set 2 bind to the analyte-specific probes of above set 2 of analyte-specific probes, the decoding oligonucleotides of set 3 bind to the analyte-specific probes of above set 3 of analyte-specific probes, etc. In this embodiment where a plurality of analytes is to be encoded in parallel the different sets of analyte-specific probes may be provided in step (1) as a premixture of different sets of analyte-specific probes and/or the different sets of decoding oligonucleotides may be provided in step (4) as a premixture of different sets of decoding oligonucleotides. Each mixture may be contained in a single vial. Alternatively, the different sets of analyte-specific probes and/or the different sets of decoding oligonu cleotides may be provided in steps (1) and/or (4) singularly.

[0039] A "kit" is a combination of individual elements useful for carrying out the use and/or method of the invention, wherein the elements are optimized for use together in the methods. The kits may also contain additional reagents, chemicals, buffers, reaction vials etc. which may be useful for carrying out the method according to the invention.

Such kits unify all essential elements required to work the method according to the invention, thus minimizing the risk of errors. Therefore, such kits also allow semi-skilled laboratory staff to perform the method according to the invention.

[0040] The features, characteristics, advantages and embodiments specified herein apply to the method, use, and kit according to the invention, even if not specifically indicated.

[0041] In an embodiment of the invention the sample is a biological sample, preferably comprising biological tissue, further preferably comprising biological cells. A biological sample may be derived from an organ, organoids, cell cultures, stem cells, cell suspensions, primary cells, samples infected by viruses, bacteria or fungi, eukaryotic or prokaryotic samples, smears, disease samples, a tissue section.

[0042] The method is particularly qualified to encode, identify, detect, count or quantify analytes or single analytes molecules in a biological sample, i.e. such as a sample which contains nucleic acids or proteins as said analytes. It is understood that the biological sample may be in a form as it is in its natural environment (i.e. liquid, semi liquid, solid etc.), or processed, e.g. as a dried film on the surface of a device which may be re-liquefied before the method is carried out.

[0043] In another embodiment of the invention prior to step (2) the biological tissue and/or biological cells are fixed. [0044] This measure has the advantage that the analytes to be encoded, e.g. the nuclei acids or proteins, are immobilized and cannot escape. In doing so, the analytes then prepared for a better detection or encoding by the method according to the invention. The fixation of the sample can be, e.g., carried out by means of formaline, ethanol, methanol or other components well known to the skilled person.

[0045] In yet a further embodiment within the set of analyte-specific probes the individual analyte-specific probes comprise binding elements (S1 , S2, S3, S4, S5) which specifically interact with different sub-structures of one of the analytes to be encoded.

[0046] By this measure the method becomes even more robust and reliable because the signal intensity obtained at the end of the method or a cycle, respectively, is increased. It is understood, that the individual probes of a set while binding to the same analyte differ in their binding position or binding site at or on the analyte. The binding elements S1 , S2, S3, S4, S5 etc. of the first, second, third fourth, fifth etc. analyte-specific probes therefore bind to or at a different position which, however, may or may not overlap.

[0047] in another embodiment of the method according to the invention it comprises the following additional steps:

(9) removing non-bound signal oligonucleotides from the sample; and

(10) detecting the signal.

[0048] By this measure the method is further developed to such an extent that the encoded analytes can be detected by any means which is adapted to visualize the signal element. Examples of detectable physical features include e.g. light, chemical reactions, molecular mass, radioactivity, etc.

[0049] In a further embodiment of the method according to the invention the following additional step is carried out: (11) selectively removing the decoding oligonucleotides and signal oligonu cleotides from the sample, thereby essentially maintaining the specific binding of the analyte-specific probes to the analyte to be encoded.

[0050] By this measure the requirements for another round of binding further decoding oligonucleotides to the same analyte-specific probes are established, thus finally resulting in a code or encoding scheme comprising more than one signal. This step is realized by applying conditions and factors well known to the skilled person, e.g. pH, temperature, salt conditions, oligonucleotide concentration, polymers etc.

[0051] In another embodiment of the invention, the method comprises the following step:

(12) repeating steps (4)-(11) at least once [(4 ₂)-(112)] to generate an encod ing scheme consisting of at least two signals.

[0052] With this measure a code of more than one signal is set up, i.e. of two, three, four, five etc. signals in case of two [(4 ₂)-(112)], three [(4 ₃)-(11 ₃)], four [(4 ₄)-(11 ₄)], five [(4 ₅)-(115)] , etc. [(4 _n)-(11 _n)] rounds which are carried out by the user, where 'h' is an integer representing the number of rounds. The encoding capacity of the method accord ing to the invention is herewith increased depending on the nature of the analyte and the needs of the operator.

[0053] In an embodiment of the invention said encoding scheme is predeter mined and allocated to the analyte to be encoded.

[0054] This measure enables a precise experimental set-up by providing the appropriate sequential order of the employed decoding and signal oligonucleotides and, therefore, allows the correct allocation of a specific analyte to a respective encoding scheme. [0055] The decoding oligonucleotides which are used in repeated steps (4 ₂)- (11 ₂) may comprise a translator element (c2) which is identical with the translator element (c1) of the decoding oligonucleotides used in previous steps (4)-(1 1). In another embodi ment of the invention decoding oligonucleotides are used in repeated steps (4 ₂)-(1 1 ₂) comprising a translator element (c2) which differs from the translator element (d) of the decoding oligonucleotides used in previous steps (4)-(11).

[0056] It is understood that the decoding elements may or may not be changed from round to round, i.e. in the second round (4 ₂)-(11 ₂) comprising the translator element c2, in the third round (4 ₃)-(1 1 ₃) comprising the translator element c3, in the fourth round (4 ₄)-(11 ₄) comprising the translator element c4, in the fifths round comprising the translator element c5, in the 'h' round (4 _n)-(11 _n) comprising the translator element cn, etc., wherein 'h' is an integer representing the number of rounds.

[0057] The signal oligonucleotides which are used in repeated steps (4 ₂)-(1 1 ₂) may comprise a signal element which is identical with the signal element of the decoding oligonucleotides used in previous steps (4)-(1 1). In a further embodiment of the invention signal oligonucleotides are used in repeated steps (4 ₂)-(1 1 ₂) comprising a signal element which differs from the signal element of the decoding oligonucleotides used in previous steps (4)-(1 1).

[0058] By this measure each round the same or a different signal is provided resulting in an encoding scheme characterized by a signal sequence consisting of numer ous different signals. This measure allows the creation of a unique code or code word which differs from all other code words of the encoding scheme.

[0059] In another embodiment of the invention the binding element (S) of the analyte-specific probe comprises a nucleic acid comprising a nucleotide sequence allow ing a specific binding to the analyte to be encoded, preferably a specific hybridization to the analyte to be encoded.

[0060] This measure creates the condition for encoding a nucleic acid analyte, such as specific DNA molecules, e.g. genomic DNA, nuclear DNA, mitochondrial DNA, viral DNA, bacterial DNA, extra- or intracellular DNA etc., or specific mRNA molecules, e.g. hnRNA, miRNA, viral RNA, bacterial RNA, extra- or intracellular RNA, etc.

[0061] In an alternative embodiment of the invention the binding element (S) of the analyte-specific probe comprises an amino acid sequence allowing a specific binding to the analyte to be encoded, preferably the binding element is an antibody.

[0062] This measure creates the condition for encoding a nucleic acid analyte, such as an mRNA, e.g. such an mRNA coding for a particular protein.

[0063] In another embodiment the analyte to be encoded or detected is a nucle ic acid, preferably DNA or RNA, further preferably mRNA, and/or, alternatively the analyte to be decoded is a peptide or a protein.

[0064] By this measure the invention is adapted to the detection of such kinds of analytes which are of upmost importance in the clinical routine or the focus of biological questions.

[0065] It is to be understood that the before-mentioned features and those to be mentioned in the following cannot only be used in the combination indicated in the respec tive case, but also in other combinations or in an isolated manner without departing from the scope of the invention.

[0066] The invention is now further explained by means of embodiments result ing in additional features, characteristics and advantages of the invention. The embodi ments are of pure illustrative nature and do not limit the scope or range of the invention. The features mentioned in the specific embodiments are general features of the invention which are not only applicable in the specific embodiment but also in an isolated manner in the context of any embodiment of the invention.

[0067] The invention is now described and explained in further detail by refer ring to the following non-limiting examples and figures. Fig. 1 : Embodiment where the analyte is a nucleic acid and the probe set com prises oligonucleotides specifically binding to the analyte. The probes comprise a unique identifier sequence allowing hybridization of decoding oligonucleotides;

Fig. 2: Embodiment where the analyte is a protein and the probe set comprises proteins (here: antibodies) specifically binding to the analyte. The probes comprise a unique identifier sequence allowing hybridization of decoding oligonucleotides;

Fig. 3: Flowchart of the method according to the invention;

Fig. 4: Alternative options for the application of decoding and signal oligonucle otides.

Fig. 5: Example for signal encoding of three different nucleic acid sequences by two different signal types and three detection rounds; in this example, the encoding scheme includes error detection;

Fig. 6: Number of generated code words (logarithmic scale) against number of detection cycles;

Fig. 7: Calculated total efficiency of a 5-round encoding scheme based on sin gle step efficiencies;

Fig. 8: Comparison of relative transcript abundances between different experi ments;

Fig. 9: Correlation of relative transcript abundances between different experi ments;

Fig. 10 Comparison of intercellular distribution of signals; Fig. 11 : Comparison of intracellular distribution of signals;

Fig. 12: Distribution pattern of different cell cycle dependent transcripts.

EXAMPLES

1. Introduction

[0068] The method disclosed herein is used for specific detection of many dif ferent analytes in parallel. The technology allows distinguishing a higher number of analytes than different signals are available. The process preferably includes at least two consecutive rounds of specific binding, signal detection and selective denaturation (if a next round is required), eventually producing a signal code. To decouple the dependency between the analyte specific binding and the oligonucleotides providing the detectable signal, a so called“decoding” oligonucleotide is introduced. The decoding oligonucleotide transcribes the information of the analyte specific probe set to the signal oligonucleotides.

[0069] In a first application variant, the analyte or target is nucleic acid, e.g.

DNA or RNA, and the probe set comprises oligonucleotides that are partially or completely complementary to the whole sequence or a subsequence of the nucleic acid sequence to be detected (Figure 1). The nucleic acid sequence specific oligonucleotide probe sets comprising analyte-specific probes (1) including a binding element (S) that specifically hybridizes to the target nucleic acid sequence to be detected, and an identifier element (T) comprising a nucleotide sequence which is unique to said set of analyte-specific probes (unique identifier sequence).

[0070] In a second application variant, the analyte or target is a protein and the probe set comprises one or more proteins, e.g. antibodies (Figure 2). The protein specific probe set comprising analyte-specific probes (1) including a binding element (T) such as the (hyper-)variable region of an antibody, that specifically interacts with the target protein to be detected, and the identifier element (T). [0071] In a third application variant, at least one analyte is a nucleic acid and at least a second analyte is a protein and at least the first probe set binds to the nucleic acid sequence and at least the second probe set binds specifically to the protein analyte. Other combinations are possible as well.

2. General method according to the invention

[0072] In order to elucidate the workflow in more depth, the following workflow is restricted on the first application variant. A well-trained person can easily adapt the exemplary workflow to other applications. The method steps are depicted in the flowchart of Figure 3.

[0073] Step 1 : Applying the analyte- or target-specific probe set. The target nu cleic acid sequence is incubated with a probe set consisting of oligonucleotides with sequences complementary to the target nucleic acid. In this example, a probe set of 5 different probes is shown, each comprising a sequence element complementary to an individual subsequence of the target nucleic acid sequence (S1 to S5). In this example, the regions do not overlap. Each of the oligonucleotides targeting the same nucleic acid sequence comprises the identifier element or unique identifier sequence (T), respectively.

[0074] Step 2: Hybridization of the probe set. The probe set is hybridized to the target nucleic acid sequence under conditions allowing a specific hybridization. After the incubation, the probes are hybridized to their corresponding target sequences and provide the identifier element (T) for the next steps.

[0075] Step 3: Eliminating non-bound probes. After hybridization, the unbound oligonucleotides are eliminated, e.g. by washing steps.

[0076] Step 4: Applying the decoding oligonucleotides. The decoding oligonu cleotides consisting of at least two sequence elements (t) and (c) are applied. While sequence element (t) is complementary to the unique identifier sequence (T), the se quence element (c) provides a region for the subsequent hybridization of signal oligonu cleotides (translator element). [0077] Step 5: Hybridization of decoding oligonucleotides. The decoding oligo nucleotides are hybridized with the unique identifier sequences of the probes (T) via their complementary first sequence elements (t). After incubation, the decoding oligonucleo tides provide the translator sequence element (c) for a subsequent hybridization step.

[0078] Step 6: Eliminating the excess of decoding oligonucleotides. After hy bridization, the unbound decoding oligonucleotides are eliminated, e.g. by washing steps.

[0079] Step 7: Applying the signal oligonucleotide. The signal oligonucleotides are applied. The signal oligonucleotides comprise at least one second connector element (C) that is essentially complementary to the translator sequence element (c) and at least one signal element that provides a detectable signal (F).

[0080] Step 8: Hybridization of the signal oligonucleotides. The signal oligonu cleotides are hybridized via the complementary sequence connector element (C) to the translator element (c) of decoding oligonucleotide. After incubation, the signal oligonucleo tides are hybridized to their corresponding decoding oligonucleotides and provide a signal (F) that can be detected.

[0081] Step 9: Eliminating the excess of signal oligonucleotides. After hybridiza tion, the unbound signal oligonucleotides are eliminated, e.g. by washing steps.

[0082] Step 10: Signal detection. The signals provided by the signal oligonucle otides are detected.

[0083] The following steps (steps 11 and 12) are unnecessary for the last de tection round.

[0084] Step 11 : Selective denaturation. The hybridization between the unique identifier sequence (T) and the first sequence element (t) of the decoding oligonucleotides is dissolved. The destabilization can be achieved via different mechanisms well known to the trained person like for example: increased temperature, denaturing agents, etc. The target- or analyte-specific probes are not affected by this step. [0085] Step 12: Eliminating the denatured decoding oligonucleotides. The de natured decoding oligonucleotides and signal oligonucleotides are eliminated (e.g. by washing steps) leaving the specific probe sets with free unique identifier sequences, reusable in a next round of hybridization and detection (steps 4 to 10). This detection cycle (steps 4 to 12) is repeated 'h' times until the planed encoding scheme is completed.

[0086] Note that in every round of detection, the type of signal provided by a certain unique identifier is controlled by the use of a certain decoding oligonucleotide. As a result, the sequence of decoding oligonucleotides applied in the detection cycles tran scribes the binding specificity of the probe set into a unique signal sequence.

3. Alternative options for the application of decoding- and signal oligonucleotides

[0087] The steps of decoding oligonucleotide hybridization (steps 4 to 6) and signal oligonucleotide hybridization (steps 7 to 9) can also be combined in two alternative ways as shown in Figure 4.

[0088] Opt. 1 : Simultaneous hybridization. Instead of the steps 4 to 9 of Figure

3, specific hybridization of decoding oligonucleotides and signal oligonucleotides can also be done simultaneously leading to the same result as shown in step 9 of Figure 3, after eliminating the excess decoding- and signal oligonucleotides.

[0089] Opt. 2: Preincubation. Additionally to option 1 of Figure 3, decoding- and signal oligonucleotides can be preincubated in a separate reaction before being applied to the target nucleic acid with the already bound specific probe set.

4. Example for signal encoding of three different nucleic acid sequences by two

different signal types and three detection rounds

[0090] Figure 3 shows the general concept of generation and detection of spe cific signals mediated by decoding oligonucleotides. It does not show the general concept of encoding that can be achieved by this procedure. To illustrate the use of the process shown in Figure 3 for the generation of an encoding scheme, Figure 5 shows a general example for a multiple round encoding experiment with three different nucleic acid se quences. In this example, the encoding scheme includes error detection.

[0091] Step 1 : Target nucleic acids. In this example three different target nucle ic acids (A), (B) and (C) have to be detected and differentiated by using only two different types of signal. Before starting the experiment, a certain encoding scheme is set. In this example, the three different nucleic acid sequences are encoded by three rounds of detection with two different signals (1) and (2) and a resulting hamming distance of 2 to allow for error detection. The planed code words are: sequence A: (1) - (2) - (2); sequence B: (1) - (1) - (1); sequence C: (2) - (1) - (2).

[0092] Step 2: Hybridization of the probe sets. For each target nucleic acid, an own probe set is applied, specifically hybridizing to the corresponding nucleic acid se quence of interest. Each probe set provides a unique identifier sequence (T1), (T2) or (T3). This way each different target nucleic acid is uniquely labeled. In this example sequence (T) is labeled with (T1), sequence (B) with (T2) and sequence (C) with (T3). The illustration summarizes steps 1 to 3 of Figure 3.

[0093] Step 3: Hybridization of the decoding oligonucleotides. For each unique identifier present, a certain decoding oligonucleotide is applied specifically hybridizing to the corresponding unique identifier sequence by its first sequence element (here (t1) to (T1), (t2) to (T2) and (t3) to (T3)). Each of the decoding oligonucleotides provides a translator element that defines the signal that will be generated after hybridization of signal oligonucleotides. Here nucleic acid sequences (A) and (B) are labeled with the translator element (d) and sequence (C) is labeled with (c2). The illustration summarizes steps 4 to 6 of Figure 3. [0094] Step 4: Hybridization of signal oligonucleotides. For each type of transla tor element, a signal oligonucleotide with a certain signal (2), differentiable from signals of other signal oligonucleotides, is applied. This signal oligonucleotide can specifically hybridize to the corresponding translator element. The illustration summarizes steps 7 to 9 of Figure 3.

[0095] Step 5: Signal detection for the encoding scheme. The different signals are detected. Note that in this example the nucleic acid sequence (C) can be distin guished from the other sequences by the unique signal (2) it provides, while sequences (A) and (B) provide the same kind of signal (1) and cannot be distinguished after the first cycle of detection. This is due to the fact, that the number of different nucleic acid se quences to be detected exceeds the number of different signals available. The illustration corresponds to step 10 of Figure 3.

[0096] Step 6: Selective denaturation. The decoding (and signal) oligonucleo tides of all nucleic acid sequences to be detected are selectively denatured and eliminated as described in steps 11 and 12 of Figure 3. Afterwards the unique identifier sequences of the different probe sets can be used for the next round of hybridization and detection.

[0097] Step 7: Second round of detection. A next round of hybridization and de tection is done as described in steps 3 to 5. Note that in this new round the mix of different decoding oligonucleotides is changed. For example, decoding oligonucleotide of nucleic acid sequence (A) used in the first round comprised of sequence elements (t1) and (d) while the new decoding oligonucleotide comprises of the sequence elements (t1) and (c2). Note that now all three sequences can clearly be distinguished due to the unique combi nation of first and second round signals.

[0098] Step 8: Third round of detection. Again, a new combination of decoding oligonucleotides is used leading to new signal combinations. After signal detection, the resulting code words for the three different nucleic acid sequences are not only unique and therefore distinguishable but comprise a hamming distance of 2 to other code words. Due to the hamming distance, an error in the detection of the signals (signal exchange) would not result in a valid code word and therefore could be detected. By this way three different nucleic acids can be distinguished in three detection rounds with two different signals, allowing error detection.

5. Advantages over prior art technologies

Coding strategy

[0099] Compared to state-of-the-art methods, one particular advantage of the method according to the invention is the use of decoding oligonucleotides breaking the dependencies between the target specific probes and the signal oligonucleotides.

[00100] Without decoupling target specific probes and signal generation, two dif ferent signals can only be generated for a certain target if using two different molecular tags. Each of these molecular tags can only be used once. Multiple readouts of the same molecular tag do not increase the information about the target. In order to create an encoding scheme, a change of the target specific probe set after each round is required (SeqFISH) or multiple molecular tags must be present on the same probe set (like mer- FISH, intronSeqFISH).

[00101] Following the method according to the invention, different signals are achieved by using different decoding oligonucleotides reusing the same unique identifier (molecular tag) and a small number of different, mostly cost-intensive signal oligonucleo tides. This leads to several advantages in contrast to the other methods.

(1) The encoding scheme is not defined by the target specific probe set as it is the case for all other methods of prior art. Here the encoding scheme is transcribed by the decoding oligonucleotides. This leads to a much higher flexibility concerning the number of rounds and the freedom in signal choice for the codewords. Looking on the methods of prior art (e.g. merFISH or in tronSeqFISH), the encoding scheme (number, type and sequence of de tectable signals) for all target sequences is predefined by the presence of the different tag sequences on the specific probe sets (4 of 16 different tags per probe set in the case of merFISH and 5 of 60 different tags in the case of intron FISH). In order to produce a sufficient number of different tags per probe set, the methods use rather complex oligonucleotide designs with several tags present on one target specific oligonucleotide. In order to change the encoding scheme for a certain target nucleic acid, the specific probe set has to be replaced. The method according to the invention de scribes the use of a single unique tag sequence (unique identifier) per ana lyte, because it can be reused in every detection round to produce a new information. The encoding scheme is defined by the order of decoding oli gonucleotides that are used in the detection rounds. Therefore, the encod ing scheme is not predefined by the specific probes (or the unique tag se quence) but can be adjusted to different needs, even during the experi ment. This is achieved by simply changing the decoding oligonucleotides used in the detection rounds or adding additional detection rounds.

(2) The number of different signal oligonucleotides must match the number of different tag sequences with methods of prior art (16 in the case of mer FISH and 60 in the case of intronSeqFISH). Using the method according to the invention, the number of different signal oligonucleotides matches the number of different signals used. Due to this, the number of signal oligonu cleotides stays constant for the method described here and never exceeds the number of different signals but increases with the complexity of the en coding scheme in the methods of prior art (more detection rounds more dif ferent signal oligonucleotides needed). As a result, the method described here leads to a much lower complexity (unintended interactions of signal ol igonucleotides with environment or with each other) and dramatically re duces the cost of the assay since the major cost factor are the signal oligo nucleotides.

(3) In the methods of prior art, the number of different signals generated by a target specific probe set is restricted by the number of different tag se quences the probe set can provide. Since each additional tag sequence in creases the total size of the target specific probe, there is a limitation to the number of different tags a single probe can provide. This limitation is given by the size dependent increase of several problems (unintended inter- and intramolecular interactions, costs, diffusion rate, stability, errors during syn thesis etc.). Additionally, there is a limitation of the total number of target specific probes that can be applied to a certain analyte. In case of nucleic acids, this limitation is given by the length of the target sequence and the proportion of suitable binding sites. These factors lead to severe limitations in the number of different signals a probe set can provide (4 signals in the case of merFISH and 5 signals in the case of intronSeqFISH). This limita tion substantially affects the number of different code words that can be produced with a certain number of detection rounds. In the approach of the invention only one tag is needed and can be freely reused in every detec tion round. This leads to a low oligonucleotide complexity/length and at the same time to the maximum encoding efficiency possible (number of col- _ors ^{number of rounds}^ _{jhe vast} q _jff _{erences 0}f coding capacity of our method compared to the other methods is shown in Figures 1 and 5. Due to this in approach of the invention a much lower number of detection rounds is needed to produce the same amount of information. A lower number of de tection rounds is connected to lower cost, lower experimental time, lower complexity, higher stability and success rate, lower amount of data to be collected and analyzed and a higher accuracy of the results.

Coding capacity

[00102] All three methods compared in the Table 1 below use specific probe sets that are not denatured between different rounds of detection. For intronSeqFISH there are four detection rounds needed to produce the pseudo colors of one coding round, therefore data is only given for rounds 4, 8,12,16 and 20. The merFISH-method uses a constant number of 4 signals, therefore the data starts with the smallest number of rounds possible. After 8 detection rounds our method exceeds the maximum coding capacity reached with 20 rounds of merFISH (depicted with one asterisk) and after 12 rounds of detection the maximum coding capacity of intron FISH is exceeded (depicted with two asterisks). For the method according to the invention usage of 3 different signals is assumed (as is with intronSeqFISH).

Table 1 : Comparison of coding capacity [00103] As shown in Figure 6 the number of codewords for merFISH does not exponentially increase with the number of detection cycles but gets less effective with each added round. In contrast, the number of codewords for intronSeqFISH in the method according to the invention increases exponentially. The slope of the curve for the pro posed method is much higher than that of intron FISH, leading to more than 10,000 times more code words usable after 20 rounds of detection.

[00104] Note that this maximum efficiency of coding capacity is also reached in case of seqFISH, where specific probes are denatured after every detection round and a new probe set is specifically hybridized to the target sequence for each detection round. However, this method has major downsides to technologies using only one specific hybridization for their encoding scheme (all other methods):

(1) For the efficient denaturation of the specific probes, rather crude conditions must be used (high temperatures, high concentrations of denaturing agent, long incubation times) leading to much higher probability for the loss or the damage of the analyte.

(2) For each round of detection an own probe set has to be used for every tar get nucleic acid sequence. Therefore, the number of specific probes need ed for the experiment scales with the number of different signals needed for the encoding scheme. This dramatically increases the complexity and the cost of the assay.

(3) Because the hybridization efficiency of every target nucleic acid molecule is subject to some probabilistic effects, the fluctuations of signal intensity be tween the different detection rounds is much higher than in methods using only one specific hybridization event, reducing the proportion of complete codes.

(4) The time needed for the specific hybridization is much longer than for the hybridization of signal oligonucleotides or decoding oligonucleotides (as can be seen in the method parts of the intronSeqFISH, merFISH and se- qFISH publications), which dramatically increases the time needed to com plete an experiment.

[00105] Due to these reasons all other methods use a single specific hybridiza tion event and accept the major downside of lower code complexity and therefore the need of more detection rounds and a higher oligonucleotide design complexity.

[00106] The method according to the invention combines the advantages of se- qFISH (mainly complete freedom concerning the encoding scheme) with all advantages of methods using only one specific hybridization event while eliminating the major problems of such methods.

[00107] Note that the high numbers of code words produced after 20 rounds can also be used to introduce higher hamming distances (differences) between different codewords, allowing error detection of 1 , 2 or even more errors and even error correc tions. Therefore, even very high coding capacities are still practically relevant.

6. Selective denaturation, oligonucleotide assembly and reuse of unique identifiers are surprisingly efficient

[00108] A key factor of the method according to the invention is the consecutive process of decoding oligonucleotide binding, signal oligonucleotide binding, signal detec tion and selective denaturation. In order to generate an encoding scheme, this process has to be repeated several times (depending on the length of the code word). Because the same unique identifier is reused in every detection cycle, all events from the first to the last detection cycle are depending on each other. Additionally, the selective denaturation depends on two different events: While the decoding oligonucleotide has to be dissolved from the unique identifier with highest efficiency, specific probes have to stay hybridized with highest efficiency. [00109] Due to this the efficiency E of the whole encoding process can be de scribed by the following equation:

E = B _sp x (B _de x B _si x E _de x S _sp) ⁿ

E = total efficiency

B _sp= binding of specific probes

B _de= binding of decoding oligonucleotides

B _si= binding of signal oligonucleotides

E _de= elimination of decoding oligonucleotides

S _sp= stability of specific probes during elimination process

n = number of detection cycles

[00110] Based on this equation the efficiency of each single step can be esti mated for a given total efficiency of the method. The calculation is hereby based on the assumption, that each process has the same efficiency. The total efficiency describes the portion of successfully decodable signals of the total signals present.

[00111] The total efficiency of the method is dependent on the efficiency of each single step of the different factors described by the equation. Under the assumption of an equally distributed efficiency the total efficiency can be plotted against the single step efficiency as shown in Figure 7. As can be seen, a practically relevant total efficiency for an encoding scheme with 5 detection cycles can only be achieved with single step effi ciencies clearly above 90%. For example, to achieve a total efficiency of 50% an average efficiency within each single step of 97.8% is needed. These calculations are even based on the assumption of a 100% signal detection and analysis efficiency. Due to broad DNA melting curves of oligonucleotides of a variety of sequences, the inventors assumed prior to experiments that the selective denaturation would work less efficient for denaturation of decoding oligonucleotides and that sequence specific binding probes are not stable enough. In contrast to this assumption, we found a surprising effectiveness of all steps and a high stability of sequence specific probes during selective denaturation.

[00112] Experimentally, the inventors achieved a total decoding efficiency of about 30% to 65% based on 5 detection cycles. A calculation of the efficiency of each single step (Bsp, Bde, Bsi, Ede, Ssp) by the formula given above revealed an average efficiency of about 94.4% to 98%. These high efficiencies are very surprising and cannot easily be anticipated by a well-trained person in this field.

7. Experimental data

Background

100113] The experiment shows the specific detection of 10 to 50 different mRNAs species in parallel with single molecule resolution. It is based on 5 detection cycles, 3 different fluorescent signals and an encoding scheme without signal gaps and a hamming distance of 2 (error detection). The experiment proofs the enablement and functionality of the method according to the invention.

Oligonucleotides and their sequences

[00114] All oligonucleotide sequences used in the experiment (target specific probes, decoding oligonucleotides, signal oligonucleotides) are listed in the sequence listing of the appendix. The signal oligonucleotide R:ST05*O_Atto594 was ordered from biomers.net GmbH. All other oligonucleotides were ordered from Integrated DNA Tech nologies. Oligonucleotides were dissolved in water. The stock solutions (100 mM) were stored at -20°C.

Experimental overview

[00115] The 50 different target specific probe sets are divided into 5 groups. The name of the transcript to be detected and the name of the target specific probe set are the same (transcript variant names of www.ensemble.org). The term "new" indicates a revised probe design. All oligonucleotide sequences of the probe sets can be found in the se quence listing. The table lists the unique identifier name of the probe set as well as the names of the decoding oligonucleotides used in the different detection cycles. The result- ing code shows the sequence of fluorescent signals generated during the 5 detection cycles (G(reen)=Alexa Fluor 488, 0(range)= Atto 594, Y(ellow)= Alexa Fluor 546).

Table 2: Experimental overview

Variations of the experiment

[00116] Some variations of the experiment have been performed. Experiments 1 to 4 mainly differ in the number of transcripts detected in parallel. The groups listed as target specific probe sets refer to table 6. Experiments 5 to 8 are single round, single target controls for comparison with the decoded signals.

Table 3: Variations of the experiment

Experimental details

A. Seeding and cultivation of cells

HeLa cells were grown in HeLa cell culture medium to nearly 100% confluency.

The HeLa cell culture medium comprises DMEM (Thermo Fisher, Cat.: 31885) with 10% FCS (Biochrom, Cat.: S0415), 1 % Penicilliin-Streptomycin (Sigma-Adrich, Cat.: P0781) and 1 % MEM Non-Essential Amino Acids Solution (Thermo Fisher, Cat.: 11140035). After aspiration of cell culture medium, cells were trypsinized by incubation with trypsin EDTA solution (Sigma-Aldrich, Cat.: T3924) for 5 min at 37°C after a washing step with PBS (1 ,424 g/l Na ₂HP0 _4*2H ₂0, 0,276 g/l,

NaH ₂P0 _4*2H ₂0, 8,19 g/l NaCI in water, pH 7,4). Cells were then seeded on the wells of a m-Slide 8 Well ibidiT reat (Ibidi, Cat.: 80826). The number of cells per well was adjusted to reach about 50% confluency after adhesion of the cells. Cells were incubated over night with 200 pi HeLa cell culture medium per well. B. Fixation of cells

After aspiration of cell culture medium and two washing steps with 200 pi 37°C warm PBS per well, cells were fixed with 200 mI precooled methanol (-20°C, Roth, Cat.: 0082.1) for 10 min at -20°C.

C. Counterstaining with Sudan Black

Methanol was aspirated and 150 mI of 0.2% Sudan Black-solution diluted in 70% ethanol were added to each well. Wells were incubated for 5 min in the dark at room temperature. After incubation cells were washed three times with 400 mI 70% ethanol per well to eliminate the excess of Sudan Black-solution.

D. Hybridization of analyte/target-specific probes

Before hybridization, cells were equilibrated with 200 mI sm-wash-buffer. The sm- wash-buffer comprises 30 mM Na ₃Citrate, 300 mM NaCI, pH7, 10% formamide (Roth, Cat.:P040.1) and 5mM Ribonucleoside Vanadyl Complex (NEB, Cat.:

S1402S). For each target-specific probe set 1 mI of a 100 mM oligonucleotide stock solution was added to the mixture. The oligonucleotide stock solution comprises equimolar amounts of all target specific oligonucleotides of the corresponding tar get specific probe set. The total volume of the mixture was adjusted to 100 mI with water and mixed with 100 mI of a 2x concentrated hybridization buffer solution. The 2x concentrated hybridization buffer comprises 120 mM Na ₃Citrate, 1200 mM NaCI, pH7, 20% formamide and 20 mM Ribonucleoside Vanadyl Complex. The re sulting 200 mI hybridization mixture was added to the corresponding well and incu bated at 37°C for 2 h. Afterwards cells were washed three times with 200 mI per well for 10 min with target probe wash buffer at 37°C. The target probe wash buffer comprises 30 mM Na ₃Citrate, 300 mM NaCI, pH7, 20% formamide and 5 mM Ri bonucleoside Vanadyl Complex. E. Hybridization of decoding oligonucleotides

Before hybridization, cells were equilibrated with 200 mI sm-wash-buffer. For each decoding oligonucleotide 1 ,5 mI of a 5 mM stock solution were added to the mixture. The total volume of the mixture was adjusted to 75 mI with water and mixed with 75 mI of a 2x concentrated hybridization buffer solution. The resulting 150 mI decoding oligonucleotide hybridization mixture was added to the corresponding well and in cubated at room temperature for 45 min. Afterwards cells were washed three times with 200 mI per well for 2 min with sm-wash-buffer at room temperature.

F. Hybridization of signal oligonucleotides

Before hybridization, cells were equilibrated with 200 mI sm-wash-buffer. The signal oligonucleotide hybridization mixture was the same for all rounds of experiments 1 to 4 and comprised 0,3 mM of each signal oligonucleotide (see table A3) in 1x con centrated hybridization buffer solution. In each round 150 mI of this solution were added per well and incubated at room temperature for 45 min. The procedure was the same for experiments 5 to 8 with the exception that the final concentration of each signal oligonucleotide was 0,15 mM. Afterwards cells were washed three times with 200 mI per well for 2 min with sm-wash-buffer at room temperature.

G. Fluorescence and white light imaging

Cells were washed once with 200 mI of imaging buffer per well at room tempera ture. In experiments without Trolox (see table 7, last column) imaging buffer com prises 30 mM Na ₃Citrate, 300 mM NaCI, pH7 and 5mM Ribonucleoside Vanadyl Complex. In experiments with Trolox, imaging buffer additionally contains 10 % VectaCell Trolox Antifade Reagent (Vector laboratories, Cat.: CB-1000), resulting in a final Trolox concentration of 10 mM.

A Zeiss Axiovert 200M microscope with a 63x immersion oil objective (Zeiss, apo- chromat) with numerical aperture of 1.4, a pco.edge 4.2 CMOS camera (PCO AG) and an LED-light source (Zeiss, colibri 7) was used for imaging of the regions. Fil ter sets and LED-wavelengths were adjusted to the different optima of the fluoro- phores used. Illumination times per image were 1000 ms for Alexa Fluor 546 and Atto 594 and 400 ms for Alexa Fluor 488.

In each experiment, three regions were randomly chosen for imaging. For each re gion, a z-stack of 32 images was detected with a z-step size of 350 nm. Additional ly, one white light image was taken from the regions. In experiments with more than one detection cycle, the regions of the first detection round were found back and imaged in every subsequent round.

H. Selective denaturation

For selective denaturation, every well was incubated with 200 pi of sm-wash-buffer at 42°C for 6 min. This procedure was repeated six times.

[00117] Steps (E) to (H) were repeated 5 times in experiments 1 to 4. Step (H) was omitted for the 5 ^th detection cycle.

/. Analysis

Based on custom ImageJ-plugins a semi-automated analysis of the raw data was performed to distinguish the specific fluorescent signals from the background. The resulting 3D-point clouds of all three fluorescent channels were combined in silico with a custom VBA script. The resulting combined 3D-point clouds of the 5 detec tion cycles were aligned to each other on the basis of a VBA script. The resulting alignments revealed the code words for each unique signal detected. Successfully decoded signals were used for quantitative and spatial analysis of the experiments based on custom VBA-scripts and ImageJ-plugins. Results

1. Absolute numbers of decoded signals

The absolute numbers of successfully decoded signals for all transcripts are listed for each region of each experiment in the following Table 4. In summary, the sum of correct codes depicts the total number of decoded signals that were assigned to transcripts detectable in the corresponding experiment, while the sum of incorrect codes it the total number of decoded signals not detectable in the corresponding experiment. The total number of signals comprises successfully decoded as well as unsuccessfully decoded signals.

Table 4: Absolute num bers of decoded signals

[00118] Table 4 shows a very low number of incorrectly decoded signals com pared to the number of correctly decoded signals. The absolute values for decoded signals of a certain transcript are very similar between different regions of one experiment. The fraction of the total number of signals that can be successfully decoded is between 27.1 % and 64.5 %. This fraction depends on the number of transcripts and/or the total number of signals present in the respective region/experiment.

Conclusion

[00119] The method according to the invention produces a low amount of incor rectly assigned code words and can therefore be considered specific. The fraction of successfully decodable signals is very high, even with very high numbers of signals per region and very high numbers of transcripts detected in parallel. The high fraction of assignable signals and the high specificity make the method practically useful. 2. Comparison of relative transcript abundancies between different experiments

As shown in Figure 8 for both comparisons (A and B) the overlap of detected tran scripts between the experiments is used for the analysis. Each bar represents the mean abundance of all three regions of an experiment. The standard deviation be tween these regions is also indicated.

3. Correlation of relative transcript abundancies between different experiments

As can be seen in Figure 9 the mean relative abundances of transcripts from ex periment 1 are correlated to the abundances of the overlapping transcripts of ex periment 3, 4 and 2. The correlation coefficient as well as the formula for the linear regression are indicated for each correlation.

Figure 8 shows low standard deviations, indicating low variations of relative abun dances between different regions of one experiment. The differences of relative abundances between transcripts from different experiments are also very low. This is the case for the comparison of transcripts from group 1 (Fig. 8A), that were de tected in experiments 1 , 2 and 3. It is also the case for the comparison of the tran scripts from groups 2, 3 and 4 that were overlapping between experiments 1 , 2 and 4. The very high correlation of these abundances can also be seen in Figure 9. The abundances of transcripts from experiment 1 correlate very well with the abundances of the other multi round experiments. The correlation factors are be tween 0.88 and 0.91 , while the slope of the linear regressions is between 0.97 and 1.05.

Conclusion

[00120] The relative abundancies of transcripts correlate very well between dif ferent regions of one experiment but also between different experiments. This can be clearly seen by the comparisons of Figures 3 and 4. The main difference between the experiments is the number of different targets and hence the total number of signals detected. Therefore, the number of transcripts detected as well as the number and density of signals does not interfere with the ability of the method to accurately quantify the number of transcripts. The very good correlations further support the specificity and stability of the method, even with very high numbers of signals.

4. Comparison of intercellular distribution of signals

In Figure 10 the maximum projections of image stacks are shown. A: region 1 of experiment 7 (single round, single transcript experiment detecting SPOCK1), B: 2D-projection of all selected signals from experiment 1 , region 1 assigned to SPOCK1 , C: region 1 of experiment 8 (single round, single transcript experiment detecting THRAP3), D: 2D-projection of all selected signals from experiment 1 , re gion 1 assigned to THRAP3.

5. Comparison of intracellular distribution of signals

In Figure 11 the maximum projections of image stacks are shown. Magnified sub regions of the corresponding regions are shown. A: region 1 of experiment 8 (sin gle round, single transcript experiment detecting THRAP3), B: 2D-projection of se lected signals from experiment 1 , region 1 assigned to THRAP3, C: region 1 of ex periment 5 (single round, single transcript experiment detecting DDX5), D: 2D- projection of all selected signals from experiment 1 , region 1 assigned to DDX5.

Figure 10 shows huge differences of intercellular distributions between different transcripts. SPOCK1 seems to be highly abundant in some cells but nearly absent in other cells (Figure 10 A). THRAP3 shows a more uniform distribution over all cells of a region (Figure 10 C). These spatial distribution patterns can also clearly be observed with the point clouds assigned to the corresponding transcripts from experiment 1 (Figure 10 B and D).

Figure 11 shows huge differences of intracellular distributions between different transcripts. THRAP3 can be mainly observed in the periphery (cytoplasm) of the cells (Figure 11 A), while DDX5 shows a higher abundance in the center (nucleus) of the cells (Figure 11 C). These intracellular distributions can also be observed with the point clouds of experiment 1 assigned to THRAP3 and DDX5 (Figure 11 B and D).

Conclusion

[00121] Next to the reliability of quantification, the point clouds of multi round ex periments also show the same intracellular and intercellular distribution patterns of transcripts. This is clearly proven by the direct comparison of the assigned point clouds with signals from single round experiments detecting only one characteristic mRNA- species.

6. Distribution pattern of different cell cycle dependent transcripts

All images of Figure 12 show region 1 of experiment 1. In each image, a point cloud is shown, that is assigned to a certain transcript, A: CCNA2, B: CENPE, C: CCNE1 , D: all transcripts. Figure 12 shows the transcripts of three different cell cy cle dependent proteins. CENPE (Figure 12 B) is also known as Centromere pro tein E and accumulates during G2 phase. It is proposed to be responsible for spin dle elongation and for chromosome movement. It is not present during interphase. CCNA2 (Figure 12 A) is also known as Cyclin A2. It regulates the cell cycle pro gression by interacting with CDK1 during transition from G2 to M-phase. Interest ingly there is an obvious colocalization of both mRNA-species. They are mainly present in the three central cells of region 1. CCNE1 (Figure 12 C) is also known as Cyclin E1. This cyclin interacts with CDK2 and is responsible for the transition from G1 to S-phase. Figure 12 shows clearly, that the transcripts of this gene are not present in the three central cells, but quite equally distributed over the other cells. It therefore shows an anti-localization to the other two transcripts. The data for the corresponding point-clouds are derived from a point cloud with a very high number of points and a very high point density (Figure 12 D gives an impression). Conclusion

[00122] The three decoded point clouds of cell cycle dependent proteins shown in Figure 12, show distribution patterns that can be explained by their corresponding function. These data strongly suggest, that our method reliably produces biological relevant data, even with a low number of signals per cell (Figure 12 C) and with very high signal densities (Figure 12 D).

Sequence listing

[00123] In the accompanying sequence listing SEQ ID Nos. 1-1247 refer to nu cleotide sequences of exemplary target-specific oligonucleotides. The oligonucleotides listed consist of a target specific binding site (5'-end) a spacer/linker sequence (gtaac or tagac) and the unique identifier sequence, which is the same for all oligonucleotides of one probe set.

[00124] In the accompanying sequence listing SEQ ID Nos. 1248-1397 refer to nucleotide sequences of exemplary decoding oligonucleotides.

[00125] In the accompanying sequence listing SEQ ID Nos. 1398-1400 refer to the nucleotide sequences of exemplary signal oligonucleotides. For each signal oligonu cleotide the corresponding fluorophore is present twice. One fluorophore is covalently linked to the 5'-end and one fluorophore is covalently linked to the 3'-end. SEQ ID No. 1398 comprises at its 5' terminus "5Alex488N", and at its 3' terminus "3AlexF488N". SEQ ID No. 1399 comprises at its 5' terminus "5Alex546", and at its 3' terminus 3Alex546N. SEQ ID No. 1400 comprises at its 5' terminus and at its 3' terminus "Atto594".

Previous Patent: GASOLINE FUEL COMPOSITION

Next Patent: SYSTEM FOR SCREENING BALLAST WITH TWO SCREENS