Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DNA-ENCODED CHEMICAL LIBRARY, USE THEREOF AND METHOD TO SYNTHESIZE THE LIBRARY
Document Type and Number:
WIPO Patent Application WO/2017/108741
Kind Code:
A1
Abstract:
The present invention relates to a compound library comprising a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, wherein the nucleic acid moiety consists of pyrimidine nucleotides and the plurality of conjugates differ from each other at least by comprising different organic molecules. Further, the present invention relates to the use of said library for screening compounds binding to a target molecule and methods of synthesizing said library.

Inventors:
BRUNSCHWEIGER ANDREAS (DE)
KRAUSE NORBERT (DE)
ANTONCHICK ANDREY (DE)
KLIKA SKOPIC MATEJA (DE)
SALAMON HAZEM (DE)
BUGAIN OLIVIA (DE)
JUNG KATHRIN (DE)
WAGNER BERND (DE)
Application Number:
PCT/EP2016/081845
Publication Date:
June 29, 2017
Filing Date:
December 20, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TECHNISCHE UNIVERSITÄT DORTMUND (DE)
International Classes:
C40B40/06; C07H21/00
Domestic Patent References:
WO2012071428A22012-05-31
WO1993020242A11993-10-14
WO2003084982A22003-10-16
Other References:
PENG ZHAO ET AL: "Solid-phase synthesis and evaluation of TAR RNA targeted [beta]-carboline-nucleoside conjugates", ORGANIC & BIOMOLECULAR CHEMISTRY, vol. 6, no. 20, 1 January 2008 (2008-01-01), GB, pages 3741, XP055260950, ISSN: 1477-0520, DOI: 10.1039/b809598a
Attorney, Agent or Firm:
VIERING, JENTSCHURA & PARTNER MBB (DE)
Download PDF:
Claims:
CLAIMS

1. Compound library comprising a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, wherein the nucleic acid moiety consists of pyrimidine nucleotides and the plurality of conjugates differ from each other at least by comprising different organic molecules.

2. The compound library according to claim 1, wherein the plurality of conjugate molecules comprises at least ten conjugate molecules, preferably at least 50 conjugate molecules, more preferably at least 100 conjugate molecules.

3. The compound library according to claim 1 or 2, wherein

(a) the nucleic acid moiety consists of two to 20, preferably three to ten, more preferably six nucleotides;

(b) the nucleic acid moiety consists of thymidine and/or cytosine nucleosides; and/or

(c) the nucleic acid moiety comprises a sequence of at least six thymidine and/or cytosine nucleotides directly attached to the small organic molecule.

4. The compound library according to any one of claims 1 to 3, wherein the nucleic acid moiety is RNA, DNA or a mixture thereof.

5. The compound library according to claim 4, wherein the nucleic acid moiety is DNA.

6. The compound library according to any one of claims 1 to 5, wherein the nucleic acid moiety and the organic molecule are linked by amide bond.

7. The compound library according to any one of claims 1 to 6, wherein

a) the conjugates further comprise a linker between the nucleic acid moiety and the organic molecule, preferably this linker is polyethylene glycol (PEG);

b) the conjugates differ from each other by comprising different nucleic acid moieties;

c) the organic molecule consists of 2 or more carbon atoms, preferably up to 50 carbon atoms, more preferably up to 30 carbon atoms; and/or

d) the organic molecule has a molecular weight of at most 900 daltons, preferably at most 700 daltons and more preferably at most 500 daltons.

8. The compound library according to any one of claims 1 to 6, wherein the pyrimidine nucleotides are selected from the group consisting of thymine, cytosine, uracil, 4-acetylcytidine, 5- (carboxyhydroxymethyl)uridine, 2' -O-methylcytidine, 5-carboxymethylaminomethyl-2-thiouridine, 5- carboxymethylaminomethyluridine, dihydrouridine, 2'-0-methylpseudouridine, 1- methylpseudouridine, 3-methylcytidine, 5-methylcytidine, 5-methylaminomethyluridine, 5- methoxyaminomethyl-2-thiouridine, 5-methoxycarbonylmethyl-2-thiouridine, 5- methoxycarbonylmethyluridine, 5-methoxyuridine, uridine-5-oxyacetic acid-methylester, uridine-5- oxyacetic acid, pseudouridine, 2-thiocytidine, 5-methyl-2-thiouridine, 2-thiouridine, 4-thiouridine, 5- methyluridine, 2'-0-alkyluridine, 2'-0-alkylthymidine, 2'-0-alkylcytidine and 3-(3-amino-3-carboxy- propyl)uridine. 9. Method for synthesizing a compound library, wherein said library comprises a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, the synthesis of each conjugate molecule comprises:

- reacting a nucleic acid consisting of pyrimidine nucleotides with an organic molecule under conditions to allow the conjugation of said molecules,

wherein the plurality of conjugates differ from each other at least by comprising different organic molecules.

10. The method according to claim 9, wherein the conjugation step of claim 9 comprises:

a) reacting the nucleic acid consisting of pyrimidine nucleotides with a first organic intermediate molecule under conditions to form a conjugate of the nucleic acid and the first organic intermediate molecule; and

b) reacting the conjugate of the nucleic acid and the first organic intermediate molecule with a second organic intermediate molecule to form the conjugate molecule. 11. The method according to claim 9 or 10, wherein the conjugate molecule and/or the conjugate of the nucleic acid and the first organic intermediate molecule is further used in a reaction comprising a) a strong acid, preferably an acid having a pKa of less than 3; and/or

b) a catalyst selected from the group consisting of copper, silver, gold, ruthenium, iron, osmium, cobalt, rhodium, iridium, nickel, palladium, platinum and derivatives thereof, preferably copper, silver, gold, ruthenium and derivatives thereof.

12. The method according to any one of claims 9 to 11 , wherein the nucleic acid moiety a) consists of two to 20, preferably three to ten, more preferably six nucleotides;

b) is RNA, DNA or a mixture thereof, preferably DNA;

c) is elongated in an additional reaction step with pyrimidine and/or purine nucleotides after the conjugation step; and/or

d) consists of thymidine and/or cytosine nucleosides; and/or (e) comprises a sequence of at least six thymidine and/or cytosine nucleotides directly attached to the small organic molecule.

13. The method according to any one of claims 9 to 12, wherein the nucleic acid moiety and the organic molecule are linked by amide bond.

14. The method according to any one of claims 9 to 13, wherein

a) the conjugates further comprise a linker between the nucleic acid moiety and the organic molecule, preferably this linker portion is polyethylene glycol (PEG);

b) the conjugates differ from each other by comprising different nucleic acid moieties;

c) the organic molecule consists of 2 or more carbon atoms, preferably up to 50 carbon atoms, more preferably up to 30 carbon atoms;

d) the organic molecule has a molecular weight of at most 900 daltons, preferably at most 700 daltons and more preferably at most 500 daltons; and/or

e) the pyrimidine nucleotides are selected from the group consisting of thymine, cytosine, uracil, 4- acetylcytidine, 5-(carboxyhydroxymethyl)uridine, 2'-0-methylcytidine, 5- carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluridine, dihydrouridine, 2'- O-methylpseudouridine, 1 -methylpseudouridine, 3 -methyl cytidine, 5-methylcytidine, 5- methylaminomethyluridine, 5-methoxyaminomethyl-2-thiouridine, 5-methoxycarbonylmethyl-2- thiouridine, 5-methoxycarbonylmethyluridine, 5-methoxyuridine, uridine-5-oxyacetic acid- methylester, uridine-5-oxyacetic acid, pseudouridine, 2-thiocytidine, 5-methyl-2-thiouridine, 2- thiouridine, 4-thiouridine, 5-methyluridine, 2'-0-alkyluridine, 2'-0-alkylthymidine, 2'-0- alkylcytidine and 3-(3-amino-3-carboxy-propyl)uridine. 15. Use of a compound library according to any one of claims 1 to 8 for screening a compound binding to a target molecule, preferably the target molecule is a protein.

Description:
DNA-ENCODED CHEMICAL LIBRARY, USE THEREOF AND METHOD TO

SYNTHESIZE THE LIBRARY

FIELD OF THE INVENTION

The present invention lies in the field of medicinal chemistry and chemical biology and relates to a compound library comprising a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, wherein the nucleic acid moiety consists of pyrimidine nucleotides and the plurality of conjugates differ from each other at least by comprising different organic molecules. Further, the present invention relates to the use of said library for screening compounds binding to a target molecule and methods of synthesizing said library.

BACKGROUND OF THE INVENTION DNA-encoded chemical libraries (DECLs) represent a tool for drug discovery. DECL technology allows the synthesis and screening of chemical libraries of unprecedented size at moderate costs. DECLs feature the display of individual small organic chemical moieties on DNA fragments serving as amplifiable identification barcodes. The DNA-tag allows the simultaneous screening of a veiy large set of compounds (up to billions of molecules), because the hit compounds can easily be identified and quantified by PCR-amplification and sequencing of the DNA-barcode. Several approaches have been used to generate DECLs. Most common is the combinatorial mix-and-split synthesis strategy in which preparative organic synthesis and encoding steps are performed in iterative manner.

A prerequisite for the library synthesis is the compatibility of DNA with the synthesis methodology used for synthesis of the organic chemical moieties. Currently, reaction methodology meeting this requirement is veiy limited. Therefore, DECLs are biased towards a certain chemical space, contradicting the endeavor to design screening libraries to cover chemical space as broad as possible. A current challenge for DECL synthesis research is the development of novel synthetic schemes that furnish in DNA-compatible manner DNA-conjugates of small and geometrically defined (rigid) scaffolds which serve as starting points for subsequent combinatorial library synthesis.

The synthesis of such molecules would result in molecules having a drug-like structure. These druglike structures include, but are not limited to heterocyclic structures. Surprisingly, the present inventors found a synthesis strategy that enables the synthesis of several (hetero)cyclic structures from different starting materials attached to a specific DNA sequence by various catalytic methods. Fundamental to the present invention is the observation that DNA- sequences that contain only pyrimidine nucleotides are tolerant to a broad spectrum of reaction conditions, such as the use of strong acids or transition metals as catalysts, while, as commonly known in the field, DNA-molecules comprising purines are degraded under the same reaction conditions by depurination and subsequent strand fragmentation of the resulting abasic sites. The possibility to use a broad spectrum of reaction conditions allows the synthesis of several heterocyclic structures attached to DNA-sequences. In turn, this possibility to apply a broader spectrum of chemical reaction conditions results in the synthesis of molecules that have a drug-like structure.

SUMMARY OF THE INVENTION

In a first aspect, the present invention relates to a compound library comprising a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, wherein the nucleic acid moiety consists of pyrimidine nucleotides and the plurality of conjugates differ from each other at least by comprising different organic molecules.

In various embodiments, the plurality of conjugate molecules comprises at least ten molecules, preferably at least 50 different molecules, more preferably at least 100 different molecules.

In various other embodiments, the nucleic acid moiety consists of two to 30, preferably three to ten, more preferably six nucleotides. In preferred embodiments, the nucleic acid moiety consists of thymidine and/or cytosine nucleosides. In more preferred embodiments, the nucleic acid moiety consists of six thymidine nucleosides. It is called "hexT". Further, in alternative embodiments, the nucleic acid moiety comprises a sequence of at least six thymidine and/or cytosine nucleotides directly attached to the small organic molecule.

In still various other embodiments, the nucleic acid moiety is RNA, DNA or a mixture thereof. In more preferred embodiments, the nucleic acid portion is DNA.

In various embodiments, the nucleic acid moiety and the organic molecule are linked by amide bond.

In further embodiments, the present invention relates to a compound library wherein the conjugates differ from each other by comprising different nucleic acid moieties.

In various embodiments, the conjugate molecules of the library further comprise a linker portion between the nucleic acid moiety and the organic moiety molecule, preferably this linker portion is polyethylene glycol (PEG). In various other embodiments, the organic molecule consists of 2 or more carbon atoms, preferably up to 50 carbon atoms, more preferably up to 30 carbon atoms. In other various embodiments, the organic molecule has a molecular weight of at most 900 daltons, preferably at most 700 daltons and more preferably at most 500 daltons.

In still various other embodiments, the pyrimidine nucleotides are selected from the group consisting of thymidine, cytidine, uridine, 4-acetylcytidine, 5-(carboxyhydroxymethyl)uridine, 2'-0- methylcytidine, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluridine, dihydrouridine, 2'-0-methylpseudouridine, 1 -methylpseudouridine, 3-methylcytidine, 5- methylcytidine, 5-methylaminomethyluridine, 5-methoxyaminomethyl-2-thiouridine, 5- methoxycarbonylmethyl-2-thiouridine, 5-methoxycarbonylmethyluridine, 5-methoxyuridine, uridine- 5-oxyacetic acid-methylester, uridine-5-oxyacetic acid, pseudouridine, 2-thiocytidine, 5-methyl-2- thiouridine, 2-thiouridine, 4-thiouridine, 5-methyluridine, 2'-0-alkyluridine, 2'-0-alkylthymidine, 2'- O-alkylcytidine and 3-(3-amino-3-carboxy-propyl)uridine.

In a further aspect, the present invention relates to a method for synthesizing a compound library, wherein said library comprises a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, the synthesis of each conjugate molecule comprises: reacting a nucleic acid consisting of pyrimidine nucleotides with an organic molecule under conditions that allow the conjugation of said molecules, wherein the plurality of conjugate molecules differ from each other at least by comprising different organic molecules.

In various embodiments of the method, the conjugate molecule is further used in a reaction comprising a) a strong acid, preferably an acid having a pK a of less than 3; and/or b) a catalyst selected from the group consisting of copper, silver, gold, ruthenium, iron, osmium, cobalt, rhodium, iridium, nickel, palladium, platinum and derivatives thereof, preferably copper, silver, gold, ruthenium and derivatives thereof.

In still various embodiments of the method, the nucleic acid moiety is elongated in an additional reaction step with pyrimidine and/or purine nucleotides after the conjugation step.

In a third aspect, the present invention relates to the use of a compound library of the present invention for screening a compound binding to a target molecule.

In various embodiments, the target molecule is a protein. It is understood that all combinations of the above disclosed embodiments are also intended to fall within the scope of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The invention will be better understood with reference to the detailed description when considered in conjunction with the non-limiting examples and the accompanying drawings.

Figure 1 shows the synthesis of DNA-amide conjugates. Figure 2 shows the removal of the Fmoc-protective group from DNA-amide conjugates.

Figure 3 shows the synthesis of the 3' -5 '-hexathymidine-5' -tryptophane conjugate 2.

Figure 4 shows the synthesis of the 3 '-5' -hexathymidine-5 '-β-carboline conjugates 3a-3p.

Figure 5 (a) shows the HPLC chromatogram of compound 2; Figure 5 (b) shows the HPLC chromatogram of compound 4a; Figure 5 (c) shows the MALDI-MS analysis of compound 4a.

Figure 6 shows the ring-closing metathesis of a 3 '-5 '-hexathymidine-5 '-peptide conjugate catalyzed by the ruthenium catalyst bis(tricyclohexylphosphine)benzylidine ruthenium(IV) dichloride yielding 6a.

Figure 7 (a) shows the HPLC chromatogram of compound 5; Figure 8 (b) shows the HPLC chromatogram of compound 6a; Figure 8 (c) shows the MALDI-MS analysis of compound 6a.

Figure 8 shows the synthesis of the 3 '-5' -hexathymidine-5 '-PEG-linker conjugate 9.

Figure 9 shows the synthesis of the 3 '-5 '-hexathymidine-5 '-aldehyde conjugate 11.

Figure 10 shows the gold-catalyzed synthesis of the 3 '-5' -hexathymidine-5 '-6-oxa- 1,2- diazaspiro[4.4]nonane-conjugate 14. Figure 11 (a) shows the HPLC chromatogram of the 3'-5'-hexathymidine-5'-conjugate 11; Figure 13 (b) shows the HPLC chromatogram of DNA-conjugate 14; Figure 13 (c) shows the MALDI-TOF analysis of the 3'-5'-hexathymidine-5'-conjugate 14, expected mass= 2640,14; found mass= 2639,45. Figure 12 shows the synthesis of the 3'-5'-hexathymidine-5'-alkyne conjugate 16.

Figure 13 shows the gold-catalyzed synthesis of 3 '-5'-hexathymidine-5'-pyrazoline-conjugate 19. Figure 14 (a) shows the HPLC chromatogram of the 3 '-5 '-hexathymidine-5 '-conjugate 16; Figure 14 (b) shows the HPLC chromatogram of DNA-conjugate 19; Figure 14 (c) shows the MALDI-TOF analysis of hexathymidine-5 '-conjugate 19, expected mass= 239,71 ; found mass= 2392,256.

Figure 15 shows the synthesis of tert-butyl 4-(3-benzylidene-2-oxoindolin-l-yl)butanoate 25

Figure 16 shows the synthesis of the 3 '-5 '-hexathymidine-5 '-oxindole conjugate 27.

Figure 17 shows the Ag (I) or Cu(I)-catalyzed synthesis of the 3 '-5' -hexathymidine-5 '-spirooxindole conjugate 29a.

Figure 18 (a) shows the HPLC chromatogram of compound 27; Figure 20 (b) shows the HPLC chromatogram of compound 29a; Figure 20 (c) shows the MALDI-TOF analysis of the hexathymidine-5 '-conjugate 29a, expected mass= 2733,09; found mass= 2733,014.

Figure 19 shows the ligation ofa coding DNA to the adapter nucleic acid "hexT. Figure 20 shows the ligation of a second coding DNA sequence to the encoded hexT. Figure 21 shows the agarose gel electrophoresis of ligations 1 and 2 having the following arrangement: L: DNA ladder; lane 1: oligonucleotides I, II (phosphorylated), and III; lane 2: T4 DNA ligase reaction yielding a duplex of oligonucleotides III, IV; lane 3: oligonucleotides III (phosphorylated), IV, V (phosphorylated), and VI; lane 4: T4 DNA ligase reaction yielding a duplex of oligonucleotides VII, VIII 5,5% agarose gel; 75minutes; 150V; 25pmol per lane;gel stain: Midori Green Direct (Biozym).

Figure 22 shows an EcoRI digest and result of MALDI-TOF analysis.

Figure 23 shows the comparison of DNA-encoded chemistry with TiDEC (TiDEC, oligoThymidine- initiated DNA-Encoded Chemistry). A) Synthesis of a DNA-encoded compound by alternated synthesis and encoding steps; a) conjugation of a building block; b) encoding by e.g. DNA ligation. B) TiDEC; a) conjugation of starting materials to hexT; b) synthesis of target heterocycles by acid or coinage metal catalysis; c) isolation of the hexT conjugates; d) encoding by DNA ligation. Figure 24 shows exploration of reaction conditions for the synthesis of the hexT-p-carboline hexT- 3CR; a) see table; [a] b) aqueous NH 3 /aqueous MeNH 2 . Figure 25 shows synthesis and encoding of hexT-p-carboline conjugates hexT-3A-3DJ. a) HATU, DMF; b) piperidine, DMF; c) 1% TFA, CH 2 C1 2 ; d) aqueous NH 3 /aqueous MeNH 2 ; e) T4 ligase.

Figure 26 shows total yield, conversion rates, and MALDI MS data of hexT-conjugates 3A-3DJ. a) 1% TFA, CH2CI2; b) aqueous NH 3 /aqueous MeNH 2 .

Figure 27 shows exploration of reaction conditions for the synthesis of the hexT-pyrazol(in)es hexT- 5AJ and hexT-6AJ. a) see table; [a] b) aqueous NH 3 /aqueous MeNH 2 .

Figure 28 shows synthesis and encoding of hexT-pyrazolines hexT-12 and pyrazoles hexT-13. a) HATU, DMF; b) A (Figure 27), MeCN, 50°C for aliphatic aldehydes; B (Figure 27), AcOH, 50°C for aromatic aldehydes; c) aqueous NH 3 /aqueous MeNH 2 ; d) B (Figure 27), AcOH, 60°C; e) T4 ligase.

Figure 29 shows yield, conversion rates, and MALDI-MS data of compounds hexT-12F-12DI. a) either (Figure 27), MeCN, 50°C, or B (Figure 27), AcOH, 50°C; b) aqueous NH 3 /aqueous MeNH 2 .

Figure 30 shows yield, conversion rates, and MALDI MS data of compounds hexT-13F-13DI. a) B (Figure 27), AcOH, 60°C; b) aqueous NH 3 /aqueous MeNH 2 .

DETAILED DESCRIPTION OF THE INVENTION

The present inventors surprisingly found that only purine nucleotides, but not pyrimidine nucleotides, are degraded under several conditions used for the synthesis of heterocyclic chemical compounds (for example reactions involving the use of strong acids or transition metals such as Au(I), Ag(I), Ru(II), and Cu(I) as catalysts). Based on these results, the present inventors concluded that chemical compounds being labeled with DNA can be reacted to a broad spectrum of chemical structures, including but not limited to heterocycles, if the DNA-label only consists of pyrimidine nucleotides. At a later stage, after the organic moiety of the compound has been reacted to its final chemical structure, it is also possible to elongate the DNA-label with further nucleotides including purine nucleotides.

Thus, in a first aspect, the present invention relates to a compound libraiy comprising a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, wherein the nucleic acid moiety consists of pyrimidine nucleotides and the plurality of conjugates differ from each other at least by comprising different organic molecules.

The term "compound", as used herein, relates to a substance formed when two or more chemical elements are chemically bonded together. A typical type of chemical bond in the compounds of the present invention is a covalent bond. However, atoms within the compound may also be bound to each other by ionic bonds, metallic bonds and/or coordinate covalent bonds. "Compounds" refer generally to molecules potentially capable of structural interactions with cellular constituents through non- covalent interactions, such as, for example, through hydrogen bonds, ionic bonds, van der Waals attractions, or hydrophobic interactions. For example, compounds will most typically include molecules with moieties necessary for structural interaction with proteins, glycoproteins, and/or other macromolecules. "Compound library", as used herein, refers to any collection of agents/compounds that includes a plurality of molecular structures. Compound libraries can include, for example, combinatorial chemical libraries, natural products libraries, and peptide and cyclized peptide libraries. Compound libraries of the present invention contain compounds that comprise a small organic molecule covalently coupled to a nucleic acid moiety. Such libraries are known as DNA-encoded chemical libraries (DECLs).

"Plurality", as used herein, is defined as two or more than two, in particular 2, 3, 4, 5, 6, 7, 8, 9, 10 or more, more preferably 100 or more, 500 or more, 1000 or more, 1500 or more, 3000 or more, 5000 or more, 10000 or more or 50000 or more. In various embodiments, the library of the present invention comprises at least ten molecules, preferably at least 50 different molecules, more preferably at least 100 different molecules. The term "conjugate molecule", as used herein, refers to a compound comprising two or more molecules (e.g., proteins, carbohydrates, organic molecules or nucleic acid molecules) that are chemically linked. The two or more molecules desirably are chemically linked using any suitable chemical bond (e.g., covalent bond). Suitable chemical bonds are well known in the art and include amide bonds, disulfide bonds, thioester bonds, and ester bonds. In various embodiments, the compounds comprise a small organic molecule covalently coupled to a nucleic acid moiety. Preferably, the nucleic acid moiety and the organic molecule are linked by amide bond. The term "organic molecule" refers to carbon-based chemicals, including small organic molecules and macromolecules, as well as derivatives and analogues thereof. The organic molecules may display biological activity such as, but not limited to pharmaceutical, antibiotic, pesticidical, herbicidical, or fungicidical activity. Preferably, the organic molecule contains a heterocyclic structure. The term "heterocyclic," as used herein, means an aromatic or non-aromatic saturated mono-, bi-, or tricyclic ring system having 2 to 14 ring carbon atoms, containing 1-5 ring atoms chosen from NH, N— (CO)— Ci-6-alkyl, NCi-e-alkyl, O, S02, S, Br, CI, P, or PO4 "3 alone or in combination. Bi- and tricyclic heterocyclic groups are fused at 2 or 4 points or joined at one point via a bond or a heteroatom linker (O, S, NH, or N(Ci-C6 alkyl). The mono-, bi-, or tricyclic heterocyclic can be optionally substituted on the ring by replacing an available hydrogen on the ring by one or more substituents which may be the same or different, each being independently selected from the groups defined below. There are no adjacent oxygen and/or sulfur atoms present in the ring system. The nitrogen or sulfur atom of the heterocyclic ring system can be optionally oxidized to the corresponding N-oxide, S-oxide or S- dioxide. Non-limiting examples of suitable heterocyclic ring system include furanyl, imidazolyl, isoxazolyl, oxadiazolyl, oxazolyl, pyrazolyl, pyrrolyl, pyridyl, pyrimidyl, pyridazinyl, thiazolyl, triazolyl, tetrazolyl, thienyl, carbazolyl, benzimidazolyl, benzothienyl, benzofuranyl, indolyl, quinolinyl, benzotriazolyl, benzothiazolyl, benzooxazolyl, benzimidazolyl, isoquinolinyl, isoindolyl, acridinyl, or benzoisoxazolyl. Non-limiting examples of suitable heterocyclic rings include also aziridinyl, piperidinyl, pyrrolidinyl, piperazinyl, tetrahydropyranyl, tetrahydrofuranyl, tetrahydrothiophenyl, morpholinyl, thiomorpholinyl and the like. Also included within the scope of the term "heterocyclic" as it is used herein is a group in which the heterocyclic ring is fused at two points or joined at one point via a bond or a heteroatom linker (O, S, NH, or N(Ci-C6 alkyl), to one aromatic, or cycloalkyl ring, non-limiting examples include isoindoline-l,3-dione l-methyl-2 -phenyl- 1H- pyrazole-3(2H)-one, indoline, pyridoindole and the like. The "organic molecule", as used herein, may also refer to molecules of different classes such as small molecules conforming or not conforming to Lipinski's rule of five, (cyclic) peptides, proteins, nucleotides, lipids, sugars and derivatives thereof. In one embodiment, the organic molecule of the present invention can be selected from the group comprising small molecules conforming or not conforming to Lipinski's rule of five, (cyclic) peptides, proteins, nucleotides and mixtures thereof.

"Small" in the context of the term "organic molecule", as used herein, relates to compounds that consist of 2 or more carbon atoms, preferably up to 50 carbon atoms, more preferably up to 30 carbon atoms. In other various embodiments, the organic molecule has a molecular weight of at most 1500daltons, preferably at most 700 daltons and more preferably at most 500 daltons.

The term "nucleic acid moiety", as used herein, refers to a nucleic acid sequence that is covalently coupled to an organic moiety, as defined above, to form a conjugate molecule. The terms "nucleotide", "nucleic acid molecule" or "nucleic acid sequence", as interchangeably used herein, relate to DNA (deoxyribonucleic acid) molecules, RNA (ribonucleic acid) molecules or molecules comprising both, DNA and RNA. Said molecules may appear independent of their natural genetic context and/or background. The term "nucleic acid molecule/sequence" further refers to the phosphate ester polymeric fonn of ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as phosphorothioates and thioesters, in either single stranded form, or a double-stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA- RNA helices are possible. The term nucleic acid molecule, and in particular DNA or RNA molecule, refers only to the primary and secondary structure of the molecule, and does not limit it to any particular tertiary forms.

"RNA" or "ribonucleic acid", as interchangeably used herein, relates to a chain of nucleotides wherein the nucleotides contain the sugar ribose and bases selected from the group of adenine (A), cytosine (C), guanine (G), or uracil (U). "DNA" or "deoxyribonucleic acid", as interchangeably used herein, relates to a chain of nucleotides wherein the nucleotides contain the sugar 2'-deoxyribose and bases selected from adenine (A), guanine (G), cytosine (C) and thymine (T). The term "mRNA" refers to messenger RNA. The term "base" or "nucleobase", as interchangeably used herein, relates to nitrogen-containing biological compounds (nitrogenous bases) found linked to a sugar within nucleosides - the basic building blocks of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). Their ability to form base pairs and to stack upon one another lead directly to the helical structure of DNA and RNA. The primary, or canonical, nucleobases are cytosine (DNA and RNA), guanine (DNA and RNA), adenine (DNA and RNA), thymine (DNA) and uracil (RNA), abbreviated as C, G, A, T, and U, respectively. Because A, G, C, and T appear in the DNA, these molecules are called DNA-bases; A, G, C, and U are called RNA-bases. Uracil and thymine are identical except that uracil lacks the 5' methyl group. Adenine and guanine belong to the double-ringed class of molecules called purines (abbreviated as R). Cytosine, thymine, and uracil are all pyrimidines (abbreviated as Y). Other bases that do not function as normal parts of the genetic code are termed non-canonical. Nucleobases that can be used to be coupled to the organic molecule to form a compound of the library of the present invention are thymine, cytosine, uracil, 4-acetylcytidine, 5-(carboxyhydroxymethyl)uridine, 2'-0-methylcytidine, 5- carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluridine, dihydrouridine, 2 '- O-methylpseudouridine, 1 -methylpseudouridine, 3-methylcytidine, 5-methylcytidine, 5- methylaminomethyluridine, 5-methoxyaminomethyl-2-thiouridine, 5-methoxycarbonylmethyl-2- thiouridine, 5-methoxycarbonylmethyluridine, 5-methoxyuridine, uridine-5-oxyacetic acid- methylester, uridine-5-oxyacetic acid, pseudouridine, 2-thiocytidine, 5-methyl-2-thiouridine, 2- thiouridine, 4-thiouridine, 5-methyluridine, 2'-0-alkyluridine, 2'-0-alkylthymidine, 2'-0- alkylcytidine and 3-(3-amino-3-carboxy-propyl)uridine.

The term "alkyl", as used in the context of 2'-0-alkyluridine, 2'-0-alkylthymidine, 2'-0- alkylcytidine, refers to a saturated or unsaturated hydrocarbon containing 1-20 carbon atoms including both acyclic and cyclic structures (such as methyl, ethyl, propyl, isopropyl, butyl, isobutyl, sec-butyl, pentyl, hexyl, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, propenyl, butenyl).

In various other embodiments, the nucleic acid moiety consists of two to 20, preferably three to ten, more preferably six nucleotides. In preferred embodiments, the nucleic acid moiety consists of thymidine and/or cytosine nucleosides. In more preferred embodiments, the nucleic acid moiety consists of six thymidine nucleosides. This nucleic acid moiety is called "hexT".

In still various other embodiments, the nucleic acid moiety is RNA, DNA or a mixture thereof. In more preferred embodiments, the nucleic acid portion is DNA.

In further embodiments, the present invention relates to a compound library wherein the conjugates differ from each other by comprising different nucleic acid moieties. Basically, there are two alternative methods to ensure that each molecule of the plurality of conjugate molecules comprises a nucleic acid moiety that is distinct from the nucleic acid moieties of the other conjugate molecules: first, all individual types of organic moieties are coupled to individual sequences of pyrimidine nucleotides. By this method, the different conjugate molecules can be discriminated from each other directly after the coupling to the individual nucleic acid sequences. In a second method that allows the discrimination of the different conjugate molecules, the different types of organic molecules are linked to one type of nucleic acid sequence that comprises pyrimidine nucleotides, for example six thymine nucleotides. This pyrimidine nucleic acid moiety (of six thymine nucleotides) is a so-called "adaptor nucleic acid", also called "hexT" which is elongated with further nucleotides after the synthesis steps have been applied on the organic molecule. The nucleotides used in this elongation step of the adaptor nucleic acid are not limited to pyrimidine nucleotides and can comprise purine nucleotides. The elongation step is used to individualize the nucleic acid moiety sequence attached to each individual organic molecule. A preferred method for the elongation of the nucleic acid moiety is DNA ligation with T4 ligase or with a chemical condensing agent using oligonucleotides that comprise a nucleic acid sequence that hybridizes with the adaptor nucleic acid and that additionally comprises a nucleic acid sequence which encodes for a sequence that allows the individualization of the conjugate molecule. Alternatively, a nucleic acid sequence which encodes for a sequence that allows the individualization of the conjugate molecule and is modified with a psoralen derivative is ligated to the adaptor nucleic acid by UV irradiation.

"Covalent" or "covalent bond", as used herein, refers to a chemical bond that involves the sharing of electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs and the stable balance of attractive and repulsive forces between atoms when they share electrons is known as covalent bonding. For many molecules, the sharing of electrons allows each atom to attain the equivalent of a full outer shell, corresponding to a stable electronic configuration. Covalent bonding includes many kinds of interactions, including σ-bonding, π-bonding, metal-to-metal bonding, agostic interactions, bent bonds, and three-center two-electron bonds. In various embodiments, the conjugate molecules of the library further comprise a linker portion between the nucleic acid moiety and the organic moiety molecule, preferably this linker portion is polyethylene glycol (PEG). Chemical linkers may also be incorporated into the assembly reactants. The chemical linkers may be included between any of the moieties. Chemical linkers may optionally connect two or more of the moieties to introduce additional functionality or facilitate synthesis. The chemical linker can be a bond between any of the moieties. The bond can include a physical interaction, such as chemical bonds (either directly linked or through intermediate structures), or a non-physical interaction or attractive force, such as electrostatic attraction, hydrogen bonding, and van der Waals/dispersion forces. The linker may be conjugated to any given nucleotide within the nucleic acid moiety. In addition, any position of the ribose or the nucleobase within a given nucleotide of the nucleic acid moiety may be used to the coupling of the linker. In preferred embodiments, the linker is coupled to the nucleic acid moiety by the free 5'-position, namely a phosphate group, of the most 5'- terminal nucleotide of the nucleic acid moiety.

The chemical linkers may aid in facilitating spatial separation of the moieties, increasing flexibility of the moieties relative to each other, introducing a cleavage site or modification site to the templated assembly reactant, facilitating synthesis of the templated assembly reactant, improving physical or functional characteristics (such as solubility, hydrophobicity, charge, cell-permeability, toxicity, biodistribution, or stability) of a templated assembly reactant, or any combination of the above. In various embodiments, the chemical linker is derived from a cross-linker that facilitates connecting the assembly reactant moieties via bioconjugation chemistry. "Bioconjugation chemistry", as used herein, refers to the chemical synthesis strategies and reagents that ligate common functional groups together under mild conditions, facilitating the modular construction of multi-moiety compounds. Due to the mild reaction conditions, bioconjugate chemistry approaches can be suitable for ligating biomolecules, such as nucleic acids, peptides, or polysaccharides. Some non-limiting examples can include chains of one or more of the following: alkyl groups, alkenyl groups, amides, esters, thioesters, ketones, ethers, thioethers, disulfides, ethylene glycol, cycloalkyl groups, benzyl groups, heterocyclic groups, maleimidyl groups, hydrazones, urethanes, azoles, imines, haloalkyl groups, carbamates, or combinations of any of these. In preferred embodiments, the linker is polyethylene glycol (PEG). In addition to chemical linkers between moieties, additional functionality may optionally be introduced to the conjugate molecule by the addition of accessory groups to the moieties. Some non- limiting examples of accessory groups can include appending a chemical tag or fluorophore to track the location of the conjugation molecule, or appending an agent that improves delivery of the conjugate molecule to a given target, stabilizing groups or groups that improve purification of the conjugate molecule. "At least one", as used herein, relates to one or more, in particular 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more.

In a further aspect, the present invention relates to a method for synthesizing a compound library, wherein said library comprises a plurality of conjugate molecules, wherein said conjugates comprise a small organic molecule covalently coupled to a nucleic acid moiety, the synthesis of each conjugate molecule comprises: reacting a nucleic acid consisting of pyrimidine nucleotides with an organic molecule under conditions that allow the conjugation of said molecules, wherein the plurality of conjugate molecules differ from each other at least by comprising different organic molecules.

The term "synthesizing" or "synthesis", as used herein, refers to a purposeful execution of chemical reactions to obtain a product, or several products. This happens by physical and chemical manipulations usually involving one or more reactions. This may imply that the process is reproducible, reliable, and established to work in multiple laboratories.

A chemical synthesis begins by selection of compounds that are known as reagents or reactants. Various reaction types can be applied to these to synthesize the product, or an intermediate product. This requires mixing the compounds in a reaction vessel such as a chemical reactor or a simple round- bottom flask. Many reactions require some form of work-up procedure before the final product is isolated. The amount of product in a chemical synthesis is the reaction yield. Typically, chemical yields are expressed as a weight in grams (in a laboratory setting) or as a percentage of the total theoretical quantity of product that could be produced. A side reaction is an unwanted chemical reaction taking place that diminishes the yield of the desired product.

"Reacting" as used with regard to the method for synthesizing a compound library refers to contacting the educts under conditions that allow formation of the product, namely the conjugate molecule. Exemplary reaction conditions are described in the examples.

In various embodiments of the method, the conjugate molecule is further used in a reaction comprising a) a strong acid, preferably an acid having a pK a of less than 3; and/or b) a catalyst selected from the group consisting of copper, silver, gold, ruthenium, iron, osmium, cobalt, rhodium, iridium, nickel, palladium, platinum and derivatives thereof, preferably copper, silver, gold, ruthenium and derivatives thereof. The strength of an acid refers to its ability or tendency to lose a proton (H + ). A strong acid is one that completely ionizes (dissociates) in a solution. In water, one mole of a strong acid HA dissolves yielding one mole of H + (as hydronium ion ¾0 + ) and one mole of the conjugate base, A " . Essentially, none of the non-ionized acid HA remains. Examples of strong acids are hydrochloric acid (HQ), hydroiodic acid (HI), hydrobromic acid (HBr), perchloric acid (HCIO4), nitric acid (HNO3), trifluoroacetic acid, trichloroacetic acid, and sulfuric acid (H2SO4). In aqueous solution, each of these essentially ionizes 100%. In contrast, a weak acid only partially dissociates. Examples in water include carbonic acid (H2CO3) and acetic acid (CH3COOH). At equilibrium, both the acid and the conjugate base are present in solution. Stronger acids have a larger acid dissociation constant (K a ) and a smaller logarithmic constant (pKa = -log K a ) than weaker acids.

The term "transition metal", as used herein, is a synonym for elements of the groups 3 to 12 of modern IUPAC numbering. Examples of transition metals are copper (Cu), silver (Ag), and gold (Au). "Catalyst", as used herein, refers to any substance that increases the rate of a reaction without itself being consumed. In general, catalytic action is a chemical reaction between the catalyst and a reactant, forming chemical intermediates that are able to react more readily with each other or with another reactant, to form the desired end product. During the reaction between the chemical intermediates and the reactants, the catalyst is regenerated. The modes of reactions between the catalysts and the reactants vary widely. Typical of these reactions are acid-base reactions, oxidation-reduction reactions, formation of coordination complexes, and formation of free radicals.

In still various embodiments of the method, the nucleic acid moiety is elongated in an additional reaction step with pyrimidine and/or purine nucleotides after the conjugation step. As described above, the elongation of the nucleic acid molecule may be put into practice by using PCR and a primer nucleic acid molecule, which comprises a nucleic acid sequence that hybridizes with the adaptor nucleic acid and a nucleic acid sequence which encodes for a sequence that allows the individualization of the conjugate molecule. It was surprisingly found that only purine nucleotides, but not pyrimidine nucleotides, are degraded under conditions used for the synthesis of heterocyclic chemical compounds (for example reactions involving the use of transition metals as catalysts or strong acids). However, the variety of chemical reactions carried out on the organic molecule is not limited to reactions that are only tolerated by pyrimidine nucleotides. This variety of chemical reactions may also comprise, but is not limited to amide synthesis, stepwise substitution of cyanuric chloride, protective group chemistry, amide bond formation, carbonyl reactions, such as synthesis of diamine-containing cyclic or linear compounds, aromatic substitutions, such as nucleophilic aromatic substitutions of reactive heteroaromatic halides, cross-coupling reactions, such as palladium-catalyzed synthesis of biaryl compounds, Diels-Alder reaction, benzimidazole formation, macrocycle synthesis, substitution with N-nucleophiles, nucleophilic substitution of reactive aliphatic halides, Cu(I)-catalyzed alkyne-azide cycloaddition, Homer-Wadsworth-Emmons reation with reactive phosphonium ylids, reduction of aromatic nitro groups and appendage of cyanuric chloride.

In a third aspect, the present invention relates to the use of a compound library of the present invention for screening a compound binding to a target molecule. The term "screening", as used herein, means that a plurality of candidate compounds (this can be an amount of 10 or less compounds or up to 10000 compounds or more) can be screened in a single experiment for their ability to bind to a given target molecule. In various embodiments, 10 or more, 100 or more, 1000 or more, 1500 or more, 2000 or more, 5000 or more or 10000 or more of the candidate compounds are pooled in a single vessel and tested for their ability to bind to a target molecule. Thus, said candidate compounds pooled in a single vessel and exposed to the target molecule compete for binding to said target molecule. Upon being brought into contact with the target molecule, the candidate compounds may bind to the target molecule or being washed out by one or more washing steps. Subsequently, the candidate compounds bound to the target molecule can be identified via their individual DNA label. Methods to identify candidate compounds based on their DNA label are described herein and are well-known in the art.

A "target molecule", as used herein, may be any molecule which can be bound by a binding molecule and may for example be a biological substance such as a biomolecule, complexes, cell fractions or cells. Preferably, a target molecule within the context of the present invention is a nucleic acid, e.g. DNA, or RNA molecule. Even more preferred are target molecules such as a peptide, a protein, a drug molecule, a small molecule, or a vitamin. In particular preferred embodiments of the present invention a "target molecule" may be a biomarker. A target molecule may also refer to a biological molecule that is either naturally present in a cell or subject or has been previously introduced into a cell or subject. A "target molecule" can be any polypeptide, lipid, nucleic acid, small molecule, saccharide or a vitamin of the cell. It may be located in the nucleus, nucleolus, cytoplasm, mitochondria, Golgi apparatus, endoplasmic reticulum or membrane of the cell. In various embodiments, the target molecules of the methods of the present invention are polypeptide molecules.

In various embodiments, the target molecule is a protein.

The term "protein", as used herein, relates to one or more associated polypeptides, wherein the polypeptides consist of amino acids coupled by peptide (amide) bonds. The term polypeptide refers to a polymeric compound comprised of covalently linked amino acid residues. The amino acids are preferably the 20 naturally occurring amino acids glycine, alanine, valine, leucine, isoleucine, phenylalanine, cysteine, methionine, proline, serine, threonine, glutamine, asparagine, aspartic acid, glutamic acid, histidine, lysine, arginine, tyrosine and tryptophan.

Suitable targets may include one or more of peptides, proteins (e.g., antibodies, affibodies, or aptamers), nucleic acids (e.g., polynucleotides, DNA, RNA, or aptamers); polysaccharides (e.g., lectins or sugars), lipids, enzymes, enzyme substrates, ligands, receptors, antigens, or haptens. One or more of the aforementioned targets may be characteristic of particular cells, while other targets may be associated with a particular disease or condition. In some embodiments, targets in a tissue sample that may be detected and analyzed using the methods disclosed herein may include, but are not limited to, prognostic targets, honnone or honnone receptor targets, lymphoid targets, tumor targets, hematopoietic targets, cell cycle associated targets, neural tissue and tumor targets, or cluster differentiation targets.

Suitable examples of prognostic targets may include enzymatic targets such as galactosyl transferase II, neuron specific enolase, proton ATPase-2, or acid phosphatase.

Suitable examples of hormone or honnone receptor targets may include human chorionic gonadotropin (HCG), adrenocorticotropic hormone, carcinoembryonic antigen (CEA), prostate- specific antigen (PSA), estrogen receptor, progesterone receptor, androgen receptor, gClq-R/p33 complement receptor, IL-2 receptor, p75 neurotrophin receptor, PTH receptor, thyroid honnone receptor, or insulin receptor. Suitable examples of hematopoietic targets may include CD45, CD34, CD133, HLA-DR, CD115, CD1 16, CD117, CD33, CD38, CD90, CD71, Ki67, Flt3, CD163, CD45RA, CD3, IgD, CD105, CD45, and also c-kit, - the receptor for stem cell factor.

Suitable examples of lymphoid targets may include alpha- 1- antichymotrypsin, alpha- 1 -antitrypsin, B cell target, bcl-2, bcl-6, B lymphocyte antigen 36 kD, BM1 (myeloid target), BM2 (myeloid target), galectin-3, granzyme B, HLA class I Antigen, HLA class II (DP) antigen, HLA class II (DQ) antigen, HLA class II (DR) antigen, human neutrophil defensins, immunoglobulin A, immunoglobulin D, immunoglobulin G, immunoglobulin M, kappa light chain, kappa light chain, lambda light chain, lymphocyte/histocyte antigen, macrophage target, muramidase (lysozyme), p80 anaplastic lymphoma kinase, plasma cell target, secretory leukocyte protease inhibitor, T cell antigen receptor (JOVI 1), T cell antigen receptor (JOVI 3), terminal deoxynucleotidyl transferase, or unclustered B cell target. Suitable examples of tumour targets may include alpha fetoprotein, apolipoprotein D, BAG-I (RAP46 protein), CAl 9-9 (sialyl lewisa), CA50 (carcinoma associated mucin antigen), CAl 25 (ovarian cancer antigen), CA242 (tumour associated mucin antigen), chromogranin A, clusterin (apolipoprotein J), epithelial membrane antigen, epithelial-related antigen, epithelial specific antigen, gross cystic disease fluid protein- 15, hepatocyte specific antigen, heregulin, human gastric mucin, human milk fat globule, MAGE-I, matrix metalloproteinases, melan A, melanoma target (HMB45), mesothelin, metallothionein, microphthalmia transcription factor (MITF), Muc-1 core glycoprotein. Muc-1 glycoprotein, Muc-2 glycoprotein, Muc- 5AC glycoprotein, Muc-6 glycoprotein, myeloperoxidase, Myf-3 (Rhabdomyosarcoma target), Myf-4 (Rhabdomyosarcoma target), MyoDl (Rliabdomyosarcoma target), myoglobin, nm23 protein, placental alkaline phosphatase, prealbumin, prostate specific antigen, prostatic acid phosphatase, prostatic inhibin peptide, PTEN, renal cell carcinoma target, small intestinal mucinous antigen, tetranectin, thyroid transcription factor- 1 , tissue inhibitor of matrix metalloproteinase 1, tissue inhibitor of matrix metalloproteinase 2, tyrosinase, tyrosinase-related protein- 1 , villin, or von Willebrand factor.

Suitable examples of cell cycle associated targets may include apoptosis protease activating factor- 1, bcl-w , bcl-x, bromodeoxyuridine, CAK (cdk-activating kinase), cellular apoptosis susceptibility protein (CAS), caspase 2, caspase 8, CPP32 (caspase-3), CPP32 (caspase-3), cyclin dependent kinases, cyclin A, cyclin Bl, cyclin Dl, cyclin D2, cyclin D3, cyclin E, cyclin G, DNA fragmentation factor (N- terminus), Fas (CD95), Fas-associated death domain protein, Fas ligand, Fen-1, IPO-38, Mcl-I, minichromosome maintenance proteins, mismatch repair protein (MSH2), poly (ADP- Ribose) polymerase, proliferating cell nuclear antigen, pl6 protein, p27 protein, p34cdc2, p57 protein (Kip2), pi 05 protein, Stat 1 alpha, topoisomerase I, topoisomerase II alpha, topoisomerase III alpha, or topoisomerase II beta.

Suitable examples of neural tissue and tumor targets may include alpha B crystallin, alpha-internexin, alpha synuclein, amyloid precursor protein, beta amyloid, calbindin, choline acetyltransferase, excitatory amino acid transporter 1 , GAP43, glial fibrillary acidic protein, glutamate receptor 2, myelin basic protein, nerve growth factor receptor (gp75), neuroblastoma target, neurofilament 68 kD, neurofilament 160 kD, neurofilament 200 kD, neuron specific enolase, nicotinic acetylcholine receptor alpha4, nicotinic acetylcholine receptor beta2, peripherin, protein gene product 9, S- 100 protein, serotonin, SNAP-25, synapsin I, synaptophysin, tau, tryptophan hydroxylase, tyrosine hydroxylase, or ubiquitin. Suitable examples of cluster differentiation targets may include CDIa, CDIb, CDIc, CDId, CDIe, CD2, CD3delta, CD3epsilon, CD3gamma, CD4, CD5, CD6, CD7, CD8alpha, CD8beta, CD9, CDIO, CD1 la, CD1 lb, CD1 Ic, CDwl2, CD13, CD14, CD15, CD15s, CD16a, CD16b, CDwl7, CD18, CD19, CD20, CD21, CD22, CD23, CD24, CD25, CD26, CD27, CD28, CD29, CD30, CD31 , CD32, CD33, CD34, CD35, CD36, CD37, CD38, CD39, CD40, CD41 , CD42a, CD42b, CD42c, CD42d, CD43, CD44, CD44R, CD45, CD46, CD47, CD48, CD49a, CD49b, CD49c, CD49d, CD49e, CD49f, CD50, CD51, CD52, CD53, CD54, CD55, CD56, CD57, CD58, CD59, CDw60, CD61, CD62E, CD62L, CD62P, CD63, CD64, CD65, CD65s, CD66a, CD66b, CD66c, CD66d, CD66e, CD66f, CD68, CD69, CD70, CD71, CD72, CD73, CD74, CDw75, CDw76, CD77, CD79a, CD79b, CD80, CD81, CD82, CD83, CD84, CD85, CD86, CD87, CD88, CD89, CD90, CD91, CDw92, CDw93, CD94, CD95, CD96, CD97, CD98, CD99, CDIOO, CD101, CD102, CD103, CD104, CD105, CD106, CD107a, CD107b, CDwl08, CD109, CD1 14, CD1 15, CD1 16, CD1 17, CDwI 19, CD120a, CD120b, CD121a, CDwl21b, CD122, CD123, CD124, CDwl25, CD126, CD127, CDwl28a, CDwl28b, CD130, CDwl31 , CD132, CD134, CD135, CDwl36, CDwl37, CD138, CD139, CD 140a, CD140b, CD141 , CD142, CD143, CD144, CDwl45, CD146, CD147, CD148, CDwl49, CDwl50, CD151, CD152, CD153, CD154, CD155, CD156, CD157, CD158a, CD158b, CD161, CD162, CD163, CD 164, CD 165, CD 166, and TCR-zeta.

Suitable examples of cell surface receptor targets may include 7-helices receptors (e.g. alpha- and beta-adrenergic receptors, dopamine receptors, serotonin receptors, gamma-aminobutyric acid (GABA) A receptors, neurokinin receptors, opioid receptors, cannabinoid receptors, chemokine receptors), receptor tyrosine kinases (RTKs) and interleukin receptors.

Other suitable prognostic targets hormone or hormone receptor targets lymphoid targets tumor targets cell cycle associated targets neural tissue and tumor targets include centromere protein-F (CENP-F), giantin, involucrin, lamin A&C (XB 10), LAP-70, mucin, nuclear pore complex proteins, pi 80 lamellar body protein, ran, cathepsin D, Ps2 protein.

The term "contacting", as used herein in the context of conjugate molecules and target molecules, refers generally to providing access of one component, reagent, analyte or sample to another. For example, contacting can involve mixing a solution comprising a conjugate molecule and a target molecule. The solution comprising one component, reagent, analyte or sample may also comprise another component or reagent, such as dimethyl sulfoxide (DMSO) or a detergent, which facilitates mixing, interaction, uptake, or other physical or chemical phenomenon advantageous to the contact between components, reagents, analytes and/or samples.

The term "identify", as used herein in the context of screening, relates to the recognition of a conjugate molecule that has the ability to bind to a target, preferably a peptide or protein target molecule and may completely or partly revert molecular changes of a cell induced by pathological conditions or to inhibit or activate enzymatic functions. Further, in alternative embodiments, the nucleic acid moiety comprises a sequence of at least six thymidine and/or cytosine nucleotides directly attached to the small organic molecule.

The term "sequence", as used herein, relates to the primary nucleotide sequence of nucleic acid molecules or the primary amino acid sequence of a protein.

EXAMPLES

Example 1: Coupling of (amino) acids to 5'-aminolinker-modified DNA bound to controlled pore glass (CPG) solid support

The MMt-protective group of 3'-5'-dTdTdTdTdTdT-5'-(6-aminohexyl)-hydrogen phosphate A bound to 1000 A controlled pore glass (CPG) solid support was removed by addition of 3 % trichloroacetic acid in dry DCM. A yellow color indicated successful removal of the protective group. After that, the CPG was washed three times with each 200 μΐ, of 1 % TEA in AcCN, DMF, MeOH, ACN and DCM. Then, the 3'-5'-hexathymidine-5'-aminolinker DNA B bound to 1000 A controlled pore glass (CPG) solid support, an (amino) acid C, and a coupling reagent were dried in vacuo for 15 min. Then, stock solutions of all reactants in DMF were prepared immediately before the reactions were started. An amino acid C (100 eq) was coupled to the aminolinker-modified DNA B using HATU (100 eq) as coupling reagent and DIPEA (250 eq) as base. To a 15 of a solution of a carboxylic acid C in dry DMF were added 15 μΐ, of HATU dissolved in dry DMF and DIPEA (250 eq.) was added. The carboxylic acid C was then activated for 20 min at room temperature in a shaker. Afterwards, the solid support-bound DNA B suspended in DMF (15 iL) was added. The reaction tube was sealed and the reaction mixture was shaken at room temperature for 2 hours. Then, the CPG containing the DNA-conjugate D was filtered off using a filter column and washed subsequently with each 3 x 200 μΐ. of DMF, MeOH, MeCN and DCM. The filter with the CPG-bound product was then dried in vacuo for 15 min. 20-30 nM of the DNA were deprotected and removed from the CPG by treatment with AMA (AMA= aqueous ammonia (30 %)/ aqueous methylamine (40 %) l : lvol/vol) and analyzed by HPLC to evaluate the coupling efficiency. Then, 20 μΐ. of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 of distilled water and analyzed by HPLC on a Gemini 5u CI 8 1 10 A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 20 min). For further reaction, the solid support containing compound D was reacted three times with each 200 μΐ- of a 1 : 1 solution of capping solutions A and B (A: 800 μΐ dry THF/ 100 μΐ, acetic acid anhydride/ 100 μΐ ^ pyridine; B: 900 μΐ, dry THF/ 100 μΐ ^ N-methylimidazole) for each 30 seconds. This step capped unreacted amines, and it was followed by washing of solid support three times with each of 200 μΐ, of DMF, MeOH, ACN and DCM. The CPGs were finally washed again with each 3 x 200 μΐ. of DMF, MeOH, MeCN and DCM (see Figure 1). Example 2: Removal of the Fmoc-protective group from DNA-amino acid conjugates bound to controlled pore glass (CPG) solid support

The CPG containing an Fmoc-protected 3'-5'-hexathymidine-5'-amino acid conjugate E was washed with each 3 x 200 μΐ, of DMF, MeOH, MeCN and DCM. Fmoc-deprotection was performed with 20 % piperidine in DMF (100 μί) for 15 min at room temperature. The CPG containing the deprotected conjugate F was filtered off and washed with each 3 x 200 μΐ, of DMF, MeOH, MeCN and DCM. The CPG was then dried in vacuo for 15 min (see Figure 2).

Example 3: Synthesis of 3'-5'-hexathymidme-5'-B-carboline-conjugates by Pictet-Spengler reaction

a) Synthesis of DNA-tryptophane conjugates

The amide coupling of Fmoc-tryptophane 1 to 3'-5'-hexathymidine-5'-aminolinker DNA B was performed as described above in Example 1 using 3'-5'-hexathymidine-5'-aminolinker DNA B on 1000 A CPG (60 nM), Fmoc-Trp-OH (6 μηιοΐ, 2.6 mg), HATU (6 μιηοΐ, 2.3 mg), and 2.6 μΐ. DIPEA. The removal of the Fmoc-protective group yielding 3'-5'-dTdTdTdTdTdT-5'-((S)-6-(2-amino-3-(lH- indol-3-yl)propanamido)hexyl)hydrogen phosphate 2 was done according to Example 2 (see Figure 3).

b) Pictet-Spengler reaction

To a suspension of the DNA-Tryp-NFh conjugate bound to CPG (ca. 30 nM, 1.1 mg) in a solution of a suitable catalyst (trichloroacetic acid (TCA), 0.03 M; trifluoroacetic acid (TFA), 0.03 M; diphenyl hydrogen phosphate (DHP) 0.03 M) in 45 μΐ of a suitable solvent (dry DCM, MeCN, DCE, PhMe) was added 100 eq (3 μιηοΐ) of an aldehyde or ketone of choice (e.g. biphenyl-4-carboxaldehyde a, 3 μιηοΐ, 0.55 mg) (see Figure 4). The reaction was shaken at room temperature for 16 h, then, the DNA was filtered off, washed successively with DMF (3 x 200 μΐ), MeCN (3 x 200 μΐ), MeOH (3 x 200 μΐ) and DCM (3 x 200 μΐ), and dried in vacuo for 15 min. The DNA was deprotected and removed by treatment with AMA (AMA= aqueous ammonia (30 %)/ aqueous methylamine (40 %) 1 : 1 vol/vol). Then, 20 \L of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΐ. of distilled water and purified by HPLC on a Gemini Su C 18 1 10 A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 19 min). HPLC purification and MALDI-MS analysis of 4a are shown in Figure 5.

Example 4: Ruthenium-catalyzed ring-closing metathesis to 3'-5'-hexathymidine-peptide conjugate 6a

The synthesis of peptide-DNA conjugates was performed by Fmoc-solid-phase peptide synthesis (SPPS) by iterative application of the protocols of Examples 1 and 2 making use of amino acids in the following order: 1. (S)-N-fmoc-alpha-4-pentenylalanine (25 μηιοΐ, 9.5 mg); 2. N-fmoc-L-leucine (22 μηιοΐ, 7.8 mg); 3. N-fmoc-L-alanine (19 μηιοΐ, 5.9 mg); 4. (R)-N-fmoc-alpha-4-pentenyl alanine (16 μιηοΐ, 6.1mg). Crosslinking of unnatural olefinic amino acids was performed by ring closing metathesis (RCM) using the Grubbs first generation catalyst bis(tricyclohexylphosphine)benzylidine ruthenium(IV) dichloride (4.9 μπιοΐ, 4 mg) dissolved in 1 ml of dry dichloromethane (DCM) three times for 2 h at room temperature (100 μΐ of the solution were used for each coupling) (Figure 6). After each coupling, the DNA was washed with DCM (3 x 200 μΐ), the CPG was then dried in vacuo for 5 min and the next coupling was performed. Subsequent to the three couplings, the DNA was washed successively with DMF (3 x 200 μΐ), ACN (3 x 200 μΐ), MeOH (3 x 200 μΐ) and DCM (3 x 200 μΐ). The filter carrying CPGs was then dried in vacuo for 15 min. The DNA-macrocycle conjugate was deprotected and removed from the CPG by treatment with AMA (AMA= aqueous ammonia (30 %)/ aqueous methylamine (40 %) 1 : 1 vol/vol) . Then, 20 μί of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΐ. of distilled water and purified by HPLC on a Gemini Su CI 8 110 A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pB=8) and methanol (20:80 - 70:30 of methanol over 20 min) (see Figure 7). HPLC purification and MALDI-MS analysis of 6a are shown in Figure 7.

Example 5a: Gold-catalyzed cycloaddition to 3'-5'-hexathymidine-6-oxa-l,2- diazaspiro [4.4] nonane-conj ugates

a) Synthesis of hexathymidine-5 '-PEG linker conjugate 9

The synthesis of the 3'-5'-hexathymidine-conjugate 9 was done according to general protocol of Example 1 making use of 400 nmol MMt-protected 3 '-5' -hexathymidine-5 '-aminolinker DNA B attached to solid support (ca. 14.4 mg of CPG), PEG-linker 7 (21.5 mg, 40 μιηοΐ, 100 eq.) dissolved in 120 μΐ, of dry DMF, HATU (15.2 mg,40 μιηοΐ, 100 eq.) dissolved in 120 \iL of dry DMF, and DIPEA (17.3 \L, 100 μηιοΐ, 250 eq.). To remove the MMt group of the PEG linker, the solid support was treated with 3 % trichloroacetic acid in diy DCM yielding 9 ,(Figure 8). For purpose of analysis, 20 nmol of DNA-conjugate 9 (ca. 0.72 mg) were deprotected and removed from the CPG with 500 μΐ, of a 1 : 1 solution of cone. aq. NH3 and cone. aq. MeN¾ for 30 minutes at room temperature. Then, 20 μΐ. of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΕ of distilled water and analyzed by HPLC on a Gemini 5u C18 1 10A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 20 min). b) Synthesis of 3'-5'-hexathymidine-aldehyde conjugate 11

The synthesis of the hexathymidine-5 '-conjugate 11 was done according to general protocol of Example 1 making use of 360 nmol of conjugate 9 attached to the solid support (ca. 13 mg of CPG), 2- (4-formylphenoxy)acetic acid 10 (6.5 mg, 36 μιηοΐ, 100 eq.) dissolved in 108 ih of dry DMF, HATU (13,7 mg, 36 μηιαΐ, 100 eq.) dissolved in 108 iL of dry DMF, and DIPEA (15,6 μί, 90 μνα , 250 eq.) (Figure 9). For purpose of analysis, 20 nmol of 11 (ca. 0,72 mg) were deprotected and removed from the CPG with 500 iL of a l : lsolution of cone. aq. N¾ and cone. aq. MeNH2 for 30 minutes at room temperature. Then, 20 μΐ-, of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 \L of distilled water and analyzed by HPLC on Gemini 5u CI 8 1 10 A column; 100*10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 % of methanol over 19 min). c) Gold-catalyzed cycloaddition reaction to 3'-5'-hexathymidine-6-oxa-l ,2-diazaspiro[4.4]nonane- conjugate 14

ieri-Butyl-2-benzylhydrazine-l-carboxylate 12 (2,36 mg, 10 μιηοΐ, 500 eq) dissolved in 15 μΐ, of dry THF, pent-4-yn-l-ol 13 (1.98 μί, 20 μιηοΐ, 100 eq), and chloro[tris(2,4-di-teri- butylphenyl)phosphite]gold (4.4 mg, 5 μιηοΐ, 250 eq) mixed with silver hexafluoroantimonate (1.72 mg, 5 μιηοΐ, 250 eq) in 15 μί of dry THF, were added to 20 nmol of polypyrimidine ONA-conjugate 11 (ca. 0.72 mg) suspended in 15 iL of dry THF (Figure 10). The reaction mixture was shaken at room temperature for overnight to furnish 14. After that, the solid support was washed three times with each 200 iL of DMF, MeOH, ACN and DCM. Then it was treated with 500 μί of 1 : 1 solution of conc.aq. N¾ and cone. aq. MeNH2 for 30 minutes at room temperature in order to remove it from CPG. Then, 20 xL of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΐ, of distilled water and purified by HPLC on Gemini 5u CI 8 1 10A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 19 min) HPLC purification and MALDI-MS analysis of the synthesized compound 14 are shown in Figure 1 1. Example 5b: Synthesis of 3'-5'-hexathymidine-2,3-dihydro-lH-pyrazole-conjugates

a) Synthesis of hexathymidine-5'-alkyne conjugate 16

The synthesis of the DNA-alkyne conjugate 16 (Figure 12) was done according to general procedure 1 making use of 350 nmol 5'-aminolinker-modified 3'-5'-hexathymidine B attached to solid support (ca. 13 mg of CPG), 4-ethynylbenzoic acid 15 (5.1 mg, 35 μιηοΐ, 100 eq.) dissolved in 105 \xL of dry DMF, HATU (13.3 mg, 35 μιηοΐ, 100 eq.) dissolved in 105 μΐ, of dry DMF, and DIPEA (15.1 μΐ,, 87.5 μιηοΐ, 250 eq.). For purpose of analysis (Figure 14a), 30 nmol of 16 (ca. 1.08 mg) were deprotected and removed from the CPG with 500 μΕ of a 1 : 1 solution of cone. aq. N¾ and cone. aq. MeN¾ for 30 minutes at room temperature. Then, 20 μί of 1 M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΐ, of distilled water and analyzed by HPLC on Gemini 5u C18 1 10A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 19 min). b) Gold-catalyzed synthesis of 3'-5'-hexathymidine-2,3-dihydro-lH-pyrazole conjugate 19

teri-Butyl 2-(2-nitrobenzyl)hydrazinecarboxylate 17 (10.7 mg, 40 μιηοΐ, 1000 eq) dissolved in 45 μΐ, of dry AcCN, isobutyraldehyde 18 (3.7 iL, 40 μηιοΐ, 1000 eq), and chloro[tris(2,4-di-ieri- butylphenyl)phosphite]gold (8.8 mg, 10 μιηοΐ, 250 eq) mixed with silver hexafluoroantimonate (3.4 mg, 10 μιηοΐ, 250 eq) in 45 ih of diy MeCN, were added to 40 nmol of DNA-conjugate 16 (ca. 1.4 mg). The reaction mixture was shaken at 50 °C for overnight to yield 19 (Figure 13). After that, the solid support was washed three times with each 200 iL of 0.1 M EDTA, DMF, MeOH, MeCN and DCM. Then, it was treated with 500 ih of 1 : 1 solution of cone. aq. N¾ and cone. aq. MeN¾ for 30 minutes at room temperature in order to remove it from CPG. 20 μΐ. of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΙ_. of distilled water and purified by HPLC on Gemini 5u C18 1 10A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 % of methanol over 19 min). HPLC purification and MALDI-MS analysis of the newly synthesized compound 19 are shown in Figures 14b and 14c. Example 6: Cycloaddition reactions yielding 3'-5'-hexathymidine-5'-spirooxindole-conjugates a) Synthesis of oxindole 20 (feri-butyl 4-(3-benzylidene-2-oxoindolin-l-yl)butanoate)

To triphenylphosphine (1.3 g, 5 mmol) in 10 ml of toluene was added benzyl bromide 20 (0.6 ml, 5 mmol) via syringe, the reaction was heated at 110 °C for 15 h, then the mixture was cooled to room temperature and the solid formed was isolated by suction filtration, washed with diethyl ether (3 x 5 ml) and dried in vacuo to give a white solid. Under an atmosphere of argon, 10 ml of anhydrous THF was added and the solution was cooled to 0 °C. A solution of butyl lithium (2.5 M in hexanes, 2 ml, 5 mmol) was added dropwise via syringe and the reaction was allowed to stir for lh in order to form 21. A solution of isatin 22 (0.75 g, 5 mmol) in dry THF (30 ml) was added dropwise via syringe over 20 min at 0 °C and the resulting solution was allowed to stir at room temperature for 15 h. The reaction mixture was then poured into a saturated solution of N¾C1 (35 ml) at 0 °C and extracted with dichloromethane (3 x 25 mL). The combined organic layers were dried over anhydrous MgS04, filtered and concentrated in vacuo to afford a residue containing a E/Z mixture (ca. 4: 1) of product 3- benzylideneindolin-2-one 23 that was separated by flash column chromatography, eluting with 3:7 EtOAc in hexanes, to give (E)-23 (605 mg, 62%) and (Z)-23 (155 mg, 15%) as yellow solids. To a solution of 3-benzylideneindolin-2-one 23 (130 mg, 0.6 mmol) in 10 ml of DMF was added potassium carbonate (207 mg, 1.5 mmol). The reaction was stirred at room temperature for 15 minutes, tert- Butyl-4-bromobutyrate 24 (212 iL, 1.2 mmol) was then added. The reaction was stirred at room temperature overnight, concentrated in vacuo, dissolved in EtOAc and washed with ¾0. The layers were separated and the aqueous layer was extracted with additional EtOAc. The combined organic phase was dried over Na2S04, filtered and concentrated in vacuo. The crude product was purified by silica gel column chromatography, eluting with 3:7 EtOAc in petroleum ether, to give the product teri-butyl 4-(3-benzylidene-2 -oxoindolin-l -yl)butanoate 25 as an orange solid (83%, 180 mg) (Figure 15). b) Synthesis of the 3'-5'-hexathymidine-5'-oxindole conjugate 22

After the deprotection of teri-butyl-4-(3-benzylidene-2-oxoindolin-l-yl)butanoate 25 with TFA, the synthesis of 27 was done according to general protocol of Example 1 making use of 400 nmol hexathymidine DNA-PEG linker conjugate 9 attached to the solid support, (£)-4-(3-benzylidene-2- oxoindolin-l-yl)butanoic acid 26 (12.3 mg, 40 μηιοΐ, 100 eq.) dissolved in 100 μΐ., of dry DMF, HATU (15.2 mg, 49 μιηοΐ, 100 eq.) dissolved in 100 μΐ, of dry DMF, and DIPEA (17.4 μΐ,, 100 μιηοΐ, 250 eq.) (Figure 16). For purpose of analysis 20 nmol of DNA-conjugate 27 were deprotected and removed from the CPG with 500 \L of a 1 : 1 solution of cone. aq. N¾ and cone. aq. MeN¾ for 30 minutes at room temperature. Then, 20 iL of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 μΕ of distilled water and analyzed by HPLC on Gemini 5u CI 8 1 10A column; 100* 10.0 mm with a gradient of aqueous triethylammanium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 of methanol over 19 min, Figure 18a). c) Cu(I)-catalyzed synthesis of hexathymidine-5'-spirooxindole [methyl l-(4-amino-4- oxobutyl)-2'- (4-bromophenyl)-2-oxo-4'-phenylspiro[indoline-3,3 '-pyrrolidine] -5 '-carboxylate] conjugate 29 via (3+2) cycloaddition

The catalyst [Cu(CH 3 CN) 4 ]PF 6 (0.05 mg, 0.12 μιηοΐ, 5 eq) and one of the ligands (RP,R'P)-1,1 '- bis[(S)-a-(dimethylamino)benzyl]-2,2'-bis(diphenylphosphino) ferrocene (0.4 mg, 0.5 μιηοΐ, 20 eq) or [tris[(l-benzyl-lH-l ,2,3-triazol-4-yl) methyljamine) (0.26 mg, 0.5 μηιοΐ, 20 eq) were dissolved in 60 μΐ of DCM and mixed with methyl 2-((4-bromobenzylidene)amino)acetate 28 (0.76 mg, 3 μπιοΐ, 100 eq) in 30 μΐ of DCM and triethylamine (0.37 μΐ, 2.5 μηιοΐ, 100 eq), the mixture was added to 30 nmol of DNA-conjugate 27 (ca. 1 mg) and shaken at room temperature for 4 h to yield 29 (Figure 17). After that, the solid support was washed three times with each 200 \xL of DMF, MeOH, ACN and DCM. Then it was treated with 500 μΐ, of a solution of K2CO3 in methanol for 30 minutes at room temperature. The solution was neutralized with aq. acetic acid, 20 \\L of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 iL of distilled water and purified by HPLC on Gemini 5u CI 8 1 10 A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 % of methanol over 19 min) and analyzed by MALD1-TOF analysis (Figure 18b and 18c). d) Ag-catalyzed (3+2) cycloaddition yielding hexathymidine-5'-spirooxindole conjugate 29

To a solution of catalyst AgOAc (0.04 mg, 0.25 μπιοΐ, 10 eq) and triethylamine (0.37 μΐ, 2.5 μιηοΐ, 100 eq) in 60 μΐ of DCM or THF was added methyl 2-((4-bromobenzylidene)amino)acetate 28 (0,76 mg, 3 μιηοΐ, 100 eq) in 30 μΐ of DCM or THF, the mixture was added to 30 nmol of DNA-conjugate 27 (ca. 1 mg) shaken at room temperature for 4 h to furnish 29 (Figure 17). After that, the solid support was washed three times with each 200 xL of DMF, MeOH, ACN and DCM. Then it was treated with 500 \i of a solution of K2CO3 in methanol for 30 minutes at room temperature. The solution was neutralized with aq. acetic acid. Then, 20 L of 1M Tris-buffer was added, the mixture was dried in a SpeedVac, re-dissolved in 100 iL of distilled water and purified by HPLC on Gemini Su C18 110A column; 100* 10.0 mm with a gradient of aqueous triethylammonium acetate buffer (100 mM, pH= 8) and methanol (20:80 - 70:30 % of methanol over 19 min, Figure 18b), and analyzed by MALD1-TOF analysis.

Example 7: Encoding of hexaT-adapter adaptor nucleic acid-small molecule conjugates by T4 ligase

Coding oligonucleotides were 5 '-phosphorylated with T4 polynucleotide kinase (T4 PNK, Thermo Scientific). The phosphorylation reactions were performed in a total volume of 20 μΐ using 280 pmol of the coding oligonucleotides (e.g. oligonucleotide II (Figure 19)).

Table 1 : reaction mixture of PNK reaction for the first ligation

The PNK reaction mixtures were mixed thoroughly, spinned briefly and incubated at 37°C for 20 minutes. PNK was inactivated at 75°C for 10 minutes. Oligonucleotides I (adapter nucleic acid-beta- carboline conjugate), II (phosphorylated) and III were hybridized at 65°C for 10 minutes and slowly cooled down to room temperature (25°C) in order to perform the first ligation reaction (cf. Figure 19).

Ligation protocol:

The ligation mixture contained oligonucleotides I, II and III (1.4fold excess of the phosphorylated oligonucleotide II) and 2x rapid ligation buffer (Biozym; 132 mM Tris-HCI, 20 mM MgC , 2 mM DTT, 2 mM ATP, 15% PEG 6000, pH 7.6 @ 25°C). For the ligation reactions 600 units of T4 DNA ligase (600 U/μΙ; T4 DNA ligase rapid; Biozym) were added for a total reaction volume of 20 μΐ. The reactions mixtures were incubated overnight at 25°C and T4 DNA ligase was inactivated by incubating the samples at 75°C for 10 minutes. To confirm the ligation process, agarose gel electrophoresis was used to identify the components in the ligation reaction mixtures, shown in Figure 21. Electrophoresis was carried out in 1 * TBE (pH 8.0) at 150 V constant voltage for 75 minutes. For staining, Midori Green Direct (Biozym) and as a reference the GeneRuler Ultra Low Range DNA Ladder (ThermoScientific) were used. The second ligation reaction with T4 DNA ligase was prepared analogously. The scheme of the second ligation reaction is shown in Figure 20.

MALDI MS analysis:

To confirm the ligation success for the first ligation reaction an EcoRI digest followed by MALDI- TOF analysis was performed (cf. Figure 22). For the EcoRI digest 25 pmol of the ligation product was incubated in 5 μΐ lOx Buffer H (commercially provided by Roche; 0,5 M Tris-HCl, 1 M NaCl, 100 lnM MgCb, 10 mM DTE, pH 7,5 at 37°C) and 1 unit EcoRI. The total volume of the digest reaction was 50 μΐ (distilled water was added). The digest was performed at 37°C for overnight. Before measuring MALDI the reaction mixture was precipitated with 75% EtOH. For the MALDI analysis 3- HPA was used as matrix. The MALDI-TOF result shows 3 defined masses which correlate well with the calculated masses (cf. Figure22).

In the following the DNA sequences, which involved in the ligation process, are depicted:

Table 2: Sequences of the oligonucleotides I-XII

VIII Α', Β', C\ 67mer 8 GAA GTC ACA CAG CAG TCG TGA ACA GTT D', Ε', F\ AGG CCT ATG TTA TGA CAG CGG AAT TCG G' GCA AGA CAA AAA A

IX A, H 15mer TTT Ί ΓΤ GTC TTG CCG

X C, D, I 22mer 10 AAT TCC GCT GTC ATA ACA TAG G

XI A', J 19mer 1 1 AAT TCG GCA AGA CAA AAA A

XII C, D « , E', 22mer 12 TAG GCC TAT GTT ATG ACA GCG G

K

Table 3: Functions of the partial sequences of oligonucleotides I-XII.

partial function of partial length SEQ ID sequence (5'-3')

sequence sequence NO:

A adaptor nucleic acid 6mcr 1

A' complementary sequence 6mer 13 AAA AAA

to adaptor nucleic acid

B primer 18mer 14 GTC TTG CCG AAT TCC GCT

B' complementary sequence 18mer 15 AGC GGA ATT CGG CAA GAC to primer

C scaffold code 5mer 16 GTC AT

C * complementary sequence 5mer 17 ATG AC

to scaffold code

D code for building block I 8mer 18 AAC ATA GG

D' complementary sequence 8mer 19 CCT ATG TT

to code for building

block I

E overhang for T4 DNA 4mer 20 CCT A

ligation

Example 8: Synthesis of compounds comprising a hexT (six thymidine nucleotide) nucleic acid moiety

The TiDEC strategy is based on a 5'-aminolinker modified hexathymidine oligonucleotide, named "hexT" (Figure 23B). It was bound to controlled pore glass (CPG) solid support which allowed us to use organic solvents in the synthesis of target molecules, and facilitated purification of the DNA-conjugates. The synthesis of hexT-P-carboline conjugates by Bronsted acid catalyzed Pictet-Spengler reaction and the synthesis of hexT-pyrazol(in)e conjugates through a Au(I)-catalyzed three component annulation reaction is demonstrated. As heterocycle formation and introduction of a first set of building blocks are performed simultaneously in both reactions, the hexT-heterocycle conjugates were ligated to two coding dsDNAs in one step (Figure 23B): the first dsDNA contained a hexa-adenine overhang, the primer, the code for the heterocycle, and a four-base overhang for ligation of a second dsDNA. The second sequence contained the code for the building block appended to the heterocycle, and a four-base overhang for a further ligation step. The coding dsDNA sequences were 5'-phosphorylated by polynucleotide kinase, and incubated with the hexT- fluorescein conjugate hexT-fluo and T4 ligase at 25°C.

Both the ligation of one dsDNA, and also of two dsDNA strands in one pot to hexT-fluo proceeded efficiently. The ligation products were identified both by staining of the DNA, and by detection of the fluorescein label. These experiments demonstrated the viability of TiDEC for DNA-encoded chemistry. An important heterocyclic structure in drug research is the β- carboline. It can be accessed through Bronsted acid catalyzed Pictet-Spengler reaction. It is the core scaffold of many natural products including the pharmacologically active alkaloids ajmalicine and reserpine, and also of synthetic drugs, and drug candidates such as tadalafil, AZD9496, and the antimalarial candidate NITD609. The CPG-bound tryptophane conjugate hexT-2 was synthesized (Figures 25), and screened several conditions (Figure 24) to react it with -biphenylcarboxaldehyde CR (Figure 26) to the β-carboline hexT-3CR. The product was confirmed by comparison with the reference ref-hexT-3CR. The target compound hexT- 3CR was obtained with 10 % of TFA in CH 2 C1 2 for 4 hours, 1 % of TFA, TCA, diphenylphosphate, and 50% of formic acid in CH 2 C1 2 for 18 hours at room temperature, prolonged treatment of th 10 % of TFA in CH 2 C1 2 for 18 hours degraded the DNA (entries 1- 6). A solvent screen revealed dry non-polar solvents, acetonitrile, and dry DMF to be suitable (entries 11, 14-16), whereas the starting material was recovered when the reaction was performed in polar solvents THF and 1,4-dioxane, DMSO, as well as DMF with trace amounts of water (entries 9,10,12,13). The hexT was degraded though by the Lewis acids BC1 3 and BF 3 in CH 2 C1 2 , and 10 % of TFA in in CH 2 C1 2 for 18 hours (entries 7,8,3).

The hexT-2 was then reacted with 1 13 aldehydes A-DJ, and isatin L with 1 % TFA in CH 2 C1 2 on the 20 nmol scale in sealed standard 96well deepwell plates for 16 hours at room temperature (Figure 25 and 26). The reaction revealed a broad scope as aliphatic, olefmic, heterocyclic, diverse aromatic aldehydes, and isatin furnished the target β-carbolines hexT- 3A-3DJ with yields of mostly 1-5 nmol after HPLC-purification, which is sufficient for DEL synthesis. Each compound was ligated with a dsDNA encoding the β-carboline 3 and an individual dsDNA sequence encoding the aldehyde building block A-DJ in one pot (Figures 25).

Many bioactive compounds, among them the drugs rimonabant and celecoxib, represent highly substituted pyrazol(in)es. A class of these heterocycles can be efficiently synthesized from alkynes, aldehydes, and hydrazides by a Au(I)-catalyzed cascade reaction. The CPG- bound hexT-alkyne conjugate hexT-4, isobutyric aldehyde AJ and the hydrazide 5 were employed to explore the accessibility of the hexT-pyrazoline conjugate hexT-6AJ by Au(I) catalysis (Figure 28).

Systematic exploration of reaction parameters yielded four distinct, isolatable products in different ratios (Figure 27): the pyrazoline 6 A J, the pyrazole 7AJ, the propargylamine 8 A J, and the hydration product 9AJ. Performing the reaction with a 250-fold excess of the Au(I)catalyst A versus the DNA at 50°C in THF yielded a 2: 1 :0.4 mixture of pyrazoline 6AJ, propargylamine 8AJ, and the hydration product 9AJ (entry 3). Reducing the excess of the catalyst to 50-fold essentially reversed the product ratio of the pyrazoline 6AJ versus the propargylamine 8AJ, the hydration product 9AJ was not observed; a further reduction of catalyst A to 5-fold excess did not yield any reaction product at all (entries 2,1). Changing the solvent had a profound effect on the product ratio: while MeCN yielded a 3: 1 mixture of 6AJ to 8AJ (entry 4), the hydration product 9AJ was obtained as the main product in 1 ,2- dichloroethane (entry 5), and the pyrazole 7AJ as the main product in glacial acetic acid with catalyst B (entry 6). Thus, a combination of high catalyst concentration with the acidic solvent effected oxidation of the heterocycle and cleavage of the Boc-group. This reaction was even more pronounced at 60°C (entry 9). The formation of the pyrazole was observed as side product also in MeCN with catalyst A upon raising the temperature to 60°C (entry 8), while lowering the temperature to 25°C returned only unreacted starting material (entry 7). The triphenylphosphine- and N-heterocyclic carbine-based gold catalysts B and C, respectively, yielded the pyrazoline as sole detectable reaction product in 1 ,2-dichloroethane (entries 10,1 1). Ag(I), which reportedly afforded Mannich-type products such as 8AJ, gave the annulation product 6AJ and 8AJ in a 1 : 1 ratio, likely due to the high catalyst concentration (entry 12). Reference compounds for both the annulation and the Mannich-type products confirmed these findings. Having established an access to both hexT-pyrazolines, and hexT- pyrazoles by selection of reaction conditions, next libraries of both heterocycles from aldehydes, a hydrazide, and hexT-4 were synthesized (Figure 28). While aldehydes are abundantly available to introduce diversity in the heterocycle, the hydrazide had to be synthesized de novo, and therefore lent itself to functionalization with an acylatable primary amine 11 for combinatorial library synthesis. In the course of the library synthesis it was found that aliphatic aldehydes related to isobutyric aldehyde AJ, i.e. branched in a-position, neatly yielded the pyrazoline conjugates hexT-12 in MeCN at 50°C (condition A, Table 27) whereas linear aliphatic and benzaldehydes were not reactive (Figure 29). Only a small number of benzaldehydes furnished the pyrazolines hexT- 12 in glacial acetic acid at 50°C (Figure 25). However, both branched aliphatic aldehydes, and benzaldehydes yielded pyrazoles hexT-13 (Figure 30) in glacial acetic acid at 60°C (condition C, Figure 27). Notably, isatin L gave rise to the spirooxindole hexT-13L under these conditions. Linear aliphatic aldehydes were again not reactive, also pyridine carboxaldehydes and related structures did not yield the corresponding hexT-13 conjugates, likely due to poisoning of the catalyst. In total 23 pyrazolines hexT-12 and 41 pyrazoles hexT-13 were obtained with yields between 0.5 and 3 nmol after purification by HPLC. Usually, branched aliphatic aldehydes gave higher yields than aromatic aldehydes. The pyrazolines hexT-12 and pyrazoles hexT-13 were ligated with each a duplex DNA encoding the heterocycles, and dsDNA sequences encoding the aldehyde building blocks. All documents cited herein, are hereby incorporated by reference in their entirety.

The inventions illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms "comprising", "including", "containing", etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the inventions embodied therein herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention. The invention has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein. In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group. Further embodiments of the invention will become apparent from the following claims.