Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
COMPLEX COMBINATORIAL CHEMICAL LIBRARIES ENCODED WITH TAGS
Document Type and Number:
WIPO Patent Application WO/1994/008051
Kind Code:
A1
Abstract:
Encoded combinatorial chemistry is provided, where sequential synthetic schemes are recorded using organic molecules, which define choice of reactant, and stage, as the same or different bit of information. Various products can be produced in the multi-stage synthesis, such as oligomers and synthetic non-repetitive organic molecules. Conveniently, nested families of compounds can be employed as identifiers, where number and/or position of a substituent define the choice. Alternatively, detectable functionalities may be employed, such as radioisotopes, fluorescers, halogens, and the like, where presence and ratios of two different groups can be used to define stage or choice. Particularly, pluralities of identifiers may be used to provide a binary or higher code, so as to define a plurality of choices with only a few detachable tags. The particles may be screened for a characteristic of interest, particularly binding affinity, where the products may be detached from the particle or retained on the particle. The reaction history of the particles which are positive for the characteristic can be determined by the release of the tags and analysis to define the reaction history of the particle.

Inventors:
STILL W CLARK (US)
OHLMEYER MICHAEL H J (US)
WIGLER MICHAEL H (US)
DILLARD LAWRENCE W (US)
READER JOHN C (US)
Application Number:
PCT/US1993/009345
Publication Date:
April 14, 1994
Filing Date:
October 01, 1993
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV COLUMBIA (US)
COLD SPRING HARBOR LAB (US)
STILL W CLARK (US)
OHLMEYER MICHAEL H J (US)
WIGLER MICHAEL H (US)
DILLARD LAWRENCE W (US)
READER JOHN C (US)
International Classes:
C07B61/00; C07C205/19; C07C245/14; C07D243/24; C07H21/00; G01N37/00; C07K1/04; C12Q1/68; G01N33/532; G11B9/00; G11B9/14; C40B70/00; (IPC1-7): C12Q1/68; C07H21/02; C07H21/04; A61K48/00; A61K37/00
Other References:
P.N.A.S., Volume 89, issued June 1992, BRENNER et al., "Encoded Combinatorial Chemistry", pages 5381-5383, see entire document.
Nature, Volume 354, issued 07 November 1991, LAM et al., "A New Type of Synthetic Peptide Library for Identifying Ligand-Binding Activity", pages 82-84, see page 82, summary.
Science, Volume 257, issued 17 July 1992, AMATO et al., "Speeding Up a Chemical Game of Change", pages 330-331, see entire document.
Nature, Volume 354, issued 07 November 1991, HOUGHTEN et al., "Generation and Use of Synthetic Peptide Combinatorial Libraries for Basic Research and Drug Discovery", pages 84-86, see entire document.
Download PDF:
Claims:
WHAT IS CLAIMED IS:
1. A method for recording the reaction history of a reaction series on each of a plurality of unique solid supportε, wherein said reaction series involves at least two stages requiring differing agents or reaction conditions resulting in a different modification as to a plurality of said unique solid supports, resulting in a plurality of different final products on different unique solid εupports, employing a combination of identifierε for recording said reaction history, said identifiers characterized by defining the choice of agent or reaction condition and the stage in said reaction εerieε and being capable of being analyzed aε to the choice and εtage, εaid method comprising: reacting, at a first or intermediate stage of said series, a different agent or employing a different reaction condition with each of a group of said unique solid supportε, εaid group compriεing at leaεt one of εaid unique εolid supportε, and a combination of identifierε wherein εaid combination of identifierε defineε the choice of agent and the εtage in εaid εeries as to each group of said unique solid supportε, each of said identifiers being individually bound to said unique solid support directly or through other than a prior identifier; mixing said groups together and then dividing said plurality of unique solid supports into a plurality of groups for a second intermediate or final stage; and repeating said reacting at least once to provide a plurality of final products, having different products on the different individual unique εolid supports.
2. A method according to Claim 1, wherein at least 100 unique solid supports and at least 2 groups are employed in each said reacting.
3. A method according to Claim 1, including the additional stageε of εcreening εaid final productε on said unique solid supports for a characteristic of intereεt; and identifying the reaction hiεtory of at leaεt one final product having εaid characteriεtic of intereεt.
4. A method of Claim 1 further compriεing cleaving the final product from the εolid εupport and screening said final product.
5. A method of Claim 1 further comprising treating the identifierε so as to detach the tag components from the εolid supports and reacting said tag components with a moiety capable of detection by fluorescence or electron capture.
6. A method of Claim 5, wherein the detaching is done photochemically or oxidatively and the detectable moiety is derived from dansyl chloride or a polyhalobenzoylhalide.
7. A method according to Claim 5, wherein said tag components have two characteristics, a characteristic capable of separation and a characteristic capable of detection.
8. A method according to Claim 7, wherein said characteristic capable of detection is the ability to be detected by electron capture.
9. A method according to Claim 7, wherein said characteristic capable of detection iε the ability to be detected by maεε εpectroscopy.
10. A method according to Claim 7, wherein said characteristic capable of detection is radioactivity.
11. A method according to Claim 7, wherein said characteristic capable of detection is fluorescence.
12. A method according to Claim 7, wherein said tags may be separated by means of chromatography.
13. A kit comprising a plurality of different separated organic compounds, each of the compounds characterized by having a distinguishable composition, encoding at least one bit of different information which can be determined by a physical measurement and sharing at least one common functionality.
14. A kit of Claim 13 comprising at least 4 different functional organic compounds.
15. A kit according to Claim 13, wherein said functional organic compounds are of the formula: F1F2CEC where F1F2 is a linker which allowε for attachment to and detachment from a εolid particle; and CEC' iε a tag which can be determined by a phyεical measurement.
16. A kit according to Claim 15, wherein said functional organic compounds differ by the number of methylene groups and/or halogenε, nitrogenε or sulfurs preεent.
17. A kit according to Claim 15 wherein the CEC' portion can be removed photochemically.
18. A kit according to Claim 15 wherein the CEC' portion can be removed oxidatively, hydrolytically, thermolytically, or reductively.
19. A solid support characterized by having a ligand bound thereto and having a combination of identifiers bound to said solid support.
20. A solid support according to Claim 19, wherein said ligand is an oligomer which is an oligopeptide, oligonucleotide, oligosaccharide, polylipid, polyester, polyamide, polyurethane, polyurea, polyether, poly (phosphorus derivative) which is a phosphate, phosphonate, phosphora ide , phosphonamidey, phosphite, or phosphinamide, poly (sulfur derivative) which is a sulfone, sulfonate, sulfite, sulfonamide, or εulfenamide, where for the phosphorous and sulfur derivatives the indicated heteroatom for the most part will be bonded to C, H, N, O or S, and combinations thereof.
21. A solid support according to claim 19 wherein said ligand iε a nonoligomer which iε heterocyclic, aromatic, alicyclic, or aliphatic, and combinationε thereof.
22. A solid support of Claim 21 wherein the nonoligomer is a diazabicyclic, an azatricyclic, or a branched amide compound.
23. A solid support of Claim 19 wherein the ligand is linked to the support through a nonlabile linkage.
24. A εolid εupport of Claim 19 wherein the ligand iε linked to the support through a cleavable linkage.
25. A solid εupport according to Claim 19, wherein the identifier comprises tags, the tags being radioiεotopeε, or haloalkyl or haloarylallyl containing compoundε.
26. A solid support of Claim 19 which is a bead of about 102000 μm in diameter, and wherein the identifiers comprise tag components which after cleavage from the bead can be separated by gas chromatography and or liquid chromatography detected by electron capture, mass spectroscopy, fluorescence, or atomic emisεion techniques.
27. A library comprising a plurality of solid supportε according to claim 22.
28. A library of Claim 27, wherein the final productε have been cleaved from the solid support.
29. A library of Claim 28, wherein the final products are a diazabicyclic, azatricyclic, or branched amide compoundε.
30. A process for identifying compounds having a characteristic of interest which comprises screening a library of Claim 27.
31. A procesε of Claim 30, wherein the compoundε have been cleaved from the εolid εurface.
32. A process of Claim 31, wherein the compound is a diazabicyclic, azatricyclic, or branched amide compound.
33. A method for producing a ligand involving a reaction serieε employing a method for recording the reaction history of a reaction series on each of a plurality of unique solid supportε, wherein said reaction εerieε involveε at least two stages requiring differing agents and/or reaction conditions resulting in a different modification as to a plurality of said unique solid supportε, resulting in a plurality of different final products on different unique solid εupportε, employing a combination of identifierε for recording said reaction history, said identifiers characterized by defining the choice of agent or reaction condition and the stage in εaid εerieε and being capable of being analyzed aε to the choice and stage, said method comprising: reacting, at a first or intermediate stage of said series, a different agent or employing a different reaction condition with each of a group of said unique solid supports, said group comprising at least one of said unique solid supportε, and a combination of identifierε wherein said combination of identifiers defines the choice of agent and the stage in said series as to each group of εaid unique εolid εupportε, each of said identifiers being individually bound to εaid unique solid support directly or through other than a prior identifier; mixing said groups together and then dividing εaid plurality of unique solid supports into a plurality of groups for a second intermediate or final stage; repeating said reacting at least once to provide a plurality of ligands, having different products on the different individual unique solid surfaceε; and identifying εaid reaction history of at least one selected unique solid surface by means of said combination of identifiers.
34. A ligand according to Claim 33, wherein εaid identifying includeε the εtage of screening said ligands for a characteristic of interest.
35. A method for producing a ligand involving a reaction serieε employing a method for recording the reaction history of a reaction series on each of a plurality of unique solid surfaces, wherein said reaction series involves at least two εtageε requiring differing agentε and/or reaction conditions resulting in a different modification as to each of a plurality of said unique solid surfaceε, reεulting in a plurality of different ligandε on different unique solid surfaces, employing combinations of identifierε for recording εaid reaction history, said combination of identifierε characterized by defining the choice of agent and/or reaction condition and the εtage in said series and being capable of being analyzed as to the choice and stage, said method comprising: reacting, at a first or intermediate stage of said serieε, a different agent and/or employing a different reaction condition with each of a group of said unique solid surfaceε, said group comprising at least one of said unique εolid εurfaceε, and a combination of identifierε wherein εaid combination of identifierε defineε the choice of agent and the εtage in εaid εerieε aε to each group of said unique solid εurfaceε, each of εaid identifierε being individually bound to εaid unique solid surface through other than a prior identifier by a cleavable link; mixing said groups together and then dividing said plurality of unique εolid εurfaceε into a plurality of groupε for a εecond intermediate or final εtage; repeating εaid reacting to provide a plurality of ligandε having different ligandε on the different individual unique solid surfaceε; εcreening the ligandε from a plurality of each of εaid unique εolid surfaces for a characteristic of interest; and identifying said reaction history of at least one selected unique εolid εurface having ligand having εaid characteristic of interest by detaching the tag members from said unique solid surface and identifying said tag members by means of a differing characteristic.
36. A method according to Claim 35, wherein said tags differ in an homologous series and are detected by electron capture gas chromatography or mass spectroscopy.
37. A compound of the Formula I: FI_F2_CEC' I where F1F2 is a linker which allows for attachment to and detachment from a support; and CEC' is the tag which iε capable of analysis; E iε a tag component which allows for detection, or allows for detection and provides for separation aε a reεult of variable εubεtitution; C and C' are tag componentε which allow for individual detection; F2 iε a linking component capable of being εelectively cleaved to releaεe the tag componentε; and F1 iε a functional group which allows ready attachment of the compound to a synthesis support.
38. A compound of Claim 37 having the formula: F1F2(C(EC')a)b wherein: F1 is C02H, CH2X, OH, CHN2, SH, C(0)CHN2, S(02)C1, S(02)CHN2, N3, N02, NO, S(0)N3, OC(0)X, C(0)X, NCO, or NCS; F2 is Qfλ . ^ ■ «<»v Si(R )— CtE1), — C(K ., "°yE — Si(CE3 (CR'2)7A— i^(CB'*e)2A Q A , — CE1==CB_C(RJ)A H§C S C(R )2A" 'C(X)R CfR1)*, A CCOHJB5 C(R1)2A— . — C(0H)R — C(CH2X)R 1 i C(OH)R C(H )2 C(X)R" C(OH)(CH2CH2X) with the proviso that when F2 iε a bond, F1 iε OH or COOH; A iε 0, OC(0)0, OC(0), or NHC(O); C iε a bond, C^C20 alkylene optionally εubεtituted by 140 F, Cl, Br, C1C6 alkoxy, NR4R4, OR4, or NR4, or [(C(R4)2)mYZY(C(R4)2)nYZY]p; with the proviso that the maximum number of carbon atoms in C+C' is 20; C _ iε H; F; Cl; C.C20 alkylene optionally εubεtituted by 140 F, Cl, Br, C,C6 alkoxy, NR4R4, OR4, or NR4, or [(C(R4)2)_YZY(C(R4)2)nYZY]p; With the Proviso that the maximum number of carbon atoms in C+C' is 20; E is C.C10 alkyl subεtituted by 120 F, Cl or Br; or Qaryl wherein the aryl iε εubεtituted by 17 F, Cl, N02, S02R5, or εubstituted phenyl wherein the substituent is 15 F, Cl, N02, or S02R5; EC' may be H, OH, or amino; R1 is H or C1C6 alkyl; R3 is C=0, C(0)0, C(0)NR1, S, SO, or S02; R4 is H or C1C6 alkyl; R5 iε C,C6 alkyl; a iε 15; b iε 13 ; m and n iε each 020; p iε 17; Q iε a bond, O, S, NR4, C=0, C(0)NR5, NR5C(0), C(0)0, or OC(O); X iε a leaving group εuch as Br, Cl, triflate, mesylate, toεylate, or 0C(O)OR5; Y iε a bond, 0, S, or NR4; Z is a bond; phenyl optionally substituted by 14 F, Cl, Br, C1C6 alkyl, C1C6 alkoxy, C1C6 alkyl εubstituted by 113 F, Cl, or CjC6 alkyloxy subεtituted by 113 F, Cl, or Br; (C(R4)2) ._20; or (CF2).,..20; with the proviso that when Z is a bond one of its adjacent Y's iε also a bond and aryl is a mono or bicyclic aromatic ring containing up to 10 carbon atoms and up to 2 heteroatoms selected from 0, S, and N.
39. A compound of Claim 38 wherein: is C02H. 0E, CHN2, C(0)CH C(0)X, NCS. or CE2X: CH2A" C and C' is each independently ^C.,,. alkylene unsubstituted or substituted by 1.
40. F or Cl, or [0 (CH2)23*V E iε C.C10 alkyl substituted by 120 F or Cl ; Qaryl where aryl is a bicyclic aromatic ring subεtituted by 17 F or Cl; or Qphenyl εubεtituted by 15 F, Cl, iε a bond, 0, NR5C(0), or OC(O) .
41. A compound of Claim 38 having the formula: X (CHj)!.ιi~°0 .~Ar )0.ls(cr2)1.lsr wherein Ar iε pentafluoro pentachloro, or pentabromophenyl, 2,3,5,6tetrafluoro4(2,3,4,5,6 pentafluorophenyl)phenyl, 2,4,6trichlorophenyl, 2,4,5trichlorophenyl, 2,6dichloro4fluorophenyl, or 2,3,5,6tetrafluorophenyl.
42. A compound of Claim 38 wherein: EC' iε H, OH, or NH2.
43. A compoεition of the formula SF1'F2CEC' wherein: S is a soluble or solid support; CEC'is the tag which is capable of analysiε where E iε a tag component which (a) allowε for detection, such as an electrophoric group which can be analyzed by gas chromatography or masε spectroscopy or (b) allows for detection and for separation aε a reεult of variable εubεtitution; C and C' are tag componentε which allow for distinguishing one tag from all other tags, usually allowing for separation as a reεult of variable length or substitution, for example, varying the chromatographic retention time or the masε spectroscopy ratio Z/e; F2 is a linking component capable of being selectively cleaved to release the tag; and F1' is a functional group which provides for attachment to the support.
44. A composition of claim 43 wherein: S is a capillary, hollow fiber, needle, solid fiber, cellulose bead, poreglass bead, silica gel, polystyrene bead optionally crosεlinked with divinylbenzene, grafted copoly bead, polyacrylamide bead, latex bead, dimethylacrylamide bead optionally crosslinked with N,N'biεacryloyl ethylene diamine, glaεε particleε coated with a hydrophobic polymer, or low molecular weight noncrosslinked polystyrene; and F1'F2CEC' is the residue of Formula I attached to S.
45. The method of Claim 1, wherein the combination of identifiers defines a binary coding scheme.
46. The method of Claim 1, wherein the identifiers are of Claim 37.
47. The method of Claim 1, wherein the identifiers are of Claim 38.
48. The method of Claim 1, wherein the identifiers are of Claim 39.
49. The method of Claim 1, wherein the identifiers are of Claim 42.
50. The method of Claim 1 further comprising detaching the tag members from said unique solid surfaceε.
51. The method of Claim 50 wherein the tag members are detached photochemically, oxidatively, hydrolytically, thermolytically, or reductively.
52. The method of Claim 1 further comprising detaching nonoligomer ligands from said unique solid surfaceε photochemically.
53. A compound of the formula wherein: P is a polystyrene resin; IXa.f iε a plurality of reεidueε of the formula wherein: n is 1 6; R is CH3, CH(CH3)2, CH2C02H, (CH2)4NH2, CH2C6H4OH, or CH2C6H5; and R1 is H, CH3, C2H5, CH2CH=CH2, or CHC6H5.
54. A method of syntheεizing a chemical compound εo that the εtructure of the compound iε readily determinable, which compriεeε εyntheεizing the compound on the εurface of a εolid εupport under conditionε εuch that the εolid εupport at the completion of the synthesiε of the compound haε bound to it a plurality of identifiers which encode the reaction stages associated with the synthesiε of the compound.
55. A method of synthesizing a library of chemical compounds so that the structure of each compound in the library is readily determinable which comprises synthesizing each compound on the surface of a unique solid support under conditions such that each such unique support at the completion of the syntheεiε of the library of compoundε has bound to it a plurality of identifiers which encode the reaction εtageε associated with the synthesis of the compound synthesized on such solid support.
56. A method of determining the structure of a chemical compound which compriseε εyntheεizing the compound by the method of claim 54 or 55, iεolating the solid support upon which the compound waε εyntheεized, treating the solid support so isolated so aε to cause the tag components of each of the identifiers bound to the solid support to be released, determining the identity or quantity or both of each tag component so released, and deriving the structure of the compound from the identities or quantities or both of all such tag components.
57. A method of identifying a compound having a desired characteristic which comprises synthesizing a library of chemical compounds by the method of claim 55, εeparately teεting each of the compoundε in the resulting library in an assay which identifies compounds having the desired characteristic so as to identify any compounds present in the library which has the desired characteristic.
58. A method of claim 57, further comprising determining the εtructure of the compound εo identified.
59. A library of chemical compoundε, each compound in the library being bound to a unique εolid εupport and each such solid support having bound to it a plurality of identifierε which encode the reaction εtageε aεsociated with the synthesiε of the compound bound to εuch solid support.
60. A library of claim 59, wherein compounds in the library are diazabicyclic compoundε.
61. A library of claim 59, wherein compoundε in the library are azatricyclic compoundε.
62. A library of claim 59, wherein compounds in the library are branched amide compounds.
63. A library of claim 59, wherein compound in the library are peptides.
64. A method of identifying a compound having a desired characteristic which compriseε testing a library of chemical compounds according to claim 58 in an assay which identifies compounds having the desired characteristic so as to identify any compound present in the library which have the desired characteristic.
65. A method of claim 64, further comprising determining the structure of the compound so identified.
66. A compound identified by the method of claim 63.
67. A method of claim 64, wherein the desired characteristic is antagonism for the human neurokinin 1/brandykin receptor and the library of chemical compounds comprises azatricyclic compounds.
68. A method of claim 64, wherein the desired characteristic in usefulnesε as a muεcle relaxant, a tranquilizer or a sedative and the library of chemical compounds comprising bezodiazopineε.
69. A method of claim 64, wherein the desired characteristic is useful in the treatment of hypertension or Raynaud's syndrome and the library of chemical compounds comprises branched amides.
Description:
COMPLEX COMBINATORIAL CHEMICAL LIBRARIES ENCODED WITH TAGS

This application is a continuation-in-part of U.S. Serial No. 08/013,948, filed February 4, 1993, which is a continuation-in-part of U.S. Serial No. 07/955,371, filed October 1, 1992, the contents of both of which are hereby incorporated by reference into the subject application.

Introduction Technical Field The field of this invention concerns combinatorial chemistry which involves syntheses having a plurality of stages, with each stage involving a plurality of choices, where large numbers of products having varying compositions are obtained.

Background of the Invention

There is substantial interest in devising facile methods for the synthesis of large numbers of diverse compounds which can then be screened for various possible physiological or other activities. Typically such a synthesis involves successive stages, each of which involves a chemical modification of the then existing molecule. For example, the chemical modification may involve the addition of a unit, e.g. a monomer or synthon, to a growing sequence or modification of a functional group. By employing syntheses where the chemical modification involves the addition of units, such as amino acids, nucleotides, sugars, lipids, or heterocyclic compounds where the units may be naturally-occurring, synthetic, or combinations thereof. One may create a large number of compounds. Thus, even if one restricted the synthesis to naturally-occurring units or building blocks, the number of choices would be very large, 4 in the case of nucleotides, 20 in the case of the common amino acids, and essentially an unlimited number in the case of sugars.

One disadvantage heretofore inherent in the production of large number of diverse compounds, where at each stage of the synthesis there are a significant number of choices, is the fact that each individual compound will be present in a minute amount. While a characteristic of a particular compound, e.g. a physiological activity, may be deter inable, it is usually impossible to identify the chemical structure of this particular compound present.

Moreover, physiologically-active compounds have historically been discovered by assaying crude broths using Edisonian or stochastic techniques, where only a relatively few compounds are assayed at a time, or where a limited number of structural similar homologs of naturally-occurring physiologically-active compounds are assayed. Two of the major problems has been associated with the use of such crude broths, namely, the necessity to purify the reaction mixture into individual component compounds and the time-consuming effort required to establish the structure of the compound once purified.

To address these disadvantages and problems, techniques have been developed in which one adds individual units as part of a chemical synthesis sequentially, either in a controlled or a random manner, to produce all or a substantial proportion of the possible compounds which can result from the different choices possible at each sequential stage in the synthesis. However, for these techniques to be successful it is necessary for the compounds made by them to be amenable to methods which will allow one to determine the composition of a particular compound so made which shows a characteristic of interest.

One such approach involves using a chip which allows for separate analysis at physically separate sites on the surface of the chip (Fodor et al., Science 251: 767

[1991]). By knowing what reactant is added sequentially at each such site, one can record the sequence of events and thus the series of reactions. If one then subjects the chip to a screening method for a particular desired characteristic and detects the characteristic one can really determine the compound synthesized at the site which demonstrates that characteristic.

Another such technique involves the theoretical synthesis of oligonucleotides in parallel with the synthesis of oligopeptides as the compounds of interest (Brenner and Lerner, PNAS USA [1992] 8-1: 5381-5383) .

Further techniques are also disclosed in the following publications: A oto, Science (1992) 257. 330-331 discusses the use of cosynthesized DNA labels to identify polypeptides. Lam, et al., Nature (1991) 354. 82-84 describe a method for making large peptide libraries. Houghton, et al., Nature (1991) 354, 84-86 and Jung and Beck-Sickinger, Angew. Chem. Int. Ed. Engl. (1992) 91, 367-383 describe methodology for making large peptide libraries. Kerr et al.. J. Amer. Chem. Soc. , (1993) 115. 2529-31 teach a method of synthesizing oligo er libraries encoded by peptide chains.

However, since methods such as the preceding typically require the additum of like moieties, there is substantial interest in discovering methods for producing compounds which are not limited to sequential addition of like moieties. Such methods would find application, for example, in the modification of steroids, antibiotics, sugars, coenzymes, enzyme inhibitors, ligands and the like, which frequently involve a multi-stage synthesis in which one would wish to vary the reagents and/or conditions to provide a variety of compounds. In such methods the reagents may be organic or inorganic reagents, where functionalities may be introduced or modified, side

groups attached or removed, rings opened or closed, stereochemistry changed, and the like. (See, for example, Bunin and Ellman, JACS 114, 10997 [1992].) For such a method to be viable, however, there needs to be a convenient way to identify the structures of the large number of compounds which result from a wide variety of different modifications. Thus, there is a need to find a way whereby the reaction history may be recorded, and desirably, the structures of the results compound identifed.

Finally as the size of a library compounds so synthesizd increases, known techniques of structure elucidation and product segregation introduce substantial inefficiencies and uncertainties which hinder the accurate determination of the structure of any compound identified as being of interest. Thus, there is a substantial need for new methods which will permit the synthesis of complex combinatorial chemcial libraries which readily permit accurate structural determination of individual compounds within the library which are identified as being of interest.

Finally, international applications W091/17823 and WO92/09300 concern combinatorial libraries.

Many of the disadvatnages of the previously-described methods as well as many of the needs not met by them are addressed by the present invention which, as described more fully hereinafter, provides marlad advantages over these previously-described methods.

Summarv of The Invention

Methods and compositions are provided for encoded combinatorial chemistry, whereby at each stage of the synthesis, a support such as a particle upon which a compound is being synthesized is uniquely tagged to define a particular event, usually chemical, associated with the synthesis of the compound on the support. The tagging is accomplished using identifier molecules which record the sequential events to which the supporting particle is exposed during synthesis, thus providing a reaction history for the compound produced on the support.

Each identifier molecule is characterized by being stable under the synthetic conditions employed, by remaining associated with the supports during the stage of the synthesis, by uniquely defining a particular event during the synthesis which reflects a particular reaction choice at a given stage of the synthesis, by being distinguishable from other components that may be present during assaying, and by allowing for detachment of a tag component which is discernible by a convenient, analytical technique.

The identifiers of this invention are used in combination with one another to form a binary or higher order encoding system permitting a relatively small number of identifiers to be used to encode a relatively large number of reaction products. For example, when used in a binary code N identifiers can uniquely encode up to 2 N different compounds.

Moreover, the identifiers of this invention need not be bound serially through a previous identifier but rather are individually bound to the substrate, either directly or through the product being synthesized. The identifiers are not sequencable. Furthermore, the identifiers contain

a cleavable member or moiety which permits detachment of a tag component which can be readily analyzed.

Conveniently, the combinatorial synthesis employs definable solid supports upon which reactions are performed and to which the identifiers are bound. The individual solid supports or substrates or substrates carrying the final product compounds may be screened for a characteristic of interest and the reaction history determined by analyzing the associated identifier tags.

DETAILED DESCRIPTION OF THE INVENTION

As used in this application the term "tag" or "T" means a chemical moiety which possesses two properties. First, it is capable of being distinguished from all other chemical moieties. Second, it is capable of being detected when present at IO "18 to IO '9 mole. These two properties may be embodied in a single chemical structure. Alternatively, these properties may be embodied in separate chemical structures which are linked together. In this latter case, one of the chemical structures, which may be designated C (or in the case of more than one such structure C, C, etc.) provides the property of rendering the tag distinguishable from other tags while the other chemical structure, E, provides the property of rendering the tag detectable and optionally may provide the property of rendering the tag separable from other tags.

As used in this application, the term "linker" or "L" means a chemical moiety which possesses three properties. First, it is attachable to a solid support. Second, it is attachable to a tag. Third, when it is attached to both a solid support and a tag, it is cleavable such that the tag may be released from the solid support. These three properties may be embodied in a single chemical structure. Alternatively, these properties are embodied in three chemical structures which are linked together. In this latter case one of the chemical structures, which may be designated F 1 , provides the property of rendering the linker attachable to the solid support; the second chemical structure, which may be designated V, provides the property of rendering the linker cleavable; and the third chemical structure which may be designed A' , provides the property of rendering the linker attachable to the tag. Desirably, the chemical structures V and A' are one and the same, in which case V-A' may be designated F 2 .

As used in this application, the term "identifier" means a chemical entity which includes both a tag and a linker. Thus, in the broadest sense an identifier may be represented by the formula L-T while specific embodiments of the identifier may be represented by the formulae F'-V- A'-T; F'-V-A'-C-E (or F'-V-A'-E-C) ; L-C-E (or L-E-C) ; and L-C-E-C .

As used in this application, the term "bound identifier" means an identifier attached to a solid support.

As used herein, the term "choice" means the alternative variables for a given stage in a combinatorial synthesis, such as reactant, reagent, reaction conditions, and combinations thereof. Where the term "stage" corresponds to a step in the sequential synthesis of a compound or ligand; the compound or ligand being the final product of a combinatorial synthesis.

The term "alkyl" includes linear, branched, and cyclic structures and combinations thereof. Thus, the term includes methyl, ethyl, propyl, isopropyl, butyl, sec- and tert-butyl, cyclopropyl, cyclobutyl, cyclopentyl, 2- methylcyclopropyl, and the like. Lower alkyl is C 1 -C 6 alkyl. Lower alkenyl is C--C 6 alkenyl of a linear, branched, or cyclic configuration and combinations thereof.

Unless otherwise indicated, it is intended that the definitions of any substituent (e.g.. R 1 R 2 , Z, etc.) in a particular molecule be independent of its definitions elsewhere in the molecule. Thus, NR 4 R 4 represents NHH, NHCH 3 , NHCH-CH 3 , N(CH 3 )-, etc.

Some of the compounds described herein contain one or more centers of asymmetry and may thus give rise to enantiomers, diastereoisomers, and other steroiεomeric

formε. The present invention is meant to include all such possible stereoisomers as well as their racemic and optically pure forms. Optically active (R) and (S) isomers may be prepared using chiral synthons, chiral reagents, or resolved using conventional techniques. When the compounds described herein contain olefinic double bonds, it is intended to include both E and Z geometric isomers.

The materials upon which the combinatorial syntheses of this invention are performed are referred to herein interchangeably as beads, solid surfaces, (solid) substrates, particles, supports, etc. These terms are intended to include: a) solid supports such as beads, pellets, disks, capillaries, hollow fibers, needles, solid fibers, cellulose beads, pore-glass beads, silica gels, polystyrene beads optionally cross- linked with divinylbenzene, grafted co-poly beads, poly-acrylamide beads, latex beads, dimethylacrylamide beads optionally cross-linked with N,N'-bis-acryloyl ethylene diamine, glass particles coated with a hydrophobic polymer, etc., i.e., a material having a rigid or semi- rigid surface; and b) soluble supports such as low molecular weight non-cross-linked polystyrene.

These materials must contain functionalities or must be able to be functionalized such that identifiers or product intermediates may be attached to them.

In addition, the following abbreviations have the indicated meanings: AcOH = acetic acid

BSA = bis(trimethylsilyl)acetamide CAN = cerium (iv) ammonium nitrate

DEAD diethylazodicarboxylate

DCM dichloromethane

DIC diisopropylcarbodiimide

DMF N,N-dimethylformamide

Fmoc 9-fluorenylmethoxycarbonyl

HOBt 1-hydroxybenzotriazole

PhMe toluene r.t. room temperature

TFA trifluoroacetic acid

THF tetrahydrofuran

The subject invention concerns the production of libraries of products, i.e. compounds, where the individual products or compounds present in the libraries may be physically separated from one another and may be screened for a characteristic of interest either bound to, or detached from, a solid support. By having serial syntheses, where at each stage of a synthesis each of the individual intermediates is treated in a variety of ways, a very large number of products is produced, each of which is present in a small amount, frequently less than 100 pmol, more frequently less than 10 nmol. Because of the small quantity of final product or compound so produced, identifying these products by isolating and structurally elucidating the products would generally not be feasible. Moreover, in sequential synthesis involving other than the addition of similar units, the analysis would be arduous if not impossible using the amount of product typically available. However, by associating each stage or combination of stages (e.g., "add reagent A" or "add reagent A, then reagent B, and heat to 100°C for 2 hrs.") of the serial synthesis with an identifier which defines the choice of variables such as reactant, reagent, reaction conditions, or a combination of these, one can use the identifiers to define the reaction history of each definable and separable substrate. The analysis of tags detached from the identifiers allows for ready

identification of the reaction history, at picomolar or lower concentrations, e.g. femtomolar or less. One can determine a characteristic of a product of a synthesis, usually a chemical or biological characteristic by variuε screening techniques, and then identify the reaction history and thereby the structure of that product, which has the desired characteristic, by virtue of the tags associated with the product.

The use of the instant multiple tag system avoids the necessity of carrying out a complicated cosynthesis which reduces yields and requires multiple protecting groups, and avoids the necessity of using sequencable tags which are necessarily chemically labile. Both the necessity of multiple protecting groups and the intrinsic instability of all known sequencable tagging molecules (i.e., nucleic acid or peptide oligomers) severely limit the chemistry which may be used in the synthesis of the library element or ligand.

Moreover, the use of a binary, or higher, multiple tag system reduces enormously the number of tags necessary to encode the reagent/reactant choice in any stage in a synthesis. For example, if a particular synthetic stage could be carried with 125 different choices for reagent, the binary system would require only 7 tags. This can make the difference between a practical encoding system and an impractical one, because it may not be feasible to obtain and use the large number of distinguishable tags required by other systems. With the binary system of the invention, 30 distinguishable tags are available and are sufficient to encode >10 9 different syntheses.

Importantly, the present method employs tags which are detachable from a ligand or compound synthesized also for the purpose of decoding. Such detachability also allows the tags to be distinguished on more than one basis; in

particular, they can be separated (e.g.. on the basis of chromatographic retention time) and then analyzed (e.g. , a second basis is a spectral property such as mass spectroscopy m/e, or electrophoricity) . Having multiple bases for distinction allows the encoding of large amounts of information with a small number of tags.

Detachment further allows tags to be detected at very low levels, because they can be removed from the support matrix on which the synthesis is effected and from the ligand synthesized, the presence of either of which could provide spurious background signals, e.g. by quenching fluorescence or the like.

Detachable tags are also amenable to rapid analysis by automated sampling systems, and allow for selective derivatization for detection via functional groups, eliminating any incompatibility between the detection moiety and the reaction conditions used in the synthesis.

Inherent in any tagging scheme is the requirement that the chemical characteristics of the tags and the chemical stages for their incorporation be compatible with the characteristics of the ligand and the stages in their synthesis, and vice versa. The advantage of tags that are generally unreactive, as exemplified hereinafter by the substituted- aryloxypolymethylene moieties, is a greater range of chemical transformations and chemical functionality that can be employed in synthesis of the ligands.

A further advantage of the chemically stable tags of this invention is their compatibility with a greater variety of rapid, convenient methods of separation and analysis, such as gas chromatography and mass spectrometry. Moreover, the organic tags of these inventions generally do not give specifically interact with biological receptors. Thus,

then tags will generally not give spurious results in biological assays and will generally not be modified by enzymes or other biological molecules.

Finally, the chemical stability of the present tags allows them to be detached by a wide variety of methods which improves sensitivity in their analysis as described above.

Thus, this invention provides methods and compositions for encoded combinatorial synthesis whereby at each stage of the synthesis one or more identifiers are provided which encode an event associated with a particle stage in the synthesis of a compound on a support or particle. This event comprises the choice of reactant and/or reaction conditions at that stage of the reactions where each such stage may involve one or more reactants which are the same or different under the same or different conditions, e.g. partial reactions, multiple additions, rate of addition, differing combinations of reagents, etc. In addition, groups of particles may be sequestered from other groups of particles and subjected to a different series of events at any time during the course of the sequential synthesis.

By providing N identifiers, each having M distinguishable states, M N different syntheses can be uniquely defined. In the case of M=2 where the two states could be the presence or absence of identifier, the synthesis would thus be defined by a base 2 or binary code. In the case of M=3 where the three states could be the presence of an identifier at two distinguishable concentrations or its absence, the synthesis would be defined by a base 3 code. Herein, such base M codes where M>2 are termed higher order codes. The advantage of higher order codes over a binary code is that fewer identifiers are required to encode the same quantity of information about the synthesis. The products which are produced will be defined as resulting from a serial synthesis. At each

stage in the synthesis, there is available a plurality of reactants and/or reagents and/or conditions, which result in a feature of the product in relation to an identifiable and usually separable entity, e.g. tag. In referring to reactants and reagents, it is intended that the reactant, for the most part, becomes incorporated into the product, e.g. an amino acid, nucleotide, nucleophile, electrophile, diene, alkylating or acylating agent, diamine, or any other synthon, etc. while a reagent may or may not become incorporated into the product, e.g. base, acid, heat, oxidizing or reducing agent, while both will be included under the term "agent". The synthesis may involve individual reactants which become incorporated into the product. Alternatively, a stage may involve one or more reactions which result in a modification of a reaction intermediate. In many cases, combinations of these possibilities will be involved.

Using a base 2 or binary code (M=2) and three identifiers (N=3) , as many as 8 (2 3 ) agents for a given stage in a synthesis may be encoded. If the three identifiers are represented as TI, T2, and T3 and the presence or absence of each identifier is represented as a '0' or '1' respectively, then eight different agents could be represented in a binary code as follows:

Similarly, even more information about the synthesis may be encoded by more identifiers. For example, 9 identifiers (N=3) and a base 2 code (M=2) would allow up to 2 9 or 512 different agent choices to be encoded. Using a base 3 code (M=3) and three identifiers (N=3) would allow as many as 27 (3 3 ) agent choices to be encoded. If

the three identifiers are represented as TI, T2 and T3 , and the absence of an identifier is represented as a '0', its presence at a quantity of -0.5 pmol/bead as a '!', and its presence of a quantity of -1.0 pmol/bead as a '2', then the 27 different agents could be represented by three identifiers in base 3 code as:

To make such higher order encoding schemes practical, one additional identifier at a given quantity (e.g., -1.0 pmol/bead) would be added to all members of the library to provide a standard against which the quantities of all identifiers would be measured. The quantities of the identifiers could be measured by gas chromatography or HPLC with a variety of detection methods. In the case of HPLC, quantities could be conveniently measured by scintillation counting if the identifiers were radioactively labeled by different quantities of a radionuclide such as tritium ( 3 H) . It would be particularly convenient to carry out the quantitation by measuring the H-to- K C ratio, thus using 14 C as a standard. In this way, as many as ten quantities of 3 H could be distinguished to create a base 10 or decimal code (M=10) which could encode enormous amounts of information with very few identifiers.

Products and Synthetic Strategies

For the most part, the products of the method of this invention will be organic compounds and removal of chemical units, reactions involving the modification or introduction of one or more functionalities, ring openings, ring closings, etc. Chemical units can take many forms, both naturally-occurring and synthetic, such as nucleophiles, electrophiles, dienes, alkylating or acylating agents, diamineε, nucleotides, amino acids, sugars, lipids, or derivatives thereof, organic monomers, synthons, and combinations thereof. Alternatively, reactions may be involved which result in alkylation, acylation, nitration, halogenation, oxidation, reduction, hydrolysis, substitution, elimination, addition, and the like. This process can produce non-oligomers, oligomers, or combinations thereof in extremely small amounts, where the reaction history, and composition in appropriate cases, can be defined by the present tags. Non-oligomers include a wide variety of organic molecules, e.g. heterocyclics, aromatics, alicyclics, aliphatics and combinations thereof, comprising steroids, antibiotics, enzyme inhibitors, ligands, hormones, drugs, alkaloids, opioids, terpenes, porphyrins, toxins, catalystε, as well as combinations thereof. Oligomers include oligopeptides, oligonucleotides, oligosaccharides, polylipidε, polyesters, polyamides, polyurethanes, polyureaε, polyetherε, poly (phoεphoruε derivativeε) e.g. phoεphateε, phoεphonates, phosphoramides, phosphonamides, phoεphiteε, phosphinamides, etc., poly (sulfur derivatives) e.g. sulfoneε, εulfonateε, εulfiteε, εulfonamides, εulfena ideε, etc., where for the phoεphorouε and sulfur derivatives the indicated heteroatom for the most part will be bonded to C, H, N, 0 or S, and combinations thereof.

Reactions may involve modifications at a variety of random sites of a central core molecular structure or

modifications at a specific site. For example, one may brominate a polycyclic compound, where bromination may occur at a plurality of sites or use a brominating agent which will be specific for a particular εite, e.g., N- bromosuccinimide. For the most part, reactions will involve single siteε or equivalent εiteε, for example, one of two hydroxyl groups of a glycol.

For the most part, the subject εyntheεiε will have at leaεt two εtageε where other than bifunctional compoundε are attached uεing the same linking functionality, e.g. amino acids and amide bonds, nucleotides and phosphate ester bonds, or mimetic compounds thereof, e.g., amrnoiso- cyanates and urea bonds.

The methods of the invention permit the variation in reaction at each stage, depending on the choice of agents and conditions involved. Thus, for amino acids, one may have up to 20 amino acidε involved using the common naturally-encoded amino acids and a much wider choice, if one wishes to use other amino acids, such as D-amino acids, amino acids having the amino group at other than the α-position, amino acids having different substituents on the side chain or substituents on the amino group, and the like. For the different nucleic acids, there will usually be up to 4 natural nucleic acidε uεed for either DNA or RNA and a much larger number iε available if one doeε not chooεe to use thoεe particular nucleic acidε. For the sugars and lipids, there are a very large number of different compoundε, which compoundε may be further increased by various substitutions, where all of these compounds may be used in the εyntheεiε. For individual organic compoundε the choice may be aεtronomically large. In addition, one may have mimetic analogε, where ureaε, urethaneε, carbonylmethylene groupε, and the like may εubstitute for the peptide linkage; various organic and inorganic groups may subεtitute for the phosphate linkage;

and nitrogen or sulfur may subεtitute for oxygen in an ether linkage or vice versa.

The synthetic strategies will vary with the nature of the group of products one wishes to produce. Thus, the strategy must take into consideration the ability to stage-wise change the nature of the product, while allowing for retention of the results of the previous stageε and anticipating needε for the future stages. Where the various units are of the same family, such as nucleotideε, amino acidε and εugarε, the synthetic strategies are relatively well-eεtabliεhed and frequently conventional chemiεtry will be available. Thuέ, for nucleotideε, phoεphoramidite or phosphite chemistries may be employed; for oligopeptides, Fmoc or Boc chemistries may be employed where conventional protective groups are used; for sugarε, the εtrategieε may be leεs conventional, but a large number of protective groups, reactive functionalities, and conditions have been establiεhed for the synthesis of polysaccharides. For other types of chemistries, one will look to the nature of the individual unit and either synthetic opportunities will be known or will be devised, as appropriate.

In some instances, one may wish to have the same or different blocks introduced at the same or different stages. For example, one may wish to have a common peptide functional unit, e.g. the fibronectin binding unit (RGDS) , a polysaccharide, e.g. Le x , an organic group, e.g. a lactam, lactone, benzene ring, olefin, glycol, thioether, etc. introduced during the synthesis. In this manner one may achieve a molecular context into which the variation is introduced. These situations may involve only a few stageε having the plurality of choices, where a large number of products are produced in relation to a particular functional entity. This could have particular application where one is interested in a large number of

derivativeε related to a core molecule or unit known to have a characteriεtic of interest.

In developing synthetic strategies, one can provide for batch synthesiε of a few compounds which would be prepared during the courεe of the combinatorial syntheεiε. By taking extreme exampleε, for example, εyntheεeε which might involve steric hindrance, charge and/or dipole interactions, alternative reaction pathways, or the like, one can optimize conditions to provide for enhanced yields of compoundε which might not otherwiεe be formed or be formed only in low yield. In thiε manner, one may allow for a variety of reaction conditions during the combinatorial syntheεiε, involving differenceε in solvent, temperatures, times, concentrationε, and the like. Furthermore, one may use the batch syntheεeε, which will provide much higher concentrations of particular products than the combinatorial synthesiε, to develop aεεayε to characterize the activity of the compounds.

Supports: Attachment and Detachment

The synthetic protocol requireε that one provide for a plurality of different reactionε involving different reactantε resulting in a plurality of different intermediates at each stage of the syntheεiε. While other techniqueε are available, thiε can be achieved most conveniently by employing small definable εolid εubεtrates, commercially available as beads, which can be readily mixed, separated, and serve as a solid subεtrate for the εequential εyntheεis. The solid substrates may be solid, porous, deformable or hard, and have any convenient structure and shape. In some inεtanceε, magnetic or fluorescent beads may be uεeful. The beads will generally be at least 10-2000 μm, usually at least 20-500 μm, more uεually at least 50-250 μm in diameter.

Any convenient composition can be used for the particles or beads, which bead composition will maintain its mechanical integrity during the various process stages, can be functionalized, has functional groupε or allows for reaction with an active specieε, allows for the serial εyntheεis as well as attachment of the identifiers, can be readily mixed and separated, and will allow for convenient detachment of the tags and products. Beads which may be employed include cellulose beadε, pore-glaεε beadε, silica gel, polystyrene beadε, particularly polyεtyrene beadε cross-linked with divinylbenzene, grafted co-polymer beads such as polyethyleneglycol/polystyrene, polyacrylamide beads, latex beads, dimethylacrylamide beads, particularly crosε-linked with N,N'-bis-acryloyl ethylene diamine and comprising N-t-butoxycarbonyl-^-alanyl-N'-acryloyl hexamethylene diamine, co poεites, such as glasε particleε coated with a hydrophobic polymer such as crosε-linked polyεtyrene or a fluorinated ethylene polymer to which iε grafted linear polyεtyrene; and the like. General reviewε of useful solid supports (particles) that include a covalently-1inked reactive functionality may be found in Atherton, et al., Prospectives in Peptide Chemistry r Karger, 101-117 (1981) ; Amamath, et al., Chem. Rev. 77:183-217 (1977); and Fridkin, The Peptides. Vol. 2, Chapter 3, Academic Press, Inc., (1979), pp. 333-363.

Depending upon the nature of the synthetic procedure or the assay of the final product, one or another bead may be more or less desirable. While beads are especially convenient, other solid supportε may alεo find uεe, εuch as capillaries, hollow fibers, needleε, εolid fiberε, etc. , where the εize of the εolid εupport allowε for the desired variation in reaction histories.

Depending upon the nature of the syntheεiε, the beadε may be functionalized in a variety of ways to allow for attachment of the initial reactant. Theεe may be linked

through a non-labile linkage εuch as an ester bond, amide bond, amine bond, ether bond, or through a sulfur, εilicon, or carbon atom, depending upon whether one wishes to be able to remove the product from the bead. Conveniently, the bond to the bead may be permanent, but a linker between the bead and the product may be provided which is cleavable such as exemplified in Table 1. Two or more different linkageε may be employed to allow for differential release of tags and/or products.

Depending upon the nature of the linking group bound to the particle, reactive functionalities on the bead may not be necesεary where the manner of linking allows for inεertion into εingle or double bonds, such aε is available with carbenes and nitrenes or other highly- reactive εpecies. In this caεe, the cleavable linkage will be provided in the linking group which joins the product or the tag to the bead.

Deεirably, when the product iε permanently attached, the link to the bead will be extended, εo that the bead will not εterically interfere with the binding of the product during εcreening. Variouε links may be employed, particular hydrophilic links, such as polyethyleneoxy, saccharide, polyol, esters, amides, combinations thereof, and the like.

Functionalities present on the bead may include hydroxy, carboxy, iminohalide, amino, thio, active halogen (Cl or Br) or pseudohalogen (e.g., -CF 3 , -CN, etc.), carbonyl, εilyl, tosyl, meεylates, brosylateε, triflateε or the like. In εelecting the functionality, εo e consideration should be given to the fact that the identifiers will usually also become bound to the bead. Consideration will include whether the same or a different functionality should be associated with the product and the identifier, as well aε whether the two functionalities will be

compatible with the product or identifier attachment and tag detachment stages, as appropriate. Different linking groups may be employed for the product, so that a specific quantity of the product may be selectively released. In some instances the particle may have protected functionalities which may be partially or wholly deprotected prior to each stage, and in the latter case, reprotected. For example, amino may be protected with a carbobenzoxy group as in polypeptide synthesiε, hydroxy with a benzyl ether, etc.

Where detachment of the product is desired, there are numerous functionalities and reactants which may be used. Conveniently, ethers may be used, where substituted benzyl ether or derivatives thereof, e.g. benzhydryl ether, indanyl ether, etc. may be cleaved by acidic or mild reductive conditions. Alternatively, one may employ /--elimination, where a mild base may serve to releaεe the product. Acetals, including the thio analogs thereof, may be employed, where mild acid, particularly in the presence of a capturing carbonyl compound, may serve. By combining formaldehyde, HC1 and an alcohol moiety, an o-chloroether is formed. This may then be coupled with an hydroxy functionality on the bead to form the acetal. Variouε photolabile linkageε may be employed, εuch aε o-nitrobenzyl, 7-nitroindanyl, 2-nitrobenzhydryl ethers or esters, etc. Esterε and amides may serve as linkerε, where half-acid eεters or amides are formed, particularly with cyclic anhydrides, followed by reaction with hydroxyl or amino functionalities on the bead, uεing a coupling agent such as a carbodiimide. Peptides may be used as linkers, where the sequence is εubject to enzymatic hydrolyεiε, particularly where the enzyme recognizeε a specific sequence. Carbonates and carbamates may be prepared using carbonic acid derivatives, e.g. phosgene, carbonyl diimidazole, etc. and a mild base. The link may be cleaved using acid, base or a strong reductant, e.g.,

LiAlH 4 , particularly for the carbonate esters. For a list of cleavable linkages, see, for example, Greene and Wutε, Protective Groupε in Organic Syntheεiε, 2nd ed. Wiley, 1991. The verεatility of the variouε εystems that have been developed allows for broad variation in the conditions for attachment of products and identifiers and differential detachment of products and tags, as desired.

The following table indicates various illustrative linking units (i.e. , F 2 in Formula I) and the manner in which they may be cleaved:

Table 1. Various illustrative linking units and the manner in which they may be cleaved.

(L) shows the point of attachment of the tag or product.

B =

' O(L)

ϊ = / \./ \

( )

(L)

L is the tag or product either directly bonded to the indicated atom or indirectly bonded through a linking group such as C(0)0, which linking group may provide a convenient functionality. R is H or lower alkyl.

Linker.

The choice of linker for the ligand will be part of the synthetic strategy, since the linking group may result in a residual functionality on the product. It will usually be difficult, but feasible, to further modify the product after detachment from the bead. In designing the synthetic strategy, one can use a functionality "to be retained in the product aε the point of attachment for the linking group. Alternatively, when permitted by the nature of the product, one could use a cleavage or detachment method which removes the linking functionality, e.g. , an arylthioether or silyl with a metal hydride or acid. Since in many cases the synthetic strategy will be able to include a functionalized site for linking, the functionality can be taken advantage of in choosing the linking group. In some instanceε it may be desirable to have different functionalities at the εite of linking the product to the εupport, which may neceεεitate using different modes of linking, which modes must accommodate either the same detachment method or different detachment methods which may be carried out concurrently or consecutively, e.g., irradiation with light and acid hydrolysis.

Of particular intereεt for binding the identifierε to the particle are carbeneε and nitrenes which can insert between a carbon and hydrogen atom to form a covalent bond, or into an olefinic bond to form a cyclopropane (in the case of carbene) or an aziridine (in the case of nitrene) .

With carbene or nitrene linking groups variouε subεtituted benzenes may be used, where the benzene is substituted with a group capable of providing a carbene: CHN 2 , C0CHN 2 , S0 2 CHN 2 ; or nitrene: N 3 , N0 2 , NO, S0 2 N 3 . The carbenes may be generated from diazoalkane derivativeε by photolyεiε, thermolysiε, or by treatment with low valent transition metal specieε, e.g., Rh(0Ac) 2 . The nitrene may be generated by photolyεis or thermolysis from azides; and from nitro, nitroεo and azideε by using tervalent phosphoruε compounds or low valent transition metals.

A group of linker moietieε (F 1 -F 2 -) of particular intereεt include 2-nitro-4-carboxybenzyloxy, 2-nitro-4- diazoacetylbenzyloxy, 4 or 5 azidomethylcarbonyl-2- ethoxyphenoxy, and 2-methoxy-4, or 5-carboxyphenoxy moietieε.

Illuεtrative compoundε where T represents the tag, Z represents a carbene or nitrene precursor or a carboxy group, and R is H or lower alkyl are as follows. For photochemical tag detachment (e.g. , with ultraviolet light at about 350 nm) : T 3-Z-2-nitrobenzyl ether, T 4-Z-2- nitrobenzyl ether, T 5-Z-2-nitrobenzyl ether, T 6-Z-2- nitrobenzyl ether, T 2-Z-4-nitrobenzyl ether, T 3-Z-4- nitrobenzyl ether, T 3-Z-2-nitrobenzyl carbonate, T 4-Z-2- nitrobenzyl carbonate, T 5-Z-2-nitrobenzyl carbonate, T 6- Z-2-nitrobenzyl carbonate, T 2-Z-4-nitrobenzyl carbonate, and T 3-Z-4-nitrobenzyl carbonate. For oxidative detachment (e.g., uεing eerie ammonium nitrate) : 1-0T-2- OR-3-Z-benzene, l-OT-2-OR-4-Z-benzene, 1-OT-2-OR-5-Z- benzene, l-OT-2-OR-6-Z-benzene, l-0T-4-0R-2-Z-benzene, and l-OT-4-OR-3-Z-benzene. For reductive or alkylative detachment (e.g. with lithium/ammonia or methyl iodide) : T (2-Z-phenyl)thioether, T (3-Z-phenyl)thioether, and T (4-Z-phenyl)thioether. For deεilylative detachment (e.g., uεing tetrabutyl ammonium fluoride or acid) : T dialkyl- (2-Z-phenyl)εilyl ether, T dialkyl-(3-Z-phenyl) εilyl

ether, T dialkyl-(4-Z-phenyl) silyl ether, T-dialkyl-(2-Z- phenyl)εilane, T-dialkyl-(3-Z-phenyl) εilane, and T- dialkyl-(4-Z-phenyl) εilane.

Combinatorial Svntheεis

The synthesiε will uεually involve εtageε involving at leaεt 2 choiceε, frequently at least 4 choices, and may involve 10 choices or more. Generally, the number of choiceε per εtage will not exceed about 100, more uεually not exceed about 50. The number of εtageε will uεually be at least about 3, more usually at least about 4, frequently at least 5, and not more than about 30, more usually not more than about 25, preferably not more than about 20, more preferably not more than about 10, frequently not more than about 8.

The number of choices and stageε will usually result in at leaεt a number of compounds which allows for a sufficient variety to provide a reasonable likelihood that at least one compound will have the characteristic of interest.

The subject methodology allows for producing greater than 25,000 compounds, uεually greater than 50,000 compoundε, preferably greater than 200,000 compounds, and a million or more may be produced. This will usually mean at least 20 compounds but may be 10° or more.

In some syntheses, a stage may only involve one or two choices, but this situation will usually be limited in relation to the number of compounds one wishes to produce and the particular synthetic strategy. In many of the strategieε, the reεtricted number of choiceε, i.e., fewer than 5 choiceε, more uεually 2 or fewer choiceε, will be limited to the greater of 40% of the total number of stages or about 2 stageε in the sequential synthesiε, more usually limited to 20% of the total number of stages.

Reaction Procedure.

In carrying out the synthesiε, one may initially begin with a number of beads, usually at leaεt IO 3 , more uεually at least IO 4 , and desirably at least IO 5 , while generally not exceeding at least IO 15 , more usually not exceeding at least IO 10 . Depending upon the number of choices in the first stage, one will divide up the particles accordingly into as many containers. One can use microtiter well plates, individual containers, columns, gels, Terasaki plates, flasks, Merrifield syntheεiε vessels, etc. The particleε will uεually be divided up into groupε of at leaεt one particle each, usually a plurality of particles, generally 1000 or more, and may be 10 5 or more depending on the total number of particles and choices involved in the stage.

One would then add the appropriate agents to each of the individual containers to procesε them in εtageε and add the identifiers which encode the reagent and stage. Each stage would provide the desired reaction. Once the reaction(s) is complete, one may wish to wash the beads free of any reagent, followed by combining all of the beads into a single mixture and then separating the beads according to the number of choices for the next stage.

This procedure of dividing beads, followed by the tagging and εyntheεis stageε (or vice verεa) , and then recombining beads is iterated until the combinatorial synthesiε is completed.

In some instances, the same reaction may be carried out in 2 or more containerε to enhance the proportion of product having a particular reaction at a particular stage as compared to the other choices. In other instanceε, one or more of the stages may involve a portion of the beads being set aside and undergoing no reaction, so as to enhance the variability associated with the final product. In other εituationε, batcheε may be taken along different synthetic pathways.

In order to record or encode the syntheεis history on the beads, in one embodiement C or c. or both may be present and subεequent attachment of C excludes the presence of cat each stage one would tag the beadε aεεociated with each choice and stage with their own unique combination of identifiers. Alternately one may use a single tag to record or enclode ths εynthesis history. Depending on the chemistrieε involved, thiε tagging may be done prior to, after, or concomitantly with the reactions which comprise each choice. Further, as a control, sample beads may be picked at any stage and a portion of their tags cleaved off and decoded to verify that the correct tags are bound to the sample beadε.

Aε indicated previouεly, in εome instances, portions of the particles will be segregated into subsets, where each of the subsets would then undergo a different reaction series. At any time, the portions may be recombined into a single mixture for subsequent reaction. For example, if at one stage one introduces unsaturation, one could provide two subsets, where in one subεet the unεaturation iε reduced, while in the other subset the unsaturation is epoxidized. These two subsets could then be subjected to different reaction serieε.

After εyntheεiε of the productε iε complete, they are screened for a desired property either after detachment of the ligand from the bead or while still attached. In the latter case, beads, for example, may be incubated in aqueous buffer with mouse monoclonal antibody Y. After incubation and washing, the beads are incubated with alkaline phoεphatase-conjugated rabbit (or goat) polyclonal antibody directed against mouse antibodies. Using a fluorescent precipitation developing reagent, fluorescent beads with attached monoclonal antibody are identified and manually separated from the majority of clear, unstained beadε. Alternatively, the fluorescent

beadε can be εeparated using a fluorescence-activated cell sorter, so long as the tags are retained on the bead under the conditions of sorting. Each selected fluorescent bead is subjected to a means for releasing at leaεt εome of the tagε from the bead.

In instances where the syntheεiε doeε not involve the εtagewiεe addition of like unitε, or where reaction byproducts are formed, there may be instanceε where there will be a plurality of compounds on a single bead or the structure of the active compound cannot be known from its reaction history. In accordance with the εubject invention, by knowing the reaction hiεtory, one may repeat the εynthesis on a larger scale so as to obtain a sufficient amount of the product(ε) to iεolate the product(ε) and εtructurally identify the active compound.

The εubject methodology may be illuεtrated using variouε reaction sequences. For example, barbiturates may be prepared by combining an aldehyde or ketone with an acetate ester to prepare a crotonate under Claisen conditions to provide an unsubstituted to tetrasubstituted crotonate. The crotonate may then be combined with a second acetate under Michael conditions, whereby a glutarate may be obtained having up to 6 εubεtituentε. The glutarate may then be combined with ammonia or monoεubεtituted amine to provide the barbiturate. By varying the aldehydes and ketones, the acetates and the amines, a great variety of barbiturates may be obtained. Where functionalities are present on one or more of the subεtituentε, εuch as amino, carboxy, hydroxy, thiol, and the like, these groups may be protected or modified as desired.

In another example described by Bunin and Ellman, J. Am. Chem. Soc, 114, 10997 (1992), benzodiazepineε are produced. One beginε the εyntheεis with different amino

protected εubεtituted 2-aminobenzophenones bound to individual particles through, for example a 4'-oxy group. To each different group of particles in different vesεelε, after deprotection, are added a different Fmoc-protected α-amino acid, either naturally occurring or εynthetic, under conditionε where a peptide bond is formed. After deprotection, internal cyclization is caused, followed by alkylation on nitrogen with an alkylating agent. In only three stageε, a very large number of benzodiazepineε may be prepared and the librarieε εcreened for tranquilizing or other activity.

A wide variety of drug analogε may be produced, εuch aε analogs of antihypertensive agents, e.g. enalapril; 3-blockers, e.g. propanolol; antiulcer drugs (H 2 -receptor antagonists) e.g. cimetidine and ranitidine; antifungal agents (cholesterol-demethylase inhibitors) e.g. isoconazole; anxiolytics, e.g. diazepam; analgeεics, e.g. aspirin, phenacetamide, and fentanyl; antibiotics, e.g. vancomycin, penicillin and cephalosporin ; antiinflammatories, e.g. cortisone; contraceptives, e.g. progeεtinε; abortifacientε, e.g. RU-456; antihiεtamineε, e.g. chlorphenamine; antituεεiveε, e.g. codeine; sedatives, e.g. barbitol; etc.

An illuεtrative εynthesis of cimetidine analogε could involve hydroxymethylεubεtituted hiεtidineε, and related heterocycles, where the remaining carbon atoms or nitrogen atomε could be further εubstituted or unsubstituted, α,ω-aminoalkylthiols, and substituted thioamidine esters, where the groups on nitrogen could be varied, such as nitro, cyano, hydroxy, alkyl, combinations thereof, and the like.

Identifier

The identifiers of this invention may be represented by the Formula I:

F 1 -F 2 -C-E-C' I where F 1 -F 2 is a linker which allows for attachment to a support and detachment of the tag from a support; and

C-E-C'is the tag which is capable of detection and distinguishability;

E iε a tag component which (a) allowε for detection, such as an electrophoric group which can be analyzed by gas chromatography or masε spectroscopy or (b) allows for detection and for separation; C and C' are tag components which allow for individual distinguiεhing one tag from all other tagε, uεually allowing for εeparation aε a result of variable length or εubεtitution, for example, varying the chromatographic retention time or the maεs spectroεcopy ratio m/e;

F 2 iε a linking component capable of being selectively cleaved to release the tag component; and

F 1 is a functional group which provided for attachment to the support; or F 2 is a bond when F 1 iε a cleavable group εuch as OH or carboxy.

Although the identifiers of Formula I are typically added at each appropriate stage and choice during the combinatorial syntheεiε, the portion E can be added at the end of the εyntheses either before or after cleavage (preferably photochemically or oxidatively) from the εubεtrate. Specifically, where C containε OH, NHR 4 , or SH, E can be attached to C prior to cleavage. Alternatively, if E iε attached after cleavage, the point of attachment at C may be where F 2 waε attached. Thiε is exemplified in the scheme on the following page:

where S = subεtrate and n = 1-40 Attachment of the identifier to the substrate can be represented as follows:

F 1 -F-C-E-C' + S > S-F 1, -F 2 -C-E-C' where F 1 '-F 2 -C-E-C' represents the identifier residue attached to the subεtrate. For example, when the bead iε functionalized with an aminomethyl group and F 1 is C0 2 H, then F 1 ' is -C(O)-; when the bead contains an unsaturated bond and F 1 is N-CH-C(O)-, then F 1 ' is =CH-C(0)- or -CH 2 -C(0)-.

Of particular interest for use as identifiers are compounds of Formula I of the Formula la:

F 1 -F 2 -(C(E-C') a ) b la

wherein:

F 1 iε C0 2 H, CH 2 X, NR 1 R 1 , C(0)R 1 , OH, CHN 2 , SH, C(0)CHN 2 ,

S(0 2 )C1, S(0 2 )CHN 2 , N 3 , N0 2 , NO, S(0 2 )N 3 , OC(0)X, C(0)X,

NCO, or NCS;

F 2

-NC(0)0" -CR CR (CR 2 ) 2 ( — CR 1= == CR 1 C(R 4 ) 2 — ,

-0*. -A —

— C(R CR= — , — 0~C(R ^A "

= CR •--cr - C(R ) A

— S — C(R ),A- C(X)R C(R ) 2 A-

C(0H)K C(R ) 2 A- — C(0E)R — C(CH 2 X)K-

C(0H)] -C(K )- > — C(X)K- — C(OH)(CH 2 CH 2 X)-

O

R

OR

— 0 on OR

A is -O , -OC (0) 0- , -OC (O) - , or -NHC ( O) - ;

C is a bond, C < - ZQ alkylene optionally substituted by 1-40 F, Cl, Br, C 1 -C 6 alkoxy, NR 4 R 4 , OR 4 , or NR 4 , or -[(C(R 4 ) 2 ) m -Y-Z-Y-(C(R 4 ) 2 ) n Y-Z-Y] p -; with the proviso that the maximum number of carbon atoms in C+C' is preferably 20; C _ is H; F; Cl; ^-C^ alkylene optionally substituted by 1-40 F, Cl, Br, C 1 -C 6 alkoxy, NR 4 R 4 , OR 4 , or NR 4 , or -[(C(R 4 ) 2 ) m -Y-Z-Y-(C(R 4 ) 2 ) n Y-Z-Y] p -; E is ^-C,-, alkyl substituted by 1-20 F, Cl or Br; or

Q-aryl wherein the aryl is substituted by 1-7 F, Cl, N0 2 , S0 2 R 5 , or substituted phenyl wherein the subεtituent iε 1-5 F, Cl, N0 2 , or S0 2 R 5 ; R 1 iε H or C 1 -C 6 alkyl; R 3 iε C=0, C(0)0, C(0)NR 1 , S, SO, or S0 2 ; R 4 iε H or C^-C 6 alkyl; R 5 iε C^C. alkyl; a iε 1-5; b iε 1-3; and n iε each 0-20; p iε 1-7 ; Q iε a bond, O, S, NR 4 , C=0, -C(0)NR 5 , -NR 5 C(0)-, -C(0)0-, or -OC(O)-; X iε a leaving group εuch aε Br, Cl, triflate, meεylate, tosylate, or OC(0)OR 5 ; Y iε a bond, 0, S, or NR 4 ;

Z is a bond; phenyl optionally subεtituted by 1-4 F, Cl,

Br, C 1 -C 6 alkyl, C 1 -C 6 alkoxy, C -C & alkyl εubεtituted by 1-13 F, Cl, or C,-C 6 alkyloxy εubstituted by 1-13 F, Cl, or Br; (C(R 4 ) 2 ) -._ 20 ; or (CF 2 ) 1.20 ; with the proviso that when Z is a bond one of its adjacent Y'ε iε alεo a bond; and aryl is a mono- or bi-cyclic aromatic ring containing up to 10 carbon atoms and up to 2 heteroatoms selected from O, S, and N.

In the definitions of F 2 in Formula la, the left-hand bond as depicted attaches to F 1 .

Also useful as identifiers are compounds of the Formula la';

F 1 -(C(E-C') a ) b la' wherein: F 1 is OH or COOH; and the remaining definitions are as in Formula la.

Preferred compounds of Formula la are those wherein; lε

? 2

CH 2 A—

C and C' is each independently C^C^ alkylene unsubstituted or subεtituted by 1-40 F or Cl, or [0-(CH 2 ) 2 _ 3 ] ;

E iε C.,-C 10 alkyl εubεtituted by 1-20 F or Cl; Q-aryl where aryl iε a bi-cyclic aromatic ring εubεtituted by 1-7 F or Cl; or Q-phenyl εubstituted by 1-5 F, Cl, N0 2 , or S0 2 R 5 ; and Q is a bond, O, -NR 5 C(0)-, or -OC(O)-.

Preferred compounds of Formula la are those wherein -C(E-C') a is repreεented by -(CH 2 ) 3.15 -(CF 2 ) ,. 15 F,

-(CH 2 ) 3.15 -(CC1 2 ) ,.-,..01, -(CH 2 CH 2 -0) ,. Ar,

-(CH 2 CH 2 CH 2 0) 1.5 -Ar, or -(CH 2 ) ^^-O-Ar; wherein Ar is pentafluoro- pentachloro-, or pentabromophenyl, 2,3,5,6-tetrafluoro-4 (2,3,4,5,6-

pentafluorophenyl)phenyl, 2,4,6-trichlorophenyl, 2,4,5-trichlorophenyl, 2,6-dichloro-4-fluorophenyl, or 2,3,5,6-tetrafluorophenyl.

Other preferred compounds of Formula la are represented by the formulae:

) 1.l5 r

wherein Ar is pentafluoro- pentachloro-, or pentabromophenyl, 2,3,5,6-tetrafluoro-4 (2,3,4,5,6- pentafluorophenyl)phenyl, 2,4, 6-trichlorophenyl, 2,4,5-trichlorophenyl, 2, 6-dichloro-4-fluorophenyl, or 2,3,5,6-tetrafluorophenyl.

Other preferred compounds of Formula la are those wherein E-C' is H, OH, or NH 2 . Such compounds are particularly useful for reaction with an E at the end of the combinatorial synthesis, especially with an E detectable by fluorescence or electron capture, such as dansyl chloride or polyhalobenzoylhalide.

The compounds of Formula I can be prepared according to the following exemplary schemes or other means known to those skilled in the art.

SCHEME 1

Identifier Tag Preparation

X ELECTROPHOEIC PHEIOL. ArOB X = Cl OR F

SCHEME 2

Identifiers With Photolvtic Cleavage Linkers

PYRIDINE. CH 2 C1 2

8CHEME 3

Identifiers With Oxidative Cleavage Linkers

TOUEHE

(CK 2 .Ar

SCHEME 4

Identifierε With Alternative Oxidative Release Linkers

PPh 3 . DEAD, TOLUEHE

r l. -OIn0 4 , ACETOKE/VATEB '

2. OXALOYL C10RIDE, cat. Dlff. CH 2 C1 2

CH 7 » 2

E^O/CH-^Cl-,

SCHEME 5

E-C Tags

SCHEME 6

Identifiers With Photolvtic Cleavage Linkerε

1. C0CL 7 . TOLUENE

E— C TAG

PYRIDINE, CH 2 C1 2

SCHEME 7

Identifiers With Oxidative Cleavage Linkers

(CH 2 )--CH 3

The identifier may comprise one or a plurality of identical tags. The identifiers will be individual chemical compound(s) which may be distinguiεhed one from the other and will uniquely identify different choices and stages. In this manner, very large combinatorial libraries may be prepared with a relatively small number of identifiers, usually fewer than 50 tags. During each stage, a combination of identifiers will be added, which defines the stage and choice. Each identifier will be bound, either covalently or non- covalently to the bead or to the product, usually the bead. Combinations of identifiers are uεed to provide a binary or other code at each εtage, whereby the choice and stage may be defined. The combination of identifiers may include zero or only one identifier.

Tags

So far as the tags (C-E-C') are concerned, the tags which are employed will be characterized as follows: by being removable from the bead by means depending on F 2 , preferably by photolysis or oxidation; by being individually differentiable, usually separable; by being stable under the synthetic conditions; by encoding both stage and choice so as to uniquely define the choice of agent used at each stage in the syntheεis; desirably, there should be an eaεy way to identify the variouε tagε with readily-available equipment which does not require sophisticated technical capabilities to operate; they should be relatively economical and provide a strong εignal baεed on a relatively few moleculeε; and the tagε εhould provide sufficient senεitivity to permit diεtinguishing the tags from the other components which may be present during the tag determinations.

The tags may be structurally related or unrelated, as in a homologous serieε, repetitive functional groups, related

me bers of the Periodic Chart, different isotopes, combinations thereof, or the like. The tags may be used as elements of a binary code, so that one tag can define two choices, two tags can define four choices, three tagε can define eight choiceε, five tagε can define thirty-two choiceε, etc. Thuε, at each εtage of the εynthesis, a relatively small number of tags can designate a much larger number of choices. The tags comprising the identifiers for each stage may or may not be related to other stages. Each tag for any combinatorial synthesiε must allow for being distinguiεhed from all other tagε. In thiε manner, very large combinatorial librarieε may be prepared with a relatively εmall number of tags, uεually fewer than 60 tagε, more uεually fewer than about 50 tags.

For each bead, there will usually be at least 0.01 femtomol, more usually 0.001-50 pmol, of each tag, although lesser or greater amounts may be used in special circumstances. The amount of product may also be at least in the same range and up to at leaεt 10 4 or more greater, uεually being at least 0.01 pmol, more usually at least 1.0 pmol and generally not more than about 10 n ol. Depending upon the number of beads, the number of stageε and the number of choiceε per εtage, the number of products produced will usually exceed 10 2 , more usually

10 3 , and may exceed 10 10 , usually not exceeding about IO 8 , preferably being in the range of about 10 4 to 10 8 , more usually 10 5 to 10°.

The tags will, for the most part, be organic molecules.

Each tag will usually have fewer than about 100 atoms, more usually fewer than about 80 atoms, generally fewer than about 60 atoms, other than hydrogen, excluding a linking moiety which would not be retained on release of the tag from the bead. The linking moiety may be of any size, uεually being fewer than about 30 atomε, more usually fewer than 20 atoms, other than hydrogen. The

size of the linking moiety iε not critical, but one of convenience. The tagε may form familieε of compounds, where all of the compounds are of a similar nature or may be combinations of different families, where the compounds may be aliphatic, alicyclic, aromatic, heterocyclic, or combinations thereof. Distinguishing features may be the number of repetitive units, such as methylene groups in an alkyl moiety, alkyleneoxy groups in a polyalkyleneoxy moiety, halo groups in a polyhalocompound, α- and/or β- substituted ethylenes, where the εubstituents may involve alkyl groups, oxy, carboxy, amino, halo, or the like; isotopes; etc.

Tag Analvsiε

Tagε may be removed from the bead uεing reductive, oxidative, ther olytic, hydrolytic, or photolytic conditionε depending on the nature of the group F 2 ; for example, by oxidation of a catechol ether with eerie ammonium nitrate or by photolysis of a nitrobenzyl ether or ester or amide, or by other methods, e.g. as shown in Table 1.

Differentiation of tags can be achieved with physical differences, e.g. molecular weight of the tags or the chromatographic retention time using gas or liquid chromatography. Positional isomerε may have different retention time. If poεitional iεomerε or εteroiεomerε are inadequate for phyεical εeparation, then one could use varying numberε of εubεtituentε, e.g. halogens, such as fluorines, methyl groups, oxy groups, or other side chains in conjunction with differing numbers of units, e.g. methylene groups or ethyleneoxy groups, to provide the desired separation. Ratios of radioisotopeε could be uεed, where the radioiεotopes provide for differential emission, for example U C and 3 H. The physical differences,

particularly mass number, can provide information about choice and stage.

Instead of H C/ 3 H ratios, one could use combinations of non- radioactive isotopes, e.g. -CH m D n , where m is 0 and up to 3 and n is 3 minus m. For example, by detecting the varying amounts of up to four different methyl groups using maεε εpectroscopy, one could define a large number of choices.

When E is a bond and C' iε H, the tags obtained upon release from the support have an active functionality for reaction with a labeling reagent which introduces a detectable tag component E. Conveniently, the functionality could be a double bond, particularly an activated double bond, hydroxy, thio, amino, carboxy, etc. The tag would then be reacted with an exceεε of the labeling reagent to provide the product (E-C) for analyεiε. In this way a wide variety of labeling reagents could be used as part of the identifying system, which may not be compatible with the synthetic strategy for the product of interest. Labeling reagents which may be used for detection include haloaro atics (e.g., perfluorobenzyl bromide), fluoreεcerε (e.g., danεyl chloride), radioisotopes, chemilumineεcers, etc.

While exemplary tags and reactionε have been given, it should be understood that many other combinations could be employed.

Depending on the chemical and physical nature of the tags, an appropriate method for separation is chosen, desirably one of various chromatographic procedures including gas chromatography (GC) , liquid chromatography (LC) particularly high-performance liquid chromatography (HPLC) , thin layer chromatography (TLC) , electrophoresis, etc. Instead of chromatographic procedure, maεε

spectrometry may be employed for εeparation by mass number. Tags include: for GC: chemically inert organic moleculeε having different molecular weightε including alkaneε, alkenes, areneε, halocarbonε, ethers, alcohols, silanes, thioethers, etc., particularly halogenated compounds, with or without other functionalities, for electron capture detection or mass spectroscopy detection (MS) with capillary GC separation, and for compound with elements not normally found in organic chemistry (e.g., Sn, Ge) for atom emission detection with GC capillary seperation; for LC, HPLC or TLC: see above for GC, conveniently linear ethers or hydrocarbons with subεtitution by radioiεotopes or combinations of radioiεotopes for radioassay detection or suitable groupε for fluorescence detection after separation; for electrophoresis: see above, particularly functionalized charged molecules, e.g. cationic or anionic, particularly organic or inorganic acid groups, where the molecule may be further modified by having a detectable radioisotope or fluorescer for detection in the electrophoresis; for mass spectroscopy: see above, particularly different maεε numbers due to different isotopes, different numbers of the same functionality or different functionalities, different members of a homologous series or combinations thereof.

The separation of tags from one another may involve individual techniques or combinations of techniques, e.g. chromatography and electrophoresis; gas chromatography and masε spectroscopy; etc.

The tags of the present invention will have a property which allows detection at very low levelε, uεually not greater than nanomol, preferably picomol or less, more preferably femtomol or lesε, in the preεence of other

compoundε which may be preεent at significantly higher levels. For this reason, specific atomic substitutions may be used to render the labels easily detectable. Such subεtitutions include: (a) substitution by electronegative elements, e.g. fluorine or chlorine, for electron capture detection in conjunction with capillary GC or negative ion masε εpectroεcopy detection;

(b) εubεtitution by an uncommon element (excluding C, H, and O) for atomic e iεsion detection in conjunction with capillary GC;

(c) substitution by several uncommon elements for atomic emission detection to determine the ratio between the elements; (d) subεtitution by a radioactive element, e.g. 3 H, for detection by autoradiography or εcintillation counting in conjunction with LC, TLC or electrophoreεiε; (e) εubεtitution by a multiplicity of radioactive elements having differing emissions, e.g. 3 H and U C, for detection by autoradiography or scintillation counting to determine the ratio of the different radioactive elements.

For single-element substitution (a., b., d. above) a separable mixture of A tags whose simple presence or absence can be detected would encode up to 2 A different syntheεeε. For multiple-element substitution (see, c. and e. above) a separable mixture of A tags each having B distinguishable states (e.g., different 3 H/ C ratios, different Si/Sn ratios) would be able to encode for up to B A different εyntheεeε.

A wide variety of isotopes exist, where the presence or ratio of isotopes may provide information as to stage and choice. The isotopeε may be radioactive or non- radioactive. Isotopes of particular interest include deuterium, tritium, U C, 32 P, 131 I, etc.

By employing mixtures of isotopically-modified compounds, one can greatly expand the information obtained from a single tag compound which is only diεtinguished by the presence of isotopes. For example, one could prepare a mixture of ratios of hydrogen to deuterium, where the various ratios could differ by as little as 10% each. By replacing hydrogens with another atom, such aε fluorine, one would then have a varying mixture of hydrogenε, deuteriumε and fluorines, providing for a large number of different differentiable tags.

Other groups that may be involved could be aromatic rings, which are differentially substituted, as to position and functionality. Thus, by having subεtituted benzene rings, where the position of the subεtitution and the nature of the substitution can be determined, one can provide for a plurality of molecules which can be distinguished and can provide for both stage and choice information. For example, if C were constant one could detect and discriminate through the substitution pattern on E when E is a polyhalogenated aromatic ring.

There is also the posεibility to uεe fluoreεcent tagε. While fluoreεcent tagε alone may not be εufficient to define a εignificant number of stages with a significant number of choices, as referred to above, by providing for means for separating the fluorescent tagging molecules based on variations in C or C' , one can individually detect the tags by their fluoreεcence.

The mixture of tagε aεεociated with a particular bead may be detached and subject to an initial separation, where it iε desirable to detect each of the tags separately. Once the group of tags has been separated, each of the tags may then be analyzed based on its particular functionalities and distinctive properties. Various techniques which may be uεed to detect the particular tagε include

autoradiography or εcintillation counting, electron capture detection, negative or positive ion mass εpectroεcopy, infrared εpectroεcopy, ultraviolet εpectroscopy, electron spin resonance spectroεcopy, fluoreεcence, and the like.

Assays

To determine the characteristic of interest of the product, a wide variety of assays and techniques may be employed.

Frequently, in εcreening the beadε, one will uεe either single beadε or mixtureε of beadε and determine whether the bead or mixtures show activity. Thus, the mixtureε may involve 10, 100, 1000 or more beadε. In thiε way, large groupε of compoundε may be rapidly screened and segregated into smaller groups of compounds.

One technique is where one is interested in binding to a particular biomolecule such as a receptor. The receptor may be a single molecule, a molecule asεociated with a microsome or cell, or the like. Where agonist activity is of interest, one may wish to use an intact organism or cell, where the reεponse to the binding of the subject product may be measured. In some inεtanceε, it may be deεirable to detach the product from the bead, particularly where physiological activity by transduction of a εignal is of interest. Variouε devices are available for detecting cellular response, such as a microphysiometer, available from Molecular Deviceε, Redwood City, CA. Where binding iε of interest, one may use a labeled receptor, where the label iε a fluorescer, enzyme, radioiεotope, or the like, where one can detect the binding of the receptor to the bead. Alternatively, one may provide for an antibody to the receptor, where the antibody is labeled, which may allow for amplification of the signal and avoid changing the receptor of interest,

which might affect its binding tot he product of interest. Binding may alεo be determined by diεplacement of a ligand bound to the receptor, where the ligand is labeled with a detectable label.

In some instances, one may be able to carry out a two-stage screen, whereby one first uses binding as an initial screen, followed by biological activity with a viable cell in a second screen. By employing recombinant techniques, one can greatly vary the genetic capability of cells. One can then produce exogenous geneε or exogenouε transcriptional regulatory sequences, so that binding to a surface membrane protein will result in an observable signal, e.g. an intracellular signal. For example, one may introduce a leuco dye into the cell, where an enzyme which transforms the leuco dye to a colored product, particularly a fluorescent product, becomes expreεεed upon appropriate binding to a εurface membrane, e.g. 3-galactoεidaεe and digalactoεidylfluoreεcein. In this manner, by associating a particular cell or cells with a particular particle, the fluorescent nature of the cell may be determined uεing a FACS, εo that particleε carrying active compoundε may be identified. Variouε techniqueε may be employed to enεure that the particle remains bound to the cell, even where the product is releaεed from the particle. For example, one may use antibodies on the particle to a εurface membrane protein, one may link avidin to the εurface of the cell and have biotin preεent on the particle, etc.

Aεεays may be performed εtagewiεe uεing individual particleε or groupε of particleε or combinations thereof. For example, after carrying out the combinatorial syntheses, groups of about 50 to 10,000 particles may be segregated in separate vessels. In each vesεel, aε to each particle a portion of the product bound to the particle iε releaεed. The fractional releaεe may be aε a

reεult of differential linking of the product to the particle or using a limited amount of a reagent, condition or the like, εo that the average number of product moleculeε releaεed per particle iε leεε than the total number of product moleculeε per particle. One would then have a mixture of productε in a εmall volume. The mixture could then be used in an asεay for binding, where the binding event could be inhibition of a known binding ligand binding to a receptor, activation or inhibition of a metabolic proceεs of a cell, or the like. Various asεay conditionε may be uεed for the detection of binding activity aε will be described subsequently. Once a group is shown to be active, the individual particles may then be screened, by the same or a different assay. One could of course, have a three- or four-stage procedure, where large groups are divided up into smaller groups, etc. and finally single particles are screened. In each case, portions of the products on the particles would be released and the resulting mixture used in an appropriate assay. The asεays could be the same or different, the more sophiεticated and time conεuming aεεayε being uεed in the later or last stage.

One may also provide for spatial arrayε, where the particleε may be diεtributed over a honeycomb plate, with each well in the honeycomb having 0 or 1 particle.

The subject methodology may be used to find chemicals with catalytic properties, εuch aε hydrolytic activity, e.g. eεterase activity. For this purpose one might embed beads in a semiεolid matrix εurrounded by diffuεible teεt εubstrates. If the catalytic activity can be detected locally by processes that do not disturb the matrix, for example, by changes in the absorption of light or by detection of fluorescence due to a cleaved subεtrate, the beads in the zone of catalytic activity can be isolated and their labels decoded.

Inεtead of catalytic activity, compounds with inhibitory or activating activity can be developed. Compounds may be sought that inhibit or activate an enzyme or block a binding reaction. To detect beads that inhibit an enzyme, which beads have an attached product with this desirable property, it is advantageous to be able to release the products from the beads, enabling them to diffuse into a semisolid matrix or onto a filter where this inhibition, activation or blocking can be observed. The beads that form a viεualized or otherwise detectable zone of inhibition, activation or blocking can then be picked and the tags decoded. In this case it is necessary that a portion of the syntheεized productε be attached to the beads by cleavable linkages, preferably a photolabile linkage, while a portion of the tagε remain attached to the bead, releasable after picking by a different meanε than before.

A dialyεiε membrane may be employed where a layer of beadε iε separated from a layer of radiolabeled ligand/receptor pair. The bead layer could be irradiated with ultraviolet light and the product released from the bead would diffuse to the pair layer, where the radiolabeled ligand would be released in proportion to the affinity of the compound for the receptor. The radiolabeled ligand would diffuse back to the layer of beads. Since the radiolabel would be proximal to the bead, beads asεociated with radioemiεεion would be analyzed.

Of particular intereεt iε finding productε that have biological activity. In εome applications it is deεirable to find a product that haε an effect on living cells, such as inhibition of microbial growth, inhibition of viral growth, inhibition of gene expreεεion or activation of gene expression. Screening of the compounds on the beads can be readily achieved, for example, by embedding the beadε in a semisolid medium and the library of product

molecules released from the beads (while the beads are retained) enabling the compounds to diffuse into the surrounding medium. The effects, such as plaques with a bacterial lawn, can be observed. Zones of growth inhibition or growth activation or effectε on gene expreεεion can then be viεualized and the beadε at the center of the zone picked and analyzed.

One aεsay scheme will involve gels where the molecule or system, e.g. cell, to be acted upon may be embedded subεtantially homogeneously in the gel. Various gelling agents may be uεed εuch aε polyacrylamide, agaroεe, gelatin, etc. The particleε may then be εpread over the gel εo aε to have εufficient separation between the particles to allow for individual detection. If the desired product is to have hydrolytic activity, a substrate is present in the gel which would provide a fluorescent product. One would then screen the gel for fluorescence and mechanically select the particles associated with the fluorescent signal.

One could have cellε embedded in the gel, in effect creating a cellular lawn. The particles would be spread out as indicated above. Of course, one could place a grid over the gel defining areas of one or no particle. If cytotoxicity were the criterion, one could release the product, incubate for a sufficient time, followed by spreading a vital dye over the gel. Those cells which absorbed the dye or did not absorb the dye could then be distinguiεhed.

As indicated above, cells can be genetically engineered so as to indicate when a εignal has been transduced. There are many receptors for which the genes are known whose expreεεion iε activated. By inεerting an exogenouε gene into a εite where the gene is under the transcriptional control of the promoter responsive to such receptor, an

enzyme can be produced which provides a detectable signal, e.g. a fluorescent signal. The particle aεεociated with the fluoreεcent cell(ε) may then be analyzed for itε reaction hiεtσry.

Libraries and Kits

For convenience, libraries and/or kits may be provided. The libraries would comprise the particles to which a library of products and tags have been added εo aε to allow for screening of the products bound to the bead or the libraries would comprise the products removed from the bead and grouped singly or in a set of 10 to 100 to 1000 members for screening. The kits would provide various reagents for use as tags in carrying out the library syntheses. The kits will usually have at least 4, usually 5, different compounds in separate containers, more usually at least 10, and may comprise at least 10 2 different separated organic compounds, usually not more than about 10 2 , more usually not more than about 36 different compounds. For binary determinations, the mode of detection will usually be common to the compounds associated with the analysis, so that there may be a common chromophore, a common atom for detection, etc. Where each of the identifiers is pre-prepared, each will be characterized by having a distinguishable composition encoding choice and stage which can be determined by a physical measurement and including groups or all of the compounds sharing at least one common functionality.

Alternatively, the kit can provide reactants which can be combined to provide the various identifierε. In thiε εituation, the kit will compriεe a plurality of separated first functional, frequently bifunctional, organic compounds, usually four or more, generally one for each stage of the synthesis, where the functional organic compounds share the same functionality and are distinguiεhable aε to at leaεt one determinable

characteriεtic. In addition, one would have at leaεt one, usually at least two, second organic compounds capable of reacting with a functionality of the functional organic compounds and capable of forming mixtures which are distinguishable as to the amount of each of said second organic compounds. For example, one could have a glycol, amino acid, or a glycolic acid, where the variouε bifunctional compounds are distinguiεhed by the number of fluorine or chlorine atomε preεent, to define εtage, and have an iodomethane, where one iodo ethane haε no radioiεotope, another haε K C and another haε one or more 3 H. By uεing two or more of the iodomethaneε, one could provide a variety of mixtures which could be determined by their radioemisεionε. Alternatively, one could have a plurality of εecond organic compoundε, which could be used in a binary code.

As indicated previously one could react the tags after release with a molecule which allows for detection. In this way the tags could be quite simple, having the same functionality for linking to the particle as to the detectable moiety. For example, by being linked to a hydroxycarboxyl group, an hydroxyl group would be releaεed, which could then be eεterified or etherified with the molecule which allowε for detection. For example, by uεing combinationε of fluoro- and chloroalkyl groupε, in the binary mode, the number of fluoro and/or chloro groupε could determine choice, while the number of carbon atomε would indicate εtage.

Groupε of compoundε of particular interest include linkers joined to a εubεtituted ortho-nitrobenzyloxy group, indanyloxy or fluorenyloxy group, or other group which allows for photolytic or other selective cleavage. The linking group may be an alkylene group of from 2 to 20 carbon atomε, polyalkyleneoxy, particularly alkyleneoxy of from 2 to 3 carbon atoms, cycloalkyl group of from 4 to 8

carbon atoms, haloalkyl group, particularly fluoroalkyl of from 2 to 20 carbon atoms, one or more aromatic rings and the like, where the linker provides for the discrimination between the various groups, by having different numbers of unitε and/or substituents.

Individual particles or a plurality of particles could be provided as articleε of commerce, particularly where the particle(s) have shown a characteriεtic of intereεt. Based on the associated tags, the reaction history may be decoded. The product may then be produced in a large syntheεiε. Where the reaction history unequivocally defines the structure, the same or analogous reaction serieε may be uεed to produce the product in a large batch. Where the reaction history does not unambiguously define the structure, one would repeat the reaction history in a large batch and use the resulting product for structural analysis. In some instances it may be found that the reaction series of th= combinatorial chemistry may not be the preferred way to produce the product in large amounts.

Thus, an embodiment of this invention is a kit comprising a plurality of separated organic compounds, each of the compounds characterized by having a distinguishable composition, encoding at least one bit of different information which can be determined by a physical measurement, and sharing at least one common functionality. A preferred embodiment is a kit comprising at least 4 different functional organic compoundε.

More preferred is a kit wherein said functional organic compounds are of the formula:

F 1 -F 2 -C-E-C' I where F 1 -F 2 is a linker which allowε for attachment to and detachment from a εolid particle; and

C-E-C 7 iε a tag member which can be determined by a phyεical meaεurement, especially wherein said functional organic compounds differ by the number of methylene groups and/or halogens, nitrogens or sulfurs preεent.

Alεo preferred is a kit wherein the C-E-C' portion is removed photochemically or a kit wherein the C-E-C' portion is removed oxidatively, hydrolytically, ther olytically, or reductively.

Compounds of thiε invention may be useful as analgesicε and/or for the treatment of inflammatory diεeaεe, eεpecially in the case of the azotricyclics acting as antagonistε of the meurokin 1/brandykin receptor. Memberε of the benzodiazopine library may be useful as a muscle relaxant and/or tranquilizer and/or as a sedative. Members of the 23 million Mixed Amide Library may be of use in the treatment of hypertension on endothelin antagonists or Raynaud , s syndrcme.

The following exampleε are offered by way of illuεtration and not by way limitation.

In one embodiment the invention is composition compriεing at least 6 different components, each component having a distinguiεhable moiety. The componentε may be characterized by each moiety being εubεtantially chemically stable or inert and having an identifiable characteristic different from each of the other moieties. Each moiety is joined to a linking group having an active functionality capable of forming a covalent bond, through a linking group to individually separable solid surfaces, or joined to a group which is detectable at lesε than 1 nanomole. With a proviso that when the moieties are joined to the linking group, the components are physically segregated. Preferably, the solid supportε are beadε.

In one embodiment each component compriεeε molecules of different compounds bound to individual εeparable solid surfaceε, wherein the moleculeε on the εolid εurfaceε. Preferably, the moietieε of the invention define an homologouε series and/or a series of subεtitutionε on a core molecule.

The invention herein iε also directed to a compound library comprising at least one hundred unique solid supports. In this compound library each solid support has (1) an individual compound bound to the solid support as a major compound bound to the support; and (2) a plurality of tags e.g. tags incapable of being sequenced, where the tagε are individual tag molecules which are physically distinguishable in being physically separable and are substituted so aε to be detectable at leεε than about a nanomole or have a functional group for bonding to a εubεtituent which iε detectable at leεε than about at nanomole. Preferably, in the compound library each εolid εupport haε at leaεt about 6 tagε. In another embodiment, in the compound library the tagε define a binary code encoding the synthetic protocol used for the syntheεizing the compound on the εolid support.

This invention alεo provideε a method for determining a εynthetic protocol encoded by εeparable physically different tags in a series and defining a binary code. In this method at least two tags are employed to define each εtage of the εynthetic protocol, there being at leaεt εix tagε. The εtep of the method comprising εeparating tagε by eanε of their physical differences and detecting the tags to define a binary line encoding the protocol whereby the synthetic protocol is determined in accordance with the binary line.

Compound of this invention may be useful as analgeεicε and/or for the treatment of inflammatory diεeaεe,

eεpecially in the caεe of the azatricyclics acting aε antagoniεtε of the neurokinin 1/brandykin receptor. Memberε of the benzodiazopine library may be uεeful aε a muscle relaxant and/or tranquilizer and/or as a sedative. Members of the 23 Mixed Amide Library may be of uεe in the treatment of hypertenεion on endothelin antagonists or Raynaud's syndrome.

EXAMPLE 1 PEPTIDE LIBRARY

In order to encode up to 10 9 different syntheεeε, one could prepare 30 different identifierε which carry individual tagε capable of being εeparated one from another by capillary GC. For encoding a εmaller number of syntheseε, fewer identifierε would be used. The tags would normally be prepared from commercially-available chemicals as evidenced by the following illustration. ω-Hydroxyalkenes-1, where the number of methylene groups would vary from 1 to 5, wculd be reacted with an iodoperfluoroalkane, where the number of CF 2 groups would be 3, 4, 6, 8, 10, and 12. By employing a free-radical catalyst, the iodoperfluorocarbon would add to the double bond, where the iodo group could then be reduced with hydrogen and a catalyst or a tin hydride. In thiε manner, 30 different tags could be prepared. The chemical procedure is described by Haszeldine and Steele, J. Chem. Soc. (London), 1199 (1953); Brace, J. Fluor. Chem., 20, 313 (1982) . The highly fluorinated tagε can be eaεily detected by electron capture, have different GC retention timeε, εo that they are readily εeparated by capillary GC, are chemically inert due to their fluorinated, hydrocarbon εtructure and each bearε a εingle hydroxyl functional group for direct or indirect attachment to particleε.

Before attachment to compound precursors, the tags (referred to aε T1-T30) would be activated in a way which is appropriate for the chemical intermediates to be used

in the combinatorial εyntheεis. By appropriate it iε intended that a functionality would be added which allowε for ready attachment by a chemical bond to a compound precursor or to the bead matrix itself. The activation process would be applied to each of the 30 different tagε and allow theεe tags to be chemically bound, either directly or indirectly, to intermediates in the combinatorial compound synthesiε. For example, a carboxy derivative could be used for coupling and upon activation the resulting carboxy group would bond to the particle.

In the case of a combinatorial synthesiε of a peptidic compound or other structure made of amide-linked organic fragments, the encoding procesε could conεist of addition of a carboxylic acid-equipped linker. For example, the tag would be coupled to the tert.-butyl ester of o-nitro- E-carboxybenzyl bromide in the presence of sodium hydride. The eεter would then be hydrolyzed in dilute trifluoroacetic acid.

Activated identifierε would be coupled to intermediateε at each εtage in the combinatorial compound εyntheεiε. The ortho-nitrobenzyl ether part of the activated identifierε iε uεed to allow photochemical detachment of the tagε after completing the combinatorial εyntheεiε and εelecting the moεt desirable compounds. The detached tagε would then be decoded uεing capillary GC with electron capture detection to yield a hiεtory of the εynthetic εtageε uεed to prepare the compound selected.

While there is an almost unlimited set of chemical stages and methods which could be used to prepare combinatorial libraries of compounds, we will use coupling of α-amino acids to make a combinatorial library of peptides aε an example of an application of the encoding methodology. In thiε example, we will deεcribe preparation of a library of pentapeptideε having all co binationε of 16 different

amino acidε at each of the five residue positions. Such a library would contain 16 s memberε. To uniquely encode all members of thiε library, 20 detachable tagε (T1-T20) aε deεcribed above would be required.

To prepare the encoded library, we would begin with a large number (>10 6 ) of polymer beadε of the type used for Merrifield solid phase syntheεiε and functionalized by free amino groupε. We would divide the beadε into 16 equal portionε and place a portion in each of 16 different reaction veεεels (one vesεel for each different α-amino acid to be added) . We would then add a small portion (e.g., 1 mol%) of identifiers to each of the amino acid derivatives (e.g., Fmoc amino acidε) to be coupled in the firεt εtage of the combinatorial εynthesis. The specific combination of the tags incorporated into the identifiers added would represent a simple binary code which identifies the amino acid used in the first stage of synthesis. The 16 amino acids added would be indicated by numberε 1-16 and any εuch number could be repreεented chemically by combinationε of the firεt four tagε (T1-T4) . In tableε 2 and 3, a typical encoding scheme is εhown in which the preεence or absence of a tag is indicated by a 1 or a 0, respectively. The letter T may represent either the the tag or the identifier incorporating that tag.

Table 2. A typical encoding scheme.

We would then carry out a standard dicyclohexyl- carbodiimide (DCC) peptide coupling in each of the 16 vessels using the Fmoc amino acids admixed with small amounts of the encoding activated identifiers as indicated above. During the couplings, the amino acids as well aε εmall amountε (e.g., 1%) of the identifierε would become chemically bound to intermediateε attached to the beadε.

Next the beads would be thoroughly mixed and again separated into 16 portions. Each portion would again be placed in a different reaction vessel. A second amino acid admixed with appropriate new activated identifiers (T5-T8) would be added to each vesεel and DCC coupling would be carried out aε before. The particular mixture of the incorporated tagε (T5-T8) would again represent a

εimple binary code for the amino acid added in this, the εecond εtage of the combinatorial εynthesis.

Table 3. A typical encoding scheme.

After the 16 couplings of stage 2 are complete, the beads would be again mixed and then divided into 16 new portionε for the third stage of the synthesis. For the third stage, T9-T12 would be used to encode the third amino acid bound to the beads using the same scheme used for stageε 1 and 2. After the third couplingε, the procedure would be repeated two more timeε uεing the fourth amino acidε with T13-T16 and the fifth amino acidε with T17-T20 to give the entire library of 1,048,576 different peptideε bound to beads.

Although the above beads would be visually indistinguishable, any bead may be chosen (e.g., by

selecting based on the interesting chemical or biological properties of its bound peptide or other target molecule) and its synthetic history may be learned by detaching and decoding the asεociated tagε.

The preciεe method used to detach tags will depend upon the particular linker used to chemically bind it to intermediates in the combinatorial synthesis of the target compound. In the example above, the ortho-nitrobenzyl carbonate linkages, which are known to be unstable to -300 nm light (Ohtsuka, et al., J. Am. Chem. Soc. , 100, 8210 [1978]), would be cleaved by photochemical irradiation of the beads. The tags would then diffuse from the beads into free εolution which would be injected into a capillary gaε chromatograph (GC) equipped with a εenεitive electron capture detector. Since the order in which the tags (T1-T20) emerged from the GC and their retention times under standard conditions were previously determined, the presence or absence of any of T1-T20 would be directly determined by the presence or absence of their peaks in the GC chromatogram. If 1 and 0 represent the presence and absence respectively of peaks corresponding to T1-T20, then the chromatogram can be taken aε a 20-digit binary number which can uniquely repreεent each poεεible εynthesis leading to each member of the peptide library. The uεe of halocarbon tagε which are safe, economical and detectable at εubpicomole levelε by

electron capture detection makes this capillary GC method a particularly convenient encoding scheme for the purpose.

As an example of using the encoding scheme for the pentapeptide library above, a particular bead is irradiated with light to detach the tags, the solubilized labels injected into a capillary GC and the following chromatogram obtained ("Peak" line):

Label 20191817 1615 14 13 12 11 10 9 8 7 6 5 3 2 1 GC Inject

Peak I I I I I I I I I I Binary 1 1 1 1 0 1 0 0 0 0 1 1 0 0 0 1 0 0 1 0

Stage 5 3 2 1

AA Tryptophan Threonine Serine Alanine Valine

The "Label" line diagrams the GC chromatogram where T20-T1 peakε (|) are to be found (note the injection iε given on the right and the chromatogram readε from right to left) . The "Peak" line repreεentε the preεence of labelε (T20-T1) as peaks in the chromatogram. The "Binary" line giveε preεence (1) or abεence (0) of peakε aε a binary number. The "Stage" line breaks up the binary number into the five different parts encoding the five different εtageε in the εynthesiε. Finally, the "AA" line gives the identity of the amino acid which was added in each stage and was given by the binary code in the "Binary" line above.

EXAMPLE 2

RADIO-LABELED TAGS In the next illustration, the tags employed are onomethylethers of linear alkyl-α:,ω-diols. The diol would have N + 2 carbon atoms, where N designates the stage. The methyl group would be a radiolabeled reagent which would have any of a variety of 3 H/ 14 C ratioε from 1/1 to m/1, where m iε the number of choiceε. The double radiolabel allowε for accurate quantitation of the tritium preεent in the tag. By having 10 different alkylene groups and 10 different radioactive label ratios, 10 10 unique ten-member εetε of tagε are generated. Tagε would be attached by firεt reacting them with activating agentε, e.g. phosgene to form a chloroformate, followed by reaction with the F 1 -F 2 component. In this case, F 1 -F 2 is the o-nitro-p-carboxy-benzyl alcohol protected as the t- butyl ester. Each time a εynthetic stage is carried out, the de-esterified identifier is added directly to the bead, which haε covalently bonded amine or hydroxyl groupε, to form amideε or esterε with the acid activated using standard chemistry, e.g., carbodiimide coupling methodology. At the end of the sequential syntheεiε, the beadε are then εcreened with a variety of receptorε or enzy eε to determine a particular characteristic. The beads demonstrating the characteristic may then be isolated, the tags detached and separated by HPLC to give a serieε of glycol monomethyl ethers which may then be analyzed for radioactivity by standard radioisotope

identification methods. For example, if the first and second tags to elute from the HPLC column had 3 H/ 14 C ratios of 5:1 and 7:1 respectively, then the product which εhowed activity had been synthesized by reagent number 5 in stage 1 and reagent number 7 in stage 2.

EXAMPLE 3

2401 Peptide Library The identifiers employed were 2-nitro-4-carboxybenzyl, O-aryl subεtituted ω-hydroxyalkyl carbonate, where alkyl waε of from three to 12 carbon atomε and aryl waε (A) pentachlorophenyl, (B) 2,4 , 6-trichlorophenyl, or (C) 2,6- dichloro-4-fluorophenyl. The tagε are deεignated as NAr, wherein N is the number of methylene groups minus two and Ar is the aryl group. Thuε, tag 2A haε a butylene group bonded to the pentachlorophenyl through oxygen. The εubject tagε can be easily detected using electron capture gaε chromatography at about 100 fmol.

In the εubject analyεiε, the tagging moleculeε are arranged in their GC elution order. Thuε the tag which iε retained the longeεt on the GC column iε designated TI and is aεεociated with the leaεt εignificant bit in the binary εyntheεis code number, the next longest retained tag is called T2 repreεenting the next leaεt εignificant binary bit, and εo on. Using an 0.2mM x 20M methylsilicone capillary GC column, eighteen well-resolved tags were obtained where TI through T18 corresponded to 10A, 9A, 8A,

7A, 6A, 5A, 4A, 3A, 6B, 2A, 5B, IA, 4B, 3B, 2B, IB, 2C, and IC, respectively.

An encoded combinatorial library of 2401 peptides was prepared. Thiε library had the amino acid sequence N- XXXXEEDLGGGG-bead, where the variable X residues were D, E, I, K, L, Q, or S (single letter code). The 4 glycines εerved aε a εpacer between the encoded amino acid sequence and the bead. The combinatorial library included the sequence H 2 N-KLISEEDL, part of the 10 amino acid epitope which is known to be bound by 9E10, a monoclonal antibody directed against the human C-myc gene product. For encoding this library, three binary bits were sufficient to represent the seven alternative reagents for each stage. The code was as follows: 001 = S; 010 = I; 011 = K; 100 = L; 101 = Q; 110 = E; 111 = D.

The library was syntheεized by first preparing the constant segment of the library H 2 NEEDLGGGG-bead on 1.5 g of 50-90μ polystyrene synthesis beads functionalized with 1.1 meq/g of aminomethyl groups using standard solid phase methods based on t.-butyl side-chain protection and Fmoc main chain protection (Stewart and Young, "Solid Phase Peptide Synthesis", 2nd edition, Pierce Chemical Co., 1984) . After deprotecting the Fmoc groups with diethylamine, the beadε were divided into εeven 200 mg fractions and each fraction placed in a different Merrifield syntheεiε vessel mounted on a single wrist-

action shaker. The beads in the seven vessels were processed independently as follows (see Table 3-1) . The letter T in this example refers to the tag or to the identifier incorporating that tag.

TABLE 3-1

In accordance with the above procedure a sufficient amount of the identifiers listed in step 1 were attached via their carboxylic acids using diisopropylcarbodiimide to tag about 1% of the free amino groups on each bead in the corresponding vessel. The remaining free amino groups on each bead were then coupled in step 3 to N-protected amino acid anhydrides. After washing with methylene chloride, isopropanol, and N,N-dimethylformamide, the beadε from the seven vessels were combined and thoroughly mixed. At this point the library had seven members.

After Fmoc deprotection (diethylamine) , the beads were again divided into seven vesselε and processed as before except that in place of the identifiers used previously, identifierε representing the second stage (T4-6) were used. By repeating the procedure two more times, using identifiers T7-9 and then T10-12 analogously, the entire uniquely encoded library of 7 4 =2401 different peptides was prepared uεing only 12 identifierε.

To read the εyntheεis code from a single selected bead, the bead was first washed four times in a small centrifuge tube with 100 μL portions of DMF, and then resuspended in 1 ML of DMF in a Pyrex capillary tube. After 2 hrε of photolyεis with a Rayonet 350 nm light source, the tags released from the bound identifiers were silylated using about 0.1 μL bis-trimethylsilylacetamide and the solution injected into a Hewlett Packard capillary gas chromatograph equipped with an 0.2mM x 20M methylsilicone fused silica capillary column and an electron capture detector. The binary synthesis code of the selected bead was directly determined from the chromatogram of the tags which resulted.

EXAMPLE 4

Benzodiazepine Library A combinatorial benzodiazepine library comprising 30 compounds of the formula VIII

R wherein:

R is CH 3 , CH(CH 3 ) 2 , CH 2 C0 2 H, (CH 2 ) 4 NH 2 , CH 2 C 6 H 4 OH, or CH 2 C 6 H 5 and

R 1 is H , CH 3 , C 2 H 5 , CH 2 CH=CH 2 , or CH 2 C 6 H 5 is constructed per the following scheme .

TTA STEP B DO-

STEP C III

oc

2 = POLYSTYRENE RESIN

STEP D DMF

L)-Z-

E

STEP G

The benzodiazepines VIII are constructed on polystyrene beads similarly to the method of Bunin and Ellman (JACS, 114. 10997-10998 [1992]) except that a photolabile linker is incorporated between the bead and the benzodiazepine (see steps A, B, and C) , thus allowing the benzodiazepine to be removed in step G non-hydrolytically by exposure to U.V. light (350 nm in DMF for 10 minutes to 12 hr) . Additionally, binary codes are introduced in steps D and E which allow for a precise determination of the reaction sequence used to introduce each of the 6 R's and 5 R 1, s. After removal of the tags according to step H and analysis by electron capture detection following GC separation, the nature of the individual R and R 1 groups is determined.

Steps D, E, and F essentially follow the procedure of Bunin and Ellman, but also include the incorporation of identifiers IXa-c in step D and IXd-f in Step E. The identifiers are all represented by Formula IX,

0 Cl

IX

wherein :

IX a indicates n=6;

IX b indicates n=5;

IX C indicates n=4;

IX d indicates n=3;

IX e indicates n=2; and

IX, indicates n=l,

The codes for each of R and R 1 are as follows: Table 4-1

Step A

To a solution of I (1 equiv) in toluene (cone. = 0.5 M) is added the Fmoc protected 2-amino-5-chloro-4'-hydroxy- benzophenone (1.3 eq)and diethylazaodicarboxylate (1.3 eq) and triphenylphosphine (1.3 eq) . The mixture is stirred at room temperature for 24 hr. The solvent is removed in vacuo and the residue triturated with ether and filtered and the solvent again removed in vacuo. The resultant product II is purified by chromatography on silica gel.

Step B

To a solution of II in DCM (0.2 M) stirring at r.t. iε added TFA (3 equiv.) and the solution is allowed to stir for 12 hr. The solution is evaporated to dryness in vacuo and the residue dissolved in DCM, washed once with brine and dried (Na 2 S0 4 ) . Filtration and evaporation of the solvent affords III.

Step C 1% DVB (divinylbenzene) cross-linked polystyrene beads (50μ) functionalized with aminomethyl groups (1.1 mEq g) are suspended in DMF in a peptide reaction vessel (Merrifield vessel) . Ill (2 equiv) and HOBt (3 equiv) in DMF are added and the vessel shaken for 10 min. DIC (3 eq) is added and the vessel is shaken until a negative Ninhydrin test indicates completion of the reaction after 12 hr.

The DMF is removed and the resin washed with additional DMF (x5) and DCM (x5) before drying in vacuo.

Step D

The dry resin is divided into 6 reaction vesselε and is suεpended in DCM. The appropriate combinationε of identifiers IX g . c (see Table 4-1) are added to the flasks and shaken for 1 hr. The Rh(TFA) 2 catalyst (1 mol%) iε added to each flask and shaken for an additional 2 hr. The flasks are drained and the resin washed with DCM (x5) . The resin is then treated with a solution of TFA in DCM (0.01 M) and shaken for 30 min. and then washed again with DCM (x3) followed by DMF (x2) . The resin is treated with a 20% solution of piperidine in DMF and shaken for 30 min. and is then washed with DMF (x3) and DCM (x3).

To each flask is added the appropriate Fmoc protected amino acylfluoride (3 equiv) (when required side-chain functional groups are protected as tert-butyl ester (Asp) , tert-butyl ether (Tyr) or tert-butyloxycarbonyl (Lys) ) with 2,6-di-tert-butyl-4-methylpyridine (10 equiv) and the flasks shaken overnight or until a negative Ninhydrin test is achieved. The resin is washed once (DCM) and then the six batches are combined and washed again (DCM, x5) before drying in vacuo.

The dry resin is divided into five reaction vessels and is suspended in DCM. The appropriate combinations of identifiers IX d.f (see Table 4-1) are added to the flasks and shaken for 1 hr. The Rh(TFA) 2 catalyst (1 mol%) is added to each flask and shaken for an additional 2 hr. The flasks are drained and the resin washed with DCM (x5) . The resin in then treated with a solution of TFA in DCM (0.01 M) and shaken for 30 min. and is then washed with DMF (X3) and DCM (x3).

To each flask is added a solution of 5% acetic acid in DMF and the mixtures are heated to 60°C and shaken overnight. The solvent is drained and then the resin washed with DMF (x5).

Step F

Each batch of resin is suspended in THF and the flasks are cooled to -78"C. To each flask is added a εolution of lithiated 5-(phenylmethyl)-2-oxazolidinone (2 equiv) in THF and the mixtures are shaken at -78°C for 1 hr. The appropriate alkylating agent (Table 4-2) (4 equiv) is then added to each reaction flask followed by a catalytic amount of DMF. The vessels are allowed to warm to ambient temperature and shaken at this temperature for 5 hrε. The solvent iε removed by filtration and the resin washed with THF (xl) and then dried in vacuo. The batches of resin are then combined and washed with THF (x2) and DCM (x2)

and the combined resin is then treated with a 95:5:10 mixture of TFA:water:dimethylsulphide for 2 hrs to remove the side chain protecting groups.

TABLE 4-2

Step G

The resultant benzodiazepine can be cleaved from a bead of polystyrene by suspending the bead in DMF and irradiating with U.V. (350 nm) for 12 hrs.

Step H

A bead of interest is placed into a glasε capillary tube. Into the tube is syringed 1 μL of 1M aqueous cerium (IV) ammonium nitrate (CAN) solution, 1 μL of acetonitrile and 2μL of hexane. The tube is flame sealed and then centrifuged to ensure that the bead is immersed in the reagents. The tube is placed in an ultrasonic bath and sonicated from 1 to 10 hrs preferably from 2 to 6 hrs.

The tube iε cracked open and nl μL of the upper hexane layer is mixed with -0.2 μL of bis(trimethylεilyl)- acetamide (BSA) prior to injection into the GC and each tag member determined using electron capture detection, as exemplified in the following scheme.

Cl

BSA

Cl

EXAMPLE 5

117.649 Peptide Library An encoded library of 117,649 peptides was prepared. This library had the sequence H 2 N-XXXXXXEEDLGGGG-bead, where the variable residue X was D,E,I,K,L,Q or S. This library was encoded using the 18 tags as defined in Example 3 ; three binary bits being εufficient to repreεent the seven amino acids used in each step. The code waε: 001=S; 010=1; 011=K; 100=L; 101=Q; 110=E; and 111=D, where 1 indicates the presence and 0 indicates the absence of a tag.

The constant segment of the library (H 2 NEEDLGGGG-bead) waε synthesized on 1.5 g of 50-80 μ Merrifield polystyrene synthesiε beads functionalized with 1.1 mEq/g of aminomethyl groups using standard εolid phaεe methodε based on t-Bu εidechain protection and Fmoc mainchain protection. After deprotecting the N-terminal Fmoc protecting group with diethylamine, the beadε were divided into seven 200 mg portions, each portion being placed into a different Merrifield synthesiε vessel mounted on a single wrist-action shaker.

The beads in the εeven veεεelε were processed as in Table 3-1 to attach the sets of identifiers (T1-T3) and the corresponding amino acid to each portion except that instead of DIC, i-butylchloroformate was used for activation.

Thiε procedure first chemically attached small amounts of appropriate identifiers via their carboxylic acids to the syntheεis beadε. Thiε attachment waε achieved by activating the linker carboxyl groupε aε mixed carbonic anhydrideε uεing isobutylchloroformate, and then adding an amount of activated identifier corresponding to 1% of the free amino groups attached to the beads. Thus, about 1% of the free amino groups were terminated for each identifier added. The remaining free amino groups were then coupled in the usual way with the corresponding protected amino acids activated as their symmetrical anhydrides.

After washing, the seven portions were combined and the Fmoc protected amino groups were deprotected by treatment with diethylamine. The beadε were again divided into seven portions and processed as before, except that the appropriate identifierε carrying tagε T4 , T5, and T6 were added to the reaction vessels. The procedure of dividing, labelling, coupling the amino acid combining and main-chain deprotection was carried out a total of six times using identifierε bearing tagε Tl- T18, affording an encoded peptide library of 117,649 different memberε.

Typjcal Identifier Preparation

To a solution of 8-bromo-l-octanol (0.91 g, 4.35 mmol) and 2,4, 6-trichlorophenol (1.03 g, 5.22 mmol) in DMF (5 mL) was added cesium carbonate (1.70 g, 5.22 mmol) resulting in the evolution of gas and the precipitation of a white solid. The reaction waε εtirred at 80° C for 2 hrε. The mixture waε diluted with toluene (50 mL) and poured into a εeparatory funnel, washed with 0.5 N NaOH (2x50 mL) , IN HC1 (2x50 mL) and water (50 L) and the organic phase was dried (MgSO.) . Removal of the solvent by evaporation gave 1.24 g (87% yield) of tag as a clear oil.

The above tag (0.81 g, 2.5 mmol) waε added to a 2 M εolution of phosgene in toluene (15 mL) and stirred at room temperature for 1 hr. The excess phosgene and the toluene were removed by evaporation and the resulting crude chlorofor ate waε diεsolved in DCM (5 mL) and pyridine (0.61 L, 7.5 mmol). tert-Butyl 4-hydroxy- methyl-3-nitrobenzoate (Barany and Albericio, J. Am. Chem. Soc, (1985), 107. 4936-4942) (0.5 g, 1.98 mmol) was added and the reaction mixture stirred at room temperature for 3 hrs. The solution was diluted with ethyl acetate (75 L) and poured into a separatory funnel. After washing with IN HC1 (3x35 L) , saturated NaHC0 3 (2x35 L) and brine (35 L) , the organic phase was dried (MgSO ) . The εolvent waε removed by evaporation and the residue purified by chromatography on silica gel (5% to 7.5% ethyl acetate in

petroleum ether) affording 0.95 g (79% yield) of the identifier tert-butyl ester as a clear oil.

Trifluoroacetic acid (3 mL) was added to a solution of the identifier tert-butyl ester (0.95 g, 1.57 mmol) in DCM (30 mL) to deprotect the linker acid (i.e., F 1 -F 2 of Formula I) and the εolution waε stirred at room temperature for 7 hrε. The mixture waε then evaporated to dryneεs and the residue redissolved in DCM (30 mL) . The εolution was washed with brine (20 mL) and the organic phase dried (MgSO . Removal of the solvent by evaporation gave 0.75 g (87% yield) of the identifier (6B) as a pale yellow solid. (Tag nomenclature is the same as in Example 3) .

Typical Encoded Library Svnthesiε Step

Ncf-Fmoc-E(tBu) -E(tBu)-D(tBu)-L-G4-NH-resin waε suspended in DMF (20 mL) and shaken for 2 min. After filtering, 1:1 diethylaminerDMF (40 mL) was added to remove the Fmoc protecting groups and the resin was shaken for 1 hr. The reεin waε εeparated by filtration and waεhed with DMF

(2x20 L, 2 min each) ; 2:1 dioxane: water (2x20 L, 5 min each) , DMF (3x20 mL, 2 min each) , DCM (3 x 20 mL, 2 min each) then dried in vacuo at 25° C. (The resin waε found to have 0.4 mmol/g amino groups by picric acid titration at this stage.)

150 mg Portions of the resin were placed into seven Merrifield vesεels and suspended in DCM (5 mL) . The

appropriate identifiers were activated as their acyl carbonates as follows (for the firεt coupling): TI (6.6 mg, 0.0098 mmol) waε dissolved in anhydrous ether (2 L) and pyridine (10 μL) was added. Isobutyl chloroformate (1.3 μL, 0.0096 mmol) waε added as a solution in anhydrous ether (0.1 L) . The resulting mixture was stirred at 25° C for 1 hr. during which time a fine white precipitate formed. The stirring was stopped and the precipitate was allowed to εettle for 30 min. Solutions of the acylcarbonates of T2 and T3 were prepared in the εame way. Aliquotε (0.25 mL) of the εupernatant solution of activated identifierε were mixed to give the appropriate 3-bit binary tag codes and the appropriate coding mixtures of identifierε were added to each of the εeven εyntheεiε veεεelε. The veεsels were εhaken in the dark for 12 hrs, and then each was washed with DCM (4x10 L, 2 min each) .

A solution of the symmetrical anhydride of an Nα-Fmoc amino acid in DCM (3 equivalents in 10 mL) was then added to the corresponding coded batch of resin and shaken for 20 min. 5% N,N-diisopropylethylamine in DCM (l mL) waε added and the mixture εhaken until the reεin gave a negative Kaiεer teεt.

The reεin batcheε were filtered and combined, and then washed with DCM (4x20 mL, 2 min each) , iεopropanol (2x20 mL, 2 min each) , DCM (4x20 L, 2 min each) . The next cycle of labelling/coupling waε initiated by Fmoc deprotection aε deεcribed above.

After Fmoc deprotection of the residues in the last position of the peptide, the side chain functionality was deprotected by suspending the resin in DCM (10 mL) , adding thioanisole (2 L) , ethanedithiol (0.5 L) and tri- fluoroacetic acid (10 mL) then shaking for 1 hr at 25° c The resin was then washed with DCM (6x20 mL, 2 min each) and dried.

Electron Capture Gas Chromatography Reading of Code A single, selected bead waε placed in a Pyrex capillary tube and waεhed with DMF (5x10 μL) . The bead was then suεpended in DMF (1 μL) and the capillary waε εealed. The suspended bead was irradiated at 366 nm for 3 hrs to release the tag alcohols, and the capillary tube subsequently placed in a sand bath at 90° c for 2 hrs. The tube was opened and bis-trimethylsilyl acetamide (0.1 mL) was added to trimethylsilylate the tag alcohols. After centrifuging for 2 min. , the tag solution above the bead (1 μL) was injected directly into an electron capture detection, capillary gas chromatograph for analysiε. Gas chromatography was performed using a Hewlett Packard Series II Model 5890 gas chromatograph equipped with a 0.2 mmx20 m methylsilicone fused silica capillary column and an electron capture detector. Photolysis reactions were performed uεing a UVP "Black Ray" model UVL 56 hand-held 366 nm lamp.

Antibodv Affinity Methods

The anti-C-myc peptide monoclonal antibody 9E10 was prepared from ascites fluid as described in Evans et al. , Mol. Cell Biol., 5, 3610-3616 (1985) and Munro and Pelha , Cell, 48., 899-907 (1987). To test beads for binding to 9E10, beads were incubated in TBST [20 mM Tris-HCl (pH 7.5), 500 mM NaCl and 0.05% Tween-20] containing 1% bovine serum albumin (BSA) to block non-specific protein binding εites. The beads were then centrifuged, resuspended in a 1:200 dilution of 9E10 asciteε fluid in TBST + 1% BSA and incubated overnight at 4°C. Beadε were εubsequently washed three times in TBST and incubated for 90 min. at room temperature in alkaline phosphataεe-coupled goat antimouse IgG antibodies (Bio-Rad Laboratories) , diluted 1:3000 in TBST + 1% BSA. After washing the beads twice in TBST and once in phosphataεe buffer (100 mM Tris-HCl, pH 9.5, 100 mM NaCl and 5 mM MgCl 2 ) , the beads were incubated 1 hr at room temperature in phosphatase buffer containing one one-hundreth part each of AP Color Reagentε A & B (Bio-Rad Laboratorieε) . To stop the reaction, the beads were washed twice in 20 mM sodium EDTA, pH 7.4. Solution phase affinities between 9E10 and various peptides were determined by a modification of the competitive ELISA assay described by Harlow et al. , Antibodies: a Laboratory Manual, 570-573, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.

From a 30 mg sample of the combinatorial library of peptides, 40 individual beads were identified which stained on exposure to the anti-C-myc monoclonal antibody. Decoding of these positive-reacting beads established the ligand'ε reaction εequence as the myc epitope (EQKLISEEDL) or sequences that differed by one or two substituents among the three N-terminal residues.

EXAMPLE 6 23.540,625 Mixed Amide Library

The encoding technique was teεted further by the preparation of a combinatorial library of 23,540,625 memberε conεisting of peptides and other amide compounds.

The synthesiε was carried out using 15 different reagents in 5 steps and- 31 different reagents in the sixth εtep. Four identifiers were used to encode each of the 5 steps with 15 reagents and five identifiers were used in the final εtep with 31 reagentε. A label εet of 25 identifierε waε therefore prepared. 2-Nitro-4- carboxybenzyl, O-aryl substituted ω-hydroxyalkyl carbonate identifiers were employed, where the tag components were comprised of an alkyl moiety of from 3 to 12 carbon atoms and the aryl moieties were (A) pentachlorophenyl, (B) 2,4,5-trichlorophenyl, (C) 2,4, 6-trichlorophenyl, or (D) 2,6-dichloro-4-fluorophenyl. A set of 25 tags was prepared using appropriate alkyl chains lengths with A, B, C or D, separable using a 0.2 mMx25M methylεilicone GC

colu n. The chemical compositionε of tagε T1-T25 (where TI represents the tag with the longest retention time, and T25 the tag with the shortest retention time) are summarized below:

The designationε 10A, 9A, etc. are aε described in Example 3.

The fifteen reagents used in the first five stageε and the code identifying them are represented below where 1 represents the presence of tag and O the absence thereof.

10

15

20

The 31 reagents and the code representing them in the sixth stage are repreεented below:

REAGENT CODE

L-serine (00001)

D-serine (00010)

L-glutamic acid (00011)

D-glutamic acid (00100)

L-glutamine (00101)

D-glutamine (00110)

L-lyεine (00111)

D-lysine (01000)

L-proline (01001)

D-proline (01010)

L-phenylalanine (01011)

D-phenylalanine (01100)

3-amino-benzoic acid (01101)

4-aminophenyl acetic acid (OHIO)

3,5-diamino-benzoic acid (01111)

Succinic Anhydride (10000)

Tiglic acid (10001)

2-pyrazine carboxylic acid (10010)

A spacer of six glycine units was prepared on the beads uεing εtandard methods. The variable region was constructed using butyl sidechain protection, and amino groups were protected as Fmoc derivativeε. Amide bondε were formed by activation of the carboxylic acid with DIC and HOBt.

EXAMPLE 7

Hetero-Diels-Alder Library A combinatorial hetero Diels-Alder library comprising 42 compounds of the formula:

wherein;

R 1 is H, CH 3 0, F 3 C, F 3 CO, H 5 C 6 0, or C 6 K *t ;

R 2 is H, CH 3 , or CH 3 0;

R 3 is H (when n=2), or CH 3 (when n=l) ; and

X. S = H or Cl or Ar

Ar =

HO X ΩJD

waε conεtructed per the following εcheme :

TFA STEP B DCM

STEP C III

IV

1) To

2) Ident

1) Identifiers x, f.g

BF 3 . Et 2 0 DCH

STEP E

NO

The azatricyclic productε (VI) were constructed on polyεtyrene beads and were linked to the beads by a photocleavable linker allowing the azatricycle (VII) to be removed from the bead by exposure to U.V. light (350 nm in DMF). The binary codes introduced in stepε C,D and E allow a unique determination of the reaction sequence used to introduce ArR, R 1 , R 2 and R 3 . The encoding tags were removed according to step G and analyzed by electron capture detection following GC separation.

The identifiers uεed in thiε scheme are represented by the formula X:

Wherein; X. indicates n=10

X b indicates n=9

X c indicates n=8 ^ indicates n=7

X e. indicateε n=6

X f indicateε n=5

X indicateε n=4

The codeε for each of R, R 1 , R 2 , R 3 are as follows:

TABLE 7-1

R 1 =H R 2 =H

R=H R=CH 7

d,c R 1 =OCH 3 R 2 =0CH 3

R 1 =CF, R 2 =H

e,c R 1 =C 6 H 5 0 R 2 =H

e,d R 1 =F 3 C0 R 2 =H

e,d,c R^H,, R 2 =H

R 3 =CH, n=l

R 3 =H n=2

Step A To a solution of I (2.03 g, 8 mmol), 4-hydroxybenzaldehyde

(1.17 g, 9.6 mmol) and triphenylphosphine (2.73 g, 10.4 mmol) in toluene (20 mL) stirring at 0°C was added over a period of 30 minuteε diethylazodicarboxylate. The εolution waε allowed to warm and εtirred for 1 hour once ambient temperature had been reached. The εolution waε concentrated by removal of approximately half of the εolvent in vacuo and waε then triturated with ether. The mixture waε then filtered and the reεidue waε waεhed thoroughly with ether. The solvent was removed jln vacuo and the reεidue was purified by chromatography on silica gel (15% ethyl acetate in hexane) affording 1.3 g of the ether IIa (47% yield).

2-chloro-4-hydroxybenzaldehyde and 2-hydroxy-l- naphthaldehyde were coupled to I in analogous fashion affording ethers lib and c in yields of 91% and 67%, respectively.

Step B

To a solution of ether Ila (0.407 g, 1.14 mmol) in DCM (20 L) stirring at room temperature was added TFA (8 L) . The εolution waε allowed to εtir for 6 hrε. The εolution waε evaporated to dryneεs in vacuo affording 0.343 g of acid Ilia (100% yield) . Ethers lib and lie were deprotected analogously affording acids Illb and c in yields of 92% and 100% respectively.

Step C

Into a peptide reaction veεεel (Merrifield veεεel) were measured 1% DVB (divinylbenzene) crosε-linked polyεtyrene beadε (50-80μ) functionalized with aminomethyl groups (l.l meq/g) (200 mg of resin) . The resin was εuspended in DMF (2 mL) and shaken for 20 min. The acid Ilia (38 mg, 2 equiv.), 1-hydroxybenzotriazole (40 mg, 2 equiv) and diiεopropylcarbodiimide (38 mg, 2 equiv) were added and the mixture εhaken until a negative Ninhydrin teεt waε achieved (22 hr) . The εolution waε removed by filtration and the resin was waεhed with DCM (8x 10 mL) .

The resin was resuspended in DCM (5 mL) , identifier Xa (15 mg) waε added and the flaεk was shaken for 1 hr. Rh(TFA) 2

-Ill-

catalyst (1 mol%) was added and the flaεks εhaken for 2 hrε. The εolvent waε removed by filtration and the resin resuεpended in DCM (5 mL) . Trifluoroacetic acid (1 drop) waε added and the vessel shaken for 20 min. The solvent was removed by filtration, and the resin was washed with DCM (8x 10 mL) .

In an analogous fashion, acids Illb and IIIc were attached to the resin and were encoded with the appropriate identifiers, i.e., Xb for acid Illb and Xa and Xb for acid IIIc. The three batches of resin were combined, mixed, washed, and dried.

Step D The dry resin was divided into 7 equal portions (87 mg) which were put into seven peptide reaction vesεelε (Merrifield veεεels) which were wrapped with heat tape. The resin in each vesεel was suεpended in toluene (10 mL) and shaken for 20 min. An appropriate amount of one aniline was then added to each flask (see Table 7-2) .

TABLE 7-2

The heating tape waε connected and the reaction mixtureε εhaken at 70"C for 18 hrε. The heat tape was disconnected and the solvent was removed by filtration and each batch of resin was washed with dry DCM (4x 10 mL) , ether (10 mL) , toluene (10 mL) and DCM (2x 10 mL) . Each of the portions was then εuspended in DCM (5 mL) and to each flask was added the appropriate identifier or combination of identifierε (Xc-e) (15 mg) (see Table 7-1) . The flasks were shaken for 1 hr. and then Rh(TFA) 2 (1 mol%) was added to each flask and shaking continued for 2 hrs.

The solvent was then removed and each batch of resin waε re-εuspended in DCM (5 mL) to which waε added TFA (1 drop) . This mixture was shaken for 20 min. , then the solvent was removed by filtration. The batches of resin were then washed (DCM, lx 10 mL) and combined, washed again with DCM (3x 10 mL) and then dried thoroughly in vacuo.

Step E

The dried resin was divided into two equal portions (0.3 g) and each was placed in a peptide reaction vesεel. The reεin batches were washed with DCM (2x 10 mL) and then resuspended in DCM (5 mL) . To one flask was added the identifier Xf (15 mg) and to the other was added Xg (15 mg) . The flaεks were shaken for 1 hr. prior to the

addition of Rh(TFA) 2 catalyst (1 mol%) . The flaεks were εhaken for 2 hrε. and then the εolvent waε removed by filtration. Each batch of reεin waε washed with DCM (3x 10 mL) , and each was then resuspended in DCM (5 mL) .

The appropriate enol ether (1 mL) (see Table 7-1) waε added to the flaskε and the vessels shaken for 30 min. To each flask waε added a solution of BF 3 -OEt 2 (0.5 mL of a 5% solution in DCM) and the flasks were εhaken for 24 hrε. Removal of the εolvent by filtration waε followed by washing of the resin with DCM (10 mL) and the resin waε then combined. The beads were then washed further with DCM (5x 10 mL) , DMF (2x 10 mL) methanol (2x 10 mL) and DCM (2x 10 mL) . The resin was then dried thoroughly in vacuo.

Step F

To confirm the identity of the products produced in the Hetero-Diels-Alder library one example was completed on a large scale to allow confirmation of the structure by spectroεcopic meanε. The procedure followed waε essentially the same method aε described for the combinatorial library. In step A 4-hydroxybenzaldehyde was coupled to the photolabile group. In step D, aniline was condenεed with the aldehyde. In εtep E, the enol ether was formed with 4 ,5-dihydro-2-methylfuran.

The photolysiε of the compound (step F) was performed by suεpending 100 mg of the beads in DMF (0.3 mL) and irradiating the beads with UVP "Black Ray" model UVL 56 hand-held 366 nm lamp for 16 hrs. The DMF was removed to one side by pipette and the beads rinsed with additional DMF (2x 3 mL) . The original solution and the washingε were combined and the εolvent removed n vacuo. NMR analyεiε of the reaction mixture εhowed it to contain the desired azatricycle by comparison to the authentic sample.

Step G

A bead of interest was placed into a pyrex glasε capillary tube sealed at one end. A solution (1 μL) of 1M aqueous cerium (IV) ammonium nitrate and acetonitrile (1:1) was syringed into the tube, and the tube waε then centrifuged εo that the bead lay on the bottom of the capillary and waε completely immerεed by the reagent solution. Hexane (2 μL) was added by syringe and the tube was again centrifuged. The open end of the capillary was flame- sealed and placed in an ultrasonic bath for 4 hrs. The capillary was then placed inverted into a centrifuge and spun such that the aqueous layer was forced through the hexane layer to the bottom of the tube. This extraction process was repeated 3 or 4 times and the tube was then opened. The hexane layer (1.5 μL) was removed by syringe and placed into a different capillary containing BSA (0.2 μL) . This tube was sealed and centrifuged until the reagents were thoroughly mixed. A portion of the solution (ca. 1 μL) was removed and injected into a gas chromatography machine with a 25M x 0.2 mM methylsilicone fused silica column with electron capture detection for separation and interpretation of the tag molecules.

The sample was injected onto the GC column at 200°C and 25 psi of carrier gas (He 2 ) . After 1 minute the temperature was increased at a rate of 20°C per minute to 320°C, and the pressure was increased at a rate of 2 psi per minute to 40 psi. These conditions are shown in the following diagram:

GC CONDITIONS

TEMPERATURE

320 C

20 C per tin

200 C

1 min

PEE55UEE 40 psi

2 psi per min

25 psi

1 min

The following resultε were obtained with four randomly εelected beadε :

Bead 1

Bead 3

EXAMPLE 8

Benzodiazepine Library

Following the procedure of Example 4 , a combinatorial library is constructed of the Formula X

R

wherein

R is a radical of a naturally occurring D or L amino acid;

R 1 is H, C 1 -C 6 alkyl, lower alkenyl, C,-^ alkylamine, carboxy C 1 -C 6 alkyl, or phenyl C 1 -C 6 alkyl wherein the phenyl is optionally subεtituted by lower alkyl, F, Cl,

Br, OH, NH 2 , C0 2 H, or O-lower alkyl;

R 2 iε H or CO-H;

R 3 iε H or OH;

R 4 iε H or Cl; with the provisos that when R 3 is OH, R 2 is H and when R 2 is carboxy, R 3 is H.

Thiε library is released from a plurality of encoded beads of the general formula

R

wherein

IX n is a plurality of identifierε of the Formula la wherein said plurality representε an encoded scheme;

S is a substrate;

F 1, -F 2 is the residue of the linker member of Formula la; and

R, R 1 , R 2 , and R 4 are as defined for Formula X.

EXAMFLE 9

Typical Identifier Preparations The diazo compound identifierε which are attached to the reεin via carbene formation are prepared as exemplified.

Compounds of the general formula

wherein n is O-10 and

Ar iε pentachlorophenol, 2,4,6-trichlorophenol,

2,4,5-trichlorophenol, or2, 6-dichloro-4-fluorophenol are prepared aε follows.

To a solution of l-hydroxy-4-(2, 6-dichloro-4-fluoro- phenoxy)butane (0.38 g, 1.5 mmol), methyl isovanillate (0.228 g, 1.5 mmol) and triphenylphosphine (0.393 g, 1.5 mmol) in THF (8 L) waε added diethylazodicarboxylate (0.287 g, 1.7 mmol). The εolution εtirred at r.t. for 36 hrs. The solvent was removed in vacuo and the residue purified by chromatography on silica gel (with a mixture of 20% ethyl acetate and 80% petroleum ether) affording 0.45 g of the aldehyde (77% yield).

The aldehyde (100 mg, 0.26 mmol) was dissolved in acetone

(8 mL) and was treated with a solution of KMn0 4 (61 mg, 0.39 mmol) in acetone (4 mL) and water (4 mL) . The reaction stirred at room temperature for 13 hrs. The mixture was diluted with ethyl acetate (100 mL) and water

(50 mL) and the layers were separated. The aqueous layer was extracted with additional ethyl acetate (2x 100 mL) . The combined organic layers were washed with water (50 mL) and dried (MgSO . Removal of the solvent afforded 109 mg of the benzoic acid (93% yield) .

A solution of the acid (76 mg, 0.188 mmol) in methylene chloride (2 mL) was treated with oxalylchloride (36 mg, 0.28 mmol) and catalytic DMF. After stirring for 10 min at room temperature slow but steady evolution of gas was observed. Stirring continued for 2 hrs. when the solution was diluted with DCM (15 L) and washed with saturated aqueous sodium hydrogencarbonate solution (5 mL) . The layers were separated. The organic layer waε dried (Na 2 SO and the solvent evaporated affording the benzoyl chloride as pale yellow crystalε.

The benzoyl chloride waε dissolved in methylene chloride (5 mL) and was added to a stirring solution of diazomethane in ether at -78°C. The cold bath waε allowed to warm up and the mixture allowed to stir for 5 hrs at room temperature. The solventε and exceεε diazomethane were removed in vacuo and the residue purified by chromatography on silica gel uεing gradient elution method where the concentration of ethyl acetate ranged from 10% to 40 % in hexanes affording 48 mg of the diazo compound (60% yield) .

Compounds of the general formula:

0

wherein; n is 0-10 and

Ar is pentachlorophenol, 2,4,6-trichlorophenol, 2,4,5- trichlorophenol, or 2,6-dichloro-4-fluorophenol are prepared as follows.

Methyl vanillate (0.729 g, 4.0 mmole), l-hydroxy-9- (2,3,4,5, 6-pentachlorophenoxy)nonane (1.634 g, 4.0 mmole) and triphenylphosphine (1.259 g, 4.8 mmole) were dissolved in 20 mL dry toluene under argon. DEAD (0.76 mL, 0.836 g, 4.8 mmole) was added dropwise, and the mixture was stirred at 25"C for one hour. The solution was concentrated to half volume and purified by flash chromatography eluting with DCM to give 1.0 g (1.7 mmole, 43%) of the product as a white crystalline solid.

The methyl ester above (1.0 g, 1.7 mmole) was disεolved in 50 mL THF, 2 mL water waε added followed by lithium hydroxide (1.2 g, 50 mmole). The mixture waε εtirred at 25 * C for one hour then refluxed for five hourε. After cooling to 25"C the mixture was poured onto ethyl acetate (200 mL) and the solution was washed with 1 M HC1 (50 mL x3) then sat. aq. NaCl (lx 50 mL) and dried over sodium

εulfate. The εolvent was removed and the crude acid azeotroped once with toluene.

The crude material above was dissolved in 100 mL toluene, 10 L (1.63 g, 14 mmole) thionyl chloride was added, and the mixture was refluxed for 90 min. The volume of the solution was reduced to approximately 30 mL by distillation, then the remaining toluene removed by evaporation. The crude acid chloride was dissolved in 20 mL dry DCM and cooled to -78°C under argon and a solution of approximately 10 mmole diazomethane in 50 L anhydrous ether waε added. The mixture waε warmed to room temperature and εtirred for 90 min. Argon waε bubbled through the εolution for 10 min. then the εolvents were removed by evaporation and the crude material was purified by flash chromatography eluting with 10-20% ethyl acetate in hexane. The diazoketone (0.85 g, 1.4 mmole, 82% over three steps) was obtained as a pale yellow solid.

The following identifiers have been prepared as described above:

Photolabile Cleavage

50 Identifierε were prepared of the formula:

and n iε 1,2,3,4,5,6,7,8,9, and 10.

Oxidative Cleavage Type I

7 Identifierε were prepared of the formula

wherein:

and n iε 4,5,6,7,8,9, and 10.

Oxidative Cleavage Type II

13 Identifierε were prepared of the formula

wherein: Ar is

and n iε 1,2,3,4,5,6,7,8,9,10; and wherein: 3Ar iε

and n iε 0,3, and 9,

It is evident from the above description that the subject invention provideε a verεatile, εimple method for identifying compounds, where the amount of compound present precludes any assurance of the ability to obtain an accurate determination of its reaction history. The method allowε for the production of extraordinarily large numbers of different products, which can be used in various screening techniques to determine biological or other activity of interest. The use of tags which are chemically inert under the procesε conditionε allowε for great versatility in a variety of environments produced by the various synthetic techniques employed for producing the products of interest. The tags can be readily synthesized and permit accurate analysiε, so as to accurately define the nature of the composition.

All publications and patent applications cited in this specification are herein incorporated by reference as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference.

Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims.