GENETICALLY ENCODED BIOSENSORS FOR DETECTION OF POLYKETIDES

Title:

GENETICALLY ENCODED BIOSENSORS FOR DETECTION OF POLYKETIDES

Document Type and Number:

WIPO Patent Application WO/2017/196983

Kind Code:

A1

Abstract:

The present disclosure relates to high-throughput detection of polyketides using genetically encoded biosensors.

Inventors:

WILLIAMS GAVIN J (US)
KASEY CHRISTIAN (US)
LI YIWEI (US)

Application Number:

PCT/US2017/031962

Publication Date:

November 16, 2017

Filing Date:

May 10, 2017

Export Citation:

Click for automatic bibliography generation Help

Assignee:

UNIV NORTH CAROLINA STATE (US)

International Classes:

C12N1/21; C12N15/09; C12N15/63; C12Q1/68

Foreign References:

US20040209270A1

2004-10-21

Other References:

FENG T. ET AL.: "Insights into resistance mechanism of the macrolide biosensor protein MphR(A) binding to macrolide antibiotic erythromycin by molecular dynamics simulation", J COMPUT AIDED MOL DES, 2015, pages 1 - 14, XP035913014
BRAKHAGE A. A. ET AL.: "Use of Reporter Genes to identify recessive trans-acting mutations specifically involved in the regulation of Aspergillus nidulans penicillin biosynthesis genes", JOURNAL OF BACTERIOLOGY, vol. 177, no. 10, 1995, pages 2781 - 2788, XP002956425
FU YU ET AL.: "Study of Transcriptional Regulation Using a Reporter Gene Assay", METHODS IN MOLECULAR BIOLOGY, vol. 313, no. 22, 2006, pages 257 - 264

Attorney, Agent or Firm:

PRATHER, Donald M. et al. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

We claim:

1. A biosensor system comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

2. The biosensor system of claim 1, wherein the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

3. The biosensor system of claim 1 or 2, wherein the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP).

4. The biosensor system of claim 3, wherein the reporter gene is a gene coding for green fluorescent protein (GFP).

5. The biosensor system of any one of claims 1 to 4, wherein the mutation confers improved sensitivity for detecting erythromycin A.

6. The biosensor system of any one of claims 1 to 4, wherein the mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides.

7. The biosensor system of any one of claims 1 to 4, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence.

8. The biosensor system of any one of claims 1 to 4, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof.

9. The biosensor system of any one of claims 1 to 4, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof.

10. A genetically modified host cell comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

11. The cell of claim 10, wherein the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

12. The cell of claim 10 or 1 1, wherein the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP).

13. The cell of claim 12, wherein the reporter gene is a gene coding for green fluorescent protein (GFP).

14. The cell of any one of claims 10 to 13, wherein the mutation confers improved sensitivity for detecting erythromycin A.

15. The cell of any one of claims 10 to 13, wherein the mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides.

16. The cell of any one of claims 10 to 13, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence.

17. The cell of any one of claims 10 to 13, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof.

18. The cell of any one of claims 10 to 13, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof.

19. The cell of any one of claims 10 to 18, wherein the cell is E. coli.

20. The cell of any one of claims 10 to 18, wherein the cell is Streptomyces.

21. A method for detecting a polyketide, comprising:

introducing into a cell:

i . a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild- type MphR gene sequence; and

is . a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

and

detecting the polyketide based on the differential expression of the reporter gene in comparison to a cell comprising a wild-type MphR gene sequence.

22. The method of claim 21, wherein the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

23. The method of claim 21 or 22, wherein the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP).

24. The method of claim 23, wherein the reporter gene is a gene coding for green fluorescent protein (GFP).

25. The method of any one of claims 21 to 24, wherein the polyketide is a 12-membered or 14- membered macrolide.

26. The method of any one of claims 21 to 24, wherein the mutation confers improved sensitivity for detecting erythromycin A.

27. The method of any one of claims 21 to 24, wherein the mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides.

28. The method of any one of claims 21 to 24, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence.

29. The method of any one of claims 21 to 24, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof.

30. The method of any one of claims 21 to 24, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof.

31. The method of any one of claims 21 to 30, wherein the cell is E. coli.

32. The method of any one of claims 21 to 30, wherein the cell is Streptomyces.

33. A method of screening for genetic mutations in a target gene, comprising:

introducing into a cell: i . a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild- type MphR gene sequence; and

ii. a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

introducing at least one mutation into a target gene; and

identifying a cell comprising the target gene mutation based on the differential expression of the reporter gene in comparison to a cell comprising the wild-type target gene.

34. The method of claim 33, wherein the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

35. The method of claim 33 or 34, wherein the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP).

36. The method of claim 35, wherein the reporter gene is a gene coding for green fluorescent protein (GFP).

37. The method of any one of claims 33 to 36, wherein the mutation confers improved sensitivity for detecting erythromycin A.

38. The method of any one of claims 33 to 36, wherein the mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides.

39. The method of any one of claims 33 to 36, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence.

40. The method of any one of claims 33 to 36, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof.

41. The method of any one of claims 33 to 36, wherein the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof.

42. The method of any one of claims 33 to 41, wherein the cell is E. coli.

43. The method of any one of claims 33 to 41, wherein the cell is Streptomyces.

44. The method of any one of claims 33 to 43, wherein the target gene encodes an O- methyltransferase.

45. The method of any one of claims 33 to 43, wherein the target gene encodes a glycosyltransferase.

Description:

GENETICALLY ENCODED BIOSENSORS FOR DETECTION OF POL YKE TIDE S

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Patent Application Serial No. 62/334,204 filed May 10, 2016, the disclosure of which is expressly incorporated herein by reference.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

This invention was made with Government Support under Grant No. GM104258 awarded by the National Institutes of Health. The Government has certain rights to the invention.

FIELD

The present disclosure relates to high-throughput detection of polyketides using genetically encoded biosensors. BACKGROUND

Polyketides are a large group of diverse molecules that display broad and potent biological activities. Access to large quantities of polyketides and analogues thereof is critical for the discovery of new biological activities, optimization of pharmacological properties, and to probe discovery and development. Biosynthetic approaches to polyketide production offer enormous potential and several benefits compared to traditional chemical approaches. The scaffolds of many polyketides are constructed by type I polyketide synthases (PKSs). These are large multifunctional protein complexes organized in a modular fashion. Each module is responsible for the selection and installation of a ketide into the polyketide. The number, identity, and order of modules describe the structure of the corresponding polyketide. These scaffolds are often further elaborated by tailoring enzymes to afford the mature, biologically active natural product. Accordingly, these systems offer the potential for the synthesis of large quantities of polyketides via microbial fermentation and combinatorial synthesis of analogues by mixing and matching modules and tailoring enzymes. However, the sheer size, mechanistic diversity, and poor understanding of how specificity and catalysis are controlled by type I PKSs render rational design of new pathways difficult. For example, many hybrid PKSs designed to produce polyketide analogues fail or are less active than wild-type machinery. Consequently, the full synthetic potential of type I PKSs has yet to be realized. Synthetic biology and directed evolution offer an opportunity to overcome these challenges by testing the functions of large libraries of variants. Yet, the ability of synthetic biology and directed evolution approaches to be applied to polyketides is extremely limited because there are no generally applicable high-throughput tools available for screening polyketides, particularly those encoded by type I PKSs. Regulatory proteins such as transcription factors have been used as effective devices for sensitive and specific detection of various small molecules. Engineered transcription factors have been described for sensing several small molecules, including dicarboxylic acids, alcohols, and a lactone, but none have been reported for the complex products of type l PKSs.

The biosensor systems, cells, and methods disclosed herein address these and other needs.

SUMMARY

Described herein is a platform technology that comprises genetically-encoded biosensors and methods for detection of polyketides using mutated MphR gene sequences. Such biosensors provide a scalable, economic, high-throughput, and broadly applicable means to specifically identify a target polyketide of interest from a complex mixture of molecules.

In one aspect, disclosed herein is a biosensor system comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In one aspect, disclosed herein is a genetically modified host cell comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In one aspect, provided herein is a method for detecting a polyketide, comprising: introducing into a cell:

i . a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and ii. a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

and

detecting the polyketide based on the differential expression of the reporter gene in comparison to a cell comprising a wild-type MphR gene sequence.

In one aspect, provided herein is a method of screening for genetic mutations in a target gene, comprising:

introducing into a cell: i. a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

ii . a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

introducing at least one mutation into a target gene; and

identifying a cell comprising the target gene mutation based on the differential expression of the reporter gene in comparison to a cell comprising the wild-type target gene.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying figures, which are incorporated in and constitute a part of this specification, illustrate several aspects described below.

FIGS. 1 A-1B. The MphR biosensor. (FIG. 1 A) Structures of selected polyketides that are detected by wild-type (WT) MphR. Erythromycin A (ErA) is the natural ligand. (FIG. IB) Artificial MphR-GFP reporter system. In the presence of ErA, MphR changes conformation and stops inhibiting transcription from the PmphR operator, thus turning on reporter expression.

FIGS. 2A-2C. Engineered MphR variants with improved sensitivity towards erythromycin

A (ErA) and sensitivity of amino acid changes compared to ribosome binding site mutations. (FIG. 2 A) Sensitivity of original clones A3, E7, and H4 towards erythromycin A. (FIG. 2B) Sensitivity of wild-type MphR and amino-acid change-only mutations towards erythromycin A. (FIG. 2C) Sensitivity of wild-type MphR and RBS-only mutations towards erythromycin A.

FIG. 3A. Erythromycin, clarithromycin, azithromycin, roxithromycin sensitivity with wild-type (WT) MphR.

FIG. 3B. Erythromycin, clarithromycin, azithromycin, roxithromycin sensitivity with M2D6-E7RBS MphR. FIG. 3C. Erythromycin, clarithromycin, azithromycin, roxithromycin sensitivity with M2D6 MphR.

FIGS. 4A-4C. MphR is a robust macrolide glycosylation sensor. (FIG. 4A) WT MphR detects erythromycin A (ErA) but not the aglycone, 6dEB. (FIG. 4B) Structures of the 12- membered macrolide YC-17 and macrolactone (aglycone) 10-DML. (FIG. 4C) Left, the MphR variant D3 detects YC-17 at concentrations ~ 100-fold lower than WT MphR; Right, neither WT or D3 MphR is activated by the aglycone 10-DML.

FIGS. 5A-5B. Biosynthesis of clarithromycin via an engineered O-methyltransferase (OMT). (FIG. 5A) An OMT with the requisite regioselectivity allows the single-step preparation of clarithromycin from ErA. (FIG. 5B) Role of naturally occurring OMTs that target polyketide sugar residues.

FIGS. 6A-6B. Clarithromycin selective MphR sensor. (FIG. 6 A) Wild-type (WT) MphR does not discriminate ErA/clarithromycin across a 1000-fold concentration range. (FIG. 6B) MphR M1B10 provides higher GFP signal with clarithromycin vs. erythromycin A (ErA) across entire range of concentrations.

FIG. 7. Existing 18-step route to solithromycin compared to a biosynthetic route.

FIGS. 8A-8B. Biosensor-guided engineering of a solithromycin precursor. (FIG. 8 A) Two genetic changes afford I, in low yield. (FIG. 8B) Biosensor-guided screening of large libraries of variants identify prototype pathway s/strains with improved product titers.

FIGS. 9A-9D. O-methyltransferase (OMT) scaffolds for directed evolution. (FIG. 9A)

Phyr2 generated homology model for EryG, 93% of residues were modeled at >90% confidence. Residues involved in the SAM binding site (V88, G89, F90, G91, L92, G93, A94, D112, LI 13, G139, S140, A141, L157). Sticks: putative macrolide (ErA) binding residues (1188, G215, W221, W252, W256, K278, R279, L281, T282, S285, G286, K288, F296), determined by comparison to known acceptor binding sites for related OMTs. (FIG. 9B) Computationally predicted internal cavities of EryG using CAVER Analyst 1.0 (Outer probe 3.00 A, Inner probe 1.90 A). SAM binding site and putative erythromycin A (ErA) binding site are shown. (FIG. 9C) DnrK (PDB: 1TW3) acceptor binding site shown as sticks (E298, L299, R302, M303, F306, L307, Y341). Macrolide ligand shown space filled. (FIG. 9D) MycF (PDB: 4X7U) acceptor binding site shown as sticks (L32, Y49, M132, L134, Y137, V141).

FIGS. 10A-10D. Glycosylation pathways and combinatorial biosynthesis. (FIG. 10A) Reactions catalyzed by glycosyltransferases (GTs). (FIG. 10B) Genes responsible for the biosynthesis of a given polyketide are usually clustered on microbial genomes. (FIG. IOC) Feeding non-native aglycones into heterologous host with non-native DP-sugar and GT genes. (FIG. 10D) Overall reaction catalyzed by DesVII/VIII is shown in the grey box, along with the natural aglycone substrates for this enzyme.

FIGS. 11A-11B. Dose-response curves of several selected clones compared to the wild- type biosensor. Multiple MphR mutants displayed increased sensitivity to erythromycin A versus MphR-WT. Clones generated by error prone PCR (epPCR) (FIG. 11 A) typically performed better than clones generated by multi-site mutagenesis (FIG. 1 IB).

FIGS. 12A-12C. Dose-response curves of MphR-A16T/T154M/M155K compared to the wild-type biosensor induced by erythromycin A, clarithromycin, azithromycin and roxithromycin. (FIG. 12A) MphR-WT responses to erythromycin A and semi-synthetic analogs. (FIG. 12B) MphR- A16T/T154M/M155K responses to erythromycin A and semi-synthetic analogs. Coding of macrolides show potential or actual points of semi -synthetic modification. (FIG. 12C) Structures for erythromycin A (compound 1), clarithromycin (compound 2), azithromycin (compound 3), and roxithromycin (compound 4).

FIG. 13. Late-stage erythromycin A biosynthesis. 6dEB, produced by DEBS1-3, is modified by a suite of enzymes to yield erythromycin D. Biosynthesis from erythromycin D to erythromycin A proceeds via biosynthetic intermediate erythromycin C (filled arrows), or by the shunt pathway via intermediate erythromycin B (dashed arrows). The eryK-catalyzed C-12 hydroxylations and eryG-catalyzed mycarosyl O'-methylations are shown in the figure.

FIGS. 14A-14B. Dose-response curves of the wild-type sensor (FIG. 14A) and the erythromycin A specific sensor MphR-P4L/W107L/H193R (FIG. 14B) in the context of discriminating between erythromycins A (compound 1) and B (compound 5). Clone MphR- P4LAV107L/H193R is capable of significant activation by erythromycin A solely, unlike the general wild-type macrolide biosensor.

FIG. 15. Plasmid map for pMLGFP.

FIG. 16. Plasmid map for pJZ12.

FIG. 17. Sensitivity of the smRBSl Al clone versus the wild-type (WT) biosensor with erythromycin A.

FIG. 18. Sensitivity of clones E7-RBS, smRBS lAl, pikBl, and wild-type (WT) with pikromycin.

FIG. 19 A. Clarithromycin/erythromycin A selectivity with R122T MphR.

FIG. 19B. Clarithromycin/erythromycin A selectivity with the M9C4 clone.

FIG. 19C. Clarithromycin/erythromycin A selectivity with wild-type (WT) MphR.

FIG. 19D. Clarithromycin/erythromycin A selectivity with the E7-M9C4 clone.

FIG. 20. MphR clone "PikBl" can detect a solithromycin biosynthetic intermediate. FIGS. 21 A-21C. Characterization of YC-17, narbomycin, and pikromycin selective MphR Clones. (FIG. 21A) YC-17 sensitivity of Bl clone vs. WT. (FIG. 21B) Narbomycin sensitivity of G7 clone vs. WT. (FIG. 21C) Pikromycin sensitivity of Bl clone vs. WT.

FIG. 22 A. The E7-RBS clone shows increased detection of the erythromycin producing strain, Aeromicrobium erythreum, compared to the wild-type (WT) biosensor.

FIG. 22B. Agar plate detection of the E7-RBS clone shows increased detection of the erythromycin producing strain, Aeromicrobium erythreum, compared to the WT biosensor.

FIG. 23. Plasmid map for WT-pMLCmR.

FIG. 24. Analysis of the control of expression of the chloramphenicol (Cm) resistance gene using pMLCmR.

FIG. 25. Analysis of antibiotic sensitivities of the E7-M9C4 pMLCmR clone.

FIG. 26A. Analysis of wild-type (WT) MphR using a range of ErA/Clarithromycin concentrations. This shows that the WT biosensor does not discriminate between these two polyketides and cannot be used to determine the concentration of clarithromycin in the presence of ErA.

FIG. 26B. Analysis of MphR mutant M9C4 using a range of ErA/Clarithromycin concentrations. This shows that the WT biosensor does discriminate between these two polyketides and can be used to determine the concentration of clarithromycin in the presence of ErA. DETAILED DESCRIPTION OF THE INVENTION

Described herein is a platform technology that comprises genetically-encoded biosensors and methods for detection of polyketides using mutated MphR gene sequences. Such biosensors provide a scalable, economic, high-throughput, and broadly applicable means to specifically identify a target polyketide of interest from a complex mixture of molecules.

Reference will now be made in detail to the embodiments of the invention, examples of which are illustrated in the drawings and the examples. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs. The following definitions are provided for the full understanding of terms used in this specification. Terminology

Terms used throughout this application are to be construed with ordinary and typical meaning to those of ordinary skill in the art. However, Applicant desires that the following terms be given the particular definition as defined below.

As used in the specification and claims, the singular form "a," "an," and "the" include plural references unless the context clearly dictates otherwise. For example, the term "a cell" includes a plurality of cells, including mixtures thereof.

As used herein, the terms "may," "optionally," and "may optionally" are used interchangeably and are meant to include cases in which the condition occurs as well as cases in which the condition does not occur.

The terms "about" and "approximately" are defined as being "close to" as understood by one of ordinary skill in the art. In one non-limiting embodiment the terms are defined to be within 10%. In another non-limiting embodiment, the terms are defined to be within 5%. In still another non-limiting embodiment, the terms are defined to be within 1%.

The term "nucleic acid" as used herein means a polymer composed of nucleotides, e.g. deoxyribonucleotides or ribonucleotides.

The terms "ribonucleic acid" and "RNA" as used herein mean a polymer composed of ribonucleotides.

The terms "deoxyribonucleic acid" and "DNA" as used herein mean a polymer composed of deoxyribonucleotides.

The term "oligonucleotide" denotes single- or double-stranded nucleotide multimers of from about 2 to up to about 100 nucleotides in length. Suitable oligonucleotides may be prepared by the phosphoramidite method described by Beaucage and Carruthers, Tetrahedron Lett, 22: 1859-1862 (1981), or by the triester method according to Matteucci, et al., J. Am. Chem. Soc, 103 :3185 (1981), both incorporated herein by reference, or by other chemical methods using either a commercial automated oligonucleotide synthesizer or VLSIPS™ technology. When oligonucleotides are referred to as "double-stranded," it is understood by those of skill in the art that a pair of oligonucleotides exist in a hydrogen-bonded, helical array typically associated with, for example, DNA. In addition to the 100% complementary form of double-stranded oligonucleotides, the term "double-stranded," as used herein is also meant to refer to those forms which include such structural features as bulges and loops, described more fully in such biochemistry texts as Stryer, Biochemistry, Third Ed., (1988), incorporated herein by reference for all purposes.

The term "polynucleotide" refers to a single or double stranded polymer composed of nucleotide monomers. In some embodiments, the polynucleotide is composed of nucleotide monomers of generally greater than 100 nucleotides in length and up to about 8,000 or more nucleotides in length.

The term "polypeptide" refers to a compound made up of a single chain of D- or L-amino acids or a mixture of D- and L-amino acids joined by peptide bonds.

The term "promoter" or "regulatory element" refers to a region or sequence determinants located upstream or downstream from the start of transcription and which are involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. Promoters need not be of bacterial origin, for example, promoters derived from viruses or from other organisms can be used in the compositions, systems, or methods described herein

The term "recombinant" refers to a human manipulated nucleic acid (e.g. polynucleotide) or a copy or complement of a human manipulated nucleic acid (e.g. polynucleotide), or if in reference to a protein (i.e, a "recombinant protein"), a protein encoded by a recombinant nucleic acid (e.g. polynucleotide). In embodiments, a recombinant expression cassette comprising a promoter operably linked to a second nucleic acid (e.g. polynucleotide) may include a promoter that is heterologous to the second nucleic acid (e.g. polynucleotide) as the result of human manipulation (e.g., by methods described in Sambrook et al., Molecular Cloning— A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1989) or Current Protocols in Molecular Biology Volumes 1-3, John Wiley & Sons, Inc. (1994-1998)). In another example, a recombinant expression cassette may comprise nucleic acids (e.g. polynucleotides) combined in such a way that the nucleic acids (e.g. polynucleotides) are extremely unlikely to be found in nature. For instance, human manipulated restriction sites or plasmid vector sequences may flank or separate the promoter from the second nucleic acid (e.g. polynucleotide). One of skill will recognize that nucleic acids (e.g. polynucleotides) can be manipulated in many ways and are not limited to the examples above.

The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same (i.e., about 60% identity, preferably 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99% or higher identity over a specified region when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI web site or the like). Such sequences are then said to be "substantially identical." This definition also refers to, or may be applied to, the compliment of a test sequence. The definition also includes sequences that have deletions and/or additions, as well as those that have substitutions. As described below, the preferred algorithms can account for gaps and the like. Preferably, identity exists over a region that is at least about 10 amino acids or 20 nucleotides in length, or more preferably over a region that is 10-50 amino acids or 20-50 nucleotides in length. As used herein, percent (%) amino acid sequence identity is defined as the percentage of amino acids in a candidate sequence that are identical to the amino acids in a reference sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full-length of the sequences being compared can be determined by known methods.

For sequence comparisons, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Preferably, default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.

One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al. (1990) J. Mol. Biol. 215:403-410). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89: 10915) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands.

The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01.

The phrase "codon optimized" as it refers to genes or coding regions of nucleic acid molecules for the transformation of various hosts, refers to the alteration of codons in the gene or coding regions of polynucleic acid molecules to reflect the typical codon usage of a selected organism without altering the polypeptide encoded by the DNA. Such optimization includes replacing at least one, or more than one, or a significant number, of codons with one or more codons that are more frequently used in the genes of that selected organism.

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are near each other, and, in the case of a secretory leader, contiguous and in reading phase. However, operably linked nucleic acids (e.g. enhancers and coding sequences) do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. In embodiments, a promoter is operably linked with a coding sequence when it is capable of affecting (e.g. modulating relative to the absence of the promoter) the expression of a protein from that coding sequence (i.e., the coding sequence is under the transcriptional control of the promoter).

"Ribosome binding site ^"' or "RBS" is also called the Shine Dalgarno sequence and generally has a sequence complementary to the 3' terminal of 16S rR A. The ribosomal binding site is found in bacterial and archaeal messenger RNA, and is generally located about 8 bases upstream of the start codon AUG. In particular, the RBS sequence which appears at high frequency is AGGAGG or AAGGAGG (hereinafter these sequences are referred to as ''consensus RBS sequences"), or a sequence homologous with "consensus RBS sequence". Although these sequences appear at various sites of genes, it is understood that the J BS sequences appear at high frequency in regions upstream of start codons. Also included in the term ^"'RBS' ^"' is the RBS sequence from the MphR gene as disclosed herein ("AGAAGG"). Other functional RBS sequences can also be used in place of the specific sequences disclosed herein. When discussing nucleotide mutations in the R BS. the first A is labeled as nucleotide "1" and the final G is label led as nucleotide "6". Alternatively, the mutations may sometimes referred to by their relative position to the ATG start codon. The basic structure of a prokaryote gene consists of a promoter which starts the synthesis of mRNA, a ribosome binding site which participates in the binding between mRNA. and ribosomes and in the translation initiation, a start, codon, a translation stop codon and a terminator which terminates the synthesis of mRNA. AUG codon is the most appropriate as a start codon. Since the start, codons and coding regions are determined usually based upon a DNA. sequence, in the present specification, the sequences of start codons and stop codons and sequences involved in the binding of ribosomes and mRNA are expressed as DNA sequences appropriately as well as RNA sequences, unless mentioned specifically.

The term "gene" or "gene sequence" refers to the coding sequence or control sequence, or fragments thereof. A gene may include any combination of coding sequence and control sequence, or fragments thereof. Thus, a "gene" as referred to herein may be all or part of a native gene. A polynucleotide sequence as referred to herein may be used interchangeably with the term "gene", or may include any coding sequence, non-coding sequence or control sequence, fragments thereof, and combinations thereof. The term "gene" or "gene sequence" includes, for example, control sequences upstream of the coding sequence (for example, the ribosome binding site). MphR Biosensors

Described herein is a platform technology that comprises genetically-encoded biosensors and methods to create them for detection of a class of small molecules called polyketides. Such biosensors provide a scalable, economic, high-throughput, and broadly applicable means to specifically identify a target polyketide of interest from complex mixtures of molecules. Polyketides are used extensively as drugs to treat human, animal, and plant diseases.

Examples of polyketides include, but are not limited to, macrolides, polyenes, enediynes, and aromatic polyketides. In some embodiments, the polyketide is a macrolide. In some embodiments, the polyketide is a 12-membered macrolide. In some embodiments, the polyketide is a 14-membered macrolide.

Due to their widespread use, polyketides are often produced in bacteria via genetic engineering. Detection of polyketides in microbial hosts remains a significant challenge however, and this limits the throughput and success of engineering approaches aimed at improving yields of polyketide and accessing new molecules. Thus, the main application of the present invention relates to the production of antibiotics, anticancer drugs, insecticides, anti-parasitics, anti-fungals, anti-cholesterol, and immunosuppressants in microbial hosts. Because the biosensors can be employed in a wide variety of contexts, other commercial applications include but are not limited to: (/) discovery of polyketide producing genes from collections of genomes; (2) identification and quantification of polyketide-based drugs, contaminants, and other molecules in environmental, clinical, and other research samples; and (3) isolation or removal of target polyketide compounds from complex mixtures.

The sensor is based on the MphR gene, which encodes a transcription factor. The natural role of wild-type (WT) MphR is to activate the expression of resistance genes in response to binding the polyketide antibiotic, erythromycin A (ErA, Figure 1). Upon binding ErA, the MphR protein undergoes a conformational change that causes it to leave its cognate operator DNA sequence, thereby allowing RNA polymerase to transcribe the gene and produce the gene product. By placing the MphR gene sequence and its operator DNA into an artificial vector, MphR can be used to drive the expression of reporter proteins that produce fluorescent, luminescent, or chromogenic signals in the presence of erythromycin A (ErA) (Figure 1(b)). However, compared to ErA, much higher concentrations of other polyketides, even those structurally related to ErA, are required to elicit strong reporter signals using WT MphR (Figure 3(a)). Moreover, most polyketides are not detected by WT MphR at all. These features have severely restricted the utility of MphR as a biosensor for high-throughput analysis of polyketides. Disclosed herein is a panel of MphR variants that are utilized for the detection of specific, target polyketides. Such tailored biosensors enable a suite of high-throughput approaches to be applied to the engineering of polyketide biosynthesis in microbes.

In one embodiment, the operator DNA sequence is 5'- AATATAACCGACGTGACTGTTACATTTAGG-3 (SEQ ID NO:27).

The genetically-encoded biosensors described here are unique in several aspects: (/) biosensors that respond to a broad variety of polyketides are not currently known; (2) biosensors that can discriminate between very closely related polyketide structures have not been described, (3) a strategy to engineer the ligand specificity and/or amount of MphR was developed that is efficient, novel, and non-obvious; and (4) other high-throughput analytical methods/tools to detect most polyketides are not available. Accordingly, high-throughput engineering approaches such as directed gene or enzyme evolution and synthetic biology have not been applied to the vast majority of polyketides due to the lack of suitable screening tools. Such strategies are critical to overcome the poor understanding of how to design and construct biosynthetic or chemical routes to new and existing antibiotics. In contrast, the biosensor-guided approach described herein can be applied to engineering the biosynthesis of a broad range of polyketides in potentially any microbial host, and could be generalized to other classes of natural products such as peptides, alkaloids, and terpenes. The invention disclosed herein can enable production of polyketide products rapidly and at lower cost than existing manufacturing routes, thus maximizing the return on investment and providing incentive to develop new antibiotics.

The biosensor platform is simple (consisting of two genes - one encodes the genetically modified MphR gene sequence and the other encodes a marker/reporter gene (for example, GFP) under the control of the MphR responsive promoter), scalable (genetically encoded so that the host microbe synthesizes all the parts), economic, ultra-high-throughput (millions of potential polyketide producing strains can be assayed using the biosensor), and can be easily adapted to target polyketides of interest (directed evolution is a powerful strategy to engineer the ligand specificity of proteins).

MphR is a repressor protein that controls the transcription of a gene cassette responsible for resistance to macrolide antibiotics via phosphorylation of the desosamine 2'-hydroxy group of ErA. Interestingly, MphR is also de-repressed by other macrolide antibiotics, including josamycin, oleandomycin, narbomycin, methymycin and pikromycin. This promiscuity provides a platform for creating tailored MphR variants for applications related to polyketide synthetic biology and directed evolution beyond those offered by the wild-type biosensor. For example, sensors may recognize a wide variety of polyketides, sensors may distinguish biosynthetic intermediates to allow specific detection of the desired mature product, and the binding affinity and dynamic range of a given biosensor can be tailored for specific applications.

In one aspect, disclosed herein is a biosensor system comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In some embodiments, the biosensor system further comprises a nucleic acid encoding an

MphA gene sequence. In some embodiments, the biosensor system further comprises a nucleic acid encoding a portion of the mrx gene. In some embodiments, the biosensor system further comprises a nucleic acid encoding an MphA gene sequence and a portion of the mrx gene.

In one embodiment, the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector. In one embodiment, the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

In one embodiment, the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP). In one embodiment, the reporter gene is a gene coding for green fluorescent protein (GFP). In one embodiment, the reporter gene is a gene coding for chloramphenicol acetyltransferase.

In some embodiments, the MphR mutation confers improved sensitivity for detecting erythromycin A. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A1G nucleotide change in the ribosome binding site sequence. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence.

In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from AIT, G2T, A3C, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from AIC, G2T, A3G, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from G2A, G5T, or a combination thereof.

In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T17R, T27G, Q65M, T27A, M59E, M59S, R22H, K35N, T49I, L89V, D98N, E109D, R122T, K132N, A151T, H184Q, T49I, L89V, D98N, E109D, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T17R, T27G, Q65M, T27A, M59E, M59S, R22H, K35N, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D, R122T, K132N, A151T, H184Q, T49I, L89V, D98N, E109D, or a combination thereof.

In some embodiments, the MphR mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from A16T, T154M, M155K, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence and an amino acid change selected from A16T, T154M, M155K, or a combination thereof.

In some embodiments, the MphR mutation confers improved selectivity for detecting erythromycin A in comparison to structurally similar precursors. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from P4L, W107L, H193R, or a combination thereof.

In some embodiments, the MphR mutation confers improved sensitivity for detecting pikromycin. In one embodiment, the MphR genetic mutation encodes the amino acid change S106F.

In some embodiments, the MphR mutation confers improved sensitivity for detecting narbomycin. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from V33L, A34S, R51C, or a combination thereof.

In some embodiments, the MphR mutation confers improved sensitivity for detecting clarithromycin. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change R122T. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from R122T, K132N, A151T, H184Q, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence and an amino acid change selected from R122T, K132N, A151T, H184Q, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D, or a combination thereof.

In one aspect, disclosed herein is a genetically modified host cell comprising:

a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In one embodiment, the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

In one embodiment, the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP). In one embodiment, the reporter gene is a gene coding for green fluorescent protein (GFP). In one embodiment, the reporter gene is a gene coding for chloramphenicol acetyltransferase.

In one embodiment, the cell is E. coli. In one embodiment, the cell is Streptomyces. In one embodiment, the cell is Streptomyces venezuelae. In one embodiment, the cell is Saccharopolyspora erythraea. In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the nucleotide sequence upstream of the ATG start codon of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of erythromycin A in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the ribosome binding site sequence of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of erythromycin A in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased sensitivity for detection of erythromycin A in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the nucleotide sequence upstream of the ATG start codon of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of erythromycin A in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the ribosome binding site sequence of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of erythromycin A in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased selectivity for detection of erythromycin A in comparison to other polyketides.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased selectivity for detection of erythromycin A in comparison to structurally similar precursors.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased sensitivity for detection of pikromycin in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the nucleotide sequence upstream of the ATG start codon of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of pikromycin in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the ribosome binding site sequence of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of pikromycin in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased sensitivity for detection of narbomycin in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the nucleotide sequence upstream of the ATG start codon of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of narbomycin in comparison to the wild type MphR transcription factor. In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the ribosome binding site sequence of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of narbomycin in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the MphR protein sequence, wherein the mutation confers increased sensitivity for detection of YC-17 in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the nucleotide sequence upstream of the ATG start codon of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of YC- 17 in comparison to the wild type MphR transcription factor.

In some embodiments, disclosed herein is a genetically modified MphR gene sequence comprising at least one mutation in the ribosome binding site sequence of the MphR gene sequence, wherein the mutation confers increased sensitivity for detection of YC-17 in comparison to the wild type MphR transcription factor.

In one aspect, disclosed herein is a biosensor system comprising:

a nucleic acid encoding a genetically modified MphR transcription factor, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In one aspect, disclosed herein is a genetically modified host cell comprising:

a nucleic acid encoding a genetically modified MphR transcription factor, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor.

In one aspect, provided herein is a method for detecting a polyketide, comprising: introducing into a cell:

i. a nucleic acid encoding a genetically modified MphR transcription factor, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and ii. a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

and

detecting the polyketide based on the differential expression of the reporter gene in comparison to a cell comprising a wild-type MphR transcription factor.

In one aspect, provided herein is a method of screening for genetic mutations in a target gene, comprising:

introducing into a cell: i. a nucleic acid encoding a genetically modified MphR transcription factor, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

ii . a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

introducing at least one mutation into a target gene; and

identifying a cell comprising the target gene mutation based on the differential expression of the reporter gene in comparison to a cell comprising the wild-type target gene.

MphR Biosensors: Methods

In one aspect, provided herein is a method for detecting a polyketide, comprising: introducing into a cell:

i . a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

ii. a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;

and

detecting the polyketide based on the differential expression of the reporter gene in comparison to a cell comprising a wild-type MphR gene sequence.

In one embodiment, the nucleic acid encoding the genetically modified MphR gene sequence and the reporter gene are located on one recombinant DNA vector.

In one embodiment, the reporter gene is a gene coding for chloramphenicol acetyltransferase, beta-galactosidase, luciferase or green fluorescent protein (GFP). In one embodiment, the reporter gene is a gene coding for green fluorescent protein (GFP). In some embodiments, the MphR mutation confers improved sensitivity for detecting erythromycin A. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, AIT, AIC, G2T, G2A, A3C, A3G, A4T, G5T, G6T, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from A1G, A4T, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A1G nucleotide change in the ribosome binding site sequence. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence.

In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from AIT, G2T, A3C, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from AIC, G2T, A3G, or a combination thereof. In one embodiment, the MphR genetic mutation encodes a nucleotide change in the ribosome binding site sequence selected from G2A, G5T, or a combination thereof.

In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T17R, T27G, Q65M, T27A, M59E, M59S, R22H, K35N, T49I, L89V, D98N, E109D, R122T, K132N, A151T, H184Q, T49I, L89V, D98N, E109D, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T17R, T27G, Q65M, T27A, M59E, M59S, R22H, K35N, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D, R122T, K132N, A151T, H184Q, T49I, L89V, D98N, E109D, or a combination thereof.

In some embodiments, the MphR mutation confers improved selectivity for detecting erythromycin A in comparison to other polyketides. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from A16T, T154M, M155K, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence and an amino acid change selected from A16T, T154M, M155K, or a combination thereof.

In some embodiments, the MphR mutation confers improved selectivity for detecting erythromycin A in comparison to structurally similar precursors. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from P4L, W107L, H193R, or a combination thereof. In some embodiments, the MphR mutation confers improved sensitivity for detecting pikromycin. In one embodiment, the MphR genetic mutation encodes the amino acid change S106F.

In some embodiments, the MphR mutation confers improved sensitivity for detecting narbomycin. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from V33L, A34S, R51C, or a combination thereof.

In some embodiments, the MphR mutation confers improved sensitivity for detecting clarithromycin. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change R122T. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from R122T, K132N, A151T, H184Q, or a combination thereof. In one embodiment, the MphR genetic mutation encodes an A4T nucleotide change in the ribosome binding site sequence and an amino acid change selected from R122T, K132N, A151T, H184Q, or a combination thereof. In one embodiment, the MphR genetic mutation encodes the amino acid change selected from T49I, L89V, D98N, E109D, or a combination thereof.

In one embodiment, the cell is E. coli. In one embodiment, the cell is Streptomyces. In one embodiment, the cell is Streptomyces venezuelae.

In one aspect, provided herein is a method of screening for genetic mutations in a target gene, comprising:

introducing into a cell: i. a nucleic acid encoding a genetically modified MphR gene sequence, wherein the nucleic acid comprises at least one genetic mutation when compared to the wild-type MphR gene sequence; and

it. a reporter gene whose transcription is under the control of a promoter region which is regulated by the MphR transcription factor;