BENČINA MOJCA (SI)
MAJERLE ANDREJA (SI)
OBLAK ALJA (SI)
LEBAR TINA (SI)
FORSTNERIČ VIDA (SI)
LONZARIĆ JAN (SI)
SMOLE ANŽE (SI)
GABER ROK (SI)
BEZELJAK URBAN (SI)
GOLOB ANJA (SI)
KADUNC LUCIJA (SI)
VUČKO DUŠAN (SI)
STRAŽAR MARTIN (SI)
PIRŠ BOŠTJAN (SI)
JERALA MIHA (SI)
ZUPANČIČ UROŠ (SI)
SOMRAK MAJA (SI)
LUŽNIK ZALA (SI)
EN FIST CT ODLI & CCARON NOSTI ZD (SI)
WO2000032748A1 | 2000-06-08 |
US6841376B2 | 2005-01-11 | |||
US20120110685A1 | 2012-05-03 | |||
US6841376B2 | 2005-01-11 |
PTASHNE, M.: "A Genetic Switch: Phage a and Higher Organisms", CELL, 1992
ISHIURA, M. ET AL.: "Expression of a gene cluster kaiABC as a circadian feedback process in cyanobacteria", SCIENCE, vol. 281, pages 1519 - 1523, XP002216759, DOI: doi:10.1126/science.281.5382.1519
GARDNER, T. S.; CANTOR, C. R.; COLLINS, J. J.: "Construction of a genetic toggle switch in Escherichia coli", NATURE, vol. 403, 2000, pages 339 - 42, XP002216760, DOI: doi:10.1038/35002131
KRAMER, B. P. ET AL.: "An engineered epigenetic transgene switch in mammalian cells", NATURE BIOTECHNOLOGY, vol. 22, 2004, pages 867 - 70, XP002718464, DOI: doi:10.1038/NBT980
ELOWITZ, M. B.; LEIBLER, S.: "A synthetic oscillatory network of transcriptional regulators", NATURE, vol. 403, 2000, pages 335 - 8, XP002223673, DOI: doi:10.1038/35002125
FUNG, E. ET AL.: "A synthetic gene-metabolic oscillator", NATURE, vol. 435, 2005, pages 118 - 122
TAMSIR, A.; TABOR, J. J.; VOIGT, C.: "a. Robust multicellular computing using genetically encoded NOR gates and chemical ''wires.", NATURE, vol. 1-4, 2010
Claims 1. A bistable genetic toggle switch comprising a pair of mutual repressors and a positive feedback loop based on modular DNA-binding proteins. 2. A bistable genetic toggle switch according to claim 1 that comprises a pair of repressors which mutually repress each other and a pair of activators which are reinforced in a positive feedback loop manner and also activate the repressor of the opposing element in the pair of repressors. 3. A bistable genetic toggle switch according to claim 2 wherein the repressors comprise a DNA-binding domain and a repression domain, whereas the activators comprise a DNA- binding domain and an activation domain. 4. A bistable genetic toggle switch according to claim 3, wherein the repression domain is the KRAB domain and the activation domain is either the VP 16 or the VP 64 domain. 5. A bistable genetic toggle switch according to claims 1 to 4 where the DNA-binding domain binds DNA as a monomer. 6. A bistable genetic toggle switch according to any of the claims 1 to 5 which can occupy stable states 1 or 2 and comprises: a) An operon for maintaining state 1 by a positive feedback loop that includes a structural gene for an activator A, which forms a positive feedback loop and where transcription of the structural gene for the activator A is regulated by an operator that comprises the DNA binding element A and a promoter; b) An operon for maintaining state 1 with a repressor B, which inhibits state 2 by repressing the gene, regulated by the operator B and where transcription of the structural gene for repressor B is regulated by an operator that comprises the DNA binding element A and a promoter; c) An operon exhibiting an activation of state 1 with an effector gene 1 that can be any gene or a group of genes e.g. different enzymes, fluorescent proteins, signaling proteins etc. and where transcription of this effector gene 1 is regulated by an operator that comprises the DNA binding element A and a promoter; d) An operon for maintaining state 2 by a positive feedback loop that includes a structural gene for an activator B, which forms a positive feedback loop and where transcription of the structural gene for the activator B is regulated by an operator that comprises the DNA binding element B and a promoter; e) An operon for maintaining state 2 with a repressor A, which inhibits state 1 by repressing the gene, regulated by the operator A and where transcription of the structural gene for repressor A is regulated by an operator that comprises the DNA binding element B and a promoter; f) An operon exhibiting an activation of state 2 with an effector gene 2 that can be any gene or a group of genes e.g. different enzymes, fluorescent proteins, signaling proteins etc. and where transcription of this effector gene 2 is regulated by an operator that comprises the DNA binding element B and a promoter; g) Operons of inducer-dependent activators or repressors that include structural genes for inducer-dependent proteins regulated by a promoter, preferentially constitutive or minimal, and optionally an operator. where A represents a selected recognition site for the DNA-binding domain A, that is included in the activator A and the repressor A, whereas B represents a selected recognition site for the DNA-binding domain B, that is included in the activator B and the repressor B. 7. A bistable genetic toggle switch according to any of the claims 1 to 6 wherein two or more structural genes are regulated by the same operator: activator A, repressor B, effector 1 and activator B, repressor A, effector 2; joined together so that they are regulated by the same operator and linked together by a self-cleaving 2A peptide. They are transcribed from a structural gene as a single RNA molecule and are translated into a polypeptide chain that autocatalitically cleaves at the positions of self-cleaving peptides, leading to synthesis of separate individual proteins in an equal stoichiometric ratio. 8. Bistable genetic toggle switch according to any of the claims 1 to 7 that comprises a) operons of repressor B, activator A and effector 1 for expressing state 1 by a positive feedback loop, that comprises structural genes for the repressor B, activator A and effector 1 at aseparate operon or in any combination the repressor B, activator A and effector 1 separated by a self-cleaving peptide and where the operons for state 1 are regulated by the same operator A that comprises the DNA binding element A and a promoter. b) operons of repressor A, activator B and effector 2 for expressing state 2 by a positive feedback loop, that comprises structural genes for the repressor A, activator B and effector 2 at aseparate operon or in any combination the repressor A, activator B and effector 2 separated by a self-cleaving peptide and where the operons for state 2 are regulated by the same operator B that comprises the DNA binding element B and a promoter. c) operon for switching to state 1 , which comprises a structural gene for the activator A for activation of state 1 and an operon for switching to state 1 , that comprises a structural gene for the repressor B for inhibition of state 2, where transcription of these structural genes on separate operons is under control of same operator, which activates in the presence if inducer A, and a promoter, which can be either constitutive or minimal. d) operon for switching to state 2, which comprises a structural gene for the activator B for activation of state 2 and an operon for switching to state 2, that comprises a structural gene for the repressor A for inhibition of state 1, where transcription of these structural genes on separate operons is under control of same operator, which activates in the presence if inducer B, and a promoter, which can be either constitutive or minimal. e) operons of the inducer-dependent activators and repressors, which comprise structural genes for inducer-dependent proteins under regulation of a promoter, preferably constitutive or minimal, and optionally an operator. 9. Bistable genetic toggle switch according to any of the claims from 1 to 8, where the operator of the operon includes any number of equal or different type of DNA binding elements, which can be joined into clusters or distributed in alternating order, the operator preferably includes 1 to 20 DNA-binding elements, which are separated with nucleotide sequence of any length, and where operators A of operons for expression of state 1 differ from operators B of operons for expression of state 2 and differ from operators A of operons for switching to state 1 and operators B of operons for switching to state 2 and a) DNA binding elements in operators A are designed to bind the DNA-binding domain of repressor A and activator A. b) DNA binding elements in operators B are designed to bind the DNA-binding domain of repressor B and activator B. 10. A bistable genetic toggle switch according to any claim from 1 to 9 which can be used for controlled synthesis of effectors 1 or effectors 2, depending of the state of the switch and the presence of inducers for activation of a specific state, where the effectors can be any type of protein or a group of proteins. 11. A bistable genetic toggle switch according to any claim from 1 to 10 which can be used as indicator which through synthesis of effector 1 or effector 2 reports the presence of inducer for activation of a specific state where the effectors can be any type of protein or a group of proteins. 12. A bistable genetic toggle switch, where the DNA with the sequence SEQ ID NO.: 1, 2, 4, 5, 7, 8, 9, 10 and pCMV-tTR-KRAB, pCMV-ECD-KRAB is introduced to mammalian cells transiently or permanently. 13. Prokaryotic or eukaryotic host cells, that contain a bistable genetic toggle switch according to any of the claims from 1 to 12. 14. Method for effector gene expression, that comprises (a) host cells according to claim 13 containing a bistable genetic toggle switch according to any of the claims from 1 to 12 and (b) cultivation of the cells in such a way that they express an effector protein when the switch is switched to state 1 in the presence of inducer 1 or when switch is switched to state 2 in the presence of inducer 2. 1/65 SEQUENCE LISTING <110> Kemijski institut, Ljubljana, Slovenia and EN-FIST Center odlicnosti, Ljubljana, Slovenia <120> Bistable genetic toggle switch comprising a pair of reciprocal Repressors and a positive feedback loop based on DNA-binding Proteins <130> 301-P31PC/13 <140> - <141> 2013-09-11 <150> SI P-201200285 <151> 2012-09-20 <160> 22 <170> Patentln version 3.5 <210> 1 <211> 3465 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 1 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 60 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 120 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 180 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 240 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 300 tcttggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 420 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 480 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 540 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 600 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 660 tcttggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcacatgacg ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtccaac 1560 ggtggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc acatgacggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtccaacgg tggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgtcaa acggaggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tcgcatgacg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg cctccaatat tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcaagca acatcggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgtc caacggtgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt agcccccccg 3240 accgatgtca gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat 3300 gccgacgcgc tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg 3360 ggatttaccc cccacgactc cgccccctac ggcgctctgg atatggccga cttcgagttt 3420 gagcagatgt ttaccgatgc ccttggaatt gacgagtacg gtggg 3465 <210> 2 <211> 4395 <212> DNA <213> Unknown <220> <223> Synthetic genetic construct <400> 2 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 60 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 120 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 180 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 240 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 300 tcttggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 420 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 480 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 540 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 600 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 660 tcttggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcaaacggag ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtcgaac 1560 attggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc aaacggaggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtcgcatga cggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgaaca ataatggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 3/65 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tccaacggtg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg ccagccatga tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcatccc acgacggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgaa taacaatgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt accaaaaaag 3240 aagagaaagg tcgacggcgg tggtgctttg tctcctcagc actctgctgt cactcaagga 3300 agtatcatca agaacaagga gggcatggat gctaagtcac taactgcctg gtcccggaca 3360 ctggtgacct tcaaggatgt atttgtggac ttcaccaggg aggagtggaa gctgctggac 3420 actgctcagc agatcgtgta cagaaatgtg atgctggaga actataagaa cctggtttcc 3480 ttgggttatc agcttactaa gccagatgtg atcctccggt tggagaaggg agaagagccc 3540 tggctggtgg agagagaaat tcaccaagag acccatcctg attcagagac tgcatttgaa 3600 atcaaatcat cagttagagc cgaggggaga ggaagtcttc tgacctgcgg agacgtcgaa 3660 gagaatcctg gacccatggt gagcaagggc gaggagctgt tcaccggggt ggtgcccatc 3720 ctggtcgagc tggacggcga cgtaaacggc cacaagttca gcgtgtccgg cgagggcgag 3780 ggcgatgcca cctacggcaa gctgaccctg aagttcatct gcaccaccgg caagctgccc 3840 gtgccctggc ccaccctcgt gaccaccttc ggctacggcc tgatgtgctt cgcccgctac 3900 cccgaccaca tgaagcagca cgacttcttc aagtccgcca tgcccgaagg ctacgtccag 3960 gagcgcacca tcttcttcaa ggacgacggc aactacaaga cccgcgccga ggtgaagttc 4020 gagggcgaca ccctggtgaa ccgcatcgag ctgaagggca tcgacttcaa ggaggacggc 4080 aacatcctgg ggcacaagct ggagtacaac tacaacagcc acaacgtcta tatcatggcc 4140 gacaagcaga agaacggcat caaggtgaac ttcaagatcc gccacaacat cgaggacggc 4200 agcgtgcagc tcgccgacca ctaccagcag aacaccccca tcggcgacgg ccccgtgctg 4260 ctgcccgaca accactacct gagctaccag tccgccctga gcaaagaccc caacgagaag 4320 cgcgatcaca tggtcctgct ggagttcgtg accgccgccg ggatcactct cggcatggac 4380 gagctgtaca agtaa 4395 <210> 3 <211> 7125 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 3 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 60 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 120 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 180 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 240 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 300 tcttggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 taaagagcag aaactcatct tccgtttcca catcttgctg gcagccgggg taccgcaggc 420 aagtcgctaa agagcagaaa ctcatcttcc gtttccacat ctatggaatc acggcggccg 480 cccgtatctc atcctaaaga gcagaaactc atcttccgtt tccacatcta atgcgtcggc 540 gtgcggccgc tgccagttgc ataaagagca gaaactcatc ttccgtttcc acatctttgg 600 ctgtcctagg gtacctggac gctccttgta aagagcagaa actcatcttc cgtttccaca 660 tcttggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 4/65 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcaaacggag ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtcgaac 1560 attggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc aaacggaggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtcgcatga cggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgaaca ataatggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tccaacggtg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg ccagccatga tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcatccc acgacggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgaa taacaatgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt accaaaaaag 3240 aagagaaagg tcgacggcgg tggtgctttg tctcctcagc actctgctgt cactcaagga 3300 agtatcatca agaacaagga gggcatggat gctaagtcac taactgcctg gtcccggaca 3360 ctggtgacct tcaaggatgt atttgtggac ttcaccaggg aggagtggaa gctgctggac 3420 actgctcagc agatcgtgta cagaaatgtg atgctggaga actataagaa cctggtttcc 3480 ttgggttatc agcttactaa gccagatgtg atcctccggt tggagaaggg agaagagccc 3540 tggctggtgg agagagaaat tcaccaagag acccatcctg attcagagac tgcatttgaa 3600 atcaaatcat cagttagagc cgaggggaga ggaagtcttc tgacctgcgg agacgtcgaa 3660 gagaatcctg gacccgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 3720 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 3780 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 3840 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 3900 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 3960 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 4020 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 4080 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 4140 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 4200 ctgaccccag accaggtagt cgcaatcgcg tcacatgacg ggggaaagca agccctggaa 4260 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 4320 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 4380 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtccaac 4440 ggtggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 4500 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 4560 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 4620 caggtagtcg caatcgcgtc acatgacggg ggaaagcaag ccctggaaac cgtgcaaagg 4680 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 4740 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 4800 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtccaacgg tggagggaaa 4860 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 4920 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 4980 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 5040 atcgcgtcaa acggaggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 5100 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 5160 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 5220 ctgactcccg atcaagttgt agcgattgcg tcgcatgacg gagggaaaca agcattggag 5280 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 5340 5/65 gtcgccatcg cctccaatat tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 5400 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 5460 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 5520 cacggcctta caccggagca agtcgtggcc attgcaagca acatcggtgg caaacaggct 5580 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 5640 caagttgtag cgattgcgtc caacggtgga gggaaacaag cattggagac tgtccaacgg 5700 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 5760 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 5820 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 5880 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 5940 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 6000 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 6060 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt agcccccccg 6120 accgatgtca gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat 6180 gccgacgcgc tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg 6240 ggatttaccc cccacgactc cgccccctac ggcgctctgg atatggccga cttcgagttt 6300 gagcagatgt ttaccgatgc ccttggaatt gacgagtacg gtgggagagc cgaggggaga 6360 ggaagtcttc tgacctgcgg agacgtcgaa gagaatcctg gacccatggt gagcaagggc 6420 gaggagctgt tcaccggggt ggtgcccatc ctggtcgagc tggacggcga cgtaaacggc 6480 cacaagttca gcgtgtccgg cgagggcgag ggcgatgcca cctacggcaa gctgaccctg 6540 aagttcatct gcaccaccgg caagctgccc gtgccctggc ccaccctcgt gaccaccttc 6600 ggctacggcc tgatgtgctt cgcccgctac cccgaccaca tgaagcagca cgacttcttc 6660 aagtccgcca tgcccgaagg ctacgtccag gagcgcacca tcttcttcaa ggacgacggc 6720 aactacaaga cccgcgccga ggtgaagttc gagggcgaca■■ ccctggtgaa ccgcatcgag 6780 ctgaagggca tcgacttcaa ggaggacggc aacatcctgg ggcacaagct ggagtacaac 6840 tacaacagcc acaacgtcta tatcatggcc gacaagcaga agaacggcat caaggtgaac 6900 ttcaagatcc gccacaacat cgaggacggc agcgtgcagc tcgccgacca ctaccagcag 6960 aacaccccca tcggcgacgg ccccgtgctg ctgcccgaca accactacct gagctaccag 7020 tccgccctga gcaaagaccc caacgagaag cgcgatcaca tggtcctgct ggagttcgtg 7080 accgccgccg ggatcactct cggcatggac gagctgtaca agtaa 7125 <210> 4 <211> 3615 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 4 ■ tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 60 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 120 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 180 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 240 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 300 tcatggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 420 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 480 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 540 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 600 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 660 tcatggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcacatgacg ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtccaac 1560 ggtggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 6/65 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc acatgacggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtccaacgg tggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgtcaa acggaggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tcgcatgacg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg cctccaatat tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcaagca acatcggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgtc caacggtgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt accaaaaaag 3240 aagagaaagg tcgacggcgg tggtgctttg tctcctcagc actctgctgt cactcaagga 3300 agtatcatca agaacaagga gggcatggat gctaagtcac taactgcctg gtcccggaca 3360 ctggtgacct tcaaggatgt atttgtggac ttcaccaggg aggagtggaa gctgctggac 3420 actgctcagc agatcgtgta cagaaatgtg atgctggaga actataagaa cctggtttcc 3480 ttgggttatc agcttactaa gccagatgtg atcctccggt tggagaaggg agaagagccc 3540 tggctggtgg agagagaaat tcaccaagag acccatcctg attcagagac tgcatttgaa 3600 atcaaatcat cagtt 3615 <210> 5 <211> 4224 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 5 tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 60 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 120 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 180 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 240 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 300 tcatggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 420 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 480 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 540 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 600 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 660 tcatggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcaaacggag ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 7/65 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtcgaac 1560 attggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc aaacggaggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtcgcatga cggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgaaca ataatggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tccaacggtg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg ccagccatga tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcatccc acgacggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgaa taacaatgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt agcccccccg 3240 accgatgtca gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat 3300 gccgacgcgc tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg 3360 ggatttaccc cccacgactc cgccccctac ggcgctctgg atatggccga cttcgagttt 3420 gagcagatgt ttaccgatgc ccttggaatt gacgagtacg gtgggagagc cgaggggaga 3480 ggaagtcttc tgacctgcgg agacgtcgaa gagaatcctg gacccatgag cgagctgatt 3540 aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 3600 tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 3660 gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 3720 agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 3780 gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 3840 caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 3900 ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 3960 acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 4020 ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 4080 aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 4140 aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 4200 agcaaactgg ggcacaagct caat 4224 <210> 6 <211> 7104 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 6 tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 60 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 120 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 180 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 240 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 300 tcatggctgg tgccgcaccg gtaagcagaa ctagaggtgg caccggtttg attgcacagc 360 tttactgctg ctcccgctta aagagcagaa actcatgctg gcagccgggg taccgcaggc 420 aagtcgcttt actgctgctc ccgcttaaag agcagaaact caatggaatc acggcggccg 480 cccgtatctc atcctttact gctgctcccg cttaaagagc agaaactcaa atgcgtcggc 540 gtgcggccgc tgccagttgc atttactgct gctcccgctt aaagagcaga aactcattgg 600 ctgtcctagg gtacctggac gctccttgtt tactgctgct cccgcttaaa gagcagaaac 660 tcatggctgg tgccgcaccg gtaagcagag gatcttagag ggtatataat ggaagctcga 720 8/65 cttccagctc gagggcaatc cggtactgtt ggtaaagcca ccgaattcgc caccatgcac 780 caccaccacc accacgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 840 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 900 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 960 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 1020 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 1080 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 1140 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 1200 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 1260 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 1320 ctgaccccag accaggtagt cgcaatcgcg tcacatgacg ggggaaagca agccctggaa 1380 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 1440 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 1500 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtccaac 1560 ggtggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 1620 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 1680 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 1740 caggtagtcg caatcgcgtc acatgacggg ggaaagcaag ccctggaaac cgtgcaaagg 1800 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 1860 aataataacg gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 1920 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtccaacgg tggagggaaa 1980 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 2040 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 2100 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 2160 atcgcgtcaa acggaggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 2220 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 2280 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 2340 ctgactcccg atcaagttgt agcgattgcg tcgcatgacg gagggaaaca agcattggag 2400 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 2460 gtcgccatcg cctccaatat tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 2520 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 2580 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 2640 cacggcctta caccggagca agtcgtggcc attgcaagca acatcggtgg caaacaggct 2700 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 2760 caagttgtag cgattgcgtc caacggtgga gggaaacaag cattggagac tgtccaacgg 2820 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 2880 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 2940 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 3000 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 3060 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 3120 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 3180 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt accaaaaaag 3240 aagagaaagg tcgacggcgg tggtgctttg tctcctcagc actctgctgt cactcaagga 3300 agtatcatca agaacaagga gggcatggat gctaagtcac taactgcctg gtcccggaca 3360 ctggtgacct tcaaggatgt atttgtggac ttcaccaggg aggagtggaa gctgctggac 3420 actgctcagc agatcgtgta cagaaatgtg atgctggaga actataagaa cctggtttcc 3480 ttgggttatc agcttactaa gccagatgtg atcctccggt tggagaaggg agaagagccc 3540 tggctggtgg agagagaaat tcaccaagag acccatcctg attcagagac tgcatttgaa 3600 atcaaatcat cagttagagc cgaggggaga ggaagtcttc tgacctgcgg agacgtcgaa 3660 gagaatcctg gacccgacta caaagaccat gacggtgatt ataaagatca tgacatcgat 3720 tacaaggatg acgatgacaa gatggccccc aagaagaaga ggaaggtggg cattcaccgc 3780 ggggtaccta tggtggactt gaggacactc ggttattcgc aacagcaaca ggagaaaatc 3840 aagcctaagg tcaggagcac cgtcgcgcaa caccacgagg cgcttgtggg gcatggcttc 3900 actcatgcgc atattgtcgc gctttcacag caccctgcgg cgcttgggac ggtggctgtc 3960 aaataccaag atatgattgc ggccctgccc gaagccacgc acgaggcaat tgtaggggtc 4020 ggtaaacagt ggtcgggagc gcgagcactt gaggcgctgc tgactgtggc gggtgagctt 4080 agggggcctc cgctccagct cgacaccggg cagctgctga agatcgcgaa gagaggggga 4140 gtaacagcgg tagaggcagt gcacgcctgg cgcaatgcgc tcaccggggc ccccttgaac 4200 ctgaccccag accaggtagt cgcaatcgcg tcaaacggag ggggaaagca agccctggaa 4260 accgtgcaaa ggttgttgcc ggtcctttgt caagaccacg gccttacacc ggagcaagtc 4320 gtggccattg caagcaatgg gggtggcaaa caggctcttg agacggttca gagacttctc 4380 ccagttctct gtcaagccca cgggctgact cccgatcaag ttgtagcgat tgcgtcgaac 4440 attggaggga aacaagcatt ggagactgtc caacggctcc ttcccgtgtt gtgtcaagcc 4500 cacggtttga cgcctgcaca agtggtcgcc atcgccagcc atgatggcgg taagcaggcg 4560 ctggaaacag tacagcgcct gctgcctgta ctgtgccagg atcatggact gaccccagac 4620 caggtagtcg caatcgcgtc aaacggaggg ggaaagcaag ccctggaaac cgtgcaaagg 4680 ttgttgccgg tcctttgtca agaccacggc cttacaccgg agcaagtcgt ggccattgca 4740 aataataacg- gtggcaaaca ggctcttgag acggttcaga gacttctccc agttctctgt 4800 9/65 caagcccacg ggctgactcc cgatcaagtt gtagcgattg cgtcgcatga cggagggaaa 4860 caagcattgg agactgtcca acggctcctt cccgtgttgt gtcaagccca cggtttgacg 4920 cctgcacaag tggtcgccat cgcctcgaat ggcggcggta agcaggcgct ggaaacagta 4980 cagcgcctgc tgcctgtact gtgccaggat catggactga ccccagacca ggtagtcgca 5040 atcgcgaaca ataatggggg aaagcaagcc ctggaaaccg tgcaaaggtt gttgccggtc 5100 ctttgtcaag accacggcct tacaccggag caagtcgtgg ccattgcatc ccacgacggt 5160 ggcaaacagg ctcttgagac ggttcagaga cttctcccag ttctctgtca agcccacggg 5220 ctgactcccg atcaagttgt agcgattgcg tccaacggtg gagggaaaca agcattggag 5280 actgtccaac ggctccttcc cgtgttgtgt caagcccacg gtttgacgcc tgcacaagtg 5340 gtcgccatcg ccagccatga tggcggtaag caggcgctgg aaacagtaca gcgcctgctg 5400 cctgtactgt gccaggatca tggactgacc ccagaccagg tagtcgcaat cgcgtcacat 5460 gacgggggaa agcaagccct ggaaaccgtg caaaggttgt tgccggtcct ttgtcaagac 5520 cacggcctta caccggagca agtcgtggcc attgcatccc acgacggtgg caaacaggct 5580 cttgagacgg ttcagagact tctcccagtt ctctgtcaag cccacgggct gactcccgat 5640 caagttgtag cgattgcgaa taacaatgga gggaaacaag cattggagac tgtccaacgg 5700 ctccttcccg tgttgtgtca agcccacggt ttgacgcctg cacaagtggt cgccatcgcc 5760 agccatgatg gcggtaagca ggcgctggaa acagtacagc gcctgctgcc tgtactgtgc 5820 caggatcatg gactgacacc cgaacaggtg gtcgccattg cttctaatgg gggaggacgg 5880 ccagccttgg agtccatcgt agcccaattg tccaggcccg atcccgcgtt ggctgcgtta 5940 acgaatgacc atctggtggc gttggcatgt cttggtggac gacccgcgct cgatgcagtc 6000 aaaaagggtc tgcctcatgc tcccgcattg atcaaaagaa ccaaccggcg gattcccgag 6060 agaacttccc atcgagtcgc gggatccgat ccaaaaaaga agagaaaggt agcccccccg 6120 accgatgtca gcctggggga cgagctccac ttagacggcg aggacgtggc gatggcgcat 6180 gccgacgcgc tagacgattt cgatctggac atgttggggg acggggattc cccgggtccg 6240 ggatttaccc cccacgactc cgccccctac ggcgctctgg atatggccga cttcgagttt 6300 gagcagatgt ttaccgatgc ccttggaatt gacgagtacg gtgggagagc cgaggggaga 6360 ggaagtcttc tgacctgcgg agacgtcgaa gagaatcctg gacccatgag cgagctgatt 6420 aaggagaaca tgcacatgaa gctgtacatg gagggcaccg tggacaacca tcacttcaag 6480 tgcacatccg agggcgaagg caagccctac gagggcaccc agaccatgag aatcaaggtg 6540 gtcgagggcg gccctctccc cttcgccttc gacatcctgg ctactagctt cctctacggc 6600 agcaagacct tcatcaacca cacccagggc atccccgact tcttcaagca gtccttccct 6660 gagggcttca catgggagag agtcaccaca tacgaagacg ggggcgtgct gaccgctacc 6720 caggacacca gcctccagga cggctgcctc atctacaacg tcaagatcag aggggtgaac 6780 ttcacatcca acggccctgt gatgcagaag aaaacactcg gctgggaggc cttcaccgag 6840 acgctgtacc ccgctgacgg cggcctggaa ggcagaaacg acatggccct gaagctcgtg 6900 ggcgggagcc atctgatcgc aaacatcaag accacatata gatccaagaa acccgctaag 6960 aacctcaaga tgcctggcgt ctactatgtg gactacagac tggaaagaat caaggaggcc 7020 aacaacgaga cctacgtcga gcagcacgag gtggcagtgg ccagatactg cgacctccct 7080 agcaaactgg ggcacaagct caat 7104 <210> 7 <211> 3645 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 7 acattgatta ttgactagtt attaatagta atcaattacg gggtcattag ttcatagccc 60 atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa 120 cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac 180 tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg cagtacatca 240 agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg 300 gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt 360 agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc gtggatagcg 420 gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg 480 gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat 540 gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc taactagaga 600 acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga cccaagctta 660 tcgataggta ccctcgagtt taccactccc tatcagtgat agagaaaagt gaaagtcgag 720 tttaccactc cctatcagtg atagagaaaa gtgaaagtcg agtttaccac tccctatcag 780 tgatagagaa aagtgaaagt cgagtttacc actccctatc agtgatagag aaaagtgaaa 840 gtcgagttta ccactcccta tcagtgatag agaaaagtga aagtcgagtt taccactccc 900 tatcagtgat agagaaaagt gaaagtcgag ctcgctagcg aattcgccac catgcaccac 960 caccaccacc acgactacaa agaccatgac ggtgattata aagatcatga catcgattac 1020 aaggatgacg atgacaagat ggcccccaag aagaagagga aggtgggcat tcaccgcggg 1080 gtacctatgg tggacttgag gacactcggt tattcgcaac agcaacagga gaaaatcaag 1140 10/65 cctaaggtca ggagcaccgt cgcgcaacac cacgaggcgc ttgtggggc'a tggcttcact 1200 catgcgcata ttgtcgcgct ttcacagcac cctgcggcgc ttgggacggt ggctgtcaaa 1260 taccaagata tgattgcggc cctgcccgaa gccacgcacg aggcaattgt aggggtcggt 1320 aaacagtggt cgggagcgcg agcacttgag gcgctgctga ctgtggcggg tgagcttagg 1380 gggcctccgc tccagctcga caccgggcag ctgctgaaga tcgcgaagag agggggagta 1440 acagcggtag aggcagtgca cgcctggcgc aatgcgctca ccggggcccc cttgaacctg 1500 accccagacc aggtagtcgc aatcgcgtca catgacgggg gaaagcaagc cctggaaacc 1560 gtgcaaaggt tgttgccggt cctttgtcaa gaccacggcc ttacaccgga gcaagtcgtg 1620 gccattgcaa gcaatggggg tggcaaacag gctcttgaga cggttcagag acttctccca 1680 gttctctgtc aagcccacgg gctgactccc gatcaagttg tagcgattgc gtccaacggt 1740 ggagggaaac aagcattgga gactgtccaa cggctccttc ccgtgttgtg tcaagcccac 1800 ggtttgacgc ctgcacaagt ggtcgccatc gccagccatg atggcggtaa gcaggcgctg 1860 gaaacagtac agcgcctgct gcctgtactg tgccaggatc atggactgac cccagaccag 1920 gtagtcgcaa tcgcgtcaca tgacggggga aagcaagccc tggaaaccgt gcaaaggttg 1980 ttgccggtcc tttgtcaaga ccacggcctt acaccggagc aagtcgtggc cattgcaaat 2040 aataacggtg gcaaacaggc tcttgagacg gttcagagac ttctcccagt tctctgtcaa 2100 gcccacgggc tgactcccga tcaagttgta gcgattgcgt ccaacggtgg agggaaacaa 2160 gcattggaga ctgtccaacg gctccttccc gtgttgtgtc aagcccacgg tttgacgcct 2220 gcacaagtgg tcgccatcgc ctcgaatggc ggcggtaagc aggcgctgga aacagtacag 2280 cgcctgctgc ctgtactgtg ccaggatcat ggactgaccc cagaccaggt agtcgcaatc 2340 gcgtcaaacg gagggggaaa gcaagccctg gaaaccgtgc aaaggttgtt gccggtcctt 2400 tgtcaagacc acggccttac accggagcaa gtcgtggcca ttgcatccca cgacggtggc 2460 aaacaggctc ttgagacggt tcagagactt ctcccagttc tctgtcaagc ccacgggctg 2520 actcccgatc aagttgtagc gattgcgtcg catgacggag ggaaacaagc attggagact 2580 gtccaacggc tccttcccgt gttgtgtcaa gcccacggtt tgacgcctgc acaagtggtc 2640 gccatcgcct ccaatattgg cggtaagcag gcgctggaaa cagtacagcg cctgctgcct 2700 gtactgtgcc aggatcatgg actgacccca gaccaggtag tcgcaatcgc gtcacatgac 2760 gggggaaagc aagccctgga aaccgtgcaa aggttgttgc cggtcctttg tcaagaccac 2820 ggccttacac cggagcaagt cgtggccatt gcaagcaaca tcggtggcaa acaggctctt 2880 gagacggttc agagacttct cccagttctc tgtcaagccc acgggctgac tcccgatcaa 2940 gttgtagcga ttgcgtccaa cggtggaggg aaacaagcat tggagactgt ccaacggctc 3000 cttcccgtgt tgtgtcaagc ccacggtttg acgcctgcac aagtggtcgc catcgccagc 3060 catgatggcg gtaagcaggc gctggaaaca gtacagcgcc tgctgcctgt actgtgccag 3120 gatcatggac tgacacccga acaggtggtc gccattgctt ctaatggggg aggacggcca 3180 gccttggagt ccatcgtagc ccaattgtcc aggcccgatc ccgcgttggc tgcgttaacg 3240 aatgaccatc tggtggcgtt ggcatgtctt ggtggacgac ccgcgctcga tgcagtcaaa 3300 aagggtctgc ctcatgctcc cgcattgatc aaaagaacca accggcggat tcccgagaga 3360 acttcccatc gagtcgcggg atccgatcca aaaaagaaga gaaaggtagc acccccgacc 3420 gatgtcagcc tgggggacga gctccactta gacggcgagg acgtggcgat ggcgcatgcc 3480 gacgcgctag acgatttcga tctggacatg ttgggggacg gggattcccc gggtccggga 3540 tttacccccc acgactccgc cccctacggc gctctggata tggccgactt cgagtttgag 3600 cagatgttta ccgatgccct tggaattgac gagtacggtg ggtag 3645 <210> 8 <211> 3792 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 8 acattgatta ttgactagtt attaatagta atcaattacg gggtcattag ttcatagccc 60 atatatggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa 120 cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac 180 tttccattga cgtcaatggg tggactattt acggtaaact gcccacttgg cagtacatca 240 agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg 300 gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt 360 agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc gtggatagcg 420 gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg 480 gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat 540 gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctctctggc taactagaga 600 acccactgct tactggctta tcgaaattaa tacgactcac tatagggaga cccaagctta 660 tcgataggta ccctcgagtt taccactccc tatcagtgat agagaaaagt gaaagtcgag 720 tttaccactc cctatcagtg atagagaaaa gtgaaagtcg agtttaccac tccctatcag 780 tgatagagaa aagtgaaagt cgagtttacc actccctatc agtgatagag aaaagtgaaa 840 gtcgagttta ccactcccta tcagtgatag agaaaagtga aagtcgagtt taccactccc 900 tatcagtgat agagaaaagt gaaagtcgag ctcgctagcg aattcgccac catgcaccac 960 11/65 caccaccacc acgactacaa agaccatgac ggtgattata aagatcatga catcgattac 1020 aaggatgacg atgacaagat ggcccccaag aagaagagga aggtgggcat tcaccgcggg 1080 gtacctatgg tggacttgag gacactcggt tattcgcaac agcaacagga gaaaatcaag 1140 cctaaggtca ggagcaccgt cgcgcaacac cacgaggcgc ttgtggggca tggcttcact 1200 catgcgcata ttgtcgcgct ttcacagcac cctgcggcgc ttgggacggt ggctgtcaaa 1260 taccaagata tgattgcggc cctgcccgaa gccacgcacg aggcaattgt aggggtcggt 1320 aaacagtggt cgggagcgcg agcacttgag gcgctgctga ctgtggcggg tgagcttagg 1380 gggcctccgc tccagctcga caccgggcag ctgctgaaga tcgcgaagag agggggagta 1440 acagcggtag aggcagtgca cgcctggcgc aatgcgctca ccggggcccc cttgaacctg 1500 accccagacc aggtagtcgc aatcgcgtca aacggagggg gaaagcaagc cctggaaacc 1560 gtgcaaaggt tgttgccggt cctttgtcaa gaccacggcc ttacaccgga gcaagtcgtg 1620 gccattgcaa gcaatggggg tggcaaacag gctcttgaga cggttcagag acttctccca 1680 gttctctgtc aagcccacgg gctgactccc gatcaagttg tagcgattgc gtcgaacatt 1740 ggagggaaac aagcattgga gactgtccaa cggctccttc ccgtgttgtg tcaagcccac 1800 ggtttgacgc ctgcacaagt ggtcgccatc gccagccatg atggcggtaa gcaggcgctg 1860 gaaacagtac agcgcctgct gcctgtactg tgccaggatc atggactgac cccagaccag 1920 gtagtcgcaa tcgcgtcaaa cggaggggga aagcaagccc tggaaaccgt gcaaaggttg 1980 ttgccggtcc tttgtcaaga ccacggcctt acaccggagc aagtcgtggc cattgcaaat 2040 aataacggtg gcaaacaggc tcttgagacg gttcagagac ttctcccagt tctctgtcaa 2100 gcccacgggc tgactcccga tcaagttgta gcgattgcgt cgcatgacgg agggaaacaa 2160 gcattggaga ctgtccaacg gctccttccc gtgttgtgtc aagcccacgg tttgacgcct 2220 gcacaagtgg tcgccatcgc ctcgaatggc ggcggtaagc aggcgctgga aacagtacag 2280 cgcctgctgc ctgtactgtg ccaggatcat ggactgaccc cagaccaggt agtcgcaatc 2340 gcgaacaata atgggggaaa gcaagccctg gaaaccgtgc aaaggttgtt gccggtcctt 2400 tgtcaagacc acggccttac accggagcaa gtcgtggcca ttgcatccca cgacggtggc 2460 aaacaggctc ttgagacggt tcagagactt ctcccagttc tctgtcaagc ccacgggctg 2520 actcccgatc aagttgtagc gattgcgtcc aacggtggag ggaaacaagc attggagact 2580 gtccaacggc tccttcccgt gttgtgtcaa gcccacggtt tgacgcctgc acaagtggtc 2640 gccatcgcca gccatgatgg cggtaagcag gcgctggaaa cagtacagcg cctgctgcct 2700 gtactgtgcc aggatcatgg actgacccca gaccaggtag tcgcaatcgc gtcacatgac 2760 gggggaaagc aagccctgga aaccgtgcaa aggttgttgc cggtcctttg tcaagaccac 2820 ggccttacac cggagcaagt cgtggccatt gcatcccacg acggtggcaa acaggctctt 2880 gagacggttc agagacttct cccagttctc tgtcaagccc acgggctgac tcccgatcaa 2940 gttgtagcga ttgcgaataa caatggaggg aaacaagcat tggagactgt ccaacggctc 3000 cttcccgtgt tgtgtcaagc ccacggtttg acgcctgcac aagtggtcgc catcgccagc 3060 catgatggcg gtaagcaggc gctggaaaca gtacagcgcc tgctgcctgt actgtgccag 3120 gatcatggac tgacacccga acaggtggtc gccattgctt ctaatggggg aggacggcca 3180 gccttggagt ccatcgtagc ccaattgtcc aggcccgatc ccgcgttggc tgcgttaacg 3240 aatgaccatc tggtggcgtt ggcatgtctt ggtggacgac ccgcgctcga tgcagtcaaa 3300 aagggtctgc ctcatgctcc cgcattgatc aaaagaacca accggcggat tcccgagaga 3360 acttcccatc gagtcgcggg atccgatcca aaaaagaaga gaaaggtacc aaaaaagaag 3420 agaaaggtcg acggcggtgg tgctttgtct cctcagcact ctgctgtcac tcaaggaagt 3480 atcatcaaga acaaggaggg catggatgct aagtcactaa ctgcctggtc ccggacactg 3540 gtgaccttca aggatgtatt tgtggacttc accagggagg agtggaagct gctggacact 3600 gctcagcaga tcgtgtacag aaatgtgatg ctggagaact ataagaacct ggtttccttg 3660 ggttatcagc ttactaagcc agatgtgatc ctccggttgg agaagggaga agagccctgg 3720 ctggtggaga gagaaattca ccaagagacc catcctgatt cagagactgc atttgaaatc 3780 aaatcatcag tt 3792 <210> 9 <211> 3170 <212> DNA <213> Unknown <220> <223> Synthetic construct/combination of sequences <400> 9 gatctcggcc gcatattaag tgcattgttc tcgataccgc taagtgcatt gttctcgtta 60 gctcgatgga caagtgcatt gttctcttgc tgaaagctcg atggacaagt gcattgttct 120 cttgctgaaa gctcgatgga caagtgcatt gttctcttgc tgaaagctca gtacccgggt 180 cggagtactg ccccgcccct agcgattagc cccggccccg catagctccg ccccgggagt 240 accttcgacc tagagggtat ataatggaag ctcgacttcc agctcgaggg caatccggta 300 ctgttggtaa agccaccgaa ttcgccacca tgcaccacca ccaccaccac gactacaaag 360 accatgacgg tgattataaa gatcatgaca tcgattacaa ggatgacgat gacaagatgg 420 cccccaagaa gaagaggaag gtgggcattc accgcggggt acctatggtg gacttgagga 480 cactcggtta ttcgcaacag caacaggaga aaatcaagcc taaggtcagg agcaccgtcg 540 cgcaacacca cgaggcgctt gtggggcatg gcttcactca tgcgcatatt gtcgcgcttt 600 cacagcaccc tgcggcgctt gggacggtgg ctgtcaaata ccaagatatg attgcggccc 660 12/65 tgcccgaagc cacgcacgag gcaattgtag gggtcggtaa acagtggtcg ggagcgcgag 720 cacttgaggc gctgctgact gtggcgggtg agcttagggg gcctccgctc cagctcgaca 780 ccgggcagct gctgaagatc gcgaagagag ggggagtaac agcggtagag gcagtgcacg 840 cctggcgcaa tgcgctcacc ggggccccct tgaacctgac cccagaccag gtagtcgcaa 900 tcgcgtcaca tgacggggga aagcaagccc tggaaaccgt gcaaaggttg ttgccggtcc 960 tttgtcaaga ccacggcctt acaccggagc aagtcgtggc cattgcaagc aatgggggtg 1020 gcaaacaggc tcttgagacg gttcagagac ttctcccagt tctct'gtcaa gcccacgggc 1080 tgactcccga tcaagttgta gcgattgcgt ccaacggtgg agggaaacaa gcattggaga 1140 ctgtccaacg gctccttccc gtgttgtgtc aagcccacgg tttgacgcct gcacaagtgg 1200 tcgccatcgc cagccatgat ggcggtaagc aggcgctgga aacagtacag cgcctgctgc 1260 ctgtactgtg ccaggatcat ggactgaccc cagaccaggt agtcgcaatc gcgtcacatg 1320 acgggggaaa gcaagccctg gaaaccgtgc aaaggttgtt gccggtcctt tg'tcaagacc 1380 acggccttac accggagcaa gtcgtggcca ttgcaaataa taacggtggc aaacaggctc 1440 ttgagacggt tcagagactt ctcccagttc tctgtcaagc ccacgggctg actcccgatc 1500 aagttgtagc gattgcgtcc aacggtggag ggaaacaagc attggagact gtccaacggc 1560 tccttcccgt gttgtgtcaa gcccacggtt tgacgcctgc acaagtggtc gccatcgcct 1620 cgaatggcgg cggtaagcag gcgctggaaa cagtacagcg cctgctgcct gtactgtgcc 1680 aggatcatgg actgacccca gaccaggtag tcgcaatcgc gtcaaacgga gggggaaagc 1740 aagccctgga aaccgtgcaa aggttgttgc cggtcctttg tcaagaccac ggccttacac 1800 cggagcaagt cgtggccatt gcatcccacg acggtggcaa acaggctctt gagacggttc 1860 agagacttct cccagttctc tgtcaagccc acgggctgac tcccgatcaa gttgtagcga 1920 ttgcgtcgca tgacggaggg aaacaagcat tggagactgt ccaacggctc cttcccgtgt 1980 tgtgtcaagc ccacggtttg acgcctgcac aagtggtcgc catcgcctcc aatattggcg 2040 gtaagcaggc gctggaaaca gtacagcgcc tgctgcctgt actgtgccag gatcatggac 2100 tgaccccaga ccaggtagtc gcaatcgcgt cacatgacgg gggaaagcaa gccctggaaa 2160 ccgtgcaaag gttgttgccg gtcctttgtc aagaccacgg ccttacaccg gagcaagtcg 2220 tggccattgc aagcaacatc ggtggcaaac aggctcttga gacggttcag agacttctcc 2280 cagttctctg tcaagcccac gggctgactc ccgatcaagt tgtagcgatt gcgtccaacg 2340 gtggagggaa acaagcattg gagactgtcc aacggctcct tcccgrtgttg tgtcaagccc 2400 acggtttgac gcctgcacaa gtggtcgcca tcgccagcca tgatggcggt aagcaggcgc 2460 tggaaacagt acagcgcctg ctgcctgtac tgtgccagga tcatggactg acacccgaac 2520 aggtggtcgc cattgcttct aatgggggag gacggccagc cttggagtcc atcgtagccc 2580 aattgtccag gcccgatccc gcgttggctg cgttaacgaa tgaccatctg gtggcgttgg 2640 catgtcttgg tggacgaccc gcgctcgatg cagtcaaaaa gggtctgcct catgctcccg 2700 cattgatcaa aagaaccaac cggcggattc ccgagagaac ttcccatcga gtcgcgggat 2760 ccgatccaaa aaagaagaga aaggtaccaa aaaagaagag aaaggtcgac ggcggtggtg 2820 ctttgtctcc tcagcactct gctgtcactc aaggaagtat catcaagaac aaggagggca 2880 tggatgctaa gtcactaact gcctggtccc ggacactggt gaccttcaag gatgtatttg 2940 tggacttcac cagggaggag tggaagctgc tggacactgc tcagcagatc gtgtacagaa 3000 atgtgatgct ggagaactat aagaacctgg tttccttggg ttatcagctt actaagccag 3060 atgtgatcct ccggttggag aagggagaag agccctggct ggtggagaga gaaattcacc 3120 aagagaccca tcctgattca gagactgcat ttgaaatcaa atcatcagtt 3170 <210> 10 <211> 3020 <212> DNA <213> Unknown <220> <223> Operon composed of operator DNA-binding elements, minimal or constitutive promoter, structural genes <400> 10 gatctcggcc gcatattaag tgcattgttc tcgataccgc taagtgcatt gttctcgtta 60 gctcgatgga caagtgcatt gttctcttgc tgaaagctcg atggacaagt gcattgttct 120 cttgctgaaa gctcgatgga caagtgcatt gttctcttgc tgaaagctca gtacccgggt 180 cggagtactg ccccgcccct agcgattagc cccggccccg catagctccg ccccgggagt 240 accttcgacc tagagggtat ataatggaag ctcgacttcc agctcgaggg caatccggta 300 ctgttggtaa agccaccgaa ttcgccacca tgcaccacca ccaccaccac gactacaaag 360 accatgacgg tgattataaa gatcatgaca tcgattacaa ggatgacgat gacaagatgg 420 cccccaagaa gaagaggaag gtgggcattc accgcggggt acctatggtg gacttgagga 480 cactcggtta ttcgcaacag caacaggaga aaatcaagcc taaggtcagg agcaccgtcg 540 cgcaacacca cgaggcgctt gtggggcatg gcttcactca tgcgcatatt gtcgcgcttt 600 cacagcaccc tgcggcgctt gggacggtgg ctgtcaaata ccaagatatg attgcggccc 660 tgcccgaagc cacgcacgag gcaattgtag gggtcggtaa acagtggtcg ggagcgcgag 720 cacttgaggc gctgctgact gtggcgggtg agcttagggg gcctccgctc cagctcgaca 780 ccgggcagct gctgaagatc gcgaagagag ggggagtaac agcggtagag gcagtgcacg 840 cctggcgcaa tgcgctcacc ggggccccct tgaacctgac cccagaccag gtagtcgcaa 900 tcgcgtcaaa cggaggggga aagcaagccc tggaaaccgt gcaaaggttg ttgccggtcc 960 tttgtcaaga ccacggcctt acaccggagc aagtcgtggc cattgcaagc aatgggggtg 1020 gcaaacaggc tcttgagacg gttcagagac ttctcccagt tctctgtcaa gcccacgggc 1080 tgactcccga tcaagttgta gcgattgcgt cgaacattgg agggaaacaa gcattggaga 1140 ctgtccaacg gctccttccc gtgttgtgtc aagcccacgg tttgacgcct gcacaagtgg 1200 tcgccatcgc cagccatgat ggcggtaagc aggcgctgga aacagtacag cgcctgctgc 1260 ctgtactgtg ccaggatcat ggactgaccc cagaccaggt agtcgcaatc gcgtcaaacg 1320 gagggggaaa gcaagccctg gaaaccgtgc aaaggttgtt gccggtcctt tgtcaagacc 1380 acggccttac accggagcaa gtcgtggcca ttgcaaataa taacggtggc aaacaggctc 1440 ttgagacggt tcagagactt ctcccagttc tctgtcaagc ccacgggctg actcccgatc 1500 aagttgtagc gattgcgtcg catgacggag ggaaacaagc attggagact gtccaacggc 1560 tccttcccgt gttgtgtcaa gcccacggtt tgacgcctgc acaagtggtc gccatcgcct 1620 cgaatggcgg cggtaagcag gcgctggaaa cagtacagcg cctgctgcct gtactgtgcc 1680 aggatcatgg actgacccca gaccaggtag tcgcaatcgc gaacaataat gggggaaagc 1740 aagccctgga aaccgtgcaa aggttgttgc cggtcctttg tcaagaccac ggccttacac 1800 cggagcaagt cgtggccatt gcatcccacg acggtggcaa acaggctctt gagacggttc 1860 agagacttct cccagttctc tgtcaagccc acgggctgac tcccgatcaa gttgtagcga 1920 ttgcgtccaa cggtggaggg aaacaagcat tggagactgt ccaacggctc cttcccgtgt 1980 tgtgtcaagc ccacggtttg acgcctgcac aagtggtcgc catcgccagc catgatggcg 2040 gtaagcaggc gctggaaaca gtacagcgcc tgctgcctgt actgtgccag gatcatggac 2100 tgaccccaga ccaggtagtc gcaatcgcgt cacatgacgg gggaaagcaa gccctggaaa 2160 ccgtgcaaag gttgttgccg gtcctttgtc aagaccacgg ccttacaccg gagcaagtcg 2220 tggccattgc atcccacgac ggtggcaaac aggctcttga gacggttcag agacttctcc 2280 cagttctctg tcaagcccac gggctgactc ccgatcaagt tgtagcgatt gcgaataaca 2340 atggagggaa acaagcattg gagactgtcc aacggctcct tcccgtgttg tgtcaagccc 2400 acggtttgac gcctgcacaa gtggtcgcca tcgccagcca tgatggcggt aagcaggcgc 2460 tggaaacagt acagcgcctg ctgcctgtac tgtgccagga tcatggactg acacccgaac 2520 aggtggtcgc cattgcttct aatgggggag gacggccagc cttggagtcc atcgtagccc 2580 aattgtccag gcccgatccc gcgttggctg cgttaacgaa tgaccatctg gtggcgttgg 2640 catgtcttgg tggacgaccc gcgctcgatg cagtcaaaaa gggtctgcct catgctcccg 2700 cattgatcaa aagaaccaac cggcggattc ccgagagaac ttcccatcga gtcgcgggat 2760 ccgatccaaa aaagaagaga aaggtagccc ccccgaccga tgtcagcctg ggggacgagc 2820 tccacttaga cggcgaggac gtggcgatgg cgcatgccga cgcgctagac gatttcgatc 2880 tggacatgtt gggggacggg gattccccgg gtccgggatt taccccccac gactccgccc 2940 cctacggcgc tctggatatg gccgacttcg agtttgagca gatgtttacc gatgcccttg 3000 gaattgacga gtacggtggg 3020 <210> 11 <211> 2670 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 11 Gly Ala Cys Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gl Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gl Gly Thr 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 14/65 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gl Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 Ala Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala Gly Cys Gly Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 465 470 475 480 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala Ala Ala 545 550 555 560 Cys Gly Gly Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 595 600 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 15/65 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Ala Ala Cys Ala Thr Thr 755 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys 865 870 875 880 Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly Gly 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 995 1000 1005 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Cys Ala Thr 1160 1165 1170 Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 16/65 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Ala Ala Cys Ala Ala Thr Ala Ala Thr Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys 1565 1570 1575 Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys Cys Ala Thr 1670 1675 1680 Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 1745 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 17/65 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Ala Ala Thr Ala Ala Cys Ala Ala Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 2090 2095 2100 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 2150 2155 2160 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 18/65 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gl 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 2345 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Gly Cys Cys Cys Cys Cys Cys Cys Gly Ala Cys Cys 2435 2440 2445 Gly Ala Thr Gly Thr Cys Ala Gly Cys Cys Thr Gly Gly Gly Gly 2450 2455 2460 Gly Ala Cys Gly Ala Gly Cys Thr Cys Cys Ala Cys Thr Thr Ala 2465 2470 2475 Gly Ala ■ Cys Gly Gly Cys Gly Ala Gly Gly Ala Cys Gly Thr Gly 2480 2485 2490 Gly Cys Gly Ala Thr Gly Gly Cys Gly Cys Ala Thr Gly Cys Cys 2495 2500 2505 Gly Ala Cys Gly Cys Gly Cys Thr Ala Gly Ala Cys Gly Ala Thr 2510 2515 2520 Thr Thr Cys Gly Ala Thr Cys Thr Gly Gly Ala Cys Ala Thr Gly 2525 2530 2535 Thr Thr Gly Gly Gly Gly Gly Ala Cys Gly Gly Gly Gly Ala Thr 2540 2545 2550 Thr Cys Cys Cys Cys Gly Gly Gly Thr Cys Cys Gly Gly Gly Ala 2555 2560 2565 Thr Thr Thr Ala Cys Cys Cys Cys Cys Cys Ala Cys Gly Ala Cys 2570 2575 2580 Thr Cys Cys Gly Cys Cys Cys Cys Cys Thr Ala Cys Gly Gly Cys 2585 2590 2595 Gly Cys Thr Cys Thr Gly Gly Ala Thr Ala Thr Gly Gly Cys Cys 2600 2605 2610 Gly Ala Cys Thr Thr Cys Gly Ala Gly Thr Thr Thr Gly Ala Gly 2615 2620 2625 Cys Ala Gly Ala Thr Gly Thr Thr Thr Ala Cys Cys Gly Ala Thr 2630 2635 2640 Gly Cys Cys Cys Thr Thr Gly Gly Ala Ala Thr Thr Gly Ala Cys 2645 2650 2655 Gly Ala Gly Thr Ala Cys Gly Gly Thr Gly Gly Gly 2660 2665 2670 <210> 12 <211> 2670 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 12 Gly Ala Cys Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 19/65 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gly Gly Thr 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly C s Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 Ala Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala Gly Cys Gly Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 465 470 475 480 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Al Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala 545 550 555 560 Thr Gly Ala Cys Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 20/65 595 600 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys■ 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys Gly Gly Thr 755 1 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys '865 870 875 880 Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys Gly Gly Gly Gly 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly ' Gly Thr Thr Gly 995 1000 1005 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 21/65 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys 1160 1165 1170 Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly 1565 1570 1575 Cys Ala Thr Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 22/65 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Thr 1670 1675 1680 Ala Thr Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 1745 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Ala Gly Cys Ala Ala Cys Ala Thr Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Thr Cys Cys Al Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 2090 2095 2100 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 23/65 2150 2155 2160 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 2345 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Gly Cys Cys Cys Cys Cys Cys Cys Gly Ala Cys Cys 2435 2440 2445 Gly Ala Thr Gly Thr C s Ala Gly Cys Cys Thr Gly Gly Gly Gly 2450 2455 2460 Gly Ala Cys Gly Ala Gly Cys Thr Cys Cys Ala Cys Thr Thr Ala 2465 2470 2475 Gly Ala Cys Gly Gly Cys Gly Ala Gly Gly Ala Cys Gly Thr Gly 2480 2485 2490 Gly Cys Gly Ala Thr Gly Gly Cys Gly Cys Ala Thr Gly Cys Cys 2495 2500 2505 Gly Ala Cys Gly Cys Gly Cys Thr Ala Gly Ala Cys Gly Ala Thr 2510 2515 2520 Thr Thr Cys Gly Ala Thr Cys Thr' Gly Gly Ala Cys Ala Thr Gly 2525 2530 2535 Thr Thr Gly Gly Gly Gly Gly Ala Cys Gly Gly Gly Gly Ala Thr 2540 2545 2550 Thr Cys Cys Cys Cys Gly Gly Gly Thr Cys Cys Gly Gly Gly Ala 2555 2560 2565 Thr Thr Thr Ala Cys Cys Cys Cys Cys Cys Ala Cys Gly Ala Cys 2570 2575 2580 Thr Cys Cys Gly Cys Cys Cys Cys Cys Thr Ala Cys Gly Gly Cys 2585 2590 2595 Gly Cys Thr Cys Thr Gly Gly Ala Thr Ala Thr Gly Gly Cys Cys 2600 2605 2610 Gly Ala Cys Thr Thr Cys Gly Ala Gly Thr Thr Thr Gly Ala Gly 2615 2620 2625 Cys Ala Gly Ala Thr Gly Thr Thr Thr Ala Cys Cys Gly Ala Thr 2630 2635 2640 Gly Cys Cys Cys Thr Thr Gly Gly Ala Ala Thr Thr Gly Ala Cys 2645 2650 2655 Gly Ala Gly Thr Ala Cys Gly Gly Thr Gly Gly Gly 24/65 2660 2665 <210> 13 <211> 2820 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 13 Gly Ala Cys Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gly Gly Thr 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 Ala Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gl Gly Ala Gly Cys Gly Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 25/65 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 465 470 475 480 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala Ala Ala 545 550 555 560 Cys Gly Gly Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 595 600 . 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Ala Ala Cys Ala Thr Thr 755 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys 865 870 875 880 Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly Gly 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 995 1000 1005 26/65 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Cys Ala Thr 1160 1165 1170 Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Ala Ala Cys Ala Ala Thr Ala Ala Thr Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 27/65 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys 1565 1570 1575 Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys Cys Ala Thr 1670 1675 1680 Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 1745 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Ala Ala Thr Ala Ala Cys Ala Ala Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 28/65 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 2090 2095 2100 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 2150 2155 2160 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys • Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 2345 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Cys Cys Ala Ala Ala Ala Ala Ala Gly Ala Ala Gly 2435 2440 2445 Ala Gly Ala Ala Ala Gly Gly Thr Cys Gly Ala Cys Gly Gly Cys 2450 2455 2460 Gly Gly Thr Gly Gly Thr Gly Cys Thr Thr Thr Gly Thr Cys Thr 2465 2470 2475 Cys Cys Thr Cys Ala Gly Cys Ala Cys Thr Cys Thr Gly Cys Thr 2480 2485 2490 Gly Thr Cys Ala Cys Thr Cys Ala Ala Gly Gly Ala Ala Gly Thr 2495 2500 2505 Ala Thr Cys Ala Thr Cys Ala Ala Gly Ala Ala Cys Ala Ala Gly 2510 2515 2520 Gly Ala Gly Gly Gly Cys Ala Thr Gly Gly Ala Thr Gly Cys Thr 2525 2530 2535 29/65 Ala Ala Gly Thr Cys Ala Cys Thr Ala Ala Cys Thr Gly Cys Cys 2540 2545 2550 Thr Gly Gly Thr Cys Cys Cys Gly Gly Ala Cys Ala Cys Thr Gly 2555 2560 2565 Gly Thr Gly Ala Cys Cys Thr Thr Cys Ala Ala Gly Gly Ala Thr 2570 2575 2580 Gly Thr Ala Thr Thr Thr Gly Thr Gly Gly Ala Cys Thr Thr Cys 2585 2590 2595 Ala Cys Cys Ala Gly Gly Gly Ala Gly Gly Ala Gly Thr Gly Gly 2600 2605 2610 Ala Ala Gly Cys Thr Gly Cys Thr Gly Gly Ala Cys Ala Cys Thr 2615 2620 2625 Gly Cys Thr Cys Ala Gly Cys Ala Gly Ala Thr Cys Gly Thr Gly 2630 2635 2640 Thr Ala Cys Ala Gly Ala Ala Ala Thr Gly Thr Gly Ala Thr Gly 2645 2650 2655 Cys Thr Gly Gly Ala Gly Ala Ala Cys Thr Ala Thr Ala Ala Gly 2660 2665 2670 Ala Ala Cys Cys Thr Gly Gly Thr Thr Thr Cys Cys Thr Thr Gly 2675 2680 2685 Gly Gl Thr Thr Ala Thr Cys Ala Gly Cys Thr Thr Ala Cys Thr 2690 2695 2700 Ala Ala Gly Cys Cys Ala Gly Ala Thr Gly Thr Gly Ala Thr Cys 2705 2710 2715 Cys Thr Cys Cys Gly Gly Thr Thr Gly Gly Ala Gly Ala Ala Gly 2720 2725 2730 Gly Gly Ala Gly Ala Ala Gly Ala Gly Cys Cys Cys Thr Gly Gly 2735 2740 2745 Cys Thr Gly Gly Thr Gly Gly Ala Gly Ala Gly Ala Gly Ala Ala 2750 2755 2760 Ala Thr Thr Cys Ala Cys Cys Ala Ala Gly Ala Gly Ala Cys Cys 2765 2770 2775 Cys Ala Thr Cys Cys Thr Gly Ala Thr Thr Cys Ala Gly Ala Gly 2780 2785 2790 Ala Cys Thr Gly Cys Ala Thr Thr Thr Gly Ala Ala Ala Thr Cys 2795 2800 2805 Ala Ala Ala Thr Cys Ala Thr Cys Ala Gly Thr Thr 2810 2815 2820 <210> 14 <211> 2820 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 14 Gly Ala Cys Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gly Gly Thr 30/65 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 Ala Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala Gly Cys Gly Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 455 470 475 480 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala 545 550 555 560 Thr Gly Ala Cys Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 595 600 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gl Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 31/65 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys Gly Gly Thr 755 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys 865 870 875 880 Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys Gly Gly Gly Gly 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 995 1000 1005 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys 1160 1165 1170 Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 32/65 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gl Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Th Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly 1565 1570 1575 Cys Ala Thr Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Thr 1670 1675 1680 Ala Thr Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 33/65 17 5 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Ala Gly Cys Ala Ala Cys Ala Thr Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Thr Cys Cys Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 2090 2095 2100 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 2150 2155 2160 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 34/65 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 2345 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gl Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Cys Cys Ala Ala Ala Ala Ala Ala Gly Ala Ala Gly 2435 2440 2445 Ala Gly Ala Ala Ala Gly Gl Thr Cys Gly Ala Cys Gly Gly Cys 2450 2455 2460 Gly ■Gly Thr Gly Gly Thr Gly Cys Thr Thr Thr Gly Thr Cys Thr 2465 2470 2475 Cys Cys Thr Cys Ala Gly Cys Ala Cys Thr Cys Thr Gly Cys Thr 2480 2485 2490 Gly Thr Cys Ala Cys Thr Cys Ala Ala Gly Gly Ala Ala Gly Thr 2495 2500 2505 Ala Thr Cys Ala Thr Cys Ala Ala Gly Ala Ala Cys Ala Ala Gly 2510 2515 2520 Gly Ala Gly Gly Gly Cys Ala Thr Gly Gly Ala Thr Gly Cys Thr 2525 2530 2535 Ala Ala Gly Thr Cys Ala Cys Thr Ala Ala Cys Thr Gly Cys Cys 2540 2545 2550 Thr Gly Gly Thr Cys Cys Cys Gly Gly Ala Cys Ala Cys Thr Gly 2555 2560 2565 Gly Thr Gly Ala Cys Cys Thr Thr Cys Ala Ala Gly Gly Ala Thr 2570 2575 2580 Gly Thr Ala Thr Thr Thr Gly Thr Gly Gly Ala Cys Thr Thr Cys 2585 2590 2595 Ala Cys Cys Ala Gly Gly Gly Ala Gly Gly Ala Gly Thr Gly Gly 2600 2605 2610 Ala Ala Gly Cys Thr Gly Cys Thr Gly Gly Ala Cys Ala Cys Thr 2615 2620 2625 Gly Cys Thr Cys Ala Gly Cys Ala Gly Ala Thr Cys Gly Thr Gly 2630 2635 2640 Thr Ala Cys Ala Gly Ala Ala Ala Thr Gly Thr Gly Ala Thr Gly 2645 2650 2655 Cys Thr Gly Gly Ala Gly Ala Ala Cys Thr Ala Thr Ala Ala Gly 2660 2665 2670 Ala Ala Cys Cys Thr Gly Gly Thr Thr Thr Cys Cys Thr Thr Gly 2675 2680 2685 Gly Gly Thr Thr Ala Thr Cys Ala Gly Cys Thr Thr Ala Cys Thr 2690 2695 2700 Ala Ala Gly Cys Cys Ala Gly Ala Thr Gly Thr Gly Ala Thr Cys 2705 2710 2715 Cys Thr Cys Cys Gly Gly Thr Thr Gly Gly Ala Gly Ala Ala Gly 2720 2725 2730 Gly Gly Ala Gly Ala Ala Gly Ala Gly Cys Cys Cys Thr Gly Gly 2735 2740 2745 Cys Thr Gly Gly Thr Gly Gly Ala Gly Ala Gly Ala Gly Ala Ala 2750 2755 2760 Ala Thr Thr Cys Ala Cys Cys Ala Ala Gly Ala Gly Ala Cys Cys 35/65 2765 2770 2775 Cys Ala Thr Cys Cys Thr Gly Ala Thr Thr Cys Ala Gly Ala Gly 2780 ' 2785 2790 Ala Cys Thr Gly Cys Ala Thr Thr Thr Gly Ala Ala Ala Thr Cys 2795 2800 2805 Ala Ala Ala Thr Cys Ala Thr Cys Ala Gly Thr Thr 2810 2815 2820 <210> 15 <211> ' 720 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 15 Ala Thr Gly Gly Thr Gly Ala Gly Cys Ala Ala Gly Gly Gly Cys Gly 1 5 10 15 Ala Gly Gly Ala Gly Cys Thr Gly Thr Thr Cys Ala Cys Cys Gly Gly 20 25 30 Gly Gly Thr Gly Gly Thr Gly Cys Cys Cys Ala Thr Cys Cys Thr Gly 35 40 45 Gly Thr Cys Gly Ala Gly Cys Thr Gly Gly Ala Cys Gly Gly Cys Gly 50 55 60 Ala Cys Gly Thr Ala Ala Ala Cys Gly Gly Cys Cys Ala Cys Ala Ala 65 70 75 80 Gly Thr Thr Cys Ala Gly Cys Gly Thr Gly Thr Cys Cys Gly Gly Cys 85 90 95 Gly Ala Gly Gly Gly Cys Gly Ala Gly Gly Gly Cys Gly Ala Thr Gly 100 105 110 Cys Cys Ala Cys Cys Thr Ala Cys Gly Gly Cys Ala Ala Gly Cys Thr 115 120 125 Gly Ala Cys Cys Cys Thr Gly Ala Ala Gly Thr Thr Cys Ala Thr Cys 130 135 140 Thr Gly Cys Ala Cys Cys Ala Cys Cys Gly Gly Cys Ala Ala Gly Cys 145 150 155 160 Thr Gly Cys Cys Cys Gly Thr Gly Cys Cys Cys Thr Gly Gly Cys Cys 165 170 175 Cys Ala Cys Cys Cys Thr Cys Gly Thr Gly Ala Cys Cys Ala Cys Cys 180 185 190 Thr Thr Cys Gly Gly Cys Thr Ala Cys Gly Gly Cys Cys Thr Gly Ala 195 200 205 Thr Gly Thr Gly Cys Thr Thr Cys Gly Cys Cys Cys Gly Cys Thr Ala 210 215 220 Cys Cys Cys Cys Gly Ala Cys Cys Ala Cys Ala Thr Gly Ala Ala Gly 225 230 235 240 Cys Ala Gly Cys Ala Cys Gly Ala Cys Thr Thr Cys Thr Thr Cys Ala 245 250 255 Ala Gly Thr Cys Cys Gly Cys Cys Ala Thr Gly Cys Cys Cys Gly Ala 260 265 270 Ala Gly Gly Cys Thr Ala Cys Gly Thr Cys Cys Ala Gly Gly Ala Gly 275 280 285 Cys Gly Cys Ala- Cys Cys Ala Thr Cys Thr Thr Cys Thr Thr Cys Ala 290 295 300 Ala Gly Gly Ala Cys Gly Ala Cys Gly Gly Cys Ala Ala Cys Thr Ala 305 310 315 320 Cys Ala Ala Gly Ala Cys Cys Cys Gly Cys Gly Cys Cys Gly Ala Gly 325 330 335 Gly Thr Gly Ala Ala Gly Thr Thr Cys Gly Ala Gly Gly Gly Cys Gly 340 345 350 Ala Cys Ala Cys Cys Cys Thr Gly Gly Thr Gly Ala Ala Cys Cys Gly 355 360 365 Cys Ala Thr Cys Gly Ala Gly Cys Thr Gly Ala Ala Gly Gly Gly Cys 370 375 380 Ala Thr Cys Gly Ala Cys Thr Thr Cys Ala Ala Gly Gly Ala Gly Gly 385 390 395 400 Ala Cys Gly Gly Cys Ala Ala Cys Ala Thr Cys Cys Thr Gly Gly Gly 405 410 415 36/65 Gly Cys Ala Cys Ala Ala Gly Cys Thr Gly Gly Ala Gly Thr Ala Cys 420 425 430 Ala Ala Cys Thr Ala Cys Ala Ala Cys Ala Gly Cys Cys Ala Cys Ala 435 440 445 Ala Cys Gly Thr Cys Thr Ala Thr Ala Thr Cys Ala Thr Gly Gly Cys 450 455 460 Cys Gly Ala Cys Ala Ala Gly Cys Ala Gly Ala Ala Gly Ala Ala Cys 465 470 475 480 Gly Gly Cys Ala Thr Cys Ala Ala Gly Gly Thr Gly Ala Ala Cys Thr 485 490 495 Thr Cys Ala Ala Gly Ala Thr Cys Cys Gly Cys Cys Ala Cys Ala Ala 500 505 510 Cys Ala Thr Cys Gly Ala Gly Gly Ala Cys Gly Gly Cys Ala Gly Cys 515 520 525 Gly Thr Gly Cys Ala Gly Cys Thr Cys Gly Cys Cys Gly Ala Cys Cys 530 535 540 Ala Cys Thr Ala Cys Cys Ala Gly Cys Ala Gly Ala Ala Cys Ala Cys 545 550 555 560 Cys Cys Cys Cys Ala Thr Cys Gly Gly Cys Gly Ala Cys Gly Gly Cys 565 570 575 Cys Cys Cys Gly Thr Gly Cys Thr Gly Cys Thr Gly Cys Cys Cys Gly 580 585 590 Ala Cys Ala Ala Cys Cys Ala Cys Thr Ala Cys Cys Thr Gly Ala Gly 595 600 605 Cys Thr Ala Cys Cys Ala Gly Thr Cys Cys Gly Cys Cys Cys Thr Gly 610 615 620 Ala Gly Cys Ala Ala Ala Gly Ala Cys Cys Cys Cys Ala Ala Cys Gly 625 630 635 640 Ala Gly Ala Ala Gly Cys Gly Cys Gly Ala Thr Cys Ala Cys Ala Thr 645 650 655 Gly Gly Thr Cys Cys Thr Gly Cys Thr Gly Gly Ala Gly Thr Thr Cys 660 665 670 Gly Thr Gly Ala Cys Cys Gly Cys Cys Gly Cys Cys Gly Gly Gly Ala 675 680 685 Thr Cys Ala Cys Thr Cys Thr Cys Gly Gly Cys Ala Thr Gly Gly Ala 690 695 700 Cys Gly Ala Gly Cys Thr Gly Thr Ala Cys Ala Ala Gly Thr Ala Ala 705 710 715 720 <210> 16 <211> 699 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> : 16 Ala Thr Gly Ala Gly Cys Gly Ala Gly Cys Thr Gly Ala Thr Thr Ala 1 5 10 15 Ala Gly Gly Ala Gly Ala Ala Cys Ala Thr Gly Cys Ala Cys Ala Thr 20 25 30 Gly Ala Ala Gly Cys Thr Gly Thr Ala Cys Ala Thr Gly Gly Ala Gly 35 40 45 Gly Gly Cys Ala Cys Cys Gly Thr Gly Gly Ala Cys Ala Ala Cys Cys 50 55 60 Ala Thr Cys Ala Cys Thr Thr Cys Ala Ala Gly Thr Gly Cys Ala Cys 65 70 75 80 Ala Thr Cys Cys Gly Ala Gly Gly Gly Cys Gly Ala Ala Gly Gly Cys 85 90 95 Ala Ala Gly Cys Cys Cys Thr Ala Cys Gly Ala Gly Gly Gly Cys Ala 100 105 110 Cys Cys Cys Ala Gly Ala Cys Cys Ala Thr Gly Ala Gly Ala Ala Thr 115 120 125 Cys Ala Ala Gly Gly Thr Gly Gly Thr Cys Gly Ala Gly Gly Gly Cys 130 135 140 Gly Gly Cys Cys Cys Thr Cys Thr Cys Cys Cys Cys Thr Thr Cys Gly 145 150 155 160 Cys Cys Thr Thr Cys Gly Ala Cys Ala Thr Cys Cys Thr Gly Gly Cys 37/65 165 170 175 Thr Ala Cys Thr Ala Gly Cys Thr Thr Cys Cys Thr Cys Thr Ala Cys 130 185 190 Gly Gly Cys Ala Gly Cys Ala Ala Gly Ala Cys Cys Thr Thr Cys Ala 195 200 205 Thr Cys Ala Ala Cys Cys Ala Cys Ala Cys Cys Cys Ala Gly Gly Gly 210 215 220 Cys Ala Thr Cys Cys Cys Cys Gly Ala Cys Thr Thr Cys Thr Thr Cys 225 230 235 240 Ala Ala Gly Cys Ala Gly Thr Cys Cys Thr Thr Cys Cys Cys Thr Gly 245 250 255 Ala Gly Gly Gly Cys Thr Thr Cys Ala Cys Ala Thr Gly Gly Gly Ala 260 265 270 Gly Ala Gly Ala Gly Thr Cys Ala Cys Cys Ala Cys Ala Thr Ala Cys 275 280 285 Gly Ala Ala Gly Ala Cys Gly Gly Gly Gly Gly Cys Gly Thr Gly Cys 290 295 300 Thr Gly Ala Cys Cys Gly Cys Thr Ala Cys Cys Cys Ala Gly Gly Ala 305 310 315 320 Cys Ala Cys Cys Ala Gly Cys Cys Thr Cys Cys Ala Gly Gly Ala Cys 325 330 335 Gly Gly Cys Thr Gly Cys Cys Thr Cys Ala Thr Cys Thr Ala Cys Ala 340 345 350 Ala Cys Gly Thr Cys Ala Ala Gly Ala Thr Cys Ala Gly Ala Gly Gly 355 360 365 Gly Gly Thr Gly Ala Ala Cys Thr Thr Cys Ala Cys Ala Thr Cys Cys 370 375 380 Ala Ala Cys Gly Gly Cys Cys Cys Thr Gly Thr Gly Ala Thr Gly Cys 385 390 395 400 Ala Gly Ala Ala Gly Ala Ala Ala Ala Cys Ala Cys Thr Cys Gly Gly 405 410 415 Cys Thr Gly Gly Gly Ala Gly Gly Cys Cys Thr Thr Cys Ala Cys Cys 420 425 430 Gly Ala Gly Ala Cys Gly Cys Thr Gly Thr Ala Cys Cys Cys Cys Gly 435 440 445 Cys Thr Gly Ala Cys Gly Gly Cys Gly Gly Cys Cys Thr Gly Gly Ala 450 455 460 Ala Gly Gly Cys Ala Gly Ala Ala Ala Cys Gly Ala Cys Ala Thr Gly 465 470 475 480 Gly Cys Cys Cys Thr Gly Ala Ala Gly Cys Thr Cys Gly Thr Gly Gly 485 490 495 Gly Cys Gly Gly Gly Ala Gly Cys Cys Ala Thr Cys Thr Gly Ala Thr 500 505 510 Cys Gly Cys Ala Ala Ala Cys Ala Thr Cys Ala Ala Gly Ala Cys Cys 515 520 525 Ala Cys Ala Thr Ala Thr Ala Gly Ala Thr Cys Cys Ala Ala Gly Ala 530 535 540 Ala Ala Cys Cys Cys Gly Cys Thr Ala Ala Gly Ala Ala Cys Cys Thr 545 550 555 560 Cys Ala Ala Gly Ala Thr Gly Cys Cys Thr Gly Gly Cys Gly Thr Cys 565 570 575 Thr Ala Cys Thr Ala Thr Gly Thr Gly Gly Ala Cys Thr Ala Cys Ala 580 585 590 Gly Ala Cys Thr Gly Gly Ala Ala Ala Gly Ala Ala Thr Cys Ala Ala 595 600 605 Gly Gly Ala Gly Gly Cys Cys Ala Ala Cys Ala Ala Cys Gly Ala Gly 610 615 620 Ala Cys Cys Thr Ala Cys Gly Thr Cys Gly Ala Gly Cys Ala Gly Cys 625 630 635 640 Ala Cys Gly Ala Gly Gly Thr Gly Gly Cys Ala Gly Thr Gly Gly Cys 645 650 655 Cys Ala Gly Ala Thr Ala Cys Thr Gly Cys Gly Ala Cys Cys Thr Cys 660 665 670 Cys Cys Thr Ala Gly Cys Ala Ala Ala Cys Thr Gly Gly Gly Gly Cys 675 680 685 Ala Cys Ala Ala Gly Cys Thr Cys Ala Ala Thr 690 695 38/65 <210> 17 <211> 6309 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 17 Gly Ala Cys Thr Ala ■ Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gly Gly Thr 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 A-la Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gl 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala Gly Cys Gl Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 465 470 475 480 39/65 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala 545 550 555 560 Thr Gly Ala Cys Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 595 600 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys Gly Gly Thr 755 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys 865 870 875 880 Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys Gly Gly Gly Gly 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 995 1000 1005 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Al Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys 1160 1165 1170 Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300" 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly 1565 1570 1575 Cys Ala Thr Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Thr 1670 1675 1680 Ala Thr Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 1745 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gl Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Ala Gly Cys Ala Ala Cys Ala Thr Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Thr Cys Cys Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 2090 2095 210Ό Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 2150 2155 2160 C s Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 2345 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Cys Cys Ala Ala Ala Ala Ala Ala Gly Ala Ala Gly 2435 2440 2445 Ala Gly Ala Ala Ala Gly Gly Thr Cys Gly Ala Cys Gly Gly Cys 2450 2455 2460 Gly Gly Thr Gly Gly Thr Gly Cys Thr Thr Thr Gly Thr Cys Thr 2465 2470 2475 Cys Cys Thr Cys Ala Gly Cys Ala Cys Thr Cys Thr Gly Cys Thr 2480 2485 2490 Gly Thr Cys Ala Cys Thr Cys Ala Ala Gly Gly Ala Ala Gly Thr 2495 2500 2505 Ala Thr Cys Ala Thr Cys Ala Ala Gly Ala Ala Cys Ala Ala Gly 2510 2515 2520 Gly Ala Gly Gly Gly Cys Ala Thr Gly Gly Ala Thr Gly Cys Thr 2525 2530 2535 Ala Ala Gly Thr Cys Ala Cys Thr Ala Ala Cys Thr Gly Cys Cys 2540 2545 2550 43/65 Thr Gly Gly Thr Cys Cys Cys Gly Gly Ala Cys Ala Cys Thr Gly 2555 2560 2565 Gly Thr Gly Ala Cys Cys Thr Thr Cys Ala Ala Gly Gly Ala Thr 2570 2575 2580 Gly Thr Ala Thr Thr Thr Gly Thr Gly Gly Ala Cys Thr Thr Cys 2585 2590 2595 Ala Cys Cys Ala Gly Gly Gly Ala Gly Gly Ala Gly Thr Gly Gly 2600 2605 2610 Ala Ala Gly Cys Thr Gly Cys Thr Gly Gly Ala Cys Ala Cys Thr 2615 2620 2625 Gly Cys Thr Cys Ala Gly Cys Ala Gly Ala Thr Cys Gly Thr Gly 2630 ■ 2635 2640 Thr Ala Cys Ala Gly Ala Ala Ala Thr Gly Thr Gly Ala Thr Gly 2645 2650 2655 Cys Thr Gly Gly Ala Gly Ala Ala Cys Thr Ala Thr Ala Ala Gly 2660 2665 2670 Ala Ala Cys Cys Thr Gly Gly Thr Thr Thr Cys Cys Thr Thr Gly 2675 2680 2685 Gly Gly Thr Thr Ala Thr Cys Ala Gly Cys Thr Thr Ala Cys Thr 2690 2695 2700 Ala Ala Gly Cys Cys Ala Gly Ala Thr Gly Thr Gly Ala Thr Cys 2705 2710 2715 Cys Thr Cys Cys Gly Gly Thr Thr Gly Gly Ala Gly Ala Ala Gly 2720 2725 2730 Gly Gly Ala Gly Ala Ala Gly Ala Gly Cys Cys Cys Thr Gly Gly 2735 2740 2745 Cys Thr Gly Gly Thr Gly Gly Ala Gly Ala Gly Ala Gly Ala Ala 2750 2755 2760 Ala Thr Thr Cys Ala Cys Cys Ala Ala Gly Ala Gly Ala Cys Cys 2765 2770 2775 Cys Ala Thr Cys Cys Thr Gly Ala Thr Thr Cys Ala Gly Ala Gly 2780 2785 2790 Ala Cys Thr Gly Cys Ala Thr Thr Thr Gly Ala Ala Ala Thr Cys 2795 2800 2805 Ala Ala Ala Thr Cys Ala Thr Cys Ala Gly Thr Thr Ala Gly Ala 2810 2815 2820 Gly Cys Cys Gly Ala Gly Gly Gly Gly Ala Gly Ala Gly Gly Ala 2825 2830 2835 Ala Gly Thr Cys Thr Thr Cys Thr Gly Ala Cys Cys Thr Gly Cys 2840 2845 2850 Gly Gly Ala Gly Ala Cys Gly Thr Cys Gly Ala Ala Gly Ala Gly 2855 2860 2865 Ala Ala Thr Cys Cys Thr Gly Gly Ala Cys Cys Cys Gly Ala Cys 2870 2875 2880 Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly Ala Cys 2885 2890 2895 Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala Thr 2900 2905 2910 Cys Ala Thr Gly Ala C s Ala Thr Cys Gly Ala Thr Thr Ala Cys 2915 2920 2925 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys 2930 2935 2940 Ala Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly 2945 2950 2955 Ala Ala Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly 2960 2965 2970 Gly Gly Cys Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly 2975 2980 2985 Gly Thr Ala Cys Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys 2990 2995 3000 Thr Thr Gly Ala Gly Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr 3005 3010 3015 Thr Ala Thr Thr Cys Gly Cys Ala Ala Cys Ala Gly Cys Ala Ala 3020 3025 3030 Cys Ala Gly Gly Ala Gly Ala Ala Ala Ala Thr Cys Ala Ala Gly 3035 3040 3045 Cys Cys Thr Ala Ala Gly Gly Thr Cys Ala Gly Gly Ala Gly Cys 3050 3055 3060 44/65 Ala Cys Cys Gly Thr Cys Gly Cys Gly Cys Ala Ala Cys Ala Cys 3065 3070 3075 Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys Thr Thr Gly Thr Gly 3080 3085 3090 Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr Cys Ala Cys Thr 3095 3100 3105 Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr Gly Thr Cys 3110 3115 3120 Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys Ala Cys 3125 3130 3135 Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly Gly 3140 3145 3150 Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 3155 3160 3165 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr 3170 3175 3180 Gly Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala 3185 3190 3195 Gly Cys Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala 3200 3205 3210 Ala Thr Thr Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr 3215 3220 3225 Ala Ala Ala Cys Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala 3230 3235 3240 Gly Cys Gly Cys Gly Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly 3245 3250 3255 Gly Cys Gly Cys Thr Gly Cys Thr Gly Ala Cys Thr Gly Thr Gly 3260 3265 3270 Gly Cys Gly Gly Gly Thr Gly Ala Gly Cys Thr Thr Ala Gly Gly 3275 3280 3285 Gly Gly Gly Cys Cys Thr Cys Cys Gly Cys Thr Cys Cys Ala Gly 3290 3295 3300 Cys Thr Cys Gly Ala Cys Ala Cys Cys Gly Gly Gly Cys Ala Gly 3305 3310 3315 Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala Thr Cys Gly Cys Gly 3320 3325 3330 Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly Ala Gly Thr Ala 3335 3340 3345 Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly Gly Cys Ala 3350 3355 3360 Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys Gly Cys 3365 3370 3375 Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly Gly 3380 3385 3390 Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 3395 3400 3405 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala 3410 3415 3420 Gly Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala 3425 3430 3435 Ala Ala Cys Gly Gly Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly 3440 3445 3450 Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys 3455 3460 3465 Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly 3470 3475 3480 Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala 3485 3490 3495 Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr Ala Cys Ala 3500 3505 3510 Cys Cys Gl Gly Ala Gly Cys Ala Ala Gly Thr Cys Gly Thr Gly 3515 3520 3525 Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Gly Cys Ala Ala Thr 3530 3535 3540 Gly Gly Gly Gly Gly Thr Gly Gly Cys Ala Ala Ala Cys Ala Gly 3545 3550 3555 Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly Gly Thr Thr 3560 3565 3570 45/65 Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys Cys Cys Ala 3575 3580 3585 Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala Gly Cys Cys 3590 3595 3600 Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys Cys Cys 3605 3610 3615 Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys Gly 3620 3625 3630 Ala Thr Thr Gly Cys Gly Thr Cys Gly Ala Ala Cys Ala Thr Thr 3635 3640 3645 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala 3650 3655 3660 Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala 3665 3670 3675 Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly 3680 3685 3690 Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys 3695 3700 3705 Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala 3710 3715 3720 Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys 3725 3730 3735 Gly Cys Cys Ala Gly Cys Cys Ala Thr Gly Ala Thr Gly Gly Cys 3740 3745 3750 Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly Cys Thr Gly 3755 3760 3765 Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly Cys Gly Cys 3770 3775 3780 Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala Cys Thr Gly 3785 3790 3795 Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr Gly Gly Ala 3800 3805 3810 Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly 3815 3820 3825 Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly 3830 3835 3840 Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly Gly Gly Ala 3845 3850 3855 Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala Ala 3860 3865 3870 Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 3875 3880 3885 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys' Thr Thr Thr Gly Thr 3890 3895 3900 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 3905 3910 3915 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 3920 3925 3930 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 3935 3940 3945 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 3950 3955 3960 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 3965 3970 3975 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 3980 3985 3990 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 3995 4000 4005 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 4010 4015 4020 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 4025 4030 4035 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Cys Ala Thr 4040 4045 4050 Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 4055 4060 4065 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 4070 4075 4080 46/65 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 4085 4090 4095 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 4100 4105 4110 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 4115 4120 4125 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 4130 4135 4140 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 4145 4150 4155 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 4160 4165 4170 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 4175 4180 4185 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 4190 4195 4200 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 4205 4210 4215 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 4220 4225 4230 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 4235 4240 4245 Gly Cys Gly Ala Ala Cys Ala Ala Thr Ala Ala Thr Gly Gly Gly 4250 4255 4260 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 4265 4270 4275 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 4280 4285 4290 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 4295 4300 4305 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 4310 4315 4320 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 4325 4330 4335 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 4340 4345 4350 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 4355 4360 4365 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 4370 4375 4380 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 4385 4390 4395 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 4400 4405 4410 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 4415 4420 4425 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 4430 4435 4440 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys 4445 4450 4455 Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala 4460 4465 4470 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 4475 4480 4485 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 4490 4495 4500 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 4505 4510 4515 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 4520 4525 4530 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 4535 4540 4545 Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys Cys Ala Thr 4550 4555 4560 Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 4565 4570 4575 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 4580 4585 4590 47/65 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 4595 4600 4605 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 4610 4615 4620 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 4625 4630 4635 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 4640 4645 4650 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 4655 4660 4665 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 4670 4675 4680 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 4685 4690 4695 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 4700 4705 4710 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 4715 4720 4725 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 4730 4735 4740 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 4745 4750 4755 Gly Cys Ala Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr 4760 4765 4770 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 4775 4780 4785 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 4790 4795 4800 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 4805 4810 4815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 4820 4825 4830 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 4835 4840 4845 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 4850 4855 4860 Ala Ala Thr Ala Ala Cys Ala Ala Thr Gly Gly Ala Gly Gly Gly 4865 4870 4875 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 4880 4885 4890 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 4895 4900 4905 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 4910 4915 4920 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 4925 4930 4935 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 4940 4945 4950 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 4955 4960 4965 Cys Ala Thr Gly Ala Thr Gly ' Gly Cys Gly Gly Thr Ala Ala Gly 4970 4975 4980 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 4985 4990 4995 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 5000 5005 5010 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 5015 5020 5025 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 5030 5035 5040 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 5045 5050 5055 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 5060 5065 5070 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 5075 5080 5085 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 5090 5095 5100 48/65 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 5105 5110 5115 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 5120 5125 5130 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 5135 5140 5145 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 5150 5155 5160 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 5165 5170 5175 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 5180 5185 5190 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 5195 5200 5205 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 5210 5215 5220 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 5225 5230 5235 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 5240 5245 5250 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 5255 5260 5265 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 5270 5275 5280 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 5285 5290 5295 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 5300 5305 5310 Gly Thr Ala Gly Cys Cys Cys Cys Cys Cys Cys Gly Ala Cys Cys 5315 5320 5325 Gly Ala Thr Gly Thr Cys Ala' Gly Cys Cys Thr Gly Gly Gly Gly 5330 5335 5340 Gly Ala Cys Gly Ala Gly Cys Thr Cys Cys Ala Cys Thr Thr Ala 5345 5350 5355 Gly Ala Cys Gly Gly Cys Gly Ala Gly Gly Ala Cys Gly Thr Gly 5360 5365 5370 Gly Cys Gly Ala Thr Gly Gly Cys Gly Cys Ala Thr Gly Cys Cys 5375 5380 5385 Gly Ala Cys Gly Cys Gly Cys Thr Ala. Gly Ala Cys Gly Ala Thr 5390 5395 5400 Thr Thr Cys Gly Ala Thr Cys Thr Gly Gly Ala Cys Ala Thr Gly 5405 5410 5415 Thr Thr Gly Gly Gly Gly Gly Ala Cys Gly Gly Gly Gly Ala Thr 5420 5425 5430 Thr Cys Cys Cys Cys Gly Gly Gly Thr Cys Cys Gly Gly Gly Ala 5435 5440 5445 Thr Thr Thr Ala Cys Cys Cys Cys Cys Cys Ala Cys Gly Ala Cys 5450 5455 5460 Thr Cys Cys Gly Cys Cys Cys Cys Cys Thr Ala Cys Gly Gly Cys 5465 5470 5475 Gly Cys Thr Cys Thr Gly Gly Ala Thr Ala Thr Gly Gly Cys Cys 5480 5485 5490 Gly Ala Cys Thr Thr Cys Gly Ala Gly Thr Thr Thr Gly Ala Gly 5495 5500 5505 Cys Ala Gly Ala Thr Gly Thr Thr Thr Ala Cys Cys Gly Ala Thr 5510 5515 5520 Gly Cys Cys Cys Thr Thr Gly Gly Ala Ala Thr Thr Gly Ala Cys 5525 5530 5535 Gly Ala Gly Thr Ala Cys Gly Gly Thr Gly Gly Gly Ala Gly Ala 5540 5545 5550 Gly Cys Cys Gly Ala Gly Gly Gly Gly Ala Gly Ala Gly Gly Ala 5555 5560 5565 Ala Gly Thr Cys Thr Thr Cys Thr Gly Ala Cys Cys Thr Gly Cys 5570 5575 5580 Gly Gly Ala Gly Ala Cys Gly Thr Cys Gly Ala Ala Gly Ala Gly 5585 5590 5595 Ala Ala Thr Cys Cys Thr Gly Gly Ala Cys Cys Cys Ala Thr Gly 5600 5605 5610 49/65 Ala Gly Cys Gly Ala Gly Cys Thr Gly Ala Thr Thr Ala Ala Gly 5615 5620 5625 Gly Ala Gly Ala Ala Cys Ala Thr Gly Cys Ala Cys Ala Thr Gly 5630 5635 5640 Ala Ala Gly Cys Thr Gly Thr Ala Cys Ala Thr Gly Gly Ala Gly 5645 5650 5655 Gly Gly Cys Ala Cys Cys Gly Thr Gly Gly Ala Cys Ala Ala Cys 5660 5665 5670 Cys Ala Thr Cys Ala Cys Thr Thr Cys Ala Ala Gly Thr Gly Cys 5675 5680 5685 Ala Cys Ala Thr Cys Cys Gly Ala Gly Gly Gly Cys Gly Ala Ala 5690 5695 5700 Gly Gly Cys Ala Ala Gly Cys Cys Cys Thr Ala Cys Gly Ala Gly 5705 5710 5715 Gly Gly Cys Ala Cys Cys Cys Ala Gly Ala Cys Cys Ala Thr Gly 5720 5725 5730 Ala Gly Ala Ala Thr Cys Ala Ala Gly Gly Thr Gly Gly Thr Cys 5735 5740 5745 Gly Ala Gly Gly Gly Cys Gly Gly Cys Cys Cys Thr Cys Thr Cys 5750 5755 5760 Cys Cys Cys Thr Thr Cys Gly Cys Cys Thr Thr Cys Gly Ala Cys 5765 5770 5775 Ala Thr Cys Cys Thr Gly Gly Cys Thr Ala Cys Thr Ala Gly Cys 5780 5785 5790 Thr Thr Cys Cys Thr Cys Thr Ala Cys Gly Gly Cys Ala Gly Cys 5795 5800 5805 Ala Ala Gly Ala Cys Cys Thr Thr Cys Ala Thr Cys Ala Ala Cys 5810 5815 5820 Cys Ala Cys Ala Cys Cys Cys Ala Gly Gly Gly Cys Ala Thr Cys 5825 5830 5835 Cys Cys Cys Gly Ala Cys Thr Thr Cys Thr Thr Cys Ala Ala Gly 5840 5845 5850 Cys Ala Gly Thr Cys Cys Thr Thr Cys Cys Cys Thr Gly Ala Gly 5855 5860 5865 Gly Gly Cys Thr Thr Cys Ala Cys Ala Thr Gly Gly Gly Ala Gly 5870 5875 5880 Ala Gly Ala Gly Thr Cys Ala Cys Cys Ala Cys Ala Thr Ala Cys 5885 5890 5895 Gly Ala Ala Gly Ala Cys Gly Gly Gly Gly Gly Cys Gly Thr Gly 5900 5905 5910 Cys Thr Gly Ala Cys Cys Gly Cys Thr Ala Cys Cys Cys Ala Gly 5915 5920 5925 Gly Ala Cys Ala Cys Cys Ala Gly Cys Cys Thr Cys Cys Ala Gly 5930 5935 5940 Gly Ala Cys Gly Gly Cys Thr Gly Cys Cys Thr Cys Ala Thr Cys 5945 5950 5955 Thr Ala Cys Ala Ala Cys Gly Thr Cys Ala Ala Gly Ala Thr Cys 5960 5965 5970 Ala Gly Ala Gly Gly Gly Gly Thr Gly Ala Ala Cys Thr Thr Cys 5975 5980 5985 Ala Cys Ala Thr Cys Cys Ala Ala Cys Gly Gly Cys Cys Cys Thr 5990 5995 6000 Gly Thr Gly Ala Thr Gly Cys Ala Gly Ala Ala Gly Ala Ala Ala 6005 6010 6015 Ala Cys Ala Cys Thr Cys Gly Gly Cys Thr Gly Gly Gly Ala Gly 6020 6025 6030 Gly Cys Cys Thr Thr Cys Ala Cys Cys Gly Ala Gly Ala Cys Gly 6035 6040 6045 Cys Thr Gly Thr Ala Cys Cys Cys Cys Gly Cys Thr Gly Ala Cys 6050 6055 6060 Gly Gly Cys Gly Gly Cys Cys Thr Gly Gly Ala Ala Gly Gly Cys 6065 6070 6075 Ala Gly Ala Ala Ala Cys Gly Ala Cys Ala Thr Gly Gly Cys Cys 6080 6085 6090 Cys Thr Gly Ala Ala Gly Cys Thr Cys Gly Thr Gly Gly Gly Cys 6095 6100 6105 Gly Gly Gly Ala Gly Cys Cys Ala Thr Cys Thr Gly Ala Thr Cys 6110 6115 6120 50/65 Gly Cys Ala Ala Ala Cys Ala Thr Cys Ala Ala Gly Ala Cys Cys 6125 6130 6135 Ala C s Ala Thr Ala Thr Ala Gly Ala Thr Cys Cys Ala Ala Gly 6140 6145 6150 Ala Ala Ala Cys Cys Cys Gly Cys Thr Ala Ala Gly Ala Ala Cys 6155 6160 6165 Cys Thr Cys Ala Ala Gly Ala Thr Gly Cys Cys Thr Gly Gly Cys 6170 6175 6180 Gly Thr Cys Thr Ala Cys Thr Ala Thr Gly Thr Gly Gly Ala Cys 6185 6190 6195 Thr Ala Cys Ala Gly Ala Cys Thr Gly Gly Ala Ala Ala Gly Ala 6200 6205 6210 Ala Thr Cys Ala Ala Gly Gly Ala Gly Gly Cys Cys Ala Ala Cys 6215 6220 6225 Ala Ala Cys Gly Ala Gly Ala Cys Cys Thr Ala Cys Gly Thr Cys 6230 6235 6240 Gly Ala Gly Cys Ala Gly Cys Ala Cys Gly Ala Gly Gly Thr Gly 6245 6250 6255 Gly Cys Ala Gly Thr Gly Gly Cys Cys Ala Gly Ala Thr Ala Cys 6260 6265 6270 Thr Gly Cys Gly Ala Cys Cys Thr Cys Cys Cys Thr Ala Gly Cys 6275 6280 6285 Ala Ala Ala Cys Thr Gly Gly Gly Gly Cys Ala Cys Ala Ala Gly 6290 6295 6300 Cys Thr Cys Ala Ala Thr 6305 <210> 18 <211> 6330 <212> PRT <213> Unknown <220> <223> Synthetic genetic construct <400> 18 Gly Ala Cys Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly 1 5 10 15 Ala Cys Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala 20 25 30 Thr Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 35 40 45 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys Ala 50 55 60 Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly Ala Ala 65 70 75 80 Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly Gly Gly Cys 85 90 95 Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly Gly Thr Ala Cys 100 105 110 Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys Thr Thr Gly Ala Gly 115 120 125 Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr Thr Ala Thr Thr Cys Gly 130 135 140 Cys Ala Ala Cys Ala Gly Cys Ala Ala Cys Ala Gly Gly Ala Gly Ala 145 150 155 160 Ala Ala Ala Thr Cys Ala Ala Gly Cys Cys Thr Ala Ala Gly Gly Thr 165 170 175 Cys Ala Gly Gly Ala Gly Cys Ala Cys Cys Gly Thr Cys Gly Cys Gly 180 185 190 Cys Ala Ala Cys Ala Cys Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys 195 200 205 Thr Thr Gly Thr Gly Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr 210 215 220 Cys Ala Cys Thr Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr 225 230 235 240 Gly Thr Cys Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys 245 250 255 Ala Cys Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly 51/65 260 265 270 Gly Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 275 280 285 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr Gly 290 295 300 Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala Gly Cys 305 310 315 320 Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala Ala Thr Thr 325 330 335 Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr Ala Ala Ala Cys 340 345 350 Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala Gly Cys Gly Cys Gly 355 360 365 Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly Gly Cys Gly Cys Thr Gly 370 375 380 Cys Thr Gly Ala Cys Thr Gly Thr Gly Gly Cys Gly Gly Gly Thr Gly 385 390 395 400 Ala Gly Cys Thr Thr Ala Gly Gly Gly Gly Gly Cys Cys Thr Cys Cys 405 410 415 Gly Cys Thr Cys Cys Ala Gly Cys Thr Cys Gly Ala Cys Ala Cys Cys 420 425 430 Gly Gly Gly Cys Ala Gly Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala 435 440 445 Thr Cys Gly Cys Gly Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly 450 455 460 Ala Gly Thr Ala Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly 465 470 475 480 Gly Cys Ala Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys 485 490 495 Gly Cys Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gly 500 505 510 Gly Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 515 520 525 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly 530 535 540 Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala Ala Ala 545 550 555 560 Cys Gly Gly Ala Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala 565 570 575 Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys 580 585 590 Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr 595 600 605 Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 610 615 620 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys 625 630 635 640 Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys 645 650 655 Ala Ala Gly Cys Ala Ala Thr Gly Gly Gly Gly Gly Thr Gly Gly Cys 660 665 670 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala 675 680 685 Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr 690 695 700 Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 705 710 715 720 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys 725 730 735 Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys 740 745 750 Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Ala Ala Cys Ala Thr Thr 755 760 765 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr 770 775 780 Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly 785 790 795 800 Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly 52/65 805 810 815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr 820 825 830 Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr 835 840 845 Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 850 855 860 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys 865 870 875 880 Ala Gly Gly Cys Gly .Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr 885 890 895 Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 900 905 910 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys 915 920 925 Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala 930 935 940 Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 945 950 955 960 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly Gly . 965 970 975 Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala 980 985 990 Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 995 1000 1005 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 1010 1015 1020 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 1025 1030 1035 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 1040 1045 1050 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 1055 1060 1065 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 1070 1075 1080 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 1085 1090 1095 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 1100 1105 1110 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 1115 1120 1125 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 1130 1135 1140 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 1145 1150 1155 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly Cys Ala Thr 1160 1165 1170 Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 1175 1180 1185 Gly Cys Ala Thr Th'r Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 1190 1195 1200 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 1205 1210 1215 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 1220 1225 1230 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 1235 1240 1245 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 1250 1255 1260 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 1265 1270 1275 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 1280 1285 1290 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 1295 1300 1305 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 1310 1315 1320 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 53/65 1325 1330 1335 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 1340 1345 1350 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 1355 1360 1365 Gly Cys Gly Ala Ala Cys Ala Ala Thr Ala Ala Thr Gly Gly Gly 1370 1375 1380 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 1385 1390 1395 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 1400 1405 1410 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 1415 1420 1425 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 1430 1435 1440 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 1445 1450 1455 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 1460 1465 1470 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 1475 1480 1485 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 1490 1495 1500 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 1505 1510 1515 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 1520 1525 1530 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly' Cys Thr Gl 1535 1540 1545 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 1550 1555 1560 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys 1565 1570 1575 Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala 1580 1585 1590 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 1595 1600 1605 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 1610 1615 1620 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 1625 1630 1635 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 1640 1645 1650 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys 1655 1660 1665 Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys Cys Ala Thr 1670 1675 1680 Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 1685 1690 1695 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 1700 1705 1710 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 1715 1720 1725 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 1730 1735 1740 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 1745 1750 1755 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 1760 1765 1770 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 1775 1780 1785 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 1790 1795 1800 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 1805 1810 1815 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 1820 1825 1830 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 54/65 1835 1840 1845 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 1850 1855 1860 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 1865 1870 1875 Gly Cys Ala Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr 1880 1885 1890 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 1895 1900 1905 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 1910 1915 1920 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 1925 1930 1935 thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 1940 1945 1950 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 1955 1960 1965 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 1970 1975 1980 Ala Ala Thr Ala Ala Cys Ala Ala Thr Gly Gly Ala Gly Gly Gly 1985 1990 1995 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 2000 2005 2010 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 2015 2020 2025 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 2030 2035 2040 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 2045 2050 2055 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 2060 2065 2070 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 2075 2080 2085 Cys Ala Thr Gly Ala Thr Gly Gl Cys Gly Gly Thr Ala Ala Gly 2090 2095 2100 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 2105 2110 2115 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 2120 2125 2130 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 2135 2140 2145 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 2150 2155 2160 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 2165 2170 2175 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 2180 2185 2190 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 2195 2200 2205 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 2210 2215 2220 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 2225 2230 2235 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 2240 2245 2250 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 2255 2260 2265 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 2270 2275 2280 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 2285 2290 2295 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 2300 2305 2310 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 2315 2320 2325 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 2330 2335 2340 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 55/65 2350 2355 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 2360 2365 2370 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 2375 2380 2385 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 2390 2395 2400 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 2405 2410 2415 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 2420 2425 2430 Gly Thr Ala Cys Cys Ala Ala Ala Ala Ala Ala Gly Ala Ala Gly 2435 2440 2445 Ala Gly Ala Ala Ala Gly Gly Thr Cys Gly Ala Cys Gly Gly Cys 2450 2455 2460 Gly Gly Thr Gly Gly Thr Gly Cys Thr Thr Thr Gly Thr Cys Thr 2465 2470 2475 Cys Cys Thr Cys Ala Gly Cys Ala Cys Thr Cys Thr Gly Cys Thr 2480 2485 2490 Gly Thr Cys Ala Cys Thr Cys Ala Ala Gly Gly Ala Ala Gly Thr 2495 2500 2505 Ala Thr Cys Ala Thr Cys Ala Ala Gly Ala Ala Cys Ala Ala Gly 2510 2515 2520 Gly Ala Gly Gly Gly Cys Ala Thr Gly Gly Ala Thr Gly Cys Thr 2525 2530 2535 Ala Ala' Gly Thr Cys Ala Cys Thr Ala Ala Cys Thr Gly Cys Cys 2540 2545 2550 Thr Gly Gly Thr Cys Cys Cys Gly Gly Ala Cys Ala Cys Thr Gly 2555 2560 2565 Gly Thr Gly Ala Cys Cys Thr Thr Cys Ala Ala Gly Gly Ala Thr 2570 2575 2580 Gly Thr Ala Thr Thr Thr Gly Thr Gly Gly Ala Cys Thr Thr Cys 2585 2590 2595 Ala Cys Cys Ala Gly Gly Gly Ala Gly Gly Ala Gly Thr Gly Gly 2600 2605 2610 Ala Ala Gly Cys Thr Gly Cys Thr Gly Gly Ala Cys Ala Cys Thr 2615 2620 2625 Gly Cys Thr Cys Ala Gly Cys Ala Gly Ala Thr Cys Gly Thr Gly 2630 2635 2640 Thr Ala Cys Ala Gly Ala Ala Ala Thr Gly Thr Gly Ala Thr Gly 2645 2650 2655 Cys Thr Gly Gly Ala Gly Ala Ala Cys Thr Ala Thr Ala Ala Gly 2660 2665 2670 Ala Ala Cys Cys Thr Gly Gly Thr Thr Thr Cys Cys Thr Thr Gly 2675 2680 2685 Gly Gly Thr Thr Ala Thr Cys Ala Gly Cys Thr Thr Ala Cys Thr 2690 2695 2700 Ala Ala Gly Cys Cys Ala Gly Ala Thr Gly Thr Gly Ala Thr Cys 2705 2710 2715 Cys Thr Cys Cys Gly Gly Thr Thr Gly Gly Ala Gly Ala Ala Gly 2720 2725 2730 Gly Gly Ala Gly Ala Ala Gly Ala Gly Cys Cys Cys Thr Gly Gly 2735 2740 2745 Cys Thr Gly Gly Thr Gly Gly Ala Gly Ala Gly Ala Gly Ala Ala 2750 2755 2760 Ala Thr Thr Cys Ala Cys Cys Ala Ala Gly Ala Gly Ala Cys Cys 2765 2770 2775 Cys Ala Thr Cys Cys Thr Gly Ala Thr Thr Cys Ala Gly Ala Gly 2780 2785 2790 Ala Cys Thr Gly Cys Ala Thr Thr Thr Gly Ala Ala Ala Thr Cys 2795 2800 2805 Ala Ala Ala Thr Cys Ala Thr Cys Ala Gly Thr Thr Ala Gly Ala 2810 2815 2820 Gly Cys Cys Gly Ala Gly Gly Gly Gly Ala Gly Ala Gly Gly Ala 2825 2830 2835 Ala Gly Thr Cys Thr Thr Cys Thr Gly Ala Cys Cys Thr Gly Cys 2840 2845 2850 Gly Gly Ala Gly Ala Cys Gly Thr Cys Gly Ala Ala Gly Ala Gly 56/65 2855 2860 2865 Ala Ala Thr Cys Cys Thr Gly Gly Ala Cys Cys Cys Gly Ala Cys 2870 2875 2880 Thr Ala Cys Ala Ala Ala Gly Ala Cys Cys Ala Thr Gly Ala Cys 2885 2890 2895 Gly Gly Thr Gly Ala Thr Thr Ala Thr Ala Ala Ala Gly Ala Thr 2900 2905 2910 Cys Ala Thr Gly Ala Cys Ala Thr Cys Gly Ala Thr Thr Ala Cys 2915 2920 2925 Ala Ala Gly Gly Ala Thr Gly Ala Cys Gly Ala Thr Gly Ala Cys 2930 2935 2940 Ala Ala Gly Ala Thr Gly Gly Cys Cys Cys Cys Cys Ala Ala Gly 2945 2950 2955 Ala Ala Gly Ala Ala Gly Ala Gly Gly Ala Ala Gly Gly Thr Gly 2960 2965 2970 Gly Gly Cys Ala Thr Thr Cys Ala Cys Cys Gly Cys Gly Gly Gly 2975 2980 2985 Gly Thr Ala Cys Cys Thr Ala Thr Gly Gly Thr Gly Gly Ala Cys 2990 2995 3000 Thr Thr Gly Ala Gly Gly Ala Cys Ala Cys Thr Cys Gly Gly Thr 3005 3010 3015 Thr Ala Thr Thr Cys Gly Cys Ala Ala Cys Ala Gly Cys Ala Ala 3020 3025 3030 Cys Ala Gly Gly Ala Gly Ala Ala Ala Ala Thr Cys Ala Ala Gly 3035 3040 3045 Cys Cys Thr Ala Ala Gly Gly Thr Cys Ala Gly Gly Ala Gly Cys 3050 3055 3060 Ala Cys Cys Gly Thr Cys Gly Cys Gly Cys Ala Ala Cys Ala Cys 3065 3070 3075 Cys Ala Cys Gly Ala Gly Gly Cys Gly Cys Thr Thr Gly Thr Gly 3080 3085 3090 Gly Gly Gly Cys Ala Thr Gly Gly Cys Thr Thr Cys Ala Cys Thr 3095 3100 3105 Cys Ala Thr Gly Cys Gly Cys Ala Thr Ala Thr Thr Gly Thr Cys 3110 3115 3120 Gly Cys Gly Cys Thr Thr Thr Cys Ala Cys Ala Gly Cys Ala Cys 3125 3130 3135 Cys Cys Thr Gly Cys Gly Gly Cys Gly Cys Thr Thr Gly Gly Gly 3140 3145 3150 Ala Cys Gly Gly Thr Gly Gly Cys Thr Gly Thr Cys Ala Ala Ala 3155 3160 3165 Thr Ala Cys Cys Ala Ala Gly Ala Thr Ala Thr Gly Ala Thr Thr 3170 3175 3180 Gly Cys Gly Gly Cys Cys Cys Thr Gly Cys Cys Cys Gly Ala Ala 3185 3190 3195 Gly Cys Cys Ala Cys Gly Cys Ala Cys Gly Ala Gly Gly Cys Ala 3200 3205 3210 Ala Thr Thr Gly Thr Ala Gly Gly Gly Gly Thr Cys Gly Gly Thr 3215 3220 3225 Ala Ala Ala Cys Ala Gly Thr Gly Gly Thr Cys Gly Gly Gly Ala 3230 3235 3240 Gly Cys Gly Cys Gly Ala Gly Cys Ala Cys Thr Thr Gly Ala Gly 3245 3250 3255 Gly Cys Gly Cys Thr Gly Cys Thr Gly Ala Cys Thr Gly Thr Gly 3260 3265 3270 Gly Cys Gly Gly Gly Thr Gly Ala Gly Cys Thr Thr Ala Gly Gly 3275 3280 3285 Gly Gly Gly Cys Cys Thr Cys Cys Gly Cys Thr Cys Cys Ala Gly 3290 3295 3300 Cys Thr Cys Gly Ala Cys Ala Cys Cys Gly Gly Gly Cys Ala Gly 3305 3310 3315 Cys Thr Gly Cys Thr Gly Ala Ala Gly Ala Thr Cys Gly Cys Gly 3320 3325 3330 Ala Ala Gly Ala Gly Ala Gly Gly Gly Gly Gly Ala Gly Thr Ala 3335 3340 3345 Ala Cys Ala Gly Cys Gly Gly Thr Ala Gly Ala Gly Gly Cys Ala 3350 3355 3360 Gly Thr Gly Cys Ala Cys Gly Cys Cys Thr Gly Gly Cys Gly Cys 57/65 3365 3370 3375 Ala Ala Thr Gly Cys Gly Cys Thr Cys Ala Cys Cys Gly Gl Gly 3380 3385 3390 Gly Cys Cys Cys Cys Cys Thr Thr Gly Ala Ala Cys Cys Thr Gly 3395 3400 3405 Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly Gly Thr Ala 3410 3415 3420 Gly Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly Thr Cys Ala 3425 3430 3435 Cys Ala Thr Gly Ala Cys Gly Gly Gly Gly Gly Ala Ala Ala Gly 3440 3445 3450 Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala Ala Ala Cys Cys 3455 3460 3465 Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly Thr Thr Gly 3470 3475 3480 Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr Cys Ala Ala 3485 3490 3495 Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr Ala Cys Ala 3500 3505 3510 Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys Gly Thr Gly 3515 3520 3525 Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Gly Cys Ala Ala Thr 3530 3535 3540 Gly Gly Gly Gly Gly Thr Gly Gly Cys Ala Ala Ala Cys Ala Gly 3545 3550 3555 Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly Gly Thr Thr 3560 3565 3570 Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys Cys Cys Ala 3575 3580 3585 Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala Gly Cys Cys 3590 3595 3600 Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr Cys Cys Cys 3605 3610 3615 Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala Gly Cys Gly 3620 3625 3630 Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys Gly Gly Thr 3635 3640 3645 Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala Gly Cys Ala 3650 3655 3660 Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys Cys Ala Ala 3665 3670 3675 Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys Gly Thr Gly 3680 3685 3690 Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys 3695 3700 3705 Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr Gly Cys Ala 3710 3715 3720 Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys Ala Thr Cys 3725 3730 3735 Gly Cys Cys Ala Gly Cys Cys Ala Thr Gly Ala Thr Gly Gly Cys 3740 3745 3750 Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly Cys Thr Gly 3755 3760 3765 Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly Cys Gly Cys 3770 3775 3780 Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala Cys Thr Gly 3785 3790 3795 Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr Gly Gly Ala 3800 3805 3810 Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys Cys Ala Gly 3815 3820 3825 Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys Gly Cys Gly 3830 3835 3840 Thr Cys Ala Cys Ala Thr Gly Ala Cys Gly Gly Gly Gly Gly Ala 3845 3850 3855 Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly Gly Ala Ala 3860 3865 3870 Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly Thr Thr Gly 58/65 3875 3880 3885 Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr Thr Gly Thr 3890 3895 3900 Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys Cys Thr Thr 3905 3910 3915 Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala Gly Thr Cys 3920 3925 3930 Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala Ala Ala Thr 3935 3940 3945 Ala Ala Thr Ala Ala Cys Gly Gly Thr Gly Gly Cys Ala Ala Ala 3950 3955 3960 Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly Ala Cys Gly 3965 3970 3975 Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr Cys Thr Cys 3980 3985 3990 Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr Cys Ala Ala 3995 4000 4005 Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly Ala Cys Thr 4010 4015 4020 Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr Gly Thr Ala 4025 4030 4035 Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Cys Ala Ala Cys 4040 4045 4050 Gly Gly Thr Gly Gly Ala Gly Gly Gly Ala Ala Ala Cys Ala Ala 4055 4060 4065 Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr Gly Thr Cys 4070 4075 4080 Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr Cys Cys Cys 4085 4090 4095 Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala Gly Cys Cys 4100 4105 4110 Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly Cys Cys Thr 4115 4120 4125 Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys Gly Cys Cys 4130 4135 4140 Ala Thr Cys Gly Cys Cys Thr Cys Gly Ala Ala Thr Gly Gly Cys 4145 4150 4155 Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly Gly Cys Gly 4160 4165 4170 Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala Cys Ala Gly 4175 4180 4185 Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr Gly Thr Ala 4190 4195 4200 Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr Cys Ala Thr 4205 4210 4215 Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala Gly Ala Cys 4220 4225 4230 Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala Ala Thr Cys 4235 4240 4245 Gly Cys Gly Thr Cys Ala Ala Ala Cys Gly Gly Ala Gly Gly Gly 4250 4255 4260 Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys Cys Thr Gly 4265 4270 4275 Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala Ala Gly Gly 4280 4285 4290 Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys Cys Thr Thr 4295 4300 4305 Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys Gly Gly Cys 4310 4315 4320 Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly Cys Ala Ala 4325 4330 4335 Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr Gly Cys Ala 4340 4345 4350 Thr Cys Cys Cys Ala Cys Gly Ala Cys Gly Gly Thr Gly Gly Cys 4355 4360 4365 Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr Gly Ala Gly 4370 4375 4380 Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala Cys Thr Thr 59/65 4385 4390 4395 Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys Thr Gly Thr 4400 4405 4410 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly Cys Thr Gly 4415 4420 4425 Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala Gly Thr Thr 4430 4435 4440 Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly Thr Cys Gly 4445 4450 4455 Cys Ala Thr Gly Ala Cys Gly Gly Ala Gly Gly Gly Ala Ala Ala 4460 4465 4470 Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly Ala Cys Thr 4475 4480 4485 Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys Cys Thr Thr 4490 4495 4500 Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr Cys Ala Ala 4505 4510 4515 Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly Ala Cys Gly 4520 4525 4530 Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly Gly Thr Cys , 4535 4540 4545 Gly Cys Cys Ala Thr Cys Gly Cys Cys Thr Cys Cys Ala Ala Thr 4550 4555 4560 Ala Thr Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly Cys Ala Gly 4565 4570 4575 Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala Gly Thr Ala 4580 4585 4590 Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly Cys Cys Thr 45.95 4600 4605 Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly Gly Ala Thr 4610 4615 4620 Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Cys Cys Cys Ala 4625 4630 4635 Gly Ala Cys Cys Ala Gly Gly Thr Ala Gly Thr Cys Gly Cys Ala 4640 4645 4650 Ala Thr Cys Gly Cys Gly Thr Cys Ala Cys Ala Thr Gly Ala Cys 4655 4660 4665 Gly Gly Gly Gly Gly Ala Ala Ala Gly Cys Ala Ala Gly Cys Cys 4670 4675 4680 Cys Thr Gly Gly Ala Ala Ala Cys Cys Gly Thr Gly Cys Ala Ala 4685 4690 4695 Ala Gly Gly Thr Thr Gly Thr Thr Gly Cys Cys Gly Gly Thr Cys 4700 4705 4710 Cys Thr Thr Thr Gly Thr Cys Ala Ala Gly Ala Cys Cys Ala Cys 4715 4720 4725 Gly Gly Cys Cys Thr Thr Ala Cys Ala Cys Cys Gly Gly Ala Gly 4730 4735 4740 Cys Ala Ala Gly Thr Cys Gly Thr Gly Gly Cys Cys Ala Thr Thr 4745 4750 4755 Gly Cys Ala Ala Gly Cys Ala Ala Cys Ala Thr Cys Gly Gly Thr 4760 4765 4770 Gly Gly Cys Ala Ala Ala Cys Ala Gly Gly Cys Thr Cys Thr Thr 4775 4780 4785 Gly Ala Gly Ala Cys Gly Gly Thr Thr Cys Ala Gly Ala Gly Ala 4790 4795 4800 Cys Thr Thr Cys Thr Cys Cys Cys Ala Gly Thr Thr Cys Thr Cys 4805 4810 4815 Thr Gly Thr Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Gly 4820 4825 4830 Cys Thr Gly Ala Cys Thr Cys Cys Cys Gly Ala Thr Cys Ala Ala 4835 4840 4845 Gly Thr Thr Gly Thr Ala Gly Cys Gly Ala Thr Thr Gly Cys Gly 4850 4855 4860 Thr Cys Cys Ala Ala Cys Gly Gly Thr Gly Gly Ala Gly Gly Gly 4865 4870 4875 Ala Ala Ala Cys Ala Ala Gly Cys Ala Thr Thr Gly Gly Ala Gly 4880 4885 4890 Ala Cys Thr Gly Thr Cys Cys Ala Ala Cys Gly Gly Cys Thr Cys 60/65 4895 4900 4905 Cys Thr Thr Cys Cys Cys Gly Thr Gly Thr Thr Gly Thr Gly Thr 4910 4915 4920 Cys Ala Ala Gly Cys Cys Cys Ala Cys Gly Gly Thr Thr Thr Gly 4925 4930 4935 Ala Cys Gly Cys Cys Thr Gly Cys Ala Cys Ala Ala Gly Thr Gly 4940 4945 4950 Gly Thr Cys Gly Cys Cys Ala Thr Cys Gly Cys Cys Ala Gly Cys 4955 4960 4965 Cys Ala Thr Gly Ala Thr Gly Gly Cys Gly Gly Thr Ala Ala Gly 4970 4975 4980 Cys Ala Gly Gly Cys Gly Cys Thr Gly Gly Ala Ala Ala Cys Ala 4985 4990 4995 Gly Thr Ala Cys Ala Gly Cys Gly Cys Cys Thr Gly Cys Thr Gly 5000 5005 5010 Cys Cys Thr Gly Thr Ala Cys Thr Gly Thr Gly Cys Cys Ala Gly 5015 5020 5025 Gly Ala Thr Cys Ala Thr Gly Gly Ala Cys Thr Gly Ala Cys Ala 5030 5035 5040 Cys Cys Cys Gly Ala Ala Cys Ala Gly Gly Thr Gly Gly Thr Cys 5045 5050 5055 Gly Cys Cys Ala Thr Thr Gly Cys Thr Thr Cys Thr Ala Ala Thr 5060 5065 5070 Gly Gly Gly Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Cys Ala 5075 5080 5085 Gly Cys Cys Thr Thr Gly Gly Ala Gly Thr Cys Cys Ala Thr Cys 5090 5095 5100 Gly Thr Ala Gly Cys Cys Cys Ala Ala Thr Thr Gly Thr Cys Cys 5105 5110 5115 Ala Gly Gly Cys Cys Cys Gly Ala Thr Cys Cys Cys Gly Cys Gly 5120 5125 5130 Thr Thr Gly Gly Cys Thr Gly Cys Gly Thr Thr Ala Ala Cys Gly 5135 5140 5145 Ala Ala Thr Gly Ala Cys Cys Ala Thr Cys Thr Gly Gly Thr Gly 5150 5155 5160 Gly Cys Gly Thr Thr Gly Gly Cys Ala Thr Gly Thr Cys Thr Thr 5165 5170 5175 Gly Gly Thr Gly Gly Ala Cys Gly Ala Cys Cys Cys Gly Cys Gly 5180 5185 5190 Cys Thr Cys Gly Ala Thr Gly Cys Ala Gly Thr Cys Ala Ala Ala 5195 5200 5205 Ala Ala Gly Gly Gly Thr Cys Thr Gly Cys Cys Thr Cys Ala Thr 5210 5215 5220 Gly Cys Thr Cys Cys Cys Gly Cys Ala Thr Thr Gly Ala Thr Cys 5225 5230 5235 Ala Ala Ala Ala Gly Ala Ala Cys Cys Ala Ala Cys Cys Gly Gly 5240 5245 5250 Cys Gly Gly Ala Thr Thr Cys Cys Cys Gly Ala Gly Ala Gly Ala 5255 5260 5265 Ala Cys Thr Thr Cys Cys Cys Ala Thr Cys Gly Ala Gly Thr Cys 5270 5275 5280 Gly Cys Gly Gly Gly Ala Thr Cys Cys Gly Ala Thr Cys Cys Ala 5285 5290 5295 Ala Ala Ala Ala Ala Gly Ala Ala Gly Ala Gly Ala Ala Ala Gly 5300 5305 5310 Gly Thr Ala Gly Cys Cys Cys Cys Cys Cys Cys Gly Ala Cys Cys 5315 5320 5325 Gly Ala Thr Gly Thr Cys Ala Gly Cys Cys Thr Gly Gly Gly Gly 5330 5335 5340 Gly Ala Cys Gly Ala Gly Cys Thr Cys Cys Ala Cys Thr Thr Ala 5345 5350 5355 Gly Ala Cys Gly Gly Cys Gly Ala Gly Gly Ala Cys Gly Thr Gly 5360 5365 5370 Gly Cys Gly Ala Thr Gly Gly Cys Gly Cys Ala Thr Gly Cys Cys 5375 5380 5385 Gly Ala Cys Gly Cys Gly Cys Thr Ala Gly Ala Cys Gly Ala Thr 5390 5395 5400 Thr Thr Cys Gly Ala Thr Cys Thr Gly Gly Ala Cys Ala Thr Gly 61/65 5405 5410 5415 Thr Thr Gly Gly Gly Gly Gly Ala Cys Gly Gly Gly Gly Ala Thr 5420 5425 5430 Thr Cys Cys Cys Cys Gly Gly Gly Thr Cys Cys Gly Gly Gly Ala 5435 5440 5445 Thr Thr Thr Ala Cys Cys Cys Cys Cys Cys Ala Cys Gly Ala Cys 5450 5455 5460 Thr Cys Cys Gly Cys Cys Cys Cys Cys Thr Ala Cys Gly Gly Cys 5465 5470 5475 Gly Cys Thr Cys Thr Gly Gly Ala Thr Ala Thr Gly Gly Cys Cys 5480 5485 5490 Gly Ala Cys Thr Thr Cys Gly Ala Gly Thr Thr Thr Gly Ala Gly 5495 5500 5505 Cys Ala Gly Ala Thr Gly Thr Thr Thr Ala Cys Cys Gly Ala Thr 5510 5515 5520 Gly Cys Cys Cys Thr Thr Gly Gly Ala Ala Thr Thr Gly Ala Cys 5525 5530 5535 Gly Ala Gly Thr Ala Cys Gly Gly Thr Gly Gly Gly Ala Gly Ala 5540 5545 5550 Gly Cys Cys Gly Ala Gly Gly Gly Gly Ala Gly Ala Gly Gly Ala 5555 5560 5565 Ala Gly Thr Cys Thr Thr Cys Thr Gly Ala Cys Cys Thr Gly Cys 5570 5575 5580 Gly Gly Ala Gly Ala Cys Gly Thr Cys Gly Ala Ala Gly Ala Gly 5585 5590 5595 Ala Ala Thr Cys Cys Thr Gly Gly Ala Cys Cys Cys Ala Thr Gly 5600 5605 5610 Gly Thr Gly Ala Gly Cys Ala Ala Gly Gly Gly Cys Gly Ala Gly 5615 5620 5625 Gly Ala Gly Cys Thr Gly Thr Thr Cys Ala Cys Cys Gly Gly Gly 5630 5635 5640 Gly Thr Gly Gly Thr Gly Cys Cys Cys Ala Thr Cys Cys Thr Gly 5645 5650 5655 Gly Thr Cys Gly Ala Gly Cys Thr Gly Gly Ala Cys Gly Gly Cys 5660 5665 5670 Gly Ala Cys Gly Thr Ala Ala Ala Cys Gly Gly Cys Cys Ala Cys 5675 5680 5685 Ala Ala Gly Thr Thr Cys Ala Gly Cys Gly Thr Gly Thr Cys Cys 5690 5695 5700 Gly Gly Cys Gly Ala Gly Gly Gly Cys Gly Ala Gly Gly Gly Cys 5705 5710 5715 Gly Ala Thr Gly Cys Cys Ala Cys Cys Thr Ala Cys Gly Gly Cys 5720 5725 5730 Ala Ala Gly Cys Thr Gly Ala Cys Cys Cys Thr Gly Ala Ala Gly 5735 5740 5745 Thr Thr Cys Ala Thr Cys Thr Gly Cys Ala Cys Cys Ala Cys Cys 5750 5755 5760 Gly Gly Cys Ala Ala Gly Cys Thr Gly Cys Cys Cys Gly Thr Gly 5765 5770 5775 Cys Cys Cys Thr Gly Gly Cys Cys Cys Ala Cys Cys Cys Thr Cys 5780 5785 5790 Gly Thr Gly Ala Cys Cys Ala Cys Cys Thr Thr Cys Gly Gly Cys 5795 5800 5805 Thr Ala Cys Gly Gly Cys Cys Thr Gly Ala Thr Gly Thr Gly Cys 5810 5815 5820 Thr Thr Cys Gly ■Cys Cys Cys Gly Cys Thr Ala Cys Cys Cys Cys 5825 5830 5835 Gly Ala Cys Cys Ala Cys Ala Thr Gly Ala Ala Gly Cys Ala Gly 5840 5845 5850 Cys Ala Cys Gly Ala Cys Thr Thr Cys Thr Thr Cys Ala Ala Gly 5855 5860 5865 Thr Cys Cys Gly Cys Cys Ala Thr Gly Cys Cys Cys Gly Ala Ala 5870 5875 5880 Gly Gly Cys Thr Ala Cys Gly Thr Cys Cys Ala Gly Gly Ala Gly 5885 5890 5895 Cys Gly Cys Ala Cys Cys Ala Thr Cys Thr Thr Cys Thr Thr Cys 5900 5905 5910 Ala Ala Gly Gly Ala Cys Gly Ala Cys Gly Gly Cys Ala Ala Cys 62/65 5915 5920 5925 Thr Ala Cys Ala Ala Gly Ala Cys Cys Cys Gly Cys Gly Cys Cys 5930 5935 5940 Gly Ala Gly Gly Thr Gly Ala Ala Gly Thr Thr Cys Gly Ala Gly 5945 5950 5955 Gly Gly Cys Gly Ala Cys Ala Cys Cys Cys Thr Gly Gly Thr Gly 5960 5965 5970 Ala Ala Cys Cys Gly Cys Ala Thr Cys Gly Ala Gly Cys Thr Gly 5975 5980 5985 Ala Ala Gly Gly Gly Cys Ala Thr Cys Gly Ala Cys Thr Thr Cys 5990 5995 6000 Ala Ala Gly Gly Ala Gly Gly Ala Cys Gly Gly Cys Ala Ala Cys 6005 6010 6015 Ala Thr Cys Cys Thr Gly Gly Gly Gly Cys Ala Cys Ala Ala Gly 6020 6025 6030 Cys Thr Gly Gly Ala Gly Thr Ala Cys Ala Ala Cys Thr Ala Cys 6035 6040 6045 Ala Ala Cys Ala Gly Cys Cys Ala Cys Ala Ala Cys Gly Thr Cys 6050 6055 6060 Thr Ala Thr Ala Thr Cys Ala Thr Gly Gly Cys Cys Gly Ala Cys 6065 6070 6075 Ala Ala Gly Cys Ala Gly Ala Ala Gly Ala Ala Cys Gly Gly Cys 6080 6085 6090 Ala Thr Cys Ala Ala Gly Gly Thr Gly Ala Ala Cys Thr Thr Cys 6095 6100 6105 Ala Ala Gly Ala Thr Cys Cys Gly Cys Cys Ala Cys Ala Ala Cys 6110 6115 6120 Ala Thr Cys Gly Ala Gly Gly Ala Cys Gly Gly Cys Ala Gly Cys 6125 6130 6135 Gly Thr Gly Cys Ala Gly Cys Thr Cys Gly Cys Cys Gly Ala Cys 6140 6145 6150 Cys Ala Cys Thr Ala Cys Cys Ala Gly Cys Ala Gly Ala Ala Cys 6155 6160 6165 Ala Cys Cys Cys Cys Cys Ala Thr Cys Gly Gly Cys Gly Ala Cys 6170 6175 6180 Gly Gl Cys Cys Cys Cys Gly Thr Gly Cys Thr Gly Cys Thr Gly 6185 6190 6195 Cys Cys Cys Gly Ala Cys Ala Ala Cys Cys Ala Cys Thr Ala Cys 6200 6205 6210 Cys Thr Gly Ala Gly Cys Thr Ala Cys Cys Ala Gly Thr Cys Cys 6215 6220 6225 Gly Cys Cys Cys Thr Gly Ala Gly Cys Ala Ala Ala Gly Ala Cys 6230 6235 6240 Cys Cys Cys Ala Ala Cys Gly Ala Gly Ala Ala Gly Cys Gly Cys 6245 6250 6255 Gly Ala Thr Cys Ala Cys Ala Thr Gly Gly Thr Cys Cys Thr Gly 6260 6265 6270 Cys Thr Gly Gly Ala Gly Thr Thr Cys Gly Thr Gly Ala Cys Cys 6275 6280 6285 Gly Cys Cys Gly Cys Cys Gly Gly Gly Ala Thr Cys Ala Cys Thr 6290 6295 6300 Cys Thr Cys Gly Gly Cys Ala Thr Gly Gly Ala Cys Gly Ala Gly 6305 6310 6315 Cys Thr Gly Thr Ala Cys Ala Ala Gly Thr Ala Ala 6320 6325 6330 <210> 19 <211> 17 <212> DNA <213> Unknown <220> <223> DNA-binding domain <400> 19 taaagagcag aaactca 17 <210> 20 <211> 18 63/65 <212> DNA <213> Unknown <220> <223> DNA-binding domain <400> 20 tttactgctg ctcccgct <210> 21 <211> 1762 <212> DNA <213> Unknown <220> <223> pCMV-tTR: KRAB <400> 21 gagcttggcc cattgcatac gttgtatcca tatcataata tgtacattta tattggctca 60 tgtccaacat taccgccatg ttgacattga ttattgacta gttattaata gtaatcaatt 120 acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact tacggtaaat 180 ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat gacgtatgtt 240 cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggagta tttacgctaa 300 actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc 360 aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct 420 acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag 480 tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt 540 gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaac 600 aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc 660 agagctcgtt tagtgaaccg tcagatcgcc tggagacgcc atccacgctg ttttgacctc 720 catagaagac accgggaccg atccagcctc cgcggaattc gccaccatgg ctagattaga 780 taaaagtaaa gtgattaaca gcgcattaga gctgcttaat gaggtcggaa tcgaaggttt 840 aacaacccgt aaactcgccc agaagctagg tgtagagcag cctacattgt attggcatgt 900 aaaaaataag cgggctttgc tcgacgcctt agccattgag atgttagata ggcaccatac 960 tcacttttgc cctttagaag gggaaagctg gcaagatttt ttacgtaata acgctaaaag 1020 ttttagatgt gctttactaa gtcatcgcga tggagcaaaa gtacatttag gtacacggcc 1080 tacagaaaaa cagtatgaaa ctctcgaaaa tcaattagcc tttttatgcc aacaaggttt 1140 ttcactagag aatgcattat atgcactcag cgctgtgggg cattttactt taggttgcgt 1200 attggaagat caagagcatc aagtcgctaa agaagaaagg gaaacaccta ctactgatag 1260 tatgccgcca ttattacgac aagctatcga attatttgat caccaaggtg cagagccagc 1320 cttcttattc ggccttgaat tgatcatatg cggattagaa aaacaactta aatgtgaaag 1380 tgggtcccgc gcgccagatc caaaaaagaa gagaaaggta gatccaaaaa agaagagaaa 1440 ggtagatcca aaaaagaaga gaaaggtaat ggatgctaag tcactaactg cctggtcccg 1500 gacactggtg accttcaagg atgtatttgt ggacttcacc agggaggagt ggaagctgct 1560 ggacactgct cagcagatcg tgtacagaaa tgtgatgctg gagaactata agaacctggt 1620 ttccttgggt tatcagctta ctaagccaga tgtgatcctc cggttggaga agggagaaga 1680 gccctggctg gtggagagag aaattcacca agagacccat cctgattcag agactgcatt 1740 tgaaatcaaa tcatcagttt aa 1762 <210> 22 <211> 8363 <212> DNA <213> Unknown <220> <223> plasmid ekdison induktor <400> 22 cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 60 tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 120 tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc 180 gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 240 gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta 300 atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg 360 atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa 420 aatttaacgc gaattttaac aaaatattaa cgcttacaat ttacgcgtcc agacatgata 480 agatacattg atgagtttgg acaaaccaca actagaatgc agtgaaaaaa atgctttatt 540 tgtgaaattt gtgatgctat tgctttattt gtaaccatta taagctgcaa taaacaagtt 600 aacaacaaca attgcattca ttttatgttt caggttcagg gggaggtgtg ggaggttttt 660 taaagcaagt aaaacctcta caaatgtggt atggctgatt atgatcatga acagactgtg 720 aggactgagg ggcctgaaat gagccttggg actgtgatct aaaatacaca aacaattaga 780 64/65 atcagtagtt taacacatta tacacttaaa aattttatat ttaccttaga gctttaaatc 840 tctgtaggta gtttgtccaa ttatgtcaca ccacagaagt aaggttcctt cacaaagatc 900 gcggccgcct aagtcatttg gtgcggcgcc tccagcatct ccataaggaa ggtgtcaatg 960 ggtgtgtccc cgatgagctt gaagaagaag agatgttcca ggcatttgag cccgatggag 1020 cgcagagccg gcaggcggag caagagctta gcgaaccttc ccggctgctc tgggtacttg 1080 tgcttgcagt aggcctccaa ggacgcatag accttctccc tcagcgcctc cacctcggcc 1140 gggttcgaga gccccttgga gtcagggtta aagaggacga tggcgcgcag gcagcccagc 1200 tccgtcttgt ccatctgcat gtcccgcatc ttggacacaa gctccgtcag caccctgtca 1260 aagatggcgc ccacccctgc gctgtgggcg ctgttccggt ggacgtgcag cccggtggcc 1320 aggaggatcc cgtccttcac ggcgatggag cggtgggaga aggaggcgat gagcagctca 1380 ttccagcctg cccgcagcag gatgacctgg tcgtccaggg gcagctctga gaagtgtggg 1440 atccgcttgg cccactccac cagggtgaaa agctgtttgt cggctgcttg gcaaatgttg 1500 gtgacagggt cgttcggcga gctggggttc agccccatgt ttgcctccac gtaggtctcg 1560 gtcttgggct ccacggccag ctcagcctcc aggatcctct ccaccggcat gtcctcgttg 1620 gcgctgctgg tcgactccac ctcattctcg ttccggtcct tgccacgctg ccgctcctcc 1680 tgcacggctt cccgcttcat gcccatggcc aggcacttct ggtagcggca gtactggcac 1740 cggttccgct gccgcttgtc aatcaggcag tccttgttgt cgcggcaggt gtaggtcagg 1800 tccttgcgca ccgtccgctt gaagaagccc ttgcacccct cgcagctgta cactccatag i860 tgcttgcctg aggagcggtc cccgcagatg gcgcagatgt gcttggtgaa ggaagccatg 1920 tttcctgagg ggtgggcggg gaccttgagg acgccattga ggcccagggg gggcttgatg 1980 tcctcgctgc tgctgacggg gttcataggt gagctgagct gggggctgcc agtgctgaag 2040 cccagggtgg gtgtggtggg caccgacatg gagtgggggc ccatgg'ggga gctgatgacc 2100 gagaaaggcg ggcccatgcc gttgatgggg gagctcaggg tgctgatggg agaatgcagc 2160 tgtcccgggg agccgatgcc aggccccagg gacgggtgca gcgagggggc agccatggag 2220 cctcgccccg tcggggaggt gagggaggag ttcacctggg tggagaaatc gagcggcagg 2280 aaatgtttgg tgtccaagtc ttcttcagaa ataagttttt gttccatatt atcgatcgtg 2340 tttttcaaag gaaaaccacg tccccgtggt tcggggggcc tagacgtttt ttaacctcga 2400 ctaaacacat gtaaagcatg tgcaccgagg ccccagatca gatcccatac aatggggtac 2460 cttctgggca tccttcagcc ccttgttgaa tacgcttgag gagagccatt tgactctttc 2520 cacaactatc caactcacaa cgtggcactg gggttgtgcc gcctttgcag gtgtatctta 2580 tacacgtggc ttttggccgc agaggcacct gtcgccaggt ggggggttcc gctgcctgca 2640 aagggtcgct acagacgttg tttgtcttca agaagcttcc agaggaactg cttccttcac 2700 gacattcaac agaccttgca ttcctttggc gagaggggaa agacccctag gaatgctcgt 2760 caagaagaca gggccaggtt tccgggccct cacattgcca aaagacggca atatggtgga 2820 aaataacata tagacaaacg cacaccggcc ttattccaag cggcttcggc cagtaacgtt 2880 aggggggggg gagggagagg ggggtaaccc tatgcagtcg tcgagtgctc cgacttaacc 2940 gccactcccc cgataagctg ctcttggtgc gaatgcaggg ctacacccat caaggccatc 3000 gccgtctggg cgttcgcata catgctgacg ttgccgccca ccccaacacc gactccaact 3060 ccgttgccca tcggtaccgc tgatgtggtg gagctagcgg taacggcagc cgtgatactg 3120 ctggtggttg ccggcgtgat gggtcctatg gccgcacttc cgcccatgta ttcgctgctc 3180 gtactgaccg cggacaagga accaggtgcg gttacggagg cgggcacggg agcggagacg 3240 ggaaggagct gtggctgtgg ttgaatctgt ggctggagtt gcgtctgaag ctgtgggtgg 3300 agctggggtt gcagttgacc ttgcagctga ggtggtagct gaggttgtag ctgcggctgt 3360 gtctggtgct gggaatcgtt ctgggtcagg gaggagggtt ggggctgggg ctgaggctga 3420 ggctgatgct gggccgcggc tgccgccgcc gaagtggagg cagagtcgca atcaatgccg 3480 gcggtaatgg cgcccccaac cgatgcccgc atacgctcag cccgctcgag acgctcgttc 3540 tcctcctggg taatctgaag gtgcgactgg accgatggcg ggatggcatg aacgtcccag 3600 atctcctcga ggaacttggg cagtttgcgg tttttgagct ttagtgagaa acacatctcg 3660 gcgttctggt tgcccagcgt acgcagctcg gtgaggatcg agagcagctt tgcgtagaag 3720 acgaggctca ttgagtcgcc gcagtggcgg ttgagtatat aaatgcgtag cgtgtcgatg 3780 tagtagctct ggatcgcttc gactagctgg gccttctcca ggcccggccg gtccgagaag 3840 atcacaatgg cagtgagaag cgcgtattcg acgttgtcca ccttcatcga gaacatttgg 3900 cggcagaaat gcagcaggtc ttcaatgtta tcagccattc cggccatttt gtaagaatcc 3960 cgcgtatatg atctattatt cgcgaagaat attgagtccg agctgtggtc atagcgtcgt 4020 gccatacgca gcatcatcac ctccgacgag caggccttta gtaacgtgat ctggtcctcc 4080 tggggtatct ttgtaaacgc tggtagacct ttagcaaact caacaatcaa ctggaccgtg 4140 agtatggtta tctcggttat atgccgaaag ctgacgtccg tttggctctc gttctcatcg 4200 ggttgactca ttatacgcct gagatcctct tcagatggct gctcatagcc atcctggtac 4260 caaattaact tgtatataac ggccaactga ttgtacgtta aggaaggtat attgcgcgct 4320 tgacacttgg ccaatatttc atcaggtagt agcggaatag tggcatgctg gggcggctcg 4380 catgtcataa ggtcaagaat ctccttctta acaaagtctt ggccgccacc agaggccaag 4440 ctgccattgc cgccatgctg agagctcggc gaagtggtca ttttgtcctt ctccttctgg 4500 gccttctttt cgcgccgctt catcgcacat tggttctccg ggacgacgca ttccggccgc 4560 atacccacgg ccaggcactt tttcaggcgg cactcctgac actttcgcct catgtacatg 4620 tccatttcgc aggcgcgccc gaacttgcag cagtagacgg cgctcttcgt aacgctgcgt 4680 cgaaagaaca ccttgcagga tccacaggtg agggcgttgt agtggtagcc ggaggccctg 4740 tcctttcgtt cgccgagtat ccgttcaagc tgctcgaagg cgagagatca tcgcgacctg 4800 aagatataga atttgatatt cttctagatg tacctagaag cttcccaccg tactcgtcaa 4860 65/65 ttccaagggc atcggtaaac atctgctcaa actcgaagtc ggccatatcc agagcgccgt 4920 agggggcgga gtcgtggggg gtaaatcccg gacctgggga atccccgtcc cccaacatgt 4980 ccagatcgaa atcgtctagc gcgtcggcat gcgccatcgc cacgtcctcg ccgtctaagt 5040 ggagttcgtc ccccaggctg acatcggtcg ggggggccaa gtcttcttca gaaataagtt 5100 tttgttccat ggtggcggcc ggccactagc ggatctgacg gttcactaaa ccagctctgc 5160 ttatatagac ctcccaccgt acacgcctac cgcccatttg cgtcaatggg gcggagttgt 5220 tacgacattt tggaaagtcc cgttgatttt ggtgccaaaa caaactccca ttgacgtcaa 5280 tggggtggag acttggaaat ccccgtgagt caaaccgcta tccacgccca ttgatgtact 5340 gccaaaaccg catcaccatg gtaatagcga tgactaatac gtagatgtac tgccaagtag 5400 gaaagtccca taaggtcatg tactgggcat aatgccaggc gggccattta ccgtcattga 5460 cgtcaatagg gggcgtactt ggcatatgat acacttgatg tactgccaag tgggcagttt 5520 accgtaaata ctccacccat tgacgtcaat ggaaagtccc tattggcgtt actatgggaa 5580 catacgtcat tattgacgtc aatgggcggg ggtcgttggg cggtcagcca ggcgggccat 5640 ttaccgtaag ttatgtaacg cggaactcca tatatgggct atgaactaat gaccccgtaa 5700 ttgattacta ttaataacta atgcaacggc gctgcagcca ctgcatggcg gtaatacggt 5760 tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 5820 ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 5880 agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 5940 accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 6000 ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 6060 m gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 6120 ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 6180 1 gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 6240 ¾ Θ taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 6300 ; tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 6360 gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 6420 cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 6480 ¾ agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 6540 cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaac 6600 ctgaggctat ggcagggcct gccgccccga cgttggctgc gagccctggg ccttcacccg 6660 aacttggggg gtggggtggg gaaaaggaag aaacgcgggc gtattggccc caatggggtc 6720 tcggtggggt atcgacagag tgccagccct gggaccgaac cccgcgttta tgaacaaacg 6780 acccaacacc gtgcgtttta ttctgtcttt ttattgccgt catagcgcgg gttccttccg 6840 gtattgtctc cttccgtgtt tcagttagcc tccccctagg gtgggcgaag aactccagca 6900 tgagatcccc gcgctggagg atcatccagc cggcgtcccg gaaaacgatt ccgaagccca 6960 acctttcata gaaggcggcg gtggaatcga aatctcgtga tggcaggttg ggcgtcgctt 7020 ggtcggtcat ttcgaacccc agagtcccgc tcagaagaac tcgtcaagaa ggcgatagaa 7080 ■ ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca 7140 ttcgccgcca agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc 7200 cgccacaccc agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat 7260 attcggcaag caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgctcgc 7320 cttgagcctg gcgaacagtt cggctggcgc gagcccctga tgctcttcgt ccagatcatc 7380 ctgatcgaca agaccggctt ccatccgagt acgtgctcgc tcgatgcgat gtttcgcttg 7440 gtggtcgaat gggcaggtag ccggatcaag cgtatgcagc cgccgcattg catcagccat 7500 gatggatact ttctcggcag gagcaaggtg agatgacagg agatcctgcc ccggcacttc 7560 gcccaatagc agccagtccc ttcccgcttc agtgacaacg tcgagcacag ctgcgcaagg 7620 aacgcccgtc gtggccagcc acgatagccg cgctgcctcg tcttgcagtt cattcagggc 7680 accggacagg tcggtcttga caaaaagaac cgggcgcccc tgcgctgaca gccggaacac 7740 ggcggcatca gagcagccga ttgtctgttg tgcccagtca tagccgaata gcctctccac 7800 ccaagcggcc ggagaacctg cgtgcaatcc atcttgttca atcatgcgaa acgatcctca 7860 tcctgtctct tgatcgatct ttgcaaaagc ctaggcctcc aaaaaagcct cctcactact 7920 tctggaatag ctcagaggcc gaggcggcct cggcctctgc ataaataaaa aaaattagtc 7980 agccatgggg cggagaatgg gcggaactgg gcggagttag gggcgggatg ggcggagtta 8040 ggggcgggac tatggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg 8100 gggagcctgg ggactttcca cacctggttg ctgactaatt gagatgcatg ctttgcatac 8160 ttctgcctgc tggggagcct ggggactttc cacaccctaa ctgacacaca ttccacagct 8220 ggttctttcc gcctcaggac tcttcctttt tcaatattat tgaagcattt atcagggtta 8280 ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc 8340 gcgcacattt ccccgaaaag tgc 8363 |
Field of invention
The invention refers to a bistable genetic toggle switch comprising a pair of mutual repressors and a positive feedback loop based on DNA-binding proteins. Each repressor and activator pair binds to its corresponding binding site on DNA. This mechanism constitutes switch functionality.
State of the art
Genetic regulatory networks are hirearchically organised networks of genes that act on each other in order to perform a function of some kind.
Genetic arrangements of this kind can be found in nature, such as a switch in the bacteriophage lambda and a circadian oscillator in cianobacteria (2).
In the context of recent developments in the fields of biotechnology and synthetic biology, there arose a need to construct synthetic genetic regulatory networks with well-defined and complex functions.
Bistable and multistable toggle swithces are the most useful representatives, as they can function as a form of memory in biological systems as well as in the control of expression of endogenous and heterologous genes. The development started with the construction of a bistable switch using endogenous transcription factors in Escherichia coli (3). Kramer et al. constructed a bistable switch out of similiar elements in mammalian cells (4). This switch consists of two constitutive promoters and their corresponding opposing repressors. A genetic network of this kind can assume two stable states, since the activation of the promoter of the first construct activates expression of the repressor that represses the promoter of the second construct. For a practical application of such a switch, an external signal able to push the switch into the desired state is needed. This kind of change should be possible with the addition of small molecules that act as inducers, notably anhydrotetracyclin (aTc) and isopropyl-beta-D-l-thiogalactopyranoside (IPTG) (3). Each of the inducers binds to its corresponding repressor and inactivates it. For example, inducer 2 binds to repressor 2 and inactivates it. Lack of repression of promotor 2 results in expression of repressor 1 , which binds to promotor 1 and blocks the transcription of repressor 2. Similiarly, when inducer 1 is added, it inactivates repressor 1 and promoter 1 activates transciption of repressor 2, which represses promoter 2 and therefore blocks the transcription of repressor 2. It is important that the system exhibits stability -a set state must persists after the removal of the inducer. Kramer et al. prepared a bistable switch in Chinese hamster ovary (CHO) cells (4). This switch functions very similiarly to the switch constructed in Escherichia coli. The switch is made of the same two opposing promoters, followed by binding sites for repressors. Promoter 1 is followed by the repressor 1 binding site, whereas promoter 2 is followed by the repressor 2 binding site. Both promoters and corresponding binding sites are followed by a gene encoding a DNA-binding protein (promoter 1 followed by DNA-binding protein 2 and vice versa for promoter 2). Mammalian repressors are constructed modularly - the repressor and the DNA- binding function are exerted by separate domains of the protein. This provides the oppurtunity to use specific DNA-binding domains, such as zinc-finger domains or TAL (Transcriptional activator like) proteins and combine them with repressor domains (for instance KRAB), or with activator domains (such as VP 16), which results in functional transcriptional repressors or activators, respectively. In the case of the switch designed by Kramer et al. (4), the DNA- binding domains A and B are fused with the repression domain KRAB which results in two repressors: A and B.
Drawbacks of current state of the art
Development in the field continued with the design of gene regulatory networks of higher complexity consisting of several bistable switches. Networks with an odd number of repressors were constructed, exhibiting oscillatory behaviour (5,6). In addition, networks of repressors that were capable of boolean logic operations were constructed (7).
Up to now, all of the designed genetic switches were based on natural bacterial DNA-binding proteins as building blocks of repressors. The number of well characterized natural repressor proteins is limited, which limits the number of independent bistable switches able to operate inside a single cell. Several biochemical properties such as stability, oligomeric state and affinity for DNA vary between different natural repressor proteins, contrary to the the desired balance between the regulatory elements which is needed for a bistable switch to function properly and robustly. The second drawback of bistable switches described to date is the use of only repressors, which results in relatively small differences between the two states. In the study by Kramer et al. (4), it was found that the state to state difference in the expression of a reporter gene under the control of a bistable switch was relatively small.
Literature:
1. Ptashne, M. A Genetic Switch: Phage λ and Higher Organisms (Cell, Cambridge, Massachusetts, 1992).
2. Ishiura, M. et al. Expression of a gene cluster kaiABC as a circadian feedback process in cyanobacteria. Science 281, 1519-1523
3. Gardner, T. S., Cantor, C. R. & Collins, J. J. Construction of a genetic toggle switch in Escherichia coli. Nature 403, 339-42 (2000).
4. Kramer, B. P. et al. An engineered epi genetic transgene switch in mammalian cells. Nature biotechnology 22, 867-70 (2004).
5. Elowitz, M. B. & Leibler, S. A synthetic oscillatory network of transcriptional regulators. Nature 403, 335-8 (2000).
6. Fung, E. et al. A synthetic gene-metabolic oscillator. Nature 435, 1 18-122 (2005).
7. Tamsir, A., Tabor, J. J. & Voigt, C. a. Robust multicellular computing using genetically encoded NOR gates and chemical "wires. 'Nature 1-4 (2010). doi:10.1038/nature09565
Summary of invention.
Artificially designed DNA-binding proteins, e.g. zinc-finger proteins (ZFP) and TAL effectors (US patent application: US201201 10685), which can be prepared in any desired numbers, seem to be the best solution to the problem of limited numbers of natural repressor proteins. However, computer simulations and laboratory experiments have shown that a bistable switch based on synthetic DNA-binding proteins is difficult to implement. Cooperativity of repressors is a prerequisite to for the preparation of a functional bistable toggle switch (US patent US6841376) (Figure 2). Synthetic modular DNA-binding proteins (e.g. TAL effectors, ZFP) bind DNA as monomers and therefore lack binding cooperativity. To solve this problem we propose a designed bistable toggle switch composed of two mutual repressors based on DNA-binding protein domains, and an additional positive feedback loop based on the same DNA-binding protein domains fused with an activator domain (Figure 1). This switch is significantly more stable than a switch based on synthetic DNA repressor proteins without an integrated positive feedback loop. Moreover, the difference in reporter expression in both toggle states of this switch is far greater than in the case of a switch without a positive feedback loopand absolute repression is not necessary for bistability.
Figure 1 represents a scheme of the positive feedback loop switch design. Transcription of both the repressors and the activators is controlled by a minimal promoter that ensures very low expression of the downstream gene. The expression of the downstream gene is greatly increased if a DNA-binding protein fused with an activatory domain binds upstream of the minimal promoter. The switch has binding sites for the appropriate artificial transcription factors, which are composed of designed DNA-binding proteins and located upstream of promoters. The binding sites for the artificial DNA-binding proteins A and B are located upstram of the promoters of the first or of the second group of genes, respectively.
Each state requires transcription of two genes, therefore the switch is composed of four genes. An optional number of genes (effectors or reporters) expressed in either of the two toggle states can be added. (Figure 3).
The following are active in state 1 : repressor B, which is under the control of operator A located upstream of either a minimal or a constitutive promoter. Repressor B represses the expression of all genes characteristic of state 2 as well as the effectors or the reporters of state 2;
activator A, which is under the control of operator A. Activator A activates the expression of all genes characteristic of state 1, i.e. the repressor B and the activator A as well as the effectors of state 1. The activator A's operon is auto-activatory and represents a positive feedback loop that can be inhibited by repressor A.
The following are active in state 2:
- repressor A, which is under the control of operator B located upstream of either a minimal or a constitutive promoter. Repressor A represses the expression of all genes characteristic of state 1 as well as the effectors or the reporters of state 1 ; activator B, which is under the control of operator B. Activator B activates the expression of all genes characteristic of state 2, i.e. the repressor A and the activator B as well as the effectors of state 2. The activator B's operon is auto-activatory and represents a positive feedback loop that can be inhibited by repressor B.
The activator consists of a DNA-binding domain that is fused to an activation domain, such as the VP 16 or the VP64 domain. The repressor consists of a DNA-binding domain that is fused to a transcriptional repression domain, such as the KRAB domain.
Because the repressor B, activator A and the state 1 effector are under the joint transcriptional control of operator A, we can combine them under the same operator and promoter in single or multiple DNA sequences linked with 2A sequences between structural genes, which enable co-translational cleavage of proteins, e.g. the t2A sequence.
We can combine repressor A, activator B and the state 2 effectors under the joint control of a single operator and promoter in the same manner.
Figure descriptions
Figure 1 : A scheme of a mutual repressor switch with a positive feedback loop based on DNA-binding proteins. A) A multiple-operon implementation allowing toggle control with the same operator for each state. B) A single-operon implementation allowing toggle control with the same operator for each state.
Figure 2: A scheme of a classic bistable toggle switch, as implemented in bacterial and mammalian cells, based on bacterial cooperative DNA-binding domains (Gardner et al. 2000, Kramer et al. 2004).
Figure 3: A scheme of a bistable genetic toggle switch comprising a pair of mutual repressors and a positive feedback loop, based on DNA-binding proteins. A) Activator A is expressed in state 1 and binds to DNA binding element A, activating transcription of the structural genes of repressor B, activator A (itself) and effector 1. The expression of repressor B inhibits the expression of repressor A and of activator B. B) Activator B is expressed in state 2 and it binds to DNA binding element B, activating the transcription of the structural genes of repressor A, activator B (itself) and effector 2. The expression of repressor A inhibits the expression of repressor B and of activator A. Figure 4: Implementation of a bistable toggle switch with a pair of mutual repressors and a positive feedback loop, based on TAL DNA-binding proteins, applicable for expression in mammalian cells. Legend: the arrows depict the structural genes coding for repressor (TAL:KRAB), activator (TAL:VP16), effectors (fluorescent proteins) and inducer proteins; 2A represents the position of the self-cleaving peptide; a round symbol represents the position of an operator withDNA binding elements, which along with the promoter controls the expression of structural genes.
Figure 5: Implementation of a bistable toggle switch with a pair of mutual repressors and a positive feedback loop, based on TAL DNA-binding proteins, with joint operons for expression in mammalian cells. Legend: the arrows represent structural genes coding for repressor (TAL:KRAB), activator (TAL:VP16), effectors (fluorescent proteins) and inducer proteins; 2A represents the position of the self-cleaving peptide; a round symbol represents the position of the operator with the DNA binding elements, which along with the promoter controls the expression of structural genes.
Figure 6: Detection of bistability in mammalian cells, which have been transfected with plasmids encoding the neccessary genes for the switch.
Figure 7: Detection of the TAL-repressor efficiency.
Figure 8: Detection of the effect of the DNA binding elements in the operon's operator. Figure 9: Detection of TAL activator efficiency.
Description of the invention
The present invention relates to a bistable switch comprising a pair of mutual repressors and a positive feedback loop based on modular DNA-binding proteins. The bistable switch regulates switching from stable state 1 to stable state 2 and vice-versa. The switching is directed by signals from the environment. The positive feedback loop brings non- linearity to the system, enabling the switch to function with non-oligomeric and non- cooperative DNA-binding domains.
The bistable switch includes: a) An operon for maintaining state 1 with a positive feedback loop, which includes the structural gene for activator A, which forms the positive feedback loop. Transcription of the structural gene for activator A is under regulation of an operator, which includes the DNA binding element A and a promoter. b) An operon for maintaining state 1 with repressor B which inhibits state 2 by repressing the genes under regulation of operator B. Transcription of the structural gene for repressor B is under regulation of an operator, which includes the DNA binding element A and a promoter. c) An operon representing the activation of state 1, which includes the gene for effector 1. Effector 1 can be any gene or a group of genes e.g. different enzymes, fluorescent proteins, signaling proteins etc. Transcription of the structural gene for effector 1 is under regulation of an operator, which includes the DNA binding element A and a promoter. d) An operon for maintaining state 2 with a positive feedback loop, which includes the structural gene for activator B, which forms the positive feedback loop. Transcription of the structural gene for activator B is under regulation of an operator, which includes the DNA binding element B and a promoter. e) An operon for maintaining state 2 with repressor A which inhibits state 1 by repressing the genes under regulation of operator A. Transcription of the structural gene for repressor A is under regulation of an operator, which includes the DNA binding element B and a promoter. f) An operon representing the activation of state 2, which includes the gene for effector 2. Effector 2 can be any gene or a group of genes e.g. different enzymes, fluorescent proteins, signaling proteins etc. Transcription of the structural gene for effector 2 is under regulation of an operator, which includes the DNA binding element B and a promoter.
State 1 or state 2 are activated with the following switch-ON operons: a) An operon for switching to state 1 , which includes a structural gene for activator A to activate state 1, and a structural gene for repressor B which inhibits state 2. The transcription of these structural genes is under regulation of a promoter and an operator, which is active in the presence of inducer A. The promoter can be constitutive or minimal. b) An operon for switching to state 2, which includes a structural gene for activator B to activate state 2, and a structural gene for repressor A to inhibit state 2. The transcription of these structural genes is under regulation of a promoter and an operator, which is active in the presence of inducer B. The promoter can be constitutive or minimal.
Switching operons are activated by inducers: a) Operons of the inducers, which include structural genes for inducer proteins under regulation of a promoter, preferably constitutive or minimal, and optionally an operator.
Operons of the structural genes, which are under regulation of the same operator (activator A, repressor B and effector 1 or activator B, repressor A and effector 2) can be joined in groups of two or more so that they are under regulation of the same operator and are linked with self-cleaving 2A peptides. They are transcribed from structural genes in the form of a single RNA and are translated into polypeptide chains, leading to synthesis of individual proteins in an equal stoichiometric ratio.
The number of DNA binding elements for DNA-binding domains of the activator or repressor ranges from one to several, preferably from one to 20, more preferably from one to 12. The position of the DNA binding elements for transcription factors in regard to the promoter can be upstream or downstream of the promoter, or both. In a preferred embodiment, the DNA binding elements are upstream of the promoter, which is adjacent to transcription initialization site and the structural gene.
The operons composing the bistable switch include a structural gene, which encodes several proteins adjacently linked with a self-cleaving 2 A peptide. Optionally, the above- mentioned polycistronic operons may be divided into seperate operons encoding a single protein or two proteins liked via a self-cleaving 2A peptide with the same operators upstream of the structural gene.
Definitions:
The term »DNA-binding domain« refers to DNA-binding domains of protein families such as TAL effectors, zinc fingers and other transcriptional regulators, their homologues, orthologues and mutants with preserved or enhanced basic functions of DNA-binding proteins.
The term »TAL« refers to synthetic or natural TAL proteins, preferably their central DNA- binding domain with an additional nuclear localization signal. The central domain of a TAL protein is composed of a variable number of TAL repeats. The term »TAL« may also refer to homologues, orthologues and mutants with preserved or enhanced basic function of TAL proteins. The term »TAL« may refer to synthetic TAL domains with any number of TAL repeats in any order, additionally containing a nuclear localization signal.
The term »operator« refers to a DNA sequence containing DNA binding elements located near a promoter. An operator can be located upstream or downstream of a promoter, preferably upstream. For the purposes of the present invention, the operator can contain one or more sequential, either identical or different DNA binding elements. Repeats of different DNA binding elements can either alternate or cluster. The number of DNA binding elements is not limited. The term »DNA binding element« refers to a specific nucleotide sequence on a DNA molecule, which binds to the DNA-binding domain. The nucleotide sequence of the DNA binding element depends on the DNA-binding domain specificity of repressors and activators of the switch. DNA binding elements composing an operator may be separated with a variable number of nucleotides. The number of nucleotides separating the DNA binding elements is between 2 and 100, preferably between 2 and 50.
The term »transcription repression domain« refers to a protein domain, which ensures the inhibition of structural gene transcription, if linked to a DNA-binding domain. The function of a transcription repressor domain, linked to a DNA-binding domain is the inhibition of structural gene transcription by preventing the binding of RNA polymerase to the corresponding promoter. Transcription repressor domains can be chosen from a range of repressors known to persons skilled in the art, preferably from the family of KRAB repressors. The term »KRAB« refers to »Kriippel-associated box« and may also refer to homologues, orthologues and mutants with preserved or enhanced basic function of inhibiting structural gene transcription.
The term »transcription activation domain« refers to a protein domain, which ensures activation of structural gene transcription, if linked to a DNA-binding domain. The function of a transcription activation domain linked to a DNA-binding domain is transcriptional activation of structural genes. Transcription activation domains can be chosen from a range of activators known to persons skilled in the art, preferably from the family of VP 16 and VP64 activators. The term »VP16« refers to a transcription activation domain of viral origin. VP 16 induces formation of a protein complex, which enhances expression of structural genes. The term »VP64« refers to four tandem repeats of the activation region of the VP 16 domain.
The term »repressor« refers to proteins, comprising a DNA-binding domain and a transcription repression domain, preferably KRAB. The term »repressor« refers to proteins with the function of inhibiting structural gene transcription when bound to their respective DNA binding elements.
The term »activator« refers to proteins, comprising a DNA-binding domain and a transcription activation domain, preferably VP 16 or VP64. The term »activator« refers to proteins with the function of activating structural gene transcription when bound to their corresponding DNA binding elements (Garg et al. 2012).
The DNA-binding domain and the transcription repression/activation domain are linked together by a linker peptide, which is any polypeptide of any length and any aminoacid sequence. The term »linker peptide« refers to aminoacid sequences with the function of separating individual domains of a chimeric protein. Optional functions of a linker peptide in a chimeric protein can also be cleavage or posttranslational modification site introduction.
The ratio of the DNA-binding domain and the transcriptional repression/activation domain can be 1 :1 or 1 :2. One or more transcriptional repression/activation domains can be linked to the DNA-binding domain at the N or C terminal end or at the N and C terminal end.
The term »minimal promoter« refers to a DNA sequence of a few nucleotides in length located upstream of a transcription initiation site and is a minimal requirement for the binding of transcription factors and gene transcription. Nucleotide sequences of minimal promoters are known to persons skilled in the art and have been extensively described elsewhere. The term »constituitive promoter« refers to a DNA sequence, which ensures continuous transcription of structural genes. Its location and sequence is known to persons skilled in the art and has been previously described elsewhere. The term »constituitive promoter« refers to an unregulated promoter, enabling continuous expression of the corresponding gene.
The term »inducer« refers to molecules, able to regulate gene expression by binding to proteins, e.g. repressors or activators. The term »inducer« refers to antibiotics and their analogues, natural compounds and their analogues, metalothionines, steroids and analogues; e.g. tetracyclin, doxycyclin, anhidrotetracyclin, rapamycin and analogues, ecdysone and analogues (e.g. ponasteronA, muristeronA), alolactose (lac operon), arabinose (ara operon), cumermycin and novobiocin, RU486 (mifepriston), estrogens and analogues (e.g. 4-hydroxi- tamoxifene), streptogramines (e.g. pristinamycin), macrolides (e.g. erythromycin), vanilinic acid, cumate, phloretin, biotin, arginine, metal ions, polymeric substrates (e.g. pectin, xylan, arabinan) or monomelic units of degraded polymers (e.g. arabinose, xylose, metals and metal ion-protein complexes, cAMP, cyanate, CRP, formate, maltose, acetolactate, urea) and other compounds known to persons skilled in the art. The term »inducer« may also refer to temperature, pH or redox potential, if the change in temperature, pH or redox potential effects the activity of repressors or activators.
The term »self-cleaving peptide« refers to aminoacid peptide sequences, that ensure autocatalytic cleavage of the peptide, such as 2 A sequences (e.g. t2A, e2A, f2A etc). Self- cleaving peptides enable synthesis of a polycistronic mRNA chain, from which individual proteins are synthesised. The polycistronic chain can contain two or more sequentially linked proteins, separated by the self-cleavable peptide. The described composition ensures synthesis of individual proteins in equal stoichiometric ratio.
The term »signal sequence« or »signal peptide« refers to an aminoacid sequence, important for directing the protein to a certain location in the cell. Signal sequences differ depending on the host organism for protein expression. Aminoacid sequences and functions of signal peptides are known to persons skilled in the art and are available in the literature.
The term »tag peptide« refers to aminoacid sequences, added to a protein for simplified purification, isolation or detection. The position of signal sequences, linker peptides and tag peptides can be arbitrary, although they should allow functional expression of the protein, while also preserving the function for which these sequences were selected.
The terms »homologue« and »orthologue« refer to polypeptides, originating from the same or different organism. The term »homologous« also refers to mutated protein segments, where the mutations have a minimal effect on the structure or function of the polypeptide. The term »mutant« refers to a polypeptide, differing from the native protein polypetide in at least one aminoacid.
The term »effector« refers to any protein.
The terms »promoter«, »teminator«, »protein«, »DNA« are generally known to persons skilled in the art and are used as expected.
Embodiments of the invention can contain one or more switches. The switches can function independently or can be interconnected.
An embodiment of the invented switch enables controlled expression of state 1 or state 2, maintains a stable state even when the inducer molecule is removed, and is capable of switching between the two states, depending on the presence of a corresponding inducer. States are defined by expression of one or more effectors, e.g. therapeutic molecules, signal molecules, regulators or any other protein molecules.
The switch can be used for state signalization as a reporter system, e.g. reporting the presence of an inducer. Such inducers might include but are not limited to metal ions, pH, glucose and others.
The term »expression vector« refers to circular or linear DNA plasmids or viral DNA, containing operons listed in the invention and the necessary elements for expression in prokaryotic or eukaryotic cells, which are known to persons skilled in the art. Bacterial vectors contain bacterial control elements, a bacterial replication origin and an antibiotic resistance operon for selection of successfully transformed bacteria. Eukaryotic vectors contain, in addition to a bacterial replication origin, appropriate eukaryotic control elements, and appropriate antibiotic resistance operons for selection of successful bacterial transformation and/or successful eukaryotic transfection.
Embodiements of the invention can be used in prokaryotic as well as in eukaryotic organisms and cell lines. The basic difference is the use of transcription and translation ensuring nucleotide sequences in promoters and terminators, known to persons skilled in the art.
The invention further includes host cells and organisms, which either transiently or stably incorporate the nucleic acids described herein. The appropriate host cells are known to persons skilled in the art and include bacterial and eukaryotic cells. One skilled in the art will appreciate that a protein can be expressed in mammalians cells of the following origins: human, rodent, bovine, pork, poultry, rabbit and similar. Mammalian host cells include cultivated primary cell lines or immortalized cell lines.
Transfer of DNA into host cells is performed with conventional methods well known to persons skilled in the arts, such as transformation or transfection, including: chemical transfer, electroporation, microinjection, DNA lipofection, cell soni cation, gene bombarding, viral DNA transfer etc.
DNA transfer can be either transient or stable. »Transient transfer« refers to transfer of DNA in a vector, that does not undergo cromosomal insertion. »Stable transfer« refers to insertion of DNA into the host genome. DNA transfer to a cell line with a previous stable insertion can be controlled with the presence of markers. »Markers« refer to antibiotic or chemical resistance and can be included in the vector.
Examples of implementations described in detail below are conceived to best describe the invention. These descriptions are not intended to limit the field of the invention or its applicability, but serve to better demonstrate the invention and its appplicability.
Exemplification
Example 1. Preparation of DNA constructs for the switch according to the invention
For the preparation of DNA constructs the inventors used methods of molecular biology, such as: chemical transformation of competent E. coli cells, DNA plasmid isolation, polymerase chain reaction (PCR), reverse transcription - PCR, PCR ligation, determination of nucleic acid concentration, agarose gel electrophoresis of DNA, isolation of DNA fragments from agarose gels, chemical synthesis of DNA, DNA digestion with restriction enzymes, digestion of plasmid vectors, ligation of DNA fragments, purification of plasmid DNA in larger quantities. The protocols of the experimental techniques and methods are well known to person skilled in the art and are described in the manuals of molecular biology.
All work was performed with sterile techniques, which are well known to persons skilled in the art. All plasmids, completed constructs and partial constructs were transformed into bacteria E. coli with chemical transformation. Plasmids and constructs were transfected into cell lines HEK293 and HE 293T using comercially available transfection reagents.
The final gene constructs comprising the operons for the switch as described in the present invention are listed in Table 1 and the proteins transcribed from structural genes are listed in Table 2. All operons have been prepared using techniques and methods known in the art. Operons were inserted into appropriate plasmids suitable for eukaryotic systems. The inventors confirmed adequacy of nucleotide sequences by sequencing and restriction analysis.
The label [A] represents the sequence of the transcription factor-binding DNA element that is the recognition sequence of the DNA-binding protein TALA, which is a TAL effector designed to recognize the chosen DNA sequence. The DNA sequence that is the recognition sequence of the Tt represorrepressor is labeled [TRE] and the DNA sequence that is the recognition sequence of ecdysone is labeled [ECD].
The operon for the maintenance of stable state 1 is comprised of the DNA binding element [A] for the repressor TALArKRAB or the activator TALA:VP16, a minimal promoter and structural genes of a feedback loop, which in turn is comprised of a repressor for the inhibition of expression of structural genes from the operon formaintenance of stable state 2, specifically it comprises TALBrKRAB. Separately, the operon for the maintenance of stable state 1 comprises also a DNA-binding site [A] for the repressor TALA:KRAB or the activator TALA:VP16, a minimal promoter and structural genes of a positive feedback loop to maintain state 1, specifically TALA:VP16, and any effectors (Figure 1, Figure 3, Figure 4, Figure 5). Both operons can be combined into a single operon in which structural genes are separated by a self-cleaving peptide 2A (Figure 2, Figure 3, Figure 4, Figure 5). The operon for the maintenance of stable state 2 is comprised of the DNA binding site [B] for the repressor TALB:KRAB or the activator TALB:VP16, a minimal promoter and structural genes of a feedback loop, which in turn is comprised of a repressor for the inhibition of expression of structural genes from the operon formaintenance of stable state 1 , specifically it comprises TALA:KRAB. Separately, the operon for the maintenance of stable state 2 comprises also a DNA-binding site [B] for the repressor TALB:KRAB or the activator TALB:VP16, a minimal promoter and structural genes of a positive feedback loop to maintain state 2, specifically TALB:VP16, and any effectors (Figure 1 , Figure 3, Figure 4, Figure 5). Both operons can be combined into a single operon in which structural genes are separated by a self-cleaving peptide 2A (Figure 1, Figure 3, Figure 4, Figure 5).
The operator contains 10 DNA binding elements for either the DNA-binding protein TALA or TALB.
The operons for switching into stable state 1 are comprised of a constitutive or minimal promoter, DNA binding elements for an inducer- dependent activator or repressor, and structural genes encoding TALA:VP16 for state 1 activation and (on a separate operon) TALB:KRAB for state 2 inhibition. Operons for switching into state 1 can be combined into a single operon, which contains the DNA-binding elements for an inducer-dependent activator or repressor, minimal or constitutive promoter and structural genes separated by a self- cleaving peptide 2A.
The operons for switching into stable state 2 are comprised of a constitutive or minimal promoter, DNA binding elements for an inducer-dependent activator or repressor, and structural genes encoding TALB:VP16 for state 2 activation and (on a separate operon) TALA:KRAB for state 1 inhibition. Operons for switching into state 2 can be combined into a single operon, which contains the DNA-binding elements for an inducer-dependent activator or repressor, minimal or constitutive promoter and structural genes separated by a self- cleaving peptide 2A.
Inducers for state 1 or state 2 induction are different. Inducers are well known to persons skilled in the art and described in detail in the state of the art. Inducers can be arbitrarily selected, provided they specifically activate one of the two/several states. The TAL DNA-binding domain was obtained from TAL effectors by PCR amplification of the central DNA-binding domain to which a signal sequence for nuclear localization was added. Appropriate DNA binding elements were prepared synthetically so that they match the DNA-binding domains of selected TAL proteins. It is evident from the state of the art that DNA-binding domains based either on TAL or zinc fingers are naturally or synthetically prepared and each have their own recognition binding sites on DNA; the DNA binding elements. The number of DNA binding elements can be varied. It is evident from the state of the art that the effect of a repressor or an activator is improved by increasing the number of its DNA binding elements up to 12. We selected VP16 as the activation domain and KRAB as the repression domain. Both domains alike were fused with a TAL DNA- binding domain.
Table 1 : List of operons, DNA-binding elements and proteins.
operon structure SEQ ID NO.
10x[B]-p MIN -TALB:VP16 Ϊ
10x[B]-p m in-TALA:KPvAB-T2 A-mCit 2
10x[A]-p MIN -TALB:KRAB-T2A-TALA:VP16-T2A-BFP 3
10x[A]-p MIN -TALB:KRAB 4
10x[A]-p MIN -TALA:VP16-T2A-BFP 5
10x[B]-p MIN -TALA:KRAB-T2A-TALB:VP16-T2A-mCit 6
PCMV-[TRE]-TALB:VP16 7
PcMV [TRE] -TAL A: KRAB 8
[ECD] -p m in-TALB :KRAB 9
[ECD]-p min -TALA:VP16 10
pCMv-tTR-KRAB
PCMV-ECD-KRAB
[B] 19
[A] 20
pCMV-tTR:KRAB 21
pERV3 22 protein SEQ ID NO.
TALA:VP16 11
TALB:VP16 12
TALA-.KRAB 13
TALB:KRAB 14 mCit 15
BFP 16
TALB:KRAB-T2A-TALA: VP 16-T2A-BFP 17
TALA:KRAB-T2A-TALB : VP 16-T2A-mCit 18
Example 2, Bistable switch in mammalian cells
The methods and techniques of culturing cell cultures are well known to persons skilled in the art, therefore they are only briefly described with the intention of illustrating the implementation example. Cell lines of HEK293 and HEK293T cells were cultured at 37 °C and 5% C0 2 . DMEM medium supplemented with 10% FBS, which contains all the necessary nutrients and growth factors was used for cell culturing. Once the cell culture reached an appropriate density, cells were subcultured into a new culture vessel and/or diluted. For the application of cells in experiments the number of cells was determined with a hemocytometer and seeded at 2,5 x lO 4 cells per well on a 12 well microtiter plate 24 hours prior to transfection. Seeded plates were incubated at 37 °C and 5% C0 2 , until cells were 50-70% confluent for transfection with JetPei transfection reagent (Polyplus transfection). Transfection was performed according to the manufacturer's protocol, modified to a 12 well microtiter plate.
The HEK293 and HEK293T cell lines were transfected with plasmids described in the present invention and listed in Table 1.
For determination of effector expression and the system's state the cell medium was changed 2 hours post transfection and again 2 days post transfection, after which cells were incubated an additional 2 days. At the first medium change, an inducer for state 1 or state 2, tetracycline or ecdysone, was added. The expression of the effectors, blue fluorescent protein and yellow fluorescent protein, was monitored on a flow cytometer. A laser with a wavelength of 405 nm (blue fluorescent protein - BFP) and 488 nm (yellow fluorescent protein - mCit) was used. Emission was measured in the 430-480 nm (BFP) and 520-550 nm (mCit) range.
Results shown in figure 6A and 6B demonstrate that the switches function as described in the present invention. Results shown in figure 6A prove the switch enters state 1 or state 2 accordingly in the presence of plasmids PCMV-TALA: VP 16 (column 1), PCMV-TALB:VP16 (column 2), p CM v-TALA:VP16 and pc M v- ALB : KRAB (column 3), p CM v-TALB:VP 16 and PCMV-TALA :KRAB (column 4). All cells were, in addition to the plasmids listed earlier, also transfected with plasmids SEQ ID NO. 1, 2, 4, 5 of the switch as described in the present invention.
Results shown in figure 6B prove the switch enters state 1 or state 2 accordingly with the addition of inducers for state 1 or 2 and switching of the switch. All cell were in addition to the plasmids listed earlier also transfected with plasmids SEQ ID NO. 1,2,4,5,7,8,9,10 of the switch by the invention. Without the addition of an inducer the cells enter either state 1 or state 2. Inducer 1 causes the cells to enter state 1 exclusively, inducer 2 causes the cells to enter state 2 exclusively.
Example 3. TAL-repressors
A repressor was prepared by fusing a TAL DNA-binding domain and a KRAB repressor domain, where the KRAB repressor domain was fused to the C- and N-terminal end of the TAL DNA-binding domain. Two different DNA-binding domains were used: TALA and TALB. Operons with operators containing DNA binding elements specific for either TALA or TALB DNA-binding domain were prepared (labeled [A] and [B] respectively). The operator was followed by a CMV constitutive promoter and an effector, which is a luciferase gene in this example. HEK293 cells, whose culturing and transfection are described in implementation example 2, were transfected with repressors TAL:KRAB, KRAB:TAL or KRAB:TAL:KRAB and reporter, either pcMV-[A]-effector (luciferase gene) or PCMV-[B]- effector (luciferase gene). After 72 hours of culturing luciferase expression was assayed. Reduced luciferase activity is a direct proof of TAL-repressor function (figure 7). Results showed that the TAL-repressor function is independent of the KRAB repressor domain position, since the effector (luciferase) expression is sufficiently repressed.
Example 4. DNA binding elements for TAL DNA-binding domains We prepared operons with an effector (luciferase) whose operons contained one, two, four or seven DNA binding elements lx[D], 2x[D], 4x[D], 7x[D] for the TALD DNA-binding domain fused to a KRAB repressor domain. The operons also contained a CMV constitutive promoter, placed between the operator and structural genes. HEK293 cells, whose culturing and transfection are described in implementation example 2, were transfected with the TALD: KRAB repressor and a reporter, either pcMV-lx[D]-effector (luciferase gene), PCMV 2x[D]-effector (luciferase gene), PCMV-4X[D] -effector (luciferase gene) or PCMV-7X[D]- effector (luciferase gene). After 72 hours of culturing luciferase expression was assayed. Reduced luciferase activity is a direct proof of the TAL-repressor function (figure 8) on reporters with a different number of DNA-binding elements. Results showed that the function of a TAL-repressor is effective and dependent on the number of DNA-binding elements in the operons operator.
Example 5. TAL-activators
Activators were prepared by fusing a TAL DNA-binding domain and a VP 16 activation domain, where the VP 16 activation domain was fused to the C-terminal end of a TAL DNA- binding domain. Here we used two different DNA-binding domains TALA and TALB. We prepared operons with operators, which contained DNA binding elements specific for either TALA or TALB DNA-binding domain (labeled [A] and [B] respectively). The operator was followed by CMV or minimal constitutive promoter and an effector, which is a luciferase gene in this example. HEK293 cells, whose culturing and transfection are described in implementation example 2, were transfected with a TAL:VP16 activator (different amounts) and a reporter, either pcMv-[A]-effector (luciferase gene) or PC V-[B] -effector (luciferase gene). After 72 hours of culturing luciferase expression was assayed. Luciferase activity is a direct proof of the TAL-activator function, which depends on the dose of the activator (figure 9)·
Next Patent: PISTON RING, PISTON COMPRISING SUCH RING AND PISTON MACHINE COMPRISING AT LEAST ONE SUCH PISTON