Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SYSTEMS FOR EXPRESSION OF HETEROLOGOUS PROTEINS IN M. CAPSULATUS
Document Type and Number:
WIPO Patent Application WO/2009/038475
Kind Code:
A1
Abstract:
The present invention relates to an expression system for the expression of proteins and peptides in a methanotrophic bacterium, preferably M. capsulatus. Further, the invention relates to the exportation and display of said peptides and proteins on the surface of said bacteria. The invention also describes a method for the production of a desired protein in M. capsulatus.

Inventors:
LILLEHAUG JOHAN R (NO)
FJELLBIRKELAND ANNE (NO)
JENSEN HARALD B (NO)
Application Number:
PCT/NO2008/000337
Publication Date:
March 26, 2009
Filing Date:
September 19, 2008
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
STIFTELSEN UNIVERSITETSFORSKNI (NO)
LILLEHAUG JOHAN R (NO)
FJELLBIRKELAND ANNE (NO)
JENSEN HARALD B (NO)
International Classes:
C07K14/195; A61K39/00; C07K19/00; C12N15/31; C12N15/74
Domestic Patent References:
WO2002055549A22002-07-18
Other References:
FJELLBIRKELAND A ET AL: "THE C-TERMINAL PART OF THE SURFACE-ASSOCIATED PROTEIN MOPE OF THE METHANOTROPH METHYLOCOCCUS CAPSULATUS (BATH) IS SECRETED INTO THE GROWTH MEDIUM", ARCHIVES OF MICROBIOLOGY, BERLIN, DE, vol. 176, 1 January 2001 (2001-01-01), pages 197 - 203, XP002950293, ISSN: 0302-8933
KARLSEN ODD A ET AL: "The surface-associated and secreted MopE protein of Methylococcus capsulatus (Bath) responds to changes in the concentration of copper in the growth medium.", APPLIED AND ENVIRONMENTAL MICROBIOLOGY APR 2003, vol. 69, no. 4, April 2003 (2003-04-01), pages 2386 - 2388, XP002513613, ISSN: 0099-2240
Attorney, Agent or Firm:
ACAPO AS (Bergen, NO)
Download PDF:
Claims:

C L A I M S

1 Recombinant vector comprising a first nucleotide sequence Seq. ID no. 4, or sequences homologous thereto, excluding sequences comprising the N-terminal sequence present in Seq. ID no 3 but not in Seq. ID no 4.

2. Recombinant vector according to claim 1, wherein the nucleotide sequence further comprises a second nucleotide sequence.

3. Recombinant vector according to claim 2, wherein said second nucleotide sequence having multiple cloning sites, said multiple cloning sites being positioned such that insertion of a third nucleotide sequence into said cloning site operable links said third nucleotide sequence to said first nucleotide sequence.

4. Recombinant vector according to claim 3, wherein said third nucleotide sequence codes for a desired protein or peptide.

5. Recombinant vector according to claim 4, wherein said protein or peptide is a drug, an antigen or an antibody.

6. Recombinant vector according to claim 1, wherein said nucleotide sequence further comprises a gene encoding a selection marker.

7. Recombinant vector according to claim 6, wherein said selection marker is an antibiotic selection marker.

8. Recombinant vector according to claim 1, wherein said nucleic acid further comprises a replication origin that function in the host M. capsulatus.

9. Recombinant vector according to claim 8, wherein said replication origin is smmo or pmmo.

10. Recombinant vector according to one of the previous claims, wherein the desired protein is expressed in the host M. capsulatus.

11. Recombinant vector according to claim 10, wherein the desired protein is expressed on the surface of the outer membrane of M capsulatus.

12. Recombinant vector according to one of the previous claims, wherein the nucleotide coding for the desired protein contains a region which codes for a peptide stretch functioning as a substrate for a hydrolyzing enzyme capable of cleaving the desired protein from the remaining of the membrane anchored protein, such that the desired protein is excreted to the culture medium.

13. Recombinant vector according to claim 1 , wherein the recombinant vector is a plasmid.

14. Recombinant vector according to claim 13, wherein the plasmid is pBBmopE H* , with Seq. ID no. 9 or sequences homologous thereto.

15. Recombinant vector according to any of the previous claims, where sequences homologous to the Seq. ID's have a sequence similarity at least 80 %, preferably 85 %, more preferably 90 % and most preferably 95 %.

16. Recombinant vector according to claim 1, comprising a sequence chosen from the group comprising Seq. ID no. 10, 12 or 14.

17. Bacterial host cell transformed with the recombinant vector according to one of the previous claims.

18. Bacterial host cell according to claim 17, wherein the bacterial cell is M capsulatus.

19. A method for producing a desired protein in a bacterial host cell, said method comprising transforming a bacterial host cell with a recombinant vector comprising a first nucleotide sequence Seq. ID no. 4 or sequences homologous thereto, excluding sequences comprising the N-terminal sequence present in Seq. ID no 3 but not in Seq. ID no 4, and said vector comprising a further nucleotide sequence encoding said protein, said further nucleotide sequence being operably linked in frame to said first nucleotide sequence, and culturing said transformed host cell in a suitable medium under conditions allowing expression of said protein.

20. A method according to claim 19, wherein the method further comprises the step of recovering the expressed protein or peptide from the medium.

21. A method according to claim 19, wherein the host cell is M. capsulatus.

22. A method according to claim 1, wherein the desired expressed protein is a drug, where said drug is extracted from the host cell, or used together with the host cell for the manufacturing of a vaccine, where said vaccine optionally is for oral administration.

23. A method according to any of claims 19-22, where sequences homologous to the Seq. ID's are at least 80 %, preferably 85 %, more preferably 90 % and most preferably 95 % identical.

24. Protein capable of being exposed on the surface of a methanotrophic bacterium, wherein the protein is encoded for by Seq. ID no. 4 or sequences homologous thereto, excluding sequences comprising the N-terminal sequence present in Seq. ID no 3 but not in Seq. ID no 4.

25. Protein according to claim 24, wherein the bacterium is M. capsulatus.

26. Fusion protein comprising a protein or peptide sequence encoded for by a nucleotide sequence Seq. ID no. 4 or sequences homologous thereto, excluding sequences comprising the N-terminal sequence present in Seq. ID no 3 but not in Seq. ID no 4, and a desired protein or peptide, capable of being translocated.

27. Fusion protein according to claim 26, where the amino acid sequence of the protein is Seq. ID no 11, 13 or 15.

28. Protein according to any of claims 24-26, where sequences homologous to the Seq. ID's have a sequence similarity of at least 80 %, preferably 85 %, more preferably 90 % and most preferably 95 % .

Description:

Systems for expression of heterologous proteins in M. capsulatus.

The present invention relates to the expression of heterologous proteins in the bacteria M. capsulatus. More specifically, the present invention relates to the exportation and display of polypeptides and proteins on the surface of said bacteria.

Thus, the present application relates to a recombinant vector, a bacterial host cell transformed with said vector, a method for producing a desired protein in a bacterial host cell, a protein capable of being exposed on the surface of a methanotrophic bacterium, and a fusion protein.

The expression of polypeptides on the surface of bacteria and bacteriophages has been pursued for several years, in part because of interest in recombinant antibody production. Many other potential applications exist, including the production of genetically-engineered whole cell adsorbents, construction of "peptide libraries", cell bound enzymes, and use as live vaccines or immunogens to generate antibodies.

In bacteria, one approach to obtaining surface expressed foreign proteins has been the use of native membrane proteins as a carrier for a foreign protein. In general, most attempts to develop methods of anchoring proteins on a bacterial surface have focused on fusion of the desired recombinant polypeptide to a native protein that is normally exposed on the cell's exterior with the hope that the resulting hybrid will also be localized on the surface.

In a prior invention (Norwegian patent application no. 20033176) the present inventors also provided an expression system where a heterologous polypeptide (termed "desired" protein) was expressed in the bacteria Methylococcus capsulatus. The heterologous protein is preferably linked to an outer membrane protein in M. capsulatus termed MopE.

MopE has a 540 amino acid protein sequence, with a short (29 amino acids) N-terminal sequence dependent signal sequence, followed by a N-terminal domain (176 amino acids) and a C-terminal domain (335 amino acids). The N-terminal domain is not secreted, while the C-terminal domain is secreted and expressed on the cell surface. The MopE amino acid sequence is shown in the sequence listing, as Seq. ID. no. 1

The method of secretion is not known. MopE does not show high sequence similarity to other known secreted proteins. The secretion is host specific, although inserted in the

IM and released to the periplasm, the MopE protein is not secreted by E. coli hosts, only by M. capsulatus. MopE has been proposed to be secreted either by the Type 2 Secretion system (T2S) or the Type 5 Secretion system (T5S) based upon its primary sequence (Fjellbirkeland et al, 2001). Proteins secreted by the T2S or T5S systems can be translocated across the IM either by the Tat or the Sec machinery, which are shown in figure 1.

Secretion of T2S substrates is very host specific. Secretion by the T2S system (Fig. no. 1) is a two-step process in which the secreted protein initially is exported across the IM by the Sec or Tat export systems. The periplasmatic intermediate is subsequently translocated across the OM by the T2S secretion through the channel formed by the secretin that is large enough to translocate folded or close to folded substrates. T2S is energized by ATP, and although not required for secretion, the proton motive force increases the rate of secretion. Substrates of the T2S system share no obvious similarities in primary sequence. Although no recognition signal that confine proteins to secretion by the T2S pathway has been identified, a potential common feature for T2S substrates is a medium to high content of β-sheet (de Vries et al, 1990 and Sandkvist, 2001). β-strands were predicted in the N-terminal domain of MopE by the computer program PRED-TMBB (http://bioinfromatics.biol.uoa.gr/PRED-TMBB). and this program also predicted an N-terminal β-barrel in MopE. The predicted β-strands may form the tertiary structures of β-sheets required for translocation by the T2S pathway. In addition to the prediction of such structures in N-terminal domain of Mop-E, MopE 0 is heat-modifiable while MopE* is not (Fjellbirkeland et al, 2001), indicating that the N- terminal domain of MopE indeed contains stabile β-structures, and thus is a candidate T2S substrate. Since a Sec-compatible signal sequence has been predicted in MopE it has been considered likely that the Sec machinery export MopE across the IM.

However, the presence of β-structures does not confine secreted proteins to the T2S route. Substrates of T5S, autotransporters, require a β-domain that forms a β-barrel that allows for translocation of the passenger domain across the OM. Since the N-terminal domain of MopE has the potential of containing β-structures, the domain could function as an autotransporter translocation unit in T5S. No autoproteolytic activity could be demonstrated for MopE using the substrate azocasein, however, such activity is not a widely distributed feature among T5S substrates, and thus not required. And while autotransporters inherently contain all information and accessory factors necessary for translocation to the OM, release of the protein to the extracellular space most often require cleavage by an external protease. Thus secretion by T5S is somewhat dependent

on the host cell, and the host specificity observed for the secretion of MopE does not exclude secretion by the T5S route.

Thus, it was thought that the most likely mechanism for translocation of MopE was either by the T2S or T5S routes, by the Sec or Tat transport systems. This translocation would then require the β-structures found in the N-terminal domain of MopE. Thus it was expected that the removal of the N-terminal domain of MopE would abolish the ability of the truncated protein, here termed MopE H *, to translocate. The MopE H * amino acid sequence is shown in the sequence listing, as Seq. ID. no. 2

The inventors constructed a mutated MopE protein, MopE H *, that consisted of the secreted domain of MopE alone, and did not contain the N-terminal MopE domain. (Please see Fig. no. 2. for a comparison of MopE and MopE H *) Quite surprisingly, they found that MopE H * fully retained its' ability to translocate in M. capsulatus.

Experiments were also performed to acertain the usefulness of MopE H * as part of a fusion protein acpable of translocation, by expressing Atlantic halibut nodavirus capsid protein fused to the surface protein MopE. As shown below, these fusion proteins were indeed capable of translocation.

As stated, this translocation of MopE H *, with or without a fused protein, is very surprising because it was thought that the N-terminal of MopE was required for translocation by the T2S or T5S routes. In addition, even though a protein in its entirety, here MopE, is known to be translocated, there is no reason to believe that only parts of the protein also may be able to do so.

The importance of this invention is fairly self evident. If one is to use MopE as a translocation system, there is an advantage to narrow it down to the smallest possible protein. There are always limits to how large of a protein (i.e. how long of a amino acid chain) may be transported. By removing a part of the native protein (here the N-terminal domain) one may, as shown, fuse a proportionally larger protein to the truncated MopE protein, and still have a reasonable expectation of successful translocation. In addition, the discovery that the N-terminal is not involved in translocation, changes the understanding of how MopE is translocated, and points research in new directions in trying to ascertain the mechanisms behind the translocation.

The invention can be better understood with reference to the following figures:

Fig. no. 1. Overview of secretion by the Type 2 and Type 5 secretion systems. (A)

The T2S system secretion consists of 12-16 proteins depending of species. The majority of the T2S components are IM proteins, many with large periplasmatic domains. Two of these GspE and GspL interact with ATP and are involved in energizing of the secretion. In the OM 12-14 GspD and GspS units form a secretin. The secretin is a transmembrane complex with a channel 5-10 ran in diameter, thus large enough to translocate folded or close to folded polypeptides. (B) Proteins secreted by T5S are pre-pro-proteins consisting of three domains: an N-terminal signal sequence for export across the IM, an internal passenger/functional domain and a C-terminal β-domain. The figures (A) and (B) are reproduced from (Voulhoux et al, 2001) and ( Desvaux et al, 2004), respectively.

Fig. no. 2. Overview of the structure of MopE and MopE H \

MopE consists of two domains, the non-secreted N-terminal domain and the secreted C- terminal domain, in addition to a Sec-dependent signal sequence. In MopE H* the histidine originating from the cloning strategy is shown.

Fig. no. 3. Amplification of DNA molecules containing the mopE H * gene or the mmoX promoter linked to the sequence encoding the MopE signal sequence from pAFpglO or pJBrp2, respectively. The mopE H * gene was constructed by amplification of a region of pAFpglO using the primers MopEXhoR and MopE*NcoI. The DNA containing the copper sensitive mmoX promoter and the sequence encoding the MopE signal sequence (MopE ss) was amplified from pJBrp2 using the primers sMMOSacI and spNcoI.

Fig. no. 4. Amplification DNAs containing the mopE H * gene or the mmoX promoter linked to the sequence encoding the MopE signal sequence. Negative PCR controls used had composition identical to the PCR sample except that no template was added. (A) PCR amplification of the -0.5 kb DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence (lane 2) and PCR negative control (lane 1). (B) PCR amplification of a -1,2 kb DNA containing the mopE H * gene (lane 1) and PCR negative control (lane 2). (C) The amplified DNAs were subcloned in pCR2.1 -TOPO vectors, and resulting plasmids were controlled by RE-analysis. NcoVXhol digested pCR2.1 -TOPOl (lane 1) and pCR2.1-TOPO2 (lane 2).

Fig. no. 5. Subcloning of the amplification products in pCR ® 2.1-TOPO vectors by TOPO ® TA cloning. The DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence was inserted to pCR2.1-TOPO to produce pCR2.1- TOPOl, while the DNA containing the mopE H * gene was inserted to pCR2.1-TOPO to produce pCR2.1 -TOPO2.

Fig. no. 6. Subcloning of the DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence in pETlld. The DNA containing the m moX promoter and the sequence encoding the MopE signal sequence was inserted to pET 11 d, producing pET 1.

Fig. no. 7. Construction of and RE-analysis of pETl (A) The -0.5 kb DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence (lane 1) and the -5.9 kb pETl Id vector fragment (lane 2) were purified from an agarose gel. (B) pETl was digested by BamHl (lane 1) and double digested with Ncol and Xbal (lane 2).

Fig. no. 8. Subcloning of the DNA containing the mopE H * gene in pETl. To produce pET2 a DNA containing the mopE H * gene was inserted to pETl .

Fig. no. 9. Construction of and RE-analysis of pET2. (A) A -1.2 kb DNA containing the mopE * gene (lane 2) and a -6.3 kb pETl fragment (lane 1) were purified from an agarose gel. (B) The pET2 plasmid was digested by BamHl (lane 1) and double digested by BamHl and Ncol (lane 2).

Fig. no. 10. Construction of pBBmopE H *. A pET2-DνA containing the mopE H * gene, the mmoX promoter and the sequence encoding the MopE signal sequence was excised from pET2 an inserted to pBBRlMCS-2 to produce pBBmopE H *.

Fig. no. 11. Construction and RE-analysis of pBBmopE H *. (A) The -5.1 kb pBBRlMCS-2 fragment (lane 1) and -2.0 kb pET2 (lane 2) were purified from a preparative agarose gel. (B) The pBBmopE H *plasmid was double digested with BamHl and Xbal (lane 1) and single digested by Ndel (lane 2).

Fig. no. 12. ECL developed blot of spent media of a copper-depleted M. capsulatus (Bath) wild type culture (lane 1) and of copper-depleted cultures of M. capsulatus δmopE harbouring either pBBRlMCS-2 (lane 2) or pBBmopE H * (lane 3).

Fig. no. 13. Diagrams of MopEH*, and MopEH* modified to contain restriction sites for BspHI and Nhel, in order to facilitate cloning of fusion proteins.

Fig. no. 14. The three MopEH* -nodavirus constructs developed: AHNVC-MopEH*, MopEH*-AHNVCc and MopEH*-AHNVC-20.

Fig. no. 15 .Schematic of the determination of the localization of recombinant MopEH*

Fig. no. 16. Gel results of MopEH*-AHNVCc (16a) and MopEH* -AHNVC-20 ( 16b) and AHNVC-MopEH* ( 16c) using anti-AHNVC and anti-MopEH* .

Fig. no. 17. Use of antibodies against MopEH*-AHNVC 20 aa peptide shows that the construct is antigenic.

Cultured Methylococcus capsulatus with pBBmopE H * plasmid was deposited with DDZ access identification Methylococcus capsulatus McdeltamopEpBBmopEH = DSM 19108 on March 1 st 2007 with DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (The German National Resource Centre for Biological Material), Inhoffenstr. 7 B, D-38124 Braunschweig, Germany.

A sequence listing was prepared using Patentln3.4 for the amino acid sequences of MopE (Seq. ID no. 1) and MopEH* (Seq. ID no. 2), AHNVC-MopEH* fusion protein (Seq. ID no 11), MopEH*-AHNVCc fusion protein(Seq. ID no 13), MopEH*-AHNVC- 20 aa fusion protein (Seq. ID no 15), and the DNA sequences of MopE (Seq. ID no. 3), MopEH* (Seq. ID no. 4), pETl (Seq. ID no. 5), pET2 (Seq. ID no. 6), pCR2.1-TOPOl (Seq. ID no. 7), pCR2.1-TOPO2 (Seq. ID no. 8), and pBBmopEH* (Seq. ID no. 9), AHNVC-MopEH* (Seq. ID no 10), MopEH*-AHNVCc (Seq. ID no 12), MopEH*- AHNVC-20 aa (Seq. ID no 14), MopEH* with BspHI mutation in stop codon (Seq. ID no 16), MopEH* with Ndel mutation (Seq. ID no 17), Atlantic halibut Nodavirus capsid protein 2 (Seq. ID no 18) and pBBRl-mopEH*(Seq. ID no 19/ The primers given in tables 4 and 6 are also given as Patentln sequences ID no 20-33. MopEH ST25.txt Patentln utskrift

Thus, the recombinant vector according to the present invention comprise a first nucleotide sequence Seq. ID no. 4, or sequences homologous thereto, capable of translocation through the outer membrane of Methylococcus capsulatus.

Also, the bacterial host cell according to the present invention is transformed with said recombinant vector.

And, the method for producing a desired protein in a bacterial host cell according to the present invention comprise transforming a bacterial host cell with a recombinant vector comprising a first nucleotide sequence Seq. ID no. 4 or sequences homologous thereto, and said vector comprising a further nucleotide sequence encoding said protein, said further nucleotide sequence being operably linked in frame to said first nucleotide sequence, and culturing said transformed host cell in a suitable medium under conditions allowing expression of said protein.

And, the protein capable of being exposed on the surface of a methanotrophic bacterium according to the present invention is encoded for by Seq. ID no. 2 or sequences homologous thereto.

And, the fusion protein according to the present invention comprise a protein or peptide sequence encoded for by a nucleotide sequence Seq. ID no. 4 or sequences homologous thereto, and a desired protein or peptide, capable of being translocated.

The fusion protein according to the invention is preferably expressed from a chimeric DNA having a DNA segment encoding a leader amino acid sequence capable of mediating secretion of the fusion protein, a DNA segment encoding for subunits of the surface protein, and a DNA segment encoding the desired target protein. The DNA segments are positioned such that expression of the fusion protein results in display of the target protein on the surface of the cells. The fusion proteins are preferably anchored to the cell surface of the bacteria forming what is referred to as a "display bacteria."

The present invention thus provides for a system for the expression of heterologous proteins, where the heterologous proteins are expressed on the surface of the bacterial cells.

The chimeric DNA may be integrated into the bacterial cell chromosome or be carried by a vector, where said vector preferably dose not comprise the entire nucleotide sequense of MopE (Seq. ID no 3), but rather only the truncated sequence comprising

MopE H * (Seq. ID no 4). In certain preferred embodiments, expression of the fusion protein may be regulated by an inducible promoter. Bacteria displaying a particular protein may be selected, for example, using antibody affinity. The fusion protein can be detached from selected cells. If desired, the target protein may be separated from the surface protein and further purified.

Target proteins useful in the present invention include peptides, proteins, e.g., hormones, enzymes, inhibitors, and receptors, antigens, antibodies including antibody fragments and single-chain antibodies.

The present invention thus provides a system for the expression of heterologous proteins in the membrane fraction, and preferable on the cell surface of the M. capsulatus.

The bacterium M. capsulatus is able to utilise methane as a single carbon and energy source. Bacteria capable of oxidising methane are collectively referred to as methanotrophs. They belong to different families and groups of the eubacteria but have in common the possession of the unusual enzyme methane mono oxygenase, which catalyses the oxidation of methane to methanol.

The bacterium has an obligate requirement for methane or methanol and an optimum growth temperature of 45 °C. Methane is oxidized via methanol to formaldehyde which is either assimilated into cellular biomass or dissimilated to carbon dioxide to release cellular energy.

M. capsulatus has a gram-negative cell envelope. Much of the intracellular space is occupied by an extensive intracytoplasmic membrane system. The genome of M. capsulatus (Bath) has a molecular weight of 2.8 x K)" Da and a G + C content of 62.5 %.

Commercial interests involving M. capsulatus and other methanotrophs could roughly be divided into two categories: Those taking advantage of the inexpensive growth requirements of the bacteria and those taking advantage of unique catalytic activities possessed by the bacteria.

The development of high-cell density fermentation technology for M. capsulatus has created the possibility of producing large quantities of specialised compounds like for instance amino acids, cofactors, vitamins, metabolic end products, and various high value proteins, at reasonable costs.

The present invention thus provides a system for the manufacturing of such product.

Other uses for the protein display methods of the present invention include, for example, epitope mapping, screening of antibody libraries and live bacterial vaccines.

The invention is especially suited for production of vaccines that can be administered orally for use in animals, fish and humans. The technique can also potentially be used for display of vaccines, especially for oral administration.

The invention relates to the use of the genes and the proteins encoded by them, as given in the accompanying sequences list, fragments thereof, or functionally equivalent substantially similar genes, for construction of fusion proteins carrying foreign peptide sequences for display in the M. capsulatus, and preferable on the surface of said bacterium. The term "homology" or "homologous", as used in the present application, does not necessary infer a common evolutionary ancestor/relationship, as homologous sequences may be artificially created. Rather, it is meant to encompass sequences that are similar and have a similar function, that is, sequences likely to be able to perform the same/similar function due to having a degree of sequence similarity (that may be defined as a percentage sequence similarity/identity).

M. capsulatus is a bacterium licensed for use in animal and fish feed. It has no virulent or pathogenic properties, and contains very low amounts of endotoxin (LPS). It is thus well suited as a carrier organism for recombinant oral vaccines, with a potential also for use in humans. Vaccines could be constructed by insertion of fragments of Dl 5 genes from pathogens into the M. capsulatus D 15 gene in order to display a fusion-protein containing parts of the two Dl 5 antigens on the surface of M. capsulatus. The part of the Dl 5 protein originating from the pathogen should trigger an immune response to the respective pathogenic bacterium. If replacement of M. capsulatus -specific Dl 5 sequences with corresponding sequences from the pathogens is well tolerated by the host, larger regions of Dl 5 could be replaced, and if possible, the entire Dl 5 protein could be replaced by the corresponding protein from a pathogen.

Due to the sequence conservation of Dl 5 among distantly related bacteria, exchange of parts of the gene (or the entire gene) without seriously affecting the survival and growth of M. capsulatus is plausible. The specific function of the Dl 5 antigen on the surface of the bacteria is not known, but it possibly plays a structural role and is most probably not involved in any important biochemical processes.

Successful display of the target protein on the cell surface can be detected using a number of methods, for example, if the target peptide can be specifically labeled by a procedure that does not operate through the membrane, its cell surface display can be readily demonstrated.

If the target polypeptide displays enzymatic activity, one may use such activity to demonstrate cell surface display. Antibodies against the target protein may also be used.

The chimeric DNA may be integrated into the host cell chromosome or be carried within a vector. Methods of integrating DNA into a host cell chromosome are well known in the art. The chimeric DNA may also be carried within a recombinant vector, e.g., a plasmid.

Plasmids useful as the vector backbone include plasmids containing replicon and control sequences which are derived from species compatible with the host cell. The vector may also contain an inducible promoter and marker gene, e.g., antibiotic resistance.

Introduction of the chimeric DNA to the host cell may be effected by any method known to those skilled in the art. For example, if a recombinant vector carries the DNA, the vector can be introduced, for example, by transformation, electroporation, or phage transfection.

The detection techniques noted above can be used initially to verify that the method of the present invention is working, i.e., that the fusion surface protein has been expressed and transported to the bacterial cell surface and is orientated so that the target protein is accessible i.e., displayed.

Cells that display the target may be separated from those that do not, using, for example, affinity separation techniques. Such techniques include affinity column chromatography, batch elution from affinity matrix material and fluorescent-activated cell sorting.

MopE is a major outer membrane protein of M capsulatus. It contains surface-exposed regions but its exact folding and association with the cell surface is not known. Under copper limitations, the C-terminal part of the protein is secreted into the growth medium, but considerable amounts of the full-length protein remains associated with the cell surface. By using this protein as an anchor it is possible to mediate translocation of passenger proteins to the cell surface or to the extracellular environment. .

Experimental section

Bacterial strains M. capsulatus Table 1. Strains of M. capsulatus used.

Strain Description and use Reference

M. capsulatus (Bath) wild Used as control and reference. Whittenbury type NCIMB 11132 α/., 1970

M. capsulatus δmopE Contains an inactivated mopE gene and is Fjellbirkelan gentamycin resistant. Used as host for , .. , , p ilasmi -d λ s expressi ■ng mu ,ta * ted λ x M*op iE- pro + tei •n unpublished and as mating-pair recipient in conjugation with E. coli SIl A

E. coli Table 2. Stains of E. coli used.

Strain Description and use Reference

One Shot Genotype: F' [ lacPTnlO (Tet R )] mcrA l{mrr- hsdKMS- Invitrogen TOPlOF' mcrBC) φ80/αcZ?M15 ?/αcX74 recAlaraO\39 l{ara- leu)7697 gα/U galK rpsL (Str R ) endAl nupG

Used as intermediate host for constructed plasmids, and as host for TOPO TA cloning.

S17-1 Genotype: Tp R Sm R recA, thi, pro, /zsdR-M+RP4: 2-Tc:Mu: Simon et a Km Tn7 1 pir 1983

Has genomically inserted tra genes and were used as mating-pair donor in conjugation with M. capsulatus δmopE.

DH5α Genotype: supE44 DlacU169 (F80 lacZDM15) hsdR17 Invitrogen recAl endAl gyrA96 thi-1 relAl

Host for plasmids pAFpglO and pJBrp2.

Plasmids

Table 3. List of plasmids used.

Name Description and use Reference pAFpglO Contained both mopE and Amp R . Used as template for PCR Fjellbirkeland amplification of the truncated mopE gene, mopE H *, and for site-directed ai, 2001 mutagenesis to create mopE genes mutated by substitution, the mopE 1 " genes. pJBrp2 Contained Km R , the copper-sensitive mmoX promoter and the sequence Haugland, encoding the MopE signal sequence. unpublished

Used for PCR amplification of the mmoX promoter and the sequence encoding the MopE signal sequence. pCR ® 2.1 INVITROGEN pCR ® 2.1 TOPOl Cloning intermediate based on pCR ® 2.1 that contain the mmoX promoter This applicatio linked to the sequence encoding the MopE signal sequence. Also contained Km R , Amp R and LacZa disrupted by insertion of the DNA fragment. pCR ® 2.1 TOPO2 Cloning intermediate based on pCR ® 2.1 that contain mopE H *. Also This applicatio contained Km R , Amp R and LacZa disrupted by insertion of the mopE H * gene. pETl ld Intermediate vector used to connect the mmoX promoter, the sequence Stratagene encoding the MopE signal sequence and the mopE H * gene. Also contained Amp R , pBR322 ori, laclq and lac operator. pETl Cloning intermediate based on pETl Id that contain the mmoX promoter This applicatio linked to the sequence encoding the MopE signal sequence. Also contained Amp R . pET2 Cloning intermediate based on pETl Id that contain the mmoX promoter This applicatio linked to the sequence encoding the MopE signal sequence and the mopE H * gene. Also contained Amp R . pBBRl MCS-2 Used as vector for the mopE H * gene. Contained mob genes, thus were Kovach et , mobilizable when tra genes were provided by E. coli S 17-1. Also 1995 contained lacZa, Km R and rep. pBBmopE H Used to express mopE?* in E. coli S 17-1 and to transfer mopE H * to M. This applicatio capsulatus δmopE. Vector based on pBBRlMCS-2 containing a truncated mopE, mopE H *, the mmoX promoter and the sequence encoding the MopE signal sequence. Also contained Km R .

Primers

Table 4. List of primers used.

Name Sequence Use

sMMOprSacI 5'-GTGGAGCCGTTGCCGTTCCGGTTCAGC PCR amplification of GTGTCC- 3' mmoX promoter linked to the sequence encoding the MopE signal sequence

MopEλTjoR 5 ' - TGGCGGTGATCTCGAGCCTGC- 3 ' PCR amplification of mmoX promoter linked to the sequence encoding the MopE signal sequence

spNcoI 5' - AGTGCCTCCATGGGCGGCTG- 3' PCR amplification of the mopE H * gene.

MopE*NcoI 5 ' - CAGCGAACTCCCATGGCCTGGAC- 3 ' PCR amplification of the mopE H * gene.

Eurogentec supplied all primers, except from MopEλTzøR supplied by TAGN Ltd and Ml 3 forward supplied by Invitrogen.

Kits

Table 5. List of kits used.

Kit Use Supplier

QIAQuick Miniprep Purification of plasmid DNA QIAGEN

QIAGEN HiSpeed Midi Plasmid Large scale purification of QIAGEN Purification Kit plasmid

PCR purification Kit Purification of PCR products QIAGEN

TOPO TA cloning Kit Subcloning of PCR products Invitrogen

QIAQuick Gel Extraction Kit Extraction of DNA from QIAGEN agarose gels

ECL western blotting detection system Development of immunoblot Amersham Bioscience

Transfer of plasmid DNA to M capsulatus by conjugation

Presently conjugation is the only method available for transfer of genetic information to Methylococcus. Conjugative transfer require establishment of physical contact between the cells of the mating-pair, the DNA donor and the DNA recipient. Additionally, the donated plasmids must hold mob or tra genes. The plasmid used, pBBRlMCS-2 (Table 3-3), contained mob genes, while E. coli S 17-1 contained tra genes.

Conjugation was performed as described by Lloyd et al (1999) using the plasmids derived from the mobilizable plasmid pBBRlMCS-2 and the mating-pair donor E. coli S17-1.

M. capsulatus and E. coli whole cells were then separated from the spent medium by centrifugation.

Concentration of spent medium proteins by cellulose ultrafiltration

MopE* is the major protein detectable in unconcentrated spent medium of M. capsulatus cultures. Spent medium proteins were concentrated by cellulose ultrafiltration ncinσ th<=» λmirnn(R) T Iltrn- 1 ^ PT - 1 ftO rpntrifiiσntion filtpr rlpvirp Thp filter

used had a nominal molecular weight limit of 10 kDa and maximum sample volume of 15 ml. Spent medium from a 150 ml cultures was concentrated to a final volume of about 200 μl by repeated centrifugations.

Strategy for cloning

Genetic manipulation in M. capsulatus imposes several constraints regarding systems available for genetic transfer. Conjugation is the only method known to be effective in transferring genes to M. capsulatus, thus the conjugative vector, pBBRlMCS-2, was chosen as carrier of the mutated mopE genes. Based on its successful use in prior conjugations to M. capsulatus, E. coli S 17-1 was chosen as plasmid DNA donor. When this study was initiated expression vectors compatible with M. capsulatus were not available. Thus, to enable initiation of transcription in M. capsulatus, a promoter recognizable by this bacterium was connected to the mutated mopE genes in the conjugative plasmids. Moreover, it was desirable that the transcription should be regulated in a relatively easy manner and this led to use of the mmoX promoter. The mmoX promoter initiate transcription of the M. capsulatus (Bath) operon mmoXYBZYC, and its activity is affected by the concentration of copper. The promoter region used was the 335 bp region located immediately upstream of the start-codon of mmoXYBZYC. This region has been shown to be sufficient for the copper-dependent activity of the promoter.

Construction of the mopE H * gene that encodes the MopE H * mutant protein

The mopE gene was contained in pAFpglO (Table 3.), and this plasmid was purified from cells from an E. coli DH5α culture. The deletion mutant of the mopE gene was constructed by PCR amplification (Fig. no. 3.) using the forward primer MopE*NcoI (Table 4.) and the reverse primer MopEXhoR (Table 4.). The resulting amplified fragment (Fig. no. 3. and 4.) was -1.2 kb and consisted of a DNA encoding MopE* (GIy 205 - PrO 540 ), as well as a downstream region containing a Rho-independent transcription terminator. By using primers slightly non-complimentary to their target sequences, flanking Ncol and Xhol recognition sites were introduced in this amplified product. The introduction of the 5' Ncol site resulted in addition of an additional histidine codon to the 5' -end of the mopE* gene. Thus, the amplified gene constructed and the protein encoded by it, was designated mopE H * and MopE H *, respectively.

The mmoX promoter was present in the plasmid pJBrp2 (Table 3.) linked to the sequence encoding the MopE signal sequence. This plasmid was purified from E. coli DH5α cells (Table 2.), and a fragment containing the mmoX promoter linked to the sequence encoding the MopE signal sequence was amplified from pJBrp2 by the PCR using the primers spNcoI (Table 4.) and sMMOprSacI (Table 4. and Fig. no. 3.). The resulting -0.5 kb DNA product (Fig. no. 4.) was flanked by Sad and Ncol restriction sites.

To simplify handling of the amplified fragments, the two amplification products were individually cloned into pCR ® 2.1 -TOPO vectors (Table 3.) by TOPO ® TA cloning producing pCR2.1 -TOPOl and pCR2.1-TOPO2 (Fig. no. 5.). Transformants from both the TOPO ® TA reactions were selected based on their resistance to ampicillin and impaired production of β-galactosidase. A high yield of transformants was obtained from both transformation reactions.

A few single colonies of transformants were picked and cultivated in liquid LB. Cells from the E. coli TOlOF' cultures were harvested, plasmids were purified and analyzed by Ncol and Sad digestion (Fig. no 4. C). All plasmids purified from the TOPO ® TA cloning reaction with the -0.5 kb PCR product were digested into three fragments of lengths -0.5 kb, -1.5 kb and -2.5 kb, while all the plasmids purified from the TOPO ® TA cloning reaction with the -1.2 kb PCR product were digested into three fragments of lengths -1.2 kb, -1.5 kb and -2.5 kb (Fig. no. 4. C lane 1-2, respectively). Thus, the results of the RE-analyses were in agreement with theoretical predictions. One E. coli TOlOF' colony containing pCR2.1-TOPOl and one colony containing pCR2.1-TOPO2 were selected for further analysis. Plasmids from these colonies were purified and sequenced. This sequencing confirmed that the fragment containing the mmoX promoter linked to the MopE signal sequence fragment was contained in pCR ® 2.1 -TOPOl, and that the fragment containing the mopE H * gene was contained in pCR ® 2.1-TOPO2.

Because of incompatibility of RE-sites in plasmid and fragments, the mopE H * gene could not be connected to the mmoX promoter and the sequence encoding the MopE signal sequence directly in the coηjugative vector pBBRlMCS-2. Thus, the fragments should be subcloned in pETl Id (Table 4.). First, the DνA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence should be inserted to pETl Id (Fig. no. 6.) to produce pETl (Table 3.). The DνA containing the mopE H * gene then should be inserted to pETl to produce pET2 (Table 3.). Thus, in pET2 the mmoX promoter should precede the mopE H * gene connected with an upstream sequence encoding the MopE signal sequence (Fig. no. 2.).

As a first step to construct pETl, both pCR ® 2.1 -TOPOl and pETl Id were digested by Ncol and Xbal. The restriction of pCR2.1 -TOPOl produced three fragments with lengths -0.5 kb, ~1.7 kb and 2.3 kb, while restriction of pETl Id produced a -5.8 kb fragment, all in agreement with the theoretical predictions. The -0.5 kb DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence and the ~5.9 kb vector fragment were purified from the preparative agarose gel (Fig. no. 7. A lane 1-2), and used in a subsequent ligation reaction. The ligation solution was used to transform E. coli TOPlOF' cells and the resulting transformants were selected based on their resistance to ampicillin. One colony of transformed E. coli TOPlOF' cells was obtained. The transformed colony was cultivated in a 5 ml LB culture. A plasmid, designated pETl, was purified from the cells and analysed by RE digestion. The length of pETl was estimated to be about 6.3 kb by agarose gel electrophoresis (Fig. no. 7. B lane 1). In agreement with theoretical predictions pETl to produced two bands, one of length -5.8 kb and one of -0.5 kb after NcoIIXbal double digestion (Fig. no. 7. B lane 2). Insertion of the mmoX promoter and the sequence encoding the MopE signal sequence were verified by sequencing.

To produce pET2 the DNA containing the mopE H * gene was inserted to pETl (Fig. no. 8. ). The DNA containing the mopE H * gene was excised from pCR ® 2.1-TOPO2 by digestion with Bamlil and Ncol and this resulted in three fragments with apparent lengths of -1.2 kb, ~1.6 kb and -2.3 kb, as theoretically predicted. The plasmid pETl was opened by digestion with BamHl and Ncol and this resulted in a linear vector fragment of about 6.3 kb.

The -1.2 kb DνA containing the mopE H * gene and the -6.3 kb pETl fragment were purified from a preparative agarose gel (Fig. no. 9. A lane 1 and 2, respectively) and used in a subsequent ligation reaction. The ligation solution was used to transform E. coli TOPlOF' cells and transformants were selected based on resistance to ampicillin. A generous number of colonies of transformed E. coli TOPlOF' cells were obtained. A few colonies were picked and cultivated in liquid media for further analysis. Plasmids, designated pET2, were purified from the selected transformed cells and analysed by RE digestion. As predicted theoretically the length of the pET2 was estimated to be -7.4 kb by agarose gel electrophoresis (Fig. no. 9. B lane 1). Double digestion of the plasmid with BamHl and Xbal produced two DνAs, one -1.7 kb and one -5.7 kb fragment (Fig. no. 9. B lane 2), as theoretically predicted. Sequencing confirmed that the mopE H * gene, in pET2, was preceded by the mmoX promoter and the sequence encoding the MopE signal sequence.

From pET2 a DNA fragment containing the mopE H * gene proceeded by the sequence encoding the MopE signal sequence and the mmoX promoter could be excised and transferred to the mobilizable vector pBBRlMCS-2 (Table 3.). This would produce pBBmopE H * (Fig. no. 10.). The pBBmopE H * amino acid sequence is shown in the sequence listing, as Seq. ID. no. 9.

This pET2-DNA fragment was excised from the plasmid by restriction with Hindlll and Xbal. As theoretically predicted this resulted in two DNAs, one of ~2.0 kb and one of -5.5 kb. Also as expected, opening of pBBRlMCS-2 by digestion with Hindlll and Xbal produced a ~5.1 kb vector-DNA. The ~5.1 kb vector-DNA was purified from a preparative agarose gel along with the ~2.0 kb pET2-DNA containing the mmoX promoter linked to the sequence encoding the MopE signal sequence and the mopE H * gene (Fig. no. 11. A lane 1 and 2) and were ligated. The ligation solution was subsequently used to transform E. coli Top 1OF' cells, and transformed cells were selected based on their resistance to kanamycin. A total of 26 colonies of transformed E. coli Top 1OF' cells were obtained. A few colonies of transformants were cultivated in 5 ml cultures for further analysis.

Plasmids, designated pBBmopE H *, were purified from the selected transformed cells and analysed by RE digestion. As theoretically predicted the length of pBBmopE H * was estimated to be about 7.2 kb by agarose gel electrophoresis (Fig. no. 11. B lane 2), while double digestion of pBBmopE H * with BamHl and Xbal produced two DNAs, a -2.0 kb and a -5.1 kb DNA (Fig. no 11. B lane 1). That pBBmopE H * contained a mopE H * gene preceded by the mmoX promoter and the sequence encoding the MopE signal sequence was confirmed by sequencing.

Production of MopE H * in E. coli S 17-1

A previous study in our laboratory has shown that the mmoX promoter is functional in E. coli. The expression of the mutated mopE H * gene was studied in E. coli prior to transfer of the gene to M. capsulatus. E. coli whole cells and spent media were analysed by immunoblotting. E. coli S 17-1 cells harbouring pBBmopE H * were harvested from 50 ml cultures and By immunoblotting one immunoreactive protein migrating according to an apparent molecular mass of about 50 kDa was detected (not shown), thus the protein migrated shorter than wild type MopE* in the gel. Thus the E. coli host cell apparently produced MopE H *, but the host was not able to cleave off the signal peptide. As

expected no immunoreactive proteins were detected in the E. coli S 17-1 cells harbouring pBBRlMCS-2 (not shown).

Spent medium from a culture of E. coli S 17-1 harbouring pBBmopε H * was concentrated. No immunoreactive proteins were detected in the concentrated spent medium. Thus, MopE H * were seemingly not secreted from E. coli S 17-1 in detectable amounts. As expected, the E. coli S 17-1 pBBlMCS-2 did not either secrete immunoreactive proteins.

Production of MopE H * in M. capsulatus δmopE

The pBBmopE H * plasmid was transferred to M. capsulatus:

The plasmid was transferred from E. coli S 17-1 to M capsulatus δmopE by conjugation and M. capsulatus cells transformed by pBBmopE H * were selected by their resistance to kanamycin and gentamycin. A total of four coηjugants were obtained. A few were selected for further analysis. The pBBmopE H * plasmid was purified from the selected transformants and re-sequencing confirmed that no deletions had occurred during the conjugation process.

To study the expression of MopE H * in M. capsulatus the copper-sensitive mmoX promoter was induced by cultivation of M. capsulatus δmopE containing pBBmopE H * in a copper-depleted medium. Cells were separated from the spent medium by centrifugation, No MopE proteins could be detected in the M. capsulatus δmopE cells harbouring the empty conjugative plasmid, and no MopE protein was secreted from these cells.

The spent medium was isolated from the M. capsulatus δmopE pBBmopE H * cell, concentrated and subjected to SDS-PAGE and immunoblotted. One immunoreactive protein was detected (Fig. no. 12. lane 3). This protein migrated as wild type MopE* (Fig. no. 12. lane 1), demonstrating that MopE H * was secreted from M. capsulatus δmopE. Thus, the protein was properly processed in M. capsulatus and was able to cross the OM even though the N-terminal domain had been removed.

In conclusion, MopE H * was expressed both in E. coli S 17-1 and in M. capsulatus

δmopE, but the secretion of MopE * was host specific, as MopE * was detected in the

spent medium of the M. capsulatus culture only. This shows conclusively the abillety of MopE H * to translocate across the outer membrane of M. capsulatus.

The inventors have in a previous application (Norwegian patent application no. 20033176) established a fusion protein of the complete MopE from M. capsulatus and the VP2 protein in of the infectious pancreatic necrosis (IPN) virus. They have also demonstrated that it is possible to express heterologous peptides in M. capsulatus by using the native protein MopE as a fusion partner. These fusion proteins did translocate, and produced immunological active antibodies.

Cloning of nodavirus capsid - MopEH* fusion proteins

In order to ascertain wether MopE * is able not only to translocate itself, but to do so as a functional fusion protein, several constructs with Nodavirus capsid protein were constructed, and the expression thereof was tested.

Atlantic halibut nodavirus is a RNA virus infecting mitochondria of insects or fish. It infect halibut at the larvae or juvenile stage, and mortality rates are up to 100%. Antibodies against AHNV have been previously developed.

The cloning of the fusion protein constructs were achived by conventional methods, including standard lab methods and commercial kits for cloning, mutagenese, immunoblotting etc, using pBBRlMCS2, as described above, as the starting plasmid. This time, instead of cloning in the MopE H * nucleotide sequence alone, the sequence was first modified to comprise capsid protein from Atlantic halibut Nodavirus. In order for the capsid protein DNA sequence to be inserted into the MopEH* sequence, the MopEH* sequence was modified to comprise restriction enzyme sites. One such modified MopEH* sequence contains a BspHI site at the stop codon of MopEH*, another one a Ndel site internally in MopEH*, causing some minor changes in the

MopEH* sequence. The DNA sequences of these two specific modified MopEH* sequences are fiven as Seq. ID no. 16 and 17, respectively. Diagrams thereof are shown in figure 13. Three MopEH* -nodavirus capsid constructs were made, as shown in figure 14.

DNA from Atlantic halibut Nodavirus was a gift from Audun Nerland. Based on the published capsid sequence (Accesion number AJ245641, Seq. ID no 18 in the attatched patentln file) the primers listed in table 6 were ordered from Sigma- Aldrich.

Tableό. List of primers used to create the Atlantic halibut Nodavirus capsid and Mo EH* fusion proteins

Also two new versions of the mopEH* expressions system where made. One version with a BspHl restriction site replacing the stop codon of mopEH* (Seq. ID no 16) and a second version where a Nhel restriction site where mutated into a predicted surface loop of MopEH* (Seq. ID no. 17). The mopEH* plasmids are base don the pBBRl-MCS2 plasmid, and Seq. ID no 19 shows the original, unmodefied pBBRl-mopEH* .

In the first construct, AHNVC-MopEH*, the known sequence of Atlantic halibut

Nodavirus capsid protein (AHNVC) were fused to MopEH* using the Ncol restriction enzyme site at the start of MopEH* giving a protein of 72.4 kDa. The leader sequence should be cleaved off when exported from the cytoplasm giving a protein of 72.5 kDa. The DνA and protein sequences of AHνVC-MopEH* are given as Seq. ID 10 and 11, respectively.

In the second construct, MopEH*-AHνVCc, the predicted surface part of Atlantic halibut Nodavirus capsid protein (AHNVCc) were fused to MopEH* using the BspHl restriction enzyme site at the end of MopEH* giving a protein of 52.5 kDa. The leader sequence should be cleaved off when exported from the cytoplasm giving a protein of 49.5 kDa. The DNA and protein sequences of MopE*-AHNVCc are given as Seq. ID no 12 and 13, respectively.

In the third construct, MopEH*-AHNVC-20, aa a 20 amino acid fragment of Atlantic halibut Nodavirus capsid protein (AHNVC-20aa) were inserted into a predicted surface loop of MopEH* using the Ndel restriction enzyme site giving a protein of 41.5 kDa. The leader sequence should be cleaved off when exported from the cytoplasm giving a protein of 38.5 kDa. The DNA and protein sequences of MopEH*-AHNVC-20aa are given as Seq. ID no 14 and 15, respectively.

Expression of Atlantic halibut Nodavirus capsid MopEH* recombinant proteins

The tree recombinant proteins where conjugated into M. capsulatus AmopEH*. To express the recombinant protein M. capsulatus AmopEH were grown in a low copper medium. At late log phase growth the cultures were harvest and fractionated into:

1 : Spent medium (S)

2: Periplasmic fraction (P)

3: Cytoplasmic fraction (C)

4: Inner membrane (I) 5 : Outer membrane (O)

Figure 15 shows the scheme for localization of the recombinant MopEH*.

The presence of recombinant protein where checked with protein- immunoblot using either AHNV antibodies or MopEH antibodies. The results are given in figure 16. As can bee seen in figure 16 a, for the MopEH*-AHNVCc construct the majority of recombinant MopEH* -AHNVcc was degraded to MopEH*, although some intact fusion protein is left.Fig 16b shows that for the MopEH*-AHNVC 20 aa peptid the MopEH*-AHNVC seems to be misfolded. Figure 16c shows that the AHNVC- MopEH* seems to be intact, allthough the plasmid is a bit unstable.

Figure 17 shows the results of using antibodies against the MopEH* -AHNVC 20 aa peptid. This clearly shows that the translocated MopEH*-AHNVC is antigenic.

In conclusion, the above results thus show that MopEH* fusion proteins can be successfully constructed, and successfully translocated thrugh the outer membrane of M. capsulatus and there expressed.

SEQUENCE LISTING

<110> Stiftelsen Universitetsforskning Bergen

<120> Systems for expression of heterologous proteins in M. capsulatus

<130> P10224PC00

<160> 33

<170> Patentln version 3.4

<210> 1 <211> 540 <212> PRT <213> Methylococcus capsulatus

<400> 1

Met Arg Asp Thr Met Asn GIu Lys His Cys Tyr Ser Leu Leu Ala Ala 1 5 10 15

GIy Leu He Ala Ala VaI Pro GIn Leu Ala Ala Ala His GIy GIy Thr 20 25 30

His Asp VaI Thr Ala VaI Ala His Leu Ser Tyr Ser GIu Ala Tyr Ser 35 40 45

GIu Lys Leu Lys Lys GIy GIn GIu VaI GIy Thr GIu Leu Leu VaI Leu 50 55 60

Asp GIy Arg Phe GIu Phe Asn GIu His VaI GIy Met GIy Asp He Thr 65 70 75 80

Pro Asp Thr Thr Tip Ser Ala VaI VaI GIn GIy GIn Thr Leu Ala Thr 85 90 95

GIy Thr Leu GIy Asp Ala Thr Lys Lys Lys Phe GIy Ala Lys GIy GIy 100 105 110

Met Ala VaI He Asn VaI Pro GIy GIy GIy Thr Leu Lys Phe Thr Trp

115 120 125

Asn Ala Lys Ala He Met Leu Lys Leu Lys Tip Thr GIy GIu Pro Ala 130 135 140

Leu Ala Arg Leu Tyr Lys Asp GIn Asn Thr Ser He Asn Leu Pro GIn 145 150 155 160

Phe Pro VaI Asp He Ala He GIy Ser Leu His GIy Tyr Phe Asn VaI 165 170 175

Pro VaI Thr GIy GIn Ala Lys Ala Thr Thr Lys Asn GIy Thr Met Leu 180 185 190

Ser Lys He Ala Leu Lys GIy Thr Ala Asn Ser Ala GIy Leu Asp Thr 195 200 205

Leu Asp Arg Asp GIy Asp GIy Ser Thr Ala Asp Ala Asp Cys Asn Asp 210 215 220

Phe Ala Pro Thr He His Pro GIy Ala Ala GIu Ala Thr Leu Asp GIy

225 230 235 240

VaI Asp Ser Asn Cys Asp GIy Arg Asp Ser GIy VaI Ala GIu VaI VaI

245 250 255

GIu Thr Phe Lys Asn Pro GIy Thr Tyr Ser Ser Pro VaI He Asn Phe 260 265 270

Lys He Ala Ser Pro Pro GIy Pro GIy Thr Pro He Tyr GIy Pro Pro 275 280 285

Arg Asp Phe Ser GIy Tyr Asn Lys Ser Tyr Ser Leu Ala He GIy Lys 290 295 300

Thr Ser Tyr Tyr Asp Pro Thr Thr GIy Thr Lys Trp Asn Asp Asp Thr 305 310 315 320

He Thr Pro VaI Ser Asp GIy GIn Asp He Trp Arg GIy Trp Thr His

325 330 335

Thr GIy Lys Trp Ser Phe Phe Asn GIy Lys Ala GIy Asp Lys He Thr 340 345 350

Leu Ser VaI GIn Arg Asp Ala GIn GIu Ala Ser Leu Lys GIy Ala His 355 360 365

Pro GIy Phe He Leu Phe Trp Arg Pro GIu GIy GIy Pro Leu Phe Trp 370 375 380

Ala GIy Thr GIn Asp Leu Asp GIu GIy GIn Thr Ala Leu Pro Ala Asp 385 390 395 400

Ser Asp Thr VaI He GIy His VaI He VaI GIn His Ala Asp Trp Thr 405 410 415

Leu GIn GIy Leu Pro Pro Lys Ala Asp His Thr Ala Pro Ala GIy VaI

420 425 430

Asp Thr GIu Leu Tyr Pro Met Lys Pro Asp Ser Tyr Thr Met Tyr Tyr 435 440 445

VaI Asp Ser GIy Tyr Asp Ala Asp Lys Tyr VaI Ala Ser Lys Lys Leu 450 455 460

He Met His Pro Thr Ala Phe Lys GIy Leu Ala Leu Asn Asp GIy Thr 465 470 475 480

Ala GIy Ala Phe Thr Lys Ser He Thr Leu Pro Lys Thr GIy Tyr Tyr 485 490 495

Met Leu Tyr VaI Ala Asn VaI Leu GIu VaI Asp Asp Trp Ser VaI Asp 500 505 510

Ala Asp GIy Lys Leu Thr Thr Thr GIy GIu VaI Trp GIu VaI Pro Ala 515 520 525

Lys GIy Cys Trp VaI Asn lie Thr He Ser Lys Pro 530 535 540

<210> 2 <211> 337 <212> PRT <213> Methylococcus capsulatus

<400> 2

His GIy Leu Asp Thr Leu Asp Arg Asp GIy Asp GIy Ser Thr Ala Asp 1 5 10 15

Ala Asp Cys Asn Asp Phe Ala Pro Thr lie His Pro GIy Ala Ala GIu 20 25 30

Ala Thr Leu Asp GIy VaI Asp Ser Asn Cys Asp GIy Arg Asp Ser GIy 35 40 45

VaI Ala GIu VaI VaI GIu Thr Phe Lys Asn Pro GIy Thr Tyr Ser Ser 50 55 60

Pro VaI He Asn Phe Lys He Ala Ser Pro Pro GIy Pro GIy Thr Pro 65 70 75 80

He Tyr GIy Pro Pro Arg Asp Phe Ser GIy Tyr Asn Lys Ser Tyr Ser 85 90 95

Leu Ala He GIy Lys Thr Ser Tyr Tyr Asp Pro Thr Thr GIy Thr Lys 100 105 110

Trp Asn Asp Asp Thr He Thr Pro VaI Ser Asp GIy GIn Asp He Trp 115 120 125

Arg GIy Trp Thr His Thr GIy Lys Trp Ser Phe Phe Asn GIy Lys Ala

130 135 140

GIy Asp Lys He Thr Leu Ser VaI GIn Arg Asp Ala GIn GIu Ala Ser 145 150 155 160

Leu Lys GIy Ala His Pro GIy Phe He Leu Phe Trp Arg Pro GIu GIy 165 170 175

GIy Pro Leu Phe Trp Ala GIy Thr GIn Asp Leu Asp GIu GIy GIn Thr 180 185 190

Ala Leu Pro Ala Asp Ser Asp Thr VaI He GIy His VaI He VaI GIn 195 200 205

His Ala Asp Trp Thr Leu GIn GIy Leu Pro Pro Lys Ala Asp His Thr 210 215 220

Ala Pro Ala GIy VaI Asp Thr GIu Leu Tyr Pro Met Lys Pro Asp Ser

225 230 235 240

Tyr Thr Met Tyr Tyr VaI Asp Ser GIy Tyr Asp Ala Asp Lys Tyr VaI

245 250 255

Ala Ser Lys Lys Leu He Met His Pro Thr Ala Phe Lys GIy Leu Ala 260 265 270

Leu Asn Asp GIy Thr Ala GIy Ala Phe Thr Lys Ser He Thr Leu Pro 275 280 285

Lys Thr GIy Tyr Tyr Met Leu Tyr VaI Ala Asn VaI Leu GIu VaI Asp 290 295 300

Asp Trp Ser VaI Asp Ala Asp GIy Lys Leu Thr Thr Thr GIy GIu VaI 305 310 315 320

Trp GIu VaI Pro Ala Lys GIy Cys Trp VaI Asn He Thr He Ser Lys

325 330 335

Pro

<210> 3

<211> 1620

<212> DNA <213> Methylococcus capsulatus

<400> 3 atgagagaca ccatgaacga aaagcattgc tactccttac tggccgccgg cctcatcgcc 60

gccgtgccgc aactcgcagc cgcccacgga ggcactcacg acgtcaccgc ggtcgcccac 120

ctgagctatt cggaagccta ttcggaaaag ctcaagaagg gtcaggaagt gggcaccgag 180

ctgctggtgc tggacggacg cttcgaattc aacgaacacg tcggcatggg tgacatcacc 240

ccggacacga cctggtcggc cgtcgtgcag ggccagaccc tggcaacggg taccctgggc 300

gacgccacca agaagaagtt cggcgccaag ggcggcatgg cggtgatcaa cgtgcccggc 360

ggcggtacgc tcaaattcac ctggaacgcc aaggccatca tgctcaagct gaaatggacc 420

ggcgagccgg ccttggcccg gctgtacaag gaccagaaca ccagcatcaa tctgccgcaa 480

ttcccggtcg acatcgcgat cggcagtctg cacggctact tcaacgttcc ggtcaccggc 540

caggcgaagg ccacgaccaa gaacggcacc atgctgtcca agatcgcctt gaaaggcaca 600

gcgaactccg cgggcctgga cacgctggac cgggacggcg acggctccac ggccgacgcc 660

gattgcaacg acttcgcgcc caccatccat ccgggcgccg ccgaagcgac gctggacggc 720

gtggattcca actgcgacgg gcgcgactcc ggcgtggcgg aagtcgtcga gaccttcaag 780

aatccgggca cctactccag cccggtcatc aacttcaaga tcgcttcgcc gccggggccg 840

ggaacgccca tctacgggcc gccgcgtgat ttctccggtt acaacaagag ctactcgctg 900

gcgatcggca agacctcgta ctacgatccg accaccggca ccaagtggaa cgacgacacc 960

atcacgccgg tcagtgatgg tcaggacatc tggcgcggct ggacccatac cggcaagtgg 1020

tcgttcttca acggcaaggc cggcgacaag atcaccctca gcgtacagcg tgatgcgcag 1080

gaagccagcc tgaaaggcgc ccatccgggc ttcatcctgt tctggcggcc cgagggcggt 1140

ccgctgttct gggccggcac ccaggatctc gacgagggcc agaccgcgct gcccgccgac 1200

tccgacaccg ttatcggcca cgtgatcgtt cagcacgccg actggaccct gcagggcttg 1260

ccgcccaagg ccgaccatac cgcacccgcg ggcgtggata ccgagctcta tcccatgaag 1320

ccggacagct acaccatgta ctacgtcgac tccggctacg atgccgacaa gtacgtggca 1380

tcgaagaagc tcatcatgca ccccacggcg ttcaaagggc tggccctgaa cgacggcacc 1440

gccggggcgt tcaccaagtc catcaccctg ccgaagacgg gctattacat gctgtacgtc 1500

gccaacgtcc tggaagtgga cgactggagc gtcgacgcgg acggcaagct caccaccacc 1560

ggcgaagtct gggaagtgcc ggccaagggc tgctgggtca acatcacgat ctccaagccg 1620

<210> 4 <211> 1011 <212> DNA <213> Methylococcus capsulatus

<400> 4 catggcctgg acacgctgga ccgggacggc gacggctcca cggccgacgc cgattgcaac 60

gacttcgcgc ccaccatcca tccgggcgcc gccgaagcga cgctggacgg cgtggattcc 120

aactgcgacg ggcgcgactc cggcgtggcg gaagtcgtcg agaccttcaa gaatccgggc 180

acctactcca gcccggtcat caacttcaag atcgcttcgc cgccggggcc gggaacgccc 240

atctacgggc cgccgcgtga tttctccggt tacaacaaga gctactcgct ggcgatcggc 300

aagacctcgt actacgatcc gaccaccggc accaagtgga acgacgacac catcacgccg 360

gtcagtgatg gtcaggacat ctggcgcggc tggacccata ccggcaagtg gtcgttcttc 420

aacggcaagg ccggcgacaa gatcaccctc agcgtacagc gtgatgcgca ggaagccagc 480

ctgaaaggcg cccatccggg cttcatcctg ttctggcggc ccgagggcgg tccgctgttc 540

tgggccggca cccaggatct cgacgagggc cagaccgcgc tgcccgccga ctccgacacc 600

gttatcggcc acgtgatcgt tcagcacgcc gactggaccc tgcagggctt gccgcccaag 660

gccgaccata ccgcacccgc gggcgtggat accgagctct atcccatgaa gccggacagc 720

tacaccatgt actacgtcga ctccggctac gatgccgaca agtacgtggc atcgaagaag 780

ctcatcatgc accccacggc gttcaaaggg ctggccctga acgacggcac cgccggggcg 840

ttcaccaagt ccatcaccct gccgaagacg ggctattaca tgctgtacgt cgccaacgtc 900

ctggaagtgg acgactggag cgtcgacgcg gacggcaagc tcaccaccac cggcgaagtc 960

tgggaagtgc cggccaaggg ctgctgggtc aacatcacga tctccaagcc g 1011

<210> 5

<211> 6165

<212> DNA

<213> Methylococcus capsulatus

<400> 5 taatacgact cactataggg gaattgtgag cggataacaa ttcccctcta gatgcatgct 60

cgagcggccg ccagtgtgat ggatatctgc agaattcgcc cttggcttgt attgagctca 120

tccgaagata agcgctttcc gcgcagcccg attctttcat ggatcacgat tccattgaat 180

gcggcgaaag tctcagggtc cggtcatgaa tgaagagtta tggcggccca gtacgtcacc 240

gttatgtccg atggctgtat caaacaaaga cacgtgtagt gatatcggac aactcgtcca 300

tccccgtcgg agcattcgga taacgtgctc atcgttccaa aatattgata tacggtatac 360

gtatccgaag aataaagttg gcacgatccc tgtaactagg ttgtcacgac ctcgtcggag 420

gttgtatgtc cggtgttccg tgacgtcatc gggcattcat cattcataga atgtgttacg 480

gaggaaacaa catatgagag acaccatgaa cgaaaagcat tgctactcct tactggccgc 540

cggcctcatc gccgccgtgc cgcaactcgc agccgcccat ggctagcatg actggtggac 600

agcaaatggg tcggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc 660

caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt 720

tttgctgaaa ggaggaacta tatccggata tcccgcaaga ggcccggcag taccggcata 780

accaagccta tgcctacagc atccagggtg acggtgccga ggatgacgat gagcgcattg 840

ttagatttca tacacggtgc ctgactgcgt tagcaattta actgtgataa actaccgcat 900

taaagcttat cgatgataag ctgtcaaaca tgagaattct tgaagacgaa agggcctcgt 960

gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 1020

cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 1080 tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 1140 gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 1200 tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 1260 tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 1320 ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 1380 atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 1440 cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 1500 attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 1560 gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 1620 ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 1680 gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 1740 agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 1800 gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 1860 gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 1920 ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 1980 tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 2040 tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 2100 catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 2160 gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 2220 aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 2280 gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 2340 gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 2400 gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 2460

atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 2520 cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 2580 cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 2640 agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 2700 tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 2760 gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 2820 catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 2880 agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 2940 ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 3000 atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatacactc 3060 cgctatcgct acgtgactgg gtcatggctg cgccccgaca cccgccaaca cccgctgacg 3120 cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg 3180 ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg cagctgcggt 3240 aaagctcatc agcgtggtcg tgaagcgatt cacagatgtc tgcctgttca tccgcgtcca 3300 gctcgttgag tttctccaga agcgttaatg tctggcttct gataaagcgg gccatgttaa 3360 gggcggtttt ttcctgtttg gtcactgatg cctccgtgta agggggattt ctgttcatgg 3420 gggtaatgat accgatgaaa cgagagagga tgctcacgat acgggttact gatgatgaac 3480 atgcccggtt actggaacgt tgtgagggta aacaactggc ggtatggatg cggcgggacc 3540 agagaaaaat cactcagggt caatgccagc gcttcgttaa tacagatgta ggtgttccac 3600 agggtagcca gcagcatcct gcgatgcaga tccggaacat aatggtgcag ggcgctgact 3660 tccgcgtttc cagactttac gaaacacgga aaccgaagac cattcatgtt gttgctcagg 3720 tcgcagacgt tttgcagcag cagtcgcttc acgttcgctc gcgtatcggt gattcattct 3780 gctaaccagt aaggcaaccc cgccagccta gccgggtcct caacgacagg agcacgatca 3840 tgcgcacccg tggccaggac ccaacgctgc ccgagatgcg ccgcgtgcgg ctgctggaga 3900 tggcggacgc gatggatatg ttctgccaag ggttggtttg cgcattcaca gttctccgca 3960

agaattgatt ggctccaatt cttggagtgg tgaatccgtt agcgaggtgc cgccggcttc 4020 cattcaggtc gaggtggccc ggctccatgc accgcgacgc aacgcgggga ggcagacaag 4080 gtatagggcg gcgcctacaa tccatgccaa cccgttccat gtgctcgccg aggcggcata 4140 aatcgccgtg acgatcagcg gtccagtgat cgaagttagg ctggtaagag ccgcgagcga 4200 tccttgaagc tgtccctgat ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc 4260 gggcatcccg atgccgccgg aagcgagaag aatcataatg gggaaggcca tccagcctcg 4320 cgtcgcgaac gccagcaaga cgtagcccag cgcgtcggcc gccatgccgg cgataatggc 4380 ctgcttctcg ccgaaacgtt tggtggcggg accagtgacg aaggcttgag cgagggcgtg 4440 caagattccg aataccgcaa gcgacaggcc gatcatcgtc gcgctccagc gaaagcggtc 4500 ctcgccgaaa atgacccaga gcgctgccgg cacctgtcct acgagttgca tgataaagaa 4560 gacagtcata agtgcggcga cgatagtcat gccccgcgcc caccggaagg agctgactgg 4620 gttgaaggct ctcaagggca tcggtcgaga tcccggtgcc taatgagtga gctaacttac 4680 attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca 4740 ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgcc agggtggttt 4800 ttcttttcac cagtgagacg ggcaacagct gattgccctt caccgcctgg ccctgagaga 4860 gttgcagcaa gcggtccacg ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg 4920 ttaacggcgg gatataacat gagctgtctt cggtatcgtc gtatcccact accgagatat 4980 ccgcaccaac gcgcagcccg gactcggtaa tggcgcgcat tgcgcccagc gccatctgat 5040 cgttggcaac cagcatcgca gtgggaacga tgccctcatt cagcatttgc atggtttgtt 5100 gaaaaccgga catggcactc cagtcgcctt cccgttccgc tatcggctga atttgattgc 5160 gagtgagata tttatgccag ccagccagac gcagacgcgc cgagacagaa cttaatgggc 5220 ccgctaacag cgcgatttgc tggtgaccca atgcgaccag atgctccacg cccagtcgcg 5280 taccgtcttc atgggagaaa ataatactgt tgatgggtgt ctggtcagag acatcaagaa 5340 ataacgccgg aacattagtg caggcagctt ccacagcaat ggcatcctgg tcatccagcg 5400

gatagttaat gatcagccca ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac 5460

aggcttcgac gccgcttcgt tctaccatcg acaccaccac gctggcaccc agttgatcgg 5520

cgcgagattt aatcgccgcg acaatttgcg acggcgcgtg cagggccaga ctggaggtgg 5580

caacgccaat cagcaacgac tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt 5640

aattcagctc cgccatcgcc gcttccactt tttcccgcgt tttcgcagaa acgtggctgg 5700

cctggttcac cacgcgggaa acggtctgat aagagacacc ggcatactct gcgacatcgt 5760

ataacgttac tggtttcaca ttcaccaccc tgaattgact ctcttccggg cgctatcatg 5820

ccataccgcg aaaggttttg cgccattcga tggtgtccgg gatctcgacg ctctccctta 5880

tgcgactcct gcattaggaa gcagcccagt agtaggttga ggccgttgag caccgccgcc 5940

gcaaggaatg gtgcatgcaa ggagatggcg cccaacagtc ccccggccac ggggcctgcc 6000

accataccca cgccgaaaca agcgctcatg agcccgaagt ggcgagcccg atcttcccca 6060

tcggtgatgt cggcgatata ggcgccagca accgcacctg tggcgccggt gatgccggcc 6120

acgatgcgtc cggcgtagag gatcgagatc tcgatcccgc gaaat 6165

<210> 6 <211> 7358 <212> DNA

<213> Methylococcus capsulatus

<400> 6 taatacgact cactataggg gaattgtgag cggataacaa ttcccctcta gatgcatgct 60

cgagcggccg ccagtgtgat ggatatctgc agaattcgcc ctttaagagc tcatccgaag 120

ataagcgctt tccgcgcagc ccgattcttt catggatcac gattccattg aatgcggcga 180

aagtctcagg gtccggtcat gaatgaagag ttatggcggc ccagtacgtc accgttatgt 240

ccgatggctg tatcaaacaa agacacgtgt agtgatatcg gacaactcgt ccatccccgt 300

cggagcattc ggataacgtg ctcatcgttc caaaatattg atatacggta tacgtatccg 360

aagaataaag ttggcacgat ccctgtaact aggttgtcac gacctcgtcg gaggttgtat 420

gtccggtgtt ccgtgacgtc atcgggcatt catcattcat agaatgtgtt acggaggaaa 480

caacatatga gagacaccat gaacgaaaag cattgctact ccttactggc cgccggcctc 540 atcgccgccg tgccgcaact cgcagccgcc catggcctgg acacgctgga ccgggacggc 600 gacggctcca cggccgacgc cgattgcaac gacttcgcgc ccaccatcca tccgggcgcc 660 gccgaagcga cgctggacgg cgtggattcc aactgcgacg ggcgcgactc cggcgtggcg 720 gaagtcgtcg agaccttcaa gaatccgggc acctactcca gcccggtcat caacttcaag 780 atcgcttcgc cgccggggcc gggaacgccc atctacgggc cgccgcgtga tttctccggt 840 tacaacaaga gctactcgct ggcgatcggc aagacctcgt actacgatcc gaccaccggc 900 accaagtgga acgacgacac catcacgccg gtcagtgatg gtcaggacat ctggcgcggc 960 tggacccata ccggcaagtg gtcgttcttc aacggcaagg ccggcgacaa gatcaccctc 1020 igcgtacagc gtgatgcgca ggaagccagc ctgaaaggcg cccatccggg cttcatcctg 1080 ttctggcggc ccgagggcgg tccgctgttc tgggccggca cccaggatct cgacgagggc 1140 cagaccgcgc tgcccgccga ctccgacacc gttatcggcc acgtgatcgt tcagcacgcc 1200 gactggaccc tgcagggctt gccgcccaag gccgaccata ccgcacccgc gggcgtggat 1260 accgagctct atcccatgaa gccggacagc tacaccatgt actacgtcga ctccggctac 1320 gatgccgaca agtacgtggc atcgaagaag ctcatcatgc accccacggc gttcaaaggg 1380 ctggccctga acgacggcac cgccggggcg ttcaccaagt ccatcaccct gccgaagacg 1440 ggctattaca tgctgtacgt cgccaacgtc ctggaagtgg acgactggag cgtcgacgcg 1500 gacggcaagc tcaccaccac cggcgaagtc tgggaagtgc cggccaaggg ctgctgggtc 1560 aacatcacga tctccaagcc gtaaatccgg cttgatgtgc gtgatccttc ggagcggcca 1620 gtcgaccgtt ccgcggggaa aacttccgga gccaatcccc gtcctctgcg gcggggattt 1680 ttttattcgg cggaaccgag ttgggcgacg gctttttcca gcttcttcaa ccggctccag 1740 atttcgtgca ggctcgagat caccgccaaa gggcgaattc cagcacactg gcggccgtta 1800 ctagtggatc cggctgctaa caaagcccga aaggaagctg agttggctgc tgccaccgct 1860 gagcaataac tagcataacc ccttggggcc tctaaacggg tcttgagggg ttttttgctg 1920 aaaggaggaa ctatatccgg atatcccgca agaggcccgg cagtaccggc ataaccaagc 1980

ctatgcctac agcatccagg gtgacggtgc cgaggatgac gatgagcgca ttgttagatt 2040 tcatacacgg tgcctgactg cgttagcaat ttaactgtga taaactaccg cattaaagct 2100 tatcgatgat aagctgtcaa acatgagaat tcttgaagac gaaagggcct cgtgatacgc 2160 ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt 2220 cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat 2280 ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg 2340 agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt 2400 tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga 2460 gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa 2520 gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt 2580 gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt 2640 gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc 2700 agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga 2760 ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat 2820 cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct 2880 gcagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc 2940 cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg 3000 gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc 3060 ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg 3120 acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca 3180 ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta 3240 aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc 3300 aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa 3360 ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca 3420

ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta 3480 actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc 3540 caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca 3600 gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta 3660 ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag 3720 cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt 3780 cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc 3840 acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac 3900 ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac 3960 gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc 4020 tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat 4080 accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag 4140 cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatatggt 4200 gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc 4260 gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg 4320 acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg 4380 catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagctgc ggtaaagctc 4440 atcagcgtgg tcgtgaagcg attcacagat gtctgcctgt tcatccgcgt ccagctcgtt 4500 gagtttctcc agaagcgtta atgtctggct tctgataaag cgggccatgt taagggcggt 4560 tttttcctgt ttggtcactg atgcctccgt gtaaggggga tttctgttca tgggggtaat 4620 gataccgatg aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg 4680 gttactggaa cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa 4740 aatcactcag ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag 4800 ccagcagcat cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt 4860 ttccagactt tacgaaacac ggaaaccgaa gaccattcat ettεttεctc aggtcgcaga 4920

cgttttgcag cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc 4980 agtaaggcaa ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac 5040 ccgtggccag gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga 5100 cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 5160 attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag 5220 gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg 5280 gcggcgccta caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc 5340 gtgacgatca gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga 5400 agctgtccct gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc 5460 ccgatgccgc cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg 5520 aacgccagca agacgtagcc cagcgcgtcg gccgccatgc cggcgataat ggcctgcttc 5580 tcgccgaaac gtttggtggc gggaccagtg acgaaggctt gagcgagggc gtgcaagatt 5640 ccgaataccg caagcgacag gccgatcatc gtcgcgctcc agcgaaagcg gtcctcgccg 5700 aaaatgaccc agagcgctgc cggcacctgt cctacgagtt gcatgataaa gaagacagtc 5760 ataagtgcgg cgacgatagt catgccccgc gcccaccgga aggagctgac tgggttgaag 5820 gctctcaagg gcatcggtcg agatcccggt gcctaatgag tgagctaact tacattaatt 5880 gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga 5940 atcggccaac gcgcggggag aggcggtttg cgtattgggc gccagggtgg tttttctttt 6000 caccagtgag acgggcaaca gctgattgcc cttcaccgcc tggccctgag agagttgcag 6060 caagcggtcc acgctggttt gccccagcag gcgaaaatcc tgtttgatgg tggttaacgg 6120 cgggatataa catgagctgt cttcggtatc gtcgtatccc actaccgaga tatccgcacc 6180 aacgcgcagc ccggactcgg taatggcgcg cattgcgccc agcgccatct gatcgttggc 6240 aaccagcatc gcagtgggaa cgatgccctc attcagcatt tgcatggttt gttgaaaacc 6300 ggacatggca ctccagtcgc cttcccgttc cgctatcggc tgaatttgat tgcgagtgag 6360

atatttatgc cagccagcca gacgcagacg cgccgagaca gaacttaatg ggcccgctaa 6420

cagcgcgatt tgctggtgac ccaatgcgac cagatgctcc acgcccagtc gcgtaccgtc 6480

ttcatgggag aaaataatac tgttgatggg tgtctggtca gagacatcaa gaaataacgc 6540

cggaacatta gtgcaggcag cttccacagc aatggcatcc tggtcatcca gcggatagtt 6600

aatgatcagc ccactgacgc gttgcgcgag aagattgtgc accgccgctt tacaggcttc 6660

gacgccgctt cgttctacca tcgacaccac cacgctggca cccagttgat cggcgcgaga 6720

tttaatcgcc gcgacaattt gcgacggcgc gtgcagggcc agactggagg tggcaacgcc 6780

aatcagcaac gactgtttgc ccgccagttg ttgtgccacg cggttgggaa tgtaattcag 6840

ctccgccatc gccgcttcca ctttttcccg cgttttcgca gaaacgtggc tggcctggtt 6900

caccacgcgg gaaacggtct gataagagac accggcatac tctgcgacat cgtataacgt 6960

tactggtttc acattcacca ccctgaattg actctcttcc gggcgctatc atgccatacc 7020

gcgaaaggtt ttgcgccatt cgatggtgtc cgggatctcg acgctctccc ttatgcgact 7080

cctgcattag gaagcagccc agtagtaggt tgaggccgtt gagcaccgcc gccgcaagga 7140

atggtgcatg caaggagatg gcgcccaaca gtcccccggc cacggggcct gccaccatac 7200

ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 7260

tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgccg gccacgatgc 7320

gtccggcgta gaggatcgag atctcgatcc cgcgaaat 7358

<210> 7

<211> 4409

<212> DNA

<213> Methylococcus capsulatus

<400> 7 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60

acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120

tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180

ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagcttg 240

gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcgc ccttggagct 300 catccgaaga taagcgcttt ccgcgcagcc cgattctttc atggatcacg attccattga 360 atgcggcgaa agtctcaggg tccggtcatg aatgaagagt tatggcggcc cagtacgtca 420 ccgttatgtc cgatggctgt atcaaacaaa gacacgtgta gtgatatcgg acaactcgtc 480 catccccgtc ggagcattcg gataacgtgc tcatcgttcc aaaatattga tatacggtat 540 acgtatccga agaataaagt tggcacgatc cctgtaacta ggttgtcacg acctcgtcgg 600 aggttgtatg tccggtgttc cgtgacgtca tcgggcattc atcattcata gaatgtgtta 660 cggaggaaac aacatatgag agacaccatg aacgaaaagc attgctactc cttactggcc 720 gccggcctca tcgccgccgt gccgcaactc gcagccgccc atggcctgga ttaagggcga 780 attctgcaga tatccatcac actggcggcc gctcgagcat gcatctagag ggcccaattc 840 gccctatagt gagtcgtatt acaattcact ggccgtcgtt ttacaacgtc gtgactggga 900 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 960 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 1020 atggacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg 1080 accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 1140 gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 1200 tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg ttcacgtagt 1260 gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 1320 agtggactct tgttccaaac tggaacaaca ctcaacccta tctcggtcta ttcttttgat 1380 ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 1440 tttaacgcga attttaacaa aattcagggc gcaagggctg ctaaaggaag cggaacacgt 1500 agaaagccag tccgcagaaa cggtgctgac cccggatgaa tgtcagctac tgggctatct 1560 ggacaaggga aaacgcaagc gcaaagagaa agcaggtagc ttgcagtggg cttacatggc 1620 gatagctaga ctgggcggtt ttatggacag caagcgaacc ggaattgcca gctggggcgc 1680 cctctggtaa ggttgggaag ccctgcaaag taaactggat ggctttcttg ccgccaagga 1740

tctgatggcg caggggatca agatctgatc aagagacagg atgaggatcg tttcgcatga 1800 ttgaacaaga tggattgcac gcaggttctc cggccgcttg ggtggagagg ctattcggct 1860 atgactgggc acaacagaca atcggctgct ctgatgccgc cgtgttccgg ctgtcagcgc 1920 aggggcgccc ggttcttttt gtcaagaccg acctgtccgg tgccctgaat gaactgcagg 1980 acgaggcagc gcggctatcg tggctggcca cgacgggcgt tccttgcgca gctgtgctcg 2040 acgttgtcac tgaagcggga agggactggc tgctattggg cgaagtgccg gggcaggatc 2100 tcctgtcatc ccaccttgct cctgccgaga aagtatccat catggctgat gcaatgcggc 2160 ggctgcatac gcttgatccg gctacctgcc cattcgacca ccaagcgaaa catcgcatcg 2220 agcgagcacg tactcggatg gaagccggtc ttgtcgatca ggatgatctg gacgaagagc 2280 atcaggggct cgcgccagcc gaactgttcg ccaggctcaa ggcgcgcatg cccgacggcg 2340 aggatctcgt cgtgacccat ggcgatgcct gcttgccgaa tatcatggtg gaaaatggcc 2400 gcttttctgg attcatcgac tgtggccggc tgggtgtggc ggaccgctat caggacatag 2460 cgttggctac ccgtgatatt gctgaagagc ttggcggcga atgggctgac cgcttcctcg 2520 tgctttacgg tatcgccgct cccgattcgc agcgcatcgc cttctatcgc cttcttgacg 2580 agttcttctg aattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 2640 tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 2700 aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 2760 cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 2820 agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 2880 ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 2940 tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 3000 tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 3060 caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 3120 accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact 3180

attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc 3240

ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga 3300

taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg 3360

taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg 3420

aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca 3480

agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta 3540

ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca 3600

ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg 3660

cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga 3720

tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa 3780

tactgttctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc 3840

tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg 3900

tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac 3960

ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct 4020

acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc 4080

ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg 4140

gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg 4200

ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct 4260

ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga 4320

taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg 4380

cagcgagtca gtgagcgagg aagcggaag 4409

<210> 8 <211> 5132

<212> DNA

<213> Methylococcus capsulatus

<400> 8

agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 60 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180 ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagcttg 240 gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcgc ccgcagccgc 300 ccatggcctg gacacgctgg accgggacgg cgacggctcc acggccgacg ccgattgcaa 360 cgacttcgcg cccaccatcc atccgggcgc cgccgaagcg acgctggacg gcgtggattc 420 caactgcgac gggcgcgact ccggcgtggc ggaagtcgtc gagaccttca agaatccggg 480 cacctactcc agcccggtca tcaacttcaa gatcgcttcg ccgccggggc cgggaacgcc 540 catctacggg ccgccgcgtg atttctccgg ttacaacaag agctactcgc tggcgatcgg 600 caagacctcg tactacgatc cgaccaccgg caccaagtgg aacgacgaca ccatcacgcc 660 ggtcagtgat ggtcaggaca tctggcgcgg ctggacccat accggcaagt ggtcgttctt 720 caacggcaag gccggcgaca agatcaccct cagcgtacag cgtgatgcgc aggaagccag 780 cctgaaaggc gcccatccgg gcttcatcct gttctggcgg cccgagggcg gtccgctgtt 840 ctgggccggc acccaggatc tcgacgaggg ccagaccgcg ctgcccgccg actccgacac 900 cgttatcggc cacgtgatcg ttcagcacgc cgactggacc ctgcagggct tgccgcccaa 960 ggccgaccat accgcacccg cgggcgtgga taccgagctc tatcccatga agccggacag 1020 ctacaccatg tactacgtcg actccggcta cgatgccgac aagtacgtgg catcgaagaa 1080 gctcatcatg caccccacgg cgttcaaagg gctggccctg aacgacggca ccgccggggc 1140 gttcaccaag tccatcaccc tgccgaagac gggctattac atgctgtacg tcgccaacgt 1200 cctggaagtg gacgactgga gcgtcgacgc ggacggcaag ctcaccacca ccggcgaagt 1260 ctgggaagtg ccggccaagg gctgctgggt caacatcacg atctccaagc cgtaaatccg 1320 gcttgatgtg cgtgatcctt cggagcggcc agtcgaccgt tccgcgggga aaacttccgg 1380 agccaatccc cgtcctctgc ggcggggatt tttttattcg gcggaaccga gttgggcgac 1440 ggctttttcc agcttcttca accggctcca gatttcgtgc aggctcgagg gggttaaggg 1500

cgaattctgc agatatccat cacactggcg gccgctcgag catgcatcta gagggcccaa 1560 ttcgccctat agtgagtcgt attacaattc actggccgtc gttttacaac gtcgtgactg 1620 ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg 1680 gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg 1740 cgaatggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 1800 gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 1860 ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 1920 cgatttagtg ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt 1980 agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt 2040 aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt 2100 gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa 2160 aaatttaacg cgaattttaa caaaattcag ggcgcaaggg ctgctaaagg aagcggaaca 2220 cgtagaaagc cagtccgcag aaacggtgct gaccccggat gaatgtcagc tactgggcta 2280 tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt agcttgcagt gggcttacat 2340 ggcgatagct agactgggcg gttttatgga cagcaagcga accggaattg ccagctgggg 2400 cgccctctgg taaggttggg aagccctgca aagtaaactg gatggctttc ttgccgccaa 2460 ggatctgatg gcgcagggga tcaagatctg atcaagagac aggatgagga tcgtttcgca 2520 tgattgaaca agatggattg cacgcaggtt ctccggccgc ttgggtggag aggctattcg 2580 gctatgactg ggcacaacag acaatcggct gctctgatgc cgccgtgttc cggctgtcag 2640 cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc cggtgccctg aatgaactgc 2700 aggacgaggc agcgcggcta tcgtggctgg ccacgacggg cgttccttgc gcagctgtgc 2760 tcgacgttgt cactgaagcg ggaagggact ggctgctatt gggcgaagtg ccggggcagg 2820 atctcctgtc atcccacctt gctcctgccg agaaagtatc catcatggct gatgcaatgc 2880 ggcggctgca tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca 2940

tcgagcgagc acgtactcgg atggaagccg gtcttgtcga tcaggatgat ctggacgaag 3000 agcatcaggg gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc atgcccgacg 3060 gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg gtggaaaatg 3120 gccgcttttc tggattcatc gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca 3180 tagcgttggc tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc 3240 tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat cgccttctat cgccttcttg 3300 acgagttctt ctgaattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 3360 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 3420 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 3480 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 3540 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 3600 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 3660 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 3720 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 3780 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 3840 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 3900 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 3960 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 4020 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 4080 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 4140 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 4200 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 4260 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 4320 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 4380 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 4440

ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 4500

aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 4560

gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 4620

gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 4680

aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 4740

cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 4800

tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 4860

ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 4920

atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 4980

cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 5040

ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 5100

gcgcagcgag tcagtgagcg aggaagcgga ag 5132

<210> 9 <21 1> 7138 <212> DNA <213> Methylococcus capsulatus

<400> 9 ccggtaaacc agcaatagac ataagcggct atttaacgac cctgccctga accgacgacc 60

gggtcgaatt tgctttcgaa tttctgccat tcatccgctt attatcactt attcaggcgt 120

agcaccaggc gtttaagggc accaataact gccttaaaaa aattacgccc cgccctgcca 180

ctcatcgcag tcggcctatt ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa 240

ttttaacaaa atattaacgc ttacaatttc cattcgccat tcaggctgcg caactgttgg 300

gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 360

gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 420

gccagtgagc gcgcgtaata cgactcacta tagggcgaat tggagctcca ccgcggtggc 480

ggccgctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcgcc 540

cttaagagct catccgaaga taagcgcttt ccgcgcagcc cgattctttc atggatcacg 600 attccattga atgcggcgaa agtctcaggg tccggtcatg aatgaagagt tatggcggcc 660 cagtacgtca ccgttatgtc cgatggctgt atcaaacaaa gacacgtgta gtgatatcgg 720 acaactcgtc catccccgtc ggagcattcg gataacgtgc tcatcgttcc aaaatattga 780 tatacggtat acgtatccga agaataaagt tggcacgatc cctgtaacta ggttgtcacg 840 acctcgtcgg aggttgtatg tccggtgttc cgtgacgtca tcgggcattc atcattcata 900 gaatgtgtta cggaggaaac aacatatgag agacaccatg aacgaaaagc attgctactc 960 cttactggcc gccggcctca tcgccgccgt gccgcaactc gcagccgccc atggcctgga 1020 cacgctggac cgggacggcg acggctccac ggccgacgcc gattgcaacg acttcgcgcc 1080 caccatccat ccgggcgccg ccgaagcgac gctggacggc gtggattcca actgcgacgg 1140 gcgcgactcc ggcgtggcgg aagtcgtcga gaccttcaag aatccgggca cctactccag 1200 cccggtcatc aacttcaaga tcgcttcgcc gccggggccg ggaacgccca tctacgggcc 1260 gccgcgtgat ttctccggtt acaacaagag ctactcgctg gcgatcggca agacctcgta 1320 ctacgatccg accaccggca ccaagtggaa cgacgacacc atcacgccgg tcagtgatgg 1380 tcaggacatc tggcgcggct ggacccatac cggcaagtgg tcgttcttca acggcaaggc 1440 cggcgacaag atcaccctca gcgtacagcg tgatgcgcag gaagccagcc tgaaaggcgc 1500 ccatccgggc ttcatcctgt tctggcggcc cgagggcggt ccgctgttct gggccggcac 1560 ccaggatctc gacgagggcc agaccgcgct gcccgccgac tccgacaccg ttatcggcca 1620 cgtgatcgtt cagcacgccg actggaccct gcagggcttg ccgcccaagg ccgaccatac 1680 cgcacccgcg ggcgtggata ccgagctcta tcccatgaag ccggacagct acaccatgta 1740 ctacgtcgac tccggctacg atgccgacaa gtacgtggca tcgaagaagc tcatcatgca 1800 ccccacggcg ttcaaagggc tggccctgaa cgacggcacc gccggggcgt tcaccaagtc 1860 catcaccctg ccgaagacgg gctattacat gctgtacgtc gccaacgtcc tggaagtgga 1920 cgactggagc gtcgacgcgg acggcaagct caccaccacc ggcgaagtct gggaagtgcc 1980

ggccaagggc tgctgggtca acatcacgat ctccaagccg taaatccggc ttgatgtgcg 2040 tgatccttcg gagcggccag tcgaccgttc cgcggggaaa acttccggag ccaatccccg 2100 tcctctgcgg cggggatttt tttattcggc ggaaccgagt tgggcgacgg ctttttccag 2160 cttcttcaac cggctccaga tttcgtgcag gctcgagggg cgaattccag cacactggcg 2220 gccgttacta gtggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc 2280 caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt 2340 tttgctgaaa ggaggaacta tatccggata tcccgcaaga ggcccggcag taccggcata 2400 accaagccta tgcctacagc atccagggtg acggtgccga ggatgacgat gagcgcattg 2460 ttagatttca tacacggtgc ctgactgcgt tagcaattta actgtgataa actaccgcat 2520 taaagcttat cgataccgtc gacctcgagg gggggcccgg tacccagctt ttgttccctt 2580 tagtgagggt taattgcgcg cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat 2640 tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg 2700 ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag 2760 tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 2820 ttgcgtattg ggcgcatgca taaaaactgt tgtaattcat taagcattct gccgacatgg 2880 aagccatcac aaacggcatg atgaacctga atcgccagcg gcatcagcac cttgtcgcct 2940 tgcgtataat atttgcccat gggggtgggc gaagaactcc agcatgagat ccccgcgctg 3000 gaggatcatc cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc 3060 ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa 3120 ccccagagtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 3180 tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 3240 tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 3300 ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 3360 tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 3420 agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 3480

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 3540 gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 3600 gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 3660 tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 3720 agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 3780 ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 3840 ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 3900 cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc ctcatcctgt ctcttgatca 3960 gatcttgatc ccctgcgcca tcagatcctt ggcggcaaga aagccatcca gtttactttg 4020 cagggcttcc caaccttacc agagggcgcc ccagctggca attccggttc gcttgctgtc 4080 cataaaaccg cccagtctag ctatcgccat gtaagcccac tgcaagctac ctgctttctc 4140 tttgcgcttg cgttttccct tgtccagata gcccagtagc tgacattcat cccaggtggc 4200 acttttcggg gaaatgtgcg cgcccgcgtt cctgctggcg ctgggcctgt ttctggcgct 4260 ggacttcccg ctgttccgtc agcagctttt cgcccacggc cttgatgatc gcggcggcct 4320 tggcctgcat atcccgattc aacggcccca gggcgtccag aacgggcttc aggcgctccc 4380 gaaggtctcg ggccgtctct tgggcttgat cggccttctt gcgcatctca cgcgctcctg 4440 cggcggcctg tagggcaggc tcatacccct gccgaaccgc ttttgtcagc cggtcggcca 4500 cggcttccgg cgtctcaacg cgctttgaga ttcccagctt ttcggccaat ccctgcggtg 4560 cataggcgcg tggctcgacc gcttgcgggc tgatggtgac gtggcccact ggtggccgct 4620 ccagggcctc gtagaacgcc tgaatgcgcg tgtgacgtgc cttgctgccc tcgatgcccc 4680 gttgcagccc tagatcggcc acagcggccg caaacgtggt ctggtcgcgg gtcatctgcg 4740 ctttgttgcc gatgaactcc ttggccgaca gcctgccgtc ctgcgtcagc ggcaccacga 4800 acgcggtcat gtgcgggctg gtttcgtcac ggtggatgct ggccgtcacg atgcgatccg 4860 ccccgtactt gtccgccagc cacttgtgcg ccttctcgaa gaacgccgcc tgctgttctt 4920

ggctggccga cttccaccat tccgggctgg ccgtcatgac gtactcgacc gccaacacag 4980 cgtccttgcg ccgcttctct ggcagcaact cgcgcagtcg gcccatcgct tcatcggtgc 5040 tgctggccgc ccagtgctcg ttctctggcg tcctgctggc gtcagcgttg ggcgtctcgc 5100 gctcgcggta ggcgtgcttg agactggccg ccacgttgcc cattttcgcc agcttcttgc 5160 atcgcatgat cgcgtatgcc gccatgcctg cccctccctt ttggtgtcca accggctcga 5220 cgggggcagc gcaaggcggt gcctccggcg ggccactcaa tgcttgagta tactcactag 5280 actttgcttc gcaaagtcgt gaccgcctac ggcggctgcg gcgccctacg ggcttgctct 5340 ccgggcttcg ccctgcgcgg tcgctgcgct cccttgccag cccgtggata tgtggacgat 5400 ggccgcgagc ggccaccggc tggctcgctt cgctcggccc gtggacaacc ctgctggaca 5460 agctgatgga caggctgcgc ctgcccacga gcttgaccac agggattgcc caccggctac 5520 ccagccttcg accacatacc caccggctcc aactgcgcgg cctgcggcct tgccccatca 5580 atttttttaa ttttctctgg ggaaaagcct ccggcctgcg gcctgcgcgc ttcgcttgcc 5640 ggttggacac caagtggaag gcgggtcaag gctcgcgcag cgaccgcgca gcggcttggc 5700 cttgacgcgc ctggaacgac ccaagcctat gcgagtgggg gcagtcgaag gcgaagcccg 5760 cccgcctgcc ccccgagcct cacggcggcg agtgcggggg ttccaagggg gcagcgccac 5820 cttgggcaag gccgaaggcc gcgcagtcga tcaacaagcc ccggaggggc cactttttgc 5880 cggaggggga gccgcgccga aggcgtgggg gaaccccgca ggggtgccct tctttgggca 5940 ccaaagaact agatataggg cgaaatgcga aagacttaaa aatcaacaac ttaaaaaagg 6000 ggggtacgca acagctcatt gcggcacccc ccgcaatagc tcattgcgta ggttaaagaa 6060 aatctgtaat tgactgccac ttttacgcaa cgcataattg ttgtcgcgct gccgaaaagt 6120 tgcagctgat tgcgcatggt gccgcaaccg tgcggcaccc taccgcatgg agataagcat 6180 ggccacgcag tccagagaaa tcggcattca agccaagaac aagcccggtc actgggtgca 6240 aacggaacgc aaagcgcatg aggcgtgggc cgggcttatt gcgaggaaac ccacggcggc 6300 aatgctgctg catcacctcg tggcgcagat gggccaccag aacgccgtgg tggtcagcca 6360 gaagacactt tccaagctca tcggacgttc tttgcggacg gtccaatacg cagtcaagga 6420

cttggtggcc gagcgctgga tctccgtcgt gaagctcaac ggccccggca ccgtgtcggc 6480

ctacgtggtc aatgaccgcg tggcgtgggg ccagccccgc gaccagttgc gcctgtcggt 6540

gttcagtgcc gccgtggtgg ttgatcacga cgaccaggac gaatcgctgt tggggcatgg 6600

cgacctgcgc cgcatcccga ccctgtatcc gggcgagcag caactaccga ccggccccgg 6660

cgaggagccg cccagccagc ccggcattcc gggcatggaa ccagacctgc cagccttgac 6720

cgaaacggag gaatgggaac ggcgcgggca gcagcgcctg ccgatgcccg atgagccgtg 6780

ttttctggac gatggcgagc cgttggagcc gccgacacgg gtcacgctgc cgcgccggta 6840

gcacttgggt tgcgcagcaa cccgtaagtg cgctgttcca gactatcggc tgtagccgcc 6900

tcgccgccct ataccttgtc tgcctccccg cgttgcgtcg cggtgcatgg agccgggcca 6960

cctcgacctg aatggaagcc ggcggcacct cgctaacgga ttcaccgttt ttatcaggct 7020

ctgggaggca gaataaatga tcatatcgtc aattattacc tccacgggga gagcctgagc 7080

aaactggcct caggcatttg agaagcacac ggtcacactg cttccggtag tcaataaa 7138

<210> 10

<211> 2940

<212> DNA <213> Methylococcus capsulatus

<400> 10 caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 60

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 120

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 180

gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 240

gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 300

cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 360

caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 420

tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 480

acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 540

acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 600 accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 660 caactcgcag ccgcccatgg taagaaattg gctaaaccag cgaccacaaa ggccgttaat 720 ccccagcccc gtcgacgcaa caacaaccgt cggcgtggca tgagagcgga tgcaccttta 780 gctaaggcct cgactatcac gggatttgga cgtgggacca atgacgtcca tctcacgggt 840 atgtcgagaa tcgcccaagc ggttatccca gctggcaccg gcacggacgg atacatcgtg 900 gttgacgaaa ccatcgtccc cgagctcttg ccaagactgg gatttgctgc tagaatcttc 960 cagcgatacg ctgttgagac actggagttc gaaattcagc caatgtgccc cgcaaacacg 1020 ggcggtggtt acgtggctgg cttcctgcct gatccaactg acagcgacca caccttcgac 1080 gcaattcaag cgactcgcgg tgcggtcgtt gccaaatggt gggaaagcag aacaatccga 1140 ccccagcatg cccgcgcact cctctggacc tcggtcggga aggagcagcg tttgacatcc 1200 ccgggccggt tggtactcct gtgtgccggc aacaacactg acgtcgtcaa cgtgtcagtg 1260 ctgtgtcgct ggagtgtacg tctcagtgtt ccatctctcg agacacctga agatacattc 1320 gctccaatcc taaccctggg accactctac aacgactccc ttgcacccaa cgatttcaaa 1380 tcaatacttc ttggctctac ccagcttgac atcgcccctg acggagccgt ctattcatta 1440 gatcggccgc tgtccattga ctacagtctg ggcactggtg atgtcgaccg tgccgtttac 1500 tggcatgtga agaaagttgc tggcaatgcg ggaacacctg cggggtggtt ccactggggg 1560 ctatgggata atttcaacaa aacattcaca cagggcgctg cctactattc tgatgcgcag 1620 cctcgacaga tcttgctgcc agtgggcacg ctcaacaccc gagctgactc ccatggcctg 1680 gacacgctgg accgggacgg cgacggctcc acggccgacg ccgattgcaa cgacttcgcg 1740 cccaccatcc atccgggcgc cgccgaagcg acgctggacg gcgtggattc caactgcgac 1800 gggcgcgact ccggcgtggc ggaagtcgtc gagaccttca agaatccggg cacctactcc 1860 agcccggtca tcaacttcaa gatcgcttcg ccgccggggc cgggaacgcc catctacggg 1920 ccgccgcgtg atttctccgg ttacaacaag agctactcgc tggcgatcgg caagacctcg 1980

tactacgatc cgaccaccgg caccaagtgg aacgacgaca ccatcacgcc ggtcagtgat 2040

ggtcaggaca tctggcgcgg ctggacccat accggcaagt ggtcgttctt caacggcaag 2100

gccggcgaca agatcaccct cagcgtacag cgtgatgcgc aggaagccag cctgaaaggc 2160

gcccatccgg gcttcatcct gttctggcgg cccgagggcg gtccgctgtt ctgggccggc 2220

acccaggatc tcgacgaggg ccagaccgcg ctgcccgccg actccgacac cgttatcggc 2280

cacgtgatcg ttcagcacgc cgactggacc ctgcagggct tgccgcccaa ggccgaccat 2340

accgcacccg cgggcgtgga taccgagctc tatcccatga agccggacag ctacaccatg 2400

tactacgtcg actccggcta cgatgccgac aagtacgtgg catcgaagaa gctcatcatg 2460

caccccacgg cgttcaaagg gctggccctg aacgacggca ccgccggggc gttcaccaag 2520

tccatcaccc tgccgaagac gggctattac atgctgtacg tcgccaacgt cctggaagtg 2580

gacgactgga gcgtcgacgc ggacggcaag ctcaccacca ccggcgaagt ctgggaagtg 2640

ccggccaagg gctgctgggt caacatcacg atctccaagc cgtaaatccg gcttgatgtg 2700

cgtgatcctt cggagcggcc agtcgaccgt tccgcgggga aaacttccgg agccaatccc 2760

cgtcctctgc ggcggggatt tttttattcg gcggaaccga gttgggcgac ggctttttcc 2820

agcttcttca accggctcca gatttcgtgc aggctcgagg gggggcccgg tacccagctt 2880

ttgttccctt tagtgagggt taattgcgcg cttggcgtaa tcatggtcat agctgtttcc 2940

<210> 11 <211> 669

<212> PRT

<213> Methylococcus capsulatus

<400> 11

Lys Lys Leu Ala Lys Pro Ala Thr Thr Lys Ala VaI Asn Pro GIn Pro 1 5 10 15

Arg Arg Arg Asn Asn Asn Arg Arg Arg GIy Met Arg Ala Asp Ala Pro 20 25 30

Leu Ala Lys Ala Ser Thr He Thr GIy Phe GIy Arg GIy Thr Asn Asp

VaI His Leu Thr GIy Met Ser Arg lie Ala GIn Ala VaI lie Pro Ala 50 55 60

GIy Thr GIy Thr Asp GIy Tyr He VaI VaI Asp GIu Thr lie VaI Pro 65 70 75 80

GIu Leu Leu Pro Arg Leu GIy Phe Ala Ala Arg He Phe GIn Arg Tyr 85 90 95

Ala VaI GIu Thr Leu GIu Phe GIu He GIn Pro Met Cys Pro Ala Asn 100 105 . 110

Thr GIy GIy GIy Tyr VaI Ala GIy Phe Leu Pro Asp Pro Thr Asp Ser 115 120 125

Asp His Thr Phe Asp Ala He GIn Ala Thr Arg GIy Ala VaI VaI Ala 130 135 140

Lys Trp Trp GIu Ser Arg Thr He Arg Pro GIn His Ala Arg Ala Leu 145 150 155 160

Leu Trp Thr Ser VaI GIy Lys GIu GIn Arg Leu Thr Ser Pro GIy Arg 165 170 175

Leu VaI Leu Leu Cys Ala GIy Asn Asn Thr Asp VaI VaI Asn VaI Ser 180 185 190

VaI Leu Cys Arg Trp Ser VaI Arg Leu Ser VaI Pro Ser Leu GIu Thr 195 200 205

Pro GIu Asp Thr Phe Ala Pro He Leu Thr Leu GIy Pro Leu Tyr Asn 210 215 220

Asp Ser Leu Ala Pro Asn Asp Phe Lys Ser He Leu Leu GIy Ser Thr

225 230 235 240

GIn Leu Asp He Ala Pro Asp GIy Ala VaI Tyr Ser Leu Asp Arg Pro

245 250 255

Leu Ser lie Asp Tyr Ser Leu GIy Thr GIy Asp VaI Asp Arg Ala VaI 260 265 270

Tyr Trp His VaI Lys Lys VaI Ala GIy Asn Ala GIy Thr Pro Ala GIy 275 280 285

Trp Phe His Trp GIy Leu Trp Asp Asn Phe Asn Lys Thr Phe Thr GIn 290 295 300

GIy Ala Ala Tyr Tyr Ser Asp Ala GIn Pro Arg GIn He Leu Leu Pro 305 310 315 320

VaI GIy Thr Leu Phe Thr Arg Ala Asp Ser GIy Asn His GIy Leu Asp

325 330 335

Thr Leu Asp Arg Asp GIy Asp GIy Ser Thr Ala Asp Ala Asp Cys Asn 340 345 350

Asp Phe Ala Pro Thr lie His Pro GIy Ala Ala GIu Ala Thr Leu Asp 355 360 365

GIy VaI Asp Ser Asn Cys Asp GIy Arg Asp Ser GIy VaI Ala GIu VaI 370 375 380

VaI GIu Thr Phe Lys Asn Pro GIy Thr Tyr Ser Ser Pro VaI He Asn 385 390 395 400

Phe Lys He Ala Ser Pro Pro GIy Pro GIy Thr Pro He Tyr GIy Pro 405 410 415

Pro Arg Asp Phe Ser GIy Tyr Asn Lys Ser Tyr Ser Leu Ala He GIy 420 425 430

Lys Thr Ser Tyr Tyr Asp Pro Thr Thr GIy Thr Lys Trp Asn Asp Asp 435 440 445

Thr lie Thr Pro VaI Ser Asp GIy GIn Asp lie Trp Arg GIy Trp Thr 450 455 460

His Thr GIy Lys Trp Ser Phe Phe Asn GIy Lys Ala GIy Asp Lys He 465 470 475 480

Thr Leu Ser VaI GIn Arg Asp Ala GIn GIu Ala Ser Leu Lys GIy Ala 485 490 495

His Pro GIy Phe He Leu Phe Trp Arg Pro GIu GIy GIy Pro Leu Phe 500 505 510

Trp Ala GIy Thr GIn Asp Leu Asp GIu GIy GIn Thr Ala Leu Pro Ala

515 520 525

Asp Ser Asp Thr VaI He GIy His VaI He VaI GIn His Ala Asp Trp 530 535 540

Thr Leu GIn GIy Leu Pro Pro Lys Ala Asp His Thr Ala Pro Ala GIy

545 550 555 560

VaI Asp Thr GIu Leu Tyr Pro Met Lys Pro Asp Ser Tyr Thr Met Tyr 565 570 575

Tyr VaI Asp Ser GIy Tyr Asp Ala Asp Lys Tyr VaI Ala Ser Lys Lys 580 585 590

Leu He Met His Pro Thr Ala Phe Lys GIy Leu Ala Leu Asn Asp GIy 595 600 605

Thr Ala GIy Ala Phe Thr Lys Ser He Thr Leu Pro Lys Thr GIy Tyr 610 615 620

Tyr Met Leu Tyr VaI Ala Asn VaI Leu GIu VaI Asp Asp Trp Ser VaI 625 630 635 640

Asp Ala Asp GIy Lys Leu Thr Thr Thr GIy GIu VaI Trp GIu VaI Pro 645 650 655

Ala Lys GIy Cys Trp VaI Asn He Thr He Ser Lys Pro 660 665

<210> 12 <211> 2299 <212> DNA

<213> Methylococcus capsulatus

<400> 12 caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 60

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 120

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 180

gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 240

gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 300

cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 360

caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 420

tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 480

acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 540

acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 600

accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 660

caactcgcag ccgcccatgg cctggacacg ctggaccggg acggcgacgg ctccacggcc 720

gacgccgatt gcaacgactt cgcgcccacc atccatccgg gcgccgccga agcgacgctg 780

gacggcgtgg attccaactg cgacgggcgc gactccggcg tggcggaagt cgtcgagacc 840

ttcaagaatc cgggcaccta ctccagcccg gtcatcaact tcaagatcgc ttcgccgccg 900

gggccgggaa cgcccatcta cgggccgccg cgtgatttct ccεettacaa caagagctac 960

tcgctggcga tcggcaagac ctcgtactac gatccgacca ccggcaccaa gtggaacgac 1020 gacaccatca cgccggtcag tgatggtcag gacatctggc gcggctggac ccataccggc 1080 aagtggtcgt tcttcaacgg caaggccggc gacaagatca ccctcagcgt acagcgtgat 1140 gcgcaggaag ccagcctgaa aggcgcccat ccgggcttca tcctgttctg gcggcccgag 1200 ggcggtccgc tgttctgggc cggcacccag gatctcgacg agggccagac cgcgctgccc 1260 gccgactccg acaccgttat cggccacgtg atcgttcagc acgccgactg gaccctgcag 1320 ggcttgccgc ccaaggccga ccataccgca cccgcgggcg tggataccga gctctatccc 1380 atgaagccgg acagctacac catgtactac gtcgactccg gctacgatgc cgacaagtac 1440 gtggcatcga agaagctcat catgcacccc acggcgttca aagggctggc cctgaacgac 1500 ggcaccgccg gggcgttcac caagtccatc accctgccga agacgggcta ttacatgctg 1560 tacgtcgcca acgtcctgga agtggacgac tggagcgtcg acgcggacgg caagctcacc 1620 accaccggcg aagtctggga agtgccggcc aagggctgct gggtcaacat cacgatctcc 1680 aagcctacat tcgctccaat cctaaccctg ggaccactct acaacgactc ccttgcaccc 1740 aacgatttca aatcaatact tcttggctct acccagcttg acatcgcccc tgacggagcc 1800 gtctattcat tagatcggcc gctgtccatt gactacagtc tgggcactgg tgatgtcgac 1860 cgtgccgttt actggcatgt gaagaaagtt gctggcaatg cgggaacacc tgcggggtgg 1920 ttccactggg ggctatggga taatttcaac aaaacattca cacagggcgc tgcctactat 1980 tctgatgcgc agcctcgaca gatcttgctg ccagtgggca cgctcaacac ccgagctgac 2040 tcatgaccgg cttgatgtgc gtgatccttc ggagcggcca gtcgaccgtt ccgcggggaa 2100 aacttccgga gccaatcccc gtcctctgcg gcggggattt ttttattcgg cggaaccgag 2160 ttgggcgacg gctttttcca gcttcttcaa ccggctccag atttcgtgca ggctcgaggg 2220 ggggcccggt acccagcttt tgttcccttt agtgagggtt aattgcgcgc ttggcgtaat 2280 catggtcata gctgtttcc 2299

<210> 13

<211> 457

<212> PRT

<213> Methylococcus capsulatus

<400> 13

His GIy Leu Asp Thr Leu Asp Arg Asp GIy Asp GIy Ser Thr Ala Asp 1 5 10 15

Ala Asp Cys Asn Asp Phe Ala Pro Thr He His Pro GIy Ala Ala GIu 20 25 30

Ala Thr Leu Asp GIy VaI Asp Ser Asn Cys Asp GIy Arg Asp Ser GIy 35 40 45

VaI Ala GIu VaI VaI GIu Thr Phe Lys Asn Pro GIy Thr Tyr Ser Ser 50 55 60

Pro VaI He Asn Phe Lys lie Ala Ser Pro Pro GIy Pro GIy Thr Pro 65 70 75 80

He Tyr GIy Pro Pro Arg Asp Phe Ser GIy Tyr Asn Lys Ser Tyr Ser 85 90 95

Leu Ala He GIy Lys Thr Ser Tyr Tyr Asp Pro Thr Thr GIy Thr Lys 100 105 110

Tip Asn Asp Asp Thr He Thr Pro VaI Ser Asp GIy GIn Asp He Tip 115 120 125

Arg GIy Trp Thr His Thr GIy Lys Tip Ser Phe Phe Asn GIy Lys Ala 130 135 140

GIy Asp Lys He Thr Leu Ser VaI GIn Arg Asp Ala GIn GIu Ala Ser 145 150 155 160

Leu Lys GIy Ala His Pro GIy Phe He Leu Phe Trp Arg Pro GIu GIy 165 170 175

GIy Pro Leu Phe Tip Ala GIy Thr GIn Asp Leu Asp GIu GIy GIn Thr 180 185 190

Ala Leu Pro Ala Asp Ser Asp Thr VaI He GIy His VaI He VaI GIn 195 200 205

His Ala Asp Trp Thr Leu GIn GIy Leu Pro Pro Lys Ala Asp His Thr 210 215 220

Ala Pro Ala GIy VaI Asp Thr GIu Leu Tyr Pro Met Lys Pro Asp Ser

225 230 235 240

Tyr Thr Met Tyr Tyr VaI Asp Ser GIy Tyr Asp Ala Asp Lys Tyr VaI

245 250 255

Ala Ser Lys Lys Leu He Met His Pro Thr Ala Phe Lys GIy Leu Ala 260 265 270

Leu Asn Asp GIy Thr Ala GIy Ala Phe Thr Lys Ser He Thr Leu Pro 275 280 285

Lys Thr GIy Tyr Tyr Met Leu Tyr VaI Ala Asn VaI Leu GIu VaI Asp 290 295 300

Asp Trp Ser VaI Asp Ala Asp GIy Lys Leu Thr Thr Thr GIy GIu VaI 305 310 315 320

Trp GIu VaI Pro Ala Lys GIy Cys Trp VaI Asn He Thr He Ser Lys

325 330 335

Pro Phe Ala Pro He Leu Thr Leu GIy Pro Leu Tyr Asn Asp Ser Leu 340 345 350

Ala Pro Asn Asp Phe Lys Ser He Leu Leu GIy Ser Thr GIn Leu Asp 355 360 365

He Ala Pro Asp GIy Ala VaI Tyr Ser Leu Asp Arg Pro Leu Ser He 370 375 380

Asp Tyr Ser Leu GIy Thr GIy Asp VaI Asp Arg Ala VaI Tyr Trp His 385 390 395 400

VaI Lys Lys VaI Ala GIy Asn Ala GIy Thr Pro Ala GIy Trp Phe His 405 410 415

Trp GIy Leu Trp Asp Asn Phe Asn Lys Thr Phe Thr GIn GIy Ala Ala 420 425 430

Tyr Tyr Ser Asp Ala GIn Pro Arg GIn He Leu Leu Pro VaI GIy Thr 435 440 445

Leu Phe Thr Arg Ala Asp Ser GIy Asn 450 455

<210> 14

<211> 2010 <212> DNA <213> Methylococcus capsulatus

<400> 14 caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 60

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 120

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 180

gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 240

gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 300

cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 360

caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 420

tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 480

acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 540

acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 600

accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 660 caactcgcag ccgcccatgg cctggacacg ctggaccggg acggcgacgg ctccacggcc 720 gacgccgatt gcaacgactt cgcgcccacc atccatccgg gcgccgccga agcgacgctg 780 gacggcgtgg attccaactg cgacgggcgc gactccggcg tggcggaagt cgtcgagacc 840 ttcaagaatc cgggcaccta ctccagcccg gtcatcaact tcaagatcgc ttcgccgccg 900 gggccgggaa cgcccatcta cgggccgccg cgtgatttct ccggttacaa caagagctac 960 tcgctggcga tcggcaagac ctcgtactac gatccgacca ccggcaccaa gtggaacgac 1020 gacaccatca cgccggtcag tgatggtcag gacatctggc gcggctggac ccataccggc 1080 aagtggtcgt tcttcaacgg caaggccggc gacaagatca ccctcagcgt acagcgtgat 1140 gcgcaggaag ccagcctgaa aggcgcccat ccgggcttca tcctgttctg gcggcccgag 1200 ggcggtccgc tgttctgggc cggcacccag gatctcgacg agggccagac cgcgctgccc 1260 gccgactccg acaccgttat cggccacgtg atcgttcagc acgccgactg gaccctgcag 1320 ggcttgccgc ccaaggccga ccataccgca cccgcgggcg tggctagctc attagatcgg 1380 ccgctgtcca ttgactacag tctgggcact ggtgatgtcg accgtgccgc tagcgagctc 1440 tatcccatga agccggacag ctacaccatg tactacgtcg actccggcta cgatgccgac 1500 aagtacgtgg catcgaagaa gctcatcatg caccccacgg cgttcaaagg gctggccctg 1560 aacgacggca ccgccggggc gttcaccaag tccatcaccc tgccgaagac gggctattac 1620 atgctgtacg tcgccaacgt cctggaagtg gacgactgga gcgtcgacgc ggacggcaag 1680 ctcaccacca ccggcgaagt ctgggaagtg ccggccaagg gctgctgggt caacatcacg 1740 atctccaagc cgtaaatccg gcttgatgtg cgtgatcctt cggagcggcc agtcgaccgt 1800 tccgcgggga aaacttccgg agccaatccc cgtcctctgc ggcggggatt tttttattcg 1860 gcggaaccga gttgggcgac ggctttttcc agcttcttca accggctcca gatttcgtgc 1920 aggctcgagg gggggcccgg tacccagctt ttgttccctt tagtgagggt taattgcgcg 1980 cttggcgtaa tcatggtcat agctgtttcc 2010

<210> 15 <211> 359 <212> PRT <213> Methylococcus capsulatus

<400> 15

His GIy Leu Asp Thr Leu Asp Arg Asp GIy Asp GIy Ser Thr Ala Asp 1 5 10 15

Ala Asp Cys Asn Asp Phe Ala Pro Thr He His Pro GIy Ala Ala GIu 20 25 30

Ala Thr Leu Asp GIy VaI Asp Ser Asn Cys Asp GIy Arg Asp Ser GIy 35 40 45

VaI Ala GIu VaI VaI GIu Thr Phe Lys Asn Pro GIy Thr Tyr Ser Ser 50 55 60

Pro VaI He Asn Phe Lys He Ala Ser Pro Pro GIy Pro GIy Thr Pro 65 70 75 80

He Tyr GIy Pro Pro Arg Asp Phe Ser GIy Tyr Asn Lys Ser Tyr Ser 85 90 95

Leu Ala He GIy Lys Thr Ser Tyr Tyr Asp Pro Thr Thr GIy Thr Lys 100 105 110

Trp Asn Asp Asp Thr He Thr Pro VaI Ser Asp GIy GIn Asp He Trp 115 120 125

Arg GIy Trp Thr His Thr GIy Lys Trp Ser Phe Phe Asn GIy Lys Ala 130 135 140

GIy Asp Lys He Thr Leu Ser VaI GIn Arg Asp Ala GIn GIu Ala Ser 145 150 155 160

Leu Lys GIy Ala His Pro GIy Phe He Leu Phe Trp Arg Pro GIu GIy

165 170 175

GIy Pro Leu Phe Trp Ala GIy Thr GIn Asp Leu Asp GIu GIy GIn Thr 180 185 190

Ala Leu Pro Ala Asp Ser Asp Thr VaI He GIy His VaI He VaI GIn 195 200 205

His Ala Asp Trp Thr Leu GIn GIy Leu Pro Pro Lys Ala Asp His Thr 210 215 220

Ala Pro Ala GIy VaI Ala Ser Ser Leu Asp Arg Pro Leu Ser He Asp

225 230 235 240

Tyr Ser Leu GIy Thr GIy Asp VaI Asp Arg Ala Ala Ser GIu Leu Tyr

245 250 255

Pro Met Lys Pro Asp Ser Tyr Thr Met Tyr Tyr VaI Asp Ser GIy Tyr 260 265 270

Asp Ala Asp Lys Tyr VaI Ala Ser Lys Lys Leu He Met His Pro Thr 275 280 285

Ala Phe Lys GIy Leu Ala Leu Asn Asp GIy Thr Ala GIy Ala Phe Thr 290 295 300

Lys Ser He Thr Leu Pro Lys Thr GIy Tyr Tyr Met Leu Tyr VaI Ala 305 310 315 320

Asn VaI Leu GIu VaI Asp Asp Trp Ser VaI Asp Ala Asp GIy Lys Leu 325 330 335

Thr Thr Thr GIy GIu VaI Trp GIu VaI Pro Ala Lys GIy Cys Trp VaI 340 345 350

Asn He Thr He Ser Lys Pro

355

<210> 16 <211> 1954 <212> DNA

<213> Methylococcus capsulatus

caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 60

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 120

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 180

gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 240

gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 300

cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 360

caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 420

tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 480

acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 540

acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 600

accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 660

caactcgcag ccgcccatgg cctggacacg ctggaccggg acggcgacgg ctccacggcc 720

gacgccgatt gcaacgactt cgcgcccacc atccatccgg gcgccgccga agcgacgctg 780

gacggcgtgg attccaactg cgacgggcgc gactccggcg tggcggaagt cgtcgagacc 840

ttcaagaatc cgggcaccta ctccagcccg gtcatcaact tcaagatcgc ttcgccgccg 900

gggccgggaa cgcccatcta cgggccgccg cgtgatttct ccggttacaa caagagctac 960

tcgctggcga tcggcaagac ctcgtactac gatccgacca ccggcaccaa gtggaacgac 1020

gacaccatca cgccggtcag tgatggtcag gacatctggc gcggctggac ccataccggc 1080

aagtggtcgt tcttcaacgg caaggccggc gacaagatca ccctcagcgt acagcgtgat 1140

gcgcaggaag ccagcctgaa aggcgcccat ccgggcttca tcctgttctg gcggcccgag 1200

ggcggtccgc tgttctgggc cggcacccag gatctcgacg agggccagac cgcgctgccc 1260

gccgactccg acaccgttat cggccacgtg atcgttcagc acgccgactg gaccctgcag 1320

ggcttgccgc ccaaggccga ccataccgca cccgcgggcg tggataccga gctctatccc 1380

atgaagccgg acagctacac catgtactac gtcgactccg gctacgatgc cgacaagtac 1440

gtggcatcga agaagctcat catgcacccc acggcgttca aagggctggc cctgaacgac 1500

ggcaccgccg gggcgttcac caagtccatc accctgccga agacgggcta ttacatgctg 1560

tacgtcgcca acgtcctgga agtggacgac tggagcgtcg acgcggacgg caagctcacc 1620

accaccggcg aagtctggga agtgccggcc aagggctgct gggtcaacat cacgatctcc 1680

aagcctcatg accggcttga tgtgcgtgat ccttcggagc ggccagtcga ccgttccgcg 1740

gggaaaactt ccggagccaa tccccgtcct ctgcggcggg gattttttta ttcggcggaa 1800

ccgagttggg cgacggcttt ttccagcttc ttcaaccggc tccagatttc gtgcaggctc 1860

gagggggggc ccggtaccca gcttttgttc cctttagtga gggttaattg cgcgcttggc 1920

gtaatcatgg tcatagctgt ttcctgtgtg aaat 1954

<210> 17

<211> 1944

<212> DNA <213> Methylococcus capsulatus

<400> 17 caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 60

cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 120

cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 180

gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 240

gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 300

cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 360

caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 420

tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 480

acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 540

acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 600 accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 660 caactcgcag ccgcccatgg cctggacacg ctggaccggg acggcgacgg ctccacggcc 720 gacgccgatt gcaacgactt cgcgcccacc atccatccgg gcgccgccga agcgacgctg 780 gacggcgtgg attccaactg cgacgggcgc gactccggcg tggcggaagt cgtcgagacc 840 ttcaagaatc cgggcaccta ctccagcccg gtcatcaact tcaagatcgc ttcgccgccg 900 gggccgggaa cgcccatcta cgggccgccg cgtgatttct ccggttacaa caagagctac 960 tcgctggcga tcggcaagac ctcgtactac gatccgacca ccggcaccaa gtggaacgac 1020 gacaccatca cgccggtcag tgatggtcag gacatctggc gcggctggac ccataccggc 1080 aagtggtcgt tcttcaacgg caaggccggc gacaagatca ccctcagcgt acagcgtgat 1140 gcgcaggaag ccagcctgaa aggcgcccat ccgggcttca tcctgttctg gcggcccgag 1200 ggcggtccgc tgttctgggc cggcacccag gatctcgacg agggccagac cgcgctgccc 1260 gccgactccg acaccgttat cggccacgtg atcgttcagc acgccgactg gaccctgcag 1320 ggcttgccgc ccaaggccga ccataccgca cccgcgggcg tggctagcga gctctatccc 1380 atgaagccgg acagctacac catgtactac gtcgactccg gctacgatgc cgacaagtac 1440 gtggcatcga agaagctcat catgcacccc acggcgttca aagggctggc cctgaacgac 1500 ggcaccgccg gggcgttcac caagtccatc accctgccga agacgggcta ttacatgctg 1560 tacgtcgcca acgtcctgga agtggacgac tggagcgtcg acgcggacgg caagctcacc 1620 accaccggcg aagtctggga agtgccggcc aagggctgct gggtcaacat cacgatctcc 1680 aagccgtaaa tccggcttga tgtgcgtgat ccttcggagc ggccagtcga ccgttccgcg 1740 gggaaaactt ccggagccaa tccccgtcct ctgcggcggg gattttttta ttcggcggaa 1800 ccgagttggg cgacggcttt ttccagcttc ttcaaccggc tccagatttc gtgcaggctc 1860 gagggggggc ccggtaccca gcttttgttc cctttagtga gggttaattg cgcgcttggc 1920 gtaatcatgg tcatagctgt ttcc 1944

<210> 18

<211> 1000 <212> DNA <213> Methylococcus capsulatus

<400> 18 taagaaattg gctaaaccag cgaccacaaa ggccgttaat ccccagcccc gtcgacgcaa 60

caacaaccgt cggcgtggca tgagagcgga tgcaccttta gctaaggcct cgactatcac 120

gggatttgga cgtgggacca atgacgtcca tctcacgggt atgtcgagaa tcgcccaagc 180

ggttatccca gctggcaccg gcacggacgg atacatcgtg gttgacgaaa ccatcgtccc 240

cgagctcttg ccaagactgg gatttgctgc tagaatcttc cagcgatacg ctgttgagac 300

actggagttc gaaattcagc caatgtgccc cgcaaacacg ggcggtggtt acgtggctgg 360

cttcctgcct gatccaactg acagcgacca caccttcgac gcaattcaag cgactcgcgg 420

tgcggtcgtt gccaaatggt gggaaagcag aacaatccga ccccagcatg cccgcgcact 480

cctctggacc tcggtcggga aggagcagcg tttgacatcc ccgggccggt tggtactcct 540

gtgtgccggc aacaacactg acgtcgtcaa cgtgtcagtg ctgtgtcgct ggagtgtacg 600

tctcagtgtt ccatctctcg agacacctga agatacattc gctccaatcc taaccctggg 660

accactctac aacgactccc ttgcacccaa cgatttcaaa tcaatacttc ttggctctac 720

ccagcttgac atcgcccctg acggagccgt ctattcatta gatcggccgc tgtccattga 780

ctacagtctg ggcactggtg atgtcgaccg tgccgtttac tggcatgtga agaaagttgc 840

tggcaatgcg ggaacacctg cggggtggtt ccactggggg ctatgggata atttcaacaa 900

aacattcaca cagggcgctg cctactattc tgatgcgcag cctcgacaga tcttgctgcc 960

agtgggcacg ctcttcaccc gagctgactc gggaaactaa 1000

<210> 19 <211> 6700 <212> DNA

<213> Methylococcus capsulatus

<400> 19 ctcgggccgt ctcttgggct tgatcggcct tcttgcgcat ctcacgcgct cctgcggcgg 60

cctgtagggc aggctcatac ccctgccgaa ccgcttttgt cagccggtcg gccacggctt 120 ccggcgtctc aacgcgcttt gagattccca gcttttcggc caatccctgc ggtgcatagg 180 cgcgtggctc gaccgcttgc gggctgatgg tgacgtggcc cactggtggc cgctccaggg 240 cctcgtagaa cgcctgaatg cgcgtgtgac gtgccttgct gccctcgatg ccccgttgca 300 gccctagatc ggccacagcg gccgcaaacg tggtctggtc gcgggtcatc tgcgctttgt 360 tgccgatgaa ctccttggcc gacagcctgc cgtcctgcgt cagcggcacc acgaacgcgg 420 tcatgtgcgg gctggtttcg tcacggtgga tgctggccgt cacgatgcga tccgccccgt 480 acttgtccgc cagccacttg tgcgccttct cgaagaacgc cgcctgctgt tcttggctgg 540 ccgacttcca ccattccggg ctggccgtca tgacgtactc gaccgccaac acagcgtcct 600 tgcgccgctt ctctggcagc aactcgcgca gtcggcccat cgcttcatcg gtgctgctgg 660 ccgcccagtg ctcgttctct ggcgtcctgc tggcgtcagc gttgggcgtc tcgcgctcgc 720 ggtaggcgtg cttgagactg gccgccacgt tgcccatttt cgccagcttc ttgcatcgca 780 tgatcgcgta tgccgccatg cctgcccctc ccttttggtg tccaaccggc tcgacggggg 840 cagcgcaagg cggtgcctcc ggcgggccac tcaatgcttg agtatactca ctagactttg 900 cttcgcaaag tcgtgaccgc ctacggcggc tgcggcgccc tacgggcttg ctctccgggc 960 ttcgccctgc gcggtcgctg cgctcccttg ccagcccgtg gatatgtgga cgatggccgc 1020 gagcggccac cggctggctc gcttcgctcg gcccgtggac aaccctgctg gacaagctga 1080 tggacaggct gcgcctgccc acgagcttga ccacagggat tgcccaccgg ctacccagcc 1140 ttcgaccaca tacccaccgg ctccaactgc gcggcctgcg gccttgcccc atcaattttt 1200 ttaattttct ctggggaaaa gcctccggcc tgcggcctgc gcgcttcgct tgccggttgg 1260 acaccaagtg gaaggcgggt caaggctcgc gcagcgaccg cgcagcggct tggccttgac 1320 gcgcctggaa cgacccaagc ctatgcgagt gggggcagtc gaaggcgaag cccgcccgcc 1380 tgccccccga gcctcacggc ggcgagtgcg ggggttccaa gggggcagcg ccaccttggg 1440 caaggccgaa ggccgcgcag tcgatcaaca agccccggag gggccacttt ttgccggagg 1500

gggagccgcg ccgaaggcgt gggggaaccc cgcaggggtg cccttctttg ggcaccaaag 1560 aactagatat agggcgaaat gcgaaagact taaaaatcaa caacttaaaa aaggggggta 1620 cgcaacagct cattgcggca ccccccgcaa tagctcattg cgtaggttaa agaaaatctg 1680 taattgactg ccacttttac gcaacgcata attgttgtcg cgctgccgaa aagttgcagc 1740 tgattgcgca tggtgccgca accgtgcggc accctaccgc atggagataa gcatggccac 1800 gcagtccaga gaaatcggca ttcaagccaa gaacaagccc ggtcactggg tgcaaacgga 1860 acgcaaagcg catgaggcgt gggccgggct tattgcgagg aaacccacgg cggcaatgct 1920 gctgcatcac ctcgtggcgc agatgggcca ccagaacgcc gtggtggtca gccagaagac 1980 actttccaag ctcatcggac gttctttgcg gacggtccaa tacgcagtca aggacttggt 2040 ggccgagcgc tggatctccg tcgtgaagct caacggcccc ggcaccgtgt cggcctacgt 2100 ggtcaatgac cgcgtggcgt ggggccagcc ccgcgaccag ttgcgcctgt cggtgttcag 2160 tgccgccgtg gtggttgatc acgacgacca ggacgaatcg ctgttggggc atggcgacct 2220 gcgccgcatc ccgaccctgt atccgggcga gcagcaacta ccgaccggcc ccggcgagga 2280 gccgcccagc cagcccggca ttccgggcat ggaaccagac ctgccagcct tgaccgaaac 2340 ggaggaatgg gaacggcgcg ggcagcagcg cctgccgatg cccgatgagc cgtgttttct 2400 ggacgatggc gagccgttgg agccgccgac acgggtcacg ctgccgcgcc ggtagcactt 2460 gggttgcgca gcaacccgta agtgcgctgt tccagactat cggctgtagc cgcctcgccg 2520 ccctatacct tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga 2580 cctgaatgga agccggcggc acctcgctaa cggattcacc gtttttatca ggctctggga 2640 ggcagaataa atgatcatat cgtcaattat tacctccacg gggagagcct gagcaaactg 2700 gcctcaggca tttgagaagc acacggtcac actgcttccg gtagtcaata aaccggtaaa 2760 ccagcaatag acataagcgg ctatttaacg accctgccct gaaccgacga ccgggtcgaa 2820 tttgctttcg aatttctgcc attcatccgc ttattatcac ttattcaggc gtagcaacca 2880 ggcgtttaag ggcaccaata actgccttaa aaaaattacg ccccgccctg ccactcatcg 2940 cagtacggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa 3000

caaaatatta acgcttacaa tttccattcg ccattcaggc tgcgcaactg ttgggaaggg 3060 cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 3120 cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 3180 gagcgcgcgt aatacgactc actatagggc gaattggagc tcttaattaa tttccgcgca 3240 gcccgattct ttcatggatc acgattccat tgaatgcggc gaaatgtctc agggtccggt 3300 cttgaatgaa gagttatggc ggcccagtac gtcaccgtta tgtccgatgg ctgtatcaaa 3360 caaagacacg tgtagtgata tcggacaact cgtccatccc cgtcgggagc attcggataa 3420 tcgtgtcatc gttccaaaat attgatatat cggtatacgt atccgaagaa taaagttggc 3480 acgatccctg taactaggtt gtcacgacct cgtcggaggt tgtatgtccg gtgttccgtg 3540 acgtcatcgg gcattcatca ttcatagaat gtgttacgga ggaaacaaca tatgagagac 3600 accatgaacg aaaagcattg ctactcctta ctggccgccg gcctcatcgc cgccgtgccg 3660 caactcgcag ccgcccatgg cctggacacg ctggaccggg acggcgacgg ctccacggcc 3720 gacgccgatt gcaacgactt cgcgcccacc atccatccgg gcgccgccga agcgacgctg 3780 gacggcgtgg attccaactg cgacgggcgc gactccggcg tggcggaagt cgtcgagacc 3840 ttcaagaatc cgggcaccta ctccagcccg gtcatcaact tcaagatcgc ttcgccgccg 3900 gggccgggaa cgcccatcta cgggccgccg cgtgatttct ccggttacaa caagagctac 3960 tcgctggcga tcggcaagac ctcgtactac gatccgacca ccggcaccaa gtggaacgac 4020 gacaccatca cgccggtcag tgatggtcag gacatctggc gcggctggac ccataccggc 4080 aagtggtcgt tcttcaacgg caaggccggc gacaagatca ccctcagcgt acagcgtgat 4140 gcgcaggaag ccagcctgaa aggcgcccat ccgggcttca tcctgttctg gcggcccgag 4200 ggcggtccgc tgttctgggc cggcacccag gatctcgacg agggccagac cgcgctgccc 4260 gccgactccg acaccgttat cggccacgtg atcgttcagc acgccgactg gaccctgcag 4320 ggcttgccgc ccaaggccga ccataccgca cccgcgggcg tggataccga gctctatccc 4380 atgaagccgg acagctacac catgtactac gtcgactccg gctacgatgc cgacaagtac 4440

gtggcatcga agaagctcat catgcacccc acggcgttca aagggctggc cctgaacgac 4500 ggcaccgccg gggcgttcac caagtccatc accctgccga agacgggcta ttacatgctg 4560 tacgtcgcca acgtcctgga agtggacgac tggagcgtcg acgcggacgg caagctcacc 4620 accaccggcg aagtctggga agtgccggcc aagggctgct gggtcaacat cacgatctcc 4680 aagccgtaaa tccggcttga tgtgcgtgat ccttcggagc ggccagtcga ccgttccgcg 4740 gggaaaactt ccggagccaa tccccgtcct ctgcggcggg gattttttta ttcggcggaa 4800 ccgagttggg cgacggcttt ttccagcttc ttcaaccggc tccagatttc gtgcaggctc 4860 gagggggggc ccggtaccca gcttttgttc cctttagtga gggttaattg cgcgcttggc 4920 gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa 4980 catacgagcc ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac 5040 attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca 5100 ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgca tgcataaaaa 5160 ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacaaacgg catgatgaac 5220 ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc ccttgggggt 5280 gggcgaagaa ctccagcatg agatccccgc gctggaggat catccagccg gcgtcccgga 5340 aaacgattcc gaagcccaac ctttcataga aggcggcggt ggaatcgaaa tctcgtgatg 5400 gcaggttggg cgtcgcttgg tcggtcattt cgaaccccag agtcccgctc agaagaactc 5460 gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga gcggcgatac cgtaaagcac 5520 gaggaagcgg tcagcccatt cgccgccaag ctcttcagca atatcacggg tagccaacgc 5580 tatgtcctga tagcggtccg ccacacccag ccggccacag tcgatgaatc cagaaaagcg 5640 gccattttcc accatgatat tcggcaagca ggcatcgcca tgggtcacga cgagatcctc 5700 gccgtcgggc atgcgcgcct tgagcctggc gaacagttcg gctggcgcga gcccctgatg 5760 ctcttcgtcc agatcatcct gatcgacaag accggcttcc atccgagtac gtgctcgctc 5820 gatgcgatgt ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg tatgcagccg 5880 ccgcattgca tcagccatga tggatacttt ctcggcagga gcaaggtgag atgacaggag 5940

atcctgcccc ggcacttcgc ccaatagcag ccagtccctt cccgcttcag tgacaacgtc 6000

gagcacagct gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg ctgcctcgtc 6060

ctgcagttca ttcagggcac cggacaggtc ggtcttgaca aaaagaaccg ggcgcccctg 6120

cgctgacagc cggaacacgg cggcatcaga gcagccgatt gtctgttgtg cccagtcata 6180

gccgaatagc ctctccaccc aagcggccgg agaacctgcg tgcaatccat cttgttcaat 6240

catgcgaaac gatcctcatc ctgtctcttg atcagatctt gatcccctgc gccatcagat 6300

ccttggcggc aagaaagcca tccagtttac tttgcagggc ttcccaacct taccagaggg 6360

cgccccagct ggcaattccg gttcgcttgc tgtccataaa accgcccagt ctagctatcg 6420

ccatgtaagc ccactgcaag ctacctgctt tctctttgcg cttgcgtttt cccttgtcca 6480

gatagcccag tagctgacat tcatcccagg tggcactttt cggggaaatg tgcgcgcccg 6540

cgttcctgct ggcgctgggc ctgtttctgg cgctggactt cccgctgttc cgtcagcagc 6600

ttttcgccca cggccttgat gatcgcggcg gccttggcct gcatatcccg attcaacggc 6660

cccagggcgt ccagaacggg cttcaggcgc tcccgaaggt 6700

<210> 20 <211> 33

<212> DNA

<213> Methylococcus capsulatus

<400> 20 gtggagccgt tgccgttccg gttcagcgtg tec 33

<210> 21 <211> 21 <212> DNA

<213> Methylococcus capsulatus

<400> 21 tggcggtgat ctcgagcctg c 21

<210> 22 <211> 20 <212> DNA

<213> Methylococcus capsulatus

<400> 22 agtgcctcca tgggcggctg 20

<210> 23 <211> 23 <212> DNA <213> Methylococcus capsulatus

<400> 23 cagcgaactc ccatggcctg gac 23

<210> 24

<211> 38

<212> DNA

<213> Methylococcus capsulatus

<400> 24 gcaaaccatg gtaagaaatt ggctaaacca gcgaccac 38

<210> 25 <211> 30 <212> DNA <213> Methylococcus capsulatus

<400> 25 ttagtccatg gagtcagctc gggtgttgag 30

<210> 26 <211> 27

<212> DNA

<213> Methylococcus capsulatus

<400> 26 gccatgggag tcagctcggg tgttgag 27

<210> 27 <211> 27 <212> DNA

<213> Methylococcus capsulatus

<400> 27 tcatgataca ttcgctccaa tcctaac 27

<210> 28 <211> 30 <212> DNA

<213> Methylococcus capsulatus

<400> 28 ttagtctcat gagtcagctc gggtgttgag 30

<210> 29 <211> 21 <212> DNA <213> Methylococcus capsulatus

<400> 29 ctccaagcct acattcgctc c 21

<210> 30

<211> 66

<212> DNA

<213> Methylococcus capsulatus

<400> 30 ctagctcatt agatcggccg ctgtccattg actacagtct gggcactggt gatgtcgacc 60

gtgccg 66

<210> 31 <211> 66 <212> DNA <213> Methylococcus capsulatus

<400> 31 ctagcggcac ggtcgacatc accagtgccc agactgtagt caatggacag cggccgatct 60

aatgag 66

<210> 32 <211> 22 <212> DNA

<213> Methylococcus capsulatus

<400> 32 gcgtggctag cacattcgct cc 22

<210> 33 <211> 22 <212> DNA

<213> Methylococcus capsulatus

<400> 33 ccgagctgac gctagcgagc tc 22