SIPSEY SANDY (US)
MADEIROS AMY (US)
GURNANI SRISHTI (US)
WO2019126041A1 | 2019-06-27 |
US20160153020A1 | 2016-06-02 | |||
US20160102298A1 | 2016-04-14 | |||
US20170320959A1 | 2017-11-09 | |||
US20190271021A1 | 2019-09-05 | |||
US20100143969A1 | 2010-06-10 | |||
US20110224102A1 | 2011-09-15 | |||
US20150252108A1 | 2015-09-10 |
WHAT IS CLAIMED IS: 1. A fusion protein comprising: an N-terminal signal sequence; and an enzymatically active portion of human Alpha-2,6-sialyltransferase 1 (ST6Gal1). 2. The fusion protein of claim 1, wherein the enzymatically active portion of ST6Gal1 comprises SEQ ID NO:4. 3. The fusion protein of claim 2, wherein the enzymatically active portion of ST6Gal1 consists of SEQ ID NO:4. 4. The fusion protein of any one of claims 1–3, wherein the signal sequence is an N- terminal azurocidin signal sequence. 5. The fusion protein of claim 4, wherein the azurocidin signal sequence comprises MTRLTVLALLAGLLASSRA (SEQ ID NO:30). 6. The fusion protein of claim 4, wherein the azurocidin signal sequence consists of MTRLTVLALLAGLLASSRA (SEQ ID NO:30). 7. The fusion protein of any one of claims 1–6, further comprising an affinity tag. 8. The fusion protein of claim 7, wherein the affinity tag is selected from the group consisting of polyhistidine, glutathione S-transferase (GST), maltose-binding protein (MBP), chitin binding protein, a streptavidin tag (e.g., Trp-Ser-His-Pro-Gln-Phe-Glu-Lys (SEQ ID NO:31)), FLAG-tag (e.g., DYKDDDDK (SEQ ID NO:32)), a biotin tag, and combinations thereof. 9. The fusion protein of claim 8, wherein the polyhistidine tag is selected from the group consisting of HHHH (SEQ ID NO:11), HHHHH (SEQ ID NO:12), HHHHHH, (SEQ ID NO:13), HHHHHHH (SEQ ID NO:14), HHHHHHHH (SEQ ID NO:15), HHHHHHHHH (SEQ ID NO:16), and HHHHHHHHHH (SEQ ID NO:17). 10. The fusion protein of any one of claims 7–9, wherein the affinity tag is situated towards the N-terminal side of the enzymatically active portion of ST6Gal1. 11. The fusion protein of claim 1, wherein the N-terminal signal sequence comprises MTRLTVLALLAGLLASSRA (SEQ ID NO:30), and the enzymatically active portion of ST6Gal1 comprises SEQ ID NO:4. 12. The fusion protein of claim 11, further comprising a hexahistidine tag. 13. The fusion protein of claim 12, wherein the hexahistidine tag is between the N- terminal signal sequence and the enzymatically active portion of ST6Gal1. 14. The fusion protein of claim 13, consisting of SEQ ID NO:6. 15. A nucleic acid molecule encoding the fusion protein of any one of claims 1–14. 16. A vector comprising the nucleic acid molecule of claim 15. 17. The vector of claim 16 further comprising a promoter operably linked to the nucleic acid encoding the fusion protein. 18. The vector of claim 16, wherein the promoter is a cytomegalovirus (CMV) promoter. 19. A host cell stably transformed the vector of claim 16. 20. The host cell of claim 17, wherein the cell is a human embryonic kidney (HEK) cell, or derivative thereof. 21. The host cell of claim 20, wherein the cell is the HEK derivative HEK293. 22. A method for producing a polypeptide comprising: culturing the host cell of any one of claims 19–21 in a culture medium under conditions permissive for expression of the fusion protein; and isolating the fusion protein from the culture medium. 23. A fusion protein comprising: an N-terminal signal sequence; and an enzymatically active portion of human beta-1,4-galactosyltransferase (B4GalT1). 24. The fusion protein of claim 23, wherein the enzymatically active portion of B4GalT1 comprises SEQ ID NO:43. 25. The fusion protein of claim 23, wherein the enzymatically active portion of B4GalT1 consists of SEQ ID NO:43. 26. The fusion protein of any one of claims 23–25, wherein the signal sequence is an N-terminal azurocidin signal sequence. 27. The fusion protein of claim 23, wherein the azurocidin signal sequence comprises MTRLTVLALLAGLLASSRA (SEQ ID NO:30). 28. The fusion protein of claim 23, wherein the azurocidin signal sequence consists of MTRLTVLALLAGLLASSRA (SEQ ID NO:30). 29. The fusion protein of any one of claims 23–28, further comprising an affinity tag. 30. The fusion protein of claim 29, wherein the affinity tag is selected from the group consisting of polyhistidine, glutathione S-transferase (GST), maltose-binding protein (MBP), chitin binding protein, a streptavidin tag (e.g., Trp-Ser-His-Pro-Gln-Phe-Glu-Lys (SEQ ID NO:31)), FLAG-tag (e.g., DYKDDDDK (SEQ ID NO:32)), a biotin tag, and combinations thereof. 31. The fusion protein of claim 30, wherein the polyhistidine tag is selected from the group consisting of HHHH (SEQ ID NO:11), HHHHH (SEQ ID NO:12), HHHHHH, (SEQ ID NO:13), HHHHHHH (SEQ ID NO:14), HHHHHHHH (SEQ ID NO:15), HHHHHHHHH (SEQ ID NO:16), and HHHHHHHHHH (SEQ ID NO:17). 32. The fusion protein of any one of claims 23–31, wherein the affinity tag is situated towards the C-terminal side of the enzymatically active portion of B4GalT1. 33. The fusion protein of claim 23, wherein the N-terminal signal sequence comprises MTRLTVLALLAGLLASSRA (SEQ ID NO:30), and the enzymatically active portion of B4GalT1comprises SEQ ID NO:43. 34. The fusion protein of claim 33, further comprising a septahistidine tag. 35. The fusion protein of claim 34, wherein the septahistidine tag is C-terminal. 36. The fusion protein of claim 35, consisting of SEQ ID NO:45. 37. A nucleic acid molecule encoding the fusion protein of any one of claims 23–36. 38. A vector comprising the nucleic acid molecule of claim 37. 39. The vector of claim 38 further comprising a promoter operably linked to the nucleic acid encoding the fusion protein. 40. The vector of claim 39, wherein the promoter is a cytomegalovirus (CMV) promoter. 41. A host cell stably transformed with the vector of claim 16. 42. The host cell of claim 41, wherein the cell is a human embryonic kidney (HEK) cell, or derivative thereof. 43. The host cell of claim 42, wherein the cell is the HEK derivative HEK293. 44. A method for producing a polypeptide comprising: culturing the host cell of any one of claims 41–43 in a culture medium under conditions permissive for expression of the fusion protein; and isolating the fusion protein from the culture medium. 45. A method for sialyating immunoglobulin G (IgG) antibodies, the method comprising: a) providing a composition comprising IgG antibodies; b) exposing the composition to a α1,4-galactosyltransferase 1 and an enzymatically active portion of ST6Gal1 comprising SEQ ID NO:4 in the presence of UDP-Gal and CMP-NANA, thereby producing a composition comprising sialyated IgG (sIgG). 46. A method for sialyating immunoglobulin G (IgG) antibodies, the method comprising: a) providing a composition comprising IgG antibodies; b) exposing the IgG antibodies to a α1,4-galactosyltransferase 1 in the presence of UDP- Gal, thereby producing a composition comprising galactosylated IgG antibodies; and c) exposing the composition comprising galactosylated IgG antibodies to an enzymatically active portion of ST6Gal1 comprising SEQ ID NO:4 in the presence of CMP- NANA, thereby producing a composition comprising sialyated IgG (sIgG). 47. The method of claim 46, wherein neither the composition comprising galactosylated IgG antibodies is not purified prior to step (c). 48. The method of any one of claims 45–47, further comprising supplementing one or more of the compositions with CMP-NANA. 49. The method of any one of claims 45–48, where the mixture of IgG antibodies are selected from the group consisting of IgG1, IgG2, IgG3, IgG4, and combinations thereof. 50. The method of claim 45 or 46, wherein at least 60% of branched glycans on the Fc region of the antibodies in the composition comprising sIgG are di-sialylated. 51. The method of claim 45 or 46, wherein less than 50% of branched glycans on the Fc regions of the antibodies in the composition comprising sIgG are mono-sialylated. |
Table 4. Post Translational Amino Acid Modifications of B4GALT1 isoform 1 (SEQ ID NO:37) The soluble form of B4GalT1 derives from the membrane form by proteolytic processing. The cleavage site is at positions 77–78 of B4GALT1 isoform 1 (SEQ ID NO:37). In some embodiments, one or more of the amino acids of the B4GalT1 corresponding to amino acids 113, 130, 172, 243, 250, 262, 310, 343, or 355 of B4GALT1 isoform 1 (SEQ ID NO:37) is conserved as compared to (SEQ ID NO:37). Provided herein are enzymatically active portions of, e.g., B4GalT1. In some embodiments, the enzyme is an enzymatically active portion of B4GALT1 isoform 1 (SEQ ID NO:37), or an ortholog, mutant, or variant of SEQ ID NO:37. In some embodiments, the enzyme is an enzymatically active portion of B4GALT1 isoform 2 (SEQ ID NO:38), or an ortholog, mutant, or variant of SEQ ID NO:38. In some embodiments, the enzyme is an enzymatically active portion of B4GALT1 isoform 3 (SEQ ID NO:39), or an ortholog, mutant, or variant of SEQ ID NO:39. In some embodiments, the enzyme is an enzymatically active portion of B4GALT1 isoform 4 (SEQ ID NO:40), or an ortholog, mutant, or variant of SEQ ID NO:40. In some embodiments, the enzymatically active portion of B4GalT1 does not comprise a cytoplasmic domain, e.g., SEQ ID NO:41. In some embodiments, the enzymatically active portion of B4GalT1 does not comprise a transmembrane domain, e.g., SEQ ID NO:42. In some embodiments, the enzymatically active portion of B4GalT1 does not comprise a cytoplasmic domain, e.g., SEQ ID NO:41 or a transmembrane domain, e.g., SEQ ID NO:42. In some embodiments, the enzymatically active portion of B4GalT1 comprises all or a portion of a luminal domain, e.g., SEQ ID NO:43, or an ortholog, mutants, or variants thereof. In some embodiments, the enzymatically active portion of B4GalT1 comprises amino acids 109–398 of SEQ ID NO:37, or an ortholog, mutants, or variants thereof. In some embodiments, the enzymatically active portion of B4GalT1 consists of SEQ ID NO:37, or an ortholog, mutant, or variant of SEQ ID NO:37. A suitable functional portion of an B4GalT1 can comprise or consist of an amino acid sequence that is at least 80% (85%, 90%, 95%, 98% or 100%) identical to SEQ ID NO:43. SEQ ID NO:43 GPASNLTSVPVPHTTALSLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPNVKMGGRYA PRDC VSPHKVAIIIPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAGDTIFNRAKLLNVGF QEAL KDYDYTCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGGVSALSKQQ FLTI NGFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAVVGRCRMIRHSRDKKNEPNPQRFDRIA HTKE TMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPS ST6Gal1, e.g., human ST6Gal1, as well as orthologs, mutants, and variants thereof, including enzymatically active portions of ST6Gal1, e.g., human ST6Gal1, as well as orthologs, mutants, and variants thereof, and fusion proteins comprising the same, are suitable for use in the methods described herein. ST6GAL1, α-galactoside α-2,6-sialyltransferase 1, transfers sialic acid from CMP-sialic acid to the Galαα?4GlcNAc structure on glycoproteins, such as asialofetuin and asialo-a1-acid glycoprotein. ST6Gal1 is also called as ST6N or SIAT1. Four alternative transcripts encoding two isoforms of ST6GAL1 (NCBI Gene ID 6480) are described in Table 1. Table 1. Human ST6GAL1 isoforms _ _ >NP_001340845.1 (NP_00302 [organism=Homo sapiens] [ MIHTNLKKKFSCCVLVFLLFAVICVWKEKKKGSYYDSFKLQTKEFQVLKSLGKLAMGSDS QSVS SSSTQDPHRGRQTLGSLRGLAKAKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKV SYKG PGPGIKFSAEALRCHLRDHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVS SAGS LKSSQLGREIDDHDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEG ILIV WDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEI QPNP PSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVK HLNQ GTDEDIYLLGKATLPGFRTIHC >NP_775324.1 ST6GAL1 [organism=Homo sapiens] [GeneID=6480] [isoform=b](SEQ ID NO:29) MNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQ PFYI LKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYY YQKF FDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFRTIHC Table 2. Topology of ST6Gal1 isoform a (SEQ ID NO:28)
Table 3. Binding sites of ST6Gal1 isoform a (SEQ ID NO:28) Table 4. Post Translational Amino Acid Modifications of ST6Gal1 isoform a (SEQ ID NO:28)
The soluble form of ST6Gal1 derives from the membrane form by proteolytic processing. In some embodiments, one or more of the amino acids of the ST6Gal1 corresponding to amino acids 142, 149, 161, 184, 189, 212, 233, 335, 353, 354, 364, 365, 369, 370, 376, or 406 of ST6Gal1 isoform a (SEQ ID NO:28) is conserved as compared to SEQ ID NO:28. Also provided herein is an enzymatically active portion of, e.g., ST6Gal1. In some embodiments, the enzyme is an enzymatically active portion of STG6Gal1 isoform a (SEQ ID NO:28), or an ortholog, mutant, or variant of SEQ ID NO:28. In some embodiments, the enzyme is an enzymatically active portion of STG6Gal1 isoform b (SEQ ID NO:29), or an ortholog, mutant, or variant of SEQ ID NO:29. In some embodiments, the enzymatically active portion of ST6Gal1 does not comprise a cytoplasmic domain, e.g., SEQ ID NO:34. In some embodiments, the enzymatically active portion of ST6Gal1 does not comprise a transmembrane domain, e.g., SEQ ID NO:35. In some embodiments, the enzymatically active portion of ST6Gal1 does not comprise a cytoplasmic domain, e.g., SEQ ID NO:34 or a transmembrane domain, e.g., SEQ ID NO:35. In some embodiments, the enzymatically active portion of ST6Gal1 comprises all or a portion of a luminal domain, e.g., SEQ ID NO:36, or an ortholog, mutants, or variants thereof. In some embodiments, the enzymatically active portion of ST6Gal1 comprises amino acids 87–406 of SEQ ID NO:28 (SEQ ID NO:4), or an ortholog, mutants, or variants thereof. In some embodiments, the enzymatically active portion of ST6Gal1 consists of SEQ ID NO:4, or an ortholog, mutant, or variant of SEQ ID NO:4. A suitable functional portion of an ST6Gal1 can comprise or consist of an amino acid sequence that is at least 80% (85%, 90%, 95%, 98% or 100%) identical to SEQ ID NO:3 or SEQ ID NO:4. SEQ ID NO:4 AKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYKGPGPGIKFSAEALRCHLRD HVNV SMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSSAGSLKSSQLGREIDDHDAVLR FNGA PTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPD YNFF NNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTLCDQV DIYE FLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFR TIHC Variants In some embodiments, the enzyme(s) described herein are at least 80%, e.g., at least 85%, 90%, 95%, 98%, or 100% identical to the amino acid sequence of an exemplary sequence (e.g., as provided herein), e.g., have differences at up to 1%, 2%, 5%, 10%, 15%, or 20% of the residues of the exemplary sequence replaced, e.g., with conservative mutations, e.g., including or in addition to the mutations described herein. In preferred embodiments, the variant retains desired activity of the parent, e.g., α-galactoside α-2,6-sialyltransferase activity or α-1,4- galactosyltransferase activity. To determine the percent identity of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The length of a reference sequence aligned for comparison purposes is at least 80% of the length of the reference sequence, and in some embodiments is at least 90% or 100%. The nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein nucleic acid “identity” is equivalent to nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. Percent identity between a subject polypeptide or nucleic acid sequence (i.e. a query) and a second polypeptide or nucleic acid sequence (i.e. target) is determined in various ways that are within the skill in the art, for instance, using publicly available computer software such as Smith Waterman Alignment (Smith, T. F. and M. S. Waterman (1981) J Mol Biol 147:195-7); “BestFit” (Smith and Waterman, Advances in Applied Mathematics, 482-489 (1981)) as incorporated into GeneMatcher Plus TM , Schwarz and Dayhof (1979) Atlas of Protein Sequence and Structure, Dayhof, M.O., Ed, pp 353-358; BLAST program (Basic Local Alignment Search Tool; (Altschul, S. F., W. Gish, et al. (1990) J Mol Biol 215: 403-10), BLAST-2, BLAST-P, BLAST-N, BLAST-X, WU-BLAST-2, ALIGN, ALIGN-2, CLUSTAL, or Megalign (DNASTAR) software. In addition, those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the length of the sequences being compared. In general, for target proteins or nucleic acids, the length of comparison can be any length, up to and including full length of the target (e.g., 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 100%). For the purposes of the present disclosure, percent identity is relative to the full length of the query sequence. For purposes of the present disclosure, the comparison of sequences and determination of percent identity between two sequences can be accomplished using a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Fusion Proteins Also provided herein are fusion protein(s) comprising enzyme(s) or portions thereof as described herein. In one embodiment, the fusion protein comprises a signal sequence. In some embodiments, the signal sequence is about 15 to about 20 amino acid, e.g., about 15, 16, 17, 18, 19, or 20 amino acids long. In some embodiments, the signal sequence comprises a hydrophobic core region (h-region) flanked by an n-region and a c-region. In some embodiments, the c- region comprises a signal peptidase consensus cleavage site. In some embodiments, signal sequence is an N-terminal signal sequence. In some embodiments, the signal sequence is an azurocidin signal sequence. In some embodiments, the azurocidin signal sequence comprises or consists of MTRLTVLALLAGLLASSRA (SEQ ID NO:30). In some embodiments, the signal sequence is a serum albumin signal sequence. In some embodiments, the serum albumin signal sequence comprises or consists of MKWVTFISLLFLFSSAYS (SEQ ID NO:46). In some embodiments, the signal sequence is an immunoglobulin heavy chain signal sequence. In some embodiments, the immunoglobulin heavy chain signal sequence comprises or consists of MDWTWRVFCLLAVTPGAHP (SEQ ID NO:47). In some embodiments, the signal sequence is an immunoglobulin light chain signal sequence. In some embodiments, the immunoglobulin light chain signal sequence comprises or consists of MDWTWRVFCLLAVTPGAHP (SEQ ID NO:48). In some embodiments, the signal sequence is a cystatin-S signal sequence. In some embodiments, the cystatin-S signal sequence comprises or consists of MARPLCTLLLLMATLAGALA (SEQ ID NO:49). In some embodiments, the signal sequence is an IgKappa signal sequence. In some embodiments, the IgKappa signal sequence comprises or consists of MDMRAPAGIFGFLLVLFPGYRS (SEQ ID NO:50). In some embodiments, the signal sequence is a trypsonigen 2 signal sequence. In some embodiments, the trysonigen 2 signal sequence comprises or consists of MRSLVFVLLIGAAFA (SEQ ID NO:51). In some embodiments, the signal sequence is potassium channel blocker signal sequence. In some embodiments, the potassium channel blocker sequence comprises or consists of MSRLFVFILIALFLSAIIDVMS (SEQ ID NO:52). In some embodiments, the signal sequence is an alpha conotoxin Ip 1.3 signal sequence. In some embodiments, the alpha conotoxin Ip1.3 signal sequence comprises or consists of MGMRMMFIMFMLVVLATTVVS (SEQ ID NO:53). In some embodiments, the signal sequence is an alfa-galactosidase signal sequence. In some embodiments, the alfa-galactosidase signal sequence comprises or consists of MRAFLFLTACISLPGVFG (SEQ ID NO:54). In some embodiments, the signal sequence is a cellulase signal sequence. In some embodiments, the cellulase signal sequence comprises or consists of MKFQSTLLLAAAAGSALA (SEQ ID NO:55). In some embodiments, signal sequence is an aspartic proteinase nepenthesin-1 signal sequence. In some embodiments, the aspartic proteinase nepenthesin-1 signal sequence comprises or consists of MASSLYSFLLALSIVYIFVAPTHS (SEQ ID NO:56). In some embodiments, the signal sequence is an acid chitinase signal sequence. In some embodiments, the acid chitinase signal sequence comprises or consists of MKTHYSSAILPILTLFVFLSINPSHG (SEQ ID NO:57). In some embodiments, the signal sequence is a K28 prepro-toxin signal sequence. In some embodiments, the K28 prepro-toxin signal sequence comprises or consists of MESVSSLFNIFSTIMVNYKSLVLALLSVSNLKYARG (SEQ ID NO:58). In some embodiments, the signal sequence is a killer toxin zygocin precursor signal sequence. In some embodiments, the killer toxin zygocin precursor signal sequence comprises or consists of MKAAQILTASIVSLLPIYTSA (SEQ ID NO:59). In some embodiments, the signal sequence is a cholera toxin signal sequence. In some embodiments, the cholera toxin signal sequence comprises or consists of MIKLKFGVFFTVLLSSAYA (SEQ ID NO:60). In some embodiments, the signal sequence is a human growth hormone signal sequence. In some embodiments, the human growth hormone signal sequence comprises or consists of MATGSRTSLLLAFGLLCLPWLQEGSA (SEQ ID NO:61) In some embodiments, the fusion protein comprises one or more affinity tag(s). In some embodiments, the purification tag is selected from the group consisting of polyhistidine, glutathione S-transferase (GST), maltose-binding protein (MBP), chitin binding protein, a streptavidin tag (e.g., Strep-Tag®, e.g., Trp-Ser-His-Pro-Gln-Phe-Glu-Lys (SEQ ID NO:31)), FLAG-tag (e.g., DYKDDDDK (SEQ ID NO:32)), a biotin tag (e.g., AviTag TM ) and combinations thereof. In some embodiments, the affinity tag is situated towards the N-terminal side of the enzyme or portion thereof. In some embodiments, the affinity tag is N-terminal. In some embodiments, the affinity tag is situated towards the C-terminal side of the enzyme or portion thereof. In some embodiments, the affinity tag is C-terminal. In some embodiments, the affinity tag is a polyhistidine tag. In some embodiments, the polyhistidine tag is selected from the group consisting of HHHH (SEQ ID NO:11), HHHHH (SEQ ID NO:12), HHHHHH, (SEQ ID NO:13), HHHHHHH (SEQ ID NO:14), HHHHHHHH (SEQ ID NO:15), HHHHHHHHH (SEQ ID NO:16), and HHHHHHHHHH (SEQ ID NO:17). In some embodiments, the polyhistidine tag is a hexahistidine tag (e.g., HHHHHH (SEQ ID NO:13)). In some embodiments, the fusion protein comprises or consists of SEQ ID NO:43, SEQ ID NO:44, or SEQ ID NO:45. SEQ ID NO:44 GPASNLTSVPVPHTTALSLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPNVKMGGRYA PRDC VSPHKVAIIIPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAGDTIFNRAKLLNVGF QEAL KDYDYTCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGGVSALSKQQ FLTI NGFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAVVGRCRMIRHSRDKKNEPNPQRFDRIA HTKE TMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPSPRD SEQ ID NO:45 gssplldmGPASNLTSVPVPHTTALSLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPN VKMG GRYAPRDCVSPHKVAIIIPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAGDTIFNR AKLL NVGFQEALKDYDYTCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGG VSAL SKQQFLTINGFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAVVGRCRMIRHSRDKKNEPN PQRF DRIAHTKETMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPSprdhhhhhhh In some embodiments, the fusion protein comprises or consists of SEQ ID NO:3 or SEQ ID NO:5. SEQ ID NO:3 gssplldmlehhhhhhhhmAKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYK GPGP GIKFSAEALRCHLRDHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSSAG SLKS SQLGREIDDHDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILI VWDP SVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPN PPSS GMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKHLN QGTD EDIYLLGKATLPGFRTIHC SEQ ID NO:5 hhhhhhhhmAKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYKGPGPGIKFSA EALR CHLRDHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSSAGSLKSSQLGRE IDDH DAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSD IPKW YQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGII IMMT LCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLL GKAT LPGFRTIHC Expression Systems To use the enzyme(s) and/or fusion protein(s) described herein, it may be desirable to express them from a nucleic acid that encodes them. This can be performed in a variety of ways. For example, the nucleic acid encoding the enzyme(s) and/or fusion protein(s) can be cloned into an intermediate vector for transformation into prokaryotic or eukaryotic cells for replication and/or expression. Intermediate vectors are typically prokaryote vectors, e.g., plasmids, or shuttle vectors, or insect vectors, for storage or manipulation of the nucleic acid encoding the enzyme(s) and/or fusion protein(s). The nucleic acid encoding the enzyme(s) and/or fusion protein(s) can also be cloned into an expression vector, for administration to a plant cell, animal cell, preferably a mammalian cell or a human cell, fungal cell, bacterial cell, or protozoan cell. To obtain expression, a sequence encoding the enzyme(s) and/or fusion protein(s) is typically subcloned into an expression vector that contains a promoter to direct transcription. Suitable bacterial and eukaryotic promoters are well known in the art and described, e.g., in Sambrook et al., Molecular Cloning, A Laboratory Manual (3d ed.2001); Kriegler, Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular Biology (Ausubel et al., eds., 2010). Bacterial expression systems for expressing engineered protein are available in, e.g., E. coli, Bacillus sp., and Salmonella (Palva et al., 1983, Gene 22:229-235). Kits for such expression systems are commercially available. Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well known in the art and are also commercially available. The promoter used to direct expression of a nucleic acid depends on the particular application. For example, a strong constitutive promoter is typically used for expression and purification of fusion proteins. In some embodiments, the promoter is selected from the group consisting of human cytomegalovirus (CMV), EF-1 α (EF1A), elongation factor 1α short (EFS), CMV enhancer chicken α-Actin promoter and rabbit α-Globin splice acceptor site (CAG), hybrid CBA (CBh), spleen focus-forming virus (SFFV), murine stem cell virus (MSCV), simian virus 40 (SV40), mouse phosphoglycerate kinase 1 (mPGK), human phosphoglycerate kinase 1 (hPGK), and ubiquitin C (UBC) promoters. In some embodiments, the promoter is a human cytomegalovirus promoter (CMV). In addition to the promoter, the expression vector typically contains a transcription unit or expression cassette that contains all the additional elements required for the expression of the nucleic acid in host cells, either prokaryotic or eukaryotic. A typical expression cassette thus contains a promoter operably linked, e.g., to the nucleic acid sequence encoding the enzyme(s) and/or fusion protein(s) and any signals required, e.g., for efficient polyadenylation of the transcript, transcriptional termination, ribosome binding sites, or translation termination. Additional elements of the cassette may include, e.g., enhancers, and heterologous spliced intronic signals. In some embodiments, the expression vector comprises a woodchuck hepatitis virus post- transcriptional regulatory element (WPRE). See, e.g., Zufferey et al., “Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element Enhances Expression of Transgenes Delivered by Retroviral Vectors,” Journal of Virology 73(4):2886–92 (1999). The particular expression vector used to transport the genetic information into the cell is selected with regard to the intended use of the enzyme(s) and/or fusion protein(s), e.g., expression in plants, animals, bacteria, fungus, protozoa, etc. Standard transfection methods are used to produce bacterial, mammalian, yeast or insect cell lines that express large quantities of protein, which are then purified using standard techniques (see, e.g., Colley et al., 1989, J. Biol. Chem., 264:17619-22; Guide to Protein Purification, in Methods in Enzymology, vol.182 (Deutscher, ed., 1990)). Transformation of eukaryotic and prokaryotic cells are performed according to standard techniques (see, e.g., Morrison, 1977, J. Bacteriol.132:349-351; Clark-Curtiss & Curtiss, Methods in Enzymology 101:347-362 (Wu et al., eds, 1983). Any of the known procedures for introducing foreign nucleotide sequences into host cells may be used. These include the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, nucleofection, liposomes, microinjection, naked DNA, plasmid vectors, viral vectors, both episomal and integrative, and any of the other well-known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook et al., supra). It is only necessary that the particular genetic engineering procedure used be capable of successfully introducing at least one gene into the host cell capable of expressing the enzyme(s) and/or fusion protein(s). In some embodiments, the host cells are stably transformed. In some embodiments, the host cells are grow under non-hypoxic conditions. The enzyme(s) and/or fusion protein(s) described herein can be produced by any protein production system known in the art, such as host cell based expression systems, synthetic biology platforms, or cell-free protein production platforms. In some embodiments, the protein production system is capable of post-translational modification(s), including, but not limited to one or more of glycosylation, e.g., N-glycosylated proteins, disulfide bond formation, and tyrosine phosphorylation. See, e.g., Boh and Ng, “Impact of Host Cell Line Choice on Glycan Profile,” Critical Reviews in Biotechnology 38(6):851–67 (2018). In some embodiments, the host cell is a mammalian host cell. In some embodiments, the mammalian cell is selected from the group consisting of Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, NS0 myeloma cells, Sp2/0 hybridoma mouse cells, human embryonic kidney cells (HEK), HT-1080 human cells, and derivatives thereof. In some embodiments, the host cell is a non-human mammalian host cell. In some embodiments, the non-human mammalian host cell is selected from CHO cells, BHK-21 cells, murine NS0 myeloma cells, Sp2/0 hybridoma cells, and derivatives thereof. In some embodiments, the host cell is a human mammalian host cell. In some embodiments, the human cell is selected from the group consisting of HEK, PER.C6, CEVEC’s amniocyte production (CAP), AGE1.HM, HKB-11, HT-1080 cells, and derivatives thereof. In some embodiments, the host cell is a human embryonic kidney cell (HEK, ATCC® CRL-1573 TM ) or derivative thereof. In some embodiments, the HEK cell expresses a temperature sensitive allele of the SV40 T antigen. In some embodiments, the HEK cell is resistant against the Ricin toxin after ethymethanesulfonate (EMS) mutagenesis and lack N-acetylglucosaminyltransferase I activity, e.g., encoded by the MGAT1 gene. In some embodiments, the HEK cell predominantly modifies glycoproteins with the Man5GlcNAc2 N-glycan. In some embodiments, the HEK cell expresses the tetR repressor, enabling tetracycline-inducible protein expression. In some embodiments, the HEK derivative is selected from the group consisting of HEK293, HEK293T (293tsA1609neo, ATCC® CRL-3216 TM ), HEK293T/17 (ATCC® CRL- 11268 TM ), HEK293T/17 SF (ATCC® ACS-4500 TM ), HEK293S, HEK293SG, HEK293FTM, HEK293SGGD, HEK293FTM, HEK293E, and HKB-11. Synthetic biology platforms, such as those described in Kightlinger et al., “Synthetic Glycobiology: Parts, Systems, and Applications,” ACS Synth. Biol.9:1534–62 (2020) are also suitable for producing the enzyme(s) and/or fusion protein(s) described herein. Also provided herein are vectors and cells comprising the vectors, as well as kits comprising the proteins and nucleic acids described herein, e.g., for use in a method described herein. EXAMPLES The invention is further described in the following examples, which do not limit the scope of the invention described in the claims. Example 1: Hypersialylated IgG Preparation IgG in which more than 60% of the overall branched glycans are disialylated can be prepared as follows. Briefly, a mixture of IgG antibodies is exposed to a sequential enzymatic reaction using α1,4 galactosyltransferase 1 (B4-GalT) and α2,6-sialyltransferase (ST6-Gal1) enzymes. The B4- GalT does not need to be removed from the reaction before addition of ST6-Gal1 and no partial or complete purification of the product is needed between the enzymatic reactions. The galactosyltransferase enzyme selectively adds galactose residues to pre-existing asparagine-linked glycans. The resulting galactosylated glycans serve as substrates to the sialic acid transferase enzyme which selectively adds sialic acid residues to cap the asparagine-linked glycan structures attached to. Thus, the overall sialylation reaction employed two sugar nucleotides (uridine 5?-diphosphogalactose (UDPGal) and cytidine-5?-monophospho-N- acetylneuraminic acid (CMP-NANA)). The latter is replenished periodically to increase disialylated product relative to monosialylated product. The reaction includes the co-factor manganese chloride. A representative example of the IgG-Fc glycan profile for such a reaction starting with IVIg and the reaction product is shown in FIG.4. In FIG.4, on the left is a schematic representation of enzymatic sialylation reaction to transform IgG to hsIgG; on the right is the IgG Fc glycan profile for the starting IVIg and hsIgG. In this study, glycan profiles for the different IgG subclasses are derived via glycopeptide mass spectrometry analysis. The peptide sequences used to quantify glycopeptides for different IgG subclasses were: IgG1 = EEQYNSTYR (SEQ ID NO:7), IgG2/3 EEQFNSTFR (SEQ ID NO:8), IgG3/4 EEQYNSTFR (SEQ ID NO:9) and EEQFNSTYR (SEQ ID NO:10). The glycan data is shown per IgG subclass. Glycans from IgG3 and IgG4 subclasses cannot be quantified separately. As shown, for IVIg the sum of all the nonsialylated glycans is more than 80% and the sum of all sialylated glycans is < 20%. For the reaction product, the sum for all nonsialylated glycans is < 20% and the sum for all sialylated glycans is more than 80%. Nomenclature for different glycans listed in the glycoprofile use the Oxford notation for N linked glycans. Example 2: Alternative Sialylation Condition Alternative suitable reaction conditions for galactosylation and sialylation to create hsIgG in, e.g., 50 mM BIS-TRIS pH 6.9 include: galactosylation of IgG antibodies (e.g., pooled IgG antibodies, pooled immunoglobulins or IVIg) as follows: 7.4 mM MnCl 2 ; 38 αmol UDP-Gal/g IgG antibody; and 7.5 units B4GalT/g IgG antibody with 16-24 hours of incubation at 37°C followed by sialylation in 7.4 mM MnCl 2 ; 220 αmol CMP-NANA/g IgG antibody (added twice: once at the start of the reaction and again after 9-10 hrs); and 15 units ST6-Gal1/g IgG antibody with 30-33 hours of incubation at 37°. The reaction can be carried out by adding the ST6-Gal1 and CMP-NANA to the galactosylation reaction. Alternatively, all of the reactants can be combined at the outset and the CMP-NANA supplemented. Example 3: Production of ST6Gal A fusion protein that includes an enzymatically active portion of ST6Gal was designed for high level expression in HEK cells and ease of purification. SEQ ID NO:6 is the immature fusion protein which includes a portion of human ST6Gal (SEQ ID NO:4), a 6 HIS tag, a signal sequence from azurocidin (MTRLTVLALL AGLLASSRAGSSPLLD (SEQ ID NO:31); 19 aa is signal is underlined) and amino acids resulting from the cloning process. SEQ ID NO:3 is the secreted form, and SEQ ID NO:5 includes the 6 HIS tag and the ST6GalT portion. SEQ ID NO: 6 MTRLTVLALLAGLLASSRAGSSPLLDMLEHHHHHHHHMAKPEASFQVWNKDSSSKNLI PRLQKIWKNYLSMNKYKVSYKGPGPGIKFSAEALRCHLRDHVNVSMVEVTDFPFNTSE WEGYLPKESIRTKAGPWGRCAVVSSGSLKSSQLGREIDDHDAVLRFNGAPTANFQQDV GTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSVYHSDIPKWYQNPDYNFFNNYK TYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPPSSGMLGIIIMMTLCDQVDIYE FL PSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFR TIHC SEQ ID NO: 3 gssplldmlehhhhhhhhmAKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYK GPG PGIKFSAEALRCHLRDHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSS AGSLKSSQLGREIDDHDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSL YNEGILIVWDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDI LQEISPEEIQPNPPSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMG A YHPLLYEKNLVKHLNQGTDEDIYLLGKATLPGFRTIHC SEQ ID NO: 4 AKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYKGPGPGIKFSAEALRCHLR DHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSSAGSLKSSQLGREIDD HDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILIVWDPSVYH SDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPEEIQPNPPSS GMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLYEKNLVKH LNQGTDEDIYLLGKATLPGFRTIHC SEQ ID NO: 5 hhhhhhhhmAKPEASFQVWNKDSSSKNLIPRLQKIWKNYLSMNKYKVSYKGPGPGIKFSA EALRCHLRDHVNVSMVEVTDFPFNTSEWEGYLPKESIRTKAGPWGRCAVVSSAGSLKS SQLGREIDDHDAVLRFNGAPTANFQQDVGTKTTIRLMNSQLVTTEKRFLKDSLYNEGILI VWDPSVYHSDIPKWYQNPDYNFFNNYKTYRKLHPNQPFYILKPQMPWELWDILQEISPE EIQPNPPSSGMLGIIIMMTLCDQVDIYEFLPSKRKTDVCYYYQKFFDSACTMGAYHPLLY EKNLVKHLNQGTDEDIYLLGKATLPGFRTIHC HEK293 cells (Expi293F®cells; Life Technologies) were stably transfected with a vector expressing a polypeptide having SEQ ID NO: 6 under the control of a CMV promotor. To produce ST6GalT fusion protein, the stably transfected, and clonally selected cells were counted and seeded on Day 0 at a cell density of 0.4E6 cells/mL, grown at 37°C, 5% CO2, 130-150rpm. On Day 4, a 10% glucose/media feed was added to the cells. Growth was monitored by daily. On Day 7 cell supernatants were harvested, sterile filtered through a 0.45 micron filter and then a 0.2 micron filter. OTHER EMBODIMENTS It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
Next Patent: SEPARATION PROCESS AND APPARATUS