Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ENGINEERED PARASITES FOR DELIVERING PROTEIN TO THE CENTRAL NERVOUS SYSTEM (CNS)
Document Type and Number:
WIPO Patent Application WO/2018/002938
Kind Code:
A1
Abstract:
Provided are nucleic acid constructs, Toxoplasma comprising same, pharmaceutical compositions comprising same and methods using same for delivering a protein-of-interest to a tissue-of-interest of a subject, such as the CNS and further treating a pathology which is treatable by administration of a therapeutic polypeptide in a central nervous system of the subject.

Inventors:
RECHAVI ODED (IL)
BRACHA SHAHAR (IL)
SHEINER LILACH (GB)
Application Number:
PCT/IL2017/050731
Publication Date:
January 04, 2018
Filing Date:
June 29, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
RAMOT AT TEL-AVIV UNIV LTD (IL)
UNIV GLASGOW COURT (GB)
International Classes:
A61K35/68; A61P25/28; C12N15/79
Domestic Patent References:
WO2016046321A12016-03-31
Other References:
KOSHY, A. A. ET AL.: "Toxoplasma secreting Cre recombinase for analysis of host-parasite interactions", NATURE METHODS, vol. 7, no. 4, 7 March 2010 (2010-03-07), pages 307 - 309, XP055601210, DOI: 10.1038/nmeth.1438
SIBLEY, L. D.: "Invasion and Intracellular Survival by Protozoan Parasites", IMMUNOLOGICAL REVIEWS, vol. 240, no. 1, March 2011 (2011-03-01), pages 72 - 91, XP055601212, DOI: 10.1111/j.1600-065X.2010.00990.x
STANISLAUS, M. A. ET AL.: "Genetically engineered self-destruction: an alternative to herbicides for cover crop systems", WEED SCIENCE, vol. 50, no. 6, 30 November 2002 (2002-11-30), pages 794 - 801
WEISS, LOUIS M. ET AL.: "Toxoplasma gondii: the model apicomplexan: Perspectives and methods", ACADEMIC PRESS, 31 December 2011 (2011-12-31), Retrieved from the Internet
Attorney, Agent or Firm:
EHRLICH, Gal et al. (IL)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1. A nucleic acid construct comprising a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein said heterologous polynucleotide is operably linked to a promoter for directing transcription of said heterologous polynucleotide in a Toxoplasma, wherein said promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that said promoter is not a Toxofilin promoter.

2. The nucleic acid construct of claim 1, wherein said endogenous promoter is not an endogenous promoter of a rhoptry protein.

3. The nucleic acid construct of claim 1, wherein said Toxoplasma secreted protein is secreted from a rhoptry of said Toxoplasma.

4. The nucleic acid construct of claim 3, wherein said Toxoplasma secreted protein which is secreted from said rhoptry comprises Toxofilin and/or ROP1.

5. The nucleic acid construct of claim 1, wherein said Toxoplasma secreted protein is a non-rhoptry protein.

6. The nucleic acid construct of claim 5, wherein said Toxoplasma secreted protein is selected from the group consisting of a microneme protein and a dense granule protein.

7. The nucleic acid construct of claim 1, wherein said Toxoplasma secreted protein is secreted from a dense granule of said Toxoplasma.

8. The nucleic acid construct of claim 7, wherein said protein secreted from said dense granule comprises the amino acid sequence selected from the group consisting of SEQ ID NOs: 234-279.

9. The nucleic acid construct of claim 7, wherein said Toxoplasma secreted protein which is secreted from said dense granule comprises a GRA16, and/or GRA24.

10. The nucleic acid construct of claim 1, wherein said Toxoplasma secreted protein is secreted from a microneme of said Toxoplasma.

11. The nucleic acid construct of claim 10, wherein said protein secreted from said microneme comprises the amino acid sequence selected from the group consisting of SEQ ID NOs: 280-322.

12. The nucleic acid construct of claim 1, wherein said Toxoplasma secreted protein comprises Toxoplasma gondii macrophage migration inhibitory factor (TgMIF).

13. The nucleic acid construct of any one of claims 1-12, wherein said heterologous polynucleotide further comprises a Toxoplasma untranslated region (UTR) nucleic acid sequence upstream and/or downstream of said heterologous polynucleotide.

14. The nucleic acid construct of any one of claims 1-13, further comprises a third nucleic acid sequence encoding an inducible self-destruction element.

15. The nucleic acid construct of any one of claims 1-14, with the proviso that said nucleic acid construct does not comprise a Cre-recombinase coding sequence.

16. The nucleic acid construct of any one of claims 1-15, suitable for integration into a genome of Toxoplasma.

17. The nucleic acid construct of claim 16, wherein said inducible self- destruction element is active in response to a drug.

18. The nucleic acid construct of claim 17, wherein said drug comprises an antibiotic.

19. The nucleic acid construct of any one of claims 1-18, further comprises at least one in frame cleavage site which allows detachment of said pharmaceutical polypeptide from said Toxoplasma secreted protein.

20. The nucleic acid construct of any one of claims 1-19, wherein said heterologous polynucleotide further comprises a nucleic acid sequence encoding a selectable marker.

21. A nucleic acid construct system comprising at least two nucleic acid constructs, wherein a first nucleic acid construct of said at least two nucleic acid constructs is the nucleic acid construct of any one of claims 1-19, and a second nucleic construct of said at least two nucleic acid constructs comprises a polynucleotide encoding a selectable marker.

22. The nucleic acid construct of claim 20 or 21, wherein said selectable marker comprises chloramphenicol acetyltransferase (CAT), DHFR-TS, BLE, HXGPRT, UPRT, TK, CD, a fluorescent protein or an epitope tag.

23. A toxoplama transformed with the nucleic acid construct of any one of claims 1-20 and 22, or with the nucleic acid construct system of claim 21 or 22.

24. The Toxoplasma of claim 23, being not attenuated.

25. The Toxoplasma of claim 23 or 24, being devoid of Toxoplasma elements which facilitate propagation of said Toxoplasma in a host cell.

26. The Toxoplasma of claim 25, being devoid of virulence genes which are not necessary for delivery of the protein-of-interest into a central nervous system (CNS) of a subject.

27. A pharmaceutical composition comprising the Toxoplasma of any one of claims 23-26, and a pharmaceutically acceptable carrier.

28. The pharmaceutical composition of claim 27, for treating a subject diagnosed with a pathology characterized by a deficient endogenous protein in a central nervous system of a subject.

29. The pharmaceutical composition of claim 27, for treating a subject diagnosed with a pathology which is treatable by administration of said pharmaceutical polypeptide in a central nervous system of a subject.

30. The pharmaceutical composition of claim 27, 28 or 29, further comprises an immunosuppression agent.

31. A method of administering a protein-of-interest into a central nervous system of a subject, the method comprising:

administering to the subject the Toxoplasma of any one of claims 23-26 or the pharmaceutical composition of claim 27, 28, 29 or 30, thereby administering the protein-of-interest to the central nervous system of the subject.

32. A method of treating a subject in need thereof, comprising administering to the subject the Toxoplasma of any one of claims 24-26 or the pharmaceutical composition of claim 27, 28, 29 or 30, wherein said subject is diagnosed with a pathology treatable by administration of said pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

33. A method of treating a subject in need thereof, comprising administering to the subject a Toxoplasma comprising a nucleic acid construct which comprises a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein said heterologous polynucleotide is operably linked to a promoter for directing transcription of said heterologous polynucleotide in a Toxoplasma, wherein said subject is diagnosed with a pathology treatable by administration of said pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

34. The method of claim 33, wherein said promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that said promoter is not a toxofilin promoter.

35. A method of administering a polypeptide-of-interest to a subject, comprising administering to the subject a Toxoplasma comprising a nucleic acid construct which comprises a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding the polypeptide-of-interest, wherein said heterologous polynucleotide is operably linked to a promoter for directing transcription of said heterologous polynucleotide in a Toxoplasma, wherein said promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that said promoter is not a toxofilin promoter, thereby administering the polypeptide-of-interest to the subject.

36. The method of any one of claims 31-35, further comprising administering to the subject an immunosuppression drug prior to administration of said Toxoplasma and/or subsequent to administration of said Toxoplasma and/or concomitantly with administration of said Toxoplasma to the subject.

37. The method of any one of claims 31-34 and 36, wherein said pharmaceutical polypeptide comprises a wild type amino acid sequence corresponding to said endogenous protein capable of treating the pathology.

38. The method of any one of claims 31-34 and 36, wherein said pharmaceutical polypeptide comprises an antibody capable of treating the pathology.

39. The method of any one of claims 31-34 and 36, wherein said pharmaceutical polypeptide comprises a toxin capable of treating the pathology.

40. The method of any one of claims 31-34 and 36-39, further comprising administering to the subject a drug capable of inducing said self-destruction element.

Description:
ENGINEERED PARASITES FOR DELIVERING PROTEIN TO THE CENTRAL

NERVOUS SYSTEM (CNS)

FIELD AND BACKGROUND OF THE INVENTION

The present invention, in some embodiments thereof, relates to a nucleic acid construct for the secretion of a pharmaceutical polypeptide fused to a Toxoplasma gondii secreted polypeptide, and Toxoplasma comprising same, and, more particularly, but not exclusively, to a pharmaceutical compositions and methods of using same for treating a subject.

The lack of robust methods for the delivery of protein therapeutics is to a great degree the current bottleneck in their translation into clinical treatment. Proteins often serve a highly specific and complex set of functions that cannot be mimicked by simple chemical compounds (Nat Rev Drug Discov. 2008; 7(l):21-39. Protein therapeutics: a summary and pharmacological classification. Leader B, Baca QJ, and Golan DE); however, owing to their macromolecular nature, the delivery of therapeutic protein to target sites in the body is extremely challenging. Low functional stability and rapid loss of activity following administration or during storage, along with low permeability through biological barriers, limit the delivery of active proteins. The ongoing development of chemical modifications, conjugates and carrier systems aided the delivery of some therapeutic proteins, but these are mostly still limited to targets within the "accessible target space" - in the vascular compartment or on the surface of cells (Mitragotri 2014, Nat. Rev. Drug. Discov. 13(9):655-72). Efficient delivery is particularly challenging in the field of neurological diseases due to the blood-brain barrier (BBB), which tightly regulates the transport of molecules to the brain, while further complexity is added when the target is intracellular.

In order to deliver supplementary proteins to cells within the CNS, it is necessary to develop mechanisms that penetrate the BBB and may target specific cells within the CNS.

Currently, the major approaches for treating conditions caused by a deficiency of a specific protein are gene therapy, stem cell therapy and enzyme replacement therapy.

Gene therapy is the insertion of a functional copy of a gene that encodes a therapeutic protein (Cox, D. B. T., et al., 2015) which aims to increase the expression of a functional copy of the protein. While gene therapy provides the correct sequence of the gene, it does not address loss of protein function that is caused by or accompanied by defects in translational mechanisms or post-translational modifications. In addition, it can bear the risk of cancer development due to insertional mutagenesis (Persons, D. A. & Baum, C. 2011), mobilization, germline transmission, immunogenicity and limited transgene capacity (Schambach, A., et al., 2013; Al-Dosari et al., 2009). To target the CNS, gene therapy is most commonly mediated by the use of viral vectors, including those derived from herpes simplex virus type 1 (HSV-1), adenovirus, AAV, lentiviruses such as HIV-1, feline immunodeficiency virus or equine infectious anemia virus, and more recently SV40- AAV and lentivirus which are currently the most common of those. While some viral vectors are safer than the others, they do not address protein processing defects, and clinical efficiency remains limited.

Stem cell therapy relies on the protein synthesis mechanisms of implanted cells which can supplement the defective endogenous synthesis of the protein. Despite numerous efforts to implement this approach clinically, the efficiency of such treatments is reduced by rejection of implanted cells, and the low cost-efficiency of using patient- specific stem cells. In addition, the risks involved in stem cell treatments include complications associated with immunosuppression required to reduce rejection of grafted cells, and the development of cancer (Dimmeler, S., et al., 2014).

Protein- or Enzyme replacement therapy (ERT), in which the supplementary protein is synthesized externally and then delivered into the CNS, must overcome the impermeability of the BBB, and develop precise targeting in order to be clinically relevant. One such approach to protein delivery is injection directly into the brain or into adjacent organs from which the protein may diffuse at a low efficiency. Thus, in addition to the risk of such approaches, the ability to target specific regions within the CNS with the necessary amount is limited. Another problem of such approaches is the requirement of repeated dosing, which is often clinically impractical (Abbott, N. J. 2013).

ERT can be combined with an increase in BBB permeability to allow the therapeutic protein to diffuse into the CNS with greater efficiency (Malhotra, M. & Prakash, S. 2011). This is achieved through the shrinking of the cells that compose the BBB, or manipulation of the transport mechanisms through the BBB. However, this approach is not efficient due to opsonization, and bears the risk of unwanted substances diffusing into the CNS along with the supplemented protein of interest (Bradbury, M. W. B. 2012).

An alternative approach is the attachment of the therapeutic protein to elements that enter the CNS to utilize their inherent ability to cross the BBB. Agents that may mediate trafficking across the BBB include fusion proteins, dendrimers, solid lipid nanoparticles, liposomes and nanoparticles (Solaro, R., 2010). However, these approaches require further development and have not reached clinical trials.

Toxoplasma gondii is a single-cell intracellular protozoan parasite of the phylum apicomplexa. Its primary hosts are felines, and only in them it can go through the sexual stages of its life cycle. However, as secondary hosts it can infect many warm-blooded organisms, including humans. In humans as in rodents, after the host contracts the parasite (typically after ingestion of infectious tissue cysts or oocysts), the parasite differentiates into the fast-replicating tachyzoite stage. Toxoplasma gondii tachyzoites invade nucleated host cells by active penetration and establish a vacuole within which they replicate by endodyogeny. The tachyzoites travel through the intestinal epithelium, and by both "hitch-hiking" on immune cells and by mobilizing itself autonomously, they reach distal tissues, breach the blood brain barrier and spread in the brain [Harker 2015, Parasite Immunol. 37(3): 141-9]. Thus, the tachyzoites enter the circulation and disseminate to secondary tissues. Replication is critical for dissemination and for reaching distal tissues during the acute stage of infection. Once inside the unique environment of the brain, T. gondii penetrates cells in the brain (primarily neurons, but also glial cells in lower proportions) and following immune pressure, it differentiates to the latent bradyzoite stage, and resides in cytoplasmic intercellular cysts. This characterizes a chronic infection (Cabral et al. 2016, PLoS Pathog. 12(2):el005447). Tissue cysts harbouring bradyzoites persist for the lifetime of the host, while multiplying very slowly and having a quiescent metabolic program (Parasite Immunol. 2015, 37(3): 141-9. "Toxoplasma gondii dissemination: a parasite's journey through the infected host").

Although T. gondii infects an estimated one third of the world population, infection remains asymptomatic in healthy humans (Montoya, J. G. & Liesenfeld, O. 2004) and is primarily a risk only for individuals with irregularly compromised immune systems. Most importantly, during infection T. gondii secretes proteins into the host's cells, both during cell invasion and while intracellular. T. gondii has three types of electrodense secretory organelles: micronemes, rhoptries, and dense granules. Host cell invasion is mediated by the sequential secretion of the contents of all three organelles, which are exocytosed from the apical region when the parasite invades the host cell and when it resides inside it (Dlugonska, H. 2008). Micronemes are involved in the attachment and penetration of T. gondii, while rhoptries are required for creating a transient structure, moving junction and then the establishment of the PV. Rhoptry proteins are secreted upon initial contact with the host in a process termed "kiss and spit" (Boothroyd, J. C. et al., 2008). Dense granules secrete proteins throughout most of the parasite stages. The secretion process coincides with the formation of intravacuolar network and continues during the intracellular residence of T. gondii (Dlugonska, H. 2008; Carruthers, V. B. & Sibley, L. D. 1997).

Additional background art includes Koshy, A. A. et al. 2010; US8673289; US20120045477 Al; Lodoen M.B., et al. 2010. Cellular Microbiology, 12: 55-66.

SUMMARY OF THE INVENTION

According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct comprising a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein the heterologous polynucleotide is operably linked to a promoter for directing transcription of the heterologous polynucleotide in a Toxoplasma, wherein the promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that the promoter is not a Toxofilin promoter.

According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct system comprising at least two nucleic acid constructs, wherein a first nucleic acid construct of the at least two nucleic acid constructs is the nucleic acid construct of some embodiments of the invention, and a second nucleic construct of the at least two nucleic acid constructs comprises a polynucleotide encoding a selectable marker. According to an aspect of some embodiments of the present invention there is provided a Toxoplasma transformed with the nucleic acid construct of some embodiments of the invention or with the nucleic acid construct system of some embodiments of the invention.

According to an aspect of some embodiments of the present invention there is provided a pharmaceutical composition comprising the Toxoplasma of some embodiments of the invention, and a pharmaceutically acceptable carrier.

According to an aspect of some embodiments of the present invention there is provided a method of administering a protein-of-interest into a central nervous system of a subject, the method comprising: administering to the subject the Toxoplasma of some embodiments of the invention or the pharmaceutical composition of some embodiments of the invention, thereby administering the protein-of-interest to the central nervous system of the subject.

According to an aspect of some embodiments of the present invention there is provided a method of treating a subject in need thereof, comprising administering to the subject the Toxoplasma of some embodiments of the invention or the pharmaceutical composition of some embodiments of the invention, wherein the subject is diagnosed with a pathology treatable by administration of the pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

According to an aspect of some embodiments of the present invention there is provided a method of treating a subject in need thereof, comprising administering to the subject a Toxoplasma comprising a nucleic acid construct which comprises a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein the heterologous polynucleotide is operably linked to a promoter for directing transcription of the heterologous polynucleotide in a Toxoplasma, wherein the subject is diagnosed with a pathology treatable by administration of the pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

According to some embodiments of the invention, the promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that the promoter is not a Toxofilin promoter.

According to some embodiments of the invention, the endogenous promoter is not of a rhoptry protein.

According to some embodiments of the invention, the Toxoplasma being not attenuated.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a rhoptry of the Toxoplasma.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the rhoptry comprises Toxofilin and/or ROP1.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the rhoptry comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:344-465.

According to some embodiments of the invention, the Toxoplasma secreted protein is a non-rhoptry protein.

According to some embodiments of the invention, the Toxoplasma secreted protein is selected from the group consisting of a microneme protein and a dense granule protein.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a dense granule of the Toxoplasma.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the dense granule comprises a GRA16, and/or GRA24.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a microneme of the Toxoplasma.

According to some embodiments of the invention, the protein secreted from the microneme comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:280-322.

According to some embodiments of the invention, the protein secreted from the dense granule comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:234-279.

According to some embodiments of the invention, the Toxoplasma secreted protein comprises Toxoplasma gondii macrophage migration inhibitory factor (TgMIF). According to some embodiments of the invention, the heterologous polynucleotide further comprises a Toxoplasma untranslated region (UTR) nucleic acid sequence upstream and/or downstream of the Toxoplasma secreted protein open reading frame.

According to some embodiments of the invention, the Toxoplasma 5'- untranslated region (5'-UTR) is placed upstream of the Toxoplasma secreted protein open reading frame.

According to some embodiments of the invention, the Toxoplasma 3'- untranslated region (3'-UTR) is placed downstream of the Toxoplasma secreted protein open reading frame.

According to some embodiments of the invention, the Toxoplasma 3' untranslated region (3'-UTR) nucleic acid sequence is the GRA2 3'-UTR, GRA16 3'- UTR, GRA24 3'-UTR, SAG1 3'-UTR, or the DHFR 3'-UTR.

According to some embodiments of the invention, the Toxoplasma 5' untranslated region (5'-UTR) nucleic acid sequence is the GRA2 5'-UTR, GRA16 5'- UTR, GRA24 5'-UTR, SAG1 5'-UTR, or the DHFR 5'-UTR.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the GRA2 promoter, GRA16 promoter, GRA24 promoter, SAG1 promoter, or the DHFR promoter.

According to some embodiments of the invention, the Toxoplasma untranslated region (UTR) nucleic acid sequence is the Toxofilin 3'-UTR.

According to some embodiments of the invention, the nucleic acid construct further comprises a third nucleic acid sequence encoding an inducible self-destruction element.

According to some embodiments of the invention, the third nucleic acid sequence encoding the inducible self-destruction element is comprised in the same nucleic acid construct which comprises the heterologous polynucleotide comprising the first nucleic acid sequence encoding the Toxoplasma secreted protein in frame fused upstream to the second nucleic acid sequence encoding the pharmaceutical polypeptide.

According to some embodiments of the invention, the third nucleic acid sequence encoding the inducible self-destruction element is comprised in a separate nucleic acid construct with respect to the nucleic acid construct which comprises the heterologous polynucleotide comprising the first nucleic acid sequence encoding the Toxoplasma secreted protein in frame fused upstream to the second nucleic acid sequence encoding the pharmaceutical polypeptide.

According to some embodiments of the invention, the nucleic acid construct does not comprise a Cre-recombinase coding sequence.

According to some embodiments of the invention, the nucleic acid construct does not comprise a beta ( )-lactamase (BLA) coding sequence.

According to some embodiments of the invention, the nucleic acid construct is suitable for integration into a genome of Toxoplasma.

According to some embodiments of the invention, the inducible self-destruction element is active in response to a drug.

According to some embodiments of the invention, the drug comprises an antibiotic.

According to some embodiments of the invention, the nucleic acid construct further comprises at least one in frame cleavage site which allows detachment of the pharmaceutical polypeptide from the Toxoplasma secreted protein.

According to some embodiments of the invention, the Toxoplasma is devoid of elements which facilitate propagation of the Toxoplasma in a host cell.

According to some embodiments of the invention, the Toxoplasma is devoid of virulence genes which are not necessary for delivery of the protein-of-interest into a CNS of a subject.

According to some embodiments of the invention, the heterologous polynucleotide further comprises a nucleic acid sequence encoding a selectable marker.

According to some embodiments of the invention, the selectable marker comprises chloramphenicol acetyltransferase (CAT), DHFR-TS, BLE, HXGPRT, UPRT, TK, CD, a fluorescent protein (such as GFP, YFP, RFP, mCherry or other) or an epitope tag (such as HA, Myc, Ty-1, FLAG or other).

According to some embodiments of the invention, the pharmaceutical composition is for treating a subject diagnosed with a pathology characterized by a deficient endogenous protein in a central nervous system of a subject.

According to some embodiments of the invention, the pharmaceutical composition is for treating a subject diagnosed with a pathology which is treatable by administration of the pharmaceutical polypeptide in a central nervous system of a subject.

According to some embodiments of the invention, the pharmaceutical composition further comprises an immunosuppression agent.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug prior to administration of the Toxoplasma and/or subsequent to administration of the Toxoplasma and/or concomitantly with administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug prior to administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug subsequent to administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug concomitantly with administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises a wild type amino acid sequence corresponding to the endogenous protein capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an antibody capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an antigen capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises a toxin capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an enzyme, a structural polypeptide, a motility polypeptide, a regulatory polypeptide, a storage polypeptide, a signaling/ligand polypeptide, a receptor polypeptide, a sensory polypeptide, an antibody, a protein channel and/or a transport polypeptide. According to some embodiments of the invention, administering is performed by peripheral administration.

According to some embodiments of the invention, the peripheral administration comprises intravenous administration.

According to some embodiments of the invention, the peripheral administration comprises oral administration.

According to some embodiments of the invention, administering is performed by direct administration to the central nervous system.

According to some embodiments of the invention, the deficient endogenous protein comprises a deletion, insertion, and/or substitution of at least one amino acid of the endogenous protein as compared to a wild type amino acid sequence of the endogenous protein.

According to some embodiments of the invention, the deficient endogenous protein comprises a reduced level of the endogenous protein as compared to a level of the endogenous protein in a healthy subject devoid of the pathology.

According to some embodiments of the invention, the deficient endogenous protein is absence of the endogenous protein in the subject diagnosed with the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide is Galactocerebrosidase (GALC).

According to some embodiments of the invention, the pharmaceutical polypeptide is Galactocerebrosidase (GALC) isoform 1, isoform 2, isoform 3, isoform 4 or isoform 5.

According to some embodiments of the invention, the pharmaceutical polypeptide is Methyl-CpG Binding Protein 2 (MECP2) isoform 1 or MECP2 isoform 2.

According to some embodiments of the invention, the pharmaceutical polypeptide is Glial Cell Derived Neurotrophic Factor (GDNF).

According to some embodiments of the invention, the pharmaceutical polypeptide is Glial Cell Derived Neurotrophic Factor (GDNF) isoform 1, isoform 2, isoform 3, isoform 4 or isoform 5. According to some embodiments of the invention, the pharmaceutical polypeptide is Aspartoacylase (ASPA).

According to some embodiments of the invention, the pharmaceutical polypeptide is Survival Motor Neuron Protein (SMN1).

According to some embodiments of the invention, the pharmaceutical polypeptide is Survival Motor Neuron Protein isoform SMN, isoform SMN-delta5, isoform SMN-delta7, or isoform SMN-delta57.

According to some embodiments of the invention, the pharmaceutical polypeptide is Parkin (PARK2).

According to some embodiments of the invention, the pharmaceutical polypeptide is Parkin (PARK2) isoform 1, isoform 2, isoform 3, isoform 4, isoform 5, isoform 6, isoform 7, or isoform 8.

According to some embodiments of the invention, the pharmaceutical polypeptide is Transcription Factor EB (TFEB).

According to some embodiments of the invention, the pharmaceutical polypeptide is Transcription Factor EB (TFEB) isoform 1 or isoform 2.

According to some embodiments of the invention, the pharmaceutical polypeptide is a TALEN (TALE nuclease).

According to some embodiments of the invention, the pharmaceutical polypeptide is a TALE TF (TALE transcription factor).

According to some embodiments of the invention, the subject is diagnosed with Krabbe disease.

According to some embodiments of the invention, the subject is diagnosed with Rett syndrome.

According to some embodiments of the invention, the subject is diagnosed with

Canavan disease.

According to some embodiments of the invention, the subject is diagnosed with Spinal Muscular Atrophy.

According to some embodiments of the invention, the subject is diagnosed with Parkinson's disease.

According to some embodiments of the invention, the subject is diagnosed with hypoxic/ischemic or neuroinflammatory CNS disorder. According to some embodiments of the invention, the subject is diagnosed with Alzheimer's disease.

According to some embodiments of the invention, the subject is diagnosed with Amyotrophic Lateral Sclerosis.

According to some embodiments of the invention, the subject is diagnosed with

Huntington's disease.

According to some embodiments of the invention, the subject is diagnosed with a lysosomal storage disease .

According to some embodiments of the invention, the subject is diagnosed with MECP2-duplication syndrome.

According to some embodiments of the invention, the method further comprising administering to the subject a drug capable of inducing the self-destruction element.

According to some embodiments of the invention, the method further comprising administering to the subject a molecule necessary for sustaining the Toxoplasma inside the host cell and/or body.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is an antibiotic.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is a small-molecule.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is a metabolite.

Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting. BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.

In the drawings:

Fig. 1 is a schematic representation of the clinical concept of some embodiments of the invention. (1) The pharmaceutical polypeptide coding sequence of choice is fused to a Toxoplasma gondii secretable polypeptide coding sequence; (2) The nucleic acid construct is introduced into the Toxoplasma; (3) The fusion protein is expressed inside the parasite and localizes to the parasite's secretory organelle (here shown as an example, the rhoptries); (4) The parasite enters the CNS of the subject and reaches the site of pathology; (5) The protein is secreted into the subject's cells and rescues the pathological phenotype.

Fig. 2 depicts a schematic map of the construct used for the generation of therapeutic transgenic T. gondii lines based on the pGRA backbone. The nucleic acid construct comprises an open reading frame (ORF) consisting of a fusion of a therapeutic polypeptide coding sequence in frame with the T. gondii Toxofilin coding sequence and an "HA" tag. Upstream of the ORF is the endogenous 5' UTR (untranslated region) of the Toxofilin gene (Toxofilin 5'-UTR which acts as a promoter), and downstream of the ORF is the 3' UTR of the abundant dense granule protein GRA2 (GRA2 3'-UTR). The construct also contains a selectable marker cassette consisting of the HXGPRT gene, nested between the endogenous 5' UTR and the 3' UTR of DHFR-TS. Also included in the construct is a bacterial expression cassette including selectable antibiotic resistance.

Fig. 3 depicts a schematic map of the construct used for the generation of therapeutic transgenic T. gondii lines based on the pGRA backbone. The nucleic acid construct comprises an ORF consisting of a fusion of a therapeutic polypeptide coding sequence in frame with the T. gondii GRA16 coding sequence and an HA tag. Upstream of the ORF is the endogenous 5' UTR of the GRA16 gene (GRA16 5'-UTR which acts as a promoter), and downstream of the ORF is the 3' UTR of the abundant dense granule protein GRA2 (GRA2 3'-UTR). The construct also contains a selectable marker cassette consisted of the HXGPRT gene, nested between the endogenous 5'-UTR and the 3'- UTR of DHFR-TS. Also included in the construct is a bacterial expression cassette including selectable antibiotic resistance.

Figs. 4A-I depict novel parasite strains within mammalian cells (HFF), wherein the parasite strains expressing the Toxofilin-fused therapeutic proteins Aspartoacylase (ASPA), Survival motor neuron protein (SMN1), Methyl-CpG Binding Protein 2 (MECP2) and Galactocerebrosidase-TAT Δ43 (also referred to as "mutated GALC- TAT" in the GENERAL MATERIALS AND EXPERIMENTAL METHODS section), presenting specific localization to the parasites' secretory rhoptry organelles. Figure 4 A - Schematic structure of an intracellular T. gondii, highlighting the rhoptries. Red: inner membrane complex (IMC), Blue: DNA (host cell nucleus and T. gondii nucleus), Green: rhoptry proteins; Figures 4B-I - Fluorescence microscopy analysis of parasites grown on HFF cells, expressing endogenously HA-tagged Toxofilin-ASPA (Figures 4B and AC), HA-tagged Toxofilin-SMNl (Figures 4D and 4E), HA-tagged Toxofilin- MECP2 (Figures 4F and 4G) and HA-tagged Toxofilin-GALC-TAT mutated (Figures 4H and 41) using anti-HA antibody (green in all panels), co-stained with the inner membrane complex EVICl using anti-EVICl antibody (red) and DAPI (blue), or shown on top of a polarized light image of the cells (grayscale). Parasites are shown in a mixed population, whereas only parasites showing green staining express the transgenic proteins. Scale bar = 5 μΜ in all of the images shown in Figures 4B-I.

Figs. 5A-I depict novel parasite strains within mammalian cells (HFF), wherein the parasite strains expressing the GRA16-fused therapeutic proteins Aspartoacylase (ASPA), Survival motor neuron protein (SMN1) and Methyl-CpG Binding Protein 2 (MECP2), presenting specific localization to the parasitophorous vacuole (PV) space, as well as to the host cell nucleus. Figure 5 A - Schematic representation of intracellular T. gondii parasites in a parasitophorous vacuole inside a host cell (fibroblast), highlighting the distribution of secreted dense granule effector proteins. Red: inner membrane complex (IMC), blue: DNA (host cell nucleus and T. gondii nuclei), yellow: dense granule secreted effector proteins, orange: parasitophorous vacuole. Figures 5B-I - Fluorescence microscopy analysis of parasites grown on HFF cells, expressing endogenously HA-tagged GRA16-ASPA (Figures 5B and 5C), HA-tagged GRA16- SMN1 (Figures 5D and 5E) and HA-tagged GRA16-MECP2 (Figures 5B-G in lOOx magnification and Figures 5H-I in 40x magnification) using anti-HA antibody (green in all panels), co-stained with the inner membrane complex IMC1 using anti-IMCl antibody (red) and DAPI (blue), or shown on top of a polarized light image of the cells. Parasites are shown in a mixed population, whereas only parasites showing green staining express the transgenic proteins. Scale bar = 5 μΜ (for Figures 5B-G). Scale bar = 20 μΜ (for Figures 5H-I).

Figs. 6A-B are schematic illustrations of a TALE Nuclease (Figure 6A) and TALE transcription factor (Figure 6B) which are designed, as example, to the "TACGTACG" (SEQ ID NO: 4505) target sequence. The TALE repeats are presented according to the order of the nucleotides in the target sequence. It is noted that because of the nature of TALEs, the first nucleotide always has to be T (so it is integrated in the TAL N-Terminus and is not annotated), and the last nucleotide (in this case G) is represented in a half-monomer (so it is annotated as "Half repeat"). In order to be expressed and secreted by the Toxoplasma of some embodiments of the invention, the open reading frame (ORF) of these constructs is inserted to the nucleic acid construct of some embodiments of the invention. For the TALE_Nuc (TALE nuclease) the ORF is from the NLS until after the Fokl (e.g., nucleotides 2113-4104 of the polynucleotide set forth by SEQ ID NO: 4506). For the TALE-TF (TALE Transcription Factor) the ORF is from the TALE N terminus (N-term) after the EGFP (e.g., nucleotides 2120-5005 of the polynucleotide set forth by SEQ ID NO: 4507). "TALE-TF" = TALE transcription factor; "TALEN" = TALE nuclease. "NI", "NG", "NN" and "HD" = monomers as described in Example 5 hereinbelow.

Fig. 7 depicts a schematic map of the constructs used for the generation of the therapeutic transgenic T. gondii lines. The nucleic acid construct comprises an open reading frame (ORF) consisting of a fusion of a therapeutic polypeptide coding sequence in frame with the T. gondii Toxofilin coding sequence and an "HA" tag. Upstream of the ORF is the endogenous 5' UTR (untranslated region) of the Toxofilin gene (the "Toxofilin 5'-UTR"), and downstream of the ORF is the 3' UTR of the abundant dense granule protein GRA2 ("GRA2 3'-UTR"). The construct also contains a selectable marker cassette consisting of the HXGPRT gene, DHFR-TS gene or mCherry gene, nested between the endogenous 5' UTR and the 3' UTR of DHFR-TS. Also included in the construct is a bacterial expression cassette including selectable antibiotic resistance, used for molecular cloning.

Fig. 8 depicts a schematic map of the construct used for the generation of the therapeutic transgenic T. gondii lines. The nucleic acid construct comprises an ORF consisting of a fusion of a therapeutic polypeptide coding sequence in frame with the T. gondii GRA16 coding sequence and an HA tag. Upstream of the ORF is the endogenous 5' UTR of the GRA16 gene (the "GRA16 5'-UTR"), and downstream of the ORF is the 3' UTR of the abundant dense granule protein GRA2 ("GRA2 3 -UTR"). The construct also contains a selectable marker cassette consisted of the HXGPRT gene, DHFR-TS gene or mCherry gene, nested between the endogenous 5' UTR and the 3' UTR of DHFR-TS. Also included in the construct is a bacterial expression cassette including selectable antibiotic resistance, used for molecular cloning.

Figs. 9A-N depict images of transgenic parasites within mammalian cells (HFF), wherein the parasite strains expressing 12 novel Toxofilin-fused therapeutic proteins. Figure 9A- Schematic structure of an intracellular T. gondii, highlighting the rhoptries. Magenta: inner membrane complex (IMC), cyan: DNA (host cell nucleus and T. gondii nucleus), yellow: rhoptry proteins; Figure 9B- Representative fluorescence microscopy image of T. gondii grown in HFF cells, immunostained for the rhoptry marker ROP2/4 (yellow). Left: co-stained with DAPI (cyan), right: overlaid on a polarized-light image (grayscale). Figures 9C-N- images of transgenic parasites expressing the HA-tagged Toxofilin-fused therapeutic proteins: Figure 9C - Aspartoacylase (ASPA), Figure 9D - Aspartoacylase codon-optimized (ASPAopt), Figure 9E - Galactocerebrosidase (GALC), Figure 9F - Galactocerebrosidase codon-optimized (GALCopt), Figure 9G - Galactocerebrosidase-TAT (GALC-TAT), Figure 9H - Galactocerebrosidase-TAT Δ43 (GALC-TAT Δ43- also referred to as "mutated GALC-TAT" in the GENERAL MATERIALS AND EXPERIMENTAL METHODS section), Figure 91 - Glial Cell Derived Neurotrophic Factor (GDNF), Figure 9J - Methyl-CpG Binding Protein 2 (MECP2), Figure 9K - Methyl-CpG Binding Protein 2 codon-optimized (MECP2opt), Figure 9L - Parkin (PARK2), Figure 9M - Survival motor neuron protein (SMN1) and Figure 9N - Transcription Factor EB codon-optimized (TFEBopt). Example images of the parasites expressing Toxofilin- ASPAopt, Toxofilin-GALC-TAT Δ43, Toxofilin- GDNF, Toxofilin-MeCP2opt, Toxofilin-PARK2, Toxofilin- SMN1 and Toxofilin-TFEB show localization to the parasites' secretory rhoptry organelles. Toxofilin fusion protein are immunostained using anti-HA antibody (yellow). Left: co-stained with DAPI (cyan) and the parasite marker anti-EVIC-l (magenta), right: overlaid on a polarized-light image (grayscale). Parasites are shown in a mixed population, whereas only parasites showing yellow staining express the transgenic proteins. Scale bars = 5 μΜ.

Figs. 10A-J depict images of transgenic parasites within mammalian cells (HFF), wherein the parasite strains expressing 8 novel GRA16-fused therapeutic proteins. Figure 10A- Schematic representation of intracellular T. gondii parasites in a parasitophorous vacuole inside a host cell, highlighting the distribution of secreted dense granule effector proteins. Magenta: inner membrane complex (IMC), cyan: DNA (host cell nucleus and T. gondii nuclei), yellow: dense granule secreted effector proteins, orange: parasitophorous vacuole; Figure 10B- Representative fluorescence microscopy image of T. gondii grown in HFF cells, expressing an HA-tagged GRA16 protein and immunostained with anti-HA (yellow). Left: co-stained with DAPI (cyan), right: overlaid on a polarized-light image (grayscale). Figures 10C-J - Example images of transgenic parasites expressing the HA-tagged GRA16-fused therapeutic proteins: Figure IOC- Aspartoacylase (ASPA), Figure 10D - Survival motor neuron protein (SMN1), Figure 10E- Galactocerebrosidase (GALC), Figure 10F- Galactocerebrosidase-TAT (GALC-TAT), Figure 10G- Aspartoacylase codon- optimized (ASPAopt). Figure 10H- Galactocerebrosidase codon-optimized (GALCopt), Figure 101 - Methyl-CpG Binding Protein 2 codon-optimized (MECP2opt) and Figure 10J - Transcription Factor EB codon-optimized (TFEBopt). GRA16-ASPA, GRA16- SMN1, GRA16-ASPAopt, GRA16-MECP2opt and GRA16-TFEBopt show localization to the parasitophorous vacuole. GRA16-MECP2opt and GRA16-TFEBopt show also localization to the host cell nucleus. GRA16 fusion protein are immunostained using anti-HA antibody (yellow). Left: co- stained with DAPI (cyan) and the parasite marker anti-IMC-1 (magenta), right: overlaid on a polarized-light image (grayscale). Parasites are shown in a mixed population, whereas only parasites showing yellow staining express the transgenic proteins. Scale bars = 5 μΜ.

Fig. 11A-C- Dynamics of protein delivery to the nuclei of HFF cells by tachyzoites of the strains RH GRA16-HAstop, GRA16-MECP2opt and GRA16- TFEBopt, over time and multiplicity of infection (MOI). Infected cells and nuclei delivery were counted by automated image analysis using GE IN Cell Investigator software (GE healthcare, Chicago, IL, USA). Graphs represent aggregated results from the 3 strains together.

Figs. 12A-C- Representative images of in vitro -differentiated LUHMES human neuronal cells 16-22 hours after infection with tachyzoites of the T. gondii strains RH GRA16-HAstop (Figure 12A), GRA16-MECP2opt (Figure 12B) and GRA16-TFEBopt (Figure 12C). GRA16 fusion protein are immunostained using an anti-HA antibody (yellow) and co-stained with DAPI (cyan) and the marker for mature neurons anti-NeuN (magenta). Insets on the left of each figure show the infected cells visualized with anti- HA only (top, yellow), anti-HA and DAPI (middle, yellow and cyan) and anti-HA and NeuN (bottom, yellow and magenta). All strains show clear secretion and targeting of the fusion proteins to the nuclei of the human neurons.

Figs. 13A-D- Representative images of immunohistochemically stained neuron- enriched primary cultures from the cortices and hippocampi of PI mice pups, 12 hours after infection with tachyzoites of the transgenic line RH GRA16-MECP2opt. The GRA16-MeCP2 fusion protein is immunostained using an anti-HA antibody (yellow). Figure 13A - GRA16-MeCP2 alone (yellow). Figure 13B - DAPI alone (cyan). Figure 13C - merge of GRA16-MeCP2 and DAPI staining. Figure 13D - merge of GRA16- MeCP2 (yellow), DAPI (cyan) and NeuN (magenta) overlaid on a polarized-light image (grayscale). The delivered GRA16-MeCP2opt demonstrates a characteristic pattern of co-localization with areas of condensed, heterochromatic DNA in the nuclei of the neuron, suggesting effective binding of the delivered MeCP2 to heterochromatin.

Fig. 14 is a Western blot of nuclear extracts of R306C MeCP2-mutant human LUHMES neurons infected with an MOI=l of the RH GRA16-MECP2opt transgenic T. gondii strain, immunoprecipitated and blotted with a MeCP2- specific antibody. The blot shows two bands corresponding to the endogenous mutated MeCP2 and the T. gondii- delivered MeCP2. The T. gondii-delivered MeCP2 has higher molecular weight due to the fusion to GRA16.

Figs. 15A-C show example images of in vitro differentiated bradyzoites of the transgenic T. gondii strain Pru GRA16-MECP2opt in HFF cells. The GRA16-MeCP2 fusion protein is immunostained using an anti-HA antibody (red). Figure 15A - GRA16-MeCP2 alone. Figure 15B - co-stained with the bradyzoite cyst-wall marker Dolichos Biflorus Agglutinin (DBA) (green) and DAPI (blue) showing in green the parasite cyst and blue the nuclei of the host fibroblast cells (HFF) with the HA-stained (red) GRA16-MeCP2 fusion protein within the nucleus of the host fibroblast cell containing the cyst. Figure 15C - co-stained with DBA and overlaid on a polarized-light image (grayscale). The presented cyst shows continuous expression of GRA16-MeCP2, secretion and targeting of the fusion protein to the nucleus of the host cell at the bradyzoites stage. White arrowheads point on the nucleus of the cell containing the bradyzoite cyst, which contains the delivered protein GRA16-MeCP2.

DESCRIPTION OF SPECIFIC EMBODIMENTS OF THE INVENTION

The present invention, in some embodiments thereof, relates to a nucleic acid construct for secretion of Toxoplasma secreted protein fused to a pharmaceutical polypeptide, and Toxoplasma comprising same, and, more particularly, but not exclusively, to a pharmaceutical compositions and methods of using same for treating a subject.

Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details set forth in the following description or exemplified by the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.

The present inventors have uncovered that Toxoplasma parasites transformed with genetic constructs encoding for a heterologous polypeptide synthesize and deliver the heterologous polypeptide to mammalian cells (Figures 4B-I, 5B-I, 9B-N, 10B-J, 11A-C, 12A-C, 13A-D, 14 and 15A-C and the Examples section which follows). Inside the mammalian cells, the heterologous polypeptide shuttles to its region of activity in the cell where it is able to augment cellular processes related to the known functions of the endogenous pharmaceutical protein. For example, the present inventors generated Type I and Type II Toxoplasma strains which express and deliver the mammalian proteins MeCP2 and TFEB into mice and human cells, translationally fused to the Toxoplasma secreted protein GRA16, and secreting the fused therapeutic protein into the desired cellular localization (nuclei) within mammalian cells treated ex-vivo with the Toxoplasma (Figures 13A-D), thus proving the feasibility of the constructs, Toxoplasma and methods for specific delivery of therapeutic polypeptides into a subject and treating the subject.

According to an aspect to some embodiments of the invention, there is provided a nucleic acid construct comprising a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein the heterologous polynucleotide is operably linked to a promoter for directing transcription of the heterologous polynucleotide in a Toxoplasma, wherein the promoter is selected from the group consisting of: a constitutive promoter, an inducible promoter, a latent period- specific promoter, and a Toxoplasma endogenous promoter with the proviso that the promoter is not a Toxofilin promoter.

According to some embodiments of the invention, the nucleic acid construct is suitable for expression in a Toxoplasma.

According to some embodiments of the invention, the nucleic acid construct does not comprise a Cre-recombinase coding sequence.

According to some embodiments of the invention, the nucleic acid construct does not comprise a beta ( )-lactamase (BLA) coding sequence.

According to some embodiments of the invention, the nucleic acid construct is suitable for integration into a genome of Toxoplasma.

According to some embodiments of the invention, the Toxoplasma is devoid of elements which facilitate propagation of the Toxoplasma in a host cell (e.g., elements important for evading the immune system of the host, elements essential for the production or utilization of certain metabolites, elements important for counteracting certain toxins or antibiotics), yet including an endogenous functional CPSII.

According to some embodiments of the invention, the Toxoplasma is devoid of virulence genes which are not necessary for delivery of the protein-of-interest into a CNS of a subject.

The nucleic acid construct includes a promoter sequence for directing transcription of the polynucleotide sequence in the Toxoplasma cell in a constitutive, transient, regulated or inducible manner.

As mentioned, the heterologous polynucleotide is operably linked to a promoter for directing transcription of the heterologous polynucleotide in a Toxoplasma. A coding nucleic acid sequence is "operably linked" to a regulatory sequence (e.g., promoter) if the regulatory sequence is capable of exerting a regulatory effect on the coding sequence linked thereto.

As used herein, the term "promoter" refers to a region of DNA which lies upstream of the transcriptional initiation site of a gene to which RNA polymerase binds to initiate transcription of RNA. The promoter controls the timing and the intensity of expression of the gene, e.g., at which stage or condition in the lifetime of a parasite and/or of a host cell the gene is expressed.

According to some embodiments of the invention, the promoter is heterologous to the Toxoplasma used for expression of the nucleic acid construct.

According to some embodiments of the invention, the promoter is a constitutive promoter.

According to some embodiments of the invention, the promoter is an inducible promoter.

According to some embodiments of the invention, the promoter is a latent period- specific promoter.

According to some embodiments of the invention, the promoter is a Toxoplasma endogenous promoter with the proviso that the promoter is not a Toxofilin promoter.

According to some embodiments of the invention the promoter is not a ToxofiUin endogenous promoter. It is noted that the predicted promoter of ToxofiUin (predicted promoter of gene ID=TGME49_214080; SEQ ID NO:3689) contains the Toxofilin promoter used in some of the experiments of the Examples section (SEQ ID NO: 4482) and additional nucleotides upstream.

According to some embodiments the promoter does not comprise the nucleic acid sequence set forth by SEQ ID NO: 3689 and/or SEQ ID NO: 4482.

According to some embodiments of the invention, the endogenous promoter is not of a rhoptry protein.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the GRA2 promoter, GRA16 promoter, GRA24 promoter, SAG1 promoter, or the DHFR promoter.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the GRA2 promoter. According to some embodiments of the invention, the Toxoplasma endogenous promoter is the GRA16 promoter.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the GRA24 promoter.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the SAG1 promoter.

According to some embodiments of the invention, the Toxoplasma endogenous promoter is the DHFR promoter.

Table 1 herein below lists suitable endogenous promoters which can be cloned into the nucleic acid construct of some embodiments of the invention in order to drive expression of the heterologous polynucleotide in the Toxoplasma.

Table 1

Suitable endogenous promoters

starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49 . _chr 601871 ID=TGME49_278580;length=32

XII 4 6021975 - 61 467

TGME49 . _chr 464581 ID=TGME49_315320;length= 18

XI 6 4647664 + 48 468

TGME49 . _chr 597572 ID=TGME49_216670;length=41

XI 5 5979863 - 38 469

TGME49 . _chr ID=TGME49_293850;length=50

Ia 602877 607895 + 18 470

TGME49 . _chr 155162 ID=TGME49_205190;length=33

Vila 6 1554994 + 68 471

TGME49 . _chr ID=TGME49_263420;length=20

Vllb 561320 563366 + 46 472

TGME49 . _chr 279516 ID=TGME49_246485;length=39

XII 2 2799102 + 40 473

TGME49 . _chr ID=TGME49_208850;length=39

Ib 904562 908487 + 25 474

TGME49 . _chr 444932 ID=TGME49_271610;length= 19

VIII 9 4451277 - 48 475

TGME49 . _chr 304478 ID=TGME49_274060;length=28

VIII 8 3047633 + 45 476

TGME49 . _chr 361219 ID=TGME49_258060;length= 19

Vllb 5 3614194 + 99 477

TGME49 . _chr 654961 ID=TGME49_277730;length=20

XII 3 6551657 + 44 478

TGME49 . _chr ID=TGME49_229350;length=19

VIII 197364 199295 - 31 479 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 514331 ID=TGME49_316130;length=25

XI 9 5145850 + 31 480

TGME49_chr 247130 ID=TGME49_203490;length= 16

Vila 2 2472989 + 87 481

TGME49_chr 111776 ID=TGME49_213530;length=37

V 1 1121481 + 20 482

TGME49_chr 564098 ID=TGME49_251880;length= 14

XII 3 5642416 - 33 483

TGME49_chr 112092 ID=TGME49_209210;length=29

lb 6 1123894 + 68 484

TGME49_chr 499601 ID=TGME49_270790;length= 19

VIII 8 4997934 - 16 485

TGME49_chr 426647 ID=TGME49_271960;length=44

VIII 7 4270885 - 08 486

TGME49_chr 725098 ID=TGME49_275430;length=32

X 6 7254222 + 36 487

TGME49_chr 393313 ID=TGME49_257640;length= 14

Vllb 2 3934570 + 38 488

TGME49_chr 120704 ID=TGME49_209410;length=42

lb 4 1211342 - 98 489

TGME49_chr 384933 ID=TGME49_257750;length=41

Vllb 7 3853482 + 45 490

TGME49_chr ID=TGME49_230230;length= 17

VIII 706425 708192 + 67 491

TGME49_chr 210869 ID=TGME49_297920;length=41

II 7 2112820 + 23 492

TGME49_chr 367810 ID=TGME49_290270;length= 12

IX 5 3679325 - 20 493

TGME49_chr 246954 ID=TGME49_285272;length=19

V 9 2471470 - 21 494

TGME49_chr 219752 ID=TGME49_260500;length=29

Vllb 2 2200500 + 78 495

TGME49_chr 450076 ID=TGME49_249370;length= 17

XII 6 4502482 - 16 496

TGME49_chr 231352 ID=TGME49_203705 ;length=24

Vila 1 2315991 - 70 497

TGME49_chr 499361 ID=TGME49_255215 ;length=50

Vllb 8 4998625 + 07 498

TGME49_chr 210025 ID=TGME49_297900;length=35

II 7 2103772 + 15 499

TGME49_chr 279911 ID=TGME49_203135;length=42

Vila 3 2803405 - 92 500

TGME49_chr 138964 ID=TGME49_265990;length= 18

IX 4 1391469 - 25 501

TGME49_chr 518297 ID=TGME49_305030;length=24

IX 4 5185461 - 87 502 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_229140;length=22

VIII 50367 52586 - 19 503

TGME49_chr 307200 ID=TGME49_224350;length=21

X 6 3074113 - 07 504

TGME49_chr 320314 ID=TGME49_247260;length=27

XII 0 3205868 - 28 505

TGME49_chr 131650 ID=TGME49_240550;length=17

VI 2 1318224 + 22 506

TGME49_chr 490383 ID=TGME49_255310;length=49

Vllb 8 4908769 + 31 507

TGME49_chr ID=TGME49_304760;length=40

Vila 689060 693095 + 35 508

TGME49_chr 150896 ID=TGME49_231625 ;length=47

VIII 0 1513678 - 18 509

TGME49_chr 190321 ID=TGME49_217555 ;length=47

XII 6 1907934 + 18 510

TGME49_chr 609657 ID=TGME49_216530;length=21

XI 6 6098700 - 24 511

TGME49_chr 430075 ID=TGME49_249010;length=22

XII 3 4303035 - 82 512

TGME49_chr 529885 ID=TGME49_270270;length=20

VIII 4 5300947 + 93 513

TGME49_chr 218402 ID=TGME49_232600;length=26

VIII 3 2186702 + 79 514

TGME49_chr 143298 ID=TGME49_318300;length=20

IV 4 1435042 - 58 515

TGME49_chr 570824 ID=TGME49_305990;length=27

IX 3 5710982 + 39 516

TGME49_chr 268926 ID=TGME49_224910;length=27

X 2 2692060 + 98 517

TGME49_chr ID=TGME49_266960;length=33

IX 871380 874729 - 49 518

TGME49_chr 158049 ID=TGME49_265390;length=34

IX 8 1583961 + 63 519

TGME49_chr 110743 ID=TGME49_240250;length=50

VI 3 1112442 + 09 520

TGME49_chr 461860 ID=TGME49_315270;length=24

XI 2 4621016 - 14 521

TGME49_chr 123894 ID=TGME49_266290;length=65

IX 1 1239596 - 5 522

TGME49_chr ID=TGME49_207440;length=42

lb 65733 70002 - 69 523

TGME49_chr 107838 ID=TGME49_206300;length=32

Vila 9 1081603 - 14 524

TGME49_chr 162884 ID=TGME49_231910;length= 17

VIII 8 1630628 + 80 525

TGME49_chr 556012 557885 + ID=TGME49_293780;length=18 526 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: la 73

TGME49_chr 128648 ID=TGME49_294960;length=23

la 1 1288807 + 26 527

TGME49_chr 209872 ID=TGME49_217760;length=23

XII 0 2101108 - 88 528

TGME49_chr 335658 ID=TGME49_289620;length=22

IX 0 3358813 - 33 529

TGME49_chr 643221 ID=TGME49_277920;length= 14

XII 0 6433693 - 83 530

TGME49_chr 144083 ID=TGME49_265810;length=25

IX 1 1443406 + 75 531

TGME49_chr 250506 ID=TGME49_260150;length= 15

Vllb 8 2506652 - 84 532

TGME49_chr 251349 ID=TGME49_203400;length= 17

Vila 0 2515244 + 54 533

TGME49_chr 433878 ID=TGME49_291680;length=30

IX 6 4341883 - 97 534

TGME49_chr 184420 ID=TGME49_232085 ;length= 15

VIII 3 1845704 + 01 535

TGME49_chr 430329 ID=TGME49_314800;length=38

XI 3 4303677 + 4 536

TGME49_chr 394995 ID=TGME49_201240;length=29

Vila 7 3952927 - 70 537

TGME49_chr 106138 ID=TGME49_294680;length=64

la 3 1062031 - 8 538

TGME49_chr 509059 ID=TGME49_315950;length=40

XI 3 5094613 + 20 539

TGME49_chr 244116 ID=TGME49_203540;length=50

Vila 6 2446197 - 31 540

TGME49_chr 501264 ID=TGME49_307605 ;length=27

IX 8 5015438 - 90 541

TGME49_chr 390630 ID=TGME49_257690;length=41

Vllb 4 3910432 - 28 542

TGME49_chr 365273 ID=TGME49_247980;length=27

XII 0 3655440 - 10 543

TGME49_chr 673256 ID=TGME49_267990;length=49

VIII 8 6737516 + 48 544

TGME49_chr ID=TGME49_267790;length= 14

IX 362887 364364 - 77 545

TGME49_chr 219744 ID=TGME49_285810;length=47

V 0 2202168 + 28 546

TGME49_chr 736001 ID=TGME49_207050;length=29

X 5 7363002 - 87 547

TGME49_chr 107385 ID=TGME49_209150;length=25

lb 6 1076422 + 66 548

TGME49_chr 274709 ID=TGME49_259630;length=17

Vllb 4 2748845 + 51 549 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_207350;length=20

lb 4351 6355 - 04 550

TGME49_chr 289909 ID=TGME49_224660;length=25

X 2 2901603 + 11 551

TGME49_chr ID=TGME49_227970;length=25

X 697594 700161 + 67 552

TGME49_chr 206962 ID=TGME49_311430;length= 12

XI 2 2070833 - 11 553

TGME49_chr 297580 ID=TGME49_274150;length=21

VIII 8 2977909 + 01 554

TGME49_chr 163376 ID=TGME49_261600;length=21

Vllb 4 1635885 + 21 555

TGME49_chr 477892 ID=TGME49_210450;length=47

IX 3 4783666 - 43 556

TGME49_chr ID=TGME49_300180;length= 18

XII 170916 172799 - 83 557

TGME49_chr ID=TGME49_239760;length=48

VI 903242 908079 - 37 558

TGME49_chr 525691 ID=TGME49_250840;length=37

XII 3 5260639 + 26 559

TGME49_chr 260946 ID=TGME49_224980;length=28

X 7 2612329 + 62 560

TGME49_chr 580629 ID=TGME49_237250;length=20

X 5 5808296 + 01 561

TGME49_chr ID=TGME49_239320;length=42

VI 548569 552854 - 85 562

TGME49_chr 145601 ID=TGME49_261960;length=21

Vllb 4 1458169 + 55 563

TGME49_chr ID=TGME49_219590;length= 16

XII 838987 840604 + 17 564

TGME49_chr ID=TGME49_267530;length=24

IX 600361 602779 - 18 565

TGME49_chr 307235 ID=TGME49_283550;length=23

V 6 3074734 + 78 566

TGME49_chr 165370 ID=TGME49_211630;length=36

IV 2 1657375 + 73 567

TGME49_chr 634357 ID=TGME49_216190;length=46

XI 5 6348246 + 71 568

TGME49_chr 383564 ID=TGME49_248400;length= 16

XII 7 3837281 - 34 569

TGME49_chr ID=TGME49_239400;length=33

VI 613254 616583 + 29 570

TGME49_chr 286419 ID=TGME49_224710;length= 17

X 2 2865912 - 20 571

TGME49_chr ID=TGME49_221500;length= 19

II 404894 406860 - 66 572

TGME49_chr 636407 6366025 + ID=TGME49_268685;length=19 573 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 9 46

TGME49_chr 341308 ID=TGME49_289730;length=47

IX 2 3417820 - 38 574

TGME49_chr 441154 ID=TGME49_291940;length= 19

IX 8 4413465 + 17 575

TGME49_chr 739515 ID=TGME49_207100;length=30

X 7 7398234 - 77 576

TGME49_chr 424518 ID=TGME49_291330;length= 16

IX 7 4246860 + 73 577

TGME49_chr 142207 ID=TGME49_213910;length=33

V 0 1425388 + 18 578

TGME49_chr 376679 ID=TGME49_290605 ;length= 12

IX 4 3768071 - 77 579

TGME49_chr 428335 ID=TGME49_234280;length= 19

X 2 4285324 - 72 580

TGME49_chr 695646 ID=TGME49_215430;length= 18

X 2 6958294 - 32 581

TGME49_chr ID=TGME49_220175 ;length= 18

V 186946 188803 + 57 582

TGME49_chr 271977 ID=TGME49_246182;length=46

XII 0 2724408 - 38 583

TGME49_chr 727312 ID=TGME49_275440;length=23

X 7 7275437 - 10 584

TGME49_chr 225013 ID=TGME49_232760;length= 14

VIII 3 2251577 + 44 585

TGME49_chr 211323 ID=TGME49_264680;length=14

IX 5 2114688 - 53 586

TGME49_chr 150362 ID=TGME49_209880;length=43

lb 4 1507993 + 69 587

TGME49_chr 165392 ID=TGME49_205050;length=29

Vila 0 1656907 - 87 588

TGME49_chr 161412 ID=TGME49_231880;length=35

VIII 2 1617638 - 16 589

TGME49_chr 682303 ID=TGME49_200360;length= 15

VIII 5 6824629 + 94 590

TGME49_chr ID=TGME49_320290;length=39

IV 392225 396143 + 18 591

TGME49_chr 172505 ID=TGME49_311010;length= 19

XI 1 1727020 - 69 592

TGME49_chr 157081 ID=TGME49_265410;length=24

IX 3 1573290 + 77 593

TGME49_chr ID=TGME49_320550;length=49

IV 258575 263510 + 35 594

TGME49_chr 343262 ID=TGME49_258410;length=24

Vllb 6 3435118 + 92 595

TGME49_chr 443347 ID=TGME49_234510;length=24

X 1 4435962 - 91 596 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 469568 ID=TGME49_315410;length=36

XI 7 4699361 - 74 597

TGME49_chr 194415 ID=TGME49_225905 ;length=30

X 3 1947194 + 41 598

TGME49_chr 207287 ID=TGME49_260670;length=20

Vllb 0 2074965 - 95 599

TGME49_chr 265195 ID=TGME49_246090;length=37

XII 8 2655687 + 29 600

TGME49_chr 122549 ID=TGME49_205662;length=14

Vila 0 1226911 + 21 601

TGME49_chr ID=TGME49_304500;length=34

Vila 524618 528076 + 58 602

TGME49_chr 290167 ID=TGME49_233925 ;length=21

VIII 1 2903828 - 57 603

TGME49_chr 471537 ID=TGME49_271100;length= 18

VIII 3 4717272 + 99 604

TGME49_chr 475083 ID=TGME49_315480;length= 19

XI 6 4752746 - 10 605

TGME49_chr 342895 ID=TGME49_223960;length=25

X 4 3431481 - 27 606

TGME49_chr 119535 ID=TGME49_266368;length=21

IX 2 1197540 - 88 607

TGME49_chr 179874 ID=TGME49_297465 ;length= 15

II 2 1800245 + 03 608

TGME49_chr 607760 ID=TGME49_306430;length=33

IX 2 6080932 - 30 609

TGME49_chr ID=TGME49_206660;length=40

Vila 740498 744501 + 03 610

TGME49_chr 313567 ID=TGME49_224280;length=29

X 1 3138592 - 21 611

TGME49_chr 355374 ID=TGME49_258140;length= 11

Vllb 1 3554846 - 05 612

TGME49_chr 535274 ID=TGME49_305315;length=98

IX 8 5353735 - 7 613

TGME49_chr 469808 ID=TGME49_271115;length=39

VIII 7 4702028 - 41 614

TGME49_chr ID=TGME49_267060;length=43

IX 793986 798309 + 23 615

TGME49_chr 320259 ID=TGME49_258760;length= 13

Vllb 0 3203934 + 44 616

TGME49_chr ID=TGME49_228350;length=19

X 379796 381741 + 45 617

TGME49_chr 463958 ID=TGME49_235150;length=22

X 0 4641831 + 51 618

TGME49_chr 345871 ID=TGME49_289800;length=25

IX 3 3461263 + 50 619

TGME49_chr 250025 2501767 - ID=TGME49_203450;length= 15 620 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vila 5 12

TGME49_chr 469211 ID=TGME49_249620;length=24

XII 2 4694534 + 22 621

TGME49_chr 435112 ID=TGME49_271810;length=30

VIII 6 4354140 - 14 622

TGME49_chr 130517 ID=TGME49_205570;length=26

Vila 3 1307774 - 01 623

TGME49_chr 571006 ID=TGME49_217040;length= 12

XI 2 5711311 + 49 624

TGME49_chr ID=TGME49_294390;length=30

la 876889 879930 - 41 625

TGME49_chr 397518 ID=TGME49_201200;length=20

Vila 2 3977201 + 19 626

TGME49_chr 287008 ID=TGME49_259220;length= 14

Vllb 4 2871524 - 40 627

TGME49_chr 108434 ID=TGME49_240230;length=42

VI 8 1088618 + 70 628

TGME49_chr 338572 ID=TGME49_258490;length= 17

Vllb 1 3387496 + 75 629

TGME49_chr ID=TGME49_263480;length=34

Vllb 528457 531919 + 62 630

TGME49_chr ID=TGME49_222020;length=43

II 757384 761699 + 15 631

TGME49_chr 135828 ID=TGME49_231350;length=40

VIII 9 1362306 - 17 632

TGME49_chr 280216 ID=TGME49_288900;length=81

IX 2 2802980 - 8 633

TGME49_chr 246758 ID=TGME49_225130;length=22

X 9 2469875 - 86 634

TGME49_chr 225166 ID=TGME49_260460;length=90

Vllb 6 2252571 - 5 635

TGME49_chr 209148 ID=TGME49_203950;length=42

Vila 6 2095749 + 63 636

TGME49_chr 605865 ID=TGME49_306410;length=23

IX 7 6060999 + 42 637

TGME49_chr ID=TGME49_220200;length= 19

V 198241 200161 + 20 638

TGME49_chr 135674 ID=TGME49_222980;length=31

II 1 1359871 - 30 639

TGME49_chr 220650 ID=TGME49_225490;length=30

X 5 2209526 + 21 640

TGME49_chr 209682 ID=TGME49_217770;length=26

XII 8 2099495 + 67 641

TGME49_chr 297001 ID=TGME49_202870;length= 15

Vila 4 2971521 + 07 642

TGME49_chr ID=TGME49_263800;length= 17

Vllb 322918 324659 - 41 643 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 295399 ID=TGME49_259140;length= 13

Vllb 1 2955320 - 29 644

TGME49_chr 146944 ID=TGME49_205320;length=41

Vila 9 1473570 + 21 645

TGME49_chr ID=TGME49_309060;length=25

XI 461912 464423 + 11 646

TGME49_chr 122409 ID=TGME49_266320;length= 14

IX 7 1225515 - 18 647

TGME49_chr 214203 ID=TGME49_297970;length=42

II 7 2146242 + 05 648

TGME49_chr 104259 ID=TGME49_240070;length=25

VI 4 1045138 - 44 649

TGME49_chr 130908 ID=TGME49_213770;length=22

V 6 1311333 - 47 650

TGME49_chr 102344 ID=TGME49_222305 ;length= 17

II 7 1025166 - 19 651

TGME49_chr 150041 ID=TGME49_310640;length=38

XI 7 1504291 - 74 652

TGME49_chr 437837 ID=TGME49_314890;length=65

XI 8 4379028 + 0 653

TGME49_chr ID=TGME49_227910;length=20

X 738524 740600 + 76 654

TGME49_chr 604551 ID=TGME49_306390;length= 17

IX 2 6047246 + 34 655

TGME49_chr 193523 ID=TGME49_225920;length=27

X 2 1938008 - 76 656

TGME49_chr 144126 ID=TGME49_240770;length=24

VI 7 1443761 - 94 657

TGME49_chr 200150 ID=TGME49_204090;length=38

Vila 8 2005367 - 59 658

TGME49_chr ID=TGME49_294025 ;length=27

la 654666 657455 - 89 659

TGME49_chr 217469 ID=TGME49_260530;length=22

Vllb 2 2176915 + 23 660

TGME49_chr ID=TGME49_219700;length=21

XII 711702 713805 + 03 661

TGME49_chr 130470 ID=TGME49_295000;length=22

la 1 1306918 + 17 662

TGME49_chr ID=TGME49_267560;length=10

IX 568139 569234 + 95 663

TGME49_chr 539782 ID=TGME49_270110;length= 14

VIII 9 5399254 - 25 664

TGME49_chr 284410 ID=TGME49_246550;length=21

XII 2 2846241 + 39 665

TGME49_chr 493492 ID=TGME49_249900;length=25

XII 6 4937491 + 65 666

TGME49_chr 488005 4884161 + ID=TGME49_235470;length=41 667 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 3 08

TGME49_chr 586859 ID=TGME49_216820;length=30

XI 5 5871618 - 23 668

TGME49_chr 640425 ID=TGME49_277940;length=29

XII 1 6407186 - 35 669

TGME49_chr ID=TGME49_266800;length=20

IX 957423 959519 + 96 670

TGME49_chr 577498 ID=TGME49_278900;length=24

XII 9 5777401 - 12 671

TGME49_chr 395975 ID=TGME49_272450;length=24

VIII 1 3962176 + 25 672

TGME49_chr 370067 ID=TGME49_273030;length=37

VIII 5 3704469 - 94 673

TGME49_chr 353255 ID=TGME49_258170;length= 19

Vllb 5 3534514 + 59 674

TGME49_chr 533864 ID=TGME49_305290;length= 18

IX 5 5340540 - 95 675

TGME49_chr 380735 ID=TGME49_290645;length=33

IX 1 3810696 + 45 676

TGME49_chr 562578 ID=TGME49_251855;length=24

XII 6 5628283 + 97 677

TGME49_chr 300420 ID=TGME49_244070;length=24

VI 4 3006660 + 56 678

TGME49_chr 394860 ID=TGME49_201230;length=35

Vila 2 3952191 + 89 679

TGME49_chr ID=TGME49_229005 ;length= 19

VIII 15020 16940 + 20 680

TGME49_chr ID=TGME49_293570;length=34

la 429247 432726 - 79 681

TGME49_chr 192728 ID=TGME49_297710;length=48

II 1 1932126 + 45 682

TGME49_chr 263701 ID=TGME49_312370;length= 17

XI 0 2638757 + 47 683

TGME49_chr 346902 ID=TGME49_247660;length=43

XII 0 3473396 - 76 684

TGME49_chr 491615 ID=TGME49_315700;length=40

XI 7 4920214 - 57 685

TGME49_chr ID=TGME49_208360;length=18

lb 629071 630918 + 47 686

TGME49_chr 675582 ID=TGME49_277050;length= 15

XII 7 6757330 + 03 687

TGME49_chr 441130 ID=TGME49_256840;length=24

Vllb 0 4413757 - 57 688

TGME49_chr 704922 ID=TGME49_215610;length= 18

X 1 7051104 - 83 689

TGME49_chr 652853 ID=TGME49_277760;length=50

XII 8 6533549 + 11 690 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 342792 ID=TGME49_202310;length=21

Vila 1 3430094 + 73 691

TGME49_chr 668939 ID=TGME49_215030;length= 19

X 9 6691360 + 61 692

TGME49_chr 287448 ID=TGME49_243900;length= 18

VI 3 2876334 + 51 693

TGME49_chr ID=TGME49_229180;length= 19

VIII 83739 85719 - 80 694

TGME49_chr 211901 ID=TGME49_210715 ;length=30

IV 5 2122018 - 03 695

TGME49_chr 325907 ID=TGME49_258680;length=31

Vllb 5 3262189 + 14 696

TGME49_chr ID=TGME49_220100;length=20

V 125975 128008 + 33 697

TGME49_chr 335439 ID=TGME49_244550;length=43

VI 2 3358777 - 85 698

TGME49_chr 181049 ID=TGME49_286490;length=32

V 7 1813699 + 02 699

TGME49_chr 130631 ID=TGME49_209530;length=20

lb 7 1308410 + 93 700

TGME49_chr 206270 ID=TGME49_286000;length=30

V 0 2065738 + 38 701

TGME49_chr 276254 ID=TGME49_288840;length=23

IX 5 2764943 + 98 702

TGME49_chr ID=TGME49_227350;length=44

X 995784 1000191 + 07 703

TGME49_chr 153721 ID=TGME49_209890;length=49

lb 5 1542214 - 99 704

TGME49_chr 298595 ID=TGME49_244050;length=26

VI 4 2988613 + 59 705

TGME49_chr 518708 ID=TGME49_316200;length=26

XI 5 5189714 - 29 706

TGME49_chr 256167 ID=TGME49_203340;length=43

Vila 0 2565986 + 16 707

TGME49_chr 222631 ID=TGME49_285790;length=48

V 5 2231159 + 44 708

TGME49_chr ID=TGME49_267725 ;length= 14

IX 404524 405991 + 67 709

TGME49_chr ID=TGME49_267360;length=22

IX 694704 696929 + 25 710

TGME49_chr 377734 ID=TGME49_248290;length=32

XII 5 3780569 + 24 711

TGME49_chr 309240 ID=TGME49_202720;length=20

Vila 6 3094459 - 53 712

TGME49_chr 245560 ID=TGME49_312150;length=43

XI 0 2459938 + 38 713

TGME49_chr 220920 223578 + ID=TGME49_221295 ;length=26 714 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

II 58

TGME49_chr 208416 ID=TGME49_217730;length=29

XII 9 2087144 - 75 715

TGME49_chr 100779 ID=TGME49_209045 ;length=25

lb 1 1010351 - 60 716

TGME49_chr 526570 ID=TGME49_236070;length=85

X 2 5266555 - 3 717

TGME49_chr 345746 ID=TGME49_258360;length=33

Vllb 5 3460841 - 76 718

TGME49_chr 219290 ID=TGME49_232610;length=30

VIII 3 2195917 - 14 719

TGME49_chr 110948 ID=TGME49_262640;length= 16

Vllb 3 1111172 + 89 720

TGME49_chr 385365 ID=TGME49_223430;length=21

X 7 3855793 - 36 721

TGME49_chr 423919 ID=TGME49_314720;length=46

XI 8 4243862 + 64 722

TGME49_chr 269786 ID=TGME49_259710;length=24

Vllb 9 2700279 + 10 723

TGME49_chr 147173 ID=TGME49_205330;length=21

Vila 0 1473919 - 89 724

TGME49_chr 271187 ID=TGME49_284570;length=22

V 8 2714132 - 54 725

TGME49_chr ID=TGME49_229700;length=25

VIII 388717 391218 - 01 726

TGME49_chr 153897 ID=TGME49_265455;length=32

IX 3 1542213 + 40 727

TGME49_chr ID=TGME49_309610;length=38

XI 763730 767625 - 95 728

TGME49_chr 254436 ID=TGME49_312250;length=20

XI 5 2546425 + 60 729

TGME49_chr 564359 ID=TGME49_269730;length=36

VIII 0 5647253 - 63 730

TGME49_chr 358401 ID=TGME49_290005 ;length= 12

IX 2 3585263 - 51 731

TGME49_chr 397110 ID=TGME49_201215 ;length= 11

Vila 8 3972217 - 09 732

TGME49_chr 566685 ID=TGME49_305910;length=25

IX 2 5669401 - 49 733

TGME49_chr 172677 ID=TGME49_204540;length= 15

Vila 8 1728368 + 90 734

TGME49_chr 576361 ID=TGME49_306030;length=30

IX 9 5766671 - 52 735

TGME49_chr 521153 ID=TGME49_305080;length=24

IX 2 5213996 - 64 736

TGME49_chr 100244 ID=TGME49_222270;length=30

II 1 1005482 + 41 737 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_300320;length= 16

XII 82314 83922 + 08 738

TGME49_chr 411281 ID=TGME49_314470;length=47

XI 3 4117560 + 47 739

TGME49_chr 633864 ID=TGME49_268740;length=20

VIII 9 6340670 - 21 740

TGME49_chr 495534 ID=TGME49_210220;length=20

IX 1 4957346 - 05 741

TGME49_chr 117669 ID=TGME49_226980;length=10

X 3 1177745 - 52 742

TGME49_chr 165641 ID=TGME49_218240;length=34

XII 2 1659890 - 78 743

TGME49_chr 603391 ID=TGME49_278550;length=17

XII 9 6035646 - 27 744

TGME49_chr 296097 ID=TGME49_202890;length=26

Vila 8 2963669 - 91 745

TGME49_chr 608758 ID=TGME49_278450;length=47

XII 5 6092290 + 05 746

TGME49_chr ID=TGME49_229230;length=32

VIII 101229 104465 + 36 747

TGME49_chr 102717 ID=TGME49_319330;length=21

IV 6 1029320 - 44 748

TGME49_chr ID=TGME49_213325 ;length=26

V 927001 929683 + 82 749

TGME49_chr 197927 ID=TGME49_204120;length=20

Vila 8 1981377 - 99 750

TGME49_chr 216840 ID=TGME49_203875 ;length= 16

Vila 0 2170024 - 24 751

TGME49_chr ID=TGME49_213040;length=28

V 805784 808589 + 05 752

TGME49_chr 325766 ID=TGME49_313310;length=23

XI 4 3259999 + 35 753

TGME49_chr 422592 ID=TGME49_314690;length= 17

XI 1 4227690 - 69 754

TGME49_chr 246924 ID=TGME49_312160;length= 19

XI 9 2471186 - 37 755

TGME49_chr 306939 ID=TGME49_202740;length= 18

Vila 6 3071264 - 68 756

TGME49_chr 398250 ID=TGME49_257550;length=20

Vllb 5 3984582 - 77 757

TGME49_chr 235240 ID=TGME49_242850;length=40

VI 5 2356464 - 59 758

TGME49_chr 136468 ID=TGME49_205510;length=23

Vila 2 1367044 - 62 759

TGME49_chr 329028 ID=TGME49_289530;length=38

IX 0 3294112 - 32 760

TGME49_chr 650976 6512159 + ID=TGME49_214820;length=23 761 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 4 95

TGME49_chr ID=TGME49_228110;length= 16

X 581261 582904 - 43 762

TGME49_chr 299006 ID=TGME49_259040;length=45

Vllb 5 2994627 + 62 763

TGME49_chr 207206 ID=TGME49_311450;length=49

XI 0 2077000 + 40 764

TGME49_chr 287623 ID=TGME49_233880;length=25

VIII 1 2878765 - 34 765

TGME49_chr 194085 ID=TGME49_286160;length=23

V 2 1943237 + 85 766

TGME49_chr ID=TGME49_208510;length= 18

lb 737486 739366 + 80 767

TGME49_chr 742744 ID=TGME49_207180;length=20

X 2 7429460 + 18 768

TGME49_chr 451767 ID=TGME49_315150;length=26

XI 2 4520283 + 11 769

TGME49_chr 207561 ID=TGME49_242570;length=20

VI 3 2077679 - 66 770

TGME49_chr 579277 ID=TGME49_278878;length=23

XII 9 5795081 - 02 771

TGME49_chr 660706 ID=TGME49_298630;length=27

XI 8 6609847 + 79 772

TGME49_chr 130983 ID=TGME49_222920;length=21

II 6 1311957 + 21 773

TGME49_chr 325405 ID=TGME49_244420;length= 17

VI 9 3255822 - 63 774

TGME49_chr 173333 ID=TGME49_241840;length=50

VI 5 1738338 + 03 775

TGME49_chr ID=TGME49_300350;length=39

XII 51579 55519 + 40 776

TGME49_chr 427850 ID=TGME49_291580;length=83

IX 7 4279338 - 1 111

TGME49_chr ID=TGME49_228160;length=31

X 537261 540449 - 88 778

TGME49_chr 263789 ID=TGME49_233430;length= 16

VIII 7 2639548 + 51 779

TGME49_chr 632046 ID=TGME49_214490;length= 16

X 1 6322071 - 10 780

TGME49_chr 143896 ID=TGME49_205360;length=40

Vila 9 1442993 + 24 781

TGME49_chr ID=TGME49_267710;length=41

IX 423314 427457 + 43 782

TGME49_chr 330533 ID=TGME49_247380;length=48

XII 8 3310175 - 37 783

TGME49_chr 110712 ID=TGME49_213510;length=26

V 1 1109784 + 63 784 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 392824 ID=TGME49_290900;length=23

IX 7 3930598 - 51 785

TGME49_chr 173725 ID=TGME49_218070;length= 15

XII 4 1738814 - 60 786

TGME49_chr 129759 ID=TGME49_226840;length=26

X 3 1300223 + 30 787

TGME49_chr 329891 ID=TGME49_244480;length=47

VI 7 3303680 + 63 788

TGME49_chr ID=TGME49_219730;length=47

XII 681432 686187 + 55 789

TGME49_chr 115439 ID=TGME49_262564;length=48

Vllb 8 1159261 + 63 790

TGME49_chr 252091 ID=TGME49_260020;length=25

Vllb 8 2523507 + 89 791

TGME49_chr 438343 ID=TGME49_282130;length=50

Vila 9 4388493 + 54 792

TGME49_chr 635787 ID=TGME49_268710;length=43

VIII 8 6362199 - 21 793

TGME49_chr 306285 ID=TGME49_283698;length=17

V 8 3064646 - 88 794

TGME49_chr ID=TGME49_207650;length=47

lb 185665 190405 + 40 795

TGME49_chr 311395 ID=TGME49_247025;length=36

XII 3 3117617 - 64 796

TGME49_chr 266213 ID=TGME49_317705 ;length=21

IV 6 2664273 + 37 797

TGME49_chr 620410 ID=TGME49_216375 ;length=22

XI 2 6206390 - 88 798

TGME49_chr 197145 ID=TGME49_204130;length=31

Vila 1 1974609 - 58 799

TGME49_chr 137864 ID=TGME49_223030;length= 18

II 5 1380471 - 26 800

TGME49_chr ID=TGME49_228480;length= 13

X 294956 296317 - 61 801

TGME49_chr 332591 ID=TGME49_202445 ;length= 17

Vila 2 3327698 - 86 802

TGME49_chr ID=TGME49_304520;length=28

Vila 530623 533507 + 84 803

TGME49_chr 187853 ID=TGME49_286265 ;length= 16

V 0 1880133 - 03 804

TGME49_chr 595557 ID=TGME49_216700;length= 13

XI 0 5956873 - 03 805

TGME49_chr 209454 ID=TGME49_297880;length=30

II 2 2097582 - 40 806

TGME49_chr 682194 ID=TGME49_215220;length=34

X 9 6825368 + 19 807

TGME49_chr 658617 6591203 - ID=TGME49_268280;length=50 808 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 4 29

TGME49_chr 234687 ID=TGME49_285520;length=21

V 0 2349052 + 82 809

TGME49_chr 318295 ID=TGME49_202600;length= 15

Vila 6 3184488 - 32 810

TGME49_chr 195987 ID=TGME49_297730;length=20

II 3 1961931 - 58 811

TGME49_chr 221336 ID=TGME49_232660;length=26

VIII 7 2216028 - 61 812

TGME49_chr 147259 ID=TGME49_231590;length= 19

VIII 3 1474572 + 79 813

TGME49_chr ID=TGME49_263650;length=49

Vllb 407034 411993 + 59 814

TGME49_chr 604041 ID=TGME49_278522;length=14

XII 0 6041868 + 58 815

TGME49_chr ID=TGME49_238210;length=42

VI 128832 133066 + 34 816

TGME49_chr 235131 ID=TGME49_233020;length=21

VIII 2 2353449 - 37 817

TGME49_chr 367567 ID=TGME49_248130;length=24

XII 0 3678134 + 64 818

TGME49_chr 175130 ID=TGME49_311060;length=28

XI 5 1754114 + 09 819

TGME49_chr 382092 ID=TGME49_272650;length=31

VIII 0 3824021 - 01 820

TGME49_chr 395819 ID=TGME49_290940;length=28

IX 7 3961023 - 26 821

TGME49_chr 392730 ID=TGME49_272500;length=41

VIII 6 3931495 + 89 822

TGME49_chr 455736 ID=TGME49_256030;length=20

Vllb 5 4559435 - 70 823

TGME49_chr 327620 ID=TGME49_202500;length=29

Vila 4 3279186 - 82 824

TGME49_chr 631889 ID=TGME49_268780;length=24

VIII 6 6321304 - 08 825

TGME49_chr ID=TGME49_263550;length=35

Vllb 479812 483399 + 87 826

TGME49_chr 429028 ID=TGME49_291620;length=29

IX 3 4293221 + 38 827

TGME49_chr ID=TGME49_238050;length=30

VI 28927 31927 + 00 828

TGME49_chr ID=TGME49_320140;length= 18

IV 478540 480420 - 80 829

TGME49_chr ID=TGME49_304690;length=37

Vila 625928 629635 - 07 830

TGME49_chr 548533 ID=TGME49_316670;length=42

XI 8 5489618 - 80 831 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 153984 ID=TGME49_205210;length=22

Vila 1 1542075 + 34 832

TGME49_chr ID=TGME49_219800;length=41

XII 634097 638220 + 23 833

TGME49_chr 182285 ID=TGME49_286465;length=47

V 1 1827597 - 46 834

TGME49_chr 557302 ID=TGME49_269850;length=44

VIII 3 5577518 + 95 835

TGME49_chr ID=TGME49_212880;length=25

V 688702 691280 - 78 836

TGME49_chr ID=TGME49_206700;length=48

Vila 718101 722948 - 47 837

TGME49_chr 107862 ID=TGME49_262690;length= 15

Vllb 3 1080134 + 11 838

TGME49_chr ID=TGME49_229270;length=49

VIII 122484 127476 + 92 839

TGME49_chr 198445 ID=TGME49_232270;length=42

VIII 5 1988743 + 88 840

TGME49_chr 163556 ID=TGME49_321685 ;length= 14

lb 0 1637014 + 54 841

TGME49_chr 493610 ID=TGME49_249890;length=42

XII 0 4940316 - 16 842

TGME49_chr ID=TGME49_309185;length=25

XI 542439 545032 - 93 843

TGME49_chr 442190 ID=TGME49_234505 ;length= 16

X 9 4423573 + 64 844

TGME49_chr 617218 ID=TGME49_216400;length=39

XI 1 6176170 - 89 845

TGME49_chr 229151 ID=TGME49_225350;length=15

X 0 2293010 - 00 846

TGME49_chr 174954 ID=TGME49_286630;length=22

V 2 1751790 + 48 847

TGME49_chr 131981 ID=TGME49_240570;length=27

VI 1 1322525 + 14 848

TGME49_chr ID=TGME49_264030;length=28

Vllb 229491 232296 - 05 849

TGME49_chr 231085 ID=TGME49_242810;length=20

VI 4 2312930 + 76 850

TGME49_chr 650267 ID=TGME49_214800;length=49

X 8 6507667 - 89 851

TGME49_chr 206033 ID=TGME49_311400;length=23

XI 9 2062705 - 66 852

TGME49_chr ID=TGME49_230650;length=27

VIII 926825 929619 + 94 853

TGME49_chr ID=TGME49_300300;length= 12

XII 105272 106503 - 31 854

TGME49_chr 453277 455115 - ID=TGME49_229800;length=18 855 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 38

TGME49_chr 164456 ID=TGME49_226420;length=35

X 2 1648123 + 61 856

TGME49_chr 151743 ID=TGME49_297080;length=44

II 8 1521865 + 27 857

TGME49_chr 404650 ID=TGME49_281360;length= 12

Vila 9 4047748 - 39 858

TGME49_chr 284781 ID=TGME49_224740;length= 18

X 2 2849682 - 70 859

TGME49_chr 240400 ID=TGME49_288240;length=21

IX 1 2406112 + 11 860

TGME49_chr 205967 ID=TGME49_297830;length=28

II 9 2062541 - 62 861

TGME49_chr 148376 ID=TGME49_240870;length=23

VI 6 1486135 + 69 862

TGME49_chr 570119 ID=TGME49_269670;length=22

VIII 2 5703410 - 18 863

TGME49_chr 301102 ID=TGME49_283720;length= 15

V 2 3012612 + 90 864

TGME49_chr 287381 ID=TGME49_283900;length=14

V 4 2875242 - 28 865

TGME49_chr 267763 ID=TGME49_224928 ;length= 13

X 8 2679002 - 64 866

TGME49_chr ID=TGME49_239090;length=44

VI 459221 463664 - 43 867

TGME49_chr 248169 ID=TGME49_288380;length= 17

IX 1 2483404 + 13 868

TGME49_chr 633646 ID=TGME49_278050;length= 16

XII 6 6338146 - 80 869

TGME49_chr 346172 ID=TGME49_313530;length=27

XI 6 3464489 - 63 870

TGME49_chr ID=TGME49_320105 ;length=44

IV 494931 499385 - 54 871

TGME49_chr 321678 ID=TGME49_258730;length=42

Vllb 5 3221009 + 24 872

TGME49_chr 238756 ID=TGME49_301330;length= 16

IV 9 2389243 - 74 873

TGME49_chr 458199 ID=TGME49_249510;length=29

XII 2 4584975 + 83 874

TGME49_chr ID=TGME49_227870;length=22

X 764443 766659 - 16 875

TGME49_chr 527243 ID=TGME49_316400;length=21

XI 4 5274572 + 38 876

TGME49_chr ID=TGME49_266910;length=46

IX 907526 912198 - 72 877

TGME49_chr 334873 ID=TGME49_273620;length= 15

VIII 8 3350248 - 10 878 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 497796 ID=TGME49_249990;length=21

XII 6 4980165 - 99 879

TGME49_chr 297103 ID=TGME49_283790;length=36

V 6 2974678 - 42 880

TGME49_chr 619450 ID=TGME49_278250;length=21

XII 1 6196639 + 38 881

TGME49_chr 488643 ID=TGME49_255350;length=41

Vllb 4 4890628 - 94 882

TGME49_chr 153746 ID=TGME49_261770;length= 12

Vllb 2 1538674 - 12 883

TGME49_chr 267344 ID=TGME49_288680;length=29

IX 1 2676363 - 22 884

TGME49_chr 378794 ID=TGME49_223480;length=25

X 1 3790472 + 31 885

TGME49_chr 363588 ID=TGME49_247940;length=50

XII 4 3640897 - 13 886

TGME49_chr 349133 ID=TGME49_244726;length=40

VI 7 3495343 + 06 887

TGME49_chr 342590 ID=TGME49_247580;length=27

XII 1 3428625 - 24 888

TGME49_chr 166003 ID=TGME49_261540;length=30

Vllb 9 1663128 + 89 889

TGME49_chr ID=TGME49_238180;length=21

VI 110931 113077 - 46 890

TGME49_chr 156901 ID=TGME49_218362;length=43

XII 1 1573385 - 74 891

TGME49_chr 304883 ID=TGME49_258990;length= 16

Vllb 7 3050490 - 53 892

TGME49_chr ID=TGME49_263820;length=47

Vllb 301194 305916 + 22 893

TGME49_chr 438672 ID=TGME49_282120;length=38

Vila 7 4390549 - 22 894

TGME49_chr ID=TGME49_320600;length=38

IV 205415 209239 + 24 895

TGME49_chr ID=TGME49_279315 ;length=31

IX 281401 284597 - 96 896

TGME49_chr 424794 ID=TGME49_257060;length=23

Vllb 7 4250288 + 41 897

TGME49_chr 434759 ID=TGME49_314850;length=34

XI 3 4351033 - 40 898

TGME49_chr 167129 ID=TGME49_226385 ;length=25

X 5 1673894 - 99 899

TGME49_chr 189850 ID=TGME49_217540;length=22

XII 4 1900791 - 87 900

TGME49_chr 162350 ID=TGME49_241130;length=26

VI 1 1626197 + 96 901

TGME49_chr 508311 5085552 + ID=TGME49_315940;length=24 902 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 8 34

TGME49_chr 110864 ID=TGME49_219150;length=20

XII 8 1110707 - 59 903

TGME49_chr 369175 ID=TGME49_313780;length=42

XI 4 3696010 - 56 904

TGME49_chr 120105 ID=TGME49_266360;length=23

IX 5 1203450 + 95 905

TGME49_chr 162875 ID=TGME49_265290;length= 14

IX 3 1630162 + 09 906

TGME49_chr 580216 ID=TGME49_306080;length=31

IX 6 5805344 + 78 907

TGME49_chr 126932 ID=TGME49_213730;length=25

V 4 1271922 - 98 908

TGME49_chr 654494 ID=TGME49_268320;length=16

VIII 3 6546577 + 34 909

TGME49_chr 256589 ID=TGME49_243350;length=47

VI 9 2570600 + 01 910

TGME49_chr ID=TGME49_220250;length=26

V 244725 247396 - 71 911

TGME49_chr 425358 ID=TGME49_234260;length= 12

X 1 4254822 - 41 912

TGME49_chr ID=TGME49_300230;length=21

XII 141229 143333 + 04 913

TGME49_chr 524875 ID=TGME49_236050;length=44

X 5 5253245 + 90 914

TGME49_chr 125597 ID=TGME49_222840;length= 19

II 4 1257956 - 82 915

TGME49_chr 142126 ID=TGME49_231430;length= 19

VIII 5 1423169 - 04 916

TGME49_chr ID=TGME49_206610;length=50

Vila 789026 794038 - 12 917

TGME49_chr 145469 ID=TGME49_240810;length=27

VI 6 1457473 + 77 918

TGME49_chr 443271 ID=TGME49_234520;length= 17

X 9 4434480 + 61 919

TGME49_chr ID=TGME49_294610;length=38

la 999746 1003574 - 28 920

TGME49_chr ID=TGME49_280480;length=26

Vila 374254 376921 - 67 921

TGME49_chr ID=TGME49_320670;length=29

IV 146510 149451 + 41 922

TGME49_chr 295565 ID=TGME49_289060;length=26

IX 0 2958322 - 72 923

TGME49_chr 228328 ID=TGME49_285700;length=19

V 8 2285190 - 02 924

TGME49_chr 261928 ID=TGME49_233390;length=20

VIII 7 2621303 + 16 925 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 224621 ID=TGME49_311740;length=20

XI 8 2248260 - 42 926

TGME49_chr 330442 ID=TGME49_247390;length=50

XII 5 3309462 + 37 927

TGME49_chr 682255 ID=TGME49_215210;length=41

X 9 6826683 - 24 928

TGME49_chr 104859 ID=TGME49_310100;length=47

XI 0 1053342 + 52 929

TGME49_chr 142915 ID=TGME49_226690;length=42

X 3 1433385 + 32 930

TGME49_chr 260277 ID=TGME49_288540;length=39

IX 1 2606731 + 60 931

TGME49_chr ID=TGME49_200590;length=23

X 76000 78305 - 05 932

TGME49_chr 313147 ID=TGME49_244240;length= 13

VI 4 3132802 + 28 933

TGME49_chr 274696 ID=TGME49_243600;length=39

VI 2 2750934 - 72 934

TGME49_chr 486970 ID=TGME49_235440;length=23

X 3 4872065 + 62 935

TGME49_chr ID=TGME49_293310;length=27

la 245706 248432 - 26 936

TGME49_chr ID=TGME49_238030;length= 15

VI 18291 19862 - 71 937

TGME49_chr 102588 ID=TGME49_294640;length=25

la 3 1028407 + 24 938

TGME49_chr 632904 ID=TGME49_278060;length=17

XII 5 6330752 - 07 939

TGME49_chr 634357 ID=TGME49_216200;length=24

XI 5 6346044 + 69 940

TGME49_chr 552801 ID=TGME49_236830;length=42

X 6 5528444 - 8 941

TGME49_chr 400612 ID=TGME49_201140;length=23

Vila 2 4008450 + 28 942

TGME49_chr ID=TGME49_309590;length= 12

XI 744895 746156 + 61 943

TGME49_chr ID=TGME49_319700;length= 16

IV 777767 779444 + 77 944

TGME49_chr 170499 ID=TGME49_295600;length=27

la 3 1707763 - 70 945

TGME49_chr ID=TGME49_320580;length= 18

IV 251460 253297 - 37 946

TGME49_chr 454334 ID=TGME49_256050;length=19

Vllb 5 4545276 + 31 947

TGME49_chr 288006 ID=TGME49_243920;length=26

VI 3 2882722 + 59 948

TGME49_chr 229700 2301830 - ID=TGME49_285670;length=48 949 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

V 9 21

TGME49_chr ID=TGME49_319850;length=29

IV 742966 745952 + 86 950

TGME49_chr ID=TGME49_221670;length=38

II 551419 555255 - 36 951

TGME49_chr ID=TGME49_266900;length=45

IX 903708 908281 + 73 952

TGME49_chr 657189 ID=TGME49_277685;length=26

XII 0 6574505 + 15 953

TGME49_chr 251100 ID=TGME49_301470;length=32

IV 2 2514234 + 32 954

TGME49_chr 288188 ID=TGME49_224680;length=41

X 2 2886012 - 30 955

TGME49_chr 118994 ID=TGME49_218955 ;length=26

XII 3 1192566 - 23 956

TGME49_chr 163671 ID=TGME49_310870;length= 16

XI 5 1638328 + 13 957

TGME49_chr 174083 ID=TGME49_286650;length=30

V 8 1743872 - 34 958

TGME49_chr 203802 ID=TGME49_210815 ;length= 14

IV 8 2039463 - 35 959

TGME49_chr 624617 ID=TGME49_214320;length=28

X 0 6248972 + 02 960

TGME49_chr ID=TGME49_263590;length=24

Vllb 460354 462837 - 83 961

TGME49_chr 423919 ID=TGME49_314715 ;length= 19

XI 8 4241167 + 69 962

TGME49_chr 174788 ID=TGME49_226280;length= 19

X 6 1749867 - 81 963

TGME49_chr 487295 ID=TGME49_270920;length=46

VIII 3 4877607 + 54 964

TGME49_chr ID=TGME49_221530;length=31

II 420859 424002 - 43 965

TGME49_chr 490263 ID=TGME49_315690;length=22

XI 9 4904888 + 49 966

TGME49_chr 240452 ID=TGME49_288230;length=22

IX 6 2406796 - 70 967

TGME49_chr 500909 ID=TGME49_315810;length= 16

XI 3 5010699 - 06 968

TGME49_chr 680985 ID=TGME49_200330;length=23

VIII 8 6812222 - 64 969

TGME49_chr 197391 ID=TGME49_242340;length=75

VI 1 1974670 + 9 970

TGME49_chr ID=TGME49_299825 ;length=41

XII 316928 321044 - 16 971

TGME49_chr ID=TGME49_221910;length=48

II 683949 688778 + 29 972 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_208310;length=21

lb 607119 609291 - 72 973

TGME49_chr 171749 ID=TGME49_311000;length=35

XI 0 1721072 + 82 974

TGME49_chr 288726 ID=TGME49_312650;length=20

XI 2 2889295 - 33 975

TGME49_chr ID=TGME49_280522;length= 15

Vila 332983 334493 + 10 976

TGME49_chr 153192 ID=TGME49_226560;length=47

X 3 1536680 + 57 977

TGME49_chr 262605 ID=TGME49_233405 ;length= 16

VIII 7 2627682 + 25 978

TGME49_chr 117015 ID=TGME49_262550;length=36

Vllb 2 1173763 - 11 979

TGME49_chr 194144 ID=TGME49_211060;length=25

IV 1 1943945 + 04 980

TGME49_chr 444752 ID=TGME49_271600;length=30

VIII 2 4450597 + 75 981

TGME49_chr 196522 ID=TGME49_232240;length=24

VIII 5 1967627 + 02 982

TGME49_chr 359664 ID=TGME49_290040;length=22

IX 8 3598874 + 26 983

TGME49_chr 396833 ID=TGME49_257572;length=46

Vllb 6 3973027 - 91 984

TGME49_chr ID=TGME49_220170;length=32

V 187252 190551 - 99 985

TGME49_chr 127411 ID=TGME49_231160;length=27

VIII 4 1276871 + 57 986

TGME49_chr 463323 ID=TGME49_255910;length=75

Vllb 9 4633990 - 1 987

TGME49_chr 656774 ID=TGME49_277690;length= 13

XII 0 6569055 + 15 988

TGME49_chr ID=TGME49_221710;length=22

II 605002 607207 - 05 989

TGME49_chr 139120 ID=TGME49_209680;length=30

lb 8 1394231 + 23 990

TGME49_chr ID=TGME49_239120;length= 17

VI 468532 470265 + 33 991

TGME49_chr 292163 ID=TGME49_224630;length=39

X 0 2925549 - 19 992

TGME49_chr 279900 ID=TGME49_243690;length=48

VI 7 2803879 - 72 993

TGME49_chr 290492 ID=TGME49_243940;length=20

VI 7 2906963 - 36 994

TGME49_chr ID=TGME49_221585 ;length=23

II 466401 468740 - 39 995

TGME49_chr 661628 6618113 - ID=TGME49_268240;length=18 996 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 3 30

TGME49_chr 556597 ID=TGME49_236890;length=29

X 0 5568898 - 28 997

TGME49_chr ID=TGME49_219770;length= 15

XII 651262 652841 + 79 998

TGME49_chr 243209 ID=TGME49_288280;length=20

IX 6 2434121 + 25 999

TGME49_chr 163621 ID=TGME49_218270;length=42

XII 7 1640467 - 50 1000

TGME49_chr 187250 ID=TGME49_204330;length= 15

Vila 8 1874024 + 16 1001

TGME49_chr 373730 ID=TGME49_248200;length=21

XII 0 3739417 + 17 1002

TGME49_chr 385851 ID=TGME49_313990;length= 16

XI 0 3860201 + 91 1003

TGME49_chr 253560 ID=TGME49_243302;length=22

VI 9 2537835 - 26 1004

TGME49_chr ID=TGME49_229380;length= 18

VIII 218635 220518 + 83 1005

TGME49_chr 589922 ID=TGME49_278750;length=49

XII 6 5904153 + 27 1006

TGME49_chr ID=TGME49_263120;length= 16

Vllb 791575 793258 - 83 1007

TGME49_chr ID=TGME49_219370;length=35

XII 965702 969227 - 25 1008

TGME49_chr 224792 ID=TGME49_260470;length=46

Vllb 1 2252571 - 50 1009

TGME49_chr ID=TGME49_222160;length=48

II 925169 930001 - 32 1010

TGME49_chr 432812 ID=TGME49_271870;length=29

VIII 2 4331040 - 18 1011

TGME49_chr 456580 ID=TGME49_256020;length=23

Vllb 3 4568166 - 63 1012

TGME49_chr ID=TGME49_267730;length=23

IX 401868 404259 - 91 1013

TGME49_chr ID=TGME49_319580;length=22

IV 883987 886188 + 01 1014

TGME49_chr 327121 ID=TGME49_247330;length=22

XII 2 3273508 - 96 1015

TGME49_chr 117918 ID=TGME49_318632;length=94

IV 0 1180121 - 1 1016

TGME49_chr 590474 ID=TGME49_306270;length= 18

IX 8 5906644 - 96 1017

TGME49_chr 288287 ID=TGME49_233905;length=23

VIII 8 2885243 + 65 1018

TGME49_chr ID=TGME49_207830;length=28

lb 319440 322320 + 80 1019 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 527606 ID=TGME49_316410;length=33

XI 4 5279392 + 28 1020

TGME49_chr 432783 ID=TGME49_256965;length=20

Vllb 5 4329892 - 57 1021

TGME49_chr 138366 ID=TGME49_205480;length=33

Vila 4 1387009 - 45 1022

TGME49_chr 118327 ID=TGME49_226970;length= 13

X 7 1184615 - 38 1023

TGME49_chr 301169 ID=TGME49_283730;length=19

V 6 3013612 - 16 1024

TGME49_chr 153192 ID=TGME49_226565;length=20

X 3 1533938 + 15 1025

TGME49_chr 224839 ID=TGME49_225420;length= 13

X 0 2249697 + 07 1026

TGME49_chr ID=TGME49_230450;length=43

VIII 792105 796468 + 63 1027

TGME49_chr 347147 ID=TGME49_289830;length=21

IX 3 3473649 + 76 1028

TGME49_chr 608623 ID=TGME49_216560;length=30

XI 2 6089255 - 23 1029

TGME49_chr 598005 ID=TGME49_269232;length=27

VIII 9 5982851 - 92 1030

TGME49_chr 202367 ID=TGME49_232350;length=45

VIII 0 2028207 + 37 1031

TGME49_chr 364110 ID=TGME49_247970;length=40

XII 4 3645165 + 61 1032

TGME49_chr ID=TGME49_262990;length=27

Vllb 883251 885953 - 02 1033

TGME49_chr 262664 ID=TGME49_237880;length=45

IV 4 2631162 - 18 1034

TGME49_chr ID=TGME49_293060;length=24

la 111397 113812 + 15 1035

TGME49_chr ID=TGME49_239030;length=36

VI 428224 431908 - 84 1036

TGME49_chr 254040 ID=TGME49_233245 ;length= 17

VIII 9 2542122 - 13 1037

TGME49_chr 572898 ID=TGME49_217000;length= 16

XI 7 5730625 + 38 1038

TGME49_chr 439411 ID=TGME49_249240;length=43

XII 1 4398427 - 16 1039

TGME49_chr 397620 ID=TGME49_201210;length= 16

Vila 4 3977892 - 88 1040

TGME49_chr ID=TGME49_293170;length=30

la 135999 139033 - 34 1041

TGME49_chr 244632 ID=TGME49_260220;length=38

Vllb 2 2450162 - 40 1042

TGME49_chr 400506 4007884 + ID=TGME49_314330;length=28 1043 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 5 19

TGME49_chr 172185 ID=TGME49_211430;length=37

IV 8 1725651 + 93 1044

TGME49_chr 144025 ID=TGME49_240780;length=23

VI 6 1442654 + 98 1045

TGME49_chr 230943 ID=TGME49_260400;length=20

Vllb 7 2311494 + 57 1046

TGME49_chr 105731 ID=TGME49_230850;length=43

VIII 4 1061637 + 23 1047

TGME49_chr 310459 ID=TGME49_258880;length=20

Vllb 5 3106681 + 86 1048

TGME49_chr 355389 ID=TGME49_289970;length=41

IX 8 3558050 + 52 1049

TGME49_chr ID=TGME49_263180;length=35

Vllb 747456 750990 - 34 1050

TGME49_chr ID=TGME49_208010;length=23

lb 508499 510809 + 10 1051

TGME49_chr 340393 ID=TGME49_273560;length=37

VIII 9 3407719 - 80 1052

TGME49_chr 232825 ID=TGME49_260380;length=27

Vllb 8 2331030 - 72 1053

TGME49_chr 544756 ID=TGME49_305520;length=19

IX 9 5449517 + 48 1054

TGME49_chr 454089 ID=TGME49_249440;length=37

XII 8 4544602 - 04 1055

TGME49_chr 118066 ID=TGME49_266380;length=50

IX 2 1185700 - 38 1056

TGME49_chr ID=TGME49_221990;length=20

II 741431 743464 - 33 1057

TGME49_chr 330242 ID=TGME49_289550;length=17

IX 3 3304195 - 72 1058

TGME49_chr 516334 ID=TGME49_250740;length=36

XII 4 5167027 - 83 1059

TGME49_chr 136487 ID=TGME49_209620;length= 12

lb 7 1366149 - 72 1060

TGME49_chr ID=TGME49_263020;length= 18

Vllb 858403 860259 + 56 1061

TGME49_chr 208086 ID=TGME49_242580;length= 17

VI 1 2082634 - 73 1062

TGME49_chr 512650 ID=TGME49_316100;length= 16

XI 9 5128203 + 94 1063

TGME49_chr 214207 ID=TGME49_260580;length=29

Vllb 2 2145018 + 46 1064

TGME49_chr 268528 ID=TGME49_246150;length= 19

XII 3 2687256 + 73 1065

TGME49_chr ID=TGME49_238190;length=22

VI 110307 112590 + 83 1066 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 283747 ID=TGME49_233830;length=19

VIII 8 2839433 - 55 1067

TGME49_chr ID=TGME49_227615 ;length=40

X 867923 871948 + 25 1068

TGME49_chr 507122 ID=TGME49_304880;length=27

IX 3 5073998 - 75 1069

TGME49_chr 209089 ID=TGME49_232430;length=49

VIII 0 2095794 - 04 1070

TGME49_chr 278163 ID=TGME49_259550;length=25

Vllb 1 2784134 - 03 1071

TGME49_chr ID=TGME49_228690;length=20

X 193904 195964 + 60 1072

TGME49_chr 105971 ID=TGME49_294670;length=23

la 2 1062031 - 19 1073

TGME49_chr 389034 ID=TGME49_201520;length=26

Vila 4 3893030 + 86 1074

TGME49_chr 521467 ID=TGME49_305100;length=23

IX 6 5217074 + 98 1075

TGME49_chr ID=TGME49_206520;length=24

Vila 899877 902361 + 84 1076

TGME49_chr 353934 ID=TGME49_289940;length=24

IX 7 3541778 - 31 1077

TGME49_chr 452881 ID=TGME49_256080;length=19

Vllb 1 4530770 - 59 1078

TGME49_chr ID=TGME49_229250;length= 17

VIII 110877 112644 - 67 1079

TGME49_chr 229676 ID=TGME49_311810;length=40

XI 5 2300817 + 52 1080

TGME49_chr 274295 ID=TGME49_312500;length=23

XI 8 2745345 - 87 1081

TGME49_chr ID=TGME49_220500;length=22

V 417057 419288 + 31 1082

TGME49_chr 205752 ID=TGME49_260680;length=29

Vllb 5 2060480 + 55 1083

TGME49_chr ID=TGME49_208530;length=36

lb 746521 750200 + 79 1084

TGME49_chr 547496 ID=TGME49_236650;length=31

X 8 5478116 - 48 1085

TGME49_chr 507493 ID=TGME49_235710;length= 17

X 7 5076733 - 96 1086

TGME49_chr 128207 ID=TGME49_209510;length= 14

lb 7 1283563 + 86 1087

TGME49_chr ID=TGME49_293870;length= 18

la 617372 619206 + 34 1088

TGME49_chr 524875 ID=TGME49_236040;length=17

X 5 5250532 + 77 1089

TGME49_chr 528804 5290319 - ID=TGME49_250860;length=22 1090 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 6 73

TGME49_chr 272022 ID=TGME49_288750;length=47

IX 5 2724939 + 14 1091

TGME49_chr ID=TGME49_267585 ;length=28

IX 544999 547818 + 19 1092

TGME49_chr 106802 ID=TGME49_206310;length= 19

Vila 3 1069996 + 73 1093

TGME49_chr ID=TGME49_239690;length=23

VI 830758 833127 + 69 1094

TGME49_chr 226505 ID=TGME49_287960;length=39

IX 5 2268965 - 10 1095

TGME49_chr 608718 ID=TGME49_216550;length=20

XI 3 6089255 - 72 1096

TGME49_chr 257825 ID=TGME49_237840;length=28

IV 7 2581065 + 08 1097

TGME49_chr ID=TGME49_230705;length=36

VIII 991782 995395 - 13 1098

TGME49_chr 188418 ID=TGME49_297643 ;length= 18

II 6 1886034 + 48 1099

TGME49_chr 231205 ID=TGME49_311830;length=12

XI 0 2313267 + 17 1100

TGME49_chr 125450 ID=TGME49_222850;length=23

II 6 1256879 + 73 1101

TGME49_chr ID=TGME49_222030;length=22

II 766484 768700 + 16 1102

TGME49_chr 196108 ID=TGME49_264850;length=68

IX 8 1961769 + 1 1103

TGME49_chr 534426 ID=TGME49_270200;length=38

VIII 9 5348104 - 35 1104

TGME49_chr 138077 ID=TGME49_240660;length= 18

VI 5 1382606 - 31 1105

TGME49_chr 180471 ID=TGME49_261240;length=32

Vllb 6 1807943 + 27 1106

TGME49_chr 338228 ID=TGME49_224020;length=43

X 6 3386675 + 89 1107

TGME49_chr 190834 ID=TGME49_311240;length=40

XI 2 1912400 - 58 1108

TGME49_chr 443861 ID=TGME49_282200;length=40

Vila 0 4442652 + 42 1109

TGME49_chr 584471 ID=TGME49_306230;length=33

IX 2 5848045 + 33 1110

TGME49_chr 150281 ID=TGME49_310650;length= 14

XI 5 1504291 - 76 1111

TGME49_chr 197035 ID=TGME49_311320;length=35

XI 5 1973898 + 43 1112

TGME49_chr 230569 ID=TGME49_225330;length=24

X 5 2308125 + 30 1113 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 173656 ID=TGME49_261460;length= 19

Vllb 2 1738490 - 28 1114

TGME49_chr 228884 ID=TGME49_311790;length=20

XI 0 2290926 + 86 1115

TGME49_chr 500070 ID=TGME49_235635;length=13

X 3 5002026 + 23 1116

TGME49_chr 305180 ID=TGME49_313025 ;length= 12

XI 9 3053089 + 80 1117

TGME49_chr 122681 ID=TGME49_213670;length=24

V 9 1229241 - 22 1118

TGME49_chr 184693 ID=TGME49_242040;length= 12

VI 0 1848152 + 22 1119

TGME49_chr 512710 ID=TGME49_316090;length=20

XI 1 5129194 - 93 1120

TGME49_chr ID=TGME49_280370;length=29

Vila 489530 492510 + 80 1121

TGME49_chr 273311 ID=TGME49_233702;length=30

VIII 8 2736128 - 10 1122

TGME49_chr 639877 ID=TGME49_216120;length=31

XI 1 6401970 - 99 1123

TGME49_chr ID=TGME49_309160;length=35

XI 530082 533588 - 06 1124

TGME49_chr ID=TGME49_229750;length=39

VIII 428448 432358 - 10 1125

TGME49_chr 207783 ID=TGME49_297850;length=48

II 2 2082637 + 05 1126

TGME49_chr 208847 ID=TGME49_210770;length=21

IV 0 2090595 - 25 1127

TGME49_chr 111280 ID=TGME49_318742;length=21

IV 1 1114995 - 94 1128

TGME49_chr 269451 ID=TGME49_233530;length=19

VIII 2 2696420 + 08 1129

TGME49_chr 139218 ID=TGME49_209650;length=25

lb 8 1394786 - 98 1130

TGME49_chr ID=TGME49_220580;length=84

V 469162 470010 + 8 1131

TGME49_chr 203409 ID=TGME49_264770;length=28

IX 8 2036991 + 93 1132

TGME49_chr 285596 ID=TGME49_233850;length=20

VIII 6 2857966 - 00 1133

TGME49_chr 159535 ID=TGME49_310798;length=23

XI 0 1597733 + 83 1134

TGME49_chr 554150 ID=TGME49_305750;length= 19

IX 0 5543417 + 17 1135

TGME49_chr 414252 ID=TGME49_291130;length=20

IX 3 4144608 + 85 1136

TGME49_chr 790933 792672 + ID=TGME49_263110;length= 17 1137 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 39

TGME49_chr 221047 ID=TGME49_203815 ;length= 10

Vila 7 2211546 - 69 1138

TGME49_chr 406376 ID=TGME49_272340;length= 18

VIII 4 4065564 + 00 1139

TGME49_chr ID=TGME49_266970;length=25

IX 862193 864770 + 77 1140

TGME49_chr 600535 ID=TGME49_237585;length=36

X 0 6009045 - 95 1141

TGME49_chr ID=TGME49_319650;length=38

IV 812682 816552 - 70 1142

TGME49_chr 135102 ID=TGME49_318410;length=21

IV 5 1353144 + 19 1143

TGME49_chr ID=TGME49_263323 ;length=41

Vllb 628035 632181 + 46 1144

TGME49_chr ID=TGME49_221590;length=32

II 464340 467546 + 06 1145

TGME49_chr 235318 ID=TGME49_203682;length=25

Vila 9 2355787 - 98 1146

TGME49_chr 136533 ID=TGME49_295060;length=26

la 1 1367979 + 48 1147

TGME49_chr 308637 ID=TGME49_313090;length=40

XI 9 3090390 + 11 1148

TGME49_chr ID=TGME49_221870;length=43

II 639786 644118 + 32 1149

TGME49_chr 128354 ID=TGME49_318490;length=39

IV 1 1287463 - 22 1150

TGME49_chr 254494 ID=TGME49_225070;length=35

X 8 2548532 - 84 1151

TGME49_chr ID=TGME49_263470;length=35

Vllb 541912 545460 - 48 1152

TGME49_chr 359156 ID=TGME49_290020;length= 15

IX 0 3593074 - 14 1153

TGME49_chr 392816 ID=TGME49_212310;length=43

X 3 3932517 - 54 1154

TGME49_chr 470129 ID=TGME49_255720;length= 16

Vllb 2 4702972 - 80 1155

TGME49_chr 550659 ID=TGME49_269940;length=38

VIII 3 5510411 + 18 1156

TGME49_chr ID=TGME49_320005 ;length=23

IV 592700 595047 + 47 1157

TGME49_chr 451962 ID=TGME49_256100;length= 18

Vllb 9 4521475 - 46 1158

TGME49_chr 184853 ID=TGME49_295320;length=22

la 3 1850781 - 48 1159

TGME49_chr 376111 ID=TGME49_223530;length=18

X 2 3762929 - 17 1160 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 383925 ID=TGME49_272620;length= 17

VIII 0 3840966 - 16 1161

TGME49_chr 553332 ID=TGME49_236850;length=32

X 4 5536592 + 68 1162

TGME49_chr 523329 ID=TGME49_235990;length=20

X 1 5235370 - 79 1163

TGME49_chr 181656 ID=TGME49_321430;length= 18

lb 4 1818369 + 05 1164

TGME49_chr 332700 ID=TGME49_313380;length=26

XI 9 3329669 + 60 1165

TGME49_chr 336348 ID=TGME49_289650;length= 16

IX 8 3365112 + 24 1166

TGME49_chr ID=TGME49_220310;length=62

V 287641 288269 - 8 1167

TGME49_chr 207783 ID=TGME49_297845 ;length= 19

II 2 2079808 + 76 1168

TGME49_chr 365773 ID=TGME49_258000;length= 17

Vllb 5 3659476 + 41 1169

TGME49_chr 203901 ID=TGME49_264760;length=41

IX 3 2043206 + 93 1170

TGME49_chr 698884 ID=TGME49_276810;length=21

XII 0 6990981 + 41 1171

TGME49_chr 361236 ID=TGME49_290160;length=26

IX 5 3615024 - 59 1172

TGME49_chr 125954 ID=TGME49_294898;length=35

la 5 1263126 - 81 1173

TGME49_chr 616571 ID=TGME49_216415 ;length= 11

XI 5 6166848 + 33 1174

TGME49_chr 351148 ID=TGME49_289890;length=14

IX 9 3512915 - 26 1175

TGME49_chr 174828 ID=TGME49_265130;length=41

IX 3 1752414 + 31 1176

TGME49_chr 101527 ID=TGME49_319340;length=37

IV 8 1019035 + 57 1177

TGME49_chr ID=TGME49_239770;length=26

VI 905412 908079 - 67 1178

TGME49_chr ID=TGME49_309220;length=49

XI 570018 574932 - 14 1179

TGME49_chr 222898 ID=TGME49_232690;length=24

VIII 3 2231436 + 53 1180

TGME49_chr 375637 ID=TGME49_313870;length=24

XI 5 3758818 + 43 1181

TGME49_chr 168333 ID=TGME49_310940;length= 15

XI 0 1684845 - 15 1182

TGME49_chr ID=TGME49_208580;length=21

lb 802516 804677 - 61 1183

TGME49_chr 307335 3075701 - ID=TGME49_283580;length=23 1184 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

V 0 51

TGME49_chr 243319 ID=TGME49_288270;length= 16

IX 9 2434876 - 77 1185

TGME49_chr 524749 ID=TGME49_270360;length= 16

VIII 9 5249101 + 02 1186

TGME49_chr ID=TGME49_293740;length=28

la 538347 541224 + 77 1187

TGME49_chr 103440 ID=TGME49_310055;length=35

XI 2 1037980 - 78 1188

TGME49_chr 427376 ID=TGME49_271930;length=36

VIII 5 4277392 + 27 1189

TGME49_chr 394221 ID=TGME49_212280;length=86

X 0 3943072 + 2 1190

TGME49_chr 642721 ID=TGME49_216060;length=21

XI 3 6429335 - 22 1191

TGME49_chr 577698 ID=TGME49_269470;length=38

VIII 4 5780816 - 32 1192

TGME49_chr 133406 ID=TGME49_240600;length= 13

VI 6 1335457 + 91 1193

TGME49_chr 423495 ID=TGME49_281640;length=24

Vila 9 4237402 + 43 1194

TGME49_chr 124054 ID=TGME49_213700;length= 19

V 1 1242474 + 33 1195

TGME49_chr 107905 ID=TGME49_227140;length=23

X 4 1081448 - 94 1196

TGME49_chr 236234 ID=TGME49_245540;length= 15

XII 1 2363912 - 71 1197

TGME49_chr 473595 ID=TGME49_255690;length=25

Vllb 0 4738492 - 42 1198

TGME49_chr ID=TGME49_264060;length=20

Vllb 215535 217628 - 93 1199

TGME49_chr 376768 ID=TGME49_313880;length=26

XI 5 3770308 + 23 1200

TGME49_chr ID=TGME49_263710;length=44

Vllb 382525 387001 - 76 1201

TGME49_chr 376414 ID=TGME49_272730;length=48

VIII 6 3769011 - 65 1202

TGME49_chr 251824 ID=TGME49_233230;length=49

VIII 2 2523170 + 28 1203

TGME49_chr 251298 ID=TGME49_225105 ;length= 17

X 1 2514745 + 64 1204

TGME49_chr 174733 ID=TGME49_226270;length=21

X 7 1749496 + 59 1205

TGME49_chr 255645 ID=TGME49_312280;length=49

XI 0 2561384 + 34 1206

TGME49_chr 150851 ID=TGME49_310670;length=26

XI 1 1511130 + 19 1207 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 135707 ID=TGME49_226800;length= 15

X 3 1358633 - 60 1208

TGME49_chr 185601 ID=TGME49_217460;length=39

XII 8 1859951 - 33 1209

TGME49_chr 103211 ID=TGME49_240060;length=31

VI 3 1035230 + 17 1210

TGME49_chr 475024 ID=TGME49_255660;length=29

Vllb 0 4753174 - 34 1211

TGME49_chr 248832 ID=TGME49_301450;length=26

IV 2 2491013 + 91 1212

TGME49_chr 239943 ID=TGME49_203610;length= 17

Vila 7 2401178 + 41 1213

TGME49_chr ID=TGME49_319870;length=31

IV 717427 720591 - 64 1214

TGME49_chr ID=TGME49_267450;length=25

IX 633939 636510 + 71 1215

TGME49_chr 106427 ID=TGME49_222350;length=28

II 7 1067130 - 53 1216

TGME49_chr 658751 ID=TGME49_214910;length=21

X 2 6589629 - 17 1217

TGME49_chr 110027 ID=TGME49_262650;length=28

Vllb 3 1103123 + 50 1218

TGME49_chr 401969 ID=TGME49_212150;length=20

X 3 4021700 - 07 1219

TGME49_chr ID=TGME49_319560;length=25

IV 910057 912578 + 21 1220

TGME49_chr 595462 ID=TGME49_216680;length=20

XI 3 5956672 + 49 1221

TGME49_chr ID=TGME49_262900;length=24

Vllb 936733 939153 + 20 1222

TGME49_chr 262448 ID=TGME49_288580;length= 16

IX 1 2626169 + 88 1223

TGME49_chr 611798 ID=TGME49_214220;length=22

X 5 6120211 + 26 1224

TGME49_chr 467057 ID=TGME49_292310;length= 11

IX 5 4671677 - 02 1225

TGME49_chr 158670 ID=TGME49_205140;length= 13

Vila 1 1588039 + 38 1226

TGME49_chr ID=TGME49_220380;length=35

V 343813 347336 - 23 1227

TGME49_chr 282454 ID=TGME49_203060;length=35

Vila 3 2828105 - 62 1228

TGME49_chr 284207 ID=TGME49_243750;length= 16

VI 2 2843767 - 95 1229

TGME49_chr 590166 ID=TGME49_278757;length=42

XII 4 5905896 - 32 1230

TGME49_chr 864169 866128 - ID=TGME49_266980;length=19 1231 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

IX 59

TGME49_chr ID=TGME49_220470;length=35

V 405382 408887 - 05 1232

TGME49_chr 251207 ID=TGME49_260140;length=23

Vllb 6 2514420 - 44 1233

TGME49_chr 209833 ID=TGME49_242600;length=23

VI 6 2100722 - 86 1234

TGME49_chr 150888 ID=TGME49_226610;length=21

X 4 1511053 - 69 1235

TGME49_chr 204366 ID=TGME49_204020;length=24

Vila 6 2046086 + 20 1236

TGME49_chr 103052 ID=TGME49_262760;length=49

Vllb 8 1035441 + 13 1237

TGME49_chr ID=TGME49_294420;length= 14

la 906891 908355 - 64 1238

TGME49_chr 394909 ID=TGME49_290925 ;length= 13

IX 6 3950443 - 47 1239

TGME49_chr 202116 ID=TGME49_286060;length=38

V 0 2025024 + 64 1240

TGME49_chr 333428 ID=TGME49_313385 ;length= 13

XI 6 3335654 - 68 1241

TGME49_chr 459233 ID=TGME49_292170;length=31

IX 7 4595459 - 22 1242

TGME49_chr ID=TGME49_219530;length= 17

XII 883327 885042 - 15 1243

TGME49_chr ID=TGME49_304720;length=44

Vila 646040 650529 + 89 1244

TGME49_chr 258296 ID=TGME49_245995;length=81

XII 8 2583778 + 0 1245

TGME49_chr 149253 ID=TGME49_205280;length=45

Vila 4 1497097 + 63 1246

TGME49_chr 652929 ID=TGME49_268340;length=28

VIII 2 6532178 - 86 1247

TGME49_chr 177547 ID=TGME49_226240;length=l 1

X 3 1776642 - 69 1248

TGME49_chr 328288 ID=TGME49_224160;length=46

X 6 3287547 + 61 1249

TGME49_chr 639792 ID=TGME49_216090;length=21

XI 0 6400037 + 17 1250

TGME49_chr 270652 ID=TGME49_243545;length=46

VI 9 2711143 + 14 1251

TGME49_chr ID=TGME49_308980;length=23

XI 416696 419057 - 61 1252

TGME49_chr 134225 ID=TGME49_209580;length=46

lb 7 1346867 + 10 1253

TGME49_chr 550975 ID=TGME49_236790;length=17

X 5 5511547 - 92 1254 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_309050;length=20

XI 463158 465197 - 39 1255

TGME49_chr ID=TGME49_238505 ;length= 13

VI 256977 258317 - 40 1256

TGME49_chr 691681 ID=TGME49_215343 ;length=39

X 3 6920734 - 21 1257

TGME49_chr 478934 ID=TGME49_249770;length= 14

XII 7 4790800 + 53 1258

TGME49_chr 222150 ID=TGME49_217910;length=39

XII 9 2225506 - 97 1259

TGME49_chr ID=TGME49_219810;length=37

XII 635109 638817 - 08 1260

TGME49_chr ID=TGME49_227360;length=47

X 996966 1001699 - 33 1261

TGME49_chr 475935 ID=TGME49_249720;length=21

XII 7 4761502 - 45 1262

TGME49_chr ID=TGME49_230580;length= 19

VIII 880481 882435 - 54 1263

TGME49_chr 239382 ID=TGME49_203630;length= 16

Vila 0 2395507 - 87 1264

TGME49_chr 396967 ID=TGME49_212230;length=28

X 3 3972540 - 67 1265

TGME49_chr 373071 ID=TGME49_201870;length=20

Vila 8 3732764 + 46 1266

TGME49_chr 100116 ID=TGME49_230820;length=22

VIII 9 1003387 + 18 1267

TGME49_chr 460067 ID=TGME49_292190;length= 13

IX 4 4601980 - 06 1268

TGME49_chr 131879 ID=TGME49_318460;length=22

IV 3 1321000 - 07 1269

TGME49_chr 500864 ID=TGME49_250050;length= 19

XII 5 5010570 - 25 1270

TGME49_chr 282110 ID=TGME49_233820;length=34

VIII 8 2824516 + 08 1271

TGME49_chr ID=TGME49_206470;length= 17

Vila 945242 947003 + 61 1272

TGME49_chr ID=TGME49_304670;length= 17

Vila 618512 620275 - 63 1273

TGME49_chr 543108 ID=TGME49_316610;length=20

XI 3 5433154 - 71 1274

TGME49_chr 389491 ID=TGME49_314042;length=23

XI 5 3897284 + 69 1275

TGME49_chr ID=TGME49_239885;length=35

VI 613785 617334 - 49 1276

TGME49_chr 608064 ID=TGME49_278470;length=14

XII 5 6082084 + 39 1277

TGME49_chr 453005 4533822 - ID=TGME49_249425 ;length=37 1278 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 7 65

TGME49_chr 498355 ID=TGME49_255240;length=31

Vllb 2 4986683 - 31 1279

TGME49_chr 616250 ID=TGME49_306540;length=37

IX 9 6166258 + 49 1280

TGME49_chr 119179 ID=TGME49_231000;length=35

VIII 0 1195369 - 79 1281

TGME49_chr 253967 ID=TGME49_260000;length=42

Vllb 1 2543923 - 52 1282

TGME49_chr ID=TGME49_263060;length=28

Vllb 849810 852648 - 38 1283

TGME49_chr ID=TGME49_263300;length=30

Vllb 648112 651155 - 43 1284

TGME49_chr 338476 ID=TGME49_289710;length=49

IX 3 3389725 + 62 1285

TGME49_chr 166770 ID=TGME49_211610;length=39

IV 4 1671687 - 83 1286

TGME49_chr 337067 ID=TGME49_244570;length=21

VI 0 3372867 - 97 1287

TGME49_chr 138453 ID=TGME49_295070;length=26

la 1 1387176 - 45 1288

TGME49_chr 138073 ID=TGME49_223045 ;length= 19

II 6 1382666 + 30 1289

TGME49_chr 405129 ID=TGME49_281370;length= 18

Vila 3 4053186 - 93 1290

TGME49_chr 395598 ID=TGME49_212250;length=23

X 5 3958323 + 38 1291

TGME49_chr ID=TGME49_238930;length= 18

VI 320087 321893 - 06 1292

TGME49_chr 182141 ID=TGME49_286450;length=40

V 2 1825437 + 25 1293

TGME49_chr 455866 ID=TGME49_234950;length=29

X 8 4561661 - 93 1294

TGME49_chr 151283 ID=TGME49_265510;length=34

IX 3 1516330 + 97 1295

TGME49_chr 679404 ID=TGME49_200320;length=26

VIII 5 6796697 + 52 1296

TGME49_chr 697826 ID=TGME49_215490;length=27

X 4 6981039 + 75 1297

TGME49_chr 217421 ID=TGME49_225540;length=19

X 1 2176205 - 94 1298

TGME49_chr 129822 ID=TGME49_231180;length=45

VIII 4 1302758 + 34 1299

TGME49_chr 214288 ID=TGME49_297960;length=47

II 2 2147668 - 86 1300

TGME49_chr 343369 ID=TGME49_247600;length=22

XII 0 3435922 + 32 1301 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 231750 ID=TGME49_203700;length=29

Vila 1 2320428 - 27 1302

TGME49_chr 667460 ID=TGME49_277490;length=32

XII 7 6677822 - 15 1303

TGME49_chr 423372 ID=TGME49_314700;length=42

XI 1 4237966 - 45 1304

TGME49_chr ID=TGME49_320230;length= 18

IV 425592 427430 - 38 1305

TGME49_chr 251486 ID=TGME49_203420;length=24

Vila 2 2517330 - 68 1306

TGME49_chr 457946 ID=TGME49_292140;length=45

IX 1 4584034 - 73 1307

TGME49_chr 560933 ID=TGME49_251850;length=28

XII 6 5612206 + 70 1308

TGME49_chr 234889 ID=TGME49_260330;length=57

Vllb 2 2349464 + 2 1309

TGME49_chr 182904 ID=TGME49_242010;length= 19

VI 1 1831040 - 99 1310

TGME49_chr 229890 ID=TGME49_232970;length=21

VIII 8 2301088 + 80 1311

TGME49_chr 505158 ID=TGME49_255175;length=81

Vllb 8 5052399 + 1 1312

TGME49_chr 183501 ID=TGME49_226068;length=22

X 7 1837256 - 39 1313

TGME49_chr 384247 ID=TGME49_313960;length= 16

XI 7 3844120 - 43 1314

TGME49_chr ID=TGME49_263085;length=24

Vllb 821627 824086 - 59 1315

TGME49_chr ID=TGME49_320570;length= 16

IV 250795 252483 + 88 1316

TGME49_chr 216446 ID=TGME49_225555;length=16

X 8 2166075 - 07 1317

TGME49_chr 110507 ID=TGME49_227070;length= 18

X 4 1106948 - 74 1318

TGME49_chr 140426 ID=TGME49_262050;length=29

Vllb 9 1407193 + 24 1319

TGME49_chr ID=TGME49_267520;length=32

IX 599004 602300 + 96 1320

TGME49_chr 327041 ID=TGME49_202510;length=39

Vila 0 3274311 - 01 1321

TGME49_chr 351669 ID=TGME49_273410;length=46

VIII 3 3521323 + 30 1322

TGME49_chr 262667 ID=TGME49_284670;length= 18

V 5 2628503 + 28 1323

TGME49_chr 734484 ID=TGME49_207020;length=30

X 9 7347942 - 93 1324

TGME49_chr 174135 1745785 + ID=TGME49_204500;length=44 1325 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vila 8 27

TGME49_chr 464624 ID=TGME49_315310;length=24

XI 3 4648666 - 23 1326

TGME49_chr 428128 ID=TGME49_257010;length=20

Vllb 2 4283353 + 71 1327

TGME49_chr 175572 ID=TGME49_321520;length=22

lb 0 1757922 + 02 1328

TGME49_chr 694038 ID=TGME49_215400;length= 16

X 4 6941995 + 11 1329

TGME49_chr 110807 ID=TGME49_240243 ;length=48

VI 2 1112890 - 18 1330

TGME49_chr 447249 ID=TGME49_249340;length=23

XII 0 4474867 - 77 1331

TGME49_chr ID=TGME49_206450;length=28

Vila 970566 973439 - 73 1332

TGME49_chr ID=TGME49_208300;length=22

lb 601001 603218 + 17 1333

TGME49_chr 576283 ID=TGME49_306035 ;length= 17

IX 8 5764549 + 11 1334

TGME49_chr ID=TGME49_267440;length=37

IX 653845 657554 - 09 1335

TGME49_chr 296749 ID=TGME49_224570;length=44

X 4 2971910 + 16 1336

TGME49_chr 528692 ID=TGME49_270310;length=28

VIII 6 5289744 - 18 1337

TGME49_chr 187793 ID=TGME49_232130;length=34

VIII 0 1881330 + 00 1338

TGME49_chr ID=TGME49_309865;length=27

XI 852387 855185 - 98 1339

TGME49_chr 626174 ID=TGME49_216280;length=35

XI 1 6265340 - 99 1340

TGME49_chr 271211 ID=TGME49_224890;length=21

X 4 2714287 - 73 1341

TGME49_chr 259270 ID=TGME49_246010;length=33

XII 7 2596028 - 21 1342

TGME49_chr 167383 ID=TGME49_226380;length= 16

X 7 1675489 + 52 1343

TGME49_chr 478412 ID=TGME49_249750;length= 16

XII 5 4785752 - 27 1344

TGME49_chr 137964 ID=TGME49_262090;length=29

Vllb 9 1382568 + 19 1345

TGME49_chr ID=TGME49_263850;length=20

Vllb 292075 294154 - 79 1346

TGME49_chr 258296 ID=TGME49_246000;length=20

XII 8 2585062 + 94 1347

TGME49_chr 548089 ID=TGME49_305580;length=21

IX 7 5483034 - 37 1348 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_267370;length=34

IX 678959 682427 + 68 1349

TGME49_chr 128334 ID=TGME49_266130;length=49

IX 8 1288325 - 77 1350

TGME49_chr 652819 ID=TGME49_215910;length=37

XI 1 6531898 + 07 1351

TGME49_chr 453672 ID=TGME49_256060;length=22

Vllb 5 4538969 + 44 1352

TGME49_chr ID=TGME49_267050;length= 17

IX 800121 801864 + 43 1353

TGME49_chr 285782 ID=TGME49_312635 ;length=49

XI 0 2862733 + 13 1354

TGME49_chr 217378 ID=TGME49_203870;length=23

Vila 0 2176161 - 81 1355

TGME49_chr 176137 ID=TGME49_286610;length=21

V 2 1763556 + 84 1356

TGME49_chr 265791 ID=TGME49_233450;length=30

VIII 2 2660930 + 18 1357

TGME49_chr 265617 ID=TGME49_203230;length=33

Vila 5 2659506 + 31 1358

TGME49_chr 392731 ID=TGME49_257670;length=25

Vllb 6 3929847 - 31 1359

TGME49_chr 242926 ID=TGME49_288265 ;length= 19

IX 6 2431241 - 75 1360

TGME49_chr 220746 ID=TGME49_232650;length=21

VIII 8 2209626 - 58 1361

TGME49_chr ID=TGME49_320130;length=37

IV 482747 486460 - 13 1362

TGME49_chr 179251 ID=TGME49_261300;length=40

Vllb 3 1796589 - 76 1363

TGME49_chr 608341 ID=TGME49_214180;length=28

X 0 6086247 + 37 1364

TGME49_chr 674800 ID=TGME49_215080;length=20

X 4 6750095 - 91 1365

TGME49_chr 491979 ID=TGME49_235500;length=46

X 3 4924423 - 30 1366

TGME49_chr 336574 ID=TGME49_202390;length=37

Vila 5 3369481 - 36 1367

TGME49_chr ID=TGME49_280780;length=29

Vila 88266 91194 + 28 1368

TGME49_chr 672550 ID=TGME49_277080;length=31

XII 4 6728703 - 99 1369

TGME49_chr 569977 ID=TGME49_269660;length=29

VIII 8 5702719 + 41 1370

TGME49_chr ID=TGME49_320740;length=26

IV 101470 104091 + 21 1371

TGME49_chr 431779 4319648 - ID=TGME49_314820;length= 18 1372 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 9 49

TGME49_chr 172739 ID=TGME49_321570;length= 17

lb 6 1729180 - 84 1373

TGME49_chr 384173 ID=TGME49_313970;length= 17

XI 5 3843470 + 35 1374

TGME49_chr 328643 ID=TGME49_258650;length=24

Vllb 1 3288841 - 10 1375

TGME49_chr 270638 ID=TGME49_288710;length= 16

IX 9 2708077 - 88 1376

TGME49_chr ID=TGME49_227570;length=38

X 912881 916723 - 42 1377

TGME49_chr 176220 ID=TGME49_261410;length=20

Vllb 9 1764225 + 16 1378

TGME49_chr 272324 ID=TGME49_312490;length=21

XI 0 2725414 + 74 1379

TGME49_chr ID=TGME49_266840;length=18

IX 932063 933944 - 81 1380

TGME49_chr ID=TGME49_293390;length=29

la 296552 299464 + 12 1381

TGME49_chr 312560 ID=TGME49_244210;length=22

VI 5 3127814 - 09 1382

TGME49_chr 206793 ID=TGME49_232380;length= 16

VIII 0 2069608 - 78 1383

TGME49_chr ID=TGME49_309390;length=44

XI 656827 661285 + 58 1384

TGME49_chr 184684 ID=TGME49_217450;length=l 1

XII 1 1847991 - 50 1385

TGME49_chr ID=TGME49_263000;length= 18

Vllb 863548 865419 + 71 1386

TGME49_chr ID=TGME49_309920;length=22

XI 900050 902305 + 55 1387

TGME49_chr ID=TGME49_263490;length=21

Vllb 523777 525912 + 35 1388

TGME49_chr ID=TGME49_320170;length=48

IV 452586 457475 - 89 1389

TGME49_chr 434515 ID=TGME49_291690;length=32

IX 5 4348378 - 23 1390

TGME49_chr 573532 ID=TGME49_237140;length= 11

X 1 5736518 + 97 1391

TGME49_chr ID=TGME49_213380;length=22

V 971313 973576 - 63 1392

TGME49_chr 366801 ID=TGME49_223660;length=19

X 1 3669970 + 59 1393

TGME49_chr 295652 ID=TGME49_274170;length=49

VIII 9 2961446 - 17 1394

TGME49_chr ID=TGME49_208470;length=49

lb 714291 719214 + 23 1395 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 400940 ID=TGME49_248580;length= 14

XII 0 4010849 + 49 1396

TGME49_chr 254971 ID=TGME49_285190;length=26

V 2 2552335 - 23 1397

TGME49_chr 192728 ID=TGME49_297700;length=81

II 1 1928097 + 6 1398

TGME49_chr 268952 ID=TGME49_233510;length=88

VIII 1 2690407 - 6 1399

TGME49_chr 222073 ID=TGME49_203800;length=43

Vila 1 2225084 + 53 1400

TGME49_chr 279818 ID=TGME49_243700;length=35

VI 9 2801702 + 13 1401

TGME49_chr ID=TGME49_309790;length= 19

XI 810675 812617 + 42 1402

TGME49_chr ID=TGME49_293720;length=36

la 521377 525024 + 47 1403

TGME49_chr 427582 ID=TGME49_248970;length=40

XII 5 4279900 + 75 1404

TGME49_chr ID=TGME49_222000;length= 18

II 748920 750747 - 27 1405

TGME49_chr 440080 ID=TGME49_291920;length=39

IX 3 4404708 + 05 1406

TGME49_chr 550725 ID=TGME49_269950;length=38

VIII 3 5511071 - 18 1407

TGME49_chr 139843 ID=TGME49_218590;length=29

XII 5 1401335 - 00 1408

TGME49_chr 115439 ID=TGME49_262568;length=46

Vllb 8 1159024 + 26 1409

TGME49_chr 283468 ID=TGME49_259260;length=34

Vllb 4 2838145 - 61 1410

TGME49_chr 102039 ID=TGME49_262800;length=23

Vllb 3 1022780 - 87 1411

TGME49_chr ID=TGME49_227620;length=36

X 871374 874994 - 20 1412

TGME49_chr ID=TGME49_293350;length=22

la 270952 273178 + 26 1413

TGME49_chr 457952 ID=TGME49_315210;length=44

XI 1 4583971 - 50 1414

TGME49_chr 211473 ID=TGME49_225710;length=35

X 7 2118243 - 06 1415

TGME49_chr ID=TGME49_309930;length=42

XI 915530 919805 - 75 1416

TGME49_chr 487465 ID=TGME49_270925 ;length=46

VIII 8 4879348 - 90 1417

TGME49_chr 239690 ID=TGME49_260280;length= 10

Vllb 4 2397972 - 68 1418

TGME49_chr 433993 4342675 - ID=TGME49_271840;length=27 1419 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 3 42

TGME49_chr 197882 ID=TGME49_297760;length=30

II 6 1981903 + 77 1420

TGME49_chr ID=TGME49_239510;length=28

VI 724098 726949 + 51 1421

TGME49_chr ID=TGME49_229710;length=23

VIII 394955 397268 - 13 1422

TGME49_chr 253834 ID=TGME49_225080;length= 18

X 6 2540172 + 26 1423

TGME49_chr 395703 ID=TGME49_212260;length=21

X 3 3959160 - 27 1424

TGME49_chr 603620 ID=TGME49_269160;length=21

VIII 4 6038366 - 62 1425

TGME49_chr 150704 ID=TGME49_295770;length=41

la 1 1511154 + 13 1426

TGME49_chr ID=TGME49_280660;length= 17

Vila 256069 257838 - 69 1427

TGME49_chr ID=TGME49_219540;length= 17

XII 876562 878297 - 35 1428

TGME49_chr 438632 ID=TGME49_282100;length=42

Vila 1 4390549 - 28 1429

TGME49_chr 734443 ID=TGME49_207040;length=21

X 2 7346618 + 86 1430

TGME49_chr 456050 ID=TGME49_271335;length=20

VIII 5 4562544 + 39 1431

TGME49_chr 431686 ID=TGME49_314830;length=24

XI 3 4319362 + 99 1432

TGME49_chr 142280 ID=TGME49_262030;length= 18

Vllb 8 1424669 - 61 1433

TGME49_chr 228521 ID=TGME49_232830;length=22

VIII 2 2287451 - 39 1434

TGME49_chr 399253 ID=TGME49_201170;length=37

Vila 3 3996332 - 99 1435

TGME49_chr 100400 ID=TGME49_310010;length=28

XI 0 1006816 + 16 1436

TGME49_chr 595729 ID=TGME49_306310;length=24

IX 9 5959752 - 53 1437

TGME49_chr 734443 ID=TGME49_207030;length= 12

X 2 7345679 + 47 1438

TGME49_chr 425360 ID=TGME49_257050;length=25

Vllb 2 4256192 + 90 1439

TGME49_chr 145231 ID=TGME49_226670;length=24

X 2 1454768 - 56 1440

TGME49_chr ID=TGME49_220570;length=22

V 469797 472077 - 80 1441

TGME49_chr 209440 ID=TGME49_264710;length=23

IX 8 2096713 + 05 1442 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 366260 ID=TGME49_202060;length=34

Vila 2 3666008 + 06 1443

TGME49_chr 325384 ID=TGME49_247320;length= 18

XII 1 3255658 - 17 1444

TGME49_chr ID=TGME49_208040;length=30

lb 547531 550624 - 93 1445

TGME49_chr 213312 ID=TGME49_264660;length=25

IX 8 2135664 + 36 1446

TGME49_chr 129245 ID=TGME49_266120;length=28

IX 8 1295329 - 71 1447

TGME49_chr 132898 ID=TGME49_310430;length=36

XI 0 1332607 + 27 1448

TGME49_chr 627078 ID=TGME49_214370;length=20

X 5 6272853 - 68 1449

TGME49_chr ID=TGME49_264000;length=24

Vllb 251585 254066 - 81 1450

TGME49_chr 161272 ID=TGME49_240990;length= 14

VI 6 1614168 + 42 1451

TGME49_chr 153014 ID=TGME49_265500;length=19

IX 2 1532118 - 76 1452

TGME49_chr 623274 ID=TGME49_306630;length= 15

IX 9 6234329 - 80 1453

TGME49_chr 475370 ID=TGME49_315495 ;length= 15

XI 1 4755221 + 20 1454

TGME49_chr 384433 ID=TGME49_201700;length=24

Vila 8 3846794 - 56 1455

TGME49_chr 105843 ID=TGME49_219222;length=35

XII 8 1061957 - 19 1456

TGME49_chr 363265 ID=TGME49_273140;length=20

VIII 4 3634741 + 87 1457

TGME49_chr 503287 ID=TGME49_270740;length=21

VIII 1 5035049 - 78 1458

TGME49_chr ID=TGME49_293010;length=35

la 89272 92865 + 93 1459

TGME49_chr 268641 ID=TGME49_246140;length=43

XII 9 2690798 - 79 1460

TGME49_chr 219708 ID=TGME49_217880;length=34

XII 1 2200494 + 13 1461

TGME49_chr 453678 ID=TGME49_315165 ;length=36

XI 2 4540443 + 61 1462

TGME49_chr ID=TGME49_219690;length= 14

XII 727484 728939 - 55 1463

TGME49_chr 469993 ID=TGME49_255715 ;length=21

Vllb 7 4702102 + 65 1464

TGME49_chr 457370 ID=TGME49_255980;length=30

Vllb 0 4576785 + 85 1465

TGME49_chr 475047 477267 - ID=TGME49_263570;length=22 1466 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 20

TGME49_chr 170580 ID=TGME49_286720;length=46

V 2 1710404 + 02 1467

TGME49_chr 232667 ID=TGME49_285650;length=25

V 7 2329241 - 64 1468

TGME49_chr 164165 ID=TGME49_321660;length=34

lb 0 1645144 + 94 1469

TGME49_chr 219884 ID=TGME49_232640;length=44

VIII 3 2203273 + 30 1470

TGME49_chr 153288 ID=TGME49_287180;length=32

V 3 1536115 + 32 1471

TGME49_chr 190895 ID=TGME49_286230;length=38

V 0 1912837 - 87 1472

TGME49_chr 441345 ID=TGME49_249270;length=36

XII 7 4417115 + 58 1473

TGME49_chr 589883 ID=TGME49_306250;length= 19

IX 2 5900782 - 50 1474

TGME49_chr 415476 ID=TGME49_257290;length=21

Vllb 5 4156893 - 28 1475

TGME49_chr 126891 ID=TGME49_231150;length=20

VIII 4 1270935 + 21 1476

TGME49_chr ID=TGME49_262870;length= 19

Vllb 971154 973119 - 65 1477

TGME49_chr ID=TGME49_320515 ;length=28

IV 282678 285486 + 08 1478

TGME49_chr 688077 ID=TGME49_276880;length=29

XII 2 6883692 + 20 1479

TGME49_chr 603275 ID=TGME49_214110;length=32

X 6 6036014 + 58 1480

TGME49_chr 406111 ID=TGME49_212100;length=25

X 5 4063643 + 28 1481

TGME49_chr 249899 ID=TGME49_285240;length=25

V 9 2501500 - 01 1482

TGME49_chr 172414 ID=TGME49_218188 ;length=46

XII 4 1728800 - 56 1483

TGME49_chr 543718 ID=TGME49_305485 ;length= 15

IX 6 5438779 - 93 1484

TGME49_chr 679558 ID=TGME49_200310;length= 18

VIII 9 6797461 - 72 1485

TGME49_chr 182941 ID=TGME49_265030;length= 12

IX 5 1830688 - 73 1486

TGME49_chr 182826 ID=TGME49_242020;length=20

VI 5 1830308 + 43 1487

TGME49_chr 201646 ID=TGME49_232320;length=23

VIII 3 2018840 - 77 1488

TGME49_chr 374085 ID=TGME49_313852;length=30

XI 5 3743951 - 96 1489 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 427510 ID=TGME49_257030;length=20

Vllb 9 4277123 - 14 1490

TGME49_chr 676798 ID=TGME49_215120;length= 17

X 1 6769751 - 70 1491

TGME49_chr 207575 ID=TGME49_203970;length=l 1

Vila 6 2076930 + 74 1492

TGME49_chr 241685 ID=TGME49_260260;length= 15

Vllb 6 2418383 - 27 1493

TGME49_chr 377993 ID=TGME49_290630;length=50

IX 1 3784973 + 42 1494

TGME49_chr ID=TGME49_263390;length=33

Vllb 581365 584740 + 75 1495

TGME49_chr 446646 ID=TGME49_249330;length=20

XII 6 4468532 - 66 1496

TGME49_chr 185898 ID=TGME49_297530;length=17

II 9 1860788 + 99 1497

TGME49_chr ID=TGME49_220840;length=30

II 22256 25319 - 63 1498

TGME49_chr 386864 ID=TGME49_290730;length= 14

IX 3 3870136 + 93 1499

TGME49_chr ID=TGME49_309200;length=32

XI 551152 554422 + 70 1500

TGME49_chr 142227 ID=TGME49_209730;length=22

lb 7 1424478 - 01 1501

TGME49_chr 183090 ID=TGME49_265020;length=20

IX 8 1832938 + 30 1502

TGME49_chr 283946 ID=TGME49_246535;length=19

XII 3 2841372 - 09 1503

TGME49_chr 583051 ID=TGME49_269413 ;length= 15

VIII 0 5832051 - 41 1504

TGME49_chr 316069 ID=TGME49_258826;length= 18

Vllb 0 3162547 + 57 1505

TGME49_chr 430731 ID=TGME49_314810;length=25

XI 3 4309840 + 27 1506

TGME49_chr ID=TGME49_229730;length=29

VIII 406789 409738 - 49 1507

TGME49_chr 245614 ID=TGME49_245670;length=29

XII 0 2459097 + 57 1508

TGME49_chr 261948 ID=TGME49_203290;length=21

Vila 7 2621645 - 58 1509

TGME49_chr 192428 ID=TGME49_286190;length=28

V 2 1927117 + 35 1510

TGME49_chr ID=TGME49_239500;length=45

VI 716743 721319 + 76 1511

TGME49_chr 577009 ID=TGME49_237180;length= 19

X 0 5772025 - 35 1512

TGME49_chr 283297 2834932 - ID=TGME49_246520;length= 19 1513 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 0 62

TGME49_chr 549979 ID=TGME49_251620;length=22

XII 5 5502016 - 21 1514

TGME49_chr 399607 ID=TGME49_272400;length=30

VIII 4 3999088 + 14 1515

TGME49_chr 196156 ID=TGME49_264860;length=20

IX 7 1963588 - 21 1516

TGME49_chr 102839 ID=TGME49_206380;length=41

Vila 3 1032556 + 63 1517

TGME49_chr 300474 ID=TGME49_224530;length=33

X 1 3008120 - 79 1518

TGME49_chr 171095 ID=TGME49_297350;length=35

II 1 1714500 - 49 1519

TGME49_chr ID=TGME49_220260;length= 19

V 244391 246292 + 01 1520

TGME49_chr 444690 ID=TGME49_234540;length=25

X 2 4449402 - 00 1521

TGME49_chr 341079 ID=TGME49_273540;length=21

VIII 3 3412947 + 54 1522

TGME49_chr 368619 ID=TGME49_313775;length=87

XI 0 3687061 + 1 1523

TGME49_chr 196199 ID=TGME49_211030;length=27

IV 9 1964761 - 62 1524

TGME49_chr 421752 ID=TGME49_257100;length=32

Vllb 7 4220783 + 56 1525

TGME49_chr 235526 ID=TGME49_301260;length=l 1

IV 1 2356404 - 43 1526

TGME49_chr 197865 ID=TGME49_261000;length=20

Vllb 5 1980752 + 97 1527

TGME49_chr 170743 ID=TGME49_218195 ;length=39

XII 4 1711430 - 96 1528

TGME49_chr 262670 ID=TGME49_243430;length=35

VI 7 2630269 - 62 1529

TGME49_chr 308652 ID=TGME49_258930;length= 15

Vllb 3 3088120 - 97 1530

TGME49_chr ID=TGME49_208710;length=20

lb 820471 822527 - 56 1531

TGME49_chr 609719 ID=TGME49_214190;length=22

X 7 6099437 - 40 1532

TGME49_chr 574772 ID=TGME49_237150;length=24

X 4 5750210 - 86 1533

TGME49_chr 105214 ID=TGME49_209120;length= 16

lb 6 1053746 + 00 1534

TGME49_chr ID=TGME49_263080;length=19

Vllb 821098 823077 + 79 1535

TGME49_chr 127652 ID=TGME49_218830;length=42

XII 1 1280802 - 81 1536 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 131425 ID=TGME49_218780;length= 16

XII 4 1315857 - 03 1537

TGME49_chr 380831 ID=TGME49_272660;length=32

VIII 6 3811592 + 76 1538

TGME49_chr 333992 ID=TGME49_247440;length=40

XII 7 3343970 - 43 1539

TGME49_chr ID=TGME49_228020;length=76

X 669041 669809 - 8 1540

TGME49_chr 180142 ID=TGME49_295380;length=24

la 6 1803846 - 20 1541

TGME49_chr 115243 ID=TGME49_266400;length=35

IX 5 1155997 + 62 1542

TGME49_chr 473617 ID=TGME49_249685;length=19

XII 6 4738097 - 21 1543

TGME49_chr 646264 ID=TGME49_268400;length=21

VIII 6 6464783 - 37 1544

TGME49_chr 240592 ID=TGME49_225200;length=27

X 5 2408633 + 08 1545

TGME49_chr 425066 ID=TGME49_234270;length=36

X 7 4254283 + 16 1546

TGME49_chr 258067 ID=TGME49_237835;length=25

IV 6 2583197 - 21 1547

TGME49_chr 283771 ID=TGME49_224760;length=44

X 7 2842141 - 24 1548

TGME49_chr 122549 ID=TGME49_205658 ;length=26

Vila 0 1228154 + 64 1549

TGME49_chr ID=TGME49_320150;length=35

IV 472782 476325 - 43 1550

TGME49_chr 249532 ID=TGME49_233200;length=19

VIII 6 2497228 - 02 1551

TGME49_chr 139824 ID=TGME49_265870;length=26

IX 6 1400896 - 50 1552

TGME49_chr ID=TGME49_263135;length=32

Vllb 779047 782310 - 63 1553

TGME49_chr 249938 ID=TGME49_203440;length= 19

Vila 7 2501343 + 56 1554

TGME49_chr 317934 ID=TGME49_247230;length=22

XII 3 3181561 + 18 1555

TGME49_chr ID=TGME49_262940;length=94

Vllb 906498 907438 + 0 1556

TGME49_chr 310608 ID=TGME49_224290;length=30

X 6 3109088 + 02 1557

TGME49_chr 297253 ID=TGME49_259100;length= 16

Vllb 6 2974185 - 49 1558

TGME49_chr ID=TGME49_229620;length=49

VIII 322273 327193 - 20 1559

TGME49_chr 272641 2728299 - ID=TGME49_243570;length=18 1560 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VI 9 80

TGME49_chr 176347 ID=TGME49_321510;length= 13

lb 6 1764810 - 34 1561

TGME49_chr 191593 ID=TGME49_232170;length= 14

VIII 6 1917426 - 90 1562

TGME49_chr 119838 ID=TGME49_310260;length=48

XI 1 1203275 - 94 1563

TGME49_chr 116497 ID=TGME49_213600;length= 18

V 0 1166774 - 04 1564

TGME49_chr 112045 ID=TGME49_219125 ;length=35

XII 9 1123972 - 13 1565

TGME49_chr 308796 ID=TGME49_313080;length=42

XI 0 3092164 - 04 1566

TGME49_chr 197658 ID=TGME49_311335;length=33

XI 5 1979964 + 79 1567

TGME49_chr 505955 ID=TGME49_250340;length=23

XII 4 5061900 - 46 1568

TGME49_chr ID=TGME49_267330;length= 16

IX 708789 710487 + 98 1569

TGME49_chr 287448 ID=TGME49_243910;length=43

VI 3 2878854 + 71 1570

TGME49_chr 385749 ID=TGME49_223420;length= 19

X 6 3859431 - 35 1571

TGME49_chr ID=TGME49_280690;length=20

Vila 176036 178132 + 96 1572

TGME49_chr 563791 ID=TGME49_236980;length=25

X 9 5640447 + 28 1573

TGME49_chr 598127 ID=TGME49_269225 ;length= 15

VIII 3 5982851 - 78 1574

TGME49_chr 182111 ID=TGME49_211330;length=41

IV 8 1825235 + 17 1575

TGME49_chr 254869 ID=TGME49_285180;length=29

V 9 2551643 + 44 1576

TGME49_chr 198929 ID=TGME49_260990;length= 16

Vllb 0 1990938 + 48 1577

TGME49_chr 183466 ID=TGME49_226060;length= 16

X 9 1836342 + 73 1578

TGME49_chr ID=TGME49_213050;length=25

V 817024 819593 + 69 1579

TGME49_chr 223431 ID=TGME49_232710;length= 15

VIII 8 2235909 + 91 1580

TGME49_chr 314787 ID=TGME49_258832;length=25

Vllb 5 3150412 + 37 1581

TGME49_chr 474004 ID=TGME49_255680;length=19

Vllb 3 4742016 - 73 1582

TGME49_chr 358534 ID=TGME49_273340;length=23

VIII 9 3587718 - 69 1583 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 161949 ID=TGME49_241000;length=29

VI 3 1622431 - 38 1584

TGME49_chr 323548 ID=TGME49_244400;length=45

VI 2 3240036 + 54 1585

TGME49_chr 462431 ID=TGME49_271250;length=20

VIII 2 4626375 - 63 1586

TGME49_chr 126040 ID=TGME49_318525;length=36

IV 0 1264074 - 74 1587

TGME49_chr 294621 ID=TGME49_246730;length= 19

XII 7 2948125 + 08 1588

TGME49_chr 110255 ID=TGME49_213490;length= 18

V 6 1104367 + 11 1589

TGME49_chr 106452 ID=TGME49_262715 ;length=25

Vllb 8 1067088 - 60 1590

TGME49_chr ID=TGME49_320040;length= 13

IV 564447 565815 - 68 1591

TGME49_chr 129591 ID=TGME49_213760;length=22

V 8 1298121 - 03 1592

TGME49_chr 185918 ID=TGME49_232100;length= 17

VIII 1 1860943 - 62 1593

TGME49_chr 585213 ID=TGME49_216840;length=22

XI 7 5854406 + 69 1594

TGME49_chr 604718 ID=TGME49_214130;length=31

X 1 6050358 + 77 1595

TGME49_chr ID=TGME49_212937;length=34

V 702364 705795 + 31 1596

TGME49_chr ID=TGME49_228065 ;length= 14

X 644166 645619 - 53 1597

TGME49_chr ID=TGME49_208440;length=27

lb 707596 710331 + 35 1598

TGME49_chr 425402 ID=TGME49_271970;length=28

VIII 5 4256840 + 15 1599

TGME49_chr 175267 ID=TGME49_204490;length=20

Vila 7 1754734 - 57 1600

TGME49_chr 303126 ID=TGME49_289160;length= 13

IX 0 3032617 + 57 1601

TGME49_chr 161058 ID=TGME49_295690;length=19

la 9 1612536 - 47 1602

TGME49_chr 365228 ID=TGME49_248100;length=27

XII 4 3655006 + 22 1603

TGME49_chr 225328 ID=TGME49_285760;length= 16

V 5 2254950 - 65 1604

TGME49_chr ID=TGME49_294010;length=20

la 644712 646745 - 33 1605

TGME49_chr 263484 ID=TGME49_259890;length=24

Vllb 2 2637275 - 33 1606

TGME49_chr 335759 3360110 - ID=TGME49_247460;length=25 1607 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 4 16

TGME49_chr ID=TGME49_207950;length=33

lb 458487 461835 - 48 1608

TGME49_chr ID=TGME49_221390;length=29

II 309176 312120 - 44 1609

TGME49_chr 524097 ID=TGME49_250810;length=20

XII 5 5243070 - 95 1610

TGME49_chr 150106 ID=TGME49_265530;length=36

IX 4 1504713 - 49 1611

TGME49_chr ID=TGME49_207420;length= 15

lb 48324 49906 - 82 1612

TGME49_chr 296070 ID=TGME49_202880;length=23

Vila 9 2963102 + 93 1613

TGME49_chr 275930 ID=TGME49_246230;length=32

XII 3 2762582 - 79 1614

TGME49_chr 336429 ID=TGME49_202380;length=39

Vila 1 3368281 + 90 1615

TGME49_chr 584747 ID=TGME49_269380;length=39

VIII 1 5851413 + 42 1616

TGME49_chr 280391 ID=TGME49_312590;length=46

XI 5 2808538 + 23 1617

TGME49_chr 358727 ID=TGME49_313665;length=23

XI 5 3589602 + 27 1618

TGME49_chr 416832 ID=TGME49_248780;length=14

XII 0 4169758 + 38 1619

TGME49_chr 507445 ID=TGME49_235720;length= 15

X 3 5075975 + 22 1620

TGME49_chr 212756 ID=TGME49_297950;length=24

II 2 2129984 + 22 1621

TGME49_chr ID=TGME49_262970;length=36

Vllb 894167 897847 - 80 1622

TGME49_chr 242077 ID=TGME49_233110;length=18

VIII 8 2422638 + 60 1623

TGME49_chr 543063 ID=TGME49_251490;length=23

XII 4 5433016 - 82 1624

TGME49_chr ID=TGME49_228280;length=38

X 438371 442269 + 98 1625

TGME49_chr 205334 ID=TGME49_286020;length=47

V 2 2058060 - 18 1626

TGME49_chr 154837 ID=TGME49_265450;length=49

IX 0 1553308 - 38 1627

TGME49_chr 136166 ID=TGME49_209610;length=44

lb 5 1366149 - 84 1628

TGME49_chr 116050 ID=TGME49_222440;length= 18

II 6 1162386 - 80 1629

TGME49_chr 382562 ID=TGME49_248370;length=17

XII 5 3827372 + 47 1630 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_294200;length=31

la 695084 698240 + 56 1631

TGME49_chr 639121 ID=TGME49_216130;length= 16

XI 6 6392903 - 87 1632

TGME49_chr ID=TGME49_309095;length=l 1

XI 496472 497645 + 73 1633

TGME49_chr 210503 ID=TGME49_242610;length= 19

VI 6 2106954 - 18 1634

TGME49_chr ID=TGME49_279550;length=17

Vila 20277 22070 - 93 1635

TGME49_chr 558816 ID=TGME49_269810;length=29

VIII 3 5591148 + 85 1636

TGME49_chr 164377 ID=TGME49_297245 ;length=27

II 6 1646520 - 44 1637

TGME49_chr 166131 ID=TGME49_218230;length=24

XII 4 1663812 - 98 1638

TGME49_chr 195874 ID=TGME49_204140;length=24

Vila 7 1961164 - 17 1639

TGME49_chr 131776 ID=TGME49_318450;length=26

IV 5 1320416 + 51 1640

TGME49_chr 455187 ID=TGME49_234930;length=42

X 1 4556156 - 85 1641

TGME49_chr 136116 ID=TGME49_213830;length=22

V 7 1363437 - 70 1642

TGME49_chr 213539 ID=TGME49_285940;length=27

V 8 2138100 - 02 1643

TGME49_chr 428827 ID=TGME49_271910;length=26

VIII 0 4290925 + 55 1644

TGME49_chr 544889 ID=TGME49_236620;length=19

X 6 5450860 + 64 1645

TGME49_chr 717048 ID=TGME49_275310;length=46

X 2 7175111 - 29 1646

TGME49_chr 309234 ID=TGME49_289218 ;length=42

IX 4 3096555 - 11 1647

TGME49_chr ID=TGME49_319590;length=23

IV 866827 869130 + 03 1648

TGME49_chr 610965 ID=TGME49_214200;length= 18

X 6 6111516 - 60 1649

TGME49_chr ID=TGME49_294210;length=34

la 716367 719854 - 87 1650

TGME49_chr 252074 ID=TGME49_288420;length=27

IX 2 2523456 - 14 1651

TGME49_chr 336988 ID=TGME49_289660;length=14

IX 0 3371358 - 78 1652

TGME49_chr ID=TGME49_228240;length=21

X 456771 458903 + 32 1653

TGME49_chr 296950 2972803 + ID=TGME49_259090;length=33 1654 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 0 03

TGME49_chr ID=TGME49_263210;length=48

Vllb 711535 716415 - 80 1655

TGME49_chr 212253 ID=TGME49_232500;length=37

VIII 8 2126263 - 25 1656

TGME49_chr 209693 ID=TGME49_210750;length=21

IV 4 2099104 - 70 1657

TGME49_chr ID=TGME49_208800;length=33

lb 863212 866544 + 32 1658

TGME49_chr 521978 ID=TGME49_270520;length=22

VIII 5 5222055 - 70 1659

TGME49_chr 128526 ID=TGME49_240450;length=28

VI 1 1288152 - 91 1660

TGME49_chr 303176 ID=TGME49_202790;length= 12

Vila 3 3033039 - 76 1661

TGME49_chr 323540 ID=TGME49_313277;length= 16

XI 7 3237049 - 42 1662

TGME49_chr 446034 ID=TGME49_249325 ;length= 16

XII 9 4461980 - 31 1663

TGME49_chr 106209 ID=TGME49_262710;length=38

Vllb 8 1065932 + 34 1664

TGME49_chr 215542 ID=TGME49_232550;length=26

VIII 8 2158102 + 74 1665

TGME49_chr ID=TGME49_267820;length=24

IX 325672 328132 + 60 1666

TGME49_chr 515729 ID=TGME49_316150;length= 14

XI 9 5158701 + 02 1667

TGME49_chr 484234 ID=TGME49_255400;length=49

Vllb 7 4847324 + 77 1668

TGME49_chr 228767 ID=TGME49_225360;length=17

X 8 2289457 - 79 1669

TGME49_chr 381905 ID=TGME49_201750;length=37

Vila 8 3822835 + 77 1670

TGME49_chr 484393 ID=TGME49_235398;length=18

X 2 4845765 - 33 1671

TGME49_chr 114500 ID=TGME49_318690;length=20

IV 0 1147010 + 10 1672

TGME49_chr 558864 ID=TGME49_251780;length=22

XII 6 5590856 - 10 1673

TGME49_chr 612911 ID=TGME49_216470;length=49

XI 4 6134065 - 51 1674

TGME49_chr ID=TGME49_239083 ;length= 17

VI 455338 457067 - 29 1675

TGME49_chr 328988 ID=TGME49_273750;length=48

VIII 3 3294737 + 54 1676

TGME49_chr 368921 ID=TGME49_273060;length= 18

VIII 9 3691099 + 80 1677 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 332658 ID=TGME49_224100;length= 16

X 9 3328196 - 07 1678

TGME49_chr 106083 ID=TGME49_206320;length=23

Vila 3 1063141 + 08 1679

TGME49_chr 130177 ID=TGME49_222910;length=35

II 6 1305290 + 14 1680

TGME49_chr 151226 ID=TGME49_297065 ;length=23

II 3 1514584 + 21 1681

TGME49_chr 303073 ID=TGME49_274090;length=31

VIII 3 3033855 + 22 1682

TGME49_chr ID=TGME49_221430;length= 17

II 342210 344003 + 93 1683

TGME49_chr 132068 ID=TGME49_240560;length=21

VI 9 1322846 - 57 1684

TGME49_chr 485179 ID=TGME49_315610;length=40

XI 6 4855798 + 02 1685

TGME49_chr 286621 ID=TGME49_246590;length=24

XII 7 2868700 + 83 1686

TGME49_chr 239506 ID=TGME49_260270;length=24

Vllb 7 2397478 + 11 1687

TGME49_chr ID=TGME49_213000;length=26

V 766839 769527 + 88 1688

TGME49_chr 192713 ID=TGME49_217580;length=20

XII 6 1929197 - 61 1689

TGME49_chr 659783 ID=TGME49_275568;length=23

XII 6 6600162 - 26 1690

TGME49_chr 533074 ID=TGME49_305280;length= 14

IX 0 5332155 + 15 1691

TGME49_chr 625914 ID=TGME49_214350;length= 15

X 5 6260711 + 66 1692

TGME49_chr 273024 ID=TGME49_288765 ;length=22

IX 3 2732515 - 72 1693

TGME49_chr 416922 ID=TGME49_314515 ;length=44

XI 7 4173719 - 92 1694

TGME49_chr 172289 ID=TGME49_311020;length=36

XI 5 1726565 + 70 1695

TGME49_chr 205978 ID=TGME49_311410;length=26

XI 0 2062470 + 90 1696

TGME49_chr 420201 ID=TGME49_281580;length=43

Vila 1 4206408 + 97 1697

TGME49_chr 294635 ID=TGME49_202900;length=43

Vila 9 2950755 + 96 1698

TGME49_chr 267808 ID=TGME49_259830;length=49

Vllb 8 2682997 + 09 1699

TGME49_chr ID=TGME49_206510;length=35

Vila 899877 903405 + 28 1700

TGME49_chr 399607 3999430 - ID=TGME49_314310;length=33 1701 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 7 53

TGME49_chr 535341 ID=TGME49_251400;length=23

XII 1 5355727 + 16 1702

TGME49_chr 217686 ID=TGME49_217860;length=39

XII 8 2180818 + 50 1703

TGME49_chr 579267 ID=TGME49_269450;length=42

VIII 3 5796876 - 03 1704

TGME49_chr 686350 ID=TGME49_200400;length=44

VIII 0 6867930 - 30 1705

TGME49_chr 164201 ID=TGME49_265270;length=48

IX 1 1646868 - 57 1706

TGME49_chr ID=TGME49_293610;length= 19

la 451232 453213 + 81 1707

TGME49_chr 144574 ID=TGME49_240800;length=34

VI 1 1449188 + 47 1708

TGME49_chr ID=TGME49_262950;length=23

Vllb 899808 902193 + 85 1709

TGME49_chr 552655 ID=TGME49_251690;length=49

XII 1 5531499 + 48 1710

TGME49_chr 215487 ID=TGME49_210690;length=24

IV 8 2157331 + 53 1711

TGME49_chr 224470 ID=TGME49_203770;length=32

Vila 1 2247997 + 96 1712

TGME49_chr 177396 ID=TGME49_261400;length=35

Vllb 9 1777524 - 55 1713

TGME49_chr 208764 ID=TGME49_264730;length= 18

IX 4 2089518 - 74 1714

TGME49_chr 532609 ID=TGME49_236200;length=20

X 6 5328125 - 29 1715

TGME49_chr 528018 ID=TGME49_305200;length=23

IX 0 5282539 + 59 1716

TGME49_chr 181504 ID=TGME49_286480;length=33

V 7 1818388 + 41 1717

TGME49_chr ID=TGME49_304700;length=49

Vila 624310 629301 + 91 1718

TGME49_chr ID=TGME49_221630;length=43

II 505138 509495 - 57 1719

TGME49_chr 315301 ID=TGME49_244280;length=33

VI 4 3156401 + 87 1720

TGME49_chr 399709 ID=TGME49_272410;length=22

VIII 7 3999384 - 87 1721

TGME49_chr 620537 ID=TGME49_278245;length=26

XII 6 6207991 + 15 1722

TGME49_chr 510823 ID=TGME49_270650;length=23

VIII 6 5110579 - 43 1723

TGME49_chr 141588 ID=TGME49_265840;length=23

IX 1 1418195 + 14 1724 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 386309 ID=TGME49_257740;length= 15

Vllb 7 3864612 - 15 1725

TGME49_chr 421582 ID=TGME49_272020;length=29

VIII 5 4218765 - 40 1726

TGME49_chr 162934 ID=TGME49_205120;length=28

Vila 2 1632196 - 54 1727

TGME49_chr ID=TGME49_280390;length=31

Vila 463960 467129 + 69 1728

TGME49_chr 142654 ID=TGME49_205390;length=46

Vila 2 1431194 - 52 1729

TGME49_chr ID=TGME49_221275 ;length=28

II 201437 204275 - 38 1730

TGME49_chr 243762 ID=TGME49_245620;length=14

XII 7 2439102 - 75 1731

TGME49_chr 233216 ID=TGME49_260370;length=25

Vllb 2 2334662 - 00 1732

TGME49_chr ID=TGME49_206600;length=45

Vila 788333 792905 + 72 1733

TGME49_chr 410878 ID=TGME49_291090;length=47

IX 8 4113576 + 88 1734

TGME49_chr 468242 ID=TGME49_292325 ;length= 14

IX 7 4683833 + 06 1735

TGME49_chr 241744 ID=TGME49_312075 ;length= 10

XI 5 2418466 - 21 1736

TGME49_chr ID=TGME49_293790;length=45

la 556012 560537 + 25 1737

TGME49_chr ID=TGME49_267090;length=49

IX 779646 784608 + 62 1738

TGME49_chr 259295 ID=TGME49_225005 ;length= 12

X 7 2594193 + 36 1739

TGME49_chr 241059 ID=TGME49_288245 ;length=43

IX 5 2414911 - 16 1740

TGME49_chr 260230 ID=TGME49_259940;length=27

Vllb 3 2605077 + 74 1741

TGME49_chr 539587 ID=TGME49_236520;length=18

X 1 5397679 + 08 1742

TGME49_chr 110727 ID=TGME49_318750;length=22

IV 5 1109514 + 39 1743

TGME49_chr ID=TGME49_238400;length=26

VI 191582 194227 + 45 1744

TGME49_chr 658678 ID=TGME49_214920;length=21

X 9 6588930 + 41 1745

TGME49_chr 637109 ID=TGME49_277990;length=23

XII 4 6373426 - 32 1746

TGME49_chr 181250 ID=TGME49_295350;length=20

la 6 1814521 - 15 1747

TGME49_chr 665698 6660694 + ID=TGME49_277500;length=37 1748 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 2 12

TGME49_chr 266364 ID=TGME49_246100;length=21

XII 3 2665815 + 72 1749

TGME49_chr ID=TGME49_309010;length= 16

XI 436997 438672 - 75 1750

TGME49_chr 544282 ID=TGME49_305498;length=29

IX 9 5445816 - 87 1751

TGME49_chr ID=TGME49_238060;length= 19

VI 41637 43605 + 68 1752

TGME49_chr ID=TGME49_219828 ;length=49

XII 613045 617971 + 26 1753

TGME49_chr 202602 ID=TGME49_225820;length=26

X 0 2028673 - 53 1754

TGME49_chr 369096 ID=TGME49_290300;length= 15

IX 4 3692526 + 62 1755

TGME49_chr ID=TGME49_267435 ;length= 16

IX 652699 654379 + 80 1756

TGME49_chr ID=TGME49_220610;length=30

V 489825 492876 - 51 1757

TGME49_chr ID=TGME49_263540;length=24

Vllb 486067 488472 + 05 1758

TGME49_chr 328373 ID=TGME49_273770;length=22

VIII 3 3286012 + 79 1759

TGME49_chr ID=TGME49_209010;length=44

lb 978776 983209 + 33 1760

TGME49_chr 177511 ID=TGME49_241860;length= 17

VI 9 1776850 - 31 1761

TGME49_chr 114695 ID=TGME49_294790;length= 17

la 1 1148726 - 75 1762

TGME49_chr 104173 ID=TGME49_219250;length= 17

XII 1 1043438 - 07 1763

TGME49_chr ID=TGME49_230350;length=38

VIII 733619 737486 - 67 1764

TGME49_chr 160369 ID=TGME49_297190;length=20

II 4 1605698 - 04 1765

TGME49_chr 271119 ID=TGME49_288720;length=21

IX 1 2713335 - 44 1766

TGME49_chr 119654 ID=TGME49_226950;length=30

X 3 1199550 + 07 1767

TGME49_chr 539354 ID=TGME49_305455;length=22

IX 4 5395750 + 06 1768

TGME49_chr 149916 ID=TGME49_310660;length=40

XI 1 1503181 + 20 1769

TGME49_chr 500825 ID=TGME49_315820;length= 19

XI 4 5010201 + 47 1770

TGME49_chr 139036 ID=TGME49_240690;length=24

VI 8 1392791 - 23 1771 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 112039 ID=TGME49_222410;length=27

II 3 1123132 + 39 1772

TGME49_chr ID=TGME49_227920;length=22

X 739417 741694 - 77 1773

TGME49_chr 554828 ID=TGME49_269910;length=47

VIII 2 5553068 - 86 1774

TGME49_chr ID=TGME49_229990;length=21

VIII 503704 505886 - 82 1775

TGME49_chr 252189 ID=TGME49_225102;length=22

X 1 2524155 + 64 1776

TGME49_chr 246923 ID=TGME49_260190;length= 15

Vllb 2 2470805 + 73 1777

TGME49_chr ID=TGME49_267750;length=30

IX 381279 384341 + 62 1778

TGME49_chr 260311 ID=TGME49_246030;length=24

XII 8 2605598 + 80 1779

TGME49_chr 137385 ID=TGME49_223025 ;length= 17

II 6 1375582 + 26 1780

TGME49_chr 216776 ID=TGME49_260550;length=27

Vllb 1 2170472 - 11 1781

TGME49_chr 424019 ID=TGME49_314710;length=43

XI 4 4244498 - 04 1782

TGME49_chr ID=TGME49_280570;length=21

Vila 293961 296113 - 52 1783

TGME49_chr 132348 ID=TGME49_266065 ;length=24

IX 7 1325964 - 77 1784

TGME49_chr 151380 ID=TGME49_231640;length=30

VIII 3 1516857 + 54 1785

TGME49_chr ID=TGME49_220390;length=38

V 342801 346665 + 64 1786

TGME49_chr 166310 ID=TGME49_205030;length=30

Vila 1 1666170 + 69 1787

TGME49_chr 284797 ID=TGME49_243760;length=49

VI 1 2852954 - 83 1788

TGME49_chr 535115 ID=TGME49_305320;length=21

IX 0 5353348 + 98 1789

TGME49_chr 413189 ID=TGME49_248730;length= 15

XII 1 4133450 - 59 1790

TGME49_chr ID=TGME49_306990;length= 10

XI 133543 134623 + 80 1791

TGME49_chr 635324 ID=TGME49_214550;length= 14

X 8 6354650 - 02 1792

TGME49_chr 604241 ID=TGME49_269140;length=22

VIII 1 6044655 + 44 1793

TGME49_chr 168883 ID=TGME49_265230;length=21

IX 3 1690933 - 00 1794

TGME49_chr 306442 309887 + ID=TGME49_293420;length=34 1795 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: la 45

TGME49_chr 182492 ID=TGME49_204390;length=26

Vila 8 1827582 + 54 1796

TGME49_chr ID=TGME49_219430;length= 16

XII 951031 952710 + 79 1797

TGME49_chr ID=TGME49_239350;length=17

VI 581992 583744 - 52 1798

TGME49_chr 344165 ID=TGME49_223940;length=30

X 7 3444661 - 04 1799

TGME49_chr ID=TGME49_230520;length=23

VIII 846765 849139 + 74 1800

TGME49_chr 332379 ID=TGME49_247400;length=29

XII 0 3326774 + 84 1801

TGME49_chr 292072 ID=TGME49_202960;length=29

Vila 4 2923694 - 70 1802

TGME49_chr 457854 ID=TGME49_315220;length=42

XI 1 4582751 + 10 1803

TGME49_chr 152877 ID=TGME49_265470;length=30

IX 8 1531783 + 05 1804

TGME49_chr 123994 ID=TGME49_310290;length=23

XI 7 1242270 - 23 1805

TGME49_chr ID=TGME49_264040;length= 19

Vllb 222702 224691 - 89 1806

TGME49_chr ID=TGME49_229720;length=45

VIII 405193 409738 - 45 1807

TGME49_chr ID=TGME49_304680;length=45

Vila 625071 629635 - 64 1808

TGME49_chr 250728 ID=TGME49_288400;length=21

IX 7 2509481 - 94 1809

TGME49_chr 326093 ID=TGME49_202530;length=27

Vila 7 3263671 - 34 1810

TGME49_chr ID=TGME49_279360;length=22

IX 227007 229288 + 81 1811

TGME49_chr 402165 ID=TGME49_212140;length=26

X 5 4024338 + 83 1812

TGME49_chr 121590 ID=TGME49_262460;length=40

Vllb 8 1219966 + 58 1813

TGME49_chr 745793 ID=TGME49_207230;length=26

X 7 7460592 - 55 1814

TGME49_chr 107151 ID=TGME49_230870;length=41

VIII 4 1075639 - 25 1815

TGME49_chr 502920 ID=TGME49_235660;length=14

X 2 5030634 - 32 1816

TGME49_chr 483829 ID=TGME49_255420;length=26

Vllb 7 4840986 - 89 1817

TGME49_chr ID=TGME49_230070;length= 17

VIII 574328 576120 - 92 1818 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 143452 ID=TGME49_205370;length=26

Vila 9 1437210 + 81 1819

TGME49_chr 318471 ID=TGME49_258790;length=32

Vllb 4 3187955 - 41 1820

TGME49_chr ID=TGME49_263595 ;length= 15

Vllb 452002 453575 + 73 1821

TGME49_chr 179917 ID=TGME49_297460;length= 18

II 7 1801013 - 36 1822

TGME49_chr 134801 ID=TGME49_266010;length=23

IX 9 1350404 + 85 1823

TGME49_chr ID=TGME49_308990;length=32

XI 422982 426252 - 70 1824

TGME49_chr 263864 ID=TGME49_203250;length=47

Vila 0 2643428 + 88 1825

TGME49_chr 311210 ID=TGME49_247030;length=49

XII 6 3117089 + 83 1826

TGME49_chr 178611 ID=TGME49_286550;length=16

V 7 1787739 + 22 1827

TGME49_chr 163636 ID=TGME49_321690;length=31

lb 9 1639495 - 26 1828

TGME49_chr 474406 ID=TGME49_249698 ;length= 16

XII 2 4745703 - 41 1829

TGME49_chr 104928 ID=TGME49_310080;length=50

XI 8 1054295 - 07 1830

TGME49_chr 299736 ID=TGME49_246800;length=42

XII 4 3001613 + 49 1831

TGME49_chr ID=TGME49_280600;length=40

Vila 276480 280573 - 93 1832

TGME49_chr 204483 ID=TGME49_204040;length=23

Vila 4 2047221 - 87 1833

TGME49_chr 144357 ID=TGME49_318290;length= 16

IV 5 1445262 - 87 1834

TGME49_chr 135854 ID=TGME49_262125 ;length=50

Vllb 3 1363574 - 31 1835

TGME49_chr 188973 ID=TGME49_217520;length=39

XII 9 1893641 - 02 1836

TGME49_chr 621815 ID=TGME49_268890;length=46

VIII 6 6222777 + 21 1837

TGME49_chr ID=TGME49_279390;length=31

IX 212823 215942 - 19 1838

TGME49_chr 307466 ID=TGME49_283570;length=10

V 8 3075701 - 33 1839

TGME49_chr ID=TGME49_280540;length= 19

Vila 329088 331013 - 25 1840

TGME49_chr 113585 ID=TGME49_219090;length=84

XII 9 1136707 - 8 1841

TGME49_chr 562038 563532 - ID=TGME49_309210;length= 14 1842 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 94

TGME49_chr 507976 ID=TGME49_315930;length=22

XI 3 5082015 + 52 1843

TGME49_chr 423001 ID=TGME49_248870;length= 17

XII 0 4231788 + 78 1844

TGME49_chr ID=TGME49_229640;length=37

VIII 347018 350776 + 58 1845

TGME49_chr 336895 ID=TGME49_289670;length= 14

IX 8 3370369 + 11 1846

TGME49_chr 268743 ID=TGME49_233500;length=29

VIII 6 2690407 - 71 1847

TGME49_chr ID=TGME49_263360;length=19

Vllb 611076 613065 - 89 1848

TGME49_chr 397000 ID=TGME49_257568;length=30

Vllb 6 3973027 - 21 1849

TGME49_chr 132523 ID=TGME49_262160;length=36

Vllb 9 1328902 + 63 1850

TGME49_chr 441311 ID=TGME49_271760;length=23

VIII 3 4415505 - 92 1851

TGME49_chr 381450 ID=TGME49_201760;length=23

Vila 3 3816891 + 88 1852

TGME49_chr 405853 ID=TGME49_314400;length=28

XI 5 4061396 + 61 1853

TGME49_chr 185439 ID=TGME49_311210;length=46

XI 5 1859046 + 51 1854

TGME49_chr 463211 ID=TGME49_235140;length= 19

X 0 4634094 + 84 1855

TGME49_chr 581529 ID=TGME49_306190;length=39

IX 6 5819202 + 06 1856

TGME49_chr 290895 ID=TGME49_259180;length= 18

Vllb 5 2910810 - 55 1857

TGME49_chr 242121 ID=TGME49_233100;length=26

VIII 2 2423837 - 25 1858

TGME49_chr ID=TGME49_309000;length=23

XI 431000 433353 - 53 1859

TGME49_chr 227103 ID=TGME49_311770;length=46

XI 0 2275657 + 27 1860

TGME49_chr 433207 ID=TGME49_271850;length=29

VIII 0 4335007 + 37 1861

TGME49_chr 498916 ID=TGME49_250020;length= 14

XII 0 4990606 + 46 1862

TGME49_chr 267735 ID=TGME49_246130;length=21

XII 8 2679539 + 81 1863

TGME49_chr ID=TGME49_280550;length=23

Vila 305043 307438 + 95 1864

TGME49_chr 466818 ID=TGME49_315360;length=30

XI 1 4671233 + 52 1865 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 203508 ID=TGME49_225790;length=35

X 8 2038686 + 98 1866

TGME49_chr 325987 ID=TGME49_258690;length=27

Vllb 2 3262590 - 18 1867

TGME49_chr 491282 ID=TGME49_249860;length=18

XII 9 4914644 - 15 1868

TGME49_chr 244008 ID=TGME49_243200;length=22

VI 6 2442352 - 66 1869

TGME49_chr 388033 ID=TGME49_272560;length=44

VIII 6 3884798 + 62 1870

TGME49_chr 383160 ID=TGME49_248390;length=14

XII 7 3833051 - 44 1871

TGME49_chr 226279 ID=TGME49_311760;length=27

XI 6 2265505 - 09 1872

TGME49_chr ID=TGME49_294240;length=42

la 741460 745719 - 59 1873

TGME49_chr 178261 ID=TGME49_321470;length=47

lb 7 1787380 - 63 1874

TGME49_chr 166993 ID=TGME49_205020;length=94

Vila 1 1670876 - 5 1875

TGME49_chr 632649 ID=TGME49_268770;length=26

VIII 1 6329179 - 88 1876

TGME49_chr ID=TGME49_220145 ;length= 19

V 166266 168190 - 24 1877

TGME49_chr ID=TGME49_320020;length=47

IV 576195 580895 - 00 1878

TGME49_chr 129822 ID=TGME49_231175;length=34

VIII 4 1301679 + 55 1879

TGME49_chr 496442 ID=TGME49_255250;length=41

Vllb 4 4968613 - 89 1880

TGME49_chr ID=TGME49_267340;length= 16

IX 709923 711569 - 46 1881

TGME49_chr 455693 ID=TGME49_234970;length=38

X 6 4560782 + 46 1882

TGME49_chr ID=TGME49_263690;length=36

Vllb 393977 397643 - 66 1883

TGME49_chr 542293 ID=TGME49_251480;length= 19

XII 1 5424831 - 00 1884

TGME49_chr ID=TGME49_266880;length= 10

IX 914298 915352 + 54 1885

TGME49_chr ID=TGME49_220520;length=40

V 428812 432815 + 03 1886

TGME49_chr 640495 ID=TGME49_216080;length=49

XI 4 6409920 - 66 1887

TGME49_chr ID=TGME49_222080;length=48

II 861618 866482 - 64 1888

TGME49_chr 566986 5671445 + ID=TGME49_251930;length= 15 1889 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 7 78

TGME49_chr 253292 ID=TGME49_243310;length=42

VI 9 2537138 + 09 1890

TGME49_chr 380782 ID=TGME49_290640;length=29

IX 1 3810806 - 85 1891

TGME49_chr ID=TGME49_206540;length=31

Vila 901698 904817 - 19 1892

TGME49_chr 184746 ID=TGME49_265000;length=46

IX 3 1852082 + 19 1893

TGME49_chr 245453 ID=TGME49_203510;length=24

Vila 2 2456953 + 21 1894

TGME49_chr 360469 ID=TGME49_273290;length= 19

VIII 8 3606638 + 40 1895

TGME49_chr 223475 ID=TGME49_232700;length=21

VIII 2 2236893 - 41 1896

TGME49_chr 244932 ID=TGME49_243220;length= 10

VI 1 2450374 + 53 1897

TGME49_chr 693718 ID=TGME49_215385 ;length= 17

X 4 6938923 + 39 1898

TGME49_chr ID=TGME49_267380;length=38

IX 680988 684814 - 26 1899

TGME49_chr 226102 ID=TGME49_203760;length=21

Vila 7 2263156 - 29 1900

TGME49_chr 314527 ID=TGME49_289280;length=31

IX 5 3148375 - 00 1901

TGME49_chr 248972 ID=TGME49_203470;length= 12

Vila 2 2491017 + 95 1902

TGME49_chr 176988 ID=TGME49_232020;length=44

VIII 5 1774294 - 09 1903

TGME49_chr 399353 ID=TGME49_201160;length=27

Vila 3 3996332 - 99 1904

TGME49_chr 451479 ID=TGME49_234660;length= 15

X 9 4516300 + 01 1905

TGME49_chr ID=TGME49_213100;length=25

V 860484 863029 - 45 1906

TGME49_chr 359128 ID=TGME49_258085 ;length= 19

Vllb 1 3593195 - 14 1907

TGME49_chr 151486 ID=TGME49_318160;length=40

IV 5 1518877 + 12 1908

TGME49_chr 327966 ID=TGME49_224180;length=30

X 6 3282705 - 39 1909

TGME49_chr ID=TGME49_230700;length= 19

VIII 975775 977748 + 73 1910

TGME49_chr 308572 ID=TGME49_258940;length=23

Vllb 9 3088120 - 91 1911

TGME49_chr ID=TGME49_308050;length=22

XII 529861 532149 + 88 1912 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_220240;length=37

V 239790 243541 - 51 1913

TGME49_chr 146151 ID=TGME49_223125 ;length= 19

II 3 1463416 - 03 1914

TGME49_chr 522211 ID=TGME49_316280;length=20

XI 7 5224159 - 42 1915

TGME49_chr 146830 ID=TGME49_223140;length= 17

II 6 1470065 + 59 1916

TGME49_chr 235371 ID=TGME49_311890;length=47

XI 5 2358433 + 18 1917

TGME49_chr 158212 ID=TGME49_297170;length=46

II 3 1586805 + 82 1918

TGME49_chr ID=TGME49_279400;length=41

IX 199072 203223 + 51 1919

TGME49_chr ID=TGME49_320525 ;length= 18

IV 284392 286223 - 31 1920

TGME49_chr 484293 ID=TGME49_210370;length= 18

IX 4 4844771 - 37 1921

TGME49_chr 445216 ID=TGME49_314970;length=44

XI 7 4456567 - 00 1922

TGME49_chr 249977 ID=TGME49_312200;length=25

XI 2 2502355 - 83 1923

TGME49_chr ID=TGME49_208830;length=25

lb 890079 892589 - 10 1924

TGME49_chr 196636 ID=TGME49_232230;length= 15

VIII 2 1967945 - 83 1925

TGME49_chr ID=TGME49_239880;length=18

VI 989945 991803 - 58 1926

TGME49_chr 351889 ID=TGME49_258200;length=26

Vllb 5 3521575 + 80 1927

TGME49_chr ID=TGME49_220208 ;length= 14

V 216186 217667 - 81 1928

TGME49_chr 564027 ID=TGME49_251885 ;length= 15

XII 6 5641811 + 35 1929

TGME49_chr 382653 ID=TGME49_248360;length=18

XII 8 3828410 - 72 1930

TGME49_chr ID=TGME49_267290;length=77

IX 736629 737403 + 4 1931

TGME49_chr ID=TGME49_212740;length=42

V 578793 583050 - 57 1932

TGME49_chr 288673 ID=TGME49_246620;length=23

XII 2 2889046 + 14 1933

TGME49_chr 417636 ID=TGME49_248800;length= 18

XII 5 4178211 + 46 1934

TGME49_chr 506374 ID=TGME49_270710;length=28

VIII 7 5066564 - 17 1935

TGME49_chr 190733 1909721 + ID=TGME49_286220;length=23 1936 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

V 2 89

TGME49_chr 339747 ID=TGME49_313440;length=42

XI 4 3401749 - 75 1937

TGME49_chr 386182 ID=TGME49_201670;length=32

Vila 4 3865091 + 67 1938

TGME49_chr 556968 ID=TGME49_251760;length= 18

XII 4 5571506 - 22 1939

TGME49_chr 222592 ID=TGME49_217920;length=37

XII 1 2229690 + 69 1940

TGME49_chr 330787 ID=TGME49_289570;length=27

IX 2 3310644 + 72 1941

TGME49_chr ID=TGME49_293200;length=23

la 156682 159073 + 91 1942

TGME49_chr 388133 ID=TGME49_248470;length=42

XII 2 3885535 - 03 1943

TGME49_chr ID=TGME49_228330;length=22

X 418777 421046 - 69 1944

TGME49_chr 227402 ID=TGME49_285720;length=23

V 5 2276420 - 95 1945

TGME49_chr 299221 ID=TGME49_312940;length=28

XI 7 2995109 - 92 1946

TGME49_chr ID=TGME49_293520;length=17

la 390048 391767 - 19 1947

TGME49_chr 644570 ID=TGME49_268450;length=50

VIII 9 6450718 - 09 1948

TGME49_chr 392439 ID=TGME49_201270;length=35

Vila 8 3927902 - 04 1949

TGME49_chr 261252 ID=TGME49_288560;length=28

IX 6 2615415 + 89 1950

TGME49_chr 539402 ID=TGME49_305450;length=21

IX 5 5396129 - 04 1951

TGME49_chr 244932 ID=TGME49_243240;length=50

VI 1 2454332 + 11 1952

TGME49_chr 114167 ID=TGME49_227000;length=33

X 8 1144996 + 18 1953

TGME49_chr 358063 ID=TGME49_202140;length=21

Vila 3 3582769 - 36 1954

TGME49_chr ID=TGME49_230340;length=23

VIII 712149 714535 + 86 1955

TGME49_chr ID=TGME49_228050;length=39

X 653775 657724 - 49 1956

TGME49_chr 152907 ID=TGME49_226580;length= 17

X 6 1530823 - 47 1957

TGME49_chr 333465 ID=TGME49_202430;length=37

Vila 1 3338417 - 66 1958

TGME49_chr 216320 ID=TGME49_260570;length=21

Vllb 1 2165337 - 36 1959 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 104384 ID=TGME49_262740;length= 19

Vllb 8 1045795 + 47 1960

TGME49_chr 131105 ID=TGME49_226825;length=50

X 3 1316055 + 02 1961

TGME49_chr 397749 ID=TGME49_272430;length=24

VIII 6 3979941 - 45 1962

TGME49_chr 178919 ID=TGME49_286530;length=21

V 2 1791334 + 42 1963

TGME49_chr 417392 ID=TGME49_257170;length= 13

Vllb 2 4175290 + 68 1964

TGME49_chr 459139 ID=TGME49_315230;length= 17

XI 6 4593178 + 82 1965

TGME49_chr ID=TGME49_229420;length=20

VIII 249237 251305 - 68 1966

TGME49_chr ID=TGME49_212840;length=21

V 646486 648649 - 63 1967

TGME49_chr 112033 ID=TGME49_310180;length= 19

XI 0 1122240 + 10 1968

TGME49_chr 557532 ID=TGME49_316760;length= 19

XI 1 5577263 - 42 1969

TGME49_chr 287461 ID=TGME49_233870;length=41

VIII 3 2878765 - 52 1970

TGME49_chr 551151 ID=TGME49_251650;length= 12

XII 6 5512810 + 94 1971

TGME49_chr 162815 ID=TGME49_211675;length=89

IV 7 1629047 + 0 1972

TGME49_chr 142262 ID=TGME49_213900;length=32

V 9 1425901 - 72 1973

TGME49_chr ID=TGME49_280670;length=37

Vila 186501 190276 + 75 1974

TGME49_chr ID=TGME49_206500;length=45

Vila 929557 930012 - 5 1975

TGME49_chr 297127 ID=TGME49_289080;length=22

IX 0 2973519 + 49 1976

TGME49_chr 393034 ID=TGME49_272510;length= 16

VIII 0 3931981 - 41 1977

TGME49_chr ID=TGME49_306930;length= 19

XI 181107 183028 - 21 1978

TGME49_chr 164966 ID=TGME49_265255;length=31

IX 3 1652849 + 86 1979

TGME49_chr 107392 ID=TGME49_219200;length=22

XII 7 1076130 + 03 1980

TGME49_chr 247103 ID=TGME49_301430;length=28

IV 7 2473860 + 23 1981

TGME49_chr 636486 ID=TGME49_278005 ;length= 14

XII 6 6366334 + 68 1982

TGME49_chr 842071 845707 - ID=TGME49_208740;length=36 1983 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: lb 36

TGME49_chr ID=TGME49_239600;length=49

VI 790118 795087 + 69 1984

TGME49_chr 142429 ID=TGME49_226700;length=25

X 8 1426804 + 06 1985

TGME49_chr ID=TGME49_287490;length=44

V 560109 564542 - 33 1986

TGME49_chr 248367 ID=TGME49_225120;length=26

X 0 2486271 + 01 1987

TGME49_chr 139372 ID=TGME49_205460;length=45

Vila 5 1398244 - 19 1988

TGME49_chr 109259 ID=TGME49_219175 ;length= 16

XII 2 1094200 + 08 1989

TGME49_chr ID=TGME49_227960;length= 18

X 705828 707714 - 86 1990

TGME49_chr 440449 ID=TGME49_291910;length=24

IX 6 4406990 - 94 1991

TGME49_chr 699818 ID=TGME49_215510;length= 14

X 5 6999653 - 68 1992

TGME49_chr 452551 ID=TGME49_234680;length=18

X 9 4527356 + 37 1993

TGME49_chr 136441 ID=TGME49_226780;length=18

X 3 1366308 + 95 1994

TGME49_chr 143448 ID=TGME49_262000;length=97

Vllb 2 1435459 - 7 1995

TGME49_chr 319832 ID=TGME49_273885 ;length=46

VIII 8 3202978 - 50 1996

TGME49_chr 589194 ID=TGME49_269320;length=22

VIII 6 5894191 + 45 1997

TGME49_chr 180127 ID=TGME49_297470;length=43

II 3 1805605 + 32 1998

TGME49_chr 158336 ID=TGME49_297160;length=40

II 6 1587369 - 03 1999

TGME49_chr 101569 ID=TGME49_310030;length= 18

XI 0 1017565 - 75 2000

TGME49_chr ID=TGME49_229470;length=21

VIII 284128 286310 + 82 2001

TGME49_chr ID=TGME49_320610;length=49

IV 206127 211040 - 13 2002

TGME49_chr ID=TGME49_319510;length=25

IV 954032 956564 - 32 2003

TGME49_chr 110247 ID=TGME49_227080;length= 16

X 7 1104098 - 21 2004

TGME49_chr 366798 ID=TGME49_290250;length=21

IX 1 3670156 - 75 2005

TGME49_chr 651963 ID=TGME49_268360;length=32

VIII 8 6522878 + 40 2006 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 544011 ID=TGME49_316620;length=38

XI 1 5443988 - 77 2007

TGME49_chr 217741 ID=TGME49_203840;length=25

Vila 6 2179962 + 46 2008

TGME49_chr 508420 ID=TGME49_270680;length= 16

VIII 9 5085825 + 16 2009

TGME49_chr 299546 ID=TGME49_312960;length= 19

XI 4 2997405 + 41 2010

TGME49_chr 162592 ID=TGME49_297230;length=27

II 3 1628671 + 48 2011

TGME49_chr 161890 ID=TGME49_310830;length= 10

XI 0 1619991 + 91 2012

TGME49_chr 675150 ID=TGME49_277060;length=33

XII 4 6754864 - 60 2013

TGME49_chr 616571 ID=TGME49_216410;length=29

XI 5 6168615 + 00 2014

TGME49_chr 263664 ID=TGME49_284645 ;length=24

V 7 2639120 + 73 2015

TGME49_chr 447349 ID=TGME49_256770;length=25

Vllb 7 4476055 + 58 2016

TGME49_chr 525383 ID=TGME49_270350;length=19

VIII 8 5255754 - 16 2017

TGME49_chr ID=TGME49_239890;length=38

VI 995195 999093 - 98 2018

TGME49_chr 154674 ID=TGME49_240900;length=49

VI 3 1551741 - 98 2019

TGME49_chr 357082 ID=TGME49_258110;length=20

Vllb 3 3572905 - 82 2020

TGME49_chr 168738 ID=TGME49_265220;length=30

IX 8 1690438 + 50 2021

TGME49_chr 186192 ID=TGME49_232120;length=24

VIII 8 1864368 + 40 2022

TGME49_chr ID=TGME49_220620;length=22

V 488616 490907 + 91 2023

TGME49_chr 340246 ID=TGME49_244620;length=33

VI 4 3405837 - 73 2024

TGME49_chr ID=TGME49_220585 ;length=21

V 469162 471286 + 24 2025

TGME49_chr ID=TGME49_228270;length=26

X 448104 450711 - 07 2026

TGME49_chr ID=TGME49_294350;length=99

la 840385 841380 - 5 2027

TGME49_chr 164217 ID=TGME49_310875 ;length=92

XI 0 1643093 + 3 2028

TGME49_chr 499571 ID=TGME49_270780;length=17

VIII 9 4997442 + 23 2029

TGME49_chr 927570 932508 - ID=TGME49_239800;length=49 2030 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VI 38

TGME49_chr 184331 ID=TGME49_204370;length=26

Vila 3 1845988 + 75 2031

TGME49_chr 454960 ID=TGME49_234940;length=43

X 6 4553931 + 25 2032

TGME49_chr 170878 ID=TGME49_241610;length=22

VI 5 1711030 + 45 2033

TGME49_chr 493897 ID=TGME49_235540;length=14

X 2 4940429 - 57 2034

TGME49_chr ID=TGME49_264120;length=45

Vllb 164856 169420 + 64 2035

TGME49_chr 402759 ID=TGME49_291000;length=21

IX 8 4029731 + 33 2036

TGME49_chr 141959 ID=TGME49_205402;length=20

Vila 9 1421687 + 88 2037

TGME49_chr ID=TGME49_309020;length= 13

XI 436362 437746 + 84 2038

TGME49_chr 517629 ID=TGME49_316190;length=28

XI 9 5179174 + 75 2039

TGME49_chr 271212 ID=TGME49_259690;length= 19

Vllb 0 2714104 - 84 2040

TGME49_chr 145031 ID=TGME49_209780;length=65

lb 4 1450964 + 0 2041

TGME49_chr 593935 ID=TGME49_278680;length=19

XII 9 5941323 - 64 2042

TGME49_chr ID=TGME49_219850;length= 19

XII 602054 603985 - 31 2043

TGME49_chr 443187 ID=TGME49_291960;length=31

IX 8 4435031 + 53 2044

TGME49_chr 216750 ID=TGME49_285860;length=48

V 0 2172395 - 95 2045

TGME49_chr 604975 ID=TGME49_306400;length=35

IX 2 6053318 + 66 2046

TGME49_chr 166256 ID=TGME49_261550;length= 12

Vllb 8 1663864 - 96 2047

TGME49_chr 280719 ID=TGME49_224820;length= 10

X 7 2808264 + 67 2048

TGME49_chr 194182 ID=TGME49_264875 ;length= 19

IX 0 1943728 - 08 2049

TGME49_chr ID=TGME49_238070;length=50

VI 41637 46647 + 10 2050

TGME49_chr 642083 ID=TGME49_216070;length=45

XI 5 6425390 - 55 2051

TGME49_chr ID=TGME49_267670;length=28

IX 469404 472283 + 79 2052

TGME49_chr 600717 ID=TGME49_237595 ;length= 18

X 6 6009045 - 69 2053 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 560847 ID=TGME49_316900;length= 17

XI 2 5610173 - 01 2054

TGME49_chr 617199 ID=TGME49_214250;length=40

X 5 6176026 + 31 2055

TGME49_chr 278634 ID=TGME49_288870;length=l 1

IX 8 2787500 - 52 2056

TGME49_chr 407134 ID=TGME49_291020;length=41

IX 0 4075503 - 63 2057

TGME49_chr ID=TGME49_308920;length=25

XI 362611 365152 - 41 2058

TGME49_chr 603810 ID=TGME49_278540;length=14

XII 2 6039598 - 96 2059

TGME49_chr 138386 ID=TGME49_295080;length=23

la 6 1386224 + 58 2060

TGME49_chr 125521 ID=TGME49_310320;length= 18

XI 0 1257092 + 82 2061

TGME49_chr 580093 ID=TGME49_237230;length=22

X 0 5803170 - 40 2062

TGME49_chr 372593 ID=TGME49_290460;length= 16

IX 4 3727613 + 79 2063

TGME49_chr ID=TGME49_239680;length=20

VI 831846 833871 - 25 2064

TGME49_chr ID=TGME49_228730;length=26

X 184156 186759 - 03 2065

TGME49_chr ID=TGME49_267510;length=31

IX 608936 612060 - 24 2066

TGME49_chr 260404 ID=TGME49_233350;length=17

VIII 2 2605811 + 69 2067

TGME49_chr 337029 ID=TGME49_244580;length= 16

VI 2 3371967 + 75 2068

TGME49_chr 431302 ID=TGME49_256975;length=75

Vllb 8 4313787 - 9 2069

TGME49_chr 577542 ID=TGME49_216960;length= 18

XI 7 5777232 - 05 2070

TGME49_chr 210373 ID=TGME49_260640;length=24

Vllb 7 2106170 - 33 2071

TGME49_chr 179560 ID=TGME49_226100;length= 19

X 4 1797542 + 38 2072

TGME49_chr ID=TGME49_228320;length=18

X 418101 419981 + 80 2073

TGME49_chr 134061 ID=TGME49_310440;length=23

XI 6 1343000 + 84 2074

TGME49_chr 335530 ID=TGME49_289630;length=25

IX 8 3357831 + 23 2075

TGME49_chr 219258 ID=TGME49_232620;length=21

VIII 3 2194733 + 50 2076

TGME49_chr 463656 4638383 - ID=TGME49_255900;length=18 2077 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 1 22

TGME49_chr 558816 ID=TGME49_269820;length=19

VIII 3 5590158 + 95 2078

TGME49_chr 324750 ID=TGME49_244408 ;length=27

VI 6 3250212 - 06 2079

TGME49_chr 205149 ID=TGME49_204010;length=47

Vila 6 2056236 - 40 2080

TGME49_chr ID=TGME49_229980;length=25

VIII 491985 494558 + 73 2081

TGME49_chr 205653 ID=TGME49_203985;length= 15

Vila 6 2058106 + 70 2082

TGME49_chr ID=TGME49_280725 ;length=41

Vila 140276 144404 + 28 2083

TGME49_chr 225069 ID=TGME49_232750;length=20

VIII 8 2252771 - 73 2084

TGME49_chr 588704 ID=TGME49_269340;length=23

VIII 7 5889397 - 50 2085

TGME49_chr 105280 ID=TGME49_310090;length= 14

XI 3 1054295 - 92 2086

TGME49_chr 127977 ID=TGME49_240440;length= 18

VI 2 1281594 + 22 2087

TGME49_chr ID=TGME49_263160;length= 15

Vllb 756456 758017 - 61 2088

TGME49_chr 181199 ID=TGME49_297480;length=31

II 3 1815135 + 42 2089

TGME49_chr ID=TGME49_206430;length=44

Vila 990948 995424 - 76 2090

TGME49_chr 298611 ID=TGME49_283770;length=47

V 3 2990861 - 48 2091

TGME49_chr ID=TGME49_219560;length= 17

XII 850493 852277 - 84 2092

TGME49_chr 134416 ID=TGME49_318430;length=48

IV 2 1349005 - 43 2093

TGME49_chr 105633 ID=TGME49_262730;length=27

Vllb 3 1059103 - 70 2094

TGME49_chr 466171 ID=TGME49_271170;length= 16

VIII 3 4663360 - 47 2095

TGME49_chr 594276 ID=TGME49_216720;length=24

XI 8 5945248 + 80 2096

TGME49_chr 177952 ID=TGME49_311075;length=35

XI 0 1783053 + 33 2097

TGME49_chr 182066 ID=TGME49_321420;length= 17

lb 4 1822422 + 58 2098

TGME49_chr 194415 ID=TGME49_225900;length=43

X 3 1948496 + 43 2099

TGME49_chr 120133 ID=TGME49_205710;length= 16

Vila 5 1202972 + 37 2100 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 487233 ID=TGME49_315640;length=23

XI 4 4874729 - 95 2101

TGME49_chr 155463 ID=TGME49_231815 ;length=29

VIII 3 1557549 + 16 2102

TGME49_chr 305180 ID=TGME49_313030;length=49

XI 9 3056712 + 03 2103

TGME49_chr 284025 ID=TGME49_224750;length= 18

X 0 2842141 - 91 2104

TGME49_chr 241922 ID=TGME49_288260;length=38

IX 1 2423061 + 40 2105

TGME49_chr ID=TGME49_221560;length=22

II 443749 446045 - 96 2106

TGME49_chr 196919 ID=TGME49_261018 ;length=38

Vllb 2 1973085 - 93 2107

TGME49_chr 297138 ID=TGME49_224575 ;length=24

X 9 2973789 - 00 2108

TGME49_chr ID=TGME49_267642;length=32

IX 509919 513149 - 30 2109

TGME49_chr 578743 ID=TGME49_216930;length=23

XI 2 5789807 - 75 2110

TGME49_chr 446902 ID=TGME49_292000;length=22

IX 8 4471277 - 49 2111

TGME49_chr 427623 ID=TGME49_291360;length=31

IX 7 4279338 - 01 2112

TGME49_chr 234217 ID=TGME49_260360;length= 15

Vllb 7 2343737 - 60 2113

TGME49_chr 128744 ID=TGME49_310360;length=48

XI 5 1292319 + 74 2114

TGME49_chr 507069 ID=TGME49_250360;length=36

XII 2 5074348 - 56 2115

TGME49_chr 156955 ID=TGME49_261710;length=31

Vllb 9 1572751 + 92 2116

TGME49_chr 258337 ID=TGME49_225020;length=27

X 3 2586162 - 89 2117

TGME49_chr 138633 ID=TGME49_231370;length=42

VIII 9 1390571 - 32 2118

TGME49_chr 334174 ID=TGME49_289600;length= 18

IX 9 3343589 - 40 2119

TGME49_chr 485250 ID=TGME49_210345 ;length=27

IX 0 4855230 + 30 2120

TGME49_chr 199923 ID=TGME49_232300;length=40

VIII 2 2003285 - 53 2121

TGME49_chr 121791 ID=TGME49_226930;length=92

X 8 1218840 + 2 2122

TGME49_chr ID=TGME49_267740;length=23

IX 393414 395802 + 88 2123

TGME49_chr 223983 2241664 - ID=TGME49_232720;length= 18 2124 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 1 33

TGME49_chr 177808 ID=TGME49_226220;length=27

X 2 1780845 + 63 2125

TGME49_chr 432518 ID=TGME49_256970;length=47

Vllb 0 4329892 - 12 2126

TGME49_chr ID=TGME49_320630;length=20

IV 185525 187612 + 87 2127

TGME49_chr 501645 ID=TGME49_250090;length=20

XII 0 5018497 + 47 2128

TGME49_chr 597155 ID=TGME49_306334;length=10

IX 2 5972595 + 43 2129

TGME49_chr 165490 ID=TGME49_211640;length=39

IV 4 1658845 - 41 2130

TGME49_chr 169110 ID=TGME49_211470;length=21

IV 2 1693242 + 40 2131

TGME49_chr 144072 ID=TGME49_231460;length=20

VIII 1 1442757 + 36 2132

TGME49_chr 217306 ID=TGME49_311640;length=48

XI 6 2177892 - 26 2133

TGME49_chr 523272 ID=TGME49_236000;length=19

X 0 5234701 + 81 2134

TGME49_chr 236132 ID=TGME49_311905;length=40

XI 5 2365388 + 63 2135

TGME49_chr 248247 ID=TGME49_233180;length=41

VIII 9 2486633 - 54 2136

TGME49_chr ID=TGME49_300120;length=36

XII 205121 208744 - 23 2137

TGME49_chr 529337 ID=TGME49_250870;length=35

XII 8 5296947 - 69 2138

TGME49_chr 458277 ID=TGME49_292150;length= 12

IX 3 4584034 - 61 2139

TGME49_chr 211865 ID=TGME49_260620;length=48

Vllb 7 2123522 - 65 2140

TGME49_chr 363282 ID=TGME49_247950;length=46

XII 9 3637477 + 48 2141

TGME49_chr 341650 ID=TGME49_273520;length=34

VIII 8 3419960 + 52 2142

TGME49_chr 700708 ID=TGME49_215540;length=35

X 5 7010674 - 89 2143

TGME49_chr 261083 ID=TGME49_224990;length= 19

X 6 2612776 - 40 2144

TGME49_chr ID=TGME49_300000;length=42

XII 286679 290951 + 72 2145

TGME49_chr 106758 ID=TGME49_310122;length= 14

XI 1 1069045 - 64 2146

TGME49_chr 252372 ID=TGME49_203390;length=47

Vila 4 2528425 - 01 2147 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 636858 ID=TGME49_268670;length=36

VIII 4 6372245 + 61 2148

TGME49_chr 237917 ID=TGME49_288210;length=41

IX 1 2383345 + 74 2149

TGME49_chr 624706 ID=TGME49_214310;length=27

X 0 6249835 - 75 2150

TGME49_chr 441072 ID=TGME49_234470;length=46

X 4 4415344 - 20 2151

TGME49_chr ID=TGME49_229950;length=20

VIII 483790 485820 + 30 2152

TGME49_chr 120794 ID=TGME49_205690;length=49

Vila 0 1212921 + 81 2153

TGME49_chr ID=TGME49_320050;length=21

IV 548750 550949 + 99 2154

TGME49_chr 333571 ID=TGME49_313390;length= 16

XI 9 3337418 + 99 2155

TGME49_chr 449972 ID=TGME49_234625 ;length=46

X 9 4504428 + 99 2156

TGME49_chr 130617 ID=TGME49_266100;length= 19

IX 1 1308162 - 91 2157

TGME49_chr 498737 ID=TGME49_235620;length=12

X 3 4988579 + 06 2158

TGME49_chr 204915 ID=TGME49_297820;length=43

II 3 2053496 - 43 2159

TGME49_chr 100539 ID=TGME49_209040;length=49

lb 2 1010351 - 59 2160

TGME49_chr 141691 ID=TGME49_318320;length=21

IV 0 1419099 + 89 2161

TGME49_chr 182371 ID=TGME49_286460;length=38

V 3 1827597 - 84 2162

TGME49_chr 650588 ID=TGME49_277800;length=14

XII 8 6507354 - 66 2163

TGME49_chr 388967 ID=TGME49_248490;length=22

XII 1 3891930 - 59 2164

TGME49_chr 151611 ID=TGME49_205250;length= 17

Vila 3 1517866 - 53 2165

TGME49_chr 414725 ID=TGME49_281480;length=22

Vila 7 4149471 - 14 2166

TGME49_chr 266858 ID=TGME49_233480;length=27

VIII 7 2671322 + 35 2167

TGME49_chr 560831 ID=TGME49_305830;length= 14

IX 5 5609770 + 55 2168

TGME49_chr 546061 ID=TGME49_236630;length=45

X 5 5465206 - 91 2169

TGME49_chr ID=TGME49_207930;length=23

lb 446581 448900 - 19 2170

TGME49_chr 295820 2962910 + ID=TGME49_244020;length=47 2171 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VI 4 06

TGME49_chr 255363 ID=TGME49_312260;length=20

XI 5 2555730 - 95 2172

TGME49_chr 535366 ID=TGME49_270170;length=39

VIII 9 5357579 + 10 2173

TGME49_chr ID=TGME49_300090;length= 16

XII 224079 225774 + 95 2174

TGME49_chr 461492 ID=TGME49_255960;length=49

Vllb 8 4619845 - 17 2175

TGME49_chr 147327 ID=TGME49_231480;length= 19

VIII 1 1475198 - 27 2176

TGME49_chr 270767 ID=TGME49_203170;length=33

Vila 5 2711017 + 42 2177

TGME49_chr 432539 ID=TGME49_234375 ;length=32

X 4 4328594 + 00 2178

TGME49_chr ID=TGME49_264075 ;length=46

Vllb 201826 206450 + 24 2179

TGME49_chr 398770 ID=TGME49_314280;length= 19

XI 4 3989680 - 76 2180

TGME49_chr 243110 ID=TGME49_301390;length= 16

IV 7 2432795 + 88 2181

TGME49_chr 155404 ID=TGME49_261740;length=32

Vllb 7 1557249 - 02 2182

TGME49_chr 607789 ID=TGME49_216570;length=22

XI 8 6080164 + 66 2183

TGME49_chr 590787 ID=TGME49_306280;length= 15

IX 7 5909447 - 70 2184

TGME49_chr 129127 ID=TGME49_266110;length=24

IX 4 1293712 + 38 2185

TGME49_chr ID=TGME49_207360;length= 15

lb 3431 5000 + 69 2186

TGME49_chr ID=TGME49_319740;length=24

IV 749326 751746 + 20 2187

TGME49_chr 157448 ID=TGME49_261700;length= 12

Vllb 4 1575737 + 53 2188

TGME49_chr 133892 ID=TGME49_240630;length=42

VI 1 1343149 + 28 2189

TGME49_chr 553953 ID=TGME49_251720;length= 18

XII 4 5541403 - 69 2190

TGME49_chr 489662 ID=TGME49_255330;length=21

Vllb 3 4898784 + 61 2191

TGME49_chr 473381 ID=TGME49_249690;length=32

XII 7 4737060 + 43 2192

TGME49_chr 138387 ID=TGME49_223050;length= 13

II 1 1385210 + 39 2193

TGME49_chr 461734 ID=TGME49_315280;length=32

XI 6 4620622 + 76 2194 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 187012 ID=TGME49_311220;length= 16

XI 5 1871822 + 97 2195

TGME49_chr 384324 ID=TGME49_248425 ;length=24

XII 5 3845735 + 90 2196

TGME49_chr 538950 ID=TGME49_251450;length=37

XII 6 5393276 + 70 2197

TGME49_chr 598049 ID=TGME49_269228 ;length=23

VIII 5 5982851 - 56 2198

TGME49_chr 147971 ID=TGME49_318220;length=28

IV 8 1482540 - 22 2199

TGME49_chr 391075 ID=TGME49_201390;length=21

Vila 2 3912895 - 43 2200

TGME49_chr 286019 ID=TGME49_243790;length=27

VI 9 2862974 - 75 2201

TGME49_chr 217851 ID=TGME49_203860;length= 17

Vila 3 2180276 - 63 2202

TGME49_chr 134193 ID=TGME49_222970;length=26

II 1 1344535 + 04 2203

TGME49_chr 434870 ID=TGME49_249160;length= 19

XII 0 4350687 - 87 2204

TGME49_chr 181725 ID=TGME49_286485;length=23

V 2 1819572 - 20 2205

TGME49_chr 409656 ID=TGME49_291070;length=28

IX 8 4099371 + 03 2206

TGME49_chr 145637 ID=TGME49_213940;length=27

V 7 1459175 - 98 2207

TGME49_chr 112826 ID=TGME49_294760;length=24

la 8 1130676 - 08 2208

TGME49_chr ID=TGME49_300270;length=38

XII 112149 115996 + 47 2209

TGME49_chr 114118 ID=TGME49_294780;length=45

la 2 1145699 - 17 2210

TGME49_chr ID=TGME49_294180;length= 19

la 684768 686694 - 26 2211

TGME49_chr 198253 ID=TGME49_217665 ;length=20

XII 3 1984559 - 26 2212

TGME49_chr ID=TGME49_264150;length=42

Vllb 136836 141095 - 59 2213

TGME49_chr 364488 ID=TGME49_290200;length= 19

IX 9 3646816 - 27 2214

TGME49_chr 341684 ID=TGME49_313475 ;length=38

XI 4 3420688 + 44 2215

TGME49_chr 555831 ID=TGME49_251750;length=47

XII 1 5563031 + 20 2216

TGME49_chr 359112 ID=TGME49_273330;length=30

VIII 6 3594224 - 98 2217

TGME49_chr 190716 1911891 + ID=TGME49_311250;length=47 2218 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 5 26

TGME49_chr ID=TGME49_208320;length=18

lb 606581 608427 + 46 2219

TGME49_chr ID=TGME49_320470;length=31

IV 317723 320840 + 17 2220

TGME49_chr 123136 ID=TGME49_294870;length=24

la 6 1233810 + 44 2221

TGME49_chr ID=TGME49_213030;length=41

V 799204 803341 + 37 2222

TGME49_chr 414126 ID=TGME49_248740;length=37

XII 7 4145015 - 48 2223

TGME49_chr 683103 ID=TGME49_200370;length= 19

VIII 2 6832955 - 23 2224

TGME49_chr 581418 ID=TGME49_269417;length=35

VIII 1 5817773 + 92 2225

TGME49_chr ID=TGME49_293380;length=24

la 288432 290836 + 04 2226

TGME49_chr ID=TGME49_228360;length=15

X 375771 377308 + 37 2227

TGME49_chr 281707 ID=TGME49_203070;length=29

Vila 5 2820018 + 43 2228

TGME49_chr 264242 ID=TGME49_288630;length=23

IX 6 2644741 + 15 2229

TGME49_chr 377832 ID=TGME49_201810;length=23

Vila 4 3780703 - 79 2230

TGME49_chr 213696 ID=TGME49_311500;length= 14

XI 8 2138370 - 02 2231

TGME49_chr 674660 ID=TGME49_215090;length=26

X 4 6749287 + 83 2232

TGME49_chr ID=TGME49_280420;length=41

Vila 428379 432545 - 66 2233

TGME49_chr 318713 ID=TGME49_244320;length=l 1

VI 9 3188303 + 64 2234

TGME49_chr 146360 ID=TGME49_240830;length=18

VI 1 1465416 - 15 2235

TGME49_chr 627386 ID=TGME49_214400;length=21

X 0 6275989 + 29 2236

TGME49_chr 222030 ID=TGME49_242740;length=22

VI 9 2222591 + 82 2237

TGME49_chr 135187 ID=TGME49_209590;length=32

lb 2 1355146 + 74 2238

TGME49_chr ID=TGME49_229490;length=49

VIII 294473 299452 + 79 2239

TGME49_chr 199645 ID=TGME49_210970;length= 16

IV 3 1998125 + 72 2240

TGME49_chr 260800 ID=TGME49_233370;length=32

VIII 7 2611227 + 20 2241 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 379539 ID=TGME49_201775 ;length=33

Vila 3 3798765 + 72 2242

TGME49_chr 129382 ID=TGME49_294970;length=20

la 6 1295850 + 24 2243

TGME49_chr 453575 ID=TGME49_292070;length= 15

IX 0 4537308 - 58 2244

TGME49_chr 227995 ID=TGME49_203740;length=37

Vila 2 2283671 - 19 2245

TGME49_chr ID=TGME49_280380;length=20

Vila 485370 487434 + 64 2246

TGME49_chr 109558 ID=TGME49_310160;length=43

XI 0 1099919 + 39 2247

TGME49_chr 216676 ID=TGME49_260540;length=31

Vllb 2 2169888 + 26 2248

TGME49_chr 549238 ID=TGME49_236780;length=24

X 1 5494853 + 72 2249

TGME49_chr ID=TGME49_304490;length=35

Vila 525141 528720 - 79 2250

TGME49_chr 125146 ID=TGME49_218840;length= 17

XII 2 1253195 + 33 2251

TGME49_chr 104456 ID=TGME49_262750;length= 16

Vllb 0 1046178 - 18 2252

TGME49_chr 494142 ID=TGME49_249920;length= 15

XII 6 4942964 + 38 2253

TGME49_chr 199587 ID=TGME49_260870;length= 16

Vllb 9 1997531 - 52 2254

TGME49_chr 429925 ID=TGME49_314780;length= 19

XI 8 4301184 - 26 2255

TGME49_chr 176990 ID=TGME49_297420;length=48

II 5 1774752 - 47 2256

TGME49_chr 132004 ID=TGME49_209540;length= 19

lb 0 1321949 - 09 2257

TGME49_chr 617711 ID=TGME49_306550;length=43

IX 7 6181474 - 57 2258

TGME49_chr 172509 ID=TGME49_286680;length=23

V 2 1727430 + 38 2259

TGME49_chr ID=TGME49_239630;length=28

VI 811913 814755 + 42 2260

TGME49_chr 387628 ID=TGME49_290840;length=21

IX 2 3878405 + 23 2261

TGME49_chr 169302 ID=TGME49_297330;length=23

II 9 1695406 - 77 2262

TGME49_chr 121402 ID=TGME49_209440;length=41

lb 2 1218129 + 07 2263

TGME49_chr 157430 ID=TGME49_231840;length=46

VIII 3 1578976 + 73 2264

TGME49_chr 491424 4916993 + ID=TGME49_255300;length=27 2265 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 3 50

TGME49_chr 174501 ID=TGME49_311030;length=48

XI 4 1749860 - 46 2266

TGME49_chr 413189 ID=TGME49_272230;length=30

VIII 6 4134904 - 08 2267

TGME49_chr ID=TGME49_306910;length=42

XI 186373 190635 + 62 2268

TGME49_chr 496945 ID=TGME49_270820;length=32

VIII 9 4972682 + 23 2269

TGME49_chr ID=TGME49_221320;length=29

II 252286 255240 - 54 2270

TGME49_chr 466530 ID=TGME49_249590;length= 19

XII 7 4667213 + 06 2271

TGME49_chr ID=TGME49_300140;length=28

XII 183074 185954 + 80 2272

TGME49_chr 210764 ID=TGME49_225730;length= 18

X 2 2109483 - 41 2273

TGME49_chr 508420 ID=TGME49_270670;length=36

VIII 9 5087859 + 50 2274

TGME49_chr 314133 ID=TGME49_244250;length=48

VI 8 3146169 - 31 2275

TGME49_chr 164172 ID=TGME49_265260;length=43

IX 8 1646063 + 35 2276

TGME49_chr 162508 ID=TGME49_321700;length=25

lb 7 1627665 - 78 2277

TGME49_chr 170859 ID=TGME49_297360;length=27

II 5 1711391 + 96 2278

TGME49_chr 256157 ID=TGME49_245770;length=23

XII 0 2563897 + 27 2279

TGME49_chr 366208 ID=TGME49_313750;length=23

XI 3 3664424 - 41 2280

TGME49_chr 267897 ID=TGME49_288700;length=44

IX 6 2683389 + 13 2281

TGME49_chr 245253 ID=TGME49_203520;length=18

Vila 3 2454348 - 15 2282

TGME49_chr ID=TGME49_229660;length= 13

VIII 371670 372973 - 03 2283

TGME49_chr 133833 ID=TGME49_231215 ;length= 13

VIII 3 1339664 - 31 2284

TGME49_chr 239343 ID=TGME49_312050;length=33

XI 5 2396761 + 26 2285

TGME49_chr 483404 ID=TGME49_255430;length=25

Vllb 4 4836547 - 03 2286

TGME49_chr ID=TGME49_207940;length= 18

lb 454627 456429 - 02 2287

TGME49_chr 236457 ID=TGME49_245560;length=44

XII 5 2368999 + 24 2288 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_320500;length=48

IV 290608 295448 + 40 2289

TGME49_chr 264779 ID=TGME49_224940;length=28

X 2 2650609 + 17 2290

TGME49_chr ID=TGME49_221720;length= 14

II 595002 596475 - 73 2291

TGME49_chr ID=TGME49_296121 ;length=35

V 6880 10388 - 08 2292

TGME49_chr 493856 ID=TGME49_235550;length=12

X 3 4939819 + 56 2293

TGME49_chr ID=TGME49_220850;length=62

II 24693 25319 - 6 2294

TGME49_chr 656819 ID=TGME49_277700;length=23

XII 0 6570515 - 25 2295

TGME49_chr 205860 ID=TGME49_210787;length=31

IV 3 2061707 + 04 2296

TGME49_chr 445107 ID=TGME49_234550;length=36

X 9 4454735 - 56 2297

TGME49_chr ID=TGME49_228760;length=48

X 155871 160694 - 23 2298

TGME49_chr 704198 ID=TGME49_215590;length=23

X 1 7044359 - 78 2299

TGME49_chr 197916 ID=TGME49_261010;length=21

Vllb 6 1981300 - 34 2300

TGME49_chr ID=TGME49_280605 ;length= 16

Vila 266318 267937 + 19 2301

TGME49_chr 285557 ID=TGME49_233860;length=10

VIII 3 2856668 + 95 2302

TGME49_chr ID=TGME49_208340;length=48

lb 609246 614107 + 61 2303

TGME49_chr 638413 ID=TGME49_214580;length=41

X 4 6388240 - 06 2304

TGME49_chr ID=TGME49_238880;length= 17

VI 280442 282228 + 86 2305

TGME49_chr 184894 ID=TGME49_265005 ;length=37

IX 2 1852708 - 66 2306

TGME49_chr 541898 ID=TGME49_236550;length=33

X 1 5422356 - 75 2307

TGME49_chr 127468 ID=TGME49_294930;length=22

la 1 1276904 + 23 2308

TGME49_chr ID=TGME49_294050;length=21

la 665674 667870 - 96 2309

TGME49_chr 276184 ID=TGME49_259580;length= 16

Vllb 4 2763517 + 73 2310

TGME49_chr 159649 ID=TGME49_211695 ;length= 17

IV 7 1598239 + 42 2311

TGME49_chr 233225 2334583 - ID=TGME49_233000;length=23 2312 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 8 25

TGME49_chr 345118 ID=TGME49_258370;length=20

Vllb 8 3453211 - 23 2313

TGME49_chr 525787 ID=TGME49_305160;length= 18

IX 5 5259734 + 59 2314

TGME49_chr 539273 ID=TGME49_270130;length= 11

VIII 3 5393889 - 56 2315

TGME49_chr 568855 ID=TGME49_217178 ;length=73

XI 1 5689284 - 3 2316

TGME49_chr 115005 ID=TGME49_294800;length= 19

la 1 1152041 - 90 2317

TGME49_chr ID=TGME49_304755;length=22

Vila 691948 694230 - 82 2318

TGME49_chr 419670 ID=TGME49_234220;length=20

X 7 4198737 + 30 2319

TGME49_chr ID=TGME49_239480;length=20

VI 709612 711661 - 49 2320

TGME49_chr 264566 ID=TGME49_246070;length=24

XII 8 2648078 - 10 2321

TGME49_chr ID=TGME49_263700;length=44

Vllb 381484 385908 + 24 2322

TGME49_chr 382280 ID=TGME49_313940;length= 11

XI 5 3823979 + 74 2323

TGME49_chr 549297 ID=TGME49_269970;length=20

VIII 3 5494998 - 25 2324

TGME49_chr 195536 ID=TGME49_297725 ;length=38

II 4 1959181 - 17 2325

TGME49_chr 440245 ID=TGME49_282150;length= 18

Vila 9 4404312 + 53 2326

TGME49_chr 372363 ID=TGME49_223590;length=10

X 4 3724688 - 54 2327

TGME49_chr 302344 ID=TGME49_224510;length= 19

X 2 3025405 - 63 2328

TGME49_chr 195667 ID=TGME49_232210;length=21

VIII 4 1958845 - 71 2329

TGME49_chr 217788 ID=TGME49_264600;length=47

IX 1 2182679 + 98 2330

TGME49_chr 322813 ID=TGME49_244380;length=29

VI 4 3231037 - 03 2331

TGME49_chr ID=TGME49_300190;length=26

XII 161554 164207 + 53 2332

TGME49_chr 238695 ID=TGME49_285490;length=19

V 5 2388910 - 55 2333

TGME49_chr 592380 ID=TGME49_216750;length=45

XI 6 5928393 - 87 2334

TGME49_chr 521860 ID=TGME49_270510;length=22

VIII 4 5220893 + 89 2335 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_307850;length=36

XII 448136 451773 - 37 2336

TGME49_chr ID=TGME49_308890;length=33

XI 329830 333208 + 78 2337

TGME49_chr 498222 ID=TGME49_255230;length=27

Vllb 3 4984936 + 13 2338

TGME49_chr 133892 ID=TGME49_240620;length= 17

VI 1 1340659 + 38 2339

TGME49_chr 271067 ID=TGME49_203175 ;length= 18

Vila 2 2712535 - 63 2340

TGME49_chr ID=TGME49_294300;length=26

la 783634 786312 + 78 2341

TGME49_chr 220706 ID=TGME49_203820;length=44

Vila 7 2211546 - 79 2342

TGME49_chr 141012 ID=TGME49_262040;length= 18

Vllb 4 1411940 + 16 2343

TGME49_chr 176212 ID=TGME49_286620;length=20

V 3 1764217 - 94 2344

TGME49_chr 244417 ID=TGME49_312120;length= 13

XI 1 2445480 - 09 2345

TGME49_chr 230420 ID=TGME49_288030;length=48

IX 0 2309017 + 17 2346

TGME49_chr 310704 ID=TGME49_247020;length= 19

XII 6 3108961 + 15 2347

TGME49_chr 108840 ID=TGME49_219190;length=23

XII 8 1090747 - 39 2348

TGME49_chr 524310 ID=TGME49_270380;length= 14

VIII 1 5244559 - 58 2349

TGME49_chr 366138 ID=TGME49_290240;length= 15

IX 9 3662897 + 08 2350

TGME49_chr 146524 ID=TGME49_209810;length= 15

lb 0 1466759 - 19 2351

TGME49_chr 409399 ID=TGME49_257360;length=31

Vllb 7 4097159 + 62 2352

TGME49_chr 273001 ID=TGME49_288760;length=24

IX 9 2732515 - 96 2353

TGME49_chr 180471 ID=TGME49_311100;length=33

XI 5 1808023 + 08 2354

TGME49_chr 425211 ID=TGME49_234250;length=27

X 3 4254822 - 09 2355

TGME49_chr 360235 ID=TGME49_313677;length=20

XI 3 3604400 - 47 2356

TGME49_chr 291281 ID=TGME49_233940;length= 16

VIII 8 2914466 + 48 2357

TGME49_chr 700063 ID=TGME49_215530;length=38

X 9 7004503 + 64 2358

TGME49_chr 228976 2291993 - ID=TGME49_232940;length=22 2359 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 8 25

TGME49_chr 525819 ID=TGME49_236060;length= 18

X 8 5260065 - 67 2360

TGME49_chr 188448 ID=TGME49_232150;length=49

VIII 5 1889437 + 52 2361

TGME49_chr 188297 ID=TGME49_217510;length=27

XII 6 1885676 + 00 2362

TGME49_chr 104990 ID=TGME49_227280;length=44

X 5 1054351 - 46 2363

TGME49_chr 124626 ID=TGME49_294880;length= 14

la 3 1247746 - 83 2364

TGME49_chr 589219 ID=TGME49_278770;length=27

XII 8 5894986 - 88 2365

TGME49_chr ID=TGME49_220132;length=23

V 160407 162730 - 23 2366

TGME49_chr 375707 ID=TGME49_223520;length=46

X 6 3761771 + 95 2367

TGME49_chr 380916 ID=TGME49_272670;length=29

VIII 6 3812131 - 65 2368

TGME49_chr 281631 ID=TGME49_243720;length=45

VI 4 2820874 - 60 2369

TGME49_chr 186691 ID=TGME49_226010;length=36

X 9 1870597 + 78 2370

TGME49_chr 165590 ID=TGME49_295640;length=48

la 6 1660726 + 20 2371

TGME49_chr 249704 ID=TGME49_245730;length=26

XII 5 2499685 + 40 2372

TGME49_chr 136890 ID=TGME49_209640;length= 16

lb 7 1370576 + 69 2373

TGME49_chr 105516 ID=TGME49_206340;length=31

Vila 3 1058273 - 10 2374

TGME49_chr 210696 ID=TGME49_225720;length= 18

X 8 2108843 + 75 2375

TGME49_chr ID=TGME49_304710;length=35

Vila 647552 651135 - 83 2376

TGME49_chr 243397 ID=TGME49_260230;length= 12

Vllb 7 2435254 + 77 2377

TGME49_chr 509374 ID=TGME49_270660;length=29

VIII 9 5096679 + 30 2378

TGME49_chr 471251 ID=TGME49_292375 ;length=41

IX 2 4716695 + 83 2379

TGME49_chr 382464 ID=TGME49_290670;length=46

IX 8 3829311 + 63 2380

TGME49_chr 293718 ID=TGME49_202930;length=47

Vila 4 2941974 + 90 2381

TGME49_chr ID=TGME49_207760;length=25

lb 264375 266961 + 86 2382 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 106409 ID=TGME49_227150;length=22

X 5 1066354 + 59 2383

TGME49_chr 223865 ID=TGME49_203780;length=21

Vila 1 2240842 + 91 2384

TGME49_chr 551151 ID=TGME49_251660;length=25

XII 6 5514022 + 06 2385

TGME49_chr 272952 ID=TGME49_288755 ;length=29

IX 3 2732515 - 92 2386

TGME49_chr 652363 ID=TGME49_215930;length=88

XI 1 6524512 - 1 2387

TGME49_chr 147705 ID=TGME49_205310;length= 13

Vila 4 1478405 + 51 2388

TGME49_chr 129989 ID=TGME49_294980;length=17

la 6 1301681 + 85 2389

TGME49_chr 547882 ID=TGME49_251600;length=23

XII 6 5481198 + 72 2390

TGME49_chr ID=TGME49_279410;length=23

IX 195627 197977 + 50 2391

TGME49_chr 258525 ID=TGME49_288500;length=24

IX 5 2587746 - 91 2392

TGME49_chr ID=TGME49_228130;length=22

X 551024 553273 + 49 2393

TGME49_chr 124629 ID=TGME49_231110;length= 14

VIII 9 1247749 - 50 2394

TGME49_chr 607677 ID=TGME49_306440;length=31

IX 2 6079949 + 77 2395

TGME49_chr ID=TGME49_307030;length= 13

XI 112403 113739 + 36 2396

TGME49_chr 148672 ID=TGME49_205300;length=24

Vila 4 1489206 - 82 2397

TGME49_chr 390172 ID=TGME49_290870;length=25

IX 3 3904302 + 79 2398

TGME49_chr 300087 ID=TGME49_289110;length= 16

IX 1 3002525 - 54 2399

TGME49_chr 645740 ID=TGME49_277880;length=22

XII 3 6459638 - 35 2400

TGME49_chr 138311 ID=TGME49_310490;length=20

XI 3 1385121 - 08 2401

TGME49_chr 358677 ID=TGME49_247790;length=24

XII 5 3589201 + 26 2402

TGME49_chr 251547 ID=TGME49_243290;length=50

VI 9 2520512 - 33 2403

TGME49_chr 219744 ID=TGME49_285825 ;length=20

V 0 2199492 + 52 2404

TGME49_chr 263966 ID=TGME49_259880;length=24

Vllb 5 2642073 - 08 2405

TGME49_chr 431416 433165 - ID=TGME49_320220;length= 17 2406 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

IV 49

TGME49_chr 520247 ID=TGME49_235960;length=14

X 5 5203919 + 44 2407

TGME49_chr 171675 ID=TGME49_295590;length=19

la 8 1718720 - 62 2408

TGME49_chr 515548 ID=TGME49_316140;length= 16

XI 6 5157169 - 83 2409

TGME49_chr 315670 ID=TGME49_289300;length=19

IX 5 3158620 + 15 2410

TGME49_chr 143303 ID=TGME49_310530;length=42

XI 1 1437316 - 85 2411

TGME49_chr 686030 ID=TGME49_276900;length=25

XII 2 6862811 + 09 2412

TGME49_chr 497621 ID=TGME49_315780;length=40

XI 9 4980235 + 16 2413

TGME49_chr 234089 ID=TGME49_285530;length=31

V 0 2344008 + 18 2414

TGME49_chr 153988 ID=TGME49_265460;length=34

IX 8 1543369 - 81 2415

TGME49_chr ID=TGME49_228250;length= 14

X 447801 449204 + 03 2416

TGME49_chr 314005 ID=TGME49_244260;length=40

VI 4 3144151 + 97 2417

TGME49_chr 158400 ID=TGME49_231850;length=41

VIII 8 1588110 + 02 2418

TGME49_chr 286200 ID=TGME49_246570;length= 19

XII 4 2863907 - 03 2419

TGME49_chr 266441 ID=TGME49_312410;length= 19

XI 5 2666362 + 47 2420

TGME49_chr 421947 ID=TGME49_257110;length= 18

Vllb 8 4221342 - 64 2421

TGME49_chr ID=TGME49_206620;length=45

Vila 771238 775785 + 47 2422

TGME49_chr 198867 ID=TGME49_297780;length= 15

II 1 1990219 + 48 2423

TGME49_chr 220586 ID=TGME49_264468;length=96

IX 0 2206820 + 0 2424

TGME49_chr 357578 ID=TGME49_258100;length= 17

Vllb 6 3577531 - 45 2425

TGME49_chr 232473 ID=TGME49_301218 ;length= 10

IV 2 2325789 - 57 2426

TGME49_chr 303307 ID=TGME49_246960;length=43

XII 6 3037430 + 54 2427

TGME49_chr 636074 ID=TGME49_216180;length=45

XI 3 6365335 - 92 2428

TGME49_chr 436474 ID=TGME49_282055;length=36

Vila 3 4368384 - 41 2429 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 426557 ID=TGME49_271950;length=42

VIII 5 4269776 + 01 2430

TGME49_chr ID=TGME49_267830;length=91

IX 327730 328648 - 8 2431

TGME49_chr 138188 ID=TGME49_223040;length= 19

II 1 1383797 - 16 2432

TGME49_chr 318912 ID=TGME49_258780;length=47

Vllb 8 3193873 - 45 2433

TGME49_chr 660251 ID=TGME49_214940;length=24

X 9 6604937 + 18 2434

TGME49_chr 622111 ID=TGME49_278220;length= 16

XII 1 6222788 + 77 2435

TGME49_chr 239457 ID=TGME49_288220;length= 17

IX 6 2396348 + 72 2436

TGME49_chr 169684 ID=TGME49_310960;length=37

XI 9 1700575 - 26 2437

TGME49_chr 178817 ID=TGME49_226110;length=29

X 2 1791138 + 66 2438

TGME49_chr 244228 ID=TGME49_312110;length=31

XI 2 2445480 - 98 2439

TGME49_chr ID=TGME49_279370;length=29

IX 219922 222865 + 43 2440

TGME49_chr 487335 ID=TGME49_315650;length= 13

XI 8 4874729 - 71 2441

TGME49_chr 744639 ID=TGME49_207200;length= 17

X 8 7448098 - 00 2442

TGME49_chr 224509 ID=TGME49_285780;length=15

V 7 2246666 - 69 2443

TGME49_chr ID=TGME49_304480;length=30

Vila 515933 519026 + 93 2444

TGME49_chr 174008 ID=TGME49_321540;length=24

lb 0 1742565 + 85 2445

TGME49_chr 425793 ID=TGME49_314740;length= 16

XI 3 4259598 + 65 2446

TGME49_chr ID=TGME49_208540;length=32

lb 761021 764259 + 38 2447

TGME49_chr 345096 ID=TGME49_244680;length=22

VI 6 3453208 - 42 2448

TGME49_chr 475504 ID=TGME49_315490;length=42

XI 2 4759341 - 99 2449

TGME49_chr 737277 ID=TGME49_207070;length=24

X 3 7375242 - 69 2450

TGME49_chr 158713 ID=TGME49_310770;length= 17

XI 9 1588848 - 09 2451

TGME49_chr ID=TGME49_230660;length= 18

VIII 943428 945265 - 37 2452

TGME49_chr 552123 553994 - ID=TGME49_228145 ;length= 18 2453 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 71

TGME49_chr 345971 ID=TGME49_289790;length=26

IX 4 3462359 - 45 2454

TGME49_chr 333906 ID=TGME49_202410;length=37

Vila 6 3342796 + 30 2455

TGME49_chr 649150 ID=TGME49_215980;length=29

XI 4 6494435 + 31 2456

TGME49_chr 258371 ID=TGME49_245990;length= 19

XII 2 2585703 - 91 2457

TGME49_chr 263743 ID=TGME49_243440;length=26

VI 9 2640089 - 50 2458

TGME49_chr 238613 ID=TGME49_285480;length=23

V 0 2388506 + 76 2459

TGME49_chr 301562 ID=TGME49_246910;length=21

XII 0 3017791 - 71 2460

TGME49_chr 430270 ID=TGME49_271888 ;length=30

VIII 0 4305790 - 90 2461

TGME49_chr 186625 ID=TGME49_204350;length=35

Vila 1 1869833 - 82 2462

TGME49_chr 557001 ID=TGME49_236910;length=27

X 3 5572759 - 46 2463

TGME49_chr 317957 ID=TGME49_273905 ;length=23

VIII 8 3181888 + 10 2464

TGME49_chr 622031 ID=TGME49_214295 ;length= 19

X 4 6222244 - 30 2465

TGME49_chr 232316 ID=TGME49_288040;length=20

IX 4 2325191 - 27 2466

TGME49_chr 560387 ID=TGME49_236950;length=22

X 7 5606138 - 61 2467

TGME49_chr 302787 ID=TGME49_244105 ;length=54

VI 4 3028418 + 4 2468

TGME49_chr 289433 ID=TGME49_259200;length=32

Vllb 7 2897600 - 63 2469

TGME49_chr 558816 ID=TGME49_269800;length=43

VIII 3 5592490 + 27 2470

TGME49_chr 211429 ID=TGME49_225690;length=30

X 8 2117359 + 61 2471

TGME49_chr 480677 ID=TGME49_249810;length= 17

XII 7 4808523 + 46 2472

TGME49_chr ID=TGME49_279405 ;length=59

IX 199072 199671 + 9 2473

TGME49_chr 276832 ID=TGME49_224860;length=49

X 6 2773227 - 01 2474

TGME49_chr 148738 ID=TGME49_295920;length=36

la 8 1491082 - 94 2475

TGME49_chr 222675 ID=TGME49_225460;length=87

X 5 2227625 - 0 2476 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 420357 ID=TGME49_281575;length=39

Vila 4 4207561 - 87 2477

TGME49_chr 514360 ID=TGME49_304990;length=28

IX 8 5146410 + 02 2478

TGME49_chr ID=TGME49_221930;length=33

II 700644 704009 + 65 2479

TGME49_chr 488775 ID=TGME49_270900;length= 17

VIII 3 4889510 + 57 2480

TGME49_chr ID=TGME49_264010;length=21

Vllb 238808 241001 - 93 2481

TGME49_chr 649445 ID=TGME49_214780;length=30

X 8 6497510 - 52 2482

TGME49_chr 394502 ID=TGME49_212275 ;length=24

X 5 3947454 + 29 2483

TGME49_chr ID=TGME49_309752;length=24

XI 784810 787257 + 47 2484

TGME49_chr ID=TGME49_263280;length=48

Vllb 654090 658976 + 86 2485

TGME49_chr 111653 ID=TGME49_240255;length=26

VI 3 1119178 + 45 2486

TGME49_chr 323292 ID=TGME49_273840;length=20

VIII 0 3235018 - 98 2487

TGME49_chr 449972 ID=TGME49_234622;length=28

X 9 4502574 + 45 2488

TGME49_chr 143972 ID=TGME49_213920;length=27

V 1 1442480 - 59 2489

TGME49_chr 340936 ID=TGME49_258450;length=44

Vllb 6 3413789 + 23 2490

TGME49_chr 142475 ID=TGME49_262010;length= 16

Vllb 4 1426396 + 42 2491

TGME49_chr ID=TGME49_267760;length=23

IX 371574 373882 + 08 2492

TGME49_chr 558432 ID=TGME49_269830;length=20

VIII 8 5586338 + 10 2493

TGME49_chr ID=TGME49_239110;length= 17

VI 465237 466979 + 42 2494

TGME49_chr 266989 ID=TGME49_224932;length=20

X 4 2671902 - 08 2495

TGME49_chr 442220 ID=TGME49_282170;length=35

Vila 1 4425705 - 04 2496

TGME49_chr ID=TGME49_309280;length=29

XI 611994 614956 - 62 2497

TGME49_chr ID=TGME49_208460;length=37

lb 717976 721681 - 05 2498

TGME49_chr 142348 ID=TGME49_218570;length=26

XII 5 1426090 - 05 2499

TGME49_chr 230042 234619 - ID=TGME49_300080;length=45 2500 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 77

TGME49_chr 245764 ID=TGME49_245660;length=23

XII 7 2459970 - 23 2501

TGME49_chr 676155 ID=TGME49_200250;length=49

VIII 2 6766520 - 68 2502

TGME49_chr 228799 ID=TGME49_245460;length= 17

XII 9 2289797 - 98 2503

TGME49_chr 376732 ID=TGME49_248270;length=36

XII 5 3771010 + 85 2504

TGME49_chr 426706 ID=TGME49_257040;length= 14

Vllb 2 4268545 + 83 2505

TGME49_chr 573974 ID=TGME49_216980;length= 16

XI 7 5741353 + 06 2506

TGME49_chr 356600 ID=TGME49_202160;length= 14

Vila 6 3567436 + 30 2507

TGME49_chr ID=TGME49_229630;length=48

VIII 321618 326501 + 83 2508

TGME49_chr 451846 ID=TGME49_315140;length=23

XI 1 4520837 - 76 2509

TGME49_chr 594950 ID=TGME49_237530;length=41

X 4 5953689 + 85 2510

TGME49_chr 378040 ID=TGME49_223490;length=20

X 6 3782419 + 13 2511

TGME49_chr 115623 ID=TGME49_222660;length=45

II 1 1160734 + 03 2512

TGME49_chr 285174 ID=TGME49_243770;length= 12

VI 8 2852954 - 06 2513

TGME49_chr ID=TGME49_239310;length=21

VI 539374 541507 + 33 2514

TGME49_chr 299776 ID=TGME49_246780;length=47

XII 4 3002544 - 80 2515

TGME49_chr ID=TGME49_219570;length=26

XII 849594 852277 - 83 2516

TGME49_chr 224072 ID=TGME49_225435 ;length= 17

X 0 2242446 + 26 2517

TGME49_chr 200550 ID=TGME49_232310;length=24

VIII 4 2007905 - 01 2518

TGME49_chr ID=TGME49_264210;length=23

Vllb 82870 85185 - 15 2519

TGME49_chr ID=TGME49_221340;length=49

II 267810 272793 + 83 2520

TGME49_chr 246881 ID=TGME49_285268;length=23

V 0 2471121 + 11 2521

TGME49_chr 244529 ID=TGME49_288320;length= 16

IX 6 2446946 - 50 2522

TGME49_chr ID=TGME49_280740;length=21

Vila 133327 135436 - 09 2523 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_208790;length=30

lb 864047 867064 - 17 2524

TGME49_chr 676467 ID=TGME49_215130;length=44

X 4 6769143 + 69 2525

TGME49_chr 230633 ID=TGME49_225340;length=30

X 0 2309400 - 70 2526

TGME49_chr 330001 ID=TGME49_273740;length=20

VIII 3 3302035 - 22 2527

TGME49_chr 210221 ID=TGME49_264700;length= 15

IX 3 2103786 + 73 2528

TGME49_chr ID=TGME49_308820;length= 13

XI 288223 289585 - 62 2529

TGME49_chr 719188 ID=TGME49_275340;length=38

X 5 7195726 - 41 2530

TGME49_chr 348767 ID=TGME49_289850;length=50

IX 2 3492693 - 21 2531

TGME49_chr 702812 ID=TGME49_215570;length=40

X 4 7032199 - 75 2532

TGME49_chr 218671 ID=TGME49_203830;length=40

Vila 6 2190767 + 51 2533

TGME49_chr 425163 ID=TGME49_271980;length= 18

VIII 9 4253470 - 31 2534

TGME49_chr 164306 ID=TGME49_321680;length=30

lb 9 1646070 - 01 2535

TGME49_chr 560245 ID=TGME49_305820;length= 17

IX 0 5604206 + 56 2536

TGME49_chr 299196 ID=TGME49_274140;length= 13

VIII 8 2993305 - 37 2537

TGME49_chr 529929 ID=TGME49_270273 ;length=24

VIII 7 5301755 - 58 2538

TGME49_chr 215024 ID=TGME49_225570;length=20

X 5 2152288 - 43 2539

TGME49_chr ID=TGME49_239010;length=48

VI 394289 399092 - 03 2540

TGME49_chr 187562 ID=TGME49_286270;length=45

V 0 1880133 - 13 2541

TGME49_chr 136638 ID=TGME49_310455;length=20

XI 1 1368421 - 40 2542

TGME49_chr 411371 ID=TGME49_248720;length=37

XII 0 4117497 + 87 2543

TGME49_chr 265367 ID=TGME49_246080;length=24

XII 6 2656148 - 72 2544

TGME49_chr 134947 ID=TGME49_213810;length=39

V 9 1353386 - 07 2545

TGME49_chr 229051 ID=TGME49_203720;length= 17

Vila 6 2292314 + 98 2546

TGME49_chr 110162 1104332 - ID=TGME49_318770;length=27 2547 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

IV 7 05

TGME49_chr ID=TGME49_263610;length=32

Vllb 428872 432171 + 99 2548

TGME49_chr 609434 ID=TGME49_306450;length=42

IX 3 6098588 - 45 2549

TGME49_chr ID=TGME49_206480;length=22

Vila 945669 947912 - 43 2550

TGME49_chr 106138 ID=TGME49_240200;length= 15

VI 9 1062893 + 04 2551

TGME49_chr 527649 ID=TGME49_236080;length=18

X 3 5278337 - 44 2552

TGME49_chr 440080 ID=TGME49_291930;length=48

IX 3 4405615 + 12 2553

TGME49_chr 183045 ID=TGME49_211310;length=43

IV 8 1834833 + 75 2554

TGME49_chr ID=TGME49_319658 ;length=22

IV 800777 803054 + 77 2555

TGME49_chr ID=TGME49_221440;length= 18

II 355915 357775 - 60 2556

TGME49_chr 185770 ID=TGME49_311200;length=20

XI 6 1859770 - 64 2557

TGME49_chr ID=TGME49_320640;length= 18

IV 173575 175464 + 89 2558

TGME49_chr 498112 ID=TGME49_307575 ;length=22

IX 0 4983371 - 51 2559

TGME49_chr 151451 ID=TGME49_297060;length= 19

II 5 1516458 - 43 2560

TGME49_chr 375536 ID=TGME49_201830;length= 14

Vila 8 3756835 + 67 2561

TGME49_chr 148470 ID=TGME49_240860;length=22

VI 1 1486911 - 10 2562

TGME49_chr ID=TGME49_309030;length= 16

XI 451688 453382 - 94 2563

TGME49_chr ID=TGME49_263290;length=43

Vllb 655477 659865 - 88 2564

TGME49_chr ID=TGME49_228440;length= 19

X 309061 310991 + 30 2565

TGME49_chr 436328 ID=TGME49_282070;length=44

Vila 4 4367705 + 21 2566

TGME49_chr 339917 ID=TGME49_224000;length=27

X 7 3401893 - 16 2567

TGME49_chr 311090 ID=TGME49_289240;length=37

IX 2 3114645 - 43 2568

TGME49_chr 714920 ID=TGME49_215775 ;length=36

X 9 7152824 + 15 2569

TGME49_chr ID=TGME49_320030;length=23

IV 566220 568593 + 73 2570 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 436027 ID=TGME49_271800;length=27

VIII 3 4363025 - 52 2571

TGME49_chr 214979 ID=TGME49_217810;length=66

XII 3 2150458 - 5 2572

TGME49_chr 167741 ID=TGME49_226370;length=31

X 2 1680531 + 19 2573

TGME49_chr 200953 ID=TGME49_264805 ;length=28

IX 7 2012433 - 96 2574

TGME49_chr ID=TGME49_219500;length=26

XII 898597 901257 + 60 2575

TGME49_chr 506870 ID=TGME49_270700;length=24

VIII 4 5071169 - 65 2576

TGME49_chr 202767 ID=TGME49_204050;length=30

Vila 1 2030677 + 06 2577

TGME49_chr 164942 ID=TGME49_231920;length= 18

VIII 6 1651273 - 47 2578

TGME49_chr 500812 ID=TGME49_250060;length= 17

XII 0 5009885 + 65 2579

TGME49_chr ID=TGME49_229690;length=31

VIII 388039 391218 - 79 2580

TGME49_chr 245238 ID=TGME49_288340;length=43

IX 6 2456727 + 41 2581

TGME49_chr 382189 ID=TGME49_272645 ;length=21

VIII 5 3824021 - 26 2582

TGME49_chr 139023 ID=TGME49_310515 ;length= 11

XI 1 1391395 + 64 2583

TGME49_chr 115147 ID=TGME49_310200;length= 14

XI 8 1152895 - 17 2584

TGME49_chr 345004 ID=TGME49_244690;length= 19

VI 2 3452022 + 80 2585

TGME49_chr ID=TGME49_238010;length= 14

VI 8252 9653 + 01 2586

TGME49_chr 105445 ID=TGME49_206330;length=26

Vila 3 1057137 + 84 2587

TGME49_chr 398583 ID=TGME49_257540;length= 13

Vllb 2 3987219 + 87 2588

TGME49_chr ID=TGME49_267070;length=20

IX 797312 799366 - 54 2589

TGME49_chr 311326 ID=TGME49_202670;length=39

Vila 9 3117191 + 22 2590

TGME49_chr 110892 ID=TGME49_266480;length=20

IX 2 1111021 - 99 2591

TGME49_chr 600755 ID=TGME49_269190;length= 16

VIII 7 6009244 - 87 2592

TGME49_chr 188862 ID=TGME49_297647;length=43

II 3 1893021 - 98 2593

TGME49_chr 871531 873751 - ID=TGME49_213240;length=22 2594 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

V 20

TGME49_chr ID=TGME49_229170;length=37

VIII 63466 67252 + 86 2595

TGME49_chr 177782 ID=TGME49_286570;length=22

V 2 1780056 + 34 2596

TGME49_chr 222510 ID=TGME49_225450;length= 17

X 0 2226843 + 43 2597

TGME49_chr ID=TGME49_230030;length=24

VIII 561740 564177 - 37 2598

TGME49_chr 193899 ID=TGME49_225910;length= 19

X 3 1940952 + 59 2599

TGME49_chr ID=TGME49_207965 ;length=47

lb 471344 476049 + 05 2600

TGME49_chr 497891 ID=TGME49_307580;length=34

IX 6 4982404 + 88 2601

TGME49_chr ID=TGME49_280710;length=22

Vila 160728 162946 - 18 2602

TGME49_chr ID=TGME49_280590;length=49

Vila 275203 280157 + 54 2603

TGME49_chr ID=TGME49_263030;length=l 1

Vllb 858403 859587 + 84 2604

TGME49_chr 660043 ID=TGME49_268250;length=17

VIII 8 6602216 + 78 2605

TGME49_chr 439589 ID=TGME49_282140;length=26

Vila 4 4398590 + 96 2606

TGME49_chr ID=TGME49_293250;length=37

la 192097 195881 + 84 2607

TGME49_chr 225290 ID=TGME49_285750;length=14

V 0 2254312 + 12 2608

TGME49_chr 131716 ID=TGME49_310400;length=20

XI 0 1319239 + 79 2609

TGME49_chr 166471 ID=TGME49_231950;length=22

VIII 8 1667016 + 98 2610

TGME49_chr 289972 ID=TGME49_288990;length=20

IX 9 2901760 - 31 2611

TGME49_chr 697253 ID=TGME49_215470;length= 13

X 9 6973901 + 62 2612

TGME49_chr 439677 ID=TGME49_291890;length=35

IX 6 4400328 - 52 2613

TGME49_chr ID=TGME49_267120;length=36

IX 772170 775776 - 06 2614

TGME49_chr 111045 ID=TGME49_310173 ;length=36

XI 5 1114100 + 45 2615

TGME49_chr 571105 ID=TGME49_278990;length=24

XII 1 5713503 - 52 2616

TGME49_chr 111855 ID=TGME49_318730;length=38

IV 9 1122425 - 66 2617 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 125919 ID=TGME49_205625;length=26

Vila 2 1261805 - 13 2618

TGME49_chr 694154 ID=TGME49_215390;length= 14

X 3 6942949 - 06 2619

TGME49_chr 438564 ID=TGME49_234440;length=47

X 4 4390371 + 27 2620

TGME49_chr 452551 ID=TGME49_234690;length=26

X 9 4528145 + 26 2621

TGME49_chr 391709 ID=TGME49_314090;length=21

XI 0 3919270 - 80 2622

TGME49_chr 637074 ID=TGME49_216155 ;length=50

XI 0 6375773 + 33 2623

TGME49_chr 192567 ID=TGME49_286200;length=23

V 3 1927982 - 09 2624

TGME49_chr 638184 ID=TGME49_277950;length=45

XII 1 6386433 + 92 2625

TGME49_chr ID=TGME49_227810;length= 18

X 804068 805942 + 74 2626

TGME49_chr 466846 ID=TGME49_255870;length=21

Vllb 0 4670573 + 13 2627

TGME49_chr 229222 ID=TGME49_232960;length=30

VIII 3 2295223 + 00 2628

TGME49_chr 531086 ID=TGME49_270250;length=23

VIII 0 5313259 - 99 2629

TGME49_chr ID=TGME49_219630;length= 14

XII 802072 803527 + 55 2630

TGME49_chr 454670 ID=TGME49_271360;length= 19

VIII 0 4548626 + 26 2631

TGME49_chr ID=TGME49_309265 ;length= 16

XI 599056 600665 + 09 2632

TGME49_chr 392454 ID=TGME49_272530;length=23

VIII 9 3926856 - 07 2633

TGME49_chr 125576 ID=TGME49_310310;length= 18

XI 6 1257660 - 94 2634

TGME49_chr 276283 ID=TGME49_259590;length= 17

Vllb 2 2764576 - 44 2635

TGME49_chr 245605 ID=TGME49_225140;length=31

X 6 2459220 - 64 2636

TGME49_chr 577592 ID=TGME49_269460;length=21

VIII 1 5778051 + 30 2637

TGME49_chr 301653 ID=TGME49_244090;length=24

VI 2 3018964 - 32 2638

TGME49_chr 167784 ID=TGME49_205000;length=36

Vila 1 1681517 + 76 2639

TGME49_chr 358235 ID=TGME49_313650;length=31

XI 1 3585505 - 54 2640

TGME49_chr 127110 1273811 - ID=TGME49_222860;length=27 2641 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

II 9 02

TGME49_chr ID=TGME49_308096;length=18

XII 577408 579217 - 09 2642

TGME49_chr 317551 ID=TGME49_247195 ;length=21

XII 4 3177639 - 25 2643

TGME49_chr 257184 ID=TGME49_203330;length=24

Vila 0 2574320 - 80 2644

TGME49_chr 252324 ID=TGME49_203380;length=46

Vila 0 2527928 + 88 2645

TGME49_chr 194144 ID=TGME49_211080;length=l 1

IV 1 1942608 + 67 2646

TGME49_chr 608943 ID=TGME49_278460;length=38

XII 4 6093280 - 46 2647

TGME49_chr 608105 ID=TGME49_278490;length=19

XII 8 6083036 - 78 2648

TGME49_chr 294113 ID=TGME49_246720;length=26

XII 7 2943790 + 53 2649

TGME49_chr 319629 ID=TGME49_244335;length=33

VI 3 3199662 - 69 2650

TGME49_chr 103157 ID=TGME49_262780;length=45

Vllb 1 1036168 - 97 2651

TGME49_chr 130318 ID=TGME49_240510;length= 19

VI 1 1305102 - 21 2652

TGME49_chr 202165 ID=TGME49_210840;length= 17

IV 8 2023438 - 80 2653

TGME49_chr 322642 ID=TGME49_273850;length= 16

VIII 2 3228068 - 46 2654

TGME49_chr 310147 ID=TGME49_258900;length=24

Vllb 2 3103925 - 53 2655

TGME49_chr 116083 ID=TGME49_310220;length=33

XI 5 1164143 + 08 2656

TGME49_chr ID=TGME49_309195 ;length= 16

XI 551152 552779 + 27 2657

TGME49_chr 283782 ID=TGME49_312622;length=22

XI 1 2840040 - 19 2658

TGME49_chr 138704 ID=TGME49_223060;length=35

II 1 1390571 + 30 2659

TGME49_chr 194144 ID=TGME49_211070;length= 18

IV 1 1943297 + 56 2660

TGME49_chr ID=TGME49_309740;length=26

XI 785847 788515 - 68 2661

TGME49_chr 305224 ID=TGME49_224470;length=23

X 3 3054585 - 42 2662

TGME49_chr 427765 ID=TGME49_291370;length= 16

IX 9 4279338 - 79 2663

TGME49_chr ID=TGME49_309800;length=20

XI 826261 828335 - 74 2664 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 684026 ID=TGME49_276930;length=24

XII 2 6842703 + 41 2665

TGME49_chr ID=TGME49_320450;length=23

IV 346077 348428 - 51 2666

TGME49_chr 164998 ID=TGME49_218250;length= 13

XII 8 1651365 - 77 2667

TGME49_chr 515101 ID=TGME49_250730;length= 17

XII 0 5152773 + 63 2668

TGME49_chr 400247 ID=TGME49_212160;length=30

X 0 4005528 + 58 2669

TGME49_chr 212693 ID=TGME49_260600;length=28

Vllb 2 2129789 + 57 2670

TGME49_chr 440680 ID=TGME49_314910;length=41

XI 5 4410963 - 58 2671

TGME49_chr 625755 ID=TGME49_268835;length=18

VIII 6 6259358 + 02 2672

TGME49_chr 187538 ID=TGME49_217490;length= 12

XII 1 1876656 + 75 2673

TGME49_chr 385675 ID=TGME49_290710;length= 11

IX 6 3857919 - 63 2674

TGME49_chr ID=TGME49_229930;length=42

VIII 470060 474349 + 89 2675

TGME49_chr 402592 ID=TGME49_314358 ;length=48

XI 0 4030724 + 04 2676

TGME49_chr ID=TGME49_207955 ;length= 17

lb 460087 461835 - 48 2677

TGME49_chr 249073 ID=TGME49_203480;length=23

Vila 1 2493062 - 31 2678

TGME49_chr ID=TGME49_221685;length=32

II 562040 565277 + 37 2679

TGME49_chr 246215 ID=TGME49_285290;length=32

V 1 2465375 - 24 2680

TGME49_chr ID=TGME49_212850;length=20

V 645279 647282 + 03 2681

TGME49_chr 533141 ID=TGME49_270210;length= 18

VIII 4 5333301 + 87 2682

TGME49_chr ID=TGME49_262935 ;length= 19

Vllb 912343 914333 - 90 2683

TGME49_chr 412687 ID=TGME49_281450;length=20

Vila 1 4128947 + 76 2684

TGME49_chr 215890 ID=TGME49_242650;length= 19

VI 9 2160846 + 37 2685

TGME49_chr 412989 ID=TGME49_257330;length=14

Vllb 4 4131374 - 80 2686

TGME49_chr 155500 ID=TGME49_218380;length=50

XII 9 1560065 - 56 2687

TGME49_chr 379805 3802342 - ID=TGME49_248300;length=42 2688 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 8 84

TGME49_chr 277365 ID=TGME49_224850;length=26

X 7 2776323 + 66 2689

TGME49_chr ID=TGME49_219550;length= 15

XII 859876 861462 - 86 2690

TGME49_chr 313382 ID=TGME49_258840;length=38

Vllb 0 3137682 + 62 2691

TGME49_chr 456077 ID=TGME49_292120;length=21

IX 8 4562922 + 44 2692

TGME49_chr 131779 ID=TGME49_266070;length=21

IX 9 1319959 + 60 2693

TGME49_chr ID=TGME49_280810;length=30

Vila 58368 61406 - 38 2694

TGME49_chr 636518 ID=TGME49_268690;length= 13

VIII 9 6366539 - 50 2695

TGME49_chr 605897 ID=TGME49_216600;length= 16

XI 8 6060618 - 40 2696

TGME49_chr 133479 ID=TGME49_240590;length=20

VI 9 1336826 - 27 2697

TGME49_chr 197348 ID=TGME49_286130;length=21

V 9 1975650 - 61 2698

TGME49_chr 110119 ID=TGME49_262655;length=48

Vllb 3 1106043 - 50 2699

TGME49_chr ID=TGME49_221510;length= 16

II 403605 405254 + 49 2700

TGME49_chr 173532 ID=TGME49_211420;length=24

IV 2 1737748 - 26 2701

TGME49_chr 465098 ID=TGME49_271182;length=32

VIII 5 4654271 + 86 2702

TGME49_chr ID=TGME49_262930;length=36

Vllb 921322 925018 - 96 2703

TGME49_chr 231179 ID=TGME49_242800;length=20

VI 7 2313889 - 92 2704

TGME49_chr 482013 ID=TGME49_210390;length= 18

IX 1 4821972 + 41 2705

TGME49_chr 162667 ID=TGME49_297220;length=30

II 5 1629738 - 63 2706

TGME49_chr 344546 ID=TGME49_247630;length=50

XII 0 3450482 + 22 2707

TGME49_chr 480009 ID=TGME49_235350;length=17

X 8 4801840 - 42 2708

TGME49_chr ID=TGME49_230160;length= 19

VIII 651925 653885 - 60 2709

TGME49_chr 667493 ID=TGME49_268035;length= 16

VIII 3 6676585 + 52 2710

TGME49_chr ID=TGME49_307260;length=38

XI 26464 30284 - 20 2711 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 311517 ID=TGME49_244200;length=21

VI 0 3117323 - 53 2712

TGME49_chr ID=TGME49_219510;length=28

XII 899075 901898 - 23 2713

TGME49_chr 716937 ID=TGME49_275320;length=34

X 1 7172828 + 57 2714

TGME49_chr 295344 ID=TGME49_246740;length= 18

XII 2 2955259 + 17 2715

TGME49_chr 144546 ID=TGME49_295990;length=38

la 8 1449325 + 57 2716

TGME49_chr 461405 ID=TGME49_255940;length=39

Vllb 5 4617990 + 35 2717

TGME49_chr 147167 ID=TGME49_209820;length=33

lb 3 1474994 - 21 2718

TGME49_chr ID=TGME49_239795 ;length=30

VI 918303 921331 + 28 2719

TGME49_chr 134373 ID=TGME49_205540;length=26

Vila 9 1346394 + 55 2720

TGME49_chr 325915 ID=TGME49_313305 ;length= 13

XI 3 3260521 - 68 2721

TGME49_chr 546161 ID=TGME49_305560;length=48

IX 9 5466485 + 66 2722

TGME49_chr 299743 ID=TGME49_283740;length=20

V 7 2999458 + 21 2723

TGME49_chr 490373 ID=TGME49_210270;length=37

IX 6 4907468 + 32 2724

TGME49_chr 332321 ID=TGME49_202450;length=44

Vila 7 3327698 - 81 2725

TGME49_chr 530604 ID=TGME49_236153;length=93

X 5 5306976 + 1 2726

TGME49_chr 116682 ID=TGME49_318650;length=20

IV 5 1168908 + 83 2727

TGME49_chr ID=TGME49_262840;length=20

Vllb 982149 984168 - 19 2728

TGME49_chr 543533 ID=TGME49_305490;length=27

IX 4 5438062 + 28 2729

TGME49_chr 298915 ID=TGME49_312930;length= 12

XI 2 2990445 - 93 2730

TGME49_chr 363241 ID=TGME49_290180;length= 19

IX 8 3634414 - 96 2731

TGME49_chr 184398 ID=TGME49_204380;length=27

Vila 2 1846698 - 16 2732

TGME49_chr 188718 ID=TGME49_232140;length=33

VIII 6 1890535 - 49 2733

TGME49_chr ID=TGME49_319630;length=27

IV 823132 825847 - 15 2734

TGME49_chr 263672 2639733 + ID=TGME49_243450;length=30 2735 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VI 9 04

TGME49_chr 385673 ID=TGME49_223410;length=22

X 1 3858953 + 22 2736

TGME49_chr 467842 ID=TGME49_249600;length=44

XII 1 4682858 - 37 2737

TGME49_chr 450992 ID=TGME49_315130;length= 16

XI 2 4511567 - 45 2738

TGME49_chr 115306 ID=TGME49_294812;length= 15

la 6 1154657 + 91 2739

TGME49_chr ID=TGME49_267350;length=21

IX 706165 708319 - 54 2740

TGME49_chr 439256 ID=TGME49_249250;length=47

XII 7 4397271 + 04 2741

TGME49_chr 362047 ID=TGME49_273200;length=21

VIII 8 3622646 + 68 2742

TGME49_chr 304877 ID=TGME49_289180;length=36

IX 5 3052445 - 70 2743

TGME49_chr 578643 ID=TGME49_237220;length=29

X 5 5789375 + 40 2744

TGME49_chr 344859 ID=TGME49_223920;length=49

X 6 3453595 + 99 2745

TGME49_chr 161881 ID=TGME49_241010;length=32

VI 1 1622040 + 29 2746

TGME49_chr 280869 ID=TGME49_246500;length=44

XII 4 2813152 - 58 2747

TGME49_chr ID=TGME49_266820;length=18

IX 952354 954209 - 55 2748

TGME49_chr ID=TGME49_228770;length=20

X 152090 154165 - 75 2749

TGME49_chr 324344 ID=TGME49_247305 ;length= 16

XII 8 3245120 + 72 2750

TGME49_chr 102868 ID=TGME49_227320;length= 18

X 4 1030566 - 82 2751

TGME49_chr ID=TGME49_239087;length=22

VI 453972 456175 + 03 2752

TGME49_chr ID=TGME49_229670;length= 15

VIII 370373 371876 + 03 2753

TGME49_chr 220509 ID=TGME49_217890;length=22

XII 5 2207381 - 86 2754

TGME49_chr 117171 ID=TGME49_230990;length=46

VIII 9 1176344 + 25 2755

TGME49_chr 671969 ID=TGME49_215040;length=22

X 7 6721912 - 15 2756

TGME49_chr 362359 ID=TGME49_247920;length= 17

XII 4 3625340 - 46 2757

TGME49_chr 592120 ID=TGME49_306290;length=38

IX 8 5925102 - 94 2758 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_262890;length= 18

Vllb 945683 947561 + 78 2759

TGME49_chr 379335 ID=TGME49_272695 ;length=47

VIII 1 3798091 - 40 2760

TGME49_chr ID=TGME49_263870;length= 12

Vllb 283563 284791 - 28 2761

TGME49_chr 289867 ID=TGME49_312680;length=23

XI 5 2900997 + 22 2762

TGME49_chr 426949 ID=TGME49_314760;length= 17

XI 3 4271271 + 78 2763

TGME49_chr ID=TGME49_319860;length=25

IV 743496 746016 - 20 2764

TGME49_chr 594475 ID=TGME49_278670;length=20

XII 2 5946781 - 29 2765

TGME49_chr ID=TGME49_263760;length= 16

Vllb 329609 331237 + 28 2766

TGME49_chr 361269 ID=TGME49_258070;length=20

Vllb 6 3614760 - 64 2767

TGME49_chr ID=TGME49_221580;length=48

II 446960 451815 + 55 2768

TGME49_chr 514200 ID=TGME49_250710;length= 19

XII 0 5143982 + 82 2769

TGME49_chr ID=TGME49_280500;length=24

Vila 348808 351228 - 20 2770

TGME49_chr 162631 ID=TGME49_218280;length=20

XII 5 1628343 + 28 2771

TGME49_chr 498217 ID=TGME49_270810;length=48

VIII 0 4987009 - 39 2772

TGME49_chr 521198 ID=TGME49_316250;length= 17

XI 6 5213771 - 85 2773

TGME49_chr 141959 ID=TGME49_205398;length=46

Vila 9 1424205 + 06 2774

TGME49_chr ID=TGME49_219678 ;length= 19

XII 736337 738273 + 36 2775

TGME49_chr 196947 ID=TGME49_217640;length=20

XII 2 1971512 - 40 2776

TGME49_chr 215885 ID=TGME49_225560;length=41

X 7 2163020 - 63 2777

TGME49_chr 152708 ID=TGME49_287210;length=22

V 4 1529358 - 74 2778

TGME49_chr 298568 ID=TGME49_246763 ;length= 13

XII 1 2987023 - 42 2779

TGME49_chr 118846 ID=TGME49_318610;length=36

IV 0 1192145 + 85 2780

TGME49_chr 311862 ID=TGME49_258870;length=38

Vllb 3 3122518 - 95 2781

TGME49_chr 433889 4340907 + ID=TGME49_271830;length=20 2782 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 5 12

TGME49_chr ID=TGME49_307015 ;length= 12

XI 120398 121691 + 93 2783

TGME49_chr 465223 ID=TGME49_255880;length=31

Vllb 0 4655422 + 92 2784

TGME49_chr 222975 ID=TGME49_232680;length=23

VIII 3 2232098 - 45 2785

TGME49_chr 320194 ID=TGME49_247250;length=39

XII 6 3205868 - 22 2786

TGME49_chr 199532 ID=TGME49_232290;length=l 1

VIII 5 1996498 + 73 2787

TGME49_chr 234949 ID=TGME49_245530;length=24

XII 5 2351982 + 87 2788

TGME49_chr 529682 ID=TGME49_270277;length=20

VIII 0 5298844 - 24 2789

TGME49_chr 647409 ID=TGME49_277860;length=21

XII 1 6476198 - 07 2790

TGME49_chr 133178 ID=TGME49_310420;length=21

XI 5 1333925 - 40 2791

TGME49_chr 660659 ID=TGME49_214950;length=45

X 9 6611152 + 53 2792

TGME49_chr 183802 ID=TGME49_321400;length=25

lb 4 1840558 + 34 2793

TGME49_chr 186767 ID=TGME49_226020;length=42

X 5 1871933 - 58 2794

TGME49_chr 681217 ID=TGME49_200350;length=38

VIII 0 6815991 + 21 2795

TGME49_chr 479739 ID=TGME49_249780;length=19

XII 5 4799343 - 48 2796

TGME49_chr 510689 ID=TGME49_270640;length=32

VIII 4 5110114 + 20 2797

TGME49_chr 185692 ID=TGME49_311190;length=28

XI 9 1859770 - 41 2798

TGME49_chr 342913 ID=TGME49_202320;length= 17

Vila 0 3430859 - 29 2799

TGME49_chr ID=TGME49_264100;length=37

Vllb 181366 185076 + 10 2800

TGME49_chr 364658 ID=TGME49_258020;length=20

Vllb 2 3648630 - 48 2801

TGME49_chr 279844 ID=TGME49_284160;length= 15

V 8 2800018 + 70 2802

TGME49_chr 543018 ID=TGME49_251500;length= 13

XII 1 5431564 + 83 2803

TGME49_chr ID=TGME49_294190;length= 16

la 684229 685834 + 05 2804

TGME49_chr 451479 ID=TGME49_234670;length=28

X 9 4517679 + 80 2805 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_207380;length=22

lb 19922 22201 - 79 2806

TGME49_chr 283785 ID=TGME49_246540;length=23

XII 2 2840190 + 38 2807

TGME49_chr 325989 ID=TGME49_202520;length=27

Vila 6 3262598 + 02 2808

TGME49_chr 266552 ID=TGME49_312400;length= 12

XI 5 2666765 - 40 2809

TGME49_chr ID=TGME49_221450;length=20

II 355180 357222 + 42 2810

TGME49_chr 243323 ID=TGME49_225190;length=24

X 0 2435647 + 17 2811

TGME49_chr ID=TGME49_229150;length=23

VIII 57748 60111 - 63 2812

TGME49_chr 488564 ID=TGME49_255340;length=28

Vllb 3 4888514 + 71 2813

TGME49_chr ID=TGME49_212955;length=l 1

V 725744 726888 + 44 2814

TGME49_chr 158074 ID=TGME49_240940;length=67

VI 4 1581416 - 2 2815

TGME49_chr ID=TGME49_230640;length= 19

VIII 928298 930205 - 07 2816

TGME49_chr 283274 ID=TGME49_246530;length=20

XII 2 2834747 + 05 2817

TGME49_chr ID=TGME49_229160;length=26

VIII 56761 59366 + 05 2818

TGME49_chr 366672 ID=TGME49_290260;length=22

IX 9 3669016 + 87 2819

TGME49_chr 637409 ID=TGME49_268660;length=22

VIII 4 6376344 - 50 2820

TGME49_chr 169211 ID=TGME49_211480;length=21

IV 0 1694217 - 07 2821

TGME49_chr 227297 ID=TGME49_260440;length=23

Vllb 4 2275367 - 93 2822

TGME49_chr 488029 ID=TGME49_270910;length=23

VIII 8 4882600 + 02 2823

TGME49_chr ID=TGME49_219300;length= 19

XII 985332 987309 + 77 2824

TGME49_chr 543004 ID=TGME49_236570;length=19

X 1 5431958 + 17 2825

TGME49_chr 184782 ID=TGME49_204360;length=26

Vila 3 1850467 + 44 2826

TGME49_chr 374368 ID=TGME49_248225 ;length=42

XII 0 3747903 + 23 2827

TGME49_chr 378378 ID=TGME49_223485 ;length= 16

X 1 3785405 + 24 2828

TGME49_chr 427024 4271815 - ID=TGME49_314750;length= 15 2829 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 3 72

TGME49_chr 107854 ID=TGME49_227115 ;length=22

X 3 1080826 + 83 2830

TGME49_chr 465961 ID=TGME49_249575 ;length=26

XII 9 4662247 - 28 2831

TGME49_chr 202396 ID=TGME49_260825 ;length=26

Vllb 4 2026656 - 92 2832

TGME49_chr 463941 ID=TGME49_292245 ;length=24

IX 8 4641832 + 14 2833

TGME49_chr 579808 ID=TGME49_269430;length=34

VIII 6 5801508 + 22 2834

TGME49_chr 508089 ID=TGME49_235730;length=23

X 2 5083213 - 21 2835

TGME49_chr ID=TGME49_279310;length= 12

IX 283315 284597 - 82 2836

TGME49_chr ID=TGME49_219485 ;length=46

XII 919752 924441 - 89 2837

TGME49_chr 254492 ID=TGME49_288450;length=31

IX 1 2548040 + 19 2838

TGME49_chr 534332 ID=TGME49_316480;length=23

XI 3 5345659 - 36 2839

TGME49_chr ID=TGME49_238410;length=27

VI 207532 210236 + 04 2840

TGME49_chr ID=TGME49_306880;length=20

XI 216523 218588 + 65 2841

TGME49_chr ID=TGME49_309130;length=35

XI 513332 516847 + 15 2842

TGME49_chr 535194 ID=TGME49_305310;length= 17

IX 5 5353735 - 90 2843

TGME49_chr 126094 ID=TGME49_266260;length=24

IX 0 1263377 + 37 2844

TGME49_chr 677308 ID=TGME49_200290;length=24

VIII 5 6775554 + 69 2845

TGME49_chr 185931 ID=TGME49_297520;length=29

II 3 1862244 - 31 2846

TGME49_chr 422173 ID=TGME49_257090;length=26

Vllb 7 4224344 + 07 2847

TGME49_chr 207469 ID=TGME49_232400;length= 18

VIII 4 2076558 - 64 2848

TGME49_chr 518456 ID=TGME49_235920;length=27

X 5 5187267 - 02 2849

TGME49_chr 386221 ID=TGME49_257730;length= 17

Vllb 2 3863945 + 33 2850

TGME49_chr 393562 ID=TGME49_290920;length=30

IX 3 3938663 + 40 2851

TGME49_chr ID=TGME49_208330;length=26

lb 609246 611891 + 45 2852 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 556909 ID=TGME49_236920;length=30

X 4 5572139 + 45 2853

TGME49_chr 455122 ID=TGME49_256040;length=38

Vllb 7 4555100 - 73 2854

TGME49_chr ID=TGME49_227890;length=19

X 753755 755679 - 24 2855

TGME49_chr ID=TGME49_239475 ;length=29

VI 686164 689115 - 51 2856

TGME49_chr 394134 ID=TGME49_314240;length=44

XI 0 3945803 + 63 2857

TGME49_chr 317583 ID=TGME49_258810;length=45

Vllb 2 3180410 - 78 2858

TGME49_chr 408176 ID=TGME49_272300;length=l 1

VIII 9 4082967 + 98 2859

TGME49_chr 153505 ID=TGME49_261780;length=36

Vllb 7 1538674 - 17 2860

TGME49_chr 334541 ID=TGME49_313400;length= 18

XI 2 3347300 - 88 2861

TGME49_chr 223907 ID=TGME49_232730;length=20

VIII 8 2241114 + 36 2862

TGME49_chr 398543 ID=TGME49_212190;length=22

X 0 3987663 + 33 2863

TGME49_chr 626186 ID=TGME49_278160;length= 19

XII 1 6263784 + 23 2864

TGME49_chr 113413 ID=TGME49_227015 ;length=24

X 3 1136564 + 31 2865

TGME49_chr 492738 ID=TGME49_210255 ;length= 19

IX 0 4929331 - 51 2866

TGME49_chr ID=TGME49_229970;length=31

VIII 494173 497365 - 92 2867

TGME49_chr 460367 ID=TGME49_315250;length= 14

XI 4 4605166 - 92 2868

TGME49_chr ID=TGME49_280465 ;length=49

Vila 385701 390631 - 30 2869

TGME49_chr 518667 ID=TGME49_316210;length= 17

XI 4 5188422 + 48 2870

TGME49_chr 272113 ID=TGME49_233610;length=48

VIII 8 2725943 - 05 2871

TGME49_chr 307645 ID=TGME49_224340;length= 19

X 0 3078386 - 36 2872

TGME49_chr 107071 ID=TGME49_230880;length=41

VIII 4 1074839 + 25 2873

TGME49_chr 612081 ID=TGME49_216480;length= 16

XI 0 6122479 + 69 2874

TGME49_chr 126835 ID=TGME49_310330;length= 19

XI 0 1270276 - 26 2875

TGME49_chr 374985 3752478 + ID=TGME49_272740;length=26 2876 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 6 22

TGME49_chr 147998 ID=TGME49_226620;length= 19

X 8 1481931 + 43 2877

TGME49_chr 588699 ID=TGME49_216780;length=22

XI 2 5889236 + 44 2878

TGME49_chr ID=TGME49_227420;length=21

X 952154 954291 - 37 2879

TGME49_chr 102891 ID=TGME49_206390;length=47

Vila 1 1033633 - 22 2880

TGME49_chr 447427 ID=TGME49_234570;length=28

X 7 4477145 - 68 2881

TGME49_chr 140470 ID=TGME49_231410;length= 18

VIII 1 1406518 + 17 2882

TGME49_chr ID=TGME49_220430;length=27

V 374249 377042 - 93 2883

TGME49_chr 336095 ID=TGME49_224060;length=34

X 8 3364401 - 43 2884

TGME49_chr 345449 ID=TGME49_247648 ;length=20

XII 0 3456588 + 98 2885

TGME49_chr 249854 ID=TGME49_243280;length= 15

VI 9 2500123 + 74 2886

TGME49_chr 244023 ID=TGME49_312130;length=43

XI 0 2444604 + 74 2887

TGME49_chr ID=TGME49_267430;length=38

IX 652699 656532 + 33 2888

TGME49_chr 551685 ID=TGME49_269930;length=36

VIII 8 5520480 + 22 2889

TGME49_chr 513069 ID=TGME49_250700;length=32

XII 0 5133986 + 96 2890

TGME49_chr 694315 ID=TGME49_215410;length= 18

X 9 6945000 + 41 2891

TGME49_chr 208699 ID=TGME49_210760;length=24

IV 8 2089481 + 83 2892

TGME49_chr 532300 ID=TGME49_305270;length=32

IX 5 5326221 + 16 2893

TGME49_chr 623527 ID=TGME49_268870;length=41

VIII 6 6239397 + 21 2894

TGME49_chr 657188 ID=TGME49_268290;length=18

VIII 4 6573723 - 39 2895

TGME49_chr 706425 ID=TGME49_215650;length=28

X 9 7067143 + 84 2896

TGME49_chr ID=TGME49_263190;length= 15

Vllb 736866 738425 - 59 2897

TGME49_chr 331808 ID=TGME49_224120;length= 15

X 0 3319616 - 36 2898

TGME49_chr 111356 ID=TGME49_318738 ;length= 14

IV 8 1114995 - 27 2899 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 341797 ID=TGME49_223970;length= 16

X 1 3419580 + 09 2900

TGME49_chr 164314 ID=TGME49_297270;length=29

II 0 1646065 + 25 2901

TGME49_chr 224195 ID=TGME49_264430;length=45

IX 0 2246458 + 08 2902

TGME49_chr ID=TGME49_222130;length= 19

II 892236 894195 + 59 2903

TGME49_chr ID=TGME49_222040;length=35

II 775384 778953 + 69 2904

TGME49_chr 623600 ID=TGME49_268880;length=40

VIII 0 6240061 - 61 2905

TGME49_chr 148751 ID=TGME49_310630;length=24

XI 6 1489916 + 00 2906

TGME49_chr 448029 ID=TGME49_315120;length=20

XI 7 4482368 + 71 2907

TGME49_chr ID=TGME49_212860;length=27

V 651209 653981 + 72 2908

TGME49_chr 563461 ID=TGME49_251860;length=21

XII 5 5636719 - 04 2909

TGME49_chr 582670 ID=TGME49_306195 ;length=25

IX 9 5829287 - 78 2910

TGME49_chr 402561 ID=TGME49_257470;length=50

Vllb 1 4030642 + 31 2911

TGME49_chr ID=TGME49_208490;length= 17

lb 726992 728777 - 85 2912

TGME49_chr 363982 ID=TGME49_258030;length=34

Vllb 3 3643240 - 17 2913

TGME49_chr ID=TGME49_228720;length= 18

X 183654 185507 + 53 2914

TGME49_chr 110441 ID=TGME49_227060;length= 17

X 8 1106139 + 21 2915

TGME49_chr ID=TGME49_229450;length=20

VIII 265034 267090 - 56 2916

TGME49_chr 268528 ID=TGME49_246160;length=34

XII 3 2688777 + 94 2917

TGME49_chr 149346 ID=TGME49_218500;length=26

XII 9 1496095 - 26 2918

TGME49_chr ID=TGME49_228470;length= 17

X 293471 295200 + 29 2919

TGME49_chr 114868 ID=TGME49_294805 ;length=23

la 1 1150982 + 01 2920

TGME49_chr 514938 ID=TGME49_250720;length= 14

XII 8 5150805 - 17 2921

TGME49_chr 246530 ID=TGME49_301420;length=25

IV 2 2467886 + 84 2922

TGME49_chr 902188 906938 + ID=TGME49_239780;length=47 2923 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VI 50

TGME49_chr ID=TGME49_263505;length=37

Vllb 511337 515116 + 79 2924

TGME49_chr ID=TGME49_292960;length=22

la 50386 52683 - 97 2925

TGME49_chr 141039 ID=TGME49_209710;length= 19

lb 8 1412319 - 21 2926

TGME49_chr ID=TGME49_220910;length=25

II 60741 63309 - 68 2927

TGME49_chr 250297 ID=TGME49_312210;length= 11

XI 0 2504148 + 78 2928

TGME49_chr ID=TGME49_266760;length=41

IX 979074 983263 - 89 2929

TGME49_chr 580027 ID=TGME49_269438 ;length= 15

VIII 7 5801876 - 99 2930

TGME49_chr 643499 ID=TGME49_216053 ;length=21

XI 2 6437143 - 51 2931

TGME49_chr 128101 ID=TGME49_218810;length=24

XII 7 1283487 + 70 2932

TGME49_chr 663241 ID=TGME49_214970;length=39

X 9 6636375 + 56 2933

TGME49_chr 222418 ID=TGME49_298050;length= 15

II 2 2225712 + 30 2934

TGME49_chr 125252 ID=TGME49_218850;length= 16

XII 0 1254167 - 47 2935

TGME49_chr ID=TGME49_306900;length= 17

XI 201833 203587 + 54 2936

TGME49_chr 193775 ID=TGME49_297720;length=23

II 1 1940101 + 50 2937

TGME49_chr 164349 ID=TGME49_310880;length= 15

XI 4 1645015 - 21 2938

TGME49_chr 162600 ID=TGME49_295662;length=28

la 0 1628841 - 41 2939

TGME49_chr ID=TGME49_208380;length=32

lb 643892 647186 - 94 2940

TGME49_chr 649593 ID=TGME49_214783 ;length= 15

X 9 6497510 - 71 2941

TGME49_chr 332429 ID=TGME49_224090;length=27

X 1 3327072 + 81 2942

TGME49_chr 101066 ID=TGME49_222280;length= 18

II 6 1012466 - 00 2943

TGME49_chr 237763 ID=TGME49_301300;length= 18

IV 8 2379502 + 64 2944

TGME49_chr ID=TGME49_319500;length= 17

IV 953527 955274 + 47 2945

TGME49_chr 676741 ID=TGME49_215110;length=23

X 5 6769751 - 36 2946 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 202927 ID=TGME49_260820;length=25

Vllb 8 2031836 - 58 2947

TGME49_chr ID=TGME49_279340;length= 17

IX 254786 256497 - 11 2948

TGME49_chr 175694 ID=TGME49_217340;length=26

XII 1 1759557 + 16 2949

TGME49_chr 412126 ID=TGME49_314480;length= 17

XI 8 4122993 + 25 2950

TGME49_chr 384719 ID=TGME49_290700;length=29

IX 8 3850189 + 91 2951

TGME49_chr 519976 ID=TGME49_305050;length=38

IX 6 5203644 + 78 2952

TGME49_chr ID=TGME49_206440;length=26

Vila 969639 972289 + 50 2953

TGME49_chr ID=TGME49_207590;length=21

lb 140290 142474 + 84 2954

TGME49_chr 110916 ID=TGME49_209190;length=41

lb 4 1113311 - 47 2955

TGME49_chr 237416 ID=TGME49_225250;length=20

X 5 2376239 + 74 2956

TGME49_chr 234504 ID=TGME49_242840;length=20

VI 4 2347093 + 49 2957

TGME49_chr 161378 ID=TGME49_265320;length=47

IX 3 1618496 + 13 2958

TGME49_chr 399440 ID=TGME49_314320;length=44

XI 5 3998830 + 25 2959

TGME49_chr 168635 ID=TGME49_286760;length=36

V 8 1689982 - 24 2960

TGME49_chr 290748 ID=TGME49_312690;length=42

XI 2 2911745 - 63 2961

TGME49_chr ID=TGME49_239748 ;length=42

VI 883138 887371 + 33 2962

TGME49_chr 218330 ID=TGME49_225520;length=41

X 0 2187485 + 85 2963

TGME49_chr 557472 ID=TGME49_316770;length=23

XI 3 5577030 + 07 2964

TGME49_chr ID=TGME49_221665 ;length= 16

II 531485 533086 + 01 2965

TGME49_chr 351021 ID=TGME49_289900;length= 19

IX 8 3512191 + 73 2966

TGME49_chr 621494 ID=TGME49_214290;length=21

X 5 6217062 + 17 2967

TGME49_chr 298361 ID=TGME49_259080;length=l 1

Vllb 8 2984755 - 37 2968

TGME49_chr ID=TGME49_264140;length=34

Vllb 157650 161052 - 02 2969

TGME49_chr 353508 355474 - ID=TGME49_299780;length=19 2970 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 66

TGME49_chr 319333 ID=TGME49_224235 ;length=38

X 6 3197198 + 62 2971

TGME49_chr ID=TGME49_299670;length=23

XII 351854 354229 + 75 2972

TGME49_chr 469174 ID=TGME49_255730;length=43

Vllb 2 4696076 + 34 2973

TGME49_chr 231475 ID=TGME49_232980;length=18

VIII 0 2316635 - 85 2974

TGME49_chr 105374 ID=TGME49_209112;length= 10

lb 7 1054786 - 39 2975

TGME49_chr 127108 ID=TGME49_240410;length=43

VI 8 1275407 - 19 2976

TGME49_chr 191375 ID=TGME49_264900;length=22

IX 0 1916033 - 83 2977

TGME49_chr ID=TGME49_227980;length=30

X 691619 694702 + 83 2978

TGME49_chr 562904 ID=TGME49_269740;length= 18

VIII 8 5630894 + 46 2979

TGME49_chr 341028 ID=TGME49_258458 ;length=40

Vllb 9 3414291 - 02 2980

TGME49_chr 643003 ID=TGME49_268570;length=22

VIII 0 6432264 - 34 2981

TGME49_chr 122113 ID=TGME49_318570;length=26

IV 2 1223825 - 93 2982

TGME49_chr 213715 ID=TGME49_232520;length=44

VIII 6 2141623 - 67 2983

TGME49_chr ID=TGME49_208970;length=48

lb 928521 933382 - 61 2984

TGME49_chr ID=TGME49_280790;length= 16

Vila 89778 91426 - 48 2985

TGME49_chr 280024 ID=TGME49_288910;length=20

IX 0 2802305 + 65 2986

TGME49_chr ID=TGME49_212760;length=31

V 584166 587344 - 78 2987

TGME49_chr 560227 ID=TGME49_269770;length=23

VIII 3 5604583 + 10 2988

TGME49_chr 442874 ID=TGME49_234507;length=l 1

X 9 4429928 + 79 2989

TGME49_chr 119838 ID=TGME49_218950;length= 18

XII 2 1200277 - 95 2990

TGME49_chr ID=TGME49_228620;length=42

X 274179 278462 - 83 2991

TGME49_chr 117776 ID=TGME49_318620;length=20

IV 0 1179793 + 33 2992

TGME49_chr ID=TGME49_293770;length=26

la 547592 550266 + 74 2993 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 651808 ID=TGME49_215940;length= 18

XI 5 6519956 - 71 2994

TGME49_chr 140023 ID=TGME49_240710;length= 18

VI 6 1402037 + 01 2995

TGME49_chr 370584 ID=TGME49_248150;length= 18

XII 1 3707649 - 08 2996

TGME49_chr 686896 ID=TGME49_215260;length=33

X 5 6872314 - 49 2997

TGME49_chr 498008 ID=TGME49_307570;length=32

IX 5 4983371 - 86 2998

TGME49_chr 624176 ID=TGME49_306650;length= 19

IX 2 6243674 - 12 2999

TGME49_chr 701460 ID=TGME49_215560;length=28

X 9 7017444 + 35 3000

TGME49_chr 368686 ID=TGME49_313770;length=27

XI 0 3689632 - 72 3001

TGME49_chr 263222 ID=TGME49_288600;length=45

IX 0 2636771 - 51 3002

TGME49_chr ID=TGME49_220950;length=26

II 83969 86643 + 74 3003

TGME49_chr 448383 ID=TGME49_256700;length=47

Vllb 2 4488559 + 27 3004

TGME49_chr ID=TGME49_262920;length=24

Vllb 920403 922803 + 00 3005

TGME49_chr 372686 ID=TGME49_290340;length= 17

IX 3 3728579 - 16 3006

TGME49_chr 226551 ID=TGME49_301000;length=24

IV 3 2268004 - 91 3007

TGME49_chr 461744 ID=TGME49_271270;length=24

VIII 2 4619870 - 28 3008

TGME49_chr 491618 ID=TGME49_270880;length=24

VIII 4 4918619 - 35 3009

TGME49_chr ID=TGME49_309180;length=39

XI 541082 545032 - 50 3010

TGME49_chr 142117 ID=TGME49_240730;length=21

VI 0 1423356 - 86 3011

TGME49_chr 531123 ID=TGME49_236190;length= 15

X 5 5312787 + 52 3012

TGME49_chr 193409 ID=TGME49_204270;length=20

Vila 0 1936122 - 32 3013

TGME49_chr 368992 ID=TGME49_223620;length=26

X 1 3692588 + 67 3014

TGME49_chr ID=TGME49_319920;length=27

IV 666975 669700 - 25 3015

TGME49_chr 270231 ID=TGME49_233540;length=25

VIII 7 2704841 + 24 3016

TGME49_chr 423031 4232511 - ID=TGME49_248860;length=21 3017 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 4 97

TGME49_chr 267568 ID=TGME49_312430;length=28

XI 5 2678516 + 31 3018

TGME49_chr 424232 ID=TGME49_248880;length= 13

XII 7 4243685 - 58 3019

TGME49_chr 347864 ID=TGME49_202255;length=18

Vila 9 3480506 - 57 3020

TGME49_chr ID=TGME49_300220;length=31

XII 157841 160945 - 04 3021

TGME49_chr 185890 ID=TGME49_264990;length=31

IX 8 1862051 + 43 3022

TGME49_chr 112677 ID=TGME49_318720;length= 19

IV 4 1128732 - 58 3023

TGME49_chr ID=TGME49_263750;length=47

Vllb 342230 347024 - 94 3024

TGME49_chr 343171 ID=TGME49_202300;length=28

Vila 6 3434599 + 83 3025

TGME49_chr 336402 ID=TGME49_289640;length=20

IX 9 3366093 - 64 3026

TGME49_chr 184611 ID=TGME49_211300;length= 13

IV 8 1847509 + 91 3027

TGME49_chr 193932 ID=TGME49_242290;length=26

VI 1 1941969 + 48 3028

TGME49_chr 183802 ID=TGME49_321390;length=37

lb 4 1841751 + 27 3029

TGME49_chr 302544 ID=TGME49_246930;length=28

XII 3 3028262 - 19 3030

TGME49_chr ID=TGME49_209005 ;length=21

lb 982520 984651 - 31 3031

TGME49_chr 544001 ID=TGME49_305510;length=45

IX 4 5444552 + 38 3032

TGME49_chr 178563 ID=TGME49_321460;length= 17

lb 4 1787380 - 46 3033

TGME49_chr 126952 ID=TGME49_266150;length=23

IX 9 1271869 - 40 3034

TGME49_chr ID=TGME49_320680;length=45

IV 136530 141115 + 85 3035

TGME49_chr 212914 ID=TGME49_297940;length=24

II 7 2131563 - 16 3036

TGME49_chr 531170 ID=TGME49_236160;length=21

X 6 5313875 - 69 3037

TGME49_chr 117514 ID=TGME49_266390;length= 17

IX 2 1176935 - 93 3038

TGME49_chr 327042 ID=TGME49_247340;length=22

XII 8 3272692 + 64 3039

TGME49_chr 438652 ID=TGME49_282110;length=40

Vila 3 4390549 - 26 3040 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 571254 ID=TGME49_217020;length= 18

XI 2 5714427 + 85 3041

TGME49_chr ID=TGME49_207990;length=26

lb 508930 511554 - 24 3042

TGME49_chr ID=TGME49_320090;length=49

IV 509090 514052 + 62 3043

TGME49_chr 232359 ID=TGME49_245500;length=20

XII 6 2325670 - 74 3044

TGME49_chr 183879 ID=TGME49_297500;length= 19

II 3 1840776 - 83 3045

TGME49_chr 184807 ID=TGME49_295310;length= 17

la 6 1849826 + 50 3046

TGME49_chr ID=TGME49_280750;length=50

Vila 118366 123372 + 06 3047

TGME49_chr 232361 ID=TGME49_301222;length= 13

IV 9 2324963 + 44 3048

TGME49_chr 252580 ID=TGME49_260010;length= 17

Vllb 3 2527506 + 03 3049

TGME49_chr 260478 ID=TGME49_233340;length=22

VIII 9 2607085 - 96 3050

TGME49_chr ID=TGME49_263327;length=33

Vllb 629441 632828 - 87 3051

TGME49_chr ID=TGME49_239490;length=38

VI 718257 722058 - 01 3052

TGME49_chr 121917 ID=TGME49_231060;length=50

VIII 1 1224214 - 43 3053

TGME49_chr 736648 ID=TGME49_207060;length=26

X 5 7369133 - 48 3054

TGME49_chr ID=TGME49_267840;length=21

IX 326465 328648 - 83 3055

TGME49_chr 340084 ID=TGME49_258470;length=37

Vllb 1 3404561 + 20 3056

TGME49_chr 367401 ID=TGME49_273075 ;length=32

VIII 4 3677250 + 36 3057

TGME49_chr 212133 ID=TGME49_203940;length=39

Vila 9 2125267 - 28 3058

TGME49_chr 219708 ID=TGME49_217875 ;length=26

XII 1 2199723 + 42 3059

TGME49_chr ID=TGME49_309230;length= 19

XI 569487 571481 + 94 3060

TGME49_chr ID=TGME49_228040;length=22

X 660185 662395 - 10 3061

TGME49_chr 229408 ID=TGME49_232955 ;length= 16

VIII 9 2295780 - 91 3062

TGME49_chr 547668 ID=TGME49_251590;length=21

XII 5 5478806 - 21 3063

TGME49_chr 633161 636374 + ID=TGME49_208370;length=32 3064 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: lb 13

TGME49_chr 626259 ID=TGME49_278170;length= 16

XII 2 6264233 - 41 3065

TGME49_chr 625932 ID=TGME49_278180;length= 15

XII 7 6260911 - 84 3066

TGME49_chr 284829 ID=TGME49_259230;length=21

Vllb 9 2850481 + 82 3067

TGME49_chr 176800 ID=TGME49_211385;length=32

IV 3 1771226 + 23 3068

TGME49_chr 172065 ID=TGME49_231994;length=28

VIII 5 1723473 - 18 3069

TGME49_chr 408007 ID=TGME49_291040;length=20

IX 8 4082079 + 01 3070

TGME49_chr 131741 ID=TGME49_240540;length= 18

VI 4 1319297 - 83 3071

TGME49_chr 165286 ID=TGME49_286810;length=19

V 5 1654841 + 76 3072

TGME49_chr 325161 ID=TGME49_258700;length=20

Vllb 9 3253665 - 46 3073

TGME49_chr 137035 ID=TGME49_262110;length=21

Vllb 4 1372539 + 85 3074

TGME49_chr 144950 ID=TGME49_261970;length= 17

Vllb 9 1451279 + 70 3075

TGME49_chr ID=TGME49_309730;length=23

XI 781501 783875 - 74 3076

TGME49_chr 164934 ID=TGME49_261580;length=48

Vllb 6 1654154 - 08 3077

TGME49_chr ID=TGME49_229340;length= 17

VIII 181487 183193 - 06 3078

TGME49_chr 291164 ID=TGME49_224650;length= 17

X 9 2913447 - 98 3079

TGME49_chr ID=TGME49_207810;length= 14

lb 306500 307968 + 68 3080

TGME49_chr ID=TGME49_219440;length= 16

XII 951504 953111 - 07 3081

TGME49_chr 692138 ID=TGME49_215360;length=35

X 4 6924980 + 96 3082

TGME49_chr ID=TGME49_293510;length=40

la 387758 391767 - 09 3083

TGME49_chr 272936 ID=TGME49_259650;length=22

Vllb 4 2731606 + 42 3084

TGME49_chr ID=TGME49_228100;length=29

X 592859 595829 - 70 3085

TGME49_chr 702993 ID=TGME49_215580;length=22

X 9 7032199 - 60 3086

TGME49_chr 119160 ID=TGME49_209290;length= 14

lb 1 1193002 - 01 3087 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 302726 ID=TGME49_224500;length=25

X 8 3029865 - 97 3088

TGME49_chr 689773 ID=TGME49_276830;length=23

XII 2 6900123 + 91 3089

TGME49_chr 680980 ID=TGME49_276980;length=35

XII 2 6813303 + 01 3090

TGME49_chr 185135 ID=TGME49_286410;length=21

V 0 1853522 - 72 3091

TGME49_chr 494616 ID=TGME49_315760;length=45

XI 4 4950718 + 54 3092

TGME49_chr 199645 ID=TGME49_210960;length=44

IV 3 2000862 + 09 3093

TGME49_chr 708747 ID=TGME49_215690;length= 18

X 4 7089312 + 38 3094

TGME49_chr 228963 ID=TGME49_311780;length=17

XI 7 2291430 - 93 3095

TGME49_chr 262734 ID=TGME49_233400;length=19

VIII 1 2629242 - 01 3096

TGME49_chr 683642 ID=TGME49_215250;length= 18

X 9 6838321 + 92 3097

TGME49_chr 540048 ID=TGME49_236530;length=27

X 1 5403238 + 57 3098

TGME49_chr 293862 ID=TGME49_283820;length=24

V 7 2941078 + 51 3099

TGME49_chr 181056 ID=TGME49_204400;length=47

Vila 3 1815332 + 69 3100

TGME49_chr 574770 ID=TGME49_278930;length=36

XII 6 5751318 + 12 3101

TGME49_chr 616826 ID=TGME49_278290;length=34

XII 5 6171686 - 21 3102

TGME49_chr 134446 ID=TGME49_213800;length=30

V 0 1347471 - 11 3103

TGME49_chr ID=TGME49_220890;length=34

II 41471 44914 - 43 3104

TGME49_chr ID=TGME49_208430;length=20

lb 709441 711471 - 30 3105

TGME49_chr 563313 ID=TGME49_251870;length= 16

XII 1 5634774 + 43 3106

TGME49_chr 136419 ID=TGME49_205500;length=21

Vila 9 1366359 + 60 3107

TGME49_chr 220452 ID=TGME49_260490;length=23

Vllb 7 2206884 + 57 3108

TGME49_chr 437403 ID=TGME49_234420;length=21

X 2 4376202 - 70 3109

TGME49_chr 500876 ID=TGME49_270770;length=27

VIII 0 5011510 - 50 3110

TGME49_chr 123722 1239000 + ID=TGME49_213690;length= 17 3111 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

V 6 74

TGME49_chr ID=TGME49_213360;length=36

V 955818 959487 - 69 3112

TGME49_chr 220681 ID=TGME49_232655;length=23

VIII 8 2209210 + 92 3113

TGME49_chr 363568 ID=TGME49_223725 ;length=25

X 0 3638226 - 46 3114

TGME49_chr 173458 ID=TGME49_241830;length=46

VI 1 1739230 - 49 3115

TGME49_chr 183247 ID=TGME49_211320;length=34

IV 6 1835918 - 42 3116

TGME49_chr 119702 ID=TGME49_226960;length=37

X 6 1200743 - 17 3117

TGME49_chr 560329 ID=TGME49_269775 ;length=26

VIII 8 5605973 - 75 3118

TGME49_chr 358769 ID=TGME49_258090;length=23

Vllb 3 3590070 - 77 3119

TGME49_chr 630533 ID=TGME49_214470;length=22

X 7 6307575 - 38 3120

TGME49_chr 238950 ID=TGME49_225230;length=44

X 0 2393945 - 45 3121

TGME49_chr ID=TGME49_227380;length=49

X 968991 973953 + 62 3122

TGME49_chr 119430 ID=TGME49_222700;length=40

II 0 1198358 + 58 3123

TGME49_chr 559432 ID=TGME49_251800;length=21

XII 6 5596456 - 30 3124

TGME49_chr 270021 ID=TGME49_284590;length=26

V 4 2702822 - 08 3125

TGME49_chr 638591 ID=TGME49_268630;length=15

VIII 9 6387483 + 64 3126

TGME49_chr 254215 ID=TGME49_203370;length=27

Vila 9 2544885 - 26 3127

TGME49_chr 495497 ID=TGME49_210230;length=23

IX 4 4957346 - 72 3128

TGME49_chr 101131 ID=TGME49_294620;length=46

la 2 1015928 - 16 3129

TGME49_chr 331933 ID=TGME49_273690;length=39

VIII 3 3323273 - 40 3130

TGME49_chr 108975 ID=TGME49_294710;length=40

la 5 1093766 - 11 3131

TGME49_chr ID=TGME49_221905 ;length= 13

II 683949 685295 + 46 3132

TGME49_chr 534422 ID=TGME49_236230;length=65

X 4 5344882 - 8 3133

TGME49_chr 613714 ID=TGME49_269015 ;length=44

VIII 1 6141591 - 50 3134 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 385503 ID=TGME49_272593 ;length=69

VIII 6 3855734 + 8 3135

TGME49_chr 249899 ID=TGME49_243265 ;length= 18

VI 0 2500814 - 24 3136

TGME49_chr 138391 ID=TGME49_240680;length=48

VI 6 1388756 - 40 3137

TGME49_chr 182675 ID=TGME49_226072;length= 12

X 1 1828019 - 68 3138

TGME49_chr 426360 ID=TGME49_248940;length=21

XII 5 4265733 - 28 3139

TGME49_chr 483370 ID=TGME49_235390;length=15

X 4 4835258 - 54 3140

TGME49_chr 549273 ID=TGME49_305600;length=27

IX 4 5495529 + 95 3141

TGME49_chr 159720 ID=TGME49_211700;length= 19

IV 7 1599157 - 50 3142

TGME49_chr 220039 ID=TGME49_260510;length= 12

Vllb 4 2201672 - 78 3143

TGME49_chr 649193 ID=TGME49_214790;length=45

X 9 6496460 + 21 3144

TGME49_chr ID=TGME49_219450;length=22

XII 929521 931802 + 81 3145

TGME49_chr 147419 ID=TGME49_218540;length= 16

XII 7 1475820 - 23 3146

TGME49_chr 240686 ID=TGME49_225220;length=29

X 1 2409830 - 69 3147

TGME49_chr 165816 ID=TGME49_226410;length=25

X 6 1660752 - 86 3148

TGME49_chr 211770 ID=TGME49_210730;length=43

IV 2 2122018 - 16 3149

TGME49_chr 232235 ID=TGME49_242820;length=43

VI 9 2326694 - 35 3150

TGME49_chr 672039 ID=TGME49_215050;length= 15

X 0 6721912 - 22 3151

TGME49_chr 333033 ID=TGME49_247410;length=24

XII 6 3332740 - 04 3152

TGME49_chr 479982 ID=TGME49_235360;length=15

X 8 4801328 + 00 3153

TGME49_chr 108020 ID=TGME49_227130;length= 12

X 6 1081448 - 42 3154

TGME49_chr 152903 ID=TGME49_218420;length=28

XII 4 1531895 - 61 3155

TGME49_chr 476195 ID=TGME49_315510;length=24

XI 6 4764431 + 75 3156

TGME49_chr 450638 ID=TGME49_249390;length=32

XII 2 4509664 + 82 3157

TGME49_chr 153901 1541043 + ID=TGME49_226550;length=20 3158 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 8 25

TGME49_chr ID=TGME49_228000;length= 18

X 676709 678567 + 58 3159

TGME49_chr 604041 ID=TGME49_278518;length=38

XII 0 6044218 + 08 3160

TGME49_chr 205346 ID=TGME49_204000;length=27

Vila 7 2056236 - 69 3161

TGME49_chr ID=TGME49_267400;length= 16

IX 674942 676609 - 67 3162

TGME49_chr 117425 ID=TGME49_213620;length=50

V 1 1179266 + 15 3163

TGME49_chr 107448 ID=TGME49_209140;length=27

lb 6 1077281 - 95 3164

TGME49_chr 347581 ID=TGME49_313560;length= 18

XI 6 3477674 - 58 3165

TGME49_chr ID=TGME49_308840;length= 13

XI 293500 294822 + 22 3166

TGME49_chr 255757 ID=TGME49_312270;length=42

XI 1 2561791 - 20 3167

TGME49_chr 453196 ID=TGME49_271375 ;length= 18

VIII 0 4533762 + 02 3168

TGME49_chr 387515 ID=TGME49_314000;length= 19

XI 3 3877145 - 92 3169

TGME49_chr ID=TGME49_264050;length= 19

Vllb 214816 216778 + 62 3170

TGME49_chr 348881 ID=TGME49_247690;length=23

XII 0 3491174 + 64 3171

TGME49_chr ID=TGME49_306960;length=23

XI 156674 158998 - 24 3172

TGME49_chr 129514 ID=TGME49_240490;length=24

VI 7 1297637 - 90 3173

TGME49_chr 534295 ID=TGME49_316490;length=21

XI 4 5345109 + 55 3174

TGME49_chr 359026 ID=TGME49_258080;length=21

Vllb 5 3592441 + 76 3175

TGME49_chr 159020 ID=TGME49_310780;length=37

XI 7 1593920 - 13 3176

TGME49_chr 524080 ID=TGME49_236010;length=24

X 8 5243220 - 12 3177

TGME49_chr 209510 ID=TGME49_264720;length= 19

IX 4 2097103 - 99 3178

TGME49_chr 234307 ID=TGME49_311870;length= 17

XI 1 2344870 - 99 3179

TGME49_chr 626852 ID=TGME49_268820;length=47

VIII 9 6273255 + 26 3180

TGME49_chr 500444 ID=TGME49_235640;length=21

X 8 5006565 + 17 3181 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 137383 ID=TGME49_310470;length= 15

XI 1 1375397 + 66 3182

TGME49_chr 525787 ID=TGME49_305170;length=36

IX 5 5261530 + 55 3183

TGME49_chr 189710 ID=TGME49_217550;length=26

XII 6 1899724 + 18 3184

TGME49_chr 603670 ID=TGME49_278530;length=18

XII 0 6038574 + 74 3185

TGME49_chr 379301 ID=TGME49_201785 ;length= 14

Vila 3 3794469 - 56 3186

TGME49_chr 402542 ID=TGME49_201120;length=39

Vila 2 4029337 + 15 3187

TGME49_chr ID=TGME49_212750;length= 14

V 581559 583050 - 91 3188

TGME49_chr 599215 ID=TGME49_278620;length=21

XII 6 5994337 + 81 3189

TGME49_chr ID=TGME49_239340;length=30

VI 554234 557286 + 52 3190

TGME49_chr 647286 ID=TGME49_216000;length=26

XI 9 6475497 + 28 3191

TGME49_chr 363778 ID=TGME49_290190;length=24

IX 3 3640204 - 21 3192

TGME49_chr 266936 ID=TGME49_243490;length= 10

VI 9 2670449 + 80 3193

TGME49_chr 382058 ID=TGME49_248350;length=17

XII 0 3822334 + 54 3194

TGME49_chr ID=TGME49_212900;length=49

V 695402 700329 - 27 3195

TGME49_chr 396071 ID=TGME49_314250;length=21

XI 5 3962825 + 10 3196

TGME49_chr 361919 ID=TGME49_258050;length=20

Vllb 0 3621196 + 06 3197

TGME49_chr 249313 ID=TGME49_288390;length= 15

IX 3 2494671 - 38 3198

TGME49_chr 369146 ID=TGME49_223630;length=17

X 3 3693211 - 48 3199

TGME49_chr 240774 ID=TGME49_225210;length=20

X 6 2409830 - 84 3200

TGME49_chr 117193 ID=TGME49_205760;length=22

Vila 5 1174196 - 61 3201

TGME49_chr 566188 ID=TGME49_237000;length= 14

X 1 5663299 - 18 3202

TGME49_chr 651560 ID=TGME49_214830;length=22

X 4 6517875 + 71 3203

TGME49_chr ID=TGME49_221460;length=25

II 364035 366572 + 37 3204

TGME49_chr 318145 3182748 - ID=TGME49_273915 ;length= 12 3205 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 1 97

TGME49_chr 299096 ID=TGME49_312950;length=31

XI 9 2994121 + 52 3206

TGME49_chr 380755 ID=TGME49_313930;length=35

XI 5 3811116 + 61 3207

TGME49_chr 395152 ID=TGME49_257595 ;length=21

Vllb 2 3953639 + 17 3208

TGME49_chr 432539 ID=TGME49_234380;length=44

X 4 4329853 + 59 3209

TGME49_chr 638681 ID=TGME49_268640;length= 13

VIII 0 6388130 - 20 3210

TGME49_chr 107686 ID=TGME49_206290;length=27

Vila 3 1079600 + 37 3211

TGME49_chr 126168 ID=TGME49_266270;length=20

IX 7 1263721 - 34 3212

TGME49_chr ID=TGME49_220140;length= 15

V 160175 161766 + 91 3213

TGME49_chr 594567 ID=TGME49_269280;length=24

VIII 6 5948163 + 87 3214

TGME49_chr 396071 ID=TGME49_314260;length=50

XI 5 3965719 + 04 3215

TGME49_chr 276530 ID=TGME49_246340;length=35

XII 7 2768904 + 97 3216

TGME49_chr 207973 ID=TGME49_242590;length= 19

VI 4 2081710 + 76 3217

TGME49_chr 207883 ID=TGME49_210778 ;length= 18

IV 8 2080697 + 59 3218

TGME49_chr ID=TGME49_206695 ;length=39

Vila 717178 721171 + 93 3219

TGME49_chr ID=TGME49_307840;length=29

XII 440948 443883 - 35 3220

TGME49_chr 369843 ID=TGME49_313810;length=35

XI 5 3702023 + 88 3221

TGME49_chr 394277 ID=TGME49_272490;length=24

VIII 5 3945251 - 76 3222

TGME49_chr ID=TGME49_304650;length=24

Vila 609399 611880 - 81 3223

TGME49_chr ID=TGME49_230410;length= 18

VIII 759334 761144 - 10 3224

TGME49_chr 680145 ID=TGME49_215180;length=29

X 9 6804369 + 10 3225

TGME49_chr 203962 ID=TGME49_311390;length=42

XI 5 2043830 + 05 3226

TGME49_chr 454800 ID=TGME49_249450;length=25

XII 8 4550577 - 69 3227

TGME49_chr 307174 ID=TGME49_246990;length=38

XII 1 3075581 + 40 3228 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_230020;length=26

VIII 555529 558130 - 01 3229

TGME49_chr 262521 ID=TGME49_288570;length=42

IX 7 2629487 - 70 3230

TGME49_chr 433889 ID=TGME49_271820;length=29

VIII 5 4341797 + 02 3231

TGME49_chr 502312 ID=TGME49_315845 ;length=44

XI 3 5027594 - 71 3232

TGME49_chr 389491 ID=TGME49_314038 ;length=55

XI 5 3895467 + 2 3233

TGME49_chr ID=TGME49_228080;length=26

X 592274 594953 + 79 3234

TGME49_chr ID=TGME49_206605 ;length=31

Vila 790922 794038 - 16 3235

TGME49_chr ID=TGME49_200700;length=48

X 12618 17436 - 18 3236

TGME49_chr ID=TGME49_221190;length=28

II 113565 116395 + 30 3237

TGME49_chr 160362 ID=TGME49_310810;length=29

XI 0 1606558 + 38 3238

TGME49_chr 572854 ID=TGME49_278960;length=19

XII 0 5730511 - 71 3239

TGME49_chr 211326 ID=TGME49_232450;length=45

VIII 5 2117768 - 03 3240

TGME49_chr 269301 ID=TGME49_243510;length= 17

VI 0 2694809 - 99 3241

TGME49_chr 213354 ID=TGME49_217790;length= 16

XII 7 2135173 - 26 3242

TGME49_chr ID=TGME49_228290;length=41

X 439688 443866 - 78 3243

TGME49_chr ID=TGME49_309840;length=18

XI 838551 840380 - 29 3244

TGME49_chr ID=TGME49_239790;length= 17

VI 910173 911950 + 77 3245

TGME49_chr 586103 ID=TGME49_216830;length=20

XI 5 5863057 - 22 3246

TGME49_chr 208515 ID=TGME49_232420;length=21

VIII 2 2087343 - 91 3247

TGME49_chr 645712 ID=TGME49_216020;length=49

XI 0 6462057 + 37 3248

TGME49_chr 483761 ID=TGME49_315600;length=35

XI 6 4841215 + 99 3249

TGME49_chr ID=TGME49_263220;length=22

Vllb 696220 698492 + 72 3250

TGME49_chr 525301 ID=TGME49_316350;length=21

XI 1 5255174 - 63 3251

TGME49_chr 620376 6205467 + ID=TGME49_216370;length= 17 3252 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 0 07

TGME49_chr 558474 ID=TGME49_269840;length=22

VIII 5 5587000 - 55 3253

TGME49_chr ID=TGME49_304630;length=31

Vila 546593 549758 + 65 3254

TGME49_chr 156371 ID=TGME49_209945 ;length=30

lb 6 1566724 + 08 3255

TGME49_chr ID=TGME49_238020;length=29

VI 16939 19862 - 23 3256

TGME49_chr 444495 ID=TGME49_271625 ;length= 15

VIII 0 4446452 - 02 3257

TGME49_chr 427699 ID=TGME49_281920;length=42

Vila 8 4281215 - 17 3258

TGME49_chr ID=TGME49_293690;length=20

la 512763 514787 - 24 3259

TGME49_chr ID=TGME49_300360;length=24

XII 54016 56494 - 78 3260

TGME49_chr 523811 ID=TGME49_270450;length=20

VIII 9 5240134 - 15 3261

TGME49_chr 366009 ID=TGME49_223672;length=24

X 6 3662573 + 77 3262

TGME49_chr 113783 ID=TGME49_318700;length=22

IV 0 1140106 + 76 3263

TGME49_chr 528585 ID=TGME49_270300;length=29

VIII 9 5288814 + 55 3264

TGME49_chr ID=TGME49_239330;length=45

VI 546954 551539 + 85 3265

TGME49_chr ID=TGME49_238165 ;length=29

VI 94162 97062 - 00 3266

TGME49_chr 567518 ID=TGME49_251940;length=27

XII 7 5677936 + 49 3267

TGME49_chr 656009 ID=TGME49_268300;length=14

VIII 3 6561570 - 77 3268

TGME49_chr 599298 ID=TGME49_278630;length=20

XII 2 5995016 - 34 3269

TGME49_chr 220397 ID=TGME49_311680;length=23

XI 5 2206339 - 64 3270

TGME49_chr 285503 ID=TGME49_246555;length=24

XII 6 2857439 - 03 3271

TGME49_chr 571335 ID=TGME49_269650;length=35

VIII 8 5716949 + 91 3272

TGME49_chr 106316 ID=TGME49_262720;length=39

Vllb 8 1067088 - 20 3273

TGME49_chr 460357 ID=TGME49_235010;length= 19

X 4 4605486 - 12 3274

TGME49_chr 107656 ID=TGME49_222370;length=25

II 8 1079158 + 90 3275 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_229650;length=21

VIII 367389 369573 - 84 3276

TGME49_chr 309988 ID=TGME49_247010;length=21

XII 6 3101996 + 10 3277

TGME49_chr 112704 ID=TGME49_262610;length=26

Vllb 8 1129656 + 08 3278

TGME49_chr 323235 ID=TGME49_273830;length=20

VIII 8 3234398 + 40 3279

TGME49_chr 392361 ID=TGME49_272520;length=22

VIII 1 3925862 + 51 3280

TGME49_chr ID=TGME49_320540;length=38

IV 278199 282008 - 09 3281

TGME49_chr ID=TGME49_228210;length= 17

X 493321 495081 - 60 3282

TGME49_chr 176852 ID=TGME49_232030;length=44

VIII 8 1773014 + 86 3283

TGME49_chr 527215 ID=TGME49_270330;length=39

VIII 6 5276079 - 23 3284

TGME49_chr 618311 ID=TGME49_268960;length=22

VIII 8 6185406 - 88 3285

TGME49_chr ID=TGME49_319730;length= 17

IV 758992 760757 - 65 3286

TGME49_chr ID=TGME49_319520;length=35

IV 948350 951893 - 43 3287

TGME49_chr 243886 ID=TGME49_288300;length=32

IX 6 2442135 + 69 3288

TGME49_chr 464046 ID=TGME49_255895 ;length=28

Vllb 3 4643361 - 98 3289

TGME49_chr 520651 ID=TGME49_235970;length=23

X 0 5208863 + 53 3290

TGME49_chr 457066 ID=TGME49_256010;length= 10

Vllb 1 4571741 - 80 3291

TGME49_chr 162798 ID=TGME49_226450;length=46

X 1 1632602 - 21 3292

TGME49_chr ID=TGME49_309300;length=26

XI 615166 617797 + 31 3293

TGME49_chr 331882 ID=TGME49_273670;length=39

VIII 8 3322791 + 63 3294

TGME49_chr ID=TGME49_263050;length= 15

Vllb 856438 857983 - 45 3295

TGME49_chr 400505 ID=TGME49_290970;length=22

IX 2 4007328 - 76 3296

TGME49_chr 264504 ID=TGME49_312380;length=23

XI 5 2647433 + 88 3297

TGME49_chr ID=TGME49_208980;length=47

lb 927952 932691 + 39 3298

TGME49_chr 151579 1517465 + ID=TGME49_226590;length= 16 3299 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 8 67

TGME49_chr 128574 ID=TGME49_226860;length=45

X 3 1290308 - 65 3300

TGME49_chr 616738 ID=TGME49_268985;length=19

VIII 3 6169354 - 71 3301

TGME49_chr 302548 ID=TGME49_224490;length=35

X 5 3029064 + 79 3302

TGME49_chr 204762 ID=TGME49_210800;length= 13

IV 7 2048983 - 56 3303

TGME49_chr ID=TGME49_266710;length=46

IX 998733 1003398 + 65 3304

TGME49_chr 439619 ID=TGME49_234450;length= 10

X 4 4397204 + 10 3305

TGME49_chr 111277 ID=TGME49_222390;length=27

II 7 1115543 - 66 3306

TGME49_chr ID=TGME49_319640;length=24

IV 812222 814651 + 29 3307

TGME49_chr 146017 ID=TGME49_213960;length= 13

V 0 1461527 + 57 3308

TGME49_chr ID=TGME49_293230;length=20

la 182514 184547 - 33 3309

TGME49_chr 227154 ID=TGME49_260430;length=30

Vllb 7 2274550 + 03 3310

TGME49_chr ID=TGME49_222170;length=43

II 924486 928859 + 73 3311

TGME49_chr 187466 ID=TGME49_226000;length=31

X 3 1877819 - 56 3312

TGME49_chr 328901 ID=TGME49_289540;length=36

IX 2 3292661 + 49 3313

TGME49_chr 492369 ID=TGME49_315720;length=25

XI 9 4926214 + 15 3314

TGME49_chr 263335 ID=TGME49_246050;length=26

XII 8 2636013 - 55 3315

TGME49_chr 344264 ID=TGME49_258390;length=22

Vllb 3 3444853 - 10 3316

TGME49_chr 147111 ID=TGME49_209830;length=32

lb 4 1474353 + 39 3317

TGME49_chr 652905 ID=TGME49_277770;length=50

XII 8 6534063 - 05 3318

TGME49_chr ID=TGME49_220510;length=27

V 423922 426714 + 92 3319

TGME49_chr 174639 ID=TGME49_261450;length=48

Vllb 1 1751219 - 28 3320

TGME49_chr 163269 ID=TGME49_211670;length= 19

IV 3 1634603 - 10 3321

TGME49_chr 396351 ID=TGME49_272440;length= 19

VIII 1 3965457 + 46 3322 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 566623 ID=TGME49_305920;length= 19

IX 6 5668173 + 37 3323

TGME49_chr 590265 ID=TGME49_278753;length=32

XII 5 5905896 - 41 3324

TGME49_chr 157411 ID=TGME49_310760;length= 13

XI 6 1575491 + 75 3325

TGME49_chr 169290 ID=TGME49_241300;length=25

VI 9 1695468 + 59 3326

TGME49_chr ID=TGME49_208200;length=38

lb 601374 605206 - 32 3327

TGME49_chr 552084 ID=TGME49_316730;length=25

XI 8 5523439 + 91 3328

TGME49_chr 312717 ID=TGME49_247040;length=28

XII 2 3130041 - 69 3329

TGME49_chr 139983 ID=TGME49_209690;length=25

lb 8 1402434 + 96 3330

TGME49_chr 577995 ID=TGME49_237200;length=25

X 1 5782510 - 59 3331

TGME49_chr 572423 ID=TGME49_278965 ;length=21

XII 7 5726426 - 89 3332

TGME49_chr 346637 ID=TGME49_313550;length=40

XI 4 3470440 + 66 3333

TGME49_chr 474972 ID=TGME49_255650;length=28

Vllb 2 4752553 + 31 3334

TGME49_chr 561149 ID=TGME49_305840;length=22

IX 0 5613721 + 31 3335

TGME49_chr 429130 ID=TGME49_291600;length=26

IX 4 4293908 - 04 3336

TGME49_chr 345384 ID=TGME49_313520;length=23

XI 9 3456185 + 36 3337

TGME49_chr 474994 ID=TGME49_249710;length=27

XII 2 4752741 + 99 3338

TGME49_chr 178276 ID=TGME49_295390;length=31

la 1 1785919 + 58 3339

TGME49_chr ID=TGME49_309580;length=25

XI 732325 734891 + 66 3340

TGME49_chr 649629 ID=TGME49_214787 ;length= 12

X 4 6497510 - 16 3341

TGME49_chr 365554 ID=TGME49_223690;length= 13

X 0 3656902 - 62 3342

TGME49_chr ID=TGME49_263400;length=26

Vllb 583138 585745 - 07 3343

TGME49_chr ID=TGME49_306940;length=22

XI 175012 177263 - 51 3344

TGME49_chr 625350 ID=TGME49_214330;length= 16

X 3 6255193 - 90 3345

TGME49_chr 217454 2176594 - ID=TGME49_232580;length=20 3346 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 6 48

TGME49_chr 584058 ID=TGME49_216860;length=40

XI 7 5844665 + 78 3347

TGME49_chr 154351 ID=TGME49_209910;length=33

lb 9 1546904 + 85 3348

TGME49_chr 376923 ID=TGME49_223500;length=44

X 6 3773701 + 65 3349

TGME49_chr 158597 ID=TGME49_240960;length=47

VI 3 1590736 - 63 3350

TGME49_chr ID=TGME49_212810;length=39

V 609849 613843 + 94 3351

TGME49_chr 131554 ID=TGME49_222940;length=42

II 6 1319755 + 09 3352

TGME49_chr 293246 ID=TGME49_312830;length= 19

XI 9 2934408 + 39 3353

TGME49_chr 519850 ID=TGME49_250770;length=34

XII 2 5201968 - 66 3354

TGME49_chr 344477 ID=TGME49_273490;length= 14

VIII 8 3446250 + 72 3355

TGME49_chr 139903 ID=TGME49_213870;length=36

V 4 1402700 - 66 3356

TGME49_chr 225696 ID=TGME49_245432;length=22

XII 8 2259255 - 87 3357

TGME49_chr 142394 ID=TGME49_296015 ;length=29

la 9 1426873 - 24 3358

TGME49_chr 437934 ID=TGME49_256900;length=19

Vllb 7 4381247 - 00 3359

TGME49_chr 319382 ID=TGME49_289330;length=23

IX 4 3196200 + 76 3360

TGME49_chr 246387 ID=TGME49_245680;length=17

XII 5 2465663 + 88 3361

TGME49_chr ID=TGME49_308860;length=28

XI 308195 311017 - 22 3362

TGME49_chr 607872 ID=TGME49_216580;length=28

XI 3 6081550 - 27 3363

TGME49_chr ID=TGME49_230110;length= 15

VIII 609952 611492 - 40 3364

TGME49_chr ID=TGME49_308950;length=35

XI 378842 382347 + 05 3365

TGME49_chr 549935 ID=TGME49_251630;length= 12

XII 6 5500559 + 03 3366

TGME49_chr 364142 ID=TGME49_292610;length=20

VI 0 3643484 + 64 3367

TGME49_chr 146874 ID=TGME49_223130;length=20

II 6 1470751 - 05 3368

TGME49_chr 139777 ID=TGME49_213880;length=26

V 0 1400442 + 72 3369 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 104119 ID=TGME49_310070;length=25

XI 2 1043750 - 58 3370

TGME49_chr ID=TGME49_294360;length=30

la 837889 840906 + 17 3371

TGME49_chr 217373 ID=TGME49_285850;length=19

V 8 2175720 - 82 3372

TGME49_chr 641401 ID=TGME49_214600;length=20

X 4 6416110 - 96 3373

TGME49_chr 625286 ID=TGME49_268850;length=27

VIII 0 6255582 - 22 3374

TGME49_chr 171380 ID=TGME49_226330;length=25

X 7 1716377 - 70 3375

TGME49_chr 620636 ID=TGME49_214270;length=47

X 3 6211100 - 37 3376

TGME49_chr 557956 ID=TGME49_251770;length=27

XII 2 5582287 - 25 3377

TGME49_chr 206525 ID=TGME49_210783 ;length= 19

IV 3 2067243 - 90 3378

TGME49_chr 400859 ID=TGME49_257500;length=26

Vllb 6 4011217 + 21 3379

TGME49_chr 154534 ID=TGME49_297120;length=44

II 8 1549822 + 74 3380

TGME49_chr 367637 ID=TGME49_248120;length=38

XII 2 3680205 - 33 3381

TGME49_chr 133138 ID=TGME49_205558;length=31

Vila 4 1334559 + 75 3382

TGME49_chr 259647 ID=TGME49_288530;length=20

IX 1 2598483 + 12 3383

TGME49_chr 210377 ID=TGME49_311470;length=40

XI 0 2107867 - 97 3384

TGME49_chr 191160 ID=TGME49_217570;length= 12

XII 1 1912822 + 21 3385

TGME49_chr 321636 ID=TGME49_273860;length=31

VIII 8 3219488 + 20 3386

TGME49_chr 461647 ID=TGME49_271260;length=26

VIII 0 4619161 + 91 3387

TGME49_chr 620311 ID=TGME49_306590;length= 18

IX 9 6204949 - 30 3388

TGME49_chr ID=TGME49_227640;length= 18

X 865963 867849 - 86 3389

TGME49_chr 301656 ID=TGME49_289130;length=34

IX 5 3020010 + 45 3390

TGME49_chr ID=TGME49_207980;length= 13

lb 498140 499444 - 04 3391

TGME49_chr 567474 ID=TGME49_237015 ;length= 18

X 2 5676571 - 29 3392

TGME49_chr 164534 1649764 - ID=TGME49_226430;length=44 3393 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 6 18

TGME49_chr 318164 ID=TGME49_244300;length= 19

VI 0 3183599 - 59 3394

TGME49_chr ID=TGME49_320700;length=21

IV 132847 135011 - 64 3395

TGME49_chr 453406 ID=TGME49_292080;length=25

IX 8 4536660 + 92 3396

TGME49_chr 263117 ID=TGME49_288610;length=50

IX 6 2636201 + 25 3397

TGME49_chr 330994 ID=TGME49_273720;length=47

VIII 9 3314650 - 01 3398

TGME49_chr 637331 ID=TGME49_216160;length=31

XI 2 6376490 - 78 3399

TGME49_chr 284727 ID=TGME49_243780;length=48

VI 9 2852109 + 30 3400

TGME49_chr 693787 ID=TGME49_215380;length=22

X 6 6940165 - 89 3401

TGME49_chr 272200 ID=TGME49_288740;length=36

IX 0 2725631 - 31 3402

TGME49_chr 579207 ID=TGME49_216910;length=20

XI 2 5794118 + 46 3403

TGME49_chr 138521 ID=TGME49_231380;length=47

VIII 1 1389924 + 13 3404

TGME49_chr 398651 ID=TGME49_314295 ;length= 18

XI 5 3988412 + 97 3405

TGME49_chr 200389 ID=TGME49_217680;length=47

XII 8 2008672 - 74 3406

TGME49_chr ID=TGME49_267600;length= 18

IX 530467 532341 - 74 3407

TGME49_chr 460604 ID=TGME49_315260;length= 13

XI 1 4607359 + 18 3408

TGME49_chr 357686 ID=TGME49_273350;length=33

VIII 3 3580229 + 66 3409

TGME49_chr ID=TGME49_221310;length= 18

II 233786 235666 - 80 3410

TGME49_chr 428701 ID=TGME49_248990;length= 19

XII 0 4288915 + 05 3411

TGME49_chr ID=TGME49_221818 ;length=22

II 695953 698239 - 86 3412

TGME49_chr 698314 ID=TGME49_215500;length= 18

X 9 6985034 + 85 3413

TGME49_chr ID=TGME49_213020;length=43

V 799775 804114 - 39 3414

TGME49_chr 506139 ID=TGME49_255160;length=31

Vllb 8 5064594 + 96 3415

TGME49_chr 341788 ID=TGME49_244645 ;length=26

VI 2 3420561 + 79 3416 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 307306 ID=TGME49_246982;length=30

XII 4 3076076 - 12 3417

TGME49_chr 150026 ID=TGME49_205265 ;length=47

Vila 9 1505002 + 33 3418

TGME49_chr 211244 ID=TGME49_232475 ;length= 19

VIII 8 2114445 + 97 3419

TGME49_chr 118776 ID=TGME49_218960;length=47

XII 9 1192566 - 97 3420

TGME49_chr 180004 ID=TGME49_241880;length=46

VI 5 1804730 - 85 3421

TGME49_chr 550593 ID=TGME49_305610;length=30

IX 7 5508990 - 53 3422

TGME49_chr ID=TGME49_238500;length=48

VI 253436 258317 - 81 3423

TGME49_chr 393380 ID=TGME49_257650;length=14

Vllb 2 3935262 - 60 3424

TGME49_chr 633073 ID=TGME49_268760;length=38

VIII 9 6334570 - 31 3425

TGME49_chr ID=TGME49_230440;length=40

VIII 793083 797090 - 07 3426

TGME49_chr ID=TGME49_280680;length=10

Vila 186501 187589 + 88 3427

TGME49_chr 670570 ID=TGME49_277220;length= 18

XII 1 6707565 - 64 3428

TGME49_chr 200385 ID=TGME49_297790;length= 18

II 4 2005661 - 07 3429

TGME49_chr 274226 ID=TGME49_243590;length=17

VI 5 2744014 - 49 3430

TGME49_chr 706704 ID=TGME49_215640;length=48

X 4 7071859 - 15 3431

TGME49_chr 174135 ID=TGME49_204505 ;length= 15

Vila 8 1742899 + 41 3432

TGME49_chr 184315 ID=TGME49_311180;length=32

XI 5 1846445 + 90 3433

TGME49_chr ID=TGME49_239820;length=45

VI 948026 952595 - 69 3434

TGME49_chr 183493 ID=TGME49_217430;length=21

XII 4 1837076 - 42 3435

TGME49_chr 314247 ID=TGME49_313160;length=50

XI 9 3147503 + 24 3436

TGME49_chr 713035 ID=TGME49_215730;length=21

X 9 7132487 + 28 3437

TGME49_chr 228666 ID=TGME49_232840;length=78

VIII 6 2287451 - 5 3438

TGME49_chr 163689 ID=TGME49_310860;length=20

XI 2 1638984 - 92 3439

TGME49_chr 427622 4277993 + ID=TGME49_281930;length= 17 3440 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vila 9 64

TGME49_chr 632951 ID=TGME49_268750;length=45

VIII 9 6334023 + 04 3441

TGME49_chr ID=TGME49_239290;length=32

VI 528450 531729 - 79 3442

TGME49_chr 135506 ID=TGME49_205530;length=42

Vila 7 1359343 - 76 3443

TGME49_chr 587790 ID=TGME49_216800;length=23

XI 7 5880229 - 22 3444

TGME49_chr 267966 ID=TGME49_259835;length=41

Vllb 3 2683799 - 36 3445

TGME49_chr 559844 ID=TGME49_305800;length=27

IX 2 5601226 - 84 3446

TGME49_chr ID=TGME49_294100;length=23

la 676197 678509 - 12 3447

TGME49_chr 480745 ID=TGME49_249800;length=30

XII 2 4810501 - 49 3448

TGME49_chr 295233 ID=TGME49_312850;length=90

XI 4 2953239 - 5 3449

TGME49_chr ID=TGME49_238150;length=24

VI 85967 88465 + 98 3450

TGME49_chr 177487 ID=TGME49_241870;length= 13

VI 0 1776217 + 47 3451

TGME49_chr 244382 ID=TGME49_225160;length= 15

X 0 2445343 + 23 3452

TGME49_chr 225686 ID=TGME49_285740;length= 16

V 5 2258465 + 00 3453

TGME49_chr 305508 ID=TGME49_202760;length=22

Vila 7 3057294 - 07 3454

TGME49_chr 521599 ID=TGME49_316270;length=27

XI 9 5218760 + 61 3455

TGME49_chr 164128 ID=TGME49_211650;length=32

IV 9 1644518 - 29 3456

TGME49_chr 572012 ID=TGME49_237110;length=29

X 6 5723099 + 73 3457

TGME49_chr 274845 ID=TGME49_246210;length= 16

XII 8 2750074 + 16 3458

TGME49_chr 532289 ID=TGME49_270220;length=37

VIII 9 5326644 + 45 3459

TGME49_chr ID=TGME49_219520;length=25

XII 889364 891927 - 63 3460

TGME49_chr 137994 ID=TGME49_240670;length= 14

VI 1 1381383 + 42 3461

TGME49_chr 137383 ID=TGME49_310480;length=46

XI 1 1378479 + 48 3462

TGME49_chr 120818 ID=TGME49_218930;length=45

XII 9 1212694 - 05 3463 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 361644 ID=TGME49_247850;length=14

XII 4 3617866 - 22 3464

TGME49_chr 148594 ID=TGME49_287270;length= 13

V 8 1487275 + 27 3465

TGME49_chr 132180 ID=TGME49_213780;length=39

V 0 1325747 - 47 3466

TGME49_chr 121684 ID=TGME49_318580;length= 15

IV 0 1218366 - 26 3467

TGME49_chr 396964 ID=TGME49_201220;length=25

Vila 2 3972217 - 75 3468

TGME49_chr 493096 ID=TGME49_235515;length=36

X 0 4934578 - 18 3469

TGME49_chr 184851 ID=TGME49_226040;length=32

X 4 1851776 + 62 3470

TGME49_chr 120166 ID=TGME49_294830;length=27

la 4 1204376 - 12 3471

TGME49_chr 421000 ID=TGME49_272030;length=24

VIII 0 4212465 - 65 3472

TGME49_chr ID=TGME49_308020;length=30

XII 511743 514831 - 88 3473

TGME49_chr 260854 ID=TGME49_259920;length=23

Vllb 8 2610923 + 75 3474

TGME49_chr 529061 ID=TGME49_236120;length=34

X 1 5294071 - 60 3475

TGME49_chr 384934 ID=TGME49_201680;length=48

Vila 4 3854159 + 15 3476

TGME49_chr 655751 ID=TGME49_268310;length=40

VIII 7 6561570 - 53 3477

TGME49_chr 580692 ID=TGME49_237240;length=20

X 6 5808940 - 14 3478

TGME49_chr ID=TGME49_227450;length=25

X 922178 924766 + 88 3479

TGME49_chr 265901 ID=TGME49_288650;length=29

IX 8 2661986 - 68 3480

TGME49_chr 275901 ID=TGME49_243620;length=34

VI 9 2762419 - 00 3481

TGME49_chr 647478 ID=TGME49_214760;length=44

X 4 6479194 + 10 3482

TGME49_chr 167016 ID=TGME49_261530;length=22

Vllb 3 1672456 - 93 3483

TGME49_chr 580156 ID=TGME49_216900;length= 13

XI 5 5802917 - 52 3484

TGME49_chr 697992 ID=TGME49_215480;length= 15

X 7 6981500 - 73 3485

TGME49_chr 407848 ID=TGME49_248677;length=43

XII 5 4082874 + 89 3486

TGME49_chr 153327 1537059 - ID=TGME49_226570;length=37 3487 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

X 7 82

TGME49_chr 167065 ID=TGME49_265240;length=27

IX 8 1673385 + 27 3488

TGME49_chr 535113 ID=TGME49_236250;length=l 1

X 0 5352320 + 90 3489

TGME49_chr ID=TGME49_299980;length=14

XII 302852 304279 - 27 3490

TGME49_chr 140395 ID=TGME49_209700;length= 12

lb 3 1405251 + 98 3491

TGME49_chr 300370 ID=TGME49_224520;length=36

X 5 3007325 + 20 3492

TGME49_chr ID=TGME49_320620;length= 17

IV 194285 196070 + 85 3493

TGME49_chr 518932 ID=TGME49_270570;length= 17

VIII 3 5191091 - 68 3494

TGME49_chr ID=TGME49_227390;length=46

X 971149 975816 - 67 3495

TGME49_chr 114538 ID=TGME49_213560;length=l 1

V 6 1146565 + 79 3496

TGME49_chr 566119 ID=TGME49_251910;length=21

XII 0 5663297 - 07 3497

TGME49_chr 168598 ID=TGME49_321630;length=49

lb 9 1690920 - 31 3498

TGME49_chr ID=TGME49_294060;length=39

la 662350 666276 + 26 3499

TGME49_chr 334749 ID=TGME49_313410;length=27

XI 9 3350293 + 94 3500

TGME49_chr ID=TGME49_309290;length=28

XI 611336 614210 + 74 3501

TGME49_chr 438590 ID=TGME49_291860;length= 16

IX 9 4387593 - 84 3502

TGME49_chr 267117 ID=TGME49_246110;length=40

XII 6 2675189 - 13 3503

TGME49_chr ID=TGME49_308075 ;length=75

XII 565309 566062 - 3 3504

TGME49_chr 472715 ID=TGME49_249670;length=26

XII 5 4729832 - 77 3505

TGME49_chr ID=TGME49_239070;length= 19

VI 447757 449748 - 91 3506

TGME49_chr ID=TGME49_221360;length=31

II 287426 290540 - 14 3507

TGME49_chr 187335 ID=TGME49_261070;length=30

Vllb 0 1876420 + 70 3508

TGME49_chr 401155 ID=TGME49_272390;length=44

VIII 0 4016029 - 79 3509

TGME49_chr 261052 ID=TGME49_233360;length=50

VIII 2 2615562 - 40 3510 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 543706 ID=TGME49_270070;length= 19

VIII 5 5439034 - 69 3511

TGME49_chr ID=TGME49_238130;length=21

VI 75690 77850 - 60 3512

TGME49_chr 153090 ID=TGME49_205230;length= 17

Vila 3 1532616 - 13 3513

TGME49_chr 275460 ID=TGME49_259610;length=20

Vllb 2 2756634 - 32 3514

TGME49_chr 115807 ID=TGME49_310210;length=27

XI 9 1160818 - 39 3515

TGME49_chr ID=TGME49_230000;length=27

VIII 530520 533240 - 20 3516

TGME49_chr 552856 ID=TGME49_305635;length=14

IX 9 5530025 + 56 3517

TGME49_chr 163337 ID=TGME49_218260;length=46

XII 5 1637998 + 23 3518

TGME49_chr 353033 ID=TGME49_273400;length=22

VIII 8 3532583 - 45 3519

TGME49_chr 220048 ID=TGME49_311670;length=23

XI 0 2202879 - 99 3520

TGME49_chr 430572 ID=TGME49_291630;length=43

IX 6 4310108 - 82 3521

TGME49_chr 126614 ID=TGME49_294902;length= 12

la 5 1267436 - 91 3522

TGME49_chr ID=TGME49_207690;length=42

lb 209955 214235 + 80 3523

TGME49_chr 285884 ID=TGME49_246580;length=44

XII 7 2863310 + 63 3524

TGME49_chr ID=TGME49_219610;length=34

XII 823067 826469 + 02 3525

TGME49_chr 383349 ID=TGME49_201710;length= 11

Vila 3 3834672 + 79 3526

TGME49_chr 329606 ID=TGME49_258625 ;length= 18

Vllb 0 3297939 + 79 3527

TGME49_chr ID=TGME49_239710;length=22

VI 867209 869418 - 09 3528

TGME49_chr 127542 ID=TGME49_205615 ;length= 10

Vila 3 1276455 - 32 3529

TGME49_chr 470870 ID=TGME49_249630;length= 16

XII 2 4710344 - 42 3530

TGME49_chr 417174 ID=TGME49_234165 ;length=35

X 8 4175262 - 14 3531

TGME49_chr ID=TGME49_300260;length=20

XII 120499 122563 + 64 3532

TGME49_chr ID=TGME49_239260;length=32

VI 498397 501660 - 63 3533

TGME49_chr 493816 497365 - ID=TGME49_229960;length=35 3534 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 49

TGME49_chr 361628 ID=TGME49_202090;length=44

Vila 0 3620711 - 31 3535

TGME49_chr 697368 ID=TGME49_215460;length= 11

X 4 6974864 - 80 3536

TGME49_chr 672589 ID=TGME49_215060;length=35

X 0 6729429 - 39 3537

TGME49_chr 276280 ID=TGME49_312520;length=26

XI 5 2765415 - 10 3538

TGME49_chr ID=TGME49_206415 ;length=47

Vila 989384 994150 + 66 3539

TGME49_chr 103305 ID=TGME49_209080;length=28

lb 7 1035956 - 99 3540

TGME49_chr ID=TGME49_263720;length=38

Vllb 363934 367739 + 05 3541

TGME49_chr 367932 ID=TGME49_290290;length=20

IX 9 3681333 + 04 3542

TGME49_chr 132724 ID=TGME49_310410;length= 14

XI 1 1328715 - 74 3543

TGME49_chr ID=TGME49_207630;length=20

lb 162965 165017 + 52 3544

TGME49_chr 341163 ID=TGME49_273550;length=23

VIII 0 3413948 - 18 3545

TGME49_chr 423761 ID=TGME49_291310;length= 19

IX 3 4239542 - 29 3546

TGME49_chr 271127 ID=TGME49_284560;length=17

V 7 2713061 + 84 3547

TGME49_chr ID=TGME49_238100;length=44

VI 66661 71093 - 32 3548

TGME49_chr 110818 ID=TGME49_209200;length=40

lb 1 1112225 + 44 3549

TGME49_chr 112501 ID=TGME49_227030;length=40

X 8 1129103 + 85 3550

TGME49_chr 147672 ID=TGME49_218510;length=40

XII 0 1480771 + 51 3551

TGME49_chr 302829 ID=TGME49_274110;length=22

VIII 4 3030583 - 89 3552

TGME49_chr ID=TGME49_267312;length=44

IX 727730 732199 - 69 3553

TGME49_chr 172377 ID=TGME49_241820;length=39

VI 5 1727714 + 39 3554

TGME49_chr ID=TGME49_230040;length= 13

VIII 560930 562293 + 63 3555

TGME49_chr 141962 ID=TGME49_223070;length= 17

II 6 1421332 + 06 3556

TGME49_chr 281672 ID=TGME49_312600;length=20

XI 5 2818748 + 23 3557 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 708587 ID=TGME49_215680;length=14

X 3 7087326 - 53 3558

TGME49_chr 116295 ID=TGME49_318675;length=27

IV 8 1165755 - 97 3559

TGME49_chr 193260 ID=TGME49_211150;length=27

IV 8 1935398 - 90 3560

TGME49_chr 302417 ID=TGME49_246940;length=29

XII 6 3027111 + 35 3561

TGME49_chr 171839 ID=TGME49_261475 ;length= 17

Vllb 4 1720099 + 05 3562

TGME49_chr 518858 ID=TGME49_270560;length=20

VIII 9 5190591 + 02 3563

TGME49_chr 692247 ID=TGME49_215350;length=44

X 4 6926889 - 15 3564

TGME49_chr 211463 ID=TGME49_264670;length=20

IX 3 2116655 + 22 3565

TGME49_chr 530262 ID=TGME49_305240;length=31

IX 0 5305770 + 50 3566

TGME49_chr 372226 ID=TGME49_313840;length= 19

XI 5 3724229 + 64 3567

TGME49_chr 367939 ID=TGME49_223650;length=20

X 6 3681487 + 91 3568

TGME49_chr 161225 ID=TGME49_310820;length= 18

XI 5 1614074 + 19 3569

TGME49_chr 430366 ID=TGME49_314790;length=20

XI 3 4305739 - 76 3570

TGME49_chr 303796 ID=TGME49_202780;length=47

Vila 0 3042664 - 04 3571

TGME49_chr 535366 ID=TGME49_270180;length= 11

VIII 9 5354773 + 04 3572

TGME49_chr 518667 ID=TGME49_316220;length=22

XI 4 5188879 + 05 3573

TGME49_chr 649143 ID=TGME49_277830;length=42

XII 0 6491855 - 5 3574

TGME49_chr 117836 ID=TGME49_318640;length= 17

IV 0 1180121 - 61 3575

TGME49_chr 139120 ID=TGME49_209670;length= 18

lb 8 1393087 + 79 3576

TGME49_chr ID=TGME49_208590;length=28

lb 801471 804370 + 99 3577

TGME49_chr ID=TGME49_229740;length=20

VIII 415461 417463 - 02 3578

TGME49_chr 127498 ID=TGME49_222880;length= 19

II 6 1276921 + 35 3579

TGME49_chr 246664 ID=TGME49_225125 ;length=24

X 0 2469050 + 10 3580

TGME49_chr 327759 3280720 - ID=TGME49_258660;length=31 3581 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 6 24

TGME49_chr 139578 ID=TGME49_262070;length=33

Vllb 5 1399179 - 94 3582

TGME49_chr 659704 ID=TGME49_277560;length=23

XII 7 6599362 + 15 3583

TGME49_chr 721585 ID=TGME49_275380;length= 19

X 3 7217846 - 93 3584

TGME49_chr 676616 ID=TGME49_215100;length=35

X 6 6769751 - 85 3585

TGME49_chr 538683 ID=TGME49_236400;length=22

X 4 5389094 - 60 3586

TGME49_chr 153154 ID=TGME49_297100;length=35

II 4 1535065 - 21 3587

TGME49_chr 207309 ID=TGME49_311440;length=45

XI 1 2077650 - 59 3588

TGME49_chr 321612 ID=TGME49_202572;length= 19

Vila 4 3218042 - 18 3589

TGME49_chr 138804 ID=TGME49_295090;length=34

la 1 1391447 + 06 3590

TGME49_chr ID=TGME49_227600;length= 13

X 896541 897868 - 27 3591

TGME49_chr 622202 ID=TGME49_278230;length= 16

XII 1 6223691 - 70 3592

TGME49_chr 582599 ID=TGME49_306210;length=23

IX 7 5828317 + 20 3593

TGME49_chr 328288 ID=TGME49_224170;length=l 1

X 6 3284073 + 87 3594

TGME49_chr ID=TGME49_229680;length=30

VIII 377418 380433 - 15 3595

TGME49_chr 310419 ID=TGME49_283510;length=28

V 1 3107086 + 95 3596

TGME49_chr 220200 ID=TGME49_285820;length= 14

V 9 2203415 - 06 3597

TGME49_chr 197043 ID=TGME49_211015 ;length= 14

IV 0 1971908 - 78 3598

TGME49_chr 105291 ID=TGME49_240090;length=47

VI 9 1057698 + 79 3599

TGME49_chr 133053 ID=TGME49_240580;length=21

VI 3 1332686 - 53 3600

TGME49_chr 529636 ID=TGME49_236140;length= 18

X 0 5298183 + 23 3601

TGME49_chr 157585 ID=TGME49_205150;length=22

Vila 1 1578088 + 37 3602

TGME49_chr 366899 ID=TGME49_257990;length= 17

Vllb 5 3670754 + 59 3603

TGME49_chr ID=TGME49_207400;length=20

lb 27811 29823 + 12 3604 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 367401 ID=TGME49_273070;length=45

VIII 4 3678518 + 04 3605

TGME49_chr 202072 ID=TGME49_210830;length=20

IV 8 2022776 + 48 3606

TGME49_chr ID=TGME49_293680;length=30

la 501242 504308 + 66 3607

TGME49_chr 418829 ID=TGME49_272040;length= 19

VIII 4 4190240 + 46 3608

TGME49_chr 159327 ID=TGME49_261670;length= 17

Vllb 4 1595045 + 71 3609

TGME49_chr 234504 ID=TGME49_242845 ;length=27

VI 4 2347765 + 21 3610

TGME49_chr ID=TGME49_262960;length=40

Vllb 892798 896825 + 27 3611

TGME49_chr 457653 ID=TGME49_255990;length= 15

Vllb 5 4578100 - 65 3612

TGME49_chr 356426 ID=TGME49_258120;length=31

Vllb 9 3567431 - 62 3613

TGME49_chr ID=TGME49_213067;length=28

V 841156 844040 - 84 3614

TGME49_chr ID=TGME49_263680;length=19

Vllb 400077 401984 - 07 3615

TGME49_chr 565125 ID=TGME49_269720;length=30

VIII 7 5654273 - 16 3616

TGME49_chr 582451 ID=TGME49_216875 ;length=92

XI 9 5825439 - 0 3617

TGME49_chr ID=TGME49_293370;length=18

la 289525 291377 - 52 3618

TGME49_chr ID=TGME49_263740;length=46

Vllb 340984 345618 + 34 3619

TGME49_chr 333462 ID=TGME49_244515 ;length=23

VI 4 3336932 - 08 3620

TGME49_chr 646217 ID=TGME49_268390;length=18

VIII 8 6464044 + 66 3621

TGME49_chr 182553 ID=TGME49_297495 ;length=49

II 5 1830439 - 04 3622

TGME49_chr ID=TGME49_238250;length=35

VI 151812 155358 + 46 3623

TGME49_chr 415323 ID=TGME49_211870;length=26

X 4 4155867 + 33 3624

TGME49_chr 311836 ID=TGME49_202660;length=35

Vila 8 3121954 - 86 3625

TGME49_chr 368953 ID=TGME49_248140;length=27

XII 7 3692256 - 19 3626

TGME49_chr 327996 ID=TGME49_313330;length=31

XI 7 3283134 - 67 3627

TGME49_chr 396066 3963072 - ID=TGME49_272460;length=24 3628 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 9 03

TGME49_chr 546346 ID=TGME49_236635;length=17

X 9 5465206 - 37 3629

TGME49_chr 140265 ID=TGME49_213890;length= 18

V 0 1404500 + 50 3630

TGME49_chr 415323 ID=TGME49_211880;length=20

X 4 4155275 + 41 3631

TGME49_chr 126699 ID=TGME49_226880;length=26

X 3 1269615 + 22 3632

TGME49_chr 152877 ID=TGME49_265480;length=20

IX 8 1530786 + 08 3633

TGME49_chr 174584 ID=TGME49_265150;length= 14

IX 5 1747343 - 98 3634

TGME49_chr ID=TGME49_239740;length=37

VI 884386 888183 - 97 3635

TGME49_chr 531719 ID=TGME49_316460;length=37

XI 4 5320938 + 44 3636

TGME49_chr ID=TGME49_263130;length=37

Vllb 776365 780112 + 47 3637

TGME49_chr 644282 ID=TGME49_216050;length= 17

XI 5 6444555 - 30 3638

TGME49_chr 231852 ID=TGME49_225310;length= 19

X 5 2320515 + 90 3639

TGME49_chr 429630 ID=TGME49_271892;length=22

VIII 4 4298540 - 36 3640

TGME49_chr 361143 ID=TGME49_273280;length=17

VIII 8 3613139 + 01 3641

TGME49_chr 217283 ID=TGME49_285840;length= 19

V 0 2174806 + 76 3642

TGME49_chr ID=TGME49_293340;length=22

la 271448 273727 - 79 3643

TGME49_chr 266046 ID=TGME49_259850;length=21

Vllb 3 2662578 + 15 3644

TGME49_chr 292475 ID=TGME49_259168;length=l 1

Vllb 5 2925875 - 20 3645

TGME49_chr 609433 ID=TGME49_278440;length=25

XII 5 6096858 + 23 3646

TGME49_chr 514613 ID=TGME49_304980;length=78

IX 0 5146913 - 3 3647

TGME49_chr 360068 ID=TGME49_313673;length=37

XI 8 3604400 - 12 3648

TGME49_chr 180220 ID=TGME49_261260;length=24

Vllb 6 1804702 - 96 3649

TGME49_chr 611665 ID=TGME49_216500;length= 19

XI 9 6118655 - 96 3650

TGME49_chr 185497 ID=TGME49_217470;length=45

XII 1 1859523 + 52 3651 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_228490;length=23

X 293996 296317 - 21 3652

TGME49_chr 531834 ID=TGME49_305260;length=24

IX 5 5320794 + 49 3653

TGME49_chr 239901 ID=TGME49_242900;length= 17

VI 9 2400763 + 44 3654

TGME49_chr 164247 ID=TGME49_205080;length=29

Vila 4 1645461 - 87 3655

TGME49_chr 612397 ID=TGME49_269035;length=39

VIII 1 6127883 - 12 3656

TGME49_chr 254318 ID=TGME49_285200;length=21

V 4 2545309 - 25 3657

TGME49_chr 449269 ID=TGME49_249365;length=39

XII 7 4496623 + 26 3658

TGME49_chr ID=TGME49_238915 ;length=48

VI 303372 308255 + 83 3659

TGME49_chr ID=TGME49_221260;length=32

II 179710 182938 + 28 3660

TGME49_chr 429488 ID=TGME49_249000;length=20

XII 8 4296930 - 42 3661

TGME49_chr 350524 ID=TGME49_289880;length=33

IX 5 3508638 - 93 3662

TGME49_chr 286929 ID=TGME49_259210;length= 18

Vllb 9 2871159 + 60 3663

TGME49_chr 377764 ID=TGME49_201800;length=20

Vila 8 3779712 + 64 3664

TGME49_chr 307207 ID=TGME49_313055;length=22

XI 4 3074339 - 65 3665

TGME49_chr 415722 ID=TGME49_281500;length= 19

Vila 0 4159151 - 31 3666

TGME49_chr ID=TGME49_264225 ;length=38

Vllb 57559 61364 - 05 3667

TGME49_chr 187673 ID=TGME49_311230;length=41

XI 5 1880862 + 27 3668

TGME49_chr 444059 ID=TGME49_256810;length= 15

Vllb 7 4442145 + 48 3669

TGME49_chr 307230 ID=TGME49_202730;length=23

Vila 9 3074641 + 32 3670

TGME49_chr ID=TGME49_229480;length=42

VIII 295302 299523 - 21 3671

TGME49_chr 370084 ID=TGME49_313800;length=27

XI 2 3703550 - 08 3672

TGME49_chr ID=TGME49_208560;length=38

lb 783599 787408 - 09 3673

TGME49_chr 633795 ID=TGME49_268730;length= 15

VIII 4 6339540 + 86 3674

TGME49_chr 263004 2632405 + ID=TGME49_312360;length=23 3675 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 5 60

TGME49_chr 123527 ID=TGME49_266300;length=43

IX 7 1239596 - 19 3676

TGME49_chr ID=TGME49_266920;length=39

IX 894353 898324 + 71 3677

TGME49_chr 415896 ID=TGME49_272180;length= 18

VIII 4 4160790 + 26 3678

TGME49_chr 244382 ID=TGME49_225150;length=44

X 0 2448305 + 85 3679

TGME49_chr ID=TGME49_293290;length= 17

la 236822 238568 + 46 3680

TGME49_chr 582599 ID=TGME49_306200;length= 16

IX 7 5827653 + 56 3681

TGME49_chr 235479 ID=TGME49_288050;length= 13

IX 6 2356195 - 99 3682

TGME49_chr 312835 ID=TGME49_273995 ;length= 15

VIII 3 3129861 - 08 3683

TGME49_chr ID=TGME49_263860;length= 14

Vllb 282534 283983 + 49 3684

TGME49_chr 128544 ID=TGME49_213748 ;length=21

V 9 1287602 - 53 3685

TGME49_chr 412707 ID=TGME49_291120;length=31

IX 8 4130241 + 63 3686

TGME49_chr 274726 ID=TGME49_233720;length=20

VIII 3 2749263 + 00 3687

TGME49_chr 341463 ID=TGME49_247550;length=30

XII 9 3417680 - 41 3688

TGME49_chr 601034 ID=TGME49_214080;length=38

X 4 6014213 + 69 3689

TGME49_chr ID=TGME49_309720;length=39

XI 763150 767129 + 79 3690

TGME49_chr ID=TGME49_238200;length=29

VI 118877 121839 + 62 3691

TGME49_chr ID=TGME49_309810;length= 16

XI 829867 831550 - 83 3692

TGME49_chr 236022 ID=TGME49_242860;length=48

VI 8 2365079 - 51 3693

TGME49_chr 397854 ID=TGME49_212200;length=28

X 5 3981345 + 00 3694

TGME49_chr 685241 ID=TGME49_276920;length=32

XII 9 6855685 - 66 3695

TGME49_chr 494394 ID=TGME49_255260;length=45

Vllb 3 4948498 + 55 3696

TGME49_chr 470757 ID=TGME49_249650;length=21

XII 7 4709680 + 03 3697

TGME49_chr 470997 ID=TGME49_255710;length= 14

Vllb 2 4711447 - 75 3698 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 208344 ID=TGME49_264740;length=26

IX 0 2086073 - 33 3699

TGME49_chr 285090 ID=TGME49_284040;length=30

V 6 2853929 - 23 3700

TGME49_chr 595151 ID=TGME49_237520;length=29

X 3 5954484 - 71 3701

TGME49_chr 511389 ID=TGME49_270630;length= 16

VIII 9 5115543 + 44 3702

TGME49_chr ID=TGME49_208570;length=44

lb 781751 786180 + 29 3703

TGME49_chr 237739 ID=TGME49_225240;length= 18

X 5 2379215 + 20 3704

TGME49_chr 313455 ID=TGME49_224275 ;length=25

X 6 3137082 + 26 3705

TGME49_chr 100583 ID=TGME49_213410;length= 19

V 6 1007767 + 31 3706

TGME49_chr 346904 ID=TGME49_313540;length= 18

XI 8 3470899 - 51 3707

TGME49_chr 104947 ID=TGME49_230830;length=39

VIII 3 1053414 - 41 3708

TGME49_chr 330076 ID=TGME49_244470;length=32

VI 9 3304052 - 83 3709

TGME49_chr ID=TGME49_309760;length=45

XI 793193 797745 - 52 3710

TGME49_chr ID=TGME49_219682;length=21

XII 736911 739042 - 31 3711

TGME49_chr 176429 ID=TGME49_286595;length=39

V 2 1768196 + 04 3712

TGME49_chr 267854 ID=TGME49_243500;length=23

VI 4 2680887 + 43 3713

TGME49_chr 294441 ID=TGME49_244000;length= 18

VI 4 2946280 + 66 3714

TGME49_chr 325328 ID=TGME49_244430;length=20

VI 7 3255288 + 01 3715

TGME49_chr 620559 ID=TGME49_214280;length=47

X 5 6210365 + 70 3716

TGME49_chr 403728 ID=TGME49_291005 ;length=43

IX 6 4041660 - 74 3717

TGME49_chr 458528 ID=TGME49_292160;length= 16

IX 4 4586943 - 59 3718

TGME49_chr 108836 ID=TGME49_294720;length=49

la 1 1093316 + 55 3719

TGME49_chr 280589 ID=TGME49_233810;length=25

VIII 8 2808434 + 36 3720

TGME49_chr ID=TGME49_213115 ;length= 13

V 860119 861470 + 51 3721

TGME49_chr 156022 1563486 + ID=TGME49_295720;length=32 3722 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: la 1 65

TGME49_chr 266713 ID=TGME49_317720;length=24

IV 6 2669610 + 74 3723

TGME49_chr 165534 ID=TGME49_241155;length=36

VI 1 1658987 + 46 3724

TGME49_chr ID=TGME49_263630;length=35

Vllb 430148 433668 - 20 3725

TGME49_chr 467769 ID=TGME49_292300;length= 19

IX 7 4679657 - 60 3726

TGME49_chr 353173 ID=TGME49_223845 ;length=31

X 1 3534835 + 04 3727

TGME49_chr 655214 ID=TGME49_214860;length=71

X 4 6552859 + 5 3728

TGME49_chr 192863 ID=TGME49_232190;length=46

VIII 5 1933235 + 00 3729

TGME49_chr 496384 ID=TGME49_255245 ;length= 17

Vllb 8 4965565 + 17 3730

TGME49_chr 227226 ID=TGME49_311765;length=42

XI 7 2276540 - 73 3731

TGME49_chr 320638 ID=TGME49_313235;length=44

XI 6 3210859 - 73 3732

TGME49_chr 114915 ID=TGME49_230970;length=36

VIII 9 1152853 + 94 3733

TGME49_chr ID=TGME49_221570;length=47

II 447679 452475 - 96 3734

TGME49_chr ID=TGME49_307760;length=23

XII 386080 388449 - 69 3735

TGME49_chr ID=TGME49_230200;length=22

VIII 676375 678638 + 63 3736

TGME49_chr 318052 ID=TGME49_247220;length=20

XII 7 3182528 - 01 3737

TGME49_chr 594567 ID=TGME49_269270;length=46

VIII 6 5950322 + 46 3738

TGME49_chr ID=TGME49_229900;length=23

VIII 451670 454022 + 52 3739

TGME49_chr 263270 ID=TGME49_246060;length= 12

XII 3 2633950 + 47 3740

TGME49_chr 293014 ID=TGME49_312820;length= 19

XI 7 2932129 - 82 3741

TGME49_chr 459067 ID=TGME49_292180;length=31

IX 8 4593804 + 26 3742

TGME49_chr 265078 ID=TGME49_284630;length=36

V 4 2654467 - 83 3743

TGME49_chr 202414 ID=TGME49_225810;length=37

X 8 2027899 + 51 3744

TGME49_chr 422618 ID=TGME49_281630;length=28

Vila 4 4229009 + 25 3745 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 496061 ID=TGME49_249960;length=43

XII 8 4964965 - 47 3746

TGME49_chr 113463 ID=TGME49_219100;length=20

XII 5 1136707 - 72 3747

TGME49_chr ID=TGME49_230205 ;length=23

VIII 684588 686890 - 02 3748

TGME49_chr 120944 ID=TGME49_318590;length=l 1

IV 1 1210630 - 89 3749

TGME49_chr 269244 ID=TGME49_243520;length=81

VI 9 2693266 + 7 3750

TGME49_chr 183828 ID=TGME49_286420;length=28

V 2 1841112 - 30 3751

TGME49_chr ID=TGME49_230610;length=20

VIII 920039 922135 - 96 3752

TGME49_chr 269803 ID=TGME49_224905 ;length=38

X 3 2701877 - 44 3753

TGME49_chr 211457 ID=TGME49_311485;length=50

XI 5 2119619 + 44 3754

TGME49_chr 140128 ID=TGME49_218580;length= 18

XII 5 1403127 + 42 3755

TGME49_chr 487413 ID=TGME49_255370;length=49

Vllb 8 4879043 - 05 3756

TGME49_chr 433411 ID=TGME49_271860;length=21

VIII 0 4336235 - 25 3757

TGME49_chr 195586 ID=TGME49_232220;length=23

VIII 5 1958248 + 83 3758

TGME49_chr 405823 ID=TGME49_281400;length=28

Vila 1 4061062 + 31 3759

TGME49_chr 560574 ID=TGME49_251830;length=26

XII 8 5608371 - 23 3760

TGME49_chr 714518 ID=TGME49_215750;length= 17

X 9 7146960 + 71 3761

TGME49_chr 530427 ID=TGME49_305230;length= 16

IX 1 5305970 - 99 3762

TGME49_chr 170989 ID=TGME49_321600;length=35

lb 5 1713439 + 44 3763

TGME49_chr 566781 ID=TGME49_251920;length=20

XII 9 5669852 - 33 3764

TGME49_chr 103294 ID=TGME49_310050;length=50

XI 5 1037980 - 35 3765

TGME49_chr 510653 ID=TGME49_315958 ;length= 17

XI 7 5108324 - 87 3766

TGME49_chr 111776 ID=TGME49_262630;length=32

Vllb 6 1121043 - 77 3767

TGME49_chr ID=TGME49_208550;length=26

lb 772266 774883 + 17 3768

TGME49_chr 627070 629071 - ID=TGME49_208350;length=20 3769 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO: lb 01

TGME49_chr 631966 ID=TGME49_214500;length= 15

X 0 6321243 + 83 3770

TGME49_chr 219263 ID=TGME49_260520;length=38

Vllb 9 2196472 - 33 3771

TGME49_chr ID=TGME49_239755;length=28

VI 898593 901403 - 10 3772

TGME49_chr 416864 ID=TGME49_314530;length=45

XI 2 4173147 + 05 3773

TGME49_chr ID=TGME49_293050;length=25

la 111718 114297 - 79 3774

TGME49_chr 278369 ID=TGME49_246450;length=22

XII 1 2785982 - 91 3775

TGME49_chr 128813 ID=TGME49_310350;length=47

XI 3 1292845 - 12 3776

TGME49_chr ID=TGME49_300200;length=25

XII 162730 165254 - 24 3777

TGME49_chr 357596 ID=TGME49_290000;length=31

IX 0 3579143 - 83 3778

TGME49_chr 335648 ID=TGME49_247470;length=30

XII 5 3359532 + 47 3779

TGME49_chr 211244 ID=TGME49_232460;length= 12

VIII 8 2113740 + 92 3780

TGME49_chr 352033 ID=TGME49_202210;length=24

Vila 1 3522747 - 16 3781

TGME49_chr 180750 ID=TGME49_261250;length=42

Vllb 7 1811746 - 39 3782

TGME49_chr 132648 ID=TGME49_262170;length=45

Vllb 6 1331064 - 78 3783

TGME49_chr 474298 ID=TGME49_249702;length=24

XII 2 4745450 + 68 3784

TGME49_chr 269976 ID=TGME49_284580;length=20

V 7 2701801 + 34 3785

TGME49_chr 271680 ID=TGME49_312480;length= 18

XI 5 2718646 + 41 3786

TGME49_chr ID=TGME49_267110;length=29

IX 772823 775776 - 53 3787

TGME49_chr 168355 ID=TGME49_241240;length=37

VI 1 1687309 + 58 3788

TGME49_chr 170886 ID=TGME49_211440;length= 19

IV 2 1710770 + 08 3789

TGME49_chr 234889 ID=TGME49_260325 ;length= 15

Vllb 2 2350447 + 55 3790

TGME49_chr 101235 ID=TGME49_227335;length=32

X 9 1015635 + 76 3791

TGME49_chr 660326 ID=TGME49_214930;length=31

X 0 6606386 - 26 3792 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr ID=TGME49_229780;length=20

VIII 434978 437027 + 49 3793

TGME49_chr 153636 ID=TGME49_231780;length=24

VIII 1 1538858 - 97 3794

TGME49_chr ID=TGME49_239752;length=21

VI 890233 892412 + 79 3795

TGME49_chr ID=TGME49_227840;length=37

X 775064 778810 + 46 3796

TGME49_chr 512857 ID=TGME49_270620;length=47

VIII 6 5133324 - 48 3797

TGME49_chr 169635 ID=TGME49_286740;length=34

V 2 1699779 + 27 3798

TGME49_chr 457016 ID=TGME49_292130;length= 13

IX 7 4571563 - 96 3799

TGME49_chr 150699 ID=TGME49_209870;length= 16

lb 7 1508664 - 67 3800

TGME49_chr 588868 ID=TGME49_216785 ;length= 17

XI 2 5890417 - 35 3801

TGME49_chr ID=TGME49_306870;length=33

XI 216523 219848 + 25 3802

TGME49_chr 374595 ID=TGME49_201850;length=45

Vila 1 3750483 - 32 3803

TGME49_chr 591184 ID=TGME49_278740;length=48

XII 6 5916676 - 30 3804

TGME49_chr 302542 ID=TGME49_259020;length= 14

Vllb 3 3026890 - 67 3805

TGME49_chr 147565 ID=TGME49_310610;length=44

XI 4 1480111 - 57 3806

TGME49_chr 576082 ID=TGME49_278910;length=38

XII 1 5764702 + 81 3807

TGME49_chr 516306 ID=TGME49_305010;length=25

IX 3 5165571 - 08 3808

TGME49_chr ID=TGME49_294340;length=36

la 808274 811964 + 90 3809

TGME49_chr ID=TGME49_307810;length=47

XII 425609 430368 - 59 3810

TGME49_chr ID=TGME49_279430;length=20

IX 189607 191648 - 41 3811

TGME49_chr 669940 ID=TGME49_277240;length=46

XII 6 6704060 - 54 3812

TGME49_chr 386211 ID=TGME49_223390;length=43

X 6 3866423 + 07 3813

TGME49_chr ID=TGME49_207620;length= 16

lb 163682 165355 - 73 3814

TGME49_chr ID=TGME49_263330;length=42

Vllb 628606 632828 - 22 3815

TGME49_chr 427667 4280983 - ID=TGME49_271935 ;length=43 3816 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

VIII 6 07

TGME49_chr 115182 ID=TGME49_213580;length= 16

V 6 1153503 + 77 3817

TGME49_chr 359734 ID=TGME49_290030;length=30

IX 4 3600415 - 71 3818

TGME49_chr 120794 ID=TGME49_205700;length= 17

Vila 0 1209729 + 89 3819

TGME49_chr 171609 ID=TGME49_218192;length=20

XII 1 1718180 - 89 3820

TGME49_chr 107220 ID=TGME49_240220;length=27

VI 8 1074913 + 05 3821

TGME49_chr 334489 ID=TGME49_289615 ;length= 19

IX 8 3346813 + 15 3822

TGME49_chr ID=TGME49_300130;length=43

XII 193234 197614 - 80 3823

TGME49_chr 567874 ID=TGME49_305930;length= 19

IX 2 5680658 - 16 3824

TGME49_chr ID=TGME49_293700;length=28

la 511367 514250 + 83 3825

TGME49_chr 323464 ID=TGME49_313280;length= 18

XI 4 3236534 + 90 3826

TGME49_chr 134288 ID=TGME49_295035;length=29

la 5 1345816 - 31 3827

TGME49_chr ID=TGME49_267660;length=29

IX 476754 479709 + 55 3828

TGME49_chr 545486 ID=TGME49_251550;length=24

XII 0 5457281 - 21 3829

TGME49_chr 655214 ID=TGME49_214870;length=47

X 4 6556896 + 52 3830

TGME49_chr 318034 ID=TGME49_202590;length=37

Vila 8 3184059 + 11 3831

TGME49_chr 528088 ID=TGME49_305190;length=27

IX 3 5283665 - 82 3832

TGME49_chr 370180 ID=TGME49_223610;length=50

X 7 3706821 - 14 3833

TGME49_chr 474291 ID=TGME49_271070;length= 15

VIII 8 4744434 - 16 3834

TGME49_chr 632925 ID=TGME49_214510;length= 15

X 7 6330831 - 74 3835

TGME49_chr 413527 ID=TGME49_257300;length=25

Vllb 7 4137872 + 95 3836

TGME49_chr 329660 ID=TGME49_258630;length=21

Vllb 4 3298791 - 87 3837

TGME49_chr 259757 ID=TGME49_233330;length=18

VIII 9 2599382 - 03 3838

TGME49_chr ID=TGME49_228140;length= 12

X 551024 552313 + 89 3839 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 461473 ID=TGME49_292220;length=24

IX 3 4617149 + 16 3840

TGME49_chr ID=TGME49_320720;length=48

IV 110065 114937 + 72 3841

TGME49_chr 338310 ID=TGME49_289690;length= 17

IX 2 3384808 - 06 3842

TGME49_chr 316684 ID=TGME49_313200;length=21

XI 9 3168996 + 47 3843

TGME49_chr 165829 ID=TGME49_231940;length= 18

VIII 3 1660173 + 80 3844

TGME49_chr ID=TGME49_309600;length=21

XI 749160 751293 + 33 3845

TGME49_chr 534057 ID=TGME49_236220;length= 18

X 1 5342425 - 54 3846

TGME49_chr 410988 ID=TGME49_291080;length=42

IX 1 4114158 - 77 3847

TGME49_chr ID=TGME49_263410;length= 16

Vllb 579048 580735 - 87 3848

TGME49_chr 135475 ID=TGME49_209585 ;length=20

lb 8 1356812 - 54 3849

TGME49_chr ID=TGME49_221180;length=22

II 114763 116965 - 02 3850

TGME49_chr 745735 ID=TGME49_207240;length=26

X 2 7459980 + 28 3851

TGME49_chr 362216 ID=TGME49_313700;length= 19

XI 5 3624128 + 63 3852

TGME49_chr 459491 ID=TGME49_315240;length= 15

XI 1 4596494 + 83 3853

TGME49_chr 366787 ID=TGME49_202050;length=24

Vila 0 3670309 - 39 3854

TGME49_chr 375097 ID=TGME49_272760;length=27

VIII 7 3753691 - 14 3855

TGME49_chr 273192 ID=TGME49_284540;length=30

V 8 2734972 - 44 3856

TGME49_chr 531295 ID=TGME49_305250;length=24

IX 5 5315417 + 62 3857

TGME49_chr 124218 ID=TGME49_231120;length=47

VIII 4 1246962 + 78 3858

TGME49_chr 127056 ID=TGME49_222870;length=20

II 1 1272652 + 91 3859

TGME49_chr 139458 ID=TGME49_295100;length=40

la 1 1398592 + 11 3860

TGME49_chr ID=TGME49_221922;length= 19

II 695109 697038 + 29 3861

TGME49_chr ID=TGME49_320588 ;length=26

IV 231725 234333 + 08 3862

TGME49_chr 473177 474441 + ID=TGME49_307980;length=12 3863 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 64

TGME49_chr 384933 ID=TGME49_257755;length=28

Vllb 7 3852143 + 06 3864

TGME49_chr 166462 ID=TGME49_205040;length=21

Vila 8 1666761 - 33 3865

TGME49_chr 368595 ID=TGME49_257970;length=38

Vllb 0 3689788 + 38 3866

TGME49_chr 192152 ID=TGME49_225930;length=43

X 4 1925824 + 00 3867

TGME49_chr 730268 ID=TGME49_275480;length=22

X 3 7304952 - 69 3868

TGME49_chr 271022 ID=TGME49_288730;length=25

IX 0 2712817 + 97 3869

TGME49_chr 355191 ID=TGME49_258150;length=29

Vllb 6 3554846 - 30 3870

TGME49_chr 339540 ID=TGME49_247530;length=23

XII 4 3397801 - 97 3871

TGME49_chr 203631 ID=TGME49_217710;length=49

XII 8 2041237 - 19 3872

TGME49_chr 660108 ID=TGME49_268260;length=18

VIII 5 6602909 - 24 3873

TGME49_chr 247751 ID=TGME49_260180;length= 19

Vllb 1 2479458 - 47 3874

TGME49_chr ID=TGME49_219580;length= 11

XII 847627 848796 + 69 3875

TGME49_chr 524532 ID=TGME49_316340;length=26

XI 9 5247973 + 44 3876

TGME49_chr 159595 ID=TGME49_231865 ;length= 14

VIII 2 1597448 - 96 3877

TGME49_chr 488239 ID=TGME49_235460;length=29

X 9 4885373 - 74 3878

TGME49_chr 151040 ID=TGME49_218470;length=33

XII 0 1513768 + 68 3879

TGME49_chr 127537 ID=TGME49_218820;length=35

XII 7 1278904 + 27 3880

TGME49_chr 432186 ID=TGME49_291670;length= 17

IX 9 4323613 - 44 3881

TGME49_chr 160060 ID=TGME49_265370;length=30

IX 3 1603702 + 99 3882

TGME49_chr 216621 ID=TGME49_217830;length= 17

XII 5 2167918 - 03 3883

TGME49_chr 169429 ID=TGME49_211460;length=28

IV 2 1697186 + 94 3884

TGME49_chr 688460 ID=TGME49_215300;length= 14

X 4 6886087 + 83 3885

TGME49_chr 403064 ID=TGME49_314350;length=50

XI 4 4031150 - 6 3886 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 571368 ID=TGME49_217030;length= 16

XI 5 5715292 - 07 3887

TGME49_chr 318754 ID=TGME49_202580;length=32

Vila 8 3190806 + 58 3888

TGME49_chr 147951 ID=TGME49_265650;length=42

IX 6 1483804 + 88 3889

TGME49_chr 110804 ID=TGME49_219140;length= 16

XII 2 1109700 + 58 3890

TGME49_chr ID=TGME49_293180;length=46

la 144640 149293 - 53 3891

TGME49_chr 340277 ID=TGME49_202350;length=17

Vila 1 3404549 + 78 3892

TGME49_chr 110712 ID=TGME49_213500;length= 19

V 1 1109027 + 06 3893

TGME49_chr 165286 ID=TGME49_286800;length=26

V 5 1655496 + 31 3894

TGME49_chr 673203 ID=TGME49_215070;length= 18

X 5 6733904 - 69 3895

TGME49_chr 333194 ID=TGME49_273650;length=49

VIII 3 3336912 + 69 3896

TGME49_chr 675720 ID=TGME49_277055;length=70

XII 4 6757912 - 8 3897

TGME49_chr 408674 ID=TGME49_314415 ;length=23

XI 5 4089078 - 33 3898

TGME49_chr ID=TGME49_264020;length=33

Vllb 228036 231382 + 46 3899

TGME49_chr 422742 ID=TGME49_281620;length=27

Vila 7 4230194 - 67 3900

TGME49_chr 187480 ID=TGME49_286260;length=43

V 7 1879170 + 63 3901

TGME49_chr 302787 ID=TGME49_244110;length= 15

VI 4 3029432 + 58 3902

TGME49_chr 170960 ID=TGME49_211450;length= 19

IV 1 1711557 - 56 3903

TGME49_chr 222086 ID=TGME49_217915 ;length=41

XII 1 2225036 + 75 3904

TGME49_chr ID=TGME49_309820;length=18

XI 828620 830503 + 83 3905

TGME49_chr 519253 ID=TGME49_270550;length=26

VIII 9 5195213 + 74 3906

TGME49_chr 692762 ID=TGME49_215370;length= 15

X 9 6929143 + 14 3907

TGME49_chr 522438 ID=TGME49_316300;length=23

XI 4 5226744 + 60 3908

TGME49_chr 295920 ID=TGME49_244010;length=40

VI 3 2963259 - 56 3909

TGME49_chr 577465 5776514 + ID=TGME49_216950;length= 18 3910 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XI 2 62

TGME49_chr 180441 ID=TGME49_295360;length=40

la 1 1808455 + 44 3911

TGME49_chr 102118 ID=TGME49_240050;length=26

VI 8 1023814 + 26 3912

TGME49_chr ID=TGME49_293240;length=22

la 184672 186967 + 95 3913

TGME49_chr 157855 ID=TGME49_261690;length=20

Vllb 3 1580579 - 26 3914

TGME49_chr 155785 ID=TGME49_310740;length=46

XI 1 1562530 + 79 3915

TGME49_chr 331481 ID=TGME49_224130;length=48

X 5 3319616 - 01 3916

TGME49_chr 456365 ID=TGME49_249480;length=47

XII 7 4568429 + 72 3917

TGME49_chr 103246 ID=TGME49_209090;length=23

lb 6 1034841 + 75 3918

TGME49_chr 253415 ID=TGME49_243298;length=36

VI 5 2537835 - 80 3919

TGME49_chr 359751 ID=TGME49_223770;length=27

X 1 3600273 + 62 3920

TGME49_chr 267820 ID=TGME49_246120;length=42

XII 6 2682410 - 04 3921

TGME49_chr 481857 ID=TGME49_210400;length=90

IX 8 4819481 - 3 3922

TGME49_chr 231652 ID=TGME49_311840;length=48

XI 3 2321340 - 17 3923

TGME49_chr 242008 ID=TGME49_301370;length= 12

IV 3 2421297 - 14 3924

TGME49_chr 182218 ID=TGME49_211340;length=39

IV 2 1826088 - 06 3925

TGME49_chr 376447 ID=TGME49_248260;length=20

XII 5 3766489 + 14 3926

TGME49_chr 120766 ID=TGME49_231030;length=23

VIII 9 1210044 - 75 3927

TGME49_chr 686854 ID=TGME49_215280;length=30

X 9 6871573 + 24 3928

TGME49_chr 164372 ID=TGME49_261590;length=20

Vllb 9 1645825 + 96 3929

TGME49_chr 162485 ID=TGME49_310840;length=96

XI 2 1625815 - 3 3930

TGME49_chr ID=TGME49_300030;length=25

XII 269284 271834 - 50 3931

TGME49_chr 391432 ID=TGME49_290890;length=48

IX 3 3919217 + 94 3932

TGME49_chr 349883 ID=TGME49_273460;length=21

VIII 5 3500944 - 09 3933 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 326747 ID=TGME49_313322;length=38

XI 4 3271294 + 20 3934

TGME49_chr 113530 ID=TGME49_227020;length=39

X 3 1139273 - 70 3935

TGME49_chr 200044 ID=TGME49_204100;length=49

Vila 0 2005367 - 27 3936

TGME49_chr 213674 ID=TGME49_225590;length=48

X 5 2141573 - 28 3937

TGME49_chr ID=TGME49_293670;length=22

la 497577 499864 + 87 3938

TGME49_chr 273046 ID=TGME49_288770;length=20

IX 5 2732515 - 50 3939

TGME49_chr 609564 ID=TGME49_216510;length=21

XI 5 6097770 + 25 3940

TGME49_chr 611063 ID=TGME49_269050;length=33

VIII 7 6114023 - 86 3941

TGME49_chr 229630 ID=TGME49_285660;length=47

V 5 2301022 + 17 3942

TGME49_chr 159348 ID=TGME49_209970;length=23

lb 1 1595867 + 86 3943

TGME49_chr ID=TGME49_262933;length=24

Vllb 917523 919993 - 70 3944

TGME49_chr 356709 ID=TGME49_202170;length=l 1

Vila 4 3568251 - 57 3945

TGME49_chr 130226 ID=TGME49_240520;length= 17

VI 2 1303984 + 22 3946

TGME49_chr 637644 ID=TGME49_216150;length=50

XI 5 6376951 + 6 3947

TGME49_chr 138751 ID=TGME49_318370;length=28

IV 2 1390320 - 08 3948

TGME49_chr 124868 ID=TGME49_231125 ;length= 16

VIII 4 1250326 + 42 3949

TGME49_chr 180540 ID=TGME49_265080;length= 18

IX 0 1807228 - 28 3950

TGME49_chr 367751 ID=TGME49_290280;length=70

IX 9 3678219 + 0 3951

TGME49_chr 102044 ID=TGME49_222300;length=47

II 9 1025166 - 17 3952

TGME49_chr ID=TGME49_227560;length=39

X 912303 916274 + 71 3953

TGME49_chr 569910 ID=TGME49_217050;length=24

XI 2 5701538 + 36 3954

TGME49_chr ID=TGME49_279380;length=34

IX 211482 214977 + 95 3955

TGME49_chr 596021 ID=TGME49_269260;length=29

VIII 9 5963166 - 47 3956

TGME49_chr 134103 1345114 - ID=TGME49_218740;length=40 3957 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

XII 6 78

TGME49_chr 598938 ID=TGME49_237560;length=17

X 0 5991089 - 09 3958

TGME49_chr ID=TGME49_320120;length= 17

IV 481795 483580 + 85 3959

TGME49_chr 359801 ID=TGME49_273320;length=28

VIII 0 3600888 - 78 3960

TGME49_chr 306593 ID=TGME49_289190;length=22

IX 7 3068225 - 88 3961

TGME49_chr 187392 ID=TGME49_225990;length=25

X 9 1876445 + 16 3962

TGME49_chr 388920 ID=TGME49_248500;length=21

XII 5 3891343 + 38 3963

TGME49_chr 344546 ID=TGME49_247620;length=98

XII 0 3446440 + 0 3964

TGME49_chr ID=TGME49_227948 ;length=24

X 713964 716364 + 00 3965

TGME49_chr 742356 ID=TGME49_207170;length= 17

X 2 7425356 + 94 3966

TGME49_chr 171615 ID=TGME49_286710;length=24

V 6 1718652 - 96 3967

TGME49_chr 123809 ID=TGME49_213680;length=20

V 4 1240156 - 62 3968

TGME49_chr 113203 ID=TGME49_209230;length=29

lb 6 1134997 + 61 3969

TGME49_chr 589285 ID=TGME49_237460;length=27

X 0 5895594 - 44 3970

TGME49_chr 163579 ID=TGME49_265280;length=30

IX 2 1638798 - 06 3971

TGME49_chr 219363 ID=TGME49_298020;length=44

II 2 2198046 + 14 3972

TGME49_chr 458430 ID=TGME49_255970;length=38

Vllb 5 4588201 + 96 3973

TGME49_chr 441386 ID=TGME49_249260;length=43

XII 9 4418183 - 14 3974

TGME49_chr 137529 ID=TGME49_218610;length= 19

XII 5 1377205 - 10 3975

TGME49_chr 215973 ID=TGME49_297990;length=21

II 5 2161907 - 72 3976

TGME49_chr 339001 ID=TGME49_244600;length=25

VI 2 3392557 - 45 3977

TGME49_chr 237967 ID=TGME49_288190;length=47

IX 5 2384376 - 01 3978

TGME49_chr 572125 ID=TGME49_306010;length=20

IX 3 5723275 + 22 3979

TGME49_chr 158899 ID=TGME49_310790;length=39

XI 5 1592976 + 81 3980 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 152694 ID=TGME49_297090;length=34

II 2 1530423 - 81 3981

TGME49_chr ID=TGME49_320592;length=21

IV 212060 214189 + 29 3982

TGME49_chr ID=TGME49_267875;length=30

IX 302614 305702 - 88 3983

TGME49_chr 696419 ID=TGME49_215440;length=45

X 1 6968709 - 18 3984

TGME49_chr 313837 ID=TGME49_313140;length=20

XI 2 3140384 - 12 3985

TGME49_chr ID=TGME49_263830;length= 13

Vllb 305794 307179 - 85 3986

TGME49_chr ID=TGME49_300048 ;length=34

XII 249631 253074 - 43 3987

TGME49_chr 465347 ID=TGME49_255890;length=21

Vllb 9 4655650 - 71 3988

TGME49_chr 187335 ID=TGME49_261075 ;length=20

Vllb 0 1875365 + 15 3989

TGME49_chr 530604 ID=TGME49_236158 ;length=23

X 5 5308393 + 48 3990

TGME49_chr ID=TGME49_294285 ;length= 13

la 780454 781800 + 46 3991

TGME49_chr 294635 ID=TGME49_202910;length= 14

Vila 9 2947823 + 64 3992

TGME49_chr 385736 ID=TGME49_248445 ;length=46

XII 7 3862038 - 71 3993

TGME49_chr 145031 ID=TGME49_209790;length= 18

lb 4 1452140 + 26 3994

TGME49_chr 208265 ID=TGME49_217740;length=33

XII 8 2086018 + 60 3995

TGME49_chr 302835 ID=TGME49_244100;length=25

VI 3 3030855 - 02 3996

TGME49_chr 297074 ID=TGME49_312905 ;length=25

XI 4 2973324 + 80 3997

TGME49_chr 121014 ID=TGME49_209420;length=l 1

lb 3 1211342 - 99 3998

TGME49_chr ID=TGME49_229390;length=24

VIII 225120 227610 + 90 3999

TGME49_chr 235486 ID=TGME49_233030;length=31

VIII 0 2357994 - 34 4000

TGME49_chr 325941 ID=TGME49_273790;length=20

VIII 8 3261475 + 57 4001

TGME49_chr 175227 ID=TGME49_226260;length=37

X 2 1756039 + 67 4002

TGME49_chr 592322 ID=TGME49_216740;length=35

XI 8 5926772 + 44 4003

TGME49_chr 413170 4133633 + ID=TGME49_257310;length= 19 4004 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

Vllb 2 31

TGME49_chr 116948 ID=TGME49_205750;length=37

Vila 5 1173218 + 33 4005

TGME49_chr ID=TGME49_263990;length=39

Vllb 255202 259201 - 99 4006

TGME49_chr 123051 ID=TGME49_318560;length=25

IV 0 1233030 - 20 4007

TGME49_chr 161871 ID=TGME49_261650;length=27

Vllb 7 1621489 - 72 4008

TGME49_chr 315111 ID=TGME49_283450;length=30

V 1 3154171 + 60 4009

TGME49_chr 574599 ID=TGME49_269600;length=20

VIII 3 5747995 + 02 4010

TGME49_chr 171966 ID=TGME49_261480;length= 11

Vllb 7 1720796 - 29 4011

TGME49_chr ID=TGME49_280375;length=85

Vila 489530 490382 + 2 4012

TGME49_chr 314167 ID=TGME49_273970;length=39

VIII 5 3145631 - 56 4013

TGME49_chr 551687 ID=TGME49_316710;length=20

XI 3 5518970 + 97 4014

TGME49_chr 150832 ID=TGME49_295850;length=39

la 3 1512231 - 08 4015

TGME49_chr 433872 ID=TGME49_234400;length=45

X 1 4343260 - 39 4016

TGME49_chr ID=TGME49_228430;length=37

X 309061 312804 + 43 4017

TGME49_chr ID=TGME49_207365 ;length=38

lb 8721 12586 - 65 4018

TGME49_chr 625195 ID=TGME49_214340;length=26

X 0 6254576 + 26 4019

TGME49_chr ID=TGME49_230380;length=30

VIII 739200 742279 - 79 4020

TGME49_chr 157027 ID=TGME49_226500;length=40

X 8 1574363 + 85 4021

TGME49_chr ID=TGME49_227990;length= 14

X 681974 683378 + 04 4022

TGME49_chr 646760 ID=TGME49_216010;length= 13

XI 4 6468954 + 50 4023

TGME49_chr 393362 ID=TGME49_201250;length=50

Vila 7 3938631 + 04 4024

TGME49_chr 300757 ID=TGME49_202830;length=29

Vila 4 3010494 - 20 4025

TGME49_chr 325680 ID=TGME49_224200;length= 15

X 0 3258378 - 78 4026

TGME49_chr 463918 ID=TGME49_315300;length=23

XI 3 4641514 - 31 4027 starting

position end

on position on Endogenous promoter of gene

Chromosome chromo chromoso stra name (ID), and length of SEQ ID some me nd promoter sequence NO:

TGME49_chr 194745 ID=TGME49_242300;length=23

VI 6 1949773 + 17 4028

TGME49_chr 472634 ID=TGME49_249680;length=25

XII 2 4728931 + 89 4029

TGME49_chr 248861 ID=TGME49_260170;length= 13

Vllb 7 2490012 - 95 4030

Table 1. Provided are non-limiting examples of endogenous promoters which can be used along with the construct(s) of some embodiments of the invention. The predicted promoter sequences were obtained based on histone marks data around the genes.

Table 2 lists suitable constitutive and inducible promoters which can be cloned into the nucleic acid construct of some embodiments of the invention.

Table 2, constitutive and inducible promoters

Table 2.

As mentioned, the heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein.

As used herein the phrase "Toxoplasma secreted protein " refers to at least a functional fragment (amino acid sequence) of a Toxoplasma polypeptide which is sufficient in order to be secreted to a host cell when infected in-vitro by the Toxoplasma.

According to some embodiments of the invention, the Toxoplasma secreted protein or the functional fragment thereof is capable of being secreted to a host cell when infected in-vivo by the Toxoplasma.

According to some embodiments of the invention, the first nucleic acid sequence encoding a functional fragment of the Toxoplasma secreted protein. According to some embodiments of the invention, the first nucleic acid sequence encoding the full-length open reading frame (ORF) of the Toxoplasma secreted protein.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a rhoptry of the Toxoplasma.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the rhoptry comprises Toxofilin and/or ROP1.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the rhoptry comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:344-465.

According to some embodiments of the invention, the Toxoplasma secreted protein is a non-rhoptry protein.

According to some embodiments of the invention, the Toxoplasma secreted protein is selected from the group consisting of a microneme protein and a dense granule protein.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a dense granule of the Toxoplasma.

According to some embodiments of the invention, the Toxoplasma secreted protein which is secreted from the dense granule comprises a GRA16, and/or GRA24.

According to some embodiments of the invention, the Toxoplasma secreted protein is secreted from a microneme of the Toxoplasma.

According to some embodiments of the invention, the protein secreted from the microneme comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:280-322.

According to some embodiments of the invention, the protein secreted from the dense granule comprises the amino acid sequence selected from the group consisting of SEQ ID NOs:234-279.

According to some embodiments of the invention, the Toxoplasma secreted protein comprises Toxoplasma gondii macrophage migration inhibitory factor (TgMIF).

Table 3 below lists Toxoplasma-secreted proteins which can be cloned into the nucleic acid construct of some embodiments of the invention, organized according to the organelle to which the protein is localized or predicted to be localized to. Table 3, Toxoplasma proteins

Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ T. gondii Dense

cathepsin CPC1 (CPC1) 1 234 289620 ME49 Granules

TGME49_ T. gondii Dense

cathepsin CPC2 (CPC2) 2 235 276130 ME49 Granules

TGME49_ dense granular protein GRA10 T. gondii Dense

3 236 268900 (GRA10) ME49 Granules

TGME49_ T. gondii Dense

dense granule protein DG32 4 237 297880 ME49 Granules

TGME49_ dense granule protein GRAl T. gondii Dense

5 238 270250 (GRAl) ME49 Granules

TGME49_ T. gondii Dense

dense granule protein GRAl 1 6 239 212410 ME49 Granules

TGME49_ T. gondii Dense

dense granule protein GRAl 1 7 240 237800 ME49 Granules

TGME49_ dense granule protein GRAl 2 T. gondii Dense

8 241 275850 (GRAl 2) ME49 Granules

TGME49_ dense granule protein GRAl 2 T. gondii Dense

9 242 288650 (GRAl 2) ME49 Granules

TGME49_ dense granule protein GRAl 4 T. gondii Dense

10 243 239740 (GRAl 4) ME49 Granules

TGME49_ dense granule protein GRAl 5 T. gondii Dense

11 244 275470 (GRAl 5) ME49 Granules

TGME49_ dense granule protein GRA2 T. gondii Dense

12 245 227620 (GRA2) ME49 Granules

TGME49_ dense granule protein GRA3 T. gondii Dense

13 246 227280 (GRA3) ME49 Granules

TGME49_ dense granule protein GRA4 T. gondii Dense

14 247 310780 (GRA4) ME49 Granules

TGME49_ dense granule protein GRA5 T. gondii Dense

15 248 286450 (GRA5) ME49 Granules

TGME49_ dense granule protein GRA6 T. gondii Dense

16 249 275440 (GRA6) ME49 Granules

TGME49_ dense granule protein GRA7 T. gondii Dense

17 250 203310 (GRA7) ME49 Granules

TGME49_ dense granule protein GRA8 T. gondii Dense

18 251 254720 (GRA8) ME49 Granules

TGME49_ dense granule protein GRA9 T. gondii Dense

19 252 251540 (GRA9) ME49 Granules

TGME49_ T. gondii Dense

dense-granule antigen DG32 20 253 222170 ME49 Granules

TGME49_ T. gondii Dense

GRAl 6 21 254 208830 ME49 Granules

TGME49_ T. gondii Dense

GRA20 22 255 200010 ME49 Granules

TGME49_ T. gondii Dense

GRA21 23 256 241610 ME49 Granules Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ T. gondii Dense

GRA24 24 257 230180 ME49 Granules

TGME49_ T. gondii Dense

GRA25 25 258 290700 ME49 Granules

TGME49_ T. gondii Dense

GRA28 26 259 231960 ME49 Granules

TGME49_ T. gondii Dense

GRA29 27 260 269690 ME49 Granules

TGME49_ T. gondii Dense

GRA30 28 261 232000 ME49 Granules

TGME49_ T. gondii Dense

GRA31 29 262 220240 ME49 Granules

TGME49_ T. gondii Dense

GRA32 30 263 212300 ME49 Granules

TGME49_ T. gondii Dense

GRA33 31 264 247440 ME49 Granules

TGME49_ T. gondii Dense

GRA35 32 265 226380 ME49 Granules

TGME49_ T. gondii Dense

GRA36 33 266 213067 ME49 Granules

TGME49_ T. gondii Dense

GRA37 34 267 236890 ME49 Granules

TGME49_ T. gondii Dense

GRA38 35 268 312420 ME49 Granules

TGME49_ T. gondii Dense

GRA39 36 269 289380 ME49 Granules

TGME49_ T. gondii Dense

GRA40 37 270 219810 ME49 Granules

TGME49_ T. gondii Dense

hypothetical protein 38 271 200360 ME49 Granules

TGME49_ T. gondii Dense

hypothetical protein 39 272 203290 ME49 Granules

TGME49_ T. gondii Dense

hypothetical protein 40 273 310790 ME49 Granules

TGME49_ T. gondii Dense

NTPase I 41 274 277240 ME49 Granules

TGME49_ T. gondii Dense

NTPase II 42 275 277270 ME49 Granules

TGME49_ T. gondii Dense

protease inhibitor PIl (PIl) 43 276 217430 ME49 Granules

TGME49_ T. gondii Dense

protease inhibitor PI2 (PI2) 44 277 208450 ME49 Granules

TGGT1_4 putative dense granule protein T. gondii Dense

45 278 11330 GRA11 GT1 Granules

TGME49_ putative dense granule protein T. gondii Dense

46 279 275860 GRA12 VEG Granules

TGME49_ T. gondii Microne

ankyrin repeat-containing protein 47 280 261710 ME49 mes Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ apical membrane antigen 1 domain- T. gondii Microne

48 281 300130 containing protein ME49 mes

TGME49_ T. gondii Microne

apical membrane antigen AMA1 49 282 255260 ME49 mes

TGME49_ T. gondii Microne

chitinase-like protein CLP1 (CLP1) 50 283 293770 ME49 mes

TGME49_ T. gondii Microne

GRA34 51 284 204340 ME49 mes

TGME49_ T. gondii Microne

hypothetical protein 52 285 221480 ME49 mes

TGME49_ T. gondii Microne

MIC 18 53 286 289100 ME49 mes

TGME49_ T. gondii Microne

MIC 19 54 287 294790 ME49 mes

TGME49_ T. gondii Microne

MIC2-associated protein M2AP 55 288 214940 ME49 mes

TGME49_ T. gondii Microne

MIC20 56 289 283540 ME49 mes

TGME49_ T. gondii Microne

microneme protein MICl (MICl) 57 290 291890 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 10 (MIC 10) 58 291 250710 ME49 mes

TGME49_ T. gondii Microne

microneme protein MICl 1 (MICl 1) 59 292 204530 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 12 (MIC 12) 60 293 267680 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 13 (MIC 13) 61 294 260190 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 14 (MIC 14) 62 295 218310 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 15 (MIC 15) 63 296 247195 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC 16 (MIC 16) 64 297 289630 ME49 mes

TGME49_ microneme protein MIC17A T. gondii Microne

65 298 200250 (MIC 17 A) ME49 mes

TGME49_ microneme protein MIC17B T. gondii Microne

66 299 200240 (MIC17B) ME49 mes

TGME49_ microneme protein MIC17C T. gondii Microne

67 300 200230 (MIC17C) ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC2 (MIC2) 68 301 201780 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC3 (MIC3) 69 302 319560 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC4 (MIC4) 70 303 208030 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC5 (MIC5) 71 304 277080 ME49 mes Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ T. gondii Microne

microneme protein MIC6 (MIC6) 72 305 218520 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC7 (MIC7) 73 306 261780 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC8 (MIC8) 74 307 245490 ME49 mes

TGME49_ T. gondii Microne

microneme protein MIC9 (MIC9) 75 308 245485 ME49 mes

TGME49_ T. gondii Microne

microneme protein, putative 76 309 208730 ME49 mes

TGME49_ T. gondii Microne

microneme protein, putative 77 310 208740 ME49 mes

TGME49_ T. gondii Microne

microneme protein, putative 78 311 254430 ME49 mes

TGME49_ T. gondii Microne

microneme protein, putative 79 312 275798 ME49 mes

TGME49_ T. gondii Microne

microneme-like protein 80 313 244180 ME49 mes

TGME49_ T. gondii Microne

microneme-like protein 81 314 286740 ME49 mes

TGME49_ PAN/Apple domain-containing T. gondii Microne

82 315 200270 protein ME49 mes

TGME49_ PAN/Apple domain-containing T. gondii Microne

83 316 286150 protein ME49 mes

TGME49_ T. gondii Microne

perforin-like protein PLPl (PLPl) 84 317 204130 ME49 mes

TGVEG_4 T. gondii Microne

putative microneme protein 85 318 39620 VEG mes

TGME49_ T. gondii Microne

rhomboid protease ROMl (ROMl) 86 319 200290 ME49 mes

TGME49_ sporozoite protein with an altered T. gondii Microne

87 320 293900 thrombospondin repeat SPATR ME49 mes

TGME49_ T. gondii Microne

subtilisin SUB 1 (SUB 1) 88 321 204050 ME49 mes

TGME49_ T. gondii Microne

toxolysin TLN4 (TLN4) 89 322 206510 ME49 mes

TGME49_ T. gondii

actin ACT1 (ACT1) Other 90 323 209030 ME49

TGME49_ T. gondii

DJ-1 family protein Other 91 324 214290 ME49

TGME49_ T. gondii

enolase 2 Other 92 325 268850 ME49

TGME49_ T. gondii

fructose-1 ,6-bisphosphate aldolase Other 93 326 236040 ME49

glyceraldehyde-3-phosphate

TGME49_ T. gondii

dehydrogenase GAPDH1 Other 94 327 289690 ME49

(GAPDH1)

TGME49_ macrophage migration inhibitory T. gondii

Other 95 328 290040 factor, putative ME49 Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ T. gondii

protein disulfide isomerase Other 96 329 211680 ME49

TGME49_ T. gondii

Secretory phospholipase A2 Other 97 330 273610 ME49

TGME49_ T. gondii

Secretory phospholipase A2 Other 98 331 273620 GT1

TGME49_ T. gondii

Toxoplasma gondii family D protein Other 99 332 271570 ME49

TGME49_ translation elongation factor 2 T. gondii

Other 100 333 205470 family protein, putative ME49

TGME49_ tryptophanyl-tRNA synthetase T. gondii

Other 101 334 288360 (TrpRS2) ME49

TGME49_ vacuolar ATP synthase subunit T. gondii

Other 102 335 208590 54kD, putative ME49

TGME49_ vacuolar ATP synthase subunit A, T. gondii

Other 103 336 256970 putative ME49

TGME49_ vacuolar ATP synthase subunit b, T. gondii

Other 104 337 219800 putative ME49

TGME49_ vacuolar ATP synthase subunit C, T. gondii

Other 105 338 315620 putative ME49

TGME49_ vacuolar ATP synthase subunit d, T. gondii

Other 106 339 259010 putative ME49

TGME49_ vacuolar atp synthase subunit e, T. gondii

Other 107 340 305290 putative ME49

TGME49_ vacuolar atp synthase subunit f, T. gondii

Other 108 341 310960 putative ME49

TGME49_ vacuolar ATP synthase subunit g, T. gondii

Other 109 342 246560 putative ME49

TGME49_ T. gondii

vacuolar ATP synthetase Other 110 343 212310 ME49

TGME49_ bradyzoite rhoptry protein BRPl T. gondii

Rhoptries 111 344 314250 (BRPl) ME49

TGME49_ T. gondii

cAMP-dependent protein kinase Rhoptries 112 345 209985 ME49

TGME49_ T. gondii

cathepsin B Rhoptries 113 346 249670 ME49

TGME49_ cell cycle regulator with zn-finger T. gondii

Rhoptries 114 347 252200 domain-containing protein ME49

TGME49_ T. gondii

hypothetical protein Rhoptries 115 348 210095 ME49

TGME49_ T. gondii

hypothetical protein Rhoptries 116 349 296015 ME49

TGME49_ T. gondii

hypothetical protein Rhoptries 117 350 297070 ME49

TGME49_ T. gondii

hypothetical protein Rhoptries 118 351 311320 ME49

TGME49_ T. gondii

hypothetical protein Rhoptries 119 352 321700 ME49 Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGVEG_2 T. gondii

rhoptry neck protein RON6 Rhoptries 188 All 97960B VEG

TGME49_ T. gondii

rhoptry neck protein RON6 (RON6) Rhoptries 189 All 297960 ME49

TGME49_ T. gondii

rhoptry neck protein RON8 (RON8) Rhoptries 190 A13 306060 ME49

TGVEG_3 T. gondii

rhoptry neck protein RON9 Rhoptries 191 A1A 08810A VEG

TGVEG_3 T. gondii

rhoptry neck protein RON9 Rhoptries 192 425 08810B VEG

TGME49_ T. gondii

rhoptry neck protein RON9 (RON9) Rhoptries 193 426 308810 ME49

TGME49_ T. gondii

rhoptry neck protein, putative Rhoptries 194 427 265120 ME49

TGME49_ T. gondii

rhoptry neck protein, putative Rhoptries 195 428 327200 ME49

TGME49_ T. gondii

rhoptry protein ROP1 (ROP1) Rhoptries 196 429 309590 ME49

TGME49_ T. gondii

rhoptry protein ROP10 (ROP10) Rhoptries 197 430 315490 ME49

TGME49_ T. gondii

rhoptry protein ROP12 (ROP12) Rhoptries 198 431 203990 ME49

TGME49_ T. gondii

rhoptry protein ROP13 (ROP13) Rhoptries 199 432 312270 ME49

TGME49_ T. gondii

rhoptry protein ROP14 (ROP14) Rhoptries 200 433 315220 ME49

TGME49_ T. gondii

rhoptry protein ROP15 (ROP15) Rhoptries 201 434 211290 ME49

TGME49_ T. gondii

rhoptry protein ROP16 (ROP16) Rhoptries 202 435 262730 ME49

TGME49_ T. gondii

rhoptry protein ROP17 (ROP17) Rhoptries 203 436 258580 ME49

TGME49_ T. gondii

rhoptry protein ROP18 (ROP18) Rhoptries 204 437 205250 ME49

TGME49_ T. gondii

rhoptry protein ROP2A (ROP2A) Rhoptries 205 438 215785 ME49

TGME49_ T. gondii

rhoptry protein ROP4 (ROP4) Rhoptries 206 439 295125 ME49

TGGT1_4 T. gondii

rhoptry protein ROP5 Rhoptries 207 440 11430 GT1

TGVEG_4 T. gondii

rhoptry protein ROP5 Rhoptries 208 441 42220 VEG

TGME49_ T. gondii

rhoptry protein ROP5 (ROP5) Rhoptries 209 442 308090 ME49

TGME49_ T. gondii

rhoptry protein ROP6 (ROP6) Rhoptries 210 443 258660 ME49

TGGT1_3 T. gondii

rhoptry protein ROP7 Rhoptries 211 AAA 65080 GT1 Polyn. Polyp.

Gene ID Gene Product Description Organism organelle SEQ SEQ

ID ID

TGME49_ T. gondii

rhoptry protein ROP7 (ROP7) Rhoptries 212 445 295110 ME49

TGVEG_3 T. gondii

rhoptry protein ROP8 Rhoptries 213 446 63030 VEG

TGME49_ T. gondii

rhoptry protein ROP8 (ROP8) Rhoptries 214 447 215775 ME49

TGME49_ T. gondii

rhoptry protein ROP9 (ROP9) Rhoptries 215 448 243730 ME49

TGME49_ T. gondii

rhoptry protein, putative Rhoptries 216 449 295105 ME49

TGME49_ T. gondii

rhoptry protein, putative Rhoptries 217 450 315210 ME49

TGME49_ T. gondii

rhoptry protein, putative Rhoptries 218 451 315940 ME49

TGME49_ T. gondii

RON 11 Rhoptries 219 452 230350 ME49

TGME49_ T. gondii

RON 12 Rhoptries 220 453 232020 ME49

TGME49_ T. gondii

RON2L1 Rhoptries 221 454 294400 ME49

TGME49_ T. gondii

ROP2L11 Rhoptries 222 455 296020 VEG

TGME49_ T. gondii

ROP2L12 Rhoptries 223 456 296000 ME49

TGME49_ T. gondii

ROP38 Rhoptries 224 457 242100 ME49

TGME49_ T. gondii

ROP47 Rhoptries 225 458 261740 ME49

TGME49_ T. gondii

ROP48 Rhoptries 226 459 218270 ME49

TGME49_ T. gondii

ROP5 Rhoptries 227 460 308075 VEG

TGME49_ T. gondii

ROP51 Rhoptries 228 461 241000 ME49

TGME49_ T. gondii

ROP54 Rhoptries 229 462 210370 ME49

TGME49_ T. gondii

SNARE domain-containing protein Rhoptries 230 463 300290 ME49

TGME49_ T. gondii

sodium/hydrogen exchanger NHE2 Rhoptries 231 464 299060 ME49

TGME49_ T. gondii

subtilisin SUB 2 (SUB2) Rhoptries 232 465 314500 ME49

TGME49_ T. gondii

toxofilin Rhoptries 233 466 214080 ME49

Table 3. "polyn." = polynucleotide; "polyp" = polypeptide. According to some embodiments of the invention, the heterologous polynucleotide further comprises a Toxoplasma untranslated region (UTR) nucleic acid sequence.

According to some embodiments of the invention, the Toxoplasma untranslated region (UTR) nucleic acid sequence is upstream and/or downstream of the open reading frame of the heterologous polynucleotide (e.g., which encodes the Toxoplasma secreted protein fused in frame with the pharmaceutical polypeptide).

According to some embodiments of the invention, the Toxoplasma 5'- untranslated region (5'-UTR) is placed upstream of the Toxoplasma secreted protein open reading frame.

According to some embodiments of the invention, the Toxoplasma 3'- untranslated region (3'-UTR) is placed downstream of the open reading frame encoding the Toxoplasma secreted protein fused in frame with the pharmaceutical polypeptide.

According to some embodiments of the invention, the Toxoplasma 3' untranslated region (3'-UTR) nucleic acid sequence is the GRA2 3'-UTR, GRA16 3'- UTR, GRA24 3'-UTR, SAG1 3'-UTR, or the DHFR 3'-UTR.

According to some embodiments of the invention, the Toxoplasma 5' untranslated region (5'-UTR) nucleic acid sequence is the GRA2 5'-UTR, GRA16 5'- UTR, GRA24 5'-UTR, SAG1 5'-UTR, or the DHFR 5'-UTR.

According to some embodiments of the invention, the Toxoplasma untranslated region (UTR) nucleic acid sequence is the Toxofilin 3'-UTR.

As mentioned, the heterologous polynucleotide comprises a second nucleic acid sequence encoding a pharmaceutical polypeptide being fused in frame downstream of the Toxoplasma secreted protein.

As used herein the phrase "pharmaceutical polypeptide" refers to a polypeptide having a therapeutic effect, e.g., capable of treating a pathology, when introduced into cell(s) of a subject in need thereof.

It should be noted that for certain diseases, the pharmaceutical polypeptide may be effective when reaching a certain organ, tissue, cell, cell compartment or cellular localization within the subject. For example, for treating a neurological disease, the pharmaceutical composition is preferably targeted to the nervous system, e.g., neurons, glia or other cells. Additionally or alternatively, for certain diseases, the pharmaceutical polypeptide should reach a certain cellular localization such as the cell nucleus in case the target of the pharmaceutical composition is situated (positioned) within the cell nucleus. For example, in the case of MECP2, the active protein binds to DNA within the nucleus and regulates gene expression.

According to some embodiments of the invention, the nucleic acid construct further comprises at least one in frame cleavage site, which allows detachment of the pharmaceutical polypeptide from the Toxoplasma secreted protein.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises a wild type amino acid sequence corresponding to the endogenous protein capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an antibody capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an antigen capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises a toxin capable of treating the pathology.

According to some embodiments of the invention, the pharmaceutical polypeptide comprises an enzyme, a structural polypeptide, a motility polypeptide, a regulatory polypeptide, a storage polypeptide, a signaling/ligand polypeptide, a receptor polypeptide, a sensory polypeptide, an antibody, a protein channel and/or a transport polypeptide.

According to some embodiments of the invention, the pharmaceutical polypeptide is Galactocerebrosidase (GALC).

According to some embodiments of the invention, the pharmaceutical polypeptide is Galactocerebrosidase (GALC) isoform 1, isoform 2, isoform 3, isoform 4 or isoform 5.

According to some embodiments of the invention, the pharmaceutical polypeptide is Methyl-CpG Binding Protein 2 (MECP2).

According to some embodiments of the invention, the pharmaceutical polypeptide is Methyl-CpG Binding Protein 2 (MECP2) isoform 1 or MECP2 isoform 2. According to some embodiments of the invention, the pharmaceutical polypeptide is Glial Cell Derived Neurotrophic Factor (GDNF).

According to some embodiments of the invention, the pharmaceutical polypeptide is Glial Cell Derived Neurotrophic Factor (GDNF) isoform 1, isoform 2, isoform 3, isoform 4 or isoform 5.

According to some embodiments of the invention, the pharmaceutical polypeptide is Aspartoacylase (ASPA).

According to some embodiments of the invention, the pharmaceutical polypeptide is Survival Motor Neuron Protein (SMN1).

According to some embodiments of the invention, the pharmaceutical polypeptide is Survival Motor Neuron Protein isoform SMN, isoform SMN-delta5, isoform SMN-delta7, or isoform SMN-delta57.

According to some embodiments of the invention, the pharmaceutical polypeptide is Parkin (PARK2).

According to some embodiments of the invention, the pharmaceutical polypeptide is Parkin (PARK2) isoform 1, isoform 2, isoform 3, isoform 4, isoform 5, isoform 6, isoform 7, or isoform 8.

According to some embodiments of the invention, the pharmaceutical polypeptide is Transcription Factor EB (TFEB).

According to some embodiments of the invention, the pharmaceutical polypeptide is Transcription Factor EB (TFEB) isoform 1 or isoform 2.

According to some embodiments of the invention, the pharmaceutical polypeptide is a TALEN (TALE nuclease).

According to some embodiments of the invention, the pharmaceutical polypeptide is a TALE TF (TALE transcription factor).

Table 4 hereinbelow, lists exemplary therapeutic proteins, which can be secreted to the subject by the Toxoplasma of some embodiments of the invention. Table 4, exemplary pharmaceutical proteins

Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

TFEB NP_001258874.1 4610 NM_001271945.1 4600

TFEB NP_001161299.2 4618 NM_001167827.2 4619

TFEB NP_001258873.1 4620 NM_001271944.1 4621

TFEB NP_009093.1 4622 NM_007162.2 4623

TFEB NP_001258872.1 4624 NM_001271943.1 4625

HGSNAT NP_689632 4301 NM_152419 4040

MEF2C NP_001124477 4302 NM_001131005 4041

MEF2C NP_002388 4303 NM_002397 4042

NRTN NP_004549 4304 NM_004558 4043

PGF NP_002623 4305 NM_002632 4044

RPE65 NP_000320 4306 NM_000329 4045

GDNF NP_000505 4307 NM_000514 4046

GDNF NP_954701 4308 NM_199231 4047

CNGB3 NP_061971 4309 NM_019098 4048

BDNF NP_001137277 4310 NM_001143816 4049

BDNF NP_001137279 4311 NM_001143814 4050

BDNF NP_001137288 4312 NM_001143813 4051

BDNF NP_001137278 4313 NM_001143812 4052

BDNF NP_733931 4314 NM_001143806 4053

BDNF NP_733930 4315 NM_001143811 4054

BDNF NP_733929 4316 NM_001709 4055

BDNF NP_733927 4317 NM_001143805 4056

BDNF NP_733928 4318 NM_001143810 4057

BDNF NP_001700 4319 NM_170735 4058

BDNF NP_001137281 4320 NM_170734 4059

BDNF NP_001137282 4321 NM_170733 4060

BDNF NP_001137280 4322 NM_170732 4061

BDNF NP_001137285 4323 NM_170731 4062

BDNF NP_001137286 4324 NM_001143809 4063

BDNF NP_001137283 4325 NM_001143807 4064

BDNF NP_001137284 4326 NM_001143808 4065

APP NP_001129601 4327 NM_000484 4066

APP NP_958817 4328 NM_201414 4067

APP NP_958816 4329 NM_001136131 4068

APP NP_001129602 4330 NM_201413 4069

APP NP_001129488 4331 NM_001136130 4070

APP NP_000475 4332 NM_001136016 4071

APP NP_001129603 4333 NM_001136129 4072

ASPA NP_000040 4334 NM_001128085 4073

ASPA NP_001121557 4335 NM_000049 4074

S1PR1 NP_001391 4336 NM_001400 4075

IDS NP_001160022 4337 NM_001166550 4076

IDS NP_006114 4338 NM_006123 4077

IDS NP_000193 4339 NM_000202 4078 Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

HTRA2 NP_659540 4340 NM_013247 4079

HTRA2 NP_037379 4341 NM_145074 4080

CDNF NP_001025125 4342 NM_001029954 4081

DNAJC5 NP_079495 4343 NM_025219 4082

IDUA NP_000194 4344 NM_000203 4083

SPON1 NP_006099 4345 NM_006108 4084

NMNAT2 NP_733820 4346 NM_015039 4085

NMNAT2 NP_055854 4347 NM_170706 4086

FMR1 NP_002015 4348 NM_002024 4087

MECP2 NP_004983 4349 NM_001110792 4088

MECP2 NP_001104262 4350 NM_004992 4089

GNS NP_002067 4351 NM_002076 4090

VEGFB NP_003368 4352 NM_003377 4091

HIF1A NP_851397 4353 NM_181054 4092

HIF1A NP_001521 4354 NM_001530 4093

SERPINF1 NP_002606 4355 NM_002615 4094

VEGFA NP_001165093 4356 NM_001025369 4095

VEGFA NP_001165094 4357 NM_001025366 4096

VEGFA NP_001020540 4358 NM_001025367 4097

VEGFA NP_001020541 4359 NM_001025368 4098

VEGFA NP_003367 4360 NM_003376 4099

VEGFA NP_001165101 4361 NM_001171627 4100

VEGFA NP_001020537 4362 NM_001171626 4101

VEGFA NP_001165100 4363 NM_001171629 4102

VEGFA NP_001020539 4364 NM_001171628 4103

VEGFA NP_001020538 4365 NM_001171624 4104

VEGFA NP_001028928 4366 NM_001171630 4105

VEGFA NP_001165098 4367 NM_001171625 4106

VEGFA NP_001165097 4368 NM_001171622 4107

VEGFA NP_001165096 4369 NM_001171623 4108

VEGFA NP_001165095 4370 NM_001033756 4109

VEGFA NP_001165099 4371 NM_001025370 4110

STC1 NP_003146 4372 NM_003155 4111

NGB NP_067080 4373 NM_021257 4112

NFE2L2 NP_001138884 4374 NM_006164 4113

NFE2L2 NP_006155 4375 NM_001145412 4114

NFE2L2 NP_001138885 4376 NM_001145413 4115

MAPK7 NP_620603 4377 NM_002749 4116

MAPK7 NP_620602 4378 NM_139034 4117

MAPK7 NP_620601 4379 NM_139033 4118

MAPK7 NP_002740 4380 NM_139032 4119

NMNAT1 NP_073624 4381 NM_022787 4120

NGF NP_002497 4382 NM_002506 4121

NAGLU NP_000254 4383 NM_000263 4122

CLU NP_001822 4384 NM_001831 4123

MME NP_009218 4385 NM_000902 4124

MME NP_009219 4386 NM_007289 4125 Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

MME NP_000893 4387 NM_007287 4126

MME NP_009220 4388 NM_007288 4127

CTSA NP_001121167 4389 NM_001167594 4128

CTSA NP_000299 4390 NM_000308 4129

CTSA NP_001161066 4391 NM_001127695 4130

PPT1 NP_000301 4392 NM_000310 4131

PPT1 NP_001136076 4393 NM_001142604 4132

TIMP2 NP_003246 4394 NM_003255 4133

MANF NP_006001 4395 NM_006010 4134

TIMP1 NP_003245 4396 NM_003254 4135

GAD2 NP_000809 4397 NM_001134366 4136

GAD2 NP_001127838 4398 NM_000818 4137

GALC NP_000144 4399 NM_000153 4138

GAD1 NP_000808 4400 NM_013445 4139

GAD1 NP_038473 4401 NM_000817 4140

NTF3 NP_002518 4402 NM_002527 4141

NTF3 NP_001096124 4403 NM_001102654 4142

GUSB NP_000172 4404 NM_000181 4143

HGF NP_001010932 4405 NM_001010933 4144

HGF NP_001010931 4406 NM_001010932 4145

HGF NP_001010934 4407 NM_001010931 4146

HGF NP_000592 4408 NM_000601 4147

HGF NP_001010933 4409 NM_001010934 4148

SERPINI1 NP_005016 4410 NM_005025 4149

SERPINI1 NP_001116224 4411 NM_001122752 4150

RGS2 NP_002914 4412 NM_002923 4151

NPY NP_000896 4413 NM_000905 4152

UCP2 NP_003346 4414 NM_003355 4153

SST NP_001039 4415 NM_001048 4154

SGSH NP_000190 4416 NM_000199 4155

CYP2J2 NP_000766 4417 NM_000775 4156

GM2A NP_000396 4418 NM_000405 4157

GM2A NP_001161079 4419 NM_001167607 4158

HEXA NP_000511 4420 NM_000520 4159

UCHL1 NP_004172 4421 NM_004181 4160

HEXB NP_000512 4422 NM_000521 4161

PINK1 NP_115785 4423 NM_032409 4162

PRDX3 NP_006784 4424 NM_006793 4163

PRDX1 NP_859048 4425 NM_002574 4164

PRDX1 NP_859047 4426 NM_181696 4165

PRDX1 NP_002565 4427 NM_181697 4166

GLB1 NP_001129074 4428 NM_001135602 4167

GLB1 NP_001073279 4429 NM_000404 4168

GLB1 NP_000395 4430 NM_001079811 4169

TPP1 NP_000382 4431 NM_000391 4170

ANG NP_001136 4432 NM_001097577 4171

ANG NP_001091046 4433 NM_001145 4172 Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

HMOX1 NP_002124 4434 NM_002133 4173

NRG1 NP_039258 4435 NM_001160001 4174

NRG1 NP_001153479 4436 NM_001159995 4175

NRG1 NP_001153476 4437 NM_001160007 4176

NRG1 NP_001153477 4438 NM_001160008 4177

NRG1 NP_039254 4439 NM_001159996 4178

NRG1 NP_039256 4440 NM_001159999 4179

NRG1 NP_001153467 4441 NM_001160002 4180

NRG1 NP_001153468 4442 NM_001160004 4181

NRG1 NP_039250 4443 NM_004495 4182

NRG1 NP_001153471 4444 NM_001160005 4183

NRG1 NP_001153480 4445 NM_013964 4184

NRG1 NP_039251 4446 NM_013960 4185

NRG1 NP_039252 4447 NM_013962 4186

NRG1 NP_039253 4448 NM_013959 4187

NRG1 NP_001153474 4449 NM_013958 4188

NRG1 NP_004486 4450 NM_013957 4189

NRG1 NP_001153473 4451 NM_013956 4190

NFIL3 NP_005375 4452 NM_005384 4191

MAN2B1 NP_000519 4453 NM_000528 4192

DHCR24 NP_055577 4454 NM_014762 4193

DDC NP_001076440 4455 NM_001082971 4194

DDC NP_000781 4456 NM_000790 4195

TP53 NP_001119588 4457 NM_001126113 4196

TP53 NP_001119589 4458 NM_001126112 4197

TP53 NP_001119585 4459 NM_000546 4198

TP53 NP_001119584 4460 NM_001126116 4199

TP53 NP_001119587 4461 NM_001126117 4200

TP53 NP_001119586 4462 NM_001126114 4201

TP53 NP_000537 4463 NM_001126115 4202

ADNP NP_056154 4464 NM_181442 4203

ADNP NP_852107 4465 NM_015339 4204

GAN NP_071324 4466 NM_022041 4205

GAL NP_057057 4467 NM_015973 4206

SMN1 NP_000335 4468 NM_022874 4207

SMN1 NP_075015 4469 NM_017411 4208

SMN1 NP_075014 4470 NM_000344 4209

SMN1 NP_059107 4471 NM_022876 4210

SMN1 NP_075013 4510 NM_022875 4211

SMN1 NP_075012 4511 NM_022877 4212

PROC NP_000303 4512 NM_000312 4213

TRAP1 NP_057376 4513 NM_016292 4214

IL18BP NP_001138527 4514 NM_173042 4215

IL18BP NP_001138529 4515 NM_001145057 4216

IL18BP NP_766630 4516 NM_001145055 4217

IL18BP NP_001034749 4517 NM_001039660 4218

IL18BP NP_001034748 4518 NM_001039659 4219 Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

PRDX6 NP_004896 4519 NM_004905 4220

TXN NP_003320 4520 NM_003329 4221

GAA NP_000143 4521 NM_000152 4222

GAA NP_001073271 4522 NM_001079803 4223

GAA NP_001073272 4523 NM_001079804 4224

GHRL NP_001128416 4524 NM_001134945 4225

GHRL NP_001128417 4525 NM_001134944 4226

GHRL NP_001128413 4526 NM_001134946 4227

GHRL NP_057446 4527 NM_016362 4228

GHRL NP_001128418 4528 NM_001134941 4229

CSF3 NP_757373 4529 NM_000759 4230

CSF3 NP_757374 4530 NM_172220 4231

CSF3 NP_000750 4531 NM_172219 4232

UBE3A NP_000453 4532 NM_130838 4233

UBE3A NP_570854 4533 NM_130839 4234

UBE3A NP_570853 4534 NM_000462 4235

ABCD1 NP_000024 4535 NM_000033 4236

NDP NP_000257 4536 NM_000266 4237

TH NP_954986 4537 NM_199292 4238

TH NP_954987 4538 NM_199293 4239

TH NP_000351 4539 NM_000360 4240

GCH1 NP_000152 4540 NM_000161 4241

GCH1 NP_001019241 4541 NM_001024070 4242

GCH1 NP_001019242 4542 NM_001024071 4243

GCH1 NP_001019195 4543 NM_001024024 4244

BCL2 NP_000648 4544 NM_000633 4245

BCL2 NP_000624 4545 NM_000657 4246

EN02 NP_001966 4546 NM_001975 4247

EPO NP_000790 4547 NM_000799 4248

GBA NP_001005741 4548 NM_001005742 4249

GBA NP_001165283 4549 NM_001005741 4250

GBA NP_001005742 4550 NM_001171811 4251

GBA NP_001165282 4551 NM_001171812 4252

GBA NP_000148 4552 NM_000157 4253

AGA NP_000018 4553 NM_000027 4254

AGA NP_001165459 4554 NM_001171988 4255

UPF1 NP_002902 4555 NM_002911 4256

PSAP NP_001035931 4556 NM_001042465 4257

PSAP NP_002769 4557 NM_001042466 4258

PSAP NP_001035930 4558 NM_002778 4259

ANXA1 NP_000691 4559 NM_000700 4260

IGF1 NP_001104753 4560 NM_001111284 4261

IGF1 NP_001104754 4561 NM_001111283 4262

IGF1 NP_000609 4562 NM_000618 4263

IGF2 NP_001035835 4563 NM_001007139 4264

IGF2 NP_000603 4564 NM_000612 4265

IGF2 NP_001007140 4565 NM_000207 4266 Polynucleotide

Polypeptide

Gene GenBank

GenBank Accession SEQ ID NO: SEQ ID NO:

Name/Symbol Accession Number

Number

(coding sequence)

IGF2 NP_001121070 4566 NM_001127598 4267

IGF2 NP_000198 4567 NM_001042376 4268

PARK2 NP_054642 4568 NM_013988 4269

PARK2 NP_054643 4569 NM_013987 4270

PARK2 NP_004553 4570 NM_004562 4271

FUCA1 NP_000138 4571 NM_000147 4272

SIRT1 NP_001135970 4572 NM_012238 4273

SIRT1 NP_036370 4573 NM_001142498 4274

PARK7 NP_001116849 4574 NM_007262 4275

PARK7 NP_009193 4575 NM_001123377 4276

ANXA2 NP_004030 4576 NM_004039 4277

ANXA2 NP_001002857 4577 NM_001136015 4278

ANXA2 NP_001002858 4578 NM_001002858 4279

ANXA2 NP_001129487 4579 NM_001002857 4280

GCG NP_002045 4580 NM_002054 4281

LEP NP_000221 4581 NM_000230 4282

SH3BP5 NP_004835 4582 NM_001018009 4283

SH3BP5 NP_001018009 4583 NM_004844 4284

CNTF NP_000605 4584 NM_053023 4285

CNTF NP_444251 4585 NM_000614 4286

GLA NP_000160 4586 NM_000169 4287

KCNN3 NP_740752 4587 NM_170782 4288

KCNN3 NP_002240 4588 NM_002249 4289

ARSA NP_001078896 4589 NM_000487 4290

ARSA NP_001078897 4590 NM_001085428 4291

ARSA NP_000478 4591 NM_001085426 4292

ARSA NP_001078894 4592 NM_001085427 4293

ARSA NP_001078895 4593 NM_001085425 4294

SUMF1 NP_001158147 4594 NM_001164674 4295

SUMF1 NP_001158146 4595 NM_001164675 4296

SUMF1 NP_877437 4596 NM_182760 4297

CRH NP_000747 4597 NM_000756 4298

SMPD1 NP_000534 4598 NM_000543 4299

SMPD1 NP_001007594 4599 NM_001007593 4300

Table 4.

It should be noted that any of the sequences described herein (e.g., the Toxoplasma secreted protein and/or the therapeutic polypeptides) can be codon optimized for expression in a target cell (e.g., Toxoplasma). Examples of such sequence modifications include, but are not limited to, an altered G/C content to more closely approach that typically found in the target cell species of interest, and the removal of codons atypically found in the target cell a species commonly referred to as codon optimization. The phrase "codon optimization" refers to the selection of appropriate DNA nucleotides for use within a structural gene or fragment thereof that approaches codon usage within the target cell of interest. Therefore, an optimized gene or nucleic acid sequence refers to a gene in which the nucleotide sequence of a native or naturally occurring gene has been modified in order to utilize statistically-preferred or statistically-favored codons within the target cell. The nucleotide sequence typically is examined at the DNA level and the coding region optimized for expression in the target cell species determined using any suitable procedure. In this method, the standard deviation of codon usage, a measure of codon usage bias, may be calculated by first finding the squared proportional deviation of usage of each codon of the native gene relative to that of highly expressed target cell genes, followed by a calculation of the average squared deviation. The formula used is: 1 SDCU = n = 1 N [ ( Xn - Yn ) / Yn ] 2 / N, where Xn refers to the frequency of usage of codon n in highly expressed target cell genes, where Yn to the frequency of usage of codon n in the gene of interest and N refers to the total number of codons in the gene of interest.

One method of optimizing the nucleic acid sequence in accordance with the preferred codon usage for a particular target cell type is based on the direct use, without performing any extra statistical calculations, of codon optimization Tables such as those provided on-line at the Codon Usage Database through the Codon usage tabulated from international DNA sequence databases: status for the year 2000. Y Nakamura, T Gojobori, T Ikemura - Nucleic acids research, 2000 - Oxford Univ Press. The Codon Usage Database contains codon usage tables for a number of different species, with each codon usage Table having been statistically determined based on the data present in Genbank.

By using the above Tables to determine the most preferred or most favored codons for each amino acid in a particular species (for example, Toxoplasma gondii), a naturally-occurring nucleotide sequence encoding a protein of interest can be codon optimized for that particular target cell species. This is effected by replacing codons that may have a low statistical incidence in the particular species genome with corresponding codons, in regard to an amino acid, that are statistically more favored. However, one or more less-favored codons may be selected to delete existing restriction sites, to create new ones at potentially useful junctions (5' and 3' ends to add signal peptide or termination cassettes, internal sites that might be used to cut and splice segments together to produce a correct full-length sequence), or to eliminate nucleotide sequences that may negatively effect mRNA stability or expression.

The naturally-occurring encoding nucleotide sequence may already, in advance of any modification, contain a number of codons that correspond to a statistically- favored codon in a particular target cell species. Therefore, codon optimization of the native nucleotide sequence may comprise determining which codons, within the native nucleotide sequence, are not statistically-favored with regards to a particular target cell, and modifying these codons in accordance with a codon usage table of the particular target cell to produce a codon optimized derivative. A modified nucleotide sequence may be fully or partially optimized for target cell codon usage provided that the protein encoded by the modified nucleotide sequence is produced at a level higher than the protein encoded by the corresponding naturally occurring or native gene. Additionally or alternatively, codon optimized/usage sequence can fold better than the protein encoded by the corresponding naturally occurring or native gene. Additionally or alternatively, codon optimized/usage sequence is targeted better to the target organelle(s) than the protein encoded by the corresponding naturally occurring or native gene. Additionally or alternatively, codon optimized/usage sequence is less degraded than the protein encoded by the corresponding naturally occurring or native gene.

For example, the following Table 5 of Tosoplasma codon usage can be used.

Table 5

Table 5.

According to some embodiments of the invention, the heterologous polynucleotide further comprises a nucleic acid sequence encoding a selectable marker.

According to some embodiments of the invention, the selectable marker comprises chloramphenicol acetyltransferase (CAT), DHFR-TS, BLE, HXGPRT, UPRT, TK, CD, a fluorescent protein (such as GFP, YFP, RFP, mCherry or other) or an epitope tag (such as HA, Myc, Ty- 1, FLAG or other).

According to some embodiments of the invention, the selectable marker comprises HXGPRT.

Toxoplasma tachyzoites deficient in HXGPRT activity can be selected in the presence of 6-thioxanthine (6-TX), whereas strains expressing HXGPRT activity can be selected in mycophenolic acid (MPA) and/or xanthine (Pfefferkom and Borotz, 1994, Pfefferkom E. R., Borotz S. (1994) Exp. Parasitol. 79, 374-382). According to some embodiments of the invention, the selectable marker comprises chloramphenicol acetyltransferase (CAT) (for positive selection using Chloramphenicol), DHFR-TS (for positive selection using Pyrimethamine), BLE (for positive selection using Phleomycin), HXGPRT (for positive selection using Mycophenolic acid+xanthine or negative selection using 6-thioxanthine, UPRT (for negative selection using 5'-fluo-2'-deoxyuridine), TK (for positive selection using Ganciclovir), CD (for positive selection using 5-fluorocytosine), a fluorescent protein (such as GFP, YFP, RFP, mCherry or other) or an epitope tag (such as HA, Myc, Ty-1, FLAG or other).

According to some embodiments of the invention, the nucleic acid construct further comprises a third nucleic acid sequence encoding an inducible self-destruction element.

According to some embodiments of the invention, the inducible self destruction element is in the same frame as the therapeutic cassette.

According to some embodiments of the invention, the inducible self destruction element is in a separate reading frame, regulated by different regulatory elements. For instance, the therapeutic cassette can be driven by a latent- stage promoter, while the self-destruction cassette will be driven by a drug-inducible promoter.

According to some embodiments of the invention, the inducible self-destruction element is active in response to a drug.

According to some embodiments of the invention, the drug comprises an antibiotic.

According to some embodiments of the invention, the third nucleic acid sequence encoding the inducible self-destruction element is comprised in the same nucleic acid construct which comprises the heterologous polynucleotide comprising the first nucleic acid sequence encoding the Toxoplasma secreted protein in frame fused upstream to the second nucleic acid sequence encoding the pharmaceutical polypeptide.

According to some embodiments of the invention, the third nucleic acid sequence encoding the inducible self-destruction element is comprised in a separate nucleic acid construct with respect to the nucleic acid construct which comprises the heterologous polynucleotide comprising the first nucleic acid sequence encoding the Toxoplasma secreted protein in frame fused upstream to the second nucleic acid sequence encoding the pharmaceutical polypeptide.

According to some embodiments of the invention, the Toxoplasma untranslated region (UTR) nucleic acid sequence is upstream and/or downstream of the open reading frame of the selectable marker and/or of the self-destruction element.

According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct system comprising at least two nucleic acid constructs, wherein a first nucleic acid construct of the at least two nucleic acid constructs is the nucleic acid construct of some embodiments of the invention, and a second nucleic construct of the at least two nucleic acid constructs comprises a polynucleotide encoding a selectable marker.

Typical cloning vectors may also contain a transcription and translation initiation sequence, transcription and translation terminator and a polyadenylation signal.

Following is a non-limiting list of Toxoplasma nucleic acid constructs which can be used as a backbone to generate the nucleic acid construct of some embodiments of the invention: pGRA, pUPRT, pROPl, pTUB l, pTUB8, pLIC, pTOXO, pMIC2, pHX, pCAT, pDHFR, pBlueScript, pTet07SAGl, pTet07SAG4, pSAGl, pSAG4, pHTU, pTKO, pLoxP-DHFR, pminCAT/HXGPRT+, pminCAT/HXGPRT-, pDHFR-TSc3/M3, pDHFR-TSc3/M2M3, pminiHXGPRT, and pUC19. These vectors can be obtained from various sources, such as the nonprofit plasmid repository "addgene"; the NIH AIDS Reagent Program; Agilent Technologies; NEB; Thermo Fisher Scientific; Sigma Aldrich; GenScript; and MoBiTec GmbH.

Table 6 hereinbelow provides non-limiting exemplary vectors and catalogue numbers.

Table 6, Vector Backbones

Construct name Manufacturer Catalogue number

pLIC Addgene

pCAT Addgene 59018

pTet07SAGl Addgene 59017

pSAGl Addgene 54467

Table 6.

According to an aspect of some embodiments of the present invention there is provided a Toxoplasma transformed with the nucleic acid construct of some embodiments of the invention or with the nucleic acid construct system of some embodiments of the invention.

As used herein the term "Toxoplasma" refers to the intracellular, parasitic protozoan Toxoplasma gondii.

Non-limiting examples of a Toxoplasma gondii strains which can be used along with the construct and methods of some embodiments of the invention include:

(i) Type I- including but not limited to: GTl, RH, ENT, VEL, TgCatCol and CAST.

(ii) Type II- including but not limited to: ME49, Beverly, PDS, PLK, PTG, DEG, PIH, TgNmBrl and PRU.

(iii) Type III- including but not limited to: VEG, C56, CTG, CEP, TgGoatUS4 and STRL.

(iv) Type Atypical- including but not limited to: TgCTPrC3, TgBbUS l, TgRabbitBrl and TgPigUS 15.

According to some embodiments of the invention, the Toxoplasma is not attenuated. As is conventional in the art, the term "attenuated" refers to a weakened and/or less vigorous strain of T. gondii as compared to a native (wild type) T. gondii which is non-genetically modified. Usually, an attenuated mutant of T. gondii is capable of stimulating an immune response and creating immunity but not causing illness. Attenuation can be achieved by conventional methods including, but not limited, to γ- irradiation or the generation of a pyrimidine auxotroph.

Non-limiting examples of attenuated Toxoplasma are described in US 20120045477 Al and US 8673289 B2, which are fully incorporated herein by reference.

It should be noted that T. gondii replication is essential for its ability to disseminate in the body and reach distal tissues during the acute stage of infection, its ability to persist in the infected cells and its ability to establish chronic cysts in the host.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is characterized by a higher proliferation rate as compared to the proliferation rate of an attenuated Toxoplasma.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is not an auxotroph.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention comprises a functional endogenous pathway suitable for de-novo biosynthesis of pyrimidines.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is capable of differentiation into the Bradizoite stage.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is capable of RNA synthesis and/or DNA replication.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is capable of RNA synthesis.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention is capable of DNA replication.

According to some embodiments of the invention, the Toxoplasma of some embodiments of the invention comprises an endogenous functional Carbamoyl phosphate synthetase II (CPSII) enzyme (as described in US8673289), e.g., as provided in any of SEQ ID NOs: 4612-4617. The constructs and Toxoplasma of some embodiments of the invention can be used to deliver a protein-of-interest into the subject, e.g., into a specific tissue or cell type of the subject.

According to an aspect of some embodiments of the present invention there is provided a method of administering a protein-of-interest into a tissue-of-interest of a subject, the method comprising administering to the subject the Toxoplasma of some embodiments of the invention or the pharmaceutical composition of some embodiments of the invention, thereby administering the protein-of-interest to the tissue-of-interest of the subject.

Examples of tissue-of-interest include, but are not limited to, a central nervous system, muscles, parts of the eye, blood, lymph nodes, spleen, white blood cells, digestive system, and lamina propria.

It should be noted that since the Toxoplasma can cross the blood brain barrier it can deliver the protein-of-interest into the central nervous system of the subject in need thereof.

According to an aspect of some embodiments of the present invention there is provided a method of administering a protein-of-interest into a central nervous system of a subject, the method comprising administering to the subject the Toxoplasma of some embodiments of the invention or the pharmaceutical composition of some embodiments of the invention, thereby administering the protein-of-interest to the central nervous system of the subject.

Thus, the Toxoplasma of some embodiments of the invention can be used to treat a subject in need thereof by delivering a protein-of-interest (e.g., a therapeutic polypeptide) which is capable of treating the subject.

According to an aspect of some embodiments of the present invention there is provided a method of treating a subject in need thereof, comprising administering to the subject the Toxoplasma of some embodiments of the invention or the pharmaceutical composition of some embodiments of the invention, wherein the subject is diagnosed with a pathology treatable by administration of the pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

According to an aspect of some embodiments of the present invention there is provided a method of treating a subject in need thereof, comprising administering to the subject a Toxoplasma comprising a nucleic acid construct which comprises a heterologous polynucleotide comprising a first nucleic acid sequence encoding a Toxoplasma secreted protein in frame fused upstream to a second nucleic acid sequence encoding a pharmaceutical polypeptide, wherein the heterologous polynucleotide is operably linked to a promoter for directing transcription of the heterologous polynucleotide in a Toxoplasma, wherein the subject is diagnosed with a pathology treatable by administration of the pharmaceutical polypeptide in a central nervous system of the subject, thereby treating the subject in need thereof.

The term "treating" refers to inhibiting, preventing or arresting the development of a pathology (disease, disorder or condition) and/or causing the reduction, remission, or regression of a pathology. Those of skill in the art will understand that various methodologies and assays can be used to assess the development of a pathology, and similarly, various methodologies and assays may be used to assess the reduction, remission or regression of a pathology.

As used herein, the term "preventing" refers to keeping a disease, disorder or condition from occurring in a subject who may be at risk for the disease, but has not yet been diagnosed as having the disease.

As used herein, the term "subject" includes mammals, preferably human beings at any age which suffer from the pathology. Preferably, this term encompasses individuals who are at risk to develop the pathology.

According to some embodiments of the invention, the subject is diagnosed with a pathology characterized by a deficient endogenous protein in a central nervous system of a subject.

According to some embodiments of the invention, the deficient endogenous protein comprises a deletion, insertion, and/or substitution of at least one amino acid of the endogenous protein as compared to a wild type amino acid sequence of the endogenous protein.

According to some embodiments of the invention, the deficient endogenous protein comprises a reduced level of the endogenous protein as compared to a level of the endogenous protein in a healthy subject devoid of the pathology. According to some embodiments of the invention, the deficient endogenous protein is absence of the endogenous protein in the subject diagnosed with the pathology.

According to some embodiments of the invention, the subject is diagnosed with Krabbe disease.

According to some embodiments of the invention, the subject is diagnosed with Rett syndrome.

According to some embodiments of the invention, the subject is diagnosed with Canavan disease.

According to some embodiments of the invention, the subject is diagnosed with

Spinal Muscular Atrophy.

According to some embodiments of the invention, the subject is diagnosed with Parkinson's disease.

According to some embodiments of the invention, the subject is diagnosed with hypoxic/ischemic or neuroinflammatory CNS disorder.

According to some embodiments of the invention, the subject is diagnosed with Alzheimer's disease.

According to some embodiments of the invention, the subject is diagnosed with Amyotrophic Lateral Sclerosis.

According to some embodiments of the invention, the subject is diagnosed with

Huntington's disease.

According to some embodiments of the invention, the subject is diagnosed with a lysosomal storage disease .

According to some embodiments of the invention, the subject is diagnosed with MECP2-duplication syndrome.

According to some embodiments of the invention the subject does not a have a compromised immune system. Examples of subjects with compromised immune system include, but are not limited to AIDS patients, subjects receiving chemotherapy, subjects receiving immunosuppressant drugs such as for cell, tissue and/or organ transplantation.

According to some embodiments of the invention, the method further comprising administering to the subject a drug capable of inducing the self-destruction element. According to some embodiments of the invention, the method further comprising administering to the subject a molecule necessary for sustaining the Toxoplasma inside the host cell and/or body.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is an antibiotic.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is a small-molecule.

According to some embodiments of the invention, the molecule necessary for sustaining the Toxoplasma is a metabolite.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug prior to administration of the Toxoplasma and/or subsequent to administration of the Toxoplasma and/or concomitantly with administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug prior to administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug subsequent (or following) to administration of the Toxoplasma to the subject.

According to some embodiments of the invention, the method further comprises administering to the subject an immunosuppression drug concomitantly with administration of the Toxoplasma to the subject.

According to some embodiments of the invention, administering is performed by peripheral administration.

According to some embodiments of the invention, the peripheral administration comprises intravenous administration.

According to some embodiments of the invention, the peripheral administration comprises oral administration.

According to some embodiments of the invention, administering is performed by intraperitoneal injection.

According to some embodiments of the invention, administering is performed by intramuscular injection. According to some embodiments of the invention, administering is performed by aerosols.

According to some embodiments of the invention, administering is performed by direct administration to the central nervous system or an adjacent tissue thereof.

According to some embodiments of the invention, the disease which is treated by the methods of some embodiments of the invention, e.g., using the Toxoplasma and/or the pharmaceutical composition comprising same is any of the diseases listed in Table 7 hereinabove.

Table 7 lists nonb-limiting diseases associated with deficient or abnormal expression or function of the endogenous proteins, which can be treated by administration of the therapeutic polypeptide of some embodiments of the invention.

Table 7

Gene Name/Symbol Associated disease

TFEB Lysosomal storage diseases and neurodegenerative diseases

HGSNAT Mucopolysaccharidosis 3C (Sanfilippo syndrome C)

MEF2C Parkinson's disease

NRTN Parkinson's disease

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

PGF neuroinfiammatory diseases, including stroke and neurodegenerative conditions.

RPE65 Leber congenital amaurosis, retinitis pigmentosa

Parkinson's disease, Supranuclear Palsy, Multiple sclerosis, Alzheimers disease, Amyotrophic Lateral Sclerosis,

GDNF Huntington's disease, retinal degenerations, Traumatic Brain

Injury, hypoxic/ischaemic CNS disorders, neuroinfiammatory diseases, including stroke and neurodegenerative conditions.

CNGB3 Achromatopsia

Amyotrophic Lateral Sclerosis, Huntington's disease,

BDNF Alzheimer's disease, Spinal Cord Injury, Obesity, Lysosomal storage disorders.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

APP neuroinfiammatory diseases, including stroke and neurodegenerative conditions.

ASPA Canavan disease

S1PR1 Multiple sclerosis

Mucopolysaccharidosis type II (Hunter Syndrome, iduronate

IDS

sulfatase deficiency)

HTRA2 Parkinson's disease, Alzheimer's disease

Parkinson's Disease, Alzheimers Disease, Amyotrophic Lateral

CDNF

Sclerosis Gene Name/Symbol Associated disease

DNAJC5 neurodegenerative disorders

Mucopolysaccharidosis type I (Hurler syndrome, Hurler-Scheie

IDUA

syndrome, Scheie syndrome)

SPON1 Alzheimer's disease

disorders involving axonal degenerations, such as Taxol-induced neuropathy, multiple sclerosis, some viral disorders, Traumatic

NMNAT2 Brain Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

FMR1 Fragile X syndrome

MECP2 Rett syndrome

GNS Mucopolysaccharidosis 3D (Sanfilippo syndrome D)

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

VEGFB neuroinflammatory diseases, including stroke and neurodegenerative conditions.

HIF1A Alzheimer's disease

Parkinson's Disease, Amyotrophic Lateral Sclerosis, Multiple Sclerosis, Traumatic Brain Injury, hypoxic/ischaemic CNS

SERPINF1

disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

VEGFA neuroinflammatory diseases, including stroke and neurodegenerative conditions.

retinal degeneration, optic nerve diseases, Traumatic Brain

STC1 Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions. optic atrophy, Traumatic Brain Injury, hypoxic/ischaemic CNS

NGB disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

NFE2L2

MAPK7 Alzheimer's disease

Parkinsons disease, Leber Congential Amaurosis, Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

NMNAT1

neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Amyotrophic Lateral Sclerosis, Alzheimer's disease, down

NGF

syndrome, sensory neuropathies

NAGLU Mucopolysaccharidosis 3B (Sanfilippo syndrome B)

Alzheimer's disease, Traumatic Brain Injury, hypoxic/ischaemic

CLU CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

MME Alzheimer's disease

CTSA Galactosialidosis

PPT1 ceroid lipofuscinosis neuronal type 1 (Batten disease)

Multiple sclerosis, Alzheimer's disease, and Huntington's disease, Parkinson's disease, Traumatic Brain Injury,

TIMP2

hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions. Gene Name/Symbol Associated disease

Parkinson's disease, retinitis pigmentosa, retinal artery occlusion, diabetes, Wolfram's syndrome, Alzheimer's disease, Traumatic

MANF Brain Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

TIMP1 neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GAD2 Parkinson's disease

GALC Krabbe disease (globoid cell leukodystrophy)

GAD1 Parkinson's disease

Charcot-Marie -Tooth disease, multiple sclerosis, Spinal Cord Injury, X-linked adrenoleukodystrophy, diabetic neuropathies,

NTF3 axonal neuropathies, Traumatic Brain Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GUSB Sly syndrome

HGF amyotrophic lateral sclerosis

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

SERPINI1 neuroinflammatory diseases, including stroke and neurodegenerative conditions.

RGS2 anxiety disorders

NPY epilepsy, Post-Traumatic Stress Disorder

UCP2 acute and excitotoxic brain injury, Parkinsons disease,

SST epilepsy, alzheimer's disease,

SGSH Mucopolysaccharidosis 3A (Sanfilippo syndrome A)

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

CYP2J2 neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GM2 gangliosidoses (Tay-Sachs disease (TSD), Sandhoff-

GM2A

Jatzkewitz disease, GM2 activator deficiency)

GM2 gangliosidoses (Tay-Sachs disease (TSD), Sandhoff-

HEXA

Jatzkewitz disease, GM2 activator deficiency)

UCHL1 Alzheimer's disease

GM2 gangliosidoses (Tay-Sachs disease (TSD), Sandhoff-

HEXB

Jatzkewitz disease, GM2 activator deficiency)

PINK1 Parkinson's disease

Amyotrophic Lateral Sclerosis, parkinsons disease, alzheimers

PRDX3

disease

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

PRDX1 neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GLB1 GM1 gangliosidosis

TPP1 ceroid lipofuscinosis neuronal type 2 (Batten disease)

ANG Amyotrophic Lateral Sclerosis, Parkinson's disease

Alzheimer's (AD), Parkinson's (PD), Traumatic Brain Injury,

HMOX1 hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions. Gene Name/Symbol Associated disease

Charcot-Marie -Tooth disease, Traumatic Brain Injury,

NRG1 hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

NFIL3 Amyotrophic Lateral Sclerosis

MAN2B1 alpha-Mannosidosis

DHCR24 Alzheimer's disease.

Parkinson's disease, Aromatic L-amino-acid decarboxylase

DDC

deficiency

Brain tumors (Astrocytic tumours, Oligodendroglial tumours, Oligoastrocytic tumours, Ependymal tumours, Choroid plexus tumours, Other neuroepithelial tumours (Astroblastoma, Chordoid glioma of the third ventricle, Angiocentric glioma), Neuronal and mixed neuronal-glial tumours, Tumours of the pineal region, Embryonal tumours, Tumours of cranial and paraspinal nerves (Schwannoma, Neurofibroma, Perineurioma, Malignant peripheral nerve sheath tumour), Tumours of meningothelial cells, Mesenchymal tumours, Primary

TP53

melanocytic lesions, Other neoplasms related to the meninges (Haemangioblastoma), Tumors of the haematopoietic system (Malignant Lymphomas, Plasmocytoma, Granulocytic sarcoma), Germ cell tumours (Germinoma, Embryonal carcinoma, Yolk sac tumour, Choriocarcinoma, Teratoma, Mixed germ cell tumours), Tumours of the sellar region (craniopharyngioma, Granular cell tumour, Pituicytoma, Spindle cell oncocytoma of the adenohypophysis), Metastatic Tumours).

Alzheimer's disease, Amyotrophic Lateral Sclerosis, frontotemportal degeneration/dementia (FTD), Traumatic

ADNP Brain Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GAN Giant Axonal Neuropathy

GAL epilepsy, Alzheimer's disease, opiate addiction/withdrawel

SMN1 Spinal muscular atrophy

Amyotrophic Lateral Sclerosis, Alzheimer's disease, multiple sclerosis, Traumatic Brain Injury, hypoxic/ischaemic CNS

PROC

disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Parkinson's disease, Traumatic Brain Injury, hypoxic/ischaemic

TRAP1 CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

IL18BP neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Alzheimer's disease, Parkinson's disease, Traumatic Brain Injury,

PRDX6 hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

TXN neuroinflammatory diseases, including stroke and neurodegenerative conditions. Gene Name/Symbol Associated disease

GAA Pompe disease

Alzheimer's disease, Parkinson's disease, Traumatic Brain Injury,

GHRL hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Parkinson's Disease, Amyotrophic Lateral Sclerosis, Cerebral Palsy, Alzheimer's disease, Traumatic Brain Injury,

CSF3

hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

UBE3A Angelman syndrome

ABCD1 Adrenoleukodystrophy

Norrie disease, familial exudative vitreoretinopathy (FEVR),

NDP

inner ear diseases

TH Parkinson's disease

GCH1 Parkinson's disease

Amyotrophic lateral sclerosis, Parkinson's disease, Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

BCL2

neuroinflammatory diseases, including stroke and neurodegenerative conditions.

EN02 Alzheimers disease

Parkinson's disease, multiple sclerosis, Traumatic Brain Injury,

EPO hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GBA Gaucher disease

AGA Aspartylglucosaminuria

UPF1 Amyotrophic Lateral Sclerosis

Parkinson's disease, demyelination disorders, Traumatic Brain

PSAP Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

ANXA1 neuroinflammatory diseases, including stroke and neurodegenerative conditions.

Amyotrophic Lateral Sclerosis, Traumatic Brain Injury,

IGF1 hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

IGF2 Alzheimer's disease

Parkinson's disease, Traumatic Brain Injury, hypoxic/ischaemic

PARK2 CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

FUCA1 Fucosidosis

Huntington's disease, amyotrophic lateral sclerosis, Parkinson's disease, Alzheimer disease, Frontotemporal dementia,

SIRT1 Traumatic Brain Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

PARK7 Amyotrophic Lateral Sclerosis, Parkinsons disease

Traumatic Brain Injury, hypoxic/ischaemic CNS disorders,

ANXA2 neuroinflammatory diseases, including stroke and neurodegenerative conditions. Gene Name/Symbol Associated disease

Brain insulin resistance in Alzheimer's disease, Parkinson's

GCG

disease, Amyotrophic Lateral Sclerosis

Alzheimer's disease, Obesity, Traumatic Brain Injury,

LEP hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

SH3BP5 Alzheimer's disease.

Parkinson's disease, Supranuclear Palsy, Multiple sclerosis, Alzheimers disease, Amyotrophic Lateral Sclerosis,

CNTF Huntington's disease, retinal degenerations, Traumatic Brain

Injury, hypoxic/ischaemic CNS disorders, neuroinflammatory diseases, including stroke and neurodegenerative conditions.

GLA Fabry disease

KCNN3 Alzheimer's disease

ARSA Metachromatic leukodystrophy, Multiple sulfatase deficiency

Multiple sulfatase deficiency, Mucopolysaccharidosis Type III A

SUMF1

(Sanfilippo Disease Type A)

CRH cerebral oedema

SMPD1 Niemann-Pick disease

Table 7.

According to some embodiments of the invention, the disease (pathology) is not cancer.

The nucleic acid construct, the nucleic acid construct system or the Toxoplasma transformed with the nucleic acid construct or with the nucleic acid construct system of some embodiments of the invention can be administered to an organism per se, or in a pharmaceutical composition where it is mixed with suitable carriers or excipients.

According to an aspect of some embodiments of the present invention there is provided a pharmaceutical composition comprising the Toxoplasma of some embodiments of the invention, and a pharmaceutically acceptable carrier.

According to some embodiments of the invention, the pharmaceutical composition is for treating a subject diagnosed with any of the pathologies described herein.

According to some embodiments of the invention, the pharmaceutical composition is for treating a subject diagnosed with a pathology characterized by a deficient endogenous protein in a central nervous system of a subject.

According to some embodiments of the invention, the pharmaceutical composition is for treating a subject diagnosed with a pathology which is treatable by administration of the pharmaceutical polypeptide in a central nervous system of a subject. According to some embodiments of the invention, the pharmaceutical composition further comprises an immunosuppression agent.

As used herein a "pharmaceutical composition" refers to a preparation of one or more of the active ingredients described herein with other chemical components such as physiologically suitable carriers and excipients. The purpose of a pharmaceutical composition is to facilitate administration of a compound to an organism.

Herein the term "active ingredient" refers to the nucleic acid construct, the nucleic acid construct system or the toxoplama transformed with the nucleic acid construct or with the nucleic acid construct system of some embodiments of the invention accountable for the biological effect.

Hereinafter, the phrases "physiologically acceptable carrier" and "pharmaceutically acceptable carrier" which may be interchangeably used refer to a carrier or a diluent that does not cause significant irritation to an organism and does not abrogate the biological activity and properties of the administered compound. An adjuvant is included under these phrases.

Herein the term "excipient" refers to an inert substance added to a pharmaceutical composition to further facilitate administration of an active ingredient. Examples, without limitation, of excipients include calcium carbonate, calcium phosphate, various sugars and types of starch, cellulose derivatives, gelatin, vegetable oils and polyethylene glycols.

Techniques for formulation and administration of drugs may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest edition, which is incorporated herein by reference.

Suitable routes of administration may, for example, include oral, rectal, transmucosal, especially transnasal, intestinal or parenteral delivery, including intramuscular, subcutaneous and intramedullary injections as well as intrathecal, direct intraventricular, intracardiac, e.g., into the right or left ventricular cavity, into the common coronary artery, intravenous, intraperitoneal, intranasal, or intraocular injections.

Conventional approaches for drug delivery to the central nervous system (CNS) include: neurosurgical strategies (e.g., intracerebral injection or intracerebroventricular infusion); molecular manipulation of the agent (e.g., production of a chimeric fusion protein that comprises a transport peptide that has an affinity for an endothelial cell surface molecule in combination with an agent that is itself incapable of crossing the BBB) in an attempt to exploit one of the endogenous transport pathways of the BBB; pharmacological strategies designed to increase the lipid solubility of an agent (e.g., conjugation of water-soluble agents to lipid or cholesterol carriers); and the transitory disruption of the integrity of the BBB by hyperosmotic disruption (resulting from the infusion of a mannitol solution into the carotid artery or the use of a biologically active agent such as an angiotensin peptide).

Alternately, one may administer the pharmaceutical composition in a local rather than systemic manner, for example, via injection of the pharmaceutical composition directly into a tissue region of a patient.

The term "tissue" refers to part of an organism consisting of cells designed to perform a function or functions. Examples include, but are not limited to, brain tissue, retina, skin tissue, hepatic tissue, pancreatic tissue, bone, cartilage, connective tissue, blood tissue, muscle tissue, cardiac tissue, vascular tissue, renal tissue, pulmonary tissue, gonadal tissue, hematopoietic tissue.

Pharmaceutical compositions of some embodiments of the invention may be manufactured by processes well known in the art, e.g., by means of conventional mixing, dissolving, culturing, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. For example, manufacturing of toxoplasma can be by culturing the toxoplasma in cell cultures.

Pharmaceutical compositions for use in accordance with some embodiments of the invention thus may be formulated in conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries, which facilitate processing of the active ingredients into preparations which, can be used pharmaceutically. Proper formulation is dependent upon the route of administration chosen.

For injection, the active ingredients of the pharmaceutical composition may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hank's solution, Ringer's solution, or physiological salt buffer. For transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art. For oral administration, the pharmaceutical composition can be formulated readily by combining the active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers enable the pharmaceutical composition to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, foodstuff, slurries, suspensions, and the like, for oral ingestion by a patient. Pharmacological preparations for oral use can be made using a solid excipient, optionally grinding the resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carbomethylcellulose; and/or physiologically acceptable polymers such as polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate.

Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions may be used which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.

Pharmaceutical compositions which can be used orally, include push-fit capsules made of gelatin as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules may contain the active ingredients in admixture with filler such as lactose, binders such as starches, lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active ingredients may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for oral administration should be in dosages suitable for the chosen route of administration.

For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner. For administration by nasal inhalation, the active ingredients for use according to some embodiments of the invention are conveniently delivered in the form of an aerosol spray presentation from a pressurized pack or a nebulizer with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichloro- tetrafluoroethane or carbon dioxide. In the case of a pressurized aerosol, the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use in a dispenser may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch.

The pharmaceutical composition described herein may be formulated for parenteral administration, e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampoules or in multidose containers with optionally, an added preservative. The compositions may be suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.

Pharmaceutical compositions for parenteral administration include aqueous solutions of the active preparation in water-soluble form. Additionally, suspensions of the active ingredients may be prepared as appropriate oily or water based injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acids esters such as ethyl oleate, triglycerides or liposomes. Aqueous injection suspensions may contain substances, which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol or dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase the solubility of the active ingredients to allow for the preparation of highly concentrated solutions.

Alternatively, the active ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile, pyrogen-free water based solution, before use.

The pharmaceutical composition of some embodiments of the invention may also be formulated in rectal compositions such as suppositories or retention enemas, using, e.g., conventional suppository bases such as cocoa butter or other glycerides.

Pharmaceutical compositions suitable for use in context of some embodiments of the invention include compositions wherein the active ingredients are contained in an amount effective to achieve the intended purpose. More specifically, a therapeutically effective amount means an amount of active ingredients (e.g., the nucleic acid construct, the nucleic acid construct system or the toxoplasma transformed with the nucleic acid construct or with the nucleic acid construct system of some embodiments of the invention) effective to prevent, alleviate or ameliorate symptoms of a disorder (e.g., a disorder affecting the central nervous system of the subject) or prolong the survival of the subject being treated.

Determination of a therapeutically effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.

For any preparation used in the methods of the invention, the therapeutically effective amount or dose can be estimated initially from in-vitro assays, ex-vivo assays, cell culture assays, and/or animal models. For example, a dose can be formulated in animal models to achieve a desired concentration or titer. Such information can be used to more accurately determine useful doses in humans.

Toxicity and therapeutic efficacy of the active ingredients described herein can be determined by standard pharmaceutical procedures in vitro, in cell cultures or experimental animals. The data obtained from these in vitro and cell culture assays and animal studies can be used in formulating a range of dosage for use in human. The dosage may vary depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. (See e.g., Fingl, et al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 1 p. l).

Dosage amount and interval may be adjusted individually to provide tissue levels (e.g., brain levels) of the active ingredient which are sufficient to induce or suppress the biological effect (minimal effective concentration, MEC). The MEC may vary for each preparation, but can be estimated from in vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics and route of administration. Detection assays can be used to determine plasma, tissue or CSF (cerebro- spinal fluid) concentrations.

Depending on the severity and responsiveness of the condition to be treated, dosing can be of a single or a plurality of administrations, with course of treatment lasting from several days to several weeks, or e.g., a lifetime in the case of chronic diseases, or until cure is effected or diminution of the disease state is achieved. For example, for chronic diseases, the dosing can be adjusted until symptoms are managed, stabilized and/or ameliorated.

The amount of a composition to be administered will, of course, be dependent on the subject being treated, the severity of the affliction, the manner of administration, the immune state of the subject, the judgment of the prescribing physician, etc.

Compositions of some embodiments of the invention may, if desired, be presented in a pack or dispenser device, such as an FDA approved kit, which may contain one or more unit dosage forms containing the active ingredient. The pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may be accompanied by instructions for administration. The pack or dispenser may also be accommodated by a notice associated with the container in a form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals, which notice is reflective of approval by the agency of the form of the compositions or human or veterinary administration. Such notice, for example, may be of labeling approved by the U.S. Food and Drug Administration for prescription drugs or of an approved product insert. Compositions comprising a preparation of the invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition, as is further detailed above.

As used herein the term "about" refers to + 10 %

The terms "comprises", "comprising", "includes", "including", "having" and their conjugates mean "including but not limited to".

The term "consisting of means "including and limited to".

The term "consisting essentially of" means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.

As used herein, the singular form "a", "an" and "the" include plural references unless the context clearly dictates otherwise. For example, the term "a compound" or "at least one compound" may include a plurality of compounds, including mixtures thereof. Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.

Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases "ranging/ranges between" a first indicate number and a second indicate number and "ranging/ranges from" a first indicate number "to" a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.

As used herein the term "method" refers to manners, means, techniques and procedures for accomplishing a given task including, but not limited to, those manners, means, techniques and procedures either known to, or readily developed from known manners, means, techniques and procedures by practitioners of the chemical, pharmacological, biological, biochemical and medical arts.

As used herein, the term "treating" includes abrogating, substantially inhibiting, slowing or reversing the progression of a condition, substantially ameliorating clinical or aesthetical symptoms of a condition or substantially preventing the appearance of clinical or aesthetical symptoms of a condition.

When reference is made to particular sequence listings, such reference is to be understood to also encompass sequences that substantially correspond to its complementary sequence as including minor sequence variations, resulting from, e.g., allelic variation, sequencing errors, cloning errors, or other alterations resulting in base substitution, base deletion or base addition, provided that the frequency of such variations is less than 1 in 50 nucleotides, alternatively, less than 1 in 100 nucleotides, alternatively, less than 1 in 200 nucleotides, alternatively, less than 1 in 500 nucleotides, alternatively, less than 1 in 1000 nucleotides, alternatively, less than 1 in 5,000 nucleotides, alternatively, less than 1 in 10,000 nucleotides. It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.

Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.

EXAMPLES

Reference is now made to the following examples, which together with the above descriptions illustrate some embodiments of the invention in a non limiting fashion.

Generally, the nomenclature used herein and the laboratory procedures utilized in the present invention include molecular, biochemical, microbiological and recombinant DNA techniques. Such techniques are thoroughly explained in the literature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et al., (1989); "Current Protocols in Molecular Biology" Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", Scientific American Books, New York; Birren et al. (eds) "Genome Analysis: A Laboratory Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-III Cellis, J. E., ed. (1994); "Current Protocols in Immunology" Volumes I-III Coligan J. E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), "Selected Methods in Cellular Immunology", W. H. Freeman and Co., New York (1980); available immunoassays are extensively described in the patent and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521 ; "Oligonucleotide Synthesis" Gait, M. J., ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. J., eds. (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., Eds. (1984); "Animal Cell Culture" Freshney, R. I., ed. (1986); "Immobilized Cells and Enzymes" IRL Press, (1986); "A Practical Guide to Molecular Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To Methods And Applications", Academic Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein Purification and Characterization - A Laboratory Course Manual" CSHL Press (1996); all of which are incorporated by reference as if fully set forth herein. Toxoplasma gondii: the model apicomplexan - Perspectives and methods. LM Weiss, K Kim, 2011. Other general references are provided throughout this document. The procedures therein are believed to be well known in the art and are provided for the convenience of the reader. All the information contained therein is incorporated herein by reference.

GENERAL MATERIALS AND EXPERIMENTAL METHODS

Toxoplasma gondii culture

Parasites were grown on human foreskin fibroblasts (HFF) in Dulbecco's modified Eagle's medium (DMEM) supplemented with 2 mM L-Glutamine, Penicillin and Streptomycin or Gentamicin antibiotic mix and 10% Fetal Bovine Serum (FBS), thereby referred to as "complete DMEM media". Cultures were monitored daily, and the parasites were passaged by transferring a drop of the supernatant of a lysed dish or a drop of syringe-released intracellular parasites into a fresh dish with HFF.

Generation of T. gondii expression vectors for expression of heterologous polypeptides comprised of a Toxoplasma secreted protein fused to a pharmaceutical polypeptide -of-interest

In order to generate stable transgenic lines of T. gondii that express specific therapeutic proteins, the present inventors first generated T. gondii expression constructs by molecular cloning. The constructs include a long open reading frame (ORF), which consists of Toxofilin cDNA [SEQ ID NO: 4481] or GRA16 cDNA [SEQ ID NO: 4472] followed by an HA tag coding sequence (SEQ ID NO: 4474) followed by the therapeutic mammalian cDNA of interest, such as the murine MeCP2 isoform 1 [SEQ ID NO: 4476]; the human ASPA [SEQ ID NO: 4483], the human SMN1 [SEQ ID NO: 4479]; the human PARK2 [SEQ ID NO: 4478]; the human GDNF [SEQ ID NO: 4489]; the human TFEB (Transcription factor EB) isoform 1 [SEQ ID NO: 4600] sequence; the human GALC isoform 1 [SEQ ID NO:4486 or SEQ ID NO:4601 (GALC version 2); the GALC-TAT that includes a Glycine flexible linker (Gly) [SEQ ID NO:4490] followed by a protein transduction domain (TAT) that facilitates nonclassical trafficking across membranes in order to aid the targeting of GALC to the lysosome of the host cell (SEQ ID NO: 4488) or the "mutated GALC-TAT" (GALC-TAT that harbors a deletion) (SEQ ID NO: 4487). The cDNA of interest can also be codon- optimized according to the codon usage of T. gondii in order to increase efficiency of expression, stability and localization of the heterologous polypeptide in T. gondii (such as with the codon optimized murine MeCP2 isoform 1 [SEQ ID NO: 4477], codon optimized human ASPA [SEQ ID NO: 4602], codon optimized human GALC isoform 1 [SEQ ID NO: 4603] and codon optimized human TFEB isoform 1 [SEQ ID NO: 4604] codon optimized for expression in T. gondii). The T. gondii expression vector also includes regulatory elements from the same or other Toxoplasma genes that guarantee its correct expression and targeting. Upstream of the ORF is the endogenous 5' regulatory sequence which can be either the endogenous 5' regulatory sequence of the Toxofilin gene, here referred to as the "Toxofilin promoter" or "Toxofilin 5'-UTR" (SEQ ID NO: 4482) or the endogenous 5' regulatory sequence of the GRA16 gene, here referred to as the "GRA16 promoter" or "GRA16 5'-UTR" (SEQ ID NO: 4473). Downstream of the ORF is the 3'-UTR of the abundant dense granule protein GRA2 (SEQ ID NO: 4491; Figure 3). The expression vector also included a separate ORF containing the selectable markers HXGPRT [SEQ ID NO: 4475] or DHFR-TS [SEQ ID NO: 4484 or 4606] or mCherry [SEQ ID NO: 4608], surrounded by the 5' UTR of DHFR-TS (upstream) [SEQ ID NO:4492] and 3' UTR of DHFR-TS (downstream) [SEQ ID NO: 4493 or 4609] . Generation of transgenic T. gondii

T. gondii of the type I strains RH or RHAHX or the type II strains Prugniaud, Prugniaud-GFP-Luciferase or Prugniaud ΔΗΡΤ (also known as Prugniaud ΔΗΧ) were used for generating transgenic strains. Extracellular Tachyzoites were collected or mechanically egressed using a 22-26 gauge needle, filtered from cellular debris, pelleted and resuspended in 300 μΐ cytomix buffer (120 mM KC1, 0.15 mM CaCl 2 10 mM K 2 HP0 4 /KH 2 P0 4 pH 7.6, 25 mM HEPES pH 7.6, 2 mM EGTA, 5 mM MgCl 2 ) with freshly added 2 mM ATP and 5 mM GSH in 30 μΐ each (360 μΐ total). 20-60 μg plasmid DNA, or DNA linearized with the enzyme Seal was transfected into the tachyzoites by electroporation with a BTX ECM electroporator. A few drops of the transfected cells were transferred onto IFA wells (immunofluorescence assay wells of a 24-well plate with HFF cells seeded on glass coverslips) and immunofluorescently stained for assessment of transfection efficiency and transient protein expression (expression from plasmid DNA). The rest of the transfected parasites were transferred onto a fresh flask with HFF in complete DMEM. If a drug-resistance selectable marker was used, the next day media was changed to fresh media containing the drug used for selection (1 μΜ pyrimethamine for DHFR-TS selection or 25 μg/ml mycophenolic acid + 50 μg/ml xanthine for HXGPRT selection). Starting from the day the parasites started egressing from the HFF, drops of the supernatant containing extracellular parasites were passed every 1-2 days into a second flask of HFF in selective media containing the selection drug. When the parasites in the second flask started egressing from the HFF, drops of the supernatant were passed every 1-2 days from the second flask into a third flask, and so on. If a fluorescent protein selectable marker was used, when the parasites in the second flask egressed out of about 50%-90% of the HFF cells, the media with extracellular parasites was collected, filtered from cell debris and sorted using a FACS machine. Parasites emitting fluorescence corresponding to the fluorescent protein used for selection were collected and passed into the next flask of HFF in complete DMEM media. After the 3rd-5th flask started lysing (about 2-5 weeks), the parasites were considered a "stable pool" containing parasites that integrated the DNA construct into their genome. A drop of the stable pool was passed onto IFA (immuno fluorescent assay) wells and immunofluorescently stained to assess percentage of construct-positive parasites in the pool, protein expression and protein localization in the genome- integrated ("stable") parasites (genomic expression). Clonal lines were established by manual sorting by limiting dilutions or by FACS sorting one parasite per well in 96 well plates. Plates were kept undisturbed for 5-10 days, after which they were screened by eye using a bright field microscope to detect wells with a single plaque which is assumed to originate from a single parasite. Wells with a single plaque were mixed vigorously with a pipette and 50 μΐ of the mixture was passed to a new dish and to a PCR tube. Lysates were made by pelleting the collected cells in the PCR tubes, resuspending them in 10 μΐ lysis buffer consisted of 10% proteinase K in TE buffer and incubation in 60°C for 60 minutes followed by 95°C for 10 minutes in a thermocycler. One micorliter (1 μΐ) of the lysates were used as template for PCR screening using primers specific for the genetic construct. PCR-positive clones were passed onto IFA wells, fixed, immunofluorescently stained and analysed using a fluorescent microscope and polarized light microscope for protein expression and protein localization in each clone.

Antibodies

Antibodies and the respective concentrations they were used in, are as follows: anti-HA (Roche, IFA: 1: 1000, WB: 1: 1000), anti-IMC-1 (gift from Prof. Dominique Soldati-Favre, IFA: 1: 1000-1:2000), anti-MeCP2 (Cell Signaling, IFA: 1:200, WB: 1: 1000, IP: 1:26), anti-NeuN (Abeam, IFA: 1:500), anti-NCoRl (Bethyl Laboratories, WB: 1: 1000), anti-TBLl (Abeam, WB: 1: 1000), Alexa Fluor anti-rat 488 and 594 (Invitrogen, IFA: 1: 1000), Alexa Fluor anti-rabbit 488 and 594 (Invitrogen, IFA: 1: 1000), Alexa Fluor anti-mouse 488 and 594 (Invitrogen, IFA: 1: 1000).

EXAMPLE 1

GENERATION OF T GONDII CONSTRUCT WHICH LEADS TO THE

EXPRESSION, LOCALIZATION TO PARASITE'S SECRETORY ORGANELLES AND SECRETION OF A THERAPEUIC PROTEIN -OF -INTEREST INTO THE

HOST CELL

Experimental Procedures

The constructs were transfected into RH (RHhxgprt) or Pru (Pruhxgprt) strain T. gondii parasites by electroporation. This is followed by a process of selection for parasites expressing the selection markers (either Pyrimethamine selection for a construct containing DHFR-TS (SEQ ID NO: 4484), which was co-transfected with the construct-of-interest), or using MPA+Xanthine for selection based on expression of HXGPRT (SEQ ID NO:4475), cloning by limiting dilutions or flow cytometry, and PCR-based screening. The positive clones were verified by Western Blot analysis and by immunofluorescence staining to characterize the expression and specific localization of the HA-tagged therapeutic protein in the secretory rhoptry organelles, dense granule organelles, parasitophorous vacuole space, parasitophorous vacuole membrane, or in the host cells. The verified positive lines were further maintained on HFF (human foreskin fibroblast) cells.

Experimental Results

Generation of constructs and T. gondii strains for delivery of the therapeutic proteins Galactocerebrosidase, Galactocerebrosidase-TAT, GDNF, Aspartoacylase, MeCP2, Survival motor neuron protein and E3 ubiquitin-protein ligase parkin - The results demonstrate that T. gondii can be engineered to express heterologous (mammalian, e.g., human) therapeutic proteins fused to an endogenous protein of the parasite. Using this approach, the present inventors have generated 3 novel stable T. gondii transgenic lines expressing the 3 human proteins associated with common neuropathologies, Galactocerebrosidase-TAT (the mutated GALC-TAT SEQ ID NO: 4487), MeCP2 (codon optimized SEQ ID NO: 4477) and GDNF (SEQ ID NO: 4489), and additional 4 mixed-population pools containing stable T. gondii transgenic parasites expressing the 3 human proteins associated with common neuropathologies Galactocerebrosidase (GALC, isoform 1, SEQ ID NO:4486), Galactocerebrosidase- TAT (the WT GALC-TAT SEQ ID NO: 4488), Aspartoacylase (ASPA SEQ ID NO:4483) and Survival Motor Neuron Protein (SMN1, SEQ ID NO: 4479).

Improving secretion and uptake of the therapeutic protein by a viral protein transduction domain - In order to improve secretion of the Galactocerebrosidase protein from the host cell to the extracellular space, as well as the uptake of the protein by neighboring cells, the present inventors have fused Galactocerebrosidase to a viral protein transduction domain (PTD). Tat-PTD has been previously shown to significantly increase (by 6 fold) the cross -correction efficiency of Galactocerebrosidase through enhancing both the secretion and uptake of the enzyme in a cell culture model system (Meng, X.-L., et al., 2013). Localization of the therapeutic proteins Galactocerebrosidase-TAT, Aspartoacylase, MeCP2 and Survival Motor Neuron Protein to T. gondii rhoptries secretory organelles - The results demonstrate that T. gondii can localize heterologous (mammalian, e.g., human) therapeutic proteins to the parasite's rhoptries secretory organelles (Figure 4A), from which they could potentially be released into the host cell cytoplasm and localize to target sites within the infected cell. The fusion proteins containing Galactocerebrosidase-TAT, Aspartoacylase, MeCP2 and Survival Motor Neuron Protein localized to the parasite rhoptries as reported for previous Toxofilin fusions that evidently made their way into the host cell. This was proven by immunofluorescence staining (IFAs) (Figures 4B-I).

Thus, the present inventors have succeeded in demonstrating that T. gondii may be engineered to express heterologous (mammalian, e.g., human) proteins fused to an endogenous protein of the parasite, and that these proteins localize to the parasite's rhoptries secretory organelles. The present inventors have engineered the human proteins to be fused to Toxofilin, an endogenous protein that is naturally secreted into host cells by the parasites, which allows to "ride" on the parasite's protein secretion machinery for the targeting and secretion of proteins of interest (e.g., a therapeutic protein). This is an important milestone on the path to suggesting T. gondii as a feasible basis for this technology.

Using this approach, the present inventors have generated a novel stable T. gondii transgenic line, expressing and localizing to the rhoptries the human protein associated with Krabbe disease Galactocerebrosidase-TAT (the mutated GALC-TAT SEQ ID NO: 4487), and additional 4 mixed-population pools containing stable T. gondii transgenic parasites expressing and localizing to the rhoptries the 3 human proteins associated with common neuropathologies Galactocerebrosidase-TAT (the WT GALC- TAT SEQ ID NO: 4488), Aspartoacylase (ASPA SEQ ID NO:4483), MeCP2 (SEQ ID NO:4477) and Survival Motor Neuron Protein (SMN1, SEQ ID NO: 4479).

Localization of the therapeutic proteins Aspartoacylase, MeCPl and Survival Motor Neuron Protein to T. gondii Dense granules secretory organelles and parasitophorous vacuolar space - The results demonstrate that T. gondii can localize the heterologous (mammalian) human therapeutic proteins to the parasite's Dense granules secretory organelles and parasitophorous vacuolar space (Figure 5A), from which they could be released into the host cell cytoplasm and localize to target sites within the infected cell (Figures 5B-I). As this approach of using sequences from dense granule proteins to drive the secretion of a protein of interest from the parasite dense granules is unprecedented, this is the first time mammalian proteins have been found to be secreted into the parasitophorous vacuolar space by fusion to a dense granule protein (GRA16). This was proven by immunofluorescence staining (IFAs) (Figures 5B-I). The fusion proteins are then expected to pass through the parasitophorous vacuolar membrane into the host cell, based on the known localization of the native GRA16 protein they are fused to (Bougdour et al. 2013).

Thus, the present inventors have succeeded in demonstrating that T. gondii may be engineered to express heterologous (mammalian) proteins fused to an endogenous protein of the parasite, and that these proteins localize to the parasite's Dense granules secretory organelles and parasitophorous vacuolar space. The present inventors have engineered the human proteins to be fused to GRA16, an endogenous protein that is naturally secreted into host cells by the parasites, which allows to "ride" on the parasite's protein secretion machinery for the targeting and secretion of proteins of interest (e.g., a therapeutic protein). This is an important milestone on the path to suggesting T. gondii as a feasible basis for this technology.

Using this approach, the present inventors have generated a novel stable T. gondii transgenic line, expressing and localizing to the Dense granule secretory organelles and parasitophorous vacuolar space the human protein associated with Rett Syndrome MeCP2, and additional 2 mixed-population pools containing stable T. gondii transgenic parasites expressing and localizing to the Dense granule secretory organelles and parasitophorous vacuolar space the two human proteins associated with common neuropathologies Aspartoacylase and Survival Motor Neuron Protein.

Secretion of the therapeutic protein MeCP2 from the transgenic parasite into the host cell, and localization in the region of activity in the host cell, the nucleus - The results demonstrate that protein fusions consisting of a pharmaceutical polypeptide fused to a Toxoplasma gondii secreted polypeptide (and specifically here, the therapeutic protein MeCP2 fused to the T. gondii dense granule protein GRA16), can be released from the parasite into the host cell and translocate to the therapeutic protein's target site in the host cell (here, the nucleus). The fusion protein containing MeCP2 localized to the host cell nucleus as shown by immunofluorescence staining (IFAs) (Figures 5F-I). This is an important milestone on the path to suggesting T. gondii as a feasible basis for this technology.

Using this approach, the present inventors have generated novel stable T. gondii transgenic lines, expressing and localizing to the host cell nucleus the human protein associated with Rett Syndrome, MeCP2 (Figures 5F-I).

EXAMPLE 2

RESCUE OF GALACTOCEREBROSIDASE DEFICIENCY THROUGH PARASITIC DELIVERY

The Galactocerebrosidase enzymatic activity assay which is based on the enzymatic breakup of a synthetic fluorescent substrate of Galactocerebrosidase named 6-Hexadecanoylamino-4-methylumbelliferyl-beta-D-galactoside (6HMU-beta-D- galactoside) is based on the manufacturer's protocol [Otto P. van Diggelen. MOSCERDAM SUBSTRATES, Laboratory protocol for enzyme analysis for Krabbe disease; Galactocerebrosidase]. Using Krabbe patient-derived fibroblasts, the assay is used to evaluate the ability of the GALC- and GALC-TAT-expressing parasite's to rescue Krabbe disease phenotypes in culture. Untreated Krabbe disease cells (Krabbe patient-derived fibroblasts), Krabbe disease cells transfected with sham vector (empty human expression vector) and/or Krabbe disease cells infected with T. gondii parasites not expressing GALC are used as negative controls. Positive controls are wild type (WT) cell lines, or the Krabbe disease cells transfected with human expression vectors driving the expression of either the standalone (HA-tagged) GALC or GALC-TAT gene or a Toxofilin/GRA16-fused GALC or GALC-TAT gene, identical to the fusion expressed by the transgenic parasites tested. This allows ruling out the possibility that any negative results could be caused by a lack of sufficient effect of the protein itself, or from disruption of the therapeutic protein's activity by the fusion of Toxofilin or GRA16 to the N-terminus of the therapeutic protein, and thus allows quantifiable and direct inference of the efficiency of the parasitic delivery system.

The assay is used to characterize the Galactocerebrosidase-expressing parasites' ability to rescue the Galactocerebrosidase-deficiency shown in Krabbe disease patient- derived fibroblasts. This experiment demonstrates the transgenic parasites' ability to rescue Krabbe disease by delivering the supplementary protein to cells in culture.

EXAMPLE 3

RESCUE OF MECP2 DEFICIENCY THROUGH PARASITIC DELIVERY

Neural and glial cells derived from mouse models of Rett syndrome that harbor a mutation or deletion of the MECP2 gene present an array of abnormal morphological and molecular characteristics. [Percy, A. Rett syndrome: coming to terms with treatment. Adv. Neurosci. (2014), [de la Torre-Ubieta et al. Advancing the understanding of autism disease mechanisms through genetics. Nat Med. 2016].

Using a Rett Syndrome cell culture model, the quantitative scoring of the molecular and morphological characteristic phenotypes of Rett syndrome is used to evaluate the ability of the MeCP2-expressing parasite's to rescue Rett Syndrome phenotypes in culture. Untreated Rett Syndrome model cells, Rett Syndrome model cells transfected with sham vector and/or Rett Syndrome model cells infected with T. gondii parasites not expressing MeCP2 are used as negative controls. Positive controls are WT cell lines and Rett Syndrome model cells transfected with human expression vectors driving the expression of either the standalone (HA-tagged) MECP2 coding sequence or a Toxofilin/GRA16-fused MECP2 coding sequence identical to the fusion expressed by the transgenic parasites tested. This allows ruling out the possibility that any negative results could be caused by a lack of sufficient effect of the protein itself, or from disruption of the therapeutic protein's activity by the fusion of Toxofilin or GRA16 to the N-terminus of the therapeutic protein, and thus allows quantifiable and direct inference of the efficiency of the parasitic delivery system.

The assay is used to characterize the MeCP2-expressing parasites' ability to rescue the MeCP2-deficiency shown in a Rett Syndrome cell culture model. This experiment demonstrates the transgenic parasites' ability to rescue a Rett Syndrome by delivering the supplementary protein to cells in culture. EXAMPLE 4

TESTING THERAPEUTIC POTENTIAL OF THE TRANSGENIC PARASITE

LINES IN ANIMAL MODELS

The transgenic lines of the parasite are tested for their ability to affect phenotypes in whole organism. Testing the transgenic parasites in mice allows the assessment of the system in the presence of an immune system and a functional blood brain barrier (BBB). The transgenic T. gondii parasites are administered to the respective animal model of the condition being tested intranasally, enterally (such as orally), subcutaneously, intra-muscularly, intravenously, intradermally, intraperitoneally, intracranially or intracerebrally. The transgenic T. gondii parasites are administered in the form of tachyzoites, bradyzoite tissue cysts or oocysts.

Following inoculation with the parasites, the animal model are characterized and assessed for the manifestation of phenotypes related to the condition studied (such as pathological phenotypes of the disease). Phenotypes include behavioral, morphological and molecular phenotypes. The effects of the transgenic parasites are revealed by comparison to animals treated with parasites not expressing the therapeutic polypeptide and to sham-treated animals.

For instance, mice model of Rett Syndrome treated with T. gondii parasites expressing the therapeutic protein MeCP2 are assessed by known phenotypes of the disease which include impaired social interaction, increased scent marking, hindlimb clasping, reduced lifspan, anxiety, respiratory problem, hypoactivity, impaired motor coordination, miRNA-mediated IGF1 deficit, decreased level of BDNF, impaired cortical plasticity, glial dysfunction, GABAergic neuronal dysfunction, decreased soma size, decreased dendritic spine density, decreased synapse number, decreased activity (Ca2+ imaging), decreased sEPSC and sIPSC frequency and amplitude, decreased Tuj l+ cell number, decreased PAX6, SCN1A/1B expression, decreased dendrite complexity in cortical neurons, decreased AP frequency in cortical neurons, decreased total transcription in cortical neurons, decreased expression of neuronal, synaptic, highly-expressed genes, immediate early genes and mitochondrial genes in cortical neurons, decreased mRNA translation in cortical neurons, decreased BDNF secretion from cortical neurons, decreased AKT/mTOR pathway activity in cortical neurons and decreased oxygen consumption and maximal respiration in cortical neurons [de la Torre-Ubieta et al. Advancing the understanding of autism disease mechanisms through genetics. Nat Med. 2016.]

EXAMPLE 5

GENERATION OF CONSTRUCTS AND T GONDII STRAINS FOR DELIVERY OF CUSTOMIZED TRANSCRIPTION ACTIVATOR-LIKE EFFECTORS (TALEs)

T. gondii are engineered to express and secrete customized Transcription activator-like effectors (TALE). Customized TALEs can be used for a wide variety of genome engineering applications, including transcriptional modulation and genome editing. The DNA recognition and binding region of TALEs are made up of tandem repeats of 34-aa sequences (monomers) which determine the sequence of the recognized DNA sequence. There are 4 types of monomers which differ in their repeat variable diresidues (RVDs) which determines the nucleotide they recognize. The monomer termed "NI" (SEQ ID NO: 4508) is specific for the "A" nucleotide; the monomer termed "HD" (SEQ ID NO: 4495) is specific to the "C" nucleotide"; the monomer termed "NG" (SEQ ID NO: 4509) is specific to the "T" nucleotide and the monomer termed "NN" (SEQ ID NO: 4494) is specific the "G" or "A" nucleotide. More recently, the additional RVD NH (SEQ ID NO:4504) was shown to provide higher G- specificity. The tandem repeat DNA-binding domain always ends with a half-length repeat (0.5 repeat). Customized TALE DNA-binding domains can be used to generate custom TALE transcription factors (TALE-TFs) and modulate the transcription of endogenous genes from the genome, by fusion to the synthetic VP64 transcriptional activator. Customized TALE DNA-binding domains can also be used to generate custom TALE nucleases (TALENs) and can be used to generate site-specific double-strand breaks to facilitate genome editing through nonhomologous repair or homology directed repair, by fusion to the catalytic domain of Fokl endonuclease. The TALEs are made by constructing a DNA-binding domain from the monomers (SEQ IDs NO:4508, 4509, 4494, 4495 and 4504), which is inserted into a TALEN backbone or TALE-TF backbone. Because the backbones also include the 0.5 repeat at the end of the DNA binding domain, there are 4 types of TALEN backbones (e.g., SEQ ID NOs:4500, 4501, 4502, and 4503), as well as 4 TALE-TF backbones (e.g., SEQ ID NO:4496, 4497, 4498, and 4499). The backbones also contain cytomegalovirus promoter (CMV), nonrepetitive N terminus from the Hax3 TALE (N-term), nonrepetitive C terminus from the Hax3 TALE (C-term), type lis restriction sites used for the insertion of custom TALE DNA- binding domains (such as Bsal), negative selection cassette containing the ccdB negative selection gene and chloramphenicol resistance gene (ccdB + CmR), nuclear localization signal (NLS), and either a catalytic domain from the Fokl endonuclease (Fokl) or synthetic transcriptional activator derived from VP 16 protein of herpes simplex virus (VP64) followed by 2A self-cleavage linker (2A) and enhanced green fluorescent protein (EGFP). All are followed by a polyadenylation signal (polyA signal) (Sanjana et al., A transcription activator-like effector toolbox for genome engineering, 2012). The resulting TALE sequences are inserted downstream of the coding sequence of a T. gondii endogenous secreted polypeptide as described in the GENERAL MATERIALS AND EXPERIMENTAL METHODS hereinabove, and the resulting constructs are transfected into T. gondii to produce transgenic T. gondii strains that express and secreted the TALE.

For example, in order to target the TACGTACG (SEQ ID NO: 4505) sequence in a genome of a subject in need thereof, the TALE nuclease construct or the TALE transcription factor construct can be used. In the open reading frame of these constructs the monomers which are specific for targeting the exemplary "target sequence" (SEQ ID NO: 4505) are arranged one after the other, as shown schematically in Figure 6A (for TALE Nuclease) and Figure 6B (for TALE transcription factor). Sequence information of these constructs is available in the sequence listing (SEQ ID NOs: 4506 and 4507).

EXAMPLE 6

GENERATION OF T GONDII PARASITES THAT EXPRESS MAMMALIAN THERAPEUTIC PROTEINS AND LOCALIZE THEM TO THEIR RHOPTRY

SECRETORY ORGANELLES

In order to generate T. gondii parasites that express mammalian proteins and target them to their rhoptry secretory organelles the present inventors transfected type I RHAHX tachyzoites or RH (co-transfected with the pDHFR selection plasmid) T. gondii with plasmids containing sequences coding for the protein of interest fused to the rhoptry protein Toxofilin. The mammalian proteins tested in the Toxofilin fusion system were: glial derived neurotrophic factor (GDNF), parkin (PARK2), galactocerebrosidase (GALC), galactocerebrosidase fused to a TAT protein transduction domain (GALC-TAT; SEQ ID NO: 4487), methyl CpG binding protein (MeCP2), aspartoacylase (ASPA) and Survival of motor neuron (SMNl). In an attempt to improve protein expression and targeting, the present invenotrs also tested codon-optimized versions of some of the proteins, including GALC, MECP2, ASPA and the additional gene TFEB which was tested in the codon-optimized form (SEQ ID NO: 4604).

ASPA (SEQ ID NO: 4483), ASPA codon optimized (SEQ ID NO: 4602), GALC (SEQ ID NOs: 4486 and 4601), GALC-TAT (SEQ ID NO: 4488), PARK2 (SEQ ID NO: 4478), SMNl (SEQ ID NO: 4479), GDNF (SEQ ID NO:4489), MeCP2 (SEQ ID NO: 4476) and MeCP2 codon optimized (SEQ ID NO:4477) showed variable targeting of the fusion proteins in the parasites, which included rhoptry targeting as well as other patterns of targeting, resembling endoplasmic reticulum, golgi, nucleus, cytoplasm, apicoplast, microneme and other localizations in the parasites (Figures 9A- N).

GALC-TAT mutated form generated primarily rhoptry-like fusion protein localization. From GALC-TAT mutated the present invenotrs also generated clonal lines that strongly expressed the fusion protein and targeted it to a rhoptries (Figure 9H).

EXAMPLE 7

GENERATION OF T GONDII PARASITES THAT EXPRESS MAMMALIAN THERAPEUTIC PROTEINS, LOCALIZE THEM TO THEIR DENSE GRANULE SECRETORY ORGANELLES AND SECRETE THEM INTO THE PARASITOPHOROUS VACUOLE

In order to generate T. gondii parasites that express mammalian proteins and target them to their dense granule secretory organelles type I RHAHX T. gondii were transfected with plasmids containing sequences coding for the protein of interest fused to the dense granule protein GRA16. The mammalian proteins-coding genes tested in the GRA16 fusion system were GALC [human forms (SEQ ID NOs: 4486 and 4601) and codon-optimized (SEQ ID NO: 4603), GALC-TAT (human form SEQ ID NO: 4488), MECP2 (codon-optimized, SEQ ID NO:4477), ASPA [human form (SEQ ID NO:4483) and codon-optimized (SEQ ID NO:4602)], SMNl (human form, SEQ ID NO: 4479) and TFEB (codon-optimized, SEQ ID NO:4604). GALC, GALC codon optimized and GALC-TAT were expressed in the transfected parasites (Figures 10E, 10F and 10H). ASPA, ASPA codon optimized, SMN1, MeCP2 codon optimized and TFEB codon optimized were expressed and secreted by the dense granules to the parasitophorous vacuole (Figures IOC, 10G, 10D, 101, 10J).

EXAMPLE 8

DELIVERY OF MAMMALIAN THERAPEUTIC PROTEINS TO HUMAN FIBROBLASTS IN CULTURE USING TRANSGENIC T GONDII

The present inventors have engineered six Type I and Type II Toxoplasma clonal lines which in addition to secreting the fusion protein into the parasitophorous vacuole, were also able to release it from the parasitophorous vacuole and target it to the nucleus of the host cell in amounts detectable by immunofluorescence staining. These clonal lines are: RH GRA16-MECP2 codon optimized (RH GRA16-MECP2opt), RH GRA16-TFEB codon optimized (RH GRA16-TFEBopt), Prugniaud-GFP-Luciferase DHFR-TS GRA16-MECP2 codon optimized (Pru-GFP-LUC GRA16-MECP2opt), Prugniaud-GFP-Luciferase DHFR-TS GRA16-TFEB codon optimized (Pru-GFP-LUC GRA16-TFEBopt), Prugniaud GRA16-MECP2 codon optimized (Pru GRA16- MECP2opt), and Prugniaud GRA16-TFEB codon optimized (Pru GRA16-TFEBopt). In addition to the transgenic lines expressing the fusion proteins, the present inventors also generated 2 control lines expressing the carrier protein GRA16 fused to the HA epitope tag alone: RH GRA16-HA (RH GRA16-HAstop) and Prugniaud GRA16-HA (Pru GRA16-HAstop). As MeCP2 and TFEB are both nuclear proteins, host cell nuclear targeting means they were targeted to their site of activity in the cell. Secretion and host cell nuclear targeting of the proteins in all the lines described above was detected in HFF human fibroblasts by infection of HFF cells in complete DMEM with tachyzoites of the respective line, fixation, immunofluorescence staining and analysis by fluorescence and polarized light microscopy (Figures 10B, 101 and 10J).

In order to quantify the efficiency and the dynamics of GRA16-mediated protein delivery in vitro, the present inventors quantified delivery over time and multiplicity of infection (MOI) for tachyzoites of the three type I lines RH GRA16-HAstop, RH GRA16-MECP2opt and RH GRA16-TFEBopt, as summarized in Figures 11A-C. EXAMPLE 9

DELIVERY OF MAMMALIAN THERAPEUTIC PROTEINS TO HUMAN NEURONS IN CULTURE USING TRANSGENIC T GONDII

Immortalized Human Dopaminergic Neuronal Precursor Cells, also known as Lund Human Mesencephalic (LUHMES) cells, were grown in media containing F12 advanced DMEM media supplemented with L-glutamine, N-2 serum-free supplement and PFGF (beta-fibroblast growth factor). The cells were differentiated in culture into morphologically and biochemically mature dopamine-like neurons using F12 advanced DMEM media supplemented with L-glutamine, N-2 serum-free supplement, tetracycline, GDNF and cAMP (Cyclic adenosine monophosphate). On day 6-9 of differentiation (at which point the cells are mature neurons), tachyzoites of the clonal lines RH GRA16-MECP2opt, RH GRA16-TFEBopt and RH GRA16-HAstop were used to infect the neurons. 16-22 hours post infection the neurons were fixed and immunofluorescently stained and analysed by fluorescence and polarized light microscopy. All lines tested presented clear secretion of the fusion proteins, which were targeted to the nuclei of the human neurons (Figures 12A-C).

In addition, human neurons were infected with RH GRA16-MECP2opt tachyzoites and collected 24 hours post infection. Infected neurons were washed with PBS, scraped using a sterile cell scraper and pelleted by centrifugation, residual PBS was removed and the pellets were snap-frozen in liquid nitrogen. Nuclear proteins extracted from the infected neurons were immunoprecipitated using magnetic beads bound to an MeCP2 antibody. Immunoprecipitated proteins were immunoblotted with the same MeCP2 antibody following a standard Western blot protocol. The resulting blot presented 2 strong bands, of approximately 75 kDa and approximately 130 kDa. The -75 kDa band represents the endogenous untruncated MeCP2 from the human neurons (predicted size: 75 kDa) and the -130 kDa band represents the parasite- delivered GRA16-MeCP2 fusion protein (GRA16 predicted size: 55 kDa. Fusion protein predicted size: 75+55 = 130 kDa) (Figure 14). EXAMPLE 10

DELIVERY OF MAMMALIAN MECP2 TO MOUSE PRIMARY CORTICAL AND HIPPOCAMPAL CULTURES USING TRANSGENIC T GONDII AND BINDING OF THE DELIVERED MECP2 TO HETEROCHROMATIC DNA

Neuron-enriched primary cultures from the cortices and hippocampi of PI (day

1) mice pups were cultured for 5 days and infected with tachyzoites of the transgenic line RH GRA16-MECP2opt. Twelve (12), 24 and 48 hours post infection, the neurons were fixed, immunofluorescently stained and analysed by fluorescence and polarized light microscopy. In all listed time points, the GRA16-MECP2 fusion protein was secreted and targeted to the nuclei of the neurons, and it also appeared in foci corresponding to areas of dense heterochromatic DNA, recognizes by intense DAPI staining (Figures 13A-D). This is a classical marker for MeCP2 functionality as it suggests that it successfully binds on the heterochromatin. Notably, in culture systems T. gondii can infect both neurons, glial and other cells. Because the primary cultures were not purely neuronal (according to NeuN staining), the present inventors could see that non neuronal cells (presumably mostly glia) infected with the transgenic T. gondii also received the MeCP2 fusion protein and presented the same characteristic foci pattern. EXAMPLE 11

EXPRESSION OF GRA16 AND GRA16-MECP2 FUSION PROTEIN IN T

GONDII BRADYZOITE CYSTS

During the normal course of infection, after the initial acute phase in which the parasite replicates and disseminates in the body as tachyzoites, immune pressure causes T. gondii tachyzoites to differentiate into bradyzoites which resides in the long-lasting quiescent cysts in tissues, and mostly in the brain (Carruthers VB, Suzuki Y, 2007. Schizophr Bull. 33(3):745-51. Effects of Toxoplasma gondii infection on the brain). Therefore the present inventors investigated whether the transgenic T. gondii continue to express the GRA16-fused therapeutic proteins after they differentiate into the bradyzoite form. An in vitro differentiation of the transgenic T. gondii into bradyzoites was induced by stress with alkaline media for 3 to 5 days. Alkaline media contained DMEM medium adjusted to pH 8.1 with 10 mM HEPES and 1% fetal bovine serum supplemented with penicillin-streptomycin (Tomita T, Bzik DJ, et al. 2013. PLoS Pathog. 9(12):el003823. The Toxoplasma gondii cyst wall protein CSTl is critical for cyst wall integrity and promotes bradyzoite persistence). The present inventors tested differentiation of the strains Pru GRA16-HAstop and Pru GRA16-MeCP2. Bradyzoite cysts, identified by DBA (Dolichos Biflorus Agglutinin) staining of the cyst wall, continue to express both the GRA16-HAstop (Data not shown) and GRA16-MeCP2 proteins (Figures 15A-C). This finding shows that the transgenic parasites can continue to express GRA16 and the GRA16 fusion proteins after differentiation into the bradyzoite stage, which is important for continuous secretion and protein delivery from cysts in chronic infections.

EXAMPLE 12

PROBING THE MOLECULAR INTERACTIONS BETWEEN THE HETEROLOGOUS POLYPEPTIDES DELIVERED BY T GONDII AND ENDOGENOUS PROTEINS IN THE RECIPIENT CELL

By performing co-immunoprecipitation on cell lysates from infected cultures and from infected animals, the present inventors can decipher molecular interactions between the polypeptides delivered by the transgenic T. gondii and endogenous proteins and nucleic acids.

For example, in mammalian cells MeCP2 binds both DNA and proteins from the

SMRT/NCoR and SIN3A co-repressor complexes. Binding to DNA can be probed by immunofluorescent stainings and by chromatin-immunoprecipitation, and binding to other proteins can be probed by immunofluorescent stainings and by protein co- immunoprecipitation. The added epitope tags on the delivered fusion proteins can be used to distinguish between copies of the protein produced by the host cell and copies of the protein delivered by the parasites, while antibodies against the protein itself can be used to probe both the endogenous and the parasite-delivered protein (e.g. for comparative assays). EXAMPLE 13

TESTING THE ABILITY OF TRANSGENIC T GONDII TO DELIVER HETEROLOGOUS PROTEIN S-OF -INTEREST CONTINUOUSLY

(/CHRONICALLY) IN ANIMAL MODELS

The transgenic lines of the parasite are tested for their ability to affect phenotypes in whole organism. Testing the transgenic parasites in animal models allows the assessment of the system in the presence of an immune system and a functional blood brain barrier (BBB). This is done by following the standard infection protocols established in the field to infect mice with the transgenic T. gondii (Cabral CM et al., 2016, PLoS Pathog. 12(2):el005447. Neurons are the Primary Target Cell for the Brain- Tropic Intracellular Parasite Toxoplasma gondii; Berenreiterova M et al., 2011. PLoS One. 6(12):e28925. The distribution of Toxoplasma gondii cysts in the brain of a mouse with latent toxoplasmosis: implications for the behavioral manipulation hypothesis; Tait ED et al. 2010. J. Immunol. 185(3): 1502-12. Virulence of Toxoplasma gondii is associated with distinct dendritic cell responses and reduced numbers of activated CD8+ T cells; Koshy AA et al. 2012. PLoS Pathog. 8(7):el002825. Toxoplasma co-opts host cells it does not invade; Jensen KD et al. 2015. MBio. 6(2):e02280. Toxoplasma gondii superinfection and virulence during secondary infection correlate with the exact ROP5/ROP18 allelic combination). The transgenic T. gondii parasites are administered to the animal model intranasally, enterally (such as orally), subcutaneously, intramuscularly, intravenously, intradermally, intraperitoneally, intracranially or intracerebrally. The transgenic T. gondii parasites are administered in the form of tachyzoites, bradyzoite tissue cysts or oocysts. Animal models can be wild type animals or model animals bearing a condition treatable or otherwise affected by delivery of the protein of interest expressed in the transgenic parasites. After the initial stage of acute infection, the animal develops chronic bradyzoite tissue cysts, primarily in the brain. This mode of delivery relies on continuous expression and secretion the protein of interest from the transgenic T. gondii at the chronic stage of infection.

At different stages during development of the chronic infection (first 3 weeks to 2 months after infection, depending on strain and route of infection), as well as in different time points after the chronic infection has been established and stabilized (time-scale of months to years after infection), brain histological stainings are used to assess the percentage of cells infected, distribution of parasites and the delivered protein in the brain and in other tissues and to quantify the levels of delivered protein. Co- immunoprecipitation and cellular morphologies can be used to confirm molecular interactions between the delivered proteins and endogenous counterparts in the receiving cells and to provide evidence for their function.

The condition of the treated animal model is monitored and characterized at different stages of the chronic infection and at different stages of the disease progression using physiological, behavioural, cellular and molecular measures in order to assess manifestations of disease phenotypes and efficacy of the treatment in alleviating them as well as any potential toxicity arising from the treatment. The effects of the transgenic parasites are assessed by comparison to animals treated with parasites not expressing the pharmaceutical polypeptide and/or to sham-treated animals.

For instance, mice model of Rett Syndrome treated with T. gondii parasites expressing the therapeutic protein MeCP2 are assessed by known phenotypes of the disease which include impaired social interaction, increased scent marking, hindlimb clasping, reduced lifspan, anxiety, respiratory problem, hypoactivity, impaired motor coordination, miRNA-mediated IGF1 deficit, decreased level of BDNF, impaired cortical plasticity, glial dysfunction, GABAergic neuronal dysfunction, decreased soma size, decreased dendritic spine density, decreased synapse number, decreased activity (Ca2+ imaging), decreased sEPSC and sIPSC frequency and amplitude, decreased Tuj l+ cell number, decreased PAX6, SCN1A/1B expression, decreased dendrite complexity in cortical neurons, decreased AP frequency in cortical neurons, decreased total transcription in cortical neurons, decreased expression of neuronal, synaptic, highly-expressed genes, immediate early genes and mitochondrial genes in cortical neurons, decreased mRNA translation in cortical neurons, decreased BDNF secretion from cortical neurons, decreased AKT/mTOR pathway activity in cortical neurons and decreased oxygen consumption and maximal respiration in cortical neurons (de la Torre-Ubieta et al. Advancing the understanding of autism disease mechanisms through genetics. Nat Med. 2016, 22:345-61). EXAMPLE 14

TESTING THE ABILITY OF TRANSGENIC T GONDII TO DELIVER HETEROLOGOUS PROTEINS-OF -INTEREST TRANSIENTLY (/ACUTELY) IN

ANIMAL MODELS

Similarly to Example 13 hereinabove, the transgenic lines of the parasite are tested for their ability to affect phenotypes in whole organism. Testing the transgenic parasites in animal models allows the assessment of the system in the presence of an immune system and a functional blood brain barrier (BBB), and can be conducted by intranasal, enteral (such as oral), subcutaneous, intra-muscular, intravenous, intradermal, intraperitoneal, intracranial or intracerebral administration of the transgenic T. gondii in the form of tachyzoites, bradyzoite tissue cysts or oocysts. Animal models can be wild type animals or model animals bearing a condition treatable or otherwise affected by delivery of the protein of interest expressed in the transgenic parasites. This experimental design focuses on assessment of transient delivery that does not rely on differentiation into bradyzoites and establishment of chronic infection and/or chronic cysts. Some examples include: delivery by attenuated parasites that cannot differentiate into bradyzoites and/or cannot replicate and/or cannot persist in vivo longer than a few weeks, or otherwise cannot establish a chronic infection; delivery by local administration to the target tissue or to an area close to the target tissues followed by clearance or inactivation of the parasites. These can also be repeated multiple times for repeated dosing. Immunosupressive agents can be administered before, concomitantly or after infection with the parasites to enhance infection and to aid re-infection of animals that have already been infected with T. gondii before. This experimental design also includes delivery by transgenic T. gondii capable of establishing chronic infection, but in which the collection of samples or analysis are performed at the acute stage of infection before establishment of chronic infection. This form of delivery relies mostly on secretion of the protein of interest by tachyzoites of the transgenic lines. At different stages during the acute infection (first days to about 2 months after infection, depending on strain and route of infection) brain histological stainings are used to assess the percentage of cells infected, characterize the distribution of parasites and the delivered protein in the brain and in other tissues and quantify the levels of delivered protein. Co- immunoprecipitation and cellular morphologies are used to confirm molecular interactions between the delivered proteins and endogenous counterparts in the receiving cells and to provide evidence for their function.

The condition of the treated animal model is monitored and characterized at different time points after initiation of treatment and at different stages of the disease progression using physiological, behavioural, cellular and molecular measures in order to assess manifestations of disease phenotypes and efficacy of the treatment in alleviating them as well as any potential toxicity arising from the treatment. The effects of the transgenic parasites are assessed by comparison to animals treated with parasites not expressing the therapeutic polypeptide and/or to sham-treated animals.

EXAMPLE 15

INTRODUCTION OF FURTHER MODIFICATIONS THAT WILL INCREASE THE SAFETY AND CONTROLLABILITY OF THE TRANSGENIC T GONDII

Although the parasite is generally considered harmless in healthy humans, additional modifications to the parasites may be incorporated in the therapeutic strains in order to further attenuate their virulence. As the model organism of its phylum, there is a wide range of highly developed genetic tools for modifying the parasite (Jimenez- Ruiz E, Wong EH, Pall GS, Meissner M. Parasitology. 2014, 141: 1390-8. "Advantages and disadvantages of conditional systems for characterization of essential genes in Toxoplasma gondii") which may be used to generate attenuated and exogenously- controllable parasites (Moe-Behrens GH, Davis R, Haynes KA. Front Microbiol. 2013 4:5. "Preparing synthetic biology for the world"). One approach may include engineered auxotrophy. Secondly, attenuated parasites may be developed by targeting and disrupting specific virulence mechanisms of the parasite which are not essential to its ability to deliver the protein drugs. Another approach may include active control through engineered induced lethality such as inducible "suicide switches" (self- destruction element) that eliminate the parasite upon administration of an activating molecule. Some examples may include utilizing the widely-used tetracyclin-inducible promoter for drug-inducible activation of lethal or apoptotic pathways in the parasites in vivo, or the rapamycin-dimerizing diCre system for inducible knock-out of essential genes or for inducible activation of lethal or apoptotic pathways. Such drug-inducible control may also allow differential clearance of the parasite from specific tissues, regions or cell types due to differential permeability of the tissues or cells to the drug and its different distribution in the body.

Analysis and Discussion

The invention is a protein delivery platform that utilizes the machinery of the brain parasite T. gondii, which allows it to penetrate the blood-brain barrier and delivery proteins to cells, and especially neurons, within the CNS.

To enable delivery of therapeutic proteins to the CNS, the present inventors reappropriate the parasite Toxoplasma gondii for the synthesis and delivery of proteins of interest to cells in the CNS. The present inventors thus establish a versatile platform for specific and controlled delivery of proteins to cells within the CNS.

The engineered parasite has the following novel capabilities:

(1) It can be easily modified to synthesize a range of therapeutic heterologous proteins and secrete them with specificity into target cells within the CNS.

(2) It can contain modification that reduce or exclude elements of the parasite that can mediate deleterious effects on the infected host.

(3) It can contain regulatory and/or self-destruction elements that allow control of its distribution and spatiotemporal activity once in the host.

Following is a non-limiting summary of alterations in the parasite's genome that are described herein:

(i) Fusion of the selected therapeutic protein to a specific endogenous secreted protein of the parasite. Additional accessory elements and modifications may be added to the fusion gene/protein that may increase the effectiveness of the therapeutic fusion protein, such as cleavage sites that may allow detachment of the therapeutic protein from the endogenous parasite protein, after it is no longer needed for its targeting or activity, or mediate detection and assessment of efficiency and distribution of the Toxoplasma and of the delivered protein, such as epitope tags and fluorescent proteins.

(ii) Removal of virulence genes that are not necessary for the therapeutic role of the parasite. These modifications reduce the potential deleterious effects of the parasite on the patient, and makes it relevant for safe clinical use. (iii) Insertion of regulatory elements that can be induced to cause the parasite to self-destruct in response to an externally administered drug, or to manipulate the parasite's activity. These elements augment the safety features of the technology, and also provide increased control of its spatiotemporal activity.

The initial strains which were developed with the method of some embodiments of the invention were based on the Galactocerebrosidase and Galactocerebrosidase-TAT proteins for Krabbe disease, GDNF protein for a range of neurodegenerative diseases and other conditions (among them Parkinson's disease, Supranuclear Palsy, Multiple sclerosis, Alzheimers disease, Amyotrophic Lateral Sclerosis, Huntington's disease, retinal degenerations, Traumatic Brain Injury and hypoxic/ischaemic CNS disorders), Aspartoacylase protein for Canavan disease, MeCP2 protein for Rett Syndrome, SMN protein for Spinal Mascular Atrophy, E3 ubiquitin-protein ligase parkin for Parkinson's Disease and TFEB for lysosomal storage diseases and neurodegenerative diseases. Though the present inventors have selected a number of therapeutic proteins that are currently being tested clinically and have known supplementary functions, the platform may be adapted to a wide range of proteins.

The possible applications of the technology include, but are not limited to:

1. The treatment of conditions through the delivery of therapeutic proteins;

2. The supply of proteins to the brain in healthy individuals for the purpose of augmenting brain functions or promoting neuro -regeneration;

3. A general solution for highly specific and efficient targeted protein delivery for supplementing protein synthesis.

Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.

All publications, patents and patent applications mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting.

REFERENCES

1. Pardridge, W. M. Targeted delivery of protein and gene medicines through the blood-brain barrier. Clin. Pharmacol. Ther. 97, 347-61 (2015).

2. Cox, D. B. T., Piatt, R. J. & Zhang, F. Therapeutic genome editing: prospects and challenges. Nat. Med. 21, 121-131 (2015).

3. Persons, D. A. & Baum, C. Solving the problem of γ-retroviral vectors containing long terminal repeats. Mol. Ther. 19, 229-31 (2011).

4. Schambach, A., Zychlinski, D., Ehrnstroem, B. & Baum, C. Biosafety features of lentiviral vectors. Hum. Gene Ther. 24, 132-42 (2013).

5. Gray, S. J., Nagabhushan Kalburgi, S., McCown, T. J. & Jude Samulski, R. Global CNS gene delivery and evasion of anti-AAV-neutralizing antibodies by intrathecal AAV administration in non-human primates. Gene Ther. 20, 450-9 (2013).

6. Templeton, N. Gene and cell therapy: therapeutic mechanisms and strategies. (2008).

7. Dimmeler, S., Ding, S., Rando, T. A. & Trounson, A. Translational strategies and challenges in regenerative medicine. Nat. Med. 20, 814-21 (2014).

8. Abbott, N. J. Blood-brain barrier structure and function and the challenges for CNS drug delivery. J. Inherit. Metab. Dis. 36, 437-49 (2013).

9. Malhotra, M. & Prakash, S. Targeted Drug Delivery Across Blood-Brain-Barrier Using Cell Penetrating Peptides Tagged Nanoparticles. Curr. Nanosci. 7, 81-93 (2011).

10. Bradbury, M. W. B. Physiology and Pharmacology of the Blood-Brain Barrier. (Springer Science & Business Media, 2012).

11. Solaro, R., Chiellini, F. & Battisti, A. Targeted Delivery of Protein Drugs by Nanocarriers. Materials (Basel). 3, 1928-1980 (2010).

12. Sanecka, A. & Frickel, E.-M. Use and abuse of dendritic cells by Toxoplasma gondii. Virulence 3, 678-89 (2012).

13. Carruthers, V. B. & Suzuki, Y. Effects of Toxoplasma gondii infection on the brain. Schizophr. Bull. 33, 745-51 (2007).

14. Feustel, S. M., Meissner, M. & Liesenfeld, O. Toxoplasma gondii and the blood- brain barrier. Virulence 3, 182-92

15. Montoya, J. G. & Liesenfeld, O. Toxoplasmosis. Lancet 363, 1965-76 (2004).

16. Dlugonska, H. Toxoplasma rhoptries: unique secretory organelles and source of promising vaccine proteins for immunoprevention of toxoplasmosis. J. Biomed. Biotechnol. 2008, 632424 (2008).

17. Carruthers, V. B. & Sibley, L. D. Sequential protein secretion from three distinct organelles of Toxoplasma gondii accompanies invasion of human fibroblasts. Eur. J. Cell Biol. 73, 114-23 (1997).

18. Boothroyd, J. C. & Dubremetz, J.-F. Kiss and spit: the dual roles of Toxoplasma rhoptries. Nat. Rev. Microbiol. 6, 79-88 (2008).

19. Koshy, A. A. et al. Toxoplasma secreting Cre recombinase for analysis of host- parasite interactions. Nat. Methods 7, 307-9 (2010).

20. Fox, B. A., Sanders, K. L. & Bzik, D. J. Non-replicating Toxoplasma gondii reverses tumor-associated immunosuppression. Oncoimmunology 2, e26296 (2013).

21. Fox, B. A., Sanders, K. L., Chen, S. & Bzik, D. J. Targeting tumors with nonreplicating Toxoplasma gondii uracil auxotroph vaccines. Trends Parasitol. 29, 431-7 (2013).

22. Elliott, D. E. & Weinstock, J. V. Helminth-host immunological interactions: prevention and control of immune-mediated diseases. Ann. N. Y. Acad. Sci. 1247, 83-96 (2012).

23. Rothman, J. & Paterson, Y. Live-attenuated Listeria-based immunotherapy. Expert Rev. Vaccines 12, 493-504 (2013).

24. Reeves, A. Z. et al. Engineering Escherichia coli into a protein delivery system for mammalian cells. ACS Synth. Biol. 4, 644-54 (2015).

25. Meng, X.-L., Eto, Y., Schiffmann, R. & Shen, J.-S. HIV Tat Domain Improves Cross-correction of Human Galactocerebrosidase in a Gene- and Flanking Sequence- dependent Manner. Mol. Ther. Nucleic Acids 2, el30 (2013).

26. Otto P. van Diggelen. MOSCERDAM SUBSTRATES, Laboratory protocol for enzyme analysis fir Krabbe disease; Galactocerebrosidase. MOSCERDAM SUBSTRATES, Laboratory protocol for enzyme analysis fir Krabbe disease; Galactocerebrosidase.