HUANG JIAN (US)
WO2013138363A2 | 2013-09-19 |
JP4814686B2 | 2011-11-16 | |||
US20020129407A1 | 2002-09-12 | |||
US20140215652A1 | 2014-07-31 |
HUANG, JIAN ET AL.: "Creating completely both male and female sterile plants by specifically ablating microspore and megaspore mother cells", FRONTIERS IN PLANT SCIENCE, vol. 7, February 2016 (2016-02-01), pages 1 - 12, XP055350565
CLAIMS What is claimed is: 1. An isolated polynucleotide construct comprising a first polynucleotide and a second polynucleotide, the first polynucleotide comprising a SOLO-DANCERS (SDS) gene or fragment thereof, the second polynucleotide comprising a Barnase gene or fragment thereof, wherein the SDS gene comprises the SDS promoter. 2. The isolated polynucleotide construct of claim 1, wherein the isolated polynucleotide construct is operably linked to the SDS promoter. 3. The isolated polynucleotide construct of claim 1, wherein the SDS gene comprises at least one regulatory intron. 4. The isolated polynucleotide construct of claim 3, wherein the at least one regulatory intron comprises a sequence of any one of SEQ ID NO: 22-26 or 47-51. 5. The isolated polynucleotide construct of claim 1, wherein the SDS gene comprises a polynucleotide sequence of any one of SEQ ID NO: 1-21 or 29-46. 6. The isolated polynucleotide construct of claim 1, wherein the Barnase gene comprises a polynucleotide sequence of any one of SEQ ID NO:27. 7. A vector comprising the isolated polynucleotide construct of of claim 1. 8. A plant cell comprising the vector of claim 7. 9. A plant comprising the plant cell of claim 8. 10. The plant of claim 9, wherein the plant is completely male sterile and female sterile. 11. The plant of claim 10, wherein the plant is a gymnosperm or angiosperm. 12. The plant of claim 11, wherein the plant is a grass, tree, or ornamental plant. 13. The plant of claim 11, wherein the plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus. 14. A composition for generating a complete male sterile and female sterile transgenic plant, the composition comprising the isolated polynucleotide construct of claim 1. 15. The composition of claim 14, further comprising a second isolated polynucleotide construct, wherein the second isolated polynucleotide construct comprises an inducible promoter operably linked to an artificial microRNA (amiRNA) targeted to the Barnase gene or fragment thereof, wherein the fertility of the plant is restored by inducing the expression of the amiRNA. 16. The composition of claim 15, wherein the amiRNA comprises a polynucleotide sequence of SEQ ID NO: 28. 17. The composition of claim 15, wherein the inducible promoter is an estradiol inducible promoter, an ethanol inducible promoter, a dexamethasone inducible promoter, a methoxyfenozide inducible promoter, or a temperature inducible promoter. 18. The composition of claim 17, wherein the temperature inducible promoter is a heat shock inducible promoter or a heat inducible promoter. 19. The composition of claim 14, wherein the isolated polynucleotide construction of claim 1 and the second isolated polynucleotide are encoded on the same vector. 20. The composition of claim 14, wherein the isolated polynucleotide construction of claim 1 and the second isolated polynucleotide are encoded on separate vectors. 21. A vector comprising the composition of claim 14. 22. A plant cell comprising the vector of claim 21. 23. A plant comprising the plant cell of claim 22. 24. The plant of claim 23, wherein the plant becomes male fertile and female fertile after the induction of amiRNA. 25. The plant of claim 24, wherein the plant is a gymnosperm or angiosperm. 26. The plant of claim 25, wherein the plant is a grass, tree, or ornamental plant. 27. The plant of claim 25, wherein the plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus. 28. A method for generating a complete male sterile and female sterile plant, the method comprising introducing into a target plant an isolated polynucleotide construct of claim 1 to generate a transgenic plant. 29. A method for ablating microspore and megaspore mother cells in a plant, the method comprising introducing into a target plant an isolated polynucleotide construct of claim lto generate a transgenic plant, wherein the microspore and megaspore mother cells are ablated. 30. A method for restoring fertility in a male sterile and female sterile transgenic plant, the method comprising: (a) introducing into a target plant a second isolated polynucleotide construct, wherein the second isolated polynucleotide construct comprises an inducible promoter operably linked to an artificial microRNA (amiRNA) targeted to the Barnase gene or fragment thereof to generate a transgenic plant; (b) introducing into the transgenic plant generated in (a) the isolated polynucleotide construct of claim 1 to generate a double transgenic plant; and (c) inducing the expression of the amiRNA, thereby restoring fertility in a complete male sterile and female sterile transgenic sterile plant. 31. The method of claim 30, wherein the isolated polynucleotide construct and the second polynucleotide construct are encoded on the same vector. 32. The method of claim 30, wherein the isolated polynucleotide construct and the second polynucleotide construct are encoded on different vectors. 33. The method of any one of claims 30-32, wherein inducing the expression of the amiRNA comprises contacting the transgenic plant with estradiol, ethanol, dexamethasone, methoxyfenozide, or temperature. 34. The method of any one of claims 30-33, wherein the target plant is a gymnosperm or angiosperm. 35. The method of claim 34, wherein the target plant is a grass, tree, or ornamental plant. 36. The method of claim 34, wherein the target plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus. 37. The method of any one of claims 28-36, wherein the SDS gene is an endogenous gene of target plant. 38. The method of any one of claims 28-36, wherein the SDS gene is a transgene to the target plant. 39. The plant of any one of claims 8-13 or 23-27, wherein the SDS gene is an endogenous gene of target plant. 40. The plant of any one of claims 8-13 or 23-27, wherein the SDS gene is a transgene to the target plant. 41. A transgenic plant produced by the method of claim 28. |
RESTORATION OF FERTILITY
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U. S. Provisional Application No. 62/198, 979, filed July 30, 2015, which is incorporated herein by reference in its entirety.
TECHNICAL FIELD
[0002] The present invention relates to compositions and methods for creating sterile plants by genetically ablating microspore and megaspore mother cells.
BACKGROUND
[0003] Genetically modified (GM) plants, including GM trees, turf grasses, biofuel and forage crops, and ornamentals, improve commercially important traits, such as biomass and biofuel production, digestibility, bioremediation, ornamental value, and tolerance to stresses. However, commercial uses of GM plants are severely limited by stringent government regulations due to concerns over potential ecological effects of transgene flow and floral- modified plantations, Transgene flow from GM plants to non-GM plants and wild populations is mainly mediated by dispersal of pollen and seeds. Early studies found that the pollen-mediated gene flow from GM Roundup Ready creeping bentgrass (a turfgrass) occurred within 2 to 21 km. The non-GM rabbit food grass could pollinate the GM creeping bentgrass to produce transgenic intergeneric hybrid offspring, suggesting that the transgene escape can also be mediated by the female part of GM plants. Long distance pollen-mediated gene flow occurred between weed beets as far as 9.6 km and the resulting interfield gene flow is unavoidable. Pollen migration from poplars often goes beyond 10 km, indicating that similar issues happened in GM trees. Moreover, gene flow from GM crops to native populations was detected in maize, soybean, wheat, and canola. To overcome regulatory hurdles to field research and, ultimately, commercial uses of GM plants, a practical solution is to create sterile plants by ablating floral organs/tissues using toxic genes under control of specific promoters or by altering flowering time and floral organs via manipulating genes critical for flower development. [0004] Strategies on making male sterility have been employed to prevent the pollen- mediated transgene flow. This strategy has also been applied to asexually propagated GM perennial grasses and trees. In addition, manipulating genes regulating flowering time, floral meristem identify, floral organ identity, and floral organ establishment is used to abolish plant fertility. Although male sterility has been successfully achieved via different approaches in various plant species, it cannot completely prevent transgene flow. Seed development in male sterile GM plants can be rescued by the S ong-distance transfer of pollen from non-GM plants. The same is also true for female sterile GM plants which disperse pollen to non-GM or male sterile GM plants. Thus, completely abolishing male and female fertility is the only fail-safe way to prevent transgene flow. Moreover, existing strategies for creating male sterility, female sterility, or both lead to loss or alterations of entire flowers or floral organs, which may cause potential ecological effects on biodiversity of species associated with flowers, such as insects. In addition, genetically engineered ornamental plants that do not produce flowers or exhibit floral organ alterations reduce their ornamental value. The remaining toxicity of BARNASE in non- target organs due to unspecific basal activities of employed promoters inhibits plant survival and growth. In addition, the male fertility restoring system BARNASE-BARSTAR has been used to restore the male fertility via suppressing the BARNASE enzyme activity by its protein inhibitor BARSTAR. Seed production of BARNASE-created male sterile plants is restored by introducing BARSTAR, a BARNASE inhibitor. However, the BARNASE :BARSTAR protein complex may cause potential health risk and no restoration system has been tested to restore female fertility.
[0005] Biotechnologies for engineering sterility without altering either growth or floral structure are needed to prevent dispersal of transgenes and to reduce concerns regarding ecological impacts from genetically modified (GM ) plants, such as GM trees, turf grasses, biofuei and forage crops, and ornamentals. There is a need to generate sterility in both male and female reproductive organs without affecting plant growth or altering flower structure. In addition, a system to restore both male and female fertility is needed to directly down-regulate the expression of BARNASE.
SUMMARY
[0006] The present invention is also directed to an isolated polynucleotide construct comprising a first polynucleotide and a second polynucleotide, the first polynucleotide comprising a SOLO-DANCERS (SDS) gene or fragment thereof, the second polynucleotide comprising a Bamase gene or fragment thereof, wherein the SDS gene comprises the SDS promoter. The present invention is directed to a vector comprising said isolated polynucleotide construct. The present invention is directed to a plant ceil comprising said vector. The present invention is directed to a plant comprising said plant cell.
[0007] The present invention is also directed to a composition for generating a complete male sterile and female sterile transgenic plant. The composition comprises said isolated
polynucleotide construct. The present invention is directed to a vector comprising said composition. The present invention is directed to a plant cell comprising said vector or said composition. The present invention is directed to a plant comprising said plant cell.
[0008] The present invention is also directed to a method for generating a complete male sterile and female sterile plant. The method comprises introducing into a target plant said isolated polynucleotide constmct to generate a transgenic plant. The present invention is directed to a transgenic plant produced by said method.
[0009] The present invention is also directed to a method for ablating microspore and megaspore mother cells in a plant. The method comprises introducing into a target plant said isolated polynucleotide constmct to generate a transgenic plant, wherein the microspore and megaspore mother ceils are ablated.
[0010] The present invention is also directed to a method for restoring fertility in a male sterile and female sterile transgenic plant. The method comprises (a) introducing into a target plant said composition to generate a transgenic plant; (b) introducing into the transgenic plant generated in (a) said isolated polynucleotide constmct to generate a double transgenic plant; and (c) inducing the expression of the amiRNA, thereby restoring fertility in a complete male sterile and female sterile transgenic sterile plant.
BRIEF DESCRIPTION OF THE DRAWINGS
[0011] FIGS. 1A-1D show schematic diagrams of constructs. FIG. 1A shows the
SDS::BARNASE constmct. FIG. I B shows the SDS:. -G US ' construct. FIG. 1 C shows the
SDS::SDS~GFP constmct. FIG. ID shows the SDS::SDS-BARNASE constmct. LB and RB, the T-DNA left and right border, respectively; BAR, the gene conferring resistance to the herbicide Basta; SDS: :, the 1.5-kb promoter of the SDS gene; BAR ASE, the bacterial ribonuciease; KAN, the kanamycin resistance gene; GUS, the gene encoding β-glucuronidase; GFP, the gene encoding green fluorescent protein; HPT, the hygromycin phosphotransferase gene; and
SDS::SDS, the SDS genomic fragment containing a .5-kb promoter followed by a DNA fragment consisting of seven exons and six introns.
[0012] FIGS. 2A-2I show that the SDSr. BARNASE Arabidopsis plants were abnormal in growth and development. FIGS. 2A-2C show that compared to wild type (FIG. 2 A), three-week old SDSrBARNASE (FIGS. 2B and 2C) show plants produced less rosette leaves with irregular shape. Bars = 0.5 cm. FIGS. 2D-2G show six-week old wild-type (WT, FIG. 2D) and
SDS:: BARN ASE plants showing fertile but dwarf (FIG. 2E), dwarf and sterile ( FIG 2F), and no inflorescence (FIG. 2G) phenotypes. Bars = 1 cm. FIG. 2H shows six-week old SDS::BARA T ASE plants were significantly shorter than the wild type. FIG. 21 shows the rosette leaf number of SDS::BARNASE adult plants was significantly reduced, "n" indicates the number of examined plants. Stars indicate significant difference (P < 0.01).
[0013] FIGS. 3 A-3F show that the entire SDS gene but not the SDS .5-kb promoter confers the SDS meiocyte-specific expression. FIGS. 3A-3D show GUS staining of SDS::GUS plants showing GUS signals in cotyledons, true leaves, and shoot apical meristem of a young seedling (FIG. 3 A), as well as in carpels and stigmas of young buds (FIGS. 3B-3D). FIG. 3E shows a confocal image from an SDS::SDS-GFP stage- 5 anther showing the GFP signal (green color) only in microspore mother ceils (arrows). Red and yellow colors showing merged
autofluorescences. FIG. 3F shows a confocal image from an SDS::SDS-GFP stage 2-IV ovule showing the GFP signal only in the megaspore mother cell (arrow). Bars = 0.1 cm (FIGS. 3 A and 3B), 0.5 mm (FIGS. 3C and 3D), 50 μηι (FIG. 3E), and 10 μιη (FIG. 3F).
[0014] FIGS. 4A-4H show that the SDSrSDS-BARNASE Arabidopsis plants showed normal growth and development. FIGS. 4A and 4B show three-week old WT (FIG. 4A) and SDS::SDS- BARNASE (FIG. 4B) plants. Bars = 0.5 cm. FIGS. 4C and 4D show five-week old WT (FIG. 4C) and SDSr.SDS-BARNASE (FIG. 4D) inflorescences. Bars = 0.5 cm. FIGS. 4E and 4F show six- week old WT (FIG. 4E) and SDSr.SDS-BARNASE (FIG. 4F) plants. Bars = 1 cm. FIG. 4G shows no difference in average height between six-week old WT and SDSrSDS-BARNASE plants. FIG, 4H shows similar rosette leaf numbers indicating no difference in flowering time between WT and SDSrSDS-BARNASE plants, "n" in FIGS. 4G and 4H indicates the number of examined plants. [0015] FIGS. 5A-5J show that the SDS::SDS-BARNASE Arabidopsis plants were completely both male and female sterile. FIGS. 5A-5C show primary branches showing normal siliques in wild type (FIG. 5A) and short siliques indicating no developing seeds in SDS: : SDS-BARN ASE plants without (FIG. 5B) and with (FIG. 5C) pollination. Bars = I cm. FIGS. 5D and 5E show side view of mature flowers (One sepal was removed, respectively) showing the SDS::SDS- BARNASE flower (FIG. 5E) is similar to the wild type (FIG. 5D) except short filaments. Pollen grains released from WT anthers (FIG. 5D, inset), while no pollen grains from SDS::SDS- BARNASE anthers (FIG. 5E, inset). Bars = 0.5 mm. (FIGS. 5F and 5G) Pollen staining showing the WT anther full of viable pollen grains (FIG. 5F), but no pollen grains from the SDS::SDS- BARIvASE anther (FIG. 5G). Bars = 30 μηι. FIGS. 5H-5J show dissected individual siliques from primary inflorescences (positions 1-9) were long in wild type (FIG. 5H), but short in SDS: :SBS-BARK4SE plants (FIG. 51, without pollination; FIG. 5J, pollinated with WT pollen). Bars = I cm.
[0016] FIGS. 6A-6F show that the formation of male gametes was arrested in SDS::SDS- BARIvASE Arabidopsis plants. FIGS. 6A-6C show WT anthers showing microsporocytes (microspore mother cells) and surrounding tapetal cells at stage 5 (FIG. 6 A), tetrads and tapetal ceils at stage 7 (FIG. 6B), and developing pollen grains at stage 9 (FIG. 6C). FIGS. 6D-6F show SDS: :SDS-BARNASE anthers showing degenerating microsporocytes and precociously vacuolated tapetal cells at stage 5 (FIG. 6D), dead microsporocytes and tapetal cells at stage 7 (FIG. 6E), and a nearly empty anther lobe at stage 9 (only one dead pollen, FIG. 6F). M, microsporocytes (microspore mother cells); DP, developing pollen; T, tapetal cell; and Tds, tetrads.
[0017] FIGS. 7A-7F show that the formation of female gamete was arrested in SDS::SDS- BARIvASE Arabidopsis plants. FIGS. 7A-7C show WT ovules showing two separated nuclei (arrows) at the FG3 stage (FIG. 7A), four nuclei (arrows) at the FG4 stage (FIG. 7B), and the central cell, the egg cell, and synergid cells in a mature embryo sac (white dots outlined) at the FG6 stage (FIG. 7C). FIGS. 7D-7F show SDS::SDS-BARNASE ovules showing one small nucleus (arrow) at both FG3 (FIG. 7D) and FG4 (FIG. 7E) stages and a small empty embryo sac (white dots outlined) at the FG6 stage (FIG. 7F). Bars = 10 μιη. cc, central cell; ec, egg cell; and syn, synergid cells. [0018] FIG. 8 shows the expressions of tapetal cell as well as microspore and megaspore mother cell marker genes. Real-time qRT-PCR showing decreased expressions of tapetal cell marker genes A9 and A TA7 as well as microspore and megaspore mother ceil marker genes DMC1 and SW11. Stars indicate significant difference (P < 0.01).
[0019] FIGS. 9A-9F show that the SDS::SDS-BARNASE tobacco plants showed normal growth and development. FIG. 9A shows forty-day old tobacco WT and SDS::SDS~BARNASE plants. Bar = 5 cm. FIGS. 9B and 9C show Sixty-day old WT (FIG. 9B) and SDS::SDS- BARNASE (FIG. 9C) plants. Bars = 10 cm. FIG. 9D shows no difference in average height between W and SDS: :SDS-BARNASE adult plants. FIGS. 9E and 9F show flower size, color, and structure remained the same in WT and SDS::SDS-BARNASE plants. Bars = I cm.
[0020] FIGS . I OA- 1 OH show that the SDS: :SDS~BARNASE tobacco plants were completely both male and female sterile. FIGS. lOA-lOC show large fruits from the WT plant (FIG. 10A) and small fruits from SDS::SDS~BARNASE plants without (FIG. 10B) and with (FIG. IOC) manual pollination with WT pollen grains. Bars = 1 era. FIG. 10D shows the weight of seeds per self-pollinated and manually pollinated fruit (n = 5), respectively. Numbers indicate examined independent transgenic lines. FIG. 10E shows WT viable pollen grains in red color. FIGS. 10F- 10H show no (FIG. IGF), all dead (FIG. 10G) and a few viable (FIG. 10H) pollen grains in SDS: :SDS-BARNASE plants. Numbers indicate examined independent transgenic lines. Bars = 100 μηι.
[0021] FIGS. 11 A-l 1C show schematic diagrams of constructs. FIG. 11 A shows a schematic diagram of the SDSr.BARNASE construct. BARSTAR, the BARNASE inhibitor gene; KcmR, the kanamycin resistance gene; LB, the T-DNA left border; BAR, the BASTA resistance gene; SDS::, the SDS 1.5-Kb promoter region; BARNASE, the bacterial ribonuclease; and RB, the T-DNA right border. FIG. 1 IB shows a schematic diagram of the SDSr.SDS-BARNASE construct.
SDS::SDS, the SDS genomic fragment containing a 1.5-Kb promoter region followed by a DNA fragment containing 7 exons and 6 introns; other components are the same as that of
SDS::BARNASE. FIG. 1 1C shows a schematic diagram of the ER: :amiR-BARNASE construct. ER, estrogen receptor; amiR-BARNASE, sequence for generating an artificial microRNA targeting BARNASE.
[0022] FIG. 12A-12M show the creation of complete male and female sterility in Arabidopsis by SDS::SDS-BAR1VASE and restoration of fertility by ER:: amiR-BARNASE. FIGS. I2A-1F shows the side view of mature flowers (FIGS. 12A-12C) and pollen staining of mature anthers (FIGS. 1 2D- 1 2! ) showing plenty of pollen grains from wild type (FIGS. 12A and 12D), no pollen grains from SDS::SDS-BARNASE plants (FIGS, 12B and 12E), and some pollen grains from ER::amiR~BARNASE/SDS::SDS~BARNASE plants after estradiol induction (FIGS, 12C and 12F). One sepal was removed from each flower. FIGS. 12G-12J shows main branches showing normal siliques in wild type (FIGS. 12G), short siliques indicating no developing seeds in SDS: :SDS-BARNASE plants without (FIGS. 12H) and with (FIGS. 121) pollination, and elongated siliques (arrows) in the ER::amiR-BARNASE/SDS::SDS-BARNASE plant treated with estradiol for 7 days (FIGS. 12J). FIGS. 12K shows real-time qRT-PCR showing expression changes of BARNASE before and after estradiol induction from three examined ER:. ximiR- BARNASE/SDS: : SDS-BARNASE lines. Stars indicate significant difference (P 0.01 ). FIGS. 12L shows six-week old wild-type plants. FIGS. 12M shows sterile six-week old ER::amiR- BARNASE/SDS:: SDS-BARNASE offspring plants from induced seeds. Bars = 0.5 mm (FIGS. 12A), 20 .urn (FIGS, 12D), 1 era (FIGS. 12G), and 5 cm (FIGS. 12L), FIGS. 12A-12C, FIGS. 12D-12F, FIGS. 12G-12J, and FIGS. 12L and 12M have the same magnifications.
[0023] FIGS. 13A-13D show that SDS::SDS-BARNASE Arabidopsis plants are female sterile and the estradiol induction partially rescues fertilities of ER::amiRB ARNASE/SDS:: SDS- BARNASE plants. FIGS. 13A-13C (same as FIGS. 5H-5J) show dissected individual siliques from primary inflorescences (positions 1-9) were long in wild type (FIG. 13H , but short in SDS: .SDS-BARNASE plants (FIG. I3L without pollination; FIG. 131 pollinated with WT pollen). FIG. 13D shows the estradiol induction partially rescues fertilities of
ER: :amiRBARNASE/SDS: : SDS-BARNASE plants.
[0024] FIG. 14 shows a comparison of SDS gene structure. Twenty one SDS orthoiogs in dicots, monocots, and chiorophyta were analyzed by searching PIECE (Plant Intron Exon Comparison and Evolution database; http://wheat.pw.usda.gov/piece/). The Exalign viewer of PIECE shows SDS gene structures (exons, introns, and protein domains) and the relationship of exons in examined SDS orthologous genes. The exon-intron gene structure links to the species phylogeny. Color lines indicate different exon comparison results. The names of species and gene IDs are: Aquilegia coerulea (AcoGoldSmith vl .023056m; SEQ ID NO: 1); Arabidopsis lyrata (Aly_471662; SEQ ID NO:2); Arabidopsis thaliana (AT1G14750.1; SEQ ID NO:3); Brachypodium distachyon (Bradilg69380.1; SEQ ID NO:4); Carica papaya
-Ί- (evm. model. supercoiitig_2.165; SEQ ID NO:5); Citrus Clementine (clementine0.9_028383m;
SEQ ID NO: 6); Citrus sinensis (orange 1. lg045573m; SEQ ID NO:7); Cucumis sativus
(Cucsa.1741 10, 1 ; SEQ ID NO:8); Eucalyptus grands (Egrandis_vl_0.039610m; SEQ ID
NO:9); Glycine max (Glyma02g09500.1; SEQ ID NO: 10); Manihot esculenta
(cassava4.1_033727m; SEQ ID NO: l l); Mimulus guttatus (mgvla024744m; SEQ ID NO: 12):
Oryza saliva (LOC O s03 g 12414.1 ; SEQ ID NO: 13); Populus trichocarpa
(POPTR OOlOsl 1430.1; SEQ ID NO: 1 4 ): Primus persica (ppa026778m; SEQ ID NO: 15);
Ricinus communis (29968. m000642; SEQ ID NO: 16); Setaria italica (Si039334m; SEQ ID
NO: 17); Sorghum bicolor (Sb01g042340. 1 ; SEQ ID NO: 18); Vitis vinifera
(GSVIVT01011625001; SEQ ID NO: 19); Volvox curler i (Vca_96988; SEQ ID NO:20); Zea mays (GRMZM2G344416 TOI; SEQ ID NO:21).
[0025] FIGS. 15A-15B show conserved regulator}' motifs in introns of SDS genes. FIG. 15A shows MEME (Multiple Em for Motif Elicitati on) suite motif sequence logos showing 5 regulatory motifs in introns of SDS genes: Motif 1 (SEQ ID NO:22); Motif 2 (SEQ ID NO: 23); Motif 3 (SEQ ID NO:24); Motif 4 (SEQ ID NO:25); and Motif 5 (SEQ ID NO:26). Introns from 18 SDS orthoiogous genes were extracted and joined to a single sequence. Conserved regulatory motifs were analyzed by the MEME suite (http://meme-suite.org/). FIG. 15B shows locations of motifs in intron sequences. Black lines indicate joint intron sequences. Colored bars showing sizes and positions of motifs. Motif 5 (the orange bar) is present in all dicots and monocots. Motifs 1-4 are mainly found in monocots. Numbers before the slash indicate the order number of intron containing the motif 5, and numbers after the slash indicate the total number of introns. Me, Manihot esculenta; Rc, Ricinus communis; Pi, Populus trichocarpa; Gm, Glycine max; Pp, Primus persica; At, Arahidopsis thaliana; .·!/, Arahidopsis lyrata; Cp, Carica papaya; Cs, Citrus sinensis; Cc, Citrus Clementina; Eg, Eucalyptus grandis; Vv, Vitis vinifera; Mg, Mimulus guttatus; Ac, Aquilegia coerulea; Sh, Sorghum hi color; Zm, Zea mays; Si, Setaria italic; Os, Oryza sativa; Bd, Brachypodium distachyon,
[0026] FIGS. 16A-160 show SDS: :SDS-BARNASE results in completely bisexual sterility in Arahidopsis and tobacco plants. FIG, 16A-16C shows wild type Arahidopsis plants show red pollen in anther (FIG. 16A) and normal seed production (FIGS. 16B and 16C). FIGS. 16D-16F shows sterile Arahidopsis plants show no pollen (FIG. 16D) or seed production (FIGS. 16E and I6F). FIGS. 16G-16I shows fertility restored Arahidopsis plants show partially rescued red pollen (FIG. 16G) and seed production (FIGS. 16G and 161). FIGS. 16J-16L shows wild type tobacco plants show normal pollen (FIG. 16J) and seed production (FIGS. 16K and 16L). FIGS. 16M-160 shows sterile tobacco plants show no pollen (FIG. 16M) or seed production (FIGS. I6N and 160).
[0027] FIG. 17 shows conserved SDS gene structure in grasses.
[0028] FIGS. 18A-18D shows schematic diagrams of constructs. FIG. 18A shows the ablation construct previously used in dicot plants. FIG. 18B shows the ablation construct for generating bisexually sterile B. distachyon. FIG. 18C shows constructs for generating male sterile B.
distachyon. Arrow heads indicate positions of regulator}' motifl (Ml), Ml, M3 and M4. FIG. 18D shows the ethanol-inducible amiR-BARNASE fertility restoration construct that contains the inducible and fertility ablation unit.
DETAILED DESCRIPTION
[0029] The present invention provides a method for creating complete male and female sterility in plants, such as Arabidopsis (Arahidopsis thaliand), tobacco (Nicotiana tabaciim), Brachypodium, and alfalfa. The disclosed methods provides an efficient strategy to specifically ablate microspore and raegaspore mother cells using the SOLO DANCERS (SDS) and BARNASE fusion gene, which results in complete sterility in both male and female reproductive organs, but does not affect plant growth or development, including the production of all flower organs.
[0030] The present invention also relates to a fertility restoring system via inducible expression of an artificial microRNA targeting BARNASE. The fertility restoring system can restore fertility to male and female plants and can be used for plant hybrid breeding. The disclosed methods of restoring fertility suppresses the BARSTAR enzyme activity by directly down-regulating the expression of BARNASE, thus providing a new tool to restore the fertility of BARNASE-induced sterile plants.
1. Definitions
[0031] The terms "comprise(s)," "include(s)," "having," "has," "can," "contain(s)," and variants thereof, as used herein, are intended to be open-ended transitional phrases, terms, or words that do not preclude the possibility of additional acts or structures. The singular forms "a," "and" and "the" include plural references unless the context clearly dictates otherwise. The present disclosure also contemplates other embodiments "comprising," "consisting of and "consisting essentially of," the embodiments or elements presented herein, whether explicitly set forth or not.
[0032] For the recitation of numeric ranges herein, each intervening number there between with the same degree of precision is explicitly contemplated. For example, for the range of 6-9, the numbers 7 and 8 are contemplated in addition to 6 and 9, and for the range 6.0-7.0, the number 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, and 7.0 are explicitly contemplated.
[0033] "Chemically-inducible promoters" or "chemically-regulated promoters" as used interchangeably herein refer to a class of promoters that are modulated by chemical compounds that either turn off or turn on gene transcription. The chemicals that influence promoter activity are not typically naturally present in the organism where expression of the transgene is sought; are not toxic, affect only the expression of the gene of interest; are easy to apply or removal; and induce a clearly detectable expression pattern of either high or very low gene expression for their optimal use as modulators of gene expression.
[0034] "Coding sequence" or "encoding nucleic acid" as used herein means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes a protein. The coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the ceils of an individual plant or animal cell to which the nucleic acid is administered. The coding sequence may be codon optimize.
[0035] "Complement" or "complementary" as used herein means a nucleic acid can mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules. "Complementarity" refers to a property shared between two nucleic acid sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position will be complementary.
[0036] As used herein, a "control plant" is a plant that is substantially equivalent to a test plant or modified plant in all parameters with the exception of the test parameters. For example, when referring to a plant into which a polynucleotide according to the present invention has been introduced, in certain embodiments, a control plant is an equivalent plant into which no such polynucleotide has been introduced. In certain embodiments, a control plant is an equivalent plant into which a control polynucleotide has been introduced. In such instances, the control polynucleotide is one that is expected to result in little or no phenotypic effect on the plant. [0037] "Endogenous gene" as used herein refers to a gene that originates from within the plant or plant cell. An endogenous gene is native to the plant or plant cell, which is in its normal genomic and chromatin context, and which is not heterologous to the plant or plant cell.
[0038] A "functional homoiog," "functional equivalent," or "functional fragment" of a polypeptide of the present invention is a polypeptide that is homologous to the specified polypeptide but has one or more amino acid differences from the specified polypeptide. A functional fragment or equivalent of a polypeptide retains at least some, if not all, of the activity of the specified polypeptide.
[0039] A "fusion protein" as used herein refers to an artificially made or recombinant molecule that comprises two or more protein sequences that are not naturally found within the same protein. The fusion protein may include non-proteinaceous elements as well as
proteinaceous elements.
[0040] "Genetic construct" as used herein refers to the DNA or RNA molecules that comprise a nucleotide sequence that encodes a protein. The coding sequence includes initiation and termination signals operably linked to regulatory elements including a promoter and
polyadenylation signal capable of directing expression in the ceils of the individual to whom the nucleic acid molecule is administered. As used herein, the term "expressible form" refers to gene constructs that contain the necessary regulatory elements operable linked to a coding sequence that encodes a protein such that when present in the cell of the individual, the coding sequence will be expressed.
[0041] "Genetically modified" or "GM" as used interchangeably herein refers to an organism or crop containing genetic material that has been artificially altered so as to produce a desired characteristic.
[0042] "Identical" or "identity" as used herein in the context of two or more nucleic acids or polypeptide sequences means that the sequences have a specified percentage of residues that are the same over a specified region. The percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity. In cases where the two sequences are of different lengths or the alignment produces one or more staggered ends and the specified region of comparison includes only a single sequence, the residues of single sequence are included in the denominator but not the numerator of the calculation. When comparing DNA and RNA, thymine (T) and uracil (U) may be considered equivalent. Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0,
[0043] Optimal alignment of sequences for comparison may be conducted by methods commonly known in the art, for example by the search for similarity method described by Pearson and Lipman 1988, Proc. Natl. Acad. Sci. USA 85: 2444-2448, by computerized implementations of algorithms such as GAP, BESTFIT, BLAST, FASTA, and TF ASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), Madison, Wis., or by inspection. In a preferred embodiment, protein and nucleic acid sequence identities are evaluated using the Basic Local Alignment Search Tool ("BLAST"), which is well known in the art (Karlin and Altschui, Proc. Natl. Acad. Sci. USA 87: 2267-2268 (1990); Aitschui et al., Nucl. Acids Res. 25: 3389-3402 ( 1997)), the disclosures of which are incorporated by reference in their entireties. The BLAST programs identify homologous sequences by identifying similar segments, which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. Preferably, the statistical significance of a high-scoring segment pair is evaluated using the statistical significance formula (Karlin and Altschui, 1990). The BLAST programs can be used with the default parameters or with modified parameters provided by the user.
[0044] The terms "isolated," "purified" or "biologically pure" refer to material that is substantially or essentially free from components that normally accompany it as found in its native state. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid
chromatography. A protein that is the predominant species present in a preparation is
substantially purified. In particular, an isolated nucleic acid of the present invention is separated from open reading frames that flank the desired gene and encode proteins other than the desired protein. The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and most preferably at least 99% pure. [0045] "Nucleic acid" or "oligonucleotide" or "polynucleotide" as used herein means at least two nucleotides covalently linked together. The depiction of a single strand al so defines the sequence of the complementary strand. Thus, a nucleic acid also encompasses the
complementary strand of a depicted single strand. Many variants of a nucleic acid may be used for the same purpose as a given nucleic acid. Thus, a nucleic acid also encompasses
substantially identical nucleic acids and complements thereof. A single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions. Thus, a nucleic acid also encompasses a probe that hybridizes under stringent hybridization conditions.
[0046] Nucleic acids may be single stranded or double stranded, or may contain portions of both double stranded and single stranded sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine and isoguanine. Nucleic acids may be obtained by chemical synthesis methods or by recombinant methods.
[0047] The specificity of single-stranded DNA to hybridize complementary fragments is determined by the "stringency" of the reaction conditions (Sambrook et αί.. Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989)). Hybridization stringency increases as the propensity to form DNA duplexes decreases. In nucleic acid hybridization reactions, the stringency can be chosen to favor specific hybridizations (high stringency), which can be used to identify, for example, full-length clones from a library. Less-specific
hybridizations (low stringency) can be used to identify related, but not exact (homologous, but not identical), DNA molecules or segments.
[0048] DNA duplexes are stabilized by: (1) the number of complementary base pairs; (2) the type of base pairs; (3) salt concentration (ionic strength) of the reaction mixture; (4) the temperature of the reaction; and (5) the presence of certain organic solvents, such as formamide, which decrease DNA duplex stability. In general, the longer the probe, the higher the temperature required for proper annealing. A common approach is to vary the temperature; higher relative temperatures result in more stringent reaction conditions,
[0049] To hybridize under "stringent conditions" describes hybridization protocols in which nucleotide sequences at least 60% homologous to each other remain hybridized. Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied at equilibrium,
[0050] "Stringent hybridization conditions" are conditions that enable a probe, primer, or oligonucleotide to hybridize only to its target sequence. Stringent conditions are sequence- dependent and will differ. Stringent conditions comprise: (1) low ionic strength and high temperature washes, for example 15 mM sodium chloride, 1.5 mM sodium citrate, 0.1% sodium dodecyi sulfate, at 50°C; (2) a denaturing agent during hybridization, e.g. 50% (v/v) formamide, 0.1% bovine serum albumin, 0.1% Ficoli, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer (750 mM sodium chloride, 75 mM sodium citrate; pH 6.5), at 42°C; or (3) 50% formamide. Washes typically also comprise 5xSSC (0.75 M NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, SxDenhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with a wash at 42°C in 0.2xSSC (sodium chloride/sodium citrate) and 50% formamide at 55°C, followed by a high-stringency wash consisting of O. lxSSC containing EDTA at 55°C. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each other. These conditions are presented as examples and are not meant to be limiting.
[0051] "Moderately stringent conditions" use washing solutions and hybridization conditions that are less stringent, such that a polynucleotide will hybridize to the entire, fragments, derivatives, or analogs of the target sequence. One example comprises hybridization in 6xSSC, 5xDenhardt's solution, 0.5% SDS and 100 .ug/'ml denatured salmon sperm DNA at 55°C, followed by one or more washes in lxSSC, 0.1% SDS at 37°C. The temperature, ionic strength, etc., can be adjusted to accommodate experimental factors such as probe length. Other moderate stringency conditions have been described (Ausubel et al ., Current Protocols in Molecular Biology, Volumes 1-3, John Wiley & Sons, Inc., Hoboken, N.J. (1993); Kriegler, Gene Transfer and Expression: A Laboratory Manual, Stockton Press, New York, N.Y. (1990); Perbal, A Practical Guide to Molecular Cloning, 2nd edition, John Wiley & Sons, New York, N.Y.
(1988)). [0052] "Low stringent conditions" use washing solutions and hybridization conditions that are less stringent than those for moderate stringency, such that a polynucleotide will hybridize to the entire, fragments, derivatives, or analogs of the target sequence. A nonlimiting example of low stringency hybridization conditions includes hybridization in 35% formamide, 5xSSC, 50 mM Tris HQ (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% s Ficoil, 0.2% BSA, 100 μg/ml denatured salmon sperm DNA, 10% (wt/voi) dextran sulfate at 40°C, followed by one or more washes in 2xSSC, 25 mM Tris HC1 (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low stringency, such as those for cross-species hybridizations, are well-described (Ausubel et al., 1993; Kriegier, 1990),
[0053] "Operabiy linked" as used herein means that expression of a gene is under the control of a promoter with which it is spatially connected. A promoter may be positioned 5' (upstream) or 3 ! (downstream) of a gene under its control. The distance between the promoter and a gene may be approximately the same as the distance between that promoter and the gene it controls in the gene from which the promoter is derived. As is known in the art, variation in this distance may be accommodated without loss of promoter function.
[0054] As used herein, the term "plant" includes reference to whole plants, plant organs (e.g., leaves, stems, roots, etc.), seeds, plant ceils, and progeny of same. Parts of transgenic plants comprise, for example, plant cells, protoplasts, tissues, callus, embryos as well as flowers, ovules, stems, fruits, leaves, roots originating in transgenic plants or their progeny previously transformed with a DNA. As used herein, the term "plant cell" includes, without limitation, protoplasts and cells of seeds, suspension cultures, embryos, meristernatic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores.
[0055] "Promoter" as used herein means a synthetic or naturally-derived molecule which is capable of conferring, activating or enhancing expression of a nucleic acid in a cell. A promoter may comprise one or more specific transcriptional regulatory sequences to further enhance expression and/or to alter the spatial expression and/or temporal expression of same. A promoter may also comprise distal enhancer or repressor elements, which may be located as much as several thousand base pairs from the start site of transcription. A promoter may be derived from sources including viral, bacterial, fungal, plants, insects, and animals. A promoter may regulate the expression of a gene component constitutively, or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, pathogens, metal ions, or inducing agents.
[0056] The term "substantial identity" of polynucleotide sequences means that a
polynucleotide comprises a sequence that has at least 25% sequence identity compared to a reference sequence as determined using the programs described herein; preferably BLAST using standard parameters, as described. Alternatively, percent identity can be any integer from 25% to 100%. More preferred embodiments include polynucleotide sequences that have at least about: 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity compared to a reference sequence. These values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning, and the like. Accordingly, polynucleotides of the present invention encoding a protein of the present invention include nucleic acid sequences that have substantial identity to the nucleic acid sequences that encode the polypeptides of the present invention. Polynucleotides encoding a polypeptide comprising an amino acid sequence that has at least about: 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity compared to a reference polypeptide sequence are also preferred.
[0057] The term "substantial identity" of amino acid sequences (and of polypeptides having these amino acid sequences) normally means sequence identity of at least 40% compared to a reference sequence as determined using the programs described herein; preferably BLAST using standard parameters, as described. Preferred percent identity of amino acids can be any integer from 40% to 100%. More preferred embodiments include amino acid sequences that have at least about: 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity compared to a reference sequence. Polypeptides that are "substantially identical" share amino acid sequences as noted above except that residue positions which are not identical may differ by conservative amino acid changes. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and giutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Preferred conservative amino acids substitution groups are: vaiine- ieucine-isoleucine, phenylalanine-tyrosine, iysine-arginine, alanine-valine, aspartic acid-glutamic acid, and asparagine-glutamine. Accordingly, polypeptides or proteins, encoded by the polynucleotides of the present invention, include amino acid sequences that have substantial identity to the amino acid sequences of the polypeptides, encoded by the polynucleotides of the present invention, which are compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants .
[0058] "Target plant" as used herein refers to a plant or tree that will be transformed with recombinant genetic material not normally found in plants or trees of this type and which will be introduced into the plant in question (or into progenitors of the plant) by human manipulation.
[0059] "Transgene" as used herein refers to a gene or genetic material containing a gene sequence that has been isolated from one organism, such as one plant or plant cell, and is introduced into a different organism, such as a different plant or plant cell. This non-native segment of DNA may retain the ability to produce RNA or protein in the transgenic organism, such as the transgenic plant, or it may alter the normal function of the transgenic organism's genetic code. The introduction of a transgene has the potential to change the phenotype of an organism, such as a plant.
[0060] "Transgenic plant" as used herein refers to a plant or tree that contains recombinant genetic material not normally found in plants or trees of this type and which has been introduced into the plant in question (or into progenitors of the plant) by human manipulation. Thus, a plant that is grown from a plant cell into which recombinant DNA is introduced by transformation is a transgenic plant, as are all offspring of that plant that contain the introduced transgene (whether produced sexually or asexuaily). It is understood that the term transgenic plant encompasses the entire plant or tree and parts of the plant or tree, for instance grains, seeds, flowers, leaves, roots, fruit, pollen, stems etc.
[0061] "Variant" used herein with respect to a nucleic acid means (i) a portion or fragment of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequences substantially identical thereto.
[0062] "Variant" with respect to a peptide or polypeptide that differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retain at least one biological activity. Variant may also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity. A conservative substitution of an amino acid, i.e., replacing an amino acid with a different amino acid of similar properties (e.g., hydrophilicity, degree and
distribution of charged regions) is recognized in the art as typically involving a minor change. These minor changes may be identified, in part, by considering the hydropathic index of amino acids, as understood in the art. Kyte et al., J. Mol. Biol. 157: 105-132 (1982). The hydropathic index of an amino acid is based on a consideration of its hydrophobicity and charge. It is known in the art that amino acids of similar hydropathic indexes may be substituted and still retain protein function. In one aspect, amino acids having hydropathic indexes of ±2 are substituted. The hydrophilicity of amino acids may also be used to reveal substitutions that would result in proteins retaining biological function. A consideration of the hydrophilicity of amino acids in the context of a peptide permits calculation of the greatest local average hydrophilicity of that peptide. Substitutions may be performed with amino acids having hydrophilicity values within ±2 of each other. Both the hydrophobicity index and the hydrophilicity value of amino acids are influenced by the particular side chain of that amino acid. Consistent with that observation, amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties.
[0063] "Vector" as used herein means a nucleic acid sequence containing an origin of replication. A vector may be a viral vector, bacteriophage, bacterial artificial chromosome or yeast artificial chromosome. A vector may be a DNA or RNA vector. A vector may be a self- replicating extrachromosomal vector, and preferably, is a DNA plasmid. For example, the vector may encode a composition for generating male sterility and female sterility and/or composition for restoring fertility in the male sterile and female sterile plants, as disclosed herein.
Alternatively, the vector may comprise a polynucleotide sequence encoding a composition for generating male sterility and female sterility and/or composition for restoring fertility in the male sterile and female sterile plants, as disclosed herein.
[0064] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. In case of conflict, the present document, including definitions, will control. Preferred methods and materials are described below, although methods and materials similar or equivalent to those described herein can be used in practice or testing of the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. The materials, methods, and examples disclosed herein are illustrative only and not intended to be limiting.
2. Compositions for Generating Male Sterility and Female Sterility
[0065] Provided herein are compositions for generating male sterility and female sterility in plants. The SOLO-DANCERS (SDS)::SDS-BARNASE system can be used to generate both male and female sterile plants without affecting growth or flower structure. The SDS:: SDS-BARNASE system includes an isolated polynucleotide construct that encodes a SDS-BARNASE fusion protein. The isolated polynucleotide construct includes a first polynucleotide and a second polynucleotide that are operably linked to a SDS promoter. The first polynucleotide includes a SOLO-DANCERS (SDS) gene or fragment thereof. The second polynucleotide includes a Barnase gene or fragment thereof. The SDS gene includes the SDS promoter. a. SOLO-DANCERS (SDS) Gene
[0066] The SOLO-DANCERS (SDS) gene encodes a meiosis specific cyciin that is involved in homolog interaction during meiotic prophase I in Arabidopsis. With normal growth and development, the sds mutant is male and female sterile due to the meiosis defect. The SDS protein is exclusively present in pollen mother cells in anthers and megaspore mother cells in ovules. The SDS-BARNASE fusion protein does not create any toxicity in other cells or tissues. RNA in situ hybridization analysis shows that SDS is specifically expressed in micro- and megaspore mother ceils (or male and female meiocytes); however, as disclosed herein, the SDS promoter does not achieve the exclusive expression of GUS or BARNASE in either micro- or megaspore mother cells. Conversely, the SDS genomic fragment containing the promoter, introns and exons does achieve the exclusive expression of GUS or BARNASE in either micro- or megaspore mother ceils. Regulatory motifs in SDS introns may contribute to its specific spatial and temporal expression. Intron dependent spatial expression has been revealed in different genes in various species.
[0067] SDS, existing in both dicots and monocots, is distantly related to other cyclins, thus represents a unique type of (SDS-type) cyclin. Analysis of 21 SDS orthologs using PIECE (Plant Intron and Exon Comparative and Evolution; http://wheat.pw.usda.gov/piece/) shows that the length and numbers of exons in SDS genes are similar in higher plants, especially in the Cyclin N domain that spans 3 most conserved exons (see FIG. 14). The length of SDS introns among dicots is different, whereas the gene staicture of SDS in monocots is conserved. 5 novel regulator}' motifs were identified in SDS introns via the MEME (Multiple Em for Motif
Elicitatioii) suite (http://meme-suite.org/tools/meme) (FIG. 15 A). Among them, the motif 5 is present in all examined dicots and monocots, while the motif 1 is unique in monocots (FIG. 15B). The motif 5, which is found in all examined plants, can play an important role in the specific expression of SDS gene.
[0068] In some embodiments, the SDS gene can be the SDS gene from Arabidopsis
(Arahidopsis thaliand), Purple false brome (Brachypodium distachyon), Brachypodium syivaticum, Rice (Oryza saliva). False brome (Brachypodium stacei), Switchgrass (Panicum virgatum), Aquilegia coendea, Arahidopsis lyrata, Carica papaya, Citrus Clementine, Citrus sinensis, Turnip mustard (Brassica rapa), Barrel medic (Medicago truncatula), Soybean
(Glycine max), Cucumber (Cucumis sativus), Potato (Solarium lycopersiciim). Maize (Zea mays), Manihot esculenta, Mimulus guttatus, Hail's panicgrass (Panicum hallii), Foxtail millet (Setaria italicd), Sorghum (Sorghum A/colo ), Green foxtail (Setaria viridis), Poplar (Populus
trichocarpa), Rose gum (Eucalyptus grandis), Ricinus communis, Vitis vinifera, Volvox carteri, or Cherry (Primus persica).
[0069] In some embodiments, the SDS::SDS-BARNASE system includes a synthetic promoter that confers strong and specific SDS expression in micro and megaspore mother cells. The synthetic promoter can be used to produce absolute male and female sterility in various plants. In some embodiments, the synthetic promoter is the SDS promoter from the SDS gene from Arabidopsis (Arahidopsis thaliand), Purple false brome (Brachypodium distachyon),
Brachypodium syivaticum, Rice (Oryza sativa), False brome (Brachypodium stacei), Switchgrass (Panicum virgatum), Aquilegia coendea, Arahidopsis lyrata, Carica papaya, Citrus Clementine, Citrus sinensis. Turnip mustard (Brassica rapa), Barrel medic (Medicago truncatula), Soybean (Glycine max), Cucumber (C cumis sativus), Potato (Solarium lycopersiciim). Maize (Zea mays), Manihot esculenta, Mimulus guttatus, Hall's panicgrass (Panicum hallii), Foxtail millet (Setaria italicd), Sorghum (Sorghum A/color), Green foxtail (Setaria viridis), Poplar (Popidus
trichocarpa), Rose gum (Eucalyptus grandis), Ricinus communis, Vitis vinifera, Volvox carteri, or Cherry (Primus persica). The synthetic promoter can he used with one or more regulatory introns. The one or more regulatory introns can include one or more of motifs 1-5.
[0070] In some embodiments, the SDS gene includes at least one regulatory intron. For example, the isolated SDS gene can include between 1 and 5 regulatory introns, between 2 and 5 regulator}' introns, between 3 and 5 regulator}' introns, between 4 and 5 regulator}' introns, between 1 and 4 regulator}' introns, between 2 and 4 regulatory introns, between 3 and 4 regulatory- introns, between 1 and 3 regulator}- introns, between 2 and 3 regulator}- introns, or between 1 and 2 regulatory introns. In some embodiments, the SDS gene includes at least 1 regulatory intron, at least 2 regulatory introns, at least 3 regulator}- introns, at least 4 regulatory introns, or at least 5 regulatory introns. In some embodiments, the SDS gene can include between 1 and 5 motifs, between 2 and 5 motifs, between 3 and 5 motifs, between 4 and 5 motifs, between I and 4 motifs, between 2 and 4 motifs, between 3 and 4 motifs, between 1 and 3 motifs, between 2 and 3 motifs, or between 1 and 2 motifs. In some embodiments, the SDS gene includes at least I motif, at least 2 motifs, at least 3 motifs, at least 4 motifs, or at least 5 motifs. In some embodiments, the regulatory intron includes a polynucleotide sequence of any ¬ one of SEQ ID NO: 22-26 or 47-51. In some embodiments, the motif includes a polynucleotide sequence of any one of SEQ ID NO: 22-26 or 47-51. In some embodiments, the SDS gene includes a polynucleotide sequence of any one of SEQ ID NO: 1-21 or 29-46. b. BARNASE gene
[0071] The barnase protein (also referred to as "Barnase") is an RNase that has 110 amino acid residues and hydrolyzes RNA. Barnase originates from Bacillus amyloliquefaciens. When expressed in cells, this enzyme inhibits the functions of the cells as a result of its potent RNase activity and thus causes cell death in many cases. By using this characteristic, it is therefore expected that the function of the specific site can be selectively controlled by expressing the barnase gene in a specific site of a plant. In some embodiments, the barnase gene includes the polynucleotide sequence of SEQ ID NO: 27. 3. Compositions for Restoring Fertility
[0072] Provided herein are compositions for restoring fertility in the male sterile and female sterile plants that already includes a first isolated polynucleotide construct as described above. The compositions for restoring fertility involves an artificial microRNA system that inhibits BARNASE expression to restore plant fertility. To restore fertility to both male and female sterile plants, the artificial microRNA system, such as the ER: :amiR-BARNASE system, induces the expression of an artificial microRNA (amiRNA) to post-transcriptionally suppress the expression of BARNASE. Instead of inhibiting the BARNASE activity by BARSTAR at the protein level, the amiR-BARNASE system, under the control of an inducible promoter, such as the estradiol inducible promoter, suppresses the expression of BARNASE at the post-transcriptionai level, which consequently decreases the accumulation of BARNASE protein. Not only does the inducible treatment, such as estradiol treatment, restore fertility of male sterile and female sterile plants, such as SDS::SDS-BARNASE/ER:: amiR-BARNASE double transgenic plants, but also the offspring of these plants are completely sterile. The amiR-BARNASE system, such as the ER: : amiR-BARNASE system, can be used as an alternative approach to conveniently and efficiently restore fertility of BARNASE-indueed sterile plants.
[0073] The compositions for restoring fertility include a second isolated polynucleotide construct. The second isolated polynucleotide construct includes an inducible promoter operably linked to an artificial microRNA (amiRNA) targeted to the Barnase gene or fragment thereof. The fertility of the plant is restored by inducing the expression of the amiRNA. In some embodiments, the plant becomes male fertile and female fertile after the induction of amiRNA. In some embodiments, the second isolated polynucleotide construct includes estradiol
(ER): :amirBARNASE. In some embodiments, the amiRNA includes a polynucleotide sequence of SEQ ID NO: 28.
[0074] In some embodiments, the isolated polynucleotide construction that encodes the SDS- BARIvASE fusion protein and the second isolated polynucleotide are encoded on the same vector. In some embodiments, the isolated polynucleotide construction that encodes the SDS-BARNASE fusion protein and the second isolated polynucleotide are encoded on separate vectors. a. Inducible Promoter
[0075] An "inducible" promoter is one which is capable of directing a level of transcription of an operably linked nucleic acid sequence in the presence of a stimulus or environmental stress (e.g., heat shock, irradiation, chemicals, etc.), wherein the level of the transcription is different from that in the absence of the stimulus. In some embodiments, the inducible promoter is a promoter that induced by a chemical, such as estradiol, dexamethasone, methoxvfenozide, and ethanol, or heat shock. In some embodiments, the inducible promoter is an estradiol-inducible, glucocorticoid-inducible, tetracycline-inducible, pristamycin-inducible, pathogen-inducible, steroid-inducible, such as glucocorticoid-inducible, estrogen-inducible, metal-inducible, such as copper-inducible, herbicide safener-inducible, alcohol-inducible, such as an ethanol-inducible, iso-propyi β-D-l-thiogalactopyranoside-inducible, pathogen-inducible, or ecdysone-inducible promoter. In some embodiments, the inducible promoter is an estradiol inducible promoter, an ethanol inducible promoter, a dexamethasone inducible promoter, a methoxvfenozide inducible promoter or a temperature inducible promoter. In some embodiments, the inducible promoter is induced by environmental factors such as water or salt stress, anaerobiosis, temperature, such as cold- and heat-inducible, illumination, and wounding. In some embodiments, the inducible promoter is a heat shock inducible promoter or a heat inducible promoter. Examples of inducible promoters are described in U.S. Patent Publication No. 20130042371, which are incorporated by reference herein in its entirely.
[0076] In some embodiments, the inducible promoter is induced or activated by a chemical. In some embodiments, the chemical is applied to the transgenic plant by a foliar spray or root drenching. In some embodiments, the chemical is applied to the transgenic plant by dipping the reproductive organs of the plant in the chemical or solution containing said chemical. In some embodiments, the reproductive organ is an inflorescence.
4. Methods of Generating Transgenic Plants with Male Sterility and Female Sterility
[0077] The present invention is directed to a method for generating a complete male sterile and female sterile plant using the SDS::SDS-BARNASE system. The method includes introducing into a target plant an isolated polynucleotide construct containing the SOLO- DANCERS (SDS) gene or fragment thereof, and the Barnase gene or fragment thereof, as described above to generate a transgenic plant that is male sterile and female sterile. In some embodiments, the SDS gene is an endogenous gene of target plant. In some embodiments, the SDS gene is a transgene to the target plant. 5. Methods of Restoring Fertility in Male Sterile and Female Sterile Plants
[0078] The present invention is directed to methods of restoring fertility in a male sterile and female sterile transgenic plant, as described above. The methods of restoring fertility can be used for plant hybrid breeding. The method includes introducing into a target plant a second isolated polynucleotide construct that includes an inducible promoter operably linked to an artificial microRNA (amiRNA) targeted to the Barnase gene or fragment thereof, thereby generating a transgenic plant, introducing into the generated transgenic plant an isolated polynucleotide construct that includes a first polynucleotide and a second polynucleotide, the first polynucleotide comprising a SOLO-DANCERS (SDS) gene or fragment thereof, the second polynucleotide comprising a Barnase gene or fragment thereof, wherein the SDS gene comprises the SDS promoter, as described above, thereby generating a double transgenic plant; and inducing the expression of the amiRNA, thereby restoring fertility in a complete male sterile and female transgenic sterile plant. In some embodiments, the transgenic plant becomes male fertile and female fertile after the induction of amiRNA.
[0079] In some embodiments, the expression of the amiRNA is induced when the transgenic plant is flowering. In some embodiments, the method restores at least about 20%, at least about 30% at least about 40%, at least about 50%, at least about 60% at least about 70%, at least about 80%, at least about 80%, at least about 90%, or at least about 100% fertility.
6. Methods of Ablating Microspore and Megaspore Mother Cells
[0080] The present invention is directed to a method of genetically ablating pollen and megaspore mother cells. Megaspore and pollen mother cells are two small groups of reproductive cells, which are differentiated after all floral organs are established. Ablating pollen and megaspore mother cells only leads to elimination of male and female gametes, but it does not affect differentiation of any other somatic cells and flower development. The method includes introducing into a target plant an isolated polynucleotide construct containing the SOLO-DANCERS (SDS) gene or fragment thereof, and the Barnase gene or fragment thereof, as described above to generate a transgenic plant wherein the microspore and megaspore mother cells are ablated. In some embodiments, the SDS gene is an endogenous gene of target plant. In some embodiments, the SDS gene is a transgene to the target plant. 7. Target Plant
[0081] The methods described herein can be used to provide a valuable resource for wood production, biofuels, bioremediation, and many other applications. The methods can be used to produce transgenic trees, such as poplar, eucalypts, and pines, grasses for biofuels, such as miscanthus and switchgrass, wood production, bioremediation, such as with turf grasses and forage crops, ornamental plants to avoid fruit production (e.g. ornamental cherry or crabapple trees), or invasive and ornamental plants. Male and female sterilized invasive plants by our method can be planted for multiple purposes, such as forestry and horticulture.
[0082] The target plant to be transformed to produce the transgenic plant may be any plant species, including non-vascular plants and vascular plants. The non-vascular plant may include a bryophyte, such as Ph scomitrella patens. The vascular plants may include pteridophyte, such as Selaginella martensii, angiosperms, and gymnosperms. The angiosperms may include a monocot plant or a dicot plant. The plant may be a crop plant, such as a cereal, a fruit, a legume, or a root crop, ornamental plants, or a non-food crop, such as cotton, hemp (Cannabis sativa), flax or linseed (Linam usitatissimum), oilseed rape or high erucic acid rape (Brassica napus), balsam poplar (Popuhis balsamifera), tobacco (Nicotiana tabacurn), and switchgrass
(e.g., Panicum virgatum).
[0083] In some embodiments, the target plant is a gymnosperm or angiosperm. In some embodiments, the plant is a grass, tree, or ornamental plant. Suitable plant species include, without limitation, corn (Zea mays) " , soybean (Glycine max), Brassica sp. (e.g., Arabidopsis thaliana, Brassica napus, B. rapa, and B. jiincea), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgar e), millet (e.g., pearl millet (Penniseium glaucurn), proso millet (Panicum miliaceiim), foxtail millet (Setaria italica), finger millet (Eleusine coracana), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), tobacco (Nicotiana tabacurn), potato (Solarium tuberosum), peanuts (Arachis hypogaea), pea (Pisum sativum), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Tpomoea batatas), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosiis), citrus trees (Citrus spp.), cocoa (Theobroma cacao), grape (Vitis vinifera), tea (Camellia sinensis), banana (Musa spp.), avocado (Per sea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Pruniis amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp,), oats (Avena sativa), barley (Hordeum vulgar e), vegetables, ornamentals, and conifers.
[0084] Vegetables include, without limitation, tomatoes (Lycopersicon esculentiim), lettuce (e.g., Lactuca sativd , green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativiis), cantaloupe (C. cantalupensis), and musk melon (C. meld). In some embodiments, the target plant is
Arahidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus. a. Grasses
[0085] The grass family of monocotyledonous flowering plants (monocots) is the most important plant family for human and the environment where we live. Besides traditional uses of grasses, many grass species can provide a large and sustainable cellulosic biomass feedstock. Recently, switchgrass was selected as a biomass feedstock for renewable bioenergy by the U.S. Department of Energy (DOE) Bioenergy Feedstock Development Program since its broad adaption, high yield, and minimal agricultural inputs. Genetically modified (GM) switchgrass has been made to improve biomass and biofuel production, but the approval for commercial uses of GM plants is subject to complicated and stringent government regulations due to economic, politic or social concerns over potential ecological effects of transgene flow. Completely abolishing both male and female (bisexual) fertility is the only fail-safe way to prevent transgene flow; however, approaches to generating both bisexual sterility are limited. The gene structure of SDS in monocots is more conserved than that in dicots. In grass plants, two conserved regulator}' motifs in the promoter region and the other two in introns may be possibly important for the SDS specific expression (see FIGS. 17 and 18A-18D). b. Ornamental Plants
[0086] Ornamental plants are plants that are grown for decorative purposes in gardens and landscapes, as houseplants, and for cut flowers. For ornamental trees, such as cherries and plums, fruit setting affects flower numbers and quality. Moreover, fruits often make the garden messy. The methods disclosed herein can be used to generate ornamental trees that produce attractive flowers but no fruits. 8. Constructs and Plasmids
[0087] The genetic constructs may comprise a nucleic acid sequence that encodes the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants, disclosed herein. The genetic construct, such as a plasmid, may comprise a nucleic acid that encodes the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants. The genetic construct may be present in the cell as a functioning extrachromosomal molecule. The genetic construct may be a linear minichromosome including centromere, telomeres or plasmids or cosmids.
[0088] The genetic construct may also be part of a genome of a recombinant viral vector, including recombinant cauliflower mosaic virus, recombinant tobacco mosaic vims, and recombinant potato virus X-based vectors. The genetic construct may be part of the genetic material in attenuated live microorganisms or recombinant microbial vectors which live in ceils. The genetic constructs may comprise regulator}' elements for gene expression of the coding sequences of the nucleic acid. The regulatory elements may be a promoter, an enhancer an initiation codon, a stop codon, or a polyadenylation signal.
[0089] In certain embodiments, the polynucleotides to be introduced into the plant are operably linked to a promoter sequence and may be provided as a construct. As used herein, a polynucleotide is "operably linked" when it is placed into a functional relationship with a second polynucleotide sequence. For instance, a promoter is operably linked to a coding sequence if the promoter is connected to the coding sequence such that it may effect transcription of the coding sequence. In various embodiments, the polynucleotides may be operably linked to at least one, at least two, at least three, at least four, at least five, or at least ten promoters.
[0090] The nucleic acid sequences may make up a genetic construct that may be a vector. The vector may be capabl e of expressing the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants in the cell of a plant. The vector may be recombinant. The vector may comprise heterologous nucleic acid encoding the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants. The vector may be a plasmid. The vector may be useful for transfecting cells with nucleic acid encoding the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants, after which the transformed host cell is cultured and maintained under conditions wherein expression of the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants takes or can take place.
[0091] Coding sequences may be optimized for stability and high levels of expression. In some instances, codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding.
[0092] The vector may comprise heterologous nucleic acid encoding the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants and may further comprise an initiation codon, which may ¬ be upstream of the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence and a stop codon, which may be downstream of the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence. The initiation and termination codon may be in frame with the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence. The vector may also compri se a promoter that is operably linked to the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence. The promoter that is operably linked to the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence may be not natively associated with the polynucleotide encoding the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants. Promoters useful in the practice of the present invention include, but are not limited to, constitutive, inducible, temporally-regulated, developmentally regulated, chemically regulated, tissue-preferred and tissue-specific promoters. Suitably, the promoter causes sufficient expression in the plant to produce the phenotypes described herein. Suitable promoters include, without limitation, the 35S promoter of the cauliflower mosaic virus, ubiquitin, tCUP cryptic constitutive promoter, the Rsyn7 promoter, pathogen-inducible promoters, the maize In2-2 promoter, the tobacco PR-la promoter, glucocorticoid-inducible promoters, and tetracycline-inducible and tetracyciine- repressible promoters.
[0093] The vector may also comprise a polyadenylation signal, which may be downstream of the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence. The vector may also comprise an enhancer upstream of the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants coding sequence. The enhancer may be necessary for DNA expression. The vector may also compri se a plant origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell. The vector may also comprise a regulatory sequence, which may be well suited for gene expression in a plant cell into which the vector is administered. The vector may also comprise a reporter gene, such as green fluorescent protein ("GFP") and/or a selectable marker, such as hygromycin ("Hygro").
[0094] The vector may be expression vectors or systems to produce protein by routine techniques and readily available starting materials including Sambrook et ai., 1989, which is incorporated fully by reference. In some embodiments the vector may comprise the nucleic acid sequence encoding the compositions for generating male sterility and female sterility and/or compositions for restoring fertility in the male sterile and female sterile plants.
9. Plant Transformation
[0095] The compositions for generating male sterility and female sterility and/or
compositions for restoring fertility in the male sterile and female sterile plants of the present invention may be introduced into a plant ceil to produce a transgenic plant. As used herein, "introduced into a plant" with respect to polynucleotides encompasses the delivery of a polynucleotide into a plant, plant tissue, or plant cell using any suitable polynucleotide delivery method. Methods suitable for introducing polynucleotides into a plant useful in the practice of the present invention include, but are not limited to, freeze-thaw method, microparticle bombardment, direct DNA uptake, whisker-mediated transfoniiation, electroporation, soni cation, microinjection, plant vims-mediated, and Agrobacte um-mediated transfer to the plant. Any suitable Agrobacterium strain, vector, or vector system for transforming the plant may be employed according to the present invention. In certain embodiments, the polynucleotide is introduced using at least one of stable transformation methods, transient transformation methods, or virus-mediated methods.
[0096] By "stable transformation" is intended that the nucleotide construct introduced into a plant integrates into the genome of the plant and is capable of being inherited by progeny thereof. By "transient transformation" is intended that a nucleotide construct introduced into a plant does not integrate into the genome of the plant.
[0097] Transformation protocols as well as protocols for introducing nucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing nucleotide sequences into plant cells and subsequent insertion into the plant genome include microinjection (Crossway et al.,
Biotechniques 4:320-334 (1986)), electroporation (Riggs et al., Proc. Natl. Acad. Sci. USA 83 :5602-5606 (1986)), Agrohactermm -mediated transformation (U.S. Pat. Nos. 5,981 ,840 and 5,563,055), direct gene transfer (Paszkowski et al, EMBO J. 3 :2717-2722 (1984)), and ballistic particle acceleration (see, for example, U.S. Pat, Nos. 4,945,050; 5,879,918; 5,886,244;
5,932,782; Tomes et al., in Plant Ceil, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer- Verlag, Berlin) (1995); and McCabe et al., Biotechnology 6:923-926(1988)). Also see Weissinger et al., Ann. Rev. Genet. 22:421-477 (1988); Sanford et al., Particulate Science and Technology 5:27-37 (1987) (onion); Christou et al., Plant Physiol. 87:671-674 (1988) (soybean); McCabe et al., Bio/Technology 6:923-926 (1988) (soybean); Finer and McMullen, In Vitro Cell Dev. Biol. 27P: 175-182 (1991) (soybean); Singh et al., Theor. Appl. Genet. 96:319-324 (1998) (soybean); Datta et al., Biotechnology 8:736-740(1990) (rice); Klein et al., Proc. Natl. Acad. Sci. USA 85:4305-4309 (1988) (maize); Klein et al.,
Biotechnology 6:559-563 (1988) (maize); U.S. Pat. Nos, 5,240,855; 5,322,783 and 5,324,646; Klein et al., Plant Physiol. 91 :440-444 (1988) (maize); Fromm et al., Biotechnology 8:833-839 (1990) (maize); Hooykaas-Van Slogteren et al., Nature (London) 311 :763-764(1984); U.S. Pat. No. 5,736,369 (cereals); Bytebier et al., Proc. Natl. Acad. Sci. USA 84:5345-5349 (1987) (Liliaceae); De Wet et al., in The Experimental Manipulation of Ovule Tissues, ed. Chapman et al., (Longman, N.Y.), pp. 197-209 (1985) (pollen); Kaeppler et al., Plant Cell Reports 9:415-418 (1990) and Kaeppler et al., Theor. Appl. Genet. 84:560-566 (1992) (whisker-mediated transformation); D'Halluin et al., Plant Cell 4: 1495-1505 (1992) (electroporation); Li et al, Plant Ceil Reports 12:250-255 (1993) and Christou and Ford, Annals of Botany 75:407-413 (1995) (rice); Osjoda et al., Nature Biotechnology 14:745-750 (1996) (maize via Agrobacteri m tumefaciens); ail of which are herein incorporated by reference in their entireties.
[0098] In some embodiments, a plant may be regenerated or grown from the plant, plant tissue or plant cell. Any suitable methods for regenerating or growing a plant from a plant cell or plant tissue may be used, such as, without limitation, tissue culture or regeneration from protoplasts. Suitably, plants may be regenerated by growing transformed plant ceils on callus induction media, shoot induction media and/or root induction media. See, for example,
McCormick et al., Plant Cell Reports 5:81-84 (1986). These plants may then be grown, and either pollinated with the same transformed strain or different strains, and the resulting hybrid having expression of the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure expression of the desired phenotypic characteristic has been achieved. Thus as used herein, "transformed seeds" refers to seeds that contain the nucleotide construct stably integrated into the plant genome.
[0099] The present invention has multiple aspects, illustrated by the following non-limiting examples.
10. Examples
[00100] The foregoing may be better understood by reference to the following examples, which are presented for purposes of illustration and are not intended to limit the scope of the invention.
EXAMPLE 1
Methods and Materials
[00101] Plants and Growth Condition. Arabidopsis thaliana Landsberg erecta ( er) and tobacco (Nicotiana tabacum Petit Havana SRI) were used. Plants were grown in Metro-Mix 360 soil (Sun-Gro Horticulture) in a growth chamber under a 16-hour light/8-hour dark photoperiod regime at 22°C and 50% of humidity.
[00102] Generation of Constructs and Transgenic Plants, PGR reactions (see all primers in Table 1) were performed using Phusion High-Fidelity DNA Polymerase (New England Bioiabs). Table 1 - Primers Enzyme SEQ
Primer Primer
Purpose digestion Sequence (5' to 3') ID ID name
site NO: zpl283 SDS CA CGGTACCCCATCATTCTC
pENTR 52 promoter 5' -mS C
Kpn l
GTCTCTCTCGCAC
SDS CAGTGTACATTTTTCTCCGTA
zpl284 pENTR BsrGI 53 promoter 3' -mS
CGAAAGCTTGAAAC pEarleyGate303- CCGCTCGAGGCAGGCTTTATG zpl823 mGFP5er 5' Xhol 54 mGFP5er AAGAC pEarleyGate 303- GCTCTAGAGCGGCCGCCGATC zpl824 mGFP5er 3' Xbal 55 mGFP5er TAGTAAC pCR2.1- CCAATGCATTGGCGTATAACA zpl768 BARSTAR 5' Nsil 56
BARSTAR TAG pCR2.1- CCAATGCATATGGCAGCGCTG zpl769 BARSTAR Ύ Nsil 57
BARSTAR GCA
pEarleyGate 303- zpl770 Xhol 5' Bglll GAAGATCTGGATCCGGCTTAC 58
BARSTAR(XhoI) pEarleyGate 303- Xbal, GCTCTAGACTCGAGCTGTTCC zpl771 Xhol 3' 59
BARSTAR(XhoI) Xhol ACC pEarleyGate 303-
CCGCTCGAGTACGCTGTGAGG
zpl772 BARNASE 5' BARSTAR- Xhol 60
ATCTGTG BARNASE
BARSTAR- GCTCTAGAAGGATATCCTGAT zpl773 BARNASE 3' Xbal 61
BARNASE CCGTTGAC zp2163 SWI1 5' Real-time PCR GGAGGAAGACATGGGATGGC 62
CCCTTGTTCACCACCTTCACTT
zp2164 SWI1 3' Real-time PCR 63
C zp2165 DMC1 5' Real-time PCR GGAGAACTCGCAGACCGCC 64 zp2166 DMC1 3' Real-time PCR CCACCTGGGTCAGCTATGAC 65 ATGGTATCTCTAAAGTCCCTT zpl l96 A9 5' Real-time PCR 66
G zpl l97 Α9 Ύ Real-time PCR CCAAATCCTCGGAACTGAATG 67 zp851 ATA 7 5' Real-time PCR CGTCTCCAGGATCGAGGAAT 68 zp852 ΑΤΑ 7 Ύ Real-time PCR GGAGATGGGAAAGCTGAGAG 69 zp853 ACTIN2 5' Real-time PCR GTTGGGATGAACCAGAAGGA 70 zp854 ACTIN2 Real-time PCR GAGGAGCCTCGGTAAGAAGA 71
[00103] The SDS promoter was amplified and cloned into the pENTR/D-TOPO vector (Invitrogen) to generate pENTR-SDS. The 1.5 kb promoter of the SDS gene (upstream of the SDS coding region and the 3' non-coding region of the SDS adjacent gene) was amplified and cloned into the pENTR D-TOPO vector (Invitrogen). The SDS genomic fragment from the promoter region to the last exon was introduced into the pENTR/D-TOPO vector to generate pENTR-SDS.vSDS. The SDS genomic fragment from the beginning of the 1 .5 kb promoter region to the last exon was introduced in the pENTR'T)-TOPE vector. The mGFPSer was amplified from the pBIN Ga\4-mGFP5er vector and cloned into the pEarleyGate303 binary vector (Eariey et ai., 2006, Plant J 45: 616-629) using the BamHI and Sacl sites to generate pEarleyGate303 -mGFPSer. The BARSTAR gene was amplified from the pABGCZ vector that contains BARSTAR and BARNASEfHl 02E) genes (Zhang et al., 2012, Plant Physiol 159: 1319- 1334), then it was cloned into the pCR2.1 vector (Invitrogen) to generate pCR2.1 - BARSTAR. BARSTAR was introduced from pCBJ. A -BARS TAR into the pEarleyGate303 vector at the Nsi site to generate pEarleyGate303-A RS7¾R. An Xhol site was introduced between Bglll and Xbal sites right after attR2 to generate pEarleyGate303-BARSTAR(XhoI). The BARNASE fragment that was amplified from pABGCZ was cloned into pEar\eyGate303 -BARS TARfXhoI) using the Xhol and Xbal sites to generate pEadeyGate303-BARSTAR-BARNASE. The gene for generating artificial microRNAs targeting to BARNASE was designed, as described previously (Schwab et al ., 2006, Plant Ceil 18: 1121-1 133; Ossowski et al ., 2008, Plant J 53 : 674-690). The cuniR- BARNASE fragment was amplified and cloned into pRS300 vector, which contains miR319a precursor sequence in pBSK (Schwab et al., 2006, Plant Cell 18: 1121-1 133). Then, the amiR- BARNASE fragment was introduced into the estradiol (ER) inducible vector (Zuo et al, 2000, Plant J 24: 265-273) at the Xhol and Spel sites to generate ER: : ami R-B ARNASE. Using the Gateway LR recombinase ΙΪ enzyme mix (invitrogen), SDSr. GUS, SDSr. GFP, SDSr.BARNASE, SDS::SDS-GUS, SDS::SDS-GFP, and SDS::SDS-BARNASE binary vectors were generated between pKNTR-.S/XV and pENTR-SDS.vSDS as well as pGBW3, pEarleyGate303-mGFP5er, and pEarleyGate303 -BARSTAR-B ARNASE. Then these vectors and ER: :amiR-BARNASE were transformed into the Agro bacterium strain GV3101.
[00104] The floral dip method was used to generate transgenic Arabidopsis (Clough and Bent, 1998, Plant J 16:735-743). Transformants of SDSr. GUS and SDS::SDS-GUS were screened on 50 pg/mL of kanamycin and 25 g/mL of hygromycin. Transformants of SDSrGFP, SDSrSDS- GFP, SDS: :B ARNASE, and SDS ::SDS~B ARNASE were screened on 1% of Basta (PlantMedia). Transformants of ER: :amiR-B ARNASE was screened on 25 .ug/mL of hygromycin. Tobacco transformation was performed. Briefly, leaf discs were inoculated with the Agrobacterium strain GV3101 containing the SDS: :SDS-BARNASE binary vector and cultured for 1 day in the dark, followed by 2 days under light. Then, leaf discs were screened on shoot and root selection medium containing 4% of Basta. The regenerated plants were transferred into soil and sprayed with 4% of Basta solution one week later. The surviving plants were used for further analyses.
[00105] Pollen Staining and Anther Semi-thin Sections. To access pollen viability,
Alexander pollen staining was carried as described previously (Zhao et al,, 2002, Genes Dev 16: 2021-2031). Mature anthers of tobacco were collected and analyzed using the same method. Pollen grains were released from anthers before imaging. Semi-thin sectioning was performed as described in our previous studies (Zhao et al., 2002, Genes Dev 16: 2021-2031; Jia et al., 2008, PNAS 105:2220-2225),
[00106] Estradiol Induction of ER::amiR-BARNASE. Induction [2 umol/L estradiol (Sigma) and 0.02% Siiwet L-77] and mock (without estradiol) solutions were dropped or sprayed to main inflorescences in the morning, respectively. Seven day induction resulted in fertility restoration under our growth chamber condition.
[00107] GUS Staining Assay. Histochemical GUS staining assay was performed. Tissues were collected and fixed for 1 h in 90% acetone at -20°C. After washing tissues in washing buffer [0.1 M phosphate (pH 7.0), 10 mM EDTA, and 2 niM K 3 Fe(CN)6] twice for 5 min under the vacuum, the drained tissues were transferred into the GUS staining buffer [0.1 M phosphate (pi ! 7.0), 10 mM EDTA, 1 mM K3Fe(C ) 6 , 1 mM i< |1 eiCX). : , 3! U), and 1 mg/ml X-GLUC)] and incubated overnight at 37°C. GUS-stained tissues were then fixed in a 3 : 1 mixture of ethanol and acetic acid. Tissues were mounted onto the glass slides for observation.
[00108] Real-time qRT-PCR. Inflorescences of wild-type, SDS::SDS-BARNASE and
ER::amiR-BARNASE/SDS::SDSBARNASE plants were collected for RNA isolation using the RNeasy Plant Mini Kit (Qiagen). RNA quantification was determined with a NanoDrop 2000c (Thermo Scientific). RNA reverse transcription was performed using the QuantiTect Reverse Transcription Kit (Qiagen). Real-time PGR (DNA Engine Opticon 2 system) and data analysis were performed as previously described (Liu et al., 2010, Plant J. 62, 416-428) to evaluate expression of BARNASE, DMCJ, SWI1, .49, cmdATA 7 (Table 1). ACTIN2 gene was used as an internal control. Three independent biological repeats were carried out.
[00109] Microscopy. Pollen staining samples: GUS staining was observed with an Olympus SZX7 microscope. Semi-thin sections were observed with an Olympus BX51 microscope.
Images were obtained with an Olympus DP 70 digital camera. For confocal microscopy analysis, anthers and ovules were dissected and mounted in water. GFP signal was observed with a Leica TCS SP2 laser scanning confocal microscope using a 63x/1.4 water immersion objective lens. The 488-nm laser line was used to excite GFP and the emission capture PMT was set at 505-530 nm. The 488-nm laser line was used to excite GFP and it also induced chlorophyll
autofluorescence. The PMT gain settings was held at 650. GFP and chlorophyll
autofluorescence were detected at 505-530 nm and 644-719 nm, respectively.
EXAMPLE 2
BARNASE Driven by the SDS Promoter Caused Defects in Growth and Reproduction
[001 0] In Arabidopsis, the SDS gene, which encodes a meiosis-specific cyclin, is exclusively expressed in microspore mother cells (male meiocytes) in anthers and megaspore mother cells (female meiocytes) in ovules. To create completely both male and female sterile plants without altering flower structure, the SDS: :BARNASE construct was generated using the 1.5- kbpromoterof the SDS gene and a modified BARNASE (Zhang et al., 2012) to genetically ablate microspore and megaspore mother cells in Arabidopsis (FIG. 1 A). Among 66 examined SDS: :BARNASE transgenic plants, none of them showed the specific phenotype in sterility. Instead, compared with the wild-type (FIG. 2A), SDS: :BARNASE young plants were defective in vegetative growth, indicated by abnormal shape and numbers of rosette leaves (FIGS. 2B and 2C). Different from the WT adult plant (FIG. 2D), SDS: : BARNASE adult plants also exhibited various abnormal phenotypes, such as dwarf and fertile (FIG. 2E), dwarf and sterile (FIG. 2F), and even no inflorescence (FIG. 2G). The height of mature SDS: :BARNASE plants was significantly reduced (FIG. 211). Moreover, SDS: .'BARNASE plants produced significantly fewer rosette leaves than that of wild-type (FIG. 21). Various defects of SDS: :BARVASE plants in growth and development suggest that the 1.5- kb promoter of the SDS gene failed to dri ve the specific expression of BARNASE in microspore and megaspore mother cells.
EXAMPLE 3
1.5 kb Upstream Region of the SDS Gene did not Confer its Meiocyte-Speeific Expression
[00111] Genetic ablation relies on the specificity of employed promoters. To examine why BARNASE under the control of the 1 .5- kb SDS promoter did not achieve specific ablation effects on microspore and megaspore mother ceils, SDS::GUS plants were generated to test the transcriptional activity of the 1.5-kb promoter (FIG. B). Among 25 examined SDS::GUS transgenic plants , GUS signals were detected in cotyledons, true leaves, and shoot apical meristem of young seedlings (FIG. 3 A), as well as in carpels and stigmas of young buds (FIGS. 3B-3D). Thus, the results suggest that the 1.5-kb promote of the SDS gene was not sufficient for conferring its meiocyte-specific expression, which resulted in abnormal plant growth and development when it drove the expression of BARNASE.
EXAMPLE 4
SDS::SDS-BARNASE Causes Complete Male and Female Sterility But Does Not Affect
Plant Growth and Development
[00112] The possible existence of regulatory elements in SDS introns may contribute to the SDS meiocyte-specifi c expression. To achieve the specific expression of SDS in microspore and megaspore mother cells, SDS::SDS-GFP constructs were generated by fusing the SDS genomic fragment, containing the 1.5-kb promoter, seven exons and six introns, with the GFP gene (FIG. 1C). In examined 18 SDS::SDS-GFP transgenic plants, the GFP signal was not detected during the seedling stage and later in the vegetative growth stage. We, however, observed GFP signals only in microspore mother cells in anthers (FIG. 3E) and megaspore mother ceil in ovule during the reproductive stage (FIG. 3F). Therefore, our results indicate that the entire SDS gene led to the meiocyte-specific expression of the SDS protein.
[00113] To generate complete both male and female sterility by specifically ablating microspore and megaspore mother cells, the SDS: :SDS-BARNASE construct was made by fusing the SDS entire gene with the BARNASE gene (FIG. ID). We performed three transformations, resulting in 97, 80, and 126 SDS: :SDS-BARNASE transgenic plants, respectively. All independent transgenic plants were sterile. We first evaluated the effects of SDS:: SDS- BARNASE on growth and development. SDS::SDS-BARNASE transgenic plants produced rosette leaves with the same number, size, and shape as that of WT plants (FIGS. 4A, 4B). No morphological changes were observed in SDS::SDS~BARNASE inflorescences and flowers (FIGS, 4C, 4D). Moreover, mature SDSr.SDS-BARNASE plants had a similar height to the wild- type (FIGS. 4E-4G). The flowering time of SDS: :SDS~BARNASE plants was not affected either, because the same rosette leaf numbers as the wild-type were produced when flowering (FIG. 4H). To further investigate sterility of SDS: :SDS-BARNASE transgenic plants, we analyzed both male and female fertilities. Compared with the wild-type (FIGS. 5 A, 51 ! }, SDS::SDS- BARNASE plants produced short siiiques (FIGS. 5B, 51). Except short filaments, SDS::SDS~BARNASE plants formed flowers that were the same as the wild-type , indicated by four sepals, four petals, six stamens, and two carpels (FIGS. 5D, 5E). In the WT flower, pollen grains were released from anthers that reached the stigma (FIG. 5D), whereas in the SDS::SDS~BARNASE flower, no pollen grains were observed on the anther surface and anthers did not reach the stigma (FIG. 5E), Fur the r more, different from the WT anther (FIG. 5F), the SDS::SDS~BARNASE anther did not produce pollen grains (FIG. 5G), indicating that SDS: :SDS-BARNASE plants were male sterile. Because pollination using the WT pollen did not rescue the fertility (FIGS. 5C, 5J), SDS::SDS- BARNASE plants were female sterile too. Thus, using SDS::SDS-BAKNASE, we efficiently created completely both male and female sterile Arabidopsis plants that had normal vegetative and reproductive growth and development, including the formation of all flower organs. EXAMPLE 5
SDS::SDS-BARNASE Inhibited Both Male and Female Gamete Formation
[00114] To further understand ablation effects on microspore and megaspore mother cells, we did semi-thin sectioning of anthers and whole-mount squashes of ovules. At stage 5, when compared with the WT anthers (FIG. 6A), the SDS::SDS-BARNASE anther showed vacuolated microsporocytes (microspore mother cells) and tapetal cells (FIG. 6D), indicating the
degeneration of both cells. At stage 7 in the WT anther, successful male meiosis resulted in the formation of tetrads (FIG. 6B), whereas in the SDS::SDS-BARNASE anther, tetrads, and tapetal ceils were collapsed (FIG. 6E). At stage 9, the WT anther contains developing pollen grains (FIG. 6C), but the SDS::SDS-BARNASE anther lacked developing microspore s (FIG. 6F). In embryo sacs of WT ovules, two nuclei at stage FG3 (FIG. 7 A) and four nuclei at stageFG4 (FIG. 7B) were observed; however, in SDS::SDS-BARNASE embryo sacs, only a single nucleus was produced (FIGS. 7D, 7E), At stage FG6, the WT embryo sac showed the central cell, the egg ceil, and synergid cells (FIG. 7C), but the SDS: :SDS-BARNASE embryo sac is empty (FIG. 7F). Furthermore, our results showed that expressions of tapetal ceil marker genes A9 and ATA 7 as well as microspore and megaspore mother cell marker genes DMCl and SWIl were significantly decreased in SDS: :SDS-BARNASE buds in comparison to the wild-type (FIG. 8), In summary, the specific expression of the SDS-BARNASE toxic fusion protein in microspore and megaspore mother cells efficiently impaired the production of both male and female gametes, which led to absolute both male and female sterility, but did not affect flower organ formation or plant growth and development.
EXAMPLE 6
Combination of an Inducible System and Artificial MicroRNA Technology Restores
Fertilities to SDS:: SDS-BARNASE Plants
[00115] To restore fertility to SDS:: SDS-BARNASE plants, we generated the ER::amiR- BARNASE construct to produce an artificial microRNA (Schwab et al., 2006, Plant Cell 18: 1 121-1133) targeting the BARNASE gene under control of the estradiol inducible system (Zuo et al., 2000, Plant J 24: 265-273) (FIG. 1 1 C). ER: :ctmiR-BARNASE plants exhibit no differences from wild type, with or without estradiol treatment. SDS: :SDSBARNASEER: :amiR-BARNASE double transgenic plants showed the same sterile phenotype as SDS: :SDS-BARNASE plants without estradiol treatment, while after the treatment with estradiol, the fertility of 40% (12/30) of examined SDS::SDS-BARNASE/ER::amiR-BARNASE plants was partially rescued, indicated by the formation of pollen grains in anthers (FIGS, 12C and 13F) and elongation of siliques (FIG. 12 J; FIG. 13D). Real-time qRT-PCR showed that the accumulation of BARNASE transcripts was decreased after estradiol treatment (FIG. 12K). Offspring from recovered seeds are completely sterile without estradiol treatment (FIGS. 12L and 12M). Our results showed that male and female sterility of SDS::SDS-BARNASE can be restored by the inducible artificial microRNA approach. See also FIGS. 16A-160.
EXAMPLE 7
SDS::SDS-BARNASE Causes Male and Female Sterility in Tobacco
[00116] To test whether SDS::SDS-BARNASE can provide a general tool to create both male and female sterile plants , we transformed it into tobacco and generated SDS: :SDS-BARNASE tobacco transgenic plants bytissueculture.Amongl4examined SDS::SDS-BARNASE tobacco transgenic lines, leaf shape and size (FIGS. 9A--9C), as well as the plant height (FIGS. 9B--9D) were the same as that of WT plants .In addition, the SDS::SDS-BARNASE tobacco flower had the same size, color, and structure as that of wild type (FIGS. 9E, 9F). Therefore, SDS::SDS- BARIvASE did not affect growth or development in tobacco plants.
[00117] Ten examined SDS::SDS-BARNASE tobacco transgenic lines were completely sterile. WT tobacco plants produced large faiits andperfruitaveragelycontainedO. l Igofseeds (FIGS. 10A, 10D). Conversely, SDS::SDS-BARNASE plants produced small fruits and no seeds were found when self- polienated (FIGS. 10B, 10D, e.g., plants #1, 3, 5, and?). Further pollen viability analysis showed that WT tobacco anthers produced viable pollen, indicated by red color (FIG. 10E), whereas anthers from sterile tobacco plants either lacked pollen grains (FIG. 10F) or formed dead pollen grains (FIG. 10G). The four non-absoiutely sterile lines produced a few seeds (FIG. 10D, e.g., plants #2, and 14) and only some functional pollen grains were found in anthers of those lines (FIG. 10H, e.g., piant#2). SDS: :SDS-BARNASE may impair male fertility in tobacco.
[00118] The female fertility in sterile tobacco transgenic plants was examined. The fertility of manually male-sterilized WT flowers could be rescued by cross-pollination with WT pollen (FIG. 10D), but following cross-pollination with WT pollen, the fruit size of SDS::SDS- BARNASE sterile tobacco plants did not change (FIG. IOC) and no seeds were produced (FIG. 10D, e.g., plants #1, 3, and 5). Thus, SDS::SDS-BARNASE tobacco transgenic plants were also female sterile. Manual pollination partially rescued the fertility of line #7, indicating that the line #1 is a completely male but partially female sterile plant, while lines#2and 14 were nearly male and female sterile plants (FIG. 10D). Collectively, a majority of SDSr.SDS-BARNASE tobacco transgenic plants were completely male and female sterile, suggesting that SDS::SDS-BARNASE is functionally conserved, which can be used to create both male and female sterility in general.
EXAMPLE 8
Completely Sterile Brachypodium
[00119] A Brachypodium regenerating system is established and a BdSDS: :BdSDS-BARNASE construct is generated. The SDS::SDS-BARNASE construct is modified to generate the
BdSDS: :BdSDS-BARNASE construct. A 2-Kb upstream sequence and following genomic sequence of BdSDS containing 7 exons and 6 introns is used to replace the Arabidopsis
SDS::SDS fragment. To achieve a high B. distachyon transformation efficiency, the ablation construct described above was modified using the HPT selectable gene (conferring resistance to hygromycin) under control of the maize ubiquitin promoter (Fig. 18B). Moreover, the 35S::BAR fragment used for transgenic plants selection in Arabidopsis is replaced by UBI: :HPT which is suitable for transgenic Brachypodium selection. The Arabidopsis SDS::SDS genomic fragment is replaced with the BdSDS: :BdSDS genomic fragment that contains a 2-Kb promoter sequence following a genomic fragment with 7 exons and 6 introns (FIGS. 18A and 18B). The resulting construct (named BdSDS: :BdSDS:BARNASE will be used to transform B. distachyon Bd21-3 via tissue culture. The Agrobacteria harboring the BdSDS: :BdSDS-BARNASE construct is transfected into Brachypodium callus. The BdSDS: :BdSDS-BARNASE plants are regenerated.
[00120] The following results are expected: (1) produce bisexualiy sterile BdSDS: :BdSDS- BARNASE Brachypodium plants with normal growth and normal flower organs; (2) obtain male sterile Brachypodium from transgenic plants derived from one of mutated constmcts; (3) restore the fertility of the sterile BdSDS: :Bd,SDS-BARNASE Brachypodium plants by either sparing or watering with ethanol. EXAMPLE 9
Male Sterile only Brachypodium Plants
[00121] The regulatory motif responsible for the SDS expression in male meiocytes is identified. A system that only ablates male reproductive cells for achieving male sterile only Brachypodium plants is developed. 4 novel putative regulator}' motifs (Ml, M2, M3, and M4) in the BdSDS promoter and introns were identified. BdSDS: :BdSDS-BARNASFAMl ,
BdSDS: :BdSDS-BARNASEAM2, BdSDS::BdSDS-BAWASEAM3 and BdSDS: :BdSDS- BARNASE/SM4 constructs are generated by deleting Ml , M2, M3, and M4, respectively. Then transgenic plants are generated to test the male fertility.
EXAMPLE 10
Restoring Fertility of Sterile Brachypodium
[00122] Maize ubiquitin promoter controlled ethanol -inducible system and amiR-BARNASE are used to rescue target plants fertility by inserting the inducible unit into the construct containing fertility ablation unit, Ethanol-inducible system has been successfully used in both dicots and monocots. Considering the price, availability and non-toxic in a moderate amount, ethanol is suitable for field application. The best concentration of ethanol will be tested by spraying on flowers or watering.
[00123] It is understood that the foregoing detailed description and accompanying examples are merely illustrative and are not to be taken as limitations upon the scope of the invention, which is defined solely by the appended claims and their equivalents.
[00124] Various changes and modifications to the disclosed embodiments will be apparent to those skilled in the art. Such changes and modifications, including without limitation those relating to the chemical structures, substituents, derivatives, intermediates, syntheses, compositions, formulations, or methods of use of the invention, may be made without departing from the spirit and scope thereof.
[00125] For reasons of completeness, various aspects of the invention are set out in the following numbered clauses:
[00126] Clause I . An isolated polynucleotide construct comprising a first polynucleotide and a second polynucleotide, the first polynucleotide comprising a SOLO-DANCERS (SDS) gene or fragment thereof, the second polynucleotide comprising a Barnase gene or fragment thereof, wherein the SDS gene comprises the SDS promoter.
[00127] Clause 2. The isolated polynucleotide construct of clause 1, wherein the isolated polynucleotide construct is operably linked to the SDS promoter.
[00128] Clause 3. The isolated polynucleotide construct of clause 1 or 2, wherein the SDS gene comprises at least one regulatory intron.
[00129] Clause 4. The isolated polynucleotide construct of clause 3, wherein the at least one regulatory intron comprises a sequence of any one of SEQ ID NO: 22-26 or 47-51.
[00130] Clause 5. The isolated polynucleotide construct of any one of clauses 1-4, wherein the
SDS gene comprises a polynucleotide sequence of any one of SEQ ID NO: 1-21 or 29-46.
[00131] Clause 6. The isolated polynucleotide construct of any one of clauses 1-5, wherein the
Barnase gene comprises a polynucleotide sequence of any one of SEQ ID NO:27.
[00132] Clause 7. A vector comprising the isolated polynucleotide construct of any one of clauses 1 -6.
[00133] Ciause S. A plant cell comprising the vector of clause 7.
[00134] Clause 9. A plant comprising the plant cell of clause 8.
[00135] Clause 10. The plant of clause 9, wherein the plant is completely male sterile and female sterile.
[00136] Clause 11. The plant of clause 10, wherein the plant is a gymnosperm or angiosperm.
[00137] Clause 12. The plant of clause 11, wherein the plant is a grass, tree, or ornamental plant.
[00138] Clause 13. The plant of clause 11, wherein the plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus.
[00139] Clause 14. A composition for generating a complete male sterile and female sterile transgenic plant, the composition comprising the isolated polynucleotide construct of clause I , [00140] Clause 15. The composition of clause 14, further comprising a second isolated polynucleotide construct, wherein the second isolated polynucleotide construct comprises an inducible promoter operably linked to an artificial microR A (amiRNA) targeted to the Barnase gene or fragment thereof, wherein the fertility of the plant is restored by inducing the expression of the amiRNA. [00141] Clause 16. The composition of clause 15, wherein the amiRNA comprises a polynucleotide sequence of SEQ ID NO: 28.
[00142] Clause 17, The composition of clause 15 or 16, wherein the inducible promoter is an estradiol inducible promoter, an ethanol inducible promoter, a dexamethasone inducible promoter, a methoxyfenozide inducible promoter, or a temperature inducible promoter.
[00143] Clause 18. The composition of clause 17, wherein the temperature inducible promoter is a heat shock inducible promoter or a heat inducible promoter.
[00144] Clause 19. The composition of any one of clauses 14-17, wherein the isolated polynucleotide construction of clause 1 and the second isolated polynucleotide are encoded on the same vector.
[00145] Clause 20. The composition of any one of clauses 14-17, wherein the isolated polynucleotide construction of clause 1 and the second isolated polynucleotide are encoded on separate vectors.
[00146] Clause 21. A vector comprising the composition of any one of clauses 14-18.
[00147] Clause 22. A plant ceil comprising the vector of clause 21 or the composition of clause 19 or 20.
[00148] Clause 23. A plant comprising the plant ceil of clause 22.
[00149] Clause 24. The plant of clause 23, wherein the plant becomes male fertile and female fertile after the induction of amiRNA.
[00150] Clause 25. The plant of clause 24, wherein the plant is a gymnosperm or angiosperm.
[00151] Clause 26, The plant of clause 25, wherein the plant is a grass, tree, or ornamental plant.
[00152] Clause 27. The plant of clause 25, wherein the plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherr', or Eucalyptus.
[00153] Clause 28. A method for generating a complete male sterile and female sterile plant, the method comprising introducing into a target plant an isolated polynucleotide construct of any one of clauses 1-6 to generate a transgenic plant.
[00154] Clause 29. A method for ablating microspore and megaspore mother cells in a plant, the method comprising introducing into a target plant an isolated polynucleotide construct of any one of clauses 1 -6 to generate a transgenic plant, wherein the microspore and megaspore mother ceils are ablated. [00155] Clause 30. A method for restoring fertility in a male sterile and female sterile transgenic plant, the method comprising; (a) introducing into a target plant a composition of any one of clauses 14-20 to generate a transgenic plant; (b) introducing into the transgenic plant generated in (a) an isolated polynucleotide construct of any one of clauses 1-6 to generate a double transgenic plant; and (c) inducing the expression of the amiRNA, thereby restoring fertility in a complete male sterile and female sterile transgenic sterile plant.
[00156] Clause 31 , A method for restoring fertility in a male sterile and female sterile transgenic plant, the method comprising: (a) introducing into a target plant a second isolated polynucleotide construct, wherein the second isolated polynucleotide construct comprises an inducible promoter operably linked to an artificial microRNA (amiRNA) targeted to the Barnase gene or fragment thereof to generate a transgenic plant; (b) introducing into the transgenic plant generated in (a) the isolated polynucleotide construct of claim 1 to generate a double transgenic plant; and (c) inducing the expression of the amiRNA, thereby restoring fertility in a complete male sterile and female sterile transgenic sterile plant,
[00157] Clause 32. The method of clause 30 or 31, wherein the isolated polynucleotide construct and the second polynucleotide construct are encoded on the same vector.
[00158] Clause 33. The method of clause 30 or 31, wherein the isolated polynucleotide construct and the second polynucleotide construct are encoded on different vectors.
[00159] Clause 34. The method of any one of clauses 30-33, wherein inducing the expression of the amiRNA comprises contacting the transgenic plant with estradiol, ethanol,
dexamethasone, methoxyfenozide, or temperature.
[00160] Clause 35. The method of any one of clauses 30-34, wherein the target plant is a gymnosperm or angiosperm.
[00161] Clause 36. The method of clause 35, wherein the target plant is a grass, tree, or ornamental plant.
[00162] Clause 37. The method of clause 35, wherein the target plant is Arabidopsis, tobacco, alfalfa, soybean, maize, rice, Brachypodium, switchgrass, Miscanthus, poplars, cherry, or Eucalyptus.
[00163] Clause 38. The method of any one of clauses 28-37, wherein the SDS gene is an endogenous gene of target plant. [00164] Clause 39. The method of any one of clauses 28-37, wherein the SDS gene is a transgene to the target plant.
[00165] Clause 40, The plant of any one of clauses 8-13 or 23-27, wherein the SDS gene is an endogenous gene of target plant.
[00166] Clause 41. The plant of any one of clauses 8-13 or 23-27, wherein the SDS gene is a transgene to the target plant.
[00167] Clause 42. A transgenic plant produced by the method of clause 28.
ΪΧ
The barnase sequence and the translation initiation ATG and translation stop codon of TAA were in bold letters (SEP ID NO: 27).
ATGGCACAGGTTATCAACACGTTTGACGGGGTTGCGGATTATCTTCAGACATATCAT
AAGCTACCTGATAATTACATTACAAAATCAGAAGCACAAGCCCTCGGCTGGGTGGC
ATCAAAAGGGAACCTTGCAGACGTCGCTCCGGGGAAAAGCATCGGCGGAGACATCT
TCTCAAACAGGGAAGGCAAACTCCCGGGCAAAAGCGGACGAACATGGCGTGAAGC
GGATATTAACTATACATCAGGCTTCAGAAATTCAGACCGGATTCTTTACTCAAGCGA
CTGGCTGATTTACAAAACAACGGACCATTATCAGACCTTTACAAAAATCAGATAA
The amiR-BARNASE sequence - This sequence was amplified from pRS300 vector by replacing miRNA and :«GG: A for targeting BARNASE gene (SEQ ID NO: 28).
GTGCAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAAC GACGGCCAGTGAATTG TAA TACG AC IC AC! A TAGGGCGAATTGGGTACCGGGCCCC CCCTCGAGGTCGACGGTATCGATAAGCTTGATATGAATTCCTGCAGCCCcaaacacaegc tc ggacgcata ttaeacatgttcatacaettaa tacicgctgtittgaa gatgttctaggaa tata catgiagaG -
:¾ , ¾ tcacaggtcgtgatatgattcaattagcttccgactcattcatccaaataccgagtcgcc aaaattcaaactaga ctcgttaaatgaatgaatgatgcggtagacaaattggatcattgatttf^
irtctctttcgiattccaa^
gtaaaattaacattttgggtiiatcittatttaaggcatcgccatgGGGGGATCCAC TAGTTCTAGAGCGGCCGCC ACCGCGGTGGAGCTCCAGCTTTTGTTCCC I ΎΊ ACi fGAGGGl Ί AA Γ I CCGAGCTTGGC GTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGC
SF6 k A !sg B miRNA
Genomic sequences of SDS-like genes in different species. All sequences include 2000bp upstream sequence. All sequences are obtained from Phytozome nittps://pb 07X>oie.jgi.doe.gov/pz por al.htin{#).
Common name Latin name Name of sequence
Arabidopsis Arabidopsis thaliana AT1G14750
Rice Oryza sativa LOC_Os03gl2414
Turnip mustard Brassica rapa Brara.H02558
Barrel medic Medicago truncatula Medtrlg032850
Soybean Glycine max Glyma.02G086500
Cucumber Cucumis sativus Cucsa.174110
Potato Solarium lycopersicum Solyc04g008070.1
Maize Zea mays GRMZM2G093157
Hall's panicgrass Panicum hallii Pahal.B00065
Foxtail millet Setaria italica Seita.9G484600
Sorghum Sorghum bicolor Sobic.001G450400
Purple false brome Brachypodium distachyon Bradilg69380
Green foxtail Setaria viridis Sevir.6Gl 18600
False brome Brachypodium stacei Brast02G101200
Switchgrass Panicum virgatum Pavir.Ia04006
Poplar Populus trichocarpa Potri.010G103700
Rose gum Eucalyptus grandis Eucgr.B02694
Cherry Prunus persica Prupe. lG335600
COS
Arabidopsis Arabidopsis thaliana (SEQ ID NO: 29)
>AT1G14750 | ACCESSION NC_003070 Chrl: 5079407..5082520 reverse
ACATGAACAACTGTTCGGTGCTACTATGTCAATGCATTTTGCCAAATTACTACTCAGTCT ACTCAC
GATTTATTGTACTGCGTTTACGTAACGCGTTTGTATGATCGTTTATTGGTAACCGTA ATTTATGGC
ATGCCCTCCTGCTTTTTTATTTAAGAAAAATAAAACTAATTATATTGTAAATATTGC ATTGATCAT
TTAGTCACACTCTTTAGAAAACAACAGTAAAATTTAAATATAAAAACAACACTAGCT TCCATGAT
TATTTTTCATAACCATTTATAATTGCGTCATCTTGTAAGTTGTAACGCATTGCCTTT CTTACTATGT
AACGGTTGTTGCATATTTTTGTGTACATAAATTTATACACAAAGATAAAAAGTGACT AAGCTTAA
AATATCCTTGAAAAAGCCTTTGGGTCATTAACATGGTGTAAGACTACAGGCGCATTC AGCAATTG
GAGTTCCGATTCTATTACAGTAAGAGGGAACAGAACCGTAATAATCGCGACACATTT GTTCGCAT
TTGTTAGCATCGCATGGAACCATTGGCCAGAAAACGGGGCAAGTTTGTTCCATCATT CTCGTCTCT
CTCGCACCTTTAAACAAACATCAGAAAATTTGTGACATTAATTAACAGGATTTGGCT TCTTATAA
AGATAAGATTAAAACTACTATTTAAAAGATAATCTGTACCTGAGGCTGAAACGATGA AGATGGT CATGATAAGAACAGCGAAATTTATGAGGTTTCTCATGGTTTTATGTTTTTTTTTTTCTTA ACAAAG
ACGTAAACTTGAATCGTTTTATATGCGAAATTGACAGAGAAAACCGGAAAAGATAGG ATCTCCTT
TTCTTTCTTTCTTTTAGTGAAATAGATGATAAACTTGTTTCTGCTAAAAGAGGTGTT TATTTTGGA
AATTATGAATTTTCTGGTCAATGTGATCTTAGAATTTTAAATAGGCTGGATTTTGTG ACCTGATTC
CGTGTCTTATATCTGTATTTACTATATTTAGATGATTCTCTGATAACTGATGTTTTA AAAAGAAGA
TAATTTTGATAAAGAAGTGATTACGAACTTTCCAACATTAAAAGTTTAGAGTTTATT TGATTTTAT
ATCTAATCTTGGTTTATATGTTTTTGATGGGGTTTACTAATTATATTATACCATTCA AGTTGAAAT
ATATACAAGTTTTTTTTGTTTTATCCCTAAATTCTCTAATGTGATATATATAATATA TAATTTGGAT
CGGATTCAACCAAACCATGAACGAGATTTACATTTTGCCGTTTTCCGAAATGTTTTG GGCTTCGTA
AAGAACTAAAGGTGATATTTAGATATTGGGTATACTATTTGTTGTATTGGGCTTAAA AGTTTACTT
TTTTGGCCCAAAATTAATCAACTAAAATAAGATCACCAATGGAAAAAGAAACAAAAA AACCAGT
AAAACATATGCAGAAAATGTAAATTTACAGGGCCTAATATAATCTGCTTGACCATGC CATTGCGA
CATAACAAATGTTACACAAGTAGTGTACCTATAAAGTAGTGTACCTATAATATATTA ACAGTGAT
CAATTTCAGTGTATAAAAAAAGTCTTCTTAAATCATCTTTTAATTCCAACAATATGA CATTCACAA
ACTTATCTATGATTTTTTTAAAAAAAAATTCACACGTGTGCTCAATTTATGTTTCTT TTAGTTCTTC
CACGTGATTTGATGCAAGAAAAATGATTAGACTGTATGTTAAAAAGCATACTAGAGA AATTAATT
ATAAAACATCAATCAGTTGAAGTAATTATCAAAACCGCATGCTTTTTTAGCTAAATC TGTGATTGT
ACTGACGCAGATGCATAAATTCAAACGCAAACGCTGATCTCTACATTAGCCAAACAA GAATAGC
GTCCAAATTTACGACTGGTTTCACGTGCACCAAACCGTAGGGTATAATATCTCTCTC TCACTCTCC
AACATCCCCACTCTTCCCAAGAAACTTCTATAACTGCATCAGCC ACTCTCTAGTC C QA nAAC
A AG G A<3 ATCGCG ATG A GG A ATTCA A AOCG A A G CCTG AG CG ACCCCGTTCGCXO G A AGC TCCGGTCGAC CGATTACGCCG AAGAGAGC^^^
GGAGCAAACAAATCGGAGTCTCTGCTGCTTCTGTCGATTC K TCCGATTTG A JCTGATGAC
AA GTTTCCTGTGGTT GAGCAGAGTCGAGAAGAGCTCGAATC GAAGAAGACTCTAATTGAAG
AGGTAGAAGTTTCTAAACCTGGTTATAATGTGA^
ArFACGAGGT rTAC C ^
CGTCTTGTGTTGATTCGAATTCTGGTGCTGGATTAAGGAGATTGAATGTGAAGGGAAATA AAATT
AACGACAACGATGAGATCTCmCTCACGATCCGATGTGACCTTCGCCGGACATGTCTC CAACAG
CCGGAGTTTGAATTTCGAATCGGAGAATAAQGAGAGCGACGTCGTTTCTGT ATATCTGGAGTTG
AGTACTGTTCCAAGrrC KjGA GTTACCGGAGGA K TGATAACGAAGAAATTOA^TCTCCAA
G CGAGCAGCTT GTGGAAGCTGATTCC C TTGGATCGG CAAGGAATTGAAGC GGAGCTTG
AGATAGTCGGATGCGTCTCTGATCTCGCrrGCTCTGAGAAATTCTCGGAAGAGGTTT CGGATTCTC
TCGATGATGAGTCATCTGAG ^CGrrCAGAGATATATTCACAGTATTCCGACTTCGArrACrCG
GATTACACTCCGTCCATCTTCTTCGACTCTGGCAGCGAATTCTCTGAGAAATCTTCC TCTGATTCT
AACGATTTTGGATCTT TTGCGAGGAAGAAATT ACTCTGAAGTAAGTGGTATAATGATTTCATA TCTCTTGGAATAAT^
TCGATTACTAGTCTATTTTTGATATGAGACTTGTTCTGCTCTGTGTTTGATTCTGAAATT TTGTTCT
GGAATGAATCTTAAGTATACATTTTCGTTTTAGTTGCTAAGGTTTGATGATGAGGAG GTGGAAGA
GAGCTATCTAAGGCTGAGGGAAAGAGAAAGAAGTCATGCATATATGCGGGACTGTGC TAAGGCA
TACTGCTCCAGGATGGACAATACTGGTCTCATCCCTCGTCTACGCTCCATCATGGTT CAATGGATT
GTAAAGGTGAATTTTAACTTTCTGTTCAAA
GAAGCTCAGAAATATGTATCAGTAGCAGAAGATTATGAAGTAAATGAATATTTGGAG ATCCTGTT
CCTGGTTTTAAGAATGTTTTAGCCTAAGGAAATCTATAGCTTACTTTGGAATCTTTT AAGGTTTAT
GTATCAGTCAGCTATGATATTCTTTGTTGCTGATT
GTCTGCTCCCTGATTACAAGC ^ AGCAATGTTCTGACATGGGGCTTCAGCAAGAGACATTGTTTCTA GGAGTTGGTCTGTTGGATCGArrCCTGAGCAAAGGATCATOAAAAGCGAAAGGACTCTAA TACT AGTCGGGATTGCGAGTCTTACTCTGGCCACCAGAATTGAAGAAAATCAACCTTACAACAG GTACC AACCATATTCCAT
AGATTAGGACCATTACAAGAAACTGAGTATTACGCTTAACCAAATCAAGGACTAATA ATGGTCTA
ATACAAACCCTTATGGTTCAATGAATTGGCATTTCATGTGGGTATCGAATATTGGAT TATGTTTCT
CAAAAACACTCTTTACTGGAAAGAACCTTCCACAATACACAGGAATAGTTCAATTTT CTTCAACT GCTCACCTGATACTTGCTCTTTTTAACTAGCATCCGGAAAAGGAACTTCACCATTCAGAA CCTAA GATATAGCCGGCATGAAGTGGTGGCAATGGAGTGGCTGGTTCAAGAAGTCCTCAACTTCA AATG CTTCACACCCACAATCTTCAACTTCTTGTGGTAAAACCTCT
GACACATTATCCACACAGAAAGATACATATGACTATCATTTATACATGTCAGGTTCT ACTTAAAA
GCTGCTCGAGCCAATCCAC^GTTGAAAGGAAAGCCAAATCCTT KjCTGTTACCrCACTATCCGA
C AAACTClAACTCTCnTnTGGCCCT
ACACAACAAAATCTCTGCATACCAACGAGTCATAAAGCJ I ATCAn
AATACCTm
TCCATGTTAGAACAACAGATAACGAGTTGCCTGAATGCGTTAAGGTGTTTTCAGTAACAC TCTCA
TTATATACAAATCTCATTTTTACCACTAAACGTAAGGTAAGTGACTGTTTTCACATT TTTGTTCCCT
ATACAACAGAGTCTGGACTGGTTG TTCiCiGCAGTAAGCAATCAAAAAGAA AAAAAC CTAAAA
CCAGGACACAGTATACTCCGATACCAACACACAGGTTATCATTACTATTTACAAAAA CAAACACA
AGGTAAGTAATAAGAA T CTCTACAGATTTATATACTTAAT GAGCTGGACTTAATTACiCTCTT
AGTATACCAATTATTAGTijCCACCATTTGTGTCGCTCATACACATTTATTTCTTAT TTTCCCTAATT
CATrAGACTCTCATATTCTTAAAAAGAATATTTCCTTGTTTG
Rice Oryza sativa (SEQ ID NO: 30)
>LOC_Os03gl2414 | ACCESSION NC_029258 Chr3:6556387..6562025 reverse
GGATGCTTGCTACTGGATAGGAGTCATGGAAGAGAACGGGGTGCTCTGTGACACTGATGT CTACA
ATGGTTTGTTGCTTAGGCTGTGTGTGGAAGGGCATGTTGGTGAGGCCTTGGCGTTGG CTAAGAAG
GTTGCTGAGAGGGGGATTCTCATAGAGGCTTCTTGTGCTGATCGTTTGATGGATTTG CTAAAGCA
ATATGGTGATGAGGAGCTAGCACCAAAAATATCAGAACTGAGGAGGTGCTCTGAAGT GCTGTCA
CATTAACCAATGTGTGATCCGAACCCTCCTACAAGTATCATGCTTGGTTGATTTCAA ATCAAGAA
AAATGCTTCCGTGCTGCATGATTACAGCAAGAAAAGGCTTTGAGGGTTTGTTACGCT GAAATAGA
TTGGTGGGGATAGGGTGCAGCACAGAGTGATTTGTGTGAGCAAAATGTGGATGAGTT ACTTCATT
TACTTGCCCATTTCCTGTAGTTTTTCTGAACTCTGTTCAGATCCTCCAGTCCAAGGG ATGCTTCAG
GACATGTGAACTATGATTGCGATGGAATTCTCAGGTTCCTCATTAGTATGCTCCCAA ACAGATAT
GTTTGTTTAAGTGGTGATCAATCAAATGTTTTACATTTTTAAAGAACACATATGCTG ACACTGTAA
CTTGTAGTAGTTCTTCGACCTCCGTTGTATAGCGGCCAACTCTAATCAAGATCAGGT TACCGATTT
ACAGCTAGAATGTTCCAACTTGCATCCTTTGATGCAAGTGTTTTAGTTCACTGACTT TAGTAGTGA
ATGTTGTTTTACGGGAACTCTTGTGTTTCCCCAGGGTGATGCACAAGGGAACCAAGG TTTTCGGT
ACTCTGTTCAGAATTCAGATTCAGAGGAGACGTTTCTGAAGTCTGCGGCAAATGACG GTCTTCAG
AAGTGTGTATCAGACTATCAATCAGTCCATCAGGGTCCCATCTACATGCATACACTT TCCTTTTCT
TTCATTTCCTCTTTACCGAGCTATTTGCTCCAAACCTTATCCAAGCCGTTTCAAGGG CCCTTTGAA
TCGTAGGAATGAAAAAACAGAGGAATAGGAAAAACACAGGATTCTGACAGGAATACA ATTGTA
AAATAGAGGATTGCAAAACACAGGAATGGCCATTTGATTGGATCACAGGAAAAACAC AGGAATC
AGATGAGAGAGATAGACTCAGAGGAAATGTTCCAAGAGGATAGACCTATTGCTAACT TTCCTCC
AAAATGTGCATAGGATTATCCATTCCATAGGAATTTTAAAGGATTGGATAAGATTCA ATCCTTTG
TTTCAAATGCCTTCATAGGATTTTTTTTTCATAGGATTGAAATCCTCCAAAATTCCT TCATTTTTCC
TACAAATCAAAGGAGCTATGTGTAACTTGAAATACCACCGGAATACCAGCAGATTCA AACAATC
GAGCTTCAACTGTACTTCCTTAAAAACGTAGTGATCCGAGTATGCAGTGACCCAATT GGAGACAA
CCTGGTTGGGATTGGGTAATTCTTCCCCGATCCCAACTGGGTTAACCGGATCGTTCG ATCGAGATT
GTTCGGGTAAGGAATTAATCAGGGTCAGGATAGCATTTTGACAACCTGGCCGCCGTC CGCCAATC
CCGCATATGCGCTGGCCCCAGCCCGACATTCCCCACTGGAAGCAAACGCCTCGATTT GCCTGGGA
GGCTGGGACACGGGAGGCAAGGCCGTTATTCCGGCGGATCGCTCGTGCACCAGATGC ATCTGGG
ACCCACAGGGATACGCCGGCGAATCTACACGGACTGGAGTACACAGGCCATCCAATG GAGCGCG
AGCGGCGCATGTGCATCGTCGCAGCACAGTCGTGGCTTGAGGATTTTTGGAAGTTCA AAAAACAC
TTCACCGATCAGGCGATGCGGCGAACAGCCGAAGGTTGCTGAACACTCCTCTCCCCC TTCCTCTC
CACGCCTCAAGTCACGGCTATATAACTAGCAAGCCAAACGAACATAGC( e^^¾^^e
CACAAAITCACAATTCACACACAA^ ATCGGTGCCGACGAGGCCGC^
CCTCCTCTGCTCCCTGATCAG^^
CCTC€CTCGCCG€AGCR€AG€GCCCGGAGAAGCCCCCTCGGTACCAGGATGT CCACGAGGAGCA GCCCGCCGCCTCCGAGTGCTCCGAGATCATCGGTGGCGCGAGGCCGCGCGCCGCCGAGGT CGAG
A CGACGCCGAGACGACCGCCTAC^ CTCGCCTTGGCGGAGCAGTTCGTCC CTT ACGCATCCCAAAACGCCCAC GCCACGGATGTCG
CTCTACAAG GGGAGAGGTAAGCAGGTTCCACACAGCTTCTCTCAAATTTGTTGGAGTTTGGCTG
CTCTCTCCCGAATCAGTGCGACATTTGATGTAATCAGATGGGAATTATTGTA
TTTGAGGACTTGGACAATGAGGTGAGCTACGAGCGGTTCCGGCGGCGCGAGCGGCGA GGGGTGG
TAGCCCGCGACTACATTGAGGTGTACTCCTCCATGCTCGGCAGCTACGGCCGCGCCG TCGTGGAG
CAGCGCGTTGTCATGGTGAACTGGATCATGGAGGTCAGTATGCTCTGCATTGTACAC GTATGCTG
CGACCATGACTTTGCCTTGGTCCAATAATTCATCTAGCTCGAGCGCAATTTGTTGTG TGGTAGCTG
CAGATGTTTGTTGCACGGGTATTGTGCTGATAATGCACATGTATACTTTCGGAGTGC TGTACTGAT
CTAGATTCTATCAGCTTATCTGTCTGACTGATGTGTTCTGCAAGAACAGTAGTTGCT TGCATTTTG
CACTAACATGCTACTCTGCAGTGTAAATGCTGTGCTGGGTGGGAGTGCTGAGGTAGC AGATCCGG
TTGTGATCTTGCATATTTTGTGTAGGGGAAGCTATCTGATACCAATATACGCATTGC ATTTGCTGA
TTCGTTTATCACACAGTTAGCTGCCTGTACATTTTGCAGCATTCCCCGTGTTAGTCT GCCAGCAGT
GTGGTGGTAATCCTAGTGATTTCAATGTAACTCTAGTCATTAACTTTTATGTCATCG CATTCAAGT
TCTATAGAGCAGTGATAAAGATTTTCTAGGTGCTTATTTTGGCATGCTTGCATATTT TTTTTGGAT
GTAATGGCATGCTTGTATGAAAAGAAAAGAGATACATAGGATTTTTCTAGGTGTTTA TTTTCGCA
CCTGAAAGTTGCTCTGTAGTCATTTGCCATCTGAAATCCCATGGCATTGGGCATTGG
CATATGGCCTCCAGAAATGTCTGCATCACTTTTTACTTCTAAACCTATATATGCAAT AAGTGTTAG
AAAAACATATTGCCAGTGTATTCCCTTTTTTTGTGTATCCACCATCATCTGAATTTA ATCTTTTCTA
ACTCTAGTTCCATCAGTTTATATTTGCATTGAAACTTCAGGTAGCCTTTGAAAATGA AATCGCTGT
GTGTTTTTTGTTTGAAGTTTTTTTGGGCAACTCAGCGATAAGTACTTAAGTGATATG ATCATTTAT
ACTTGTAATTTGATTTTTCTGAAAATAATGTACTGTCATTTGTGGAGAATATGTTTG TTCTTACAT
AGAGTTCATACATAACTTGCAAAGAATGACTTGTTCTCTATGCATCGCTAGCATTCA CAAGCGAT
GAAGCTGCAGCCAGAGA CGTGTT ATGGGGATAGGG TGATGGA CGCTTCTTGACACGTGGA
TGAACAGAATCAACCCTATAATTGGTAATGGCTCCCTTAAATGACCAGTTTCAGTAT GAATACTT CTGCAAGTTGTTGCTGTCT
TGTATTATTAAATGGGATAGTGCTCAATTGCAATGCTGCAAATTCCTTAGTATTCTT TGATAGTTT
GATTTTTCTGAAAGTTGAACTTAATTCTAACAGACATTAGCATGAGTTTATGAGTTT AATTATGTT
CTATAAATAACTTAATAACAACAGGAACATCCGATGGTAACTCATATCTCATCTAGT AGCTATTT
ATCACTTTCAATACTGAGCACTGCATGTGAGAACTCAGACCATTGGACCATTGTTCT ACAAAAAG
GTTTCACACGATGGGTATTTTAGAAATGAAGGGCTTGTTTGGTTCTAAGCCATTGTG GGCCATAC
CAATTTTTTGGCAATGGCAAGATTTAGCCACTCCAAAATCTTGGCAAAGGACTTGTT TGGTTTGTT
GACAAACTTTTTGGCAAGATTGATTAGTTTAGTATTTAGTTTGCTACCAAGGAAAAA TATTGGTGT
TGCCAAAATTTGGTGACAAAAAGAAAGCTAACAAAATTTTAGGCAGACCAAAATATT GGTATGG
TTTTGATTGGCTGAGAACCAAACACACCAGAAGTTTCTTGATTGTGTTGACTATGAT CAAGCATCT
CTCAAAATAATTTACCGCGCCAGTTGGCATCCTTGAGCTATTGTTTTTTAAGAAAAA AAGAACTA
AGCTATCAATTATTGGGAGAGATGCGATTTTGAGAAATTATCACTTGCATAGATCTC CTTGATAT
AGTTTTATTTCTCTGTTTTTGATGTTACAATAAAGCTATTACATCAGCGCTCTGGTG GGCGAGATA
TTCTCTGTGTTTTCCCTTAAGTTTTAAAGACTAAACTCATATGAATCTTCAACATTT TTGAAAGTA
GACCTTATTTGTATCCCTGAGAACTTGCTAGTAAATGAATGTTTTATGTGCCACTTA TTACTGCAC
AGAAGCATGTGAAGTGAACCCAGGAACTATTCTTCTTTGCCTGTCTACTTTGCACGA AGTATCAC
TTTTGAACTTATGAGTTTGCCTCTGGATGCGCATTTGTTTCAAGCGCCTAGTTTCTG TCAAGGGCA ATGTGCAATATAGACCTTTTCTTTAGAACACTCTACAGGAACCTAATAATACGATCTCTT TCAGii
TCCTTCAAAAAGCTTTCAAAGTAGGGATCAATACTTACAGCCGAAGTGAGGTCGTCG CCATGGAG
TGGCTGGTTCAGGAGGTCCTTGACTTCCAGTGCTTTGTCACAACAACCCATCATTTC CTCTGGTAC
TATGCCTGTGTGTTTGCTTCATTTTCTGTGTCAGCTGGACAGAATGAATAAGAAACT TACAATTGT
TTGGTTCAACTTTGCAGGTTCTATCTGAAGGCTGCGAATGCAGATGACAGAGTCGAG CACCTGG€
GAAQ AC T GOTTQ^^
AGGTAAATACTTTAATCTTCCATCACTGGCTATGCTATTTCTCTTATATCTGCAGTCTGC TATTCGT
TCAGAACTTCCTTAAGGAAAAAGATTCTGAACTTTTGCTCAGTTTTGTATGTTGCTG CTTTCATTTT
ATCTCGTAGCCAGATGACAAATGCCATGAATCACCTAGGTCTTCACATGCTTATTCC AATTCACA
ACATACTGATCCTTGCAAAGAATACAAATGTCTAACCACTTGCTCTTCACATAAATG CAGACTCA
CATGAGAACGAAGAATGATGATCTGCCTGAATGCCTAATGGTTTGTTTTCTCCTCAA ATTATGATT
CTGGTGAAAAGTGTTGTACAC^
GGCATATAACGTGCAGTAATCCTTTTATGCAGAGTCTCGAATGGTTGACCAATTATGCTT CCTGAT TTGAACAAA CCAGQTGATATGCCGA CCAAT TTCTGC CATT CCAGAAACACAGTGTACiTAC
GCATTAGTrrGACACCAGGGTAGAAAAGAGGGCAAAGAAGCCGGCTAAACGTGGTTC TGATGGC ACCACACTTATAGGGAGCATCGCAACCACGAAATTTTGCTACTACTGCCGGCTTCAGTGA CTACC ACTAACOTC OT CTGCA A ACA GATGTTCA GTTA G CTGAAGATCC AG AGTACCACCTCGTTTGCTCA GGTAC ATGTC ATCAAAACATCTACTA TAATCTCTAGTTTTATTCTCTAGA TTCTCTATT CAATCTATCTCCAGG
GGATCGCGAGAGTTCCTGCACGCCATAAAAATGAGGAAAACGAGGTGGGGAGGGCGT GGCTTCT TTTCTCTATGA
CCCCGAATGTTGGGGGCGTCTGCAGAAAGAAGGGAGGGCGTTCGCGTACACCCCCCT TTCAGCCC
CACTTGCGCCCGCGCTCTCGTTGAGCCGTCTGCCTTTTGGGGTTCTAGAGTGAATAG GGTCGGCA
CGTTAGCGGCGAGCCGGCGACGGGTAATGTGGTG
TGCCAACAACTTTCAGCTGCTGCTGAAGTGAAGAGCAGGTGGCGATCAACCAGTCCC GGCACTG AACTACTGAACCTGAAGTGGTTACTCCATTCATCGACAGAGATATTTAATTATTTGTCAC TTTTAG ATGTAATATTTTACCACATAAACCTACAAATCATAAACATATAAGTGACAAATTATTTAA CGCCA CATCGAAAAATGGTAAATGCTTATTTGCCCCTTCCCACTCTCTC
Turnip mustard Brassica rapa (SEQ ID NO: 31)
>Brara.H02558 | A08:20912243..20915016 forward
CCCGCTGGTGATTCCCGAAGTGAATCCCGAGGCGATGAAAGGGATTAAAGTCGGAACGGG GAAA
GGGGCGTTGATTGCGAACCCTAATTGCTCTACAATTATCTGCTTGATGGCTGTTACG CCTCTTCAT
CATCACGCTAAGGTTCGATTTTTTTTTTTTGCAATGCCAACGTCTTCGCGTTTTGTG CTATGAGTAA
CGTTTTGATTTTGGTTATAACAGGTGAAGAGGATGGTGGTTAGTACTTATCAAGCAG CTAGTGGT
GCGGGTGCTGCAGCGATGGAGGAGCTTGTGCAGCAGACTCGCGAGGTTTTGCTTCTT TTTTTAAC
CATTCCATTGACTTTGATTAACGATAATGCTGAGAGTTTGGATTGGTGTTTGCTTAG GTTTTAGCC
GGTAAGCCGCCGACTTGTAACATCTTCAGCCAGCAGGTGAATAGTCAATTTTGCTTA TAGTTTAA
TTTTCAAATGGTGGTTTAGTGTTCTGATTCTGAATTACTTTTTTGATTGATTTGTTG TCTTCGATAG
TATGCATTTAACTTGTTTTCGCACAATGCTCCCATCACTGAGAATGGTTACAACGAA GAGGAAAT
GAAACTTGTGAAAGAGACAAGGAAGATTTGGGTGAGTGGTTACTTGAAGAACTGTTT TGTAGTA
ATATCACTCTAATATTTTGTTCATAACGCTGGTTATGTTAAGAGGCTATTTACCTTT TCCTGCTTTT CGCAGAATGACACAGAGGTCAAAGTAACAGCGACGTGCATACGTGTTCCGGTTATGCGTG CTCAT
GCAGAGAGTGTGAATCTCCAGTTTGAGAACCCCCTCGATGAGGTAATAATAATACAC TTCAAACT
CGTATTCTACTAAGTTTGTTATTACTTATTAGTAGTTTCTGAAGCATGGTTCATAGT GAATTTCAA
TTTGAATCATGGGTAAAACGGCATTCTATAGGCATTTTTAACTTCTTTTCCAAGGAC CTGTGATCA
GCGTTAGTAAGCTTGGATAGTTCTTGAGGAACCTTGCAGAGTTAAATCACCTTAGAA TTGTTATTT
GGACTTGTTCTGCTAGCATCTTTAGAGAGCTTGTGATTCTTCTTACGCGACAAAAAA TAATCTTAT
GATCAAGTTCTTTGTTATCTTAACAGAACACAGCAAGGGAGCTATTGAGGAAAGCAC CTGGAGTT
TACATTATAGACGACCGTGCCTCTAACACCTTCCCTACTCCACTTGATGTCTCTAAC AAAGACGAT
GTAGCGGTTGGTAGGATCAGGCGAGACGTGTCCCAAGATGGCAATTTCGGGTTAAGT CTCACTCT
CTTTTCTACTAAATTTAAGATCATATGAGTTCTTTCCATTAAGTTAAAAGGCTATAA TAACTTTGT
GAACTTTCAGACTGGACATATTCGTTTGTGGAGATCAAATACGCAAAGGAGCTGCTC TAAACGCT
GTTCAGATCGCTGAGATGCTTCTCTGATTTGGAGTCCCCTCACTCACTTGGCTTCTC CTGATTCTTG
ACATGATCAGATTTGAGCCAAGAACTTGTCTCAATTTTTTTGTTTCCCTATTTGACC AGTTTTGTTA
CTTTTCATTATTCATGAAGTTCTCTCTGGGATCTAAATCATCCACAACTCTGGAACC TTGCCAATT
TCCGGTTCGAACCGATACCGGCTTGGTTAATGAGTCTTTGCATGTGATATTATCCAA GAAAAATT
ATTAGACCGCTAATAAACGCGCGAAGTTAATTTTTATATATACCAAGAAGTTGAAGT AATTAACA
AACCGCATGTTTAAGCTATTGTAATTTCGATTTGTGATACAAAGCACTTAAAGCCAA ACGCTAAC
GCTGATCTTAGATTGACTAGCGTCCAAGGTTGCGATTTGGGACCACAGGGACGCTCA CATGGACC
TTTCCGCAGGATATTAAAACCTTTCTCACTCTCCACCATCCTCTTCAACTTCCATAA TAACTGCAT
CACACTCTCTAGTTCTAACCAACAGAAACGAAlie
GAAGTTTTCGTCT AAAGGAAGGATGAAGGAGATCG GACGAGGATTT AAAGCG AAGGCCGA
lillilllB^
T CAGTTTCCGTCGAGCCACC C C ATCA AAGGAAACAGGAGTATC GCTGCTTCCGTCGATT CCTGCTCXRAATCTGCTCTCTGCAGTCGA GACAA GTTTCGTGCGGTTCTAGCAGAGTCGAGAAG
AGA AGATCATAG ATG AGGCXG A AGT AAGCGA A CGTCATTCAC AC GATCOG ACGTG ACATTCGC GAGAGTAAGGAGAGCGA GTCGTTTCATTCGTTTCGG TGTGGAGTCTTGCTCGAAG T GGAG
AGCCGGAGGTTCAGACAGT€GGATGCGTAT€CGATCT€GCTTGCA€GGA GACGTrrTC€GGCGAA
GATGTTTCGGATGATTACGAGGATGAGTTATCGGAGCAGCGTTCCGAGATGTTTTCA CTATCCTC
CGACXTFCGATT AT GGATT^
ATCTAGCTTTGATTCTC AATTTCACATACTCGCTCTCTGTA CTT AGTACAAGGAA AGTTCTG
AAGTAAGTGCTATTTAGATTACAGATTGAAGGTGTGGTTAATTACTTGATGTTTCAC TCGATTTGC
TAGTCTAATTTGATCTGAGATTTGTTCTAAAATATACTTAGCATTTAAATCCGATAT TCTGTTATG
GAATGAATCTTGAATATACGTTTTCGTTTAGCTGGTAAGGTTTGAAGATAAAGAGGT GGAAGAGA
GCTATCAAATGCTGAGGGAAAGAGAGAGAAGTCATGCGTATTTGCGTGACTGTGCTA AGGCTTA
CTGCTCCAGGATGGACCACGCTGATTTCATCCCTCGTCTACGCTTGATCATGGTTCA ATGGATTGT
GGAGGTAAGCACTATCATTCTGTTCTTATATGCATCTGAATGTTCAATCTCAGAAAT ATATACATC
TTATTACTCATTAGATTAATAGGTTAAGAGTCATGTGTGTCA
AATATTGAATTATGTTTCTAGAAAGCTCTTTAACTGGAGAACCCTTTCAACACACAC GTAGCAAT
AGTTCAGTTTGCTTCAGCTGTTCACCTGATACTTCCTCCATTTATGTAGCATCCGGA AAAGGAACT
TCTACATTGAGAACCTAAAGTATAGCCGTCATGAAGTGGTGGCAATGGAGTGGCTGA TTCTAGAA
GTCCTTAACTTCAAATGCTGCTCACCCACAATCTTTAACTTCTTATGGTAAAAACCT CTATTACTA
TATATTTTCTC GrrCTTGCCTGCAT ^CACAACAAAACCTCAGCCTACCAACCmGTCGTAAAGGTACCAGTCTC
TTCAACACTACTTTAAATACTTTTTGATTTGAAGAATATACAGAATAATTACAATCC CAAACCTCT TTTTTCTCGCCTTCTGCAGGTTCATGTTAGAACAAAAGATAACGACCTGCATGAATGCGT CAAGG TATATTTTAAACATCACTCTCATACTAATCAGACCACTTATTCTCCACTAAGAGGGTTAG CGAAG GAGTTTTATATTAGTGTTTCTATATACAGAGC TGGAATGGTTCCTTGGGCAGTAAGCAATCAAC
iilliliiiiiiiiili^
iiiiiiiiiiiiiie
TTAATCTCTGGACTTTTTAG TGTTGTATTGGCA A TAATACCCAATTATTTGTGTCGC ACCAA CATTTATOCTTATTTTC CCAATACACTACACTC CATTTTATTAAAAATCATTTTATTGTTCAGT
Barrel medic or alfalfa Medicago truncatula (SEQ ID NO: 32)
>Medtrlg032850 | chrl : 11757673..11761366 forward
AACCTACCAATATCATAGGTTCACTTCTATCACCCAACTTCTTTCTCTTTGCATCATGAA CATGCC
TGGAGCACAAGGAACCAAACACTCTCAAATATTTTGCAGACTTACTTTCTTATTAGC AATGAAAA
TTGTGAGGTTACAAGTTTATATACAACATCATATGTGAGTTAGTTGCTATACAATTA ATAAACCA
AGACTTACTAATTTCTAACAAAGTAGACAACAAACTAACACATTGTTTTAACTACTT TTATTATTG
CAACTAACTTGAACTAAAAACTCACGATTAGTAGCAGAAGAATATTTCTTCATCACA TTTTACAA
ATACATAACAAACATTGTTTTGTTGATTTTGTTTTTAGTTACAGTCGTAACATTTGG GAAAAAAAT
ATTTATATTAGAGTTAACTCACGCGTAAGGTCGTAGATTAAACTTTCATCGTCGATG TGAACACA
CCTTTATTGATTGATCTATAAATGGTGAGGCCTAGATTACCCTCTCTTGTTTATAGC TGAAAAGAT
GGTTTATTAAAATTGAAGTGTTTGGTAAAATTAGTTGATGAAGTGGCTGATAAGTAA AAAATGAC
ATAAAAGGACATGTTTATATATATATAGACATTTTTCTAATGTATTTGTTTTTTAAT ATTTTAATTT
ATGGTTAACTATGTTTTGGATCCCTATAAATATTCAAACTTTTGGTTTTAGTCTCCA ATAAAATTT
CACCGACAATTTTGATCTCTGCTTATTTTATTTTTTTGTACAAAATTGAGCAAAAGT TCATTGATCT
CGACTTTTATGAATCCCAAAAATAAGAGGAAAGTGGAAAAAAAATATAAGCAAGAAT ATAAAAN
GTGGAAAAAAATGCAAGCAAGAATATATAAAATTTTACAACGTACCGTAGACTAGTT ATAGTTA
AATATAAAGCATTTCTTTTAAGAAATATATATAAAGCATTCATTAAAAAATAAAATA AAGCATGA
CAGTTTTTTTTTTAAAGGAGAAAACGTGACAGTTGTTTTATTAAAAAATAAGCTATG AACTTGGC
CGTTATTTTTAAGCCATGAACATGTTGTTTTATTAAAAAATAATTAAATTAAATTAA TATGGTTAA
AATTGGAAGAAATTATAAAAAAAAAAAAACTACCAGCTATAAGCTCAAAAGCTACTT GAAATAG
TTTCTAAAAAACATTTATGCTAGTGAAAAAAACTTTTTACCAAACACATCTTATTAT ATCAAAAC
GAGCTTATAAGCTAGTCCAACAAGTCATAAGCTAGCTTATTCGTGTTACCAAACACA GTCATGAT
GGGTTGAATGTGATGTGAAATTTTAATTGTTACAAACCCTATAGTGTAAATTAATTA TGATTTATA
CCTAAATATAATTAAATAAAAATTAAAAGTTACCTTATTAAAATTGATTTTTTTTAT AGAAAAATT
GACTCATTTATTTGAGATTTAATGTTGCATTTGTATATTACATTTTGATTGGTTGAT ATCTTGAAGG
TGTAAATACCTTCTAAATAAGAAAGTGTAATGTAGAAAAAACCTCTATTAAATACAT TTATAAAC
ATTTGTTAAACATCGAGATATGTTCCGACAATGATGAGTCTAGAGTCCAACTAACAA AAACTTTT
TTTTTTATATAAAAAAGATATTATGTTAAAAAAAATTGATAAATATTATTATTACCG CTACTGCAT
TATATAATTTGTATATATATATATATATATATATATATATATATATCAAATGCACTT TTCAAAAAA
TTAAAAATATCAAAACACTTTATTACAGTTAACTTTCGTCATGATGTATGTTGTGAT GGACGTGGG
GCACGGAAAATGCACTACGTGGGTCCACCTCATATAAAAACCCTCCTCCCTCGTTTT TCCTTCAAT
TTCATAACCATCCCTTCGAACACTCTTTCCTTCACTCATCTCAACTAAACTTAACTC CAACAAACC
AAATTCAATTTTCACTGCATTTTCTCACTTCACAATQATAATAATCAAATCTAGAAA TTCCAAA G
CAAGCTTCAACACGAACCTTCACCGTTACACGTCATCAGCAAGAAGCTCCGGTCGAA GATTCCTC
GCCGGAAACGACGTCAGATCTCACCGGTGCTACTTGTTTCTCCGAGATTCAAAGCrr crCGTGAG
A ATOG e l GTTTTTCTGTTOOTTC A GTTG ATTCG AOTTCTGGTTOG G ATTTOG C GG AGGTG A AGTT
TCGTOTC^TTCGAGTAGAATCTCTGCTGTTAAAGGAAOA^CGAACTCOAOAAGTGAA ATrrCGAG TGGTGrrGAATGTGrrCGTAGAmGAGAAGACKjAATGAGAATGAAGTTGAAGTTTCGGAG ACTT
CGTGTGTGGATTCTAGTTCTGGAGTTCGTAGAAACTTGATTTTGAAGTTTGAAAATG GAAAAGAG
AACGATGAAGmCTGAAGTTTGTACGAAATO^
TAACGGAAATTCGAATTTGAATTTGAATTTGAATAT T GT GGAGATAACACGAAACGATGTTG
nTCCGTTAACAGAGCATCGOAATCTGAATTTTCTCAAATrrCGAGAAATCGTAAT TGATOAG
AATTG GTTATCGCGCAATCGATTATGAAGAATTATTCGGATAATTCAGGTTACGATT CGATCT
AGCnOTTCTGAGAAACTGCAATTCTCTTACTACGACGATGATGAATCGOAGGAGTAT TGTTCAA
GTCAGGGAACTACATTCTCTGATCTTCACTCC mTATTrrCAGTGAAGGTTCAGATTArrcrCCGT
CGCAGTTCATTGATTCTGGTAGCGAGTTTTCACAAGGATCCGTTGGTGAAACTCCTT CTCATACTT
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TTGTTGTAGAAGATTTTT T TTCATAACAATGTACGTTTTGTTTCATCAATTTCTTAGTTCAAAAT G ITAA ΓΛΛ ΐ ' .Α
TGAAGATATTGATGATGAAGAGAGTTACCAGATGCTGAGAAAGAGAGAACGAAGGCAAGC TTTC
ATGTGGAATTATGGAGAAAGATATTTCTCTGCAACGGATTTTGCGGAAGTATTTCAG CAACGTTC
ACGAATGGTTCATTGGATTGTTGAGGTAGTTTACTCTATAACTACATTCAATTAAGC ^^
TTATTATGATTAATTAATTAATTAGTTAATTGTTATAAAAGTATGTATTATTATGCTTAA TCATTGT
TTAATTGCTTTGATATGGTTTTGTTAATTAAGAAATTGAAACAGTGACATGGTTGGA AATGGAGA
ATTGAAACAGAGTTAGGCTTTGCGTTTGAATGTAGTAGTTATTTTTATTTATTTGTT ATTACTTAAT
TAAAAGCTAGTAGTTCGTTATTAAGTAGTGTTGCATGTGAGAGAGCACGCTTGAAGA GTGTGATG
AGATAAAGATTGTTGGAGAATCAGTTAGTACTTAGTAGCTTCCACTTGTCTTGTGAG ACACGCTTT
TGAAGGTTAGAAAATAAGAACAAGAACGAGAGTGAGAAATGTGATCAATCAAATATT TATCAAA
TGTGAAATAAAATGAAGTGTGTGGGTTAAA
TTCACATTTTAGTACATGGAATATGTGCACTTTTCTTTTTACTCGTGCATGCGGTTA GTTACTTCAG
GTTTTCCAACGAGAGATTTTACCTTTTACTTGCAAGTTGTAAATGTAGTGTAAATTT CTTTTTAAA
AATTGTACCAGCTTTAAATAAAATATCTATAAGATTATAGAAATTATTATAATATTA ACATTTTGG
CATGCCTGTAACCATAGGCTAGCAAAACAACTTATACAACAATGATCTAAAAATTTA AGAATTTA
ATTTGTATATAACGATACTGTAAAACTTTTTTACACGCGCATAAAAATAAATTTTAA AATTTAATT
TGGTTGTTTTATTGTTACATTACATGATTGTCTTCTTCTCATCCTTATTTTAATTAA GAGACAAAAC
TATCATAAAGGAATGAGCAAATAATTTTTATTGCTACCATTTTTATTTTATGCTATC TATAATCCT
CATCTATGTTCACATTGAATTTTTGGAGTTTCAATTTCTAGTTTACTTATTTAAAAT TGCTTAGTTT
TGAAATTTGCTTTGAATATGCAGCACTCTTATCGAAAACAGCTTCGACCAGAGACAA TGTTTCTT
G AAT AATCTACTTGA CGTTTCXTGAGCAAGGGATACTTCAAAG AGAAAGAAAC TTCAAAT
iiiiiiiiie
ATATTTTACA^
TGCACTGTCAGATAACTAAAAAATTTAGTTCATACGATAATTATATGTGTAAATTTAAAG AACTC
TTAAATATTCAATTTCTTTTTATAGGTATAAATTTATTTTACAATATATTAACTAGC TCAGTTGAA
GTTTGAGTTAAGAAAACAATAAAGTCAATTAATGATGGATAATCAAGGTTCTCATGA TTTCAATA
ATGATATATGTACTACCTTTCTTCGTTTTAAGTTAATAGTAACGGAGTCTTTTGTTG GCATTATAG
AGTGAACCAGAAAAATTTCTACATAGAAAAAAGTGTGTACAGTAGATGCGAAGTGGT GGCTATG
GAATGGATGGTGCAGGAGGTGCTAGAGTTCGAGTGTTTTCATCCAACCATCTACAAT TTCTTGTG
GTATAAGTTTTTCTGTACATTCTTATAGCATCT
AATTGCATCTTACATATATAGCACCAATGAATAAAAACTTGTAACGCCTTAACAAAG TGTATTTA GATCTTTCATGAATGAAACAAAGTTTTACTCGCCCATTAACCCCTTGTAAAATTTTGCAG TTTCTA CCTTAAAGCTGCTAATGCTGATGCTGTTGTGGAGAAAAGGGTCA GTGCCTTGCATTA TQGCTC
GTCTAGAG T AAT AGAAAGCATC CACAAAGT ATAGGGGTAATTACTC^
CTAAGCCTTTm
GTTACCATTATATGACACGGTATGTTTAGTTGTAAATTCTTTAGGCAAGGGCCACATATG CTTCAG
GGGTTTAAGTTTTATTGGTCCTTTGATATTTTGATTTACCAATGTAGTTTAAGTAAT CAAAATTAA
TCAAAATGAAGTAAAAGGATCATGTTTGTATGGTCTTGATTTGCCCTAAAGTTTGCT AAACACTT
GATGATATATACAGTTGTGTATGCTTTCTTTGAATATTATAAGTTTCACTTTTCCTC ATTAGATCAA
AGGATTTTAATGTTCATAATTAATCGTGTGATTTTGCAGATCCACATTAGATCAAAG GAAGAGAA
TTTGCATGAATGCATGGAGGTATGA Soybean Glycine max (SEQ ID NO: 33)
>Glyma.02G086500 | Chr02:7532871..7538307 forward
TGAACCTATATTATTATTTTAATATTTAATAAAATAATATTGTACTGCAGTAAATACATT TTTTATT
GAATATTATTAGTAATTAATTAAACTATTAATTGGACTGACCAAGCAGTTCTATTGC TTTTGGTTG
ACCATAATATGTCAAACATCAAAGATAAAAAAACTAAACATATATTAGCATAGTTAG TCAATTAA
AATTAAAGTTTGGATAAAAAATTGAGAGGGGTAAGTTTTTTCTAATTTATAAATTTA ATGATGAC
TTTTATGAATATGACTCCATGTGACATTTCTCATTTTATTGTAGGATTCTTCCTATT TGTAGGTAAT
TTTTGGCACGCATAAAACCTCCGGAAGTTGTCGCAGGATTTGAAAAATGAATTGATT CAGATTTT
GAACTTGTTCTTTCCCTGATTTCTCTCATTTGAGACATGAAATCCTATGAGGAAGTA ATAATCAAT
TTTATTTTATTATAAATATACTTTAAATATTAGGTTGAGGTGAACTAGTTGTGTTGA AATAATACT
TCTTTTTTCCTTTCTTTTCTTAGTTTCTATATAAAAAAGATTCCCTGGAATAAAATT TGATACTTGT
TATGCATTCTTGCCAATTCTAAACGAGTAAATTGTTGATACAACAAAGTTCAAATAC ATCAAATG
TACAATTAATAAAGACACAATATTTATATTGTATTTAAAAGAAACATTTTTAACCAA CAAGTCAT
TTCTTCGTTTTATAAAAGAAAAAAGTAATTAAAAAGAAAATTTTCCCTAAAAATGAT AAACTAAA
TTTCTTAACAAAGATTAATTATTAATAAAATTAATAAATATTTCAGATTAATTTCAT GACTTAAAA
AAAAAAGAAGAAACAACTTCAAACTACACATTTTATCTCTCCATGAATACAAGTATA AATAGAG
AAAATAAAACATCTAATGTTGGTTACTACTTTGAAACCGCATTTTCTACTCTAGCCA TCATTATTT
TTTTTGGGTCATATTTTAAGTGCATTTATTGCAGCAGTACAAAAAATATTGCTCAGG CATTACTGT
TATTTAGTGCAACCATCTTTATTTTATTAGTTTTGTAAAAAGAAAATCTTTATTTTA TTAATATAAA
TATATGAAGAAGATAGTTGTATTTTTTTTTCTTTATTTGAAAATGTGATGTATTTTT CTTTTTTTTAT
AGAAAAGTGCATGAGGAGGCTGAAAAAATCTAAATTAAAAAATACTTGAAAAAAACA CCTGGG
AAGTGAACATGGTGGTTATGCTTATCTTGCATGCGTTATTCTAATCAGTGAAAAACT TGGCAAAA
TGGTCATGAACTAGACATAAACATGTGTTCTGAAATAAATCAGGATACATCGGTTTG GGCTTTCA
CTTTTTCATCTCTTTGCTTATTTTTCCTAATCAAAACAAGTATAAATTGTAAAGTTT TCCTTTACAC
AATATATTTCGTTTAAACATTTTTTTTACTGTGGTACGTATCACTTCAACGATACCT ACCCCTTTTC
ATAATCGTCCTACACCACAAATCTCTTTAAAAACCAAATGTTTACCCAAAATATATT TTTGTTTCA
ACAACTCAATTAATGAAACTAATCACAATTCACAGAACCTCTTGAAACCCTCGAAGC CATACACA
TCATTTCAATTTTCTGTGTCGCATTAATTGATCTCGAGCACAACAACCTTCTTCCAC AATTCGGTT
CTAGCTAGGTCTTGCAAATGCGCAAAGATCATTTACATTTCATAATTTGAGATTAAT ATCACAAA
TCCCAAATTATTTACTCTAAAATATTGTTAATTCCCAGTCAAAAAAATACTTGTAAA TCCAAAATA
CTAGTATTTGTCAACTTAAATAATTAATTAATGAGTTTCCAAAAACAATTTTTACAA TCAACAAGC
GTGGACGTGGTGCACGGAAAACGCATTACGTGGGACCGCACCGTATAAATACCCCTC CAACCTC
GTTTTTTCTTCCTTCCCTCAACTCCCGTAACCGTTCAAGCACACTTCCCACACACTC TCTCTTCATT
CAAT ATAACAACAACAACGATTTTCAATTTTCTC CTGCGTTTC TCGTTAAC GAACTCCAATG GCATCCAGATCGAGAAAATCGAAGCGCAAGCTCGAGCCGGAGCCACATCCGCTCGTCATC ACCA AGAAGCTCCGGCAGAAGCTCCCTC K CGGCGCCOTCAAAACATCTCGCCAGTGCTCCTCOTCGGC
ATCTC GCCX AGAATCCTCGTTTC CGTCGATTCCAQC CGTCTCCGACTT GCCGTAGG GAA GCCTCGTGCAACTCCAGCAGA ^^
TCTCGACCGArrCAACGAGAAATCGGAGATTCGAGAAGCGGAACGAGAACGAAGTTGAGG TGTC
GGAGTCTTCGTGCGTTGACTCTGCTTCGTTCGCGAGCGAACGTAACAGAAGCTTGAT TCTGAAGT
nAAAAGAGAAGATA^GAATCTAA^CGAAAACGACGACGmCGGAAGCGTCK:ACGAAA TCTGA
GATTA TA TGTTCTGAAGTT AAAAGCGGAAQ GAGACTAAGAATGCAAAAGAAGACGA GAC
GrrtGGTGCGCGAAOTCAGAGArrACTTGTA GAGGAACAGrrCAATTCAAACTCAAAGTCCTC GGTAACGGTAACGGAAA ATAAAAGT T TTCGGATTCAAA GCAAACGACTT GTGTCGTTTA
GrrCCGGTGTTCGCGCGTCGTCGTTTCATGAGGAAGCGAACAGAAACAAGGAAAACACrA AAAA
CAGAGOTCGGA^TCTGAATACTCTGAAGrrTCTAGAAGCCTCCACGTGGAAGAGAAT TGCGCTG
ATTTAATAGCGCAATCGATGACGAAGGAGGATTCGGATGTATACGACGTCGTTGCGG ATCTCGCT
TGCTCTGAGGATCTGCGTTTCTCGTACTGCAACGACGACGACGACGACAACGAATCG GAGTACTG
miGAGT AGGGAAC GTGTTAT CGAATTT ATTC GAGCTTTTCGG GAATGCTCGCAGAATG GCmC ^ArrA€TGTCCGTCGTCOT^
TCGGAGAAACTCCTTCGCX'GACG ATTTGTTGTT CTTCAGTACAGCAAGGAGTTCGCAGAGCTA iiiiiiiiiie^
TTCGATTTCAGAAT^
TGTTTAGCCTAGCTCTTATTATATGATAGTTTGTAATTATTAAATGTTAGTGTAATTTAT TGTATGT
GGATTTAAGTTTGTGAGATTTGAAGATTTGGATGACGAAGACAGCTACCAGATGCTG AGGAAGA
GGGAGAGGAGGCAAGGCTATGTGTTGAATTATGGTGATGGATATTTCTCTACCACTG AATTCGGA
GACACCGTGATTGAGCAACGTGCGCAAATGGTTCACTGGATCATTGAGGTAGGTTTG TCTAAAAC
AAAATCCAACTCATTATATTATTATATATGCTGTTACTCTTGTCGTTTGATTAATTT CACTTTTATA
TAGTTTTTGAGTAAATAAGTACGCCTTAAAAAAAAGAGTAAATAAGTACAGTATATG TTATGAAT
TGTACTAAAATTTAGTTACAATTTATCATATATAATTTATTTCTTCATGATTTTTGA TTGATTAATG
ACCGAGTATAAAAATTAAACAAGTTGATTAAGGAGAGCTCGTGCTTTGATTATTAGT TAGTTGTT
ATTTTTATTTATTTATTGGTGTATAGATAGTCGCTCTTTATTGAGTCAGGTTTTGGA TGGTGAGTG
AGTGAGCGAGAGAGAACAACACGTTTGAAGAGTGAGTGAGAATCAAACTTGATTCGG TTAAAAA
GGTAAACACTTTGTACAACAAGTTGTTGGGAGTAATATTGAATGGCGTCCAATTGTC TCGTGACA
CGTCAGAGTCATTTGGAAGACGAAAGCCTCTACGCTAGTATCGCGGCGGACTTGAGT AAATCCCT
GTATCGTCATGCTTTTTGATGATGCAATCATGCAACGCACTTTTCTCGTATTCGTGC TCGTGCATG
TGGGTAGTTACTTCACAAGGGATGATACATTTTGCTTTTCACTTAGAACCCAAAACA TTGAGGAG
CATTGGAGTAGGGAAAAATTATTTCCTTCTAGTTAGTCGTCCCCCAAAAATTACTTA CATTCAATT
TTTGCGAGTCTAGCTAATGTTACACAAGTATAGATGCTACTGCGGTACATACACATA CACTCACA
CACACACACACACACACATATATATATATATATATATATATATATATATATATATAT ATATATATA
TATATATATATATATATATATATTACACCAGTTTCACCATTATTTTAACATTTCATA TATAGGTAG
GATGCATTCGACATTTTGTATTGGAATGGCAAGGACTAAAGGAGCTTGACAGAAGAT TTAATTGA
ATGGCAGGCCACGGCCTAAAAAAATAAAATAAACGCTACTTCTAATGAATATGCAAT TGTTATGT
TTCTACAAAATAGGAGTATTTCTTTTTTAAAATTTTAATAAAATTATTTCATAAATT AAACAAATG
AATATAATATTATTTATATTTTTTATTATTTGAGCATTGTACATAGTGCGAAAATAT TTCGGAGCA
TCAATCAACAACTCATGCCGTTTCTCTTATTAATTATTAATTTATTATTATTATTAA TGTGTTTTTT
ATTTGAAGAAATTATTAATGTGGTTATTTTATTGTTGTTATCATCTTTCATGATCGA ATAACCATA
AGAACATTCTTTCCTTTTGATGAAGGGAATCCATTTTCTTTTTCCACCTGTTTCAGA CAGCGACAC
TAATCATGAATATGTCATTTTTTATTTTTTGTCCATATAAAGCTTATGTATTTTGCT AACATACCGT
CACCTACCAATTTACATTAAAAAAACTATTGTATGCATGCGAACCTATTTAGATTTG GATAACAA
TGTTTTTAGATTCAGGTATTGAGTCTGTCATTTCTAATTGCTGCATTTGTGGACTTT GAATTGCCAG
TTGAATATTTTTTAAAATGCTTTATGTTTGAAACTTA GGCAAGAGAC C GTTT TTGGAGTCAA CTACTTGAT GTTTCCTAAGCAAAGGATACTT AA
iBBiiiiiiiiiB^
GGCAAAAATGTC
TGTTAGTGACGGTCTTGAATTAAGGAAAATGGAAATAGATTATGATTTAATTTTTCT AGAGTTTG
AGTTTATATATCCTCATTAATCATGCATTGTGCCTTAATTAAAAGAATAACAAATTT TCCCTAATG
GTTCAAACATTATCCAATTTTCACATAAATTTTATTTTCTTACACATGAACTTTCTT GAAGTTTAGG
TAAAAAAAAAAAACCATAAAGTTTGTAAGTTTTAATATCCCATGTGTAAAGTTAGCC AAAGCTGC
TTGTAAAAATTTTTACTTTCTTCGTTTACATATTCTTGATTTTCGTACAACTAGGGA AAGTGAAAA
TTAACAGAACTTATAATTTTTTCTTCCAATTAAACTCATGAAAGCCATATTCAATGA ATAATTGAA
CTCTTTTGTGGGTACTACGTACAGAGTGGGGCAAAAAAATTTCTACATAGGAAGCAA TGTGTACA
GTAGAAGCGAGGTGGTAGCTATGGAATGGGTGGTGCAGGAGGTGCTCAAGTTTCAGT GCTTTCTG
CCTACCATCTACAATTTCTTGTGGTATAACTTTTTATTCTTTCAGCACGAATATGAC CTGAAATTCT GCAAAAATTAAAGGTTAS
TGTTTCTTACTGGGAAACAACTTGAATTTTAAAATGAACGGTTGAAGCAAAGACTTG CTATCATT TCTCTAATAAGGGATTTCTTTTTGCCTTAAAAACTCTGCATAATTTATAAAATTAAAACA AACTTT TGTAATTGTGACTTCACATAGCCTCAATAAGAAAACTCATAAGCCTTTTTGTTTATGAAT TCAGCT AAATAACACTCCACTCTATAATTTTTCAGGTATTACCTAAAAGCAGCTAATCCTCATGCA GTCGTT
GAGAAOA KJTCAAGTATCTGGCAGT K T KJCACTGTCAGGTCATGAGCAACTGTGCTA ^ CTT AACAGTTGCTGCAG ACTTGTAATCCTGG TTGTCTTGAATTCAATCAAATTT AT CCACA
IIMIB^
GAGTCAGGGAG TAATTTGGTCATTCCTTGATATTTTAATAAATCATTGTTATTTTAGTACACAAAATCAAT CAAAAT
GGAATACAATAGTATTTTTATGGTCTAGATTGGCACTGGAGTTTCAAAAAGATTGGC ACTGAATG
CAGAGTTGTGCGTGCATTCCTTTTTCCTGAATGCTCAATGTTTCTATTTTATATCTT TATTTTCTTTA
GATTGGATTATCCTGTAATGTCCATATTAGATTGTGAATAGAATGCCGTATAGATAA TTAATCTTG
TGATTTTGCAGATTCACGTTAGATCAAAAGATGAGAATTTGTACGAATGCATAGAGG TATGCTAG
Τ ΤΑΤΑΤΑΤΤΤΑ ΠΧϊΙ ΑΑΤ Γ
AGAAAAAAACTGAAATACCATAATGAAAAAGCGCCTCACACTACAAGTATTATGAAATTA ATCA AATCATTTCATTTATTTTGCTGAGAAAAACCCCACACCACGATTAGGATCGATGAAATAT ATCAT TTTCGTTAATTATCATTCAATTTCTCCTATTATCAGTATTGTTCATGTATTTAACATGAA ACTTATA TTCACTTCAACAGAGCO^AGTGCCT^
G ATTAGTTATA AATTCAGTTTGGTGA GATGGTTCCTAA CATCAGAAGA AGAATGGCTTTC
rrATGCCTGGTTGArrCTTCATTAATACAG
Cucumber Cucumis sativus (SEQ ID NO: 34)
>Cucsa. l74110 | scaffold01219:61526..66098 reverse
ATCTTGTACCTAGCCTAAACGATCTTGTACTCTTGTACTTAATCTAAATGACATTTTACC TAGTCT
AAATGATCTATTAGTGATAGTGGTTTATCTATGTCTATGTAGAATAGACAACTGATC GTTTAGATA
TTGGTATACTATCATTTAGATCTTGTACTAAGAAGAAAAAAAAAGAGATGAAGAAGA AGGAAGA
AAATCTAGAAAAGGAAATGAAAAAATCGCAAACAAAAAGAAGTGTGAATATGAGGAA AGAGAA
GTAATAATATGAAGTAGAGAATAATAAATAGCAAAGAAAAAAAAAGTGAAGTGAAGA CACAAT
ATTAAAAAAAGAAATGGCAAATCTGAAATTTATGAAAAAACAAGGAGACTTTATAGA TTTTAAT
TTTTTTTTTGTTAAACGATCCATAAATATTTTTGGGATTTGTTAAACTATCCAAAAA TTGATTGAT
AAATGAATATTAAAGTATCATTTTTAATAATACTAGAAAATTTTAGCATGCATTGCA TGTGAGGA
CCTTGTTAACAATATAATATATATTGTTGCTAAAATGAACAATAGCATGTAGTGGTT AAATGAGT
CATTCTTTATATACAATATTTTTATTCATACATTTTCAACATTCATACATTTTCAAC ATTTGAATTT
TTATAATAATTTCATTTCTCTTGAACTTTCTCTATAACAATCTTAGAAATTATAATG CTAAAAGTG
AATAATAAAAATAAATTTAAGAAAGCCTAAGTAGAAGTGAACGTTTGAATGTTGAAT GTTTGCAC
AAAAATAGTTGAAGGGAGTTGTTTATATTGTTAATAAAAATAGAAAGTTACAAAATA TTTTAAAC
TTATATGAATTATTCTAAAAATTTAATTATTAAAATAAAGAGAGATTTTCATGTTTT AACCTAAGA
AGGGTTGTGAGAATGTAATTAAAAGATAATAAATAATAATTATTTTTAATAATAATA GTAATTAA
ATCATTTTAACTTCGGTGTGTGATAAGTGATGTTTTCTTATTTTATTTTTAATAATT AAATAATTTT
AGCCTTAATATGTCACAAGTAATGTTTTTTATTTGTTTTTAAAAAATACTCATTGCA TTGTGTACC
CGAGTGTTTTTTAAGTTACCAACCTACCCTAAAATATTCAATAACAATATTTTTCAT GTTTAAAAG
ACTTTTTAATCAATAAGAATATAAAATATTAATTACATAAGCTAGAATGAAACAAAA AATATAAA
CAAAAATTAGAGAATTATCATTTGGACAAAATGGTTCAAAATGGTTCAAAAATGATT GTAAATGT
TAGATTGTAAGATAATATAAGATAGATATCTAGATAAGTTGGTACAAAAGTAATACT AAAAATG
CATGTGTTTTGTAATAATTTTTTTTACAAATAAAAACTTTTATATACAACAAATTCA TATCATAAA
ACTAACCAAATAACGTTGATGAAGGTAGATTTGGATAAGGATAAAAGATATTTTTTA TTAAATAT
TTTTATTTTTTAAAATCTTTTTACAATACGAGTAAAATATGAAGATGTAAAGAGAAA AAAAATAT
ATTTTCATCTTGTAAACTAAAAGAAATGAATAGTTTTAAGTGATTTAGTAAAAGCAA GGAGTTGA
TGAAGGTAGAAGAAGAAGATGAATTAGATGAGTTAACAATTTAAGTTGTGAAAGCGA TAGATAA
TTGATATGAGGAGGGTATGTTGGTATGTTATAGCAAAAGATGAGATTAAGTAATTTC ACTTCATC
CCCCAAACTCAAAACTAAAATAAATTATATATTTGAGAAAATAAATATTGATTAATT TTATTTAC
AAAAAGGAAATTAGGGGGTGGGGGTGGGGGTGGGACCCATACCCACTACCCACAACG AAAAGA
AAGCTCACTTCCAATTCTCATTTCTCTTTTCGCTTCTTAACTTCCATAACCGCTCTT TCTTCATCTTC
ATCTTCATCTTCATCTTCCTCTTTCTCTTCCTCCAAACAAACAC'CATGAAATCCAA GAAACGAAGA
CCA CCCCAAACCX^^
CeOCAAACGCCCTCTGATTTTAem
ACCACCTTTTCTTTTGC1 CTTCTT TCTTTCACTGCCO ACAATCCACCTCCACTTCCTTCTTCCC AACCOOAC€TGAGGT€TCTAG€
ACAAGGAGOTTGGAGTAGGGAGTAATGA j€AAGTGTCTGAATCCTCTTGTGTTGAATCTAATTCT GGACTCGATmCGTGTTTCCGGACCAAGCACTACTTCCAAGTTGAAGAATAGGAGAACTAr rCA CC MAATGAAGATCCAATOATO^
AAGGCAGCTGTGGTACTCACTTCTTGTGTAGACTCTTGTGCTGAATCTATCTTTCAGAGT GTTTGT TCGTTCGAAGAGAA^GGATTAGAC j-rrGAAGATAACAGACTATGGGAAATTCAGTTACCTGAGC TACAQAAAAACGAAATTAATAAAACTTTCAC GTTT OAAGTCGGATT OACGATAGAACAQTG
CGGAATACTTAAGC AOCCQTTGT GCm AGTCAACTAm^ATTGGAGATGTCTQATGA TQ T CAC^nACACTCCATCAATTTtCTTGGAATCCGGAAGCGAATrrTCAGAGAAATCGAACGA AGAC
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TCTCACATCAGAACTAGCTCGTCTATTGAAGAAGAAGAAGTAGATCAATCTACGGTA ATTCGCTG TTTTCGTGCT
TATATATATGGTTTATTTAATCGATTTTAAATTAGATTTTGAGATTTGAAGAATTGG ACGATGAAG
AAGCCTATCGAATGTTCAGAAATAGAGAAAGACGCCAACTGATTATTTGCGACTACA TAGAGGA
ATATCGGTCCACAACGGATTATGGCGATTTCATTCTTCAGCAACGGTCAAATATGGT CCAATGGA
TAGTTGAAGTAAGTCCTGGATTTCAAACCTCCATGTTTCTCTTAAAAATTCCTGAAT TAGCATAAG
CAATTCCCCCTGTCTTCCATTTTCATCGTTAATAGCTTTGGTATTCTGAGACATTAG AACTGTAGA
GTGTATAGGCACTGTCTATCATATTACAATTTGTACTGAATTGCCAATTTGTTCTTA GCATGTCGT
AAAATGAGTCCCCTGCCTTATTTGATTTGGAACTTTATCCAACAATGTGATTTACTG ATGAAAATT
ACAAAGTCATTACTATGATCATACTTTTTACTATTTAAGGCAAGCAGTTCATGATTC TGCACACAT
ATACACCTAGATGTTACAAGCTTCAGTGCATCTTGAATTAGCCAAGTTCAGCTGATT TTTCTTTTC
ATTTTGTACTTCTACTTAGATACATAATCTGTTTATTTTTAACTTAATAATAGAATA CTGATTCATA
ACAGCGAGATTTGTGCTCATTACTGTGAATGTTAGGATTTTCTTCGAAATACTCCAA CGTAGTTGC
ATTTTCATCATCGGTTCATGGAATACATTCTTTATAATCTCTTTCAATTCCTTTTCA TGCGTTAAGG
CTTGTCATCAATCAGTTGGATCAAACTTTTTTACATTATATAGCTTTAATTTGTTGA ATGATGGCA
GCGATCTAGAGAAAAGAAArrTCATCACGAGACGACATTTTTAGGAGTTACCCTTCr GGACCAGA
TTCTOAGCAAAGGATTCTT AAAGCTGAAACTCACCTTCAAATTC AQQCATAGCATGTCTAACT
TTTGTCTTCATCTCAGTTT
TAGTCTATATTATGTATGATGAATATTGATAGGAAACCAAACTGTATGCCAATTGGT CTTCTTGTT
TCAATCCAAGGGTGTAGAATTGAGTAAAGTTAGGATCAAATGGTAAGTAGTACACTA GAAATAA
TAATCAGAAAAAACTGTCTAAAAGTACTTGAATTCAATAGTCTTGAATGTTTTCCTT GAGCTCAA
AGTGCCGGGACTGAAACTTTTTCCGTTCATGAACAAAATAACGTTGTGTGATTATAT CGTAGATC
CTCTTATAGGAAACTATGTAACAGAAAATAGCCATACATGTTACATTAGTGTCGATG CACACACC
TCCCGTACGGCACTGCAGTCGAATCCTATTGCCTTAACAATATCTTAAGTTCGTAAG TTAACAACT
CGTGCACAGATGATATCCAAGATCCACCAAGAAAACATTATATGGCAAACCACTTCA ATCACTTG
ATCGGGCCATCAGACAATAAAATTCTGATCATATAGAGCTCCAAGTCAAGTCAGATG TAAAACA
ATTGTTTAAAACTGTTCTTCTCTCTCTCTCTCTCTCTTCCTCAAACTTCCTCTTATC TAGTCTTAATT
TATCTTTGACTGGTAATTTTCATGAAAAGTGATAAATAATCATCGTCTGTTTCATTA AATAGAGCT
TTGAGAACTGAAAGTATGATAGTACTTATTTGTTTTTGGGCAATTCAGGTTACAGCA AAGGAATA
TCCATGTAGGGAGCAACACGTACAGAAGATCAAAAGTTGTTGGCATGGAATGGCTCG TTGAAGA
AGTTCTAAAGTTCCATTGTTTCTTGCCAACTGTTTACAATTTCTTGTGGTAAATCTT CCTTTCACTA
ACTTCAC^
CAAAAATAAACAAGTAAGTCTAGAAGAAAATTTGAAGTTTTACAAAAAAAAAAAAAAACA GCAT
AATCTAAGTCCAATTAGATTCCAACACGTAAAGTGCACATATAAATTCCGTACTCAT ACATATAC
TAAAAGGAAGTGCTAGGTTATAGTGTTAGTTTAGATTACATATCAAATTCATAATAG TGAACTTT
CTACTGTTAATCAATATAAATATGAAGGTTTGTTTATTATAAATTTATAACAGTAAT TTATGTATT
TATTTAGATTACTCTTGCATATTTCTTACTTTATCTTGAGGAAGGTTTCCTGTCTTA TAAAAACCCT
TCCATGACCAAAATTTCAACCTTAGACTAGTCCCATTGAATCAATGGAAGGATATAT GTCCATCC
TTCCAAAAGAACAAGAATCATCGATCTTGTTCTTCAAAACGTAGATTTTACCTTTTT TTTCTTTTCT TTTCGAAACAAAATTGAAAGGACATTGAATCCATGATCACATAAACATTAAAATATGCCA TTAAA
GTTGAATTTGTGAGGCAAACATGCAATGAGTTAACCCTTTTTTTTTCTAATTACTAT CTATTTTTAA
TAAGTTATTTCTCTTATACACTTTTTGTAGTTTGAATAAAGGAACTACTACTAATAC CTTGCAATT
TCTTTCGAAATCTACAATAATAGAAATAACATGTATTTAGACATCTTGTTGAACTAA CTCATAAC
ATCGATTGTATTGTGGTTTTGCAGATACACGTCAGAACAGAAAACGATGATCTCCCT GAATGTAT
CGAGGTATTTATAGTCAACTATAAAAAAATCAAT^
ATTTTCAAGGCTTAAAAACACATTTTTAATAGATACAACTTTTTCAAGCATTAAAAAAGG ATCAA TCCAAACAAATCCTTATTTTTTCAGCAAAAAAAAAAGTGAGTAATACAGTTGGAATTTTA ACAGA
GCTTGG AGTG GCT ATTA AAGTTTCTATGATGG A AGCATG A ATTCCT A AGACAGC AA A A AGAA A
TCATCTTGAACA AGCTAGTGAAGCTCAC C
Potato Solarium lycopersicum (SEQ ID NO: 35)
>Solyc04g008070.1 | SL2.40ch04: 1731222..1734904 forward
CAATTATTTTTTATTTTTTTCAAAAAGCTAAAAGTAGGTCATAGTGGCCCATATTATTAA ACAATA
AAGTAGATTTTGCACAAATCTTTTAGTGTTATAAGTTAAGTTAGAGAGAAAATCACC TATTTGATT
GAACATGCTGCCAATTAAAAATTTACCTCTCTTATTATAGTGTGTGTTTTTGGGGGA TGCGCATGC
AGGAAGGAAGTTTGAGTCTGATGTAAATGACTAAAATATCATCATAAGATCTATTTT ATTAACAA
GATTTATCAATATTTTAGCGATTTTTGTGACTTCGCAAGTTACTCGGAATTCTAGGT TATTGAATT
TTCTCATGTTAGCATTATACAATGGTCAAACTAGATCACTTTCATATTTGGGACCTT TTACTTTTTT
TCATCAGGCGGAAGTTAGAATTAATTCTTGTTTATCTACATTTGGCCATGTGGGGAG GATAGAAA
CGTTTGTATTCAGCTACAAAATTTACTTTGTAGTGAGGCGTTTTGTTTTTATCTTAA TGTGTTTTGA
GTTCTGTTTTATTCAAAAGCCTGCATCTAGCACCAGTGGTCTAGTGGTAGAATAGTA CCCTGCCAC
GGTACAGACCCGTGTTCGATTCCCGGTTGGTGCAATTAATATGTTTGCGGGGATAGC TCAGTTGG
GAGAGCGTCTAAATATCTACTTTCAAGCTATCTAAGTGTGAACACCTTCAACACCAC TGAAAAGT
GTAGCATAGTGGTCGTTGGAGTTCATTAATTAGCAGTCGTGTGTTTGATTCTCCCTA ACATCATAT
TTTTTTGGAGAGATTGAAATATTTTTTATTTTAGCTATATTTTAAAAATTACATAAC ATTTTAGATT
CAATATTCATGCTTACGTACAAAATATGATGTTAGAGAGGATCAAACATACGACTGC TAATTAAT
GAACCCCAAAGTTCACTGGGCTACACTTTTCAGTGGTGTTGAGGGTATTCATTTAAA GTTATAAA
CTTATAATATTCAAATCTAATATCGTATATACTGTGTAATTTTTCGATCAAAAGAGT TCGGGTGAA
CCCCTTACCTCACACTTAGATCCGCTCGTCGAAGTAGGAAACATTAGCATTCTTATA CATGAAGT
AATTTGAAGAAAGTGAAATGATTTATGAAGTACTTATTTTTGCATCTAACTTATGGC TTTCAATTA
ATTGAATCGTACTAAATTTTGGATAAGGGTCCATCGATATCTTATATGATTTCTTAT TGAATTTTG
CATAGAGATCCACAGACATCAAAACACATCTTTTGAAATTATTTTTATTTGTTAAGT TTTGAATAT
TACTTTTACTCTTTATATATTTTCAATTAAGAAATAAATATTAATAGAAAGTAATTC GTCAACAAT
AAAAATATTATTCCTTGTATATAAGATTTGTTTGAGCAACATTGTATAATGAACGTG TTATTCATT
GAAGATATTAATTTATACAAGTCAATTAGTTTGGAGTATTTTATTGAAATCAGAAGC AAATTATG
CAAAAACTTGTAATGCTGTGAGCTACAATTCTCACTCTCAAAACGAAAATATCCACA TTTAAATT
AATACTAGTAGATTTATCTTATTCAGAATTAAATAATCGGCTGACTTCTTTTATAAG AAATAAAAT
AATTTAAACTATTTGTATTTTTTAAAATTTTAAAAATATATATATATACACACTCTA TTTTATTTTA
TGTGATATTTTTTTATAAATTTTTCATCAAATTTAAATTATTTTTGAAAAAGAAAAT ATTACCTAG
AGAAAAAAATAAAAGAAAAAGAAAATGTGAAAGAAAAATACACAACAACGTGACATC AACGTG
GTCCCACTCGACCACAGCGTATATAAGCTCTCACACTCCCCATTTTCCTCATTTTCT CTCCGAGCA
AACAAACGCCATTAACGGCTTTCTCTCACTGACGCACACAACTTGAACACACTCAGT TTGAGAAA
ATTCACACGTTCTAAGCAAAGTACAAGCAATGAAQ GAAACiTTA ATGCAGAACiCAGTTCAA C
GGCGGTTCACCAACCGAAGGAAATCCTACCGGCAGTGAAGAGGCAGCTCCGGTCGAA ATTACCT
CGCCGGAAGCGATCACATATAT^^
ACAAGTGAAGTCTCGCGTCAATCGAGCAAAGGTTCTGTGAATAAGGAAGTGAAGAAGCGT GAAA
TOAAGGAGAGGAAmCGGAGAArTACTAGAGCTTAmCAGGAAGAAATTACTTGT KjATCAG
AAGAAGGATTCTGAAGT GAATTATCGGAGTGCTCTTGTGTTGATT GTQTTCTGAAGTTATCGG
AAAAATCATAAAAATTGAAOATCCAGTTGATATCTCACGCGATATTGmCA^AGCGGA ATAGAA
ATGCAAAAGTAATTGAAGGAACTGAGGATOTGAAGTAATrrCGAGATrrCTGAAAGC rrCTGGT AAATCATCCATGAAGATGTCGTTTCATTCAAmrCGTCTTACAGTCGCCTTCGGAGTCAAA ATGTG
GAA^TTTATCAGTT ^TCAATCAAATGTAGTGAAAACAGAGCAGCGGAAGAGGTCGAGTCTGA
AGTTT ACGAGTCTGTCCAGAQOTAGAATTATCTQCTQTAQAACAAGCTCATGAGAAAC GTTG
AAGCAGAATTGGATCTGGAATGTTCTGAAAATrrCTCAATTGTTGATGTCTCTGATG ACTATTCAT
CAGC TATTC GAAC CCAATCGGAAATAmC OOAGAGTTCXmTATAQATATCTC OACTAT
AGTCCGTCGTATTGOTACGACTCCGGAAGCCAGTTCTCTGAGAAATCGAATGCAGAC GCTAGTCC
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ATCCACTCCCATAAACTCGTCGGAAGATCAAATTTCTACTGAATTCACTGTAAGCTA TTTATTTAA CTTCCTTTAATCTCT
TTATAGTTCATGAATACTGAGTAATGGAGTGAAACAAATATAGTTGTACCAGAGTAA GAGAGGA
TTTTAATTAGCGGATTGAACTGGTTTTAGATTGAGATAGTAATTATTGAAGTTTTTT AAAAAAATT
ATGTTGTATACTATTAGTTAGCCTTAACATCATCATGATTCTGTTTATTTCCTTAAT TTATGGAGGG
ATTGGAAGATGAAGAGGATGAAGAGAGCTATAGGATGATAAGAAACAGAGAGAGGAG GCAATT
GTATCTACACGACTACGCCGAGGAATACTGTTCCACTACGGACTACGGCGATCTAAT CGTGCAGC
AACGGTTACAGATGGTTCATTGGATTCTTGAGGTTAGTTAGTGATGAACGTGTTTAC TCCCCGCGT
TCCTTTCTAGTTGATCTGAAATGC^
ATCAGCTTTTCGTGCCATATAGATTTACGGCTTATAGTGCATGTGGAAGTTATATTTTTT AACAAC
TGGTAGAAAGACAATAGAATCCACCTTGCACGTGATCCTATAATACTGTCCATATTG CTTGTGTG
TGCTTAAATATTAGTAGTACAATTTTACTGAAATAAAGTATTGGTCCATAGAGTATA ATAATTGA
AGATTAGTTTGATATATATTACTGTAAAATATTAAAATTTTAAATTCATGAGATCTT AGAAAGTTG
TGTAGAATAATGTCGTCAAACATTTTGGCGAGTTGATTACGGACAGGATGCCATGTA AGTTGGAA
AGAAGTAAGAGTATTATAACTCGATACTATTTCCCTTTTATTCATATAGTAACTCAT TCACAGGAT
GCTGTTAGGACCTTGTAATGAAAATTAAAGAATTTGGATCCACTGGCATTTCAAGCG TTTACCAC
AAGATGGCCCTAAAACACCAGGTCCTGGGAGTAGTTCAAGGGATCTAAGATTCCCAT TTTTTTCT
GCAGAGAACTGCAATACAAAAGCAATACAAAGGAACTAAAAATTGTTTCAACGGCCT ATGCTTT
GGACTCTTTACATACACATACTAATGTCACTAGTACTTTGGCACTTTTCCTTGTCTA ATGATGGAC
ATGTTTCTTTTATAGCAAGCCACGAGGAAGGACCTTCAGAAGGAGACGATGTTCCTA AGTGTTAA
i!i!ii!i!ie
TTGCXTGCCTTACTCTGGCAGTCAGGATCQAAGAAAACCAGCCTTT AACAGGTAACATTCCTTG CT Cf GATGTTAAATC
GTCGAAAGGAAGTGCACATAAGCAGAGCCTCAAAATCAATTGTAAAATCTGAGGGAACTG CTCA
GCATTGCAAATTACCTTGCGTTTATTTTTTCGGATGTTACTTGCTGGGAAATACAGT AAGAAATTC
TCAAAGGAAGTTCAAATTGACCAAGCTAAGGACATTACAGTTTAAAAGTCTTGATAT ATACATTC
CTTATCATTAGTAAAGCATACTAGTCTTCTTAAGTCTCATTTGCTGAAAAGTTATTA AGTTGACAA
GTTCACTTCTTTCATTTCAGCATTCGCCAGAAGACATTCTCTGTTGCAGGCACTACA TATAGCTGT
TCTGAAGTGGTGGCCATGGAGTGGCTGGTGCAGGAGGTCCTCAACTTCCAATGCTTT CTTCCCAC
AATCTACAACTTCTTATGGTACAA^
GACTTAGCCTTAACATACACTAACTGAAGAAGTAAAGAGTCATCCATATATTTTATTTGG CGTTT
GTCACAACTGCACAGGTTCTATCTTAAAGCTGCTACAGCTACCGAATATATGGAGAA GACAGCTA
AATACCTGGCAGTCCXA^
CTGCACTGGTGATTCTCGCTTTATCAGCTGCCAATCTTTATGCCTCATGCCATTTGGTCA CCAAGG ΊΑΛΓΙΊΓΊ ΛΛΛ^Λ
ATGTATCGTCAAGGATGAATCCAAGTATATTCTTAAGCACTATTTTATCTAAAATTGTGC TTTGCA TTTGCACATTTTTTATAAGTGTAGAGAGTCATGATCAGTAATGCATTTGAAAACTCTAAT TCACAT GCTTTTTCATCTCTTCATTCACAGACTCATGCCAAAATAGAAGACGAAGATTTACCTGAA TGCATC AAGGTACTATATCCCTACCTAGCAATATTTAGTTCATCTTGTTTCTCTGCTGAAAAACAG GCAGAG TCAGAGTTTTGTATCATCAAAGCTATGATATTAGTAAATGAAGTTACTGCTTTTAGCTTA TCAAAG AAGTTATTTTCTAATGGCATATTCATTCATGACAGAGCTTGGAATGGCTGGTGA
Maize Zea mays (SEQ ID NO: 36) >GRMZM2G093157 | 9: 145760171..145764897
forwardCTCGCCGCTTGACTGTCTCGCTTTCTACAACAACATTATTGCCAAGAACA TTTTTGATGAT
GTACTGCATTCCATAGCAGCATTATTGTCCGTAGCAGATGATGGCAATTCGCTGCTA GCACTAGC
TGAAGCTTCCTTAACTTGAGCGGGAATATCAATCTTAACTGAAGGTCTGATTGCTCG AATTGGAG
CTGCCAGTGATTCCTTTGGGCCTTGGGTTTGCGCATCCTTCATCTAAACTGGTATGG CAACTGTTA
GTGATCCACAAGAGGCTAGCGTTATGTATCATCATTTACTAGGACAGAAATCACAGG AGAAGTTG
AGCTCTTCCTTGGTTGGTATTGTACATCCTTTACCTGAAAAGGGATCACAGATGGAA CTGCTTCCT
TCCTTGATTGATCTTGCATGCCCGTGTGCGCCGTGGAGGGCAGGCCGGAGTCTTGGA CGATGCTG
GCCGTGGCAGCCGTCGACATGGAAGGTGCTGTGTCTGGCCGATCTCTGCTGTTCTTC TTTTAATCG
ATTTTTTGTTTTATTAAATAGGCAAAAGACATTTTAACGTATGCGTCTTTTTTGAAA ATAGCATAT
AAGCGAAGTCTTTTAAAATAATGGTAAATTGGCGAAACTCACTTGCCATAATGGCAA AATTCTAA
ATTTCTCCTCACGCAAGTCTTCGGCGCCAGGTCGCAACCGTCACGTCACCTCGCCGC CTCCCCCCG
CCCGCCCTCCAGAAATAGGGTCAGAGCAGAAAATATCTCTCTTCCCTGGTCGGGTGA TGAGTTAT
GCTGATTCGGTAGCAGAAAGAGCGCGTAGGGTAGAGGTGGAGTGGAGGAGCGAGAGG AGTGGG
CTGCCCGCCGGCCCTGGTGGCGTGGTGCGACGCTGGAGGGAAGGTGTGTGTGAGAGG GAGGAGG
CCGTGGGGAATCGAGCGCAGACAAGGCAGCGCAAAGCGGGCGGAGAAGATGACCTCC CGCTCCC
GCCTGCGTGGGTAGCGTCATGGCGCAGCAACAGAGGCAGATGGGTGGTGTGCGCCGC GTGCGTG
GACGAACGGATGGCATGTGCGCGTAATTAGCGATCGGGCGCGGGGGCGGGGCGGGAA GAGCAC
GGACGGTGGGTGCTGGTGCCGCTGCCTTCGGCCTCACCCAATGCCGGCAGAAGGGGG GTGCGGG
GCTGGCCAGACCATACGGTGGGACTTGTGAGCGACCGGGTTGCCCGCCCCGGAGCCG GTCGGAT
CGAGACGAACGACTGCGACAGCGAGCGATCGCGCATCGCGGGCACCTGGCACGCGTA CGAGCCC
ACCACCACTCGCGCTCGTACCCGCGGTAGGCAGTCCAGTCCACTCCAGTGCGGCTCC CCTGGTCA
GGCTAGGGCCTGATGGACCTGACAGGCTGGCACACACGCGCTGGCGATGTGTGTGCG CCTACTTT
GCTCCTCTGTTTTACCGTACGGCGAGCGGGGCAGCTGGCCCAGCATGGCTTCATTCC CACCTGTTC
AACTGTTTGATGTACCAATTTTTTTATATATCTCTACTTCTCTACTATCTATTAAGG CAATTGTGTA
GACCATCTCTGTGCCCCCCGCCTCCGCGAAACCTCAGCGACATCCACGCCGACTCCG CGCACACG
GAGCTCTGACTCCACGAAACCCCCGCCCACCTGCATCGGATAGCCAACCTCCGCACG CGACCTCC
GCAACCTCAGTAAACTGCTATTGTAGCTCCATGACATTTGCGCCGACGACTCCGCGA CCTCCTCC
ACCACGATGGCTCTACCACACCGTGCGCCTCATCGCAAAAACCTACGCAATGTGTCG TGCGGCTC
CCACCCGCCGTGCCCCTGCCTCCCCTCCGCATACACCATGCGTGTCGTTTCGTATAT ATATTTTAT
ACTATTATTTTACTTCCGTGACAACGCACGGGCACATATCTAGTAGTATTGATTAAA ATGCTGTAA
AAAATACCATAGTTTAAAATACTTTGGTGCCCACGCCGCAGCC
TGCCAGTGrGACACGGAT€ACGCAATG€CTCC€ACCATGCTCGCGCCGGTG CC€ACCACCCCG€G CTCCAACCCCTTCCGCCGGCGCAGAGGAG TG TCCGCTGCI^CTCGATCAGACTTCGGCGAAGC
GGCCCGCTGAGT GTCCACCTCAGC T ATCCTGCTTCTACAGTGAGGTGAT T CAACTCCTCCA CATcecTeGtrc OTATO^
GGCG GGCCGGCTGGCTCCGAGTG T GGAGGTGAT GGCGGCGCGAGGGTGCG CC GCCGAG GTCGAGGTCTCCGAATCGTCCT CCTGTCTCCGT K TCGAGTCCOACCTCGCCTGCCCGAAGCA
G T GCCGACGACG TGAGGCGATCGAGAAATC T CGCGTG GATQAGCTGACC CGTCGTCG
GAGCCCGATGAGGAGGAGGTGCTCAGTGATCCCAGCCACTCGGGGTACTCCCCCAGT CCCCTGAT
CAGCTCCCCARRGACCGAAGATGACAGCGACGACGCGCCCTCTGCGACCTTCTCCCT CTTTCTCG
ACTTCGCCAAGCAGTTCGTCCCCW^
ATCT€CTGA€GGTGAGCAGTTCCT
TTG TCAGTGCGGTATTTGGTCTGGTCAATTGGTTGTCTAATGTTTTGGTGGGAATGCTTGTGT CA
GGGGAGGCGGTTTGAGGACTTGGACGACGAGGAGAGCTACGAGCGCTTCCGGCGGCG CGAGCGA CGCGAGGCAGTTGCGCGCGACTTCACTGAGGTGTGCAGCTCCACCTCCATACCCGACAGC TACCG CCCTCTCGTCGTGGAGCAACGTGTCATCATGGTGAACTGGATCATCCAGGTCAGTGAGTC TGTGT CAGACTGTCAGTGCAC
CCAACAATTTACCTCAGATTATGCATGGAATGTGACCGAATTTATACGGTGTAGGGG CTGTACAG
GCTTATGCGAGTGAGTTCAGATTTGCATTCTGCCGGCGTTGCACCAGTAGCATACGA CTCTAGCA
CCATGGCTGCAAATTAGTAGATTTCGCAATAGCTATGTTATGCCAATGAATGTTGTC TCGTGTGA
ATGCCTTCGTGCTGAGGCAGCAGATTAGTTCTCTTTCTTATTTTGCACTGGGGTAGA TATCTACTA
CTGAACATTTTGTTGTTAACACCAGTTGATTAGTAGATTTCACAATAGCTATGTTAT GCTGATGGA TGTCGTAGTGTATCTTTTTGTTTGTCAGTAGTCATGCTAGTTATTGTATACTTTGATCAC TGGTTTT
GGCAGCCAACAGAGTTAGGAGTATGTTTCAATAGCAAGTACTCATGCTTTTTTTGGA AATGGAAA
CATTGTTTCGCCCTTTTGCATTTGCATGCATACAACCTTATAACTCAATTATTACAT CAACCTGCA
ACAATTTGTAGTTCAAACAACCTTCAACCAAATAATGATATGGAGTAAAATAAAACA CGAGCAC
TAAGCCTCTTTAATTCTTGCATGGAACCGCCACCTGTGACTCGCAAAGAACTCCATG GCCACATC
CTCCAAAGACTGGCACACGACGAGGATTTGCTGCTGTGTTTCTTCTTTCTGCAATAG TCTCCAAAA
TCTGAACCAGTGTGTGTCCCTGAAGATAGTCTGTATAATTGACGGTATAGATTTATG ATTAAACA
CCACATCATTACGGCAAAGCCAAATCGACCAAAACATCGCCGCAACGCCAGTAAGAA GCAAGTT
TTTATGCGTGCAACCTTTGTTGGATTTCCAATCCCCTATAATATGATTAATATTAAT GGTACATGG
CCTGTTTAATAAGCAAGGGAGAAGCAGCTGGTTCAGTTACTCAACAATGTTTCTCCC AATTTCTTG
TTTCGCATCTGAACCCCTATCTCATCGGCACAGTGCTGGTATGTCTGGCTGGAATCA TATCTTGTA
GCAATCAGGTGCTTGAATATTCAGTATCTAAATATGCAAGTTTCTATCTGTAACTCT GTATATACC
CTTCATTTCATATTTATTCCCAATTTGAGCTTTCTGATGTGCTTGGTATTTTTATGA GATTTAAGAG
AACTCCTGAAAACACCATCATCACCATTTTTCCATCTGAAGGTTTGAATTGTGATTA AGCACAAC
AGTTATATTTCCCCTCGTACTCTGCTACAATGATCTCACCACTCAAAATCACGTGAT GCAAATTTG
AAATTTATGTGTATTCATTTTTTTATAAATTTGTTAAAAAAATTAGAGTTCAGTTGC AAGGAATGA
GAGTGATGGCTAGACCATCTAGTTCCATTATATGTTCAATTTCAGTAAAACTACTGATAT AAGTTG
GTGATTCCATGGTGTCATATTGCCTAATTAGATATCGACGGGATTAATATTCAGCAG CAACTGGT
GCCTAATCAGTAGCATCTGAGTCTGTGTGAGCTCCTCCTTTAATTTATGTTGGTTCC ATAAGCTAT
ATTTTTATCCATTTGCATCACTAAAGCTGCAATATGCCTTGGGTCTTTGACAACCTT TAGCGGGGC
AAATGAGATGGTTTTGATTTAGTAAAACTATTTTACCATTTAATCATATTATGAATA TGAAACATA
TCTGCATGTGGCAATGCTTTCCATGGTATTTCCATTTGTAATCTTTTTTTGAGCAAA ACCATCTGTC
ATTTGTTCCTTTCAATACTTAGTATCTGTGCAATTTGCGTTTAGAAGTGTTCACAAG GTTAACATT
TCAGAAGTTTATTTTTTCCAGGACATAAATTTGGGTTTCCTGATTGTGCTGTTATCT ATGATAAAG
GCATTTGACCCTACTAGTTAGCATTGTTTTAGTTGACTTGATGCTTTTATCTATTTG ATTTGATATA
TATTACTACAATTCACATTTGGAAGACATGTAAGAGAAGTATATTTAGCTGAAACTG CACTGGAG
CATGACCTTTGTTCTTCAGATAATTTTTTCTTTTCATATTCCTTTTCCTTGTTTCTG TTAAGCTCAAT
GTACAACATTAATTTCACTGCTTGATCCCTTTCAGCGTTCTTCGAAAGACTTTTCAA GTTGGGATC
AATATCTACAGCCAGAGTGAGGTTGTTGCCATGGAGTGGCTGGTTCAGGAGGTCCTC AACTTCAA
GTGTTTTGTCACAACAACCCATCATTTCCTATGGTACCACGAACTTC
CTGAACAAAAGTAATGAGAGACTAACACCATTTTGTTTCAATGTTGCAGGTTCTATC TGAACCCT GCAAATCHTXGATGAC&GG^
TGAGCAG€T€T€ m:TGGCX CTCGAC GTGG AGCTGCAGTGGTAGTT€TTGCTTG CTTGCCA iiiiiilBiie
AATTTTTGTT ACATTTGTTATTGTCA
CTTCCATCAAAACAGATACCATACTCCAAATTTACATGCTCAGATTCTAGCTAGACT GGAAACGC
CTATTGAGTAGCTCTTTACATATTTGTAGACTCACATCAGGACGCAGGATGATGATC TACCAGAA
TGCCTAATGGTACACATTCTCTTATTTTTCTCTTCTTTTTTGGGAATACACTGGTGG GCATGAATCA
TGATTCATGCATGCTACAGTTTGCAAGGCTGTTAACTTTACATTTAGGAGGTGTTTG AATGCACTA
GAGCTAATATTTAGTGGCTAAGATTAGTACTAGCAAATTTTTAGCCAACCAACTATT AGCTCTAG
TGCATTCAAACACTCCTTTATTCTCCTACACAATCT^
AGTACQTCTCGTGATAC CAGAGC CCAQGTGATAGCAQTGTTTTCA TTTTTTCTGTATGGGGA
CGTGAAATCTTAGCATTGACAAATAGTCTGCCTGTAGTGTAGATAAGATAGCCATCC GGCATGAA
ACGTAGCTTGTGGATTrTGAmTGCAGCrrTCTGATTAGGAG ACGACAAGGACGAGGAATTO
GTATTGAGCTTGGCCTTTAGGAATAACTGAACTTCTGTATCGGGGGATGTCTATCTT TACATCGGT
TAGTCGCTCTmtAGAAGGAC KK TAAGGCTGGGCGTTGTTGTACTCGnOATCTATTTGTTTAA
CCAATGTATTG TGATGGATGATATACCA TGAAATCTGTTGTTCTGGTGTGACAAGCGGC
Hall's panicgrass Panicum hallii (SEQ ID NO: 37) >Pahal.B00065 | Chr09:65019319..65021431 forward
CCTATACATGTTTGGTGAAATGCCTCTTTGGGAAGGGGAGGGTGGCAGAGGCGCTTGGCG TGCTG
GATAGGATGGCAGGTAGAGGGGTGACGCCAAACCGGGTTTTTGTGCAGACACTCCTC GAAGGTG
TCTGCACGGAGCAGAGGGTGGCCGATACATATAATGTGGTCGAGCGTGTGGTTGGTG ATCGGGG
CATGTCGAGTGAGCAGTGCTACAATGTTCTACTTATTTGCTTGTGGAGGGTTGGCAT GACAGCTG
AAGCTGAAGGATTGGCGCAGAGGATGATGAAGAAAGGGGTGCAGTTGTCCCCGCTTG CTGGCAG
TTCGATGGTGAGGGAGCTCTGTGTAAGGAAGAGGTCGTTGGATGCTTACCACTGGTT GGGAATGA
TGGAGGAGAACGGTGTGCTGTGTGACTCCAATGTGTATGGAACTCTGTTGCTTGGTC TGTGTGAG
GAAGGGCATCTCCATGAGGCATCAGCATTGGGGAGGAAGGTTGTCGAGAGAGAGATC CACATAG
AAGCATCTTGTGCTGAACGTTTAGTGGAGTTACTGAAGCAATATGGTGATGAGGAGC TAGCATCT
CATTTATTAGGATTGAAACAGTGCCCTGGAGGGTTGTCATTTTAAGCAATGCGCGAT TCTGCACA
ACCCTCGTGCATGAAGCACGTCGTGGTTAGTCATGGGGTGTGCCAAGAATAGTGCTT CACCGCTT
TGTTGGGAATTTGCCTGAGAACTGATTTAGCCAAATGGCTTAGTGCAGTCAAAAGTT TACTGTTG
TTGAATAAAGCATGGAACAGAATTCAACCGAAGTGCCACTGAACTACTTGCTTCTTT TGTATAAA
TTTGCTGAAGAACATGATGCAGATCCAGAAGACACTTGGCGTCATGTAAACTACCAT TTTGATCA
CTTCTCAGGTACATCACCTTGTCTCCCAGGCTGATGACATGCTTGGACAAGTGCCGT GCCTGTCAG
TCGAACATTTTAGATATGTTTCATGTGCTGTAATCCTAGGAAGTTATGTACAACGGT GCTGAAGTC
ATTTTACATGATACGTGCCCATAAGCACCTACTCTGACATGCTGTAACGTTTTCGAG TTACTCTCA
GTTTTTGTTGTCCCCTCATCTGAAGGAACTGAAAAGAGAATTTACTTTCTCATTTTC TTCCAATTTG
TTTGTATTCAACCTGCACCTGCAAACAAGGTTTGCCCACATTGCTTTTTAGGAACAT TTAGTTGAA
AATTTTGGTGTCCGTCAAATCTGACATTCTGCTCTTGTCGGTGTGAAAGAAATCCAA CTAAGAAG
GACAAGCAAACAAAACCGCGGTCAAATCTGACATTGCATTTGCAGGTGGGTGGGCGC TGGAGGC
AGCGGTCGAGTGAGATTGTTTTCACATAACCCTAATGCAGACTGCAGACACTAGCAT TCTTCAAG
TTCAGGAATCAGGGACCATTCTGATTTGCAACCGAAATCTGACTAGTTGCTGGGATT TGCTGCTG
GGACCGCAGTGAGCCATTGAACTCTGAAAATGGAGTTCAGGAGAACTTCGACAGCAG CTGAGAG
AAAAGTCGCGTACCTCTTGCCACCCCGAATCAAGCAGCAGATCACACATCGCAGCAA AGTAAAT
CACGGCATGACAGTGACAGTCCGAGACAACTGGCGTTTGCTCAGTCTGCAACAGCCC CGGACATT
CCCAACGGAGGCTGACACGGCCGTTGTTCTGGCAATCGCAAGTCGCCGGCACGCTGT CAATCTAC
TCTGGCTGCAGGTGGGACCAGTGAAGCACACCCGTCCATCACCGTTCAGGATTTAAA TTCGAATT
GCTTTTCGGGCTGGGCGTTCATCGTTGATCTCCCCTTCCCCTTCCCCAAGTCTCAGT GGTCTCCAC
ACAGGCAGCGGCAGGTCGGAGCTATATAATCAAGGCAAACACGGCAACATCTAGCCG TAGCAAG
TTCCACGC
GATCTGTTAAGTTGGTAATTGTTGTGAATTGTGATGGGAATGCTTGTGTCAGGGGAG GCGGTTTG
AGGACTTGGACGATGAGGAGAGCTACGAGCGGTTCCGGCGGCGCGAGCGGCGCGAGG CGGTTGC
GCGCGACTACACTGAGGTGTACGGCTCCATGCCCGGCAGCGACGGCCTTCTGGTCGT GGAGCAAC
GTGTCGTCATGGTGAACTGGATCATCGAGGTCAGTGTATACTACACTCTGCTGTGCG CGTACGGT
GCGATCAACAGTACACCT^
ATACAGGCTTATGACTGCATTGTATTGGTGGCATACGACTCTAGATCCTCTGTTGGTTAA TTGGTT
TGTTGTCAGACTCAATGGTCTAGAATTTGTTTCCAGTGTTCAGCAAGCACCGTAACT GCATAATTG
CGTAAGGAGCTGTTCTCTGGTGTGAACGTTTTTTTAATTAATGATTATTGTGCTGAG GCAGCACAT
CTGGTTCCCTTTCGTATTTTGTGCTGGCGAAATTATCTACTATCTAAAAGTTTGTTA GTTTAGCACT
AGTTGATGAGTGGATTTGAAAATGGCGACACTATTGAGATATCAGGGTTCAGTGGTG TCCTTGTA GCTTTTTATCAGTGAGTAGTCATGATTGTACTGACGCAGTTGATCACTCATTTTTGCAAC CAAATC
GTCCTGGTCCCAGAGATCAGCTATGTCTAACATGGGCTGTTTGAAGAGGAAGAGAGA AACAACT
GGTTGAGTTGCACAAAAAAATTCTCTGCAGTTCCCATTTGGCATCTGGAACGCCATT TCATTGGC
ATATTGCTTCCATGTCTGGGATTACATATTGTAGCAATTAGGATAGCTGAACCTACG CTCTCTAAA
TGCAATTGTCTATCTGTAACTCTGAATATGCCCTTTATTGCATATGCGTCCCCACAA ATTTGAACA
TTTTTTATGCATTTGGTATTTGTTTGAGATTCGGAGAACTCCTGAAAACATTGCCAT CACCATTTC
CCATCTGAAGGTTTCTTGAAATTAATCATTACAGATGTTTTCCATGCACTCTACTAC TGTGTCACT
ACTCAAAAACATGACATGAACATTTCATGCTCCTTCATTTCTTAGTTTGTTCCAAAA TTGAAGTTC
AGTTGTGAAGAATGACTCTTTCCCTTGTAATGGCAGCATTCGCATCTCATCAAGTTG CAGCCAGT
GACCGTGTTCATGGG^^
GAAATCTGCAGTO€TCiGGTATTG CTGCATCAC:CCTGGC AC€ GCATAGAAGAGAA CAGC:CG
TACAATTGGTAA
Foxtail millet Setaria italic (SEQ ID NO : 38)
>Seita.9G484600 | scaffold_9:52452228..52456950 forward
CGCGTCGCCCCTCCTCCTCCGCCGCCGCCTCTCCACCTGCCCGCCCCACCGCGACCACCC CAAACT
CGCCGCGCTGCTGGACGTCCTCACGTCGACGTCGACGTCCCCCACGCCGCTCCCACA CGCGCTCT
CCCGCGCCTTCCCGTCCCCCTCCGACGCCTTCCCTCTCCGCACGCTGCCCCGCCTCC TCCCGCTGC
TCCCCTCCCCGCTTCTCTCCCTTCGTTTCCTCCTATGGCGCCTGACCCCCTCCTCGC CGCTCCCCTC
CCCGCATGCTCTCTCCTCACTCGCCACCTCTCTCCCCGACCTCTCCTCCTCCGTACC GCTCCTCCTC
TCCTCCTCCGCACAGCCCCTCCCACTCCCGCACTACGCCCTCCTACTCAACATCTCC GCGCACGCC
GGCCTCTTCCCCGCCTCCCTCGCCGCCCTGCGCCACATGCGGTCCTTCGGCCTCGTC CCCGACGCC
GCCTTCTTCCACTACGCCCTCCGCGCGGCGGGCTCTGCCTCCGATGTCTCCGCCGTG CTTGAGATC
ATGGCCGGGTCCGGCGCCTCTCCGACCGTGCCGGTGATCGTGACCGCGGTGCATAAG CAGGCGTC
CGCTGGGAACTTTGAGAGCGCCCGCCGGCTGATCGATAAAATGCCGGAGTTCGGGTG CGTGCCC
AATGCTGTGGTTTACACCGCATTGCTCGATGGGATGTGCAGTTTAGGGAACGTGGAT GGCGCGCT
GAGGTTGATCGAGGAGATGGAGAGCAGCGGTTTGGATGCAAATTGTGCACCCAACGT GGTGACC
TATACATGTTTGGTGAAATGCCTCTGTGGGAATGGGAGGGTGGCGGAGGCGCTTGGC GTGCTGGA
TAGGATGGCAGAGAGAGGGGTGATGCCAAACCAGGTTTTTGTGCGGACACTGGTCGA AGGGGTT
TGCACAGAGCGGAGGGTGGCTGACGCATATGATGTGGTCGAGCGTGTGATCGGTGAT GGGGGCG
TGTCGAGCGGGCAGTGCTACAATGTTTTACTCATTTGCTTGTGGAGGGTTGACATGA CACCTGAA
GCTGAAGGACTGGCGCAGAGGATGATGAAGAAAGGGGTGCAGTTGACCCCGCTTGCT GGCAGTT
CAATGGTGAGGGAGCTCTGTGTGAGGAAGAGGTCGCTGGATGCTTGCCACTGGTTGA GAATGAT
GGAGGAGAGTGGCGTGCTGTGTGACTCTGACGTGTACGGAACTCTGTTGCTTGGTCT GTGTGAGG
AAGGGCATGTCCATGAGGCATCAGCATTGGGGAGGAAGGTTGTGGAGAGGGACATCC ACGTAGA
AGCATCTTGTACTGAACGTTTAGTGGAGTTGCTGAAGCAATATGGTGATGAGGAGCT AGCATCTC
ATTTATTAGGATTGAAACAGTGCGCTGGAGGGTTGTCATTGTCATTGTAAGCAATGT GCATTCTTC
CCAACCCTCATGCGTGAGAACGCCAAGAACAGTGCTTCACAGTCTTGTTGGGAATTT GCCTGAGA
ACAGCTTCAACAAATTGGATTGGTGCAGTCATGATGCTACGTTTAGACTGTTGCTTG ATACAGCA
TGGAACAGAATTCAAACAAAGTGCTGCTGAATTACTTGCTTCTTTTGAATGAAATTG CTGAAGAA
CATCATGTAGATCCAGAGGACGCTTGGCGTCTTGTAAACTACCATTTTGATCACTTC TCAGGTACA
TCACCTTGTCTCCCAGGCTGATGACATGCTTGGGCAAGTGCTGTGCCTGTCAGTCAA ACATGTTA
GATCTGTTTTATGAACTTGCAACCCTAGGAAGTTTTGTACAATGGTGCTAAAGTTAT TTTACATGA
TACGTGCTCAAGCACCTACTCTGGTATGTTGTAACTTTTTCGAATTACTCTCAGTTT TTGCTGTCCC
CTTTTCTGAAGGAACAGAAAAGAGAGCTTGCTTTATCAATTTCTTGCAACCTGTTTG TATTATTAA
GCACAGCCTGCACATAAATTTTGCCCACATTGCTTTTCAGGAACATTTAGTTGAAGT GAGACTGA
GTACTGCAGAGGCTGCATTCACTAGCAGTATTCAAGTT AGGAC ATTCTGACTTGCAAGTTGCA
ACCGGAATATGAC€TGTTGrTGGGATTTTGCTGArGGGAAGACAGTGAGG€AT TAAACTCTGAAG
AAAATTGG GrTTCAGGAGAAC TTGACATCTGCTGAGATAAAAAGTCACCTA CTCTTGCCACC
AACCAAGCTGATCACAGCAAAGTGAATCACGGCGTGTCAGTGACAGTCTGAGACAAC CTGGCGT
TTCCTCAGAC GCAACAGCTGCGGA ATTCX!CAACCCAAGCTGACA GGCCGTTGTTCTGG AAT CCC ACG CGGCC ACG CGGG AC OGGCCTC G C AC A CCC CGCTGTG A ATCT A CTCTG G CTGC A G GTG GGG ACC AGTG A AGTT A CCTGTCC AC A CG G CTC AGG ATTT A A ATTCG A A G AGCTTTTCGGGC CG
GGCAM:ATCGTTGGTCTCTCC TO
GACCTCCTCACAGACAGCAGCAGCAGCAGCAGCTGGTCGGAGCTACACCCCGCAGGCACG CGCA CGCACGGATCACGCAATGCRR€CCAC€ATGCTCG€G€CCGTG€CCACGAG GC€CCCCTC€AA€CC CTACCGCCGGCGGAGAGGGGCGGCTCCGCTGCTCCTCGATCAGGCCGCGACTGCGGCAGC GGCG
GGGAAGCC J€CCGCTGAGTCGTCCACCTCGGCCTCCTCCTGCTTCTACAGCGAGGTGATCTCCGC CTCCTCCACCTCTCTCGCCGCGTATCAACG CCG JAGA GAGGTCTCGCCGCCAGGACGAGGACG AGGCGCGCCCGGCCGGCTCCGAGTGCTCGGTGGTGATCGGCGGCGCGAGGGCGCTCCCCG CCGA ΚΛΚΛ I i. UAGG L- ) i. Ι υΛ ί i . G t . G t G i. t i. UU . t . UU ) Gi. ) i. GAG t L . G ALA.. i t, G<, i. t L&A L-GGAGL
AGCTCGCCGACGACGCCGAGGCGACCGAGTACTCCTCGGCGTACGAGGAG€TGAC :CCCGTCGGA
GCCCGATC^GGAGGACXJAGGTOTCAGCGG^
TGATCAGCTC€CC€TTGACCGACAACGACGA€GACACTACCGCG€CCTCCGC AA€CTT€T€CCTC TTCCTCGACTTCGCCAAGCAGTTCATCCC TG GTGCACCCCGAAGCGCG GCCGTCAACAATGC
iiiiiiiiiiB
TTTGGCTGCATTGATC
GTGTCAGGGGAGGCGGTTTGAGGACTTGGACGACGAGGAGAGCTACGAGCGGTTCCG GCGGCGC
GAGCGGCGCGAGGCGGTTGCACGCGACTACACTGAGGTGTACGGCTCCATGCCCGGC AGCGACG GCCCTCTCGTCGTGGAGCAACGTGTCGTCATGGTGAACTGGA
ACTCACTCTGTT
CAGATTTGTTGCCCTGGTAGGAGCTACACAGGCTTATGCAGTAATGTCTGCATTGTA CTGGTGGC
ATACAACTCTAGGTCCTTTGGTGGTTTGTTGTCAGATTCAATGGTCTAGAATTTGTT TACCAGTGT
TCAGCAAGCACCGCAACTGCATAATTGCATAAGCTGTTCTCTGGTGTGAATACTTTT TTTAGTAAT
GATTATTGTGCTCAAGCAGCATGTCTGATTCCCTTTCGTATTTTGTACTGGGGAAAC TATCTGCTA
TCTGAAAGTTTGTTAGTTTAACACTAGTTGATGGGTGGATTTGAAAATGGCAATGCT ATTGACAT
ATCAAGGTTCAGGGGTGTCCTTGTGTAGCTTTCTGTCAGTGAGTAGTCATGCTTGTA CTGGCTCAG
TTTGGTCACTCATTTTGAGGACCAAACCATCCTGGTCCCAGAGATCAGCTATCTCTA ACATGGGCT
GTTTGAAAAGGAAGAGAGAAACAGCGGATTCAGTTGAATAAAATGTTTCTCCGCAGT CCCCATTT
GACATCTGAAACATGATTTCATTGGAATGTTGCTTCCATGTCTGGGATTACATAGTG TAGCAATTA
GGATTCCTGAATCTTCGCTCTCTAAATGCTATTGTGTCTATCTGTAACTCTGAATAT GCCCTTTATT
GCATATGCATCCCCGCAAATTTAAGCTTTTCGATGCACTTTCTATTCGTATGAGATT CAGATAACT
CCTGAAAATATTGTTATCACCATTTTCTATCAGAAGGTTTCTTGGAATTAAGCATTT CATTGATGT
TTTCCTTGCATTGTACTATAATTGTGTCACTACTCAAAAGCATGGCATGCACAATTT ATGCTCCTT
CATTGTTCCAAAATAAAGAGTTCAGTTGCAAAGAATGACTTTCCCTTGCAATGCCAG CATTCGTA
TCTGACGAAGCTGCAGCCAGTGACCGTTTTCATGGGGATTGGACTGATGGACCGCTT CTTGACAC
iiiiiiiii iiiie
CGCATAGAAGAGAACCAACCGTACAATTGGTAATGTTCTCCCTTGTTATGTCTGCTGTAA GAGAT
rCTG I ; rC
ACTACTGACTTAAGTCACACCAAATTAGCTCCTTCTTTTAATCATGCATTGATCCTGCAT AGTCCC
TCAGATGATAGAATATATGCTGCAAGGTCATAACTATGTTTCTTTTCCCAGTTGCAT CCCTACCTC
GCTGAAATACGCCTTGGGTGTTTGAAAAGCTTTAGGAGGCAATGAGATGGTTCGGTT CAGAAGA
GCTATTTCACTGTTTAATCATGTTATGAATCTGAATCATATTAGCATTTGACGGTGG TTTTCACAT
TCTCATCTGTCATTTGTCCCTTTTGATACTAGCACTCTGTGCAGCTTGCATTTAGAA GTGTTCACA
GGGTGATCATTTCAGAAGTTCCTTAGTTTCCTGATTGGACTGTGCTGCTGTTACGTG TTAGTGATA
GTAAAAGAATTGACGATGCTGGTTGCCATTTTTGGTTGATTGAATCATATATTTTGA TATGTGACA
CGCATCGCTGCACTTTGCATTGCAAGACAACTAGACGTATCTTTAGCTGAAATTGCA CTGAAGTG
TATCTATGAGTTTTGTCTCCTGATCATAACTTGTTTTCGATTATTTATTTGTGCTAA GCTTGATGTG
CAACATTATCTCATTGCTTGATTCCTTTCAGCGTCCTTCAAAAGACCTTCAAAGTTG GGATCAATA
CTTACAGCCGGAGTGAGGTTGTTGCCATGGAGTGGCTGGTTCAGGAGGTCCTGAACT TCAAGTGT
TTCGTCACAACAACTCACCATTTCCTCTGGTACCACAAACTTCCTGTCTTATCTGTA TCAGCTGAG
CATAAGGACAGGCm
GATGACAGGGTAGCGGACCTGGCAAAATACCrGTCCrTGCTCTCACTTCTCAACCATAAG CAGCT CTCCTTCTGGCCCTCA^^ AGTCCT ATGCCATTTAGTCATGGAGGTGAACACCTCGGTCCTCCATTTCTAGCTATAAATTGTCA TTACACTTGCTATTGT
TCCTGTCAGTCAGCTTCTAAACATGCACCAACTCCAATTTTACATGTTCTGATTCTA GCTAGACTG
GAAATGCCTAACGAGTAGCTCTTTGCATGTATGTAGACTCACATGAGGACGCAGGAT GACGATCT
GCCTGAATGCCTAATGGTACATTTTTCTCCTCTGATTTTTTATAAGTTACTGGG^
TGAACCATGCTCTTACA
ACCCCGTGCAATCTTCTTTCATGCAGl§«
GACTCCCACH3TGACGAAATTGATC
GGTCACATAGACATCACCATGTGTAGGCCATACGTGAATCTTAGCATTAACAGATTATTC TGTAC
AroCAlTAGmTCCCTGTAAGGTAGATATAAGATAAGCCAAGGCACjCATAAAACGTA GCCTGT GATTATACGACTTT TGGC AGQAGCAAGGCAAGGAT GAGAGTTTCiGTATTGAG TGTCGGCCT liiiiiiiiiiiiiie
GACAA TAAQGGCTGGGCTCTGTTATACTC TATTCAAC:CGATATATTTGTTTAAACGGT
Sorghum Sorghum bicolor (SEQ ID NO: 39)
>Sobic.001G450400 | Chr01 :72724690..72728841 forward
TGCAGTTTTGGGAACGTGGATGCGGCGTTGAGGCTGATGGAGGCGATGGAGGGCAGCGAG TTTG
GTGCAAACTGTGCACCCACCGTGGTGACCTATACGTGTTTGGTGAAATGCCTCTGTG GGAAGGGG
AGGGTGGCCGAGGCTCTTGCTGTGCTGGATAGGATGGCAGAGAGAGGGGTGATGCCA AACCGTG
TTTTTATGCGGACGCTGGTCGAAGGATTTTGCACTGAGCAGAGGGTTGTCGAGGCAT ATGATGTG
GTGGAGCGTGTAATTGGTGATGGGAGTGTTTCAAGTACACAGTGCTACAATGTTCTA CTCGTTTC
CTTGTGGAAAGTTGGCATGGAAGAAGAAGCTGAAGGACTGGCACAGAGGATGATGAA GAAAGG
GGTGCAGCTGACCCCACTCGCTGGCAGTTCTATGGTGAGGGAGCTGTGTGGAAGGAA GAGGTCG
TTGGATGCTTGCTACTGGCTGGGATTGATGGAGGAGAACGGGGTGTTGTGTGACTCT GATGTGTA
TGGTAGCTTGTTGCTTGGGCTGTGTGAGGACGGCCACATTCATGAGGCATCAACATT GGGAAGGA
AGGTTGTCGATAGGGGGATCCTCATAGAAGTATCTTGTGCTGACCGTTTAGTGAAGT TGCTGAAG
CAATATGGTGATGAGGAGCTTGCATCACATATATTGAGATTGAGAAGGCGCTCTGAA GGGTTGTC
ATTTTAAGCAATTTGCGATTCTGCTCCATCCTTGTGGATGAAGAACATCTTGATTAG TCATGGGAT
GTGCCAAGAATAGTGTTTCACCACCTTGTTCGGAATTTGCTCGTGAACTGATTTAGC AAAATGGC
TTAGGCCTTGTTTAGTTTCCAAAAAGTTTCAAGATTCCCCGTCACATCGAATCTTGT AACACATGC
ATGAAATATTAAATGTAGACAAAAACAAATACTAATTACACAGTTTATCTGTAATTC GCGAAATG
AATCTTTTGAGTCTAGTTAGTCTATGATTAGACAATATTTGTCACAAACAAACGAAA GTGCTACA
GTAGCAAAAACCAAATTTTTTCCCAAACTGAACAAGGCCTTAGTGCAGTCAAAATGC TTGGAGAA
GTGATGTGACTGTTTGTCGAACATCTTAGACCTGTTTCATGTACTTGTAATCCTAGG CAGCTTTGT
ACACTGTCTATAAAAAGTCATTTACTACATTCCCATAAGCACCTAGCCTGGTATAGT GGTATGCA
TGACGTTTTCTAGTTATCCTCAGGTTTTGTCGTCCCCTTTTGCAAAGGAATAGAACA GAGATTAAT
TTCTCGATTCCATAAAATCTGACGTTCTGCAATTTTTGGTGTGAAAGAAGTATCGAG GCGGGCCA
GCTGATGCCGGTGGAAGCAAGGATGGCGCTGATTGAGTAGGCGCAGCCGCTTGTTGC ATTTGCAG
GTGGCTGCGGCGCGGGGCGCTTGAGGCAGGCCCTGAACATGGGCTGATGGGCGGTGT ATCAATC
TTGTGTGACCAGCACCGGCAGTGTGATTGCTTTCACATAACCGTAGTGCAGGCTGCA GATGCTAG
CAATATTCAGTTTCAGGACCATTCTGACTTGCAACTGGAGTATGACTTGTTGCTGTG ATTTTGCTG
ACGGGAACACAATGGGCCATGACAATGGCTTTCCTTATTTCCGCAGCTGCTGCTGAC ATTCTCTA
CGGAGGCTGACACCTGACAGTGAATCTACTCTTGCTGCAGGTGGGACCAGACTACCA GAGAAGC
GCACCCGTAGCGTCTCCATCACGGTTCCGGTTGGTTCAGGATTTAAATTCGAAGAGC TTCTCGGG
CTAGGCCTCCATCGTTCTGATGATCACCCCTCCCTTCCCCTTCCCCAAGTCGTAGCG GCCAGCTGC
CAGCACCGCAGCAGGCAGGAGCTATATAATCAAAGGCAAACAGCCAAACACGCACAC ACACCTA
GCCGTAGCATTGTAGCAACACGCGCACTCGCCGCTGCCGCACGGATCACGCAATGCC TCCCACCA
TGCTCG GCC^^
CTGCTCCACOATCAGACTGCGGC TGCGGCOAAGCGGCCCGCTGAGTCOTCCACCTCGGCCTC CTCCTGCTTCTACAGCGAGGTGATCTCCAACTCCn^CCACATCCCTCGCCGCGTATCAGC ACCCGGA G A A G AGGC AGC GGOGOC AGG ACG GG ACGCGG ACGCGGGC G AGGC GOG G CCGGCTGGCTCCG A
Kj { t . UIJAIJVJ i G A t GGLGU . Gi. UAGUG t Ut G i. ) i. UL WvUG ) i. UAUGt L- i. L-t wiAj i LG i . G TGC TTCiCiC iCCGTGCTCG AGTCX * GACCTCGCCTGCX * CGGAG AGCTCGC GACGACGCTGAGAG
GACCGACTACTCCTCCGCGTG€GATG^^
AGCGGTCCCAGC GCTCCCiCTCTGT^
TOACAACOACCKH^ X lCCTCOK!aACC^
C GCOTCACCCCAAA€£ H:G
CTAAGCGAT TG
TCTGGTAAATTGATAATGTTTTGGTGGGAATGCTTGTGTCAGGGGAGGCGATTTGAG GACTTGGA CGACGAGGAGAGCTACGAGCGGTTCCGGCGGCGCGAGCGGCGCGAGGCTGTTGCGCGCGA CTAC ACTGAGGTGTACAGCTCCATACCCGGCAGCTACGGCCGTCTCGTCGTGGAGCAACGTGTC GTCAT GGTGAACTGGATCATTGAGGTCAGTTCATACT
AGATCAACAATTTACCTCAGGTTATGCATCTGATATGACCGAATTTATACGGTGTTA AGGGCTGT
ACAGGCTTATGCGCGTGAGTTCAGACTTGCATTGTGCCGGCGTTGCACCGGAGCGTA CGTCTCTA
GCACCATAGCTGTATGATTGCAGCAAGATCTGTTGTCTCATGTGAAGGCCTTCGTGC TGAGTCAG
CAGATTAGTTAGTTCTCTTTCTTATTTTGCACCGGGGTAGTTATCTACTACTGAACA CGTTGTTAA
CACTAGTTGATGAGTAGATTTCACAATAGCTATGTTATGCAGATGGATGAGTGTACC TTTTTGTCT
GTCAGTACACAGTAGTCATGCTAGTTCTCATTCTATACTTTGATCACTGGTTCTGGC AACCAACAG
AGGTATGTTTGAATAGTAGGTTTACATGGTCTGTTTGATAAGCAAGGGAGAAGCAGC TGGTTCAG
TTACTCAACAATGTTTCTCCGAATTTCTCGTTTCGCATCTGAACCCCTATCTCATCA GCACACTGC
TTGCATGTCTGGAATCATATCCTGTACCAATCAGGATGCTTGAATCTTCAGTATCTA AATATGCAA
CTTTCTATCTGTAACTCTTTATATACCCTTTATTTCATATCGATCCCAAATTTAAGC TTTCTGACGT
GCTTGGTATTTTATGAGATTCCAGAGAACCCCTGAAAATACTGTCACACTATTTTTA CATCTGAAG
GTTTGAATTGCGATGAAGCATTACAGTTATATTTCCCTTGTACTCTGCTAGAATAAT CTCACTGCT
CAAAATTATGCGATGCAAATTTATGTTGATTCATTTTTTGCAAGGAATGACTCTTTA TTATGAAAT
GCAGCArrCACGTCTOATGAAACTCCAGCCAGTTACAATOmATGGGOATTGGATTGA TGGACC
G TTCTTGACACAAGGGTATATGAAGGGTTTGAG AAACTT AGTTG TGGG ATTGCXTGCATC
iiiiiiilM
GTTTCAACTTTTATGT
TTCTAGTTCTCTTCTATGCTCAATTTCAGTGAAACTACTGATGTTAGTTTGTGATTC CATGGTGTCA
GATTGCCTAATTAGATATCCACAGAATTAATGTTTAGCACCAACTGATGTGTAATCA GTAGCACT
CTGAGTGAGTGAACTCCTCCTTTAATTTGAGTTAGTTCTAACATTGCCTCATAGGAT ATATGCTGA
TCATAAGCTATGTTTTTATCCATTTGCATGACTACCGCTGAAATGTGCCTTGGGTCT TTGACAGGC
TTTAGCAGGGCAGATGAGATGGTTTGATTTAGTAGAACTATTTTACCATTGAATCAT ATTATGAAT
TTGAACCGTATATGCATGTGGAAATGGTTTCCATGCATTTCCATCTGTCATTTGTTT TTGTTTTTGT
TTTAAGTGAAACCATCTGCCATTTGTTCCTTCTGATACTTGGTATCTGTGCAGCTTG CGTTTACAA
GTGTTTGTAAGGTGAATATTTCAGAAGTTTCTTTTTTCCAGGACATAAATTTGGGTT TCCTGATTG
TGCTGTTATCTATGATAAAGGCATTGAGCATACTAGTTAGCATTTTTTTTAGTTGAC TTGATATTT
CTATATATTTGATTTGATATGTATTACTACAATTCGGATTTGGAAGACATGTGAAAG AAGTATATT
TAGCTGAAATTGCACTTGAGCATGTCCTTTGTTCTCCAGATCATTTTTCTTTTCCTA TTCCTTTTCC
TTGTTTCTGTTAAGCTCAATGCACAACATTAATTTCACTGCTTGGTCCCTTTCAGCG TCCTTCAAA
AGACTTTTAAAGTTGGGATCAATACCTACAGCCAGAGTGAGGTTGTTGCCATGGAGT GGCTGGTT
CAGGAGGTCXTCAACTTCAAGTGCTTTGTCACAACAACTCAAC^^
TCCTGCTTTCTTGTCTGTTCAGTTGAAAAAAAGTAATGGGAGACTAACACCATTC
TGCAGGTTCTAT OA^GGCTGCAAATGCTGATGACAGGOTAGCAGACCTGGCAAACTACCTGGC
CTT GTCTCACTTCGGGACXrATAAGAAGCTCTC TTCTGGC CTCGA TGTGGCAGCCG AGTGQT iiiiiiiiii^
AAGTCCTCC^^
ATATAAATCAAAGTATGAACTGTAACTACCAGAAGCTCAAGCCAGTTAGCTTGAAAACAG ATAC CAAACTCCAAATTTACGTGCTCAGATTCTAGTTAGACTGGAAATGCCTAATGAGTAGCTC TTTAC ATATATGCAGACTCACATGAGGACGCAGGACGATGACCTGCCAGAATGCCTAATGGTACG CTTCT CCTCTTCTTCTTTACCTCTTTTGTTTTTGGAAAGACACTGG
CTACAGTTTGCAGGGCTGCTAACCTTACATTCGTTATCCTATGCAATCTGTTTCATG CAGTGC TC GAGTGGCTGCTCAACTACGTCCCGTGATACCCAGAGCTCCCAGGTGATAGCAGTGTTTCA CATTT TTTCTGTAAATGGGGACATGAACTGACAAATTGCTCTGTACATGGCATTAGTCTGCCCTG TAGTTT
Purple false brome Brac ypodium distachyon (SEQ ID NO: 40)
>Bradilg69380 | Bdl :68032312..68043073 forward
ATGAAACCACAGAAAAAATTTCAACTCAAAACTGGGTCAAATTAATGCAAAACTTAGTTC GGAA
TTCAAATTTGGAGGTTCACTCTTGAGATCGTTCATTTTGTGAAGTATGTACCATTTT AATTTTGTGT
ATTTACTTGTAAATTTTATCCTTGTGTTGTATACCAATAACATGGTACCATGCCAAA AATTCTGAA
TTTTTTATAAATGTTTAATATTGTTTCATTTTTCCCTATTAAACGTATATAGAAAAT GATAAATAAT
TATTTTTACATAAAAAGTTAGTATTTTAATTCACATTACTCGTCATCAATGTCAAAT GAAAGTACA
GTAAAAGTTTCAACTCAAAAAGTGGTGGTTTACATCATAATTTGAAACAAGAGAGGA AAGGGAT
ACAAAAGAAAAAATATTGAGAAACTCTTTGCCGGCGGCCGGCCTTCGGCAAAGAATC CCGTCCG
TTTTCTCCCCGTTAGGCGCCGGTCAAGTCCACGTGGGACCCCTTCCTTTGTCGTAGG CATTTCTTT
GCCGAAGGCCTTTTTGTTCTTTGCCGGAGGCTTTTCTTTACCGGAGGCCTTTTTTAT TCTTTGCCGG
AGGCCTCTTCTTTTTTGCCGTCAGCAAAGTCTTAGCCTCCGGCAAAGGCCCAGGCCG CCGGCAAA
GAATGTTTTTCCCGTAGTGTACTCCCTCCGTTCGTTTTGAAATTTTGCTTTGACCAT CAATTAGACC
AATAATAAGTGAATTATGTATTATAAAAGTATACCATTGGAAACCTCTTCCAAATAT GAATCTAG
TGGTATAATTTTTATAGCATATTATTTTAATTTTATTAGTGTAATTGATGGTCAAAG TTAGACATC
AAAATACGTGGGTACGTTATATTATAGAACGGAGGGAGTACCTCAATTTGCCTCTCG GACACAAG
GTCCAAATGTCATCGCCGATGAGGCTCGAAGCCGAGAAATTTCTGTGAAGATTGTCG TTGCTGGC
CCGAACACTGCCTCAGCGGAAGTCATGGAATTCATCACTTCAAGTCTAACTCAACAT AATTCTGC
TGCATCCGATATTATCAACCAATATTAAGCCCATTCCGAAGACCAAGAGCTTCAGCC ACCAGCGG
AGGTAGAAATTCAAATATTTTGACGTCGAGGGGAGCCGACTAAAAGAGTCAAACGAA ATTTTTG
TTTTTTTCATGTAGTTAACAAGTAAATGCCACGTGTTAGGCGGGCCCTTGGCCCTGT TGGTCACCC
TCCACCATAGGCTCAGCTATTGCCGCACTGCCAATGTGGAGGATGAACGAGGTTGTC GTCGCCAT
GAAAAACCTCTGATCATCACTTTATATGTCTGATTTTTTTTGTTACAATAATAGGGT GTGACTATT
TTAAAAACAAGGATTAAATCTCAGTTGTATCACAACCGTTGGAATTATTTGGATGTC ATGTCATC
CCGGTCTTCTCTTTCCATCGTTCGTTCTGAATCTTACAGAAAAATCAAATCTAATGG TTGAGAAAT
CAAATCACTGATAACTAAAAATCAGGCAACTTAGATATGGTAAAACCATAATGAATT TTTGTAAA
ACTTTAAAAACTCTTGCCAAAAGATTTGTTCTCCTCGCAAAAGAAAAAGAGACATAG AAGACGT
GAGAAGAAACTCTGATCAGCGAAATTACCAGAACCTGACTCCTCAAAACCACGCTCG CGGTGTG
ATACCGACTTTTATTATCGTGTGCAGTGATCGCATGTGCGCTTCCTAATCCTGCAGC AGCCGTCTT
CCGTGTTCCGTCTCGTTTGGAAACGGGGAACACCGAGGCTGTTGACCTGTCGTTACC GTCACCGT
CGGTCCATCGTTCCGGTAGATCGCTCGTACACCGGTGTTTCCTGGGCCGTGGGATCC GCACCCAC
TGCAAGCGGGGTCCTATGGAGCGGTGTACGAGCACCCGGCGCAGCTGGGAGCTCGTG ATTTCCTG
GCCAAGCTCATGAATTTAAAATTCAAAAAGTGCTGGTCGGGCTGTGAGTTCATCATG GCAGGAC
GCAGGAGTCGTCCTCCCATTCCCCCmrCTCCCCAGTCTCGACGGCCTCGCAGGCGCG ATTATATA
AG AAGGCAATTCAC TAGCCGTAGC CGTAGCGA ACACAACCACACACGCACACGAACACAC
ACTATGCCTC CAC ATGCTCGCA CGGTGCCX ACGAQGCCG GCTCCAACCCX'TT CGC GGCG
GAGAGGGGC ^TOTCCG €CCAG€€CAG
GAQTCQTCX^ACATCGGCATCCTOT
CTCt^CGC GCCCAGC( Ce<^^
CAGCCTCCGAGTGin AGAGGTCAT
G AGTCATCCTG CCTCGGCTCC GTCXTOG AGTCOG ACCTTGCCTGCCCCG AGCAGCTCG CCOACG ATG€AGAGCCGACTGAGTA€T€TTCGGCCCGCGATGACCTGACGCAGTCAGACG CCGAAGAGGA Gi yn :TC OT TO
GT OACGACGACGATGACGCCGCCCCCTCTCCCACCTTCTCCCTCTTCCTCGCCrrCGCCGAG CA ATHTCGTCCCCTGCGCG ^ GGTTAATTTCTACACAGTTGTTCTAAATTTGTTTGAAATTGGGTCTGTTTGCAAGTGTCG GTGCGG
TGTTTCATCCGATTAGGTGGCTTGGTGGGAATGTTTGTGACAGGGGAAGCGGTTTGA GGACTTGG
ACGACGAAGAGACCTACGAGCGGTTCCGGCGCCGTGAGCGGCGGGGAGTGGTGGCGT GTGACTA
CACCGAAGTGTACATCTGCATGCCAGGCAGCTATGGCCGTGCCGTCGTGGAGCAGCG TGCTGTCA
TGGTGAACTGGATCATCGAGGTCGCT
GGTAGAAGC
AAATTCTGTGTTGTTTTGTTTGTCTGAGTCCGAGTGTCCAATATGCTCTGAAAGCAC GGTAGTTTT
GTGACTGCGCTAATAAGCTGATCTCTGGTGTAGATGTTTGTGCTGGCCTAGTGAGGC AGCAGATT
TAGCTATGCGATTTCGTGATTAGTGCAGCGGCAAGTTGTGTACTATCTAAGAATTTG TTGTACAAC
ATTCTGATAAGAAGATTGCGCAACTGACATTGTTCGCTGAACAGAAGGATCCCCATT TTTTTTTTG
GAACTGTTGTTGACCAGGCCATACTTATTGCAGTACTCAAAAGGACTCTGATCACCA ATTTTGAC
TGTTAGACCATCCAAGTCAAAGAGATCAGTGCTAGGATGTTTTAGCAGGTGTTTGTT TTGACCTTT
GACATTTACTATTTGAAAAGGAATGGACAAATAGATAGTTCAGTTATGCTGAGAAGT TATTCAGT
GAGCCATTTGACATGTCATCCGCATGTGGCCTCGACGCCTCGTGTGTCTGGAAAGCA TATTATAG
GAGTAGCAATTAGGATATCTGCATAATTTCTATGTACATATGCAATTCATGAGTACT TCGGTATA
ATCACTTATTTAGCCTCCTATGAAAAATCTTAGTTTGTCTATGCACTTGATATTGCA TTGAGACTG
GAAAGAACTTCTGATAATACTGACACCACTGTGTCTTCCACCTGAAGATTTGGGTGT CTTCCACCT
TGTACTGTAATATTCCTGAAAAGCATTGTACTATGATTCCTGGAGCAAAGATTTATT TTCAGATAA
iiiiiiiiiiiiiiiiiiiiiiiGi^
AAATTGGTGCATGC
TATACTACTGTAGGTTGTTGGTATACAGTAGTCAGATTGTGTCATTTGAAGTGTGTA CCCTCTTAA
CTGATGCATTGCTAAATGAAATAATGCTTCAAAGAAGCTCCTCATCTAAATTCAGAT CTTAGTTC
AACGTAGTTTCCTACTTCCTCCGTCCAAAAAAGATGTCTCAAGTTTGTCAAAATTTG AATGTATCT
AGACATGATTTAGTGTATAGATGCATTCAAATTTAGTCAAAGTTGAGACATCATTTG TTGGACGG
AGGGAGTATTACATATTTACATTGTGACATGGTTGTAGTACATAATACTGTTAGTTC CTACCTAAG
CTATTCTCTGTGGTATTTGCTTTTCTGTTGCTAAAGCTCATTGCAGTATGATTTAAT TGGGAACTTG
ATAGACCTTAGCAAGTATCCTTGGGAAGCCTTGGTTTGTTGGAACTGTCATCGTCTA ATCACATG
ATGGATCTCCATAGAAACATGTGACAATAGTTCATACACGGTGTTTACTTATCTCAT TGCAGGCTT
ATCGGCTATCACTGCATGCTAGTATTTGCAAATTGATCATTAATCAACTTCCATTTT TATGGGTTG
AGCATTTCAGAAATTGACTTTCTTAATTGATTTACCTGTGGTCAGCTAGCATCTTCA GTTTAGAAC
ACAAAATCCATTCATATGTTATCCCCACTGAAGGGAGTTGAACCATTGTACGAGTGA TCCTAGGT
AGCATAAGGTCCAAACTTTTTGATTGTGCATACTTACATGATTGTTCAAGTGAAATC AGAGCCTTT
TTGTGGTTGTTTTAAAGTTTTTGAGCCTGAATTCAAGTGGATCTTTCCTTATTATTA ACAGCAGGT
CTGAAGATAATAAATCATTATGTGTCACACAGTAGTACCTCCGTTCCTAAATACTTG TCGCTGTTT
TAGTGCAAACTTGCACTAAAACAGTGACAAGTATTTAGGAATGGAGGGAGTACTATA TATGCAG
AAACAATAGAGTACTTAAGATTAACGTCAACAGGAGCACTGCAGCATTATTGTTGAA CTTCTGGG
TTTATTGTCTATGGGATCAACATTTGTTTCCTCATTAATGTTTCTGTTCAAAAAATG TGTGATGAG
GAACCTCACTATATTATCTCTTTCAGCATCCTGCAAAAATCTTTCAAGGTAGGGATC AACACTTAT
GGCCAAAGCGAGGTCGTTGCCATGGAGTGGCTGGTTCAGGAGGTCCTCGACTTCCAA TGCTTTCT
CACGACAGTCCACCATTTCCTCTGGTACTACGTGTTTCCT
AAAACGAACGAGAAGCTAACAGCTGACTTTGTTCTAATTTGGCAGGTTCTAT TGAAAGCTGCCiA
AAGCGGATGACAAAGTTGAGGATATGGCAAAGCACCTGGCCrTGATCTCACTTCTGG ACCATAA
G ACCTCTCX^ACTGGCCC GACCGT GCAGCAGCAGTGGTAGCC TTGCTTGCCTTGCCACAG iiiiiiiiiiiiiiiiiiiiiiiiiiiB
TTTCTCTATCTm
ACCGAACTGTTTGCTCTTTGCATAACGCAGACTCACATGAGGACGAAGAACGATGATCTG CCTGA
ATGTTTAACGGTTTGGCCCCTCACTCGCATTCTGATACCTGG^
TATTTGCAGTAACTGATGTACGGAGGAAGTACAATTTTGTGGTGC
ATGTTATGCAACTTCTCGTGCAGAGTCTCGAGTGGCTGATAAACTATGCTTCGTAGT ACCCTGGG CCCCCAGAATTGAGCATTCGATCTAACCTTCGCTGATCAGCACAGCATAGCAGTCGTTTA GCAAC AACAAAAGAGCGTACATGCCATCTGGTTGCACAGCAGGATAACTAAAAAGGACAAGGCAG CAG
GTTTATGACTGTAGGGCCAACCGTTGTGGTCGTCTGTCTTTGCATCAGCAGCTAGCT CTTTAGGAA
CAATTAAGGATTTAAGGTTGGATGCTGTAGTATTCCTCAATGTCTTTTTTAGATCAA CGGTCTTGT
TTAATGAGCCTGCTAATGTTAGTGTATGATTGCTATTTTTCGCCGGGTTACTATAGC TCTTTAGGA
ACAGTCAAGGTTGGATGCTGTTGTGTTCCTCAACTTCCATTTTTCAATGATCAACGG TCTTGTTTA
TAAGGACTTGTTTAGTGTTAGTGTACGATTGTGATTTGTCGCCCGGTTACTTCTGAT CATGACCCA
ATCTTGTCTTCTTTTTTCTTTCTTTTTTTAGGGAGTTACACGGTCTTGTCTGCCACT ACTCTTTTCGT
TCGTCGGCCCAACCCTCCCAGGTTCAGCTCGCAGCTGTGCCAAGCAGATACGTTAAC TTAGACAA
CTCCTCAGTTTCAAAAAAAAAAAAACTTCGACAACTCCTTCCGAAGCAACAATAGCT GAAGATTT
TTGGAGCGAAACAATAGCGGAAGATGGTTGAGTCTACACCTGCAGGGGAATGCGTTT TTTCTCCT
TCGGCACCAGACCAGAGTAGTACCAGACCACCAGACCAGAGAGGCAGAGACCATCAC CTCCGTA
GTCCGTAGTGGACGCCACCACCAGATGCCTGCGTGCGCGTCCCTCGTCCGCCGCCTC TCCACCCG
CCGCGATCCCAACCTCGCCACTCTCCTCGCCGTCCTCCGCTCGCCGCAGCCCCCATC CACGCCGCT
CCCGCACGCCCTCTCCCGCGCCTTCCCGTCCCCATCAGACGCGTTCCCCCTCCGCAC CCTCCCCGG
CCTCCTCCCGCTCCTCCCGTCCCCGCTCCTCTCGCTCCAGTTCCTCCTCTGGCGCAT GCCCCCTTCC
CCGCCGCTCCCCTCCCCGCACATCCTCTCCTCGCTCGCCGCCTCGCTCCCCGACCTC CCCACCGCC
GCGCCCCTCCTCCTCTCCTCCTCCCCTCACCCGCTACCCCTCCCGCACTACGCCCTC CTCCTCGGC
ATCTCCGCCCATGCCGGCCTCTTTCCCGCCTCCGTCGCGGTCCTCCGCCACATGCGA TCCTCCCGC
CTGACGCCCGACGCCGCCAGCTTCCACTCCGCCCTCCGCGCAGCGCGCTCGCCTGGT GATGTCTC
CGTCGTTCTGGACATCATGTCCGGTGCCGGCGTCGACCCCACCGTCCCCCTGGTCGT GACAGCGG
TGCATAAGCTGGCATCCGCGGGCGAGTTCGAGGACGCCCGCCGTCTGATCGACAAAA TGCCTGA
GTTCGGGTGCGTGGCCAATGTGGTGGTTTACACCGCCGTGCTCGACGGGATGCGCGC TTTCGGGG
ACGTCGATGCCGTGGTGGGGCTTTTGAAAGAGATGGAGGACGGCGGGCTGGGTGCTT GGTGTGT
GCCCAATGTCGTGTCGTACACGTGTTTGGTGAAATGCCTGTGCGAGAAGGGGAGAGT GGCGGAG
GCTCTGAGCGTGCTGGATAGGATGATAGCTAGAGGGGTGATGCCGAACCGAGTTTTC CTGCGGAC
ACTGATCGATGGGTTTTGCGCGGACAGGAGGGTTGGCTTGGTTGCCAAGGCATATGA TGTGGTGG
AGCGTGTTGTCGGTGACGGGACTTTGTCGAGCGAGCAATGCTATAATGTTCTTCTGG TTGGCTTGT
GTGGGGCGGGGATGTCAGGGGAAGCTGAAGGACTTGCACACAGGATGATGAAGAAAG AGGTGC
AGCTCAGCCCGCTCGCGGCAAGTGCAATGGTGAGGGAGCTTTGCAGGAGGAAGAGGT GGTTGGA
TGCTTGCCACTTGTTGGGAATGATGGAGAAGAACGGTGTGCTGTGTGACTCTGATGT CTTTGCTG
GTTTGTTGCTGGGGCTGTGCGAGGACGGGCATGTCCTTGAGGCCTCAGCATTGGGGA GGAAGGTC
ATCGAGAGGGGGATACACATGGAGGCTTCTTGGGCTGATTGTTTGGTGCAGTTATTG AAGCAACA
TGGCAATGAGGAGCTAGCATCATATGTATTAGGATTAAGGACTCGTGAGTGATGTCA CTTTGAGC
AATGTGTGGTCCTTTTCCCCAATCCTTGCTTTGCTGCAACATGGTAATGAAGAAGAA AAAAGGTT
TGTTTTAGTTGAAGCAAGGACCATGTTTGGCTCCGAATGATACAGCTAGGAAGGATA TCTCTTGT
CAAGTTGCTTTTGCTGCAACAGATAATCGGTGGATGCAGCACAGAAAGACTAGTGTG ATCAAATT
TTGGGTGCCGCACAGAAAGACTTGCTTCGCTGCAACAGAAGTACTTACGTACTTCTA CTTGATGC
TTTTGCCAAAGAACTTGCTTTAGATCCAGTGGAAACTTAGGCCATGTAATTACCATT ACGAAGGC
CTCTCAGGACTCAGGTGATCATCACCATGCCTCCCAGATAGATGTGCTTGCAACACT GCTAATCA
ATTGTAGGAGTGGTGCCATAAGATGCAGACTTCAGTTTAATTGCTTCAGGCAGTTCA CCACGATT
TAGTGGCTATTCTTTTTGCTAAGTAAACCTACCGTGTCAACCTTTTTGGTTTCATAT GGTTACTTCT
GCAAGAGAAATCAGGGACTTTTTTAGTGGTTGTGTTAAGGTTTTTGAGCCTGAATTC AAGTGGTT
CTTATCATGCTTACTTTTTAACACTTAAAAGTTTAAACAAAAGTATATTCATTACGT GGCACTGTG
TATTGTTCAGAATCAGTCCACACTTAAGATATGAGATGTTCTTTTTTACCAGTCAAC TCCTTTGCA
TTGCAGAAGCACACCGCGGTATCACGGCAGAACTCATTTTTTTTTTTAGTTTGTTCT ATTTGCTTCT
GTTCAGGACAATGGGCTCATTTTTTTTTAGATGCTCCCAATGGGCTCATTATGTTAT TTCTTTCAGC
ATATTCATGTTCCTTGTTTCTGTTCAGGACAATGTGCTCATTATGTTATCTCTTTCA GTATCCTGCA
CAGGAGGTTCTCAGCTTTCAGTGCTACTTCTTTTCTACATCTTTGTCCGTGCCAAAC TTAACAAAT
GAACGGTTTGCTTTTTTCTGATTTTGCAGGTCCTGTCTGAACATATTCAGGCTCCAA AATCAGATG
ACAAAGTCAAGGACCTGGCAAAATACCTGGCCTTGCTCTCAATTCTATGCCATAAAG CACCTCTC
TTTCTGACCCTCACCCGTGGTAGCCCTTGCTTGCTATGCCACAGAAAAAGAGCCCTC CTGCCATTT
GGTAATAGATCAAGGTAAACACTGACCTCCATCGCTTGCCTACATATATTCTTATTT TCACTGGTT CTTTTCAGAACTTAGATAAGGCTATAAGCCGCAATGATCCTAAAAAGAACTGGAAACGCA CGCCT
AACAGCTTGCTCTTGTATAAACGCAACTCACTTGAGGACGTATAATGACAATCTGCC TGAATACT
Green foxtail Setaria viridis (SEQ ID NO: 41)
>Sevir.6Gl 18600 | Chr_06: 17567545..17569287 reverse
CAGTCCAATTATTATGAAGAGACTGGGGGTCGGATGGAAAAAGGAGGAAAGAGAGGGGGA TAA AGGGGAAAGTTTTTCTCTTTCTTTGAAACATACAGCCAAAGCAGATTGCGCATGGGCGGC GGCCG GAGTGTGGCGTCGCGACCTCCGCTCCCGTCTCCGCCTCCCCCCTCCCTCCATCGCGCCTG CTCCAT
CTCGGCCTCGGCGCCCCCGCGCGGGAGGCGGCGGCCGCCGCGCCGTGACTCCGGCGG TTCGAGC
CGACCCGGCCTCGATCTGCGCCAGTGCCTGGCGGCCGCGCCCCTATCTCCGGGGGGC ACGTGTTC
TGGCGGACGGAGAAGGACGAGGACGAGAGGGGCCTCGAGGCAACGGAGGCCGCCGTG CGCGTG
GTCGCCACGTCCGACTGCATCGAGGAGGACAACACGGCGACCGCATCCACGGCGGTG TCCCTGG
CACGCGTGATGCGCCGAGCCACAGCTCGCGGAGCTGGTTCGCCTCATCGGCTCCGCC GACTTGAA
GAGCGGGCTTGACTGGGGCACCAGGGCGACGGGGCTGTCGTGCCTCGTGGCCGTCGC GGCGACG
CGGCGGGGCCCCGACTCCGGCCTTCCCCATCGATCCATGGAGAGGGTGCACATCGGC GACCTCGT
CTTCTACTTCCTGCAGGGACACCTCGAGCAGGTACCCTCCCCCGTTTCCACTCCCTC CTCTCTGCT
TCCATTGGAGTTAGCTCCGCATTTGATATTTCCATGGACGATGCCCTCAATATGTTC GATGAAATG
GGTACACAGTACAATTTTCTTCTTTTGCCGTTCAATTTTGCTTGCTGTTTTGTACAG TCCCCATCTC
TATCAGTTGTGCTTGTATATTCAGGTTCAAAATTTAGAGTTCAATAGAGAGATTGAC ATGAGCTCT
ATGAGTTCAGCGGAACTGCAATTGTTCAGTGCAATTGATTCAATTTGTGGTGTAGAG ATGTAGTT
GTTTGAACTTGTGCAGAGAGCTTCCACATCATCTCACTCTAATGTACGCTATTCTGT CTTTTCTGG
GTTCAATCTTCTGTAGGTGTATGTGTATCCCATCAATACTTGCGACATCTTTTTTTC TTTTCTGCAC
GTAACAGAAATTATCATTTGATGTGCTCATGAAAGCATATGACATAGCCTTGATGGA TCAAGATT
TTCTTTCACAAATTCCCTGGCACTATGCAATTATTGATGAGGCCCAACGTCTAAACA ATCCATCCA
GTGTAAGTGGCCATTTTATTTATGTAGTTCTTTTGAATTCAAGGTTTGACTGCTTCC TTACCGTTAC
TGCTGCTTTTGCCACCAAGCACCTCTTGGTACTACTGCTAAGCTACACAGGATGTAC CTCCCCTCG
TATTTGTGCATTATGTGATGTTTTCAGTTTTGATTCAGTAGCACGGAAGCTCAGACT TTCAGTACT
GTTTGGGAAAAGAAACATCTTCACTTTCGAGCGSTGTATTGGTAGAGCATAATGACA GAGGTAAA
TTGGGCTCTGTGACTAAGAGAAAACTAAAGAATAGCTATGACTTGGTGGCACCAAGT GGAAATA
TTCATATTTCACTAGCAGTTTGTGGACTTCAGGTTCAGCTGATTAGTTACAACCTGT GTTGAGCTT
GAGAGGGCTGCAACCATAAAATTGCTCAAAAAACAACCCCATCAATTTCCATTACTT GTAACGAG
AATATTAGAGTTTAGTTACATTGCCAGATAAAGGAAACAGAGGAGAGAGCAGACACC CCCTAGT
AGTTCTAACCTGATTCAAGCTAAAATCCTAGGTGCAATTCTGCATTTTTATTGGTTC GTTGAGCTT
GTAATCCAAGCATTAGTGTTAAAAGGAATTAGATTAATAAGTGTGCATGTGTCAAAC ACTCAAAT
GCTATATGTTACACCCTTTTCCTTTTCTCCTTTTGGTTACGATCAATTAAAATACAC ATATATTTTG
TAACAAACCTTCCAAGTTTAACTCAGAAAGTTCTTTGCCAGCTA TGTATAATCT CTTGAGCAA
CGCTTCATCATGC AAGACGTCTACTACTAACAGGCACTC TATCCAGAACAACCTTT TGAATT AAGGAAGCAGGGGACTCATTAACGGGTATTACTTT AAATTCTAGAQGGGCGCATTTTAGCATA
TGTC A ATTGrCAGGTACCTTTTGTGC
TATTCTC
CACAGTTTATTCATGATTCGTGTTGTATCACATTTGTATAGGTGAATTGCTGTTGGA TGCTTAATG TTTTTACTTCTTGATTCCTTTCGGCGTCCTTCAAAAGACATTCAAAGTTGAGATCAATAC TTACAG CCGGAGTGAGGTTGTTGCCATGGAGTGGCTGGTTCAGGAGGCCCTGAACTTCAAGTGTTT CGTCA CAACAACTCACCATTTCCTCTGGTACCACAAACTTCCTGCT^
ACGACAGGCTAACACCATTGTGTTCAAATTTCGCAGGTTCTAT TQAAGGCTGCAAAGG AGATG ACAGGGTAGTGGAC TGGCAAAATACCTGTCC TGCTCTCA TTCTCAAACATAAGCAGC CTCC
■■■■lie
CTAATGCCATTTAGTCATGGAGGTGAACACCTCAGTCCTCCATTTCTAGCTATAAATTGT CATTAC ACTTGCTATTCT
GTCAGTCAGCTTCTAAACATACACCAACTCCAATTTTACATGTTCTGATTCTAGCTA GACTGGAAA
TGCCTAACGAGTAGCTCTTTGCATGTATGTAGACTCACATGAGGACGCAGGATGACG ATCTGCCT
AAATGTATAATGGTACATTTTTCTCCTCTGATTTTTTATAAGTTACT
CCATGCTCTTACAGTTCCAGCAATAGGCAGAAATCTACTTTTATCACAATACATTCA TTACCCCGT GCAATCTTCTTTCATGCAGliiiiiiiiiie^
AGGTGACGAAATTGATCCACAGTTTCCTGATTCCAAGTTACGCAGCACAGTTCAAGCGGT CAGAC
GGGCATOAGGATGTGTAGGCCATACGTOAATCTTAGCATTAACAGATTATTCTGTAC ATGCCATA
AGC AAGGCAG ATAAAACGAAGCX GTGATTAAACGACTTTCTGGCTAGGAGCAAGGCAAGGA
TCGAGAGTrrGGTATTGAGATGTCGGCCrTTAGGAACTACTGAATGACCTATTGCGC TGTCTATTA
TCntGGGTAGGCTTGGTTGTCGCCACGTCC jGGAAAAAATGGGGAAAGCAGACGGTGGTCGGT
False brome Brac ypodium stacei (SEQ ID NO: 42)
>Brast02G101200 | Chr02:5946464..5950641 reverse
TATATTTTACCAGTATTTAACCATAACTAGTCAAACATGTTTTGTTTGTTTTTATGGTTT TCTGAGC
GAGTATGCTCTTATTTTGCGTGATCTGCTGCCTCCTCATGTGCATCTAATCATATCA TGATTGTTCA
CTAGTAACGCGAAACCTACTACCTGTATTTACCAGGTACAATAGATGCGATAATAAA GATGATTC
GCTACGAGGGATTGCATGGATTCTACAAAGGAATGGGTACAAAGATTGTACAGAGTG TTTTTGCC
GCCTCGGTCCTTTTTATGGTGAAGGAGGAGCTTGTTAAGTTTGTAGTTCTTCTAGTT GCCAGGAGT
AGGACTGTGCTTCTTACAAGATATAAAAAACAATAGGTCTTGTTTCATGATAAAATT ATTTAATT
GTCTCTACGCGTAATATCCTGTTCGAAATTGCTCTTTCAATTCTTTATTAGTTATGA AATATCTCAT
AATGCTGCTGGTGCTCTTTTTGTTGGTGCCCATCTCTTCTACTGCCTCCATAAATCC ATGTTCGAG
AAAAATATTCATGTGTTTCATAAATCCATAATCCAAGTCCGTCTTAAAAGAGAACAG GCTTAGCG
TGGCGTTTGCATGGTGCCACCAGACATACAGCCTGGCGGTTGACTTGGGTTGTCAGA CATGCATC
AAGAAGTGGCTGGCCGGTCACTTGATTGGAAGCAGTAAATTGTACCGATTTTGGTAC TCCCTACT
AAATCAGGGATATTTATTACTTATTACGTATCGGAGTGAATAGCTGATAATCGCTAT ATATTGATT
GGTTTGTTTTTTTCTTTTTCAAGGGTAGGGCGACTTTATTCCTGTTACAATCAAGTT TTGAATAAA
GCTAGGGGGATTATCGCTCCAAGCTGACAAGGATTTACATATAAATGCCTATCTAGC CAAGCTAT
GAGCTACCTCGTTTGCCTGTGGGACGCAATGTCAAAATGACATGCTCCCGATGCGGC TCGCAAGC
TGGGAAGTGGAGATTGTGGTTGCTACCTGAGCTCACTTCAAATAAATCTGACTCAAC ACAATTCA
GCTGCATCCGAGATTAGCAACCAATGTTATGCCCATTCAAAAGAGCAAGAGCAGGCA GCAAGTC
AAATATTTTGGTGTCGAGGGGAGCCAACTAAAAGAGTCATACACAATTTCTGGGTTT TTTCATGT
ATTTAACAAGTAAATGCCTCGTGTTAGGCGGGACCTTGGCCCCCATTGGCCCCTCCT TCCTCCGCC
ATTGGCTTCAGCTATTACTGCGCTGCCAATATGGAGGATGAACGAGGTTGTCACGGC CATGAAAA
CCTCTAGTCATCGCGTTATATAAAAGAGACAGCGCACAAGCGGGGACACGCGTCGAC CTGCATG
CATCGTCTGCGTTCTCTAGTTTGTTCCATCCACCGGCCAACAACCCATCCGGATAAG GGAACCAC
AGGCGTGCTGTGGAACAATGAGCATGCGAAGAAACCACCCGGCTATGACGTTTGTAT CTTACTTG
TATGTTGATATTTTTCTTATAAATTTTGTCAAACTCTATAAACTCTTGCCAAAAGAT CTGTTCTCCT
CGCAAAAGAAAAAGAGGCGCAAAAGGGGAGAGAAGAAACTCAAGGTCAGCGAAACTG CCACTG
AAATTCTTCGCAGGAAAGAGCTTGACTCCTGGAAAGTGGAAACCACTCGCGGTGTGA CACCGAC
TTACCGTGTGCAGTCATAGCAGAAGACAACATGGCATCGTGCCTGTCCTTCCGTCTC GCTTGGAA
ACGGGGAACACCGAGGCTGTTGTTACCGTCACCGTCGGTCCGTTGTTCCGGCAGATC GCTCACAT
ACACCGGTGTTTCCTCCTGGGACGTGGGATCCGCATCCACTGCAAGTGGGGTCCTAC GGAGCGGT
GTACGTGCACCTGGCGCAGCTGGGAGCTCGCGATTTCCTGACCAAGCTAAGGAATTT AAAATTCA
AAAAGAGCTGGTCGGGCTGTGAGTTCATCACGGCAGGACCCGCAGGAGT€GT€ CTCCCATTCC€C
TTCCTC CCGGTCTCGACGGC T GCAGG GCGGCTATATAAGCAAGA AATTCACCTAGC GTA
GCCCGCAGCGACAM
GCCCAGATCGCAGCGGCGGCGGC CCGAATCG CM CTTCCGCA CGAGOT^
AGAGGC T GGCGTCAGGACGCGGA GAGGCG GGCCTGCAGCCTCTGAGTGCTCAGAGGTCAT CGGCCXJ€GCAA ¾XJ€G€G€GT€G€GGA
TCGAGTCX^ACC TGCC A CC GAGCTGCTCG CGA GATQ AGAGGCGACTGAGTAC TTCG
GTCCC ^ATOACC^^
CGAGTACTCCCTGACCCCC^
CCTCCCCCACCTTCTCCCTOT^
CCCACGCCGTCGCCGACGTTGCGATTCCAGAGGTGAGCG^
GTGTGAAA GGGT
GAATGTTTGTGTCAGGGGATGCGTTTTGAGGACTTGAACGACGAAGAGAGCTACGAG CGGTTCC GGCGCCGTGAGCGGCGGGGAGTGGTGGCGTGTGACTACACCGAGCTGTACAACTGCATGC CAGA CAGCTATGGCCGTGCCGTCGTGGAGCAGCGTACTGTCATGGTGAACTGGATCATCGAGGT CCGTT
TAATACTGCGGTTATCACTCTGGCCCATTTGATTTTTGTGGTAGAAGCGTGCCTTAC AGGATTACA
GTAAAATGCATGCGTACAATGGAAGTCACGTAGTACTCTAAATTCTGTGTTGTTTTA TTTGTCAGA
GTCTGAGTGTCCAATAAACCTCTGGTGTAGTGGTGTAAATGTTTTGTGCTGGGCCTG CTGAGGCA
GCAGATTTAGCTACCCAATTTCGTGGTTAGTGCAGCGGAAAGTTGTGTTATCGAAGA ATTTGTTG
TACAACATTCTGATGAGAAGGTTGCGCAATTGACATTGTTCGCTGAACAGAAGGATC CTTTTTTTC
GGAACTGTTGTTGAAATACCGTGCTTATTGCATTACTCAGTAGGATCCCTTATAGTA GGATTCTGA
TCACCAATTTTGACTGTCAGCCCTTCCAAGTCAAAGAGATCTGAGAGGAGTGTTAGG ATGTTTAA
GCAGGTGTTGTTTTGACCTTCGACATTCACTGTTTGAAAAGGAATGGACAAATAGAT CGTTCAGT
TATGCTGATATCAGTGCCATTTTACATGTCATCTGCATGTGGCCTGTGTCAGGGAAA CATATTAGG
ATATCTGCATCATTTCTATCTAGATATGCAATTCATGAGTACTTTATCGGTATAATC CCTTATATA
GACTCCAATGAAAATGTAAGTTTGTCTATGCACTTGATATTGCATTGATTCTGGAGA GAATCTCTG
AGAGAATACTGACACCAGCGTTTCTTCCAGTTCCACCTTAAGATTTCGGTATGCAGT TTAGGTATT
TAAAAGATGAGAAAACTTGTACAGTAATATTCCTGAAAAGCATTGTACTATGACTCC TGGAGCAA
AGATTTATTCTCAGATAAAATTTCATCTACAACTGACGAAATACTGACTGTTTCCCA TGAATTCTA
GCATGGCCATGTTACCGATCTCCAGCCAGAGACAGTGTTCTTGGGGATTGGACTGAT GGATCGCT
TCTTGACCCGTGGATACGTAAAGGGCACTAAGAAAATGCAATTGCTGGGCATTGCrr CCATCACC
C TGC AC OCATTQAAG^
ATTCTGTTTCAGTAA
CTGATGCATTGCTGAATTAAATAATGCTTCAAAGAATGTCCTCATCTAAATTCAAAT CTTAGTTCA
GTGTAGTTTCTTATTATATTGTCACATGGTTGTAGTATAGTACTGTTAGATCCTACC TAAGCTATT
CTATGCGGTATTTGCTTTTCTGTTTGATAAAGCTCATCGCAGCATGACTTAATTTAG CATTTGATA
GACCTTAGCAAGTATGGTTGGGATGCCTTGGTCCTGATTGATTTACCTGCGGTCAGG TAGCTCTCC
TCTCACCAATCACTAGAGCATAGTACAGAGCTAGCATCTTCAGTTTAGAACATAAAA TCTATTAC
TATGTTATCCCCACTGAAGGGAACTGAACCATTCTACGAGTGATCCAAGGTAGCATA AGATCCAA
CTTTAGTTTGATTACATCAGGCAATTCATCATGATTTAGTGCCCATTTTGACTTGGG TAGACCGTT
CATTCCAGAGTTCAATCTATTTTTTGCCAAGTAAACCTGTTGCATCAACTTTTTGGT CGTGCATAC
TTACATGATTGCACAAGTGAAATCAGAGCGGTTTGTGGTTATGTTATTGACCGCATG TCTGGAGA
TAATAAATCATTGTGTGTCACCCAGTACTATATATACAGAACCAATAGAGTACTTAA TATTTAAC
ATCAACAGGAGCAGTGGAGCAGTGCAGAATTATTGTCGAACTCTGGGTTCATTGTCT ATGGGGTC
AACATTTGTTTCCTGATTAATGTTTATGTTCAGAAATGTGCAATGGCGAACCTCACT ATATTATCT
CTTTCAGCATCCTGCAAAAATCTTTCAAGGTAGGGATCAACACTTACAGCCAAAGCG AGGTCGTT
GCCATGGAGTGGCTGGTTCAGGAGGTCCTCGACTTCCAATGCTTTGTCACGACAGTC CACCATTT
CCTCTGGTACTAT GTGTTT
CAAGCCATACAAAATGAACGAGACGCTAACAGGTTACTCTGTTCTAATTTGGCAGGT TCTATCTG AAGGCXCX^GAAA CAGATGAA^^
TGGACCATAAGCACCTCTC TA TGG CCTCAACCGTCG AGC GCAGTGGTAGCC TTGCTTGC
l _ii_ii_iii_i_iiiiiiii
ACCTAACTGTTTGCTCGTCGCATAATACAGACTCACATGAGGACGAAGAACGACGATCTG CCTGA ATGTTTAACGGTTTGGTCCCTCACTCGCATTCTC
CGTTTGCAGTAGCTGATGTAAGTAAATTTTTGTGGTGCCGCAGCAACTAAGATTCGT TATGTTATG
CAACTTTTTGTGCAGAGTCTCGAGTGGCTGATAA^CTATGCrrCGTAGTATCCAGGC CCCCCAOA
ATCGAGCAG ATAGCAGTCATTTAA AT AACAAAAAGAGCGTACATGCCATTTQGTTGCA AAC
AGGATAAATAAAAAGGACAAGGCAGCAGGTTTATGACTGTAGGGACAACCGrrGTGG TCGTCTG
TCTTTG AT AT AGTTAG T TTTAGGAACAATTAAGGAGTTAAQGTTGGATT TGTTGTATTCC
TCAA CTTCTGTTTTTC TTGGATCA ACGGT T GTTTAATGG G CTTGTTT AATGTTAGTGTATGCTT
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TCAACTTCCATTTTTCTATGGATCAACGGTCTTGTTTA
Switchgrass Panicum virgatum (SEQ ID NO: 43) >Pavir.Ia04006 | Chr09a:74385039..74391468
forwardTCGACAAAATGCCGGAGTTCGGCTGCGTGCCCAATGCTGTGGTTTACACC GCGATGCTCG
ATGGGATGTACAATTTCGGGAACTTGGATGGTGCGGTGAGGTTGATCGAGGAGATGG AGGGCAG
TGGGTTGGGTGCAAATTGTGCACCGAACGTGGTGACCTATACATGTTTGGTGAAATG CCTCTTTG
GGAAGGGGAGGTTGGCAGAGGCGCTTGGTGTGCTGGATAGGATGGTAGGTAGAGGGG TGATGCC
AAACCGGGTTTTTGTGATGACACTCCTCGAAGGTGTCTGCACGGAGCGGAGGGTGGC CGATACAT
ATAATGTGGTCGAGCGTGTGGTTGGTGATCGGGGCATGTCGAGTCAGCAGTGCTACA ATGTTCTA
CTTATTTGCTTGTGGAGGGTTGGCATGACAGCTGAAGCTGAAGGATTGGCACAAAGG ATGATGA
AGAAAGGGGTGCAGTTGTCCCCGCTTGCTGGCAGTTTGATGGTGAGGGAGCTCTGTA CAAGGAA
GAGGTCGCTGGATGCTTACCACTGGTTGGGAATGATGGAGGAGAACGGTGTGCTGTG TGACTCTG
ACGTGTATGGAACTCTGTTGCTTGGTCTGTGTGAGGAAGGGCATGTCCATGAGGCAT CAGCATTG
GGGAGGAAGGTTGTCGAGAGAGAGATCCACATAGAAGCATCTTGTGCTGAACGTTTA GCGGAGT
TTCTGAAGCAATATGGTGATGAGGAGCTAGCATCTCATTTATTAGGATTGAAACAGT GCCCTGGA
GGGCTGTCATTTTAAGCAATGCGCGATTCTGCCCAACCCTCTGCATGAAGCATGTCA TGGTTAGT
CATGGGGTGTGCCAAGAATAGTGGGGAATTTGCCTGAGAACAGATTTAGCCAAATGG CTTAGTG
CAGTCAAAAGTTTACTTTTGTTGAATAAAACATGAAACATAATTCAACCGAAGTGCT GCTGAACT
ACTTGCTTCTTTTGTACAAATTTGCTGAAGAACATGATGCAGATCCAGAGGACACTT GGCGTCAA
GTAAACTACCATTTTGATCACTTCTCAGGTAGATGACATGCTTGGACAAGTGCTGTG CCTGTCAGT
CGAACGTTTTAGATATGTTTCATGTACTGTAATCCGAGGAAGTTATGTACAACGTTG CTCGAGTC
ATTTAACATGATACGTGCCCATAAACACCTACCCTGACATGCTGTAACGTTTTCCTG TTACTCAGT
TTTTTGCTGCCCCCTTATCCAAAGAACTGAAAATAGAATTTACTTTCTCATTTTCTT CCAATTTGTT
TGTATGATTGAGCACAACATTTTCTTCCAATTTGTTTATATGATTGAGCACAACCTG CACCTGCAC
CACATTGCTTTTTAGGAACGTTTACTTGCAAATTTTGGTGCCCGTCAAATCTGAGTC TGACATTCT
GCTCTTGTCGGTGTGAAAGAAATCTAAGGGCAAGCAAACAAAACCAGGCGGTCCAGC TGATGCT
GATGGAAGCAAGGCGGCCGCTGCGCTTGCATAGTTGTATTTGCATTGCATTTGCAGG TGGCTGGG
CGCTGGAGGCAGCGGTCGAGTGAGAATGTTTTCACATAACTGTAGTGCAGACGGCAT TCTTCAAG
TTCAGGAATCAGGGATCATTTTGATTTGCAACCGGAATATGACTAGTTGCTGGGATT TTGCTGCTG
GGAACGCAGTGAGCGATTGAACTCTGAAAGAAAAGTCACGAACCTATTGCCACTCCG AATCAAG
CTATTCAGCAGATCACACACATCGCAGCAAAAGTGAATCACGGCAATACGGCATGAC AGTGACA
GTCTGCAACAGCCCCGGACATTCCCAACGGAGGCTGACGCGGCCGTTGTTCTGGCAT CCCACGCC
GCGAGCGGGGCTCGCAAGTCGCACAGCACGCCGTCAATCTACTCTGGCTGCGGGTGG GACCAGT
GAAGCGCACCCGTCCATCACCGTTCAGGATTTAAATTCGAATTGCTTTTCGGGCCTG GGCGTTCAT
TGTTGTTGATCTCTrCTTrCCrAAGTCTCAGTGGT€TCCACACAGGCAGCGGCAG GTCGGAGCTAT
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GGGGGGCACGCGCACGCACGGATCACACAATGCCTCCCGCCCTGCTCGTGCCGGTCC CCACGAG CTGCGG XJOCGGC^
GAGGTGATCTCCGCCRCCTCCACCTCCCTCGCCGAGTACCAGCGCCCGGAGAAGAGGCCT CGGCA CAGGACGC GGAC G AGGCGCG GCCGG CGGCTC'CGA GTGCTCAGAGGTGATCGG GGCGC G GO CGTGCCCCGCCGAGGTCGAGGCCTCCGAGTC T^^^^
ACC^G CCGG^
CTGACCCCGTCGGAGCCCGAGGAGGATGAGGAGGTGCTCAGCGGGACTTGCCGCTGCGCC GAGT ACTCCCr€AG€CC€CTGATCAGCTC€CCrrTGACCGAAGACCGCGGCGCCGA CG€CGCCC€CTCC GCGAC^C TOTOT
iiiiiiiiiiiie
TTGTTGG^
GTGATGCGAATGCTTGTGTCAGGGGAGGCGGTTTGAGGACTTGGACGACGAGGAGAGCTA CGAG CGGTTCCGGCGGCGCGAGCGGCGCGAGGCGGTTGCACGCGACTACACTGAGGTGTACGGC TCCA TGTCCGGCAGCTACGGCCCTCTCGTCGTTGAGCAACGTGTCGTCATGGTGAACTGGATCA TCGAG GTCAGTGTATACTACATTC
TCCGATGTGACCGATTTGTTGCATTTGTAGGAGCTGTATAGGCTTATGACTGCATTG TACTGGTGG
CATACGACTCTAGATCCTCCGTTGGTTAATTGTTTTATTGTCAGACTCAATGGTCTA GAATTTGTT
TGCAGTGTTCAGCAAGCACGGTATCTGCATAATTGCATAAGAAGCTGTTCTCTGGTG TAAATGTT TTTTTAATTGATGACTATCTGGTTCCCTTTAGTATTTTGTGCTGGCGAAATTATCTACCA TCTAAAA
GTTTGTTAGTTTGCCATTAGTTGATGAGTGGATTTGGAAATGGCGACACTATTGTGA GATATCAG
GGTTCAGGGGTGTCCTTGTAGCCTGTCATTGAGTAGTCATGCTTGTACTGACTCAGT TTGAGCACT
CAGTTTTTTCAACCAAATTGTCCTTGTCCCAGAGATCAGCTATGTCTAACATGGGCT GTTTGAAGA
GGAAGAGAGAAATAACTGGTTGGGTCGCACAAAAAGAAAATCTCTGTAGTTCCCATT TGGCATTT
GAAACTTCATTTCATTGGCATGTTGCTTCCATGTCTGGGATTACATATTGTAGCAAT TAGGATACC
TGAACCTTCGCTCTCTAAATGCAATTGTCTACCTGTAACTCTGAATGTGCCCTTCAT TGCATATGC
ATCCCCACAAATTTGAGCTGTTTTGATGCACTTGGTATTTGTTTGAGATTCAGAGAA ATCCTGAAA
GCATTGCCATCACCATTTTCCATCAGAAGGTTTCTTGAAATTATGCATTACGAATGT TTTTGAATA
TTTACGTTGCACTTTACTGCTGTGTCATTACAAAAGCTTGACCTGAACAATTTATGC TCCTTCATTT
CTGCTGTTCCAAAACCAAAGTTCACTTGCAAAGAATGACTCTTTCCCTTGCAATGGC AGCATTCG
CATCTCACGAAG1T< ¾^
ACAAGGATACATGAAGGGTCTGAGAAATCTG AGTO€TCiGGTATTG CTGCATCAC:CCTGCi€ A
CCCGCATAGAAGAGAACCAGCCGTACAATTGGTAATGTTCTCCCffGT
CATGTCTGGTTT
TTCATGCTCTATTTCAGTATAACTACCGACTATAGTTGGTGATCCTGTGTTGTCAGA TTGCCTAAT
TGATATATACACCATTGACATTCAGCAGCAATTGATGCATAATTAACTACATTAACA TTCAGAAG
CAACTGATGCATAATTAATTGGCTTAGCACTCCAAATTAACTCCTCCTTTAACCATG CATTGGTGC
TGGACATTCTCTCAGATTGTCAAATATATGCTGCAAGGTATAACTGTGTTTCTTTTA CCAACTGCA
TCACTACCTCACAGAAATATGGGTTGGGTTTGAGTCAGTAGAACTATTTCACTTTTA AATTATGTT
ATGAATCTGAACTATGTTAGCGTGTGACAGTGGTTGTCATGTTATTTTCATTTTTTA TTTGTCCCTT
TTTATACTAGCACTCTGTAGCTTGCATTTAGAAGTGTGCACAAGGTGAGCATGTCAG AAGTGTTA
GAGTATATTAGGCAACTCCGGATATGGTTAGTTTAGGATTGATTGTAATCCCGGGAT AATCTTTCT
TATCTCTAGGAAATGCTACTTGCCCTCCAAGCCATGTACTCATATATATACCGCCCA AGGGGCTC
AATGCAATACATCGATCACATTATACACATCCTACTTTCTTACATGGCATCAGACGC CTAGGTTTT
AGATCCTGACCTAGCCGCCGCCGCTTCCGCTGCCGTCGCGCCGCCCCCGGGGAGATC GATCTCCG
CCGGGGGTAGCGCCCTCCTAGGATGCGCCGCGGATCCCTATGATCCGCACCGCCGTT GTTGTCAA
CACACAGCAGAAGGACCTACCTCCCATGGTGATGCGATCTCGTTGGCTCCCTCATTG CCCCTTCC
ACGCCGACTCCCCCCGTGCGCGCGGCCCCGTGCTGGCCCTCTCTCCTCCTGCGCCGT CGCTGCCCT
GTGCGCAGGCTGCGGAGGAGGCACGAGGTGCGCCAGGCTGCTGCGCCATCGGAGCCG CGAGATT
GGGGCGGGTTACCGTGCCGCTGGTCGATTCCGGCACCGCCCGACGAGATCCGCCGCT GGGACGA
CCTGCAGCGGGGCCACCCCGTTTCGCCCCGGCCTCCATGCTCCACCACCTCGCTCCT CCGCCATCG
CCGGCGGGCCGCACCATCGTGGGGGCTCCACCCCATCTCGGCTTCCCCAGCTCCTCG CCGCCGTC
GCCTCCCGACCCGTCGAGACCGGCCTCGTCCGCGCCTTCCGCGGCGGCGCCGCTCCC ATAGTCCC
GATCCGCGCCTTCCTGGAGCGGATCCACTTGCTAGAGGCGGAGGCCCGCCGCTACGC CACCGCCG
CGCCGGAATCGAGAAGGCAGCGCCGGCGGCCCACTCCCTCCTTTTCTTCTCTGCCCG TGTTGGCC
ACGGGGGGAACAGATAAGGGAGGGGGAGGGGCACCGCTAGGGGTACCCTGAGAATGT ATCCAT
GGTTCGTTTGCTGCAGCTGATTTTCTTTTTTTTTCCGATCTAAGATCGGTTTGCGTC GCCTTGCCAT
TCGTCGCCGTTCATCGACACTCCCGAAGGACGAGGTTGTTGCTGCGCCTTCCAGGTT GCAGCGAC
AGCGACGACGTTCGTGATCAAGCCACCCTACCGGTGTCGTCGCCTTTTCAGGCGGTG GCGCCACT
CGCCGCCGGTCTTCGTCAAGCAGGCGCTCGATCCGTCTCCGCGCCTCCAGCCGTTCT CGTGCAGA
CATCGCCGCCGATGTTCCTGAAGGAAGACGTCGTCGCCACCCCTGATCTGAGCGCGA CACTTGCT
GCAACCTGCGCCGTCGCCACCCCTGTCCTGAGCGTGACCTAAGTCGCAAACCGCGTC GTCTACCA
TCGTCATTCGCCGCACCGTCCTGCTGCTGTCTTCGGCAAGAAGCTGCTGAGTTTGTG TACTCGAGC
ACATCAACGATGCTTCGACCCGCGCCCCCTCTACGGCTTCGACCACGTCCACCTCAA CTTCGGCT
ACTACGGCACTAAAGGGCTATCATTTGCATGAGTCTCTAGTCAAAGCTTTCGCACCG GCATTCCG
ACTGCAGGGGGATATGTCTCCATTGTTCTCCAGTCTAACCGTTCGTGTTGCTACCGC TACGACTGC
GGGGGGATGTTAGAGTATATTAGTCAACTCCGGATATGTTTAGTTTAAGATTGATTG TAATCCCG
GGATAACCTTTCTTATCTTTAGGAAAGGCTACTTGCCCTCCAAGCCATGTACTCATA TATATACCG
CCCAAGGGGCTCAATGCAATACATCGACCACATTATACGCATCATACCTTTCCTACA AGAAGTTC
CTTAGTTTCCTGATTGTGCTGTCCTGCTGTTATGTGTTAACAGTGGTAGAAGAATTG AGCATACTA
ATTGGCATTTTTTGTTGATTGATAATATCTTTTGCTATATGGTTTTCATTCCTGCAT TTTGCATTTGT
AAGACAATTCACAGACATATCTTTAGTTGAAATTGCGCTGAAACGTATCCTATGAGT TTGTCTCCT GATCATAACCTGTTTCCAATTATTTATTTCTGCTAAGCTTGATGTGCAACAGTTATGTGG TTTCTTG
ATTCTTTTCAGCGTCCTTCAAAAGACATTTAAAGTTGGGATCAATATTTACAGCCGG AGTGATGTT
GTTGCCATGGAGTGGCTGGTTCTGGAGGTCCTCAACTTTAAGTGTTTTGTCACAACA ACTAACCAT
TTCCTCTGGTACCACGAACTTCCTGCTTTCTTGTCTATATCAGCTGAACAAAAAGGA GAGGCTAA
CACCA C G
ACCTGGCOAACTACCTGTCC TO^
iiiiiiiiiiiiiiiH
GT ATGGAGGTGAACACCCGAAT
TAGAACTCAGTAACCATAAAAGCCAAGTATGAACTGTAACTTTCTGAACCTCTCGTC TGATAACT TCTAAATATATATCGACTCCAAAATTACATGTTCTGATTCTAGCTAGACTGGAAATGCAA ATGCC TAACGAGTTGCTCACATGTATGCAGACTCACGTGAGGACGCACGACGACGATCTGCATGA ATGCC T A ATGGTACATTTTTCCTCTTTTTCTATATAAAAAAATTACC
CAGTTTGTAGCACTAGGCAAATATCGACTTTTATCACTCTATGACACCTAATTGTCA TTAGCCTGT
TCAATGTTCTTTCATGCAGAGCCTAGACTGGCTGATCAACTACGCTTCGTGATACCC ATGACTCC
AAGGTGATGACATTGATCC AC ATrXTrXGCTGAI C CC AGTX ACCT ATC AC AGTAC A ACC GOTCOG
GTATGGGGATGTATAGGCCATACGTGAATCTTAGCATTGACACATTATTCTGCACAT GCCATTAG
TTTCCCTOTAGGTAGATAAGATAAAAGAAAGGCAGCATAAGGTAGCGTCTGArTATG AGAGAAT
GGC AGGAGCAAGACAATGACGAGAATTOGTATTTAGCTGTCGCiCCTTTAGGGACTATACCGA A
CTGCCGTArTGGGCTATATATCTTTGCATCArrTCTTCGCTGTTrTATGGACAATTA AAGCTGCTCT
GGTCTTGTTCATACCTGATGTAAGACTGAAATTTGTTGCCTGCGTGTGGATCGGGGC GTGATCTTG TGATCCTGGAAGATGCTTCCGTATCC^^
CAGATCTGTCAGCTTGTCTGCTAGACTGGAGATCGCAAACGAATAAAGTTTCAGTTAACT GCAGA
liiiiiiie^
TCACTTTCACGTGT
Poplar Populus trichocarpa (SEQ ID NO: 44)
>Potri.010G103700 | ChrlO: 12459034..12460255 reverse
TGCAGTTTCTGTTAATGAGTGTGTTGTAGAGAAGCAGAAGAAGCCAAACAGCTTGGGAGG AGGA
GGAGGAGAAAGTGATGACCTGGCTTGTACAGAGGAGTTGTATGTGGACGACGGAGTT TCGGATT
ACTCGTCTTGTCAGGAGACGTTGTTCTCGGAGCTGCAATCGGAGATATTCCGGGAAA AGTATTCA
TCGGACGACCTCGATTTCTCTGATGATTACACGCCGTCTATTTTCTTCGAATCTGGA AGCGATTTT
TCTGAGAAGTCCGTAAGTGATTCGAATCCTTCGCAGACTTATTCCCTGTTGCTCCAG TACAGACA
GCAATTCTCGCGATCTAGTTTACCTCTAGAAACTACAAAATCATCGTCACTCCTTGA AGCAGAGT
ATCAAGAGAATTTCGCCGTGAGTTTTTGAATCAACAATTACTTTTGTTTTCGTGTTA TTTTGCTATG
TTTTAATGCTCTCATTGTTTTTTAATTTGTTTAATTTCTGTGCTTTTCATTTGTTTT TATTGCTTAAG
TTTGCGAGATTGGACGATGAGGAAGATGAGGAGAGCTACAAGAGATTGAGAGAAAGA GAGAGA
AGGCAATTGTTTTTGCACGACTACCCTGAATTGTACCGTAACAACACGGAGTTCGGC GATCTCAT
CCTCCAGCAACGGTTGCAGATGGTACACTGGATTATCGAGGTAAGTTTTTTATAATT AATCGCGG
TAGCTAACAAATCAATTGCTCGGATGCGGCGATTTTGCGACAGTTGGCTATTTTTTT CTTGTCAAT
ACACACACGTTTGAAGAAGTTGATTCCTTCAATTATGGAAGATGTCCTCTACACTGG AGATTCTA
CACTGGAGATTCTACTACTGGAAGTGTTAAGCAAATGGGGTTGTGACACGTCATAGT TTGAGATT
TTGGGTTTTGGACTTTAAAACTGTATTTGTACCATTTGTGGTATGGTCAGTACTCAG TACTGTCCT
TGTACTAGATTAAACTAGATTTTCTCTGTGGCTCATTTTTTCTTTCTATTTACTAAA GGGTATTTTT
ATCATTTAAACTTGGGCTGTTTGTAGTGGTTTCAAGGCACTAAACACTAATCAACAA CAGTTGTT
AATCACTTGTTTATCATTTTTGGACTTTCAATTGACAATCTGTATGAGCAATGTGTG CCTTATGTTC
TTCTAATTAAATGGTAATCACTTATGTTAAAAAAATGGTAATTGCATGTTGAATTCG TGTCTGTCG
CTTTGATTATGCAGCAAGCAACTGCGAAGGAGTTTGAACCGTGTTTCTTGGAATTAG CCTTCTGG
ACCGGTTCCTAGCAATAGGGTTCTTCAAGAACAAAAGTCACCTTCAAATTGTTGGTA TAGCTTGT
CTTTCATTGGCCACCAGAATTGAAGAAAACCAGCCCTATAACTGGTAAATATCTCTG CCCCGTTC
TTTTTTGTGGTCGTGGTGTCCTGGATTGCTTAAGGAATAAAATAAAACAAGGTGCCA GTCTTGGA
TGCATAATTCTTTCCTTTTAGTCCTTTGCATACTAATGGCTTGCATTTACAACATAG AATTGTCAA AAAACGATTAGGATCGTGAATAAACTTGTTGATCATCTTAAGAAACAGATCACAAGTGAG TGTTG
TGTGGTTATTTTTTATGGACTTACTAGAAAAGAAAAATTGCTGTCAACTTGTTGCCA GATCAAGG
CTCAAATGAACTTGAACACACTGGATGAACTATACTTTAATTTCTTAACGTTTGCAA GTACTAAG
ACATTCCTATACCGCTCTCCAGTTCATTCTTTTTCATGGTTATATAATCTGCCTCAT TTTCATCAAA
GCTAGCTGTCCATCACTTCTCAAAACTGAAGGTAATATGATATCTAGCTGCAAGTTC TGCTCCGGT
CAATGAGAGCTCCGCTGACAGTTTTATAATGGGAATTTCAGTGTTAGGCAGAAGAAT TTCAACAT
TGGGAACAATGTGTACAGCAGAAGTGAAGTAGTGGCCATGGAATGGCTGGTGCAGGA GGTCCTT
!!!!ill!B
CTTTATAGTTT
TTCATATGACCTGTAGTTTAGGAAGTCTAATACAACCTTGTCCTTTTTCCATGCTGA ACATTAGGT TCTACCTGAAAGCTATGAAAGCTGGTGCAGAGGTGGAGAAGAGGGCCAGATACTTAGCAG TGCT GGCACTGTCAGACCTTGAGCAACTTAGGCATTGGCCCTCAACAGTTGCAGCCACGCTTGT CATCC TGGCTTCTCTAGAAAGCAATGAAATTGCATCCTATGGACGAGTTATCGAGGTATAAATAT TAATT
GGCAACACAAGCTTGCCGTGCTCGAATACCAAAATAAAACGATTTGGGCTGTTAATG GGTGGAG
GAGGTGCTGTTGCTTGTTTCATTTAATATTTTTGGACAAGATTTGCCGTATGATATG ATTCAGTGA
AATATCTGAAAACTGTCCTAGGACATACTTTTTCCCCGGGCCTGGCCCCGGCCCATT CCTAAAGT
GTACTGCAATTAACATGCTCTTAGGTTCTGATTAGTCAATTTGATTTTACAGGTTCA TGTAAGAAC
AAA GAAAATGACCTCXrAC AGTG ATAAAGGTATGCAAATAAAGATCTTf cf
AAT f GTACCCTCTT TTT TATGCA^
ATTCACATTCTCGAAAGCTAGTGCCGCTGGTGTTGATTTTGCCTTTATTGTTAATTTGAT TCGAATT
ATTTGACTTCGGTATTAACAGAGCCTAGAGTGGTTGCTGCAATATATGAGCTAGCAG TCTAGCAG
GAGAAGGAATGAAAGATCATAAGATCGCCTCTTGTACACTCGCATTCTTTTCTCACA CGCATCAC
TGGTTACTGTAGATGAAGATAATTGTAAAGCCTGATAATAACAGGTAACACAAATCT ATTTCTTT
TACGGTACATGCTTTTACCGAACTGTTCATAATATAGAATGACAGGATCTGTATTCA AGGAGCTG
GCTCATGTAAATTCAAAGACATAATATTCCAAGTATTCTTTTGTATGTTCTATGCAA AACGATGAT
GGGATTATGCAGACAG
Rose gum Eucalyptus grandis (SEQ ID NO: 45)
>Eucgr.B02694 | Chr02:46870743..46873621 reverse
ATTTTTCAAAATATGACTATTTATATTGTTTATAAAAATAAATGGACGAAAAATATTTTT ATATTA
AATTTATTTCGATAAATTATTTCAAGCGATATAATTTAAAAGCTCATCAAACTAATG GATTAGGTC
CGGTCATTAATGTTTCATCTAAATTGATCAATTTAACACGTTTTTACTTATTTTTAC GTAATACAA
ATCTAATAACTTATACTTGACCCAACCCGACTCGACACATATCTCAATCGTTCCCAA CCTAATCCA
ACCCATTTCCAACTCTATCCATATTGCAAAGTAAGTATGAGATCAAGTGAAAGCGAG AGAGATTA
AAGATAGAATTATAATTATAAAAGTGAATTAGATCTAAGTAAATTGTAAATCCATTT ATGACCCA
TTTACTCAATATAATTAATCTATTTATAACTCATGTAATATCCATATGGATTAAGAA ATTGGTTTA
TGACCATTTTAACGGGTCTAGATACAAGTGGTTGTTTTCAATTTTCTTTCTAAATCA TTAATTTTTT
ACAAAACAAACAAAGTCTAATTATAATTTTTTTTCTCTCATTAGAATGTTTTGATGT TACAAAAAC
CTAAAAATTCGCCTTATGTTTATCCTATATTCTTTATTCACGATTGCATTGATATTT TAAATTTTTT
TTTTTTGATAACTGAGGATCCGCTGGATCCGCCTTTCACTTATGCTAATTGGCAACC ATGGCCCGT
ACAATGCACGGTGCACCTTAGACCAAGCATCAATACCTCAGGAAAGTTAGCACAGAA CTCCACC
ACCATAATCCCTCTGCTTAAGATGTGTTAGCAACTACGAAGTTTCAATTTTGGGACC TCTGTAGTA
AAGTGCTCAAAGCCCAACTAACTCAACTACCCATCGGTGGGTGATGTATTTTAATTT GGTACTGT
AAAAAAAAAAAGATATAGAAAAGCTTTAGCATGATTGCCAATTTCTCGATTGATCAT TTGAAATG
TCAAAGCATCAAGTAAAAATTGATTTTACCCTCTTTGATAGATTACGTGTTTATTCT TTTAGCTAA
AGCTAGATTATATGTTTTGAATGAATTTATTGGATTCTTGAGGCTAAAGGGTCGAAT CCCGAATT
ATTCCCTCCCTTCTTTCTTTTCTTTTTTATTTAGCCCTTTAAAGGCTCAAGTCTAAG AGGCAACAAG
AAATTAACATGTTTTGATTCGGTTCATTGATTTGATGAATTGAATCAAGTCAATATC AAATATGAT
CAAAACTTTCGACCATTCCAATTAAAGTCAACTAATTAAACCGATTAGTCCAACCCT AATTTTAA
ATAACATTAGAGTTGTCGCTCACCCCTCTTCCATCACCACTGTGGCTCTGTGCCGCC TAGTGTTCC
GACGAAAAGGCGAGGCCGACCATCTCACATGCAAATCTTTTGCCCTTCACGTTGGCG GAAATTGC
AGCACTGCCACCGTGAGCATTCTGACCGGACAAAAGAGACTTTTTCGCCACTTCAAA AAACTCGA TAACTGCCCGGTAAACGTAAAATATATACATATATATAGAGACCAAAAAACTAACCTATT TTCCA
AAAAAAGGAAGGGAAACCAGCCCATAAATACTCTCTCGGTTCCTGAACAATTCTCAT TATCTGTG
TGGGGCCCATGGCCGAGCCAGTGCGGGCCGCGCGATGATCAAGTGGATCACATCAAC GGCTATG
ACCGATCCGCAGTTCAGTAGCAGGAGATGCCTCATGCGGAGCGATACACTACATCGT GTCCCCTT
CACGACGTGGTTAAACCCACGTAAAACCACACTCCCCAAACTCCCCATAACCGCCGC CTCCTCTT
CTCCTTCTTCTCCTTCTTCTGCTCCTCCTCGCCACACTGCGATCACTTCACGAAATT CCTCCGTCAT
CAAACTCACAACGGCACCGTCTCCAAGCTTCGAACTAGTTCGACGAAGTTCGATTCG GACGACGG
CGATGGCCAAGACATTGATCCGTTGAAATCCATTTATAATTTGCATAAATTGCTTGTAGA AACCA
CTTTTGCATGGCATATAGATAACTGCATACATTCCTCACTGTTAGATCCATGCCTGT ACATAAGAG
ATTGCTTCAAGCTGGACAAAACGTCATGCCTCTATATATTTACATGGATCCTGTTTA TATGGATCT
TTCTTTCCTTCTGTGGTCCTTCCTTTGACCTCTTACAAGTTTGATATTCCATGTTAA CAGCAT CTA
GTCH SAGAGAGCTCCACAAC^
AAAGGATACTTCAAACAGCGAAGGAACTTCCAAATTGCTGGAATAGCCTGTCTCACCCTA GCGAC
iiiiiiiiiiiiiiiiiiiiiiiiiiie^
AGAGC J( J AI G( · ΓΛΛ Γ( AA I GC ( - TTT( GGTTAGC ' A A A
TGTGAAGATGCTTAGACCTCGAGCTTTACAAATCTTCTTTTTACTTATGCTCCCAGC AATCATTTC
ACCATAACTTCAGACGACACCACATTTGGAATGTTCATTGGATCGAGTAATTTCAGT TACATGTG
GTAGACCGTTAGGAAATACATCATGAACAACTTCAAAGGTTTGCCATCGCAAAGGAA TTACCGAT
CACTCAACCTTCACTATGTCCGGGTCCGGGTTGCCAGTAGTGACTTACAATCACTCC ACTGCATA
ATTCATTTATGGATGGTAAACACCCAATTGGTTCATGCTTAGTCCCTAACAGATGTA AATGGTACT
GGTAATAACCAATTACTCAAACTTCCCTATGTCCGGGTCCAGGTTGCCAGTAGTGAC TTAAAATC
ACTCCACTGCATGACTTCTTATGGATGGTAGACACCCAGTTGGTTTATGCCAAGTCC CTAACAGA
TGTAAATGGTACTGGTATTTCAGCGTAAGACAAAAGAACTTCCGTGTGGGGAGAGAC ACCTACA
GCAGATGCGAAGTGGTGGCAATGGAGTGGTTGGTACAAGAGGTTCTCAACTTTCAGT GTACATTG
CCTACCATACACAACTTCTTATGGTACAGTCCATATCTTGTCTGCGCAACCTCGATC GGCAGACTT
ATTTTCTTTTCTCTCATATTGTGTGTTTTCATGGGTTAAGACTTACTTTTCAGACTT AGCAA^
GACCACTTGACTAATCTGAGTAAAGAAAGTGATTCTGCTGAATATTACTGAAATATG CAGAAATG
GGCAG TTTAGCTCTGCTAGACCATGAGCAGCTGTCCTACTGG CTT CACAGTTGCAG TG GC TTGTCATCCTAGCATCAGTGGAAGACGCATCCTGCAAGCGAGTCATGCAG Cherry Primus persic (SEQ ID NO : 46)
>Prupe. lG335600 | Pp01 :31684789..31689557
forwardATCCAATTATTAAAATAAAAGATTCAAGAACCTCAAGAATCTCTCCCTCC CTACCAGAAG
AAGAAGAAGAAGAAGAAGAATCAATTAATTTCAGAGATAAAGAATTCGTTGAAGAAG AAATAA
TGGCTGAATGCAGTTACCAGAACATTAAGAACCCACAACCAGAACCAGAAGATGAAG AAGAAG
AATACACAGCGGTGGTTGGCAAACACTTGTCCATGCTCCGCCTTGACAACAGCAGCA GCAGCAG
CAGCAGCTTCAAATCTCCCAATTCCAGTCCCAAGCCCAGGAGAACATTGAAAAGGCG ATCCCCGT
CCCAATCCCCACCAACATCCCAACCCAACCCCAAGAAAGAGAAGCTTGATCTCCCTC CTGATCCT
CTTCTTCGCCGCTGCAGTTCCGAACGCTTCAACCCAACTTCTCCTCCTCCTCCCCCA TTTTATTCTT
TTAATTCTCATCACAATCAGCTGCAGTGCCCCAACGCAGCCTCTCCTGCCTCTGCCT CAACAGATA
AAGCCTCTGGCGCGGCTGCTCTCTCTTCCTATGCCTCCACACTCCGCCGCTCCGTTT CCAATCCCA
AGCCTTCTTCGTGTTCGCCTGCTCTCAAAACCTTCTCCCGTCAATCCTCCTCCTCCT CTGGTGACGA
AGACGACAACGACGACGCCACTCCCAATTCTAAGGTTTTCTTCCTTCACCTCCATCT TTCATTCTC
TTGACTTTGAATTTCATCCGTATCTTTCATATGAACGTATGGTTCTTGAAGATGAAT TTCTATTATT
TTTGCAGAGGCTTAGAAGGATAAAATATCGCGTCAGAGAGATGAGCCTGTGGTTCCA ACAAGTC
ATGCTTGAAAATGAAGATGATGACGAGGAAGAAGAAGAAGAACTGGAACTGGAACCT CCTCAA
GAACAACATCATCAACAAAATGGAGACACTACTGAGGTTGGTAACTCTAACTCATCA TATTTTAA
TCATATCTTTACAACAATTGGTTTCGTTGGTTCCCAAGAATTAGAAATTACAAATCC TTTTTCACA
AGTATTGGGTTCGGACTTAGAAATTCCGTTGATATTATCATCATCATCATCTCTTTC TTTCCGTTGA
TGAATTTGATGTTGCAGTTGCAGGTCGATAGTGACATAAATTTTGCAGAATCTGTGA GCGTGGAG
AGGATGGGGGATGGCTTAGTCATTCATTTCAGGTGCCACTGTGGCGTCCCCTATCAG TTCCTTCTT
GCTGGGGGCAACTGCTACTACAAGCTCATGTAGATTTGTATTTCACAACCCCCTTTT ACCACTAGA
CTACCCACAAAAACCACTTTTTATTCTGCCTTTCTTTTTACCTTTTGCTCAAGTACA AGTTCATATG
TGCAAACGATGGCATTAATTTGTTGATTCTTCTGGCTATATGGTTATTTTCTTTCTG TATTGCTGTA
TTTCACTTACTCTTGGCAAAAAGAAATGTTCTGCTTTTTTATTCTACTTACTAATCA CAATGTTATC
AGCCTTACCACCTGCAGTAACTAGAAGGGCTTAGTTAGTACCATCTCCAAATAGTTG TGCGAGTG
TCATACATAATCTATTTTAAGGATATTTTGTCATTCGAGGTATGAATATTACTTTTT TTTTAAAATC
ACATAGTCGCACATTGCAAAAAAAAATTTAAGTGACAGTACCATATTTTTTATTTTT GAAGAGAG
ACCTTGCGGGGAGGGTTAAAGTTGAAATGGCATTGGTATAATATTTATCAATAATAT TCACTTTG
CAAAACAGAAGAGTTGCGCCTTGCGTATCAAGTAATGCTAAAGAGGGCGGCCCATAT GCCACAT
CAAAACCCATCATTAACGAAAAAAGAACGCACACAGCCTATACTAAAAGACCAAAGA GGACCTC
ATTTTCCCTTGCTTTCCT TTTCATTACTCCTTCTGTATTGATTTTTGTTATTTTAATAATATTATTTTAGTTTCTTAA GTTCGAAG
ACGAGGAGGACGAAGCGAGCTATCAGCTGCTTAGGAACAGAGAGAGGATACAAGTAT TTTTGCG
AGACTACACGGAGGAGTACTCTTCCACGACGGAATGCGGCGATCTTATCCTCCAGCA ACGGTGGC
AAATGGTCCGTTGGATCGTCGAGGTGATTGGCTTTACCGAAATTCACGTTTCTCTGA TTAAGTTCA
ATTAATCGTCGTTTTCTAAATTTAAATAAGGTCGAAGTTCAACTAATCGTCGTTTTT ATCTATTTA
AATTTGGTCGTAGTTAAATTAAGCGTCGTAATTAGTTCTGTTTGGAAATTGGACCTG CACACGTTT
GTGGAAGTACATGCCGTCAAGTAGCACATACTATCTACATTGACGATATTCTACAAC CATTTAAC
ATCCAATCAAATCTGTGCCACGTCATTGAATGGATGTGTTTGGCACTGAAACTGAAT GCATTTCT
AGTTTTTATGTTAGGATCATTCATGTACTTTTTATCACAGGCACTGGCACATCACTA GTTTCGCTTT
TCTCCTATTGGCCAAAGTAATACACATTTGTAATGTATAGGGAGATTAATTTGATAC ATTTTTGCG
AAAAATGATTTTATGTAAATTTGATACATGTACGTCTCCAAATTCAATTTTACGTAA TCCTTCTTA
GCTTAACAACTTCCACGTTGCCCCACACTTAGAACGCAGTAGTAGCACGTGCATTCG CACCAGCG
ATGGTGCGTATATAACATTTGTTGAGGGGTACTTGTTACTTGTTGGACAAGTCTATA CACTCCACG
ATTTTTTGCGTAGGTGCGATTAATTTCAAGAATTTCAATACAAAGAACTTGCATACA GCATACTG
ACTAGGGTGGATATCCAAACGGTCAATTAAGTAATTGGATCGATTTGGTTCCATTTC TATAAAAA
AAAGAAAAAGAAATTAAATTAATTCATAATTAGTTTGGTTTAGTTCAATTCATTCTC TATAAGAA
CAAGCTAAATCGAACTAAACTGCATATTATTATTTATTTATTTTGGCACAAGGAGTT TTATATTGG
TAATGATCATTTACTATTATGTTTCTTCCCACTATTTATATCAACATTTAATATAAT TGTATTGTTA
GTTTGTTACTTTAATGGAATGTTGAATATTGTTAGTGTACTAGAGATTTAAAAAGTA AAATGGTG
AATGCAATGTTTGCTTTACAAGTGCTTGAAATTGTTAAGATTGTTAAGTTATTTTGA CTCTCATGT
ATAGAATTCTTGTGTTGCACATGAGTATATTGGGTGGTGGCTATGTTTATACTTTGT TGGTGGATG
TTCGAATGCCATTGTCAATTTGCTTTGCTTGACATGCATTGAGATGTGAATTTAAGA TATTGATGC
TCTTCATGCTTTTTCTGATAAAGTGGTAATAAGAGATTGCATTATAGTTAAAAATAG TGTTCACCC
TCATCATTATAGTGGTTAATTTTCACGTAAAACTCTAATATTCCGTTTCCTGTGGGA GAGGGTGCA
TAGGCTAGCTGTTATCCGTATTTCTTATAACTAACGTTTTATTATCTCTTGTTATTA CGTTAATGCG
GTGCTTTTGATTGGCTTTCAACAAGCAGCCATCGAAT€AAATGAAGCTA€AG AGGAAACGAAC mCTAGGAGTTAGCCTCCTTGACCGArrCTTAAGCAAAGGATmTCAAGAG UAAGGATCCT
TCAGATTGTTGGAATAGCCTGTCTAACTCTAGCCACCAGAATAGAAGAAAATCAGCC CTACAACT
tlGTATAT rTI ΑΊ A
TGTTGAGGTGTTTTTTAGCCTTTTTTGTGTGGCAATTAAAGCATTACTTATAGATGAATA CAAAAA
TCAGAAGTTGAAGCAGTCCAATTCCTTCTGCAAGTGATTTGTCTGGAAATGAATGTA TTAGAGAA
ACTGTGAAGTTGCTTAGAGCTCAAACTTAAAGTTAACCCACATCCCCTTTTGGTACT TAAATAAC
TCTCTTGCATTGACAACCTGGCAAACTCAGGCTCCTGATCTTCTTTCTCTGCATATCTAG ATTATC CTTCAACTTTTATTTTCTTTTTTTCTGGGCAAAGAAAATGTTTCTGATTTGAGAGGTTCA TGCCATC TTCATTCCATGAACTAATTGGATAACATAGGTTCTACCTGAGAG TGCTAGAGCTGATGCC AAG TGGAGAAGAGAGCCAAGTACTTGGCAGTGCTGCAGATGTCGGACCATGTGCAACTTCGTT ACTGG
liiiiiiiiiiiiiiiiiiiiiiiiiiiiiii^
CAACGAGTCATAGAGGTAACTGCHTAATC GA I ' GCCA
TTTGCAGACTCATGTGAGAACAGAAGGTGATGATTTACATGAATGCATAGAGGTAAGGAT AAAA TATGAGGTATCATAAAGTTCAA^
AGCATCATTTCTTTTCTGTTTTTCTTAATGTTTCGAATATATTGTCATTTTAAGATAGTT GATAGGT GCTGATGGTGTGCTAACTTTAACAGAGCCTAGAGTGGTTGTTACATTATGTGTGATTTCT GTTTGC
TGACTCCCTCAT ^GAGATGGATC mAGGTAGATCAAGGTAAAGCCTGATCAATAGGTAAC AAAACAAATCTGATTTTTTCGTCAATTAAGACGACCGTGCAGCTACTTGTAAACATTTCA TAGAA GTACAGAATCTGTAATAATATCTGATGGTCTCCAAGGACCAA^GTAAAmTATOAACTTAT GT TTGAAAAGTACTTCA TA T ACCATGAATGTTTTACCTGCTTTQTTT TAGCATGCGTCATTATC
TTGAGGGCGTTGACATGCCCCCTAAAGTTTGAAGGT Regulatory introns - Motif 1 (SEP ID NO: 47)
TGTTTTGGTGGGAATGCTTGTGTCAGGTCAGGTCAGT
Regulatory introns - Motif2 (SEP ID NO: 48)
CTAGCTAGACTGGAAATGCCTAACGAGTAGCTCTTTACATATATGTAGGT
Regulatory introns - MotiO (SEP ID NP: 49)
GGTGGGAATGCTTGTGTCAGGTCAGTG
Regulatory introns - Motif4 (SEP ID NP: 50)
CAGTAATCTCACTGCTTGATCCCTTTCAGGTACCACGAATTTCCTGC
Regulatory introns - Motif5 (SEP ID NP: 51)
TGATTTTGCAGGTA