Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PLANT EUKARYOTIC TRANSLATION INITIATION FACTOR 4E
Document Type and Number:
WIPO Patent Application WO/2011/161466
Kind Code:
A1
Abstract:
The invention relates to plants, and in particular to virus-resistant plants, and to methods of generating such plants. The invention extends to eukaryotic translation initiation factor variants and isoforms thereof, and to nucleic acids involved in the splicing of such variant factors, and uses thereof in methods for producing plants that are resistant to viral infections.

Inventors:
WALSH JOHN ANTHONY (GB)
NELLIST CHARLOTTE FLORENCE (GB)
BARKER GUY CAMERON (GB)
JENNER CAROL ELIZABETH (GB)
Application Number:
PCT/GB2011/051192
Publication Date:
December 29, 2011
Filing Date:
June 24, 2011
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV WARWICK (GB)
SYNGENTA PARTICIPATIONS AG (CH)
WALSH JOHN ANTHONY (GB)
NELLIST CHARLOTTE FLORENCE (GB)
BARKER GUY CAMERON (GB)
JENNER CAROL ELIZABETH (GB)
International Classes:
C07K14/415; C12N15/82
Foreign References:
US20050255455A12005-11-17
EP0242246A11987-10-21
EP0249637A11987-12-23
GB2197653A1988-05-25
Other References:
SATO M ET AL: "Selective involvement of members of the eukaryotic initiation factor 4E family in the infection of Arabidopsis thaliana by potyviruses", FEBS LETTERS, ELSEVIER, AMSTERDAM, NL, vol. 579, no. 5, 14 February 2005 (2005-02-14), pages 1167 - 1171, XP004745623, ISSN: 0014-5793, DOI: 10.1016/J.FEBSLET.2004.12.086
RUFFEL S ET AL: "A natural recessive resistance gene against potato virus Y in pepper corresponds to the eukaryotic initiation factor 4E (eIF4E)", PLANT JOURNAL, BLACKWELL SCIENTIFIC PUBLICATIONS, OXFORD, GB, vol. 32, no. 6, 1 December 2002 (2002-12-01), pages 1067 - 1075, XP002242998, ISSN: 0960-7412, DOI: 10.1046/J.1365-313X.2002.01499.X
ROBAGLIA ET AL: "Translation initiation factors: a weak link in plant RNA virus infection", TRENDS IN PLANT SCIENCE, ELSEVIER SCIENCE, OXFORD, GB, vol. 11, no. 1, 1 January 2006 (2006-01-01), pages 40 - 45, XP005242590, ISSN: 1360-1385, DOI: 10.1016/J.TPLANTS.2005.11.004
ANDREW J MAULE ET AL: "Sources of natural resistance to plant viruses: status and prospects", MOLECULAR PLANT PATHOLOGY, WILEY-BLACKWELL PUBLISHING LTD, GB, vol. 8, no. 2, 1 January 2007 (2007-01-01), pages 223 - 231, XP007916091, ISSN: 1464-6722, [retrieved on 20070209], DOI: 10.1111/J.1364-3703.2007.00386.X
FLORENCE PIRON ET AL: "An Induced Mutation in Tomato eIF4E Leads to Immunity to Two Potyviruses", PLOS ONE, vol. 5, no. 6, 25 June 2010 (2010-06-25), pages E11313, XP055007538, ISSN: 1932-6203, DOI: 10.1371/journal.pone.0011313
THOMPSON ET AL., NUCLEIC ACIDS RESEARCH, vol. 22, 1994, pages 4673 - 4680
THOMPSON ET AL., NUCLEIC ACIDS RESEARCH, vol. 24, 1997, pages 4876 - 4882
WALSH ET AL., EUROPEAN JOURNAL OF PLANT PATHOLOGY, vol. 108, 2002, pages 15 - 20
RUSHOLME ET AL., JOURNAL OF GENERAL VIROLOGY, vol. 88, 2007, pages 3177 - 3186
Attorney, Agent or Firm:
Hutter, Anton et al. (20 Little Britain, London EC1A 7DH, GB)
Download PDF:
Claims:
Claims

1. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E), which is non- functional for a virus, wherein nucleic acid encoding the eIF4E or eIF(iso)4E is mis- spliced.

2. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to claim 1, wherein the mis-spliced gene encoding eIF4E or eIF(iso)4E is caused by a modification in said nucleic acid sequence.

3. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to claim 2, wherein the modification occurs in a non-coding region of the gene.

4. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to either claim 2 or claim 3, wherein the modification occurs in a splice element in the nucleic acid sequence encoding eIF4E or eIF(iso)4E.

5. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 2 to 4, wherein the modification is located in an intron of the nucleic acid sequence encoding eIF4E or eIF(iso)4E.

6. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 2 to 5, wherein the modification is located in intron 1.

7. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 2 to 6, wherein the modification comprises an insertion or a deletion of at least one base.

8. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 2 to 7, wherein the modification comprises the insertion of at least one guanine.

9. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 2 to 8, wherein the modification comprises an insertion at position 201 of SEQ ID No. 1, preferably a guanine.

10. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 9, comprising an amino acid sequence substantially as set out in SEQ ID No: 6, 8 or 10, or a variant or fragment thereof.

11. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 10, wherein the nucleic acid, which encodes the eIF4E or eIF(iso)4E protein, comprises a sequence substantially as set out in SEQ ID No: 4, 5, 7 or 9, or a variant or fragment thereof.

12. An isolated nucleic acid sequence encoding an alternatively spliced variant of a plant eukaryotic translation initiation factor 4E (eIF4E), or an isoform thereof (eIF(iso)4E), wherein the nucleic acid sequence is mis-spliced such that the eIF4E or eIF(iso)4E is non-functional for a virus.

13. An isolated nucleic acid sequence according to claim 12, wherein the eIF4E or eIF(iso)4E is defined as in any one of claims 1 to 11.

14. A recombinant vector comprising the nucleic acid sequence according to claim 13.

15. A host cell comprising the vector according to claim 14.

16. A method for detecting, in a test plant, the presence of a plant eukaryotic translation initiation factor 4E (eIF4E) variant, or an isoform thereof (eIF(iso)4E), which is non- functional for a virus, wherein nucleic acid encoding the eIF4E or eIF(iso)4E is mis-spliced, the method comprising the steps of:

(i) isolating RNA from a test plant;

(ii) producing cDNA from the RNA isolated in step (i) using primers specific for eIF4E or eIF(iso)4E;

(iii) determining the sequence of the cDNA produced in step (ii); and

(iv) comparing the cDNA sequence determined in step (iii) with the cDNA

sequence of wild-type eIF4E or eIF(iso)4E,

wherein a variation in the sequence of step (iii) compared to the wild-type sequence indicates that the nucleic acid encoding the eIF4E or eIF(iso)4E in the test plant is mis-spliced and is a variant of eIF4E or eIF(iso)4E.

17. The method according to claim 16, further comprising a step of identifying the modification in the plant genome that causes mis-splicing of eIF4E or eIF(iso)4E, wherein said identification step comprises breeding the test plant such that it is homozygous for a eIF4E or eIF(iso)4E gene.

18. The method according to claim 17, wherein the eIF4E or eIF(iso)4E variant is defined as in any one of claims 1 to 11.

19. A method for selecting a virus-resistant plant, the method comprising detecting, in a test plant, the presence of:-

(i) a plant eukaryotic initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 11; and

(ii) at least one copy of wild-type eIF4E or eIF(iso)4E, wherein the wild-type copy of eIF4E or eIF(iso)4E can be used by the plant, but cannot be used by a virus.

20. Use of a plant eukaryotic initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 11 , or the nucleic acid according to either claim 12 or 13, for inducing viral-resistance in a plant.

21. A method for producing a virus-resistant plant, the method comprising crossing a parent plant that expresses a plant eukaryotic initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 11 with a recipient plant, the recipient comprising at least one trait, selected from a group consisting of an agronomic advantage, a commercial advantage, and/ or suitability for a particular climate or soil.

22. A virus-resistant plant produced by or obtainable by the method according to claim 21.

23. A virus-resistant plant comprising the plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) according to any one of claims 1 to 11, or the nucleic acid according to either claim 12 or claim 13.

24. A virus-resistant plant according to either claim 22 or claim 23, wherein the plant expresses at least one copy of wild-type eIF4E or eIF(iso)4E, wherein the wild- type copy of eIF4E or eIF(iso)4E can be used by the plant, but not by a virus.

25. A virus-resistant plant according to any one of claims 22 to 24, wherein the plant is Brassica spp, preferably B. rapa and/ or Brassica oleracea.

26. The method according to any one of claims 16-19 or 21 , the use according to claim 20 or the plant according to any one of claims 22-25 comprising the use of a transgenic plant.

27. The method according to any one of claims 16-19 or 21 , the use according to claim 20, or the plant according to any one of claims 22-26, wherein the plant is homozygous for eIF4E and/or eIF(iso)4E, which encodes the alternatively spliced variant of a plant eukaryotic translation initiation factor 4E (eIF4E), or an isoform thereof (eIF(iso)4E), which is non- functional for a virus.

28. The method according to any one of claims 16-19 or 21, the use according to claim 20, or the plant according to any one of claims 22-27, wherein, in order to confer virus resistance in plants having multiple copies/loci of eIF4E and/or eIF(iso)4E, the alleles of eIF4E and/or eIF(iso)4E at each of these other loci are also non- functional for the virus.

29. An isolated nucleic acid sequence encoding a plant eukaryotic translation initiation factor 4E (eIF4E), or an isoform thereof (eIF(iso)4E), wherein the eIF4E or eIF(iso)4E is non-functional for a virus, and wherein the nucleic acid sequence comprises a nucleotide sequence substantially as set out in SEQ ID No: 4, 5, 7, 9, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 58 or 59, or a variant or fragment thereof.

30. An isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant, or an isoform thereof (eIF(iso)4E), which is non- functional for a virus, wherein the eIF4E or eIF(iso)4E protein variant comprises an amino acid sequence substantially as set out in SEQ ID No: 6, 8, 10, 36, 39, 42, 45, 48, 51, 54 or 60, or a variant or fragment thereof. 31. The vector according to claim 14, the host cell according to claim 15, the method according to any one of claims 16-19 or 21, the use according to claim 20, or the plant according to any one of claims 22-28, comprising a nucleotide sequence substantially as set out in SEQ ID No: 4, 5, 7, 9, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 58 or 59, or a variant or fragment thereof.

32. The vector according to claim 14, the host cell according to claim 15, the method according to any one of claims 16-19 or 21, the use according to claim 20, or the plant according to any one of claims 22-28 comprising an eIF4E or eIF(iso)4E protein variant which comprises an amino acid sequence substantially as set out in SEQ ID No: 6, 8, 10, 36, 39, 42, 45, 48, 51, 54 or 60, or a variant or fragment thereof.

33. The vector according to claim 14, the host cell according to claim 15, the method according to any one of claims 16-19 or 21 , the use according to claim 20, or the plant according to any one of claims 22-28 comprising the use of a nucleotide sequence substantially as set out in SEQ ID No: 58 or 59, or a variant or fragment thereof, or the use of an eIF4E or eIF(iso)4E protein variant which comprises an amino acid sequence substantially as set out in SEQ ID No: 60, or a variant or fragment thereof.

34. The method according to any one of claims 16-19 or 21 , the use according to claim 20, or the plant according to any one of claims 22-28, wherein the plant is of the family Solanaceae, such as potato, tomato, pepper or egg plant; Cucurbits, such as melons, squash and cucumbers; Cruciferous crops; Brassica spp; Brassica napus such as oilseed rape; Brassica rapa such as Chinese cabbage; Brassica oleracea such as broccoli, cauliflower, cabbage, savoy cabbage, Brussels sprouts, red cabbage; Brassica oleracea; Fabaceae peas, beans, pulses etc. or a monocotyledonous crop, including rice, maize, wheat, or barley etc.

Description:
A PLANT

The invention relates to plants, and in particular to virus-resistant plants, and to methods of generating such plants. The invention extends to eukaryotic translation initiation factor variants and isoforms thereof, and to nucleic acids involved in the splicing of such variant factors, and uses thereof in methods for producing plants that are resistant to viral infections.

Viruses present a significant problem in agriculture. For example, plant viruses in the family Potyviridae (potyviruses) represent approximately 30% of plant viruses and are capable of infecting more than 30 different families of plants, leading to extensive crop damage and even death. In particular, the Solanaceae, Cucurbitaceae and Fabaceae plant families are especially sensitive to infection by potyviruses. Currently, in contrast to many bacterial or fungal infections, there are few ways to combat viral infections in plants. Due to the increasing size of the international market for plants and seeds, it is becoming more essential for plant breeders to develop plants that are resistant to infection from viruses, for example those from the Potyviridae family.

Upon infection of a plant host, plant viruses use some of the host's endogenous proteins to complete their own life cycle. For example, potyviruses, such as Turnip mosaic virus (TuMV), use plant eukaryotic translation initiation factors to bind plant ribosomes to the viral genomic RNA as a pre-requisite to translating their genomes into various viral proteins, including the viral RNA-dependent RNA polymerase that is essential to produce more copies of the virus. Therefore, defects in the plant eukaryotic translation initiation factors may confer viral resistance in plants. However, since the eukaryotic translation initiation factors are vital to the survival of plants, defects in the eukaryotic translation initiation factors are detrimental to plants, often resulting in plants that are non- viable.

It has been shown in a number of plant-potyvirus interactions that the potyvirus VPg (protein encoded by potyvirus RNA genome) is able to bind to the eIF4E protein and that mutations in members of the eIF4E gene family can confer resistance to potyviruses. Recessive resistance to infections by potyviruses is believed to arise from base changes in the coding region (i.e. the exons) of the genes encoding eIF4E and/or eIF(iso)4E, thereby resulting in eIF4E and/ or eIF(iso)4E protein variants.

For example, Turnip mosaic virus (TuMV), which can normally infect Arabidopsis thaliana, can no longer infect Λ. thaliana plants that lack a functional eukaryotic translation initiation factor isoform (eIF(iso)4E) protein. Insertional mutagenesis of At.eIF(iso)4E using a defective maize transposon (dSpm) produced a plant line that was able to grow normally, and was resistant to TuMV infection. Additionally, chemically-induced point mutation of the Λ. thaliana eIF(iso)4E gene using ethylmethane sulphonate (EMS) named lsp1 also conferred resistance to TuMV infection.

Although various mechanisms of viral resistance have been found in certain plants, these are mostly specific to virus strains. Dominant plant R genes predominantly provide strain-specific resistance to plant viruses, for example TuRBOI provides resistance to pathotype 1 isolates of TuMV, but is overcome by TuMV isolates belonging to other pathotypes including, 3, 4 and 12. Most examples of recessive resistance associated with mutations in eIF4E and eIF(iso)4E are also strain-specific and mutations in viral VPg and occasionally other viral proteins result in strains able to overcome such resistance.

Therefore, there is a need to induce virus resistance in plants that is not specific to any strains, and thereby confer broad spectrum virus resistance. There are some examples of broad spectrum resistance for some viruses but for most viruses in most crop types, there are no sources of broad spectrum resistance. Additionally, viruses are constantly mutating and genotypes able to overcome strain- specific and broad spectrum resistance are generated in susceptible plants and selected for by the cultivation of resistant plants. Most plant viruses have RNA genomes and RNA is known to have a particularly high mutation rate due to infidelity in the proof-reading mechanism. Consequently, new sources of broad spectrum resistance are required, particularly based on new

mechanisms, in order to improve the durability of such resistances. All previously reported recessive resistance to infection by viruses (such as potyviruses) based on eIF4E or eIF(iso)4E has arisen through base changes in the exons. Such bases changes cause alterations in the sequence of the eIF4E or eIF(iso)4E protein, for example altered amino acid residues in the translated protein, or premature chain termination resulting in a truncated protein. Hence, previous efforts in developing virus-resistant plant species, have focused on generating plant varieties, which harbour mutations in only the exons of either eIF4E or eIF(iso)4E. Surprisingly, however, the inventor has now identified for the first time that plant resistance to viruses, such as potyviruses, may be conferred by plant eIF4E or eIF(iso)4E protein variants that are produced from mis-splicing processes compared to that of the wild-type or native protein.

Accordingly, in a first aspect of the invention, there is provided an isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant, or an isoform thereof (eIF(iso)4E), which is non-functional for a virus, wherein nucleic acid encoding the eIF4E or eIF(iso)4E is mis-spliced.

Previous examples of eIF4E and eIF(iso)4E have all involved mutations in the coding region of the genes. The inventors have now provided the first example of variation in the DNA sequence of an intron inducing virus resistance. Most changes in intron DNA sequence would not be predicted to provide resistance as the introns are spliced from the genes prior to translation and hence do not affect the protein. Additionally, mis- splicing of genes usually results in non-functional proteins, and the lack of the functional protein can be lethal, can lead to reduced fitness, or can have other adverse affects.

The term "eIF4E" is also known as eukaryotic translation initiation factor 4E, which is a key component in the initiation of protein synthesis. As will be known to the skilled technician, in plants, eIF4E forms a complex eIF4F (consisting of eIF4E and eIF4G). The precise biochemical role of a plant eIF4E protein in virus infection has yet to be identified. However, not wishing to be bound to any theory, the plant eIF4E may be capable of binding to the 5' cap structure (or a mimic) of the viral RNA, or the plant eIF4E may interact with the viral RNA directly. Alternatively, eIF4E may be involved in the cell-to-cell movement of the infecting virus in the host plant.

The term "eIF(iso)4E" refers to an isoform of eIF4E, and eIF(iso)4E has a similar function as eIF4E. eIF4E and eIF(iso)4E proteins from Arabidopsis thaliana are 44% - 49% identical at the amino acid level. This is similar to values found for Brassica rapa line R-o-18 (47 - 50% identity) and Brassica rapa line Chiifu (43 - 50 % identity).

The virus life cycle is totally dependent on using the host plant's translation machinery to turn viral nucleic acid into proteins. Without the viral proteins produced by the host translation machinery, no viral replicase protein is produced and so no more copies of the virus are made either. A number of viruses have been shown to be dependent on the host plant's translation complex and virus interactions with the plant eIF4E and/ or eIF(iso)4E proteins have been demonstrated (the version used depends on the particular combination of virus species and plant species).

The skilled person will appreciate that mis-splicing can produce the eIF4E or eIF(iso)4E variant of the first aspect, which can therefore be described as being an alternatively spliced variant, compared to the wild-type or native eIF4E or eIF(iso)4E proteins. The genomic sequence of a gene, such as that encoding eIF4E or eIF(iso)4E, comprises coding regions (i.e. exons) and non-coding regions (i.e. introns). The introns and exons are transcribed into RNA termed "primary transcript, precursor to mRNA" (or "pre-mRNA"). The introns must be removed from the pre-mRNA so that the native protein encoded by the exons can be produced. The term "native protein" can mean the naturally occurring, wild-type or functional protein.

The removal of the introns from the pre-mRNA and subsequent ligation of the exons to each other is carried out in the splicing process. The splicing process usually consists of a series of reactions, mediated by splicing factors, which is carried out on the RNA after transcription, but before translation. Thus, a "pre-mRNA" is an RNA molecule which contains both exons and intron(s), and an "mRNA" is an RNA in which the intron(s) have been removed and the exons have been joined together sequentially so that the protein can then be translated therefrom by the ribosomes.

Introns are defined by a set of "splice elements" which are relatively short, conserved RNA segments which bind the various splicing factors that carry out the splicing reactions. Thus, each intron is defined by a 5' splice site, a 3' splice site, and a branch point situated there between.

The inventor has found that the protein variant of the first aspect may be caused by modifications in the splice element of the native DNA and/ or pre-mRNA of eIF4E or eIF(iso)4E, which create a new, aberrant, splice element (compared to the wild-type). Therefore, in an embodiment of the invention, the mis- spliced gene encoding eIF4E or eIF(iso)4E may arise from a modification in a splice element, thereby producing an aberrant splice element. The aberrant splice element may cause altered splice patterns of the eIF4E or eIF(iso)4E pre-mRNA, giving rise to altered mRNA. Preferably, the resulting eIF4E protein variant or eIF(iso)4E protein variant of the first aspect is not functional for the virus in a plant, such that the plant is substantially resistant to viral infection.

The inventor has demonstrated that alterations in an intron of the gene encoding eIF4E or eIF(iso)4E (rather than in an exon) result in mis-splicing occurring, which results in a eIF4E variant or eIF(iso)4E variant protein being produced, which is non-functional for a virus, and thus confers virus resistance when present in a plant.

In one embodiment, the aberrant splice element may alter the native splice site at the 5'- or 3'-end of the intron (i.e. at the 5' native splice site or at the 3' native splice site of the intron), which creates a new, aberrant, splice site. It would be appreciated that the difference between the genomic DNA sequence and the sequence of mature message RNA defines the introns. The 5' and 3' ends of the introns define the native splice sites of a gene. The aberrant splice site may be upstream or downstream of the native splice site. It would be appreciated by the skilled technician that the term "upstream" means towards the 5' end of the DNA or pre-mRNA molecule and may be denoted by the symbol It would also be appreciated by the skilled technician that the term "downstream" means towards the 3' end of the DNA or pre-mRNA molecule, and may be denoted by the symbol "+". Hence, in an embodiment where the aberrant splice site may be 10 nucleotides upstream from the 5' native splice site of intron 1, the aberrant splice site may also be referred to as "-10 bp relative to the 5' splice site of intron 1".

In embodiments where the aberrant splice site is upstream of the 5' native splice site of an intron, some of the exon that is upstream of the intron may be excised during the splicing process. Therefore, the resulting mRNA may lack nucleotides of the excised exon. In embodiments where the aberrant splice site is downstream of the 5' native splice site of an intron, some of the intron may be retained during the splice event. Therefore, the resulting mRNA may comprise nucleotides of the retained intron.

In embodiments where the aberrant splice site is downstream of the 3' native splice site of an intron, some of the exon downstream of the intron may be excised during the splicing process. Therefore, the resulting mRNA may lack nucleotides of the excised exon. In embodiments where the aberrant splice site is upstream of the 3' native splice site of an intron, some, or all of the intron may be retained during the splice event. Therefore, the resulting mRNA may contain the retained intron nucleotides.

In an alternative embodiment, the altered splice pattern may comprise retaining at least one intron. The resultant mRNA molecule (i.e. mRNA variant) may therefore contain RNA sequence that corresponds to the retained intron or introns, and may be larger than the native mRNA molecule.

The amino acid sequence translated from the retained at least one intron, or part thereof may comprise a stop codon. The resulting protein (i.e. protein variant) may not be translated beyond the first stop codon and, hence, the protein variant may be smaller in size compared to the native protein. The stop codon that is introduced by the retained intron may be referred to as a premature stop codon because it causes translation of the mRNA to cease prematurely (the first wild-type stop codon of the gene being located further downstream). Therefore, the protein variant may have lack of function or an altered function compared to the wild-type or the native protein. The protein variant may be non-functional for a virus.

The codon reading frame of the retained at least one intron, or part thereof, may be in- frame or out-of-frame with the coding reading frame of the neighbouring exon(s). The resulting mRNA may encode an altered amino acid sequence, and hence, produce a protein variant. The protein variant may have lack of function or an altered function compared to the wild type or the native protein. The protein variant may be nonfunctional for a virus.

In an embodiment where the codon reading frame of the retained at least one intron, or part thereof, is out-of-frame with that of the neighbouring exon(s), the resulting protein (i.e. protein variant) may contain altered amino acid sequences may be truncated due to a premature stop codon. The resultant protein variant may have lack of function or an altered function compared to the wild type or the native protein. The protein variant may be non- functional for a virus.

In an embodiment where at least one exon, or a part thereof, is excised, the resulting mRNA may encode an altered amino acid sequence, and hence, produce a protein variant. The altered amino acid sequence may contain a premature stop codon. The protein variant may have lack of function, or an altered function compared to the wild type or the native protein. The protein variant may be non- functional for a virus.

The altered function of the eIF4E variant or eIF(iso)4E variant may comprise a decrease in its activity for translation initiation. When the mis-splicing occurs in intron 1, this may result in extremely truncated versions of the protein, or versions where most of the protein is completely different to functional versions. Not bound by theory, this may have a number of consequences, as it is highly unlikely to bind to other components of the eukaryotic initiation complex, particularly eIF4G or eIF(iso)4G and also unlikely to bind to the viral protein VPg, or messenger RNA cap. This will result in lack of translation of messenger RNA and/ or viral RNA.

The inventor has carried out his experiments using rassica rapa as a plant model, and has surprisingly found that B. rapa contains three eIF(iso)4E loci, and thus, has three protein isoforms of eIF(iso)4E, the protein isoforms being denoted eIF(iso)4E.a, eIF(iso)4E.b and eIF(iso)4E.c. He therefore investigated eIF(iso)4E.a in two plant lines. The first plant line, referred to herein as "R-o-18", is sensitive to infection with a virus (i.e. Turnip Mosaic virus, TuMV), and the second plant line, referred to herein as

"RLR22", is resistant to viral infection. The inventor has determined the genomic, mRNA and polypeptide sequences for the eIF(iso)4E.a isoform of B. rapa, as discussed below.

The genomic DNA sequence encoding the native or wild type Brassica rapa eIF(iso)4E.a isoform, BraA.eIF(iso)4E.a, in line R-o-18 (sensitive to viral infection), is provided herein as SEQ ID No. 1, as follows:

ATGGCGACAGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAAGTACCGGCAACAGAGACG ACGGAGAAGC AGCCTGCTCACAAGCTCGAAAGAAAGTGGAGTTTCTGGTTCGATAACCAATCCAAACCAA AGCAAGGCGC CGCCTGGGGAGCCTCCCTTCGCAAAGCCTATACCTTCGACACCGTCCAAGACTTCTGGGG GTTTGTTTGT CTTCTCCTTTTATTTATTGTTAGCGATCTGTAAAGCTAGATCTTCTTTTGCAGTTTGCAC GAGACTATAT TCATCCCTAGCAAACTGACGCCGAATGCTGAAATTCACATGTTCAAAGCTGGTGTTGAGC CTAAGTGGGA AGATCCTGAGTGTGCTAATGGGGGAAAGTGGACTTATGTTGTCACCTCCAACCGCAAGCC TGCTTTAGAC AAGGCTTGGCTTGAAACTGTACTCCTCTTCTACCTCTCCTCCTTTTTTCTTTTTTTTTGC ATCTGGTAAT GACATGTTTTCTCTGCCAGTTGATGGCTCTTGTCGGAGAGCAATTTGATGAGGCTGATGA GATTTGTGGC GTGGTTGCTAGTGTGCGCCCAAAGCAGGACAAGCTCTCCTTGTGGACAAGGACCAAATCT AATGAAGCTG TTCTGGTATGATGCTTGTCTTCTCTCACTATGTACCTTTGGTGTTGTTTGATAACTGTTT TCTTCTACTT GTTATCCGTTGCGATGTCCCATTATTGTTTGATTATCCTGTTCCAATTTTTTTGTATTGC GTACTGGTGG TTTACGAAGAAGTGTTCTTGTACAATATGTTAGCGTTGTTGAATGTGTTAATTGCTTACT ATAGTAAAAC AGTTTAAGCTGTTGACTATGTTAATATTCTCTTCGATACACACACTTAGAATGGATAACT ACCTTGTTTC TTTATCCTTTGGAGTTTCACCAGCTTATTATCGATCGAGATACTCCTTCTGATTTGAATT ACCATTCAAG ATTAATATTTATATATATTGAAAGTATATGTTTGTTTAACGATATATCTATTAGGCTTGC TTTTTTTAGT TCATTCGCAGTATAAACGTAGCTCTATTTATTAGAGGCTTCTCTTTAGAACTTGGCAGTA ATGTAATATG TCGAAGTGTGGTTTATGAATCTGGTTGATGATATTACTAATTTTTTTGTTTGTTATTGTA AATCCAGATG GGTATTGGGAAGAAGTGGAAGGAGATACTTGATGTCACCGACAAGATAACTTTCACTAAC CATGTAACTT AACTTTCTCCACATAGAGGCTAATTATCTTTTGTTCTTCTTACGTGGCTTACTAAAATGT GGTCTACTTA TATATATAGG AT G AT T C T AG AAG AAC TAGGTTCACTGTCTGA

SEQ ID No: 1 SEQ ID No: 1 shows exons 1 to 5, with the introns being represented in bold. The mRNA sequence of the native or wild type BraA.eIF(iso)4E.a in line R-o-18 (sensitive to viral infection), is provided herein as SEQ ID No. 2, as follows:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGGCAACAGAG ACGACGGAGAAGC AGCCUGCUCACAAGCUCGAAAGAAAGUGGAGUUUCUGGUUCGAUAACCAAUCCAAACCAA AGCAAGGCGC CGCCUGGGGAGCCUCCCUUCGCAAAGCCUAUACCUUCGACACCGUCCAAGACUUCUGGGG UUUGCACGAG ACUAUAUUCAUCCCUAGCAAACUGACGCCGAAUGCUGAAAUUCACAUGUUCAAAGCUGGU GUUGAGCCUA AGUGGGAAGAUCCUGAGUGUGCUAAUGGGGGAAAGUGGACUUAUGUUGUCACCUCCAACC GCAAGCCUGC UUUAGACAAGGCUUGGCUUGAAACUUUGAUGGCUCUUGUCGGAGAGCAAUUUGAUGAGGC UGAUGAGAUU UGUGGCGUGGUUGCUAGUGUGCGCCCAAAGCAGGACAAGCUCUCCUUGUGGACAAGGACC AAAUCUAAUG AAGCUGUUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUGAUGUCACCGACAAGA UAACUUUCAC UAACCAUGAUGAUUCUAGAAGAACUAGGUUCACUGUCUGA

SEQ ID No: 2 Figure 3 illustrates schematically that the exons of BmA.eIF(iso)4E.a are ligated together to form the mRNA of SEQ ID No: 2.

The polypeptide sequence of the native or wild type BraA.eIF(iso)4E.a in line R-o-18 (sensitive to viral infection), is provided herein as SEQ ID No. 3, as follows:

MATEDVNEALAAAEVPATETTEKQPAHKLERKWS W DNQSKPKQGAAWGASLRKAYT DTVQDFWGLHE TIFIPSKLTPNAEIHMFKAGVEPKWEDPECANGGKWTYVVTSNRKPALDKAWLETLMALV GEQFDEADEI CGWASVRPKQDKLSLWTRTKSNEAVLMGIGKKWKEILDVTDKITFTNHDDSRRTRFTV.

SEQ ID No: 3 The genomic DNA sequence encoding the Brassica rapa eIF(iso)4E.a isoform,

BraA.eIF(iso)4E.a variant, in line RLR22 (resistant to viral infection), is provided herein as SEQ ID No. 4, as follows:

ATGGCGACAGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAAGTACCGGCAACAGAGACG ACGGAGAAGC AGCCTGCTGACAAGCTCGAAAGAAAGTGGAGTTTCTGGTTCGATAACCAATCCAAACCAA AGCAAGGCGC CGCCTGGGGAGCCTCCCTTCGCAAAGCCTATACCTTCGACACCGTCCAAGACTTCTGGGG GGTTTGTTTG TCTTCTCCTTTTACTTATTGTTAGCGATCTGTAAAGCTAGATCTTCTTTTGCAGTTTGCA CGAGACTATA TTCATCCCTAGCAAACTGACGCCGAATGCTGAAATTCACATGTTCAAAGCTGGTGTTGAG CCTAAGTGGG AAGATCCTGAGTGTGCTAATGGCGGAAAGTGGACTTTTGTTGTTACCTCCAACCGCAAGC CTGCTTTAGA CAAGGCTTGGCTTGAAACTGTACTCATCTTCTACCTCTCCTCTTTTTTTTTTTAATAGTT TAGACAATTT TGCATCTGGTAATGACATGTTTTATCTGCCAGTTGATGGCTCTTGTCGGAGAGCAATTTG ATGAGGCTGA TGAGATCTGTGGGGTGGTTGCTAGTGTGCGCCCAAAGCAGGACAAGCTCTCCTTGTGGAC AAGGACCAAA TCTAATGAAGCTGTTCTGGTATGATGCTTCTCTTCTCTCACTATGTACCTTTGGTGTTGT TTTCTTCTAC TTGTTATCCGTTGCGATGTCCCATTATTGTTTGATTATCCTGTTCCAATTTTTCTGTTTT GCGTACTGGT GGTTTACGAAGAAGTATGCTTGTACAATATGTTAGCGTTGTTGAATGTGTTAATTGCTTA CTATAGTAAA ACAGTTTAAGCTGTTGACTATGTTAATATTCTCTTCGATACACACACTTAGAATGGATAA CTACCTTGTT TCTTTATCCTTTGGAGTTTCACCAGCTTAATATATATTGAAAGTATATGTTTGTTCAACG ATATATCTAT TAGGCTTGCTTTTTTTAGTTCATTCGCAGTATAAACATAGCTCTATTTATTAGAGGCCAT CTCTTTAGAA CTTGGCAGTACTGTAATATGTCGAAGTGTGGTTTATGAATCTGGCTGATGATATTACTAC TTTGTTGTTT GTTATTGTAAATCCAGATGGGTATTGGGAAGAAGTGGAAGGAGATACTTGATGTCACCGA CAAGATAACT TTCACTAACCATGTAACTTAACTTTCTCCACATAGAGGCTAATTATCTTTTGTTCTTCTT ACGTGGCTTA

CTAAAATGTGGTCTACTTATATATATAGGATGATTCTAGAAGAACTCGGTTCACTGT CTGA

SEQ ID No: 4

SEQ ID No: 4 shows the int ons being represented in bold, and a guanine insertion at position +1 of the 5' splice site of intron 1 (underlined in the sequence above). This mutation is referred to herein as "insertion/ deletion" mutation or "indel". The inventor has surprisingly observed that the DNA sequence represented as SEQ ID No.4, in some embodiments, can give rise to several different forms of mRNA sequence, each of which may produce the variant of the eIF4E or eIF(iso)4E protein of the first aspect. The inventor has named the allele (BraA.eIF(iso)4E.d) from the virus- resistant line (RLR22) "retrOI", which is believed to be recessive. A first embodiment of an mRNA sequence of Br A.eIF(iso)4E.a variant in line RLR22 (resistant to viral infection), is provided herein as SEQ ID No. 5, as follows:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGGCAACAGAG ACGACGGAGAAGC AGCCUGCUGACAAGCUCGAAAGAAAGUGGAGUUUCUGGUUCGAUAACCAAUCCAAACCAA AGCAAGGCGC CGCCUGGGGAGCCUCCCUUCGCAAAGCCUAUACCUUCGACACCGUCCAAGACUUCUGGGG GGUUUGUUUG UCUUCUCCUUUUACUUAUUGUUAGCGAUCUGUAAAGCUAGAUCUUCUUUUGCAGUUUGCA CGAGACUAUA

UUCAUCCCUAGCAAACUGACGCCGAAUGCUGAAAUUCACAUGUUCAAAGCUGGUGUU GAGCCUAAGUGGG AAGAUCCUGAGUGUGCUAAUGGCGGAAAGUGGACUUUUGUUGUUACCUCCAACCGCAAGC CUGCUUUAGA CAAGGCUUGGCUUGAAACUUUGAUGGCUCUUGUCGGAGAGCAAUUUGAUGAGGCUGAUGA GAUCUGUGGG GUGGUUGCUAGUGUGCGCCCAAAGCAGGACAAGCUCUCCUUGUGGACAAGGACCAAAUCU AAUGAAGCUG UUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUGAUGUCACCGACAAGAUAACUU UCACUAACCA UGAUGAUUCUAGAAGAACUCGGUUCACUGUCUGA

SEQ ID No: 5 As can be seen, the effect of the nucleic acid modification in SEQ ID No.4 (i.e.

containing the guanine insertion at position +1 of the 5' splice site of intron 1) is that the resultant mRNA shown in SEQ ID No: 5 contains RNA sequence that

corresponds to intron 1 which is fully retained, and which is represented in bold.

Figure 4 illustrates schematically that an altered splice pattern of eIF4E DNA or pre- mRNA resulting in the mRNA of SEQ ID No: 5. The polypeptide sequence corresponding to the first embodiment of mRNA (i.e. SEQ ID No. 5) of BraA.eIF(iso)4E.a variant in line RLR22, is provided herein as SEQ ID No. 6, as follows:

MATEDVNEALAAAEVPATETTEKQPADKLERKWS W DNQSKPKQGAAWGASLRKAYT DTVQDFWGVCL SSPFTYC . RSVKLDLLLQFARDYIHP . QTDAEC . SHVQSWC . A . VGRS . VC . WRKVDFCCYLQPQACFR QGLA . FDGSCRRAI .. G.. DLWGGC . CAPKAGQALLVDKDQI .. SCSDGYWEEVEGDT . CHRQDNFH . P .. F . KNSVHCL

SEQ ID No: 6

In SEQ ID No.6, chain termination is indicated by a As can be seen, translation of the mRNA shown in SEQ ID No: 5, having the whole of intron 1 retained, results in a first premature stop codon at position 88, such that a truncated eIF(iso)4E protein is produced.

A second embodiment of an mRNA sequence of BmA.eIF(iso)4E.a variant in line RLR22 (resistant to viral infection), is provided herein as SEQ ID No. 7, as follows:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGGCAACAGAG ACGACGGAGAAGC AGCCUGCUGACAAGCUCGAAAGAAAGUGGAGUUUCUGGUUCGAUAACCAAUCCAAACCAA AGCAAGGCGC CGCCUGGGGAGCCUCCCUUCGCAAAGCCUAUACCUUCGACACCGUCCAAGACUUCUGGGG GAUCUUCUUU UGCAGUUUGCACGAGACUAUAUUCAUCCCUAGCAAACUGACGCCGAAUGCUGAAAUUCAC AUGUUCAAAG CUGGUGUUGAGCCUAAGUGGGAAGAUCCUGAGUGUGCUAAUGGCGGAAAGUGGACUUUUG UUGUUACCUC CAACCGCAAGCCUGCUUUAGACAAGGCUUGGCUUGAAACUUUGAUGGCUCUUGUCGGAGA GCAAUUUGAU GAGGCUGAUGAGAUCUGUGGGGUGGUUGCUAGUGUGCGCCCAAAGCAGGACAAGCUCUCC UUGUGGACAA GGACCAAAUCUAAUGAAGCUGUUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUG AUGUCACCGA CAAGAUAACUUUCACUAACCAUGAUGAUUCUAGAAGAACUAGGUUCACUGUCUGA

SEQ ID No: 7 The mRNA variant shown in SEQ ID No: 7 results from an altered 3' splice site of intron 1. The altered splice site is at position +48 relative to the 5' end of intron 1 (i.e. 15 bp upstream of the native 3' splice site of intron 1). The retained intron sequence (15 nucleotides) is represented in bold. Figure 5 illustrates this splice event

schematically. The polypeptide sequence corresponding to the second embodiment of mRNA variant (i.e. SEQ ID No. 7) of BraA.eIF(iso)4E. a variant in line RLR22, is provided herein as SEQ ID No. 8, as follows: MATEDVNEALAAAEVPATETTEKQPADKLERKWS W DNQSKPKQGAAWGASLRKAYT DTVQDFWGIFF CSLHETIFIPSKLTPNAEIHMFKAGVEPKWEDPECANGGKWTFVVTSNRKPALDKAWLET LMALVGEQFD EADEICGVVASVRPKQDKLSLWTRTKSNEAVLMGIGKKWKEILDVTDKITFTNHDDSRRT RFTV.

SEQ ID No: 8

In SEQ ID No. 8, five new amino acid residues (i.e. IFFCS) are translated as a result of the nucleic modification and are shown in bold. As can be seen, translation of the mRNA shown in SEQ ID No: 7 having part of the intron retained does not result in a ftameshift, but the elongated protein may be non- functional for the virus.

A third embodiment of an mRNA sequence of BraA.eIF(iso)4E.a variant in line RLR22 (resistant to viral infection), is provided herein as SEQ ID No. 9, as follows:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGGCAACAGAG ACGACGGAGAAGC AGCCUGCUGACAAGCUCGAAAGAAAGUGGAGUUUCUGGUUCGAUAACCAAUCCAAACCAA AGCAAGGCGC CGCCUGGGGAGCCUCCCUUCGCAAAGCCUAUACCUUCGACACCGUCCAAGACUUCUGAUU UGCACGAGAC UAUAUUCAUCCCUAGCAAACUGACGCCGAAUGCUGAAAUUCACAUGUUCAAAGCUGGUGU UGAGCCUAAG UGGGAAGAUCCUGAGUGUGCUAAUGGCGGAAAGUGGACUUUUGUUGUUACCUCCAACCGC AAGCCUGCUU UAGACAAGGCUUGGCUUGAAACUUUGAUGGCUCUUGUCGGAGAGCAAUUUGAUGAGGCUG AUGAGAUCUG UGGGGUGGUUGCUAGUGUGCGCCCAAAGCAGGACAAGCUCUCCUUGUGGACAAGGACCAA AUCUAAUGAA GCUGUUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUGAUGUCACCGACAAGAUA ACUUUCACUA ACCAUGAUGAUUCUAGAAGAACUCGGUUCACUGUCUGA

SEQ ID No: 9 The mRNA variant shown in SEQ ID No: 9 results from an altered 5' splice site of intron 1. The altered splice site is at position -3 (i.e. 3 bp upstream) relative to the 5' native splice site of intron 1. The excised exon sequence (3 nucleotides) is represented by the symbol "Δ". Figure 6 illustrates this splice event schematically. The polypeptide sequence corresponding to the third embodiment of mRNA (i.e. SEQ ID No. 9) of BraA.eIF(iso)4E.a variant in line RLR22, is provided herein as SEQ ID No. 10, as follows: MATE DVNEALAAAEVP ATE TTEKQPADKLERKWS W DNQSKPKQGAAWGASLRKAYT DTVQDFCLHET IFIPSKLTPNAE IHMFKAGVEPKWEDPECANGGKWTFVVTSNRKPALDKAWLETLMALVGEQFDEADE IC GWASVRPKQDKLSLWTRTKSNEAVLMGIGKKWKE I LDVTDKI TFTNHDDSRRTRFTV .

SEQ ID No: 10

In SEQ ID No.10, a new amino acid residue (i.e. C) is translated as a result of the nucleic modification and is shown in bold. As can be seen, translation of the mRNA shown in SEQ ID No: 9 having part of the intron retained did not result in a ftameshift. Comparing SEQ ID No. 9 with SEQ ID No.3, amino acids:

phenylalanine (F)-tryptophan(W) -glycine (G) at positions 75-77 have been replaced by cysteine(C). Having lost one amino acid and had another substituted may result in a protein that is non- functional for the virus.

Accordingly, the eIF4E or eIF(iso)4E protein variant of the first aspect may comprise an amino acid sequence substantially as set out in SEQ ID No: 6, 8 or 10, or a variant or fragment thereof. The nucleic acid, which encodes the eIF4E or eIF(iso)4E protein, may comprise a sequence substantially as set out in SEQ ID No: 4, 5, 7 or 9, or a variant or fragment thereof.

In view of the above, it will be appreciated that the mis- splicing of the nucleic acid encoding the eIF4E or eIF(iso)4E protein variant of the first aspect may be caused by a modification in said nucleic acid sequence. It is preferred that the modification occurs in a non-coding region of the gene. The modification may be located in a splice element in the nucleic acid sequence encoding eIF4E or eIF(iso)4E protein to produce an aberrant splice element. The modification may be present upstream or downstream of the exon-intron junction or native splice site. Preferably, the modification is present between the position -10 and +10 of the native splice site. More preferably, the modification is present between the position -5 and +5 of the native splice site. Most preferably, the modification is present at position -1 to +1 of the native splice site. The native splice site may be at or towards the 5'- or 3'-end of an intron. The splice element may comprise a 3' splice site, a 5' splice site and a branch site, all of which are required for accurate removal of the intron during production of mature message RNA. Modifications in one, or more of these components may result in failure to remove some, or all of the intron, or removal of exon sequence. This in turn results in truncated or elongated proteins which are highly unlikely to be functional or would be less functional than the native protein.

The modification may be located in an intron in the nucleic acid sequence encoding eIF4E or eIF(iso)4E protein. The modification may be in intron 1, 2, 3 or 4 of the nucleic acid sequence encoding eIF4E or eIF(iso)4E proteins. Intron 1 corresponds to bases 201 to 263 of SEQ ID No. 1, intron 2 corresponds to bases 439 to 509 of SEQ ID No. 1, intron 3 corresponds to bases 636 to 1187 of SEQ ID No. 1 and intron 4 corresponds to bases 1254 to 1339 of SEQ ID No. 1 Preferably, the modification is in intron 1 of the nucleic acid sequence encoding eIF4E or eIF(iso)4E protein.

The modification may comprise an insertion, deletion, substitution or any combination of these, of at least one nucleic acid base anywhere in the nucleic acid sequence encoding eIF4E or eIF(iso)4E protein, preferably in an intron, more preferably in intron 1. The insertion may be a purine (adenine or guanine) or a pyrimidine (cytosine or thymine). Preferably, the modification comprises a guanine insertion. The modification may result in a frameshift of the codon reading frame relative to that of the native protein. The modification may result in the formation of a premature stop codon. The modification may result in no frameshift, or no premature stop codon, but with the addition or deletion of amino acids.

Preferably, the modification comprises an insertion at position 201 of SEQ ID No. 1, which insertion is preferably a guanine.

The inventor believes that it may be possible to modify the plant genome to produce mutations that would result in mis-splicing of eIF4E and/ or eIF(iso)4E, or other proteins essential for the completion of virus life-cycles by artificial means known in the art. The artificial means may include inducing / promoting recombination, site-directed mutagenesis through a number of means, Targeted Induced Local Lesions in Genomes (TILLING), or other means known in the art.

The inventor has found that the eIF4E or eIF(iso)4E variant protein of the first aspect is non-functional for Turnip mosaic virus (TuMV). However, the inventor believes that the eIF4E or eIF(iso)4E protein of the first aspect is non- functional for a wide variety of viruses, which can otherwise infect plants. Thus, the eIF4E or eIF(iso)4E may be non-functional for any plant viruses that are dependent on them for completion of their life cycle.

However, it is preferred that the variant eIF4E or eIF(iso)4E is non- functional for any plant virus, in the family Potyviridae. Examples of suitable potyviruses for which the eIF4E or eIF(iso)4E may be non- functional include Pepper veinal mottle virus (PVMP), Bean common mosaic virus (BCMV), Potato virus Y (PVY), or A^ukinin mosaic virus (AzMV), or any other virus that is dependent on eIF4E and / or eIF(iso)4E.

As described herein, the eIF4E or eIF(iso)4E protein of the first aspect is nonfunctional for a range of viruses in Brassica rapa. However, the inventor believes that the eIF4E or eIF(iso)4E protein of the first aspect is non- functional for viruses in a wide range of different plant species. Thus, the eIF4E or eIF(iso)4E may be non- functional for a virus in a plant of the family Solanaceae, such as potato (all species), tomato, pepper or egg plant, Cucurbits, such as melons, squash and cucumbers, Cruciferous crops, particularly Brassica napus such as oilseed rape, Brassica rapa such as Chinese cabbage, and even more particularly Brassica oleracea such as broccoli, cauliflower, cabbage, savoy cabbage, Brussels sprouts, red cabbage, and the like etc., Fabaceae peas, beans, pulses etc. and also any monocotyledonous crops, including rice, maize, wheat, barley etc. In one preferred embodiment, the plant may be Brassica spp, preferably B. rapa and/ or Brassica oleracea. In a second aspect, there is provided an isolated nucleic acid sequence encoding an alternatively spliced variant of a plant eukaryotic translation initiation factor 4E

(eIF4E), or an isoform thereof (eIF(iso)4E), wherein the nucleic acid sequence is mis- spliced such that the eIF4E or eIF(iso)4E is non- functional for a virus.

The isolated nucleic acid sequence of the second aspect may comprise DNA, cDNA, RNA or mRNA.

The nucleic acid sequence may comprise a nucleotide sequence substantially as set out in any one of SEQ ID No: 4, 5, 7 or 9, or a variant or fragment thereof. The eIF4E or eIF(iso)4E encoded by the nucleic acid sequence may comprise an amino acid sequence substantially as set out in any one of SEQ ID No: 6, 8 or 10, or a variant or fragment thereof.

In a third aspect, there is provided a recombinant vector comprising the nucleic acid sequence of the second aspect.

The recombinant vector may be a plasmid, cosmid or phage. Such recombinant vectors are highly useful for transforming host cells with the nucleic acid molecules of the second aspect. The skilled technician will appreciate that genetic constructs of the invention may be combined with many types of backbone vector for expression purposes. The backbone vector may be a binary vector, for example one which can replicate in both E. coli and Agrobacterium tumefaciens. For example, a suitable vector may be a pBIN plasmid, such as pBIN19.

Recombinant vectors may include a variety of other functional elements in addition to the nucleic acid sequence of the invention, including a promoter. For instance, the recombinant vector may be designed such that it autonomously replicates in the cytosol of a host cell, which may be a plant cell. In this case, elements which induce or regulate DNA replication may be required in the recombinant vector. Alternatively, the recombinant vector may be designed such that it integrates into the genome of a host cell. In this case, DNA sequences which favour targeted integration (e.g. by

homologous recombination) are envisaged.

The recombinant vector may also comprise DNA coding for a gene that may be used as a selectable marker in the cloning process, i.e. to enable selection of cells that have been transfected or transformed, and to enable the selection of cells harbouring vectors incorporating heterologous DNA. Alternatively, the selectable marker gene may be in a different vector to be used simultaneously with vector containing the gene of interest. The vector may also comprise DNA involved with regulating expression of the coding sequence, or for targeting the expressed polypeptide to a certain part of the host cell, e.g. the chloroplast. Hence, the vector of the third aspect may comprise at least one additional element selected from a group consisting of: a selectable marker gene (e.g. an antibiotic resistance gene); a polypeptide termination signal; and a protein targeting sequence (e.g. a chloroplast transit peptide).

Examples of suitable marker genes include antibiotic resistance genes such as those conferring resistance to Kanamycin, Geneticin (G418) and Hygromycin (npt-II, hyg-B); herbicide resistance genes, such as those conferring resistance to phosphinothricin and sulphonamide based herbicides (bar and s l respectively; EP-A-242246, EP-A- 0249637); and screenable markers such as beta-glucuronidase (GB2197653), luciferase and green fluorescent protein (GFP).

The marker gene may be controlled by a second promoter, which allows expression in cells, which may or may not be in the seed, thereby allowing the selection of cells or tissue containing the marker at any stage of development of the plant. Suitable second promoters are the promoter of nopaline synthase gene of Agrobacteritim and the promoter derived from the gene which encodes the 35S cauliflower mosaic virus (CaMV) transcript. However, any other suitable second promoter may be used. Various embodiments of the vector of the invention may be prepared using a suitable cloning procedure, and which may be summarised as follows. In a fourth aspect, there is provided a host cell comprising the vector of the third aspect.

The host cell may be a plant cell. Alternatively, the host cell may be a bacterium or a virus. The vector may be transformed into the host cell using techniques known to the skilled technician.

It would be appreciated that molecular techniques required to introduce the vector of the third aspect into a plant are known in the art, and may be found in textbooks such as Sambrook et al.

The inventor has found that it is possible to detect plants, which are resistant to infection from viruses.

In a fifth aspect of the invention, there is provided a method for detecting, in a test plant, the presence of a plant eukaryotic translation initiation factor 4E (eIF4E) variant, or an isoform thereof (eIF(iso)4E), which is non- functional for a virus, wherein nucleic acid encoding the eIF4E or eIF(iso)4E is mis-spliced, the method comprising the steps of:

(i) isolating RNA from a test plant;

(ii) producing cDNA from the RNA isolated in step (i) using primers specific for eIF4E or eIF(iso)4E;

(iii) determining the sequence of the cDNA produced in step (ii); and

(iv) comparing the cDNA sequence determined in step (iii) with the cDNA

sequence of wild-type eIF4E or eIF(iso)4E,

wherein a variation in the sequence of step (iii) compared to the wild-type sequence indicates that the nucleic acid encoding the eIF4E or eIF(iso)4E in the test plant is mis-spliced and is a variant of eIF4E or eIF(iso)4E.

It will be appreciated that the detection steps of (i) to (iv) involve molecular techniques that are known in the art, such as in Sambrook et al. The method may comprise a step of obtaining a sample from the test plant, from the RNA may be isolated, preferably mRNA. The cDNA may be obtained by reverse transcription polymerase chain reaction (RT-PCR) of eIF4E and/or eIF(iso)4E RNA utilising primers complementary to these genes. It would be appreciated that the skilled technician would employ techniques known in the art to design the location of the RT-PCR primers. Preferably, the RT-PCR primers are designed such that the resulting amplified product

encompasses the complete coding region of the eIF4E or eIF(iso)4E gene.

For example, the reverse transcription primer may be selected from a group consisting of:

AAAAAGCAGGCT CGAGGCGACAGAGGATG - (SEQ ID No. 13);

AGAAAGCTGGGT TCAGACAGTGAACCTAGTTCTTC (SEQ ID No. 14); and AGAAAGCTGGGT TCAGACAGTGAACCGAGTTCTTC (SEQ ID No. 15).

The method of the fifth aspect may further comprise a step of identifying the modification in the plant genome that causes mis-splicing of eIF4E or eIF(iso)4E, wherein said identification step may comprise breeding the test plant such that it is homozygous for eIF4E or eIF(iso)4E gene. The method may comprise determining the genomic sequence of said gene. The method may comprise comparing the determined genomic sequence with the genomic sequence of wild type eIF4E or eIF(iso)4YL to identify the modification that causes mis-splicing of eIF4E or eIF(iso)4E.

In an embodiment where the test plant is heterozygous for eIF4E or eIF(iso)4E, homozygosity may obtained by self-crossing the test plant.

The inventor has found that it is possible to generate and select plants, which resistant to infection from viruses.

Thus, in a sixth aspect, there is provided a method for selecting a virus-resistant plant, the method comprising detecting, in a test plant, the presence of:- (i) a plant eukaryotic initiation factor 4E (eIF4E) variant or an isoform thereof

(eIF(iso)4E) of the first aspect; and (ii) at least one copy of wild-type eIF4E or eIF(iso)4E, wherein the wild-type copy of eIF4E or eIF(iso)4E can be used by the plant, but cannot be used by a virus. In a seventh aspect, there is provided use of a plant eukaryotic initiation factor 4E

(eIF4E) variant or an isoform thereof (eIF(iso)4E) of the first aspect or the nucleic acid of the second aspect, for inducing viral-resistance in a plant.

In an eighth aspect, there is provided a method for producing a virus-resistant plant, the method comprising crossing a parent plant that expresses a plant eukaryotic initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) of the first aspect with a recipient plant, the recipient comprising at least one trait, selected from a group consisting of an agronomic advantage, a commercial advantage, and/ or suitability for a particular climate or soil.

The recipient plant is a commercially useful plant, such as a crop (e.g. rice, wheat etc) or a vegetable (e.g. a tomato etc). As described in the examples and as shown in Figure 8, the recipient plant (which may be B. rapa) may be crossed with a parent plant which is resistant to viral infections, because it expresses the eIF4E or eIF4(iso)4E variant of the first aspect. The method may further comprise back-crossing the progeny of the first cross with the recipient parent (i.e. recurrent plant line). Between 1 and 10 rounds of back-crossing may be required. The method according to the fifth aspect may be used to ensure that the variant allele of eIF4E or eIF(iso)4E (which confers virus resistance) is present in the non-recurrent plants in each cross. Finally, the method may comprise a step of self-pollinating or selfing virus resistant plants derived from the back-crossing programme. Plants in the subsequent generation that are homozygous for the modified splice site (which confers virus resistance) may then be identified.

For breeding Fi hybrid plants with viral resistance, two inbred parent plant lines derived from back-crossing programmes and homozygous for the eIF4E or eIF(iso)4E of the first aspect may be used to generate a Fi hybrid. These two inbred parental lines may then be crossed together to generate virus-resistant Fi hybrids homozygous for the eIF4E or eIF(iso)4E of the first aspect.

It is preferred that the parent plant used in the method may express at least one copy of wild-type eIF4E or eIF(iso)4E, wherein the wild-type copy of eIF4E or eIF(iso)4E may be used by the plant, but not by a virus.

In a ninth aspect, the invention provides a virus-resistant plant produced by or obtainable by the method of the eighth aspect.

In a tenth aspect, the invention provides a virus-resistant plant comprising the plant eukaryotic translation initiation factor 4E (eIF4E) variant or an isoform thereof (eIF(iso)4E) of the first aspect, or the nucleic acid encoding an alternatively spliced variant of eIF4E or eIF(iso)4E of the second aspect.

Preferably, the plant of the ninth and tenth aspects expresses at least one copy of wild- type eIF4E or eIF(iso)4E, wherein the wild-type copy of eIF4E or eIF(iso)4E can be used by the plant but not by a virus. The plant may comprise a modification or mutation in a non-coding region of the gene encoding eIF4E or eIF(iso)4E. The modification may be within a splice- element of eIF4E or eIF(iso)4E nucleic acid.

The use, method or plant may comprise use of eIF(iso)4E.a, eIF(iso)4E.b or eIF(iso)4E.c, preferably eIF(iso)4E.

The plant may be of the family Solanaceae, such as potato, all species, tomato, pepper or egg plant, Cucurbits, such as melons, squash and cucumbers, Cruciferous crops, particularly Brassica napus such as oilseed rape, Brassica rapa such as Chinese cabbage, and even more particularly Brassica oleracea such as broccoli, cauliflower, cabbage, savoy cabbage, Brussels sprouts, red cabbage, and the like etc., Fabaceae peas, beans, pulses etc. and also any monocotyledonous crops, including rice, maize, wheat, barley etc. In one preferred embodiment, the plant may be Brassica spp, preferably B. rapa and/ or Brassica oleracea. It will be appreciated that the method of any one of the fifth, sixth or eighth aspects, the use of the seventh aspect, or the plant of the ninth aspect may comprise the use of a transgenic plant. Indeed, any of the plants described herein may be transgenic, i.e. produced using recombinant DNA technology. The scenario described above explains virus resistance in plants that possess a single copy of eIF4E and/ or eIF(iso)4E, which a virus can use to complete its life-cycle and cause an infection. Thus, in the preferred embodiments, the plant is homozygous for eIF4E and/ or eIF(iso)4E, which encodes the alternatively spliced variant of a plant eukaryotic translation initiation factor 4E (eIF4E), or an isoform thereof (eIF(iso)4E), which is non- functional for a virus.

However, it will be appreciated that some plants have multiple copies/loci of one, or both of these two genes. For example, Brassica rapa has three copies of both eIF4E and eIF(iso)4E (i.e. BraA.eIF4E.a, b and c; and BraA.elF (iso)4E. a, b and c), and a virus may be able to use any of these genes to complete its life cycle, and cause a viral infection in the host plant. Thus, in order to confer virus resistance in plants having multiple copies/loci of eIF4E and/ or eIF(iso)4E, it is preferred that the alleles of eIF4E and/ or eIF(iso)4E at each of these other loci are also non-functional for the virus.

In order to investigate this further, the inventors crossed the B. rapa virus infection- susceptible line R-o-18 and the virus-resistant line RLR22 and an Fi plant was backcrossed to a self from the resistant plant to produce a Bi plant that was

homozygous for the RLR22 allele of BraA.elF (iso)4E.a and the RLR22 allele of

BraA.eIF4E.c, but heterozygous at the BraA.eIF(iso)4E.c locus. BiSi plants derived from this particular individual were then phenotyped and genotyped. Surprisingly, plants homozygous for the RLR22 allele of BraA.elF (iso)4E.c were completely resistant to Turnip mosaic virus (TuMV), heterozygotes were only slightly susceptible to virus infection, whereas plants that were homozygous for the R-o-18 allele of

BraA.eIF(iso)4E.c were completely susceptible to viral infections. Accordingly, the inventors have identified a second locus involved in the broad- spectrum resistance to the virus in B. rapa as BraA.eIF(iso)4E.c on chromosome A8. This second locus is referred to herein as ConTROI . The inventors believe that, for broad- spectrum virus resistance, a plant requires the second gene (i.e. ConTROI , the RLR22 allele of BraA.eIT(iso)4E.c) in addition to the first gene (i.e. retrOI , the RLR22 allele of BraA.eIT(iso)4E.a). This demonstrates the importance of more than one gene for conferring virus resistance in plants where viruses are able to utilise multiple copies/loci of eIT4E and/or eIT(iso)4E. The genomic DNA sequences determined for the eIT4E and eIT(iso)4E alleles from Brassica rapa lines R-o-18 and RLR22 and predicted mRNA and polypeptide sequences are provided below along with information on whether Turnip mosaic virus is able to use each allele.

R-o-18 BraA.eIT4E.a

The R-o-18 allele of BmA.eIT4E.a located on chromosome Al, which TuMV cannot use in B. rapa, has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold): ATGGCGGTAGAAGACACACTCAAGCCTAATGTCGCTACGGAAGAATCGAATCCCAATTCT GCAGATCACC CGATCGATCGATACCATGAGGAAGGCGACGATGCCGAGGAAGGAGCGACCGTAGACGAAT CGAGCAAATC CGCCGTCCCTGAATCGCATCCGTTGGAGCATTCGTGGACTCTCTGGTTCGATAACCCTTC CGTCAAATCA AAGCAGACGACTTGGGGAAGCTCCTTACGATCCGTCTTCACCTTCTCCACCGTCGAGGAG TTCTGGAGGT TGGTAGCTTTACAACAATCTTTTTCCTTCTTACAGTAATTCCACAATCTGGGTTTTGTTT AGATTTTGAT TTCTCACAGGAAAGTTATCTTCTTTGTTGTTGCTGTTAGAATCTTGTTTGATGTTTGAAC AAACAGTTAC TTGTTGGATGCTAGTGTATTGGCTTTGACATTTTACTTTTGATTTGTAGTTTGTACAATA ACATTCGGCA CCCGAGCAAGTTAGCTAACGGAGCTGACTTGTACTGTTTCAAACACAATATTGAACCTAA GTGGGAGGAT CCTATCTGTGCCAACGGAGGCAAGTGGACTATGAACTTCTCTAGGGAGAAGTCTGATAAG CCCTTTCTTT ACACCGTATGTAACTTGACATTCATATAGTTCTTGTTTCACACCATCCAGTCTCCAGTCT AATCGGGTTG TTGTTGTTGTtgttgtCACTTGTAGTTGCTTGCTTTGATTGGAGAACAGTTTGACCATGG AGATGAAATC TGTGGAGTTGTTGTTAACGTTAGAGCTAAGCAAGAGAGGATATCTATTTGGACTAAAAAC TCTTCCAACG AAGCGGCTCAGGTACAAGACAAAAAAAACCCACATCAAACTGTGTCTCTCTCTCGGTCTG AAGAAAAGAC GTGGAAATTTTATTTTATTTAATGTTACAGGTGAGCATTGGGAGACAGTGGAAGGAGTTT CTTGATTACA ACAGCAGCATTGGTTTCATCATCCATGTAAAGAGCGTTTCTGTTGTTGCTAATTTCTGTT TTTTTTTTCT TTCTATGGATCGCTCACTACTTGTTGTATGTGTGTATTGGTTTGGTTTCTCTTCAGGAGG ATGCGaAGaA GCTGGACAGAGGC G C AAAG AG CGCTTACACTGCCTGA

(SEQ ID No. 34)

The R-o-18 allele of BraA.eIT4E.a, which TuMV cannot use in B. rapa, has the following mRNA sequence:

AUGGCGGUAGAAGACACACUCAAGCCUAAUGUCGCUACGGAAGAAUCGAAUCCCAAUUCU GCAGAUCACC CGAUCGAUCGAUACCAUGAG G AAG GCGACGAUGCCGAG G AAG GAGCGACCG U AG AC G AAU C G AG C AAAU C CGCCGUCCCUGAAUCGCAUCCGUUGGAGCAUUCGUGGACUCUCUGGUUCGAUAACCCUUC CGUCAAAUCA AAGCAGACGACUUGGGGAAGCUCCUUACGAUCCGUCUUCACCUUCUCCACCGUCGAGGAG UUCUGGAGUU UGUACAAUAACAUUCGGCACCCGAGCAAGUUAGCUAACGGAGCUGACUUGUACUGUUUCA AACACAAUAU UGAACCUAAGUGGGAGGAUCCUAUCUGUGCCAACGGAGGCAAGUGGACUAUGAACUUCUC UAGGGAGAAG UCUGAUAAGCCCUUUCUUUACACCUUGCUUGCUUUGAUUGGAGAACAGUUUGACCAUGGA GAUGAAAUCU GUGGAGUUGUUGUUAACGUUAGAGCUAAGCAAGAGAGGAUAUCUAUUUGGACUAAAAACU CUUCCAACGA AGCGGCUCAGGUGAGCAUUGGGAGACAGUGGAAGGAGUUUCUUGAUUACAACAGCAGCAU UGGUUUCAUC AUCCAUGAGGAUGCGaAGaAGCUGGACAGAGGCGCAAAGAGCGCUUACACUGCCUGA

(SEQ ID No. 35)

The R-o-18 allele of BraA.eIF4E.a, which TuMV cannot use in B. rapa, codes for the following polypeptide sequence:

MAVEDTLKPNVATEESNPNSADHPIDRYHEEGDDAEEGATVDESSKSAVPESHPLEHSWT LWFDNPSVKS KQTTWGSSLRSVFTFSTVEEFWSLYNNIRHPSKLANGADLYCFKHNIEPKWEDPICANGG KWTMNFSREK SDKPFLYTLLALIGEQFDHGDEICGVVVNVRAKQERISIWTKNSSNEAAQVSIGRQWKEF LDYNSSIGFI IHEDAKKLDRGAKSAYTA .

(SEQ ID No. 36)

RLR22 BraA.eIF4E.a

The RLR22 allele of BmA.eIF4E.a located on chromosome Al which TuMV cannot use in B. rapa has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGGTAGAAGACACTCTCAAGCCTAACGTCCCTACGGAAGAATCGAATCCCAATTCT GTAGATCACC CGATCGATCGATACCATGAGGAAGGCGACGATGCCGAGGAAGGAGCGATCGTAGACGAAT CGAGCAAATC CGCCGTCCCTGAATCGCATCCGTTGGAGCATTCGTGGACTCTCTGGTTCGATAACCCTTC CGTCAAATCA AAGCAGACGACTTGGGGAAGCTCCTTACGATCCGTCTTCACCTTCTCCACCGTCGAGGAG TTCTGGAGGT TGGTAGCTTTACAACAATCTTTTTCCTTCTTAATGTAATTCCACAATCTGGGTTTTGTTT AGATTTCGAT TTCTCACAGGAAAGTTATCTTCTTTATGGGTGGGTTTAAATCTCATGAAGTCACTGTTCT TCTCTGTCTA TGAAGAATTGCCTGTTTGGTGTTTGAACAAACAGTTACTTGTTGGATGCTATTTTATTGG CTTCTTATTA TATGTGACATTGCAGTTTGTACAATAACATTCGGCATCCGAGCAAGTTAGCTAACGGAGC TGACTTGTAC TATTTCAAACACAATATTGAACCTAAGTGGGAGGATCCTATCTGTGCCAACGGAGGCAAG TGGACTATGA ACTTCTCTAGGGAGAAGTCTGATAAGCCCTTTCTTTACACCGTATGTAACTTGACATTCA TATAGTTCTT GTTTCACACCATCCAGTCTCCAGTCTAATCGGGTTGTTGTTGTTGTTGTCACTTATAGTT GCTTGCTTTG ATTGGAGAACAGTTTGACCATGGAGATGAAATCTGTGGAGTTGTTGTTAACGTTAGAGCT AAGCAAGAAA GGATATCTATTTGGACTAAAAACTCTTCCAACGAAGCTGCTCAGGTACAAGACAAAAAAA GAACCCCCAT CGAACTGTATCTCTCTCTCGGTCTGAAGAAAAGACGTGGAAATTTTATATTGTTTAATGT TACAGGTGAG CATTGGGAGACAGTGGAAGGAGTTTCTTGATTACAACAGCAGCATTGGTTTCATCATCCA TGTAATAGTA TTTCTGTTGTTGCTAATTTCTTTCTTTTTTTCTTCTATGGATCGCTCACTACTTGTTGTA TGTGTGTATT GGTTTGGTTTCTCTTCAGGAGGATGCGAAGAAGCTGGACAGAGGCGCAAAGAGCGCTTAC ACTGCCTGA

(SEQ ID No. 37)

The RLR22 allele of BraA.eIF4E.a which TuMV cannot use in B. rapa has the following mRNA sequence:

AUGGCGGUAGAAGACACUCUCAAGCCUAACGUCCCUACGGAAGAAUCGAAUCCCAAUUCU GUAGAUCACC CGAUCGAUCGAUACCAUGAGGAAGGCGACGAUGCCGAGGAAGGAGCGAUCGUAGACGAAU CGAGCAAAUC CGCCGUCCCUGAAUCGCAUCCGUUGGAGCAUUCGUGGACUCUCUGGUUCGAUAACCCUUC CGUCAAAUCA AAGCAGACGACUUGGGGAAGCUCCUUACGAUCCGUCUUCACCUUCUCCACCGUCGAGGAG UUCUGGAGUU UGUACAAUAACAUUCGGCAUCCGAGCAAGUUAGCUAACGGAGCUGACUUGUACUAUUUCA AACACAAUAU UGAACCUAAGUGGGAGGAUCCUAUCUGUGCCAACGGAGGCAAGUGGACUAUGAACUUCUC UAGGGAGAAG UCUGAUAAGCCCUUUCUUUACACGUUGCUUGCUUUGAUUGGAGAACAGUUUGACCAUGGA GAUGAAAUCU GUGGAGUUGUUGUUAACGUUAGAGCUAAGCAAGAAAGGAUAUCUAUUUGGACUAAAAACU CUUCCAACGA AGCUGCUCAGGUGAGCAUUGGGAGACAGUGGAAGGAGUUUCUUGAUUACAACAGCAGCAU UGGUUUCAUC AUCCAUGAGGAUGCGAAGAAGCUGGACAGAGGCGCAAAGAGCGCUUACACUGCCUGA (SEQ ID No. 38)

The RLR22 allele of BmA.eIF4E.a which TuMV cannot use in B. rapa codes for the following polypeptide sequence:

MAVEDTLKPNVPTEESNPNSVDHPIDRYHEEGDDAEEGAIVDESSKSAVPESHPLEHSWT LWFDNPSVKS KQTTWGSSLRSVFTFSTVEEFWSLYNNIRHPSKLANGADLYYFKHNIEPKWEDPICANGG KWTMNFSREK SDKPFLYTLLALIGEQFDHGDEICGVVVNVRAKQERISIWTKNSSNEAAQVSIGRQWKEF LDYNSSIGFI IHEDAKKLDRGAKSAYTA .

(SEQ ID No. 39)

R-o-18 BraA.eIF4E.b

R-o-18 BraA.eIF4E.b located on chromosome A3 is non-functional. The R-o-18 allele of BraA.eIF4E.b which is non- functional has the following genomic DNA sequence (exons 1 to 3, with the introns shown in bold and premature stop codons being underlined) :

ATGGCGGTAGAAGACACTTTCAAGCCTGTTGTTGCTATCAaGGAAGCGAAACCTAAT TATGTAGAGCATC TGATTGGACCAGGCGACGATGCGGAGGAAGGAGAGATCGTAGACGGAGATGTTGACAAAT CTGGAAATCC ACAGTTCCTGAATCGCATTCGTTGGAGCATTTGTGGACTTTCCACAGTTCCTCTTTTTAT TCGTTGACTT TCTAACGAAAAGACGACTTGGGGAAGCTCCTTAGATCCGCGTTCACGTTCTCCACGGTCG AGGAGTTCTG GAGGTTGGTGCTTTAAAACAATCTTTTCGTTCTTCCAATAATTCTACAATCTGGGTTTTG GTTTGGATTT AGATTTCTCGAGGAAAGTTATGTTCTTTGTTGATGGGTTAGATCACATGAAGTCATCGTT CTTATCTGTT TCTGAAGAATTGTTTGTTTGATGTTTGAATTTGTAGCTACAAGCTTATATGTTAAGTTTT TAAAAAGATA GCGAAGATATTATATTCGATGTAAATCAATGTTTTACACCTTAGTATTTTTGTTGGTAAC AGAAGATGAA CAAAGAGTTATTTGGTTAGTGTTGGATGCTATTGTATTGCTGTGCACTCGTGTGTGTATA TGCTTTCTTG TATTCTCCTTTCTTGAGAACCTTTCTCTCAATGGGAATAATGAACTTGTAGTTTGTTCTA TTGGGAGACA ATAGAAGGAGTTCCTTGATTACAACAGCTGCATTGGTTTCATCATCCATGTGGGAAGAGT GCTTGTCTTG ATGCTAATTCAAAAGGCTTTTCTTTTGCATTTCTCAGTGTTTATTTTTTTGTCTGTATTG GCTTGTTTTC CCTTCAGGAGGATGCGACGAAGATGAACAAGTACAACCATACTGTTATCGATCTACAATT TGAGTTTTAA

(SEQ ID No. 40)

The R-o-18 allele of BmA.eIF4E.b which is non- functional has the following mRNA sequence:

AUGGCGGUAGAAGACACUUUCAAGCCUGUUGUUGCUAUCAaGGAAGCGAAACCUAAU UAUGUAGAGCAUC UGAUUGGACCAGGCGACGAUGCGGAGGAAGGAGAGAUCGUAGACGGAGAUGUUGACAAAU CUGGAAAUCC ACAGUUCCUGAAUCGCAUUCGUUGGAGCAUUUGUGGACUUUCCACAGUUCCUCUUUUUAU UCGUUGACUU UCUAACGAAAAGACGACUUGGGGAAGCUCCUUAGAUCCGCGUUCACGUUCUCCACGGUCG AGGAGUUCUG GAGUGGGAGACAAUAGAAGGAGUUCCUUGAUUACAACAGCUGCAUUGGUUUCAUCAUCCA UGAGGAUGCG ACGAAGAUGAACAAGUACAACCAUACUGUUAUCGAUCUACAAUUUGAGUUUUAA

(SEQ ID No. 41) The R-o-18 allele of BraA.eIF4E.b which is non-functional codes for the following polypeptide sequence:

MAVEDTFKPVVAIKEAKPNYVEHLIGPGDDAEEGEIVDGDVDKSGNPQ LNRIRWS ICGLSTVPL IR . L SNEKTTWGSSLDPRSRSPRSRSSGVGDNRRSSLITTAALVSSSMRMRRR . TSTTILLS IYNLSF

(SEQ ID No. 42)

RLR22 BraA.eIF4E.b

No sequence obtained for the RLR22 allele of BraA.eIF4E.b located on chromosome A3.

R-o-18 BraA.eIF4E.c

The R-o-18 allele of BraA.eIF4E.c located on chromosome A8 which TuMV cannot use in B. rapa has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGGTAGAAGACACTTCCAAGCCTGTTGTCGTTGCGGAaGaAGCGAACCCTAACCCC ACAGACCATC CGATTGATCGATACCATGAAGAAGGCGACGATGCTGAGGAAGGAGAGATCGCCGGCGGCG AAGGAGACGG AGACGAATCGAGCAAATCCGCCGTTCCGCAGTCGCATCCGTTGGAGCATTCGTGGACTTT CTGGTTCGAT AACCCTTCTGTTAAATTAAAGCAGGCGACTTGGGGAAGCTCCTTGCGATCCGTGTTCACT TTCTCCACCG TCGAGGAGTTCTGGAGGTTAGGGCTTTTTACAAAATCAATAATTTATTCTTACAATTATT ATTTCGACAT GGGTTTTAGTTTGGTTTTGTCTAGATTATGTTTTCTCGAAGAAAGTTATGTTCTTTCTTC GTGGGTTAAG TGAATCACTTGTTCTTGTTTGTTTCTGAAATATTGCTTTGCTTGTTTGTTTGGTGTTTGA ATTATGGAAA AGGAATCTTTTGTTCTTTCAATGACTATGCGGCATGGGTTTTGGTTTAGTTTTGTTTAGA TTGTTATATC TCAAGGAAAGTTATGTTCTTTGTTGGTGGGTTAGATTCCGTGAAGTCACTTGCTCTTGTA TGTTTCTGAA GAATCACTTAATTGGTGTTTGAGTTTGTAGCTACTGCTTATATGTTAAGGTCATATTTGT TCGCTTGTTA TCTTCACAAGAGCTAAAACATTGAACCAGGGAATCATCGGTCTTATTCGGTTAATGTTGG ATGCTATTGT GTTGTTGCGTGTGTGTATATATACTTTCTTGTGTTCTTCTTTTTGATTTGTGAGTCTCTC TCAAGTCTCA ATGGGATTTAAGGACTTGTCTTTGGCTCTATTGACTTCATCTTACTTTGGTTGCAGTCTG TTCAATAACA TGAGGGGTCCGAGCAAGTTAGCTGGCGGAGCTGACTTCTACTGTTTCAAGCACAATATCG AACCTAAGTG GGAGGATCCTATCTGTGCTAATGGAGGCAAATGGACTATGAACTTCCCGAAGGAGAAGTC TGATAAGCCC TGGCTTTACACCGTATGGTTTTGATTCTTCTTACTTGAACACATGATTCTTGTTTCACCA TCCATTCGAG TCTGATTGGGTTTTTGTTTTCTCGATGTAGTTGCTTGCGTTGATTGGAGAACAGTTTGAC CATGGAGATG AGATATGCGGAGCTGTTGTCAACGTTAGAGGAAAGCAAGAGAGGATTTCCATTTGGACCA AAAATGCTTC CAACGAAGCTGCTCAGGTAAAAGATCATTTATTGACAAATAAATGTTAAATTGTCTCTCT TCCGGCTAAA AGACCTGAAATTTCTTGTTTCCTTTGATGTTGCAGGTGAGCATTGGGAAACAATGGAAGG AGTTTATTGA TTACAACAACAGCATTGGTTTCATCATCCATGTAAGAAGAGAGCTTTTCTCTTGAATGCT TATTCATAAG TTTTTTTTTAATATCTCACTGTCTGTATTGTTTTTTTTTTCTTCAGGAGGATGCCAAGAA GCTGGACAGG GGCGCGAAGAGCGCTTACACCGCTTGA

(SEQ ID No. 43)

The R-o-18 allele of BraA.eIF4E.cwhich TuMV cannot use in B. rapa has the following mRNA sequence:

AUGGCGGUAGAAGACACUUCCAAGCCUGUUGUCGUUGCGGAaGaAGCGAACCCUAACCCC ACAGACCAUC CGAUUGAUCGAUACCAUGAAGAAGGCGACGAUGCUGAGGAAGGAGAGAUCGCCGGCGGCG AAGGAGACGG AGACGAAUCGAGCAAAUCCGCCGUUCCGCAGUCGCAUCCGUUGGAGCAUUCGUGGACUUU CUGGUUCGAU AACCCUUCUGUUAAAUUAAAGCAGGCGACUUGGGGAAGCUCCUUGCGAUCCGUGUUCACU UUCUCCACCG UCGAGGAGUUCUGGAGUCUGUUCAAUAACAUGAGGGGUCCGAGCAAGUUAGCUGGCGGAG CUGACUUCUA CUGUUUCAAGCACAAUAUCGAACCUAAGUGGGAGGAUCCUAUCUGUGCUAAUGGAGGCAA AUGGACUAUG AACUUCCCGAAGGAGAAGUCUGAUAAGCCCUGGCUUUACACCUUGCUUGCGUUGAUUGGA GAACAGUUUG ACCAUGGAGAUGAGAUAUGCGGAGCUGUUGUCAACGUUAGAGGAAAGCAAGAGAGGAUUU CCAUUUGGAC CAAAAAUGCUUCCAACGAAGCUGCUCAGGUGAGCAUUGGGAAACAAUGGAAGGAGUUUAU UGAUUACAAC AACAGCAUUGGUUUCAUCAUCCAUGAGGAUGCCAAGAAGCUGGACAGGGGCGCGAAGAGC GCUUACACCG CUUGA

(SEQ ID No. 44)

The R-o-18 allele of BraA.eIF4E.cwhich TuMV cannot use in B. rapa codes for the following polypeptide sequence:

MAVEDTSKPVVVAEEANPNPTDHPIDRYHEEGDDAEEGEIAGGEGDGDESSKSAVPQ SHPLEHSWTFWFD NPSVKLKQATWGSSLRSVFTFSTVEEFWSLFNNMRGPSKLAGGADFYCFKHNIEPKWEDP ICANGGKWTM NFPKEKSDKPWLYTLLALIGEQFDHGDEICGAVVNVRGKQERI S IWTKNASNEAAQVS IGKQWKEFIDYN NS IGFI IHEDAKKLDRGAKSAYTA .

(SEQ ID No. 45)

RLR22 BraA.eIF4E.c

The RLR22 allele of BraA.eIF4E.c located on chromosome A8 which TuMV cannot use in B. rapa has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGGTAGAAGACACTTCCAAGCCTGTTGTCGTTGCGGAAGAAGCGAACCCTAAC CCCACAGACCATC CGATTGATCGATACCATGAAGAAGGCGACGATGTTGAGGAAGGAGAGATCGCCGGCGGCG AAACAGACGG AGACGAATCGAGCAAATCCGCCGTTCCGCAGTCGCATCCGTTGGAGCATTCGTGGACCTT CTGGTTCGAT AACCCTTCTGTTAAATTAAAGCAGGCGACTTGGGGAAGCTCCTTGCGATCCGTGTTCACT TTCTCCACCG TCGAGGAGTTCTGGAGGTTAGGGCTTTTTACAAAATCAATAATTTATTCTTGCAATGATT ATTTCGACAT GGGTTTTAGTTTGGTTTTGTCTAGATTATGTTTTCTCGAGGAAAGTTATGTTCTTTCTTC GTGGGTTAAG TCAAGTCACTTGTTCTTGTCTGTTTCTGAAGTATGACTTGTTTGGTGTTTGAATTATGGA GGTCAGGCCT TACAAAACAATCTTTTGTTCTTCTTCTGATTATTTCGACATGGGTTTTAGTTTGGTTTTG TCTAGATTAT GTTTTCTCGAGGAAAGTTATGTTCTTTCTTGTTGGGTTAAGTGAAGTCACTTGCTCTTGT CTGTTTCTGA AATATTACTTGTTTGGTGTTTGAAATTGTAGCTACTGCTTATATGTTAAGGTCATATTTG TTCACTTCTA ATCTTCACAAGAGCTAAAACATTGAACTAGGGAATCATTGGTCTTATTTGGTTAATGTTG GATGCTATTG TGTTGTTGCGTGTGTGTATATATACTTTCTTGTGTTCTTCTTTTTGATTTGTGAGTCTCT CTCAAGTCTC AATGGGATTTAAGGACTTGTCTTTGGCTCTATTGACTTCATCTTACTTTGGTTGCAGTCT GTTCAATAAC ATGAAGGGTCCGAGCAAGTTAGCTGGCGGAGCTGACTTCTACTGTTTCAAGCACAATATC GAACCTAAGT GGGAGGATCCTATCTGTGCTAATGGAGGCAAATGGACTATGAACTTCCCGAAGGAGAAGT CTGATAAGCC CTGGCTTTACACTGTATGGTTTTGATTCTTCTTACTTGAACACATGATTCTTGTTTCACC ATCCATTCGA GTCTGATTGGGTTTTTGTTTTTTCCATGTAGTTGCTTGCGTTGATTGGAGAACAGTTTGA CCACGGAGAT GAGATATGCGGAGCTGTTGTCAACGTTAGAGGAAAGCAAGAGAGGATTTCCATTTGGACC AAAAATGCTT CCAACGAAGCTGCTCAGGTAAAAGATCATTTATTGACAAATAAATGTTAAATTGTCTCTC TTCCGGCTAA AAGACCTGAAATTTCTTGTTTCCTTTGATGTTGCAGGTGAGCATTGGGAAACAATGGAAG GAGTTtATTG ATTACAACAACAGCATTGGTTTCATCATCCATGTAAGAAGAGAGCTTTTCTCTTGAATGC TTATTCATAA GTTTTTTTTTAATATCTCACTGTCTGTTTTGTTTTTTTTTTCTTCAGGAGGATGCCAAGA AGCTGGACAG GGGCGCGAAGAGCGCTTACACCGCTTGA

(SEQ ID No. 46) The RLR22 allele of BraA.eIF4E.cwhich TuMV cannot use in B. rapa has the following mRNA sequence:

AUGGCGGUAGAAGACACUUCCAAGCCUGUUGUCGUUGCGGAAGAAGCGAACCCUAACCCC ACAGACCAUC CGAUUGAUCGAUACCAUGAAGAAGGCGACGAUGUUGAGGAAGGAGAGAUCGCCGGCGGCG AAACAGACGG AGACGAAUCGAGCAAAUCCGCCGUUCCGCAGUCGCAUCCGUUGGAGCAUUCGUGGACCUU CUGGUUCGAU AACCCUUCUGUUAAAUUAAAGCAGGCGACUUGGGGAAGCUCCUUGCGAUCCGUGUUCACU UUCUCCACCG UCGAGGAGUUCUGGAGUCUGUUCAAUAACAUGAAGGGUCCGAGCAAGUUAGCUGGCGGAG CUGACUUCUA CUGUUUCAAGCACAAUAUCGAACCUAAGUGGGAGGAUCCUAUCUGUGCUAAUGGAGGCAA AUGGACUAUG AACUUCCCGAAGGAGAAGUCUGAUAAGCCCUGGCUUUACACUUUGCUUGCGUUGAUUGGA GAACAGUUUG ACCACGGAGAUGAGAUAUGCGGAGCUGUUGUCAACGUUAGAGGAAAGCAAGAGAGGAUUU CCAUUUGGAC CAAAAAUGCUUCCAACGAAGCUGCUCAGGUGAGCAUUGGGAAACAAUGGAAGGAGUUuAU UGAUUACAAC AACAGCAUUGGUUUCAUCAUCCAUGAGGAUGCCAAGAAGCUGGACAGGGGCGCGAAGAGC GCUUACACCG CUUGA

(SEQ ID No. 47)

The RLR22 allele of BraA.eIF4E.cwhich TuMV cannot use in B. rapa codes for the following polypeptide sequence:

MAVEDTSKPVVVAEEANPNPTDHPIDRYHEEGDDVEEGEIAGGETDGDESSKSAVPQSHP LEHSWTFWFD NPSVKLKQATWGSSLRSVFTFSTVEEFWSLFNNMKGPSKLAGGADFYCFKHNIEPKWEDP ICANGGKWTM NFPKEKSDKPWLYTLLALIGEQFDHGDEICGAVVNVRGKQERI S IWTKNASNEAAQVS IGKQWKEFIDYN NS IGFI IHEDAKKLDRGAKSAYTA .

(SEQ ID No. 48)

R-o-18 BraA.eIF(iso)4E.b

The R-o-18 allele of BraA.eIF(iso)4E.b located on chromosome A5 which TuMV cannot use in B. rapa has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGACGGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAAGTACCGATAGAGTCGACA ACGGAGAAGC AGCCTCATAAGCTGGAAAGAAAATGGTGTTTCTGGTTCGATAACCAATCTAAGCCAAAGC AAGGCGCCGC CTGGGGAGCTTCCCTTCGTAAAGCCTCTACCTTCGACACTGTCGAAGATTTCTGGGGGTG TGTCGTGTCT TCTTCTCCTCCTCATTTTTAGATTTCTTCGATTAACTTCTTCTGGCATGCGTTTTTGCAG TTTGCACGAG ACTATATTCATTCCCAGCAAATTGACACCCAATGCTGATATCCACTTGTTCAAAGCTGGC GTTGAGCCCA AGTGGGAAGATCCTGAGTGTGCTCACGGCGGAAAGTGGACTTTTGTTGTCACCAACAACA GGAAGCAAGC TTTAGACAAGGCTTGGCTTGAAACTGTAATACCGTCTTCCCTTTTACTGTTTTTGTCTTT AGACAATTGT GGCTTATGTCCTAATGTCTGTTTCTTCTCTCTCTCTCTCGTAATTGGGCGGCAGTTGATG GCTTTGATTG GAGAGCAATTCGATGAGGCAGATGAGATTTGTGGTGTTGTTGCTAGTGTGCGCCTAAAGC AAGACAAGCT CTCCTTGTGGACACGGACTAAATCAAATGAAGCTGTCCTGGTTAGATTACGAATCATGTT TTCTTCTAGT TGTCtTTTTTTTTTTTTTTTTTCATTTTCTTGCTTTTTGGTGGTGTGCGATGAGATGCCC AAGTACTATT CACTAGCTTCCTTTGTTGAACGTGTTGATTGCTTTCTACAGTAAAATAGCATAAGCTGTT TAATATATCA ATAACGCTACTCTAAATTATCAACGAAAGATGTAGAGAGGTTTTTTATAATGAGTTAAAT TAGTTTTTAT ACTGAAGGTTTATAGGTTCGTTTAACTATTCATATTTCTGTGATACCTGCTTTTATAGTT TACGCTCTAT AAACATAGCATTTGACAGCTCTTTAGAACATGGCAGTATCTAGATGCTAAAAGACTAGTT TCTGAATCTG TCTGCTTAAATTACTGCTTTGTTGTTTGTTATGGTAAATTCAGATGGGTATTGGAAAGAA GTGGAAGGCG CTACTTGACGTCACCGACAAGATAACTTTCACTAACCATGTAATTAACGTTCTCCTATAG AAGCTAATAT TACTTTTGTTCATGTTTATCCTTTCACGTGCTTACTAAAATCTGGTCTACTTACTTGCAG GATGATTCTA GAAGAAGTCGGTTCACTGTCTGA

(SEQ ID No. 49)

The R-o-18 allele of BraA.eIF(iso)4E.b which TuMV cannot use in B. rapa has the following mRNA sequence:

AUGGCGACGGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGAUAGAGUCGACA ACGGAGAAGC AGCCUCAUAAGCUGGAAAGAAAAUGGUGUUUCUGGUUCGAUAACCAAUCUAAGCCAAAGC AAGGCGCCGC CUGGGGAGCUUCCCUUCGUAAAGCCUCUACCUUCGACACUGUCGAAGAUUUCUGGGGUUU GCACGAGACU AUAUUCAUUCCCAGCAAAUUGACACCCAAUGCUGAUAUCCACUUGUUCAAAGCUGGCGUU GAGCCCAAGU GGGAAGAUCCUGAGUGUGCUCACGGCGGAAAGUGGACUUUUGUUGUCACCAACAACAGGA AGCAAGCUUU AGACAAGGCUUGGCUUGAAACUUUGAUGGCUUUGAUUGGAGAGCAAUUCGAUGAGGCAGA UGAGAUUUGU GGUGUUGUUGCUAGUGUGCGCCUAAAGCAAGACAAGCUCUCCUUGUGGACACGGACUAAA UCAAAUGAAG CUGUCCUGAUGGGUAUUGGAAAGAAGUGGAAGGCGCUACUUGACGUCACCGACAAGAUAA CUUUCACUAA CCAUGAUGAUUCUAGAAGAAGUCGGUUCACUGUCUGA

(SEQ ID No. 50)

The R-o-18 allele of BraA.eIF(iso)4E.b which TuMV cannot use in B. rapa codes for the following polypeptide sequence (same as RLR22 BraA.eIF(iso)4E.b):

MATEDVNEALAAAEVPIESTTEKQPHKLERKWCFWFDNQSKPKQGAAWGASLRKAST FDTVEDFWGLHET IFIPSKLTPNADIHLFKAGVEPKWEDPECAHGGKWTFVVTNNRKQALDKAWLETLMALIG EQFDEADEIC GVVASVRLKQDKLSLWTRTKSNEAVLMGIGKKWKALLDVTDKITFTNHDDSRRSRFTV.

(SEQ ID No. 51)

RLR22 BraA.elF(iso)4E.b

The RLR22 allele of BraA.eIF(iso)4E.b located on chromosome A5 which TuMV cannot use in B. rapa has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGACGGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAAGTACCGATAGAGTCG ACAACGGAGAAGC AGCCTCATAAGCTGGAAAGAAAATGGTGTTTCTGGTTCGATAACCAATCTAAGCCAAAGC AAGGCGCCGC CTGGGGAGCTTCCCTTCGTAAAGCCTCTACCTTCGACACTGTCGAAGATTTCTGGGGGTG TGTCGTGTCT TCTTCTCCTCCTCATTTTTAGATTTCTTCGATTAACTTCTTCTGGCATGCGTTTTTGCAG TTTGCACGAG ACTATATTCATTCCCAGCAAATTGACACCCAATGCTGATATCCACTTGTTCAAAGCTGGC GTTGAGCCCA AGTGGGAAGATCCTGAGTGTGCTCACGGCGGAAAGTGGACTTTTGTTGTCACCAACAACA GGAAGCAAGC TTTAGACAAGGCTTGGCTTGAAACTGTAATACCGTCTTCCCTTTTACTGTTTTTGTCTTT AGACAATTGT GGCTTATGTCCTAATGTCTGTTTCTTCTCTCTCTCTCTCGTAATTGGGCGGCAGTTGATG GCTTTGATTG GAGAGCAATTCGATGAGGCAGATGAGATTTGTGGTGTTGTTGCTAGTGTGCGCCTAAAGC AAGACAAGCT CTCCTTGTGGACACGGACTAAATCAAATGAAGCTGTCCTGGTTAGATTACGAATCATGTT TTCTTCTAGT TGTCTTTTTTTTTTTTTTTTTTCATTTTCTTGCTTTTTGGTGGTGTGCGATGAGATGCCC AAGTACTATT CACTAGCTTCCTTTGTTGAACGTGTTGATTGCTTTCTACAGTAAAATAGCATAAGCTGTT TAATATATCA ATAACGCTACTCTAAATTATCAACGAAAGATGTAGAGAGGTTTTTTATAATGAGTTAAAT TAGTTTTTAT ACTGAAGGTTTATAGGTTCGTTTAACTATTCATATTTCTGTGATACCTGCTTTTATAGTT TACGCTCTAT AAACATAGCATTTGACAGCTCTTTAGAACATGGCAGTATCTAGATGCTAAAAGACTAGTT TCTGAATCTG TCTGCTTAAATTACTGCTTTGTTGTTTGTTATGGTAAATTCAGATGGGTATTGGAAAGAA GTGGAAGGCG CTACTTGACGTCACCGACAAGATAACTTTCACTAACCATGTAATTAACGTTCTCCTATAG AAGCTAATAT TACTTTTGTTCATGTTTATCCTTTCACGTGCTTACTAAAATCTGGTCTACTTACTTGCAG GATGATTCTA GAAGAAGTCGGTTCACTGTCTGA

(SEQ ID No. 52)

The RLR22 allele of BraA.eIF(iso)4E.b which TuMV cannot use in B. rapa has the following mRNA sequence:

AUGGCGACGGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAAGUACCGAUAGAGUCGACA ACGGAGAAGC AGCCUCAUAAGCUGGAAAGAAAAUGGUGUUUCUGGUUCGAUAACCAAUCUAAGCCAAAGC AAGGCGCCGC CUGGGGAGCUUCCCUUCGUAAAGCCUCUACCUUCGACACUGUCGAAGAUUUCUGGGGUUU GCACGAGACU AUAUUCAUUCCCAGCAAAUUGACACCCAAUGCUGAUAUCCACUUGUUCAAAGCUGGCGUU GAGCCCAAGU GGGAAGAUCCUGAGUGUGCUCACGGCGGAAAGUGGACUUUUGUUGUCACCAACAACAGGA AGCAAGCUUU AGACAAGGCUUGGCUUGAAACUUUGAUGGCUUUGAUUGGAGAGCAAUUCGAUGAGGCAGA UGAGAUUUGU GGUGUUGUUGCUAGUGUGCGCCUAAAGCAAGACAAGCUCUCCUUGUGGACACGGACUAAA UCAAAUGAAG CUGUCCUGAUGGGUAUUGGAAAGAAGUGGAAGGCGCUACUUGACGUCACCGACAAGAUAA CUUUCACUAA CCAUGAUGAUUCUAGAAGAAGUCGGUUCACUGUCUGA

(SEQ ID No. 53)

The RLR22 allele of BraA.eIF(iso)4E.b which TuMV cannot use in B. rapa codes for the following polypeptide sequence (same as R-o-18 BraA.eIF(iso)4E.b):

MATEDVNEALAAAEVPIESTTEKQPHKLERKWCFWFDNQSKPKQGAAWGASLRKASTFDT VEDFWGLHET IFIPSKLTPNADIHLFKAGVEPKWEDPECAHGGKWTFVVTNNRKQALDKAWLETLMALIG EQFDEADEIC GVVASVRLKQDKLSLWTRTKSNEAVLMGIGKKWKALLDVTDKITFTNHDDSRRSRFTV.

(SEQ ID No. 54)

R-o-18 BraA.eIE(iso)4E.c

The R-o-18 allele of BraA.eIF(iso)4E.c located on chromosome A8 which TuMV can use in B. rapa and confers susceptibility on plants has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGACAGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAGGTAACGGCGATAGAATCG ACGGAGAAGC AGCAGCCTCCTCACAAGCTCGAAAGAAAGTGGAGTCTCTGGTTCGATAACCAATCGAAAC CCAAGCAAGG CGCCGCCTGGGGAGTTTCCCTCCGTAAAGCATGTACCTTCGATACCGTCGAAGACTTCTG GGGGTTTGTC TTTTTCTTCTTCGATCTAAGATTTTCTGTGAAGTTATACTAATAAGGGTGTGTGTATTGT TGCAGTTTGC ACGAGACTATCTTCGTTCCCAGCAGATTGACACCCAACGCTGACATTCACATGTTCAAAG CTGGTGTTGA GCCCAAGTGGGAAGATCCTGAGTGTGCTAACGGCGGAAAGTGGACTTATGTTGTTACCAA CAACAGGAAG CAAGCTTTAGACAAGGCTTGGCTTGAAACTGTACTCTTCTTCTTCTTCTAACCCTTTTTA CTCTTCTGTT TTCTGACTTAATAATTTTATCTCTTGTGTTTGGCAGTTGATGGCTTTAGTTGGAGAGCAG TTTGATGAGG CAGATGAGATCTGTGGTGTGGTTGCTAGTGTCCGCCAAAAGCAAGACAAGCTCTCCTTGT GGACTAGGAC TAAATCTAATGAAGCTGTTCTGGTATCATGCTTCTCTTCTCCCTTATATATGTTTGTTTG ACAGTTTTTT AAACCACCTTTTGATACTTTGCTGACAGTATAATCATAAGCTATATTTGCCAAAGGATAT ATATATATCA GTTTAGAACATGTTAGTATGTCAAAGATGGTTTATGAATCTATCTATCGGATGAAATTGC TGCTTGTTGT TTGTTTATTGTTATTATGTTTTATATTGGTTTATGATCCTATCTGATGAGATTTCTACTC TGCTGTATAT TTAGATTGATTTATGAATTTATCTGATGAAACTACTACACTTTGTTGTaAACCTAGATGG GTATTGGGAA GAAGTGGAAGGAGATACTTGATGTCACTGACAAGATATCTTTCACTAACCATGTAATTAC TACTTCCCCA CGTAAAAAGCTAATAAATCATCCTTTTGTTAGTTCCTTTTTAAACTGTGGTCTAAATATA TGCAGGATGA TGCAAGAAGAAGTCGATTTAGTGTCTAA

(SEQ ID No. 55)

The R-o-18 allele of BraA.eIF(iso)4E.c which TuMV can use in B. rapa and confers susceptibility on plants has the following mRNA sequence:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAGGUAACGGCGAUAGAA UCGACGGAGAAGC AGCAGCCUCCUCACAAGCUCGAAAGAAAGUGGAGUCUCUGGUUCGAUAACCAAUCGAAAC CCAAGCAAGG CGCCGCCUGGGGAGUUUCCCUCCGUAAAGCAUGUACCUUCGAUACCGUCGAAGACUUCUG GGGUUUGCAC GAGACUAUCUUCGUUCCCAGCAGAUUGACACCCAACGCUGACAUUCACAUGUUCAAAGCU GGUGUUGAGC CCAAGUGGGAAGAUCCUGAGUGUGCUAACGGCGGAAAGUGGACUUAUGUUGUUACCAACA ACAGGAAGCA AGCUUUAGACAAGGCUUGGCUUGAAACUUUGAUGGCUUUAGUUGGAGAGCAGUUUGAUGA GGCAGAUGAG AUCUGUGGUGUGGUUGCUAGUGUCCGCCAAAAGCAAGACAAGCUCUCCUUGUGGACUAGG ACUAAAUCUA AUGAAGCUGUUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUGAUGUCACUGACA AGAUAUCUUU CACUAACCAUGAUGAUGCAAGAAGAAGUCGAUUUAGUGUCUAA (SEQ ID No. 56)

The R-o-18 allele of BraA.eIF(iso)4E.c which TuMV can use in B. rapa and confers susceptibility on plants codes for the following polypeptide sequence:

MATEDVNEALAAAEVTAIESTEKQQPPHKLERKWSLWFDNQSKPKQGAAWGVSLRKACTF DTVEDFWGLH ETIFVPSRLTPNADIHMFKAGVEPKWEDPECANGGKWTYVVTNNRKQALDKAWLETLMAL VGEQFDEADE ICGVVASVRQKQDKLSLWTRTKSNEAVLMGIGKKWKEILDVTDKI SFTNHDDARRSRFSV .

(SEQ ID No. 57)

RLR22 BraA.eIE(iso)4E.c

The RLR22 allele of BraA.eIF(iso)4E.c located on chromosome A8 which TuMV cannot use in B. rapa and is necessary for resistance has the following genomic DNA sequence (exons 1 to 5, with the introns shown in bold):

ATGGCGACAGAGGATGTGAACGAAGCCCTTGCGGCGGCGGAGGTAACGGCGATAGAATCG ACGGAGAAGC AGCAGCCTCCTCACAAGCTCGAAAGAAAGTGGAGTTTCTGGTTCGATAACCAATCGAAAC CCAAGCAAGG CGCCGCCTGGGGAGCTTCCCTCCGTAAAGCATGTACCTTCGATACCGTCGAAGACTTCTG GGGGTTTGTC TTTTTCTTCTTCGATCTAAGATTTTCTGTGAAGTTATACTAATAGGGGTGTGTGTATTGT TGCAGTTTGC ACGAGACTATCTTCGTTCCCAGCAGATTGATACCCAACGCTGACATTCACATGTTCAAAG CTGGTGTTGA GCCCAAGTGGGAAGATCCTGAGTGTGCTAACGGCGGAAAGTGGACTTATGTTGTTACCAA CAACAGGAAG CAAGCTTTAGACAAGGCTTGGCTTGAAACTGTACTCTTCTTCTTCTTCTAACCCTTTTTA CTCTTCTGTT TTCTGACTTAATAATTTTATCTCTTGTGTTTGGCAGTTGATGGCTTTAGTTGGAGAGCAG TTTGATGAGG CAGATGAGATCTGTGGTGTGGTTGCTAGTGTCCGCCCAAAGCAAGACAAGCTCTCCTTGT GGACTAGGAC TAAATCTAATGAAGCTGTTCTGGTATCATGCTTCTCTTCTCCCTTATATATGTTTGTTTG ACAGTTTTTT AAACCACCTTTTGATACTTTGCTGACAGTATAATCATAAGCTATATTTGCCAAAGGATAT GTTAGTATGT CAAAGATGGTTTATGAATCTATATATCTGATGAAATTGTTGTTTGTTGTTTGTTTATTGT TATTATGTTT TATATTGGTTTATGATCCTATCTGATGAGATTTCTACTCTGCTATATATTTAGATTGGTT TATGAATTTA TCTGACGAAACTAATACACTTTGTTTGTAAACCTAGATGGGTATTGGGAAGAAGTGGAAG GAGATACTTG ATGTCACCGACAAGATATCTTTCACTAACCATGTAATTACTACTTCCCCACGTAAAAAGC TAATCAATCA TCCTTTTGTTAGTGCCTTTTTAAACTGTGGTCTATGTATATGCAGGATGATGCAAGAAGA AGTCGATTTA GTGTCTGA

(SEQ ID No. 58)

The RLR22 allele of BraA.eIF(iso)4E.c which TuMV cannot use in B. rapa and is necessary for resistance has the following mRNA sequence:

AUGGCGACAGAGGAUGUGAACGAAGCCCUUGCGGCGGCGGAGGUAACGGCGAUAGAAUCG ACGGAGAAGC AGCAGCCUCCUCACAAGCUCGAAAGAAAGUGGAGUUUCUGGUUCGAUAACCAAUCGAAAC CCAAGCAAGG CGCCGCCUGGGGAGCUUCCCUCCGUAAAGCAUGUACCUUCGAUACCGUCGAAGACUUCUG GGGUUUGCAC GAGACUAUCUUCGUUCCCAGCAGAUUGAUACCCAACGCUGACAUUCACAUGUUCAAAGCU GGUGUUGAGC CCAAGUGGGAAGAUCCUGAGUGUGCUAACGGCGGAAAGUGGACUUAUGUUGUUACCAACA ACAGGAAGCA AGCUUUAGACAAGGCUUGGCUUGAAACUUUGAUGGCUUUAGUUGGAGAGCAGUUUGAUGA GGCAGAUGAG AUCUGUGGUGUGGUUGCUAGUGUCCGCCCAAAGCAAGACAAGCUCUCCUUGUGGACUAGG ACUAAAUCUA AUGAAGCUGUUCUGAUGGGUAUUGGGAAGAAGUGGAAGGAGAUACUUGAUGUCACCGACA AGAUAUCUUU CACUAACCAUGAUGAUGCAAGAAGAAGUCGAUUUAGUGUCUGA (SEQ ID No. 59)

The RLR22 allele of BraA.eIF(iso)4E.c which TuMV cannot use in B. rapa and necessary for resistance codes for the following polypeptide sequence:

MATEDVNEALAAAEVTAIESTEKQQPPHKLERKWSFWFDNQSKPKQGAAWGASLRKA CTFDTVEDFWGLH ETIFVPSRLIPNADIHMFKAGVEPKWEDPECANGGKWTYVVTNNRKQALDKAWLETLMAL VGEQFDEADE ICGVVASVRPKQDKLSLWTRTKSNEAVLMGIGKKWKEILDVTDKI SFTNHDDARRSRFSV .

(SEQ ID No. 60)

Accordingly, in a tenth aspect, there is provided an isolated nucleic acid sequence encoding a plant eukaryotic translation initiation factor 4E (eIF4E), or an isoform thereof (eIF(iso)4E), wherein the eIF4E or eIF(iso)4E is non- functional for a virus, and wherein the nucleic acid sequence comprises a nucleotide sequence substantially as set out in SEQ ID No: 4, 5, 7, 9, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 58 or

59, or a variant or fragment thereof.

In an eleventh aspect, there is provided an isolated plant eukaryotic translation initiation factor 4E (eIF4E) variant, or an isoform thereof (eIF(iso)4E), which is non- functional for a virus, wherein the eIF4E or eIF(iso)4E protein variant comprises an amino acid sequence substantially as set out in SEQ ID No: 6, 8, 10, 36, 39, 42, 45, 48, 51, 54 or

60, or a variant or fragment thereof.

It will be appreciated that the vector of the third aspect, the host cell of the fourth aspect, the method of any one of the fifth, sixth or eighth aspects, the use of the seventh aspect or the plant of the ninth aspect may comprise any of the sequences described herein which are non- functional for a virus, i.e. as defined in the tenth aspect. For example, in one embodiment, the vectors, cells, methods, uses or plants of the invention may comprise a nucleotide sequence substantially as set out in SEQ ID No: 4, 5, 7, 9, 34, 35, 37, 38, 40, 41 , 43, 44, 46, 47, 49, 50, 52, 53, 58 or 59, or a variant or fragment thereof. In another embodiment, the vectors, cells, methods, uses or plants of the invention may comprise an eIF4E or eIF(iso)4E protein variant, as defined in the eleventh aspect. Preferably, therefore, the vectors, cells, methods, uses or plants comprise an eIF4E or eIF(iso)4E protein variant which comprises an amino acid sequence substantially as set out in SEQ ID No: 6, 8, 10, 36, 39, 42, 45, 48, 51, 54 or 60, or a variant or fragment thereof.

It is preferred that the vectors, cells, methods, uses or plants of the invention comprise the use of a nucleotide sequence substantially as set out in SEQ ID No: 58 or 59, or a variant or fragment thereof, or the use of an eIF4E or eIF(iso)4E protein variant which comprises an amino acid sequence substantially as set out in SEQ ID No: 60, or a variant or fragment thereof.

It will be appreciated that the invention extends to any nucleic acid or peptide or variant, derivative or analogue thereof, which comprises substantially the amino acid or nucleic acid sequences of any of the sequences referred to herein, including (functional) variants or (functional) fragments thereof. The terms "substantially the amino acid/ polynucleotide/ polypeptide sequence", "(functional) variant" and "(functional) fragment", can be a sequence that has at least 40% sequence identity with the amino acid/ polynucleotide/ polypeptide sequences of any one of the sequences referred to herein.

Amino acid/ polynucleotide/ polypeptide sequences with a sequence identity which is greater than 65%, more preferably greater than 70%, even more preferably greater than 75%, and still more preferably greater than 80% sequence identity to any of the sequences referred to are also envisaged. Preferably, the amino

acid/ polynucleotide/ polypeptide sequence has at least 85% identity with any of the sequences referred to, more preferably at least 90% identity, even more preferably at least 92% identity, even more preferably at least 95% identity, even more preferably at least 97% identity, even more preferably at least 98% identity and, most preferably at least 99% identity with any of the sequences referred to herein. The skilled technician will appreciate how to calculate the percentage identity between two amino acid/ polynucleotide/ polypeptide sequences. In order to calculate the percentage identity between two amino acid/ polynucleotide/ polypeptide sequences, an alignment of the two sequences must first be prepared, followed by calculation of the sequence identity value. The percentage identity for two sequences may take different values depending on:- (i) the method used to align the sequences, for example, ClustalW, BLAST, FAST A, Smith- Waterman (implemented in different programs), or structural alignment from 3D comparison; and (ii) the parameters used by the alignment method, for example, local vs global alignment, the pair-score matrix used (e.g. BLOSUM62, PAM250, Gonnet etc.), and gap-penalty, e.g. functional form and constants.

Having made the alignment, there are many different ways of calculating percentage identity between the two sequences. For example, one may divide the number of identities by: (i) the length of shortest sequence; (ii) the length of alignment; (iii) the mean length of sequence; (iv) the number of non-gap positions; or (iv) the number of equivalenced positions excluding overhangs. Furthermore, it will be appreciated that percentage identity is also strongly length dependent. Therefore, the shorter a pair of sequences is, the higher the sequence identity one may expect to occur by chance.

Hence, it will be appreciated that the accurate alignment of protein or DNA sequences is a complex process. The popular multiple alignment program ClustalW (Thompson et al., 1994, Nucleic Acids Research, 22, 4673-4680; Thompson et al., 1997, Nucleic Acids Research, 24, 4876-4882) is a preferred way for generating multiple alignments of proteins or DNA in accordance with the invention. Suitable parameters for ClustalW may be as follows: For DNA alignments: Gap Open Penalty = 15.0, Gap Extension Penalty = 6.66, and Matrix = Identity. For protein alignments: Gap Open Penalty = 10.0, Gap Extension Penalty = 0.2, and Matrix = Gonnet. For DNA and Protein alignments: ENDGAP = -1, and GAPDIST = 4. Those skilled in the art will be aware that it may be necessary to vary these and other parameters for optimal sequence alignment. Preferably, calculation of percentage identities between two amino acid/ polynucleotide/ polypeptide sequences may then be calculated from such an alignment as (N/T)*100, where N is the number of positions at which the sequences share an identical residue, and T is the total number of positions compared including gaps but excluding overhangs. Hence, a most preferred method for calculating percentage identity between two sequences comprises (i) preparing a sequence alignment using the ClustalW program using a suitable set of parameters, for example, as set out above; and (ii) inserting the values of N and T into the following formula:- Sequence Identity = (N/T)*100.

Alternative methods for identifying similar sequences will be known to those skilled in the art. For example, a substantially similar nucleotide sequence will be encoded by a sequence which hybridizes to the sequences shown in SEQ ID No's: 1, 2, 4, 5, 7 or 9, or their complements under stringent conditions. By stringent conditions, we mean the nucleotide hybridises to filter-bound DNA or RNA in 3x sodium chloride / sodium citrate (SSC) at approximately 45°C followed by at least one wash in 0.2x SSC/0.1% SDS at approximately 20-65°C. Alternatively, a substantially similar polypeptide may differ by at least 1, but less than 5, 10, 20, 50 or 100 amino acids from the sequences shown in SEQ ID No: 3, 6, 8 or 10. A skilled technician will be familiar with highly sensitive techniques for identifying single nucleotide variations or mutations in a nucleic acid sequence.

Due to the degeneracy of the genetic code, it is clear that any nucleic acid sequence described herein could be varied or changed without substantially affecting the sequence of the protein encoded thereby, to provide a (functional) variant thereof.

Suitable nucleotide variants are those having a sequence altered by the substitution of different codons that encode the same amino acid within the sequence, thus producing a silent change. Other suitable variants are those having homologous nucleotide sequences but comprising all, or portions of, sequence, which are altered by the substitution of different codons that encode an amino acid with a side chain of similar biophysical properties to the amino acid it substitutes, to produce a conservative change. For example small non-polar, hydrophobic amino acids include glycine, alanine, leucine, isoleucine, valine, proline, and methionine. Large non-polar, hydrophobic amino acids include phenylalanine, tryptophan and tyrosine. The polar neutral amino acids include serine, threonine, cysteine, asparagine and glutamine. The positively charged (basic) amino acids include lysine, arginine and histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. It will therefore be appreciated which amino acids may be replaced with an amino acid having similar biophysical properties, and the skilled technician will known the nucleotide sequences encoding these amino acids. All of the features described herein (including any accompanying claims, abstract and drawings), and/ or all of the steps of any method or process so disclosed, may be combined with any of the above aspects in any combination, except combinations where at least some of such features and/ or steps are mutually exclusive.

For a better understanding of the invention, and to show how embodiments of the same may be carried into effect, reference will now be made, by way of example, to the accompanying diagrammatic drawings, in which:-

Figure 1 shows an electrophoretic gel on which RT-PCT products of the Brassica rapa eIF(iso)4E mRNA and Arabidopsis thaliana eIF(iso)4E and eIF(iso)4E mRNA have been separated. R-o-18 corresponds to a B. rapa line which is susceptible to infection by TuMV, whereas B. rapa line RLR22 is resistant to infection by TuMV. The mRNA from virus-resistant line RLR22 (SEQ ID No.5) is larger than that of the susceptible line R- o-18 (SEQ ID No. 2). Col-0 eIF4E and Col-0 eIF(iso)4E are controls and correspond to wild-type Arabidopsis thaliana eIF4E and eIF(iso)4E, respectively;

Figure 2 is a schematic representation of the gene encoding Brassica rapa eIF(iso)4E.a. The exons are shown as black boxes, and are numbered, and the introns are shown as white boxes and are not numbered;

Figure 3 is a schematic representation of the mRNA of Brassica rapa eIF(iso)4E.a in line R-o-18, which can be infected by TuMV. The arrow above exon 1 corresponds to the start codon, and the arrow above exon 5 corresponds to the stop codon; Figure 4 is a schematic representation of one embodiment (the most common variant) of the mRNA of Brassica rapa eIF(iso)4E.a in line RLR22 (SEQ ID No. 5), which is resistant to viral infection. The start and stop codons are represented by arrows. The mis-splicing results in the retention of the whole of intron 1 in the mature message RNA and the introduction of a premature stop codon indicated by the red arrow on the left;

Figure 5 is a schematic representation of another embodiment of the mRNA of Brassica rapa eIF(iso)4E.a in line RLR22 (SED ID No. 7), which is resistant to viral infection. The start and stop codons are represented by arrows. The mis-splicing results in partial removal of the intron, with only the first 48 nucleotides removed;

Figure 6 is a schematic representation of another embodiment of the mRNA of Brassica rapa eIF(iso)4E.a in line RLR22 (SEQ ID No. 9), which is resistant to viral infection. The start and stop codons are represented by arrows. The mis-splicing results in the excision of the last three nucleotides of exon 1 along with the whole of intron 1 ;

Figure 7 shows a photo of a plant having a modification in intron 1 of Brassica rapa eIF(iso)4E.a in line RLR22, which is resistant to TuMV infection (left) and a Brassica rapa plant lacking this modification which is extremely susceptible to TuMV (right);

Figure 8 is a schematic representation of a breeding program to generate a plant line that is homozygous for the eIF4E or eIF(iso)4E variant of the invention; and

Figure 9 shows an electrophoretic gel showing the PCR products obtained from DNA of plants R-o-18 and RLR22 using primers BR2 and BR14.

Examples

Example 1— Analysis of eIF(iso)4E sequence conferring virus-resistance and isolation of a first locus conferring virus-resistance. retrOI

The inventor investigated in B. rapa with the aim of identifying genes and mechanisms involved in conferring plant resistance against TuMV. Materials and Methods

The B. rapa line R-o-18 is an inbred line susceptible to infection by TuMV. Line RLR22, however, has broad-spectrum resistance to all TuMV isolates tested so far. Total genomic DNA was extracted from young leaves of lines R-o-18 and RLR22 using a DNeasy plant mini kit (Qiagen) followed by amplification using the GenomiPhi system (GE Healthcare). The majority of the genomic coding region of the BraA.eIF(iso)4E.a gene was amplified with standard PCR using Taq DNA polymerase (Invitrogen) and the following primers:- PCR1 (ATGGCGACAGAGGATGTGAACG) - (SEQ ID No. 11) and

PCR2 (TCTCCTTCCACTTCTTCCCAATAC) - (SEQ ID No. 12).

The largest amplification product, corresponding to the eIF(iso)4E locus a, was purified using agarose gels and the Qiaquick gel extraction kit (Qiagen). The PCR products were then labelled with 32 P dCTP using the RediPrime™II DNA labelling system (GE

Healthcare) to form probes. These radio-labelled probes were then hybridised to filters printed with 36864 colonies of the JBr BAC library (line R-o-18) or a fosmid library of line RLR22 (Warwick Plant Genome Libraries, UK) to identify colonies containing plasmids with BraA.eIF(iso)4E.a (JBr043O19 and RLR021G13, respectively). Sections of these plasmids were sequenced, using the BigDye Terminator system and an ABI Prism 3130x1 Genetics Analyzer (Applied Biosystems), in a step-wise manner to determine the complete genomic sequence of the two alleles.

Total RNA was extracted from young leaves of both plant lines using an RNeasy kit (Qiagen). Some samples of total RNA were treated with DNasel (Roche) to remove any remnants of genomic DNA. The RT-FOR and RT-REV primers encompass the complete coding region of the BraA.eIF(iso)4E.a gene (start and stop codons are shown in bold; an adapter for cloning is in italics). RT-FOR: AAAAAGCAGGCT CGATGGCGACAGAGGATG (SEQ ID

No. 13) RT-REV (R-o-18): ^G,4^GCTGGGT TCAGACAGTGAACCTAGTTCTTC (SEQ ID No. 14)

RT-REV (RLR22) : AGAAAGCTGGGT TCAGACAGTGAACCGAGTTCTTC (SEQ ID No. 15)

Total RNA (about 10 μg RNA, with and without DNasel treatment) was converted to cDNA with standard reverse transcription reactions using Superscript II (Invitrogen) and the appropriate RT-REV primer. cDNA was amplified with standard PCR using Taq DNA polymerase (Invitrogen) or KOD polymerase (Novagen) and the appropriate RT-FOR and RT-REV primer combination. RT-PCR products were separated using agarose gel electrophoresis, and extracted from the gel using a Qiaquick gel extraction kit (Qiagen). Products were cloned into plasmid pDONR221 using BP clonase

(Invitrogen) . Plasmids were amplified using the TempliPhi reaction (GE Healthcare) and then sequenced.

Results

The inventor identified a B. rapa line, RLR22, which has resistance against a range of TuMV isolates from different parts of the world and representing different genetic groups. Figure 8 illustrates that a plant homozygous for the RLR22 allele of eIF(iso)4E.a is resistant to TuMV infection (left) whereas plants homozygous, or heterozygous for the B. rapa R-o-18 allele of eIF(iso)4E.a, are susceptible to TuMV infection (right).

Firstly, the inventor found that the viral resistance phenotype is linked to the eIF(iso)E locus. Using a cross between a susceptible line (R-o-18) and the resistant line (RLR22), the inventor confirmed that B. rapa contains three eIF(iso)4E loci.

Secondly, the sequences of genomic and mRNA versions of BraA.eIF(iso)4E.a in line R- o-18 and RLR22 were compared. Figure 1 shows an electrophoretic gel of RT-PCR products of the Brassica rapa eIF(iso)4E.a mRNA, and as controls, Arabidopsis thaliana eIF4E and eIF(iso)4E mRNA. As can be seen in the gel, the mRNA of BraA.eIF(iso)4E.a from vims-resistant line, RLR22, is larger than the mRNA of BraA.eIF(iso)4E.a obtained from the infected line, R-o- 18.

The genomic sequence of Br A.eIF(iso)4E. a in line R-o- 18 is provided as SEQ ID No.l, and the genomic sequence of RLR22 is provided as SEQ ID No.4. Following sequencing of the Br A.eIF(iso)4E.a, the inventor has found that the two alleles (in R-o- 18 and RLR22) differ by a base insertion/deletion (i.e. an "indel") at the exon/intron junction. In the virus-susceptible line R-o-18, a single mRNA sequence of the eIF(iso)4E.a allele was found and the mRNA sequence is provided as SEQ ID No.2. The mRNA was capable of producing a full-length protein, as represented by the schematic shown in Figure 3, and the protein sequence is provided as SEQ ID No.3.

However, in the virus-resistant line RLR22, several splice variants of mRNA of eIF(iso)4E.a were found. The mRNA of eIF(iso)4E.a mainly consisted of a product (denoted as SEQ ID No.5) with the first intron still present in the expressed DNA, forming a longer eIF(iso)4E.a protein, and this explains why the fragment is larger in the RLR22 shown in Figure 1. Other rarer splice variants of eIF(iso)4E.a were also identified: SEQ ID No. 7 had most of the first intron removed except for the 3' terminal 15 bases which were retained, as represented in Figure 5. SEQ ID No. 9 had all of the first intron removed, and also the 3' terminal 3 bases of exon 1 removed, as represented in Figure 6. None of these splicing variants would be capable of producing a full-length 'correct' eIF(iso)4E.a protein, i.e. producing the full length protein as if splicing had occurred correctly, as in line R-o-18, which is susceptible to viral infection. The amino acid sequences translated from SEQ ID Nos. 5, 7 and 9 are shown as SEQ ID Nos. 6, 8 and 10, respectively. The inventor believes that none of the modified proteins would be functional for TuMV.

Until now, reported recessive resistance to pot viruses based on eIF(iso)4E have arisen through base changes only in the coding region of the gene (exons), causing changed protein sequences (altered amino acid residues or premature chain termination). In the case of virus-resistant line RLR22, however, the resistance is surprisingly related to alterations in a non-coding region of the gene (introns). The inventor has shown that the "indel" in the eIF(iso)4E.a allele would disrupt correct splicing in line RLR22 and result in no (or greatly reduced) functional eIF(iso)4Ea protein for viral genome translation, and ultimately viral infection. The inventors have named the allele, BraA.eIF(iso)4E.a, from the virus-resistant line, RLR22, retrOI .

Example 2— Breeding plants resistant to TuMV infection

B.rapa plants that are resistant to TuMV may be obtained from the following breeding programme and the breeding programme is illustrated in Figure 8.

Referring to Figure 8, the recipient B.rapa plants with desired agronomic traits are crossed with a virus-resistant RLR22 plant and the progeny (Fi) are backcrossed with the recipient parent (recurrent) plant line. Molecular techniques (e.g. PCR and sequencing or other specific PCR-based methods to distinguish BraA.eIF(iso)4E.a alleles) are used to ensure that the allele of BraA.eIF(iso)4E.a derived from RLR22 (which confers virus resistance) is present in the non-recurrent plants in each cross. Finally, plants derived from the backcrossing programme that possessed the RLR22 allele of BraA.eIF(iso)4E.a are selfed (i.e. self-pollinated). Plants in the subsequent generation that are homozygous for the RLR22 allele of BraA.eIF(iso)4E.a are identified by molecular techniques as described above.

For breeding Fi hybrid plants with the viral resistance, two plant lines derived from separate backcrossing programmes must be homozygous for the eIF4E or eIF(iso)4E of the first aspect. These lines are then crossed to generate the Fi hybrid which would in turn be homozygous for the eIF4E or eIF(iso)4E of the first aspect.

Example 3— Breeding plants resistant to potyvirus infection

Plants that are resistant to a particular potyvirus may be obtained from the following breeding programme.

The recipient plant is crossed with plants defective for splicing of eIF4E and/ or eIF(iso)4E and the progeny is backcrossed with the recipient parent (recurrent) plant line. Molecular techniques (PCR and sequencing or other specific PCR-based tests) are used to ensure that the allele (s) of eIF4E and/ or eIF(iso)4E that is/ are defective in splicing and conferring resistance is/are present in the non- recurrent plants for each cross. Plants derived from the backcrossing programme possessing the eIF4E and/or eIF(iso)4E allele that is/ are defective in splicing and conferring resistance are selfed. Plants in the subsequent generation that are homozygous for the allele of eIF4E and/ or eIF(iso)4E that is / are defective in splicing are identified using the molecular techniques described above. Finally these plants are selfed to give lines / families that are homozygous for the desired eIF4E / eIF(iso)4E alleles.

For breeding Fi hybrid plants with the viral resistance, two plant lines derived from separate backcrossing programmes must be homozygous for the eIF4E or eIF(iso)4E of the first aspect. These lines are then crossed to generate the Fi hybrid, which would in turn be homozygous for the eIF4E or eIF(iso)4E of the first aspect.

Example 4— Isolation of a second locus conferring virus-resistance. ConTROI

The scenario described in Examples 1-3 is sufficient to explain virus resistance in plants that possess just one copy of eIF4E and/ or eIF(iso)4E, which a particular virus is able to utilise to complete its life-cycle. However, someplants have multiple copies/loci of one, or both of these two genes. For example, Brassica rapa has three copies of both eIF4E and eIF(iso)4E, and a virus may be able to use any of these genes to complete its life cycle. Thus, in order to confer virus resistance in plants having multiple copies/loci of eIF4E and/ or eIF(iso)4E, it is necessary that the alleles of eIF4E and/ or eIF(iso)4E at each of these other loci are non-functional for the virus.

Using the cross discussed above between the B. rapa TuMV- susceptible line R-o-18 and the TuMV-resistant line RLR22, a Bi plant that was homozygous for the RLR22 allele of BraA.eIF (iso)4E.a and the RLR22 allele of BraA.eIF4E.c, but heterozygous at the BraA.eIF (iso)4E.c locus, was identified. BiSi plants derived from this particular individual were then phenotyped and genotyped. Plants homozygous for the RLR22 allele of BraA.eIF(iso)4E.c were completely resistant to systemic spread of TuMV, heterozygotes were only slightly susceptible to virus infection, whereas plants that were homozygous for the R-o-18 allele of BraA.eIF (iso)4E.c were completely susceptible. The second locus involved in the broad- spectrum resistance to TuMV in B. rapa has now been identified as BraA.eIF(iso)4E.c on chromosome A8, and is referred to herein as ConTROI. This confirms the necessity for the second gene (CoriTR.01 , the RLR22 allele of BraA.eIF(iso)4E.c) in addition to retrOI (the RLR22 allele of BraA.eIF(iso)4E.a) for broad-spectrum resistance to TuMV in B. rapa, and demonstrates the importance of more than one gene to confer resistance in plants where viruses are able to utilise multiple copies/loci of eIF4E and/or eIF(iso)4E. Materials and methods

Total genomic DNA was extracted from young leaves of lines R-o-18 and RLR22 using a DNeasy plant mini kit (Qiagen) followed by amplification using the GenomiPhi system (GE Healthcare). The genomic coding region of the BraA.eIF4E.a,

BraA.eIF4E.b (R-o-18 only), BraA.eIF4E.c, BraA.eIF(iso)4E.b and BraA.eIF(iso)4E.c genes were amplified with standard PCR using Taq DNA polymerase (Invitrogen) and the primers listed below. Sections of these products were sequenced, using the BigDye Terminator system and an ABI Prism 3130x1 Genetics Analyzer (Applied Biosystems), in a step-wise manner to determine the complete genomic sequence of the alleles. BraA.eIF4E.b was not sequenced from RLR22, as it is non- functional. A summary of sequences obtained are listed in Table 1.

Table 1 - Sequencing of eIF4E and eIF(iso)4E alleles in Brassica rapa lines RLR22 and R- o-18

Allele Sequence obtained for plant line

RLR22 R-o-18

BraA.eIF4E.a Yes Yes

BraA.eIF4E.b No Yes

BraA.eIF4E.c Yes Yes

BraA.eIF(iso)4E.b Yes Yes

BraA.eIF(iso)4E.c Yes Yes Primers used to amplify or sequence eIF4E and eIF(iso)4E alleles from rassica rata lines R-o-18 and RLR22:

R-o-18 BraA.eIF4E.a:

FORWARD (BR57): AAAAAGCAGGCTTTTGGTCTGCAGTTATGTTATTAG (SEQ ID No. 16)

REVERSE (BR58): AGAAAGCTGGGTAAAAAGGCTTGCGAGTCA (SEQ ID No. 17)

R-o-18 BraA.eIF4E.

FORWARD (BR71): C AATGGC GGT AG AAG AC ACT (SEQ ID No. 18) REVERSE (BR50): AGTTGAGTTTTTGTTCTTAC (SEQ ID No. 19)

R-o-18 BraA.eIF4E.r.

FORWARD (BR59): AAAAAGCAGGCTTAGGACAAATGATATGGGGAGAGT (SEQ ID No. 20) REVERSE (BR60): AGAAAGCTGGGTAGCTTGGCGACCTTTTGA (SEQ ID No. 21)

R-o-18 BraA.elF(iso)4E.b:

FORWARD (BR73): TGAAAGGGGCGAAAAACACAT (SEQ ID No. 22)

REVERSE (BR74): GCAAACCGACAAAATAAGGAAGAA (SEQ ID No. 23) R-o-18 BraA.elF(iso)4E.c.

FORWARD (BR63): AAAAAGCAGGCTTTTTTAAGAATGGAGGGAGTAT (SEQ ID No. 24) REVERSE (BR64): AGAAAGCTGGGTGAAGCGCGGGTCAAAAT (SEQ ID No. 25)

RLR22 BraA.eIF4E.a:

FORWARD (BR68): AAAAAGCAGGCTTTTGGTCTGCAATTATCTTATTAG (SEQ ID No. 26)

REVERSE (BR58): AGAAAGCTGGGTAAAAAGGCTTGCGAGTCA (SEQ ID No. 27)

RLR22 BraA.eIF4E.r.

FORWARD (59): AAAAAGCAGGCTTAGGACAAATGATATGGGGAGAGT (SEQ ID No. 28) REVERSE (60): AGAAAGCTGGGTAGCTTGGCGACCTTTTGA (SEQ ID No. 29)

RLR22 BraA.elF(iso)4E.b:

FORWARD (BR73): TGAAAGGGGCGAAAAACACAT (SEQ ID No. 30)

REVERSE (BR74): GCAAACCGACAAAATAAGGAAGAA (SEQ ID No. 31) RLR22 BraA.elF(iso)4E.c.

FORWARD (BR80): AAAAAGCAGGCTCGAAGAAGTCCGCATAAAGC (SEQ ID No. 32)

REVERSE (BR81): AGAAAGCTGGGTACCCGTCCGTGGATTAAATA (SEQ ID No. 33) Example 5 - Identification of the second gene (CotiTR01) involved in the broad- spectrum resistance B. rapa line R-o-18 is an inbred line susceptible to TuMV. Line RLR22, resistant to TuMV, has broad-spectrum resistance to all TuMV isolates tested so far (Walsh et al. 2002, European Journal of Plant Pathology 108, 15-20). From the cross between the B. rapa TuMV- susceptible line R-o-18 and the TuMV-resistant line RLR22 (Rusholme et al. 2007, Journal of General Virology 88, 3177-3186). Bi plant 16 was identified as homozygous for the RLR22 alleles of BraA.elF (iso)4E.a and BraA.eIF4E.c, but heterozygous at the BraA.eIF(iso)4E.c locus. RLR22 and R-o-18 have the same allele at the BraA.eIF(iso)4E.b locus and the R-o-18 allele at BraA.eIF4E.b is a pseudogene giving a much truncated polypeptide, and so these alleles were not determined for this plant.

BiSi seed derived from Bi plant 16 was sown. Total genomic DNA was extracted from young leaves of the plants that germinated using a DNeasy plant mini kit (Qiagen), followed by amplification using the GenomiPhi system (GE Healthcare) . Plants were then genotyped at the BraA.eIF(iso)4E.c locus using the primers BR2

(TCTCCTTCCACTTCTTCCCAATAC - SEQ ID No: 61) and BR14

(TAGACAAGGCTTGGCTTGAAACTG - SEQ ID No: 62). These primers produce products of three sizes (see Figure 9) for RLR22 and three sizes for R-o-18. The largest product corresponds to BraA.eIF(iso)4E.a for both plants, the middle-sized product corresponds to BraA.elF ' (iso)4E.b and the smallest product BraA.eIF(iso)4E.c for both plants. The product for the R-o-18 allele of BraA.eIF(iso)4E.c (566 bp) is larger than the product for the RLR22 allele (546bp) and easily distinguished (see Figure 9). As can be seen in the gel, the PCR product for BrA.eIF(iso)4Ec ito the susceptible plant R-o-18, is bigger than that of the resistant plant RLR22.

The plants were mechanically inoculated with TuMV isolate CDN 1 at the 3-5 leaf stage and infection/ resistance phenotypes determined by visual assessment and ELISA as described by Rusholme et al. (2007). The genotypes, phenotypes and ELISA values for the plants are given in Table 2. Table 2 - The phenotypes of Brassica rapa plants segregating for BraA.elFiisoHE.c from Bi Si family 16. derived from a cross between the Turnip mosaic virus (TuMV)-resistant plant RLR22 and the TuMV- susceptible plant R-o-18

Genotype of plants at Phenotypes of plants Mean A405 from BraA.eIF(iso)4E.c locus (all following challenge with ELISA on homozygous for RLR22 allele of TuMV as determined by uninoculated leaves BraA.eIF(iso)4E.a and RLR22 visual assessment

allele of BraA.eIF4E.c)

Heterozygous Limited / some systemic 0.05

spread 1 of TuMV

Heterozygous Resistant, no systemic 0.02

spread of TuMV

Homozygous for R-o-18 allele Susceptible, systemic spread 0.30

of TuMV

Homozygous for RLR22 allele Resistant, no systemic 0.02

spread of TuMV

Control uninfected plants No virus inoculated 0.04

1 Systemic spread— spread of TuMV infection from inoculated to uninoculated leaves.

Plants homozygous for the RLR22 allele of BraA.eIF(iso)4E.c were completely resistant to TuMV, heterozygotes showed either no detectable infection, or were only slightly susceptible with the virus spreading from the inoculated leaves to a limited number of uninoculated leaves but then no further. Plants that were homozygous for the R-o-18 allele of BraA.eIF(iso)4E.c were completely susceptible. The results show that TuMV is able to use the R-o-18 allele of BraA.eIF(iso)4E.c and that plants need to be homozygous for this allele in order to establish a full systemic infection. Results from inoculation of plants that were heterozygous at this locus show that one copy of the R- 0-I8 allele is not sufficient for TuMV to establish a full-blown systemic infection. Differences between the phenotypes of heterozygotes (not susceptible or limited susceptibility) probably relate to the necessity for the virus to establish a high enough titre in inoculated leaves in order for systemic spread to take place.

Table 3 provides a summary of the ability of TuMV to use the six alleles described herein. BraA.eIF(iso)4E. a is retrOI , and BraA.eIF(iso)4E.c is ConTROI .

Table 3 - The ability of Turnip mosaic virus (TuMV) to use the different eIF4E and eIF(iso)4E alleles in Brassica rapa lines RLR22 and R-o-18 to complete its life cycle

Allele Ability of TuMV to use allele from B. rapa line

RLR22 R-o-18

BraA.eIF4E.a No No

BraA.eIF4E.b No No

BraA.eIF4E.c No No

BraA. eIF(iso )4E. a No Yes

BraA.eIF(iso)4E.b No No

BraA.eIF(iso)4E.c No Yes