Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
RNAI THERAPEUTIC FOR RESPIRATORY VIRUS INFECTION
Document Type and Number:
WIPO Patent Application WO/2006/110688
Kind Code:
A2
Abstract:
Disclosed herein is a double stranded siRNA molecule that inhibits production of a respiratory virus, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus, and uses thereof.

Inventors:
GE QING (US)
KUMAR MUKESH (US)
MCSWIGGEN JAMES ANTHONY (US)
Application Number:
PCT/US2006/013374
Publication Date:
October 19, 2006
Filing Date:
April 07, 2006
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NASTECH PHARM CO (US)
GE QING (US)
KUMAR MUKESH (US)
MCSWIGGEN JAMES ANTHONY (US)
International Classes:
C12N15/113
Foreign References:
US20040242518A12004-12-02
US10209705A2005-04-08
US5185243A1993-02-09
US5679524A1997-10-21
US5573907A1996-11-12
US5723591A1998-03-03
US5011769A1991-04-30
US5403711A1995-04-04
US5660988A1997-08-26
US4876187A1989-10-24
US6277607B12001-08-21
US6150097A2000-11-21
US6037130A2000-03-14
US5744305A1998-04-28
US5800992A1998-09-01
US6646243B22003-11-11
US4522811A1985-06-11
US44644403A2003-05-28
Other References:
SHARP, P. A., GENES DEV., vol. 15, 2001, pages 485 - 490
HUTVAGNER, G.; ZAMORE, P. D., CURR. OPIN. GENET. DEV., vol. 12, 2002, pages 225 - 232
FIRE, A. ET AL., NATURE, vol. 391, 1998, pages 806 - 811
ZAMORE, P. ET AL., CELL, vol. 101, 2000, pages 25 - 33
ELBASHIR, S. M. ET AL., GENES DEV., vol. 15, 2001, pages 188 - 200
HAMMOND, S. M. ET AL., NATURE, vol. 404, 2000, pages 293 - 296
BERNSTEIN, E. ET AL., NATURE, vol. 409, 2001, pages 363 - 366
NYKANEN, A., CELL, vol. 107, 2001, pages 309 - 321
CAPLEN, N. J. ET AL., GENE, vol. 252, 2000, pages 95 - 105
UI-TEI, K. ET AL., FEBS LETT., vol. 479, 2000, pages 79 - 82
ELBASHIR ET AL., NATURE, vol. 411, 2001, pages 494 - 498
BARIK S.: "Control ofnonsegmented negative-strand RNA virus replication by siRNA", VIRUS RES., vol. 102, no. 1, 1 June 2004 (2004-06-01), pages 27 - 35
BITKO ET AL.: "Phenotypic silencing of cytoplasmic genes using sequence-specific double-stranded short interfering RNA and its application in the reverse genetics of wild type negative-strand RNA viruses", BMC MICROBIOL. 2001, vol. 1, no. 1, 20 December 2001 (2001-12-20), pages 34
HATTA ET AL.: "Inhibition of influenza virus RNA polymerase and nucleoprotein genes expression by unmodified, phosphorothioated, and liposomally encapsulated oligonucleotides", BIOCHEM BIOPHYS RES COMMUN., vol. 223, no. 2, 14 June 1996 (1996-06-14), pages 341 - 6
MIZUTA ET AL.: "Antisense oligonucleotides directed against the viral RNA polymerase gene enhance survival of mice infected with influenza A", NAT BIOTECHNOL., vol. 17, no. 6, June 1999 (1999-06-01), pages 583 - 7
GITLIN ET AL.: "Short interfering RNA confers intracellular antiviral immunity in human cells", NATURE, vol. 418, no. 6896, 25 July 2002 (2002-07-25), pages 430 - 4
FOWLER ET AL.: "Inhibition of Marburg virus protein expression and viral release by RNA interference", J GEN VIROL., vol. 86, no. 4, April 2005 (2005-04-01), pages 1181 - 8
YUAN ET AL.: "Inhibition of coxsackievirus B3 replication by small interfering RNAs requires perfect sequence match in the central region of the viral positive strand", J VIROL., vol. 79, no. 4, February 2005 (2005-02-01), pages 2151 - 9
HAYDEN FG.: "Rhinovirus and the lower respiratory tract.", REV MED VIROL., vol. 14, no. 1, 2004, pages 17 - 31
TAN WC.: "Viruses in asthma exacerbations", CURR OPIN PULM MED., vol. 11, 2005, pages 21 - 6
WEINBERGER M.: "Respiratory infections and asthma: current treatment strategies", DRUG DISCOV TODAY., vol. 9, 2004, pages 831 - 7
PAPI A ET AL.: "Effect of desloratadine and loratadine on rhinovirus-induced intercellular adhesion molecule 1 upregulation and promoter activation in respiratory epithelial cells", J ALLERGY CLIN IMMUNOL., vol. 108, 2001, pages 221 - 8
BIANCO A ET AL.: "Tb2 cytokines exert a dominant influence on epithelial cell expression of the major group human rhinovirus receptor, ICAM-1", EUR RESPIRJ., vol. 12, 1998, pages 619 - 26
GUBLER, D. J.: "Dengue and dengue hemorrhagic fever", CLIN. MICROBIAL. REV., vol. 11, 1998, pages 480 - 496
GIBBONS, R. V.; D. W. VAUGHN.: "Dengue: an escalating problem", BMJ, vol. 324, 2002, pages 1563 - 1566
"Wodd Health Organization", 1997, article "Dengue haemorrhagic fever: diagnosis, treatment, prevention and control"
GUBLER, D. J.: "Dengue and dengue hemorrhagic fever: its history and resurgence as a global public health problem", DENGUE AND DENGUE HEMORRHAGIC FEVER, 1997, pages 1 - 22
DEUBEL, V.; R. M. KINNEY; D. W. TRENT.: "Nucleotide sequence and deduced amino acid sequence of the nonstructural proteins of dengue type 2 virus, Jamaica genotype: comparative analysis of the full-length genome", VIROLOGY, vol. 165, 1988, pages 234 - 244
HALSTEAD, S. B.: "Pathogenesis of dengue: challenge to molecular biology", SCIENCE, vol. 239, 1988, pages 476 - 481
GUBLER, D. J.: "Dengue and dengue hemorrhagic fever", CLIN. MICROBIOL. REV., vol. 11, 1998, pages 480 - 496
KAPIKIAN, A.Z.: "Fields virology", 2001, LIPPINCOTT/THE WILLIAMS & WILKINS CO., pages: 1787 - 1833
PRASAD, B.V., J. MOL BIOL., vol. 199, 1988, pages 269 - 275
LAWTON, J.A., J. VIROL., vol. 71, 1997, pages 7353
LAWTON, J.A., ADV. VIRUS RES., vol. 55, 2000, pages 185 - 229
CHARPILIENNE, A., J. VIROL., vol. 76, 2002, pages 7822 - 7831
PESAVENTO, J.B., PROC. NATL. ACAD. SCI. U.S.A., vol. 98, 2001, pages 1381 - 1386
HOLEN, T. ET AL., NUCLEIC ACIDS RES., vol. 30, no. 8, pages 1757 - 1766
SEMIZAROV, D. ET AL., PROC. NATL. ACAD. SCI., vol. 100, no. 11, pages 6347 - 6352
KARLIN; ALTSCHUL, PROC. NATL. ACAD. SCI. USA, vol. 87, 1990, pages 2264 - 68
KARLIN; ALTSCHUL, PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 5873 - 77
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 10
ALTSCHUL ET AL., NUCLEIC ACIDS RES., vol. 25, no. 17, 1997, pages 3389 - 3402
MYERS; MILLER, CABIOS, 1989
SAMBROOK, J. ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
"Current Protocols in Molecular Biology", 1995, JOHN WILEY & SONS, INC.
ALBERTS ET AL.: "The Cell", 2002, pages: 452
VERMA; ECKSTEIN, ANNUL REV. BIOCHEM., vol. 67, 1998, pages 99 - 134
MILLIGAN; UHLENBECK, METHODS ENZYMOL., vol. 180, 1989, pages 51 - 62
EIS, P. S. ET AL., NAT. BIOTECHNOL., vol. 19, 2001, pages 673
MACKEN, C. ET AL.: "Options for the Control of Influenza IV. A.D.M.E. Osterhaus", 2001, ELSEVIER SCIENCE, article "The value of a database in surveillance and vaccine selection.", pages: 103 - 106
KNIPE DM; HOWLEY, PM, FUNDAMENTAL VIROLOGY, pages 34 - 35
"Fundamental virology", pages: 32
RUBINSON, D. ET AL., NATURE GENETICS, vol. 33, 2003, pages 401 - 406
MCCAFFREY, AP ET AL., NATURE, vol. 418, pages 38 - 39
DEFFRASNES C ET AL., J CLIN. MICROBIOL., vol. 43, 2005, pages 488 - 90
Attorney, Agent or Firm:
MCSWIGGEN, James, Anthony (#1022Bothell, WA, US)
Download PDF:
Claims:
What is claimed is:

1. An RNAi-inducing entity targeted to a transcript of a respiratory virus, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length and comprises: a. a first nucleic acid sequence that is at least about 84% identical to a portion of a nucleic acid encoding a viral protein; and b. a second nucleic acid sequence that is at least 84% complementary to the first nucleic acid portion.

2. The RNAi-inducing entity of claim 1 , wherein said RNAi-inducing entity is between about 15 and about 40 nucleotides in length and comprises: a. a first nucleic acid sequence that is at least about 89% identical to a portion of a nucleic acid encoding a viral protein; and b. a second nucleic acid sequence that is at least 89% complementary to the first nucleic acid portion.

3. The RNAi-inducing entity of claim 1, wherein said RNAi-inducing entity is between about 15 and about 40 nucleotides in length and comprises: a. a first nucleic acid sequence that is at least about 94% identical to a portion of a nucleic acid encoding a viral protein; and b. a second nucleic acid sequence that is at least 94% complementary to the first nucleic acid portion.

4. The RNAi-inducing entity of claim 1, wherein said RNAi-inducing entity is an siRNA or an shRNA.

5. The RNAi-inducing entity of claim 1 , wherein said nucleic acid comprises a 3 ' overhang.

6. The RNAi-inducing entity of claim 5, wherein said 3 ' overhang comprises deoxythymidine .

7. The RNAi-inducing entity of claim 1 , wherein said viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1 , human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus.

8. The RNAi-inducing entity of claim 1, wherein said viral protein is a respiratory virus protein.

9. The RNAi-inducing entity of claim 1 , wherein said viral protein is an influenza virus protein.

10. The RNAi-inducing entity of claim 1, wherein said RNAi-inducing entity is an shRNA and further comprises a third nucleic acid sequence that forms a hairpin loop structure.

11. The RNAi-inducing entity of claim 10, wherein said hairpin loop structure comprises between 4 and 11 nucleotides.

12. An isolated nucleic acid sequence between about 16 and about 35 nucleotides in length that is at least about 85% identical along its length to a portion of a viral nucleic acid encoding a viral protein, or its complement.

13. The nucleic acid of claim 12, wherein said viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus.

14. The nucleic acid of claim 12, wherein said viral protein is a respiratory virus protein.

15. The nucleic acid of claim 12, wherein said viral protein is an influenza virus protein.

16. An RNAi-inducing entity targeted to an influenza virus protein transcript, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length and comprises: a. a first nucleic acid sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NOS. 1-10709, its complement, a fragment of either having a length of at least 16 nucleotides or a nucleotide sequence at least 80% homologous to said nucleic acid sequence; and b. a second nucleic acid sequence that is at least 80% complementary to the first nucleic acid sequence.

17. The RNAi-inducing entity of claim 16, wherein said second nucleic acid sequence is at least about 90% complementary to said first nucleic acid portion.

18. The RNAi-inducing entity of claim 16, wherein said RNAi-inducing entity is an siRNA or an shRNA.

19. The RNAi-inducing entity of claim 16, wherein said RNAi-inducing entity is an shRNA and further comprises a third nucleic acid sequence that forms a hairpin loop structure.

20. The RNAi-inducing entity of claim 19, wherein said hairpin loop structure comprises between 4 and 11 nucleotides.

21. An siRNA targeted to a conserved site of a viral protein transcript, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length.

22. The siRNA of claim 21, wherein said conserved site is about 300 nucleotides in length and comprises the 3' end of said viral protein gene.

23. The siRNA of claim 21 , wherein said viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus.

24. The siRNA of claim 21 , wherein said viral protein is a respiratory virus protein.

25. The siRNA of claim 21 , wherein said viral protein is an influenza virus protein.

26. An siRNA that reduces the expression of a first gene encoding a first viral protein and a second gene encoding a second viral protein.

27. The siRNA of claim 26, wherein said first and second viral genes are from different strains of the same virus.

28. The siRNA of claim 26, wherein said first and second viral genes are from two different viruses.

29. The siRNA of claim 28, wherein one virus is an influenza virus.

30. An siRNA that reduces the expression of two or more genes encoding viral proteins by at least about 25%.

31. An RNAi-inducing agent targeted to a transcript whose sequence comprises a target portion of a nucleic acid encoding a viral protein.

32. The RNAi-inducing agent of claim 31 , wherein the RNAi-inducing agent is an siRNA or shRNA.

33. The RNAi-inducing agent of claim 31 , wherein said viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus.

34. The RNAi-inducing agent of claim 31 , wherein said viral protein is a respiratory virus protein.

35. The RNAi-inducing agent of claim 31, wherein said viral protein is an influenza protein.

36. A composition comprising a first siRNA and a second siRNA, each of which reduces the expression of a gene encoding a viral protein.

37. The composition of claim 36, wherein said first siRNA reduces the expression of a first gene encoding a first viral protein and said second siRNA reduces the expression of a second gene encoding a second viral protein.

38. The composition of claim 37, wherein said first and second siRNAs reduce the expression of at said first and second genes by at least about 25%.

39. The composition of claim 37, wherein said first and second genes are from two strains of the same viral species.

40. The composition of claim 37, wherein said first and second genes are from two species of the same viral genus.

41. The composition of claim 37, wherein said first and second genes are from two viruses of the same viral family.

42. The composition of claim 36, further comprising a third siRNA that reduces the expression of a third gene encoding a third viral protein by at least about 25%.

43. The composition of claim 42, further comprising a cationic polymer.

44. A diagnostic kit comprising a primer or probe that detects a viral protein gene over at least part of said gene, wherein said part is a conserved site.

45. A method of reducing expression of a target viral gene in a virally-infected mammalian cell, comprising the step of contacting said cell with an siRNA that reduces expression of a viral protein gene, such that said siRNA enters the cytoplasm of said mammalian cell and reduces the expression of said viral protein gene by at least about 25%.

46. The method of claim 45, wherein said viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus.

47. The method of claim 45, wherein said viral protein gene is a respiratory virus protein gene.

48. The method of claim 45, wherein said viral protein gene is an influenza virus protein gene.

49. A method of delivering an siRNA to a subject in need thereof, comprising administering to said subject a composition comprising an effective amount of said siRNA, wherein said siRNA reduces the expression of a viral protein gene by at least about 25%.

50. The method of claim 49, wherein said siRNA is delivered to said subject at a concentration of between about 0.1 mg/kg of subject's body weight and about 20 mg/kg of subject's body weight.

51. The method of claim 49, wherein said siRNA is delivered to said subject at a concentration of between about 0.1 mg/kg of subject's body weight and about 10 mg/kg of subject's body weight.

52. The method of claim 49, wherein said siRNA is delivered to said subject at a concentration of between about 0.1 mg/kg of subject's body weight and about 50 mg/kg of subject's body weight.

53. The method of claim 49, wherein said siRNA reduces the expression of a viral protein gene by at least about 25%.

54. The method of claim 49, wherein said siRNA comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOS 1-10709.

55. The method of claim 49, wherein said composition further comprises a cationic polymer.

56. The method of claim 49, wherein said composition is administered by inhalation, intranasally, orally, or intravenously.

57. A method of preventing or treating a viral infection in a subject comprising administering a therapeutic compound comprising an siRNA or shRNA targeted to a protein gene of the virus to the subject.

58. The method of claim 57, wherein said siRNA or said shRNA is between about 15 and about 60 nucleotides in length and comprises: a. a first nucleic acid sequence that is at least about 85% identical to a portion of a nucleic acid encoding a viral protein; and b. a second nucleic acid sequence that is at least 85% complementary to the first nucleic acid portion.

59. The method of claim 57, wherein the siRNA is double stranded siRNA molecule, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and

wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus.

60. The method of claim 57, wherein the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus.

61. The method of claim 57, wherein the nucleic acid sequence encodes a nucleoprotein gene.

62. The method of claim 57, wherein the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710- 10751.

63. The method of claim 57, wherein said therapeutic compound comprises a cationic polymer.

64. A use of an siRNA molecule in the manufacture of a medicament for inhibiting production of a virus in the respiratory system of a mammalian subject, comprising administering a therapeutic compound comprising an siRNA or shRNA targeted to a protein gene of the virus to the subject.

65. The use of an siRNA molecule of claim 64, wherein said siRNA or said shRNA is between about 15 and about 60 nucleotides in length and comprises: a. a first nucleic acid sequence that is at least about 85% identical to a portion of a nucleic acid encoding a viral protein; and b. a second nucleic acid sequence that is at least 85% complementary to the first nucleic acid portion.

66. The use of an siRNA molecule of claim 64, wherein the siRNA is double stranded siRNA molecule, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus.

67. The use of an siRNA molecule of claim 64, wherein the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human

parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus.

68. The use of an siRNA molecule of claim 64, wherein the nucleic acid sequence encodes a protein (NP) gene.

69. The use of an siRNA molecule of claim 64, wherein the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710-10751.

70. The use of an siRNA molecule of claim 64, wherein the respiratory virus is influenza virus, and wherein the siRNA is selected from the group consisting of SEQ ID NO. 1- 10709.

71. A method for identifying an siRNA or shRNA target sequence that is present in a viral protein transcript of two or more viruses, comprising the steps of: a. providing a nucleic acid sequence encoding a viral protein from a virus; b. identifying a target portion of said nucleic acid sequence, wherein said portion comprises about 19 nucleotides and does not comprise more than three contiguous guanine nucleotides or more than three contiguous cytosine nucleotides; and c. repeating steps (a) and (b) one or more times, using different viruses with each repetition, thereby identifying an siRNA or shRNA target sequence on said viral protein transcript.

72. The method of claim 71, wherein said nucleic acid sequence comprises a conserved site of a protein sequence.

73. The method of claim 71 , further comprising generating an siRNA or shRNA that binds to said target sequence.

74. A method of diagnosing a viral infection, comprising the step of determining whether the subject is infected with a virus that is susceptible to inhibition by the RNAi- inducing entity of claim 1.

75. The method of claim 71 , further comprising the step of administering the RNAi- inducing entity to the subject.

76. A method of treating or preventing an influenza virus infection comprising administering the RNAi-inducing entity of claim 1 to a subject in need thereof.

77. A double stranded siRNA molecule that inhibits production of a respiratory virus, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus.

78. The double stranded siRNA molecule of claim 77, wherein the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus.

79. The double stranded siRNA molecule of claim 78, wherein the nucleic acid sequence encodes a protein (NP) gene.

80. The double stranded siRNA molecule of claim 78, wherein the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710-10751.

81. The double stranded siRNA molecule of claim 78, wherein the respiratory vims is influenza virus, and wherein the siRNA is selected from the group consisting of SEQ ID NO. 1-10709.

Description:

RNAI THERAPEUTIC FOR RESPIRATORY VIRUS INFECTION

FIELD OF THE INVENTION

This invention relates generally to the field of treatment and prophylaxis of viral infections. The invention further relates to the field of using RNA interference (RNAi), particularly anti-viral siRNAs and shRNAs.

BACKGROUND OF THE INVENTION

RNA interference (RNAi) is a ubiquitous mechanism of gene regulation in plants and animals in which target niRNAs are degraded in a sequence-specific manner (Sharp, P. A., Genes Dev. 15, 485-490 (2001); Hutvagner, G. & Zamore, P. D., Curr. Opin. Genet. Dev. 12, 225-232 (2002); Fire, A., et al., Nature 391, 806-811 (1998); Zamore, P., et al., Cell 101, 25-33 (2000)). The target mRNA can be a host mRNA or an mRNA of a pathogen of the host, such as a viral pathogen.

Pathogenic viral infections are some of the most widely spread infections worldwide. A family of such viruses is the influenza family. An estimated 20 to 40 million people died during the 1918 influenza A virus pandemic. In the United States between 20 to 40 thousand people die from influenza A virus infection or its complications each year. During epidemics the number of influenza related hospitalizations may reach over 300,000 in a single winter season. There is no superior therapy for influenza virus infection, and existing vaccines are limited in value in part because of the properties of antigenic shift and drift. Treatment or prevention of a number of other viruses that are pathogenic to humans and other animals face similar difficulties.

The natural RNA degradation process is initiated by the dsRNA-specific endonuclease Dicer, which promotes processive cleavage of long dsRNA precursors into double-stranded fragments between 21 and 25 nucleotides long, termed small interfering RNA (siRNA) (Zamore, P., et al., Cell 101, 25-33 (2000); Elbashir, S. M., et al.. Genes Dev. 15, 188-200 (2001); Hammond, S. M., et al., Nature 404, 293-296 (2000); Bernstein, E., et al., Nature 409, 363-366 (2001)). siRNAs are incorporated into a large

protein complex that recognizes and cleaves target mRNAs (Nykanen, A., et al., Cell 107, 309-321 (2001). It has been reported by one group that introduction of dsRNA into mammalian cells does not result in efficient Dicer-mediated generation of siRNA and therefore does not induce RNAi (Caplen, N. J., et al., Gene 252, 95-105 (2000); Ui-Tei, K., et al., FEBS Lett. 479, 79-82 (2000)). The requirement for Dicer in maturation of siRNAs in cells can be bypassed by introducing synthetic polynucleotide siRNA duplexes (such as 21 nucleotide sequences), which inhibit expression of transfected and endogenous genes in a variety of mammalian cells (Elbashir, et al., Nature 411 : 494-498 (2001)).

SUMMARY OF THE INVENTION

These and other objects, advantages, and features of the invention will become apparent to those persons skilled in the art upon reading the details of the invention as more fully described below. Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention. One aspect of the invention is an RNAi-inducing entity targeted to a transcript of a respiratory virus, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length and comprises: a first nucleic acid sequence that is at least about 84% identical to a portion of a nucleic acid encoding a viral protein; and a second nucleic acid sequence that is at least 84% complementary to the first nucleic acid portion.

In an embodiment of the invention, the RNAi-inducing entity is between about 15 and about 40 nucleotides in length and comprises: a first nucleic acid sequence that is at least about 89% identical to a portion of a nucleic acid encoding a viral protein; and a second nucleic acid sequence that is at least 89% complementary to the first nucleic acid portion.

In another embodiment of the invention, the RNAi-inducing entity is between about 15 and about 40 nucleotides in length and comprises: a first nucleic acid sequence that is at least about 94% identical to a portion of a nucleic acid encoding a viral protein; and a second nucleic acid sequence that is at least 94% complementary to the first nucleic acid portion. In another embodiment, the RNAi-inducing entity is an siRNA or an shRNA. In another embodiment of the invention, the nucleic acid comprises a 3' overhang. In a related embodiment, the 3' overhang comprises deoxythymidine. In another embodiment of the invention, the viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumo virus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the viral protein is a respiratory virus protein. In another embodiment of the invention, the viral protein is an influenza virus protein. In another embodiment of the invention, the RNAi-inducing entity is an shRNA and further comprises a third nucleic acid sequence that forms a hairpin loop structure. In a related embodiment, the hairpin loop structure comprises between 4 and 11 nucleotides.

Another aspect of the invention is an isolated nucleic acid sequence between about 16 and about 35 nucleotides in length that is at least about 85% identical along its length to a portion of a viral nucleic acid encoding a viral protein, or its complement. In an embodiment of the invention, the viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the viral protein is a respiratory virus protein. In another embodiment of the invention, the viral protein is an influenza virus protein.

Another aspect of the invention is an RNAi-inducing entity targeted to an influenza virus protein transcript, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length and comprises: a first nucleic acid sequence comprising a nucleic acid sequence selected from the group consisting of SEQ ID NOS. 1 - 10709, its complement, a fragment of either having a length of at least 16 nucleotides or a nucleotide sequence at least 80% homologous to said nucleic acid sequence; and a second nucleic acid sequence that is at least 80% complementary to the first nucleic acid sequence. In an embodiment of the invention, the second nucleic acid sequence is at least about 90% complementary to said first nucleic acid portion. In another embodiment of

the invention, the RNAi-inducing entity is an siRNA or an shRNA. In another embodiment of the invention, the RNAi-inducing entity is an shRNA and further comprises a third nucleic acid sequence that forms a hairpin loop structure. In a related embodiment, the hairpin loop structure comprises between 4 and 11 nucleotides. Another aspect of the invention is an siRNA targeted to a conserved site of a viral protein transcript, wherein said RNAi-inducing entity is between about 15 and about 60 nucleotides in length. In an embodiment of the invention, the conserved site is about 300 nucleotides in length and comprises the 3' end of said viral protein gene. In another embodiment of the invention, the viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumo virus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the viral protein is a respiratory virus protein. In another embodiment of the invention, the viral protein is an influenza virus protein. Another aspect of the invention is an siRNA that reduces the expression of a first gene encoding a first viral protein and a second gene encoding a second viral protein. In an embodiment of the invention, the first and second viral genes are from different strains of the same virus. In another embodiment of the invention, the first and second viral genes are from two different viruses. In a related embodiment, one virus is an influenza virus.

Another aspect of the invention is an siRNA that reduces the expression of two or more genes encoding viral proteins by at least about 25%.

Another aspect of the invention is an RNAi-inducing agent targeted to a transcript whose sequence comprises a target portion of a nucleic acid encoding a viral protein. In an embodiment of the invention, the RNAi-inducing agent is an siRNA or shRNA. In another embodiment of the invention, the viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the viral protein is a respiratory virus protein. In another embodiment of the invention, the viral protein is an influenza protein.

Another aspect of the invention is a composition comprising a first siRNA and a second siRNA, each of which reduces the expression of a gene encoding a viral protein. In an embodiment of the invention, the first siRNA reduces the expression of a first gene

encoding a first viral protein and said second siRNA reduces the expression of a second gene encoding a second viral protein. In a related embodiment of the invention, the first and second siRNAs reduce the expression of at said first and second genes by at least about 25%. In another related embodiment of the invention, first and second genes are from two strains of the same viral species. In another related embodiment of the invention, the first and second genes are from two species of the same viral genus. In another related embodiment of the invention, the first and second genes are from two viruses of the same viral family. In another embodiment of the invention, the composition further comprising a third siRNA that reduces the expression of a third gene encoding a third viral protein by at least about 25%. In a related embodiment, the composition further comprising a cationic polymer.

Another aspect of the invention is a diagnostic kit comprising a primer or probe that detects a viral protein gene over at least part of said gene, wherein said part is a conserved site. Another aspect of the invention is a method of reducing expression of a target viral gene in a virally-infected mammalian cell, comprising the step of contacting said cell with an siRNA that reduces expression of a viral protein gene, such that said siRNA enters the cytoplasm of said mammalian cell and reduces the expression of said viral protein gene by at least about 25%. In an embodiment of the invention, the viral protein is obtained from a virus selected from the group consisting of human respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the viral protein gene is a respiratory virus protein gene. In another embodiment of the invention, the viral protein gene is an influenza virus protein gene.

Another aspect of the invention is a method of delivering an siRNA to a subject in need thereof, comprising administering to said subject a composition comprising an effective amount of said siRNA, wherein said siRNA reduces the expression of a viral protein gene by at least about 25%. In an embodiment of the invention, siRNA is delivered to said subject at a concentration of between about 0.1 mg/kg of subject's body weight and about 20 mg/kg of subject's body weight. In another embodiment of the invention, the siRNA is delivered to said subject at a concentration of between about 0.1 mg/kg of subject's body weight and about 10 mg/kg of subject's body weight. In another embodiment of the invention, the siRNA is delivered to said subject at a concentration of

between about 0.1 mg/kg of subject's body weight and about 50 mg/kg of subject's body weight. In another embodiment of the invention, the siRNA reduces the expression of a viral protein gene by at least about 25%. In another embodiment of the invention, the siRNA comprises a nucleic acid sequence listed in Tables 1-9. In another embodiment of the invention, the composition further comprises a cationic polymer. In another embodiment of the invention, the composition is administered by inhalation, intranasally, orally, or intravenously.

Another aspect of the invention is a method of preventing or treating a viral infection in a subject comprising administering a therapeutic compound comprising an siRNA or shRNA targeted to a protein gene of the virus to the subject. In an embodiment of the invention, the siRNA or said shRNA is between about 15 and about 60 nucleotides in length and comprises: a first nucleic acid sequence that is at least about 85% identical to a portion of a nucleic acid encoding a viral protein; and a second nucleic acid sequence that is at least 85% complementary to the first nucleic acid portion. In another embodiment of the invention, the siRNA is double stranded siRNA molecule, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus. In another embodiment of the invention, the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the nucleic acid sequence encodes a nucleoprotein gene. In another embodiment of the invention, the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710- 10751. In another embodiment of the invention, the therapeutic compound comprises a cationic polymer.

Another aspect of the invention is a use of an siRNA molecule in the manufacture of a medicament for inhibiting production of a virus in the respiratory system of a mammalian subject, comprising administering a therapeutic compound comprising an siRNA or shRNA targeted to a protein gene of the virus to the subject. In an embodiment of the invention, the siRNA or said shRNA is between about 15 and about 60 nucleotides in length and comprises: a first nucleic acid sequence that is at least about 85% identical to a portion of a nucleic acid encoding a viral protein; and a second nucleic acid sequence

that is at least 85% complementary to the first nucleic acid portion. In another embodiment of the invention, the siRNA is double stranded siRNA molecule, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus. In another embodiment of the invention, the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumo virus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhinovirus and influenza virus. In another embodiment of the invention, the nucleic acid sequence encodes a protein (NP) gene. In another embodiment of the invention, the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710-10751. In another embodiment of the invention, the respiratory virus is influenza virus, and wherein the siRNA is selected from the group consisting of SEQ ID NO. 1- 10709. Another aspect of the invention is a method for identifying an siRNA or shRNA target sequence that is present in a viral protein transcript of two or more viruses, comprising the steps of: a) providing a nucleic acid sequence encoding a viral protein from a virus; b) identifying a target portion of said nucleic acid sequence, wherein said portion comprises about 19 nucleotides and does not comprise more than three contiguous guanine nucleotides or more than three contiguous cytosine nucleotides; and c) repeating steps (a) and (b) one or more times, using different viruses with each repetition, thereby identifying an siRNA or shRNA target sequence on said viral protein transcript. In an embodiment of the invention, the nucleic acid sequence comprises a conserved site of a protein sequence. In another embodiment of the invention, the method further comprising generating an siRNA or shRNA that binds to said target sequence.

Another aspect of the invention is a method of diagnosing a viral infection, comprising the step of determining whether the subject is infected with a virus that is susceptible to inhibition by the RNAi-inducing entity. In an embodiment of the invention, the method further comprising the step of administering the RNAi-inducing entity to the subject.

Another aspect of the invention is a method of treating or preventing an influenza virus infection comprising administering the RNAi-inducing entity to a subject in need thereof.

Another aspect of the invention is a double stranded siRNA molecule that inhibits production of a respiratory virus, wherein each strand of said siRNA molecule is about 15 to about 50 nucleotides, and wherein one strand of said siRNA molecule comprises a nucleic acid sequence identical to a conserved site, or a variant thereof, within the nucleic acid sequence of the respiratory virus. In an embodiment of the invention, the respiratory virus is selected from the group consisting of respiratory syncytial virus, human metapneumovirus, human parainfluenza virus 1, human parainfluenza virus 2, human parainfluenza virus 3, human parainfluenza virus 4a, human parainfluenza virus 4b, rhino virus and influenza virus. In a related embodiment, the nucleic acid sequence encodes a protein (NP) gene. In another embodiment of the invention, the respiratory virus is influenza virus, and wherein the siRNA is not selected from the group consisting of SEQ ID NO. 10710- 10751. In another embodiment of the invention, the respiratory virus is influenza virus, and wherein the siRNA is selected from the group consisting of SEQ ID NO. 1-10709.

BRIEF DESCRIPTION OF THE FIGURES

Figure 1. A-K series of nucleic acid sequence alignments of target viruses of the invention are presented.

Figure 2. Influenza virus production is inhibited in mice by administration of influenza- specific siRNA. Mice were intravenously injected with increasing amounts of NP-1496 siRNA or GFP siRNA complexed with jetPEI. Three hours later, mice were infected with the PR8 variant of influenza A. Viral titers were measured 24 hours post-infection using lung homogenates in the MDCK-HA assay. Each data point represents one mouse. P values between groups indicate statistical significance.

Figure 3. Therapeutic administration of siRNA inhibits influenza virus production in mice. Mice were injected with NP-siRNA or PA-siRNA complexed with jetPEI 5 hours following influenza infection. Virus titers were measured in lung homogenates 28 hours post-infection by MDCK-HA assay. Each data point represents one mouse. P values between groups indicate statistical significance.

Figure 4. Influenza-specific siRNA treatment provides broad cross-protection against lethal challenge with highly pathogenic H5 and H7 avian influenza A viruses. BALB/c mice (8 per group) were given 50 μg siKNA intravenously one day before virus challenge and another 20 μg of siRNA intranasally on the day of virus challenge. Body weights and survival of mice were monitored for 16 days after 10 LD50 dose of intranasal virus challenge. Filled circles, GFP-specific siRNA; open circles, NP plus PA-specific siRNAs. P values are indicated.

Figure 5. BALB/c mice were treated intranasally with indicated amounts of NP specific siRNA in PBS or PBS control. Two hours later, all mice were infected intranasally (lOOOpfu/mouse) with the PR8 serotype. The lungs were harvested 24 hours postinfection, and viral titer was measured from lung homogenates by MDCK-HA assay. P values between PBS and siRNA groups indicate statistical significance with 0.5, 1 and 2 mg/kg siRNA treated groups.

Figure 6. BALB/c mice were administered control and NP-targeting siRNA intranasally (10mg/kg, in PBS). Three hours later all the mice were infected i.n with PR8 virus (50 pfu/mouse). The lungs were harvested at 24 and 48 hours post-infection and total RNA was isolated from the left lung. Total mRNA was reverse transcribed to cDNA using dTl 8 primers. Real time PCR was carried out using PB 1 specific primers to quantify viral mRNA levels. GAPDH was used as an internal control. The right and middle lungs were homogenized and the viral titer was measured by MDCK-HA assay. The virus titer in the samples at 48 hour post-infection is shown in the figure (statistic significance was found between PBS and NP siRNA treated group using student t test (p=0.01); the titer in the samples 24 hours post-infection was too low to detect, possibly due to siRNA directed suppression.

Figure 7. Balb/c mice were treated intranasally with 10 mg/kg cyclophilin B specific siRNA or GFP siRNA in PBS or PBS control. There were five mice per group. The mouse lungs were harvested 24 later. Total RNA was purified from the lung samples and reverse transcription was conducted using dT18 primer. Cyclophilin B-specific primers were used in real-time PCR to quantify the target mRNA level. GAPDH-specific primers were also used in the PCR reaction as control.

Figure 8. Influenza virus suppression in vivo by intranasal administration of cochleate siRNA formulations is shown.

Figure 9. Influenza virus suppression in vivo by intravenous administration of cochleate siRNA formulations is shown.

Figure 1OA. Dose-response profile of intranveouslly adminstered siRNA delivered in cochleat formulations for influenza virus suppression.

Figure 1OB. Influenza virus suppression in vivo by oral gavage administration of cochleate siRNA formulations is shown.

Figures 1 IA-C. The results of experiments indicating that siRNA inhibits influenza virus production in MDCK cells are shown. Six different siRNAs that target various viral transcripts were introduced into MDCK cells by electroporation, and cells were infected with virus 8 hours later. FIG. 1 IA is a time course showing viral titer in culture supernatants as measured by hemagglutinin assay at various times following infection with viral strain A/PR/8/34 (HlNl) (PR8), at a multiplicity of infection (MOI) of 0.01 in the presence or absence of the various siRNAs or a control siRNA. FIG. 1 IB is a time course showing viral titer in culture supernatants as measured by hemagglutinin assay at various times following infection with influenza virus strain A/WSN/33 (HlNl) (WSN) at an MOI of 0.01 in the presence or absence of the various siRNAs or a control siRNA. FIG. 11C shows a plaque assay showing viral titer in culture supernatants from virus infected cells that were either mock transfected or transfected with siRNA NP-1496. FIG. 1 ID shows inhibition of influenza virus production at different doses of siRNA. MDCK cells were transfected with the indicated amount of NP-1496 siRNA followed by infection with PR8 virus at an MOI of 0.01. Virus titer was measured 48 hours after infection. Representative data from one of two experiments are shown.

Figure 12. Positions of various siRNAs relative to influenza virus gene segments, correlated with effectiveness in inhibiting influenza virus are shown.

Figure 13 A. A schematic of a developing chicken embryo indicating the area for injection of siRNA and siRN A/delivery agent compositions is shown.

Figure 13B. The ability of various siRNAs to inhibit influenza virus production in developing chicken embryos is shown.

Figure 14. A schematic showing the interaction of nucleoprotein with viral RNA molecules is shown.

Figure 15. Schematic diagrams illustrating the differences between influenza virus vRNA, mRNA, and cRNA (template RNA) and the relationships between them are shown.

Figure 16. Amounts of viral NP and NS RNA species at various times following infection with virus, in cells that were mock transfected or transfected with siRNA NP- 1496 6-8 hours prior to infection are shown.

Figures 17A and 17B. FIG. 17A shows that inhibition of influenza virus production requires a wild type (wt) antisense strand in the duplex siRNA. MDCK cells were first transfected with siRNAs formed from wt and modified (m) strands and infected 8 hrs later with PR8 virus at MOI of 0.1. Virus titers in the culture supernatants were assayed 24 hrs after infection. Representative data from one of the two experiments are shown. FIG. 17B shows that M-specific siRNA inhibits the accumulation of specific mRNA. MDCK cells were transfected with M-37, infected with PR8 virus at MOI of 0.01, and harvested for RNA isolation 1, 2, and 3 hrs after infection. The levels of M-specific mRNA, cRNA, and vRNA were measured by reverse transcription using RNA-specific primers, followed by real time PCR. The level of each viral RNA species is normalized to the level of .gamma.-actin mRNA (bottom panel) in the same sample. The relative levels of RNAs are shown as mean value .+-.S.D. Representative data from one of the two experiments are shown.

Figure 18A-D. Show that NP-specific siRNA inhibits the accumulation of not only NP- but also M- and NS-specific mRNA, vRNA, and cRNA. MDCK (A-C) and Vero (D) cells were transfected with NP- 1496, infected with PR8 virus at MOI of 0.1, and harvested for RNA isolation 1, 2, and 3 hrs after infection. The levels of mRNA, cRNA, and vRNA specific for NP, M, and NS were measured by reverse transcription using RNA-specific

primers followed by real time PCR. The level of each viral RNA species is normalized to the level of .gamma. -actin mRNA (not shown) in the same sample. The relative levels of RNAs are shown. Representative data from one of three experiments are shown.

Figures 18E-G. The right side in each figure, show that PA-specific siRNA inhibits the accumulation of not only PA- but also M- and NS-specific mRNA, vRNA, and cRNA. MDCK cells were transfected with PA-1496, infected with PR8 virus at MOI of 0.1, and harvested for RNA isolation 1, 2, and 3 hrs after infection. The levels of mRNA, cRNA, and vRNA specific for PA, M, and NS were measured by reverse transcription using RNA-specific primers followed by real time PCR. The level of each viral RNA species is normalized to the level of .gamma.-actin mRNA (not shown) in the same sample. The relative levels of RNAs are shown.

Figure 18H. FIG. 18H shows that NP-specifϊc siRNA inhibits the accumulation of PBl- (top panel), PB2-(middle panel) and PA-(lower panel) specific mRNA. MDCK cells were transfected with NP-1496, infected with PR8 virus at MOI of 0.1 , and harvested for RNA isolation 1, 2, and 3 hrs after infection. The levels of mRNA specific for PBl, PB2, and PA mRNA were measured by reverse transcription using RNA-specific primers followed by real time PCR. The level of each viral RNA species is normalized to the level of .gamma.-actin mRNA (not shown) in the same sample. The relative levels of RNAs are shown.

Figures 19A-C. FIG. 19A is a plot showing that siRNA inhibits influenza virus production in mice when administered together with the cationic polymer PEI prior to infection with influenza virus. Filled squares (no treatment); Open squares (GFP siRNA); Open circles (30 .mu.g NP siRNA); Filled circles (60 .mu.g NP siRNA). Each symbol represents an individual animal, p values between different groups are shown. FIG. 19B is a plot showing that siRNA inhibits influenza virus production in mice when administered together with the cationic polymer PLL prior to infection with influenza virus. Filled squares (no treatment); Open squares (GFP siRNA); Filled circles (60 .mu.g NP siRNA). Each symbol represents an individual animal, p values between different groups are shown. FIG. 19C is a plot showing that siRNA inhibits influenza virus production in mice when administered together with the cationic polymer jetPEI prior to infection with influenza virus significantly more effectively than when administered in PBS. Open

squares (no treatment); Open triangles (GFP siRNA in PBS); Filled triangles (NP siRNA in PBS); Open circles (GFP siRNA with jetPEI); Filled circles (NP siRNA with jetPEI). Each symbol represents an individual animal, p values between different groups are shown.

Figure 20. A plot showing that siRNAs targeted to influenza virus NP and PA transcripts exhibit an additive effect when administered together prior to infection with influenza virus. Filled squares (no treatment); Open circles (60 .mu.g NP siRNA); Open triangles (60 .mu.g PA siRNA); Filled circles (60 .mu.g NP siRNA+60 .mu.g PA siRNA). Each symbol represents an individual animal, p values between different groups are shown.

Figure 21. A plot showing that siRNA inhibits influenza virus production in mice when administered following infection with influenza virus. Filled squares (no treatment); Open squares (60 .mu.g GFP siRNA); Open triangles (60 .mu.g PA siRNA); Open circles (60 .mu.g NP siRNA); Filled circles (60 .mu.g NP+60 .mu.g PA siRNA). Each symbol represents an individual animal, p values between different groups are shown.

Figures 22A-C. FIG. 22A is a schematic diagram of a lentiviral vector expressing a shRNA. Transcription of shRNA is driven by the U6 promoter. EGFP expression is driven by the CMV promoter. SIN-LTR, .PSL, cPPT, and WRE are lentivirus components. The sequence of NP- 1496 shRNA is shown. FIG. 22B presents plots of flow cytometry results demonstrating that Vero cells infected with the lentivirus depicted in FIG. 22B express EGFP in a dose-dependent manner. Lentivirus was produced by co- transfecting DNA vector encoding NP- 1496a shRNA and packaging vectors into 293T cells. Culture supernatants (0.25 ml or 1.0 ml) were used to infect Vero cells. The resulting Vero cell lines (Vero-NP-0.25 and Vero-NP-1.0) and control (uninfected) Vero cells were analyzed for GFP expression by flow cytometry. Mean fluorescence intensity of Vero-NP-0.25 (upper portion of figure) and Vero-NP-1.0 (lower portion of figure) cells are shown. The shaded curve represents mean fluorescence intensity of control (uninfected) Vero cells. FIG. 22C is a plot showing inhibition of influenza virus production in Vero cells that express NP- 1496 shRNA. Parental and NP- 1496 shRNA expressing Vero cells were infected with PR8 virus at MOI of 0.04, 0.2 and 1. Virus titers in the supernatants were determined by hemagglutination (HA) assay 48 hrs after infection.

Figure 23. A plot showing that influenza virus production in mice is inhibited by administration of DNA vectors that express siRNA targeted to influenza virus transcripts. Sixty .mu.g of DNA encoding RSV, NP-1496 (NP) or PB1-2257 (PBl) shRNA were mixed with 40 .mu.l Infasurf and were administered into mice by instillation. For no treatment (NT) group, mice were instilled with 60 .mu.l of 5% glucose. Thirteen hrs later, the mice were infected intranasally with PR8 virus, 12000 pfu per mouse. The virus titers in the lungs were measured 24 hrs after infection by MDCK/hemagglutinin assay. Each data point represents one mouse, p values between groups are indicated.

DETAILED DESCRIPTION OF THE INVENTION The present invention is based on the intracellular phenomenon of RNA interference (RNAi). Therein, the presence in a cell of double-stranded RNA containing a portion that is complementary to a target RNA inhibits expression of the target RNA in a sequence-specific manner. Generally, inhibition is caused by cleavage of the target or inhibition of its translation. While RNAi is a normal cellular response to insults such as pathogen infection, it is also an effective mechanism to return to stasis the system perturbed by a such an infection. Further, RNAi can be used to specifically disrupt cellular signaling pathways. The double-stranded RNA structures that drive RNAi activity are siRNAs, shRNAs, and other double-stranded structures (dsRNAs) that can be processed to yield an siRNA or shRNA (or any other small RNA species that inhibits expression of a target transcript by RNA interference). RNAi-inducing entities such as siRNAs and shRNAs can be introduced into a subject, or an isolated cell thereof, and modulate specific signaling pathways. Further, these dsRNAs are useful therapeutics to prevent and treat diseases or disorders characterized by aberrant cell signaling. For instance, virus that infect mammals replicate by taking control of cellular machinery of the host cell. It is therefore useful to use RNAi technology to disrupt the viral signaling pathway that controls virus production. It is known that certain viral genes control critical in two stages of the viral life cycle. Prior work in the art has focused on the viral polymerases are effective targets for siRNA as they are required for viral replication. For example, RNA-dependent RNA polymerase (RdRP) is an essential polypeptide in both transcription and replication. This has led to the speculation that silencing of RdRP subunits would lead to a nearly total loss

of all RNA synthesis and thus result in a drastic inhibition of virus production. Indeed, this result has been observed in RSV, vesicular stomatitis and parainfluenza virus. (See, Barik S. Control of nonsegmented negative-strand RNA virus replication by siRNA. Virus Res. 2004 Jun l;102(l):27-35; and Bitko et al, .Phenotypic silencing of cytoplasmic genes using sequence-specific double-stranded short interfering RNA and its application in the reverse genetics of wild type negative-strand RNA viruses. BMC Microbiol. 2001;l(l):34. Epub 2001 Dec 20.) Another viral protein, variously termed nucleoprotein, capsid, or nucleocapsid, is recognized to be involved in, e.g., viral transcription and replication. However, antisense studies performed on nucleoprotein did not indicate that the transcript encoding this polypeptide is as suitable a target as polymerase. (See, Hatta et al. Inhibition of influenza virus RNA polymerase and nucleoprotein genes expression by unmodified, phosphorothioated, and liposomally encapsulated oligonucleotides.Biochem Biophys Res Commun. 1996 Jun 14;223(2):341~ 6; and Mizuta et al. Antisense oligonucleotides directed against the viral RNA polymerase gene enhance survival of mice infected with influenza A.Nat Biotechnol.

1999 Jun; 17(6): 583-7.) Recent studies with siRNA have suggested that nucleoprotein be investigated as a possible target of siRNA. Gitlin et al. Short interfering RNA confers intracellular antiviral immunity in human cells. Nature. 2002 JuI 25;418(6896):430-4; Fowler et al. Inhibition of Marburg virus protein expression and viral release by RNA interference. J Gen Virol. 2005 Aρr;86(Pt 4):1181-8; and Yuan et al. Inhibition of coxsackievirus B3 replication by small interfering RNAs requires perfect sequence match in the central region of the viral positive strand. J Virol. 2005 Feb;79(4):2151-9). The present invention demonstrates the use of siRNAs directed to viral nucleoprotein sequences to disrupt viral signaling pathways and inhibit viral replication. Further, the present inventors determined that inhibition or silencing of another viral protein, nucleoprotein or nucleocapsid protein, has similar effects to inhibiting polymerase activity. Thus, a nucleoprotein transcript is preferred target for siRNA.

While not wishing to be bound by any theory, the broad effect of NP siRNA is likely a result of the importance of NP in binding and stabilizing vRNA and cRNA, instead of NP-specific siRNA non-specifically targeting RNA degradation. For example, the NP gene segment in influenza virus encodes a single-stranded RNA-binding nucleoprotein, which can bind to both vRNA and cRNA. During the viral life cycle, NP mRNA is first transcribed and translated. The primary function of the NP protein is to encapsidate the virus genome for the purpose of RNA transcription, replication and

is

packaging. In the absence of NP protein, the full-length synthesis of both vRNA and cRNA is strongly impaired. When NP siRNA induces the degradation of NP RNA, NP protein synthesis is impaired and the resulting lack of sufficient NP protein subsequently affects the replication of other viral gene segments. In this way, NP siRNA is able to potently inhibit virus production at a very early stage. Thus, the multifunctional properties of viral nucleoproteins make them useful targets for RNAi-based therapy, offering the opportunity to intervene at multiple different stages of the viral life cycle by inhibiting a single gene.

It has been hypothesized that the number of NP protein molecules in infected host cells regulates mRNA synthesis, as opposed to replication of genome RNA (vRNA and cRNA). Using a temperature-sensitive mutation in the NP protein, previous studies have shown that cRNA, but not mRNA, synthesis was temperature sensitive both in vitro and in vivo. NP protein has been shown to be required for elongation and anti-termination of the nascent cRNA and vRNA transcripts. The results presented herein demonstrate that viral NP-specific siRNA inhibits the accumulation of all viral RNAs in infected cells. While not wishing to be bound by any theory, it appears probable that in the presence of NP-specific siRNA, the newly transcribed NP mRNA is degraded, resulting in the inhibition of NP protein synthesis following virus infection. Without newly synthesized NP, further viral transcription and replication, and therefore new virion production is inhibited.

The features and other details of the invention will now be more particularly described with reference to the accompanying drawings and pointed out in the claims. It will be understood that particular embodiments described herein are shown by way of illustration and not as limitations of the invention. The principal features of this invention can be employed in various embodiments without departing from the scope of the invention. All parts and percentages are by weight unless otherwise specified.

Definitions

For convenience, certain terms used in the specification, examples, and appended claims are collected here. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. However, to the extent that these definitions vary from meanings circulating within the art, the definitions below are to control.

A "nucleotide" comprises a nitrogenous base, a sugar molecule, and a phosphate group. A "nucleoside" comprises a nitrogenous base (nucleobase) linked to a sugar molecule. In a naturally occurring nucleic acid, phosphate groups covalently link adjacent nucleosides to form a polymer. A nucleic acid may include naturally occurring nucleosides (e.g., adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine), nucleoside analogs (e.g., 2- aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3 -methyl adenosine, C5- propynylcytidine, C5-propynyluridine, C5-bromouridine, C5-fluorouridine, C5- iodouridine, C5-methylcytidine, 7-deazaadenosine, 7-deazaguanosine, 8-oxoadenosine, 8- oxoguanosine, O(6)-methylguanine, and 2-thiocytidine), chemically modified bases, biologically modified bases (e.g., methylated bases), intercalated bases, modified sugars (e.g., 2'-fluororibose, ribose, 2'-deoxyribose, arabinose, and hexose).

The term "RNA" or "RNA molecule" or "ribonucleic acid molecule" refers to a polymer of ribonucleotides. The term "DNA" or "DNA molecule" or deoxyribonucleic acid molecule" refers to a polymer of deoxyribonucleotides. DNA and RNA can be synthesized naturally (e.g, by DNA replication or transcription of DNA or RNA, respectively). RNA can be post-transcriptionally modified. DNA and RNA can also be chemically synthesized. The terms "target mRNA" and "target transcript" are synonymous as used herein. The term "RNA interference" ("RNAi") refers to selective intracellular degradation of RNA (also referred to as gene silencing). RNAi also includes translational repression by microRNAs or siRNAs acting like microRNAs. RNAi can be initiated by introduction of small interfering RNAs (siRNAs) or production of siRNAs intracellularly (e.g., from a plasmid or transgene), to silence the expression of one or more target genes. Alternatively, RNAi occurs in cells naturally to remove foreign RNAs (e.g., viral RNAs). Natural RNAi proceeds via dicer-directed fragmentation of precursor dsRNA which direct the degradation mechanism to other cognate RNA sequences.

The term "small interfering RNA" ("siRNA"), also referred to in the art as "short interfering RNAs," refers to an RNA (or RNA analog) comprising between about 10-60 nucleotides (or nucleotide arialogs) that is capable of directing or mediating RNA interference. The term "siRNA" includes both double stranded siRNA and single stranded siRNA. Generally, as used herein the term "siRNA" refers to double stranded siRNA (as compared to single stranded or antisense RNA). The term "short hairpin RNA"

("shRNA") refers to an siRNA (or siRNA analog) precursor that is folded into a hairpin structure and contains a single stranded portion of at least one nucleotide (a "loop"), e.g., an RNA molecule that contains at least two complementary portions hybridized or capable of hybridizing to form a double-stranded (duplex) structure sufficiently long to mediate RNAi (as described for siRNA duplexes), and at least one single-stranded portion, typically between approximately 1 and 10 nucleotides in length that forms a loop connecting the regions of the shRNA that form the duplex portion. The duplex portion may, but typically does not, contain one or more mismatches and/or one or more bulges consisting of one or more unpaired nucleotides in either or both strands. Without wishing to be bound by theory, shRNAs are thought to be processed into siRNAs by the conserved cellular RNAi machinery. shRNAs are capable of inhibiting expression of a target transcript that is complementary to a portion of the shRNA (referred to as the antisense or guide strand of the shRNA). In general, the features of the duplex formed between the guide strand of the shRNA and a target transcript are similar to those of the duplex formed between the guide strand of an siRNA and a target transcript. In certain embodiments of the invention the 5' end of an shRNA has a phosphate group while in other embodiments it does not. In certain embodiments of the invention the 3' end of an shRNA has a hydroxyl group.

The term "RNAi-inducing entity" or "RNAi agent" refers to an RNA species (other than a naturally occurring molecule not modified by the hand of man or transported into its location by the hand of man) whose presence within a cell results in RNAi and leads to reduced expression of an RNA to which the RNAi agent is targeted. The RNAi agent may be, for example, an siRNA or shRNA. In certain embodiments of the invention an siRNA may contain a strand that inhibits expression of a target RNA via a translational repression pathway utilized by endogenous small RNAs referred to as microRNAs. In certain embodiments of the invention an shRNA may be processed intracellularly to generate an siRNA that inhibits expression of a target RNA via this microRNA translational repression pathway. Any "target RNA" may be referred to as a "target transcript" regardless of whether the target RNA is a messenger RNA. The terms "target RNA" and "target transcript" are used interchangeably herein. The term RNAi- inducing agent encompasses RNAi agents and vectors (other than naturally occurring molecules not modified by the hand of man as described above) whose presence within a

cell results in RNAi and leads to reduced expression of a transcript to which the RNAi agent is targeted.

An "RNAi-inducing vector" includes a vector whose presence within a cell results in transcription of one or more RNAs that self-hybridize or hybridize to each other to form an RNAi agent. In various embodiments of the invention this term encompasses plasmids, e.g., DNA vectors (whose sequence may comprise sequence elements derived from a virus), or viruses, (other than naturally occurring viruses or plasmids that have not been modified by the hand of man), whose presence within a cell results in production of one or more RNAs that self-hybridize or hybridize to each other to form an RNAi agent. In general, the vector comprises a nucleic acid operably linked to expression signal(s) so that one or more RNA molecules that hybridize or self-hybridize to form an RNAi agent is transcribed when the vector is present within a cell. Thus the vector provides a template for intracellular synthesis of the RNAi agent. For purposes of inducing RNAi, presence of a viral genome into a cell (e.g., following fusion of the viral envelope with the cell membrane) is considered sufficient to constitute presence of the virus within the cell. In addition, for purposes of inducing RNAi, a vector is considered to be present within a cell if it is introduced into the cell, enters the cell, or is inherited from a parental cell, regardless of whether it is subsequently modified or processed within the cell. An RNAi- inducing vector is considered to be targeted to a transcript if presence of the vector within a cell results in production of one or more RNAs that hybridize to each other or self- hybridize to form an RNAi agent that is targeted to the transcript, i.e., if presence of the vector within a cell results in production of one or more RNAi agent targeted to the transcript. Use of the term "induce" is not intended to indicate that the RNAi agent necessarily activates or upregulates RNAi in general but simply indicates that presence of the vector within a cell results in production of an RNAi agent within the cell, leading to an RNAi-mediated reduction in expression of an RNA to which the agent is targeted.

An RNAi-inducing entity is considered to be targeted to a target transcript for the purposes described herein if (1) the agent comprises a strand that is substantially complementary to the target transcript over a window of evaluation between 15-29 nucleotides in length, e.g., 15, more preferably at least about 17, yet more preferably at least about 18 or 19 to about 21-23 or 24-29 nucleotides in length. For example, in various embodiments of the invention the agent comprises a strand that has at least about 70%, preferably at least about 80%, 84%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,

97%, 98%, 99%, or 100% precise sequence complementarity with the target transcript over a window of evaluation between 15-29 nucleotides in length, e.g., over a window of evaluation of at least 15, more preferably at least about 17, yet more preferably at least about 18 or 19 to about 21-23 or 24-29 nucleotides in length; or (2) one strand of the RNAi agent hybridizes to the target transcript under stringent conditions for hybridization of small (<5.0 nucleotide) RNA molecules in vitro and/or under conditions typically found within the cytoplasm or nucleus of mammalian cells. In addition, in the case of agents that act via the microRNA translational repression pathway, the duplex formed by the agent and the target contains at least one bulge and/or mismatch. In certain embodiments of the invention a GU or UG base pair in a duplex formed by a guide strand and a target transcript is not considered a mismatch for purposes of determining whether an RNAi agent is targeted to a transcript.

An RNA-inducing vector whose presence within a cell results in production of an RNAi agent that is targeted to a transcript is also considered to be targeted to the transcript. Since the effect of targeting a transcript is to reduce or inhibit expression of the gene that directs synthesis of the transcript, an RNAi agent targeted to a transcript is also considered to target the gene that directs synthesis of the transcript even though the gene itself (e.g., genomic DNA in the case of a cell) is not thought to interact with the agent or components of the cellular silencing machinery. Thus an RNAi agent or vector that targets a transcript is understood to target the gene that provides a template for synthesis of the transcript.

A viral "nucleoprotein" (also termed a "capsid protein" or a "nucleocapsid protein") is a viral polypeptide that sequesters viral RNA and affects viral transcription. The viral nucleoprotein is capable of forming a nucleic acid/protein complex (i.e., a ribonucleoprotein (RNP) complex). Nucleoproteins are also termed 'TSfS" in double stranded viruses (e.g., NS-6). A nucleoprotein is distinguished from an outer capsid protein, which generally does not contact and sequester the viral genome. The terms "nucleoprotein mRNA," "NP mRNA", "nucleoprotein transcript," and "NP transcript" are understood to include any mRNA that encodes a viral nucleoprotein or its functional equivalent as described herein.

As will be appreciated by one of ordinary skill in the art, proteins fulfilling one or more functions of a viral nucleoprotein are referred to by a number of different names, depending on the particular virus of interest. For example, in the case of certain viruses

such as influenza the protein is known as nucleoprotein (NP) while in the case of a number of other single-stranded RNA viruses, proteins that fulfill a similar role are referred to as nucleocapsid (NC or N) proteins. In yet other viruses, analogous proteins that both interact with genomic nucleic acid and play a structural role in the viral particle are considered to be capsid (C) proteins.

As used herein, the terms "nucleoprotein mRNA," "NP mRNA", "nucleoprotein transcript," and "NP transcript" are understood to include any mRNA that encodes a viral nucleoprotein or its functional equivalent as described herein. Any virus containing a nucleoprotein gene or the functional equivalent thereof is suitable as an siRNA target. By way of non-limiting example, several groups of target viruses are described herein in greater detail.

"Subject" includes living organisms such as humans, monkeys, cows, sheep, ■ horses, pigs, cattle, goats, dogs, cats, mice, rats, cultured cells therefrom, and transgenic species thereof. In a preferred embodiment, the subject is a human. A subject is synonymous with a "patient." Administration of the compositions of the present invention to a subject to be treated can be carried out using known procedures, at dosages and for periods of time effective to treat the condition in the subject. An effective amount of the therapeutic compound necessary to achieve a therapeutic effect may vary according to factors such as the age, sex, and weight of the subject, and the ability of the therapeutic compound to treat the foreign agents in the subject. Dosage regimens can be adjusted to provide the optimum therapeutic response. For example, several divided doses may be administered daily or the dose may be proportionally reduced as indicated by the exigencies of the therapeutic situation.

As used herein, the terms "approximately" or "about" in reference to a number are generally taken to include numbers that fall within a range of 5% in either direction

(greater than or less than) the number unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value). Where ranges are stated, the endpoints are included within the range unless otherwise stated or otherwise evident from the context. The term "complementary" is used herein in accordance with its art-accepted meaning to refer to the capacity for precise pairing between particular bases, nucleosides, nucleotides or nucleic acids. For example, adenine (A) and uridine (U) are complementary; adenine (A ) and thymidine (T) are complementary; and guanine (G) and

cytosine (C), are complementary and are referred to in the art as Watson-Crick base pairings. If a nucleotide at a certain position of a first nucleic acid sequence is complementary to a nucleotide located opposite in a second nucleic acid sequence, the nucleotides form a complementary base pair, and the nucleic acids are complementary at that position. One of ordinary skill in the art will appreciate that the nucleic acids are aligned in antiparallel orientation (i.e., one nucleic acid is in 5' to 3' orientation while the other is in 3' to 5' orientation). A degree of complementarity of two nucleic acids or portions thereof may be evaluated by determining the total number of nucleotides in both strands that form complementary base pairs as a percentage of the total number of nucleotides over a window of evaluation when the two nucleic acids or portions thereof are aligned in antiparallel orientation for maximum complementarity. For example, AAAAAAAA (SEQ ID NO: 11424) and TTTGTTAT (SEQ ID NO: 11425) are 75% complementary since there are 12 nucleotides in complementary base pairs out of a total of 16. Nucleic acids that are at least 70% complementary over a window of evaluation are considered substantially complementary over that window. Specifically, if the window of evaluation is 15-16 nucleotides long, substantially complementary nucleic acids may have 0-3 mismatches within the window; if the window is 17 nucleotides long, substantially complementary nucleic acids may have 0-4 mismatches within the window; if the window is 18 nucleotides long, substantially complementary nucleic acids may have may contain 0-5 mismatches within the window; if the window is 19 nucleotides long, substantially complementary nucleic acids may contain 0-6 mismatches within the window. In certain embodiments the mismatches are not at continuous positions. In certain embodiments the window contains no stretch of mismatches longer than two nucleotides in length. In preferred embodiments a window of evaluation of 15-19 nucleotides contains 0-1 mismatch (preferably 0), and a window of evaluation of 20-29 nucleotides contains 0-2 mismatches (preferably 0-1, more preferably 0).

"Substantially pure" includes compounds, e.g., drugs, proteins or polypeptides that have been separated from components which naturally accompany it. Typically, a compound is substantially pure when at least 10%, more preferably at least 20%, more preferably at least 50%, more preferably at least 60%, more preferably at least 75%, more preferably at least 90%, and most preferably at least 99% of the total material (by volume, by wet or dry weight, or by mole percent or mole fraction) in a sample is the compound of interest. Purity can be measured by any appropriate method, e.g., in the case of

polypeptides by column chromatography, gel electrophoresis or HPLC analysis. A compound, e.g., a protein, is also substantially purified when it is essentially free of naturally associated components or when it is separated from the native contaminants which accompany it in its natural state. Included within the meaning of the term "substantially pure" are compounds, such as proteins or polypeptides, which are homogeneously pure, for example, where at least 95% of the total protein (by volume, by wet or dry weight, or by mole percent or mole fraction) in a sample is the protein or polypeptide of interest.

"Administering" includes routes of administration which allow the compositions of the invention to perform their intended function, e.g., treating or preventing viral disease. A variety of routes of administration are possible including, but not necessarily limited to parenteral {e.g., intravenous, intraarterial, intramuscular, subcutaneous injection), oral (e.g., dietary), inhalation (e.g., aerosol to lung), topical, nasal, rectal, or via slow releasing microcarriers depending on the disease or condition to be treated. Inhalation and parenteral administration are preferred modes of administration.

Formulation of the compound to be administered will vary according to the route of administration selected (e.g., solution, emulsion, gels, aerosols, capsule). An appropriate composition comprising the compound to be administered can be prepared in a physiologically acceptable vehicle or carrier and optional adjuvants and preservatives. For solutions or emulsions, suitable carriers include, for example, aqueous or alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media, sterile water, creams, ointments, lotions, oils, pastes and solid carriers. Parenteral vehicles can include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or fixed oils. Intravenous vehicles can include various additives, preservatives, or fluid, nutrient or electrolyte replenishers (See generally, Remington's Pharmaceutical Science, 16th Edition, Mack, Ed. (1980)).

"Effective amount" includes those amounts of the composition of the invention which allow it to perform its intended function, e.g., treating or preventing, partially or totally, viral infection as described herein. The effective amount will depend upon a number of factors, including biological activity, age, body weight, sex, general health, severity of the condition to be treated, as well as appropriate pharmacokinetic properties. For example, dosages of the active substance may be from about 0.01mg/kg/day to about 100mg/kg/day, advantageously from about O.lmg/kg/day to about 10mg/kg/day. For

example, an siRNA is delivered to a subject in need thereof at a dosage of from about O.lmg/kg/day to about 5 mg/kg/day. A therapeutically effective amount of the active substance can be administered by an appropriate route in a single dose or multiple doses. Further, the dosages of the active substance can be proportionally increased or decreased as indicated by the exigencies of the therapeutic or prophylactic situation.

"Pharmaceutically acceptable carrier" includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like which are compatible with the activity of the compound and are physiologically acceptable to the subject. An example of a pharmaceutically acceptable carrier is buffered normal saline (0.15M NaCl). The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the therapeutic compound, use thereof in the compositions suitable for pharmaceutical administration is contemplated. Supplementary active compounds can also be incorporated into the compositions. "Additional ingredients" include, but are not limited to, one or more of the following: excipients; surface active agents; dispersing agents; inert diluents; granulating and disintegrating agents; binding agents; lubricating agents; sweetening agents; flavoring agents; coloring agents; preservatives; physiologically degradable compositions such as gelatin; aqueous vehicles and solvents; oily vehicles and solvents; suspending agents; dispersing or wetting agents; emulsifying agents, demulcents; buffers; salts; thickening agents; fillers; emulsifying agents; antioxidants; antibiotics; antifungal agents; stabilizing agents; and pharmaceutically acceptable polymeric or hydrophobic materials. Other "additional ingredients" which may be included in the pharmaceutical compositions of the invention are known in the art and described, e.g., in Remington 's Pharmaceutical Sciences.

"Conserved sites"

Conserved sites of a virus are those sites or sequences that are found to be present in more than about 70% of all known sequences for a given region. The set of siRNA having sequence identity to conserved sites are determined by deriving all 19-mer sequence fragments from each of the known viral sequences, and evaluating the frequency in which each sequence fragment is present as an exact match within each of the set of viral sequences. A first viral sequence contains a 19-mer sequence fragment

that extends from position 1 through 19, another from position 2 through 20, another from position 3 through 21, and so on until the 19 nucleotide site at the end of the strand.

Likewise the second, third, and fourth viral sequences are extracted in the same way, all the way down to the last viral sequence in the list. The sequence fragments are then added to a growing table of sequence fragments and a count is maintained of the number of viral sequences that contain each 19-mer fragment. The fragment frequency is expressed as the percent of the viral sequences that contain each specific 19-mer fragment. The set of siRNA of the invention are those having sequence identity with greater than a majority of the known sequences, preferably greater than about 70% of the known sequences .

"Conserved sites for influenza virus" do not include sequences disclosed in U.S. Patent Application No. 10/674,159 filed September 29, 2003, Publication No. US-2004- 0242518-Al (J. Chen, Q. Ge and M. Eisen, "Influenza Therapeutic") and expressly listed in Table , below (Seq. ID Nos. 69-108). Conserved sites for influenza virus may exclude some embodiments disclosed in copending U.S. Patent Application No. 11/102097 filed April 8, 2005 (a CIP of the above identified application), hereby incorporated by reference in its entirety.

"Variants of a conserved site" include a small number of mismatches that are tolerated between the target RNA and the antisense guide sequence of the siRNA duplex. Thus, a single siRNA duplex targeting a highly conserved site in a virus will often still be active against minor variant species having only one or a few mismatches relative to the conserved site. We used the viral mismatch data in an algorithm to expand the list of potential influenza A viral sequence variants that are targetable by a given siRNA duplex, described below in Example 15. Niicleoproteins as RNAi targets

The present invention provides compositions and methods using RNAi for treating or preventing virus replication or infection in a subject, such as a human or non-human mammal. Preferably, the virus is an RNA virus. For example, the RNA virus is a negative strand virus. Alternatively, the virus is a positive strand virus or a double stranded (ds) virus. A preferred target RNA is the nucleoprotein (also termed nucleocapsid) transcript, or a transcript of a viral gene that accomplishes the function of the viral nucleoprotein. Any virus containing a nucleoprotein gene or the functional

equivalent thereof is suitable as an siRNA target. By way of non-limiting example, several groups of target viruses are described herein in greater detail.

Negative strand RNA viruses Negative strand RNA viruses have a viral genome that is in the complementary sense of mRNA. Therefore, one of the first activities of negative strand RNA viruses following entry into a host cell is transcription and production of viral mRNAs. For this purpose, the virions carry an N-RNA structure that consists of the viral RNA (vRNA) that is tightly associated with the viral nucleoprotein (N or NP, sometimes called nucleocapsid protein). The RNA-dependent RNA polymerase binds either directly to the N-RNA, as is the case for influenza virus, or it binds with the help of a co-factor, like the phosphoprotein of the paramyxoviruses and the rhabdo viruses. The intact N-RNA is the actual template for transcription rather than the naked vRNA and nucleoprotein contributes to exposure of the nucleotide bases of the N-RNA for efficient reading by the polymerase.

Commonalities in expression and replication of ssRNA(-) viruses appear to include distinct transcription and replication functions for the RdRp, probably triggered by binding of the virion nucleoprotein (N or NP) subunits. Thus, both RNA(-) and RNA(+) may be found complexed with N proteins in replication complexes. Negative strand RNA viruses useful in the present invention include human respiratory syncytial virus (RSV), human metapneumovirus (hMP V), Mumps virus, Measles virus, Hendra virus, Newcastle disease virus, Influenza virus, Vesicular stomatitis virus (VSV), Hepatitis delta virus, Marburg virus, Ebola virus, Hantaan virus, Sin nombre virus, Lassa fever virus, Lacrosse virus, Rift valley fever virus, Bunyamwera virus, Sandfly fever Sicilian virus, Sabia virus, Guanarito virus, Machupo virus, Junin virus, lymphocytic choriomeningitis virus (LCMV), and parainfluenza virus. Other suitable negative strand RNA viruses are known to those skilled in the art. Genbank Accession numbers for exemplary viral nucleoprotein nucleic acid sequences include U41071, NC_005077, K03362, NC_002045, NC_003443, NCJ)Ol 781, AY297748, AF389119, AY705373, AY354458, NC_001608, AB027523, and L37904.

Influenza

Influenza viruses are enveloped, negative-stranded RNA viruses of the

Orthomyxoviridae family. They are classified as influenza types A, B, and C, of which

influenza A is the most pathogenic and is believed to be the only type able to undergo reassortment with animal strains. Current vaccines based upon inactivated virus are able to prevent illness in approximately 70-80% of healthy individuals under age 65; however, this percentage is far lower in the elderly or immunocompromised. In addition, the expense and potential side effects associated with vaccine administration make this approach less than optimal. There are four antiviral drugs currently approved in the United States for treatment and/or prophylaxis of influenza, amantadine, rimanadine, zanamivir, and oseltamivir, but their use is limited due to concerns about side effects, compliance, and possible emergence of resistant strains. Therefore, there remains a need for the development of effective therapies for the treatment and prevention of influenza infection.

Influenza nucleocapsid protein or nucleoprotein (NP) is the major structural protein that interacts with the RNA segments to form RNP. It is encoded by RNA segment 5 of influenza A virus and is 1,565 nucleotides in length. NP contains 498 amino acids. NP protein is critical in virus replication. The number of NP protein molecules in infected cells has been hypothesized to regulate the levels of mRNA synthesis versus genome RNA (vRNA and cRNA) replication (1) Using a temperature-sensitive mutation in the NP protein, previous studies have shown that cRNA, but not mRNA, synthesis was temperature-sensitive both in vitro and in vivo (28, 29). NP protein was also shown to be required for elongation and antitermination of nascent cRNA and vRNA transcripts (29, 30). The present inventors have found that NP-specific siRNA inhibited the accumulation of all viral RNAs in infected cells. Probably, in the presence of NP-specific siRNA, the newly transcribed NP mRNA is degraded, resulting in inhibition of NP protein synthesis. Without newly synthesized NP, further viral transcription and replication are blocked, as is new virion production.

Parainfluenza virus

Parainfluenza virus (PIV) is enveloped, has a nonsegmented negative-strand RNA genome and belong to the family Paramyxoviridae of the order Mononegavirales. The parainfluenza viruses comprise two of the three genera of the subfamily Paramyxovirinae, namely Respirovirus (hPIVl and hPIV3) and Rubulavirus (hPIV2 and hPIV4). PIV is second to RSV as a common cause of lower respiratory tract disease in infants and children. PIV can cause repeated infections throughout life, usually manifested by an upper respiratory tract illness (e.g., a cold and/or sore throat). PIV can also cause serious

lower respiratory tract disease (e.g., pneumonia, bronchitis, and bronchiolitis), especially among the elderly, and among patients with compromised immune system. Only symptomatic treatment will be used for croup. Specific antiviral treatment is not available. Nucleocapsid (NP) protein is 509 to 557 amino acids in length and the amino acid sequence is relatively well conserved. The NP encapsidates genomic and antigenomic RNA 5 with each NP monomer associating with six nucleotides. PvNA replication is dependent on cosynthetic encapsidation of the nascent PvNA by NP. The N- terminal 75% of the moclecule is the more highly conserved part. It is involved in forming the soluble complex with P as well as in subsequently associating with other NP monomers and with RNA to form the nucleocapsid.

Respiratory Syncytial Virus

Respiratory syncytial virus (RSV) is a negative-sense, enveloped RNA virus belonging to the genus pneumovirus in paramyxoviridae. RSV infects upper and lower respiratory tract of essentially all children within the first two years of life and is also a significant cause of morbidity and mortality in the elderly. Infants experiencing RSV bronchiolitis are more likely to develop wheezing and asthma later in life. The illness may begin with URT symptoms and progress rapidly over 1-2 days to the diffuse small airway disease. RSV also causes repeated infections throughout life, usually associated with moderate-to-severe cold-like symptoms; however, severe lower respiratory tract disease may occur at any age, especially among the elderly or among those with compromised cardiac, pulmonary, or immune systems.

Research towards effective treatment and a vaccine against RSV has been ongoing for nearly four decades with few successes. Currently, no vaccine is clinically approved for RSV. The nucleocapsid (N) protein is a major structural protein involved in encapsidation of the RNA genome and is essential for replication and transcription of the genome. It is 1176 nucleotides in length.

Human Metapneumovirus

The human metapneumovirus (hMPV) is a member of the family

Paramyxoviridae. The virus is negative-sense RNA virus, which can be classified into two genotypes (A and B), has been assigned to the genus Metapneumovirus within the subfamily Pneumovirinae. The virus is responsible for acute respiratory tract infections in young children, elderly patients and immunocompromised hosts. The clinical

syndromes associated with this viral infection encompass mild to severe respiratory problems and acute wheezing as well as bronchiolitis and pneumonia. The nucleocapsid (N) gene is 1,206 nucleotides in length, and has substantially similar activity as N in RSV.

Positive strand RNA viruses

The positive-stranded RNA viruses have wholly or partially translatable genomes, and as a result are usually infectious as naked RNA. These viruses utilize the mechanisms of cap-independent translation initiation, polyprotein processing and RNA replication to regulate expression of their viral genome.

Several viruses cause disease in humans and animals. Poliovirus has caused severe poliomyelitis worldwide in the past and brought financial burden to developing countries attempting to eradicate the disease. Human rhinovirus causes one of the most widespread viral diseases, the common cold, for which there is no effective treatment or prevention. Foot-and-mouth disease virus (FMDV), an aphtho virus, caused a recent outbreak in sheep and cattle, creating significant financial risks in European agriculture industries. Coxsackie viruses are responsible for diseases such as hand-foot— mouth syndrome (primarily in young children), myocarditis, and ocular conjunctivitis. Hepatitis A virus, a hepatovirus, is known to be a leading cause of liver disease. Positive strand RNA viruses useful in the present invention include human astrovirus, Norwalk like virus, Coronavirus, Hepatitis A, C and E viruses, Yellow fever virus, Polio virus, Rhinovirus, Encephalomyocarditis virus, Human parechovirus, HIV-I, Dengue virus, West nile virus, Foot and mouth disease virus, Rubella virus, and Yellow fever virus. Other suitable positive strand RNA viruses are known to those skilled in the art. Genbank Accession numbers for exemplary positive strand viral nucleoprotein nucleic acid sequences include AY391777, AJ313030, NC_001474, AY660002, D83645 (Capsid), X03700 (Capsid), and L24917 (Capsid).

Human Coronavirus Human coronaviruses (HCoVs), members of the Coronaviridae family, are ubiquitous in the environment and are responsible for up to one-third of common colds. The viruses are enveloped viruses that possess a positive-strand RNA genome of up to 31 kb, which represents the largest known genome among all RNA viruses. Human coronaviruses are responsible for 10-30% of all common colds. All age groups are affected, and infection rates have been shown to be uniform for all age groups. Infection

may be subclinical or very mild. More severe lower respiratory tract infection has been reported in young children and old people. Reinfections with the similar and different strains is common. Antibodies to one coronavirus group do not protect against infection with viruses from another group or the same group but infect 4 months later.

West Nile Virus

West Nile virus (WNV), a member of the family Flaviviridae, has recently spread throughout the United States and the infection resulted in more than 9000 cases and 200 deaths in 2003. It has become the most common cause of viral encephalitis in several states in the US. West Nile virus encephalitis is a zoonosis. The life cycle of the virus includes mainly birds as hosts and mosquitoes as vectors. Humans are accidental hosts, insufficient to support the life cycle of the virus because of low-grade, transient viremia. However, human-to human transmission through blood, organ transplantation, and lactation has been documented. The frequency of severe neurologic disease in the current epidemic suggests a more neuro virulent strain of virus than the one classically associated with West Nile fever. Several neurologic manifestations have been described, but the most characteristic presentation is encephalitis with weakness. Thus far, no therapeutic intervention has shown consistent clinical efficacy in treatment of West Nile virus.

Rhinovirus

Human rhino viruses are the major causative agents of the common cold and associated upper respiratory tract complications. Since the virus has more than hundred serotypes and previous exposure to rhinovirus gives little immunological protection, which leads to higher rate of infection (Hayden FG. Rhinovirus and the lower respiratory tract. Rev Med Virol. 2004;14(l):17-31). The infection causes short self-limiting illness however, for asthmatics, the elderly and immunocompromised patients, rhinovirus infection can lead to life-threatening complications. The virus is a non enveloped positive strand RNA virus belonging to the family Picornaviridae and has a genome of approx. 7200 nucleotides. The viral genome functions directly as mRNA as soon as it is released into the host cytoplasm (McKnight KL, Lemon SM. The rhinovirus type 14 genome contains an internally located RNA structure that is required for viral replication. RNA;4: 1569-84).

Rhinovirus can be transmitted by aerosol or direct contact. Primary site of inoculation is the nasal mucosa, although the conjunctiva may be involved to a lesser

extent (Tan WC. Viruses in asthma exacerbations. Curr Opin PuIm Med. 2005;l 1:21-6). The virus attaches to respiratory epithelium and spreads locally. The major human receptor for this virus is intercellular adhesion molecule- 1 (ICAM-I) (Weinberger M. Respiratory infections and asthma: current treatment strategies. Drug Discov Today. 2004; 9:831-7). Some RV serotypes also up-regulate the ICAM-I expression on human epithelial cells to increase infection susceptibility (Papi A, Papadopoulos NG, Stanciu LA, Degitz K, Holgate ST, Johnston SL. Effect of desloratadine and loratadine on rhinovirus-induced intercellular adhesion molecule 1 upregulation and promoter activation in respiratory epithelial cells. J Allergy Clin Immunol. 2001; 108:221 -8). The virus replicates well in the nasal passages and upper tracheobronchial tree but less well in the lower respiratory tract. Incubation period is approximately 2-3 days. Viremia is uncommon but the virus is shed in large amounts. Viral shedding can occur a few days before cold symptoms are recognized by the patient, peaks on days 2-7 of the illness, and may last for as many as 3-4 weeks. Rhinovirus infection of upper airway has been linked to asthma exacerbations and studies suggest these are caused by additive or synergistic interactions with allergen exposure or with air pollution (Tan, supra). An impaired antiviral immunity to rhinovirus may lead to impaired viral clearance and hence prolonged symptoms. Th-2 cytokines has been shown to play an important role in upregulation of human rhinovirus receptor and may explain exacerbation of disease in asthmatics following rhinovirus infection (Bianco A, Sethi SK, Allen JT, Knight RA, Spiteri MA. Th2 cytokines exert a dominant influence on epithelial cell expression of the major group human rhinovirus receptor, ICAM-I . Eur Respir J. 1998;12:619-26).

Dengue virus

Dengue is an endemic viral disease affecting tropical and subtropical regions around the world. Dengue fever (DF) and its more serious forms, dengue hemorrhagic fever (DHF) and dengue shock syndrome (DSS), has become important public health problems and has grown dramatically in recent times. The disease is now endemic in more than 100 countries in Africa, the Americas, the eastern Mediterranean, Southeast Asia, and the Western Pacific, threatening more than 2.5 billion people (Gubler, D. J. 1998. Dengue and dengue hemorrhagic fever. Clin. Microbiol. Rev. 11:480-496.). The World Health Organization estimates that there may be 50 million to 100 million cases of dengue virus infections worldwide every year, which result in 250,000 to 500,000 cases

of DHF and 24,000 deaths each year (Gibbons, R. V., and D. W. Vaughn. 2002. Dengue: an escalating problem. BMJ 324:1563-1566; World Health Organization. 1997. Dengue haemorrhagic fever: diagnosis, treatment, prevention and control, 2nd ed. World Health Organization, Geneva, Switzerland) Dengue virus is a mosquito-borne flavivirus and the most prevalent arbovirus in tropical and subtropical regions of the world (Gubler, D. J. 1997. Dengue and dengue hemorrhagic fever: its history and resurgence as a global public health problem, p. 1-22. In D. J. Gubler and G. Kuno (ed.), Dengue and dengue hemorrhagic fever. CAB International, New York, N. Y.). Dengue virus is a positive-stranded encapsulated RNA virus. The genomic RNA is approximately 11 kb in length and is composed of three structural protein genes that encode the nucleocapsid or core protein (C), a membrane- associated protein (M), an envelope protein (E), and seven nonstructural (NS) protein genes. The proteins are synthesized as a polyprotein of about 3,000 amino acids that is processed cotranslationally and posttranslationally by viral and host proteases (Deubel, V., R. M. Kinney, and D. W. Trent. 1988. Nucleotide sequence and deduced amino acid sequence of the nonstructural proteins of dengue type 2 virus, Jamaica genotype: comparative analysis of the full-length genome. Virology 165:234-244). There are four distinct serotypes, serotypes 1 to 4. Infection with one serotype does not provide protection from the other serotype. Instead, it is generally thought that secondary infection or infection with secondary or multiple infections with various dengue virus serotypes is a major risk factor for DHF-DSS due to antibody-dependent enhancement (Halstead, S. B. 1988. Pathogenesis of dengue: challenge to molecular biology. Science 239:476-481.). At present there is effective vaccine against this virus.

The virus causes a broad spectrum of illnesses, ranging from inapparent infection, flu-like mild undifferentiated fever, and classical DF to the more severe form, DHF-DSS, from which rates of morbidity and mortality are high (Gubler, D. J. 1998. Dengue and dengue hemorrhagic fever. Clin. Microbiol. Rev. 11 :480-496.).

Double strand RNA viruses The dsRNA viruses are polyphyletic in origin. Reoviruses are the one of the best- studied dsRNA viruses. Representatives of the family infect plants, animals and insects, and many infect an insect vector as well as an animal or plant alternate host. The viruses all have a double or triple capsid structure, the outer layer of which is stripped off during endocytotic entry. Naked core particles in the cytoplasm are able to transcribe capped and

non-polyadenylated genome-segment-length monocistronic mRNAs, via an RNA- dependent RNA polymerase (RdRp) activity associated into the cytoplasm as they are synthesized and are translated. Viral products accumulate as viroplasms: associations of viral structural and polymerase proteins and mRNAs result in assembly of immature particles, inside which mRNAs are transcribed to give negative-stranded RNA molecules with which they become base-paired. The importance of the intermediate and inner capsid proteins are illustrated in the example of rotavirus.

Double strand RNA viruses useful in the present invention include rotavirus, reovirus, mammalian orthoriovirus, and Colorado tick fever virus. Other suitable double strand RNA viruses are known to those skilled in the art. Genbank Accession numbers for exemplary double strand viral nucleoprotein nucleic acid sequences include K02086 (VP6) and X14942 (VP2).

Rotavirus Rotavirus, a member of the family Reoviridae, is an important cause of acute gastroenteritis in infants and young children (Kapikian, A.Z. 2001, Rotaviruss, p.1787- 1833. Fields virology, 4 th ed. Lippincott/The Williams & Wilkins Co., Philadelphia, Pa.). The virion is an icosahedron composed of three concentric layers of protein with a genome of 11 segments of double-stranded RNA (dsRNA) (Prasad, B.V. 1988, J. MoI. Biol. 199:269-275). The outer layer of the infectious triple-layered particle (TLP) is made up of the glycoprotein, VP7, and the spike protein, VP4. The intermediate layer is formed by VP6 trimers, and the inner layer is formed by the core lattice protein, VP2. Positioned at the vertices of the VP2 lattice are individual copies of the RNA-dependent RNA polymerase (RdRp) VPl, and the mRNA-capping enzyme VP3 (Lawton, J. A., 1997, J. Virol. 71:7353).

VP6, forms the intermediate layer of the virus, integrate the two principal functions of the virus, cell entry and endogenous transcription, through its interactions with the outer layer proteins VP7 and VP4, and the inner layer proteins VP4 and VP7, and the inner layer protein VP2. VP6 itself, despite lack of any enymatic functions, is essential for endogenous transcription of the genome. Cryo-EM studies have shown that the nascent niRNA transcripts exit specifically through the type I channels in the VP6 layer (Lawton, J. A., 2000, Adv. Virus Res. 55, 185-229). Mutational analysis based on the pseudo-atomic model of the VP6 layer further demonstrated that the proper assembly of VP 6 trimers on VP2 is an absolute requirement for endogenous transcription. The site-

specific amino acid substitution partially or completely abolished the transcriptase activity (Charpilienne, A., 2002, J. Virol. 76, 7822-7831). The exit of transcripts through the channels at the capsid layer (equivalent of VP6 layer in rotavirus) appears to be a common theme in dsRNA viruses. Regions in this layer have been shown also function as substrate sinks for the transcription reaction.

VP2 forms the innermost layer interacting with the VP6 layer on the outside and the genomic RNA on the inside. VP2 exhibits RNA-binding ability through its N-terminal residues. Through this RNA-binding property, VP2 plays an important role in maintaining the appropriate spacing between the RNA strands to allow the genomic RNA to move around the transcription complex during transcription (Pesavento, J.B., 2001, Proc. Natl. Acad. Sci. U.S.A., 98, 1381-1386). Thus, one of the principal functions of the VP2 is to direct the structural organization of the genome that is conductive for its endogenous transcription.

RNA-inducing entities- siRNA and shRNA molecules

The present invention features siRNA molecules, methods of making siRNA molecules and methods (e.g., prophylactic and/or therapeutic methods and methods for research) for using siRNA molecules. The siRNA molecule can have a length from about 10-60 or more nucleotides (or nucleotide analogs), about 15-25 nucleotides (or nucleotide analogs), or about 19-23 nucleotides (or nucleotide analogs). The siRNA molecule can have nucleotide (or nucleotide analog) lengths of about 10-20, 20-30, 30-40, 40-50, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, or 29. In apreferred embodiment, the siRNA molecule has a length of 19 nucleotides. It is to be understood that all ranges and values encompassed in the above ranges are within the scope of the present invention. Generally, long dsRNAs (over 60 nucleotides) are less preferable, as they have been found to induce cell death (termed the "interferon response") in mammalian cells, such as human cells. siRNAs can preferably include 5' terminal phosphate and a 3' short overhang of about 1 or 2 nucleotides. In a preferred embodiment, the RNAi-inducing entity can be a short hairpin siRNA (shRNA) or an expressed shRNA. Examples of such shRNAs and methods of manufacturing the same are discussed in the examples. In another embodiment, the siRNA can be associated with one or more proteins in an siRNA complex.

The siRNA molecules of the invention are provided to reduce viral gene expression in a host cell by, at least in part, binding to target viral transcripts in a manner

that results in destruction of the target viral transcript by the host cell machinery. Thus, the siRNA molecules of the invention include a sequence that is sequence sufficiently complementary to a portion of the viral nucleoprotein gene to mediate RNA interference (RNAi), as defined herein, i e., the siRNA has a sequence sufficiently specific to trigger the degradation of the target RNA by the RNAi machinery or process. The siRNA molecule can be designed such that every residue of the antisense strand is complementary to a residue in the target molecule. Alternatively, substitutions can be made within the molecule to increase stability and/or enhance processing activity of said molecule. Substitutions can be made within the strand or can be made to residues at the ends of the strand.

The target RNA cleavage reaction guided by siRNAs is highly sequence specific. In general, siRNAs containing a nucleotide sequence identical to a portion of the target gene are preferred for inhibition. As the siRNAs of the invention are generally provided as double stranded molecules, identity and complementarily of the antisense strand of the siRNA can be determined relative to the target transcript. Thus, as used herein, disclosure of a nucleic acid sequence that is identical to a portion of a nucleic acid encoding a viral nucleoprotein includes both strands of a double stranded siRNA. However, it is recognized that 100% sequence identity between the siRNA and the target gene is not required to practice the present invention. Thus the invention has the advantage of being able to tolerate sequence variations that might be expected due to genetic mutation, strain polymorphism, or evolutionary divergence. For example, siRNA sequences with insertions, deletions, and single point mutations relative to the target sequence are effective for inhibition. Alternatively, siRNA sequences with nucleotide analog substitutions or insertions are effective for inhibition. Moreover, not all positions of a siRNA contribute equally to target recognition. Mismatches in the center of the siRNA are most critical and can essentially abolish target RNA cleavage. In contrast, the 3' nucleotides of the siRNA (e.g., the 3' nucleotides of the siRNA antisense strand) typically do not contribute significantly to specificity of the target recognition. In particular, 3' residues of the siRNA sequence which are complementary to the target RNA (e.g., the guide sequence) generally are not as critical for target RNA cleavage.

It is known in the art that not all siRNAs are equally effective in reducing or inhibiting expression of any particular target gene. (See, e.g., Holen, T., et al., Nucleic Acids Res., 30(8): 1757- 1766, reporting variability in the efficacy of different siRNAs), and a variety of considerations may be employed to increase the likelihood that a selected

siRNA may be effective. For example, it may be preferable to select target portions within exons rather than introns. siRNAs may generally be designed in accordance with principles described in Technical Bulletin # 003- Revision B, "siRNA Oligonucleotides for RNAi Applications" and Technical Bulletin #4, Dharmacon Research, Inc., Lafayette, CO 80026, a commercial supplier of RNA reagents. The RNAi Technical Reference & Application Guide, from Dharmacon, contains a variety of information regarding siRNA design parameters, synthesis, etc., and is incorporated herein by reference. Additional design considerations that may also be employed are described in Semizarov, D., et ah, Proc. Natl. Acad. ScL, Vol. 100, No. 11, pp. 6347-6352. Sequence identity may be determined by sequence comparison and alignment algorithms known in the art. To determine the percent identity of two nucleic acid sequences (or of two amino acid sequences), the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the first sequence or second sequence for optimal alignment). The nucleotides (or amino acid residues) at corresponding nucleotide (or amino acid) positions are then compared. When a position in the first sequence is occupied by the same residue as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % homology equals the number of identical positions divided by the total number of positions multiplied by 100), optionally penalizing the score for the number of gaps introduced and/or length of gaps introduced.

The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In one embodiment, the alignment generated over a certain portion of the sequence aligned having sufficient identity but not over portions having low degree of identity (i.e., a local alignment). A • preferred, non-limiting example of a local alignment algorithm utilized for the comparison of sequences is the algorithm of Karlin & Altschul, Proc. Natl. Acad. Sci. USA 87:2264-68 (1990), modified as in Karlin & Altschul, Proc. Natl. Acad. Sci. USA 90:5873-77 (1993). Such an algorithm is incorporated into the BLAST programs (version 2.0) of Altschul, et al, J. MoI. Biol. 215:403-10 (1990). In another embodiment, the alignment is optimized by introducing appropriate gaps and percent identity is determined over the length of the aligned sequences (i. e., a gapped alignment). To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul, et al., Nucleic Acids Res. 25(17):3389-3402 (1997). In another embodiment, the

alignment is optimized by introducing appropriate gaps and percent identity is determined over the entire length of the sequences aligned (i.e., a global alignment). A so preferred, non-limiting example of a mathematical algorithm utilized for the global comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAMl 20 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used.

Greater than 80% sequence identity, e.g., 84%, 89%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or even 100% sequence identity, between the siRNA {e.g., the antisense strand of the siRNA) and the portion of the target gene is preferred. In the context of an siRNA of about 19-25 nucleotides, e.g., at least 16-21 identical nucleotides are preferred, more preferably at least 17-22 identical nucleotides, and even more preferably at least 18-23 or 19-24 identical nucleotides. Alternatively worded, in an siRNA of about 19-25 nucleotides in length, siRNAs having no greater than about 4 mismatches are preferred, preferably no greater than 3 mismatches, more preferably no greater than 2 mismatches, and even more preferably no greater than 1 mismatch. For example, the siRNA contains an antisense strand having 1, 2, 3 or 4 mismatches with the target sequence. Alternatively, the siRNA may be defined functionally as including a nucleotide sequence (or oligonucleotide sequence) that is capable of hybridizing with a portion of the target gene transcript {e.g., 400 mM NaCl, 40 mM PIPES pH 6.4, 1 mM EDTA, 5O 0 C or 7O 0 C hybridization for 12-16 hours; followed by washing). Additional preferred hybridization conditions include hybridization at 7O 0 C in IX SSC or 5O 0 C in IX SSC, 50% formamide followed by washing at 7O 0 C in 0.3X SSC or hybridization at 7O 0 C in 4X SSC or 5O 0 C in 4X SSC, 50% formamide followed by washing at 67°C in IX SSC. The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should be 5-1O 0 C less than the melting temperature (Tm) of the hybrid, where Tm is determined according to the following equations. For hybrids less than 18 base pairs in length, Tm(°C.)=2(# of A+T bases)+4(# of G+C bases). For hybrids between 18 and 49 base pairs in length, Tm(°C)=81.5+16.6(loglO[Na+])+0.41(% G+C) (600/N), where N is the number of bases in the hybrid, and [Na+] is the concentration of sodium ions in the hybridization buffer ([Na+] for IX SSC=O.165 M). Additional examples of stringency conditions for polynucleotide hybridization are provided in Sambrook, J., et al., 1989,

Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. The length of the identical nucleotide sequences may be at least about 10, 12, 15, 17, 20, 22, 25, 27, 30, 32, 35, 37, 40, 42, 45, 47 or 50 bases. In one embodiment, the RNA molecules of the present invention are modified, such as to improve stability in serum or in growth medium for cell cultures. In order to enhance the stability, the 3'-residues may be stabilized against degradation, e.g., they may be selected such that they consist of purine nucleotides, e.g., adenosine or guanosine nucleotides. Alternatively, substitution of pyrrolidine nucleotides by modified analogues, e.g., substitution of uridine by 2'-deoxythymidine is tolerated and does not affect the efficiency of RNA interference. For example, the absence of a 2' hydroxyl may significantly enhance the nuclease resistance of the siRNAs in tissue culture medium. In a preferred embodiment of the present invention the RNA molecule may contain at least one modified nucleotide analogue (or analog). The nucleotide analogues may be located at positions where the target-specific activity, e.g., the RNAi mediating activity is not substantially affected, e.g., in a region at the 5'-end and/or the 3'-end of the RNA molecule. Particularly, the ends may be stabilized by incorporating modified nucleotide analogues. Preferred nucleotide analogues include sugar- and/or backbone- modified ribonucleotides (i.e., include modifications to the phosphate-sugar backbone). For example, the phosphodiester linkages of natural RNA may be modified to include at least one of a nitrogen or sulfur heteroatom. In preferred backbone-modified ribonucleotides the phosphoester group connecting to adjacent ribonucleotides is replaced by a modified group, e.g., of phosphorothioate group. In preferred sugar-modified ribonucleotides, the 2' OH-group is replaced by a group selected from H, OR, R, halo, SH, SR, NH 2 , NHR, NR 2 or ON, wherein R is C 1 -C 6 alkyl, alkenyl or alkynyl and halo is F, Cl, Br or I.

Also preferred are nucleobase-modified ribonucleotides, i.e., ribonucleotides containing at least one non-naturally occurring nucleobase instead of a naturally occurring nucleobase. Bases may be modified to block the activity of adenosine deaminase. Exemplary modified nucleobases include, but are not limited to, uridine and/or cytidine modified at the 5-position, e.g., 5-(2-amino)propyl uridine, 5-bromo uridine; adenosine and/or guanosines modified at the 8 position, e.g., 8-bromo guanosine; deaza nucleotides, e.g., 7-deaza-adenosine; O- and N-alkylated nucleotides, e.g., N6-

methyl adenosine are suitable. It should be noted that the above modifications may be combined.

In some embodiments, the siRNA can be modified by the substitution of at least one nucleotide with a modified nucleotide. The siRNA can have one or more mismatches when compared to the target sequence of the nucleoprotein transcript and still mediate RNAi as demonstrated in the examples below.

The ability of the nucleoprotein-directed siRNAs of the present invention to mediate RNAi is particularly advantageous considering the rapid mutation rate of some of the genes of the viruses provided herein, such as genes of an influenza virus. The inventors provide for the use of the nucleoprotein gene as an RNAi target as the inventors have recognized that the nucleoprotein gene generally has a lower rate of mutations as compared to other viral genes. Moreover, in embodiments of the invention, siRNAs are targeted towards conserved regions of the viral nucleoprotein gene. The invention contemplates several embodiments which further leverage this ability by, e.g., synthesizing patient-specific siRNAs or plasmids, and/or introducing several siRNAs staggered along the nucleoprotein gene. In one embodiment, highly and/or moderately conserved regions of the nucleoprotein gene are targeted as discussed in greater detail below. In other embodiments, a biological sample is obtained from a subject. As used herein, a biological sample is any material obtained from the subject containing a viral nucleic acid. For example, one or more of a host subject's infected cells are procured and the genome of the viral nucleoprotein gene within it sequenced or otherwise analyzed to select or synthesize one or more corresponding siRNAs, plasmids or transgenes.

Manufacture of siRNA In one embodiment, siRNAs are synthesized either in vivo or in vitro. Endogenous

RNA polymerase of the cell may mediate transcription in vivo, or cloned RNA polymerase can be used for transcription in vivo or in vitro. For transcription from a transgene in vivo or an expression construct, a regulatory region (e.g., promoter, enhancer, silencer, or splice donor and acceptor) may be used to transcribe the siRNA. Inhibition may be targeted by specific transcription in an organ, tissue, or cell type; stimulation of an environmental condition (e.g., infection, stress, temperature, chemical inducers); and/or engineering transcription at a developmental stage or age. A transgenic organism that expresses siRNA from a recombinant construct may be produced by introducing the construct into a zygote, an embryonic stem cell, or another multipotent

cell derived from the appropriate organism.

In addition, not only can an siRNA be used to cleave multiple RNAs within the cell, but the siRNAs can be replicated and amplified within a cell by the host cell enzymes. Alberts, et al., The Cell 452 (4th Ed. 2002). RNA may be produced enzymatically or by partial/total organic synthesis, any modified ribonucleotide can be introduced by in vitro enzymatic or organic synthesis. In one embodiment, a siRNA is prepared chemically. Methods of synthesizing RNA molecules are known in the art, in particular, the chemical synthesis methods as de scribed in Verma and Eckstein, Annul Rev. Biochem. 67:99-134 (1998). In another embodiment, a siRNA is prepared enzymatically. For example, a siRNA can be prepared by enzymatic processing of a long dsRNA having sufficient complementarity to the desired target RNA. Processing of long dsRNA can be accomplished in vitro, for example, using appropriate cellular lysates and ds-siRNAs can be subsequently purified by gel electrophoresis or gel filtration. In an exemplary embodiment, RNA can be purified from a mixture by extraction with a solvent or resin, precipitation, electrophoresis, chromatography, or a combination thereof. Alternatively, the RNA may be used with no or a minimum of purification to avoid losses due to sample processing.

The siRNAs can also be prepared by enzymatic transcription from synthetic DNA templates or from DNA plasmids isolated from recombinant bacteria. Typically, phage RNA polymerases are used such as T7, T3 or SP6 RNA polymerase (Milligan &

Uhlenbeck, Methods Enzymol. 180:51-62 (1989)). The RNA may be dried for storage or dissolved in an aqueous solution. The solution may contain buffers or salts to inhibit annealing, and/or promote stabilization of the single strands.

siRNA Vectors

Another aspect of the present invention includes a vector that expresses one or more siRNAs that include sequences sufficiently complementary to a portion of the nucleoprotein gene genome to mediate RNAi. The vector can be administered in vivo to thereby initiate RNAi therapeutically or prophylactically by expression of one or more copies of the siRNAs. In one embodiment, synthetic shRNA is expressed in a plasmid vector. In another, the plasmid is replicated in vivo. In another embodiment, the vector can be a viral vector, e.g., a retroviral vector. Examples of such plasmids and methods of making the same are illustrated in the examples. Use of vectors and plasmids are advantageous because the vectors can be more stable than synthetic siRNAs and thus

effect long-term expression of the siRNAs.

Some target viruses mutate rapidly and may result in a mismatch of even one nucleotide that can, in some instances, impede RNAi. Accordingly, in one embodiment, a vector is contemplated that expresses a plurality of siRNAs to increase the probability of sufficient homology to mediate RNAi. Preferably, these siRNAs are staggered along the nucleoprotein gene, or are clustered in one region of the nucleoprotein gene. For example, a plurality of siRNAs is directed towards a region of the nucleoprotein gene that is about 200 nucleotides in length and contains the 3' end of the nucleoprotein gene. In one embodiment, one or more of the siRNAs expressed by the vector is a shRNA. The siRNAs can be staggered along one portion of the nucleoprotein gene or target different portions of the nucleoprotein gene. In one embodiment, the vector encodes about 3 siRNAs, more preferably about 5 siRNAs. The siRNAs can be targeted to conserved regions of the nucleoprotein gene.

Methods of Introducing RNAs, Vectors, and Host Cells

Physical methods of introducing the agents of the present invention (e.g., siRNAs, vectors, or transgenes) include injection of a solution containing the agent, bombardment by particles covered by the agent, soaking the cell or organism in a solution of the agent, or electroporation of cell membranes in the presence of the agent. A viral construct packaged into a viral particle would accomplish both efficient introduction of an expression construct into the cell and transcription of RNA, including siRNAs, encoded by the expression construct. Other methods known in the art for introducing nucleic acids to cells may be used, such as lipid-mediated carrier transport, chemical-mediated transport, such as calcium phosphate, and the like. Thus the siRNA may be introduced along with components that perform one or more of activities, e.g., enhance siRNA uptake by the cell, inhibit annealing of the two siRNA strands to each other, stabilize the single strands, or otherwise increase inhibition of the target gene.

The agents may be directly introduced into the cell (i e., intracellularly); or introduced extracellularly into a cavity, interstitial space, into the circulation of an organism, introduced orally, by inhalation, or may be introduced by bathing a cell or organism in a solution containing the RNA. Vascular or extravascular circulation, the blood or lymph system, and the cerebrospinal fluid are sites where the agent may be introduced.

Cells may be infected with a target virus upon delivery of the agent or exposed to

the target virus after delivery of agent. The cells may be derived from or contained in any organism. The cell may be from the germ line, somatic, totipotent or pluripotent, dividing or non-dividing, parenchyma or epithelium, immortalized or transformed, or the like. The cell may be a stem cell, or a differentiated cell. Depending on the particular target gene and the dose of double stranded RNA material delivered, this process may provide partial or complete loss of function for the target gene. A reduction or loss of gene expression in at least 50%, 60%, 70%, 80%, 90%, 95% or 99% or more of targeted cells is exemplary. Inhibition of gene expression refers to the absence (or observable decrease) in the level of viral protein, RNA 5 and/or DNA. Specificity refers to the ability to inhibit the target gene without manifesting effects on other genes, particularly those of the host cell. The consequences of inhibition can be confirmed by examination of the outward properties of the cell or organism or by biochemical techniques such as RNA solution hybridization, nuclease protection, Northern hybridization, reverse transcription gene expression monitoring with a microarray, antibody binding, enzyme linked immunosorbent assay (ELISA), integration assay, Western blotting, radioimmunoassay (RIA), other immunoassays, and fluorescence activated cell analysis (FACS).

For RNA-mediated inhibition in a cell line or whole organism, gene expression is conveniently assayed by use of a reporter or drug resistance gene whose protein product is easily assayed. Such reporter genes include acetohydroxyacid synthase (AHAS), alkaline phosphatase (AP), beta galactosidase (LacZ), beta glucuronidase (GUS), chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP), horseradish peroxidase (HRP), luciferase (Luc), nopaline synthase (NOS), octopine synthase (OCS), and derivatives thereof. Multiple selectable markers are available that confer resistance to ampicillin, bleomycin, chloramphenicol, gentarnycin, hygromycin, kanamycin, lincomycin, methotrexate, phosphinothricin, puromycin, and tetracyclin. Depending on the assay, quantitation of the amount of gene expression allows one to determine a degree of inhibition which is greater than 10%, 33%, 50%, 90%, 95% or 99% as compared to a cell not treated according to the present invention. Lower doses of injected material and longer times after administration of siRNA may result in inhibition in a smaller fraction of cells (e.g., at least 10%, 20%, 50%, 75%, 90%, or 95% of targeted cells).

Quantitation of gene expression in a cell may show similar amounts of inhibition at the level of accumulation of target RNA or translation of target protein. As an example, the efficiency of inhibition may be determined by assessing the amount of gene product in

the cell; RKA may be detected with a hybridization probe having a nucleotide sequence outside the region used for the inhibitory double-stranded RNA, or translated polypeptide may be detected with an antibody raised against the polypeptide sequence of that region.

The siRNA may be introduced in an amount that allows delivery of at least one copy per cell. Higher doses (e.g., at least 5, 10, 100, 500 or 1000 copies per cell) of material may yield more effective inhibition; lower doses may also be useful for specific applications.

Diagnostic Methods and Kits The invention encompasses the recognition that RNAi-based therapy of infectious diseases, e.g., infections caused by a virus, can desirably incorporate a diagnostic step that determines whether a subject in need of treatment is infected with an infectious agent that is susceptible to inhibition by one or more RNAi-inducing entities. By "susceptible to inhibition" is meant that one or more biological activities of the infectious agent can be effectively inhibited by administration of the RNAi-inducing entity to a subject.

Preferably replication, pathogenicity, spread, and/or production of the infectious agent are inhibited. For example, preferably replication, pathogenicity, spread, or production of the agent is inhibited by at least 25% when the RNAi-inducing entity is administered to a subject at a tolerated dose. Preferably the inhibition is sufficient to produce a therapeutically useful effect.

Influenza virus is used as a non-limiting example to illustrate the diagnostic methods of the invention, which are tailored to allow the selection of an RNAi-inducing entity that is suitable for a subject suffering from an infection. However, it is understood that the methods disclosed herein are appropriate to any virus described herein or any virus that would be recognized by one skilled in the art. The selected RNAi-inducing entity may, of course, also be administered for prophylaxis, e.g., to individuals who have come in contact with the infected individual, regardless of whether those individuals have developed symptoms of infection.

The invention therefore provides methods for diagnosing virus infection and for determining whether a subject is infected with a virus. In certain embodiments the method comprises determining whether a subject is infected with a virus that is inhibited by one or more of the RNAi-inducing entities of the invention that target a viral nucleoprotein transcript. For example, a sample (e.g., sputum, saliva, nasal washings, nasal swab, throat swab, bronchial washings, broncheal alveolar lavage (BAL) fluid,

biopsy specimens, etc.) is obtained from a subject who may be suspected of having a viral infection, e.g., influenza. The sample can be subjected to one or more processing steps. Any such processed sample is considered to be obtained from the subject. The sample is analyzed to determine whether it contains a virus-specific nucleic acid, particularly a nucleoprotein transcript. A "virus-specific nucleic acid" is any nucleic acid, or its complement, that originates from or is derived from a virus and can serve as an indication of the presence of a virus in a sample and, optionally, be used to identify the strain and/or the sequence of a viral gene. The nucleic acid may have been subjected to processing steps following its isolation. For example, it may be reverse transcribed, amplified, cleaved, etc. In certain embodiments the sequence of a virus-specific nucleic acid present in the sample, or its complement, is compared with the sequence of the antisense or sense strand of an RNAi-inducing agent such as an siRNA or shRNA. The word "comparison" is used in a broad sense to refer to any method by which a sequence can be evaluated, e.g., which it can be determined whether the sequence is the same as or different to a reference sequence at one or more positions, or by which the extent of difference can be assessed.

Any of a wide variety of nucleic acid-based assays can be used. In certain embodiments the diagnostic assay utilizes a nucleic acid comprising a favorably and/or highly conserved target portion or its complement, or a fragment of the favorably and/or highly conserved portion or its complement. In certain embodiments the nucleic acid serves as an amplification primer or a hybridization probe, e.g., in an assay such as those described below.

In certain embodiments an influenza-specific nucleic acid in the sample is amplified. Isothermal target amplification methods include transcription mediated amplification (TMA), self-sustained sequence replication (3SR), Nucleic Acid Sequence Based Amplification (NASBA), and variations thereof. Detection or comparison can be performed using any of a variety of methods known in the art, e.g., amplification-based assays, hybridization assays, primer extension assays (e.g., allele-specific primer extension in which the corresponding target portions of different influenza virus strains are analogous to different alleles of a gene), oligonucleotide ligation assays (U.S. Pat. Nos. 5,185,243, 5,679,524 and 5,573,907), cleavage assays , heteroduplex tracking analysis (HTA) assays, etc. Examples include the Taqman ® assay, Applied Biosystems (U.S. Pat. No. 5,723,591). Cycling probe technology (CPT), which is a nucleic acid detection system based on signal or probe amplification rather than target amplification

(U.S. Pat. Nos. 5,011,769, 5,403,711, 5,660,988, and 4,876,187), could also be employed. Invasive cleavage assays, e.g., Invader ® assays (Third Wave Technologies), described in Eis, P. S. et al., Nat. Biotechnol. 19:673, 2001, can also be used to detect influenza- specific nucleic acids. Assays based on molecular beacons (U.S. Pat. Nos. 6,277,607; 6,150,097; 6,037, 130) or fluorescence energy transfer (FRET) may be used. Molecular beacons are oligonucleotide hairpins which undergo a conformational change upon binding to a perfectly matched template.

In certain embodiments the assay determines whether an influenza-specific nucleic acid in the sample comprises a portion that is identical to or different from a sense or antisense strand of an RNAi-inducing entity. Optionally the exact differences, if any, are identified. This information is used to determine whether the influenza virus is susceptible to inhibition by the RNAi-inducing entity. In addition to those discussed above, suitable assays for detection and/or genotyping of infectious agents are described in Molecular Microbiology: Diagnostic Principles and Practice, Persing, D.H., et al., (eds.) Washington, D.C.: ASM Press, 2004.

The diagnostic assays may employ any of the nucleic acids described herein. In certain embodiments of the invention the nucleic acid comprises a nucleic acid portion that is not substantially complementary or substantially identical to a nucleoprotein transcript. For example, the nucleic acid may comprise a primer binding site (e.g., a binding site for a universal sequencing primer or amplification primer), a hybridization tag (which may, for example, be used to isolate the nucleic acid from a sample comprising other nucleic acids), etc. In certain embodiments of the invention the nucleic acid comprises a non-nucleotide moiety. The non-nucleotide moiety may be attached to a terminal nucleotide of the nucleic acid, e.g., at the 3' end. The moiety may protect the nucleic acid from degradation. In certain embodiments the non-nucleotide moiety is a detectable moiety such as a fluorescent dye, radioactive atom, member of a fluorescence energy transfer (FRET) pair, quencher, etc. In certain embodiments the non-nucleotide moiety is a binding moiety, e.g. biotin or avidin. In certain embodiments the non- nucleotide moiety is a hapten such as digoxygenin, 2,4-Dinitrophenyl (TEG), etc. In certain embodiments the non-nucleotide moiety is a tag usable for isolation of the nucleic acid.

In certain embodiments of the invention a nucleic acid is attached to a support, e.g., a microparticle such as a bead, which is optionally magnetic. The invention further provides an array comprising a multiplicity of nucleic acids of the invention, e.g., at least

10, 20, 50, etc. The nucleic acids are covalently or noncovalently attached to a support, e.g., a substantially planar support such as a glass slide. See, e.g., U.S. Pat. Nos. 5,744,305; 5,800,992; 6,646,243.

Information obtained from experiments or from previous experience in treating a virus having a particular sequence within the nucleoprotein gene can also be used to decide whether the virus is susceptible to inhibition by a given RNAi-inducing entity or combination thereof. Susceptibility information can also include theoretical predictions based, for example, on the expected effect of any mismatches that exist between the nucleoprotein virus sequence and the antisense strand of an inhibitory agent. The invention provides diagnostic kits for detecting virus infection. Certain of the kits comprise one or more nucleic acids of the invention. Certain of the kits comprise one or more nucleic acids that can be used to detect a portion of an nucleoprotein virus transcript that comprises a preferred target portion for RNAi. The kits may comprise one or more items selected from the group consisting of: a probe, a primer, a sequence- specific oligonucleotide, an enzyme, a substrate, an antibody, a population of nucleotides, a buffer, a positive control, and a negative control. The nucleotides may be labeled. For example, one or more populations of fluorescently labeled nucleotides such as dNTPs, ddNTPs, etc. may be provided.

The probe can be a nucleic acid that includes all or part of a target portion, e.g., a highly or favorably conserved nucleoprotein target portion, or its complement, or is at least 80% identical or complementary to a target portion, e.g., 100% identical or complementary. In certain embodiments a plurality of probes are provided. The probes differ at one or more positions and can be used for determining the exact sequence of a nucleoprotein virus transcript at such positions. For example, the probes may differentially hybridize to the transcript (e.g., hybridization occurs only if the probe is 100% complementary to a target portion of the transcript). Kits of the invention can comprise specimen collection materials, e.g., a swab, a tube, etc. The components of the kit may be packaged in individual vessels or tubes which will generally be provided in a container, e.g., a plastic or styrofoam container suitable for commercial sale, together with instructions for use of the kit.

Methods of Treatment

The present invention provides for both prophylactic and therapeutic methods for

treating a subject at risk of (or susceptible to) or a subject having a virus. "Treatment", or "treating" as used herein, is defined as the application or administration of a therapeutic agent (e.g., a siRNA or vector or transgene encoding same) to a patient, or application or administration of a therapeutic agent to an isolated tissue or cell line from a patient, who has a virus with the purpose to cure, heal, alleviate, relieve, alter, remedy, ameliorate, improve or affect the virus, or symptoms of the virus. The term "treatment" or "treating" is also used herein in the context of administering agents prophylactically, e.g., to inoculate against a virus.

With regards to both prophylactic and therapeutic methods of treatment, such treatments may be specifically tailored or modified, based on knowledge obtained from the field of pharmacogenomics. "Pharmacogenomics", as used herein, refers to the application of genomics technologies such as gene sequencing, statistical genetics, and gene expression analysis to drugs in clinical development and on the market. More specifically, the term refers the study of how a patient's genes determine his or her response to a drug (e.g., a patient's "drug response phenotype", or "drug response genotype"). Thus, another aspect of the invention provides methods for tailoring an individual's prophylactic or therapeutic treatment with either the target gene molecules of the present invention or target gene modulators according to that individual's drug response genotype.

In related embodiments, a population of two or more different RNAi-inducing agents are administered to a subject, who may be a host to a virus. In one embodiment, the population of two or more RNAi-inducing agents include agents that contain guide strands whose sequences are substantially complementary (preferably 100% complementary) to the same highly conserved region from a variety of strains of a particular virus. In another embodiment, the population of two or more RNAi-inducing agents includes agents that contain guide strands whose sequences are substantially complementary (preferably 100% complementary) to different highly conserved regions from the same virus strain. In yet another embodiment, the population of two or more RNAi-inducing agents include agents that contain guide strands whose sequences are substantially complementary (preferably 100% complementary) to the same highly conserved region from a variety of strains of a particular virus, e.g., an influenza virus and RNAi-inducing agents includes agents that contain guide strands whose sequences

are substantially complementary (preferably 100% complementary) to different highly conserved regions from the same virus strain.

Prophylactic Methods In one aspect, the invention provides a method for preventing in a subject, infection with a virus or a condition associated with a viral infection, by administering to the subject a prophylactically effective agent that includes any of the siKNAs or vectors or transgenes discussed herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms characteristic of a viral infection, such that the viral infection is prevented.

In a preferred embodiment, the prophylactically effective agent is administered to the subject prior to exposure to the target virus. In another embodiment, the agent is administered to the subject after exposure to the target virus to delay or inhibit its progression, or prevent its integration into the DNA of healthy cells or cells that do not contain a pro virus. Preferably, target virus formation is inhibited or prevented. Additionally or alternatively, it is preferable that target virus replication is inhibited or prevented. In one embodiment, the siRNA degrades the target virus RNA in the early stages of its replication, for example, immediately upon entry into the cell. In this manner, the agent can prevent healthy cells in a subject from becoming infected. In another embodiment, the siRNA degrades the viral MRNA in the late stages of replication. Any of the strategies discussed herein can be employed in these methods, such as administration of a vector that expresses a plurality of siRNAs sufficiently complementary to the viral nucleoprotein gene to mediate RNAi.

Therapeutic Methods

Another aspect of the invention pertains to methods of modulating target gene expression, protein expression or activity for therapeutic purposes. Accordingly, in an exemplary embodiment, the modulatory method of the invention involves contacting a cell infected with the virus with a therapeutic agent (e.g., a siRNA or vector or transgene encoding same) that is specific for a portion of the viral genome such that RNAi is mediated. These modulatory methods can be performed ex vivo (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). The methods can be performed ex vivo and then the products introduced to a subject (e.g.,

gene therapy).

The therapeutic methods of the invention generally include initiating RNAi by administering the agent to a subject infected with the virus (e.g., influenza). The agent can include one or more siRNAs, one or more siRNA complexes, vectors that express one or more siRNAs (including shRNAs), or transgenes that encode one or more siRNAs. The therapeutic methods of the invention are capable of reducing viral production (e.g., viral titer or provirus titer), by about 30-50-fold, preferably by about 60-80-fold, and more preferably about (or at least) 90-fold, 100-fold, 200-fold, 300-fold, 400-fold, 500-fold or 1000-fold. Additionally, the therapeutic agents and methods of the present invention can be used in co-therapy with post-transcriptional approaches (e.g., with ribozymes and/or antisense siRNAs).

Dual Prophylactic and Therapeutic Methods In a preferred method, a two-pronged attack on the target virus is effected in a subject that has been exposed to the target virus. An infected subject can thus be treated both prophylactically and therapeutically by degrading the virus during early stages of replication and prior to integration into the host cell genome, and also retards replication of the virus in cells in which the target virus has already begun to replicate. One skilled in the art can readily determine the appropriate dose, schedule, and method of administration for the exact formulation of the composition being used, in order to achieve the desired "effective level" in the individual patient. One skilled in the art also can readily determine and use an appropriate indicator of the "effective level" of the compounds of the present invention by a direct (e.g., analytical chemical analysis) or indirect (e.g., with surrogate indicators of viral infection) analysis of appropriate patient samples (e.g., blood and/or tissues).

The prophylactic or therapeutic pharmaceutical compositions of the present invention can contain other pharmaceuticals, in conjunction with a vector according to the invention, when used to therapeutically treat viral infections. Further representative examples of these additional pharmaceuticals that can be used in addition to those previously described, include antiviral compounds, immunomodulators, immunostimulants, antibiotics, and other agents and treatment regimes (including those recognized as alternative medicine) that can be employed to treat viral infections. Immunomodulators and immunostimulants include, but are not limited to, various

interleukins, CD4, cytokines, antibody preparations, blood transfusions, and cell transfusions.

Pharmaceutical Compositions The invention pertains to uses of the above-described RNAi-inducing entities for the prophylactic and therapeutic treatments of viral infection, as described infra. Accordingly, the agents of the present invention can be incorporated into pharmaceutical compositions suitable for administration. Such compositions typically comprise the agent and a pharmaceutically acceptable carrier. A pharmaceutical composition of the invention is formulated to be compatible with its intended route of administration. Examples of routes of administration include oral, by inhalation, intranasal, parenteral {e.g., intravenous, intradermal, subcutaneous, intraperitoneal, and intramuscular), transdermal (topical), and transmucosal administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include the following components: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ.) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy syringability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and frngi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, e.g.,

by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal agents (e.g., parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like). In many cases, it will be preferable to include isotonic agents (e.g., sugars, polyalcohols such as manitol, sorbitol, and sodium chloride) in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent that delays absorption (e.g., aluminum monostearate and gelatin).

Sterile injectable solutions can be prepared by incorporating the active compound in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the active compound into a sterile vehicle which contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying which yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.

Inhalational administration means the RNAi-inducing entity is introduced directly to the respiratory system by inhalation through the nose or mouth and into the lungs. The entity is in naked form or with a delivery agent In certain embodiments the RNAi- inducing agent is administered in an amount effective to treat or prevent a condition that affects the respiratory system, such as a respiratory virus infection, while resulting in minimal absorption into the blood and thus minimal systemic delivery of the RNAi- inducing agent. In particular, the invention provides dry powder compositions containing RNAi-inducing entities that are preferably delivered in the form of an aerosol spray from a pressured container or dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide, or a nebulizer. In certain embodiments the delivery system is suitable for delivering the composition into major airways (trachea and bronchi) of a subject (e.g., an animal or human) and/or deeper into the lung (bronchioles and/or alveoli). The present invention also includes delivery of compositions comprising an RNAi-inducing entity using a nasal spray. According to certain embodiments of the invention delivery agents to facilitate nucleic acid uptake by cells in the respiratory system are included in the pharmaceutical composition. However, the inventors have also discovered that RNAi- inducing agents can effectively inhibit influenza virus when delivered to the respiratory

system via the respiratory passages in the absence of specific delivery agents. For example, RNAi-inducing agents can be delivered to the lungs as a composition that consists essentially of the RNAi-inducing agent in dry form (e.g., dry powder) or in an aqueous medium that consists essentially of water, optionally also including a salt (e.g., NaCl, a phosphate salt), buffer, and/or an alcohol, e.g., as naked siRNA or shRNA.

The invention also provides means of systemic circulatory delivery of an RNAi- inducing entity by the pulmonary circulation. For a respiratory disease it is preferable to have minimal transfer to the circulation.

Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic administration, the active compound can be incorporated with excipients and used in the form of so tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and expectorated or swallowed. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring.

Systemic administration can also be by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. For transdermal administration, the active compounds are formulated into ointments, salves, gels, or creams as generally known in the art.

In one embodiment, the active compounds are prepared with carriers that will protect the compound against rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate,

polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. The materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared according to methods known to those skilled in the art, for example, as described in U.S. Pat. No. 4,522,811.

It is especially advantageous to formulate inhalational, oral or parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of active compound calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the invention are dictated by and directly dependent on the unique characteristics of the active compound and the particular therapeutic effect to be achieved, and the limitations inherent in the art of compounding such an active compound for the treatment of individuals.

Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds that exhibit large therapeutic indices are preferred. Although compounds that exhibit toxic side effects may be used, care should be taken to design a delivery system that targets such compounds to the site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, reduce side effects.

The data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. For any compound used in the method of the invention, the therapeutically effective dose can be estimated initially from cell culture or non-human animal assays. A dose may be foπnulated in animal models to achieve a circulating plasma concentration range that includes the EC50 (i.e., the

concentration of the test compound which achieves a half-maximal response) as determined in cell culture. Such information can be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for example, by high performance liquid chromatography. The pharmaceutical compositions can be included in a container, pack, or dispenser together with instructions for administration.

EXAMPLES

Example 1: identification of viral nucleoproteins Highly conserved sites are considered to be those sites or sequences that are found to be present in a majority of all the available human influenza sequences. Variants are identified that are 19-mer sequences in human influenza isolates that are similar to the conserved 19-mer sequences, but that differ by only one or a few nucleotide changes.

These are important since RISC (RNA Induced Silencing Complex) can still initiate RNAi activity using an siRNA duplex whose guide (antisense) strand is largely complementary to the target mRNA sequence, but that has one or a few nucleotide changes relative to exact complementarity.

There are eight separate RNA segments that compose the influenza viral genome.

All analyses were done separately for each of the viral segments. Thus, for example, a search for conserved sites was performed for viral segment #1 using only sequences obtained from segment #1.

Influenza A viral sequences from each of the eight viral segments was obtained from the Influenza Sequence Database (Macken, C, Lu, H., Goodman, J., & Boykin, L., "The value of a database in surveillance and vaccine selection." in Options for the Control of Influenza IV. A.D.M.E. Osterhaus, N. Cox & A. W. Hampson (Eds.) Amsterdam: Elsevier Science, 2001, 103-106). The list was screened to remove all but full-length sequences (those with the designation "Complete Cds", or those having a length that is >95% of the Complete Cds gene lengths), so that a failure to find a 19-mer fragment match within a given target sequence would not be due to sequence truncation. Sequences were further screened to eliminate laboratory strains (with the exception of PR8 and

WNV since these strains were used in testing), because the lab strains are likely to have a higher number of artificially-induced mutations; this resulted in the removal of 2-11 sequences from each viral segment. Finally the sequences were crosschecked against the

GenBank nucleotide sequence database

(http://www.ncbi.nlni.nih.gov/entrez/query.fcgi?db=Nucleo tide) to ensure that all sequences were still current. Table 1 lists the GenBank accession numbers of the human influenza sequences that met the preceding criteria and were used in the subsequent analyses.

Table 1-1: GenBank accession numbers for PB2 sequences (segment 1) used in this analysis.

CY007474, CY002815, CY003391, CY003711, CY002687, CY006202, CY002695, CY002711, CY003383, CY006674, CY008531, CY002991, CY006434, CY002543, CY003303, CY009003, CY003695, CY006922, CY002631, CY002535, CY003311, CY006682, CY002575, CY003407, CY003007, CY003399, CY003479, CY003487, CY006178, CY006370, CY002807, CY006362, CY002703, CY003015, CY003471, CY003031, CY003295, CY003319, CY003335, CY003840, CY009243, CY001959, CY002679, CY002399, CY006426, CY006882, CY003023, CY008155, CY002407, CY002623, CY003327, CY006786, CY009179, CY009187, CY009195, CY009203, CY009211, CY009227, CY009235, CY002655, CY002647, AF258525, AF258524,

DQ249263, AF398866, CY009323, AJ564805, X15283, M38277, CY009291, CY009299, CY008995, CY009371, CY009347, CY009339, X99035, AF389115, NC_002023, J02179, DQ208309, CΫ002359, CY002639, CY003679, CY002159, CY003703, CY003768, CY006194, CY006418, CY002671, CY006402, CY006410, CY006874, CY002999, CY006394, CY002367, CY002663, CY006114, CY006754, CY003776, CY003375, CY001687, AB126635, AB126626, M73524, AY209951, AY209947, AY209948, AY209949, AY209950, AY209945, AY209946, M91713, AY209942, AY209943, AY209944, AY209941, AY209940, AY209939, M23970, AY209938, AY209937, AY209936, M81587, M73521, M81575, M81581, AY209934, AY209935, CY002015, CY002079, CY002191, CY002207, CY002455, CY003647, CY002007, CY002063, CY002463, CY002471, CY006083, CY006091, CY002495, CY003655, CY006298, CY002271, CY002023, CY002487, CY002727, CY003055, CY002247, CY002783, CY007802, CY007810, CY007818, CY008347, CY008355, CY008363, CY008371, CY008379, CY008387, CY008395, CY008403, CY008411, CY008419, CY008427, CY008435, CY008443, CY008451, CY008563, CY008571, CY008579, CY008587, CY008595, CY008603, CY008611, CY008619, CY008635, CY008643, CY008651, CY008659, CY009035, CY009043, CY009051, CY006130, CY008627, CY006162, CY002039, CY002743, CY003063, CY003351, CY006138, CY006154, CY006146, CY002071, CY002199, CY002921, CY002929, CY002976, CY002983, CY002055, CY002215, CY002231, CY002239, CY002423, CY002599, CY003039, CY003343, CY006186, CY006378, CY006386, CY000040, CY002255, CY002937, CY003663, CY002223, CY002961, CY002031, CY000768, CY002945, CY003671, CY006122, CY002953, CY002969, CY000264, CY002183, CY002263, CY002295, CY008171, CY001036, CY002719, CY002913, CY003079, CY006099, CY002431, CY002439, CY002447, CY002047, CY008211, CY002799, CY003415, CY006442, CY007282, CY007290, CY007298, CY007306, CY007314, CY007322, CY007330, CY007346, CY007354, CY007378, CY007386, CY007394, CY007410, CY007434, CY007442, CY007450, CY007458, CY007482, CY007490, CY007498, CY007514, CY007538, CY007546, CY007554, CY007562, CY008227, CY008235, CY008915, CY009259, CY009267, CY009275, CY007650, CY008195, CY000896, CY007338, CY007506, CY008243, CY007370, CY007586, CY000376, CY002287, CY007362, CY007402, CY007426, CY007466, CY007522, CY007530, CY007570, CY007578, CY008259, CY007418, CY001236, CY002791, CY008251, CY002415, CY002607, CY002615, CY002735, CY002767, CY003047, CY003423, CY008219, CY008523, CY000568, CY002087, CY002479, CY002511, CY002591, CY002775, CY002351, CY000016, CY000032, CY000128, CY000136, CY000152, CY000160, CY000176, CY000184, CY000352, CY000384, CY000519, CY001044, CY001052, CY000956, CY000760, CY001519, CY001719, CY000368, CY000972, CY000048, CY000064, CY000080, CY000088, CY000104, CY000144, CY000168, CY000255, CY000272, CY000360, CY000480, CY000512, CY000528, CY000776, CY000784, CY000792, CY000888, CY000916, CY001095, CY001260, CY001412, CY001543, CY001567, CY001631, CY001639, CY001647, CY001655, CY002111, CY000096, CY003687, CY000056, CY001028, CYOO 1071, CY001103, CY001060, CY006866, CY001292, CY006946, CY006930, CY006938, CY006954, CY006962, CY006970, CY006978, CY006986, CY006994, CY007002, CY007010, CY007018, CY007026, CY007034, CY007050, CY007066, CY007082, CY007090, CY007098, CY007106, CY007114, CY007122, CY007130, CY007138, CY007146, CY007154, CY007162, CY007178, CY007186, CY007194, CY007202, CY007218, CY007226, CY007234, CY007250, CY007266, CY007274, CY008875, CY008883, CY008891, CY008899, CY008907, CY008923, CY009027, CY009251, CY007258, CY001196, CY008203, CY001380, CY000872, CY000072, CYOOl 119, CY007042, CY007058, CY007074, CY007210, CY007242, CY007170, CY008547, CY008555, CY000024, CY000200, CY000908, CY000980, CY001020, CY001212, CY001476, CYOOl 551, CY000880, CY001428, CY000008, CYOOOl 12, CY000964, CYOOl 167, CY001220, CY001228, CY001300, CY001348, CY001468, CY001559, CY002527, CY000924, CY003135, CY003199, CY003119, CY003151, CY003159, CY003431, CY003784, CY000232, CY000288, CY000496, CY003103, CY003111, CY003132, CY003143, CY003175, CY003183, CY003191, CY003207, CY000948, CY000296, CY001735, CY001087, CY001324, CY000120, CY000216,

CY000224, CY000240, CY000248, CY000304, CY000320, CY000336, CY000344, CY000432, CY000440, CY000448, CY000536, CY000552, CY000560, CY000592, CY000800, CY000940, CY001079, CYOOl 159, CY001191, CY001244, CY001743, CY001951, CY002135, CY003167, CY003215, CY000424, CY000504, CY001332, CY001308, CY008323, CY007594, CY007602, CY007610, CY007658, CY007666, CY007674, CY007682, CY007690, CY007698, CY007706, CY007714, CY007722, CY007730, CY007738, CY007746, CY007754, CY007762, CY007770, CY007778, CY007786, CY007794, CY008291, CY008331, CY008275, CY008299, CY001444, CY008267, CY001316, CY008283, CY008307, CY008339, CY008315, CY001268, CY000400, CY000408, CY000416, CY000544, CY000632, CYOOl 111, CY001340, CY001436, CYOOl 135, CY001204, CYOOl 151, CY000932, CY002823, CY000192, CY000583, CY001175, CY003095, CY003087, CY000208, CY000280, CY000392, CY000488, CY000576, CY002335, CY001727, CY000312, CY000328, CY000455, CY003455, CY003463, CY003247, CY003287, CY000616, CY000704, CY000720, CY001004, CY003832, CY000816, CY003271, CY003279, CY003255, CY003263, CY003816, CY000472, CY000664, CY000672, CY000712, CY000744, CY000824, CY000840, CY000848, CY000856, CY000988, CY001252, CY001284, CY001404, CY001527, CY001372, CY003824, CY006666, CY008483, CY008491, CY008507, CY008763, CY008771, CY008779, CY008787, CY008795, CY008803, CY008811, CY008827, CY008851, CY008859, CY008867, CY009091, CY009107, CY009123, CY009131, CY009155, CY009163, CY009171, CY008515, CY008499, CY008755, CY008819, CY008835, CY008843, CY009083, CY009139, CY009147, CY000696, CYOOl 143, AJ293920, CY001695, CY001879, CY001903, CY001983, CY001999, CY002311, CY003615, CY003631, CY006067, CY001775, CYOOl 831, CY001847, CY001895, CY001919, CY001991, CY002303, CY002375, CY003623, CY003639, CY000624, CY000648, CY000736, CY000808, CY001943, CY003583, CY003591, CY003599, CY001935, CY003231, CY003808, CY003792, CY006906, CY000608, CY000464, CY000600, CY000680, CY000688, CY000728, CY000832, CY000996, CY001012, CY001127, CY001583, CY001663, CY001671, CY001751, CY001759, CY001767, CY001783, CY001815, CY001839, CY001863, CYOO 1871, CY001887, CY001975, CY002119, CY002143, CY002175, CY002319, CY002343, CY002583, CY001927, CY003239, CY003439, CY003575, CY003607, CY001799, CY002167, CY003223, f CY007642, CY009115, CY006075, CY003800, CY002567, CY001276, CY001703, CY000640, CY000656, CY000752, CY000864, CYOOl 183, CY001356, CY001420, CY001535, CY001855, CY001911, CY001967, CY002151, CY001615, CY001711, CY001388, CY001452, CY001460, CY001599, CY001607, CY001623, CY002519, CY002559, CY001364, CY003447, CY006170, CY003567, CY002391, CY002383, CY001484, CY001492, CY001500, CY001679, CY001807, CY001823, CY006258, CY006546, CY006290, CY006466, CY006482, CY006490, CY006498, CY006506, CY006538, CY006554, CY006562, CY006570, CY006578, CY006586, CY006594, CY006602, CY006610, CY006626, CY006642, CY006778, CY006794, CY006802, CY006810, CY008187, CY008539, CY008931, CY008947, CY008955, CY008963, CY008971, CY008979, CY008987, CY006522, CYOO 1791, CY002127, CY001575, CY001511, CY001591, CY002551, AF258841, AF258842, CY006250, CY006274, CY006266, CY006234, CY006242, CY006282, CY006450, CY006458, CY006514, CY006530, CY006618, CY006634, CY008939, CY006474, AF037417, AF483602, AF037414, AF037415, AF037416, CY002279, AF037413, U71133, U71134, U71135, CY006346, CY003759, CY006338, U71132, CY006354, CY009019, AF037412, CY003719, CY003071, CY008747, M73517, CY003519, CY008731, CY008739, CY003359, U62543, CY003551, CY002095, CY002759, CY008723, CY003543, CY003527, CY003511, CY008459, CY008675, CY008715, CY009075, CY006330, CY008179, CY003751, CY003727, CY003743, CY006322, CY006858, CY009059, CY006059, CY006762, CY007634, CY003495, CY006106, CY007626, CY008475, CY008667, CY006210, CY006898, CY006698, CY006706, CY006714, CY007618, CY006738, CY006746, CY006770, CY006850, CY006730, CY006842, CY006890, CY008699, CY008707, CY009067, CY006051, CY003735, CY006914, CY003503, CY006722, CY006826, CY006834, CY003535, CY006818, CY009011, CY002103, CY003559, M91712, CY002751, CY006314, CY008467, CY008691, CY008683, AY210149, AY210150, CY002503, CY006226, CY006690, AY210146, AY210147, AY210148, AY210143, AY210144, AY210145, AJ564804, CY006306, AY210140, AY210141, AY210142, J02140, CY008163, AF348170, AF348171, CY006218, AY210137, AY210138, AY210139, DQ360837, DQ372598, DQ138181, AY651721, AY651718, AY651719, AY818126, AB212050, AB212051, AY576380, AY576381, AF258846, AF036363, AF258837, AF258838, AF258839, AF258840, AF258843, AF258844, AF258845, AJ404632, AF084261, AF084262, AF084263, AF115290, AF115291, AF046093, DQ226172, AF258835, AF258836, AJ404630, AJ404631, AY043030

Table 1-2: GenBank accession numbers for PBl sequences (segment 2) used in this analysis. CY007473, CY002542, CY003382, CY002710, CY003390, CY002694, CY002814, CY002686, CY002990, CY003710, CY008530, CY003694, CY006201, CY006433, CY006921, CY009002, CY003302, CY002630, CY006673, CY002534, CY003310, CY006681, CY002574, CY006369, CY003478, CY003406, CY003486, CY002622, CY002702, CY001958, CY003470, CY003030, CY003398, CY002806, CY008154, CY003014, CY006177, CY006881, CY009242, CY003006, CY003334, CY006361, CY002678, CY003022, CY002398, CY003839, CY003326, CY006785, CY003318, CY006425, CY002406, CY003294, CY002646, CY002654, CY009178, CY009186, CY009194, CY009202, CY009210, CY009234, CY009226, AF258526, AF342823, AF258527, AF398865, CY009322, M25934, AJ564807, M38376, CY009290, CY009298, CY008994, M25932, CY009370, CY009346, CY009338, X99037, AF389116, NC_002021, J02178, DQ208310, CY002158, CY002638, CY006193, CY002662, CY002358, CY006393, CY006409, CY006417, CY003702, CY003678, CY006401, CY006873, CY006753, CY003767, CY002366, CY006113, CY002998, CY002670, CY001686, CY003775,

CY003374, AB126625, AB126634, M25935, AY210025, AY210021, AY210022, AY210023, AY210024, AY210019,

AY210020, AY210016, AY210017, AY210018, AY210015, AY210014, AY210013, M23972, AY210012, AY210011, AY210010, M25924, M81574, M81580, M81586, AY210008, AY210009, CY002006, CY002014, CY003646, CY006082, CY006297, CY006090, CY002038, CY002270, CY002462, CY002470, CY003654, CY007801, CY007809, CY007817, CY008346, CY008354, CY008370, CY008378, CY008386, CY008394, CY008402, CY008410, CY008418, CY008426, CY008434, CY008442, CY008450, CY008562, CY008570, CY008578, CY008586, CY008602, CY008610, CY008626, CY008650, CY008658, CY009034, CY009042, CY009050, CY006137, CY008362, CY008594, CY008618, CY008634, CY008642, CY002190, CY002454, CY006129, CY006153, CY002726, CY006145, CY002022, CY002062, CY002078, CY002486, CY002494, CY002742, CY002782, CY003062, CY003350, CY006161, CY002206, CY002246, CY003054, CY000039, CY000263, CY002070, CY002198, CY002238, CY002928, CY002982, CY002214, CY002598, CY002054, CY001035, CY006377, CY003662, CY006098, CY006385, CY002912, CY002920, CY002952, CY002968, CY002975, CY002936, CY000767, CY000375, CY002960, CY002718, CY002944, CY003670, CY007465, CY006441, CY007281, CY007289, CY007297, CY007305, CY007313, CY007321, CY007329, CY007337, CY007345, CY007353, CY007361, CY007369, CY007377, CY007385, CY007393, CY007401, CY007409, CY007417, CY007433, CY007441, CY007457, CY007481, CY007489, CY007497, CY007529, CY007537, CY007545, CY007553, CY007561, CY007649, CY008210, CY008218, CY008226, CY008234, CY008914, CY009258, CY009266, CY009274, CY001235, CY007425, CY007449, CY007513, CY007569, CY007577, CY007585, CY008194, CY008242, CY008258, CY002222, CY002230, CY007505, CY007521, CY008250, CY006185, CY003078, CY006121, CY008170, CY000567, CY000895, CY002182, CY002262, CY002286, CY002294, CY002422, CY002446, CY002774, CY003038, CY003046, CY003414, CY002086, CY002734, CY002798, CY008522, CY002254, CY002430, CY002030, CY002046, CY002414, CY002438, CY002478, CY002510, CY002590, CY002606, CY002614, CY002766, CY003342, CY003422, CY002790, CY000256, CY000007, CY000015, CY000063, CY000087, CY000103, CY000135, CY000159, CY000167, CY000175, CYOOOl 83, CY000271, CY000367, CY000383, CY000479, CY000511, CY000907, CY001043, CY001051, CYOOl 102, CY000915, CY000023, CY000031, CYOOOl 11, CY000143, CY000151, CY000199, CY000527, CY000759, CY000783, CY000971, CY001059, CY001070, CY001227, CY001299, CY001411, CY001638, CY001654, CY002350, CY000359, CY003686, CY000071, CY000047, CY000351, CY000518, CY000791, CY000871, CY000879, CY001019, CYOOl 118, CY001518, CY001566, CY000095, CY000079, CY007057, CY001027, CY006865, CY006929, CY006937, CY006945, CY006953, CY006961, CY006969, CY006977, CY006985, CY006993, CY007001, CY007009, CY007017, CY007025, CY007033, CY007041, CY007049, CY007065, CY007073, CY007081, CY007089, CY007097, CY007105, CY007113, CY007121, CY007129, CY007137, CY007145, CY007153, CY007177, CY007185, CY007193, CY007201, CY007209, CY007217, CY007225, CY007233, CY007241, CY007249, CY007257, CY007265, CY007273, CY008554, CY008874, CY008882, CY008890, CY008898, CY008906, CY008922, CY009026, CY009250, CY007161, CY007169, CY008202, CY000887, CY000979, CY001094, CY001291, CY001211, CY001475, CY008546, CY000775, CY000055, CY000127, CY000923, CY000955, CY000963, CYOOl 166, CY001219, CY001259, CY001347, CY001379, CY001427, CY001467, CY001550, CY001630, CY001646, CY001718, CY002526, CYOOl 195, CY001558, CY001542, CY002110, CY000215, CY000223, CY000239, CY000247, CY000439, CY000495, CY000503, CY000535, CY000947, CYOOOl 19, CY000231, CY000343, CY000447, CY000799, CY001086, CY001323, CY001742, CY003128, CY003134, CY003142, CY003150, CY003158, CY003198, CY003430, CY003110, CY003102, CY003182, CY003206, CY003783, CY000287, CY003174, CY000295, CY000319, CY000423, CY000431, CY000543, CY000551, CY000559, CY000591, CY000631, CYOOlI lO, CY001150, CY001158, CY001243, CY001307, CY001339, CY002134, CY000303, CY003166, CY003190, CY003214, CY007721, CY007593, CY007601, CY007609, CY007673, CY007681, CY007689, CY007697, CY007705, CY007713, CY007729, CY007737, CY007753, CY007761, CY007769, CY007777, CY007785, CY007793, CY008274, CY008290, CY008322, CY008330, CY001315, CY008282, CY007657, CY007665, CY007745, CY008266, CY008306, CY008314, CY008338, CY008298, CY000939, CY003118, CY001435, CY000335, CY000399, CY000407, CY000415, CY000931, CY001078, CYOOl 134, CYOOl 190, CY001203, CY001267, CY001331, CY001734, CY001950, CY001443, CY000207, CY000487, CY000582, CYOOl 174, CY002822, CY000327, CY003094, CY000191, CY000279, CY000391, CY000575, CY001726, CY003086, CY000311, CY002334, CY000456, CY000471, CY000615, CY000671, CY000719, CY000987, CYOO 1251, CY003246, CY003254, CY003270, CY003454, CY003462, CY003262, CY003286, CY000663, CY000695, CY000703, CY000711, CY000743, CY000847, CY001003, CY001283, CY001403, CY000839, CY003815, CY003831, CY006665, CY008482, CY008490, CY008498, CY008506, CY008514, CY008762, CY008770, CY008778, CY008786, CY008794, CY008802, CY008810, CY008826, CY008834, CY008850, CY008858, CY008866, CY009106, CY009122, CY009130, CY009138, CY009146, CY009154, CY009162, CY000855, CY008754, CY008842, CY009082, CY009090, CY009170, CY008818, CY003278, CY003823, CY000815, CY000823, CYOOl 142, CY001371, CY001526, CY000463, CY000599, CY000727, CY002142, CY000647, CY000679, CY000687, CY001582, CY001662, CY001670, CY001694, CY001750, CY001758, CY001766, CY001774, CY001782, CY001798, CY001830, CY001846, CY001862, CY001870, CY001878, CY001886, CY001894, CY001902, CY001918, CY001926, CY001934, CY001974, CY001990, CY001998, CY002166, CY002302, CY003598, CY003622, CY003230, CY003614, CY006905, CY002582, CY000607, CY000639, CY000655, CY000735, CY000807, CY000831, CYOOl 126, CY001982, CY002118, CY002318, CY002342, CY003630, CY001942, CYOOlOI l, CYOOl 182, CY001838, CY002310, CY003446, CY003574, CY003606, CY003799, CY003807, CY006066, CY006074, CY003582, CY003590, CY003791, CY007641, CY009114, CY003222, CY003638, CY002374, CY001451, CY001459, CY003238, CY003438, CY000623, CY000751, CY000863, CY000995, CY001275, CY001355, CY001363, CY001387, CY001419, CY001534, CY001598, CY001614, CY001710,

CY001814, CY001854, CY002150, CY002174, CY002518, CY002558, CY002566, CY006169, CY001606, CY001622, CY001966, CY001702, CY001910, AJ293921, CY001491, CY001499, CY001678, CY001806, CY001822, CY002382, CY002390, CY003566, CY001590, CY006465, CY006257, CY006289, CY006481, CY006489, CY006497, CY006505, CY006521, CY006545, CY006553, CY006561, CY006569, CY006577, CY006585, CY006593, CY006601, CY006609, CY006625, CY006641, CY006777, CY006793, CY006801, CY006809, CY008186, CY008930, CY008946, CY008954, CY008962, CY008970, CY008978, CY008538, CY006537, CY008986, CY002550, CY001483, CY001574, CY001790, CY002126, CY001510, AF258822, AF258823, CY006233, CY006241, CY006249, CY006265, CY006273, CY006281, CY006449, CY006457, CY006473, CY006513, CY006529, CY006617, CY006633, CY008938, AF037423, AF483601, AF037420, AF037421, AF037422, CY002278, AF037419, U71129, U71130, U71131, CY006337, CY006345, CY003758,

U71128, CY006353, CY009018, AF037418, CY003718, CY003070, CY008746, M25936, CY003518, CY008730, CY008738, CY003358, CY003550, CY002094, CY008722, CY002758, CY003510, CY003526, CY003542, CY008458, CY008674, CY008714, CY009074, CY006329, CY008178, CY003750, CY006857, CY003726, CY006321, CY003742, CY009058, CY006058, CY006761, CY007633, CY003494, CY006105, CY006209, CY007625, CY008474, CY008666, CY006897, CY006697, CY006705, CY006713, CY007617, CY006769, CY006737, CY006745, CY006849, CY006729, CY006841, CY006889, CY008698, CY008706, CY009066, CY003734, CY006050, CY006913, CY003502, CY006721, CY006825, CY006833, CY009010, CY003534, CY006817, CY003558, CY002102, CY002750, CY006313, CY008466, CY008690, CY008682, AY210283, AY210284, CY006225, CY006689, CY002502, AY210279, AY210280, AY210281, AY210277, AY210278, AY210282, AJ564806, CY006305, AY210274, AY210275, AY210276, AF348172, AF348173, J02138, CY008162, CY006217, AY210271, AY210272, AY210273, DQ360838, DQ372597, DQ138163, DQ138164, DQ138165, AY818129, DQ138159, DQ138160, AY651664, AY651667, AY651665, AY651666, AB212052, AY576392, AY576393, AF258827, AF036362, AF258818, AF258819, AF258820, AF258821, AF258824, AF258825, AF258826, AJ404633, AF046094, AF084264, AF084265, AF084266, AFl 15293, DQ226161, AF258816, AF258817, AJ404634, AY043029, M74899

Table 1-3: GenBank accession numbers for PA sequences (segment 3) used in this analysis.

DQ381564, CY007472, CY002541, CY002813, CY003381, CY002629, CY002685, CY002693, CY002709, CY002989, CY003301, CY003693, CY003709, CY006200, CY006672, CY008529, CY009001, CY006920, CY006432, CY003389, CY003309, CY002533, CY006680, CY001957, CY002677, CY003477, CY006880, CY003405, CY003317, CY002573, CY002621, CY002701, CY002805, CY003021, CY003293, CY003325, CY003333, CY003397, CY003469, CY003485, CY003838, CY006176, CY006368, CY006424, CY006360, CY003029, CY002397, CY003005, CY003013, CY009241, CY008153, CY006784, CY002405, CY002645, CY002653, CY009177, CY009185, CY009193, CY009201, CY009209, CY009225, CY009233, AF258519,

AF258518, AF398864, AF398862, CY009321, AJ605762, CY009289, CY009297, CY008993, CY009369, CY009345, CY009337, X99039, AF389117, NC_002022, X17336, DQ208311, CY002157, CY002357, CY002637, CY002365, CY003677, CY006408, CY006752, CY006416, CY006192, CY006392, CY006872, CY003701, CY006400, CY003766, CY006112, CY002661, CY002997, CY002669, CY001685, CY003774, CY003373, AB126627, AB126633, M26079, AY210007, AY210003, AY210004, AY210005, AY210006, AY210001, AY210002, AY209998, AY209999, AY210000, AY209997, AY209996, AY209995, M23974, AY209994, AY209993, AY209992, M26078, M81573, M81579, M81585, AY209990, AY209991, CY002061, CY002269, CY002469, CY003645, CY002005, CY002037, CY002461, CY002205, CY003653, CY006081, CY006089, CY006296, CY002725, CY002013, CY002189, CY002077, CY002741, CY008657, CY002485, CY008649, CY007800, CY008345, CY008353, CY008361, CY008369, CY008377, CY008385, CY008393, CY008401, CY008417, CY008425, CY008433, CY008441, CY008449, CY008577, CY008585, CY008633, CY008641, CY009033, CY009041, CY009049, CY008569, CY008609, CY007808, CY007816, CY008561, CY008593, CY008601, CY008617, CY008625, CY002453, CY008409, CY006160, CY002493, CY003061, CY002021, CY002245, CY006128, CY003349, CY006136, CY002781, CY003053, CY006144, CY006152, CY002181, CY002069, CY002197, CY002213, CY002229, CY002237, CY002261, CY002597, CY002911, CY002919, CY002981, CY002053, CY002429, CY002974, CY000038, CY000374, CY000766, CY001234, CY002045, CY002253, CY002413, CY002445, CY002927, CY002951, CY002967, CY003661, CY006184, CY006376, CY006384, CY000262, CY002935, CY000894, CY002943, CY001034, CY002421, CY002613, CY002959, CY003341, CY003669, CY008913, CY003037, CY006440, CY007280, CY007296, CY007304, CY007312, CY007320, CY007328, CY007336, CY007344, CY007352, CY007368, CY007376, CY007384, CY007392, CY007408, CY007432, CY007456, CY007480, CY007512, CY007544, CY007648, CY008193, CY008225, CY009257, CY009265, CY009273, CY007416, CY007584, CY008169, CY007424, CY007496, CY008249, CY007360, CY007520, CY002437, CY002765, CY002029, CY002717, CY007464, CY007488, CY007536, CY007560, CY008241, CY002733, CY007288, CY007440, CY007448, CY007528, CY007552, CY007568, CY007576, CY008209, CY008217, CY008233, CY008257, CY006097, CY007400, CY003077, CY007504, CY000566, CY002789, CY002085, CY002221, CY002285, CY002477, CY002589, CY002605, CY003413, CY003421, CY002293, CY008521, CY002509, CY002773, CY002797, CY003045, CY006120, CY000030, CY000006, CY000022, CY000350, CY001093, CY001549, CY001629, CY001645, CY001653, CY002349, CY000078, CY000758, CYOOl 165, CY000014, CY000046, CY000062, CY000070, CY000086, CY000094, CY000102, CY000126, CY000134, CY000150, CY000158, CY000166, CY000174, CY000182, CY000198, CY000254, CY000270, CY000358, CY000366, CY000382, CY000478, CY000510, CY000517, CY000526, CY000782, CY000790, CY000870,

CY000878, CY000906, CY000914, CY000954, CY000970, CY001018, CY001042, CY001050, CY001058, CY001069, CYOOl 117, CY001218, CY001226, CY001258, CY001290, CY001298, CY001346, CY001378, CY001410, CY001517, CY001541, CYOOOIlO, CY001026, CY000142, CY001565, CY001637, CY000886, CY001557, CY001717, CY003685, CY001466, CY006952, CY006864, CY006944, CY007200, CY006928, CY006936, CY006960, CY006968, CY006976, CY006984, CY006992, CY007000, CY007008, CY007016, CY007024, CY007032, CY007040, CY007048, CY007056, CY007064, CY007072, CY007080, CY007088, CY007096, CY007104, CY007112, CY007120, CY007128, CY007136, CY007144, CY007152, CY007168, CY007184, CY007192, CY007216, CY007224, CY007240, CY007264, CY007272, CY008201, CY008553, CY008873, CYOO8881, CY008889, CY008897, CY008905, CY008921, CY009025, CY009249, CY007232, CY007256, CY002525, CY007208, CY007248, CY007160, CY007176, CY000978, CY001210, CY001474, CY008545, CY002109, CY000054, CY000774, CY000922, CYOOI lOl, CY000962, CYOOl 194, CY001426, CY000238, CY000246, CY000550, CY000798, CY000946, CY001338, CY001733, CY003133, CY003149, CY003429, CY003782, CY003157, CY003173, CY003197, CY003205, CY000118, CY000214, CY000222, CY000230, CY000294, CY000302, CY000334, CY000342, CY000422, CY000430, CY000438, CY000446, CY000494, CY000502, CY000534, CY000558, CY000590, CY000630, CY000938, CY001077, CY001157, CY001242, CY001306, CY001322, CY001434, CYOO 1741, CY001949, CY002133, CY003101, CY003109, CY003126, CY003141, CY003189, CY001314, CY003181, CY001133, CY001109, CY001189, CY000318, CY000542, CY000930, CYOOl 149, CY007592, CY007600, CY007608, CY007656, CY007664, CY007672, CY007680, CY007688, CY007696, CY007704, CY007744, CY007760, CY007768, CY007776, CY007784, CY007792, CY008273, CY008289, CY008321, CY007752, CY008329, CY007728, CY008305, CY000414, CY007720, CY007712, CY007736, CY008337, CY008265, CY008281, CY001266, CY001330, CY008297, CY008313, CY000286, CY000398, CY001085, CY000406, CY003213, CY003165, CY001202, CY001442, CY003117, CY001725, CY000190, CY000278, CY000390, CY000486, CY000574, CY000581, CY002821, CY003093, CY000326, CYOOl 173, CY003085, CY002333, CY000310, CY000206, CY003261, CY003453, CY003245, CY000454, CY000470, QY000614, CY000670, CY000694, CY000702, CY000710, CY000718, CY000742, CY000814, CY000822, CY000846, CY000854, CY000986, CY001002, CY001250, CY001282, CY001370, CY003285, CY000662, CY003253, CY003269, CY003461, CY000838, CY001525, CY001402, CY003830, CY008481, CYOO88O1, CY009105, CY009137, CY006664, CY008489, CY008505, CY008513, CY008785, CY008793, CY008809, CY008849, CY008865, CY009129, CY009145, CY009153, CY009169, CY008497, CY008769, CY003814, CY008761, CY008753, CY008857, CY009121, CY009161, CY008777, CY008817, CY008825, CY008833, CY008841, CY009081, CY009089, CY003277, CY003822, CY001141, CY001773, CY001829, CY001845, CYOOl 861, CY001869, CY001877, CY001885, CY001917, CY001933, CY001973, CY002117, CY002173, CY003237, CY003613, CY003621, CY001581, CY001925, CY001981, CY001989, CY001997, CY002141, CY003573, CY003605, CY000462, CY000598, CY000606, CY000622, CY000646, CY000654, CY000678, CY000686, CY000726, CY000734, CY000806, CY000830, CY000994, CYOOl 125, CYOOl 181, CY001354, CY001362, CY001458, CY001693, CY001701, CY001709, CY001749, CY001757, CY001765, CY001781, CY001797, CY001813, CY001837, CY001901, CY001909, CYOO 1941, CY002165, CY002301, CY002565, CY002581, CY003229, CY003581, CY003589, CY003597, CY003637, CY006065, CY006073, CY006904, CY000638, CY001893, CY003790, CY002341, CY003629, CY001661, AJ293922, CY002309, CYOOlOlO, CY001274, CY001669, CY007640, CY009113, CY003437, CY002517, CY003445, CY001853, CY000862, CY000750, CY001533, CY002557, CY002373, CY001597, CY002149, CY002317, CY001386, CY001418, CY001605, CY001613, CYOO 1621, CY001965, CY003806, CY001450, CY003798, CY003221, CY006168, CY001805, CYOO 1821, CY002389, CY001498, CY003565, CY001490, CY001789, CY002381, CY001509, CY001677, CY006288, CY006464, CY008185, CY006256, CY006552, CY006576, CY006584, CY008945, CY008961, CY006480, CY006488, CY006496, CY006504, CY006520, CY006536, CY006544, CY006568, CY006592, CY006624, CY006640, CY006776, CY006792, CY006808, CY008929, CY008953, CY008969, CY008977, CY008985, CY006600, CY006608, CY008537, CY006800, CY001482, CY006560, CY002125, CY001573, CY001589, CY002549, AF257197, AF257198, CY006264, CY006472, CY006280, CY006232, CY006248, CY006272, CY006456, CY006528, CY006632, CY008937, CY006240, CY006448, CY006616, CY006512, AF037429, AF483603, AF037426, AF037427, AF037428, CY002277, AF037425, U71137, U71138, U71139, CY006344, CY003757, CY006336, U71136, CY006352, CY009017, AF037424, CY003717, CY008745, CY003069, CY003357, CY003517, CY008729, CY008737, CY003549, CY002093, CY008721, CY002757, CY003509, CY003525, CY003541, CY009073, CY008673, CY008713, CY008457, CY006328, CY008177, CY003749, CY003741, CY006856, CY009057, CY003725, CY006320, CY006057, CY006760, CY007632, CY006104, CY003493, CY006208, CY007624, CY008473, CY008665, CY006896, CY006696, CY006704, CY006712, CY007616, CY006736, CY006744, CY006768, CY006848, CY006840, CY008697, CY008705, CY006728, CY006888, CY009065, CY006049, CY003733, CY006912, CY003501, CY006720, CY006824, CY006832, CY009009, CY003533, CY006816, CY002101, CY002749, CY003557, CY008681, CY006312, CY008465, CY008689, AY210205, AY210206, CY002501, CY006688, CY006224, AY210202, AY210203, AY210204, AY210199, AY210200, AY210201, AJ605763, CY006304,

AY210196, AY210197, AY210198, AF348175, J02139, CY008161, AF348174, CY006216, AY210193, AY210194, AY210195, DQ360839, DQ138184, DQ138185, DQ138186, DQ138187, DQ138188, DQ372596, DQ099791, DQ099792, AY818132, AY651610, AY651611, AY651613, AY651612, AB212053, AY576404, AY576405, AF257202, AF257193, AF257194, AF257195, AF257196, AF257199, AF257200, AF257201, AJ2S9874, AJ291402, AF046095, AF084267, AF084268, AF084269, AF084270, AFl 15294, AFl 15295, AF257191, AF257192, AJ404637, AY043028

Table 1-4: GenBank accession numbers for HA sequences (segment 4) used in this analysis.

L20113, U38242, DQ265706, CY007467, CY002624, CY002808, CY002688, CY003384, CY002984, CY002680, CY003376, CY003704, CY006195, CY006667, CY006915, CY002536, CY003688, CY002704, CY003296, CY008996, CY006427, CY008524, DQ249260, CY003304, CY002528, CY006675, AY299507, AY297155, AY297156, AY299497, AY299509, AY299505, AY299495, AY299496, AY299500, AY299502, AY299504, AY299508, AY299499, AY299498, AY299506, AY297157, AY299501, AY299503, AY297154, AY299494, AJ457877, AJ517815, AJ457930, AY971011, CY003472, CY006363, CY003833, CY002568, CY002672, CY002392, CY003024, CY001952, CY002616, CY002800, CY003016, CY003288, CY003312, CY003400, CY003464, CY003480, CY006171, CY006355, CY003320, CY009236, CY008148, CY006419, CY002696, CY006779,

CY006875, CY002400, CY003008, CY003328, CY003392, CY003000, AY682833, AJ517814, AJ457871, AJ457873, AJ457872, AJ457886, AJ457865, AJ457870, AJ457888, AY971009, AJ457867, AJ457866, AJ457863, AJ457869, AJ457881, AY971006, AJ457868, AY971008, AY063229, AY971010, AY063228, CY002640, CY009172, CY009180, CY009188, CY009196, CY009204, CY009220, CY009228, CY002648, AJ457902, AJ457887, AJ457904, AJ457862, AY971007, AB043499, AB043498, AY029287, AY029288, AY029289, AY029290, AY029291,

AY029292, AY971004, AY971003, AY289929, AJ344014, AF534030, AF534031, AF534038, AJ457905, AB043496, AB043497, AF268312, AF268313, AF386775, AF386776, AF386777, AF386778, AF386780, AF386781, AF386782, AF386779, AF387491, AF342821, AJ457885, AJ457892, AJ457894, AJ457898, AB043495, AB043500, AF534025, AF534026, AF534027, AJ457891, AJ457895, AJ457897, AB043494, AY282756, AY282758, AF026153, AF026154, AF026155, AF026156, AF026157, AF026158, AF026159, AF026160, AJ457906, AJ457896, AB043492, AB043493, AF131993, AF131994, AF131995, AF386773, AF386774, AY289928, AY289930, AF398878, AJ344022, AF398875, AJ457907, AJ457899, AJ457900, AB043491, ISDNX127, U53162, U53163, AJ457901, AB043490, AF055426, AY289927, CY009316, L24362, Z54289, Z54288, AJ457908, AB043489, L19017, L19549, L33480, L33481, L33482, L33743, L33744, L33745, L33746, L33747, L33748, L33749, L33750, L33751, L33758, L33780, L19013, L19018, L19020, L19022, L19026, H9027, L20106, L20107, L20108, L20109, L20110, L20111, L20112, L20116, L20117, D13573, D13574, AB043488, D31949, L19016, L19028, L33756, Z54287, X59778, L19011, L19014, L19015, L19019, L19021, L19023, L19024, L19025, L33487, L33752, L33753, L33754, L33755, M33748, AB043487, L33485, L33486, D00841, X17224, D00406, D00407, L19012, L33483, Z54286, S62154, L33490, L33491, L33492, L33493, AJ289702, X17221, M59324, M59325, M59326, M59327, M59328, L33489, X00031, X00030, M38353, X86654, X86655, X86656, X86657, X00028, L33484, L33488, L33757, CY009284, CY009292, X00027, K01331, CY008988, AB043486, CY009364, AB043485, AB043484, CY009340, AB043483, AB043482, AB043480, AB043481, CY009332, AB043479, U02085, AF494249, AF494250, AF494247, AF494248, AF494251, AF494246, NC_002017, AF389118, J02144, J02176, U08903, U08904, AFl 17241, AFl 16575, AFl 16576, U37727, CY002360, CY003672, CY003696, CY003761, CY002992, CY002352, CY002632, CY002656, CY002152, CY006107, CY006387, CY006867, CY002664, CY006187, CY006395, CY006411, CY006403, CY006747, CY003368,

CY003769, CY001680, AB126622, AB126630, AJ457909, AJ457911, AJ457878, AJ457861, AJ457875, AJ457910, AY684125, AF503473, AF503474, AF503476, AF503477, AF503478, AF503479, AF503480, AF503481, AF503482, AF503483, AF503485, AF503486, AF503484, AF503475, D10163, L19005, L19006, L19008, L11125, L11133, AY209988, AY209989, AY209978, AY209979, AY209980, AY209981, AY209982, AY209983, AY209984, AY209985, AY209986, AY209987, AY209974, AY209975, AY209976, AY209977, D13579, D13580, AY209970,

AY209971, AY209972, AY209973, Ll 1126, AY209967, AY209968, AY209969, AY209963, AY209964, AY209965, AY209966, AY209959, AY209960, AY209961, AY209962, AY209955, AY209956, AY209957, AY209958, AY209954, AF270721, Ll 1134, AF270725, AF270726, AF270727, AF270723, AF270724, AB056699, J02127, Ll 1142, L20406, L20407, L20408, L20409, L20410, AY643085, AY643087, AY643086, AY209952, AY209953, AF270716, AF270717, AF270718, AF270719, AF270720, AF270722, AF270728, DQ265709, DQ265716, CY002000, CY002056, CY002184, CY003640, CY006076, CY006084, CY002072, CY002448, CY002456, CY006291, CY002008, CY002032, CY002200, CY002464, CY003648, CY003344, CY002240, CY002488, CY002736, CY002264, CY006123, CY003048, CY006139, CY006155, CY008628, CY008652, CY007795, CY007803, CY007811, CY008340, CY008348, CY008356, CY008372, CY008380, CY008388, CY008396, CY008404, CY008412, CY008420, CY008428, CY008436, CY008444, CY008564, CY008572, CY008580, CY008636, CY008644, CY009028, CY009036, CY009044, CY003056, CY008620, CY002776, CY008556, CY008588, CY008596, CY008604, CY008612, CY006131, CY002480, CY008364, CY006147, CY002720, CY002016, DQ265707, DQ265708, DQ265710, DQ265711, DQ265712, DQ265713, DQ265714, DQ265715, DQ265717, CY000257, CY002024, CY002064, CY002192, CY002592, DQ249261, DQ249262, CY000889, CY002040, CY002176, CY002208, CY002216, CY002224, CY002232, CY002904, CY000033, CY000761, CY001229, CY002048, CY002416, CY002600, CY006371, CY000369, CY002906, CY002914, CY002922, CY002946, CY002930, CY006115, CY006179, CY003040, CY003664, CY006379, CY008164, CY002954, CY001029, CY002408, CY002424, CY003032, CY002712, CY002760, CY002905, CY002938, CY003072, CY003656, CY008516, CY002432, CY002256, CY002962, CY002248, CY002440, CY002584, CY007283, CY007459, CY002728, CY002792, CY002288, CY006092, CY007387, CY003416, CY006435, CY007275, CY007291, CY007299, CY007307, CY007315, CY007323, CY007331, CY007347, CY007371, CY007379, CY007427, CY007435, CY007443, CY007451, CY007475, CY007483, CY007491, CY007507, CY007531, CY007539, CY007547, CY007555, CY008188, CY008204, CY008212, CY008220, CY008228, CY008908, CY009252, CY009260, CY009268, CY007419, CY007563, CY007643, CY008244, CY002280, CY007339, CY007355, CY007363, CY007403, CY007499, CY007515, CY007571, CY007579, CY008236, CY008252, CY007395, CY007523, CY007411, CY000561, CY002768, CY002784, CY002080, CY003336, CY002472, CY002504,

CY003408, CY002608, AY947474, AY963789, AY963790, AY963791, AY963792, AY963793, AY945263, AY945264, AY945265, AY945266, AY945267, AY945268, AY945269, AY945270, AY945271, AY945272, AY945273, AY945274, AY945275, AY945276, AY945277, AY945278, AY945279, AY945280, AY945281, AY945282, AY945283, AY945284, AY945285, AY945286, AY945287, AY945288, CYOOOOOl, CY000065, CY000097, CY000137, CY000161, CY000177, CY000265, CY000345, CY000361, CY000901, CY001064, CY001632, DQ249259, CY000073, CY000089, CY000377, CY001405, CY000009, CY000025, CY000057, CY000081, CY000121, CY000129, CY000145, CY000153, CY000169, CY000193, CY000353, CY000505, CY000513, CY000753, CY000777, CY000909, CY000965, CY001013, CY001021, CY001045, CYOOl 112, CY001213, CY001512, CY000017, DQ089636, CY001285, DQ086160, CY000041, CY000105, CY000249, CY000473, CY000785, CY000957, CY000973, CY001037, CY001053, CY001061, CY001221, CY002344, DQ089635, CY000881, CY000949, CY001648, DQ089637, DQ089638, DQ089639, CY003680, CY001088, CY000873, CY001293, CY007163, DQ086161, CY009020, CY001560, CY007003, CY007083, CY006859, CY006923, CY006931, CY006939, CY006947, CY006955, CY006963, CY006971, CY006979, CY006987, CY006995, CY007011, CY007019, CY007035, CY007043, CY007051, CY007059, CY007067, CY007091, CY007099, CY007107, CY007115, CY007123, CY007131, CY007139, CY007147, CY007155, CY007187, CY007195, CY007267, CY008876, CY008884, CY008892, CY008900, CY008916, CY009244, CY007179, CY007235, CY008196, CY008548, CY008868, CY007027, CYOOl 160, CY001421, CY001469, CY001544, CY001640, CY002104, CY002520, CY007243, CY007075, CY007171, CY007203, CY007211, CY007219, CY007227, CY007251, CY007259, CY001341, CY001712, CY008540, CY000049, CY000521, CY000865, CY000917, CY001096, CY001205, CY001253, CY001461, CY001536, CY001552, CY001624, DQ089634, CY000769, CY001373, DQ086159, AY531033, AY531039, AY531040, AY531041, AY531042, AY531043, AY531044, AY531045, AY531046, AY531047, AY531048, AY531049, AY531050, AY531051, AY531052, AY531053, AY531054, AY531055, AY531056, AY531057, AY531058, AY531059, AY531060, AY531061, DQ059385, DQ086157, DQ086158, AY963783, AY963784, AY963785, AY963786, AY963787, AY963788, AY963796, CYOOOl 13, CY000233, CY000241, CY000585, CY000793, CY000941, CY001080, CY003120, CY003123, CY003136, CY003200, CY003777, CY000537, CY003104, CY003144, CY000209, CY000217, CY000225, CY000281, CY000337, CY000425, CY000433, CY000441, CY000489, CY000497, CY000529, CY001728, CY001736, CY003096, CY003192, CY001317, CY001333, CY003176, CY003112, CY003168, CY003424, CY003152, CY000933, CY000289, CY000329, CY000545, CY000553, CYOOl 152, CYOOl 197, CY001325, CY002128, CY003160, CY003184, CY003208, CY001301, CY001072, CY000313, CY001144, DQ227423, CY007587, CY007595, CY007603, CY007651, CY007659, CY007667, CY007675, CY007683, CY007691, CY007699, CY007707, CY007715, CY007723, CY007731, CY007739, CY007747, CY007755, CY007763, CY007771, CY007779, CY007787, CY008284, CY008324, CY008332, CY001309, CY000625, CY001437, CY008268, CY008276, CY008292, CY001944, CY008260, CY008316, CY000297, CY000393, CY000401, CY000409, CY000417, CY000925, CY001104, CY001128, CY001184, CY001237, CY001261, CY001429, CY008308, CY008300, DQ227431, DQ227424, DQ227425, DQ227426, DQ227427, AY589647, AY589648, AY589649, AY589650, AY589651, AY589652, AY589653, AY589654, AY589655, AY589656, AY589657, AY589658, AY589659, AY589660, AY589661, DQ227429, AY884276, AY884277, AY884278, AY884279, AY884280, AY884281, AY884282, AY884283, AY884284, DQ227430, DQ227428, AY661030, AY661031, AY947476, AY377129, AY968039, AY968037, AY968040, AY968038, AY968041, CY000584, CY000481, CY000273, CYOOl 168, CY002816, CY003088, CY001720, CY000185, CY000569, CY003080, CY000385, CY000201, CY000305, CY000321, CY002328, AY661022, AY661023, AY661024, AY968029, AY968030, AY968031, CY000609, CY003256, CY000465, CY000657, CY000665, CY000705, CY000713, CY000737, CY000817, CY000849, CY000981, CY001245, CY001520, CY000997, CY001365, CY003240, CY003448, CY003456, CY003825, CY003264, CY000449, CY000841, CY001277, CY001397, CY003248, CY003272, CY003280, CY003809, CY003817, CY008844, CY000697, CY008492, CY000833, CY009140, CY006659, CY008476, CY008484, CY008500, CY008508, CY008764, CY008780, CY008788, CY008796, CY008804, CY008852, CY008860, CY009100, CY009116, CY009124, CY009132, CY009148, CY009156, CY009164, CY008748, CY008756, CY008772, CY008820, CY008828, CY008836, CY009084, CY008812, CY009076, CY000689, CY000809, CYOOl 136, AY661021, AY968024, AY968023, AF382319, AF382320,

AF382321, AF382322, AF382323, AF382324, AF382325, AF382326, AF382327, AF382328, AY035588, AY035589, AY035590, AY035591, AY035592, CY000593, CY001656, CY001688, CY001752, CY001768, CY001776, CY001824, CY001832, CY001840, CY001856, CY001864, CY001872, CY001880, CY001888, CY001896, CY001920, CY001928, CY001976, CY001984, CY002136, CY003608, CY003624, CY000601, CY001760, CY001912, CY001992, CY002160, CY000457, CY000641, CY000649, CY000681, CY000721, CY000729, CY000801, CY000825, CYOOl 176, CY001576, CY001904, CY002296, CY002560, CY002576, CY003224, CY003632, CY003592, CY001968, CY003232, CY000633, CY002168, CYOOl 120, CY001413, CY001792, CY001936, CY002112, CY002312, CY002336, CY002368, CY001744, CY003568, CY003584, CY003785, CY003801, CY006060, CY006068, CY003576, CY003440, CY003600, CY000617, CY001453, CY003616, CY000673, CY006899, CY001848, CY003793, CY001808, CY001005, CY001269, CY007635, CY009108, CY003432, CY002552, CY000745, CY001664, CY000857, CY000989, CY001349, CY001357, CY001381, CY001445, CY001528, CY001592, CY001600, CY001608, CY001616, CY002304, CY002512, CY003216, CY001696, CY006163, CY001704, CY001960, CY002144, AY531035, AJ293926, AY963782, AY963794, AY963795, AF315566, AF315564, AF315565, AY661019, AY661026, AY968020, AY968019, AY968021, AB019355, AB019356, AF382318, AY032978, CY001816, CY003560, CY002376, CY001485, CY001493, CY001784, CY001800, CY002384, CY001584, CY006283, CY008980, CY006515, CY008180, CY006251, CY006459, CY006475, CY006483, CY006491, CY006499, CY006531, CY006539, CY006547, CY006555,

CY006563, CY006571, CY006579, CY006587, CY006595, CY006603, CY006619, CY006635, CY006771, CY006787, CY006795, CY008940, CY008948, CY008956, CY008964, CY008972, CYOOl 504, CY002544, CY006803, CY008532, CY008924, CY002120, CY001672, CY001477, CY001568, AY271794, AF316818, AF316820, AF316821, AF316819, AF315560, AF315563, AF315571, AF316817, AF315561, AY661206, AY661207, AY661208, AY661209, AF315559, AF315562, CY006227, CY006243, CY006467, CY006507, CY006451, CY006235, CY006259, CY006275, CY006443, CY006523, CY006611, CY006627, CY008932, CY006267, AF363502, AF363503, AF363504, AJ311466, AF315567, AF315569, AF315570, AF315568, AY661201, AY661202, AY661203, AY661205, AY661210, AF017272, AY661193, AY661194, AY661195, AY661197, AY661198, AY661199, AY661200, AF131996, AF131997, AF131998, AF017270, AFO 17271, AB019354, AB019357, CY002272, AF534013, AF534014, AF534015, AF534016, AF534017, AF534018, AY661181, AY661182, AY661183, AY661185, AY661186, AY661187, AY661188, AY661189, AY661190, AY661191, AY661192, AY661196, U48444, U48445, U48447, U65553, U65554, U65555, U65556, U65557, U65558, U65559, U65560, CY006331, CY006339, CY003752, AY661020, AY661176, AY661175, AY661177, AY661178, AY661179, AY661204, AY661180, Z46403, Z46404, Z46405, Z46407, Z46408, AB043708, U48439, U48440, U48441, U48442, U48443, U48446, U65552, CY006347, CY009012, AY661131, AY661132, AY661133, AY661139, AY661140, AY661141, AY661142, AY661143, AY661144, AY661145, AY661146, AY661147, AY661148, AY661149, AY661150, AY661151, AY661152, AY661153, AY661154, AY661155, AY661156, AY661157, AY661158, AY661159, AY661161, AY661162, AY661163, AY661165, AY661166, AY661167, AY661168, AY661169, AY661170, AY661171, AY661172, AY661173, AY661174, AY661211, D30669, D30668, Z46393, Z46394, Z46395, Z46396, Z46397, Z46398, Z46399, Z46400, Z46401, Z46402, Z46406, Z46411, Z46412, Z46413, Z46414, Z46415, Z46416, Z46417,

AB043707, AY661160, AY661164, CY003712, U26830, AY661080, AY661082, AY661083, AY661084, AY661085, AY661086, AY661087, AY661088, AY661089, AY661090, AY661091, AY661092, AY661093, AY661094, AY661095, AY661096, AY661097, AY661108, AY661109, AY661110, AY661111, AY661112, AY661113, AY661114, AY661115, AY661116, AY661117, AY661118, AY661120, AY661121, AY661122, AY661123, AY661124, AY661126, AY661127, AY661128, AY661129, AY661130, AY661134, AY661135, AY661136,

AY661137, AY661138, D30665, D30664, Z46392, Z46410, AB043706, AY661125, AY661081, D30663, AY661075, AY661076, AY661077, AY661078, AY661079, AY661098, AY661099, AY661100, AY661101, AY661102, AY661103, AY661104, AY661105, AY661106, AY661107, AY661119, D30662, AB043705, D49967, CY003064, CY008740, AY661073, D49966, AY661069, AY661070, AY661072, AY661074, Z46409, D49965, U97740, AY661059, AY661066, D49961, AY661029, AY661057, AY661058, AY661060, AY661061, AY661062, AY661063, AY661064, AY661065, AY661067, AY661068, AY661071, X75800, D49962, Z46391, D49963, D49960, D49964, AJ252129, AJ252131, CY003512, CY008724, CY008732, CY003352, AY661054, AY661055, AY661056, AF204238, CY003544, M57644, D49959, CY002088, CY002752, CY008716, AY661053, M21648, M57632, CY003520, CY003536, CY003504, CY008668, CY008708, CY009068, CY008452, AY661049, AY661050, AY661051, AY661052, AF405211, M57631, CY006323, CY008172, CY003744, CY006851, CY003736, CY006315, CY003720, CY009052, AY661016, AF201845, AF201846, AF405209, AF405210, AF405207, CY006052, CY006755, U08858, U08859, U08905, AY661015, AY661025, AY661048, AF233691, CY007627, AY661014, AF201844, AF405206, M57630, CY006043, CY003488, CY008468, CY006203, CY008660, CY006891, CY007619, AY661046, AY661047, AF405212, J02092, AF201843, CY006691, CY006699, CY006707, CY007611, X05907, CY006763, CY006731, CY006739, CY006843, AY661011, AY661012, AY661013, AY661045, AF450246, CY006723, CY006835, CY006883, CY008692, CY008700, CY009060, AY661006, AY661007, AY661008, AY661010, AY661044, AY661009, V01086, CY006044, CY003728, AY661043, AY661028, CY006907, CY003496, CY006715, CY006819, CY006827, AY661042, AY661005, AY661017, AY661018, AY661027, CY003528, CY009004, CY006811, AY661002, AY661003, AY661004, AF201842, M54895, CY002744, CY002096, CY003552, CY006307, CY008460, CY008676, CY008684, AY661041, V01089, AY660999, AY661000, AY661001, AF201875, J02132, CY002496, CY006683, CY006219, AY660996, AY660997, AY660998, AY660995, AY660994, CY006299, AY660992, AY660993, AY661040, J02135, V01085, VOl 103, M55059, CY008156, AF348176, AF348177, AF348178, AF348179, CY006211, AY660991, AY661038, AY661039, AF201874, AB239125, DQ372591, AY555153, AY555150, AJ867074, AY818135, AY679514, AY651334, AY651335, AY651333, AY651336, AJ715872, AY720950, AB212054, AY575869, AY575870, AF102676, AF028709, AF046088, AF046096, AF046097, AF046098, AF036356, AF084279, AF084280, AF084281, AF084532, AFl 02671, AF102672, AF102673, AF102674, AF102675, AF102677, AF102678, AF102679, AF102680, AF102681, AF102682, AF028020, DQ226106, AJ404626, AJ404627, AY043019, AY043015, AY043017, AY043018 Table 1-5: GenBank accession numbers for NP sequences (segment 5) used in this analysis.

CY007470, CY002811, CY002683, CY002691, CY003379, CY003691, CY003707, CY002707, CY003299, CY002539, CY002627, CY006430, CY006918, CY008999, CY006198, CY003387, CY006670, CY008527, CY002987, CY002531, CY006678, CY003307, CY001955, CY002571, CY003403, CY003475, CY006174, CY003011, CY003395, CY003291, CY002803, CY003836, CY003315, CY003467, CY002675, CY003003, CY003027, CY006878, CY003483, CY002699, CY003323, CY008151, CY006366, CY009239, CY002395, CY002403, CY003019, CY006782, CY003331, CY006358, CY002619, CY006422, CY002651, CY002643, CY009175, CY009183, CY009199, CY009223, CY009231, CY009191, CY009207, AF258516, AF342819, AF258517, DQ249264, AF398867, L24394, CY009319, Z54292, M63755, M76602, M76610, Z54291, Z54290, AJ628066, M59329, M59330, M59331, M59332, M59333, M59334, X51972, D00599, D00600, D00603, CY009287, CY009295, M63754, M76605, M76606, M76604, CY008991, CY009367, M63751, CY009343, D00601, CY009335,

M63750, U02086, M63749, AF389119, M30746, AY744935, CY002355, CY002659, CY006750, CY003675, CY006390, CY002363, CY002155, CY006414, CY006406, CY006190, CY002667, CY003699, CY006110, CY006870, CY003764, CY002995, CY006398, CY002635, CY001683, CY003772, CY003371, AB126624, AB126632, M63753, AY210103, AY210104, AY210093, AY210094, AY210095, AY210096, AY210097, AY210098, AY210099, AY210100, AY210101, AY210102, AY210089, AY210090, AY210091, AY210092, AY210085, AY210086, AY210087, AY210088, AY210082, AY210084, AY210081, AY210083, AY210077, AY210078, AY210079, AY210080, AY210075, AY210076, M23976, AY210074, AY210072, AY210073, AY210070, AY210071, M81571, M81577, M81583, M63752, AY210066, AY210067, AY210068, AY210069, CY002003, CY002011, CY002467, CY006079, CY002059, CY006087, CY003643, CY003651, CY002451, CY002723, CY002459, CY002019, CY002075, CY002491, CY006294, CY003051, CY002203, CY002267, CY003059, CY006142, CY002779, CY008359, CY007798, CY007806, CY007814, CY008343, CY008351, CY008367, CY008375, CY008383, CY008391, CY008399, CY008407, CY008415, CY008423, CY008431, CY008439, CY008447, CY008567, CY008575, CY008583, CY008591, CY008615, CY008631, CY008639, CY008647, CY008655, CY009031, CY009039, CY009047, CY002187, CY006150, CY008559, CY008599, CY008607, CY008623, CY002483, CY006134, CY006126, CY002739, CY006158, CY002035, CY002243, CY003347, CY000036, CY000372, CY000564, CY000764, CY001032, CY001232, CY002051, CY002067, CY002043, CY002195, CY002227, CY002595, CY002925, CY002933, CY002949, CY002979, CY000260, CY006382, CY002771, CY002941, CY002179, CY002235, CY002251, CY002259, CY002611, CY002731, CY002909, CY002715, CY002763, CY002957, CY002965, CY002972, CY003075, CY006118, CY008167, CY002917, CY000892, CY002027, CY007446, CY002219, CY003659, CY003667, CY006095, CY006374, CY006438, CY002787, CY003043, CY003339, CY003411, CY008519, CY002427, CY006182, CY007326, CY008911, CY009255, CY007454, CY008207, CY008223, CY007374, CY007478, CY007558, CY009263, CY002587, CY007278, CY007286, CY007294, CY007302, CY007310, CY007334, CY007350, CY007382, CY007390, CY007430, CY007438, CY007462, CY007486, CY007494, CY007518, CY007534, CY007542, CY007550, CY007582, CY008215, CY008231, CY002211, CY007318, CY007342, CY007358, CY007366, CY007398, CY007406, CY007414, CY007502, CY007526, CY007566, CY007574, CY007646, CY008191, CY008239, CY008247, CY008255, CY009271, CY002435, CY007422, CY002507, CY002411, CY002419, CY003419, CY002795, CY007510, CY002475, CY002291, CY002443, CY002603, CY003035, CY002283, CY002083, CY000780, CY000756, CY000004, CY000020, CY000044, CY000060, CY000068, CY000076, CY000084, CY000092, CYOOOlOO, CY000132, CY000148, CY000156, CY000164, CY000172, CY000180, CY000196, CY000268, CY000348, CY000356, CY000364, CY000380, CY000476, CY000508, CY000520, CY000788, CY000904, CY000912, CY000920, CY000960, CY000968, CY001024, CY001040, CY001048, CY001056, CY001067, CY001091, CY001099, CYOOl 115, CY001208, CY001216, CY001224, CY001296, CY001408, CY001635, CYOO 1651, CY002347, CY000028, CY000524, CYOOl 016, CYOOOO 12, CY000052, CY000252, CYOO 1192, CY000108, CY001288, CY001715, CY000140, CY001515, CY001627, CY002523, CY001424, CY003683, CY007142, CY000124, CY000772, CY001344, CY006862, CY000952, CY008551, CY000976, CY007118, CY007006, CY007070, CY007102, CY007198, CY006926, CY006934, CY006942, CY006966, CY007022, CY007054, CY007110, CY007270, CY006958, CY007014, CY007046, CY007062, CY007078, CY007086, CY007094, CY007126, CY007166, CY007174, CY008543, CY009023, CY006950, CY006974, CY006982, CY006990, CY006998, CY007134, CY007150, CY007190, CY007262, CY008895, CY008903, CY009247, CY001563, CY007030, CY007182, CY007206, CY007214, CY007222, CY007230, CY007238, CY007246, CY007254, CY008199, CY008879, CY008887, CY008919, CYOOl 163, CY001555, CY007158, CY001472, CY008871, CY002107, CY000868, CY001376, CY000876, CY000884, CY001256, CY001464, CY001539, CY001547, CY001643, CY007038, CY000116, CY000212, CY000220, CY000228, CY000236, CY000244, CY000292, CY000300, CY000316, CY000332, CY000340, CY000420, CY000428, CY000436, CY000444, CY000492, CY000500, CY000532, CY000548, CY000556, CY000588, CY000628, CY000796, CY000944, CY001083, CY001147, CY001155, CY001320, CY001336, CY001731, CY001739, CY002131, CY003099, CY003115, CY003124, CY003130, CY003155, CY003203, CY003427, CY003179, CY003195, CY000284, CY001131, CY001312, CY003107, CY003780, CY001240, CY003187, CY003147, CY003139, CY000540, CY000396, CY000404, CY000928, CY001075, CYOOl 107, CYOOl 187, CY001200, CY001432, CY003171, CY001440, CY003163, CY003211, CY007774, CY001947, CY007694, CY001264, CY007742, CY007750, CY007758, CY007766, CY007590, CY007598, CY007606, CY007654, CY007662, CY007670, CY007678, CY007686, CY007702, CY007710, CY007718, CY007726, CY007734, CY007782, CY007790, CY008287, CY008327, CY008263, CY008271, CY008279, CY008303, CY008311, CY008319, CY008335, CY001328, CY000936, CY001304, CY000412, CY008295, CY001171, CY000188, CY000204, CY000276, CY000324, CY000388, CY000484, CY000572, CY000579, CY002331, CY002819, CY003091, CY001723, CY003083, CY000308, CY000452, CY000468, CY000612, CY000660, CY000668, CY000692, CY000700, CY000708, CY000716, CY000740, CY000836, CY000852, CY001248, CY001400, CY001523, CY000844, CY003459, CY003259, CY003243, CY000984, CYOOlOOO, CY000820, CY000812, CY001280, CY003451, CY003251, CY003267, CY003275, CY003283, CY003820, CY003828, CY001368, CY003812, CY009167, CY006662, CY008783, CY008791, CY008799, CY008807, CY008815, CY008855, CY009103, CY009119, CY009127, CY009135, CY009159, CY008479, CY008487, CY008503, CY008511, CY008823, CYOO8831, CY008847, CY008863, CY009087, CY009143, CY009151, CY008495, CY008751, CY008759, CY008767, CY008775, CY009079, CY008839, CYOOl 139, CY000604, CY001579, CY001763, CY000460, CY000636, CY000644, CY000652, CY000676, CY000732, CYOOl 123, CY001416, CY001667, CY001691, CY001747, CY001755, CY001771, CY001811, CY001875, CY001883, CY001891, CY001899, CY001915, CY001923, CY001931, CY001979, CY001987, CY002163, CY002371, CY003579, CY003595, CY003611, CY003619, AJ458276,

CY000596, CY000724, CY000804, CY002115, CY000684, CY001995, CY002579, AJ293924, CY003571, CY003627, CY003635, CY003603, CY006071, CY001595, CY001659, CY001699, CY001779, CY001827, CY001843, CY001859, CY001867, CY002139, CY002339, CYOOl 851, CY000620, CY000828, CY000860, CY000992, CY002299, CY002315, CY002563, CY006166, CY001352, CY001907, CY001939, CY001971, CY001272, CY001619, CY003219, CY003435, CY003788, CY003796, CY003804, CY003227, CY003587, CY003443, CY006063, CY006902, CY002171, CY001707, CY001795, CY001835, CY009111, CY007638, CY001179, CY002555, CY003235, CY000748, CY001008, CY001603, CY001963, CY002515, CY001456, CY002147, CY001611, CY001384, CY001360, CY001448, CY002307, CY001531, CY001496, CY001587, CY001803, CY002379, CY002387, CY003563, AB019359, AB019360, CY001488, CY001507, CY001819, CY001571, CY001675, CY002123, CY006254, CY006574, CY006774, CY008943, CY006462, CY006494, CY006550, CY006558, CY006638, CY006286, CY006478, CY006486, CY006502, CY006518, CY006534, CY006542, CY006566, CY006582, CY006590, CY006598, CY006606, CY006622, CY006798, CY006806, CY008183, CY008535, CY008927, CY008951, CY008959, CY008967, CY008975, CY008983, CY006790, CY001787, CY001480, CY002547, AF255748, AF255749, AJ458277, CY006454, CY006614, CY006630, CY008935, CY006230, CY006238, CY006246, CY006262, CY006270, CY006278, CY006446, CY006470, CY006510,

CY006526, AF038259, AF483604, AF038256, AF038257, AF038258, AB019358, AB019361, CY002275, AF038255, U71145, U71146, U71147, CY006342, CY003755, CY006334, U71144, AY936880, CY006350, CY009015, AF038254, CY003715, L07357, CY008743, CY003067, L07356, L07355, L07353, L07354, L07372, L07373, L07374, L07369, L07370, CY003515, CY003355, CY008727, CY008735, L07368, L07371, L07366, L07367, CY003547, L07365, CY002091, CY008719, CY002755, CY003523, CY003539, L07364, CY003507, CY008455, CY008671, CY008711, CY009071, L07362, L07363, CY006326, CY008175, CY003747, M22577, CY003739, CY006318, CY003723, CY006854, CY009055, CY006055, CY006758, CY007630, L07352, L07360, L07361, CY003491, CY006102, CY008663, CY008471, CY006206, CY006894, CY007622, L07350, L07351, CY006694, CY006702, CY006710, CY007614, D00602, L07359, CY006742, CY006734, CY006766, CY006846, L07349, CY006726, CY006838, CY006886, CY008695, CY008703, CY009063, AF072545, CY006047, L07358, CY003731, CY003499, CY006910, CY006822, CY006718, CY006830, L07346, L07347, CY003531, CY009007, CY006814, CY002099, D00051, L07345, CY003555, CY002747, CY006310, CY008463, CY008687, CY008679, L07342, L07343, L07344, AY210234, AY210235, AY210236, AY210237, AY210238, CY006222, CY002499, L07341, CY006686, AY210230, AY210231, AY210232, AY210233, AY210226, AY210227, AY210228, AY210229, CY006302, AY210221, AY210222, AY210223, AY210224, AY210225, J02137, CY008159, AF348180, AF348183, CY006214, L07340, AF348181, AY210207, AY210208, AY210209, AY210210, AY210211, AY210212, AY210213, AY210214, AY210215, AY210216, AY210217, AY210218, AY210219, AY210220, AF348182, AY936881, DQ372594, DQ360840, DQ099781, DQ099783, DQ099782, DQ099780, DQ099779, DQ099775, AJ867076, AY818138, DQ099776, AY651498, AY651499, AY651501, AY651500, AB212055, AY575905, AY575906, AF255753, AF036359, AP255744, AF255745, AF255746, AF255747, AF255750, AF255751, AF255752, AJ289873, AJ291400, AJ291401, AF028710, AF046092, AF084276, AF084277, AF084278, AFl 15284, AFl 15285, DQ226139, AF255742, AF255743, AJ289872, AJ289871, AY043026

Table 1-6: GenBank accession numbers for NA sequences (segment 6) used in this analysis. CY007469, CY002538, CY002626, CY002682, CY002690, CY002706, CY002810, CY002986, CY003298, CY003378, CY003386, CY003690, CY003706, CY006197, CY006429, CY006669, CY006917, CY008526, CY008998, AJ518100, AJ518101, AY297140, AY297141, AY297142, AY297143, AY297144, AY297145, AY297146, AY297147, AY297148, AY297149, AY297150, AY297151, AY297152, AY297153, AY310407, AY310408, AY310409, AY310410, AY310411, AY310412, CY002530, CY003306, CY006677, DQ249254, DQ249256, DQ249258, AJ518091, CY001954, CY002394, CY002402, CY002570, CY002618, CY002674, CY002698, CY002802, CY003002, CY003010, CY003018, CY003026, CY003290, CY003314, CY003322, CY003330, CY003394, CY003402, CY003466, CY003474, CY003482, CY003835, CY006173, CY006357, CY006365, CY006421, CY006781, CY006877, CY008150, CY009238, AJ518093, AJ518095, CY002642, CY002650, CY009174, CY009182, CY009190, CY009198, CY009206, CY009222, CY009230, AJ518092, DQ249252, AF342820, AJ518098, AJ518096, AJ518099, AY904335, AF398868, AF398869, AF398872, AJ518097, AJ518104, U53166, D31945, CY009318, D31950, D31944, D31948, AJ006954, X15281, AF494254, M38335, CY009286, CY009294, K02018, M38309, AF250363, M27970, CY008990, CY009366, CY009342, CY009334, AF250357, AF494253, AY122327, DQ345546, Y14193, AF250365, AF494252, AY122326, AF389120, NC_002018, J02177, L25815, L25816, L25817, AF250356, CY002154, CY002354, CY002362, CY002634, CY002658, CY002666, CY002994, CY003674, CY003698, CY003763, CY006109, CY006189, CY006389, CY006397, CY006405,

CY006413, CY006749, CY006869, AB126623, AB126631, AJ457943, AJ489848, AJ489849, AJ489850, AJ489851, CY001682, CY003370, CY003771, AF503463, AF503464, AF503465, AF503466, AF503467, AF503468, AF503469, AF503470, AF503471, AF503472, AJ457944, AJ489846, AJ489847, D10164, AB101673, AB124657, AY209932, AY209933, AY209922, AY209923, AY209924, AY209925, AY209926, AY209927, AY209928, AY209929, AY209930, AY209931, K01393, AY209918, AY209919, AY209920, AY209921, AY209914, AY209915, AY209916, AY209917, AB101672, AB124656, AY209912, AY209913, AY209910, AY209911, AY209906, AY209907, AY209908, AY209909, AY209904, AY209905, AB124655, AY209903, AY209901, AY209902, AY209899, AY209900, AB101671, AB124653, AB124654, AY209895, AY209896, AY209897, AY209898, AY643088, AY643089, J02156, L37329, L37330, L37331, CY002002, CY002010, CY002018, CY002034, CY002058, CY002074, CY002186, CY002202, CY002242, CY002266, CY002450, CY002458, CY002466, CY002482, CY002490, CY002722, CY002738, CY002778, CY003050, CY003058, CY003346, CY003642, CY003650,

CY006078, CY006086, CY006125, CY006133, CY006141, CY006149, CY006157, CY006293, CY007797, CY007805, CY007813, CY008342, CY008350, CY008358, CY008366, CY008374, CY008382, CY008390, CY008398, CY008406, CY008414, CY008422, CY008430, CY008438, CY008446, CY008558, CY008566, CY008574, CY008582, CY008590, CY008598, CY008606, CY008614, CY008622, CY008630, CY008638, CY008646, CY008654, CY009030, CY009038, CY009046, CY000035, CY000259, CY000371, CY000563, CY000763, CY000891, CY001031, CY001231, CY002026, CY002042, CY002050, CY002066, CY002082, CY002178, CY002194, CY002210, CY002218, CY002226, CY002234, CY002250, CY002258, CY002282, CY002290, CY002410, CY002418, CY002426, CY002434, CY002442, CY002474, CY002506, CY002586, CY002594, CY002602, CY002610, CY002714, CY002730, CY002762, CY002770, CY002786, CY002794, CY002908, CY002916, CY002924, CY002932, CY002940, CY002948, CY002956, CY002964, CY002971, CY002978, CY003034, CY003042, CY003074, CY003338, CY003410, CY003418, CY003658, CY003666, CY006094, CY006117, CY006181, CY006373, CY006381, CY006437, CY007277, CY007285, CY007293, CY007301, CY007309, CY007317, CY007325, CY007333, CY007341, CY007349, CY007357, CY007365, CY007373, CY007381, CY007389, CY007397, CY007405, CY007413, CY007421, CY007429, CY007437, CY007445, CY007453, CY007461, CY007477, CY007485, CY007493, CY007501, CY007509, CY007517, CY007525, CY007533, CY007541, CY007549, CY007557, CY007565, CY007573, CY007581, CY007645, CY008166, CY008190, CY008206, CY008214, CY008222, CY008230, CY008238, CY008246, CY008254, CY008518, CY008910, CY009254, CY009262, CY009270, DQ249255, DQ249257, AY531006, AY531007, AY531008, AY531009, AY531010, AY531011, AY531012, AY531013, AY531014, AY531015, AY531016, AY531017, AY531018, AY531019, AY531020, AY531021, AY531022, AY531023, AY531024, AY531025, AY531026, AY531027, AY531028, AY531034, CY000003, CYOOOOI l, CY000019, CY000027, CY000043, CY000051, CY000059, CY000067, CY000075, CY000083, CY000091, CY000099, CY000107, CY000123, CY000131, CY000139, CY000147, CY000155, CY000163, CY000171, CY000179, CY000195, CY000251, CY000267, CY000347, CY000355, CY000363, CY000379, CY000475, CY000507, CY000515, CY000523, CY000755, CY000771, CY000779, CY000787, CY000867, CY000875, CY000883, CY000903, CY000911, CY000919, CY000951, CY000959, CY000967, CY000975, CY001015, CY001023, CY001039, CY001047, CY001055, CY001063, CY001066, CY001090, CY001098, CYOOl 114, CYOOl 162, CY001207, CY001215, CY001223, CY001255, CY001287, CY001295, CY001343, CY001375, CY001407, CY001423, CY001463, CY001471, CY001514, CY001538, CY001546, CY001554, CY001562, CY001626, CY001634, CY001642, CY001650, CY001714, CY002106, CY002346, CY002522, CY003682, CY006861, CY006925, CY006933, CY006941, CY006949, CY006957, CY006965, CY006973, CY006981, CY006989, CY006997, CY007005, CY007013, CY007021, CY007029, CY007037, CY007045, CY007053, CY007061, CY007069, CY007077, CY007085, CY007093, CY007101, CY007109, CY007117, CY007125, CY007133, CY007141, CY007149, CY007157, CY007165, CY007173, CY007181, CY007189, CY007197, CY007205, CY007213, CY007221, CY007229, CY007237, CY007245, CY007253, CY007261, CY007269, CY008198, CY008542, CY008550, CY008870, CY008878, CY008886, CY008894, CY008902, CY008918, CY009022, CY009246, DQ059384, DQ090706, DQ090707, DQ090708, DQ090709, DQ090710, DQ091199, AY589662, AY589663, AY589664, AY589665, AY589666, AY589667, AY589668, AY589669, AY589670, AY589671, AY589672, AY589673, AY589674, AY589675, AY589676, CY000115, CY000211, CY000219, CY000227, CY000235, CY000243, CY000283, CY000291, CY000299, CY000315, CY000331, CY000339, CY000395, CY000403, CY000411, CY000419, CY000427, CY000435, CY000443, CY000491, CY000499, CY000531, CY000539, CY000547, CY000555, CY000587, CY000627, CY000795, CY000927, CY000935, CY000943, CY001074, CY001082, CY001106, CY001130, CY001146, CY001154, CY001186, CY001199, CY001239, CY001263, CY001303, CY001311, CY001319, CY001327, CY001335, CYOO 1431, CY001439, CY001730, CY001738, CY001946, CY002130, CY003098, CY003106, CY003114, CY003122, CY003129, CY003138, CY003146, CY003154, CY003162, CY003170, CY003178, CY003186, CY003194, CY003202, CY003210, CY003426, CY003779, CY007589, CY007597, CY007605, CY007653, CY007661, CY007669, CY007677, CY007685, CY007693, CY007701, CY007709, CY007717, CY007725, CY007733, CY007741, CY007749, CY007757, CY007765, CY007773, CY007781, CY007789, CY008262, CY008270, CY008278, CY008286, CY008294, CY008302, CY008310, CY008318, CY008326, CY008334, DQ227446, U42776, AJ457958, AJ457959, AJ457960, AJ457961, CY000187, CY000203, CY000275, CY000307, CY000323, CY000387, CY000483, CY000571, CY000578, CYOOl 170, CY001722, CY002330, CY002818, CY003082, CY003090, DQ249253, AJ307599, AJ307600, AJ307601, AJ307602, AJ307603, AJ307604, AJ307605, AJ307606, AJ307607, AJ307608, AJ307609, AJ307610, AJ307611, AJ307612, AJ307613, AJ307614, AJ307615, AJ307616, AJ307617, AJ307618, AJ307619, AJ307620, AJ307621, AJ307622, AJ307623, AJ307624, AJ307625, AJ307626, AJ307627, AJ307628, AJ307629, AJ457933, AJ457934,

AJ457956, AJ457957, AJ457962, AJ457963, CY000451, CY000467, CY000611, CY000659, CY000667, CY000691, CY000699, CY000707, CY000715, CY000739, CY000811, CY000819, CY000835, CY000843, CY000851, CY000983, CY000999, CYOOl 138, CY001247, CY001279, CY001367, CY001399, CY001522, CY003242, CY003250, CY003258, CY003266, CY003274, CY003282, CY003450, CY003458, CY003811, CY003819, CY003827, CY006661, CY008478, CY008486, CY008494, CY008502, CY008510, CY008750, CY008758, CY008766, CY008774, CY008782, CY008790, CY008798, CY008806, CY008814, GY008822, CY008830, CY008838, CY008846, CY008854, CY008862, CY009078, CY009086, CY009102, CY009118, CY009126, CY009134, CY009142, CY009150, CY009158, CY009166, AF316808, AF316810, AF382329, AF382330, AF382331, AF382332, AF386761, AF386762, AF386763, AF386764, AF534001, AJ293923, AJ457931, AJ457935, AJ457936, AJ457937, AJ457938, AJ457964, AJ457965, AJ457966, AY531036, CY000459, CY000595, CY000603, CY000619, CY000635, CY000643, CY000651, CY000675, CY000683, CY000723, CY000731, CY000747, CY000803, CY000827, CY000859, CY000991, CY001007, CY001122, CYOOl 178, CY001271, CY001351, CY001359,

CY001383, CY001415, CY001447, CY001455, CY001530, CY001578, CY001594, CY001602, CY001610, CY001618, CY001658, CY001666, CY001690, CY001698, CY001706, CY001746, CY001754, CY001762, CY001770, CY001778, CY001794, CY001810, CY001826, CY001834, CY001842, CY001850, CY001858, CY001866, CY001874, CY001882, CY001890, CY001898, CY001906, CY001914, CY001922, CY001930, CY001938, CY001962, CY001970, CY001978, CY001986, CY001994, CY002114, CY002138, CY002146, CY002162, CY002170, CY002298, CY002306, CY002314, CY002338, CY002370, CY002514, CY002554, CY002562, CY002578, CY003218, CY003226, CY003234, CY003434, CY003442, CY003570, CY003578, CY003586, CY003594, CY003602, CY003610, CY003618, CY003626, CY003634, CY003787, CY003795, CY003803, CY006062, CY006070, CY006165, CY006901, CY007637, CY009110, AF316805, AF316815, AF316816, AF533730, AF533731, AF533732, AF533733, AF533734, AF533735, AF533736, AF533737, AF533738, AF533739, AF533740, AF533741, AF533742, AF533743, AF533744, AF533745, AF533746, AF533747, AF533748, AF533749, AF533750, AJ316063, AJ457932, AJ457939, AJ457940, AJ457941, AY271795, CY001479, CY001487, CY001495, CY001506, CY001570, CY001586, CY001674, CY001786, CY001802, CY001818, CY002122, CY002378, CY002386, CY002546, CY003562, CY006253, CY006285, CY006461, CY006477, CY006485, CY006493, CY006501, CY006517, CY006533, CY006541, CY006549, CY006557, CY006565, CY006573, CY006581, CY006589, CY006597, CY006605, CY006621, CY006637, CY006773, CY006789, CY006797, CY006805, CY008182, CY008534, CY008926, CY008942, CY008950, CY008958, CY008966, CY008974, CY008982, AF038264, AF330819, AF533999, AF534000, AJ291403, CY006229, CY006237, CY006245, CY006261, CY006269, CY006277, CY006445, CY006453, CY006469, CY006509, CY006525, CY006613, CY006629, CY008934, AF038262, AF038263, AF038265, AF533995, AF533996, AF533997, AF533998, AJ457942, AF038261, AF533989, AF533990, AF533991, AF533992, AF533993, AF533994, AJ457945, CY002274, U51245, U51246, U51247, U71141, U71142, U71143, AJ457946, CY003754, CY006333, CY006341, U43422, U43423, U43424, U43425, U43426, U71140, AF038260, CY006349, CY009014, U42777, U42778, U42779, U42780, U43417, U43418, U43419, U43420, U43421, CY003714, U42770, U42771, U42772, U42773, U42774, U42775, U43427, U42637, CY003066, CY008742, U42636, U42635, CY003354, CY003514, CY008726, CY008734, U42634, CY00354jS,

U42633, CY002090, CY002754, CY008718, CY003506, CY003522, CY003538, CY008454, CY008670, CY008710, CY009070, U42632, CY003746, CY006325, CY008174, CY003722, CY003738, CY006317, CY006853, CY009054, AB124664, CY006054, CY006757, CY007629, CY003490, CY006101, CY006205, CY006893, CY007621, CY008470, CY008662, KOl 150, CY006693, CY006701, CY006709, CY007613, AB124662, AB124663, CY006733, CY006741, CY006765, CY006845, AB124661, CY006725, CY006837, CY006885, CY008694, CY008702, CY009062, CY003730, CY006046, CY003498, CY006717, CY006821, CY006829, CY006909, AB124660, CY003530, CY006813, CY009006, AB124659, AY210132, AY210133, AY210134, AY210135, AY210136, CY002098, CY002746, CY003554, CY006309, CY008462, CY008678, CY008686, J02168, AB101675, AY210128, AY210129, AY210130, AY210131, CY002498, CY006221, CY006685, U42631, AY210124, AY210125, AY210126, AY210127, AY210119, AY210120, AY210121, AY210122, AY210123, CY006301, AB101674, AB124658, AF348184, AF348185, AF348186, AF348187, AY210105, AY210106, AY210107, AY210108, AY210109, AY210110, AY210111, AY210112, AY210113, AY210114, AY210115, AY210116, AY210117, AY210118, CY006213, CY008158, J02136, U42630, AB239126, AY555151, AB212056, AY575881, AF028708, AF036357, DQ226128, AJ404628, AJ404629, AY043024

Table 1-7: GenBank accession numbers for MP sequences (segment 7) used in this analysis. AJ298948, X59240, DQ299489, CY007468, CY002985, CY002537, CY002625, CY002681, CY002705, CY002809, CY003297, CY003385, CY003689, CY008525, CY003377, CY008997, CY002689, CY003705, CY006196, CY006668, CY006916, CY006428, DQ249267, CY002529, CY006676, CY003305, CY001953, CY002569, CY003025, CY003473, CY006780, CY006876, CY002393, CY002401, CY002617, CY002673, CY002697, CY002801, CY003009, CY003017, CY003289, CY003313, CY003321, CY003465, CY003481, CY003834, CY006356, CY006364, CY006420, CY008149, CY003329, CY003401, CY003393, CY006172, CY003001, CY009237, CY002649, CY002641, CY009173, CY009181, CY009189, CY009197, CY009229, CY009205, CY009221, AF258523, AF342818, AF258522, DQ249265, AF398876, U53168, U53169, CY009317, M63521, AJ298947, M54941, X53029, CY009285, CY009293, CY008989, CY009365, CY009341, CY009333, X08091, U02084, AF389121, NC_002016, L25814, L25818, M19374, M23920, X08088, X08089, AY130766, CY002353, CY002361, CY006388, CY002993, CY002665, CY002657, CY006868, CY002153, CY002633, CY003673, CY003697, CY003762, CY006188, CY006404, CY006748, CY006396, CY006412, CY006108, CY001681, CY003770, AB126629, AB126637, CY003369, AF231358, AF231359, M63531, AY210065, AY210055, AY210056, AY210057, AY210058, AY210059, AY210060, AY210061, AY210062, AY210063, AY210064, AY210051, AY210052, AY210053, AY210054, AY210047, AY210048, AY210049, AY210050, AY210044, AY210045, AY210046, AY210041, AY210042, AY210043, AY210037, AY210038, AY210039, AY210040, AY210035, AY210036, M23978, AY210032, AY210033, AY210034, AY210030, AY210031, M81570, M81576, M81582, AY210026, AY210027, AY210028, AY210029, X08093, CY002009, CY002033, CY002057, CY002073, CY002185, CY002201, CY002457, CY002777, CY003049, CY006077, CY006085, CY006132, CY006140, CY006148, CY006156, CY006292, CY002001, CY002241, CY002449, CY002721, CY003345, CY002017, CY002265, CY002465, CY002489, CY002737, CY003649, CY003641, CY002481, CY003057, CY006124, CY007796, CY008405, CY009029, CY007804, CY007812, CY008365, CY008373, CY009045, CY008557, CY008349, CY008357, CY008605, CY008341, CY008381, CY008389, CY008397, CY008413, CY008421, CY008429, CY008437, CY008445, CY008573, CY008581, CY008589, CY008597, CY008613, CY008621, CY008629, CY008637, CY008645, CY009037, CY008565, CY008653, CY002065, CY002081, CY002177, CY002217,

CY002225, CY002233, CY002249, CY002281, CY002409, CY002417, CY002585, CY002601, CY002713, CY002907, CY002915, CY002931, CY002939, CY002947, CY002955, CY002963, CY002970, CY002977, CY003041, CY003665, CY006372, DQ249268, CY002425, CY002433, CY002505, CY002729, CY003033, CY000034, CY000258, CY000370, CY000562, CY000762, CY000890, CY001030, CY001230, CY002025, CY002041, CY002049, CY002209, CY002257, CY002289, CY002441, CY002473, CY002593, CY002609, CY002769, CY002923, CY003409, CY003657, CY006180, CY002193, CY002785, CY006116, CY008517, CY006380, CY008165, CY002761, CY002793, CY003073, CY003337, CY003417, CY007292, CY007388, CY007508, CY006436, CY007300, CY007308, CY007380, CY007436, CY007444, CY007476, CY007492, CY007556, CY006093, CY007276, CY007332, CY007348, CY007372, CY007404, CY007452, CY007460, CY007484, CY007532, CY007548, CY007564, CY007580, CY008205, CY008213, CY009261, CY007364, CY008221, CY007356, CY007644, CY009253, CY007284, CY007324, CY007412, CY007420, CY007428, CY007524, CY007540, CY007572, CY008229, CY008245, CY008253, CY008909, CY007316, CY007340, CY008189, CY009269, CY007500, CY008237, CY007396, CY007516, CY001633, CY003681, DQ098266, DQ098267, CY000194, CY000002, CYOOOOlO, CY000018, CY000026, CY000042, CY000050, CY000058, CY000066, CY000074, CY000082, CY000090, CY000098, CY000106, CY000122, CY000130, CY000138, CY000146, CY000154, CY000162, CY000170, CY000178, CY000250, CY000266, CY000346, CY000354, CY000362, CY000378, CY000474, CY000506, CY000514, CY000522, CY000754, CY000770, CY000778, CY000786, CY000866, CY000874, CY000882, CY000902, CY000910, CY000950, CY000958, CY000966, CY000974, CY001014, CY001022, CY001038, CY001046, CY001054, CY001062, CY001065, CY001089, CY001097, CYOOl 113, CYOOl 161, CY001206, CY001214, CY001222, CY001254, CY001286, CY001294, CY001374, CY001406, CY001462, CY001470, CY001513, CY001545, CY001553, CY001561, CY001625, CY001641, CY001649, CY001713, CY002345, CY002521, CY002105, CY001537, DQ098269, DQ100422, DQ100423, DQ100424, CY001342, CY001422, CY006972, CY006988, CY007036, CY007140, CY007268, CY000918, CY006924, CY006932, CY006940, CY006956, CY006964, CY006980, CY006996, CY007004, CY007012, CY007044, CY007052, CY007060, CY007076, CY007084, CY007092, CY007100, CY007116, CY007124, CY007132, CY007188, CY006948, CY007020, CY007068, CY007108, CY007148, CY007156, CY007164, CY007172, CY007180, CY007204, CY007220, CY007228, CY007236, CY007244, CY007252, CY007260, CY008197, CY009245, CY006860, CY007196, CY008541, CY008549, CY008885, CY008893, CY008901, CY008917, CY009021, CY008869, CY008877, CY007212, CY007028, CY000234, CY000242, CY000546, CY000942, CY001334, CY003127, CY003145, CY003153, CY003201, CY003778, CY000538, CY000554, CY000794, CY001198, CY003121, CY003177, CY003193, CY000114, CY000210, CY000218, CY000226, CY000282, CY000290, CY000298, CY000314, CY000330, CY000338, CY000394, CY000402, CY000410, CY000418, CY000426, CY000434, CY000442, CY000490, CY000498, CY000530, CY000626, CY000926, CY000934, CY001073, CY001081, CY001105, CY001129, CY001145, CY001153, CY001185, CY001238, CY001302, CY001310, CY001318, CY001326, CY001729, CY001737, CY001945, CY002129, CY003097, CY003105, CY003161, CY003169, CY003185, CY003425, CY003209, CY001262, CY000586, CY001438, CY003137, CY007652, CY007764, CY007780, CY001430, CY007588, CY007596, CY007604, CY007668, CY007676, CY007684, CY007740, CY007748, CY007772, CY007788, CY003113, CY007660, CY007692, CY007700, CY007708, CY007716, CY007724, CY007732, CY007756, CY008277, CY008301, CY008269, CY008293, CY008333, CY008261, CY008285, CY008309, CY008317, CY008325, CY002817, DQ249266, CYOOOl 86, CY000202, CY000274, CY000322, CY000386, CY000482, CY000570, CY000577, CYOOl 169, CY001721, CY002329, CY003081, CY003089, CY000306, CY001398, CY003249, CY003257, CY003449, CY003457, CY003265, CY003273, CY000450, CY000466, CY000610, CY000658, CY000666, CY000690, CY000698, CY000706, CY000714, CY000738, CY000810, CY000818,,CY000834, CY000842, CY000850, CY000982, CY000998, CY001246, CY001278, CY001366, CY003810, CY003826, CY003281, CY006660, CYOOl 137, CY003241, CY001521, CY008501, CY008509, CY009101, CY009149, CY003818, CY008837, CY008477, CY009085, CY009117, CY009133, CY009141, CY009165, CY008485, CY008749, CY008757, CY008773, CY008781, CY008789, CY008797, CY008805, CY008813, CY008821, CY008829, CY008845, CY008853, CY008861, CY009077, CY009125, CY009157, CY008493, CY008765, AJ293925, AF386765, AF386766, AF386767, AF386768, AF386770, AF386771, AF386772, CY001529, CY001617, CY001665, CY001809, CY001873, CYOO1881, CY002297, CY002305, CY002337, CY003433, CY003569, CY003577, CY003585, CY003609, CY003617, CY006061, CY006900, CY001897, CY001977, CY001985, CY002169, CY003625, CY003786, CY000458, CY000594, CY000602, CY000618, CY000634, CY000642, CY000650, CY000674, CY000682, CY000722, CY000730, CY000802, CY000826, CY000858, CY000990, CY001006, CY001121, CY001177, CY001270, CY001358, CY001382, CY001446, CY001454, CY001577, CY001593, CY001601, CY001657, CY001689, CY001697, CY001705, CY001745, CY001753, CY001761, CY001769, CY001777, CY001793, CY001825, CY001841, CY001849, CY001857, CY001865, CY001889, CY001905, CY001913, CY001921, CY001929, CY001937, CY001961, CY001969, CY001993, CY002113, CY002137, CY002145, CY002369, CY002553, CY002561, CY002577, CY003593, CY003794, CY003217, CY006069, CY003601, CY003633, CY000746, CY001414, CY001833, CY002161, CY002313, CY003225, CY003233, CY003441, CY003802, CY001350, CY006164, CY007636, CY009109, CY001609, AF386769, AJ458305, CY001801, CY001817, CY002121, CY001478, CY001486, CY001494, CY001569, CY001585, CY001673, CY001785, CY002377, CY002545, CY002385, CY006252, CY003561, CY006540, CY006788, CY006532, CY006284, CY006460, CY006572, CY006580, CY008181, CY006604, CY006772, CY006484, CY006492, CY006500, CY006516, CY006548, CY006588, CY006596, CY006636, CY006796, CY006804, CY006556, CY006564, CY006620, CY008533, CY008925, CY008941, CY008949, CY008957, CY008965, CY008973, CY008981, CY006476, CY001505, AJ458307, AJ458308, AF255369, AF255370, CY006228, CY006244, CY006276,

CY006452, CY006236, CY006260, CY006468, CY006444, CY006508, CY006524, CY006612, CY006628, CY008933, CY006268, AF038274, AJ458306, AF038271, AF038272, AF038273, AJ458339, CY002273, U65565, U65568, U65569, U65570, U65572, U65573, U65574, U65575, U65576, U65577, U65578, CY006340, CY003753, CY006332, U65563, U65564, U65566, U65567, U65571, CY006348, CY009013, U65561, U65562, CY003713, CY003065, CY008741, L18995, L18999, AF401293, M63516, CY003513, CY003353, CY008725, CY008733, CY003545, CY002089, CY002753, CY008717, CY003521, CY003537, CY003505, CY008453, CY009069, CY008669, CY008709, CY006324, CY003745, CY008173, CY006316, CY006852, CY003721, CY003737, CY009053, CY006053, U08863, CY006756, AF348912, AF348913, CY007628, CY003489, CY006100, CY006892, CY007620, CY008469, CY006204, CY008661, KOl 140, CY006692, CY006700, CY006708, CY007612, CY006732, CY006740, CY006844, CY006764, CY006724, CY006836, CY006884, CY009061, CY008693, CY008701, CY006045, CY003729, CY006908, CY003497, CY006828, CY006716, CY006820, CY009005, CY006812, CY003529, X08090, X08092, CY002097, CY003553, J02167, CY002745, CY006308, CY008461, CY008685, CY008677, AY210266, AY210267, AY210268, AY210269, AY210270, CY002497, CY006220, AY210262, AY210263, AY210264, AY210265, CY006684, AY210258, AY210259, AY210260, AY210261, CY006300, AY210253, AY210254, AY210255, AY210256, AY210257, CY008157, M63515, AF348188, AF348191, AF348192, AF348193, AF348195, AF348196, AF348197, CY006212, AF348189, AF348190, AF348194, AY210239, AY210240, AY210241, AY210242, AY210243, AY210244, AY210245, AY210246, AY210247, AY210248, AY210249, AY210250, AY210251, AY210252, DQ360841, DQ094271, DQ094272, DQ094273, DQ094274, DQ094275, DQ372592, DQ094266, DQ094267, AY818144, AY651387, AY651388, AY651389, AY651390, AB212057, AY575894, AY575893, AF255374, AJ278648, AF036358, AF255365, AF255366, AF255367, AF255368, AF255371, AF255372, AF255373, AF046090, AF084282, AF084284, AFl 15286, DQ226095, AF255363, AF255364, AJ278647, AJ278646

Table 1-8: GenBank accession numbers for NS sequences (segment 8) used in this analysis. CY007471 , CY002540, CY002628, CY002684, CY002692, CY002708, CY002812, CY002988, CY003300, CY003380, CY003388, CY003692, CY003708, CY006199, CY006431, CY006671, CY006919, CY008528, CY009000, CY002532, CY003308, CY006679, DQ249269, CY001956, CY002396, CY002404, CY002572, CY002620, CY002676, CY002700, CY002804, CY003004, CY003012, CY003020, CY003028, CY003292, CY003316, CY003324, CY003332, CY003396, CY003404, CY003468, CY003476, CY003484, CY003837, CY006175, CY006359, CY006367, CY006423, CY006783, CY006879, CY008152, CY009240, CY002644, CY002652, CY009176, CY009184, CY009192, CY009200, CY009208, CY009224, CY009232, AF258521, AF342817, AF258520, AF055423, AF055424, AF055425, AF398877, U53170, U53171, AF055422, CY009320, M57643, M80974, M12593, M12594, AJ298950, X15282, M12595, CY009288, CY009296, K00578, CY008992, M12592, CY009368, CY009344, X52146, CY009336, K00576, K00577, U02087, M12596, U13682, AF389122, L25720, M12597, U13683, Z21498, AF333238, CY002156, CY002356, CY002364, CY002636, CY002660, CY002668, CY002996, CY003676, CY003700, CY003765, CY006111, CY006191, CY006391, CY006399, CY006407, CY006415, CY006751, CY006871, AB126628, AB126636, AJ519455, CY001684, CY003372, CY003773, AY210191, AY210192, M12590, AY210182, AY210183, AY210184, AY210185, AY210186, AY210187, AY210188, AY210189, AY210190, AY210178, AY210179, AY210180, AY210181, AY210174, AY210175, AY210176, AY210177, AY210171, AY210172, AY210173, AY210168, AY210169, AY210170, AY210164,

AY210165, AY210166, AY210167, AY210162, AY210163, AY210160, AY210161, M23968, AY210158, AY210159, AY210156, AY210157, AY210151, AY210152, AY210153, AY210154, AY210155, M81572, M81578, M81584, CY002004, CY002012, CY002020, CY002036, CY002060, CY002076, CY002188, CY002204, CY002244, CY002268, CY002452, CY002460, CY002468, CY002484, CY002492, CY002724, CY002740, CY002780, CY003052, CY003060, CY003348, CY003644, CY003652, CY006080, CY006088, CY006127, CY006135, CY006143, CY006151, CY006159, CY006295, CY007799, CY007807, CY007815, CY008344, CY008352, CY008360, CY008368, CY008376, CY008384, CY008392, CY008400, CY008408, CY008416, CY008424, CY008432, CY008440, CY008448, CY008560, CY008568, CY008576, CY008584, CY008592, CY008600, CY008608, CY008616, CY008624, CY008632, CY008640, CY008648, CY008656, CY009032, CY009040, CY009048, CY000037, CY000261, CY000373, CY000565, CY000765, CY000893, CY001033, CY001233, CY002028, CY002044, CY002052, CY002068, CY002084, CY002180, CY002196, CY002212, CY002220, CY002228, CY002236, CY002252, CY002260, CY002284, CY002292, CY002412, CY002420, CY002428, CY002436, CY002444, CY002476, CY002508, CY002588, CY002596, CY002604, CY002612, CY002716, CY002732, CY002764, CY002772, CY002788, CY002796, CY002910, CY002918, CY002926, CY002934, CY002942, CY002950, CY002958, CY002966, CY002973, CY002980, CY003036, CY003044, CY003076, CY003340, CY003412, CY003420, CY003660, CY003668, CY006096, CY006119, CY006183, CY006375, CY006383, CY006439, CY007279, CY007287, CY007295, CY007303, CY007311, CY007319, CY007327, CY007335, CY007343, CY007351, CY007359, CY007367, CY007375, CY007383, CY007391, CY007399, CY007407, CY007415, CY007423, CY007431, CY007439, CY007447, CY007455, CY007463, CY007479, CY007487, CY007495, CY007503, CY007511, CY007519, CY007527, CY007535, CY007543, CY007551, CY007559, CY007567, CY007575, CY007583, CY007647, CY008168, CY008192, CY008208, CY008216, CY008224, CY008232, CY008240, CY008248, CY008256, CY008520, CY008912, CY009256, CY009264, CY009272, CY000005, CY000013, CY000021, CY000029, CY000045, CY000053, CY000061, CY000069, CY000077, CY000085, CY000093, CYOOOlOl, CY000109, CY000125, CY000133, CY000141, CY000149, CY000157, CY000165, CY000173, CYOOOl 81, CY000197, CY000253, CY000269, CY000349, CY000357, CY000365, CY000381, CY000477, CY000509, CY000516, CY000525, CY000757, CY000773, CY000781,

CY000789, CY000869, CY000877, CY000885, CY000905, CY000913, CY000921, CY000953, CY000961, CY000969, CY000977, CY001017, CY001025, CY001041, CY001049, CY001057, CY001068, CY001092, CYOOIlOO, CYOOl 116, CYOOl 164, CYOOl 193, CY001209, CY001217, CY001225, CY001257, CY001289, CY001297, CY001345, CY001377, CY001409, CY001425, CY001465, CY001473, CY001516, CY001540, CY001548, CY001556, CY001564, CY001628, CY001636, CY001644, CY001652, CY001716, CY002108, CY002348, CY002524, CY003684, CY006863, CY006927, CY006935, CY006943, CY006951, CY006959, CY006967, CY006975, CY006983, CY006991, CY006999, CY007007, CY007015, CY007023, CY007031, CY007039, CY007047, CY007055, CY007063, CY007071, CY007079, CY007087, CY007095, CY007103, CY007111, CY007119, CY007127, CY007135, CY007143, CY007151, CY007159, CY007167, CY007175, CY007183, CY007191, CY007199, CY007207, CY007215, CY007223, CY007231, CY007239, CY007247, CY007255, CY007263, CY007271, CY008200, CY008544, CY008552, CY008872, CY008880, CY008888, CY008896, CY008904, CY008920, CY009024, CY009248, CYOOOl 17, CY000213, CY000221, CY000229, CY000237, CY000245, CY000285, CY000293, CY000301, CY000317, CY000333, CY000341, CY000397, CY000405, CY000413, CY000421, CY000429, CY000437, CY000445, CY000493, CY000501, CY000533, CY000541, CY000549, CY000557, CY000589, CY000629, CY000797, CY000929, CY000937, CY000945, CY001076, CY001084, CY001108, CY001132, CY001148, CY001156, CYOO1188, CY001201, CYOO 1241, CY001265, CY001305, CY001313, CY001321, CY001329, CY001337, CY001433, CY001441, CY001732, CY001740, CY001948, CY002132, CY003100, CY003108, CY003116, CY003125, CY003131, CY003140, CY003148, CY003156, CY003164, CY003172, CY003180, CY003188, CY003196, CY003204, CY003212, CY003428, CY003781, CY007591, CY007599, CY007607, CY007655, CY007663, CY007671, CY007679, CY007687, CY007695, CY007703, CY007711, CY007719, CY007727, CY007735, CY007743, CY007751, CY007759, CY007767, CY007775, CY007783, CY007791, CY008264, CY008272, CY008280, CY008288, CY008296, CY008304, CY008312, CY008320, CY008328, CY008336, DQ249270, CY000189, CY000205, CY000277, CY000309, CY000325, CY000389, CY000485, CY000573, CY000580, CYOOl 172, CY001724, CY002332, CY002820, CY003084, CY003092, CY000453, CY000469, CY000613, CY000661, CY000669, CY000693, CY000701, CY000709, CY000717, CY000741, CY000813, CY000821, CY000837, CY000845, CY000853, CY000985, CYOOlOOl, CY001140, CY001249, CY001281, CY001369, CYOO 1401, CY001524, CY003244, CY003252, CY003260, CY003268, CY003276, CY003284, CY003452, CY003460, CY003813, CY003821, CY003829, CY006663, CY008480, CY008488, CY008496, CY008504, CY008512, CY008752, CY008760, CY008768, CY008776, CY008784, CY008792, CY008800, CY008808, CY008816, CY008824, CY008832, CY008840, CY008848, CY008856, CY008864, CY009080, CY009088, CY009104, CY009120, CY009128, CY009136, CY009144, CY009152, CY009160, CY009168, AJ293941, CY000461, CY000597, CY000605, CY000621, CY000637, CY000645, CY000653, CY000677, CY000685, CY000725, CY000733, CY000749, CY000805, CY000829, CY000861, CY000993, CY001009, CYOOl 124, CYOOl 180, CY001273, CY001353, CY001361, CY001385, CY001417, CY001449, CY001457, CY001532, CY001580, CY001596, CY001604, CY001612, CY001620, CY001660, CY001668, CY001692, CY001700, CY001708, CY001748, CY001756, CY001764, CY001772, CY001780, CY001796, CY001812, CY001828, CY001836, CY001844, CY001852, CY001860, CY001868, CY001876, CY001884, CY001892, CY001900, CY001908, CY001916, CY001924, CY001932, CY001940, CY001964, CY001972, CY001980, CY001988, CY001996, CY002116, CY002140, CY002148, CY002164, CY002172, CY002300, CY002308, CY002316, CY002340, CY002372, CY002516, CY002556, CY002564, CY002580, CY003220, CY003228, CY003236, CY003436, CY003444, CY003572, CY003580, CY003588, CY003596, CY003604, CY003612, CY003620, CY003628, CY003636, CY003789, CY003797, CY003805, CY006064, CY006072, CY006167, CY006903, CY007639, CY009112, CY001481, CY001489, CY001497, CY001508, CY001572, CY001588, CY001676, CY001788, CY001804, CY001820, CY002124, CY002380, CY002388, CY002548, CY003564, CY006255, CY006287, CY006463, CY006479, CY006487, CY006495, CY006503, CY006519, CY006535, CY006543, CY006551, CY006559, CY006567, CY006575, CY006583, CY006591, CY006599, CY006607, CY006623, CY006639, CY006775, CY006791, CY006799, CY006807, CY008184, CY008536, CY008928, CY008944, CY008952, CY008960, CY008968, CY008976, CY008984, AF038279, AF256182, AF256183, CY006231, CY006239, CY006247, CY006263, CY006271, CY006279, CY006447, CY006455, CY006471, CY006511, CY006527, CY006615, CY006631, CY008936, AF038276, AF038277, AF038278, AF038275, CY002276, U65671, U65672, U65673, U65674, CY003756, CY006335, CY006343, U65670, CY006351, CY009016, D30675, D30676, CY003716, D30667, D30674, D30673, CY003068, CY008744, D30670, D30672, CY003356, CY003516, CY008728, CY008736, M57642, CY003548, D30671, M57641, M80975, CY002092, CY002756, CY008720, M57640, CY003508, CY003524, CY003540, CY008456, CY008672, CY008712, CY009072, M17699, CY003748, CY006327, CY008176, CY003724, CY003740, CY006319, CY006855, CY009056, CY006056, CY006759, U08862, CY007631, CY003492, CY006103, CY006207, CY006895, CY007623, CY008472, CY008664, CY006695, CY006703, CY006711, CY007615, CY006735, CY006743, CY006767, CY006847, K01332, CY006727, CY006839, CY006887, CY008696, CY008704, CY009064, CY003732, CY006048, CY003500, CY006719, CY006823, CY006831, CY006911, CY003532, CY006815, CY009008, AY210312, AY210313, AY210314, AY210315, AY210316, CY002100, CY002748,

CY003556, CY006311, CY008464, CY008680, CY008688, VOl 102, AY210307, AY210309, AY210310, AY210311, CY002500, CY006223, CY006687, AY210304, AY210305, AY210306, AY210308, AY210299, AY210300, AY210301, AY210302, AY210303, CY006303, AF348198, AF348199, AF348200, AF348201, AF348202, AF348203, AF348204, AF348205, AF348206, AY210285, AY210286, AY210287, AY210288, AY210289, AY210290, AY210291, AY210292, AY210293, AY210294, AY210295, AY210296, AY210297, AY210298, CY006215, CY008160, DQ360842, DQ372595, AY526747, AY651552, AY651553, AY651554, AY651555, AY818147, AB212058, AB212059, AY576368, AY576369, AF256188, AF036360, AF046091, AF084285, AF084286, AF084287,

AF115288, AF256178, AF256179, AF256180, AF256181, AF256184, AF256185, AF256186, AF256187, AJ404736, DQ226117, AF256177, AJ278649, AJ404735, AY043027

Highly conserved 19-mer sequence fragments were identified by extracting all 19- mer sequence fragments from each of the influenza A viral sequences under study, and then tabulating whether or not each sequence fragment is present, as an exact match, within each of the influenza A viral sequences. Thus, a first viral sequence contains a 19- mer sequence fragment that extends from position 1 through 19, another from position 2 through 20, another from position 3 through 21, etc. Likewise the second, third, and fourth viral sequences are extracted in the same way, all the way down to the last viral sequence in the list (Table 1).

The sequence fragments are then added to a growing table of sequence fragments and a count is maintained of the number of influenza A viral sequences that contain each 19-mer fragment. Finally, the fragment frequency is expressed as the percent of the influenza A viral sequences that contain each specific 19-mer fragment. Table 2 lists the most conserved 19-mer sequence fragments (down to 70%) and their frequency of occurrence.

Table 2-1: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 1 (PB2) sequences listed in Table 1-1.

Seq 51 AUGAAACGAAAACGGGACU 95.9%

ID Sequence Percent

52 GAAAUGGAUGAUGGCAAUG 95.8%

1 GCCAGACAGCGACCAAAAG 99.5%

53 UGGAUGAUGGCAAUGAAAU 95.8%

2 AGCCAGACAGCGACCAAAA 99.5%

54 UGAAAUGGAUGAUGGCAAU 95.8%

3 ACAGCCAGACAGCGACCAA 99.3%

55 GGAUGAUGGCAAUGAAAUA 95.8%

4 GACAGCCAGACAGCGACCA 99.3%

56 UGAAACGAAAACGGGACUC 95.8%

5 CAGACAGCGACCAAAAGAA 99.3%

57 AAAUGGAUGAUGGCAAUGA 95.8%

6 CAGCCAGACAGCGACCAAA 99.3%

58 AUGGAUGAUGGCAAUGAAA 95.7%

7 AGACAGCGACCAAAAGAAU 99.1% 59 AAUGGAUGAUGGCAAUGAA 95.7%

8 CCAGACAGCGACCAAAAGA 99.1% 60 AUGAAAUGGAUGAUGGCAA 95.7%

9 ACAGCGACCAAAAGAAUUC 99.1% 61 GAAACGAAAACGGGACUCU 95.5%

10 UACUUACUGACAGCCAGAC 99.1% 62 AAACGAAAACGGGACUCUA 95.5%

11 CAGCGACCAAAAGAAUUCG 99.1% 63 AACGAAAACGGGACUCUAG 95.4%

12 ACUUACUGACAGCCAGACA 99.1% 64 GAAAACGGGACUCUAGCAU 95.4%

13 CAUACUUACUGACAGCCAG 99.1% 65 ACGAAAACGGGACUCUAGC 95.4%

14 CUUACUGACAGCCAGACAG 99.1%

66 CGAAAACGGGACUCUAGCA 95.4%

15 GACAGCGACCAAAAGAAUU 99.1% 67 AAAACGGGACUCUAGCAUA 95.3%

16 AUACUUACUGACAGCCAGA 99.1%

68 GAAAGGUUAAAACAUGGAA 94.9%

17 ACUGACAGCCAGACAGCGA 99.0%

69 AAAGGUUAAAACAUGGAAC 94.9%

18 GCAUACUUACUGACAGCCA 99.0%

70 AAGGUUAAAACAUGGAACC 94.8%

19 AGCAUACUUACUGACAGCC 99.0%

71 GGUUAAAACAUGGAACCUU 94.8%

20 UGACAGCCAGACAGCGACC 99.0%

72 AGGUUAAAACAUGGAACCU 94.8%

21 UACUGACAGCCAGACAGCG 99.0%

73 CUCGCACUCGCGAGAUACU 94.5%

22 UUACUGACAGCCAGACAGC 99.0%

74 UCUCGCACUCGCGAGAUAC 94.5%

23 CUGACAGCCAGACAGCGAC 99.0%

75 UAAAAGCAGUUAGAGGUGA 94.3%

24 ACUCUAGCAUACUUACUGA 98.9%

76 UCGGAUGGCCAUCAAUUAA 94.2%

25 UAGCAUACUUACUGACAGC 98.7%

77 AGGAUGAAAUGGAUGAUGG 94.0%

26 CUCUAGCAUACUUACUGAC 98.6%

78 GGAUGAAAUGGAUGAUGGC 94.0%

27 CUAGCAUACUUACUGACAG 98.6%

79 UUAGGAUGAAAUGGAUGAU 93.9%

28 UCUAGCAUACUUACUGACA 98.6%

80 AUGGUUGCAUACAUGUUAG 93.9%

29 CAAAAGAAUUCGGAUGGCC 98.6%

81 UAGGAUGAAAUGGAUGAUG 93.9% 30 AAAAGAAUUCGGAUGGCCA 98.6%

82 UGGUUGCAUACAUGUUAGA 93.9%

31 GACCAAAAGAAUUCGGAUG 98.5%

83 CUUAGGAUGAAAUGGAUGA 93.9%

32 CGACCAAAAGAAUUCGGAU 98.5%

84 UUGCAUACAUGUUAGAGAG 93.9%

33 AAAGAAUUCGGAUGGCCAU 98.5%

85 ACUUAGGAUGAAAUGGAUG 93.9%

34 AGCGACCAAAAGAAUUCGG 98.5%

86 GUUGCAUACAUGUUAGAGA 93.8%

35 GCGACCAAAAGAAUUCGGA 98.5%

87 UAAAACAUGGAACCUUUGG 93.8%

36 CCAAAAGAAUUCGGAUGGC 98.5%

88 UUAAAACAUGGAACCUUUG 93.8%

37 ACCAAAAGAAUUCGGAUGG 98.5%

89 GGUUGCAUACAUGUUAGAG 93.8%

38 GAAUUCGGAUGGCCAUCAA 98.4%

90 UGAUGGUUGCAUACAUGUU 93.7%

39 UUCGGAUGGCCAUCAAUUA 98.3%

91 AUGAGAAUACUUGUAAGGG 93.7%

40 AUUCGGAUGGCCAUCAAUU 98.3%

92 UGAGAAUACUUGUAAGGGG 93.7%

41 GACUCUAGCAUACUUACUG 98.3%

93 CACUUAGGAUGAAAUGGAU 93.7%

42 AGAAUUCGGAUGGCCAUCA 98.3%

94 GUUAAAACAUGGAACCUUU 93.7%

43 AAGAAUUCGGAUGGCCAUC 98.3% 95 UUGAUGGUUGCAUACAUGU 93.5%

44 AAUUCGGAUGGCCAUCAAU 98.3% 96 GAUGGUUGCAUACAUGUUA 93.4%

45 GGGACUCUAGCAUACUUAC 98.1% 97 UCACUUAGGAUGAAAUGGA 93.3%

46 ACGGGACUCUAGCAUACUU 98.0% 98 AACAUGGAACCUUUGGCCC 93.1%

47 AAACGGGACUCUAGCAUAC 98.0% 99 AAACAUGGAACCUUUGGCC 93.1%

48 CGGGACUCUAGCAUACUUA 98.0% 100 CAAGCUGUGGAUAUAUGCA 92.9%

49 AACGGGACUCUAGCAUACU 98.0% 101 AAGCUGUGGAUAUAUGCAA 92.9%

50 GGACUCUAGCAUACUUACU 97.9% 102 AAAACAUGGAACCUUUGGC 92.8%

103 GUAAUGAAACGAAAACGGG 92.6% 157 ACUCCAGGUGGAGAAGUGA 87.1%

104 UAAUGAAACGAAAACGGGA 92.6% 158 GAUUGCAUGAUAAAAGCAG 87.0%

105 AAUGAAACGAAAACGGGAC 92.6% 159 CCAGGUGGAGAAGUGAGGA 87.0%

106 GAUGAAAUGGAUGAUGGCA 92.3% 160 CAUACAUGUUAGAGAGAGA 87.0%

107 AACAAGCUGUGGAUAUAUG 91.3% 161 GCAUGAUAAAAGCAGUUAG 87.0%

108 GAACAAGCUGUGGAUAUAU 91.3% 162 AUUGCAUGAUAAAAGCAGU 87.0%

109 ACAAGCUGUGGAUAUAUGC 91.2% 163 GCAUACAUGUUAGAGAGAG 87.0%

110 UGUACACUCCAGGUGGAGA 90.0% 164 CUCCAGGUGGAGAAGUGAG 87.0%

111 AUGUACACUCCAGGUGGAG 90.0% 165 CAUGAUAAAAGCAGUUAGA 87.0%

112 AGAACAAGCUGUGGAUAUA 89.8% 166 GUGGAGAAGUGAGGAAUGA 86.9%

113 GAAGAACAAGCUGUGGAUA 89.8% 167 UGCAUGAUAAAAGCAGUUA 86.9%

114 AAGAACAAGCUGUGGAUAU 89.8% 168 CAGGUGGAGAAGUGAGGAA 86.9%

115 UGAUAAAAGCAGUUAGAGG 89.7% 169 GGUGGAGAAGUGAGGAAUG 86.9%

116 AUGAUAAAAGCAGUUAGAG 89.6% 170 UUGCAUGAUAAAAGCAGUU 86.8%

117 ACAUGGUGGAAUAGAAAUG 89.3% 171 AGGUGGAGAAGUGAGGAAU 86.8%

118 CAUGGUGGAAUAGAAAUGG 89.3% 172 UGCAUACAUGUUAGAGAGA 86.7%

119 CGGAUGGCCAUCAAUUAAU 89.1% 173 AAAGAAUAAAAGAACUACG 86.6%

120 GAUAAAAGCAGUUAGAGGU 88.9% 174 GAGAUGUGCCACAGCACAC 86.6%

121 AUAAAAGCAGUUAGAGGUG 88.9% 175 UAAGAGUCAGCAAAAUGGG 86.6%

122 AUAUGGCCAUAAUUAAGAA 88.8% 176 CCAUAUGGCCAUAAUUAAG 86.5%

123 CAUAUGGCCAUAAUUAAGA 88.8% 177 GAUUUCUCCCAGUUGCUGG 86.5%

124 UGGAAAGAAUAAAAGAACU 88.5% 178 AGAUUUCUCCCAGUUGCUG 86.5%

125 UGGUGGAAUAGAAAUGGAC 88.5% 179 AGGAGGUCAGUGAAACACA 86.4%

126 AUGGAAAGAAUAAAAGAAC 88.5% 180 GAGGAGGUCAGUGAAACAC 86.4%

127 GGUGGAAUAGAAAUGGACC 88.4% 181 GAAAGAAUAAAAGAACUAC 86.4%

128 GGAAAGAAUAAAAGAACUA 88.4% 182 AUACAUGUUAGAGAGAGAA 86.2%

129 AUGGUGGAAUAGAAAUGGA 88.4% 183 AGGGAUGAGAAUACUUGUA 86.1%

130 AUGGCCAUAAUUAAGAAGU 88.1% 184 UAUGCAAGGCUGCAAUGGG 86.1%

131 UGGCCAUAAUUAAGAAGUA 88.1% 185 GGAUGAGAAUACUUGUAAG 86.1%

132 AGUCUCGCACUCGCGAGAU 88.1% 186 GCACAGAGAUGUCAAUGAG 86.1%

133 GUCUCGCACUCGCGAGAUA 88.1% 187 GAAAAGAACCCGUCACUUA 86.1%

134 UAUUCAACUACAACAAGAC 88.1% 188 AGCACAGAGAUGUCAAUGA 86.1%

135 CAGUCUCGCACUCGCGAGA 88.1% 189 AUAUGCAAGGCUGCAAUGG 86.1% 136 UAUGGCCAUAAUUAAGAAG 88.0% 190 GGGAUGAGAAUACUUGUAA 86.1%

137 GACCAUAUGGCCAUAAUUA 87.8% 191 AAAAGAACCCGUCACUUAG 86.1%

138 UGGACCAUAUGGCCAUAAU 87.7% 192 UCAGGGAUGAGAAUACUUG 86.1%

139 GUAUUCAACUACAACAAGA 87.7% 193 CAGGGAUGAGAAUACUUGU 86.1%

140 GUGGACCAUAUGGCCAUAA 87.7% 194 CCAAGCACAGAGAUGUCAA 86.0%

141 CAUGUUAGAGAGAGAACUU 87.6% 195 CAGGAAAAGAACCCGUCAC 86.0%

142 UACAUGUUAGAGAGAGAAC 87.6% 196 CCACAGUGGACCAUAUGGC 86.0%

143 ACAUGUUAGAGAGAGAACU 87.6% 197 AGGAAAAGAACCCGUCACU 86.0%

144 ACCAUAUGGCCAUAAUUAA 87.6% 198 ACCACAGUGGACCAUAUGG 86.0%

145 UGUUAGAGAGAGAACUUGU 87.6% 199 CAGUGGACCAUAUGGCCAU 86.0%

146 AUGUUAGAGAGAGAACUUG 87.6% 200 GAAUACUUGUAAGGGGCAA 86.0%

147 ACACUCCAGGUGGAGAAGU 87.4% 201 AGAUGUGCCACAGCACACA 86.0%

148 AUGUCGCAGUCUCGCACUC 87.4% 202 GGAAAAGAACCCGUCACUU 86.0%

149 GUACACUCCAGGUGGAGAA 87.4% 203 AAGCACAGAGAUGUCAAUG 86.0%

150 UACACUCCAGGUGGAGAAG 87.4% 204 AGAAUACUUGUAAGGGGCA 86.0%

151 UGUCGCAGUCUCGCACUCG 87.4% 205 GGACCAUAUGGCCAUAAUU 86.0%

152 UCGCAGUCUCGCACUCGCG 87.3% 206 ACAGUGGACCAUAUGGCCA 86.0%

153 GUCGCAGUCUCGCACUCGC 87.3% 207 AGUGGACCAUAUGGCCAUA 86.0%

154 GCAGUCUCGCACUCGCGAG 87.2% 208 CAAGCACAGAGAUGUCAAU 86.0%

155 CGCAGUCUCGCACUCGCGA 87.2% 209 GAGAAUACUUGUAAGGGGC 86.0%

156 CACUCCAGGUGGAGAAGUG 87.2% 210 GACAGGAAAAGAACCCGUC 85.9%

211 AUCGAGUGAUGGUAUCACC 85.9% 265 AGGUCAGUGAAACACAGGG 85.3%

212 GCUGUGGAUAUAUGCAAGG 85.9% 266 GGGGAUCAGGGAUGAGAAU 85.3%

213 GGAUAUAUGCAAGGCUGCA 85.9% 267 GAGGUCAGUGAAACACAGG 85.3%

214 UGGAUAUAUGCAAGGCUGC 85.9% 268 UUGUAAGGGGCAAUUCUCC 85.3%

215 AGACAGGAAAAGAACCCGU 85.9% 269 GGGAGACAGGAAAAGAACC 85.3%

216 ACAAAAACCACAGUGGACC 85.9% 270 GGAGACAGGAAAAGAACCC 85.3%

217 UUUGCAGCCGCUCCACCAA 85.9% 271 GGGAUCAGGGAUGAGAAUA 85.3%

218 CUGUGGAUAUAUGCAAGGC 85.9% 272 CUUGUAAGGGGCAAUUCUC 85.3%

219 GUGGAUAUAUGCAAGGCUG 85.9% 273 ACUUGUAAGGGGCAAUUCU 85.3%

220 ACAGGAAAAGAACCCGUCA 85.9% 274 AGAAUCAGCUCAUCCUUCA 85.1%

221 CAGAGAUGUCAAUGAGAGG 85.9% 275 GGUAAUGAAACGAAAACGG 85.1%

222 GAUCGAGUGAUGGUAUCAC 85.9% 276 UGAGGGGAUCAGGGAUGAG 85.1%

223 UAUAUGCAAGGCUGCAAUG 85.9% 277 GUGAGGGGAUCAGGGAUGA 85.1%

224 AUAUAUGCAAGGCUGCAAU 85.9% 278 AUGUGAGGGGAUCAGGGAU 85.1%

225 UCCAGGUGGAGAAGUGAGG 85.9% 279 GAGGGGAUCAGGGAUGAGA 85.1%

226 CAAAAACCACAGUGGACCA 85.9% 280 ACCACCCAGAUAAUAAAGC 85.1%

227 UUGCAGCCGCUCCACCAAA 85.9% 281 AAUGUGAGGGGAUCAGGGA 85.1%

228 UACUCAGAAAAGCAACCAG 85.8% 282 CCACCCAGAUAAUAAAGCU 85.1%

229 AUAAUUAAGAAGUACACAU 85.8% 283 UGUGAGGGGAUCAGGGAUG 85.1%

230 AAAAACCACAGUGGACCAU 85.8% 284 GCUAUACUCAGAAAAGCAA 85.0%

231 AUACUCAGAAAAGCAACCA 85.8% 285 CUAUACUCAGAAAAGCAAC 85.0%

232 GAUAUAUGCAAGGCUGCAA 85.8% 286 GAAUCAGCUCAUCCUUCAG 85.0%

233 AAAACCACAGUGGACCAUA 85.8% 287 CCUGAGGAGGUCAGUGAAA 84.9%

234 AAGAAUAAAAGAACUACGG 85.8% 288 CUGAGGAGGUCAGUGAAAC 84.9%

235 UACAACAAAAUGGAAUUUG 85.8% 289 UAUACUCAGAAAAGCAACC 84.9%

236 ACAACAAAAUGGAAUUUGA 85.8% 290 CAACAGCUAUACUCAGAAA 84.8%

237 AAACCACAGUGGACCAUAU 85.8% 291 AUGGAACCUUUGGCCCUGU 84.8%

238 CACAGUGGACCAUAUGGCC 85.8% 292 GCAACAGCUAUACUCAGAA 84.8%

239 CACAGAGAUGUCAAUGAGA 85.8% 293 AUAAGAGUCAGCAAAAUGG 84.8%

240 UAAUUAAGAAGUACACAUC 85.8% 294 UGAGGAGGUCAGUGAAACA 84.6%

241 ACAGAGAUGUCAAUGAGAG 85.8% 295 UUUUAAGGCAUUUUCAGAA 84.6%

242 AACCACAGUGGACCAUAUG 85.8% 296 CUCUAUUCCAACAAAUGAG 84.6%

243 AUCAGGGAUGAGAAUACUU 85.7% 297 ACUCUAUUCCAACAAAUGA 84.6%

244 AGAAUAAAAGAACUACGGA 85.7% 298 GAACCCGUCACUUAGGAUG 84.6%

245 AUACUUGUAAGGGGCAAUU 85.7% 299 AAGAACCCGUCACUUAGGA 84.6%

246 GAUCAGGGAUGAGAAUACU 85.7% 300 AGAACCCGUCACUUAGGAU 84.6%

247 AGCUGUGGAUAUAUGCAAG 85.7% 301 AACAGCUAUACUCAGAAAA 84.6%

248 CCAUAAUUAAGAAGUACAC 85.7% 302 AGAUAAUAAAGCUUCUCCC 84.5%

249 UGUGGAUAUAUGCAAGGCU 85.7% 303 CCCAGAUAAUAAAGCUUCU 84.5%

250 UACUUGUAAGGGGCAAUUC 85.7% 304 CAGAUAAUAAAGCUUCUCC 84.5%

251 AAUACUUGUAAGGGGCAAU 85.7% 305 CUUUUAAGGCAUϋUUCAGA 84.5%

252 GGAUCAGGGAUGAGAAUAC 85.7% 306 GUAAUUAUGGAAGUUGUUU 84.4%

253 CAUAAUUAAGAAGUACACA 85.7% 307 ACCCAGAUAAUAAAGCUUC 84.4%

254 ACCCGUCACUUAGGAUGAA 85.6% 308 AUGUAAUUAUGGAAGUUGU 84.4%

255 CCGUCACUUAGGAUGAAAU 85.6% 309 CCUGGUCAUGCAGACCUCA 84.4%

256 GUCACUUAGGAUGAAAUGG 85.6% 310 GAUGUAAUUAUGGAAGUUG 84.4%

257 GCCAUAAUUAAGAAGUACA 85.6% 311 UCCUGAGGAGGUCAGUGAA 84.4%

258 CGUCACUUAGGAUGAAAUG 85.6% 312 CUCCUGAGGAGGUCAGUGA 84.4%

259 CCCGUCACUUAGGAUGAAA 85.6% 313 UAAUUAUGGAAGUUGUUϋU 84.4%

260 GGCCAUAAUUAAGAAGUAC 85.6% 314 CACCCAGAUAAUAAAGCUU 84.4%

261 AACCCGUCACUUAGGAUGA 85.6% 315 UGGAGAUGUGCCACAGCAC 84.4%

262 UGGUAAUGAAACGAAAACG 85.4% 316 UCUCCUGAGGAGGUCAGUG 84.4%

263 UUGGUAAUGAAACGAAAAC 85.4% 317 ACAGCUAUACUCAGAAAAG 84.4%

264 AGGGGAUCAGGGAUGAGAA 85.3% 318 CCACUAGCAUCUUUAUUGG 84.4%

319 CACUAGCAUCUUUAUUGGA 84.4% 373 CAGCCGCUCCACCAAAGCA 83.5%

320 CUGGUCAUGCAGACCUCAG 84.3% 374 GCAGCCGCUCCACCAAAGC 83.5%

321 CAUGGAACCUUUGGCCCUG 84.3% 375 GGCAUUUUCAGAAAGAUGC 83.4%

322 UGUAAUUAUGGAAGUUGUU 84.3% 376 AGGCAUUUUCAGAAAGAUG 83.4%

323 CAGCUAUACUCAGAAAAGC 84.2% 377 UAAGGCAUUUUCAGAAAGA 83.4%

324 AUUCUCCUGUAUUCAACUA 84.2% 378 ACUUUAAUUGAAGACCCAG 83.4%

325 AAAGAACCCGUCACUUAGG 84.2% 379 AAGGCAUUUUCAGAAAGAU 83.4%

326 UUGGAGAUGUGCCACAGCA 84.2% 380 UUAAGGCAUUUUCAGAAAG 83.4%

327 AAUUCUCCUGUAUUCAACU 84.2% 381 GCAAUCUCCAAACAUUGAA 83.4%

328 ACAUGGAACCUUUGGCCCU 84.1% 382 GCACUUUAAUUGAAGACCC 83.3%

329 GGAGAUGUGCCACAGCACA 84.1% 383 CACUUUAAUUGAAGACCCA 83.3%

330 UCAGAUCGAGUGAUGGUAU 84.1% 384 CUUUAAUUGAAGACCCAGA 83.3%

331 UAUUGGAGAUGUGCCACAG 84.1% 385 GGCACUUUAAUUGAAGACC 83.3%

332 AGAUCGAGUGAUGGUAUCA 84.1% 386 UUUAAGGCAUUUUCAGAAA 83.2%

333 CAGAUCGAGUGAUGGUAUC 84.1% 387 UUUAAUUGAAGACCCAGAU 83.2%

334 CCAGAUAAUAAAGCUUCUC 84.1% 388 UUUGAGAGUUCGAGACCAA 83.2%

335 AUUGGAGAUGUGCCACAGC 84.1% 389 UUUUUGAGAGUUCGAGACC 83.2%

336 AGCUAUACUCAGAAAAGCA 84.0% 390 GCCGCUCCACCAAAGCAAA 83.2%

337 GGCAAUUCUCCUGUAUUCA 84.0% 391 UUUUGAGAGUUCGAGACCA 83.2%

338 CUUUAUUGGAGAUGUGCCA 84.0% 392 AGCCGCUCCACCAAAGCAA 83.2%

339 UUAUUGGAGAUGUGCCACA 84.0% 393 UUAAUUGAAGACCCAGAUG 83.2%

340 UCCAAGCACAGAGAUGUCA 84.0% 394 UGCAGCCGCUCCACCAAAG 83.2%

341 UCUUUAUUGGAGAUGUGCC 84.0% 395 CCGCUCCACCAAAGCAAAG 83.2%

342 GCAAUUCUCCUGUAUUCAA 83.9% 396 UAAUUGAAGACCCAGAUGA 83.0%

343 CUCCAAGCACAGAGAUGUC 83.9% 397 UUGAGAGUUCGAGACCAAC 83.0%

344 GCCAUGGUGUUUUCACAAG 83.9% 398 UUCACAAUGGUGGGGAAAA 83.0%

345 GAGACAGGAAAAGAACCCG 83.9% 399 UAAACCCUGGUCAUGCAGA 82.9%

346 UUUAUUGGAGAUGUGCCAC 83.9% 400 UUCAAUGGUCUCAGAAUCC 82.9%

347 CAUCUUUAUUGGAGAUGUG 83.9% 401 UCACAAUGGUGGGGAAAAG 82.9%

348 CCAUGGUGUUUUCACAAGA 83.9% 402 UCAAUGGUCUCAGAAUCCU 82.9%

349 GGAGGUCAGUGAAACACAG 83.9% 403 UCGAGUGAUGGUAUCACCU 82.9%

350 AAAAUUCAAUGGUCUCAGA 83.9% 404 AUUCAAUGGUCUCAGAAUC 82.9%

351 AAAUUCAAUGGUCUCAGAA 83.8% 405 AAUUCAAUGGUCUCAGAAU 82.9%

352 AGCAUCUUUAUUGGAGAUG 83.8% 406 UCAGUGAAACACAGGGAAC 82.8%

353 AUGACUCCAAGCACAGAGA 83.8% 407 AUAAACCCUGGUCAUGCAG 82.8%

354 CUAGCAUCUUUAUUGGAGA 83.8% 408 ACCCUGGUCAUGCAGACCU 82.8%

355 GCAUCUUUAUUGGAGAUGU 83.8% 409 AAACCCUGGUCAUGCAGAC 82.8%

356 GACAUAAACCCUGGUCAUG 83.8% 410 AACCCUGGUCAUGCAGACC 82.8% 357 UAUCUCCUGAGGAGGUCAG 83.8% 411 GUCAGUGAAACACAGGGAA 82.7%

358 CAAUUCUCCUGUAUUCAAC 83.8% 412 GAGGAUUGCAUGAUAAAAG 82.7%

359 GAUGAGAAUACUUGUAAGG 83.8% 413 GUAAGGGGCAAUUCUCCUG 82.7%

360 UAGCAUCUUUAUUGGAGAU 83.8% 414 AGGAUUGCAUGAUAAAAGC 82.7%

361 AUCUUUAUUGGAGAUGUGC 83.7% 415 UGUAAGGGGCAAUUCUCCU 82.7%

362 UGACUCCAAGCACAGAGAU 83.7% 416 CAUAAACCCUGGUCAUGCA 82.7%

363 AUCUCCUGAGGAGGUCAGU 83.7% 417 UGAGAGUUCGAGACCAACG 82.6% 364 ACAUAAACCCUGGUCAUGC 83.7% 418 CAAGAGGAUUGCAUGAUAA 82.6%

365 GAGAAUCAGCUCAUCCUUC 83.7% 419 AAGAGGAUUGCAUGAUAAA 82.6%

366 UGAGAAUCAGCUCAUCCUU 83.7% 420 UCACAAGAGGAUUGCAUGA 82.5%

367 GACUCCAAGCACAGAGAUG 83.7% 421 UAAGGGGCAAUUCUCCUGU 82.5%

368 ACUAGCAUCUUUAUUGGAG 83.7% 422 GGAUUGCAUGAUAAAAGCA 82.5%

369 ACUCCAAGCACAGAGAUGU 83.7% 423 CACAAGAGGAUUGCAUGAU 82.5%

370 AAGGAACGUGUUGGGAACA 83.6% 424 AGAGGAUUGCAUGAUAAAA 82.5%

371 CAAGGAACGUGUUGGGAAC 83.6% 425 ACAAGAGGAUUGCAUGAUA 82.5%

372 GGCAAUCUCCAAACAUUGA 83.6% 426 GUGUUUUCACAAGAGGAUU 82.4%

427 UGUUUUCACAAGAGGAUUG 82.4% 481 CAUGGUGUUUUCACAAGAG 80.8%

428 AAGAAGUGCUUACAGGCAA 82.3% 482 AAUGUUGUACAACAAAAUG 80.8%

429 UUCACAAGAGGAUUGCAUG 82.3% 483 GGUGUUUUCACAAGAGGAU 80.8% 430 UUUCACAAGAGGAUUGCAU 82.3% 484 ACAAGGAUGGUGGACAUUC 80.6%

431 GUUUUCACAAGAGGAUUGC 82.3% 485 CAAGGAUGGUGGACAUUCU 80.6%

432 GAAGAAGUGCUUACAGGCA 82.3% 486 AACUACAACAAGACCACUA 80.6%

433 UUUUCACAAGAGGAUUGCA 82.3% 487 ACUACAACAAGACCACUAA 80.6%

434 GCCAAUACAGUGGGUUUGU 82.2% 488 GCAGAUCCACUAGCAUCUU 80.5%

435 AGGGGCAAUUCUCCUGUAU 82.1% 489 CAGAUCCACUAGCAUCUUU 80.5%

436 UUAGAGGUGACCUGAAUUU 82.1% 490 GUGAUGGUAUCACCUUUGG 80.4%

437 UGGAAUUUGAACCAUUUCA 82.1% 491 UGAUGGUAUCACCUUUGGC 80.4%

438 GGGGCAAUUCUCCUGUAUU 82.1% 492 AGAUACGGACCAGCAUUAA 80.4%

439 AAGGGGCAAUUCUCCUGUA 82.1% 493 GAGUGAUGGUAUCACCUUU 80.4%

440 AUGGAAUUUGAACCAUUUC 82.1% 494 AUCCACUAGCAUCUUUAUU 80.3%

441 GUUAGAGGUGACCUGAAUU 82.1% 495 GAUACGGACCAGCAUUAAG 80.3%

442 AAAAUGGAAUUUGAACCAU 82.0% 496 AGAUCCACUAGCAUCUUUA 80.3%

443 UGUACAACAAAAUGGAAUU 82.0% 497 GAUCCACUAGCAUCUUUAU 80.3%

444 AAAUGGAAUUUGAACCAUU 82.0% 498 CGAGUGAUGGUAUCACCUU 80.3%

445 GUACAACAAAAUGGAAUUU 82.0% 499 CAACUACAACAAGACCACU 80.3%

446 CAGUUAGAGGUGACCUGAA 81.9% 500 UUACAGGCAAUCUCCAAAC 80.2%

447 GAGGUGACCUGAAUUUCGU 81.8% 501 ACAGGCAAUCUCCAAACAU 80.2%

448 AGAGGUGACCUGAAUUUCG 81.8% 502 CUUACAGGCAAUCUCCAAA 80.2%

449 UAGAGGUGACCUGAAUUUC 81.8% 503 UACAGGCAAUCUCCAAACA 80.2%

450 AAAGCAGUUAGAGGUGACC 81.8% 504 CAGGCAAUCUCCAAACAUU 80.2%

451 AAGCAGUUAGAGGUGACCU 81.8% 505 AGUGAUGGUAUCACCUUUG 80.2%

452 AAAAGCAGUUAGAGGUGAC 81.8% 506 AGGCAAUCUCCAAACAUUG 79.9%

453 AGUUAGAGGUGACCUGAAU 81.8% 507 UAUGGAAAGAAUAAAAGAA 79.9%

454 GCAGUUAGAGGUGACCUGA 81.8% 508 AAGAUACGGACCAGCAUUA 79.9%

455 AGCAGUUAGAGGUGACCUG 81.8% 509 AUGCGAAAGUGCUUUUUCA 79.7%

456 AAUGGAAUUUGAACCAUUU 81.7% 510 UAUCAAUGGAUCAUCAGAA 79.7%

457 GGAGAAGUGAGGAAUGACG 81.7% 511 GGAUGGUGGACAUUCUUAG 79.7%

458 GGGCAAUUCUCCUGUAUUC 81.7% 512 ACCUAUCAAUGGAUCAUCA 79.6%

459 GAGAAGUGAGGAAUGACGA 81.7% 513 GCUUACAGGCAAUCUCCAA 79.6%

460 UGGAGAAGUGAGGAAUGAC 81.6% 514 AAGGAUGGUGGACAUUCUU 79.6%

461 UUGUACAACAAAAUGGAAU 81.4% 515 AGGAUGGUGGACAUUCUUA 79.6%

462 UGUUGUACAACAAAAUGGA 81.4% 516 GAAGUGCUUACAGGCAAUC 79.6%

463 AGCAUUAAGCAUCAAUGAA 81.4% 517 AUCAAUGGAUCAUCAGAAA 79.6%

464 CAGCAUUAAGCAUCAAUGA 81.4% 518 UGCUUACAGGCAAUCUCCA 79.6%

465 CCAGCAUUAAGCAUCAAUG 81.4% 519 AAGUGCUUACAGGCAAUCU 79.6%

466 AUGUUGUACAACAAAAUGG 81.3% 520 GAAGUGAGGAAUGACGAUG 79.5%

467 GGAGUAAAAUGAGUGAUGC 81.3% 521 GUGCUUACAGGCAAUCUCC 79.5%

468 AAUUGAAGACCCAGAUGAA 81.3% 522 AAGUGAGGAAUGACGAUGU 79.5%

469 UGGAGUAAAAUGAGUGAUG 81.3% 523 AGAGACUGACAAUAACUUA 79.5%

470 UUCAACUACAACAAGACCA 81.2% 524 GAGAGACUGACAAUAACUU 79.5%

471 UCAACUACAACAAGACCAC 81.2% 525 AGAAGUGCUUACAGGCAAU 79.5%

472 GUUGUACAACAAAAUGGAA 81.2% 526 GAAGAUACGGACCAGCAUU 79.5%

473 AUGGUGUUUUCACAAGAGG 81.0% 527 GAUGCGAAAGUGCUUUUUC 79.5%

474 AUGGAGUAAAAUGAGUGAU 81.0% 528 UCAGCUCAUCCUUCAGCUU 79.5%

475 CUAUGGAGUAAAAUGAGUG 81.0% 529 AUGUGCCACAGCACACAAA 79.4%

476 UGGUGUUUUCACAAGAGGA 81.0% 530 AAUACCUAUCAAUGGAUCA 79.4%

477 UAUGGAGUAAAAUGAGUGA 81.0% 531 CCUAUCAAUGGAUCAUCAG 79.4%

478 AGAAGUGAGGAAUGACGAU 80.9% 532 AUACCUAUCAAUGGAUCAU 79.4%

479 CAAUGUUGUACAACAAAAU 80.8% 533 UGUGCCACAGCACACAAAU 79.4%

480 AUUCAACUACAACAAGACC 80.8% 534 AGAAGAUACGGACCAGCAU 79.4%

535 AGUGCUUACAGGCAAUCUC 79.4% 589 GAUGGCCAUCAAUUAAUGU 78.7%

536 AUCAGCUCAUCCUUCAGCU 79.4% 590 GUGAGGAAUGACGAUGUUG 78.7%

537 AGGAGUUCACAAUGGUGGG 79.4% 591 GUCAAUACCUAUCAAUGGA 78.7%

538 GAGGAGUUCACAAUGGUGG 79.4% 592 AGUGAGGAAUGACGAUGUU 78.7%

539 AAUCAGCUCAUCCUUCAGC 79.3% 593 GGAGUUCACAAUGGUGGGG 78.6%

540 GAUGUGCCACAGCACACAA 79.3% 594 CCACAGCACACAAAUUGGC 78.5%

541 AGCUCAUCCUUCAGCUUUG 79.2% 595 AGAACUCUAUUCCAACAAA 78.5%

542 CUGUAUUCAACUACAACAA 79.2% 596 GAUGUCAAUGAGAGGAAUA 78.5%

543 GCUCAUCCUUCAGCUUUGG 79.2% 597 GAACUCUAUUCCAACAAAU 78.5%

544 UCAAUGAGAGGAAUAAGAG 79.2% 598 GGUCAGUGAAACACAGGGA 78.5%

545 UGAGAGGAAUAAGAGUCAG 79.2% 599 AGAUGUCAAUGAGAGGAAU 78.5%

546 CAAUGAGAGGAAUAAGAGU 79.2% 600 GAGAUGUCAAUGAGAGGAA 78.5%

547 GUGCCACAGCACACAAAUU 79.2% 601 AACUCUAUUCCAACAAAUG 78.5%

548 GAGGAAUAAGAGUCAGCAA 79.2% 602 GGGACAAGGAUGGUGGACA 78.4%

549 CCUGUAUUCAACUACAACA 79.2% 603 CAAACUCUAUGGAGUAAAA 78.4%

550 CAGCUCAUCCUUCAGCUUU 79.2% 604 GGACAAGGAUGGUGGACAU 78.4%

551 AAUGAGAGGAAUAAGAGUC 79.1% 605 ACAGCACACAAAUUGGCGG 78.4%

552 UGUAUUCAACUACAACAA.G 79.1% 606 CACAGCACACAAAUUGGCG 78.4%

553 AGAGGAAUAAGAGUCAGCA 79.1% 607 CAAUACCUAUCAAUGGAUC 78.3%

554 CUAUCAAUGGAUCAUCAGA 79.1% 608 GACAAGGAUGGUGGACAUU k , 78.3%

555 GGUCAAUACCUAUCAAUGG 79.1% 609 AAACUCUAUGGAGUAAAAU 78.3%

556 GAGUCAGCAAAAUGGGUGU 79.1% 610 CAAAAUGGAAUUUGAACCA 78.1%

557 AUGAGAGGAAUAAGAGUCA 79.1% 611 UGUUGGGAACAAAUGUACA 78.1%

558 UGGUCAAUACCUAUCAAUG 79.1% 612 AACAAAAUGGAAUUUGAAC 78.1%

559 UCUCCUGUAUUCAACUACA 79.1% 613 AACUCUAUGGAGUAAAAUG 78.1%

560 AGAGUCAGCAAAAUGGGUG 79.1% 614 ACAAAAUGGAAUUUGAACC 78.1%

561 GCCACAGCACACAAAUUGG 79.1% 615 CACACAAAUUGGCGGGACA 78.1%

562 CUCCUGUAUUCAACUACAA 79.1% 616 GUUGGGAACAAAUGUACAC 78.1%

563 GUCAAUGAGAGGAAUAAGA 79.1% 617 GCACACAAAUUGGCGGGAC 78.0%

564 AGAGAUGUCAAUGAGAGGA 79.1% 618 CAGAAAGAUGCGAAAGUGC 78.0%

565 UGUCAAUGAGAGGAAUAAG 79.1% 619 AGCACACAAAUUGGCGGGA 78.0%

566 GGAUGGCCAUCAAUUAAUG 79.1% 620 AAAGAUGCGAAAGUGCUUU 78.0%

567 GAGAGGAAUAAGAGUCAGC 79.1% 621 AGAAAGAUGCGAAAGUGCU 78.0%

568 UGCCACAGCACACAAAUUG 79.1% 622 AAGAUGCGAAAGUGCUUUU 78.0%

569 GGAAUAAGAGUCAGCAAAA 79.0% 623 GCGGGACAAGGAUGGUGGA 78.0%

570 UACCUAUCAAUGGAUCAUC 79.0% 624 ACACAAAUUGGCGGGACAA 78.0%

571 UUGGCGGGACAAGGAUGGU 79.0% 625 UGAUGUCGCAGUCUCGCAC 77.9%

572 AUUGGCGGGACAAGGAUGG 79.0% 626 CAACAAAAUGGAAUUUGAA 77.9%

573 UCAAUACCUAUCAAUGGAU 78.9% 627 GGAACAAAUGUACACUCCA 77.8%

574 AUGCAGACCUCAGUGCCAA 78.9% 628 CAAAUGUACACUCCAGGUG 77.8%

575 CAAGGACAAACUCUAUGGA 78.9% 629 CUGAUGUCGCAGUCUCGCA 77.8%

576 AAGGACAAACUCUAUGGAG 78.9% 630 CGGGACAAGGAUGGUGGAC 77.8%

577 AGGAAUAAGAGUCAGCAAA 78.9% 631 GGGAACAAAUGUACACUCC 77.8%

578 GAGUUCACAAUGGUGGGGA 78.9% 632 UUGGGAACAAAUGUACACU 77.8%

579 AAUAAGAGUCAGCAAAAUG 78.9% 633 GAACAAAUGUACACUCCAG 77.8%

580 AGUUCACAAUGGUGGGGAA 78.9% 634 AAAUGUACACUCCAGGUGG 77.8%

581 GAAUAAGAGUCAGCAAAAU 78.9% 635 GAAAGAUGCGAAAGUGCUU 77.8% 582 AUGUCAAUGAGAGGAAUAA 78.9% 636 UGGGAACAAAUGUACACUC 77.8%

583 UUGGUCAAUACCUAUCAAU 78.9% 637 ACAAAUGUACACUCCAGGU 77.8%

584 GUUCACAAUGGUGGGGAAA 78.9% 638 GAGUAGACAUAAACCCUGG 77.8%

585 UCCUGUAUUCAACUACAAC 78.9% 639 AACAAAUGUACACUCCAGG 77.8%

586 UUCUCCUGUAUUCAACUAC 78.8% 640 AGAGUAGACAUAAACCCUG 77.8%

587 CAUGCAGACCUCAGUGCCA 78.8% 641 AUUUUAGAAAUCAAGUCAA 77.7%

588 UGAGGAAUGACGAUGUUGA 78.7% 642 AAGAGUAGACAUAAACCCU 77.7%

643 CAUϋUUAGAAAUCAAGUCA 77.7% 697 GAUAAUAAAGCUUCUCCCU 76.9%

644 GAAGAGUAGACAUAAACCC 77.7% 698 AAAGCUUCUCCCUUUUGCA 76.9%

645 UGAAGACCCAGAUGAAAGC 77.7% 699 AUAAAGCUUCUCCCUϋϋUG 76.9%

646 GGCGGGACAAGGAUGGUGG 77.7% 700 GAACCAUUUCAAUCUUUAG 76.8%

647 AAGACCCAGAUGAAAGCAC 77.7% 701 AGUAGACAUAAACCCUGGU 76.8%

648 AAUGUACACUCCAGGUGGA 77.6% 702 CUCAUCCUUCAGCUUUGGU 76.8%

649 GAUGUCGCAGUCUCGCACU 77.6% 703 UCAUCCUUCAGCUUUGGUG 76.8%

650 UGGCGGGACAAGGAUGGUG 77.6% 704 UGAACCAUUUCAAUCUUUA 76.8%

651 GAAGACCCAGAUGAAAGCA 77.6% 705 AACCAUUUCAAUCUUUAGU 76.8%

652 CAUUUUCAGAAAGAUGCGA 77.5% 706 UUUGAACCAUUUCAAUCUU 76.8%

653 UUUαCAGAAAGAUGCGAAA 77.5% 707 UUGAACCAUUUCAAUCUUU 76.8%

654 AUUUUCAGAAAGAUGCGAA 77.5% 708 UUCUCCCUUUUGCAGCCGC 76.8%

655 AGACCACUAAAAGACUAAC 77.4% 709 CAUCCUUCAGCUUUGGUGG 76.8%

656 UUUCAGAAAGAUGCGAAAG 77.4% 710 CUUCUCCCUUUUGCAGCCG 76.8%

657 AAGACCACUAAAAGACUAA 77.4% 711 GUAUAUACAUUGAAGUUUU 76.7%

658 AUUGAAGACCCAGAUGAAA 77.4% 712 GGAACAAGCAGUAUAUACA 76.7%

659 AACAAGACCACUAAAAGAC 77.4% 713 GAAUUUGAACCAUUUCAAU 76.7%

660 ACAAGACCACUAAAAGACU 77.4% 714 UCUCCCUUUUGCAGCCGCU 76.7%

661 CAGCACACAAAUUGGCGGG 77.4% 715 CUCCCUUUUGCAGCCGCUC 76.7%

662 AGACCCAGAUGAAAGCACA 77.4% 716 UCCCUUUUGCAGCCGCUCC 76.7%

663 GACCACUAAAAGACUAACA 77.4% 717 GGAAUUUGAACCAUUUCAA 76.7%

664 UUCAGAAAGAUGCGAAAGU 77.4% 718 AGUAUAUACAUUGAAGUUU 76.7%

665 GUAGACAUAAACCCUGGUC 77.3% 719 AUUUGAACCAUUUCAAUCU 76.7%

666 ACCCAGAUGAAAGCACAUC 77.3% 720 AAUUUGAACCAUUUCAAUC 76.7%

667 UAGACAUAAACCCUGGUCA 77.3% 721 GAACAAGCAGUAUAUACAU 76.7%

668 CAAGACCACUAAAAGACUA 77.2% 722 GCUUCUCCCUUUUGCAGCC 76.7%

669 UGGUCAUGCAGACCUCAGU 77.2% 723 AACAAGCAGUAUAUACAUU 76.5%

670 GGUCAUGCAGACCUCAGUG 77.2% 724 CCUUUUGCAGCCGCUCCAC 76.4%

671 UCAGAAAGAUGCGAAAGUG 77.2% 725 ACAAACUCUAUGGAGUAAA 76.4%

672 GCACAUCCGGAGUGGAGUC 77.2% 726 GGACAAACUCUAUGGAGUA 76.4%

673 GUCAUGCAGACCUCAGUGC 77.2% 111 GACAAACUCUAUGGAGUAA 76.4%

674 GACCCAGAUGAAAGCACAU 77.2% 728 CAGGAUACUAACAUCAGAA 76.4%

675 GUGUUGGGAACAAAUGUAC 77.1% 729 AGGACAAACUCUAUGGAGU 76.4%

676 AGCAGUAUAUACAUUGAAG 77.1% 730 AACGUGUUGGGAACAAAUG 76.4%

677 GUGGUGGUUAGCAUUGAUC 77.1% 731 GCCAGGAUACUAACAUCAG 76.4%

678 AGCUUCUCCCUUUUGCAGC 77.1% 732 AGGAACGUGUUGGGAACAA 76.4%

679 UGGUGGUUAGCAUUGAUCG 77.1% 733 GAACGUGUUGGGAACAAAU 76.4%

680 GGAUACUAACAUCAGAAUC 77.1% 734 CCAGGAUACUAACAUCAGA 76.4%

681 CGUGUUGGGAACAAAUGUA 77.1% 735 CUUUUGCAGCCGCUCCACC 76.4%

682 AAGCUUCUCCCUUUUGCAG 77.1% 736 CAGUAUAUACAUUGAAGUU 76.4%

683 GCAGUAUAUACAUUGAAGU 77.1% 737 GGAACGUGUUGGGAACAAA 76.4%

684 ACGUGUUGGGAACAAAUGU 77.1% 738 UUUUGCAGCCGCUCCACCA 76.4%

685 AAGCAGUAUAUACAUUGAA 77.0% 739 AGAUGCGAAAGUGCUUUUU 76.3%

686 AGGAUACUAACAUCAGAAU 77.0% 740 GGCCAUUAGAGGCCAAUAC 76.2%

687 GCAUUUUCAGAAAGAUGCG 77.0% 741 AGGCCAUUAGAGGCCAAUA 76.2%

688 ACAAGCAGUAUAUACAUUG 77.0% 742 CCCUUUUGCAGCCGCUCCA 76.2%

689 UCAUGCAGACCUCAGUGCC 77.0% 743 UUAGAGGCCAAUACAGUGG 76.2%

690 CAAGCAGUAUAUACAUUGA 77.0% 744 AAGGCCAUUAGAGGCCAAU 76.1%

691 AGACAUAAACCCUGGUCAU 77.0% 745 CCAUUAGAGGCCAAUACAG 76.1%

692 AAUAAAGCUUCUCCCUUUU 76.9% 746 AUUAGAGGCCAAUACAGUG 76.1%

693 UAAAGCUUCUCCCUUUUGC 76.9% 747 GCCAUUAGAGGCCAAUACA 76.1%

694 AUAAUAAAGCUUCUCCCUU 76.9% 748 UCAGCAGAUCCACUAGCAU 75.9%

695 UAAUAAAGCUUCUCCCUUU 76.9% 749 GUAUCAGCAGAUCCACUAG 75.9%

696 CACAAAUUGGCGGGACAAG 76.9% 750 CAAAAUUCAAUGGUCUCAG 75.9%

751 UAUCAGCAGAUCCACUAGC 75.9% 805 GGGUGGUGGUUAGCAUUGA 74.7%

752 CAGCAGAUCCACUAGCAUC 75.9% 806 AGUAUCAGCAGAUCCACUA 74.7%

753 CAUUAGAGGCCAAUACAGU 75.9% 807 CAGUAUCAGCAGAUCCACU 74.7%

754 GUCAAAAUUCAAUGGUCUC 75.8% 808 AUUAAGCAUCAAUGAACUG 74.7%

755 UCAAAAUUCAAUGGUCUCA 75.8% 809 AGGGUGGUGGUUAGCAUUG 74.7%

756 GGAUCAUCAGAAAUUGGGA 75.8% 810 AGAACUCCGAGAUUGCAAA 74.7%

757 UGGAUCAUCAGAAAUUGGG 75.8% 811 GAACUCCGAGAUUGCAAAA 74.7%

758 GAUCAUCAGAAAUUGGGAA 75.7% 812 GACAGAAGAUACGGACCAG 74.6%

759 AGCAGAUCCACUAGCAUCU 75.7% 813 ACAGAAGAUACGGACCAGC 74.6%

760 UUGAAGACCCAGAUGAAAG 75.6% 814 CAGAACUCUAUUCCAACAA 74.6%

761 ACUCUAUGGAGUAAAAUGA 75.6% 815 CAGAAUGCAGUUCUCUUCA 74.6%

762 AGGAAGACAGAAGAUACGG 75.6% 816 GCAGUAUCAGCAGAUCCAC 74.6%

763 AUCAGCAGAUCCACUAGCA 75.6% 817 GGUGGUGGUUAGCAUUGAU 74.6%

764 UGUCAAAAUUCAAUGGUCU 75.6% 818 CUCCGAGAUUGCAAAAUσU 74.6%

765 AAGGAAGACAGAAGAUACG 75.6% 819 UCCGAGAUUGCAAAAUUUC 74.6%

766 CAAUGGAUCAUCAGAAAUU 75.5% 820 CAGAAGAUACGGACCAGCA 74.6%

767 CCCUGGUCAUGCAGACCUC 75.5% 821 GAAGACAGAAGAUACGGAC 74.5%

768 AAUGGAUCAUCAGAAAUUG 75.5% 822 AGCCAGGAUACUAACAUCA 74.5%

769 AUGGAUCAUCAGAAAUUGG 75.5% 823 ACUCCGAGAUUGCAAAAUU 74.5%

770 AAAGCAGAAUGCAGUUCUC 75.4% 824 GAGGGUGGUGGUUAGCAUU 74.5%

771 UUUGUCAGAACUCUAUUCC 75.4% 825 AAGACAGAAGAUACGGACC 74.5%

772 UUGUCAGAACUCUAUUCCA 75.4% 826 GGAAGACAGAAGAUACGGA 74.5%

773 CAAAGCAGAAUGCAGUUCU 75.4% 827 ACCAACGUGGGAAUGUAUU 74.4%

774 UCAAUGGAUCAUCAGAAAU 75.3% 828 AGACAGAAGAUACGGACCA 74.4%

775 CUCCACCAAAGCAAAGCAG 75.3% 829 CGAGAUUGCAAAAUUUCUC 74.4%

776 GCUCCACCAAAGCAAAGCA 75.3% 830 CCGAGAUUGCAAAAUOUCU 74.4%

777 UAAGCAUCAAUGAACUGAG 75.3% 831 UUAAGCAUCAAUGAACUGA 74.4%

778 UCUAUGGAGUAAAAUGAGU 75.3% 832 GACCAACGUGGGAAUGUAU 74.4%

779 UCAGAACUCUAUUCCAACA 75.2% 833 UAAAUGUGAGGGGAUCAGG 74.2%

780 GAGCCAGGAUACUAACAUC 75.2% 834 UGACUGUAAAUGUGAGGGG 74.1%

781 GCAGAAUGCAGUUCUCUUC 75.2% 835 UGGUGGGGAAAAGAGCAAC 74.1%

782 GAGAGGGUGGUGGUUAGCA 75.2% 836 GUUGCUGGCGGAACAAGCA 74.1%

783 GGAGCCAGGAUACUAACAU 75.2% 837 UUGCUGGCGGAACAAGCAG 74.1%

784 CUGUCAAAAUUCAAUGGUC 75.2% 838 CCAGUUGCUGGCGGAACAA 74.0%

785 CAUUAAGCAUCAAUGAACU 75.2% 839 CACAAGAUGUAAUUAUGGA 74.0%

786 AGCAGAAUGCAGUUCUCUU 75.2% 840 ACAAGAUGUAAUUAUGGAA 74.0%

787 AGAGGGUGGUGGUUAGCAU 75.2% 841 AGGCACAAGAUGUAAUUAU 74.0%

788 AAGCAGAAUGCAGUUCUCU 75.2% 842 GCACAAGAUGUAAUUAUGG 74.0%

789 CUCUAUGGAGUAAAAUGAG 75.2% 843 AAGAUGUAAUUAUGGAAGU 74.0%

790 CGCUCCACCAAAGCAAAGC 75.2% 844 AGAUGUAAUUAUGGAAGUU 74.0%

791 GUCAGAACUCUAUUCCAAC 75.2% 845 GUAAAUGUGAGGGGAUCAG 74.0%

792 GCAUUAAGCAUCAAUGAAC 75.2% 846 GGCACAAGAUGUAAUUAUG 74.0%

793 AGCACAUCCGGAGUGGAGU 75.2% 847 AACCUUGCAAAAGGGGAAA 74.0%

794 CAGAGAGGGUGGUGGUUAG 75.1% 848 ACCUUGCAAAAGGGGAAAA 74.0%

795 UGGGAGCCAGGAUACUAAC 75.1% 849 CAGUUGCUGGCGGAACAAG 74.0%

796 ACAGAGAGGGUGGUGGUUA 75.1% 850 GAGGCACAAGAUGUAAUUA 74.0%

797 AGAGAGGGUGGUGGUUAGC 75.1% 851 CAGAUGAAAGCACAUCCGG 73.9%

798 GUGGGAGCCAGGAUACUAA 75.1% 852 UUUCUCCCAGUUGCUGGCG 73.9%

799 UGUCAGAACUCUAUUCCAA 75.1% 853 CUCCCAGUUGCUGGCGGAA 73.9%

800 GAAGAACUCCGAGAUUGCA 74.9% 854 GGCGGAACAAGCAGUAUAU 73.9%

801 GGGAGCCAGGAUACUAACA 74.8% 855 AUGGUGGGGAAAAGAGCAA 73.9%

802 AACUCCGAGAUUGCAAAAU 74.8% 856 UUCUCCCAGUUGCUGGCGG 73.9%

803 AAGAACUCCGAGAUUGCAA 74.8% 857 UACAAGACUUAUUUUGACA 73.9%

804 GAGAUUGCAAAAUUUCUCC 74.7% 858 AAAUGUGAGGGGAUCAGGG 73.9%

859 ACAAGACUUAUUUUGACAA 73.9% 913 UGAAUACUCCAGUACAGAG 73.2%

860 CAAGACUUAUUUUGACAAA 73.9% 914 CAAAUUGGCGGGACAAGGA 73.2%

861 UCCCAGUUGCUGGCGGAAC 73.9% 915 AAUACAGUGGGUUUGUCAG 73.2%

862 GCGGAACAAGCAGUAUAUA 73.9% 916 CAAUACAGUGGGUUUGUCA 73.2%

863 AUUUCUCCCAGUUGCUGGC 73.9% 917 AAUUGGCGGGACAAGGAUG 73.2%

864 CAAGAUGUAAUUAUGGAAG 73.9% 918 UAGGUAAGGAAGACAGAAG 73.2%

865 AUAUGACUCCAAGCACAGA 73.8% 919 CAAUGAAGUGGGAGCCAGG 73.2%

866 CGGAACAAGCAGUAUAUAC 73.8% 920 AGAUGAAAGCACAUCCGGA 73.2%

867 GAUAUGACUCCAAGCACAG 73.8% 921 CCAAUACAGUGGGUUUGUC 73.2%

868 AGUUGCUGGCGGAACAAGC 73.8% 922 GGUAAGGAAGACAGAAGAU 73.2%

869 UUGAGAAUCAGCUCAUCCU 73.8% 923 GUAAGGAAGACAGAAGAUA 73.2%

870 AAAAGAGCAACAGCUAUAC 73.8% 924 GGUGGGGAAAAGAGCAACA 73.1%

871 AAAGAGCAACAGCUAUACU 73.8% 925 GUAUGGAAAGAAUAAAAGA 73.1%

872 CCAGAUGAAAGCACAUCCG 73.8% 926 AAUACUCCAGUACAGAGAG 73.1%

873 CCCAGUUGCUGGCGGAACA 73.8% 927 ACAAAUUGGCGGGACAAGG 73.1%

874 GACUGUAAAUGUGAGGGGA 73.8% 928 GGAAAAGAGCAACAGCUAU 73.0%

875 CUGGCGGAACAAGCAGUAU 73.8% 929 GUGGAUGAAUACUCCAGUA 73.0%

876 ACUGUAAAUGUGAGGGGAU 73.8% 930 UGGAUGAAUACUCCAGUAC 73.0%

877 UGGCGGAACAAGCAGUAUA 73.8% 931 GAAAAGAGCAACAGCUAUA 73.0%

878 UGUAAAUGUGAGGGGAUCA 73.8% 932 GGAUGAAUACUCCAGUACA 73.0% 879 CUGUAAAUGUGAGGGGAUC 73.8% 933 AGGUAAGGAAGACAGAAGA 73.0% 880 GAAAGCACAUCCGGAGUGG 73.7% 934 GGCAAGGAGACGUGGUGUU 73.0%

881 AAAGCACAUCCGGAGUGGA 73.7% 935 GGGAAAAGAGCAACAGCUA 73.0%

882 UGCUGGCGGAACAAGCAGU 73.7% 936 UAAGGAAGACAGAAGAUAC 72.9%

883 GAAGUGGGAGCCAGGAUAC 73.7% 937 GACCAGCAUUAAGCAUCAA 72.9% 884 AAGUGGGAGCCAGGAUACU 73.7% 938 UAUUCCAACAAAUGAGAGA 72.9%

885 CCAACGUGGGAAUGUAUUA 73.7% 939 GGACCAGCAUUAAGCAUCA 72.9%

886 GCUGGCGGAACAAGCAGUA 73.7% 940 UAUGACUCCAAGCACAGAG 72.9%

887 AGUGGGAGCCAGGAUACUA 73.7% 941 GGUGUGGAUGAAUACUCCA 72.8%

888 AAGACUUAUUUUGACAAAG 73.6% 942 UGGAAGUUGUUUUUCCCAA 72.8%

889 UGAAAGCACAUCCGGAGUG 73.6% 943 GUGACAUGGUGGAAUAGAA 72.8%

890 UGGCCAUGGUGUUUUCACA 73.6% 944 AUGGAAGUUGUUUUUCCCA 72.8%

891 GUGGCCAUGGUGUUUUCAC 73.6% 945 ACCAGCAUUAAGCAUCAAU 72.8%

892 AUGAAUACUCCAGUACAGA 73.5% 946 UGACAUGGUGGAAUAGAAA 72.8% 893 AUGAAGUGGGAGCCAGGAU 73.5% 947 AACAAGAUUUCUCCCAGUU 72.8%

894 AAUGAAGUGGGAGCCAGGA 73.5% 948 GGGGAAAAGAGCAACAGCU 72.8%

895 CCCAAUGAAGUGGGAGCCA 73.5% 949 GAAGUUGUUUUUCCCAAUG 72.8%

896 UGAAGUGGGAGCCAGGAUA 73.5% 950 GGAAGUUGUUUUUCCCAAU 72.8%

897 GGAACAUAGUAAGAAGAGC 73.5% 951 AAGUUGUUUUUCCCAAUGA 72.8% 898 AGGAACAUAGUAAGAAGAG 73.5% 952 GGGUGUGGAUGAAUACUCC 72.8%

899 AAGCACAUCCGGAGUGGAG 73.5% 953 CAAUAGCCGAAGCAAUAAU 72.7%

900 CCCAGAUGAAAGCACAUCC 73.5% 954 UCAAUAGCCGAAGCAAUAA 72.7%

901 AUGAAAGCACAUCCGGAGU 73.5% 955 AAAACAAGAUUUCUCCCAG 72.7%

902 UCUCCCAGUUGCUGGCGGA 73.5% 956 GAAAAACAAGAUUUCUCCC 72.7%

903 GAUGAAUACUCCAGUACAG 73.5% 957 AAACAAGAUUUCUCCCAGU 72.7%

904 AGACUUAUUUUGACAAAGU 73.3% 958 GUGUGGAUGAAUACUCCAG 72.7%

905 CCAAUGAAGUGGGAGCCAG 73.3% 959 ACAAGAUUUCUCCCAGUUG 72.7%

906 AUGGCCAUCAAUUAAUGUU 73.3% 960 CAAGAUUUCUCCCAGUUGC 72.7%

907 UGGGGAAAAGAGCAACAGC 73.2% 961 UGGGUGUGGAUGAAUACUC 72.7%

908 GAAUACUCCAGUACAGAGA 73.2% 962 UUCAUUUUAGAAAUCAAGU 72.6%

909 GAUGAAAGCACAUCCGGAG 73.2% 963 CAGUCAAUAGCCGAAGCAA 72.6%

910 GUGGGGAAAAGAGCAACAG 73.2% 964 AUACAGUGGGUUUGUCAGA 72.6%

911 AAAUUGGCGGGACAAGGAU 73.2% 965 ACAGUGGGUUUGUCAGAAC 72.6%

912 GGCCAUGGUGUUUUCACAA 73.2% 966 GUCAAUAGCCGAAGCAAUA 72.6%

967 UACAGUGGGUϋϋGUCAGAA 72.6% 1021 CUUGAUGGUUGCAUACAUG 72.2%

968 AAGAUUUCUCCCAGUUGCϋ 72.6% 1022 AGCAAAAUGGGUGUGGAUG 72.2%

969 GUUCAUUUUAGAAAUCAAG 72.6% 1023 CCACCAAAGCAAAGCAGAA 72.2%

970 GGGCAAGGAGACGUGGUGU 72.6% 1024 AAAGAAGAACUCCGAGAUU 72.2%

971 AUGGGUGUGGAUGAAUACU 72.6% 1025 UAUUACCAGAUAUGACUCC 72.1%

972 UCAUUUUAGAAAUCAAGUC 72.6% 1026 CAGCAAAAUGGGUGUGGAU 72.1%

973 UUCAGCUCAUAGUGAGUGG 72.5% 1027 UGGUUCAGCUCAUAGUGAG 72.1%

974 CCAGAUAUGACUCCAAGCA 72.5% 1028 GUUCAGCUCAUAGUGAGUG 72.1%

975 CAAAGCAAAGCAGAAUGCA 72.5% 1029 AAGAGUCAGCAAAAUGGGU 72.1%

976 AGUCAAUAGCCGAAGCAAU 72.5% 1030 GUCAGCAAAAUGGGUGUGG 72.1%

977 CCAAAGCAAAGCAGAAUGC 72.5% 1031 UCAGCAAAAUGGGUGUGGA 72.1%

978 UACAACAAGACCACUAAAA 72.4% 1032 ACCUGAAUUUCGUCAACAG 72.0%

979 AGAAAUCAAGUCAAGAUAC 72.4% 1033 CUCCCUUGAUGGUUGCAUA 72.0%

980 AGCAAAGCAGAAUGCAGUU 72.4% 1034 CUAUUCCAACAAAUGAGAG 72.0%

981 CAGAUAUGACUCCAAGCAC 72.4% 1035 UCCCUUGAUGGUUGCAUAC 72.0%

982 GUAAAAUGAGUGAUGCUGG 72.4% 1036 UCUCCCUUGAUGGUUGCAU 72.0%

983 GAAAUCAAGUCAAGAUACG 72.4% 1037 GGUUCAGCUCAUAGUGAGU 72.0%

984 UUAUGGAAGUUGUUUUUCC 72.4% 1038 AUUACCAGAUAUGACUCCA 72.0%

985 CAACAAGACCACUAAAAGA 72.4% 1039 CAUCCGGAGUGGAGUCCGC 72.0%

986 AAAGCAAAGCAGAAUGCAG 72.4% 1040 GACCUGAAUUUCGUCAACA 72.0%

987 UACCAGAUAUGACUCCAAG 72.4% 1041 UUGGUUCAGCUCAUAGUGA 72.0%

988 AGAUAUGACUCCAAGCACA 72.4% 1042 CCAAAAGUAUACAAGACUU 71.9%

989 AUUAUGGAAGUUGUUUUUC 72.4% 1043 ACUGAAGAACAAGCUGUGG 71.9%

990 ACAACAAGACCACUAAAAG 72.4% 1044 CUGAAGAACAAGCUGUGGA 71.9%

991 AAGCAAAGCAGAAUGCAGU 72.4% 1045 CAAAAGUAUACAAGACUUA 71.9%

992 GAGUAAAAUGAGUGAUGCU 72.4% 1046 AUUGGGAAGCUGUCAAAAU 71.9%

993 AGUAAAAUGAGUGAUGCUG 72.4% 1047 AAAGAUGCCGGCACUUUAA 71.9%

994 ACCAGAUAUGACUCCAAGC 72.4% 1048 AUUGGUUCAGCUCAUAGUG 71.9%

995 UGUGGAUGAAUACUCCAGU 72.4% 1049 UGAAGAACAAGCUGUGGAU 71.9%

996 AAAAACAAGAUUUCUCCCA 72.4% 1050 AAUUGGGAAGCUGUCAAAA 71.9%

997 GCAAAGCAGAAUGCAGUUC 72.3% 1051 AGUCAGCAAAAUGGGUGUG 71.7%

998 GAUUGGUUCAGCUCAUAGU 72.3% 1052 ACAUCCGGAGUGGAGUCCG 71.7%

999 UUUUAGAAAUCAAGUCAAG 72.3% 1053 UGCAAAAUUUCUCCCUUGA 71.7%

1000 UAUGGAAGUUGUUUUUCCC 72.3% 1054 AAAAUUUCUCCCUUGAUGG 71.7%

1001 AGAUUGGUUCAGCUCAUAG 72.3% 1055 UUUUGGUCAAUACCUAUCA 71.7%

1002 AAUGGGUGUGGAUGAAUAC 72.3% 1056 GCAAAAUUUCUCCCUUGAU 71.7%

1003 UUUAGAAAUCAAGUCAAGA 72.3% 1057 CAAAAUUUCUCCCUUGAUG 71.7%

1004 CUACAACAAGACCACUAAA 72.3% 1058 CACAAUGGUGGGGAAAAGA 71.7%

1005 UAGAAAUCAAGUCAAGAUA 72.3% 1059 AAAUUUCUCCCUUGAUGGU 71.7% 1006 UUAGAAAUCAAGUCAAGAU 72.3% 1060 GAGUUCGAGACCAACGUGG 71.6%

1007 UUACCAGAUAUGACUCCAA 72.3% 1061 AUUUCUCCCUUGAUGGUUG 71.6%

1008 ACCAAAGCAAAGCAGAAUG 72.2% 1062 UGUUGGUAAUGAAACGAAA 71.6%

1009 ACAAUGGUGGGGAAAAGAG 72.2% 1063 UAUACAAGACUUAUUUUGA 71.6%

1010 CAAUGGUGGGGAAAAGAGC 72.2% 1064 UGGGAAGCUGUCAAAAUUC 71.6%

1011 CCCUUGAUGGUUGCAUACA 72.2% 1065 GAGAGUUCGAGACCAACGU 71.6%

1012 CACCAAAGCAAAGCAGAAU 72.2% 1066 AAAUUGGGAAGCUGUCAAA 71.6%

1013 AAUGGUGGGGAAAAGAGCA 72.2% 1067 AGUAUACAAGACUUAUUUU 71.6%

1014 AAAUGGGUGUGGAUGAAUA 72.2% 1068 GUGUUGGUAAUGAAACGAA 71.6%

1015 CAAAAUGGGUGUGGAUGAA 72.2% 1069 AAGUAUACAAGACUUAUUU 71.6%

1016 AAAAUGGGUGUGGAUGAAU 72.2% 1070 AAAGUAUACAAGACUUAUU 71.6%

1017 AAGAAGAACUCCGAGAUUG 72.2% 1071 AGAGUUCGAGACCAACGUG 71.6%

1018 GCAAAAUGGGUGUGGAUGA 72.2% 1072 GAAAAGAUGCCGGCACUUU 71.6%

1019 CCUUGAUGGUUGCAUACAU 72.2% 1073 UUCUCCCUUGAUGGUUGCA 71.6%

1020 AGAAGAACUCCGAGAUUGC 72.2% 1074 GGAAAAGAUGCCGGCACUU 71.6%

1075 GUAUACAAGACUUAUUUUG 71.6% 1129 CUGAGUCGGUUUUGGUCAA 71.2%

1076 UUGGGAAGCUGUCAAAAUU 71.6% 1130 AGUCGGUUUUGGUCAAUAC 71.2%

1077 GGGAAGCUGUCAAAAUUCA 71.6% 1131 GAGUCGGUUUUGGUCAAUA 71.2%

1078 AGAUUGCAAAAUUUCUCCC 71.6% 1132 CAGAAAUUGGGAAGCUGUC 71.1%

1079 GUUGGUAAUGAAACGAAAA 71.6% 1133 UUUUCCCAAUGAAGUGGGA 71.1%

1080 UUUCUCCCUUGAUGGUUGC 71.6% 1134 UUCCCAAUGAAGUGGGAGC 71.1%

1081 AAUUAUGGAAGUUGUUUUU 71.5% 1135 UUUCCCAAUGAAGUGGGAG 71.1%

1082 UGUUUUUCCCAAUGAAGUG 71.5% 1136 AGACCAACGUGGGAAUGUA 71.0%

1083 AUUGCAAAAUUUCUCCCUU 71.5% 1137 AAGAUGCCGGCACUUUAAU 71.0%

1084 UUUUUCCCAAUGAAGUGGG 71.5% 1138 UGAGUCGGUUUUGGUCAAU 71.0%

1085 GUUUUUCCCAAUGAAGUGG 71.5% 1139 AAAUCAAGUCAAGAUACGC 71.0%

1086 AAUUUCUCCCUUGAUGGUU 71.5% 1140 CGGUUUUGGUCAAUACCUA 71.0%

1087 UUCUCGGAAAAGAUGCCGG 71.5% 1141 UCGGUUUUGGUCAAUACCU 71.0%

1088 UUUGGUCAAUACCUAUCAA 71.5% 1142 CGAGACCAACGUGGGAAUG 71.0%

1089 GAUUGCAAAAUUUCUCCCU 71.5% 1143 GAGACCAACGUGGGAAUGU 71.0%

1090 AAAAGUAUACAAGACUUAU 71.5% 1144 AGAUGCCGGCACUUUAAUϋ 71.0%

1091 ACCCAAAAGUAUACAAGAC 71.5% 1145 GAUGCCGGCACUUUAAUUG 70.9%

1092 GUUUUGGUCAAUACCϋAUC 71.5% 1146 AUGCCGGCACUUUAAUUGA 70.9%

1093 CACAUCCGGAGUGGAGUCC 71.5% 1147 UGCCGGCACUUUAAUUGAA 70.9%

1094 UUGUUUUUCCCAAUGAAGU 71.5% 1148 GCCGGCACUUUAAUUGAAG 70.8%

1095 GAAAUUGGGAAGCUGUCAA 71.4% 1149 UGAGGAGUUCACAAUGGUG 70.8%

1096 CGGAAAAGAUGCCGGCACU 71.4% 1150 CCGGCACUUUAAUUGAAGA 70.8%

1097 UCCACCAAAGCAAAGCAGA 71.4% 1151 AGUUCGAGACCAACGUGGG 70.8%

1098 AGAAAUUGGGAAGCUGUCA 71.4% 1152 AAUGUAUUAUUAUCUCCUG 70.8%

1099 AAGCUGUCAAAAUUCAAUG 71.4% 1153 UCAUCAGAAAUUGGGAAGC 70.8%

1100 UAUGAGGAGUUCACAAUGG 71.4% 1154 UUCGAGACCAACGUGGGAA 70.8%

1101 UCGGAAAAGAUGCCGGCAC 71.4% 1155 AUCAUCAGAAAUUGGGAAG 70.8%

1102 GGAAGCUGUCAAAAUUCAA 71.4% 1156 AUGUAUUAUUAUCUCCUGA 70.8%

1103 GCUGUCAAAAUUCAAUGGU 71.4% 1157 UCGAGACCAACGUGGGAAU 70.8%

1104 AUGAGGAGUUCACAAUGGU 71.4% 1158 GUUCGAGACCAACGUGGGA 70.8%

1105 GAAGCUGUCAAAAUUCAAU 71.4% 1159 GGUUUUGGUCAAUACCUAU 70.8%

1106 AGUUGUUUUUCCCAAUGAA 71.4% 1160 GGAAUGUAUUAUUAUCUCC 70.7%

1107 GUUGUUUUUCCCAAUGAAG 71.4% 1161 GGGAAUGUAUUAUUAUCUC 70.7%

1108 UUGCAAAAUUUCUCCCUUG 71.4% 1162 CAUCAGAAAUUGGGAAGCU 70.7%

1109 AGCUGUCAAAAUUCAAUGG 71.4% 1163 UCCCAAUGAAGUGGGAGCC 70.6%

1110 CUCGGAAAAGAUGCCGGCA 71.4% 1164 UAAGGCCAUUAGAGGCCAA 70.6%

1111 GAACAGUCAAUAGCCGAAG 71.3% 1165 GUCGGUUUUGGUCAAUACC 70.6%

1112 UACCCAAAAGUAUACAAGA 71.3% 1166 GGUCCUGAGUCGGUUUUGG 70.6%

1113 UCAAUGAUGUGGGAGAUUA 71.3% 1167 GUCCUGAGUCGGUUUUGGU 70.6%

1114 AAAAGAUGCCGGCACUUUA 71.3% 1168 CUAAGGCCAUUAGAGGCCA 70.6%

1115 GACAUGGUGGAAUAGAAAU 71.3% 1169 CCUAAGGCCAUUAGAGGCC 70.6%

1116 CAAUGAUGUGGGAGAUUAA 71.3% 1170 CGGCACUUUAAUUGAAGAC 70.6%

1117 GAAUAAAAGAACUACGGAA 71.3% 1171 ACAGUCAAUAGCCGAAGCA 70.5%

1118 AUCAGAAAUUGGGAAGCUG 71.3% 1172 UUCUCAUUAUAGGUAAGGA 70.3%

1119 GUGACCUGAAUUUCGUCAA 71.3% 1173 UCCUGAGUCGGUUUUGGUC 70.3%

1120 AGGUGACCUGAAUUUCGUC 71.3% 1174 CUCAUUAUAGGUAAGGAAG 70.3%

1121 GGUGACCUGAAUUUCGUCA 71.3% 1175 UCAUUAUAGGUAAGGAAGA 70.3%

1122 UCAGAAAUUGGGAAGCUGU 71.2% 1176 UUUCUCAUUAUAGGUAAGG 70.3%

1123 UGACCUGAAUUUCGUCAAC 71.2% 1177 GAAUGUAUUAUUAUCUCCU 70.1%

1124 CCCAAAAGUAUACAAGACU 71.2% 1178 UCUCAUUAUAGGUAAGGAA 70.1%

1125 AUACAAGACUUAUUUUGAC 71.2% 1179 GAUCAGAUCGAGUGAUGGU 70.1%

1126 UCUCGGAAAAGAUGCCGGC 71.2% 1180 GGAUCAGAUCGAGUGAUGG 70.1%

1127 CCUGAGUCGGUUUUGGUCA 71.2% 1181 AUGGGAAUGGUUGGAGUAU 70.0%

1128 AACAGUCAAUAGCCGAAGC 71.2% 1182 UGGGAAUGGUUGGAGUAUU 70.0%

Table 2-2: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (PBl) sequences listed in Table 1-2.

SeqID Sequence Percent 1237 GAUCAUGAAGAUCUGUUCC 88.0%

1183 AUGAUGAUGGGCAUGUUCA 96.7% 1238 AGUGAAUGCACCAAAUCAU 88.0%

1184 UGAUGAUGGGCAUGUUCAA 96.7% 1239 GGAAUUCUUGAGGAUGAAC 88.0%

1185 AUCUGUUCCACCAUUGAAG 91.1% 1240 GGCCAUGGUGUCUAGGGCC 87.9%

1186 UCUGUUCCACCAUUGAAGA 91.1% 1241 AUCAUGAGGGAAUACAAGC 87.9%

1187 GAUCUGUUCCACCAUUGAA 90.9% 1242 CAUAGUGAAUGCACCAAAU 87.9% 1188 GAAGAUCUGUUCCACCAUU 90.8% 1243 CCUCCAUACAGCCAUGGAA 87.8%

1189 AAGAUCUGUUCCACCAUUG 90.7% 1244 AUACAAGCAGGAGUGGAUA 87.8%

1190 UGAAGAUCUGUUCCACCAU 90.7% 1245 CUCCAUACAGCCAUGGAAC 87.8%

1191 AGAUCUGUUCCACCAUUGA 90.7% 1246 AAUCAUGAGGGAAUACAAG 87.6%

1192 UGAUAAACAAUGACCUUGG 90.5% 1247 UCCAUACAGCCAUGGAACA 87.6%

1193 AUGAUAAACAAUGACCUUG 90.5% 1248 UACAAGCAGGAGUGGAUAG 87.6%

1194 CCAUACAGCCAUGGAACAG 89.2% 1249 UGUUCUCAAACAAAAUGGC 87.5%

1195 CAUACAGCCAUGGAACAGG 89.2% 1250 AUGUUCUCAAACAAAAUGG 87.5%

1196 GAGGCCAUGGUGUCUAGGG 89.1% 1251 UCACACAAAGAACAAUAGG 87.5%

1197 AGGCCAUGGUGUCUAGGGC 89.1% 1252 AUGGUCACACAAAGAACAA 87.4%

1198 GGAGGCCAUGGUGUCUAGG 89.0% 1253 UACACCAUGGACACAGUCA 87.4%

1199 UGGAGGCCAUGGUGUCUAG 88.9% 1254 UGGUCACACAAAGAACAAU 87.4%

1200 GAGAUCAUGAAGAUCUGUU 88.8% 1255 GUCACACAAAGAACAAUAG 87.4%

1201 ACCAAGAAAAUGGUCACAC 88.7% 1256 GGUCACACAAAGAACAAUA 87.4%

1202 CCAAGAAAAUGGUCACACA 88.7% 1257 ACACCAUGGACACAGUCAA 87.4% 1203 UGACCAAGAAAAUGGUCAC 88.7% 1258 CCAGCGCAAAAUGCCAUAA 87.3%

1204 AUGACCAAGAAAAUGGUCA 88.7% 1259 GCAUGGUGGAGGCCAUGGU 87.3%

1205 CAUGAAGAUCUGUUCCACC 88.6% 1260 UUGAUGGACCACUACCUGA 87.3%

1206 AUGAAGAUCUGUUCCACCA 88.6% 1261 AGCAUGGUGGAGGCCAUGG 87.3%

1207 CAAGAAAAUGGUCACACAA 88.6% 1262 AAUGCACCAAAUCAUGAGG 87.2%

1208 UCAUGAAGAUCUGUUCCAC 88.5% 1263 AAUGUUCUCAAACAAAAUG 87.2%

1209 AUCAUGAAGAUCUGUUCCA 88.5% 1264 AUGCACCAAAUCAUGAGGG 87.2%

1210 GAUGGGCAUGUUCAACAUG 88.4% 1265 AUGGUGGAGGCCAUGGUGU 87.2%

1211 UGAUGGGCAUGUUCAACAU 88.4% 1266 GGUGGAGGCCAUGGUGUCU 87.2%

1212 AUGAGGGAAUACAAGCAGG 88.4% 1267 AUUGAUGGACCACUACCUG 87.2%

1213 ACAUGACCAAGAAAAUGGU 88.4% 1268 UGGUGGAGGCCAUGGUGUC 87.2%

1214 CAUGAGGGAAUACAAGCAG 88.4% 1269 AUAAUGUUCUCAAACAAAA 87.2%

1215 GAUGAUGGGCAUGUUCAAC 88.4% 1270 UAAUGUUCUCAAACAAAAU 87.2%

1216 AACAUGACCAAGAAAAUGG 88.4% 1271 GUGGAGGCCAUGGUGUCUA 87.1%

1217 AUGAUGGGCAUGUUCAACA 88.4% 1272 UGGCAAAUGUUGUGAGAAA 87.1%

1218 CAUGACCAAGAAAAUGGUC 88.3% 1273 GAAUGCACCAAAUCAUGAG 87.1%

1219 CUCAUAGUGAAUGCACCAA 88.3% 1274 AAUGGUCACACAAAGAACA 87.0%

1220 GACCAAGAAAAUGGUCACA 88.3% 1275 GAUCCUCCAUACAGCCAUG 86.9%

1221 UCAUAGUGAAUGCACCAAA 88.3% 1276 AUCCUCCAUACAGCCAUGG 86.9%

1222 AUGGGCAUGUUCAACAUGC 88.2% 1277 CAUGGUGGAGGCCAUGGUG 86.7%

1223 ACAACAUGACCAAGAAAAU 88.2% 1278 AAAUGGUCACACAAAGAAC 86.6%

1224 AUAGUGAAUGCACCAAAUC 88.2% 1279 GCAUGUUCAACAUGCUAAG 86.6%

1225 UAGUGAAUGCACCAAAUCA 88.2% 1280 AGGGAAUACAAGCAGGAGU 86.6%

1226 GCCAUGGUGUCUAGGGCCC 88.2% 1281 GGGCAUGUUCAACAUGCUA 86.6%

1227 GACAACAUGACCAAGAAAA 88.2% 1282 GGCAUGUUCAACAUGCUAA 86.6%

1228 UGGGCAUGUUCAACAUGCU 1283 ACCAUGGACACAGUCAACA 86.6%

1229 CCAUGGUGUCUAGGGCCCG 88.2% 1284 GAUAUGCACAAACAGACUG 86.6%

1230 CAACAUGACCAAGAAAAUG 88.2% 1285 AAAAUGGUCACACAAAGAA 86.6%

1231 UGAAUGCACCAAAUCAUGA 88.1% 1286 CCAUGGACACAGUCAACAG 86.6%

1232 GUGAAUGCACCAAAUCAUG 88.0% 1287 GGAUAUGCACAAACAGACU 86.6%

1233 AGAUCAUGAAGAUCUGUUC 88.0% 1288 CACCAUGGACACAGUCAAC 86.5%

1234 UCAUGAGGGAAUACAAGCA 88.0% 1289 ACACAGUCAACAGAACACA 86.5%

1235 CAGCGCAAAAUGCCAUAAG 88.0% 1290 GACACAGUCAACAGAACAC 86.5%

1236 GAAUUCUUGAGGAUGAACA 88.0% 1291 GAGGGAAUACAAGCAGGAG 86.5%

1292 AUGUUCAACAUGCUAAGUA 86.4% 1351 AGAACAACAUGAUAAACAA 85.2%

1293 ACAUUCCCUUAUACUGGAG 86.4% 1352 GGGAAUUCUUGAGGAUGAA 85.2%

1294 UGUUCAACAUGCUAAGUAC 86.4% 1353 CAGGAGUGGAUAGAUUCUA 85.2%

1295 CAUUCCCUUAUACUGGAGA 86.4% 1354 AGUGGAUAUGCACAAACAG 85.2%

1296 AAGAAAAUGGUCACACAAA 86.3% 1355 AAGAACAACAUGAUAAACA 85.2%

1297 AAUACAAGCAGGAGUGGAU 86.3% 1356 AUAAAGAACAACAUGAUAA 85.2%

1298 AGAAAAUGGUCACACAAAG 86.3% 1357 UGGUGUCUAGGGCCCGGAU 85.0%

1299 GAAUACAAGCAGGAGUGGA 86.3% 1358 UGAUAAAGAACAACAUGAU 85.0% 1300 GGAAUACAAGCAGGAGUGG 86.3% 1359 AGCAGGAGUGGAUAGAUUC 85.0% 1301 UGAGGGAAUACAAGCAGGA 86.3% 1360 GUGAUAAAGAACAACAUGA 85.0% 1302 GGGAAUACAAGCAGGAGUG 86.3% 1361 GCAGGAGUGGAUAGAUUCU 85.0% 1303 GGACACAGUCAACAGAACA 86.2% 1362 GAUAAAGAACAACAUGAUA 85.0%

1304 AAGCAGGAGUGGAUAGAUU 86.2% 1363 AUGGUGUCUAGGGCCCGGA 85.0%

1305 UUGAGGAUGAACAGAUGUA 86.2% 1364 UGUCUAGGGCCCGGAUUGA 85.0%

1306 UGGACACAGUCAACAGAAC 86.2% 136S CAUGGUGUCUAGGGCCCGG 84.9% 1307 AUGGACACAGUCAACAGAA 86.2% 1366 GAACAACAUGAUAAACAAU 84.9%

1308 AAAUCAUGAGGGAAUACAA 86.2% 1367 UGACUAAUUCACAAGACAC 84.8%

1309 CAAGCAGGAGUGGAUAGAU 86.2% 1368 GGAAGGCUAAUAGAUUUCC 84.8%

1310 CAAAUCAUGAGGGAAUACA 86.0% 1369 UUCCCUUAUACUGGAGAUC 84.8%

1311 CAUGGACACAGUCAACAGA 86.0% 1370 UCCCUUAUACUGGAGAUCC 84.8%

1312 CUUGAGGAUGAACAGAUGU 86.0% 1371 GAAGGCUAAUAGAUUUCCU 84.8%

1313 ACAAGCAGGAGUGGAUAGA 86.0% 1372 AUUGACUUCGAGUCUGGAC 84.8%

1314 CAUGUUCAACAUGCUAAGU 85.9% 1373 GGUGUCUAGGGCCCGGAUU 84.8%

1315 UAACAGUGAUAAAGAACAA 85.9% 1374 UAAUAGAUUUCCUCAAGGA 84.8%

1316 GAAAAUGGUCACACAAAGA 85.8% 1375 AACAGUGAUAAAGAACAAC 84.8%

1317 UGCACCAAAUCAUGAGGGA 85.7% 1376 GUGUCUAGGGCCCGGAUUG 84.8%

1318 GGCUAAUAGAUUUCCUCAA 85.7% 1377 ACUUCGAGUCUGGACGGAU 84.7%

1319 UCCUCCAUACAGCCAUGGA 85.6% 1378 ACAUGAUAAACAAUGACCU 84.7%

1320 CACCAAAUCAUGAGGGAAU 85.6% 1379 GUUCCAGCGCAAAAUGCCA 84.7%

1321 CCAAAUCAUGAGGGAAUAC 85.6% 1380 AUGACUAAUUCACAAGACA 84.7%

1322 ACCAAAUCAUGAGGGAAUA 85.6% 1381 AACAUGAUAAACAAUGACC 84.7%

1323 GCACCAAAUCAUGAGGGAA 85.6% 1382 CUAGGGCCCGGAUUGAUGC 84.7%

1324 GUAACAGUGAUAAAGAACA 85.6% 1383 GAUGACUAAUUCACAAGAC 84.7%

1325 UCUUGAGGAUGAACAGAUG 85.5% 1384 UCUAGGGCCCGGAUUGAUG 84.7%

1326 AAUUCUUGAGGAUGAACAG 85.5% 1385 AUUCCCUUAUACUGGAGAU 84.7%

1327 AGAUCCUCCAUACAGCCAU 85.5% 1386 UUGCAACUACACACUCCUG 84.6%

1328 UUCCAGCGCAAAAUGCCAU 85.5% 1387 UAAGAGACAACAUGACCAA 84.6%

1329 CUGGAGAUCCUCCAUACAG 85.5% 1388 UGCAACUACACACUCCUGG 84.6% 1330 ACAACAUGAUAAACAAUGA 85.5% 1389 CAACAUGAUAAACAAUGAC 84.6%

1331 GAGAUCCUCCAUACAGCCA 85.5% 1390 GUUGCAACUACACACUCCU 84.6%

1332 AACAACAUGAUAAACAAUG 85.5% 1391 CCAGCAACAGCCCAGAUGG 84.6%

1333 ACUGGAGAUCCUCCAUACA 85.5% 1392 CAGCAACAGCCCAGAUGGC 84.6%

1334 GGAGAUCCUCCAUACAGCC 85.5% 1393 GACUUCGAGUCUGGACGGA 84.5%

1335 UCCAGCGCAAAAUGCCAUA 85.5% 1394 AACUGGCAAAUGUUGUGAG 84.5% 1336 UUCUUGAGGAUGAACAGAU 85.5% 1395 GCAACUACACACUCCUGGA 84.5%

1337 AUUCUUGAGGAUGAACAGA 85.5% 1396 CUGGCAAAUGUUGUGAGAA 84.5%

1338 AGGGGAAUUCUUGAGGAUG 85.4% 1397 AAACUGGCAAAUGUUGUGA 84.5% 1339 UGGAGAUCCUCCAUACAGC 85.4% 1398 AAGUGGGAGCUAAUGGAUG 84.5%

1340 AUGAUGACUAAUUCACAAG 85.4% 1399 CUAAUAGAUUUCCUCAAGG 84.5%

1341 UGAUGACUAAUUCACAAGA 85.4% 1400 AGUGGGAGCUAAUGGAUGA 84.5%

1342 GUGGAUAUGCACAAACAGA 85.4% 1401 UAUGCACAAACAGACUGUG 84.3%

1343 UGGAUAUGCACAAACAGAC 85.3% 1402 CAACUACACACUCCUGGAU 84.3%

1344 AGUGAUAAAGAACAACAUG 85.3% 1403 ACUGGCAAAUGUUGUGAGA 84.3%

1345 ACAGUGAUAAAGAACAACA 85.3% 1404 AUGCACAAACAGACUGUGU 84.3%

1346 GGGGAAUUCUUGAGGAUGA 85.3% 1405 UGAAUGGAUGUCAAUCCGA 84.2%

1347 UAAAGAACAACAUGAUAAA 85.3% 1406 GAAUGGAUGUCAAUCCGAC 84.2%

1348 CAGUGAUAAAGAACAACAU 85.3% 1407 GAGUσUGGACGGAUUAAGA 84.2%

1349 UUGACUUCGAGUCUGGACG 85.3% 1408 UGACUUCGAGUCUGGACGG 84.2%

1350 AAAGAACAACAUGAUAAAC 85.2% 1409 GUCUAGGGCCCGGAUUGAU 84.1%

1410 AACUACACACUCCUGGAUU 84.1% 1469 GGCCCGGAUUGAUGCCAGA 83.4%

1411 UUGAAUGGAUGUCAAUCCG 84.1% 1470 AUACUGGAGAUCCUCCAUA 83.4%

1412 GAGACAACAUGACCAAGAA 84.1% 1471 UUACAUAUAUCACAAAAAA 83.3%

1413 AGGCUAAUAGAUUUCCUCA 84.1% 1472 CAACAGCCCAGAUGGCUCU 83.3%

1414 CGCAAAAUGCCAUAAGCAC 84.1% 1473 AGCCAUGGAACAGGAACAG 83.3%

1415 AGUCUGGACGGAUUAAGAA 84.1% 1474 GCCAUGGAACAGGAACAGG 83.3%

1416 AUAUGCACAAACAGACUGU 84.1% 1475 UUAUACUGGAGAUCCUCCA 83.3%

1417 UUUGAAUGGAUGUCAAUCC 84.0% 1476 CAAACUGGCAAAUGUUGUG 83.3%

1418 GCGCAAAAUGCCAUAAGCA 84.0% 1477 UACUGGAGAUCCUCCAUAC 83.3%

1419 CUACACACUCCUGGAUUCC 84.0% 1478 GAGUAACAGUGAUAAAGAA 83.3%

1420 UUCGAGUCUGGACGGAUUA 84.0% 1479 CUUAUACUGGAGAUCCUCC 83.3%

1421 UCGAGUCUGGACGGAUUAA 84.0% 1480 CCUUAUACUGGAGAUCCUC 83.3%

1422 AAGGCUAAUAGAUUUCCUC 84.0% 1481 GAAUUGACUUCGAGUCUGG 83.3%

1423 AGAGACAACAUGACCAAGA 84.0% 1482 AAGAAGUCCUAUAUAAAUA 83.3%

1424 CGAGUCUGGACGGAUUAAG 84.0% 1483 GCAACAGCCCAGAUGGCUC 83.3%

1425 ACUACACACUCCUGGAUUC 84.0% 1484 AUUUGAAUGGAUGUCAAUC 83.3%

1426 AGAGUAAGAGACAACAUGA 83.9% 1485 GCCAGAAUUGACUUCGAGU 83.2%

1427 AGCGCAAAAUGCCAUAAGC 83.9% 1486 CCAGAAUUGACUUCGAGUC 83.2%

1428 AGUAAGAGACAACAUGACC 83.9% 1487 CAAAGGGGAAUUCUUGAGG 83.2%

1429 GUAAGAGACAACAUGACCA 83.9% 1488 CAGAAUUGACUUCGAGUCU 83.2%

1430 UCAAUGGAUAAAGAGGAAA 83.9% 1489 AAAGGGGAAUUCUUGAGGA 83.2%

1431 AAGAGACAACAUGACCAAG 83.9% 1490 UCAGCUGAUAUGAGCAUUG 83.2%

1432 CAAUGGAUAAAGAGGAAAU 83.9% 1491 CCCGGAUUGAUGCCAGAAU 83.2%

1433 GAGUAAGAGACAACAUGAC 83.9% 1492 GAGUGGAUAGAUUCUACAG 83.2%

1434 GCCAAAGGGGAAUUCUUGA 83.8% 1493 AGUAACAGUGAUAAAGAAC 83.2%

1435 AGACAACAUGACCAAGAAA 83.8% 1494 UACAUAUAUCACAAAAAAU 83.2%

1436 CUUCGAGUCUGGACGGAUU 83.8% 1495 CAGCUGAUAUGAGCAUUGG 83.2%

1437 AUGAUUACAUAUAUCACAA 83.8% 1496 GGAGUGGAUAGAUUCUACA 83.2%

1438 AGCCAAAGGGGAAUUCUUG 83.8% 1497 AGAAUUGACUUCGAGUCUG 83.2% 1439 GGACCAGCAACAGCCCAGA 83.7% 1498 CGAUGAUUACAUAUAUCAC 83.1%

1440 GACCAGCAACAGCCCAGAU 83.7% 1499 AAGGGGAAUUCUUGAGGAU 83.1%

1441 UGAUUACAUAUAUCACAAA 83.7% 1500 GAUUUCCUCAAGGAUGUGA 83.1%

1442 AUAGAUUUCCUCAAGGAUG 83.7% 1501 CUGAUAUGAGCAUUGGAGU 83.1%

1443 ACCAGCAACAGCCCAGAUG 83.7% 1502 GCUGAUAUGAGCAUUGGAG 83.1%

1444 ACAUAUAUCACAAAAAAUC 83.7% 1503 GCCCGGAUUGAUGCCAGAA 83.1%

1445 UAGAUUUCCUCAAGGAUGU 83.7% 1504 AAUAGAUUUCCUCAAGGAU 83.0%

1446 CAUAUAUCACAAAAAAUCA 83.7% 1505 AGGCCAAACUGGCAAAUGU 83.0%

1447 AUUACAUAUAUCACAAAAA 83.6% 1506 UGGAGUAACAGUGAUAAAG 83.0%

1448 GAUUACAUAUAUCACAAAA 83.6% 1507 UGAGCAUUGGAGUAACAGU 83.0%

1449 CCAAACUGGCAAAUGUUGU 83.6% 1508 CCGGAUUGAUGCCAGAAUU 83.0%

1450 AUUGAGAAAAUAAGGCCUC 83.6% 1509 AUGAGCAUUGGAGUAACAG 83.0%

1451 UUGAGAAAAUAAGGCCUCU 83.6% 1510 AUUUCCUCAAGGAUGUGAU 83.0%

1452 AUUGGAGUAACAGUGAUAA 83.6% 1511 GCGAUGAUUACAUAUAUCA 83.0%

1453 AGCUUGAACAGUCUGGACU 83.6% 1512 AAGGCCAAACUGGCAAAUG 83.0%

1454 GCCAAACUGGCAAAUGUUG 83.6% 1513 AGCUGAUAUGAGCAUUGGA 83.0%

1455 UUGGAGUAACAGUGAUAAA 83.6% 1514 CAUUGGAGUAACAGUGAUA 82.9%

1456 AAGCUUGAACAGUCUGGAC 83.6% 1515 AGAUUUCCUCAAGGAUGUG 82.9%

1457 GCUAAUAGAUUUCCUCAAG 83.5% 1516 AGCAACAGCCCAGAUGGCU 82.9%

1458 AGGAGUGGAUAGAUUCUAC 83.5% 1517 GGCCAAACUGGCAAAUGUU 82.9%

1459 CAUGAUAAACAAUGACCUU 83.5% 1518 GCAUUGGAGUAACAGUGAU 82.9%

1460 ACCUUGGACCAGCAACAGC 83.4% 1519 AGCAUUGGAGUAACAGUGA 82.9%

1461 GACCUUGGACCAGCAACAG 83.4% 1520 CCCUUAUACUGGAGAUCCU 82.9%

1462 AAGAAAAUUGAGAAAAUAA 83.4% 1521 GGAUUGAUGCCAGAAUUGA 82.7%

1463 AGGGCCCGGAUUGAUGCCA 83.4% 1522 CGGAUUGAUGCCAGAAUUG 82.7%

1464 GGAGUAACAGUGAUAAAGA 83.4% 1523 AGAAGGCCAAACUGGCAAA 82.7%

1465 GGGCCCGGAUUGAUGCCAG 83.4% 1524 GAUGAUUACAUAUAUCACA 82.7%

1466 UAGGGCCCGGAUUGAUGCC 83.4% 1525 ACAGCCAUGGAACAGGAAC 82.7%

1467 UAUACUGGAGAUCCUCCAU 83.4% 1526 AAGAAGGCCAAACUGGCAA 82.7%

1468 AGAAAAUUGAGAAAAUAAG 83.4% 1527 GAAGGCCAAACUGGCAAAU 82.7%

1528 AUAUGAGCAUUGGAGUAAC 82.7% 1587 GAAAAUUGAGAAAAUAAGG 81.4%

1529 UAUGAGCAUUGGAGUAACA 82.7% 1588 CAAGCCAAAGGGGAAUUCU 81.4%

1530 UACAGCCAUGGAACAGGAA 82.7% 1589 ACAAACAGACUGUGUCCUG 81.4%

1531 CUUGGACCAGCAACAGCCC 82.7% 1590 ACAAGCCAAAGGGGAAUUC 81.4%

1532 AUUGAUGCCAGAAUUGACU 82.7% 1591 CACAAACAGACUGUGUCCU 81.4%

1533 GAGCAUUGGAGUAACAGUG 82.7% 1592 AAGCCAAAGGGGAAUUCUU 81.3%

1534 UUGGACCAGCAACAGCCCA 82.7% 1593 CCAAUUGAUGGACCACUAC 81.3% 1535 AUGCCAGAAUUGACUUCGA 82.7% 1594 AACAAUGACCUUGGACCAG 81.3%

1536 UUGAUGCCAGAAUUGACUU 82.7% 1595 UGGAUUCCCAAGAGGAACC 81.3%

1537 GAUAUGAGCAUUGGAGUAA 82.7% 1596 GCACAAACAGACUGUGUCC 81.3%

1538 GAUGCCAGAAUUGACUUCG 82.7% 1597 ACAAUGACCUUGGACCAGC 81.3%

1539 AAUUGACUUCGAGUCUGGA 82.6% 1598 CAAUUGAUGGACCACUACC 81.3%

1540 UGAUAUGAGCAUUGGAGUA 82.6% 1599 AAACAAUGACCUUGGACCA 81.3%

1541 AUACAGCCAUGGAACAGGA 82.6% 1600 UCCUUGAAGAAUCCCACCC 81.3%

1542 CAGCCAUGGAACAGGAACA 82.6% 1601 UGCACAAACAGACUGUGUC 81.3%

1543 UGAUGCCAGAAUUGACUUC 82.5% 1602 UUCCUUGAAGAAUCCCACC 81.3%

1544 GAUUGAUGCCAGAAUUGAC 82.5% 1603 AAUAAGGCCUCUUCUAAUA 81.3%

1545 UGGACCAGCAACAGCCCAG 82.5% 1604 GGAUUCCCAAGAGGAACCG 81.3%

1546 CUGAAUCUUGGGCAAAAGA 82.4% 1605 UAUCACAAAAAAUCAACCU 81.2%

1547 UGAUGGACCACUACCUGAG 82.4% 1606 UUGAAGAAUCCCACCCAGG 81.2%

1548 UGCCAGAAUUGACUUCGAG 82.4% 1607 CUUGAAGAAUCCCACCCAG 81.2%

1549 UGGCGAUGAUUACAUAUAU 82.4% 1608 CCUGGAUUCCCAAGAGGAA 81.2%

1550 GUGGGAGCUAAUGGAUGAG 82.3% 1609 ACCACAUUCCCUUAUACUG 81.2%

1551 GGCGAUGAUUACAUAUAUC 82.3% 1610 CCCAAUUGAUGGACCACUA 81.2%

1552 UAAAGUGGGAGCUAAUGGA 82.3% 1611 AAUUGAGAAAAUAAGGCCU 81.2%

1553 AAAGUGGGAGCUAAUGGAU 82.2% 1612 UCCUGGAUUCCCAAGAGGA 81.2%

1554 UUGAAUUCACAAGCUUUUU 82.0% 1613 CCACAUUCCCUUAUACUGG 81.2%

1555 UUUGAAtKJCACAAGCUUUU 82.0% 1614 CCUUGAAGAAUCCCACCCA 81.0%

1556 AAAAUUGAGAAAAUAAGGC 81.8% 1615 AACACAAGCCAAAGGGGAA 81.0%

1557 CCUUGGACCAGCAACAGCC 81.8% 1616 AUGACCUUGGACCAGCAAC 81.0%

1558 UAAACAAUGACCUUGGACC 81.8% 1617 UAAGGCCUCUUCUAAUAGA 81.0%

1559 GAUGGACCACUACCUGAGG 81.8% 1618 AAUGACCUUGGACCAGCAA 81.0%

1560 AAAUUGAGAAAAUAAGGCC 81.8% 1619 ACACAAGCCAAAGGGGAAU 81.0%

1561 AUUUGAAUUCACAAGCUUU 81.7% 1620 GAUCAUUCGAGCUAAAGAA 81.0%

1562 CCAAAGGGGAAUUCUUGAG 81.7% 1621 UGACCUUGGACCAGCAACA 81.0%

1563 CAUUUGAAUUCACAAGCUU 81.7% 1622 CUCCUGGAUUCCCAAGAGG 80.9%

1564 AUGGACCACUACCUGAGGA 81.7% 1623 CUGGAUUCCCAAGAGGAAC 80.9%

1565 ACAUUUGAAUUCACAAGCU 81.7% 1624 CAAUGACCUUGGACCAGCA 80.9%

1566 ACACACUCCUGGAUUCCCA 81.6% 1625 CACAAGCCAAAGGGGAAUU 80.9%

1567 CACACUCCUGGAUUCCCAA 81.6% 1626 UCAUUCGAGCUAAAGAAGC 80.8%

1568 ACACUCCUGGAUUCCCAAG 81.5% 1627 AUAAGGCCUCUUCUAAUAG 80.8%

1569 GAUAAACAAUGACCUUGGA 81.5% 1628 CAUUCGAGCUAAAGAAGCU 80.8%

1570 AUAUCACAAAAAAUCAACC 81.5% 1629 AUCAUUCGAGCUAAAGAAG 80.8%

1571 CACUCCUGGAUUCCCAAGA 81.5% 1630 CAAAAUGCCAUAAGCACCA 80.7%

1572 GAGGAGACACACAAAΌUCA 81.5% 1631 AAAUGCCAUAAGCACCACA 80.6%

1573 ACUCCUGGAUUCCCAAGAG 81.5% 1632 AAAAUGCCAUAAGCACCAC 80.6%

1574 AGAUCAUUCGAGCUAAAGA 81.5% 1633 GCAAAAUGCCAUAAGCACC 80.6%

1575 UACACACUCCUGGAUUCCC 81.5% 1634 AUGCCAUAAGCACCACAUU 80.6%

1576 AGAGGAGACACACAAAUUC 81.5% 1635 AAUGCCAUAAGCACCACAU 80.6%

1577 UAUAUCACAAAAAAUCAAC 81.5% 1636 UAAGCACCACAUUCCCUUA 80.4%

1578 AUCACAAAAAAUCAACCUG 81.5% 1637 AUAAGCACCACAUUCCCUU 80.4%

1579 AUAUAUCACAAAAAAUCAA 81.5% 1638 UUCAACAUGCUAAGUACGG 80.3%

1580 AUAAACAAUGACCUUGGAC 81.5% 1639 UUCCUCAAGGAUGUGAUGG 80.3%

1581 UGUUCGAGAAAUUUUUCCC 81.5% 1640 GUUCAACAUGCUAAGUACG 80.3%

1582 AACCCAAUUGAUGGACCAC 81.5% 1641 UUUCCUCAAGGAUGUGAUG 80.3%

1583 UCACAAAAAAUCAACCUGA 81.5% 1642 GAAUCAAUGGAUAAAGAGG 80.3%

1584 ACCCAAUUGAUGGACCACU 81.4% 1643 UCAACAUGCUAAGUACGGU 80.3%

1585 CAAACAGACUGUGUCCUGG 81.4% 1644 CCAAGAGGAACCGCUCUAU 80.3%

1586 AAACAGACUGUGUCCUGGA 81.4% 1645 CUGAGAUCAUGAAGAUCUG 80.2%

1646 CCCAAGAGGAACCGCUCUA 80.2% 1705 UGAUGGAAUCAAUGGAUAA 79.1%

1647 UGCCAUAAGCACCACAUUC 80.2% 1706 CUGCUUAAAGUGGGAGCUA 79.0%

1648 UCCCAAGAGGAACCGCUCU 80.2% 1707 UGGAAUCAAUGGAUAAAGA 79.0%

1649 GCCAUAAGCACCACAUUCC 80.2% 1708 AAACAAGGGUGGACAAACU 78.9%

1650 UUCCCAAGAGGAACCGCUC 80.2% 1709 GGUUCAGAAACAUCCUGAG 78.9%

1651 AAUCAAUGGAUAAAGAGGA 80.2% 1710 CAACUCAACCCAAUUGAUG 78.9%

1652 UUAAAGUGGGAGCUAAUGG 80.2% 1711 UGGUUCAGAAACAUCCUGA 78.9%

1653 CCAUAAGCACCACAUUCCC 80.2% 1712 AACUCAACCCAAUUGAUGG 78.9%

1654 CAUAAGCACCACAUUCCCU 80.2% 1713 AUUCGAGCUAAAGAAGCUG 78.8%

1655 AUUCCCAAGAGGAACCGCU 80.1% 1714 CAAACAAGGGUGGACAAAC 78.8%

1656 GAUUCCCAAGAGGAACCGC 80.1% 1715 AUCAAUGGAUAAAGAGGAA 78.8%

1657 UCCUCAAGGAUGUGAUGGA 80.0% 1716 AUGGAAUCAAUGGAUAAAG 78.8%

1658 UGAGAAAAUAAGGCCUCUU 80.0% 1717 GUUCAGAAACAUCCUGAGC 78.7%

1659 AAGAGGAACCGCUCUAUUC 80.0% 1718 CAACAUGCUAAGUACGGUU 78.7%

1660 GAGACACACAAAUUCAGAC 80.0% 1719 UUCAGAAACAUCCUGAGCA 78.6%

1661 CAAGAGGAACCGCUCUAUU 80.0% 1720 GGAAUCAAUGGAUAAAGAG 78.6%

1662 GGAGACACACAAAUUCAGA 80.0% 1721 UCAGAAACAUCCUGAGCAU 78.6%

1663 UGAGAUCAUGAAGAUCUGU 80.0% 1722 CAACUGCAUUAGCCAACAC 78.5%

1664 GAGCUAAAGAAGCUGUGGG 80.0% 1723 GCAACUGCAUUAGCCAACA 78.5%

1665 AGAGGAACCGCUCUAUUCU 80.0% 1724 AAGUGGAUAUGCACAAACA 78.5%

1666 AGCUAAAGAAGCUGUGGGA 79.9% 1725 CAAGUGGAUAUGCACAAAC 78.5%

1667 AGAAAAUAAGGCCUCUUCU 79.9% 1726 CCAAGUGGAUAUGCACAAA 78.5%

1668 AAAAUAAGGCCUCUUCUAA 79.9% 1727 AGAAACAUCCUGAGCAUCG 78.3%

1669 CACAUUCCCUUAUACUGGA 79.9% 1728 ACAUGCUAAGUACGGUUUU 78.3%

1670 GAAGUCUGCUUAAAGUGGG 79.8% 1729 GAAACAUCCUGAGCAUCGC 78.3%

1671 AAAUAAGGCCUCUUCUAAU 79.8% 1730 AACAUGCUAAGUACGGUUU 78.3%

1672 GAGAAAAUAAGGCCUCUUC 79.8% 1731 ACACACAAAUUCAGACGAG 78.2%

1673 AGUCUGCUUAAAGUGGGAG 79.7% 1732 GCACCACAUUCCCUUAUAC 78.2%

1674 CCACUACCUGAGGAUAAUG 79.7% 1733 GCUAAUUUUAGCAUGGAGC 78.2%

1675 AAGUCUGCUUAAAGUGGGA 79.7% 1734 ACAUCCUGAGCAUCGCACC 78.2%

1676 AUGUGAUGGAAUCAAUGGA 79.6% 1735 AAUUGAUGGACCACUACCU 78.2%

1677 GAUGUGAUGGAAUCAAUGG 79.6% 1736 CAGAAACAUCCUGAGCAUC 78.2%

1678 GAAAAUAAGGCCUCUUCUA 79.6% 1737 AGCACCACAUUCCCUUAUA 78.2%

1679 CACUACCUGAGGAUAAUGA 79.6% 1738 GACACACAAAUUCAGACGA 78.2%

1680 UUCGAGCUAAAGAAGCUGU 79.4% 1739 AACAUCCUGAGCAUCGCAC 78.2%

1681 GGACCACUACCUGAGGAUA 79.4% 1740 AAGCACCACAUUCCCUUAU 78.2%

1682 UCGAGCUAAAGAAGCUGUG 79.4% 1741 CUAAUUUUAGCAUGGAGCU 78.2%

1683 ACCACUACCUGAGGAUAAU 79.4% 1742 AAACAUCCUGAGCAUCGCA 78.1%

1684 GACCACUACCUGAGGAUAA 79.4% 1743 CACCACAUUCCCUUAUACU 78.0%

1685 CGAGCUAAAGAAGCUGUGG 79.4% 1744 GAUGGAAUCAAUGGAUAAA 78.0%

1686 UCUGCUUAAAGUGGGAGCU 79.4% 1745 AGACACACAAAUUCAGACG 78.0%

1687 UGGACCACUACCUGAGGAU 79.4% 1746 AAGGAUGUGAUGGAAUCAA 77.7%

1688 GUCUGCUUAAAGUGGGAGC 79.4% 1747 AGGAUGUGAUGGAAUCAAU 77.7%

1689 GAAGCUCCGAACACAAAUA 79.3% 1748 UGGGAUGGGCUCCAAUCCU 77.6%

1690 AUGAAGCUCCGAACACAAA 79.3% 1749 AUGCCGUUGCAACUACACA 77.6%

1691 CCAGUUUUGGAGUGUCUGG 79.3% 1750 UGAUGCCGUUGCAACUACA 77.6%

1692 CCCAGUUUUGGAGUGUCUG 79.3% 1751 GUGGGAUGGGCUCCAAUCC 77.6%

1693 UGAAGCUCCGAACACAAAU 79.3% 1752 GAUGCCGUUGCAACUACAC 77.6%

1694 CAGAAAUGCUAGCAAGCAU 79.2% 1753 AUGAUGCCGUUGCAACUAC 77.6%

1695 GUGAUGGAAUCAAUGGAUA 79.2% 1754 UAUGAUGCCGUUGCAACUA 77.6%

1696 GCAGAAAUGCUAGCAAGCA 79.2% 1755 GGGAUGGGCUCCAAUCCUC 77.6%

1697 UGUGAUGGAAUCAAUGGAU 79.2% 1756 AAUGGAUGUCAAUCCGACU 77.5%

1698 UGCUUAAAGUGGGAGCUAA 79.2% 1757 GGAUGUGAUGGAAUCAAUG 77.5%

1699 GGUGGGAUGGGCUCCAAUC 79.2% 1758 GAUUUGUGGCUAAUUUUAG 77.5%

1700 AAGCUCCGAACACAAAUAC 79.2% 1759 GGAUUUGUGGCUAAUUUUA 77.5%

1701 UGGUGGGAUGGGCUCCAAU 79.2% 1760 GGCACAGCAUCAUUGAGCC 77.3%

1702 GCUUAAAGUGGGAGCUAAU 79.2% 1761 GUGGCUAAUUUUAGCAUGG 77.3%

1703 AGCUCCGAACACAAAUACC 79.2% 1762 UUUGUGGCUAAUUUUAGCA 77.3%

1704 CUUAAAGUGGGAGCUAAUG 79.2% 1763 GCACAGCAUCAUUGAGCCC 77.3%

1764 UGGCUAAUUUUAGCAUGGA 77.3% 1823 CUUGAACAGUCUGGACUUC 76.4%

1765 GGCUAAUUUUAGCAUGGAG 77.3% 1824 CCACCAUUGAAGAACUCAG 76.4%

1766 UGGAUUUGUGGCUAAUUUU 77.3% 1825 AGUGGAUAGAUUCUACAGG 76.4%

1767 UAUGGAUUUGUGGCUAAUU 77.3% 1826 AGCUUUCUUUCACAAUCAC 76.4%

1768 UGUGGCUAAUUUUAGCAUG 77.3% 1827 UUGAACAGUCUGGACUUCC 76.4%

1769 AUGGAUUUGUGGCUAAUUU 77.3% 1828 UUGCAACACCCGGGAUGCA 76.3%

1770 UUGUGGCUAAUUUUAGCAU 77.3% 1829 UUCAACAAACAAGGGUGGA 76.3%

1771 CAAGGAUGUGAUGGAAUCA 77.2% 1830 AUUGCAACACCCGGGAUGC 76.3%

1772 ACUACCUGAGGAUAAUGAG 77.2% 1831 AUGUCAAUCCGACUCUACU 763%

1773 CUCAAGGAUGUGAUGGAAU 77.2% 1832 UGCCCUCAUAGUGAAUGCA 76.1%

1774 GCCCUCAUAGUGAAUGCAC 77.2% 1833 UGGAUGUCAAUCCGACUCU 76.1%

1775 CUACCUGAGGAUAAUGAGC 77.2% 1834 UUGCCCUCAUAGUGAAUGC 76.1%

1776 UCAAGGAUGUGAUGGAAUC 77.2% 1835 GCAACACCCGGGAUGCAAA 76.1%

1777 AUAUGAUGCCGUUGCAACU 77.1% 1836 CAACACCCGGGAUGCAAAU 76.1%

1778 GAAUAUGAUGCCGUUGCAA 77.1% 1837 UGCAACACCCGGGAUGCAA 76.1%

1779 UACCUGAGGAUAAUGAGCC 77.1% 1838 UUUGCCCUCAUAGUGAAUG 76.0% 1780 AAUAUGAUGCCGUUGCAAC 77.1% 1839 AUGGAUGUCAAUCCGACUC 76.0%

1781 CCCUCAUAGUGAAUGCACC 77.1% 1840 ACAGAACACACCAAUAUUC 76.0%

1782 AUUUGUGGCUAAUUUUAGC 77.1% 1841 CAGAACACACCAAUAUUCA 76.0%

1783 UGGAAUAUGAUGCCGUUGC 77.0% 1842 AACAGAACACACCAAUAUU 76.0%

1784 AGAACACACCAAUAUUCAG 77.0% 1843 AGGAUGAACAGAUGUACCA 75.9%

1785 GGAAUAUGAUGCCGUUGCA 77.0% 1844 AUUUUGCCCUCAUAGUGAA 75.9%

1786 CAACAAACAAGGGUGGACA 77.0% 1845 UGAAGAACUCAGACGGCAA 75.9%

1787 CCUCAUAGUGAAUGCACCA 77.0% 1846 GCUUGAACAGUCUGGACUU 75.9%

1788 CAGCAUCAUUGAGCCCUGG 77.0% 1847 GAAGAACUCAGACGGCAAA 75.9%

1789 ACAGCAUCAUUGAGCCCUG 76.9% 1848 UGAGGAUGAACAGAUGUAC 75.9%

1790 CACAGCAUCAUUGAGCCCU 76.9% 1849 UUUUGCCCUCAUAGUGAAU 75.9%

1791 AUGGAAUAUGAUGCCGUUG 76.9% 1850 GAUGUCAAUCCGACUCUAC 75.9%

1792 AGAAGUCCUAUAUAAAUAA 76.8% 1851 GAUUUUGCCCUCAUAGUGA 75.9%

1793 CUGUUCCACCAUUGAAGAA 76.8% 1852 GGAUGUCAAUCCGACUCUA 75.9%

1794 AAGUACGGUUUUAGGAGUC 76.8% 1853 GAGGAUGAACAGAUGUACC 75.9%

1795 GAACACACCAAUAUUCAGA 76.8% 1854 UGAGCAAAAAGAAGUCCUA 75.8%

1796 CUCAGACGGCAAAAAUAAU 76.8% 1855 AUGAGCAAAAAGAAGUCCU 75.8%

1797 AGUACGGUUUUAGGAGUCU 76.8% 1856 AAGAACUCAGACGGCAAAA 75.8%

1798 UGCUAAGUACGGUUUUAGG 76.8% 1857 GUUCAACAAACAAGGGUGG 75.7%

1799 GUACGGUUUUAGGAGUCUC 76.8% 1858 UCAACAGAACACACCAAUA 75.7%

1800 CAUGCUAAGUACGGUUUUA 76.8% 1859 GUCAACAGAACACACCAAU 75.7O/ O

1801 UAAGUACGGUUUUAGGAGU 76.8% 1860 CAACAGAACACACCAAUAU 75.7%

1802 UCCACCAUUGAAGAACUCA 76.8% 1861 GCUAUUGCAACACCCGGGA 75.6%

1803 AGGAGACACACAAAUUCAG 76.8% 1862 UUCACAAGCUUUUUUUAUC 75.6%

1804 AUGCUAAGUACGGUUUUAG 76.8% 1863 CUAUUGCAACACCCGGGAU 75.6%

1805 AGGUUCCAGCGCAAAAUGC 76.7% 1864 AAUUCACAAGCUUUUUUUA 75.6%

1806 AAGGUUCCAGCGCAAAAUG 76.7% 1865 UGAAUUCACAAGCUUϋϋUU 75.6%

1807 AAAGGUUCCAGCGCAAAAU 76.7% 1866 GAAUUCACAAGCUUUUUUU 75.6%

1808 UAAAGGUUCCAGCGCAAAA 76.7% 1867 UAUUGCAACACCCGGGAUG 75.6% 1809 UGUUCCACCAUUGAAGAAC 76.6% 1868 UCACAAGCUUUUUUUAUCG 75.6%

1810 GUUCCACCAUUGAAGAACU 76.6% 1869 GUUCUCAAACAAAAUGGCA 75.5%

1811 CAUUGAAGAACUCAGACGG 76.6% 1870 UUCUCAAACAAAAUGGCAA 75.5%

1812 CUAAGUACGGUUUUAGGAG 76.6% 1871 AGAAGGGCUAUUGCAACAC 75.5%

1813 CCAUUGAAGAACUCAGACG 76.6% 1872 GAAGUUUUUAGAUCGAAUG 75.5%

1814 ACCAUUGAAGAACUCAGAC 76.6% 1873 UGCCUUGAAACAAUGGAAG 75.5%

1815 GCUAAGUACGGUUUUAGGA 76.6% 1874 AGAACUCAGACGGCAAAAA 75.5%

1816 UUCCACCAUUGAAGAACUC 76.6% 1875 AUUCACAAGCUUUUUUUAU 75.5%

1817 CACCAUUGAAGAACUCAGA 76.6% 1876 GAAGGGCUAUUGCAACACC 75.5O/ O

1818 AUUGAAGAACUCAGACGGC 76.5% 1877 UCUCAAACAAAAUGGCAAG 75.4%

1819 GUGGAUAGAUUCUACAGGA 76.5% 1878 UACAUGUUCGAGAGUAAGA 75.3%

1820 GGUUCCAGCGCAAAAUGCC 76.5% 1879 UCUGAGAUCAUGAAGAUCU 75.3O/ O

1821 UUGAAGAACUCAGACGGCA 76.5% 1880 AAGAAGGGCUAUUGCAACA 75.3%

1822 GAGCUUUCUUUCACAAUCA 76.4% 1881 ACAUGUUCGAGAGUAAGAG 75.3%

1882 GAAAAGGAUACAUGUUCGA 75.3% 1941 CAACACACUUUCAAAGAAA 74.4%

1883 GGAAAAGGAUACAUGUUCG 75.3% 1942 CCAUAGAAGUUUUUAGAUC 74.4%

1884 AAAGAAGGGCUAUUGCAAC 75.2% 1943 GCCAGCCCAUGGUCCAGCC 74.3%

1885 AAAAGAAGGGCUAUUGCAA 75.2% 1944 AAACUGGGGCACCCCAACU 74.3%

1886 GAACUCAGACGGCAAAAAU 75.2% 1945 CAGAAACUGGGGCACCCCA 74.3%

1887 UCAACAAACAAGGGUGGAC 75.1% 1946 AGAAACUGGGGCACCCCAA 74.3%

1888 CCUGAGUGGUUCAGAAACA 75.1% 1947 ACAGAAACUGGGGCACCCC 74.3%

1889 CAAACAAAAUGGCAAGACU 75.1% 1948 AAAGAAGUCCUAUAUAAAU 74.3%

1890 CUGAGUGGUUCAGAAACAU 75.1% 1949 CGAUGACCAAAGAUGCAGA 74.3%

1891 UCAAACAAAAUGGCAAGAC 75.0% 1950 ACGAUGACCAAAGAUGCAG 74.3%

1892 AGAAGUUUUUAGAUCGAAU 75.0% 1951 GAAACUGGGGCACCCCAAC 74.3%

1893 AACUCAGACGGCAAAAAUA 75.0% 1952 CCAGCCCAUGGUCCAGCCA 74.3%

1894 AUAGAAGUUUUUAGAUCGA 75.0% 1953 GAGAAGAUCAUUCGAGCUA 74.3%

1895 GGAUACAUGUσCGAGAGUA 75.0% 1954 AGGAUACAUGUUCGAGAGU 74.2%

1896 UAGAAGUUUUUAGAUCGAA 75.0% 1955 GCCUUGAAACAAUGGAAGU 74.2%

1897 CUCAAACAAAAUGGCAAGA 75.0% 1956 AAGGAUACAUGUUCGAGAG 74.2%

1898 AGAAGAUCAUUCGAGCUAA 75.0% 1957 AUAAUGAGCCAAGUGGAUA 74.2% 1899 AUACAUGUUCGAGAGUAAG 74.9% 1958 AAAGGAUACAUGUUCGAGA 74.2%

1900 AUCAACAUGAGCAAAAAGA 74.9% 1959 GUAAGAGAAUGAAGCUCCG 74.1%

1901 UCAACAUGAGCAAAAAGAA 74.9% 1960 ACAUGAGCAAAAAGAAGUC 74.1%

1902 AGUGGUUCAGAAACAUCCU 74.9% 1961 CCGUUGCAACUACACACUC 74.1%

1903 GAUACAUGUUCGAGAGUAA 74.9% 1962 GAGCCAAGUGGAUAUGCAC 74.1%

1904 GGAAUCAACAUGAGCAAAA 74.9% 1963 AAGUUUUUAGAUCGAAUGG 74.1%

1905 GAGUGGUUCAGAAACAUCC 74.9% 1964 CUGGGGCACCCCAACUCAA 74.1%

1906 GUGGUUCAGAAACAUCCUG 74.9% 1965 AACUGGGGCACCCCAACUC 74.1%

1907 AAAAGAAGUCCUAUAUAAA 74.8% 1966 AUGAGCCAAGUGGAUAUGC 74.1%

1908 UGAGUGGUUCAGAAACAUC 74.8% 1967 AACAUGAGCAAAAAGAAGU 74.1%

1909 GAAUCAACAUGAGCAAAAA 74.8% 1968 CAGUCAACAGAACACACCA 74.1%

1910 CCUCAAGGAUGUGAUGGAA 74.8% 1969 UUUCAAAGAAAAAGGAGAG 74.1%

1911 AAAAAGAAGUCCUAUAUAA 74.8% 1970 UGAGCCAAGUGGAUAUGCA 74.1%

1912 UAACAACACACUUUCAAAG 74.7% 1971 UGGGAUCAAACCCAAUCAA 74.1%

1913 AAUCAACAUGAGCAAAAAG 74.7% 1972 ACUGGGGCACCCCAACUCA 74.1%

1914 UUUCUUUCACAAUCACUGG 74.7% 1973 GCCGUUGCAACUACACACU 14.1%

1915 AUAACAACACACUUUCAAA 74.5% 1974 AGCCAAGUGGAUAUGCACA 74.1%

1916 UGACAUUGAACACGAUGAC 74.5% 1975 GCCAAGUGGAUAUGCACAA 74.1%

1917 GAAGAUCAUUCGAGCUAAA 74.5% 1976 AGUAAGAGAAUGAAGCUCC 74.1%

1918 ACUCAACCCAAUUGAUGGA 74.5% 1977 UUCAAAGAAAAAGGAGAGU 74.1%

1919 CACCAUAGAAGUUUUUAGA 74.5% 1978 ACAGUCAACAGAACACACC 74.1%

1920 CUCAACCCAAUUGAUGGAC 74.5% 1979 GCUUUCUUUCACAAUCACU 74.0%

1921 UCAACCCAAUUGAUGGACC 74.5% 1980 UGGGAAUCAACAUGAGCAA 74.0%

1922 CAACCCAAUUGAUGGACCA 74.5% 1981 UCAAAGAAAAAGGAGAGUA 74.0%

1923 UUGACAUUGAACACGAUGA 74.5% 1982 AAUGAGCCAAGUGGAUAUG 74.0%

1924 ACACACUUUCAAAGAAAAA 74.5% 1983 GGGGCACCCCAACUCAACC 74.0%

1925 UUCUCUGAGAUCAUGAAGA 74.5% 1984 ACAGACUGUGUCCUGGAGG 74.0%

1926 ACACCAUAGAAGUUUUUAG 74.5% 1985 CAACAUGAGCAAAAAGAAG 74.0%

1927 CACACUUUCAAAGAAAAAG 74.5% 1986 UCAAAGACUACAGAUAUAC 74.0%

1928 AACACCAUAGAAGUUUUUA 74.5% 1987 GGGCACCCCAACUCAACCC 74.0%

1929 ACCAUAGAAGUUUUUAGAU 74.5% 1988 UAAUGAGCCAAGUGGAUAU 74.0%

1930 ACAACACACUUUCAAAGAA 74.4% 1989 CUUUCUUUCACAAUCACUG 74.0%

1931 CUCUGAGAUCAUGAAGAUC 74.4% 1990 UGGGGCACCCCAACUCAAC 74.0%

1932 AACAACACACUUUCAAAGA 74.4% 1991 GUGGGAAUCAACAUGAGCA 74.0%

1933 GAUAACAACACACUUUCAA 74.4% 1992 CAGACUGUGUCCUGGAGGC 74.0%

1934 GAGAUAACAACACACUUUC 74.4% 1993 CGUUGCAACUACACACUCC 73.9%

1935 AACACACUUUCAAAGAAAA 74.4% 1994 CACAGUCAACAGAACACAC 73.9<>/ O

1936 UCUCUGAGAUCAUGAAGAU 74.4% 1995 ACACGAUGACCAAAGAUGC 73.9%

1937 UGCCAGCCCAUGGUCCAGC 74.4% 1996 AGUCAACAGAACACACCAA 73.9%

1938 ACGAGAAGAUCAUUCGAGC 74.4% 1997 AACACGAUGACCAAAGAUG 73.9%

1939 CGAGAAGAUCAUUCGAGCU 74.4% 1998 AUGCCAGCCCAUGGUCCAG 73.9%

1940 AGAUAACAACACACUUUCA 74.4% 1999 AAAGAAAAAGGAGAGUAAG 73.9%

2000 CAAAGAAAAAGGAGAGUAA 73.9% 2059 ACCUGAGUGGUUCAGAAAC 72.9%

2001 AACAGACϋGUGUCCUGGAG 73.9% 2060 CAAAAGAAAUACACCAAGA 72.9%

2002 UAAAAAGAAGGGCUAUUGC 73.8% 2061 UUUCUAGCAUGGUGGAGGC 72.9%

2003 CACGAUGACCAAAGAUGCA 73.8% 2062 AGGAAAAGGAUACAUGUUC 72.8%

2004 UGCCGUUGCAACUACACAC 73.8% 2063 AAAAGCUUGAACAGUCUGG 72.8%

2005 AAAUUAAAAAGAAGGGCUA 73.8% 2064 GGAAUUUCUAGCAUGGUGG 72.8%

2006 AAUUAAAAAGAAGGGCUAU 73.8% 2065 ACAAAAAAUCAACCUGAGU 72.8%

2007 AAAAAGAAGGGCUAUUGCA 73.8% 2066 AAAAAAUCAACCUGAGUGG 72.8%

2008 UUAAAAAGAAGGGCUAUUG 73.8% 2067 AAGAAAUACACCAAGACAA 72.8%

2009 AUUAAAAAGAAGGGCUAUU 73.7% 2068 AACAAAAUGGCAAGACUAG 72.8%

2010 GAACACGAUGACCAAAGAU 73.7% 2069 CAAAAAAUCAACCUGAGUG 72.8%

2011 UGAACACGAUGACCAAAGA 73.7% 2070 AGAAAUACACCAAGACAAC 72.8%

2012 GACAUUGAACACGAUGACC 73.6% 2071 AAGAAAAAACAAAGAGUGA 72.8%

2013 GAUAAUGAGCCAAGUGGAU 73.6% 2072 AAAGCUUGAACAGUCUGGA 72.8%

2014 CAUUGAACACGAUGACCAA 73.6% 2073 UGGAAUUUCUAGCAUGGUG 72.8%

2015 ACAUUGAACACGAUGACCA 73.6% 2074 AAUUUCUAGCAUGGUGGAG 72.8%

2016 UUUGAGAACUCAUGCCUUG 73.6% 2075 GAAAAGCUUGAACAGUCUG 72.8%

2017 UUGAGAACUCAUGCCUUGA 73.6% 2076 GAAUUUCUAGCAUGGUGGA 72.8%

2018 CAUAGAAGUUUUUAGAUCG 73.6% 2077 UUCUAGCAUGGUGGAGGCC 72.8%

2019 CAGCCCAUGGUCCAGCCAA 73.6% 2078 AAAGACUACAGAUAUACAU 72.7%

2020 AGAAUGAAGCUCCGAACAC 73.5% 2079 GGGCAAAAGAAAUACACCA 72.7%

2021 AAUGAAGCUCCGAACACAA 73.5% 2080 AAUACACCAAGACAACAUA 72.7%

2022 GAAUGAAGCUCCGAACACA 73.5% 2081 AAAGAAAUACACCAAGACA 72.7%

2023 UAGCCAACACCAUAGAAGU 73.4% 2082 UCCUAAAGGUUCCAGCGCA 72.7%

2024 AACCUGAGUGGUUCAGAAA 73.4% 2083 UAAGAGAAUGAAGCUCCGA 72.7%

2025 GCCAACACCAUAGAAGUUU 73.4% 2084 AAGAGAAUGAAGCUCCGAA 72.7%

2026 CCAACACCAUAGAAGUUUU 73.4% 2085 UUUGCGAAAAGCUUGAACA 72.7% 2027 UGAAUCUUGGGCAAAAGAA 73.4% 2086 CCUAAAGGUUCCAGCGCAA 72.7%

2028 CAACCUGAGUGGUUCAGAA 73.4% 2087 AUUUGCGAAAAGCUUGAAC 72.7%

2029 CAACACCAUAGAAGUUUUU 73.4% 2088 AGAGAAUGAAGCUCCGAAC 72.7%

2030 UAGGAAAAGGAUACAUGUϋ 73.3% 2089 UUCCUAAAGGUUCCAGCGC 72.7%

2031 GAGAACUCAUGCCUUGAAA 73.3% 2090 AAGACUACAGAUAUACAUA 72.7%

2032 UCAACCUGAGUGGUUCAGA 73.3% 2091 CACAAAAAAUCAACCUGAG 72.7%

2033 AGCAAAAAGAAGUCCUAUA 73.3% 2092 AAAUACACCAAGACAACAU 72.7%

2034 AGCCAACACCAUAGAAGUU 73.3% 2093 CAUUUGCGAAAAGCUUGAA 72.7%

2035 GAGCAAAAAGAAGUCCUAU 73.3% 2094 CAAAGACUACAGAUAUACA 72.6%

2036 GCAAAAAGAAGUCCUAUAU 73.3% 2095 CCAAUAAUGUUCUCAAACA 72.6%

2037 AGAACUCAUGCCUUGAAAC 73.3% 2096 GAGAAUGAAGCUCCGAACA 72.6% 2038 UCAUCAAAGACUACAGAUA 73.2% 2097 GAAAUACACCAAGACAACA 72.6%

2039 UGAGAACUCAUGCCUUGAA 73.2% 2098 AACAAACAAGGGUGGACAA 72.6%

2040 UUGGAAUUUCUAGCAUGGU 73.2% 2099 AAACAAAAUGGCAAGACUA 72.6%

2041 UUCAUCAAAGACUACAGAU 73.2% 2100 UCUUUGAGAACUCAUGCCU 72.6%

2042 UϋAGCCAACACCAUAGAAG 73.2% 2101 UCCUAUAUAAAUAAAACAG 72.6%

2043 ACUCAGACGGCAAAAAUAA 73.1% 2102 CCUAUAUAAAUAAAACAGG 72.6%

2044 UUGAACACGAUGACCAAAG 73.1% 2103 CAAUAAUGUUCUCAAACAA 72.6%

2045 AAAAGAAAUACACCAAGAC 73.1% 2104 UUCGAGAGUAAGAGAAUGA 72.6%

2046 CUAAAGGUUCCAGCGCAAA 73.1% 2105 GUUCGAGAGUAAGAGAAUG 72.6%

2047 AAUCAACCUGAGUGGUUCA 73.1% 2106 UGGGCAAAAGAAAUACACC 72.6%

2048 AUCAACCUGAGUGGUUCAG 73.1% 2107 CUUGGGCAAAAGAAAUACA 72.6%

2049 ACAAAAUGGCAAGACUAGG 73.1% 2108 UCGAGAGUAAGAGAAUGAA 72.6%

2050 AAAAAUCAACCUGAGUGGU 73.1% 2109 AUGUUCGAGAGUAAGAGAA 72.6%

2051 CAAAAAGAAGUCCUAUAUA 73.1% 2110 AAUAAUGUUCUCAAACAAA 72.6%

2052 UCUAGCAUGGUGGAGGCCA 72.9% 2111 UUGGGCAAAAGAAAUACAC 72.6%

2053 AAAAUCAACCUGAGUGGUU 72.9% 2112 UGUUCGAGAGUAAGAGAAU 72.6%

2054 AUUUCUAGCAUGGUGGAGG 72.9% 2113 ACAAACAAGGGUGGACAAA 72.5%

2055 UAGCAUGGUGGAGGCCAUG 72.9% 2114 ACUCAUGCCUUGAAACAAU 72.5%

2056 AAAUCAACCUGAGUGGUUC 72.9% 2115 GAAAAAGGAGAGUAAGAGA 72.5%

2057 AUUGAACACGAUGACCAAA 72.9% 2116 AAAAGGAGAGUAAGAGACA 72.5%

2058 CUAGCAUGGUGGAGGCCAU 72.9% 2117 GAAUCUUGGGCAAAAGAAA 72.5%

2118 CUAGGAAAAGGAUACAUGU 72.5% 2177 CUGUGUCCUGGAGGCUAUG 72.0%

2119 AGCAUUUGCGAAAAGCUUG 72.5% 2178 AACACUAAGUGGAAUGAAA 72.0%

2120 AUCUUUGAGAACUCAUGCC 72.5% 2179 AGGAGAGUAAGAGACAACA 72.0%

2121 AAAGGAGAGUAAGAGACAA 72.5% 2180 UGCAACUUGUUCGAGAAAU 72.0%

2122 GCAUUUGCGAAAAGCUUGA 72.5% 2181 ACACUAAGUGGAAUGAAAA 72.0%

2123 AUCUUGGGCAAAAGAAAUA 72.4% 2182 UGCUGCAACUUGUUCGAGA 72.0%

2124 GCGAAAAGCUUGAACAGUC 72.4% 2183 GCUGUGGGAUCAAACCCAA 72.0%

2125 UGUUCAUCAAAGACUACAG 72.4% 2184 CCAACUCAACCCAAUUGAU 72.0%

2126 AAUCUUGGGCAAAAGAAAU 72.4% 2185 GACUGUGUCCUGGAGGCUA 72.0%

2127 AGUUUUGGAGUGUCUGGAA 72.4% 2186 ACUUGUUCGAGAAAUϋUUU 72.0%

2128 GAGCUUUGACAUUGAACAC 72.4% 2187 AAGGAGAGUAAGAGACAAC 72.0%

2129 AGAAAAAGGAGAGUAAGAG 72.4% 2188 AGACUAGGAAAAGGAUACA 72.0%

2130 CAAUUGUUCAUCAAAGACU 72.4% 2189 GCAACUUGUUCGAGAAAUU 72.0%

2131 AAUUGUUCAUCAAAGACUA 72.4% 2190 GGAGAGUAAGAGACAACAU 72.0%

2132 UCUUGGGCAAAAGAAAUAC 72.4% 2191 GUCCUAUAUAAAUAAAACA 71.9%

2133 AACUCAUGCCUUGAAACAA 72.4% 2192 CUAAUAAGAGCUUUGACAU 71.9%

2134 AGAAAAAACAAAGAGUGAA 72.4% 2193 CAACUUGUUCGAGAAAUUU 71.9%

2135 UGCGAAAAGCUUGAACAGU 72.4% 2194 AAAAUGGCAAGACUAGGAA 71.9%

2136 AGACUACAGAUAUACAUAU 72.4% 2195 CUGGACGGAUUAAGAAGGA 71.9%

2137 UUGCGAAAAGCUUGAACAG 72.4% 2196 GAAGUCCUAUAUAAAUAAA 71.9% 2138 CGAAAAGCUUGAACAGUCU 72.4% 2197 UAAUAAGAGCUUUGACAUU 71.9%

2139 AAGAAAAAGGAGAGUAAGA 72.4% 2198 CAAAGAGUGAAUAAGAGAG 71.9%

2140 GAGUGAAUAAGAGAGGCUA 72.3% 2199 CUUGUUCGAGAAAUUUUUC 71.9%

2141 GGCAAAAGAAAUACACCAA 72.3% 2200 GCUGCAACUUGUUCGAGAA 71.9<>/ 0

2142 CAUGUUCGAGAGUAAGAGA 72.3% 2201 UCAUGCCUUGAAACAAUGG 71.9%

2143 AAAAAACAAAGAGUGAAUA 72.3% 2202 CAUGCCUUGAAACAAUGGA 71.9%

2144 AGAGUGAAUAAGAGAGGCU 72.3% 2203 AAAUGGCAAGACUAGGAAA 71.9%

2145 GAAAAAACAAAGAGUGAAU 72.3% 2204 CAAAAUGGCAAGACUAGGA 71.9%

2146 CϋGUGGGAUCAAACCCAAU 72.2% 2205 UCUGGACGGAUUAAGAAGG 71.9o/ o

2147 GGGAAUCAACAUGAGCAAA 72.2% 2206 AAAGAGUGAAUAAGAGAGG 71.9%

2148 GUGUCCUGGAGGCUAUGGC 72.2% 2207 ACAGGAACAGGGUACACCA 71.8%

2149 AAAAAGGAGAGUAAGAGAC 72.2% 2208 CACUUUCAAAGAAAAAGGA 71.8%

2150 UCUGUAAACAAUGCUGUAG 72.2% 2209 CAACACUAAGUGGAAUGAA 71.8%

2151 GUGGGAUCAAACCCAAUCA 72.2% 2210 GACAACACUAAGUGGAAUG 71.8%

2152 AAGAUCAUUCGAGCUAAAG 72.2% 2211 ACAACACUAAGUGGAAUGA 71.8%

2153 UGUGUCCUGGAGGCUAUGG 72.2% 2212 UGGACGGAUUAAGAAGGAA 71.8%

2154 UGUGGGAUCAAACCCAAUC 72.2% 2213 ACACUUUCAAAGAAAAAGG 71.8%

2155 CUGUAAACAAUGCUGUAGU 72.2% 2214 AAAACAAAGAGUGAAUAAG 71.8%

2156 GAACUCAUGCCUUGAAACA 72.2% 2215 CAGGAACAGGGUACACCAU 71.8%

2157 GUUUUGGAGUGUCUGGAAU 72.2% 2216 UAUGGAAUAUGAUGCCGUU 71.8%

2158 CAUGAGCAAAAAGAAGUCC 72.1% 2217 ACUUσCAAAGAAAAAGGAG 71.8%

2159 GGAUGAUGAUGGGCAUGUU 72.1% 2218 GAGUCAGCUGAUAUGAGCA 71,70/ 0

2160 AACAAAGAGUGAAUAAGAG 72.1% 2219 AGAGCUUUGACAUUGAACA 71.7%

2161 ACAAAGAGUGAAUAAGAGA 72.1% 2220 AUAAGAGCUUUGACAUUGA 71 , 7%

2162 AAACAAAGAGUGAAUAAGA 72.1% 2221 GGACGGAUUAAGAAGGAAG 71.70/ 0

2163 GGGAUGAUGAUGGGCAUGU 72.1% 2222 GAGAGUAAGAGACAACAUG 71.7%

2164 AAAAGGAUACAUGUUCGAG 72.1% 2223 AAGAGCUUUGACAUUGAAC 71.7%

2165 ACUAGGAAAAGGAUACAUG 72.1% 2224 GUCUGGACGGAUUAAGAAG 71.7%

2166 CUUUGAGAACUCAUGCCUU 72.0% 2225 GAUGGGCUCCAAUCCUCCG 71.7%

2167 AGACUGUGUCCUGGAGGCU 72.0% 2226 AAGCUGUGGGAUCAAACCC 71.70/ 0

2168 GACUAGGAAAAGGAUACAU 72.0% 2227 UAAGAGCUUUGACAUUGAA 71.7%

2169 CCCAACUCAACCCAAUUGA 72.0% 2228 CUUUCAAAGAAAAAGGAGA 71.7%

2170 AAAAACAAAGAGUGAAUAA 72.0% 2229 AUGGGCUCCAAUCCUCCGA 71.7%

2171 UUGUUCGAGAAAUUUUUCC 72.0% 2230 GGAUGGGCUCCAAUCCUCC 71.70/ 0

2172 AGUCCUAUAUAAAUAAAAC 72.0% 2231 GACGGAUUAAGAAGGAAGA 71.7%

2173 AACUUGUUCGAGAAAUUUU 72.0% 2232 AGUCAGCUGAUAUGAGCAU 71.7%

2174 CCCCAACUCAACCCAAUUG 72.0% 2233 ACAAAUACAGAAACUGGGG 71.6O/ O

2175 AAGUCCUAUAUAAAUAAAA 72.0% 2234 GGAACAGGGUACACCAUGG 71.6%

2176 ACUGUGUCCUGGAGGCUAU 72.0% 2235 AGCUGUGGGAUCAAACCCA 7i.6o/ o

2236 CAAAUACAGAAACUGGGGC 71.6% 2295 UGAGCAUCGCACCAAUAAU 70.9%

2237 GUCAGCUGAUAUGAGCAUU 71.6% 2296 CGCACCAAUAAUGUUCUCA 70.9% 2238 CUGCAACUUGUUCGAGAAA 71.6% 2297 UUAGCUAGAAGCAUUUGCG 70.9%

2239 GAACAGGGUACACCAUGGA 71.6% 2298 CAUCGCACCAAUAAUGUUC 70.9%

2240 AAGAGUGAAUAAGAGAGGC 71.6% 2299 GAGCAUCGCACCAAUAAUG 70.9%

2241 AGUGCUGCAACUUGUUCGA 71.5% 2300 UCUCGAUACUGAAUCUUGG 70.8%

2242 CUUUCACAAUCACUGGGGA 71.5% 2301 AGACGAGAAGAUCAUUCGA 70.8%

2243 CUCAUGCCUUGAAACAAUG 71.5% 2302 GGUAAUGAAAAGAAGGCCA 70.8%

2244 UCUUUCACAAUCACUGGGG 71.5% 2303 CAGACGAGAAGAUCAUUCG 70.8%

2245 AAGUGCUGCAACUUGUUCG 71.5% 2304 GUCUCGAUACUGAAUCUUG 70.8%

2246 AGGAACAGGGUACACCAUG 71.5% 2305 UUUAGCUAGAAGCAUUUGC 70.8%

2247 AUGCCUUGAAACAAUGGAA 71.4% 2306 GUAAUGAAAAGAAGGCCAA 70.8%

2248 AAUAAGAGCUUUGACAUUG 71.4% 2307 ACUUUAGCUAGAAGCAUUU 70.8%

2249 CAGUUUUGGAGUGUCUGGA 71.4% 2308 AGCCCAUGGUCCAGCCAAA 70.8%

2250 AAAGAAGGCCAAACUGGCA 71.4% 2309 UAAUGAAAAGAAGGCCAAA 70.7%

2251 AUCAAAGACUACAGAUAUA 71.4% 2310 GAGUCUCGAUACUGAAUCU 70.7%

2252 CCUUUGUCAGCCAUAAAGA 71.4% 2311 UCAGACGAGAAGAUCAUUC 70.7%

2253 GUAUGGAAUAUGAUGCCGU 71.4% 2312 UCGUUCAACAAACAAGGGU 70.7%

2254 UUGGCGAUGAUUACAUAUA 71.4% 2313 AUUCAGACGAGAAGAUCAU 70.7%

2255 GAAAAGAAGGCCAAACUGG 71.4% 2314 UGCCAUAGAGGAGACACAC 70.7%

2256 CAUCAAAGACUACAGAUAU 71.4% 2315 CCCAUGGUCCAGCCAAAAG 70.7%

2257 CCCUUUGUCAGCCAUAAAG 71.4% 2316 GGAGUCUCGAUACUGAAUC 70.7%

2258 UUGUUCAUCAAAGACUACA 71.4% 2317 UGACCAAAGAUGCAGAGAG 70.6%

2259 AAAAGAAGGCCAAACUGGC 71.4% 2318 CACAAAUUCAGACGAGAAG 70.6%

2260 GCCCAUGGUCCAGCCAAAA 71.2% 2319 CUGAGCAUCGCACCAAUAA 70.6%

2261 ACAACAUACUGGUGGGAUG 71.2% 2320 CCAAAGAUGCAGAGAGAGG 70.6%

2262 GUGCUGCAACUUGUUCGAG 71.2% 2321 AGUCUCGAUACUGAAUCUU 70.6%

2263 AGUAUGGAAUAUGAUGCCG 71.2% 2322 CAACAAGGAAGAAAAUUGA 70.6%

2264 CAACAUACUGGUGGGAUGG 71.2% 2323 GCACCCCAACUCAACCCAA 70.6%

2265 AAUCCCUUUGUCAGCCAUA 71.2% 2324 GCCAUAGAGGAGACACACA 70.6%

2266 UCCCUUUGUCAGCCAUAAA 71.2% 2325 GGCACCCCAACUCAACCCA 70.6%

2267 AUUGUUCAUCAAAGACUAC 71.2% 2326 UAUACAUAUAGGUGCCAUA 70.6%

2268 AUCCCUUUGUCAGCCAUAA 71.2% 2327 AUGACCAAAGAUGCAGAGA 70.6%

2269 UGGCAAGACUAGGAAAAGG 71.1% 2328 GUUGAAACUUUAGCUAGAA 70.6%

2270 UUCUUUCACAAUCACUGGG 71.1% 2329 AUACAUAUAGGUGCCAUAG 70.6%

2271 AAUGGCAAGACUAGGAAAA 71.1% 2330 GUCGUUCAACAAACAAGGG 70.5%

2272 UUUCACAAUCACUGGGGAC 71.1% 2331 ACAAAUUCAGACGAGAAGA 70.5%

2273 UCGCACCAAUAAUGUUCUC 71.1% 2332 ACAAGGAAGAAAAUUGAGA 70.5%

2274 GCACCAAUAAUGUUCUCAA 71.1% 2333 UCCUGAGCAUCGCACCAAU 70.5%

2275 ACCAAUAAUGUUCUCAAAC 71.1% 2334 AUCCUGAGCAUCGCACCAA 70.5%

2276 UUCACAAUCACUGGGGACA 71.1% 2335 GGGGACAACACUAAGUGGA 70.5%

2277 UCACAAUCACUGGGGACAA 71.1% 2336 AUAGAGGAGACACACAAAU 70.5%

2278 CACCAAUAAUGUUCUCAAA 71.1% 2337 GGGACAACACUAAGUGGAA 70.5%

2279 AUGGCAAGACUAGGAAAAG 71.1% 2338 CCAUAGAGGAGACACACAA 70.5%

2280 ACUUAUGAUUGGACAUUAA 71.0% 2339 ACCAAAGAUGCAGAGAGAG 70.5%

2281 GCUUUGACAUUGAACACGA 71.0% 2340 ACACAAAUUCAGACGAGAA 70.5%

2282 GAAGUCGUUCAACAAACAA 71.0% 2341 UCAACAAGGAAGAAAAUUG 70.5%

2283 AGCUUUGACAUUGAACACG 71.0% 2342 GACGAGAAGAUCAUUCGAG 70.5%

2284 AAGUCGUUCAACAAACAAG 71.0% 2343 GUUCAUCAAAGACUACAGA 70.5%

2285 UAGCUAGAAGCAUUUGCGA 71.0% 2344 AAAUUCAGACGAGAAGAUC 70.5%

2286 UUUGACAUUGAACACGAUG 71.0% 2345 CAUAGAGGAGACACACAAA 70.5%

2287 CUUUGACAUUGAACACGAU 71.0% 2346 CAAAUUCAGACGAGAAGAU 70.5%

2288 AUCGCACCAAUAAUGUUCU 71.0% 2347 GGACAACACUAAGUGGAAU 70.5%

2289 CUUUAGCUAGAAGCAUUUG 71.0% 2348 AAUUCAGACGAGAAGAUCA 70.5%

2290 CUUAUGAUUGGACAUUAAA 71.0% 2349 GACCAAAGAUGCAGAGAGA 70.5%

2291 GAACACAAAUACCCGCAGA 71.0% 2350 AACUUUAGCUAGAAGCAUU 70.4%

2292 AGCAUCGCACCAAUAAUGU 70.9% 2351 AUGAAAAGAAGGCCAAACU 70.4%

2293 GCAUCGCACCAAUAAUGUU 70.9% 2352 CACCCCAACUCAACCCAAU 70.4%

2294 UUCAGACGAGAAGAUCAUU 70.9% 2353 UACUGAAUCUUGGGCAAAA 70.4%

2354 ACCCCAACUCAACCCAAUU 70.4% 2372 AAGAAGCUGUGGGAUCAAA 70.2%

2355 AAUGAAAAGAAGGCCAAAC 70.4% 2373 CACAAAUACCCGCAGAAAU 70.2%

2356 UGAAACUUUAGCUAGAAGC 70.4% 2374 UGAAAAGAAGGCCAAACUG 70.2%

2357 UUGAAACUUUAGCUAGAAG 70.4% 2375 UAAAGAAGCUGUGGGAUCA 70.1%

2358 AACAAGGAAGAAAAUUGAG 70.4% 2376 AAAGAAGCUGUGGGAUCAA 70.1%

2359 GAAACUUUAGCUAGAAGCA 70.4% 2377 CACACAAAUUCAGACGAGA 70.1%

2360 UAGAGGAGACACACAAAUU 70.4% 2378 CAAAUACCCGCAGAAAUGC 70.1%

2361 AAACUUUAGCUAGAAGCAU 70.4% 2379 UUGGAGUGUCUGGAAUAAA 70.1% 2362 CCUGAGCAUCGCACCAAUA 70.3% 2380 UAUϋUCAAUGAAUCAACAA 70.1%

2363 AUCAACAAGGAAGAAAAUU 70.3% 2381 UUUGGAGUGUCUGGAAUAA 70.1%

2364 AUACUGAAUCUUGGGCAAA 70.3% 2382 AAAUACCCGCAGAAAUGCU 70.1%

2365 CGAACACAAAUACCCGCAG 70.3% 2383 AACACAAAUACCCGCAGAA 70.1%

2366 AAUCAACAAGGAAGAAAAU 70.3% 2384 CAUCCUGAGCAUCGCACCA 70.1%

2367 ACAAAUACCCGCAGAAAUG 70.2% 2385 AAAUACAGAAACUGGGGCA 70.0%

2368 AGAAGCUGUGGGAUCAAAC 70.2% 2386 CUGCAUUAGCCAACACCAU 70.0%

2369 ACUGAAUCUUGGGCAAAAG 70.2% 2387 ACUGCAUUAGCCAACACCA 70.0%

2370 ACACAAAUACCCGCAGAAA 70.2% 2388 AAUACAGAAACUGGGGCAC 70.0%

2371 GAAUCAACAAGGAAGAAAA 70.2% 2389 AGUCGUUCAACAAACAAGG 70.0%

Table 2-3: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (PA) sequences listed in Table 1-3.

Seq ID Sequence Percent 2421 GAGGAGUGCCUGAUUAAUG 96.9%

2390 UUAGAGCCUAUGUGGAUGG 98.9% 2422 AGUGCCUGAUUAAUGAUCC 96.9%

2391 UUUAGAGCCUAUGUGGAUG 98.9% 2423 UAUGAAGCAAUUGAGGAGU 96.4%

2392 AGCAAUUGAGGAGUGCCUG 98.7% 2424 AUAUGAAGCAAUUGAGGAG 96.4% 2393 UUGAGGAGUGCCUGAUUAA 98.7% 2425 AUGAAGCAAUUGAGGAGUG 96.4% 2394 GCAAUUGAGGAGUGCCUGA 98.6% 2426 UGAAGCAAUUGAGGAGUGC 96.4% 2395 AUUGAGGAGUGCCUGAUUA 98.6% 2427 AAGCAAUUGAGGAGUGCCU 96.4% 2396 CAAUUGAGGAGUGCCUGAU 98.6% 2428 GAAGCAAUUGAGGAGUGCC 96.4% 2397 AAUUGAGGAGUGCCUGAUU 98.6% 2429 ACAAAUUUGCAGCAAUAUG 95.4% 2398 UAGAGCCUAUGUGGAUGGA 98.3% 2430 AACAAAUUUGCAGCAAUAU 95.4% 2399 AGAGCCUAUGUGGAUGGAU 98.3% 2431 GGGCUAUAUGAAGCAAUUG 95.3%

2400 GAGCCUAUGUGGAUGGAUU 98.2% 2432 GCUAUAUGAAGCAAUUGAG 95.3%

2401 UAAUGAUCCCUGGGUUUUG 98.0% 2433 UCAUGUAUUCAGAUUUUCA 95.3%

2402 UUAAUGAUCCCUGGGUUUU 98.0% 2434 GGCUAUAUGAAGCAAUUGA 95.3%

2403 AUGAUCCCUGGGUUUUGCU 98.0% 2435 CAAAUUUGCAGCAAUAUGC 95.2%

2404 AAUGAUCCCUGGGUUUUGC 98.0% 2436 AAAUUUGCAGCAAUAUGCA 95.2%

2405 CCUGAUUAAUGAUCCCUGG 97.9% 2437 UUCAUGUAUUCAGAUUUUC 95.2%

2406 UGCCUGAUUAAUGAUCCCU 97.9% 2438 AAUUUGCAGCAAUAUGCAC 95.1%

2407 AUUAAUGAUCCCUGGGUUU 97.9% 2439 ACAAACAAAUUUGCAGCAA 95.0%

2408 GCCUGAUUAAUGAUCCCUG 97.9% 2440 CAAACAAAUUUGCAGCAAU 95.0% 2409 CUGAUUAAUGAUCCCUGGG 97.9% 2441 AAACAAAUUUGCAGCAAUA 95.0%

2410 GAUUAAUGAUCCCUGGGUU 97.8% 2442 UUUUAGAGCCUAUGUGGAU 94.9%

2411 UGAUUAAUGAUCCCUGGGU 97.8% 2443 AAUUUUAGAGCCUAUGUGG 94.7%

2412 UAUAUGAAGCAAUUGAGGA 97.3% 2444 AUUUUAGAGCCUAUGUGGA 94.7%

2413 CUAUAUGAAGCAAUUGAGG 97.2% 2445 GUAUUCAGAUUUUCAUUUC 94.6%

2414 CUUGGUUCAACUCCUUCCU 97.1% 2446 GCAUCCUGUGCAGCAAUGG 94.6%

2415 UCUUGGUUCAACUCCUUCC 97.1% 2447 AUGCAUCCUGUGCAGCAAU 94.6%

2416 AGGAGUGCCUGAUUAAUGA 96.9% 2448 UAUUCAGAUUUUCAUUUCA 94.6%

2417 GAGUGCCUGAUUAAUGAUC 96.9% 2449 GAAACAAACAAAUUUGCAG 94.6%

2418 UGAGGAGUGCCUGAUUAAU 96.9% 2450 UGUAUUCAGAUUUUCAUUU 94.6%

2419 GUGCCUGAUUAAUGAUCCC 96.9% 2451 AAUGCAUCCUGUGCAGCAA 94.6%

2420 GGAGUGCCUGAUUAAUGAU 96.9% 2452 AACAAACAAAUUUGCAGCA 94.6%

2453 AAACAAACAAAUUUGCAGC 94.6% 2504 CGUCAGUCCGAAAGAGGCG 92.4%

2454 CAUCCUGUGCAGCAAUGGA 94.6% 2505 GUCAGUCCGAAAGAGGCGA 92.4%

2455 AUUCAGAUUUUCAUUUCAU 94.6% 2506 UCAGUCCGAAAGAGGCGAA 92.4%

2456 AUGUAUUCAGAUUUUCAUU 94.6% 2507 GGUAAACUUUGUGAGCAUG 92.4%

2457 UGCAUCCUGUGCAGCAAUG 94.6% 2508 UCCGAAAGAGGCGAAGAAA 92.4% 2458 CAUGUAUUCAGAUUUUCAU 94.5% 2509 CCGAAAGAGGCGAAGAAAC 92.4%

2459 AGAAUUUUAGAGCCUAUGU 94.3% 2510 UUUCGUCAGUCCGAAAGAG 92.4%

2460 GAGAAUUUUAGAGCCUAUG 94.2% 2511 UCCUUUCGUCAGUCCGAAA 92.3%

2461 GAUUUUCAUUUCAUCAAUG 94.2% 2512 CCUUUCGUCAGUCCGAAAG 92.3%

2462 UGAGAAUUUUAGAGCCUAU 94.2% 2513 GCGAUUUGAAGCAAUAUGA 92.3%

2463 UUGAGAAUUUUAGAGCCUA 94.2% 2514 CUUUCGUCAGUCCGAAAGA 92.3%

2464 AUUUUCAUUUCAUCAAUGA 94.2% 2515 AAGCGAUUUGAAGCAAUAU 92.1%

2465 UCAGAUUUUCAUUUCAUCA 94.1% 2516 AAGAUUUUGUGCGACAAUG 92.0%

2466 GAAUUUUAGAGCCUAUGUG 94.1% 2517 AGCGAUUUGAAGCAAUAUG 92.0%

2467 CAGAUUUUCAUUUCAUCAA 94.1% 2518 GUCCGAAAGAGGCGAAGAA 91.8%

2468 GUUCAGGCUCUUAGGGACA 94.1% 2519 GGCAAGCUUUCUCAAAUGU 91.5%

2469 UUCAGGCUCUUAGGGACAA 94.1% 2520 ACAUUCUUUGGAUGGAAAG 91.3%

2470 AGAUUUUCAUUUCAUCAAU 94.1% 2521 CAUϋCUUUGGAUGGAAAGA 91.3%

2471 UUCAGAUUUUCAUUUCAUC 94.0% 2522 GAUUUGAAGCAAUAUGAUA 91.2%

2472 AGAGGCGAAGAAACAAUUG 93.7% 2523 AGGGCAAGCUUUCUCAAAU 91.2%

2473 GAGGCGAAGAAACAAUUGA 93.7% 2524 UUGAGGGCAAGCUUUCUCA 91.2%

2474 GGCGAAGAAACAAUUGAAG 93.5% 2525 AUUUGAAGCAAUAUGAUAG 91.2%

2475 CGAAGAAACAAUUGAAGAA 93.5% 2526 AUUGAGGGCAAGCUUUCUC 91.2%

2476 GCGAAGAAACAAUUGAAGA 93.5% 2527 GAGGGCAAGCUUUCUCAAA 91.2%

2477 AGGCGAAGAAACAAUUGAA 93.5% 2528 GGGCAAGCUUUCUCAAAUG 91.2%

2478 GAAGAAACAAUUGAAGAAA 93.5% 2529 CAUUGAGGGCAAGCUUUCU 91.2%

2479 AAAUGAAAUGGGGAAUGGA 93.4% 2530 GCAAGCUUUCUCAAAUGUC 91.0%

2480 AAAAUGAAAUGGGGAAUGG 93.4% 2531 UGAGGGCAAGCUUUCUCAA 91.0%

2481 UGAAAUGGGGAAUGGAGAU 93.3% 2532 CAGAAUGAGUUCAACAAGG 91.0%

2482 AAUGGGGAAUGGAGAUGAG 93.3% 2533 AGAAUGAGUUCAACAAGGC 91.0%

2483 CACAUUUUCUCAUUCACUG 93.3% 2534 AUAAGCGAUUUGAAGCAAU 90.7%

2484 AAAUGGGGAAUGGAGAUGA 93.3% 2535 UUCUUUGGAUGGAAAGAAC 90.5%

2485 GAAAUGGGGAAUGGAGAUG 93.3% 2536 CGAUUUGAAGCAAUAUGAU 90.5%

2486 AUGAAAUGGGGAAUGGAGA 93.3% 2537 UCUUUGGAUGGAAAGAACC 90.5%

2487 ACAUUUUCUCAUUCACUGG 93.2% 2538 AGCCUAUGUGGAUGGAUUC 90.4% 2488 AAAGAGGCGAAGAAACAAU 92.8% 2539 CAAUGAAAGAGUAUGGAGA 90.4% 2489 GAAAGAGGCGAAGAAACAA 92.8% 2540 UGAAGCAAUAUGAUAGUGA 90.2%

' 2490 AAUGAAAUGGGGAAUGGAG 92.8% 2541 UGGAAGAUUUUGUGCGACA 90.1%

2491 UAAACUUUGUGAGCAUGGA 92.5% 2542 AUGGAAGAUUUUGUGCGAC 90.1%

2492 AAGAGGCGAAGAAACAAUU 92.5% 2543 AAAGAGUUUUUUGAGAAUA 90.1%

2493 GUAAACUUUGUGAGCAUGG 92.5% 2544 UAAUGAAGGGGGUAUACAU 90.0%

2494 CAGUCCGAAAGAGGCGAAG 92.5% 2545 GAAGAUUUUGUGCGACAAU 90.0%

2495 AGUCCGAAAGAGGCGAAGA 92.5% 2546 UCUGGAUAGAGCUCGAUGA 89.9%

2496 CGAAAGAGGCGAAGAAACA 92.4% 2547 AUAAUGAAGGGGGUAUACA 89.9%

2497 UGGUAAACUUUGUGAGCAU 92.4% 2548 AUUCUUUGGAUGGAAAGAA 89.9%

2498 GUGGUAAACUUUGUGAGCA 92.4% 2549 UUGAAGCAAUAUGAUAGUG 89.9%

2499 UUUGUGCGACAAUGCUUCA' 92.4% 2550 GAGAAAGACAUGACCAAAG 89.6%

2500 UCGUCAGUCCGAAAGAGGC 92.4% 2551 UUUGAAGCAAUAUGAUAGU 89.6%

2501 UUGUGCGACAAUGCUUCAA 92.4% 2552 AGAAAGACAUGACCAAAGA 89.4%

2502 UAAGCGAUUUGAAGCAAUA 92.4% 2553 GAUUUUGUGCGACAAUGCU 89.4%

2503 UUCGUCAGUCCGAAAGAGG 92.4% 2554 AUUUUGUGCGACAAUGCUU 89.4%

2555 UGAAGGGGGUAUACAUUAA 89.3% 2606 GGGGGUAUACAUUAAUACU 87.0%

2556 AGAUUUUGUGCGACAAUGC 89.2% 2607 GGAUGGAUUCGAACCGAAC 87.0%

2557 GGAAGAUUUUGUGCGACAA 89.2% 2608 UGUGGAUGGAUUCGAACCG 87.0%

2558 AUGAAGGGGGUAUACAUUA 89.2% 2609 CUUGAGAAUUUUAGAGCCU 86.9%

2559 AAUGAAGGGGGUAUACAUU 89.2% 2610 GCCUUGAGAAUUUUAGAGC 86.9%

2560 CCUAUGUGGAUGGAUUCGA 89.2% 2611 CUGCCUUGAGAAUUUUAGA 86.9%

2561 CUAUGUGGAUGGAUUCGAA 89.2% 2612 UCCUGCCUUGAGAAUUUUA 86.9%

2562 AUGUGGAUGGAUUCGAACC 89.1% 2613 UGUGCAGCAAUGGACGAUU 86.9%

2563 UAUGUGGAUGGAUUCGAAC 89.1% 2614 GUGCAGCAAUGGACGAUUU 86.9%

2564 AAAGACAUGACCAAAGAGU 89.0% 2615 UUCUCCUGCCUUGAGAAUU 86.9%

2565 AACAUUCUUUGGAUGGAAA 89.0% 2616 CUCCUGCCUUGAGAAUUUU 86.9%

2566 AAGACAUGACCAAAGAGUU 89.0% 2617 UGCCUUGAGAAUUUUAGAG 86.9%

2567 UUUUGUGCGACAAUGCUUC 88.9% 2618 UCUCCUGCCUUGAGAAUUU 86.9%

2568 GCCUAUGUGGAUGGAUUCG 88.7% 2619 CCUGCCUUGAGAAUUUUAG 86.9%

2569 GGGAUUCCUUUCGUCAGUC 88.4% 2620 CCUUGAGAAUUUUAGAGCC 86.9%

2570 UGGGAUUCCUUUCGUCAGU 88.4% 2621 UGCAUUGAGGGCAAGCUUU 86.8%

2571 GAAAGACAUGACCAAAGAG 88.4% 2622 GGCCUCUGGGAUUCCUUUC 86.8%

2572 GCAAUGAAAGAGUAUGGAG 88.3% 2623 GCAUUGAGGGCAAGCUUUC 86.8%

2573 AUUCCUUUCGUCAGUCCGA 88.1% 2624 AUCCUGUGCAGCAAUGGAC 86.8%

2574 GAUUCCUUUCGUCAGUCCG 88.1% 2625 UCCUGUGCAGCAAUGGACG 86.7%

2575 GGAUUCCUUUCGUCAGUCC 88.1% 2626 AGAGGCCUCUGGGAUUCCU 86.6%

2576 AAGGGGGUAUACAUUAAUA 87.8% 2627 AGGCCUCUGGGAUUCCUUU 86.6%

2577 GAAGGGGGUAUACAUUAAU 87.8% 2628 CUUCUCCUGCCUUGAGAAU 86.6%

2578 AAUGAAAGAGUAUGGAGAG 87.8% 2629 GCCUCUGGGAUUCCUUUCG 86.6%

2579 AGGGGGUAUACAUUAAUAC 87.8% 2630 GAGGCCUCUGGGAUUCCUU 86.6%

2580 UGUAUGAUUACAAGGAGAA 87.7% 2631 CCUGUGCAGCAAUGGACGA 86.6%

2581 CCGAACUUCUCCUGCCUUG 87.4% 2632 CCUCUGGGAUUCCUUUCGU 86.5% 2582 CGAACUUCUCCUGCCUUGA 87.4% 2633 CUGGGAUUCCUUUCGUCAG 86.5%

2583 AACUUCUCCUGCCUUGAGA 87.3% 2634 CUGUGCAGCAAUGGACGAU 86.4%

2584 GAACUUCUCCUGCCUUGAG 87.3% 2635 GGAUUCGAACCGAACGGCU 86.2%

2585 AAAGAGAAAGACAUGACCA 87.3% 2636 UGGAUUCGAACCGAACGGC 86.2%

2586 AUGGAUUCGAACCGAACGG 87.2% 2637 CAGAGGCCUCUGGGAUUCC 86.1%

2587 AAGAGAAAGACAUGACCAA 87.2% 2638 ACGGCUGCAUUGAGGGCAA 86.0%

2588 CUCUGGGAUUCCUUUCGUC 87.2% 2639 AACGGCUGCAUUGAGGGCA 86.0%

2589 GAUGGAUUCGAACCGAACG 87.2% 2640 GCUGCAUUGAGGGCAAGCU 85.8% 2590 GGGUAUACAUUAAUACUGC 87.2% 2641 UUCCUUUCGUCAGUCCGAA 85.8%

2591 GGGGUAUACAUUAAUACUG 87.2% 2642 CUGCAUUGAGGGCAAGCUU 85.8%

2592 AAGCUUUCUCAAAUGUCCA 87.1% 2643 GGCUGCAUUGAGGGCAAGC 85.8% 2593 AGCUUUCUCAAAUGUCCAA 87.1% 2644 CGGCUGCAUUGAGGGCAAG 85.8% 2594 ACAUGACCAAAGAGUUUUU 87.1% 2645 UUGUAUGAUUACAAGGAGA 85.8% 2595 AGAGAAAGACAUGACCAAA 87.1% 2646 AUAGUGACGAACCUGAAUU 85.7%

2596 UCUGGGAUUCCUUUCGUCA 87.1% 2647 GGGGCUAUAUGAAGCAAUU 85.6%

2597 ACUUCUCCUGCCUUGAGAA 87.1% 2648 UGACCAAAGAGUUUUUUGA 85.6%

2598 GACAUGACCAAAGAGUUUU 87.1% 2649 UUGUUCAGGCUCUUAGGGA 85.5%

2599 AGACAUGACCAAAGAGUUU 87.1% 2650 CAUGACCAAAGAGUUUUUU 85.5%

2600 GUGGAUGGAUUCGAACCGA 87.0% 2651 AUGACCAAAGAGUUUUUUG 85.5%

2601 AAGCACAGAUUUGAAAUAA 87.0% 2652 UUUUCUCAUUCACUGGGGA 85.4%

2602 AGCACAGAUUUGAAAUAAU 87.0% 2653 UGUUCAGGCUCUUAGGGAC 85.4%

2603 CAAGCUUUCUCAAAUGUCC 87.0% 2654 GACCAAAGAGUUUUUUGAG 85.4%

2604 GCUUUCUCAAAUGUCCAAA 87.0% 2655 CAUUUUCUCAUUCACUGGG 85.4%

2605 UGGAUGGAUUCGAACCGAA 87.0% 2656 AUUUUCUCAUUCACUGGGG 85.4%

2657 AAGCAAUAUGAUAGUGACG 85.4% 2708 ACGAACCUGAAUUAAGGUC 84.1%

2658 GAUAGUGACGAACCUGAAU 85.4% 2709 AUGUUCUUGUAUGUGAGGA 84.1%

2659 AGCAAUAUGAUAGUGACGA 85.3% 2710 UUUUCUCUCACUGACCCGA 84.1%

2660 ACCAAAGAGUUUUUUGAGA 85.2% 2711 AACAGAGGCCUCUGGGAUU 84.1%

2661 CUUGCCGACCAAAGUCUCC 85.2% 2712 UUUCUCUCACUGACCCGAG 84.1%

2662 UUGCCGACCAAAGUCUCCC 85.2% 2713 GAACGGCUGCAUUGAGGGC 84.1%

2663 UAUGAUAGUGACGAACCUG 85.1% 2714 UGUUCUUGUAUGUGAGGAC 84.1%

2664 AAUAUGAUAGUGACGAACC 85.1% 2715 AAUAAAAUUAAAUCUGAGA 84.0%

2665 UGAUAGUGACGAACCUGAA 85.1% 2716 GACGAACCUGAAUUAAGGU 84.0%

2666 AUGAUAGUGACGAACCUGA 85.1% 2717 AUAAAAUUAAAUCUGAGAA 84.0%

2667 CAAUAUGAUAGUGACGAAC 85.1% 2718 GCCAACAGAGGCCUCUGGG 83.9%

2668 UUGAUGAUCCAAAUGCACU 85.1% 2719 UGGCCAACAGAGGCCUCUG 83.9%

2669 CUUUCUCAAAUGUCCAAAG 85.0% 2720 CUAAGAAGUGCCAUAGGCC 83.9%

2670 UCAAUAGCCUGUAUGCAUC 85.0% 2721 CCAACAGAGGCCUCUGGGA 83.9%

2671 AAGGCCUAUGUUCUUGUAU 85.0% 2722 CAACAGAGGCCUCUGGGAU 83.9%

2672 AAAUGGGAGAAAUACUGUG 85.0% 2723 GGCCAACAGAGGCCUCUGG 83.9%

2673 AUAUGAUAGUGACGAACCU 85.0% 2724 AUGGCCAACAGAGGCCUCU 83.8%

2674 AAUGGGAGAAAUACUGUGU 85.0% 2725 UUCUUGUAUGUGAGGACAA 83.8%

2675 CUUGAUGAUCCAAAUGCAC 85.0% 2726 UCUUGUAUGUGAGGACAAA 83.8%

2676 UUCAAUAGCCUGUAUGCAU 85.0% 2727 AAAAGGCCAAUAAAAUUAA 83.7%

2677 GAAGCAAUAUGAUAGUGAC 85.0% 2728 AGGCCAAUAAAAUUAAAUC 83.7%

2678 GCUUGCCGACCAAAGUCUC 84.9% 2729 AAAGGCCAAUAAAAUUAAA 83.7%

2679 CAAGGCCUAUGUUCUUGUA 84.9% 2730 GAAAAGGCCAAUAAAAUUA 83.7%

2680 GCAAUAUGAUAGUGACGAA 84.9% 2731 CGAACCUGAAUUAAGGUCA 83.7%

2681 UCAAGGCCUAUGUUCUUGU 84.9% 2732 UUUUUUGAGAAUAAAUCAG 83.7%

2682 CCAAAGAGUUUUUUGAGAA 84.8% 2733 AAGGCCAAUAAAAUUAAAU 83.7%

2683 AGGCCUAUGUUCUUGUAUG 84.8% 2734 GUUCUUGUAUGUGAGGACA 83.6%

2684 UAAGAAGUGCCAUAGGCCA 84.8% 2735 UUUUUGAGAAUAAAUCAGA 83.6%

2685 GGCCUAUGUUCUUGUAUGU 84.6% 2736 UUUUGAGAAUAAAUCAGAA 83.5%

2686 CUCAAAUGUCCAAAGAAGU 84.6% 2737 GUUUUUUGAGAAUAAAUCA 83.3%

2687 UCUCAAAUGUCCAAAGAAG 84.6% 2738 AGUUUUUUGAGAAUAAAUC 83.3%

2688 UUUCUCAAAUGUCCAAAGA 84.5% 2739 AAGAGUUUUUUGAGAAUAA 83.3%

2689 AACCGAACGGCUGCAUUGA 84.4% 2740 GAGUUUUUUGAGAAUAAAU 83.3%

2690 CCAAUAAAAUUAAAUCUGA 84.4% 2741 AGUGACGAACCUGAAUUAA 83.2%

2691 GCCAAUAAAAUUAAAUCUG 84.4% 2742 GUGACGAACCUGAAUUAAG 83.2%

2692 UUCUCAAAUGUCCAAAGAA 84.3% 2743 AAAUGGCCAACAGAGGCCU 83.2%

2693 AGGCUUGCCGACCAAAGUC 84.3% 2744 GGGGGCUAUAUGAAGCAAU 83.2%

2694 AGGUCACUUUCAAGCUGGA 84.3% 2745 GAAAUGGCCAACAGAGGCC 83.2%

2695 CUAUGUUCUUGUAUGUGAG 84.3% 2746 CAAAGAGUUUUUUGAGAAU 83.2%

2696 GGUCACUUUCAAGCUGGAU 84.3% 2747 AGAGUUUUUUGAGAAUAAA 83.2%

2697 UGUGGUAAACUUUGUGAGC 84.3% 2748 UGGACAGUAGUAAACAGUA 83.0%

2698 CGAACGGCUGCAUUGAGGG 84.3% 2749 GGACAGUAGUAAACAGUAU 83.0%

2699 AUGUGGUAAACUUUGUGAG 84.3% 2750 GGCCAAUAAAAUUAAAUCU 82.9%

2700 CCUAUGUUCUUGUAUGUGA 84.3% 2751 ACAAUGGCCUGGACAGUAG 82.9%

2701 GAUGUGGUAAACUUUGUGA 84.3% 2752 CAAUGGCCUGGACAGUAGU 82.9%

2702 GGCUUGCCGACCAAAGUCU 84.3% 2753 GCCUGGACAGUAGUAAACA 82.8%

2703 CCGAACGGCUGCAUUGAGG 84.2% 2754 AUGGCCUGGACAGUAGUAA 82.8%

2704 CAAUAAAAUUAAAUCUGAG 84.2% 2755 UGGCCUGGACAGUAGUAAA 82.8%

2705 GCCUAUGUUCUUGUAUGUG 84.2% 2756 ACAUAAGCGAUUUGAAGCA 82.8%

2706 ACAGAGGCCUCUGGGAUUC 84.1% 2757 GGGGGGCUAUAUGAAGCAA 82.8%

2707 ACCGAACGGCUGCAUUGAG 84.1% 2758 GGCCUGGACAGUAGUAAAC 82.8%

2759 GGAGAAAUACUGUGUCCUU 82.7% 2810 UGCAGAACUAAAGAGGGAA 82.0%

2760 GACAUAAGCGAUUUGAAGC 82.7% 2811 CGAACCGAACGGCUGCAUU 81.9% 27Sl AGAAAUACUGUGUCCUUGA 82.7% 2812 CACACGAAAAGGGAAUAAA 81.9%

2762 UGGGAGAAAϋACUGUGUCC 82.7% 2813 UUUCAUUUCAUCAAUGAAC 81.9%

2763 GGGAGAAAUACUGUGUCCU 82.7% 2814 UCGAACCGAACGGCUGCAU 81.8%

2764 AGACAUAAGCGAUUUGAAG 82.7% 2815 CCACACGAAAAGGGAAUAA 81.8%

2765 GAGAAAUACUGUGUCCUUG 82.7% 2816 UAGUGACGAACCUGAAUUA 81.8%

2766 CCUGGACAGUAGUAAACAG 82.7% 2817 UUCGAACCGAACGGCUGCA 81.8%

2767 GAGACAUAAGCGAUUUGAA 82.6% 2818 GCAGAACUAAAGAGGGAAG 81.8%

2768 AGAGACAUAAGCGAUUUGA 82.6% 2819 ACACGAAAAGGGAAUAAAU 81.7%

2769 GAGGAAAGCAGGGCUAGGA 82.6% 2820 UCUCUCACUGACCCGAGAC 81.7%

2770 AGGAAAGCAGGGCUAGGAU 82.6% 2821 CUCUCACUGACCCGAGACU 81.7%

2771 AUGGGAGAAAUACUGUGUC 82.6% 2822 UUUCAUGUAUUCAGAUUUU 81.6%

2772 UGAUUGAAGCCGAGUCCUC 82.5% 2823 GUUUCAUGUAUUCAGAUUU 81.6%

2773 UACCUUGAAAAGGCCAAUA 82.5% 2824 UGUUUCAUGUAUUCAGAUU 81.6%

2774 AUGAUUGAAGCCGAGUCCU 82.5% 2825 AAAGCACAGAUUUGAAAUA 81.4%

2775 CAUAAGCGAUUUGAAGCAA 82.4% 2826 ACCACACGAAAAGGGAAUA 81.4%

2776 ACCUUGAAAAGGCCAAUAA 82.4% 2827 AAGAAAUGGCCAACAGAGG 81.4%

2777 AACAUGGCACCAGAGAAAG 82.4% 2828 AUUCGAACCGAAQGGCUGC 81.4%

2778 UCACAGCAGAGGUGUCCCA 82.4% 2829 CAAGAAAUGGCCAACAGAG 81.4%

2779 CUAAAAACAUGAAGAAAAC 82.4% 2830 UUCAUUUCAUCAAUGAACA 81.4%

2780 UUCACAGCAGAGGUGUCCC 82.4% 2831 UAUGUUCUUGUAUGUGAGG 81.4%

2781 ACAACACCAAGACCAAUCA 82.4% 2832 UAAAGCACAGAUUUGAAAU 81.4%

2782 ACAUGGCACCAGAGAAAGU 82.4% 2833 GAUUCGAACCGAACGGCUG 81.4%

2783 AAUGGCCAACAGAGGCCUC 82.3% 2834 AUUUCAUCAAUGAACAAGG 81.3%

2784 AUGUCCAAAGAAGUGAAUG 82.3% 2835 UCAUUUCAUCAAUGAACAA 81.3%

2785 UUCUCUCACUGACCCGAGA 82.3% 2836 CAUUUCAUCAAUGAACAAG 81.3%

2786 UGUCCAAAGAAGUGAAUGC 82.3% 2837 AACCUGAAUUAAGGUCACU 81.3%

2787 UUGAAAAGGCCAAUAAAAU 82.3% 2838 AAUUAAGGUCACUUUCAAG 81.3%

2788 CAUGGCCCAUUGGGGAGUC 82.3% 2839 GAACCUGAAUUAAGGUCAC 81.3%

2789 AGCCACAUAAAUGGGAGAA 82.2% 2840 UCACUGGGGAGGAAAUGGC 81.2%

2790 AAUGGCCUGGACAGUAGUA 82.2% 2841 CAUUCACUGGGGAGGAAAU 81.2%

2791 CCACAUAAAUGGGAGAAAU 82.2% 2842 UCAUUCACUGGGGAGGAAA 81.2%

2792 ACAUAAAUGGGAGAAAUAC 82.2% 2843 UUCACUGGGGAGGAAAUGG 81.2%

2793 GAGCCACAUAAAUGGGAGA 82.2% 2844 AUUCACUGGGGAGGAAAUG 81.1%

2794 AGCAUGAUUGAAGCCGAGU 82.2% 2845 AAACCACACGAAAAGGGAA 81.1%

2795 UGACGAACCUGAAUUAAGG 82.2% 2846 CACUGGGGAGGAAAUGGCC 81.1%

2796 CAAAUGUCCAAAGAAGUGA 82.2% 2847 AACCACACGAAAAGGGAAU 81.1%

2797 GCAUGAUUGAAGCCGAGUC 82.2% 2848 UGGGGAAUGGAGAUGAGAC 81.0%

2798 UGAAAAGGCCAAUAAAAUU 82.2% 2849 UGAAUUAAGGUCACUUUCA 81.0%

2799 AUAAAUGGGAGAAAUACUG 82.2% 2850 AGCAUUGAAGACCCAAGUC 81.0%

2800 CAUAAAUGGGAGAAAUACU 82.2% 2851 GAAUUAAGGUCACUUUCAA 81.0%

2801 CACAUAAAUGGGAGAAAUA 82.2% 2852 GAAAAACCAAUUUAUAUGG 81.0%

2802 AAAUGUCCAAAGAAGUGAA 82.2% 2853 AUGGGGAAUGGAGAUGAGA 81.0%

2803 UAAAUGGGAGAAAUACUGU 82.2% 2854 CUGAAUUAAGGUCACUUUC 81.0%

2804 AAUGUCCAAAGAAGUGAAU 82.2% 2855 GCAUUGAAGACCCAAGUCA 81.0%

2805 CUUGAAAAGGCCAAUAAAA 82.1% 2856 GGGGAAUGGAGAUGAGACG 81.0%

2806 CUGGACAGUAGUAAACAGU 82.1% 2857 CCUGAAUUAAGGUCACUUU 80.9%

2807 GAACCGAACGGCUGCAUUG 82.1% 2858 CGAAAAACCAAUUUAUAUG 80.8%

2808 UϋUUCAUUUCAUCAAUGAA 82.1% 2859 AAGGUCACUUUCAAGCUGG 80.7%

2809 GCCACAUAAAUGGGAGAAA 82.0% 2860 UGGCACCAGAGAAAGUAGA 80.7%

2861 AUGGCACCAGAGAAAGUAG 80.7% 2912 CCCAAGGGAGUGGAAGAAG 79.6%

2862 AGAAAUGGCCAACAGAGGC 80.7% 2913 GACCCAAGUCACGAAGGAG 79.5%

2863 AGAGCAUGAUUGAAGCCGA 80.6% 2914 GGUAUACAUUAAUACUGCC 79.5%

2864 CAUGGCACCAGAGAAAGUA 80.6% 2915 ACCCAAGUCACGAAGGAGA 79.5%

2865 AUUGAAGACCCAAGUCACG 80.6% 2916 UCAAUGCAUCCUGUGCAGC 79.4%

2866 CAUUGAAGACCCAAGUCAC 80.6% 2917 GAGGAAAUGGCCACAAAGG 79.4%

2867 UCUCAUUCACUGGGGAGGA 80.6% 2918 UGGAGAUGAGACGUUGCCU 79.4%

2868 UUCUCAUUCACUGGGGAGG 80.6% 2919 CAAUGCAUCCUGUGCAGCA 79.4%

2869 UUGAAGACCCAAGUCACGA 80.6% 2920 AGGAAAUGGCCACAAAGGC 79.4%

2870 CGGAGUCAAGAAAACUGCU 80.6% 2921 UGCAAGCAUGAGGAGGAAU 79.4%

2871 ACCUGAAUUAAGGUCACUU 80.6% 2922 AUGGAGAUGAGACGUUGCC 79.4%

2872 GAGAGCAUGAUUGAAGCCG 80.6% 2923 CUCAAUGCAUCCUGUGCAG 79.4%

2873 UAAGGUCACUUUCAAGCUG 80.5% 2924 ACAAUUGAAGAAAAAUUUG 79.3%

2874 GCGGAGUCAAGAAAACUGC 80.5% 2925 UGAUGAUCCAAAUGCACUG 79.3%

2875 GCACCAGAGAAAGUAGACU 80.4% 2926 CAAUUGAAGAAAAAUUUGA 79.3%

2876 CACCAGAGAAAGUAGACUU 80.4% 2927 AAUUGAAGAAAAAUUUGAA 79.3%

2877 UUAAAGCACAGAUUUGAAA 80.4% 2928 GCUCAAUGCAUCCUGUGCA 79.3% 2878 GGCACCAGAGAAAGUAGAC 80.4% 2929 UCAAACCACACGAAAAGGG 79.3%

2879 CCAGAGAAAGUAGACUUUG 80.4% 2930 UUGAAGAAAAAUUUGAAAU 79.3%

2880 GAGCAUGAUUGAAGCCGAG 80.4% 2931 GGAGAUGAGACGUUGCCUC 79.3%

2881 ACCAGAGAAAGUAGACUUU 80.3% 2932 AUUGAAGAAAAAUUUGAAA 79.3%

2882 CUCAUUCACUGGGGAGGAA 80.3% 2933 UGCUCAAUGCAUCCUGUGC 79.3%

2883 AACACCAAGACCAAUCAAA 80.3% 2934 GAGAUGAGACGUUGCCUCC 79.2%

2884 UAUAUGAUGCGAUCAAGUG 80.3% 2935 GAAUGGAGAUGAGACGUUG 79.2%

2885 CCUUGAAAAGGCCAAUAAA 80.3% 2936 GAAGACCCAAGUCACGAAG 79.2%

2886 CAGAGAAAGUAGACUUUGA 80.2% 2937 UGUUAAAGCACAGAUUUGA 79.2%

2887 UUUCUCAUUCACUGGGGAG 80.2% 2938 GGAAUGGAGAUGAGACGUU 79.2%

2888 ACACCAAGACCAAUCAAAC 80.2% 2939 UGAAGAAAAAUUUGAAAUC 79.2%

2889 UUAAGGUCACUUUCAAGCU 80.2% 2940 AAGACCCAAGUCACGAAGG 79.2%

2890 CACCAAGACCAAUCAAACU 80.2% 2941 GCAGCAAUGGACGAUUUUC 79.2%

2891 AUUAAGGUCACUUUCAAGC 80.2% 2942 AGAUGAGACGUUGCCUCCU 79.2%

2892 CAACACCAAGACCAAUCAA 80.2% 2943 GUCAAACCACACGAAAAGG 79.2%

2893 UUUUCAGCGGAGUCAAGAA 80.1% 2944 CAGCAAUGGACGAUUUUCA 79.2%

2894 ACUAAAAACAUGAAGAAAA 80.1% 2945 UGCAGCAAUGGACGAUUUU 79.2%

2895 CAGCGGAGUCAAGAAAACϋ 80.1% 2946 GGGAAUGGAGAUGAGACGU 79.2% 2896 UUUCAGCGGAGUCAAGAAA 80.1% 2947 AUGCACUGUUAAAGCACAG 79.1%

2897 AGCGGAGUCAAGAAAACUG 80.1% 2948 CUGUUAAAGCACAGAUUUG 79.1%

2898 UUCAGCGGAGUCAAGAAAA 80.1% 2949 AAUGCACUGUUAAAGCACA 79.1%

2899 UCAGCGGAGUCAAGAAAAC 80.1% 2950 AAUGGAGAUGAGACGUUGC 79.1%

2900 CCGAUGAUUGUCGAACUUG 79.7% 2951 AAAUGCACUGUUAAAGCAC 79.0%

2901 GCAAGCAUGAGGAGGAAUU 79.7% 2952 ACUGUUAAAGCACAGAUUU 79.0%

2902 CAUGAUUGAAGCCGAGUCC 79.7% 2953 CCAAAUGCACUGUUAAAGC 79.0%

2903 CGAUGAUUGUCGAACUUGC 79.7% 2954 UGAAGACCCAAGUCACGAA 79.0%

2904 AUUGCAAGCAUGAGGAGGA 79.7% 2955 GCACUGUUAAAGCACAGAU 79.0%

2905 UUGCAAGCAUGAGGAGGAA 79.7% 2956 CAAAUGCACUGUUAAAGCA 79.0%

2906 CAAGCAUGAGGAGGAAUUA 79.7% 2957 CACUGUUAAAGCACAGAUU 79.0%

2907 GAUCCAAAUGCACUGUUAA 79.7% 2958 UUCAUCAAUGAACAAGGCG 78.9%

2908 CCAAGGGAGUGGAAGAAGG 79.6% 2959 UGCACUGUUAAAGCACAGA 78.9%

2909 ACAUUGCAAGCAUGAGGAG 79.6% 2960 UCAUCAAUGAACAAGGCGA 78.9%

2910 AUCCAAAUGCACUGUUAAA 79.6% 2961 AUGUUUCAUGUAUUCAGAU 78.9%

2911 CAUUGCAAGCAUGAGGAGG 79.6% 2962 UCCAAAUGCACUGUUAAAG 78.8%

2963 GGGAGGAAAUGGCCACAAA 78.8% 3014 CAAGUGCAGAACUAAAGAG 77.9%

2964 CAAAGGCAGACUACACUCU 78.8% 3015 UUGUUGUUCAGGCUCUUAG 77.8%

2965 GGGGAGGAAAUGGCCACAA 78.8% 3016 GGGAGAGACAGAACAAUGG 77.8%

2966 UAUGUUUCAUGUAUUCAGA 78.8% 3017 CAAACCACACGAAAAGGGA 77.8%

2967 GGAGGAAAUGGCCACAAAG 78.8% 3018 GGAUACAGAAUGAGUUCAA 77.8%

2968 GUAUGUUUCAUGUAUUCAG 78.8% 3019 UCACAUUUUCUCAUUCACU 77.8%

2969 ACUGGGGAGGAAAUGGCCA 78.7% 3020 UGGAUACAGAAUGAGUUCA 77.8%

2970 UUUCAUCAAUGAACAAGGC 78.7% 3021 GGAGAGACAGAACAAUGGC 77.8%

2971 GACAACACCAAGACCAAUC 78.7% 3022 GAACAAUGGCCUGGACAGU 77.7%

2972 CUGGGGAGGAAAUGGCCAC 78.7% 3023 CAAGCUGGAUACAGAAUGA 77.7%

2973 AGCAAUGGACGAUUUUCAA 78.7% 3024 UACAGAAUGAGUUCAACAA 77.7%

2974 UGGGGAGGAAAUGGCCACA 78.7% 3025 AGCAUGAGGAGGAAUUAUU 77.6%

2975 AUGAUAAGCAAGUGCAGAA 78.7% 3026 AUACAGAAUGAGUUCAACA 77.6%

2976 AAUUAUUUCACAGCAGAGG 78.6% 3027 AGGGGAGAGACAGAACAAU 77.6%

2977 GUUAAAGCACAGAUUUGAA 78.6% 3028 GAGGGGAGAGACAGAACAA 77.6%

2978 AAAACAUGAAGAAAACGAG 78.6% 3029 GGGGAGAGACAGAACAAUG 77.6%

2979 AAAAACAUGAAGAAAACGA 78.6% 3030 GCAUGAGGAGGAAUUAUUU 77.6%

2980 ACAUGAAGAAAACGAGUCA 78.6% 3031 UCAAGCUGGAUACAGAAUG 77.6%

2981 AUUAUUUCACAGCAGAGGU 78.6% 3032 UGUUGUUCAGGCUCUUAGG 77.6%

2982 UCAAAAUGAAAUGGGGAAU 78.6% 3033 UUAUUσCACAGCAGAGGUG 77.6% 2983 AAACAUGAAGAAAACGAGU 78.6% 3034 GAUACAGAAUGAGUUCAAC 77.6%

2984 UGAUAAGCAAGUGCAGAAC 78.6% 3035 AUUCACAUUUUCUCAUUCA 77.6%

2985 AACAUGAAGAAAACGAGUC 78.6% 3036 ACAGCAGAGGUGUCCCAUU 77.6%

2986 CAAAAUGAAAUGGGGAAUG 78.5% 3037 CAGCAGAGGUGUCCCAUUG 77.6%

2987 AUGAUCCAAAUGCACUGUU 78.4% 3038 CACACAUUCACAUUUUCUC 77.6%

2988 AGGAAUUAUUUCACAGCAG 78.4% 3039 AAGCAUGAGGAGGAAUUAU 77.6%

2989 GAAUUAUUUCACAGCAGAG 78.4% 3040 CUGGAUACAGAAUGAGUUC 77.5%

2990 GGAAUUAUUUCACAGCAGA 78.4% 3041 CAUUCACAUUUUCUCAUUC 77.5%

2991 GAUGAUCCAAAUGCACUGU 78.3% 3042 ACACACAUUCACAUUUUCU 77.5%

2992 AAAGCAGGGCUAGGAUUAA 78.3% 3043 CAUGAGGAGGAAUUAUUUC 77.5%

2993 GUUGUUCAGGCUCUUAGGG 78.3% 3044 AGACCCAAGUCACGAAGGA 77.5%

2994 AUGAGGAGGAAUUAUUUCA 78.3% 3045 ACAUUCACAUUUUCUCAUU 77.5%

2995 UGAGGAGGAAUUAUUUCAC 78.3% 3046 CACAGCAGAGGUGUCCCAU 77.5%

2996 GAUAAGCAAGUGCAGAACU 78.1% 3047 ACACAUUCACAUUUUCUCA 77.5%

2997 UUAAAGAGAAAGACAUGAC 78.1% 3048 GCUGGAUACAGAAUGAGUU 77.5%

2998 GAAAGCAGGGCUAGGAUUA 78.1% 3049 ACCAAGACCAAUCAAACUU 77.5%

2999 UAAGCAAGUGCAGAACUAA 78.1% 3050 AAGCUGGAUACAGAAUGAG 77.5%

3000 UGAUCCAAAUGCACUGUUA 78.1% 3051 AGCUGGAUACAGAAUGAGU 77.5%

3001 UACAUUGCAAGCAUGAGGA 78.1% 3052 UUUCACAGCAGAGGUGUCC 77.5%

3002 AGCAAGUGCAGAACUAAAG 78.0% 3053 CACAUUCACAUUUUCUCAU 77.5%

3003 UAAAGAGAAAGACAUGACC 78.0% 3054 CCAAGACCAAUCAAACUUC 77.5%

3004 GCAAGUGCAGAACUAAAGA 78.0% 3055 GAUCCCUGGGUUUUGCUCA 77.4%

3005 AUAAGCAAGUGCAGAACUA 78.0% 3056 CCUGGGUUUUGCUCAAUGC 77.4%

3006 AAGCAAGUGCAGAACUAAA 78.0% 3057 UGAUCCCUGGGUUUUGCUC 77.4% 3007 UUCACAUUUUCUCAUUCAC 77.9% 3058 CAUAAUGAAGGGGGUAUAC 77.4%

3008 AUUUCACAGCAGAGGUGUC 77.9% 3059 ACAUAAUGAAGGGGGUAUA 77.4%

3009 AAGUGCAGAACUAAAGAGG 77.9% 3060 CCCUGGGUUUUGCUCAAUG 77.4%

3010 AGUGCAGAACUAAAGAGGG 77.9% 3061 AUCCCUGGGUUUUGCUCAA 77.4% 3011 GUGCAGAACUAAAGAGGGA 77.9% 3062 ACAGAAUGAGUUCAACAAG 77.4%

3012 CUUGUUGUUCAGGCUCUUA 77.9% 3063 CAAGACCAAUCAAACUUCC 77.4%

3013 UAUUUCACAGCAGAGGUGU 77.9% 3064 UACAUAAUGAAGGGGGUAU 77.4%

3065 UCCCUGGGUUUUGCUCAAU 77.4% 3116 UGGAGGUAUGUUUCAUGUA 76.2%

3066 AGAGACAGAACAAUGGCCU 77.3% 3117 CACAAAGGCAGACUACACU 76.2%

3067 AGACAGAACAAUGGCCUGG 77.3% 3118 AAUGGAAGAUUUUGUGCGA 76.1%

3068 ACAGAACAAUGGCCUGGAC 77.3% 3119 AGAGAAAGUAGACUUUGAC 76.1%

3069 UGAUUGUCGAACUUGCAGA 77.3% 3120 GAAAGUAGACUUUGACAAC 76.1%

3070 GAGACAGAACAAUGGCCUG 77.3% 3121 AAUGGCCACAAAGGCAGAC 76.0%

3071 AUGAUUGUCGAACUUGCAG 77.3% 3122 GUCACUUUCAAGCUGGAUA 76.0%

3072 GACAGAACAAUGGCCUGGA 77.3% 3123 UUCAACCCGAUGAUUGUCG 75.9%

3073 UAAAAACAUGAAGAAAACG 77.1% 3124 UCAGGCUCUUAGGGACAAC 75.9%

3074 GAGAGACAGAACAAUGGCC 77.1% 3125 CAGGCUCUUAGGGACAACC 75.9%

3075 AAACAAUUGAAGAAAAAUU 77.0% 3126 CCAUUGGGAAAGUCUGUAG 75.9%

3076 AACAAUUGAAGAAAAAUUU 77.0% 3127 AGGCUCUUAGGGACAACCU 75.9%

3077 GAGGAGGAAUUAUUUCACA 76.9% 3128 UCAACCCGAUGAUUGUCGA 75.9% 3078 GAAACAAUUGAAGAAAAAU 76.9% 3129 UCCAUUGGGAAAGUCUGUA 75.8% 3079 GAGAAAGUAGACUUUGACA 76.9% 3130 ACAUAUAUUACCUUGAAAA 75.7% 3080 AGGAGGAAUUAUUUCACAG 76.9% 3131 CAUUGGGAAAGUCUGUAGG 75.7% 3081 GGAGGAAUUAUUUCACAGC 76.9% 3132 UUGGGAAAGUCUGUAGGAC 75.7% 3082 AGAACAAUGGCCUGGACAG 76.9% 3133 AUUGGGAAAGUCUGUAGGA 75.7% 3083 CAGAACAAUGGCCUGGACA 76.9% 3134 CACAUAUAUUACCUUGAAA 75.7% 3084 GAACCUGGGACCUUUGAUC 76.8% 3135 UCCACAUAUAUUACCUUGA 75.7% 3085 AUAUGAUGCGAUCAAGUGC 76.8% 3136 CCACAUAUAUUACCUUGAA 75.7% 3086 CUUUCAAGCUGGAUACAGA 76.8% 3137 UAGAACUUGAUGAUCCAAA 75.6% 3087 UAUGAUGCGAUCAAGUGCA 76.8% 3138 UUUGCAGCAAUAUGCACUC 75.6% 3088 AUGAUGCGAUCAAGUGCAU 76.8% 3139 UUGCAGCAAUAUGCACUCA 75.6% 3089 GAUUCAUCGAAAUUGGAGU 76.8% 3140 AUUUGCAGCAAUAUGCACU 75.6% 3090 AACCUGGGACCUUUGAUCU 76.8% 3141 AAGUCCACAUAUAUUACCU 75.6% 3091 AGAAACAAUUGAAGAAAAA 76.7% 3142 UUGUCGAACUUGCAGAAAA 75.6%

3092 AAGAAACAAUUGAAGAAAA 76.7% 3143 GAAGUCCACAUAUAUUACC 75.6%

3093 GAGGAAUUAUUUCACAGCA 76.7% 3144 GAACUUGAUGAUCCAAAUG 75.6% 3094 CACUUUCAAGCUGGAUACA 76.5% 3145 GUAGAACUUGAUGAUCCAA 75.5% 3095 UCACUUUCAAGCUGGAUAC 76.5% 3146 AACUUGAUGAUCCAAAUGC 75.5%

3096 GAUGAUUGUCGAACUUGCA 76.5% 3147 GGAAAUGGCCACAAAGGCA 75.5%

3097 AGAAAGUAGACUUUGACAA 76.4% 3148 GAUUGUCGAACUUGCAGAA 75.5%

3098 CCACAAAGGCAGACUACAC 76.3% 3149 AUUGUCGAACUUGCAGAAA 75.5%

3099 ACUUUCAAGCUGGAUACAG 76.3% 3150 AGAACUUGAUGAUCCAAAU 75.5%

3100 GAAAUGGCCACAAAGGCAG 76.3% 3151 ACUUGAUGAUCCAAAUGCA 75.4%

3101 AUGGCCACAAAGGCAGACU 76.3% 3152 AACAAUGGCCUGGACAGUA 75.4%

3102 GCCACAAAGGCAGACUACA 76.3% 3153 UGCAGCAAUAUGCACUCAC 75.4%

3103 AAAGUAGACUUUGACAACU 76.3% 3154 GUCCACAUAUAUUACCUUG 75.3%

3104 GAGGUAUGUUUCAUGUAUU 76.3% 3155 CAUCAAUGAACAAGGCGAA 75.3%

3105 ACAAAGGCAGACUACACUC 76.3% 3156 AGUCCACAUAUAUUACCUU 75.3%

3106 UGGCCACAAAGGCAGACUA 76.3% 3157 AGAUUCAUCGAAAUUGGAG 75.3%

3107 UUCAAGCUGGAUACAGAAU 76.3% 3158 AUCAAUGAACAAGGCGAAU 75.2%

3108 AGGUAUGUUUCAUGUAUUC 76.3% 3159 ACCAAUUUAUAUGGAUUCA 75.2%

3109 AAGUAGACUUUGACAACUG 76.2% 3160 CCAAUUUAUAUGGAUUCAU 75.2%

3110 AAAUGGCCACAAAGGCAGA 76.2% 3161 CAAUUUAUAUGGAUUCAUC 75.2%

3111 AAAUGGAAGAUUUUGUGCG 76.2% 3162 UCAAUGAACAAGGCGAAUC 75.2%

3112 UUUCAAGCUGGAUACAGAA 76.2% 3163 ACCCGAUGAUUGUCGAACU 75.1%

3113 GGAGGUAUGUUUCAUGUAU 76.2% 3164 AACCCGAUGAUUGUCGAAC 75.1%

3114 GGUAUGUUUCAUGUAUUCA 76.2% 3165 AAUGAACAAGGCGAAUCAA 75.1%

3115 GGCCACAAAGGCAGACUAC 76.2% 3166 CAACCCGAUGAUUGUCGAA 75.1%

3167 AUGAACAAGGCGAAUCAAU 75.1% 3218 UGCAACACUACUGGAGCUG 74.1%

3168 AAAAUUGAACCUUUUCUGA 74.9% 3219 AUAUAUUACCUUGAAAAGG 74.1%

3169 CCCGAUGAUUGUCGAACUU 74.9% 3220 CCAGAUUUGUAUGAUUACA 74.1%

3170 UGAACAAGGCGAAUCAAUA 74.9% 3221 UAUUACCUUGAAAAGGCCA 74.1%

3171 CAAUGAACAAGGCGAAUCA 74.9% 3222 UUGGUGAAAACAUGGCACC 74.1%

3172 UUUGAGAAUAAAUCAGAAG 74.8% 3223 GUGAAAACAUGGCACCAGA 74.1%

3173 UUGAAACAAACAAAUUUGC 74.7% 3224 AUUUGUAUGAUUACAAGGA 74.1%

3174 AUUGAAACAAACAAAUUUG 74.7% 3225 GAAAAAUUUGAAAUCUCAG 74.1%

3175 AAAACCAAUUUAUAUGGAU 74.7% 3226 AAAAAUUUGAAAUCUCAGG 74.1%

3176 UUGAGAAUAAAUCAGAAGC 74.7% 3227 GGUGAAAACAUGGCACCAG 74.1%

3177 AAAUUGAAACAAACAAAUU 74.7% 3228 CAUAUAUUACCUUGAAAAG 74.1%

3178 UGGGAAAGUCUGUAGGACU 74.7% 3229 UAUAUUACCUUGAAAAGGC 74.1%

3179 AAACCAAUUUAUAUGGAUU 74.7% 3230 CAGAUUUGUAUGAUUACAA 74.1%

3180 AAAAUUGAAACAAACAAAU 74.7% 3231 GCUUCAACCCGAUGAUUGU 74.0%

3181 AACCAAUUUAUAUGGAUUC 74.6% 3232 ACAGUAUCUGCAACACUAC 74.0%

3182 UGAAACAAACAAAUϋUGCA 74.6% 3233 AACUCCUUCCUGACACAUG 74.0%

3183 AAUUGAAACAAACAAAUUU 74.6% 3234 AACAGUAUCUGCAACACUA 74.0%

3184 UGGUUCAACUCCUUCCUGA 74.6% 3235 AGAUUUGUAUGAUUACAAG 74.0%

3185 AGCUCGAUGAAAUUGGAGA 74.5% 3236 UAUAUAGUCAAACCACACG 74.0%

3186 GAGCUCGAUGAAAUUGGAG 74.5% 3237 AUAUAGUCAAACCACACGA 74.0%

3187 AUCGAGAGCAUGAUUGAAG 74.5% 3238 AGCAUGGCCCAUUGGGGAG 74.0%

3188 UGGAUAGAGCUCGAUGAAA 74.5% 3239 GAAGAAAAAUUUGAAAUCU 74.0%

3189 GGAUAGAGCUCGAUGAAAU 74.5% 3240 UGGUGAAAACAUGGCACCA 74.0%

3190 AAUCAGAAGCAUGGCCCAU 74.4% 3241 CUGCAGAGACAUAAGCGAU 74.0%

3191 AAAUCAGAAGCAUGGCCCA 74.4% 3242 UGCUUCAACCCGAUGAUUG 74.0%

3192 UCAAAUGUCCAAAGAAGUG 74.4% 3243 AAGAAAAAUUUGAAAUCUC 74.0%

3193 UGCAGAGACAUAAGCGAUU 74.3% 3244 AGAAAAAUUUGAAAUCUCA 74.0%

3194 UUGUGAGCAUGGAGUUUUC 74.3% 3245 ACUCCUUCCUGACACAUGC 73.9%

3195 AUAGAGCUCGAUGAAAUUG 74.3% 3246 UUCAACUCCUUCCUGACAC 73.9%

3196 UCGAGAGCAUGAUUGAAGC 74.3% 3247 GUUCAACUCCUUCCUGACA 73.9%

3197 AAAGGCAGACUACACUCUC 74.3% 3248 GCAUGGCCCAUUGGGGAGU 73.9%

3198 AAAAACCAAUUUAUAUGGA 74.3% 3249 UGAAAACAUGGCACCAGAG 73.9%

3199 GCAGAGACAUAAGCGAUUU 74.3% 3250 GUAAACAGUAUCUGCAACA 73.9%

3200 UAGAGCUCGAUGAAAUUGG 74.3% 3251 AAAACAUGGCACCAGAGAA 73.9%

3201 UUUGUGAGCAUGGAGUUUU 74.3% 3252 CAACUCCUUCCUGACACAU 73.9%

3202 AGAGCUCGAUGAAAUUGGA 74.3% 3253 UCAACUCCUUCCUGACACA 73.9%

3203 GAUAGAGCUCGAUGAAAUU 74.3% 3254 GGUUCAACUCCUUCCUGAC 73.9%

3204 CUUGGUGAAAACAUGGCAC 74.2% 3255 GAAAACAUGGCACCAGAGA 73.9%

3205 CAGAGACAUAAGCGAUUUG 74.2% 3256 UUGGUUCAACUCCUUCCUG 73.8%

3206 CAGAAGCAUGGCCCAUUGG 74.2% 3257 UAAACAGUAUCUGCAACAC 73.8%

3207 AGGCAGACUACACUCUCGA 74.2% 3258 AAACAGUAUCUGCAACACU 73.8%

3208 UCAGAAGCAUGGCCCAUUG 74.2% 3259 GUGCGACAAUGCUUCAACC 73.8%

3209 GAAGCAUGGCCCAUUGGGG 74.2% 3260 CUUCAACCCGAUGAUUGUC 73.8%

3210 AAGGCAGACUACACUCUCG 74.2% 3261 CUCGACGAGGAAAGCAGGG 73.8%

3211 AAGCAUGGCCCAUUGGGGA 74.2% 3262 AAUUUGAAAUCUCAGGAAC 73.8%

3212 AAAAUUUGAAAUCUCAGGA 74.1% 3263 UCGACGAGGAAAGCAGGGC 73.8%

3213 AUAUUACCUUGAAAAGGCC 74.1% 3264 UGCGACAAUGCUUCAACCC 73.8%

3214 AUCAGAAGCAUGGCCCAUU 74.1% 3265 AAAUUUGAAAUCUCAGGAA 73.8%

3215 CUGGAUAGAGCUCGAUGAA 74.1% 3266 AAACAUGGCACCAGAGAAA 73.8%

3216 GAUUUGUAUGAUUACAAGG 74.1% 3267 CUCCUUCCUGACACAUGCA 73.8%

3217 AGAAGCAUGGCCCAUUGGG 74.1% 3268 CUCUCGACGAGGAAAGCAG 73.7%

3269 GCAGACUACACUCUCGACG 73.7% 3320 AUUACAAGGAGAACAGAUU 73.1%

3270 ACUCUCGACGAGGAAAGCA 73.7% 3321 GUUCAAUAGCCUGUAUGCA 73.1%

3271 UGUGCGACAAUGCUUCAAC 73.7% 3322 AAUAAAUCAGAAGCAUGGC 73.1%

3272 CAGACUACACUCUCGACGA 73.7% 3323 GAUUACAAGGAGAACAGAU 73.1%

3273 AACUGCAGAGACAUAAGCG 73.7% 3324 UAAAUCAGAAGCAUGGCCC 73.0%

3274 AUAGUCAAACCACACGAAA 73.7% 3325 AUUUAUAUGGAUUCAUCAU 73.0%

3275 ACUGCAGAGACAUAAGCGA 73.7% 3326 UAUAUGGAUUCAUCAUAAA 73.0%

3276 UAUAGUCAAACCACACGAA 73.6% 3327 GCGACAAUGCUUCAACCCG 73.0%

3277 CACUCUCGACGAGGAAAGC 73.6% 3328 AAUUUAUAUGGAUUCAUCA 73.0%

3278 GGCAGACUACACUCUCGAC 73.6% 3329 UUUAUAUGGAUUCAUCAUA 73.0%

3279 UACACUCUCGACGAGGAAA 73.6% 3330 UUAUAUGGAUUCAUCAUAA 73.0%

3280 ACACUCUCGACGAGGAAAG 73.6% 3331 CGACAAUGCUUCAACCCGA 73.0%

3281 ACAGAUUCAUCGAAAUUGG 73.6% 3332 UUACAAGGAGAACAGAUUC 73.0%

3282 UGGGCUCUUGGUGAAAACA 73.5% 3333 AUGAUUACAAGGAGAACAG 72.9%

3283 UGAAAUCUCAGGAACUAUG 73.5% 3334 UAUGAUUACAAGGAGAACA 72.9%

3284 UAGUCAAACCACACGAAAA 73.5% 3335 AGAAUAAAUCAGAAGCAUG 72.9%

3285 UCUUGGUGAAAACAUGGCA 73.5% 3336 CGAGAGCAUGAUUGAAGCC 72.9%

3286 AUUACCUUGAAAAGGCCAA 73.5% 3337 GAGAAUAAAUCAGAAGCAU 72.9%

3287 GAAAUCUCAGGAACUAUGC 73.5% 3338 GAAUAAAUCAGAAGCAUGG 72.9%

3288 UCUCGACGAGGAAAGCAGG 73.5% 3339 CUAAAGUGGGCUCUUGGUG 72.9% 3289 ACUGAGUACAUAAUGAAGG 73.5% 3340 UGAUUACAAGGAGAACAGA 72.9%

3290 AACAGAUUCAUCGAAAUUG 73.5% 3341 GCAACACUACUGGAGCUGA 72.8%

3291 UUUGAAAUCUCAGGAACUA 73.5% 3342 UGAGAAUAAAUCAGAAGCA 72.8%

3292 CUGAGUACAUAAUGAAGGG 73.5% 3343 AAAUUGAGCAUUGAAGACC 72.8%

3293 UUGAAAUCUCAGGAACUAU 73.5% 3344 AAAACAUUCUUUGGAUGGA 72.8%

3294 AUUUGAAAUCUCAGGAACU 73.5% 3345 CAACUAAAGUGGGCUCUUG 72.8%

3295 AUAAAUCAGAAGCAUGGCC 73.3% 3346 ACUAAAGUGGGCUCUUGGU 72.8%

3296 GGCUCUUGGUGAAAACAUG 73.3% 3347 GUAUGAUUACAAGGAGAAC 72.8%

3297 CUACACUCUCGACGAGGAA 73.3% 3348 AACUAAAGUGGGCUCUUGG 72.8%

3298 UUACCUUGAAAAGGCCAAU 73.3% 3349 AAUUGAGCAUUGAAGACCC 72.8%

3299 GGGCUCUUGGUGAAAACAU 73.3% 3350 AAACAUUCUUUGGAUGGAA 72.8% 3300 GACUACACUCUCGACGAGG 73.3% 3351 AUUGAGUACAUUGCAAGCA 72.7% 3301 ACUACACUCUCGACGAGGA 73.3% 3352 GUGGGCUCUUGGUGAAAAC 72.7% 3302 CUCUUGGUGAAAACAUGGC 73.3% 3353 UCAACUAAAGUGGGCUCUU 72.7% 3303 GCUCUUGGUGAAAACAUGG 73.3% 3354 AAUUGAGUACAUUGCAAGC 72.7% 3304 UGGUAGAACUUGAUGAUCC 73.3% 3355 GUCAACUAAAGUGGGCUCU 72.7% 3305 AUGAAGAAAACGAGUCAAC 73.2% 3356 AGUCAACUAAAGUGGGCUC 72.7% 3306 AGACUACACUCUCGACGAG 73.2% 3357 CCAAUUGAGUACAUUGCAA 72.7%

3307 ACAAGGAGAACAGAUUCAU 73.2% 3358 CAAUUGAGUACAUUGCAAG 72.7%

3308 CAAUGCUUCAACCCGAUGA 73.2% 3359 UGUCAUGGAAGCAAGUAUU 72.6%

3309 GACAAUGCUUCAACCCGAU 73.2% 3360 AGUGGGCUCUUGGUGAAAA 72.6%

3310 ACAAUGCUUCAACCCGAUG 73.2% 3361 AAGUGGGCUCUUGGUGAAA 72.6%

3311 UGUUCAAUAGCCUGUAUGC 73.2% 3362 AGAACUAAAGAGGGAAGGC 72.6%

3312 AGUCAAACCACACGAAAAG 73.2% 3363 GAACUAAAGAGGGAAGGCG 72.6%

3313 UACAAGGAGAACAGAUUCA 73.2% 3364 CUGUCAUGGAAGCAAGUAU 72.5%

3314 GUGUUCAAUAGCCUGUAUG 73.2% 3365 UCGAUGAAAUUGGAGAGGA 72.5%

3315 AAUGCUUCAACCCGAUGAU 73.2% 3366 GCCUGCGAGCUAACUGAUU 72.5%

3316 GGUAGAACUUGAUGAUCCA 73.2% 3367 CUCGAUGAAAUUGGAGAGG 72.5%

3317 AUGCUUCAACCCGAUGAUU 73.2% 3368 UUGAGUACAUUGCAAGCAU 72.5%

3318 UGAAGAAAACGAGUCAACU 73.2% 3369 CCUGCGAGCUAACUGAUUC 72.5%

3319 UAAAGUGGGCUCUUGGUGA 73.1% 3370 CUGCGAGCUAACUGAUUCA 72.5%

3371 UUGAGCAUUGAAGACCCAA 72.4% 3422 GACAACUGCAGAGACAUAA 71.9%

3372 AGGAGAACAGAUUCAUCGA 72.4% 3423 UUCCUGACACAUGCAUUAA 71.9%

3373 CAAGGAGAACAGAUUCAUC 72.4% 3424 AGAACAGAUUCAUCGAAAU 71.9%

3374 CAGAUUCAUCGAAAUUGGA 72.4% 3425 ACUAAAGAGGGAAGGCGAA 71.9%

3375 AUUUCAAGGCCUAUGUUCU 72.4% 3426 GAAUGAGUUCAACAAGGCC 71.9%

3376 UUUCAAGGCCUAUGUUCUU 72.4% 3427 GACUUUGACAACUGCAGAG 71.7%

3377 AAGGAGAACAGAUUCAUCG 72.4% 3428 CUUUGACAACUGCAGAGAC 71.7%

3378 CCCCAAUUGAGUACAUUGC 72.3% 3429 AGGAAUGACACAGAUGUGG 71.7%

3379 UGAGUACAUUGCAAGCAUG 72.3% 3430 GAAUGACACAGAUGUGGUA 71.7% 3380 UGAGCAUUGAAGACCCAAG 72.3% 3431 CAACUGCAGAGACAUAAGC 71.7%

3381 ACUUUGUGAGCAUGGAGUU 72.3% 3432 GGAAUGACACAGAUGUGGU 71.7%

3382 GAGUACAUUGCAAGCAUGA 72.3% 3433 GAAGAAAACGAGUCAACUA 71.7%

3383 AUGACACAGAUGUGGUAAA 72.3% 3434 AAGAAAACGAGUCAACUAA 71.7%

3384 AAUGACACAGAUGUGGUAA 72.3% 3435 ACUUUGACAACUGCAGAGA 71.7%

3385 UUCAACAAGGCCUGCGAGC 72.3% 3436 AACGAGUCAACUAAAGUGG 71.6%

3386 GUCCAAAGAAGUGAAUGCC 72.3% 3437 UCCUGACACAUGCAUUAAA 71.6%

3387 UUCAAGGCCUAUGUUCUUG 72.3% 3438 CGAGGAAAGCAGGGCUAGG 71.6% 3388 UCAACAAGGCCUGCGAGCU 72.3% 3439 GGACAAACGGAACAUCAAA 71.6%

3389 AAACUUUGUGAGCAUGGAG 72.3% 3440 AGGACAAACGGAACAUCAA 71.6%

3390 AACUUUGUGAGCAUGGAGU 72.3% 3441 ACGAGUCAACUAAAGUGGG 71.6%

3391 GCCCCAAUUGAGUACAUUG 72.3% 3442 ACGAGGAAAGCAGGGCUAG 71.6% 3392 AGUACAUUGCAAGCAUGAG 72.3% 3443 ACACAUGCAUUAAAAUAGU 71.6% 3393 CCCAAUUGAGUACAUUGCA 72.3% 3444 CCUGACACAUGCAUUAAAA 71.6% 3394 UUUGUAUGAUUACAAGGAG 72.3% 3445 UGACACAUGCAUUAAAAUA 71.5% 3395 GGCCUGCGAGCUAACUGAU 72.2% 3446 GACAAACGGAACAUCAAAG 71.5%

3396 AAGGCCUGCGAGCUAACUG 72.2% 3447 UAGUAAACAGUAUCUGCAA 71.5%

3397 AUUGAGCAUUGAAGACCCA 72.2% 3448 GUAGUAAACAGUAUCUGCA 71.5%

3398 GAGUUCAACAAGGCCUGCG 72.2% 3449 AACUAAAGAGGGAAGGCGA 71.5%

3399 GAGCAUUGAAGACCCAAGU 72.2% 3450 AGAAAACGAGUCAACUAAA 71.5%

3400 UUGACAACUGCAGAGACAU 72.2% 3451 GACGAGGAAAGCAGGGCUA 71.5%

3401 GUUCAACAAGGCCUGCGAG 72.2% 3452 CACAUGCAUUAAAAUAGUU 71.5%

3402 AGUUCAACAAGGCCUGCGA 72.2% 3453 AAAACGAGUCAACUAAAGU 71.4%

3403 UUUGACAACUGCAGAGACA 72.2% 3454 UAAGGAAUGACACAGAUGU 71.4%

3404 GCUCGAUGAAAUUGGAGAG 72.2% 3455 CCACUGAGUACAUAAUGAA 71.4%

3405 AGGCCUGCGAGCUAACUGA 72.2% 3456 CGAGUCAACUAAAGUGGGC 71.4%

3406 UGAGUUCAACAAGGCCUGC 72.2% 3457 AAACGAGUCAACUAAAGUG 71.4%

3407 UCCUUCCUGACACAUGCAU 72.1% 3458 CUGACACAUGCAUUAAAAU 71.4%

3408 AUGAGUUCAACAAGGCCUG 72.1% 3459 GCCACUGAGUACAUAAUGA 71.4%

3409 AAUGAGUUCAACAAGGCCU 72.1% 3460 GUACAUUGCAAGCAUGAGG 71.4%

3410 CCUUCCUGACACAUGCAUϋ 72.1% 3461 CUUUGUGAGCAUGGAGUUU 71.3%

3411 AACAAGGCCUGCGAGCUAA 72.0% 3462 UGAGGACAAACGGAACAUC 71.3%

3412 CUUCCUGACACAUGCAUUA 72.0% 3463 GAGGACAAACGGAACAUCA 71.3%

3413 CAAGGCCUGCGAGCUAACU 72.0% 3464 AAGGAAUGACACAGAUGUG 71.3%

3414 GGAGAACAGAUUCAUCGAA 72.0% 3465 UUAAGGAAUGACACAGAUG 71.3%

3415 CAACAAGGCCUGCGAGCUA 72.0% 3466 AGUAAACAGUAUCUGCAAC 71.3%

3416 ACAAGGCCUGCGAGCUAAC 72.0% 3467 CAUGAAGAAAACGAGUCAA 71.2%

3417 ACAACUGCAGAGACAUAAG 71.9% 3468 CACUGAGUACAUAAUGAAG 71.2%

3418 GAGAACAGAUUCAUCGAAA 71.9% 3469 CAGUAGUAAACAGUAUCUG 71.2%

3419 GAACAGAUUCAUCGAAAUU 71.9% 3470 GACAGUAGUAAACAGUAUC 71.2%

3420 UGACAACUGCAGAGACAUA 71.9% 3471 AGUAGUAAACAGUAUCUGC 71.2%

3421 CUAAAGAGGGAAGGCGAAA 71.9% 3472 CAAUGGACGAUUUUCAACU 71.2%

3473 ACAGUAGUAAACAGUAUCU 71.2% 3505 AUUCAUCGAAAUUGGAGUG 70.7%

3474 GCAAUGGACGAUUUUCAAC 71.2% 3506 UCAUCGAAAUUGGAGUGAC 70.7%

3475 GACACAUGCAUUAAAAUAG 71.1% 3507 CAAGUCACGAAGGAGAAGG 70.6%

3476 GUCAUGGAAGCAAGUAUUG 71.1% 3508 UCAUCAUAAAAGGAAGAUC 70.6%

3477 AAAGUGGGCUCUUGGUGAA 71.1% 3509 UGGACGAUUUUCAACUAAU 70.6%

3478 UAUGUGAGGACAAACGGAA 71.0% 3510 UCCAAAUUCCUCCUGAUGG 70.6%

3479 GUGAGGACAAACGGAACAU 71.0% 3511 AGUCACGAAGGAGAAGGGA 70.6%

3480 UCCUCCUGAUGGAUGCUUU 71.0% 3512 CAAAUUCCUCCUGAUGGAU 70.6%

3481 CGACGAGGAAAGCAGGGCU 71.0% 3513 CCAAAUUCCUCCUGAUGGA 70.6%

3482 UUCCUCCUGAUGGAUGCUU 71.0% 3514 AAUGGACGAUUUUCAACUA 70.6%

3483 UGUGAGGACAAACGGAACA 71.0% 3515 CCAAGUCACGAAGGAGAAG 70.6%

3484 AUGUGAGGACAAACGGAAC 71.0% 3516 AUGGACGAUUUUCAACUAA 70.6%

3485 UUGUAUGUGAGGACAAACG 70.9% 3517 AAUUCCUCCUGAUGGAUGC 70.6%

3486 UGUAUGUGAGGACAAACGG 70.9% 3518 UUCAUCAUAAAAGGAAGAU 70.6%

3487 GUAUGUGAGGACAAACGGA 70.9% 3519 GUCACGAAGGAGAAGGGAU 70.6%

3488 AGUAUCUGCAACACUACUG 70.8% 3520 AAAUUCCUCCUGAUGGAUG 70.6%

3489 GAGCCACUGAGUACAUAAU 70.8% 3521 CAUGGAAGCAAGUAUUGUC 70.6%

3490 AGCCACUGAGUACAUAAUG 70.8% 3522 AUGGAAGCAAGUAUUGUCA 70.6%

3491 AGAGCCACUGAGUACAUAA 70.8% 3523 GGAAAGCAGGGCUAGGAUU 70.6%

3492 CAGUAUCUGCAACACUACU 70.8% 3524 AAAUCUCAGGAACUAUGCG 70.5%

3493 GUAUCUGCAACACUACUGG 70.8% 3525 CCCAAGUCACGAAGGAGAA 70.5%

3494 GAAAACGAGUCAACUAAAG 70.7% 3526 GAUUGAAGCCGAGUCCUCG 70.4%

3495 UUCAUCGAAAUUGGAGUGA 70.7% 3527 CGAUGAAAUUGGAGAGGAC 70.1%

3496 AGACUUUGACAACUGCAGA 70.7% 3528 AUUCCUCCUGAUGGAUGCU 70.1%

3497 AGUAGACUUUGACAACUGC 70.7% 3529 GUCCAAAUUCCUCCUGAUG 70.1%

3498 GGAAGCAAGUAUUGUCAGA 70.7% 3530 GGUCCAAAUUCCUCCUGAU 70.1%

3499 UAGACUUUGACAACUGCAG 70.7% 3531 UGAGCAUGGAGUUUUCUCU 70.0%

3500 GUAGACUUUGACAACUGCA 70.7% 3532 UCAUAAAAGGAAGAUCUCA 70.0% 3501 UGGAAGCAAGUAUUGUCAG 70.7% 3533 AAGAAGUGCCAUAGGCCAA 70.0%

3502 CUUGUAUGUGAGGACAAAC 70.7% 3534 CGGUCCAAAUUCCUCCUGA 70.0%

3503 CAUCGAAAUUGGAGUGACA 70.7% 3535 AUCAUAAAAGGAAGAUCUC 70.0% 3504 UCAUGGAAGCAAGUAUUGU 70.7% 3536 UGUGAGCAUGGAGUUUUCU 70.0%

Table 2-4: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (HA) sequences listed in Table 1-4.

Seq

BD Sequence Percent

3537 GCAUCACUCCAAAUGGAAG 70.4%

3538 UGCAUCACUCCAAAUGGAA 70.4%

3539 AUGCAUCACUCCAAAUGGA 70.4%

3540 AAUGCAUCACUCCAAAUGG 70.3%

3541 GAAUGCAUCACUCCAAAUG 70.3%

3542 UCACUCCAAAUGGAAGCAU 70.3%

3543 UGUUACCCUUAUGAUGUGC 70.2%

3544 CAUCACUCCAAAUGGAAGC 70.2%

3545 AUCACUCCAAAUGGAAGCA 70.2%

Table 2-5: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (NP) sequences listed in Table 1-5.

Seq 3596 GAAAUGCUGAGAUCGAAGA 91.2%

ED Sequence Percent 3597 GAACCCAGGAAAUGCUGAG 91.2%

3546 UCUUAUUUCUUCGGAGACA 98.0% 3598 CCAGGAAAUGCUGAGAUCG 91.1%

3547 AUUUCUUCGGAGACAAUGC 97.9% 3599 AGGAAAUGCUGAGAUCGAA 91.1%

3548 UAUUUCUUCGGAGACAAUG 97.9% 3600 CAGGAAAUGCUGAGAUCGA 91.1%

3549 UUAUUUCUUCGGAGACAAU 97.9% 3601 GGAAAUGCUGAGAUCGAAG 91.1%

3550 CUUAUUUCUUCGGAGACAA 97.9% 3602 UGGAUCAAGUGAGAGAAAG 90.7%

3551 UUUCUUCGGAGACAAUGCA 97.8% 3603 UAUGAGAGAAUGUGCAACA 90.4%

3552 AUGAAGGAUCUUAUUUCUU 97.6% 3604 AUGAGAGAAUGUGCAACAU 90.4%

3553 AAUGAAGGAUCUUAUUUCU 97.6% 3605 GAAAAUUUCAAACAGCUGC 90.1%

3554 GAAGGAUCUUAUUUCUUCG 97.5% 3606 GGAAAAUUUCAAACAGCUG 90.1%

3555 AAGGAUCUUAUUUCUUCGG 97.5% 3607 AAAGGAAAAUUUCAAACAG 90.0%

3556 UGAAGGAUCUUAUUUCUUC 97.4% 3608 AGGAAAAUUUCAAACAGCU 90.0%

3557 UAAUGAAGGAUCUUAUUUC 97.3% 3609 AAGGAAAAUUUCAAACAGC 90.0%

3558 UUCUUCGGAGACAAUGCAG 96.5% 3610 CCCAGGAAAUGCUGAGAUC 90.0%

3559 UCUUCGGAGACAAUGCAGA 96.4% 3611 AAAAUUUCAAACAGCUGCA 89.7%

3560 GAUCUUAUUUCUUCGGAGA 96.0% 3612 CUUAUGAACAGAUGGAAAC 89.5%

3561 GGAUCUUAUUUCUUCGGAG 96.0% 3613 UCUUAUGAACAGAUGGAAA 89.5%

3562 AUCUUAUUUCUUCGGAGAC 96.0% 3614 UAUGAACAGAUGGAAACUG 89.5%

3563 AGGAUCUUAUUUCUUCGGA 95.8% 3615 AGGGAACUCGUCCUUUAUG 89.5%

3564 AGUAAUGAAGGAUCUUAUU 95.4% 3616 GGGAACUCGUCCUUUAUGA 89.5%

3565 GUAAUGAAGGAUCUUAUUU 95.4% 3617 UUAUGAACAGAUGGAAACU 89.1%

3566 AUGAGUAAUGAAGGAUCUU 95.3% 3618 AUUCUGCUGCAUUUGAAGA 89.0%

3567 UGAGUAAUGAAGGAUCUUA 95.3% 3619 UGAGGGAACUCGUCCUUUA 88.8%

3568 GAGUAAUGAAGGAUCUUAU 95.3% 3620 GAGGGAACUCGUCCUUUAU 88.7%

3569 UACAAAUUGCUUCAAAUGA 93.7% 3621 UUCUGCUGCAUUUGAAGAU 88.6%

3570 AAUUUCAAACAGCUGCACA 93.5% 3622 GAGGAAACACUAAUCAACA 88.2%

3571 AAAUUUCAAACAGCUGCAC 93.5% 3623 GGAGGAAACACUAAUCAAC 88.2%

3572 GUACAAAUUGCUUCAAAUG 93.4% 3624 AGUGGAGGAAACACUAAUC 88.2%

3573 AUGGUGCUCUCUGCUUUUG 93.3% 3625 GUGGAGGAAACACUAAUCA 88.1%

3574 UGGUGCUCUCUGCUUUUGA 93.3% 3626 UGGAGGAAACACUAAUCAA 88.1%

3575 CAAGGCACCAAACGGUCUU 93.2% 3627 GAGGGUCAGUUGCUCACAA 87.8%

3576 AAGGCACCAAACGGUCUUA 93.2% 3628 AGAGGGUCAGUUGCUCACA 87.8%

3577 AGAGAAUGUGCAACAUUCU 93.0% 3629 CAGCUGGUGUGGAUGGCAU 87.6%

3578 GAGAGAAUGUGCAACAUUC 93.0% 3630 AGCUGGUGUGGAUGGCAUG 87.6%

3579 AUUUCAAACAGCUGCACAA 92.7% 3631 GCUGGUGUGGAUGGCAUGC 87.5%

3580 UUUCAAACAGCUGCACAAA 92.7% 3632 ACUUCGAAAAAGAGGGAUA 87.4%

3581 GGAACCCAGGAAAUGCUGA 92.2% 3633 CAACGAACCCGAUCGUGCC 87.2%

3582 CGAACCCGAUCGUGCCCUC 92.1% 3634 GCAACGAACCCGAUCGUGC 87.2%

3583 CGGAACCCAGGAAAUGCUG 92.1% 3635 AUGAGGGCAGAAAUCAUAA 87.2% 3584 AAUGCUGAGAUCGAAGAUC 91.7% 3636 UCAGCUGGUGUGGAUGGCA 87.1%

3585 AUGCUGAGAUCGAAGAUCU 91.7% 3637 GUCAGCUGGUGUGGAUGGC 87.1%

3586 GAUUCUACAUCCAAAUGUG 91.6% 3638 GGCAACGAACCCGAUCGUG 87.1%

3587 GCUGAGAUCGAAGAUCUCA 91.5% 3639 AGUCAGCUGGUGUGGAUGG 87.1%

3588 UCAUGGCAGCAUUCACUGG 91.5% 3640 UUCAAACAGCUGCACAAAG 87.0%

3589 AAUGGUGCUCUCUGCUUUU 91.5% 3641 UUCUACAUCCAAAUGUGCA 86.9%

3590 ACCCAGGAAAUGCUGAGAU 91.4% 3642 UCUACAUCCAAAUGUGCAC 86.9%

3591 UCAGACAUGAGGGCAGAAA 91.4% 3643 CAUGAGGGCAGAAAUCAUA 86.8%

3592 UGCUGAGAUCGAAGAUCUC 91.4% 3644 ACAUGAGGGCAGAAAUCAU 86.8% 3593 CAGACAUGAGGGCAGAAAU 91.4% 3645 GACAUGAGGGCAGAAAUCA 86.8%

3594 AACCCAGGAAAUGCUGAGA 91.4% 3646 GAGUACAAAUUGCUUCAAA 86.7%

3595 AAAUGCUGAGAUCGAAGAU 91.2% 3647 GCACACAAGAGUCAGCUGG 86.7%

3648 GAGAAUCCAGCACACAAGA 86.7% 3702 CGAGAAUCCAGCACACAAG 85.1%

3649 CACACAAGAGUCAGCUGGU 86.7% 3703 AACACUAAUCAACAGAGGG 85.1%

3650 GGAGUACAAAUUGCUUCAA 86.7% 3704 AGGAAACACUAAUCAACAG 85.1%

3651 AGAAUCCAGCACACAAGAG 86.7% 3705 GAAACACUAAUCAACAGAG 85.0%

3652 CAGCACACAAGAGUCAGCU 86.6% 3706 AGGAGUACAAAUUGCUUCA 85.0%

3653 AGUACAAAUUGCUUCAAAU 86.6% 3707 GAGGAGUACAAAUUGCUUC 85.0%

3654 CCAGCACACAAGAGUCAGC 86.6% 3708 AGAGGAGUACAAAUUGCUU 85.0%

3655 AUUCUACAUCCAAAUGUGC 86.6% 3709 AAACACUAAUCAACAGAGG 85.0%

3656 ACAAGAGUCAGCUGGUGUG 86.5% 3710 UAUUGAGAGGGUCAGUUGC 84.9%

3657 AGCACACAAGAGUCAGCUG 86.5% 3711 AACCCGAUCGUGCCCUCUU 84.6%

3658 UGCUUAUGAGAGAAUGUGC 86.5% 3712 CCCGAUCGUGCCCUCUUUU 84.6%

3659 CAAGAGUCAGCUGGUGUGG 86.5% 3713 GAACCCGAUCGUGCCCUCU 84.6%

3660 AGAGUCAGCUGGUGUGGAU 86.5% 3714 ACCCGAUCGUGCCCUCUUU 84.6%

3661 GAGUCAGCUGGUGUGGAUG 86.5% 3715 CGAUCGUGCCCUCUUUUGA 84.5%

3662 CACAAGAGUCAGCUGGUGU 86.5% 3716 CCGAUCGUGCCCUCUUUUG 84.5%

3663 GCUUAUGAGAGAAUGUGCA 86.5% 3717 CUAGAGGAGUACAAAUUGC 84.2%

3664 CUUAUGAGAGAAUGUGCAA 86.5% 3718 ACUAGAGGAGUACAAAUUG 84.1% 3665 AAGAGUCAGCUGGUGUGGA 86.5% 3719 UAGAGGAGUACAAAUUGCU 84.1% 3666 CCAAACGGUCUUAUGAACA 86.4% 3720 AGAUGGAAACUGAUGGGGA 84.0% 3667 GGCACCAAACGGUCUUAUG 86.4% 3721 CAGAUGGAAACUGAUGGGG 84.0% 3668 GCACCAAACGGUCUUAUGA 86.4% 3722 AGUGGGUACGACUUCGAAA 84.0%

3669 ACCAAACGGUCUUAUGAAC 86.4% 3723 ACAGAUGGAAACUGAUGGG 84.0%

3670 AGGCACCAAACGGUCUUAU 86.4% 3724 CUAAUCAACAGAGGGCCUC 84.0% 3671 UGAGAGAAUGUGCAACAUU 86.4% 3725 ACUAAUCAACAGAGGGCCU 84.0% 3672 AAUCCAGCACACAAGAGUC 86.3% 3726 CAGUGGGUACGACUUCGAA 83.9% 3673 ACACAAGAGUCAGCUGGUG 86.3% 3727 CCAGUGGGUACGACUUCGA 83.9% 3674 AUCCAGCACACAAGAGUCA 86.3% 3728 UGAUGGAAGGUGCAAAACC 83.8% 3675 CACCAAACGGUCUUAUGAA 86.3% 3729 AAGAAGAAAUAAGGCGAAU 83.8%

3676 AGACAUGAGGGCAGAAAUC 86.2% 3730 GUGGGUACGACUUCGAAAA 83.8%

3677 GACUUCGAAAAAGAGGGAU 86.2% 3731 AAAGAAGAAAUAAGGCGAA 83.8% 3678 GAAUCCAGCACACAAGAGU 86.2% 3732 GAUGGAAGGUGCAAAACCA 83.7%

3679 CAGAGGACAAGAGCUCUUG 86.1% 3733 UGUGCUCUCUGAUGCAGGG 83.7%

3680 CGGUCUUAUGAACAGAUGG 86.1% 3734 AUAUUGAGAGGGUCAGUUG 83.7%

3681 AGAGGACAAGAGCUCUUGU 86.1% 3735 AUGUGCUCUCUGAUGCAGG 83.7%

3682 ACGGUCUUAUGAACAGAUG 86.1% 3736 AUGGAAGGUGCAAAACCAG 83.7%

3683 UACGACUUCGAAAAAGAGG 86.1% 3737 UGGAAGGUGCAAAACCAGA 83.7%

3684 GGUCUUAUGAACAGAUGGA 86.1% 3738 CACUAAUCAACAGAGGGCC 83.6%

3685 ACGACUUCGAAAAAGAGGG 86.1% 3739 AUGAUGGAAGGUGCAAAAC 83.6%

3686 AACGGUCUUAUGAACAGAU 86.0% 3740 ACCAGAGGACAAGAGCUCU 83.5%

3687 AAACGGUCUUAUGAACAGA 86.0% 3741 UACCAGAGGACAAGAGCUC 83.5% 3688 CAAACGGUCUUAUGAACAG 86.0% 3742 AGAUGAUUGAUGGAAUUGG 83.2%

3689 UUAUGAGAGAAUGUGCAAC 86.0% 3743 CCAGAGGACAAGAGCUCUU 83.2%

3690 CGACUUCGAAAAAGAGGGA 85.8% 3744 AAGAUGAUUGAUGGAAUUG 83.2%

3691 UCCAGCACACAAGAGUCAG 85.7% 3745 GAAAUAAGGCGAAUCUGGC 83.0%

3692 AUGAACAGAUGGAAACUGA 85.7% 3746 AAAUAAGGCGAAUCUGGCG 83.0%

3693 UGAACAGAUGGAAACUGAU 85.6% 3747 GAGAGGGUCAGUUGCUCAC 83.0%

3694 AACAGAUGGAAACUGAUGG 85.6% 3748 UGAGAGGGUCAGUUGCUCA 83.0%

3695 GUCUUAUGAACAGAUGGAA 85.6% 3749 CUGAACUUAAACUCAGUGA 82.9%

3696 CUGAGAUCGAAGAUCUCAU 85.6% 3750 UGAACUUAAACUCAGUGAU 82.9%

3697 GAACAGAUGGAAACUGAUG 85.6% 3751 UUGAGAGGGUCAGUUGCUC 82.9%

3698 GGAAACACUAAUCAACAGA 85.5% 3752 ACUGAACUUAAACUCAGUG 82.9%

3699 ACGAGAAUCCAGCACACAA 85.4% 3753 CAAACAGCUGCACAAAGAG 82.8%

3700 AACGAGAAUCCAGCACACA 85.4% 3754 AUUGAGAGGGUCAGUUGCU 82.8%

3701 ACACUAAUCAACAGAGGGC 85.1% 3755 AAACAGCUGCACAAAGAGC 82.8%

IQ 5

3756 UCAGUUGCUCACAAAUCUU 82.8% 3810 AGGACAAGAGCUCUUGUUC 79.1%

3757 UGGGUACGACUUCGAAAAA 82.7% 3811 UCUGGAGAGGUGAGAAUGG 78.9% 3758 CAGUUGCUCACAAAUCUUG 82.7% 3812 UUCUGGAGAGGUGAGAAUG 78.9%

3759 GGGUACGACUUCGAAAAAG 82.7% 3813 CCCAGCGCGGGGAAAGAUC 78.9%

3760 GGUACGACUUCGAAAAAGA 82.7% 3814 CCAGCGCGGGGAAAGAUCC 78.9%

3761 UCAAACAGCUGCACAAAGA 82.7% 3815 UACUGGGCCAUAAGGACCA 78.8%

3762 GUACGACUUCGAAAAAGAG 82.6% 3816 ACUGGGCCAUAAGGACCAG 78.7%

3763 GAUGAGGGAACUCGUCCUU 82.5% 3817 GAGGACAAGAGCUCUUGUU 78.3%

3764 UGGAUGAGGGAACUCGUCC 82.5% 3818 AUGGCGUCCCAAGGCACCA 77.8%

3765 GGAUGAGGGAACUCGUCCU 82.5% 3819 UGGCGUCCCAAGGCACCAA 77.7%

3766 CAGCUGCACAAAGAGCAAU 82.5% 3820 CGUCCCAAGGCACCAAACG 77.7%

3767 AUGAGGGAACUCGUCCUUU 82.5% 3821 GGCGUCCCAAGGCACCAAA 77.7%

3768 AGCUGCACAAAGAGCAAUG 82.5% 3822 GCGUCCCAAGGCACCAAAC 77.7%

3769 ACAGCUGCACAAAGAGCAA 82.4% 3823 UGUGCACUGAACUUAAACU 77.4%

3770 AACAGCUGCACAAAGAGCA 82.4% 3824 GUGCACUGAACUUAAACUC 77.4%

3771 UAAGCUUCAUCAGAGGGAC 82.2% 3825 AUGUGCACUGAACUUAAAC 77.4% 3772 AGAAAUAAGGCGAAUCUGG 82.2% 3826 CCAAGGCACCAAACGGUCU 77.4%

3773 GAAGAAAUAAGGCGAAUCU 82.1% 3827 CACUGAACUUAAACUCAGU 77.3%

3774 AAGAAAUAAGGCGAAUCUG 82.1% 3828 UGCACUGAACUUAAACUCA 77.3%

3775 UUAAGCUUCAUCAGAGGGA 82.1% 3829 GCACUGAACUUAAACUCAG 77.3%

3776 GGGAAGAUGAUUGAUGGAA 82.1% 3830 CCCAAGGCACCAAACGGUC 77.2%

3777 ACGAACCCGAUCGUGCCCU 82.1% 3831 UCCCAAGGCACCAAACGGU 77.1% 3778 GGAAGAUGAUUGAUGGAAU 82.1% 3832 AGAUCGAAGAUCUCAUAUU 76.9%

3779 AACGAACCCGAUCGUGCCC 82.1% 3833 GAGAUCGAAGAUCUCAUAϋ 76.9%

3780 GCCAGAAUGCAACUGAGAU 82.1% 3834 UGAGAUCGAAGAUCUCAUA 76.9%

3781 AGAAGAAAUAAGGCGAAUC 82.1% 3835 GUCCCAAGGCACCAAACGG 76.9% 3782 CGCCAGAAUGCAACUGAGA 82.1% 3836 GUACUGGGCCAUAAGGACC 76.8%

3783 GUGCUCUCUGCUUUUGAUG 81.7% 3837 UGGUGUGGAUGGCAUGCCA 76.6%

3784 GGUGCUCUCUGCUUUUGAU 81.7% 3838 GUGCUUAUGAGAGAAUGUG 76.4%

3785 UGCUCUCUGCUUUUGAUGA 81.7% 3839 GGUACUGGGCCAUAAGGAC 76.3%

3786 GAAGAUGAUUGAUGGAAUU 81.6% 3840 GUGUGGAUGGCAUGCCAUU 76.1%

3787 AUCCGUCGGGAAGAUGAUU 81.5% 3841 UGUGGAUGGCAUGCCAUUC 76.1%

3788 CAUCCGUCGGGAAGAUGAU 81.5% 3842 GGUGUGGAUGGCAUGCCAU 76.1%

3789 UCGGGAAGAUGAUUGAUGG 81.4% 3843 CAUACCAGAGGACAAGAGC 76.1%

3790 CGGGAAGAUGAUUGAUGGA 81.4% 3844 CAACAUACCAGAGGACAAG 76.1%

3791 AAGGCAACGAACCCGAUCG 81.3% 3845 ACAUACCAGAGGACAAGAG 76.1%

3792 AGGCAACGAACCCGAUCGU 81.3% 3846 CUGGUGUGGAUGGCAUGCC 76.0%

3793 GGUCAGUUGCUCACAAAUC 81.2% 3847 AACAUACCAGAGGACAAGA 75.9%

3794 GGGUCAGUUGCUCACAAAU 81.2% 3848 AUACCAGAGGACAAGAGCU 75.9%

3795 GUCGGGAAGAUGAUUGAUG 81.1% 3849 ACAUCCAAAUGUGCACUGA 75.8%

3796 AGGGUCAGUUGCUCACAAA 81.1% 3850 UACAUCCAAAUGUGCACUG 75.8%

3797 UCCGUCGGGAAGAUGAUUG 81.1% 3851 CUACAUCCAAAUGUGCACU 75.8%

3798 AAAGAAACCUCCCAUUUGA 81.0% 3852 GAUCGAAGAUCUCAUAUUU 75.8%

3799 CGUCGGGAAGAUGAUUGAU 81.0% 3853 AUCCAAAUGUGCACUGAAC 75.4%

3800 CAAAGAAACCUCCCAUUUG 81.0% 3854 CCUUUCAAACUACUUCAAA 75.4%

3801 CCGUCGGGAAGAUGAUUGA 81.0% 3855 UCCAAAUGUGCACUGAACU 75.4%

3802 UGAGGGCAGAAAUCAUAAG 80.7% 3856 CAUCCAAAUGUGCACUGAA 75.4%

3803 GGAAUAGACCCUUUCAAAC 80.6% 3857 CAAAUGUGCACUGAACUUA 75.3%

3804 GAAUAGACCCUUUCAAACU 80.6% 3858 AAAUGUGCACUGAACUUAA 75.3%

3805 GUGGGAAUAGACCCUUUCA 80.5% 3859 CUUUCAAACUACUUCAAAA 75.2%

3806 UGGGAAUAGACCCUUUCAA 80.5% 3860 AAUGUGCACUGAACUUAAA 75.1%

3807 GUCAGUUGCUCACAAAUCU 80.4% 3861 UACUCUUGAACUGAGAAGC 75.1%

3808 GGGAAUAGACCCUUUCAAA 80.3% 3862 GUGACAUCAAAAUCAUGGC 75.1%

3809 GGACAAGAGCUCUUGUUCG 79.1% 3863 UGACAUCAAAAUCAUGGCG 75.0%

3864 ACAUCAAAAUCAUGGCGUC 74.9% 3918 AGGAGUGGAGGAAACACUA 72.3%

3865 GACAUCAAAAUCAUGGCGU 74.9% 3919 GAGUGGAGGAAACACUAAU 72.3%

3866 CCAGGAGUGGAGGAAACAC 74.9% 3920 GGAGUGGAGGAAACACUAA 72.3%

3867 ACCAGGAGUGGAGGAAACA 74.9% 3921 CAGGAGUGGAGGAAACACU 72.2% 3868 AGUGACAUCAAAAUCAUGG 74.9% 3922 UCAAACGGGGGAUCAACGA 72.0%

3869 GACCCUUUCAAACUACUUC 74.8% 3923 CAAACGGGGGAUCAACGAU 71.9%

3870 GUACUCUUGAACUGAGAAG 74.8% 3924 CUUAAACUCAGUGAUUAUG 71.9%

3871 ACCCUUUCAAACUACUUCA 74.8% 3925 AGAAUGCAACUGAGAUUAG 71.9%

3872 AGUACUCUUGAACUGAGAA 74.7% 3926 CAGAAUGCAACUGAGAUUA 71.9% 3873 GAGUGACAUCAAAAUCAUG 74.6% 3927 UUAAACUCAGUGAUUAUGA 71.9% 3874 CCCUUUCAAACUACUUCAA 74.6% 3928 AAACGGGGGAUCAACGAUC 71.7% 3875 CCAAAUGUGCACUGAACUU 74.5% 3929 AACGGGGGAUCAACGAUCG 71.7% 3876 AGUGCUUAUGAGAGAAUGU 74.4% 3930 UCAAAUGAGAACAUGGAUA 71.6% 3877 GGAUGGCAUGCCAUUCUGC 74.3% 3931 GUUAAGCUUCAUCAGAGGG 71.6% 3878 UGGAUGGCAUGCCAUUCUG 74.2% 3932 UUCAAAUGAGAACAUGGAU 71.6%

3879 CUUCAAAUGAGAACAUGGA 74.1% 3933 CUCACAUGAUGAUCUGGCA 71.5%

3880 CAAAUUGCUUCAAAUGAGA 74.1% 3934 GAAGGUGCAAAACCAGAAG 71.5%

3881 GCUUCAAAUGAGAACAUGG 74.1% 3935 CUUCGGAGACAAUGCAGAA 71.5%

3882 AAAUUGCUUCAAAUGAGAA 74.0% 3936 GGAAGGUGCAAAACCAGAA 71.5% 3883 AUUGCUUCAAAUGAGAACA 74.0% 3937 AAGGUGCAAAACCAGAAGA 71.5% 3884 UGAGUGACAUCAAAAUCAU 74.0% 3938 ACUCACAUGAUGAUCUGGC 71.5% 3885 UUGCUUCAAAUGAGAACAU 74.0% 3939 AGGUGCAAAACCAGAAGAA 71.5%

3886 AAUUGCUUCAAAUGAGAAC 73.9% 3940 GUGGAUGGCAUGCCAUUCU 71.3%

3887 AACAGAGGGCCUCCGCAGG 73.9% 3941 AUGGCAUGCCAUUCUGCUG 71.3%

3888 UGCUUCAAAUGAGAACAUG 73.9% 3942 UGGCAUGCCAUUCUGCUGC 71.3% 3889 AGGGCCUCCGCAGGCCAAA 73.9% 3943 GAUGGCAUGCCAUUCUGCU 71.3% 3890 CAACAGAGGGCCUCCGCAG 73.9% 3944 GAACUUAAACUCAGUGAUU 71.2% 3891 UAGACCCUUUCAAACUACU 73.9% 3945 ACUUAAACUCAGUGAUUAU 71.2% 3892 AAUCAACAGAGGGCCUCCG 73.8% 3946 UUCGGAGACAAUGCAGAAG 71.2%

3893 AUCAACAGAGGGCCUCCGC 73.8% 3947 AUCCUAAGAAAACUGGAGG 71.2%

3894 UCAACAGAGGGCCUCCGCA 73.8% 3948 CCAGAAUGCAACUGAGAUU 71.2% 3895 AGACCCUUUCAAACUACUU 73.8% 3949 CUCACUGAGUGACAUCAAA 71.2% 3896 GGGGAGUUUUCGAGCUCUC 73.6% 3950 UCGGAGACAAUGCAGAAGA 71.2%

3897 AUAGACCCUUUCAAACUAC 73.6% 3951 AACUUAAACUCAGUGAUUA 71.2%

3898 GAGUUUUCGAGCUCUCAGA 73.6% 3952 GAUCCUAAGAAAACUGGAG 71.1% 3899 ACAGAGGGCCUCCGCAGGC 73.5% 3953 CAGCUGGUCUAACUCACAU 71.0% 3900 GGAGUUUUCGAGCUCUCAG 73.5% 3954 GAAUGCAACUGAGAUUAGG 71.0% 3901 CAGAGGGCCUCCGCAGGCC 73.5% 3955 ACAGCUGGUCUAACUCACA 71.0% 3902 AGAGGGCCUCCGCAGGCCA 73.5% 3956 CAUUCUGCUGCAUUUGAAG 70.9%

3903 AAUAGACCCUUUCAAACUA 73.4% 3957 AUGCCAUUCUGCUGCAUUU 70.8%

3904 GGGAGUUUUCGAGCUCUCA 73.4% 3958 GCCAUUCUGCUGCAUUUGA 70.8%

3905 UCACUGAGUGACAUCAAAA 73.4% 3959 CAUGCCAUUCUGCUGCAUU 70.8%

3906 GAGGGCCUCCGCAGGCCAA 73.4% 3960 CCAUUCUGCUGCAUUUGAA 70.8%

3907 UAAUCAACAGAGGGCCUCC 73.3% 3961 UGCCAUUCUGCUGCAUUUG 70.8%

3908 CUGAGUGACAUCAAAAUCA 73.1% 3962 GCAUGCCAUUCUGCUGCAU 70.8%

3909 ACUGAGUGACAUCAAAAUC 73.1% 3963 GGCAUGCCAUUCUGCUGCA 70.8%

3910 CACUGAGUGACAUCAAAAU 73.0% 3964 UGGAGAGGUGAGAAUGGGC 70.8%

3911 AUCGAAGAUCUCAUAUUUU 72.9% 3965 GGAGAGGUGAGAAUGGGCG 70.8%

3912 UUGUUAAGCUUCAUCAGAG 72.6% 3966 CUGGAGAGGUGAGAAUGGG 70.8%

3913 GAUGAUUGAUGGAAUUGGG 72.6% 3967 CAAAUGAGAACAUGGAUAA 70.6%

3914 UGUUAAGCUUCAUCAGAGG 72.6% 3968 UAAGGACCAGGAGUGGAGG 70.6%

3915 UUCGAAAAAGAGGGAUAUU 72.3% 3969 AUAAGGACCAGGAGUGGAG 70.6%

3916 UCGAAAAAGAGGGAUAUUC 72.3% 3970 AAGGACCAGGAGUGGAGGA 70.5%

3917 ACAAAUUGCUUCAAAUGAG 72.3% 3971 AGGACCAGGAGUGGAGGAA 70.5%

3972 GAAAAAGAGGGAUAUUCCU 70.5% 3987 GUUUUCGAGCUCUCAGACG 70.3%

3973 CGGAGACAAUGCAGAAGAG 70.5% 3988 GCUGCAGUCAAAGGAAUCG 70.3%

3974 CGAAAAAGAGGGAUAUUCC 70.5% 3989 AGAAAGUCGGAACCCAGGA 70.3% 3975 GCCUGUGUGUAUGGACCUG 70.5% 3990 AAAGUCGGAACCCAGGAAA 70.3%

3976 AAAAAGAGGGAUAUUCCUU 70.5% 3991 AACAGCUUGACAAUAGAGA 70.2%

3977 CCUGUGUGUAUGGACCUGC 70.5% 3992 GGAGACAAUGCAGAAGAGU 70.2%

3978 GAGAAAGUCGGAACCCAGG 70.4% 3993 AGAUUGUUAAGCUUCAUCA 70.1%

3979 AGAGAAAGUCGGAACCCAG 70.4% 3994 GAUUGUUAAGCUUCAUCAG 70.1%

3980 AAAUGAGAACAUGGAUAAU 70.4% 3995 UGGAAGAACACCCCAGCGC 70.1% 3981 CUUCGAAAAAGAGGGAUAU 70.4% 3996 CUGCCUGUGUGUAUGGACC 70.0%

3982 GGACCAGGAGUGGAGGAAA 70.4% 3997 CUGGAAGAACACCCCAGCG 70.0%

3983 UUUUCGAGCUCUCAGACGA 70.3% 3998 GAGACAAUGCAGAAGAGUA 70.0%

3984 CUGCAGUCAAAGGAAUCGG 70.3% 3999 AAUGAGAACAUGGAUAAUA 70.0%

3985 AGUUUUCGAGCUCUCAGAC 70.3% 4000 AUGAGAACAUGGAUAAUAU 70.0% 3986 GAAAGUCGGAACCCAGGAA 70.3% 4001 CCUGCCUGUGUGUAUGGAC 70.0%

Table 2-6: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (NA) sequences listed in Table 1-6.

Seq 4035 AGCUCAAGUUGUCACGAUG 85.8%

ID Sequence Percent

4036 CAGAGACAACUGGAAAGGC 85.8%

4002 CAGGCUCAUGGCCUGAUGG 87.4%

4037 GAAUGCGUUUGUAUCAAUG 85.7%

4003 GUGGACCUCAAACAGUAUU 87.4% 4038 AAUGCGUUUGUAUCAAUGG 85.7%

4004 UAGCAUGGUCCAGCUCAAG 87.3%

4039 AGUAUUGUUGUGUUUUGUG 85.6%

4005 UGGACCUCAAACAGUAUUG 87.3%

4040 GUAUUGUUGUGUUUUGUGG 85.6%

4006 ACAGGCUCAUGGCCUGAUG 87.2%

4041 UCAAGUUGUCACGAUGGAA 85.5%

4007 GGACCUCAAACAGUAUUGU 87.1%

4042 GUUGUCACGAUGGAAAAGC 85.5%

4008 UGGUCCAGCUCAAGUUGUC 87.0%

4043 CAGUAUUGUUGUGUUUUGU 85.5%

4009 GACCUCAAACAGUAUUGUU 87.0%

4044 AUUGGUCAAAGCCGCAAUG 85.5%

4010 GGUCCAGCUCAAGUUGUCA 87.0%

4045 CAAGUUGUCACGAUGGAAA 85.5%

4011 CAUGGUCCAGCUCAAGUUG 87.0%

4046 ACAGUAUUGUUGUGUUUUG 85.5%

4012 GCAUGGUCCAGCUCAAGUU 87.0%

4047 AACAGUAUUGUUGUGUUUU 85.5%

4013 ACCUCAAACAGUAUUGUUG 87.0%

4048 AAUUGGUCAAAGCCGCAAU 85.5%

4014 AGCAUGGUCCAGCUCAAGU 87.0%

4049 AAACAGUAUUGUUGUGUUU 85.4%

4015 AUGGUCCAGCUCAAGUUGU 86.7%

4050 AGUUGUCACGAUGGAAAAG 85.4%

4016 CCAGUUAUGUGUGCUCAGG 86.4%

4051 CUCAAGUUGUCACGAUGGA 85.4%

4017 CAAACAGUAUUGUUGUGUU 86.4%

4052 AAGUUGUCACGAUGGAAAA 85.3%

4018 UCAAACAGUAUUGUUGUGU 86.4%

4053 CAACUGCUAGCUUCAUUUA 85.1%

4019 CCUCAAACAGUAUUGUUGU 86.4%

4054 GCAACUGCUAGCUUCAUUU 85.1%

4020 UCCAGUUAUGUGUGCUCAG 86.4%

4055 AAUGCAACUGCUAGCUUCA 84.9%

4021 UUCCAGUUAUGUGUGCUCA 86.3%

4056 UGCAACUGCUAGCUUCAUU 84.9%

4022 UCUGCAGAGACAACUGGAA 86.1%

4057 AUGCAACUGCUAGCUUCAU 84.9%

4023 UGCAGAGACAACUGGAAAG 86.1%

4058 GAACCUUAUGUGUCAUGCG 84.8%

4024 GCAGAGACAACUGGAAAGG 86.1%

4059 AACCUUAUGUGUCAUGCGA 84.7%

4025 CUCAAACAGUAUUGUUGUG 86.1%

4060 AAUCCAAAUCAAAAGAUAA 84.7%

4026 CUGCAGAGACAACUGGAAA 86.1%

4061 AUCCAAAUCAAAAGAUAAU 84.7%

4027 GUCCAGCUCAAGUUGUCAC 86.0%

4062 AAAUGCAACUGCUAGCUUC 84.7%

4028 GAGACAACUGGAAAGGCUC 86.0%

4063 UCCAAAUCAAAAGAUAAUA 84.6%

4029 GCUCAAGUUGUCACGAUGG 85.9%

4064 CAAAUCAAAAGAUAAUAAC 84.5%

4030 CCAGCUCAAGUUGUCACGA 85.9%

4065 AUAGCAUGGUCCAGCUCAA 84.5%

4031 UCCAGCUCAAGUUGUCACG 85.9%

4066 CCAAAUCAAAAGAUAAUAA 84.5%

4032 CAGUAGUAAUGACUGAUGG 85.9%

4067 ACAGUAGUAAUGACUGAUG 84.4%

4033 CAGCUCAAGUUGUCACGAU 85.8%

4068 GUCACGAUGGAAAAGCAUG 84.2%

4034 AGAGACAACUGGAAAGGCU 85.8%

4069 UUGGUCAAAGCCGCAAUGU 84.2% 4123 CUCAUGGCCUGAUGGGGCG 82.7%

4070 GUGGAGUUGAUAAGGGGAA 84.2% 4124 AAAUUGGUCAAAGCCGCAA 82.7%

4071 UCACGAUGGAAAAGCAUGG 84.2% 4125 CAGGAUUUGCACCUUUUUC 82.7%

4072 UGGAGUUGAUAAGGGGAAG 84.1% 4126 ACAGGAUUUGCACCUUUUU 82.7%

4073 UGUCACGAUGGAAAAGCAU 84.1% 4127 AACAGGCUCAUGGCCUGAU 82.7%

4074 GAGAACCUUAUGUGUCAUG 84.0% 4128 GGAAAAGCAUGGCUGCAUG 82.6%

4075 AGAGAACCUUAUGUGUCAU 84.0% 4129 GAAAAGCAUGGCUGCAUGU 82.6%

4076 UUGUCACGAUGGAAAAGCA 83.9% 4130 AAAAGCAUGGCUGCAUGUU 82.6%

4077 ACCUUAUGUGUCAUGCGAU 83.8% 4131 UGGAAAAGCAUGGCUGCAU 82.5%

4078 CCUUAUGUGUCAUGCGAUC 83.8% 4132 AUGGAAAAGCAUGGCUGCA 82.5%

4079 AGGCUCAUGGCCUGAUGGG 83.8% 4133 GAUGGAAAAGCAUGGCUGC 82.4%

4080 UUUUAUGUGGAGUUGAUAA 83.8% 4134 GUGGAUGGGAAGAACGAUC 82.4%

4081 GGCUCAUGGCCUGAUGGGG 83.8% 4135 CAGAAAUUGGUCAAAGCCG 82.3%

4082 UUUAUGUGGAGUUGAUAAG 83.8% 4136 UACAGGAUUUGCACCUUUU 82.2%

4083 GGUGCUUUUAUGUGGAGUU 83.7% 4137 CGUGUGGAUGGGAAGAACG 82.0%

4084 ACGUGUGGAUGGGAAGAAC 83.6% 4138 CAUAACAGAGAUAGUGUAU 81.8%

4085 AACAUAACAGAGAUAGUGU 83.6% 4139 UUAUGUGUCAUGCGAUCCU 81.8%

4086 ACAUAACAGAGAUAGUGUA 83.6% 4140 CUUAUGUGUCAUGCGAUCC 81.8% 4087 AUGACGUGUGGAUGGGAAG 83.6% 4141 GGAUUUGCACCUUUUUCUA 81.5%

4088 UAUUGUUGUGUUUUGUGGC 83.5% 4142 CACGAUGGAAAAGCAUGGC 81.5%

4089 UGACGUGUGGAUGGGAAGA 83.5% 4143 ACGAUGGAAAAGCAUGGCU 81.5%

4090 GACGUGUGGAUGGGAAGAA 83.5% 4144 UGGAUGGGAAGAACGAUCA 81.4%

4091 AUUGUUGUGUUUUGUGGCA 83.5% 4145 GAUUUGCACCUUUUUCUAA 81.4%

4092 UUGUUGUGUUUUGUGGCAC 83.5% 4146 AUUUGCACCUUUUUCUAAG 81.4%

4093 UACAGAAAUUGGUCAAAGC 83.4% 4147 UUGCACCUUUUUCUAAGGA 81.4%

4094 AAAAUGCAACUGCUAGCUU 83.4% 4148 CGAUGGAAAAGCAUGGCUG 81.4%

4095 ACAGAAAUUGGUCAAAGCC 83.4% 4149 UUUGCACCUUUUUCUAAGG 81.4%

4096 AUGUGGAGUUGAUAAGGGG 83.3% 4150 UAUGUGUCAUGCGAUCCUG 81.3%

4097 GCUUUUAUGUGGAGUUGAU 83.3% 4151 UGAAUCCAAAUCAAAAGAU 81.3%

4098 UAUGUGGAGUUGAUAAGGG 83.3% 4152 AUGAAUCCAAAUCAAAAGA 81.3%

4099 UGUGGAGUUGAUAAGGGGA 83.3% 4153 GAAUCCAAAUCAAAAGAUA 81.3%

4100 CUUUUAUGUGGAGUUGAUA 83.3% 4154 GGAUGGGAAGAACGAUCAG 80.5%

4101 GUGCUUUUAUGUGGAGUUG 83.3% 4155 UGAAAGGCUGGGCCUUUGA 80.1%

4102 UGCUUUUAUGUGGAGUUGA 83.3% 4156 GAGUGAAAGGCUGGGCCUU 80.0%

4103 UGUGGAUGGGAAGAACGAU 83.3% 4157 GGAGUGAAAGGCUGGGCCU 80.0%

4104 AGAACCUUAUGUGUCAUGC 83.2% 4158 GUGAAAGGCUGGGCCUUUG 80.0%

4105 UUAUGUGGAGUUGAUAAGG 83.1% 4159 UGCACCUUUUUCUAAGGAC 79.8%

4106 AUUACAGGAUUUGCACCUU 83.1% 4160 GCACCUUUUUCUAAGGACA 79.8%

4107 AUACAGAAAUUGGUCAAAG 83.1% 4161 CACCUUUUUCUAAGGACAA 79.8%

4108 UUACAGGAUUUGCACCUUU 83.1% 4162 CAUGGCUGCAUGUUUGUGU 79.8%

4109 AAAGCAUGGCUGCAUGUUU 83.1% 4163 AGUGAAAGGCUGGGCCUUU 79.8%

4110 AAGCAUGGCUGCAUGUUUG 83.1% 4164 GCAUGGCUGCAUGUUUGUG 79.8%

4111 GUGUGGAUGGGAAGAACGA 83.1% 4165 ACCUUUUUCUAAGGACAAU 79.8%

4112 AGCAUGGCUGCAUGUUUGU 83.1% 4166 CCUUUUUCUAAGGACAAUU 79.7%

4113 AUGGAACAGGCUCAUGGCC 83.0% 4167 GAUGGGAAGAACGAUCAGC 79.7%

4114 UAUGGAACAGGCUCAUGGC 83.0% 4168 CUUUUUCUAAGGACAAUUC 79.7%

4115 GCUCAUGGCCUGAUGGGGC 82.9% 4169 UAACUACUGUAACAUUGCA 79.3%

4116 GAAUACAGAAAUUGGUCAA 82.9% 4170 CUACUGUAACAUUGCAUUU 79.3%

4117 AAUACAGAAAUUGGUCAAA 82.9% 4171 ACUACUGUAACAUUGCAUU 79.3%

4118 UGGAACAGGCUCAUGGCCU 82.9% 4172 AACUACUGUAACAUUGCAU 79.0%

4119 GAAAUUGGUCAAAGCCGCA 82.8% 4173 GAAAUGACGUGUGGAUGGG 78.6%

4120 GGAACAGGCUCAUGGCCUG 82.8% 4174 GGAAAUGACGUGUGGAUGG 78.5%

4121 AGAAAUUGGUCAAAGCCGC 82.8% 4175 UGGAAAUGACGUGUGGAUG 78.4%

4122 GAACAGGCUCAUGGCCUGA 82.8% 4176 AUGGAAAUGACGUGUGGAU 78.4%

4177 GAUGGAAAUGACGUGUGGA 78.3% 4231 UUUCCAGUUAUGUGUGCUC 75.8%

4178 AAAUGACGUGUGGAUGGGA 78.2% 4232 GUCAGGAAGUGCUCAGCAU 75.8%

4179 AAUGACGUGUGGAUGGGAA 78.2% 4233 GGUGUCAGAUGUGUCUGCA 75.6%

4180 GUACAGUAGUAAUGACUGA 78.0% 4234 CUGGUGUCAGAUGUGUCUG 75.6%

4181 UGUACAGUAGUAAUGACUG 78.0% 4235 GUGUCAGAUGUGUCUGCAG 75.6%

4182 AGGAUUUGCACCUUUUUCU 78.0% 4236 UGGUGUCAGAUGUGUCUGC 75.5%

4183 CUUGUACAGUAGUAAUGAC 78.0% 4237 GUUUCCAGUUAUGUGUGCU 75.5%

4184 ACUUGUACAGUAGUAAUGA 77.9% 4238 GCAUUGUUUCCAGUUAUGU 75.5%

4185 UACAGUAGUAAUGACUGAU 77.9% 4239 AGCAUUGUUUCCAGUUAUG 75.5%

4186 UUGUACAGUAGUAAUGACU 77.9% 4240 GUACAUAUGGAACAGGCUC 75.5%

4187 AACUGCUAGCUUCAUUUAC 77.9% 4241 UGUUUCCAGUUAUGUGUGC 75.4%

4188 GUUAUUCUGGUAUUUUCUC 77.6% 4242 GUCAGAUGUGUCUGCAGAG 75.4%

4189 GGUUAUUCUGGUAUUUUCU 77.6% 4243 GGUACAUAUGGAACAGGCU 75.4%

4190 AUACUAAAAUACUAUUCAU 77.2% 4244 UGUCAGAUGUGUCUGCAGA 75.4%

4191 GAUACUAAAAUACUAUUCA 77.1% 4245 CCUGGUGUCAGAUGUGUCU 75.4%

4192 UACUAAAAUACUAUUCAUU 77.1% 4246 CAUAUGGAACAGGCUCAUG 75.3%

4193 AUGAAACCUUCAAAGUCAU 77.0% 4247 UACAUAUGGAACAGGCUCA 75.3%

4194 UCAGAUGUGUCUGCAGAGA 76.9% 4248 CAGGUACAUAUGGAACAGG 75.3%

4195 CUGAUACUAAAAUACUAUU 76.9% 4249 ACUGGAAAGGCUCCAAUAG 75.3%

4196 UCAAUCUCAUGCCUAUAUA 76.9% 4250 AUAUGGAACAGGCUCAUGG 75.3%

4197 CAGGAAGUGCUCAGCAUGU 76.9% 4251 AGGUACAUAUGGAACAGGC 75.3%

4198 AUCAAUCUCAUGCCUAUAU 76.9% 4252 AUAGCAUUGUUUCCAGUUA 75.3%

4199 CAAUCUCAUGCCUAUAUAA 76.9% 4253 UUGUUUCCAGUUAUGUGUG 75.3%

4200 UAUGAAACCUUCAAAGUCA 76.9% 4254 UAGCAUUGUUUCCAGUUAU 75.3%

4201 CAGAUGUGUCUGCAGAGAC 76.8% 4255 UCAGGUACAUAUGGAACAG 75.2%

4202 GCUGAUACUAAAAUACUAU 76.8% 4256 AACUGGAAAGGCUCCAAUA 75.2%

4203 ACUAAAAUACUAUUCAUUG 76.7% 4257 AUUGUUUCCAGUUAUGUGU 75.2%

4204 UGAUACUAAAAUACUAUUC 76.7% 4258 ACAUAUGGAACAGGCUCAU 75.2%

4205 AGAUGUGUCUGCAGAGACA 76.7% 4259 CAUUGUUUCCAGUUAUGUG 75.1%

4206 CUAAAAUACUAUUCAUUGA 76.7% 4260 CUGGAAAGGCUCCAAUAGG 75.1%

4207 GAUGUGUCUGCAGAGACAA 76.7% 4261 GACAACUGGAAAGGCUCCA 75.1%

4208 UCAGGAAGUGCUCAGCAUG 76.6% 4262 UAGAAAGAAACAUAACAGA 75.1%

4209 ACUGUAACAUUGCAUUUCA 76.3% 4263 ACAACUGGAAAGGCUCCAA 75.1%

4210 UAACAUUGCAUUUCAAGCA 76.3% 4264 CAACUGGAAAGGCUCCAAU 75.1%

4211 AGCUGAUACUAAAAUACUA 76.3% 4265 AGAAAGAAACAUAACAGAG 74.9%

4212 GUAACAUUGCAUUUCAAGC 76.2% 4266 AAAGAAACAUAACAGAGAU 74.9%

4213 GGAAAGGCUCCAAUAGGCC 76.2% 4267 GAAAGAAACAUAACAGAGA 74.9%

4214 UGGAAAGGCUCCAAUAGGC 76.2% 4268 AGACAACUGGAAAGGCUCC 74.9%

4215 ACACACCCAGAAAAAACGA 76.1% 4269 AAAGGCUCCAAUAGGCCCA 74.6%

4216 UGUAACAUUGCAUUUCAAG 76.1% 4270 GAAACAUAACAGAGAUAGU 74.6%

4217 CUGUAACAUUGCAUUUCAA 76.1% 4271 AGAAACAUAACAGAGAUAG 74.6%

4218 GACACACCCAGAAAAAACG 76.1% 4272 AAGGCUCCAAUAGGCCCAU 74.6%

4219 UGUCAGGAAGUGCUCAGCA 76.0% 4273 AAGAAACAUAACAGAGAUA 74.6%

4220 ACACCCAGAAAAAACGACA 76.0% 4274 ACCUCAGGUACAUAUGGAA 74.4%

4221 UGUCUGCAGAGACAACUGG 75.9% 4275 CCUCAGGUACAUAUGGAAC 74.3%

4222 UGUGUCUGCAGAGACAACU 75.9% 4276 CUCAGGUACAUAUGGAACA 74.3%

4223 CACCCAGAAAAAACGACAG 75.9% 4277 UACUGUAACAUUGCAUUUC 74.1%

4224 CAUGGCCUGAUGGGGCGGA 75.9% 4278 UACAUGAUAGGACCCCUUA 73.8%

4225 GUCUGCAGAGACAACUGGA 75.9% 4279 CAUAGCAUGGUCCAGCUCA 73.5%

4226 GUGUCUGCAGAGACAACUG 75.9% 4280 AUGGGAAGAACGAUCAGCG 73.5%

4227 UCAUGGCCUGAUGGGGCGG 75.9% 4281 UGCAUAGCAUGGUCCAGCU 73.5%

4228 GAAAGGCUCCAAUAGGCCC 75.9% 4282 GCAUAGCAUGGUCCAGCUC 73.5%

4229 CACACCCAGAAAAAACGAC 75.9% 4283 UUGUCAGGAAGUGCUCAGC 73.3% 4230 AUGUGUCUGCAGAGACAAC 75.8% 4284 UGGGAAGAACGAUCAGCGA 73.3%

4285 AUUGUCAGGAAGUGCUCAG 73.3% 4339 UAUGUGUGCUCAGGACUUG 71.9%

4286 CUUGUUGGAGACACACCCA 72.9% 4340 UGUGCUCAGGACUUGUUGG 71.9%

4287 GUGCAUAGCAUGGUCCAGC 72.9% 4341 GAUAUCCUGGUGUCAGAUG 71.9%

4288 GUGUGCAUAGCAUGGUCCA 72.9% 4342 AAUCGGUGCUUUUAUGUGG 71.8%

4289 UUGUUGGAGACACACCCAG 72.9% 4343 UGUGUGCUCAGGACUUGUU 71.8%

4290 UGUGCAUAGCAUGGUCCAG 72.9% 4344 CUCGAUAUCCUGGUGUCAG 71.8%

4291 UGUUGGAGACACACCCAGA 72.9% 4345 AGUUAUGUGUGCUCAGGAC 71.8%

4292 GUUGGAGACACACCCAGAA 72.9% 4346 GUGUGCUCAGGACUUGUUG 71.8%

4293 AAGUGUGCAUAGCAUGGUC 72.8% 4347 AUCGGUGCUUUUAUGUGGA 71.8%

4294 AGUGUGCAUAGCAUGGUCC 72.8% 4348 GCAAGUGUGCAUAGCAUGG 71.8%

4295 UUGGAGACACACCCAGAAA 72.8% 4349 UUUUGUGGCACCUCAGGUA 71.7%

4296 CAAGUGUGCAUAGCAUGGU 72.7% 4350 CCUCGAUAUCCUGGUGUCA 71.7%

4297 GGAGACACACCCAGAAAAA 72.6% 4351 AUCAAUCGGUGCUUUUAUG 71.7%

4298 UGGAGACACACCCAGAAAA 72.6% 4352 UCGAUAUCCUGGUGUCAGA 71.7%

4299 AAACAUAACAGAGAUAGUG 72.4% 4353 GUUUUGUGGCACCUCAGGU 71.7% 4300 AGAAUACAGAAAUUGGUCA 72.4% 4354 UUUGUGGCACCUCAGGUAC 71.7% 4301 UCCGGUUAUUCUGGUAUUU 72.4% 4355 CAAUCGGUGCUUUUAUGUG 71.7%

4302 AUUAUAGCAUUGUUUCCAG 72.4% 4356 UUGUGGCACCUCAGGUACA 71.6%

4303 UAUAGCAUUGUUUCCAGUU 72.4% 4357 CAGUUAUGUGUGCUCAGGA 71.5%

4304 GGGAAGAACGAUCAGCGAG 72.4% 4358 ACCCAGAAAAAACGACAGC 71.5%

4305 CCGGUUAUUCUGGUAUUUU 72.4% 4359 CCCAGAAAAAACGACAGCU 71.5%

4306 UUAUAGCAUUGUUUCCAGU 72.4% 4360 GCAUCAAUCGGUGCUUUUA 71.4% 4307 CAGUACAUGAUAGGACCCC 72.3% 4361 UUGUGUUUUGUGGCACCUC 71.4% 4308 UGCGUUUGUAUCAAUGGAA 72.3% 4362 CAUCAAUCGGUGCUUUUAU 71.4%

4309 GAUUAUAGCAUUGUUUCCA 72.3% 4363 UCCUCGAUAUCCUGGUGUC 71.4%

4310 GGCUGCAUGUUUGUGUAAC 72.2% 4364 UGCAUCAAUCGGUGCUUUU 71.4%

4311 CACCUCAGGUACAUAUGGA 72.2% 4365 AUCCUCGAUAUCCUGGUGU 71.4%

4312 GUACAUGAUAGGACCCCUU 72.2% 4366 UAUCCUCGAUAUCCUGGUG 71.4%

4313 UGGCACCUCAGGUACAUAU 72.2% 4367 GCUCAGGACUUGUUGGAGA 71.3%

4314 CGGUUAUUCUGGUAUUUUC 72.2% 4368 UGCUCAGGACUUGUUGGAG 71.3%

4315 GAGACACACCCAGAAAAAA 72.2% 4369 GUUGUGUUUUGUGGCACCU 71.3%

4316 GGCACCUCAGGUACAUAUG 72.2% 4370 UGUUGUGUUUUGUGGCACC 71.3%

4317 GUGGCACCUCAGGUACAUA 72.2% 4371 AUUGGCUCUGUUUCUCUCA 71.2%

4318 GCGUUUGUAUCAAUGGAAC 72.2% 4372 UGUGUUUUGUGGCACCUCA 71.2%

4319 UUUGUAUCAAUGGAACUUG 72.2% 4373 GGACUUGUUGGAGACACAC 71.2%

4320 ACAGUACAUGAUAGGACCC 72.2% 4374 GACUUGUUGGAGACACACC 71.2%

4321 GCACCUCAGGUACAUAUGG 72.2% 4375 CAGGACUUGUUGGAGACAC 71.2%

4322 UGUGGCACCUCAGGUACAU 72.2% 4376 CUCAGGACUUGUUGGAGAC 71.2%

4323 UGGCUGCAUGUUUGUGUAA 72.2% 4377 AGGACUUGUUGGAGACACA 71.2%

4324 AGUACAUGAUAGGACCCCU 72.2% 4378 GUGUUUUGUGGCACCUCAG 71.2%

4325 GUUUGUAUCAAUGGAACUU 72.1% 4379 UCAGGACUUGUUGGAGACA 71.2%

4326 AUGCGUUUGUAUCAAUGGA 72.1% 4380 UGUUUUGUGGCACCUCAGG 71.2%

4327 ACACAGUACAUGAUAGGAC 72.0% 4381 ACUUGUUGGAGACACACCC 71.1%

4328 UUGUAUCAAUGGAACUUGU 72.0% 4382 UCAAUGGAACUUGUACAGU 71.0%

4329 CGUUUGUAUCAAUGGAACU 72.0% 4383 AUCAAUGGAACUUGUACAG 71.0%

4330 GUCCGGUUAUUCUGGUAUU 72.0% 4384 CUGCAUCAAUCGGUGCUUU 71.0%

4331 UCAAUCGGUGCUUUUAUGU 72.0% 4385 GUGCUCAGGACUUGUUGGA 70.9%

4332 CACAGUACAUGAUAGGACC 72.0% 4386 GUUAUCAAUUUGCCCUUGG 70.8%

4333 AGACACACCCAGAAAAAAC 71.9% 4387 AAGCUGCAUCAAUCGGUGC 70.8%

4334 AUGUGUGCUCAGGACUUGU 71.9% 4388 GCUGCAUCAAUCGGUGCUU 70.8%

4335 GUUAUGUGUGCUCAGGACU 71.9% 4389 UGUUAUCAAUUUGCCCUUG 70.8%

4336 CGAUAUCCUGGUGUCAGAU 71.9% 4390 AAAAGCUGCAUCAAUCGGU 70.8%

4337 UUAUGUGUGCUCAGGACUU 71.9% 4391 AAAGCUGCAUCAAUCGGUG 70.8%

4338 AUAUCCUGGUGUCAGAUGU 71.9% 4392 AGCUGCAUCAAUCGGUGCU 70.8%

4393 UUGGCUCUGUUUCUCUCAC 70.7% 4410 GCAAAAGCUGCAUCAAUCG 70.4%

4394 GGUCCGGUUAUUCUGGUAU 70.7% 4411 AAGAGAACCUUAUGUGUCA 70.4%

4395 AGGUCCGGUUAUUCUGGUA 70.7% 4412 AUGACACAGUACAUGAUAG 70.3%

4396 CAAAAGCUGCAUCAAUCGG 70.6% 4413 AAUGACACAGUACAUGAUA 70.3%

4397 AUGGCUGCAUGUUUGUGUA 70.6% 4414 UUUGAUGAUGGAAAUGACG 70.3%

4398 UAUCAAUGGAACUUGUACA 70.6% 4415 UAAUAGAAAGAAACAUAAC 70.3%

4399 GUAUCAAUGGAACUUGUAC 70.6% 4416 AGGGAACAACACUAAACAA 70.3%

4400 UGUAUCAAUGGAACUUGUA 70.6% 4417 UAUCCUGGUGUCAGAUGUG 70.2%

4401 AAUAGAAAGAAACAUAACA 70.6% 4418 AUCCUGGUGUCAGAUGUGU 70.2%

4402 UCAUGGAGUGAAAGGCUGG 70.5% 4419 UCGGUGCUUUUAUGUGGAG 70.2%

4403 CAUGGAGUGAAAGGCUGGG 70.5% 4420 CAGGGAACAACACUAAACA 70.2%

4404 CAAGAGAACCUUAUGUGUC 70.5% 4421 GACACAGUACAUGAUAGGA 70.2%

4405 AUGGAGUGAAAGGCUGGGC 70.5% 4422 UUGAUGAUGGAAAUGACGU 70.2%

4406 AUAGAAAGAAACAUAACAG 70.5% 4423 GGGAACAACACUAAACAAC 70.1%

4407 ACAAGAGAACCUUAUGUGU 70.5% 4424 GGCAAAAGCUGCAUCAAUC 70.1%

4408 UGGAGUGAAAGGCUGGGCC 70.4% 4425 CAAAUGACACAGUACAUGA 70.0%

4409 UCAAAUGACACAGUACAUG 70.4% 4426 UCCUGGUGUCAGAUGUGUC 70.0%

Table 2-7: Conserved 19-mer sequences that are present in at least 70% of the Influenza A segment 2 (Ml & M2) sequences listed in Table 1-7.

Seq 4459 GCUUUGUCCAAAAUGCCCU 94.3%

ID Sequence Percent

4460 CGCUUUGUCCAAAAUGCCC 94.3%

4427 UUCACGCUCACCGUGCCCA 99.2%

4461 UCAGUUAUUCUGCUGGUGC 94.2%

4428 CAGGCCCCCUCAAAGCCGA 99.0%

4462 AUUCUGCUGGUGCACUUGC 94.1%

4429 UCAGGCCCCCUCAAAGCCG 99.0%

4463 CUCAUGGAAUGGCUAAAGA 94.1% 4430 UCACGCUCACCGUGCCCAG 98.9%

4464 UCAUGGAAUGGCUAAAGAC 94.1%

4431 CACGCϋCACCGUGCCCAGU 98.8%

4465 UAUUCUGCUGGUGCACUUG 94.1%

4432 CGCUCACCGUGCCCAGUGA 98.8%

4466 AUGGAAUGGCUAAAGACAA 94.0%

4433 ACGCUCACCGUGCCCAGUG 98.8%

4467 CUCAGUUAUUCUGCUGGUG 94.0%

4434 UCACCGUGCCCAGUGAGCG 98.7%

4468 GUUAUUCUGCUGGUGCACU 94.0%

4435 CUCACCGUGCCCAGUGAGC 98.7%

4469 UGGCCAGCACUACAGCUAA 94.0%

4436 GCUCACCGUGCCCAGUGAG 98.5%

4470 UUAUUCUGCUGGUGCACUU 94.0%

4437 AUCUUGAGGCUCUCAUGGA 96.4%

4471 CAUGGAAUGGCUAAAGACA 94.0%

4438 GAUCUUGAGGCUCUCAUGG 96.4%

4472 UGGAAUGGCUAAAGACAAG 93.9% 4439 UGUCACCUCUGACUAAGGG 96.3%

4473 UCUCAUGGAAUGGCUAAAG 93.9%

4440 CUGUCACCUCUGACUAAGG 96.3%

4474 AGUUAUUCUGCUGGUGCAC 93.9%

4441 GACUGCAGCGUAGACGCUU 95.6%

4475 GGAAUGGCUAAAGACAAGA 93.7%

4442 GGACUGCAGCGUAGACGCU 95.6%

4476 AAUGGCUAAAGACAAGACC 93.7%

4443 UGCAGCGUAGACGCUUUGU 95.5%

4477 GAAUGGCUAAAGACAAGAC 93.7%

4444 ACUGCAGCGUAGACGCUUU 95.5%

4478 UCCAUGGGGCCAAAGAAAU 93.4%

4445 CUGCAGCGUAGACGCUUUG 95.5%

4479 GCGUAGACGCUUUGUCCAA 93.4%

4446 GCUAUGGAGCAAAUGGCUG 95.4%

4480 CAGUUAUUCUGCUGGUGCA 93.4%

4447 UGGCUAAAGACAAGACCAA 95.4%

4481 UUCCAUGGGGCCAAAGAAA 93.4%

4448 CAGCGUAGACGCUUUGUCC 95.4%

4482 CACCUCUGACUAAGGGGAU 93.2%

4449 CUAUGGAGCAAAUGGCUGG 95.4%

4483 GUCACCUCUGACUAAGGGG 93.2%

4450 AGCGUAGACGCUUUGUCCA 95.4%

4484 UCACCUCUGACUAAGGGGA 93.2%

4451 GCAGCGUAGACGCUUUGUC 95.4%

4485 CUGGUGCACUUGCCAGUUG 93.0%

4452 GGCUAAAGACAAGACCAAU 95.4%

4486 GAGGCUCUCAUGGAAUGGC 93.0%

4453 CGUAGACGCUUUGUCCAAA 94.9%

4487 GCUCUCAUGGAAUGGCUAA 93.0%

4454 GUAGACGCUUUGUCCAAAA 94.9%

4488 CUCUCAUGGAAUGGCUAAA 93.0%

4455 AGACGCUUUGUCCAAAAUG 94.8%

4489 AUGGGGCCAAAGAAAUAGC 93.0%

4456 UAGACGCUUUGUCCAAAAU 94.8%

4490 AGGCUCUCAUGGAAUGGCU 93.0%

4457 GACGCUϋUGUCCAAAAUGC 94.7%

4491 GGCUCUCAUGGAAUGGCUA 92.9%

4458 ACGCUUUGUCCAAAAUGCC 94.3%

4492 UUGAGGCUCUCAUGGAAUG 92.9% 4546 GAUCUUCUUGAAAAUUUGC 88.6%

4493 UGAGGCUCUCAUGGAAUGG 92.9% 4547 UGAUCUUCUUGAAAAUUUG 88.6%

4494 CUUGAGGCUCUCAUGGAAU 92.9% 4548 CAGAAACGAAUGGGGGUGC 88.5%

4495 CAUGGGGCCAAAGAAAUAG 92.9% 4549 ACGAAUGGGGGUGCAGAUG 88.5%

4496 GCUGGUGCACUUGCCAGUU 92.9% 4550 AGAAACGAAUGGGGGUGCA 88.5%

4497 CUUCUACGGAAGGAGUACC 92.8% 4551 AAACGAAUGGGGGUGCAGA 88.4%

4498 UCUUGAGGCUCUCAUGGAA 92.7% 4552 AACGAAUGGGGGUGCAGAU 88.4%

4499 CUGCUGGUGCACUUGCCAG 92.7% 4553 UGGAGCAAAUGGCUGGAUC 88.4%

4500 UGCUGGUGCACUUGCCAGU 92.7% 4554 AUGGAGCAAAUGGCUGGAU 88.4%

4501 UCUGCUGGUGCACUUGCCA 92.7% 4555 GAAACGAAUGGGGGUGCAG 88.2%

4502 CCAUGGGGCCAAAGAAAUA 92.6% 4556 AUCGUCUUUUUUUCAAAUG 88.1%

4503 AGUCUAUGAGGGAAGAAUA 92.6% 4557 CCUAUCAGAAACGAAUGGG 88.1%

4504 CCUUCUACGGAAGGAGUAC 92.5% 4558 UAUGGAGCAAAUGGCUGGA 88.1%

4505 GUCUAUGAGGGAAGAAUAU 92.4% 4559 GAUCGUCUUUUUUUCAAAU 88.0%

4506 CUAUGAGGGAAGAAUAUCG 92.4% 4560 UGAUCGUCUUUUUUUCAAA 88.0%

4507 UCUAUGAGGGAAGAAUAUC 92.4% 4561 UUGAUCGUCUUUUUUUCAA 88.0%

4508 GUCAUUUUGUCAGCAUAGA 92.2% 4562 UCGUCUUϋUUUUCAAAUGC 87.8%

4509 GAGUCUAUGAGGGAAGAAU 92.2% 4563 GAUUCUUGAUCGUCUUUUU 87.5%

4510 UCAUUUUGUCAGCAUAGAG 92.1% 4564 AUUCUUGAUCGUCUUUUUU 87.4%

4511 AGACUUGAAGAUGUCUUUG 91.9% 4565 CUUGAUCGUCUUUUUUUCA 87.2%

4512 GACUUGAAGAUGUCUUUGC 91.9% 4566 UUGGGACUCAUCCUAGCUC 87.2%

4513 AGAGACUUGAAGAUGUCUU 91.9% 4567 AUUGGGAUCUUGCACUUGA 87.1%

4514 GAGACUUGAAGAUGUCUUU 91.9% 4568 UCUUGAUCGUCUUUUUUUC 87.1%

4515 CAGAGACUUGAAGAUGUCU 91.9% 4569 UUCUUGAUCGUCUUUUUUU 87.1%

4516 GUGCAGAUGCAACGAUUCA 91.7% 4570 GAUGAUCUUCUUGAAAAUU 87.0%

4517 UUCUGCUGGUGCACUUGCC 91.7% 4571 UAUCGAAAGGAACAGCAGA 87.0%

4518 UGCAGAUGCAACGAUUCAA 91.6% 4572 AGAUGAUCUUCUUGAAAAU 86.9%

4519 UAUGAGGGAAGAAUAUCGA 91.5% 4573 UUGGGAUCUUGCACUUGAU 86.9%

4520 AUGGGGGUGCAGAUGCAAC 91.3% 4574 UGGGAUCUUGCACUUGAUA 86.9%

4521 UGGGGGUGCAGAUGCAACG 91.3% 4575 UCAUUGGGAUCUUGCACUU 86.6%

4522 ACCUCUGACUAAGGGGAUU 91.3% 4576 CAUUGGGAUCUUGCACUUG 86.5%

4523 UGGAUUCUUGAUCGUCUUU 91.1% 4577 CAGAUCUUGAGGCUCUCAU 86.4%

4524 GGAUUCUUGAUCGUCUUUU 91.1% 4578 CACAGAUCUUGAGGCUCUC 86.4%

4525 GGGGUGCAGAUGCAACGAU 90.8% 4579 CCUGAGUCUAUGAGGGAAG 86.4%

4526 GGGUGCAGAUGCAACGAUU 90.8% 4580 ACAGAUCUUGAGGCUCUCA 86.4%

4527 GUGGAUUCUUGAUCGUCUU 90.8% 4581 AUCAUUGGGAUCUUGCACU 86.4%

4528 GGUGCAGAUGCAACGAUUC 90.8% 4582 CUGAGUCUAUGAGGGAAGA 86.4%

4529 GGGGGUGCAGAUGCAACGA 90.8% 4583 AGAUCUUGAGGCUCUCAUG 86.4%

4530 CUCUGACUAAGGGGAUUUU 90.7% 4584 ACACAGAUCUUGAGGCUCU 86.4%

4531 CCUCUGACUAAGGGGAUUU 90.7% 4585 ACAGCUAAGGCUAUGGAGC 86.3%

4532 UGUGGAUUCUUGAUCGUCU 90.7% 4586 CAGCUAAGGCUAUGGAGCA 86.3%

4533 UCUUCUUGAAAAUUUGCAG 90.5% 4587 AACACAGAUCUUGAGGCUC 86.3%

4534 AUGGCUAAAGACAAGACCA 90.5% 4588 GGCUAUGGAGCAAAUGGCU 86.2%

4535 AUCUUCUUGAAAAUUUGCA 90.5% 4589 ACUACAGCUAAGGCUAUGG 86.2%

4536 GAAUGGGGGUGCAGAUGCA 90.4% 4590 AAGGCUAUGGAGCAAAUGG 86.2%

4537 CGAAUGGGGGUGCAGAUGC 90.2% 4591 AGGCUAUGGAGCAAAUGGC 86.2%

4538 AAUGGGGGUGCAGAUGCAA 90.2% 4592 CUACAGCUAAGGCUAUGGA 86.2%

4539 UUGAAGAUGUCUUUGCUGG 89.9% 4593 CCAGCACUACAGCUAAGGC 86.1%

4540 CUUGAAGAUGUCUUUGCUG 89.9% 4594 AGCACUACAGCUAAGGCUA 86.1%

4541 ACUUGAAGAUGUCUUUGCU 89.7% 4595 CACUACAGCUAAGGCUAUG 86.1%

4542 GAUCCAAAUAACAUGGACA 89.2% 4596 GCCAGCACUACAGCUAAGG 86.1%

4543 AUGAUCUUCUUGAAAAUUU 89.0% 4597 GCACUACAGCUAAGGCUAU 86.1%

4544 AGAUUGCUGACUCCCAGCA 88.8% 4598 CUAAGGCUAUGGAGCAAAU 86.0%

4545 CAGAUUGCUGACUCCCAGC 88.8% 4599 AAUAUCGAAAGGAACAGCA 86.0%

4G00 ' UAAGGCUAUGGAGCAAAUG 86.0% 4654 UCAGAAACGAAUGGGGGUG 84.1%

4601 GCUAAGGCUAUGGAGCAAA 86.0% 4655 CUAUCAGAAACGAAUGGGG 84.1%

4602 GUGAACAGAUUGCUGACUC 86.0% 4656 CCAAAAUGCCCUCAAUGGG 84.1%

4603 UACAGCUAAGGCUAUGGAG 86.0% 4657 UAUCAGAAACGAAUGGGGG 84.1%

4604 UGUGAACAGAUUGCUGACU 86.0% 4658 ACGGAAGGAGUACCUGAGU 84.0%

4605 CAGCACUACAGCUAAGGCU 86.0% 4659 UUUGGCCUGGUAUGUGCAA 84.0%

4606 GAAUAUCGAAAGGAACAGC 86.0% 4660 UACGGAAGGAGUACCUGAG 83.9%

4607 ACAGAUUGCUGACUCCCAG 85.9% 4661 CGGAAGGAGUACCUGAGUC 83.9%

4608 UGAACAGAUUGCUGACUCC 85.9% 4662 UCUACGGAAGGAGUACCUG 83.7%

4609 GAACAGAUUGCUGACUCCC 85.9% 4663 CACUUGCCAGUUGCAUGGG 83.7%

4610 AACAGAUUGCUGACUCCCA 85.9% 4664 GCACUUGCCAGUUGCAUGG 83.7%

4611 UAUCAUUGGGAUCUUGCAC 85.8% 4665 CUACGGAAGGAGUACCUGA 83.7%

4612 ACAUGAGAACAGAAUGGUU 85.8% 4666 UUCUACGGAAGGAGUACCU 83.7%

4613 UACCUGAGUCUAUGAGGGA 85.6% 4667 GGAAGGAGUACCUGAGUCU 83.7%

4614 AUAUCGAAAGGAACAGCAG 85.5% 4668 GGUGCACUUGCCAGUUGCA 83.6%

4615 AUGAGGGAAGAAUAUCGAA 85.4% 4669 GUGCACUUGCCAGUUGCAU 83.6%

4616 UUGCUAGUCAGGCCAGGCA 85.4% 4670 UGCACUUGCCAGUUGCAUG 83.6%

4617 UGAGGGAAGAAUAUCGAAA 85.3% 4671 AGUACCUGAGUCUAUGAGG 83.5%

4618 GGGAAGAAUAUCGAAAGGA 85.3% 4672 GGAGUACCUGAGUCUAUGA 83.5%

4619 GGAAGAAUAUCGAAAGGAA 85.2% 4673 GAGUACCUGAGUCUAUGAG 83.5%

4620 AGGGAAGAAUAUCGAAAGG 85.2% 4674 CAGCAGAAUGCUGUGGAUG 83.4%

4621 AAGAAUAUCGAAAGGAACA 85.2% 4675 AGGAGUACCUGAGUCUAUG 83.3%

4622 GUACCUGAGUCUAUGAGGG 85.2% 4676 GAAGGAGUACCUGAGUCUA 83.2%

4623 GAAGAAUAUCGAAAGGAAC 85.2% 4677 ACAGCAGAAUGCUGUGGAU 83.1%

4624 UGUUCACGCUCACCGUGCC 85.2% 4678 GGAACAGCAGAAUGCUGUG 83.1% 4525 AGAAUAUCGAAAGGAACAG 85.2% 4679 GAACAGCAGAAUGCUGUGG 83.1%

4626 GUGUUCACGCUCACCGUGC 85.2% 4680 AACAGCAGAAUGCUGUGGA 83.1%

4627 GAGGGAAGAAUAUCGAAAG 85.1% 4681 AAGGAGUACCUGAGUCUAU 83.1%

4628 GUUCACGCUCACCGUGCCC 85.1% 4682 CUUGCCAGUUGCAUGGGCC 82.7%

4629 GGCCAGCACUACAGCUAAG 85.0% 4683 UUGCCAGUUGCAUGGGCCU 82.7% 4630 UGUGUUCACGCUCACCGUG 84.9% 4684 AGGAACAGCAGAAUGCUGU 82.7%

4631 UUGUGUUCACGCUCACCGU 84.9% 4685 UGCCAGUUGCAUGGGCCUC 82.3%

4632 UUUGUGUUCACGCUCACCG 84.9% 4686 GCCAGUUGCAUGGGCCUCA 82.3%

4633 UUGUCCAAAAUGCCCUCAA 84.8% 4687 CCAGUUGCAUGGGCCUCAU 82.3%

4634 ACCUGAGUCUAUGAGGGAA 84.8% 4688 CAGUUGCAUGGGCCUCAUA 82.1%

4635 UUUGUCCAAAAUGCCCUCA 84.8% 4689 UGAGUCUAUGAGGGAAGAA 82.1% 4636 AAUGCCCUCAAUGGGAAUG 84.6% 4690 ACUUGCCAGUUGCAUGGGC 82.1%

4637 AGCUAAGGCUAUGGAGCAA 84.6% 4691 UGGUGCACUUGCCAGUUGC 82.0%

4638 AUGCCCUCAAUGGGAAUGG 84.6% 4692 AAGAUGUCUUUGCUGGGAA 81.9%

4639 AGUUGCAUGGGCCUCAUAU 84.5% 4693 UGAAGAUGUCUUUGCUGGG 81.9%

4640 GCAUGGGCCUCAUAUACAA 84.5% 4694 GAAGAUGUCUUUGCUGGGA 81.9%

4641 UGCAUGGGCCUCAUAUACA 84.5% 4695 GUAUCAUUGGGAUCUUGCA 81.5%

4642 GUUGCAUGGGCCUCAUAUA 84.5% 4696 AGUAUCAUUGGGAUCUUGC 81.5%

4643 CUUUGUCCAAAAUGCCCUC 84.5% 4697 AAAUGGUGCAGGCAAUGAG 81.1%

4644 UUGCAUGGGCCUCAUAUAC 84.4% 4698 GCAGAAUGCUGUGGAUGCU 81.1%

4645 UCCAAAAUGCCCUCAAUGG 84.4% 4699 GGACUCAUCCUAGCUCCAG 81.1%

4646 GUCCAAAAUGCCCUCAAUG 84.4% 4700 AGAAUGCUGUGGAUGCUGA 81.1%

4647 UGUCCAAAAUGCCCUCAAU 84.3% 4701 AGCAGAAUGCUGUGGAUGC 81.1%

4648 AGUCAUUUUGUCAGCAUAG 84.2% 4702 GAAUGCUGUGGAUGCUGAC 81.1%

4649 AAAUGCCCUCAAUGGGAAU 84.2% 4703 CGAGUAUCAUUGGGAUCUU 81.1%

4650 AAAAUGCCCUCAAUGGGAA 84.2% 4704 GGGACUCAUCCUAGCUCCA 81.1%

4651 UUGGCCUGGUAUGUGCAAC 84.2% 4705 CAGAAUGCUGUGGAUGCUG 81.1%

4652 CAAAAUGCCCUCAAUGGGA 84.2% 4706 GCGAGUAUCAUUGGGAUCU 81.1%

4653 AUCAGAAACGAAUGGGGGU 84.1% 4707 UGGGACUCAUCCUAGCUCC 81.0%

4708 CAAAUGGUGCAGGCAAUGA 81.0% 4762 GUAUGUUCUCUCUAUCGUU 77.6%

4709 GAGUAUCAUUGGGAUCUUG 81.0% 4763 GUCAGGCCAGGCAAAUGGU 77.6%

4710 GACUCAUCCUAGCUCCAGU 80.9% 4764 AUGUUCUCUCUAUCGUUCC 77.6%

4711 AAUGCUGUGGAUGCUGACG 80.5% 4765 UAGUCAGGCCAGGCAAAUG 77.4%

4712 AGGCAAAUGGUGCAGGCAA 80.2% 4766 CUAGUCAGGCCAGGCAAAU 77.4%

4713 GCAAAUGGUGCAGGCAAUG 80.2% 4767 GCUAGUCAGGCCAGGCAAA 77.4%

4714 GGCAAAUGGUGCAGGCAAU 80.2% 4768 UGCUAGUCAGGCCAGGCAA 77.2%

4715 AUGCUGUGGAUGCUGACGA 80.1% 4769 UUGCUGACUCCCAGCACAG 77.1%

4716 CUGACUAAGGGGAUUUUGG 79.8% 4770 AUUGCUGACUCCCAGCACA 77.1%

4717 UCUGACUAAGGGGAUUUUG 79.8% 4771 GAUUGCUGACUCCCAGCAC 77.1%

4718 UGCACUUGAUAUUGUGGAU 79.8% 4772 GCUGACUCCCAGCACAGGU 77.1%

4719 UGACUAAGGGGAUUUUGGG 79.8% 4773 CAGAUGCAACGAUUCAAGU 77.1%

4720 GCACUUGAUAUUGUGGAUU 79.8% 4774 CUGACUCCCAGCACAGGUC 77.1%

4721 UUGCACUUGAUAUUGUGGA 79.6% 4775 GCAGAUGCAACGAUUCAAG 77.0%

4722 CGUAUGUUCUCUCUAUCGU 79.2% 4776 UGCUGACUCCCAGCACAGG 76.9%

4723 GAAACGUAUGUUCUCUCUA 79.2% 4777 GGAAUGGGGAUCCAAAUAA 76.8%

4724 AUAUUGUGGAUUCUUGAUC 79.2% 4778 GGGAAUGGGGAUCCAAAUA 76.8%

4725 ACGUAUGUUCUCUCUAUCG 79.2% 4779 GAGGGCCUUCUACGGAAGG 76.8%

4726 AACGUAUGUUCUCUCUAUC 79.2% 4780 AUCUUGCACUUGAUAUUGU 76.7%

4727 AAACGUAUGUUCUCUCUAU 79.2% 4781 UCUUGCACUUGAUAUUGUG 76.7%

4728 CCAGGCAAAUGGUGCAGGC 79.1% 4782 CUUGCACUUGAUAUUGUGG 76.7%

4729 GAUAUUGUGGAUUCUUGAU 79.1% 4783 AUGGGAAUGGGGAUCCAAA 76.6%

4730 UGAUAUUGUGGAUUCUUGA 79.1% 4784 AGGGCCUUCUACGGAAGGA 76.6%

4731 AGGUCGAAACGUAUGUUCU 79.1% 4785 AAUGGGAAUGGGGAUCCAA 76.6%

4732 GAGGUCGAAACGUAUGUUC 79.1% 4786 GAUGCAACGAUUCAAGUGA 76.5%

4733 GGUCGAAACGUAUGUUCUC 79.1% 4787 ACAUUCCAUGGGGCCAAAG 76.5%

4734 GUCGAAACGUAUGUUCUCU 79.1% 4788 AGAUGCAACGAUUCAAGUG 76.5%

4735 CGAAACGUAUGUUCUCUCU 79.1% 4789 AAAAGAGGGCCUUCUACGG 76.4%

4736 UCGAAACGUAUGUUCUCUC 79.1% 4790 CAUUCCAUGGGGCCAAAGA 76.4%

4737 UAUUGUGGAUUCUUGAUCG 79.0% 4791 AGGCAAAUGGUGGCAACAA 76.4%

4738 CACUUGAUAUUGUGGAUUC 78.9% 4792 GGCAAAUGGUGGCAACAAC 76.4%

4739 ACUUGAUAUUGUGGAUUCU 78.9% 4793 CAAAUGGUGGCAACAACCA 76.3%

4740 CUUGAUAUUGUGGAUUCUU 78.9% 4794 AUUCCAUGGGGCCAAAGAA 76.3%

4741 GCCAGGCAAAUGGUGCAGG 78.9% 4795 AAAUGGUGGCAACAACCAA 76.3%

4742 UUGAUAUUGUGGAUUCUUG 78.7% 4796 AACUUAAGAGGGAGAUAAC 76.2%

4743 CGCAGAGACUUGAAGAUGU 78.7% 4797 AGAGGGCCUUCUACGGAAG 76.2%

4744 GCGCAGAGACUUGAAGAUG 78.7% 4798 UGGGAAUGGGGAUCCAAAU 76.2%

4745 GGCCAGGCAAAUGGUGCAG 78.6% 4799 AAGAGGGCCUUCUACGGAA 76.2%

4746 AGGCCAGGCAAAUGGUGCA 78.6% 4800 AAACUUAAGAGGGAGAUAA 76.2%

4747 AUUGUGGAUUCUUGAUCGU 78.5% 4801 AAAGAGGGCCUUCUACGGA 76.2%

4748 UUGUGGAUUCUUGAUCGUC 78.5% 4802 CAGUCAUUUUGUCAGCAUA 76.1%

4749 CAGGCAAAUGGUGCAGGCA 78.4% 4803 CCUGUCACCUCUGACUAAG 76.0%

4750 GGGCCUUCUACGGAAGGAG 78.4% 4804 GCAAAUGGUGGCAACAACC 76.0%

4751 GGCCUUCUACGGAAGGAGU 78.4% 4805 UGCUGUGGAUGCUGACGAC 75.9%

4752 CGCGCAGAGACUUGAAGAU 78.3% 4806 ACAGUCAUUUUGUCAGCAU 75.8%

4753 UCGCGCAGAGACUUGAAGA 78.3% 4807 AUCAGGCCCCCUCAAAGCC 75.6%

4754 AUCGCGCAGAGACUUGAAG 78.2% 4808 CCAUCAGGCCCCCUCAAAG 75.6%

4755 CAGGCCAGGCAAAUGGUGC 78.1% 4809 GGAUCUUGCACUUGAUAUU 75.6%

4756 GCAGAGACUUGAAGAUGUC 77.9% 4810 CAUCAGGCCCCCUCAAAGC 75.6%

4757 AUCCUGUCACCUCUGACUA 77.9% 4811 GGGAUCUUGCACUUGAUAU 75.6%

4758 UCCUGUCACCUCUGACUAA 77.8% 4812 GAUCUUGCACUUGAUAUUG 75.5%

4759 UCAGGCCAGGCAAAUGGUG 77.8% 4813 GGAUGCUGACGACAGUCAU 75.4%

4760 UAUGUUCUCUCUAUCGUUC 77.6% 4814 GCUGUGGAUGCUGACGACA 75.4%

4761 AGUCAGGCCAGGCAAAUGG 77.6% 4815 AUGCUGACGACAGUCAUUU 75.4%

4816 UGGAUGCUGACGACAGUCA 75.4% 4870 UUGAAAAUUUGCAGACCUA 74.0%

4817 GAUGCUGACGACAGUCAUU 75.4% 4871 CUUCUUGAAAAUUUGCAGA 74.0%

4818 CUGUGGAUGCUGACGACAG 75.3% 4872 UCUUGAAAAUUUGCAGACC 74.0%

4819 UGCUGACGACAGUCAUUUU 75.3% 4873 CUUGAAAAUUUGCAGACCU 74.0%

4820 UGUGGAUGCUGACGACAGU 75.3% 4874 UUCUUGAAAAUUUGCAGAC 74.0%

4821 GUGGAUGCUGACGACAGUC 75.3% 4875 UCGAAAGGAACAGCAGAAU 73.9%

4822 GCCUUCUACGGAAGGAGUA 75.2% 4876 CCCAGUGAGCGAGGACUGC 73.7%

4823 GCUGACGACAGUCAUUUUG 75.1% 4877 CAUUUUGUCAGCAUAGAGC 73.7%

4824 ACGACAGUCAUUUUGUCAG 75.0% 4878 CCAGUGAGCGAGGACUGCA 73.7%

4825 GACGACAGUCAUUUUGUCA 75.0% 4879 GUGCCCAGUGAGCGAGGAC 73.6%

4826 UGACGACAGUCAUUUUGUC 75.0% 4880 GCCCAGUGAGCGAGGACUG 73.6%

4827 CUGACGACAGUCAUUUUGU 75.0% 4881 UGCCCAGUGAGCGAGGACU 73.6%

4828 GACAGUCAUUUUGUCAGCA 75.0% 4882 CUUAAGAGGGAGAUAACAU 73.5%

4829 CGACAGUCAUUUUGUCAGC 75.0% 4883 UUAAGAGGGAGAUAACAUU 73.5% 4830 UGGUGGCAACAACCAAUCC 75.0% 4884 CACCGUGCCCAGUGAGCGA 73.4%

4831 AAUGGUGGCAACAACCAAU 74.9% 4885 AAUGGUGCAGGCAAUGAGA 73.4%

4832 GGUGGCAACAACCAAUCCA 74.9% 4886 AAGAGGGAGAUAACAUUCC 73.4%

4833 AUGGUGGCAACAACCAAUC 74.9% 4887 CGUGCCCAGUGAGCGAGGA 73.4%

4834 AAGACAAGACCAAUCCUGU 74.8% 4888 CCGUGCCCAGUGAGCGAGG 73.4%

4835 CAAGACCAAUCCUGUCACC 74.8% 4889 GAGGGAGAUAACAUUCCAU 73.4%

4836 UAGGCAAAUGGUGGCAACA 74.8% 4890 AGAGGGAGAUAACAUUCCA 73.4%

4837 GACAAGACCAAUCCUGUCA 74.8% 4891 ACCGUGCCCAGUGAGCGAG 73.4%

4838 AGACAAGACCAAUCCUGUC 74.8% 4892 UGUUCUCUCUAUCGUUCCA 73.1%

4839 GACCAAUCCUGUCACCUCU 74.8% 4893 AAAAACACAGAUCUUGAGG 73.1%

4840 AGACCAAUCCUGUCACCUC 74.8% 4894 AUOUUGUCAGCAUAGAGCU 73.1%

4841 GCUAAAGACAAGACCAAUC 74.8% 4895 UUCUCUCUAUCGUUCCAUC 73.1%

4842 ACAAGACCAAUCCUGUCAC 74.8% 4896 GUUCUCUCUAUCGUUCCAU 73.1%

4843 AAGACCAAUCCUGUCACCU 74.8% 4897 GGAAAAACACAGAUCUUGA 73.0%

4844 AUAGGCAAAUGGUGGCAAC 74.7% 4898 AAACACAGAUCUUGAGGCU 73.0%

4845 UAAAGACAAGACCAAUCCU 74.7% 4899 GGGAAAAACACAGAUCUUG 73.0%

4846 AAAGACAAGACCAAUCCUG 74.7% 4900 AAAACACAGAUCUUGAGGC 73.0%

4847 CUAAAGACAAGACCAAUCC 74.7% 4901 UAAGAGGGAGAUAACAUUC 73.0%

4848 CAAUCCUGUCACCUCUGAC 74.6% 4902 GAUGUCUUUGCUGGGAAAA 72.9%

4849 CAUAGGCAAAUGGUGGCAA 74.6% 4903 GAAAAACACAGAUCUUGAG 72.9% 4850 AAGGAACAGCAGAAUGCUG 74.6% 4904 UCUCUCUAUCGUUCCAUCA 72.8%

4851 CCAAUCCUGUCACCUCUGA 74.6% 4905 UCCAUCAGGCCCCCUCAAA 72.8%

4852 ACAACCAAUCCAUUAAUAA 74.5% 4906 UUCCAUCAGGCCCCCUCAA 72.8%

4853 AUAACAUUCCAUGGGGCCA 74.4% 4907 CUCUCUAUCGUUCCAUCAG 72.8%

4854 UAACAUUCCAUGGGGCCAA 74.4% 4908 GUUCCAUCAGGCCCCCUCA 72.8% 4855 AGAUAACAUUCCAUGGGGC 74.4% 4909 UCUAUCGUUCCAUCAGGCC 72.8%

4856 GAGAUAACAUUCCAUGGGG 74.4% 4910 UGUCUUUGCUGGGAAAAAC 72.8%

4857 AACAUUCCAUGGGGCCAAA 74.3% 4911 UAUCGUUCCAUCAGGCCCC 72.8%

4858 CAACAACCAAUCCAUUAAU 74.3% 4912 GUCUUUGCUGGGAAAAACA 72.8%

4859 AACAACCAAUCCAUUAAUA 74.3% 4913 AUGUCUUUGCUGGGAAAAA 72.8%

4860 ACCAAUCCUGUCACCUCUG 74.2% 4914 UCUUUGCUGGGAAAAACAC 72.8%

4861 GAAAGGAACAGCAGAAUGC 74.2% 4915 CGUUCCAUCAGGCCCCCUC 72.8%

4862 AAAGGAACAGCAGAAUGCU 74.2% 4916 CUAUCGUUCCAUCAGGCCC 72.8%

4863 GGGAGAUAACAUUCCAUGG 74.1% 4917 CUCUAUCGUUCCAUCAGGC 72.8%

4864 CGAAAGGAACAGCAGAAUG 74.1% 4918 UCGUUCCAUCAGGCCCCCU 72.8%

4865 GGAGAUAACAUUCCAUGGG 74.1% 4919 AUCGUUCCAUCAGGCCCCC 72.8%

4866 AAUCCUGUCACCUCUGACU 74.1% 4920 UCUCUAUCGUUCCAUCAGG 72.8%

4867 AGGGAGAUAACAUUCCAUG 74.1% 4921 GGAUCAAGUGAGCAGGCAG 72.7%

4868 GAUAACAUUCCAUGGGGCC 74.1% 4922 CUAACCGAGGUCGAAACGU 72.7%

4869 AUCGAAAGGAACAGCAGAA 74.1% 4923 AUGGCUGGAUCAAGUGAGC 72.7%

4924 GAUCAAGUGAGCAGGCAGC 72.7% 4973 GGCUGGAUCAAGUGAGCAG 71.5%

4925 UGGCUGGAUCAAGUGAGCA 72.7% 4974 UAGCACUCAGUUAUUCUGC 71.5%

4926 UUCUAACCGAGGUCGAAAC 72.6% 4975 AUAGCACUCAGUUAUUCUG 71.4%

4927 UAACCGAGGUCGAAACGUA 72.6% 4976 AAGAAAUAGCACUCAGUUA 71.4%

4928 UCUAACCGAGGUCGAAACG 72.6% 4977 GCUGGAUCAAGUGAGCAGG 71.4%

4929 GCUGGGAAAAACACAGAUC 72.6% 4978 AAAGAAAUAGCACUCAGUU 71.4%

4930 UGCUGGGAAAAACACAGAU 72.6% 4979 CUGGAUCAAGUGAGCAGGC 71.4%

4931 UCAAGUGAGCAGGCAGCGG 72.6% 4980 AAUAGCACUCAGUUAUUCU 71.3%

4932 CAAGUGAGCAGGCAGCGGA 72.6% 4981 UGGAUCAAGUGAGCAGGCA 71.3%

4933 CUGGGAAAAACACAGAUCU 72.6% 4982 AGCACUCAGUUAUUCUGCU 71.3%

4934 UGGUAUGUGCAACAUGUGA 72.6% 4983 GAAAUAGCACUCAGUUAUU 71.3%

4935 UUGCUGGGAAAAACACAGA 72.6% 4984 AAAUAGCACUCAGUUAUUC 71.3%

4936 AGAUGUCUUUGCUGGGAAA 72.6% 4985 AAUGGCUGGAUCAAGUGAG 71.3%

4937 UUUGCUGGGAAAAACACAG 72.6% 4986 AAUGGGGAUCCAAAUAACA 71.2% 4938 UGGGAAAAACACAGAUCUU 72.5% 4987 AGAAAUAGCACUCAGUUAU 71.2%

4939 CUUUGCUGGGAAAAACACA 72.5% 4988 UGGGGAUCCAAAUAACAUG 71.2%

4940 CUGGUAUGUGCAACAUGUG 72.5% 4989 AUGGGGAUCCAAAUAACAU 71.2%

4941 CUUCUAACCGAGGUCGAAA 72.5% 4990 GGCCAAAGAAAUAGCACUC 71.1%

4942 CGAGGACUGCAGCGUAGAC 72.4% 4991 GCCAAAGAAAUAGCACUCA 71.1%

4943 GAGCGAGGACUGCAGCGUA 72.4% 4992 CCAAAGAAAUAGCACUCAG 71.1% 4944 GAGGACUGCAGCGUAGACG 72.4% 4993 CAAAGAAAUAGCACUCAGU 70.9%

4945 UGAGCGAGGACUGCAGCGU 72.3% 4994 GAAUGGGGAUCCAAAUAAC 70.9%

4946 GGUUUGUGUUCACGCUCAC 72.2% 4995 CAGGCAGCGGAGGCCAUGG 70.7% 4947 GGGUOUGUGUUCACGCUCA 72.2% 4996 AGGCAGCGGAGGCCAUGGA 70.7%

4948 CACUCAGUUAUUCUGCUGG 72.2% 4997 GUGAGCAGGCAGCGGAGGC 70.6%

4949 AAAUGGCUGGAUCAAGUGA 72.1% 4998 AUUGCUAGUCAGGCCAGGC 70.6%

4950 GCGAGGACUGCAGCGUAGA 72.1% 4999 AGUGAGCAGGCAGCGGAGG 70.6%

4951 CAAAUGGCUGGAUCAAGUG 72.1% 5000 AAGUGAGCAGGCAGCGGAG 70.6% 4952 AGUGAGCGAGGACUGCAGC 72.1% 5001 GAGCAGGCAGCGGAGGCCA 70.5%

4953 GUGAGCGAGGACUGCAGCG 72.1% 5002 GCAGGCAGCGGAGGCCAUG 70.5%

4954 ACUCAGUUAUUCUGCUGGU 72.1% 5003 AGCAGGCAGCGGAGGCCAU 70.5%

4955 AGCGAGGACUGCAGCGUAG 72.1% 5004 UGAGCAGGCAGCGGAGGCC 70.5%

4956 GUUUGUGUUCACGCUCACC 72.0% 5005 AGGACUGCAGCGUAGACGC 70.4%

4957 CAGUGAGCGAGGACUGCAG 71.9% 5006 ACAUGUGAACAGAUUGCUG 70.4%

4958 GGAUCCAAAUAACAUGGAC 71.8% 5007 CAUGUGAACAGAUUGCUGA 70.4%

4959 GGGAUCCAAAUAACAUGGA 71.8% 5008 AUGUGAACAGAUUGCUGAC 70.2%

4960 GCACUCAGUUAUUCUGCUG 71.8% 5009 GAUAUUGAAAGAUGAGCCU 70.2%

4961 UGGCCUGGUAUGUGCAACA 71.8% 5010 UAUUGAAAGAUGAGCCUUC 70.2%

4962 GGGGAUCCAAAUAACAUGG 71.8% 5011 AGAUAUUGAAAGAUGAGCC 70.2%

4963 GCCUGGUAUGUGCAACAUG 71.8% 5012 AUUGAAAGAUGAGCCUUCU 70.2%

4964 GGCCUGGUAUGUGCAACAU 71.8% 5013 AUAUUGAAAGAUGAGCCUU 70.2%

4965 CCUGGUAUGUGCAACAUGU 71.8% 5014 AUGGUUUUGGCCAGCACUA 70.2%

4966 GAGCAAAUGGCUGGAUCAA 71.6% 5015 UGGUUUUGGCCAGCACUAC 70.2%

4967 GGAGCAAAUGGCUGGAUCA 71.6% 5016 UAGAUAUUGAAAGAUGAGC 70.1%

4968 AGCAAAUGGCUGGAUCAAG 71.6% 5017 UGUGCAACAUGUGAACAGA 70.0%

4969 GCAAAUGGCUGGAUCAAGU 71.6% 5018 CGUCUUUUUUUCAAAUGCA 70.0%

4970 UGGGGCCAAAGAAAUAGCA 71.6% 5019 UGAAAAGAGGGCCUUCUAC 70.0%

4971 GGGGCCAAAGAAAUAGCAC 71.5% 5020 GUGCAACAUGUGAACAGAU 70.0%

4972 GGGCCAAAGAAAUAGCACU 71.5%

Table 2-8: Conserved 19-nier sequences that are present in at least 70% of the Influenza A segment 2 (NSl & NS2) sequences listed in Table 1-8.

Seq 5021 AUAACACAGUUCGAGUCUC 98.2% ID Sequence Percent

5022 UAACACAGUUCGAGUCUCU 98.2% 5076 AAGAUGGCUGAUUGAAGAA 91.9%

5023 UGAAUGGAAUGAUAACACA 98.0% 5077 CUAAGGGCUUUCACCGAAG 91.9%

5024 GAAUGGAAUGAUAACACAG 98.0% 5078 AAGGGCUUUCACCGAAGAG 91.8%

5025 UGGAAUGAUAACACAGUUC 98.0% 5079 UAAGGGCUUUCACCGAAGA 91.8%

5026 AUGGAAUGAUAACACAGUU 97.9% 5080 GAUGGCUGAUUGAAGAAGU 91.7%

5027 AAUGGAAUGAUAACACAGU 97.9% 5081 AGAUGGCUGAUUGAAGAAG 91.7%

5028 UUGAAUGGAAUGAUAACAC 97.8% 5082 ACUAAGGGCUUUCACCGAA 91.4%

5029 UGAUAACACAGUUCGAGUC 97.7% 5083 UACUAAGGGCUUUCACCGA 91.3%

5030 AUGAUAACACAGUUCGAGU 97.6% 5084 AAAGCGAAUUUCAGUGUGA 91.2%

5031 GGAAUGAUAACACAGUUCG 97.6% 5085 AAGCGAAUUUCAGUGUGAU 91.0%

5032 GAAUGAUAACACAGUUCGA 97.6% 5086 GGCUUUCACCGAAGAGGGA 90.9%

5033 AAUGAUAACACAGUUCGAG 97.6% 5087 GCUUUCACCGAAGAGGGAG 90.8%

5034 CUUGAAUGGAAUGAUAACA 97.5% 5088 AUUACUAAGGGCUUUCACC 90.8%

5035 GAUAACACAGUUCGAGUCU 97.1% 5089 UUACUAAGGGCUUUCACCG 90.8%

5036 AUGUCAAAAAUGCAAUUGG 96.9% 5090 AAGAGGGAGCAAUUGUUGG 90.4%

5037 GGAUGUCAAAAAUGCAAUU 96.9% 5091 GAAGAGGGAGCAAUUGUUG 90.4% 5038 AGGAUGUCAAAAAUGCAAU 96.9% 5092 AGAGGGAGCAAUUGUUGGC 90.3%

5039 GAGGAUGUCAAAAAUGCAA 96.8% 5093 UCACCGAAGAGGGAGCAAU 90.3%

5040 GAUGUCAAAAAUGCAAUUG 96.8% 5094 UUUCACCGAAGAGGGAGCA 90.2%

5041 UGAGGAUGUCAAAAAUGCA 96.4% 5095 UUCACCGAAGAGGGAGCAA 90.2%

5042 CGGCUUCGCCGAGAUCAGA 96.2% 5096 CUUUCACCGAAGAGGGAGC 90.1%

5043 UGUCAAAAAUGCAAUUGGG 95.6% 5097 CCGAAGAGGGAGCAAUUGU 90.0%

5044 UCUACAGAGAUUCGCUUGG 95.5% 5098 ACCGAAGAGGGAGCAAUUG 90.0%

5045 GCAAUUGGGGUCCUCAUCG 94.8% 5099 CGAAGAGGGAGCAAUUGUU 90.0% 5046 CAAUUGGGGUCCUCAUCGG 94.8% 5100 CACCGAAGAGGGAGCAAUU 90.0%

5047 UGCAAUUGGGGUCCUCAUC 94.8% 5101 UUGGGGUCCUCAUCGGAGG 89.7%

5048 UCAAAAAUGCAAUUGGGGU 94.4% 5102 AUUGGGGUCCUCAUCGGAG 88.8%

5049 GUCAAAAAUGCAAUUGGGG 94.4% 5103 AAUUGGGGUCCUCAUCGGA 87.8% 5050 AAAAAUGCAAUUGGGGUCC 94.3% 5104 AUGAGGCACUUAAAAUGAC 87.7% 5051 AAAAUGCAAUUGGGGUCCU 94.3% 5105 AACACAGUUCGAGUCUCUA 87.5%

5052 AUGCAAUUGGGGUCCUCAU 94.3% 5106 ACACAGUUCGAGUCUCUAA 87.5%

5053 CAAAAAUGCAAUUGGGGUC 94.3% 5107 GAUGAGGCACUUAAAAUGA 87.3% 5054 AAAUGCAAUUGGGGUCCUC 94.3% 5108 ACAGAAACGGAAAAUGGCG 87.1% 5055 AAUGCAAUUGGGGUCCUCA 94.2% 5109 ACAGUUCGAGUCUCUAAAA 86.9% 5056 UGAAAGCGAAUUUCAGUGU 93.8% 5110 CACAGUUCGAGUCUCUAAA 86.9%

5057 UUGAUCGGCUUCGCCGAGA 93.7% 5111 AAGCAGUAAUGAGAAUGGG 86.1%

5058 CUUGAUCGGCUUCGCCGAG 93.6% 5112 UGAGGCACUUAAAAUGACC 86.0%

5059 GAAAGCGAAUUUCAGUGUG 93.6% 5113 GAAGCAGUAAUGAGAAUGG 85.8%

5060 AUCGGCUUCGCCGAGAUCA 93.4% 5114 GGUUCAUGCUAAUGCCCAA 85.8% 5061 GAUCGGCUUCGCCGAGAUC 93.4% 5115 GAGGCACUUAAAAUGACCA 85.8% 5062 UGAUCGGCUUCGCCGAGAU 93.4% 5116 AGGCACUUAAAAUGACCAU 85.8% 5063 UCGGCUUCGCCGAGAUCAG 93.2% 5117 ACUGGUUCAUGCUAAUGCC 85.7% 5064 GAGCAAUUGUUGGCGAAAU 92.8% 5118 UGGUUCAUGCUAAUGCCCA 85.7% 5065 GGAGCAAUUGUUGGCGAAA 92.8% 5119 CUGGUUCAUGCUAAUGCCC 85.5% 5066 UUCCUUGAUCGGCUUCGCC 92.7% 5120 GGCACUUAAAAUGACCAUG 85.3%

5067 UCCUUGAUCGGCUUCGCCG 92.7% 5121 UUGAAAGCGAAUUUCAGUG 85.2%

5068 GGGAGCAAUUGUUGGCGAA 92.7% 5122 GCACUUAAAAUGACCAUGG 85.2%

5069 AGGGAGCAAUUGUUGGCGA 92.5% 5123 UCAUGCUAAUGCCCAAGCA 85.1%

5070 CCUUGAUCGGCUUCGCCGA 92.4% 5124 UACAGAGAUUCGCUUGGAG 84.7%

5071 GAGGGAGCAAUUGUUGGCG 92.3% 5125 UACUAUUGAGGAUGUCAAA 84.7%

5072 AGGGCUUUCACCGAAGAGG 92.3% 5126 ACAGAGAUUCGCUUGGAGA 84.7% 5073 GGGCUUUCACCGAAGAGGG 92.2% 5127 CAUACUAUUGAGGAUGUCA 84.7%

5074 AUAAGAUGGCUGAUUGAAG 92.1% 5128 GUUCAUGCUAAUGCCCAAG 84.7%

5075 UAAGAUGGCUGAUUGAAGA 92.1% 5129 AUACUAUUGAGGAUGUCAA 84.7%

5130 UUCAUGCUAAUGCCCAAGC 84.7% 5184 CAUCAUGUUGAAAGCGAAU 81.3%

5131 UGGCUGAUUGAAGAAGUGA 84.7% 5185 AUGCCCCAUUCCUUGAUCG 81.3%

5132 GGCUGAUUGAAGAAGUGAG 84.7% 5186 GAUGCCCCAUUCCUUGAUC 81.3%

5133 ACUAUUGAGGAUGUCAAAA 84.6% 5187 GGCGAAAUCUCACCAUUGC 81.2%

5134 ACAUACUAUUGAGGAUGUC 84.5% 5188 UUGUUGGCGAAAUCUCACC 81.1%

5135 CAGAGAUUCGCUUGGAGAA 84.5% 5189 UUGGCGAAAUCUCACCAUU 81.0%

5136 UUCCAGGACAUACUAUUGA 84.4% 5190 AGAAACGGAAAAUGGCGAG 81.0%

5137 UUGAGGAUGUCAAAAAUGC 84.4% 5191 UGUUGGCGAAAUCUCACCA 81.0% 5138 CAGGACAUACUAUUGAGGA 84.4% 5192 UGAUUGAAGAAGUGAGACA 81.0% 5139 GGACAUACUAUUGAGGAUG 84.4% 5193 GAAACGGAAAAUGGCGAGA 80.9%

5140 AGGACAUACUAUUGAGGAU 84.4% 5194 UGGCGAAAUCUCACCAUUG 80.9%

5141 CCAGGACAUACUAUUGAGG 84.4% 5195 CCAAAACAGAAACGGAAAA 80.9%

5142 GACAUACUAUUGAGGAUGU 84.4% 5196 AUUGUUGGCGAAAUCUCAC 80.9%

5143 UCCAGGACAUACUAUUGAG 84.4% 5197 AAUUGUUGGCGAAAUCUCA 80.9%

5144 AUGGCUGAUUGAAGAAGUG 84.3% 5198 UGACUAUUGAGGAAUUGUC 80.9%

5145 CUAUUGAGGAUGUCAAAAA 84.2% 5199 GUUGGCGAAAUCUCACCAU 80.9%

5146 AUUGAGGAUGUCAAAAAUG 84.2% 5200 CUAUUGAGGAAUUGUCAAG 80.8%

5147 UAUUGAGGAUGUCAAAAAU 84.2% 5201 ACUAUUGAGGAAUUGUCAA 80.8%

5148 UAUUACUAAGGGCUUUCAC 84.1% 5202 CAGAAACGGAAAAUGGCGA 80.8% 5149, CCAUUCCUUGAUCGGCUUC 84.0% 5203 GACUAUUGAGGAAUUGUCA 80.8% 515o ' AUUCCUUGAUCGGCUUCGC 83.8% 5204 ACAUCAUGUUGAAAGCGAA 80.7%

5151 GAGAUUCGCUUGGAGAAGC 83.8% 5205 CCCCAUUCCUUGAUCGGCU 80.7%

5152 AGAGAUUCGCUUGGAGAAG 83.8% 5206 ACGGAAAAUGGCGAGAACA 80.5%

5153 CAUUCCUUGAUCGGCUUCG 83.8% 5207 UGCCCCAUUCCUUGAUCGG 80.5%

5154 CUACAGAGAUUCGCUUGGA 83.7% 5208 AACGGAAAAUGGCGAGAAC 80.5%

5155 AGAUUCGCUUGGAGAAGCA 83.4% 5209 GCCCCAUUCCUUGAUCGGC 80.4%

5156 AUUCGCUUGGAGAAGCAGU 83.0% 5210 AUGACUAUUGAGGAAUUGU 80.4%

5157 GUGAUGCCCCAUUCCUUGA 83.0% 5211 AAAUCUCACCAUUGCCUUC 80.4%

5158 GAUUCGCUUGGAGAAGCAG 83.0% 5212 AAACGGAAAAUGGCGAGAA 80.4% 5159 UUGGAGAAGCAGUAAUGAG 82.7% 5213 GAAAUCUCACCAUUGCCUU 80.4%

5160 CUUGGAGAAGCAGUAAUGA 82.7% 5214 CGAAAUCUCACCAUUGCCU 80.3%

5161 GAAGAGAUAAGAUGGCUGA 82.1% 5215 AACAUCAUGUUGAAAGCGA 80.3%

5162 AAGAGAUAAGAUGGCUGAU 82.1% 5216 GCAAUUGUUGGCGAAAUCU 80.3%

5163 AGAUAAGAUGGCUGAUUGA 82.0% 5217 CAAUUGUUGGCGAAAUCUC 80.2%

5164 GAUAAGAUGGCUGAUUGAA 81.9% 5218 GCGAAAUCUCACCAUUGCC 80.2%

5165 GUUGAAAGCGAAUUUCAGU 81.9% 5219 CCCAUUCCUUGAUCGGCUU 80.2%

5166 GCUUGGAGAAGCAGUAAUG 81.9% 5220 AUUGAAGAAGUGAGACACA 80.1%

5167 UGGAGAAGCAGUAAUGAGA 81.9% 5221 AAUCUCACCAUUGCCUUCU 80.0%

5168 GAGAUAAGAUGGCUGAUUG 81.9% 5222 AACAGAAACGGAAAAUGGC 80.0%

5169 UUCGCUUGGAGAAGCAGUA 81.9% 5223 UGAUGCCCCAUUCCUUGAU 80.0%

5170 AGAGAUAAGAUGGCUGAUU 81.8% 5224 GCUGAUUGAAGAAGUGAGA 79.6%

5171 UCGCUUGGAGAAGCAGUAA 81.7% 5225 CUGAUUGAAGAAGUGAGAC 79.6%

5172 CGCUUGGAGAAGCAGUAAU 81.7% 5226 UUCUUUUCCAGGACAUACU 79.3%

5173 CAUGUUGAAAGCGAAUUUC 81.6% 5227 ACUCCAAAACAGAAACGGA 79.3%

5174 UCAUGUUGAAAGCGAAUUU 81.6% 5228 UCCCUAAGGGGAAGAGGCA 79.3%

5175 UGUUGAAAGCGAAUUUCAG 81.6% 5229 AGCAAUUGUUGGCGAAAUC 79.1%

5176 UAACUGACAUGACUAUUGA 81.6% 5230 UCUUUUCCAGGACAUACUA 79.1%

5177 AUGUUGAAAGCGAAUUUCA 81.6% 5231 GAUUGAAGAAGUGAGACAC 78.8%

5178 CAAAACAGAAACGGAAAAU 81.5% 5232 AUUCCAACACUGUGUCAAG 78.6%

5179 AAAACAGAAACGGAAAAUG 81.5% 5233 CUUCUUUUCCAGGACAUAC 78.5%

5180 AAACAGAAACGGAAAAUGG 81.5% 5234 ACCAUUGCCUUCUUUUCCA 78.5%

5181 UCCAAAACAGAAACGGAAA 81.5% 5235 CCAUUGCCUUCUUUUCCAG 78.5%

5182 CUCCAAAACAGAAACGGAA 81.4% 5236 CAUUGCCUUCUUUUCCAGG 78.4%

5183 AUCAUGUUGAAAGCGAAUU 81.4% 5237 CCUUCUUUUCCAGGACAUA 78.4%

5238 GCCUUCUUUUCCAGGACAU 78.4% 5292 CCAACACUGUGUCAAGUUU 75.3%

5239 UGCCUUCUUUUCCAGGACA 78.4% 5293 AGAAGCAGUAAUGAGAAUG 75.3%

5240 CACCAUUGCCUUCUUUUCC 78.3% 5294 AGUGGAACAGGAGAUAAGA 75.3%

5241 UUGCCUUCUUUUCCAGGAC 78.3% 5295 CUUACUCCAAAACAGAAAC 75.2%

5242 UGGAUUCCAACACUGUGUC 78.3% 5296 CUCACCAUUGCCUUCUUUU 75.1%

5243 AUGGAUUCCAACACUGUGU 78.3% 5297 UCUCACCAUUGCCUUCUUU 75.1%

5244 UCACCAUUGCCUUCUUUUC 78.3% 5298 AUCUCACCAUUGCCUUCUU 75.1%

5245 GGAUUCCAACACUGUGUCA 78.3% 5299 UCCAACACUGUGUCAAGUU 75.0%

5246 AUUGCCUUCUUUUCCAGGA 78.2% 5300 UACUCCAAAACAGAAACGG 74.8%

5247 GAUUCCAACACUGUGUCAA 78.2% 5301 UUACUCCAAAACAGAAACG 74.8%

5248 UUGAAGAAGUGAGACACAG 77.8% 5302 UUCCAACACUGUGUCAAGU 74.7%

5249 UUGAAGUGGAACAGGAGAU 77.7% 5303 AUCAGAAUGGACCAGGCAA 74.5%

5250 UUUGAAGUGGAACAGGAGA 77.7% 5304 UAAUGAGAAUGGGGGACCU 74.0%

5251 UGAAGAAGUGAGACACAGA 77.2% 5305 UAAUGGAUUCCAACACUGU 74.0%

5252 CUGACAUGACUAUUGAGGA 77.2% 5306 CAAGUUUCCAGGUAGAUUG 74.0%

5253 UGACAUGACUAUUGAGGAA 77.2% 5307 CAUAAUGGAUUCCAACACU 73.9%

5254 CAAAUAACAUUCAUGCAAG 77.0% 5308 CAGUAAUGAGAAUGGGGGA 73.9%

5255 AAAUAACAUUCAUGCAAGC 77.0% 5309 GCAGUAAUGAGAAUGGGGG 73.9%

5256 ACUGACAUGACUAUUGAGG 77.0% 5310 UCAAGUUUCCAGGUAGAUU 73.9%

5257 CUUUUCCAGGACAUACUAU 76.9% 5311 AAUGGAUUCCAACACUGUG 73.9% 5258 UUUUCCAGGACAUACUAUU 76.9% 5312 ACAUAAUGGAUUCCAACAC 73.9%

5259 UUUCCAGGACAUACUAUUG 76.8% 5313 AGUAAUGAGAAUGGGGGAC 73.8%

5260 AAACAAGUUGUAGACCAAG 76.8% 5314 AAUGAGAAUGGGGGACCUC 73.8%

5261 ACAAGUUGUAGACCAAGAA 76.7% 5315 GUAAUGAGAAUGGGGGACC 73.7%

5262 AAGUUGUAGACCAAGAACU 76.7% 5316 AGCAGUAAUGAGAAUGGGG 73.7%

5263 AACAAGUUGUAGACCAAGA 76.7% 5317 UGUCAAGUUUCCAGGUAGA 73.7%

5264 GAAUCUGAUGAGGCACUUA 76.7% 5318 CCACACCUGCUUCGCGAUA 73.6%

5265 AGAAUCUGAUGAGGCACUU 76.6% 5319 UCCACACCUGCUUCGCGAU 73.6%

5266 AACUGACAUGACUAUUGAG 76.5% 5320 ACAUGACUAUUGAGGAAUU 73.6%

5267 AAUCUGAUGAGGCACUUAA 76.5% 5321 CUCCACACCUGCUUCGCGA 73.6%

5268 UCUGAUGAGGCACUUAAAA 76.5% 5322 CACACCUGCUUCGCGAUAC 73.6%

5269 UGAUGAGGCACUUAAAAUG 76.5% 5323 UGUGUCAAGUUUCCAGGUA 73.6%

5270 CUGAUGAGGCACUUAAAAU 76.5% 5324 GACAUGACUAUUGAGGAAU 73.6%

5271 CAAGUUGUAGACCAAGAAC 76.5% 5325 GUGUCAAGUUUCCAGGUAG 73.6%

5272 CCACCCAUGUUGGAAAGCA 76.2% 5326 ACUGUGUCAAGUUUCCAGG 73.5%

5273 AUCUGAUGAGGCACUUAAA 76.2% 5327 CUGUGUCAAGUUUCCAGGU 73.5%

5274 AGCCACCCAUGUOGGAAAG 76.2% 5328 AUAAUGGAUUCCAACACUG 73.5%

5275 GCCACCCAUGUUGGAAAGC 76.2% 5329 GUCAAGUUUCCAGGUAGAU 73.5%

5276 CAGCCACCCAUGUUGGAAA 76.2% 5330 CACCUGCUUCGCGAUACAU 73.5%

5277 GAAGUGGAACAGGAGAUAA 76.0% 5331 ACACCUGCUUCGCGAUACA 73.4%

5278 AAGUGGAACAGGAGAUAAG 75.9% 5332 CAUCAGAAUGGACCAGGCA 73.4%

5279 UGAAGUGGAACAGGAGAUA 75.8% 5333 GGCUUCGCCGAGAUCAGAG 73.4%

5280 AGGAGAUAAGAACUUUCUC 75.8% 5334 GCAUCAGAAUGGACCAGGC 73.3%

5281 GAACAGGAGAUAAGAACUU 75.8% 5335 GCUUCGCGAUACAUAACUG 73.3%

5282 CAGGAGAUAAGAACUUUCU 75.8% 5336 CACUGUGUCAAGUUUCCAG 73.3%

5283 ACAGGAGAUAAGAACUUUC 75.7% 5337 ACACUGUGUCAAGUUUCCA 73.3%

5284 AACAGGAGAUAAGAACUUϋ 75.7% 5338 AACACUGUGUCAAGUUUCC 73.3%

5285 GUGGAACAGGAGAUAAGAA 75.6% 5339 CUUCGCGAUACAUAACUGA 73.3%

5286 UGGAACAGGAGAUAAGAAC 75.6% 5340 UGGACCAGGCAAUCAUGGA 73.2%

5287 GGAACAGGAGAUAAGAACU 75.6% 5341 CGGAAAAUGGCGAGAACAG 73.2%

5288 GGAGAAGCAGUAAUGAGAA 75.6% 5342 AAUGGCGAGAACAGCUAGG 73.2%

5289 GUCCCUAAGGGGAAGAGGC 75.4% 5343 CCUGCUUCGCGAUACAUAA 73.2%

5290 ACUUACUCCAAAACAGAAA 75.4% 5344 UGCAUCAGAAUGGACCAGG 73.2%

5291 GAGAAGCAGUAAUGAGAAU 75.3% 5345 UGCUUCGCGAUACAUAACU 73.2%

5346 CUGCUUCGCGAUACAUAAC 73.2% 5400 GGGACCUCCACUUACUCCA 71.3%

5347 UGGCGAGAACAGCUAGGUC 73.1% 5401 GGGGGACCUCCACUUACUC 71.3%

5348 ACCUGCUUCGCGAUACAUA 73.1% 5402 AAAACAUCAUGUUGAAAGC 71.3%

5349 AUGGCGAGAACAGCUAGGU 73.1% 5403 AGUCUCUAAAAAUCUACAG 71.2%

5350 AAAUGGCGAGAACAGCUAG 73.1% 5404 AAAUCUACAGAGAUUCGCU 71.2%

5351 UAUUGAGGAAUUGUCAAGA 73.0% 5405 UGGAGAAAAACAUCAUGUU 71.2%

5352 AUGGACCAGGCAAUCAUGG 73.0% 5406 AAAAUCUACAGAGAUUCGC 71.2%

5353 UCAGAAUGGACCAGGCAAU 72.9% 5407 GUCUCUAAAAAUCUACAGA 71.2%

5354 GCGAGAACAGCUAGGUCAA 72.8% 5408 AAAAACAUCAUGUUGAAAG 71.2%

5355 GAGAACAGCUAGGUCAAAA 72.8% 5409 GGACCAGGCAAUCAUGGAG 71.2%

5356 CGAGAACAGCUAGGUCAAA 72.8% 5410 CAUGGAGAAAAACAUCAUG 71.2%

5357 GGCGAGAACAGCUAGGUCA 72.8% 5411 UCUAAAAAUCUACAGAGAU 71.1%

5358 AAUGGACCAGGCAAUCAUG 72.7% 5412 ACCAGGCAAUCAUGGAGAA 71.1%

5359 AAGCAGCCACCCAUGUUGG 72.7% 5413 CUCUAAAAAUCUACAGAGA 71.1%

5360 GAAUGGACCAGGCAAUCAU 72.6% 5414 CGAGUCUCUAAAAAUCUAC 71.1%

5361 UCAAGAAACUGGUUCAUGC 72.6% 5415 CUGUUUGAAGUGGAACAGG 71.1% 5362 GCAGCCACCCAUGUUGGAA 72.6% 5416 AGAAAAACAUCAUGUUGAA 71.1%

5363 AGCAGCCACCCAUGUUGGA 72.6% 5417 CUAAAAAUCUACAGAGAUU 71.1%

5364 CAAGAAACUGGUUCAUGCU 72.6% 5418 GAGAAAAACAUCAUGUUGA 71.1% 5365 GACCUCCACUUACUCCAAA 72.5% 5419 UGAGGAAUUGUCAAGAAAC 71.1%

5366 AACUGGUUCAUGCUAAUGC 72.5% 5420 GAAAAACAUCAUGUUGAAA 71.1%

5367 AGAAUGGACCAGGCAAUCA 72.5% 5421 AUGGAGAAAAACAUCAUGU 71.1% 5368 CACUUACUCCAAAACAGAA 72.5% 5422 UGUUUGAAGUGGAACAGGA 71.1% 5369 UCCACUUACUCCAAAACAG 72.4% 5423 UUCGAGUCUCUAAAAAUCU 71.1%

5370 CAGAAUGGACCAGGCAAUC 72.4% 5424 UCGAGUCUCUAAAAAUCUA 71.1%

5371 CCACUUACUCCAAAACAGA 72.3% 5425 UUGAGGAAUUGUCAAGAAA 71.1% 5372 GAAACUGGUUCAUGCUAAU 72.3% 5426 GGAAUUGUCAAGAAACUGG 71.1%

5373 AGAAACUGGUUCAUGCUAA 72.3% 5427 GUUCGAGUCUCUAAAAAUC 71.1%

5374 AAACUGGUUCAUGCUAAUG 72.3% 5428 GACCAGGCAAUCAUGGAGA 71.1% 5375 CUCCACUUACUCCAAAACA 72.3% 5429 AGGAAUUGUCAAGAAACUG 71.1%

5376 CAUGACUAUUGAGGAAUUG 72.2% 5430 GAGUCUCUAAAAAUCUACA 71.1%

5377 AAGAAACUGGUUCAUGCUA 72.2% 5431 UCUCUAAAAAUCUACAGAG 71.1%

5378 CAGGUAGAUUGCUUUCUUU 72.1% 5432 GGAGAAAAACAUCAUGUUG 71.1% 5379 CCUCCACUUACUCCAAAAC 72.1% 5433 GAGGAAUUGUCAAGAAACU 71.1%

5380 ACCUCCACUUACUCCAAAA 72.1% 5434 AAUUGUCAAGAAACUGGUU 71.0%

5381 AGGUAGAUUGCUUUCUUUG 72.1% 5435 CAAAGACAUAAUGGAUUCC 71.0%

5382 GGUAGAUUGCUUUCUUUGG 72.1% 5436 GAAUUGUCAAGAAACUGGU 71.0%

5383 UAGAUUGCUUUCUUUGGCA 72.0% 5437 UGUCAAGAAACUGGUUCAU 71.0% 5384 GUAGAUUGCUϋUCUUUGGC 72.0% 5438 UGAGAAUGGGGGACCUCCA 71.0%

5385 CAACACUGUGUCAAGUUUC 71.9% 5439 AUGAGAAUGGGGGACCUCC 71.0%

5386 AGAUUGCUUσCUUUGGCAU 71.9% 5440 UGCUGUUUGAAGUGGAACA 71.0%

5387 AACAGCUAGGUCAAAAGUU 71.8% 5441 GUCAAGAAACUGGUUCAUG 71.0%

5388 AAUCUACAGAGAUUCGCUU 71.8% 5442 CUGCUGUUUGAAGUGGAAC 71.0% 5389 GAACAGCUAGGUCAAAAGU 71.8% 5443 UUCCAGGUAGAUUGCUUUC 70.9% 5390 AUCUACAGAGAUUCGCUUG 71.7% 5444 GGACCUCCACUUACUCCAA 70.9% 5391 AGUUUCCAGGUAGAUUGCU 71.6% 5445 CCAGGUAGAUUGCUUUCUU 70.9% 5392 GUUUCCAGGUAGAUUGCUU 71.6% 5446 AUUGAGGAAUUGUCAAGAA 70.9%

5393 AAGUUUCCAGGUAGAUUGC 71.5% 5447 AUUGUCAAGAAACUGGUUC 70.9%

5394 AGAACAGCUAGGUCAAAAG 71.5% 5448 UUGUCAAGAAACUGGUUCA 70.9%

5395 UGGGGGACCUCCACUUACU 71.5% 5449 UAAAAAUCUACAGAGAUUC 70.9%

5396 AUGGGGGACCUCCACUUAC 71.4% 5450 UCCAGGUAGAUUGCUUUCU 70.9%

5397 AGUUCGAGUCUCUAAAAAU 71.3% 5451 AGUUUUGAGCAAAUAACAU 70.8%

5398 GGGGACCUCCACUUACUCC 71.3% 5452 GCUGUUUGAAGUGGAACAG 70.8%

5399 CAGUUCGAGUCUCUAAAAA 71.3% 5453 CAGGCAAUCAUGGAGAAAA 70.7%

5454 AAUAGUUUUGAGCAAAUAA 70.7% 5469 GAAUGGGGGACCUCCACUU 70.2%

5455 UUUCCAGGUAGAUUGCUUU 70.7% 5470 CAAUCAUGGAGAAAAACAU 70.1%

5456 AGAAUGGGGGACCUCCACU 70.7% 5471 AAUCAUGGAGAAAAACAUC 70.1%

5457 AAACAUCAUGUUGAAAGCG 70.7% 5472 GGCAAUCAUGGAGAAAAAC 70.1%

5458 GAGAAUGGGGGACCUCCAC 70.7% 5473 GCAΆUCAUGGAGAAAAACA 70.1%

5459 GUUUUGAGCAAAUAACAUU 70.7% 5474 UUUGCAUCAGAAUGGACCA 70.1%

5460 AAAAAUCUACAGAGAUUCG 70.6% 5475 UUGCAUCAGAAUGGACCAG 70.1%

5461 UAGUAUUACUAAGGGCUUU 70.6% 5476 CUCUCGGUCUAGACAUCAA 70.1%

5462 AUAGUUUUGAGCAAAUAAC 70.6% 5477 UCUCGGUCUAGACAUCAAA 70.1%

5463 GUAUUACUAAGGGCUUUCA 70.5% 5478 AAUGGGGGACCUCCACUUA 70.1%

5464 UAGUUUUGAGCAAAUAACA 70.5% 5479 AUCAUGGAGAAAAACAUCA 70.1%

5465 AGUAUUACUAAGGGCUUUC 70.5% 5480 CUUUGCAUCAGAAUGGACC 70.0%

5466 CCAGGCAAUCAUGGAGAAA 70.4% 5481 CCGAGAUCAGAGGUCCCUA 70.0%

5467 AGGCAAUCAUGGAGAAAAA 70.2% 5482 GUUUGAAGUGGAACAGGAG 70.0%

5468 ACUCUCGGUCUAGACAUCA 70.2% 5483 CGAGAUCAGAGGUCCCUAA 70.0%

Our research indicates that the RNAi mechanism can tolerate a small number of mismatches between the target RNA and the antisense guide sequence of the siRNA duplex. Thus, a single siRNA duplex targeting a highly conserved site in influenza will often still be active against minor variant species having only one or a few mismatches relative to the highly conserved site. We have made use of this observation to expand the list of potential influenza A viral sequence variants that are targetable by a given siRNA duplex.

In Table 3, the top nine siRNA sites identified from laboratory screening studies have been extracted from Table 20.

Table 3: Top nine siRNA sites as identified from laboratory screening studies, showing conserved and minor variant 19-mer sequences from the Influenza A as defined in Tables 20-1 through 20-6.

Seq Ref %

ID ID Segment Match Seq Total

7 7 PB2 AGACAGCGACCAAAAGAAU 99.1%

5503 7 AGACAGCGACCAAAAGgAU 0.3%

5504 7 AGACAGCGACCAAAgGAAU 0.1%

5505 7 AGACAGCGACCAAAAGAUU 0.1%

5506 7 AGACgaCGAuCAAAAGAAU 0.1%

17 17 PB2 ACUGACAGCCAGACAGCGA 99.0%

5543 17 ACUGACAGuCAGACAGCGA 0.2%

5544 17 ACUGAuAGCCAGACAGCGA 0.3%

5545 17 ACcGACAGCCAGACAGCGA 0.2%

5546 17 ACUGACAGCCAGACgaCGA 0.1%

48 48 PB2 CGGGACUCUAGCAUACUUA 98.0%

5726 48 CGGGACUCUAGCAUgCUUA 0.1%

5727 48 CGGGACUuUAGCAUACUUA 0.2%

5728 48 aGGGACUCUAGCAUACUUA 0.1%

5729 48 CGaGACUCUAGCAUACUUA 0.4%

5730 48 CGGaACUCUAGCAUACUUA 0.6%

5731 48 CGGGACUaUAGCAUACUUA 0.1%

5732 48 CGGGACUCCAGCAUACUUA 0.3%

5733 48 CGGGACUCUAaCAUACUUA 0.1%

1187 1187 PBl GAUCUGUUCCACCAUUGAA 90.9%

6265 1187 GAUCUGUUuCACCAUUGAA 0.7%

6266 1187 aAUCUGUUCCACCAUUGAA 0.1%

6267 1187 GACCUGUUCCACCAUUGAA 6.9%

6268 1187 GAUCUGcUCCACCAUUGAA 0.2%

6269 1187 GAUCUGUUaCACCAtJUGAA 0.1%

6270 1187 GAUCUGUUCCACCAUUaAA 0.1%

6271 1187 GAcCUGUUCuACCAUUGAA 0.1%

6272 1187 GAcCUGcUCCACCAUUGAA 0.3%

1206 1206 PBl AUGAAGAUCUGUUCCACCA 88.6%

6467 1206 AUGAAGAUCUGUUUCACCA 0.7%

6468 1206 AUGAgGAUCUGUUCCACCA 0.2%

6469 1206 AcGAAGAUCUGUUCCACCA 0.1%

6470 1206 AUGAAaAUCUGUUCCACCA 0.1%

6471 1206 AUGAAGAcCUGUUCCACCA 6.8%

6472 1206 AUGAAGAUCUGCUCCACCA 0.2%

6473 1206 AUGAAGAUCUGUUaCACCA 0.1%

6474 1206 cUGAAGAUCUGUUCCACCA 0.1%

6475 1206 uUGAAGAUCUGUUCCACCA 2.0%

6476 1206 AUGAAGAcCUGUUCuACCA 0.1%

6477 1206 AUaAAGAcCUGUUCCACCA 0.1%

6478 1206 AUGAAGAcCUGcUCCACCA 0.2%

2393 2393 PA UUGAGGAGUGCCUGAUUAA 98.7%

6888 2393 UUGAGGAGUGCCUGgUUAA 0.1%

6889 2393 UUGAGGAaUGCCUGAUUAA 1.0%

6890 2393 UUGAGGAGUGCCUaAUUAA 0.2%

2394 2394 PA GCAAUUGAGGAGUGCCUGA 98.6%

6891 2394 GCAAUUGAGGAGUGCCUGg 0.1%

6892 2394 GCAgUUGAGGAGUGCCUGA 0.1%

6893 2394 GCAAUUGAGGAaUGCCUGA 1.0%

6894 2394 GCAAUUGAGGAGUGCCUaA 0.2%

3560 3560 NP GAUCUUAUUUCUUCGGAGA 96.0%

8041 3560 GAUCUUAUUUCUUCGGgGA 1.7%

8042 3560 GAUCUUAUUUCUUUGGAGA 0.2%

8043 3560 GgUCUUAUUUCUUCGGAGA 0.9%

8044 3560 GuUCUUAUUUCUUCGGAGA 1.1%

3561 3561 NP GGAUCUUAUUUCUUCGGAG 96.0%

8045 3561 GGAUCUUAUUUCUUCGGgG 1.7%

8046 3561 GGAUCUUAUUUCUUUGGAG 0.2%

8047 3561 GGgUCUUAUUUCUUCGGAG 0.9%

8048 3561 GGuUCUUAUUUCUUCGGAG 1.1%

Madin-Darby canine kidney cells (MDCK) were used. For electroporation, the cells were kept in serum-free RPMI 1640 medium. Virus infections were done in

infection medium. Influenza viruses A/PR/8/34 (PR8) and A/WSN/33 (WSN), subtypes HlNl were used. Sense and antisense sequences that were tested are listed in Table 4.

Table 4. siRNA Sequences

All siRNAs were synthesized by Dharmacon Research (Lafayette, CO) using

2'ACE protection chemistry and transfected into the cells by electroporation. Six to eight h following electroporation, the serum- containing medium was washed away and PR8 or WSN virus at the appropriate multiplicity of infection was inoculated into the wells. Cells were infected with either 1,000 PFU (one virus per 1,000 cells; MOI = 0.001) or 10,000 PFU (one virus per 100 cells; MOI = 0.01) of virus. After 1 h incubation at room temperature, 2 ml of infection medium with 4 μg/ml of trypsin was added to each well and the cells were incubated and, at indicated times, supernatants were harvested from infected cultures and the titer of virus was determined by hemagglutination of chicken erythrocytes. Supernatants were harvested at 24, 36, 48, and 60 hours after infection. Viral titer was measured using a standard hemagglutinin assay as described in Knipe DM, Howley, PM, Fundamental Virology, 4th edition, p. 34-35. The hemagglutination assay was done in V-bottomed 96-well plates. Serial 2-fold dilutions of each sample were incubated for Ih on ice with an equal volume of a 0.5% suspension of chicken erythrocytes (Charles River Laboratories). Wells containing an adherent, homogeneous layer of erythrocytes were scored as positive. For plaque assays, serial 10-fold dilutions of each sample were titered for virus as described in Fundamental Virology, 4 th edition, p.32, as well known in the art.

To investigate the feasibility of using siRNA to suppress influenza virus replication, various influenza virus A RNAs were targeted. Specifically, the MDCK cell line, which is readily infected and widely used to study influenza virus, was utilized. Each siRNA was individually introduced into populations of MDCK cells by electroporation. siRNA targeted to GFP (sense: 5'- GGCUACGUCCAGGAGCGCAUU - 3' (SEQ ID NO: 10752); antisense: 5'- UGCGCUCCUGGACGUAGCCUU -3' (SEQ ID

NO: 10753)) was used as control. This siRNA is referred to as GFP-949. In subsequent experiments (described in examples below) the UU overhang at the 3' end of both strands was replaced by dTdT with no effect on results. A mock electroporation was also performed as a control. Eight hours after electroporation cells were infected with either influenza A virus PR8 or WSN at an MOI of either 0.1 or 0.01 and were analyzed for virus production at various time points (24, 36, 48, 60 hours) thereafter using a standard hemagglutination assay. GFP expression was assayed by flow cytometry using standard methods.

Figures 1 IA and 1 IB compare results of experiments in which the ability of individual siRNAs to inhibit replication of influenza virus A strain A/Puerto Rico/8/34 (HlNl) (Figure 1 IA) or influenza virus A strain A/WSN/33 (HlNl) (Figure 1 IB) was determined by measuring HA titer. Thus a high HA titer indicates a lack of inhibition while a low HA titer indicates effective inhibition. MDCKrcells were infected at an MOI of 0.01. For these experiments one siRNA that targets the PBl segment (PBl- 2257/2277), one siRKA that targets the PB2 segment (PB2-2240/2260), one siRNA that targets the PA segment (PA-2087/2107 (G)), and three different siRNAs that target the NP genome and transcript (NP-231/251, NP-390/410, and NP-1496/1516) were tested. Note that the legends on Figures 1 IA and 1 IB list only the 5' nucleotide of the siRNAs. Symbols in Figures HA and HB are as follows: Filled squares represents control cells that did not receive siRNA. Open squares represents cells that received the GFP control siRNA. Filled circles represent cells that received siRNA PB 1-2257/2277. Open circles represent cells that received siRNA PB2-2240/2260. Open triangles represent cells that received siRNA PA-2087/2107 (G). The X symbol represents cells that received siRNA NP-231/251. The + symbol represents cells that received siRNA NP- 390/410. Closed triangles represent cells that received siRNA NP-1496/1516. Note that in the graphs certain symbols are sometimes superimposed. For example, in Figure HB the open and closed triangles are superimposed.

In the absence of siRNA (mock TF) or the presence of control (GFP) siRNA, the titer of virus increased over time, reaching a peak at approximately 48-60 hours after infection. In contrast, at 60 hours the viral titer was significantly lower in the presence of any of the siRNAs. For example, in strain WSN the HA titer (which reflects the level of virus) was approximately half as great in the presence of siRNAs PB2-2240 or NP-231 than in the controls. In particular, the level of virus was below the detection limit (10,000 PFU/ml) in the presence of siRNA NP- 1496 in both strains. This represents a decrease by

a factor of more than 60-fold in the PR8 strain and more than 120-fold in the WSN strain. The level of virus was also below the detection limit (10,000 PFU/ml) in the presence of siRNA PA-2087(G) in strain WSN and was extremely low in strain PR8. Suppression of virus production by siRNA was evident even from the earliest time point measured. Effective suppression, including suppression of virus production to undetectable levels (as determined by HA titer) has been observed at time points as great as 72 hours postinfection.

Table 5 summarizes results of siRNA inhibition assays at 60 hours in MDCK cells expressed in terms of fold inhibition. Thus a low value indicates lack of inhibition while a high value indicates effective inhibition. The location of siRNAs within a viral gene is indicated by the number that follows the name of the gene. As elsewhere herein, the number represents the starting nucleotide of the siRNA in the gene. For example, NP- 1496 indicates an siRNA specific for NP, the first nucleotide starting at nucleotide 1496 of the NP sequence. Values shown (fold-inhibition) are calculated by dividing hemagglutinin units from mock transfection by hemagglutinin units from transfection with the indicated siRNA; a value of 1 means no inhibition.

A total of twenty siRNAs, targeted to 6 segments of the influenza virus genome (PB2, PBl, PA, NP, M and NS), were tested in the MDCK cell line system (Table 5). About 15% of the siRNA (PB 1-2257, PA-2087G and NP- 1496) tested displayed a strong effect, inhibiting viral production by more than 100 fold in most cases at MOI=0.001 and by 16 to 64 fold at MOI=0.01, regardless of whether PR8 or WSN virus was used. In particular, when siRNA NP- 1496 or PA-2087 was used, inhibition was so pronounced that culture supernatants lacked detectable hemagglutinin activity. These potent siRNAs target 3 different viral gene segments: PBl and PA, which are involved in the RNA transcriptase complex, and NP which is a single-stranded RNA binding nucleoprotein. Consistent with findings in other systems, the sequences targeted by these siRNAs are all positioned relatively close to the 3-prime end of the coding region (Figure 12).

Approximately 40% of the siRNAs significantly inhibited virus production, but the extent of inhibition varied depending on certain parameters. Approximately 15% of siRNAs potently inhibited virus production regardless of whether PR8 or WSN virus was used. However, in the case of certain siRNAs, the extent of inhibition varied somewhat depending on whether PR8 or WSN was used. Some siRNAs significantly inhibited virus production only at early time points (24 to 36 hours after infection) or only at lower dosage of infection (MOI=0.001), such as PB2-2240, PB1-129, NP-231 and M-37. These

siRNAs target different viral gene segments, and the corresponding sequences are positioned either close to 3-prime end or 5-prime end of the coding region (Figure 12). Tables 5 A and 5B present results of the assays. Approximately 45% of the siRNAs had no discernible effect on the virus titer, indicating that they were not effective in interfering with influenza virus production in MDCK cells. In particular, none of the four siRNAs which target the NS gene segment showed any inhibitory effect.

To estimate virus titers more precisely, plaque assays with culture supernatants were performed (at 60 hrs) from culture supernatants obtained from virus-infected cells that had undergone mock transfection or transfection with NP- 1496. Approximately 6 x 10 pfu/ml was detected in mock supernatant, whereas no plaques were detected in undiluted NP-1496 supernatant (Figure HC). As the detection limit of the plaque assay is about 20 pfu (plaque forming unit)/ml, the inhibition of virus production by NP-1496 is at least about 30,000 fold. Even at an MOI of 0.1, NP-1496 inhibited virus production about 200-fold. To ' determine the potency of siRNA, a graded amount of NP-1496 was transfected into MDCK cells followed by infection with PR8 virus. Virus titers in the culture supernatants were measured by hemagglutinin assay. As the amount of siRNA decreased, virus titer increased in the culture supernatants as shown in Fig. HD. However, even when as little as 25 pmol of siRNA was used for transfection, approximately 4-fold inhibition of virus production was detected as compared to mock transfection, indicating the potency of NP-1496 siRNA in inhibiting influenza virus production.

For therapy, it is desirable for siRNA to be able to effectively inhibit an existing virus infection. In a typical influenza virus infection, new virions are released beginning at about 4 hours after infection. To determine whether siRNA could reduce or eliminate infection by newly released virus in the face of an existing infection, MDCK cells were infected with PR8 virus and then transfected with NP-1496 siRNA. Virus titer increased steadily over time following mock transfection, whereas virus titer increased only slightly in NP-1496 transfected cells. Thus administration of siRNA after virus infection is effective. Together, these results show that (i) certain siRNAs can potently inhibit influenza virus production; (ii) influenza virus production can be inhibited by siRNAs specific for different viral genes, including those encoding NP, PA, and PBl proteins; and (iii) siRNA inhibition occurs in cells that were infected previously in addition to cells infected simultaneously with or following administration of siRNAs.

Example 2: siRNAs that Target Viral RNA Polymerase or Nucleoprotein Inhibit Influenza A Virus Production in Chicken Embiγos.

Materials and Methods SiRNA-oligofectamine complex formation and chicken embryo inoculation. SiRNAs were prepared as described above. Chicken eggs were maintained under standard conditions. 30 μl of Oligofectamine (product number: 12252011 from Life Technologies, now Invitrogen) was mixed with 30 μl of Opti-MEM I (Gibco) and incubated at RT for 5 min. 2.5 nmol (10 μl) of siRNA was mixed with 30 μl of Opti-MEM I and added into diluted oligofectamine. The siRNA and oligofectamine was incubated at RT for 30 min. 10-day old chicken eggs were inoculated with siRNA-oligofectamine complex together with 100 μl of PR8 virus (5000 pfu/ml). The eggs were incubated at 37°C for indicated time and allantoic fluid was harvested. Viral titer in allantoic fluid was tested by HA assay as described above. Results

To confirm the results in MDCK cells, the ability of siRNA to inhibit influenza virus production in fertilized chicken eggs was also assayed. Because electroporation cannot be used on eggs, Oligofectamine, a lipid-based agent that has been shown to facilitate intracellular uptake of DNA oligonucleotides as well as siRNAs in vitro was used (25). Briefly, PR8 virus alone (500 pfu) or virus plus siRNA-oligofectamine complex was injected into the allantoic cavity of 10-day old chicken eggs as shown schematically in Figure 13 A. Allantoic fluids were collected 17 hours later for measuring virus titers by hemagglutinin assay. As shown in Figure 13B, when virus was injected alone (in the presence of Oligofectamine), high virus titers were readily detected. Co-injection of GFP- 949 did not significantly affect the virus titer. (No significant reduction in virus titer was observed when Oligofectamine was omitted.)

The injection of siRNAs specific for influenza virus showed results consistent with those observed in MDCK cells: The same siRNAs (NP-1496, PA2087 and PB1-2257) that inhibited influenza virus production in MDCK cells also inhibited virus production in chicken eggs, whereas the siRNAs (NP-231 , M-37 and PB 1 - 129) that were less effective in MDCK cells were ineffective in fertilized chicken eggs. Thus, siRNAs are also effective in interfering with influenza virus production in fertilized chicken eggs.

Example 3: SiRNA inhibits influenza Virus Production at the mRNA Level

Materials and Methods

SiRNA preparation was performed as described above. RNA extraction, reverse transcription and real time PCR. 1x10 MDCK cells were electroporated with 2.5 nmol of NP-1496 or mock electroporated (no siRNA). Eight hours later, influenza A PR8 virus was inoculated into the cells at MOI=O.1. At times 1, 2, and 3 -hour post-infection, the supernatant was removed, and the cells were lysed with Trizol reagent (Gibco). RNA was purified according to the manufacturer's instructions. Reverse transcription (RT) was carried out at 37°C for 1 hr, using 200 ng of total RNA, specific primers (see below), and Omniscript Reverse transcriptase kit (Qiagen) in a 20-μl reaction mixture according to the manufacturer's instructions. Primers specific for either mRNA, NP vRNA, NP cRNA, NS vRNA, or NS cRNA were as follows: mRNA, dT 18 = 5'-TTTTTTTTTTTTTTTTTT-S ' (SEQ ID NO: 10754) NP vRNA, NP-367: 5'-CTCGTCGCTTATGACAAAGAAG-S' (SEQ ID NO: 10755).

NP cRNA, NP- 1565R:

5'-ATATCGTCTCGTATTAGTAGAAACAAGGGTATTTTT-S' (SEQ ID NO: 10756).

NS vRNA, NS-527: 5'-CAGGACATACTGATGAGGATG-S' (SEQ ID NO: 10757).

NS cRNA, NS-890R: 5'-ATATCGTCTCGTATTAGTAGAAACAAGGGTGTTTT-S' (SEQ ID NO: 10758).

1 μl of RT reaction mixture (i.e., the sample obtained by performing reverse transcription) and sequence-specific primers were used for real-time PCR using SYBR Green PCR master mix (AB Applied Biosystems) including SYBR Green I double-stranded DNA binding dye. PCRs were cycled in an ABI PRISM 7000 sequence detection system (AB applied Biosystem) and analyzed with ABI PRISM 7000 SDS software (AB Applied

Biosystems). The PCR reaction was carried out at 50°C, 2 min, 95°C, 10 min, then 95 0 C, 15 sec and 60°C, 1 min for 50 cycles. Cycle times were analyzed at a reading of 0.2 fluorescence units. All reactions were done in duplicate. Cycle times that varied by more than 1.0 between the duplicates were discarded. The duplicate cycle times were then averaged and the cycle time of β-actin was subtracted from them for a normalized value.

PCR primers were as follows. For NP RNAs:

NP-367: 5'-CTCGTCGCTTATGACAAAGAAG-S' (SEQ ID NO: 10755). NP-460R: 5'-AGATCATCATGTGAGTCAGAC-S' (SEQ ID NO: 10759). For NS RNAs:

NS-527: 5'-CAGGACATACTGATGAGGATG-S ' (SEQ ID NO: 10757). NS-617R: 5'-GTTTCAGAGACTCGAACTGTG-S' (SEQ ID NO: 10760). Results

As described above, during replication of influenza virus, vRNA is transcribed to produce cRNA, which serves as a template for more vRNA synthesis, and mRNA, which serves as a template for " protein synthesis (1). Although RNAi is known to target the degradation of mRNA in a sequence-specific manner (16-18), there is a possibility that vRNA and cRNA are also targets for siRNA since vRNA of influenza A virus is sensitive to nuclease (1). To investigate the effect of siRNA on the degradation of various RNA species, reverse transcription using sequence-specific primers followed by real time PCR was used to quantify the levels of vRNA, cRNA and mRNA. Figure 15 shows the relationship between influenza virus vRNA, mRNA, and cRNA. As shown in Figure 15, cRNA is the exact complement of vRNA, but mRNA contains a polyA sequence at the 3' end, beginning at a site complementary to a site 15 - 22 nucleotides downstream from the 5' end of the vRNA segment. Thus compared to vRNA and cRNA, mRNA lacks 15 to 22 nucleotides at the 3' end. To distinguish among the three viral RNA species, primers specific for vRNA, cRNA and mRNA were used in the first reverse transcription reaction. For mRNA, poly dTl 8 was used as primer. For cRNA, a primer complementary to the 3 ' end of the RNA that is missing from mRNA was used. For vRNA, the primer can be almost anywhere along the RNA as long as it is complementary to vRNA and not too close to the 5' end. The resulting cDNA transcribed from only one of the RNAs was amplified by real time PCR.

Following influenza virus infection, new virions are starting to be packaged and released by about 4 hrs. To determine the effect of siRNA on the first wave of mRNA and cRNA transcription, RNA was isolated early after infection. Briefly, NP- 1496 was electroporated into MDCK cells. A mock electroporation (no siRNA) was also

performed). Six to eight hours later, cells were infected with PR8 virus at MOI=O.1. The cells were then lysed at 1, 2 and 3 hours post-infection and RNA was isolated. The levels of mRNA, vRNA and cRNA were assayed by reverse transcription using primers for each RNA species, followed by real time PCR. Figure 16 shows amounts of viral NP and NS RNA species at various times following infection with virus, in cells that were mock transfected or transfected with siRNA NP- 1496 approximately 6-8 hours prior to infection. As shown in Figure 16, 1 hour after infection, there was no significant difference in the amount of NP mRNA between samples with or without NP siRNA transfection. As early as 2 hours post-infection, NP mRNA increased by 38 fold in the mock transfection group, whereas the levels of NP mRNA did not increase (or even slightly decreased) in cells transfected with siRNA. Three hours post-infection, mRNA transcript levels continued to increase in the mock transfection whereas a continuous decrease in the amount of NP mRNA was observed in the cells that received siRNA treatment. NP vRNA and cRNA displayed a similar pattern except that the increase in the amount of vRNA and cRNA in the mock transfection was significant only at 3 hrs post-infection. While not wishing to be bound by any theory, this is probably due to the life cycle of the influenza virus, in which an initial round of mRNA transcription occurs before cRNA and further vRNA synthesis.

These results indicate that, consistent with the results of measuring intact, live virus by hemagglutinin assay or plaque assay, the amounts of all NP RNA species were also significantly reduced by the treatment with NP siRNA. Although it is known that siRNA mainly mediates degradation of mRNA, the data from this experiment does not exclude the possibility of siRNA-mediated degradation of NP cRNA and vRNA although the results described below suggest that reduction in NP protein levels as a result of reduction in NP mRNA results in decreased stability of NP cRNA and/or vRNA.

Example 4: Identification of the Target of RNA Interference Materials and Methods

SiRNA preparation of unmodified siRNAs was performed as described above. Modified RNA oligonucleotides, in which the 2'-hydroxyl group was substituted with a 2'-O- methyl group at every nucleotide residue of either the sense or antisense strand, or both, were also synthesized by Dharmacon. Modified oligonucleotides were deprotected and

annealed to the complementary strand.as described for unmodified oligonucleotides. siRNA duplexes were analyzed for completion of duplex formation by gel electrophoresis.

Cell culture, transfection with siRNAs, and infection with virus. These were performed essentially as described above. Briefly, for the experiment involving modified NP-1496 siRNA, MDCK cells were first transfected with NP-1496 siRNAs (2.5 nmol) formed from wild type (wt) and modified (m) strands and infected 8 hours later with PR8 virus at a MOI of 0.1. Virus titers in the culture supernatants were assayed 24 hours after infection. For the experiment involving M-37 siRNA, MDCK cells were transfected with M-37 siRNAs (2.5 nmol), infected with PR8 virus at an MOI of 0.01 , and harvested for RNA isolation 1, 2, and 3 hours after infection. See Table 2 for M-37 sense and antisense sequences.

RNA extraction, reverse transcription and real time PCR were performed essentially as described above. Primers specific for either mRNA, M-specifϊc vRNA, and M-specific cRNA, used for reverse transcription, were as follows: mRNA, dT 18 = 5'-TTTTTTTTTTTTTTTTTT-3' (SEQ ID NO: 10754) MvRNA: 5'- CGCTC AGACATGAGAACAGAATGG - 3' (SEQ ID NO: 10761)

M cRNA: 5'- ATATCGTCTCGTATTAGTAGAAACAAGGTAGTTTTT-S' (SEQ ID NO: 10762). PCR primers for M RNAs were as follows:

M forward: 5'- CGCTCAGACATGAGAACAGAATGG - 3' (SEQ ID NO: 10761) M reverse: 5' - TAACTAGCCTGACTAGCAACCTC - 3' (SEQ ID NO: 10763) Results

To investigate the possibility that siRNA might interfere with vRNA and/or cRNA in addition to mRNA, NP-1496 siRNAs in which either the sense (S or +) or antisense (AS or -) strand was modified were synthesized. The modification, which substitutes a 2'-O- rnethyl group for the 2'-hydroxyl group in every nucleotide residue, does not affect base- pairing for duplex formation, but the modified RNA strand no longer supports RNA interference. In other words, an siRNA in which the sense strand is modified but the antisense strand is wild type (mS:wtAS) will support degradation of RNAs having a sequence complementary to the antisense strand but not a sequence complementary to the

sense strand. Conversely, an siRNA in which the sense strand is wild type but the antisense strand is modified (wtS:mAS) will support degradation of RNAs having a sequence complementary to the sense strand but will not support degradation of RNAs having a sequence complementary to the sense strand. MDCK cells were either mock transfected or transfected with NP- 1496 siRNAs in which either the sense strand (mS:wtAS) or the antisense strand (wtS:mAS), , was modified while the other strand was wild type. Cells were also transfected with NP- 1496 siRNA in which both strands were modified (mS:mAS). Cells were then infected with PR8 virus, and virus titer in supernatants was measured. As shown in Figure 17 A, high virus titers were detected in cultures subjected to mock transfection. As expected, very low virus titers were detected in cultures transfected with wild type siRNA (wtS:wtAS), but high virus titers were detected in cultures transfected with siRNA in which both strands were modified (mS:mAS). Virus titers were high in cultures transfected with siRNA in which the antisense strand was modified (wtAS:mAS), whereas the virus titers were low in cultures transfected with siRNA in which the sense strand only was modified (mS:wtAS). While not wishing to be bound by any theory, the inventors suggest that the requirement for a wild type antisense (-) strand of siRNA duplex to inhibit influenza virus production suggests that the target of RNA interference is either mRNA (+) or cRNA (+) or both.

To further distinguish these possibilities, the effect of siRNA on the accumulation of corresponding mRNA, vRNA, and cRNA was examined. To follow transcription in a cohort of simultaneously infected cells, siRNA-transfected MDCK cells were harvested for RNA isolation 1, 2, and 3 hours after infection (before the release and re-infection of new virions). The viral mRNA, vRNA, and cRNA were first independently converted to cDNA by reverse transcription using specific primers. Then, the level of each cDNA was quantified by real time PCR. As shown in Figure 17B, when M-specific siRNA M-37 was used, little M-specific mRNA was detected one or two hours after infection. Three hours after infection, M-specific mRNA was readily detected in the absence of M-37. In cells transfected with M-37, the level of M-specific mRNA was reduced by approximately 50%. In contrast, the levels of M-specific vRNA and cRNA were not inhibited by the presence of M-37. While not wishing to be bound by any theory, these results indicate that viral mRNA is probably the target of siRNA-mediated interference.

Example 5: Effects of Certain siRNAs on Viral RNA Accumulation Materials and Methods

SiRNA preparation was performed as described above. Primers were specific for either mRNA, NP vRNA, NP cRNA, NS vRNA, NS cRNA, M vRNA, or M cRNA. Primers specific for PB 1 vRNA, PB 1 cRNA, PB2 vRNA, PB2 cRNA, PA vRNA, or PA cRNA, used for reverse transcription, were as follows:

PBl vRNA: 5'-GTGCAGAAATCAGCCCGAATGGTTC-S' (SEQ ID NO: 10764)

PBl cRNA: 5'-ATATCGTCTCGTATTAGTAGAAACAAGGCATTT-S' (SEQ ID NO: 10765) PB2 vRNA: 5'-GCGAAAGGAGAGAAGGCTAATGTG-3 l (SEQ ID NO: 10766)

PB2 cRNA: 5'-ATATGGTCTCGTATTAGTAGAAACAAGGTCGTTT-S' (SEQ ID NO: 10767)

PA vRNA: 5'-GCTTCTTATCGTTCAGGCTCTTAGG-S' (SEQ ID NO: 10768)

PA cRNA: 5'-ATATCGTCTCGTATTAGTAGAAACAAGGTACTT-S' (SEQ ID NO: 10769)

PCR primers for PBl, PB2, and PA RNAs were as follows: PBl forward: 5'-CGGATTGATGCACGGATTGATTTC-S' (SEQ ID NO: 10770) PBl reverse: 5'-GACGTCTGAGCTCTTCAATGGTGGAAC-S' (SEQ ID NO: 10771) PB2 forward: 5'-GCGAAAGGAGAGAAGGCTAATGTG-S' (SEQ ID NO: 10766) PB2 reverse: S'-AATCGCTGTCTGGCTGTCAGTAAG-S' (SEQ ID NO: 10772) PA forward: 5'-GCTTCTTATCGTTCAGGCTCTTAGG-S' (SEQ ID NO: 10768) PA reverse: 5'-CCGAGAAGCATTAAGCAAAACCCAG-S' (SEQ ID NO: 10773) Results

To determine whether NP- 1496 targets the degradation of the NP gene segment specifically or whether the levels of viral RNAs other than NP are also affected, primers specific for NS were used for RT and real time PCR to measure the amount of different NS RNA species (mRNA, vRNA, cRNA) as described above. As shown in Figure 18, the changes in NS mRNA, vRNA and cRNA showed the same pattern as that observed for NP RNAs. At 3 hours post-infection, a significant increase in all NS RNA species

could be seen in mock transfected cells, whereas no significant changes in NS RNA levels were seen in the cells that received NP- 1496 siRNA. This result indicates that the transcription and replication of different viral RNAs are coordinately regulated, at least with respect to NP RNAs. By coordinately regulated is meant that levels of one transcript affect levels of another transcript, either directly or indirectly. No particular mechanism is implied. When NP transcripts are degraded by siRNA treatment the levels of other viral RNAs are also reduced.

To further explore the effect of NP siRNAs on other viral RNAs, accumulation of mRNA, vRNA, and cRNA of all viral genes was measured in cells that had been treated with NP- 1496. As shown in Figure 18A (top panel), NP-specific mRNA was low one or two hours after infection. Three hours after infection, NP mRNA was readily detected in the absence of NP-1496, whereas in the presence of NP-1496, the level of NP mRNA remained at the background level, indicating that siRNA inhibited the accumulation of specific mRNA. As shown in Figure 18A (middle and bottom panels) levels of NP- specific and NS-specific vRNA and cRNA were greatly inhibited by the presence of NP- 1496. These results confirm the results described above. In addition, in the NP-1496- treated cells, the accumulation of mRNA, vRNA, and cRNA of the M, NS, PBl, PB2, and PA genes was also inhibited (Figure 18B, 18C, and 18H). Furthermore, the broad inhibitory effect was also observed for PA-2087. The top, middle, and bottom panels on the left side in Figures 18E, 18F, and 18G display the same results as presented in Figures 18 A, 18B, and 18C, showing the inhibition of viral mRNA transcription and of viral vRNA and cRNA replication by NP-1496 siRNA. The top, middle, and bottom panels on the right side in Figures 18E, 18F, and 18G present results of the same experiment performed with PA-2087 siRNA at the same concentration. As shown in Figure 18E, right upper, middle, and lower panels respectively, at three hours after infection PA, M, and NS mRNA were readily detected in the absence of PA-2087, whereas the presence of PA-2087 inhibited transcription of PA, M, and NS mRNA. As shown in Figure 18F, right upper, middle, and lower panels respectively, at three hours after infection PA, M, and NS vRNA were readily detected in the absence of PA-2087, whereas the presence of PA-2087 inhibited accumulation of PA, M, and NS vRNA. As shown in Figure 18G, right upper, middle, and lower panels respectively, at three hours after infection PA, M, and NS cRNA were readily detected in the absence of PA-2087, whereas the presence of PA-2087 inhibited accumulation of PA, M, and NS cRNA. In addition, Figure 18H

shows that NP-specific siRNA inhibits the accumulation of PBl- (top panel), PB2- (middle panel) and PA- (lower panel) specific mRNA.

While not wishing to be bound by any theory, the inventors suggest that the broad effect of NP siRNA is probably a result of the importance of NP in binding and stabilizing vRNA and cRNA, and not because NP-specific siRNA targets RNA degradation non- specifically. The NP gene segment in influenza virus encodes a single-stranded RNA- binding nucleoprotein, which can bind to both vRNA and cRNA (see Figure 14). During the viral life cycle, NP mRNA is first transcribed and translated. The primary function of the NP protein is to encapsidate the virus genome for the purpose of RNA transcription, replication and packaging. In the absence of NP protein, the full-length synthesis of both vRNA and cRNA is strongly impaired. When NP siRNA induces the degradation of NP RNA, NP protein synthesis is impaired and the resulting lack of sufficient NP protein subsequently affects the replication of other viral gene segments. In this way, NP siRNA could potently inhibit virus production at a very early stage. The number of NP protein molecules in infected cells has been hypothesized to regulate the levels of mRNA synthesis versus genome RNA (vRNA and cRNA) replication (1). Using a temperature-sensitive mutation in the NP protein, previous studies have shown that cRNA, but not mRNA, synthesis was temperature sensitive both in vitro and in vivo (70, 71). NP protein was shown to be required for elongation and antitermination of the nascent cRNA and vRNA transcripts (71, 72). The results presented above show that NP- specific siRNA inhibited the accumulation of all viral RNAs in infected cells. While not wishing to be bound by any theory, it appears probable that in the presence of NP-specific siRNA, the newly transcribed NP mRNA is degraded, resulting in the inhibition of NP protein synthesis following virus infection. Without newly synthesized NP, further viral transcription and replication, and therefore new virion production is inhibited.

Similarly, in the presence of PA-specific, the newly transcribed PA mRNA is degraded, resulting in the inhibition of PA protein synthesis. Despite the presence of 30-60 copies of RNA transcriptase per influenza virion (1), without newly synthesized RNA transcriptase, further viral transcription and replication are likely inhibited. Similar results were obtained using siRNA specific for PBl . In contrast, the matrix (M) protein is not required until the late phase of virus infection (1). Thus, M-specific siRNA inhibits the accumulation of M-specific mRNA but not vRNA, cRNA, or other viral RNAs. Taken together, these findings demonstrate a critical requirement for newly synthesized

nucleoprotein and polymerase proteins in influenza viral RNA transcription and replication. Both mRNA- and virus-specific mechanisms by which NP-, PA-, and PBl- specific siRNAs interfere with mRNA accumulation and other viral RNA transcription suggest that these siRNAs may be especially potent inhibitors of influenza virus infection. Example 6: Broad Inhibition of Influenza Virus RNA Accumulation by Certain siRNAs is Not Due to the Interferon Response or to Virus-induced RNA Degradation.

RNA levels were measured using PCR under standard conditions. The following PCR primers were used for measurement of γ-actin RNA. γ-actin forward: 5'-TCTGTCAGGGTTGGAAAGTC-S' (SEQ ID NO: 10774) γ-actin reverse: 5'-AAATGCAAACCGCTTCCAAC - 3' (SEQ ID NO: 10775)

One possible cause for the broad inhibition of viral RNA accumulation described above is an interferon response of the infected cells in the presence of siRNA (23, 65, 66). Thus, the above experiments were repeated in Vero cells in which the entire IFN locus, including all α, β, and ω genes, are deleted (67, 68) (Q. G. and J.C. unpublished data). Just as in MDCK cells, the accumulation of NP-, M-, and NS-specific niRNAs were all inhibited by NP- 1496 (Fig. 18D). In addition, the effect of siRNA on the levels of transcripts from cellular genes, including β-actin, γ-actin, and GAPDH, was assayed using PCR. No significant difference in the transcript levels was detected in the absence or presence of siRNA indicating that the inhibitory effect of siRNA is specific for viral RNAs. These results suggest that the broad inhibition of viral RNA accumulation by certain siRNAs is not a result of a cellular interferon response.

Following influenza virus infection, the presence of dsRNA also activates a cellular pathway that targets RNA for degradation (23). To examine the effect of siRNA on the activation of this pathway, we assayed the levels of phosphorylated protein kinase R (PKR), the most critical component of the pathway (23). Transfection of MDCK cells with NP- 1496 in the absence of virus infection did not affect the levels of activated PKR (data not shown). Infection by influenza virus resulted in an increased level of phosphorylated PKR, consistent with previous studies (65, 66, 69). However, the increase was the same in the presence or absence of NP- 1496 (data not shown). Thus, the broad inhibition of viral RNA accumulation is not a result of enhanced virus-induced degradation in the presence of siRNA.

Example 7: Inhibition of Influenza Virus Production in Mice by siRNAs

This example describes experiments showing that administration of siRNAs targeted to influenza virus NP or PA transcripts inhibit production of influenza virus in mice when administered either prior to or following infection with influenza virus. The inhibition is dose-dependent and shows additive effects when two siRNAs each targeted to a transcript expressed from a different influenza virus gene were administered together.

Materials and Methods

SiRNA preparation. This was performed as described above.

SiRNA delivery. siRNAs (30 or 60 μg of GFP-949, NP-1496, or PA-2087) were incubated with j etPEI™ for oligonucleotides cationic polymer transfection reagent, NTP ratio = 5 (Qbiogene, Inc., Carlsbad, CA; Cat. No. GDSP20130; N/P refers to the number of nitrogens per nucleotide phosphate in the jetPEI/siRNA mixture) or with poly-L-lysine (MW (vis) 52,000; MW (LALLS) 41,800, Sigma Cat. No. P2636) for 20 min at room temperature in 5% glucose. The mixture was injected into mice intravenously, into the retro-orbital vein, 200 μl per mouse, 4 mice per group. 200 μl 5% glucose was injected into control (no treatment) mice. The mice were anesthetized with 2.5% Avertin before siRNA injection or intranasal infection.

Viral infection. B6 mice (maintained under standard laboratory conditions) were intranasally infected with PR8 virus by dropping virus-containing buffer into the mouse's nose with a pipette, 30 ul (12,000 pfu) per mouse.

Determination of viral titer. Mice were sacrificed at various times following infection, and lungs were harvested. Lungs were homogenized, and the homogenate was frozen and thawed twice to release virus. PR8 virus present in infected lungs was titered by infection of MDCK cells. Flat-bottom 96-well plates were seeded with 3xlO 4 MDCK cells per well, and 24 hrs later the serum-containing medium was removed. 25 μl of lung homogenate, either undiluted or diluted from IxIO "1 to IxIO '7 , was inoculated into triplicate wells. After Ih incubation, 175 μl of infection medium with 4 μg/ml of trypsin was added to each well. Following a 48 h incubation at 37°C, the presence or absence of virus was determined by hemagglutination of chicken RBC by supernatant from infected cells. The hemagglutination assay was carried out in V-bottom 96-well plates. Serial 2- fold dilutions of supernatant were mixed with an equal volume of a 0.5% suspension

(vol/vol) of chicken erythrocytes (Charles River Laboratories) and incubated on ice for 1 h. Wells containing an adherent, homogeneous layer of erythrocytes were scored as positive. The virus titers were determined by interpolation of the dilution end point that infected 50% of wells by the method of Reed and Muench (TCID 50 ), thus a lower TCID 50 reflects a lower virus titer. The data from any two groups were compared by Student t test, which was used throughout the experiments described herein to evaluate significance.

Figure 19A shows results of an experiment demonstrating that siRNA targeted to viral NP transcripts inhibits influenza virus production in mice when administered prior to infection. 30 or 60 μg of GFP-949 or NP-1496 siRNAs were incubated with jetPEI and injected intravenously into mice as described above in Materials and Methods. Three hours later mice were intranasally infected with PR8 virus, 12000 pfu per mouse. Lungs were harvested 24 hours after infection. As shown in Figure 19A 5 the average log 10 TCID 5 o of the lung homogenate for mice that received no siRNA treatment (NT; filled squares) or received an siRNA targeted to GFP (GFP 60 μg; open squares) was 4.2. In mice that were pretreated with 30 μg siRNA targeted to NP (NP 30 μg; open circles) and jetPEI, the average log 10 TCID 5 o of the lung homogenate was 3.9. In mice that were pretreated with 60 μg siRNA targeted to NP (NP 60 μg; filled circles) and jetPEI, the average log 10 TCID 5 o of the lung homogenate was 3.2. The difference in virus titer in the lung homogenate between the group that received no treatment and the group that received 60 μg NP siRNA was significant with P = 0.0002. Data for individual mice are presented in Table 6 A (NT = no treatment).

Figure 19B shows results of another experiment demonstrating that siRNA targeted to viral NP transcripts inhibits influenza virus production in mice when administered intravenously prior to infection in a composition containing the cationic polymer PLL. 30 or 60 μg of GFP-949 or NP-1496 siRNAs were incubated with PLL and injected intravenously into mice as described above in Materials and Methods. Three hours later mice were intranasally infected with PR8 virus, 12000 pfu per mouse. Lungs were harvested 24 hours after infection. As shown in Figure 19B, the average log 10 TCIDso of the lung homogenate for mice that received no siRNA treatment (NT; filled squares) or received an siRNA targeted to GFP (GFP 60 μg; open squares) was 4.1. In mice that were pretreated with 60 μg siRNA targeted to NP (NP 60 μg; filled circles)

and PLL, the average logϊoTCIDso of the lung homogenate was 3.0. The difference in virus titer in the lung homogenate between the group that received 60 μg GFP and the group that received 60 μg NP siRNA was significant with P = .001. Data for individual mice are presented in Table 6 A (NT = no treatment). These data indicate that siRNA targeted to the influenza NP transcript reduced the virus titer in the lung when administered prior to virus infection. They also indicate that a mixtures of an siRNA with a cationic polymer effectively inhibits influenza virus in the lung when administered by intravenous injection, not requiring techniques such as hydrodynamic transfection.

Table 6 A. Inhibition of influenza virus production in mice by siRNA with cationic polymers

Treatment logioTCIDSO

NT (jetPEI

4.3 4.3 4.0 4.0 experiment)

GFP (60 μg) + jetPEI 4.3 4.3 4.3 4.0

NP (30 μg) + jetPEI 4.0 4.0 3.7 3.7

NP (60 μg) + jetPEI 3.3 3.3 3.0 3.0

NT (PLL experiment) 4.0 4.3 4.0 4.0

(not

GFP (60 μg) + PLL 4.3 4.0 4.0 don e)

NP (60 μg) + PLL 3.3 3.0 3.0 2.7

Figure 19C shows results of a third experiment demonstrating that siRNA targeted to viral NP transcripts inhibits influenza virus production in mice when administered prior to infection and demonstrates that the presence of a cationic polymer significantly increases the inhibitory efficacy of siRNA. 60 μg of GFP-949 or NP-1496 siRNAs were incubated with phosphate buffered saline (PBS) or jetPEI and injected intravenously into mice as described above in Materials and Methods. Three hours later mice were intranasally infected with PR8 virus, 12000 pfu per mouse. Lungs were harvested 24 hours after infection. As shown in Figure 19C, the average log 10 TCID 5 o of the lung homogenate for

mice that received no siRNA treatment (NT; open squares) was 4.1, while the average log 10 TCID 50 of the lung homogenate for mice that received an siRNA targeted to GFP in PBS (GFP PBS; open triangles) was 4.4. In mice that were pretreated with 60 μg siRNA targeted to NP in PBS (NP PBS; closed triangles) the average log 10 TCID 50 of the lung homogenate was 4.2, showing only a modest increase in efficacy relative to no treatment or treatment with an siRNA targeted to GFP. In mice that were pretreated with 60 μg siRNA targeted to GFP in jetPEI (GFP PEI; open circles), the average log 10 TCID 50 of the lung homogenate was 4.2. However, in mice that received 60 μg siRNA targeted to NP in jetPEI (NP PEI; closed circles), the average 1Og 10 TCID 50 of the lung homogenate was 3.2. The difference in virus titer in the lung homogenate between the group that received GFP siRNA in PBS and the group that received NP siRNA in PBS was significant with P = 0.04, while the difference in virus titer in the lung homogenate between the group that received GFP siRNA with jetPEI and the group that received NP siRNA with jetPEI was highly significant with P = 0.003. Data for individual mice are presented in Table 6B (NT = no treatment).

Table 6B. Inhibition of influenza virus production in mice by siRNA showing increased efficacy with cationic polymer

Treatment log 10 TCID50

NT 4.3 4.3 4.0 3.7

GFP (60 μg) + PBS 4.3 4.3 4.7 4.3

NP (60 μg) + PBS 3.7 4.3 4.0 4.0

GFP (60 μg) + jetPEI 4.3 4.3 4.0 3.0

NT (60 μg) + jetPEI 3.3 3.0 3.7 3.0

Additional experiments were performed to assess the ability of siRNA to inhibit influenza virus production at various times after infection, when administered at various time points prior to or following infection. siRNA was administered as described above except that 120 ug siRNA was administered 12 hours before virus infection. Table 6C shows the results expresesed as log 10 TCIDso. The P value comparing NP-treated with control group was 0.049

Table 6C

In another experiment, siRNA (60 ug) was administered 3 hours before infection. 1500 pfu of PR8 virus was administered intranasally. The infected lung was harvested 48h after infection. Table 6D shows the results expressed as log 10 TCIDso. The P value comparing NP-treated with control group was 0.03.

Table 6D

In another experiment, siRNA (120 ug) was administered 24 hours after PR8 (1500 pfu) infection. 52 hours post-infection, the lung was harvested and virus titer was measured. Table 6E shows the results expressed as log 10 TCID 5 o. The P value comparing NP-treated with control group was 0.03.

Table 6E

Other polymers were also shown to be effective siRNA delivery agents. Figure 19D is a plot showing that siRNA targeted to NP (NP-1496) inhibits influenza virus production in mice when administered intravenously together with a poly(beta amino ester) (J28). Figure 19E is a plot showing that siRNA targeted to NP (NP-1496) inhibits influenza virus production in mice when administered intraperitoneally together with a polyφeta amino ester) (J28 or C32) while a control RNA (GFP) has no significant effect. The experiments were performed essentially as described above except that the ratio of polymer to siRNA was a weight/weight ratio (for instance, 60:1 w/w). Polymers and siRNA were mixed and administered to mice either intravenously or intraperitoneally 3 hours prior to intranasal infection with 12,000 pfu of PR8 virus. Lungs were harvested 24

hours later and HA assays were performed. The amine and bis(acrylate ester) monomers present in J28 and C32 are described and depicted in U.S.S.N. 10/446,444. The polymers were a kind gift of Dr. Robert Langer.

Figure 20 shows results of an experiment demonstrating that siRNAs targeted to different influenza virus transcripts exhibit an additive effect. Sixty μg of NP- 1496 siRNA, 60 μg PA-2087 siRNA, or 60 μg NP-1496 siRNA + 60 μg PA-2087 siRNA were incubated with jetPEI and injected intravenously into mice as described above in Materials and Methods. Three hours later mice were intranasally infected with PR8 virus, 12000 pfu per mouse. Lungs were harvested 24 hours after infection. As shown in Figure 20, the average 1Og 10 TCIDs 0 of the lung homogenate for mice that received no siRNA treatment (NT; filled squares) was 4.2. In mice that received 60 μg siRNA targeted to NP (NP 60 μg; open circles), the average log 10 TCID 5 o of the lung homogenate was 3.2. In mice that received 60 μg siRNA targeted to PA (PA 60 μg; open triangles), the average log 10 TCID 50 of the lung homogenate was 3.4. In mice that received 60 μg siRNA targeted to NP + 60 μg siRNA targeted to PA (NP + PA; filled circles), the average logi 0 TCID5o of the lung homogenate was 2.4. The differences in virus titer in the lung homogenate between the group that received no treatment and the groups that received 60 μg NP siRNA, 60 μg PA siRNA, or 60 μg NP siRNA + 60 μg PA siRNA were significant with P = 0.003, 0.01, and 0.0001, respectively. The differences in lung homogenate between the groups that received 60 μg NP siRNA or 60 μg NP siRNA and the group that received 60 μg NP siRNA + 60 μg PA siRNA were significant with P = 0.01. Data for individual mice are presented in Table 7 (NT = no treatment). These data indicate that pretreatment with siRNA targeted to the influenza NP or PA transcript reduced the virus titer in the lungs of mice subsequently infected with influenza virus. The data further indicate that a combination of siRNA targeted to different viral transcripts exhibit an additive effect, suggesting that therapy with a combination of siRNAs targeted to different transcripts may allow a reduction in dose of each siRNA, relative to the amount of a single siRNA that would be needed to achieve equal efficacy.

Table 7. Additive effect of siRNA against influenza virus in mice

NP (60 μg) 3.7 3.3 3.0 3.0

PA (60 μg) 3.7 3.7 3.0 3.0

NP + PA (60 μg each) 2.7 2.7 2.3 2.0

Figure 21 shows results of an experiment demonstrating that siRNA targeted to viral NP transcripts inhibits influenza virus production in mice when administered following infection. Mice were intranasally infected with PR8 virus, 500 pfu. Sixty μg of GFP-949 siRNA, 60 μg PA-2087 siRNA, 60 μg NP-1496 siRNA, or 60 μg NP siRNA + 60 μg PA siRNA were incubated with jetPEI and injected intravenously into mice 5 hours later as described above in Materials and Methods. Lungs were harvested 28 hours after administration of siRNA. As shown in Figure 21, the average 1Og 1O TCID 50 of the lung homogenate for mice that received no siRNA treatment (NT; filled squares) or received the GFP-specific siRNA GFP-949 (GFP; open squares) was 3.0. In mice that received 60 μg siRNA targeted to PA (PA 60 μg; open triangles), the average log 10 TCID 5 o of the lung homogenate was 2.2. In mice that received 60 μg siRNA targeted to NP (NP 60 μg; open circles), the average log 10 TCID 5 o of the lung homogenate was 2.2. In mice that received 60 μg NP siRNA + 60 μg PA siRNA (PA + NP; filled circles), the average log 10 TCID 50 of the lung homogenate was 1.8. The differences in virus titer in the lung homogenate between the group that received no treatment and the groups that received 60 μg PA, NP siRNA, or 60 μg NP siRNA + 60 μg PA siRNA were significant with P = 0.09, 0.02, and 0.003, respectively. The difference in virus titer in the lung homogenate between the group that received NP siRNA and PA + NP siRNAs had a P value of 0.2. Data for individual mice are presented in Table 8 (NT = no treatment). These data indicate that siRNA targeted to the influenza NP and/or PA transcripts reduced the virus titer in the lung when administered following virus infection.

Table 8. Inhibition of influenza virus production in infected mice by siRNA

Treatment log 10 TCID50

NT 3.0 3.0 3.0 3.0

GFP (60 μg) 3.0 3.0 3.0 2.7

PA (60 μg) 2.7 2.7 2.3 1.3

NP (60 μg) 2.7 2.3 2.3 1.7

14S

NP + PA (60 μg each) I 2.3 2.0 1.7 1.3

Example 8: Inhibition of Influenza Virus Production in Cells by Administration of a Lentivirus that Provides a Template for Production ofshRNA

An oligonucleotide that serves as a template for synthesis of an NP- 1496a shRNA (see Figure 22A) was cloned between the U6 promoter and termination sequence of lentiviral vector pLL3.7 (Rubinson, D., et al, Nature Genetics, Vol. 33, pp. 401-406, 2003), as depicted schematically in Figure 22A. The oligonucleotide was inserted between the Hpal and Xhol restriction sites within the multiple cloning site of pLL3.7. This lentiviral vector also expresses EGFP for easy monitoring of transfected/infected cells. Lentivirus was produced by co-transfecting the DNA vector comprising a template for production of NP-1496a shRNA and packaging vectors into 293T cells. Forty-eight h later, culture supernatant containing lentivirus was collected, spun at 2000 rpm for 7 min at 4°C and then filtered through a 0.45 um filter. Vero cells were seeded at 1 x 10 s per well in 24- well plates. After overnight culture, culture supernatants containing that contained the insert (either 0.25 ml or 1.0 ml) were added to wells in the presence of 8 ug/ml polybrene. The plates were then centrifuged at 2500 rpm, room temperature for Ih and returned to culture. Twenty-four h after infection, the resulting Vero cell lines (Vero- NP-0.25, and Vero-NP-1.0) were analyzed for GFP expression by flow cytometry along with parental (non-infected) Vero cells. It is noted that NP- 1496a differs from NP- 1496 due to the inadvertent inclusion of an additional nucleotide (A) at the 3 ' end of the sense portion and a complementary nucleotide (U) at the 5' end of the antisense portion, resulting in a duplex portion that is 20 nt in length rather than 19 as in NP- 1496. (See Table 2). According to other embodiments of the invention NP-1496 sequences rather than NP- 1496a sequences are used. In addition, the loop portion of NP- 1496a shRNA differs from that of NP- 1496 shRNA.

Control Vero cells and Vero cells infected with lentivirus containing the insert (Vero-NP-0.25 and Vero-NP-1.0) were infected with PR8 virus at MOI of 0.04, 0.2 and 1. Influenza virus titers in the supernatants were determined by HA assay 48 hrs after infection as described above. Lentivirus containing templates for production of NP- 1496a shRNA were tested for ability to inhibit influenza virus production in Vero cells. The NP-1496a shRNA

includes two complementary regions capable of forming a stem-loop structure containing a double-stranded portion that has the same sequence as the NP- 1496a siRNA described above. As shown in Figure 22B, incubation of lentivirus-containing supernatants with Vero cells overnight resulted in expression of EGFP, indicating infection of Vero cells by lenti virus. The shaded curve represents mean fluorescence intensity in control cells

(uninfected). When 1 ml of supernatant was used, almost all cells became EGFP positive and the mean fluorescence intensity was high (1818) (Vero-NP-1.0). When 0.25 ml of supernatant was used, most cells (-95%) were EGFP positive and the mean fluorescence intensity was lower (503) (Vero-NP-0.25). Parental Vero cells and lentivirus-infected Vero cells were then infected with influenza virus at MOI of 0.04, 0.2, and 0.1, and virus titers were assayed 48 hrs after influenza virus infection. With increasing MOI, the virus titers increased in the supernatants of parental Vero cell cultures (Figure 22C). In contrast, the virus titers remained very low in supernatants of Vero-NP-1.0 cell cultures, indicating influenza virus production was inhibited in these cells. Similarly, influenza virus production in Vero-NP-0.25 cell cultures was also partially inhibited. The viral titers are presented in Table 9. These results suggest that NP- 1496 shRNA expressed from lentivirus vectors can be processed into siRNA to inhibit influenza virus production in Vero cells. The extent of inhibition appears to be proportional to the extent of virus infection per cell (indicated by EGFP level).

Table 9. Inhibition of influenza virus production by siRNA expressed in cells in tissue culture

Cell Line Viral Titer

Vero 16 64 128

Vero-NP-0.25 8 32 64

Vero-NP-1.0 1 4 8

Example 9: Inhibition of Influenza Production in Mice by Intranasal Administration of a DNA Vector From Which siRNA Precursors Are Transcribed

Construction of a plasmid from which NP- 1496a shRNA is expressed is described above. Oligonucleotides that serve as templates for synthesis of PB 1-2257 shRNA or RS V-specific shRNA were cloned between the U6 promoter and termination sequence of

lentiviral vector pLL3.7 as described above and depicted schematically in Figure 22 A for NP- 1496a shRNA. The sequences of the oligonucleotides were as follows:

NP- 1496a sense:

5'- TGGATCTTATTTCTTCGGAGATTCAAGAGATCTCCGAAGAAATAAGATCCTTT

TTTC-3' (SEQ ID NO: 10776)

NP- 1496a antisense:

5'-

TCGAGAAAAAAGGATCTTATTTCTTCGGAGATCTCTTGAATCTCCGAAGAAAT AA

GATCCA-3' (SEQ ID NO: 10777) PB1-2257 sense:

5 '-TGATCTGTTCCACCATTGAATTCAAGAGATTCAATGGTGGAACAGATCTTTTTTC-S ' (SEQ ID NO: 10778) PB 1-2257 antisense:

5'-

TCGAGAAAAAAGATCTGTTCCACCATTGAATCTCTTGAATTCAATGGTGGAACAGAT CA-3' (SEQ ID NO: 10779) RSV sense: 5 '-TGCGATAATATAACTGCAAGATTCAAGAGATCTTGCAGTTATATTATCGTTTTTTC- 3' (SEQ ID NO: 10780) RSV antisense:

5'-

TCGAGAAAAAACGATAATATAACTGCAAGATCTCTTGAATCTTGCAGTTATAT TA

TCGCA-3' (SEQ ID NO: 10781)

The RSV shRNA expressed from the vector comprising the above oligonucleotide is processed in vivo to generate an siRNA having sense and antisense strands with the following sequences: Sense: 5'-CGATAATATAACTGCAAGA-3 l (SEQ ID NO: 10782) Antisense: 5'-TCTTGCAGTTATATTATCG-3 ' (SEQ ID NO: 10783)

A PA-specific hairpin may be similarly constructed using the following oligonucleotides: PA-2087 sense:

5 '-TGCAATTGAGGAGTGCCTGATTCAAGAGATCAGGCACTCCTCAATTGCTTTTTTC-S ' (SEQ ID NO: 10784) PA-2087 antisense:

5'-

TCGAGAAAAAAGCAATTGAGGAGTGCCTGATCTCTTGAATCAGGCACTCCTCAATTG CA-3' (SEQ ID NO: 10785)

Plasmid DNAs capable of serving as templates for expression of NP- 1496a shRNA, PB 1 -2257 shRNA, or RSV-specific shRNA (60 μg each) were individually mixed with 40 μl Infasurf ® (ONY, Inc., Amherst NY) and 20 μl of 5% glucose and were administered intranasally to groups of mice, 4 mice each group, as described above. A mixture of 40 μl Infasurf and 20 μl of 5% glucose was administered to mice in the no treatment (NT) group. The mice were intranasally infected with PR8 virus, 12000 pfu per mouse, 13 hours later, as described above. Lungs were harvested and viral titer determined 24 hours after infection.

The ability of shRNAs expressed from DNA vectors to inhibit influenza virus infection in mice was tested. For these experiments, plasmid DNA was mixed with Infasurf, a natural surfactant extract from calf lung similar to vehicles previously shown to promote gene transfer in the lung (74). The DN A/Infasurf mixtures were instilled into mice by dropping the mixture into the nose using a pipette. Mice were infected with PR8 virus, 12000 pfu per mouse, 13 hours later. Twenty-four hrs after influenza virus infection, lungs were harvested and virus titers were measured by MDCK/hemagglutinin assay. As shown in Figure 23, virus titers were high in mice that were not given any plasmid

DNA or were given a DNA vector expressing a respiratory syncytial virus (RSV)-specific shRNA. Lower virus titers were observed when mice were given plasmid DNA that expresses either NP- 1496a shRNA or PB 1-2257 shRNA. The virus titers were more significantly decreased when mice were given both influenza-specific plasmid DNAs together, one expressing NP- 1496a shRNA and the other expressing PB 1 -2257 shRNA.

The average logioTCIDso of the lung homogenate for mice that received no treatment (NT; open squares) or received a plasmid encoding an RSV-specific shRNA (RSV; filled squares) was 4.0 or 4.1, respectively. In mice that received plasmid capable of serving as

a template for NP-1496a shRNA (NP; open circles), the average log 10 TCID 50 of the lung homogenate was 3.4. In mice that received plasmid capable of serving as a template for PB 1-2257 shRNA (PB; open triangles), the average log 10 TCID 50 of the lung homogenate was 3.8. In mice that received plasmids capable of serving as templates for NP and PB shRNAs (NP + PBl ; filled circles), the average 1Og 1 OTCID 50 of the lung homogenate was 3.2. The differences in virus titer in the lung homogenate between the group that received no treatment or RS V-specific shRNA plasmid and the groups that received NP shRNA plasmid, PBl shRNA plasmid, or NP and PBl shRNA plasmids had P values of 0.049, 0.124, and 0.004 respectively. Data for individual mice are presented in Table 10 (NT = no treatment). These results show that shRNA expressed from DNA vectors can be processed into siRNA to inhibit influenza virus production in mice and demonstrate that Infasurf is a suitable vehicle for the delivery of plasmids from which shRNA can be expressed. In particular, these data indicate that shRNA targeted to the influenza NP and/or PBl transcripts reduced the virus titer in the lung when administered following virus infection.

Table 10. Inhibition of influenza virus production by shRNA expressed in mice

Treatment Iog 10 TCID50

NT 4.3 4.0 4.0 4.3

RSV (60 μg) 4.3 4.0 4.0 4.0

NP (60 μg) 4.0 3.7 3.0 3.0

PBl (60 μg) 4.0 4.0 3.7 3.3

NP + PBl (60

3.7 3.3 3.0 3.0 μg each)

Example 1OA: Inhibition of Lucifer ase Activity in the Lung by Delivery ofsiRNA to the Vascular System or the Respiratory Tract siRNAs were obtained from Dharmacon and were deprotected and annealed as described above. siRNA sequences for NP (NP-1496), PA (PA-2087), PBl (PB1-2257), and GFP were as given above. Luc-specific siRNA was as described in (McCaffrey, AP, et al, Nature, 418:38-39) pCMV-luc DNA (Promega) was mixed with PEI (Qbiogene, Carlsbad, CA) at a nitrogen/phosphorus molar ratio (N/P ratio) of 10 at room temperature for 20 min. For i.v. administration, 200 μl of the mixture containing 60 μg of DNA was injected retroorbitally into 8 week old male C57BL/6 mice (Taconic Farms). For intratracheal

I 50

(i.t.) adminstration, 50 μl of the mixture containing 30 μg or 60 μg of DNA was administered into the lungs of anesthetized mice using a Penn Century Model IA-IC insufflator. siRNA-PEI compositions were formed by mixing 60 μg of luc-specific or GFP- specific siRNA with jetPEI at an N/P ratio of 5 at room temperature for 20 min. For i.v. administration, 200 μl of the mixture containing the indicated amounts of siRNA was injected retroorbitally. For pulmonary administration, 50 μl was delivered intratracheally.

At various times after pCMV-luc DNA administration, lungs, spleen, liver, heart, and kidney were harvested and homogenized in Cell Lysis Buffer (Marker Gene Technologies, Eugene, OR). Luminescence was analyzed with the Luciferase Assay

System (Promega) and measured with an Optocomp® I luminometer (MGM Instruments, Hamden, CT). The protein concentrations in homogenates were measured by the BCA assay (Pierce).

To determine the tissue distribution of PEI-mediated nucleic acid delivery in mice, pCMV-luc DNA-PEI complexes were injected i.v., and 24 hr later, Luc activity was measured in various organs. Activity was highest in the lungs, where Luc activity was detected for at least 4 days, whereas in heart, liver, spleen, and kidney, levels were 100- 1,000 times lower and were detected for a shorter time after injection. When DNA-PEI complexes were instilled i.t., significant Luc activity was also detected in the lungs, although at a lower level than after i.v. adminstration.

To test the ability of PEI to promote uptake of siRNAs by the lungs following i.v. administration, mice were first given pCMV-luc DNA-PEI complexes i.t., followed by i.v. injection of Luc-specific siRNA complexed with PEI, control GFP-specific siRNA complexed with PEI, or the same volume of 5% glucose. Twenty-four hours later, Luc activity in the lungs was 17-fold lower in mice that received Luc siRNA than in those given GFP siRNA or no treatment. Because Luc siRNA can inhibit Luc expression only in the same lung cells that were transfected with the DNA vector, these results indicate that i.v. injection of a siRNA-PEI mixture achieves effective inhibition of a target transcript in the lung. To test the ability of PEI to promote uptake of siRNAs by the lungs following pulmonary administration, mice were first given pCMVDNA-PEI complexes i.v., followed immediately by i.t. administration of Luc-specific siRNA mixed with PEI, control GFP-

specific siRNA mixed with PEI, or the same volume of 5% glucose. Twenty-four hours later, luciferase activities were assayed in lung homogenates. Luciferase activity was 6.8- fold lower in mice that were treated with luciferase siRNA than those treated with GFP siRNA. These results indicate that pulmonary administration of an siRNA-PEI mixture achieves effective inhibition of a target transcript in lung cells.

Example 1OB: Inhibition of Cyclophilin B in the Lung by Delivery of siRNA to the Respiratory System

Cyclophilin B is an endogenous gene that is widely expressed in mammals. To assess the ability of siRNA delivered directly to the respiratory system to inhibit expression of an endogenous gene, outbred Blackswiss mice (around 30 g or more body weight) were anesthetized by isofluorane/oxygen, and siRNA targeted to cyclophilin B (Dharmacon, D- 001136-01-20 siCONTROL Cyclophilin B siRNA (Human/Mouse/Rat) or control GFP- 949 siRNA (2 mg/kg) was administered intranasally to groups of 2 mice for each siRNA. Lungs were harvested 24 hours after administration. RNA was extracted from the lung and reverse transcription was done using a random primer. Real time PCR was then performed using cyclophilin B and GAPDH Taqman gene expression assay (Applied Biosystems). Results (Table 11-1) showed 70% silencing of cyclophilin B by siRNA targeted to cyclophilin B. Table 11 : Inhibition of Cvclophilin B in the Lung

Ave

Averaqe Normalized normal silencinq%

PBS-I 5.395406 4.288984

PBS-2 3.182562

GFP-I 2.547352 3.752446 12.50968

GFP-2 4.957539

Cyclo-1 1.173444 1.256672 70.7

Cyclo-2 1.339901

Table 11-2 : Target Portions in NP Gene

Example 11: Inhibition of Influenza Virus by Direct Delivery of Naked siRNA to the Respiratory System

Materials and Methods siRNA preparation, viral infection, lung harvests, and influenza virus titer assays were performed as described above. Mice were anesthetized using isofluorane (administered by inhalation). siRNA was delivered in a volume of 50 μl by intranasal drip, p values were computed using Student's T test.

Results siRNA (NP-1496) in phosphate buffered saline (PBS) was administered to groups of mice (5 mice per group). Mice were infected with influenza virus (2000 PFU) 3 hours after siRNA administration. Lungs were harvested 24 hours post-infection and virus titer measured. In a preliminary experiment mice were anesthetized with avertin and 2 mg/kg siRNA was administered by intranasal drip. A reduction in virus titer relative to controls was observed, although it did not reach statistical significance (data not shown).

In a second experiment, Black Swiss mice were anesthetized using isofluorane/O 2 . Various amounts of siRNA in PBS was intranasally administered into the mice., 50 ul each mouse. Three different groups (5 mice per group) received doses of 2 mg/kg, 4 mg/kg, or 10 mg/kg siRNA in PBS by intranasal drip. A fourth group that received PBS alone served as a control. Three hours later, the mice were anesthetized again using isofluorane/0 2 , 30 ul of PR8 virus (2000 pfu = 4 x lethal dose) was intranasally administered into the mice. 24 h after infection, the mouse lungs were harvested,

homogenized and virus titer was measured by evaluation of the TCID 50 as described above. Serial 5-fold dilutions of the lung homogenate were performed rather than 10-fold dilutions.

A significant and dose-dependent differerence in virus titer was seen between mice in each of the three treated groups and the controls (Table 12). The reduction in virus titer relative to controls was 3.45-fold (p = 0.0125), 4.16-fold (p = 0.0063), and 4.62-fold (p = 0.0057) in the groups that received doses of 2 mg/kg, 4 mg/kg, and 10 mg/kg respectively.

In summary, these results demonstrate the efficacy of siRNA delivered to the respiratory system in an aqueous medium in the absence of specific agents to enhance delivery.

Table 12: Intranasal Delivery of Naked siRNA Inhibits Influenza Virus Production

Example 12: Inhibition of Influenza Virus Production in Mice by Direct Delivery of Naked siRNA to the Respiratory System

This example confirms results above and demonstrates inhibition of influenza virus production in the lung by administration of siRNA targeted to NP to the respiratory system in an aqueous medium in the absense of delivery-enhancing agents. Six μg, 15 ug, 30 μg, and 60 μg of NP-1496 siRNAs or 60 μg of GFP-949 siRNAs in PBS were intranasally instilled into mice essentially as described above, except that mice were intranasally infected with PR8 virus, 1000 pfu per mouse, two hours after siRNA delivery. Lungs were harvested 24 hours after infection. NP-specific siRNA was effective for the inhibition of influenza virus when administered by intranasal instillation in an aqueous medium in the absence of delivery agents. A significant and dose-

dependent differerence in virus titer was seen between mice in each of the three treated groups and the controls (Table 13).

Table 13: Inhibition of Influenza Virus Production in the Lun Usin Naked siRNA

Example 13: Modified SiRNAs Mediate Effective Silencing

To explore the silencing potential of siRNAs containing modified nucleotides, NP- 1496 siRNA containing sense and antisense strands with 2'-O-methyl modifications at alternate ribonucleotides in each strand were synthesized and tested in comparison with unmodified NP-1496 siRNA. The 2'-O-methyl modified NP1496 siRNA sequences were as follows: (2'-O-methyl shown as "m" in front of the modified nucleotide):

Sense: 5'-GmGA mUCmU UmAU mUUmC UmUC mGGmA G dTdT-3' (SEQ ID NO:

10792) Antisense: 5 '-mCUmC CmGA mAGmA AmAU mAAmG AmUC mC dTdT-3 ' (SEQ ID NO: 10793)

The 2'-O-methyl modified NP1496 siRNA and unmodified NP1496 siRNA were transfected into Vero cells in 24-well plate using lipofectamine 2000 (Invitrogen) following the manufacturer's instructions. 6 hours after transfection, the culture media was aspirated. The cells were inoculated with 200 μl of PR8 virus at MOI of 0.1. The culture supernatant was collected at 24, 36 and 48 hours after infection. Virus titer was determined as described above. The 2'-O-methyl modified NP 1496 showed slightly more inhibition of virus growth than unmodified NP1496. Results are shown in Table 14.

I 55

Table 14: Effective Inhibition Virus Production Usin Modified iR A

Example 14: Inhibition of Influenza Virus by orotracheal Delivery of Naked siRNA to the Respiratory System siRNA preparation, viral infection, lung harvests, and influenza virus titer assays were performed as described above. Mice were anesthetized using avertin (administered by intraperitoneal injection). 1 mg/kg siRNA was delivered in a volume of 175 μl by oraltracheal injection. siRNA (NP- 1496), 1 mg/kg, and 30 ul Infasurf in 5% glucose was administered to groups of mice (5 mice per group). Mice were infected with influenza virus (2000 PFU) 3 hours after siRNA administration. Lungs were harvested 24 hours post-infection and virus titer measured.

In a second experiment, Black Swiss mice were anesthetized using intraperitoneal^ administered avertin. NP-1496 siRNA and GFP-949 siRNAin PBS was intratracheally administered into the mice, 50 μl each mouse. A third group that received PBS alone served as a control. Three hours later, the mice were anesthetized again using isofluorane/O 2 , 30 ul of PR8 virus (2000 pfu = 4 x lethal dose) was intranasally administered into the mice. Twenty-four hours after infection, the mouse lungs were harvested, homogenized and virus titer was measured by evaluation of the TCID 50 as described above. Serial 5-fold dilutions of the lung homogenate were performed rather than 10-fold dilutions.

In summary, these results demonstrate the efficacy of siRNA delivered to the respiratory system in an aqueous medium in the absence of specific agents to enhance delivery.

Example 15: Nucleotide Mutation Tolerance Studies to Select siRNAs that Target a Broad Spectrum of Influenza Viruses.

This example demonstrates that siRNAs whose antisense strands are less than 100% complementary to the targeted transcript within the inhibitory region (e.g., within the 19 base pair region that is complementary to the target transcript) mediate effective silencing. The results demonstrate that the RNAi agents described herein will effectively inhibit a wide range of influenza strains whose sequences vary from that of PR8 within the target portion.

A dual luciferase assay was used to evaluate the ability of siRNAs to inhibit expression of influenza genes that are not 100% complementary to the antisense strand of the siRNA within the 19 nucleotide inhibitory region. Mismatches derived from the alignment of human and avian influenza virus strains (using PR8 as standards) were introduced into the DNA vector (psiCHECK) using a site-directed mutagenesis kit (Stratagene), i.e., the influenza target site was modified to include either 1 or 2 differences relative to the PR8 sequence, with the specific differences corresponding to differences found in one or more of the human or avian influenza strains.

Table 15 shows results of an experiment demonstrating that variations in the viral NP target (target for NP- 1496) do not substantially reduce RNAi activity. (The data shown is the average of triplicates). Mismatches at positions near the 5' or 3' end of the antisense strand, or near the middle, were tested.

Table 15: Effect of mismatches between antisense strand and target region on silencing bv NP- 1496

Variations in the viral PA target (target for PA-2087 or PA-8282) do not substantially reduce RNAi activity. However, Gl 8 to Al 8 mutations found in 7 among 157 human influenza strains did substantially affect the RNA interference activity. Mismatches at positions near the 5' or 3' end of the antisense strand, or near the middle,

were tested. The presence of two mismatches between the antisense strand inhibitory region and the target reduced the silencing by about 70-75%, but a useful degree of silencing was still observed (Table 16).

Table 16: Effect of mismatches between antisense strand and target region on silencing bv PA-2087 or PA-8242

Original A4 to T6 T6 Cl 5 to Gl 8 to Al 9 to T6 to G4 to to T15 A18 G19 C6 and A6 C6 C15 to T15

Renilla luciferase 91.7 80.8 75.9 88.8 87.5 7.0 89.3 26.8 silencing (%)

Remaining 100 88.1 82.8 96.9 95.5 7.6 97.4 29.3 silencing comparing with original (%)

Table 17 shows results of an experiment demonstrating that variations in the viral PB2 target (target for PB2-3817) do not substantially reduce RNAi activity. (The data shown is the average of triplicates).

Table 17: Effect of mismatches between antisense strand and target region on silencing bv PB2-3817

Table 18 shows results of an experiment demonstrating that variations in the viral PBl target (target for PB 1-6124) do not substantially reduce RNAi activity. (The data shown is the average of triplicates). Mismatches at positions near the 5 ' or 3' end of the antisense strand, or near the middle, were tested. The presence of two mismatches between the antisense strand inhibitory region and the target reduced the silencing by about 70-75%, but a useful degree of silencing was still observed.

1S8

Table 18-1 : Effect of mismatches between antisense strand and target region on silencing bv PBl-6124

C15 to A1 to T2 to G3 to A1 to T1 and Gc

PB1 (Lab ID# 6124 siRNA) Original T15 G1 C2 A3 A3

Ranilla luciferase silencing

(%) 84.1 71.8 82.8 84.5 74.9 73.5

Remaining silencing comparing with original (%) 100 85.4 98.5 100 89.1 87.J

Table 18-2: effect of mismatches between antisense strand and target region on silencing by PBl-6129.

T3 to T7 to T2 to C10 to T3 to C3 and C1

PB1 (Lab ID# 6129 siRNA) Original C3 C7 C2 T10 to T10

Ranilla luciferase silencing

(%) 86.4 87.3 84.4 84.5 81.3 59.(

Remaining silencing comparing with original (%) 100 100 97.8 100 94.2 68.:

In addition to the results shown in the tables above, the present example also demonstrates that ten exemplary siRNAs duplexes of the present invention tolerate mismatches between the nucleotide sequence of the anti-sense strand of the siRNA and the nucleotide sequence of the targeted regions of viral transcripts. In the instant example, the capacity of the 21 previously identified siRNA to tolerate target sequence mutations was determined. To accomplish this, the target sites of all human and avian influenza gene sequences (available at www.lanl.flu.gov) were aligned using the PR8 strain as a reference. Single nucleotide polymorphism (SNPs) were identified and introduced into the dual-luciferase reporter construct using site-directed mutagenesis. Expression vectors containing the control target sequence (PR8) or the variants were subsequently transfected into Vero cells along with the 50 nM of the appropriate targeting siRNAs to determine the sensitivity of each siRNA to tolerate nucleotide mismatches. Of the 21 siRNAs, ten siRNAs exhibited high degrees of silencing (% silencing) and were highly tolerant of target site polymorphisms. Table 19 summarizes the percent silencing data for the ten siRNAs (INFsi-1 through INFsi-8 and G1499 and G4276). The nucleotide sequence for each siRNA is shown and the nucleotides that were the target for site-directed mutagenesis are bolded and underlined. The "Mismatch" column illustrates the original nucleotide and its position, shown in parenthesis, within the siRNA along with the nucleotide that was substituted for the original nucleotide (mutated nucleotide). The percent silencing is presented as percentage of silencing observed with the native (PR8) silencing. Therefore, 100% relative silencing indicates that the mismatch had no

effect on the functionality of the siRNA compared to its ability to silence the exact match target sequence (PR8). Any decrease in the percent relative silencing represents the degree of sensitivity of the siRNA for that mismatch in the target sequence (i.e., a lower percentage equates to a decrease in the functionality of the siRNA; "functionality" defined in this context as the ability of the siRNA to degrade it target RNA).

Table 19. Sensitivity of siRNAs to Naturally Occurring Target Site Point Mutations.

As shown in Table 19, only 19 of the 70 avian/human targets containing SNPs disrupted siRNλ functionality by more than 30%. Two of these mutants, INFsi-2 have changes at position 9 (Mutation-9 for siRNA INFsi-2 and Mutation-4 for siRNA INFsi- 1 ), which is adjacent to the RISC mediated cleavage site and predicted to be particularly sensitive to base pair mismatches. Surprisingly, three of the four variants containing two target site polymorphisms showed significant reductions in the levels of target cleavage (e.g. siRNA INFsi-4). None of the variants carrying the individual polymorphisms significantly impeded siRNA functionality.

These data show that ten siRNAs exhibited broad targeting properties against the majority of human and avian influenza virus strains demonstrating that these ten siRNAs have great potential as a multi-gene targeting strategy for effective RNAi therapeutics. In Table 20, the most highly conserved sequences from Table 2 have been matched to additional 19-mer sequences taken from other members of the influenza A viral sequence variants that differ from the primary 19-mer sequence by having only one

or a few nucleotide mismatches. The 19-mer sequences in Table 20 are obtained by searching the list of sequences described in Table 1 using the highly conserved 19-mer sequence fragments shown in Table 2 as the reference sequence. For each 19-mer reference sequence, a target fragment is found in each of the target influenza A viral sequences that is the most closely matching 19-mer fragment. The most-closely matching target fragments are those that have the fewest number of nucleotide differences between the reference fragment and the target fragment. If two target fragments have the same number of nucleotide differences with the reference fragment, then preference is given to the target fragment that can form more GU wobble base pairs between the sense strand of the target fragment and the antisense strand of the reference fragment.

Table 20-1: Conserved and minor variant 19-mer sequences from the Influenza A segment 1 (PB2) sequences listed in Table 1-1. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

Seq Kef % 5519 11 CAGCGACCAAAAGgAOUCG 0.3% ID ID Match Seq Total

5520 11 CAGCGACCAAAgGAAOUCG 0.1%

1 1 GCCAGACAGCGACCAAAAG 99.5%

5521 11 CAGCGACCAAAAGAUUUCG 0.1%

5 484 1 GCCAGACAGCGACCAAfigG 0.1%

5522 11 CgaCGAuCAAAAGAAUOCG 0.1%

5485 1 GuCAGACAGCGACCAAAAG 0.2%

12 12 ACUUACUGACAGCCAGACA 99.1%

5486 1 GCCAGACgaCGAuCAAAAG 0.1%

5523 12 ACUUACUGACAGuCAGACA 0.2%

2 2 AGCCAGACAGCGACCAAAA 99.5%

5524 12 ACUUACUGAuAGCCAGACA 0.3%

5487 2 AGCCAGACAGCGACCAAAg 0.1%

5525 12 gCUUACUGACAGCCAGACg 0.1%

5488 2 AGuCAGACAGCGACCAAAA 0.2%

5526 12 ACUUACcGACAGCCAGACA 0.2%

5489 2 AGCCAGACgaCGAuCAAAA 0.1%

13 13 CAUACUOACOGACAGCCAG 99.1%

3 3 ACAGCCAGACAGCGACCAA 99.3%

5527 13 CAUACUUACUGACAGuCAG 0.2%

5490 3 ACAGuCAGACAGCGACCAA 0.2%

5528 13 CAOACUOACOGAuAGCCAG 0.3%

5491 3 AuAGCCAGACAGCGACCAA 0.3%

5529 13 CAϋgCUOACOGACAGCCAG 0.1%

5492 3 ACAGCCAGACgaCGAuCAA 0.1%

5530 13 CAOACOUACcGACAGCCAG 0.2%

4 4 GACAGCCAGACAGCGACCA 99.3%

14 14 cuuACOGACAGCCAGACAG 99.1%

5493 4 GACAGuCAGACAGCGACCA 0.2%

5531 14 CUUACOGACAGUCAGACAG 0.2%

5494 4 GAuAGCCAGACAGCGACCA 0.3%

5532 14 CUUACUGAuAGCCAGACAG 0.3%

5495 4 GACAGCCAGACgaCGAuCA 0.1%

5533 14 CUUACcGACAGCCAGACAG 0.2%

5 5 CAGACAGCGACCAAAAGAA 99.3%

5534 14 CUUACUGACAGCCAGACga 0.1%

5496 S CAGACAGCGACCAAAAGgA 0.3%

15 15 GACAGCGACCAAAAGAAUO 99.1%

5497 5 CAGACAGCGACCAAAgGAA 0.1%

5535 15 GACAGCGACCAAAAGgAOO 0.3%

5498 5 CAGACAGCGACCAAAAGAu 0.1%

5536 15 GACAGCGACCAAAgGAAϋϋ 0.1%

5499 5 CAGACgaCGAuCAAAAGAA 0.1%

5537 15 GACAGCGACCAAAAGAUOO 0.1%

6 6 CAGCCAGACAGCGACCAAA 99.3%

5538 15 GACgaCGAuCAAAAGAAOO 0.1%

550Q 6 CAGuCAGACAGCGACCAAA 0.2%

16 16 AUACUOACUGACAGCCAGA 99.1%

5S01 6 uAGCCAGACAGCGACCAAA 0.3%

5539 16 AUACUOACUGACAGUCAGA 0.2%

5 502 6 CAGCCAGACgaCGAuCAAA 0.1%

5540 16 AUACUOACOGAuAGCCAGA 0.3%

7 7 AGACAGCGACCAAAAGAAU 99.1%

5541 16 AUgCUOACOGACAGCCAGA 0.1%

5503 7 AGACAGCGACCAAAAGgAU 0.3%

5542 16 AUACUOACcGACAGCCAGA 0.2%

5504 7 AGACAGCGACCAAAgGAAU 0.1%

17 17 ACOGACAGCCAGACAGCGA 99.0%

5505 7 AGACAGCGACCAAAAGAuU 0.1%

5543 17 ACOGACAGuCAGACAGCGA 0.2%

5 5 06 7 AGACgaCGAuCAAAAGAAU 0.1%

Q Q 5544 17 ACUGAuAGCCAGACAGCGA 0.3%

O O CCAGACAGCGACCAAAAGA 99.1%

5545 17 ACcGACAGCCAGACAGCGA 0.2%

5507 8 CCAGACAGCGACCAAAAGg 0.3%

5546 17 ACUGACAGCCAGACgaCGA 0.1%

5508 8 CCAGACAGCGACCAAAgGA 0.1%

18 18 GCAUACUOACUGACAGCCA 99.0%

5509 8 UCAGACAGCGACCAAAAGA 0.2%

5547 18 GCAUACOUACUGACAGUCA 0.2%

5510 8 CCAGACgaCGAuCAAAAGA 0.1%

5548 18 GCAUACUUACUGAUAGCCA 0.3%

9 9 ACAGCGACCAAAAGAADUC 99.1%

5549 18 GCAUgCUUACUGACAGCCA 0.1%

5511 9 ACAGCGACCAAAAGgAUUC 0.3%

5550 18 aCAUACUUACUGACAGCCA 0.1%

5512 9 ACAGCGACCAAAgGAAUUC 0.1%

5551 18 GCAUACUUACcGACAGCCA 0.2%

5513 9 ACAGCGACCAAAAGAuUUC 0.1%

19 19 AGCAUACUUACUGACAGCC 99.0%

5514 9 ACgaCGAuCAAAAGAAUUC 0.1%

5552 19 AGCAUACUUACUGACAGuC 0.2%

10 10 UACUUACUGACAGCCAGAC 99.1%

5553 19 AGCAOACUUACUGAuAGCC 0.3%

5515 10 UACUUACUGACAGUCAGAC 0.2%

5554 19 AGCAUgCUOACUGACAGCC 0.1%

5516 10 UACUUACUGAUAGCCAGAC 0.3%

5555 19 AaCAUACUUACUGACAGCC 0.1%

5517 10 OgCUUACOGACAGCCAGAC 0.1%

5556 19 AGCAUACUUACCGACAGCC 0.2%

5518 10 OACUUACcGACAGCCAGAC 0.2%

20 20 UGACAGCCAGACAGCGACC 99.0%

11 11 CAGCGACCAAAAGAAUOCG 99.1%

5557 20 UGACAGuCAGACAGCGACC 0.2%

555 8 20 UGAuAGCCAGACAGCGACC 03% 5604 28 UCUAaCAUACUUACUGACA 01%

555 9 20 CGACAGCCAGACAGCGACC 02% 5605 28 UCUAGCAUACUUACcGACA 02%

5 S60 20 UGACAGCCAGACgaCGAuC 01% 29 29 CAAAAGAAUUCGGAUGGCC 986%

21 21 UACUGACAGCCAGACAGCG 990% 5606 29 CAAAAGgAUUCGGAUGGCC 02%

5561 21 UACUGACAGuCAGACAGCG 02% 5607 29 CAAAgGAAUUCGGAUGGCC 01%

5562 21 UACUGAuAGCCAGACAGCG 03% 5608 29 CAAAAGAAUUCGaAUGGCC 06%

5563 21 UACcGACAGCCAGACAGCG 02% 5609 29 CAAAAGAuϋUCGGAUGGCC 01%

5564 21 UACUGACAGCCAGACgaCG 01% 5610 29 CAAAAGgAUUCGaAUGGCC 01%

22 22 UUACUGACAGCCAGACAGC 990% 30 30 AAAAGAAUUCGGAUGGCCA 986%

5565 22 UUACUGACAGUCAGACAGC 02% 5611 30 AAAAGgAUUCGGAUGGCCA 02%

5566 22 UUACUGAuAGCCAGACAGC 03% 5612 30 AAAgGAAUUCGGAUGGCCA 01%

5567 22 UUACcGACAGCCAGACAGC 02% 5613 30 AAAAGAAUUCGaAUGGCCA 06%

5568 22 UUACUGACAGCCAGACgaC 01% 5614 30 AAAAGAuUUCGGAUGGCCA 01%

23 23 CUGACAGCCAGACAGCGAC 990% 5615 30 AAAAGgAUUCGaAUGGCCA 01%

5569 23 CUGACAGuCAGACAGCGAC 02% 31 31 GACCAAAAGAAUUCGGAUG 985%

5570 23 CUGAuAGCCAGACAGCGAC 03% 5616 31 GACCAAAAGgAUUCGGAUG 02%

5571 23 CcGACAGCCAGACAGCGAC 02% 5617 31 GACCAAAgGAAUUCGGAUG 01%

5572 23 CUGACAGCCAGACgaCGAu 01% 5618 31 GAuCAAAAGAAUUCGGAUG 01%

24 24 ACUCUAGCAUACUUACUGA 989% 5619 31 GACCAAAAGAAUUCGaAUG 06%

5573 24 ACUCUAGCAUgCUUACUGA 01% 5620 31 GACCAAAAGAuUUCGGAUG 01%

5574 24 ACUuUAGCAUACUUACUGA 02% 5621 31 GACCAAAAGgAUUCGaAUG 01%

5575 24 ACUaUAGCAUACUUACUGA 01% 32 32 CGACCAAAAGAAUUCGGAU 985%

5576 24 ACUCcAGCAUACUUACUGA 03% 5622 32 CGACCAAAAGgAUUCGGAU 02%

5577 24 ACUCUAaCAUACUUACUGA 01% 5623 32 CGACCAAAgGAAUUCGGAU 01%

5578 24 ACUCUAGCAUACUUACcGA 02% 5624 32 CGAuCAAAAGAAUUCGGAU 01%

25 25 UAGCAUACUUACUGACAGC 987% 5625 32 CGACCAAAAGAAUUCGaAU 06%

5579 25 UAGCAUACUUACUGACAGU 02% 5626 32 CGACCAAAAGAUUUCGGAU 01%

5580 25 UAGCAUACUUACUGAUAGC 03% 5627 32 CGACCAAAAGgAUUCGaAU 01%

5581 25 UAGCAUgCUUACUGACAGC 01% 33 33 AAAGAAUUCGGAUGGCCAU 985%

5582 25 CAGCAUACUUACUGACAGC 03% 5628 33 AAAGgAUUCGGAUGGCCAU 02%

5583 25 UAaCAUACUUACUGACAGC 01% 5629 33 AAgGAAUUCGGAUGGCCAU 01%

5584 2 5 UAGCAUACUUACcGACAGC 02% 5630 33 AAAGAAUUCGaAUGGCCAU 06%

26 26 CUCUAGCAUACUUACUGAC 986% 5631 33 AAAGAAUUCGGAUGGCCAc 01%

5585 26 CUCUAGCAUACUUACUGAU 03% 5632 33 AAAGAuUUCGGAUGGCCAϋ 01%

5586 26 CUCUAGCAUgCUUACUGAC 01% 5633 33 AAAGgAUUCGaAUGGCCAU 01%

5587 26 CUuUAGCAUACUUACUGAC 02% 34 34 AGCGACCAAAAGAAUUCGG 985%

5588 26 CUaUAGCAUACUUACUGAC 01% 5634 34 AGCGACCAAAAGgAUUCGG 02%

5589 26 CUCcAGCAUACUUACUGAC 03% 5 635 34 AGCGACCAAAgGAAUUCGG 01%

5590 26 CUCUAaCAUACUUACUGAC 01% 5636 34 AGCGACCAAAAGAAUUCGa 06%

5591 26 CUCUAGCAUACUUACCGAC 02% 5637 34 AGCGACCAAAAGAUUUCGG 01%

27 27 CUAGCAUACUUACUGACAG 986% 5638 34 AGCGACCAAAAGgAUUCGa 01%

5592 27 CUAGCAUACUUACUGAUAG 03% 5639 34 gaCGAuCAAAAGAAUUCGG 01%

5593 27 CUAGCAUgCUUACUGACAG 01% 35 35 GCGACCAAAAGAAUUCGGA 985%

5594 27 uUAGCAUACUUACUGACAG 02% 5640 35 GCGACCAAAAGgAUUCGGA 02%

5595 27 aUAGCAUACUUACUGACAG 01% 5641 35 GCGACCAAAgGAAUUCGGA 01%

5596 27 CcAGCAUACUUACUGACAG 03% 5642 35 GCGACCAAAAGAAUUCGaA 06%

5597 27 CUAaCAUACUUACUGACAG 01% 5643 35 GCGACCAAAAGAUUUCGGA 01%

5598 27 CUAGCAUACUUACcGACAG 02% 5644 35 aCGAuCAAAAGAAUUCGGA 01%

28 28 UCUAGCAUACUUACUGACA 986% 5645 35 GCGACCAAAAGgAUUCGaA 01%

5599 28 UCUAGCAUACUUACUGAUA 03% 36 36 CCAAAAGAAUUCGGAUGGC 985%

5600 28 UCUAGCAUgCUUACUGACA 01% 5646 36 CCAAAAGgAUUCGGAUGGC 02%

5601 28 UuUAGCAUACUUACUGACA 02% 5647 36 CCAAAgGAAUUCGGAUGGC 01%

5602 28 UaUAGCAUACUUACUGACA 01% 5648 36 UCAAAAGAAUUCGGAUGGC 01%

5603 28 UCcAGCAUACUUACUGACA 03% 5649 36 CCAAAAGAAUUCGaAUGGC 06%

5 6 5 0 36 CCAAAAGAuUUCGGAUGGC 0.1% 5696 44 AAUUCGGAUGGCCAUUAAU 0.2%

5 6 5 1 36 CCAAAAGgAUOCGaAUGGC 0.1% 5697 44 gAUUCGGAUGGCCAUCAAU 0.2%

37 37 ACCAAAAGAAUUCGGAUGG 98.5% 5698 44 AAUUCGaAUGGCCAUCAAU 0.6%

5 652 37 ACCAAAAGgAUUCGGAUGG 0.2% 5699 44 AAUUCGGAUGGCCACCAAU 0.1%

5653 37 ACCAAAgGAAUUCGGAUGG 0.1% 5700 44 AAUUCGGAUGGCCAUCAAg 0.1%

5654 37 AuCAAAAGAAUUCGGAUGG 0.1% 5701 44 AuUUCGGAUGGCCAUCAAU 0.1%

5 655 37 ACCAAAAGAAUUCGaAUGG 0.6% 5702 44 gAUUCGaAUGGCCAUCAAU 0.1%

5656 37 ACCAAAAGAuϋUCGGAUGG 0.1% 45 45 GGGACUCUAGCAUACUUAC 98.1%

5657 37 ACCAAAAGgAUUCGaAUGG 0.1% 5703 45 GGGACUCUAGCAUgCUUAC 0.1%

38 38 GAAUUCGGAUGGCCAUCAA 98.4% 5704 45 GGGACϋuUAGCAUACUUAC 0.2%

5658 38 GAAUUCGGAUGGCCAUUAA 0.2% 5705 45 GaGACUCUAGCAUACUUAC 0.4%

5659 38 GgAUUCGGAUGGCCAUCAA 0.2% 5706 45 GGaACUCUAGCAUACUUAC 0.6%

5660 38 GAAUUCGaAUGGCCAUCAA 0.6% 5707 45 GGGACUaUAGCAUACUUAC 0.1%

5661 38 GAAUUCGGAUGGCCACCAA 0.1% 5708 45 GGGACUCcAGCAUACUUAC 0.3%

5662 38 GAuUUCGGAUGGCCAUCAA 0.1% 5709 45 GGGACUCUAaCAUACUUAC 0.1%

5663 38 GgAUUCGaAUGGCCAUCAA 0.1% 46 46 ACGGGACUCUAGCAUACUU 98.0%

39 39 UUCGGAUGGCCAUCAAUUA 98.3% 5710 46 ACGGGACUCUAGCAUgCUU 0.1%

5664 39 UϋCGGAUGGCCAUuAAUUA 0.2% 5711 46 ACGGGACUuUAGCAUACUU 0.2%

5665 39 UUCGaAUGGCCAUCAAUUA 0.7% 5712 46 AaGGGACUCUAGCAUACUU 0.1%

5666 39 UUCGGAUGGCCAUCAAgUA 0.1% 5713 46 ACGaGACUCUAGCAUACUU 0.4%

5667 39 UUCGGAUGGCCAUCAAUag 0.1% 5714 46 ACGGaACUCUAGCAUACUU 0.6%

5668 39 UUCGGAUGGCCAcCAAUgA 0.1% 5715 46 ACGGGACUaUAGCAUACUU 0.1%

40 40 AUUCGGAUGGCCAUCAAUU 98.3% 5716 46 ACGGGACUCcAGCAUACUU 0.3%

5669 40 AUUCGGAUGGCCAUUAAUU 0.2% 5717 46 ACGGGACUCUAaCAUACUU 0.1%

5670 40 AUUCGaAUGGCCAUCAAUU 0.7% 47 47 AAACGGGACUCUAGCAUAC 98.0%

5671 40 AUUCGGAUGGCCAUCAAgU 0.1% 5718 47 AAACGGGACUCUAGCAUgC 0.1%

5672 40 AUUCGGAUGGCCAUCAAUa 0.1% 5719 47 AAACGGGACUUUAGCAUAC 0.2%

5673 40 UUUCGGAUGGCCAUCAAUU 0.1% 5720 47 AAAaGGGACUCUAGCAUAC 0.1%

5674 40 AUUCGGAUGGCCAcCAAUg 0.1% 5721 47 AAACGaGACUCUAGCAUAC 0.4%

41 41 GACUCUAGCAUACUUACUG 98.3% 5722 47 AAACGGaACUCUAGCAUAC 0.6%

5675 41 GACUCUAGCAϋgCOUACUG 0.1% 5723 47 AAACGGGACUaUAGCAUAC 0.1%

5676 41 GACUuUAGCAUACUUACUG 0.2% 5724 47 AAACGGGACUCcAGCAUAC 0.3%

5677 41 aACUCUAGCAUACUUACUG 0.6% 5725 47 AAACGGGACUCUAaCAUAC 0.1%

5678 41 GACUaUAGCAUACUUACUG 0.1% 48 48 CGGGACUCUAGCAUACUUA 98.0%

5679 41 GACϋCcAGCAUACUUACUG 0.3% 5726 48 CGGGACUCUAGCAUgCUUA 0.1%

5680 41 GACUCUAaCAUACUUACUG 0.1% 5727 48 CGGGACUuϋAGCAUACUUA 0.2%

5681 41 GACUCUAGCAUACUUACcG 0.2% 5728 48 aGGGACUCUAGCAUACUUA 0.1%

42 42 AGAAUUCGGAUGGCCAUCA 98.3% 5729 48 CGaGACUCUAGCAUACUUA 0.4%

5682 42 AGAAUUCGGAUGGCCAϋuA 0.2% 5730 48 CGGaACUCUAGCAUACUUA 0.6%

5683 42 AGgAUUCGGAUGGCCAUCA 0.2% 5731 48 CGGGACUaUAGCAUACUUA 0.1%

5684 42 gGAAUUCGGAUGGCCAUCA 0.1% 5732 48 CGGGACUCcAGCAUACUUA 0.3%

5685 42 AGAAUUCGaAUGGCCAUCA 0.6% 5733 48 CGGGACUCUAaCAUACUUA 0.1%

5686 42 AGAAUUCGGAUGGCCACCA 0.1% 49 49 AACGGGACUCUAGCAUACU 98.0%

5687 42 AGAuUUCGGAUGGCCAUCA 0.1% 5734 49 AACGGGACUCUAGCAUgCU 0.1%

5688 42 AGgAUUCGaAUGGCCAUCA 0.1% 5735 49 AACGGGACUuUAGCAUACU 0.2%

43 43 AAGAAUUCGGAUGGCCAUC 98.3% 5736 49 AAaGGGACUCUAGCAUACU 0.1%

5689 43 AAGAAUUCGGAUGGCCAUU 0.2% 5737 49 AACGaGACUCUAGCAUACU 0.4%

5690 43 AAGgAUUCGGAUGGCCAUC 0.2% 5738 49 AACGGaACUCUAGCAUACU 0.6%

5691 43 AgGAAUUCGGAUGGCCAUC 0.1% 5739 49 AACGGGACUaUAGCAUACU 0.1%

5692 43 AAGAAUUCGaAUGGCCAUC 0.6% 5740 49 AACGGGACUCCAGCAUACU 0.3%

5693 43 AAGAAUUCGGAUGGCCACC 0.1% 5741 49 AACGGGACUCUAaCAUACU 0.1%

5694 43 AAGAuUUCGGAUGGCCAUC 0.1% 50 50 GGACUCUAGCAUACUUACU 97.9%

5695 43 AAGgAUUCGaAUGGCCAUC 0.1% 5742 50 GGACUCUAGCAUgCUUACU 0.1%

44 44 AAUUCGGAUGGCCAUCAAU 98.3% 5743 50 GGACUuUAGCAUACUUACU 0.2%

5 744 50 aGACUCUAGCAUACUUACU 0.4% 5787 61 GAAACGAAAACGGaACUCU 0.5%

5 745 50 GaACOCϋAGCAOACϋUACU 0.6% 5788 61 GAAACGAAAACGGGACUaU 0.1%

5 746 50 GGACϋaOAGCAϋACϋϋACU 0.1% 5 789 61 GAAACGAAAACGGGACUCC 0.3%

5 747 50 GGACUCcAGCAUACOOACD 0.3% 5790 61 GAAACGgAAACGaGACUCU 0.1%

5 748 50 GGACUCUAaCAUACUUACU 0.1% 5791 61 GAAACGgAAACGGaACUCU 0.1%

S749 50 GGACOCϋAGCAUACUUACc 0.2% 62 62 AAACGAAAACGGGACUCUA 95.5%

51 51 AϋGAAACGAAAACGGGACO 95.9% 5792 62 AAACGgAAACGGGACUCUA 2.7%

5 750 51 AOGAAACGgAAACGGGACU 2.9% 5793 62 AAACGgAAACGGGACUuUA 0.2%

5751 51 AUGAAACGAAAAaGGGACU 0.1% 5794 62 AAACGAAAAaGGGACUCOA 0.1%

5752 51 AUGAAACGAAAACGaGACU 0.3% 5795 62 AAACGAAAACGaGACUCUA 0.3%

5753 51 AUGAAACGAAAACGGaACO 0.5% 5796 62 AAACGAAAACGGaACUCUA 0.5%

5754 51 AUGAAACGgAAACGaGACU 0.1% 5797 62 AAACGAAAACGGGACUaUA 0.1%

5755 51 AUGAAACGgAAACGGaACU 0.1% 5798 62 AAACGAAAACGGGACUCcA 0.3%

52 52 GAAAOGGAOGAOGGCAAUG 95.8% 5799 62 AAACGgAAACGaGACUCUA 0.1%

5756 52 GAAAOGGAOGAuGGCgAOG 4.1% 5800 62 AAACGgAAACGGaACUCUA 0.1%

5757 52 GAAgOGGAOGAUGGCAAUG 0.1% 63 63 AACGAAAACGGGACUCUAG 95.4%

53 53 OGGAOGAOGGCAAUGAAAO 95.8% 5801 63 AACGgAAACGGGACUCUAG 2.7%

5 7 5 8 53 UGGAOGAOGGCAAUGAgAU 0.1% 5802 63 AACGgAAACGGGACUuOAG 0.2%

5759 53 UGGAUGAUGGCgAUGAAAU 4.1% 5803 63 AACGAAAAaGGGACUCUAG 0.1%

54 54 UGAAAUGGAOGAOGGCAAU 95.8% 5804 63 AACGAAAACGaGACUCUAG 0.3%

5760 54 UGAAAOGGAOGAOGGCgAU 4.1% 5805 63 AACGAAAACGGaACUCUAG 0.5%

5761 54 UGAAgOGGAUGAOGGCAAU 0.1% 5806 63 AACGAAAACGGGACUaUAG 0.1%

55 55 GGAUGAOGGCAAOGAAAOA 95.8% 5807 63 AACGAAAACGGGACUCcAG 0.3%

5762 55 GGAUGAUGGCAAUGAgAUA 0.1% 5808 63 AACGAAAACGGGACUCUAa 0.1%

5763 55 GGAUGAUGGCgAUGAAAOA 4.1% 5809 63 AACGgAAACGaGACUCUAG 0.1%

56 56 UGAAACGAAAACGGGACUC 95.8% 5810 63 AACGgAAACGGaACUCUAG 0.1%

5764 56 UGAAACGgAAACGGGACUC 2.7% 64 64 GAAAACGGGACUCUAGCAU 95.4%

5765 56 UGAAACGgAAACGGGACUu 0.2% 5811 64 GgAAACGGGACUCUAGCAO 2.7%

5766 56 OGAAACGAAAAaGGGACOC 0.1% 5812 64 GgAAACGGGACOuUAGCAU 0.2%

5767 56 UGAAACGAAAACGaGACOC 0.3% 5813 64 GAAAAaGGGACUCUAGCAU 0.1%

5768 56 UGAAACGAAAACGGaACOC 0.5% 5814 64 GAAAACGaGACUCUAGCAU 0.3%

5769 56 UGAAACGAAAACGGGACUa 0.1% 5815 64 GAAAACGGaACUCUAGCAU 0.5%

5770 56 UGAAACGgAAACGaGACUC 0.1% 5816 64 GAAAACGGGACUaUAGCAU 0.1%

5771 56 UGAAACGgAAACGGaACUC 0.1% 5817 64 GAAAACGGGACUCCAGCAU 0.3%

57 57 AAAUGGAUGAOGGCAAOGA 95.8% 5818 64 GAAAACGGGACUCUAaCAU 0.1%

5772 57 AAAOGGAOGAOGGCgAOGA 4.1% 5819 64 GgAAACGaGACUCOAGCAU 0.1%

5773 57 AAgUGGAUGAUGGCAAUGA 0.1% 5820 64 GgAAACGGaACOCOAGCAU 0.1%

58 58 AUGGAUGAUGGCAAUGAAA 95.7% 6 5 65 ACGAAAACGGGACOCOAGC 95.4%

5774 58 AUGGAUGAUGGCAAUGAgA 0.1% 5821 65 ACGgAAACGGGACUCUAGC 2.7%

5775 58 AOGGAOGAUGGCgAUGAAA 4.1% 5 822 65 ACGgAAACGGGACUuUAGC 0.2%

5776 58 gOGGAOGAUGGCAAUGAAA 0.1% 5823 65 ACGAAAAaGGGACUCUAGC 0.1%

59 59 AAUGGAOGAUGGCAAUGAA 95.7% 5824 65 ACGAAAACGaGACUCUAGC 0.3%

5777 59 AAUGGAOGAUGGCAAUGAg 0.1% 5825 65 ACGAAAACGGaACUCUAGC 0.5%

5778 59 AAUGGAUGAUGGCgAUGAA 4.1% 5826 65 ACGAAAACGGGACUaUAGC 0.1%

5779 59 AgUGGAUGAUGGCAAOGAA 0.1% 5827 65 ACGAAAACGGGACUCcAGC 0.3%

60 60 AUGAAAUGGAUGAUGGCAA 95.7% 5828 65 ACGAAAACGGGACUCUAaC 0.1%

5780 60 AUGAAAUGGAOGAOGGCgA 4.1% 5829 65 ACGgAAACGaGACUCUAGC 0.1%

5781 60 AUGAAgUGGAUGAUGGCAA 0.1% 5830 65 ACGgAAACGGaACUCUAGC 0.1%

5782 60 gUGAAAUGGAUGAUGGCAA 0.1% 66 66 CGAAAACGGGACUCUAGCA 95.4%

61 61 GAAACGAAAACGGGACUCU 95.5% 5831 66 CGgAAACGGGACUCUAGCA 2.7%

5783 61 GAAACGgAAACGGGACUCU 2.7% 5832 66 CGgAAACGGGACUuUAGCA 0.2%

5 784 61 GAAACGgAAACGGGACUuU 0.2% 5833 66 CGAAAAaGGGACUCUAGCA 0.1%

5785 61 GAAACGAAAAaGGGACUCU 0.1% 5834 66 CGAAAACGaGACUCUAGCA 0.3%

5786 61 GAAACGAAAACGaGACUCU 0.3% 5835 66 CGAAAACGGaACUCUAGCA 0.5%

S836 66 CGAAAACGGGACUaUAGCA 0.1% 5885 71 GGOUAAAACAcGGAACCUU 0.1%

5 837 66 CGAAAACGGGACDCCAGCA 0.3% 5886 71 GGcUAAAgCAUGGAACCUU 0.2%

5838 66 CGAAAACGGGACDCUAaCA 0.1% 5887 71 GGUUAAAgCAcGGAACCDU 0.1%

5839 66 CGgAAACGaGACDCUAGCA 0.1% 5888 71 GaUUAAAACAcGGAACCUU 2.2%

584C 66 CGgAAACGGaACDCDAGCA 0.1% 5889 71 GaUUAAAACAcGGgACCUO 0.1%

67 67 AAAACGGGACDCDAGCAUA 95.3% 72 72 AGGUUAAAACAUGGAACCU 94.8%

5841 67 AAAACGGGACUCUAGCADg 0.1% 5890 72 AGGUUAAAACAUGGAACuU 0.1%

5842 67 gAAACGGGACUCUAGCAUA 2.7% 5891 72 AGGUUAAAACAUGGgACCU 1.2%

5 843 67 gAAACGGGACUuDAGCADA 0.2% 5892 72 AGGOϋgAAACAOGGAACCU 0.3%

5844 67 AAAAaGGGACDCUAGCAUA 0.1% 5893 72 AGaUOAAAACAOGGAACCU 0.4%

5845 67 AAAACGaGACUCUAGCAUA 0.3% 5894 72 AGGUUAAAAaAUGGAACCU 0.4%

5846 67 AAAACGGaACDCDAGCAUA 0.5% 5895 72 AGGUUAAAACAcGGAACCU 0.1%

5847 67 AAAACGGGACUaUAGCADA 0.1% 5896 72 AGGcUAAAgCAUGGAACCU 0.2%

5848 67 AAAACGGGACUCcAGCAUA 0.3% 5897 72 AGGUUAAAgCAcGGAACCU 0.1%

5 849 67 AAAACGGGACUCDAaCAUA 0.1% 5898 72 AGaUUAAAACAcGGAACCU 2.2%

5850 67 gAAACGaGACUCUAGCAUA 0.1% 5899 72 AGaUUAAAACAcGGgACCU 0.1%

5851 67 gAAACGGaACUCUAGCAUA 0.1% 73 73 CUCGCACDCGCGAGAOACO 94.5%

68 68 GAAAGGUUAAAACAUGGAA 94.9% 5900 73 COCGCACOCGCGAGADAuD 1.4%

5852 68 GAAAGGUUAAAACAOGGgA 1.2% 5901 73 CDCGCACUCGuGAGAUACU 0.1%

5853 68 GAAAGGUDgAAACADGGAA 0.3% 5902 73 CUCGCAuUCGCGAGAUACU 0.1%

5854 68 GAAAGaUUAAAACAUGGAA 0.4% 5903 73 UOCGCACOCGCGAGAOACO 0.1%

5855 68 GAAAGGUUAAAAaAOGGAA 0.4% 5904 73 CcCGCACUCGCGAGAUACϋ 2.0%

5856 68 GAAAGGUUAAAACACGGAA 0.1% 5905 73 CUCGCACaCGCGAGAUACϋ 0.6%

5857 68 GAAAGGcUAAAgCAUGGAA 0.2% 5906 73 CDCGCACcCGCGAGADACO 0.5%

5858 68 GAAAGGUUAAAgCAcGGAA 0.1% 5907 73 COCGCACUCGCGAaAUACU 0.1%

5859 68 GAAAGaUDAAAACAcGGAA 2.2% 5908 73 CUCGCACUCGCGgaAOACU 0.1%

5860 68 GAAAGaUUAAAACAcGGgA 0.1% 5909 73 uϋCGCACaCGCGAGAUACU 0.1%

69 69 AAAGGUUAAAACADGGAAC 94.9% 5910 73 CcCGCACgCGCGAGAUACU 0.2%

5861 69 AAAGGDUAAAACAUGGgAC 1.2% 5911 73 CUCGCACUguCGAGAUACU 0.1%

5862 69 AAAGGUDgAAACADGGAAC 0.3% 74 74 UCOCGCACOCGCGAGAUAC 94.5%

5863 69 AAAGaUUAAAACADGGAAC 0.4% 5912 74 OCOCGCACUCGCGAGAUAu 1.4%

5864 69 AAAGGDDAAAAaAUGGAAC 0.4% 5913 74 UCUCGCACUCGuGAGAOAC 0.1%

5865 69 AAAGGUDAAAACAcGGAAC 0.1% 5914 74 OCDCGCAuUCGCGAGAUAC 0.1%

5866 69 AAAGGcUAAAgCAUGGAAC 0.2% 5915 74 UuUCGCACDCGCGAGAUAC 0.1%

5867 69 AAAGGUUAAAgCAcGGAAC 0.1% 5916 74 UCcCGCACUCGCGAGAUAC 2.0%

5868 69 AAAGaDUAAAACAcGGAAC 2.2% 5917 74 UCUCGCACaCGCGAGAUAC 0.6%

5869 69 AAAGaUUAAAACAcGGgAC 0.1% 5918 74 UCDCGCACcCGCGAGAOAC 0.5%

70 70 AAGGUDAAAACADGGAACC 94.8% 5919 74 DCUCGCACUCGCGAaAUAC 0.1%

5870 70 AAGGDUAAAACAUGGAACu 0.1% 5920 74 UCOCGCACUCGCGgaAOAC 0.1%

5871 70 AAGGUUAAAACADGGgACC 1.2% 5921 74 OuUCGCACaCGCGAGAUAC 0.1%

5872 70 AAGGUDgAAACADGGAACC 0.3% 5922 74 UCcCGCACgCGCGAGAUAC 0.2%

5873 70 AAGaDUAAAACAUGGAACC 0.4% 5923 74 UCUCGCACOguCGAGAUAC 0.1%

5874 70 AAGGOUAAAAaAUGGAACC 0.4% 75 75 UAAAAGCAGUUAGAGGUGA 94.3%

5875 70 AAGGUUAAAACAcGGAACC 0.1% 5924 75 UAAAAGCgGUUAGAGGUGA 0.3%

5876 70 AAGGcDAAAgCAOGGAACC 0.2% 5925 75 UgAAAGCAGUUAGAGGUGA 0.2%

5877 70 AAGGUUAAAgCAcGGAACC 0.1% 5926 75 UAAAgGCAGUUAGgGGUGA 0.1%

5878 70 AAGaOOAAAACAcGGAACC 2.2% 5927 75 UAAAAGCAGUCAGAGGUGA 0.4%

5879 70 AAGaUUAAAACAcGGgACC 0.1% 5928 75 UAAAAGCAGUOAGAGGaGA 0.7%

71 71 GGUDAAAACAUGGAACCOU 94.8% 5929 75 OAAAAuCAGUUAGAGGUGA 0.1%

5880 71 GGUOAAAACAOGGAACUUU 0.1% 5930 75 UAAAAGCAGUcAGgGGOGA 0.1%

5881 71 GGOUAAAACAUGGgACCUU 1.2% 5931 75 UAAAgGCAGUccGAGGUGA 2.2%

5882 71 GGUUgAAACAuGGAACCUU 0.3% 76 76 UCGGAUGGCCAUCAAUUAA 94.2%

5883 71 GaUUAAAACAUGGAACCUU 0.4% 5932 76 UCGGAOGGCCADCAAUUAg 3.9%

5884 71 GGUOAAAAaAOGGAACCDO 0.4% 5933 76 UCGGAOGGCCAUuAAUUAA 0.2%

5 934 76 UCGaAOGGCCAUCAAOuAA 0.7% 5979 85 ACUUAGGgUGAAAUGGAUG 0.1%

S935 76 OCGGAUGGCCAOCAAgOAA 0.1% 5980 85 ACUcAGGAUGAAAUGGAUG 0.1%

S936 76 UCGGAUGGCCAUCAAUagg 0.1% 5981 85 ACUUAGaAUGAAAUGGAUG 2.1%

5937 76 DCGGAUGGCCAcCAAUgAg 0.1% 5982 85 UCUUAGaAUGAAAUGGAOG 2.3%

77 77 AGGAUGAAAUGGAUGAUGG 94.0% 86 86 GUUGCAOACAUGUUAGAGA 93.8%

5938 77 AGGAUGAAgUGGAOGAUGG 0.1% 5983 86 GUUGCAUACAUGUUgGAGA 0.3%

5 939 77 AGGgUGAAAUGGAOGAUGG 0.1% 5984 86 GUUGCgUACAUGOUAGAGA 1.5%

5940 77 AGaAOGAAAOGGAUGAOGG 5.8% 5985 86 aUUGCAUACAUGUUAGAGA 0.1%

78 78 GGAOGAAAOGGAOGAOGGC 94.0% 5986 86 GUcGCAUACAUGUUAGAGA 0.3%

5941 78 GGAOGAAgOGGAUGAUGGC 0.1% 5987 86 GUOGCAUACAUGUUAGAaA 0.1%

5942 78 GGgUGAAAOGGAUGAUGGC 0.1% 5988 86 GUgGCAOACAOGOOgGAGA 0.2%

5943 78 GaAUGAAAUGGAUGAUGGC 5.8% 5989 86 GUOGCAOACAOGϋϋgGAaA 1.2%

79 79 UUAGGAOGAAAOGGAOGAU 93.9% 5990 86 GOgGCAOACAOGOOgGAaA 0.3%

5944 79 OUAGGAOGAAgOGGAOGAU 0.1% 87 87 UAAAACAUGGAACCUUUGG 93.8%

5945 79 OUAGGgUGAAAUGGAUGAU 0.1% 5991 87 UAAAACAOGGAACuOOUGG 0.1%

5946 79 OcAGGAUGAAAUGGAUGAO 0.1% 5992 87 UAAAACAUGGgACCOUUGG 1.2%

5947 79 UOAGaAUGAAAUGGAUGAO 4.5% 5993 87 UAAAgCAOGGAACCUUUGG 0.2%

5948 79 UcAGaAOGAAAOGGAOGAU 1.3% 5994 87 UgAAACAUGGAACCUUOGG 0.3%

80 80 AUGGOOGCAOACAUGOUAG 93.9% 5995 87 OAAAAaAUGGAACCUUOGG 0.4%

5949 80 AUGGUUGCAUACAUGUOgG 1.5% 5996 87 OAAAACAcGGAACCOUUGG 2.2%

5950 80 AUGGUUGCgUACAUGUUAG 1.5% 5997 87 UAAAACAOGGAACCOOcGG 1.2%

5951 80 AUGaUUGCAUACAOGUOAG 0.1% 5998 87 UAAAACAOGGAACCUOnGG 0.1%

5952 80 AOGGUcGCAUACAOGUUAG 0.3% 5999 87 UAAAACAUGGAACCUUUaG 0.1%

5953 80 AUGGUgGCAUACAOGUUgG 0.5% 6000 87 UAAAACAcGGgACCOUOGG 0.1%

5954 80 AOGGOgGCAOAuAOGOUgG 2.0% 6001 87 OAAAgCAcGGAACCOUUGG 0.1%

81 81 UAGGAUGAAAOGGAOGAOG 93.9% 6002 87 UAAAACAcGGAACCOUcGG 0.1%

5955 81 UAGGAOGAAgUGGAOGAUG 0.1% 88 88 OUAAAACAUGGAACCUUUG 93.8%

5956 81 OAGGgUGAAAUGGAUGAUG 0.1% 6003 88 UUAAAACAUGGAACuOOOG 0.1%

5957 81 CAGGAUGAAAUGGAUGAUG 0.1% 6004 88 UUAAAACAUGGgACCUUUG 1.2%

5958 81 UAGaAOGAAAOGGAUGAUG 4.5% 6005 88 UUgAAACAUGGAACCϋϋOG 0.3%

5959 81 CAGaAUGAAAOGGAOGAUG 1.3% 6006 88 UOAAAAaAOGGAACCUUOG 0.4%

82 82 UGGUOGCAOACAOGOOAGA 93.9% 6007 88 OOAAAACAcGGAACCUUUG 2.2%

5960 82 OGGOOGCAOACAOGOOgGA 1.5% 6008 88 OUAAAACAOGGAACCUUcG 1.2%

5961 82 OGGOUGCgUACAUGOOAGA 1.5% 6009 88 OUAAAACAUGGAACCUUnG 0.1%

5962 82 OGaOUGCAOACAUGUUAGA 0.1% 6010 88 UUAAAACAUGGAACCUUUa 0.1%

5963 82 OGGOcGCAUACAUGUUAGA 0.3% 6011 88 cOAAAgCAOGGAACCOOOG 0.2%

5964 82 UGGUgGCAUACAUGUUgGA 0.5% 6012 88 OUAAAACAcGGgACCUUOG 0.1%

5965 82 UGGUgGCAUAuAUGUUgGA 2.1% 6013 88 OOAAAgCAcGGAACCUUUG 0.1%

83 83 CUOAGGAOGAAAOGGAUGA 93.9% 6014 88 OOAAAACAcGGAACCUUcG 0.1%

5966 83 CUOAGGAOGAAgOGGAUGA 0.1% 89 89 GGOUGCAOACAOGOUAGAG 93.8%

5967 83 CUUAGGgOGAAAOGGAUGA 0.1% 6015 89 GGUUGCAOACAUGUUgGAG 0.3%

5968 83 CUcAGGAOGAAAOGGAUGA 0.1% 6016 89 GGOUGCgUACAUGOUAGAG 1.5%

5969 83 COUAGaAUGAAAOGGAUGA 4.5% 6017 89 GaUUGCAUACAUGOUAGAG 0.1%

5970 83 COcAGaAUGAAAOGGAUGA 1.3% 6018 89 GGUcGCAUACAUGOUAGAG 0.3%

84 84 UUGCAUACAUGOOAGAGAG 93.9% 6019 89 GGUOGCAUACAUGOUAGAa 0.1%

5971 84 UUGCAUACAUGUUgGAGAG 0.3% 6020 89 GGUgGCAUACAUGUUgGAG 0.2%

5972 84 OUGCgUACAUGUUAGAGAG 1.5% 6021 89 GGUUGCAUACAOGUUgGAa 1.2%

5973 84 UcGCAUACAUGUUAGAGAG 0.3% 6022 89 GGUgGCAUACAUGUUgGAa 0.3%

5974 84 UUGCAUACAUGUUAGAaAG 0.1% 90 90 UGAOGGUUGCAUACAUGϋϋ 93.7%

5975 84 UgGCAUACAUGUUgGAGAG 0.2% 6023 90 UGAUGGUUGCgUACAUGUU 1.5%

5976 84 UOGCAUACAOGUUgGAaAG 1.2% 6024 90 UaAUGGUUGCAUACAUGUU 1.7%

5977 84 UgGCAUACAUGUUgGAaAG 0.3% 6025 90 UGAUGaUUGCAUACAUGUU 0.1%

85 85 ACUUAGGAUGAAAOGGAUG 93.9% 6026 90 UGAUGGOcGCAOACAUGOU 0.3%

5978 85 ACUUAGGAUGAAgUGGAUG 0.1% 6027 90 UGAUGGUgGCAUACAUGUU 0.1%

6028 90 UaAUGGUgGCAUACAUGUU 0.4% 6075 97 UCACOOAGGAOGAAgϋGGA 0.1%

6029 90 UaAOGGOgGCAUAuAUGUU 2.0% 6076 97 UCACUOAGGgOGAAAUGGA 0.1%

91 91 AUGAGAAUACUUGOAAGGG 93.7% 6077 97 UuACUUAGGAUGAAAUGGA 0.2%

6030 91 AUGAGAAUACUUGOgAGGG 0.2% 6078 97 gCACOOAGGAOGAAAOGGA 0.4%

6031 91 AUGAGAAUACUcGOAAGGG 1.3% 6079 97 UCACUcAGGAUGAAAOGGA 0.1%

6032 91 AOGAGAAUACUUGOAAGaG 2.6% 6080 97 UCACUOAGaAOGAAAUGGA 2.1%

6033 91 AUGAGAAUACUUGOgAGaG 2.0% 98 98 AACAOGGAACCUUUGGCCC 93.1%

6034 91 AUGAcAAUACUUGUgAGaG 0.1% 6081 98 AACAOGGAACCUUUGGuCC 1.1%

92 92 UGAGAAUACUUGOAAGGGG 93.7% 6082 98 AACAUGGAACUUUUGGCCC 0.1%

6035 92 UGAGAAUACUUGOgAGGGG 0.2% 6083 98 AACAUGGgACCUUUGGCCC 1.1%

6036 92 UGAGAAUACUCGOAAGGGG 1.3% 6084 98 AgCAUGGAACCUUUGGCCC 0.2%

6037 92 UGAGAAUACUUGOAAGaGG 2.6% 6085 98 AACAUGGgACCUUUGGuCC 0.1%

6038 92 OGAGAAUACUOGOgAGaGG 2.0% 6086 98 AAaAOGGAACCUUUGGCCC 0.4%

6039 92 OGAcAAUACUOGUgAGaGG 0.1% 6087 98 AACAcGGAACCUUUGGCCC 2.2%

93 93 CACUUAGGAUGAAAOGGAO 93.7% 6088 98 AACAOGGAACCUUnGGCCC 0.1%

6040 93 CACUUAGGAOGAAgOGGAO 0.1% 6089 98 AACAUGGAACCUUUaGCCC 0.1%

6041 93 CACUUAGGgOGAAAOGGAO 0.1% 6090 98 AACAcGGgACCUUOGGCCC 0.1%

6042 93 UACUUAGGAOGAAAOGGAO 0.2% 6091 98 AACAOGGAACCOUcGGuCC 1.2%

6043 93 CACϋcAGGAUGAAAOGGAO 0.1% 6092 98 AgCAcGGAACCOOOGGCCC 0.1%

6044 93 CACUUAGaAOGAAAOGGAU 2.1% 6093 98 AACAcGGAACCOOcGGCCC 0.1%

6045 93 CuCUUAGaAUGAAAUGGAU 2.3% 99 99 AAACAUGGAACCUUUGGCC 93.1%

94 94 GOUAAAACAOGGAACCOOU 93.7% 6094 99 AAACAUGGAACCUOUGGuC 1.1%

6046 94 GOOAAAACAOGGAACUOOU 0.1% 6095 99 AAACAUGGAACuϋUUGGCC 0.1%

6047 94 GOOAAAACAOGGgACCOOU 1.2% 6096 99 AAACAUGGgACCUOOGGCC 1.1%

6048 94 GOOgAAACAUGGAACCOOU 0.3% 6097 99 AAgCAUGGAACCUOOGGCC 0.2%

6049 94 aOOAAAACAUGGAACCUUO 0.2% 6098 99 AAACAUGGgACCUOOGGuC 0.1%

6050 94 GUUAAAAaAUGGAACCUUO 0.4% 6099 99 AAAaAOGGAACCUUUGGCC 0.4%

6051 94 GUUAAAACAcGGAACCUUO 0.1% 6100 99 AAACAcGGAACCUUUGGCC 2.2%

6052 94 GOOAAAACAOGGAACCUUc 1.0% 6101 99 AAACAOGGAACCUUnGGCC 0.1%

6053 94 GUOAAAACAUGGAACCUOn 0.1% 6102 99 AAACAUGGAACCUUUaGCC 0.1%

6054 94 GcUAAAgCAUGGAACCOUO 0.2% 6103 99 AAACAcGGgACCOOUGGCC 0.1%

6055 94 GUUAAAgCAcGGAACCUUO 0.1% 6104 99 AAACAUGGAACCUOCGGUC 1.2%

6056 94 aϋUAAAACAcGGAACCUUϋ 2.1% 6105 99 AAgCAcGGAACCUOOGGCC 0.1%

6057 94 aOUAAAACAOGGAACCUUc 0.2% 6106 99 AAACAcGGAACCUUcGGCC 0.1%

6058 94 aOOAAAACAcGGgACCUUU 0.1% 100 100 CAAGCOGUGGAUAUAUGCA 92.9%

95 95 UOGAOGGUUGCAUACAUGU 93.5% 6107 100 CAAGCUGUGGAUAUAUGuA 0.1%

6059 95 UUGAOGGUUGCgOACAUGU 1.5% 6108 100 CAgGCUGUGGAUAUAUGCA 0.2%

6060 95 CUGAOGGUUGCAOACAUGU 0.2% 6109 100 CAAGCaGUGGAUAUAUGCA 0.1%

6061 95 UUaAUGGUUGCAUACAOGU 1.7% 6110 100 CAAGCcGUGGAUAUAUGCA 0.5%

6062 95 UUGAUGaOUGCAUACAUGO 0.1% 6111 100 CAAGCUGUaGAOAOAUGCA 2.8%

6063 95 OUGAUGGUcGCAUACAUGO 0.3% 6112 100 CAAGCUGUGaAUAUAUGCA 0.1%

6064 95 CUGAUGGUgGCAUACAUGU 0.1% 6113 100 CAAGCUGUGGAaAUAUGCA 0.2%

6065 95 UOaAUGGUgGCAUACAUGU 0.4% 6114 100 CAAGCOGUGGAcAUAUGCA 2.6%

6066 95 UUaAOGGUgGCAOAuAUGU 2.0% 61 15 100 CAAGCcGUGGAUAUAUGuA 0.1%

96 96 GAUGGUUGCAUACAUGUOA 93.4% 6116 100 CAAGCcGUGGgUAUAUGCA 0.1%

6067 96 GAUGGUOGCAOACAOGUOg 0.3% 6117 100 CAAGCaGOaGAUAUAUGCA 0.2%

6068 96 GAUGGUUGCgUACAUGUUA 1.5% 61 18 100 CAAGCcGUGGAOAUuUGCA 0.1%

6069 96 aAUGGUUGCAUACAUGUUA 0.5% 101 101 AAGCUGUGGAUAUAUGCAA 92.9%

6070 96 GAUGaOUGCAUACAUGUOA 0.1% 6119 101 AAGCUGUGGAUAUAUGuAA 0.1%

6071 96 GAUGGUcGCAUACAUGOOA • 0.3% 6120 101 AgGCUGOGGAUAUAUGCAA 0.2%

6072 96 aAOGGUUGCAUACAOGUOg 1.2% 6121 101 AAGCaGUGGAUAUAUGCAA 0.1%

6073 96 GAOGGUgGCAUACAOGUUg 0.1% 6122 101 AAGCcGUGGAUAUAUGCAA 0.5%

6074 96 aAUGGUgGCAUACAOGUϋg 0.4% 6123 101 AAGCUGUaGAUAUAOGCAA 2.8%

97 97 UCACUUAGGAUGAAAUGGA 93.3% 6124 101 AAGCUGUGaAUAUAUGCAA 0.1%

6125 101 AAGCuGUGGAaAUAOGCAA 0.2% 6174 106 GgUGAAAUGGAOGAUGGCA 0.1%

6126 101 AAGCOGUGGAcAUAOGCAA 2.6% 6175 106 aAUGAAAUGGAUGAUGGCA 3.4%

6127 101 AAGCcGUGGAUAϋAUGuAA 0.1% 6176 106 aAUGAAAUGGAUGAUGGCg 2.3%

6128 101 AAGCcGUGGgUAUAUGCAA 0.1% 107 107 AACAAGCUGUGGAUAUAUG 91.3%

6129 101 AAGCaGUaGAOAUAUGCAA 0.2% 6177 107 AACAgGCUGUGGAUAUAUG 0.2%

6130 101 AAGCcGUGGAUAϋuOGCAA 0.1% 6178 107 AgCAAGCUGUGGAUAUAUG 1.7%

102 102 AAAACAOGGAACCOOOGGC 92.8% 6179 107 AACAAGCaGUGGAUAUAUG 0.1%

6131 102 AAAACAUGGAACCUUOGGu 1.1% 6180 107 AACAAGCcGUGGAUAUAUG 0.4%

6132 102 AAAACAUGGAACUUUUGGC 0.1% 6181 107 AACAAGCUGOaGAUAUAUG 0.3%

6133 102 AAAACAUGGgACCUUUGGC 1.1% 6182 107 AACAAGCUGUGGACAUAUG 2.6%

6134 102 AAAgCAUGGAACCUOUGGC 0.2% 6183 107 AgCAAGCcGOGGAUAUAUG 0.2%

6135 102 gAAACAUGGAACCUUOGGC 0.3% 6184 107 AgCAAGCUGUaGAOAOAOG 2.5%

6136 102 AAAACAOGGgACCOUOGGu 0.1% 6185 107 AgCAAGCOGOGaAUAUAUG 0.1%

6137 102 AAAAaAOGGAACCOUOGGC 0.4% 6186 107 AgCAAGCUGUGGAaAUAUG 0.2%

6138 102 AAAACAcGGAACCOUUGGC 2.2% 6187 107 AgCAAGCcGOGGgOAOAUG 0.1%

6139 102 AAAACAOGGAACCUUnGGC 0.1% 6188 107 AACAAGCaGUaGAUAUAUG 0.2%

6140 102 AAAACAUGGAACCUOUaGC 0.1% 6189 107 AgCAAGCcGUGGAOAUuUG 0.1%

6141 102 AAAACAcGGgACCOOOGGC 0.1% 108 108 GAACAAGCUGUGGAOAOAO 91.3%

6142 102 AAAACAOGGAACCOUcGGu 1.2% 6190 108 GAACAgGCUGOGGAOAOAO 0.2%

6143 102 AAAgCAcGGAACCUUUGGC 0.1% 6191 108 GAgCAAGCUGOGGAUAUAU 1.7%

6144 102 AAAACAcGGAACCUUcGGC 0.1% 6192 108 GAACAAGCaGUGGAUAUAU 0.1%

103 103 GUAAUGAAACGAAAACGGG 92.6% 6193 108 GAACAAGCcGUGGAOAOAU 0.4%

6145 103 GOAAUGAAACGgAAACGGG 2.7% 6194 108 GAACAAGCOGOaGAUAUAU 0.3%

6146 103 GUgAOGAAACGAAAACGGG 3.3% 6195 108 GAACAAGCUGUGGACAUAU 2.6%

6147 103 GOgAOGAAACGgAAACGGG 0.2% 6196 108 GAgCAAGCcGUGGAOAUAU 0.2%

6148 103 GOAAOGAAACGAAAAaGGG 0.1% 6197 108 GAgCAAGCUGUaGAUAUAU 2.5%

6149 103 GOAAUGAAACGAAAACGaG 0.3% 6198 108 GAgCAAGCUGUGaAUAUAU 0.1%

6150 103 GOAAUGAAACGAAAACGGa 0.4% 6199 108 GAgCAAGCUGOGGAaAUAU 0.2%

6151 103 GUAAUGAAACGgAAACGaG 0.1% 6200 108 GAgCAAGCcGUGGgUAOAO 0.1%

6152 103 GOAAOGAAACGgAAACGGa 0.1% 6201 108 GAACAAGCaGOaGAOAUAU 0.2%

6153 103 GOgAUGAAACGAAAACGGa 0.1% 6202 108 GAgCAAGCcGUGGAUAUuU 0.1%

104 104 UAAUGAAACGAAAACGGGA 92.6% 109 109 ACAAGCUGUGGAUAUAUGC 91.2%

6154 104 OAAUGAAACGgAAACGGGA 2.7% 6203 109 ACAAGCOGUGGAUAUAUGU 0.1%

6155 104 OgAOGAAACGAAAACGGGA 3.3% 6204 109 ACAgGCOGUGGAUAUAUGC 0.2%

6156 104 OgAUGAAACGgAAACGGGA 0.2% 6205 109 gCAAGCUGUGGAUAUAUGC 1.7%

6157 104 UAAUGAAACGAAAAaGGGA 0.1% 6206 109 ACAAGCaGUGGAUAUAOGC 0.1%

6158 104 UAAUGAAAC GAAAAC GaGA 0.3% 6207 109 ACAAGCcGOGGAUAUAUGC 0.3%

6159 104 OAAUGAAACGAAAACGGaA 0.4% 6208 109 ACAAGCUGUaGAUAUAUGC 0.3%

6160 104 UAAOGAAACGgAAACGaGA 0.1% 6209 109 ACAAGCUGUGGACAUAUGC 2.6%

6161 104 OAAUGAAACGgAAACGGaA 0.1% 6210 109 ACAAGCcGUGGAOAOAOGu 0.1%

6162 104 OgAUGAAACGAAAACGGaA 0.1% 6211 109 gCAAGCcGUGGAUAUAUGC 0.2%

105 105 AAUGAAACGAAAACGGGAC 92.6% 6212 109 gCAAGCUGUaGAOAOAOGC 2.5%

6163 105 AAUGAAACGgAAACGGGAC 2.7% 6213 109 gCAAGCUGUGaAUAUAUGC 0.1%

6164 105 gAOGAAACGAAAACGGGAC 3.3% 6214 109 gCAAGCUGUGGAaAUAUGC 0.2%

6165 105 gAOGAAACGgAAACGGGAC 0.2% 6215 109 gCAAGCcGUGGgOAOAOGC 0.1%

6166 105 AAUGAAACGAAAAaGGGAC 0.1% 6216 109 ACAAGCaGOaGAUAUAUGC 0.2%

6167 105 AAUGAAACGAAAACGaGAC 0.3% 6217 109 gCAAGCcGUGGAUAUuUGC 0.1%

6168 105 AAOGAAACGAAAACGGaAC 0.4% 110 110 OGUACACUCCAGGUGGAGA 90.0%

6169 105 AAOGAAACGgAAACGaGAC 0.1% 6218 110 UGUACACUCCAGGUGGgGA 4.4%

6170 105 AAOGAAACGgAAACGGaAC 0.1% 6219 110 UGUACACaCCAGGUGGAGA 0.1%

6171 105 gAUGAAACGAAAACGGaAC 0.1% 6220 110 OGUACACcCCAGGUGGAGA 0.9%

106 106 GAUGAAAUGGAUGAUGGCA 92.3% 6221 110 UGUACACgCCAGGUGGAGA 0.1%

6172 106 GAUGAAAOGGAOGAUGGCg 1.7% 6222 110 UGUACACUCCAGGCGGAGA 1.3%

6173 106 GAOGAAgUGGAUGAUGGCA 0.1% 6223 110 UGUACACUCCAGGUGGcGA 0.2%

6224 1 10 UGOACACUCCcGGUGGAGA 0.1% 6233 111 AUGUACACgCCAGGUGGAG Q.1%

6225 1 10 aGUACACUCCAGGUGGgGA 0.1% 6234 111 AUGUACACUCCAGGcGGAG 1.3%

6226 1 10 UGUACACUCCAGGaGGgGA 0.2% 6235 111 AUGUACACUCCAGGUGGcG 0.2%

6227 1 10 UGUACACUCCAGGcGGgGA 0.1% 6236 111 AUGUACACUCCCGGUGGAG 0.1%

6228 1 10 UGUAuACUCCAGGaGGgGA 0.2% 6237 111 AaGUACACUCCAGGUGGgG 0.1%

6229 1 10 UGUACACaCCAGGaGGgGA 2.2% 6238 111 AUGUACACUCCAGGaGGgG 0.2%

111 1 1 1 AUGUACACUCCAGGUGGAG 90.0% 6239 111 AUGUACACUCCAGGcGGgG 0.1%

6230 1 1 1 ADGUACACUCCAGGUGGgG 4.4% 6240 111 AUGUAuACUCCAGGaGGgG 0.2%

6231 1 1 1 AUGUACACaCCAGGUGGAG 0.1% 6241 111 AUGUACACaCCAGGaGGgG 2.2%

6232 11 1 AUGUACACcCCAGGUGGAG 0.9% 6242 111 AUGUACACgCCAGGaGGgG 0.1%

Table 20-2: Conserved and minor variant 19-mer sequences from the Influenza A segment 2 (PBl) sequences listed in Table 1-2. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

Seq Ref % 1188 1188 GAAGAUCUGUUCCACCAUU 90.8%

ID ID Match Seq Total 6273 1188 GAAGAUCUGUUuCACCAUU 0.7%

1183 1183 AUGAUGAUGGGCAUGUUCA 96.7%

6274 1188 GAgGAUCUGUUCCACCAUU 0.2%

6243 1183 AUGAUGAUGGGCAUGUUUA 2.7%

6275 1188 GAAaAUCUGUUCCACCAUU 0.1%

6244 1183 AUGAUGAUGGGUAUGUUCA 0.4%

6276 1188 GAAGA cCUGUUCCACCAUU 6.8%

6245 1183 AUGAUGAUGGGUAUGUUUA 0.1%

6277 1 188 GAAGAUCUGcUCCACCAUU 0.2%

6246 1183 AUGAUGAUGGGaAUGUUCA 0.1%

6278 1 188 GAAGAUCUGUUaCACCAUU 0.1%

1184 1 184 UGAUGAUGGGCAUGUUCAA 96.7%

6279 1188 GAAGAcCUGUUCuACCAUU 0.1%

6247 1184 UGAUGAUGGGCAUGUUUAA 2.7%

6280 1188 aAAGAcCUGUUCCACCAUU 0.1%

6248 1184 UGAUGAUGGGUAUGUUCAA 0.4%

6281 1188 GAAGAcCUGcUCCACCAUU 0.2%

6249 1184 UGAUGAUGGGUAUGUUUAA 0.1%

1189 1189 AAGAUCUGUUCCACCAUUG 90.7%

6250 1184 UGAUGAUGGGaAUGUUCAA 0.1%

6282 1189 AAGAUCUGUUuCACCAUUG 0.7%

1185 1185 AUCUGUUCCACCAUUGAAG 91.1%

6283 1189 AgGAUCUGUUCCACCAUUG 0.2%

6251 1185 AUCUGUUuCACCAUUGAAG 0.7%

6284 1189 AAaAUCUGUUCCACCAUUG 0.1%

6252 1185 AcCUGUUCCACCAUUGAAG 6.9%

6285 1 189 AAGAcCUGUUCCACCAUUG 6.9%

6253 1185 AUCUGcUCCACCAUUGAAG 0.2%

6286 1189 AAGAUCUG cUCCACCAUUG 0.2%

6254 1 185 AUCUGUUaCACCAUUGAAG 0.1%

6287 1189 AAGAUCUGUUaCACCAUUG 0.1%

6255 1185 AUCUGUUCCACCAUUaAAG 0.1%

6288 1 189 AAGAUCUGUUCCACCAUUa 0.1%

6256 1185 AcCUGUUCuACCAUUGAAG 0.1%

6289 1189 AAGAc CUGUUCu ACCAUUG 0.1%

6257 1185 AcCUGcUCCACCAUUGAAG 0.3%

6290 1189 AAGAcCUGcUCCACCAUUG 0.3%

1186 1186 UCUGUUCCACCAUUGAAGA 91.1%

1190 1190 UGAAGAUCUGUUCCACCAU 90.7%

6258 1186 UCUGUUuCACCAUUGAAGA 0.7%

6291 1190 UGAAGAUCUGUUUCACCAU 0.7%

6259 1 186 cCUGUUCCACCAUUGAAGA 6.9%

6292 1190 UGAgGAUCUGUUCCACCAU 0.2%

6260 1 186 UCUGcUCCACCAUUGAAGA 0.2%

6293 1 190 cGAAGAUCUGUUCCACCAU 0.1%

6261 1 186 UCUGUUaCACCAUUGAAGA 0.1%

6294 1 190 UGAAaAUCUGUUCCACCAU 0.1%

6262 1 186 UCUGUUCCACCAUUaAAGA 0.1%

6295 1190 UGAAGAcCUGUUCCACCAU 6.8%

6263 1 186 cCUGUUCuACCAUUGAAGA 0.1%

6296 1 190 UGAAGAUCUGCUCCACCAU 0.2%

6264 1 186 cCUGcUCCACCAUUGAAGA 0.3%

6297 1 190 UGAAGAUCUGUUaCACCAU 0.1%

1187 1 187 GAUCUGUUCCACCAUUGAA 90.9%

6298 1190 UGAAGAcCUGUUCuACCAU 0.1%

6265 1187 GAUCUGUUuCACCAUUGAA 0.7%

6299 1190 UaAAGAcCUGUUCCACCAU 0.1%

6266 1187 aAUCUGUUCCACCAUUGAA 0.1%

6300 1 190 UGAAGAcCUGcUCCACCAU 0.2%

6267 1 187 GAcCUGUUCCACCAUUGAA 6.9%

1 191 1 191 AGAUCUGUUCCACCAUUGA 90.7%

6268 1 187 GAUCUGcUCCACCAUUGAA 0.2%

6301 1 191 AGAUCUGUUuCACCAUUGA 0.7%

6269 1187 GAUCUGUUaCACCAUUGAA 0.1%

6302 1 191 gGAUCUGUUCCACCAUUGA 0.2%

6270 1 187 GAUCUGUUCCACCAUIJaAA 0.1%

6303 1 191 AaAUCUGUUCCACCAUUGA 0.1%

6271 1 187 GAcCUGUUCuACCAUUGAA 0.1%

6304 1191 AGAcCUGUUCCACCAUUGA 6.9%

6272 1 187 GAcCUGcUCCACCAUUGAA 0.3%

6305 1191 AGAUCUGcUCCACCAUUGA 0.2%

6306 1191 AGAUCUGUUaCACCAUUGA 0.1% 6354 1197 AGGCCAUGGUGUCUCGGGC 0.4%

6307 1191 AGAUCUGUUCCACCAUUaA 0.1% 6355 1197 AGGCCAUGGUUUCUAGGGC 0.2%

6308 1191 AGAcCUGUUCuACCAUUGA 0.1% 6356 1197 AGGCCAUGGUaUCUAGGaC 0.1%

6309 1191 AGAcCUGcUCCACCAUUGA 0.3% 6357 1197 AGGCCAUGGUuUCcAGGGC 0.1%

1192 1192 UGAUAAACAAUGACCUUGG 90.5% 1198 1198 GGAGGCCAUGGUGUCUAGG 89.0%

63H) 1192 UGAUAAACAAUGAUCUUGG 3.4% 6358 1198 GGAGGCCAUGGUGUCUgGG 0.1%

6311 1192 UGAUAAAuAAUGACCUUGG 0.6% 6359 1198 GGAGGCaAUGGUGUCUAGG 0.1%

6312 1192 UGgUAAACAAUGACCUUGG 0.3% 6360 1198 GGAGGCCAUGaUGUCUAGG 0.2%

6313 1192 UGAUAAACAACGACCUUGG 3.9% 6361 1198 GGAGGCCAUGGUaUCUAGG 0.6%

6314 1192 UGAUAAACAAUGACCUaGG 1.2% 6362 1198 GGAGGCCAUGGUGUCUAaG 0.1%

1193 1193 AUGAUAAACAAUGACCUUG 90.5% 6363 1198 GGAGGCCAUGGUGUCUAGa 0.2%

6315 1193 AUGAUAAACAAUGAUCUUG 3.4% 6364 1198 GGAGGCCAUGGUGUCUCGG 0.4%

6316 1193 AUGAUAAAuAAUGACCUUG 0.6% 6365 1198 GGAGGCCAUGGUUUCUAGG 0.2%

6317 1193 AUGgUAAACAAUGACCUUG 0.3% 6366 1198 uGAGGCCAUGGUGUCUAGG 0.1%

6318 1193 AUGAUAAACAACGACCUUG 3.9% 6367 1198 GGAGGCCAUGGUUUCCAGG 0.1%

6319 1193 AUGAUAAACAAUGACCUaG 1.2% 6368 1198 uGAaGCCAUGGUGUCUAGG 0.3%

1194 1194 CCAUACAGCCAUGGAACAG 89.2% 1199 1199 UGGAGGCCAUGGUGUCUAG 88.9%

6320 1194 CCAUACAGCCAUGGAACgG 0.1% 6369 1199 UGGAGGCCAUGGUGUCUgG 0.1%

6321 1194 CCAUACAGCCAUGGgACAG 1.3% 6370 1199 gGGAGGCCAUGGUGUCUAG 0.3%

6322 1194 CCAUACAGuCAUGGAACAG 0.2% 6371 1199 UGGAGGCaAUGGUGUCUAG 0.1%

6323 1194 CCAUAuAGCCAUGGAACAG 0.6% 6372 1199 UGGAGGCCAUGaUGUCUAG 0.2%

6324 1194 CCuUACAGCCAUGGAACAG 0.2% 6373 1199 UGGAGGCCAUGGUaUCUAG 0.6%

6325 1194 CCcUACAGCCAUGGgACAG 0.1% 6374 1199 UGGAGGCCAUGGUGUCUAa 0.1%

6326 1194 CCuUACAGCCAUGGgACAG 8.1% 6375 1199 UGGAGGCCAUGGUGUCUCG 0.4%

6327 1194 CCuUACAGuCAUGGAACAG 0.1% 6376 1199 UGGAGGCCAUGGUUUCUAG 0.2%

1195 1195 CAUACAGCCAUGGAACAGG 89.2% 6377 1199 UuGAGGCCAUGGUGUCUAG 0.1%

6328 1195 CAUACAGCCAUGGAACgGG 0.1% 6378 1199 UGGAGGCCAUGGUUUCCAG 0.1%

6329 1195 CAUACAGCCAUGGgACAGG 1.3% 6379 1199 UuGAaGCCAUGGUGUCUAG 0.3%

6330 1195 CAUACAGuCAUGGAACAGG 0.2% 6380 1199 UGGAGGCuAUGGUaUCaAG 0.1%

6331 1195 CAUAuAGCCAUGGAACAGG 0.6% 6381 1199 UGGAGGCuAUGGUuUCaAG 0.1%

6332 1195 CuUACAGCCAUGGAACAGG 0.2% 6382 1199 UGGAGGCuAUGGUuUCcAG 0.7%

6333 1195 CcUACAGCCAUGGgACAGG 0.1% 1200 1200 GAGAUCAUGAAGAUCUGUU 88.8%

6334 1195 CuUACAGCCAUGGgACAGG 8.1% 6383 1200 GAGAUCAUGAgGAUCUGUU 0.2%

6335 1195 CuUACAGuCAUGGAACAGG 0.1% 6384 1200 GAaAUCAUGAAGAUCUGUU 0.5%

1196 1196 GAGGCCAUGGUGUCUAGGG 89.1% 6385 1200 GAGAcCAUGAAGAUCUGUU 0.1%

6336 1196 GAGGCCAUGGUGUCUgGGG 0.1% 6386 1200 GAGAUCAcGAAGAUCUGUU 0.1%

6337 1196 GAaGCCAUGGUGUCUAGGG 0.3% 6387 1200 GAGAUCAUGAAaAUCUGUU 0.1%

6338 1196 GAGGCaAUGGUGUCUAGGG 0.1% 6388 1200 GAGAUCAUGAAGACCUGUU 6.9%

6339 1196 GAGGCCAUGaUGUCUAGGG 0.2% 6389 1200 GAGAUCAUGAAGAUCUGCU 0.2%

6340 1196 GAGGCCAUGGUaUCUAGGG 0.5% 6390 1200 GAGAUCcUGAAGAUCUGUU 0.1%

6341 1196 GAGGCCAUGGUGUCUAaGG 0.1% 6391 1200 GAGAUCuUGAAGAUCUGUU 2.0%

6342 1196 GAGGCCAUGGUGUCUAGaG 0.2% 6392 1200 GAGAUCAUaAAGAcCUGUU 0.1%

6343 1196 GAGGCCAUGGUGUCUCGGG 0.4% 6393 1200 GAGAUCAUGAAGACCUGCU 0.2%

6344 1196 GAGGCCAUGGUUUCUAGGG 0.2% 1201 1201 ACCAAGAAAAUGGUCACAC 88.7%

6345 1196 GAGGCCAUGGUaUCUAGGa 0.1% 6394 1201 ACCAAGAAAAUGGUCACgC 0.3%

6346 1196 GAGGCCAUGGUUUCCAGGG 0.1% 6395 1201 ACCAAGAAAAUGGUuACAC 0.1%

1197 1197 AGGCCAUGGUGUCUAGGGC 89.1% 6396 1201 ACCAAGAAgAUGGUCACAC 0.3%

6347 1197 AGGCCAUGGUGUCUgGGGC 0.1% 6397 1201 ACCAAGAgAAUGGUCACAC 0.4%

6348 1197 AaGCCAUGGUGUCUAGGGC 0.3% 6398 1201 ACCAAaAAAAUGGUCACAC 0.7%

634') 1197 AGGCaAUGGUGUCUAGGGC 0.1% 6399 1201 ACCAAGAAAAUGaUCACAC 0.1%

6350 1197 AGGCCAUGaUGUCUAGGGC 0.2% 6400 1201 ACCAAGAAAAUGGUCACCC 0.1%

6351 1197 AGGCCAUGGUaUCUAGGGC 0.5% 6401 1201 ACCAAGAAuAUGGUCACAC 0.1%

6352 1197 AGGCCAUGGUGUCUAaGGC 0.1% 6402 1201 ACCcAGAAAAUGGUCACAC 0.1%

6353 1197 AGGCCAUGGUGUCUAGaGC 0.2% 6403 1201 ACCAAaAAgAUGGUCACAC 0.2%

6404 1201 ACCAAGAgAAUGGUCACuC 0.1% 1205 1205 CAUGAAGAUCUGUUCCACC 88.6%

6405 1201 ACuAAGAAAAUGGUgACAC 1.1% 6455 1205 CAUGAAGAUCUGUUUCACC 0.7%

6406 1201 ACuAAGAAAAUGGUgACgC 0.1% 6456 1205 CAUGAgGAUCUGUUCCACC 0.2%

6407 1201 ACCAAaAAAAUGGUaACAC 0.1% 6457 1205 CAcGAAGAUCUGUUCCACC 0.1%

6408 1201 ACCAAaAAAAUGGUgACAC 0.1% 6458 1205 CAUGAAaAUCUGUUCCACC 0.1%

6409 1201 ACuAAaAAAAUGGUgACAC 0.5% 6459 1205 CAUGAAGAcCUGUUCCACC 6.8%

6410 1201 ACuAAGAAAAUGaUaACAC 0.2% 6460 1205 CAUGAAGAUCUGCUCCACC 0.2%

1202 1202 CCAAGAAAAUGGUCACACA 88.7% 6461 1205 CAUGAAGAUCUGUUaCACC 0.1%

641 1 1202 CCAAGAAAAUGGUCACgCA 0.3% 6462 1205 CcUGAAGAUCUGUUCCACC 0.1%

6412 1202 CCAAGAAAAUGGUUACACA 0.1% 6463 1205 CuUGAAGAUCUGUUCCACC 2.0%

6413 1202 CCAAGAAgAUGGUCACACA 0.3% 6464 1205 CAUGAAGAcCUGUUCuACC 0.1%

6414 1202 CCAAGAgAAUGGUCACACA 0.4% 6465 1205 CAUaAAGAcCUGUUCCACC 0.1%

6415 1202 CCAAaAAAAUGGUCACACA 0.7% 6466 1205 CAUGAAGAcCUGcUCCACC 0.2% 641 (3 1202 CCAAGAAAAUGaUCACACA 0.1% 1206 1206 AUGAAGAUCUGUUCCACCA 88.6%

6417 1202 CCAAGAAAAUGGUCACCCA 0.1% 6467 1206 AUGAAGAUCUGUUUCACCA 0.7%

6418 1202 CCAAGAAu AUGGUCACACA 0.1% 6468 1206 AUGAgGAUCUGUUCCACCA 0.2%

6419 1202 CCcAGAAAAUGGUCACACA 0.1% 6469 1206 AcGAAGAUCUGUUCCACCA 0.1%

6420 1202 CCAAaAAgAUGGUCACACA 0.2% 6470 1206 AUGAAaAUCUGUUCCACCA 0.1%

6421 1202 CCAAGAgAAUGGUCACuCA 0.1% 6471 1206 AUGAAGAcCUGUUCCACCA 6.8%

6422 1202 CuAAGAAAAUGGUgACACA 1.1% 6472 1206 AUGAAGAUCUGCUCCACCA 0.2%

6423 1202 CuAAGAAAAUGGUgACgCA 0.1% 6473 1206 AUGAAGAUCUGUUaCACCA 0.1%

6424 1202 CCAAaAAAAUGGUaACACA 0.1% 6474 1206 cUGAAGAUCUGUUCCACCA 0.1%

6425 1202 CCAAaAAAAUGGUgACACA 0.1% 6475 1206 uUGAAGAUCUGUUCCACCA 2.0%

6426 1202 CuAAaAAAAUGGUgACACA 0.5% 6476 1206 AUGAAGAcCUGUUCuACCA 0.1%

6427 1202 CuAAGAAAAUGaUaACACA 0.2% 6477 1206 AUaAAGAcCUGUUCCACCA 0.1%

1203 1203 UGACCAAGAAAAUGGUCAC 88.7% 6478 1206 AUGAAGAcCUGcUCCACCA 0.2%

6428 1203 UGACCAAGAAAAUGGUuAC 0.1% 1207 1207 CAAGAAAAUGGUCACACAA 88.6%

6429 1203 UGACCAAGAAgAUGGUCAC 0.3% 6479 1207 CAAGAAAAUGGUCACACAg 0.1%

6430 1203 UGACCAAGAgAAUGGUCAC 0.5% 6480 1207 CAAGAAAAUGGUCACgCAA 0.3%

643 1 1203 UaACCAAGAAAAUGGUCAC 0.3% 6481 1207 CAAGAAAAUGGUUACACAA 0.1%

6432 1203 UGACCAAaAAAAUGGUCAC 0.7% 6482 1207 CAAGAAgAUGGUCACACAA 0.3%

6433 1203 UGACCAAGAAAAUGaUCAC 0.1% 6483 1207 CAAGAgAAUGGUCACACAA 0.4%

6434 1203 UGACCAAGAAuAUGGUCAC 0.1% 6484 1207 CAAaAAAAUGGUCACACAA 0.7%

6435 1203 UGACCcAGAAAAUGGUCAC 0.1% 6485 1207 CAAGAAAAUGaUCACACAA 0.1%

6436 1203 UuACCAAGAAAAUGGUCAC 0.1% 6486 1207 CAAGAAAAUGGUCACCCAA 0.1%

6437 1203 UGACCAAaAAgAUGGUCAC 0.2% 6487 1207 CAAGAAuAUGGUCACACAA 0.1%

6438 1203 UGACuAAGAAAAUGGUgAC 1.2% 6488 1207 CcAGAAAAUGGUCACACAA 0.1%

6439 1203 UGACCAAaAAAAUGGUaAC 0.1% 6489 1207 CAAaAAgAUGGUCACACAA 0.2%

6440 1203 UGACuAAGAAAAUGaUaAC 0.2% 6490 1207 CAAGAgAAUGGUCACuCAA 0.1%

1204 1204 AUGACCAAGAAAAUGGUCA 88.7% 6491 1207 uAAGAAAAUGGUgACACAA 0.5%

6441 1204 AUGACCAAGAAAAUGGUUA 0.1% 6492 1207 uAAGAAAAUGGUgACACAg 0.5%

6442 1204 AUGACCAAGAAgAUGGUCA 0.3% 6493 1207 uAAGAAAAUGGUgACgCAA 0.1%

6443 1204 AUGACCAAGAgAAUGGUCA 0.5% 6494 1207 CAAaAAAAUGGUgACACAA 0.1%

6444 1204 AUaACCAAGAAAAUGGUCA 0.3% 6495 1207 CAAaAAAAUGGUaACACAg 0.1%

6445 1204 AUGACCAAaAAAAUGGUCA 0.7% 6496 1207 uAAaAAAAUGGUgACACAA 0.5% 6440 1204 AUGACCAAGAAAAUGaUCA 0.1% 1208 1208 UCAUGAAGAUCUGUUCCAC 88.5%

6447 1204 AUGACCAAGAAuAUGGUCA 0.1% 6497 1208 UCAUGAAGAUCUGUUUCAC 0.7%

6448 1204 AUGACCcAGAAAAUGGUCA 0.1% 6498 1208 UCAUGAgGAUCUGUUCCAC 0.2%

6449 1204 AUuACCAAGAAAAUGGUCA 0.1% 6499 1208 cCAUGAAGAUCUGUUCCAC 0.1%

6450 1204 AUGACCAAaAAgAUGGUCA 0.2% 6500 1208 UCAcGAAGAUCUGUUCCAC 0.1%

6451 1204 AUGACuAAGAAAAUGGUgA 0.3% 6501 1208 UCAUGAAaAUCUGUUCCAC 0.1%

6452 1204 gUGACuAAGAAAAUGGUgA 0.9% 6502 1208 UCAUGAAGAcCUGUUCCAC 6.8%

6453 1204 AUGACCAAaAAAAUGGUaA 0.1% 6503 1208 UCAUGAAGAUCUGCUCCAC 0.2%

6454 1204 AUGACuAAGAAAAUGaUaA 0.2% 6504 1208 UCAUGAAGAUCUGUUaCAC 0.1%

6505 1208 UCcUGAAGAUCUGUUCCAC 0.1% 6554 1213 ACAUGACCAAGAAUAUGGU 0.1%

6506 1208 UCuUGAAGAUCUGUUCCAC 2.0% 6555 1213 ACAUGACCcAGAAAAUGGU 0.1%

6507 1208 UCAUGAAGAcCUGUUCuAC 0.1% 6556 1213 ACAUuACCAAGAAAAUGGU 0.1%

6508 1208 UCAUaAAGAcCUGUUCCAC 0.1% 6557 1213 ACAUGACCAAaAAgAUGGU 0.2%

6509 1208 UCAUGAAGAcCUGcUCCAC 0.2% 6558 1213 AuAUGACCAAaAAAAUGGU 0.1%

1209 1209 AUCAUGAAGAUCUGUUCCA 88.5% 6559 1213 AuAUGACuAAGAAAAUGaU 0.2%

6510 1209 AUCAUGAAGAUCUGUUUCA 0.7% 1214 1214 CAUGAGGGAAUACAAGCAG 88.4%

651 1 1209 AUCAUGAgGAUCUGUUCCA 0.2% 6560 1214 CAUGAGGGgAUACAAGCAG 1.5%

6512 1209 AcCAUGAAGAUCUGUUCCA 0.1% 6561 1214 CAcGAGGGAAUACAAGCAG 0.2%

6513 1209 AUCAcGAAGAUCUGUUCCA 0.1% 6562 1214 CAUaAGGGAAUACAAGCAG 0.1%

6514 1209 AUCAUGAAaAUCUGUUCCA 0.1% 6563 1214 CAUGAaGGAAUACAAGCAG 0.3%

6515 1209 AUCAUGAAGAcCUGUUCCA 6.8% 6564 1214 CAUGAGGGAAUACAAUCAG 0.2%

6516 1209 AUCAUGAAGAUCUGCUCCA 0.2% 6565 1214 CAUGAuGGAAUACAAGCAG 0.1%

6517 1209 AUCAUGAAGAUCUGUUaCA 0.1% 6566 1214 CAUGAGGGAAUUCAAGCCG 0.1%

6518 1209 AUCcUGAAGAUCUGUUCCA 0.1% 6567 1214 CAUGAaGGgAUuCAAGCAG 0.2%

6519 1209 AUCuUGAAGAUCUGUUCCA 2.0% 1215 1215 GAUGAUGGGCAUGUUCAAC 88.4%

6520 1209 AUCAUGAAGACCUGUUCUA 0.1% 6568 1215 GAUGAUGGGCAUGUUCAAu 8.3%

6521 1209 AUCAUaAAGAcCUGUUCCA 0.1% 6569 1215 GAUGAUGGGCAUGUUUAAC 0.7%

6522 1209 AUCAUGAAGAcCUGcUCCA 0.2% 6570 1215 GAUGAUGGGuAUGUUCAAC 0.3%

1210 1210 GAUGGGCAUGUUCAACAUG 88.4% 6571 1215 GAUGAUGGGCAUGUUUAAU 1.9%

6523 1210 GAUGGGCAUGUUCAAUAUG 8.3% 6572 1215 GAUGAUGGGuAUGUUCAAu 0.1%

6524 1210 GAUGGGCAUGUUUAACAUG 0.7% 6573 1215 GAUGAUGGGuAUGUUuAAu 0.1%

6525 1210 GAUGGGuAUGUUCAACAUG 0.3% 6574 1215 GAUGAUGGGaAUGUUCAAC 0.1%

6526 1210 GAUGGGCAUGUUUAAUAUG 1.9% 1216 1216 AACAUGACCAAGAAAAUGG 88.4%

6527 1210 GAUGGGuAUGUUCAAuAUG 0.1% 6575 1216 AACAUGACCAAGAAgAUGG 0.3%

6528 1210 GAUGGGuAUGUUuAAuAUG 0.1% 6576 1216 AACAUGACCAAGAgAAUGG 0.5%

6529 1210 GAUGGGaAUGUUCAACAUG 0.1% 6577 1216 AACAUGACuAAGAAAAUGG 0.1%

121 1 121 1 UGAUGGGCAUGUUCAACAU 88.4% 6578 1216 AAuAUGACCAAGAAAAUGG 0.4%

6530 121 1 UGAUGGGCAUGUUCAAUAU 8.3% 6579 1216 AAuAUGACuAAGAAAAUGG 0.2%

653 1 121 1 UGAUGGGCAUGUUUAACAU 0.7% 6580 1216 AAugUGACuAAGAAAAUGG 0.9%

6532 1211 UGAUGGGuAUGUUCAACAU 0.3% 6581 1216 AACAUaACCAAGAAAAUGG 0.3%

6533 121 1 UGAUGGGCAUGUUUAAUAU 1.9% 6582 1216 AACAUGACCAAaAAAAUGG 0.7%

6534 121 1 UGAUGGGuAUGUUCAAuAU 0.1% 6583 1216 AACAUGACCAAGAAAAUGa 0.1%

6535 121 1 UGAUGGGuAUGUUuAAuAU 0.1% 6584 1216 AACAUGACCAAGAAUAUGG 0.1%

6536 121 1 UGAUGGGaAUGUUCAACAU 0.1% 6585 1216 AACAUGACCcAGAAAAUGG 0.1%

1212 1212 AUGAGGGAAUACAAGCAGG 88.4% 6586 1216 AACAUuACCAAGAAAAUGG 0.1%

6537 1212 AUGAGGGgAUACAAGCAGG 1.5% 6587 1216 AACAUGACCAAaAAgAUGG 0.2%

6538 1212 AcGAGGGAAUACAAGCAGG 0.2% 6588 1216 AAuAUGACCAAaAAAAUGG 0.1%

6539 1212 AUaAGGGAAUACAAGCAGG 0.1% 6589 1216 AAuAUGACuAAGAAAAUGa 0.2%

6540 1212 AUGAaGGAAUACAAGCAGG 0.3% 1217 1217 AUGAUGGGCAUGUUCAACA 88.4%

6541 1212 AUGAGGGAAUACAAUCAGG 0.2% 6590 1217 AUGAUGGGCAUGUUCAAUA 8.3%

6542 1212 AUGAuGGAAUACAAGCAGG 0.1% 6591 1217 AUGAUGGGCAUGUUuAACA 0.7%

6543 1212 AUGAGGGAAUUCAAGCCGG 0.1% 6592 1217 AUGAUGGGuAUGUUCAACA 0.3%

6544 1212 AUGAaGGgAUuCAAGCAGG 0.2% 6593 1217 AUGAUGGGCAUGUUuAAuA 1.9%

1213 1213 ACAUGACCAAGAAAAUGGU 88.4% 6594 1217 AUGAUGGGuAUGUUCAAuA 0.1%

6545 1213 ACAUGACCAAGAAgAUGGU 0.3% 6595 1217 AUGAUGGGuAUGUUuAAuA 0.1%

6546 1213 ACAUGACCAAGAgAAUGGU 0.5% 6596 1217 AUGAUGGGaAUGUUCAACA 0.1%

6547 1213 ACAUGACuAAGAAAAUGGU 0.1% 1218 1218 CAUGACCAAGAAAAUGGUC 88.3%

6548 1213 AuAUGACCAAGAAAAUGGU 0.4% 6597 1218 CAUGACCAAGAAAAUGGUu 0.1%

6549 1213 AuAUGACuAAGAAAAUGGU 0.2% 6598 1218 CAUGACCAAGAAgAUGGUC 0.3%

6550 1213 AugUGACuAAGAAAAUGGU 0.9% 6599 1218 CAUGACCAAGAgAAUGGUC 0.5%

655 1 1213 ACAUaACCAAGAAAAUGGU 0.3% 6600 1218 UAUGACCAAGAAAAUGGUC 0.4%

6552 1213 ACAUGACCAAaAAAAUGGU 0.7% 6601 1218 CAUaACCAAGAAAAUGGUC 0.3%

6553 1213 ACAUGACCAAGAAAAUGaU 0.1% 6602 1218 CAUGACCAAaAAAAUGGUC 0.6%

6603 1218 CAUGACCAAGAAAAUGaUC 0.1% 6653 1222 AUGGGCAUGUUUAACAUGC 0.7%

6604 1218 CAUGACCAAGAAUAUGGUC 0.1% 6654 1222 AUGGGuAUGUUCAACAUGC 0.3%

6605 1218 CAUGACCcAGAAAAUGGUC 0.1% 6655 1222 AUGGGCAUGUUCAAUAUGU 8.3%

6606 1218 CAUuACCAAGAAAAUGGUC 0.1% 6656 1222 AUGGGCAUGUUUAAUAUGC 1.9%

6607 1218 CAUGACCAAaAAgAUGGUC 0.2% 6657 1222 AUGGGuAUGUUCAAuAUGu 0.1%

6608 1218 CAUGACuAAGAAAAUGGUg 0.1% 6658 1222 AUGGGaAUGUUCAACAUGC 0.1%

6609 1218 uAUGACCAAaAAAAUGGUC 0.1% 1223 1223 ACAACAUGACCAAGAAAAU 88.2%

6610 1218 uAUGACuAAGAAAAUGGUg 0.2% 6659 1223 ACAACAUGACCAAGAAgAU 0.3%

661 1 1218 CAUGACCAAaAAAAUGGUa 0.1% 6660 1223 ACAACAUGACCAAGAgAAU 0.5%

1219 1219 CUCAUAGUGAAUGCACCAA 88.3% 6661 1223 ACAACAUGACuAAGAAAAU 0.1%

6612 1219 CUCAUAGUGAAUGCACCAg 0.2% 6662 1223 ACAAuAUGACCAAGAAAAU 0.4%

6613 1219 CUCAUAGUGAAUGCAUCAA 0.4% 6663 1223 AuAACAUGACCAAGAAAAU 0.3%

6614 1219 CUuAUAGUGAAUGCACCAA 0.1% 6664 1223 ACAAuAUGACuAAGAAAAU 0.4%

6615 1219 CUaAUAGUGAAUGCACCAA 0.2% 6665 1223 ACAAugUGACuAAGAAAAU 0.9%

6616 1219 CUCAUAaUGAAUGCACCAA 0.1% 6666 1223 ACAACAUaACCAAGAAAAU 0.3%

6617 1219 CUCAUAGUGAAUGCACaAA 0.1% 6667 1223 ACAACAUGACCAAaAAAAU 0.7%

6618 1219 CUCAUAGUGAAUGCACCUA 0.2% 6668 1223 ACAACAUGACCAAGAAUAU 0.1%

6619 1219 CUCAUAGUuAAUGCACCAA 0.1% 6669 1223 ACAACAUGACCCAGAAAAU 0.1%

6620 1219 CUCAUuGUGAAUGCACCAA 0.1% 6670 1223 ACAACAUuACCAAGAAAAU 0.1%

6621 1219 CUCAUcGUaAAUGCACCgA 1.3% 6671 1223 ACAACAUGACCAAaAAgAU 0.2%

6622 1219 CUgAUAGUGAAUGCgCCcA 0.1% 6672 1223 ACAAuAUGACCAAaAAAAU 0.1%

1220 1220 GACCAAGAAAAUGGUCACA 88.3% 1224 1224 AUAGUGAAUGCACCAAAUC 88.2%

6623 1220 GACCAAGAAAAUGGUCACg 0.3% 6673 1224 AUAGUGAAUGCACCAgAUC 0.2%

6624 1220 GACCAAGAAAAUGGUUACA 0.1% 6674 1224 AUAGUGAAUGCAUCAAAUC 0.4%

6625 1220 GACCAAGAAgAUGGUCACA 0.3% 6675 1224 AUAaUGAAUGCACCAAAUC 0.1%

6626 1220 GACCAAGAgAAUGGUCACA 0.4% 6676 1224 AUAGUGAAUGCACaAAAUC 0.1%

6627 1220 aACCAAGAAAAUGGUCACA 0.3% 6677 1224 AUAGUGAAUGCACCAAACC 0.4%

6628 1220 GACCAAaAAAAUGGUCACA 0.7% 6678 1224 AUAGUGAAUGCACCUAAUC 0.1%

6629 1220 GACCAAGAAAAUGaUCACA 0.1% 6679 1224 AUAGUuAAUGCACCAAAUC 0.1%

6630 1220 GACCAAGAAAAUGGUCACc 0.1% 6680 1224 AUuGUGAAUGCACCAAAUC 0.1%

663 1 1220 GACCAAGAAuAUGGUCACA 0.1% 6681 1224 AUAGUGAAUGCACCuAgUC 0.1%

6632 1220 GACCcAGAAAAUGGUCACA 0.1% 6682 1224 AUuGUGAAUGCACCcAAUC 1.1%

6633 1220 UACCAAGAAAAUGGUCACA 0.1% 6683 1224 AUcGUaAAUGCACCgAAUC 1.3%

6634 1220 GACCAAaAAgAUGGUCACA 0.2% 1225 1225 UAGUGAAUGCACCAAAUCA 88.2%

6635 1220 GACCAAGAgAAUGGUCACu 0.1% 6684 1225 UAGUGAAUGCACCAgAUCA 0.2%

6636 1220 GACuAAGAAAAUGGUgACA 1.1% 6685 1225 UAGUGAAUGCAUCAAAUCA 0.4%

6637 1220 GACuAAGAAAAUGGUgACg 0.1% 6686 1225 UAaUGAAUGCACCAAAUCA 0.1%

6638 1220 GACCAAaAAAAUGGUaACA 0.1% 6687 1225 UAGUGAAUGCACaAAAUCA 0.1%

6639 1220 GACuAAGAAAAUGaUaACA 0.2% 6688 1225 UAGUGAAUGCACCAAACCA 0.4%

1221 1221 UCAUAGUGAAUGCACCAAA 88.3% 6689 1225 UAGUGAAUGCACCUAAUCA 0.1%

6640 1221 UCAUAGUGAAUGCACCAgA 0.2% 6690 1225 UAGUuAAUGCACCAAAUCA 0.1%

664 1 1221 UCAUAGUGAAUGCAUCAAA 0.4% 6691 1225 UuGUGAAUGCACCAAAUCA 0.1%

6642 1221 UuAUAGUGAAUGCACCAAA 0.1% 6692 1225 UAGUGAAUGCACCuAgUCA 0.1%

6643 1221 UaAUAGUGAAUGCACCAAA 0.2% 6693 1225 UuGUGAAUGCACCcAAUCA 1.1%

6644 1221 UCAUAaUGAAUGCACCAAA 0.1% 6694 1225 UcGUaAAUGCACCgAAUCA 1.3%

6645 1221 UCAUAGUGAAUGCACaAAA 0.1% 1226 1226 GCCAUGGUGUCUAGGGCCC 88.2%

6646 1221 UCAUAGUGAAUGCACCUAA 0.1% 6695 1226 GCCAUGGUGUCUAGGGCUC 1.3%

6647 1221 UCAUAGUuAAUGCACCAAA 0.1% 6696 1226 GCCAUGGUGUCUgGGGCCC 0.1%

6648 1221 UCAUuGUGAAUGCACCAAA 0.1% 6697 1226 GCaAUGGUGUCUAGGGCCC 0.1%

6649 1221 UCAUAGUGAAUGCACCuAg 0.1% 6698 1226 GCCAUGaUGUCUAGGGCCC 0.2%

6650 1221 UCAUcGUaAAUGCACCgAA 1.3% 6699 1226 GCCAUGGUaUCUAGGGCCC 0.4%

6651 1221 UgAUAGUGAAUGCgCCcAA 0.1% 6700 1226 GCCAUGGUGUCUAaGGCCC 0.1%

1222 1222 AUGGGCAUGUUCAACAUGC 88.2% 6701 1226 GCCAUGGUGUCUAGaGCCC 0.2%

6652 1222 AUGGGCAUGUUCAACAUGu 0.2% 6702 1226 GCCAUGGUGUCUCGGGCCC 0.3%

6703 1226 GCCAUGGUuUCUAGGGCCC 0.2% 6753 1230 CAACAUuACCAAGAAAAUG 0.1%

6704 1226 GCCAUGGUGUCUCGGGCUC 0.1% 6754 1230 CAACAUGACCAAaAAgAUG 0.2%

6705 1226 GCCAUGGUaUCUAGGaCCC 0.1% 6755 1230 CAAuAUGACCAAaAAAAUG 0.1%

6706 1226 GCCAUGGUaUCUAGGGCCa 0.1% 6756 1230 CAACAUGAgCAAGAAAAaG 0.1%

6707 1226 GCCAUGGUuUCcAGGGCCC 0.1% 1231 1231 UGAAUGCACCAAAUCAUGA 88.1%

1227 1227 GACAACAUGACCAAGAAAA 88.2% 6757 1231 UGAAUGCACCAgAUCAUGA 0.2%

6708 1227 GACAACAUGACCAAGAAgA 0.3% 6758 1231 UGAAUGCAuCAAAUCAUGA 0.4%

6709 1227 GACAACAUGACCAAGAgAA 0.5% 6759 1231 UGAAUGCACaAAAUCAUGA 0.1%

6710 1227 GACAACAUGACuAAGAAAA 0.1% 6760 1231 UGAAUGCACCAAACCAUGA 0.4%

671 1 1227 GACAAuAUGACCAAGAAAA. 0.4% 6761 1231 UGAAUGCACCAAAUCACGA 0.2%

6712 1227 GAuAACAUGACCAAGAAAA 0.3% 6762 1231 UGAAUGCACCAAAUCAUaA 0.1%

6713 1227 GACAAuAUGACuAAGAAAA 0.4% 6763 1231 UGAAUGCACCCAAUCAUGA 0.5%

6714 1227 GACAAugUGACuAAGAAAA 0.9% 6764 1231 UGAAUGCACCUAAUCAUGA 0.1%

6715 1227 GACAACAUaACCAAGAAAA 0.3% 6765 1231 UuAAUGCACCAAAUCAUGA 0.1%

6716 1227 GACAACAUGACCAAaAMA 0.7% 6766 1231 UaAAUGCACCgAAUCAUGA 1.3%

6717 1227 GACAACAUGACCAAGAAUA 0.1% 6767 1231 UGAAUGCACCuAgUCAUGA 0.1%

6718 1227 GACAACAUGACCCAGAAAA 0.1% 6768 1231 UaAAUGCgCCgAAUCAUGA 0.1%

6719 1227 GACAACAUuACCAAGAAAA 0.1% 6769 1231 UaAAUGCACCcAAUCAUGA 0.1%

6720 1227 GACAACAUGACCAAaAAgA 0.2% 6770 1231 UGAAUGCACCCAAUCAUGC 0.5%

6721 1227 GACAAuAUGACCAAaAAAA 0.1% 6771 1231 UaAAcGCACCgAAUCAUGA 0.2%

1228 1228 UGGGCAUGUUCAACAUGCU 88.2% 1232 1232 GUGAAUGCACCAAAUCAUG 88.0%

6722 1228 UGGGCAUGUUCAACAUGUU 0.2% 6772 1232 GUGAAUGCACCAgAUCAUG 0.2%

6723 1228 UGGGCAUGUUUAACAUGCU 0.7% 6773 1232 GUGAAUGCAuCAAAUCAUG 0.4%

6724 1228 UGGGuAUGUUCAACAUGCU 0.3% 6774 1232 aUGAAUGCACCAAAUCAUG 0.1%

6725 1228 UGGGCAUGUUCAAUAUGUU 8.3% 6775 1232 GUGAAUGCACaAAAUCAUG 0.1%

6726 1228 UGGGCAUGUUuAAuAUGCU 1.9% 6776 1232 GUGAAUGCACCAAACCAUG 0.4%

6727 1228 UGGGuAUGUUCAAuAUGuU 0.1% 6777 1232 GUGAAUGCACCAAAUCACG 0.2%

6728 1228 UGGGaAUGUUCAACAUGCU 0.1% 6778 1232 GUGAAUGCACCAAAUCAUa 0.1%

1229 1229 CCAUGGUGUCUAGGGCCCG 88.2% 6779 1232 GUGAAUGCACCCAAUCAUG 1.1%

6729 1229 CCAUGGUGUCUAGGGCUCG 1.3% 6780 1232 GUGAAUGCACCUAAUCAUG 0.1%

6730 1229 CCAUGGUGUCUgGGGCCCG 0.1% 6781 1232 GUuAAUGCACCAAAUCAUG 0.1%

673 1 1229 CaAUGGUGUCUAGGGCCCG 0.1% 6782 1232 GUaAAUGCACCgAAUCAUG 1.3%

6732 1229 CCAUGaUGUCUAGGGCCCG 0.2% 6783 1232 GUGAAUGCACCuAgUCAUG 0.1%

6733 1229 CCAUGGUaUCUAGGGCCCG 0.4% 6784 1232 GUaAAUGCgCCgAAUCAUG 0.1%

6734 1229 CCAUGGUGUCUAaGGCCCG 0.1% 6785 1232 GUaAAUGCACCcAAUCAUG 0.1%

6735 1229 CCAUGGUGUCUAGaGCCCG 0.2% 6786 1232 GUGAAUGCACCCAACCAUG 0.5%

6736 1229 CCAUGGUGUCUcGGGCCCG 0.3% 6787 1232 GUaAAcGCACCgAAUCAUG 0.2%

6737 1229 CCAUGGUuUCUAGGGCCOG 0.2% 6788 1232 GUGAAUGCACCcAAcuAUG 6.2%

6738 1229 CCAUGGUGUCU cGGGCuCG 0.1% 1233 1233 AGAUCAUGAAGAUCUGUUC 88.0%

6739 1229 CCAUGGUaUCUAGGaCCCG 0.1% 6789 1233 AGAUCAUGAAGAUCUGUUu 0.7%

6740 1229 CCAUGGUaUCUAGGGCCaG 0.1% 6790 1233 AGAUCAUGAgGAUCUGUUC 0.2%

6741 1229 CCAUGGUuUCcAGGGCCCG 0.1% 6791 1233 AaAUCAUGAAGAUCUGUUC 0.5%

1230 1230 CAACAUGACCAAGAAAAUG 88.2% 6792 1233 AGAcCAUGAAGAUCUGUUC 0.1%

6742 1230 CAACAUGACCAAGAAgAUG 0.3% 6793 1233 AGAUCAcGAAGAUCUGUUC 0.1%

6743 1230 CAACAUGACCAAGAgAAUG 0.5% 6794 1233 AGAUCAUGAAaAUCUGUUC 0.1%

6744 1230 CAACAUGACuAAGAAAAUG 0.1% 6795 1233 AGAUCAUGAAGACCUGUUC 6.9%

6745 1230 CAAuAUGACCAAGAAAAUG 0.4% 6796 1233 AGAUCAUGAAGAUCUGCUC 0.2%

6746 1230 uAACAUGACCAAGAAAAUG 0.3% 6797 1233 AGAUCAUGAAGAUCUGUUa 0.1%

6747 1230 CAAuAUGACuAAGAAAAUG 0.4% 6798 1233 AGAUCcUGAAGAUCUGUUC 0.1%

6748 1230 CAAugUGACuAAGAAAAUG 0.9% 6799 1233 AGAUCuUGAAGAUCUGUUC 2.0%

6749 1230 CAACAUaACCAAGAAAAUG 0.3% 6800 1233 AGAUCAUaAAGAcCUGUUC 0.1%

6750 1230 CAACAUGACCAAaAAAAUG 0.7% 6801 1233 AGAUCAUGAAGACCUGCUC 0.2%

6751 1230 CAACAUGACCAAGAAUAUG 0.1% 1234 1234 UCAUGAGGGAAUACAAGCA 88.0%

6752 1230 CAACAUGACCCAGAAAAUG 0.1% 6802 1234 UCAUGAGGGgAUACAAGCA 1.5%

6803 1234 cCAUGAGGGAAUACAAGCA 0.4% 6840 1237 aAUCAUGMGAUCUGUUCC 0.5%

6804 1234 UCAcGAGGGAAUACAAGCA 0.2% 6841 1237 GAcCAUGMGAUCUGUUCC 0.1%

6805 1234 UCAUaAGGGAAUACAAGCA 0.1% 6842 1237 GAUCAcGAAGAUCUGUUCC 0.1%

6806 1234 UCAUGAaGGAAUACAAGCA 0.3% 6843 1237 GAUCAUGMaAUCUGUUCC 0.1%

6807 1234 UCAUGAGGGAAUACAAu CA 0.2% 6844 1237 GAUCAUGAAGACCUGUUCC 6.8%

6808 1234 UCAUGAuGGAAUACAAGCA 0.1% 6845 1237 GAUCAUGAAGAUCUGCUCC 0.2%

6809 1234 UCAUGAGGGAAUuCAAGCc 0.1% 6846 1237 GAUCAUGAAGAUCUGUUaC 0.1%

6810 1234 UCAUGAaGGgAUuCAAGCA 0.2% 6847 1237 GAUCcUGMGAUCUGUUCC 0.1%

1235 1235 CAGCGCAAAAUGCCAUAAG 88.0% 6848 1237 GAUCuUGAAGAUCUGUUCC 2.0%

681 1 1235 CAGCGCAAAAUGCUAUAAG 0.4% 6849 1237 GAUCAUGMGAcCUGUUCu 0.1%

6812 1235 CAGCGCAgAAUGCCAUAAG 0.2% 6850 1237 GAUCAUaMGAcCUGUUCC 0.1%

6813 1235 CAGuGCAAAAUGCCAUAAG 0.6% 6851 1237 GAUCAUGMGACCUGCUCC 0.2%

6814 1235 CgGCGCAAAAUGCCAUAAG 0.1% 1238 1238 AGUGMUGCACCAMUCAU 88.0%

6815 1235 CAGuGCAAAAUGCuAUAAG 1.6% 6852 1238 AGUGAAUGCACCAgAUCAU 0.2%

6816 1235 CAaCGCAAAAUGCCAUAAG 0.1% 6853 1238 AGUGAAUGCAUCAMUCAU 0.4%

6817 1235 CAGCa CAAAAUGCCAUAAG 0.1% 6854 1238 AaUGAAUGCACCAAAUCAU 0.1%

6818 1235 CAGCGCAAAAUGCaAUAAG 0.2% 6855 1238 AGUGAAUGCACaAAAUCAU 0.1%

6819 1235 CAGCu CAAAAUGCCAUAAG 0.1% 6856 1238 AGUGAAUGCACCAMCCAU 0.4%

6820 1235 CAuCGCAAMUGCCAUMG 0.1% 6857 1238 AGUGAAUGCACCAMUCAc 0.2%

6821 1235 CAGCaCAAAAUGCuAUAAG 8.1% 6858 1238 AGUGAAUGCACCUMUCAU 0.1%

6822 1235 CAGCa CAAAAUGuuAUAAG 0.1% 6859 1238 AGUuAAUGCACCAMUCAU 0.1%

6823 1235 CAGCaCAAAgUGCuAUAAG 0.1% 6860 1238 uGUGAAUGCACCAAAUCAU 0.1%

6824 1235 CAGCaCAAAAUGCaAUAAG 0.1% 6861 1238 AGUGAAUGCACCuAgUCAU 0.1%

1236 1236 GAAUUCUUGAGGAUGMCA 88.0% 6862 1238 uGUGAAUGCACCcAAUCAU 1.1%

6825 1236 GAAUUCUUGAGGAUGAgCA 0.1% 6863 1238 cGUaAAUGCACCgAAUCAU 1.3%

6826 1236 GAAUUCUUGAGGAUGgACA 0.1% 1239 1239 GGAAUUCUUGAGGAUGAAC 88.0%

6827 1236 GAAUaCUUGAGGAUGAACA 0.2% 6864 1239 GGMUUCUUGAGGAUGAgC 0.1%

6828 1236 GAAUcCUUGAGGAUGAACA 0.4% 6865 1239 GGAAUUCUUGAGGAUGgAC 0.1%

6829 1236 GAAUUaUUGAGGAUGAACA 0.1% 6866 1239 GGAAUaCUUGAGGAUGMC 0.2%

6830 1236 GAAUUCUcGAGGAUGAACA 0.1% 6867 1239 GGAAUcCUUGAGGAUGMC 0.4%

683 1 1236 GAAUUCUUGAaGAUGAACA 2.1% 6868 1239 GGMUUaUUGAGGAUGMC 0.1%

6832 1236 GAAUUCUUGAGGACGMCA 0.1% 6869 1239 GGMUUCUcGAGGAUGMC 0.1%

6833 1236 GAAUUCUUGAGGAUCAACA 0.3% 6870 1239 GGAAUUCUUGAaGAUGAAC 2.1%

6834 1236 GAAUaCUUGAGGAUGAgCA 0.5% 6871 1239 GGMUUCUUGAGGACGMC 0.1%

6835 1236 GgAUaCUUGAGGAUGAgCA 4.3% 6872 1239 GGMUUCUUGAGGAUCMC 0.3%

6836 1236 GMUaCUUGAaGAUGAACA 1.0% 6873 1239 GGAAUaCUUGAGGAUGAgC 0.5%

6837 1236 GAgUaCUUGAaGAUGAACA 0.2% 6874 1239 GGgAUaCUUGAGGAUGAgC 4.3%

1237 1237 GAUCAUGMGAUCUGUUCC 88.0% 6875 1239 GGMUaCUUGAaGAUGMC 1.0%

6838 1237 GAUCAUGAAGAUCUGUUU C 0.7% 6876 1239 GGAgUaCUUGAaGAUGMC 0.2%

6839 1237 GAUCAUGAgGAUCUGUUCC 0.2%

Table 20-3: Conserved and minor variant 19-mer sequences from the Influenza A segment 3 (PA) sequences listed in Table 1-3. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

Seq Ref 6881 2391 UUUAaAGCCUAUGUGGAUG 0 , 1% ID ID Match Seq Total

6882 2391 UUUAGAaCCUAUGUGGAUG 0 . 6%

23?0 2390 UUAGAGCCUAUGUGGAUGG 98.9%

6883 2391

6877 2390 UUAaAGCCUAUGUGGAUGG 0 , 1% UUUAGAGCCUAUGUaGAUG 0 , 2%

6884 2391 UUUAGAGCCUAUGUGGAUa 0 . 1%

6878 2390 UUAGAaCCUAUGUGGAUGG 0 , 6%

2392 2392

6879 2390 UUAGAGCCUAUGUaGAUGG 0 , 2% AGCMUUGAGGAGUGCCUG 98 , 7% 92

6880 2390 UUAGAGCCUAUGUGGAUaG 0.1% 6885 23 AGCAgUUGAGGAGUGCCUG 0 . 1%

6886 2392

23Pl 2391 UUUAGAGCCUAUGUGGAUG 98.9% AGCMUUGAGGAaUGCCUG 1 , 0%

6887 2392 AGCMUUGAGGAGUGCCUa 0 , 2%

2393 2393 UUGAGGAGUGCCUGAUJM 98.7% 6931 2403 AUGAUCCCUGGGUUCUGCU 0.1%

6888 2393 UUGAGGAGUGCCUGgUUZ-A 0.1% 2404 2404 AAUGAUCCCUGGGUUUUGC 98.0%

6889 2393 UUGAGGAaUGCCUGAUUAA 1.0% 6932 2404 AAcGAUCCCUGGGUUUUGC 1 , 8%

6890 2393 UUGAGGAGUGCCUaAULiM 0.2% 6933 2404 AAUGAUCCCUGGGcUUUGC 0 , 1%

2394 2394 GCAAUUGAGGAGUGCCUGA 98.6% 6934 2404 AAUGAUCCCUGGGUUCUGC 0 , 1%

6891 2394 GCAAUUGAGGAGUGCCUGg 0.1% 2405 2405 CCUGAUUAAUGAUCCCUGG 97,9%

6892 2394 GCAgUUGAGGAGUGCCUGA 0.1% 6935 2405 CCUGgUUAAUGAUCCCUGG 0 , 1%

6893 2394 GCAAUUGAGGAaUGCCUGA 1.0% 6936 2405 CCUaAUUAAUGAUCCCUGG 0.2%

6894 2394 GCAAUUGAGGAGUGCDJaA 0.2% 6937 2405 CCUGAUUAAcGAUCCCUGG 1.8%

2395 2395 AUUGAGGAGUGCCUGAUUA 98.6% 2406 2406 UGCCUGAUUAAUGAUCCCU 97,9%

6895 2395 AUUGAGGAGUGCCUGgUUA 0.1% 6938 2406 UGCCUGgUUAAUGAUCCCU 0.1%

6896 2395 gUUGAGGAGUGCCUGAUUA 0.1% 6939 2406 UGCCUaAUUAAUGAUCCCU 0 , 2%

6897 2395 AUUGAGGAaUGCOJGAUUA 1.0% 6940 2406 UGCCUGAUUAACGAUCCCU 1 , 8%

6898 2395 AUUGAGGAGUGCCUaAUUA 0.2% 2407 2407 AUUAAUGAUCCCUGGGUUU 97.9%

2396 2396 CAAUUGAGGAGUGCCUGAU 98.6% 6941 2407 gUUAAUGAUCCCUGGGUUU 0 , 1%

6899 2396 CAAUUGAGGAGUGCCUGgU 0.1% 6942 2407 AUUAAcGAUCCCUGGGUUU 1 , 8%

6900 2396 CAgUUGAGGAGUGCCUGAU 0.1% 6943 2407 AUUAAUGAUCCCUGGGCUU 0.1%

6901 2396 CAAUUGAGGAaUGCCUGAU 1.0% 6944 2407 AUUAAUGAUCCCUGGGUUc 0.1%

6902 2396 CAAUUGAGGAGUGCCUaAU 0.2% 2408 2408 GCCUGAUUAAUGAUCCCUG 97.9%

2397 2397 AAUUGAGGAGUGCCUGAUU 98.6% 6945 2408 GCCUGgUUAAUGAUCCCUG 0.1%

6903 2397 AAUUGAGGAGUGCCUGgUU 0,1% 6946 2408 GCCUaAUUAAUGAUCCCUG 0.2%

6904 2397 AgUUGAGGAGUGCCUGAUU 0,1% 6947 2408 GCCUGAUUAACGAUCCCUG 1.8%

6905 2397 AAUUGAGGAaUGCCUGAUU 1,0% 2409 2409 CUGAUUAAUGAUCCCUGGG 97,9%

6906 2397 AAUUGAGGAGUGCCUaAUU 0.2% 6948 2409 CUGgUUAAUGAUCCCUGGG 0.1%

2398 2398 UAGAGCCUAUGUGGAUGGA 98.3% 6949 2409 CUaAUUAAUGAUCCCUGGG 0 , 2%

6907 2398 UAGAGCCUAUGUGGAUGGg 0.6% 6950 2409 CUGAUUAAcGAUCCCUGGG 1 , 8%

6903 2398 UAaAGCCUAUGUGGAUGGA 0.1% 2410 2410 GAUUAAUGAUCCCUGGGUU 97.8%

6909 2398 UAGAaCCUAUGUGGAUGGA 0.6% 6951 2410 GgUUAAUGAUCCCUGGGUU 0.1%

6910 2398 UAGAGCCUAUGUaGAUGGA 0.2% 6952 2410 aAUUAAUGAUCCCUGGGUU 0 , 2%

6911 2398 UAGAGCCUAUGUGGAUaGA 0.1% 6953 2410 GAUUAAcGAUCCCUGGGUU 1 , 8%

2399 2399 AGAGCCUAUGUGGAUGGAU 98.3% 6954 2410 GAUUAAUGAUCCCUGGGCU 0 , 1%

6912 2399 AGAGCCUAUGUGGAUGGgU 0.6% 2411 2411 UGAUUAAUGAUCCCUGGGU 97,8%

6913 2399 AaAGCCUAUGUGGAUGGAU 0,1% 6955 2411 UGgUUAAUGAUCCCUGGGU 0 , 1%

6914 2399 AGAaCCUAUGUGGAUGGAU 0.6% 6956 2411 UaAUUAAUGAUCCCUGGGU 0 , 2%

6915 2399 AGAGCCUAUGUaGAUGGAU 0,2% 6957 2411 UGAUUAAcGAUCCCUGGGU 1.8%

6916 2399 AGAGCCUAUGUGGAUaGAU 0.1% 6958 2411 UGAUUAAUGAUCCCUGGGc 0 , 1%

2400 2400 GAGCCUAUGUGGAUGGAUU 98.2% 2412 2412 UAUAUGAAGCAAUUGAGGA 97.3%

6917 2400 GAGCCUAUGUGGAUGGgUU 0.6% 6959 2412 UAUAUGAAGCAgUUGAGGA 0 , 1%

6913 2400 aAGCCUAUGUGGAUGGAUU 0.1% 6960 2412 UAUAUGgAGCAAUUGAGGA 2 , 6%

6919 2400 GAaCCUAUGUGGAUGGAUU 0.6% 2413 2413 CUAUAUGAAGCAAUUGAGG 97 , 2%

6920 2400 GAGCCUAUGUaGAUGGAUU 0.2% 6961 2413 CUAUAUGAAGCAgUUGAGG 0 , 1%

6921 2400 GAGCCUAUGUGGAUaGAUU 0.1% 6962 2413 CUAUAUGgAGCAAUUGAGG 2 , 6%

6922 2400 GAGCCUAUGUGGAUGGAUa 0.1% 6963 2413 uUAUAUGAAGCAAUUGAGG 0.1%

2401 2401 UAAUGAUCCCUGGGUUUUG 98,0% 2414 2414 CUUGGUUCAACUCCUUCCU 97,1%

6923 2401 UAAcGAUCCCUGGGUUUUG 1.8% 6964 2414 CUUGGUUCAACUCCUUCUU 0 , 1%

6924 2401 UAAUGAUCCCUGGGCUUUG 0.1% 6965 2414 CUUGGUUuAACUCCUUCCU 0.2%

691:5 2401 UAAUGAUCCCUGGGUUCUG 0,1% 6966 2414 CaUGGUUCAACUCCUUCCU 0 , 1%

2402 2402 UUAAUGAUCCCUGGGUUUU 98,0% 6967 2414 CcUGGUUCAACUCCUUCCU 0 , 9%

6926 2402 UUAAcGAUCCCUGGGUUUU 1.8% 6968 2414 CgUGGUUCAACUCCUUCCU 1 , 6%

6927 2402 UUAAUGAUCCCUGGGCUUU 0,1% 2415 2415 UCUUGGUUCAACUCCUUCC 97,1%

6928 2402 UUAAUGAUCCCUGGGUUCU 0,1% 6969 2415 UCUUGGUUCAACUCCUUCu 0.1%

2403 2403 AUGAUCCCUGGGUUUUGCU 98.0% 6970 2415 UCUUGGUUuAACUCCUUCC 0.2%

6929 2403 AcGAUCCCUGGGUUUUGCU 1.8% 6971 2415 UCaUGGUUCAACUCCUUCC 0 , 1%

6930 2403 AUGAUCCCUGGGCUUUGCU 0.1% 6972 2415 UCcUGGUUCAACUCCUUCC 0 , 9%

6973 2415 UCgUGGUUCAACUCCUUCC 1.6% 7015 2427 gAGCAAUUGAGGAGUGCCU 2.6%

2416 2416 AGGAGUGCCUGAUUAAUGA 96.9% 7016 2427 AAGCAAUUGAGGAaUGCCU 1.0%

6974 2416 AGGAGUGCCUGgUUAAUGA 0.1% 2428 2428 GAAGCAAUUGAGGAGUGCC 96,4%

6975 2416 AGGAaUGCCUGAUUAAUGA 1.0% 7017 2428 GAAGCAgUUGAGGAGUGCC 0.1%

6976 2416 AGGAGUGCCUaAUUAAUGA 0.2% 7018 2428 GgAGCAAUUGAGGAGUGCC 2 , 6%

6977 2416 AGGAGUGCCUGAUUAACGA 1.8% 7019 2428 GAAGCAAUUGAGGAaUGCC 1 , 0%

2417 2417 GAGUGCCUGAUUAAUGAUC 96.9% 2429 2429 ACAAAUUUGCAGCAAUAUG 95,4%

6978 2417 GAGUGCCUGgUUAAUGAUC 0,1% 7020 2429 ACAAAUUUGCgGCAAUAUG 0 , 3%

6979 2417 GAaUGCCUGAUUAAUGAUC 1.0% 7021 2429 ACAAAUUcGCAGCAAUAUG 0.1%

698.0 2417 GAGUGCCUaAUUAAUGAUC 0.2% 7022 2429 ACAAAUUUGCUGCAAUAUG 0 , 4%

6981 2417 GAGUGCCUGAUUAACGAUC 1.8% 7023 2429 ACAAgUUUGCuGCAAUAUG 1 , 5%

2418 2418 UGAGGAGUGCCUGAUUAAU 96.9% 7024 2429 ACAAgUUcGCuGCAAUAUG 1 , 8%

6982 2418 UGAGGAGUGCCUGgUUAAU 0,1% 2430 2430 AACAAAUUUGCAGCAAUAU 95,4%

6983 2418 UGAGGAaUGCCUGAUUAAU 1,0% 7025 2430 AACAAAUUUGCgGCAAUAU 0.3%

6984 2418 UGAGGAGUGCCUaAUUAAU 0.2% 7026 2430 AACAAAUUcGCAGCAAUAU 0 , 1%

69S5 2418 UGAGGAGUGCCUGAUUAAc 1.8% 7027 2430 AACAAAUUUGCUGCAAUAU 0.4%

2419 2419 GUGCCUGAUUAAUGAUCCC 96,9% 7028 2430 AACAAgUUUGCuGCAAUAU 1.5%

6986 2419 GUGCCUGgUUAAUGAUCCC 0,1% 7029 2430 AACAAgUUcGCuGCAAUAU 1.8%

6987 2419 aUGCCUGAUUAAUGAUCCC 1,0% 2431 2431 GGGCUAUAUGAAGCAAUUG 95,3%

6983 2419 GUGCCUaAUUAAUGAUCCC 0,2% 7030 2431 GGGCUAUAUGAAGCAgUUG 0.1%

6989 2419 GUGCCUGAUUAACGAUCCC 1,8% 7031 2431 GGGCUAUAUGgAGCAAUUG 2 , 6%

2420 2420 GGAGUGCCUGAUUAAUGAU 96,9% 7032 2431 GGGuUAUAUGAAGCAAUUG 0.1%

6990 2420 GGAGUGCCUGgUUAAUGAU 0,1% 7033 2431 GGaCUAUAUGAAGCAAUUG 1 , 9%

6991 2420 GGAaUGCCUGAUUAAUGAU 1.0% 2432 2432 GCUAUAUGAAGCAAUUGAG 95.3%

6992 2420 GGAGUGCCUaAUUAAUGAU 0,2% 7034 2432 GCUAUAUGAAGCAgUUGAG 0.1%

6993 2420 GGAGUGCCUGAUUAACGAU 1,8% 7035 2432 GCUAUAUGgAGCAAUUGAG 2 , 6%

2411 2421 GAGGAGUGCCUGAUUAAUG 96,9% 7036 2432 GuUAUAUGAAGCAAUUGAG 0.1%

6994 2421 GAGGAGUGCCUGgUUAAUG 0,1% 7037 2432 aCUAUAUGAAGCAAUUGAG 1.9%

6995 2421 GAGGAaUGCCUGAUUAAUG 1,0% 2433 2433 UCAUGUAUUCAGAUUUUCA 95.3%

6996 2421 GAGGAGUGCOJaAUUAAUG 0.2% 7038 2433 UCAUGUAUUCgGAUUUUCA 1 , 9%

6997 2421 GAGGAGUGCCUGAUUAACG 1,8% 7039 2433 UuAUGUAUUCAGAUUUUCA 0.1%

2422 2422 AGUGCCUGAUUAAUGAUCC 96,9% 7040 2433 UCAUGUAcUCAGAUUUUCA 0 , 1%

6998 2422 AGUGCOJGgUUAAUGAUCC 0,1% 7041 2433 UCAUGUAUUCAGACUUUCA 0 , 1%

6999 2422 AaUGCCUGAUUAAUGAUCC 1 , 0% 7042 2433 UCAUGUAUUCAGAUUUCCA 0 , 1%

7000 2422 AGUGCCUaAUUAAUGAUCC 0,2% 7043 2433 UCAUGUAUUCgGAUUUcCA 0.1%

7001 2422 AGUGCCUGAUUAACGAUCC 1 , 8% 7044 2433 UCAUGUAUUCAGACUUCCA 2 , 2%

2423 2423 UAUGAAGCAAUUGAGGAGU 96.4% 2434 2434 GGCUAUAUGAAGCAAUUGA 95,3%

7002 2423 UAUGAAGCAgUUGAGGAGU 0,1% 7045 2434 GGCUAUAUGAAGCAgUUGA 0.1%

7003 2423 UAUGgAGCAAUUGAGGAGU 2,6% 7046 2434 GGCUAUAUGgAGCAAUUGA 2 , 6%

7004 2423 UAUGAAGCAAUUGAGGAaU 1,0% 7047 2434 GGuUAUAUGAAGCAAUUGA 0.1%

2424 2424 AUAUGAAGCAAUUGAGGAG 96,4% 7048 2434 GaCUAUAUGAAGCAAUUGA 1.9%

7005 2424 AUAUGAAGCAgUUGAGGAG 0,1% 2435 2435 CAAAUUUGCAGCAAUAUGC 95,2%

7006 2424 AUAUGgAGCAAUUGAGGAG 2,6% 7049 2435 CAAAUUUGCAGCAAUAUGu 0.2%

7007 2424 AUAUGAAGCAAUUGAGGAa 1,0% 7050 2435 CAAAUUUGCgGCAAUAUGC 0 , 3%

2425 2425 AUGAAGCAAUUGAGGAGUG 96.4% 7051 2435 CAAAUUcGCAGCAAUAUGC 0 , 1%

7008 2425 AUGAAGCAgUUGAGGAGUG 0.1% 7052 2435 CAAAUUUGCuGCAAUAUGC 0.3%

7009 2425 AUGgAGCAAUUGAGGAGUG 2,6% 7053 2435 CAAAUUUGCuGCAAUAUGu 0 , 1%

7010 2425 AUGAAGCAAUUGAGGAaUG 1,0% 7054 2435 CAAgUUUGCuGCAAUAUGC 1 , 5%

2426 2426 UGAAGCAAUUGAGGAGUGC 96.4% 7055 2435 CAAgUUcGCuGCAAUAUGC 1.8%

7031 2426 UGAAGCAgUUGAGGAGUGC 0.1% 2436 2436 AAAUUUGCAGCAAUAUGCA 95,2%

7012 2426 UGgAGCAAUUGAGGAGUGC 2,6% 7056 2436 AAAUUUGCAGCAAUAUGUA 0 , 2%

7013 2426 UGAAGCAAUUGAGGAaUGC 1.0% 7057 2436 AAAUUUGCgGCAAUAUGCA 0 , 3%

2427 2427 AAGCAAUUGAGGAGUGCCU 96.4% 7058 2436 AAAUUcGCAGCAAUAUGCA 0 , 1%

7014 2427 AAGCAgUUGAGGAGUGCCU 0.1% 7059 2436 AAAUUUGCuGCAAUAUGCA 0 , 3%

7060 , 2436 AAAUUUGCuGCAAUAUGuA 0.1% 7107 2443 AgUUUUAGAGCCUAUGUGG 0.1%

7061 2436 AAgUUUGCuGCAAUAUGCA 1.5% 7108 2443 AAcUUUAGAGCCUAUGUGG 4.1%

7062 2436 AAgUUcGCuGCAAUAUGCA 2.1% 7109 2443 AAUUUUAaAGCCUAUGUGG 0.1%

2437 2437 UUCAUGUAUUCAGAUUUUC 95.2% 7110 2443 AAUUUUAGAGCCUAUGUaG 0.2%

7063 2437 UUCAUGUAUUCgGAUUUUC 1.9% 7111 2443 AuUUUUAGAGCCUAUGUGG 0.1%

7064 2437 UUuAUGUAUUCAGAUUUUC 0.1% 7112 2443 AAcUUUAGAaCCUAUGUGG 0.6%

7065 2437 cUCAUGUAUUCAGAUUUUC 0.1% 7113 2443 AccUUUAGAGCOJAUGUGG 0.1%

7066 2437 UUCAUGUAcUCAGAUUUUC 0.1% 2444 2444 AUUUUAGAGCCUAUGUGGA 94.7%

7067 2437 UUCAUGUAUUCAGACUUUC 0.1% 7114 2444 gUUUUAGAGCCUAUGUGGA 0 , 1%

7063 2437 UUCAUGUAUUCAGAUUUCC 0,1% 7115 2444 AcUUUAGAGCCUAUGUGGA 4.1%

7063 2437 UUCAUGUAUUCgGAUUUcC 0.1% 7116 2444 AUUUUAaAGCCUAUGUGGA 0 , 1%

7070 2437 UUCAUGUAUUCAGACUUCC 2.2% 7117 2444 AUUUUAGAGCCUAUGUaGA 0 , 2%

2433 2438 AAUUUGCAGCAAUAUGCAC 95,1% 7118 2444 uUUUUAGAGCCUAUGUGGA 0.1%

7071 2438 AAUUUGCAGCAAUAUGCAU 0.1% 7119 2444 AcUUUAGAaCCUAUGUGGA 0.6%

7072 2438 AAUUUGCAGCAAUAUGUAC 0,2% 7120 2444 ccUUUAGAGCCUAUGUGGA 0 , 1%

7073 2438 AAUUUGCgGCAAUAUGCAC 0.3% 2445 2445 GUAUUCAGAUUUUCAUUUC 94,6%

7074 2438 AAUUcGCAGCAAUAUGCAC 0.1% 7121 2445 GUAUUCgGAUUUUCAUUUC 0.1%

7075 2438 AAUUUGCuGCAAUAUGCAC 0,3% 7122 2445 GUAcUCAGAUUUUCAUUUC 0.1%

7076 2438 AAUUUGCuGCAAUAUGuAC 0.1% 7123 2445 GUAUUCAGAcUUUCAUUUC 0.1%

7077 2438 AgUUUGCuGCAAUAUGCAC 1.5% 7124 2445 GUAUUCAGAUUUUCAcUUC 0.9%

7073 2438 AgUUcGCuGCAAUAUGCAC 2.1% 7125 2445 GUAUUCgGAUUUUCAcUUu 1 , 8%

2439 2439 ACAAACAAAUUUGCAGCAA 95.0% 7126 2445 GUAUUCAGAcUUcCAUUUC 2 , 1%

7079 2439 ACAAACAAAUUUGCgGCAA 0.3% 7127 2445 GUAUUCAGAUUUCCACUUC 0 , 1%

70S0 2439. ACgAACAAAUUUGCAGCAA 0.3% 7128 2445 GUAUUCAGAcUUcCAUUUu 0 , 1%

7081 2439 ACAAACAAAUUCGCAGCAA 0.1% 7129 2445 GUAUUCgGAUUUcCAcUUC 0.1%

7082 2439 ACAAACAAAUUUGCUGCAA 0.1% 2446 2446 GCAUCCUGUGCAGCAAUGG 94.6%

7083 2439 ACuAACAAAUUUGCAGCAA 0.1% 7130 2446 GCAUCCUGUGCAGCgAUGG 0.2%

7084 2439 ACgAACAAAUUUGCuGCAA 0.3% 7131 2446 GCAUCCUGUGUAGCAAUGG 0.1%

7085 2439 ACgAACAAgUUUGCuGCAA 1.5% 7132 2446 GCAUCuUGUGCAGCAAUGG 1.1%

7086 2439 ACAAACAAgUUcGCuGCAA 1.8% 7133 2446 GCgUCCUGUGCAGCAAUGG 0.2%

2440 2440 CAAACAAAUUUGCAGCAAU 95.0% 7134 2446 GCAUCCUGUGCAGCCAUGG 1.7%

7087 2440 CAAACAAAUUUGCgGCAAU 0.3% 7135 2446 GCAUCuUGUGCAGCcAUGG 2.0%

7083 2440 CgAACAAAUUUGCAGCAAU 0.3% 7136 2446 GCAUCuUGUGCAGuuAUGG 0.1%

7089 2440 CAAACAAAUUCGCAGCAAU 0,1% 2447 2447 AUGCAUCCUGUGCAGCAAU 94.6%

7090 2440 CAAACAAAUUUGCUGCAAU 0.1% 7137 2447 AUGCAUCCUGUGCAGCgAU 0.2%

7091 2440 CuAACAAAUUUGCAGCAAU 0.1% 7138 2447 AUGCAUCCUGUGUAGCAAU 0.1%

7092 2440 CgAACAAAUUUGCuGCAAU 0.3% 7139 2447 AUGCAUCuUGUGCAGCAAU 1.1%

7093 2440 CgAACAAgUUUGCuGCAAU 1.5% 7140 2447 AUGCgUCCUGUGCAGCAAU 0 , 2%

7094 2440 CAAACAAgUUcGCuGCAAU 1.8% 7141 2447 AUGCAUCCUGUGCAGCCAU 1.7%

2441 2441 AAACAAAUUUGCAGCAAUA 95,0% 7142 2447 AUGCAUCuUGUGCAGCcAU 2.0%

7095 2441 AAACAAAUUUGCgGCAAUA 0.3% 7143 2447 AUGCAUCuUGUGCAGuuAU 0.1%

7096 2441 gAACAAAUUUGCAGCAAUA 0.3% 2448 2448 UAUUCAGAUUUUCAUUUCA 94.6%

7097 2441 AAACAAAUUcGCAGCAAUA 0.1% 7144 2448 UAUUCgGAUUUUCAUUUCA 0.1%

7098 2441 AAACAAAUUUGCUGCAAUA 0.1% 7145 2448 UAcUCAGAUUUUCAUUUCA 0.1%

7099 2441 uAACAAAUUUGCAGCAAUA 0.1% 7146 2448 UAUUCAGAcUUUCAUUUCA 0 , 1%

7100 2441 gAACAAAUUUGCuGCAAUA 0.3% 7147 2448 UAUUCAGAUUUUCACUUCA 0.9%

7101 2441 gAACAAgUUUGCuGCAAUA 1,5% 7148 2448 UAUUCgGAUUUUCAcUUuA 1.5%

7102 2441 AAACAAgUUcGCuGCAAUA 1.8% 7149 2448 UAUUCAGAcUUcCAUUUCA 2.1%

2442 2442 UUUUAGAGCCUAUGUGGAU 94.9% 7150 2448 UAUUCAGAUUUCCACUUCA 0 , 1%

7103 2442 cUUUAGAGCCUAUGUGGAU 4.2% 7151 2448 UAUUCAGAcUUcCAUUUuA 0.1%

7104 2442 UUUUAaAGCCUAUGUGGAU 0.1% 7152 2448 UAUUCgGAUUUcCAcUUCA 0.1%

7105 2442 UUUUAGAGCCUAUGUaGAU 0.2% 2449 2449 GAAACAAACAAAUUUGCAG 94.6%

7106 2442 cUUUAGAaCCUAUGUGGAU 0.6% 7153 2449 GAAACAAACAAAUUUGCgG 0.3%

2443 2443 AAUUUUAGAGCCUAUGUGG 94.7% 7154 2449 GAAACgAACAAAUUUGCAG 0.3%

7155 2449 GAgACAAACAAAUUUGCAG 0.4% 7203 2455 AcUCAGAUUUUCAUUUCAU 0 , 1%

7156 2449 GAAACAAACAAAUUCGCAG 0.1% 7204 2455 AUUCAGAcUUUCAUUUCAU 0.1%

7157 2449 GAAACAAACAAAUUUGCUG 0.1% 7205 2455 AUUCAGAUUUUCACUUCAU 0.9%

7153 2449 GAAACuAACAAAUUUGCAG 0,1% 7206 2455 AUUCgGAUUUUCAcUUuAU 1.5%

7159 2449 GAAACgAACAAAUUUGCuG 0.3% 7207 2455 AUUCAGAcUUcCAUUUCAU 2.1%

7160 2449 GAAACgAACAAgUUUGCuG 1.5% 7208 2455 AUUCAGAUUUcCAcUUCAU 0.1%

7161 2449 GAAACAAACAAgUUcGCuG 1.8% 7209 2455 AUUCAGAcUUcCAUUUuAU 0.1%

2450 2450 UGUAUUCAGAUUUUCAUUU 94.6% 7210 2455 AUUCgGAUUUcCAcUUCAU 0 , 1%

7162 2450 UGUAUUCgGAUUUUCAUUU 0,1% 2456 2456 AUGUAUUCAGAUUUUCAUU 94.6%

7163 2450 UGUAcUCAGAUUUUCAUUU 0.1% 7211 2456 AUGUAUUCgGAUUUUCAUU 0.1%

7164 2450 UGUAUUCAGACUUUCAUUU 0.1% 7212 2456 AUGUAcUCAGAUUUUCAUU 0.1%

7165 2450 UGUAUUCAGAUUUUCACUU 0.9% 7213 2456 AUGUAUUCAGACUUUCAUU 0.1%

7166 2450 UGUAUUCgGAUUUUCAcUU 1,8% 7214 2456 AUGUAUUCAGAUUUUCAcU 0 , 9%

7167 2450 UGUAUUCAGAcUUcCAUUU 2,2% 7215 2456 AUGUAUUCgGAUUUUCAcU 1.8%

7168 2450 UGUAUUCAGAUUUCCACUU 0,1% 7216 2456 AUGUAUUCAGACUUCCAUU 2.2%

7169 2450 UGUAUUCgGAUUUcCAcUU 0.1% 7217 2456 AUGUAUUCAGAUUUCCACU 0 , 1%

2451 2451 AAUGCAUCCUGUGCAGCAA 94.6% 7218 2456 AUGUAUUCgGAUUUcCAcU 0.1%

7170 2451 AAUGCAUCCUGUGCAGCgA 0.2% 2457 2457 UGCAUCCUGUGCAGCAAUG 9A.6%

7171 2451 MUGCAUCCUGUGUAGCAA 0.1% 7219 2457 UGCAUCCUGUGCAGCgAUG 0 , 2%

7172 2451 AAUGCAUCuUGUGCAGCAA 1.1% 7220 2457 UGCAUCCUGUGUAGCAAUG 0 , 1%

7173 2451 AAUGCgUCCUGUGCAGCAA 0.2% 7221 2457 UGCAUCuUGUGCAGCAAUG 1 , 1%

7174 2451 AAUGCAUCCUGUGCAGCCA 1,7% 7222 2457 UGCgUCCUGUGCAGCAAUG 0.2%

7175 2451 AAUGCAUCuUGUGCAGCcA 2,0% 7223 2457 UGCAUCCUGUGCAGCCAUG 1 , 7%

7176 2451 AAUGCAUCuUGUGCAGuuA 0.1% 7224 2457 UGCAUCuUGUGCAGCcAUG 2 , 0%

2452 2452 AACAAACAAAUUUGCAGCA 94.6% 7225 2457 UGCAUCuUGUGCAGuuAUG 0.1%

7177 2452 AACAAACAAAUUUGCgGCA 0,3% 2458 2458 CAUGUAUUCAGAUUUUCAU 94,5%

7178 2452 AACgAACAAAUUUGCAGCA 0,3% 7226 2458 CAUGUAUUCgGAUUUUCAU 0.1%

7179 2452 gACAAACAAAUUUGCAGCA 0,4% 7227 2458 uAUGUAUUCAGAUUUUCAU 0 , 1%

7180 2452 AACAAACAAAUUCGCAGCA 0,1% 7228 2458 CAUGUAcUCAGAUUUUCAU 0.1%

7181 2452 AACAAACAAAUUUGCUGCA 0.1% 7229 2458 CAUGUAUUCAGAcUUUCAU 0 , 1%

7182 2452 AACuAACAAAUUUGCAGCA 0.1% 7230 2458 CAUGUAUUCAGAUUUUCAc 0.9%

7183 2452 AACgAACAAAUUUGCuGCA 0.3% 7231 2458 CAUGUAUUCgGAUUUUCAc 1.8%

7184 2452 AACgAACAAgUUUGCuGCA 1.5% 7232 2458 CAUGUAUUCAGACUUCCAU 2 , 2%

7185 2452 AACAAACAAgUUcGCuGCA 1.8% 7233 2458 CAUGUAUUCAGAUUUCCAC 0 , 1%

2453 2453 AAACAAACAAAUUUGCAGC 94.6% 7234 2458 CAUGUAUUCgGAUUUcCAc 0 , 1%

7185 2453 AAACAAACAAAUUUGCgGC 0,3% 2459 2459 AGAAUUUUAGAGCCUAUGU 94.3%

7187 2453 AAACgAACAAAUUUGCAGC 0,3% 7235 2459 AGAgUUUUAGAGCCUAUGU 0 , 1%

7183 2453 AgACAAACAAAUUUGCAGC 0,4% 7236 2459 AaAAUUUUAGAGCCUAUGU 0 , 5%

7189 2453 AAACAAACAAAUUCGCAGC 0.1% 7237 2459 AGAAcUUUAGAGCCUAUGU 0.5%

7190 2453 AAACAAACAAAUUUGCUGC 0,1% 7238 2459 AGAAUUUUAaAGCCUAUGU 0 , 1%

7191 2453 AAACuAACAAAUUUGCAGC 0,1% 7239 2459 AGAuUUUUAGAGCCUAUGU 0.1%

7192 2453 AAACgAACAAAUUUGCuGC 0.3% 7240 2459 AaAAcUUUAGAGCCUAUGU 3.5%

7193 2453 AAACgAACAAgUUUGCuGC 1.5% 2460 2460 GAGAAUUUUAGAGCCUAUG 94.2%

7194 2453 AAACAAACAAgUUcGCuGC 1.8% 7241 2460 GAGAgUUUUAGAGCCUAUG 0 , 1%

2454 2454 CAUCCUGUGCAGCAAUGGA 94,6% 7242 2460 aAGAAUUUUAGAGCCUAUG 0.1%

7195 2454 CAUCCUGUGCAGCgAUGGA 0,2% 7243 2460 GAaAAUUUUAGAGCCUAUG 0.5%

7196 2454 CAUCCUGUGuAGCAAUGGA 0.1% 7244 2460 GAGAAcUUUAGAGCCUAUG 0 , 5%

7197 2454 CAUCuUGUGCAGCAAUGGA 1,1% 7245 2460 GAGAAUUUUAaAGCCUAUG 0.1%

7193 2454 CgUCCUGUGCAGCAAUGGA 0.2% 7246 2460 GAGAuUtIUUAGAGCCUAUG 0 , 1%

7199 2454 CAUCCUGUGCAGCCAUGGA 1.7% 7247 2460 GAaAAcUUUAGAGCCUAUG 3.5%

7200 2454 CAUCuUGUGCAGCcAUGGA 2.0% 2461 2461 GAUUUUCAUUUCAUCAAUG 94,2%

7201 2454 CAUCuUGUGCAGuuAUGGA 0.1% 7248 2461 GAUUUUCAUUUCAUUAAUG 0.5%

2455 2455 AUUCAGAUUUUCAUUUCAU 94.6% 7249 2461 GAcUUUCAUUUCAUCAAUG 0.1%

7202 2455 AUUCgGAUUUUCAUUUCAU 0.1% 7250 2461 GAUUUUCAcUUCAUCAAUG 0.7%

7251 2461 GAUUUUCAcUUCAUCgAUG 0,1% 7298 2468 GUUCAGGCUCUcAGGGACA 0.3%

7252 2461 GAUUUcCAcUUCAUCAAUG 0.2% 7299 2468 GUUCAGGCUCUUAaGGACA 0.2%

2462 2462 UGAGAAUUUUAGAGCCUAU 94.2% 7300 2468 GUUCAGGCUCUUAGaGACA 0.2%

7253 2462 UGAGAgUUUUAGAGCCUAU 0.1% 7301 2468 GUUCAGGCUCUUCGGGACA 0.3%

7254 2462 UaAGAAUUUUAGAGCCUAU 0.1% 7302 2468 GcUCAGGCaCUUAGGGACA 1.8%

7255 2462 UGAaAAUUUUAGAGCCUAU 0.5% 7303 2468 GUUCAaGCaCUUAGGGACA 2.3%

7256 2462 UGAGAAcUUUAGAGCCUAU 0.5% 7304 2468 GUUCAGGCUCUUAaGGAaA 0 , 1%

7257 2462 UGAGAAUUUUAaAGCCUAU 0.1% 2469 2469 UUCAGGCUCUUAGGGACAA 94.1%

7258 2462 UGAGAuUUUUAGAGCOJAU 0.1% 7305 2469 UUCAGGCUCUUAGGGACAg 0.1%

7259 2462 UGAaAAcUUUAGAGCCUAU 3.5% 7306 2469 UUCAGGCUCUUAGGGAUAA 0 , 1%

2463 2463 UUGAGAAUUUUAGAGCCUA 94.2% 7307 2469 UUCAGGCUuUUAGGGACAA 0.1%

7260 2463 UUGAGAgUUUUAGAGCCUA 0,1% 7308 2469 UcCAGGCUCUUAGGGACAA 0.1%

7261 2463 UUaAGAAUUUUAGAGCCUA 0,1% 7309 2469 UUCAGGCcCUUAGGGACAA 0.1%

7262 2463 UUGAaAAUUUUAGAGCCUA 0,5% 7310 2469 UUCAGGCUCUcAGGGACAA 0 , 3%

7263 2463 UUGAGAAcUUUAGAGCCUA 0,5% 7311 2469 UUCAGGCUCUUAaGGACAA 0.2%

7264 2463 UUGAGAAUUUUAaAGCCUA 0.1% 7312 2469 UUCAGGCUCUUAGaGACAA 0 , 2%

7265 2463 UUGAGAuUUUUAGAGCCUA 0.1% 7313 2469 UUCAGGCUCUUcGGGACAA 0 , 3%

7266 2463 UUGAaAAcUUUAGAGCCUA 3,5% 7314 2469 cUCAGGCaCUUAGGGACAA 1.8%

2464 2464 AUUUUCAUUUCAUCAAUGA 94.2% 7315 2469 UUCAaGCaCUUAGGGACAA 2 , 3%

7267 2464 AUUUUCAUUUCAUUAAUGA 0.5% 7316 2469 UUCAGGCUCUUAaGGAaAA 0.1%

7268 2464 AcUUUCAUUUCAUCAAUGA 0.1% 2470 2470 AGAUUUUCAUUUCAUCAAU 94.1%

7269 2464 AUUUUCAcUUCAUCAAUGA 0,7% 7317 2470 AGAUUUUCAUUUCAUUAAU 0.5%

7270 2464 AUUUUCAcUUCAUCgAUGA 0,1% 7318 2470 gGAUUUUCAUUUCAUCAAU 0 , 1%

7271 2464 AUUUcCAcUUCAUCAAUGA 0,2% 7319 2470 AGAcUUUCAUUUCAUCAAU 0 , 1%

2465 2465 UCAGAUUUUCAUUUCAUCA 94,1% 7320 2470 AGAUUUUCAcUUCAUCAAU 0 , 7%

7272 2465 UCAGAUUUUCAUUUCAUUA 0,5% 7321 2470 AGAUUUUCAcUUCAUCgAU 0.1%

7273 2465 UCgGAUUUUCAUUUCAUCA 0,1% 7322 2470 AGAUUUcCAcUUCAUCAAU 0.1%

7274 2465 UCAGAcUUUCAUUUCAUCA 0,1% 7323 2470 gGAUUUcCAcUUCAUCAAU 0.1%

7275 2465 UCAGAUUUUCACUUCAUCA 0,7% 2471 2471 UUCAGAUUUUCAUUUCAUC 94.0%

7276 2465 UCAGAUUUUCAcUUCAUCg 0,1% 7324 2471 UUCAGAUUUUCAUUUCAUu 0.5%

7277 2465 UCAGAUUUcCAcUUCAUCA 0,1% 7325 2471 UUCgGAUUUUCAUUUCAUC 0.1%

7278 2465 UCgGAUUUcCAcUUCAUCA 0,1% 7326 2471 cUCAGAUUUUCAUUUCAUC 0.1%

2466 2466 GAAUUUUAGAGCCUAUGUG 94,1% 7327 2471 UUCAGAcUUUCAUUUCAUC 0.1%

7279 2466 GAgUUUUAGAGCCUAUGUG 0.1% 7328 2471 UUCAGAUUUUCACUUCAUC 0.9%

7280 2466 aAAUUUUAGAGCCUAUGUG 0.5% 7329 2471 UUCAGAUUUcCAcUUCAUC 0 , 1%

7261 2466 GAAcUUUAGAGCCUAUGUG 0.5% 7330 2471 UUCAGAcUUcCAUUUCAUu 2.1%

7282 2466 GAAUUUUAaAGCCUAUGUG 0,1% 7331 2471 UUCgGAUUUcCAcUUCAUC 0 , 1%

7283 2466 GAAUUUUAGAGCCUAUGUa 0,2% 2472 2472 AGAGGCGAAGAAACAAUUG 93,7%

7284 2466 GAuUUUUAGAGCCUAUGUG 0.1% 7332 2472 AGAGGCGAAGAgACAAUUG 5.3%

7285 2466 aAAcUUUAGAGCCUAUGUG 3,5% 7333 2472 AGAGGCGAgGAAA-CAAUUG 0 , 2%

2467 2467 CAGAUUUUCAUUUCAUCAA 94.1% 7334 2472 AGAGGCGAAaAAACAAUUG 0 , 1%

7286 2467 CAGAUUUUCAUUUCAUUAA 0,5% 7335 2472 AGAGGCGAAGAAACAAUCG 0.2%

7287 2467 CgGAUUUUCAUUUCAUCAA 0.1% 7336 2472 AGAGGCGAAGAAACnAUUG 0.1%

7283 2467 CAGAcUUUCAUUUCAUCAA 0,1% 7337 2472 AGAGGaGAAGAgACAAUUG 0.3%

7289 2467 CAGAUUUUCACUUCAUCAA 0.7% 2473 2473 GAGGCGAAGAAACAAUUGA 93,7%

7290 2467 CAGAUUUUCAcUUCAUCgA 0.1% 7338 2473 GAGGCGAAGAgACAAUUGA 5.3%

7291 2467 CAGAUUUcCAcUUCAUCAA 0.1% 7339 2473 GAGGCGAgGAAACAAUUGA 0 , 2%

7292 2467 CgGAUUUcCAcUUCAUCAA 0,1% 7340 2473 GAGGCGAAaAAACAAUUGA 0 , 1%

246S 2468 GUUCAGGCUCUUAGGGACA 94.1% 7341 2473 GAGGCGAAGAAACλAUcGA 0 , 2%

7293 2468 GUUCAGGCUCUUAGGGAUA 0,1% 7342 2473 GAGGCGAAGAAACnAUUGA 0 , 1%

7294 2468 GUUCAGGCUuUUAGGGACA 0,1% 7343 2473 GAGGaGAAGAgACAAUUGA 0 , 3%

7295 2468 aUUCAGGCUCUUAGGGACA 0,1% 2474 2474 GGOGAAGAAACAAWGAAG 93.5%

7296 2468 GUcCAGGCUCUUAGGGACA 0,1% 7344 2474 GGCGAAGAAACAAUUGAgG 0.2%

7297 2468 GUUCAGGCcCUUAGGGACA 0,1% 7345 2474 GGCGAAGAgACAAUUGAAG 5.3%

7346 2474 GGCGAgGAAACAAUUGAAG 0.2% 7393 2481 UGAAAUGGGGgAUGGAaAU 2,2%

7347 2474 GGCGAAaAAACAAUUGAAG 0,1% 7394 2481 UGAAAUGGGGgAUGGAuAU 0,2%

7343 2474 GGCGAAGAAACAAUCGAAG 0.2% 7395 2481 UGAAAUGGGGAAUaGAGcU 0.1%

7349 2474 GGCGAAGAAACnAUUGAAG 0.1% 7396 2481 UGAAAUGGGGcAUGGAaAU 1.8%

7350 2474 GGaGAAGAgACAAUUGAAG 0.3% 7397 2481 UGAAgUGGGGuAUGGAaAU 0.1%

2475 2475 CGAAGAAACAAUUGAAGAA 93.5% 2482 2482 AAUGGGGAAUGGAGAUGAG 93.3%

7351 2475 CGAAGAAACAAUUGAgGAA 0.2% 7398 2482 AAUGGGGgAUGGAGAUGAG 1,2%

735.] 2475 CGAAGAgACAAUUGAAGAA 5.3% 7399 2482 AAUGGGGAAUGGAaAUGAG 0.6%

7353 2475 CGAgGAAACAAUUGAAGAA 0.2% 7400 2482 AAUGGGGAAUGGAGCUGAG 0,4%

7354 2475 CGAAaAAACAAUUGAAGAA 0.1% 7401 2482 AAUGGGGgAUGGAaAUGAG 2.2%

7355 2475 CGAAGAAACAAUCGAAGAA 0.2% 7402 2482 AAUGGGGgAUGGAuAUGAG 0.2%

7356 2475 CGAAGAAACnAUUGAAGAA 0.1% 7403 2482 AAUGGGGAAUaGAGcUGAG 0,1%

7357 2475 aGAAGAgACAAUUGAAGAA 0.3% 7404 2482 AAUGGGGcAUGGAaAUGAG 1,8%

2476 2476 GCGAAGAAACAAUUGAAGA 93,5% 7405 2482 AgUGGGGuAUGGAaAUGAG 0,1%

7353 2476 GCGAAGAAACAAUUGAgGA 0.2% 2483 2483 CACAUUUUCUCAUUCACUG 93,3%

735^ 2476 GCGAAGAgACAAUUGAAGA 5.3% 7406 2483 CACAUUUUCUCgUUCACUG 0,7%

736U 2476 GCGAgGAAACAAUUGAAGA 0.2% 7407 2483 CACAUUUUuUCAUUCACUG 1,1%

7361 2476 GCGAAaAAACAAUUGAAGA 0.1% 7408 2483 CACAUUUUCUCgUUuACUG 0.1%

7362 2476 GCGAAGAAACAAUCGAAGA 0.2% 7409 2483 CACAUcUUCUCAUUCACUG 2,3%

7363 2476 GCGAAGAAACnAUUGAAGA 0.1% 7410 2483 CACAUUUUCUCAUUCACCG 0,1%

7364 2476 GaGAAGAgACAAUUGAAGA 0.3% 7411 2483 CACAUUUUCUCAUUCACUa 0 , 1%

2477 2477 AGGCGAAGAAACAAUUGAA 93.5% 7412 2483 CACAUUUUCUCCUUCACUG 0,1%

7365 2477 AGGCGAAGAAACAAUUGAg 0.2% 7413 2483 CACAUcUUuUCAUUCACUG 0,3%

7366 2477 AGGCGAAGAgACAAUUGAA 5.3% 7414 2483 CACAUaUUCUCAUUCACaG 1.7%

7367 2477 AGGCGAgGAAACAAUUGAA 0.2% 7415 2483 CACAUaUUCUCAUUCACgG 0,1%

7363 2477 AGGOGAAaAAACAAUUGAA 0.1% 2484 2484 AAAUGGGGAAUGGAGAUGA 93,3%

7369 2477 AGGCGAAGAAACAAUCGAA 0.2% 7416 2484 AAAUGGGGgAUGGAGAUGA 1.2%

737U 2477 AGGCGAAGAAACnAUUGAA 0.1% 7417 2484 AAAUGGGGAAUGGAaAUGA 0,6%

7371 2477 AGGaGAAGAgACAAUUGAA 0.3% 7418 2484 AAAUGGGGAAUGGAGCUGA 0.4%

2473 2478 GAAGAAACAAUUGAAGAAA 93.5% 7419 2484 AAAUGGGGgAUGGAaAUGA 2.2%

7372 2478 GAAGAAACAAUUGAgGAAA 0.2% 7420 2484 AAAUGGGGgAUGGAuAUGA 0.2%

7373 2478 GAAGAgACAAUUGAAGAAA 5.7% 7421 2484 AAAUGGGGAAUaGAGcUGA 0,1%

7374 2478 GAgGAAACAAUUGAAGAAA 0.2% 7422 2484 AAAUGGGGcAUGGAaAUGA 1.8%

7375 2478 GAAaAAACAAUUGAAGAAA 0.1% 7423 2484 AAgUGGGGuAUGGAaAUGA 0,1%

7376 2478 GAAGAAACAAUCGAAGAAA 0.2% 2485 2485 GAAAUGGGGAAUGGAGAUG 93,3%

7377 2478 GAAGAAACnAUUGAAGAAA 0.1% 7424 2485 GAAAUGGGGgAUGGAGAUG 1,2%

2479 2479 AAAUGAAAUGGGGAAUGGA 93,4% 7425 2485 GAAAUGGGGAAUGGAaAUG 0.6%

7373 2479 AAAUGAAAUGGGGgAUGGA 1.4% 7426 2485 GAAAUGGGGAAUGGAGCUG 0.4%

737'-1 2479 AgAUGAAAUGGGGAAUGGA 1,0% 7427 2485 GAAAUGGGGgAUGGAaAUG 2,2%

738U 2479 AgAUGAAAUGGGGgAUGGA 2.2% 7428 2485 GAAAUGGGGgAUGGAuAUG 0,2%

738L 2479 AAAUGAAAUGGGGAAUaGA 0,1% 7429 2485 GAAAUGGGGAAUaGAGcUG 0,1%

7382 2479 AAAUGAAAUGGGGCAUGGA 1.8% 7430 2485 GAAAUGGGGcAUGGAaAUG 1.8%

7383 2479 AAAUGAAgUGGGGuAUGGA 0.1% 7431 2485 GAAgUGGGGuAUGGAaAUG 0,1%

2480 2480 AAAAUGAAAUGGGGAAUGG 93,4% 2486 2486 AUGAAAUGGGGAAUGGAGA 93,3%

7384 2480 AAAAUGAAAUGGGGgAUGG 1,4% 7432 2486 AUGAAAUGGGGgAUGGAGA 1,2%

7385 2480 AAgAUGAAAUGGGGAAUGG 1,0% 7433 2486 AUGAAAUGGGGAAUGGAaA 0,6%

7386 2480 AAgAUGAAAUGGGGgAUGG 2,2% 7434 2486 AUGAAAUGGGGAAUGGAGc 0,4%

7387 2480 AAAAUGAAAUGGGGAAUaG 0.1% 7435 2486 AUGAAAUGGGGgAUGGAaA 2,2%

7383 2480 AAAAUGAAAUGGGGCAUGG 1.8% 7436 2486 AUGAAAUGGGGgAUGGAuA 0,2%

7380 2430 AAAAUGAAgUGGGGuAUGG 0.1% 7437 2486 AUGAAAUGGGGAAUaGAGc 0.1%

2481 2481 UGAAAUGGGGAAUGGAGAU 93,3% 7438 2486 AUGAAAUGGGGcAUGGAaA 1.8%

7390 2431 UGAAAUGGGGgAUGGAGAU 1,2% 7439 2486 AUGAAgUGGGGuAUGGAaA 0,1%

7391 2481 UGAAAUGGGGAAUGGAaAU 0.6% 2487 2487 ACAUUUUCUCAUUCACUGG 93,2%

7392 2481 UGAAAUGGGGAAUGGAGCU 0.4% 7440 2487 ACAUUUUCUCgUUCACUGG 0.7%

7441 2487 ACAUUUUuUCλUUCACUGG 1.1% 7489 2493 GUgAACUUUGUGAGuAUGG ?η 2%

7442 2437 ACAUUUUCUCgUUuACUGG 0,1% 7490 2493 GUAAACUUUaUGAGCAUGG d , 1%

7443 2487 ACAUcUUCUCAUUCACUGG 2,3% 7491 2493 GUAAAuUUUGUaAGuAUGG 0.2%

7444 2487 ACAUUUUCUCAUUCACCGG 0,1% 2494 2494 CAGUCCGAAAGAGGCGAAG 92.5%

7445 2487 ACAUUUUCUCAUUCACUaG 0.1% 7492 2494 CAGUCCGAAAGAGGCGAgG 0,2%

7446 2487 ACAUUUUCUCAUUCACUGa 0.1% 7493 2494 CAGUCCGAgAGAGGCGAAG 3.7%

7447 2487 ACAUUUUOJCcUUCACUGG 0,1% 7494 2494 CAGUCuGAAAGAGGCGAAG 0,2%

7443 2487 ACAUcUUuUCAUUCACUGG 0.3% 7495 2494 CAaUCCGAAAGAGGCGAAG 1,0%

7449 2487 ACAUaUUCUCAUUCACaGG 1.7% 7496 2494 CAGUCaGAAAGAGGCGAAG 0.1%

7450 2487 ACAUaUUCUCAUUCACgGG 0.1% 7497 2494 CAGUCCGAAAGAGGCGAAa 0.1%

2483 2488 AAAGAGGCGAAGAAACAAU 92.8% 7498 2494 CuGUCCGAAAGAGGCGAAG 0.1%

7451 2488 AAAGAGGCGAAGAgACAAU 1.2% 7499 2494 CAaUCCGAgAGAGGCGAAG 1,7%

7452 2488 AAAGAGGCGAgGAAACAAU 0.2% 7500 2494 CAGUCCGAgAGAGGaGAAG 0,3%

7453 2488 AgAGAGGCGAAGAAACAAU 1.2% 2495 2495 AGUCCGAAAGAGGCGAAGA 92,5%

7454 2488 AgAGAGGCGAAGAgACAAU 4.2% 7501 2495 AGUCCGAAAGAGGCGAgGA 0.2%

7455 2488 AAAGAGGCGAAaAAACAAU 0.1% 7502 2495 AGUCCGAgAGAGGCGAAGA 3,7%

7456 2488 AgAGAGGCGAAGAAACnAU 0,1% 7503 2495 AGUCuGAAAGAGGCGAAGA 0.2%

7457 2488 AgAGAGGaGAAGAgACAAU 0,3% 7504 2495 AaUCCGAAAGAGGCGAAGA 1.0%

248y 2489 GAAAGAGGCGAAGAAACAA 92,8% 7505 2495 AGUCaGAAAGAGGCGAAGA 0,1%

7453 2489 GAAAGAGGCGAAGAgACAA 1,2% 7506 2495 AGUCCGAAAGAGGCGAAaA 0.1%

7453 2489 GAAAGAGGCGAgGAAACAA 0.2% 7507 2495 uGUCCGAAAGAGGCGAAGA 0.1%

7460 2489 GAgAGAGGCGAAGAAACAA 1.2% 7508 2495 AaUCCGAgAGAGGCGAAGA 1,7%

7461 2489 GAgAGAGGCGAAGAgACAA 4.2% 7509 2495 AGUCCGAgAGAGGaGAAGA 0,3%

7462 2489 GAAAGAGGCGAAaAAACAA 0.1% 2496 2496 CGAAAGAGGCGAAGAAACA 92,4%

7463 2489 GAgAGAGGCGAAGAAACnA 0.1% 7510 2496 CGAAAGAGGCGAAGAgACA 1,2%

7464 2489 GAgAGAGGaGAAGAgACAA 0.3% 7511 2496 CGAAAGAGGCGAgGAAACA 0.2%

2490 2490 AAUGAAAUGGGGAAUGGAG 92.8% 7512 2496 CGAgAGAGGCGAAGAAACA 1.2%

7465 2490 AAUGAAAUGGGGgAUGGAG 1.2% 7513 2496 uGAAAGAGGCGAAGAAACA 0.2%

7466 2490 gAUGAAAUGGGGAAUGGAG 1.0% 7514 2496 CGAgAGAGGCGAAGAgACA 4,2%

7467 2490 AAUGAAAUGGGGAAUaGAG 0.1% 7515 2496 aGAAAGAGGCGAAGAAACA 0.1%

746; 2490 AAUGAAAUGGGGAAUGGAa 0.6% 7516 2496 CGAAAGAGGCGAAaAAACA 0,1%

7469 2490 AAUGAAAUGGGGgAUGGAa 0.2% 7517 2496 CGAgAGAGGCGAAGAAACn 0,1%

7470 2490 gAUGAAAUGGGGgAUGGAa 2.0% 7518 2496 CGAgAGAGGaGAAGAgACA 0,3%

7471 2490 gAUGAAAUGGGGgAUGGAu 0.2% 2497 2497 UGGUAAACUUUGUGAGCAU 92,4%

7472 2490 AAUGAAAUGGGGcAUGGAa 1.8% 7519 2497 UGGUAAAuUUUGUGAGCAU 3,0%

7473 2490 AAUGAAgUGGGGuAUGGAa 0.1% 7520 2497 UGGUAAAuUUUGUGAGuAU 1.7%

2491 2491 UAAACUUUGUGAGCAUGGA 92,5% 7521 2497 UGGUgAACUUUGUGAGuAU 2.2%

7471 2491 UAAAuUUUGUGAGCAUGGA 3,2% 7522 2497 UGGUAAACUUUaUGAGCAU 0,1%

7475 2491 UAAAuUUUGUGAGuAUGGA 1,7% 7523 2497 UuGUAAACUUUGUGAGCAU 0.1%

747h 2491 UgAACUUUGUGAGuAUGGA 2.2% 7524 2497 UaGUAAAuUUUGUGAGCAU 0.2%

7477 24Jl UAAACUUUaUGAGCAUGGA 0.1% 7525 2497 UGGUAAAuUUUGUaAGuAU 0.2%

747d 2491 UAAAuUUUGUaAGuAUGGA 0.2% 2498 2498 GUGGUAAACUUUGUGAGCA 92.4%

2492 2492 AAGAGGCGAAGAAACAAUU 92.5% 7526 2498 GUGGUAAAuUUUGUGAGCA 3.0%

747 1 J 2492 AAGAGGCGAAGAgACAAUU 1.2% 7527 2498 GUGGUAAAuUUUGUGAGuA 1.7%

7480 2492 AAGAGGCGAgGAAACAAUU 0.2% 7528 2498 GUGGUgAACUUUGUGAGuA 2,2%

7481 2492 gAGAGGCGAAGAAACAAUU 1.2% 7529 2498 GUGGUAAACUUUaUGAGCA 0,1%

748.1 2492 gAGAGGCGAAGAgACAAUU 4.2% 7530 2498 GUuGUAAACUUUGUGAGCA 0.1%

7483 2492 AAGAGGCGAAaAAACAAUU 0.1% 7531 2498 GUaGUAAAuUUUGUGAGCA 0,2%

748-J 2AJ2 AAGAGGCGAAGAAACAAUc 0.2% 7532 2498 GUGGUAAAuUUUGUaAGuA 0,2%

241.12 gAGAGGCGAAGAAACnAUU 0 , 1% 2499 2499 UUUGUGCGACAAUGCUUCA 92,4%

748υ 2492 gAGAGGaGAAGAgACAAUU 0.3% 7533 2499 UUUGUGCGACAAUGCUUUA 0.5%

2493 2493 GUAAACUUUGUGAGCAUGG 92.5% 7534 2499 tJUUGUGCGACAAUGuUUCA 2.8%

7487 2493 GUAAAuUUUGUGAGCAUGG 3.2% 7535 2499 UUUGUGCGACAgUGCUUCA 1.2%

7488 2493 GUAAAuUUUGUGAGuAUGG 1.7% 7536 2499 UUUGUGCGgCAAUGCUUCA 0,2%

7537 2499 UgUGUGCEACAAUGCUUCA 0.1% 7586 2504 CGcCAGUCCGAAAGAGGCG 0 , 2%

753;; 2499 UUcGUGCGACAAUGCUUCA 0.1% 7587 2504 CGUCAaUCCGAAAGAGGCG 1.0%

753:-) 2499 UUUGUGaGACAAUGCUUCA 0.2% 7588 2504 CGUCAGUCaGAAAGAGGCG 0.1%

7540 2499 UUUGUGCGACAAUGCUUCu 0.1% 7589 2504 CGUCuGUCCGAAAGAGGCG 0.1%

7541 2499 UUUGUGgGACAAUGCUUCA 0.1% 7590 2504 CGUCAaUCCGAgAGAGGCG 1.7%

7542 2499 UUUGUaCGACAAUGCUUuA 1.6% 7591 2504 CGUCAGUCCGAgAGAGGaG 0 , 3%

2500 2500 UCGUCAGUCCGAAAGAGGC 92.4% 2505 2505 GUCAGUCCGAAAGAGGCGA 92.4%

7543 2500 UCGUCAGUCCGAgAGAGGC 3.7% 7592 2505 GUCAGUCCGAgAGAGGCGA 3.7%

7544 2500 UCGUCAGUCuGAAAGAGGC 0.2% 7593 2505 GUCAGUCuGAAAGAGGCGA 0.2%

754 r > 2500 UCaUCAGUCCGAAAGAGGC 0.2% 7594 2505 aUCAGUCCGAAAGAGGCGA 0 , 2%

754b 2500 UCGcCAGUCCGAAAGAGGC 0.2% 7595 2505 GcCAGUCCGAAAGAGGCGA 0.2%

7547 2500 UCGUCAaUCCGAAAGAGGC 1.0% 7596 2505 GUCAaUCCGAAAGAGGCGA 1.0%

754J 2500 UCGUCAGUCaGAAAGAGGC 0.1% 7597 2505 GUCAGUCaGAAAGAGGCGA 0 , 1%

754» 2500 UCGUCuGUCCGAAAGAGGC 0.1% 7598 2505 GUCuGUCCGAAAGAGGCGA 0.1%

7550 2500 UCGUCAaUCCGAgAGAGGC 1.7% 7599 2505 GUCAaUCCGAgAGAGGCGA 1.7%

7551 2500 UCGUCAGUCCGAgAGAGGa 0.3% 7600 2505 GUCAGUCCGAgAGAGGaGA 0.3%

2501 2501 UUGUGCGACAAUGCUUCAA 92.4% 2506 2506 UCAGUCCGAAAGAGGCGAA 92.4%

7552 2501 UUGUGCGACAAUGCUUUAA 0.5% 7601 2506 UCAGUCCGAAAGAGGCGAg 0.2%

7553 2501 UUGUGCGACAAUGUUUCAA 2.8% 7602 2506 UCAGUCCGAgAGAGGCGAA 3.7%

7554 2501 UUGUGCGACAgUGCUUCAA 1.2% 7603 2506 UCAGUCuGAAAGAGGCGAA 0.2%

7555 2501 UUGUGCGgCAAUGCUUCAA 0.2% 7604 2506 cCAGUCCGAAAGAGGCGAA 0.2%

7556 2501 gUGUGCGACAAUGCUUCAA 0.1% 7605 2506 UCAaUCCGAAAGAGGCGAA 1 , 0%

7557 2501 UcGUGCGACAAUGCUUCAA 0.1% 7606 2506 UCAGUCaGAAAGAGGCGAA 0.1%

7553 2501 UUGUGaGACAAUGCUUCAA 0.2% 7607 2506 UCuGUCCGAAAGAGGCGAA 0.1%

7559 2501 UUGUGCGACAAUGCUUCUA 0.1% 7608 2506 UCAaUCCGAgAGAGGCGAA 1.7%

7560 2501 UUGUGgGACAAUGCUUCAA 0.1% 7609 2506 UCAGUCCGAgAGAGGaGAA 0.3%

7561 2501 UUGUaCGACAAUGCUUuAA 1.6% 2507 2507 GGUAAACUUUGUGAGCAUG 92.4%

2502 2502 UAAGCGAUUUGAAGCAAUA 92.4% 7610 2507 GGUAAAuUUUGUGAGCAUG 3 , 0%

7562 2502 UAAGCGAUUUGAAGCAgUA 0.4% 7611 2507 GGUAAAuUUUGUGAGuAUG 1.7%

756.( 2502 UAAGuGAUUUGAAGCAAUA 0.3% 7612 2507 GGUgAACUUUGUGAGuAUG 2.2%

7564 2502 UAgGCGAUUUGAAGCAAUA 0.3% 7613 2507 GGUAAACUUUaUGAGCAUG 0.1%

7565 2502 UgAGCGAUUUGAAGCAAUA 0.1% 7614 2507 UGUAAACUUUGUGAGCAUG 0.1%

7566 2502 UAgGuGAUUUGAAGCAAUA 0.3% 7615 2507 aGUAAAuUUUGUGAGCAUG 0.2%

7567 2502 cAAGCGAUUUGAAGCAAUA 0.2% 7616 2507 GGUAAAuUUUGUaAGuAUG 0.2%

7563 2502 UAAGCGAcUUGAAGCAAUA 0.2% 2508 2508 UCCGAAAGAGGCGAAGAAA 92,4%

756'1 2502 UAAGCGAUcUGAAGCAAUA 0.3% 7617 2508 UCCGAAAGAGGCGAAGAgA 1.2%

757H 2502 UAAGCGAUUUaAAGCAAUA 0.3% 7618 2508 UCCGAAAGAGGCGAgGAAA 0 , 2%

7571 2502 UAAGCGAUUUGAAaCAAUA 0.2% 7619 2508 UCCGAgAGAGGCGAAGAAA 1 , 3%

757.i 2 c .O2 UuAGCGAUUUGAAGCAAUA 0.2% 7620 2508 UCuGAAAGAGGCGAAGAAA 0.2%

7571 2S02 UAgGCGAcUUGAAGCAAUA 0,3% 7621 2508 UCCGAgAGAGGCGAAGAgA 4 , 2%

757-1 2^02 UcAGCGAUUUaAAGCAAUA 0.1% 7622 2508 UCaGAAAGAGGCGAAGAAA 0 , 1%

2503 2503 UUCGUCAGUCCGAAAGAGG 92.4% 7623 2508 UCCGAAAGAGGCGAAaAAA 0.1%

7575 2503 UUOGUCAGUCCGAgAGAGG 4.1% 7624 2508 UCCGAgAGAGGaGAAGAgA 0.3%

757 ( -; 2'M]3 UUCGUCAGUCUGAAAGAGG 0.2% 2509 2509 CCGAAAGAGGCGAAGAAAC 92,4%

7577 2 r .O3 UUCaUCAGUCCGAAAGAGG 0.2% 7625 2509 CCGAAAGAGGCGAAGAgAC 1.2%

757« 2 p -113 UUOGcCAGUCCGAAAGAGG 0.2% 7626 2509 CCGAAAGAGGCGAgGAAAC 0 , 2%

757- ) 2 1 O3 UUCGUCAaUCCGAAAGAGG 1.0% 7627 2509 CCGAgAGAGGCGAAGAAAC 1 , 3%

758H 2503 UUCGUCAGUCaGAAAGAGG 0.1% 7628 2509 CuGAAAGAGGCGAAGAAAC 0 , 2%

7581 25U3 UUCGUCuGUCCGAAAGAGG 0.1% 7629 2509 CCGAgAGAGGCGAAGAgAC 4 , 2%

7582 2503 UUCGUCAaUCCGAgAGAGG 1.7% 7630 2509 CaGAAAGAGGCGAAGAAAC 0 , 1%

2504 2504 CGUCAGUCCGAAAGAGGCG 92.4% 7631 2509 CCGAAAGAGGCGAAaAAAC 0 , 1%

7583 2504 CGUCAGUCCGAgAGAGGCG 3.7% 7632 2509 CCGAgAGAGGaGAAGAgAC 0.3%

7584 2504 CGUCAGUCuGAAAGAGGCG 0.2% 2510 2510 UUUCGUCAGUCCGAAAGAG 92.4%

7585 2504 CaUCAGUCCGAAAGAGGCG 0.2% 7633 2510 UUUCGUCAGUCCGAgAGAG 4.1%

7634 2 r .10 UUUCGUCAGUCUGAAAGAG 0.2% 7683 2515 AAGCGAUcUGAAGCAAUAU 0 , 3%

7635 2510 UUUCaUCAGUCCGAAAGAG 0.2% 7684 2515 AAGCGAUUUaAAGCAAUAU 0 , 3%

763S 2510 UUUCGcCAGUCCGAAAGAG 0.2% 7685 2515 AAGCGAUUUGAAaCAAUAU 0 , 2%

7637 2510 UUUCGUCAaUCCGAAAGAG 1.0% 7686 2515 AAGCGAUUUGAAGCAAUAc 0.5%

763θ 2510 UUUCGUCAGUCaGAAAGAG 0.1% 7687 2515 uAGCGAUUUGAAGCAAUAU 0 , 2%

7639 2510 UUUCGUCuGUCCGAAAGAG 0.1% 7688 2515 AgGCGAcUUGAAGCAAUAU 0.3%

7640 2510 UUUCGUCAaUCCGAgAGAG 1.7% 7689 2515 cAGCGAUUUaAAGCAAUAU 0.1%

2511 2511 UCCUUUCGUCAGUCCGAAA 92.3% 2516 2516 AAGAUUUUGUGCGACAAUG 92,0%

7641 2511 UCCUUUCGUCAGUCCGAgA 4.1% 7690 2516 AAGAUUUUGUGCGACAgUG 0 , 9%

764.: 2511 UCCUUUCGUCAGUCUGAAA 0.2% 7691 2516 AAGAUUUUGUGCGgCAAUG 0.2%

764t 2511 UCuUUUCGUCAGUCCGAAA 0.1% 7692 2516 AgGAUUUUGUGCGACAAUG 0 , 2%

7644 2 c ' ll UCCUUUCaUCAGUCCGAAA 0.2% 7693 2516 AAGAaUUUGUGCGACAAUG 0.3%

7645 2511 UCCUUUCGcCAGUCCGAAA 0,2% 7694 2516 AAGAcUUUGUGCGACAAUG 3.1%

7646 2511 UCCUUUCGUCAaUCCGAAA 1.0% 7695 2516 AAGAUUUcGUGCGACAAUG 0 , 1%

7647 2511 UCCUUUCGUCAGUCaGAAA 0.1% 7696 2516 AAGAUUUUGUaCGACAAUG 1 , 6%

764« 2511 UCCUUUCGUCuGUCCGAAA 0.1% 7697 2516 AAGAUUUUGUGaGACAAUG 0.1%

7649 2511 UCCUUUCGUCAaUCCGAgA 1.7% 7698 2516 AAGAUUUUGUGgGACAAUG 0.1%

2512 2512 CCUUUCGUCAGUCCGAAAG 92.3% 7699 2516 AAGAcUUUGUGCGACAgUG 0 , 3%

7650 2512 CCUUUCGUCAGUCCGAgAG 4.1% 7700 2516 AAGAcUgUGUGCGACAAUG 0 , 1%

7651 2512 CCUUUCGUCAGUCuGAAAG 0.2% 7701 2516 AAGAUUUUGaGgGACAAUG 0.1%

7652 2512 CuUUUCGUCAGUCCGAAAG 0.1% 2517 2517 AGCGAUUUGAAGCAAUAUG 92.0%

7653 2512 CCUUUCaUCAGUCCGAAAG 0.2% 7702 2517 AGCGAUUUGAAGCAgUAUG 0.4%

7654 2512 CCUUUCGcCAGUCCGAAAG 0.2% 7703 2517 AGuGAUUUGAAGCAAUAUG 0 , 3%

7655 2512 CCUUUCGUCAaUCCGAAAG 1,0% 7704 2517 gGCGAUUUGAAGCAAUAUG 0.3%

7656 2512 CCUUUCGUCAGUCaGAAAG 0,1% 7705 2517 gGuGAUUUGAAGCAAUAUG 0 , 3%

7657 2512 CCUUUCGUCuGUCCGAAAG 0.1% 7706 2517 AGCGAcUUGAAGCAAUAUG 0 , 1%

765« 2512 CCUUUCGUCAaUCCGAgAG 1.7% 7707 2517 AGCGAUcUGAAGCAAUAUG 0.3%

2512 2513 GCGAUUUGAAGCAAUAUGA 92.3% 7708 2517 AGCGAUUUaAAGCAAUAUG 0 , 4%

7659 2513 GCGAUUUGAAGCAgUAUGA 0.4% 7709 2517 AGCGAUUUGAAaCAAUAUG 0.2%

7660 2513 GuGAUUUGAAGCAAUAUGA 0.6% 7710 2517 AGCGAUUUGAAGCAAUACG 0.5%

7661 2513 GCGAcUUGAAGCAAUAUGA 0.4% 7711 2517 AGCGAUUUGAAGCAAUAUa 0 , 4%

7662 2513 GCGAUcUGAAGCAAUAUGA 0.3% 7712 2517 gGCGAcUUGAAGCAAUAUG 0.3%

7663 2513 GCGAUUUaAAGCAAUAUGA 0.4% 7713 2517 AGCGAcUUGAAGCAAUAUu 0.1%

7664 2513 GCGAUUUGAAaCAAUAUGA 0.2% 2518 2518 GUCCGAAAGAGGCGAAGAA 91.8%

7665 2513 GCGAUUUGAAGCAAUACGA 0.5% 7714 2518 GUCCGAAAGAGGCGAAGAg 0 , 9%

7666 2513 GCGAUUUGAAGCAAUAUaA 0.4% 7715 2518 GUCCGAAAGAGGCGAgGAA 0.2%

7667 2513 GCGAcUUGAAGCAAUAUuA 0.1% 7716 2518 GUCCGAgAGAGGCGAAGAA 1.3%

2514 2514 CUUUCGUCAGUCCGAAAGA 92,3% 7717 2518 GUCuGAAAGAGGCGAAGAA 0.2%

7668 2514 CUUUCGUCAGUCCGAgAGA 4,1% 7718 2518 GUCCGAgAGAGGCGAAGAg 2.5%

7669 2 r .14 CUUUCGUCAGUCUGAAAGA 0.2% 7719 2518 aUCCGAAAGAGGCGAAGAA 0.6%

7670 2514 uUUUCGUCAGUCCGAAAGA 0.1% 7720 2518 GUCaGAAAGAGGCGAAGAA 0 , 1%

7671 2514 CUUUCaUCAGUCCGAAAGA 0.2% 7721 2518 GUCCGAAAGAGGCGAAaAA 0.1%

767J 2VL4 CUUUCGcCAGUCCGAAAGA 0.2% 7722 2518 aUCCGAAAGAGGCGAAGAg 0.3%

7673 2514 CUUUCGUCAaUCCGAAAGA 1.0% 7723 2518 aUCCGAgAGAGGCGAAGAg 1.7%

7674 2'.14 CUUUCGUCAGUCaGAAAGA 0.1% 7724 2518 GUCCGAgAGAGGaGAAGAg 0.3%

7675 2514 CUUUCGUCuGUCCGAAAGA 0,1% 2519 2519 GGCAAGCUUUCUCAAAUGU 91.5%

767h 2 r .14 CUUUCGUCAaUCCGAgAGA 1.7% 7725 2519 GGCAAGCUUUCUCAgAUGU 0.1%

2515 2 C .15 AAGCGAUUUGAAGCAAUAU 92,1% 7726 2519 GGuAAGCUUUCUCAAAUGU 0.1%

7677 2M.5 AAGCGAUUUGAAGCAgUAU 0,4% 7727 2519 GGCAAaCUUUCUCAAAUGU 0 , 2%

767s 2515 AAGuGAUUUGAAGCAAUAU 0,3% 7728 2519 GGCAAGCUgUCUCAAAUGU 0.3%

7679 2515 AgGCGAUUUGAAGCAAUAU 0.3% 7729 2519 GGCAAGCUUUCaCAAAUGU 7.7%

768π 2515 gAGCGAUUUGAAGCAAUAU 0.1% 7730 2519 GGuAAGCUUUCaCAAAUGU 0.1%

7681 2515 AgGuGAUUUGAAGCAAUAU 0.3% 2520 2520 ACAUUCUUUGGAUGGAAAG 91.3%

7682 2515 AAGCGAcUUGAAGCAAUAU 0.2% 7731 2520 ACAUUCUUUGGAUGGAAgG 1.1%

7732 2520 ACAUUCUUUGGgUGGAAAG 0.1% 7781 2525 AUUUaAAGCAAUAUGAUAG 0.4%

7733 25J0 ACAUUuUUUGGAUGGAAAG 2.3% 7782 2525 AUUUGAAaCAAUAUGAUAG 0.2%

7734 2520 ACgUUCUUUGGAUGGAAAG 0.6% 7783 2525 AUUUGAAGCAAUACGAUAG 0.5%

7735 2520 ACAUUCUUUGGgUGGAAgG 0.1% 7784 2525 AUUUGAAGCAAUAUaAUAG 0.4%

7736 2520 ACAUUCUUcGGAUGGAAAG 0.1% 7785 2525 AUUUGAAGCAAUAUGAcAG 1 , 8%

7737 2520 ACAUUCUUUGGAUGGAAga 0.2% 7786 2525 AcUUGAAGCAAUAUuAUAG 0.1%

7738 2520 ACAUUCUUUGGcUGGAgAG 2.0% 2526 2526 AUUGAGGGCAAGCUUUCUC 91,2%

773" 2 c O0 ACAUUuUUcGGAUGGAAAG 0.1% 7787 2526 AUUGAGGGuAAGCUUUCUC 0 , 1%

7740 2520 ACAUUCUUcGGcUGGAgAG 0.2% 7788 2526 AUaGAGGGCAAGOJUUCUC 0.1%

7741 2520 ACAUUuUUcGGcUGGAAAG 1.6% 7789 2526 AUUGAaGGCAAGCUUUCUC 0 , 3%

7742 2520 ACAUUuUUcGGuUGGAAAG 0.1% 7790 2526 AUUGAGGGCAAaCUUUCUC 0 , 2%

2521 2521 CAUUCUUUGGAUGGAAAGA 91,3% 7791 2526 AUUGAGGGCAAGCUgUCUC 0.3%

7743 2521 CAUUCUUUGGAUGGAAgGA 1,1% 7792 2526 AUUGAGGGCAAGCUUUCaC 7 , 7%

7744 2521 CAUUCUUUGGgUGGAAAGA 0,1% 7793 2526 AUUGAGGGuAAGCUUUCaC 0 , 1%

7745 2521 CAUUuUUUGGAUGGAAAGA 2.3% 2527 2527 GAGGGCAAGCUIJUCUCAAA 91,2%

7746 2521 CgUUCUUUGGAUGGAAAGA 0.6% 7794 2527 GAGGGCAAGCUL'UCUCAgA 0 , 1%

7747 2521 CAUUCUUUGGgUGGAAgGA 0.1% 7795 2527 GAGGGuAAGCUUUCUCAAA 0 , 1%

774« 2521 CAUUCUUcGGAUGGAAAGA 0.1% 7796 2527 GAaGGCAAGCLLUCUCAAA 0.3%

774y 2521 CAUUCUUUGGAUGGAAgaA 0.2% 7797 2527 GAGGGCAAaCLπjUCUCAAA 0.2%

7750 2521 CAUUCUUUGGcUGGAgAGA 2.0% 7798 2527 GAGGGCAAGCUgUCUCAAA 0 , 3%

7751 2521 CAUUuUUCGGAUGGAAAGA 0.1% 7799 2527 GAGGGCAAGCUUUCaCAAA 7 , 7%

7751: 2521 CAUUCUUcGGcUGGAgAGA 0.2% 7800 2527 GAGGGuAAGCUUUCaCAAA 0 , 1%

7753 2521 CAUUuUUcGGcUGGAAAGA 1.6% 2528 2528 GGGCAAGCUUUCUCAAAUG 91,2%

7754 2521 CAUUuUUcGGuUGGAAAGA 0.1% 7801 2528 GGGCAAGCUULlCUCAgAUG 0 , 1%

2522 2 C ' 22 GAUUUGAAGCAAUAUGAUA 91.2% 7802 2528 GGGuAAGCUUUCUCAAAUG 0 , 1%

7755 2522 GAUUUGAAGCAgUAUGAUA 0.4% 7803 2528 aGGCAAGCUULiCUCAAAUG 0.3%

775' ' . 2522 GAcUUGAAGCAAUAUGAUA 0.4% 7804 2528 GGGCAAaCUULlCUCAAAUG 0.2%

7757 2522 GAUcUGAAGCAAUAUGAUA 0.3% 7805 2528 GGGCAAGCUgUCUCAAAUG 0.3%

775£ 2522 GAUUUaAAGCAAUAUGAUA 0.4% 7806 2528 GGGCAAGCUUUCaCAAAUG 7.7%

7753 2522 GAUUUGAAaCAAUAUGAUA 0,2% 7807 2528 GGGuAAGCUUUCaCAAAUG 0.1%

7760 2522 GAUUUGAAGCAAUACGAUA 0,5% 2529 2529 CAUUGAGGGCAAGCUUUCU 91,2%

7761 2522 GAUUUGAAGCAAUAUaAUA 0,4% 7808 2529 CAUUGAGGGuAλGCUUUCU 0.1%

7762 2 C .22 GAUUUGAAGCAAUAUGACA 1.8% 7809 2529 CAUaGAGGGCAAGCUUUCU 0.1%

7763 2522 GAcUUGAAGCAAUAUuAUA 0.1% 7810 2529 CAUUGAaGGCAAGCUUUCU 0 , 3%

2523 2'- ' 23 AGGGCAAGCUUUCUCAAAU 91.2% 7811 2529 CAUUGAGGGCAAaCUUUCU 0.2%

7764 2523 AGGGCAAGCUUUCUCAgAU 0.1% 7812 2529 CAUUGAGGGCAAGCUgUCU 0.3%

7765 2523 AGGGuAAGCUUUCUCAAAU 0.1% 7813 2529 CAUUGAGGGCAAGCUUUCa 7.7%

7766 2523 AaGGCAAGCUUUCUCAAAU 0.3% 7814 2529 CAUUGAGGGuMGCUUUCa 0.1%

7767 2523 AGGGCAAaCUUUCUCAAAU 0.2% 2530 2530 GCAAGCUUUCUCAAAUGUC 91.0%

7768 2523 AGGGCAAGCUgUCUCAAAU 0.3% 7815 2530 GCAAGCUUUCUCAAAUGUu 0.1%

7769 2523 AGGGCAAGCUUUCaCAAAU 7.7% 7816 2530 GCAAGCUUUCUCAgAUGUC 0.1%

7770 AGGGuAAGCUUUCaCAAAU 0.1% 7817 2530 GuAAGCUUUCUCAAAUGUC 0.1%

2524 2 r .24 UUGAGGGCAAGCUUUCUCA 91.2% 7818 2530 GCAAaCUUUσJCAAAUGUC 0.2%

7771 2524 UUGAGGGuAAGCUUUCUCA 0.1% 7819 2530 GCAAGCUgUCUCAAAUGUC 0 , 3%

7772 2524 UaGAGGGCAAGCUUUCUCA 0.1% 7820 2530 GCAAGCUUUCaCAAAUGUC 7 , 7%

7773 2524 UUGAaGGCAAGCUUUCUCA 0.3% 7821 2530 GCAAGCUUUOJCAAAUGUa 0 , 3%

7774 2524 UUGAGGGCAAaCUUUCUCA 0.2% 7822 2530 GuAAGCUUUCnCAAAUGUC 0 , 1%

7775 2524 UUGAGGGCAAGCUQUCUCA 0.3% 2531 2531 UGAGGGCAAGCUUUCUCAA 91.0%

777b 2524 UUGAGGGCAAGCUUUCaCA 7.7% 7823 2531 UGAGGGCAAGCUUUCUCAg 0.1%

7777 2524 UUGAGGGuAAGCUUUCaCA 0.1% 7824 2531 UGAGGGuAAGCUUUCUCAA 0 , 1%

2525 2525 AUUUGAAGCAAUAI ''"AUAG 91.2% 7825 2531 aGAGGGCAAGOJUUCUCAA 0.1%

7778 2525 AUUUGAAGCAgUλUGλUAG 0.4% 7826 2531 UGAaGGCAAGCUUUCUCAA 0.3%

777:-) 2525 AcUUGAAGCAAUλUGAUAG 0,4% 7827 2531 UGAGGGCAAaCUUUCUCAA 0 , 2%

7780 2^.25 AUcUGAAGCAAUAUGAUAG 0,3% 7828 2531 UGAGGGCAAGCUgUCUCAA 0 , 3%

7829 2531 UGAGGGCAAGCUUUCaCAA 7,7% 7878 2536 CGAUcUGAAGCAAUAUGAU 0.3%

783U 2531 UGAGGGuAAGCUUUCaCAA 0,1% 7879 2536 CGAUUUaAAGCAAUAUGAU 0.4%

2532 2532 CAGAAUGAGUUCAACAAGG 91,0% 7880 2536 CGAUUUGAAaCAAUAUGAU 0.2%

7831 2532 CAGAAUGAGUUCAAUAAGG 0,2% 7881 2536 CGAUUUGAAGCAAUACGAU 0 , 5%

7832 2532 CAGAAUGAGUUUAACAAGG 0.3% 7882 2536 CGAUUUGAAGCAAUAUaAU 0.4%

7833 2532 CAGAgUGAGUUCAACAAGG 1.5% 7883 2536 CGAUUUGAAGCAAUAUGAC 1 , 8%

7834 2532 CAaAAUGAGUUCAACAAGG 0.2% 7884 2536 CGAcUUGAAGCAAUAUuAU 0 , 1%

7835 2532 CAGAAUGAaUUCAAO-AGG 3.3% 2537 2537 UCUUUGGAUGGAAAGAACC 90,5%

783 ( .■ 2532 CAGAAUGAGUUCAACAAaG 1.2% 7885 2537 UCUUUGGAUGGAAAGAgCC 1 , 4%

7837 2532 CAGAuUGAGUUCAACAAGG 0.1% 7886 2537 UCUUUGGAUGGAAgGAACC 1 , 1%

7838 2532 CAGAAUGgaUUCAACAAGG 0.1% 7887 2537 UCUUUGGgUGGAAAGAACC 0 , 1%

7839 2532 CAGAgUGAaUUCAACAAGG 2.0% 7888 2537 UuUUUGGAUGGAAAGAACC 2.3%

2533 2533 AGAAUGAGUUCAACAAGGC 91.0% 7889 2537 UCUUUGGgUGGAAgGAACC 0 , 1%

7840 2533 AGAAUGAGUUCAAUAAGGC 0.2% 7890 2537 UCUUcGGAUGGAAAGAACC 0 , 1%

7841 2533 AGAAUGAGUUUAACAAGGC 0.3% 7891 2537 UCUUUGGAUGGAAgaAACC 0 , 2%

784J 2533 AGAgUGAGUUCAACAAGGC 1,5% 7892 2537 UCUUUGGcUGGAgAGAgCC 2 , 0%

784; 2533 AaAAUGAGUUCAACAAGGC 0.2% 7893 2537 UuUUcGGAUGGAAAGAgCC 0.1%

7844 2533 AGAAUGAaUUCAACAAGGC 3.3% 7894 2537 UuUUcGGcUGGAAAGAACC 0.7%

7845 2533 AGAAUGAGUUCAACAAaGC 1.2% 2538 2538 AGCCUAUGUGGAUGGAUUC 90.4%

7846 2533 AGAuUGAGUUCAACAAGGC 0.1% 7895 2538 AGCCUAUGUGGAUGGAUUu 7 , 9%

7847 2533 AGAAUGgaUUCAACAAGGC 0,1% 7896 2538 AGCCUAUGUGGAUGGgUUC 0.6%

7843 2533 AGAgUGAaUUCAACAAGGC 2.0% 7897 2538 AaCCUAUGUGGAUGGAUUC 0 , 5%

2534 2534 AUAAGCGAUUUGAAGCAAU 90.7% 7898 2538 AGCCUAUGUaGAUGGAUUC 0.2%

7849 2534 AUAAGCGAUUUGAAGCAgU 0.4% 7899 2538 AGCCUAUGUGGAUaGAUUC 0 , 1%

7850 2534 AUAAGuGAUUUGAAGCAAU 0.3% 7900 2538 AGCCUAUGUGGAUGGAUaC 0 , 1%

7851 2534 AUAgGCGAUUUGAAGCAAU 0.2% 7901 2538 AaCCUAUGUGGAUGGAUUu 0.1%

785J 2534 AUgAGCGAUUUGAAGCAAU 0.1% 2539 2539 CAAUGAAAGAGUAUGGAGA 90,4%

785. "; 2534 gUAAGCGAUUUGAAGCAAU 1.7% 7902 2539 CAAUGAAAGAGUAUGGgGA 4 , 1%

7854 2534 gUAgGCGAUUUGAAGCAAU 0.1% 7903 2539 CgAUGAAAGAGUAUGGAGA 0 , 1%

7855 2534 gUAgGuGAUUUGAAGCAAU 0.3% 7904 2539 CAAUGAAgGAGUAUGGgGA 0 , 3%

785.:! 2534 AcAAGOGAUUUGAAGCAAU 0.2% 7905 2539 CAAUGAAAGAaUAUGGAGA 2.1%

785? 2534 AUAAGCGAcUUGAAGCAAU 0.1% 7906 2539 CAAUGAAAGAGUACGGAGA 0.1%

7858 2534 AUAAGCGAUUUaAAGCAAU 0.3% 7907 2539 CAAUGAAAGAGUuUGGAGA 0.1%

7859 2534 AUAAGCGAUUUGAAaCAAU 0.2% 7908 2539 CAAUGAAAGAaUAUGGgGA 1.6%

7860 2534 AUuAGCGAUUUGAAGCAAU 0.2% 7909 2539 CAAUGAAgGAaUAUGGgGA 0 , 3%

7861 2534 AUAgGCGAcUUGAAGCAAU 0,3% 7910 2539 CAAUGAAgGAGUAcGGgGA 0 , 6%

7862 2534 gUAAGCGAcUUGAAGCAAU 0.1% 7911 2539 CAAUGAAAGAacAUGGAGA 0 , 1%

7863 2534 gUAAGCGAUcUGAAGCAAU 0.3% 2540 2540 UGAAGCAAUAUGAUAGUGA 90,2%

7864 2534 AUcAGOGAUUUaAAGCAAU 0,1% 7912 2540 UGAAGCAgUAUGAUAGUGA 0 , 4%

2535 2535 UUCUUUGGAUGGAAAGAAC 90,5% 7913 2540 UaAAGCAAUAUGAUAGUGA 0 , 4%

7865 2535 UUCUUUGGAUGGAAAGAgC 1.4% 7914 2540 UGAAaCAAUAUGAUAGUGA 0 , 2%

7866 2535 UUCUUUGGAUGGAAgGAAC 1,1% 7915 2540 UGAAGCAAUACGAUAGUGA 0 , 5%

7867 2535 UUCUUUGGgUGGAAAGAAC 0.1% 7916 2540 UGAAGCAAUAUaAUAGUGA 0 , 4%

786;s 2535 UUuUUUGGAUGGAAAGAAC 2.3% 7917 2540 UGAAGCAAUAUGAcAGUGA 1.6%

786: ) 2535 UUCUUUGGgUGGAAgGMC 0,1% 7918 2540 UGAAGCAAUAUGAUAGCGA 1.6%

787H 2535 UUCUUcGGAUGGAAAGAAC 0,1% 7919 2540 UGAAGCAAUAUGAUAGUaA 0 , 1%

7871. 2535 UUCUUUGGAUGGAAgaAAC 0,2% 7920 2540 UGAAGCAAUAUUAUAGUGA 0 , 1%

787,: 2535 UUCUUUGGcUGGAgAGAgC 2,0% 7921 2540 UGAAGCAAUAUGACAGCGA 0 , 2%

787-: 2535 UUuUUcGGAUGGAAAGAgC 0,1% 2541 2541 UGGAAGAUUUUGUGCGACA 90.1%

7874 2535 UUuUUcGGcUGGAAAGAAC 0,7% 7922 2541 UGGAAGAUUUUGUGCGgCA 0 , 2%

253h 2536 CGAUUUGAAGCAAUAUGAU 90,5% 7923 2541 UGGAgGAUUUUGUGCGACA 0 , 2%

787 C . 2536 CGAUUUGAAGCAgUAUGAU 0.4% 7924 2541 UGGAAGAaUUUGUGCGACA 0 , 3%

787b 2536 uGAUUUGAAGCAAUAUGAU 0.6% 7925 2541 UGGAAGAcUUUGUGCGACA 3 , 4%

7877 2536 CGAcUUGAAGCAAUAUGAU 0,4% 7926 2541 UGGAAGAUUUCGUGCGACA 0 , 1%

7927 2541 UGGAAGAUUUUGUaCGACA 1,6% 7951 2543 AAAGAGUUcUUUGAGAAcA 0,3%

7928 2541 UGGAAGAUUUUGUGaGACA 0.1% 7952 2543 AAAGAGUUUUUcaAGAAUA 0,2%

7929 2541 UGGAAGAUUUUGUGgGACA 0.1% 2544 2544 UAAUGAAGGGGGUAUACAU 90,0%

7930 2541 UGGAAGAcUgUGUGCGACA 0.1% 7953 2544 UAAUGAAGGGGGUAUACgU 0.7%

7931 2541 UGGAAGAUUUUGaGgGACA 0.1% 7954 2544 UAAUGAAGGGGGUgUACAU 0.6%

2542 2542 AUGGAAGAUUUUGUGCGAC 90.1% 7955 2544 UAgUGAAGGGGGUAUACAU 0.1%

793J 2542 AUGGAAGAUUUUGUGCGgC 0.2% 7956 2544 UAAUGAAaGGGGUAUACAU 0.2%

793J' 2542 AUGGAgGAUUUUGUGCGAC 0.2% 7957 2544 UAAUGAAGGGaGUAUACAU 3,8%

7934 2542 AUGGAAGAaUUUGUGCGAC 0.3% 7958 2544 UAAUGAAGGGGaUAUACAU 0.1%

7935 2542 AUGGAAGAcUUUGUGCGAC 3.4% 7959 2544 UAAUGAAGGGGGUUUACAU 2.2%

7936 2542 AUGGAAGAUUUCGUGCGAC 0.1% 7960 2544 UAAUGAAGGGaGUgUACAU 2.0%

7937 2542 AUGGAAGAUUUUGUaCGAC 1.6% 7961 2544 UAAUGAAGGGaaUAUACAU 0.1%

7938 2542 AUGGAAGAUUUUGUGaGAC 0.1% 2545 2545 GAAGAUUUUGUGCGACAAU 90.0%

7933 2542 AUGGAAGAUUUUGUGgGAC 0.1% 7962 2545 GAAGAUUUUGUGCGACAgU 0.9%

7940 2542 AUGGAAGAcUgUGUGCGAC 0.1% 7963 2545 GAAGAUUUUGUGCGgCAAU 0.2%

7941 2542 AUGGAAGAUUUUGaGgGAC 0.1% 7964 2545 GAgGAUUUUGUGCGACAAU 0.2%

2543 2543 AAAGAGUUUUUUGAGAAUA 90.1% 7965 2545 GAAGAaUUUGUGCGACAAU 0,3%

7942 2543 AAAGAGUUUUUUGAGAgUA 0.4% 7966 2545 GAAGAcUUUGUGCGACAAU 3,1%

7943 2543 AAAGAGUUUUUUGAGgAUA 0.2% 7967 2545 GAAGAUUUcGUGCGACAAU 0.1%

7944 2543 AAgGAGUUUUUUGAGAAUA 0.1% 7968 2545 GAAGAUUUUGUaCGACAAU 1.6%

794^ 2543 AAAGAaUUUUUUGAGAAUA 0,3% 7969 2545 GAAGAUUUUGUGaGACAAU 0.1%

7946 2543 AAAGAGUUUUUCGAGAA.UA 1,5% 7970 2545 GAAGAUUUUGUGgGACAAU 0.1%

7947 2543 AAAGAGUUUUUUGAaAAUA 0.5% 7971 2545 GAAGAcUUUGUGCGACAgU 0,3%

794;i 2543 AAAGAGUUUUUUGAGAACA 1.6% 7972 2545 GAAGAcUgUGUGCGACAAU 0.1%

7941.' 2543 AAAGAuUUUUUUGAGAAUA 0.2% 7973 2545 GAAGAUUUUGaGgGACAAU 0.1%

7950 2543 AAAGnGUUUUUUGAGAAUA 0.1%

Table 20-4: Conserved and minor variant 19-mer sequences from the Influenza A segment 5 (NP) sequences listed in Table 1-5. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

Seq ID RefID Match Seq % Total 7988 3549 UUAUUUCUUCGGgGAuAAU 0.1%

3546 3546 UCUUAUUUCUUCGGAGACA 98.0% 3550 3550 CUUAUUUCUUCGGAGACAA 97.9%

7974 3546 UCUUAUUUCUUCGGgGACA 1.6% 7989 3550 CUUAUUUCUUCGGAGACAg 0.1%

7975 3546 UCUUAUUUCUUuGGAGACA 0.2% 7990 3550 CUUAUUUCUUCGGgGACAA 1.6%

7976 3546 UCUUAUUUCUUCGGgGAuA 0.1% 7991 3550 CUUAUUUCUUUGGAGACAA 0.2%

3547 3547 AUUUCUUCGGAGACAAUGC 97.9% 7992 3550 CUUAUUUCUUCGGgGAuAA 0.1%

7977 3547 AUUUCUUCGGAGACAgUGC 0.1% 3551 3551 UUUCUUCGGAGACAAUGCA 97.8%

7978 3547 AUUUCUUCGGgGACAAUGC 1.6% 7993 3551 UUUCUUCGGAGACAgUGCA 0.1%

7979 3547 AUUUCUUuGGAGACAAUGC 0.2% 7994 3551 UUUCUUCGGgGACAAUGCA 1.6%

7980 3547 AUUUCUUCGGgGAuAAUGC 0.1% 7995 3551 UUUCUUuGGAGACAAUGCA 0.2%

3548 3548 UAUUUCUUCGGAGACAAUG 97.9% 7996 3551 UUUCUUCGGgGAuAAUGCA 0.1%

7981 3548 UAUUUCUUCGGAGACAgUG 0.1% 3552 3552 AUGAAGGAUCUUAUUUCUU 97.6%

7982 3548 UAUUUCUUCGGgGACAAUG 1.6% 7997 3552 AUGAAGGgUCUUAUUUCUU 0.7%

7983 3548 UAUUUCUUuGGAGACAAUG 0.2% 7998 3552 AUGAgGGAUCUUAUUUCUU 0.2%

7984 3548 UAUUUCUUCGGgGAuAAUG 0.1% 7999 3552 AUGAgGGgUCUUAUUUCUU 0.2%

3549 3549 UUAUUUCUUCGGAGACAAU 97.9% 8000 3552 AcGAAGGAUCUUAUUUCUU 0.1%

7985 3549 UUAUUUCUUCGGAGACAgU 0.1% 8001 3552 AUGAAGGuUCUUAUUUCUU 1.1%

7986 3549 UUAUUUCUUCGGgGACAAU 1.6% 3553 3553 AAUGAAGGAUCUUAUUUCU 97.6%

7987 3549 UUAUUUCUUuGGAGACAAU 0.2% 8002 3553 AAUGAAGGgUCUUAUUUCU 0.7%

8003 3553 AAUGAgGGAUCUUAUUUCU 0.2% 8046 3561 GGAUCUUAUUUCUUUGGAG 0.2%

8004 3553 AAUGAgGGgUCUUAUUUCU 0.1% 8047 3561 GGgUCUUAUUUCUUCGGAG 0.9%

8005 3553 AAcGAAGGAUCUUAUUUCU 0.1% 8048 3561 GGuUCUUAUUUCUUCGGAG 1.1%

8006 3553 AAUGAAGGuUCUUAUUUCU 1.1% 3562 3562 AUCUUAUUUCUUCGGAGAC 96.0%

8007 3553 uAUGAgGGgUCUUAUUUCU 0.1% 8049 3562 AUCUUAUUUCUUCGGgGAC 1.6%

3554 3554 GAAGGAUCUUAUUUCUUCG 97.5% 8050 3562 AUCUUAUUUCUUuGGAGAC 0.2%

8008 3554 GAAGGAUCUUAUUUCUUuG 0.2% 8051 3562 gUCUUAUUUCUUCGGAGAC 0.9%

8009 3554 GAAGGgUCUUAUUUCUUCG 0.7% 8052 3562 AUCUUAUUUCUUCGGgGAu 0.1%

8010 3554 GAgGGAUCUUAUUUCUUCG 0.2% 8053 3562 UUCUUAUUUCUUCGGAGAC 1.1%

801 1 3554 GAgGGgUCUUAUUUCUUCG 0.2% 3563 3563 AGGAUCUUAUUUCUUCGGA 95.8%

8012 3554 GAAGGuUCUUAUUUCUUCG 1.1% 8054 3563 AGGAUCUUAUUUCUUCGGg 1.7%

3555 3555 AAGGAUCUUAUUUCUUCGG 97.5% 8055 3563 AGGAUCUUAUUUCUUUGGA 0.2%

8013 3555 AAGGAUCUUAUUUCUUuGG 0.2% 8056 3563 AGGgUCUUAUUUCUUCGGA 0.7%

8014 3555 AAGGgUCUUAUUUCUUCGG 0.7% 8057 3563 gGGAUCUUAUUUCUUCGGA 0.2%

8015 3555 AgGGAUCUUAUUUCUUCGG 0.2% 8058 3563 gGGgUCUUAUUUCUUCGGA 0.2%

8016 3555 AgGGgUCUUAUUUCUUCGG 0.2% 8059 3563 AGGuUCUUAUUUCUUCGGA 1.1%

8017 3555 AAGGuUCUUAUUUCUUCGG 1.1% 3564 3564 AGUAAUGAAGGAUCUUAUU 95.4%

3556 3556 UGAAGGAUCUUAUUUCUUC 97.4% 8060 3564 AGUAAUGAAGGgUCUUAUU 0.7%

8018 3556 UGAAGGAUCUUAUUUCUUu 0.2% 8061 3564 AGUAAUGAgGGAUCUUAUU 0.2%

8019 3556 UGAAGGgUCUUAUUUCUUC 0.7% 8062 3564 AaUAAUGAAGGAUCUUAUU 1.9%

8020 3556 UGAgGGAUCUUAUUUCUUC 0.2% 8063 3564 AGcAAUGAAGGAUCUUAUU 0.3%

8021 3556 UGAgGGgUCUUAUUUCUUC 0.2% 8064 3564 AGUAAcGAAGGAUCUUAUU 0.1%

8022 3556 cGAAGGAUCUUAUUUCUUC 0.1% 8065 3564 AGUAAUGAAGGUUCUUAUU 1.1%

8023 3556 UGAAGGuUCUUAUUUCUUC 1.1% 8066 3564 AaUAAUGAgGGgUCUUAUU 0.1%

3557 3557 UAAUGAAGGAUCUUAUUUC 97.3% 3565 3565 GUAAUGAAGGAUCUUAUUU 95.4%

8024 3557 UAAUGAAGGgUCUUAUUUC 0.7% 8067 3565 GUAAUGAAGGgUCUUAUUU 0.7%

8025 3557 UAAUGAgGGAUCUUAUUUC 0.2% 8068 3565 GUAAUGAgGGAUCUUAUUU 0.2%

8026 3557 UAAUGAgGGgUCUUAUUUC 0.1% 8069 3565 aUAAUGAAGGAUCUUAUUU 1.9%

8027 3557 cAAUGAAGGAUCUUAUUUC 0.3% 8070 3565 GcAAUGAAGGAUCUUAUUU 0.3%

8028 3557 UAAcGAAGGAUCUUAUUUC 0.1% 8071 3565 GUAAcGAAGGAUCUUAUUU 0.1%

8029 3557 UAAUGAAGGuUCUUAUUUC 1.1% 8072 3565 GUAAUGAAGGUUCUUAUUU 1.1%

8030 3557 UuAUGAgGGgUCUUAUUUC 0.1% 8073 3565 aUAAUGAgGGgUCUUAUUU 0.1%

3558 3558 UUCUUCGGAGACAAUGCAG 96.5% 3566 3566 AUGAGUAAUGAAGGAUCUU 95.3%

8031 3558 UUCUUCGGAGACAgUGCAG 0.1% 8074 3566 AUGAGUAAUGAAGGgUCUU 0.7%

8032 3558 UUCUUCGGgGACAAUGCAG 1.6% 8075 3566 AUGAGUAAUGAgGGAUCUU 0.2%

8033 3558 UUCUUuGGAGACAAUGCAG 0.2% 8076 3566 AUaAGUAAUGAAGGAUCUU 0.2%

8034 3558 UUCUUCGGgGAuAAUGCAG 0.1% 8077 3566 AUGAaUAAUGAAGGAUCUU 1.9%

8035 3558 UUCUUCGGAGACAAUGCAa 0.3% 8078 3566 AUGAGcAAUGAAGGAUCUU 0.3%

3559 3559 UCUUCGGAGACAAUGCAGA 96.4% 8079 3566 AUGAGUAAcGAAGGAUCUU 0.1%

8036 3559 UCUUCGGAGACAgUGCAGA 0.1% 8080 3566 AUGAGUAAUGAAGGUUCUU 1.1%

8037 3559 UCUUCGGgGACAAUGCAGA 1.6% 8081 3566 AUGAaUAAUGAgGGgUCUU 0.1%

8038 3559 UCUUuGGAGACAAUGCAGA 0.2% 3567 3567 UGAGUAAUGAAGGAUCUUA 95.3%

8039 3559 UCUUCGGgGAuAAUGCAGA 0.1% 8082 3567 UGAGUAAUGAAGGgUCUUA 0.7%

8040 3559 UCUUCGGAGACAAUGCAaA 0.3% 8083 3567 UGAGUAAUGAgGGAUCUUA 0.2% 356 ( 1 3560 GAUCUUAUUUCUUCGGAGA 96.0% 8084 3567 UaAGUAAUGAAGGAUCUUA 0.2%

804 1 3560 GAUCUUAUUUCUUCGGgGA 1.7% 8085 3567 UGAaUAAUGAAGGAUCUUA 1.9%

8042 3560 GAUCUUAUUUCUUuGGAGA 0.2% 8086 3567 UGAGcAAUGAAGGAUCUUA 0.3%

8043 3560 GgUCUUAUUUCUUCGGAGA 0.9% 8087 3567 UGAGUAAcGAAGGAUCUUA 0.1%

8044 3560 GuUCUUAUUUCUUCGGAGA 1.1% 8088 3567 UGAGUAAUGAAGGUUCUUA 1.1% 3561 3561 GGAUCUUAUUUCUUOGGAG 96.0% 8089 3567 UGAaUAAUGAgGGgUCUUA 0.1%

8045 3561 GGAUCUUAUUUCUUCGGgG 1.7% 3568 3568 GAGUAAUGAAGGAUCUUAU 95.3%

8090 3568 GAGUAAUGAAGGgUCXJUAU 0.7% 8137 3572 GUgCAAAUUGCUUCAAAUG 0.1%

8091 3568 GAGUAAUGAgGGAUCUUAU 0.2% 8138 3572 aUACAAAUUGCUUCAAAUG 0.4%

8092 3568 aAGUAAUGAAGGAUCUUAU 0.2% 8139 3572 GUACAAAUUGCCUCAAAUG 0.1%

8093 3568 GAaUAAUGAAGGAUCUUAU 1.9% 8140 3572 GUACAAAUUGCUUCAAACG 0.1%

8094 3568 GAGcAAUGAAGGAUCUUAU 0.3% 8141 3572 GUuCAAAUUGCUUCAAAUG 2.1%

8095 3568 GAGUAAcGAAGGAUCUUAU 0.1% 8142 3572 GUuCAgAUUGCUUCAAAUG 2.5%

8096 3568 GAGUAAUGAAGGuUCUUAU 1.1% 8143 3572 aUuCAAAUUGCUUCAAAUG 0.2%

8097 3568 GAaUAAUGAgGGgUCUUAU 0.1% 8144 3572 GUuCAAAUaGCUUCAAAUG 0.2%

3569 3569 UACAAAUUGCUUCAAAUGA 93.7% 8145 3572 GUuCAAAUUGCUUCcAAUG 0.5%

8098 3569 UACAgAUUGCUUCAAAUGA 0.1% 8146 3572 aUuCAgAUUGCUUCAAAUG 0.1%

8099 3569 UgCAAAUUGCUUCAAAUGA 0.1% 8147 3572 GUuCAgAUUGCgUCAAAUG 0.3%

810U 3569 UACAAAUUGCCUCAAAUGA 0.1% 8148 3572 GUuCAgAUUGCUUCcAAUG 0.1%

8101 3569 UACAAAUUGCUUCAAACGA 0.1% 3573 3573 AUGGUGCUCUCUGCUUUUG 93.3%

8102 3569 UuCAAAUUGCUUCAAAUGA 2.3% 8149 3573 AUGGUGCUuUCUGCUUUUG 0.2%

8103 3569 UuCAgAUUGCUUCAAAUGA 2.6% 8150 3573 AUGGUaCUCUCUGCUUUUG 0.8%

8104 3569 UuCAAAUaGCUUCAAAUGA 0.2% 8151 3573 AUGGUGCUaUCUGCUUUUG 0.1%

8105 3569 UuCAAAUUGCUUCcAAUGA 0.5% 8152 3573 AUGGUGCUCUCaGCUUUUG 0.1%

8106 3569 UuCAgAUUGCgUCAAAUGA 0.3% 8153 3573 AUGGUGCUCUCUGCCUUUG 1.1%

8107 3569 UuCAgAUUGCUUCcAAUGA 0.1% 8154 3573 AUGGUGCUCUCUGCgUUUG 0.1%

3570 3570 AAUUUCAAACAGCUGCACA 93.5% 8155 3573 AUGGUGCUCUCUGCUUUCG 0.2%

8108 3570 AAUUUCAAACAGCUGCgCA 0.3% 8156 3573 AUGGUaCUCUCUGCaUUUG 1.7%

8109 3570 AAUUUCAAACgGCUGCACA 0.4% 8157 3573 AUGGUcCUCUOJGCaUUUG 0.2%

8110 3570 AAUUUCAAgCAGCUGCACA 0.1% 8158 3573 AUGGUuCUCUCUGCaUUUG 1.9%

81 1 1 3570 AAUUUCAgACAGCUGCACA 0.3% 3574 3574 UGGUGCUCUCUGCUUUUGA 93.3%

8112 3570 AAcUUCAAACAGCUGCACA 0.2% 8159 3574 UGGUGCUuUCUGCUUUUGA 0.2%

8113 3570 AAUUcCAAACAGCUGCACA 0.1% 8160 3574 UGGUaCUCUCUGCUUUUGA 0.8%

8114 3570 AAUUUCAAACAGCaGCACA 2.1% 8161 3574 UGGUGCUaUCUGCUUUUGA 0.1%

8115 3570 AAUUUCAAACAGCUGCuCA 0.1% 8162 3574 UGGUGCUCUCaGCUUUUGA 0.1%

8116 3570 AAUUUCAAACUGCUGCACA 0.5% 8163 3574 UGGUGCUCUCUGCCUUUGA 1.1%

8117 3570 AAUUUCAAACgGCaGCACA 0.1% 8164 3574 UGGUGCUCUCUGCgUUUGA 0.1%

8118 3570 AAUUUCAgACAGCUGCcCA 0.3% 8165 3574 UGGUGCUCUCUGCUUUCGA 0.2%

81 19 3570 AgUUUCAgACAGCUGCcCA 0.1% 8166 3574 UGGUaCUOJCUGCaUUUGA 1.7%

8120 3570 AAUUcCAAACAGCaGCACA 1.9% 8167 3574 UGGUcCUCUCUGCaUUUGA 0.2%

8121 3570 AAUUUCAgACcGCaGCACA 0.1% 8168 3574 UGGUuCUCUCUGCaUUUGA 1.9%

3571 3571 AAAUUUCAAACAGCUGCAC 93.5% 3575 3575 CAAGGCACCAAACGGUCUU 93.2%

8122 3571 AAAUUUCAAACAGCUGCgC 0.3% 8169 3575 CAAGGuACCAAACGGUCUU 0.1%

8123 3571 AAAUUUCAAACgGCUGCAC 0.4% 8170 3575 CAgGGCACCAAACGGUCUU 0.8%

8124 3571 AAAUUUCAAgCAGCUGCAC 0.1% 8171 3575 CAAGGCAaCAAACGGUCUU 0.1%

8125 3571 AAAUUUCAgACAGCUGCAC 0.3% 8172 3575 CAAGGCACCAAACGaUCUU 4.7%

8126 3571 AAAcUUCAAACAGCUGCAC 0.2% 8173 3575 CAAGGCACCACACGGUCUU 0.1%

8127 3571 AAAUUcCAAACAGCUGCAC 0.1% 8174 3575 aAAGGCACCAAACGaUCUU 0.1%

8128 3571 AAAUUUCAAACAGCaGCAC 2.1% 8175 3575 CAAGGCACCAAACGaUCaU 0.7%

8129 3571 AAAUUUCAAACAGCUGCUC 0.1% 8176 3575 CAAGGCACCAAACGaUCcU 0.4%

8130 3571 AAAUUUCAAACUGCUGCAC 0.5% 3576 3576 AAGGCACCAAACGGUCUUA 93.2%

8131 3571 AAAUUUCAAACgGCaGCAC 0.1% 8177 3576 AAGGuACCAAACGGUCUUA 0.1%

8132 3571 AAAUUUCAgACAGCUGCcC 0.3% 8178 3576 AgGGCACCAAACGGUCUUA 0.8%

8133 3571 AAgUUUCAgACAGCUGCcC 0.1% 8179 3576 AAGGCAaCAAACGGUCUUA 0.1%

8134 3571 AAAUUcCAAACAGCaGCAC 1.9% 8180 3576 AAGGCACCAAACGaUCUUA 4.7%

8135 3571 AAAUUUCAgACcGCaGCAC 0.1% 8181 3576 AAGGCACCAcACGGUCUUA 0.1%

3572 3572 GUACAAAUUGCUUCAAAUG 93.4% 8182 3576 AAGGCACCAAACGaUCaUA 0.7%

8136 3572 GUACAgAUUGCUUCAAAUG 0.1% 8183 3576 AAGGCACCAAACGaUCcUA 0.4%

3577 3577 AGAGAAUGUGCAACAUUCU 93.0% 8230 3581 GGAACCCAGGgAAUGCUGA 0.6%

8184 3577 AGAGAAUGUGCAAuAUUCU 0.3% 8231 3581 GGAAuCCAGGAAAUGCUGA 1.2%

8185 3577 AGAGgAUGUGCAACAUUCU 0.5% 8232 3581 GaAACCCAGGAAAUGCUGA 0.4%

8186 3577 AaAGAAUGUGCAACAUUCU 1.2% 8233 3581 GGAACCCAaGAAAUGCUGA 0.1%

8187 3577 AGAGAAUGUGCAACAUcCU 4.0% 8234 3581 GGAACCCAGGAAACGCUGA 0.9%

8188 3577 AuAGAAUGUGCAACAUUCU 0.3% 8235 3581 GGAACCCAGGAAAUaCUGA 0.1%

8189 3577 AaAGAAUGUGCAAuAUUCU 0.4% 8236 3581 GaAACCCAGGgAAUGCUGA 0.4%

8190 3577 AGAGAAUGUGCAACgUcCU 0.1% 8237 3581 GaAAuCCAGGgAAUGCUGA 0.1%

8191 3577 AGAGAAUGUGCAAuAUcCU 0.2% 8238 3581 GGAAuCCuGGgAAUGCUGA 0.2%

8192 3577 AGAGgAUGUGCAACAUcCU 0.1% 3582 3582 CGAACCCGAUOGUGCCCUC 92.1%

3578 3578 GAGAGAAUGUGCAACAUUC 93.0% 8239 3582 CGAACCCGAUOGUGCCUUC 5.1%

8193 3578 GAGAGAAUGUGCAAuAUUC 0.3% 8240 3582 CGAACCCGgUCGUGCCCUC 1.1%

8194 3578 GAGAGgAUGUGCAACAUUC 0.5% 8241 3582 CGAgCCCGAUOGUGCCCUC 0.1%

8195 3578 GAaAGAAUGUGCAACAUUC 1.2% 8242 3582 CGAgCCCGAUCGUGCCuUC 0.6%

8196 3578 GAGAGAAUGUGCAACAUCC 4.0% 8243 3582 CGAACCCaAUCGUGCCCUC 0.5%

8197 3578 GAuAGAAUGUGCAACAUUC 0.3% 8244 3582 CGAACCCcAUOGUGCCCUC 0.3% 819(S 3578 GAaAGAAUGUGCAAuAUUC 0.4% 8245 3582 CGAACCCGAUOGUUCCCUC 0.1%

8199 3578 GAGAGAAUGUGCAACgUcC 0.1% 8246 3582 CGAACCCuAUCGUGCCCUC 0.1%

8200 3578 GAGAGAAUGUGCAAuAUcC 0.2% 3583 3583 CGGAACCCAGGAAAUGCUG 92.1%

8201 3578 GAGAGgAUGUGCAACAUcC 0.1% 8247 3583 CGGAACCCAGGgAAUGCUG 0.6%

3579 3579 AUUUCAAACAGCUGCACAA 92.7% 8248 3583 CGGAAuCCAGGAAAUGCUG 1.2%

8202 3579 AUUUCAAACAGCUGCACAg 0.9% 8249 3583 uGGAACCCAGGAAAUGCUG 0.1%

8203 3579 AUUUCAAACAGCUGCgCAA 0.3% 8250 3583 CGaAACCCAGGAAAUGCUG 0.4%

8204 3579 AUUUCAAACgGCUGCACAA 0.4% 8251 3583 CGGAACCCAaGAAAUGCUG 0.1%

8205 3579 AUUUCAAgCAGCUGCACAA 0.1% 8252 3583 CGGAACCCAGGAAACGCUG 0.9%

8206 3579 AUUUCAgACAGCUGCACAg 0.3% 8253 3583 CGGAACCCAGGAAAUaCUG 0.1%

8207 3579 ACUUCAAACAGCUGO-CAA 0.2% 8254 3583 CGaAACCCAGGgAAUGCUG 0.4%

8208 3579 AUUcCAAACAGCUGCACAA 0.1% 8255 3583 CGGAAuCCuGGgAAUGCUG 0.1%

8209 3579 AUUUCAAACAGCaGCACAA 2.1% 3584 3584 AAUGCUGAGAUCGAAGAUC 91.7%

8210 3579 AUUUCAAACAGCUGCuCAA 0.1% 8256 3584 AAUGCUGAGAUUGAAGAUC 1.3%

8211 3579 AUUUCAAACuGCUGCACAA 0.5% 8257 3584 AAcGCUGAGAUCGAAGAUC 0.2%

8212 3579 AUUUCAAACgGCaGCACAA 0.1% 8258 3584 AAUaCUGAGAUCGAAGAUC 0.1%

8213 3579 AUUUCAgACAGCUGCcCAg 0.3% 8259 3584 AAUGCUGAaAUCGAAGAUC 0.9%

8214 3579 AUUcCAAACAGCaGCACAA 1.9% 8260 3584 AAUGCUGAGAUaGAAGAUC 0.1%

8215 3579 AUUUCAgACcGCaGCACAA 0.1% 8261 3584 AAUGCUGAGuUCGAAGAUC 0.5%

3580 3580 UUUCAAACAGCUGCACAAA 92.7% 8262 3584 AAcGCUGAGAUuGAAGAUC 0.2%

8216 3580 UUUCAAACAGCUGCACAgA 0.9% 8263 3584 AAUGCUGAaAUuGAAGAUC 4.1%

8217 3580 UUUCAAACAGCUGCgCAAA 0.3% 8264 3584 AAUGCUGAGAUUGAAGACC 0.2%

8218 3580 UUUCAAACgGCUGCACAAA 0.4% 8265 3584 AAUGCUGAGuaCGAAGAUC 0.1%

8219 3580 UUUCAAgCAGCUGCACAAA 0.1% 8266 3584 AAcGCUGAaAUuGAAGAUC 0.5%

8220 3580 UUUCAgACAGCUGCACAgA 0.3% 3585 3585 AUGCUGAGAUCGAAGAUCU 91.7%

8221 3580 cUUCAAACAGCUGCACAAA 0.2% 8267 3585 AUGCUGAGAUUGAAGAUCU 1.3%

8222 3580 UUcCAAACAGCUGCACAAA 0.1% 8268 3585 AcGCUGλGAUCGAAGAUCU 0.2%

8223 3580 UUUCAAACAGCaGCACAAA 2.0% 8269 3585 AUaCUGAGAUOGAAGAUCU 0.1%

8224 3580 UUUCAAACAGCUGCuCAAA 0.1% 8270 3585 AUGCUGAaAUOGAAGAUCU 0.9%

8225 3580 UUUCAAACuGCUGCACAAA 0.5% 8271 3585 AUGCUGλGAUaGAAGAUCU 0.1%

8226 3580 UUUCAAACgGCaGCACAAA 0.1% 8272 3585 AUGCUGAGuUCGAAGAUCU 0.5%

8227 3580 UUUCAgACAGCUGCcCAgA 0.4% 8273 3585 AcGCUGAGAUuGAAGAUCU 0.2%

8228 3580 UUcCAAACAGCaGCACAAA 1.7% 8274 3585 AUGCUGAaAUuGAAGAUCU 4.1%

8229 3580 UUUCAAACAGCaGCACAAc 0.1% 8275 3585 AUGCUGAGAUUGAAGACCU 0.2%

3581 3581 GGAACCCAGGAAAUGCUGA 92.2% 8276 3585 AUGCUGλGuaCGAAGAUCU 0.1%

8277 3585 AcGOJGAaAUuGAAGAUOJ 0.5% 8323 3590 ACCCAGGgAAUGCUGAGAU 0.6%

3586 3586 GAUUOJACAUCCAAAUGUG 91.6% 8324 3590 AuCCAGGAAAUGCUGAGAU 1.0%

8278 3586 GAUUCUACAUCCAgAUGUG 2.3% 8325 3590 ACCCAaGAAAUGCUGAGAU 0.1%

8279 3586 GAUUCUACAUUCAAAUGUG 0.9% 8326 3590 ACCCAGGAAACGCUGAGAU 0.4%

8280 3586 GAUUCUAuAUCCAAAUGUG 0.5% 8327 3590 ACCCAGGAAAUaCUGAGAU 0.1%

8281 3586 GAUUuUACAUCCAAAUGUG 0.2% 8328 3590 ACCCAGGAAAUGCUGAaAU 1.0%

8282 3586 GgUUCUACAUCCAAAUGUG 0.3% 8329 3590 ACCCAGGgAAUGCUGAGuU 0.4%

8283 3586 aAUUCUACAUCCAAAUGUG 0.1% 8330 3590 AuCCAGGAAAUGCUGAaAU 0.1%

8284 3586 GAUUCUACAUaCAgAUGUG 0.1% 8331 3590 AuCCAGGAAAUGCUGAGuU 0.1%

8285 3586 GgUUCUACAUaCAgAUGUG 1.7% 8332 3590 AuCCAGGgAAUGCUGAaAU 0.1%

8286 3586 aAUUCUACAUaCAAAUGUG 0.2% 8333 3590 AuCCuGGgAAUGCUGAGAU 0.3%

3587 3587 GCUGAGAUCGAAGAUCUCA 91.5% 8334 3590 ACCCAGGAAAcGCUGAaAU 0.5%

8287 3587 GCUGAGAUuGAAGAUCUCA 1.4% 8335 3590 ACCCAGGAAAUGCUGAGua 0.1%

8288 3587 GCUGAGAUuGAAGAUCUuA 0.1% 8336 3590 ACCCAGGAAAUGCUGAaga 0.1%

8289 3587 aCUGAGAUCGAAGAUσJCA 0.1% 3591 3591 UCAGACAUGAGGGCAGAAA 91.4%

8290 3587 GCUGAaAUCGAAGAUCUCA 0.9% 8337 3591 UCAGACAUGAGGGCAGAgA 0.2%

8291 3587 GCUGAGAUaGAAGAUCUCA 0.1% 8338 3591 UCAGACAUGAGGGCgGAAA 0.1%

8292 3587 GCUGAGAUCGAAGAUCUaA 0.4% 8339 3591 UCAGAuAUGAGGGCAGAAA 0.4%

8293 3587 GCUGAGuUCGAAGAUCUCA 0.5% 8340 3591 UCgGACAUGAGGGCAGAAA 0.4%

8294 3587 GCUGAaAUuGAAGAUCUCA 4.4% 8341 3591 UCAGACAUGAGaGCAGAAA 1.4%

8295 3587 GCUGAaAUuGAAGAUCUuA 0.2% 8342 3591 UCAGACAUGAGGGCUGAAA 0.4%

8296 3587 GCUGAGAUuGAAGAcCUuA 0.2% 8343 3591 UCuGACAUGAGGGCAGAAA 0.3%

8297 3587 GCUGAGuaCGAAGAUCUCA 0.1% 3592 3592 UGCUGAGAUCGAAGAUCUC 91.4%

3588 3588 UCAUGGCAGCAUUCACUGG 91.5% 8344 3592 UGCUGAGAUuGAAGAUCUC 1.2%

8298 3588 UCAUGGCAGCAUUCAUUGG 0.2% 8345 3592 UGCUGAGAUuGAAGAUCUu 0.1%

8299 3588 UCAUGGCAGCAUUCgCUGG 0.1% 8346 3592 cGCUGAGAUCGAAGAUCUC 0.2%

8300 3588 UCAUGGCAGCAUUUACUGG 0.4% 8347 3592 UaCUGAGAUCGAAGAUCUC 0.1%

8301 3588 UCAUGGCAGCgUUCAOJGG 0.1% 8348 3592 UGCUGAaAUCGAAGAUCUC 0.9%

8302 3588 UuAUGGCAGCAUUCACUGG 1.6% 8349 3592 UGCUGAGAUaGAAGAUCUC 0.1%

8303 3588 UCAUGGCAGCAUUCACCGG 0.3% 8350 3592 UGCUGAGAUCGAAGAUCUa 0.4%

8304 3588 UCAUGGCAGCUUUCACUGG 0.3% 8351 3592 UGCUGAGuUCGAAGAUCUC 0.5%

8305 3588 UCAUGGCuGCAUUCACUGG 0.3% 8352 3592 cGCUGAGAUuGAAGAUCUC 0.2%

8306 3588 UuAUGGCAGCAUUCAaUGG 0.4% 8353 3592 UGCUGAaAUuGAAGAUCUC 4.1%

8307 3588 UuAUGGCAGCAUUuACaGG 1.7% 8354 3592 UGCUGAGAUuGAAGAcCUu 0.2%

8308 3588 UuAUGGCAGCuUUCAuUGG 0.2% 8355 3592 UGCUGAGuaCGAAGAUCUC 0.1%

8309 3588 UuAUGGCAGCuUUCgCUGG 0.1% 8356 3592 cGCUGAaAUuGAAGAUCUC 0.3%

83 10 3588 UCAUGGCgGCcUUCACaGG 0.2% 3593 3593 CAGAC/.UGAGGGCAGAAAU 91.4%

83 1 1 3588 UCAUuGCAGCAUUuACaGG 0.1% 8357 3593 CAGACλUGAGGGCAGAgAU 0.2%

3589 3589 AAUGGUGCUCUCUGCUUUU 91.5% 8358 3593 CAGACAUGAGGGCgGAAAU 0.1%

8312 3589 AAUGGUGCUuUCUGCUUUU 0.2% 8359 3593 CAGAuAUGAGGGCAGAAAU 0.4%

8313 3589 gAUGGUGCUCUCUGCUUUU 1.8% 8360 3593 CgGACAUGAGGGCAGAAAU 0.4%

83 14 3589 AAUGGUaCUCUCUGCUUUU 0.8% 8361 3593 CAGACAUGAGaGCAGAAAU 1.4%

83 1 5 3589 AAUGGUGCUaUCUGCUUUU 0.1% 8362 3593 CAGACAUGAGGGCUGAAAU 0.4%

83 16 3589 AAUGGUGCUCUCaGCUUUU 0.1% 8363 3593 CuGACAUGAGGGCAGAAAU 0.3%

83 17 3589 AAUGGUGCUCUCUGCCUUU 1.1% 3594 3594 AACCCλGGAAAUGCUGAGA 91.4%

83 1 8 3589 AAUGGUGCUCUCUGCgUUU 0.1% 8364 3594 AACCCAGGgAAUGCUGAGA 0.6%

83 19 3589 AAUGGUGCUCUCUGCUUUc 0.2% 8365 3594 AAuCCAGGAAAUGCUGAGA 1.0%

8320 3589 AAUGGUaCUCUCUGCaLlUU 1.7% 8366 3594 AACCCAaGAAAUGCUGAGA 0.1%

8321 3589 AAUGGUcCUCUCUGCaUUU 0.2% 8367 3594 AACCCAGGAAACGCUGAGA 0.4%

8322 3589 AAUGGUuCUCUCUGCaUUU 1.9% 8368 3594 AACCCAGGAAAUaCUGAGA 0.1%

3590 3590 ACCCAGGAAAUGCUGAGAU 91.4% 8369 3594 AACCCACGAAAUGCUGAaA 1.0%

8370 3594 AACCCAGGAAAUGCUGAGu 0.1% 3598 3598 CCAGGAAAUGCUGAGAUCG 91.1%

8371 3594 AACCCAGGAAAUGCUGAag 0.1% 8418 3598 CCAGGAAAUGCUGAGAUUG 1.2%

8372 3594 AACCCAGGgAAUGCUGAGu 0.4% 8419 3598 CCAGGgAAUGCUGAGAUCG 0.6%

8373 3594 AAuCCAGGAAAUGCUGAaA 0.1% 8420 3598 CCAaGAAAUGCUGAGAUCG 0.1%

8374 3594 AAuCCAGGAAAUGCUGAGu 0.1% 8421 3598 CCAGGAAAcGCUGAGAUCG 0.2%

8375 3594 AAuCCAGGgAAUGCUGAaA 0.1% 8422 3598 CCAGGAAAUaCUGAGAUCG 0.1%

8376 3594 AAuCCuGGgAAUGCUGAGA 0.3% 8423 3598 CCAGGAAAUGCUGAaAUCG 0.9%

8377 3594 AACCCAGGAAAcGCUGAaA 0.5% 8424 3598 CCAGGAAAUGCUGAGAUaG 0.1%

3595 3595 AAAUGCUGAGAUCGAAGAU 91.2% 8425 3598 CCAGGAAAUGCUGAGUUCG 0.1%

8378 3595 AAAUGCUGAGAUUGAAGAU 1.2% 8426 3598 CCAGGAAAcGCUGAGAUuG 0.2%

8379 3595 gAAUGCUGAGAUCGAAGAU 0.6% 8427 3598 CCAGGAAAUGCUGAaAUuG 0.2%

8380 3595 gAAUGCUGAGAUuGAAGAU 0.1% 8428 3598 CCAGGgAAUGCUGAGuUCG 0.4%

8381 3595 AAAcGCUGAGAUCGAAGAU 0.2% 8429 3598 CCAGGgAAUGCUGAaAUuG 0.1%

8382 3595 AAAUaCUGAGAUCGAAGAU 0.1% 8430 3598 CCuGGgAAUGCUGAGAUuG 0.3%

8383 3595 AAAUGCUGAaAUCGAAGAU 0.9% 8431 3598 CCAGGAAAUGCUGAGuaCG 0.1%

8384 3595 AAAUGCUGAGAUaGAAGAU 0.1% 8432 3598 CCAGGAAAcGCUGAaAUuG 0.5%

8385 3595 AAAUGCUGAGUUCGAAGAU 0.1% 3599 3599 AGGAAAUGCUGAGAUCGAA 91.1%

8386 3595 AAAcGCUGAGAUuGAAGAU 0.2% 8433 3599 AGGAAAUGCUGAGAUUGAA 1.2%

8387 3595 AAAUGCUGAaAUuGAAGAU 0.2% 8434 3599 AGGgAAUGCUGAGAUCGAA 0.6%

8388 3595 gAAUGCUGAGuUCGAAGAU 0.4% 8435 3599 AaGAAAUGCUGAGAUCGAA 0.1%

8389 3595 gAAUGCUGAaAUuGAAGAϋ 3.9% 8436 3599 AGGAAAcGCUGAGAUCGAA 0.2%

8390 3595 gAAUGCUGAGAUuGAAGAc 0.2% 8437 3599 AGGAAAUaCUGAGAUCGAA 0.1%

8391 3595 AAAUGCUGAGuaCGAAGAU 0.1% 8438 3599 AGGAAAUGCUGAaAUCGAA 0.9%

8392 3595 AAAcGCUGAaAUuGAAGAU 0.5% 8439 3599 AGGAAAUGCUGAGAUaGAA 0.1%

3596 3596 GAAAUGCUGAGAUCGAAGA 91.2% 8440 3599 AGGAAAUGCUGAGuϋCGAA 0.1%

8393 3596 GAAAUGCUGAGAUUGAAGA 1.2% 8441 3599 AGGAAAcGCUGAGAUuGAA 0.2%

8394 3596 GgAAUGCUGAGAUCGAAGA 0.6% 8442 3599 AGGAAAUGCUGAaAUuGAA 0.2%

8395 3596 GgAAUGCUGAGAUuGAAGA 0.3% 8443 3599 AGGgAAUGCUGAGuUCGAA 0.4%

8396 3596 GAAAcGCUGAGAUCGAAGA 0.2% 8444 3599 AGGgAAUGCUGAaAUuGAA 0.1%

8397 3596 GAAAUaCUGAGAUCGAAGA 0.1% 8445 3599 UGGgAAUGCUGAGAUuGAA 0.3%

8398 3596 GAAAUGCUGAaAUCGAAGA 0.9% 8446 3599 AGGAAAUGCUGAGuaCGAA 0.1%

8399 3596 GAAAUGCUGAGAUaGAAGA 0.1% 8447 3599 AGGAAAcGCUGAaAUuGAA 0.5%

8400 3596 GAAAUGCUGAGUUCGAAGA 0.1% 3600 3600 CAGGAAAUGCUGAGAUCGA 91.1%

8401 3596 GAAAcGCUGAGAUuGAAGA 0.2% 8448 3600 CAGGAAAUGCUGAGAUUGA 1.2%

8402 3596 GAAAUGCUGAaAUuGAAGA 0.2% 8449 3600 CAGGgAAUGCUGAGAUCGA 0.6%

8403 3596 GgAAUGCUGAGuUCGAAGA 0.4% 8450 3600 CAaGAAAUGCUGAGAUCGA 0.1%

8404 3596 GgAAUGCUGAaAUuGAAGA 3.9% 8451 3600 CAGGAAAcGCUGAGAUCGA 0.2%

8405 3596 GAAAUGCUGAGuaCGAAGA 0.1% 8452 3600 CAGGAAAUaCUGAGAUCGA 0.1%

8406 3596 GAAAcGCUGAaAUuGAAGA 0.5% 8453 3600 CAGGAAAUGCUGAaAUCGA 0.9%

3597 3597 GAACCCAGGAAAUGCUGAG 91.2% 8454 3600 CAGGAAAUGCUGAGAUaGA 0.1%

8407 3597 GAACCCAGGgAAUGCUGAG 0.6% 8455 3600 CAGGAAAUGCUGAGUUCGA 0.1%

8408 3597 GAAuCCAGGAAAUGCUGAG 1.1% 8456 3600 CAGGAAAcGCUGAGAUuGA 0.2%

8409 3597 aAACCCAGGAAAUGCUGAG 0.3% 8457 3600 CAGGAAAUGCUGAaAUuGA 0.2%

8410 3597 GAACCCAaGAAAUGCUGAG 0.1% 8458 3600 CAGGgAAUGCUGAGuUCGA 0.4%

8411 3597 GAACCCAGGAAACGCUGAG 0.4% 8459 3600 CAGGgAAUGCUGAaAUuGA 0.1%

8412 3597 GAACCCAGGAAAUaCUGAG 0.1% 8460 3600 CuGGgAAUGCUGAGAUuGA 0.3%

8413 3597 GAACCCAGGAAAUGCUGAa 1.0% 8461 3600 CAGGAAAUGCUGAGuaCGA 0.1%

8414 3597 aAACCCAGGgAAUGCUGAG 0.4% 8462 3600 CAGGAAAcGCUGAaAUuGA 0.5%

8415 3597 GAAuCCAGGAAAUGCUGAa 0.1% 3601 3601 GGAAAUGCUGAGAUCGAAG 91.1%

8416 3597 aAACCCAGGAAAUGCUGAa 0.1% 8463 3601 GGAAAUGCUGAGAUUGAAG 1.2%

8417 3597 GAACCCAGGAAAcGCUGAa 0.5% 8464 3601 GGgAAUGCUGAGAUCGAAG 0.6%

8465 3601 GGgAAUGCUGAGAUuGAAG 0.3% 8512 3605 GgAAAUUUCAAgCAGCUGC 0.1%

8466 3601 aGAAAUGCUGAGAUCGAAG 0.1% 8513 3605 GgAAAUUUCAgACAGCUGC 0.3%

8467 3601 GGAAAcGCUGAGAUCGAAG 0.2% 8514 3605 GAAAAcUUCAAACAGCUGC 0.2%

8468 3601 GGAAAUaCUGAGAUCGAAG 0.1% 8515 3605 GAAAAUUcCAAACAGCUGC 0.1%

8469 3601 GGAAAUGCUGAaAUCGAAG 0.9% 8516 3605 GAAAAUUUCAAACUGCUGC 0.1%

8470 3601 GGAAAUGCUGAGAUaGAAG 0.1% 8517 3605 GuAAAUUUCAAACAGCUGC 0.1%

8471 3601 GGAAAUGCUGAGUUCGAAG 0.1% 8518 3605 GgAAAUUUCAAACAGCaGC 2.1%

8472 3601 GGAAAcGCUGAGAUuGAAG 0.2% 8519 3605 GgAAAUUUCAAACuGCUGC 0.4%

8473 3601 GGAAAUGCUGAaAUuGAAG 0.2% 8520 3605 GgAAAUUUCAAACgGCaGC 0.1%

8474 3601 GGgAAUGCUGAGuUCGAAG 0.4% 8521 3605 GgAAAUUcCAAACAGCaGC 1.9%

8475 3601 GGgAAUGCUGAaAUuGAAG 3.9% 3606 3606 GGAAAAUUUCAAACAGCUG 90.1%

8476 3601 GGAAAUGCUGAGuaCGAAG 0.1% 8522 3606 GGAAAAUUUCAAACgGCUG 0.4%

8477 3601 GGAAAcGCUGAaAUuGAAG 0.5% 8523 3606 GGAAAAUUUCAgACAGCUG 0.3%

3602 3602 UGGAUCAAGUGAGAGAAAG 90.7% 8524 3606 GGgAAAUUUCAAACAGCUG 3.7%

8478 3602 UGGAUCAAGUGAGAGAgAG 0.6% 8525 3606 GGAAAgUUUCAgACAGCUG 0.1%

8479 3602 UGGAUCAAGUGAGgGAAAG 0.1% 8526 3606 GGgAAAUUUCAAgCAGCUG 0.1%

8480 3602 UGGAUCAgGUGAGAGAAAG 0.2% 8527 3606 GGgAAAUUUCAgACAGCUG 0.3%

8481 3602 UaGAUCAAGUGAGAGAAAG 0.1% 8528 3606 GGAAAAcUUCAAACAGCUG 0.2%

8482 3602 UGGAcCAAGUGAGAGAAAG 2.6% 8529 3606 GGAAAAUUcCAAACAGCUG 0.1%

8483 3602 UGGAUCAAGUaAGAGAAAG 1.5% 8530 3606 GGAAAAUUUCAAACUGCUG 0.1%

8484 3602 UGGAUCcAGUGAGAGAAAG 0.1% 8531 3606 GGuAAAUUUCAAACAGCUG 0.1%

8485 3602 UGGAUCAAGUGcGAGAgAG 1.6% 8532 3606 GGgAAAUUUCAAACAGCaG 2.1%

8486 3602 UGGAUCAgGUGcGAGAAAG 2.1% 8533 3606 GGgAAAUUUCAAACuGCUG 0.4%

8487 3602 UGGAcCAgGUGAGgGAAAG 0.1% 8534 3606 GGgAAAUUUCAAACgGCaG 0.1%

8488 3602 UGGAcCAAGUGcGAGAgAG 0.1% 8535 3606 GGgAAAUUcCAAACAGCaG 1.9%

8489 3602 UGGAcCAgGUGcGAGAAAG 0.1% 3607 3607 AAAGGAAAAUUUCAAACAG 90.0%

3603 3603 UAUGAGAGAAUGUGCAACA 90.4% 8536 3607 AAAGGAAAAUUUCAAACgG 0.4%

8490 3603 UAUGAGAGAAUGUGCAACg 0.1% 8537 3607 AAAGGAAAAUUUCAgACAG 0.3%

8491 3603 UAUGAGAGAAUGUGCAAUA 0.5% 8538 3607 AAAGGgAAAUUUCAAACAG 5.8%

8492 3603 UAUGAGAGgAUGUGCAACA 0.4% 8539 3607 AAgGGAAAAUUUCAAACAG 0.1%

8493 3603 UAcGAGAGAAUGUGCAACA 6.6% 8540 3607 AAAGGAAAgUUUCAgACAG 0.1%

8494 3603 UAUGAaAGAAUGUGCAACA 1.1% 8541 3607 AAAGGgAAAUUUCAAACgG 0.1%

8495 3603 UAUGAuAGAAUGUGCAACA 0.3% 8542 3607 AAAGGgAAAUUUCAAgCAG 0.1%

8496 3603 UAcGAGAGgAUGUGCAACA 0.2% 8543 3607 AAAGGgAAAUUUCAgACAG 0.3%

8497 3603 UAUGAaAGAAUGUGCAAuA 0.4% 8544 3607 AAAGGAAAAcUUCAAACAG 0.2%

8498 3603 UAcGAaAGAAUGUGCAACA 0.1% 8545 3607 AAAGGAAAAUUCCAAACAG 0.1%

3604 3604 AUGAGAGAAUGUGCAACAU 90.4% 8546 3607 AAAGGAAAAUUUCAAACUG 0.1%

8499 3604 AUGAGAGAAUGUGCAACgU 0.1% 8547 3607 AΆΆGGUAAAUUUCAAΆCAG 0.1%

8500 3604 AUGAGAGAAUGUGCAAUAU 0.5% 8548 3607 AAAGGgAAAUUcCAAACAG 1.9%

8501 3604 AUGAGAGgAUGUGCAACAU 0.4% 8549 3607 AAAGGgAAAUUUCAAACuG 0.4%

8502 3604 AcGAGAGAAUGUGCAACAU 6.6% 8550 3607 AAAGGgAAAUUUCAgACcG 0.1%

8503 3604 AUGAaAGAAUGUGCAACAU 1.1% 3608 3608 AGGAAAAUUUCAAACAGCU 90.0%

8504 3604 AUGAuAGAAUGUGCAACAU 0.3% 8551 3608 AGGAAAAUUUCAAACgGCU 0.4%

8505 3604 AcGAGAGgAUGUGCAACAU 0.2% 8552 3608 AGGAAAAUUUCAgACAGCU 0.3%

8506 3604 AUGAaAGAAUGUGCAAuAU 0.4% 8553 3608 AGGgAAAUUUCAAACAGCU 3.7%

8507 3604 AcGAaAGAAUGUGCAACAU 0.1% 8554 3608 gGGAAAAUUUCAAACAGCU 0.1%

3605 3605 GAAAAUUUCAAACAGCUGC 90.1% 8555 3608 AGGAAAgUUUCAgACAGCU 0.1%

8508 3605 GAAAAUUUCAAACgGCUGC 0.4% 8556 3608 AGGgAAAUUUCAAgCAGCU 0.1%

8509 3605 GAAAAUUUCAgACAGCUGC 0.3% 8557 3608 AGGgAAAUUUCAgACAGCU 0.3%

8510 3605 GgAAAUUUCAAACAGCUGC 3.7% 8558 3608 AGGAAAAcUUCAAACAGCU 0.2%

8511 3605 GAAAgUUUCAgACAGCUGC 0.1% 8559 3608 AGGAAAAUUcCAAACAGCU 0.1%

8560 3608 AGGAAAAUUUCAAACuGCU 0.1% 8578 3609 AAGGgAAAUUcCAAACAGC 1.9%

8561 3608 AGGuAAAUUUCAAACAGCU 0.1% 8579 3609 AAGGgAAAUUUCAAACuGC 0.4%

8562 3608 AGGgAAAUUUCAAACAGCa 2.1% 8580 3609 AAGGgAAAUUUCAgACcGC 0.1%

8563 3608 AGGgAAAUUUCAAACuGCU 0.4% 3610 3610 CCCAGGAAAUGCUGAGAUC 90.0%

8564 3608 AGGgAAAUUUCAAACgGCa 0.1% 8581 3610 CCCAGGAAAUGCUGAGAUu 1.2%

8565 3608 AGGgAAAUUcCAAACAGCa 1.9% 8582 3610 CCCAGGgAAUGCUGAGAUC 0.6% 3609 3609 AAGGAAAAUUUCAAACAGC 90.0% 8583 3610 uCCAGGAAAUGCUGAGAUC 1.0%

8566 3609 AAGGAAAAUUUCAAACgGC 0.4% 8584 3610 CCCAaGAAAUGCUGAGAUC 0.1%

8567 3609 AAGGAAAAUUUCAgACAGC 0.3% 8585 3610 CCCAGGAAAcGCUGAGAUC 0.2%

8568 3609 AAGGgAAAUUUCAAACAGC 5.8% 8586 3610 CCCAGGAAAUaCUGAGAUC 0.1%

8569 3609 AgGGAAAAUUUCAAACAGC 0.1% 8587 3610 CCCAGGAAAUGCUGAaAUC 0.9%

8570 3609 AAGGAAAgUUUCAgACAGC 0.1% 8588 3610 CCCAGGAAAUGCUGAGAUa 0.1%

8571 3609 AAGGgAAAUUUCAAACgGC 0.1% 8589 3610 CCCAGGAAAcGCUGAGAUu 0.2%

8572 3609 AAGGgAAAUUUCAAgCAGC 0.1% 8590 3610 CCCAGGAAAUGCUGAaAUu 0.1%

8573 3609 AAGGgAAAUUUCAgACAGC 0.3% 8591 3610 CCCAGGgAAUGCUGAGuUC 0.4%

8574 3609 AAGGAAAAcUUCAAACAGC 0.2% 8592 3610 uCCAGGAAAUGCUGAGuUC 0.1%

8575 3609 AAGGAAAAUUcCAAACAGC 0.1% 8593 3610 UCCAGGAAAUGCUGAaAUu 0.1%

8576 3609 AAGGAAAAUUUCAAACuGC 0.1% 8594 3610 CCCAGGAAAUGCUGAGuaC 0.1%

8577 3609 AAGGuAAAUUUCAAACAGC 0.1% 8595 3610 CCCAGGAAAcGCUGAaAUu 0.5%

Table 20-5: Conserved and minor variant 19-mer sequences from the Influenza A segment 7 (Ml & M2) sequences listed in Table 1-7. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

SeqID RefID Match Seq % Total 8619 4432 CGCUCACCGUGCCUAGUGA 0.1%

4427 4427 UUCACGCUCACCGUGCCCA 99.2% 8620 4432 CGCUCACuGUGCCCAGUGA 0.2%

8596 4427 UUCACGCUCACCGUGCCUA 0.1% 8621 4432 CaCUCACCGUGCCCAGUGA 0.2%

8597 4427 UUCACGCUCACUGUGCCCA 0.2% 8622 4432 CGCUCACCGUGCCaAGUGA 0.2%

8598 4427 UUCACaCUCACCGUGCCCA 0.2% 8623 4432 CGCUCACCGUGCCCACUGA 0.3%

8599 4427 UUCACGCUCACCGUGCCaA 0.2% 8624 4432 CGCUCACCGUGCCCAGCGA 0.1%

4428 4428 CAGGCCCCCUCAAAGCCGA 99.0% 8625 4432 CGCUCACCGUGCCCAGUaA 0.1%

8600 4428 CgGGCCCCCUCAAAGCCGA 0.2% 4433 4433 ACGCUCACCGUGCCCAGUG 98.8%

8601 4428 uAGGCCCCCUCAAAGCCGA 0.1% 8626 4433 ACGCUCACCGUGCCUAGUG 0.1%

8602 4428 CAGGCCCCCUCAAAGCCCA 0.5% 8627 4433 ACGCUCACuGUGCCCAGUG 0.2%

8603 4428 CcGGCCCCCUCAAAGCCGA 0.2% 8628 4433 ACaCUCACCGUGCCCAGUG 0.2%

4429 4429 UCAGGCCCCCϋCAAAGCCG 99.0% 8629 4433 ACGCUCACCGUGCCaAGUG 0.2%

8604 4429 UCgGGCCCCCUCAAAGCCG 0.2% 8630 4433 ACGCUCACCGUGCCCACUG 0.3%

8605 4429 UuAGGCCCCCUCAAAGCCG 0.1% 8631 4433 ACGCUCACCGUGCCCAGCG 0.1%

8606 4429 UCAGGCCCCCUCAAAGCCc 0.5% 8632 4433 ACGCUCACCGUGCCCAGUa 0.1%

8607 4429 UCcGGCCCCCUCAAAGCCG 0.2% 4434 4434 UCACCGUGCCCAGUGAGCG 98.7%

4430 4430 UCACGCUCACCGUGCCCAG 98.9% 8633 4434 UCACCGUGCCUAGUGAGCG 0.1%

8608 4430 UCACGCUCACCGUGCCUAG 0.1% 8634 4434 UCACuGUGCCCAGUGAGCG 0.2%

8609 4430 UCACGCUCACUGUGCCCAG 0.2% 8635 4434 UCACCGUGCCaAGUGAGCG 0.2%

8610 4430 UCACaCUCACCGUGCCCAG 0.2% 8636 4434 UCACCGUGCCCACUGAGCG 0.3%

8611 4430 UCACGCUCACCGUGCCaAG 0.2% 8637 4434 UCACCGUGCCCAGCGAGCG 0.1%

8612 4430 UCACGCUCACCGUGCCCAc 0.3% 8638 4434 UCACCGUGCCCAGUaAGCG 0.1%

4431 4431 CACGCUCACCGUGCCCAGU 98.8% 8639 4434 UCACCGUGCCCAGUGAaCG 0.3%

8613 4431 CACGCUCACCGUGCCUAGU 0.1% 4435 4435 CUCACCGUGCCCAGUGAGC 98.7%

8614 4431 CACGCUCACuGUGCCCAGU 0.2% 8640 4435 CUCACCGUGCCUAGUGAGC 0.1%

8615 4431 CACaCUCACCGUGCCCAGU 0.2% 8641 4435 CUCACuGUGCCCAGUGAGC 0.2%

8616 4431 CACGCUCACCGUGCCaAGU 0.2% 8642 4435 CUCACCGUGCCaAGUGAGC 0.2%

8617 4431 CACGCUCACCGUGCCCAcU 0.3% 8643 4435 CUCACCGUGCCCACUGAGC 0.3%

8618 4431 CACGCUCACCGUGCCCAGc 0.1% 8644 4435 CUCACCGUGCCCAGCGAGC 0.1%

4432 4432 CGCUCACCGUGCCCAGUGA 98.8% 8645 4435 CUCACCGUGCCCAGUaAGC 0.1%

8646 4435 CUCACCGUGCCCAGUGAaC 0.3% 8695 4442 GGACUGCAGCGUAGACGuϋ 0.2%

4436 4436 GCUCACCGUGCCCAGUGAG 98.5% 8696 4442 GGACUGCAaCGUAGACGCU 1.7%

8647 4436 GCUCACCGUGCCUAGUGAG 0.1% 8697 4442 GGACUGCAGCGUAaACGCU 0.1%

8648 4436 GCUCACuGUGCCCAGUGAG 0.2% 8698 4442 GGACUGCAGCGUACACGCU 0.1%

8649 4436 aCUCACCGUGCCCAGUGAG 0.2% 8699 4442 GGACUGCAGCGUAGACGaU 1.5%

8650 4436 GCUCACCGUGCCaAGUGAG 0.2% 8700 4442 GGACUGCAGCGUAGACGgU 0.3%

8651 4436 GCUCACCGUGCCCAcUGAG 0.3% 8701 4442 GGACUGCAGCGUAUACGCU 0.2%

8652 4436 GCUCACCGUGCCCAGCGAG 0.1% 8702 4442 GGACUGCAaCGUAuACGCU 0.1%

8653 4436 GCUCACCGUGCCCAGUaAG 0.1% 8703 4442 GGACUGCAGCGUAuACGaU 0.1%

8654 4436 GCUCACCGUGCCCAGUGAa 0.3% 4443 4443 UGCAGCGUAGACGCUUUGU 95.5%

4437 4437 AUCUUGAGGCUCUCAUGGA 96.4% 8704 4443 UGCAGCGUAGACGUUUUGU 0.2%

8655 4437 AUCUUGAGGuUCUCAUGGA 0.9% 8705 4443 UGCAaCGUAGACGCUUUGU 1.7%

8656 4437 AcCUUGAGGCUCUCAUGGA 0.3% 8706 4443 UGCAGCGUAaACGCUUUGU 0.1%

8657 4437 AUCUaGAGGCUCUCAUGGA 0.1% 8707 4443 UGCAGCGUAcACGCUUUGU 0.1%

8658 4437 AUCUcGAGGCUCUCAUGGA 1.6% 8708 4443 UGCAGCGUAGACGaUUUGU 1.5%

8659 4437 AUCUUGAaGCUCUCAUGGA 0.1% 8709 4443 UGCAGCGUAGACGCUUCGU 0.1%

8660 4437 AUCUUGAGGCaCUCAUGGA 0.3% 8710 4443 UGCAGCGUAGACGgUUUGU 0.3%

8661 4437 AUCUUGAGGCUCUaAUGGA 0.1% 8711 4443 UGCAGCGUAuACGCUUUGU 0.2%

8662 4437 AUCUUGAGGCUCUCACGGA 0.1% 8712 4443 UGCAaCGUAuACGCUUUGU 0.1%

8663 4437 AUCUUGAGGuUCUCAcGGA 0.1% 8713 4443 UGCAGCGUAuACGaUUUGU 0.1%

4438 4438 GAUCUUGAGGCUCUCAUGG 96.4% 4444 4444 ACUGCAGCGUAGACGCUUU 95.5%

8664 4438 GAUCUUGAGGUUCUCAUGG 0.9% 8714 4444 ACUGCAGCGUAGACGUUUU 0.2%

8665 4438 GAcCUUGAGGCUCUCAUGG 0.3% 8715 4444 ACUGCAaCGUAGACGCUUU 1.7%

8666 4438 GAUCUaGAGGCUCUCAUGG 0.1% 8716 4444 ACUGCAGCGUAaACGCUUU 0.1%

8667 4438 GAUCUcGAGGCUCUCAUGG 1.6% 8717 4444 ACUGCAGCGUACACGCUUU 0.1%

8668 4438 GAUCUUGAaGCUCUCAUGG 0.1% 8718 4444 ACUGCAGCGUAGACGaUUU 1.5%

8669 4438 GAUCUUGAGGCaCUCAUGG 0.3% 8719 4444 ACUGCAGCGUAGACGCUUc 0.1%

8670 4438 GAUCUUGAGGCUCUaAUGG 0.1% 8720 4444 ACUGCAGCGUAGACGgUUU 0.3%

8671 4438 GAUCUUGAGGCUCUCACGG 0.1% 8721 4444 ACUGCAGCGUAuACGCUUU 0.2%

8672 4438 GAUCUUGAGGUUCUCACGG 0.1% 8722 4444 ACUGCAaCGUAuACGCUUU 0.1%

4439 4439 UGUCACCUCUGACUAAGGG 96.3% 8723 4444 ACUGCAGCGUAuACGaUUU 0.1%

8673 4439 UGUCACCUCUGACUAgGGG 0.1% 4445 4445 CUGCAGCGUAGACGCUUUG 95.5%

8674 4439 UGUCACCUCUaACUAAGGG 0.9% 8724 4445 CUGCAGCGUAGACGUUUUG 0.2%

8675 4439 UGUCACCUCUGACaAAGGG 0.4% 8725 4445 CUGCAaCGUAGACGCUUUG 1.7%

8676 4439 UGUCACCUCUGACCAAGGG 0.3% 8726 4445 CUGCAGCGUAaACGCUUUG 0.1%

8677 4439 UGUCACCUCUGACUAAaGG 2.0% 8727 4445 CUGCAGCGUACACGCUUUG 0.1%

8678 4439 UGUCACCUCUUACUAAGGG 0.1% 8728 4445 CUGCAGCGUAGACGaUUUG 1.5%

4440 4440 CUGUCACCUCUGACUAAGG 96.3% 8729 4445 CUGCAGCGUAGACGCUUCG 0.1%

8679 4440 CUGUCACCUCUGACUAgGG 0.1% 8730 4445 CUGCAGCGUAGACGgUUUG 0.3%

8680 4440 CUGUCACCUCUaACUAAGG 0.9% 8731 4445 CUGCAGCGUAUACGCUUUG 0.2%

8681 4440 CUGUCACCUCUGACaAAGG 0.4% 8732 4445 CUGCAaCGUAuACGCUUUG 0.1%

8682 4440 CUGUCACCUCUGACCAAGG 0.3% 8733 4445 CUGCAGCGUAuACGaUUUG 0.1%

8683 4440 CUGUCACCUCUGACUAAaG 1.9% 4446 4446 GCUAUGGAGCAAAUGGCUG 95.4%

8684 4440 CUGUCACCUCUUACUAAGG 0.1% 8734 4446 GCcAUGGAGCAAAUGGCUG 0.4%

8685 4440 uUGUCACCUCUGACUAAaG 0.1% 8735 4446 GCUAUGGAaCAAAUGGCUG 0.4%

4441 4441 GACUGCAGCGUAGACGCUU 95.6% 8736 4446 GCUAUGGAGCACAUGGCUG 0.1%

8686 4441 GACUGCAGCGUAGACGUUU 0.2% 8737 4446 GCcAUGGAGCAgAUGGCUG 1.5%

8687 4441 GACUGCAaCGUAGACGCUU 1.7% 8738 4446 GCUAUGGAGCAgAUGGCaG 0.3%

8688 4441 GACUGCAGCGUAaACGCUU 0.1% 8739 4446 GCUAUGGAGCAgAUGGCgG 1.3%

8689 4441 GACUGCAGCGUACACGCUU 0.1% 8740 4446 GCUAUGGAaCAgAUGGuUG 0.1%

8690 4441 GACUGCAGCGUAGACGaUU 1.5% 8741 4446 GCcAUGGAaCAAAUGGCUG 0.4%

8691 4441 GACUGCAGCGUAGACGgUU 0.3% 8742 4446 GCcAUGGAGCAgAUGGCaG 0.1%

8692 4441 GACUGCAGCGUAUACGCUU 0.2% 4447 4447 UGGCUAAAGACAAGACCAA 95.4%

8693 4441 GACUGCAaCGUAuACGCUU 0.1% 8743 4447 UGGCUAAAGACAAGACCgA 3.5%

8694 4441 GACUGCAGCGUAuACGaUU 0.1% 8744 4447 UGGCUAAAGACAAGgCCAA 0.4%

4442 4442 GGACUGCAGCGUAGACGCU 95.6% 8745 4447 UGGCUgAAGACAAGACCAA 0.1%

8746 4447 UGGuUAAAGACAAGACCgA 0.1% 8797 4452 GGCUAAAGACAAGgCCAAU 0.4%

8747 4447 UGGCOAAAGACAAaACCAA 0.2% 8798 4452 GGCUgAAGACAAGACCAAU 0.1%

8748 4447 UGGCUAAAGACAAGACCCA 0.1% 8799 4452 GGuUAAAGACAAGACCgAU 0.1%

8749 4447 UGGCUAAAGACAAGACCUA 0.1% 8800 4452 GGCUAAAGACAAaACCAAU 0.2%

8750 4447 UGGCUAAAGACUAGACCUA 0.2% 8801 4452 GGCUAAAGACAAGACCCAU 0.1%

4448 4448 CAGCGUAGACGCUUUGUCC 95.4% 8802 4452 GGCUAAAGACAAGACCUAU 0.1%

8751 4448 CAGCGUAGACGCUUUGUUC 0.1% 8803 4452 GGCUAAAGACUAGACCUAU 0.2%

8752 4448 CAGCGUAGACGUUUUGUCC 0.2% 4453 4453 CGUAGACGCUUUGUCCAAA 94.9%

8753 4448 CAaCGUAGACGCUUUGUCC 1.5% 8804 4453 CGUAGACGCUUUGUCCAgA 2.0%

8754 4448 CAGCGUAaACGCUUUGUCC 0.1% 8805 4453 CGUAGACGCUUUGUUCAAA 0.3%

8755 4448 CAGCGUAcACGCUUUGUCC 0.1% 8806 4453 CGUAGACGuUUUGUCCAAA 0.2%

8756 4448 CAGCGUAGACGaUUUGUCC 1.5% 8807 4453 CGUAcACGCUUUGUCCAAA 0.1%

8757 4448 CAGCGUAGACGCDUCGUCC 0.1% 8808 4453 CGUAGACGaUUUGUCCAAA 1.5%

8758 4448 CAGCGUAGACGgUUUGUCC 0.3% 8809 4453 CGUAGACGCUUCGUCCAAA 0.1%

8759 4448 CAGCGUAuACGCUUUGUCC 0.2% 8810 4453 CGUAGACGgUUUGUCCAAA 0.3%

8760 4448 CAaCGUAGACGCUUUGUuC 0.2% 8811 4453 CGUAuACGCUUUGUCCAAA 0.3%

8761 4448 CAaCGUAuACGCUUUGUCC 0.1% 8812 4453 CGUAaACGCUUUGUCCAgA 0.1%

8762 4448 CAGCGUAuACGaUUUGUCC 0.1% 8813 4453 CGUAcACuCUUUGUCCAAA 0.1%

4449 4449 CUAUGGAGCAAAUGGCUGG 95.4% 8814 4453 CGUAuACGaUUUGUCCAAA 0.1%

8763 4449 CcAUGGAGCAAAUGGCUGG 0.4% 4454 4454 GUAGACGCUUUGUCCAAAA 94.9%

8764 4449 CUAUGGAaCAAAUGGCUGG 0.4% 8815 4454 GUAGACGCUUUGUCCAgAA 2.0%

8765 4449 CUAUGGAGCACAUGGCUGG 0.1% 8816 4454 GUAGACGCUUUGUUCAAAA 0.3%

8766 4449 CcAUGGAGCAgAUGGCUGG 1.5% 8817 4454 GUAGACGuUUUGUCCAAAA 0.2%

8767 4449 CUAUGGAGCAgAUGGCaGG 0.3% 8818 4454 GUAcACGCUUUGUCCAAAA 0.1%

8768 4449 CUAUGGAGCAgAUGGCgGG 1.3% 8819 4454 GUAGACGaUUUGUCCAAAA 1.5%

8769 4449 CUAUGGAaCAgAUGGuUGG 0.1% 8820 4454 GUAGACGCUUCGUCCAAAA 0.1%

8770 4449 CcAUGGAaCAAAUGGCUGG 0.4% 8821 4454 GUAGACGgUUUGUCCAAAA 0.3%

8771 4449 CcAUGGAGCAgAUGGCaGG 0.1% 8822 4454 GUAuACGCUUUGUCCAAAA 0.3%

4450 4450 AGCGUAGACGCUUUGUCCA 95.4% 8823 4454 GUAaACGCUUUGUCCAgAA 0.1%

8772 4450 AGCGUAGACGCUUUGUUCA 0.1% 8824 4454 GUAcACuCUUUGUCCAAAA 0.1%

8773 4450 AGCGUAGACGUUUUGUCCA 0.2% 8825 4454 GUAuACGaUUUGUCCAAAA 0.1%

8774 4450 AaCGUAGACGCUUUGUCCA 1.5% 4455 4455 AGACGCUUUGUCCAAAAUG 94.8%

8775 4450 AGCGUAaACGCUUUGUCCA 0.1% 8826 4455 AGACGCUUUGUCCAgAAUG 1.3%

8776 4450 AGCGUAcACGCUUUGUCCA 0.1% 8827 4455 AGACGCUUUGUuCAAAAUG 0.3%

8777 4450 AGCGUAGACGaUUUGUCCA 1.5% 8828 4455 AGACGuUUUGUCCAAAAUG 0.2%

8778 4450 AGCGUAGACGCUUCGUCCA 0.1% 8829 4455 AcACGCUUUGUCCAAAAUG 0.1%

8779 4450 AGCGUAGACGgUUUGUCCA 0.3% 8830 4455 AGACGaUUUGUCCAAAAUG 1.5%

8780 4450 AGCGUAuACGCUUUGUCCA 0.2% 8831 4455 AGACGCUUcGUCCAAAAUG 0.1%

8781 4450 AaCGUAGACGCUUUGUuCA 0.2% 8832 4455 AGACGCUUUGUCCAAAACG 0.1%

8782 4450 AaCGUAuACGCUUUGUCCA 0.1% 8833 4455 AGACGgUUUGUCCAAAAUG 0.3%

8783 4450 AGCGUAuACGaUUUGUCCA 0.1% 8834 4455 AuACGCUUUGUCCAAAAUG 0.3%

4451 4451 GCAGCGUAGACGCUUUGUC 95.4% 8835 4455 AaACGCUUUGUCCAgAAUG 0.1%

8784 4451 GCAGCGUAGACGCUUUGUU 0.1% 8836 4455 AGACGCUUUGUCCAgAAcG 0.7%

8785 4451 GCAGCGUAGACGUUUUGUC 0.2% 8837 4455 AcACuCUUUGUCCAAAAUG 0.1%

8786 4451 GCAaCGUAGACGCUUUGUC 1.5% 8838 4455 AuACGaUUUGUCCAAAAUG 0.1%

8787 4451 GCAGCGUAaACGCUUUGUC 0.1% 4456 4456 UAGACGCUUUGUCCAAAAU 94.8%

8788 4451 GCAGCGUAcACGCUUUGUC 0.1% 8839 4456 UAGACGCUUUGUCCAgAAU 1.3%

8789 4451 GCAGCGUAGACGaUUUGUC 1.5% 8840 4456 UAGACGCUUUGUUCAAAAU 0.3%

8790 4451 GCAGCGUAGACGCUUCGUC 0.1% 8841 4456 UAGACGuUUUGUCCAAAAU 0.2%

8791 4451 GCAGCGUAGACGgUUUGUC 0.3% 8842 4456 UAcACGCUUUGUCCAAAAU 0.1%

8792 4451 GCAGCGUAuACGCUUUGUC 0.2% 8843 4456 UAGACGaUUUGUCCAAAAU 1.5%

8793 4451 GCAaCGUAGACGCUUUGUu 0.2% 8844 4456 UAGACGCUUcGUCCAAAAU 0.1%

8794 4451 GCAaCGUAuACGCUUUGUC 0.1% 8845 4456 UAGACGCUUUGUCCAAAAc 0.1%

8795 4451 GCAGCGUAuACGaUUUGUC 0.1% 8846 4456 UAGACGgUUUGUCCAAAAU 0.3%

4452 4452 GGCUAAAGACAAGACCAAU 95.4% 8847 4456 UAuACGCUUUGUCCAAAAU 0.3%

8796 4452 GGCUAAAGACAAGACCgAU 3.5% 8848 4456 UAaACGCUUUGUCCAgAAU 0.1%

8849 4456 UAGACGCUUUGUCCAgAAc 0.7% 8901 4460 CGCDUUGUCCAgAAcGCCC 0.7%

8850 4456 UAcACuCUUUGUCCAAAAU 0.1% 4461 4461 UCAGUUAUUCUGCUGGUGC 94.2%

8851 4456 UAuACGaUUUGUCCAAAAU 0.1% 8902 4461 UCgGUUAUUCUGCUGGUGC 0.1%

4457 4457 GACGCUUUGUCCAAAAUGC 94.7% 8903 4461 UuAGUUAUUCUGCUGGUGC 0.5%

8852 4457 GACGCUUUGUCCAAAAUGU 0.1% 8904 4461 UCAGcUAUUCUGCUGGUGC 0.2%

8853 4457 GACGCUUUGUCCAgAAUGC 1.3% 8905 4461 UCAGUUAcUCUGCUGGUGC 1.0%

8854 4457 GACGCUUUGUUCAAAAUGC 0.3% 8906 4461 UCAGUUAUUCCGCUGGUGC 0.1%

8855 4457 GACGuUUUGUCCAAAAUGC 0.2% 8907 4461 UCAGUUAUUCgGCUGGUGC 0.1%

8856 4457 cACGCUUUGUCCAAAAUGC 0.1% 8908 4461 UCAGUUAUUCUGCCGGUGC 0.2%

8857 4457 GACGaUUUGUCCAAAAUGC 1.5% 8909 4461 UCAGUUAUUCUGCUGGCGC 0.1%

8858 4457 GACGCUUcGUCCAAAAUGC 0.1% 4462 4462 AUUCUGCUGGUGCACUUGC 94.1%

8859 4457 GACGCUUUGUCCAAAACGC 0.1% 8910 4462 AUUCUGCUGGUGCgCUUGC 0.9%

8860 4457 GACGgUUUGUCCAAAAUGC 0.3% 8911 4462 AcUCUGCUGGUGCACUUGC 1.0%

8861 4457 uACGCUUUGUCCAAAAUGC 0.3% 8912 4462 AUUCcGCUGGUGCACUUGC 0.1%

8862 4457 aACGCUUUGUCCAgAAUGC 0.1% 8913 4462 AUUCgGCUGGUGCACUUGC 0.1%

8863 4457 GACGCUUUGUCCAgAAcGC 0.7% 8914 4462 AUUCUGCcGGUGCACUUGC 0.2%

8864 4457 cACuCUUUGUCCAAAAUGC 0.1% 8915 4462 AUUCUGCUGGCGCACUUGC 0.1%

8865 4457 uACGaϋUUGUCCAAAAUGC 0.1% 4463 4463 CUCAUGGAAUGGCUAAAGA 94.1%

4458 4458 ACGCUUUGUCCAAAAUGCC 94.3% 8916 4463 CUCAUGGAAUGGCUgAAGA 0.1%

8866 4458 ACGCUUUGUCCAAAAUGCU 0.6% 8917 4463 CUCAUGGAAUGGUUAAAGA 0.1%

8867 4458 ACGCUUUGUCCAAAAUGUC 0.1% 8918 4463 CUCAUGGAgUGGCUAAAGA 5.4%

8868 4458 ACGCUUUGUCCAgAAUGCC 1.4% 8919 4463 CUaAUGGAAUGGCUAAAGA 0.1%

8869 4458 ACGCUUUGUuCAAAAUGCC 0.3% 8920 4463 CUCAcGGAAUGGCUAAAGA 0.2%

8870 4458 ACGuUUUGUCCAAAAUGCC 0.2% 4464 4464 UCAUGGAAUGGCUAAAGAC 94.1%

8871 4458 ACGaUUUGUCCAAAAUGCC 1.6% 8921 4464 UCAUGGAAUGGCUgAAGAC 0.1%

8872 4458 ACGCUUcGUCCAAAAUGCC 0.1% 8922 4464 UCAUGGAAUGGUUAAAGAC 0.1%

8873 4458 ACGCUUUGUCCAAAACGCC 0.1% 8923 4464 UCAUGGAgUGGCUAAAGAC 5.4%

8874 4458 ACGCUUUGUCCAAAAUGCa 0.2% 8924 4464 UaAUGGAAUGGCUAAAGAC 0.1%

8875 4458 ACGgUUUGUCCAAAAUGCC 0.3% 8925 4464 UCAcGGAAUGGCUAAAGAC 0.2%

8876 4458 ACuCUUUGUCCAAAAUGCC 0.1% 4465 4465 UAUUCUGCUGGUGCACUUG 94,l°/(

8877 4458 ACGCUUUGUCCAgAAcGCC 0.7% 8926 4465 UAUUCUGCUGGUGCgCUUG 0.9%

4459 4459 GCUUUGUCCAAAAUGCCCU 94.3% 8927 4465 UAcUCUGCUGGUGCACUUG 1.0%

8878 4459 GCUUUGUCCAAAAUGCUCU 0.6% 8928 4465 UAUUCcGCUGGUGCACUUG 0.1%

8879 4459 GCUUUGUCCAAAAUGUCCU 0.1% 8929 4465 UAUUCgGCUGGUGCACUUG 0.1%

8880 4459 GCUUUGUCCAgAAUGCCCU 1.4% 8930 4465 UAUUCUGCcGGUGCACUUG 0.2%

8881 4459 GCUUUGUuCAAAAUGCCCU 0.3% 8931 4465 UAUUCUGCUGGCGCACUUG 0.1%

8882 4459 GuUUUGUCCAAAAUGCCCU 0.2% 4466 4466 AUGGAAUGGCUAAAGACAA 94.O 0 /

8883 4459 GaUUUGUCCAAAAUGCCCU 1.6% 8932 4466 AUGGAAUGGCUgAAGACAA 0.1%

8884 4459 GCUUcGUCCAAAAUGCCCU 0.1% 8933 4466 AUGGAAUGGuUAAAGACAA 0.1%

8885 4459 GCUUUGUCCAAAACGCCCU 0.1% 8934 4466 AUGGAgUGGCUAAAGACAA 5.4%

8886 4459 GCUUUGUCCAAAAUGCaCU 0.2% 8935 4466 AcGGAAUGGCUAAAGACAA 0.2%

8887 4459 GgUUUGUCCAAAAUGCCCU 0.3% 8936 4466 AUGGAAUGGCUAAAGACUA 0.2%

8888 4459 uCUUUGUCCAAAAUGCCCU 0.1% 4467 4467 CUCAGUUAUUCUGCUGGUG 94.0°/(

8889 4459 GCUUUGUCCAgAAcGCCCU 0.7% 8937 4467 CUCgGUUAUUCUGCUGGUG 0.1%

4460 4460 CGCUUUGUCCAAAAUGCCC 94.3% 8938 4467 CUuAGUUAUUCUGCUGGUG 0.5%

8890 4460 CGCUUUGUCCAAAAUGCUC 0.6% 8939 4467 uUCAGUUAUUCUGCUGGUG 0.2%

8891 4460 CGCUUUGUCCAAAAUGUCC 0.1% 8940 4467 CUCAGcUAUUCUGCUGGUG 0.2%

8892 4460 CGCUUUGUCCAgAAUGCCC 1.4% 8941 4467 CUCAGUUAcUCUGCUGGUG 1.0%

8893 4460 CGCUUUGUuCAAAAUGCCC 0.3% 8942 4467 CUCAGUUAUUCCGCUGGUG 0.1%

8894 4460 CGuUUUGUCCAAAAUGCCC 0.2% 8943 4467 CUCAGUUAUUCgGCUGGUG 0.1%

8895 4460 CGaUUUGUCCAAAAUGCCC 1.6% 8944 4467 CUCAGUUAUUCUGCCGGUG 0.2%

8896 4460 CGCUUcGUCCAAAAUGCCC 0.1% 8945 4467 CUCAGUUAUUCUGCUGGcG 0.1%

8897 4460 CGCUUUGUCCAAAACGCCC 0.1% 4468 4468 GUUAUUCUGCUGGUGCACU 94.0°/

8898 4460 CGCUUUGUCCAAAAUGCaC 0.2% 8946 4468 GUUAUUCUGCUGGUGCgCU 0.9%

8899 4460 CGgUUUGUCCAAAAUGCCC 0.3% 8947 4468 GcUAUUCUGCUGGUGCACU 0.2%

8900 4460 CuCUUUGUCCAAAAUGCCC 0.1% 8948 4468 GUUAcUCUGCUGGUGCACU 1.0%

8949 4468 GUUAUDCcGCUGGUGCACU 0.1% 8999 4474 AGUUAUUCUGCCGGUGCAC 0.2%

8950 4468 GUOAUUCgGCUGGUGCACU 0.1% 9000 4474 AGUUAUUCUGCUGGCGCAC 0.1%

8951 4468 GUUAUUCUGCCGGUGCACU 0.2% 4475 4475 GGAAUGGCUAAAGACAAGA 93.7%

8952 4468 GUUAUUCUGCUGGCGCACU 0.1% 9001 4475 GGAAUGGCUAAAGACAAGg 0.4%

4469 4469 UGGCCAGCACUACAGCUAA 94.0% 9002 4475 GGAAUGGCUgAAGACAAGA 0.1%

8953 4469 UGGCCAGCACUACgGCUAA 1.5% 9003 4475 GGAAUGGuUAAAGACAAGA 0.1%

8954 4469 UGGCCAGCAuUACAGCUAA 0.1% 9004 4475 GGAgUGGCUAAAGACAAGA 5.4%

8955 4469 UGGCCAGuACUACAGCUAA 0.1% 9005 4475 GGAAUGGCUAAAGACAAaA 0.2%

8956 4469 UGGCCAGuACUACgGCUAA 0.3% 9006 4475 GGAAUGGCUAAAGACUAGA 0.2%

8957 4469 UaGCCAGCACUACAGCUAA 1.2% 4476 4476 AAUGGCUAAAGACAAGACC 93.7%

8958 4469 UGGCaAGCACUACAGCUAA 0.1% 9007 4476 AAUGGCUAAAGACAAGgCC 0.4%

8959 4469 UGGCCAGCACCACAGCUAA 0.3% 9008 4476 AAUGGCUgAAGACAAGACC 0.1%

8960 4469 UGGCCAGCACUACAGCCAA 0.3% 9009 4476 AAUGGuUAAAGACAAGACC 0.1%

8961 4469 UGGCCAGCACUACAGCUUA 0.1% 9010 4476 AgUGGCUAAAGACAAGACC 5.4%

8962 4469 UGGCCAGCACUACCGCUAA 0.1% 9011 4476 AAUGGCUAAAGACAAaACC 0.2%

8963 4469 UaGCCAGCACUACgGCUAA 1.2% 9012 4476 AAUGGCUAAAGACUAGACC 0.2%

8964 4469 UGGCuAGCACcACAGCUAA 0.1% 4477 4477 GAAUGGCUAAAGACAAGAC 93.7%

8965 4469 UaGCCAGCACaACAGCUAA 0.4% 9013 4477 GAAUGGCUAAAGACAAGgC 0.4%

8966 4469 UaGCCAGCACUACAGCcAA 0.1% 9014 4477 GAAUGGCUgAAGACAAGAC 0.1%

8967 4469 UaGCCAGCACcACgGCUAA 0.1% 9015 4477 GAAUGGuUAAAGACAAGAC 0.1%

4470 4470 UUAUUCUGCUGGUGCACUU 94.0% 9016 4477 GAgUGGCUAAAGACAAGAC 5.4%

8968 4470 UUAUUCUGCUGGUGCgCUU 0.9% 9017 4477 GAAUGGCUAAAGACAAaAC 0.2%

8969 4470 cUAUUCUGCUGGUGCACUU 0.2% 9018 4477 GAAUGGCUAAAGACUAGAC 0.2%

8970 4470 UUAcUCUGCUGGUGCACUU 1.0% 4478 4478 UCCAUGGGGCCAAAGAAAU 93.4%

8971 4470 UUAUUCcGCUGGUGCACUU 0.1% 9019 4478 UCCAUGGGGCCAAAGAAgU 0.4%

8972 4470 UUAUUCgGCUGGUGCACUU 0.1% 9020 4478 UCCAUGGGGCCAAAGAgAU 0.1%

8973 4470 UUAUUCUGCcGGUGCACUU 0.2% 9021 4478 UCCAUGGGGCCAAAGgAAU 0.2%

8974 4470 UUAUUCUGCUGGCGCACUU 0.1% 9022 4478 UCCAUGGGGCCAAgGAAAU 0.1%

4471 4471 CAUGGAAUGGCUAAAGACA 94.0% 9023 4478 UCCAUGGGGCUAAAGAAAU 0.9%

8975 4471 CAUGGAAUGGCUgAAGACA 0.1% 9024 4478 UCuAUGGGGCCAAAGAAAU 0.2%

8976 4471 CAUGGAAUGGUUAAAGACA 0.1% 9025 4478 UuCAUGGGGCCAAAGAAAU 0.5%

8977 4471 CAUGGAgUGGCUAAAGACA 5.4% 9026 4478 UCCAUGGGGCCAAgGAggU 0.1%

8978 4471 aAUGGAAUGGCUAAAGACA 0.1% 9027 4478 UCCAUGGaGCCAAAGAAAU 0.4%

8979 4471 CAcGGAAUGGCUAAAGACA 0.2% 9028 4478 UCCAcGGGGCCAAAGAAgU 0.3%

8980 4471 CAUGGAAUGGCUAAAGACu 0.2% 9029 4478 UuCAUGGGGCCAAAGuAAU 0.1%

4472 4472 UGGAAUGGCUAAAGACAAG 93.9% 4479 4479 GCGUAGACGCUUUGUCCAA 93.4%

8981 4472 UGGAAUGGCUgAAGACAAG 0.1% 9030 4479 GCGUAGACGCUUUGUCCAg 2.0%

8982 4472 UGGAAUGGuUAAAGACAAG 0.1% 9031 4479 GCGUAGACGCUUUGUUCAA 0.1%

8983 4472 UGGAgUGGCUAAAGACAAG 5.4% 9032 4479 GCGUAGACGuUUUGUCCAA 0.2%

8984 4472 cGGAAUGGCUAAAGACAAG 0.2% 9033 4479 aCGUAGACGCUUUGUCCAA 1.5%

8985 4472 UGGAAUGGCUAAAGACAAa 0.2% 9034 4479 GCGUAcACGCUUUGUCCAA 0.1%

8986 4472 UGGAAUGGCUAAAGACUAG 0.2% 9035 4479 GCGUAGACGaUUUGUCCAA 1.5%

4473 4473 UCUCAUGGAAUGGCUAAAG 93.9% 9036 4479 GCGUAGACGCUϋcGUCCAA 0.1%

8987 4473 UCUCAUGGAAUGGCUgAAG 0.1% 9037 4479 GCGUAGACGgUUUGUCCAA 0.3%

8988 4473 UCUCAUGGAAUGGUUAAAG 0.1% 9038 4479 GCGUAuACGCUUUGUCCAA 0.2%

8989 4473 UCUCAUGGAgUGGCUAAAG 5.4% 9039 4479 aCGUAGACGCUUUGUuCAA 0.2%

8990 4473 aCUCAUGGAAUGGCUAAAG 0.3% 9040 4479 GCGUAaACGCUUUGUCCAg 0.1%

8991 4473 UCUaAUGGAAUGGCUAAAG 0.1% 9041 4479 aCGUAuACGCUUUGUCCAA 0.1%

8992 4473 UCUCAcGGAAUGGCUAAAG 0.2% 9042 4479 GCGUAuACGaUUUGUCCAA 0.1%

4474 4474 AGUUAUUCUGCUGGUGCAC 93.9% 4480 4480 CAGUUAUUCUGCUGGUGCA 93.4°/C

8993 4474 AGUUAUUCUGCUGGUGCgC 0.9% 9043 4480 CAGUUAUUCUGCUGGUGCg 0.9%

8994 4474 gGUUAUUCUGCUGGUGCAC 0.1% 9044 4480 CgGUUAUUCUGCUGGUGCA 0.1%

8995 4474 AGcUAUUCUGCUGGUGCAC 0.2% 9045 4480 uAGUUAUUCUGCUGGUGCA 0.5%

8996 4474 AGUUAcUCUGCUGGUGCAC 1.0% 9046 4480 CAGcUAUUCUGCUGGUGCA 0.2%

8997 4474 AGUUAUUCcGCUGGUGCAC 0.1% 9047 4480 CAGUUAcϋCUGCUGGUGCA 1.0%

8998 4474 AGUUAUUCgGCUGGUGCAC 0.1% 9048 4480 CAGUUAUUCcGCUGGUGCA 0.1%

20Q

9049 4480 CAGUuAUUCgGCUGGUGCA 0.1% 9099 4486 GAaGCUCUCAUGGAAUGGC 0.1%

9050 4480 CAGUUAUUCUGCCGGUGCA 0.2% 9100 4486 GAGGCaCUCAUGGAAUGGC 0.3%

9051 4480 CAGUUAUUCUGCUGGCGCA 0.1% 9101 4486 GAGGCUCUaAUGGAAUGGC 0.1%

4481 4481 UUCCAUGGGGCCAAAGAAA 93.4% 9102 4486 GAGGCUCUCACGGAAUGGC 0.1%

9052 4481 UUCCAUGGGGCCAAAGAAg 0.4% 9103 4486 GAGGuUCUCAcGGAAUGGC 0.1%

9053 4481 UUCCAUGGGGCCAAAGAgA 0.1% 4487 4487 GCUCUCAUGGAAUGGCUAA 93.0%

9054 4481 UUCCAUGGGGCCAAAGgAA 0.2% 9104 4487 GCUCUCAUGGAAUGGCUgA 0.1%

9055 4481 UUCCAUGGGGCCAAgGAAA 0.1% 9105 4487 GCUCUCAUGGAAUGGUUAA 0.1%

9056 4481 UUCCAUGGGGCUAAAGAAA 0.9% 9106 4487 GCUCUCAUGGAgUGGCUAA 5.4%

9057 4481 UUCuAUGGGGCCAAAGAAA 0.2% 9107 4487 GuUCUCAUGGAAUGGCUAA 0.9%

9058 4481 UUuCAUGGGGCCAAAGAAA 0.5% 9108 4487 GCaCUCAUGGAAUGGCUAA 0.3%

9059 4481 UUCCAUGGGGCCAAgGAgg 0.1% 9109 4487 GCUCUaAUGGAAUGGCUAA 0.1%

9060 4481 UUCCAUGGaGCCAAAGAAA 0.4% 9110 4487 GCUCUCAcGGAAUGGCUAA 0.1%

9061 4481 UUCCAcGGGGCCAAAGAAg 0.3% 9111 4487 GuUCUCAcGGAAUGGCUAA 0.1%

9062 4481 UUuCAUGGGGCCAAAGuAA 0.1% 4488 4488 CUCUCAUGGAAUGGCUAAA 93.0%

4482 4482 CACCUCUGACUAAGGGGAU 93.2%' 9112 4488 CUCUCAUGGAAUGGCUgAA 0.1%

9063 4482 CACCUCUGACUAgGGGGAU 0.1% 9113 4488 CUCUCAUGGAAUGGUUAAA 0.1%

9064 4482 CACCUCUaACUAAGGGGAU 0.9% 9114 4488 CUCUCAUGGAgUGGCUAAA 5.4%

9065 4482 CACCUCUGACaAAGGGGAU 0.4% 9115 4488 uUCUCAUGGAAUGGCUAAA 0.9%

9066 4482 CACCUCUGACCAAGGGGAU 0.2% 9116 4488 CaCUCAUGGAAUGGCUAAA 0.3%

9067 4482 CACCUCUGACUAAaGGGAU 2.0% 9117 4488 CUCUaAUGGAAUGGCUAAA 0.1%

9068 4482 CACCUCUGACUAAGGGaAU 3.0% 9118 4488 CUCϋCAcGGAAUGGCUAAA 0.1%

9069 4482 CACCUCUGACUAAGGGUAU 0.1% 9119 4488 uUCUCAcGGAAUGGCUAAA 0.1%

9070 4482 CACCUCUuACUAAGGGGAU 0.1% 4489 4489 AUGGGGCCAAAGAAAUAGC 93.0%

9071 4482 CACCUCUGACcAAGGGaAU 0.1% 9120 4489 AUGGGGCCAAAGAAgUAGC 0.4%

4483 4483 GUCACCUCUGACOAAGGGG 93.2% 9121 4489 AUGGGGCCAAAGAgAUAGC 0.1%

9072 4483 GUCACCUCUGACUAgGGGG 0.1% 9122 4489 AUGGGGCCAAAGgAAUAGC 0.2%

9073 4483 GUCACCUCUaACUAAGGGG 0.9% 9123 4489 AUGGGGCCAAgGAAAUAGC 0.1%

9074 4483 GUCACCUCUGACaAAGGGG 0.4% 9124 4489 AUGGGGCuAAAGAAAUAGC 0.9%

9075 4483 GUCACCUCUGACCAAGGGG 0.2% 9125 4489 AUGGaGCCAAAGAAAUAGC 0.4%

9076 4483 GUCACCUCUGACUAAaGGG 2.0% 9126 4489 AUGGGGCCAAAGAAAUAUC 0.2%

9077 4483 GUCACCUCUGACUAAGGGa 3.0% 9127 4489 AUGGGGCCAAAGAAAUCGC 0.5%

9078 4483 GUCACCUCUGACUAAGGGu 0.1% 9128 4489 AUGGGGCCAAAGUAAUAGC 0.1%

9079 4483 GUCACCUCUuACUAAGGGG 0.1% 9129 4489 AcGGGGCCAAAGAAgUAGC 0.3%

9080 4483 GUCACCUCUGACcAAGGGa 0.1% 9130 4489 AUGGGGCCAAAGAAAUcuC 0.4%

4484 4484 UCACCUCUGACUAAGGGGA 93.2% 4490 4490 AGGCUCUCAUGGAAUGGCU 93.0%

9081 4484 UCACCUCUGACUAgGGGGA 0.1% 9131 4490 AGGCUCUCAUGGAAUGGUU 0.1%

9082 4484 UCACCUCUaACUAAGGGGA 0.9% 9132 4490 AGGCUCUCAUGGAgUGGCU 5.4%

9083 4484 UCACCUCUGACaAAGGGGA 0.4% 9133 4490 AGGuUCUCAUGGAAUGGCU 0.9%

9084 4484 UCACCUCUGACCAAGGGGA 0.2% 9134 4490 AaGCUCUCAUGGAAUGGCU 0.1%

9085 4484 UCACCUCUGACUAAaGGGA 2.0% 9135 4490 AGGCaCUCAUGGAAUGGCU 0.3%

9086 4484 UCACCUCUGACUAAGGGaA 3.0% 9136 4490 AGGCUCUaAUGGAAUGGCU 0.1%

9087 4484 UCACCUCUGACUAAGGGUA 0.1% 9137 4490 AGGCUCUCAcGGAAUGGCU 0.1%

9088 4484 UCACCUCUuACUAAGGGGA 0.1% 9138 4490 AGGuUCUCAcGGAAUGGCU 0.1%

9089 4484 UCACCUCUGACcAAGGGaA 0.1% 4491 4491 GGCUCUCAUGGAAUGGCUA 92.9%

4485 4485 CUGGUGCACUUGCCAGUUG 93.0% 9139 4491 GGCUCUCAUGGAAUGGCUg 0.1%

9090 4485 CUGGUGCACUUGCUAGUUG 2.4% 9140 4491 GGCUCUCAUGGAAUGGUUA 0.1%

9091 4485 CUGGUGCgCUUGCCAGUUG 2.5% 9141 4491 GGCUCUCAUGGAgUGGCUA 5.4%

9092 4485 CcGGUGCACUUGCCAGUUG 1.7% 9142 4491 GGuUCUCAUGGAAUGGCUA 0.9%

9093 4485 CDGGcGCACUUGCCAGUUG 0.1% 9143 4491 aGCUCUCAUGGAAUGGCUA 0.1%

9094 4485 CUGGUGCgCUUGCCAGcUG 0.2% 9144 4491 GGCaCUCAUGGAAUGGCUA 0.3%

9095 4485 CcGGgGCACUUGCCAGUUG 0.1% 9145 4491 GGCUCUaAUGGAAUGGCUA 0.1%

4486 4486 GAGGCUCUCAUGGAAUGGC 93.0% 9146 4491 GGCUCUCAcGGAAUGGCUA 0.1%

9096 4486 GAGGCUCUCAUGGAAUGGU 0.1% 9147 4491 GGuUCUCAcGGAAUGGCUA 0.1%

9097 4486 GAGGCUCUCAUGGAgUGGC 5.4% 4492 4492 UUGAGGCUCUCAUGGAAUG 92.9%

9098 4486 GAGGuUCUCAUGGAAUGGC 0.9% 9148 4492 UUGAGGCUCUCAUGGAgUG 3.8%

9149 4492 UUGAGGuUCUCAUGGAAUG 0.9% 9200 4497 CUUCUACGGAAGGgGUACC 0.2%

9150 4492 UaGAGGCUCUCAUGGAAUG 0.1% 9201 4497 CUUCUACGGAgGGAGUACC 0.1%

9151 4492 UcGAGGCUCUCAUGGAAUG 0.1% 9202 4497 CUUCUACGGgAGGAGUACC 0.2%

9152 4492 UUGAaGCUCUCAUGGAAUG 0.1% 9203 4497 CUUuUACGGAAGGAGUACC 0.1%

9153 4492 UUGAGGCaCUCAUGGAAUG 0.3% 9204 4497 aUUCUACGGAAGGAGUACC 0.1%

9154 4492 UUGAGGCUCUaAUGGAAUG 0.1% 9205 4497 CUgCUACGGAAGGAGUACC 0.1%

9155 4492 UUGAGGCUCUCACGGAAUG 0.1% 9206 4497 CUUCcACGGAAGGAGUACC 0.1%

9156 4492 UcGAGGCUCUCAUGGAgUG 1.5% 9207 4497 CUUCUACaGAAGGAGUACC 0.1%

9157 4492 UUGAGGuUCUCAcGGAAUG 0.1% 9208 4497 CUUCUACGGAAGGAaUACC 0.4%

4493 4493 UGAGGCUCUCAUGGAAUGG 92.9% 9209 4497 CUUCUACGGAAGGAGUUCC 0.1%

9158 4493 UGAGGCUCUCAUGGAgUGG 3.8% 9210 4497 CUUgUACGGAAGGAGUACC 0.1%

9159 4493 UGAGGuUCUCAUGGAAUGG 0.9% 9211 4497 CUcCUACGGAAGGAGUgCu 0.5%

9160 4493 aGAGGCUCUCAUGGAAUGG 0.1% 9212 4497 CUUCUACaGAuGGAGUACC 0.2%

9161 4493 cGAGGCUCUCAUGGAAUGG 0.1% 9213 4497 CUgCUACGGcAGGgGUACC 1.2%

9162 4493 UGAaGCUCUCAUGGAAUGG 0.1% 4498 4498 UCUUGAGGCUCUCAUGGAA 92.7%

9163 4493 UGAGGCaCUCAUGGAAUGG 0.3% 9214 4498 UCUUGAGGCUCUCAUGGAg 3.7%

9164 4493 UGAGGCUCUaAUGGAAUGG 0.1% 9215 4498 UCUUGAGGuUCUCAUGGAA 0.9%

9165 4493 UGAGGCUCUCACGGAAUGG 0.1% 9216 4498 cCUUGAGGCUCUCAUGGAA 0.2%

9166 4493 cGAGGCUCUCAUGGAgUGG 1.5% 9217 4498 UCUaGAGGCUCUCAUGGAA 0.1%

9167 4493 UGAGGuUCUCACGGAAUGG 0.1% 9218 4498 UCUcGAGGCUCUCAUGGAA 0.1%

4494 4494 CUUGAGGCUCUCAUGGAAD 92.9% 9219 4498 UCUUGAaGCUCUCAUGGAA 0.1%

9168 4494 CUUGAGGCUCUCAUGGAgO 3.8% 9220 4498 UCUUGAGGCaCUCAUGGAA 0.3%

9169 4494 CUUGAGGuUCUCAUGGAAU 0.9% 9221 4498 UCUUGAGGCUCUaAUGGAA 0.1%

9170 4494 CUaGAGGCUCUCAUGGAAU 0.1% 9222 4498 UCUUGAGGCUCUCACGGAA 0.1%

9171 4494 CUcGAGGCUCUCAUGGAAU 0.1% 9223 4498 cCUUGAGGCUCUCAUGGAg 0.1%

9172 4494 CUUGAaGCUCUCAUGGAAU 0.1% 9224 4498 UCUcGAGGCUCUCAUGGAg 1.5%

9173 4494 CUUGAGGCaCUCAUGGAAU 0.3% 9225 4498 UCUUGAGGuUCUCAcGGAA 0.1%

9174 4494 CUUGAGGCUCUaAUGGAAU 0.1% 4499 4499 CUGCUGGUGCACUUGCCAG 92.7%

9175 4494 CUUGAGGCUCUCACGGAAU 0.1% 9226 4499 CUGCUGGUGCACUUGCUAG 2.4%

9176 4494 CUcGAGGCUCUCAUGGAgU 1.5% 9227 4499 CUGCUGGUGCgCUUGCCAG 0.9%

9177 4494 CUUGAGGuUCUCAcGGAAU 0.1% 9228 4499 CcGCUGGUGCACUUGCCAG 0.1%

4495 4495 CAUGGGGCCAAAGAAAUAG 92.9% 9229 4499 CgGCUGGUGCACUUGCCAG 0.1%

9178 4495 CAUGGGGCCAAAGAAgUAG 0.4% 9230 4499 CUGCcGGUGCACUUGCCAG 0.2%

9179 4495 CAUGGGGCCAAAGAgAUAG 0.1% 9231 4499 CUGCUGGcGCACUUGCCAG 0.1%

9180 4495 CAUGGGGCCAAAGgAAUAG 0.2% 9232 4499 CaaCUGGUGCACUUGCCAG 0.1%

9181 4495 CAUGGGGCCAAgGAAAUAG 0.1% 9233 4499 CaaCUGGUGCgCUUGCCAG 1.8%

9182 4495 CAUGGGGCuAAAGAAAUAG 0.9% 4500 4500 UGCUGGUGCACUUGCCAGU 92.7%

9183 4495 uAUGGGGCCAAAGAAAUAG 0.1% 9234 4500 UGCUGGUGCACUUGCUAGU 2.4%

9184 4495 CAUGGaGCCAAAGAAAUAG 0.4% 9235 4500 UGCUGGUGCgCUUGCCAGU 0.7%

9185 4495 CAUGGGGCCAAAGAAAUAu 0.2% 9236 4500 cGCUGGUGCACUUGCCAGU 0.1%

9186 4495 CAUGGGGCCAAAGAAAUCG 0.4% 9237 4500 gGCUGGUGCACUUGCCAGϋ 0.1%

9187 4495 CAUGGGGCCAAAGUAAUAG 0.1% 9238 4500 UGCcGGUGCACUUGCCAGU 0.2%

9188 4495 CAcGGGGCCAAAGAAgUAG 0.3% 9239 4500 UGCUGGcGCACUUGCCAGU 0.1%

9189 4495 uAUGGGGCCAAAGAAAUcG 0.1% 9240 4500 UGCUGGUGCgCUUGCCAGc 0.2%

9190 4495 CAUGGGGCCAAAGAAAUcu 0.4% 9241 4500 aaCUGGUGCACUUGCCAGU 0.1%

4496 4496 GCUGGUGCACUUGCCAGUU 92.9% 9242 4500 aaCUGGUGCgCUUGCCAGU 1.8%

9191 4496 GCUGGUGCACUUGCUAGUU 2.4% 4501 4501 UCUGCUGGUGCACUUGCCA 92.7%

9192 4496 GCUGGUGCgCUUGCCAGUU 0.7% 9243 4501 UCUGCUGGUGCACUUGCUA 2.4%

9193 4496 aCUGGUGCACUUGCCAGUU 0.1% 9244 4501 UCUGCUGGUGCgCUUGCCA 0.9%

9194 4496 GCcGGUGCACUUGCCAGUU 0.2% 9245 4501 UCcGCUGGUGCACUUGCCA 0.1%

9195 4496 GCUGGcGCACUUGCCAGUU 0.1% 9246 4501 UCgGCUGGUGCACUUGCCA 0.1%

9196 4496 aCUGGUGCgCUUGCCAGUU 1.8% 9247 4501 UCUGCcGGUGCACUUGCCA 0.2%

9197 4496 GCUGGUGCgCUUGCCAGcU 0.2% 9248 4501 UCUGCUGGcGCACUUGCCA 0.1%

9198 4496 aCcGGUGCACUUGCCAGUU 1.5% 9249 4501 UCaaCUGGUGCACUUGCCA 0.1%

4497 4497 CUUCUACGGAAGGAGUACC 92.8% 9250 4501 UCaaCUGGUGCgCUUGCCA 1.8%

9199 4497 CUUCUACGGAAGGAGUgCC 3.4% 4502 4502 CCAUGGGGCCAAAGAAAUA 92.6%

9251 4502 CCAUGGGGCCAAAGAAgOA 0.4% 9303 4506 CcAUGAGGGAAGAAUAUCG 0.8%

9252 4502 CCAUGGGGCCAAAGAgAUA 0.1% 9304 4506 CUAUGAGGGAAGAAUACCG 0.2%

9253 4502 CCAUGGGGCCAAAGgAAUA 0.2% 9305 4506 CUuUGAGGGAAGAAUAUCG 0.6%

9254 4502 CCAUGGGGCCAAgGAAAUA 0.1% 9306 4506 CUAUGAGGGAACAAUAUUG 0.1%

9255 4502 CCAUGGGGCuAAAGAAAUA 0.9% 9307 4506 CUAUGAGGGAAGAgUAcCG 1.3%

9256 4502 CuAUGGGGCCAAAGAAAUA 0.1% 9308 4506 uUAUGAGGGAAGAAUAUgG 0.1%

9257 4502 uCAUGGGGCCAAAGAAAUA 0.5% 9309 4506 CcAUGAGGGAAGAAUAUCa 0.1%

9258 4502 CCAUGGaGCCAAAGAAAUA 0.4% 9310 4506 CUAcuAGGGAAGAAUAUCG 0.1%

9259 4502 CCAUGGGGCCAAAGAAAUc 0.8% 9311 4506 CcAUGAGGGAAGAgUAUCa 0.1%

9260 4502 CCAcGGGGCCAAAGAAgUA 0.3% 4507 4507 UCUAUGAGGGAAGAAUAUC 92.4%

9261 4502 CuAUGGGGCCAAAGAAAUc 0.1% 9312 4507 UCUAUGAGGGAAGAgUAUC 3.4%

9262 4502 uCAUGGGGCCAAAGuAAUA 0.1% 9313 4507 UCUAUGAGGGAgGAAUAUC 0.8%

4503 4503 AGUCUAUGAGGGAAGAAUA 92.6% 9314 4507 UCUAuGgGGGAAGAAUAUC 0.1%

9263 4503 AGUCUAUGAGGGAAGAgUA 4.7% 9315 4507 UCcAUGAGGGAAGAAUAUC 0.9%

9264 4503 AGUCUAUGAGGGAgGAAUA 0.8% 9316 4507 UCUAUGAGGGAAGAAUACC 0.2%

9265 4503 AGUCUAUGgGGGAAGAAUA 0.1% 9317 4507 UCUuUGAGGGAAGAAUAUC 0.6%

9266 4503 AGϋuUAUGAGGGAAGAAUA 0.1% 9318 4507 UCcAUGAGGGAAGAgUAUC 0.1%

9267 4503 AGUCcAUGAGGGAAGAAUA 0.9% 9319 4507 UCUAUGAGGGAACAAUAUU 0.1%

9268 4503 AGUCUAUGAGGGAAcAAUA 0.1% 9320 4507 UCUAUGAGGGAAGAgUAcC 1.3%

9269 4503 AGUCUuUGAGGGAAGAAUA 0.6% 9321 4507 UuUAUGAGGGAAGAAUAUg 0.1%

9270 4503 AGUCcAUGAGGGAAGAgUA 0.1% 9322 4507 UCUAcuAGGGAAGAAUAUC 0.1%

9271 4503 AGUCUAcuAGGGAAGAAUA 0.1% 4508 4508 GUCAUUUUGUCAGCAUAGA 92.2%

4504 4504 CCUUCUACGGAAGGAGUAC 92.5% 9323 4508 GUCAUUUUGUCAGCAUAGg 0.2%

9272 4504 CCUUCUACGGAAGGAGUgC 3.4% 9324 4508 GUCAUUUUGUCAGCAUgGA 0.1%

9273 4504 CCUUCUACGGAAGGgGUAC 0.2% 9325 4508 GUCAUUUUGUCAGUAUAGA 0.2%

9274 4504 CCUUCUACGGAgGGAGUAC 0.1% 9326 4508 GUCgUUUUGUCAGCAUAGA 0.1%

9275 4504 CCUUCUACGGgAGGAGUAC 0.2% 9327 4508 GUCAaUUUGUCAGCAUAGA 0.1%

9276 4504 CCUOuUACGGAAGGAGUAC 0.1% 9328 4508 GUCAUUcUGUCAGCAUAGA 0.1%

9277 4504 uCUUCUACGGAAGGAGUAC 0.1% 9329 4508 GUCAUUUUGcCAGCAUAGA 0.1%

9278 4504 CaUUCUACGGAAGGAGUAC 0.1% 9330 4508 GUCAUUUUGUCAaCAUAGA 5.1%

9279 4504 CCUgCUACGGAAGGAGUAC 0.1% 9331 4508 GUCAUUUUGUCAGCAUAaA 0.3%

9280 4504 CCUUCcACGGAAGGAGUAC 0.1% 9332 4508 GUCAUUUUGUCAGCAUACA 0.1%

9281 4504 CCUUCUACaGAAGGAGUAC 0.1% 9333 4508 GUCgUUUUGUCAaCAUAGA 0.1%

9282 4504 CCUUCUACGGAAGGAaUAC 0.4% 9334 4508 GUCAUUUcGUCAaCAUAGA 0.1%

9283 4504 CCUUCUACGGAAGGAGUuC 0.1% 9335 4508 GUCAUUUUGUCAaCAUAGu 0.4%

9284 4504 CCUUgUACGGAAGGAGUAC 0.1% 9336 4508 GUCAUUUUGUCAUCAUACA 0.1%

9285 4504 gCUUCUACGGAAGGAGUAC 0.2% 4509 4509 GAGUCUAUGAGGGAAGAAU 92.2%

9286 4504 CCUcCUACGGAAGGAGUgC 0.5% 9337 4509 GAGUCUAUGAGGGAAGAgU 4.7%

9287 4504 CCUUCUACaGAuGGAGUAC 0.2% 9338 4509 GAGUCUAUGAGGGAgGAAU 0.8%

9288 4504 CCUgCUACGGcAGGgGUAC 1.2% 9339 4509 GAGUCUAUGgGGGAAGAAU 0.1%

4505 4505 GUCUAUGAGGGAAGAAUAU 92.4% 9340 4509 GAGUuUAUGAGGGAAGAAU 0.1%

9289 4505 GUCUAUGAGGGAAGAgUAU 3.4% 9341 4509 aAGUCUAUGAGGGAAGAAU 0.4%

9290 4505 GUCUAUGAGGGAgGAAUAU 0.8% 9342 4509 GAGUCcAUGAGGGAAGAAU 0.9%

9291 4505 GUCUAUGgGGGAAGAAUAU 0.1% 9343 4509 GAGUCUAUGAGGGAACAAU 0.1%

9292 4505 GUuUAUGAGGGAAGAAUAU 0.1% 9344 4509 GAGUCUuUGAGGGAAGAAU 0.6%

9293 4505 GUCcAUGAGGGAAGAAUAU 0.9% 9345 4509 GAGUCcAUGAGGGAAGAgU 0.1%

9294 4505 GUCUAUGAGGGAAcAAUAϋ 0.1% 9346 4509 GAGUCUAcuAGGGAAGAAU 0.1%

9295 4505 GUCUAUGAGGGAAGAAUAc 0.2% 4510 4510 UCAUUUUGUCAGCAUAGAG 92.1%

9296 4505 GUCUuUGAGGGAAGAAUAU 0.6% 9347 4510 UCAUUUUGUCAGCAUAGgG 0.2%

9297 4505 GUCcAUGAGGGAAGAgUAU 0.1% 9348 4510 UCAUUUUGUCAGCAUgGAG 0.1%

9298 4505 GUCUAUGAGGGAAGAgUAc 1.3% 9349 4510 UCAUUUUGUCAGUAUAGAG 0.2%

9299 4505 GUCUAcuAGGGAAGAAUAU 0.1% 9350 4510 UCgUUUUGUCAGCAUAGAG 0.1%

4506 4506 CUAUGAGGGAAGAAUAUCG 92.4% 9351 4510 UCAaUUUGUCAGCAUAGAG 0.1%

9300 4506 CUAUGAGGGAAGAgUAUCG 3.4% 9352 4510 UCAUUcUGUCAGCAUAGAG 0.1%

9301 4506 CUAUGAGGGAgGAAUAUCG 0.8% 9353 4510 UCAUUUUGcCAGCAUAGAG 0.1%

9302 4506 CUAUGgGGGAAGAAUAUCG 0.1% 9354 4510 UCAUUUUGUCAaCAUAGAG 3.7%

9355 4510 UCAUUUUGUCAGCAUAaAG 0.3% 9407 4514 GAGACUUcAAGAUGUCUUU 0.1%

9356 4510 UCAUUUUGUCAGCAUACAG 0.1% 9408 4514 GAGACUUGAAaAUGUCUUU 0.3%

9357 4510 UCAUUUUGUCAGCAUAGAU 0.1% 9409 4514 GAGACUUGAAGAUGUaUUU 2.5%

9358 4510 gCAUUUUGUCAaCAUAGAG 0.1% 9410 4514 GAGACUUGAAuAUGUCUUU 0.1%

9359 4510 UCAUUUcGUCAaCAUAGAG 0.1% 9411 4514 GAaACUUGAgGAUGUCUUU 0.1%

9360 4510 UCAUUUUGUCAaCAUAGAa 1.3% 9412 4514 GAGACUgGAAGgUGUCUUU 0.1%

9361 4510 UCAUUUUGUCAaCAUAGuG 0.4% 9413 4514 GAGACaaGAAGAUGUCUUU 0.2%

9362 4510 UCgUUUUGUCAaCAUAGAa 0.1% 9414 4514 GAGACUcGAAGAcGUuUUϋ 0.1%

4511 4511 AGACUUGAAGAUGUCUUUG 91.9% 4515 4515 CAGAGACUUGAAGAUGUCU 91.9%

9363 4511 AGACUUGAAGAUGUUUUUG 0.8% 9415 4515 CAGAGACUUGAAGAUGUUU 0.8%

9364 4511 AGACUUGAgGAUGUCUUUG 0.4% 9416 4515 CAGAGACUUGAgGAUGUCU 0.4%

9365 4511 AGgCUUGAAGAUGUCUUUG 0.2% 9417 4515 CAGAGgCUUGAAGAUGUCU 0.2%

9366 4511 AGACUUGAgGAUGUuUUUG 1.7% 9418 4515 CAGAGACUUGAgGAUGUuU 1.7%

9367 4511 AaACUUGAAGAUGUCUUUG 1.5% 9419 4515 CAGAaACUUGAAGAUGUCU 1.5%

9368 4511 AGACUUcAAGAUGUCUUUG 0.1% 9420 4515 CAGAGACUUcAAGAUGUCU 0.1%

9369 4511 AGACUUGAAaAUGUCUUUG 0.3% 9421 4515 CAGAGACUUGAAaAUGUCU 0.3%

9370 4511 AGACUUGAAGAUGUaUUUG 2.5% 9422 4515 CAGAGACUUGAAGAUGUaU 2.5%

9371 4511 AGACUUGAAuAUGUCUUUG 0.1% 9423 4515 CAGAGACUUGAAUAUGUCU 0.1%

9372 4511 AaACUUGAgGAUGUCUUUG 0.1% 9424 4515 CAGAaACUUGAgGAUGUCU 0.1%

9373 4511 AGACUgGAAGgUGUCUUUG 0.1% 9425 4515 CAGAGACUgGAAGgUGUCU 0.1%

9374 4511 AGACaaGAAGAUGUCUUUG 0.2% 9426 4515 CAGAGACaaGAAGAUGUCU 0.2%

9375 4511 AGACUcGAAGAcGUuUUUG 0.1% 9427 4515 CAGAGACUcGAAGAcGUuU 0.1%

4512 4512 GACUUGAAGAUGUCUUUGC 91.9% 4516 4516 GUGCAGAUGCAACGAUUCA 91.7%

9376 4512 GACUUGAAGAUGUUUUUGC 0.8% 9428 4516 GUGCAGAUGCAACGgUUCA 1.8%

9377 4512 GACUUGAgGAUGUCUUUGC 0.4% 9429 4516 GUGCAGAUGCAgCGAUUCA 3.2%

9378 4512 GgCUUGAAGAUGUCUUUGC 0.2% 9430 4516 GUGCAGgUGCAACGAUUCA 0.1%

9379 4512 GACUUGAgGAUGUuUUUGC 1.7% 9431 4516 GUGCAGAUGCAgCGgUUCA 0.2%

9380 4512 aACUUGAAGAUGUCUUUGC 1.5% 9432 4516 GUaCAGAUGCAACGAUUCA 0.1%

9381 4512 GACUUcAAGAUGUCUUUGC 0.1% 9433 4516 GUGCAaAUGCAACGAUUCA 0.4%

9382 4512 GACUUGAAaAUGUCUUUGC 0.3% 9434 4516 GUaCAGAUGCAgCGAUUCA 1.9%

9383 4512 GACUUGAAGAUGUaUUUGC 2.5% 9435 4516 GUGCAaAUGCAgCGAUUCA 0.2%

9384 4512 GACUUGAAuAUGUCUUUGC 0.1% 9436 4516 GUaCAaAUGCAgCGAUUCA 0.2%

9385 4512 aACUUGAgGAUGUCUUUGC 0.1% 9437 4516 GUGCAaAUGCAgaGAUUCA 0.2%

9386 4512 GACUgGAAGgUGUCUUUGC 0.1% 4517 4517 UUCUGCUGGUGCACUUGCC 91.7%

9387 4512 GACaaGAAGAUGUCUUUGC 0.2% 9438 4517 UUCUGCUGGUGCACUUGCu 2.4%

9388 4512 GACUcGAAGAcGUuUUUGC 0.1% 9439 4517 UUCUGCUGGUGCgCUUGCC 0.9%

4513 4513 AGAGACUUGAAGAUGUCUU 91.9% 9440 4517 cϋCUGCUGGUGCACUUGCC 1.0%

9389 4513 AGAGACUUGAAGAUGUUUU 0.8% 9441 4517 UUCcGCUGGUGCACUUGCC 0.1%

9390 4513 AGAGACUUGAgGAUGUCUU 0.4% 9442 4517 UUCgGCUGGUGCACUUGCC 0.1%

9391 4513 AGAGgCUUGAAGAUGUCUU 0.2% 9443 4517 UUCUGCcGGUGCACUUGCC 0.2%

9392 4513 AGAGACUUGAgGAUGUuUU 1.7% 9444 4517 UUCUGCUGGcGCACUUGCC 0.1%

9393 4513 AGAaACUUGAAGAUGUCUU 1.5% 4518 4518 UGCAGAUGCAACGAUUCAA 91.6%

9394 4513 AGAGACUUcAAGAUGUCUU 0.1% 9445 4518 UGCAGAUGCAACGAUUCAg 0.2%

9395 4513 AGAGACUUGAAaAUGUCUU 0.3% 9446 4518 UGCAGAUGCAACGgUUCAA 1.8%

9396 4513 AGAGACUUGAAGAUGUaUU 2.5% 9447 4518 UGCAGAUGCAgCGAUUCAA 3.2%

9397 4513 AGAGACUUGAAUAUGUCUU 0.1% 9448 4518 UGCAGgUGCAACGAUUCAA 0.1%

9398 4513 AGAaACUUGAgGAUGUCUU 0.1% 9449 4518 UGCAGAUGCAgCGgUUCAA 0.2%

9399 4513 AGAGACUgGAAGgUGUCUU 0.1% 9450 4518 UaCAGAUGCAACGAUUCAA 0.1%

9400 4513 AGAGACaaGAAGAUGUCUU 0.2% 9451 4518 UGCAaAUGCAACGAUUCAA 0.4%

9401 4513 AGAGACUcGAAGAcGUuUU 0.1% 9452 4518 UaCAGAUGCAgCGAUUCAA 1.9%

4514 4514 GAGACUUGAAGAUGUCUUU 91.9% 9453 4518 UGCAaAUGCAgCGAUUCAA 0.2%

9402 4514 GAGACUUGAAGAUGUUUUU 0.8% 9454 4518 UaCAaAUGCAgCGAUUCAA 0.2%

9403 4514 GAGACUUGAgGAUGUCUUU 0.4% 9455 4518 UGCAaAUGCAgaGAUUCAA 0.2%

9404 4514 GAGgCUUGAAGAUGUCUUU 0.2% 4519 4519 UAUGAGGGAAGAAUAUCGA 91.5%

9405 4514 GAGACUUGAgGAUGUuUUU 1.7% 9456 4519 UAUGAGGGAAGAAUAUCGg 1.0%

9406 4514 GAaACUUGAAGAUGUCUUU 1.5% 9457 4519 UAUGAGGGAAGAgUAUCGA 1.5%

9458 4519 UAUGAGGGAgGAAUAUCGA 0.8% 9509 4524 GGAUcCUUGAUCGUCUUUU 0.1%

9459 4519 UAUGgGGGAAGAAUAUCGA 0.1% 9510 4524 GGAUUCUUaAUCGUCUUUU 0.3%

9460 4519 UAUGAGGGAAGAgUAUCGg 1.8% 9511 4524 GGAUUCUUGAUCaUCUUUU 0.2%

9461 4519 CAUGAGGGAAGAAUAUCGA 0.8% 9512 4524 GGAUUCUUGAUCGCCUUUU 1.0%

9462 4519 UAUGAGGGAAGAAUACCGA 0.2% 9513 4524 GGAUUgUUGAUCGUCUUUU 0.1%

9463 4519 UAUGAGGGAAGAAUAUgGA 0.1% 9514 4524 GGAUUaUUGAUCGcCUUUU 5.6%

9464 4519 UuUGAGGGAAGAAUAUCGA 0.3% 9515 4524 GGAUUaUUGAUCGgCUUUU 1.0%

9465 4519 UuUGAGGGAAGAAUAUCGg 0.3% 4525 4525 GGGGUGCAGAUGCAACGAU 90.8%

9466 4519 UAUGAGGGAAcAAUAUuGg 0.1% 9516 4525 GGGGUGCAGAUGCAACGgU 0.5%

9467 4519 UAUGAGGGAAGAgUAcCGg 1.3% 9517 4525 GGGGUGCAGAUGCAgCGAU 1.5%

9468 4519 cAUGAGGGAAGAAUAUCaA 0.1% 9518 4525 GGGGUGCAGgUGCAACGAU 0.1%

9469 4519 UAcuAGGGAAGAAUAUCGA 0.1% 9519 4525 GGaGUGCAGAUGCAACGAU 1.0%

9470 4519 cAUGAGGGAAGAgUAUCaA 0.1% 9520 4525 GGGGUGCAaAUGCAACGAU 0.3%

4520 4520 AUGGGGGUGCAGAUGCAAC 91.3% 9521 4525 GGaGUGCAGAUGCAACGgU 1.3%

9471 4520 AUGGGGGUGCAGAUGCAgC 1.5% 9522 4525 GGaGUGCAGAUGCAgCGAU 1.6%

9472 4520 AUGGGGGUGCAGgUGCAAC 0.1% 9523 4525 GGaGUGCAGAUGCAgCGgU 0.2%

9473 4520 AUGGGaGUGCAGAUGCAAC 2.3% 9524 4525 GGaGUaCAGAUGCAACGAU 0.1%

9474 4520 AUGGGGGUGCAaAUGCAAC 0.3% 9525 4525 GGaGUGCAaAUGCAACGAU 0.1%

9475 4520 AUGGGaGUGCAGAUGCAgC 1.8% 9526 4525 GGaGUaCAGAUGCAgCGAU 1.9%

9476 4520 AUGGGaGUaCAGAUGCAAC 0.1% 9527 4525 GGaGUGCAaAUGCAgCGAU 0.2%

9477 4520 AUGGGaGUGCAaAUGCAAC 0.1% 4526 4526 GGGUGCAGAUGCAACGAUU 90.8%

9478 4520 AUGGGaGUaCAGAUGCAgC 1.9% 9528 4526 GGGUGCAGAUGCAACGgUU 0.5%

9479 4520 AUGGGaGUGCAaAUGCAgC 0.2% 9529 4526 GGGUGCAGAUGCAgCGAUU 1.5%

4521 4521 UGGGGGUGCAGAUGCAACG 91.3% 9530 4526 GGGUGCAGgUGCAACGAUU 0.1%

9480 4521 UGGGGGUGCAGAUGCAgCG 1.5% 9531 4526 GaGUGCAGAUGCAACGAUU 1.0%

9481 4521 UGGGGGUGCAGgUGCAACG 0.1% 9532 4526 GGGUGCAaAUGCAACGAUU 0.3%

9482 4521 UGGGaGUGCAGAUGCAACG 2.3% 9533 4526 GaGUGCAGAUGCAACGgUU 1.3%

9483 4521 UGGGGGUGCAaAUGCAACG 0.3% 9534 4526 GaGUGCAGAUGCAgCGAUU 1.6%

9484 4521 UGGGaGUGCAGAUGCAgCG 1.8% 9535 4526 GaGUGCAGAUGCAgCGgUU 0.2%

9485 4521 UGGGaGUaCAGAUGCAACG 0.1% 9536 4526 GaGUaCAGAUGCAACGAUU 0.1%

9486 4521 UGGGaGUGCAaAUGCAACG 0.1% 9537 4526 GaGUGCAaAUGCAACGAUU 0.1%

9487 4521 UGGGaGUaCAGAUGCAgCG 1.9% 9538 4526 GaGUaCAGAUGCAgCGAUU 1.9%

9488 4521 UGGGaGUGCAaAUGCAgCG 0.2% 9539 4526 GaGUGCAaAUGCAgCGAUU 0.2%

4522 4522 ACCUCUGACUAAGGGGAUU 91.3% 4527 4527 GUGGAUUCUUGAUCGUCUU 90.8%

9489 4522 ACCUCUGACUAgGGGGAUU 0.1% 9540 4527 GUGGAUUuUUGAUCGUCUU 0.7%

9490 4522 ACCUCUaACUAAGGGGAUU 0.7% 9541 4527 aUGGAUUCUUGAUCGUCUU 0.3%

9491 4522 ACCUCUGACaAAGGGGAUU 0.4% 9542 4527 GUGGAUcCUUGAUCGUCUU 0.1%

9492 4522 ACCUCUGACcAAGGGGAUU 0.2% 9543 4527 GUGGAUUCUUaAUCGUCUU 0.3%

9493 4522 ACCUCUGACUAAaGGGAUU 2.0% 9544 4527 GUGGAUUCUUGAUCaUCUU 0.2%

9494 4522 ACCUCUGACUAAGGGaAUU 3.0% 9545 4527 GUGGAUUCUUGAUCGCCUU 1.0%

9495 4522 ACCUCUGACUAAGGGGAUc 1.9% 9546 4527 GUGGAUUgUUGAUCGUCUU 0.1%

9496 4522 ACCUCUGACUAAGGGUAUU 0.1% 9547 4527 GUGGAUUaUUGAUCGcCUU 5.6%

9497 4522 ACCUCUuACUAAGGGGAUU 0.1% 9548 4527 GUGGAUUaUUGAUCGgCUU 1.0%

9498 4522 ACCUCUaACUAAGGGGAUc 0.2% 4528 4528 GGUGCAGAUGCAACGAUUC 90.8%

9499 4522 ACCUCUGACcAAGGGaAUU 0.1% 9549 4528 GGUGCAGAUGCAACGgUUC 0.5%

4523 4523 UGGAUUCUUGAUCGUCUUU 91.1% 9550 4528 GGUGCAGAUGCAgCGAUUC 1.5%

9500 4523 UGGAUUuUUGAUCGUCUUU 0.7% 9551 4528 GGUGCAGgUGCAACGAUUC 0.1%

9501 4523 UGGAUcCUUGAUCGUCUUU 0.1% 9552 4528 aGUGCAGAUGCAACGAUUC 1.0%

9502 4523 UGGAUUCUUaAUCGUCUUU 0.3% 9553 4528 GGUGCAaAUGCAACGAUUC 0.3%

9503 4523 UGGAUUCUUGAUCaUCUUU 0.2% 9554 4528 aGUGCAGAUGCAACGgUUC 1.3%

9504 4523 UGGAUUCUUGAUCGCCUUU 1.0% 9555 4528 aGUGCAGAUGCAgCGAUUC 1.6%

9505 4523 UGGAUUgUUGAUCGUCUUU 0.1% 9556 4528 aGUGCAGAUGCAgCGgUUC 0.2%

9506 4523 UGGAUUaUUGAUCGcCUUU 5.6% 9557 4528 aGUaCAGAUGCAACGAUUC 0.1%

9507 4523 UGGAUUaUUGAUCGgCUUU 1.0% 9558 4528 aGUGCAaAUGCAACGAUUC 0.1%

4524 4524 GGAUUCUUGAUCGUCUUUU 91.1% 9559 4528 aGUaCAGAUGCAgCGAUUC 1.9%

9508 4524 GGAUϋuUUGAUCGUCUUUU 0.7% 9560 4528 aGUGCAaAUGCAgCGAUUC 0.2%

4529 4529 GGGGGUGCAGAUGCAACGA 90.8% 9612 4533 UCUUCUUGAAAAaUUGCAG 0.1%

9561 4529 GGGGGUGCAGAUGCAACGg 0.5% 9613 4533 UCUUCUUGAAAACUUGCAG 0.5%

9562 4529 GGGGGUGCAGAUGCAgCGA 1.5% 9614 4533 UCUUCUUGAAAAUUUaCAG 0.8%

9563 4529 GGGGGUGCAGgUGCAACGA 0.1% 9615 4533 UCUcaUUGAAAAUUUGCAG 0.2%

9564 4529 GGGaGUGCAGAUGCAACGA 1.0% 9616 4533 UCUcCUUGAuAAUUUGCAG 0.1%

9565 4529 GGGGGUGCAaAUGCAACGA 0.3% 4534 4534 AUGGCUAAAGACAAGACCA 90.5%

9566 4529 GGGaGUGCAGAUGCAACGg 1.3% 9617 4534 AUGGCUAAAGACAAGACCg 3.0%

9567 4529 GGGaGUGCAGAUGCAgCGA 1.6% 9618 4534 AUGGCUAAAGACAAGgCCA 0.4%

9568 4529 GGGaGUGCAGAUGCAgCGg 0.2% 9619 4534 AUGGCUgAAGACAAGACCA 0.1%

9569 4529 GGGaGUaCAGAUGCAACGA 0.1% 9620 4534 gUGGCUAAAGACAAGACCA 4.9%

9570 4529 GGGaGUGCAaAUGCAACGA 0.1% 9621 4534 AUGGuUAAAGACAAGACCg 0.1%

9571 4529 GGGaGUaCAGAUGCAgCGA 1.9% 9622 4534 gUGGCUAAAGACAAGACCg 0.5%

9572 4529 GGGaGUGCAaAUGCAgCGA 0.2% 9623 4534 AUGGCUAAAGACAAaACCA 0.2%

4530 4530 CUCUGACUAAGGGGAUUUU 90.7% 9624 4534 AUGGCUAAAGACAAGACCc 0.1%

9573 4530 CUCUGACUAgGGGGAUUUU 0.1% 9625 4534 AUGGCUAAAGACAAGACCu 0.1%

9574 4530 CUCUaACUAAGGGGAOUUU 0.7% 9626 4534 AUGGCUAAAGACUAGACCU 0.2%

9575 4530 CUCUGACaAAGGGGAUUUU 0.4% 4535 4535 AUCUUCUUGAAAAUUUGCA 90.5%

9576 4530 CUCUGACcAAGGGGAUUUU 0.2% 9627 4535 AUCUUuUUGAAAAUUUGCA 0.1%

9577 4530 CUCUGACUAAaGGGAUUUU 2.0% 9628 4535 AcCUUCUUGAAAAUUUGCA 0.1%

9578 4530 CUCUGACUAAGGGaAUUUU 3.0% 9629 4535 AUCUcCUUGAAAAUUUGCA 5.9%

9579 4530 CUCUGACUAAGGGGAUCUU 0.5% 9630 4535 AUCUUaUUGAAAAUUUGCA 1.7%

9580 4530 CUCUGACUAAGGGGAUUCU 0.6% 9631 4535 AUCUUCUcGAAAAUUUGCA 0.1%

9581 4530 CUCUGACUAAGGGUAUUUU 0.1% 9632 4535 AUCUUCUUGAAAAaUUGCA 0.1%

9582 4530 CUCUuACUAAGGGGAUUUU 0.1% 9633 4535 AUCUUCUUGAAAACUUGCA 0.5%

9583 4530 CUCUGACcAAGGGaAUUUU 0.1% 9634 4535 AUCUUCUUGAAAAUUUaCA 0.8%

9584 4530 CUCUGACUAAGGGGAUccU 1.4% 9635 4535 AUCUcaUUGAAAAUUUGCA 0.2%

4531 4531 CCUCUGACUAAGGGGAUUU 90.7% 9636 4535 AUCUcCUUGAuAAUUUGCA 0.1%

9585 4531 CCUCUGACUAgGGGGAUUU 0.1% 4536 4536 GAAUGGGGGUGCAGAUGCA 90.4%

9586 4531 CCUCUaACUAAGGGGAUUU 0.7% 9637 4536 GAAUGGGGGUGCAGgUGCA 0.1%

9587 4531 CCUCUGACaAAGGGGAUUU 0.4% 9638 4536 GgAUGGGGGUGCAGAUGCA 2.4%

9588 4531 CCUCUGACcAAGGGGAUUU 0.2% 9639 4536 GAAUGGGaGUGCAGAUGCA 3.9%

9589 4531 CCUCUGACUAAaGGGAUUU 2.0% 9640 4536 GAAUGGGGGUGCAaAUGCA 0.3%

9590 4531 CCUCUGACUAAGGGaAUUU 3.0% 9641 4536 GgAUGGGaGUGCAGAUGCA 0.2%

9591 4531 CCUCUGACUAAGGGGAUCU 0.5% 9642 4536 GAAUGGGaGUaCAGAUGCA 2.0%

9592 4531 CCUCUGACUAAGGGGAUUC 0.6% 9643 4536 GgAUGGGaGUGCAaAUGCA 0.5%

9593 4531 CCUCUGACUAAGGGUAUUU 0.1% 4537 4537 CGAAUGGGGGUGCAGAUGC 90.2%

9594 4531 CCUCUuACUAAGGGGAUUU 0.1% 9644 4537 CGAAUGGGGGUGCAGgUGC 0.1%

9595 4531 CCUCUGACcAAGGGaAUUU 0.1% 9645 4537 CGgAUGGGGGUGCAGAUGC 2.4%

9596 4531 CCUCUGACUAAGGGGAUcc 1.4% 9646 4537 aGAAUGGGGGUGCAGAUGC 0.2%

4532 4532 UGUGGAUUCUUGAUCGUCU 90.7% 9647 4537 CGAAUGGGaGUGCAGAUGC 3.9%

9597 4532 UGUGGAUUuUUGAUCGUCU 0.7% 9648 4537 CGAAUGGGGGUGCAaAUGC 0.3%

9598 4532 CGUGGAUUCUUGAUCGUCU 0.1% 9649 4537 CGgAUGGGaGUGCAGAUGC 0.2%

9599 4532 UaUGGAUUCUUGAUCGUCU 0.3% 9650 4537 CGAAUGGGaGUaCAGAUGC 2.0%

9600 4532 UGUGGAUcCUUGAUCGUCU 0.1% 9651 4537 CGgAUGGGaGUGCAaAUGC 0.5%

9601 4532 UGUGGAUUCUUaAUCGUCU 0.3% 4538 4538 AAUGGGGGUGCAGAUGCAA 90.2%

9602 4532 UGUGGAUUCUUGAUCaUCU 0.2% 9652 4538 AAUGGGGGUGCAGAUGCAg 0.2%

9603 4532 UGUGGAUUCUUGAUCGCCU 1.0% 9653 4538 AAUGGGGGUGCAGgUGCAA 0.1%

9604 4532 UGUGGAUUgUUGAUCGUCU 0.1% 9654 4538 gAUGGGGGUGCAGAUGCAA 1.1%

9605 4532 UGUGGAUUaUUGAUCGcCU 5.6% 9655 4538 gAUGGGGGUGCAGAUGCAg 1.3%

9606 4532 UGUGGAUUaUUGAUCGgCU 1.0% 9656 4538 AAUGGGaGUGCAGAUGCAA 2.3%

4533 4533 UCUUCUUGAAAAUUUGCAG 90.5% 9657 4538 AAUGGGGGUGCAaAUGCAA 0.3%

9607 4533 UCUUuUUGAAAAUUUGCAG 0.1% 9658 4538 AAUGGGaGUGCAGAUGCAg 1.6%

9608 4533 cCUUCUUGAAAAUUUGCAG 0.1% 9659 4538 gAUGGGaGUGCAGAUGCAg 0.2%

9609 4533 UCUcCUUGAAAAUUUGCAG 5.9% 9660 4538 AAUGGGaGUaCAGAUGCAA 0.1%

9610 4533 UCUUaUUGAAAAUUUGCAG 1.7% 9661 4538 AAUGGGaGUaCAGAUGCAg 1.9%

9611 4533 UCUUCUcGAAAAUUUGCAG 0.1% 9662 4538 gAUGGGaGUGCAaAUGCAA 0.1%

Table 20-6: Conserved and minor variant 19-mer sequences from the Influenza A segment 8 (NSl & NS2) sequences listed in Table 1-8. The conserved sequences match at least 89% of the listed viral sequences, and the variants contain 3 or fewer nucleotide changes from the reference sequence.

Seq Ref % 9703 5026 AUGGAAUGAUAAnACAGUU 0.1%

ID ID Match Seq Total 9704 5026 AUGGAgUGAUAACACuGUU 0.1%

5021 5021 AUAACACAGUUCGAGUCUC 98.2% 5027 5027 AAUGGAAUGAUAACACAGU 97.9%

9663 5021 gUAACACgGUUCGAGUCUC 0.4% 9705 5027 AAUGGAgUGAUAACACAGU 0.1%

9664 5021 AUAACACAGcUCGAGUCUC 0.1% 9706 5027 AgUGGAAUGAUAACACAGU 0.2%

9665 5021 AUAACACAGUUaGAGUCUC 0.1% 9707 5027 AAUGGAAUGgUAACACgGU 0.4%

9666 5021 AUAACACAGUUCaAGUCUC 0.2% 9708 5027 AAUGGAAUaAUAACACAGU 1.1%

9667 5021 AUAACACAGUUCGAGUCaC 0.6% 9709 5027 AAUGGAAUGAUAACACAGc 0.1%

9668 5021 AUAACACuGUUCGAGUCUC 0.1% 9710 5027 AAUGGAAUGAUAAnACAGU 0.1%

9669 5021 AUAAnACAGUUCGAGUCUC 0.1% 9711 5027 AAUGGAgUGAUAACACuGU 0.1%

9670 5021 AUAACACAGUUCaAGUCaC 0.2% 5028 5028 UUGAAUGGAAUGAUAACAC 97.8%

5022 5022 UAACACAGUUCGAGUCUCU 98.2% 9712 5028 UUGAAUGGAAUGgUAACAC 0.4%

9671 5022 UAACACgGUUCGAGUCUCU 0.4% 9713 5028 UUGAAUGGAgUGAUAACAC 0.2%

9672 5022 UAACACAGcUCGAGUCUCU 0.1% 9714 5028 UUGAgUGGAAUGAUAACAC 0.2%

9673 5022 UAACACAGUUaGAGUCUCU 0.1% 9715 5028 nUGAAUGGAAUGAUAACAC 0.2%

9674 5022 UAACACAGUUCaAGUCUCU 0.2% 9716 5028 UUGAAUGGAAUaAUAACAC 1.1%

9675 5022 UAACACAGUUCGAGUCaCU 0.6% 9717 5028 UUGAAUGGAAUGAUAAnAC 0.1%

9676 5022 UAACACuGUUCGAGUCUCU 0.1% 5029 5029 UGAUAACACAGUUCGAGUC 97.7%

9677 5022 UAAnACAGUUCGAGUCUCU 0.1% 9718 5029 UGgUAACACgGUUCGAGUC 0.4%

9678 5022 UAACACAGUUCaAGUCaCU 0.2% 9719 5029 UaAUAACACAGUUCGAGUC 1.1%

5023 5023 UGAAUGGAAUGAUAACACA 98.0% 9720 5029 UGAUAACACAGCUCGAGUC 0.1%

9679 5023 UGAAUGGAgUGAUAACACA 0.1% 9721 5029 UGAUAACACAGUUaGAGUC 0.1%

9680 5023 UGAgUGGAAUGAUAACACA 0.2% 9722 5029 UGAUAACACAGUUCaAGUC 0.4%

9681 5023 UGAAUGGAAUGgUAACACg 0.4% 9723 5029 UGAUAACACuGUUCGAGUC 0.1%

9682 5023 UGAAUGGAAUaAUAACACA 1.1% 9724 5029 UGAUAAnACAGUUCGAGUC 0.1%

9683 5023 UGAAUGGAAUGAUAAnACA 0.1% 5030 5030 AUGAUAACACAGUUCGAGU 97.6%

9684 5023 UGAAUGGAgUGAUAACACu 0.1% 9725 5030 gUGAUAACACAGUUCGAGU 0.1%

5024 5024 GAAUGGAAUGAUAACACAG 98.0% 9726 5030 AUGgUAACACgGUUCGAGU 0.4%

9685 5024 GAAUGGAgUGAUAACACAG 0.1% 9727 5030 AUaAUAACACAGUUCGAGU 1.1%

9686 5024 GAgUGGAAUGAUAACACAG 0.2% 9728 5030 AUGAUAACACAGCUCGAGU 0.1%

9687 5024 GAAUGGAAUGgUAACACgG 0.4% 9729 5030 AUGAUAACACAGUUaGAGU 0.1%

9688 5024 GAAUGGAAUaAUAACACAG 1.1% 9730 5030 AUGAUAACACAGUUCaAGU 0.4%

9689 5024 GAAUGGAAUGAUAAnACAG 0.1% 9731 5030 AUGAUAAnACAGUUCGAGU 0.1%

9690 5024 GAAUGGAgUGAUAACACuG 0.1% 9732 5030 gUGAUAACACuGUUCGAGU 0.1%

5025 5025 UGGAAUGAUAACACAGUUC 98.0% 5031 5031 GGAAUGAUAACACAGUUCG 97.6%

9691 5025 UGGAgUGAUAACACAGUUC 0.1% 9733 5031 GGAgUGAUAACACAGUUCG 0.1%

9692 5025 UGGAAUGgUAACACgGUUC 0.4% 9734 5031 GGAAUGgUAACACgGUUCG 0.4%

9693 5025 UGGAAUaAUAACACAGUUC 1.1% 9735 5031 GGAAUaAUAACACAGUUCG 1.1%

9694 5025 UGGAAUGAUAACACAGcUC 0.1% 9736 5031 GGAAUGAUAACACAGCUCG 0.1%

9695 5025 UGGAAUGAUAACACAGUUa 0.1% 9737 5031 GGAAUGAUAACACAGUUaG 0.1%

9696 5025 UGGAAUGAUAAnACAGUUC 0.1% 9738 5031 GGAAUGAUAACACAGUUCa 0.4%

9697 5025 UGGAgUGAUAACACuGUUC 0.1% 9739 5031 GGAAUGAUAAnACAGUUCG 0.1%

5026 5026 AUGGAAUGAUAACACAGUU 97.9% 9740 5031 GGAgUGAUAACACuGUUCG 0.1%

9698 5026 AUGGAgUGAUAACACAGUU 0.1% 5032 5032 GAAUGAUAACACAGUUCGA 97.6%

9699 5026 gUGGAAUGAUAACACAGUU 0.2% 9741 5032 GAgUGAUAACACAGUUCGA 0.1%

9700 5026 AUGGAAUGgUAACACgGUU 0.4% 9742 5032 GAAUGgUAACACgGUUCGA 0.4%

9701 5026 AUGGAAUaAUAACACAGUU 1.1% 9743 5032 GAAUaAUAACACAGUUCGA 1.1%

9702 5026 AUGGAAUGAUAACACAGcϋ 0.1% 9744 5032 GAAUGAUAACACAGCUCGA 0.1%

9745 5032 GAAUGAUAACACAGUUaGA 0.1% 9792 5039 GAGaAUGUCAAAAAUGCAA 0.1%

9746 5032 GAAUGAUAACACAGUUCaA 0.4% 9793 5039 GAGGAUGUCAAAAUUGCAA 0.5%

9747 5032 GAAUGAUAAnACAGUUCGA 0.1% 5040 5040 GAUGUCAAAAAUGCAAUUG 96.8%

9748 5032 GAgUGAUAACACuGUUCGA 0.1% 9794 5040 GAUGUCAAAAAUGCAgUUG 1.5%

5033 5033 AAUGAUAACACAGUUCGAG 97.6% 9795 5040 GAUGUCAAAAAUGCgAUUG 1.0%

9749 5033 AgUGAUAACACAGUUCGAG 0.1% 9796 5040 GAUGUCAAAgAUGCAAUUG 0.1%

9750 5033 AAUGgUAACACgGUUCGAG 0.4% 9797 5040 aAUGUCAAAAAUGCAAUUG 0.1%

9751 5033 AAUaAUAACACAGUUCGAG 1.1% 9798 5040 GAUGUCAAAAAUGCAAUUa 0.1%

9752 5033 AAUGAUAACACAGcUCGAG 0.1% 9799 5040 GAUGUCAAAAUUGCAAUUG 0.5%

9753 5033 AAUGAUAACACAGUUaGAG 0.1% 5041 5041 UGAGGAUGUCAAAAAUGCA 96.4%

9754 5033 AAUGAUAACACAGUUCaAG 0.4% 9800 5041 UGAGGAUGUCAAAAAUGCg 1.0%

9755 5033 AAUGAUAAnACAGUUCGAG 0.1% 9801 5041 UGAGGAUGUCAAAgAUGCA 0.1%

9756 5033 AgUGAUAACACuGUUCGAG 0.1% 9802 5041 aGAGGAUGUCAAAAAUGCA 1.5%

5034 5034 CUUGAAUGGAAUGAUAACA 97.5% 9803 5041 cGAGGAUGUCAAAAAUGCA 0.4%

9757 5034 CUUGAAUGGAAUGgUAACA 0.4% 9804 5041 UaAGGAUGUCAAAAAUGCA 0.1%

9758 5034 CUUGAAUGGAgUGAUAACA 0.2% 9805 5041 UGAGaAUGUCAAAAAUGCA 0.1%

9759 5034 CUUGAgUGGAAUGAUAACA 0.2% 9806 5041 UGAGGAUGUCAAAAUUGCA 0.5%

9760 5034 uUUGAAUGGAAUGAUAACA 0.3% 5042 5042 CGGCUUCGCCGAGAUCAGA 96.2%

9761 5034 CUUGAAUGGAAUaAUAACA 1.1% 9807 5042 CGGCUUCGCCGgGAUCAGA 0.5%

9762 5034 CUUGAAUGGAAUGAUAAnA 0.1% 9808 5042 aGGCUUCGCCGAGAUCAGA 0.1%

9763 5034 nnUGAAUGGAAUGAUAACA 0.2% 9809 5042 CGaCUUCGCCGAGAUCAGA 0.8%

5035 5035 GAUAACACAGUUCGAGUCU 97.1% 9810 5042 CGGaUUCGCCGAGAUCAGA 0.1%

9764 5035 GgUAACACgGUUCGAGUCU 0.4% 9811 5042 CGGCUcCGCCGAGAUCAGA 0.3%

9765 5035 aAUAACACAGUUCGAGUCU 1.1% 9812 5042 CGGCUUCaCCGAGAUCAGA 0.1%

9766 5035 GAUAACACAGcUCGAGUCU 0.1% 9813 5042 CGGCUUCGCaGAGAUCAGA 0.1%

9767 5035 GAUAACACAGUUaGAGUCU 0.1% 9814 5042 CGGCUUCGCCGAGACCAGA 0.3%

9768 5035 GAUAACACAGUUCaAGUCU 0.2% 9815 5042 CGGCUUCGCCGAGAUaAGA 0.1%

9769 5035 GAUAACACAGUUCGAGUCa 0.6% 9816 5042 CGGCUUCGCCGAGAUCAaA 0.4%

9770 5035 GAUAACACuGUUCGAGUCU 0.1% 9817 5042 CGGCUUCGCCGAUAUCAGA 0.2%

9771 5035 GAUAAnACAGUUCGAGUCU 0.1% 9818 5042 CGGgUUuGCCGAGAUCAGA 0.1%

9772 5035 GAUAACACAGUUCaAGUCa 0.2% 9819 5042 aGaCUUCGCCGAGAUCAGA 0.1%

5036 5036 AUGUCAAAAAUGCAAUUGG 96.9% 9820 5042 CGaCUUCGCaGAGAUCAGA 0.1%

9773 5036 AUGUCAAAAAUGCAgUUGG 1.5% 9821 5042 CGGCUcCGCCGAGAUCAaA 0.3%

9774 5036 AUGUCAAAAAUGCgAUUGG 1.0% 9822 5042 CGGCUUCGCCGAnnUCAGA 0.1%

9775 5036 AUGUCAAAgAUGCAAUUGG 0.1% 5043 5043 UGUCAAAAAUGCAAUUGGG 95.6%

9776 5036 AUGUCAAAAAUGCAAUUaG 0.1% 9823 5043 UGUCAAAAAUGCAgUUGGG 1.0%

9777 5036 AUGUCAAAAuUGCAAUUGG 0.5% 9824 5043 UGUCAAAAAUGCgAUUGGG 1.0%

5037 5037 GGAUGUCAAAAAUGCAAUU 96.9% 9825 5043 UGUCAAAgAUGCAAUUGGG 0.1%

9778 5037 GGAUGUCAAAAAUGCAgUU 1.5% 9826 5043 UGUCAAAAAUGCAAUUaGG 0.1%

9779 5037 GGAUGUCAAAAAUGCgAUU 1.0% 9827 5043 UGUCAAAAAUGCAAUUGGa 0.1%

9780 5037 GGAUGUCAAAgAUGCAAUU 0.1% 9828 5043 UGUCAAAAAUGCAAUUGGc 1.2%

9781 5037 GaAUGUCAAAAAUGCAAUU 0.1% 9829 5043 UGUCAAAAuUGCAAUUGGG 0.5%

9782 5037 GGAUGUCAAAAuUGCAAUU 0.5% 9830 5043 UGUCAAAAAUGCAgUUGGa 0.5%

5038 5038 AGGAUGUCAAAAAUGCAAU 96.9% 5044 5044 UCUACAGAGAUUCGCUUGG 95.5%

9783 5038 AGGAUGUCAAAAAUGCAgU 1.5% 9831 5044 UCUACAGAGAUUUGCUUGG 0,2%

9784 5038 AGGAUGUCAAAAAUGCgAU 1.0% 9832 5044 UaUACAGAGAUUCGCUUGG 1.3%

9785 5038 AGGAUGUCAAAgAUGCAAU 0.1% 9833 5044 UCUACAGAaAUUCGCUUGG 0.1%

9786 5038 AGaAUGUCAAAAAUGCAAU 0.1% 9834 5044 UCUACAGAGAUUCaCUUGG 2.6%

9787 5038 AGGAUGUCAAAAuUGCAAU 0.5% 9835 5044 UCUACAGAGAUUCGCUUGa 0.2%

5039 5039 GAGGAUGUCAAAAAUGCAA 96.8% 9836 5044 UCUACAGAGAUUCGCUUnG 0.1%

9788 5039 GAGGAUGUCAAAAAUGCAg 1.5% 9837 5044 UCUACAGAGAUUCUCUUGG 0.1%

9789 5039 GAGGAUGUCAAAAAUGCgA 1.0% 5045 5045 GCAAUUGGGGUCCUCAUCG 94.8%

9790 5039 GAGGAUGUCAAAgAUGCAA 0.1% 9838 5045 GCAAUUGGGGUCCUCgUCG 0.1%

9791 5039 aAGGAUGUCAAAAAUGCAA 0.1% 9839 5045 GCAAUUGGGGUuCUCAUCG 0.1%

9840 5045 GCAgUOGGGGUCCUCAUCG 1.0% 9889 5050 AAAAAUGCAAUUGGGGUUC 0.1%

9841 5045 GCgAUUGGGGUCCUCAUCG 1.0% 9890 5050 AAAAAUGCAgUUGGGGUCC 1.0%

9842 5045 GCAAUUaGGGUCCUCAUCG 0.1% 9891 5050 AAAAAUGCgAUUGGGGUCC 1,0%

9843 5045 GCAAUUGGaGUCCUCAUCG 0.1% 9892 5050 AAAgAUGCAAUUGGGGUCC 0.1%

9844 5045 GCAAUUGGcGUCCUCAUCG 1.2% 9893 5050 AAAAAUGCAAUUaGGGUCC 0.1%

9845 5045 GCAAUUGGGaUCCUCAUCG 1.0% 9894 5050 AAAAAUGCAAUUGGaGUCC 0.1%

9846 5045 GCAAUUGGGcUCCUCAUCG 0.1% 9895 5050 AAAAAUGCAAUUGGCGUCC 1.2%

9847 5045 GCAgUUGGaGUCCUCAUCG 0.5% 9896 5050 AAAAAUGCAAUUGGGaUCC 1.1%

9848 5045 GCAAUUGGGaUCCUCAUCa 0.1% 9897 5050 AAAAAUGCAAUUGGGCUCC 0.1%

5046 5046 CAAUUGGGGUCCUCAUCGG 94.8% 9898 5050 AAAAuUGCAAUUGGGGUCC 0.5%

9849 5046 CAAUUGGGGUCCUCgUCGG 0.1% 9899 5050 AAAAAUGCAgUUGGaGUCC 0.5%

9850 5046 CAAUUGGGGUUCUCAUCGG 0.1% 5051 5051 AAAAUGCAAUUGGGGUCCU 94.3%

9851 5046 CAgUUGGGGUCCUCAUCGG 1.0% 9900 5051 AAAAUGCAAUUGGGGϋuCU 0.1%

9852 5046 CgAUUGGGGUCCUCAUCGG 1.0% 9901 5051 AAAAUGCAgUUGGGGUCCU 1.0%

9853 5046 CAAUUaGGGUCCUCAUCGG 0.1% 9902 5051 AAAAUGCgAUUGGGGUCCU 1.0%

9854 5046 CAAUUGGaGUCCUCAUCGG 0.1% 9903 5051 AAgAUGCAAUUGGGGUCCU 0.1%

9855 5046 CAAUUGGcGUCCUCAUCGG 1.2% 9904 5051 AAAAUGCAAUUaGGGUCCU 0.1%

9856 5046 CAAUUGGGaUCCUCAUCGG 1.0% 9905 5051 AAAAUGCAAUUGGaGUCCU 0.1%

9857 5046 CAAUUGGGcUCCUCAUCGG 0.1% 9906 5051 AAAAUGCAAUUGGCGUCCU 1.2%

9858 5046 CAgUUGGaGUCCUCAUCGG 0.5% 9907 5051 AAAAUGCAAUUGGGaUCCU 1.1%

5047 5047 UGCAAUUGGGGUCCUCAUC 94.8% 9908 5051 AAAAUGCAAUUGGGCUCCU 0.1%

9859 5047 UGCAAUUGGGGUCCUCgUC 0.1% 9909 5051 AAAuUGCAAUUGGGGUCCU 0.5%

9860 5047 UGCAAUUGGGGUUCUCAUC 0.1% 9910 5051 AAAAUGCAgUUGGaGUCCU 0.5%

9861 5047 UGCAgUUGGGGUCCUCAUC 1.0% 5052 5052 AUGCAAUUGGGGUCCUCAU 94.3%

9862 5047 UGCgAUUGGGGUCCUCAUC 1.0% 9911 5052 AUGCAAUUGGGGUCCUCgU 0.1%

9863 5047 UGCAAUUaGGGUCCUCAUC 0.1% 9912 5052 AUGCAAUUGGGGUuCUCAU 0.1%

9864 5047 UGCAAUUGGaGUCCUCAUC 0.1% 9913 5052 AUGCAgUUGGGGUCCUCAU 1.0%

9865 5047 UGCAAUUGGcGUCCUCAUC 1.2% 9914 5052 AUGCgAUUGGGGUCCUCAU 1.0%

9866 5047 UGCAAUUGGGaUCCUCAUC 1.1% 9915 5052 AUGCAAUUaGGGUCCUCAU 0.1%

9867 5047 UGCAAUUGGGcUCCUCAUC 0.1% 9916 5052 AUGCAAUUGGaGUCCUCAU 0.1%

9868 5047 UGCAgUUGGaGUCCUCAUC 0.5% 9917 5052 AUGCAAUUGGCGUCCUCAU 1.2%

5048 5048 UCAAAAAUGCAAUUGGGGU 94.4% 9918 5052 AUGCAAUUGGGaUCCUCAU 1.1%

9869 5048 UCAAAAAUGCAgUUGGGGU 1.0% 9919 5052 AUGCAAUUGGGCUCCUCAU 0.1%

9870 5048 UCAAAAAUGCgAUUGGGGU 1.0% 9920 5052 uUGCAAUUGGGGUCCUCAU 0.5%

9871 5048 UCAAAgAUGCAAUUGGGGU 0.1% 9921 5052 AUGCAgUUGGaGUCCUCAU 0.5%

9872 5048 UCAAAAAUGCAAUUaGGGU 0.1% 5053 5053 CAAAAAUGCAAUUGGGGUC 94.3%

9873 5048 UCAAAAAUGCAAUUGGaGU 0.1% 9922 5053 CAAAAAUGCAAUUGGGGUu 0.1%

9874 5048 UCAAAAAUGCAAUUGGCGU 1.2% 9923 5053 CAAAAAUGCAgUUGGGGUC 1.0%

9875 5048 UCAAAAAUGCAAUUGGGaU 1.1% 9924 5053 CAAAAAUGCgAUUGGGGUC 1.0%

9876 5048 UCAAAAAUGCAAUUGGGCU 0.1% 9925 5053 CAAAgAUGCAAUUGGGGUC 0.1%

9877 5048 UCAAAAuUGCAAUUGGGGU 0.5% 9926 5053 CAAAAAUGCAAUUaGGGUC 0.1%

9878 5048 UCAAAAAUGCAgUUGGaGU 0.5% 9927 5053 CAAAAAUGCAAUUGGaGUC 0.1%

5049 5049 GUCAAAAAUGCAAUUGGGG 94.4% 9928 5053 CAAAAAUGCAAUUGGCGUC 1.2%

9879 5049 GUCAAAAAUGCAgUUGGGG 1.0% 9929 5053 CAAAAAUGCAAUUGGGaUC 1.1%

9880 5049 GUCAAAAAUGCgAUUGGGG 1.0% 9930 5053 CAAAAAUGCAAUUGGGcUC 0.1%

9881 5049 GUCAAAgAUGCAAUUGGGG 0.1% 9931 5053 CAAAAuUGCAAUUGGGGUC 0.5%

9882 5049 GUCAAAAAUGCAAUUaGGG 0.1% 9932 5053 CAAAAAUGCAgUUGGaGUC 0.5%

9883 5049 GUCAAAAAUGCAAUUGGaG 0.1% 5054 5054 AAAUGCAAUUGGGGUCCUC 94.3%

9884 5049 GUCAAAAAUGCAAUUGGCG 1.2% 9933 5054 AAAUGCAAUUGGGGUUCUC 0.1%

9885 5049 GUCAAAAAUGCAAUUGGGa 1.1% 9934 5054 AAAUGCAgUUGGGGUCCUC 1.0%

9886 5049 GUCAAAAAUGCAAUUGGGc 0.1% 9935 5054 AAAUGCgAUUGGGGUCCUC 1.0%

9887 5049 GUCAAAAuUGCAAUUGGGG 0.5% 9936 5054 AgAUGCAAUUGGGGUCCUC 0.1%

9888 5049 GUCAAAAAUGCAgUUGGaG 0.5% 9937 5054 AAAUGCAAUUaGGGUCCUC 0.1%

5050 5050 AAAAAUGCAAUUGGGGUCC 94.3% 9938 5054 AAAUGCAAUUGGaGUCCUC 0.1%

2Q9

9939 5054 AAAUGCAAUUGGcGUCCUC 1.2% 9989 5058 CUUGAUCGGaOOCGCCGAG 0.1%

9940 5054 AaAUGCAAUUGGGaUCCUC 1.1% 9990 5058 COUGAUCGGCUcCGCCGAG 0.7%

9941 5054 AAAUGCAAUUGGGCUCCUC 0.1% 9991 5058 CUUGAUCGGCUUCaCCGAG 0.1%

9942 5054 AAuUGCAAUUGGGGUCCUC 0.5% 9992 5058 CUUGAUCGGCUUCGCaGAG 0.1%

9943 5054 AAAUGCAgUUGGaGUCCUC 0.5% 9993 5058 CUUGAUCGGCUUCGCCGAn 0.1%

5055 5055 AAUGCAAUUGGGGUCCUCA 94.2% 9994 5058 CUUGAUCGGCUUCGCCGAu 0.2%

9944 5055 AAUGCAAOUGGGGUCCUCg 0.1% 9995 5058 CUUGAUaGaCUUCGCCGAG 0.1%

9945 5055 AAUGCAAUUGGGGUUCUCA 0.1% 9996 5058 CUUGAUCGaCOUCGCaGAG 0.1%

9946 5055 AAUGCAgUUGGGGUCCUCA 1.0% 9997 5058 CUaGAcCGGCUUCGCCGgG 0.2%

9947 5055 AAUGCgAUUGGGGUCCUCA 1.0% 9998 5058 CUUGAcCGGgUUuGCCGAG 0.1%

9948 5055 gAUGCAAUUGGGGUCCUCA 0.1% 5059 5059 GAAAGCGAAUOϋCAGUGUG 93.6%

9949 5055 AAUGCAAUUaGGGUCCUCA 0.1% 9999 5059 GAAgGCGAAUUUCAGUGUG 0.1%

9950 5055 AAUGCAAUUGGaGUCCUCA 0.1% 10000 5059 GAgAGCGAAUUUCAGUGUG 0.1%

9951 5055 AAUGCAAUUGGCGUCCUCA 1.2% 10001 5059 aAAAGCGAAUUUCAGUGOG 0.2%

9952 5055 AAUGCAAUUGGGaUCCUCA 1.1% 10002 5059 GAAAGCaAAUUUCAGUGUG 0.3%

9953 5055 AAUGCAAUUGGGCUCCUCA 0.1% 10003 5059 GAAAGCGAAcUUCAGUGOG 1.3%

9954 5055 AuUGCAAUUGGGGUCCUCA 0.5% 10004 5059 GAAAGCGAAUaUCAGUGUG 0.1%

9955 5055 AAUGCAgUUGGaGUCCUCA 0.5% 10005 5059 GAAAGCGAAUUUCAGCGUG 0.2%

5056 5056 UGAAAGCGAAUUUCAGUGU 93.8% 10006 5059 GAAAGCGAAUUUCAGUGUa 0.2%

9956 5056 UGAAgGCGAAUUUCAGUGU 0.1% 10007 5059 GAAAGCuAAUUUCAGOGUG 0.4%

9957 5056 UGAgAGCGAAUUUCAGUGU 0.1% 10008 5059 uAAAGCGAAOUUCAGUGUG 0.1%

9958 5056 UaAAAGCGAAUUUCAGUGU 0.2% 10009 5059 aAAAGCGAAcUUCAGUGUG 0.3%

9959 5056 UGAAAGCaAAUUUCAGUGU 0.3% 10010 5059 GAAAGCaAAcUUCAGUGUG 0.9%

9960 5056 UGAAAGCGAACUUCAGUGU 1.3% 10011 5059 GAAAGCaAAcUUuAGUGUG 2.2%

9961 5056 UGAAAGCGAAUaUCAGUGU 0.1% 5060 5060 AUCGGCUUCGCCGAGAUCA 93.4%

9962 5056 UGAAAGCGAAUUUCAGCGU 0.2% 10012 5060 AUCGGCUUCGCCGgGAUCA 0.3%

9963 5056 UGAAAGCuAAUUUCAGUGU 0.4% 10013 5060 AcCGGCUUCGCCGAGAUCA 3.1%

9964 5056 UuAAAGCGAAUUUCAGUGU 0.1% 10014 5060 AUaGGCOUCGCCGAGAUCA 0.1%

9965 5056 UaAAAGCGAAcUUCAGUGU 0.3% 10015 5060 AUCGaCUUCGCCGAGAUCA 0.8%

9966 5056 UGAAAGCaAAcUUCAGUGU 0.9% 10016 5060 AUCGGaUUCGCCGAGAUCA 0.1%

9967 5056 UGAAAGCaAAcUUuAGUGU 2.2% 10017 5060 AUCGGCUcCGCCGAGAUCA 0.6%

5057 5057 UUGAUCGGCUUCGCCGAGA 93.7% 10018 5060 AUCGGCUUCaCCGAGAUCA 0.1%

9968 5057 UUGAUCGGCUUCGCCGgGA 0.3% 10019 5060 AUCGGCUUCGCaGAGAOCA 0.1%

9969 5057 UcGAUCGGCUUCGCCGAGA 0.1% 10020 5060 AOCGGCOUCGCCGAGAcCA 0.3%

9970 5057 UUGAcCGGCUUCGCCGAGA 3.1% 10021 5060 AUCGGCUUCGCCGAGAUaA 0.1%

9971 5057 UUGAUaGGCUUCGCCGAGA 0.1% 10022 5060 AUCGGCUUCGCCGAuAOCA 0.2%

9972 5057 UUGAUCGaCUUCGCCGAGA 0.8% 10023 5060 AcCGGCOOCGCCGgGAUCA 0.2%

9973 5057 UOGAUCGGaUUCGCCGAGA 0.1% 10024 5060 AUaGaCUUCGCCGAGAUCA 0.1%

9974 5057 UUGAUCGGCUCCGCCGAGA 0.7% 10025 5060 AUCGaCUUCGCaGAGAOCA 0.1%

9975 5057 UUGAUCGGCUUCaCCGAGA 0.1% 10026 5060 AOCGGCϋcCGCCGAGAUCu 0.1%

9976 5057 UUGAUCGGCUOCGCaGAGA 0.1% 10027 5060 AUCGGCUUCGCCGAnnUCA 0.1%

9977 5057 UUGAUCGGCUUCGCCGAUA 0.2% 10028 5060 AcCGGgUUuGCCGAGAUCA 0.1%

9978 5057 UUGAUaGaCUUCGCCGAGA 0.1% 5061 5061 GAUCGGCUUCGCCGAGAUC 93.4%

9979 5057 UUGAUCGaCOUCGCaGAGA 0.1% 10029 5061 GAUCGGCUUCGCCGgGAUC 0.3%

9980 5057 UUGAUCGGCUUCGCCGAnn 0.1% 10030 5061 GAcCGGCUOCGCCGAGAOC 3.1%

9981 5057 UaGAcCGGCUOCGCCGgGA 0.2% 10031 5061 GAUaGGCUUCGCCGAGAUC 0.1%

9982 5057 UUGAcCGGgUUuGCCGAGA 0.1% 10032 5061 GAUCGaCUUCGCCGAGAUC 0.8%

5058 5058 CUUGAUCGGCOUCGCCGAG 93.6% 10033 5061 GAUCGGaUUCGCCGAGAUC 0.1%

9983 5058 CUUGAUCGGCUUCGCCGgG 0.3% 10034 5061 GAUCGGCUcCGCCGAGAUC 0.7%

9984 5058 aUUGAOCGGCUUCGCCGAG 0.1% 10035 5061 GAUCGGCUUCaCCGAGAUC 0.1%

9985 5058 CUcGAUCGGCUUCGCCGAG 0.1% 10036 5061 GAUCGGCUUCGCaGAGAUC 0.1%

9986 5058 CUUGAcCGGCUUCGCCGAG 3.1% 10037 5061 GAUCGGCUUCGCCGAGAcC 0.3%

9987 5058 CUUGAUaGGCUUCGCCGAG 0.1% 10038 5061 GAUCGGCUUCGCCGAGAUa 0.1%

9988 5058 CUUGAUCGaCOUCGCCGAG 0.8% 10039 5061 GAUCGGCUUCGCCGAUAUC 0.2%

10040 5061 GAcCGGCUOCGCCGgGAUC 0.2% 10091 5064 GcGCAAUUGUUGGCGAAAU 0.6%

10041 5061 GAUaGaCUUCGCCGAGAUC 0.1% 10092 5064 GAaCAAUUGUUGGuGAAAU 0.1%

10042 5061 GAUCGaCUUCGCaGAGAUC 0.1% 10093 5064 GgaCAAUUGUUGGCGAAAU 0.2%

10043 5061 GAUCGGCUUCGCCGAnnUC 0.1% 10094 5064 GgGCAAUUGUaGGCGAAAU 0.1%

10044 5061 GAcCGGgUUuGCCGAGAUC 0.1% 10095 5064 GAcgAAUUGUUGGCGAAAU 0.1%

5062 5062 UGAUCGGCUUCGCCGAGAU 93.4% 10096 5064 GAGCAAUaGUgGGCGAAAU 1.6%

10045 5062 UGAUCGGCUUCGCCGgGAU 0.3% 10097 5064 GAGCAAUcGUUGGaGAAAU 0.3%

10046 5062 cGAUCGGCUUCGCCGAGAU 0.1% 10098 5064 GAGCAAUcGUUGGaGAgAU 0.1%

10047 5062 UGAcCGGCUUCGCCGAGAU 3.1% 5065 5065 GGAGCAAUUGUUGGCGAAA 92.8%

10048 5062 UGAUaGGCUUCGCCGAGAU 0.1% 10099 5065 GGAGCAAUUGUUGGUGAAA 0.1%

10049 5062 UGAUCGaCUUCGCCGAGAU 0.8% 10100 5065 GGAGCAgUUGUUGGCGAAA 0.2%

10050 5062 UGAUCGGaUUCGCCGAGAU 0.1% 10101 5065 GGAGuAAUUGUUGGCGAAA 0.7%

10051 5062 UGAUCGGCUcCGCCGAGAU 0.7% 10102 5065 GGgGCAAUUGUUGGCGAAA 0.7%

10052 5062 UGAUCGGCUUCaCCGAGAU 0.1% 10103 5065 GGgGCAAUUGUUGGCGAgA 0.1%

10053 5062 UGAUCGGCUUCGCaGAGAU 0.1% 10104 5065 GaAGCAAUUGUUGGCGAAA 0.1%

10054 5062 UGAUCGGCUUCGCCGAGAc 0.3% 10105 5065 GGAaCAAUUGUUGGCGAAA 0.3%

10055 5062 UGAUCGGCUUCGCCGAUAU 0.2% 10106 5065 GGAGCAAUUaUUGGCGAAA 0.1%

10056 5062 UGAUaGaCUUCGCCGAGAU 0.1% 10107 5065 GGAGCAAUUGUaGGCGAAA 0.1%

10057 5062 UGAUCGaCUUCGCaGAGAU 0.1% 10108 5065 GGAGCAAUUGUgGGCGAAA 0.1%

10058 5062 UGAUCGGCUUCGCCGAnnU 0.1% 10109 5065 GGAGCAAUUGUUGCCGAAA 0.1%

10059 5062 aGAcCGGCUUCGCCGgGAϋ 0.2% 10110 5065 GGcGCAAUUGUUGGCGAAA 0.6%

10060 5062 UGAcCGGgUUuGCCGAGAU 0.1% 10111 5065 GGAaCAAUUGUUGGuGAAA 0.1%

5063 5063 UCGGCUUCGCCGAGAUCAG 93.2% 10112 5065 GGgaCAAUUGUUGGCGAAA 0.2%

10061 5063 UCGGCUUCGCCGgGAUCAG 0.3% 10113 5065 GGgGCAAUUGUaGGCGAAA 0.1%

10062 5063 cCGGCUUCGCCGAGAUCAG 2.9% 10114 5065 GGAcgAAUUGUUGGCGAAA 0.1%

10063 5063 UaGGCUUCGCCGAGAUCAG 0.1% 10115 5065 GGAGCAAUaGUgGGCGAAA 1.6%

10064 5063 UCGaCUUCGCCGAGAUCAG 0.8% 10116 5065 GGAGCAAUcGUUGGaGAAA 0.3%

10065 5063 UCGGaUUCGCCGAGAUCAG 0.1% 10117 5065 GGAGCAAUcGUUGGaGAgA 0.1%

10066 5063 UCGGCUcCGCCGAGAUCAG 0.3% 5066 5066 UUCCUUGAUCGGCUUCGCC 92.7%

10067 5063 UCGGCUUCaCCGAGAUCAG 0.1% 10118 5066 UUuCUUGAUCGGCUUCGCC 1.5%

10068 5063 UCGGCUUCGCaGAGAUCAG 0.1% 10119 5066 UUCaUUGAUCGGCUUCGCC 0.1%

10069 5063 UCGGCUUCGCCGAGACCAG 0.3% 10120 5066 UUCCUUGAcCGGCUUCGCC 3.1%

10070 5063 UCGGCUUCGCCGAGAUaAG 0.1% 10121 5066 UUCCUUGAUaGGCUUCGCC 0.1%

10071 5063 UCGGCUUCGCCGAGAUCAa 0.2% 10122 5066 UUCCUUGAUCGaCUUCGCC 0.8%

10072 5063 UCGGCUUCGCCGAUAUCAG 0.2% 10123 5066 UUCCUUGAUCGGaUUCGCC 0.1%

10073 5063 cCGGCUUCGCCGgGAUCAG 0.2% 10124 5066 UUCCUUGAUCGGCUCCGCC 0.7%

10074 5063 cCGGCUUCGCCGAGAUCAa 0.2% 10125 5066 UUCCUUGAUCGGCUUCaCC 0.1%

10075 5063 UaGaCUUCGCCGAGAUCAG 0.1% 10126 5066 UUCCUUGAUCGGCUUCGCa 0.1%

10076 5063 UCGaCUUCGCaGAGAUCAG 0.1% 10127 5066 UUuCUcGAUCGGCUUCGCC 0.1%

10077 5063 UCGGCUcCGCCGAGAUCAa 0.3% 10128 5066 UUCCUUGAUaGaCUUCGCC 0.1%

10078 5063 UCGGCUUCGCCGAnnUCAG 0.1% 10129 5066 UUCCUUGAcCGGgUUuGCC 0.1%

10079 5063 cCGGgUUuGCCGAGAUCAG 0.1% 10130 5066 UUuCϋaGAcCGGCUUCGCC 0.2%

5064 5064 GAGCAAUUGUUGGCGAAAU 92.8% 5067 5067 UCCUUGAUCGGCUUCGCCG 92.7%

10080 5064 GAGCAAUUGUUGGUGAAAU 0.1% 10131 5067 UuCUUGAUCGGCUUCGCCG 1.5%

10081 5064 GAGCAgUUGUUGGCGAAAU 0.2% 10132 5067 UCaUUGAUCGGCUUCGCCG 0.1%

10082 5064 GAGuAAUUGUUGGCGAAAU 0.7% 10133 5067 UCCUUGAcCGGCUUCGCCG 3.1%

10083 5064 GgGCAAUUGUUGGCGAAAU 0.7% 10134 5067 UCCUUGAUaGGCUUCGCCG 0.1%

10084 5064 GgGCAAUUGUUGGCGAgAU 0.1% 10135 5067 UCCUUGAUCGaCUUCGCCG 0.8%

10085 5064 aAGCAAUUGUUGGCGAAAϋ 0.1% 10136 5067 UCCUUGAUCGGaUUCGCCG 0.1%

10086 5064 GAaCAAUUGUUGGCGAAAU 0.3% 10137 5067 UCCUUGAUCGGCUCCGCCG 0.7%

10087 5064 GAGCAAUUaUUGGCGAAAU 0.1% 10138 5067 UCCUUGAUCGGCUUCaCCG 0.1%

10088 5064 GAGCAAUUGUaGGCGAAAU 0.1% 10139 5067 UCCUUGAUCGGCUUCGCaG 0.1%

10089 5064 GAGCAAUUGUgGGCGAAAU 0.1% 10140 5067 UuCUcGAUCGGCUUCGCCG 0.1%

10090 5064 GAGCAAUUGUUGCCGAAAU 0.1% 10141 5067 UCCUUGAUaGaCUUCGCCG 0.1%

10142 5067 UCCUUGAcCGGgUUuGCCG 0.1% 10193 5070 CCUUGAUCGaCUUCGCaGA 0.1%

10143 5067 UuCUaGAcCGGCUUCGCCG 0.2% 10194 5070 CCUUGAcCGGgUUuGCCGA 0.1%

5068 5068 GGGAGCAAUUGUUGGCGAA 92.7% 5071 5071 GAGGGAGCAAUUGUUGGCG 92.3%

10144 5068 GGGAGCAAUUGUUGGUGAA 0.1% 10195 5071 GAGGGAGCAAUUGUUGGUG 0.1%

10145 5068 GGGAGCAgUUGUUGGCGAA 0.2% 10196 5071 GAGGGAGCAgUUGUUGGCG 0.2%

10146 5068 GGGAGuAAUUGUUGGCGAA 0.7% 10197 5071 GAGGGAGuAAUUGUUGGCG 0.7%

10147 5068 GGGgGCAAUUGUUGGCGAA 0.7% 10198 5071 GAGGGgGCAAUUGUUGGCG 0.8%

10148 5068 GGGgGCAAUUGUUGGCGAg 0.1% 10199 5071 aAGGGAGCAAUUGUUGGCG 0.2%

10149 5068 aGGAGCAAUUGUUGGCGAA 0.1% 10200 5071 GAaGGAGCAAUUGUUGGCG 0.1%

10150 5068 GGaAGCAAUUGUUGGCGAA 0.1% 10201 5071 GAGGaAGCAAUUGUUGGCG 0.1%

10151 5068 GGGAaCAAUUGUUGGCGAA 0.3% 10202 5071 GAGGGAaCAAUUGUUGGCG 0.3%

10152 5068 GGGAGCAAUUaUUGGCGAA 0.1% 10203 5071 GAGGGAGCAAUUaUUGGCG 0.1%

10153 5068 GGGAGCAAUUGUaGGCGAA 0.1% 10204 5071 GAGGGAGCAAUUGUaGGCG 0.1%

10154 5068 GGGAGCAAUUGUUGCCGAA 0.1% 10205 5071 GAGGGAGCAAUUGUUGCCG 0.1%

10155 5068 GGGcGCAAUUGUUGGCGAA 0.6% 10206 5071 GAGGGcGCAAUUGUUGGCG 0.6%

10156 5068 GGGAaCAAUUGUUGGuGAA 0.1% 10207 5071 GcGGGAGCAAUUGUUGGCG 0.2%

10157 5068 GGGgaCAAUUGUUGGCGAA 0.2% 10208 5071 GAGGGAaCAAUUGUUGGuG 0.1%

10158 5068 aGGAGCAAUUGUgGGCGAA 0.1% 10209 5071 GAGGGgaCAAUUGUUGGCG 0.2%

10159 5068 GGGAcgAAUUGUUGGCGAA 0.1% 10210 5071 GAaGGAGCAAUUGUgGGCG 0.1%

10160 5068 GGGAGCAAUcGUUGGaGAA 0.3% 10211 5071 GAGGGAcgAAUUGUUGGCG 0.1%

10161 5068 aGGgGCAAUUGUaGGCGAA 0.1% 10212 5071 GAGGGAGCAAUcGUUGGaG 0.4%

10162 5068 GGGAGCAAUcGUUGGaGAg 0.1% 10213 5071 GAaGGgGCAAUUGUaGGCG 0.1%

5069 5069 AGGGAGCAAUUGUUGGCGA 92.5% 5072 5072 AGGGCUUUCACCGAAGAGG 92.3%

10163 5069 AGGGAGCAAUUGUUGGUGA 0.1% 10214 5072 AGGGCUUUCACCGAgGAGG 0.1%

10164 5069 AGGGAGCAgUUGUUGGCGA 0.2% 10215 5072 AGGGCUUUCACUGAAGAGG 0.3%

10165 5069 AGGGAGuAAUUGUUGGCGA 0.7% 10216 5072 AGGGCUUUCAUCGAAGAGG 0.1%

10166 5069 AGGGgGCAAUUGUUGGCGA 0.8% 10217 5072 AGGGCUUUuACCGAAGAGG 0.1%

10167 5069 AaGGAGCAAUUGUUGGCGA 0.1% 10218 5072 AGGGCUUUCACuGAgGAGG 0.1%

10168 5069 AGGaAGCAAUUGUUGGCGA 0.1% 10219 5072 AGaGCUUUCACCGAAGAGG 0.1%

10169 5069 AGGGAaCAAUUGUUGGCGA 0.3% 10220 5072 AGGGCcUUCACCGAAGAGG 0.7%

10170 5069 AGGGAGCAAUUaUUGGCGA 0.1% 10221 5072 AGGGCUaUCACCGAAGAGG 0.1%

10171 5069 AGGGAGCAAUUGUaGGCGA 0.1% 10222 5072 AGGGCUUUCACaGAAGAGG 0.1%

10172 5069 AGGGAGCAAUUGUUGCCGA 0.1% 10223 5072 AGGGCUUUCACCaAAGAGG 0.1%

10173 5069 AGGGcGCAAUUGUUGGCGA 0.6% 10224 5072 AGGGCUUUCACCGAAaAGG 0.2%

10174 5069 cGGGAGCAAUUGUUGGCGA 0.2% 10225 5072 AGGGCUUUCACCGAAGAaG 0.1%

10175 5069 AGGGAaCAAUUGUUGGuGA 0.1% 10226 5072 AGGGCUUUCACCGAAGcGG 0.2%

10176 5069 AGGGgaCAAUUGUUGGCGA 0.2% 10227 5072 AGGGgUUUCACCGAAGAGG 0.1%

10177 5069 AaGGAGCAAUUGUgGGCGA 0.1% 10228 5072 AGGGCcUUCACCGAgGAGG 2.0%

10178 5069 AGGGAcgAAUUGUUGGCGA 0.1% 5073 5073 GGGCUUUCACCGAAGAGGG 92.2%

10179 5069 AGGGAGCAAUcGUUGGaGA 0.4% 10229 5073 GGGCUUUCACCGAgGAGGG 0.1%

10180 5069 AaGGgGCAAUUGUaGGCGA 0.1% 10230 5073 GGGCUUUCACUGAAGAGGG 0.3%

5070 5070 CCUUGAUCGGCUUCGCCGA 92.4% 10231 5073 GGGCUUUCAuCGAAGAGGG 0.1%

10181 5070 CCUUGAUCGGCUUCGCCGg 0.3% 10232 5073 GGGCUUUuACCGAAGAGGG 0.1%

10182 5070 uCUUGAUCGGCUUCGCCGA 1.5% 10233 5073 GGGCUUUCACuGAgGAGGG 0.1%

10183 5070 CaUUGAUCGGCUUCGCCGA 0.1% 10234 5073 GaGCUUUCACCGAAGAGGG 0.1%

10184 5070 CCUUGAcCGGCUUCGCCGA 3.1% 10235 5073 GGGCcUUCACCGAAGAGGG 0.7%

10185 5070 CCUUGAUaGGCUUCGCCGA 0.1% 10236 5073 GGGCUaUCACCGAAGAGGG 0.1%

10186 5070 CCUUGAUCGaCUUCGCCGA 0.8% 10237 5073 GGGCUUUCACaGAAGAGGG 0.1%

10187 5070 CCUUGAUCGGaUUCGCCGA 0.1% 10238 5073 GGGCUUUCACCaAAGAGGG 0.1%

10188 5070 CCUUGAUCGGCUCCGCCGA 0.7% 10239 5073 GGGCUUUCACCGAAaAGGG 0.2%

10189 5070 CCUUGAUCGGCUUCaCCGA 0.1% 10240 5073 GGGCUUUCACCGAAGAaGG 0.1%

10190 5070 CCUUGAUCGGCUUCGCaGA 0.1% 10241 5073 GGGCUUUCACCGAAGAGGa 0.1%

10191 5070 uCUcGAUCGGCUUCGCCGA 0.1% 10242 5073 GGGCUUUCACCGAAGCGGG 0.2%

10192 5070 CCUUGAUaGaCUUCGCCGA 0.1% 10243 5073 GGGgUUUCACCGAAGAGGG 0.1%

10244 5073 GGGCcUUCACCGAgGAGGG 2.0% 10293 5078 AAGGGCUUUCACCGAgGAG 0.1%

5074 5074 AUAAGAUGGCUGAUUGAAG 92.1% 10294 5078 AAGGGCUUUCACUGAAGAG 0.3%

10245 5074 AUAAGAUGGCUGAUUGAgG 0.1% 10295 5078 AAGGGCUUUCAUCGAAGAG 0.1%

10246 5074 AUAAGAUGGuUGAUUGAAG 1.6% 10296 5078 AAGGGCUUUuACCGAAGAG 0.1%

10247 5074 AUAAGgUGGCUGAUUGAAG 1.8% 10297 5078 gAGGGCUUUCACCGAAGAG 0.4%

10248 5074 AUgAGAUGGCUGAUUGAAG 0.1% 10298 5078 AAGGGCUUUCACuGAgGAG 0.1%

10249 5074 AUAAGgUGGCUGAUUGAgG 0.1% 10299 5078 AAGaGCUUUCACCGAAGAG 0.1%

10250 5074 AUAAGAUGGCUaAUUGAAG 1.8% 10300 5078 AAGGGCcUUCACCGAAGAG 0.7%

10251 5074 AUAAGAUGGCUGAUCGAAG 0.3% 10301 5078 AAGGGCUaUCACCGAAGAG 0.1%

10252 5074 AUAAnAUGGCUGAUUGAAG 0.1% 10302 5078 AAGGGCUUUCACaGAAGAG 0.1%

10253 5074 AUAAGAUGGuUaAUUGAAG 2.0% 10303 5078 AAGGGCUUUCACCaAAGAG 0.1%

10254 5074 AUAAGgUGGuUaAUUGAAG 0.1% 10304 5078 AAGGGCUUUCACCGAAaAG 0.2%

5075 5075 UAAGAUGGCUGAUUGAAGA 92.1% 10305 5078 AAGGGCUUUCACCGAAGAa 0.1%

10255 5075 UAAGAUGGCUGAUUGAgGA 0.1% 10306 5078 AAGGGCUUUCACCGAAGCG 0.2%

10256 5075 UAAGAUGGuUGAUUGAAGA 1.6% 10307 5078 AAGGGgUUUCACCGAAGAG 0.1%

10257 5075 UAAGgUGGCUGAUUGAAGA 1.8% 10308 5078 uAGGGCUUUCACCGAAGAG 0.1%

10258 5075 UgAGAUGGCUGAUUGAAGA 0.1% 10309 5078 AAGGGCcUUCACCGAgGAG 2.0%

10259 5075 UAAGgUGGCUGAUUGAgGA 0.1% 5079 5079 UAAGGGCUUUCACCGAAGA 91.8%

10260 5075 UAAGAUGGCUaAUUGAAGA 1.8% 10310 5079 UAAGGGCUUUCACCGAgGA 0.1%

10261 5075 UAAGAUGGCUGAUCGAAGA 0.3% 10311 5079 UAAGGGCUUUCACUGAAGA 0.3%

10262 5075 UAAnAUGGCUGAUUGAAGA 0.1% 10312 5079 UAAGGGCUUUCAUCGAAGA 0.1%

10263 5075 UAAGAUGGuUaAUUGAAGA 2.0% 10313 5079 UAAGGGCUUUUACCGAAGA 0.1%

10264 5075 UAAGgUGGuUaAUUGAAGA 0.1% 10314 5079 UgAGGGCUUUCACCGAAGA 0.4%

5076 5076 AAGAUGGCUGAUUGAAGAA 91.9% 10315 5079 UAAGGGCUUUCACuGAgGA 0.1%

10265 5076 AAGAUGGCUGAUUGAAGAg 0.2% 10316 5079 aAAGGGCUUUCACCGAAGA 0.1%

10266 5076 AAGAUGGCUGAUUGAgGAA 0.1% 10317 5079 UAAGaGCUUUCACCGAAGA 0.1%

10267 5076 AAGAUGGuUGAUUGAAGAA 1.6% 10318 5079 UAAGGGCcUUCACCGAAGA 0.7%

10268 5076 AAGgUGGCUGAUUGAAGAA 1.8% 10319 5079 UAAGGGCUaUCACCGAAGA 0.1%

10269 5076 gAGAUGGCUGAUUGAAGAA 0.1% 10320 5079 UAAGGGCUUUCACaGAAGA 0.1%

10270 5076 AAGgUGGCUGAUUGAgGAA 0.1% 10321 5079 UAAGGGCUUUCACCaAAGA 0.1%

10271 5076 AAGAUGGCUaAUUGAAGAA 1.8% 10322 5079 UAAGGGCUUUCACCGAAaA 0.2%

10272 5076 AAGAUGGCUGAUCGAAGAA 0.3% 10323 5079 UAAGGGCUUUCACCGAAGc 0.2%

10273 5076 AAnAUGGCUGAUUGAAGAA 0.1% 10324 5079 UAAGGGgUUUCACCGAAGA 0.1%

10274 5076 AAGAUGGuUaAUUGAAGAA 2.0% 10325 5079 UuAGGGCUUUCACCGAAGA 0.1%

10275 5076 AAGgUGGuUaAUUGAAGAA 0.1% 10326 5079 UAAGGGCcUUCACCGAgGA 2.0%

5077 5077 CUAAGGGCUUϋCACCGAAG 91.9% 5080 5080 GAUGGCUGAUUGAAGAAGU 91.7%

10276 5077 CUAAGGGCUUUCACCGAgG 0.1% 10327 5080 GAUGGCUGAUUGAAGAgGU 0.2%

10277 5077 CUAAGGGCUUUCACUGAAG 0.3% 10328 5080 GAUGGCUGAUUGAgGAAGU 0.1%

10278 5077 CUAAGGGCUUUCAUCGAAG 0.1% 10329 5080 GAUGGuUGAUUGAAGAAGU 1.6%

10279 5077 CUAAGGGCUUUUACCGAAG 0.1% 10330 5080 GgUGGCUGAUUGAAGAAGU 1.8%

10280 5077 CUgAGGGCUUUCACCGAAG 0.4% 10331 5080 GgUGGCUGAUUGAgGAAGU 0.1%

10281 5077 uUAAGGGCUUUCACCGAAG 0.1% 10332 5080 GAUGGCUaAUUGAAGAAGU 1.8%

10282 5077 CUAAGGGCUUUCACuGAgG 0.1% 10333 5080 GAUGGCUGAUCGAAGAAGU 0.3%

10283 5077 CaAAGGGCUUUCACCGAAG 0.1% 10334 5080 GAUGGCUGAUUGAAGAAaU 0.3%

10284 5077 CUAAGaGCUUUCACCGAAG 0.1% 10335 5080 nAUGGCUGAUUGAAGAAGU 0.1%

10285 5077 CUAAGGGCcUUCACCGAAG 0.7% 10336 5080 GAUGGuUaAUUGAAGAAGU 1.9%

10286 5077 CUAAGGGCUaUCACCGAAG 0.1% 10337 5080 GgUGGuUaAUUGAAGAAGU 0.1%

10287 5077 CUAAGGGCUUUCACaGAAG 0.1% 10338 5080 GAUGGuUaAUUGAAGAAaU 0.1%

10288 5077 CUAAGGGCUUUCACCaAAG 0.1% 5081 5081 AGAUGGCUGAUUGAAGAAG 91.7%

10289 5077 CUAAGGGCUUUCACCGAAa 0.2% 10339 5081 AGAUGGCUGAUUGAAGAgG 0.2%

10290 5077 CUAAGGGgUUUCACCGAAG 0.1% 10340 5081 AGAUGGCUGAUUGAgGAAG 0.1%

10291 5077 CUuAGGGCUUUCACCGAAG 0.1% 10341 5081 AGAUGGuUGAUUGAAGAAG 1.6%

10292 5077 CUAAGGGCcUUCACCGAgG 2.0% 10342 5081 AGgUGGCUGAUUGAAGAAG 1.8%

5078 5078 AAGGGCUUUCACCGAAGAG 91.8% 10343 5081 AGgUGGCUGAUUGAgGAAG 0.1%

10344 5081 AGAUGGCUaAUUGAAGAAG 1.8% 10394 5085 AAGCGAAUUUCAGUGUGgU 2.8%

10345 5081 AGAUGGCUGAUCGAAGAAG 0.3% 10395 5085 AgGCGAAUUUCAGUGUGAU 0.1%

10346 5081 AGAUGGCUGAUUGAAGAAa 0.3% 10396 5085 gAGCGAAUUUCAGUGUGAU 0.1%

10347 5081 AnAUGGCUGAUUGAAGAAG 0.1% 10397 5085 AAGCaAAUUUCAGUGUGAU 0.3%

10348 5081 AGAUGGuUaAUUGAAGAAG 1.9% 10398 5085 AAGCGAAcUUCAGUGUGAU 1.6%

10349 5081 AGgUGGuUaAUUGAAGAAG 0.1% 10399 5085 AAGCGAAUaUCAGUGUGAU 0.1%

10350 5081 AGAUGGuUaAUUGAAGAAa 0.1% 10400 5085 AAGCGAAUUUCAGCGUGAU 0.2%

5082 5082 ACUAAGGGCUUUCACCGAA 91.4% 10401 5085 AAGCGAAUUUCAGUGUaAU 0.2%

10351 5082 ACUAAGGGCUUUCACUGAA 0.3% 10402 5085 AAGCGAAUUUCAGUGUGAc 0.2%

10352 5082 ACUAAGGGCUUUCAUCGAA 0.1% 10403 5085 AAGCuAAUUUCAGUGUGAU 0.4%

10353 5082 ACUAAGGGCUUUUACCGAA 0.1% 10404 5085 AAGCaAAcUUCAGUGUGAU 0.9%

10354 5082 ACUgAGGGCUUUCACCGAA 0.4% 10405 5085 AAGCaAAcUUuAGUGUGAU 2.2%

10355 5082 AuUAAGGGCUUUCACCGAA 0.1% 5086 5086 GGCUUUCACCGAAGAGGGA 90.9%

10356 5082 gCUAAGGGCUUUCACCGAA 0.7% 10406 5086 GGCUUUCACCGAAGAGGGg 0.8%

10357 5082 ACUAAGGGCUUUCACuGAg 0.1% 10407 5086 GGCUUUCACCGAgGAGGGA 0.1%

10358 5082 gCUAAGGGCUUUCACCGAg 0.1% 10408 5086 GGCUUUCACuGAAGAGGGA 0.3%

10359 5082 ACaAAGGGCUUUCACCGAA 0.1% 10409 5086 GGCUUUCAuCGAAGAGGGA 0.1%

10360 5082 ACUAAGaGCUUUCACCGAA 0.1% 10410 5086 GGCUUUuACCGAAGAGGGA 0.1%

10361 5082 ACUAAGGGCcUUCACCGAA 0.7% 10411 5086 GGCUUUCACuGAgGAGGGA 0.1%

10362 5082 ACUAAGGGCUaUCACCGAA 0.1% 10412 5086 aGCUUUCACCGAAGAGGGA 0.1%

10363 5082 ACUAAGGGCUUUCACaGAA 0.1% 10413 5086 GGCcUUCACCGAAGAGGGA 0.5%

10364 5082 ACUAAGGGCUUUCACCaAA 0.1% 10414 5086 GGCUaUCACCGAAGAGGGA 0.1%

10365 5082 ACUAAGGGgUUUCACCGAA 0.1% 10415 5086 GGCUUUCACaGAAGAGGGA 0.1%

10366 5082 ACUuAGGGCUUUCACCGAA 0.1% 10416 5086 GGCUUUCACCaAAGAGGGA 0.1%

10367 5082 ACUAAGGGCcUUCACCGAg 2.0% 10417 5086 GGCUUUCACCGAAaAGGGA 0.2%

5083 5083 UACUAAGGGCUUUCACCGA 91.3% 10418 5086 GGCUUUCACCGAAGAaGGA 0.1%

10368 5083 UACUAAGGGCUUUCACUGA 0.4% 10419 5086 GGCUUUCACCGAAGAGGaA 0.1%

10369 5083 UACUAAGGGCUUUCAUCGA 0.1% 10420 5086 GGCUUUCACCGAAGAGGGc 0.6%

10370 5083 UACUAAGGGCUUUUACCGA 0.1% 10421 5086 GGCUUUCACCGAAGCGGGA 0.2%

10371 5083 UACUgAGGGCUUUCACCGA 0.4% 10422 5086 GGgUUUCACCGAAGAGGGA 0.1%

10372 5083 UAuUAAGGGCUUUCACCGA 0.1% 10423 5086 GGCcUUCACCGAAGAGGGg 0.2%

10373 5083 UgCUAAGGGCUUUCACCGA 0.8% 10424 5086 GGCcUUCACCGAgGAGGGA 2.0%

10374 5083 cACUAAGGGCUUUCACCGA 0.1% 5087 5087 GCUUUCACCGAAGAGGGAG 90.8%

10375 5083 UACaAAGGGCUUUCACCGA 0.1% 10425 5087 GCUUUCACCGAAGAGGGgG 0.8%

10376 5083 UACUAAGaGCUUUCACCGA 0.1% 10426 5087 GCUUUCACCGAgGAGGGAG 0.1%

10377 5083 UACUAAGGGCCUUCACCGA 2.7% 10427 5087 GCUUUCACuGAAGAGGGAG 0.3%

10378 5083 UACUAAGGGCUaUCACCGA 0.1% 10428 5087 GCUUUCAuCGAAGAGGGAG 0.1%

10379 5083 UACUAAGGGCUUUCACaGA 0.1% 10429 5087 GCUUUuACCGAAGAGGGAG 0.1%

10380 5083 UACUAAGGGCUUUCACCaA 0.1% 10430 5087 GCUUUCACuGAgGAGGGAG 0.1%

10381 5083 UACUAAGGGgUUUCACCGA 0.1% 10431 5087 GCcUUCACCGAAGAGGGAG 0.2%

10382 5083 UACUuAGGGCUUUCACCGA 0.1% 10432 5087 GCUaUCACCGAAGAGGGAG 0.1%

5084 5084 AAAGCGAAUUUCAGUGUGA 91.2% 10433 5087 GCUUUCACaGAAGAGGGAG 0.1%

10383 5084 AAAGCGAAUUUCAGUGUGg 2.8% 10434 5087 GCUUUCACCaAAGAGGGAG 0.1%

10384 5084 AAgGCGAAUUUCAGUGUGA 0.1% 10435 5087 GCUUUCACCGAAaAGGGAG 0.2%

10385 5084 AgAGCGAAUUUCAGUGUGA 0.1% 10436 5087 GCUUUCACCGAAGAaGGAG 0.1%

10386 5084 AAAGCaAAUUUCAGUGUGA 0.3% 10437 5087 GCUUUCACCGAAGAGGaAG 0.1%

10387 5084 AAAGCGAAcUUCAGUGUGA 1.6% 10438 5087 GCUUUCACCGAAGAGGGAa 0.1%

10388 5084 AAAGCGAAUaUCAGUGUGA 0.1% 10439 5087 GCUUUCACCGAAGAGGGAc 0.1%

10389 5084 AAAGCGAAUUUCAGCGUGA 0.2% 10440 5087 GCUUUCACCGAAGAGGGCG 0.6%

10390 5084 AAAGCGAAUUUCAGUGUaA 0.2% 10441 5087 GCUUUCACCGAAGCGGGAG 0.2%

10391 5084 AAAGCuAAUUUCAGUGUGA 0.4% 10442 5087 GgUUUCACCGAAGAGGGAG 0.1%

10392 5084 AAAGCaAAcUUCAGUGUGA 0.9% 10443 5087 GCcUUCACCGAgGAGGGAG 2.0%

10393 5084 AAAGCaAAcUUuAGUGUGA 2.2% 10444 5087 GCcUUCACCGAAGAGGGAa 0.3%

5085 5085 AAGCGAAUUUCAGUGUGAU 91.0% 10445 5087 GCUUUCACaGAAGAaGGAG 1.0%

10446 5087 GCcUUCACCGAAGAGGGga 0.2% 10497 5090 AAGAGGGgaCAAUUGUUGG 0.2%

10447 5087 GCUUUCACaGAAGAaGGgG 0.2% 10498 5090 AgGAGGGAGCAAUcGUUGG 0.1%

10448 5087 GCUUUCACgGAAGAaGGgG 0.1% 10499 5090 AAGAGGGAcgAAUUGUUGG 0.1%

5088 5088 AUUACUAAGGGCUUUCACC 90.8% 10500 5090 AAGAaGGgGCAAUUGUaGG 0.1%

10449 5088 AUUACUAAGGGCUUUCACU 0.3% 5091 5091 GAAGAGGGAGCAAUUGUUG 90.4%

10450 5088 AUUACUAAGGGCUUUCAUC 0.1% 10501 5091 GAAGAGGGAGCAgUUGUUG 0.2%

10451 5088 AUUACUAAGGGCUUUUACC 0.1% 10502 5091 GAAGAGGGAGUAAUUGUUG 0.7%

10452 5088 AUUACUgAGGGCUUUCACC 0.4% 10503 5091 GAAGAGGGgGCAAUUGUUG 0.8%

10453 5088 AUUAuUAAGGGCUUUCACC 0.1% 10504 5091 GAgGAGGGAGCAAUUGUUG 2.1%

10454 5088 AUUgCUAAGGGCUUUCACC 0.8% 10505 5091 aAAGAGGGAGCAAUUGUUG 0.1%

10455 5088 AcUACUAAGGGCUUUCACC 0.6% 10506 5091 GAAaAGGGAGCAAUUGUUG 0.2%

10456 5088 AUcACUAAGGGCUUUCACC 0.1% 10507 5091 GAAGAaGGAGCAAUUGUUG 0.1%

10457 5088 AUUACaAAGGGCUUUCACC 0.1% 10508 5091 GAAGAGGaAGCAAUUGUUG 0.1%

10458 5088 AUUACUAAGaGCUUUCACC 0.1% 10509 5091 GAAGAGGGAaCAAUUGUUG 0.4%

10459 5088 AUUACUAAGGGCCUUCACC 2.7% 10510 5091 GAAGAGGGAGCAAUCGUUG 0.3%

10460 5088 AUUACUAAGGGCUaUCACC 0.1% 10511 5091 GAAGAGGGAGCAAUUaUUG 0.1%

10461 5088 AUUACUAAGGGCUUUCACa 0.1% 10512 5091 GAAGAGGGAGCAAUUGUaG 0.1%

10462 5088 AUUACUAAGGGgUUUCACC 0.1% 10513 5091 GAAGAGGGcGCAAUUGUUG 0.6%

10463 5088 AUUACUuAGGGCUUUCACC 0.1% 10514 5091 GAAGcGGGAGCAAUUGUUG 0.2%

10464 5088 uUUACUAAGGGCUUUCACC 0.1% 10515 5091 GAAGAGGGgaCAAUUGUUG 0.2%

10465 5088 AcUACUAAGGGCUUUCACu 0.1% 10516 5091 GAgGAGGGAGCAAUcGUUG 0.1%

5089 5089 UUACUAAGGGCUUUCACCG 90.8% 10517 5091 GAAGAGGGAcgAAUUGUUG 0.1%

10466 5089 UUACUAAGGGCUUUCACUG 0.3% 10518 5091 GAAGAaGGgGCAAUUGUaG 0.1%

10467 5089 UUACUAAGGGCUUUCAUCG 0.1% 5092 5092 AGAGGGAGCAAUUGUUGGC 90.3%

10468 5089 UUACUAAGGGCUUUUACCG 0.1% 10519 5092 AGAGGGAGCAAUUGUUGGu 0.1%

10469 5089 UUACUgAGGGCUUUCACCG 0.4% 10520 5092 AGAGGGAGCAgUUGUUGGC 0.2%

10470 5089 UUAuUAAGGGCUUUCACCG 0.1% 10521 5092 AGAGGGAGuAAUUGUUGGC 0.7%

10471 5089 UUgCUAAGGGCUUUCACCG 0.8% 10522 5092 AGAGGGgGCAAUUGUUGGC 0.8%

10472 5089 cUACUAAGGGCUUUCACCG 0.6% 10523 5092 gGAGGGAGCAAUUGUUGGC 2.1%

10473 5089 UcACUAAGGGCUUUCACCG 0.1% 10524 5092 AaAGGGAGCAAUUGUUGGC 0.2%

10474 5089 UUACaAAGGGCUUUCACCG 0.1% 10525 5092 AGAaGGAGCAAUUGUUGGC 0.1%

10475 5089 UUACUAAGaGCUUUCACCG 0.1% 10526 5092 AGAGGaAGCAAUUGUUGGC 0.1%

10476 5089 UUACUAAGGGCCUUCACCG 2.7% 10527 5092 AGAGGGAaCAAUUGUUGGC 0.3%

10477 5089 UUACUAAGGGCUaUCACCG 0.1% 10528 5092 AGAGGGAGCAAUUaUUGGC 0.1%

10478 5089 UUACUAAGGGCUUUCACaG 0.1% 10529 5092 AGAGGGAGCAAUUGUaGGC 0.1%

10479 5089 UUACUAAGGGCUUUCACCa 0.1% 10530 5092 AGAGGGAGCAAUUGUUGCC 0.1%

10480 5089 UUACUAAGGGgUUUCACCG 0.1% 10531 5092 AGAGGGcGCAAUUGUUGGC 0.6%

10481 5089 UUACUuAGGGCUUUCACCG 0.1% 10532 5092 AGcGGGAGCAAUUGUUGGC 0.2%

10482 5089 cUACUAAGGGCUUUCACuG 0.1% 10533 5092 AGAGGGAaCAAUUGUUGGu 0.1%

5090 5090 AAGAGGGAGCAAUUGUUGG 90.4% 10534 5092 AGAGGGgaCAAUUGUUGGC 0.2%

10483 5090 AAGAGGGAGCAgUUGUUGG 0.2% 10535 5092 AGAGGGAcgAAUUGUUGGC 0.1%

10484 5090 AAGAGGGAGuAAUUGUUGG 0.7% 10536 5092 AGAGGGAGCAAUcGUUGGa 0.3%

10485 5090 AAGAGGGgGCAAUUGUUGG 0.8% 10537 5092 AGAaGGgGCAAUUGUaGGC 0.1%

10486 5090 AgGAGGGAGCAAUUGUUGG 2.1% 10538 5092 gGAGGGAGCAAUcGUUGGa 0.1%

10487 5090 AAaAGGGAGCAAUUGUUGG 0.2% 5093 5093 UCACCGAAGAGGGAGCAAU 90.3%

10488 5090 AAGAaGGAGCAAUUGUUGG 0.1% 10539 5093 UCACCGAAGAGGGAGCAgU 0.2%

10489 5090 AAGAGGaAGCAAUUGUUGG 0.1% 10540 5093 UCACCGAAGAGGGAGUAAU 0.7%

10490 5090 AAGAGGGAaCAAUUGUUGG 0.4% 10541 5093 UCACCGAAGAGGGgGCAAU 0.8%

10491 5090 AAGAGGGAGCAAUCGUUGG 0.3% 10542 5093 UCACCGAgGAGGGAGCAAU 2.1%

10492 5090 AAGAGGGAGCAAUUaUUGG 0.1% 10543 5093 UCACuGAAGAGGGAGCAAU 0.3%

10493 5090 AAGAGGGAGCAAUUGUaGG 0.1% 10544 5093 UCAuCGAAGAGGGAGCAAU 0.1%

10494 5090 AAGAGGGAGCAAUUGUUGc 0.1% 10545 5093 UuACCGAAGAGGGAGCAAU 0.1%

10495 5090 AAGAGGGcGCAAUUGUUGG 0.6% 10546 5093 UCACuGAgGAGGGAGCAAU 0.1%

10496 5090 AAGcGGGAGCAAUUGUUGG 0.2% 10547 5093 UCACaGAAGAGGGAGCAAU 0.1%

2I 5

10548 5093 UCACCaAAGAGGGAGCAAU 0.1% 10600 5095 UUCACCGAAGCGGGAGCAA 0.2%

10549 5093 UCACCGAAaAGGGAGCAAU 0.2% 10601 5095 UUCACCGAAGAGGGgaCAA 0.2%

10550 5093 UCACCGAAGAaGGAGCAAU 0.1% 10602 5095 UUCACaGAAGAaGGAGCAA 1.0%

10551 5093 UCACCGAAGAGGaAGCAAU 0.1% 10603 5095 UUCACCGAAGAGGGAcgAA 0.1%

10552 5093 UCACCGAAGAGGGAaCAAU 0.4% 10604 5095 UUCACaGAAGAaGGgGCAA 0.2%

10553 5093 UCACCGAAGAGGGCGCAAU 0.6% 10605 5095 UUCACgGAAGAaGGgGCAA 0.1%

10554 5093 UCACCGAAGcGGGAGCAAU 0.2% 5096 5096 CUUUCACCGAAGAGGGAGC 90.1%

10555 5093 UCACCGAAGAGGGgaCAAU 0.2% 10606 5096 CUUUCACCGAAGAGGGAGu 0.7%

10556 5093 UCACaGAAGAaGGAGCAAU 1.0% 10607 5096 CUUUCACCGAAGAGGGgGC 0.8%

10557 5093 UCACCGAAGAGGGAcgAAU 0.1% 10608 5096 CUUUCACCGAgGAGGGAGC 0.1%

10558 5093 UCACaGAAGAaGGgGCAAU 0.2% 10609 5096 CUUUCACuGAAGAGGGAGC 0.3%

10559 5093 UCACgGAAGAaGGgGCAAU 0.1% 10610 5096 CUUUCAuCGAAGAGGGAGC 0.1%

5094 5094 UUUCACCGAAGAGGGAGCA 90.2% 10611 5096 CUUUuACCGAAGAGGGAGC 0.1%

10560 5094 UUUCACCGAAGAGGGAGUA 0.7% 10612 5096 CUUUCACuGAgGAGGGAGC 0.1%

10561 5094 UUUCACCGAAGAGGGgGCA 0.8% 10613 5096 CcUUCACCGAAGAGGGAGC 0.2%

10562 5094 UUUCACCGAgGAGGGAGCA 0.1% 10614 5096 CUaUCACCGAAGAGGGAGC 0.1%

10563 5094 UUUCACuGAAGAGGGAGCA 0.3% 10615 5096 CUUUCACaGAAGAGGGAGC 0.1%

10564 5094 UUUCAuCGAAGAGGGAGCA 0.1% 10616 5096 CUUUCACCaAAGAGGGAGC 0.1%

10565 5094 UUUuACCGAAGAGGGAGCA 0.1% 10617 5096 CUUUCACCGAAaAGGGAGC 0.2%

10566 5094 UUUCACuGAgGAGGGAGCA 0.1% 10618 5096 CUUUCACCGAAGAaGGAGC 0.1%

10567 5094 cUUCACCGAAGAGGGAGCA 0.2% 10619 5096 CUUUCACCGAAGAGGaAGC 0.1%

10568 5094 UaUCACCGAAGAGGGAGCA 0.1% 10620 5096 CUUUCACCGAAGAGGGAaC 0.1%

10569 5094 UUUCACaGAAGAGGGAGCA 0.1% 10621 5096 CUUUCACCGAAGAGGGCGC 0.6%

10570 5094 UUUCACCaAAGAGGGAGCA 0.1% 10622 5096 CUUUCACCGAAGCGGGAGC 0.2%

10571 5094 UUUCACCGAAaAGGGAGCA 0.2% 10623 5096 gUUUCACCGAAGAGGGAGC 0.1%

10572 5094 UUUCACCGAAGAaGGAGCA 0.1% 10624 5096 CcUUCACCGAgGAGGGAGC 2.0%

10573 5094 UUUCACCGAAGAGGaAGCA 0.1% 10625 5096 CcUUCACCGAAGAGGGAaC 0.3%

10574 5094 UUUCACCGAAGAGGGAaCA 0.1% 10626 5096 CUUUCACaGAAGAaGGAGC 1.0%

10575 5094 UUUCACCGAAGAGGGCGCA 0.6% 10627 5096 CUUUCACCGAAGAGGGAcg 0.1%

10576 5094 UUUCACCGAAGCGGGAGCA 0.2% 10628 5096 CcUUCACCGAAGAGGGgaC 0.2%

10577 5094 cUUCACCGAgGAGGGAGCA 2.0% 10629 5096 CUUUCACaGAAGAaGGgGC 0.2%

10578 5094 cUUCACCGAAGAGGGAaCA 0.3% 10630 5096 CUUUCACgGAAGAaGGgGC 0.1%

10579 5094 UUUCACaGAAGAaGGAGCA 1.0% 5097 5097 CCGAAGAGGGAGCAAUUGU 90.0%

10580 5094 UUUCACCGAAGAGGGAcgA 0.1% 10631 5097 CCGAAGAGGGAGCAgUUGU 0.2%

10581 5094 cUUCACCGAAGAGGGgaCA 0.2% 10632 5097 CCGAAGAGGGAGUAAUUGU 0.7%

10582 5094 UUUCACaGAAGAaGGgGCA 0.2% 10633 5097 CCGAAGAGGGgGCAAUUGU 0.8%

10583 5094 UUUCACgGAAGAaGGgGCA 0.1% 10634 5097 CCGAgGAGGGAGCAAUUGU 2.1%

5095 5095 UUCACCGAAGAGGGAGCAA 90.2% 10635 5097 CuGAAGAGGGAGCAAUUGU 0.3%

10584 5095 UUCACCGAAGAGGGAGCAg 0.2% 10636 5097 uCGAAGAGGGAGCAAUUGU 0.1%

10585 5095 UUCACCGAAGAGGGAGUAA 0.7% 10637 5097 CaGAAGAGGGAGCAAUUGU 0.1%

10586 5095 UUCACCGAAGAGGGgGCAA 0.8% 10638 5097 CCaAAGAGGGAGCAAUUGU 0.1%

10587 5095 UUCACCGAgGAGGGAGCAA 2.1% 10639 5097 CCGAAaAGGGAGCAAUUGU 0.2%

10588 5095 UUCACuGAAGAGGGAGCAA 0.3% 10640 5097 CCGAAGAaGGAGCAAUUGU 0.1%

10589 5095 UUCAuCGAAGAGGGAGCAA 0.1% 10641 5097 CCGAAGAGGaAGCAAUUGU 0.1%

10590 5095 UUuACCGAAGAGGGAGCAA 0.1% 10642 5097 CCGAAGAGGGAaCAAUUGU 0.4%

10591 5095 UUCACuGAgGAGGGAGCAA 0.1% 10643 5097 CCGAAGAGGGAGCAAUCGU 0.3%

10592 5095 aUCACCGAAGAGGGAGCAA 0.1% 10644 5097 CCGAAGAGGGAGCAAUUaU 0.1%

10593 5095 UUCACaGAAGAGGGAGCAA 0.1% 10645 5097 CCGAAGAGGGCGCAAUUGU 0.6%

10594 5095 UUCACCaAAGAGGGAGCAA 0.1% 10646 5097 CCGAAGcGGGAGCAAUUGU 0.2%

10595 5095 UUCACCGAAaAGGGAGCAA 0.2% 10647 5097 CCGAAGAGGGgaCAAUUGU 0.2%

10596 5095 UUCACCGAAGAaGGAGCAA 0.1% 10648 5097 CuGAgGAGGGAGCAAUcGU 0.1%

10597 5095 UUCACCGAAGAGGaAGCAA 0.1% 10649 5097 CCGAAGAGGGAcgAAUUGU 0.1%

10598 5095 UUCACCGAAGAGGGAaCAA 0.4% 10650 5097 CgGAAGAaGGgGCAAUUGU 0.1%

10599 5095 UUCACCGAAGAGGGCGCAA 0.6% 5098 5098 ACCGAAGAGGGAGCAAUUG 90.0%

10651 5098 ACCGAAGAGGGAGCAgUUG 0.2% 10681 5099 CGAAGAGGGAaCAAUUGUU 0.4%

10652 5098 ACCGAAGAGGGAGUAAUUG 0.7% 10682 5099 CGAAGAGGGAGCAAUCGUU 0.3%

10653 5098 ACCGAAGAGGGgGCAAUUG 0.8% 10683 5099 CGAAGAGGGAGCAAUUaUU 0.1%

10654 5098 ACCGAgGAGGGAGCAAUUG 2.1% 10684 5099 CGAAGAGGGAGCAAUUGUa 0.1%

10655 5098 ACuGAAGAGGGAGCAAUUG 0.3% 10685 5099 CGAAGAGGGcGCAAUUGUU 0.6%

10656 5098 AuCGAAGAGGGAGCAAUUG 0.1% 10686 5099 CGAAGcGGGAGCAAUUGUU 0.2%

10657 5098 ACaGAAGAGGGAGCAAUUG 0.1% 10687 5099 CGAAGAGGGgaCAAUUGUU 0.2%

10658 5098 ACCaAAGAGGGAGCAAUUG 0.1% 10688 5099 uGAgGAGGGAGCAAUcGUU 0.1%

10659 5098 ACCGAAaAGGGAGCAAUUG 0.2% 10689 5099 CGAAGAGGGAcgAAUUGUU 0.1%

10660 5098 ACCGAAGAaGGAGCAAUUG 0.1% 5100 5100 CACCGAAGAGGGAGCAAUU 90.0%

10661 5098 ACCGAAGAGGaAGCAAUUG 0.1% 10690 5100 CACCGAAGAGGGAGCAgUU 0.2%

10662 5098 ACCGAAGAGGGAaCAAUUG 0.4% 10691 5100 CACCGAAGAGGGAGUAAUU 0.7%

10663 5098 ACCGAAGAGGGAGCAAUCG 0.3% 10692 5100 CACCGAAGAGGGgGCAAUU 0.8%

10664 5098 ACCGAAGAGGGAGCAAUUa 0.1% 10693 5100 CACCGAgGAGGGAGCAAUU 2.1%

10665 5098 ACCGAAGAGGGcGCAAUUG 0.6% 10694 5100 CACuGAAGAGGGAGCAAUU 0.3%

10666 5098 ACCGAAGcGGGAGCAAUUG 0.2% 10695 5100 CAuCGAAGAGGGAGCAAUU 0.1%

10667 5098 ACCGAAGAGGGgaCAAUUG 0.2% 10696 5100 UACCGAAGAGGGAGCAAUU 0.1%

10668 5098 ACuGAgGAGGGAGCAAUcG 0.1% 10697 5100 CACaGAAGAGGGAGCAAUU 0.1%

10669 5098 ACCGAAGAGGGAcgAAUUG 0.1% 10698 5100 CACCaAAGAGGGAGCAAUU 0.1%

10670 5098 ACgGAAGAaGGgGCAAUUG 0.1% 10699 5100 CACCGAAaAGGGAGCAAUU 0.2%

5099 5099 CGAAGAGGGAGCAAUUGUϋ 90.0% 10700 5100 CACCGAAGAaGGAGCAAUU 0.1%

10671 5099 CGAAGAGGGAGCAgUUGUU 0.2% 10701 5100 CACCGAAGAGGaAGCAAUU 0.1%

10672 5099 CGAAGAGGGAGUAAUUGUU 0.7% 10702 5100 CACCGAAGAGGGAa CAAUU 0.4%

10673 5099 CGAAGAGGGgGCAAUUGUU 0.8% 10703 5100 CACCGAAGAGGGAGCAAUc 0.3%

10674 5099 CGAgGAGGGAGCAAUUGUU 2.1% 10704 5100 CACCGAAGAGGGCGCAAUU 0.6%

10675 5099 uGAAGAGGGAGCAAUUGUU 0.3% 10705 5100 CACCGAAGcGGGAGCAAUU 0.2%

10676 5099 aGAAGAGGGAGCAAUUGUU 0.1% 10706 5100 CACCGAAGAGGGgaCAAUU 0.2%

10677 5099 CaAAGAGGGAGCAAUUGUU 0.1% 10707 5100 CACuGAgGAGGGAGCAAUc 0.1%

10678 5099 CGAAaAGGGAGCAAUUGUU 0.2% 10708 5100 CACCGAAGAGGGAcgAAUU 0.1%

10679 5099 CGAAGAaGGAGCAAUUGUU 0.1% 10709 5100 CACgGAAGAaGGgGCAAUU 0.1%

10680 5099 CGAAGAGGaAGCAAUUGUU 0.1%

Example 16: Intravenous Delivery ofsiRNA Inhibits Influenza Virus Production In Vivo

The present example demonstrates that both the prophylactic and post-infection intravenous administration of siRNA targeted to viral NP transcripts significantly inhibited influenza virus replication in the mouse (Figure 2). The following is a list of exemplary human influenza virus conserved target sequence (derived from Accession No. AF389119): gccacugaaaucagagcau, (SEQ ID NO: 10794) ucagagcauccgucggaaa, (SEQ ID NO: 10795) ggacgauucuacauccaaa, (SEQ ID NO: 10796) cagcuuaacaauagagaga, (SEQ ID NO: 10797) gcuuaacaauagagagaau, (SEQ ID NO: 10798) aauagagagaauggugcuc, (SEQ ID NO: 10799) gggaaagauccuaagaaaa, (SEQ ID NO: 10800) ggaaagauccuaagaaaac, (SEQ ID NO: 10801) ugagagaacucauccuuua, (SEQ ID NO: 10802) uuaugacaaagaagaaaua, (SEQ ID NO: 10803) acaagaauugcuuaugaaa, (SEQ ID NO: 10804) gaauugcuuaugaaagaau, (SEQ ID NO: 10805) aagcaaugauggaucaagu, (SEQ ID NO: 10806)

gcaaugauggaucaaguga, (SEQ ID NO: 10807) ugauggaucaagugagaga, (SEQ ID NO: 10808) ccacuagaggaguucaaau, (SEQ ID NO: 10809) cacuagaggaguucaaauu, (SEQ ID NO:10810) gaggaaacaccaaucaaca, (SEQ ID NO:10811) ggaaacaccaaucaacaga, (SEQ ID NO: 10812) gggccaaaucagcauacaa, (SEQ ID NO:10813) caaccauuauggcagcauu, (SEQ ID NO: 10814) ccauuauggcagcauucaa, (SEQ ID NO: 10815) aggaugauggaaagugcaa, (SEQ ID NO: 10816) gaugauggaaagugcaaga, (SEQ ID NO: 10817) gaguaaugaaggaucuuau, (SEQ ID NO: 10818) ggaucuuauuucuucggag, (SEQ ID NO:10819) gaucuuauuucuucggaga, (SEQ ID NO: 10820) ucuuauuucuucggagaca, (SEQ ID NO:10821) ggaguacgacaauuaaaga, (SEQ ID NO: 10822)

The following is a list of sequences that represent a fragment of the influenza NP gene for multiple species. The bolded region is a conserved region shared among these sequences.

Influenza Strain, Nucleotide Sequence of Conserved Region of Influenza NP Gene, SEQ ID NO

PR8 (HlNl), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAGTACGACAATTA, 11551 WSN (HlNl), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAGTACGACAATTA, 11552 Lenn.(H2N2), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAGTACGACAATTA, 11553 HK (H3N2), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAGTACGACAATTA, 11554

Memphis (H3N2), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAGTACGACAATTA, 11555 HK (H5N1), AATGAAGGATCTTATTTCTTCGGAGACAATGCAGAGGAATATGACAATTG, 11556 Duck (H10N7), AACGAGGGATCTTATTTCTTCGGAGACAATGCAGAGGAATATGACAATTA, 11557 Equine (H7N7), AATGAAGGGTCTTATTTCTTCGGAGACAATGCTGAGGAGTTTGACAGTTA, 11558 Whale (H13N2), AATGAGGGATCTTATTTCTTCGGAGACAATGCTGAGGAGTATGACAATTG, 11559 Chicken(H9N2), AATGAAGGATCTTATTTCTTCGGAGACAATGCATAGGAGTATGACAATTA, 11560 Swine (H4N6), AACGAAGGGTCTTATTTCTTCGGAGACAATGCAGAGGAATATGACAATTA, 11561

To test the prophylactic use of viral targeting siRNA, NP- 1496 (INFsi-9) was mixed with the cationic delivery polymer jetPEI (Qbiogene) and administered (2 mg/Kg) to C57BL/6 mice intravenously (IV). Three hours later, mice were inoculated (intranasally) with 1 x 10 4 PR8 viral particles to initiate infection and later sacrificed 24 hrs post-infection to assay lung homogenates for viral titers using the MDCK hemagglutinin assay. As shown in Figure 2, the average logioTCIDsQ of the lung homogenate for mice that received no siRNA treatment (NT; filled squares) or received a siRNA targeted to GFP (GFP 60 μg; open squares) was 4.2. In mice that were pretreated with 30 μg siRNA targeted to NP (NP 30 μg; open circles) and jetPEI, the average log 10 TCID 5 o of the lung homogenate was 3.9. In mice that were pretreated with 60 μg siRNA targeted to NP (NP 60 μg; filled circles) and jetPEI, the average log 10 TCID 5 o of the lung homogenate was 3.2.

The difference in virus titer in the lung homogenate between the group that received no treatment and the group that received 60 μg NP siRNA was significant with P=O.0002. In order to evaluate siRNA as a therapeutic treatment for an existing influenza infection, mice were infected with PR8 virus intranasally and five hours later were given NP-1496/jetPEI or PA-2087/jetPEI mixture intravenously. Viral titers in the lungs were assayed by MDCK-HA assay 28 hours post-infection. All treatments significantly reduced viral titer in comparison to untreated, infected mice; dose-responsive decreases in viral titers were observed in mice treated with NP-1496 (Figure 3). A suppression effect of siRNA treatment at 24 hours post-infection was also seen in mice. Further, the designed siRNAs can also protect mice from a lethal challenge of avian influenza virus. Mice injected (2x) with control (GFP-targeting) siRNAs and subsequently challenged with a lethal dose of HlNl (PR8), H5N1 or H7N7 virus, continuously lost weight and succumbed to the infection between days 7-10, as shown in Figure 4. However, infected mice that received the combined siRNAs NP-1496 and PA- 2087 recovered from the initial weight loss. At least 50% of the mice survived the lethal H7N7 challenge, 87% survived the lethal H5N1 challenge and 100% survived the HlNl challenge. Thus, siRNAs specific for the conserved regions of the influenza viral genome confers broad protection, including protection against the highly pathogenic avian influenza viruses (Figure 4).

Example 17a: Intranasal Delivery of siRNA Inhibits Influenza Production in Mice

The present example demonstrates that prophylactic intranasal administration of siRNA targeted to viral NP transcripts inhibited influenza virus replication and reduced viral RNA levels in a dose-dependent manner in the mouse.

Influenza normally infects and replicates in the upper respiratory tract and lungs. Therefore, due to accessibility, topical administration, i.e. intranasal and/or pulmonary delivery of drug should be ideal for influenza prophylaxis and therapy. Specifically, intranasal and/or pulmonary delivery of siRNAs is advantageous in treating influenza virus infection, because, 1) high local siRNA concentration are easily achieved when local delivery route is used and thus less siRNA is required compared to systemic delivery and 2) intranasal and/or pulmonary delivery methods are non-invasive. Thus, an intranasal delivery of siRNA in the influenza mouse model was pursued.

Unlike conventional intravenous delivery of naked siRNA which resulted in negligible silencing effect, intranasally administered siRNA (unmodified, in PBS or saline) can be detected in the lungs and is able to silence endogenous gene expression or inhibit virus production in lung tissue. To test the efficacy of non-invasive delivery of influenza targeting siRNA, the NP- 1496 siRNA (in PBS) was delivered intranasally. BALB/c mice were treated intranasally with indicated amounts of NP specific siRNA in PBS or PBS control. Two hours later, all mice were infected intranasally (1000 pfu/mouse) with the PR8 serotype. The lungs were harvested 24 hours post-infection and viral titer was measured from lung homogcnatcs by MDCK-HA assay. P values between PBS and siRNA groups indicated statistical significance with 0.5, 1 and 2 mg/kg siRNA treated groups. The data is shown in Figure 5.

In the absence of a carrier, naked NP targeting siRNA was effective in suppressing viral production in the mouse lung (Figure 5; 24 hours post-infection). Suppression was dose dependent, with a 7-fold reduction being observed when 2 mg/kg of siRNA was delivered two hours prior to infection.

The effects of intranasal delivery of NP-targeting siRNA were also investigated at higher concentrations (10 mg/kg, delivered 3 hours prior to infection) using target mRNA expression (quantitative RT-PCR) and viral titer (MDCK-TIA) to measure efficacy. BALB/c mice were administered control and NP-targeting siRNA intranasally (10 mg/kg, in PBS). Three hours later, all the mice were infected intranassally with PR8 virus (50 pfu/mouse). The lungs were harvested at 24 and 48 hours post-infection and total RNA was isolated from the left lung. Total mRNA was reverse transcribed to cDNA using dTl 8 primers. Real time PCR was carried out using PBl specific primers to quantify viral mRNA levels. GAPDH was used as an internal control. The right and middle lungs were homogenized and the viral titer was measured by MDCK-T-IA assay. The virus titer in the samples at 48 hours post-infection is shown in the Figure 6 statistic significance was found between PBS and NP siRNA treated group using student t test (p=0.01); the titer in the samples 24 hours post-infection was too low to detect, possibly due to siRNA directed suppression. The results are shown in Figure 6. Figure 6 compares the normalized quantitative

PCR results and the viral titer assay results. Viral mRNA level measured at 24 hours post-infection show a 55.2% inhibition but by 48 hours post-infection only minimal inhibition was observed. In contrast, the MDCK-HA assay of mouse lung samples indicated 84.6% viral titer suppression on day 2. Compared to the MDCK-HA assay that

22Q

measures live virus particles, viral mRNA quantification is probably more sensitive in reflecting the early changes in viral replication. Thus, the decrease in viral mRNA suppression on day 2 is probably due to the decreased RNAi effect in the mouse lung by that time. Also, the effect on influenza viral titer in mouse between naked siRNA targeting the NP transcript delivered intranasally and the influenza treatment, Tamiflu, was compared. Relative to the level of viral titer observed with the GPF control siRNA, both the intranasally delivered naked siRNA and Tamiflu treatments reduced influenza viral titers. The effect on viral titer by NP-viral transcript targeting in mouse after intranasal delivery the siRNA Gl 498 (INFsi-8) was also addressed. The Gl 498 siRNA exhibited significant ability to reduce viral titers in vitro and thus was chosen for further characterization in vivo. The control for this study was an unmodified siRNA targeted against luciferase (Dharmacon; Luc). Ten week old female BALB/c (Taconic) mice with a weight range of 18-22 grams were used in the study. There were ten mice per study group. The mice were dosed with G1498 siRNA in PBS at 2 mg/kg, 5 mg/kg, 10 mg/kg, 20 mg/kg and 30 mg/kg. The control groups were dosed the same, except no controls received a 2 mg/kg dose. Both the Gl 498 and Luc siRNA control groups were infected with PR8 influcenze virus at 30 pfu in 30 μl in PBS four hours post-siRNA administration. Fourty-eights hours post-infection, the mouse lungs were harvested and viral titers measured therefrom in MDCK cells with a TCID 50 assay.

The results are shown in figure 7. The results of the TCID 50 assay show that the G 1498 siRNA at 2 mg/kg suppressed influenza production in the mouse lung by 86%, at 5 mg/kg and 10 mg/kg by 90.6%, at 20 mg/kg by 96.6% and at 30 mg/kg by 95.2%. In relation to PBS alone or the Luc control siRNA study groups, the mice administered the G1498 siRNA intranasally, as a whole, showed significant differences (P < 0.001). The mice that received PBS did not exhibit significant difference compared to the mice group that received the Luc siRNA, as a whole, (P > 0.05). Each study group that received a dose of the G1498 siRNA significantly differed from the PBS group or Luc siRNA control study group at 30 mg/kg (P < 0.05). Finally, no significant dose response was observed with mice that received the range of G1498 siRNA doses.

Example 17b: Intranasal Delivery ofsiRNA Inhibits Cyclophilin B Expression in a Non- influenza Mouse Model

The present example demonstrates that the airway cell uptake of naked siRNA via intranasal administration is not a specific phenomenon of influenza-infected cells. In addition, naked siRNA delivered intranasally also reduced endogenous gene, cyclophilin B, expression in the lungs of healthy mice. Balb/c mice were treated intranasally with 10 mg/kg cyclophilin B specific siRNA (Dharmacon) or GFP siRNA in PBS or PBS control. There were five mice per group. The mouse lungs were harvested 24 hours later. Total RNA was purified from the lung samples and reverse transcription was conducted using dT18 primer. Cyclophilin B-specific primers (Applied Biosystem) were used in real-time PCR to quantify the target mRNA level. GAPDH-specific primers (Applied Biosystem) were also used in the PCR as a control.

As shown in figure 8, the cyclophilin B mRNA in the lungs was inhibited by 70% 24 hours after the mice received 10 mg/kg cyclophilin B siRNA intranasally. These data indicate that the airway cell uptake of naked siRNA via intranasal administration is not a specific phenomenon in influenza-infected cells. Endogenous gene silencing in healthy cells in healthy animal can be also achieved by naked siRNA delivered intranasally. This finding is highly relevant for the utility of siRNA for prophylaxis, which would occur in the absence of infection.

Example 18: Influenza Viral Titer Reduction Following Intranasal Administration of Cochleate siRNA Formulations in Mouse

The present demonstrates that intranasal administration of the G1498 siRNA in a cochleate delivery formulation (BDSI, North Carolina) enhances influenza viral suppression relative to naked siRNA in mouse. For the instant example, the formulations tested are shown below in Table 21.

Table 21. Formulations Administered Intranasally to Mice

The formulations listed above were administered intranasally five hours post-infection with influenza. The viral titer for the whole lung was measured 48 hours post-infection. There were ten mice in each study group, except the naked unlipidated U-flu groups, which had five.

The lung viral titer results are shown in figure 9. Each dot on the graph represents one animal. Statistics was performed by One-Way-Anova. The numbers with an asteric indicates that the mean value has a statistical difference relative to the controls (placebo or buffer). The data in figure 8 show that the G1498 siRNA administered intranasally with the cochleate delivery formulation exhibit greater viral titer reduction relative to the naked G1498 siRNA and the controls (Buffer alone or the Cochleate Placebo). Some degree of toxicity was observed in all groups that receive the cochleate formulations or naked lapidated siRNA.

Example 19: Influenza Viral Titer Reduction Following Intravenousl Administration of Cochleate siRNA Formulations in Mouse

The present demonstrates that intravenous administration of the G1498 siRNA in a cochleate delivery formulation enhances influenza viral suppression relative to naked siRNA in mouse. The formulations tested are shown below in Table 22.

Table 22. Formulations Administered Intravenouslly to Mice

The formulations listed above were administered intranasally five hours post-infection with influenza. The viral titer for the whole lung was measured 48 hours post-infection. There were ten mice in each study group, except the naked unlipidated U-flu groups, which had five.

The lung viral titer results are shown in figure 1OA. Each dot on the graph represents one animal. Statistics was performed by One-Way-Anova. The numbers with an asteric indicates that the mean value has a statistical difference relative to the controls (placebo or buffer). The data in figure 9 show that the G 1498 siRNA administered intravenously with the cochleate delivery formulation exhibited greater viral titer reduction relative to the controls (Buffer alone or the Cochleate Placebo). Moreover, the U-flu formulation administered intransally also reduced lung viral titers.

A dose reponse profile was also generated for intranveouslly adminstered siRNA delivered in cochleat formulations. The formulations tested are shown below in Table 23.

Table 23. Formulations Administered Intravenouslly to Mice for Dose-Response Study

The formulations listed above were administered intranasally five hours post-infection with influenza. The viral titer for the whole lung was measured 48 hours post-infection. There were ten mice in each study group.

The lung viral titer results are shown in figure 1OB. Each dot on the graph represents one animal. Statistics was performed by One-Way-Anova. The numbers with an asteric indicates that the mean value has a statistical difference relative to the controls (placebo or buffer). The data in figure 10 show that the G1498 siRNA administered intravenously with the cochleate delivery formulation exhibited greater viral titer reduction relative to the Buffer control. Moreover, a dose-response was observed. As the dose of the G1498 siRNA in the cochleat formulation increased a greater reduction in mouse lung viral titers was observed.

The same formulations listed in Table 23 above were administered via oral gavage to mice following the same protocols and procedures for measuring mouse lung viral titers. The lung viral titer results are shown in figure 1OB. As stated previously, each dot represents an animal. No statistical differences among the test formuatlion were observed. Additinally, no toxicity was observed in any of the study groups.

Example 20: Distribution of Rhodamine-Cochleate in the Mouse Lung Before and After Influenza Infection

The present example demonstrates that intransal administration of the rhodamine- cochleate siRNA-free formulation showed a wide distribution of the rhodamine compared to intravenous or oral gavage administration. For the purpose of the instant example, rhodamine was encapsulated in the siRNA-free cochleate. The rhodamine-cochleate formulation was administered via intranasal (40 mg/ml, 50 μl/mouse), intravenous (10 mg/ml, 200 μl/mouse) or oral gavage (10 mg/ml, 200 μl/mouse) four hours either pre- or post- influenza infection. The mouse lung tissue was collected five hours after the final injection or infection and then frozen with dry ice and sectioned for analysis. The frozen sections was stained with DAPI for nuclei and imaged. The results from pre- and post- influenza infection were similar for all groups.

However, the group subject to the intranasal administration of the rhodamine-cochleate fomulation showed a wide distribution of rhodamine in the lung section. The group subject to the intravenous administration showed some rhodamine aggregation but had limited distributed positive signal. The ones with oral gavage did not show positive signals. These data indicate that thourough distribution of a cochleate formulation in the lung for treatment of a respiratory virus infection is best achieved with intransal administration. Thourough distribution within the lung is essential for effective reduction of viral titers.

Example 21. generation and testing ofsiRNAs derived from parainfluenza virus

Parainfluenza virus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding parainfluenza virus nuclear proteins using the methods described above.

Tables 24 A-B list 19-nucleotide regions that are siRNA parainfluenza virus nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3 ' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Tables 24 A-B. Tables 24A-B, and all other tables provided herein that disclose target sequences derived from viruses (except for Influenza virus described above) also disclose a subset of target sequences termed "Conserved target sequences." These conserved target sequences are highly conserved among a plurality of virus nucleoprotein sequences.

Tables 24A-B, and all other tables provided herein that disclose target sequences derived from negative strand viruses having non-segmented genomes also disclose the position of the 5' nucleotide in the viral nucleoprotein gene.

In vitro analysis

Lung epithelial cell line A549 cells will be transfected with different concentrations of siRNAs. Sicontrol from Dharmacon will be used as control siRNA. Lipofectamine 2000 will be used as transfection reagent and manufacturer's instruction will be followed. 4 hours later, transfected A549 cells will be infected with human parainfluenza virus (hPIV). At a suitable time point cells will be harvested and RNA will be isolated. The RT and real time PCR will be performed as described by Hino et al (5). The culture supernatant from siRNA transfected and PIV infected cells will be collected. To measure the virus titer hemagglutination assay will be performed by mixing 2-fold serial dilution of supernatant with 0.05% guinea pig erythrocytes (6). A549 cells will also be first infected with hPIV and transfected with different concentrations of siRNAs at a suitable time point. At a suitable time point cells will be harvested and supernatant collected. RNA will be isolated from cells and RT and real time PCR will be performed with gene specific primer as described above to determine the silencing effect of siRNA. Hemagglutination assay described above as well as virus non-target gene specific primer (eg. Polymerase) in real time PCR will be used to measure the reduction of virus replication.

In vivo analysis

Balb/c mice under anesthesia will be administered intranasaly with 2-10mg/kg dose of modified or unmodified siRNA with or without formulation. Sicontrol from Dharmacon will be administered at the same dose. After a suitable time point animals will be infected with 10 7 human parainfluenza type 3. Animals will be killed at different time point after virus infection and lung tissue will be harvested. Real time PCR will be carried out for viral nucleocapsid gene to determine virus replication (5). We will also utilize lung homogenate to determine virus titers using hemagglutination assay described earlier (6). To study the therapeutic value of siRNAs, animals will be first infected with hPIV-3 and then be administered with hPIV specific siRNA and control siRNA at an optimum time point. Virus infection will be monitored from lung tissue by methods described earlier.

Example 22. generation and testing of siRNAs derived from hMPV virus

Human MPV virus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding MPV virus nuclear proteins using the methods described above.

Table 25 lists 19-nucleotide regions that are siRNA Human Metapneumovirus nucleoprotein target sequences. Figure IA demonstrates an alignment of human metapneumovirus nucleoprotein sequences and a consensus sequence derived therefrom. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 25.

In vitro analysis

Monkey kidney epithelial cell line LLC-MK2 cells will be infected with hMPV at predetermined multiplicity of infection (MOI) at a suitable time point either after or before electroporation with different concentration of siRNAs. Sicontrol from Dharmacon will be used as control siRNA. At a suitable time point cells will be harvested and total RNA will be isolated. Oligo dT will be used as primer to synthesize the first strand cDNA by reverse transcription. Real time PCR will be then carried out following method described by Boivin et al. (1) using virus target gene specific primer to determine the silencing effect of siRNA. Virus non-target gene specific primer (eg. Polymerase) will be also used in real time PCR to measure the reduction of virus replication. Virus titers will be determined in a viral plaque assay following method described by Tripp et al. (2) In vivo analysis

Balb/c mice under anesthesia will be administered intranasaly with 2-10mg/kg dose of modified or unmodified siRNA with or without formulation. Sicontrol from Dharmacon will be administered at the same dose. After a suitable time point animals will be infected with 10 6 human metapneumo virus. Animals will be killed at different time point after virus infection and lung tissue will be harvested. Real time PCR will be carried out for N gene to determine virus replication as described in in vitro assay (1). We will also utilize lung homogenate to determine virus titers in a plaque assay using LLC-MK2 cells (16). To study the therapeutic value of siRNAs, animals will be first infected with hMPV and then be administered with hMPV specific siRNA and control siRNA at a optimum time point. Virus infection will be monitored from lung tissue by methods described earlier.

Example 23. generation and testing of siRNAs derived from RSV virus

RSV virus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding RSV virus nuclear proteins using the methods described in Example 2.

Table 26 lists 19-nucleotide regions that are siRNA Human Respiratory syncytial virus nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 26.

In vitro analysis

Lung epithelial cell line A549 cells will be transfected with different concentration of siRNAs (3). Sicontrol from Dharmacon will be used as control siRNA. Lipofectamine 2000 will be used as transfection reagent and manufacturer's instruction will be followed. 4 hours later, transfected A549 cells will be infected with hRSV. At a suitable time point cells will be harvested and RNA will be isolated. The RT and real time PCR will be performed as described above. The culture supernatant from siRNA transfected and RSV infected cells will be serially diluted and the dilution will be used to infect A549 cells to determine virus titers in a viral plaque assay (4). A549 cells will also be first infected with RSV and transfected with different concentrations of siRNAs at a suitable time point. At a suitable time point cells will be harvested and RNA will be isolated. The RT and real time PCR will be performed with gene specific primer as described above to determine the silencing effect of siRNA. Virus non-target gene specific primer (eg. Polymerase) will also be also used in real time PCR to measure the reduction of virus replication. In vivo analysis

Balb/c mice will be under isofluorane anesthesia and be intranasally administered NP-specific, RSV-specific siRNA in PBS at 2mg/kg, 50 ul per mouse, sicontrol from Dharmacon will be administered at the same dose in PBS. 4 days post-infection, the mouse lung will be harvested and homogenized. 10-fold serial dilution of lung homogenate will be used to infect A549 cells. Cytopathic effect will be monitored 3 to 5 days after infection (4). The dilution at which 50% of wells showed cytopathic effect will be determined as TCID50. To study the therapeutic value of siRNAs, animals will be first infected with RSV and then be administered with RSV specific siRNA and control siRNA at a optimum time point. Virus infection will be monitored from lung tissue by methods described earlier.

Example 24. generation and testing of siRNAs derived from coronavirus

Coronavirus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding coronavirus virus nuclear proteins using the methods described above.

Table 27 lists 19-nucleotide regions that are siRNA Human Coronavirus nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 27.

Lung epithelial cell line A549 cells will be transfected with different concentration of siRNAs. Sicontrol from Dharmacon will be used as control siRNA. Lipofectamine 2000 will be used as transfection reagent and manufacturer's instruction will be followed. 4 hours later, transfected A549 cells will be infected with coronavirus. At a suitable time point cells will be harvested and RNA will be isolated. The RT and real time PCR will be performed as described before (8). The culture supernatant from siRNA transfected and coronavirus infected cells will be collected. Hemagglutination assay will be performed by mixing 2-fold serial dilution of supernatant will with 0.05% chicken red blood cells (9). A549 cells will also be first infected with human coronavirus and then transfected with different concentrations of siRNAs at a suitable time point. At an optimized time point cells will be harvested and supernatant collected. RNA will be isolated from cells and RT and real time PCR will be performed with gene specific primer as described above to determine the silencing effect of siRNA. Hemagglutination assay described above as well as virus non-target gene specific primer (eg. Polymerase) will be used in real time PCR to measure the reduction of virus replication.

Example 25. generation and testing ofsiRNAs derived from West Nile virus

West Nile virus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding West Nile virus nuclear proteins using the methods described above.

Table 28 lists 19-nucleotide regions that are siRNA Human West Nile virus nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 28.

In vitro analysis

For virus stock production, Vero cells will be infected with West Nile virus (WNV) strains obtained from ATCC or CDC at a multiplicity of infection of 0.1-1 and cultured in medium supplemented with 2% FBS. Culture supernatant will be harvested and clarified at 72-96 hour post-infection when 50-70% of cells show cytopathic effects (CPE). The concentration of infectious virus in stocks will be determined by titration on Vero cells in 96-well plates and calculated as ID 5O per ml. One IDs 0 is equivalent to 1 infectious unit (Lu.).

To find out most potent siRNA sequence we will first transfect/electroporate Vero cells or any other suitable cell line such as baby hamster kidney (BHK) cell line (BHK- 21) with different concentrations of a number of siRNAs. Sicontrol from Dharmacon will be used as control siRNA. After a suitable time point following transfection/electroporation cells will be infected with an optimized moi of west nile virus. Cells will be harvested and supernatant collected at different time point post infection. For assessment of the effect of siRNA on virus infection/replication RNA will be isolated from cells and reverse transcribed to make the first strand of cDNA. Real time PCR will be carried out (10) for WNV N gene using N gene specific primers and amplified products will be compared to controls to assess effect of siRNA. Culture supernatants will also be tested in a virus titer assay using published protocols (11). Vero or BHK-21 cells will also be first infected with WNV and transfected with different concentrations of siRNAs at a suitable time point. At a suitable time point cells will be harvested and RNA will be isolated. The RT and real time PCR will be performed with gene specific primer as described above to determine the silencing effect of siRNA. Virus non-target gene specific primer (eg. Polymerase) will also be also used in real time PCR to measure the reduction of virus replication. Virus titer will be assayed using method described above. In vivo analysis

BALB/c mice has been shown to support the WNV replication efficiently. We plan to utilize this strain to evaluate siRNAs found potent in cell culture assay. Specifically we will administer mice 2- 1 Omg/kg dose of modified or unmodified siRNA with or without formulation intravenously. Sicontrol from Dharmacon will be administered at the same dose. After a suitable time point animals will be infected with WNV. Brain tissue and blood will be collected from these animals at different time point and virus levels in blood and brains will be determined by plaque titration on Vero or

BHK-21 cells (11). Apart from this prophylaxis method we also plan to do a therapeutic model where animals will be infected first with the virus and then given siRNA therapy. Virus infection will be monitored in blood and brain tissue by methods described earlier.

Example 26. generation and testing ofsiRNAs derived from Dengue virus

Dengue virus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding Dengue virus nuclear proteins using the methods described above.

Table 29 lists 19-nucleotide regions that are siRNA dengue virus nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 29.

In vitro analysis

(DEN 1-4) will be propagated in C6/36 mosquito cell line. To test siRNA efficiency in silencing the gene, we will transfect/electroporate Vero cells with different concentrations of a number ofsiRNAs. Sicontrol from Dharmacon will be used as control siRNA. After a suitable time point following transfection/electroporation cells will be

infected with an optimized moi of dengue virus (DEN 1-4). Cells will be harvested and supernatant collected at different time point post infection. For assessment of the effect of siRNA on virus infection/replication RNA will be isolated from cells and reverse transcribed to make the first strand of cDNA. Real time PCR will be carried out for the dengue nucleocapsid/capsid gene using gene specific primers and amplified products will be compared to non-siRNA transfected infected controls to assess effect of siRNA (1). Virus non-target gene specific primer (eg. preM) will also be also used in real time PCR to measure the reduction of virus replication. Culture supernatants will also be tested in a virus titer assay using Vero cells or BHK-21 cells (12) to determine virus titers. We will also test the effect of these siRNAs in virus infection/replication following DEN infection of Vero cells by methods described above. In vivo analysis

For in vivo evaluation of siRNAs, A/J mice which has been shown to be more susceptible to dengue type 2 (DEN2) infection will be used. Mice will be administered with 2- 1 Omg/kg dose of modified or unmodified siRNA with or without formulation intravenously. Sicontrol from Dharmacon will be administered at the same dose. After a suitable time point animals will be infected with DEN-2 virus intravenously (IxIO 8 p.f.u. per mouse). The effect of siRNA on dengue-2 virus will be detected by real time PCR analysis with dengue-virus-specific primers from RNA extracted from blood (16). We will also perform viral plaque assay from these samples using Vero cells (12). Apart from this prophylaxis method we also plan to do a therapeutic model where animals will be infected first with the virus and then given siRNA therapy. Virus infection will be monitored from brain tissue by methods described earlier.

Example 27. generation and testing of siRNAs derived from Rhinovirus

Rhinovirus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding Rhinovirus nuclear proteins using the methods described above

Table 30 lists 19-nucleotide regions that are siRNA rhinovirus- 16 nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 30.

In vitro analysis

The efficacy of the siRNA against rhinovirus infection will be tested in human HeLa cells. HeLa cells will be grown in MEM with Earle's Salts supplemented with 10% fetal bovine serum (FBS) and 1% Pluronic F-68. Cells will be infected with a stock of human rhinovirus 16 at 200 plaque forming units (PFU)/ml. The infected cells will be incubated for 1 h to allow the virus to adsorb to cells. After the incubation cells will be transfected with siRNA molecules. Sicontrol from Dharmacon will be used as control siRNA. After the transfection cells will be harvested and supernatant collected at a suitable time point and virus replication will be assayed by real time PCR (13). Virus non-target gene specific primer (eg. polymerase) will also be also used in real time PCR to measure the reduction of virus replication. The supernatant will be utilized for virus plaque assay on HeLa cell monolayer (14). We also plan to transfect cells first with siRNA and then infect with human rhinovirus 16 to study its effect on inhibiting virus replication following methods described above.

Example 28. generation and testing ofsiRNAs derived from Rotavirus

Rotavirus nuclear proteins were identified using the methods above or from publicly available sequences. siRNAs were identified in nucleic acids encoding rotavirus nuclear proteins using the methods described above.

Table 31 lists 19-nucleotide regions that are siRNA rotavirus (VP6) nucleoprotein target sequences. The provided 19 nucleotide region is useful as the sense strand to design a variety of siRNA molecules, optionally having different 3' overhangs in either or both the sense and antisense strands. Thus, one skilled in the art will appreciate that a variety of sense and antisense siRNA sequences may be obtained from each sequence listed in Table 31.

In vitro analysis

The rhesus monkey kidney cell line MA 104 will be used to propagate the virus in tissue culture and to make large virus stock. MA 104 cells will be infected with an optimized dose of virus before or after transfection of various concentrations of different siRNAs. Sicontrol from Dharmacon at similar concentrations will be used as control siRNA. At a suitable time point cells will be harvested and lysed by two freeze-thaw cycles, and the lysates will be treated with 10 ug/ml of trypsin for 30 min at 37°C. The infectious titer of the viral preparations will be determined by an immunoperoxidase focus assay as described by Pando et al. (15)

The efficacy of siRNA on the in vivo replication of rotaviruses will be evaluated in the mouse model of rotaviral infection. To do this, mice will be administered with 2-

10mg/kg dose of modified or unmodified siRNA with or without formulation orally or intravenously. Sicontrol from Dharmacon will be administered at the same dose. An optimized dose of rotavirus will be given to the mouse by oral gavage before or after siRNA dosing. At the suitable time points, 2 days after viral inoculation for instance, the distal colon of each mouse will be examined for the presence of bright yellow liquid contents that are characteristic of rotavirus -induced diarrhea and the entire intestinal tract will be collected for quantitation of rotaviral antigen by enzyme immunoassay or RT- PCR.

Example 29. In Vitro siRNA Mediated Inhibition of Human Rhinovirus Replication

The present example demonstrates that exemplary siRNA molecules of the present invention effectively inhibit replication of the human rhinovirus (HRV) in vitro. Thirty- three siRNAs (see Table 32 below for sequences) were screened for their ability to inhibit viral replication of either a major or minor ATCC (American Type Culture Collection) HRV. The 33 siRNAs selected to target the HRV represent "conversed" region(s) of the HRV genome. These siRNAs were chosen by analyzing the available Genbank HRV nucleotide sequences and deriving the "conserved" regions among those sequences by alignment. The screen was performed by initially transfecting Ohio HeLa-I cells (OH-I cells) with one of the 33 siRNAs followed by infection with one of the two different human rhinovirus serotypes. Reduction in viral yield was measured with a TICD 50 (Tissue Culture Infective Dose) assay, which estimates viable virus. A decrease in the TICD 50 value indicates a decrease in the size of the viable virus population and thus a reduction in viral replication. In the context of the instant example, a larger decrease in TICD 50 indicates a more potent viral replication inhibiting siRNA.

Table 32. Nucleotide Sequence of 33 siRNAs Targeted against Human Rhinovirus.

For this study, two HRV serotypes were tested: the HRV type 16 that which belongs to the major receptor group, which includes 90% of the 101 numbered HRV serotypes and utilizes the ICAM-I cell surface receptor for entry into cells, and the HRV type IA that belongs to the minor receptor group and uses several members of the low- density lipoprotein receptor superfamily for cell entry. The inhibitory effect of each of the 33 siRNAs on viral yield of the two rhinovirus serotypes, HRV- 16 and HRV-IA, was assessed. All siRNA constructs were stored at a concentration of 5 pmol/μl at -7O 0 C prior to use. In the instant example, pleconaril and ruprintrivir served as positive controls and are known to effectively inhibit viral replication. Pleconaril was applied at either a concentration of 1 μg or 10 μg while ruprintrivir was applied at 0.1 μg. Sicontrol from

Dharmacon™ will be used as negative control siRNA. A viral infection with no siRNA transfection was also performed as a control (labeled "Virus Control 1", "Virus Control 2" and so forth).

Initially, control experiments were performed to optimize the transfection procedure (i.e., cell number and Lipofectamine™ 2000 (Invitrogen) concentration with siRNA)) and the infection procedure (i.e., the impact Lipofectamine™ 2000 would have on rhinovirus replication). To optimize the transfection procedure, the wells on two 24- well plates were seeded with either 75,000 or 100,000 OH-I cells per well. The negative impact (degree of silencing) of various concentrations of Lipofectamine™ 2000 (1 μl/well, 1.4μl/well, or 1.8μl/well) on the efficacy of a positive siRNA control to degrade a target transcript was determined. A negative siRNA control was also used. After transfection, RNA was extracted from the cells and a RT-PCR was performed. Real-time PCR was also used to determine the degree of silencing that occurred during transfection of both the 24-well plates seeded with 75,000 and 100,000 OH-I cells per well. As anticipated, the results indicated that more silencing occurred with smaller cell numbers. Thus, transfections described in the instant example were performed with the larger cells numbers (100,000 OH-I cells).

To optimize the infection procedure, the wells of two 24-well plates were seeded with 50,000; 75,000 or 100,000 OH-I cells per well. The cells on one 24-well plate were infected with 10 TCID 50 of HRV- 16 while the cells on the second plate were infected with 180 TCID 50 HRV- 16. All three different seeded cell number populations received 0 μl, 1 μl, 1.4 μl or 1.8 μl Lipofectamine™ 2000. Samples were then harvested at 24, 48 and 72 hours post-infection to measure viral yield. At 24 hours, in all instances (with or without Lipofectamine™ 2000), the lower inoculum (10 TCID 5 o) of virus yielded lower titers compared to the higher one (180 TCID 50 ). Again, in all instances, at 48 hours, both viral inocula gave higher virus yields compared to yields observed at 24 hours post-infection. Without lipofectamine, the results indicated that, within the range of cell numbers tested, cell numbers do not affect virus yield. Viral yields measured at 24, 48 and 72 hours postinfection with the lipofectamine concentrations tested did not interfere with HRV infection or growth. From the results described above, subsequent infections described in the instant example were performed with the lower viral inoculum (10 TCID 50 ) and transfections were performed with 1.4μl/well Lipofectamine™ 2000.

The transfections were performed as follows: First, 24-well plates were seeded with 100,000 OH-I cells/ml per well in Eagle's minimum essential media (EMEM) with

5% fetal bovine serum, 5% fetal clone serum, 1% L-glutamine and antibiotics. The cells were incubated overnight at 37°C in a 5% CO 2 incubator. The cell sheet confluency at the time of siRNA transfection was approximately 50-60%. Second, following the overnight incubation, a working Lipofectamine™ 2000 (LF2K) stock solution for each siRNA was made. These stock solutions were made in a 96-well plate prior to applying the solution to the OH-I cells on the 24-well plates. Each LF2K stock solution had a total volume of 50 μl/well and included 1.4 μl of LF2K and 48.6 μl Optimem; the solution was vortexed gently and placed on ice until use. Third, 200 μl of LF2K-Optimem dilution was added to each well of rows A, C and E of a 96-deep-well plate. A volume of 180 μl of Optimem was added to each well of rows B, D, and F. The siRNAs solutions were vortexed, briefly spun in a micro-centrifuge, and then placed on ice. A 20 μl sample of each siRNA was added to the appropriate wells in rows B, D, and F of the 96-deep-well plate and gently mixed by pipetting up and down three times. Thus, at this point of the transfection protocol, the wells in rows B, D and F have a volume of 200 μl (180 μl Optimem plus 20 μl siRNA solution). The liquid in the wells in row A was then transferred to the corresponding wells in row B, the liquid in the wells in row C was then transferred to the corresponding wells in row D, and the liquid in the wells in row E was transferred to the corresponding wells in row F. The result of transferring the liquid created a siRNA- LF2K mixture in the wells in rows B, D and F. After transferring the liquid from each row, the wells were mixed by pipetting up and down five times (pipette tips were changed routinely to avoid cross-contamination). The 96-deep-well plate was then covered tightly and incubated at room temperature for 30 minutes. Finally, after the 30 minute incubation, the 24-well plates with the OH-I cells were removed from the incubator and the media aspirated from all wells. A 400 μl sample of 10% EMEM, without antibiotics, was added to each well followed by the addition of 100 μl of a siRNA-LF2K mixture. Each siRNA was tested in triplicate and therefore three separate wells on the 24-well plate received a 100 μl siRNA-LF2K mixture containing the same siRNA. The transfection plates (24-well plates) were then incubated at 37°C, 5% CO 2 for five hours. Following the five hour transfection incubation, all media was aspirated from all wells, and tips were changed between each different siRNA-LF2K mixture to avoid cross-contamination. The cells were then infected with either of the two HRV serotypes by adding lOOμl of 2X virus (10 6 dilution used for HRV-16 and 10 5 5 dilution used for HRV-IA) and lOOμl of 2% McCoy's without antibiotics or MgCl 2 to each well. The virus was incubated with the OH-I cells for one hour at 34°C. The viral media was then

aspirated from the wells and all wells were rinsed two times with Hanks balanced salt solution (HBSS) and re-fed with 1.5 ml 2% McCoy's media with antibiotics and MgCl 2 . Wells on the 24- well plate that received only pleconaril or raprintrivir were re-fed with 750μl 2X drug and 750μl 2% McCoy's and incubated for 24 hours at 34°C. The total volume of supernatant was 1.5 ml per well. Cells that were cultured with either pleconaril or raprintrivir were not transfected with a siRNA.

A 400μl supernatant sample was harvested from each well, representing each siRNA and the controls, at 24, 48, and 72 hours post-transfection and pooled for later titration. The samples were stored at -7O 0 C. Serial 10-fold dilutions were made for the 24 and 72 hour harvests and tittered in quadruplicate (4 wells per dilution, lOOμl per well). The tittered samples were incubated with OH-I cells cultured in 96-well plates. Over a seven day time period, OH-I cells were observed for cytopathic effect (CPE). The 24 hour harvests were tested at dilutions of 10° to 10 5 and the 72 hour harvests were tested at dilutions of 10° to 10 7 . The collected virus used for viral yield reduction assay was frozen and a back titer done to determine the inoculum in TCID 50 ; the target inoculum was between 10 to 32 TCID 50 per monolayer.

The results are summarized in Tables 33, 34, 35. Table 33 summarizes the data of siRNAs 1 through 25 transfected into OH-I cells and later infected with the HRV- 16 serotype. Table 34 summarizes the data of siRNAs 1 through 25 transfected into OH-I cells later and infected with the HRV-IA serotype. Table 35 summarizes the data of siRNAs 21 and 24 through 33 transfected into OH-I cells and later infected with either the HRV-16 serotype or the HRV-IA serotype. The data presented in Tables 33 and 34 are categorized generally into four different groups. Each group represents a subset of siRNAs that were subjected to the transfection and infection protocols together, in connection with a control (i.e., virus control 1 with group 1, virus control 2 with group 2 and so on). Each group was subjected to the same transfection and infection protocols and had the same virus control. The groups only differ in that the subsets of tested siRNAs of each group were not tested together. The virus control for each group was used as the baseline for comparison to determine if a siRNA within the group reduced viral replication. Similarly, the data represented in Table 35 is categorized generally into four different groups where each group is separated by a thin double line. The four groups in Table 35 are not associated at all by label with the four groups in Tables 33 and 34. The four groups in Table 35 are represented by the virus control and the HRV

serotype used. For example, one group contains the "Virus Control 1" with the HRV- 16 serotype.

A TCID 50 of between 10 and 32 TCID 50 was achieved. In all instances, the positive controls pleconaril and ruprintrivir were inhibitory with reductions of 2 log 10 or more at 24 hours and over 5 log 10 at 72 hours compared to virus controls. For the instant example, a siRNA is considered to have significantly reduced viral yield when it reduces viral titer by more than 1.0 log 10 compared to the virus control. Higher reductions are more likely to indicate specific inhibition. Of the 33 siRNAs tested against HRV- 16, six siRNAs (10, 13, 14, 15, 17 and 18) exhibited significant reductions in viral yield (1.5 to 1.75 Iog 10 /ml) at 24 hours, compared to the virus control (Tables 33 and 35). At 72 hours, 12 siRNAs (7 through 17 and 19) reduced viral titer by 1.5 to 2.0 Iog 10 /ml and one siRNA (18) reduced viral titer by 2.5 log 10 .

Of the 33 constructs tested against HRV-IA, there were two siRNAs (7 and 8) that exhibited a 1.25 log 10 reduction in titer compared to the virus control at 24 hours, and two siRNAs (4 and 5) that reduced viral titer by 1.5 and 1.75 log 10 , respectively at 72 hours (Tables 34 and 35). Further, siRNA number 6 reduced viral titer by 1.25 logio at 72 hours. Of the remaining siRNAs tested, none demonstrated a substantial inhibition of either HRV-16 or HRV-IA.

Table 33. In Vitro Reduction of Human Rhinovirus-16 Titer Mediated by siRNA

Virus Control is a viral infection of cells with no siRNA transfection.

Table 34. In Vitro Reduction of Human Rhinovirus-IA Titer Mediated by siRNA

Table 35. In Vitro Reduction of Human Rhinovirus-1A or Human Rhinovirus-16 Titer Mediated by siRNA

In summary, both positive controls, pleconaril at lμg and ruprintrivir at lOμg, as expected, significantly inhibited both serotypes of HRV at both 24 hours and 72 hours. The siRNAs 10, 13-15, 17 and 18 at 24 hours and the siRNAs 7 through 19 at 72 hours reduced HRV- 16 viral titers by at least 1.0 log 10 compared to the virus control. The siRNAs 7 and 8 at 24 hours and siRNAs 4 through 6 at 72 hours reduced HRV-IA viral titers by at least 1.0 loglO compared to the virus control.

The data in Table 33, 34 and 35 indicate that select siRNAs can effectively reduce viral replication of the human rhino virus in vitro.

Example 30: In Vitro siRNA Mediated Degradation of Human Metapneumovirus RNA

The present example demonstrates that exemplary siRNAs of the present invention significantly reduce the RNA copy number of a human metapneumovirus (hMPV) target transcript (RNA) in vitro. A reduction in viral RNA copy number of a particular viral transcript correlates with a reduction in virus replication. Consequently, the reduction in viral replication minimizes and/or prevents the production of new viral particles, thus, reducing, if not inhibiting, re-infection and minimizing the viral induced pathology in a patient.

The instant example describes an initial screen of 200 siRNAs performed with a dual-luciferase assay system to identify siRNAs that effectively reduce target RNA levels as measured indirectly by a reduction in Renilla luciferase activity. Moreover, the instant example describes a secondary screen of 57 identified siRNAs selected from the initial screen of the 200 siRNAs in order to further characterize those siRNAs that effectively reduce target hMPV RNA levels directly. Table 36 below describes the 200 siRNAs subject to the initial dual-luciferase assay screen. The 200 siRNAs selected to target hMPV RNA represent "conversed"

region(s) of one of the following hMPV genes: N gene, P gene, M gene, F gene, M2-1 gene, M2-2 gene or the L gene. These siRNAs were chosen by analyzing the available Genbank nucleotide sequences of the hMPV genome (Accession # AY297748.1) and deriving the "conserved" regions among those sequences by alignment. For the initial screen, each siRNA was tested in triplicate at 10 nM concentration. Table 36. Nucleotide Sequence of siRNAs Screened by the Dual-Luciferase Assay.

2S 5

Table 37 below summarizes the results of the dual-luciferase assay. The firefly luciferase represents the internal transfection control and the luciferase assay control (i.e., the firefly luciferase transcript is not a target for the siRNA). The renilla luciferase transcript is fused with a hMPV target sequence. A reduction in renilla luciferase activity indicates a decrease in the number of renilla luciferase/hMPV target sequence transcripts within the cell. Thus, according to the assay, a greater reduction in measured renilla luciferase activity is indicative of a more potent siRNA. siRNA potency is expressed as percent silencing (% silencing) in Table 37 and was computed as 100 x [1 - (mean Renilla siRNA/mean Firefly siRNA)/(mean Renilla sicontrol/mean Firefly sicontrol)]. A higher percentage correlates with a more potent siRNA.

Table 37. Dual-Luciferase Assay Results for 200 siRNA Screened for Target RNA Degradation Efficacy.

26Q

ND is no data

For the purposes of the instant example and the data presented in Table 37, a siRNA exhibiting 40% or greater silencing of the target Renilla lucif erase was considered an effective siRNA. Based on the data in Table 37, the following 57 siRNAs were considered effective siRNAs: 1, 6, 17, 18, 28, 32, 33, 34, 45, 47, 48, 59, 60, 64, 66, 70, 73, 78, 80, 81, 85, 87, 88, 89, 90, 93, 95, 98, 99, 102, 105, 118, 121, 122, 126, 130, 138, 139, 141, 143, 145, 146, 147, 148, 149, 150, 151, 156, 157, 158, 163, 164, 168, 169, 184, 197 and 198. The 57 identified effective siRNAs were further characterized in a secondary screen to determine which of the 57 effectively induced degradation of a targeted hMPV transcript directly.

Fifty-seven siRNAs (see Table 38 below for sequences) were screened for their ability to induce degradation of a targeted hMPV RNA transcribed from one of the following target viral genes: N gene, P gene, M gene, F gene, M2-1 gene, M2-2 gene or the L gene. In general, the methods used to transfect siRNA and infect cells in culture for the instant example were disclosed above.

Sicontrol from Dharmacon™ was used as a control siRNA. Viral RNA copy number was determined with quantitative real-time PCR by comparing the replication kinetics of RNA isolated from hMPV infected cells transfected with siRNA to that of a known standard, i.e., the replication kinetics generated from a real-time PCR performed with a template with known copy number (Dcffrasnes C, ct al., J Clin. Microbiol. 43, 488-90 (2005)). Viral RNA was extracted from 140 μl of infected cell culture supernalanls and used as the starting material for the reverse transcriptase and real-time PCR. A lower viral RNA copy number indicates a more potent siRNA. Further, a greater percent inhibition indicates a more potent siRNA.

Table 38. Nucleotide Sequence of 57 siRNAs Targeted against a Human Metapneumovirus Gene Transcript.

The data in Table 98 summarizes the individual effect of 57 different siRNAs on the viral RNA copy number of a siRNA targeted hMPV transcript in cells after transfection with 10 nM siRNA. Each "PCR Results Group" represents the results from a

subset of the 57 siRNAs that were analyzed by the real-time PCR together. For the data in Table 39, there are ten "PCR Results Group." The mean viral RNA copy number was derived from three separate PCRs. The potency of each siRNA to reduce viral RNA copy number was expressed as percent inhibition ("% Inhibition")- To calculate the percent inhibition ("% Inhibition") for each of the 57 different siRNAs, the mean viral RNA copy number for each siRNA within a "PCR Results Group" was divided by the mean viral RNA copy number for the sicontrol of that same "PCR Results Group." This number is represented in Table 39 under the column titled "siRNA: sicontrol Ratio." This number was converted into a percentage by multiplying by 100 and then subtracting that product

10 from 100% to derive the percent inhibition for each siRNA (refer to the column titled "% Inhibition"). A higher percent inhibition indicates that the siRNA has a greater ability to reduce viral RNA copy number of the targeted hMPV transcript and therefore, is likely a more potent inhibitor of viral replication. A negative percent inhibition indicates an increase in viral RNA copy number compared to the control siRNA.

15

Table 39. Effect of 57 Different siRNAs on RNA Copy Number of a hMPV Targeted RNA in Cells After Transfection with 10 nM siRNA.

ND is no data; SD is standard deviation

The data in Table 39 shows that 19 siRNAs showed 50% or more suppression of viral gene transcripts at 10 nM concentration, including 6 siRNAs targeting the N 5 transcript, 2 siRNAs targeting the P transcript, 1 siRNA targeting the M transcript, 1 siRNA targeting the M2 transcript and 9 siRNAs targeting the L transcript. The No siRNA control among the ten PCR Results Groups exhibited a percent inhibition ranging from -384% to 63%. The viral RNA copy number value for the sicontrol served as the baseline for each PCR Results Group and was normalized to 0% inhibition. The 57

10 siRNAs transfected at 10 nM concentration showed a percent inhibition ranging from - 186% to 97% indicating that identifying siRNAs capable of reducing the RNA copy number of a target transcript requires more than a selection based on the nucleotide sequence of the "conserved" region of a viral target transcript. siRNAs demonstrating a percent inhibition of 50% or higher were considered to be effective at reducing viral RNA

15 copy number at 10 nM concentration (the two No siRNA controls that showed 60% and 63% were outliers and not considered). siRNAs exhibiting 50% or greater inhibition include siRNAs 1, 6, 32, 33, 45, 48, 59, 60, 70, 98, 118, 126, 143, 149, 150, 151, 163, 164, 168 and 197.

2β7

The data in Table 40 shows the individual effect of 57 different siRNAs on the viral RNA copy number of the siRNA targeted hMPV transcript in cells after transfection with 100 nM siRNA (10-fold greater siRNA compared to prior data set, Table 39). Each PCR Results Group represents the results from a subset of the 57 siRNAs that were analyzed by the real-time PCR together. For the data in Table 40, there are six PCR Results Group. The efficacy of each siRNA to reduce viral RNA copy number was expressed as percent inhibition ("% Inhibition"). The mean viral RNA copy number was derived from two separate PCRs. To calculate the percent inhibition (% Inhibition) for each of the 57 different siRNAs, the mean viral RNA copy number for each siRNA 0 within a PCR Results Group was divided by the mean viral RNA copy number for the No siRNA of that same PCR Results Group. This number is represented in Table 40 under the column titled "siRNA:No siRNA Ratio." This number was converted into a percentage by multiplying by 100 and then subtracting that product from 100% to derive the percent inhibition for each siRNA (refer to the column titled "% Inhibition"). A 5 higher percent inhibition indicates that the siRNA has a greater ability to reduce viral RNA copy number of the targeted hMPV transcript and therefore, is likely a more potent inhibitor of viral replication. A negative percent inhibition indicates an increase in viral RNA copy number compared to the sicontrol.

0 Table 40. Effect of 57 Different siRNAs on RNA Copy Number of a hMPV Targeted Transcript in Cells After Transfection with 100 nM siRNA

ND is no data; SD is standard deviation

The data in Table 40 shows that 27 siRNAs showed 50% or more suppression of hMPV viral gene expression in vitro, including 5 siRNAs targeting the N transcript, 2 siRNAs targeting the P transcript, 2 siRNAs targeting the F transcript, 3 siRNAs targeting the M2 transcript and siRNAs targeting the 15 L transcript. In general, the quantitative PCR results from the cell transfection/virus infection shown in Table 43 were consistent with the luciferase reporter assay results. The sicontrol among the five PCR Results Groups exhibited a percent inhibition ranging from -87% to 32%. This is in contrast to the sicontrol in Table 42, which served as the baseline value for calculating the percent inhibition. In this data set, the No siRNA served as the baseline for each PCR Results Group and was normalized to 0% inhibition. The 57 siRNAs transfected at 100 nM concentration showed a percent inhibition ranging from -208% to 97%, again, indicating that identifying siRNAs capable of reducing the RNA copy number of a target viral transcript requires more than a selection based on the nucleotide sequence of the "conserved" region of a viral target gene. Based on the percent inhibition observed with the sicontrol (i.e., 32% inhibition), only siRNAs demonstrating a percent inhibition of 50% or higher were considered to be effective at reducing viral RNA copy number at a 100 nM concentration. siRNAs exhibiting 50% or greater inhibition include siRNAs 1, 32, 33, 45, 47, 59, 60, 87, 89, 98, 102, 105, 118, 122, 126, 130, 149, 150, 151, 157, 158, 163, 164, 168, 184, 197 and 198.

The data in Tables 39 and 40 demonstrate that select siRNAs at either 10 nM or 100 nM concentrations, or both concentrations exhibit the ability to effectively reduce viral RNA copy number of a target hMPV transcript.

Example 31: Multi-viral Targeting

The present example demonstrates that siRNA capable of mediating signficant degradation of hMPV target RNA can also degrade RSV viral RNA. The general cell

culture and trasnfection protocols described earliear were used. Viral RNA copy number was determined with quantitative real-time PCR by comparing the replication kinetics of RNA isolated from RSV infected cells transfected with siRNA to that of a known standard, i.e., the replication kinetics generated from a real-time PCR performed with a template with known copy number (Deffrasnes C, et al., J Clin. Microbiol. 43, 488-90 (2005)). A lower viral RNA copy number indicates a more potent siRNA. Further, a greater percent inhibition indicates a more potent siRNA.

The siRNAs 118, 126, 150, 151 and 158 that exhibited the ability to effectively reduce viral copy number of the L transcript from hMPV were assessed for their ability to 0 do the same against a RSV target RNA. Each siRNA was transfected at 10 nM concentration. The data is shown below in Table 41.

Table 41. Effect of Six Different siRNAs on RNA Copy Number of a RS V Targeted Transcript in Cells After Transfection with 10 nM siRNA

5

The data in Table 41 shows that siRNA 158 exhibits the ability to degrade a RSV transcript as measured by an approximate 20% inhibition of RSV viral copy number. These data indicate that a multi-viral targeting siRNA capable of degrading viral transcripts from a wide variety of virus families can be made.

While the present invention has been described with reference to the- specific embodiments thereof, it should be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the true spirit and scope of the invention. In addition, many modifications may be made to adapt a particular situation, material, composition of matter, process, process step or steps, to the objective, spirit and scope of the present invention. All such modifications are intended to be within the scope of the claims appended hereto.