METHODS AND COMPOSITIONS FOR DETECTING A TARGET RNA

Title:

METHODS AND COMPOSITIONS FOR DETECTING A TARGET RNA

Document Type and Number:

WIPO Patent Application WO/2017/218573

Kind Code:

A1

Abstract:

The present disclosure provides methods for detecting a single-stranded target RNA. The present disclosure provides methods of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. The present disclosure provides a kit for detecting a target RNA in a sample.

Inventors:

DOUDNA JENNIFER A (US)
O'CONNELL MITCHELL RAY (US)
EAST-SELETSKY ALEXANDRA (US)
KNIGHT SPENCER CHARLES (US)
DOUDNA CATE JAMES HARRISON (US)

Application Number:

PCT/US2017/037308

Publication Date:

December 21, 2017

Filing Date:

June 13, 2017

Export Citation:

Click for automatic bibliography generation Help

Assignee:

UNIV CALIFORNIA (US)

International Classes:

A61K38/00; C07K14/47; C12N9/90; C12N15/11; C12N15/113; G01N33/50

Domestic Patent References:

WO2016205764A1	2016-12-22
WO2016094867A1	2016-06-16
WO2001042505A2	2001-06-14
WO2001086001A1	2001-11-15
WO1998039352A1	1998-09-11
WO1999014226A2	1999-03-25

Foreign References:

US8815782B2	2014-08-26
US8822673B2	2014-09-02
US8586718B2	2013-11-19
US20140378330A1	2014-12-25
US20140349295A1	2014-11-27
US20140194611A1	2014-07-10
US20130323851A1	2013-12-05
US20130224871A1	2013-08-29
US20110223677A1	2011-09-15
US20110190486A1	2011-08-04
US20110172420A1	2011-07-14
US20060179585A1	2006-08-17
US20030003486A1	2003-01-02
US5489677A	1996-02-06
US5602240A	1997-02-11
US5034506A	1991-07-23
US5539082A	1996-07-23
US5714331A	1998-02-03
US5719262A	1998-02-17
US3687808A	1972-08-29

Other References:

ABUDAYYAH ET AL.: "C2c2 is a single-component programmable RNA-guided RNA-targeting GRISPR effector", SCIENCE, vol. 353, no. 6299, 2 June 2016 (2016-06-02), pages 1 - 16, XP055407082
EAST-SELETSKY ET AL.: "Two distinct RNase activities of CRISPR-C2c2 enable guide-RNA processing and RNA detection", NATURE, vol. 538, no. 7624, 13 October 2016 (2016-10-13), pages 270 - 273, XP055407060
ABUDAYYEH ET AL., SCIENCE, vol. 353, no. 6299, 5 August 2016 (2016-08-05), pages aaf5573
SAMBROOK, J.FRITSCH, E. F.MANIATIS, T.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY PRESS
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410
ZHANGMADDEN, GENOME RES., vol. 7, 1997, pages 649 - 656
SMITHWATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 - 489
ALTSCHUL ET AL., J. MOL. BIOI., vol. 215, 1990, pages 403 - 10
"Short Protocols in Molecular Biology", 1999, JOHN WILEY & SONS
BOLLAG ET AL.: "Protein Methods", 1996, JOHN WILEY & SONS
MARTIN ET AL., HELV. CHIM. ACTA, vol. 78, 1995, pages 486 - 504
"Immunology Methods Manual", 1997, ACADEMIC PRESS
SINGH ET AL., CHEM. COMMUN., vol. 4, 1998, pages 455 - 456
SHAH ET AL., DEVELOPMENT, vol. 143, 2016, pages 2862 - 2867
XU ET AL., ANGEW CHEM INT ED ENGL., vol. 46, no. 19, 2007, pages 3468 - 70
XIA, PROC NATL ACAD SCI USA., vol. 107, no. 24, 15 June 2010 (2010-06-15), pages 10837 - 41
BAKSH, NATURE, vol. 427, no. 6970, 8 January 2004 (2004-01-08), pages 139 - 41
ROTHBERG, NATURE, vol. 475, no. 7356, 20 July 2011 (2011-07-20), pages 348 - 52
BAJAR ET AL., SENSORS (BASEL)., vol. 16, 14 September 2016 (2016-09-14), pages 9
ABRAHAM ET AL., PLOS ONE., vol. 10, no. 8, 3 August 2015 (2015-08-03), pages e0134436
BAO ET AL., ANNU REV BIOMED ENG., vol. 11, 2009, pages 25 - 47
DWAINE A. BRAASCHDAVID R. COREY, BIOCHEMISTRY, vol. 41, no. 14, 2002, pages 4503 - 4510
WANG ET AL., J. AM. CHEM. SOC., vol. 122, 2000, pages 8595 - 8602
WAHLESTEDT ET AL., PROC. NATL. ACAD. SCI. U.S.A., vol. 97, 2000, pages 5633 - 5638
KOSHKIN ET AL., TETRAHEDRON, vol. 54, 1998, pages 3607 - 3630
"The Concise Encyclopedia Of Polymer Science And Engineering", 1990, JOHN WILEY & SONS, pages: 858 - 859
ENGLISCH ET AL., ANGEWANDTE CHEMIE, INTERNATIONAL EDITION, vol. 30, 1991, pages 613
SANGHVI, Y. S.: "Antisense Research and Applications", 1993, CRC PRESS, pages: 276 - 278
BAO ET AL., ANNU REV BIOMED ENG, vol. 11, 2009, pages 25 - 47
NIEWOEHNER, OJINEK, M: "Structural basis for the endoribonuclease activity of the type III-A CRISPR-associated protein Csm6", RNA, vol. 22, 2016, pages 318 - 329, XP055619505, DOI: 10.1261/rna.054098.115
STERNBERG, S. H.HAURWITZ, R. E.DOUDNA, J. A.: "Mechanism of substrate selection by a highly specific CRISPR endoribonuclease", RNA, vol. 18, 2012, pages 661 - 672
STAMATAKIS, BIOINFORMATICS, vol. 30, no. 9, 1 May 2014 (2014-05-01), pages 1312 - 3
KATOHSTANDLEY, MOL BIOL EVOL., vol. 30, no. 4, April 2013 (2013-04-01), pages 772 - 80
MCWILLIAM ET AL., NUCLEIC ACIDS RES., vol. 41, July 2013 (2013-07-01), pages W597 - 600
STERNBERG ET AL., RNA, vol. 18, no. 4, April 2012 (2012-04-01), pages 661 - 72
LIU ET AL., CELL, vol. 168, no. 1-2, 12 January 2017 (2017-01-12), pages 121 - 134
SHMAKOV ET AL., MOL CELL, vol. 60, no. 3, 5 November 2015 (2015-11-05), pages 385 - 97
BURSTEIN ET AL., NAT COMMUN., vol. 7, 3 February 2016 (2016-02-03), pages 10613
See also references of EP 3471749A4

Attorney, Agent or Firm:

BORDEN, Paula, A. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

What is claimed is:

1. A method of detecting a single stranded target RNA in a sample comprising a plurality of RNAs, the method comprising:

a) contacting the sample with:

(i) a C2c2 guide RNA that hybridizes with the single stranded target RNA; and

(ii) a C2c2 protein that cleaves RNAs present in the sample; and

b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage.

2. A method of detecting a single stranded target RNA in a sample comprising a plurality of RNAs, the method comprising:

(a) contacting the sample with:

(i) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and

(ii) a C2c2 protein that cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs, and also cleaves RNAs of the sample; and

(b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage.

3. The method according to claim 1 or claim 2, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 1 hour of said contacting.

4. The method according to claim 3, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 40 minutes of said contacting.

5. The method according to claim 4, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 5 minutes of said contacting.

6. The method according to claim 5, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 1 minute of said contacting.

7. The method according to claim 1, wherein the C2c2 protein cleaves from 50% to more than 90% of the RNAs present in the sample within 1 minute of said contacting.

8. The method according to any one of claims 1-7, wherein the minimum concentration at which the single stranded target RNA can be detected is in a range of from 500 fM to 1 nM.

9. The method according to any one of claims 1-7, wherein the single stranded target RNA can be detected at a concentration as low as 800 fM.

10. The method according to any of claims 1-9, wherein the C2c2 protein is not a Leptotrichia shahii (Lsh) C2c2 protein comprising an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 3.

11. The method according to claim 10, wherein the C2c2 protein cleaves non-target RNA at least 1.2-fold efficiently than a Leptotrichia shahii (Lsh) C2c2 protein comprising at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:3.

12. The method according to claim 11, wherein the C2c2 protein cleaves non-target RNA at least 1.5 -fold efficiently than a Leptotrichia shahii (Lsh) C2c2 protein comprising the amino acid sequence set forth in SEQ ID NO:3.

13. The method according to any of claims 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: 1, 2, or 4-6.

14. The method according to any of claims 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2.

15. The method according to any of claims 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: 1.

16. The method according to any of claims 1-9, wherein the C2c2 protein comprises the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

17. The method according to claim 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of one or more of R472, H477, R1048, and H1053.

18. The method according to claim 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R472 and H477.

19. The method according to claim 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R1048 and H1053.

20. The method according to claim 1 , wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R472, H477, R1048, and H1053.

21. The method according to any one of claims 1-20, wherein the sample is contacted for 2 hours or less prior to said measuring.

22. The method according to claim 21, wherein the sample is contacted for 60 minutes or less prior to said measuring.

23. The method according to claim 22, wherein the sample is contacted for 30 minutes or less prior to said measuring.

24. The method according to claim 23, wherein the sample is contacted for 10 minutes or less prior to said measuring.

25. The method according to claim 24, wherein the sample is contacted for 1 minute or less prior to said measuring.

26. The method according to any one of claims 1-25, comprising determining an amount of target RNA present in the sample.

27. The method according to claim 26, wherein said determining comprises: measuring the detectable signal to generate a test measurement;

measuring a detectable signal produced by a reference sample to generate a reference measurement; and

comparing the test measurement to the reference measurement to determine an amount of target RNA present in the sample.

28. The method according to claim 26, comprising:

measuring the detectable signal to generate a test measurement,

measuring a detectable signal produced by each of two or more reference samples, wherein the two or more reference samples each include a different amount of a positive control RNA, to generate two or more reference measurements, and

comparing the test measurement to the two or more reference measurements to determine an amount of target RNA present in the sample.

29. The method according to any one of claims 1-28, wherein the sample comprises from 5 to 10⁷ RNAs that differ from one another in sequence.

30. The method according to any one of claims 1-28, wherein the sample comprises from 10 to 10⁶ RNAs that differ from one another in sequence.

31. The method according to any one of claims 1-30, wherein the sample comprises RNAs from a cell lysate.

32. The method according to any one of claims 1-31, wherein measuring a detectable signal comprises one or more of: gold nanoparticle based detection, fluorescence polarization, colloid phase transition/dispersion, electrochemical detection, and semiconductor-based sensing.

33. The method according to any one of claims 1-32, wherein (i) the method comprises contacting the sample with a labeled detector RNA comprising afluorescence-emitting dye pair , (ii) the C2c2 protein cleaves the labeled detector RNA, and (iii) the detectable signal is produced by the fluorescence- emitting dye pair.

34. The method according to claim 33, wherein the labeled detector RNA produces an amount of detectable signal prior to being cleaved, and the amount of detectable signal is reduced when the labeled detector RNA is cleaved.

35. The method according to claim 33, wherein the labeled detector RNA produces a first detectable signal prior to being cleaved and a second detectable signal when the labeled detector RNA is cleaved.

36. The method according to claim 35, wherein the labeled detector RNA comprises a fluorescence- emitting dye pair.

37. The method according to any one of claims 33-36, wherein the labeled detector RNA comprises a fluorescence resonance energy transfer (FRET) pair.

38. The method according to claim 33, wherein a detectable signal is produced when the labeled detector RNA is cleaved.

39. The method according to claim 33, wherein an amount of detectable signal increases when the labeled detector RNA is cleaved.

40. The method according to claim 38 or claim 39, wherein the labeled detector RNA comprises a quencher/fluor pair.

41. The method according to any of claims 33-40, wherein the labeled detector RNA comprises a modified nucleobase, a modified sugar moiety, and/or a modified nucleic acid linkage.

42. The method according to any one of claims 1-41, wherein said contacting is carried out in an acellular sample.

43. The method according to any one of claims 1-41, wherein said contacting is carried out in a cell in vitro, ex vivo, or in vitro.

44. A method of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs, the method comprising:

contacting a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence, with a C2c2 protein, wherein the C2c2 protein cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs.

45. The method according to claim 44, wherein the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain.

46. The method according to claim 44 or claim 45, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target sequences within the same target RNA molecule.

47. The method according to any one of claims 44-46, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target RNA molecules.

48. The method according to any one of claims 44-47, wherein said contacting does not take place inside of a cell.

49. The method according to any one of claims 44-48, wherein at least one of the guide RNAs and/or the precursor C2c2 guide RNA array is detectably labeled.

50. A kit for detecting a target RNA in a sample comprising a plurality of RNAs, the kit comprising:

(a) a precursor C2c2 guide RNA array, and/or a nucleic acid encoding said precursor C2c2 guide RNA array, wherein the precursor C2c2 guide RNA array comprisestwo or more C2c2 guide RNAs each of which has a different guide sequence and/or an insertion site for a guide sequence of choice; and

(b) a C2c2 protein.

51. The kit of claim 50, wherein the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain.

52. The kit of claim 50 or claim 51 , wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target sequences within the same target RNA molecule.

53. The kit of any one of claims 50-52, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target RNA molecules.

54. The kit of any one of claims 50-53, wherein at least one of the guide RNAs and/or the precursor C2c2 guide RNA array is detectably labeled.

55. The kit of any one of claims 50-54, further comprising a labeled detector RNA comprising a fluorescence -emitting dye pair.

56. A kit for detecting a target RNA in a sample comprising a plurality of RNAs, the kit comprising:

(a) a labeled detector RNA comprising a fluorescence-emitting dye pair; and

(b) a C2c2 protein.

57. The kit of claim 56, comprising a positive control target RNA.

58. The kit of claim 57, where in the positive control target RNA is present in different amounts in each of two or more containers.

59. The kit of any one of claims 56-58, comprising at least one of:

(c) a C2c2 guide RNA and/or a nucleic acid encoding said C2c2 guide RNA;

(d) a precursor C2c2 guide RNA and/or a nucleic acid encoding said precursor C2c2 guide RNA; and

(e) a precursor C2c2 guide RNA array, and/or a nucleic acid encoding said precursor C2c2 guide RNA array, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs each of which has a different guide sequence and/or an insertion site for a guide sequence of choice.

60. The kit of any one of claims 56-59, comprising a DNA comprising a nucleotide sequence that encodes a C2c2 guide RNA with or without a guide sequence.

61. The kit of claim 60, wherein the DNA comprises an insertion sequence for the insertion of a guide sequence.

62. The kit of claim 60 or claim 61, wherein the DNA is an expression vector and the C2c2 guide RNA is operably linked to a promoter.

63. The kit of claim 62, wherein the promoter is a T7 promoter.

64. The kit of any one of claims 56-63, comprising a C2c2 endoribonuclease variant that lacks nuclease activity.

65. The kit of any one of claims 56-64, wherein the labeled detector RNA comprises a FRET pair.

66. The kit of any one of claims 56-65, wherein the labeled detector RNA comprises a

quencher/fluor pair.

67. The kit of any one of claims 56-66, wherein the labeled detector RNA comprises a FRET pair that produces a first detectable signal and a quencher/fluor pair that produces a second detectable signal.

68. A variant C2c2 polypeptide comprising:

a) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of: i) amino acids R472 and H477; ii) amino acids R1048 and H1053; or iii) amino acids R472, H477, R1048, and H1053;

b) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of: i) amino acids R445 and H450; ii) amino acids R1016 and H1021 ; or iii) amino acids R445, H450, R1016, and H1021;

c) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of: i) amino acids R464 and H469; ii) amino acids R1052, and H1057; or iii) amino acids R464, H469, R1052, and H1057;

d) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of: i) amino acids R467 and H472; ii) amino acids R1069, and H1074; or iii) amino acids R467, H472, R1069, and H1074; or

e) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of: i) amino acids R472 and H477; ii) amino acids R1044 and H1049; iii) or amino acids R472, H477, R1044, and H1049.

69. A variant C2c2 polypeptide of claim 68, wherein the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

70. A nucleic acid comprising a nucleotide sequence encoding a variant C2c2 polypeptide of claim 68 or 69.

71. The nucleic acid of claim 70, wherein the nucleotide sequence is operably linked to a constitutive promoter or a regulatable promoter.

72. A recombinant expression vector comprising the nucleic acid of claim 70 or 71.

73. A host cell genetically modified with the nucleic acid of claim 70 or 71, or with the recombinant expression vector of claim 72.

74. The host cell of claim 73, wherein the host cell is a eukaryotic cell.

75. The host cell of claim 73, wherein the host cell is a prokaryotic cell.

76. The host cell of any one of claims 73-75, wherein the host cell is in vitro, ex vivo, or in vivo.

77. A method of detecting at least two different single stranded target RNAs in a sample comprising a plurality of RNAs, the method comprising:

a) contacting the sample with:

(i) a first C2c2 protein that cleaves single stranded RNAs (ssRNAs) that include at least one A;

(ii) a second C2c2 protein that cleaves ssRNAs that include at least one U;

(iii) a first C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the first single stranded target RNA and a second nucleotide sequence that binds to the first C2c2 protein; and

(iv) a second C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the second single stranded target RNA and a second nucleotide sequence that binds to the second C2c2 protein;

wherein the first C2c2 protein is not activated by the second C2c2 guide RNA, and wherein the first C2c2 protein cleaves ssRNA that includes at least one A, and

wherein the second C2c2 protein is not activated by the first C2c2 guide RNA, and wherein the second C2c2 protein cleaves ssRNA that includes at least one U; and

b) measuring a detectable signal produced by RNA cleavage mediated by the first and the second C2c2 proteins, wherein a first detectable signal is produced upon activation of the first C2c2 protein and a second detectable signal is produced upon activation of the second C2c2 protein, wherein detection of the first signal indicates the presence in the sample of the first target ssRNA, and wherein detection of the second signal indicates the presence in the sample of the second target ssRNA.

78. The method of claim 77, wherein:

a) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

b) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

c) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

d) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

e) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C; f) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

g) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

h) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

i) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

j) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

k) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

1) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

m) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

n) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

o) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56 A;

p) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

q) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

r) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

s) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

t) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

u) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

v) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56F; or

w) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A.

79. The method according to claim 77 or 78, wherein the method comprises contacting the sample with:

i) a first labeled detector RNA comprising a first fluorescence-emitting dye pair, where the first labeled detector RNA comprises at least one A and does not comprise U; and

ii) a second labeled detector RNA comprising a second fluorescence -emitting dye pair, where the second labeled detector RNA comprises at least one U and does not comprise A,

wherein the first C2c2 protein cleaves the first labelled detector RNA, and the first detectable signal is produced by the first fluorescence -emitting dye pair, and

wherein the first C2c2 protein cleaves the second labelled detector RNA, and the second detectable signal is produced by the second fluorescence-emitting dye pair.

80. The method according to claim 79, wherein the first labeled detector RNA comprises a stretch of from 2 to 15 consecutive As and/or the second labeled detector RNA comprises a stretch of from 2 to 15 consecutive Us.

81. The method according to claim 79, wherein the first labeled detector RNA comprises a stretch of from 4 to 15 consecutive As and/or the second labeled detector RNA comprises a stretch of from 4 to 15 consecutive Us.

82. The method according to claim 79, wherein the first labeled detector RNA comprises a stretch of at least 3 consecutive As and/or the second labeled detector RNA comprises a stretch of at least

3 consecutive Us.

83. The method according to claim 79, wherein the first labeled detector RNA comprises a stretch of at least 4 consecutive As and/or the second labeled detector RNA comprises a stretch of at least

4 consecutive Us.

84. A kit comprising:

(a) a first labeled detector RNA that lacks U and comprises at least one A and comprises a first fluorescence -emitting dye pair; (b) a second labeled detector RNA that lacks A and comprises at least one U and comprises a second fluorescence -emitting dye pair;

(c) a first C2c2 protein, and/or a nucleic acid encoding said first C2c2 protein, wherein the first C2c2 protein can cleave the first labeled detector RNA but not the second labeled detector RNA; and

(d) a second C2c2 protein, and/or a nucleic acid encoding said second C2c2 protein, wherein the second C2c2 protein can cleave the second labeled detector RNA but not the first labeled detector RNA.

85. The kit of claim 84, comprising at least one of:

(e) a first C2c2 guide RNA and/or a nucleic acid encoding said first C2c2 guide RNA, wherein the first C2c2 guide RNA comprises a constant region sequence that binds to the first C2c2 protein;

(f) a second C2c2 guide RNA and/or a nucleic acid encoding said second C2c2 guide RNA, wherein the second C2c2 guide RNA comprises a constant region sequence that binds to the second C2c2 protein;

(g) a nucleic acid comprising a nucleotide sequence encoding a constant region sequence that binds to the first C2c2 protein and an insertion site for a guide sequence of choice;

(h) a nucleic acid comprising a nucleotide sequence encoding a constant region sequence that binds to the second C2c2 protein and an insertion site for a guide sequence of choice.

86. The kit of claim 84, comprising a nucleic acid comprising a nucleotide sequence encoding a first C2c2 guide RNA, wherein the first C2c2 guide RNA comprises a constant region sequence that binds to the first C2c2 protein.

87. The kit of claim 84, comprising a nucleic acid comprising a nucleotide sequence encoding a second C2c2 guide RNA, wherein the second C2c2 guide RNA comprises a constant region sequence that binds to the second C2c2 protein.

88. The kit of claim 84, comprising a nucleic acid comprising a nucleotide sequence encoding a constant region sequence that binds to the first C2c2 protein and an insertion site for a guide sequence of choice.

89. The kit of claim 84, comprising a nucleic acid comprising a nucleotide sequence encoding a constant region sequence that binds to the second C2c2 protein and an insertion site for a guide sequence of choice

90. The kit of any one of claims 86-89, wherein the nucleic acid is an expression vector and the nucleotide sequence is operably linked to a promoter.

Description:

METHODS AND COMPOSITIONS FOR DETECTING A TARGET RNA

CROSS-REFERENCE

[0001] This application claims the benefit of U.S. Provisional Patent Application No.

62/351,172, filed June 16, 2016, U.S. Provisional Patent Application No. 62/378,156, filed August 22, 2016, and U.S. Patent Application No. 15/467,922, filed March 23, 2017, which applications are incorporated herein by reference in their entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RE SEARCH

[0002] This invention was made with government support under 1244557 awarded by the National Science Foundation. The government has certain rights in the invention.

INCORPORATION BY REFERENCE OF SEQUENCE LISTING PROVIDED AS A TEXT FILE

[0003] A Sequence Listing is provided herewith as a text file, "BERK-337WO_SeqList_ST25.txt" created on March 21, 2017 and having a size of 67 KB. The contents of the text file are incorporated by reference herein in their entirety.

INTRODUCTION

[0004] Bacterial adaptive immune systems employ CRISPRs (clustered regularly interspaced short palindromic repeats) and CRISPR-associated (Cas) proteins for RNA-guided nucleic acid cleavage. Although generally targeted to DNA substrates, the Type III and Type VI CRISPR systems direct interference complexes against single-stranded RNA (ssRNA) substrates. In Type VI CRISPR systems, the single-subunit C2c2 protein functions as an RNA-guided RNA endonuclease.

[0005] CRISPR-Cas systems confer adaptive immunity in bacteria and archaea via RNA-guided nucleic acid interference. Among the diverse CRISPR types, only the relatively rare Type VI CRISPR systems are believed to target single-stranded RNA substrates exclusively, an activity conferred by the large effector protein C2c2. The Type VI operons share common features of other CRISPR-Cas genomic loci, including CRISPR sequence arrays that serve as repositories of short viral DNA segments. To provide anti-viral immunity, processed CRISPR array transcripts (crRNAs) assemble with Cas protein-containing surveillance complexes that recognize nucleic acids bearing sequence complementarity to the virus derived segment of the crRNAs, known as the spacer. [0006] The first step of immune surveillance requires processing of precursor crRNAs (pre -crRNAs), consisting of repeat sequences flanking viral spacer sequences, into individual functional crRNAs that each contain a single virally-derived sequence segment. CRISPR systems employ a variety of mechanisms to produce mature crRNAs, including the use of dedicated endonucleases (e.g., Cas6 or Cas5d in Type I and III systems), coupling of a host endonuclease (e.g., RNase III) with a trans-activating crRNA (tracrRNA, Type II systems), or a ribonuclease activity endogenous to the effector enzyme itself (e.g., Cpfl, from Type V systems).

SUMMARY

[0007] The present disclosure provides methods for detecting a single-stranded target RNA. The present disclosure provides methods of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. The present disclosure provides a kit for detecting a target RNA in a sample.

[0008] Provided are compositions and methods for detecting a single stranded target RNA, where the methods include (i) contacting a sample having a plurality of RNAs with (a) a C2c2 guide RNA that hybridizes with the single stranded target RNA, and (b) a C2c2 protein that cleaves RNAs of the sample; and (ii) measuring a detectable signal produced by the cleavage. Once a subject C2c2 protein is activated by a C2c2 guide RNA, which occurs when the sample includes a single stranded target RNA to which the guide RNA hybridizes (i.e., the sample includes the targeted single stranded target RNA), the C2c2 protein becomes an endoribonuclease that cleaves RNAs of the sample. Thus, when the targeted single stranded target RNA is present in the sample (e.g., in some cases above a threshold amount), the result is cleavage of RNA in the sample, which can be detected using any convenient detection method (e.g., using a labeled detector RNA).

[0009] In some cases, two or more C2c2 guide RNAs can be provided by using a precursor C2c2 guide RNA array, which can be cleaved by the C2c2 protein into individual guide RNAs, and this is independent of whether the C2c2 protein has intact HEPN1 and/or HEPN2 domains. Thus, also provided are methods of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. In some cases, the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain.

BRIEF DESCRIPTION OF THE DRAWINGS

[0010] FIG. 1A-1E present schematics related to an endogenous C2c2 locus and experiments

demonstrating heterologous expression and purification of recombinant C2c2 protein.

[0011] FIG. 2 depicts results from cleavage assays that were performed using C2c2 protein.

[0012] FIG. 3 depicts results showing that C2c2 robustly cleaved single stranded RNA. [0013] FIG. 4A-4B present a diagram and results from various cleavage assays using C2c2 protein.

[0014] FIG. 5 depicts results from experiments performed using a labeled detector RNA (using a

quencher/fluor pair) to detect RNA cleave by C2c2 protein.

[0015] FIG. 6 depicts results from experiments that demonstrate pre-crRNA processing by C2c2

protein.

[0016] FIG. 7A-7B depict a schematic and sequences of example C2c2 guide RNAs.

[0017] FIG. 8A-8F provides amino acid sequences of various C2c2 polypeptides.

[0018] FIG. 9A-9C depict C2c2 family processing of precursor crRNA transcripts to generate mature crRNAs.

[0019] FIG. lOA-lOC depict the effect of structure and sequence of CRISPR repeats on LbuC2c2 mediated crRNA biogenesis.

[0020] FIG. 11A-11C depict guide-dependent ssRNA degradation of cis and trans targets by LbuC2c2.

[0021] FIG. 12A-12D depict two distinct ribonuclease activities of LbuC2c2.

[0022] FIG. 13A-13D depict C2c2-mediated sensitive visual detection of transcripts in complex

mixtures.

[0023] FIG. 14 provides Table 2: various DNA substrates used in the studies described in Examples 8-

12 and shown in FIG. 12A-12D.

[0024] FIG. 15 provides Table 3: various RNA substrates used in the studies described in Examples 8-

12.

[0025] FIG. 16A-16F depict data showing that pre -crRNA processing by C2c2 is spacer sequence independent, can occur on tandem crRNA arrays, is affected by mutations in the 5 'and/or 3' flanking region of the pre-cRNA, and is metal independent.

[0026] FIG. 17 provides a summary of the effect of pre-crRNA double mutations on pre -crRNA

processing activity.

[0027] FIG. 18A-18B depict LbuC2c2 ssRNA target cleavage site mapping.

[0028] FIG. 19A-19C depict dependence of crRNA spacer length, reaction temperature, and 5 '-end sequence of crRNA on target RNA cleavage efficiency.

[0029] FIG. 20A-20C depict binding data for LbuC2c2 to mature crRNA and target ssRNA.

[0030] FIG. 21 depicts an RNase detection assay 2-ssRNA time course.

[0031] FIG. 22A-22B depict a phylogenetic tree of C2c2 family and C2c2 alignment.

[0032] FIG. 23A-23D depict purification and production of C2c2.

[0033] FIG. 24A-24C depict data showing that C2c2 proteins process precursor crRNA transcripts to generate mature crRNAs. a, Maximum-likelihood phylogenetic tree of C2c2 proteins. Homologs used in this study are highlighted in yellow, b, Diagram of the three Type VI CRISPR loci used in this study. Black rectangles denote repeat elements, yellow diamonds denote spacer sequences. Casl and Cas2 are only found in the genomic vicinity of LshC2c2. c, C2c2-mediated cleavage of pre-crRNA derived from the LbuC2c2, LseC2c2 and LshC2c2 CRISPR repeat loci. OH: alkaline hydrolysis ladder; Tl : RNase Tl hydrolysis ladder; processing cleavage reactions were performed with 100 nM C2c2 and <1 nM pre-crRNA. Schematic of cleavage is depicted on right, and predicted pre-crRNA secondary structures are diagrammed below, with arrows indicating the mapped C2c2 cleavage sites.

[0034] FIG. 25A-25C depict data showing that LbuC2c2 mediated crRNA biogenesis depends on both structure and sequence of CRISPR repeats, a, Representative cleavage assay by LbuC2c2 on pre- crRNAs containing structural mutations within the stem and loop regions of hairpin. Processed percentages listed below are quantified at 60 min (mean ± s.d., n = 3). b, Bar graph showing the dependence of pre-crRNA processing on the CRISPR repeat sequence. The wild-type repeat sequence is shown below with individual bars representing tandem nucleotide mutations as noted in red. The cleavage site is indicated by cartoon scissors. Percentage processed was measured after 60 min (mean ± s.d., n = 3). Diagrammed hairpins of tested mutants can be found in Extended Data Figs. 3-4 c, Divalent metal ion dependence of the crRNA processing reaction was tested by addition of 10-50 mM EDTA and EGTA to standard reaction conditions.

[0035] FIG. 26A-26D depict data showing that that LbuC2c2 contains two distinct ribonuclease

activities, a, Quantified time-course data of cis ssRNA target (black) and pre-crRNA (teal) cleavage by LbuC2c2 performed at 37°C. Exponential fits are shown as solid lines (n=3), and the calculated pseudo-first-order rate constants (k _obs) (mean ± s.d.) are 9.74 ± 1.15 min ¹ and 0.12 ± 0.02 min ¹ for cis ssRNA target and pre-crRNA cleavage, respectively, b, LbuC2c2 architecture depicting the location of HEPN motifs and processing deficient point mutant c,d Ribonuclease activity of LbuC2c2 mutants for pre-crRNA processing in c and ssRNA targeting in d and Extended Data Fig 6d.

[0036] FIG. 27A-27E shows that C2c2 provides sensitive detection of transcripts in complex mixtures.

a, Illustration of LbuC2c2 RNA detection approach using a quenched fluorescent RNA reporter. b, Quantification of fluorescence signal generated by LbuC2c2 after 30 min for varying concentrations of target RNA in the presence of human total RNA. RNase A shown as positive RNA degradation control, (mean ± s.d., n = 3) c,. Quantification of fluorescence signal generated by LbuC2c2 loaded with a β-actin targeting crRNA after 3h for varying amounts of human total RNA or bacterial total RNA (as a β-actin null negative control), (mean ± s.d., n = 3) d, Tandem pre-crRNA processing also enables RNA detection, (mean ± s.d., n = 3) e, Model of the Type VI CRISPR pathway highlighting both of C2c2's ribonuclease activities. [0037] FIG. 28A-28B depict a complete phylogenetic tree of C2c2 family and C2c2 alignment, a,

Maximum-likelihood phylogenetic reconstuction of C2c2 proteins. Leaves include GI protein numbers and organism of origin; bootstrap support values, out of 100 resamplings, are presented for inner split. Scale is in substitutions per site, b, Multiple sequence alignment of the three analyzed homologs of C2c2; coordinates are based on LbuC2c2.

[0038] FIG. 29A-29D depict data related to purification and production of C2c2. All C2c2 homologs were expressed in E. coli as His-MBP fusions and purified by a combination of affinity, ion exchange and size exclusion chromatography. The Ni ⁺ affinity tag was removed by incubation with TEV protease. Representative SDS-PAGE gels of chromatography fractions are shown in (a, b). c, The chromatogram from Superdex 200 (16/60) column demonstrating that C2c2 elutes as a single peak, devoid of nucleic acid, d, SDS PAGE analysis of purified proteins used in this manuscript.

[0039] FIG. 30A-30I depict mapping of pre-crRNA processing by C2c2 in vitro and in vivo, a,

Cleavage site mapping of LseC2c2 and LshCc2c2 cleavage of a single cognate pre-crRNA array. OH: alkaline hydrolysis ladder; Tl : Tl RNase hydrolysis ladder. Cleavage reactions were performed with 100 nM C2c2 and <1 nM pre-crRNA. b-i, Re-analysis of LshC2c2 (b-f) and LseC2c2 (g-i) CRISPR array RNA sequencing experiments from Shmakov et al. ^w (Fig. S7 and Fig. 5, respectively). All reads (b,g) and filtered reads (55 nt or less; as per original Shmakov et al. analysis; c,h) were stringently aligned to each CRISPR array using Bowtie2 (see Methods). Detailed views of individual CRISPR repeat-spacers are shown for Lsh (d-f) and Lse (i).

Differences in 5' end pre-crRNA processing are indicated by arrows below each sequence. BAM alignment files of the analysis are available. This mapping clearly indicates that the 5' ends of small RNA sequencing reads generated from Lsh pre-crRNAs map to a position 2 nts from the base of the predicted hairpin, in agreement with the in vitro processing data (a). This pattern holds for all mature crRNAs detected from both native expression in L. shahii and heterologous expression in E. coli. Unfortunately, the LseC2c2 crRNA sequencing data (used in g-i) is less informative due to low read depth, and each aligned crRNA exhibits a slightly different 5' end with little obvious uniformity. The mapping for one of the processed repeats (repeat-spacer 2; i) is in agreement with the data but only with low confidence due to the insufficient read depth.

[0040] FIG. 31A-31D depict that pre-crRNA processing by C2c2 is spacer-sequence independent, can occur on tandem crRNA arrays, is affected by mutations in the 5' flanking region of the pre- cRNA and produces a 3' phosphate product, a, Cleavage site mapping of LbuCc2c2 cleavage of a tandem pre-crRNA array. OH: alkaline hydrolysis ladder; Tl : Tl RNase hydrolysis ladder. Cleavage reactions were performed with 100 nM LbuC2c2 and <1 nM pre-crRNA. A schematic of cleavage products is depicted on right, with arrows indicating the mapped C2c2 cleavage products, b, LbuC2c2 4-mer mutant pre-crRNA processing data demonstrating the importance of the 5' single-stranded flanking region for efficient pre-crRNA processing. Percentage of pre- crRNA processing was measured after 60 min (mean ± s.d., n = 3). c, Representative LbuC2c2 pre-crRNA cleavage time -course demonstrating that similar rates of pre-crRNA processing occur independent of crRNA spacer sequence pseudo-first-order rate constants (k _obs) (mean ± s.d.) are 0.07 ± 0.04 min ¹ and 0.08 ± 0.04 min ¹ for spacer A and spacer XI, respectively, d, End group analysis of cleaved RNA by T4 polynucleotide kinase (PNK) treatment. Standard processing assay conditions were used to generate cleavage product, which was then incubated with PNK for 1 hr to remove any 2', 3'-cyclic phosphates/3' monophosphates. Retarded migration of band indicates removal of the charged, monophosphate from the 3' end of radiolabeled 5' product..

[0041] FIG. 32A-32C show that LbuC2c2 catalyzes guide -dependent ssRNA degradation on cis and trans targets, a, Schematic of the two modes of C2c2, guide -dependent ssRNA degradation, b, Cleavage of two distinct radiolabeled ssRNA substrates, A and B, by LbuC2c2. Complexes of 100 nM C2c2 and 50 nM crRNA were pre -formed at 37 °C, and reaction was initiated upon addition of <1 nM 5 '-labeled target RNA at 25 °C. Trans cleavage reactions contained equimolar (<1 nM) concentrations of radiolabeled non-guide-complementary substrate, and unlabeled on- target ssRNA. For multiple ssRNA substrates, it was observed that LbuC2c2 catalyzed efficient cleavage only when bound to the complementary crRNA, indicating that LbuC2c2:crRNA cleaves ssRNA in an RNA-guided fashion This activity is hereafter referred to as on-target or cw-target cleavage. LbuC2c2-mediated cis cleavage resulted in a laddering of multiple products, with cleavage preferentially occurring before uracil residues, analogous to LshC2c2 ⁹. Non-target cleavage reactions were repeated in the presence of unlabeled, on-target (crRNA- complementary) ssRNA. In contrast to non-target cleavage experiments performed in cis, rapid degradation of non-target RNA in trans was observed. The similar RNA cleavage rates and near identical cleavage products observed for both cis on-target cleavage and trans non-target cleavage implicate the same nuclease center in both activities, c, LbuC2c2 loaded with crRNA targeting spacer A was tested for cleavage activity under both cis (target A labeled) and trans (target B labeled in the presence of unlabeled target A) cleavage conditions in the presence of 25 mM EDTA..

[0042] FIG. 33A-33B show LbuC2c2 ssRNA target cleavage site mapping a, ssRNA target cleavage assay conducted per Methods demonstrating LbuC2c2-mediated 'ci '-cleavage of several radiolabeled ssRNA substrates with identical spacer-complementary sequences but distinct 5' flanking sequences of variable length and nucleotide composition. Sequences of ssRNA substrates are shown to the right with spacer-complementary sequences for crRNA-A highlighted in yellow. Arrows indicate detected cleavage sites. Gel was cropped for clarity. It should be noted that the pattern of cleavage products produced on different substrates (e.g. A. l vs. A.2 vs. A.3) indicates that the cleavage site choice is primarily driven by a uracil preference and exhibits an apparent lack of exclusive cleavage mechanism within the crRNA- complementary target sequence, which is in contrast to what is observed for other Class II CRISPR single effector complexes such as Cas9 and Cpfl ¹¹' ²¹. Interestingly, the cleavage pattern observed for substrate A.O hints at a secondary preference for polyG sequences, b, LbuC2c2 ssRNA target cleavage assay as per Methods, using a range of crRNAs that tile the length of the ssRNA target. The sequence of the ssRNA substrates used in this experiment is shown below the gel with spacer-complementary sequences for each crRNA highlighted in yellow. Arrows indicate predicted cleavage sites. Above each set of lanes, a small diagram indicates the location of the spacer sequence along the target (yellow box) and the cleavage products observed (red arrows) or absent (black arrows). Likewise, it should be noted that for every crRNA the cleavage product length distribution is very similar, again indicating an apparent lack of exclusive cleavage within the crRNA-bound sequence. The absence of a several cleavage products in a subset of the reactions might be explained by the presence of bound C2c2:crRNA on the ssRNA target, which could sterically occlude access to uracils by any cis (intramolecular) or trans (intermolecular) LbuC2c2 active sites. While proper analysis for protospacer flanking site (PFS) preference for LbuC2c2 is beyond the scope of this study, minimal impact of the 3' flanking nucleotide was observed. Expected PFS base is noted in diagram next to each guide tested in red.

[0043] FIG. 34A-34D depict dependence of RNA targeting on crRNA variants, temperature and point mutations, a, LbuC2c2 ssRNA target cleavage assay carried out, as per Methods with crRNAs possessing 16-nt, 20-nt or 24-nt spacers, b, LbuC2c2 ssRNA target cleavage time-course carried out at either 25 °C and 37°C as per methods, c, LbuC2c2 ssRNA target cleavage timecourse carried out as per Methods with crRNAs possessing different 5 '-flanking nucleotide mutations. Mutations are highlighted in red. 1-2 nucleotide 5' extensions negligibly impacted cleavage efficiencies. In contrast, shortening the flanking region to 3 nts slowed cleavage rates, d Impact of point mutations on ribonuclease activity of C2c2 in conserved residue mutants within HEPN motifs for ssRNA targeting..

[0044] FIG. 35A-35D depict binding data for LbuC2c2 to mature crRNA and target ssRNA. a, Filter binding assays were conducted as described in the Methods to determine the binding affinity of mature crRNA- A_GG to LbuC2c2-WT, LbuC2c2-dHEPNl, LbuC2c2-dHEPN2, or LbuC2c2- dHEPNl/dHEPN2. The quantified data were fit to standard binding isotherms. Error bars represent the standard deviation from three independent experiments. Measured dissociation constants from three independent experiments (mean ± sd) were 27.1 ± 7.5 nM (LbuC2c2-WT), 15.2 ± 3.2 nM (LbuC2c2-dHEPNl), 11.5 ± 2.5 nM (LbuC2c2-dHEPN2), and 43.3 ± 11.5 nM (LbuC2c2- dHEPNl/dHEPN2). b, Representative electrophoretic mobility shift assay for binding reactions between LbuC2c2-dHEPNl/dHEPN2: crRNA-A_GG and either Όη-target' A ssRNA or Off -target' B ssRNA, as indicated. Three independent experiments were conducted as described in the Methods. The gel was cropped for clarity, c, Quantified binding data from (b) were fitted to standard binding isoforms. Error bars represent the standard deviation from three independent experiments. Measured dissociation constants from three independent experiments (mean ± sd) were 1.62 ± 0.43 nM for ssRNA A and N.D (»10 nM) for ssRNA B. d, Filter binding assays were conducted as described in the Methods to determine the binding affinity of mature crRNA-A_GA to LbuC2c2-WT and LbuC2c2-R1079A. The quantified data were fit to standard binding isotherms. Error bars represent the standard deviation from three independent experiments. Measured dissociation constants from three independent experiments (mean ± sd) were 4.65 ± 0.6 nM (LbuC2c2-WT) and 2.52 ± 0.5 nM (LbuC2c2-R1079A). It is of note that these binding affinities differ from panel a. This difference is accounted for in a slight difference in the 5 sequence of the guide with panel a guides beginning with a 5 -GGCCA... and panel d 5 -GACCA. While the native sequence guide (5 -GACCA) binds tighter to LbuC2c2, no difference is seen in the RNA targeting efficiencies of these guide variants (Extended Data Fig. 6c).

[0045] FIG. 36A-36B depict an RNase detection assay 2-ssRNA time-course, a, LbuC2c2:crRNA- 2 was incubated with RNAase -Alert substrate (Thermo-Fisher)) and 100 ng HeLa total RNA in the presence of increasing amounts of XI ssRNA (0-1 nM) for 120 min at 37°C. Fluorescence measurements were taken every 5 min. The 1 nM XI ssRNA reaction reached saturation before the first time point could be measured. Error bars represent the standard deviation from three independent experiments, b, LbuC2c2:crRNA- 4 or apo LbuC2c2 was incubated in HeLa total RNA for 2 hours in the presence or absence of on-target activating λ4 ssRNA. Degradation of background small RNA was resolved on a small RNA chip in a Bioanalyzer 2100 as per Methods. Small differences are seen in the fragment profile of between apo LbuC2c2 and LbuC2c2:crRNA- 4. In contrast, upon addition of the on-target ssRNA to the reaction, a drastic broadening and shifting of the tRNA peak reveals extensive degradation of other structured and nonstructured RNA's present in the reaction upon activation of LbuC2c2 trans activity.

[0046] FIG. 37 depicts cleavage experiments demonstrating severely reduced cleavage of precursor guide RNA (guide RNA processing) by LbuC2c2 when the protein includes a mutation at any amino acid position selected from R1079 (e.g., R1079A), R1072 (e.g., R1072A), and K1082 (e.g., K1082A).

[0047] FIG. 38A-38C depict conservation of pre-crRNA processing within the Casl3a family. [0048] FIG. 39A-39C depict CRISPR loci and crRNA repeat architecture for Casl3a homologs.

[0049] FIG. 40A-40F depict residues important for pre-crRNA cleavage by LbuCasl3a.

[0050] FIG. 41A-41B depict alignments of Helical 1 and HEPN domains of Casl3a family members.

[0051] FIG. 42A-42D depict efficiencies of ssRNA by members of the Casl3a family.

[0052] FIG. 43A-43D depict trans-ssRNA cleavage by Casl3a homologs.

[0053] FIG. 44A-44F depict crRNA exchangeability within the Casl3a family.

[0054] FIG. 45A-45C depict functional validation of orthogonal Casl3a subfamilies for RNA

detection.

[0055] FIG. 46A-46D depict crRNA array processing by wild-type (WT) LbuCasl3a and LbuCasl3a

R1079A/K1080A double mutant.

[0056] FIG. 47A-47C depict trans-cleavage by LbuCasl3a point mutants in regions implicated in pre- crRNA processing.

[0057] FIG. 48A-48B depict features of the LbuCasl3a R1079A/K1080A double mutant relative to wild-type LbuCasl3a.

[0058] FIG. 49 provides Table 4.

[0059] FIG. 50 provides Table 5.

[0060] FIG. 51 provides Table 6.

[0061] FIG. 52 provides Table 7.

[0062] FIG. 53 provides Table 8.

[0063] FIG. 54 provides Table 9.

[0064] FIG. 55A-55B presents a model for Type VI-A CRISPR system function.

[0065] FIG. 56A-56K provide amino acid sequences of various Casl3a polypeptides.

[0066] FIG. 57 provides an alignment of amino acid sequences of various Casl3a polypeptides.

DEFINITIONS

[0067] The terms "polynucleotide" and "nucleic acid," used interchangeably herein, refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides. Thus, terms "polynucleotide" and "nucleic acid" encompass single-stranded DNA; double-stranded DNA; multi-stranded DNA; single-stranded RNA; double-stranded RNA; multi-stranded RNA;

genomic DNA; cDNA; DNA-RNA hybrids; and a polymer comprising purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases.

[0068] The term "oligonucleotide" refers to a polynucleotide of between 3 and 100 nucleotides of single- or double-stranded nucleic acid (e.g., DNA, RNA, or a modified nucleic acid). However, for the purposes of this disclosure, there is no upper limit to the length of an oligonucleotide. Oligonucleotides are also known as "oligomers" or "oligos" and can be isolated from genes, transcribed (in vitro and/or in vivo), or chemically synthesized. The terms "polynucleotide" and "nucleic acid" should be understood to include, as applicable to the embodiments being described, single-stranded (such as sense or antisense) and double-stranded polynucleotides.

[0069] By "hybridizable" or "complementary" or "substantially complementary" it is meant that a

nucleic acid (e.g. RNA, DNA) comprises a sequence of nucleotides that enables it to non- covalently bind, i.e. form Watson-Crick base pairs and/or G/U base pairs, "anneal", or

"hybridize," to another nucleic acid in a sequence-specific, antiparallel, manner (i.e., a nucleic acid specifically binds to a complementary nucleic acid) under the appropriate in vitro and/or in vivo conditions of temperature and solution ionic strength. Standard Watson-Crick base-pairing includes: adenine/adenosine) (A) pairing with thymidine/thymidine (T), A pairing with uracil/ uridine (U), and guanine/guanosine) (G) pairing with cytosine/cytidine (C). In addition, for hybridization between two RNA molecules (e.g., dsRNA), and for hybridization of a DNA molecule with an RNA molecule (e.g., when a DNA target nucleic acid base pairs with a C2c2 guide RNA, etc.): G can also base pair with U. For example, G/U base-pairing is partially responsible for the degeneracy (i.e., redundancy) of the genetic code in the context of tRNA anti- codon base-pairing with codons in mRNA. Thus, in the context of this disclosure, a G (e.g., of a protein-binding segment (dsRNA duplex) of a C2c2 guide RNA molecule; of a target nucleic acid base pairing with a C2c2 guide RNA) is considered complementary to both a U and to C. For example, when a G/U base-pair can be made at a given nucleotide position of a protein- binding segment (e.g., dsRNA duplex) of a C2c2 guide RNA molecule, the position is not considered to be non-complementary, but is instead considered to be complementary.

[0070] Hybridization and washing conditions are well known and exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor (1989), particularly Chapter 11 and Table 11.1 therein; and Sambrook, J. and Russell, W., Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor (2001). The conditions of temperature and ionic strength determine the "stringency" of the hybridization.

[0071] Hybridization requires that the two nucleic acids contain complementary sequences, although mismatches between bases are possible. The conditions appropriate for hybridization between two nucleic acids depend on the length of the nucleic acids and the degree of complementarity, variables well known in the art. The greater the degree of complementarity between two nucleotide sequences, the greater the value of the melting temperature (Tm) for hybrids of nucleic acids having those sequences. For hybridizations between nucleic acids with short stretches of complementarity (e.g. complementarity over 35 or fewer, 30 or fewer, 25 or fewer, 22 or fewer, 20 or fewer, or 18 or fewer nucleotides) the position of mismatches can become important (see Sambrook et al., supra, 11.7-11.8). Typically, the length for a hybridizable nucleic acid is 8 nucleotides or more (e.g., 10 nucleotides or more, 12 nucleotides or more, 15 nucleotides or more, 20 nucleotides or more, 22 nucleotides or more, 25 nucleotides or more, or 30 nucleotides or more). The temperature and wash solution salt concentration may be adjusted as necessary according to factors such as length of the region of complementation and the degree of complementation.

[0072] It is understood that the sequence of a polynucleotide need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable or hybridizable. Moreover, a

polynucleotide may hybridize over one or more segments such that intervening or adjacent segments are not involved in the hybridization event (e.g., a loop structure or hairpin structure). A polynucleotide can comprise 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence complementarity to a target region within the target nucleic acid sequence to which it will hybridize. For example, an antisense nucleic acid in which 18 of 20 nucleotides of the antisense compound are complementary to a target region, and would therefore specifically hybridize, would represent 90 percent complementarity. In this example, the remaining noncomplementary nucleotides may be clustered or interspersed with complementary nucleotides and need not be contiguous to each other or to complementary nucleotides. Percent complementarity between particular stretches of nucleic acid sequences within nucleic acids can be determined using any convenient method. Exemplary methods include BLAST programs (basic local alignment search tools) and PowerBLAST programs (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656) or by using the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, Madison Wis.), using default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489).

[0073] The terms "peptide," "polypeptide," and "protein" are used interchangeably herein, and refer to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones.

[0074] "Binding" as used herein (e.g. with reference to an RNA-binding domain of a polypeptide, binding to a target nucleic acid, and the like) refers to a non-covalent interaction between macromolecules (e.g., between a protein and a nucleic acid; between a C2c2 guide RNA complex and a target nucleic acid; and the like). While in a state of non-covalent interaction, the macromolecules are said to be "associated" or "interacting" or "binding" (e.g., when a molecule X is said to interact with a molecule Y, it is meant the molecule X binds to molecule Y in a non- covalent manner). Not all components of a binding interaction need be sequence-specific (e.g., contacts with phosphate residues in a DNA backbone), but some portions of a binding interaction may be sequence-specific. Binding interactions are generally characterized by a dissociation constant (K _d) of less than 10 ⁶ M, less than 10 ⁷ M, less than 10 ^s M, less than 10 ⁹ M, less than 10 ¹⁰ M, less than 10 ¹¹ M, less than 10 ¹² M, less than 10 ¹³ M, less than 10 ¹⁴ M, or less than 10 ¹⁵ M. "Affinity" refers to the strength of binding, increased binding affinity being correlated with a lower K _d.

[0075] By "binding domain" it is meant a protein domain that is able to bind non-covalently to another molecule. A binding domain can bind to, for example, an RNA molecule (an RNA-binding domain) and/or a protein molecule (a protein-binding domain). In the case of a protein having a protein-binding domain, it can in some cases bind to itself (to form homodimers, homotrimers, etc.) and/or it can bind to one or more regions of a different protein or proteins.

[0076] The term "conservative amino acid substitution" refers to the interchangeability in proteins of amino acid residues having similar side chains. For example, a group of amino acids having aliphatic side chains consists of glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic -hydroxyl side chains consists of serine and threonine; a group of amino acids having amide containing side chains consisting of asparagine and glutamine; a group of amino acids having aromatic side chains consists of phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains consists of lysine, arginine, and histidine; a group of amino acids having acidic side chains consists of glutamate and aspartate; and a group of amino acids having sulfur containing side chains consists of cysteine and methionine. Exemplary conservative amino acid substitution groups are: valine -leucine - isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine-glycine, and asparagine- glutamine.

[0077] A polynucleotide or polypeptide has a certain percent "sequence identity" to another

polynucleotide or polypeptide, meaning that, when aligned, that percentage of bases or amino acids are the same, and in the same relative position, when comparing the two sequences.

Sequence identity can be determined in a number of different ways. To determine sequence identity, sequences can be aligned using various methods and computer programs (e.g., BLAST, T-COFFEE, MUSCLE, MAFFT, Phyre2, etc.), available over the world wide web at sites including ncbi.nlm.nili.gov/BLAST, ebi.ac.uk/Tools/msa/tcoffee/, ebi.ac.uk/Tools/msa/muscle/, mafft.cbrc.jp/alignment/software/, http://www.sbg.bio.ic.ac.uk/~phyre2/. See, e.g., Altschul et al. (1990), J. Mol. Bioi. 215:403-10. [0078] A DNA sequence that "encodes" a particular RNA is a DNA nucleic acid sequence that is transcribed into RNA. A DNA polynucleotide may encode an RNA (mRNA) that is translated into protein, or a DNA polynucleotide may encode an RNA that is not translated into protein (e.g. tRNA, rRNA, microRNA (miRNA), a "non-coding" RNA (ncRNA), a C2c2 guide RNA, etc.).

[0079] The terms "DNA regulatory sequences," "control elements," and "regulatory elements," used interchangeably herein, refer to transcriptional and translational control sequences, such as promoters, enhancers, polyadenylation signals, terminators, protein degradation signals, and the like, that provide for and/or regulate transcription of a non-coding sequence (e.g., Cas9 guide RNA) or a coding sequence (e.g., Cas9 protein) and/or regulate translation of an encoded polypeptide.

[0080] As used herein, a "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase and initiating transcription of a downstream (3' direction) coding or non-coding sequence. For purposes of the present disclosure, the promoter sequence is bounded at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence will be found a transcription initiation site, as well as protein binding domains responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CAT" boxes. Various promoters, including inducible promoters, may be used to drive the various vectors of the present disclosure.

[0081] The term "naturally-occurring" or "unmodified" or "wild type" as used herein as applied to a nucleic acid, a polypeptide, a cell, or an organism, refers to a nucleic acid, polypeptide, cell, or organism that is found in nature. For example, a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has not been intentionally modified by a human in the laboratory is wild type (and naturally occurring).

[0082] "Recombinant," as used herein, means that a particular nucleic acid (DNA or RNA) is the

product of various combinations of cloning, restriction, polymerase chain reaction (PCR) and/or ligation steps resulting in a construct having a structural coding or non-coding sequence distinguishable from endogenous nucleic acids found in natural systems. DNA sequences encoding polypeptides can be assembled from cDNA fragments or from a series of synthetic oligonucleotides, to provide a synthetic nucleic acid which is capable of being expressed from a recombinant transcriptional unit contained in a cell or in a cell-free transcription and translation system. Genomic DNA comprising the relevant sequences can also be used in the formation of a recombinant gene or transcriptional unit. Sequences of non-translated DNA may be present 5' or 3' from the open reading frame, where such sequences do not interfere with manipulation or expression of the coding regions, and may indeed act to modulate production of a desired product by various mechanisms (see "DNA regulatory sequences", below). Alternatively, DNA sequences encoding RNA (e.g., C2c2 guide RNA) that is not translated may also be considered recombinant. Thus, e.g., the term "recombinant" nucleic acid refers to one which is not naturally occurring, e.g., is made by the artificial combination of two otherwise separated segments of sequence through human intervention. This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques. Such is usually done to replace a codon with a codon encoding the same amino acid, a conservative amino acid, or a non-conservative amino acid. Alternatively, it is performed to join together nucleic acid segments of desired functions to generate a desired combination of functions. This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques. When a recombinant polynucleotide encodes a polypeptide, the sequence of the encoded polypeptide can be naturally occurring ("wild type") or can be a variant (e.g., a mutant) of the naturally occurring sequence. Thus, the term

"recombinant" polypeptide does not necessarily refer to a polypeptide whose sequence does not naturally occur. Instead, a "recombinant" polypeptide is encoded by a recombinant DNA sequence, but the sequence of the polypeptide can be naturally occurring ("wild type") or non- naturally occurring (e.g., a variant, a mutant, etc.). Thus, a "recombinant" polypeptide is the result of human intervention, but may be a naturally occurring amino acid sequence.

[0083] A "vector" or "expression vector" is a replicon, such as plasmid, phage, virus, or cosmid, to which another DNA segment, i.e. an "insert", may be attached so as to bring about the replication of the attached segment in a cell.

[0084] An "expression cassette" comprises a DNA coding sequence operably linked to a promoter.

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner. For instance, a promoter is operably linked to a coding sequence if the promoter affects its transcription or expression.

[0085] The terms "recombinant expression vector," or "DNA construct" are used interchangeably herein to refer to a DNA molecule comprising a vector and one insert. Recombinant expression vectors are usually generated for the purpose of expressing and/or propagating the insert(s), or for the construction of other recombinant nucleotide sequences. The insert(s) may or may not be operably linked to a promoter sequence and may or may not be operably linked to DNA regulatory sequences.

[0086] Any given component, or combination of components can be unlabeled, or can be detectably labeled with a label moiety. In some cases, when two or more components are labeled, they can be labeled with label moieties that are distinguishable from one another.

[0087] General methods in molecular and cellular biochemistry can be found in such standard textbooks as Molecular Cloning: A Laboratory Manual, 3rd Ed. (Sambrook et al., HaRBor Laboratory Press 2001); Short Protocols in Molecular Biology, 4th Ed. (Ausubel et al. eds., John Wiley & Sons 1999); Protein Methods (Bollag et al., John Wiley & Sons 1996); Nonviral Vectors for Gene Therapy (Wagner et al. eds., Academic Press 1999); Viral Vectors (Kaplift & Loewy eds., Academic Press 1995); Immunology Methods Manual (I. Lefkovits ed., Academic Press 1997); and Cell and Tissue Culture: Laboratory Procedures in Biotechnology (Doyle & Griffiths, John Wiley & Sons 1998), the disclosures of which are incorporated herein by reference.

[0088] Before the present invention is further described, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.

[0089] Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges, and are also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.

[0090] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. [0091] It must be noted that as used herein and in the appended claims, the singular forms "a," "an," and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a protein" includes a plurality of such proteins and reference to "the guide RNA" includes reference to one or more such guide RNAs and equivalents thereof known to those skilled in the art, and so forth. It is further noted that the claims may be drafted to exclude any optional element. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as "solely," "only" and the like in connection with the recitation of claim elements, or use of a "negative" limitation.

[0092] It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. All combinations of the embodiments pertaining to the invention are specifically embraced by the present invention and are disclosed herein just as if each and every combination was individually and explicitly disclosed. In addition, all sub-combinations of the various embodiments and elements thereof are also specifically embraced by the present invention and are disclosed herein just as if each and every such sub-combination was individually and explicitly disclosed herein.

[0093] The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided may be different from the actual publication dates which may need to be independently confirmed.

DETAILED DESCRIPTION

[0094] The present disclosure provides methods for detecting a single-stranded target RNA. The present disclosure provides methods of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. The present disclosure provides a kit for detecting a target RNA in a sample. The term "C2c2 guide RNA" is used herein interchangeably with "Casl3a guide RNA" and in some cases a guide RNA is referred to as a crRNA (e.g., "Casl3a crRNA"); the term "C2c2 protein" (or "C2c2 polypeptide") is used herein interchangeably with "Casl3a protein" (or "Casl3a polypeptide").

METHODS OF DETECTING A SINGLE-STRANDED RNA

[0095] Provided are compositions and methods for detecting a single stranded target RNA, where the methods include (i) contacting a sample having a plurality of RNAs with (a) a C2c2 guide RNA that hybridizes with the single stranded target RNA, and (b) a C2c2 protein that cleaves RNAs present in the sample; and (ii) measuring a detectable signal produced by the cleavage. Once a subject C2c2 protein is activated by a C2c2 guide RNA, which occurs when the sample includes a single stranded target RNA to which the guide RNA hybridizes (i.e., the sample includes the targeted single stranded target RNA), the C2c2 protein is activated and functions as an endoribonuclease that non-specifically cleaves RNAs (including non-target RNAs) present in the sample. Thus, when the targeted single stranded target RNA is present in the sample (e.g., in some cases above a threshold amount), the result is cleavage of RNA (including non-target RNA) in the sample, which can be detected using any convenient detection method (e.g., using a labeled detector RNA). The contacting step is generally carried out in a composition comprising divalent metal ions. The contacting step can be carried out in an acellular environment, e.g., outside of a cell. The contacting step can be carried out inside a cell. The contacting step can be carried out in a cell in vitro. The contacting step can be carried out in a cell ex vivo. The contacting step can be carried out in a cell in vivo. In some cases, the C2c2 guide RNA is provided as RNA; and the C2c2 protein is provided as protein per se. In some cases, the C2c2 guide RNA is provided as DNA encoding the guide RNA; and the C2c2 protein is provided as protein per se. In some cases, the C2c2 guide RNA is provided as RNA; and the C2c2 protein is provided as RNA encoding the C2c2 protein. In some cases, the C2c2 guide RNA is provided as DNA encoding the guide RNA; and C2c2 protein is provided as RNA encoding the C2c2 protein. In some cases, the C2c2 guide RNA is provided as RNA; and the C2c2 protein is provided as DNA comprising a nucleotide sequence encoding the C2c2 protein. In some cases, the C2c2 guide RNA is provided as DNA encoding the guide RNA; and the C2c2 protein is provided as DNA comprising a nucleotide sequence encoding the C2c2 protein. In some cases, a method of the present disclosure provides for substantially simultaneous detection of two different target RNAs (a first single-stranded target RNA and a second single-stranded target RNA) in a sample.

In some cases, two or more (e.g., 3 or more, 4 or more, 5 or more, or 6 or more) C2c2 guide

RNAs can be provided by using a precursor C2c2 guide RNA array, which can be cleaved by the C2c2 protein into individual ("mature") guide RNAs; cleavage of a precursor C2c2 guide RNA is independent of whether the C2c2 protein has intact HEPN1 and/or HEPN2 domains. Thus, also provided are methods of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. Thus a C2c2 guide RNA array can include more than one guide seqeunce. In some cases, a subject C2c2 guide RNA can include a handle from a precursor crRNA but does not necessarily have to include multiple guide sequences.In some cases, the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain. The contacting step can be carried out in an acellular environment, e.g., outside of a cell. The contacting step can be carried out inside a cell. The contacting step can be carried out in a cell in vitro. The contacting step can be carried out in a cell ex vivo. The contacting step can be carried out in a cell in vivo.

[0097] In some cases (e.g., when contacting with a C2c2 guide RNA and a C2c2 protein, when

contacting with a precursor C2c2 guide RNA array and a C2c2 protein, and the like), the sample is contacted for 2 hours or less (e.g., 1.5 hours or less, 1 hour or less, 40 minutes or less, 30 minutes or less, 20 minutes or less, 10 minutes or less, or 5 minutes or less, or 1 minute or less) prior to the measuring step. For example, in some cases the sample is contacted for 40 minutes or less prior to the measuring step. In some cases the sample is contacted for 20 minutes or less prior to the measuring step. In some cases the sample is contacted for 10 minutes or less prior to the measuring step. In some cases the sample is contacted for 5 minutes or less prior to the measuring step. In some cases the sample is contacted for 1 minute or less prior to the measuring step. In some cases the sample is contacted for from 50 seconds to 60 seconds prior to the measuring step. In some cases the sample is contacted for from 40 seconds to 50 seconds prior to the measuring step. In some cases the sample is contacted for from 30 seconds to 40 seconds prior to the measuring step. In some cases the sample is contacted for from 20 seconds to 30 seconds prior to the measuring step. In some cases the sample is contacted for from 10 seconds to 20 seconds prior to the measuring step.

[0098] The present disclosure provides methods of detecting a single-stranded RNA in a sample

comprising a plurality of RNAs (e.g., comprising a target RNA and a plurality of non-target RNAs). In some cases, the methods comprise: a) contacting the sample with: (i) a C2c2 guide RNA that hybridizes with the single stranded target RNA, and (ii) a C2c2 protein that cleaves RNAs present in the sample; and b) measuring a detectable signal produced by C2c2 protein- mediated RNA cleavage. In some cases, the methods comprise: a) contacting the sample with: i) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (ii) a C2c2 protein that cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs, and also cleaves RNAs of the sample; and b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage. In some cases, a method of the present disclosure provides for substantially simultaneous detection of two different target RNAs (a first single-stranded target RNA and a second single-stranded target RNA) in a sample.

[0099] A method of the present disclosure for detecting a single-stranded RNA (a single-stranded target RNA) in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs) can detect a single-stranded target RNA with a high degree of sensitivity. In some cases, a method of the present disclosure can be used to detect a target single-stranded RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at one or more copies per 10 ⁷ non-target RNAs (e.g., one or more copies per 10 ⁶ non- target RNAs, one or more copies per 10 ^s non-target RNAs, one or more copies per 10 ⁴ non- target RNAs, one or more copies per 10 ³ non-target RNAs, one or more copies per 10 ² non- target RNAs, one or more copies per 50 non-target RNAs, one or more copies per 20 non-target RNAs, one or more copies per 10 non-target RNAs, or one or more copies per 5 non-target RNAs).

[00100] In some cases, a method of the present disclosure can detect a target single-stranded

RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at from one copy per 10 ⁷ non-target RNAs to one copy per 10 non-target RNAs (e.g., from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ³ non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ⁴ non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ^s non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ⁶ non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ³ non- target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ⁴ non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ^s non-target RNAs, from 1 copy per 10 ^s non- target RNAs to 1 copy per 10 non-target RNAs, from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ³ non-target RNAs, or from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ⁴ non-target RNAs).

[00101] In some cases, a method of the present disclosure can detect a target single-stranded

RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at from one copy per 10 ⁷ non-target RNAs to one copy per 100 non-target RNAs (e.g., from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ³ non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ⁴ non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ^s non-target RNAs, from 1 copy per 10 ⁷ non-target RNAs to 1 copy per 10 ⁶ non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 100 non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ³ non- target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ⁴ non-target RNAs, from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ^s non-target RNAs, from 1 copy per 10 ^s non- target RNAs to 1 copy per 100 non-target RNAs, from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ² non-target RNAs, from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ³ non- target RNAs, or from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ⁴ non-target RNAs).

[00102] In some cases, the threshold of detection, for a subject method of detecting a single stranded target RNA in a sample, is 10 nM or less. The term "threshold of detection" is used herein to describe the minimal amount of target RNA that must be present in a sample in order for detection to occur. Thus, as an illustrative example, when a threshold of detection is 10 nM, then a signal can be detected when a target RNA is present in the sample at a concentration of 10 nM or more. In some cases, a method of the present disclosure has a threshold of detection of 5 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 1 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.5 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.1 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.05 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.01 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.005 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.001 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.0005 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.0001 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.00005 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 0.00001 nM or less. In some cases, a method of the present disclosure has a threshold of detection of 10 pM or less. In some cases, a method of the present disclosure has a threshold of detection of 1 pM or less. In some cases, a method of the present disclosure has a threshold of detection of 500 fM or less. In some cases, a method of the present disclosure has a threshold of detection of 250 fM or less. In some cases, a method of the present disclosure has a threshold of detection of 100 fM or less. In some cases, a method of the present disclosure has a threshold of detection of 50 fM or less.

[00103] In some cases, the threshold of detection (for detecting the single stranded target RNA in a subject method), is in a range of from 500 fM to 1 nM (e.g., from 500 fM to 500 pM, from 500 fM to 200 pM, from 500 fM to 100 pM, from 500 fM to 10 pM, from 500 fM to 1 pM, from 800 fM to 1 nM, from 800 fM to 500 pM, from 800 fM to 200 pM, from 800 fM to 100 pM, from 800 fM to 10 pM, from 800 fM to 1 pM, from 1 pM to 1 nM, from 1 pM to 500 pM, from 1 pM to 200 pM, from 1 pM to 100 pM, or from 1 pM to 10 pM) (where the concentration refers to the threshold concentration of target RNA at which the target RNA can be detected). In some cases, a method of the present disclosure has a threshold of detection in a range of from 800 fM to 100 pM. In some cases, a method of the present disclosure has a threshold of detection in a range of from 1 pM to 10 pM. In some cases, a method of the present disclosure has a threshold of detection in a range of from 10 fM to 500 fM, e.g., from 10 fM to 50 fM, from 50 fM to 100 fM, from 100 fM to 250 fM, or from 250 fM to 500 fM.

[00104] In some cases, the minimum concentration at which a single stranded target RNA can be detected in a sample is in a range of from 500 fM to 1 nM (e.g., from 500 fM to 500 pM, from 500 fM to 200 pM, from 500 fM to 100 pM, from 500 fM to 10 pM, from 500 fM to 1 pM, from 800 fM to 1 nM, from 800 fM to 500 pM, from 800 fM to 200 pM, from 800 fM to 100 pM, from 800 fM to 10 pM, from 800 fM to 1 pM, from 1 pM to 1 nM, from 1 pM to 500 pM, from 1 pM to 200 pM, from 1 pM to 100 pM, or from 1 pM to 10 pM). In some cases, the minimum concentration at which a single stranded target RNA can be detected in a sample is in a range of from 800 fM to 100 pM. In some cases, the minimum concentration at which a single stranded target RNA can be detected in a sample is in a range of from 1 pM to 10 pM.

[00105] In some cases, a method of the present disclosure can detect a target single-stranded

RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at a concentration as low as 500 fM (e.g., as low as 800 fM, as low as 1 pM, as low as 10 pM or as low as 100 pM). In some cases, a method of the present disclosure can detect a target single- stranded RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at a concentration as low as 1 pM.

[00106] In some cases, a method of the present disclosure can detect a target single-stranded

RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at a concentration as low as 500 fM (e.g., as low as 800 fM, as low as 1 pM, as low as 10 pM or as low as 100 pM), and where the sample is contacted for 60 minutes or less prior to the measuring step (e.g., in some cases 40 minutes or less). In some cases, a method of the present disclosure can detect a target single-stranded RNA present in a sample comprising a plurality of RNAs (including the single stranded target RNA and a plurality of non-target RNAs), where the target single-stranded RNA is present at a concentration as low as 1 pM, and where the sample is contacted for 60 minutes or less prior to the measuring step (e.g., in some cases 40 minutes or less).

[00107] For example, in some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 500 fM or more (e.g., 800 fM or more, 1 pM or more, 5 pM or more, 10 pM or more). In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 1 pM or more (e.g., 2 pM or more 5 pM or more, or 8 pM or more). In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 500 fM or more (e.g., 1 pM or more, 5 pM or more, 10 pM or more), where the sample is contacted for 60 minutes or less prior to the measuring step (e.g., in some cases 40 minutes or less). In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 1 pM or more (e.g., 2 pM or more 5 pM or more, or 8 pM or more) where the sample is contacted for 60 minutes or less prior to the measuring step (e.g., in some cases 40 minutes or less).

[00108] In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 10 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 5 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 1 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.5 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.1 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.05 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.01 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.005 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.001 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.0005 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.0001 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.00005 nM or less. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of 0.00001 nM or less.

[00109] In some cases, a method of the present disclosure provides for detection of a target

RNA present in a sample at a concentration of from 10 ⁶ nM to 1 nM, e.g., from 10 ⁶ nM to 5 x 10 ⁶ nM, from 5 x 10 ⁶ nM to 10 ^s nM, from 10 ^s nM to 5 x 10 ^s nM, from 5 x 10 ^s nM to 10 ⁴ nM, from 10 ⁴ nM to 5 x 10 ⁴ nM, from 5 x 10 ⁴ nM to 10 ³ nM, from 10 ³ nM to 5 x 10 ³ nM, from 5 x 10 ³ nM to 10 ² nM, from 10 ² nM to 5 x 10 ² nM, from 5 x 10 ² nM to 0.1 nM, from 0.1 nM to 0.5 nM, from 0.5 nM to 1 nM, from 1 nM to 5 nM, or from 5 nM to 10 nM.

[00110] In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 10 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 5 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 1 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.5 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.1 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.05 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.01 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.005 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.001 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.0005 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.0001 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.00005 nM. In some cases, a method of the present disclosure provides for detection of a target RNA present in a sample at a concentration of less than 0.00001 nM.

[00111] In some cases, a method of the present disclosure can be used to determine the amount of a target RNA in a sample (e.g., a sample comprising the target RNA and a plurality of non- target RNAs). Determining the amount of a target RNA in a sample can comprise comparing the amount of detectable signal generated from a test sample to the amount of detectable signal generated from a reference sample. Determining the amount of a target RNA in a sample can comprise: measuring the detectable signal to generate a test measurement; measuring a detectable signal produced by a reference sample to generate a reference measurement; and comparing the test measurement to the reference measurement to determine an amount of target RNA present in the sample.

[00112] For example, in some cases, a method of the present disclosure for determining the amount of a target RNA in a sample comprises: a) contacting the sample (e.g., a sample comprising the target RNA and a plurality of non-target RNAs) with: (i) a C2c2 guide RNA that hybridizes with the single stranded target RNA, and (ii) a C2c2 protein that cleaves RNAs present in the sample; b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage, generating a test measurement; c) measuring a detectable signal produced by a reference sample to generate a reference measurement; and d) comparing the test measurement to the reference measurement to determine an amount of target RNA present in the sample.

[00113] As another example, in some cases, a method of the present disclosure for determining the amount of a target RNA in a sample comprises: a) contacting the sample (e.g., a sample comprising the target RNA and a plurality of non-target RNAs) with: i) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (ii) a C2c2 protein that cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs, and also cleaves RNAs of the sample; b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage, generating a test measurement; c) measuring a detectable signal produced by each of two or more reference samples to generate two or more reference measurements; and d) comparing the test measurement to the reference measurements to determine an amount of target RNA present in the sample.

Samples

[00114] A subject sample includes a plurality of target RNAs. The term "plurality" is used herein to mean two or more. Thus, in some cases a sample includes two or more (e.g., 3 or more, 5 or more, 10 or more, 20 or more, 50 or more, 100 or more, 500 or more, 1,000 or more, or 5,000 or more) RNAs. A subject method can be used as a very sensitive way to detect a single stranded target RNA present in a complex mixture of RNAs. Thus, in some cases the sample includes 5 or more RNAs (e.g., 10 or more, 20 or more, 50 or more, 100 or more, 500 or more, 1,000 or more, or 5,000 or more RNAs) that differ from one another in sequence. In some cases, the sample includes 10 or more, 20 or more, 50 or more, 100 or more, 500 or more, 10 ³ or more, 5 x 10 ³ or more, 10 ⁴ or more, 5 x 10 ⁴ or more, 10 ^s or more, 5 x 10 ^s or more, 10 ⁶ or more 5 x 10 ⁶ or more, or 10 ⁷ or more, RNAs that differ from one another in sequence. In some cases, the sample comprises from 10 to 20, from 20 to 50, from 50 to 100, from 100 to 500, from 500 to 10 ³, from 10 ³ to 5 x 10 ³, from 5 x 10 ³ to 10 ⁴, from 10 ⁴ to 5 x 10 ⁴, from 5 x 10 ⁴ to 10 ^s, from 10 ^s to 5 x 10 ^s, from 5 x 10 ^s to 10 ⁶, from 10 ⁶ to 5 x 10 ⁶, or from 5 x 10 ⁶ to 10 ⁷, or more than 10 ⁷, RNAs that differ from one another in sequence. In some cases, the sample comprises from 5 to 10 ⁷ RNAs that differ from one another in sequence (e.g., from 5 to 10 ⁶, from 5 to 10 ^s, from 5 to 50,000, from 5 to 30,000, from 10 to 10 ⁶, from 10 to 10 ^s, from 10 to 50,000, from 10 to 30,000, from 20 to 10 ⁶, from 20 to 10 ^s, from 20 to 50,000, or from 20 to 30,000 RNAs that differ from one another in sequence). In some cases, the sample comprises from 5 to 50,000 RNAs that differ from one another in sequence (e.g., from 5 to 30,000, from 10 to 50,000, or from 10 to 30,000) RNAs that differ from one another in sequence). In some cases the sample includes 20 or more RNAs that differ from one another in sequence. In some cases, the sample includes RNAs from a cell lysate (e.g., a eukaryotic cell lysate, a mammalian cell lysate, a human cell lysate, a prokaryotic cell lysate, a plant cell lysate, and the like). For example, in some cases the sample includes expressed RNAs from a cell such as a eukaryotic cell, e.g., a mammalian cell such as a human cell.

[00115] The term "sample" is used herein to mean any sample that includes single stranded

RNA. The sample can be derived from any source, e.g., the sample can be a synthetic combination of purified RNAs; the sample can be a cell lysate, an RNA-enriched cell lysate, or RNAs isolated and/or purified from a cell lysate. The sample can be from a patient (e.g., for the purpose of diagnosis). The sample can be from permeabilized cells. The sample can be from crosslinked cells. The sample can be in tissue sections. The sample can be from tissues prepared by crosslinking followed by delipidation and adjustment to make a uniform refractive index. Examples of tissue preparation by crosslinking followed by delipidation and adjustment to make a uniform refractive index have been described in, for example, Shah et al., Development (2016) 143, 2862-2867 doi: 10.1242/dev.l38560.

[00116] A "sample" can include a single stranded target RNA and a plurality of non-target

RNAs. In some cases, the target single-stranded RNA is present in the sample at one copy per 10 non-target RNAs, one copy per 20 non-target RNAs, one copy per 25 non-target RNAs, one copy per 50 non-target RNAs, one copy per 100 non-target RNAs, one copy per 500 non-target RNAs, one copy per 10 ³ non-target RNAs, one copy per 5 x 10 ³ non-target RNAs, one copy per 10 ⁴ non-target RNAs, one copy per 5 x 10 ⁴ non-target RNAs, one copy per 10 ^s non-target RNAs, one copy per 5 x 10 ^s non-target RNAs, one copy per 10 ⁶ non-target RNAs, or less than one copy per 10 ⁶ non-target RNAs. In some cases, the target single-stranded RNA is present in the sample at from one copy per 10 non-target RNAs to 1 copy per 20 non-target RNAs, from 1 copy per 20 non-target RNAs to 1 copy per 50 non-target RNAs, from 1 copy per 50 non-target RNAs to 1 copy per 100 non-target RNAs, from 1 copy per 100 non-target RNAs to 1 copy per 500 non- target RNAs, from 1 copy per 500 non-target RNAs to 1 copy per 10 ³ non-target RNAs, from 1 copy per 10 ³ non-target RNAs to 1 copy per 5 x 10 ³ non-target RNAs, from 1 copy per 5 x 10 ³ non-target RNAs to 1 copy per 10 ⁴ non-target RNAs, from 1 copy per 10 ⁴ non-target RNAs to 1 copy per 10 ^s non-target RNAs, from 1 copy per 10 ^s non-target RNAs to 1 copy per 10 ⁶ non- target RNAs, or from 1 copy per 10 ⁶ non-target RNAs to 1 copy per 10 ⁷ non-target RNAs.

[00117] Suitable samples include but are not limited to blood, serum, plasma, urine, aspirate, and biopsy samples. Thus, the term "sample" with respect to a patient encompasses blood and other liquid samples of biological origin, solid tissue samples such as a biopsy specimen or tissue cultures or cells derived therefrom and the progeny thereof. The definition also includes samples that have been manipulated in any way after their procurement, such as by treatment with reagents; washed; or enrichment for certain cell populations, such as cancer cells. The definition also includes sample that have been enriched for particular types of molecules, e.g. , RNAs. The term "sample" encompasses biological samples such as a clinical sample such as blood, plasma, serum, aspirate, cerebral spinal fluid (CSF), and also includes tissue obtained by surgical resection, tissue obtained by biopsy, cells in culture, cell supernatants, cell lysates, tissue samples, organs, bone marrow, and the like. A "biological sample" includes biological fluids derived therefrom (e.g., cancerous cell, infected cell, etc.), e.g. , a sample comprising RNAs that is obtained from such cells (e.g. , a cell lysate or other cell extract comprising RNAs).

[00118] A sample can comprise, or can be obtained from, any of a variety of cells, tissues,

organs, or acellular fluids. Suitable sample sources include eukaryotic cells, bacterial cells, and archaeal cells. Suitable sample sources include single -celled organisms and multi-cellular organisms. Suitable sample sources include single-cell eukaryotic organisms; a plant or a plant cell; an algal cell, e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens, C. agardh, and the like; a fungal cell (e.g., a yeast cell); an animal cell, tissue, or organ; a cell, tissue, or organ from an invertebrate animal (e.g. fruit fly, cnidarian, echinoderm, nematode, an insect, an arachnid, etc.); a cell, tissue, fluid, or organ from a vertebrate animal (e.g., fish, amphibian, reptile, bird, mammal); a cell, tissue, fluid, or organ from a mammal (e.g., a human; a non-human primate; an ungulate; a feline; a bovine; an ovine; a caprine; etc.). Suitable sample sources include nematodes, protozoans, and the like. Suitable sample sources include parasites such as helminths, malarial parasites, etc.

[00119] Suitable sample sources include a cell, tissue, or organism of any of the six kingdoms, e.g., Bacteria (e.g., Eubacteria); Archaebacteria; Protista; Fungi; Plantae; and Animalia. Suitable sample sources include plant-like members of the kingdom Protista, including, but not limited to, algae (e.g., green algae, red algae, glaucophytes, cyanobacteria); fungus-like members of Protista, e.g., slime molds, water molds, etc.; animal-like members of Protista, e.g., flagellates (e.g., Euglena), amoeboids (e.g., amoeba), sporozoans (e.g, Apicomplexa, Myxozoa,

Microsporidia), and ciliates (e.g., Paramecium). Suitable sample sources include include members of the kingdom Fungi, including, but not limited to, members of any of the phyla: Basidiomycota (club fungi; e.g., members of Agaricus, Amanita, Boletus, Cantherellus, etc.); Ascomycota (sac fungi, including, e.g., Saccharomyces); Mycophycophyta (lichens);

Zygomycota (conjugation fungi); and Deuteromycota. Suitable sample sources include include members of the kingdom Plantae, including, but not limited to, members of any of the following divisions: Bryophyta (e.g., mosses), Anthocerotophyta (e.g., hornworts), Hepaticophyta (e.g., liverworts), Lycophyta (e.g., club mosses), Sphenophyta (e.g., horsetails), Psilophyta (e.g., whisk ferns), Ophioglossophyta, Pterophyta (e.g., ferns), Cycadophyta, Gingkophyta, Pinophyta, Gnetophyta, and Magnoliophyta (e.g., flowering plants). Suitable sample sources include include members of the kingdom Animalia, including, but not limited to, members of any of the following phyla: Porifera (sponges); Placozoa; Orthonectida (parasites of marine invertebrates); Rhombozoa; Cnidaria (corals, anemones, jellyfish, sea pens, sea pansies, sea wasps); Ctenophora (comb jellies); Platyhelminthes (flatworms); Nemertina (ribbon worms); Ngathostomulida (jawed worms)p Gastrotricha; Rotifera; Priapulida; Kinorhyncha; Loricifera; Acanthocephala; Entoprocta; Nemotoda; Nematomorpha; Cycliophora; Mollusca (mollusks); Sipuncula (peanut worms); Annelida (segmented worms); Tardigrada (water bears); Onychophora (velvet worms); Arthropoda (including the subphyla: Chelicerata, Myriapoda, Hexapoda, and Crustacea, where the Chelicerata include, e.g., arachnids, Merostomata, and Pycnogonida, where the Myriapoda include, e.g., Chilopoda (centipedes), Diplopoda (millipedes), Paropoda, and Symphyla, where the Hexapoda include insects, and where the Crustacea include shrimp, krill, barnacles, etc.; Phoronida; Ectoprocta (moss animals); Brachiopoda; Echinodermata (e.g. starfish, sea daisies, feather stars, sea urchins, sea cucumbers, brittle stars, brittle baskets, etc.); Chaetognatha (arrow worms); Hemichordata (acorn worms); and Chordata. Suitable members of Chordata include any member of the following subphyla: Urochordata (sea squirts; including Ascidiacea, Thaliacea, and Larvacea); Cephalochordata (lancelets); Myxini (hagfish); and Vertebrata, where members of Vertebrata include, e.g., members of Petromyzontida (lampreys), Chondrichthyces

(cartilaginous fish), Actinopterygii (ray-finned fish), Actinista (coelocanths), Dipnoi (lungfish), Reptilia (reptiles, e.g., snakes, alligators, crocodiles, lizards, etc.), Aves (birds); and Mammalian (mammals). Suitable plants include any monocotyledon and any dicotyledon.

Suitable sources of a sample include cells, fluid, tissue, or organ taken from an organism; from a particular cell or group of cells isolated from an organism; etc. For example, where the organism is a plant, suitable sources include xylem, the phloem, the cambium layer, leaves, roots, etc. Where the organism is an animal, suitable sources include particular tissues (e.g., lung, liver, heart, kidney, brain, spleen, skin, fetal tissue, etc.), or a particular cell type (e.g., neuronal cells, epithelial cells, endothelial cells, astrocytes, macrophages, glial cells, islet cells, T lymphocytes, B lymphocytes, etc.).

In some cases, the source of the sample is a diseased cell, fluid, tissue, or organ. In some cases, the source of the sample is a normal (non-diseased) cell, fluid, tissue, or organ. In some cases, the source of the sample is a pathogen-infected cell, tissue, or organ. Pathogens include viruses, fungi, helminths, protozoa, malarial parasites, Plasmodium parasites, Toxoplasma parasites, Schistosoma parasites, and the like. "Helminths" include roundworms, heartworms, and phytophagous nematodes (Nematoda), flukes (Tematoda), Acanthocephala, and tapeworms (Cestoda). Protozoan infections include infections from Giardia spp., Trichomonas spp., African trypanosomiasis, amoebic dysentery, babesiosis, balantidial dysentery, Chaga's disease, coccidiosis, malaria and toxoplasmosis. Examples of pathogens such as parasitic/protozoan pathogens include, but are not limited to: Plasmodium falciparum, Plasmodium vivax,

Trypanosoma cruzi and Toxoplasma gondii. Fungal pathogens include, but are not limited to: Cryptococcus neoformans, Histoplasma capsulatum, Coccidioides immitis, Blastomyces dermatitidis, Chlamydia trachomatis, and Candida albicans. Pathogenic viruses include, e.g., immunodeficiency virus (e.g., HIV); influenza virus; dengue; West Nile virus; herpes virus; yellow fever virus; Hepatitis Virus C; Hepatitis Virus A; Hepatitis Virus B; papillomavirus; and the like. Pathogens include, e.g., HIV virus, Mycobacterium tuberculosis, Streptococcus agalactiae, methicillin-resistant Staphylococcus aureus, Legionella pneumophila, Streptococcus pyogenes, Escherichia coli, Neisseria gonorrhoeae, Neisseria meningitidis, Pneumococcus, Cryptococcus neoformans, Histoplasma capsulatum, Hemophilus influenzae B, Treponema pallidum, Lyme disease spirochetes, Pseudomonas aeruginosa, Mycobacterium leprae, Brucella abortus, rabies virus, influenza virus, cytomegalovirus, herpes simplex virus I, herpes simplex virus II, human serum parvo-like virus, respiratory syncytial virus, varicella-zoster virus, hepatitis B virus, hepatitis C virus, measles virus, adenovirus, human T-cell leukemia viruses, Epstein-Barr virus, murine leukemia virus, mumps virus, vesicular stomatitis virus, Sindbis virus, lymphocytic choriomeningitis virus, wart virus, blue tongue virus, Sendai virus, feline leukemia virus, Reovirus, polio virus, simian virus 40, mouse mammary tumor virus, dengue virus, rubella virus, West Nile virus, Plasmodium falciparum, Plasmodium vivax, Toxoplasma gondii, Trypanosoma rangeli, Trypanosoma cruzi, Trypanosoma rhodesiense, Trypanosoma brucei, Schistosoma mansoni, Schistosoma japonicum, Babesia bovis, Eimeria tenella, Onchocerca volvulus, Leishmania tropica, Mycobacterium tuberculosis, Trichinella spiralis, Theileria parva, Taenia hydatigena, Taenia ovis, Taenia saginata, Echinococcus granulosus, Mesocestoides corti, Mycoplasma arthritidis, M. hyorhinis, M. orale, M. arginini, Acholeplasma laidlawii, M. salivarium and M. pneumoniae.

Target RN A

] A target RNA can be any single stranded RNA (ssRNA). Examples include but are not limited to mRNA, rRNA, tRNA, non-coding RNA (ncRNA), long non-coding RNA (IncRNA), and microRNA (miRNA). In some cases, the target ssRNA is mRNA. In some cases, the single stranded target nucleic acid is ssRNA from a virus (e.g., Zika virus, human immunodeficiency virus, influenza virus, and the like). In some cases, the single-stranded target nucleic acid is ssRNA of a parasite. In some cases, the single-stranded target nucleic acid is ssRNA of a bacterium, e.g., a pathogenic bacterium. The source of the target RNA can be the same as the source of the RNA sample, as described above.

Measuring a detectable signal

[00122] In some cases, a subject method includes a step of measuring (e.g., measuring a

detectable signal produced by C2c2 protein-mediated RNA cleavage). Because a C2c2 protein cleaves non-targeted RNA once activated, which occurs when a C2c2 guide RNA hybridizes with a target RNA in the presence of a C2c2 protein, a detectable signal can be any signal that is produced when RNA is cleaved. For example, in some cases the step of measuring can include one or more of: gold nanoparticle based detection (e.g., see Xu et al., Angew Chem Int Ed Engl. 2007;46(19):3468-70; and Xia et. al., Proc Natl Acad Sci U S A. 2010 Jun 15;107(24): 10837- 41), fluorescence polarization, colloid phase transition/dispersion (e.g., Baksh et. al., Nature. 2004 Jan 8;427(6970): 139-41), electrochemical detection, semiconductor-based sensing (e.g., Rothberg et. al., Nature. 2011 Jul 20;475(7356):348-52; e.g., one could use a phosphatase to generate a pH change after RNA cleavage reactions, by opening 2' -3' cyclic phosphates, and by releasing inorganic phosphate into solution), and detection of a labeled detector RNA (see below for more details). The readout of such detection methods can be any convenient readout.

Examples of possible readouts include but are not limited to: a measured amount of detectable fluorescent signal; a visual analysis of bands on a gel (e.g., bands that represent cleaved product versus uncleaved substrate), a visual or sensor based detection of the presence or absence of a color (i.e., color detection method), and the presence or absence of (or a particular amount of) an electrical signal.

[00123] The measuring can in some cases be quantitative, e.g., in the sense that the amount of signal detected can be used to determine the amount of target RNA present in the sample. The measuring can in some cases be qualitative, e.g., in the sense that the presence or absence of detectable signal can indicate the presence or absence of targeted RNA. In some cases, a detectable signal will not be present (e.g., above a given threshold level) unless the targeted RNA(s) is present above a particular threshold concentration (e.g., see Fig. 5). In some cases, the threshold of detection can be titrated by modifying the amount of C2c2 protein, guide RNA, sample volume, and/or detector RNA (if one is used). As such, for example, as would be understood by one of ordinary skill in the art, a number of controls can be used if desired in order to set up one or more reactions, each set up to detect a different threshold level of target RNA, and thus such a series of reactions could be used to determine the amount of target RNA present in a sample (e.g., one could use such a series of reactions to determine that a target RNA is present in the sample 'at a concentration of at least X'). Labeled detector RNA

[00124] In some cases, a subject method includes contacting a sample (e.g., a sample comprising a target RNA and a plurality of non-target RNAs) with: i) a labeled detector RNA; ii) a C2c2 protein; and iii) a C2c2 guide RNA (or precursor C2c2 guide RNA array). For example, in some cases, a subject method includes contacting a sample with a labeled detector RNA comprising a fluorescence -emitting dye pair; the C2c2 protein cleaves the labeled detector RNA after it is activated (by binding to the C2c2 guide RNA in the context of the guide RNA hybridizing to a target RNA); and the detectable signal that is measured is produced by the fluorescence-emitting dye pair. For example, in some cases, a subject method includes contacting a sample with a labeled detector RNA comprising a fluorescence resonance energy transfer (FRET) pair or a quencher/fluor pair, or both. In some cases, a subject method includes contacting a sample with a labeled detector RNA comprising a FRET pair. In some cases, a subject method includes contacting a sample with a labeled detector RNA comprising a fluor/quencher pair.

Fluorescence-emitting dye pairs comprise a FRET pair or a quencher/fluor pair. In both cases of a FRET pair and a quencher/fluor pair, the emission spectrum of one of the dyes overlaps a region of the absorption spectrum of the other dye in the pair. As used herein, the term

"fluorescence-emitting dye pair" is a generic term used to encompass both a "fluorescence resonance energy transfer (FRET) pair" and a "quencher/fluor pair," both of which terms are discussed in more detail below. The term "fluorescence-emitting dye pair" is used

interchangeably with the phrase "a FRET pair and/or a quencher/fluor pair."

[00125] In some cases (e.g., when the detector RNA includes a FRET pair) the labeled detector

RNA produces an amount of detectable signal prior to being cleaved, and the amount of detectable signal that is measured is reduced when the labeled detector RNA is cleaved. In some cases, the labeled detector RNA produces a first detectable signal prior to being cleaved (e.g., from a FRET pair) and a second detectable signal when the labeled detector RNA is cleaved (e.g., from a quencher/fluor pair). As such, in some cases, the labeled detector RNA comprises a FRET pair and a quencher/fluor pair.

[00126] In some cases, the labeled detector RNA comprises a FRET pair. FRET is a process by which radiationless transfer of energy occurs from an excited state fluorophore to a second chromophore in close proximity. The range over which the energy transfer can take place is limited to approximately 10 nanometers (100 angstroms), and the efficiency of transfer is extremely sensitive to the separation distance between fluorophores. Thus, as used herein, the term "FRET" ("fluorescence resonance energy transfer"; also known as "Forster resonance energy transfer") refers to a physical phenomenon involving a donor fluorophore and a matching acceptor fluorophore selected so that the emission spectrum of the donor overlaps the excitation spectrum of the acceptor, and further selected so that when donor and acceptor are in close proximity (usually 10 nm or less) to one another, excitation of the donor will cause excitation of and emission from the acceptor, as some of the energy passes from donor to acceptor via a quantum coupling effect. Thus, a FRET signal serves as a proximity gauge of the donor and acceptor; only when they are in close proximity to one another is a signal generated. The FRET donor moiety (e.g., donor fluorophore) and FRET acceptor moiety (e.g., acceptor fluorophore) are collectively referred to herein as a "FRET pair".

[00127] The donor-acceptor pair (a FRET donor moiety and a FRET acceptor moiety) is referred to herein as a "FRET pair" or a "signal FRET pair." Thus, in some cases, a subject labeled detector RNA includes two signal partners (a signal pair), when one signal partner is a FRET donor moiety and the other signal partner is a FRET acceptor moiety. A subject labeled detector RNA that includes such a FRET pair (a FRET donor moiety and a FRET acceptor moiety) will thus exhibit a detectable signal (a FRET signal) when the signal partners are in close proximity (e.g., while on the same RNA molecule), but the signal will be reduced (or absent) when the partners are separated (e.g., after cleavage of the RNA molecule by a C2c2 protein).

[00128] FRET donor and acceptor moieties (FRET pairs) will be known to one of ordinary skill in the art and any convenient FRET pair (e.g., any convenient donor and acceptor moiety pair) can be used. Examples of suitable FRET pairs include but are not limited to those presented in Table 1. See also: Bajar et al. Sensors (Basel). 2016 Sep 14;16(9); and Abraham et al. PLoS One. 2015 Aug 3;10(8):e0134436.

[00129] Table 1. Examples of FRET pairs (donor and acceptor FRET moieties)

- ycoery t r n y Donor Acceptor

Cy5 Cy5.5

(1) 5-(2-iodoacetylaminoethyl)aminonaphthalene-l -sulfonic acid

(2) N-(4-dimethylamino-3,5-dinitrophenyl)maleimide

(3) carboxyfluorescein succinimidyl ester

(4) 4,4-difluoro-4-bora-3a,4a-diaza-s-indacene

[00130] In some cases, a detectable signal is produced when the labeled detector RNA is cleaved

(e.g., in some cases, the labeled detector RNA comprises a quencher/fluor pair. One signal partner of a signal quenching pair produces a detectable signal and the other signal partner is a quencher moiety that quenches the detectable signal of the first signal partner (i.e., the quencher moiety quenches the signal of the signal moiety such that the signal from the signal moiety is reduced (quenched) when the signal partners are in proximity to one another, e.g., when the signal partners of the signal pair are in close proximity).

[00131] For example, in some cases, an amount of detectable signal increases when the labeled detector RNA is cleaved. For example, in some cases, the signal exhibited by one signal partner (a signal moiety) is quenched by the other signal partner (a quencher signal moiety), e.g., when both are present on the same RNA molecule prior to cleavage by a C2c2 protein. Such a signal pair is referred to herein as a "quencher/fluor pair", "quenching pair", or "signal quenching pair." For example, in some cases, one signal partner (e.g., the first signal partner) is a signal moiety that produces a detectable signal that is quenched by the second signal partner (e.g., a quencher moiety). The signal partners of such a quencher/fluor pair will thus produce a detectable signal when the partners are separated (e.g., after cleavage of the detector RNA by a C2c2 protein), but the signal will be quenched when the partners are in close proximity (e.g., prior to cleavage of the detector RNA by a C2c2 protein).

[00132] A quencher moiety can quench a signal from the signal moiety (e.g., prior to cleave of the detector RNA by a C2c2 protein) to various degrees. In some cases, a quencher moiety quenches the signal from the signal moiety where the signal detected in the presence of the quencher moiety (when the signal partners are in proximity to one another) is 95% or less of the signal detected in the absence of the quencher moiety (when the signal partners are separated). For example, in some cases, the signal detected in the presence of the quencher moiety can be 90% or less, 80% or less, 70% or less, 60% or less, 50% or less, 40% or less, 30% or less, 20% or less, 15% or less, 10% or less, or 5% or less of the signal detected in the absence of the quencher moiety. In some cases, no signal (e.g., above background) is detected in the presence of the quencher moiety. [00133] In some cases, the signal detected in the absence of the quencher moiety (when the signal partners are separated) is at least 1.2 fold greater (e.g., at least 1.3fold, at least 1.5 fold, at least 1.7 fold, at least 2 fold, at least 2.5 fold, at least 3 fold, at least 3.5 fold, at least 4 fold, at least 5 fold, at least 7 fold, at least 10 fold, at least 20 fold, or at least 50 fold greater) than the signal detected in the presence of the quencher moiety (when the signal partners are in proximity to one another).

[00134] In some cases, the signal moiety is a fluorescent label. In some such cases, the quencher moiety quenches the signal (the light signal) from the fluorescent label (e.g., by absorbing energy in the emission spectra of the label). Thus, when the quencher moiety is not in proximity with the signal moiety, the emission (the signal) from the fluorescent label is detectable because the signal is not absorbed by the quencher moiety. Any convenient donor acceptor pair (signal moiety /quencher moiety pair) can be used and many suitable pairs are known in the art.

[00135] In some cases the quencher moiety absorbs energy from the signal moiety (also referred to herein as a "detectable label") and then emits a signal (e.g., light at a different wavelength). Thus, in some cases, the quencher moiety is itself a signal moiety (e.g., a signal moiety can be 6- carboxyfluorescein while the quencher moiety can be 6-carboxy-tetramethylrhodamine), and in some such cases, the pair could also be a FRET pair. In some cases, a quencher moiety is a dark quencher. A dark quencher can absorb excitation energy and dissipate the energy in a different way (e.g., as heat). Thus, a dark quencher has minimal to no fluorescence of its own (does not emit fluorescence). Examples of dark quenchers are further described in U.S. patent numbers 8,822,673 and 8,586,718; U.S. patent publications 20140378330, 20140349295, and

20140194611 ; and international patent applications: WO200142505 and WO200186001, all if which are hereby incorporated by reference in their entirety.

[00136] Examples of fluorescent labels include, but are not limited to: an Alexa Fluor® dye, an

ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein isothiocyanate (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, Pacific Orange, quantum dots, and a tethered fluorescent protein. [00137] In some cases, a detectable label is a fluorescent label selected from: an Alexa Fluor® dye, an ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, and Pacific Orange.

[00138] In some cases, a detectable label is a fluorescent label selected from: an Alexa Fluor® dye, an ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, Pacific Orange, a quantum dot, and a tethered fluorescent protein.

[00139] Examples of ATTO dyes include, but are not limited to: ATTO 390, ATTO 425, ATTO

465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, and ATTO 740.

[00140] Examples of AlexaFluor dyes include, but are not limited to: Alexa Fluor® 350,

Alexa Fluor® 405, Alexa Fluor® 430, Alexa Fluor® 488, Alexa Fluor® 500, Alexa Fluor® 514, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 610, Alexa Fluor® 633, Alexa Fluor® 635, Alexa Fluor® 647, Alexa Fluor® 660, Alexa Fluor® 680, Alexa Fluor® 700, Alexa Fluor® 750, Alexa Fluor® 790, and the like.

[00141] Examples of quencher moieties include, but are not limited to: a dark quencher, a Black

Hole Quencher® (BHQ®) (e.g., BHQ-0, BHQ-1, BHQ-2, BHQ-3), a Qxl quencher, an ATTO quencher (e.g., ATTO 540Q, ATTO 580Q, and ATTO 612Q), dimethylaminoazobenzenesulfomc acid (Dabsyl), Iowa Black RQ, Iowa Black FQ, IRDye QC-1, a QSY dye (e.g., QSY 7, QSY 9, QSY 21), AbsoluteQuencher, Eclipse, and metal clusters such as gold nanoparticles, and the like.

[00142] In some cases, a quencher moiety is selected from: a dark quencher, a Black Hole

Quencher® (BHQ®) (e.g., BHQ-0, BHQ-1, BHQ-2, BHQ-3), a Qxl quencher, an ATTO quencher (e.g., ATTO 540Q, ATTO 580Q, and ATTO 612Q),

dimethylaminoazobenzenesulfomc acid (Dabsyl), Iowa Black RQ, Iowa Black FQ, IRDye QC-1, a QSY dye (e.g., QSY 7, QSY 9, QSY 21), AbsoluteQuencher, Eclipse, and a metal cluster.

[00143] Examples of an ATTO quencher include, but are not limited to: ATTO 540Q, ATTO

580Q, and ATTO 612Q. Examples of a Black Hole Quencher® (BHQ®) include, but are not limited to: BHQ-0 (493 nm), BHQ-1 (534 nm), BHQ-2 (579 nm) and BHQ-3 (672 nm).

[00144] For examples of some detectable labels (e.g., fluorescent dyes) and/or quencher

moieties, see, e.g., Bao et al., Annu Rev Biomed Eng. 2009;11:25-47; as well as U.S. patent numbers 8,822,673 and 8,586,718; U.S. patent publications 20140378330, 20140349295, 20140194611, 20130323851, 20130224871, 20110223677, 20110190486, 20110172420, 20060179585 and 20030003486; and international patent applications: WO200142505 and WO200186001, all of which are hereby incorporated by reference in their entirety.

[00145] In some cases, cleavage of a labeled detector RNA can be detected by measuring a

colorimetric read-out. For example, the liberation of a fluorophore (e.g., liberation from a FRET pair, liberation from a quencher/fluor pair, and the like) can result in a wavelength shift (and thus color shift) of a detectable signal. Thus, in some cases, cleavage of a subject labeled detector RNA can be detected by a color-shift. Such a shift can be expressed as a loss of an amount of signal of one color (wavelength), a gain in the amount of another color, a change in the ration of one color to another, and the like.

Nucleic acid modifications

[00146] In some cases, a labeled detector RNA comprises one or more modifications, e.g., a base modification, a backbone modification, a sugar modification, etc., to provide the nucleic acid with a new or enhanced feature (e.g., improved stability). As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to the 2', the 3', or the 5' hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn, the respective ends of this linear polymeric compound can be further joined to form a circular compound, however, linear compounds are generally suitable. In addition, linear compounds may have internal nucleotide base

complementarity and may therefore fold in a manner as to produce a fully or partially double- stranded compound. Within oligonucleotides, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3' to 5' phosphodiester linkage.

Modified backbones and modified internucleoside linkages

[00147] Examples of suitable modifications include modified nucleic acid backbones and non- natural internucleoside linkages. Nucleic acids having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone.

[00148] Suitable modified oligonucleotide backbones containing a phosphorus atom therein include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'- alkylene phosphonates, 5'-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and aminoalkylphosphoramidates, phosphorodiamidates , thionophosphor amidates , thionoalkylphosphonates ,

thionoalkylphosphotriesters, selenophosphates and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', 5' to 5' or 2' to 2' linkage. Suitable oligonucleotides having inverted polarity comprise a single 3' to 3' linkage at the 3'-most internucleotide linkage i.e. a single inverted nucleoside residue which may be a basic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts (such as, for example, potassium or sodium), mixed salts and free acid forms are also included.

[00149] In some cases, a labeled detector RNA comprises one or more phosphorothioate and/or heteroatom internucleoside linkages, in particular -CH ₂-NH-0-CH ₂-, -CH ₂-N(CH ₃)-0-CH ₂- (known as a methylene (methylimino) or MMI backbone), -CH ₂-0-N(CH ₃)-CH ₂-, -CH ₂-N(CH ₃)- N(CH ₃)-CH ₂- and -0-N(CH ₃)-CH ₂-CH ₂- (wherein the native phosphodiester internucleotide linkage is represented as -0-P(=0)(OH)-0-CH ₂-). MMI type internucleoside linkages are disclosed in the above referenced U.S. Pat. No. 5,489,677. Suitable amide internucleoside linkages are disclosed in t U.S. Pat. No. 5,602,240.

[00150] Also suitable are nucleic acids having morpholino backbone structures as described in, e.g., U.S. Pat. No. 5,034,506. For example, in some cases, a labeled detector RNA comprises a 6-membered morpholino ring in place of a ribose ring. In some cases, a phosphorodiamidate or other non-phosphodiester internucleoside linkage replaces a phosphodiester linkage.

[00151] Suitable modified polynucleotide backbones that do not include a phosphorus atom

therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH ₂ component parts.

Mimetics

[00152] A labeled detector RNA can be a nucleic acid mimetic. The term "mimetic" as it is applied to polynucleotides is intended to include polynucleotides wherein only the furanose ring or both the furanose ring and the internucleotide linkage are replaced with non-furanose groups, replacement of only the furanose ring is also referred to in the art as being a sugar surrogate. The heterocyclic base moiety or a modified heterocyclic base moiety is maintained for hybridization with an appropriate target nucleic acid. One such nucleic acid, a polynucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA, the sugar-backbone of a polynucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleotides are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone.

[00153] One polynucleotide mimetic that has been reported to have excellent hybridization

properties is a peptide nucleic acid (PNA). The backbone in PNA compounds is two or more linked aminoethylglycine units which gives PNA an amide containing backbone. The heterocyclic base moieties are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative U.S. patents that describe the preparation of PNA compounds include, but are not limited to: U.S. Pat. Nos. 5,539,082; 5,714,331 ; and 5,719,262.

[00154] Another class of polynucleotide mimetic that has been studied is based on linked

morpholino units (morpholino nucleic acid) having heterocyclic bases attached to the morpholino ring. A number of linking groups have been reported that link the morpholino monomeric units in a morpholino nucleic acid. One class of linking groups has been selected to give a non-ionic oligomeric compound. The non-ionic morpholino-based oligomeric compounds are less likely to have undesired interactions with cellular proteins. Morpholino-based polynucleotides are non-ionic mimics of oligonucleotides which are less likely to form undesired interactions with cellular proteins (Dwaine A. Braasch and David R. Corey, Biochemistry, 2002, 41(14), 4503-4510). Morpholino-based polynucleotides are disclosed in U.S. Pat. No. 5,034,506. A variety of compounds within the morpholino class of polynucleotides have been prepared, having a variety of different linking groups joining the monomeric subunits.

[00155] A further class of polynucleotide mimetic is referred to as cyclohexenyl nucleic acids

(CeNA). The furanose ring normally present in a DNA/RNA molecule is replaced with a cyclohexenyl ring. CeNA DMT protected phosphoramidite monomers have been prepared and used for oligomeric compound synthesis following classical phosphoramidite chemistry. Fully modified CeNA oligomeric compounds and oligonucleotides having specific positions modified with CeNA have been prepared and studied (see Wang et al., J. Am. Chem. Soc, 2000, 122, 8595-8602). In general the incorporation of CeNA monomers into a DNA chain increases its stability of a DNA/RNA hybrid. CeNA oligoadenylates formed complexes with RNA and DNA complements with similar stability to the native complexes. The study of incorporating CeNA structures into natural nucleic acid structures was shown by NMR and circular dichroism to proceed with easy conformational adaptation.

[00156] A further modification includes Locked Nucleic Acids (LNAs) in which the 2'-hydroxyl group is linked to the 4' carbon atom of the sugar ring thereby forming a 2'-C,4'-C-oxymethylene linkage thereby forming a bicyclic sugar moiety. The linkage can be a methylene (-CH ₂-), group bridging the 2' oxygen atom and the 4' carbon atom wherein n is 1 or 2 (Singh et al., Chem. Commun., 1998, 4, 455-456). LNA and LNA analogs display very high duplex thermal stabilities with complementary DNA and RNA (Tm=+3 to +10° C), stability towards 3'- exonucleolytic degradation and good solubility properties. Potent and nontoxic antisense oligonucleotides containing LNAs have been described (Wahlestedt et al., Proc. Natl. Acad. Sci. U.S.A., 2000, 97, 5633-5638).

[00157] The synthesis and preparation of the LNA monomers adenine, cytosine, guanine, 5- methyl-cytosine, thymine and uracil, along with their oligomerization, and nucleic acid recognition properties have been described (Koshkin et al., Tetrahedron, 1998, 54, 3607-3630). LNAs and preparation thereof are also described in WO 98/39352 and WO 99/14226.

Modified sugar moieties

[00158] A labeled detector RNA can also include one or more substituted sugar moieties.

Suitable polynucleotides comprise a sugar substituent group selected from: OH; F; 0-, S-, or N- alkyl; 0-, S-, or N-alkenyl; 0-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C.sub.l to C ₁₀ alkyl or C ₂ to C ₁₀ alkenyl and alkynyl. Particularly suitable are 0((CH ₂) _nO) _mCH ₃, 0(CH ₂) _nOCH ₃, 0(CH ₂) _nNH ₂, 0(CH ₂) _nCH ₃, 0(CH ₂) _nONH ₂, and 0(CH ₂) _nON((CH ₂) _nCH ₃) ₂, where n and m are from 1 to about 10. Other suitable polynucleotides comprise a sugar substituent group selected from: Q to do lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH ₃, OCN, CI, Br, CN, CF ₃, OCF ₃, SOCH ₃, S0 ₂CH ₃, ON0 ₂, N0 ₂, N ₃, NH ₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an

oligonucleotide, and other substituents having similar properties. A suitable modification includes 2'-methoxyethoxy (2'-0-CH ₂ CH ₂OCH ₃, also known as 2'-0-(2-methoxyethyl) or 2'- MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further suitable modification includes 2'-dimethylaminooxyethoxy, i.e., a 0(CH ₂) ₂ON(CH ₃) ₂ group, also known as 2'-DMAOE, as described in examples hereinbelow, and 2'- dimethylaminoethoxyethoxy (also known in the art as 2'-0-dimethyl-amino-ethoxy-ethyl or 2'- DMAEOE), i.e., 2'-0-CH ₂-0-CH ₂-N(CH ₃) ₂.

[00159] Other suitable sugar substituent groups include methoxy (-0-CH ₃), aminopropoxy (—0

CH ₂ CH ₂ CH ₂NH ₂), allyl (-CH ₂-CH=CH ₂), -O-allyl (-0- CH ₂— CH=CH ₂) and fluoro (F). 2'- sugar substituent groups may be in the arabino (up) position or ribo (down) position. A suitable 2'-arabino modification is 2'-F. Similar modifications may also be made at other positions on the oligomeric compound, particularly the 3' position of the sugar on the 3' terminal nucleoside or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide. Oligomeric compounds may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.

Base modifications and substitutions

[00160] A labeled detector RNA may also include nucleobase (often referred to in the art simply as "base") modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2- propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2- thiocytosine, 5-halouracil and cytosine, 5-propynyl (-C=C-CH ₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil

(pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8- substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5- substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-aminoadenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3- deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(lH-pyrirnido(5,4-b)(l,4)benzoxazin-2(3H)-one), phenothiazine cytidine (lH-pyrimido(5,4-b)(l,4)benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine (e.g. 9-(2-aminoethoxy)-H-pyrirnido(5,4-(b) (l,4)benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido(4,5-b)indol-2-one), pyridoindole cytidine (H- pyrido(3',2':4,5)pyrrolo(2,3-d)pyrimidin-2-one).

[00161] Heterocyclic base moieties may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2- aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., ed., CRC Press, 1993. Certain of these nucleobases are useful for increasing the binding affinity of an oligomeric compound. These include 5-substituted pyrimidines, 6- azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5- propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi et al., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are suitable base substitutions, e.g., when combined with 2'-0-methoxyethyl sugar modifications.

Detection of two different target RNAs

[00162] As noted above, in some cases, a method of the present disclosure provides for

substantially simultaneous detection of two different target RNAs (a first single-stranded target RNA and a second single-stranded target RNA) in a sample. In some cases, the method comprises: a) contacting a sample (e.g., a sample comprising the two different target RNAs and a plurality of non-target RNAs) with: (i) a first C2c2 protein that cleaves adenine ^"1" RNAs (i.e., RNAs that include A, but not RNAs that lack A such as a polyU RNA) present in the sample; (ii); a second C2c2 protein that cleaves uracil ^"1" RNAs (i.e., RNAs that include U, but not RNAs that lack U such as a poly A RNA); (iii) a first C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the first single stranded target RNA and a second nucleotide sequence that binds to the first C2c2 protein; and (iv) a second C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the second single stranded target RNA and a second nucleotide sequence that binds to the second C2c2 protein; and b) measuring a detectable signal produced by RNA cleavage mediated by the first and the second C2c2 proteins, wherein a first detectable signal is produced by the first C2c2 protein and a second detectable signal is produced by the second C2c2 protein, where the first detectable signal and the second detectable signal are distinguishable from one another. In some cases, the first C2c2 protein is not activated by the second C2c2 guide RNA, and the first C2c2 protein cleaves ssRNA that includes A (e.g., does not cleave ssRNA that lacks A); and the second C2c2 protein is not activated by the first C2c2 guide RNA, and the second C2c2 protein cleaves ssRNA that includes U (e.g., does not cleave ssRNA that lacks U). In some cases, the first C2c2 protein is not activated by the second C2c2 guide RNA, and the first C2c2 protein cleaves ssRNA that includes U (e.g., does not cleave ssRNA that lacks U); and the second C2c2 protein is not activated by the first C2c2 guide RNA, and the second C2c2 protein cleaves ssRNA that includes A (e.g., does not cleave ssRNA that lacks A).

[00163] In some cases, the method also comprises contacting the sample with: i) a first labeled detector RNA comprising a first FRET pair and/or a first quencher/fluor pair (example FRET pairs and quencher/fluor pairs are described above); and ii) a second labeled detector RNA comprising a second FRET pair and/or a second quencher/fluor pair (example FRET pairs and quencher/fluor pairs are described above). In some cases, the first labelled detector RNA comprises at least one A and does not comprise U; while the second labelled detector RNA comprises at least one U and does not comprise A. In some cases, the first labelled detector RNA comprises at least one U and does not comprise A; while the second labelled detector RNA comprises at least one A and does not comprise U. The first C2c2 protein cleaves the first labelled detector RNA, and the first detectable signal is produced by the first FRET pair and/or the first quencher/fluor pair, and the second C2c2 protein cleaves the second labelled detector RNA, and the second detectable signal is produced by the second FRET pair and/or the second quencher/fluor pair. Detection of the first detectable signal indicates the presence in the sample of the first target RNA; and detection of the second detectable signal indicates the presence in the sample of the second target RNA. In some cases, the relative amounts of detected first and second signal indicate the ratio of the first target RNA to the second target RNA in the sample.

[00164] In some cases, the first labelled detector RNA comprises a label that is distinguishable from the label of the second labelled detector RNA. For example, the first labelled detector RNA can comprise a first FRET pair and/or a first quencher/fluor pair; and the second labelled detector RNA can comprise a second FRET pair and/or a second quencher/fluor pair. As one non-limiting example, the first labelled detector RNA can comprise a donor comprising tryptophan and an acceptor comprising dansyl; and the second labelled detector RNA can comprise a donor comprising IAEDANS and an acceptor comprising DDPM. As another non- limiting example, the first labelled detector RNA comprises a donor comprising dansyl and an acceptor comprising FITC; and the second labelled detector RNA comprises a donor comprising Cy3 and an acceptor comprising Cy5. In some cases, the first labelled detector RNA comprises a 5' FAM (Fluorescein) - 3' IBFQ (Iowa Black® FQ ) quencher/fluor pair, and in some cases the second labelled detector RNA comprises a 5'FAM (Fluorescein) - 3' IBFQ (Iowa Black® FQ ) quencher/fluor pair.

[00165] In some cases, the first and second labelled detector RNAs are added to the sample at the same time (substantially simultaneous contact). In some such cases, the signals produced by the first and second labelled detector RNAs are detected at the same time (substantially

simultaneous contact), e.g., because in such cases the first and second labelled detector RNAs can be distinguishably labeled.

[00166] "Substantially simultaneous" refers to within about 5 minutes, within about 3 minutes, within about 2 minutes, within about 1 minute, within about 30 seconds, within about 15 seconds, within about 10 seconds, within about 5 seconds, or within about 1 second.

[00167] However, in some cases, the signals produced by the first and second labelled detector

RNAs are not detected at the same time and are instead detected sequentially (one before the other). For example, in some cases, the first and second labelled detector RNAs are not added to the sample at the same time and are instead added seqeuntially (e..g, the second labelled detector RNA can be added after the first labelled detector RNA is added), and in some such cases the second labelled detector RNA is not added until after the signal produced by the first labelled detector RNA is detected. Thus, in some cases, the first and second labelled detector RNAs do not need to be distinguishably labeled (e.g, they can in some cases produce the same detectable signal, e.g., can flouresce at the same wavelength) because the signals are to be detected sequentially.

[00168] As an illustrative example, in some some cases: (i) the first and second labelled detector

RNAs are not distinguishably labeled; (ii) the sample is contacted with one labelled detector RNA and the signal produced by that labelled detector RNA is detected (e.g., measured); and (iii) the sample is then contacted with the other labelled detector RNA and the signal produced by the seond added labelled detector RNA is detected - thus, when both target ssRNAs are present in the sample, addition of the second labelled detector RNA can result in a boost of signal (e..g, if the signal increases with increased cleavage, e.g., Flour/Quencher pair) or can result in a detectable decrease in signal following addition of the second labelled detector RNA (e.g., if the signal decreases with increased cleavage, e.g., FRET pair).

[00169] The first and the second C2c2 proteins can be orthogonal to one another with respect to

C2c2 guide RNA binding. In such cases, the first C2c2 protein does not bind to the second C2c2 guide RNA; and the second C2c2 protein does not bind to the first guide RNA. The first C2c2 protein and the second C2c2 protein can also differ from one another in their ssRNA cleavage preference, such that one of the C2c2 proteins cleaves ssRNA at As and the other C2c2 protein cleaves ssRNA at Us.

[00170] Guidance for orthogonal pairs of C2c2 proteins can be found in FIG. 44E.

[00171] Non-limiting examples of orthogonal pairs of C2c2 proteins suitable for use in a method of the present disclosure include those depicted below in Table 10. The cleavage preference is presented in parenthesis following the name of the Casl3a protein. For example, "Lba (A)" refers to an Lba Casl3a protein, which cleaves ssRNA at A; and "Lbu (U)" refers to an Lbu Casl3a protein, which cleaves ssRNA at U.

[00172] Table 10

C2c2 protein #1 C2c2 protein #2

Lba (A) Hhe (U)

Lba (A) Rca (U)

Lba (A) Ppr (U)

Lba (A) Lne (U)

Lba (A) Lbu (U)

Lba (A) Lwa (U)

Lba (A) Lsh (U)

Ere (A) Hhe (U)

Ere (A) Rca (U)

Ere (A) Ppr (U)

Ere (A) Lne (U)

Ere (A) Lbu (U)

Ere (A) Lwa (U)

Ere (A) Lsh (U)

Ere (A) Lse (U)

Cam (A) Hhe (U)

Cam (A) Rca (U)

Cam (A) Ppr (U)

Cam (A) Lne (U)

Cam (A) Lbu (U)

Cam (A) Lwa (U)

Cam (A) Lsh (U)

Cam (A) Lse (U) [00173] The first and the second labelled detector RNAs can each independently have a length of from 2 to 100 ribonucleotides (e.g., from 2 to 80, 2 to 60, 2 to 50, 2 to 40, 2 to 30, 2 to 20, 2 to 15, or 2 to 10 ribonucleotides). The first and the second labelled detector RNAs can each independently have a length of from 2 ribonucleotides to 100 ribonucleotides, e.g., from 2 ribonucleotides to 5 ribonucleotides, from 5 ribonucleotides to 7 ribonucleotides, from 7 ribonucleotides to 10 ribonucleotides, from 10 ribonucleotides to 15 ribonucleotides, from 15 ribonucleotides to 20 ribonucleotides, from 20 ribonucleotides to 25 ribonucleotides, from 25 ribonucleotides to 30 ribonucleotides, from 30 ribonucleotides to 35 ribonucleotides, from 35 ribonucleotides to 40 ribonucleotides, from 40 ribonucleotides to 45 ribonucleotides, or from 45 ribonucleotides to 50 ribonucleotides.

[00174] In some cases, the first labelled detector RNA comprises at least one A (e.g., at least 2, at least 3, or at least 4 As) and lacks U; and the second labelled detector RNA comprises at least one U (e.g., at least 2, at least 3, or at least 4 Us) and lacks A.

[00175] In some cases, the first labelled detector RNA lacks U and includes a stretch of from 2 to

15 consecutive As (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive As). In some cases, the first labelled detector RNA lacks U and includes a stretch of at least 2 consecutive As (e.g., at least 3, at least 4, or at least 5 consecutive As). In some cases, the second labelled detector RNA lacks A and includes a stretch of from 2 to 15 consecutive Us (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive Us). In some cases, the second labelled detector RNA lacks A and includes a stretch of at least 2 consecutive Us (e.g., at least 3, at least 4, or at least 5 consecutive Us).

[00176] In some cases, the first labelled detector RNA lacks A and includes a stretch of from 2 to

15 consecutive Us (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive Us). In some cases, the first labelled detector RNA lacks A and includes a stretch of at least 2 consecutive Us (e.g., at least 3, at least 4, or at least 5 consecutive Us). In some cases, the second labelled detector RNA lacks U and includes a stretch of from 2 to 15 consecutive As (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive As). In some cases, the second labelled detector RNA lacks U and includes a stretch of at least 2 consecutive As (e.g., at least 3, at least 4, or at least 5 consecutive As).

[00177] In some cases, the first labelled detector RNA comprises at least one U and lacks A; and the second labelled detector RNA comprises at least one A and lacks U. [00178] In some cases, the first labelled detector RNA comprises at least one A and lacks U. For example, in some cases, the first labelled detector RNA is a homoadenosine polymer (a polyA RNA). As another example, the first labelled detector RNA: i) comprises at least one A; ii) lacks U; and iii) comprises one or more C and/or Gs. In some cases, the second labelled detector RNA comprises at least one U and lacks A. For example, in some cases, the second labelled detector RNA is a homouridine polymer (a polyU RNA). As another example, the second labelled detector RNA: i) comprises at least one U; ii) lacks A; and iii) comprises one or more C and/or Gs.

[00179] In some cases, the first labelled detector RNA comprises at least one U and lacks A. For example, in some cases, the first labelled detector RNA is a homouridine polymer (polyU RNA). As another example, the second labelled detector RNA: i) comprises at least one U; ii) lacks A; and iii) comprises one or more Cs and/or Gs. In some cases, the second labelled detector RNA comprises at least one A and lacks U. For example, in some cases, the second labelled detector RNA is a homoadenosine polymer (polyA RNA). As another example, the second labelled detector RNA: i) comprises at least one A; ii) lacks U; and iii) comprises one or more Cs and/or Gs.

[00180] As noted above, a method of the present disclosure can comprise contacting a sample with: a first C2c2 protein; a second C2c2 protein; a first C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the first single stranded target RNA and a second nucleotide sequence also referred to herein as a 'constant region' or 'handle' that binds to the first C2c2 protein; and a second C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the second single stranded target RNA and a second nucleotide sequence (a handle) that binds to the second C2c2 protein.

[00181] For example, in some cases, the first C2c2 protein is a Casl3a polypeptide comprising an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the first C2c2 guide RNA comprises a constant region (a 'handle' - a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

AGAUAGCCCAAGAAAGAGGGCAAUAAC (SEQ ID NO: 16), where the crRNA has a length of about 25 nt, 26 nt, 27 nt, 28 nt, 29 nt, or 30 nt. In some cases, the crRNA has the nucleotide sequence AGAUAGCCCAAGAAAGAGGGCAAUAAC (SEQ ID NO: 16); and has a length of 27 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GUAACAAUCCCCGUAGACAGGGGAACUGCAAC (SEQ ID NO: 17). In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence CAUCACCGCCAAGACGACGGCGGACUGAACC (SEQ ID NO: 18). In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence

AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); and the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence

GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20); where the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9); where the handle has a length of 31 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of 32 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 25 nt, 26 nt, 27 nt, 28 nt, or 29 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 27 nt.

As another example, in some cases, the first C2c2 protein is a Casl3a polypeptide comprising an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the first C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

AAGUAGCCCGAUAUAGAGGGCAAUAAC (SEQ ID NO: 23), where the handle has a length of about 25 nt, 26 nt, 27 nt, 28 nt, 29 nt, or 30 nt. In some cases, the handle has the nucleotide sequence AAGUAGCCCGAUAUAGAGGGCAAUAAC (SEQ ID NO: 23); and has a length of 27 nt. in some cases, the first C2c2 protein is a Casl3a polypeptide comprising an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the first C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

AUACAGCUCGAUAUAGUGAGCAAUAAG (SEQ ID NO: 24), where the handle has a length of about 25 nt, 26 nt, 27 nt, 28 nt, 29 nt, or 30 nt. In some cases, the handle has the nucleotide sequence AUACAGCUCGAUAUAGUGAGCAAUAAG (SEQ ID NO: 24); and has a length of 27 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GUAACAAUCCCCGUAGACAGGGGAACUGCAAC (SEQ ID NO: 17); where the handle has a length of about 30 nt, 31 nt, 32 nt, 33 nt, 34 nt, or 35 nt nt. In some cases, the second C2c2 guide RNA comprises a handlecomprising the nucleotide sequence

GUAACAAUCCCCGUAGACAGGGGAACUGCAAC (SEQ ID NO: 17); and the handle has a length of 32 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

UCACAUCACCGCCAAGACGACGGCGGACUGAACC (SEQ ID NO: 25); where the handle has a length of about 32 nt, 33 nt, 34 nt, 35 nt, 36 nt, or 37 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence

UCACAUCACCGCCAAGACGACGGCGGACUGAACC (SEQ ID NO: 25); and the handle has a length of 34 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); and the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence

GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20); where the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9); where the handle has a length of 31 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of 32 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 25 nt, 26 nt, 27 nt, 28 nt, or 29 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 27 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GACUACCUCUAUAUGAAAGAGGACUAAAAC (SEQ ID NO: 7); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence

GACUACCUCUAUAUGAAAGAGGACUAAAAC (SEQ ID NO: 7); and the handle has a length of 30 nt.

As another example, in some cases, the first C2c2 protein is a Casl3a polypeptide comprising an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the first C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GAACAGCCCGAUAUAGAGGGCAAUAGAC (SEQ ID NO: 26), where the handle has a length of about 26 nt, 27 nt, 28 nt, 29 nt, or 30 nt. In some cases, the handle has the nucleotide sequence GAACAGCCCGAUAUAGAGGGCAAUAGAC (SEQ ID NO: 26); and has a length of 28 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GUAACAAUCCCCGUAGACAGGGGAACUGCAAC (SEQ ID NO: 17); where the handle has a length of about 30 nt, 31 nt, 32 nt, 33 nt, 34 nt, or 35 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence GUAACAAUCCCCGUAGACAGGGGAACUGCAAC (SEQ ID NO: 17); and the handle has a length of 32 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

UCACAUCACCGCCAAGACGACGGCGGACUGAACC (SEQ ID NO: 25); where the handle has a length of about 32 nt, 33 nt, 34 nt, 35 nt, 36 nt, or 37 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence

UCACAUCACCGCCAAGACGACGGCGGACUGAACC (SEQ ID NO: 25); and the handle has a length of 34 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence AAUUAUCCCAAAAUUGAAGGGAACUACAAC (SEQ ID NO: 19); and the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence

GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

GAGUACCUCAAAACAAAAGAGGACUAAAAC (SEQ ID NO: 20); where the handle has a length of 30 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9)); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9); where the handle has a length of 31 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

GACCACCCCAAUAUCGAAGGGGACUAAAACUU (SEQ ID NO: 21); where the handle has a length of 32 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22) (e.g., comprising a nucleotide sequence having only 1 nt, 2 nt, 3 nt, 4 nt, or 5 nt, differences from the nucleotide sequence CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 25 nt, 26 nt, 27 nt, 28 nt, or 29 nt. In some cases, the second guide RNA comprises a handle comprising the nucleotide sequence

CACCCCAAUAUCGAAGGGGACUAAAAC (SEQ ID NO: 22); where the handle has a length of 27 nt. In some cases, the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A; and the second C2c2 guide RNA comprises a handle (a stretch of nucleotides that binds to the Casl3a polypeptide) comprising a nucleotide sequence having no more than 1 nucleotide (nt), no more than 2 nt, no more than 3 nt, no more than 4 nt, or no more than 5 nt differences from the nucleotide sequence GACUACCUCUAUAUGAAAGAGGACUAAAAC (SEQ ID NO: 7); where the handle has a length of about 28 nt, 29 nt, 30 nt, 31 nt, or 32 nt. In some cases, the second C2c2 guide RNA comprises a handle comprising the nucleotide sequence

GACUACCUCUAUAUGAAAGAGGACUAAAAC (SEQ ID NO: 7); and the handle has a length of 30 nt.

Multiplexing

[00184] As noted above, in some cases, a method of the present disclosure comprises: a)

contacting a sample (e.g., a sample comprising a target RNA and a plurality of non-target RNAs) with: i) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (ii) a C2c2 protein that cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs, and also cleaves RNAs of the sample; and b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage.

[00185] In some cases, two or more C2c2 guide RNAs can be present on an array (a precursor

C2c2 guide RNA array). A C2c2 protein can cleave the precursor C2c2 guide RNA array into individual C2c2 guide RNAs (e.g., see Fig. 4 and Fig. 6).

[00186] In some cases a subject C2c2 guide RNA array includes 2 or more C2c2 guide RNAs

(e.g., 3 or more, 4 or more, 5 or more, 6 or more, or 7 or more, C2c2 guide RNAs). The C2c2 guide RNAs of a given array can target (i.e., can include guide sequences that hybridize to) different target sites of the same target RNA (e.g., which can increase sensitivity of detection) and/or can target different target RNA molecules (e.g., a family of transcripts, e.g., based on variation such as single-nucleotide polymorphisms, single nucleotide polymorphisms (SNPs), etc. , and such could be used for example to detect multiple strains of a virus such as influenza virus variants, Zika virus variants, HIV variants, and the like).

C2c2 protein

[00187] A C2c2 protein binds to a C2c2 guide RNA, is guided to a single stranded target RNA by the guide RNA (which hybridizes to the target RNA), and is thereby 'activated.' If the HEPN1 and HEPN2 domains of the C2c2 protein are intact, once activated, the C2c2 protein cleaves the target RNA, but also cleaves non-target RNAs.

[00188] Example naturally existing C2c2 proteins are depicted in Fig. 8 and are set forth as SEQ

ID NOs: 1-6. In some cases, a subject C2c2 protein includes an amino acid sequence having 80% or more (e.g., 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100%) amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: 1-6. In some cases, a suitable C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: l. In some cases, a suitable C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Leptotrichia buccalis C2c2 amino acid sequence set forth in SEQ ID NO:2. In some cases, a suitable C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rhodobacter capsulatus C2c2 amino acid sequence set forth in SEQ ID NO:4. In some cases, a suitable C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Carnobacterium gallinarum C2c2 amino acid sequence set forth in SEQ ID NO: 5. In some cases, a suitable C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Herbinix

hemicellulosilytica C2c2 amino acid sequence set forth in SEQ ID NO:6. In some cases, the C2c2 protein includes an amino acid sequence having 80% or more amino acid sequence identity with the Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2. In some cases, the C2c2 protein is a Leptotrichia buccalis (Lbu) C2c2 protein (e.g., see SEQ ID NO: 2). In some cases, the C2c2 protein includes the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

[00189] In some cases, a C2c2 protein used in a method of the present disclosure is not a

Leptotrichia shahii (Lsh) C2c2 protein. In some cases, a C2c2 protein used in a method of the present disclosure is not a C2c2 polypeptide having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh C2c2 polypeptide set forth in SEQ ID NO:3.

[00190] In some cases, the C2c2 protein is more efficient, by a factor of 1.2-fold or more, than a

Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by a C2c2 guide RNA of the method. In some cases, the C2c2 protein is more efficient, by a factor of 1.5-fold or more, than a Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by a C2c2 guide RNA of the method. In some cases, the C2c2 polypeptide used in a method of the present disclosure, when activated, cleaves non-target RNA at least 1.2-fold, at least 1.5-fold, at least 2-fold, at least 2.5-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 15-fold, at least 20-fold, at least 30- fold, or more than 30-fold, more efficiently than Lsh C2c2.

[00191] In some cases, the C2c2 protein exhibits at least a 50% RNA cleavage efficiency within

1 hour of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency). In some cases, the C2c2 protein exhibits at least a 50% RNA cleavage efficiency within 40 minutes of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency). In some cases, the C2c2 protein exhibits at least a 50% RNA cleavage efficiency within 30 minutes of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency).

[00192] In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 seconds to 60 minutes, e.g., from 1 minute to 60 minutes, from 30 seconds to 5 minutes, from 1 minute to 5 minutes, from 1 minute to 10 minutes, from 5 minutes to 10 minutes, from 10 minutes to 15 minutes, from 15 minutes to 20 minutes, from 20 minutes to 25 minutes, from 25 minutes to 30 minutes, from 30 minutes to 35 minutes, from 35 minutes to 40 minutes, from 40 minutes to 45 minutes, from 45 minutes to 50 minutes, from 50 minutes to 55 minutes, or from 55 minutes to 60 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 seconds to 5 minutes (e.g., from 1 minute to 5 minutes, e.g., in a time period of 1 minute, 2 minutes, 3 minutes, 4 minutes, or 5 minutes). In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 5 minutes to 10 minutes (e.g., in a time period of 5 minutes, 6 minutes, 7 minutes, 8 minutes, 9 minutes, or 10 minutes). In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 10 minutes to 15 minutes (e.g., 10 minutes, 11 minutes, 12 minutes, 13 minutes, 14 minutes, or 15 minutes). In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 15 minutes to 20 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 20 minutes to 25 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 25 minutes to 30 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 minutes to 35 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 35 minutes to 40 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 40 minutes to 45 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 45 minutes to 50 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 50 minutes to 55 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 55 minutes to 60 minutes. In some cases, a C2c2 protein suitable for use in a method of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of less than 1 minute, e.g., in a time period of from 50 seconds to 59 seconds, from 40 seconds to 49 seconds, from 30 seconds to 39 seconds, or from 20 seconds to 29 seconds. In some cases, the cleavage takes place under physiological conditions. In some cases, the cleavage takes place at a temperature of from 15°C to 20°C, from 20°C to 25°C, from 25°C to 30°C, from 30°C to 35°C, or from 35°C to 40°C. In some cases, the cleavage takes place at about 37°C. In some cases, the cleavage takes place at about 37°C and the reaction conditions include divalent metal ions. In some cases, the divalent metal ion is Mg ²⁺. In some cases, the divalent metal ion is Mn ²⁺. In some cases the pH of the reaction conditions is between pH 5 and pH 6. In some cases the pH of the reaction conditions is between pH 6 and pH 7. In some cases the pH of the reaction conditions is between pH 6.5 and pH 7.5. In some cases the pH of reaction conditions is above pH 7.5.

[00193] The term "cleavage efficiency" is used herein to refer to the ability of the C2c2 protein to rapidly cleave RNA in sample once the C2c2 protein has been activated by an appropriate C2c2 guide RNA/target RNA hybridization. "Cleavage efficiency" refers to the amount of RNA the protein can cleave within a given period of time. For example, 50% cleavage efficiency would indicate that 50% of a given RNA is cleaved within a specified period of time. For example, if an RNA is present in a sample at a starting concentration of 100 μΜ, 50% cleavage has been achieved when 50 μΜ of the RNA has been cleaved. As another example, if a plurality of RNA molecules is present in the sample, 50% cleavage has been achieved when 50% of the RNA molecules have been cleaved; efficiency is an expression of the amount of time that is required for a certain percent of the total RNA to be cleaved. This can be measured by any convenient method and many such methods will be known to one of ordinary skill in the art. For example, a labeled detector RNA can be used. In some cases, the RNA species (cleaved versus uncleaved) can be separated on a gel and the amount of cleaved RNA can be compared to the amount of uncleaved RNA, e.g., see Fig. 3.

[00194] When the phrase "wherein the C2c2 protein cleaves at least X% of the RNAs present in the sample" (e.g., within a specified time period) is used, it is meant that X% of the 'signal- producing' RNAs present in the sample is cleaved within the specified time period. Which RNAs are 'signal-producing' RNAs can depend on the detection method used. For example, when a labeled detector RNA is used, the labeled detector RNA might be the only 'signal- producing RNA.' However, the labeled detector RNA is used to represent the RNAs of the sample and thus, what one observes for the labeled detector RNA is assumed to be representative of what is happening to the non-target RNAs of the sample. As such, when 50% of the labeled detector RNA is cleaved, this will generally be assumed to represent when 50% of the 'RNAs present in the sample' are cleaved. In some cases, RNA cleavage in general is being measured and as such, all cleavable RNAs of the sample are 'signal-producing RNAs' . Thus, when referring to the % of RNAs present in the sample being cleaved, this value can be measured using any convenient method, and whatever the method being used, the value is generally meant herein to mean when the enzyme has cleaved half of the cleavable targets in the sample.

[00195] In some cases, the C2c2 protein is not a Leptotrichia shahii (Lsh) C2c2 protein. In some cases, the C2c2 protein is more efficient than a Leptotrichia shahii (Lsh) C2c2 protein (e.g., at cleaving non-target RNA) by a factor of 1.2-fold or more (e.g., 1.5-fold or more, 1.7-fold or more, or 2-fold or more). As such, in some cases, a subject C2c2 protein is more efficient, by a factor of 1.2-fold or more (e.g., 1.5-fold or more, 1.7-fold or more, or 2-fold or more), than a Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by the C2c2 guide RNA of the method. In some cases, the C2c2 polypeptide used in a method of the present disclosure, when activated, cleaves non-target RNA at least 1.2-fold, at least 1.5-fold, at least 2- fold, at least 2.5-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 15-fold, at least 20-fold, at least 30-fold, or more than 30-fold, more efficiently than Lsh C2c2.

Variant C2c2 polypeptides

[00196] Variant C2c2 polypeptides include variants of any one of SEQ ID NOs: l, 2, and 4-6, where the variant C2c2 polypeptide exhibits reduced (or undetectable) nuclease activity. For example, in some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain. As another example, a variant C2c2 protein lacks a catalytically active HEPN2 domain. In some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain and lacks a catalytically active HEPN2 domain.

[00197] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of 1, 2, 3, or 4 of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2 {Leptotrichia buccalis C2c2), or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

Corresponding amino acids in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6 are readily identified; see, e.g., FIG. 22B. For example, amino positions in SEQ ID NO: l {Listeria seeligeri C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R445, H450, R1016, and H1021, respectively. As another example, amino acid positions in SEQ ID NO:4 {Rhodobacter capsulatus C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R464, H469, R1052, and H1057, respectively. As another example, amino acid positions in SEQ ID NO:5 {Carnobacterium gallinarum C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R467, H472, R1069, and H1074, respectively. As another example, amino acid positions in SEQ ID NO:6 {Herbinix

hemicellulosilytica C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R472, H477, R1044, and H1049, respectively.

[00198] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472 and H477 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R1048 and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

[00199] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R1048 and H1053. In some cases, the amino acid at position 1048 is any amino acid other than Arg; and the amino acid at position 1053 is any amino acid other than His. In some cases, the substitutions are R1048A and H1053A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472, H477, R1048, and H1053. In some cases, the amino acid at positions 472 and 1048 is any amino acid other than Arg; and the amino acid at positions 477 and 1053 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1048A, and H1053A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00200] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445 and H450. In some cases, the amino acid at position 445 is any amino acid other than Arg; and the amino acid at position 450 is any amino acid other than His. In some cases, the substitutions are R445A and H450A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R1016 and H1021. In some cases, the amino acid at position 1016 is any amino acid other than Arg; and the amino acid at position 1021 is any amino acid other than His. In some cases, the substitutions are R1016A and H1021A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445, H450, R1016, and H1021. In some cases, the amino acid at positions 445 and 1016 is any amino acid other than Arg; and the amino acid at positions 450 and 1016 is any amino acid other than His. In some cases, the substitutions are R445A, H450A, R1016A, and H1021A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4 (Rhodobacter capsulatus C2c2), and comprises substitution of amino acids R464 and H469. In some cases, the amino acid at position 464 is any amino acid other than Arg; and the amino acid at position 469 is any amino acid other than His. In some cases, the substitutions are R464A and H469A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R1052 and H1057. In some cases, the amino acid at position 1052 is any amino acid other than Arg; and the amino acid at position 1057 is any amino acid other than His. In some cases, the substitutions are R1052A and H1057A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R464, H469, R1052, and H1057. In some cases, the amino acid at positions 464 and 1052 is any amino acid other than Arg; and the amino acid at positions 469 and 1057 is any amino acid other than His. In some cases, the substitutions are R464A, H469A, R1052A, and H1057A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00202] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5 (Carnobacterium gallinarum C2c2), and comprises substitution of amino acids R467 and H472. In some cases, the amino acid at position 467 is any amino acid other than Arg; and the amino acid at position 472 is any amino acid other than His. In some cases, the substitutions are R469A and H472A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R1069 and H1074. In some cases, the amino acid at position 1069 is any amino acid other than Arg; and the amino acid at position 1074 is any amino acid other than His. In some cases, the substitutions are R1069A and H1074A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R467, H472, R1069, and H1074. In some cases, the amino acid at positions 467 and 1069 is any amino acid other than Arg; and the amino acid at positions 472 and 1074 is any amino acid other than His. In some cases, the substitutions are R469A, H472A, R1069A, and H1074A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00203] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 6 (Herbinix

hemicellulosilytica C2c2), and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R1044 and H1049. In some cases, the amino acid at position 1044 is any amino acid other than Arg; and the amino acid at position 1049 is any amino acid other than His. In some cases, the substitutions are R1044A and H1049A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R472, H477, R1044, and H1049. In some cases, the amino acid at positions 472 and 1044 is any amino acid other than Arg; and the amino acid at positions 477 and 1049 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1044A, and H1049A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA- guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00204] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of 1, 2, 3, or 4 of amino acids R472, H477, R1048, and H1053, such that the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. For example, in some cases, the variant C2c2 polypeptide exhibits less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 1%, or less than 0.1%, of the RNA-guided cleavage of a non-target RNA exhibited by a C2c2 polypeptide having the amino acid sequence set forth in SEQ ID NO:2. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00205] Any of the above variant C2c2 polypeptides can also include a mutation (e.g., at any one of positions R1079, R1072, and K1082, as described in further detail below) that results in reduced ability (e.g., loss of ability) to cleave precursor C2c2 guide RNA. [00206] In some cases, a variant C2c2 polypeptide has reduced ability to cleave precursor C2c2 guide RNA (e.g., see examples below and related Figs. 26C-26D, 35D, and 37). For example, in some cases, a variant C2c2 polypeptide comprises amino acid substitutions of 1, 2, or 3 of amino acids R1079, R1072, and K1082 of the amino acid sequence set forth in SEQ ID NO:2

(Leptotrichia buccalis C2c2), or a corresponding amino acid of any C2c2 amino acid sequence (e.g., the C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). Corresponding amino acids in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6 are readily identified. For example, amino positions in SEQ ID NO: l {Listeria seeligeri C2c2) that correspond to R1079, R1072, and K1082 of SEQ ID NO:2 are R1048, R1041, and K1051, respectively. As another example, amino acid positions in SEQ ID NO:4 {Rhodobacter capsulatus C2c2) that correspond to R1079, R1072, and K1082 of SEQ ID NO:2 are R1085, R1078, and K1088, respectively. As another example, amino acid positions in SEQ ID NO:5 (Carnobacterium gallinarum C2c2) that correspond to R1079, R1072, and K1082 of SEQ ID NO:2 are R1099, R1092, and K1102, respectively. As another example, amino acid positions in SEQ ID NO:6 (Herbinix hemicellulosilytica C2c2) that correspond to R1079 and R1072 of SEQ ID NO:2 are Rl 172 and Rl 165, respectively.

[00207] In some cases, a variant C2c2 polypeptide comprises an amino acid substitution of amino acid R1079 (e.g., R1079A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid substitution of amino acid R1072 (e.g., R1072A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid substitution of amino acid K1082 (e.g., K1082A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises one or more (e.g, two or more, or all three) amino acid substitutions at positions selected from R1079 (e.g., R1079A). R1072 (e.g., R1072A), and K1082 (e.g., K1082A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l , SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6).

[00208] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises an amino acid substitution of amino acid R1079 (e.g., R1079A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises an amino acid substitution of amino acid R1072 (e.g., R1072A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises an amino acid substitution of amino acid K1082 (e.g., K1082A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises one or more (e.g, two or more, or all three) amino acid substitutions at positions selected from R1079 (e.g., R1079A). R1072 (e.g., R1072A), and K1082 (e.g., K1082A) of the amino acid sequence set forth in SEQ ID NO:2, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6).

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises an amino acid substitution of amino acid R1041 (e.g., R1041A) of the amino acid sequence set forth in SEQ ID NO: l, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises an amino acid substitution of amino acid R1048 (e.g., R1048A) of the amino acid sequence set forth in SEQ ID NO: l, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises an amino acid substitution of amino acid K1051 (e.g., K1051A) of the amino acid sequence set forth in SEQ ID NO: l, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises one or more (e.g, two or more, or all three) amino acid substitutions at positions selected from R1048 (e.g., R1048A). R1041 (e.g., R1041A), and K1051 (e.g., K1051A) of the amino acid sequence set forth in SEQ ID NO: l, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6).

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises an amino acid substitution of amino acid R1085 (e.g., R1085A) of the amino acid sequence set forth in SEQ ID NO:4, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises an amino acid substitution of amino acid R1078 (e.g., R1078A) of the amino acid sequence set forth in SEQ ID NO:4, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises an amino acid substitution of amino acid K1088 (e.g., K1088A) of the amino acid sequence set forth in SEQ ID NO:4, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises one or more (e.g, two or more, or all three) amino acid substitutions at positions selected from R1085 (e.g., R1085A). R1078 (e.g., R1078A), and K1088 (e.g., K1088A) of the amino acid sequence set forth in SEQ ID NO:4, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:5, or SEQ ID NO:6).

[00211] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises an amino acid substitution of amino acid R1099 (e.g., R1099A) of the amino acid sequence set forth in SEQ ID NO:5, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:2, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises an amino acid substitution of amino acid R1092 (e.g., R1092A) of the amino acid sequence set forth in SEQ ID NO:5, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:2, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises an amino acid substitution of amino acid Kl 102 (e.g., Kl 102A) of the amino acid sequence set forth in SEQ ID NO:5, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:2, or SEQ ID NO:6). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises one or more (e.g, two or more, or all three) amino acid substitutions at positions selected from R1099 (e.g., R1099A). R1092 (e.g., R1092A), and K1102 (e.g., K1102A) of the amino acid sequence set forth in SEQ ID NO:5, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:2, or SEQ ID NO:6).

[00212] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises an amino acid substitution of amino acid Rl 172 (e.g., Rl 172A) of the amino acid sequence set forth in SEQ ID NO:6, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:2. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises an amino acid substitution of amino acid Rl 165 (e.g., Rl 165A) of the amino acid sequence set forth in SEQ ID NO:6, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:2). In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises one or more (e.g, both) amino acid substitutions at positions selected from Rl 172 (e.g., Rl 172A) and Rl 165 (e.g., Rl 165A) of the amino acid sequence set forth in SEQ ID NO:6, or the corresponding amino acid of any C2c2 amino acid sequence (e.g., a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:2).

C2c2 Guide RNA

[00213] A subject C2c2 guide RNA (e.g., a C2c2 crRNA) includes a guide sequence and a

constant region (e.g., a region that is 5' of the guide sequence). The region that is 5' of the guide sequence binds to the C2c2 protein (and can be considered a protein-binding region) while the guide sequence hybridizes to a target sequence of the target RNA.

Guide sequence

[00214] The guide sequence has complementarity with (hybridizes to) a target sequence of the single stranded target RNA. In some cases, the base of the target RNA that is immediately 3' of the target sequence (protospacer) is not a G. In some cases, the guide sequence is 16-28 nucleotides (nt) in length (e.g., 16-26, 16-24, 16-22, 16-20, 16-18, 17-26, 17-24, 17-22, 17-20, 17-18, 18-26, 18-24, or 18-22 nt in length). In some cases, the guide sequence is 18-24 nucleotides (nt) in length. In some cases, the guide sequence is at least 16 nt long (e.g., at least 18, 20, or 22 nt long). In some cases, the guide sequence is at least 17 nt long. In some cases, the guide sequence is at least 18 nt long. In some cases, the guide sequence is at least 20 nt long.

[00215] In some cases, the guide sequence has 80% or more (e.g., 85% or more, 90% or more,

95% or more, or 100% complementarity) with the target sequence of the single stranded target RNA. In some cases, the guide sequence is 100% complementary to the target sequence of the single stranded target RNA. Constant region

[00216] The following 3 sequences are each an example of a constant region of a naturally

existing C2c2 guide RNA (e.g., a region that is 5' of the guide sequence):

GACUACCUCUAUAUGAAAGAGGACUAAAAC (SEQ ID NO: 7)

{Listeria seeligeri) ("Lse")

CCACCCCAAUAUCGAAGGGGACUAAAACA (SEQ ID NO: 8)

{Leptotrichia shahii) ("Lsh")

GACCACCCCAAAAAUGAAGGGGACUAAAACA (SEQ ID NO: 9)

{Leptotrichia buccalis) ("Lbu")

[00217] In some embodiments, a subject C2c2 guide RNA includes a nucleotide sequence having

70% or more identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, or 100% identity) with the sequence set forth in any one of SEQ ID NOs: 7- 9. In some embodiments, a subject C2c2 guide RNA includes a nucleotide sequence having 90% or more identity (e.g., 95% or more, 98% or more, 99% or more, or 100% identity) with the sequence set forth in any one of SEQ ID NOs: 7-9. In some embodiments, a subject C2c2 guide RNA includes the nucleotide sequence set forth in any one of SEQ ID NOs: 7-9.

[00218] In some embodiments, a subject C2c2 guide RNA includes a nucleotide sequence having

70% or more identity (e.g., 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, or 100% identity) with the sequence set forth in SEQ ID NO: 9. In some embodiments, a subject C2c2 guide RNA includes a nucleotide sequence having 90% or more identity (e.g., 95% or more, 98% or more, 99% or more, or 100% identity) with the sequence set forth in SEQ ID NO: 9. In some embodiments, a subject C2c2 guide RNA includes the nucleotide sequence set forth in SEQ ID NO: 9.

[00219] In some embodiments, a subject C2c2 guide RNA does not include a nucleotide

sequence of a Leptotrichia shahii (LsH) C2c2 guide RNA. For example, in some cases, the C2c2 protein that is used is not a C2c2 from Leptotrichia shahii (e.g., is not an Lsh C2c2 protein), and in some such cases the C2c2 guide RNA that is used is also not from Leptotrichia shahii (e.g., the guide RNA used does not include the constant region of an Lsh C2c2 guide RNA).

Therefore, in some cases a subject C2c2 guide RNA does not include the sequence set forth in SEQ ID NO: 8.

[00220] In some cases, the C2c2 guide RNA includes a double stranded RNA duplex (dsRNA duplex). For example, see Fig. 7A which illustrates a C2c2 guide RNA from Lbu hybridized to a single stranded target RNA, where the C2c2 guide RNA includes a dsRNA duplex that is 4 base pairs (bp) in length. In some cases, a C2c2 guide RNA includes a dsRNA duplex with a length of from 2 to 12 bp (e.g., from 2 to 10 bp, 2 to 8 bp, 2 to 6 bp, 2 to 5 bp, 2 to 4 bp, 3 to 12 bp, 3 to 10 bp, 3 to 8 bp, 3 to 6 bp, 3 to 5 bp, 3 to 4 bp, 4 to 12 bp, 4 to 10 bp, 4 to 8 bp, 4 to 6 bp, or 4 to 5 bp). In some cases, a C2c2 guide RNA includes a dsRNA duplex that is 2 or more bp in length (e.g., 3 or more, 4 or more, 5 or more, 6 or more, or 7 or more bp in length). In some cases, a C2c2 guide RNA includes a dsRNA duplex that is longer than the dsRNA duplex of a corresponding wild type C2c2 guide RNA. For example, see Fig. 7A, which illustrates a C2c2 guide RNA from Lbu hybridized to a single stranded target RNA, where the C2c2 guide RNA includes a dsRNA duplex that is 4 base pairs (bp) in length. As such, a C2c2 guide RNA can in some cases include a dsRNA duplex that is 5 or more bp in length (e.g., 6 or more, 7 or more, or 8 or more bp in length). In some cases, a C2c2 guide RNA includes a dsRNA duplex that is shorter than the dsRNA duplex of a corresponding wild type C2c2 guide RNA. As such in some cases, a C2c2 guide RNA includes a dsRNA duplex that is less than 4 bp in length. In some cases, a C2c2 guide RNA includes a dsRNA duplex having a length of 2 or 3 bp in length.

[00221] In some cases, the region of a C2c2 guide RNA that is 5' of the guide sequence is 15 or more nucleotides (nt) in length (e.g., 18 or more, 20 or more, 21 or more, 22 or more, 23 or more, 24 or more, 25 or more, 26 or more, 27 or more, 28 or more, 29 or more, 30 or more, 31 or more nt, 32 or more, 33 or more, 34 or more, or 35 or more nt in length). In some cases, the region of a C2c2 guide RNA that is 5' of the guide sequence is 29 or more nt in length.

[00222] In some cases, the region of a C2c2 guide RNA that is 5' of the guide sequence has a length in a range of from 12 to 100 nt (e.g., from 12 to 90, 12 to 80, 12 to 70, 12 to 60, 12 to 50, 12 to 40, 15 to 100, 15 to 90, 15 to 80, 15 to 70, 15 to 60, 15 to 50, 15 to 40, 20 to 100, 20 to 90, 20 to 80, 20 to 70, 20 to 60, 20 to 50, 20 to 40, 25 to 100, 25 to 90, 25 to 80, 25 to 70, 25 to 60, 25 to 50, 25 to 40, 28 to 100, 28 to 90, 28 to 80, 28 to 70, 28 to 60, 28 to 50, 28 to 40, 29 to 100, 29 to 90, 29 to 80, 29 to 70, 29 to 60, 29 to 50, or 29 to 40 nt). In some cases, the region of a C2c2 guide RNA that is 5' of the guide sequence has a length in a range of from 28 to 100 nt. In some cases, the region of a C2c2 guide RNA that is 5' of the guide sequence has a length in a range of from 28 to 40 nt.

[00223] In some cases, the region of the C2c2 guide RNA that is 5' of the guide sequence is truncated relative to (shorter than) the corresponding region of a corresponding wild type C2c2 guide RNA. For example, the mature Lse C2c2 guide RNA includes a region 5' of the guide sequence that is 30 nucleotides (nt) in length, and a subject truncated C2c2 guide RNA (relative to the Lse C2c2 guide RNA) can therefore have a region 5' of the guide sequence that is less than 30 nt in length (e.g., less than 29, 28, 27, 26, 25, 22, or 20 nt in length). In some cases, a truncated C2c2 guide RNA includes a region 5' of the guide sequence that has a length in a range of from 12 to 29 nt (e.g., from 12 to 28, 12 to 27, 12 to 26, 12 to 25, 12 to 22, 12 to 20, 12 to 18 nt). In some cases, the truncated C2c2 guide RNA is truncated by one or more nt (e.g., 2 or more, 3 or more, 4 or more, 5 or more, or 10 or more nt), e.g., relative to a corresponding wild type C2c2 guide).

[00224] In some cases, the region of the C2c2 guide RNA that is 5' of the guide sequence is extended relative to (longer than) the corresponding region of a corresponding wild type C2c2 guide RNA. For example, the mature Lse C2c2 guide RNA includes a region 5' of the guide sequence that is 30 nucleotides (nt) in length, and an extended C2c2 guide RNA (relative to the Lse C2c2 guide RNA) can therefore have a region 5' of the guide sequence that is longer than 30 nt (e.g., longer than 31, longer than 32, longer than 33, longer than 34, or longer than 35 nt). In some cases, an extended C2c2 guide RNA includes a region 5' of the guide sequence that has a length in a range of from 30 to 100 nt (e.g., from 30 to 90, 30 to 80, 30 to 70, 30 to 60, 30 to 50, or 30 to 40 nt). In some cases, the extended C2c2 guide RNA includes a region 5' of the guide sequence that is extended (e.g., relative to the corresponding region of a corresponding wild type C2c2 guide RNA) by one or more nt (e.g., 2 or more, 3 or more, 4 or more, 5 or more, or 10 or more nt).

[00225] In some cases, a subject C2c2 guide RNA is 30 or more nucleotides (nt) in length (e.g.,

34 or more, 40 or more, 45 or more, 50 or more, 55 or more, 60 or more, 65 or more, 70 or more, or 80 or more nt in length). In some cases, the C2c2 guide RNA is 35 or more nt in length.

[00226] In some cases, a subject C2c2 guide RNA has a length in a range of from 30 to 120 nt

(e.g., from 30 to 110, 30 to 100, 30 to 90, 30 to 80, 30 to 70, 30 to 60, 35 to 120, 35 to 110, 35 to 100, 35 to 90, 35 to 80, 35 to 70, 35 to 60, 40 to 120, 40 to 110, 40 to 100, 40 to 90, 40 to 80, 40 to 70, 40 to 60, 50 to 120, 50 to 110, 50 to 100, 50 to 90, 50 to 80, or 50 to 70 nt). In some cases, the C2c2 guide RNA has a length in a range of from 33 to 80 nt. In some cases, the C2c2 guide RNA has a length in a range of from 35 to 60 nt.

[00227] In some cases, a subject C2c2 guide RNA is truncated relative to (shorter than) a

corresponding wild type C2c2 guide RNA. For example, a mature Lse C2c2 guide RNA can be 50 nucleotides (nt) in length, and a truncated C2c2 guide RNA (relative to the Lse C2c2 guide RNA) can therefore in some cases be less than 50 nt in length (e.g., less than 49, 48, 47, 46, 45, 42, or 40 nt in length). In some cases, a truncated C2c2 guide RNA has a length in a range of from 30 to 49 nt (e.g., from 30 to 48, 30 to 47, 30 to 46, 30 to 45, 30 to 42, 30 to 40, 35 to 49, 35 to 48, 35 to 47, 35 to 46, 35 to 45, 35 to 42, or 35 to 40 nt). In some cases, the truncated C2c2 guide RNA is truncated by one or more nt (e.g., 2 or more, 3 or more, 4 or more, 5 or more, or 10 or more nt), e.g., relative to a corresponding wild type C2c2 guide). [00228] In some cases, a subject C2c2 guide RNA is extended relative to (longer than) a corresponding wild type C2c2 guide RNA. For example, a mature Lse C2c2 guide RNA can be 50 nucleotides (nt) in length, and an extended C2c2 guide RNA (relative to the Lse C2c2 guide RNA) can therefore in some cases be longer than 50 nt (e.g., longer than 51, longer than 52, longer than 53, longer than 54, or longer than 55 nt). In some cases, an extended C2c2 guide RNA has a length in a range of from 51 to 100 nt (e.g., from 51 to 90, 51 to 80, 51 to 70, 51 to 60, 53 to 100, 53 to 90, 53 to 80, 53 to 70, 53 to 60, 55 to 100, 55 to 90, 55 to 80, 55 to 70, or 55 to 60 nt). In some cases, the extended C2c2 guide RNA is extended (e.g., relative to a corresponding wild type C2c2 guide RNA) by one or more nt (e.g., 2 or more, 3 or more, 4 or more, 5 or more, or 10 or more nt).

METHODS OF CLEAVING A PRECURSOR C2C2 GUIDE RNA ARRAY

[00229] The present disclosure provides a method of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs. The method comprises contacting a precursor C2c2 guide RNA array with a C2c2 protein. The precursor C2c2 guide RNA array comprises two or more (e.g., 2, 3, 4, 5, or more) C2c2 guide RNAs, each of which can have a different guide sequence. The C2c2 protein cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs. In some cases, the contant region (also referred to as a 'handle') of a C2c2 guide RNA includes nucleotide sequence from the precursor guide RNA (e.g., sequence that is normally present prior to cleavage of the guide RNA array). In other words, in some cases the constant region of a subject C2c2 guide RNA includes a precursor crRNA handle.

[00230] In some cases, the contacting step does not take place inside a cell, e.g., inside a living cell. In some cases, the contacting step takes place inside of a cell (e.g., a cell in vitro (in culture), a cell ex vivo, a cell in vivo). Any cell is suitable. Examples of cells in which contacting can take place include but are not limited to: a eukaryotic cell; a prokaryotic cell (e.g., a bacterial cell, an archaeal cell); a single-cell eukaryotic organism; a plant cell; an algal cell, e.g.,

Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens, C. agardh, and the like; a fungal cell (e.g., a yeast cell); an animal cell; an invertebrate cell (e.g. fruit fly, cnidarian, echinoderm, nematode, an insect, an arachnid, etc.); a vertebrate cell (e.g., fish, amphibian, reptile, bird, mammal); a mammal cell (e.g., a human; a non-human primate; an ungulate; a feline; a bovine; an ovine; a caprine; a rat; a mouse; a rodent; a pig; a sheep; a cow; etc.); a parasite cell (e.g., helminths, malarial parasites, etc.).

C2c2 protein

[00231] When a C2c2 protein has intact HEPN domains, it can cleave RNA (target RNA as well as non-target RNA) after it is 'activated' . However, C2c2 protein can also cleave precursor C2c2 guide RNAs into mature C2c2 guide RNAs in a HEPN-independent fashion. For example, when a C2c2 protein lacks a catalytically active HEPNl domain and also lacks a catalytically active HEPN2 domain, it can still cleave precursor guide RNA into mature guide RNA. As such, when used in a method that includes a precursor C2c2 guide RNA and/or a precursor C2c2 guide RNA array, the C2c2 protein can (and will in some cases) lack a catalytically active HEPNl domain and/or catalytically active HEPN2 domain. In some cases, the C2c2 protein lacks a catalytically active HEPNl domain and lacks a catalytically active HEPN2 domain.

[00232] A C2c2 protein that lacks a catalytically active HEPNl domain and lacks a catalytically active HEPN2 domain can in some cases be used in methods of binding (e.g. imaging methods). For example, in some cases, a method of binding (and/or imaging) includes contacting a sample with a precursor C2c2 guide RNA array and a C2c2 protein that lacks a catalytically active HEPNl domain and lacks a catalytically active HEPN2 domain. In such cases, the C2c2 protein can be detectably labeled (e.g., fused an epitope tag, fused to a fluorophore, fused to a fluorescent protein such as a green fluorescent protein, etc.).

[00233] A C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array can have intact HEPNl and HEPN2 domains. However, in some cases, the C2c2 protein lacks a catalytically active HEPNl domain and/or lacks a catalytically active HEPN2 domain.

[00234] In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array includes an amino acid sequence having 80% or more (e.g., 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100%) amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: 1-6. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: l. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Leptotrichia buccalis C2c2 amino acid sequence set forth in SEQ ID NO:2. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rhodobacter capsulatus C2c2 amino acid sequence set forth in SEQ ID NO:4. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Carnobacterium gallinarum C2c2 amino acid sequence set forth in SEQ ID NO:5. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Herbinix hemicellulosilytica C2c2 amino acid sequence set forth in SEQ ID NO:6. In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array includes an amino acid sequence having 80% or more amino acid sequence identity with the Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure is a Leptotrichia buccalis (Lbu) C2c2 protein (e.g., see SEQ ID NO: 2). In some cases, a C2c2 protein suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array includes the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

[00235] In some cases, a C2c2 protein used in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array is not a Leptotrichia shahii (Lsh) C2c2 protein. In some cases, a C2c2 protein used in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array is not a C2c2 polypeptide having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh C2c2 polypeptide set forth in SEQ ID NO:3.

[00236] In some cases, a C2c2 polypeptide suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array is a variant C2c2 polypeptide. Variant C2c2 polypeptides suitable for use in a method of the present disclosure for cleaving a precursor C2c2 guide RNA array include variants of any one of SEQ ID NOs: l, 2, and 4-6, where the variant C2c2 polypeptide exhibits reduced (or undetectable) nuclease activity. For example, in some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain. As another example, a variant C2c2 protein lacks a catalytically active HEPN2 domain. In some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain and lacks a catalytically active HEPN2 domain.

[00237] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of 1, 2, 3, or 4 of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2 {Leptotrichia buccalis C2c2), or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

Corresponding amino acids in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6 are readily identified; see, e.g., FIG. 22B. For example, amino positions in SEQ ID NO: l {Listeria seeligeri C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R445, H450, R1016, and H1021, respectively.

[00238] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472 and H477 of the amino acid sequence set forth in SEQ ID NO:2, or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R1048 and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

[00239] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R1048 and H1053. In some cases, the amino acid at position 1048 is any amino acid other than Arg; and the amino acid at position 1053 is any amino acid other than His. In some cases, the substitutions are R1048A and H1053A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472, H477, R1048, and H1053. In some cases, the amino acid at positions 472 and 1048 is any amino acid other than Arg; and the amino acid at positions 477 and 1053 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1048A, and H1053A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00240] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445 and H450. In some cases, the amino acid at position 445 is any amino acid other than Arg; and the amino acid at position 450 is any amino acid other than His. In some cases, the substitutions are R445A and H450A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R1016 and H1021. In some cases, the amino acid at position 1016 is any amino acid other than Arg; and the amino acid at position 1021 is any amino acid other than His. In some cases, the substitutions are R1016A and H1021A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445, H450, R1016, and H1021. In some cases, the amino acid at positions 445 and 1016 is any amino acid other than Arg; and the amino acid at positions 450 and 1016 is any amino acid other than His. In some cases, the substitutions are R445A, H450A, R1016A, and H1021A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00241] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4 (Rhodobacter capsulatus C2c2), and comprises substitution of amino acids R464 and H469. In some cases, the amino acid at position 464 is any amino acid other than Arg; and the amino acid at position 469 is any amino acid other than His. In some cases, the substitutions are R464A and H469A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R1052 and H1057. In some cases, the amino acid at position 1052 is any amino acid other than Arg; and the amino acid at position 1057 is any amino acid other than His. In some cases, the substitutions are R1052A and H1057A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R464, H469, R1052, and H1057. In some cases, the amino acid at positions 464 and 1052 is any amino acid other than Arg; and the amino acid at positions 469 and 1057 is any amino acid other than His. In some cases, the substitutions are R464A, H469A, R1052A, and H1057A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5 (Carnobacterium gallinarum C2c2), and comprises substitution of amino acids R467 and H472. In some cases, the amino acid at position 467 is any amino acid other than Arg; and the amino acid at position 472 is any amino acid other than His. In some cases, the substitutions are R469A and H472A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R1069 and H1074. In some cases, the amino acid at position 1069 is any amino acid other than Arg; and the amino acid at position 1074 is any amino acid other than His. In some cases, the substitutions are R1069A and H1074A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R467, H472, R1069, and H1074. In some cases, the amino acid at positions 467 and 1069 is any amino acid other than Arg; and the amino acid at positions 472 and 1074 is any amino acid other than His. In some cases, the substitutions are R469A, H472A, R1069A, and H1074A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00243] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 6 (Herbinix

hemicellulosilytica C2c2), and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R1044 and H1049. In some cases, the amino acid at position 1044 is any amino acid other than Arg; and the amino acid at position 1049 is any amino acid other than His. In some cases, the substitutions are R1044A and H1049A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R472, H477, R1044, and H1049. In some cases, the amino acid at positions 472 and 1044 is any amino acid other than Arg; and the amino acid at positions 477 and 1049 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1044A, and H1049A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA- guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

Precursor C2c2 guide RNA array

[00244] As demonstrated in the working examples below, a C2c2 protein can cleave a precursor

C2c2 guide RNA into a mature guide RNA, e.g., by endoribonucleolytic cleavage of the precursor. Also as demonstrated in the working examples below, a C2c2 protein can cleave a precursor C2c2 guide RNA array (that includes more than one C2c2 guide RNA arrayed in tandem) into two or more individual C2c2 guide RNAs. Thus, in some cases a precursor C2c2 guide RNA array comprises two or more (e.g., 3 or more, 4 or more, 5 or more, 2, 3, 4, or 5) C2c2 guide RNAs (e.g., arrayed in tandem as precursor molecules). In some cases, each guide RNA of a precursor C2c2 guide RNA array has a different guide sequence. In some cases, two or more guide RNAs of a precursor C2c2 guide RNA array have the same guide sequence.

[00245] In some cases, the precursor C2c2 guide RNA array comprises two or more C2c2 guide

RNAs that target different target sites within the same target RNA molecule. For example, such a scenario can in some cases increase sensitivity of detection by activating C2c2 protein when either one hybridizes to the target RNA molecule.

[00246] In some cases, the precursor C2c2 guide RNA array comprises two or more C2c2 guide

RNAs that target different target RNA molecules. For example, such a scenario can result in a positive signal when any one of a family of potential target RNAs is present. Such an array could be used for targeting a family of transcripts, e.g., based on variation such as single nucleotide polymorphisms (SNPs) (e.g., for diagnostic purposes). Such could also be useful for detecting whether any one of a number of different strains of virus is present (e.g., influenza virus variants, Zika virus variants, HIV variants, and the like). Such could also be useful for detecting whether any one of a number of different species, strains, isolates, or variants of a bacterium is present (e.g., different species, strains, isolates, or variants of Mycobacterium, different species, strains, isolates, or variants of Neisseria, different species, strains, isolates, or variants of Staphylococcus aureus; different species, strains, isolates, or variants of E. coli; etc.)

VARIANT C2C2 POLYPEPTIDES

[00247] The present disclosure provides a variant C2c2 polypeptide, as well as a nucleic acid

(e.g., a recombinant expression vector) comprising a nucleotide sequence encoding the variant C2c2 polypeptide.

[00248] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R1048 and H1053. In some cases, the amino acid at position 1048 is any amino acid other than Arg; and the amino acid at position 1053 is any amino acid other than His. In some cases, the substitutions are R1048A and H1053A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472, H477, R1048, and H1053. In some cases, the amino acid at positions 472 and 1048 is any amino acid other than Arg; and the amino acid at positions 477 and 1053 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1048A, and H1053A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445 and H450. In some cases, the amino acid at position 445 is any amino acid other than Arg; and the amino acid at position 450 is any amino acid other than His. In some cases, the substitutions are R445A and H450A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R1016 and H1021. In some cases, the amino acid at position 1016 is any amino acid other than Arg; and the amino acid at position 1021 is any amino acid other than His. In some cases, the substitutions are R1016A and H1021A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445, H450, R1016, and H1021. In some cases, the amino acid at positions 445 and 1016 is any amino acid other than Arg; and the amino acid at positions 450 and 1016 is any amino acid other than His. In some cases, the substitutions are R445A, H450A, R1016A, and H1021A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA. [00250] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4 (Rhodobacter capsulatus C2c2), and comprises substitution of amino acids R464 and H469. In some cases, the amino acid at position 464 is any amino acid other than Arg; and the amino acid at position 469 is any amino acid other than His. In some cases, the substitutions are R464A and H469A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R1052 and H1057. In some cases, the amino acid at position 1052 is any amino acid other than Arg; and the amino acid at position 1057 is any amino acid other than His. In some cases, the substitutions are R1052A and H1057A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R464, H469, R1052, and H1057. In some cases, the amino acid at positions 464 and 1052 is any amino acid other than Arg; and the amino acid at positions 469 and 1057 is any amino acid other than His. In some cases, the substitutions are R464A, H469A, R1052A, and H1057A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00251] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5 (Carnobacterium gallinarum C2c2), and comprises substitution of amino acids R467 and H472. In some cases, the amino acid at position 467 is any amino acid other than Arg; and the amino acid at position 472 is any amino acid other than His. In some cases, the substitutions are R469A and H472A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R1069 and H1074. In some cases, the amino acid at position 1069 is any amino acid other than Arg; and the amino acid at position 1074 is any amino acid other than His. In some cases, the substitutions are R1069A and H1074A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R467, H472, R1069, and H1074. In some cases, the amino acid at positions 467 and 1069 is any amino acid other than Arg; and the amino acid at positions 472 and 1074 is any amino acid other than His. In some cases, the substitutions are R469A, H472A, R1069A, and H1074A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 6 (Herbinix

hemicellulosilytica C2c2), and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R1044 and H1049. In some cases, the amino acid at position 1044 is any amino acid other than Arg; and the amino acid at position 1049 is any amino acid other than His. In some cases, the substitutions are R1044A and H1049A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R472, H477, R1044, and H1049. In some cases, the amino acid at positions 472 and 1044 is any amino acid other than Arg; and the amino acid at positions 477 and 1049 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1044A, and H1049A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA- guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00253] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of 1, 2, 3, or 4 of amino acids R472, H477, R1048, and H1053, such that the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. For example, in some cases, the variant C2c2 polypeptide exhibits less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 1%, or less than 0.1%, of the RNA-guided cleavage of a non-target RNA exhibited by a C2c2 polypeptide having the amino acid sequence set forth in SEQ ID NO:2. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00254] The present disclosure provides a nucleic acid (e.g., an isolated nucleic acid) comprising a nucleotide sequence encoding a variant C2c2 polypeptide of the present disclosure. In some cases, the nucleotide sequence is operably linked to a transcriptional control element, e.g., a promoter. In some cases, the promoter is a constitutive promoter. In some cases, the promoter is a regulatable promoter. In some cases, the promoter is an inducible promoter. In some cases, the promoter is functional in a eukaryotic cell. In some cases, the promoter is functional in a prokaryotic cell.

[00255] The present disclosure provides a recombinant expression vector comprising a nucleic acid of the present disclosure, e.g., a nucleic acid comprising a nucleotide sequence encoding a variant C2c2 polypeptide of the present disclosure.

[00256] The present disclosure provides a host cell that is genetically modified with a nucleic acid of the present disclosure, e.g., a nucleic acid comprising a nucleotide sequence encoding a variant C2c2 polypeptide of the present disclosure. The present disclosure provides a host cell that is genetically modified with a recombinant expression vector comprising a nucleic acid of the present disclosure, e.g., a nucleic acid comprising a nucleotide sequence encoding a variant C2c2 polypeptide of the present disclosure. In some cases, the host cell is a prokaryotic cell. In some cases, the host cell is a eukaryotic cell. In some cases, the host cell is in vitro. In some cases, the host cell is ex vivo. In some cases, the host cell is in vivo. In some cases, the host cell is a bacterial cell. In some cases, the host cell is a yeast cell. In some cases, the host cell is a plant cell. In some cases, the host cell is a mammalian cell. In some cases, the host cell is human cell. In some cases, the host cell is a non-human mammalian cell. In some cases, the host cell is an insect cell. In some cases, the host cell is an arthropod cell. In some cases, the host cell is a fungal cell. In some cases, the host cell is an algal cell.

KITS

[00257] The present disclosure provides a kit for detecting a target RNA in a sample comprising a plurality of RNAs. In some cases, the kit comprises: (a) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (b) a C2c2 protein, and/or a nucleic acid encoding said C2c2 protein,. In some cases, such a kit further includes a labeled detector RNA (e.g., a labeled detector RNA comprising a

fluorescence -emitting dye pair, i.e., a FRET pair and/or a quencher/fluor pair). In some cases, two or more C2c2 guide RNAs (e.g., in some cases each of the C2c2 guide RNAs) of a given precursor C2c2 guide RNA array include the same guide sequence.

[00258] In some cases, a subject kit comprises: (a) a labeled detector RNA comprising a

fluorescence -emitting dye pair, i.e., a FRET pair and/or a quencher/fluor pair; and (b) a C2c2 protein, and/or a nucleic acid encoding said C2c2 protein,. In some cases, such a kit further includes (c) a C2c2 guide RNA (and/or a nucleic acid encoding a C2c2 guide RNA), and/or (d) a precursor C2c2 guide RNA (and/or a nucleic acid encoding a precursor C2c2 guide RNA) and/or (e) a precursor C2c2 guide RNA array (and/or a nucleic acid encoding a precursor C2c2 guide RNA array, e.g., a nucleic acid encoding a precursor C2c2 guide RNA array that includes sequence insertion sites for the insertion of guide sequences by a user).

1) Kit comprising a precursor C2c2 guide RNA array and a C2c2 protein

[00259] In some cases, the kit comprises: (a) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (b) a C2c2 protein, and/or a nucleic acid encoding said C2c2 protein. As noted above, in some cases such a kit further includes a labeled detector RNA (e.g., a labeled detector RNA comprising a

fluorescence -emitting dye pair, i.e., a FRET pair and/or a quencher/fluor pair).

C2c2 protein

[00260] A C2c2 protein suitable for inclusion in a kit of the present disclosure binds to a C2c2 guide RNA, is guided to a single stranded target RNA by the guide RNA (which hybridizes to the target RNA), and is thereby 'activated.' If the HEPN1 and HEPN2 domains of the C2c2 protein are intact, once activated, the C2c2 protein cleaves the target RNA, but also cleaves non- target RNAs.

[00261] In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes an amino acid sequence having 80% or more (e.g., 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100%) amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: 1-6. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: l. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Leptotrichia buccalis C2c2 amino acid sequence set forth in SEQ ID NO:2. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rhodobacter capsulatus C2c2 amino acid sequence set forth in SEQ ID NO:4. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Carnobacterium gallinarum C2c2 amino acid sequence set forth in SEQ ID NO:5. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Herbinix hemicellulosilytica C2c2 amino acid sequence set forth in SEQ ID NO:6. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes an amino acid sequence having 80% or more amino acid sequence identity with the Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure is a

Leptotrichia buccalis (Lbu) C2c2 protein (e.g., see SEQ ID NO: 2). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

[00262] In some cases, a C2c2 protein included in a kit of the present disclosure is not a

Leptotrichia shahii (Lsh) C2c2 protein. In some cases, a C2c2 protein included in a kit of the present disclosure is not a C2c2 polypeptide having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh C2c2 polypeptide set forth in SEQ ID NO:3.

[00263] In some cases, a C2c2 polypeptide included in a kit of the present disclosure is a variant

C2c2 polypeptide. Variant C2c2 polypeptides suitable for inclusion in a kit of the present disclosure include variants of any one of SEQ ID NOs: l, 2, and 4-6, where the variant C2c2 polypeptide exhibits reduced (or undetectable) nuclease activity. For example, in some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain. As another example, a variant C2c2 protein lacks a catalytically active HEPN2 domain. In some cases, a variant C2c2 protein lacks a catalytically active HEPN1 domain and lacks a catalytically active HEPN2 domain.

[00264] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of 1, 2, 3, or 4 of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2 (Leptotrichia buccalis C2c2), or a corresponding amino acid of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

Corresponding amino acids in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6 are readily identified; see, e.g., FIG. 22B. For example, amino positions in SEQ ID NO: l {Listeria seeligeri C2c2) that correspond to R472, H477, R1048, and H1053 of SEQ ID NO:2 are R445, H450, R1016, and H1021, respectively.

[00265] In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472 and H477 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R1048 and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6. In some cases, a variant C2c2 polypeptide comprises amino acid substitutions of amino acids R472, H477, R1048, and H1053 of the amino acid sequence set forth in SEQ ID NO:2, or corresponding amino acids of a C2c2 amino acid sequence depicted in SEQ ID NO: l, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6.

[00266] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R1048 and H1053. In some cases, the amino acid at position 1048 is any amino acid other than Arg; and the amino acid at position 1053 is any amino acid other than His. In some cases, the substitutions are R1048A and H1053A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of amino acids R472, H477, R1048, and H1053. In some cases, the amino acid at positions 472 and 1048 is any amino acid other than Arg; and the amino acid at positions 477 and 1053 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1048A, and H1053A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445 and H450. In some cases, the amino acid at position 445 is any amino acid other than Arg; and the amino acid at position 450 is any amino acid other than His. In some cases, the substitutions are R445A and H450A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R1016 and H1021. In some cases, the amino acid at position 1016 is any amino acid other than Arg; and the amino acid at position 1021 is any amino acid other than His. In some cases, the substitutions are R1016A and H1021A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of amino acids R445, H450, R1016, and H1021. In some cases, the amino acid at positions 445 and 1016 is any amino acid other than Arg; and the amino acid at positions 450 and 1016 is any amino acid other than His. In some cases, the substitutions are R445A, H450A, R1016A, and H1021A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA. [00268] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4 (Rhodobacter capsulatus C2c2), and comprises substitution of amino acids R464 and H469. In some cases, the amino acid at position 464 is any amino acid other than Arg; and the amino acid at position 469 is any amino acid other than His. In some cases, the substitutions are R464A and H469A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R1052 and H1057. In some cases, the amino acid at position 1052 is any amino acid other than Arg; and the amino acid at position 1057 is any amino acid other than His. In some cases, the substitutions are R1052A and H1057A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of amino acids R464, H469, R1052, and H1057. In some cases, the amino acid at positions 464 and 1052 is any amino acid other than Arg; and the amino acid at positions 469 and 1057 is any amino acid other than His. In some cases, the substitutions are R464A, H469A, R1052A, and H1057A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

[00269] In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5 (Carnobacterium gallinarum C2c2), and comprises substitution of amino acids R467 and H472. In some cases, the amino acid at position 467 is any amino acid other than Arg; and the amino acid at position 472 is any amino acid other than His. In some cases, the substitutions are R469A and H472A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R1069 and H1074. In some cases, the amino acid at position 1069 is any amino acid other than Arg; and the amino acid at position 1074 is any amino acid other than His. In some cases, the substitutions are R1069A and H1074A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of amino acids R467, H472, R1069, and H1074. In some cases, the amino acid at positions 467 and 1069 is any amino acid other than Arg; and the amino acid at positions 472 and 1074 is any amino acid other than His. In some cases, the substitutions are R469A, H472A, R1069A, and H1074A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 6 (Herbinix

hemicellulosilytica C2c2), and comprises substitution of amino acids R472 and H477. In some cases, the amino acid at position 472 is any amino acid other than Arg; and the amino acid at position 477 is any amino acid other than His. In some cases, the substitutions are R472A and H477A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R1044 and H1049. In some cases, the amino acid at position 1044 is any amino acid other than Arg; and the amino acid at position 1049 is any amino acid other than His. In some cases, the substitutions are R1044A and H1049A. In some cases, a variant C2c2 polypeptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of amino acids R472, H477, R1044, and H1049. In some cases, the amino acid at positions 472 and 1044 is any amino acid other than Arg; and the amino acid at positions 477 and 1049 is any amino acid other than His. In some cases, the substitutions are R472A, H477A, R1044A, and H1049A. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), and retains the ability to bind C2c2 guide RNA and ss RNA. In some cases, the variant C2c2 polypeptide retains the ability to cleave precursor C2c2 guide RNA. In some cases, the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA- guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

2) Kit comprising a labeled detector RNA and a C2c2 protein

[00271] In some cases, the kit comprises: (a) a labeled detector RNA comprising a fluorescence- emitting dye pair, i.e., a FRET pair and/or a quencher/fluor pair; and (b) a C2c2 protein, and/or a nucleic acid encoding said C2c2 protein. In some cases, a kit further includes a C2c2 guide RNA, precursor C2c2 guide RNA array, and/or a nucleic acid encoding a constant region of a C2c2 guide RNA. As noted above, in some cases such a kit further includes (c) a C2c2 guide RNA (and/or a nucleic acid encoding a C2c2 guide RNA), and/or (d) a precursor C2c2 guide RNA (and/or a nucleic acid encoding a precursor C2c2 guide RNA), and/or (e) a precursor C2c2 guide RNA array (and/or a nucleic acid encoding a precursor C2c2 guide RNA array, e.g., a nucleic acid encoding a precursor C2c2 guide RNA array that includes sequence insertion sites for the insertion of guide sequences by a user).

Labeled detector RNA

[00272] In some cases, a kit of the present disclosure comprises a labeled detector RNA

comprising a fluorescence-emitting dye pair, i.e., a FRET pair and/or a quencher/fluor pair. The labeled detector RNA produces an amount of detectable signal prior to being cleaved, and the amount of detectable signal that is measured is reduced when the labeled detector RNA is cleaved. In some cases, the labeled detector RNA produces a first detectable signal prior to being cleaved (e.g., from a FRET pair) and a second detectable signal when the labeled detector RNA is cleaved (e.g., from a quencher/fluor pair). As such, in some cases, the labeled detector RNA comprises a FRET pair and a quencher/fluor pair.

[00273] In some cases, the labeled detector RNA comprises a FRET pair. FRET is a process by which radiationless transfer of energy occurs from an excited state fluorophore to a second chromophore in close proximity. The range over which the energy transfer can take place is limited to approximately 10 nanometers (100 angstroms), and the efficiency of transfer is extremely sensitive to the separation distance between fluorophores. The donor-acceptor pair (a FRET donor moiety and a FRET acceptor moiety) is referred to herein as a "FRET pair" or a "signal FRET pair." Thus, in some cases, a subject labeled detector RNA includes two signal partners (a signal pair), when one signal partner is a FRET donor moiety and the other signal partner is a FRET acceptor moiety. A subject labeled detector RNA that includes such a FRET pair (a FRET donor moiety and a FRET acceptor moiety) will thus exhibit a detectable signal (a FRET signal) when the signal partners are in close proximity (e.g., while on the same RNA molecule), but the signal will be reduced (or absent) when the partners are separated (e.g., after cleavage of the RNA molecule by a C2c2 protein). [00274] FRET donor and acceptor moieties (FRET pairs) will be known to one of ordinary skill in the art and any convenient FRET pair (e.g., any convenient donor and acceptor moiety pair) can be used. Examples of suitable FRET pairs include but are not limited to those presented in Table 1, above.

[00275] In some cases, a detectable signal is produced when the labeled detector RNA is cleaved

(e.g., in some cases, the labeled detector RNA comprises a quencher/fluor pair. One signal partner of a signal quenching pair produces a detectable signal and the other signal partner is a quencher moiety that quenches the detectable signal of the first signal partner (i.e., the quencher moiety quenches the signal of the signal moiety such that the signal from the signal moiety is reduced (quenched) when the signal partners are in proximity to one another, e.g., when the signal partners of the signal pair are in close proximity).

[00276] For example, in some cases, an amount of detectable signal increases when the labeled detector RNA is cleaved. For example, in some cases, the signal exhibited by one signal partner (a signal moiety) is quenched by the other signal partner (a quencher signal moiety), e.g., when both are present on the same RNA molecule prior to cleavage by a C2c2 protein. Such a signal pair is referred to herein as a "quencher/fluor pair", "quenching pair", or "signal quenching pair." For example, in some cases, one signal partner (e.g., the first signal partner) is a signal moiety that produces a detectable signal that is quenched by the second signal partner (e.g., a quencher moiety). The signal partners of such a quencher/fluor pair will thus produce a detectable signal when the partners are separated (e.g., after cleavage of the detector RNA by a C2c2 protein), but the signal will be quenched when the partners are in close proximity (e.g., prior to cleavage of the detector RNA by a C2c2 protein).

[00277] A quencher moiety can quench a signal from the signal moiety (e.g., prior to cleave of the detector RNA by a C2c2 protein) to various degrees. In some cases, a quencher moiety quenches the signal from the signal moiety where the signal detected in the presence of the quencher moiety (when the signal partners are in proximity to one another) is 95% or less of the signal detected in the absence of the quencher moiety (when the signal partners are separated). For example, in some cases, the signal detected in the presence of the quencher moiety can be 90% or less, 80% or less, 70% or less, 60% or less, 50% or less, 40% or less, 30% or less, 20% or less, 15% or less, 10% or less, or 5% or less of the signal detected in the absence of the quencher moiety. In some cases, no signal (e.g., above background) is detected in the presence of the quencher moiety.

[00278] In some cases, the signal detected in the absence of the quencher moiety (when the

signal partners are separated) is at least 1.2 fold greater (e.g., at least 1.3fold, at least 1.5 fold, at least 1.7 fold, at least 2 fold, at least 2.5 fold, at least 3 fold, at least 3.5 fold, at least 4 fold, at least 5 fold, at least 7 fold, at least 10 fold, at least 20 fold, or at least 50 fold greater) than the signal detected in the presence of the quencher moiety (when the signal partners are in proximity to one another).

[00279] In some cases, the signal moiety is a fluorescent label. In some such cases, the quencher moiety quenches the signal (the light signal) from the fluorescent label (e.g., by absorbing energy in the emission spectra of the label). Thus, when the quencher moiety is not in proximity with the signal moiety, the emission (the signal) from the fluorescent label is detectable because the signal is not absorbed by the quencher moiety. Any convenient donor acceptor pair (signal moiety /quencher moiety pair) can be used and many suitable pairs are known in the art.

[00280] In some cases the quencher moiety absorbs energy from the signal moiety (also referred to herein as a "detectable label") and then emits a signal (e.g., light at a different wavelength). Thus, in some cases, the quencher moiety is itself a signal moiety (e.g., a signal moiety can be 6- carboxyfluorescein while the quencher moiety can be 6-carboxy-tetramethylrhodamine), and in some such cases, the pair could also be a FRET pair. In some cases, a quencher moiety is a dark quencher. A dark quencher can absorb excitation energy and dissipate the energy in a different way (e.g., as heat). Thus, a dark quencher has minimal to no fluorescence of its own (does not emit fluorescence). Examples of dark quenchers are further described in U.S. patent numbers 8,822,673 and 8,586,718; U.S. patent publications 20140378330, 20140349295, and

20140194611 ; and international patent applications: WO200142505 and WO200186001, all if which are hereby incorporated by reference in their entirety.

[00281] Examples of fluorescent labels include, but are not limited to: an Alexa Fluor® dye, an

ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, Pacific Orange, quantum dots, and a tethered fluorescent protein.

[00282] In some cases, a detectable label is a fluorescent label selected from: an Alexa Fluor® dye, an ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, and Pacific Orange.

[00283] In some cases, a detectable label is a fluorescent label selected from: an Alexa Fluor® dye, an ATTO dye (e.g., ATTO 390, ATTO 425, ATTO 465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol 2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, ATTO 740), a DyLight dye, a cyanine dye (e.g., Cy2, Cy3, Cy3.5, Cy3b, Cy5, Cy5.5, Cy7, Cy7.5), a FluoProbes dye, a Sulfo Cy dye, a Seta dye, an IRIS Dye, a SeTau dye, an SRfluor dye, a Square dye, fluorescein (FITC), tetramethylrhodamine (TRITC), Texas Red, Oregon Green, Pacific Blue, Pacific Green, Pacific Orange, a quantum dot, and a tethered fluorescent protein.

[00284] Examples of ATTO dyes include, but are not limited to: ATTO 390, ATTO 425, ATTO

465, ATTO 488, ATTO 495, ATTO 514, ATTO 520, ATTO 532, ATTO Rho6G, ATTO 542, ATTO 550, ATTO 565, ATTO Rho3B, ATTO Rhol l, ATTO Rhol2, ATTO Thiol2, ATTO RholOl, ATTO 590, ATTO 594, ATTO Rhol3, ATTO 610, ATTO 620, ATTO Rhol4, ATTO 633, ATTO 647, ATTO 647N, ATTO 655, ATTO Oxal2, ATTO 665, ATTO 680, ATTO 700, ATTO 725, and ATTO 740.

[00285] Examples of AlexaFluor dyes include, but are not limited to: Alexa Fluor® 350,

Alexa Fluor® 405, Alexa Fluor® 430, Alexa Fluor® 488, Alexa Fluor® 500, Alexa Fluor® 514, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 610, Alexa Fluor® 633, Alexa Fluor® 635, Alexa Fluor® 647, Alexa Fluor® 660, Alexa Fluor® 680, Alexa Fluor® 700, Alexa Fluor® 750, Alexa Fluor® 790, and the like.

[00286] Examples of quencher moieties include, but are not limited to: a dark quencher, a Black

Hole Quencher® (BHQ®) (e.g., BHQ-0, BHQ-1, BHQ-2, BHQ-3), a Qxl quencher, an ATTO quencher (e.g., ATTO 540Q, ATTO 580Q, and ATTO 612Q),

dimethylaminoazobenzenesulfonic acid (Dabsyl), Iowa Black RQ, Iowa Black FQ, IRDye QC-1, a QSY dye (e.g., QSY 7, QSY 9, QSY 21), AbsoluteQuencher, Eclipse, and metal clusters such as gold nanoparticles, and the like. [00287] In some cases, a quencher moiety is selected from: a dark quencher, a Black Hole

Quencher® (BHQ®) (e.g., BHQ-0, BHQ-1, BHQ-2, BHQ-3), a Qxl quencher, an ATTO quencher (e.g., ATTO 540Q, ATTO 580Q, and ATTO 612Q),

dimethylaminoazobenzenesulfonic acid (Dabsyl), Iowa Black RQ, Iowa Black FQ, IRDye QC-1, a QSY dye (e.g., QSY 7, QSY 9, QSY 21), AbsoluteQuencher, Eclipse, and a metal cluster.

[00288] Examples of an ATTO quencher include, but are not limited to: ATTO 540Q, ATTO

580Q, and ATTO 612Q. Examples of a Black Hole Quencher® (BHQ®) include, but are not limited to: BHQ-0 (493 nm), BHQ-1 (534 nm), BHQ-2 (579 nm) and BHQ-3 (672 nm).

[00289] For examples of some detectable labels (e.g., fluorescent dyes) and/or quencher

moieties, see, e.g., Bao et al., Annu Rev Biomed Eng. 2009;11:25-47; as well as U.S. patent numbers 8,822,673 and 8,586,718; U.S. patent publications 20140378330, 20140349295, 20140194611, 20130323851, 20130224871, 20110223677, 20110190486, 20110172420, 20060179585 and 20030003486; and international patent applications: WO200142505 and WO200186001, all of which are hereby incorporated by reference in their entirety.

Nucleic acid modifications

[00290] In some cases, a labeled detector RNA comprises one or more modifications, e.g., a base modification, a backbone modification, a sugar modification, etc., to provide the nucleic acid with a new or enhanced feature (e.g., improved stability). As is known in the art, a nucleoside is a base-sugar combination. The base portion of the nucleoside is normally a heterocyclic base. The two most common classes of such heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that further include a phosphate group covalently linked to the sugar portion of the nucleoside. For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be linked to the 2', the 3', or the 5' hydroxyl moiety of the sugar. In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one another to form a linear polymeric compound. In turn, the respective ends of this linear polymeric compound can be further joined to form a circular compound; however, linear compounds are generally suitable. In addition, linear compounds may have internal nucleotide base

complementarity and may therefore fold in a manner as to produce a fully or partially double- stranded compound. Within oligonucleotides, the phosphate groups are commonly referred to as forming the internucleoside backbone of the oligonucleotide. The normal linkage or backbone of RNA and DNA is a 3' to 5' phosphodiester linkage.

Modified backbones and modified internucleoside linkages

[00291] Examples of suitable modifications include modified nucleic acid backbones and non- natural internucleoside linkages. Nucleic acids (having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone.

[00292] Suitable modified oligonucleotide backbones containing a phosphorus atom therein include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'- alkylene phosphonates, 5'-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and aminoalkylphosphoramidates, phosphorodiamidates , thionophosphor amidates , thionoalkylphosphonates ,

thionoalkylphosphotriesters, selenophosphates and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', 5' to 5' or 2' to 2' linkage. Suitable oligonucleotides having inverted polarity comprise a single 3' to 3' linkage at the 3'-most internucleotide linkage i.e. a single inverted nucleoside residue which may be a basic (the nucleobase is missing or has a hydroxyl group in place thereof). Various salts (such as, for example, potassium or sodium), mixed salts and free acid forms are also included.

[00293] In some cases, a labeled detector RNA comprises one or more phosphorothioate and/or heteroatom internucleoside linkages, in particular -CH ₂-NH-0-CH ₂-, -CH ₂-N(CH ₃)-0-CH ₂- (known as a methylene (methylimino) or MMI backbone), -CH ₂-0-N(CH ₃)-CH ₂-, -CH ₂-N(CH ₃)- N(CH ₃)-CH ₂- and -0-N(CH ₃)-CH ₂-CH ₂- (wherein the native phosphodiester internucleotide linkage is represented as -0-P(=0)(OH)-0-CH ₂-). MMI type internucleoside linkages are disclosed in the above referenced U.S. Pat. No. 5,489,677. Suitable amide internucleoside linkages are disclosed in t U.S. Pat. No. 5,602,240.

[00294] Also suitable are nucleic acids having morpholino backbone structures as described in, e.g., U.S. Pat. No. 5,034,506. For example, in some cases, a labeled detector RNA comprises a 6-membered morpholino ring in place of a ribose ring. In some cases, a phosphorodiamidate or other non-phosphodiester internucleoside linkage replaces a phosphodiester linkage.

[00295] Suitable modified polynucleotide backbones that do not include a phosphorus atom

therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH ₂ component parts.

Mimetics

[00296] A labeled detector RNA can be a nucleic acid mimetic. The term "mimetic" as it is applied to polynucleotides is intended to include polynucleotides wherein only the furanose ring or both the furanose ring and the internucleotide linkage are replaced with non-furanose groups, replacement of only the furanose ring is also referred to in the art as being a sugar surrogate. The heterocyclic base moiety or a modified heterocyclic base moiety is maintained for hybridization with an appropriate target nucleic acid. One such nucleic acid, a polynucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA, the sugar-backbone of a polynucleotide is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleotides are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone.

[00297] One polynucleotide mimetic that has been reported to have excellent hybridization

properties is a peptide nucleic acid (PNA). The backbone in PNA compounds is two or more linked aminoethylglycine units which gives PNA an amide containing backbone. The heterocyclic base moieties are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative U.S. patents that describe the preparation of PNA compounds include, but are not limited to: U.S. Pat. Nos. 5,539,082; 5,714,331 ; and 5,719,262.

[00298] Another class of polynucleotide mimetic that has been studied is based on linked

morpholino units (morpholino nucleic acid) having heterocyclic bases attached to the morpholino ring. A number of linking groups have been reported that link the morpholino monomeric units in a morpholino nucleic acid. One class of linking groups has been selected to give a non-ionic oligomeric compound. The non-ionic morpholino-based oligomeric compounds are less likely to have undesired interactions with cellular proteins. Morpholino-based polynucleotides are non-ionic mimics of oligonucleotides which are less likely to form undesired interactions with cellular proteins (Dwaine A. Braasch and David R. Corey, Biochemistry, 2002, 41(14), 4503-4510). Morpholino-based polynucleotides are disclosed in U.S. Pat. No. 5,034,506. A variety of compounds within the morpholino class of polynucleotides have been prepared, having a variety of different linking groups joining the monomeric subunits.

[00299] A further class of polynucleotide mimetic is referred to as cyclohexenyl nucleic acids

(CeNA). The furanose ring normally present in a DNA/RNA molecule is replaced with a cyclohexenyl ring. CeNA DMT protected phosphoramidite monomers have been prepared and used for oligomeric compound synthesis following classical phosphoramidite chemistry. Fully modified CeNA oligomeric compounds and oligonucleotides having specific positions modified with CeNA have been prepared and studied (see Wang et al., J. Am. Chem. Soc, 2000, 122, 8595-8602). In general the incorporation of CeNA monomers into a DNA chain increases its stability of a DNA/RNA hybrid. CeNA oligoadenylates formed complexes with RNA and DNA complements with similar stability to the native complexes. The study of incorporating CeNA structures into natural nucleic acid structures was shown by NMR and circular dichroism to proceed with easy conformational adaptation.

[00300] A further modification includes Locked Nucleic Acids (LNAs) in which the 2'-hydroxyl group is linked to the 4' carbon atom of the sugar ring thereby forming a 2'-C,4'-C-oxymethylene linkage thereby forming a bicyclic sugar moiety. The linkage can be a methylene (-CH ₂-), group bridging the 2' oxygen atom and the 4' carbon atom wherein n is 1 or 2 (Singh et al., Chem. Commun., 1998, 4, 455-456). LNA and LNA analogs display very high duplex thermal stabilities with complementary DNA and RNA (Tm=+3 to +10° C), stability towards 3'- exonucleolytic degradation and good solubility properties. Potent and nontoxic antisense oligonucleotides containing LNAs have been described (Wahlestedt et al., Proc. Natl. Acad. Sci. U.S.A., 2000, 97, 5633-5638).

[00301] The synthesis and preparation of the LNA monomers adenine, cytosine, guanine, 5- methyl-cytosine, thymine and uracil, along with their oligomerization, and nucleic acid recognition properties have been described (Koshkin et al., Tetrahedron, 1998, 54, 3607-3630). LNAs and preparation thereof are also described in WO 98/39352 and WO 99/14226.

Modified sugar moieties

[00302] A labeled detector RNA can also include one or more substituted sugar moieties.

Suitable polynucleotides comprise a sugar substituent group selected from: OH; F; 0-, S-, or N- alkyl; 0-, S-, or N-alkenyl; 0-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C.sub.l to do alkyl or C ₂ to do alkenyl and alkynyl. Particularly suitable are 0((CH ₂) _nO) _mCH ₃, 0(CH ₂) _nOCH ₃, 0(CH ₂) _nNH ₂, 0(CH ₂) _nCH ₃, 0(CH ₂) _nONH ₂, and 0(CH ₂) _nON((CH ₂) _nCH ₃) ₂, where n and m are from 1 to about 10. Other suitable polynucleotides comprise a sugar substituent group selected from: Ci to Cio lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH ₃, OCN, CI, Br, CN, CF ₃, OCF ₃, SOCH ₃, S0 ₂CH ₃, ON0 ₂, N0 ₂, N ₃, NH ₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an

oligonucleotide, and other substituents having similar properties. A suitable modification includes 2'-methoxyethoxy (2'-0-CH ₂ CH ₂OCH ₃, also known as 2'-0-(2-methoxyethyl) or 2'- MOE) (Martin et al., Helv. Chim. Acta, 1995, 78, 486-504) i.e., an alkoxyalkoxy group. A further suitable modification includes 2'-dimethylaminooxyethoxy, i.e., a 0(CH ₂) ₂0N(CH ₃) ₂ group, also known as 2'-DMAOE, as described in examples hereinbelow, and 2'- dimethylaminoethoxyethoxy (also known in the art as 2'-0-dimethyl-amino-ethoxy-ethyl or 2'- DMAEOE), i.e., 2'-0-CH ₂-0-CH ₂-N(CH ₃)2.

[00303] Other suitable sugar substituent groups include methoxy (-0-CH ₃), aminopropoxy (—0

CH ₂ CH ₂ CH ₂NH ₂), allyl (-CH ₂-CH=CH ₂), -O-allyl (-0- CH ₂— CH=CH ₂) and fluoro (F). 2'- sugar substituent groups may be in the arabino (up) position or ribo (down) position. A suitable 2'-arabino modification is 2'-F. Similar modifications may also be made at other positions on the oligomeric compound, particularly the 3' position of the sugar on the 3' terminal nucleoside or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide. Oligomeric compounds may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.

Base modifications and substitutions

[00304] A labeled detector RNA may also include nucleobase (often referred to in the art simply as "base") modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2- propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2- thiocytosine, 5-halouracil and cytosine, 5-propynyl (-C=C-CH ₃) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil

(pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8- substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5- substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-aminoadenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3- deazaguanine and 3-deazaadenine. Further modified nucleobases include tricyclic pyrimidines such as phenoxazine cytidine(lH-pyrirnido(5,4-b)(l,4)benzoxazin-2(3H)-one), phenothiazine cytidine (lH-pyrimido(5,4-b)(l,4)benzothiazin-2(3H)-one), G-clamps such as a substituted phenoxazine cytidine (e.g. 9-(2-aminoethoxy)-H-pyrimido(5,4-(b) (l,4)benzoxazin-2(3H)-one), carbazole cytidine (2H-pyrimido(4,5-b)indol-2-one), pyridoindole cytidine (H- pyrido(3',2':4,5)pyrrolo(2,3-d)pyrimidin-2-one).

[00305] Heterocyclic base moieties may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2- aminopyridine and 2-pyridone. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. I., ed. John Wiley & Sons, 1990, those disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., ed., CRC Press, 1993. Certain of these nucleobases are useful for increasing the binding affinity of an oligomeric compound. These include 5-substituted pyrimidines, 6- azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5- propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi et al., eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are suitable base substitutions, e.g., when combined with 2'-0-methoxyethyl sugar modifications.

3) Kit comprising two different C2c2 proteins, and two different labeled detector RNAs

[00306] In some cases, a subject kit comprises: (a) a first labeled detector RNA that lacks U but comprises at least one A (e.g., at least 2, at least 3, or at least 4 As) and comprises a first FRET pair and/or a first quencher/fluor pair; (b) a second labeled detector RNA that lacks A but comprises at least one U (e.g., at least 2, at least 3, or at least 4 Us) and comprises a second FRET pair and/or a second quencher/fluor pair; (c) a first C2c2 protein, and/or a nucleic acid encoding said first C2c2 protein, wherein the first C2c2 protein cleaves adenine ^"1" RNAs (RNAs that include A) when activated but does not cleave RNAs that lack A (e.g., polyU RNAs) [e.g., the first C2c2 protein can cleave the first labeled detector RNA but not the second labeled detector RNA] ; and (d) a second C2c2 protein, and/or a nucleic acid encoding said second C2c2 protein, wherein the second C2c2 protein cleaves uracil ^"1" RNAs (RNAs that include U) when activated but does not cleave RNAs that lack U (e.g., polyA RNAs) (e.g., the second C2c2 protein can cleave the second labeled detector RNA but not the first labeled detector RNA).

[00307] In some cases, the first labelled detector RNA lacks U and includes a stretch of from 2 to

15 consecutive As (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive As). In some cases, the first labelled detector RNA lacks U and includes a stretch of at least 2 consecutive As (e.g., at least 3, at least 4, or at least 5 consecutive As). In some cases, the second labelled detector RNA lacks A and includes a stretch of from 2 to 15 consecutive Us (e.g., from 2 to 12, 2 to 10, 2 to 8, 2 to 6, 2 to 4, 3 to 15, 3 to 12, 3 to 10, 3 to 8, 3 to 6, 3 to 5, 4 to 15, 4 to 12, 4 to 10, 4 to 8, or 4 to 6 consecutive Us). In some cases, the second labelled detector RNA lacks A and includes a stretch of at least 2 consecutive Us (e.g., at least 3, at least 4, or at least 5 consecutive Us).

[00308] In some cases, such a kit further includes: (e) a first C2c2 guide RNA (and/or a nucleic acid encoding the first C2c2 guide RNA), e.g., a nucleic acid comprising a nucleotide sequence encoding the first C2c2 guide RNA, where the nucleic acid includes a sequence insertion site for the insertion of a guide sequence (e.g., a nucleotide sequence that hybridizes to a target RNA) by a user); and (f) a second C2c2 guide RNA (and/or a nucleic acid encoding the second C2c2 guide RNA), e.g., a nucleic acid comprising a nucleotide sequence encoding the second C2c2 guide RNA, where the nucleic acid includes a sequence insertion site for the insertion of a guide sequence (e.g., a nucleotide sequence that hybridizes to a target RNA) by a user). The first C2c2 guide RNA comprises a first nucleotide sequence that hybridizes with a first single stranded target RNA and a second nucleotide sequence that binds to the first C2c2 protein. The second C2c2 guide RNA comprises a first nucleotide sequence that hybridizes with a second single stranded target RNA and a second nucleotide sequence that binds to the second C2c2 protein. The first C2c2 protein is not activated by the second C2c2 guide RNA, and the first C2c2 protein cleaves ssRNA that includes at least one A (e.g., does not cleave ssRNA that lacks A). The second C2c2 protein is not activated by the first C2c2 guide RNA, and the second C2c2 protein cleaves ssRNA that includes at least one U (e.g., does not cleave ssRNA that lacks U).

[00309] The following are non-limiting examples (listed as a) through w), below) of first and second C2c2 proteins suitable for inclusion in a kit of the present disclosure:

[00310] a) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

[00311] b) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

[00312] c) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B; [00313] d) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

[00314] e) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

[00315] f) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

[00316] g) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

[00317] h) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

[00318] i) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

[00319] j) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

[00320] k) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

[00321] 1) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

[00322] m) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

[00323] n) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

[00324] o) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A;

[00325] p) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

[00326] q) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

[00327] r) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

[00328] s) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

[00329] t) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C; [00330] u) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

[00331] v) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56F; or

[00332] w) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least

80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Cam Casl3a amino acid sequence depicted in FIG. 56H; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A.

C2c2 protein

[00333] A C2c2 protein suitable for inclusion in a kit of the present disclosure binds to a C2c2 guide RNA, is guided to a single stranded target RNA by the guide RNA (which hybridizes to the target RNA), and is thereby 'activated.' If the HEPN1 and HEPN2 domains of the C2c2 protein are intact, once activated, the C2c2 protein cleaves the target RNA, but also cleaves non- target RNAs.

[00334] In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes an amino acid sequence having 80% or more (e.g., 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100%) amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: 1-6. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: l. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Leptotrichia buccalis C2c2 amino acid sequence set forth in SEQ ID NO:2. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rhodobacter capsulatus C2c2 amino acid sequence set forth in SEQ ID NO:4. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Carnobacterium gallinarum C2c2 amino acid sequence set forth in SEQ ID NO:5. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Herbinix hemicellulosilytica C2c2 amino acid sequence set forth in SEQ ID NO:6. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes an amino acid sequence having 80% or more amino acid sequence identity with the Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure is a

Leptotrichia buccalis (Lbu) C2c2 protein (e.g., see SEQ ID NO: 2). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure includes the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

[00335] In some cases, a C2c2 protein included in a kit of the present disclosure is not a

Leptotrichia shahii (Lsh) C2c2 protein. In some cases, a C2c2 protein included in a kit of the present disclosure is not a C2c2 polypeptide having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh C2c2 polypeptide set forth in SEQ ID NO:3.

[00336] In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure is more efficient, by a factor of 1.2-fold or more, than a Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by a C2c2 guide RNA of the method. In some cases, the C2c2 protein is more efficient, by a factor of 1.5-fold or more, than a Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by a C2c2 guide RNA of the method. In some cases, the C2c2 polypeptide used in a method of the present disclosure, when activated, cleaves non- target RNA at least 1.2-fold, at least 1.5-fold, at least 2-fold, at least 2.5-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 15-fold, at least 20-fold, at least 30-fold, or more than 30-fold, more efficiently than Lsh C2c2.

[00337] In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure exhibits at least a 50% RNA cleavage efficiency within 1 hour of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure exhibits at least a 50% RNA cleavage efficiency within 40 minutes of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure exhibits at least a 50% RNA cleavage efficiency within 30 minutes of said contacting (e.g., 55% or more, 60% or more, 65% or more, 70% or more, or 75% or more cleavage efficiency).

In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 seconds to 60 minutes, e.g., from 1 minute to 60 minutes, from 30 seconds to 5 minutes, from 1 minute to 5 minutes, from 1 minute to 10 minutes, from 5 minutes to 10 minutes, from 10 minutes to 15 minutes, from 15 minutes to 20 minutes, from 20 minutes to 25 minutes, from 25 minutes to 30 minutes, from 30 minutes to 35 minutes, from 35 minutes to 40 minutes, from 40 minutes to 45 minutes, from 45 minutes to 50 minutes, from 50 minutes to 55 minutes, or from 55 minutes to 60 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 seconds to 5 minutes (e.g., from 1 minute to 5 minutes, e.g., in a time period of 1 minute, 2 minutes, 3 minutes, 4 minutes, or 5 minutes). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 5 minutes to 10 minutes (e.g., in a time period of 5 minutes, 6 minutes, 7 minutes, 8 minutes, 9 minutes, or 10 minutes). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 10 minutes to 15 minutes (e.g., 10 minutes, 11 minutes, 12 minutes, 13 minutes, 14 minutes, or 15 minutes). In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 15 minutes to 20 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 20 minutes to 25 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 25 minutes to 30 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 30 minutes to 35 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 35 minutes to 40 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 40 minutes to 45 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 45 minutes to 50 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 50 minutes to 55 minutes. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure cleaves at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or more than 99%, of the RNA present in a sample in a time period of from 55 minutes to 60 minutes.

In some cases, the C2c2 protein included in a kit of the present disclosure is not a

Leptotrichia shahii (Lsh) C2c2 protein. In some cases, the C2c2 protein is more efficient than a Leptotrichia shahii (Lsh) C2c2 protein (e.g., at cleaving non-target RNA) by a factor of 1.2-fold or more (e.g., 1.5-fold or more, 1.7-fold or more, or 2-fold or more). As such, in some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure is more efficient, by a factor of 1.2-fold or more (e.g., 1.5-fold or more, 1.7-fold or more, or 2-fold or more), than a

Leptotrichia shahii (Lsh) C2c2 protein at cleaving RNA that is not targeted by the C2c2 guide RNA of the method. In some cases, a C2c2 protein suitable for inclusion in a kit of the present disclosure, when activated, cleaves non-target RNA at least 1.2-fold, at least 1.5-fold, at least 2- fold, at least 2.5-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, at least 10-fold, at least 15-fold, at least 20-fold, at least 30-fold, or more than 30-fold, more efficiently than Lsh C2c2. Positive controls

[00340] A kit of the present disclosure that comprises a labeled detector RNA and a C2c2

polypeptide can also include a positive control target RNA. In some cases, the kit also includes a positive control guide RNA that comprises a nucleotide sequence that hybridizes to the control target RNA. In some cases, the positive control target RNA is provided in various amounts, in separate containers. In some cases, the positive control target RNA is provided in various known concentrations, in separate containers, along with control non-target RNAs.

Nucleic acid encoding a C2c2 guide RNA and/or a precursor C2c2 guide RNA array and/or a C2c2 protein

[00341] While the RNAs of the disclosure (e.g., C2c2 guide RNAs and precursor C2c2 guide

RNA arrays) can be synthesized using any convenient method (e.g., chemical synthesis, in vitro using an RNA polymerase enzyme, e.g., T7 polymerase, T3 polymerase, SP6 polymerase, etc.), nucleic acids encoding 2c2 guide RNAs and/or precursor C2c2 guide RNA arrays are also envisioned. Additionally, while C2c2 proteins of the disclosure can be provided (e.g., as part of a kit) in protein form, nucleic acids (such as mRNA and/or DNA) encoding the C2c2 protein(s) can also be provided.

[00342] For example, in some embodiments, a kit of the present disclosure comprises a nucleic acid (e.g., a DNA, e.g., a recombinant expression vector) that comprises a nucleotide sequence encoding a C2c2 guide RNA. In some cases, the nucleotide sequence encodes a C2c2 guide RNA without a guide sequence. For example, in some cases, the nucleic acid comprises a nucleotide sequence encoding a constant region of a C2c2 guide RNA (a C2c2 guide RNA without a guide sequence), and comprises an insertion site for a nucleic acid encoding a guide sequence. In some embodiments, a kit of the present disclosure comprises a nucleic acid (e.g., an mRNA, a DNA, e.g., a recombinant expression vector) that comprises a nucleotide sequence encoding a C2c2 protein.

[00343] In some embodiments, a kit of the present disclosure comprises a nucleic acid (e.g., a

DNA, e.g., a recombinant expression vector) that comprises a nucleotide sequence encoding a precursor C2c2 guide RNA array (e.g., in some cases where each guide RNA of the array has a different guide sequence). In some cases, one or more of the encoded guide RNAs of the array does not have a guide sequence, e.g., the nucleic acid can include insertion site(s) for the guide sequence(s) of one or more of the guide RNAs of the array. In some cases, a subject C2c2 guide RNA can include a handle from a precursor crRNA but does not necessarily have to include multiple guide sequences. [00344] In some cases, the C2c2 guide RNA-encoding nucleotide sequence (and/or the precursor

C2c2 guide RNA array-encoding nucleotide sequence) is operably linked to a promoter, e.g., a promoter that is functional in a prokaryotic cell, a promoter that is functional in a eukaryotic cell, a promoter that is functional in a mammalian cell, a promoter that is functional in a human cell, and the like. In some cases, a nucleotide sequence encoding a C2c2 protein is operably linked to a promoter, e.g., a promoter that is functional in a prokaryotic cell, a promoter that is functional in a eukaryotic cell, a promoter that is functional in a mammalian cell, a promoter that is functional in a human cell, a cell type-specific promoter, a regulatable promoter, a tissue-specific promoter, and the like.

Examples of Non-Limiting Aspects of the Disclosure

[00345] Aspects, including embodiments, of the present subject matter described above may be beneficial alone or in combination, with one or more other aspects or embodiments. Without limiting the foregoing description, certain non-limiting aspects of the disclosure numbered 1-90 are provided below. As will be apparent to those of skill in the art upon reading this disclosure, each of the individually numbered aspects may be used or combined with any of the preceding or following individually numbered aspects. This is intended to provide support for all such combinations of aspects and is not limited to combinations of aspects explicitly provided below: Aspect 1. A method of detecting a single stranded target RNA in a sample comprising a plurality of RNAs, the method comprising:

a) contacting the sample with: (i) a C2c2 guide RNA that hybridizes with the single stranded target RNA; and (ii) a C2c2 protein that cleaves RNAs present in the sample; and

b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage. Aspect 2. A method of detecting a single stranded target RNA in a sample comprising a plurality of RNAs, the method comprising:

(a) contacting the sample with: (i) a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence; and (ii) a C2c2 protein that cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs, and also cleaves RNAs of the sample; and

(b) measuring a detectable signal produced by C2c2 protein-mediated RNA cleavage. Aspect 3. The method according to aspect 1 or 2 aspect, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 1 hour of said contacting.

Aspect 4. The method according to aspect 3, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 40 minutes of said contacting.

Aspect 5. The method according to aspect 4, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 5 minutes of said contacting. Aspect 6. The method according to aspect 5, wherein the C2c2 protein cleaves at least 50% of the RNAs present in the sample within 1 minute of said contacting.

Aspect 7. The method according to aspect 1, wherein the C2c2 protein cleaves from 50% to more than 90% of the RNAs present in the sample within 1 minute of said contacting.

Aspect 8. The method according to any one of aspects 1-7, wherein the minimum concentration at which the single stranded target RNA can be detected is in a range of from 500 fM to 1 nM.

Aspect 9. The method according to any one of aspects 1-7, wherein the single stranded target RNA can be detected at a concentration as low as 800 fM.

Aspect 10. The method according to any of aspects 1-9, wherein the C2c2 protein is not a Leptotrichia shahii (Lsh) C2c2 protein comprising an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 3.

Aspect 11. The method according to aspect 10, wherein the C2c2 protein cleaves non-target RNA at least 1.2-fold efficiently than a Leptotrichia shahii (Lsh) C2c2 protein comprising at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: 3.

Aspect 12. The method according to aspect 11, wherein the C2c2 protein cleaves non-target RNA at least 1.5-fold efficiently than a Leptotrichia shahii (Lsh) C2c2 protein comprising the amino acid sequence set forth in SEQ ID NO:3.

Aspect 13. The method according to any of aspects 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the amino acid sequence set forth in any one of SEQ ID NOs: l, 2, or 4-6.

Aspect 14. The method according to any of aspects 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the

Leptotrichia buccalis (Lbu) C2c2 amino acid sequence set forth in SEQ ID NO: 2.

Aspect 15. The method according to any of aspects 1-9, wherein the C2c2 protein comprises an amino acid sequence having 80% or more amino acid sequence identity with the

Listeria seeligeri C2c2 amino acid sequence set forth in SEQ ID NO: 1.

Aspect 16. The method according to any of aspects 1-9, wherein the C2c2 protein comprises the amino acid sequence set forth in any one of SEQ ID NOs: 1-2 and 4-6.

Aspect 17. The method according to aspect 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of one or more of R472, H477,

R1048, and H1053. Aspect 18. The method according to aspect 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R472 and H477.

Aspect 19. The method according to aspect 1, wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R1048 and H1053.

Aspect 20. The method according to aspect 1 , wherein the C2c2 protein comprises an amino acid sequence having at least 80% amino acid sequence identity to the C2c2 amino acid sequence set forth in SEQ ID NO:2, and comprises a substitution of amino acids R472, H477, R1048, and H1053.

Aspect 21. The method according to any one of aspects 1-20, wherein the sample is contacted for 2 hours or less prior to said measuring.

Aspect 22. The method according to aspect 21, wherein the sample is contacted for 60 minutes or less prior to said measuring.

Aspect 23. The method according to aspect 22, wherein the sample is contacted for 30 minutes or less prior to said measuring.

Aspect 24. The method according to aspect 23, wherein the sample is contacted for 10 minutes or less prior to said measuring.

Aspect 25. The method according to aspect 24, wherein the sample is contacted for 1 minute or less prior to said measuring.

Aspect 26. The method according to any one of aspects 1-25, comprising determining an amount of target RNA present in the sample.

Aspect 27. The method according to aspect 26, wherein said determining comprises: measuring the detectable signal to generate a test measurement;

measuring a detectable signal produced by a reference sample to generate a reference measurement; and

comparing the test measurement to the reference measurement to determine an amount of target RNA present in the sample.

Aspect 28. The method according to aspect 26, comprising:

measuring the detectable signal to generate a test measurement,

measuring a detectable signal produced by each of two or more reference samples, wherein the two or more reference samples each include a different amount of a positive control

RNA, to generate two or more reference measurements, and comparing the test measurement to the two or more reference measurements to determine an amount of target RNA present in the sample.

Aspect 29. The method according to any one of aspects 1-28, wherein the sample comprises from 5 to 10 ⁷ RNAs that differ from one another in sequence.

Aspect 30. The method according to any one of aspects 1-28, wherein the sample comprises from 10 to 10 ⁶ RNAs that differ from one another in sequence.

Aspect 31. The method according to any one of aspects 1-30, wherein the sample comprises RNAs from a cell lysate.

Aspect 32. The method according to any one of aspects 1-31, wherein measuring a detectable signal comprises one or more of: gold nanoparticle based detection, fluorescence polarization, colloid phase transition/dispersion, electrochemical detection, and semiconductor- based sensing.

Aspect 33. The method according to any one of aspects 1-32, wherein (i) the method comprises contacting the sample with a labeled detector RNA comprising a fluorescence- emitting dye pair (i.e., a fluorescence resonance energy transfer (FRET) pair and/or a quencher/fluor pair), (ii) the C2c2 protein cleaves the labeled detector RNA, and (iii) the detectable signal is produced by the FRET pair and/or the quencher/fluor pair.

Aspect 34. The method according to aspect 33, wherein the labeled detector RNA produces an amount of detectable signal prior to being cleaved, and the amount of detectable signal is reduced when the labeled detector RNA is cleaved.

Aspect 35. The method according to aspect 33, wherein the labeled detector RNA produces a first detectable signal prior to being cleaved and a second detectable signal when the labeled detector RNA is cleaved.

Aspect 36. The method according to aspect 35, wherein the labeled detector RNA comprises a FRET pair and a quencher/fluor pair.

Aspect 37. The method according to any one of aspects 33-36, wherein the labeled detector RNA comprises a FRET pair.

Aspect 38. The method according to aspect 33, wherein a detectable signal is produced when the labeled detector RNA is cleaved.

Aspect 39. The method according to aspect 33, wherein an amount of detectable signal increases when the labeled detector RNA is cleaved.

Aspect 40. The method according to aspect 38 or 3 aspect 9, wherein the labeled detector RNA comprises a quencher/fluor pair. Aspect 41. The method according to any of aspects 33-40, wherein the labeled detector RNA comprises a modified nucleobase, a modified sugar moiety, and/or a modified nucleic acid linkage.

Aspect 42. The method according to any one of aspects 1-41, wherein said contacting is carried out in an acellular sample.

Aspect 43. The method according to any one of aspects 1-41, wherein said contacting is carried out in a cell in vitro, ex vivo, or in vitro.

Aspect 44. A method of cleaving a precursor C2c2 guide RNA array into two or more C2c2 guide RNAs, the method comprising:

contacting a precursor C2c2 guide RNA array comprising two or more C2c2 guide RNAs each of which has a different guide sequence, with a C2c2 protein, wherein the C2c2 protein cleaves the precursor C2c2 guide RNA array into individual C2c2 guide RNAs.

Aspect 45. The method according to aspect 44, wherein the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain.

Aspect 46. The method according to aspect 44 or aspect 45, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target sequences within the same target RNA molecule.

Aspect 47. The method according to any one of aspects 44-46, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target RNA molecules.

Aspect 48. The method according to any one of aspects 44-47, wherein said contacting does not take place inside of a cell.

Aspect 49. The method according to any one of aspects 44-48, wherein at least one of the guide RNAs and/or the precursor C2c2 guide RNA array is detectably labeled.

Aspect 50. A kit for detecting a target RNA in a sample comprising a plurality of RNAs, the kit comprising:

(a) a precursor C2c2 guide RNA array, and/or a nucleic acid encoding said precursor C2c2 guide RNA array, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs each of which has a different guide sequence and/or an insertion site for a guide sequence of choice; and

(b) a C2c2 protein.

Aspect 51. The kit of aspect 50, wherein the C2c2 protein lacks a catalytically active HEPN1 domain and/or lacks a catalytically active HEPN2 domain. Aspect 52. The kit of aspect 50 or aspect 51 , wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target sequences within the same target RNA molecule.

Aspect 53. The kit of any one of aspects 50-52, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs that target different target RNA molecules. Aspect 54. The kit of any one of 5 aspects 0-53, wherein at least one of the guide RNAs and/or the precursor C2c2 guide RNA array is detectably labeled.

Aspect 55. The kit of any one of aspects 50-54, further comprising a labeled detector RNA comprising a fluorescence-emitting dye pair (i.e., a FRET pair and/or a quencher/fluor pair). Aspect 56. A kit for detecting a target RNA in a sample comprising a plurality of RNAs, the kit comprising:

(a) a labeled detector RNA comprising a fluorescence-emitting dye pair (i.e., a FRET pair and/or a quencher/fluor pair); and

(b) a C2c2 protein.

Aspect 57. The kit of aspect 56, comprising a positive control target RNA.

Aspect 58. The kit of aspect 57, where in the positive control target RNA is present in different amounts in each of two or more containers.

Aspect 59. The kit of any one of aspects 56-58, comprising at least one of:

(c) a C2c2 guide RNA and/or a nucleic acid encoding said C2c2 guide RNA;

(d) a precursor C2c2 guide RNA and/or a nucleic acid encoding said precursor C2c2 guide RNA; and

(e) a precursor C2c2 guide RNA array, and/or a nucleic acid encoding said precursor C2c2 guide RNA array, wherein the precursor C2c2 guide RNA array comprises two or more C2c2 guide RNAs each of which has a different guide sequence and/or an insertion site for a guide sequence of choice.

Aspect 60. The kit of any one of aspects 56-59, comprising a DNA comprising a nucleotide sequence that encodes a C2c2 guide RNA with or without a guide sequence.

Aspect 61. The kit of aspect 60, wherein the DNA comprises an insertion sequence for the insertion of a guide sequence.

Aspect 62. The kit of aspect 60 or aspect 61, wherein the DNA is an expression vector and the C2c2 guide RNA is operably linked to a promoter.

Aspect 63. The kit of aspect 62, wherein the promoter is a T7 promoter.

Aspect 64. The kit of any one of aspects 56-63, comprising a C2c2 endoribonuclease variant that lacks nuclease activity. Aspect 65. The kit of any one of aspects 56-64, wherein the labeled detector RNA comprises a FRET pair.

Aspect 66. The kit of any one of aspects 56-65, wherein the labeled detector RNA comprises a quencher/fluor pair.

Aspect 67. The kit of any one of aspects 56-66, wherein the labeled detector RNA comprises a FRET pair that produces a first detectable signal and a quencher/fluor pair that produces a second detectable signal.

Aspect 68. A variant C2c2 polypeptide comprising:

a) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:2, and comprises substitution of: i) amino acids R472 and H477; ii) amino acids R1048 and H1053; or iii) amino acids R472, H477, R1048, and H1053;

b) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO: l, and comprises substitution of: i) amino acids R445 and H450; ii) amino acids R1016 and H1021; or iii) amino acids R445, H450, R1016, and H1021 ;

c) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:4, and comprises substitution of: i) amino acids R464 and H469; ii) amino acids R1052, and H1057; or iii) amino acids R464, H469, R1052, and H1057;

d) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:5, and comprises substitution of: i) amino acids R467 and H472; ii) amino acids R1069, and H1074; or iii) amino acids R467, H472, R1069, and H1074; or

e) an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99%, amino acid sequence identity to the amino acid sequence set forth in SEQ ID NO:6, and comprises substitution of: i) amino acids R472 and H477; ii) amino acids R1044 and H1049; iii) or amino acids R472, H477, R1044, and H1049.

Aspect 69. A variant C2c2 polypeptide of 68, wherein the variant C2c2 polypeptide has reduced or undetectable cleavage of ss RNA (e.g., RNA-guided cleavage activity), but retains the ability to bind C2c2 guide RNA and ssRNA, and retains the ability to cleave precursor C2c2 guide RNA.

Aspect 70. A nucleic acid comprising a nucleotide sequence encoding a variant C2c2 polypeptide of aspect 68 or aspect 69. Aspect 71. The nucleic acid of aspect 70, wherein the nucleotide sequence is operably linked to a constitutive promoter or a regulatable promoter.

Aspect 72. A recombinant expression vector comprising the nucleic acid of aspect 70 or aspect 71.

Aspect 73. A host cell genetically modified with the nucleic acid of aspect 70 or aspect 71, or with the recombinant expression vector of aspect 72.

Aspect 74. The host cell of aspect 73, wherein the host cell is a eukaryotic cell.

Aspect 75. The host cell of aspect 73, wherein the host cell is a prokaryotic cell.

Aspect 76. The host cell of any one of aspects 73-75, wherein the host cell is in vitro, ex vivo, or in vivo.

Aspect 77. A method of detecting at least two different single stranded target RNAs in a sample comprising a plurality of RNAs, the method comprising:

a) contacting the sample with:

(i) a first C2c2 protein that cleaves single stranded RNAs (ssRNAs) that include at least one A;

(ii) a second C2c2 protein that cleaves ssRNAs that include at least one U;

(iii) a first C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the first single stranded target RNA and a second nucleotide sequence that binds to the first C2c2 protein; and

(iv) a second C2c2 guide RNA that comprises a first nucleotide sequence that hybridizes with the second single stranded target RNA and a second nucleotide sequence that binds to the second C2c2 protein;

wherein the first C2c2 protein is not activated by the second C2c2 guide RNA, and wherein the first C2c2 protein cleaves ssRNA that includes at least one A, and

wherein the second C2c2 protein is not activated by the first C2c2 guide RNA, and wherein the second C2c2 protein cleaves ssRNA that includes at least one U; and

b) measuring a detectable signal produced by RNA cleavage mediated by the first and the second C2c2 proteins, wherein a first detectable signal is produced upon activation of the first C2c2 protein and a second detectable signal is produced upon activation of the second C2c2 protein, wherein detection of the first signal indicates the presence in the sample of the first target ssRNA, and wherein detection of the second signal indicates the presence in the sample of the second target ssRNA.

Aspect 78. The method of aspect 77, wherein:

a) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

b) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

c) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

d) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

e) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

f) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E; g) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lba Casl3a amino acid sequence depicted in FIG. 56F; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

h) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Hhe Casl3a amino acid sequence depicted in FIG. 56K;

i) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Rca Casl3a amino acid sequence depicted in FIG. 56G;

j) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ppr Casl3a amino acid sequence depicted in FIG. 56B;

k) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lne Casl3a amino acid sequence depicted in FIG. 561;

1) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lbu Casl3a amino acid sequence depicted in FIG. 56C;

m) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lwa Casl3a amino acid sequence depicted in FIG. 56E;

n) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lsh Casl3a amino acid sequence depicted in FIG. 56D;

o) the first C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Ere Casl3a amino acid sequence depicted in FIG. 56J; and the second C2c2 protein comprises an amino acid sequence having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%, amino acid sequence identity to the Lse Casl3a amino acid sequence depicted in FIG. 56A;