KIT AND METHOD FOR ANALYZING SINGLE T CELLS - ST JUDE CHILDRENS RES HOSPITAL

Title:

KIT AND METHOD FOR ANALYZING SINGLE T CELLS

Document Type and Number:

WIPO Patent Application WO/2021/003114

Kind Code:

Abstract:

A kit and method for analyzing nucleic acid molecules encoding T cell receptor (TCR) a and β from individual T cells are disclosed. In particular, a method for analyzing individual T cells using high-throughput multiplex amplification and deep sequencing of nucleic acids encoding TCRαβ is provided.

More Like This:

WO/2023/236563	MONKEYPOX VIRUS-SPECIFIC DETECTION TARGET, AND OLIGONUCLEOTIDE AND KIT THEREOF
JP7011145	Modified polynucleotide synthesis method with immobilized polymerase
JP2022528009	New pathology markers and their use

Inventors:

DASH PRADYOT (US)
THOMAS PAUL (US)

Application Number:

PCT/US2020/040218

Publication Date:

January 07, 2021

Filing Date:

June 30, 2020

Export Citation:

Click for automatic bibliography generation Help

Assignee:

ST JUDE CHILDRENS RES HOSPITAL (US)

International Classes:

C12Q1/686

Attorney, Agent or Firm:

LICATA, Jane, Massey et al. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

What is claimed is :

1. A kit for analyzing a single T cell comprising:

(a) a first set of primers, wherein the first set of primers comprises:

(i) a first set of forward primers comprising the nucleotide sequences of SEQ ID NOs:l-40 and SEQ ID NOs: 42- 70 having a length ranging from 15 to 40 nucleotides,

(ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID NOs:41 and 71 having a length ranging from 15 to 40 nucleotides; and

(b) a second set of primers, wherein the second set of primers comprises:

(i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 72-111 and SEQ ID NO: 130-158 having a length ranging from 40 to 70 nucleotides, and

(ii} a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 112-129 and SEQ ID NOs: 159-176 having a length ranging from 50 to 90 nucleotides ;

(i) a first set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 177-199 and SEQ ID NOs: 201-217 having a length ranging from 15 to 40 nucleotides ,

(ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 200 and 218 having a length ranging from 15 to 40 nucleotides; and

(d) a second set of primers, wherein the second set of primers comprises: (i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 219-244 and SEQ ID NO:263-289 having a length ranging from 40 to 70 nucleotides, and

(ii) a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 245-262 and SEQ ID NOs: 290-307 having a length ranging from 50 to 90 nucleotides .

2. A method for analyzing nucleic acid molecules encoding T cell receptor (TCR) a and b from single T cells, comprising :

(a) sorting single T cells from a sample comprising a plurality of T cells into separate locations;

(b) amplifying nucleic acid molecules encoding TCRoi and TCR from one or more single T cells using the first set of primers from the kit of claim 1 to produce a first set of amplicon products in one or more locations of the separate locations ;

(c) performing nested polymerase chain reaction (PCR) on the amplified nucleic acid molecules encoding TCRa and TCRp in the first set of amplicon products with the second set of primers from the kit of claim 1 to produce a second set of amplicon products; and

(d) sequencing the amplicon products.

3. The method of claim 2, wherein the sequencing step (d) comprises:

(i) carrying out a third round of PCR to produce a third set of amplicon products, and

(ii) subjecting the third amplicon products to next generation sequencing.

4. The method of claim 2, wherein the nucleic acid molecules encoding TCRa and TCR are mRNAs .

5. The method of claim 2, wherein the sample is collected from a subject.

6. The method of claim 2, wherein the performing step (c) comprises:

dividing the first set of amplicon products into two pools; and

performing nested PCR on the first pool and second pool separately,

wherein

(i) the first pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 72-111, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 112-129, and the second pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 130- 158, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 159-176; or

(ii) the first pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ

ID NOs: 219-244, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 245-262, and the second pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 263- 289, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 290-307.

Description:

KIT AND METHOD FOR ANALYZING SINGLE T CELLS

Introduction

[0001] This patent application claims the benefit of priority from U.S. Provisional Serial Number 62/869,204 filed July 1, 2019, the contents of which are incorporated herein by reference in their entirety.

[0002] This invention was made with government support under Grant Number AI107625 awarded by the National Institutes of Health. The government has certain rights in the invention.

Background

[0003] T cells are defined by a surface T cell receptor (TCR) that mediates recognition of pathogen-associated epitopes, generally via interactions with peptide-major histocompatibility complexes (pMHC) . T cell receptors are generated by germline recombinase activating gene (RAG)- mediated rearrangements of the genomic TCR locus, a process termed V(D)J recombination. This process has the potential to generate a significant number of diverse TCRs, with estimates ranging from 10 ¹⁵ to as high as 10 ⁶¹ possible receptors that could be generated by recombination, although only a relatively small portion of these is thought to appear in any individual (~10 ⁶-10 ⁸) . In mammals, two types of TCRs are possible, ab and gd, and different species produce different ratios of cells bearing these receptors. In humans and mice, ab T cells dominate, representing up to 90% of the T cell compartment.

[0004] The pool of T cells that recognizes a specific epitope expresses diverse TCRs. The size of these naive precursor repertoires has been estimated for various epitopes by limiting dilution techniques and, more recently, by a tetramer-based magnetic enrichment approach, the latter of which finds pool sizes ranging between 50 and 500 naive cells per epitope, on average. Due to the rounds of expansion that T cells undergo in the thymus during development, it has been assumed that there are multiple naive cells with identical TCRs . However, sequencing the naive repertoire of epitope-specific responses in mice has instead shown that most naive cells contain a unique receptor, with a very low rate of duplicates among cells.

[0005 ] Sequencing the nucleic acids encoding the T cell receptor requires identifying the specific V-region used by the a or b chain and obtaining the complete sequence of the hypervariable CDR3 region, the site of RAG-mediated V{D)J junctional diversity. Due to the availability of TCR nb staining reagents in the human and mouse, analyses of the repertoire initially focused solely on the TCR chain. Subsequently, two broad approaches to sequencing the TCR repertoire have emerged: single-cell based methods that permit direct pairing of the a and b chains (Dash, et al. (2011} J. Clin. Invest. 121:288-295; Wang, et al . (2012)

Sci. Transl. Med. 4:128ra42; Kim, et al . (2012) PLoS One 7:e37338; and Han, et al . (2014) Nat. Biotechnol.

32 (7 ): 684-692 ), and deep sequencing-based methods that amplify single chains from pools of cells (Robins, et al. (2009) Blood 114:4099-4107; Weinstein, et al . (2009)

Science 324:807-810; Freeman, et al. (2009) Genome Res. 19:1817-1824) where pairing can be achieved through specific sort conditions and algorithmic imputation (Howie, et al . (2015) Sci. Transl. Med. 7 : 301ral31 } . Single cell multiplex techniques for TCRa or TCRyd profiling have been described (Dash, et al. (2017) Nature 547(7661) :89—92; Dash, et al . (2015) Meth. Mol. Biol. 1343:181-197; Guo, et al . (2016) Mol. Ther. Methods Clin. Dev. 3:15054; Guo, et al. {2018} Immunity 49 { 3) : 531-44 ; US 2019/0040381 Al). However, a large-scale multiplexing approach adapted to single cell deep sequencing is needed to increase the throughput of single cell TOR profiling·

Summary of the Invention

[0006] This invention provides a kit, which includes a first set of primers, wherein the first set of primers include (a) a first set of primers, wherein the first set of primers comprises: (i) a first set of forward primers comprising the nucleotide sequences of SEQ ID NOs:l-40 and SEQ ID NOs: 42-70 having a length ranging from 15 to 40 nucleotides, (ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 1 and 71 having a length ranging from 15 to 40 nucleotides; and (b) a second set of primers, wherein the second set of primers comprises: {i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 72-111 and SEQ ID NO: 130-158 having a length ranging from 40 to 70 nucleotides, and (ii) a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 112-129 and SEQ ID NOs: 159-176 having a length ranging from 50 to 90 nucleotides; or (c) a first set of primers, wherein the first set of primers comprises: (i) a first set of forward primers comprising the nucleotide sequences of SEQ ID

NOs: 177-199 and SEQ ID NOs: 201-217 having a length ranging from 15 to 40 nucleotides, (ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID

NOs: 200 and 218 having a length ranging from 15 to 40 nucleotides; and (d) a second set of primers, wherein the second set of primers comprises: (i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 219-244 and SEQ ID NO: 263-289 having a length ranging from 40 to 70 nucleotides, and (ii) a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs : 245-262 and SEQ ID N0s:290-307 having a length ranging from 50 to 90 nucleotides,

[ 0007 ] A method for analyzing nucleic acid molecules encoding T cell receptor (TCR) a and b from single T cells is also provided, which includes the steps of (a) sorting single T cells from a sample comprising a plurality of T cells into separate locations; (b) amplifying nucleic acid molecules encoding TCR a and b from one or more single T cells using the first set of primers from the kit to produce a first set of amplicon products in one or more locations of the separate locations; (c) performing nested polymerase chain reaction (PCR) on the amplified nucleic acid molecules in the first set of amplicon products with the second set of primers from the kit to produce a second set of amplicon products; and (d) sequencing the second set of amplicon products. In one embodiment, the sequencing step (d) includes { i) carrying out a third round of PCR to produce a third set of amplicon products, and (ii) subjecting the third amplicon products to next generation sequencing. In certain embodiments, the target nucleic acids are mRNAs . In other embodiments, the sample is collected from a subject. In further embodiments, the performing step (c) includes dividing the first set of amplicon products into two pools; and performing nested PCR on the first pool and second pool separately, wherein (i) the first pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 72- 111, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 112-129, and the second pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 130-158, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 159-176; or (ii) the first pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 219-244, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs: 245-262, and the second pool is amplified with the second set of forward primers having the nucleotide sequences of SEQ ID NOs: 263-289, and the second set of reverse primers having the nucleotide sequences of SEQ ID NOs :290-307.

Brief Description of the Drawings

[0008] FIG. 1 shows a schematic of the method of this invention .

[0009] FIG. 2 demonstrates that the method of the invention provided increased paired data.

[0010] FIG. 3 shows the number of cells with one, two or three TCRa and TO¾b sequences per cell.

Detailed Description of the Invention

[0011] A multiplex panel of chimeric primer sequences designed to amplify TOEab sequences from single cells in an unbiased manner has now been developed. These primer sequences allow for the TCR repertoire to be analyzed using conventional next generation sequencing-based platforms in a highly efficient manner. Indeed, the instant single-cell TOEoίb amplification and sequencing strategy offers a highly transparent, cell number agnostic approach for processing up to 6300 cells at a time. Accordingly, this invention provides a kit and method for amplifying and analyzing TO ab sequences from single T cells using a multiplex panel of chimeric oligonucleotide sequences designed to amplify TOEab sequences in an unbiased manner. Using the kit and method of this invention, paired TCRc^ sequences can be isolated at the single cell level in response to a variety of immune responses such as viral infections, tumors, and autoimmune patients thereby allowing for the design of effective immune cell-based therapies.

[0012] The present disclosure provides kits containing oligonucleotide primers and methods for analyzing nucleic acids encoding TCRap from individual T cells by high- throughput multiplex amplification and sequencing of the nucleic acids encoding the TCRap . As used herein, the term "primer" or "oligonucleotide primer" refers to an oligonucleotide that hybridizes to the template strand of a nucleic acid and initiates synthesis of a nucleic acid strand complementary to the template strand when placed under conditions in which synthesis of a primer extension product is induced, i.e., in the presence of nucleotides and a polymerization-inducing agent such as a DNA or RNA polymerase and at suitable temperature, pH, metal concentration, and salt concentration. The primer is generally single-stranded for maximum efficiency in amplification, but may alternatively be double-stranded. If double-stranded, the primer can first be treated to separate its strands before being used to prepare extension products. This denaturation step is typically achieved by heat, but may alternatively be carried out using alkali, followed by neutralization. Thus, a "primer" is complementary to a template, and complexes by hydrogen bonding or hybridization with the template to give a primer/template complex for initiation of synthesis by a polymerase, which is extended by the addition of covalently bonded bases linked at its 3' end complementary to the template in the process of DNA or RNA synthesis.

[0013] The method of this invention generally involves sorting of single T cells into separate locations (e.g., separate wells of a multi-well titer plate) followed by nested polymerase chain reaction (PCR) amplification of nucleic acids encoding TCRs using the primers disclosed herein. The amplicons are barcoded to identify their cell of origin, combined, and analyzed by deep sequencing. Using the kit and method of the present disclosure, TCRs from individual T cells can be reconstituted for functional studies, ligand discovery, and/or screening therapeutics.

[0014] More specifically, this invention provides a kit and method for analyzing nucleic acid molecules encoding TCRoi from individual T cells by sorting single T cells from a sample including a plurality of T cells into separate locations; amplifying nucleic acid molecules encoding TCRa and TCR from one or more single T cells using a first set of external primers to produce a first set of amplicon products in one or more locations of the separate locations; performing nested polymerase chain reaction (PCR) on the amplified nucleic acid molecules encoding TCRa and TCRP in the first set of amplicon products with the unique forward and reverse nested primers of this disclosure to produce a second set of amplicon products, wherein the reverse nested primer includes a barcode sequence and the forward and reverse nested primers have adapters attached thereto; and sequencing the second set of amplicon products, e.g., via a third round of PCR and next generation sequencing. Using the method of this invention, a wide variety of diseases, including inflammatory disorders, autoimmune diseases, infectious diseases, and cancer can be diagnosed and treated.

[0015] In carrying out the method of this invention, a biological sample including T cells is collected from a subject. The biological sample can be any sample of bodily fluid or tissue containing T cells, including but not limited to, samples of blood, thymus, spleen, lymph nodes, bone marrow, a tumor biopsy, or an inflammatory lesion biopsy. In particular, samples of T cells may be taken from sites of inflamed, infected, or injured tissue, including but not limited to sites of tumors, transplant rejection, tissue damage, such as caused by traumatic injury or autoimmune disease, and organs or tissues targeted by pathogenic organisms . The biological sample may also include samples from in vitro cell culture resulting from the growth of T cells from the subject in culture. The biological sample can be obtained from a subject by conventional techniques. For example, blood can be obtained by venipuncture. Surgical techniques for obtaining solid tissue samples are well known in the art. Samples may be obtained from a subject prior to diagnosis and throughout a course of treatment.

[ 0016] Subsequently, single T cells are isolated from the biological sample and sorted into separate locations. The separate locations can be separate reaction containers, such as wells of a multi-well plate (e.g., a 96-well plate, 384-well plate, 1536-well plate) or microwell array, capillaries or tubes (e.g., 0.2 mL tubes, 0.5 mL tubes, 1.5 mL tubes), or chambers in a microfluidic device. Alternatively, the separate locations can be emulsion droplets that spatially separate cells.

[ 0017 ] Various methods are known in the art for isolating single cells. In some embodiments, the sample is sorted to obtain single T cells using a flow cytometer. Methods of preparing a sample of cells for flow cytometry analysis is described in, e.g., US 5,378,633; US 5,631,165; US 6,524,858; US 5,266,269; US 5,017,497; US 6,549,876; US 2012/0178098; US 2008/0153170; US 2001/0006787; US 2008/0158561; US 2010/0151472; US 2010/0099074; US 2010/0009364; US 2009/0269800; US 2008/0241820; US 2008/0182262; US 2007/0196870; US2 008/0268494; WO 99/54494; Brown, et al. (2000) Clin. Chem. 46:1221-9; McCoy, et al. (2002) He atol . Oncol. Clin. North Am. 16:229-43; and Scheffold (2000) J. Clin. Immunol. 20:400-7.

[0018] In some instances, single T cells can be isolated from a biological sample by appropriate dilution of a sample to allow distribution of a single cell in a small isolation volume to a separate location. In certain embodiments, a microfluidic device is used for isolating single cells and distributing single cells to separate locations in the device, such as separate wells or chambers. Alternatively, a microfluidic device can be used to generate emulsion droplets containing single cells. For a description of techniques for isolating single cells and microfluidic devices for sorting single cells, see, e.g., Huang, et al . (2014) Lab Chip. 14 (7 ): 1230-1245; Zare, et al . (2010) Annu. Rev. Biomed. Eng. 12:187-201; Novak, et al . (2011) Angew. Che . Int. Ed. 50:390-395; US

2010/0255471; US 2010/0285975; US 2010/0021984; US 2010/0173394; WO 2009/145925; and US 2009/0181859.

[0019] "Microfluidics device" means an integrated system of one or more chambers, ports, and channels that are interconnected and in fluid communication and designed for carrying out an analytical reaction or process, either alone or in cooperation with an appliance or instrument that provides support functions, such as sample introduction, fluid and/or reagent driving means, temperature control, detection systems, data collection and/or integration systems, and the like. Microfluidics devices may further include valves, pumps, and specialized functional coatings on interior walls, e.g., to prevent adsorption of sample components or reactants, facilitate reagent movement by electroosmosis, or the like. Such devices are usually fabricated in or as a solid substrate, which may be glass, plastic, or other solid polymeric materials, and typically have a planar format for ease of detecting and monitoring sample and reagent movement, especially via optical or electrochemical methods. Features of a microfluidic device usually have cross-sectional dimensions of less than a few hundred square micrometers and passages typically have capillary dimensions, e.g., having maximal cross-sectional dimensions of from about 500 pm to about 0.1 pm. Microfluidics devices typically have, volume capacities in the range of from 1 pL to a few nL, e.g., 10-100 nL. The fabrication and operation of microfluidics devices are well-known in the art as described in US 6,001,229; US 5,858,195; US 6,010,607; US 6,033,546; US 5,126,022; US 6,054,034; US 6,613,525; US

6,399,952; WO 02/24322; WO 99/19717; and US 5,587,128;

5,498,392.

[0020] In certain embodiments, the sample is labeled with one or more detectable labels that bind to cells within the sample before sorting the cells. The terms "label" and "detectable label" refer to a molecule capable of detection, including, but not limited to, radioactive isotopes, fluorescers, chemiluminescers , enzymes, enzyme substrates, enzyme cofactors, enzyme inhibitors, chromophores, dyes, metal ions, metal sols, ligands (e.g., biotin or haptens) and the like. In some cases, the detectable label is linked to a binding agent that binds to a binding partner on a T cell in the sample. In case of labeling T cells, the binding agent may be an antibody {e.g., anti-CD3, anti-CD4, anti-CD8, anti-a TCR, anti-CDl4, anti-CD25, anti-CD45RA, anti-CD45RO, anti-FOXP3, etc.) or major histocompatibility complex (MHC) tetramer that specifically binds to a binding partner on or in a T cell. Thus, in some cases, the T cell is permeabilized before labeling. In some embodiments, one or more detectable labels is used to classify a cell, e.gr. , T cell, within a sample, based on the amount of label bound to the cell.

[0021 ] In some embodiments, a subset of cells within a sample is sorted as single cells into separate locations. Thus, cells may be sorted to include a first subset and exclude a second subset of cells within the sample. The first subset and second subset may be defined by a number of factors, including, but not limited to, amount of detectable label that is bound, size, light scattering properties, amount of staining by dyes that indicate viability or lack thereof, etc., of a cell. Thus, in some embodiments, a T cell that is labeled with an anti~CD8, anti-CD14, MHC tetramer or a combination thereof, and is not labeled as being dead, is included to be sorted to generate single T cells in separate locations.

[ 0022 ] In some cases, sorting the T cells into separate locations as single cells may result in a subset of the separate locations having two or more T cells. These locations with potentially more than one T cells may be identified and flagged during data analysis of the sequencing data, and data from such locations in some cases may be removed from further analysis.

[0023 ] As explained above, the primers described herein may be used in polymerase chain reaction (PCR) -based techniques, such as RT-PCR, for amplification of T cell mRNA. As is conventional in the art, "polymerase chain reaction," or "PCR" means a reaction for the in vitro amplification of specific DNA sequences by the simultaneous primer extension of complementary strands of DNA. In PCR, a pair of primers is employed in excess to hybridize to the complementary strands of the target nucleic acid. The primers are each extended by a polymerase using the target nucleic acid as a template. The extension products become target sequences themselves after dissociation from the original target strand. New primers are then hybridized and extended by a polymerase, and the cycle is repeated to geometrically increase the number of target sequence molecules. The PCR method for amplifying target nucleic acid sequences in a sample is well-known in the art and has been described in, e.g., Innis, et al . (eds.) (1990) PCR Protocols, Academic Press, NY; Taylor (1991) Polymerase chain reaction: basic principles and automation, in PCR: A Practical Approach, McPherson et al. (eds.) IRL Press, Oxford; Saiki, et al . (1986) Nature 324:163; as well as in US 4,683,195; US 4,683,202; and 4,889,818.

[0024] In particular, PCR uses relatively short oligonucleotide primers that flank the target nucleotide sequence to be amplified, oriented such that their 3 ' ends face each other, each primer extending toward the other. The polynucleotide sample is extracted and denatured, e.g., by heat, and hybridized with first and second primers that are present in molar excess. Polymerization is catalyzed in the presence of the four deoxyribonucleotide triphosphates (dNTPs : dATP, dGTP, dCTP and dTTP) using a primer- and template-dependent polynucleotide polymerizing agent, such as any enzyme capable of producing primer extension products, for example, E. coli DNA polymerase I, Klenow fragment of DNA polymerase I, T4 DNA polymerase, thermostable DNA polymerases isolated from Thermus aquaticus (Taq) , available from a variety of sources (for example, Perkin Elmer) , Thermus thermophilus (United States Biochemicals } , Bacillus stereothermophilus (Bio-Rad), or Thermococcus litoralis ("Vent" polymerase, New England Biolabs) . This results in two "long products" which contain the respective primers at their 5' ends covalently linked to the newly synthesized complements of the original strands. The reaction mixture is then returned to polymerizing conditions, e.g., by lowering the temperature, inactivating a denaturing agent, or adding more polymerase, and a second cycle is initiated. The second cycle provides the two original strands, the two long products from the first cycle, two new long products replicated from the original strands, and two "short products" replicated from the long products. The short products have the sequence of the target sequence with a primer at each end. On each additional cycle, an additional two long products are produced, and a number of short products equal to the number of long and short products remaining at the end of the previous cycle. Thus, the number of short products containing the target sequence grows exponentially with each cycle. In some cases, PCR is carried out with a commercially available thermal cycler, e.g., Perkin Elmer.

[0025 ] RNA may be amplified by reverse transcribing the RNA into cDNA, and then performing PCR. This type of amplification, referred to as "reverse transcription PCR, " or "RT-PCR, " is well-known in the art and described, e.g., in US 5,168,038, which is incorporated herein by reference. Alternatively, a single enzyme may be used for both steps as described in US 5,322,770, incorporated herein by reference in its entirety. RNA may also be reverse transcribed into cDNA, followed by asymmetric gap ligase chain reaction (RT-AGLCR) as described by Marshall, et al. (1994) PCR Meth. App. 4:80-84. Suitable DNA polymerases include reverse transcriptases, such as avian myeloblastosis virus (AMV) reverse transcriptase (available from, e.g., Seikagaku America, Inc.} and Moloney murine leukemia virus (M LV) reverse transcriptase (available from, e.g., Bethesda Research Laboratories).

[0026] Promoters or promoter sequences suitable for incorporation in the primers are nucleic acid sequences (either naturally occurring, produced synthetically or a product of a restriction digest) that are specifically recognized by an RNA polymerase that recognizes and binds to that sequence and initiates the process of transcription whereby RNA transcripts are produced. The sequence may optionally include nucleotide bases extending beyond the actual recognition site for the RNA polymerase which may impart added stability or susceptibility to degradation processes or increased transcription efficiency. Examples of useful promoters include those which are recognized by certain bacteriophage polymerases such as those from bacteriophage T3, T7 or SPG, or a promoter from E. coli. These RNA polymerases are readily available from commercial sources, such as New England Biolabs and Epicentre.

[0027] Some of the reverse transcriptases suitable for use in the methods herein have an RNAse H activity, such as AMV reverse transcriptase. In some cases, exogenous RNAse H, such as E. coli RNAse H, is added, even when AMV reverse transcriptase is used. RNAse H is readily available from, e.g., Bethesda Research Laboratories.

[0028] The RNA transcripts produced by these methods may serve as templates to produce additional copies of the target sequence through the above-described mechanisms. The system is autocatalytic and amplification occurs autocatalytically without the need for repeatedly modifying or changing reaction conditions such as temperature, pH, ionic strength or the like.

[0029] The method of the present disclosure uses a multiplexed nested PCR approach. "Nested PCR" refers to PCR that is carried out in at least two steps, wherein the amplicon product from a first round of PCR becomes the template for a second round of PCR using a second set of primers, at least one of which binds to an interior location of the amplicon from the first round of PCR, to generate a second amplicon product. In certain embodiments, a third round of PCR is carried out on the second amplicon product using a third set of primers to generate a third amplicon product, which is sequenced.

[ 0030 ] In certain embodiments, the nested PCR is multiplexed, wherein the nested PCR is carried out with T cell target sequences encoding TCRs simultaneously in the same reaction mixture. See, e.g. _r Bernard, et al . (1999)

Anal. Biochem. 273:221-228. Distinct sets of primers are employed for each sequence being amplified as described herein. Exemplary primers (SEQ ID NOs: 1-307) are provided in Tables 1-8 for amplifying human and mouse TCRs {e.g., both a and b chains of the heterodimer) , and also for adding barcodes and sequencing adapters for paired-end sequencing. Changes to the nucleotide sequences of these primers may be introduced corresponding to genetic variations in particular T cells. For example, up to three nucleotide changes, including 1 nucleotide change, 2 nucleotide changes, or three nucleotide changes, may be made in a sequence selected from the group of SEQ ID NOs: 1-307, wherein the oligonucleotide primer is capable of hybridizing to and amplifying or sequencing a T cell target nucleic acid (i.e., nucleic acids encoding TCRo^) .

[ 0031 ] In certain cases, a first set of primers used to amplify a target nucleic acid, i.e., a nucleic acid encoding TCRccp, may contain a primer that specifically hybridizes to and amplifies, when paired with another appropriate primer in the first set, the target nucleic acid during a first round of PCR. A second set of primers may then be used to further amplify the target nucleic acid when the second set contains a primer that specifically hybridizes to and amplifies, when paired with another appropriate primer in the second set, a specific amplification product of the first round of PCR during a second round of PCR. Similarly, a third set of primers may then be used to further amplify the target nucleic acid when the third set contains a primer that specifically hybridizes to and amplifies, when paired with another appropriate primer in the third set, a specific amplification product of the second round of PCR during a third round of PCR.

[ 0032 ] In some embodiments, primers within a set of primers may include, in addition to a sequence that hybridizes to a target nucleic acid, or an amplification product thereof, a common sequence and/or a barcode sequence. The common sequence may be the same sequence among a plurality of primers that otherwise hybridize to and amplify, when appropriately paired with another primer, different target nucleic acids, or amplification products thereof. In some embodiments, the common sequence in a primer used during a round of PCR enables a primer used during a following round of PCR to anneal to and amplify, when paired with an appropriate primer, the target nucleic acid by serving as an annealing site for the primer used during a following round of PCR. As such, in some cases, the common sequence in a primer used during a round of PCR is a sequence that does not hybridize to target-specific sequences of a target nucleic acid, or to a specific amplification product from a previous round of PCR. In some cases, the common sequence is a sequence that hybridizes to a target nucleic acid, if, for example, the target nucleic acid includes a sequence that is shared among different target nucleic acids, e.g., a sequence encoding a constant region of a TCR.

[0033] The multiplexed PCR reactions may be carried out in one or more of the separate locations into which single T cells from a sample have been sorted. Ideally, the amplification products of the multiplexed PCR reaction, which are in multiple separate locations, are combined into one pool before sequencing. In certain embodiments, the barcode sequence used in one of the rounds of the multiplexed PCR reactions may be used to enable identification of the location, e.g., well, from which a particular sequenced amplification product originated, as described further herein.

[0034] The present disclosure provides sets of primers that amplify nucleic acid molecules encoding T cell receptors, or a portion thereof. In aspects pertaining to amplification of nucleic acid molecules encoding human T cell receptors, a first set of primers includes a first set of forward primers set forth in SEQ ID NOs: 1-40 and SEQ ID NOs: 42-70, or a variant thereof that differs by up to three nucleotides, and a first set of reverse primers set forth in SEQ ID NOs: 41 and 71, or a variant thereof that differs by up to three nucleotides, wherein the first set of primers amplify nucleic acid molecules encoding T cell receptors, or a portion thereof, from single T cells to produce a first set of amplicons. In aspects pertaining to amplification of nucleic acid molecules encoding mouse T cell receptors, a first set of primers includes a first set of forward primers set forth in SEQ ID NOs: 177-199 and SEQ ID NOs:201-217, or a variant thereof that differs by up to three nucleotides, and a first set of reverse primers set forth in SEQ ID NOs: 200 and 218, or a variant thereof that differs by up to three nucleotides, wherein the first set of primers amplify nucleic acid molecules encoding T cell receptors, or a portion thereof, from single T cells to produce a first set of amplicons. The T cell receptors amplified with the first set of primers include the T cell receptor alpha chain and T cell receptor beta chain.

[0035] The present disclosure further provides a second set of nested primers that amplify the first set of amplicons. In aspects pertaining to amplification of nucleic acid molecules encoding human T cell receptors, a second set of nested primers includes a second set of forward nested primers set forth in SEQ ID NOs: 72-111 and SEQ ID NOs: 130- 158, or a variant thereof that differs by up to three nucleotides, and a second set of reverse nested primers set forth in SEQ ID NOs: 112-129 and SEQ ID NOs: 159-176, or a variant thereof that differs by up to three nucleotides, wherein the second set of nested primers amplify the first set of amplicons to produce a second set of amplicons. In aspects pertaining to amplification of nucleic acid molecules encoding mouse T cell receptors, a second set of nested primers includes a second set of forward nested primers set forth in SEQ ID NOs: 219-244 and SEQ ID NOs : 263- 289, or a variant thereof that differs by up to three nucleotides, and a second set of reverse nested primers set forth in SEQ ID NOs:245-262 and SEQ ID NOs: 290-307, or a variant thereof that differs by up to three nucleotides, wherein the second set of nested primers amplify the first set of amplicons to produce a second set of amplicons

[0036] Advantageously, barcode sequences are included in the second set of reverse nested primers to identify the single T cell from which each amplified nucleic acid originated. The use of barcodes allows nucleic acid analytes from different cells to be pooled in a single reaction mixture for sequencing while still being able to trace back a particular target nucleic acid to the particular cell from which it originated. Each cell is identified by a unique barcode sequence comprising at least six nucleotides. In accordance with this invention, barcode sequences are added during amplification by carrying out

PCR with a primer that contains a region comprising the barcode sequence and a region that is complementary to the target nucleic acid of interest such that the barcode is incorporated into the final amplified nucleic acid product. Exemplary barcode sequences include, but are not limited to, 5 ' -ATCACG-3 ' , 5 ' -CGATGT-3' , 5 ' -TTAGGC-3 ' , 5 ' -TGACCA-3 ' ,

5 ' GCCAAT-3 ' , 5 ' -CAGATC-3 ' , 5 ' -ACTTGA-3 ' , 5 ' -GATCAG-3 ' , 5'-

TAGCTT-3 ' , 5 ' -GGCTAC-3 ' , 5 ' -CTTGTA-3 ' , 5 ' -ACAGTG-3 ' , 5

ACAGTG-3', 5 ' -AGTCAA-3 ' , 5 ' -AGTTCC-3 ' , 5' -ATGTCA-3' , 5'-

CCGTCC-3', 5' -GTAGAG-3 ' , 5 ' -GTCCGC-3 ' , 5' -CGTGAT-3' , 5'-

ACATCG-3 ' , 5' -GCCTAA-3' , 5 ' -TGGTCA-3 ' , 5' -ATTGGC-3 ' , 5'-

GATCTG-3 ' , 5' -TCAAGT-3 ' , 5' -CTGATC-3 ' , 5' -AAGCTA-3 ' , 5'-

GTAGCC-3 ' , 5' -TAC AG-3 ' , 5' -CACTGT-3 ' , ₅, _ _TTGACT_3 , ^ 5

GGAACT-3 ' , 5 ' -TGACAT-3’ , 5' ~GGACGG~3 ' , 5' -CTCTAC-3 ' , OR 5'- GCGGAC-3'. See ILLUMINA® (2015) Illumine Adapter Sequences, Document # 1000000002694 vOO. In certain embodiments, single cells are initially sorted to separate locations in an ordered array or multi-well plate where the cell can be identified by its position using barcodes . In certain embodiments, a barcode sequence is added to one end of an amplicon to identify the position of a cell in a multi-well plate. Exemplary primers for introducing barcodes into an amplicon are provided in Tables 3-4 and 7-8. In one embodiment, a primer for introducing a barcode sequence into an amplicon of a nucleic acid encoding a TCR has a sequence set forth in SEQ ID NOs: 112-129 and SEQ ID NOs : 159-176; or SEQ ID NOs:245-262 and SEQ ID N0s:290-307. [0037] In addition to barcodes, adapter sequences are added to amplicons to facilitate high-throughput amplification or sequencing. For example, a pair of adapter sequences are added at the 5' and 3' ends of a DNA template to allow amplification and/or sequencing of multiple DNA templates simultaneously by the same set of primers. Exemplary adapter sequences include the sequences: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG (SEQ ID NO: 308) and GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG (SEQ ID NO: 309). See ILLUMINA®, 16S Metagenomic Sequencing Library Preparation, Preparing 16S Ribosomal RNA Gene Amplicons for the Illumina MiSeq System, Part # 15044223 Rev. B. Primers incorporating these adapter sequences are set forth in SEQ ID NOs: 112-129 and SEQ ID NOs : 159-176 (Tables 3-4) and SEQ ID NOs:245-262 and SEQ ID NOs : 290-307 (Tables 7-8).

[0038] In some embodiments, the first set of forward primers and/or the first set of reverse primers, as described above, do not include a barcode sequence and/or an adapter sequence. In other embodiments, the second set of forward nested primers and/or the second set of reverse nested primers, as described above, include a barcode sequence and/or an adapter sequence.

[0039] The primers of this invention can be readily synthesized by standard techniques, e.g., solid phase synthesis via phosphoramidite chemistry, as disclosed in US 4,458,066; US 4,415,732; Beaucage, et al. (1992) Tetrahedron 48:2223-2311. Other chemical synthesis methods include, for example, the phosphotriester method described by Narang, et al . (1979) Meth . Enzymol. 68:90 and the phosphodiester method disclosed by Brown, et al. (1979) Meth. Enzymol. 68:109. Poly(A) or poly(C), or other non complementary nucleotide extensions may be incorporated into oligonucleotides using these same methods. Hexaethylene oxide extensions may be coupled to the oligonucleotides by methods known in the art. See, e.g., Cload, et al. (1991) J. Am. Chem. Soc. 113:6324-6326; US 4,914,210; Durand, et al . (1990) Nucleic Acids Res.

18:6353-6359; and Horn, et al . (1986) Tet. Lett. 27:4705-

4708.

[0040] The primers of this invention are in the range of between 10-100 nucleotides in length, such as 15-40, 20-40, 15-70, 50-90 and so on, more typically in the range of between 15-90 nucleotides long, and any length between the stated ranges. In certain embodiments, a primer oligonucleotide has a sequence selected from the group of SEQ ID NOs: 1-307 or a fragment thereof including at least about 6 contiguous nucleotides, at least about 8 contiguous nucleotides, at least about 10-12 contiguous nucleotides, or at least about 15-20 contiguous nucleotides; or a variant thereof with a sequence having at least about 80- 100% sequence identity thereto, including any percent identity within this range, such as 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% sequence identity thereto. Changes to the nucleotide sequences of SEQ ID NOs: 1-307 may be introduced corresponding to genetic variations in particular T cells. In certain embodiments, up to three nucleotide changes, including 1 nucleotide change, 2 nucleotide changes, or three nucleotide changes, may be made in a sequence selected from the group of SEQ ID NOs: 1-307, wherein the oligonucleotide primer is capable of hybridizing to and amplifying a particular T cell receptor target nucleic acid.

[0041] Moreover, the oligonucleotides, particularly the primer oligonucleotides for amplification or sequencing, may be coupled to labels for detection. There are several means known for derivatizing oligonucleotides with reactive functionalities which permit the addition of a label. For example, several approaches are available for biotinylating probes so that radioactive, fluorescent, chemiluminescent, enzymatic, or electron dense labels can be attached via avidin. See, e.g., Broken, et al . (1978} Nucl, Acids Res. 5:363-384, which discloses the use of ferritin-avidin- biotin labels; and Chollet, et al . (1985) Nucl. Acids Res. 13:1529-1541, which discloses biotinylation of the 5' termini of oligonucleotides via an aminoalkylphosphoramide linker arm. Several methods are also available for synthesizing amino-derivatized oligonucleotides which are readily labeled by fluorescent or other types of compounds derivatized by amino-reactive groups, such as isothiocyanate, N-hydroxysuccinimide, or the like, see, e.g., Connolly (1987} Nucl. Acids Res. 15:3131-3139; Gibson, et al . (1987} Nucl. Acids Res. 15:6455-6467 and US 4,605,735. Methods are also available for synthesizing sulfhydryl-derivatized oligonucleotides, which can be reacted with thiol-specific labels, see, e.g., US 4,757,141; Connolly, et al. (1985) Nucl. Acids Res. 13:4485-4502 and Spoat, et al . (1987) Nucl. Acids Res. 15:4837-4848. A comprehensive review of methodologies for labeling DNA fragments is provided in Matthews, et al. (1988) Anal. Biochem. 169:1-25.

[ 0042 ] For example, oligonucleotides may be fluorescently labeled by linking a fluorescent molecule to the non ligating terminus of the molecule. Guidance for selecting appropriate fluorescent labels can be found in Smith, et al . (1987) Meth. Enzymol. 155:260-301; Karger, et al . (1991) Nucl. Acids Res. 19:4955-4962; and Guo, et al. (2012) Anal. Bioanal. Chem. 402 (10) :3115-3125. Fluorescent labels include fluorescein and derivatives thereof, such as disclosed in US 4,318,846 and Lee, et al . (1989) Cytometry 10:151-164. Dyes for use in the present invention include 3-phenyl-7-isocyanatocoumarin, acridines, such as 9- isothiocyanatoacridine and acridine orange, pyrenes, benzoxadiazoles , and stilbenes, such as disclosed in US 4, 174,384. Additional dyes include Yakima Yellow, Texas Red, 3- ( e-carboxypentyl) -3 '-ethyl-5, 5 ' -dimethyloxa- carbocyanine (CYA) , 6-carboxy fluorescein (FAM) , 5,6- carboxyrhodamine-110 (R110), 6-carboxyrhodamine- 6G (R6G) , N ' , N ' , N ' , N ' -tetramethyl- 6-carboxyrhodamine (TAMRA) , 6- carboxy-X-rhodamine (ROX) , 2 ' , 4 ' , 5 ' , 7 ' , -tetrachloro-4-7- dichlorofluorescein (TET) , 2 ' , 7 ' -dimethoxy-4 ' , 5 ' -6 carboxyrhodamine (JOE), 6-carboxy-2 ' , 4 , 4 ' , 5 ' , 7 , 7 ' - hexachlorofluorescein (HEX) , Dragonfly orange, ATTO-Tec, Bodipy, and VIC, and dyes available under the trademarks SYBR® green, SYBR® gold, CAL FLUOR® Orange 560, CAL FLUOR® Red, QUASAR® Blue 670, ALEXA®, Cy3®, and Cy5®. These dyes are commercially available from various suppliers such as Life Technologies (Carlsbad, CA) , Biosearch Technologies (Novato, CA) , and Integrated DNA Technologies (Coralville, IA) . Fluorescent labels include fluorescein and derivatives thereof, such as disclosed in US 4,318,846 and Lee, et al . (1989) Cytometry 10:151-164, and 6-FAM, JOE, TAMRA, ROX, HEX-1, HEX-2, ZOE, TET-1 or NAN-2, and the like.

[0043] Oligonucleotides can also be labeled with a minor groove binding (MGB) molecule, such as disclosed in US 6,884,584; US 5,801,155; Afonina, et al. (2002) Biotechniques 32:940-944, 946-949; Lopez-Andreo, et al. (2005) Anal. Biochem. 339:73-82; and Belousov, et al. (2004) Hum. Genomics 1:209-217. Oligonucleotides having a covalently attached MGB are more sequence specific for their complementary targets than unmodified oligonucleotides. In addition, an MGB group increases hybrid stability with complementary DNA target strands compared to unmodified oligonucleotides, allowing hybridization with shorter oligonucleotides.

[ 0044 ] Additionally, oligonucleotides can be labeled with an acridinium ester (AE) using the techniques described below. Current technologies allow the AE label to be placed at any location within the _, probe. See, e.g., Nelson, et al. (1995) "Detection of Acridinium Esters by

Chemiluminescence" in Nonisotopic Probings Blotting and Sequencing, Kricka L. J(ed) Academic Press, San Diego, CA; Nelson, et al. (1994) "Application of the Hybridization Protection Assay (HPA) to PCR" in The Polymerase Chain Reaction, Mullis et al . (eds.) Birkhauser, Boston, MA; Weeks, et al. (1983) Clin. Chem. 29:1474-1479; Berry, et al . (1988) Clin. Che . 34:2087-2090. An AE molecule can be directly attached to the probe using non-nucleotide-based linker arm chemistry that allows placement of the label at any location within the probe. See, e.g., US 5,585,481 and US 5,185,439.

[ 0045 ] T cells may be pre-treated in any number of ways prior to amplification and sequencing of nucleic acids. For instance, in certain embodiments, the T cell may be treated to disrupt (or lyse) the cell membrane, for example by treating the samples with one or more detergents and/or denaturing agents (e.g., guanidinium agents). Nucleic acids may also be extracted from samples, for example, after detergent treatment and/or denaturing as described above. Total nucleic acid extraction may be performed using known techniques, for example by non-specific binding to a solid phase (e.g., silica) . See, e.g., US 5,234,809; US

6,849,431; US 6,838,243; US 6,815,541; and US 6, 720 , 166.

[0046 ] In certain embodiments, the target nucleic acids are separated from non-homologous nucleic acids using capture oligonucleotides immobilized on a solid support. Such capture oligonucleotides contain nucleic acid sequences that are complementary to a nucleic acid sequence present in the target T cell nucleic acid analyte such that the capture oligonucleotide can "capture" the target nucleic acid. Capture oligonucleotides can be used alone or in combination to capture T cell nucleic acids. For example, multiple capture oligonucleotides can be used in combination, e.g. , 2, 3, 4, 5, 6, etc. different capture oligonucleotides can be attached to a solid support to capture target T cell nucleic acids . In certain embodiments, one or more capture oligonucleotides can be used to bind T cell target nucleic acids either prior to or after amplification by primer oligonucleotides and/or sequencing .

[ 0047 ] As T cells may be sorted into single T cells in separate locations, e.g., separate wells, in the present method, as described above, some embodiments of the present disclosure include a composition including one or more sets of forward and reverse primers and/or sets of primer pairs, as described above, and nucleic acids from a single T cell. After single T cells are sorted to separate locations, they may be lysed in order to release cellular contents, such as nucleic acids (e.g., mRNA, miRNA, chromosomal DNA, mitochondrial DNA, etc.) . The released nucleic acids may then provide templates, including any target nucleic acids, from which PCR may be carried out using the primer compositions of the present disclosure. A composition that contains nucleic acids from a single T cell may be distinguished from a composition that contains nucleic acids from two or more T cells by, e.g., determining the number of one or more autosomal loci of chromosomal DNA using sequencing or other suitable methods, as described in, e.g., Kalisky, et al. (2011) Nat. Methods 8:311; Fu, et al. (2011) Proc. Natl. Acad. Sci. USA 108:9026; and Shuga, et al. (2013) Nucleic Acids Res. 41:el59. Thus, in some embodiments, the composition contains one or more sets of forward and reverse primers and/or sets of primer pairs, as described above, and T cell nucleic acids from less than two T cells. In some embodiments, the composition contains no nucleases and/or contains nuclease inhibitors and/or provides buffering conditions that inhibits or reduces nucleic acid degradation at least until the first round of amplification .

[0048] Any high-throughput technique for sequencing can be used in the practice of the invention. DNA sequencing techniques include sequencing by synthesis using reversibly terminated labeled nucleotides, pyrosequencing, 454 sequencing, sequencing by synthesis using allele specific hybridization to a library of labeled clones followed by ligation, real-time monitoring of the incorporation of labeled nucleotides during a polymerization step, polony sequencing, SOLID sequencing, and the like. These sequencing approaches can thus be used to sequence target TCR nucleic acids of amplified from single T cells.

[0049] Certain high-throughput methods of sequencing include a step in which individual molecules are spatially isolated on a solid surface where they are sequenced in parallel. Such solid surfaces may include nonporous surfaces (such as in Solexa sequencing, e.g., Bentley, et al. (2008) Nature 456:53-59; or Complete Genomics sequencing, e.g., Drmanac, et al . (2010) Science 327:78- 81), arrays of wells, which may include bead- or particle- bound templates (such as with 454, e.g. Margulies, et al. (2005) Nature 437:376-380; or Ion Torrent sequencing, e.g., US 2010/0137143 or US 2010/0304982), micromachined membranes (such as with SMRT sequencing, e.g., Eid, et al.

(2009) Science 323:133-138), or bead arrays (as with SOLID sequencing or polony sequencing, e.g., Kim, et al. (2007) Science 316:1481-1414). Such methods may include amplifying the isolated molecules either before or after they are spatially isolated on a solid surface. Prior amplification may include emulsion-based amplification, such as emulsion PCR, or rolling circle amplification. In certain embodiments, amplification is carried out using a third set of primers in a third round of PCR with the second amplicon product as a template, i.e., for paired-end sequencing. Of particular interest is sequencing on the ILLUMINA® MiSeq platform, which uses reversible-terminator sequencing by synthesis technology (see, e.g., Shen, et al. (2012) BMC Bioinformatics 13:160; Junemann, et al . (2013) Nat. Biotechnol. 31 (4) : 294-296; Glenn (2011) Mol. Ecol. Resour. 11(5) : 759-769; Thudi, et al. (2012) Brief Fund. Genomics 11(1) : 3-11) .

[ 0050 ] The present disclosure also provides for analyzing multiplexed single cell sequencing data, such as those acquired using the method of analyzing single T cells described herein. In one implementation, a user may access a file on a computer system, wherein the file is generated by sequencing multiplexed PCR amplification products from multiple single T cells by, e.g., a method of analyzing single T cells, as described herein. Thus, the file may include a plurality of sequencing reads for a plurality of nucleic acids derived from multiple T cells. Each of the sequencing reads may be a sequencing read of a nucleic acid that contains a target nucleic acid nucleotide sequence (i.e., a nucleotide sequence encoding T cell receptor) and one or more barcode sequences that identifies the single cell source (e.g., a single cell in a well in a multi-well plate, a capillary, a microfluidic chamber, etc.} from which the nucleic acid originated (e.g., after multiple nested PCR of the target nucleic acid expressed by a single T cell in the well} . In some embodiments, the sequencing read is a paired-end sequencing read.

[0051] The sequencing reads in the file may be assembled to generate a consensus sequence of a target nucleic acid nucleotide sequence by matching the nucleotide sequence corresponding to the target nucleic acid sequence and the barcode sequences contained in each sequencing read. Those sequencing reads that originate from the same single cell source (e.g., same well) and have a target sequence that has a higher identity to a reference sequence than a threshold identity level may be assigned to the same target nucleic acid that was initially amplified from the single cell source, and may be grouped into a subset representing the target nucleic acid. The number of sequencing reads within the subset indicates how likely it is that the consensus sequence assembled from the sequencing reads in a subset is part of an actual nucleic acid molecule that was present in the single cell source. Thus, if the number of sequencing reads in a subset is above a background level, the consensus sequence derived from the subset may be considered to represent an actual sequence of a target nucleic acid in the single cell source. The consensus sequence may then be outputted, e.g., to a display, printout, database, etc.

[0052] In some embodiments, the reference sequence is a sequence for the target nucleic acid in a reference database, such as GENBANK®. Thus, in some embodiments, a target sequence in a first sequencing read in a subset of sequencing reads, as described above, is 80% or more, e.g., 85% or more, 90% or more, 95% or more, or up to 100% identical to a reference sequence for the target nucleic acid from a reference database. In some embodiments, the reference sequence is one or more other sequences in sequencing reads of the same subset. Thus, in such cases, a target nucleotide sequence in a first sequencing read in a subset of sequencing reads, as described above, is 80% or more, e.g., 85% or more, 90% or more, 95% or more, or up to 100% identical to a target nucleotide sequence in a second sequencing read in the same subset. In some instances, a target nucleotide sequence in a first sequencing read in a subset is 80% or more, e.g., 85% or more, 90% or more, 95% or more, or up to 100% identical to a target nucleotide sequence in all other sequencing reads in the same subset.

[ 0053 ] In certain embodiments, the sequencing reads are generated by a method of analyzing a T cell as disclosed herein. As such, in some embodiments, the target nucleic acid sequence contained in the sequenced nucleic acid is flanked on the 5' and 3' ends by a common sequence and optionally a barcode sequence. The barcode sequence may contain a sequence that specifies the single cell source of the target nucleic acid (e.g., the plate among a plurality of plates, the row among a plurality of rows in a multiwall plate, and/or the column among a plurality of columns in a multiwall plate, etc.) . The common sequence and/or barcode sequence is incorporated into the amplified target nucleic acid during a round of the multiplex amplification process, e.g., during the second round of amplification, as described above, to provide for a primer annealing site that may be used in the next round, e.g., third round, of amplification. Thus, the common sequences at the ends of the amplified target nucleotide sequence may be sequences exogenous to the target nucleic acid, are ideally different from one another, and may not be a sequence that can hybridize to the target nucleotide sequence before the second round of amplification. The length of the common sequences may be in the range of 17 to 30 nucleotides long, e.g., 18 to 28 nucleotides long, 19 to 26 nucleotides long, including 20 to 25 nucleotides long.

[ 0054 ] The output of the analysis may be provided in any convenient form. In some embodiments, the output is provided on a user interface, a print out, in a database, etc. and the output may be in the form of a table, graph, raster plot, heat map etc. In some embodiments, the output is further analyzed to determine properties of the single cell from which a target nucleotide sequence was derived. Further analysis may include correlating expression of a plurality of target nucleotide sequences within single cells, principle component analysis, clustering, statistical analyses, etc.

[ 0055 ] A computer system for implementing the present computer-implemented method may include any arrangement of components as is commonly used in the art. The computer system may include a memory, a processor, input and output devices, a network interface, storage devices, power sources, and the like. The memory or storage device may be configured to store instructions that enable the processor to implement the present computer-implemented method by processing and executing the instructions stored in the memory or storage device.

[0056] In certain alternative embodiments, the present method of analyzing T cells includes stimulating T cells in a sample obtained from a subject before sorting single T cells into separate locations . The stimulating may be achieved by any convenient method. Stimulating T cells may include, but are not limited to, contacting the T cells with 12-myristate 13-acetate (PMA) and ionomycin, with PMA and anti-CD3/anti-CD28, with one or more antigens specifically recognized by one or more T cells of interest in the sample, or with extracts of cells or tissues. In some cases, a sample is divided into to a first sample whose T cells are stimulated and a second sample whose T cells are unstimulated, then the two samples are analyzed separately according to the method described herein.

[0057] In some cases, the second round of PCR in the present method of analyzing single T cells may involve splitting the amplification products encoding a TCR from the first round of PCR into two pools, and performing the second round of PCR in the first pool using a reverse primer that is specific to a first subtype of TCR, and in the second pool using a reverse primer that is specific to a second subtype of TCR. In such instances, the amplification product from the first pool and the second pool may include different T cell receptor chains (e.g., alpha and beta chains) . For example, the first pool may amplify a TCR with an alpha chain and the second pool a TCR with a beta chain. As before, amplification products from the second round of PCR performed on amplification products originating from all or a subset of the separate locations containing a single T cell may be combined for sequencing.

[0058] In certain embodiments, the present method of analyzing single T cells is an efficient method of analyzing nucleic acids expressed in single T cells. The presence of a T cell receptor may be detected by the present method in 70% or more, e.g., 80% or more, 85% or more, 90% or more, 92% or more, or 94% or more, and in some cases 100% or less, e.g., 95% or less, or 94% or less of the single T cells sorted into the separate locations. In some instances, the presence of a T cell receptor may be detected by the present method in a range of 70 to 100%, e.g., a range of 80 to 98%, a range of 85 to 95%, including a range of 90 to 94% of the single T cells sorted into the separate locations. In some embodiments, presence of a T cell receptor alpha chain may be detected by the present method in 70% or more, e.g., 80% or more, 85% or more, or 90% or more, and in some cases 100% or less, e.g., 95% or less, or 90% or less of the single T cells sorted into the separate locations. In some instances, the presence of a T cell receptor alpha chain may be detected by the present method in a range of 70 to 100%, e.g., a range of 75 to 95%, a range of 80 to 92%, including a range of 85 to 90% of the single T cells sorted into the separate locations. In some embodiments, presence of a T cell receptor beta chain may be detected by the present method in 85% or more, e.g., 90% or more, or 94% or more, and in some cases 100% or less, e.g., 97% or less, or 94% or less of the single T cells sorted into the separate locations. In some instances, the presence of a T cell receptor beta chain may be detected by the present method in a range of 85 to 100%, e.g., a range of 98 to 98%, a range of 90 to 96%, including a range of 91 to 95% of the single T cells sorted into the separate locations .

[0059 ] In certain embodiments, the present method of analyzing single T cells is a sensitive method of analyzing nucleic acids expressed in single T cells. The present method may provide for detecting the presence of 50 molecules or less, e.g., 25 molecules or less, 20 molecules or less, 10 molecules or less, and down to 2 molecules of a target nucleic acid (e.g., mRNA for a T cell receptor) in a single T cell.

[ 0060 ] The technology described herein provides highly efficient TCR sequencing of single T cells and finds numerous applications in basic research and development. This methodology can be performed at reasonable cost by any standardly equipped laboratory with access to flow cytometry and deep sequencing. Sequencing TCRs of single T cells provides information about the ancestry of particular T cells. Furthermore, the sequences of nucleic acids amplified from T cells can be analyzed for splice variations, somatic mutations, or genetic polymorphisms. Of particular interest are genetic variations and mutations associated with immune disorders or cancer.

[0061] Additionally, knowledge of the sequences of TCRs from individual cells allows TCRs to be reconstituted for functional studies. For example, after analyzing a T cell as described herein and identifying a sequence encoding a TCRa polypeptide and a sequence encoding a TCRP polypeptide from a single T cell, recombinant constructs expressing the TCRa heterodimer can be constructed. A host cell can be transformed with one or more recombinant polynucleotides encoding the TCR (e.g., separate onocistronic constructs expressing each polypeptide chain of the TCR heterodimer or a bicistronic construct expressing both the TCRa polypeptide and the TCR beta polypeptide) . The TCR of the single cell can be produced by culturing the host cell under conditions suitable for the expression of the TCRa polypeptide and the TCRp polypeptide and recovering the TCRap heterodimer from the host cell culture.

[0062] The reconstituted TCR can be used in screening to determine the target antigen bound by the TCR by contacting the TCR with potential target antigens displayed in complexes with major histocompatibility complex (MHC) and determining whether or not the target antigen binds to the TCR. The TCR can be screened for antigen binding in a high- throughput manner by providing a peptide library including a plurality of peptides displayed by major histocompatibility complex (MHC) molecules; and contacting the plurality of peptides with the TCR; and identifying at least one peptide-MHC complex that binds to the TCR. Any suitable antigen may find use in the present method. Exemplary antigens include, but are not limited to, antigenic molecules from infectious agents, auto-/self- antigens, tumor-/cancer-associated antigens, etc.

[0063] The present disclosure also provides kits for carrying out the method of the present disclosure. The above-described reagents, including the primers for amplification of target nucleic acid molecules encoding TCRs, and optionally other reagents for performing nucleic acid amplification (e.g., by RT-PCR} and/or sequencing can be provided in kits with suitable instructions and other necessary reagents for analyzing single T cells. The kit will normally contain in separate containers the primers and other reagents (e.g., polymerases, nucleoside triphosphates, and buffers) . All primers within a set of primers may in some cases be provided in one container. In some cases, different subsets of primers within a set of primers may be provided in separate containers. Instructions (e.g., written, CD-ROM, DVD, flash drive, etc.) for carrying out the analysis of T cells usually will be included in the kit. The kit can also contain other packaged reagents and materials (i.e., wash buffers, cell lysis agents, reagents for extraction and purification of nucleic acids, and the like) . Analysis of single T cells, as described herein, can be conducted using these kits.

[0064] Thus, the present disclosure provides kits that find use in performing the present method, as described above. In certain embodiments, the kit includes a first set of primers, which includes (a) a first set of primers, wherein the first set of primers comprises: (i) a first set of forward primers comprising the nucleotide sequences of SEQ ID NOs:l-40 and SEQ ID NOs: 42-70 having a length ranging from 15 to 40 nucleotides, (ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 41 and 71 having a length ranging from 15 to 40 nucleotides; and <b) a second set of primers, wherein the second set of primers comprises: (i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 72-111 and SEQ ID NO: 130-158 having a length ranging from 40 to 70 nucleotides, and (ii) a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 112-129 and SEQ ID NOs: 159-176 having a length ranging from 50 to 90 nucleotides; or (c) a first set of primers, wherein the first set of primers comprises: (i) a first set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 177-199 and SEQ ID NOs: 201-217 having a length ranging from 15 to 40 nucleotides, (ii) a first set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 200 and 218 having a length ranging from 15 to 40 nucleotides; and (d) a second set of primers, wherein the second set of primers comprises: (i) a second set of forward primers comprising the nucleotide sequences of SEQ ID NOs: 219-244 and SEQ ID NO: 263-289 having a length ranging from 40 to 70 nucleotides, and (ii) a second set of reverse primers comprising the nucleotide sequences of SEQ ID NOs: 245-262 and SEQ ID NOs: 290-307 having a length ranging from 50 to 90 nucleotides. In certain embodiments, the first set and second set of primers are included in separate containers. In other embodiments, the first set of forward primers, first set of reverse primers, second set of forward primers, and second set of reverse primers are each in separate containers. [0065] The following non-limiting examples are provided to further illustrate the present invention.

Example 1 : Materials and Methods

[0066] The unbiased paired analysis of T-cell receptor (TCR) a- and b-chain usage at the single-cell level provides a valuable window for understanding the TCR repertoire and the nature of the immune response that would otherwise be difficult to obtain. Earlier technologies for TCR repertoire analysis were often limited to examining TCR complementarity-determining region 3 (CDR3) b expression or required in vitro cloning procedures that can artificially skew the TCR repertoire from the in vivo state. The protocol described here is a direct ex vivo, single-cell- based strategy for the clonotypic analysis of TCRorp repertoires that uses multiplexed panels of CDR3a- and CDR3p~specific primers in a nested PCR to amplify transcripts from an individual, epitope-specific or naive T cell by an next generation sequencing method.

[0067] Primers. External primers targeting human T cell receptor a (hTRA) and b (hTRB) genes for the first round of PCR amplification are provided in Tables 1 and 2, respectively.

TABLE 1

Primers targeting hTRAV are sense. Primers targeting hTRAC genes are antisense. hTRAV, human T cell receptor Va; hTRAC, human T cell receptor Cot.

TABLE 2

Primers targeting hTRBV genes are sense. Primers targeting hTRBC genes are antisense. hTRBV, human T cell receptor Vp; hTRBC, human T cell receptor Cp.

[0068 ] Internal primers for nested PCR amplification of hTRA and hTRB amplicons in the second round of PCR amplification are provided in Tables 3 and 4, respectively.

TABLE 3

Primers targeting hTRAV are sense. Primers targeting hTRAC genes are antisense. hTRAV, human T cell receptor Voc; hTRAC, human T cell receptor Coi. Lower case letters, adapter sequences ( ILLUMINA®) . Bold and underlined sequences, barcode sequences (ILLUMINA®) .

TABLE 4

Primers targeting hTRBV genes are sense. Primers targeting hTRBC genes are antisense, hTRBV, human T cell receptor nb; hTRBC, human T cell receptor Ob. Lower case letters, adapter sequences (ILLUMINA®) . Bold and underlined sequences, barcode sequences (ILLUMINA®) .

[ 0069] External primers targeting mouse T cell receptor a (mTRA) and b (m!RB) genes for the first round of PCR amplification are provided in Tables 5 and 6, respectively.

TABLE 5

Primers targeting mTRAV are sense. Primers targeting mTRAC genes are antisense. mTRAV, mouse T cell receptor Voi; mTRAC, mouse T cell receptor Ca.

TABLE 6

Primers targeting mTRBV genes are sense. Primers targeting mTRBC genes are antisense. mTRBV, mouse T cell receptor nb;

TRBC, mouse T cell receptor Cp . [0070 ] Internal primers for nested PCR amplification of mTRA and mTRB amplicons in the second round of PCR amplification are provided in Tables 7 and 8, respectively.

TABLE 7

Primers targeting mTRAV are sense. Primers targeting mTRAC genes are antisense. TRAV, mouse T cell receptor Va; mTRAC, mouse T cell receptor Ca. Lower case letters, adapter sequences (ILLUMINA®) . Bold and underlined sequences, barcode sequences {ILLUMINA®) ·

TABLE 8

Primers targeting mTRBV genes are sense. Primers targeting mTRBC genes are antisense. mTRBV, mouse T cell receptor nb; mTRBC, mouse T cell receptor cp. Lower case letters, adapter sequences (ILLUMINA®) . Bold and underlined sequences, barcode sequences (ILLUMINA®) . [0071] Stocks of primers were prepared by resuspending the primers to 200 m using IX TE buffer (low EDTA; 10 mM Tris- HC1, 0.1 mM EDTA, pH 8.0) and stored at -20 °C.

[0072] Cocktails of external forward human or mouse TRAV

(TRAV EXT FOR} and internal forward human or mouse TRAV

(TRAV INT FOR) primers were prepared by combining 25 mΐ of each human or mouse TRAV primer (40 of them, 200 mM stock) , thereby yielding 1000 mΐ of diluted primer cocktail with each primer at 5 mM working concentration.

[0073] Cocktails of external forward human or mouse TRBV

(TRBV EXT FOR) and internal forward human or mouse TRBV

(TRBV INT FOR) primers was prepared by combining 25 mΐ of each human or mouse TRBV EXT forward primer (27 of them, 200 mM stock) with 325 mΐ IX TE buffer (low EDTA; 10 M Tris-HCl, 0.1 mM EDTA, pH 8.0), thereby yielding 1000 mΐ of diluted primer cocktail with each primer at 5 mM working concentration .

[0074] All reverse primers including human or mouse TRAC external reverse (TRAC EXT REV), human or mouse TRBC external reverse (TRBC EXT REV), human or mouse TRAC internal reverse (TRAC INT REV) , human or mouse TRBC internal reverse (TRBC INT REV) were resuspended to prepare 5 mM working stocks.

[0075] Single Cell Sorting. Mouse or human CD8+ T cells isolated from peripheral blood mononuclear cells (PBMC), Bronchoalveolar lavage (BAL) , spleen (SPL) , and lymph nodes (LN) of infected, naive or memory mice or humans. Cells were resuspended in 1 ml of Dulbecco ' s phosphate-buffered saline (D-PBS) without Ca ⁺⁺/Mg ⁺⁺ or extraneous proteins. The cells were stained with LIVE/DEAD® Fixable Aqua Dead Cell Stain (405 nm excitation) discrimination dye for 30 minutes at room temperature in the dark. At the end of the incubation, cells were washed twice with sort buffer {PBS containing 0.1% BSA, fraction V (Life Technologies)) by centrifugation at 500g and +4°C for 5 minutes.

[ 0076 ] Cells were subsequently resuspended in 50 mΐ of sort buffer containing blocking antibody (anti-mouse CD16/CD32) or APC-conjugated peptide-loaded pMHCI tetra er (e.gr., HLA- A*0201-CMV pp65; Beckman Coulter) in appropriate dilution and incubated at room temperature in the dark for 1 hour. The cells were again washed twice by centrifuging at 500g and +4°C for 5 minutes using sort buffer. For human cells, the cell pellet was resuspended in sort buffer containing an appropriate dilution of FITC-conjugated anti-human CD3, PE-Cy7-conjugated anti-human CD8, PE-conjugated anti-human CD14. For mouse cells, the cell pellet was resuspended in sort buffer containing an appropriate dilution of mouse- CD4, anti-mouse CDllb, anti-mouse CDllc, F4/80 (all Pacific Blue-conjugated for negative gating) (Biolegend) , and anti ^¬ mouse CD8-APC-eFluro780 (eBiosciences ) . Subsequently, the cells were incubated on ice for 20 minutes in the dark.

[ 0077 ] Cells were again washed twice by centrifuging at 500g and +4°C for 5 minutes using sort buffer. The resulting pellets were resuspending in 0.5 ml of sort buffer containing RNase inhibitor at 200 U/ml. The cell suspension was filtered through a 40 mM cell strainer. Lymphocytes were first gated on their scatter properties based on a FSC-A/SSC-A plot. From the probable lymphocyte population, the live cells were select based on Fixable LIVE/DEAD® aqua staining and CD3, CD8, and CD14 negative cells for human cells, or CD4, CDllb, CDllc and F4/80 negative cells for mouse cells. Cells were than gated on CD8+ tetramert for sorting.

[ 0078 ] Epitope-specific CD8 cells were sorted into each well of columns 1-23 of a 384-well polypropylene PCR plate (Eppendorf ) . Column 24 of the 384-well plate was left empty for the negative {no template) control. Following the sort, the plate was sealed with adhesive plate seal (MicroAmp, Applied Biosystem) and placed on ice. The plates were briefly spun to bring the contents down and frozen at - 80 °C .

[ 0079 ] .Reverse transcription (RT) . The reverse transcription of the TCRa and b mRNA was carried out directly on the lysed cells without any RNA extraction step. Lysis was achieved by a combination of the freeze- thaw cycle and inclusion of a detergent (Triton™ X-100, 0.1% final) in the RT mixture. Either iScript™ (Biorad) or Superscript® Vilo™ (Thermo) were used for RT as follows. More specifically, the plate was thawed, centrifuged at 500g for 2 minutes and keep on ice. RT master mixes were prepared as appropriate for the iScript™ (Biorad) and Superscript® Vilo™ enzymes (Table 9) .

TABLE 9

[ 0080 ] To each well was added one mΐ of RT master mix using a multichannel pipette. Once all columns were filled, the plate was resealed and centrifuged at 500 g for 2 minutes. Using a thermocycler, cDNA was synthesized as follows: 5 minutes at 25°C; 45 minutes at 42°C; 5 minutes at 85°C; hold at 4°C. After cDNA synthesis, the plate was stored at -20 ⁰ C .

[ 0081 ] Frist Round of Polymerase chain reaction (PCR) . Nested PCR was carried out to amplify the TCR a and b chain from the single cells. The reaction mixture, which included the primers listed in Tables 1 and 2 (human) or Tables 5 and 6 (mouse), for the first round PCR (384 well plate) is provided in Table 10.

TABLE 10

[ 0082 ] To each sample and control well of the cDNA plate was add 9 mΐ of the master mix. The plate was resealed and centrifuged to bring the contents to the bottom of the wells. PCR of human TCRcf was carried out in a thermocycler as follows: initial denaturation at 95 °C for 5 minutes; 34 cycles of denaturation at 95°C for 20 seconds, primer annealing at 52 °C for 20 seconds, polymerase extension at 72° for 45 seconds; a final extension at 72 °C for 7 minutes and a final hold at 4°C. PCR of mouse TCRoip was carried out as described for human TCR p except that primer annealing was carried out at 55 °C instead of 52 °C. The plates were stored at -20°C until the next step. [0083] Nested PCR. TCR alpha and beta nested PCR reactions were set up separately. Accordingly, identical wells of alpha and beta plates are from a single cell. For example, well position Al of the alpha plate will pair with Al of the beta plate (from a single cell) . The primer set for internal PCR for next generation sequencing platform is distinct from Sanger's sequencing method. Each plate is barcoded with a unique reverse primer (6 nucleotide barcode) . Thus, each plate is labeled with the barcode name (A, B, C, etc.) so that multiple plates can be pooled for the sequencing run.

[0084] Two master mixes (alpha and beta) were prepared as provided in Table 11 for the second round PCR (384-well plate) .

TABLE 11

[0085] To each well of the alpha and beta plates was add 9 mΐ of the master mix and 1 mΐ of the amplicons from the first round. The plate was resealed and centrifuged to bring the contents to the bottom of the wells. PCR of human TCRap was carried out in a thermocycler as follows: initial denaturation at 95 °C for 5 minutes; 34 cycles of denaturation at 95°C for 20 seconds, primer annealing at 52 °C for 20 seconds, polymerase extension at 12° for 45 seconds; a final extension at 72 °C for 7 minutes and a final hold at 4°C. PCR of mouse TOKab was carried out as described for human TCRap except that primer annealing was carried out at 55°C instead of 52 °C. Samples in the plates were immediately analyzed by agarose gel (2% in IX TAE buffer) electrophoresis (2.5 mΐ) and stored at -80°C.

[0086] Sequencing. Following PCR, the plates were stored at -80°C until all the intended plates were amplified. All the 384-well plates (with different barcoded reverse primers) were pooled into a single 384-well plate using an Integra Viaflo pipetting system. Amplicons were sequenced on an ILLUMINA® MiSeq platform, which uses reversible-terminator sequencing by synthesis technology. A schematic of the single cell paired TCR deep sequencing method of the invention is presented in FIG. 1.

Example 2 : Materials and Methods

[0087] Using the method of the invention, a single cell paired TCR deep sequencing experiment was conducted. The results of this analysis indicated an increase in paired data (FIG. 2) . In addition, it was observed that most of the cells were monoallelic and approximately 10% of the cells were dual in-frame TCR a cells (FIG. 3) . Notably, the method allowed for 384-well multiplexing, plate multiplexing, and single cell level paired data (~1500 to 3000), which is efficient and economically viable as reagent use was decreased.

Previous Patent: BASE STATION ANTENNA

Next Patent: KRAFT PULP MILL SCALE CONTROL WITH END GROUP MODIFIED POLYCARBOXYLATES