Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SUPER HLA TYPING METHOD AND KIT THEREOF
Document Type and Number:
WIPO Patent Application WO/2019/229649
Kind Code:
A1
Abstract:
The present invention relates to sequencing of super Human Leukocyte Antigen (HLA) Typing for class I and class II genes. It is an object of the present invention to provide accurate full length sequence of HLA genes, preferably, class I (HLA-A, HLA-B, HLA-C) and class II genes. It is an object of the present invention to provide HLA kit to detect allele sequences of class I and class II genes. The invention also relates to a method and kit for full length HLA gene sequences. The present invention method for super HLA Typing, gene sequencing and analysis comprising: (a) designing and synthesizing the set of forward and reverse primers for full length amplification of HLA gene class I (HLA-A, B, C) and class II (HLA- DQB1, DRB1, DPB1) selected from group consisting of SEQ ID 1 to 24; (b) amplification of the test sample HLA gene by using the set of primer synthesized at step (a) to obtain PCR amplicon; (c) separation of the PCR amplicon obtained at step (b); (d) sequencing of the separated amplicon of step (c); and (e) the sequences thus generated analyzed by matching with HLA gene from 5' UTR to 3' UTR. Primers set for Super HLA Typing, gene sequencing and analysis comprising the sequences selected from group consisting of SEQ ID No. 1 to 24. This invention developed end to end Super HLA Typing kit by integrating DNA isolation from different cell types, library preparation, next generation sequencing and allele calling analysis.

Inventors:
GOWDA MALALI (IN)
Application Number:
PCT/IB2019/054409
Publication Date:
December 05, 2019
Filing Date:
May 28, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
GOWDA MALALI (IN)
International Classes:
C12Q1/68
Foreign References:
US20160060695A12016-03-03
US20170029886A12017-02-02
Other References:
CLARK ET AL.: "P050 Rapid, high resolution HLA genotyping using nanopore sequencing", HUMAN IMMUNOLOGY, vol. 78, September 2017 (2017-09-01), pages 90, XP085167654
Attorney, Agent or Firm:
BASAVA, Sharana (IN)
Download PDF:
Claims:
I Claim,

1. Method for Super HLA Typing, gene sequencing and analysis comprising:

(a) designing and synthesizing the set of forward and reverse primers for full length

amplification of HLA gene class I (HLA- A, B, C) and class II (HLA DQB1, DRB1,

DPB1) selected from group consisting of SEQ ID 1 to 24;

(b) amplification of the test sample HLA gene by using the set of primer synthesized at step

(a) to obtain PCR amplicon;

(c) separation of the PCR amplicon obtained at step (b);

(d) sequencing of the separated amplicon of step (c); and

(e) the sequences thus generated analyzed by matching with HLA gene from 5’ UTR to 3’

UTR.

2. Method as claimed in claim 1 wherein the primers for super HLA genes designed by using one of the longest sequences as“reference sequence”, other HLA gene sequences from IMGT were aligned to reference, and then most conserved region was selected for designing primer sequence.

3. Method as claimed in claim 1 wherein the amplificationis carried with long range Taq polymerase enzyme andthe primers were designed in a way that all the primer pairs for class I and class II HLA genes, class I (HLA-A, B, C) and class II (HLA-DRB1, DQB1, DPB1) have common annealing temperature for PCR amplification.

4. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-A gene selected are SEQ ID No. 1 and SEQ ID No 2.

5. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-B gene selected are SEQ ID No. 3 and SEQ ID No 4.

6. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-C gene selected are SEQ ID No. 5 and SEQ ID No 6.

7. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-DRB 1 gene selected are SEQ ID No. 7 to SEQ ID No 12.

8. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-DQB1 gene selected are SEQ ID No. SEQ ID No. 13 to SEQ ID No 18

9. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-DPB1 gene selected are SEQ ID No. SEQ ID No. 19 to SEQ ID No 24.

10. Method as claimed in claim 1 wherein the obtained sequences optionally were subjected for homology search in the database and BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI database to find the best match.

11. Primers for Super HLA Typing gene sequencing and analysis comprising the sequences selected from group consisting of SEQ ID No. 1 to 24.

12. Kit for super HLA gene sequencing and analysis comprising the set of primers for amplification of super HLA genes as claimed in claim 1 and integrated bioinformatics tools along with the HLA- allele calling.

Description:
SUPER HLA TYPING METHOD AND KIT THEREOF

FIELD OF INVENTION

[1] The present invention relates to sequencing of full-length Human Leukocyte Antigen (HLA) typing for class I and class II genes. It is an object of the present invention to provide accurate full length sequence of HLA genes, class I (HLA-A, HLA-B, HLA-C) and class II (HLA-DRB1, HLA-DQB1 and HLA-DPB1) genes using next generation sequencing technologies. It is an object of the present invention to provide HLA kit to detect allele sequences of class I and class II genes. The invention also relates to a method and kit for full length HLA gene sequences.

BACKGROUND OF THE INVENTION

[2] The major histocompatibility complex (MHC) consists of immunologically important genes that encode for HLA proteins, which help the immune system to recognize foreign antigens (pathogens and allergens). HLA genes are present on short arm of chromosome 6, which have been extensively studied in human population, HLA genes being the most polymorphic loci in the human genome.

[3] The HLA class I and class II genes are a component of the human MHC. The class I genes consist of the three classical genes encoding the major transplantation antigens HLA-A, HLA-B and HLA-C. The class II genes consist of three classical genes encoding the major transplantation antigens, HLA-DR, HLA-DQ and HLA-DP.

[4] HLA system plays an important role in organ transplantation, infectious diseases and genetic disorders. HLA genes encode cell membrane proteins, which recognize the self or non-self of cell types.

[5] HLA genes have been associated with cell type identification in organ transplantation, many autoimmune diseases, cancer, drug hypersensitivity and allergy responses, infectious diseases (viral and bacterial), anthropological studies (population genetics), vaccines and fingerprinting (paternity and forensic study) etc. In addition, numerous HLA alleles have been identified as signatures of risk indices of susceptibility to diseases, biomarkers of drug hypersensitivities, predictors of graft outcome post transplantation and tags of ethnic diversity.

[6] There are reports in the patent and non-patent literature which reported for the sequencing of HLA genes, the important ones include US 2016/0208326 which discloses a method and kit for. This method for the DNA typing of HLA, which is characterized by comprising: (1) a step of preparing a set of primers which can respectively anneal specifically to an upstream region and a downstream region of each of HLA-A, HLA-B, HLA-C, HLA-DQA1, HLA-DQB1, HLA-DPA1 and HLA-DPB I gene in the nucleotide sequence for the human genome, and a set of primers which can respectively anneal specifically to exon-2 and a 3 '-side non-translafed region in HLA-DRB1; (2) a step of earning out the PCR amplification of a sample to be tested (DNA) using the sets of primers; (3) a step of determining the nucleotide sequence for a PCR-ainplified product; and (4) an optional step of carrying out the homology search in a database. This invention may not provide complete gene sequence of class II HLA gene, DRB1. This invention uses short read method of NGS using Roche 454 sequencing technology. Short reads stitched together by bioinformatics tools, which may create inaccurate alleles uring allele phasing. Our invention provides complete gene sequence for class I (HLA- A. HLA-B, I-ILA-C) and class II (HLA-DRB 1 , HLA-DQB 1 and HLA-DPB1).

[7] US 2017/0029885 disclose a method and kit for the DNA profiling of HLA genes using a high-throughput massively parallel sequencer. A method for the DNA profiling of HLA genes, said method being characterized by including; (1) a step for preparing a primer set that anneals specifically to exon 4 and intron 1 and includes exon 2 and exon 3 of at least one target gene selected from the group consisting of HLA-DRB I, HLA-DRB3, HLA- DRB4, HLA-DRB5, HLA-DQB 1 and HLA-DPB 1 in the base sequence of the human genome; (21 a step for amplifying a sample (DNA) by PCR using the primer set; (3) a step for determining the base sequence of the amplified PCR product; and (4) a step for carrying out a homology search against a database. This invention may not provide complete gene sequence of class I and class II HLA genes. This claim used short read next generation sequencing method, which fragment sequences and fuse reads of sequences to generate phasing of alleles. This may create inaccurate phased HLA alleles. Our invention provides complete gene sequence for class I (HLA-A, HLA-B, HLA-C) and class II (HLA- DRB 1, HLA-DQB 1 and HLA-DPB1).

[8] CN101962676A invention discloses a human leukocyte antigen HLA-A and HLA-B gene full-length sequencing method and an HLA gene sequencing and typing method. The HLA-A and HLA-B gene full-length sequencing method comprises the following steps of: a, performing PCR amplification on about 4kb full-length sequences of HLA-A and HLA-B genes by using a pair of primers respectively; and b, cloning the amplification products to a pGEM-T easy vector, sequencing the full-length sequences by using ten walking sequencing primers in positive and negative directions respectively, and totally obtaining 38 allele 4.3 Kb full-length sequences of the HLA-A and 30 allele 3.7 Kb full- length sequences of the HLA-B. The HLA-A and HLA-B sequencing and typing method comprises the following steps of: performing PCR amplification on typing target areas of the HLA-A and HLA-B by using two pairs of primers respectively; and performing two directional sequencing on products by using fourteen sequencing primers respectively, wherein the HLA-DRB 1 and HLA-DQB 1 sequencing and typing method comprises the following steps of: amplifying sequences of second and third exons of DRB1 and DQB1 by adopting group specificity primers respectively; performing two directional sequencing on the second and third exons of the DRB 1 by adopting eight group specificity primers and three sequencing primers; and performing two directional sequencing on the second and third exons of the DQB1 by adopting four sequencing primers, respectively. This invention seems to be cumbersome because they adopted cloning strategy and Sanger sequencing by primer walking. The invention is only restricted to class I genes, HLA-A, B, and only second and third exons of HLA-DRB 1 and DQB1 genes.

[9] Danzer et. al. 2013 [Martin Danzer, Norbert Niklas, Stephanie Stabentheiner, Katja Hofer, Johannes Proll, Christina Stuckler, Rapid, Edeltraud Raml, Helene Polin and Christian Gabriel, BMC Genomics 2013, 14:221] discloses scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics. This invention involves selective amplification of exonic and intronic regions of class I (HLA-A, B, C) and class II (HLA-A DRB1, DQB1, DPB1) genes.

[10] Clark et. al. 2016 [Peter M. Clark, Jamie L. Duke, Deborah Lerriola, Valia Bravo- Egana, Tunde Vago, Aniqa Hassan, Anna Papazoglou, and Dimitri Monos, Clinical Chemistry 62: 12, 1630-1638 (2016)] discloses generation of full-length class I human leukocyte antigen gene consensus sequences for novel allele characterization. This invention provides full length amplification of only class I (HLA-A, HLA-B, HLA-C) genes. Wittig et. al., 2015 [Michael Wittig, Jarl A. Anmarkrud, Jan C. K ' assens, Simon Koch, Michael Lorster,Eva Ellinghaus, Johannes R. Hov, Sascha Sauer, Manfred Schimmler,Malte Ziemann, Siegfried G " org, Lrank Jacob, Tom H. Karlsenand Andre Lranke, Nucleic Acids Research, 2015, Vol. 43, No. 11 e70]“Development of a high- resolution NGS-based HLA-typing and analysis pipeline”. This invention uses target enrichment approach for high resol udon HLA sequencing which may not provide complete gene sequence of class I HLA and class II HLA genes.

[11] Yin et al., 2016 [Yuxin Yin, James H. Lan, David Nguyen, Nicole Valenzuela, Ping Takemura, Yung-Tsi Bolon, Brianna Springer, Katsuyuki Saito, Ying Zheng, Tim Hague, Agnes Pasztor, Gyorgy Horvath, Krisztina Rigo, Elaine L. Reed, Qiuheng Zhang, PLOS ONE | D01: l0.l37l/journal.pone.0l658l0 October 31, 2016] which discloses application of high-throughput next-generation sequencing for HLA typing on buccal extracted DNA: results from over 10,000 donor recruitment samples. This invention may not provide complete gene sequence of class I HLA and class II HLA genes.

[12] Manish J et al. 2017 [Manish J. Gandhi, MD; Deborah Lerriola, BS; Yanping Huang, MD, PhD; Jamie L. Duke, PhD; Dimitri Monos, PhD, Arch Pathol Lab Med— Vol 141, June 2017] which discloses targeted next-generation sequencing for human leukocyte antigen typing in a clinical laboratory metrics of relevance and considerations for its successful implementation. This invention may not provide complete gene sequence of class I HLA and class II HLA genes.

[13] Hosomichi et al., 2015 [Kazuyoshi Hosomichi, Takashi Shiina, Atsushi Tajima and Ituro Inoue, Journal of Human Genetics (2015) 60, 665-673] which discloses the impact of next-generation sequencing technologies on HLA research. Our invention provides complete gene sequence for class I (HLA- A, HLA-B, HLA-C) and class II (HLA-DRB1, HLA-DQB 1 and HLA-DPB1).

[14] Larjo et. al., 2017 [Antti Larjo, Robert Eveleigh, Elina Kilpelainen, Tony Kwan, Tomi Pastinen, Satu Koskela and Jukka Partanen, Frontiers in Immunology www.frontiersin.org, December 2017 | Volume 8 | Article 1815] which discloses accuracy of programs for the determination of human leukocyte antigen alleles from next- generation sequencing data. This invention utilizes low and medium resolution techniques for HLA typing and analysis, which may not provide complete gene sequence of class I and class II HLA genes.

[15] Conventionally, serological, sequence-specific oligonucleotide (SSO) and sequence-specific primer (SSP) and Sanger sequencing methods have been used to analyzing HLA loci for organ transplantation. These methods are based on low-resolution of HLA typing, which give ambiguity for transplantation donor finding.

[16] Though above reports do HLA typing by sequencing either by Sanger or short read technologies. There are limitations or drawbacks in HLA typing and sequencing of full length HLA I and II and data analysis to find out haplotype of HLA alleles. The recent next generation sequencing (NGS) technologies have decreased the cost per base sequencing. However, the limitation of above inventions and publications produced data from short reads from NGS methods. This may not provide complete phasing and haplotype information of parental HLA alleles, thus creating confusion among clinicians and doctors for accurate HLA alleles matching between organ donor and recipient for transplantation. NGS technologies have potential advantages over the Sanger method by increasing HLA typing sequencing depth, speed, multiplexing, resolution and low cost per sample (Lind C, Ferriola D, Mackiewicz K, Heron S, Rogers M, et al. (2010). Next generation sequencing: the solution for high-resolution, unambiguous human leukocyte antigen typing. Hum Immunol 71 : l033-l042.Wittig M, Anmarkrud JA, Kassens JC, Koch S, Forster M, et al. (2015). Development of a high-resolution NGS-based HLA- typing and analysis pipeline. Nucleic Acids Res 43: e70). NGS allows clonal amplification for the region sequenced and can also get a high-resolution. NGS technologies are more sensitive to detect rare alleles as compared to Sanger sequencing. (Cost-efficient high- throughput HLA typing by MiSeq amplicon sequencing: Vinzenz Lange, Irina Bohme, Jan Hofmann, Kathrin Lang, Jtirgen Sauter, Bianca Schone, Patrick Paul, Viviane Albrecht, Johanna M Andreas, Daniel M Baier, Jochen Nething, Ulf Ehninger, Carmen Schwarzelt, Julia Pingel, Gerhard Ehninger, and Alexander H Schmidt, BMC Genomics. 2014; 15: 63). The major problem in short read NGS technologies is that these may not resolve phasing of parental alleles. To overcome above limitations, there exists a need for development of a method and kit for full length DNA sequencing for class I and class II HLA genes and accurate allele segregation during analysis. The present invention inventor has developed an efficient novel method and a kit for Super HLA Typing, sequencing by long read third generation sequencing (Oxford Nanopore) technology. This method can sequence complete HLA class I and II genes including introns, exons and part of UTR regions of HLA genes. Our approach provides one-time solution with reduced cost per sample and subsequently assists in building the HLA database for population.

OBJECTIVES OF THE INVENTION

[17] The primary object of the present invention is the development of Super HLA Typing method for class I and class II genes by providing accurate full length sequence of HLA genes and alleles.

[18] The other object of the present invention is the development of method for Super HLA Typing for class I and class II genes by providing accurate full length sequence of HLA class I (HLA- A, HLA-B, HLA-C) and HLA class II (HLA-DRB1, HLA-DQB 1 and HLA-DPB 1 ) genes.

[19] Another object of the present invention to develop the Super HLA Typing kit to detect allele sequences of HLA class I and class II genes.

[20] The other object of the present invention is the development of method and kit for Super HLA Typing for class I and class II genes by providing accurate full length sequence of HLA genes by using long read sequencing technology, which is cost-effective method.

[21] By the other object of the present invention is the development of method and kit for Super HLA Typing for class I and class II genes providing accurate full length sequence of HLA genes, which are easy to use with little technical expertise.

BRIEF DESCRIPTION OF THE DRAWINGS

[22] FIG 1 : Diagrammatic representation of HLA-A primer location

[23] FIG 2: Diagrammatic representation of HLA-B primer location

[24] FIG 3 : Diagrammatic representation of HLA-C primer location

[25] FIG 4: Gel image of PCR amplified products of HLA-A, B, C [26] FIG 5: Diagrammatic representation of HLA-DQB1 primer location

[27] FIG 6: Gel image of PCR amplified products of HLA-DQB1

[28] FIG 7 : Diagrammatic representation of HLA-DRB 1 primer location

[29] FIG 8: Gel image of PCR amplified products of HLA-DRB 1

[30] FIG 9: Diagrammatic representation of HLA-DPB1 primer location

[31] FIG 10: Gel image of PCR amplified products of HLA-DPB1.

[32] FIG 11 : Diagrammatic representation of method to capture and sequence full length

HLA genes

STATEMENT OF THE INVENTION

Method for Super HLA Typing, gene sequencing and analysis comprising:

(a) designing and synthesizing the set of forward and reverse primers for full length

amplification of HLA genes class I (HLA- A, B, C) and class II (HLA-DQB1, DRB1,

DPB1) selected from group consisting of SEQ ID 1 to 24;

(b) amplification of the test sample HLA gene by using the set of primer synthesized at step

(a) to obtain PCR amplicon;

(c) separation of the PCR amplicon obtained at step (b);

(d) sequencing of the separated amplicon of step (c); and

(e) the sequences thus generated analyzed by matching with HLA gene from 5’ UTR

(UnTranslated Region) to 3’ UTR (UnTranslated Region).

The primers in Super HLA Typing for HLA genes have been designed by using one of the longest sequences as“reference sequence”, other HLA gene sequences from IMGT were aligned to reference, and then most conserved region was selected for designing primer sequence. The amplification is carried out with long range Taq polymerase enzyme.

The set of primers for amplification of HLA- A gene selected are SEQ ID No. 1 and SEQ ID No 2. The set of primers for amplification of HLA-B gene selected are SEQ ID No. 3 and SEQ ID No 4. Method as claimed in claim 1 wherein the set of primers for amplification of HLA-C gene selected are SEQ ID No. 5 and SEQ ID No 6.

The set of primers for amplification of HLA-DRB 1 gene selected are SEQ ID No. 7 to SEQ ID No 12. The set of primers for amplification of HLA-DQB1 gene selected are SEQ ID No. SEQ ID No. 13 to SEQ ID No 18

The set of primers for amplification of HLA-DPB1 gene selected are SEQ ID No. SEQ ID No. 19 to SEQ ID No 24.

The obtained sequences in the method optionally were subjected for homology search in the database and BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI database to find the best match.

Primers for HLA gene amplification, sequencing and analysis comprising the selected from group consisting of SEQ ID No. 1 to 24.

Super HLA Typing kit for HLA gene sequencing and analysis comprising the set of primers for amplification of HLA genes selected from group consisting of SEQ ID No. 1 to 24 and integrated bioinformatics tools along with the allele calling pipeline.

DETAILED DESCRIPTION OF THE INVENTION

[33] The present disclosure relates to a method and the kit for Super HLA Typing for

class I and class II genes.

[34] Those skilled in the art will be aware that the present disclosure is subject to

variations and modifications other than those specifically described. It is to be understood that the present disclosure includes all such variations and modifications. The below illustrations are for the understanding of the invention and should in no way be constructed to be limiting.

[35] This invention used blood, buccal swab and saliva for HLA genes amplification

using long range PCR.

[36] Buccal sample was collected using ORACollect DNA swab kit (Ref. No OCR-lOO)

from DNA Genotek Ontario, Canada. The DNA isolation was performed using the standardized ORACollect DNA buccal swab prepIT-L2P DNA extraction protocol (Cat No:Q-33 l30) as per the manufacturer’s protocol. The extracted DNA was checked for quality using 0.8% agarose gel and quantified using Nanodrop and Qubit.

[37] Also I tested saliva collection and DNA isolation from human samples using BGC

(Bengaluru Genomics Center) DNA isolation kit (Cat. No: B 10241). In this method, the intake of any food or drink was restricted for two hours for sample collection. Before collection of samples, subjects were asked to rinse mouth twice with 50 mL distilled water and asked them to drink 50 mL of water to reduce the microbial DNA. About 1 to 0.5 mL of saliva was collected by direct spitting of saliva in 15 mL tube that containing 1 mL of storage buffer. Saliva sample was mixed properly with storage buffer by inversion of tubes several times. DNA was isolated according to manufacturer’s protocol and the extracted DNA was checked for the quality using 0.8% agarose gel and quantified using Nanodrop and Qubit.

[38] Genomic DNA was extracted form 500 pL of blood using QIAamp DNA Blood

Mini Kit (Cat No./ID: 51104) according to the manufacturer’s protocol. The extracted DNA was checked for quality using 0.8% agarose gel and quantified using Nanodrop and Qubit.

[39] PCR amplification of class I (HLA-A, B, C) and class II (HLA-DRB1, DQB1,

DPB1) genes was performed using Roche’s Kapa HiF Hotstart Readymix PCR Kit (Code:

KK2601 07958927001). The primers used for amplification are as described in Table: 1. PCR was performed in 10 m L reaction volume containing 60-l00ng of genomic DNA, 5 m L Kapa HiFi Hotstart Readymix, 0.5 m L of Forward and Reverse Primers (lOpm/ m L). The cycling parameters for the PCR are as follows: 95°C for 3 min (first denaturing step), 35 cycles of 98 °C for 20 sec, 65 °C for 15 sec, 72 °C for 7 min and 72 °C for 10 min (last extension step). The primers were designed in such a way that all the PCR primer pairs for class I and class II have common annealing temperature. The PCR reaction was carried out in Applied Biosystem’s Veriti 96-well Thermal Cycler (Cat No: 4375786).

[40] Details of gene structure of class I (HLA- A, B , C) and class II (HLA-DRB 1 , DQB 1 , DPB 1 ) is depicted in Fig. 11.

[41] In accordance with the embodiments of the present invention, HLA-A gene was studied by designing specific primer sequences. The primer sequence location is depicted in Fig. 1. The amplified product of primer sequence is shown in Fig. 4. Then the forward primers to the 5’ UTR region (12-30 bp) were designed, and reverse primers were designed from 3486 to 3507 bp. The said primer was synthesized using oligo synthesizer. These oligo bases were used to amplify HLA-A from the human blood and saliva samples. I obtained 3474-3488 bp of HLA-A amplicon as depicted in Fig. 4. This band was purified from 1% agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then the high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI and IMGT databases to find the best match. The PCR amplicons generated thus were matched to HLA-A gene from 5’ UTR to 3’ UTR (12-3507 bp) as expected.

[42] Similarly, HLA-B gene was studied using specific primers (Fig. 2), PCR amplification (Fig. 4) and analysis. All the full length HLA-B alleles were (HLA-B gene IDs as shown in Table 2) downloaded from NCBI, then aligned all these sequences to reference sequence (NCBI ID: KX774745) to locate SNPs across the HLA-B gene length. Then designed the forward primers to the 5’ UTR region (20-41 bp), and reverse primers designed from 3369-3389 bp. Then primers were synthesized using oligo synthesizer. These oligo bases are used to amplify HLA-B from the human blood and saliva samples. I obtained 3366-3384 bp of HLA-B amplicon. This band was purified from 1% agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI database to find the best match. The PCR amplicons thus generated matched HLA gene from 5’ UTR to 3’ UTR (20-3389) as expected.

[43] HLA-C gene was studied using specific primers (Fig. 3), PCR amplification (Fig.

4), sequencing and analysis. All the full length HLA-C alleles were (HLA-C NCBI IDs as shown in Table 3) downloaded from NCBI database, then aligned all these sequences to reference sequence (NCBI ID: KX649940) to locate SNPs across the HLA-C gene length. Then designed the forward primers to the 5’ UTR region (19-42 bp), and reverse primers designed from 3415-3435 bp. Then primers synthesized using oligo synthesizer. These oligo bases are used to amplify HLA-C from the human blood and saliva samples. I obtained 3415-3423 bp of HLA-C amplicon. This band was purified from 1% agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI sequence database to find the best match. The PCR amplicons thus generated matched to HLA gene from 5’ UTR to 3’ UTR (19-3435).

[44] In accordance with the embodiments of the present invention, HLA-DQB1 gene was studied by designing three specific primer sequences because of the large gene sequence (7230 bp). The primer sequences location is depicted in Fig. 5. Three sets of primers were designed in-order to cover the full length. Three amplicon sizes were 2936 (Forward: 56-84, Reverse: 2968-2991), 2638 (Forward: 2822-2845, Reverse: 5436-5459) and 1971 (Forward: 5141-5164, Reverse: 7090-7111) respectively. The said primers were synthesized using oligo synthesizer. These oligo bases were used to amplify HLA-DQB 1 from the human blood and saliva samples. I obtained 3474-3488 bp of HLA-DQB 1 amplicon as depicted in Fig. 4. This band was purified from 1 % agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then the high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI and IMGT databases to find the best match. The PCR amplicons generated thus were matched to HLA-DQB 1 gene from 5’ UTR to 3’ UTR as expected.

[45] HLA-DRB1 gene was studied using specific primers (Fig. 7), PCR amplification (Fig. 8), sequencing and analysis. All the full length HLA-DRB1 alleles were downloaded from IMGT database, then aligned all these sequences to reference sequence (IMGT ID: HLA: HLA03486) to locate SNPs across the HLA-DRB1 gene length. The total gene length is 16120 bp. Three sets of primers were designed in-order to cover the full length. The three amplicon sizes were 6896 (Forward: 41-62, Reverse: 6915-6937), 6956 (Forward: 6003-6030, Reverse: 12936-12959) and 4938 (Forward: 10977-10997, Reverse: 15895-15915) respectively. Then primers synthesized using oligo synthesizer. These oligo bases are used to amplify HLA-DRB1 from the human blood and saliva samples. This band was purified from 1% agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI and IMGT databases to find the best match. Our results showed that PCR amplicons matched to HLA gene from 5’ UTR to 3’ UTR (41-15915).

[46] HLA-DPB1 gene was studied using specific primers (Fig. 9), PCR amplification (Fig. 10), sequencing and analysis. All the full length HLA-DPB1 alleles were (IMGT HLA-DPB1 gene IDs is shown in Table 6) downloaded from IMGT database, then aligned all these sequences to reference sequence (IMGT ID: HLA: HLA00517) to locate SNPs across the HLA-DPB1 gene length. The total gene length is 11532 bp. Three sets of primers were designed in-order to cover the full length. The three amplicon sizes were 3856 (Forward: 7-27, Reverse: 3841-3862), 3990 (Forward: 3518-3541, Reverse: 7486- 7507) and 3922 (Forward: 7054-7078, Reverse: 10936-10975) respectively. Then primers synthesized using oligo synthesizer. These oligo bases are used to amplify HLA-DPB 1 from the human blood and saliva samples. This band was purified from 1% agarose gel and sequenced using third generation sequencing (Oxford Nanopore) method. Then high quality sequences were BLAST (www.ncbi.nlm.nih.gov/BLAST) against NCBI and IMGT databases to find the best match. Our results showed that PCR amplicons matched to HLA gene from 5’ UTR to 3’ UTR (7-10975).

Long range PCR amplification

[47] The amplification of full length HLA gene of class I and II was standardized using long range Taq DNA polymerase (Kapa Kit, Code: KK2601 07958927001). Since the amplicons size ranges from 2 kb to 7 kb, long range PCR amplification kit was used for amplification. The amplified PCR product of class I and class II were purified using gel based purification method (QIAquick Gel Extraction Kit; cat no: 28704) and the size of amplicons was checked on agarose gel and were quantified using Qubit Fluorometer. The amplicons were pooled equimolar and purified using AMPure XP beads and quantified using Qubit Fluorometer. After the PCR amplification, the amplified genes were confirmed using Sanger sequencing.

Nanopore library preparation and Sequencing

[48] For Nanopore, the library was prepared using 1D sequencing kit (Cat. No SQK- LS108) as per manufacturer’s protocol. The pooled amplicons were subjected to end repair and dA-tailing. The end prepped DNA was quantified using Qubit (Qubit™ dsDNA HS Assay Kit; Cat No: Q32854) followed by ONT adapter ligation. After purification the library was mixed with library loading beads and loaded onto R9 Flowcell to be sequenced for 48 hours on Minion device. Nanopore sequences the entire length of the molecule presented to it regardless of the length of the molecule.

Super HLA Typing Technology - Complete Sequence Information [49] In this study, I developed high throughput HLA typing using second and third generation sequencing to obtain full length HLA Class I and Class II genes. The details of HLA genes from both class I (HLA-A, HLA-B and HLA-C) and class II (HLA-DPB 1 , DQB 1 and DRB1) was given in Table 1. HLA gene sequences were retrieved from IMGT database (Comparative analyses of Low, Medium and High-Resolution HLA Typing Technologies for Human Populations; Malali Gowda, Sheetal Ambardar, Nutan Dighe, Ashwini Manjunath, Chandana Shankaralingu, Pradeep Hirannaiah, John Harting, Swati Ranade, Latha Jagannathan and Sudhir Krishna, March 09, 2016 J Clin Cell Immunol 7:399) for the designing forward and reverse primer sequences. Using one of the longest sequences as“reference sequence”, other HLA gene sequences from IMGT were aligned to reference, and then most conserved region was selected for designing primer sequences. The amplified PCR products from these primer pairs were amplified using long range Taq polymerase enzyme. The band length was confirmed on agarose gel (Figure 1B) and sequence identified was confirmed by Sanger sequencing. The sequences obtained from primer pairs for class I and class II HLA genes were matched with the respective HLA genes. Subsequently, PCR amplicons were sequenced using Oxford Nanopore sequencing.

[50] Super HLA Typing method is end-to-end solution, user friendly and cost effective.

This can be adopted for HLA registry typing compared to expensive kits used in the current HLA market. I standardized the amplification of full length HLA genes of class I and II.

[51] Third, I have developed Super HLA Typing kit which includes bioinformatics. I integrated bioinformatics tools and developed the allele calling pipeline.

[52] The current HLA-markets in the field of organ transplantation are providing HLA typing services using low resolution typing, incomplete haplotype, which is not sufficient for finding accurate HLA match between donor(s) and recipient.

EXAMPLES

Construction of Primers. PCR amplification. Sequencing and Database analysis of HLA

EXAMPLE 1

Gene information

[53] The HLA genes information of HLA-A, HLA-B, HLA-C, HLA-DRB 1 , HLA- DQB 1 , HLA-DPB 1 were obtained from publicly available international databases such as

Primer designing procedure

[54] Primers for HLA genes were designed to capture full length of HLA sequence. The primers were validated by in-silico PCR amplification for amplicon length and primer position with the reference sequences.

EXAMPLE 3

Artificial synthesis of Primer sequences for HLA genes

Primers of HLA-A, HLA-B, HLA-C, HLA-DQB 1 , HLA-DRB 1 and HLA-DPB 1 genes were synthesized using chemical method (Optimized Automated Solid Phase Synthesis of Oligonucleotides and Derivatives Gabriel A lvarado Urbina, G erald G riibler3, Angelika Weilerb, Hartmut Echnerb, Stanka Stoevab, Johann Schernthaner, Waleri Gross, Wolfgang Voelterb 53b, 1051-1068 (1998) Z. Naturforsch)

[55] The novel primer sequences of class I (HLA-A, HLA-B, HLA-C) were diagrammatically depicted in Lig 1 -3 for class I HLA genes and their PCR amplification products are shown in Pig 4. Lor Class II genes (HLA-DQB 1, HLA-DRB 1 and HLA- DPB 1), the novel primer sequences that are diagrammatically depicted in Pig 5, 7, 9 for class II HLA genes and their PCR amplification products are shown in PIGs. 6, 8 and 10, respectively.

Primer sequences at the beginning and end of each HLA genes were amplified. The size of amplification corresponds to gene length. This confirms primers of the present invention amplify full length of HLA genes.

EXAMPLE 5

Sequencing of PCR products

[56] PCR amplified HLA products that were gel purified, were then sequenced using long read third generation sequencing, Oxford Nanopore.

EXAMPLE 6

Sequence data analysis

[57] The obtained sequence results were BLAST against HLA reference sequences (NCBI/ IMGT). The analysis showed HLA amplicon exactly matched primer position in the reference sequences. This confirmed that synthesized PCR primers were able to amplify all the HLA genes. ADVANTAGES OF THE PRESENT INVENTIION

[58] There are six classical HLA genes, which are classified into class I (HLA-A, HLA- B, HLA-C) and class II (HLA-DRB1, HLA-DQB 1 and HLA-DPB1). The present invention provides sequence information for all 6 HLA genes. Therefore the current invention information will be useful for finding match of organ for transplantation.

[59] The previously established HLA typing methods including sequence specific primer (SSP), sequence specific oligonucleotide (SSO) are based on known alleles of HLA gene sequences (e.g., exon 2 and 3 of class I; and exon 2 of class II). These methods only provide low (serological) resolution data of HLA alleles. These methods only provide low resolution and 2-digits information of HLA alleles. The advantage of present invention is that the present method identifies both known and unknown rare alleles of HLA genes.

[60] The previously established HLA typing method by Sanger Sequencing provides limited regions of HLA gene sequences (e.g., exon 2 and 3 of class I, and exon 2 of class II). But this method has certain disadvantages such as low resolution, restriction on length of data to 300-500 bases and is associated with low accuracy, high cost per sample and provides low resolution (2-digit) for HLA genes. Moreover, 41 % of HLA-A and 24% of HLA-B alleles is reported to be ambiguous using Sanger Sequencing (Adams et al. 2004). Current invention can provide a high resolution, full-length sequence of HLA genes.

[61] In the international IMGT database (ImMunoGeneTics) only less than 10 % and 5 % of the HLA sequences contain full length (UTR+EXON+INTRON) sequences for HLA class I and II, respectively. The present invention provides full length gene sequence information.

[62] The advent of second generation sequencing primers developed in prior art (EP2599877A1) claim that they have primers for short read sequencing for HLA-A, B, HLA-C and HLA-DQB 1 gene. But this invention many not resolve the haplotypes of HLA genes. The current application can provide complete haplotype information for class I (HLA-A, B, C) and class II (DQB1, DRB1 and DPB1) genes.

[63] The full length information of HLA genes and 8-digits of HLA alleles are considered to be gold standard approach to identify donors for bone marrow transplantation for the cancer, Thalassemia patients or any organ or any blood disorders.

[64] Current method provides one time solution as health record of a person including autoimmune diseases, drug hypersensitivity and allergy responses and infectious diseases (viral and bacterial).

[65] The Lull-length HLA information from current invention will be useful to trace the pedigree of a family. This will be useful for the court to judge parental information during paternity testing, sexually assaulted culprits, etc. [66] The present invention develops a kit of HLA genes amplification and sequencing using single molecule sequencing.

[67] As per the market survey, there exists a high cost for typing per sample even for low resolution HLA typing. The present super high-resolution invention has substantially reduced cost of HLA typing. This invention is the most cost effective method to date for HLA class I and II genes.

[68] This invention may applied to wide medical and forensic applications.

Table: 1 HLA Primer sequences for amplification of full length HLA genes.

Wherein the letters represented as bases are as following:

R is A or G; Y is C or T; M is A or C; K is G or T; S is C or G; W is A or T; B is C or G or T; D is A or G or T; H is A or C or T; V is A or C or G and N is A or C or G or T.

Details of class IHLA gene sequence position

Table: 2 HLA-A gene sequence position

Table: 3 HLA-B gene sequence position

Table: 4 HLA-C gene sequence position

Details of class II HLA genes sequence position Table: 5 HLA-DRB1 gene sequence position

Table: 6 HLA-DQB1 gene sequence position

Table: 7 HLA-DPB1 gene sequence position