Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHODS OF TREATING BREAST CANCER WITH GEMCITABINE THERAPY
Document Type and Number:
WIPO Patent Application WO/2014/005010
Kind Code:
A2
Abstract:
The application describes methods for predicting overall survival in subjects with breast cancer. The application also describes for screening subjects with breast cancer to determine if the breast cancer will be responsive to a breast cancer therapy including gemcitabine. The application further describes methods for treating subjects with breast cancer by screening them for the likelihood of the effectiveness of treating the cancer with a therapy including gemcitabine and administering the therapy in subjects when it is found that gemcitabine is likely to be effective.

Inventors:
FERREE SEAN M (US)
COWENS J WAYNE (US)
JORGENSEN CHARLOTTE LEVIN TYKJAER (DK)
NIELSEN TORSTEN O (CA)
EJLERTSEN BENT (DK)
Application Number:
PCT/US2013/048551
Publication Date:
January 03, 2014
Filing Date:
June 28, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NANOSTRING TECHNOLOGIES INC (US)
International Classes:
C12Q1/68; A61K31/337; A61K31/7068
Foreign References:
US20110145176A12011-06-16
Other References:
PEROU ET AL., NATURE, vol. 406, no. 6797, 2000, pages 747 - 52
SORLIE ET AL., PNAS, vol. 98, no. 19, 2001, pages 10869 - 74
PARKER ET AL., J CLIN ONCOL., vol. 27, no. 8, 2009, pages 1160 - 7
See also references of EP 2867370A4
Attorney, Agent or Firm:
ELRIFI, Ivor, R. et al. (And Popeo P.C.,One Financial Cente, Boston MA, US)
Download PDF:
Claims:
What is claimed is:

1. A method of predicting disease progression free survival in a subject having breast cancer comprising:

(a) providing a biological sample from the subject; and

(b) assaying the biological sample to determine an intrinsic breast cancer subtype, the subtype selected from the group consisting of luminal A, luminal B, basal-like, and HER-2 enriched subtypes;

wherein the intrinsic subtype is determined using a measurement of at least 40 of the genes listed in Table 1, wherein a determination of luminal A and luminal B subtypes indicates a longer disease progression free survival time period and a determination of HER2-enriched or basal-like subtype indicates a shorter disease progression free survival time period.

2. The method of claim 1 wherein the intrinsic subtype is determined using at least 45 of the genes listed in Table 1.

3. A method of predicting overall survival in a subject having breast cancer comprising:

(a) providing a biological sample from the subject; and

(b) assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype;

wherein if the biological sample is classified as a basal-like subtype, a breast cancer treatment comprising gemcitabine is more likely to prolong overall survival of the subject.

4. The method of claim 3, wherein the breast cancer is primary breast cancer.

5. The method of claim 3, wherein the breast cancer is locally advanced or metastatic breast cancer.

6. The method of claim 3, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using immunohistochemistry (IHC) or fluorescence in situ hybridization.

7. The method of claim 3, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 40 of the intrinsic genes listed in Table 1.

8. The method of claim 3, wherein the breast cancer treatment comprising gemcitabine further comprises one or more anti-cancer agents selected from the group consisting of anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof.

9. The method of claim 3, wherein the breast cancer treatment comprising gemcitabine comprises one or more comprises one or more taxanes.

10. The method of claim 9, wherein the taxanes are selected from the group consisting of docetaxel and paclitaxel.

11. The method of claim 3, wherein the biological sample is selected from the group consisting of a cell, tissue and bodily fluid.

12. The method of claim 11, wherein the tissue is obtained from a biopsy.

13. The method of claim 11, wherein the bodily fluid is selected from the group consisting of blood, lymph, urine, saliva and nipple aspirate.

14. The method of claim 3, wherein the biological sample is a formalin-fixed, paraffin- embedded sample.

15. A method of treating breast cancer in a subject in need thereof comprising:

(a) providing a biological sample from the subject;

(b) assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype;

(c) administering a breast cancer treatment to the subject,

wherein if the biological sample is classified as a basal-like subtype, the subject is administered a breast cancer treatment comprising gemcitabine and wherein if the biological sample is not a basal-like subtype, the subject is administered a breast cancer treatment not comprising gemcitabine, thereby treating breast cancer in the subject.

16. The method of claim 15, wherein the breast cancer is primary breast cancer.

17. The method of claim 15, wherein the breast cancer is locally advanced or metastatic breast cancer.

18. The method of claim 15, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using

immunohistochemistry (IHC) or fluorescence in situ hybridization.

19. The method of claim 15, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 40 of the intrinsic genes listed in Table 1.

20. The method of claim 15, wherein the breast cancer treatment comprising gemcitabine further comprises one or more anti-cancer agents selected from the group consisting of anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof.

21. The method of claim 15, wherein the breast cancer treatment comprising gemcitabine further comprises one or more taxanes.

22. The method of claim 15, wherein the taxanes are selected from the group consisting of docetaxel and paclitaxel.

23. The method of claim 15, wherein the breast cancer treatment not comprising gemcitabine further comprises one or more anti-cancer agents selected from the group consisting of anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof.

24. The method of claim 15, wherein the breast cancer treatment not comprising gemcitabine comprises one or more comprises one or more taxanes.

25. The method of claim 24, wherein the taxanes are selected from the group consisting of docetaxel and paclitaxel.

26. The method of claim 15, wherein the biological sample is selected from the group consisting of a cell, tissue and bodily fluid.

27. The method of claim 26, wherein the tissue is obtained from a biopsy.

28. The method of claim 26, wherein the bodily fluid is selected from the group consisting of blood, lymph, urine, saliva and nipple aspirate.

29. The method of claim 15, wherein the biological sample is a formalin-fixed, paraffin- embedded sample.

30. A method of screening for the likelihood of the effectiveness of a breast cancer treatment comprising an gemcitabine in a subject in need thereof comprising:

(a) providing a biological sample from the subject; and

(b) assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype;

wherein if the biological sample is classified as a basal-like subtype, the breast cancer treatment comprising the gemcitabine is more likely to be effective in the subject.

31. The method of claim 30, wherein the breast cancer is primary breast cancer.

32. The method of claim 30, wherein the breast cancer is locally advanced or metastatic breast cancer.

33. The method of claim 30, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using

immunohistochemistry (IHC) or fluorescence in situ hybridization.

34. The method of claim 30, wherein assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 40 of the intrinsic genes listed in Table 1.

35. The method of claim 30, wherein the breast cancer treatment comprising gemcitabine further comprises one or more anti-cancer agents selected from the group consisting of anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof.

36. The method of claim 30, wherein the breast cancer treatment comprising gemcitabine comprises one or more comprises one or more taxanes.

37. The method of claim 36, wherein the taxanes are selected from the group consisting of docetaxel and paclitaxel.

38. The method of claim 30, wherein the biological sample is selected from the group consisting of a cell, tissue and bodily fluid.

39. The method of claim 38, wherein the tissue is obtained from a biopsy.

40. The method of claim 38, wherein the bodily fluid is selected from the group consisting of blood, lymph, urine, saliva and nipple aspirate.

41. The method of claim 30, wherein the biological sample is a formalin-fixed, paraffin- embedded sample.

42. A kit for determining an intrinsic subtype of breast cancer comprising reagents sufficient for the detection of at least 40 of the intrinsic genes listed in Table 1.

43. The kit of claim 42, wherein the reagents sufficient for the detection of the intrinsic genes listed in Table 1 comprise a microarray.

44. A method of screening for the likelihood of the effectiveness of a breast cancer treatment comprising gemcitabine in a subject in need thereof comprising: (a) providing a biological sample from the subject; and

(b) assaying the biological sample to determine whether the biological sample is classified as a HER-2 enriched subtype;

wherein if the biological sample is classified as a HER-2 enriched subtype, the breast cancer treatment comprising the gemcitabine is more likely to be detrimental in the subject.

45. A method of predicting overall survival in a subject having breast cancer comprising:

(a) providing a biological sample from the subject; and

(b) assaying the biological sample to determine an intrinsic breast cancer subtype, the subtype selected from the group consisting of luminal A, luminal B, basal- like, and HER-2 enriched subtypes;

wherein the intrinsic subtype is determined using a measurement of at least 40 of the genes listed in Table 1, wherein a determination of luminal A and luminal B subtypes indicates a longer overall survival and a determination of HER2-enriched or basal-like subtype indicates a shorter overall survival.

46. The method of claim 1 wherein the intrinsic subtype is determined using at least 45 of the genes listed in Table 1.

Description:
METHODS OF TREATING BREAST CANCER WITH GEMCITABINE THERAPY

CROSS -REFERENCE TO RELATED APPLICATIONS

[01] This application claims priority to, and the benefit of, U.S. Provisional Application No. 61/666,355, filed June 29, 2012 and U.S. Provisional Application No. 61/733,545, filed

December 5, 2012. The contents of each of these applications are incorporated herein by reference in their entireties.

FIELD OF THE INVENTION

[02] This disclosure relates generally to the field of cancer biology, and specifically, to the fields of detection and identification of specific cancer cell phenotypes and correlation with appropriate therapies.

BACKGROUND OF THE INVENTION

[03] Therapy including the nucleoside analog, gemcitabine, has proven to be effective against many types of tumors. However, the sid effects associated with gemcitabine therapy, including neutropenia, anemia, liver and kidney changes, flu-like symptoms, loss of appetite, hair loss, shortness of breath, fatigue, loss of appetite, nausea and vomiting are severe.

Alternative therapies with less severe side effects are known. Thus, there is a need in the art to determine types of cancer that respond best to gemcitabine based therapy and which types of cancer would be better to treat with non-gemcitabine based therapy. The present invention addresses these needs.

SUMMARY OF THE INVENTION

[04] In one embodiment, this invention provides a method of predicting progression free survival in a subject having metastatic breast cancer comprising (a) providing a biological sample from the subject; and (b) assaying the biological sample to determine an intrinsic breast cancer subtype, the subtype selected from the group consisting of luminal A, luminal B, basal-like, and HER-2 enriched subtypes; wherein the intrinsic subtype is determined using a measurement of at least 40 of the genes listed in Table 1 and wherein the intrinsic subtype is used to predict progression free survival in said subject independent of the treatment that the subject has received or will receive. A determination of luminal A and B subtypes indicates a longer disease progression free survival time period and a determination of HER2-enriched or basal-like subtype indicates a shorter disease progression free survival time period. The assaying of the biological sample to determine whether intrinsic subtype is performed by detecting at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. In a preferred embodiment the intrinsic subtype is determined using at least 45 of the genes listed in Table 1.

[05] The present invention also provides a method of predicting overall survival in a subject having breast cancer comprising, (a) providing a biological sample from the subject; and (b) assaying the biological sample to determine an intrinsic breast cancer subtype, the subtype selected from the group consisting of luminal A, luminal B, basal-like, and HER-2 enriched subtypes; wherein the intrinsic subtype is determined using a measurement of at least 40 of the genes listed in Table 1, wherein a determination of luminal A and luminal B subtypes indicates a longer overall survival and a determination of HER2-enriched or basal- like subtype indicates a shorter overall survival. The assaying of the biological sample to determine whether intrinsic subtype is performed by detecting at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. In a preferred embodiment the intrinsic subtype is determined using at least 45 of the genes listed in Table 1.

[06] The present invention also provides a method of predicting overall survival in a subject having breast cancer. This method includes the steps of providing a biological sample from the subject; assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype; wherein if the biological sample is classified as a basal-like subtype, a breast cancer treatment comprising gemcitabine is more likely to prolong overall survival of the subject. The breast cancer can be primary breast cancer, locally advanced breast cancer or metastatic breast cancer.

[07] The assaying of the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using RNA expression profiling. The assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. Preferably, detection is of all 50 of the intrinsic genes listed in Table 1. The expression of the members of the intrinsic gene list of Table lean be determined using a nanoreporter and the nanoreporter code system (nCounter® Analysis system).

[08] The breast cancer treatment that includes gemcitabine can also include anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof. Preferably, the treatment that includes gemcitabine also includes one or more taxanes. Preferably, the taxanes are paclitaxel or docetaxel. The breast cancer treatment not comprising an gemcitabine includes

anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof. Preferably, the treatment that does not include gemcitabine includes one or more taxanes. Preferably, the taxanes are paclitaxel or docetaxel.

[09] The biological sample can be a cell, a tissue or a bodily fluid. The tissue can be sampled from a biopsy or smear. The sample can also be a sampling of bodily fluids. These bodily fluids can include blood, lymph, urine, saliva, nipple aspirates and gynecological fluids. The biological sample can be a formalin-fixed, paraffin-embedded sample.

[10] The present invention provides a method of treating breast cancer in a subject in need thereof. This method includes the steps of providing a biological sample from the subject; assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype; and administering a breast cancer treatment to the subject. If the biological sample is classified as a basal-like subtype, the subject is administered a breast cancer treatment including gemcitabine. If the biological sample is not a basal-like subtype, the subject is administered a breast cancer treatment without gemcitabine. The breast cancer can be primary breast cancer, locally advanced breast cancer or metastatic breast cancer.

[11] The present invention also provides a method of treating breast cancer in a subject in need thereof comprising requesting a test providing the results of analysis determining whether a biological sample from the subject is classified as a basal-like subtype, and administering a breast cancer treatment including gemcitabine if the sample from the patient is classified as a basal-like subtype, or administering a breast cancer treatment without gemcitabine if the sample from the patient is classified as not a basal-like subtype. The breast cancer can be primary breast cancer, locally advanced breast cancer or metastatic breast cancer.

[12] The assaying of the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using RNA expression profiling. The assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. Preferably, detection is of all 50 of the intrinsic genes listed in Table 1. The expression of the members of the intrinsic gene list of Table 1 can be determined using a nanoreporter and the nanoreporter code system (nCounter® Analysis system).

[13] The breast cancer treatment that includes gemcitabine can also include anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof. Preferably, the treatment that includes gemcitabine also includes one or more taxanes. Preferably, the taxanes are paclitaxel or docetaxel. The breast cancer treatment not comprising an gemcitabine includes

anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof. Preferably, the treatment that does not include gemcitabine includes one or more taxanes. Preferably, the taxanes are paclitaxel or docetaxel.

[14] The biological sample can be a cell, a tissue or a bodily fluid. The tissue can be sampled from a biopsy or smear. The sample can also be a sampling of bodily fluids. These bodily fluids can include blood, lymph, urine, saliva, nipple aspirates and gynecological fluids. The biological sample can be a formalin-fixed, paraffin-embedded sample. [15] The present invention also provides a method of screening for the likelihood of the effectiveness of a breast cancer treatment including gemcitabine in a subject in need thereof. This method includes the steps of providing a biological sample from the subject and assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype. If the biological sample is classified as a basal-like subtype, the breast cancer treatment including the gemcitabine is more likely to be effective in the subject. The breast cancer can be primary breast cancer, locally advanced breast cancer or metastatic breast cancer.

[16] The assaying of the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed using RNA expression profiling. The assaying the biological sample to determine whether the biological sample is classified as a basal-like subtype is performed by detecting at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. Preferably, detection is of all 50 of the intrinsic genes listed in Table 1. The expression of the members of the intrinsic gene list of Table 1 can be determined using and nanoreporter and the nanoreporter code system (nCounter® Analysis system).

[17] The breast cancer treatment that includes gemcitabine can also include anthracycline, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof. Preferably, the treatment that includes gemcitabine also includes one or more anti-cancer taxanes. More preferably, the taxanes are paclitaxel or docetaxel.

[18] The biological sample can be a cell, a tissue or a bodily fluid. The tissues can be sampled from a tumor biopsy or surgical specimen. The sample can also be a sampling of bodily fluids. These bodily fluids can include blood, lymph, urine, saliva and nipple aspirates. The biological sample can be a formalin-fixed, paraffin-embedded sample.

[19] The present invention also provides a kit for screening for the likelihood of the effectiveness of a breast cancer treatment including reagents sufficient for the detection of at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes and sufficient to determine a basal-like subtype. Preferably, the kit includes reagents sufficient for the detection of all 50 of the intrinsic genes listed in Table 1. The reagent sufficient for the detection of the at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1 can include a microarray. Preferably, the reagents include a reporter probe and capture probe for the detection of at least 10, at least 15, at least 20, at least 25, at least 40, 41, 42, 43, 44, 45, 46 47, 48, 49 or all 50 of the intrinsic genes listed in Table 1. Preferably, there is only one reporter probe/capture probe pair for any one gene of Table 1 to be detected. Preferably, the kit includes instructions for utilizing the reagents and for performing any of the methods provided in the instant invention. Preferably, the instructions are for screening for the likelihood of the effectiveness of a breast cancer treatment.

[20] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. In the specification, the singular forms also include the plural unless the context clearly dictates otherwise. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. The references cited herein are not admitted to be prior art to the claimed invention. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be limiting. Other features and advantages of the invention will be apparent from the following detailed description and claim

BRIEF DESCRIPTION OF THE DRAWINGS

[21] Figure 1 is a schematic showing a CONSORT Diagram of study design.

[22] Figure 2A is a line graph showing a Kaplan-Meier (K-M) curve of time to progression (TTP) according to intrinsic biological subtype identified using the PAM50 intrinsic genes.

[23] Figure 2B is a line graph showing a K-M curve of overall survival (OS) according to intrinsic biological subtype identified using the PAM50 intrinsic genes.

[24] Figure 3 A shows a Forest plot of hazard ratios (HR) with 95% CIs for time to progression for preselected prognostic factors. D, docetaxel; GD, gemcitabine plus docetaxel; * Estimates in model with PAM50 subtypes: Luminal A, Luminal B, Basal Like, and HER2 enriched.

[25] Figure 3B shows a Forest plot of hazard ratios (HR) with 95% CIs for overall survival for preselected prognostic factors. D, docetaxel; GD, gemcitabine plus docetaxel; * Estimates in model with PAM50 subtypes: Luminal A, Luminal B, Basal Like, and HER2 enriched.

[26] Figure 4 is a line graph showing a K-M curve of median overall survival (OS) of 10 months for the basal-like patients in the doublet (GD) arm compared to the monotherapy (D) arm.

DETAILED DESCRIPTION OF THE INVENTION

[27] The present invention provides a method of determining whether a breast cancer treatment comprising gemcitabine is optimal for administration to a patient suffering from breast cancer. Determining whether a breast cancer patient should receive a treatment including gemcitabine includes determining the subtype of the breast cancer using an intrinsic gene expression set and determining the basal-like subtype of the breast cancer by using immunohistochemistry (IHC). The disclosure also provides a method of treating breast cancer by determining whether a breast cancer patient should receive a treatment including gemcitabine and then administering the optimal breast cancer treatment to the patient based on that determination.

[28] Intrinsic genes are statistically selected to have low variation in expression between biological sample replicates from the same individual and high variation in expression across samples from different individuals. Thus, intrinsic genes are used as classifier genes for breast cancer classification. Although clinical information was not used to derive the breast cancer intrinsic subtypes, this classification has proved to have prognostic significance. Intrinsic gene screening can be used to classify breast cancers into various subtypes. The major intrinsic subtypes of breast cancer are referred to as Luminal A (LumA), Luminal B (LumB), HER2-enriched (Her-2-E), Basal-like, and Normal-like (Perou et al. Nature, 406(6797):747-52 (2000); Sorlie et al. PNAS, 98(19): 10869-74 (2001)).

[29] The PAM50 gene expression assay, as described herein, is able to identify intrinsic subtype from standard formalin fixed paraffin embedded tumor tissue (also see, Parker et al. J Clin Oncol., 27(8): 1160-7 (2009) and U.S. Patent Application Publication No.

2011/0145176). The methods utilize a supervised algorithm to classify subject samples according to breast cancer intrinsic subtype. This algorithm, referred to herein as the PAM50 classification model, is based on the gene expression profile of a defined subset of intrinsic genes that has been identified herein as superior for classifying breast cancer intrinsic subtypes. The subset of genes, along with primers specific for their detection, is provided in Table 1. The target specific probe sequences are merely representative and not meant to limit the invention. The skilled artisan can utilize any target sequence-specific probe for detecting any of (or each of) the genes in Table 1.

Table 1. PAM50 Intrinsic Gene List

CDC6 NM 001254 GTAAATCACCTTCT 10 A CTTG GGATATGTG A 60 GAGCCT ATAAGACC

CDCAi NM 031423 GGAGGCGGAAGAA 1 1 GGGGAAAGACAAAG 61

ACCAG TTTCCA

CDH3 BC041846 GACAAGGAGAATC 12 ACTGTCTGGGTCCAT 62

AAAAGATCAGC GGCTA

CENPF NM 016343 GTGGCAGCAGATC 13 GGATTTCGTGGTGGG 63

ACAA TTC

CEP55 AB091 43 CCTCAC GAATTGCT 14 CCACAGTCTGTGATA 64

GAACTT AACGG

CXXC5 BC006428 CATGAAATAGTGC 15 CCATCAACATTCTCT 65

ATAGTTTGCC TTATGAACG

EGFR NM 005228 ACACAGAATCTAT 16 ATCAACTCCCAAACG 66

ACCCACCAGAGT GTCAC

ERBB2 NM OO 1005862 GCTGGCTCTCACA 17 GCCCTTACACATCGG 67

CTGATAG AGAAC

ESRl NM 001122742 GCAGGGAGAGGAG 18 GACTTCAGGGTGCTG 68

TTTGT GAC

EXOl NM 130398 CCCATCCATGTGA 19 TGTGAAGCCAGCAA 69

G G A AGT AT A A TATGTATC

CTTCTTGGACCTTG TATTGGG AGGC AG G

FGFR4 AB209631 20 70

GCG AGGTTTA

FOXA1 NM 004496 GCTACTACGCAGA 21 CTGAGTTCATGTTGC 71

CACG TGACC

FOXC1 N 001453 GATGTTCGAGTCA 22 GACAGCTACTATTCC 72

CAGAGG CGTT

GP 160 AJ24924S TTCGGCTGGAAGG 23 TATGTGA GTAAGCTC 73

AACC GGAGAC

GRB7 NM 005 10 CGTGGCAGATGTG 24 AGTGGGCATCCCGTA 74

AACGA GA

HSPC150

NMJM4176 GGAGATCCGTCAA 25 AGTGGACATGCGAG 75 (UBE2T)

CTCCAAA TGGAG

KIF2C NM 006845 TGGGTCGTGTCAG 26 CACCGCTGGAAACT 76

GAAAC GAAC

KNTC2 NM 006101 CGCAGTCATCCAG 27 CGTGCACATCCATGA 77

AGATGTG CCTT

KRT14 BC042437 ACTCAGTACAAGA 28 GAGGAGATGACCTT 78

AAGAACCG GCC KRT17 AK095281 GTTGGACCAGTCA 29 GCCATAGCCACTGCC 79 ACATCTCTG ACT

KRT5 M213S9 TGTGGCTCATTAG 30 CTTCGACTGGACTCT 80

GCAAC GT

MAPT NM OO 1123066 GACTCCAAGCGCG 31 CAGACATGTTGGTAT 81

AAAAC TGCACATT

MDM2 M92424 CCAACAAAATATT 32 AGGCGATCCTGGGA 82

CATGGTTCTTG AATTAT

MEL NM O 14791 CCAGTAGCATTGT 33 CCCATTTGTCTGTCT 83

CCGAG TCAC

MIA BG765502 GTCTCTGGTAATGC 34 CTGATGGTTGAGGCT 84

ACACT GTT

MKI67 NM 002417 GTGGAATGCCTGC 35 CGCACTCCAGCACCT 85

TGACC AGAC

MLPH NM 024101 AGGGGTGCCCTCT 36 TCACAGGGTCAAACT 86

GAGAT TCCAGT

MMP11 NM 005940 CGAGATCGCCAAG 37 GATGGTAGAGTTCCA 87

ATGTT GTGATT

MYBL2 BX647151 AGGCGAACACACA 38 TCTGGTCACGCAGGG 88

ACGTC CAA

MYC NM 002467 AGCCTCGAACAAT 39 ACACAGATGATGGA 89

TGAAGA GATGTC

ATCGACTGTGTAA

NAT1 BCO 13732 40 AGTAGCTACATCTCC 90

ACAACTAGAGAAG AGGTTCTCTG A

ORC6L NM O 14321 TTTAAGAGGGCAA 41 C GGATTTTATC A AC G 91

TGGAAGG ATGCAG

PGR NM 00926 TGCCGCAGAACTC 42 CATTTCJCCGTCCTTC 92

ACTTG ATCG

PHGDH AK093306 CCTCAGATGATGC 43 GCAGGTCAAAACTCT 93

CTATCCA CAAAG

PTTGl BE904476 CAGCAAGCGATGG 44 AGCGGGCTTCTGTAA 94

CATAGT TCTGA

RRM2 A 123010 AATGCCACCGAAG 45 GCCTCAGATTTCAAC 95

CCTC TCGT

SFRP1 BC036503 TCGAACTGAAGGC 46 CTGCTGAGAATCAA 96

TATTTACGAG AGTGGGA

SLC39A6 NM 012 19 GTCGAAGCCGCAA 47 GGAACAAACTGCTCT 97

TTAGG GCCA TMEM45

A 098106 CAAACGTGTGTTCT 48 ACAGCTCTTTAGCAT 98 B GGAAGG TTGTGGA

TYMS BQ056428 TGCCCTGTATGATG 49 GGGA CTATC AATGTT 99

TCAGGA GGGTTCTC

UBE2C BC032677 GTGAGGGGTGTCA 50 CACACAGTTCACTGC 100

GCTCAGT TCCACA

[30] Table 2 provides select sequences for the PAM50 genes of Table 1.

[31] Table 2

CAACCCAGACTTTATGGAGTCCATCTCAGATGTTGTTGATGAAGTAATACAGAACTGCCC CATCGATGTG CGGCGCCCGCTGTATAAGCCCGAGTTCTTTCAGGTCTGCCACACCAAGAAGGACTATGAA GAGTACGGGC CCAGCATCTGCCGCCACAACCCCGTCTTTGGAGTCATGTCCTAGTGTCTGCCTGAACGCG TCGTTCGATG GTGTCACGTTGGGGAACAAGTGTCCTTCAGAACCCAGAGAAGGCCGCCGTTCTGTAAATA GCGACGTCGG TGTTGCTGCCCAGCAGCGTGCTTGCATTGCCGGTGCATGAGGCGCGGCGCGGGCCCTTCA GTAAAAGCCA TTTATCCGTGTGCCGACCGCTGTCTGCCAGCCTCCTCCTTCTCCCGCCCTCCTCACCCTC GCTCTCCCTC CTCCTCCTCCTCCGAGCTGCTAGCTGACAAATACAATTCTGAAGGAATCCAAATGTGACT TTGAAAATTG TTAGAGAAAACAACATTAGAAAATGGCGCAAAATCGTTAGGTCCCAGGAGAGAATGTGGG GGCGCAAACC CTTTTCCTCCCAGCCTATTTTTGTAAATAAAATGTTTAAACTTGAAATACAAATCGATGT TTATATTTCC TATCATTTTGTATTTTATGGTATTTGGTACAACTGGCTGATACTAAGCACGAATAGATAT TGATGTTATG GAGTGCTGTAATCCAAAGTTTTTAATTGTGAGGCATGTTCTGATATGTTTATAGGCAAAC AAATAAAACA GCAAACTTTTTTGCCACATGTTTGCTAGAAAATGATTATACTTTATTGGAGTGACATGAA GTTTGAACAC TAAACAGTAATGTATGAGAATTACTACAGATACATGTATCTTTTAGTTTTTTTTGTTTGA ACTTTCTGGA GCTGTTTTATAGAAGATGATGGTTTGTTGTCGGTGAGTGTTGGATGAAATACTTCCTTGC ACCATTGTAA TAAAAGCTGTTAGAATATTTGTAAATATC

NM_018685 CTCGGCGCTGAAATTCAAATTTGAACGGCTGCAGAGGCCGAGTCCGTCACTGGAAGCCGA GAGGAGAGGA 103

CAGCTGGTTGTGGGAGAGTTCCCCCGCCTCAGACTCCTGGTTTTTTCCAGGAGACACACT GAGCTGAGAC TCACTTTTCTCTTCCTGAATTTGAACCACCGTTTCCATCGTCTCGTAGTCCGACGCCTGG GGCGATGGAT CCGTTTACGGAGAAACTGCTGGAGCGAACCCGTGCCAGGCGAGAGAATCTTCAGAGAAAA ATGGCTGAGA GGCCCACAGCAGCTCCAAGGTCTATGACTCATGCTAAGCGAGCTAGACAGCCACTTTCAG AAGCAAGTAA CCAGCAGCCCCTCTCTGGTGGTGAAGAGAAATCTTGTACAAAACCATCGCCAT CAAAAAAACGCTGTTCT GACAAC AC T GAAG TAGAAG TTTCTAACTT GGAAAAT AAACAACCAG T T GAG TC GACAT C T GCAAAAT C T T GTTCTCCAAGTCCTGTGTCTCCTCAGGTGCAGCCACAAGCAGCAGATACCATCAGTGATT CTGTTGCTGT CCCGGCATCACTGCTGGGCATGAGGAGAGGGCTGAACTCAAGATTGGAAGCAACTGCAGC CTCCTCAGTT AAAACACGTATGCAAAAACTTGCAGAGCAACGGCGCCGTTGGGATAATGATGATATGACA GATGACATTC CTGAAAGCTCACTCTTCTCACCAATGCCATCAGAGGAAAAGGCTGCTTCCCCTCCCAGAC CTCTGCTTTC AAATGCCTCGGCAACTCCAGTTGGCAGAAGGGGCCGTCTGGCCAATCTTGCTGCAACTAT TTGCTCCTGG GAAGATGATGTAAATCACTCATTTGCAAAACAAAACAGTGTACAAGAACAGCCTGGTACC GCTTGTTTAT CCAAATTTTCCTCTGCAAGTGGAGCATCTGCTAGGATCAATAGCAGCAGTGTTAAGCAGG AAGCTACATT CTGTTCCCAAAGGGATGGCGATGCCTCTTTGAATAAAGCCCTATCCTCAAGTGCTGATGA TGCGTCTTTG GTTAATGCCTCAATTTCCAGCTCTGTGAAAGCTACTTCTCCAGTGAAATCTACTACATCT ATCACTGATG CTAAAAGTTGTGAGGGACAAAATCCTGAGCTACTTCCAAAAACTCCTATTAGTCCTCTGA AAACGGGGGT ATCGAAACCAATTGTGAAGTCAACTTTATCCCAGACAGTTCCATCCAAGGGAGAATTAAG TAGAGAAATT TGTCTGCAATCTCAATCTAAAGACAAATCTACGACACCAGGAGGAACAGGAATTAAGCCT TTCCTGGAAC GCTTTGGAGAGCGTTGTCAAGAACATAGCAAAGAAAGTCCAGCTCGTAGCACACCCCACA GAACCCCCAT TATTACTCCAAATACAAAGGCCATCCAAGAAAGATTATTCAAGCAAGACACATCTTCATC TACTACCCAT TTAGCACAACAGCTCAAGCAGGAACGTCAAAAAGAACTAGCATGTCTTCGTGGCCGATTT GACAAGGGCA AT AT AT GGAGT GC AGAAAAAGGCGGAAAC T CAAAAAGCAAACAAC T AGAAACC AAACAGGAAAC T C AC T G TCAGAGCACTCCCCTCAAAAAACACCAAGGTGTTTCAAAAACTCAGTCACTTCCAGTAAC AGAAAAGGTG ACCGAAAACCAGATACCAGCCAAAAATTCTAGTACAGAACCTAAAGGTTTCACTGAATGC GAAATGACGA AATCTAGCCCTTTGAAAATAACATTGTTTTTAGAAGAGGACAAATCCTTAAAAGTAACAT CAGACCCAAA GGTTGAGCAGAAAATTGAAGTGATACGTGAAATTGAGATGAGTGTGGATGATGATGATAT CAATAGTTCG AAAGTAATTAATGACCTCTTCAGTGATGTCCTAGAGGAAGGTGAACTAGATATGGAGAAG AGCCAAGAGG AGAT GGAT CAAGC AT T AGC AGAAAGC AGCGAAGAACAGGAAGAT GC AC T GAAT AT C T CC T CAATGTC T T T ACTTGCACCATTGGCACAAACAGTTGGTGTGGTAAGTCCAGAGAGTTTAGTGTCCACACC TAGACTGGAA TTGAAAGACACCAGCAGAAGTGATGAAAGTCCAAAACCAGGAAAATTCCAAAGAACTCGT GTCCCTCGAG CTGAATCTGGTGATAGCCTTGGTTCTGAAGATCGTGATCTTCTTTACAGCATTGATGCAT ATAGATCTCA AAGATTCAAAGAAACAGAACGTCCATCAATAAAGCAGGT GAT TGTTCGGAAGGAAGATGT TACT TCAAAA CTGGATGAAAAAAATAATGCCTTTCCTTGTCAAGTTAATATCAAACAGAAAATGCAGGAA CTCAATAACG AAATAAATATGCAACAGACAGTGATCTATCAAGCTAGCCAGGCTCTTAACTGCTGTGTTG ATGAAGAACA T GGAAAAGGGT CC C T AGAAGAAGC T GAAGCAGAAAGAC TTCTTCTAATTGCAACT GGGAAGAGAAC AC T T TTGATTGATGAATTGAATAAATTGAAGAACGAAGGACCTCAGAGGAAGAATAAGGCTAGT CCCCAAAGTG AATTTATGCCATCCAAAGGATCAGTTACTTTGTCAGAAATCCGCTTGCCTCTAAAAGCAG ATTTTGTCTG CAGTACGGTTCAGAAACCAGATGCAGCAAATTACTATTACTTAATTATACTAAAAGCAGG AGCTGAAAAT ATGGTAGCCACAC CAT TAGCAAGTACTTCAAACTCTCTTAACGGTGATGCTCT GACAT TCACTACTACAT TTACTCTGCAAGATGTATCCAATGACTTTGAAATAAATATTGAAGTTTACAGCTTGGTGC AAAAGAAAGA TCCCTCAGGCCTTGATAAGAAGAAAAAAACATCCAAGTCCAAGGCTATTACTCCAAAGCG ACTCCTCACA TCTATAACCACAAAAAGCAACATTCATTCTTCAGTCATGGCCAGTCCAGGAGGTCTTAGT GCTGTGCGAA CCAGCAACTTCGCCCTTGTTGGATCTTACACATTATCATTGTCTTCAGTAGGAAATACTA AGTTTGTTCT GGACAAGGTCCCCTTTTTATCTTCTTTGGAAGGTCATATTTATTTAAAAATAAAATGTCA AGTGAATTCC AGTGTTGAAGAAAGAGGTTTTCTAACCATATTTGAAGATGTTAGTGGTTTTGGTGCCTGG CATCGAAGAT GGTGTGTTCTTTCTGGAAACTGTATATCTTATTGGACTTATCCAGATGATGAGAAACGCA AGAATCCCAT AGGAAGGATAAATCTGGCTAATTGTACCAGTCGTCAGATAGAACCAGCCAACAGAGAATT TTGTGCAAGA CGCAACACTTTTGAATTAATTACTGTCCGACCACAAAGAGAAGATGACCGAGAGACTCTT GTCAGCCAAT GCAGGGACACACTCTGTGTTACCAAGAACTGGCTGTCTGCAGATACTAAAGAAGAGCGGG ATCTCTGGAT GCAAAAACTCAATCAAGTTCTTGTTGATATTCGCCTCTGGCAACCTGATGCTTGCTACAA ACCTATTGGA AAGCCTTAAACCGGGAAATTTCCATGCTATCTAGAGGTTTTTGATGTCATCTTAAGAAAC ACACTTAAGA GCATCAGATTTACTGATTGCATTTTATGCTTTAAGTACGAAAGGGTTTGTGCCAATATTC ACTACGTATT ATGCAGTATTTATATCTTTTGTATGTAAAACTTTAACTGATTTCTGTCATTCATCAATGA GTAGAAGTAA ATACATTATAGTTGATTTTGCTAAATCTTAATTTAAAAGCCTCATTTTCCTAGAAATCTA ATTATTCAGT TATTCATGACAATATTTTTTTAAAAGTAAGAAATTCTGAGTTGTCTTCTTGGAGCTGTAG GTCTTGAAGC AGCAACGTCTTTCAGGGGTTGGAGACAGAAACCCATTCTCCAATCTCAGTAGTTTTTTCG AAAGGCTGTG ATCATTTATTGATCGTGATATGACTTGTTACTAGGGTACTGAAAAAAATGTCTAAGGCCT TTACAGAAAC ATTTTTAGTAATGAGGATGAGAACTTTTTCAAATAGCAAATATATATTGGCTTAAAGCAT GAGGCTGTCT TCAGAAAAGTGATGTGGACATAGGAGGCAATGTGTGAGACTTGGGGGTTCAATATTTTAT ATAGAAGAGT TAATAAGCACATGGTTTACATTTACTCAGCTACTATATATGCAGTGTGGTGCACATTTTC ACAGAATTCT GGCTTCATTAAGATCATTATTTTTGCTGCGTAGCTTACAGACTTAGCATATTAGTTTTTT CTACTCCTAC AAGTGTAAATTGAAAAATCTTTATATTAAAAAAGTAAACTGTTATGAAGCTGCTATGTAC TAATAATACT TTGCTTGCCAAAGTGTTTGGGTTTTGTTGTTGTTTGTTTGTTTGTTTGTTTTTGGTTCAT GAACAACAGT GTCTAGAAACCCATTTTGAAAGTGGAAAATTATTAAGTCACCTATCACCTTTAAACGCCT TTTTTTAAAA TTATAAAATATTGTAAAGCAGGGTCTCAACTTTTAAATACACTTTGAACTTCTTCTCTGA ATTATTAAAG TTCTTTATGACCTCATTTATAAACACTAAATTCTGTCACCTCCTGTCATTTTATTTTTTA TTCATTCAAA TGTATTTTTTCTTGTGCATATTATAAAAATATATTTTATGAGCTCTTACTCAAATAAATA CCTGTAAATG T C T AAAGGAAAAAAAAAAAAAAAAAA

NM_004323 AGGCCGGGGCGGGGCTGGGAAGTAGTCGGGCGGGGTTGTGAGACGCCGCGCTCAGCTTCC ATCGCTGGGC 104

GGTCAACAAGTGCGGGCCTGGCTCAGCGCGGGGGGGCGCGGAGACCGCGAGGCGACCGGG AGCGGCTGGG TTCCCGGCTGCGCGCCCTTCGGCCAGGCCGGGAGCCGCGCCAGTCGGAGCCCCCGGCCCA GCGTGGTCCG CCTCCCTCTCGGCGTCCACCTGCCCGGAGTACTGCCAGCGGGCATGACCGACCCACCAGG GGCGCCGCCG CCGGCGCTCGCAGGCCGCGGATGAAGAAGAAAACCCGGCGCCGCTCGACCCGGAGCGAGG AGTTGACCCG GAGCGAGGAGTTGACCCTGAGTGAGGAAGCGACCTGGAGTGAAGAGGCGACCCAGAGTGA GGAGGCGACC CAGGGCGAAGAGATGAATCGGAGCCAGGAGGTGACCCGGGACGAGGAGTCGACCCGGAGC GAGGAGGTGA CCAGGGAGGAAATGGCGGCAGCTGGGCTCACCGTGACTGTCACCCACAGCAATGAGAAGC ACGACCTTCA TGTTACCTCCCAGCAGGGCAGCAGTGAACCAGTTGTCCAAGACCTGGCCCAGGTTGTTGA AGAGGTCATA GGGGTTCCACAGTCTTTTCAGAAACTCATATTTAAGGGAAAATCTCTGAAGGAAATGGAA ACACCGTTGT CAGCACTTGGAATACAAGATGGTTGCCGGGTCATGTTAATTGGGAAAAAGAACAGTCCAC AGGAAGAGGT T GAAC T AAAGAAG T T GAAACAT T T GGAGAAGT C T GT GGAGAAGAT AGC TGACCAGCT GGAAGAGT T GAAT AAAGAGCTTACTGGAATCCAGCAGGGTTTTCTGCCCAAGGATTTGCAAGCTGAAGCTCTC TGCAAACTTG ATAGGAGAGTAAAAGCCACAATAGAGCAGTTTATGAAGATCTTGGAGGAGATTGACACAC TGATCCTGCC AGAAAATTTCAAAGACAGTAGATTGAAAAGGAAAGGCTTGGTAAAAAAGGTTCAGGCATT CCTAGCCGAG TGTGACACAGTGGAGCAGAACATCTGCCAGGAGACTGAGCGGCTGCAGTCTACAAACTTT GCCCTGGCCG AGTGAGGTGTAGCAGAAAAAGGCTGTGCTGCCCTGAAGAATGGCGCCACCAGCTCTGCCG TCTCTGGAGC GGAATTTACCTGATTTCTTCAGGGCTGCTGGGGGCAACTGGCCATTTGCCAATTTTCCTA CTCTCACACT GGTTCTCAATGAAAAATAGTGTCTTTGTGATTTTGAGTAAAGCTCCTATCTGTTTTCTCC TTCTGTCTCT GTGGTTGTACTGTCCAGCAATCCACCTTTTCTGGAGAGGGCCACCTCTGCCCAAATTTTC CCAGCTGTTT GGACCTCTGGGTGCTTTCTTTGGGCTGGTGAGAGCTCTAATTTGCCTTGGGCCAGTTTCA GGTTTATAGG CCCCCTCAGTCTTCAGATACATGAGGGCTTCTTTGCTCTTGTGATCGTGTAGTCCCATAG CTGTAAAACC AGAATCACCAGGAGGTTGCACCTAGTCAGGAATATTGGGAATGGCCTAGAACAAGGTGTT TGGCACATAA GTAGACCACTTATCCCTCATTGTGACCTAATTCCAGAGCATCTGGCTGGGTTGTTGGGTT CTAGACTTTG TCCTCACCTCCCAGTGACCCTGACTAGCCACAGGCCATGAGATACCAGGGGGCCGTTCCT TGGATGGAGC CTGTGGTTGATGCAAGGCTTCCTTGTCCCCAAGCAAGTCTTCAGAAGGTTAGAACCCAGT GTTGACTGAG TCTGTGCTTGAAACCAGGCCAGAGCCATGGATTAGGAAGGGCAAAGAGAAGGCACCAGAA TGAGTAAAGC AGGCAGGTGGTGAAGCCAACCATAAACTTCTCAGGAGTGACATGTGCTTCCTTCAAAGGC ATTTTTGTTA ACCATATCCTTCTGAGTTCTATGTTTCCTTCACAGCTGTTCTATCCATTTTGTGGACTGT CCCCCACCCC CACCCCATCATTGTTTTTAAAAAATTAAGGCCTGGCGCAGCAGCTCATGCCTATAATCCC AGCACTTTGG GAGGCTGAGGCGGGCGGATCACTTGAGGCCAGGAGTTTGAGACCAGCCCAGGCAACATAG CAAAACCCCA TTCTGCTTTAAAAAAAAAAAAAAAAAAAATTAGCTTGGCGTAGTGGCATGTGCCTATAAT CCCAGCTACT GGGGAGGCTGAGGCACAAGAATCATTTGAACCTGGGAGGTAGAGGTTGCTGTGAGCCGAG ATTACGCCCC TGCACTCCAGCCTGGGTCACAGAGTGAGACTCCATCTCAGAAAAAAAAAAAATTGAGTCA GGTGCAGTAG CTCCTTCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCTAGAGGATCACTTGAGCCCAGG AGTTTGAGTC TAGTCTGGGCAACATAGCAAGACCCCATCTCTAAAATTTAAGTAAGTAAAAGTAGATAAA TAAAAAGAAA AAAAAACTGTTTATGTGCTCATCATAAAGTAGAAGAGTGGTTTGCTTTTTTTTTTTTTTT TGGATTAATG AGGAAATCATTCTGTGGCTCTAGTCATAATTTATGCTTAATAACATTGATAGTAGCCCTT TGCGCTATAA CTCTACCTAAAGACTCACATCATTTGGCAGAGAGAGAGTCGTTGAAGTCCCAGGAATTCA GGACTGGGCA GGTTAAGACCTCAGACAAGGTAGTAGAGGTAGACTTGTGGACAAGGCTCGGGTCCCAGCC CACCGCACCC CAACTTTAATCAGAGTGGTTCACTATTGATCTATTTTTGTGTGATAGCTGTGTGGCGTGG GCCACAACAT TTAATGAGAAGTTACTGTGCACCAAACTGCCGAACACCATTCTAAACTATTCATATATAT TAGTCATTTA ATTCTTACATAACTTGAGAGGTAGACAGATATCCTTATTTTAGAGATGAGGAAACCAAGA GAACTTAGGT CATTAGCGCAAGGTTGTAGAGTAAGCGGCAAAGCCAAGACACAAAGCTGGGTGGTTTGGT TTCAGAGCCA GTGCTTTTCCCCTCTACTGTACTGCCTCTCAACCAACACAGGGTTGCACAGGCCCATTCT CTGATTTTTT TCCTCTTGTCCTCTGCCTCTCCCTCTAGCTCCCACTTCCTCTCTGCTCTAGTTCATTTTC TTTAGAGCAG CCCGAGTGATCATGAAGTGCAAATCTTGCCATGTCAGTCCCCTGCTTAGAACCCTCCAAT GGCTCACTTT CTCTTTAGGCAAAAGTCTTTACCCCATGCCTTCTCCCATCTCATCTCAACCCCCTCATTT GTTGGCTGTC TGCTGTCAGCCACTCTTCTTTCAGGTCCTCAGATGCACTGCACCCTCTCCTGCCTGGGGG TCTTTGCTCC TGCTACTACCTCTGCTTGAACAGCTCCTCACCTTCCTTCCTCCAACCCTACCCTTGTATA GGTGACTTTT GTTCATCCTTCAGAATTCAACTCACATGTCTCTTGCATGGAGAACCCTCACCTACTGTGT TGAGACCCTG TCCAGCCCCCAGGTGGGATCCTCTCTCGACTTCCCATACATTTCTTTCACAGCATTTACA TAGTCCATGA TAGTT TACT TGTGGGAT TAT TTGGTTAATCTTTGCCTTTAACACCAGGGTTCCTTGGGTGAAGGAGCTTC TTTATCTTGGTAACAGCATTATTTCAAGCATAACTTGTAATATAGTTATATTACATATAT AACATATATA TATATAACATAACATATATAACATATATAACAAGCATAACTTGTTATATAGTCTTGTATA TAGTAAGACC T CAAT AAAT AT T T GGAGAACAAAAAAAAAAAAAAA

NM_000633 TTTCTGTGAAGCAGAAGTCTGGGAATCGATCTGGAAATCCTCCTAATTTTTACTCCCTCT CCCCGCGACT 105

CCTGATTCATTGGGAAGTTTCAAATCAGCTATAACTGGAGAGTGCTGAAGATTGATGGGA TCGTTGCCTT ATGCATTTGTTTTGGTT T T ACAAAAAGGAAAC T T GACAGAGGAT CATGCTGTACT T AAAAAAT ACAACAT CACAGAGGAAGTAGACTGATATTAACAATACTTACTAATAATAACGTGCCTCATGAAATA AAGATCCGAA AGGAATTGGAATAAAAATTTCCTGCATCTCATGCCAAGGGGGAAACACCAGAATCAAGTG TTCCGCGTGA TTGAAGACACCCCCTCGTCCAAGAATGCAAAGCACATCCAATAAAATAGCTGGATTATAA CTCCTCTTCT TTCTCTGGGGGCCGTGGGGTGGGAGCTGGGGCGAGAGGTGCCGTTGGCCCCCGTTGCTTT TCCTCTGGGA AGGATGGCGCACGCTGGGAGAACAGGGTACGATAACCGGGAGATAGTGATGAAGTACATC CATTATAAGC TGTCGCAGAGGGGCTACGAGTGGGATGCGGGAGATGTGGGCGCCGCGCCCCCGGGGGCCG CCCCCGCACC GGGCATCTTCTCCTCCCAGCCCGGGCACACGCCCCATCCAGCCGCATCCCGGGACCCGGT CGCCAGGACC TCGCCGCTGCAGACCCCGGCTGCCCCCGGCGCCGCCGCGGGGCCTGCGCTCAGCCCGGTG CCACCTGTGG TCCACCTGACCCTCCGCCAGGCCGGCGACGACTTCTCCCGCCGCTACCGCCGCGACTTCG CCGAGATGTC CAGCCAGCTGCACCTGACGCCCTTCACCGCGCGGGGACGCTTTGCCACGGTGGTGGAGGA GCTCTTCAGG GACGGGGTGAACTGGGGGAGGATTGTGGCCTTCTTTGAGTTCGGTGGGGTCATGTGTGTG GAGAGCGTCA ACCGGGAGATGTCGCCCCTGGTGGACAACATCGCCCTGTGGATGACTGAGTACCTGAACC GGCACCTGCA CACCTGGATCCAGGATAACGGAGGCTGGGATGCCTTTGTGGAACTGTACGGCCCCAGCAT GCGGCCTCTG TTTGATTTCTCCTGGCTGTCTCTGAAGACTCTGCTCAGTTTGGCCCTGGTGGGAGCTTGC ATCACCCTGG GTGCCTATCTGGGCCACAAGTGAAGTCAACATGCCTGCCCCAAACAAATATGCAAAAGGT TCACTAAAGC AGTAGAAATAATATGCATTGTCAGTGATGTACCATGAAACAAAGCTGCAGGCTGTTTAAG AAAAAATAAC ACACATATAAACATCACACACACAGACAGACACACACACACACAACAATTAACAGTCTTC AGGCAAAACG TCGAATCAGCTATTTACTGCCAAAGGGAAATATCATTTATTTTTTACATTATTAAGAAAA AAAGATTTAT TTATTTAAGACAGTCCCATCAAAACTCCTGTCTTTGGAAATCCGACCACTAATTGCCAAG CACCGCTTCG TGTGGCTCCACCTGGATGTTCTGTGCCTGTAAACATAGATTCGCTTTCCATGTTGTTGGC CGGATCACCA TCTGAAGAGCAGACGGATGGAAAAAGGACCTGATCATTGGGGAAGCTGGCTTTCTGGCTG CTGGAGGCTG GGGAGAAGGTGTTCATTCACTTGCATTTCTTTGCCCTGGGGGCTGTGATATTAACAGAGG GAGGGTTCCT GTGGGGGGAAGTCCATGCCTCCCTGGCCTGAAGAAGAGACTCTTTGCATATGACTCACAT GATGCATACC TGGTGGGAGGAAAAGAGTTGGGAACTTCAGATGGACCTAGTACCCACTGAGATTTCCACG CCGAAGGACA GCGATGGGAAAAATGCCCTTAAATCATAGGAAAGTATTTTTTTAAGCTACCAATTGTGCC GAGAAAAGCA TTTTAGCAATTTATACAATATCATCCAGTACCTTAAGCCCTGATTGTGTATATTCATATA TTTTGGATAC GCACCCCCCAACTCCCAATACTGGCTCTGTCTGAGTAAGAAACAGAATCCTCTGGAACTT GAGGAAGTGA ACATTTCGGTGACTTCCGCATCAGGAAGGCTAGAGTTACCCAGAGCATCAGGCCGCCACA AGTGCCTGCT TTTAGGAGACCGAAGTCCGCAGAACCTGCCTGTGTCCCAGCTTGGAGGCCTGGTCCTGGA ACTGAGCCGG GGCCCTCACTGGCCTCCTCCAGGGATGATCAACAGGGCAGTGTGGTCTCCGAATGTCTGG AAGCTGATGG AGCTCAGAATTCCACTGTCAAGAAAGAGCAGTAGAGGGGTGTGGCTGGGCCTGTCACCCT GGGGCCCTCC AGGTAGGCCCGTTTTCACGTGGAGCATGGGAGCCACGACCCTTCTTAAGACATGTATCAC TGTAGAGGGA AGGAACAGAGGCCCTGGGCCCTTCCTATCAGAAGGACATGGTGAAGGCTGGGAACGTGAG GAGAGGCAAT GGCCACGGCCCATTTTGGCTGTAGCACATGGCACGTTGGCTGTGTGGCCTTGGCCCACCT GTGAGTTTAA AGCAAGGCTTTAAATGACTTTGGAGAGGGTCACAAATCCTAAAAGAAGCATTGAAGTGAG GTGTCATGGA TTAATTGACCCCTGTCTATGGAATTACATGTAAAACATTATCTTGTCACTGTAGTTTGGT TTTATTTGAA AACCTGACAAAAAAAAAGTTCCAGGTGTGGAATATGGGGGTTATCTGTACATCCTGGGGC ATTAAAAAAA AAATCAATGGTGGGGAACTATAAAGAAGTAACAAAAGAAGTGACATCTTCAGCAAATAAA CTAGGAAATT TTTTTTTCTTCCAGTTTAGAATCAGCCTTGAAACATTGATGGAATAACTCTGTGGCATTA TTGCATTATA TACCATTTATCTGTATTAACTTTGGAATGTACTCTGTTCAATGTTTAATGCTGTGGTTGA TATTTCGAAA GCTGCTTTAAAAAAATACATGCATCTCAGCGTTTTTTTGTTTTTAATTGTATTTAGTTAT GGCCTATACA CTATTTGTGAGCAAAGGTGATCGTTTTCTGTTTGAGATTTTTATCTCTTGATTCTTCAAA AGCATTCTGA GAAGGTGAGATAAGCCCTGAGTCTCAGCTACCTAAGAAAAACCTGGATGTCACTGGCCAC TGAGGAGCTT TGTTTCAACCAAGTCATGTGCATTTCCACGTCAACAGAATTGTTTATTGTGACAGTTATA TCTGTTGTCC CTTTGACCTTGTTTCTTGAAGGTTTCCTCGTCCCTGGGCAATTCCGCATTTAATTCATGG TATTCAGGAT TACATGCATGTTTGGTTAAACCCATGAGATTCATTCAGTTAAAAATCCAGATGGCAAATG ACCAGCAGAT TCAAATCTATGGTGGTTTGACCTTTAGAGAGTTGCTTTACGTGGCCTGTTTCAACACAGA CCCACCCAGA GCCCTCCTGCCCTCCTTCCGCGGGGGCTTTCTCATGGCTGTCCTTCAGGGTCTTCCTGAA ATGCAGTGGT GCTTACGCTCCACCAAGAAAGCAGGAAACCTGTGGTATGAAGCCAGACCTCCCCGGCGGG CCTCAGGGAA CAGAATGATCAGACCTTTGAATGATTCTAATTTTTAAGCAAAATATTATTTTATGAAAGG TTTACATTGT CAAAGTGATGAATATGGAATATCCAATCCTGTGCTGCTATCCTGCCAAAATCATTTTAAT GGAGTCAGTT TGCAGTATGCTCCACGTGGTAAGATCCTCCAAGCTGCTTTAGAAGTAACAATGAAGAACG TGGACGTTTT TAATATAAAGCCTGTTTTGTCTTTTGTTGTTGTTCAAACGGGATTCACAGAGTATTTGAA AAATGTATAT ATATTAAGAGGTCACGGGGGCTAATTGCTGGCTGGCTGCCTTTTGCTGTGGGGTTTTGTT ACCTGGTTTT AATAACAGTAAATGTGCCCAGCCTCTTGGCCCCAGAACTGTACAGTATTGTGGCTGCACT TGCTCTAAGA GTAGTTGATGTTGCATTTTCCTTATTGTTAAAAACATGTTAGAAGCAATGAATGTATATA AAAGCCTCAA CTAGTCATTTTTTTCTCCTCTTCTTTTTTTTCATTATATCTAATTATTTTGCAGTTGGGC AACAGAGAAC CATCCCTATTTTGTATTGAAGAGGGATTCACATCTGCATCTTAACTGCTCTTTATGAATG AAAAAACAGT CCTCTGTATGTACTCCTCTTTACACTGGCCAGGGTCAGAGTTAAATAGAGTATATGCACT TTCCAAATTG GGGACAAGGGCTCTAAAAAAAGCCCCAAAAGGAGAAGAACATCTGAGAACCTCCTCGGCC CTCCCAGTCC CTCGCTGCACAAATACTCCGCAAGAGAGGCCAGAATGACAGCTGACAGGGTCTATGGCCA TCGGGTCGTC TCCGAAGATTTGGCAGGGGCAGAAAACTCTGGCAGGCTTAAGATTTGGAATAAAGTCACA GAATTAAGGA AGCACCTCAATTTAGTTCAAACAAGACGCCAACATTCTCTCCACAGCTCACTTACCTCTC TGTGTTCAGA TGTGGCCTTCCATTTATATGTGATCTTTGTTTTATTAGTAAATGCTTATCATCTAAAGAT GTAGCTCTGG CCCAGT GGGAAAAAT TAGGAAGTGAT TATAAAT CGAGAGGAGT TAT AATAATCAAGAT T AAATGTAAATA ATCAGGGCAATCCCAACACATGTCTAGCTTTCACCTCCAGGATCTATTGAGTGAACAGAA TTGCAAATAG TCTCTATTTGTAATTGAACTTATCCTAAAACAAATAGTT TATAAAT GTGAACTTAAACTCTAATTAATTC CAACTGTACTTTTAAGGCAGTGGCTGTTTTTAGACTTTCTTATCACTTATAGTTAGTAAT GTACACCTAC TCTATCAGAGAAAAACAGGAAAGGCTCGAAATACAAGCCATTCTAAGGAAATTAGGGAGT CAGTTGAAAT TCTATTCTGATCTTATTCTGTGGTGTCTTTTGCAGCCCAGACAAATGTGGTTACACACTT TTTAAGAAAT ACAATTCTACATTGTCAAGCTTATGAAGGTTCCAATCAGATCTTTATTGTTATTCAATTT GGATCTTTCA GGGAT iiiiiiii TAAAT T AT TATGGGACAAAGGACAT T TGT TGGAGGGGT GGGAGGGAGGAAGAAT T T T TAAATGTAAAACATTCCCAAGTTTGGATCAGGGAGTTGGAAGTTTTCAGAATAACCAGAA CTAAGGGTAT GAAGGACCTGTATTGGGGTCGATGTGATGCCTCTGCGAAGAACCTTGTGTGACAAATGAG AAACATTTTG AAGTTTGTGGTACGACCTTTAGATTCCAGAGACATCAGCATGGCTCAAAGTGCAGCTCCG TTTGGCAGTG CAATGGTATAAATTTCAAGCTGGATATGTCTAATGGGTATTTAAACAATAAATGTGCAGT TTTAACTAAC AGGATATTTAATGACAACCTTCTGGTTGGTAGGGACATCTGTTTCTAAATGTTTATTATG TACAATACAG AAAAAAATTTTATAAAATTAAGCAATGTGAAACTGAATTGGAGAGTGATAATACAAGTCC TTTAGTCTTA CCCAGT GAATCATTCTGTTCCATGTCTTTGGACAACCATGACCTTGGACAATCATGAAATATGCAT CTCA CTGGATGCAAAGAAAATCAGATGGAGCATGAATGGTACTGTACCGGTTCATCTGGACTGC CCCAGAAAAA T AAC T T CAAGCAAACAT CC T AT CAAC AACAAGG TTGTTCTGCATAC CAAGC TGAGCACAGAAGAT GGGAA CACTGGTGGAGGATGGAAAGGCTCGCTCAATCAAGAAAATTCTGAGACTATTAATAAATA AGACTGTAGT GTAGATACTGAGTAAATCCATGCACCTAAACCTTTTGGAAAATCTGCCGTGGGCCCTCCA GATAGCTCAT TTCATTAAGTTTTTCCCTCCAAGGTAGAATTTGCAAGAGTGACAGTGGATTGCATTTCTT TTGGGGAAGC TTTCTTTTGGTGGTTTTGTTTATTATACCTTCTTAAGTTTTCAACCAAGGTTTGCTTTTG TTTTGAGTTA CTGGGGTTATTTTTGTTTTAAATAAAAATAAGTGTACAATAAGTGTTTTTGTATTGAAAG CTTTTGTTAT CAAGATTTTCATACTTTTACCTTCCATGGCTCTTTTTAAGATTGATACTTTTAAGAGGTG GCTGATATTC TGCAACACTGTACACATAAAAAATACGGTAAGGATACTTTACATGGTTAAGGTAAAGTAA GTCTCCAGTT GGCCACCATTAGCTATAATGGCACTTTGTTTGTGTTGTTGGAAAAAGTCACATTGCCATT AAACTTTCCT TGTCTGTCTAGTTAATATTGTGAAGAAAAATAAAGTACAGTGTGAGATACTG

NM_00101227 CCCAGAAGGCCGCGGGGGGTGGACCGCCTAAGAGGGCGTGCGCTCCCGACATGCCCCGCG GCGCGCCATT 106 1 AACCGCCAGATTTGAATCGCGGGACCCGTTGGCAGAGGTGGCGGCGGCGGCATGGGTGCC CCGACGTTGC

CCCCTGCCTGGCAGCCCTTTCTCAAGGACCACCGCATCTCTACATTCAAGAACTGGCCCT TCTTGGAGGG CTGCGCCTGCACCCCGGAGCGGATGGCCGAGGCTGGCTTCATCCACTGCCCCACTGAGAA CGAGCCAGAC TTGGCCCAGTGTTTCTTCTGCTTCAAGGAGCTGGAAGGCTGGGAGCCAGATGACGACCCC ATTGGGCCGG GCACGGTGGCTTACGCCTGTAATACCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGA GAGAGGAACA TAAAAAGCATTCGTCCGGTTGCGCTTTCCTTTCTGTCAAGAAGCAGTTTGAAGAATTAAC CCTTGGTGAA T T T T T GAAAC T GGACAGAGAAAGAGC CAAGAAC AAAAT T GCAAAGGAAACC AACAAT AAGAAGAAAGAAT TTGAGGAAACTGCGGAGAAAGTGCGCCGTGCCATCGAGCAGCTGGCTGCCATGGATTGAG GCCTCTGGCC GGAGCTGCCTGGTCCCAGAGTGGCTGCACCACTTCCAGGGTTTATTCCCTGGTGCCACCA GCCTTCCTGT GGGCCCCTTAGCAATGTCTTAGGAAAGGAGATCAACATTTTCAAATTAGATGTTTCAACT GTGCTCTTGT TTTGTCTTGAAAGTGGCACCAGAGGTGCTTCTGCCTGTGCAGCGGGTGCTGCTGGTAACA GTGGCTGCTT CTCTCTCTCTCTCTCTTTTTTGGGGGCTCATTTTTGCTGTTTTGATTCCCGGGCTTACCA GGTGAGAAGT GAGGGAGGAAGAAGGCAGTGTCCCTTTTGCTAGAGCTGACAGCTTTGTTCGCGTGGGCAG AGCCTTCCAC AGTGAATGTGTCTGGACCTCATGTTGTTGAGGCTGTCACAGTCCTGAGTGTGGACTTGGC AGGTGCCTGT TGAATCTGAGCTGCAGGTTCCTTATCTGTCACACCTGTGCCTCCTCAGAGGACAGTTTTT TTGTTGTTGT GTTTTTTTGTTTTTTTTTTTTTGGTAGATGCATGACTTGTGTGTGATGAGAGAATGGAGA CAGAGTCCCT GGC T CC T C T AC T G T T T AAC AACAT GGC T T T C T T AT T T T G T T T GAAT T GT T AAT T CACAGAAT AGCACAAA CTACAATTAAAACTAAGCACAAAGCCATTCTAAGTCATTGGGGAAACGGGGTGAACTTCA GGTGGATGAG GAGACAGAATAGAGTGATAGGAAGCGTCTGGCAGATACTCCTTTTGCCACTGCTGTGTGA TTAGACAGGC CCAGTGAGCCGCGGGGCACATGCTGGCCGCTCCTCCCTCAGAAAAAGGCAGTGGCCTAAA TCCTTTTTAA ATGACTTGGCTCGATGCTGTGGGGGACTGGCTGGGCTGCTGCAGGCCGTGTGTCTGTCAG CCCAACCTTC ACATCTGTCACGTTCTCCACACGGGGGAGAGACGCAGTCCGCCCAGGTCCCCGCTTTCTT TGGAGGCAGC AGCTCCCGCAGGGCTGAAGTCTGGCGTAAGATGATGGATTTGATTCGCCCTCCTCCCTGT CATAGAGCTG CAGGGTGGATTGTTACAGCTTCGCTGGAAACCTCTGGAGGTCATCTCGGCTGTTCCTGAG AAATAAAAAG CCTGTCATTTCAAACACTGCTGTGGACCCTACTGGGTTTTTAAAATATTGTCAGTTTTTC ATCGTCGTCC CTAGCCTGCCAACAGCCATCTGCCCAGACAGCCGCAGTGAGGATGAGCGTCCTGGCAGAG ACGCAGTTGT CTCTGGGCGCTTGCCAGAGCCACGAACCCCAGACCTGTTTGTATCATCCGGGCTCCTTCC GGGCAGAAAC AACTGAAAATGCACTTCAGACCCACTTATTTCTGCCACATCTGAGTCGGCCTGAGATAGA CTTTTCCCTC TAAACTGGGAGAATATCACAGTGGTTTTTGTTAGCAGAAAATGCACTCCAGCCTCTGTAC TCATCTAAGC TGCTTATTTTTGATATTTGTGTCAGTCTGTAAATGGATACTTCACTTTAATAACTGTTGC TTAGTAATTG GCTTTGTAGAGAAGCTGGAAAAAAATGGTTTTGTCTTCAACTCCTTTGCATGCCAGGCGG TGATGTGGAT CTCGGCTTCTGTGAGCCTGTGCTGTGGGCAGGGCTGAGCTGGAGCCGCCCCTCTCAGCCC GCCTGCCACG GCCTTTCCTTAAAGGCCATCCTTAAAACCAGACCCTCATGGCTACCAGCACCTGAAAGCT TCCTCGACAT CTGTTAATAAAGCCGTAGGCCCTTGTCTAAGTGCAACCGCCTAGACTTTCTTTCAGATAC ATGTCCACAT GTCCATTTTTCAGGTTCTCTAAGTTGGAGTGGAGTCTGGGAAGGGTTGTGAATGAGGCTT CTGGGCTATG GGTGAGGTTCCAATGGCAGGTTAGAGCCCCTCGGGCCAACTGCCATCCTGGAAAGTAGAG ACAGCAGTGC CCGCTGCCCAGAAGAGACCAGCAAGCCAAACTGGAGCCCCCATTGCAGGCTGTCGCCATG TGGAAAGAGT AACTCACAATTGCCAATAAAGTCTCATGTGGTTTTATCTAAAAAAAAAAAAAAAAAAAAA AAAA AATGAGGGTATTTATAAAC TACT TAAATTATAAAAAGAATGAGACATCAGACTTACAGTTTTGGAT ACTA

ATTTTTTTCACTTAACGTTCATTATGTGATAGGAGTTTTCCATCCTATTATACCGCT GTGCGATCTGATC TTGGGCACGTTAACCAACCTCTTGTTGCCTCGATTTTCTCACCTGTAAAAGTGGGGGTAA TCATAATGCT TACT TAGTAGGAT AGCCCT GAAGAAT AAGTGAC T TAGCGAACATAAATAGC TT ACAATAGGGT T T T CAGC ATGGGAAGGATTCAGTAAATGTTAGCTGTCATCATCACCACCTACAAAGGAAGCAATACT GTGCTGAAAG TTTTTCCATCATTAATGTAATTTCTATAGTACGATTCCCAAGAAGATATTAAAATTATGG AAATAAAGGT ATTGGTATATTCCTAATTATTTCCTAAAAGATTGTATTGATAAATATGCTCATCCTTCCC TTAACGGGAT GCAT TCCAGAAAAACAAGT CAAATGT TAGACAAAGTATCAGAAGGGAAAT T CT GTAGCCAGAGAGC TAAA AATTACAATAGGGTCTCTAATTATACTTCAACTTTTTTAGGAATAATTCTCAGTGTGTTT TCCCACATTT CATATGTAATTTTTTTTTTTTTTTTTTTTTGAGACAGAGCCTCGCCCTGTCACCAGGCTG GAGTACAGTG GCGCGATCTCGGCTCACTGCAACTTCCACCTGCTGGGTTCAAGCAATTCTTCTGACCTCA GGTGATCCAC CCGCCTCGGCCTCCCAAAGTGCTGGGATTATAACAGGCGTGGCATGAGTCACCGCGCCCG GCCGATCTTT ACTTTTTTATTCTTTGTACCCCCTGCCTATCCAGTTAGCATGTGATTAAAGTCAAAGATT TGCCACTTTG GGCCACATCTATTAATTTTCATCTTTGTTATAATTGTATTTAGTTTTTGATCTACACTGC TTATTACTCC CAGTCATTTTTTATAGAACTGAAAATCTGGTAAAATACTCAAAATTGCACTGACTTCTAT GTAGAGGCGA CACTCCATCAGAACCGTGGGCTGACAGGGAATCCCACTGTGCAGGAGCTGCGCGCATTTT CATTTCTGAT TCTCTTTGGCGTATCCAGGACTCTGATGACATGATCATATATTTATCAGTAGTAACAGGT TGGGCCATTT GTTTTTTGTGGTAAATCATATATTTAAGATTTTAGAAATAAGTTGATAGCCATGTATTTT GGAATTTGAA AAAGACATTGCATTACTCAGCTTCAAATTAAGCTTTAATCAAATAGTGAAACTTTCCATT AATGGACAGT GTATACCTTTTTGTGTATTTAAAAAAAAAAACACTGAATATAGTGCCTTTGTGACAGGGG AGCTTGGTTC CTGACAATGTCCTCTTGAGCCTTTTTTTTTTTTTTGAGATGGAGTCTCACTGTGTCACCC AGGCTGGAGT GCAGTGGCGCCATCTTGGCTCACTGCAACCTCCGCCCCCTGGGTTCAAGTGATTCTCATT CCTCAGCTTC CTAAGTAGCTGGGATTACAGGCACGCACCACCATGACCAGCTAATTTTTATACTTTTAGT AGAGACAGGG TTTTGCCATGTTGGCTAGGTTGGTCTCGAACTCCTGACCTCAAGTAATCCACCCACCATG GCCTCCCCAA AGTGCTGGGATTACAGGCGTGAGCCATTTCACCCGGCCTCTCTTCCGTCTTTGAGCTGTG AGGAAATAGC T ACAT T ACAT GAGCT GCTAGATCTGCCTTATGGT CAGAAAT GAAGG T T GAACTC T CAGGAACAGT GACAT ATATACACACTGATATTTCCAAAGTACAATGCCCCAAATTGATCCACAAAGGAATTAAGG TCATTTGCAA CAAAATCACAGAATAGTAACAAATAAATAGAAGATAAATATGGCCAGGGATGCTGCAAAC TGATATACTG CCAAGTTTATCAGTTGGGAATCCCAACAGTGAAAAGCATAAAAATGAAAGGAATTTTAAG GAGACTTTTT ATAGAAGAGTGGGAAGGATTGGAGGAGCCAACAAGTGATGGTGAGGCACACAGGGAAGAG CTTCAGTGGG CACCATCCCCTCTCTGGTTTGAAGGGGTAGGGAGGGGACCAGAGCTGGGAGGAGGGGGCT GGAATACTGC TGGAGGAGCCACTCCCTTCCAGACCTGCTGTGGCCATCACAGAATGCAGCCACTGCCAGA GCAGCAGCCC GAGGAACCAGGCAGGGGGAGCACAAGTACCCTAGCCTCTCTCTTTCTGTTTCTTGCCTGC CGATCTCCTC CACTGGCTAAACCCAGCTGGATGCTAAGAGTACAGTCAGCCTGCCTGCTGAGGAGGGACC ACCAGGGACC ACCATCAGCAAGGGATCCAATGTCTTTCTGCCTCTGCAGAATGAAGGTTGGGGCGCGGGG GGCGCTCTAC TTCTTAGGGATATTGTGGGAATAAAAGGAAATAGGCAAAAAATGTTTTTGAAAAACAAAG CACATACTGC GCACCCGTGGGCCACTACTGCTTTTGACCCCTGGCTCTGTTTCATGAAGTAATGTCGTGT CATTCTCTTT TTAGGTGCTACAGGATTTCTTTAGGTTTGTTTTCTGTCCACCATATTTCAACTCATGTGT GCTGTTTGTT GTGCTAAAACAAATATTTGCTGATGCCTGAGTGAATAGTTGAATATTTTATATAAGTCAA ATTTATACGT AATGATTTTTCTTGTAACTTAGCCGTTTCTCTTTTACAAACTCAGAAAACCTCAGACTTT GAAAAGGCCT TGAAGTTCCTCACCTGAAATCTGAGAACTTGGAGCGCCTTAAAAAATCTAAAGGAAAACA AAACAGTGAA AGAACATGATATAGTCAGTGTAGAGAATAAAATTATTTATGTAATTAATATTGAGGATGC AGATAACACA TTGTGAAATCTTGCTTGTAAAAAATCTCGATCTGCTGAAGAAAGATGTTCTCTCTAGAGA TCTTTGAAAG CATAATTATTGAGCTTTTAAAATGTTAGAAACAAAAGTTAGACCCACACATATTCTGGCG TGTGGAAGAT TTGCATTCCTTCCCCTGCCCGCCCCGCCCCCACACTTGTGAGTTGTGCCTGTGTACGCAG TTCCTGTAGC ACTCGGCTGGGCAGAAATCATCTTTCAGCACTAAGGGAACATAGTTATGATCTGGACCTT CTGGGAGTGG TCAGTGCCCAAGAACAGGTATGGGACTCCAGAAAGTTCTGCTCTCAACCCTATTTTGAAA TAGAGTTACA CAT TGTTCTACAAT TAT TTGAGTTAATAAGCAGCTCTTTTCAAACGTGAT TAT GCCCTTCCAAGTT TAAA TACACTAGACTTTAGTGAAAGTAATTGACCTCATCTCATTTCTCTCCTGTTATATTAAGA TCACTTTCAG TAAAAGGTAGAAGCTTTTGAAGTGGTGAGGAGGAGGTAGAGGAGGGACATAGAGCAGATA GGGGCTGGAA AGTGGGGTGAGGAAGAGAGTGGCTTCTCTTTGGCAGAGTACCAAGGAAAAGCCCTATCTG TACAGAACCT TTGTGCCTGGGAACTTGATGGCTGCAACCTGAGCCTCAACCTAGTTTGCTTGCGGAGCCA GAAGAGAAGC TAAAAACCTTCAGTTAACCAAGCCAGACACCAAGAAAGTTAAACCGAAAGAGAACCCCCC ACCCCCCGCA AAAAAAAGAAGTAAAGTGGGTTAAAGTGATATCATGTTAGCACAGAAAGAGAACATAAGG GTCATCTAAG TTCATCTGCCCCCTCTTCTATTTCAAGGTGCAGAAACTAAGGCACAAGGGACCCCGTGTC CTGCTCTTGA TCACATAGCTAGTGGGTGCCAAGCCAGGTCTAGAACTCTGTTCTCTGGGGTCACAGGCTG GCTCTTCATC CCTCTAGAGAGATAGCTCATCTGTGTGCACCTGAGCCCGTTGTGTTTCGGAGTCAAAGCA AATAAAGGCT CAAACTCCAAGACTGTTTTGCAGACCGGCTGCAGTAGATATGGGGGGAGGAGAAACCTGC TTTAAATTGC TTCAAGCAAGTTGTTTCTGCAAAGGTGTTGACTTTTTTCTTTCAACTTTCTAGTGAGTCA CTGCAGCCTG AGCTGTTATTTGTCATTATGCAATAATTCAGGAACTAACTCAAGATTCTTCTTTTTAAAT TATTTGTTTA TTTAGAGACAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTC ACTGCAGCCT CTGCCTCCTGGGTTCAAGCAATTCTCATGTCTCAGCCTCCCGAATAGCTGGTATTGCAGG CTCGTGCCAC CACCCCCTGCTAATTTTTGTAATTTTAGTGGAGACACGGTTTCGCCATGTTGGCCGGGCT CGTCTTGAGC TCCTGGCCTCAGGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTGCAGCCGTGA GCCTCCACAC CCGGCCTATTTATTTATTTTTAAATTGGCTGCTCTTAGAAAGGCATACCATGTTTCTGGA TGGGAAGGCT TATTAATTCACCCTAATTTAATGTATAAATTTGATGCAATCATAGTCACAGTCCCAGTGG AATTTTTTAA CTTGGTAAGATGTTCTAAAATTAATGAGAGAACTTGAATTACCAGGTATTGAAACACTGT AAAGCCACAA TCATGTAAACAGTATGTTATAACCATGGGAATAGAGGTCTGTGATACAGCAGAAAAAAGT GAAAAAAAGA ATAACTGTATTCATAAAAATTTAAATGTGGAGTCACTGGGGGAAAGGATTAAATATTCGA TAATGTAGAA ACAACTCAACTATTTGGAGAAATGTAAATTTAGAGCCTTATCTCATGCCATATACCAAAA TACTATTTAG AT T T GAT T AAAAAAT AAAAAAAAAAAAAAAAAAA

NM_031966 CGAACGCCTTCGCGCGATCGCCCTGGAAACGCATTCTCTGCGACCGGCAGCCGCCAATGG GAAGGGAGTG 108

AGTGCCACGAACAGGCCAATAAGGAGGGAGCAGTGCGGGGTTTAAATCTGAGGCTAGGCT GGCTCTTCTC GGCGTGCTGCGGCGGAACGGCTGTTGGTTTCTGCTGGGTGTAGGTCCTTGGCTGGTCGGG CCTCCGGTGT TCTGCTTCTCCCCGCTGAGCTGCTGCCTGGTGAAGAGGAAGCCATGGCGCTCCGAGTCAC CAGGAACTCG AAAATTAATGCTGAAAATAAGGCGAAGATCAACATGGCAGGCGCAAAGCGCGTTCCTACG GCCCCTGCTG CAACCTCCAAGCCCGGACTGAGGCCAAGAACAGCTCTTGGGGACATTGGTAACAAAGTCA GTGAACAACT GCAGGC CAAAAT GCC T AT GAAGAAGGAAGCAAAACC T T C AGC T AC T GGAAAAG T CAT T GAT AAAAAAC T A CCAAAACCTCTTGAAAAGGTACCTATGCTGGTGCCAGTGCCAGTGTCTGAGCCAGTGCCA GAGCCAGAAC CTGAGCCAGAACCTGAGCCTGTTAAAGAAGAAAAACTTTCGCCTGAGCCTATTTTGGTTG ATACTGCCTC TCCAAGCCCAATGGAAACATCTGGATGTGCCCCTGCAGAAGAAGACCTGTGTCAGGCTTT CTCTGATGTA ATTCTTGCAGTAAATGATGTGGATGCAGAAGATGGAGCTGATCCAAACCTTTGTAGTGAA TATGTGAAAG ATATTTATGCTTATCTGAGACAACTTGAGGAAGAGCAAGCAGTCAGACCAAAATACCTAC TGGGTCGGGA AGTCACTGGAAACATGAGAGCCATCCTAATTGACTGGCTAGTACAGGTTCAAATGAAATT CAGGTTGTTG CAGGAGACCATGTACATGACTGTCTCCATTATTGATCGGTTCATGCAGAATAATTGTGTG CCCAAGAAGA TGCTGCAGCTGGTTGGTGTCACTGCCATGTTTATTGCAAGCAAATATGAAGAAATGTACC CTCCAGAAAT TGGTGACTTTGCTTT T GT GAC T GACAACAC T TAT AC T AAGCACCAAAT CAGAC AGAT GGAAAT GAAGAT T CTAAGAGCTTTAAACTTTGGTCTGGGTCGGCCTCTACCTTTGCACTTCCTTCGGAGAGCA TCTAAGATTG GAGAGGTTGATGTCGAGCAACATACTTTGGCCAAATACCTGATGGAACTAACTATGTTGG ACTATGACAT GGTGCACTTTCCTCCTTCTCAAATTGCAGCAGGAGCTTTTTGCTTAGCACTGAAAATTCT GGATAATGGT GAATGGACACCAACTCTACAACATTACCTGTCATATACTGAAGAATCTCTTCTTCCAGTT ATGCAGCACC TGGCTAAGAATGTAGTCATGGTAAATCAAGGACTTACAAAGCACATGACTGTCAAGAACA AGTATGCCAC ATCGAAGCATGCTAAGATCAGCACTCTACCACAGCTGAATTCTGCACTAGTTCAAGATTT AGCCAAGGCT GTGGCAAAGGTGTAACTTGTAAACTTGAGTTGGAGTACTATATTTACAAATAAAATTGGC ACCATGTGCC ATCTGTACATATTACTGTTGCATTTACTTTTAATAAAGCTTGTGGCCCCTTTTACTTTTT TATAGCTTAA CTAATTTGAATGTGGTTACTTCCTACTGTAGGGTAGCGGAAAAGTTGTCTTAAAAGGTAT GGTGGGGATA TTTTTAAAAACTCCTTTTGGTTTACCTGGGGATCCAATTGATGTATATGTTTATATACTG GGTTCTTGTT TTATATACCTGGCTTTTACTTTATTAATATGAGTTACTGAAGGTGATGGAGGTATTTGAA AATTTTACTT CCATAGGACATACTGCATGTAAGCCAAGTCATGGAGAATCTGCTGCATAGCTCTATTTTA AAGTAAAAGT CTACCACCGAATCCCTAGTCCCCCTGTTTTCTGTTTCTTCTTGTGATTGCTGCCATAATT CTAAGTTATT TACTTTTACCACTATTTAAGTTATCAACTTTAGCTAGTATCTTCAAACTTTCACTTTGAA AAATGAGAAT TTTATATTCTAAGCCAGTTTTCATTTTGGTTTTGTGTTTTGGTTAATAAAACAATACTCA AATACAAAAA AAAAAAA

BC035498 GCGGCCGCCAGCGCGGTGTAGGGGGCAGGCGCGGATCCCGCCACCGCCGCGCGCTCGGCC CGCCGACTCC 110

CGGCGCCGCCGCCGCCACTGCCGTCGCCGCCGCCGCCTGCCGGGACTGGAGCGCGCCGTC CGCCGCGGAC AAGACCCTGGCCTCAGGCCGGAGCAGCCCCATCATGCCGAGGGAGCGCAGGGAGCGGGAT GCGAAGGAGC GGGACACCATGAAGGAGGACGGCGGCGCGGAGTTCTCGGCTCGCTCCAGGAAGAGGAAGG CAAACGTGAC CGTTTTTTTGCAGGATCCAGATGAAGAAATGGCCAAAATCGACAGGACGGCGAGGGACCA GTGTGGGAGC CAGCCTTGGGACAATAATGCAGTCTGTGCAGACCCCTGCTCCCTGATCCCCACACCTGAC AAAGAAGATG ATGACCGGGTTTACCCAAACTCAACGTGCAAGCCTCGGATTATTGCACCATCCAGAGGCT CCCCGCTGCC TGTACTGAGCTGGGCAAATAGAGAGGAAGTCTGGAAAATCATGTTAAACAAGGAAAAGAC ATACTTAAGG GATCAGCACTTTCTTGAGCAACACCCTCTTCTGCAGCCAAAAATGCGAGCAATTCTTCTG GATTGGTTAA TGGAGGTGTGTGAAGTCTATAAACTTCACAGGGAGACCTTTTACTTGGCACAAGATTTCT TTGACCGGTA T AT GGC GACACAAGAAAAT GT T GT AAAAAC T C T T T T ACAGC T T AT T GGGAT T T CAT C T T T AT T T AT T GCA GCCAAACTTGAGGAAATCTATCCTCCAAAGTTGCACCAGTTTGCGTATGTGACAGATGGA GCTTGTTCAG GAGATGAAATTCTCACCATGGAATTAATGATTATGAAGGCCCTTAAGTGGCGTTTAAGTC CCCTGACTAT TGTGTCCTGGCTGAATGTATACATGCAGGTTGCATATCTAAATGACTTACATGAAGTGCT ACTGCCGCAG TATCCCCAGCAAATCTTTATACAGATTGCAGAGCTGTTGGATCTCTGTGTCCTGGATGTT GACTGCCTTG AATTTCCTTATGGTATACTTGCTGCTTCGGCCTTGTATCATTTCTCGTCATCTGAATTGA TGCAAAAGGT TTCAGGGTATCAGTGGTGCGACATAGAGAACTGTGTCAAGTGGATGGTTCCATTTGCCAT GGTTATAAGG GAGACGGGGAGCTCAAAACTGAAGCACTTCAGGGGCGTCGCTGATGAAGATGCACACAAC ATACAGACCC ACAGAGACAGC TTGGATTTGCT GGAC AAAGCCC GAGCAAAGAAAGC CATGTTGTCT GAACAAAAT AGGGC TTCTCCTCTCCCCAGTGGGCTCCTCACCCCGCCACAGAGCGGTAAGAAGCAGAGCAGCGG GCCGGAAATG GCGTGACCACCCCATCCTTCTCCACCAAAGACAGTTGCGCGCCTGCTCCACGTTCTCTTC TGTCTGTTGC AGCGGAGGCGTGCGTTTGCTTTTACAGATATCTGAATGGAAGAGTGTTTCTTCCACAACA GAAGTATTTC TGTGGATGGCATCAAACAGGGCAAAGTGTTTTTTATTGAATGCTTATAGGTTTTTTTTAA ATAAGTGGGT CAAGTACACCAGCCACCTCCAGACACCAGTGCGTGCTCCCGATGCTGCTATGGAAGGTGC TACTTGACCT AAGGGACTCCCACAACAACAAAAGCTTGAAGCTGTGGAGGGCCACGGTGGCGTGGCTCTC CTCGCAGGTG TTCTGGGCTCCGTTGTACCAAGTGGAGCAGGTGGTTGCGGGCAAGCGTTGTGCAGAGCCC ATAGCCAGCT GGGCAGGGGGCTGCCCTCTCCACATTATCAGTTGACAGTGTACAATGCCTTTGATGAACT GTTTTGTAAG TGCTGCTATATCTATCCATTTTTTAATAAAGATAATACTGTTTTTGAAAAAAAAAAAAAA AAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

BG256659 GAGGGCACGGGCTCCGTAGGCACCAACTGCAAGGACCCCTCCCCCTGCGGGCGCTCCCAT GGCACAGTTC 111

GCGTTCGAGAGTGACCTGCACTCGCTGCTTCAGCTGGATGCACCCATCCCCAATGCACCC CCTGCGCGCT GGCAGCGCAAAGCCAAGGAAGCCGCAGGCCCGGCCCCCTCACCCATGCGGGCCGCCAACC GATCCCACAG CGCCGGCAGGACTCCGGGCCGAACTCCTGGCAAATCCAGTTCCAAGGTTCAGACCACTCC TAGCAAACCT GGCGGTGACCGCTATATCCCCCATCGCAGTGCTGCCCAGATGGAGGTGGCCAGCTTCCTC CTGAGCAAGG AGAACCAGCCTGAAAACAGCCAGACGCCCACCAAGAAGGAACATCAGAAAGCCTGGGCTT TGAACCTGAA CGGTTTTGATGTAGAGGAAGCCAAGATCCTTCGGCTCAGTGGAAAAACCACAAAAATGCG CCAGAGGGTT ATCACGAACAGACTGAAAGTACTCTACAGCCAAAAGGCCACTCCTGGCTCCAGCCGGAAG ACCTGCCGTT TACATTCCTTCCCTGCCAAGACCGTATCCTGGATGCGCCTGAAATCGAATGACTATTAAC TGAACCTGTG GGACTGGCAGTCCGGGGAATGTCCGGGCCGGGCCACGGCCACGAGGTGTTCCGTGTGGAG TGCAAGCTGG GACACACCGTGCCGCTTGTGCACAGGGCCACGCGGGGAAATAATCCCGGGGCGCGCAAAG CGGCACTGGC GAGAGCCGCACGGGCCGGTGCTGGGGGTGGTACAACAGGCCAAAACAACACACAAGGCCA ACAAGACATA CGCGCGCTGACACCACGGTGCAAAGCGCTCAGACGAGTAGTAACCGGCACTGTGGTTGCT GCCTCCCCAC CTCTCCCGCTCTCAGCGTAAGATAAAAGAAAGAAGAGCAAAAAGCAAAGAAAGAAGACGA GACGAGACAC ACAGGAACGAACAGTAAAGCAAGCTAAAGCAAACGCAAGACCAGACAACAGAAATAGAAA GAACCAACAG AGAGGAGACAGAACAGGACGCCAGCAACATAGCAACAAACGAACAGAAGAGAGCACTAAA CAAAAGCAGC AGCAAGACGAGACAGGAGAGAAGGAGGAAGGAGGGCCGAGCGAGCAGGGAGCGCGAGCAG CGAGGCGAAG CAGCAGACAAGGGCAGGCGAAGGGCAACGAGAGGAGGCACCACACAAAAAGGAGAGGGGA CAGGAGAAGC AGCGAGAGAAGCGGAGGAGCAACAAGAGGAAGAAAAGGAGAGGGAGAGGAGGGAGAGAGC GGAAGGAGGA AGAAACAGCACGAGGCGACGAAGGGGGGAGACGCGGGGGCAGGAAAAGACACAGGAAGGC AGCGCGGAGG AGGAGAAGGGGAAGCAGGAAGGAGACGGAAGGAGAAGAGGGAGAGGACAGCGCAAGAGAG CGCGCGCGGC GACAGCGAGGGACGGAGCGAGAGAGAGGAAACGGAAAGCGAGAGGGAAGAGGAGAGGCAA CGCAGCGAAC CAACCGAAAACAGCAGAAAGAGAGGAGAAGGACGCGCAAAGAGGCAAGCGCAAGACGACA GGAAACGAAG CGAGAGACGAGAAGCCGGTGACGAGCAGGAGAAAGGGAAGGCAGGAGACAGGACAGGCGG AAGAGAGACA CGCGAGACGCAAAGAGTGAGCAGAACGAAGCGAAGAGCAACGCACGAGAGAAACGAC

NM_001254 GAGCGCGGCTGGAGTTTGCTGCTGCCGCTGTGCAGTTTGTTCAGGGGCTTGTGGTGGTGA GTCCGAGAGG 112

CTGCGTGTGAGAGACGTGAGAAGGATCCTGCACTGAGGAGGTGGAAAGAAGAGGATTGCT CGAGGAGGCC TGGGGTCTGTGAGGCAGCGGAGCTGGGTGAAGGCTGCGGGTTCCGGCGAGGCCTGAGCTG TGCTGTCGTC ATGCCTCAAACCCGATCCCAGGCACAGGCTACAATCAGTTTTCCAAAAAGGAAGCTGTCT CGGGCATTGA ACAAAGCTAAAAACTCCAGTGATGCCAAACTAGAACCAACAAATGTCCAAACCGTAACCT GTTCTCCTCG TGTAAAAGCCCTGCCTCTCAGCCCCAGGAAACGTCTGGGCGATGACAACCTATGCAACAC TCCCCATTTA CCTCCTTGTTCTCCACCAAAGCAAGGCAAGAAAGAGAATGGTCCCCCTCACTCACATACA CTTAAGGGAC GAAGATTGGTATTTGACAATCAGCTGACAATTAAGTCTCCTAGCAAAAGAGAACTAGCCA AAGTTCACCA AAACAAAATACTTTCTTCAGTTAGAAAAAGTCAAGAGATCACAACAAATTCTGAGCAGAG ATGTCCACTG AAGAAAGAATCTGCATGTGTGAGACTATTCAAGCAAGAAGGCACTTGCTACCAGCAAGCA AAGCTGGTCC TGAACACAGCTGTCCCAGATCGGCTGCCTGCCAGGGAAAGGGAGATGGATGTCATCAGGA ATTTCTTGAG GGAACACATCTGTGGGAAAAAAGCTGGAAGCCTTTACCTTTCTGGTGCTCCTGGAACTGG AAAAACTGCC TGCTTAAGCCGGATTCTGCAAGACCTCAAGAAGGAACTGAAAGGCTTTAAAACTATCATG CTGAATTGCA TGTCCTTGAGGACTGCCCAGGCTGTATTCCCAGCTATTGCTCAGGAGATTTGTCAGGAAG AGGTATCCAG GCCAGCTGGGAAGGACATGATGAGGAAATTGGAAAAACATATGACTGCAGAGAAGGGCCC CATGATTGTG TTGGTATTGGACGAGATGGATCAACTGGACAGCAAAGGCCAGGATGTATTGTACACGCTA TTTGAATGGC CATGGCTAAGCAATTCTCACTTGGTGCTGATTGGTATTGCTAATACCCTGGATCTCACAG ATAGAATTCT ACCTAGGCTTCAAGCTAGAGAAAAATGTAAGCCACAGCTGTTGAACTTCCCACCTTATAC CAGAAATCAG ATAGTCACTATTTTGCAAGATCGACTTAATCAGGTATCTAGAGATCAGGTTCTGGACAAT GCTGCAGTTC AATTCTGTGCCCGCAAAGTCTCTGCTGTTTCAGGAGATGTTCGCAAAGCACTGGATGTTT GCAGGAGAGC TATTGAAATTGTAGAGTCAGATGTCAAAAGCCAGACTATTCTCAAACCACTGTCTGAATG TAAATCACCT TCTGAGCCTCTGATTCCCAAGAGGGTTGGTCTTATTCACATATCCCAAGTCATCTCAGAA GTTGATGGTA ACAGGATGACCTTGAGCCAAGAAGGAGCACAAGATTCCTTCCCTCTTCAGCAGAAGATCT TGGTTTGCTC TTTGATGCTCTTGATCAGGCAGTTGAAAATCAAAGAGGTCACTCTGGGGAAGTTATATGA AGCCTACAGT AAAGTCTGTCGCAAACAGCAGGTGGCGGCTGTGGACCAGTCAGAGTGTTTGTCACTTTCA GGGCTCTTGG AAGCCAGGGGCAT T T TAGGAT TAAAGAGAAACAAGGAAACCCGT T T GACAAAGGTGT T T T TCAAGAT TGA AGAGAAAGAAAT AGAACAT GC T C T GAAAGAT AAAGC T T T AAT T GGAAAT AT CT TAGCTACTGGATTGCCT TAAATTCTTCTCTTACACCCCACCCGAAAGTATTCAGCTGGCATTTAGAGAGCTACAGTC TTCATTTTAG TGCTTTACACATTCGGGCCTGAAAACAAATATGACCTTTTTTACTTGAAGCCAATGAATT TTAATCTATA GATTCTTTAATATTAGCACAGAATAATATCTTTGGGTCTTACTATTTTTACCCATAAAAG TGACCAGGTA GACCCTTTTTAATTACATTCACTACTTCTACCACTTGTGTATCTCTAGCCAATGTGCTTG CAAGTGTACA GATCTGTGTAGAGGAATGTGTGTATATTTACCTCTTCGTTTGCTCAAACATGAGTGGGTA TTTTTTTGTT TGTTTTTTTTGTTGTTGTTGTTTTTGAGGCGCGTCTCACCCTGTTGCCCAGGCTGGAGTG CAATGGCGCG TTCTCTGCTCACTACAGCACCCGCTTCCCAGGTTGAAGTGATTCTCTTGCCTCAGCCTCC CGAGTAGCTG GGATTACAGGTGCCCACCACCGCGCCCAGCTAATTTTTTAATTTTTAGTAGAGACAGGGT TTTACCATGT TGGCCAGGCTGGTCTTGAACTCCTGACCCTCAAGTGATCTGCCCACCTTGGCCTCCCTAA GTGCTGGGAT TATAGGCGTGAGCCACCATGCTCAGCCATTAAGGTATTTTGTTAAGAACTTTAAGTTTAG GGTAAGAAGA ATGAAAATGATCCAGAAAAATGCAAGCAAGTCCACATGGAGATTTGGAGGACACTGGTTA AAGAATTTAT TTCTTTGTATAGTATACTATGTTCATGGTGCAGATACTACAACATTGTGGCATTTTAGAC TCGTTGAGTT TCTTGGGCACTCCCAAGGGCGTTGGGGTCATAAGGAGACTATAACTCTACAGATTGTGAA TATATTTATT TTCAAGTTGCATTCTTTGTCTTTTTAAGCAATCAGATTTCAAGAGAGCTCAAGCTTTCAG AAGTCAATGT GAAAATTCCTTCCTAGGCTGTCCCACAGTCTTTGCTGCCCTTAGATGAAGCCACTTGTT TCAAGAT GACT ACTTTGGGGTTGGGTTTTCATCTAAACACATTTTTCCAGTCTTATTAGATAAATTAGTCC ATATGGTTGG TTAATCAAGAGCCTTCTGGGTTTGGTTTGGTGGCATTAAATGG

NM_031423 GCGGAATGGGGCGGGACTTCCAGTAGGAGGCGGCAAGTTTGAAAAGTGATGACGGTTGAC GTTTGCTGAT 113

TTTTGACTTTGCTTGTAGCTGCTCCCCGAACTCGCCGTCTTCCTGTCGGCGGCCGGCACT GTAGATTAAC AGGAAACTTCCAAGATGGAAACTTTGTCTTTCCCCAGATATAATGTAGCTGAGATTGTGA TTCATATTCG CAATAAGATCTTAACAGGAGCTGATGGTAAAAACCTCACCAAGAATGATCTTTATCCAAA TCCAAAGCCT GAAGTCTTGCACATGATCTACATGAGAGCCTTACAAATAGTATATGGAATTCGACTGGAA CATTTTTACA TGATGCCAGTGAACTCTGAAGTCATGTATCCACATTTAATGGAAGGCTTCTTACCATTCA GCAATTTAGT TACTCATCTGGACTCATTTTTGCCTATCTGCCGGGTGAATGACTTTGAGACTGCTGATAT TCTATGTCCA AAAGCAAAACGGACAAGTCGGTTTTTAAGTGGCATTATCAACTTTATTCACTTCAGAGAA GCATGCCGTG AAACGTATATGGAATTTCTTTGGCAATATAAATCCTCTGCGGACAAAATGCAACAGTTAA ACGCCGCACA CCAGGAGGCATTAATGAAACTGGAGAGACTTGATTCTGTTCCAGTTGAAGAGCAAGAAGA GTTCAAGCAG C T T T CAGAT GGAAT T CAGGAGC T ACAACAAT CAC TAAAT CAGGAT T T T CAT CAAAAAAC GATAGTGCTGC AAGAGGGAAATTCCCAAAAGAAGTCAAATATTTCAGAGAAAACCAAGCGTTTGAATGAAC TAAAATTGTC GGTGGTTTCTTTGAAAGAAATACAAGAGAGTTTGAAAACAAAAATTGTGGATTCTCCAGA GAAGTTAAAG AATTATAAAGAAAAAATGAAAGATACGGTCCAGAAGCTTAAAAATGCCAGACAAGAAGTG GTGGAGAAAT ATGAAATCTATGGAGACTCAGTTGACTGCCTGCCTTCATGTCAGTTGGAAGTGCAGTTAT ATCAAAAGAA AATACAGGACCTTTCAGATAATAGGGAAAAATTAGCCAGTATCTTAAAGGAGAGCCTGAA CTTGGAGGAC CAAAT T GAGAG T G AT GAG T C AGAAC T GAAGAAA T T GAAG AC T GAAG AAAAT T C G T T C AAAAGAC T G AT GA T T GT GAAGAAGGAAAAAC T T GCCACAGCACAAT T C AAAAT AAAT AAGAAGC AT GAAGAT GT T AAGC AAT A CAAACGCACAGTAATTGAGGATTGCAATAAAGTTCAAGAAAAAAGAGGTGCTGTCTATGA ACGAGTAACC ACAAT T AAT CAAGAAAT CC AAAAAAT T AAAC T T GGAAT T CAACAAC T AAAAGAT GC T GC T GAAAGGGAGA AACTGAAGTCCCAGGAAATATTTCTAAACTTGAAAACTGCTTTGGAGAAATACCACGACG GTATTGAAAA GGCAGC AGAGGAC TCCTATGC T AAGAT AGAT GAGAAGAC AGC T GAAC T GAAGAGGAAGAT GT T C AAAAT G TCAACCTGATTAACAAAATTACATGTCTTTTTGTAAATGGCTTGCCATCTTTTAATTTTC TATTTAGAAA GAAAAGTTGAAGCGAATGGAAGTATCAGAAGTACCAAATAATGTTGGCTTCAT CAGTTTTTATACACTCT CATAAGTAGTTAATAAGATGAATTTAATGTAGGCTTTTATTAATTTATAATTAAAATAAC TTGTGCAGCT ATTCATGTCTCTACTCTGCCCCTTGTTGTAAATAGTTTGAGTAAAACAAAACTAGTTACC TTTGAAATAT ATATATTTTTTTCTGTTACTATC

GGCTAGCGCGGGAGGTGGAGAAAGAGGCTTGGGCGGCCCCGCTGTAGCCGCGTGTGG GAGGACGCACGGG 114 CCTGCTTCAAAGCTTTGGGATAACAGCGCCTCCGGGGGATAATGAATGCGGAGCCTCCGT TTTCAGTCGA CTTCAGATGTGTCTCCACTTTTTTCCGCTGTAGCCGCAAGGCAAGGAAACATTTCTCTTC CCGTACTGAG GAGGCTGAGGAGTGCACTGGGTGTTCTTTTCTCCTCTAACCCAGAACTGCGAGACAGAGG CTGAGTCCCT GTAAAGAACAGCTCCAGAAAAGCCAGGAGAGCGCAGGAGGGCATCCGGGAGGCCAGGAGG GGTTCGCTGG GGCCTCAACCGCACCCACATCGGTCCCACCTGCGAGGGGGCGGGACCTCGTGGCGCTGGA CCAATCAGCA CCCACCTGCGCTCACCTGGCCTCCTCCCGCTGGCTCCCGGGGGCTGCGGTGCTCAAAGGG GCAAGAGCTG AGCGGAACACCGGCCCGCCGTCGCGGCAGCTGCTTCACCCCTCTCTCTGCAGCCATGGGG CTCCCTCGTG GACCTCTCGCGTCTCTCCTCCTTCTCCAGGTTTGCTGGCTGCAGTGCGCGGCCTCCGAGC CGTGCCGGGC GGTCTTCAGGGAGGCTGAAGTGACCTTGGAGGCGGGAGGCGCGGAGCAGGAGCCCGGCCA GGCGCTGGGG AAAGTATTCATGGGCTGCCCTGGGCAAGAGCCAGCTCTGTTTAGCACTGATAATGATGAC TTCACTGTGC GGAATGGCGAGACAGTCCAGGAAAGAAGGTCACTGAAGGAAAGGAATCCATTGAAGATCT TCCCATCCAA ACGTATCTTACGAAGACACAAGAGAGATTGGGTGGTTGCTCCAATATCTGTCCCTGAAAA TGGCAAGGGT CCCTTCCCCCAGAGACTGAATCAGCTCAAGTCTAATAAAGATAGAGACACCAAGATTTTC TACAGCATCA CGGGGCCGGGGGCAGACAGCCCCCCTGAGGGTGTCTTCGCTGTAGAGAAGGAGACAGGCT GGTTGTTGTT GAATAAGCCACTGGACCGGGAGGAGATTGCCAAGTATGAGCTCTTTGGCCACGCTGTGTC AGAGAATGGT GCCTCAGTGGAGGACCCCATGAACATCTCCATCATAGTGACCGACCAGAATGACCACAAG CCCAAGTTTA CCCAGGACACCTTCCGAGGGAGTGTCTTAGAGGGAGTCCTACCAGGTACTTCTGTGATGC AGATGACAGC CACAGATGAGGATGATGCCATCTACACCTACAATGGGGTGGTTGCTTACTCCATCCATAG CCAAGAACCA AAGGACCCACACGACCTCATGTTCACAATTCACCGGAGCACAGGCACCATCAGCGTCATC TCCAGTGGCC TGGACCGGGAAAAAGTCCCTGAGTACACACTGACCATCCAGGCCACAGACATGGATGGGG ACGGCTCCAC CACCACGGCAGTGGCAGTAGTGGAGATCCTTGATGCCAATGACAATGCTCCCATGTTTGA CCCCCAGAAG TACGAGGCCCATGTGCCTGAGAATGCAGTGGGCCATGAGGTGCAGAGGCTGACGGTCACT GATCTGGACG CCCCCAACTCACCAGCGTGGCGTGCCACCTACCTTATCATGGGCGGTGACGACGGGGACC ATTTTACCAT CACCACCCACCCTGAGAGCAACCAGGGCATCCTGACAACCAGGAAGGGTTTGGATTTTGA GGCCAAAAAC CAGCACACCCTGTACGTTGAAGTGACCAACGAGGCCCCTTTTGTGCTGAAGCTCCCAACC TCCACAGCCA CCATAGTGGTCCACGTGGAGGATGTGAATGAGGCACCTGTGTTTGTCCCACCCTCCAAAG TCGTTGAGGT CCAGGAGGGCATCCCCACTGGGGAGCCTGTGTGTGTCTACACTGCAGAAGACCCTGACAA GGAGAATCAA AAGATCAGCTACCGCATCCTGAGAGACCCAGCAGGGTGGCTAGCCATGGACCCAGACAGT GGGCAGGTCA CAGCTGTGGGCACCCTCGACCGTGAGGATGAGCAGTTTGTGAGGAACAACATCTATGAAG TCATGGTCTT GGCCATGGACAATGGAAGCCCTCCCACCACTGGCACGGGAACCCTTCTGCTAACACTGAT TGATGTCAAC GACCATGGCCCAGTCCCTGAGCCCCGTCAGATCACCATCTGCAACCAAAGCCCTGTGCGC CAGGTGCTGA ACATCACGGACAAGGACCTGTCTCCCCACACCTCCCCTTTCCAGGCCCAGCTCACAGATG ACTCAGACAT CTACTGGACGGCAGAGGTCAACGAGGAAGGTGACACAGTGGTCTTGTCCCTGAAGAAGTT CCTGAAGCAG GATACATATGACGTGCACCTTTCTCTGTCTGACCATGGCAACAAAGAGCAGCTGACGGTG ATCAGGGCCA CTGTGTGCGACTGCCATGGCCATGTCGAAACCTGCCCTGGACCCTGGAAAGGAGGTTTCA TCCTCCCTGT GCTGGGGGCTGTCCTGGCTCTGCTGTTCCTCCTGCTGGTGCTGCTTTTGTTGGTGAGAAA GAAGCGGAAG ATCAAGGAGCCCCTCCTACTCCCAGAAGATGACACCCGTGACAACGTCTTCTACTATGGC GAAGAGGGGG GTGGCGAAGAGGACCAGGACTATGACATCACCCAGCTCCACCGAGGTCTGGAGGCCAGGC CGGAGGTGGT TCTCCGCAATGACGTGGCACCAACCATCATCCCGACACCCATGTACCGTCCTAGGCCAGC CAACCCAGAT GAAATCGGCAACTTTATAATTGAGAACCTGAAGGCGGCTAACACAGACCCCACAGCCCCG CCCTACGACA CCCTCTTGGTGTTCGACTATGAGGGCAGCGGCTCCGACGCCGCGTCCCTGAGCTCCCTCA CCTCCTCCGC CTCCGACCAAGACCAAGATTACGATTATCTGAACGAGTGGGGCAGCCGCTTCAAGAAGCT GGCAGACATG TACGGTGGCGGGGAGGACGACTAGGCGGCCTGCCTGCAGGGCTGGGGACCAAACGTCAGG CCACAGAGCA TCTCCAAGGGGTCTCAGTTCCCCCTTCAGCTGAGGACTTCGGAGCTTGTCAGGAAGTGGC CGTAGCAACT TGGCGGAGACAGGCTATGAGTCTGACGTTAGAGTGGTTGCTTCCTTAGCCTTTCAGGATG GAGGAATGTG GGCAGTTTGACTTCAGCACTGAAAACCTCTCCACCTGGGCCAGGGTTGCCTCAGAGGCCA AGTTTCCAGA AGCCTCTTACCTGCCGTAAAATGCTCAACCCTGTGTCCTGGGCCTGGGCCTGCTGTGACT GACCTACAGT GGACTTTCTCTCTGGAATGGAACCTTCTTAGGCCTCCTGGTGCAACTTAATTTTTTTTTT TAATGCTATC TTCAAAACGTTAGAGAAAGTTCTTCAAAAGTGCAGCCCAGAGCTGCTGGGCCCACTGGCC GTCCTGCATT TCTGGTTTCCAGACCCCAATGCCTCCCATTCGGATGGATCTCTGCGTTTTTATACTGAGT GTGCCTAGGT TGCCCCTTATTTTTTATTTTCCCTGTTGCGTTGCTATAGATGAAGGGTGAGGACAATCGT GTATATGTAC TAGAAC TTTTT TAT TAAAGAAACT T T TCCCAAAAAAAAAAAAAAAA

ΝΜ_016343 GAGACCAGAAGCGGGCGAATTGGGCACCGGTGGCGGCTGCGGGCAGTTTGAATTAGACTC TGGGCTCCAG 115

CCCGCCGAAGCCGCGCCAGAACTGTACTCTCCGAGAGGTCGTTTTCCCGTCCCCGAGAGC AAGTTTATTT ACAAATGTT GGAG T AAT AAAGAAGGC AGAACAAAAT GAGCT GGGCTTT GGAAGAAT GGAAAGAAGGGC T G CC T ACAAGAGC T C T T CAGAAAAT T CAAGAGC T T GAAGGACAGC T T GACAAACT GAAGAAGGAAAAGCAGC AAAGGCAGTTTCAGCTTGACAGTCTCGAGGCTGCGCTGCAGAAGCAAAAACAGAAGGTTG AAAATGAAAA AACCGAGGGTACAAACCTGAAAAGGGAGAATCAAAGATTGATGGAAATATGTGAAAGTCT GGAGAAAACT AAGCAGAAGAT T T C T CAT GAAC T T CAAGT CAAGGAGT CACAAGT GAAT T T C CAGGAAGGACAAC T GAAT T CAGGCAAAAAACAAAT AGAAAAAC T GGAACAGGAAC T T AAAAGGT G T AAAT CT GAGC T T GAAAGAAGCCA ACAAGCTGCGCAGTCTGCAGATGTCTCTCTGAATCCATGCAATACACCACAAAAAATTTT TACAACTCCA CTAACACCAAGTCAATATTATAGTGGTTCCAAGTATGAAGATCTAAAAGAAAAATATAAT AAAGAGGTTG AAGAACGAAAAAGATTAGAGGCAGAGGTTAAAGCCTTGCAGGCTAAAAAAGCAAGCCAGA CTCTTCCACA AGCCACCATGAATCACCGCGACATTGCCCGGCATCAGGCTTCATCATCTGTGTTCTCATG GCAGCAAGAG AAGACCCCAAGTCATCTTTCATCTAATTCTCAAAGAACTCCAATTAGGAGAGATTTCTCT GCATCTTACT TTTCTGGGGAACAAGAGGTGACTCCAAGTCGATCAACTTTGCAAATAGGGAAAAGAGATG CTAATAGCAG TTTCTTTGACAATTCTAGCAGTCCTCATCTTTTGGATCAATTAAAAGCGCAGAATCAAGA GCTAAGAAAC AAGATTAATGAGTTGGAACTACGCCTGCAAGGACATGAAAAAGAAATGAAAGGCCAAGTG AATAAGTTTC AAGAAC TCCAAC TCCAAC T GGAGAAAGCAAAAG T GGAAT T AAT T GAAAAAG AG AAAG T T T T GAACAAAT G TAGGGATGAACTAGTGAGAACAACAGCACAATACGACCAGGCGTCAACCAAGTATACTGC ATTGGAACAA AAACTGAAAAAATTGACGGAAGATTTGAGTTGTCAGCGACAAAATGCAGAAAGTGCCAGA TGTTCTCTGG AACAGAAAATTAAGGAAAAAGAAAAGGAGTTTCAAGAGGAGCTCTCCCGTCAACAGCGTT CTTTCCAAAC AC T GGACCAGGAG TGCATC CAGAT GAAGGCCAGAC T CAC CCAGGAG T T ACAGC AAGCCAAGAAT AT GCAC AACGTCCTGCAGGCTGAACTGGATAAACTCACATCAGTAAAGCAACAGCTAGAAAACAAT TTGGAAGAGT TTAAGCAAAAGTTGTGCAGAGCTGAACAGGCGTTCCAGGCGAGTCAGATCAAGGAGAATG AGCTGAGGAG AAGCATGGAGGAAATGAAGAAGGAAAACAACCTCCTTAAGAGTCACTCTGAGCAAAAGGC CAGAGAAGTC T GCCAC C T GGAGGCAGAAC T CAAGAACAT CAAACAGT GT T T AAAT CAGAGC CAGAAT T T T GCAGAAGAAA TGAAAGCGAAGAATACCTCTCAGGAAACCATGTTAAGAGATCTTCAAGAAAAAATAAATC AGCAAGAAAA CTCCTTGACTTTAGAAAAACTGAAGCTTGCTGTGGCTGATCTGGAAAAGCAGCGAGATTG TTCTCAAGAC C T T T T GAAGAAAAGAGAAC AT CACAT T GAACAAC T T AAT GAT AAGT T AAGC AAGACAGAGAAAGAG T CCA AAGCCTTGCTGAGTGCTTTAGAGTTAAAAAAGAAAGAATATGAAGAATTGAAAGAAGAGA AAACTCTGTT TTCTTGTTGGAAAAGTGAAAACGAAAAACTTTTAACTCAGATGGAATCAGAAAAGGAAAA CTTGCAGAGT AAAAT TAATCACTTGGAAACTTGTCTGAAGACACAGCAAATAAAAAGTCATGAATACAACGAGAG AGTAA GAACGCTGGAGATGGACAGAGAAAACCTAAGTGTCGAGATCAGAAACCTTCACAACGTGT TAGACAGTAA GTCAGTGGAGGTAGAGACCCAGAAACTAGCTTATATGGAGCTACAGCAGAAAGCTGAGTT CTCAGATCAG AAACAT CAGAAGGAAAT AGAAAAT AT GT GT T T GAAGAC T T C T CAGC T T AC T GGGCAAGT T GAAGAT C T AG AACACAAGCTTCAGTTACTGTCAAATGAAATAATGGACAAAGACCGGTGTTACCAAGACT TGCATGCCGA AT AT GAGAGCC T CAGGGATCTGC T AAAAT CCAAAGAT GC T T C T C T GGT GAC AAAT GAAGAT CAT CAGAGA AGTCTTTTGGCTTTTGATCAGCAGCCTGCCATGCATCATTCCTTTGCAAATATAATTGGA GAACAAGGAA GCATGCCTTCAGAGAGGAGTGAATGTCGTTTAGAAGCAGACCAAAGTCCGAAAAATTCTG CCATCCTACA AAAT AGAGT T GAT T CAC T T GAAT T T T CAT TAGAGTC T CAAAAACAGAT GAACT CAGACC T GCAAAAGCAG T GT GAAGAGT T GGT GCAAAT CAAAGGAGAAAT AGAAGAAAAT C T CAT GAAAGC AGAACAGAT GCAT CAAA GTTTTGTGGCTGAAACAAGTCAGCGCATTAGTAAGTTACAGGAAGACACTTCTGCTCACC AGAATGTTGT TGCTGAAACCTTAAGTGCCCTTGAGAACAAGGAAAAAGAGCTGCAACTTTTAAATGATAA GGTAGAAACT GAGCAGGCAGAGAT T CAAGAAT T AAAAAAGAGC AACCAT C T AC T T GAAGAC TC T C T AAAGGAGC T ACAAC TTTTATCCGAAACCCTAAGCTTGGAGAAGAAAGAAATGAGTTCCATCATTTCTCTAAATA AAAGGGAAAT T GAAGAGC T GACC CAAGAGAAT GGGACTCT T AAGGAAAT T AAT GCAT CC T T AAAT CAAGAGAAGAT GAAC TTAATCCAGAAAAGTGAGAGTTTTGCAAACTATATAGATGAAAGGGAGAAAAGCATTTCA GAGTTATCTG ATCAGTACAAGCAAGAAAAACTTATTTTACTACAAAGATGTGAAGAAACCGGAAATGCAT ATGAGGATCT T AGT CAAAAAT AC AAAGCAGCACAGGAAAAGAAT T C T AAAT T AGAAT GC T T GC T AAAT GAAT GCAC TAGT CTTTGTGAAAATAGGAAAAATGAGTTGGAACAGCTAAAGGAAGCATTTGCAAAGGAACAC CAAGAATTCT TAACAAAATTAGCATTTGCTGAAGAAAGAAATCAGAATCTGATGCT AGAGT TGGAGACAGTGCAGCAAGC TCTGAGATCTGAGATGACAGATAACCAAAACAATTCTAAGAGCGAGGCTGGTGGTTTAAA GCAAGAAATC AT GAC T T T AAAGGAAGAAC AAAACAAAAT GCAAAAGGAAGT T AAT GAC T T AT T ACAAGAGAAT GAACAGC T GAT GAAGGT AAT GAAGAC T AAACAT GAAT GT C AAAAT C T AGAAT C AGAAC CAAT T AGGAAC T C T G T GAA AGAAAGAGAGAGTGAGAGAAATCAATGTAATTTTAAACCTCAGATGGATCTTGAAGTTAA AGAAATTTCT CTAGATAGTTATAATGCGCAGTTGGTGCAATTAGAAGCTATGCTAAGAAATAAGGAATTA AAACTTCAGG AAAGTGAGAAGGAGAAGGAGTGCCTGCAGCATGAATTACAGACAATTAGAGGAGATCTTG AAACCAGCAA TTTGCAAGACATGCAGTCACAAGAAATTAGTGGCCTTAAAGACTGTGAAATAGATGCGGA AGAAAAGTAT ATTTCAGGGCCTCATGAGTTGTCAACAAGTCAAAACGACAATGCACACCTTCAGTGCTCT CTGCAAACAA CAAT GAACAAGC T GAAT GAGC T AGAGAAAAT AT GT GAAAT AC TGCAGGCT GAAAAGT AT GAAC T CG T AAC T GAGC T GAAT GAT T CAAGG T CAGAAT GT AT CAC AGCAAC T AGGAAAAT GGC AGAAGAGG T AGGGAAAC T A CTAAATGAAGTTAAAATATTAAATGATGACAGTGGTCTTCTCCATGGTGAGTTAGTGGAA GACATACCAG GAGGTGAATTTGGTGAACAACCAAATGAACAGCACCCTGTGTCTTTGGCTCCATTGGACG AGAGTAATTC C T ACGAGCAC T T GACAT T G T C AGAC AAAGAAG T T CAAAT GCAC T T T GCCGAAT T GCAAGAGAAAT T C T T A TCTTTACAAAGTGAACACAAAATTTTACATGATCAGCACTGTCAGATGAGCTCTAAAATG TCAGAGCTGC AGACCTATGTTGACTCATTAAAGGCCGAAAATTTGGTCTTGTCAACGAATCTGAGAAACT TTCAAGGTGA CTTGGTGAAGGAGATGCAGCTGGGCTTGGAGGAGGGGCTCGTTCCATCCCTGTCATCCTC TTGTGTGCCT GACAGCTCTAGTCTTAGCAGTTTGGGAGACTCCTCCTTTTACAGAGCTCTTTTAGAACAG ACAGGAGATA TGTCTCTTTTGAGTAATTTAGAAGGGGCTGTTTCAGCAAACCAGTGCAGTGTAGATGAAG TATTTTGCAG CAGTCTGCAGGAGGAGAATCTGACCAGGAAAGAAACCCCTTCGGCCCCAGCGAAGGGTGT TGAAGAGCTT GAGTCCCTCTGTGAGGTGTACCGGCAGTCCCTCGAGAAGCTAGAAGAGAAAATGGAAAGT CAAGGGATTA T GAAAAAT AAGGAAAT T CAAGAGC T C GAGCAGT TATTAAGTTCT GAAAGGC AAGAGC T T GAC T GCC T T AG GAAGCAGTATTTGTCAGAAAATGAACAGTGGCAACAGAAGCTGACAAGCGTGACTCTGGA GATGGAGTCC AAGTTGGCGGCAGAAAAGAAACAGACGGAACAACTGTCACTTGAGCTGGAAGTAGCACGA CTCCAGCTAC AAGGTCTGGACTTAAGTTCTCGGTCTTTGCTTGGCATCGACACAGAAGATGCTATTCAAG GCCGAAATGA GAGC T G T GACAT AT CAAAAGAACAT AC T T CAGAAAC T AC AGAAAGAACACC AAAGCAT GAT GT T CAT CAG ATT T GT GAT AAAGAT GC T C AGCAGGACC T CAAT C T AGAC AT T GAGAAAAT AAC T GAGAC TGGTGCAGTGA AACCCACAGGAGAGTGCTCTGGGGAACAGTCCCCAGATACCAATTATGAGCCTCCAGGGG AAGATAAAAC CCAGGGCTCTTCAGAATGCATTTCTGAATTGTCATTTTCTGGTCCTAATGCTTTGGTACC TATGGATTTC CTGGGGAATCAGGAAGATATCCATAATCTTCAACTGCGGGTAAAAGAGACATCAAATGAG AATTTGAGAT TACTTCATGTGATAGAGGACCGTGACAGAAAAGTTGAAAGTTTGCTAAATGAAATGAAAG AATTAGACTC AAAACTCCATTTACAGGAGGTACAACTAATGACCAAAATTGAAGCATGCATAGAATTGGA AAAAATAGTT GGGGAACTTAAGAAAGAAAACTCAGATTTAAGTGAAAAATTGGAATATTTTTCTTGTGAT CACCAGGAGT TACTCCAGAGAGTAGAAACTTCTGAAGGCCTCAATTCTGATTTAGAAATGCATGCAGATA AATCATCACG TGAAGATATTGGAGATAATGTGGCCAAGGTGAATGACAGCTGGAAGGAGAGATTTCTTGA TGTGGAAAAT GAGCTGAGTAGGATCAGATCGGAGAAAGCTAGCATTGAGCATGAAGCCCTCTACCTGGAG GCTGACTTAG AGGTAGTTCAAACAGAGAAGCTATGTTTAGAAAAAGACAATGAAAATAAGCAGAAGGTTA TTGTCTGCCT TGAAGAAGAACTCTCAGTGGTCACAAGTGAGAGAAACCAGCTTCGTGGAGAATTAGATAC TATGTCAAAA AAAACCACGGCACTGGATCAGTTGTCTGAAAAAATGAAGGAGAAAACACAAGAGCTTGAG TCTCATCAAA GTGAGTGTCTCCATTGCATTCAGGTGGCAGAGGCAGAGGTGAAGGAAAAGACGGAACTCC TTCAGACTTT GTCCTCTGATGTGAGTGAGCTGTTAAAAGACAAAACTCATCTCCAGGAAAAGCTGCAGAG TTTGGAAAAG GACTCACAGGCACTGTCTTTGACAAAATGTGAGCTGGAAAACCAAATTGCACAACTGAAT AAAGAGAAAG AATTGCTTGTCAAGGAATCTGAAAGCCTGCAGGCCAGACTGAGTGAATCAGATTATGAAA AGCTGAATGT CTCCAAGGCCTTGGAGGCCGCACTGGTGGAGAAAGGTGAGTTCGCATTGAGGCTGAGCTC AACACAGGAG GAAGTGCATCAGCTGAGAAGAGGCATCGAGAAACTGAGAGTTCGCATTGAGGCCGATGAA AAGAAGCAGC TGCACATCGCAGAGAAACTGAAAGAACGCGAGCGGGAGAATGATTCACTTAAGGATAAAG TTGAGAACCT TGAAAGGGAATTGCAGATGTCAGAAGAAAACCAGGAGCTAGTGATTCTTGATGCCGAGAA TTCCAAAGCA GAAGT AGAGAC T C T AAAAACACAAAT AGAAGAGAT GGCC AGAAGCC T GAAAGT T T T T GAAT T AGAC C T T G TCACGTTAAGGTCTGAAAAAGAAAATCTGACAAAACAAATACAAGAAAAACAAGGTCAGT TGTCAGAACT AGACAAGTTACTCTCTTCATTTAAAAGTCTGTTAGAAGAAAAGGAGCAAGCAGAGATACA GATCAAAGAA GAATCTAAAACTGCAGTGGAGATGCTTCAGAATCAGTTAAAGGAGCTAAATGAGGCAGTA GCAGCCTTGT GTGGTGACCAAGAAATTATGAAGGCCACAGAACAGAGTCTAGACCCACCAATAGAGGAAG AGCATCAGCT GAGAAATAGCATTGAAAAGCTGAGAGCCCGCCTAGAAGCTGATGAAAAGAAGCAGCTCTG TGTCTTACAA CAACTGAAGGAAAGTGAGCATCATGCAGATTTACTTAAGGGTAGAGTGGAGAACCTTGAA AGAGAGCTAG AGATAGCCAGGACAAACCAAGAGCATGCAGCTCTTGAGGCAGAGAATTCCAAAGGAGAGG TAGAGACCCT AAAAGCAAAAATAGAAGGGATGACCCAAAGTCTGAGAGGTCTGGAATTAGATGTTGTTAC TATAAGGTCA GAAAAAGAAAAT C T GACAAAT GAAT T ACAAAAAGAGCAAGAGCGAAT AT C T GAAT T AGAAAT AAT AAAT T CAT CAT T T GAAAAT AT T T T GCAAGAAAAAGAGC AAGAGAAAGT ACAGAT GAAA GAAAAAT CAAGCAC T GC CAT GGAGAT GC T T CAAACACAAT T AAAAGAGC T CAAT GAGAGAGT GGCAGC CC T GCAT AAT GACCAAGAA GCCTGTAAGGCCAAAGAGCAGAATCTTAGTAGTCAAGTAGAGTGTCTTGAACTTGAGAAG GCTCAGTTGC TACAAGGCCTTGATGAGGCCAAAAATAATTATATTGTTTTGCAATCTTCAGTGAATGGCC TCATTCAAGA AGT AGAAGAT GGC AAGCAGAAAC T GGAGAAGAAGGAT GAAGAAAT C AGT AGAC T GAAAAAT CAAAT T CAA GACCAAGAGCAGCTTGTCTCTAAACTGTCCCAGGTGGAAGGAGAGCACCAACTTTGGAAG GAGCAAAACT TAGAACTGAGAAATCTGACAGTGGAATTGGAGCAGAAGATCCAAGTGCTACAATCCAAAA ATGCCTCTTT GCAGGACACAT T AGAAGT GC T GCAGAGT T C T T ACAAGAAT C T AGAGAAT GAGC T T GAAT T GACAAAAAT G GACAAAATGTCCTTTGTTGAAAAAGTAAACAAAATGACTGCAAAGGAAACTGAGCTGCAG AGGGAAATGC ATGAGATGGCACAGAAAACAGCAGAGCTGCAAGAAGAACTCAGTGGAGAGAAAAATAGGC TAGCTGGAGA GTTGCAGTTACTGTTGGAAGAAATAAAGAGCAGCAAAGATCAATTGAAGGAGCTCACACT AGAAAATAGT GAATTGAAGAAGAGCCTAGATTGCATGCACAAAGACCAGGTGGAAAAGGAAGGGAAAGTG AGAGAGGAAA TAGCTGAATATCAGCTACGGCTTCATGAAGCTGAAAAGAAACACCAGGCTTTGCTTTTGG ACACAAACAA ACAGTATGAAGTAGAAATCCAGACATACCGAGAGAAATTGACTTCTAAAGAAGAATGTCT CAGTTCACAG AAGCTGGAGATAGACCTTTTAAAGTCTAGTAAAGAAGAGCTCAATAATTCATTGAAAGCT ACTACTCAGA T T T T GGAAGAAT T GAAGAAAACCAAGAT GGACAAT C T AAAAT AT GT AAAT C AG T T GAAGAAGGAAAAT GA ACGTGCCCAGGGGAAAATGAAGTTGTTGATCAAATCCTGTAAACAGCTGGAAGAGGAAAA GGAGATACTG CAGAAAGAACTCTCTCAACTTCAAGCTGCACAGGAGAAGCAGAAAACAGGTACTGTTATG GATACCAAGG TCGAT GAAT T AAC AAC T GAGAT CAAAGAAC T GAAAGAAAC T C T T GAAGAAAAAACCAAGGAGGCAGAT GA ATACTTGGATAAGTACTGTTCCTTGCTTATAAGCCATGAAAAGTTAGAGAAAGCTAAAGA GATGTTAGAG ACACAAGTGGCCCATCTGTGTTCACAGCAATCTAAACAAGATTCCCGAGGGTCTCCTTTG CTAGGTCCAG TTGTTCCAGGACCATCTCCAATCCCTTCTGTTACTGAAAAGAGGTTATCATCTGGCCAAA ATAAAGCTTC AGGCAAGAGGCAAAGATCCAGTGGAATATGGGAGAATGGTAGAGGACCAACACCTGCTAC CCCAGAGAGC TTTTCTAAAAAAAGCAAGAAAGCAGTCATGAGTGGTATTCACCCTGCAGAAGACACGGAA GGTACTGAGT TTGAGCCAGAGGGACTTCCAGAAGTTGTAAAGAAAGGGTTTGCTGACATCCCGACAGGAA AGACTAGCCC ATATATCCTGCGAAGAACAACCATGGCAACTCGGACCAGCCCCCGCCTGGCTGCACAGAA GTTAGCGCTA TCCCCACTGAGTCTCGGCAAAGAAAATCTTGCAGAGTCCTCCAAACCAACAGCTGGTGGC AGCAGATCAC AAAAGGTCAAAGTTGCTCAGCGGAGCCCAGTAGATTCAGGCACCATCCTCCGAGAACCCA CCACGAAATC CGTCCCAGTCAATAATCTTCCTGAGAGAAGTCCGACTGACAGCCCCAGAGAGGGCCTGAG GGTCAAGCGA GGCCGACTTGTCCCCAGCCCCAAAGCTGGACTGGAGTCCAACGGCAGTGAGAACTGTAAG GTCCAGTGAA GGCACTTTGTGTGTCAGTACCCCTGGGAGGTGCCAGTCATTGAATAGATAAGGCTGTGCC TACAGGACTT CTCTTTAGTCAGGGCATGCTTTATTAGTGAGGAGAAAACAATTCCTTAGAAGTCTTAAAT ATATTGTACT CTTTAGATCTCCCATGTGTAGGTATTGAAAAAGTTTGGAAGCACTGATCACCTGTTAGCA TTGCCATTCC TCTACTGCAATGTAAATAGTATAAAGCTATGTATATAAAGCTTTTTGGTAATATGTTACA ATTAAAATGA CAAGCACTATATCACAATCTCTGTTTGTATGTGGGTTTTACACTAAAAAAATGCAAAACA CATTTTATTC TTCTAATTAACAGCTCCTAGGAAAATGTAGACTTTTGCTTTATGATATTCTATCTGTAGT ATGAGGCATG GAATAGTTTTGTATCGGGAATTTCTCAGAGCTGAGTAAAATGAAGGAAAAGCATGTTATG TGTTTTTAAG GAAAATGTGCACACATATACATGTAGGAGTGTTTATCTTTCTCTTACAATCTGTTTTAGA CATCTTTGCT TATGAAACCTGTACATATGTGTGTGTGGGTATGTGTTTATTTCCAGTGAGGGCTGCAGGC TTCCTAGAGG TGTGCTATACCATGCGTCTGTCGTTGTGCTTTTTTCTGTTTTTAGACCAATTTTTTACAG TTCTTTGGTA AGCATTGTCGTATCTGGTGATGGATTAACATATAGCCTTTGTTTTCTAATAAAATAGTCG CCTTCGTTTT C T G T AAAAAAAAAAAAAAAAAAAAAA

AB091343 GGCACGAGGGGCCGACGCGAGCGCCGCGCTTCGCTTCAGCTGCTAGCTGGCCCAAGGGAG GCGACCGCGG 116

AGGGTGGCGAGGGGCGGCCAGGACCCGCAGCCCCGGGGCCGGGCCGGTCCGGACCGCCAG GGAGGGCAGG TCAGTGGGCAGATCGCGTCCGCGGGATTCAATCTCTGCCCGCTCTGATAACAGTCCTTTT CCCTGGCGCT CACTTCGTGCCTGGCACCCGGCTGGGCGCCTCAAGACCGTTGTCTCTTCGATCGCTTCTT TGGACTTGGC GACCATTTCAGAGATGTCTTCCAGAAGTACCAAAGATTTAATTAAAAGTAAGTGGGGATC GAAGCCTAGT AACTCCAAATCCGAAACTACATTAGAAAAATTAAAGGGAGAAATTGCACACTTAAAGACA TCAGTGGATG AAATCACAAGTGGGAAAGGAAAGCTGACTGATAAAGAGAGACACAGACTTTTGGAGAAAA TTCGAGTCCT T GAGGC T GAGAAGGAGAAGAAT GC T T AT CAAC T CACAGAGAAGGAC AAAGAAAT ACAGC GAC T GAGAGAC CAACTGAAGGCCAGATATAGTACTACCGCATTGCTTGAACAGCTGGAAGAGACAACGAGA GAAGGAGAAA GGAGGGAGCAGGTGTTGAAAGCCTTATCTGAAGAGAAAGACGTATTGAAACAACAGTTGT CTGCTGCAAC CTCACGAATTGCTGAACTTGAAAGCAAAACCAATACACTCCGTTTATCACAGACTGTGGC TCCAAACTGC T T CAAC T CAT CAAT AAAT AAT AT T CAT GAAAT GGAAAT ACAGC T GAAAGAT GC T C T GGAGAAAAAT CAGC AGTGGCTCGTGTATGATCAGCAGCGGGAAGTCTATGTAAAAGGACTTTTAGCAAAGATCT TTGAGTTGGA AAAGAAAACGGAAACAGCTGCTCATTCACTCCCACAGCAGACAAAAAAGCCTGAATCAGA AGGTTATCTT CAAGAAGAGAAGCAGAAATGTTACAACGATCTCTTGGCAAGTGCAAAAAAAGATCTTGAG GTTGAACGAC AAACCATAACTCAGCTGAGTTTTGAACTGAGTGAATTTCGAAGAAAATATGAAGAAACCC AAAAAGAAGT TCACAATTTAAATCAGCTGTTGTATTCACAAAGAAGGGCAGATGTGCAACATCTGGAAGA TGATAGGCAT AAAACAGAGAAGAT ACAAAAAC T CAGGGAAGAGAAT GAT AT T GC T AGGGGAAAAC T T GAAGAAGAGAAGA AGAGATCCGAAGAGCTCTTATCTCAGGTCCAGTTTCTTTACACATCTCTGCTAAAGCAGC AAGAAGAACA AACAAGGGTAGCTCTGTTGGAACAACAGATGCAGGCATGTACTTTAGACTTTGAAAATGA AAAACTCGAC CGTCAACATGTGCAGCATCAATTGCATGTAATTCTTAAGGAGCTCCGAAAAGCAAGAAAT CAAATAACAC AGTTGGAATCCTTGAAACAGCTTCATGAGTTTGCCATCACAGAGCCATTAGTCACTTTCC AAGGAGAGAC TGAAAACAGAGAAAAAGTTGCCGCCTCACCAAAAAGTCCCACTGCTGCACTCAATGAAAG CCTGGTGGAA TGTCCCAAGTGCAATATACAGTATCCAGCCACTGAGCATCGCGATCTGCTTGTCCATGTG GAATACTGTT CAAAGTAGCAAAATAAGTATTTGTTTTGATATTAAAAGATTCAATACTGTATTTTCTGTT AGCTTGTGGG CATTTTGAATTATATATTTCACATTTTGCATAAAACTGCCTATCTACCTTTGACACTCCA GCATGCTAGT GAATCATGTATCTTTTAGGCTGCTGTGCATTTCTCTTGGCAGTGATACCTCCCTGACATG GTTCATCATC AGGCTGCAATGACAGAATGTGGTGAGCAGCGTCTACTGAGACTACTAACATTTTGCACTG TCAAAATACT TGGTGAGGAAAAGATAGCTCAGGTTATTGCTAATGGGTTAATGCACCAGCAAGCAAAATA TTTTATGTTT TGGGGGTTTGAAAAATCAAAGATAATTAACCAAGGATCTTAACTGTGTTCGCATTTTTTA TCCAAGCACT TAGAAAACCTACAATCCTAATTTTGATGTCCATTGTTAAGAGGTGGTGATAGATACTATT TTTTTTTTCA TATTGTATAGCGGTTATTAGAAAAGTTGGGGATTTTCTTGATCTTTATTGCTGCTTACCA TTGAAACTTA ACCCAGCTGTGTTCCCCAACTCTGTTCTGCGCACGAAACAGTATCTGTTTGAGGCATAAT CTTAAGTGGC CACACACAATGTTTTCTCTTATGTTATCTGGCAGTAACTGTAACTTGAATTACATTAGCA CATTCTGCTT AGCTAAAATTGTTAAAATAAACTTTAATAAACCCATGTAGCCCTCTCATTTGATTGACAG TATTTTAGTT ATTTTTGGCATTCTTAAAGCTGGGCAATGTAATGATCAGATCTTTGTTTGTCTGAACAGG TATTTTTATA CATGCTTTTTGTAAACCAAAAACTTTTAAATTTCTTCAGGTTTTCTAACATGCTTACCAC TGGGCTACTG T AAAT GAGAAAAGAAT AAAAT TATTTAATGTTT T AAAAAAAAAAAAAAA

BC006428 GGCGGCTGAGCCTGAGCGGGGATGTAGAGGCGGCGGCAGCAGAGGCGGCACTGGCGGCAA GAGCAGACGC 117

CCGAGCCGAGCGAGAAGAGCGGCAGAGCCTTATCCCCTGAAGCCGGGCCCCGCGTCCCAG CCCTGCCCAG CCCGCGCCCAGCCATGCGCGCCGCCTGCTGAGTCCGGGCGCCGCACGCTGAGCCCTCCGC CCGCGAGCCG CGCTCAGCTCGGGGGTGATTAGTTGCTTTTTGTTGTTTTTTAATTTGGGCCGCGGGGAGG GGGAGGAGGG GCAGGTGCTGCAGGCTCCCCCCCCTCCCCGCCTCGGGCCAGCCGCGGCGGCGCGACTCGG GCTCCGGACC CGGGCACTGCTGGCGGCTGGAGCGGAGCGCACCGCGGCGGTGGTGCCCAGAGCGGAGCGC AGCTCCCTGC CCCGCCCCTCCCCCTCGGCCTCGCGGCGACGGCGGCGGTGGCGGCTTGGACGACTCGGAG AGCCGAGTGA AGACATTTCCACCTGGACACCTGACCATGTGCCTGCCCTGAGCAGCGAGGCCCACCAGGC ATCTCTGTTG TGGGCAGCAGGGCCAGGTCCTGGTCTGTGGACCCTCGGCAGTTGGCAGGCTCCCTCTGCA GTGGGGTCTG GGCCTCGGCCCCACCATGTCGAGCCTCGGCGGTGGCTCCCAGGATGCCGGCGGCAGTAGC AGCAGCAGCA CCAATGGCAGCGGTGGCAGTGGCAGCAGTGGCCCAAAGGCAGGAGCAGCAGACAAGAGTG CAGTGGTGGC TGCCGCCGCACCAGCCTCAGTGGCAGATGACACACCACCCCCCGAGCGTCGGAACAAGAG CGGTATCATC AGTGAGCCCCTCAACAAGAGCCTGCGCCGCTCCCGCCCGCTCTCCCACTACTCTTCTTTT GGCAGCAGTG GTGGTAGTGGCGGTGGCAGCATGATGGGCGGAGAGTCTGCTGACAAGGCCACTGCGGCTG CAGCCGCTGC CTCCCTGTTGGCCAATGGGCATGACCTGGCGGCGGCCATGGCGGTGGACAAAAGCAACCC TACCTCAAAG CACAAAAGTGGTGCTGTGGCCAGCCTGCTGAGCAAGGCAGAGCGGGCCACGGAGCTGGCA GCCGAGGGAC AGCTGACGCTGCAGCAGTTTGCGCAGTCCACAGAGATGCTGAAGCGCGTGGTGCAGGAGC ATCTCCCGCT GATGAGCGAGGCGGGTGCTGGCCTGCCTGACATGGAGGCTGTGGCAGGTGCCGAAGCCCT CAATGGCCAG TCCGACTTCCCCTACCTGGGCGCTTTCCCCATCAACCCAGGCCTCTTCATTATGACCCCG GCAGGTGTGT TCCTGGCCGAGAGCGCGCTGCACATGGCGGGCCTGGCTGAGTACCCCATGCAGGGAGAGC TGGCCTCTGC CATCAGCTCCGGCAAGAAGAAGCGGAAACGCTGCGGCATGTGCGCGCCCTGCCGGCGGCG CATCAACTGC GAGCAGTGCAGCAGTTGTAGGAATCGAAAGACTGGCCATCAGATTTGCAAATTCAGAAAA TGTGAGGAAC TCAAAAAGAAGCCTTCCGCTGCTCTGGAGAAGGTGATGCTTCCGACGGGAGCCGCCTTCC GGTGGTTTCA GTGACGGCGGCGGAACCCAAAGCTGCCCTCTCCGTGCAATGTCACTGCTCGTGTGGTCTC CAGCAAGGGA TTCGGGCGAAGACAAACGGATGCACCCGTCTTTAGAACCAAAAATATTCTCTCACAGATT TCATTCCTGT TTTTATATATATATTTTTTGTTGTCGTTTTAACATCTCCACGTCCCTAGCATAAAAAGAA AAAGAAAAAA ATTTAAACTGCTTTTTCGGAAGAACAACAACAAAAAAGAGGTAAAGACGAATCTATAAAG TACCGAGACT TCCTGGGCAAAGAATGGACAATCAGTTTCCTTCCTGTGTCGATGTCGATGTTGTCTGTGC AGGAGATGCA GTTTTTGTGTAGAGAATGTAAATTTTCTGTAACCTTTTGAAATCTAGTTACTAATAAGCA CTACTGTAAT TTAGCACAGTTTAACTCCACCCTCATTTAAACTTCCTTTGATTCTTTCCGACCATGAAAT AGTGCATAGT TTGCCTGGAGAATCCACTCACGTTCATAAAGAGAATGTTGATGGCGCCGTGTAGAAGCCG CTCTGTATCC ATCCACGCGTGCAGAGCTGCCAGCAGGGAGCTCACAGAAGGGGAGGGAGCACCAGGCCAG CTGAGCTGCA CCCACAGTCCCGAGACTGGGATCCCCCACCCCAACAGTGATTTTGGAAAAAAAAATGAAA GTTCTGTTCG TTTATCCATTGCGATCTGGGGAGCCCCATCTCGATATTTCCAATCCTGGCTACTTTTCTT AGAGAAAATA AGTCCTTTTTTTCTGGCCTTGCTAATGGCAACAGAAGAAAGGGCTTCTTTGCGTGGTCCC CTGCTGGTGG GGGTGGGTCCCCAGGGGGCCCCCTGCGGCCTGGGCCCCCCTGCCCACGGCCAGCTTCCTG CTGATGAACA TGCTGTTTGTATTGTTTTAGGAAACCAGGCTGTTTTGTGAATAAAACGAATGCATGTTTG TGTCACGAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

NM_005228 CCCCGGCGCAGCGCGGCCGCAGCAGCCTCCGCCCCCCGCACGGTGTGAGCGCCCGACGCG GCCGAGGCGG 118

CCGGAGTCCCGAGCTAGCCCCGGCGGCCGCCGCCGCCCAGACCGGACGACAGGCCACCTC GTCGGCGTCC GCCCGAGTCCCCGCCTCGCCGCCAACGCCACAACCACCGCGCACGGCCCCCTGACTCCGT CCAGTATTGA TCGGGAGAGCCGGAGCGAGCTCTTCGGGGAGCAGCGATGCGACCCTCCGGGACGGCCGGG GCAGCGCTCC TGGCGCTGCTGGCTGCGCTCTGCCCGGCGAGTCGGGCTCTGGAGGAAAAGAAAGTTTGCC AAGGCACGAG TAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTT CAATAACTGT GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTC TTAAAGACCA TCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGG AAAACCTGCA GAT CAT CAGAGGAAAT AT G T AC T ACGAAAAT T C C T AT GC C T T AGCAGT C T T AT C T AAC T AT GAT GC AAAT AAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG CGGTTCAGCA ACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACT TTCTCAGCAA CATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCC CAATGGGAGC TGCTGGGGTGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAG TGCTCCGGGC GCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCAGGCTGCACAG GCCCCCGGGA GAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCC CCCACTCATG CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGT GCCACCTGCG TGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTG GGGCCGACAG CTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAA AGTGTGTAAC GGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC TTCAAAAACT GCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCA CACATACTCC TCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTT GCTGATTCAG GCTTGGCCTGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGC AGGACCAAGC AACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTGGGATTACGCT CCCTCAAGGA GATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAAT AAACTGGAAA AAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGC TGCAAGGCCA CAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGG ACTGCGTCTC TTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGA GCCAAGGGAG TTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAAC ATCACCTGCA CAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCG TCAAGACCTG CCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCA TGTGTGCCAC CTGTGCCATCCAAACTGCACCTACGGATGCACTGGGCCAGGTCTTGAAGGCTGTCCAACG AATGGGCCTA AGATCCCGTCCATCGCCACTGGGATGGTGGGGGCCCTCCTCTTGCTGCTGGTGGTGGCCC TGGGGATCGG CCTCTTCATGCGAAGGCGCCACATCGTTCGGAAGCGCACGCTGCGGAGGCTGCTGCAGGA GAGGGAGCTT GTGGAGCCTCTTACACCCAGTGGAGAAGCTCCCAACCAAGCTCTCTTGAGGATCTTGAAG GAAACTGAAT TCAAAAAGATCAAAGTGCTGGGCTCCGGTGCGTTCGGCACGGTGTATAAGGGACTCTGGA TCCCAGAAGG TGAGAAAGTTAAAATTCCCGTCGCTATCAAGGAATTAAGAGAAGCAACATCTCCGAAAGC CAACAAGGAA ATCCTCGATGAAGCCTACGTGATGGCCAGCGTGGACAACCCCCACGTGTGCCGCCTGCTG GGCATCTGCC TCACCTCCACCGTGCAGCTCATCACGCAGCTCATGCCCTTCGGCTGCCTCCTGGACTATG TCCGGGAACA CAAAGACAATATTGGCTCCCAGTACCTGCTCAACTGGTGTGTGCAGATCGCAAAGGGCAT GAACTACTTG GAGGACCGTCGCTTGGTGCACCGCGACCTGGCAGCCAGGAACGTACTGGTGAAAACACCG CAGCATGTCA AGATCACAGATTTTGGGCTGGCCAAACTGCTGGGTGCGGAAGAGAAAGAATACCATGCAG AAGGAGGCAA AGTGCCTATCAAGTGGATGGCATTGGAATCAATTTTACACAGAATCTATACCCACCAGAG TGATGTCTGG AGCTACGGGGTGACCGTTTGGGAGTTGATGACCTTTGGATCCAAGCCATATGACGGAATC CCTGCCAGCG AGATCTCCTCCATCCTGGAGAAAGGAGAACGCCTCCCTCAGCCACCCATATGTACCATCG ATGTCTACAT GATCATGGTCAAGTGCTGGATGATAGACGCAGATAGTCGCCCAAAGTTCCGTGAGTTGAT CATCGAATTC TCCAAAATGGCCCGAGACCCCCAGCGCTACCTTGTCATTCAGGGGGATGAAAGAATGCAT TTGCCAAGTC CTACAGACTCCAACTTCTACCGTGCCCTGATGGATGAAGAAGACATGGACGACGTGGTGG ATGCCGACGA GTACCTCATCCCACAGCAGGGCTTCTTCAGCAGCCCCTCCACGTCACGGACTCCCCTCCT GAGCTCTCTG AGTGCAACCAGCAACAATTCCACCGTGGCTTGCATTGATAGAAATGGGCTGCAAAGCTGT CCCATCAAGG AAGACAGCTTCTTGCAGCGATACAGCTCAGACCCCACAGGCGCCTTGACTGAGGACAGCA TAGACGACAC CTTCCTCCCAGTGCCTGAATACATAAACCAGTCCGTTCCCAAAAGGCCCGCTGGCTCTGT GCAGAATCCT GTCTATCACAATCAGCCTCTGAACCCCGCGCCCAGCAGAGACCCACACTACCAGGACCCC CACAGCACTG CAGTGGGCAACCCCGAGTATCTCAACACTGTCCAGCCCACCTGTGTCAACAGCACATTCG ACAGCCCTGC CCACTGGGCCCAGAAAGGCAGCCACCAAATTAGCCTGGACAACCCTGACTACCAGCAGGA CTTCTTTCCC AAGGAAGCCAAGCCAAATGGCATCTTTAAGGGCTCCACAGCTGAAAATGCAGAATACCTA AGGGTCGCGC CACAAAGCAGTGAATTTATTGGAGCATGACCACGGAGGATAGTATGAGCCCTAAAAATCC AGACTCTTTC GATACCCAGGACCAAGCCACAGCAGGTCCTCCATCCCAACAGCCATGCCCGCATTAGCTC TTAGACCCAC AGACTGGTTTTGCAACGTTTACACCGACTAGCCAGGAAGTACTTCCACCTCGGGCACATT TTGGGAAGTT GCATTCCTTTGTCTTCAAACTGTGAAGCATTTACAGAAACGCATCCAGCAAGAATATTGT CCCTTTGAGC AGAAATTTATCTTTCAAAGAGGTATATTTGAAAAAAAAAAAAAGTATATGTGAGGATTTT TATTGATTGG GGATCTTGGAGTTTTTCATTGTCGCTATTGATTTTTACTTCAATGGGCTCTTCCAACAAG GAAGAAGCTT GCTGGTAGCACTTGCTACCCTGAGTTCATCCAGGCCCAACTGTGAGCAAGGAGCACAAGC CACAAGTCTT CCAGAGGATGCTTGATTCCAGTGGTTCTGCTTCAAGGCTTCCACTGCAAAACACTAAAGA TCCAAGAAGG CCTTCATGGCCCCAGCAGGCCGGATCGGTACTGTATCAAGTCATGGCAGGTACAGTAGGA TAAGCCACTC TGTCCCTTCCTGGGCAAAGAAGAAACGGAGGGGATGGAATTCTTCCTTAGACTTACTTTT GTAAAAATGT CCCCACGGTACTTACTCCCCACTGATGGACCAGTGGTTTCCAGTCATGAGCGTTAGACTG ACTTGTTTGT CTTCCATTCCATTGTTTTGAAACTCAGTATGCTGCCCCTGTCTTGCTGTCATGAAATCAG CAAGAGAGGA TGACACATCAAATAATAACTCGGATTCCAGCCCACATTGGATTCATCAGCATTTGGACCA ATAGCCCACA GCTGAGAATGTGGAATACCTAAGGATAGCACCGCTTTTGTTCTCGCAAAAACGTATCTCC TAATTTGAGG CTCAGATGAAATGCATCAGGTCCTTTGGGGCATAGATCAGAAGACTACAAAAATGAAGCT GCTCTGAAAT CTCCTTTAGCCATCACCCCAACCCCCCAAAATTAGTTTGTGTTACTTATGGAAGATAGTT TTCTCCTTTT ACTTCACTTCAAAAGCTTTTTACTCAAAGAGTATATGTTCCCTCCAGGTCAGCTGCCCCC AAACCCCCTC CTTACGCTTTGTCACACAAAAAGTGTCTCTGCCTTGAGTCATCTATTCAAGCACTTACAG CTCTGGCCAC AACAGGGCAT T T T ACAGGT GCGAATGACAGTAGCAT TAT GAGTAGT GTGGAAT TCAGGT AGTAAAT ATGA AACTAGGGTTTGAAATTGATAATGCTTTCACAACATTTGCAGATGTTTTAGAAGGAAAAA AGTTCCTTCC TAAAATAATTTCTCTACAATTGGAAGATTGGAAGATTCAGCTAGTTAGGAGCCCACCTTT TTTCCTAATC TGTGTGTGCCCTGTAACCTGACTGGTTAACAGCAGTCCTTTGTAAACAGTGTTTTAAACT CTCCTAGTCA ATATCCACCCCATCCAATTTATCAAGGAAGAAATGGTTCAGAAAATATTTTCAGCCTACA GTTATGTTCA GTCACACACACATACAAAATGTTCCTTTTGCTTTTAAAGTAATTTTTGACTCCCAGATCA GTCAGAGCCC CTACAGCATTGTTAAGAAAGTATTTGATTTTTGTCTCAATGAAAATAAAAC TATATTCATTTCCACTCTA AAAAAAAAAAAAAAAA

NM_00100586 GTTCCCGGATTTTTGTGGGCGCCTGCCCCGCCCCTCGTCCCCCTGCTGTGTCCATATATC GAGGCGATAG 119 2 GGTTAAGGGAAGGCGGACGCCTGATGGGTTAATGAGCAAACTGAAGTGTTTTCCATGATC TTTTTTGAGT

CGCAATTGAAGTACCACCTCCCGAGGGTGATTGCTTCCCCATGCGGGGTAGAACCTTTGC TGTCCTGTTC ACCACTCTACCTCCAGCACAGAATTTGGCTTATGCCTACTCAATGTGAAGATGATGAGGA TGAAAACCTT TGTGATGATCCACTTCCACTTAATGAATGGTGGCAAAGCAAAGCTATATTCAAGACCACA TGCAAAGCTA CTCCCTGAGCAAAGAGTCACAGATAAAACGGGGGCACCAGTAGAATGGCCAGGACAAACG CAGTGCAGCA CAGAGACTCAGACCCTGGCAGCCATGCCTGCGCAGGCAGTGATGAGAGTGACATGTACTG TTGTGGACAT GCACAAAAGTGAGTGTGCACCGGCACAGACATGAAGCTGCGGCTCCCTGCCAGTCCCGAG ACCCACCTGG ACATGCTCCGCCACCTCTACCAGGGCTGCCAGGTGGTGCAGGGAAACCTGGAACTCACCT ACCTGCCCAC CAATGCCAGCCTGTCCTTCCTGCAGGATATCCAGGAGGTGCAGGGCTACGTGCTCATCGC TCACAACCAA GTGAGGCAGGTCCCACTGCAGAGGCTGCGGATTGTGCGAGGCACCCAGCTCTTTGAGGAC AACTATGCCC TGGCCGTGCTAGACAATGGAGACCCGCTGAACAATACCACCCCTGTCACAGGGGCCTCCC CAGGAGGCCT GCGGGAGCTGCAGCTTCGAAGCCTCACAGAGATCTTGAAAGGAGGGGTCTTGATCCAGCG GAACCCCCAG CTCTGCTACCAGGACACGATTTTGTGGAAGGACATCTTCCACAAGAACAACCAGCTGGCT CTCACACTGA TAGACACCAACCGCTCTCGGGCCTGCCACCCCTGTTCTCCGATGTGTAAGGGCTCCCGCT GCTGGGGAGA GAGTTCTGAGGATTGTCAGAGCCTGACGCGCACTGTCTGTGCCGGTGGCTGTGCCCGCTG CAAGGGGCCA CTGCCCACTGACTGCTGCCATGAGCAGTGTGCTGCCGGCTGCACGGGCCCCAAGCACTCT GACTGCCTGG CCTGCCTCCACTTCAACCACAGTGGCATCTGTGAGCTGCACTGCCCAGCCCTGGTCACCT ACAACACAGA CACGTTTGAGTCCATGCCCAATCCCGAGGGCCGGTATACATTCGGCGCCAGCTGTGTGAC TGCCTGTCCC TACAACTACCTTTCTACGGACGTGGGATCCTGCACCCTCGTCTGCCCCCTGCACAACCAA GAGGTGACAG CAGAGGATGGAACACAGCGGTGTGAGAAGTGCAGCAAGCCCTGTGCCCGAGTGTGCTATG GTCTGGGCAT GGAGCACTTGCGAGAGGTGAGGGCAGTTACCAGTGCCAATATCCAGGAGTTTGCTGGCTG CAAGAAGATC TTTGGGAGCCTGGCATTTCTGCCGGAGAGCTTTGATGGGGACCCAGCCTCCAACACTGCC CCGCTCCAGC CAGAGCAGCTCCAAGTGTTTGAGACTCTGGAAGAGATCACAGGTTACCTATACATCTCAG CATGGCCGGA CAGCCTGCCTGACCTCAGCGTCTTCCAGAACCTGCAAGTAATCCGGGGACGAATTCTGCA CAATGGCGCC TACTCGCTGACCCTGCAAGGGCTGGGCATCAGCTGGCTGGGGCTGCGCTCACTGAGGGAA CTGGGCAGTG GACTGGCCCTCATCCACCATAACACCCACCTCTGCTTCGTGCACACGGTGCCCTGGGACC AGCTCTTTCG GAACCCGCACCAAGCTCTGCTCCACACTGCCAACCGGCCAGAGGACGAGTGTGTGGGCGA GGGCCTGGCC TGCCACCAGCTGTGCGCCCGAGGGCACTGCTGGGGTCCAGGGCCCACCCAGTGTGTCAAC TGCAGCCAGT TCCTTCGGGGCCAGGAGTGCGTGGAGGAATGCCGAGTACTGCAGGGGCTCCCCAGGGAGT ATGTGAATGC CAGGCACTGTTTGCCGTGCCACCCTGAGTGTCAGCCCCAGAATGGCTCAGTGACCTGTTT TGGACCGGAG GCTGACCAGTGTGTGGCCTGTGCCCACTATAAGGACCCTCCCTTCTGCGTGGCCCGCTGC CCCAGCGGTG TGAAACCTGACCTCTCCTACATGCCCATCTGGAAGTTTCCAGATGAGGAGGGCGCATGCC AGCCTTGCCC CATCAACTGCACCCACTCCTGTGTGGACCTGGATGACAAGGGCTGCCCCGCCGAGCAGAG AGCCAGCCCT CTGACGTCCATCATCTCTGCGGTGGTTGGCATTCTGCTGGTCGTGGTCTTGGGGGTGGTC TTTGGGATCC TCATCAAGCGACGGCAGCAGAAGATCCGGAAGTACACGATGCGGAGACTGCTGCAGGAAA CGGAGCTGGT GGAGCCGCTGACACCTAGCGGAGCGATGCCCAACCAGGCGCAGATGCGGATCCTGAAAGA GACGGAGCTG AGGAAGGTGAAGGTGCTTGGATCTGGCGCTTTTGGCACAGTCTACAAGGGCATCTGGATC CCTGATGGGG AGAATGTGAAAATTCCAGTGGCCATCAAAGTGTTGAGGGAAAACACATCCCCCAAAGCCA ACAAAGAAAT CTTAGACGAAGCATACGTGATGGCTGGTGTGGGCTCCCCATATGTCTCCCGCCTTCTGGG CATCTGCCTG ACATCCACGGTGCAGCTGGTGACACAGCTTATGCCCTATGGCTGCCTCTTAGACCATGTC CGGGAAAACC GCGGACGCCTGGGCTCCCAGGACCTGCTGAACTGGTGTATGCAGATTGCCAAGGGGATGA GCTACCTGGA GGATGTGCGGCTCGTACACAGGGACTTGGCCGCTCGGAACGTGCTGGTCAAGAGTCCCAA CCATGTCAAA ATTACAGACTTCGGGCTGGCTCGGCTGCTGGACATTGACGAGACAGAGTACCATGCAGAT GGGGGCAAGG TGCCCATCAAGTGGATGGCGCTGGAGTCCATTCTCCGCCGGCGGTTCACCCACCAGAGTG ATGTGTGGAG TTATGGTGTGACTGTGTGGGAGCTGATGACTTTTGGGGCCAAACCTTACGATGGGATCCC AGCCCGGGAG ATCCCTGACCTGCTGGAAAAGGGGGAGCGGCTGCCCCAGCCCCCCATCTGCACCATTGAT GTCTACATGA TCATGGTCAAATGTTGGATGATTGACTCTGAATGTCGGCCAAGATTCCGGGAGTTGGTGT CTGAATTCTC CCGCATGGCCAGGGACCCCCAGCGCTTTGTGGTCATCCAGAATGAGGACTTGGGCCCAGC CAGTCCCTTG GACAGCACCTTCTACCGCTCACTGCTGGAGGACGATGACATGGGGGACCTGGTGGATGCT GAGGAGTATC TGGTACCCCAGCAGGGCTTCTTCTGTCCAGACCCTGCCCCGGGCGCTGGGGGCATGGTCC ACCACAGGCA CCGCAGCTCATCTACCAGGAGTGGCGGTGGGGACCTGACACTAGGGCTGGAGCCCTCTGA AGAGGAGGCC CCCAGGTCTCCACTGGCACCCTCCGAAGGGGCTGGCTCCGATGTATTTGATGGTGACCTG GGAATGGGGG CAGCCAAGGGGCTGCAAAGCCTCCCCACACATGACCCCAGCCCTCTACAGCGGTACAGTG AGGACCCCAC AGTACCCCTGCCCTCTGAGACTGATGGCTACGTTGCCCCCCTGACCTGCAGCCCCCAGCC TGAATATGTG AACCAGCCAGATGTTCGGCCCCAGCCCCCTTCGCCCCGAGAGGGCCCTCTGCCTGCTGCC CGACCTGCTG GTGCCACTCTGGAAAGGCCCAAGACTCTCTCCCCAGGGAAGAATGGGGTCGTCAAAGACG TTTTTGCCTT TGGGGGTGCCGTGGAGAACCCCGAGTACTTGACACCCCAGGGAGGAGCTGCCCCTCAGCC CCACCCTCCT CCTGCCTTCAGCCCAGCCTTCGACAACCTCTATTACTGGGACCAGGACCCACCAGAGCGG GGGGCTCCAC CCAGCACCTTCAAAGGGACACCTACGGCAGAGAACCCAGAGTACCTGGGTCTGGACGTGC CAGTGTGAAC CAGAAGGCCAAGTCCGCAGAAGCCCTGATGTGTCCTCAGGGAGCAGGGAAGGCCTGACTT CTGCTGGCAT CAAGAGGTGGGAGGGCCCTCCGACCACTTCCAGGGGAACCTGCCATGCCAGGAACCTGTC CTAAGGAACC TTCCTTCCTGCTTGAGTTCCCAGATGGCTGGAAGGGGTCCAGCCTCGTTGGAAGAGGAAC AGCACTGGGG AGTCTTTGTGGATTCTGAGGCCCTGCCCAATGAGACTCTAGGGTCCAGTGGATGCCACAG CCCAGCTTGG CCCTTTCCTTCCAGATCCTGGGTACTGAAAGCCTTAGGGAAGCTGGCCTGAGAGGGGAAG CGGCCCTAAG GGAGTGTCTAAGAACAAAAGCGACCCATTCAGAGACTGTCCCTGAAACCTAGTACTGCCC CCCATGAGGA AGGAACAGCAATGGTGTCAGTATCCAGGCTTTGTACAGAGTGCTTTTCTGTTTAGTTTTT ACTTTTTTTG TTTTGTTTTTTTAAAGATGAAATAAAGACCCAGGGGGAGAATGGGTGTTGTATGGGGAGG CAAGTGTGGG GGGTCCTTCTCCACACCCACTTTGTCCATTTGCAAATATATTTTGGAAAACAGCTA

NM_00112274 ATGGTCATAACAGCCTCCTGTCTACCGACTCAGAACGGATTTTACCAAAACTGAAAATGC AGGCTCCATG 120 2 CTCAGAAGCTCTTTAACAGGCTCGAAAGGTCCATGCTCCTTTCTCCTGCCCATTCTATAG CATAAGAAGA

CAGTC T C T GAGT GAT AAT CTTCTCTT CAAGAAGAAGAAAAC T AGGAAGGAG TAAGCACAAAGAT C T C T T C ACATTCTCCGGGACTGCGGTACCAAATATCAGCACAGCACTTCTTGAAAAAGGATGTAGA TTTTAATCTG AACTTTGAACCATCACTGAGGTGGCCCGCCGGTTTCTGAGCCTTCTGCCCTGCGGGGACA CGGTCTGCAC CCTGCCCGCGGCCACGGACCATGACCATGACCCTCCACACCAAAGCATCTGGGATGGCCC TACTGCATCA GATCCAAGGGAACGAGCTGGAGCCCCTGAACCGTCCGCAGCTCAAGATCCCCCTGGAGCG GCCCCTGGGC GAGGTGTACCTGGACAGCAGCAAGCCCGCCGTGTACAACTACCCCGAGGGCGCCGCCTAC GAGTTCAACG CCGCGGCCGCCGCCAACGCGCAGGTCTACGGTCAGACCGGCCTCCCCTACGGCCCCGGGT CTGAGGCTGC GGCGTTCGGCTCCAACGGCCTGGGGGGTTTCCCCCCACTCAACAGCGTGTCTCCGAGCCC GCTGATGCTA CTGCACCCGCCGCCGCAGCTGTCGCCTTTCCTGCAGCCCCACGGCCAGCAGGTGCCCTAC TACCTGGAGA ACGAGCCCAGCGGCTACACGGTGCGCGAGGCCGGCCCGCCGGCATTCTACAGGCCAAATT CAGATAATCG ACGCCAGGGTGGCAGAGAAAGATTGGCCAGTACCAATGACAAGGGAAGTATGGCTATGGA ATCTGCCAAG GAGACTCGCTACTGTGCAGTGTGCAATGACTATGCTTCAGGCTACCATTATGGAGTCTGG TCCTGTGAGG GCTGCAAGGCCTTCTTCAAGAGAAGTATTCAAGGACATAACGACTATATGTGTCCAGCCA CCAACCAGTG CACCATTGATAAAAACAGGAGGAAGAGCTGCCAGGCCTGCCGGCTCCGCAAATGCTACGA AGTGGGAATG ATGAAAGGTGGGATACGAAAAGACCGAAGAGGAGGGAGAATGTTGAAACACAAGCGCCAG AGAGATGATG GGGAGGGCAGGGGTGAAGTGGGGTCTGCTGGAGACATGAGAGCTGCCAACCTTTGGCCAA GCCCGCTCAT GATCAAACGCTCTAAGAAGAACAGCCTGGCCTTGTCCCTGACGGCCGACCAGATGGTCAG TGCCTTGTTG GATGCTGAGCCCCCCATACTCTATTCCGAGTATGATCCTACCAGACCCTTCAGTGAAGCT TCGATGATGG GCTTACTGACCAACCTGGCAGACAGGGAGCTGGTTCACATGATCAACTGGGCGAAGAGGG TGCCAGGCTT TGTGGATTTGACCCTCCATGATCAGGTCCACCTTCTAGAATGTGCCTGGCTAGAGATCCT GATGATTGGT CTCGTCTGGCGCTCCATGGAGCACCCAGGGAAGCTACTGTTTGCTCCTAACTTGCTCTTG GACAGGAACC AGGGAAAATGTGTAGAGGGCATGGTGGAGATCTTCGACATGCTGCTGGCTACATCATCTC GGTTCCGCAT GATGAATCTGCAGGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGG AGTGTACACA TTTCTGTCCAGCACCCTGAAGTCTCTGGAAGAGAAGGACCATATCCACCGAGTCCTGGAC AAGATCACAG ACACTTTGATCCACCTGATGGCCAAGGCAGGCCTGACCCTGCAGCAGCAGCACCAGCGGC TGGCCCAGCT CCTCCTCATCCTCTCCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAG CATGAAGTGC AAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGACGCCCACCGCCTACAT GCGCCCACTA GCCGTGGAGGGGCATCCGTGGAGGAGACGGACCAAAGCCACTTGGCCACTGCGGGCTCTA CTTCATCGCA TTCCTTGCAAAAGTATTACATCACGGGGGAGGCAGAGGGTTTCCCTGCCACGGTCTGAGA GCTCCCTGGC TCCCACACGGTTCAGATAATCCCTGCTGCATTTTACCCTCATCATGCACCACTTTAGCCA AATTCTGTCT CCTGCATACACTCCGGCATGCATCCAACACCAATGGCTTTCTAGATGAGTGGCCATTCAT TTGCTTGCTC AGTTCTTAGTGGCACATCTTCTGTCTTCTGTTGGGAACAGCCAAAGGGATTCCAAGGCTA AATCTTTGTA ACAGCTCTCTTTCCCCCTTGCTATGTTACTAAGCGTGAGGATTCCCGTAGCTCTTCACAG CTGAACTCAG TCTATGGGTTGGGGCTCAGATAACTCTGTGCATTTAAGCTACTTGTAGAGACCCAGGCCT GGAGAGTAGA CATTTTGCCTCTGATAAGCACTTTTTAAATGGCTCTAAGAATAAGCCACAGCAAAGAATT TAAAGTGGCT CCTTTAATTGGTGACTTGGAGAAAGCTAGGTCAAGGGTTTATTATAGCACCCTCTTGTAT TCCTATGGCA ATGCATCCTTTTATGAAAGTGGTACACCTTAAAGCTTTTATATGACTGTAGCAGAGTATC TGGTGATTGT CAATTCATTCCCCCTATAGGAATACAAGGGGCACACAGGGAAGGCAGATCCCCTAGTTGG CAAGACTATT TTAACTTGATACACTGCAGATTCAGATGTGCTGAAAGCTCTGCCTCTGGCTTTCCGGTCA TGGGTTCCAG TTAATTCATGCCTCCCATGGACCTATGGAGAGCAGCAAGTTGATCTTAGTTAAGTCTCCC TATATGAGGG ATAAGTTCCTGATTTTTGTTTTTATTTTTGTGTTACAAAAGAAAGCCCTCCCTCCCTGAA CTTGCAGTAA GGTCAGCTTCAGGACCTGTTCCAGTGGGCACTGTACTTGGATCTTCCCGGCGTGTGTGTG CCTTACACAG GGGTGAACTGTTCACTGTGGTGATGCATGATGAGGGTAAATGGTAGTTGAAAGGAGCAGG GGCCCTGGTG TTGCATTTAGCCCTGGGGCATGGAGCTGAACAGTACTTGTGCAGGATTGTTGTGGCTACT AGAGAACAAG AGGGAAAGTAGGGCAGAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGTG GCCCTGCCAC AGGCTGCAGCTACCTAGGAACATTCCTTGCAGACCCCGCATTGCCCTTTGGGGGTGCCCT GGGATCCCTG GGGTAGTCCAGCTCTTCTTCATTTCCCAGCGTGGCCCTGGTTGGAAGAAGCAGCTGTCAC AGCTGCTGTA GACAGCTGTGTTCCTACAATTGGCCCAGCACCCTGGGGCACGGGAGAAGGGTGGGGACCG TTGCTGTCAC TACTCAGGCTGACTGGGGCCTGGTCAGATTACGTATGCCCTTGGTGGTTTAGAGATAATC CAAAATCAGG GTTTGGTTTGGGGAAGAAAATCCTCCCCCTTCCTCCCCCGCCCCGTTCCCTACCGCCTCC ACTCCTGCCA GCTCATTTCCTTCAATTTCCTTTGACCTATAGGCTAAAAAAGAAAGGCTCATTCCAGCCA CAGGGCAGCC TTCCCTGGGCCTTTGCTTCTCTAGCACAATTATGGGTTACTTCCTTTTTCTTAACAAAAA AGAATGTTTG ATTTCCTCTGGGTGACCTTATTGTCTGTAATTGAAACCCTATTGAGAGGTGATGTCTGTG TTAGCCAATG ACCCAGGTGAGCTGCTCGGGCTTCTCTTGGTATGTCTTGTTTGGAAAAGTGGATTTCATT CATTTCTGAT TGTCCAGTTAAGTGATCACCAAAGGACTGAGAATCTGGGAGGGCAAAAAAAAAAAAAAAG TTTTTATGTG CACTTAAATTTGGGGACAATTTTATGTATCTGTGTTAAGGATATGTTTAAGAACATAATT CTTTTGTTGC TGTTTGTTTAAGAAGCACCTTAGTTTGTTTAAGAAGCACCTTATATAGTATAATATATAT TTTTTTGAAA TTACATTGCTTGTTTATCAGACAATTGAATGTAGTAATTCTGTTCTGGATTTAATTTGAC TGGGTTAACA TGCAAAAACCAAGGAAAAATATTTAGTTTTTTTTTTTTTTTTTGTATACTTTTCAAGCTA CCTTGTCATG TATACAGTCATTTATGCCTAAAGCCTGGTGATTATTCATTTAAATGAAGATCACATTTCA TATCAACTTT TGTATCCACAGTAGACAAAATAGCACTAATCCAGATGCCTATTGTTGGATACTGAATGAC AGACAATCTT ATGTAGCAAAGATTATGCCTGAAAAGGAAAATTATTCAGGGCAGCTAATTTTGCTTTTAC CAAAATATCA GTAGTAATATTTTTGGACAGTAGCTAATGGGTCAGTGGGTTCTTTTTAATGTTTATACTT AGATTTTCTT T TAAAAAAAT TAAAATAAAACAAAAAAAAAT T T CTAGGACTAGACGATGTAAT ACCAGC TAAAGCCAAAC AATTATACAGTGGAAGGTTTTACATTATTCATCCAATGTGTTTCTATTCATGTTAAGATA CTACTACATT TGAAGTGGGCAGAGAACATCAGATGATTGAAATGTTCGCCCAGGGGTCTCCAGCAACTTT GGAAATCTCT TTGTATTTTTACTTGAAGTGCCACTAATGGACAGCAGATATTTTCTGGCTGATGTTGGTA TTGGGTGTAG GAACATGATTTAAAAAAAAACTCTTGCCTCTGCTTTCCCCCACTCTGAGGCAAGTTAAAA TGTAAAAGAT GTGATTTATCTGGGGGGCTCAGGTATGGTGGGGAAGTGGATTCAGGAATCTGGGGAATGG CAAATATATT AAGAAGAGTATTGAAAGTATTTGGAGGAAAATGGTTAATTCTGGGTGTGCACCAGGGTTC AGTAGAGTCC ACTTCTGCCCTGGAGACCACAAATCAACTAGCTCCATTTACAGCCATTTCTAAAATGGCA GCTTCAGTTC TAGAGAAGAAAGAACAACATCAGCAGTAAAGTCCATGGAATAGCTAGTGGTCTGTGTTTC TTTTCGCCAT TGCCTAGCTTGCCGTAATGATTCTATAATGCCATCATGCAGCAATTATGAGAGGCTAGGT CATCCAAAGA GAAGACCCTATCAATGTAGGTTGCAAAATCTAACCCCTAAGGAAGTGCAGTCTTTGATTT GATTTCCCTA GTAACCTTGCAGATATGTTTAACCAAGCCATAGCCCATGCCTTTTGAGGGCTGAACAAAT AAGGGACTTA CTGATAATT TACT TTTGATCACATTAAGGTGTTCTCACCTTGAAATCT TAT ACACTGAAATGGCCATTGA TTTAGGCCACTGGCTTAGAGTACTCCTTCCCCTGCATGACACTGATTACAAATACTTTCC TATTCATACT TTCCAATTATGAGATGGACTGTGGGTACTGGGAGTGATCACTAACACCATAGTAATGTCT AATATTCACA GGCAGATCTGCTTGGGGAAGCTAGTTATGTGAAAGGCAAATAGAGTCATACAGTAGCTCA AAAGGCAACC ATAATTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTACACTGCACCATTCCCAA GTTAATCCCC TGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGGTCCCAAATATCCATCTTTTCAGT AGCGTTAATT ATGCTCTGTTTCCAACTGCATTTCCTTTCCAATTGAATTAAAGTGTGGCCTCGTTTTTAG TCATTTAAAA TTGTTTTCTAAGTAATTGCTGCCTCTATTATGGCACTTCAATTTTGCACTGTCTTTTGAG ATTCAAGAAA AATTTCTATTCTTTTTTTTGCATCCAATTGTGCCTGAACTTTTAAAATATGTAAATGCTG CCATGTTCCA AACCCATCGTCAGTGTGTGTGTTTAGAGCTGTGCACCCTAGAAACAACATATTGTCCCAT GAGCAGGTGC CTGAGACACAGACCCCTTTGCATTCACAGAGAGGTCATTGGTTATAGAGACTTGAATTAA TAAGTGACAT TATGCCAGTTTCTGTTCTCTCACAGGTGATAAACAATGCTTTTTGTGCACTACATACTCT TCAGTGTAGA GCTCTTGTTTTATGGGAAAAGGCTCAAATGCCAAATTGTGTTTGATGGATTAATATGCCC TTTTGCCGAT GCATAC TAT TACT GATGTGACTCGGTTTTGTCGCAGCTTTGCTTTGTTTAATGAAACACACTTGTAAACC TCTTTTGCACTTTGAAAAAGAATCCAGCGGGATGCTCGAGCACCTGTAAACAATTTTCTC AACCTATTTG AT GT T C AAAT AAAGAAT T AAAC T AAA NM_130398 AAATTGAAAGGTCAGCCTTTCGCGCGCTGTGTAGGCAAGTTACCCGTGTTCTGCGTTGCC GGCCGTGGGT 121 GCTCTGGCCACAGTGAGTTAGGGGCGTCGGAGCGGGTTTCTCCAACCGCAATCGGCTCCG CTCAAGGGGA GGAGGAGAGTCCCTTCTCGGAAGGCCTAAGGAAACGTGTCGTCTGGAATGGGCTTGGGGG CCACGCCTGC ACATCTCCGCGAGACAGAGGGATAAAGTGAAGATGGTGCTGTTATTGTTACCTCGAGTGC CACATGCGAC C T C T GAGAT AT GT ACACAG TCATTCTTACTATC GCAC TCAGCCATTCTTAC TACGC T AAAGAAGAAAT AA TTATTCGAGGATATTTGCCTGGCCCAGAAGAAACTTATGTAAATTTCATGAACTATTATA TCCGTTTTCC TCGGAGTGAGAGAAAACTCTTTTTAGATATCATCTGAGAGAACTAGTGAATCCCAGTCAC TGAGTGGAGT T GAGAG T C T AAGAACC T C T GAAAT T T GAGAAC T GC T GGACCAGAGC CTTTAGAGCTCT GAT AAGGT GT CA ACAGGGTAGTTAATTTGGCACCATGGGGATACAGGGATTGCTACAATTTATCAAAGAAGC TTCAGAACCC ATCCATGTGAGGAAGTATAAAGGGCAGGTAGTAGCTGTGGATACATATTGCTGGCTTCAC AAAGGAGCTA TTGCTTGTGCTGAAAAACTAGCCAAAGGTGAACCTACTGATAGGTATGTAGGATTTTGTA TGAAATTTGT AAATATGTTACTATCTCATGGGATCAAGCCTATTCTCGTATTTGATGGATGTACTTTACC TTCTAAAAAG GAAGTAGAGAGATCTAGAAGAGAAAGACGACAAGCCAATCTTCTTAAGGGAAAGCAACTT CTTCGTGAGG GGAAAGTCTCGGAAGCTCGAGAGTGTTTCACCCGGTCTATCAATATCACACATGCCATGG CCCACAAAGT AATTAAAGCTGCCCGGTCTCAGGGGGTAGATTGCCTCGTGGCTCCCTATGAAGCTGATGC GCAGTTGGCC TATCTTAACAAAGCGGGAATTGTGCAAGCCATAATTACAGAGGACTCGGATCTCCTAGCT TTTGGCTGTA AAAAGGTAATTTTAAAGATGGACCAGTTTGGAAATGGACTTGAAATTGATCAAGCTCGGC TAGGAATGTG CAGACAGCTTGGGGATGTATTCACGGAAGAGAAGTTTCGTTACATGTGTATTCTTTCAGG TTGTGACTAC CTGTCATCACTGCGTGGGATTGGATTAGCAAAGGCATGCAAAGTCCTAAGACTAGCCAAT AATCCAGATA TAGTAAAGGTTATCAAGAAAATTGGACATTATCTCAAGATGAATATCACGGTACCAGAGG ATTACATCAA CGGGTTTATTCGGGCCAACAATACCTTCCTCTATCAGCTAGTTTTTGATCCCATCAAAAG GAAACTTATT CCTCTGAACGCCTATGAAGATGATGTTGATCCTGAAACACTAAGCTACGCTGGGCAATAT GTTGATGATT CCATAGCTCTT CAAAT AGC AC T T GGAAAT AAAGAT AT AAAT AC T T T T GAAC AGAT CGAT GAC TACAATCC AGACACTGCTATGCCTGCCCATTCAAGAAGTCATAGTTGGGATGACAAAACATGTCAAAA GTCAGCTAAT GTTAGCAGCATTTGGCATAGGAATTACTCTCCCAGACCAGAGTCGGGTACTGTTTCAGAT GCCCCACAAT TGAAGGAAAATCCAAGTACTGTGGGAGTGGAACGAGTGATTAGTACTAAAGGGTTAAATC TCCCAAGGAA ATCATCCATTGTGAAAAGACCAAGAAGTGCAGAGCTGTCAGAAGATGACCTGTTGAGTCA GTATTCTCTT TCATTTACGAAGAAGACCAAGAAAAATAGCTCTGAAGGCAATAAATCATTGAGCTTTTCT GAAGTGTTTG TGCCTGACCTGGTAAATGGACCTACTAACAAAAAGAGTGTAAGCACTCCACCTAGGACGA GAAATAAATT TGCAACATTTTTACAAAGGAAAAATGAAGAAAGTGGTGCAGTTGTGGTTCCAGGGACCAG AAGCAGGTTT TTTTGCAGTTCAGATTCTACTGACTGTGTATCAAACAAAGTGAGCATCCAGCCTCTGGAT GAAACTGCTG TCACAGATAAAGAGAACAATCTGCATGAATCAGAGTATGGAGACCAAGAAGGCAAGAGAC TGGTTGACAC AGATGTAGCACGTAATTCAAGTGATGACATTCCGAATAATCATATTCCAGGTGATCATAT TCCAGACAAG GCAACAGTGTTTACAGATGAAGAGTCCTACTCTTTTGAGAGCAGCAAATTTACAAGGACC ATTTCACCAC CCACTTTGGGAACACTAAGAAGTTGTTTTAGTTGGTCTGGAGGTCTTGGAGATTTTTCAA GAACGCCGAG CCCCTCTCCAAGCACAGCATTGCAGCAGTTCCGAAGAAAGAGCGATTCCCCCACCTCTTT GCCTGAGAAT AATATGTCTGATGTGTCGCAGTTAAAGAGCGAGGAGTCCAGTGACGATGAGTCTCATCCC TTACGAGAAG AGGCATGTTCTTCACAGTCCCAGGAAAGTGGAGAATTCTCACTGCAGAGTTCAAATGCAT CAAAGCTTTC TCAGTGCTCTAGTAAGGACTCTGATTCAGAGGAATCTGATTGCAATATTAAGTTACTTGA CAGTCAAAGT GACCAGACCTCCAAGCTACGTTTATCTCATTTCTCAAAAAAAGACACACCTCTAAGGAAC AAGGTTCCTG GGCTATATAAGTCCAGTTCTGCAGACTCTCTTTCTACAACCAAGATCAAACCTCTAGGAC CTGCCAGAGC CAGTGGGCTGAGCAAGAAGCCGGCAAGCATCCAGAAGAGAAAGCATCATAATGCCGAGAA CAAGCCGGGG TTACAGATCAAACTCAATGAGCTCTGGAAAAACTTTGGATTTAAAAAAGATTCTGAAAAG CTTCCTCCTT GTAAGAAACCCCTGTCCCCAGTCAGAGATAACATCCAACTAACTCCAGAAGCGGAAGAGG ATATATTTAA CAAACCTGAATGTGGCCGTGTTCAAAGAGCAATATTCCAGTAAATGCAGACTGCTGCAAA GCTTTTGCCT GCAAGAGAATCTGATCAATTTGAAGTCCCTGTTTGGGAATGAGGCACTTATCAGCATGAA GAATTTTTTC TCATTCTGTGCCATTTTAAAAATAGAATACATTTTGTATATTAACTTTATAATTGGGTTG TGGTTTTTTT GCTCAGCTTTTTATATTTTTATAAGAAGCTAAATAGAAGAATAATTGTATCTCTGACAGG TTTTTGGAGG TTTTAGTGTTAATTGGGAAAATCCTCTGGAGTTTATAAAAGTCTACTCTAAATATTTCTG TAATGTTGTC AAGTAGAAAGATAGTAAATGGAGAAACTACAAAAAAAAAAAAAAAAAA

AB 209631 CCATGACCTGCCTTGAGAAGGGGCAGGGGAAGCCAGATGGACTGGAAGTGGAGTGGCAGT GACCAAGGAG 122

GAGGAGGTGTGATAGGCTTCCCACGCAGGGTAGATCCAGAGACACCAGTGCCACCCATAG GCCCCTAGGA CTGCAGTGGTCACCCGATTCCTTTGTCCCAGCTGAGACTCAGTTCTGAGTGTTCTATTTT GGGGAACAGA GGCGTCCTTGGTAGCATTTGGAAGAGGATAGCCAGCTGGGGTGTGTGTACATCACAGCCT GACAGTAACA GCATCCGAACCAGAGGTGACTGGCTAAGGGCAGACCCAGGGCAACAGGTTAACCGTTCTA GGGCCGGGCA CAGGGAGGAGAACATTCCAACACTCTGTGTGCCCAGTGCCGACGCACGTTCTCTCTTTTA TCCTCAAAAC AGTCCTATGAGGATATAAGCCAGAGAGAGACAGAGACAAGGAATTACAAGTTGGTGAGAG TCAGGATTTG AACTTGGCTCTGGCAGATGGAAAATTAGGGTCTGTATTCTTTACAAAACCGTGTGTGCCT CAGATGGAGT TGGTGCATAACAAGCAGAGGTATCCAGGGTCGCGGTCCTGCTTGCCACGGAAGGGGCCGC CTTGTCAGTT GTGACCACCCAGCCCTGGAAATGTCAGTAATGCTGTAAGGAGTGGGGATCGGATCAGATG CCATCCAGAT GCTGAAGTTTGACCTTGTGTCATTTTTCACTTTCTTTTTTGGCTCTTCTGCAATCAATTC ATTTATTTAG CAAAAAAGAAATTATGTGTGCCGAGAGCATGCAGAAGATATGTCTCCGTTCTCTGCTTCC CTCCAAAAAA GAATCCCAAAACTGCTTTCTGTGAACGTGTGCCAGGGTCCCAGCAGGACTCAGGGAGAGC AGGAAGCCCA GCCCAGACCCCTTGCACAACCTACCGTGGGGAGGCCTTAGGCTCTGGCTAC TACAGAGCTGGTTCCAGTC TGCACTGCCACAGCCTGGCCAGGGACTTGGACACATCTGCTGGCCACTTCCTGTCTCAGT TTCCTTATCT GCAAAATAAGGGAAAAGCCCCCACAAAGGTGCACGTGTAGCAGGAGCTCTTTTCCCTCCC TATTTTAGGA AGGCAGTTGGTGGGAAGTCCAGCTTGGGTCCCTGAGAGCTGTGAGAAGGAGATGCGGCTG CTGCTGGCCC TGTTGGGGGTCCTGCTGAGTGTGCCTGGGCCTCCAGTCTTGTCCCTGGAGGCCTCTGAGG AAGTGGAGCT TGGTATGGCTTCTGAGGTGGGAGAGGGTGGCAGGGGTGGGAAGAGTGGGCACCAGGAGGG GGCTGCTGGG CTGAGCAAAGCTGGAAAGGATCCTTGCCCAGGCCCTGAGAAGGTGGCGGCAGGGCAGGGC TCAACCACTG AGACTCAGTCAGTGCCTGGCTTCCAGCAAGCATTCATCTATCACTGTGTCTGCGAGAGAG GACTGGCCTT GCAGGGCGCAGGGCCCTAAGCTGGGCTGCAGAGCTGGTGGTGAGCTCCTTGCCTGGGTGT GTGTGCGTGT GTGTGTGTGTTCTGTGCACTGGGTGTGTGACCTAGGAGGTCCAGGCAGCATGTGTGGTAT AAGCATTATG AGGGTGATATGCCCCGGTGCAGCATGACCCTGTATGTGGCACCAACAGCATGTGCCTTGT GTGTGTGTGT GTCCGTATGTGTGTGTGTGTATGCGTGTGTGTGTGTGTGTGTGTGTGTCTTGGCCACTGT CATGTGCACT AAATGCTGTGTGTGTGACATGCCCCAAGAGTGTGGCATTTGCCCTGGGTGTGGCATCCGC AGCATGTGGC TGTGTGGGTGTCAAGGAGTGGTGGCTCCTTCAGCATGCGTTGCGAAGTGCTTGTGCCCTG CATGTGCGGT GTGTTCTCTGTACACAGGAGGCTGCCTCAGATGGGGCTGCGGGGTCTGCTGACCTCTGCC CTCTGCCCAC AGAGCCCTGCCTGGCTCCCAGCCTGGAGCAGCAAGAGCAGGAGCTGACAGTAGCCCTTGG GCAGCCTGTG CGGCTGTGCTGTGGGCGGGCTGAGCGTGGTGGCCACTGGTACAAGGAGGGCAGTCGCCTG GCACCTGCTG GCCGTGTACGGGGCTGGAGGGGCCGCCTAGAGATTGCCAGCTTCCTACCTGAGGATGCTG GCCGCTACCT CTGCCTGGCACGAGGCTCCATGATCGTCCTGCAGAATCTCACCTTGATTACAGGTGACTC CTTGACCTCC AGCAACGATGATGAGGACCCCAAGTCCCATAGGGACCTCTCGAATAGGCACAGTTACCCC CAGCAAGGTC AGTAGGTCTCCAAGGACTTGTGTCCCCGCTGCTGCTCATCTGATCACTGAGAAGAGGAGG CCTGTGTGGG AACACACGGTCATTCTAGGGGCCTTCCCCTGCCCTCCAGCACCCTACTGGACACACCCCC AGCGCATGGA GAAGAAACTGCATGCAGTACCTGCGGGGAACACCGTCAAGTTCCGCTGTCCAGCTGCAGG CAACCCCACG CCCACCATCCGCTGGCTTAAGGATGGACAGGCCTTTCATGGGGAGAACCGCATTGGAGGC ATTCGGCTGC GCCATCAGCACTGGAGTCTCGTGATGGAGAGCGTGGTGCCCTCGGACCGCGGCACATACA CCTGCCTGGT AGAGAACGCTGTGGGCAGCATCCGTTATAACTACCTGCTAGATGTGCTGGAGCGGTCCCC GCACCGGCCC ATCCTGCAGGCCGGGCTCCCGGCCAACACCACAGCCGTGGTGGGCAGCGACGTGGAGCTG CTGTGCAAGG TGTACAGCGATGCCCAGCCCCACATCCAGTGGCTGAAGCACATCGTCATCAACGGCAGCA GCTTCGGAGC CGACGGTTTCCCCTATGTGCAAGTCCTAAAGACTGCAGACATCAATAGCTCAGAGGTGGA GGTCCTGTAC CTGCGGAACGTGTCAGCCGAGGACGCAGGCGAGTACACCTGCCTCGCAGGCAATTCCATC GGCCTCTCCT ACCAGTCTGCCTGGCTCACGGTGCTGCCAGGTGAGCACCTGAAGGGCCAGGAGATGCTGC GAGATGCCCC TCTGGGCCAGCAGTGGGGGCTGTGGCCTGTTGGGTGGTCAGTCTCTGTTGGCCTGTGGGG TCTGGCCTGG GGGGCAGTGTGTGGATTTGTGGGTTTGAGCTGTATGACAGCCCCTCTGTGCCTCTCCACA CGTGGCCGTC CATGTGACCGTCTGCTGAGGTGTGGGTGCCTGGGACTGGGCATAACTACAGCTTCCTCCG TGTGTGTCCC CACATATGTTGGGAGCTGGGAGGGACTGAGTTAGGGTGCACGGGGCGGCCAGTCTCACCA CTGACCAGTT TGTCTGTCTGTGTGTGTCCATGTGCGAGGGCAGAGGAGGACCCCACATGGACCGCAGCAG CGCCCGAGGC CAGGTATACGGACATCATCCTGTACGCGTCGGGCTCCCTGGCCTTGGCTGTGCTCCTGCT GCTGGCCAGG CTGTATCGAGGGCAGGCGCTCCACGGCCGGCACCCCCGCCCGCCCGCCACTGTGCAGAAG CTCTCCCGCT TCCCTCTGGCCCGACAGTTCTCCCTGGAGTCAGGCTCTTCCGGCAAGTCAAGCTCATCCC TGGTACGAGG CGTGCGTCTCTCCTCCAGCGGCCCCGCCTTGCTCGCCGGCCTCGTGAGTCTAGATCTACC TCTCGACCCA CTATGGGAGTTCCCCCGGGACAGGCTGGTGCTTGGGAAGCCCCTAGGCGAGGGCTGCTTT GGCCAGGTAG TACGTGCAGAGGCCTTTGGCATGGACCCTGCCCGGCCTGACCAAGCCAGCACTGTGGCCG TCAAGATGCT CAAAGACAACGCCTCTGACAAGGACCTGGCCGACCTGGTCTCGGAGATGGAGGTGATGAA GCTGATCGGC CGACACAAGAACATCATCAACCTGCTTGGTGTCTGCACCCAGGAAGGGCCCCTGTACGTG ATCGTGGAGT GCGCCGCCAAGGGAAACCTGCGGGAGTTCCTGCGGGCCCGGCGCCCCCCAGGCCCCGACC TCAGCCCCGA CGGTCCTCGGAGCAGTGAGGGGCCGCTCTCCTTCCCAGTCCTGGTCTCCTGCGCCTACCA GGTGGCCCGA GGCATGCAGTATCTGGAGTCCCGGAAGTGTATCCACCGGGACCTGGCTGCCCGCAATGTG CTGGTGACTG AGGACAATGTGATGAAGATTGCTGACTTTGGGCTGGCCCGCGGCGTCCACCACATTGACT ACTATAAGAA AACCAGCAACGGCCGCCTGCCTGTGAAGTGGATGGCGCCCGAGGCCTTGTTTGACCGGGT GTACACACAC CAGAGTGACGTGTGGTCTTTTGGGATCCTGCTATGGGAGATCTTCACCCTCGGGGGCTCC CCGTATCCTG GCATCCCGGTGGAGGAGCTGTTCTCGCTGCTGCGGGAGGGACATCGGATGGACCGACCCC CACACTGCCC CCCAGAGCTGTACGGGCTGATGCGTGAGTGCTGGCACGCAGCGCCCTCCCAGAGGCCTAC CTTCAAGCAG CTGGTGGAGGCGCTGGACAAGGTCCTGCTGGCCGTCTCTGAGGAGTACCTCGACCTCCGC CTGACCTTCG GACCCTATTCCCCCTCTGGTGGGGACGCCAGCAGCACCTGCTCCTCCAGCGATTCTGTCT TCAGCCACGA CCCCCTGCCATTGGGATCCAGCTCCTTCCCCTTCGGGTCTGGGGTGCAGACATGAGCAAG GCTCAAGGCT GTGCAGGCACATAGGCTGGTGGCCTTGGGCCTTGGGGCTCAGCCACAGCCTGACACAGTG CTCGACCTTG ATAGCATGGGGCCCCTGGCCCAGAGTTGCTGTGCCGTGTCCAAGGGCCGTGCCCTTGCCC TTGGAGCTGC CGTGCCTGTGTCCTGATGGCCCAAATGTCAGGGTTCTGCTCGGCTTCTTGGACCTTGGCG CTTAGTCCCC ATCCCGGGTTTGGCTGAGCCTGGCTGGAGAGCTGCTATGCTAAACCTCCTGCCTCCCAAT ACCAGCAGGA GGTTCTGGGCCTCTGAACCCCCTTTCCCCACACCTCCCCCTGCTGCTGCTGCCCCAGCGT CTTGACGGGA GCATTGGCCCCTGAGCCCAGAGAAGCTGGAAGCCTGCCGAAAACAGGAGCAAATGGCGTT TTATAAATTA TTTTTTTGAAAT

NM_004496 TAAGATCCACATCAGCTCAACTGCACTTGCCTCGCAGAGGCAGCCCGCTCACTTCCCGCG GAGGCGCTCC 123

CCGGCGCCGCGCTCCGCGGCAGCCGCCTGCCCCCGGCGCTGCCCCCGCCCGCCGCGCCGC CGCCGCCGCC GCGCACGCCGCGCCCCGCAGCTCTGGGCTTCCTCTTCGCCCGGGTGGCGTTGGGCCCGCG CGGGCGCTCG GGTGACTGCAGCTGCTCAGCTCCCCTCCCCCGCCCCGCGCCGCGCGGCCGCCCGTCGCTT CGCACAGGGC TGGATGGTTGTATTGGGCAGGGTGGCTCCAGGATGTTAGGAACTGTGAAGATGGAAGGGC ATGAAACCAG CGACTGGAACAGCTACTACGCAGACACGCAGGAGGCCTACTCCTCCGTCCCGGTCAGCAA CATGAACTCA GGCCTGGGCTCCATGAACTCCATGAACACCTACATGACCATGAACACCATGACTACGAGC GGCAACATGA CCCCGGCGTCCTTCAACATGTCCTATGCCAACCCGGGCCTAGGGGCCGGCCTGAGTCCCG GCGCAGTAGC CGGCATGCCGGGGGGCTCGGCGGGCGCCATGAACAGCATGACTGCGGCCGGCGTGACGGC CATGGGTACG GCGCTGAGCCCGAGCGGCATGGGCGCCATGGGTGCGCAGCAGGCGGCCTCCATGAATGGC CTGGGCCCCT ACGCGGCCGCCATGAACCCGTGCATGAGCCCCATGGCGTACGCGCCGTCCAACCTGGGCC GCAGCCGCGC GGGCGGCGGCGGCGACGCCAAGACGTTCAAGCGCAGCTACCCGCACGCCAAGCCGCCCTA CTCGTACATC TCGCTCATCACCATGGCCATCCAGCAGGCGCCCAGCAAGATGCTCACGCTGAGCGAGATC TACCAGTGGA TCATGGACCTCTTCCCCTATTACCGGCAGAACCAGCAGCGCTGGCAGAACTCCATCCGCC ACTCGCTGTC CTTCAATGACTGCTTCGTCAAGGTGGCACGCTCCCCGGACAAGCCGGGCAAGGGCTCCTA CTGGACGCTG CACCCGGACTCCGGCAACATGTTCGAGAACGGCTGCTACTTGCGCCGCCAGAAGCGCTTC AAGTGCGAGA AGCAGCCGGGGGCCGGCGGCGGGGGCGGGAGCGGAAGCGGGGGCAGCGGCGCCAAGGGCG GCCCTGAGAG CCGCAAGGACCCCTCTGGCGCCTCTAACCCCAGCGCCGACTCGCCCCTCCATCGGGGTGT GCACGGGAAG ACCGGCCAGCTAGAGGGCGCGCCGGCCCCCGGGCCCGCCGCCAGCCCCCAGACTCTGGAC CACAGTGGGG CGACGGCGACAGGGGGCGCCTCGGAGTTGAAGACTCCAGCCTCCTCAACTGCGCCCCCCA TAAGCTCCGG GCCCGGGGCGCTGGCCTCTGTGCCCGCCTCTCACCCGGCACACGGCTTGGCACCCCACGA GTCCCAGCTG CACCTGAAAGGGGACCCCCACTACTCCTTCAACCACCCGTTCTCCATCAACAACCTCATG TCCTCCTCGG AGCAGCAGCATAAGCTGGACTTCAAGGCATACGAACAGGCACTGCAATACTCGCCTTACG GCTCTACGTT GCCCGCCAGCCTGCCTCTAGGCAGCGCCTCGGTGACCACCAGGAGCCCCATCGAGCCCTC AGCCCTGGAG CCGGCGTACTACCAAGGTGTGTATTCCAGACCCGTCCTAAACACTTCCTAGCTCCCGGGA CTGGGGGGTT TGTCTGGCATAGCCATGCTGGTAGCAAGAGAGAAAAAATCAACAGCAAACAAAACCACAC AAACCAAACC GTCAACAGCATAATAAAATCCCAACAACTATTTTTATTTCATTTTTCATGCACAACCTTT CCCCCAGTGC AAAAGACTGTTACITTATTATTGTATTCAAAATTCATTGTGTATATTACTACAAAGACAA CCCCAAACCA ATTTTTTTCCTGCGAAGTTTAATGATCCACAAGTGTATATATGAAATTCTCCTCCTTCCT TGCCCCCCTC TCTTTCTTCCCTCTTTCCCCTCCAGACATTCTAGTTTGTGGAGGGTTATTTAAAAAAACA AAAAAGGAAG ATGGTCAAGTTTGTAAAATATTTGTTTGTGCTTTTTCCCCCTCCTTACCTGACCCCCTAC GAGTTTACAG GTCTGTGGCAATACTCTTAACCATAAGAATTGAAATGGTGAAGAAACAAGTATACACTAG AGGCTCTTAA AAGTATTGAAAGACAATACTGCTGTTATATAGCAAGACATAAACAGATTATAAACATCAG AGCCATTTGC TTCTCAGTTTACATTTCTGATACATGCAGATAGCAGATGTCTTTAAATGAAATACATGTA TATTGTGTAT GGACTTAATTATGCACATGCTCAGATGTGTAGACATCCTCCGTATATTTACATAACATAT AGAGGTAATA GATAGGTGATATACATGATACATTCTCAAGAGTTGCTTGACCGAAAGTTACAAGGACCCC AACCCCTTTG TCCTCTCTACCCACAGATGGCCCTGGGAATCAATTCCTCAGGAATTGCCCTCAAGAACTC TGCTTCTTGC TTTGCAGAGTGCCATGGTCATGTCATTCTGAGGTCACATAACACATAAAATTAGTTTCTA TGAGTGTATA CCATTTAAAGAATTTTTTTTTCAGTAAAAGGGAATATTACAATGTTGGAGGAGAGATAAG TTATAGGGAG CTGGATTTCAAAACGTGGTCCAAGATTCAAAAATCCTATTGATAGTGGCCATTTTAATCA TTGCCATCGT GTGCTTGTTTCATCCAGTGTTATGCACTTTCCACAGTTGGACATGGTGTTAGTATAGCCA GACGGGTTTC ATTATTATTTCTCTTTGCTTTCTCAATGTTAATTTATTGCATGGTTTATTCTTTTTCTTT ACAGCTGAAA TTGCTTTAAATGATGGTTAAAATTACAAATTAAATTGTTAATTTTTATCAATGTGATTGT AATTAAAAAT ATTTTGATTTAAATAACAAAAATAATACCAGATTTTAAGCCGTGGAAAATGTTCTTGATC ATTTGCAGTT AAGGACTTTAAATAAATCAAATGTTAACAAAAAAAAAAAAAAAA

NM_001453 ATGCAGGCGCGCTACTCCGTGTCCAGCCCCAACTCCCTGGGAGTGGTGCCCTACCTCGGC GGCGAGCAGA 124

GCTACTACCGCGCGGCGGCCGCGGCGGCCGGGGGCGGCTACACCGCCATGCCGGCCCCCA TGAGCGTGTA CTCGCACCCTGCGCACGCCGAGCAGTACCCGGGCGGCATGGCCCGCGCCTACGGGCCCTA CACGCCGCAG CCGCAGCCCAAGGACATGGTGAAGCCGCCCTATAGCTACATCGCGCTCATCACCATGGCC ATCCAGAACG CCCCGGACAAGAAGATCACCCTGAACGGCATCTACCAGTTCATCATGGACCGCTTCCCCT TCTACCGGGA CAACAAGCAGGGCTGGCAGAACAGCATCCGCCACAACCTCTCGCTCAACGAGTGCTTCGT CAAGGTGCCG CGCGACGACAAGAAGCCGGGCAAGGGCAGCTACTGGACGCTGGACCCGGACTCCTACAAC ATGTTCGAGA ACGGCAGCTTCCTGCGGCGGCGGCGGCGCTTCAAGAAGAAGGACGCGGTGAAGGACAAGG AGGAGAAGGA CAGGCTGCACCTCAAGGAGCCGCCCCCGCCCGGCCGCCAGCCCCCGCCCGCGCCGCCGGA GCAGGCCGAC GGCAACGCGCCCGGTCCGCAGCCGCCGCCCGTGCGCATCCAGGACATCAAGACCGAGAAC GGTACGTGCC CCTCGCCGCCCCAGCCCCTGTCCCCGGCCGCCGCCCTGGGCAGCGGCAGCGCCGCCGCGG TGCCCAAGAT CGAGAGCCCCGACAGCAGCAGCAGCAGCCTGTCCAGCGGGAGCAGCCCCCCGGGCAGCCT GCCGTCGGCG CGGCCGCTCAGCCTGGACGGTGCGGATTCCGCGCCGCCGCCGCCCGCGCCCTCCGCCCCG CCGCCGCACC ATAGCCAGGGCTTCAGCGTGGACAACATCATGACGTCGCTGCGGGGGTCGCCGCAGAGCG CGGCCGCGGA GCTCAGCTCCGGCCTTCTGGCCTCGGCGGCCGCGTCCTCGCGCGCGGGGATCGCACCCCC GCTGGCGCTC GGCGCCTACTCGCCCGGCCAGAGCTCCCTCTACAGCTCCCCCTGCAGCCAGACCTCCAGC GCGGGCAGCT CGGGCGGCGGCGGCGGCGGCGCGGGGGCCGCGGGGGGCGCGGGCGGCGCCGGGACCTACC ACTGCAACCT GCAAGCCATGAGCCTGTACGCGGCCGGCGAGCGCGGGGGCCACTTGCAGGGCGCGCCCGG GGGCGCGGGC GGCTCGGCCGTGGACGACCCCCTGCCCGACTACTCTCTGCCTCCGGTCACCAGCAGCAGC TCGTCGTCCC TGAGTCACGGCGGCGGCGGCGGCGGCGGCGGGGGAGGCCAGGAGGCCGGCCACCACCCTG CGGCCCACCA AGGCCGCCTCACCTCGTGGTACCTGAACCAGGCGGGCGGAGACCTGGGCCACTTGGCGAG CGCGGCGGCG GCGGCGGCGGCCGCAGGCTACCCGGGCCAGCAGCAGAACTTCCACTCGGTGCGGGAGATG TTCGAGTCAC AGAGGATCGGCTTGAACAACTCTCCAGTGAACGGGAATAGTAGCTGTCAAATGGCCTTCC CTTCCAGCCA GTCTCTGTACCGCACGTCCGGAGCTTTCGTCTACGACTGTAGCAAGTTTTGACACACCCT CAAAGCCGAA CTAAATCGAACCCCAAAGCAGGAAAAGCTAAAGGAACCCATCAAGGCAAAATCGAAACTA AAAAAAAAAA ATCCAATTAAAAAAAACCCCTGAGAATATTCACCACACCAGCGAACAGAATATCCCTCCA AAAATTCAGC TCACCAGCACCAGCACGAAGAAAACTCTATTTTCTTAACCGATTAATTCAGAGCCACCTC CACTTTGCCT TGTCTAAATAAACAAACCCGTAAACTGTTTTATACAGAGACAGCAAAATCTTGGTTTATT AAAGGACAGT GTTACTCCAGATAACACGTAAGTTTCTTCTTGCTTTTCAGAGACCTGCTTTCCCCTCCTC CCGTCTCCCC TCTCTTGCCTTCTTCCTTGCCTCTCACCTGTAAGATATTATTTTATCCTATGTTGAAGGG AGGGGGAAAG TCCCCGTTTATGAAAGTCGCTTTCTTTTTATTCATGGACTTGTTTTAAAATGTAAATTGC AACATAGTAA TTTATTTTTAATTTGTAGTTGGATGTCGTGGACCAAACGCCAGAAAGTGTTCCCAAAACC TGACGTTAAA TTGCCTGAAACTTTAAATTGTGCTTTTTTTCTCATTATAAAAAGGGAAACTGTATTAATC TTATTCTATC CTCTTTTCTTTCTTTTTGTTGAACATATTCATTGTTTGTTTATTAATAAATTACCATTCA GTTTGAATGA GACCTATATGTCTGGATACTTTAATAGAGCTTTAATTATTACGAAAAAAGATTTCAGAGA TAAAACACTA GAAGTTACCTATTCTCCACCTAAATCTCTGAAAAATGGAGAAACCCTCTGACTAGTCCAT GTCAAATTTT ACTAAAAGTCTTTTTGTTTAGATTTATTTTCCTGCAGCATCTTCTGCAAAATGTACTATA TAGTCAGCTT GCTTTGAGGCTAGTAAAAAGATATTTTTCTAAACAGATTGGAGTTGGCATATAAACAAAT ACGTTTTCTC ACTAATGACAGTCCATGATTCGGAAATTTTAAGCCCATGAATCAGCCGCGGTCTTACCAC GGTGATGCCT GTGTGCCGAGAGATGGGACTGTGCGGCCAGATATGCACAGATAAATATTTGGCTTGTGTA TTCCATATAA AATTGCAGTGCATATTATACATCCCTGTGAGCCAGATGCTGAATAGATATTTTCCTATTA TTTCAGTCCT TTATAAAAGGAAAAATAAACCAGTTTTTAAATGTATGTATATAATTCTCCCCCATTTACA ATCCTTCATG TATTACATAGAAGGATTGCTTTTTTAAAAATATACTGCGGGTTGGAAAGGGATATTTAAT CTTTGAGAAA C T AT T T T AGAAAAT AT GT T T GT AGAACAAT T AT T T T T GAAAAAGAT T T AAAGC AAT AAC AAGAAGGAAGG CGAGAGGAGCAGAACATTTTGGTCTAGGGTGGTTTCTTTTTAAACCATTTTTTCTTGTTA ATTTACAGTT AAACCTAGGGGACAATCCGGATTGGCCCTCCCCCTTTTGTAAATAACCCAGGAAATGTAA TAAATTCATT ATCTTAGGGTGATCTGCCCTGCCAATCAGACTTTGGGGAGATGGCGATTTGATTACAGAC GTTCGGGGGG GTGGGGGGCTTGCAGTTTGTTTTGGAGATAATACAGTTTCCTGCTATCTGCCGCTCCTAT CTAGAGGCAA CACTTAAGCAGTAATTGCTGTTGCTTGTTGTCAAAATTTGATCATTGTTAAAGGATTGCT GCAAATAAAT ACAC T T T AAT T T C AGT CAAAAA

AJ249248 GTGGCCTCGAGGTGGTGGCAGGGCCGCCCCCTGCAGTCCGGAGACGAACGCACGGACCGG GCCTCCGGAG 125

GCAGGTTCGGCTGGAAGGAACCGCTCTCGCTTCGTCCTACACTTGCGCAAATGTCTCCGA GCTTACTCAC ATAGCATATTGGTATATCAAAATGAAATGCAAGGAACCAAAAATAACATAATTGAAGGCA GTAAAAGTGA AATTAAATAGGAAGATCATCAGTCAAGGAAGACCCACTGGAGAGGACAGAAAATGAAGCA GTGTTTTATC ATGTGTATTTCAGCAGGTCTTCTTGAAATTTAACTAAAAATATGACTGCTCTCTCTTCAG AGAACTGCTC TTTTCAGTACCAGTTACGTCAAACAAACCAGCCCCTAGACGTTAACTATCTGCTATTCTT GATCATACTT GGGAAAATATTATTAAATATCCTTACACTAGGAATGAGAAGAAAAAACACCTGTCAAAAT TTTATGGAAT ATTTTTGCATTTCACTAGCATTCGTTGATCTTTTACTTTTGGTAAACATTTCCATTATAT TGTATTTCAG GGATTTTGTACTTTTAAGCATTAGGTTCACTAAATACCACATCTGCCTATTTACTCAAAT TATTTCCTTT ACTTATGGCTTTTTGCATTATCCAGTTTTCCTGACAGCTTGTATAGATTATTGCCTGAAT TTCTCTAAAA CAACCAAGCTTTCATTTAAGTGTCAAAAATTATTTTATTTCTTTACAGTAATTTTAATTT GGATTTCAGT CCTTGCTTATGTTTTGGGAGACCCAGCCATCTACCAAAGCCTGAAGGCACAGAATGCTTA TTCTCGTCAC TGTCCTTTCTATGTCAGCATTCAGAGTTACTGGCTGTCATTTTTCATGGTGATGATTTTA TTTGTAGCTT TCATAACCTGTTGGGAAGAAGT TACT ACT TTGGTACAGGCTATCAGGATAACTTCCTATATGAATGAAAC TATCTTATATTTTCCTTTTTCATCCCACTCCAGTTATACTGTGAGATCTAAAAAAATATT CTTATCCAAG CTCATTGTCTGTTTTCTCAGTACCTGGTTACCATTTGTACTACTTCAGGTAAT CAT TGTTT TACT TAAAG TTCAGATTCCAGCATATATTGAGATGAATATTCCCTGGTTATACTTTGTCAATAGTTTTC TCATTGCTAC AGTGTATTGGTTTAATTGTCACAAGCTTAATTTAAAAGACATTGGATTACCTTTGGATCC ATTTGTCAAC TGGAAGTGCTGCTTCATTCCACTTACAATTCCTAATCTTGAGCAAATTGAAAAGCCTATA TCAATAATGA TTTGTTAATATTATTAATTAAAAGTTACAGCTGTCATAAGATCATAATTTTATGAACAGA AAGAACTCAG GACATATTAAAAAATAAACTGAACTAAAACAACTTTTGCCCCCTGACTGATAGCATTTCA GAATGTGTCT TTTGAAGGGCTATACCAGTTATTAAATAGTGTTTTATTTTAAAAACAAAATAATTCCAAG AAGTTTTTAT AGTTATTCAGGGACACTATATTACAAATATTACTTTGTTATTAACACAAAAAGTGATAAG AGTTAACATT TGGCTATACTGATGTTTGTGTTACTCAAAAAAACTACTGGATGCAAACTGTTATGTAAAT CTGAGATTTC AC T GAC AAC T T T AAGAT AT CAACC T AAACAT T T T T AT T AAAT GT T CAAAT G TAAGCAAGAAAAAAAAAA

NM_005310 ACCCGCCCCCATCTGCCCAAGATAATTTTAGTTTCCTTGGGCCTGGAATCTGGACACACA GGGCTCCCCC 126

CCGCCTCTGACTTCTCTGTCCGAAGTCGGGACACCCTCCTACCACCTGTAGAGAAGCGGG AGTGGATCTG AAATAAAATCCAGGAATCTGGGGGTTCCTAGACGGAGCCAGACTTCGGAACGGGTGTCCT GCTACTCCTG CTGGGGCTCCTCCAGGACAAGGGCACACAACTGGTTCCGTTAAGCCCCTCTCTCGCTCAG ACGCCATGGA GCTGGATCTGTCTCCACCTCATCTTAGCAGCTCTCCGGAAGACCTTTGCCCAGCCCCTGG GACCCCTCCT GGGACTCCCCGGCCCCCTGATACCCCTCTGCCTGAGGAGGTAAAGAGGTCCCAGCCTCTC CTCATCCCAA CCACCGGCAGGAAACTTCGAGAGGAGGAGAGGCGTGCCACCTCCCTCCCCTCTATCCCCA ACCCCTTCCC TGAGCTCTGCAGTCCTCCCTCACAGAGCCCAATTCTCGGGGGCCCCTCCAGTGCAAGGGG GCTGCTCCCC CGCGATGCCAGCCGCCCCCATGTAGTAAAGGTGTACAGTGAGGATGGGGCCTGCAGGTCT GTGGAGGTGG CAGCAGGTGCCACAGCTCGCCACGTGTGTGAAATGCTGGTGCAGCGAGCTCACGCCTTGA GCGACGAGAC CTGGGGGCTGGTGGAGTGCCACCCCCACCTAGCACTGGAGCGGGGTTTGGAGGACCACGA GTCCGTGGTG GAAGTGCAGGCTGCCTGGCCCGTGGGCGGAGATAGCCGCTTCGTCTTCCGGAAAAACTTC GCCAAGTACG AACTGTTCAAGAGCTCCCCACACTCCCTGTTCCCAGAAAAAATGGTCTCCAGCTGTCTCG ATGCACACAC TGGTATATCCCATGAAGACCTCATCCAGAACTTCCTGAATGCTGGCAGCTTTCCTGAGAT CCAGGGCTTT CTGCAGCTGCGGGGTTCAGGACGGAAGCTTTGGAAACGCTTTTTCTGCTTCTTGCGCCGA TCTGGCCTCT ATTACTCCACCAAGGGCACCTCTAAGGATCCGAGGCACCTGCAGTACGTGGCAGATGTGA ACGAGTCCAA CGTGTACGTGGTGACGCAGGGCCGCAAGCTCTACGGGATGCCCACTGACTTCGGTTTCTG TGTCAAGCCC AACAAGCTTCGAAATGGCCACAAGGGGCTTCGGATCTTCTGCAGTGAAGATGAGCAGAGC CGCACCTGCT GGCTGGCTGCCTTCCGCCTCTTCAAGTACGGGGTGCAGCTGTACAAGAATTACCAGCAGG CACAGTCTCG CCATCTGCATCCATCTTGTTTGGGCTCCCCACCCTTGAGAAGTGCCTCAGATAATACCCT GGTGGCCATG GACTTCTCTGGCCATGCTGGGCGTGTCATTGAGAACCCCCGGGAGGCTCTGAGTGTGGCC CTGGAGGAGG CCCAGGCCTGGAGGAAGAAGACAAACCACCGCCTCAGCCTGCCCATGCCAGCCTCCGGCA CGAGCCTCAG TGCAGCCATCCACCGCACCCAACTCTGGTTCCACGGGCGCATTTCCCGTGAGGAGAGCCA GCGGCTTATT GGACAGCAGGGCTTGGTAGACGGCCTGTTCCTGGTCCGGGAGAGTCAGCGGAACCCCCAG GGCTTTGTCC TCTCTTTGTGCCACCTGCAGAAAGTGAAGCATTATCTCATCCTGCCGAGCGAGGAGGAGG GCCGCCTGTA CTTCAGCATGGATGATGGCCAGACCCGCTTCACTGACCTGCTGCAGCTCGTGGAGTTCCA CCAGCTGAAC CGCGGCATCCTGCCGTGCTTGCTGCGCCATTGCTGCACGCGGGTGGCCCTCTGACCAGGC CGTGGACTGG CTCATGCCTCAGCCCGCCTTCAGGCTGCCCGCCGCCCCTCCACCCATCCAGTGGACTCTG GGGCGCGGCC ACAGGGGACGGGATGAGGAGCGGGAGGGTTCCGCCACTCCAGTTTTCTCCTCTGCTTCTT TGCCTCCCTC AGATAGAAAACAGCCCCCACTCCAGTCCACTCCTGACCCCTCTCCTCAAGGGAAGGCCTT GGGTGGCCCC CTCTCCTTCTCCTAGCTCTGGAGGTGCTGCTCTAGGGCAGGGAATTATGGGAGAAGTGGG GGCAGCCCAG GCGGTTTCACGCCCCACACTTTGTACAGACCGAGAGGCCAGTTGATCTGCTCTGTTTTAT ACTAGTGACA ATAAAGATTATTTTTTGATACAAAAAAAAAAAAAAAAAAAAAAAA

NM_ _014176 AGTCAGAGGTCGCGCAGGCGCTGGTACCCCGTTGGTCCGCGCGTTGCTGCGTTGTGAGGG GTGTCAGCTC 127

AGTGCATCCCAGGCAGCTCTTAGTGTGGAGCAGTGAACTGTGTGTGGTTCCTTCTACTTG GGGATCATGC AGAGAGCTTCACGTCTGAAGAGAGAGCTGCACATGTTAGCCACAGAGCCACCCCCAGGCA TCACATGTTG GCAAGATAAAGACCAAATGGATGACCTGCGAGCTCAAATATTAGGTGGAGCCAACACACC TTATGAGAAA GGTGTTTTTAAGCTAGAAGTTATCATTCCTGAGAGGTACCCATTTGAACCTCCTCAGATC CGATTTCTCA CTCCAATTTATCATCCAAACATTGATTCTGCTGGAAGGATTTGTCTGGATGTTCTCAAAT TGCCACCAAA AGGTGCTTGGAGACCATCCCTCAACATCGCAACTGTGTTGACCTCTATTCAGCTGCTCAT GTCAGAACCC AACCCTGATGACCCGCTCATGGCTGACATATCCTCAGAATTTAAATATAATAAGCCAGCC TTCCTCAAGA ATGCCAGACAGTGGACAGAGAAGCATGCAAGACAGAAACAAAAGGCTGATGAGGAAGAGA TGCTTGATAA TCTACCAGAGGCTGGTGACTCCAGAGTACACAACTCAACACAGAAAAGGAAGGCCAGTCA GCTAGTAGGC ATAGAAAAGAAATTTCATCCTGATGTTTAGGGGACTTGTCCTGGTTCATCTTAGTTAATG TGTTCTTTGC CAAGGTGATCTAAGTTGCCTACCTTGAATTTTTTTTTAAATATATTTGATGACATAATTT TTGTGTAGTT TATTTATCTTGTACATATGTATTTTGAAATCTTTTAAACCTGAAAAATAAATAGTCATTT AATGTTGAAA AAAAAAAAAAAAAAAAAAAAAAAAA

NM_ _006845 ACGCTTGCGCGCGGGATTTAAACTGCGGCGGTTTACGCGGCGTTAAGACTTCGTAGGGTT AGCGAAATTG 128

AGGTTTCTTGGTATTGCGCGTTTCTCTTCCTTGCTGACTCTCCGAATGGCCATGGACTCG TCGCTTCAGG CCCGCCTGTTTCCCGGTCTCGCTATCAAGATCCAACGCAGTAATGGTTTAATTCACAGTG CCAATGTAAG GACTGTGAACTTGGAGAAATCCTGTGTTTCAGTGGAATGGGCAGAAGGAGGTGCCACAAA GGGCAAAGAG ATTGATTTTGATGATGTGGCTGCAATAAACCCAGAACTCTTACAGCTTCTTCCCTTACAT CCGAAGGACA ATCTGCCCTTGCAGGAAAATGTAACAATCCAGAAACAAAAACGGAGATCCGTCAACTCCA AAATTCCTGC TCCAAAAGAAAGTCTTCGAAGCCGCTCCACTCGCATGTCCACTGTCTCAGAGCTTCGCAT CACGGCTCAG GAGAATGACATGGAGGTGGAGCTGCCTGCAGCTGCAAACTCCCGCAAGCAGTTTTCAGTT CCTCCTGCCC CCACTAGGCCTTCCTGCCCTGCAGTGGCTGAAATACCATTGAGGATGGTCAGCGAGGAGA TGGAAGAGCA AGTCCATTCCATCCGAGGCAGCTCTTCTGCAAACCCTGTGAACTCAGTTCGGAGGAAATC ATGTCTTGTG AAGGAAGTGGAAAAAATGAAGAACAAGCGAGAAGAGAAGAAGGCCCAGAACTCTGAAATG AGAATGAAGA GAGCTCAGGAGTATGACAGTAGTTTTCCAAACTGGGAATTTGCCCGAATGATTAAAGAAT TTCGGGCTAC TTTGGAATGTCATCCACTTACTATGACTGATCCTATCGAAGAGCACAGAATATGTGTCTG TGTTAGGAAA CGCCCACTGAATAAGCAAGAATTGGCCAAGAAAGAAATTGATGTGATTTCCATTCCTAGC AAGTGTCTCC TCTTGGTACATGAACCCAAGTTGAAAGTGGACTTAACAAAGTATCTGGAGAACCAAGCAT TCTGCTTTGA CTTTGCATTTGATGAAACAGCTTCGAATGAAGTTGTCTACAGGTTCACAGCAAGGCCACT GGTACAGACA ATCTTTGAAGGTGGAAAAGCAACTTGTTTTGCATATGGCCAGACAGGAAGTGGCAAGACA CATACTATGG GCGGAGACCTCTCTGGGAAAGCCCAGAATGCATCCAAAGGGATCTATGCCATGGCCTCCC GGGACGTCTT CCTCCTGAAGAATCAACCCTGCTACCGGAAGTTGGGCCTGGAAGTCTATGTGACATTCTT CGAGATCTAC AATGGGAAGCTGTTTGACCTGCTCAACAAGAAGGCCAAGCTGCGCGTGCTGGAGGACGGC AAGCAACAGG TGCAAGTGGTGGGGCTGCAGGAGCATCTGGTTAACTCTGCTGATGATGTCATCAAGATGA TCGACATGGG CAGCGCCTGCAGAACCTCTGGGCAGACATTTGCCAACTCCAATTCCTCCCGCTCCCACGC GTGCTTCCAA ATTATTCTTCGAGCTAAAGGGAGAATGCATGGCAAGTTCTCTTTGGTAGATCTGGCAGGG AATGAGCGAG GCGCGGACACTTCCAGTGCTGACCGGCAGACCCGCATGGAGGGCGCAGAAATCAACAAGA GTCTCTTAGC CCTGAAGGAGTGCATCAGGGCCCTGGGACAGAACAAGGCTCACACCCCGTTCCGTGAGAG CAAGCTGACA CAGGTGCTGAGGGACTCCTTCATTGGGGAGAACTCTAGGACTTGCATGATTGCCACGATC TCACCAGGCA TAAGCTCCTGTGAATATACTTTAAACACCCTGAGATATGCAGACAGGGTCAAGGAGCTGA GCCCCCACAG TGGGCCCAGTGGAGAGCAGTTGATTCAAATGGAAACAGAAGAGATGGAAGCCTGCTCTAA CGGGGCGCTG ATTCCAGGCAATTTATCCAAGGAAGAGGAGGAACTGTCTTCCCAGATGTCCAGCTTTAAC GAAGCCATGA CTCAGATCAGGGAGCTGGAGGAGAAGGCTATGGAAGAGCTCAAGGAGATCATACAGCAAG GACCAGACTG GCTTGAGCTCTCTGAGATGACCGAGCAGCCAGACTATGACCTGGAGACCTTTGTGAACAA AGCGGAATCT GCTCTGGCCCAGCAAGCCAAGCATTTCTCAGCCCTGCGAGATGTCATCAAGGCCTTGCGC CTGGCCATGC AGCTGGAAGAGCAGGCTAGCAGACAAATAAGCAGCAAGAAACGGCCCCAGTGACGACTGC AAATAAAAAT CTGTTTGGTTTGACACCCAGCCTCTTCCCTGGCCCTCCCCAGAGAACTTTGGGTACCTGG TGGGTCTAGG CAGGGTCTGAGCTGGGACAGGTTCTGGTAAATGCCAAGTATGGGGGCATCTGGGCCCAGG GCAGCTGGGG AGGGGGTCAGAGTGACATGGGACACTCCTTTTCTGTTCCTCAGTTGTCGCCCTCACGAGA GGAAGGAGCT CTTAGTTACCCTTTTGTGTTGCCCTTCTTTCCATCAAGGGGAATGTTCTCAGCATAGAGC TTTCTCCGCA GCATCCTGCCTGCGTGGACTGGCTGCTAATGGAGAGCTCCCTGGGGTTGTCCTGGCTCTG GGGAGAGAGA CGGAGCCTTTAGTACAGCTATCTGCTGGCTCTAAACCTTCTACGCCTTTGGGCCGAGCAC TGAATGTCTT GTACTTTAAAAAAATGTTTCTGAGACCTCTTTCTACTTTACTGTCTCCCTAGAGATCCTA GAGGATCCCT ACTGTTTTCTGTTTTATGTGTTTATACATTGTATGTAACAATAAAGAGAAAAAATAAATC AGCTGTTTAA GTGTGTGGAAAAAAAAAAAAAAAAAA

NM_ _006101 ACTGCGCGCGTCGTGCGTAATGACGTCAGCGCCGGCGGAGAATTTCAAATTCGAACGGCT TTGGCGGGCC 129

GAGGAAGGACCTGGTGTTTTGATGACCGCTGTCCTGTCTAGCAGATACTTGCACGGTTTA CAGAAATTCG GTCCCTGGGTCGTGTCAGGAAACTGGAAAAAAGGTCATAAGCATGAAGCGCAGTTCAGTT TCCAGCGGTG GTGCTGGCCGCCTCTCCATGCAGGAGTTAAGATCCCAGGATGTAAATAAACAAGGCCTCT ATACCCCTCA AACCAAAGAGAAACCAACCTTTGGAAAGTTGAGTATAAACAAACCGACATCTGAAAGAAA AGTCTCGCTA TTTGGCAAAAGAACTAGTGGACATGGATCCCGGAATAGTCAACTTGGTATATTTTCCAGT TCTGAGAAAA TCAAGGACCCGAGACCACTTAATGACAAAGCATTCATTCAGCAGTGTATTCGACAACTCT GTGAGTTTCT TACAGAAAATGGTTATGCACATAATGTGTCCATGAAATCTCTACAAGCTCCCTCTGTTAA AGACTTCCTG AAGATCTTCACATTTCTTTATGGCTTCCTGTGCCCCTCATACGAACTTCCTGACACAAAG TTTGAAGAAG AGGTTCCAAGAATCTTTAAAGACCTTGGGTATCCTTTTGCACTATCCAAAAGCTCCATGT ACACAGTGGG GGCTCCTCATACATGGCCTCACATTGTGGCAGCCTTAGTTTGGCTAATAGACTGCATCAA GATACATACT GCCAT GAAAGAAAGC T CAC CTTTATTTGATGAT GGGCAGCC T T GGGGAGAAGAAAC T GAAGAT GGAAT T A TGCATAATAAGTTGTTTTTGGACTACACCATAAAATGCTATGAGAGTTTTATGAGTGGTG CCGACAGCTT TGATGAGATGAATGCAGAGCTGCAGTCAAAACTGAAGGATTTATTTAATGTGGATGCTTT TAAGCTGGAA T CAT T AGAAGCAAAAAACAGAGCAT T GAAT GAACAGAT T GCAAGAT T GGAACAAGAAAGAGAAAAAGAAC CGAATCGTCTAGAGTCGTTGAGAAAACTGAAGGCTTCCTTACAAGGAGATGTTCAAAAGT ATCAGGCATA CATGAGCAATTTGGAGTCTCATTCAGCCATTCTTGACCAGAAATTAAATGGTCTCAATGA GGAAATTGCT AGAGTAGAACTAGAATGTGAAACAATAAAACAGGAGAACACTCGACTACAGAATATCATT GACAACCAGA AGTACTCAGTTGCAGACATTGAGCGAATAAATCATGAAAGAAATGAATTGCAGCAGACTA TTAATAAATT AACCAAGGACCTGGAAGCTGAACAACAGAAGTTGTGGAATGAGGAGTTAAAATATGCCAG AGGCAAAGAA GCGATTGAAACACAATTAGCAGAGTATCACAAATTGGCTAGAAAATTAAAACTTATTCCT AAAGGTGCTG AGAATTCCAAAGGTTATGACTTTGAAATTAAGTTTAATCCCGAGGCTGGTGCCAACTGCC TTGTCAAATA CAGGGCTCAAGTTTATGTACCTCTTAAGGAACTCCTGAATGAAACTGAAGAAGAAATTAA TAAAGCCCTA AATAAAAAAATGGGTTTGGAGGATACTTTAGAACAATTGAATGCAATGATAACAGAAAGC AAGAGAAGTG T GAGAAC T C T GAAAGAAGAAGT T CAAAAGC TGGATGATCTT T ACCAACAAAAAAT T AAGGAAGCAGAGGA AGAGGATGAAAAATGTGCCAGTGAGCTTGAGTCCTTGGAGAAACACAAGCACCTGCTAGA AAGTACTGTT AACCAGGGGCTCAGTGAAGCTATGAATGAATTAGATGCTGTTCAGCGGGAATACCAACTA GTTGTGCAAA CCACGACTGAAGAAAGACGAAAAGTGGGAAATAACTTGCAACGTCTGTTAGAGATGGTTG CTACACATGT TGGGTCTGTAGAGAAACATCTTGAGGAGCAGATTGCTAAAGTTGATAGAGAATATGAAGA ATGCATGTCA GAAGAT C T C T CGGAAAAT AT T AAAGAGAT T AGAGAT AAG TAT GAGAAGAAAGC T AC T C T AAT TAAG T C T T CTGAAGAATGAAGATAAAATGTTGATCATGTATATATATCCATAGTGAATAAAATTGTCT CAGTAAAGTG TAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

BC042437 CTCCCTCCTCTGCACCATGACTACCTGCAGCCGCCAGTTCACCTCCTCCAGCTCCATGAA GGGCTCCTGC 130

GGCATCGGGGGCGGCATCGGGGGCGGCTCCAGCCGCATCTCCTCCGTCCTGGCCGGAGGG TCCTGCCGCG CCCCCAGCACCTACGGGGGCGGCCTGTCTGTCTCATCCTCCCGCTTCTCCTCTGGGGGAG CCTATGGGTT GGGGGGCGGCTATGGCGGTGGCTTCAGCAGCAGCAGCAGCAGCTTTGGTAGTGGCTTTGG GGGAGGATAT GGTGGTGGCCTTGGTGCTGGCTTGGGTGGTGGCTTTGGTGGTGGCTTTGCTGGTGGTGAT GGGCTTCTGG TGGGCAGTGAGAAGGTGACCATGCAGAACCTCAACGACCGCCTGGCCTCCTACCTGGACA AGGTGCGTGC TCTGGAGGAGGCCAACGCCGACCTGGAAGTGAAGATCCGTGACTGGTACCAGAGGCAGCG GCCTGCTGAG ATCAAAGACTACAGTCCCTACTTCAAGACCATTGAGGACCTGAGGAACAAGATTCTCACA GCCACAGTGG ACAATGCCAATGTCCTTCTGCAGATTGACAATGCCCGTCTGGCCGCGGATGACTTCCGCA CCAAGTATGA GACAGAGTTGAACCTGCGCATGAGTGTGGAAGCCGACATCAATGGCCTGCGCAGGGTGCT GGACGAACTG ACCCTGGCCAGAGCTGACCTGGAGATGCAGATTGAGAGCCTGAAGGAGGAGCTGGCCTAC CTGAAGAAGA ACCACGAGGAGGAGATGAATGCCCTGAGAGGCCAGGTGGGTGGAGATGTCAATGTGGAGA TGGACGCTGC ACCTGGCGTGGACCTGAGCCGCATTCTGAACGAGATGCGTGACCAGTATGAGAAGATGGC AGAGAAGAAC CGCAAGGATGCCGAGGAATGGTTCTTCACCAAGACAGAGGAGCTGAACCGCGAGGTGGCC ACCAACAGCG AGCTGGTGCAGAGCGGCAAGAGCGAGATCTCGGAGCTCCGGCGCACCATGCAGAACCTGG AGATTGAGCT GCAGTCCCAGCTCAGCATGAAAGCATCCCTGGAGAACAGCCTGGAGGAGACCAAAGGTCG CTACTGCATG CAGCTGGCCCAGATCCAGGAGATGATTGGCAGCGTGGAGGAGCAGCTGGCCCAGCTCCGC TGCGAGATGG AGCAGCAGAACCAGGAGTACAAGATCCTGCTGGACGTGAAGACGCGGCTGGAGCAGGAGA TCGCCACCTA CCGCCGCCTGCTGGAGGGCGAGGACGCCCACCTCTCCTCCTCCCAGTTCTCCTCTGGATC GCAGTCATCC AGAGAT GTGACCTCCTCCAGCCGCCAAATCCGCACCAAGGTCATGGATGTGCACGATGGCAAGGTG GTGT CCACCCACGAGCAGGTCCTTCGCACCAAGAACTGAGGCTGCCCAGCCCCGCTCAGGCCTA GGAGGCCCCC CGTGTGGACACAGATCCCACTGGAAGATCCCCTCTCCTGCCCAAGCACTTCACAGCTGGA CCCTGCTTCA CCCTCACCCCCTCCTGGCAATCAATACAGCTTCATTATCTGAGTTGCATAAAAAAAAAAA AAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AK095281 CTCTTTTGCAGGGGCCGTTCCTCGGGGCATGACGCTGGCTCCTGCACAGATCCTGCTCCT CTGTGGCCTT 131

CCTGGGCTGCCCTCCCCTCCTCCGGGACTGCTCTGGACTGACACTGCTCAGGTTCGGATT CCCTCAAAGA CTTTGGGAGACAAGACTTGGTCCCCCTTTTACAAACAAGGGAACGGAGGCTCTAGAACTG ACTTCCTGAA AGGCTTGGATCCAAAGCTCCCTCAGTTCAGCGGCCACGTCTATTTCCCTCAGACACAGGG ATCCTTGAAC CTGTGGGCTGTATCTCCCCGCGGACTTGGAAGAATCCCAAGAGAGTGGGGCTCCCACAGG CTGGAGTGCA ATGGTGTGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGCTATTCTCCTGCCT CAGCCTCCTG AGTAGCTGGGATTACAGATCCTGGTGGCTGTGGTCGGTAATTCCAGCTTCGTGCTGGCTA CAGGTGGATG ATGCCCACCTGGCTGCCGATGACCTCTGCACCAAGTGAGGCTGGGTCTCTGGAGCTGCCC CAGGGGCTGG ACAAGCTGACCCTGGCCGGGGCCAACCTGGAGATGCAGATTGAGAACCTCAAGGAGGACC TGGTCTACCT GAAGAAGAACCACAAGCAGGAAATGAACGTCCTTTGAGGTCAGGTGGATGAGGATGTCAG TGTGAAGATG GACACTGTGCCTGGAGTGAACCTGAGCTGCATCCTGAATGAGATGCGTGACCAGGACAAG ACATTGGTGG AGAAGAGCTGCAAGGATGCCGAGGGCTGGTTCTTCAGCATGGTGGGTGGCCGTGCGTAAG CAGGTGTGTA CACGTGTGGGCACATGTGCTGCATGCTGGTGCAGCTGGAGCACTGGCAGATCCACAGGCT GTCCCAGTTG GAAGGACTTTTGGAAACCAGTTGGACCAGCCCCTCATGTTTTAGATGTAAAACGTGAGGC TCAGAGAGGA CTCAAGCTCACACAGCCCTTCACTGTGGCCTGCAAAATAGATCCAGGTCTCTACAAGTCT GGTCTTGGGT TTCCACCACAGCTGTTTACAGGATGTGCGTATTTGAATACATATGTATACCCTTGGCAAG CACAGGCTGA GTATCTCCGGTATCCTAGGGACAGCAACAGGCGCAAAAGAATAACACCCAGTGCCTGTCT TTGAGGTGCT GCAGTTCAGTAGGAAAAAGAAATGCAAATGACCGCAGAGCAGGCTGAATTCCTCCAAGTT CCAATGTGGG TGCAGAGGCTCTCTGTGTGCAGAAAGAGGGGCTGAACTGCGAGGTGGCCACCAACACAGA GGCCCTGCAG AGTGGCTGGATAGAGATATGGAGCTCTACGTCTCTGTGCAGAACCTGAGCCGTCCCAGCT CAGCAAGAAA GCATCGCTGGAGGGCAGCCTGGTGGAGATGGAGGTGTGTTACAGGACCCTGCCGGCCCAG CTGCAGGGGC T T AACAGAAGCAT GGAGCAGCAGC T G T GCGAGC T C T GC T GCGACAC GGAGC AC CAGGAC CACAAGC ACAG GTCCTTCTGGACGTGAAGACGTGGCTGGAGCAGGAGATCGCCACCTACCGCCGCTTGCTG GAGGTTGAGG ACGCCCAGAGGTGATACTGACGATGCAGGCTGGAGTCTGGCTGAGGAGCCTTGAATGCCA AGTTAAAGCG TCTGGACTAGATCACGTAGGCAATGGGGAGCCATGGAGGGATTTGGAGCAGGAGAGTGAA ATGAACATCA AGAGATTTTAGAACATTCACTCTGGCTGCAGAGGGAGAAATGGATCAGAGGGGTCAGGGC GGGGCCAGAG AGATGTGTCAGGGGGCTGGAGCAGGGAGTCTGGCCAGAGAAGTCCCGTGCGGTGGTGGGT AGTGGGGCAG GGGAAGGAAGGTGGTGCACGCAGAAGAGAGGTTATAGCTCAAAACAGCGGGACTGGATGC CTGGATCTCG GGGTAAGCATGGCTCACAGTCAGGACTCAGTAAGTGTCGGGAGAACACATGAAGGAGCAG GCATTGATGG CCCTGGGTTTCTGGTTCTGATGACTGTGTGAGTGGTGAAGAGCAAGGTGGGTGGTGGTTG GGTTTGCAGT TGGGAAGGGTGATCAGGCCTTCAGCTGAGAGTGTCCCGGAGTCTCCATGCTTAGTCACAC GTTGCAGCTT TTTGCTCCCCGGAAATGGTGAAGTCCATCTATAGTCTAACAACAGTCTCTCCTGCTTTAA TTGGGTCTAT TTGTTGGGCCCTCTGGGTTATGGAAAAACCACTTGCTCAGCTTCTCCTTGTAAATTCCTG GTGAGTAGCC ACAGAGTGCCGCCAGACCTACTGCTGTGCTGTTTCTTTTTCTTCTTCCTGCTGTGCTGAA CCCCTGCCCT TTCATTCTTGGGCCTGCGCTAATTTCTGTGCATTCCCAACTGTGATTTTTCACCAATTTA GGGGAACCTC CTCTGCCAGGGCCTACTTCTCCCCAGCAGTGCTTGCAGGTGCCTGGGCTGGCTGGCATCC CTGGGCTGAT GGGTGCTTCTCTCCCTGCAGGCTGGCCACTCAGTACTCCTTGTCCCTGGCCTCGCAGCCC ACCCGGGAAG CCACAGTGACCAGCCACCAGGTGTGCCATCGTGGAGGAAGTCCAGGTTGGAGAGGTGGTC TTCTTCTGTG AGCAGGTCCACTTCTCCACCCACTGAGACCCCTTTCTGTCTGCGACAGCCCCACCTCGAG GGCCACGGCA CAGCCATCAGCTCCAGCTCCCAGCATGCTACTGCCACGCCCCGAGTGTCCGTCTGGGCCC CGGTGCATGG CCTGTTGTCTTTCTGTATCTACTTTCTGCAGCCCCTCACTGAGGAGGCCTCCTGGGTTTG TCCAGTGCCT ACTATTAAAGCTTTGCTCCAAGTTC

M21389 GCATCCTTTTTGGGCTGCTCACAGCCCCCAGCCTCTATGGTGAAGACATACTTGCTAGCA GCGTCACCAA 132

C T T GC T GCCAAGAGAT CAG T GC T GCAAGGCAAGGT T AT T T C T AAC T GAGCAGAGCC T GC CAGGAAGAAAG CGTTTGCACCCCACACCACTGTGCAGGTGTGACCGGTGAGCTCACAGCTGCCCCCCAGGC ATGCCCAGCC CACTTAATCATTCACAGCTCGACAGCTCTCTCGCCCAGCCCAGTTCTGGAAGGGATAAAA AGGGGGCATC ACCGTTCCTGGGTAACAGAGCCACCTTCTGCGTCCTGCTGAGCTCTGTTCTCTCCAGCAC CTCCCAACCC ACTAGTGCCTGGTTCTCTTGCTCCACCAGGAACAAGCCACCATGTCTCGCCAGTCAAGTG TGTCCTTCCG GAGCGGGGGCAGTCGTAGCTTCAGCACCGCCTCTGCCATCACCCCGTCTGTCTCCCGCAC CAGCTTCACC TCCGTGTCCCGGTCCGGGGGTGGCGGTGGTGGTGGCTTCGGCAGGGTCAGCCTTGCGGGT GCTTGTGGAG TGGGTGGCTATGGCAGCCGGAGCCTCTACAACCTGGGGGGCTCCAAGAGGATATCCATCA GCACTAGAGG AGGCAGCTTCAGGAACCGGTTTGGTGCTGGTGCTGGAGGCGGCTATGGCTTTGGAGGTGG TGCCGGTAGT GGATTTGGTTTCGGCGGTGGAGCTGGTGGTGGCTTTGGGCTCGGTGGCGGAGCTGGCTTT GGAGGTGGCT TCGGTGGCCCTGGCTTTCCTGTCTGCCCTCCTGGAGGTATCCAAGAGGTCACTGTCAACC AGAGTCTCCT GACTCCCCTCAACCTGCAAATCGACCCCAGCATCCAGAGGGTGAGGACCGAGGAGCGCGA GCAGATCAAG ACCCTCAACAATAAGTTTGCCTCCTTCATCGACAAGGTGCGGTTCCTGGAGCAGCAGAAC AAGGTTCTGG ACACCAAGTGGACCCTGCTGCAGGAGCAGGGCACCAAGACTGTGAGGCAGAACCTGGAGC CGTTGTTCGA GCAGTACATCAACAACCTCAGGAGGCAGCTGGACAGCATCGTGGGGGAACGGGGCCGCCT GGACTCAGAG CTGAGAAACATGCAGGACCTGGTGGAAGACTTCAAGAACAAGTATGAGGATGAAATCAAC AAGCGTACCA CTGCTGAGAATGAGTTTGTGATGCTGAAGAAGGATGTAGATGCTGCCTACATGAACAAGG TGGAGCTGGA GGCCAAGGTTGATGCACTGATGGATGAGATTAACTTCATGAAGATGTTCTTTGATGCGGA GCTGTCCCAG ATGCAGACGCATGTCTCTGACACCTCAGTGGTCCTCTCCATGGACAACAACCGCAACCTG GACCTGGATA GCATCATCGCTGAGGTCAAGGCCCAGTATGAGGAGATTGCCAACCGCAGCCGGACAGAAG CCGAGTCCTG GTATCAGACCAAGTATGAGGAGCTGCAGCAGACAGCTGGCCGGCATGGCGATGACCTCCG CAACACCAAG CATGAGATCACAGAGATGAACCGGATGATCCAGAGGCTGAGAGCCGAGATTGACAATGTC AAGAAACAGT GCGCCAATCTGCAGAACGCCATTGCGGATGCCGAGCAGCGTGGGGAGCTGGCCCTCAAGG ATGCCAGGAA CAAGCTGGCCGAGCTGGAGGAGGCCCTGCAGAAGGCCAAGCAGGACATGGCCCGGCTGCT GCGTGAGTAC CAGGAGCTCATGAACACCAAGCTGGCCCTGGACGTGGAGATCGCCACTTACCGCAAGCTG CTGGAGGGCG AGGAATGCAGACTCAGTGGAGAAGGAGTTGGACCAGTCAACATCTCTGTTGTCACAAGCA GTGTTTCCTC TGGATATGGCAGTGGCAGTGGCTATGGCGGTGGCCTCGGTGGAGGTCTTGGCGGCGGCCT CGGTGGAGGT CTTGCCGGAGGTAGCAGTGGAAGCTACTACTCCAGCAGCAGTGGGGGTGTCGGCCTAGGT GGTGGGCTCA GTGTGGGGGGCTCTGGCTTCAGTGCAAGCAGTGGCCGAGGGCTGGGGGTGGGCTTTGGCA GTGGCGGGGG TAGCAGCTCCAGCGTCAAATTTGTCTCCACCACCTCCTCCTCCCGGAAGAGCTTCAAGAG CTAAGAACCT GCTGCAAGTCACTGCCTTCCAAGTGCAGCAACCCAGCCCATGGAGATTGCCTCTTCTAGG CAGTTGCTCA AGCCATGTTTTATCCTTTTCTGGAGAGTAGTCTAGACCAAGCCAATTGCAGAACCACATT CTTTGGTTCC CAGGAGAGCCCCATTCCCAGCCCCTGGTCTCCCGTGCCGCAGTTCTATATTCTGCTTCAA ATCAGCCTTC AGGTTTCCCACAGCATGGCCCCTGCTGACACGAGAACCCAAAGTTTTCCCAAATCTAAAT CATCAAAACA GAATCCCCACCCCAATCCCAAATTTTGTTTTGGTTCTAACTACCTCCAGAATGTGTTCAA TAAAATGCTT TTATAATAT

NM_00112306 GGACGGCCGAGCGGCAGGGCGCTCGCGCGCGCCCACTAGTGGCCGGAGGAGAAGGCTCCC GCGGAGGCCG 133 6 CGCTGCCCGCCCCCTCCCCTGGGGAGGCTCGCGTTCCCGCTGCTCGCGCCTGCGCCGCCC GCCGGCCTCA

GGAACGCGCCCTCTTCGCCGGCGCGCGCCCTCGCAGTCACCGCCACCCACCAGCTCCGGC ACCAACAGCA GCGCCGCTGCCACCGCCCACCTTCTGCCGCCGCCACCACAGCCACCTTCTCCTCCTCCGC TGTCCTCTCC CGTCCTCGCCTCTGTCGACTATCAGGTGAACTTTGAACCAGGATGGCTGAGCCCCGCCAG GAGTTCGAAG TGATGGAAGATCACGCTGGGACGTACGGGTTGGGGGACAGGAAAGATCAGGGGGGCTACA CCATGCACCA AGACCAAGAGGGTGACACGGACGCTGGCCTGAAAGAATCTCCCCTGCAGACCCCCACTGA GGACGGATCT GAGGAACCGGGCTCTGAAACCTCTGATGCTAAGAGCACTCCAACAGCGGAAGATGTGACA GCACCCTTAG TGGATGAGGGAGCTCCCGGCAAGCAGGCTGCCGCGCAGCCCCACACGGAGATCCCAGAAG GAACCACAGC TGAAGAAGCAGGCATTGGAGACACCCCCAGCCTGGAAGACGAAGCTGCTGGTCACGTGAC CCAAGAGCCT GAAAGTGGTAAGGTGGTCCAGGAAGGCTTCCTCCGAGAGCCAGGCCCCCCAGGTCTGAGC CACCAGCTCA TGTCCGGCATGCCTGGGGCTCCCCTCCTGCCTGAGGGCCCCAGAGAGGCCACACGCCAAC CTTCGGGGAC AGGACCTGAGGACACAGAGGGCGGCCGCCACGCCCCTGAGCTGCTCAAGCACCAGCTTCT AGGAGACCTG CACCAGGAGGGGCCGCCGCTGAAGGGGGCAGGGGGCAAAGAGAGGCCGGGGAGCAAGGAG GAGGTGGATG AAGACCGCGACGTCGATGAGTCCTCCCCCCAAGACTCCCCTCCCTCCAAGGCCTCCCCAG CCCAAGATGG GCGGCCTCCCCAGACAGCCGCCAGAGAAGCCACCAGCATCCCAGGCTTCCCAGCGGAGGG TGCCATCCCC CTCCCTGTGGATTTCCTCTCCAAAGTTTCCACAGAGATCCCAGCCTCAGAGCCCGACGGG CCCAGTGTAG GGCGGGCCAAAGGGCAGGATGCCCCCCTGGAGTTCACGTTTCACGTGGAAATCACACCCA ACGTGCAGAA GGAGCAGGCGCACTCGGAGGAGCATTTGGGAAGGGCTGCATTTCCAGGGGCCCCTGGAGA GGGGCCAGAG GCCCGGGGCCCCTCTTTGGGAGAGGACACAAAAGAGGCTGACCTTCCAGAGCCCTCTGAA AAGCAGCCTG CTGCTGCTCCGCGGGGGAAGCCCGTCAGCCGGGTCCCTCAACTCAAAGCTCGCATGGTCA GTAAAAGCAA AGACGGGACTGGAAGCGATGACAAAAAAGCCAAGACATCCACACGTTCCTCTGCTAAAAC CTTGAAAAAT AGGCCTTGCCTTAGCCCCAAACACCCCACTCCTGGTAGCTCAGACCCTCTGATCCAACCC TCCAGCCCTG CTGTGTGCCCAGAGCCACCTTCCTCTCCTAAATACGTCTCTTCTGTCACTTCCCGAACTG GCAGTTCTGG AGCAAAGGAGATGAAACTCAAGGGGGCTGATGGTAAAACGAAGATCGCCACACCGCGGGG AGCAGCCCCT CCAGGCCAGAAGGGCCAGGCCAACGCCACCAGGATTCCAGCAAAAACCCCGCCCGCTCCA AAGACACCAC CCAGCTCTGCGACTAAGCAAGTCCAGAGAAGACCACCCCCTGCAGGGCCCAGATCTGAGA GAGGTGAACC TCCAAAATCAGGGGATCGCAGCGGCTACAGCAGCCCCGGCTCCCCAGGCACTCCCGGCAG CCGCTCCCGC ACCCCGTCCCTTCCAACCCCACCCACCCGGGAGCCCAAGAAGGTGGCAGTGGTCCGTACT CCACCCAAGT CGCCGTCTTCCGCCAAGAGCCGCCTGCAGACAGCCCCCGTGCCCATGCCAGACCTGAAGA ATGTCAAGTC CAAGATCGGCTCCACTGAGAACCTGAAGCACCAGCCGGGAGGCGGGAAGGTGCAGATAAT TAATAAGAAG CTGGATCTTAGCAACGTCCAGTCCAAGTGTGGCTCAAAGGATAATATCAAACACGTCCCG GGAGGCGGCA GTGTGCAAATAGTCTACAAACCAGTTGACCTGAGCAAGGTGACCTCCAAGTGTGGCTCAT TAGGCAACAT CCATCATAAACCAGGAGGTGGCCAGGTGGAAGTAAAATCTGAGAAGCTTGACTTCAAGGA CAGAGTCCAG TCGAAGATTGGGTCCCTGGACAATATCACCCACGTCCCTGGCGGAGGAAATAAAAAGATT GAAACCCACA AGCTGACCTTCCGCGAGAACGCCAAAGCCAAGACAGACCACGGGGCGGAGATCGTGTACA AGTCGCCAGT GGTGTCTGGGGACACGTCTCCACGGCATCTCAGCAATGTCTCCTCCACCGGCAGCATCGA CATGGTAGAC TCGCCCCAGCTCGCCACGCTAGCTGACGAGGTGTCTGCCTCCCTGGCCAAGCAGGGTTTG TGATCAGGCC CCTGGGGCGGTCAATAATTGTGGAGAGGAGAGAATGAGAGAGTGTGGAAAAAAAAAGAAT AATGACCCGG CCCCCGCCCTCTGCCCCCAGCTGCTCCTCGCAGTTCGGTTAATTGGTTAATCACTTAACC TGCTTTTGTC ACTCGGCTTTGGCTCGGGACTTCAAAATCAGTGATGGGAGTAAGAGCAAATTTCATCTTT CCAAATTGAT GGGTGGGCTAGTAATAAAATATTTAAAAAAAAACATTCAAAAACATGGCCACATCCAACA TTTCCTCAGG CAATTCCTTTTGATTCTTTTTTCTTCCCCCTCCATGTAGAAGAGGGAGAAGGAGAGGCTC TGAAAGCTGC TTCTGGGGGATTTCAAGGGACTGGGGGTGCCAACCACCTCTGGCCCTGTTGTGGGGGTGT CACAGAGGCA GTGGCAGCAACAAAGGATTTGAAACTTGGTGTGTTCGTGGAGCCACAGGCAGACGATGTC AACCTTGTGT GAGTGTGACGGGGGTTGGGGTGGGGCGGGAGGCCACGGGGGAGGCCGAGGCAGGGGCTGG GCAGAGGGGA GAGGAAGCACAAGAAGTGGGAGTGGGAGAGGAAGCCACGTGCTGGAGAGTAGACATCCCC CTCCTTGCCG CTGGGAGAGCCAAGGCCTATGCCACCTGCAGCGTCTGAGCGGCCGCCTGTCCTTGGTGGC CGGGGGTGGG GGCCTGCTGTGGGTCAGTGTGCCACCCTCTGCAGGGCAGCCTGTGGGAGAAGGGACAGCG GGTAAAAAGA GAAGGCAAGCTGGCAGGAGGGTGGCACTTCGTGGATGACCTCCTTAGAAAAGACTGACCT TGATGTCTTG AGAGCGCTGGCCTCTTCCTCCCTCCCTGCAGGGTAGGGGGCCTGAGTTGAGGGGCTTCCC TCTGCTCCAC AGAAACCCTGTTTTATTGAGTTCTGAAGGTTGGAACTGCTGCCATGATTTTGGCCACTTT GCAGACCTGG GACTTTAGGGCTAACCAGTTCTCTTTGTAAGGACTTGTGCCTCTTGGGAGACGTCCACCC GTTTCCAAGC CTGGGCCACTGGCATCTCTGGAGTGTGTGGGGGTCTGGGAGGCAGGTCCCGAGCCCCCTG TCCTTCCCAC GGCCACTGCAGTCACCCCGTCTGCGCCGCTGTGCTGTTGTCTGCCGTGAGAGCCCAATCA CTGCCTATAC CCCTCATCACACGTCACAATGTCCCGAATTCCCAGCCTCACCACCCCTTCTCAGTAATGA CCCTGGTTGG TTGCAGGAGGTACCTACTCCATACTGAGGGTGAAATTAAGGGAAGGCAAAGTCCAGGCAC AAGAGTGGGA CCCCAGCCTCTCACTCTCAGTTCCACTCATCCAACTGGGACCCTCACCACGAATCTCATG ATCTGATTCG GTTCCCTGTCTCCTCCTCCCGTCACAGATGTGAGCCAGGGCACTGCTCAGCTGTGACCCT AGGTGTTTCT GCCTTGTTGACATGGAGAGAGCCCTTTCCCCTGAGAAGGCCTGGCCCCTTCCTGTGCTGA GCCCACAGCA GCAGGCTGGGTGTCTTGGTTGTCAGTGGTGGCACCAGGATGGAAGGGCAAGGCACCCAGG GCAGGCCCAC AGTCCCGCTGTCCCCCACTTGCACCCTAGCTTGTAGCTGCCAACCTCCCAGACAGCCCAG CCCGCTGCTC AGCTCCACATGCATAGTATCAGCCCTCCACACCCGACAAAGGGGAACACACCCCCTTGGA AATGGTTCTT TTCCCCCAGTCCCAGCTGGAAGCCATGCTGTCTGTTCTGCTGGAGCAGCTGAACATATAC ATAGATGTTG CCCTGCCCTCCCCATCTGCACCCTGTTGAGTTGTAGTTGGATTTGTCTGTTTATGCTTGG ATTCACCAGA GTGACTATGATAGTGAAAAGAAAAAAAAAAAAAAAAAAGGACGCATGTATCTTGAAATGC TTGTAAAGAG GTTTCTAACCCACCCTCACGAGGTGTCTCTCACCCCCACACTGGGACTCGTGTGGCCTGT GTGGTGCCAC CCTGCTGGGGCCTCCCAAGTTTTGAAAGGCTTTCCTCAGCACCTGGGACCCAACAGAGAC CAGCTTCTAG CAGCTAAGGAGGCCGTTCAGCTGTGACGAAGGCCTGAAGCACAGGATTAGGACTGAAGCG ATGATGTCCC CTTCCCTACTTCCCCTTGGGGCTCCCTGTGTCAGGGCACAGACTAGGTCTTGTGGCTGGT CTGGCTTGCG GCGCGAGGATGGTTCTCTCTGGTCATAGCCCGAAGTCTCATGGCAGTCCCAAAGGAGGCT TACAACTCCT GCATCACAAGAAAAAGGAAGCCACTGCCAGCTGGGGGGATCTGCAGCTCCCAGAAGCTCC GTGAGCCTCA GCCACCCCTCAGACTGGGTTCCTCTCCAAGCTCGCCCTCTGGAGGGGCAGCGCAGCCTCC CACCAAGGGC CCTGCGACCACAGCAGGGATTGGGATGAATTGCCTGTCCTGGATCTGCTCTAGAGGCCCA AGCTGCCTGC CTGAGGAAGGATGACTTGACAAGTCAGGAGACACTGTTCCCAAAGCCTTGACCAGAGCAC CTCAGCCCGC TGACCTTGCACAAACTCCATCTGCTGCCATGAGAAAAGGGAAGCCGCCTTTGCAAAACAT TGCTGCCTAA AGAAACTCAGCAGCCTCAGGCCCAATTCTGCCACTTCTGGTTTGGGTACAGTTAAAGGCA ACCCTGAGGG ACTTGGCAGTAGAAATCCAGGGCCTCCCCTGGGGCTGGCAGCTTCGTGTGCAGCTAGAGC TTTACCTGAA AGGAAGTCTCTGGGCCCAGAACTCTCCACCAAGAGCCTCCCTGCCGTTCGCTGAGTCCCA GCAATTCTCC TAAGTTGAAGGGATCTGAGAAGGAGAAGGAAATGTGGGGTAGATTTGGTGGTGGTTAGAG ATATGCCCCC CTCATTACTGCCAACAGTTTCGGCTGCATTTCTTCACGCACCTCGGTTCCTCTTCCTGAA GTTCTTGTGC CCTGCTCTTCAGCACCATGGGCCTTCTTATACGGAAGGCTCTGGGATCTCCCCCTTGTGG GGCAGGCTCT TGGGGCCAGCCTAAGATCATGGTTTAGGGTGATCAGTGCTGGCAGATAAATTGAAAAGGC ACGCTGGCTT GTGATCTTAAATGAGGACAATCCCCCCAGGGCTGGGCACTCCTCCCCTCCCCTCACTTCT CCCACCTGCA GAGCCAGTGTCCTTGGGTGGGCTAGATAGGATATACTGTATGCCGGCTCCTTCAAGCTGC TGACTCACTT TATCAATAGTTCCATTTAAATTGACTTCAGTGGTGAGACTGTATCCTGTTTGCTATTGCT TGTTGTGCTA TGGGGGGAGGGGGGAGGAATGTGTAAGATAGTTAACATGGGCAAAGGGAGATCTTGGGGT GCAGCACTTA AACTGCCTCGTAACCCTTTTCATGATTTCAACCACATTTGCTAGAGGGAGGGAGCAGCCA CGGAGTTAGA GGCCCTTGGGGTTTCTCTTTTCCACTGACAGGCTTTCCCAGGCAGCTGGCTAGTTCATTC CCTCCCCAGC CAGGTGCAGGCGTAGGAATATGGACATCTGGTTGCTTTGGCCTGCTGCCCTCTTTCAGGG GTCCTAAGCC CACAATCATGCCTCCCTAAGACCTTGGCATCCTTCCCTCTAAGCCGTTGGCACCTCTGTG CCACCTCTCA CACTGGCTCCAGACACACAGCCTGTGCTTTTGGAGCTGAGATCACTCGCTTCACCCTCCT CATCTTTGTT CTCCAAGTAAAGCCACGAGGTCGGGGCGAGGGCAGAGGTGATCACCTGCGTGTCCCATCT ACAGACCTGC AGCTTCATAAAACTTCTGATTTCTCTTCAGCTTTGAAAAGGGTTACCCTGGGCACTGGCC TAGAGCCTCA CCTCCTAATAGACTTAGCCCCATGAGTTTGCCATGTTGAGCAGGACTATTTCTGGCACTT GCAAGTCCCA TGATTTCTTCGGTAATTCTGAGGGTGGGGGGAGGGACATGAAATCATCTTAGCTTAGCTT TCTGTCTGTG AATGTCTATATAGTGTATTGTGTGTTTTAACAAATGATTTACACTGACTGTTGCTGTAAA AGTGAATTTG GAAAT AAAGT T AT T AC T C T GAT T AAA

M92424 GCACCGCGCGAGCTTGGCTGCTTCTGGGGCCTGTGTGGCCCTGTGTGTCGGAAAGATGGA GCAAGAAGCC 134

GAGCCCGAGGGGCGGCCGCGACCCCTCTGACCGAGATCCTGCTGCTTTCGCAGCCAGGAG CACCGTCCCT CCCCGGATTAGTGCGTACGAGCGCCCAGTGCCCTGGCCCGGAGAGTGGAATGATCCCCGA GGCCCAGGGC GTCGTGCTTCCGCAGTAGTCAGTCCCCGTGAAGGAAACTGGGGAGTCTTGAGGGACCCCC GACTCCAAGC GCGAAAACCCCGGATGGTGAGGAGCAGGCAAATGTGCAATACCAACATGTCTGTACCTAC TGATGGTGCT GTAACCACCTCACAGATTCCAGCTTCGGAACAAGAGACCCTGGTTAGACCAAAGCCATTG CTTTTGAAGT TATTAAAGTCTGTTGGTGCACAAAAAGACACTTATACTATGAAAGAGGTTCTTTTTTATC TTGGCCAGTA TATTATGACTAAACGATTATATGATGAGAAGCAACAACATATTGTATATTGTTCAAATGA TCTTCTAGGA GATTTGTTTGGCGTGCCAAGCTTCTCTGTGAAAGAGCACAGGAAAATATATACCATGATC TACAGGAACT TGGTAGTAGTCAATCAGCAGGAATCATCGGACTCAGGTACATCTGTGAGTGAGAACAGGT GTCACCTTGA AGGTGGGAGTGATCAAAAGGACCTTGTACAAGAGCTTCAGGAAGAGAAACCTTCATCTTC ACATTTGGTT TCTAGACCATCTACCTCATCTAGAAGGAGAGCAATTAGTGAGACAGAAGAAAATTCAGAT GAATTATCTG GTGAACGACAAAGAAAACGCCACAAATCTGATAGTATTTCCCTTTCCTTTGATGAAAGCC TGGCTCTGTG TGTAATAAGGGAGATATGTTGTGAAAGAAGCAGTAGCAGTGAATCTACAGGGACGCCATC GAATCCGGAT CTTGATGCTGGTGTAAGTGAACATTCAGGTGATTGGTTGGATCAGGATTCAGTTTCAGAT CAGTTTAGTG TAGAAT T T GAAGT T GAAT C T C TCGAC T CAGAAGAT TATAGCCTTAGT GAAGAAGGACAAGAAC T C T CAGA TGAAGATGATGAGGTATATCAAGTTACTGTGTATCAGGCAGGGGAGAGTGATACAGATTC ATTTGAAGAA GATCCT GAAAT TTCCTTAGCTGACTATTGGAAATGCACTTCATGCAATGAAATGAATCCCCCCCTTCCAT CACATTGCAACAGATGTTGGGCCCTTCGTGAGAATTGGCTTCCTGAAGATAAAGGGAAAG ATAAAGGGGA AAT C T C T GAGAAAGCCAAAC T GGAAAAC T CAAC ACAAGC T GAAGAGGGC TTTGATGTTCCTGATTG T AAA AAAACTATAGTGAATGATTCCAGAGAGTCATGTGTTGAGGAAAATGATGATAAAATTACA CAAGCTTCAC AATCACAAGAAAGTGAAGACTATTCTCAGCCATCAACTTCTAGTAGCATTATTTATAGCA GCCAAGAAGA TGTGAAAGAGTTTGAAAGGGAAGAAACCCAAGACAAAGAAGAGAGTGTGGAATCTAGTTT GCCCCTTAAT GCCATTGAACCTTGTGTGATTTGTCAAGGTCGACCTAAAAATGGTTGCATTGTCCATGGC AAAACAGGAC ATCTTATGGCCTGCTTTACATGTGCAAAGAAGCTAAAGAAAAGGAATAAGCCCTGCCCAG TATGTAGACA ACCAATTCAAATGATTGTGCTAACTTATTTCCCCTAGTTGACCTGTCTATAAGAGAATTA TATATTTCTA ACTATATAACCCTAGGAATTTAGACAACCTGAAATTTATTCACATATATCAAAGTGAGAA AATGCCTCAA T TCACATAGAT T T CT TCTC T T TAGTATAAT TGACCTACT T TGGTAGTGGAA TAGTGAAT ACT TACT ATAA TTTGACTTGAATATGTAGCTCATCCTTTACACCAACTCCTAATTTTAAATAATTTCTACT CTGTCTTAAA TGAGAAGTACTTGGTTTTTTTTTTCTTAAATATGTATATGACATTTAAATGTAACTTATT ATTTTTTTTG AGACCGAGTCTTGCTCTGTTACCCAGGCTGGAGTGCAGTGGGTGATCTTGGCTCACTGCA AGCTCTGCCC TCCCCGGGTTCGCACCATTCTCCTGCCTCAGCCTCCCAATTAGCTTGGCCTACAGTCATC TGCCACCACA CCTGGCTAATTTTTTGTACTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGT CTCGATCTCC TGACCTCGTGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCAC CG

NM_014791 GAGATTTGATTCCCTTGGCGGGCGGAAGCGGCCACAACCCGGCGATCGAAAAGATTCTTA GGAACGCCGT 135

ACCAGCCGCGTCTCTCAGGACAGCAGGCCCCTGTCCTTCTGTCGGGCGCCGCTCAGCCGT GCCCTCCGCC CCTCAGGTTCTTTTTCTAATTCCAAATAAACTTGCAAGAGGACTATGAAAGATTATGATG AACTTCTCAA ATATTATGAATTACATGAAACTATTGGGACAGGTGGCTTTGCAAAGGTCAAACTTGCCTG CCATATCCTT ACTGGAGAGATGGTAGCTATAAAAATCATGGATAAAAACACACTAGGGAGTGATTTGCCC CGGATCAAAA CGGAGATTGAGGCCTTGAAGAACCTGAGACATCAGCATATATGTCAACTCTACCATGTGC TAGAGACAGC CAACAAAATATTCATGGTTCTTGAGTACTGCCCTGGAGGAGAGCTGTTTGACTATATAAT TTCCCAGGAT CGCCTGTCAGAAGAGGAGACCCGGGTTGTCTTCCGTCAGATAGTATCTGCTGTTGCTTAT GTGCACAGCC AGGGCTATGCTCACAGGGACCTCAAGCCAGAAAATTTGCTGTTTGATGAATAT CATAAATTAAAGCTGAT TGACTTTGGTCTCTGTGCAAAACCCAAGGGTAACAAGGATTACCATCTACAGACATGCTG TGGGAGTCTG GC T T AT GCAGCAC C T GAGT T AAT ACAAGGCAAAT CAT AT CTTGGAT CAGAGGC AGAT GT T T GGAGC AT GG GCATACTGTTATATGTTCTTATGTGTGGATTTCTACCATTTGATGATGATAATGTAATGG CTTTATACAA GAAGATTATGAGAGGAAAATATGATGTTCCCAAGTGGCTCTCTCCCAGTAGCATTCTGCT TCTTCAACAA ATGCTGCAGGTGGACCCAAAGAAACGGATTTCTATGAAAAATCTATTGAACCATCCCTGG ATCATGCAAG ATTACAACTATCCTGTTGAGTGGCAAAGCAAGAATCCTTTTATTCACCTCGATGATGATT GCGTAACAGA AC T T T C T GT ACAT CACAGAAACAACAGGCAAAC AAT GGAGGAT T T AAT T T C AC T GT GGC AGT AT GAT CAC CTCACGGCTACCTATCTTCTGCTTCTAGCCAAGAAGGCTCGGGGAAAACCAGTTCGTTTA AGGCTTTCTT CTTTCTCCTGTGGACAAGCCAGTGCTACCCCATTCACAGACATCAAGTCAAATAATTGGA GTCTGGAAGA TGTGACCGCAAGTGATAAAAATTATGTGGCGGGATTAATAGACTATGATTGGTGTGAAGA TGATTTATCA ACAGGTGCTGCTACTCCCCGAACATCACAGTTTACCAAGTACTGGACAGAATCAAATGGG GTGGAATCTA AATCATTAACTCCAGCCTTATGCAGAACACCTGCAAATAAATTAAAGAACAAAGAAAATG TATATACTCC TAAGTCTGCTGTAAAGAATGAAGAGTACTTTATGTTTCCTGAGCCAAAGACTCCAGTTAA TAAGAACCAG CATAAGAGAGAAATACTCACTACGCCAAATCGTTACACTACACCCTCAAAAGCTAGAAAC CAGTGCCTGA AAGAAACTCCAATTAAAATACCAGTAAATTCAACAGGAACAGACAAGTTAATGACAGGTG TCATTAGCCC TGAGAGGCGGTGCCGCTCAGTGGAATTGGATCTCAACCAAGCACATATGGAGGAGACTCC AAAAAGAAAG GGAGCCAAAGTGTTTGGGAGCCTTGAAAGGGGGTTGGATAAGGTTATCACTGTGCTCACC AGGAGCAAAA GGAAGGGTTCTGCCAGAGACGGGCCCAGAAGACTAAAGCTTCACTATAACGTGACTACAA CTAGATTAGT GAATCCAGATCAACTGTTGAATGAAATAATGTCTATTCTTCCAAAGAAGCATGTTGACTT TGTACAAAAG GGTTATACACTGAAGTGTCAAACACAGTCAGATTTTGGGAAAGTGACAATGCAATTTGAA TTAGAAGTGT GCCAGCTTCAAAAACCCGATGTGGTGGGTATCAGGAGGCAGCGGCTTAAGGGCGATGCCT GGGTTTACAA AAGATTAGTGGAAGACATCCTATCTAGCTGCAAGGTATAATTGATGGATTCTTCCATCCT GCCGGATGAG TGTGGGTGTGATACAGCCTACATAAAGACTGTTATGATCGCTTTGATTTTAAAGTTCATT GGAACTACCA ACTTGTTTCTAAAGAGCTATCTTAAGACCAATATCTCTTTGTTTTTAAACAAAAGATATT ATTTTGTGTA TGAATCTAAATCAAGCCCATCTGTCATTATGTTACTGTCTTTTTTAATCATGTGGTTTTG TATATTAATA ATTGTTGACTTTCTTAGATTCACTTCCATATGTGAATGTAAGCTCTTAACTATGTCTCTT TGTAATGTGT AATTTCTTTCTGAAATAAAACCATTTGTGAATATAG

BG765502 GCAGCGGAGGAGCCCAGTCCACGATGGCCCGGTCCCTGGTGTGCCTTGGTGTCATCATCT TGCTGTCTGC

CTTCTCCGGACCTGGTGTCAGGGGTGGTCCTATGCCCAAGCTGGCTGACCGGAAGCTGTG TGCGGACCAG GAGTGCAGCCACCCTATCTCCATGGCTGTGGCCCTTCAGGACTACATGGCCCCCGACTGC CGATTCCTGA CCATTCACCGGGGCCAAGTGGTGTATGTCTTCTCCAAGCTGAAGGGCCGTGGGCGGCTCT TCTGGGGAGG CAGCGTTCAGGGAGATTACTATGGAGATCTGGCTGCTCGCCTGGGCTATTTCCCCAGTAG CATTGTCCGA GAGGACCAGACCCTGAAACCTGGCAAAGTCGATGTGAAGACAGACAAATGGGATTTCTAC TGCCAGTGAG CTCAGCCTACCGCTGGCCCTGCCGTTTCCCCTCCTTGGGTTTATGCAAATACAATCAGCC CAGTGCAAAA AAAAAAAAAAAAAAAAAAAACTTCGGAGAAGAGATAGCAACAAAAGGCCGCTTGTGTGAA GGCGCCAAAA GTTTTCGCCCAAGAGACCTTCGGCCTCCCCCAGGGCGCGCGCAAAGGCGCCTTGTTTTGA CAACCTCTTG GACAACCGGAGGGGCTACCGCCCGGAGACCCCTGTGGTGGACCCCCCGGGCAACCCGGTG TGACAGGGTA CTCACCCCCACGGCTTTGTCGGGGGTCCCACCAAAGGCCCCAAAGAGGCTCTTTCAAGGC ACTATTCCTT GTTGTAGACCTTGTGTGTGCCACAGGCGCCAAAGAAACCTCGGGGGGCTAACAAACGCAC GTGCTTGGCA GCTCCGAGAAGGCTCTCTCCCACCCGAGGGGTGGACGCAACAGGGGGAATGGGCCATCAT ATTGTTGCCC CCGGTGGGCACCAACTCTTTTTCCCCCATAGAGAGGCCTTAGCACACTATGTGGGGCACG TTATTGCCGC CTAGAGAAACCGAGCGCCAGAAAATTTCGAAGGGGGGGGCGCTTCTCATCATTTTGCGCA AAACCCCCTT GTGGGAGTATGCCCCGAACTCCTCTGGAACACACAAGCGACACTTGCGCGGGGTCTGCAA AAAACCTCCT GTTGGGAAGCCGGCTTCACN

NM_002417 TACCGGGCGGAGGTGAGCGCGGCGCCGGCTCCTCCTGCGGCGGACTTTGGGTGCGACTTG ACGAGCGGTG 137

GTTCGACAAGTGGCCTTGCGGGCCGGATCGTCCCAGTGGAAGAGTTGTAAATTTGCTTCT GGCCTTCCCC TACGGATTATACCTGGCCTTCCCCTACGGATTATACTCAACT TACT GTTTAGAAAATGTGGCCCAC GAGA CGCCTGGTTACTATCAAAAGGAGCGGGGTCGACGGTCCCCACTTTCCCCTGAGCCTCAGC ACCTGCTTGT TTGGAAGGGGTATTGAATGTGACATCCGTATCCAGCTTCCTGTTGTGTCAAAACAACATT GCAAAATTGA AATCCATGAGCAGGAGGCAATATTACATAATTTCAGTTCCACAAATCCAACACAAGTAAA TGGGTCTGTT ATTGATGAGCCTGTACGGCTAAAACATGGAGATGTAATAACTATTATTGATCGTTCCTTC AGGTATGAAA ATGAAAGTCTTCAGAATGGAAGGAAGTCAACTGAATTTCCAAGAAAAATACGTGAACAGG AGCCAGCACG TCGTGTCTCAAGATCTAGCTTCTCTTCTGACCCTGATGAGAAAGCTCAAGATTCCAAGGC CTATTCAAAA ATCACTGAAGGAAAAGTTTCAGGAAATCCTCAGGTACATATCAAGAATGTCAAAGAAGAC AGTACCGCAG ATGACTCAAAAGACAGTGTTGCTCAGGGAACAACTAATGTTCATTCCTCAGAACATGCTG GACGTAATGG CAGAAATGCAGCTGATCCCATTTCTGGGGATTTTAAAGAAATTTCCAGCGTTAAATTAGT GAGCCGTTAT GGAGAATTGAAGTCTGTTCCCACTACACAATGTCTTGACAATAGCAAAAAAAATGAATCT CCCTTTTGGA AGCTTTATGAGTCAGTGAAGAAAGAGTTGGATGTAAAATCACAAAAAGAAAATGTCCTAC AGTATTGTAG AAAATCTGGATTACAAACTGATTACGCAACAGAGAAAGAAAGTGCTGATGGTTTACAGGG GGAGACCCAA CTGTTGGTCTCGCGTAAGTCAAGACCAAAATCTGGTGGGAGCGGCCACGCTGTGGCAGAG CCTGCTTCAC CTGAACAAGAGCTTGACCAGAACAAGGGGAAGGGAAGAGACGTGGAGTCTGTTCAGACTC CCAGCAAGGC TGTGGGCGCCAGCTTTCCTCTCTATGAGCCGGCTAAAATGAAGACCCCTGTACAATATTC ACAGCAACAA AATTCTCCACAAAAACATAAGAACAAAGACCTGTATACTACTGGTAGAAGAGAATCTGTG AATCTGGGTA AAAGTGAAGGCTTCAAGGCTGGTGATAAAACTCTTACTCCCAGGAAGCTTTCAACTAGAA ATCGAACACC AGCTAAAGTTGAAGATGCAGCTGACTCTGCCACTAAGCCAGAAAATCTCTCTTCCAAAAC CAGAGGAAGT ATTCCTACAGATGTGGAAGTTCTGCCTACGGAAACTGAAATTCACAATGAGCCATTTTTA ACTCTGTGGC TCACTCAAGTTGAGAGGAAGATCCAAAAGGATTCCCTCAGCAAGCCTGAGAAATTGGGCA CTACAGCTGG ACAGATGTGCTCTGGGTTACCTGGTCTTAGTTCAGTTGATATCAACAACTTTGGTGATTC CATTAATGAG AGTGAGGGAATACCTTTGAAAAGAAGGCGTGTGTCCTTTGGTGGGCACCTAAGACCTGAA CTATTTGATG AAAACTTGCCTCCTAATACGCCTCTCAAAAGGGGAGAAGCCCCAACCAAAAGAAAGTCTC TGGTAATGCA CACTCCACCTGTCCTGAAGAAAATCATCAAGGAACAGCCTCAACCATCAGGAAAACAAGA GTCAGGTTCA GAAATCCATGTGGAAGTGAAGGCACAAAGCTTGGTTATAAGCCCTCCAGCTCCTAGTCCT AGGAAAACTC CAGTTGCCAGTGATCAACGCCGTAGGTCCTGCAAAACAGCCCCTGCTTCCAGCAGCAAAT CTCAGACAGA GGTTCCTAAGAGAGGAGGGAGAAAGAGTGGCAACCTGCCTTCAAAGAGAGTGTCTATCAG CCGAAGTCAA CATGATATTTTACAGATGATATGTTCCAAAAGAAGAAGTGGTGCTTCGGAAGCAAATCTG ATTGTTGCAA AATCATGGGCAGATGTAGTAAAACTTGGTGCAAAACAAACACAAACTAAAGTCATAAAAC ATGGTCCTCA AAGGTCAATGAACAAAAGGCAAAGAAGACCTGCTACTCCAAAGAAGCCTGTGGGCGAAGT TCACAGTCAA TTTAGTACAGGCCACGCAAACTCTCCTTGTACCATAATAATAGGGAAAGCTCATACTGAA AAAGTACATG TGCCTGCTCGACCCTACAGAGTGCTCAACAACTTCATTTCCAACCAAAAAATGGACTTTA AGGAAGATCT TTCAGGAATAGCTGAAATGTTCAAGACCCCAGTGAAGGAGCAACCGCAGTTGACAAGCAC ATGTCACATC GC T AT T T CAAAT T CAGAGAAT T T GC T T GGAAAACAGT T T CAAGGAAC T GAT TC AGGAGAAGAACC T C T GC TCCCCACCTCAGAGAGTTTTGGAGGAAATGTGTTCTTCAGTGCACAGAATGCAGCAAAAC AGCCATCTGA TAAATGCTCTGCAAGCCCTCCCTTAAGACGGCAGTGTATTAGAGAAAATGGAAACGTAGC AAAAACGCCC AGGAAC ACC T ACAAAAT GACTTCTCT GGAGACAAAAAC T T CAGAT AC T GAGAC AGAGCC T T CAAAAACAG TATCCACTGCAAACAGGTCAGGAAGGTCTACAGAGTTCAGGAATATACAGAAGCTACCTG TGGAAAGTAA GAGTGAAGAAACAAATACAGAAATTGTTGAGTGCATCCTAAAAAGAGGTCAGAAGGCAAC ACTACTACAA CAAAGGAGAGAAGGAGAGATGAAGGAAATAGAAAGACCTTTTGAGACATATAAGGAAAAT ATTGAATTAA AAGAAAACGATGAAAAGATGAAAGCAATGAAGAGATCAAGAACTTGGGGGCAGAAATGTG CACCAATGTC TGACCTGACAGACCTCAAGAGCTTGCCTGATACAGAACTCATGAAAGACACGGCACGTGG CCAGAATCTC CTCCAAACCCAAGATCATGCCAAGGCACCAAAGAGTGAGAAAGGCAAAATCACTAAAATG CCCTGCCAGT CATTACAACCAGAACCAATAAACACCCCAACACACACAAAACAACAGTTGAAGGCATCCC TGGGGAAAGT AGGTGTGAAAGAAGAGCTCCTAGCAGTCGGCAAGTTCACACGGACGTCAGGGGAGACCAC GCACACGCAC AGAGAGCCAGCAGGAGATGGCAAGAGCATCAGAACGTTTAAGGAGTCTCCAAAGCAGATC CTGGACCCAG CAGCCCGTGTAACTGGAATGAAGAAGTGGCCAAGAACGCCTAAGGAAGAGGCCCAGTCAC TAGAAGACCT GGCTGGCTTCAAAGAGCTCTTCCAGACACCAGGTCCCTCTGAGGAATCAATGACTGATGA GAAAACTACC AAAATAGCCTGCAAATCTCCACCACCAGAATCAGTGGACACTCCAACAAGCACAAAGCAA TGGCCTAAGA GAAGTCTCAGGAAAGCAGATGTAGAGGAAGAATTCTTAGCACTCAGGAAAC TAACACCATCAGCAGGGAA AGCCATGCTTACGCCCAAACCAGCAGGAGGTGATGAGAAAGACATTAAAGCATTTATGGG AACTCCAGTG CAGAAACTGGACCTGGCAGGAACTTTACCTGGCAGCAAAAGACAGCTACAGACTCCTAAG GAAAAGGCCC AGGCTCTAGAAGACCTGGCTGGCTTTAAAGAGCTCTTCCAGACTCCTGGTCACACCGAGG AATTAGTGGC TGCTGGTAAAACCACTAAAATACCCTGCGACTCTCCACAGTCAGACCCAGTGGACACCCC AACAAGCACA AAGCAACGACCCAAGAGAAGTATCAGGAAAGCAGATGTAGAGGGAGAACTCTTAGCGTGC AGGAATCTAA TGCCATCAGCAGGCAAAGCCATGCACACGCCTAAACCATCAGTAGGTGAAGAGAAAGACA TCATCATATT TGTGGGAACTCCAGTGCAGAAACTGGACCTGACAGAGAACTTAACCGGCAGCAAGAGACG GCCACAAACT CCTAAGGAAGAGGCCCAGGCTCTGGAAGACCTGACTGGCTTTAAAGAGCTCTTCCAGACC CCTGGTCATA CTGAAGAAGCAGTGGCTGCTGGCAAAACTACTAAAATGCCCTGCGAATCTTCTCCACCAG AATCAGCAGA CACCCCAACAAGCACAAGAAGGCAGCCCAAGACACCTTTGGAGAAAAGGGACGTACAGAA GGAGCTCTCA GCCCTGAAGAAGCTCACACAGACATCAGGGGAAACCACACACACAGATAAAGTACCAGGA GGTGAGGATA AAAGCATCAACGCGTTTAGGGAAACTGCAAAACAGAAACTGGACCCAGCAGCAAGTGTAA CTGGTAGCAA GAGGCACCCAAAAACTAAGGAAAAGGCCCAACCCCTAGAAGACCTGGCTGGCTTGAAAGA GCTCTTCCAG ACACCAGTATGCACTGACAAGCCCACGACTCACGAGAAAACTACCAAAATAGCCTGCAGA TCACAACCAG ACCCAGTGGACACACCAACAAGCTCCAAGCCACAGTCCAAGAGAAGTCTCAGGAAAGTGG ACGTAGAAGA AGAATTCTTCGCACTCAGGAAACGAACACCATCAGCAGGCAAAGCCATGCACACACCCAA ACCAGCAGTA AGTGGTGAGAAAAACATCTACGCATTTATGGGAACTCCAGTGCAGAAACTGGACCTGACA GAGAACTTAA CTGGCAGCAAGAGACGGCTACAAACTCCTAAGGAAAAGGCCCAGGCTCTAGAAGACCTGG CTGGCTTTAA AGAGCTCTTCCAGACACGAGGTCACACTGAGGAATCAATGACTAACGATAAAACTGCCAA AGTAGCCTGC AAATCTTCACAACCAGACCCAGACAAAAACCCAGCAAGCTCCAAGCGACGGCTCAAGACA TCCCTGGGGA AAGTGGGCGTGAAAGAAGAGCTCCTAGCAGTTGGCAAGCTCACACAGACATCAGGAGAGA CTACACACAC ACACACAGAGCCAACAGGAGATGGTAAGAGCATGAAAGCATTTATGGAGTCTCCAAAGCA GATCTTAGAC TCAGCAGCAAGTCTAACTGGCAGCAAGAGGCAGCTGAGAACTCCTAAGGGAAAGTCTGAA GTCCCTGAAG ACCTGGCCGGCTTCATCGAGCTCTTCCAGACACCAAGTCACACTAAGGAATCAATGACTA ACGAAAAAAC TACCAAAGTATCCTACAGAGCTTCACAGCCAGACCTAGTGGACACCCCAACAAGCTCCAA GCCACAGCCC AAGAGAAGT C T CAGGAAAGCAGACAC T GAAGAAGAAT T T T T AGCAT T T AGGAAACAAAC GCCAT CAGCAG GCAAAGCCATGCACACACCCAAACCAGCAGTAGGTGAAGAGAAAGACATCAACACGTTTT TGGGAACTCC AGTGCAGAAACTGGACCAGCCAGGAAATTTACCTGGCAGCAATAGACGGCTACAAACTCG TAAGGAAAAG GCCCAGGCTCTAGAAGAACTGACTGGCTTCAGAGAGCTTTTCCAGACACCATGCACTGAT AACCCCACGA CTGATGAGAAAACTACCAAAAAAATACTCTGCAAATCTCCGCAATCAGACCCAGCGGACA CCCCAACAAA CACAAAGCAACGGCCCAAGAGAAGCC TCAAGAAAGCAGACGTAGAGGAAGAAT T T T TAGCAT TCAGGAAA CTAACACCATCAGCAGGCAAAGCCATGCACACGCCTAAAGCAGCAGTAGGTGAAGAGAAA GACATCAACA CATTTGTGGGGACTCCAGTGGAGAAACTGGACCTGCTAGGAAATTTACCTGGCAGCAAGA GACGGCCACA AACTCCTAAAGAAAAGGCCAAGGCTCTAGAAGATCTGGCTGGCTTCAAAGAGCTCTTCCA GACACCAGGT CACACTGAGGAATCAATGACCGATGACAAAATCACAGAAGTATCCTGCAAATCTCCACAA CCAGACCCAG TCAAAACCCCAACAAGCTCCAAGCAACGACTCAAGATATCCTTGGGGAAAGTAGGTGTGA AAGAAGAGGT CCTACCAGTCGGCAAGCTCACACAGACGTCAGGGAAGACCACACAGACACACAGAGAGAC AGCAGGAGAT GGAAAGAGCATCAAAGCGTTTAAGGAATCTGCAAAGCAGATGCTGGACCCAGCAAACTAT GGAACTGGGA TGGAGAGGTGGCCAAGAACACCTAAGGAAGAGGCCCAATCACTAGAAGACCTGGCCGGCT TCAAAGAGCT CTTCCAGACACCAGACCACACTGAGGAATCAACAACTGATGACAAAACTACCAAAATAGC CTGCAAATCT CCACCACCAGAATCAATGGACACTCCAACAAGCACAAGGAGGCGGCCCAAAACACCTTTG GGGAAAAGGG ATATAGTGGAAGAGCTCTCAGCCCTGAAGCAGCTCACACAGACCACACACACAGACAAAG TACCAGGAGA TGAGGATAAAGGCATCAACGTGTTCAGGGAAACTGCAAAACAGAAACTGGACCCAGCAGC AAGTGTAACT GGTAGCAAGAGGCAGCCAAGAACTCCTAAGGGAAAAGCCCAACCCCTAGAAGACTTGGCT GGCTTGAAAG AGCTCTTCCAGACACCAATATGCACTGACAAGCCCACGACTCATGAGAAAACTACCAAAA TAGCCTGCAG ATCTCCACAACCAGACCCAGTGGGTACCCCAACAATCTTCAAGCCACAGTCCAAGAGAAG TCTCAGGAAA GCAGACGTAGAGGAAGAATCCTTAGCACTCAGGAAACGAACACCATCAGTAGGGAAAGCT ATGGACACAC CCAAACCAGCAGGAGGTGATGAGAAAGACATGAAAGCATTTATGGGAACTCCAGTGCAGA AATTGGACCT GCCAGGAAATTTACCTGGCAGCAAAAGATGGCCACAAACTCCTAAGGAAAAGGCCCAGGC TCTAGAAGAC CTGGCTGGCTTCAAAGAGCTCTTCCAGACACCAGGCACTGACAAGCCCACGACTGATGAG AAAACTACCA AAATAGCCTGCAAATCTCCACAACCAGACCCAGTGGACACCCCAGCAAGCACAAAGCAAC GGCCCAAGAG AAACCTCAGGAAAGCAGACGTAGAGGAAGAATTTTTAGCACTCAGGAAACGAACACCATC AGCAGGCAAA GCCATGGACACACCAAAACCAGCAGTAAGTGATGAGAAAAATATCAACACATTTGTGGAA ACTCCAGTGC AGAAACTGGACCTGCTAGGAAATTTACCTGGCAGCAAGAGACAGCCACAGACTCCTAAGG AAAAGGCTGA GGCTCTAGAGGACCTGGTTGGCTTCAAAGAACTCTTCCAGACACCAGGTCACACTGAGGA ATCAATGACT GATGACAAAATCACAGAAGTATCCTGTAAATCTCCACAGCCAGAGTCATTCAAAACCTCA AGAAGCTCCA AGCAAAGGCTCAAGATACCCCTGGTGAAAGTGGACATGAAAGAAGAGCCCCTAGCAGTCA GCAAGCTCAC ACGGACATCAGGGGAGACTACGCAAACACACACAGAGCCAACAGGAGATAGTAAGAGCAT CAAAGCGTTT AAGGAGTCTCCAAAGCAGATCCTGGACCCAGCAGCAAGTGTAACTGGTAGCAGGAGGCAG CTGAGAACTC GTAAGGAAAAGGCCCGTGCTCTAGAAGACCTGGTTGACTTCAAAGAGCTCTTCTCAGCAC CAGGTCACAC TGAAGAGTCAATGACTATTGACAAAAACACAAAAATTCCCTGCAAATCTCCCCCACCAGA ACTAACAGAC ACTGCCACGAGCACAAAGAGATGCCCCAAGACACGTCCCAGGAAAGAAGTAAAAGAGGAG CTCTCAGCAG T T GAGAGGC T CAC GCAAAC AT CAGGGCAAAGCACACACACACACAAAGAAC CAGCAAGC GGT GAT GAGGG CATCAAAGTATTGAAGCAACGTGCAAAGAAGAAACCAAACCCAGTAGAAGAGGAACCCAG CAGGAGAAGG CCAAGAGCACCTAAGGAAAAGGCCCAACCCCTGGAAGACCTGGCCGGCTTCACAGAGCTC TCTGAAACAT CAGGTCACACTCAGGAATCACTGACTGCTGGCAAAGCCACTAAAATACCCTGCGAATCTC CCCCACTAGA AGTGGTAGACACCACAGCAAGCACAAAGAGGCATCTCAGGACACGTGTGCAGAAGGTACA AGTAAAAGAA GAGCCTTCAGCAGTCAAGTTCACACAAACATCAGGGGAAACCACGGATGCAGACAAAGAA CCAGCAGGTG AAGATAAAGGCATCAAAGCATTGAAGGAATCTGCAAAACAGACACCGGCTCCAGCAGCAA GTGTAACTGG CAGCAGGAGACGGCCAAGAGCACCCAGGGAAAGTGCCCAAGCCATAGAAGACCTAGCTGG CTTCAAAGAC CCAGCAGCAGGTCACACTGAAGAATCAATGACTGATGACAAAACCACTAAAATACCCTGC AAATCATCAC CAGAACTAGAAGACACCGCAACAAGCTCAAAGAGACGGCCCAGGACACGTGCCCAGAAAG TAGAAGTGAA GGAGGAGCTGTTAGCAGTTGGCAAGCTCACACAAACCTCAGGGGAGACCACGCACACCGA CAAAGAGCCG GTAGGT GAGGGCAAAGGCACGAAAGCAT T TAAGCAACCT GCAAAGCGGAAGCT GGACGCAGAAGAT GTAA TTGGCAGCAGGAGACAGCCAAGAGCACCTAAGGAAAAGGCCCAACCCCTGGAAGATCTGG CCAGCTTCCA AGAGCTCTCTCAAACACCAGGCCACACTGAGGAACTGGCAAATGGTGCTGCTGATAGCTT TACAAGCGCT CCAAAGCAAACACCTGACAGTGGAAAACCTCTAAAAATATCCAGAAGAGTTCTTCGGGCC CCTAAAGTAG AACCCGTGGGAGACGTGGTAAGCACCAGAGACCCTGTAAAATCACAAAGCAAAAGCAACA CTTCCCTGCC CCCACTGCCCTTCAAGAGGGGAGGTGGCAAAGATGGAAGCGTCACGGGAACCAAGAGGCT GCGCTGCATG CCAGCACCAGAGGAAATTGTGGAGGAGCTGCCAGCCAGCAAGAAGCAGAGGGTTGCTCCC AGGGCAAGAG GCAAATCATCCGAACCCGTGGTCATCATGAAGAGAAGTTTGAGGACTTCTGCAAAAAGAA TTGAACCTGC GGAAGAGCTGAACAGCAACGACATGAAAACCAACAAAGAGGAACACAAATTACAAGACTC GGTCCCTGAA AATAAGGGAATATCCCTGCGCTCCAGACGCCAAAATAAGACTGAGGCAGAACAGCAAATA ACTGAGGTCT TTGTATTAGCAGAAAGAATAGAAATAAACAGAAATGAAAAGAAGCCCATGAAGACCTCCC CAGAGATGGA CATTCAGAATCCAGATGATGGAGCCCGGAAACCCATACCTAGAGACAAAGTCACTGAGAA CAAAAGGTGC TTGAGGTCTGCTAGACAGAATGAGAGCTCCCAGCCTAAGGTGGCAGAGGAGAGCGGAGGG CAGAAGAGTG CGAAGGTTCTCATGCAGAATCAGAAAGGGAAAGGAGAAGCAGGAAATTCAGACTCCATGT GCCTGAGATC AAGAAAGACAAAAAGCCAGCCTGCAGCAAGCACTTTGGAGAGCAAATCTGTGCAGAGAGT AACGCGGAGT GTCAAGAGGTGTGCAGAAAATCCAAAGAAGGCTGAGGACAATGTGTGTGTCAAGAAAATA AGAACCAGAA GTCATAGGGACAGTGAAGATATTTGACAGAAAAATCGAACTGGGAAAAATATAATAAAGT TAGTTTTGTG ATAAGTTCTAGTGCAGTTTTTGTCATAAATTACAAGTGAATTCTGTAAGTAAGGCTGTCA GTCTGCTTAA GGGAAGAAAACTTTGGATTTGCTGGGTCTGAATCGGCTTCATAAACTCCACTGGGAGCAC TGCTGGGCTC CTGGACTGAGAATAGTTGAACACCGGGGGCTTTGTGAAGGAGTCTGGGCCAAGGTTTGCC CTCAGCTTTG CAGAATGAAGCCTTGAGGTCTGTCACCACCCACAGCCACCCTACAGCAGCCTTAACTGTG ACACTTGCCA CACTGTGTCGTCGTTTGTTTGCCTATGTCCTCCAGGGCACGGTGGCAGGAACAACTATCC TCGTCTGTCC CAACACTGAGCAGGCACTCGGTAAACACGAATGAATGGATGAGCGCACGGATGAATGGAG CTTACAAGAT CTGTCTTTCCAATGGCCGGGGGCATTTGGTCCCCAAATTAAGGCTATTGGACATCTGCAC AGGACAGTCC TATTTTTGATGTCCTTTCCTTTCTGAAAATAAAGTTTTGTGCTTTGGAGAATGACTCGTG AGCACATCTT TAGGGACCAAGAGTGACTTTCTGTAAGGAGTGACTCGTGGCTTGCCTTGGTCTCTTGGGA ATACTTTTCT AACTAGGGTTGCTCTCACCTGAGACATTCTCCACCCGCGGAATCTCAGGGTCCCAGGCTG TGGGCCATCA CGACCTCAAACTGGCTCCTAATCTCCAGCTTTCCTGTCATTGAAAGCTTCGGAAGTTTAC TGGCTCTGCT CCCGCCTGTTTTCTTTCTGACTCTATCTGGCAGCCCGATGCCACCCAGTACAGGAAGTGA CACCAGTACT CTGTAAAGCATCATCATCCTTGGAGAGACTGAGCACTCAGCACCTTCAGCCACGATTTCA GGATCGCTTC CTTGTGAGCCGCTGCCTCCGAAATCTCCTTTGAAGCCCAGACATCTTTCTCCAGCTTCAG ACTTGTAGAT ATAACTCGTTCATCTTCATTTACTTTCCACTTTGCCCCCTGTCCTCTCTGTGTTCCCCAA ATCAGAGAAT AGCCCGCCATCCCCCAGGTCACCTGTCTGGATTCCTCCCCATTCACCCACCTTGCCAGGT GCAGGTGAGG ATGGTGCACCAGACAGGGTAGCTGTCCCCCAAAATGTGCCCTGTGCGGGCAGTGCCCTGT CTCCACGTTT GTTTCCCCAGTGTCTGGCGGGGAGCCAGGTGACATCATAAATACTTGCTGAATGAATGCA GAAATCAGCG GTACTGACTTGTACTATATTGGCTGCCATGATAGGGTTCTCACAGCGTCATCCATGATCG TAAGGGAGAA T GACAT T C T GC T T GAGGGAGGGAAT AGAAAGGGGCAGGGAGGGGAC AT C T GAGGGC T T C ACAGGGC TGCA AAGGGTACAGGGATTGCACCAGGGCAGAACAGGGGAGGGTGTTCAAGGAAGAGTGGCTCT TAGCAGAGGC ACTTTGGAAGGTGTGAGGCATAAATGCTTCCTTCTACGTAGGCCAACCTCAAAACTTTCA GTAGGAATGT TGCTATGATCAAGTTGTTCTAACACTTTAGACTTAGTAGTAATTATGAACCTCACATAGA AAAATTTCAT CCAGCCATATGCCTGTGGAGTGGAATATTCTGTTTAGTAGAAAAATCCTTTAGAGTTCAG CTCTAACCAG AAATCTTGCTGAAGTATGTCAGCACCTTTTCTCACCCTGGTAAGTACAGTATTTCAAGAG CACGCTAAGG GTGGTTTTCATTTTACAGGGCTGTTGATGATGGGTTAAAAATGTTCATTTAAGGGCTACC CCCGTGTTTA ATAGATGAACACCACTTCTACACAACCCTCCTTGGTACTGGGGGAGGGAGAGATCTGACA AATACTGCCC ATTCCCCTAGGCTGACTGGATTTGAGAACAAATACCCACCCATTTCCACCATGGTATGGT AACTTCTCTG AGCTTCAGTTTCCAAGTGAATTTCCATGTAATAGGACATTCCCATTAAATACAAGCTGTT TTTACTTTTT CGCCTCCCAGGGCCTGTGGGATCTGGTCCCCCAGCCTCTCTTGGGCTTTCTTACACTAAC TCTGTACCTA CCATCTCCTGCCTCCCTTAGGCAGGCACCTCCAACCACCACACACTCCCTGCTGTTTTCC CTGCCTGGAA CTTTCCCTCCTGCCCCACCAAGATCATTTCATCCAGTCCTGAGCTCAGCTTAAGGGAGGC TTCTTGCCTG TGGGTTCCCTCACCCCCATGCCTGTCCTCCAGGCTGGGGCAGGTTCTTAGTTTGCCTGGA ATTGTTCTGT ACCTCTTTGTAGCACGTAGTGTTGTGGAAACTAAGCCACTAATTGAGTTTCTGGCTCCCC TCCTGGGGTT GTAAGTTTTGTTCATTCATGAGGGCCGACTGCATTTCCTGGTTACTCTATCCCAGTGACC AGCCACAGGA GATGTCCAATAAAGTATGTGATGAAATGGTCTTAAAAAAAAAAAAAA

NM_024101 GCGCCGGGACGTGGCCAGTTGCCCGCCTGCCCCGGAGAGCCAGGCGCTAACCAGCCGCTC TGCGCCCCGC 138

GCCCTGCTTGCCCCCATTATCCAGCCTTGCCCCGGCGCCCTGACCTGACGCCCTGGCCTG ACGCCCTGCT TCGTCGCCTCCTTTCTCTCCCAGGTGCTGGACCAGGGACTGAGCGTCCCCCGGAGAGGGT CCGGTGTGAC CCCGACAAGAAGCAGAAATGGGGAAGAAACTGGATCTTTCCAAGCTCACTGATGAAGAGG CCCAGCATGT CTTGGAAGTTGTTCAACGAGATTTTGACCTCCGAAGGAAAGAAGAGGAACGGCTAGAGGC GTTGAAGGGC AAGATTAAGAAGGAAAGCTCCAAGAGGGAGCTGCTTTCCGACACTGCCCATCTGAACGAG ACCCACTGCG CCCGCTGCCTGCAGCCCTACCAGCTGCTTGTGAATAGCAAAAGGCAGTGCCTGGAATGTG GCCTCTTCAC CTGCAAAAGCTGTGGCCGCGTCCACCCGGAGGAGCAGGGCTGGATCTGTGACCCCTGCCA TCTGGCCAGA GTCGTGAAGATCGGCTCACTGGAGTGGTACTATGAGCATGTGAAAGCCCGCTTCAAGAGG TTCGGAAGTG CCAAGGTCATCCGGTCCCTCCACGGGCGGCTGCAGGGTGGAGCTGGGCCTGAACTGATAT CTGAAGAGAG AAGTGGAGACAGCGACCAGACAGATGAGGATGGAGAACCTGGCTCAGAGGCCCAGGCCCA GGCCCAGCCC TTTGGCAGCAAAAAAAAGCGCCTCCTCTCCGTCCACGACTTCGACTTCGAGGGAGACTCA GATGACTCCA CTCAGCCTCAAGGTCACTCCCTGCACCTGTCCTCAGTCCCTGAGGCCAGGGACAGCCCAC AGTCCCTCAC AGATGAGTCCTGCTCAGAGAAGGCAGCCCCTCACAAGGCTGAGGGCCTGGAGGAGGCTGA TACTGGGGCC TCTGGGTGCCACTCCCATCCGGAAGAGCAGCCGACCAGCATCTCACCTTCCAGACACGGC GCCCTGGCTG AGCTCTGCCCGCCTGGAGGCTCCCACAGGATGGCCCTGGGGACTGCTGCTGCACTCGGGT CGAATGTCAT CAGGAATGAGCAGCTGCCCCTGCAGTACTTGGCCGATGTGGACACCTCTGATGAGGAAAG CATCCGGGCT CACGTGATGGCCTCCCACCATTCCAAGCGGAGAGGCCGGGCGTCTTCTGAGAGTCAGATC TTTGAGCTGA ATAAGCATATTTCAGCTGTGGAATGCCTGCTGACCTACCTGGAGAACACAGTTGTGCCTC CCTTGGCCAA GGGTCTAGGTGCTGGAGTGCGCACGGAGGCCGATGTAGAGGAGGAGGCCCTGAGGAGGAA GCTGGAGGAG CTGACCAGCAACGTCAGTGACCAGGAGACCTCGTCCGAGGAGGAGGAAGCCAAGGACGAA AAGGCAGAGC CCAACAGGGACAAATCAGTTGGGCCTCTCCCCCAGGCGGACCCGGAGGTGGGCACGGCTG CCCATCAAAC CAACAGACAGGAAAAAAGCCCCCAGGACCCTGGGGACCCCGTCCAGTACAACAGGACCAC AGATGAGGAG CTGTCAGAGCTGGAGGACAGAGTGGCAGTGACGGCCTCAGAAGTCCAGCAGGCAGAGAGC GAGGTTTCAG ACATTGAATCCAGGATTGCAGCCCTGAGGGCCGCAGGGCTCACGGTGAAGCCCTCGGGAA AGCCCCGGAG GAAGTCAAACCTCCCGATATTTCTCCCTCGAGTGGCTGGGAAACTTGGCAAGAGACCAGA GGACCCAAAT GCAGACCCTTCAAGTGAGGCCAAGGCAATGGCTGTGCCCTATCTTCTGAGAAGAAAGTTC AGTAATTCCC TGAAAAGTCAAGGTAAAGATGATGATTCTTTTGATCGGAAATCAGTGTACCGAGGCTCGC TGACACAGAG AAACCCCAACGCGAGGAAAGGAATGGCCAGCCACACCTTCGCGAAACCTGTGGTGGCCCA CCAGTCCTAA CGGGACAGGACAGAGAGACAGAGCAGCCCTGCACTGTTTTCCCTCCACCACAGCCATCCT GTCCCTCATT GGCTCTGTGCTTTCCACTATACACAGTCACCGTCCCAATGAGAAACAAGAAGGAGCACCC TCCACATGGA CTCCCACCTGCAAGTGGACAGCGACATTCAGTCCTGCACTGCTCACCTGGGTTTACTGAT GACTCCTGGC TGCCCCACCATCCTCTCTGATCTGTGAGAAACAGCTAAGCTGCTGTGACTTCCCTTTAGG ACAATGTTGT GTAAATCTTTGAAGGACACACCGAAGACCTTTATACTGTGATCTTTTACCCCTTTCACTC TTGGCTTTCT TATGTTGCTTTCATGAATGGAATGGAAAAAAGATGACTCAGTTAAGGCACCAGCCATATG TGTATTCTTG ATGGTCTATATCGGGGTGTGAGCAGATGTTTGCGTATTTCTTGTGGGTGTGACTGGATAT TAGACATCCG GACAAGTGACTGAACTAATGATCTGCTGAATAATGAAGGAGGAATAGACACCCCAGTCCC CACCCTACGT GCACCCGCTCTGCAAGTTCCCATGTGATCTGTAGACCAGGGGAAATTACACTGCGGTCAA GGGCAGAGCC TGCACATGACAGCAAGTGAGCATTTGATAGATGCTCAGATGCTAGTGCAGAGAGCCTGCT GGGAGACGAA GAGACAGCAGGCAGAGCTCCAGATGGGCAAGGAAGAGGCTTGGTTCTAGCCTGGCTCTGC CCCTCACTGC AGTGGATCCAGTGGGGCAGAGGACAGAGGGTCACAACCAATGAGGGATGTCTGCCAAGGA TGGGGGTGCA GAGGCCACAGGAGTCAGCTTGCCACTCGCCCATTGGTTACATAGATGATCTCTCAGACAG GCTGGGACTC AGAGTTATTTCCTAGTATCGGTGTGCCCCATCCAGTTTTAAGTGGAGCCCTCCAAGACTC TCCAGAGCTG CCTTTGAACATCCTAACAGTAATCACATCTCACCCTCCCTGAGGTTCACTTTAGACAGGA CCCAATGGCT GCACTGCCTTTGTCAGAGGGGGTGCTGAGAGGAGTGGCTTCTTTTAGAATCAAACAGTAG AGACAAGAGT CAAGCCTTGTGTCTTCAAGCATTGACCAAGTTAAGTGTTTCCTTCCCTCTCTCAATAAGA CACTTCCAGG AGCTTTCCAATCTCTCACTTAAAACTAAGGTTTGAATCTCAAAGTGTTGCTGGGAGGCTG ATACTCCTGC AACTTCAGGAGACCTGTGAGCACACATTAGCAGCTGTTTCTCTGACTCCTTGTGGCATCA GATAAAAACG TGGGAGTTTTTCCATATAATTCCCAGCCTTACTTATAAATTCTATTCTTTGAAAAAATTA TTCAGGCTAG GTAAGGTGGCTCATACCTATAATCCCAGCCCTTTGAGAGGCCAAGGTGGGAGAATTGCTT GAGGCCAGGA GTTTGAGACCTCCTGGGCAACATAGTGAGATCCCATCTCTACAAAAAACAAAACAAAAAA ATTACCCAAG CATGATGGTATATGCCTGTAGTCGTACCTACTTACTTAGGAGGCTGAGGCAGGAGGATCA CTTGAGCCCT GGAGGTTGGGGCTGCAGTGAGCCATGATCGCATCACTATACTCGAGCCTGGGCAACAGAG TGAGACCTTG T C T C T T AAAAAAAT T AAT AAT AAAT AAAT GAAAAT AAT T C T T CAGAAAAAAAAAAAAAAAA

NM_005940 AAGCCCAGCAGCCCCGGGGCGGATGGCTCCGGCCGCCTGGCTCCGCAGCGCGGCCGCGCG CGCCCTCCTG 139

CCCCCGATGCTGCTGCTGCTGCTCCAGCCGCCGCCGCTGCTGGCCCGGGCTCTGCCGCCG GACGCCCACC ACCTCCATGCCGAGAGGAGGGGGCCACAGCCCTGGCATGCAGCCCTGCCCAGTAGCCCGG CACCTGCCCC TGCCACGCAGGAAGCCCCCCGGCCTGCCAGCAGCCTCAGGCCTCCCCGCTGTGGCGTGCC CGACCCATCT GATGGGCTGAGTGCCCGCAACCGACAGAAGAGGTTCGTGCTTTCTGGCGGGCGCTGGGAG AAGACGGACC TCACCTACAGGATCCTTCGGTTCCCATGGCAGTTGGTGCAGGAGCAGGTGCGGCAGACGA TGGCAGAGGC CCTAAAGGTATGGAGCGATGTGACGCCACTCACCTTTACTGAGGTGCACGAGGGCCGTGC TGACATCATG ATCGACTTCGCCAGGTACTGGCATGGGGACGACCTGCCGTTTGATGGGCCTGGGGGCATC CTGGCCCATG CCTTCTTCCCCAAGACTCACCGAGAAGGGGATGTCCACTTCGACTATGATGAGACCTGGA CTATCGGGGA TGACCAGGGCACAGACCTGCTGCAGGTGGCAGCCCATGAATTTGGCCACGTGCTGGGGCT GCAGCACACA ACAGCAGCCAAGGCCCTGATGTCCGCCTTCTACACCTTTCGCTACCCACTGAGTCTCAGC CCAGATGACT GCAGGGGCGTTCAACACCTATATGGCCAGCCCTGGCCCACTGTCACCTCCAGGACCCCAG CCCTGGGCCC CCAGGCTGGGATAGACACCAATGAGATTGCACCGCTGGAGCCAGACGCCCCGCCAGATGC CTGTGAGGCC TCCTTTGACGCGGTCTCCACCATCCGAGGCGAGCTCTTTTTCTTCAAAGCGGGCTTTGTG TGGCGCCTCC GTGGGGGCCAGCTGCAGCCCGGCTACCCAGCATTGGCCTCTCGCCACTGGCAGGGACTGC CCAGCCCTGT GGACGCTGCCTTCGAGGATGCCCAGGGCCACATTTGGTTCTTCCAAGGTGCTCAGTACTG GGTGTACGAC GGTGAAAAGCCAGTCCTGGGCCCCGCACCCCTCACCGAGCTGGGCCTGGTGAGGTTCCCG GTCCATGCTG CCTTGGTCTGGGGTCCCGAGAAGAACAAGATCTACTTCTTCCGAGGCAGGGACTACTGGC GTTTCCACCC CAGCACCCGGCGTGTAGACAGTCCCGTGCCCCGCAGGGCCACTGACTGGAGAGGGGTGCC CTCTGAGATC GACGCTGCCTTCCAGGATGCTGATGGCTATGCCTACTTCCTGCGCGGCCGCCTCTACTGG AAGTTTGACC CTGTGAAGGTGAAGGCTCTGGAAGGCTTCCCCCGTCTCGTGGGTCCTGACTTCTTTGGCT GTGCCGAGCC TGCCAACACTTTCCTCTGACCATGGCTTGGATGCCCTCAGGGGTGCTGACCCCTGCCAGG CCACGAATAT CAGGCTAGAGACCCATGGCCATCTTTGTGGCTGTGGGCACCAGGCATGGGACTGAGCCCA TGTCTCCTCA GGGGGATGGGGTGGGGTACAACCACCATGACAACTGCCGGGAGGGCCACGCAGGTCGTGG TCACCTGCCA GCGACTGTCTCAGACTGGGCAGGGAGGCTTTGGCATGACTTAAGAGGAAGGGCAGTCTTG GGCCCGCTAT GCAGGTCCTGGCAAACCTGGCTGCCCTGTCTCCATCCCTGTCCCTCAGGGTAGCACCATG GCAGGACTGG GGGAACTGGAGTGTCCTTGCTGTATCCCTGTTGTGAGGTTCCTTCCAGGGGCTGGCACTG AAGCAAGGGT GCTGGGGCCCCATGGCCTTCAGCCCTGGCTGAGCAACTGGGCTGTAGGGCAGGGCCACTT CCTGAGGTCA GGTCTTGGTAGGTGCCTGCATCTGTCTGCCTTCTGGCTGACAATCCTGGAAATCTGTTCT CCAGAATCCA GGCCAAAAAGTTCACAGTCAAATGGGGAGGGGTATTCTTCATGCAGGAGACCCCAGGCCC TGGAGGCTGC AACATACCTCAATCCTGTCCCAGGCCGGATCCTCCTGAAGCCCTTTTCGCAGCACTGCTA TCCTCCAAAG CCATTGTAAATGTGTGTACAGTGTGTATAAACCTTCTTCTTCTTTTTTTTTTTTTAAACT GAGGATTGTC

BX647151 TAGCAGCACACAAGGGTTCGTGTTTGTGGAACCAGGTAGCTTCCTTCAGAGCTGACATTT GCCCACAGCC 140

AGCCTGGCCCAGCCCCATACCACCAGCCCTGGCGCTCTGGGGCGTGAGGTGCCTTTTCTG CCCCCCTGCT CTAGGGCAGGTGGAAATCACCCATGGTGGGTCTACATCTGATAGAAGCATCTTATAGTTC TGCTTCTGGA CCAGACCATCCTGGGTTTTTCTCTGTTCTGCTGAAGGGTTCCCTCCACGTGTCCATCACC TCGGTGAACT CTTGGGAGACCTGGGAAGATGCTGGCCTCACCTCTCGCCTCTCCTTTCCCTCATTGTGCT GCCACCATCC TTCTCACACAGGCTCTCCAGGGAGAGCTGGGCAGGATGGGATCTTCCTGGGTTCCCACCT TGCTCCGTGC CCCCTCTCACTGTTCCTGAAGTGTGGCCACGGACTGCCTTGTTTTCTGGAAAGTCCCAAG TCTGGACCAT GACTGAGCAGCATTCTCGGCTATCTGCCACCTGTCTGGGGCTCCTGGCCCCTCTTAGACT CCCCTCTCCC TTCTGTTTCCCCCGAGCCCCTGACTTGGACCTGCAGGGTGGGGAGAGGGATGGGACGAGA ACCTGTGCTG GGGCCAAAGGTCGCACTGGGGGAAGGTGGAGCCAGGGCAGCAGAGTGCCTGGCGTCGGCC CCTATCCTGT CACTAGTTCCCCCGTTCTGGCCCCTGGCAGGTTTGTAACCCCAGATCAGAAGTACTCCAT GGACAACACT CCCCACACGCCAACCCCGTTCAAGAACGCCCTGGAGAAGTACGGACCCCTGAAGCCCCTG GTACGTGGTG TGGTCACTGCCGTGGATCTCTGCACAGTGGGATCCCTTCGGTTCATCCAACCATGTTCAG TCCACAGGAC CCTTCCCTCTGAGGTCTCATTTGATTCTTTCTCCTGAGAAGATGCAGAGATCCTGATAAT ATAAATGGGG AAGCTGAGGCTGCTCTTTGTCACTTCCTCCGACTGCTCCTGAGCACCTGAGTTTGCAAGC ACGCGCCGGC TGGTGCTAGAGACATGGTGGTATCCCGTGACACTCAGCCTCAGGATGGGGGAGACTGATG TGAAATACAA ATAACTTAAACACTTTCAGGCAAAGATAAGCACTGGGCCTAGTTCAGAGAAGTGGCAAAT TGCTACTCTG GCCTGTCTCTGACCAACTCCCAGTTCTCTACAGAGCACGGGAAAGCCCCTCGGGGACGTC TTTCCTGCAG TGTGCAGGCTGCCCTTCTCCCCTGCTCTTCCCAGTTGATGGGATGGTTGTGTTTTCTCTA TGAAAAAAGG AGTTGGCACCTTGGGCTTTCTGAAACACACAGGTGTTTTAGAAATCAGTGGAGGGTGAGA GAAAGGCATG GTTGTGGAGGCACTGGACTGTGAACAAGGTCTGCAGCGGGTCCCCCTGCTGTCTCTCTCT ACTGCATGGA GCCTCCTATGAAGCCCAAGGTGGCTGGGGGCTGAGGCTCCCTTGGGCCTGCCATGGAACT GATTCTGAGT CAAGCAGACTTTCCACGGACCATGCTACATGAGCCGAGGTGAGGCACTAGTTAGTGCTCC TTTCCTGTTG CAGTGGAGATTTGGCTCCTCTGTACTAAAATATCTGCATGCTCTCCAAACAGGTGTGAGG GCAAATCACA TGACCTTGGCAGCTGTAATTAAAGTTTGTGGGGGCTTTTCGGATGACTTATGAGGAGTGG CTGTGATTCG CACCTTTCACTCTTAGTAGCACTCGCCCTCCCCTGTTCTCTGTTGCCTGAAGCTGGAGAG GTCCTTGGAA CCCCGAGGCCTGAGAAAGGGAAATGGGTTTGAGAGCCCCCATTAGTGTGGAACAAAGGGT TGAGTGAGCC TGGGCTTTGAGCTGTCGGGGTCCTAATTCAGCAGCTGTGTGACTGTGTGCCAGGCTGTTG ATCTCTGAGC TTCTGTTTCTACCTGCTTAAAATGACGGTTACTGCACAGGGCTGTGTGAGGGTTACAGTG CGTCTCTGGG CTGCTCCCAGCCATGGCAGGCCCCTGGGAATCAAGGTCATCAGCTGCTTGTCCAAGGCAG CAGTTAGTGG TTGTGAATGGTGCGTGTGAGATCTGCATCCTGGCGTCAGGCCTCCTTCCTGCCTTACCCA GGACAGCCCA GTTGCAGCTGGGTTGGTCCCACAGTCCCACACACACACAGCCCGAGTGTGGTGCCTCACG TGGGCTGCCC CGTGCCTACCCACAGCCACAGACCCCGCACCTGGAGGAGGACTTGAAGGAGGTGCTGCGT TCTGAGGCTG GCATCGAACTCATCATCGAGGACGACATCAGGCCCGAGAAGCAGAAGAGGAAGCCTGGGC TGCGGCGGAG CCCCATCAAGAAAGTCCGGAAGTCTCTGGCTCTTGACATTGTGGATGAGGATGTGAAGCT GATGATGTCC ACACTGCCCAAGTCTCTATCCTTGCCGACAACTGCCCCTTCAAACTCTTCCAGCCTCACC CTGTCAGGTA TCAAAGAAGACAACAGCTTGCTCAACCAGGGCTTCTTGCAGGCCAAGCCCGAGAAGGCAG CAGTGGCCCA GAAGCCCCGAAGCCACTTCACGACACCTGCCCCTATGTCCAGTGCCTGGAAGACGGTGGC CTGCGGGGGG ACCAGGGACCAGCTTTTCATGCAGGAGAAAGCCCGGCAGCTCCTGGGCCGCCTGAAGCCC AGCCACACAT CTCGGACCCTCATCTTGTCCTGAGGTGTTGAGGGTGTCACGAGCCCATTCACATGTTTAC AGGGGTTGTG GGGGCAGAGGGGGTCTGTGAATCTGAGAGTCATTCAGGTGACCTCCTGCAGGGAGCCTTC TGCCACCAGC CCCTCCCCAGACTCTCAGGTGGAGGCAACAGGGCCATGTGCTGCCCTGTTGCCGAGCCCA GCTGTGGGCG GCTCCTGGTGCTAACAACAAAGTTCCACTTCCAGGTCTGCCTGGTTCCCCCCCCAAGGCC ACAGGGAGCT CCGTCAGCTTCTCCCAAGCCCACGTCAGGCCTGGCCTCATCTCAGACCCTGCTTAGGATG GGGGATGTGG CCAGGGGTGCTCCTGTGCTCACCCTCTCTTGGTGCATTTTTTTGGAAGAATAAAATTGCC TCTCTCTTTG AAAAAAAAAAAAAAAAA

NM_002467 GACCCCCGAGCTGTGCTGCTCGCGGCCGCCACCGCCGGGCCCCGGCCGTCCCTGGCTCCC CTCCTGCCTC 141

GAGAAGGGCAGGGCTTCTCAGAGGCTTGGCGGGAAAAAGAACGGAGGGAGGGATCGCGCT GAGTATAAAA GCCGGTTTTCGGGGCTTTATCTAACTCGCTGTAGTAATTCCAGCGAGAGGCAGAGGGAGC GAGCGGGCGG CCGGCTAGGGTGGAAGAGCCGGGCGAGCAGAGCTGCGCTGCGGGCGTCCTGGGAAGGGAG ATCCGGAGCG AATAGGGGGCTTCGCCTCTGGCCCAGCCCTCCCGCTGATCCCCCAGCCAGCGGTCCGCAA CCCTTGCCGC ATCCACGAAACTTTGCCCATAGCAGCGGGCGGGCACTTTGCACTGGAACTTACAACACCC GAGCAAGGAC GCGACTCTCCCGACGCGGGGAGGCTATTCTGCCCATTTGGGGACACTTCCCCGCCGCTGC CAGGACCCGC TTCTCTGAAAGGCTCTCCTTGCAGCTGCTTAGACGCTGGATTTTTTTCGGGTAGTGGAAA ACCAGCAGCC TCCCGCGACGATGCCCCTCAACGTTAGCTTCACCAACAGGAACTATGACCTCGACTACGA CTCGGTGCAG CCGTATTTCTACTGCGACGAGGAGGAGAACTTCTACCAGCAGCAGCAGCAGAGCGAGCTG CAGCCCCCGG CGCCCAGCGAGGATATCTGGAAGAAATTCGAGCTGCTGCCCACCCCGCCCCTGTCCCCTA GCCGCCGCTC CGGGCTCTGCTCGCCCTCCTACGTTGCGGTCACACCCTTCTCCCTTCGGGGAGACAACGA CGGCGGTGGC GGGAGCTTCTCCACGGCCGACCAGCTGGAGATGGTGACCGAGCTGCTGGGAGGAGACATG GTGAACCAGA GTTTCATCTGCGACCCGGACGACGAGACCTTCATCAAAAACATCATCATCCAGGACTGTA TGTGGAGCGG CTTCTCGGCCGCCGCCAAGCTCGTCTCAGAGAAGCTGGCCTCCTACCAGGCTGCGCGCAA AGACAGCGGC AGCCCGAACCCCGCCCGCGGCCACAGCGTCTGCTCCACCTCCAGCTTGTACCTGCAGGAT CTGAGCGCCG CCGCCTCAGAGTGCATCGACCCCTCGGTGGTCTTCCCCTACCCTCTCAACGACAGCAGCT CGCCCAAGTC CTGCGCCTCGCAAGACTCCAGCGCCTTCTCTCCGTCCTCGGATTCTCTGCTCTCCTCGAC GGAGTCCTCC CCGCAGGGCAGCCCCGAGCCCCTGGTGCTCCATGAGGAGACACCGCCCACCACCAGCAGC GACTCTGAGG AGGAACAAGAAGATGAGGAAGAAATCGATGTTGTTTCTGTGGAAAAGAGGCAGGCTCCTG GCAAAAGGTC AGAGTCTGGATCACCTTCTGCTGGAGGCCACAGCAAACCTCCTCACAGCCCACTGGTCCT CAAGAGGTGC CACGTCTCCACACATCAGCACAACTACGCAGCGCCTCCCTCCACTCGGAAGGACTATCCT GCTGCCAAGA GGGTCAAGTTGGACAGTGTCAGAGTCCTGAGACAGATCAGCAACAACCGAAAATGCACCA GCCCCAGGTC CTCGGACACCGAGGAGAATGTCAAGAGGCGAACACACAACGTCTTGGAGCGCCAGAGGAG GAACGAGCTA AAACGGAGCTTTTTTGCCCTGCGTGACCAGATCCCGGAGTTGGAAAACAATGAAAAGGCC CCCAAGGTAG TTATCCTTAAAAAAGCCACAGCATACATCCTGTCCGTCCAAGCAGAGGAGCAAAAGCTCA TTTCTGAAGA GGACTTGTTGCGGAAACGACGAGAACAGTTGAAACACAAACTTGAACAGCTACGGAACTC TTGTGCGTAA GGAAAAGTAAGGAAAACGATTCCTTCTAACAGAAATGTCCTGAGCAATCACCTATGAACT TGTTTCAAAT GCATGATCAAATGCAACCTCACAACCTTGGCTGAGTCTTGAGACTGAAAGATTTAGCCAT AATGTAAACT GCCTCAAATTGGACTTTGGGCATAAAAGAACTTTTTTATGCTTACCATCTTTTTTTTTTC TTTAACAGAT TTGTATTTAAGAATTGTTTTTAAAAAATTTTAAGATTTACACAATGTTTCTCTGTAAATA TTGCCATTAA ATGTAAATAACTTTAATAAAACGTTTATAGCAGTTACACAGAATTTCAATCCTAGTATAT AGTACCTAGT ATTATAGGTACTATAAACCCTAATTTTTTTTATTTAAGTACATTTTGCTTTTTAAAGTTG ATTTTTTTCT AT T GT T T T T AGAAAAAAT AAAAT AAC T GGCAAAT AT AT C AT T GAGC CAAAT CT T AAAAAAAAAAAAAAA

BC013732 GTGGGAGGATTGCATTCAGTCTAGTTCCTGGTTGCCGGCTGAAATAACCTGCTCTCCAAA ATGTCCACAA 142

AAGTGACTTAAGTCAGGTTCCCCCAAACCAGACACCAAGACAAGAATCCATGTGTGTGTG ACTGAAGGAA GTGCTGGGAGAGCCCCAGCTGCAGCCTGGATGTGAACTGCAACTCCAAAGTGTGTCCAGA CTCAAGGCAA GGGCACTAGGCTTTCCAGACCTCCTACTAAGTCATTGATCCAGCACTGCCCTGCCAGGAC ATAAATCCCT GGCACCTCTTGCTCTCTGCAAAGGAGGGCAAAGCAGCTTCAGGAGCCCTTGGGAGTCCTC CAAAGAGAGT CTAGGGTACAGGTCCGAAAGTAGAAGAACACAGAAGGCAGGCCAGGGGCACTGTGAGATG GTAAAAGAGA TCTGAAGGGATCCAGAATTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTGGATTAAAAC TGAAGATCAA CCTACTTTCAACTTACTAAGAAAGGGGATCATGGACATTGAAGCATATCTTGAAAGAATT GGCTATAAGA AGTCTAGGAACAAATTGGACTTGGAAACATTAACTGATATTCTTCAACACCAGATCCGAG CTGTTCCCTT TGAGAACCTTAACATCCATTGTGGGGATGCCATGGACTTAGGCTTAGAGGCCATTTTTGA TCAAGTTGTG AGAAGAAATCGGGGTGGATGGTGTCTCCAGGTCAATCATCTTCTGTACTGGGCTCTGACC ACTATTGGTT TTGAGACCACGATGTTGGGAGGGTATGTTTACAGCACTCCAGCCAAAAAATACAGCACTG GCATGATTCA CCTTCTCCTGCAGGTGACCATTGATGGCAGGAACTACATTGTCGATGCTGGGTTTGGACG CTCATACCAG ATGTGGCAGCCTCTGGAGTTAATTTCTGGGAAGGATCAGCCTCAGGTGCCTTGTGTCTTC CGTTTGACGG AAGAGAAT GGATTCTGGTATC T AGAC CAAAT CAGAAGGGAACAGT ACAT T C CAAAT GAAGAAT T T C T T CA TTCTGATCTCCTAGAAGACAGCAAATACCGAAAAATCTACTCCTTTACTCTTAAGCCTCG AACAATTGAA GATTTTGAGTCTATGAATACATACCTGCAGACATCTCCATCATCTGTGTTTACTAGTAAA TCATTTTGTT CCTTGCAGACCCCAGATGGGGTTCACTGTTTGGTGGGCTTCACCCTCACCCATAGGAGAT TCAATTATAA GGACAATACAGATCTAATAGAGTTCAAGACTCTGAGTGAGGAAGAAATAGAAAAAGTGCT GAAAAATATA TTTAATATTTCCTTGCAGAGAAAGCTTGTGCCCAAACATGGTGATAGATTTTTTACTATT TAGAATAAGG AGTAAAACAATCTTGTCTATTTGTCATCCAGCTCACCAGTTATCAACTGACGACCTATCA TGTATCTTCT GTACCCTTACCTTATTTTGAAGAAAATCCTAGACATCAAATCATTTCACCTATAAAAATG TCATCATATA TAAT TAAACAGCT T T T TAAAGAAACATAACCACAAACCT T T TCAAATAATAAT AATAAT AATAATAATAA ATGTCTTTTAAAGATGGCCTGTGGTTATCTTGGAAATTGGTGATTTATGCTAGAAAGCTT TTAATGTTGG TTTATTGTTGAATTCCTAGAAAAGTTTTATGGGTAGATGAGTAAATAAAATATTGTAAAA AAACTTATTG TCTATAAAGTATATTAAAACATTGTTGGCTAATATAAAAAAAAAAAAAA

NM_014321 GCGCGCGGGTTTCGTTGACCCGCGGCGTTCACGGGAATTGTTCGCTTTAGTGCCGGCGCC ATGGGGTCGG 143

AGCTGATCGGGCGCCTAGCCCCGCGCCTGGGCCTCGCCGAGCCCGACATGCTGAGGAAAG CAGAGGAGTA CTTGCGCCTGTCCCGGGTGAAGTGTGTCGGCCTCTCCGCACGCACCACGGAGACCAGCAG TGCAGTCATG TGCCTGGACCTTGCAGCTTCCTGGATGAAGTGCCCCTTGGACAGGGCTTATTTAATTAAA CTTTCTGGTT TGAACAAGGAGACATATCAGAGCTGTCTTAAATCTTTTGAGTGTTTACTGGGCCTGAATT CAAATATTGG AATAAGAGACCTAGCTGTACAGTTTAGCTGTATAGAAGCAGTGAACATGGCTTCAAAGAT ACTAAAAAGC TATGAGTCCAGTCTTCCCCAGACACAGCAAGTGGATCTTGACTTATCCAGGCCACTTTTC ACTTCTGCTG CACTGCTTTCAGCATGCAAGATTCTAAAGCTGAAAGTGGATAAAAACAAAATGGTAGCCA CATCCGGTGT AAAAAAAGCTATATTTGATCGACTGTGTAAACAACTAGAGAAGATTGGACAGCAGGTCGA CAGAGAACCT GGAGATGTAGCTACTCCACCACGGAAGAGAAAGAAGATAGTGGTTGAAGCCCCAGCAAAG GAAATGGAGA AGGT AGAGGAGAT GCCACAT AAACCACAGAAAGAT GAAGAT C T GAC ACAGGAT TAT GAAGAAT GGAAAAG AAAAATTTTGGAAAATGCTGCCAGTGCTCAAAAGGCTACAGCAGAGTGATTTCAGCTTCC AAACTGGTAT ACATTCCAAACTGATAGTACATTGCCATCTCCAGGAAGACTTGACGGCTTTGGGATTTTG TTTAAACTTT TATAATAAGGATCCTAAGACTGTTGCCTTTAAATAGCAAAGCAGCCTACCTGGAGGCTAA GTCTGGGCAG TGGGCTGGCCCCTGGTGTGAGCATTAGACCAGCCACAGTGCCTGATTGGTATAGCCTTAT GTGCTTTCCT ACAAAATGGAATTGGAGGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGA GGCCAAGGTG GGTGGATCACCTGAGGTCAGGAGCTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATC TCTACTAAAA ATACAAAAATTAGCCAGGTGTGATGGTGCATGCCTGTAATCCCAGCTCCTCAGTAGGCTG AGACAGGAGC ATCACTTGAACGTGGGAGGCAGAGGTTGCAGTGAGCCGAGATTGCACCACCGCACTCCAG CCTGGGTGAC AGAGCGAGACTTATCTCATAAATAAATAGATAGATACTCCAGCCTGGGTGACAGAGCGAG ACTTATAGAT AGATAGATAGATAGATGGATAGATAGATAGATAGATAGATAGATAGATAAACGGAATTGG AGCCATTTTG CTTTAAGTGAATGGCAGTCCCTTGTCTTATTCAGAATATAAAATTCAGTCTGAATGGCAT CTTACAGATT TTACTTCAATTTTTGTGTACGGTATTTTTTATTTGACTAAATCAATATATTGTACAGCCT AAGTTAATAA ATGT TAT T TATAT ATGCAAAAAAAAAAAAAAAAA

NM_000926 AGTCCACAGCTGTCACTAATCGGGGTAAGCCTTGTTGTATTTGTGCGTGTGGGTGGCATT CTCAATGAGA 144

ACTAGCTTCACTTGTCATTTGAGTGAAATCTACAACCCGAGGCGGCTAGTGCTCCCGCAC TACTGGGATC TGAGATCTTCGGAGATGACTGTCGCCCGCAGTACGGAGCCAGCAGAAGTCCGACCCTTCC TGGGAATGGG CTGTACCGAGAGGTCCGACTAGCCCCAGGGTTTTAGTGAGGGGGCAGTGGAACTCAGCGA GGGACTGAGA GCTTCACAGCATGCACGAGTTTGATGCCAGAGAAAAAGTCGGGAGATAAAGGAGCCGCGT GTCACTAAAT TGCCGTCGCAGCCGCAGCCACTCAAGTGCCGGACTTGTGAGTACTCTGCGTCTCCAGTCC TCGGACAGAA GTTGGAGAACTCTCTTGGAGAACTCCCCGAGTTAGGAGACGAGATCTCCTAACAATTACT ACTTTTTCTT GCGCTCCCCACTTGCCGCTCGCTGGGACAAACGACAGCCACAGTTCCCCTGACGACAGGA TGGAGGCCAA GGGCAGGAGCTGACCAGCGCCGCCCTCCCCCGCCCCCGACCCAGGAGGTGGAGATCCCTC CGGTCCAGCC ACATTCAACACCCACTTTCTCCTCCCTCTGCCCCTATATTCCCGAAACCCCCTCCTCCTT CCCTTTTCCC TCCTCCTGGAGACGGGGGAGGAGAAAAGGGGAGTCCAGTCGTCATGACTGAGCTGAAGGC AAAGGGTCCC CGGGCTCCCCACGTGGCGGGCGGCCCGCCCTCCCCCGAGGTCGGATCCCCACTGCTGTGT CGCCCAGCCG CAGGTCCGTTCCCGGGGAGCCAGACCTCGGACACCTTGCCTGAAGTTTCGGCCATACCTA TCTCCCTGGA CGGGCTACTCTTCCCTCGGCCCTGCCAGGGACAGGACCCCTCCGACGAAAAGACGCAGGA CCAGCAGTCG CTGTCGGACGTGGAGGGCGCATATTCCAGAGCTGAAGCTACAAGGGGTGCTGGAGGCAGC AGTTCTAGTC CCCCAGAAAAGGACAGCGGACTGCTGGACAGTGTCTTGGACACTCTGTTGGCGCCCTCAG GTCCCGGGCA GAGCCAACCCAGCCCTCCCGCCTGCGAGGTCACCAGCTCTTGGTGCCTGTTTGGCCCCGA ACTTCCCGAA GATCCACCGGCTGCCCCCGCCACCCAGCGGGTGTTGTCCCCGCTCATGAGCCGGTCCGGG TGCAAGGTTG GAGACAGCTCCGGGACGGCAGCTGCCCATAAAGTGCTGCCCCGGGGCCTGTCACCAGCCC GGCAGCTGCT GCTCCCGGCCTCTGAGAGCCCTCACTGGTCCGGGGCCCCAGTGAAGCCGTCTCCGCAGGC CGCTGCGGTG GAGGTTGAGGAGGAGGATGGCTCTGAGTCCGAGGAGTCTGCGGGTCCGCTTCTGAAGGGC AAACCTCGGG CTCTGGGTGGCGCGGCGGCTGGAGGAGGAGCCGCGGCTGTCCCGCCGGGGGCGGCAGCAG GAGGCGTCGC CCTGGTCCCCAAGGAAGATTCCCGCTTCTCAGCGCCCAGGGTCGCCCTGGTGGAGCAGGA CGCGCCGATG GCGCCCGGGCGCTCCCCGCTGGCCACCACGGTGATGGATTTCATCCACGTGCCTATCCTG CCTCTCAATC ACGCCTTATTGGCAGCCCGCACTCGGCAGCTGCTGGAAGACGAAAGTTACGACGGCGGGG CCGGGGCTGC CAGCGCCTTTGCCCCGCCGCGGAGTTCACCCTGTGCCTCGTCCACCCCGGTCGCTGTAGG CGACTTCCCC GACTGCGCGTACCCGCCCGACGCCGAGCCCAAGGACGACGCGTACCCTCTCTATAGCGAC TTCCAGCCGC CCGCTCTAAAGATAAAGGAGGAGGAGGAAGGCGCGGAGGCCTCCGCGCGCTCCCCGCGTT CCTACCTTGT GGCCGGTGCCAACCCCGCAGCCTTCCCGGATTTCCCGTTGGGGCCACCGCCCCCGCTGCC GCCGCGAGCG ACCCCATCCAGACCCGGGGAAGCGGCGGTGACGGCCGCACCCGCCAGTGCCTCAGTCTCG TCTGCGTCCT CCTCGGGGTCGACCCTGGAGTGCATCCTGTACAAAGCGGAGGGCGCGCCGCCCCAGCAGG GCCCGTTCGC GCCGCCGCCCTGCAAGGCGCCGGGCGCGAGCGGCTGCCTGCTCCCGCGGGACGGCCTGCC CTCCACCTCC GCCTCTGCCGCCGCCGCCGGGGCGGCCCCCGCGCTCTACCCTGCACTCGGCCTCAACGGG CTCCCGCAGC TCGGCTACCAGGCCGCCGTGCTCAAGGAGGGCCTGCCGCAGGTCTACCCGCCCTATCTCA ACTACCTGAG GCCGGATTCAGAAGCCAGCCAGAGCCCACAATACAGCTTCGAGTCATTACCTCAGAAGAT TTGTTTAATC TGTGGGGATGAAGCATCAGGCTGTCATTATGGTGTCCTTACCTGTGGGAGCTGTAAGGTC TTCTTTAAGA GGGCAATGGAAGGGCAGCACAACTACTTATGTGCTGGAAGAAATGACTGCATCGTTGATA AAATCCGCAG AAAAAACTGCCCAGCATGTCGCCTTAGAAAGTGCTGTCAGGCTGGCATGGTCCTTGGAGG TCGAAAATTT AAAAAGTTCAATAAAGTCAGAGTTGTGAGAGCACTGGATGCTGTTGCTCTCCCACAGCCA GTGGGCGTTC CAAATGAAAGCCAAGCCCTAAGCCAGAGATTCACTTTTTCACCAGGTCAAGACATACAGT TGATTCCACC ACTGATCAACCTGTTAATGAGCATTGAACCAGATGTGATCTATGCAGGACATGACAACAC AAAACCTGAC ACCTCCAGTTCTTTGCTGACAAGTCTTAATCAACTAGGCGAGAGGCAACTTCTTTCAGTA GTCAAGTGGT CTAAATCATTGCCAGGTTTTCGAAACTTACATATTGATGACCAGATAACTCTCATTCAGT ATTCTTGGAT GAGCTTAATGGTGTTTGGTCTAGGATGGAGATCCTACAAACACGTCAGTGGGCAGATGCT GTATTTTGCA CCTGATCTAATACTAAATGAACAGCGGATGAAAGAATCATCATTCTATTCATTATGCCTT ACCATGTGGC AGATCCCACAGGAGTTTGTCAAGCTTCAAGTTAGCCAAGAAGAGTTCCTCTGTATGAAAG TATTGTTACT TCTTAATACAATTCCTTTGGAAGGGCTACGAAGTCAAACCCAGTTTGAGGAGATGAGGTC AAGCTACATT AGAGAGCTCATCAAGGCAATTGGTTTGAGGCAAAAAGGAGTTGTGTCGAGCTCACAGCGT TTCTATCAAC TTACAAAACTTCTTGATAACTTGCATGATCTTGTCAAACAACTTCATCTGTACTGCTTGA ATACATTTAT CCAGTCCCGGGCACTGAGTGTTGAATTTCCAGAAATGATGTCTGAAGTTATTGCTGCACA ATTACCCAAG ATATTGGCAGGGATGGTGAAACCCCTTCTCTTTCATAAAAAGTGAATGTCATCTTTTTCT TTTAAAGAAT TAAATTTTGTGGTATGTCTTTTTGTTTTGGTCAGGATTATGAGGTCTTGAGTTTTTATAA TGTTCTTCTG AAAGCCTTACATTTATAACATCATAGTGTGTAAATTTAAAAGAAAAATTGTGAGGTTCTA ATTATTTTCT TTTATAAAGTATAATTAGAATGTTTAACTGTTTTGTTTACCCATATTTTCTTGAAGAATT TACAAGATTG AAAAAGTACTAAAATTGTTAAAGTAAACTATCTTATCCATATTATTTCATACCATGTAGG TGAGGATTTT TAACTTTTGCATCTAACAAATCATCGACTTAAGAGAAAAAATCTTACATGTAATAACACA AAGCTATTAT ATGTTATTTCTAGGTAACTCCCTTTGTGTCAATTATATTTCCAAAAATGAACCTTTAAAA TGGTATGCAA AATTTTGTCTATATATATTTGTGTGAGGAGGAAATTCATAACTTTCCTCAGATTTTCAAA AGTATTTTTA ATGCAAAAAATGTAGAAAGAGTTTAAAACCACTAAAATAGATTGATGTTCTTCAAACTAG GCAAAACAAC TCATATGTTAAGACCATTTTCCAGATTGGAAACACAAATCTCTTAGGAAGTTAATAAGTA GATTCATATC ATTATGCAAATAGTATTGTGGGTTTTGTAGGTTTTTAAAATAACCTTTTTTGGGGAGAGA ATTGTCCTCT AATGAGGTATTGCGAGTGGACATAAGAAATCAGAAGATTATGGCCTAACTGTACTCCTTA CCAACTGTGG CATGCTGAAAGTTAGTCACTCT TACT GAT TCTCAATTCTCTCACCTTTGAAAGTAGTAAAATATCTTTCC T GCCAATT GCTCCTTTGGGT CAGAGC TTATTAACATCTTTT CAAAT CAAAGGAAAGAAGAAAGGGAGAGG AGGAGGAGGGAGGTATCAATTCACATACCTTTCTCCTCTTTATCCTCCACTAT CATGAATTCATATTATG TTTCAGCCATGCAAATCTTTTTACCATGAAATTTCTTCCAGAATTTTCCCCCTTTGACAC AAATTCCATG CATGTTTCAACCTTCGAGACTCAGCCAAATGTCATTTCTGTAAAATCTTCCCTGAGTCTT CCAAGCAGTA ATTTGCCTTCTCCTAGAGTTTACCTGCCATTTTGTGCACATTTGAGTTACAGTAGCATGT TATTTTACAA TTGTGACTCTCCTGGGAGTCTGGGAGCCATATAAAGTGGTCAATAGTGTTTGCTGACTGA GAGTTGAATG ACATTTTCTCTCTGTCTTGGTATTACTGTAGATTTCGATCATTCTTTGGTTACATTTCTG CATATTTCTG TACCCATGACTTTATCACTTTCTTCTCCCATGCTTTATCTCCATCAAT TAT CTTCAT TACT TTTAAATTT TCCACCTTTGCTTCCTACTTTGTGAGATCTCTCCCTTTACTGACTATAACATAGAAGAAT AGAAGTGTAT TTTATGTGTCTTAAGGACAATACTTTAGATTCCTTGTTCTAAGTTTTTAAACTGAATGAA TGGAATATTA TTTCTCTCCCTAAGCAAAATTCCACAAAACAATTATTTCTTATGTTTATGTAGCCTTAAA TTGTTTTGTA CTGTAAACCTCAGCATAAAAACTTTCTTCATTTCTAATTTCATTCAACAAATATTGATTG AATACCTGGT ATTAGCACAAGAAAAATGTGCTAATAAGCCTTATGAGAATTTGGAGCTGAAGAAAGACAT ATAACTCAGG AAAGTTACAGTCCAGTAGTAGGTATAAATTACAGTGCCTGATAAATAGGCATTTTAATAT TTGTACACTC AACGTATACTAGGTAGGTGCAAAACATTTACATATAATTTTACTGATACCCATGCAGCAC AAAGGTACTA ACTTTAAATATTAAATAACACCTTTATGTGTCAGTAATTCATTTGCATTAAATCTTATTG AAAAGGCTTT CAATATATTTTCCCCACAAATGTCATCCCAAGAAAAAAGTATTTTTAACATCTCCCAAAT ATAATAGTTA CAGGAAATCTACCTCTGTGAGAGTGACACCTCTCAGAATGAACTGTGTGACACAAGAAAA TGAATGTAGG TCTATCCAAAAAAAACCCCAAGAAACAAAAACAATATTATTAGCCCTTTATGCTTAAGTG ATGGACTCAG GGAACAGTTGATGTTGTGATCATTTTATTATCTGATTCTTGTTACTTTGAATTAAACCAA TATTTTGATG ATATAAATCATTTCCACCAGCATATATTTAATTTCCATAATAACTTTAAAATTTTCTAAT TTCACTCAAC TATGAGGGAATAGAATGTGGTGGCCACAGGTTTGGCTTTTGTTAAAATGTTTGATATCTT CGATGTTGAT CTCTGTCTGCAATGTAGATGTCTAAACACTAGGATTTAATATTTAAGGCTAAGCTTTAAA AATAAAGTAC CTTTTTAAAAAGAATATGGCTTCACCAAATGGAAAATACCTAATTTCTAAATCTTTTTCT CTACAAAGTC CTATCTACTAATGTCTCCATTACTATTTAGTCATCATAACCATTATCTTCATTTTACATG TCGTGTTCTT TCTGGTAGCTCTAAAATGACACTAAATCATAAGAAGACAGGTTACATATCAGGAAATACT TGAAGGTTAC TGAAATAGATTCTTGAGTTAATGAAAATATTTTCTGTAAAAAGGTTTGAAAAGCCATTTG AGTCTAAAGC ATTATACCTCCATTATCAGTAGTTATGTGACAATTGTGTGTGTGTTTAATGTTTAAAGAT GTGGCACTTT TTAATAAGGCAATGCTATGCTATTTTTTCCCATTTAACATTAAGATAATTTATTGCTATA CAGATGATAT GGAAATATGATGAACAATATTTTTTTTGCCAAAACTATGCCTTGTAAGTAGCCATGGAAT GTCAACCTGT AACTTAAATTATCCACAGATAGTCATGTGTTTGATGATGGGCACTGTGGAGATAACTGAC ATAGGACTGT GCCCCCCTTCTCTGCCACTTACTAGCTGGATGAGATTAAGCAAGTCATTTAACTGCTCTG ATTAAACCTG CCTTTCCCAAGTGCTTTGTAATGAATAGAAATGGAAACCAAAAAAAACGTATACAGGCCT TCAGAAATAG TAATTGCTACTATTTTGTTTTCATTAAGCCATAGTTCTGGCTATAATTTTATCAAACTCA CCAGCTATAT TCTACAGTGAAAGCAGGATTCTAGAAAGTCTCACTGTTTTATTTATGTCACCATGTGCTA TGATATATTT GGTTGAATTCATTTGAAATTAGGGCTGGAAGTATTCAAGTAATTTCTTCTGCTGAAAAAA TACAGTGTTT TGAGTTTAGGGCCTGTTTTATCAAAGTTCTAAAGAGCCTATCACTCTTCCATTGTAGACA TTTTAAAATA ATGACACTGATTTTAACATTTTTAAGTGTCTTTTTAGAACAGAGAGCCTGACTAGAACAC AGCCCCTCCA AAAACCCATGCTCAAATTATTTTTACTATGGCAGCAATTCCACAAAAGGGAACAATGGGT TTAGAAATTA CAATGAAGTCATCAACCCAAAAAACATCCCTATCCCTAAGAAGGTTATGATATAAAATGC CCACAAGAAA TCTATGTCTGCTTTAATCTGTCTTTTATTGCTTTGGAAGGATGGCTATTACATTTTTAGT TTTTGCTGTG AATACCTGAGCAGTTTCTCTCATCCATACTTATCCTTCACACATCAGAAGTCAGGATAGA ATATGAATCA TTTTAAAAACTTTTACAACTCCAGAGCCATGTGCATAAGAAGCATTCAAAACTTGCCAAA ACATACATTT TTTTTCAAATTTAAAGATACTCTATTTTTGTATTCAATAGCTCAACAACTGTGGTCCCCA CTGATAAAGT GAAGTGGACAAGGAGACAAGTAATGGCATAAGTTTGTTTTTCCCAAAGTATGCCTGTTCA ATAGCCATTG GATGTGGGAAATTTCTACATCTCTTAAAATTTTACAGAAAATACATAGCCAGATAGTCTA GCAAAAGTTC ACCAAGTCCTAAATTGCTTATCCTTACTTCACTAAGTCATGAAATCATTTTAATGAAAAG AACATCACCT AGGTTTTGTGGTTTCTTTTTTTCTTATTCATGGCTGAGTGAAAACAACAATCTCTGTTTC TCCCTAGCAT CTGTGGACTATTTAATGTACCATTATTCCACACTCTATGGTCCTTACTAAATACAAAATT GAACAAAAAG CAGTAAAACAACTGACTCTTCACCCATATTATAAAATATAATCCAAGCCAGATTAGTCAA CATCCATAAG ATGAATCCAAGCTGAACTGGGCCTAGATTATTGAGTTCAGGTTGGATCACATCCCTATTT ATTAATAAAC TTAGGAAAGAAGGCCTTACAGACCATCAGTTAGCTGGAGCTAATAGAACCTACACTTCTA AAGTTCGGCC TAGAATCAATGTGGCCTTAAAAGCTGAAAAGAAGCAGGAAAGAACAGTTTTCTTCAATAA TTTGTCCACC CTGTCACTGGAGAAAATTTAAGAATTTGGGGGTGTTGGTAGTAAGTTAAACACAGCAGCT GTTCATGGCA GAAAT TAT TCAATACATACCTTCTCTGAATATCCTATAACCAAAGCAAAGAAAAACACCAAGGGGTTT GT TCTCCTCCTTGGAGTTGACCTCATTCCAAGGCAGAGCTCAGGTCACAGGCACAGGGGCTG CGCCCAAGCT TGTCCGCAGCCTTATGCAGCTGTGGAGTCTGGAAGACTGTTGCAGGACTGCTGGCCTAGT CCCAGAATGT CAGCCTCATTTTCGATTTACTGGCTCTTGTTGCTGTATGTCATGCTGACCTTATTGTTAA ACACAGGTTT GTTTGCTTTTTTTCCACTCATGGAGACATGGGAGAGGCATTATTTTTAAGCTGGTTGAAA GCTTTAACCG ATAAAGCATTTTTAGAGAAATGTGAATCAGGCAGCTAAGAAAGCATACTCTGTCCATTAC GGTAAAGAAA ATGCACAGATTATTAACTCTGCAGTGTGGCATTAGTGTCCTGGTCAATATTCGGATAGAT ATGAATAAAA TATTTAAATGGTATTGTAAATAGTTTTCAGGACATATGCTATAGCTTATTTTTATTATCT TTTGAAATTG CTCTTAATACATCAAATCCTGATGTATTCAATTTATCAGATATAAATTATTCTAAATGAA GCCCAGTTAA ATGTTTTTGTCTTGTCAGTTATATGTTAAGTTTCTGATCTCTTTGTCTATGACGTTTACT AATCTGCATT TTTACTGTTATGAATTATTTTAGACAGCAGTGGTTTCAAGCTTTTTGCCAC TAAAAATACCTTTTATTTT CTCCTCCCCCAGAAAAGTCTATACCTTGAAGTATCTATCCACCAAACTGTACTTCTATTA AGAAATAGTT ATTGTGTTTTCTTAATGTTTTGTTATTCAAAGACATATCAATGAAAGCTGCTGAGCAGCA TGAATAACAA TTATATCCACACAGATTTGATATATTTTGTGCAGCCTTAACTTGATAGTATAAAATGTCA TTGCTTTTTA AATAATAGTTAGTCAATGGACTTCTATCATAGCTTTCCTAAACTAGGTTAAGATCCAGAG CTTTGGGGTC ATAATATATTACATACAATTAAGTTATCTTTTTCTAAGGGCTTTAAAATTCATGAGAATA ACCAAAAAAG GTATGTGGAGAGTTAATACAAACATACCATATTCTTGTTGAAACAGAGATGTGGCTCTGC TTGTTCTCCA TAAGGTAGAAATACTTTCCAGAATTTGCCTAAACTAGTAAGCCCTGAATTTGCTATGATT AGGGATAGGA AGAGATTTTCACATGGCAGACTTTAGAATTCTTCACTTTAGCCAGTAAAGTATCTCCTTT TGATCTTAGT ATTCTGTGTATTTTAACTTTTCTGAGTTGTGCATGTTTATAAGAAAAATCAGCACAAAGG GTTTAAGTTA AAGCC T T T T T AC T GAAAT T T GAAAGAAACAGAAGAAAAT AT CAAAG TTCTTTGTATTTT GAGAGGAT T AA ATATGATTTACAAAAGTTACATGGAGGGCTCTCTAAAACATTAAATTAATTATTTTTTGT TGAAAAGTCT TACT TTAGGCATCATTT TAT TCCTCAGCAACTAGCTGTGAAGCCTTTACTGTGCTGTATGCCAGTCACTC TGCTAGATTGTGGAGATTACCAGTGTTCCCGTCTTCTCCGAGCTTAGAGTTGGATGGGGA ATAAAGACAG GTAAACAGATAGCTACAATATTGTACTGTGAATGCTTATGCTGGAGGAAGTACAGGGAAC TATTGGAGCA CCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGCAGAGGCATCCTGAAAAAAGTCAA AGCTAAGCCA CAATCTATAAGCAGTTTAGGAATTAGCAGAACGTGCGTGGTGAGGAGATGCCAAAGGCAA GAAGAGAAGA GTATTCCAAACAGGAGGGATTCCAAAGAGAGAAGAGTATCCCAAACAACATTTGCACAAA CCTGATGGGG AGAGAGAATGTGGGGTGGGGATGGATGATGAGACTGAAGAAGAAAGCCAGGTCTAGATAA TCAGTGGCCT TGTACACCATGTTAAAGAGTGTAGACTTGATTCTGTTGTAAACAGGAAAGCAGCACAATT CATATGAATA TTTTAGAAGACTCCCACTGGAATATGGAGAATAAAGTTGGAGATGACTAATCCTGGAAGC AGGGAGAACA T T T T T GAGGAAGT T GCAC T AT T T T GG T GAAAAT GAT GAT CAT AAAC AT GAAGAAT TGTAGGTGATCATGA CCTCCTCTCTAATTTTCCAGAAGGGTTTTGGAAGATATAACATAGGAACATTGACAGGAC TGACGAAAGG AGATGAAATACACCATATAAATTGTCAAACACAAGGCCAGATGTCTAATTATTTTGCTTA TGTGTTGAAA TTACAAATTTTTCATCAGGAAACCAAAAACTACAAAACTTAGTTTTCCCAAGTCCCAGAA TTCTATCTGT CCAAACAATCTGTACCACTCCACCTATATCCCTACCTTTGCATGTCTGTCCAACCTCAAA GTCCAGGTCT ATACACACGGGTAAGACTAGAGCAGTTCAAGTTTCAGAAAATGAGAAAGAGGAACTGAGT TGTGCTGAAC CCATACAAAATAAACACATTCTTTGTATAGATTCTTGGAACCTCGAGAGGAATTCACCTA ACTCATAGGT ATTTGATGGTATGAATCCATGGCTGGGCTCGGCTTTTAAAAAGCCTTATCTGGGATTCCT TCTATGGAAC CAAGTTCCATCAAAGCCCATTTAAAAGCCTACATTAAAAACAAAATTCTTGCTGCATTGT ATACAAATAA TGATGTCATGATCAAATAATCAGATGCCATTATCAAGTGGAATTACAAAATGGTATACCC ACTCCAAAAA AAAAAAAAAAGCTAAATTCTCAGTAGAACATTGTGACTTCATGAGCCCTCCACAGCCTTG GAGCTGAGGA GGGAGCACTGGTGAGCAGTAGGTTGAAGAGAAAACTTGGCGCTTAATAATCTATCCATGT TTTTTCATCT AAAAGAGCCTTCTTTTTGGATTACCTTATTCAATTTCCATCAAGGAAATTGTTAGTTCCA CTAACCAGAC AGCAGCTGGGAAGGCAGAAGCTTACTGTATGTACATGGTAGCTGTGGGAAGGAGGTTTCT TTCTCCAGGT CCTCACTGGCCATACACCAGTCCCTTGTTAGTTATGCCTGGTCATAGACCCCCGTTGCTA TCATCTCATA TTTAAGTCTTTGGCTTGTGAATTTATCTATTCTTTCAGCTTCAGCACTGCAGAGTGCTGG GACTTTGCTA ACTTCCATTTCTTGCTGGCTTAGCACATTCCTCATAGGCCCAGCTCTTTTCTCATCTGGC CCTGCTGTGG AGTCACCTTGCCCCTTCAGGAGAGCCATGGCTTACCACTGCCTGCT AAGCC TCCACTCAGCTGCCACCAC ACTAAATCCAAGCTTCTCTAAGATGTTGCAGACTTTACAGGCAAGCATAAAAGGCTTGAT CTTCCTGGAC TTCCCTTTACTTGTCTGAATCTCACCTCCTTCAACTTTCAGTCTCAGAATGTAGGCATTT GTCCTCTTTG CCCTACATCTTCCTTCTTCTGAATCATGAAAGCCTCTCACTTCCTCTTGCTATGTGCTGG AGGCTTCTGT CAGGTTTTAGAATGAGTTCTCATCTAGTCCTAGTAGCTTTTGATGCTTAAGTCCACCTTT TAAGGATACC TTTGAGATTTAGACCATGTTTTTCGCTTGAGAAAGCCCTAATCTCCAGACTTGCCTTTCT GTGGATTTCA AAGACC AAC T GAGGAAGT CAAAAGC T GAATGTT GAC T T T C T T T GAACAT T T CC GC T AT AACAAT T C CAAT TCTCCTCAGAGCAATATGCCTGCCTCCAACTGACCAGGAGAAAGGTCCAGTGCCAAAGAG AAAAACACAA AGATTAATTATTTCAGTTGAGCACATACTTTCAAAGTGGTTTGGGTATTCATATGAGGTT TTCTGTCAAG AGGGTGAGACTCTTCATCTATCCATGTGTGCCTGACAGTTCTCCTGGCACTGGCTGGTAA CAGATGCAAA ACTGTAAAAATTAAGTGATCATGTATTTTAACGATATCATCACATACTTATTTTCTATGT AATGTTTTAA ATTTCCCCTAACATACTTTGACTGTTTTGCACATGGTAGATATTCACATTTTTTTGTGTT GAAGTTGATG CAATCTTCAAAGTTATCTACCCCGTTGCTTATTAGTAAAACTAGTGTTAATACTTGGCAA GAGATGCAGG GAATCTTTCTCATGACTCACGCCCTATTTAGTTATTAATGCTACTACCCTATTTTGAGTA AGTAGTAGGT CCCTAAGTACATTGTCCAGAGTTATACTTTTAAAGATATTTAGCCCCATATACTTCTTGA ATCTAAAGTC ATACACCTTGCTCCTCATTTCTGAGTGGGAAAGACATTTGAGAGTATGTTGACAATTGTT CTGAAGGTTT TTGCCAAGAAGGTGAAACTGTCCTTTCATCTGTGTATGCCTGGGGCTGGGTCCCTGGCAG TGATGGGGTG ACAATGCAAAGCTGTAAAAACTAGGTGCTAGTGGGCACCTAATATCATCATCATATACTT ATTTTCAAGC TAATATGCAAAATCCCATCTCTGTTTTTAAACTAAGTGTAGATTTCAGAGAAAATATTTT GTGGTTCACA TAAGAAAACAGTCTACTCAGCTTGACAAGTGTTTTATGTTAAATTGGCTGGTGGTTTGAA ATGAATCATC TTCACATAATGTTTTCTTTAAAAATATTGTGAATTTAACTCTAATTCTTGTTATTCTGTG TGATAATAAA GAATAAAC TAATTTCTA

AK093306 ATTCTATGCTGCAGCCTAAGCATCATTCCTCTTCTCTTCTTAGTGGAGATAAAATTACCC ACTGCTCTCC 145

TTACATTTACTTTGTCCATATTTGCTCCTATGCTCTAGGCTCGTGCACAACAAACACAGT GTGGGCCCTT ACCCTAGAAGCCAACTTCTCATGACCTTTCTCTATCTCCAGAATCCATGCAGTGGGAATG AAGGTAAAAG AAGGTTTTCATGGGATCCAGCTGAGAGCTCTACGGGGAAAATGGATCTGAGGAGCCATGT GCTCCATCTC TTTTATTTTACAGGTAGAGACTAGGGGTATAGAGTGAGGTGAATTACCGCAGTGACCCAC ACATTGTTGG CAGACCTAGGATTAGAACTCTGTCTTCCTGGTTCCCAGCTTGGTGCTTTTGAAAGCATAC TTGCTGCTTT CTTACCGGCCTGGTGTCTGCCACTTTGGGACAGAGTGTGGACTTGCTCACCTGCCCCATT TCTTAGGGAT TCTCATTCTGTGTTTGAGCAAGAATATTCTTATTCTGGAAAGAACCACATACCACAGGAT TCTGGGTGAG CATAAGGAAGATTGTCTTGGGGATCTGACTTAGCTCACGTATAGTGGCTATGATGAATTC AGTGTCTTAT TTTTTGCATATGTATATTTTTAGTCTAATATTGCCTGGGTGTCTGAGCAAGTCTAGATGA ATTTAATTGC TCTCATTTTTCCCCTGCCCCTCTTCCTTTGGTCTCTCTTTTAGGAAATGTTTTTCTTTCA ACATTCGTTT CATTCATTATTTACTCATTCGGCCAACCAACATTTATTGAGTGCCTTCCCTGTATCAGGG ACAGGGGCTT ACAAAGTAGAATTTGATCCCACCTCTGCCCTCAGTAGCTCAGTGTCTAATGGAGGTAGTG ATGTTCATTA AGCGTCGCCAGATACTGTGCTAGGTGCTGTGCCTGTTCTCTCTCGCTTGTTCCTCACACA CTTGAGAAGG CCGAAGCTGATTCATAGCTTGGAAGGCAGGGGCCTTGGATTTGAACCCAGGCCTGACCAA TGGCAGAACC TATCAGATGTGTGGACAGATGACATTGCCTTTCTTTCTTTGGATATATCAAAATCAGCCA GCAGGCAGGA ACTCCCATTTTGAGCAAGCAATGTGCAGGAATGATAGGGTATACAGAGAGGAACAGGAGA TGGCCCCTGA CTTCCAGCATGTGTCTGATGGACATCCAGGCTGCAGGCATCATGGTGCTGTCTAGAGAGA TGAGCCAGGT GCCCAGAGCCCATGGGCCAATGCTGCCCTTTCTTGAGCATGCCAAACAAAGCGGTTGGTG TGTTAGAGGC ACAGTCTCCTCCACTCTAAGTAAAAATCAGCATGAGTCCTAGCCCACATTTCCCTAGTGA GTACACCAAA GATATCTAT GAAC T GGCAG T CAT CAG T GAC T T C C TAAGG T T CCGGAAAT GCATCTCTTACT CAGGAGT AA GCAATGATGTGCCTGCGGCTTTACGAGTTCTCACAGAATGACTTTCTGGACCCAAATGTT TTTTCTGCTT CAGGACTGTGAAGGCCT TAT TGTTCGCTCTGCCACCAAGGTGACCGCTGATGTCATCAACGCAGCT GAGA AACTCCAGGTGGTGGGCAGGGCTGGCACAGGTGTGGACAATGTGGATCTGGAGGCCGCAA CAAGGAAGGG CATCTTGGTTATGAACACCCCCAATGGGAACAGCCTCAGTGCCGCAGAACTCACTTGTGG AATGATCATG TGCCTGGCCAGGCAGATTCCCCAGGCGACGGCTTCGATGAAGGACGGCAAATGGGAGCGG AAGAAGTTCA TGGGAACAGAGCTGAATGGAAAGACCCTGGGAATTCTTGGCCTGGGCAGGATTGGGAGAG AGGTAGCTAC CCGGATGCAGTCCTTTGGGATGAAGACTATAGGGTATGACCCCATCATTTCCCCAGAGGT CTCGGCCTCC TTTGGTGTTCAGCAGCTGCCCCTGGAGGAGATCTGGCCTCTCTGTGATTTCAT CACTGTGCACACTCCTC TCCTGCCCTCCACGACAGGCTTGCTGAATGACAACACCTTTGCCCAGTGCAAGAAGGGGG TGCGTGTGGT GAACTGTGCCCGTGGAGGGATCGTGGACGAAGGCGCCCTGCTCCGGGCCCTGCAGTCTGG CCAGTGTGCC GGGGCTGCACTGGACGTGTTTACGGAAGAGCCGCCACGGGACCGGGCCTTGGTGGACCAT GAGAATGTCA TCAGCTGTCCCCACCTGGGTGCCAGCACCAAGGAGGCTCAGAGCCGCTGTGGGGAGGAAA TTGCTGTTCA GTTCGTGGACATGGTGAAGGGGAAATCTCTCACGGGGGTTGTGAATGCCCAGGCCCTTAC CAGTGCCTTC TCTCCACACACCAAGCCTTGGATTGGTCTGGCAGAAGCTCTGGGGACACTGATGCGAGCC TGGGCTGGGT CCCCCAAAGGGACCATCCAGGTGATAACACAGGGAACATCCCTGAAGAATGCTGGGAACT GCCTAAGCCC CGCAGTCATTGTCGGCCTCCTGAAAGAGGCTTCCAAGCAGGCGGATGTGAACTTGGTGAA CGCTAAGCTG CTGGTGAAAGAGGCTGGCCTCAATGTCACCACCTCCCACAGCCCTGCTGCACCAGGGGGG CAAGGCTTCG GGGAATGCCTCCTGGCCGTGGCCCTGGCAGGCGCCCCTTACCAGGCTGTGGGCTTGGTCC AAGGCACTAC ACCTGTACTGCAGGGGCTCAATGGAGCTGTCTTCAGGCCAGAAGTGCCTCTCCGCAGGGA CCTGCCCCTG CTCCTATTCCGGACTCAGACCTCTGACCCTGCAATGCTGCCTACCATGATTGGCCTCCTG GCAGAGGCAG GCGTGCGGCTGCTGTCCTACCAGACTTCACTGGTGTCAGATGGGGAGACCTGGCACGTCA TGGGCATCTC CTCCTTGCTGCCCAGCCTGGAAGCGTGGAAGCAGCATGTGACTGAAGCCTTCCAGTTCCA CTTCTAACCT TGGAGCTCACTGGTCCCTGCCTCTGGGGCTTTTCTGAAGAAACCCACCCACTGTGATCAA TAGGGAGAGA AAATCCACATTCTTGGGCTGAACGCGAGCCTCTGACACTGCTTACACTGCACTCTGACCC TGTAGTACAG CAATAACCGTCTAATAAAGAGCCTACCCCC

BE904476 CAAACAAAAACAGCCAAGCTTTTCTGCCAAAAAGATGACTGAGAAGACTGTTAAAGCAAA AAGCTCTGTT 146

CCTGCCTCAGATGATGCCTATCCAGAAATAGAAAAATTCTTTCCCTTCAATCCTCTAGAC TTTGAGAGTT TTGACCTGCCTGAAGAGCACCAGATTGCGCACCTCCCCTTGAGTGGAGTGCCTCTCATGA TCCTTGACGA GGAGAGAGAGCTTGAAAAGCTGTTTCAGCTGGGCCCCCCTTCACCTGTGAAGATGCCCTC TCCACCATGG GAATCCAATCTGTTGCAGTCTCCTTCAAGCATTCTGTCGACCCTGGATGTTGAATTGCCA CCTGTTTGCT GTGACATAGATATTTAAATTTCTTAGTGCTTCAGAGTCTGTGTGTATTTGTATTAATAAA GCATTCTTTA ACAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGGGGGGA GACACAAAAA GAATTCCCCAAGAGGGGGCCACAAGATAATCAGAGGATATCACACAAGATCTCTCGGCGC ACCAACGACG GGGGCCCCAAATAAGGGAGAGACCCAGAATCACAACAGCCAAGACACGGTGGACACGACG GAAACAAACA CACAGCCCAGACACGGGGGCAAACACGCGCGCACACCGCGGACACCATGGGACAAAGCAG ACACCACCCA CAAAACAACACCGCGGAGGGGGAAGAACAACAAAACAAGTGCGCAAACAGAACACAACCA CAGAAAGAGA AAAATTAAAACGGCCCCCAAGACGGCGACAACACAACAAAACAACCACTACAGAGCGCTC AACAGCCGAG TAAAAACACAACAACGGACAACTAACACACAAAGGAATGAAACAAAGCGGGGCCACACAC CGACACCGGA AATCCGGCGAACAACTCACACCGAGCGAGGGTCCCAGACAACAAATACACAGACAACGAA ACCGAGAAAC AAGACCAGCAAGACGAGCAGGCAAAAGACAAACAAGACAGAGGAGACGACGACGAACGCA AAGGACAAGA GGACACAACGACGCGAGGAGCGAGAGCGAGAGGAAGAGACAACAAAAAGACACAAAAGAA CAACAAGCAA GCAGCGAAGAACGACACACAACCACACGAGACAGCAGGAGCAGAGGCGGAGAAAACACAA CGAGCAAGCC AAGACCAAGAGAGGAGAACAAAATAAAAAAATACGAGAGCAGGCGGACGAGAGCACGAGA CGAACAGACA AACGGGAATCAGAAGCATAACGATCCGCGACGCGAACAACN

AK123010 GTGCACCCTGTCCCAGCCGTCCTGTCCTGGCTGCTCGCTCTGCTTCGCTGCGCCTCCACT ATGCTCTCCC 147

TCCGTGTCCCGCTCGCGCCCATCACGGACCCGCAGCAGCTGCAGCTCTCGCCGCTGAAGG GGCTCAGCTT GGTCGACAAGGAGAACACGCCGCCGGCCCTGAGCGGGACCCGCGTCCTGGCCAGCAAGAC CGCGAGGAGG ATCTTCCAGGAGAAAACCCCCGCCGCTTTGTCATCTTCCCCATCGAGTACCATGATATCT GGCAGATGTA TAAGAAGGCAGAGGCTTCCTTTTGGACCGCCGAGGAGGTGGACCTCTCCAAGGACATTCA GCACTGGGAA TCCCTGAAACCCGAGGAGAGATATTTTATATCCCATGTTCTGGCTTTCTTTGCAGCAAGC GATGGCATAG TAAATGAAAACTTGGTGGAGCGATTTAGCCAAGAAGTTCAGATTACAGAAGCCCGCTGTT TCTATGGCTT CCAAATTGCCATGGAAAACATACATTCTGAAATGTATAGTCTTCTTATTGACACTTACAT AAAAGATCCC AAAGAAAGGGAATTTCTCTTCAATGCCATTGAAACGATGCCTTGTGTCAAGAAGAAGGCA GACTGGGCCT TGCGCTGGATTGGGGACAAAGAGGCTACCTATGGTGAACGTGTTGTAGCCTTTGCTGCAG TGGAAGGCAT TTTCTTTTCCGGTTCTTTTGCGTCGATATTCTGGCTCAAGAAACGAGGACTGATGCCTGG CCTCACATTT TCTAATGAACTTATTAGCAGAGATGAGGGTTTACACTGTGATTTTGCTTGCCTGATGTTC AAACACCTGG TACACAAACCATCGGAGGAGAGAGTAAGAGAAATAATTATCAATGCTGTTCGGATAGAAC AGGAGTTCCT CACTGAGGCCTTGCCTGTGAAGCTCATTGGGATGAATTGCACTCTAATGAAGCAATACAT TGAGTTTGTG GCAGACAGACTTATGCTGGAACTGGGTTTTAGCAAGGTTTTCAGAGTAGAGAACCCATTT GACTTTATGG AGAATATTTCACTGGAAGGAAAGACTAACTTCTTTGAGAAGAGAGTAGGCGAGTATCAGA GGATGGGAGT GATGTCAAGTCCAACAGAGAATTCTTTTACCTTGGATGCTGACTTCTAAATGAACTGAAG ATGTGCCCTT ACTTGGCTGATTTTTTTTTTTCCATCTCATAAGAAAAATCAGCTGAAGTGTTACCAACTA GCCACACCAT GAATTGTCCGTAATGTTCATTAACAGCATCTTTAAAACTGTGTAGCTACCTCACAACCAG TCCTGTCTGT TTATAGTGCTGGTAGTATCACCTTTTGCCAGAAGGCCTGGCTGGCTGTGACTTACCATAG CAGTGACAAT GGCAGTCTTGGCTTTAAAGTGAGGGGTGACCCTTTAGTGAGCTTAGCACAGCGGGATTAA ACAGTCCTTT AACCAGCACAGCCAGTTAAAAGATGCAGCCTCACTGCTTCAACGCAGATTTTAATGTTTA CTTAAATATA AACCTGGCACTTTACAAACAAATAAACATTGTTTGTACTCACAAGGCGATAATAGCTTGA TTTATTTGGT TTCTACACCAAATACATTCTCCTGACCACTAATGGGAGCCAATTCACAATTCACTAAGTG ACTAAAGTAA GTTAAACTTGTGTAGACTAAGCATGTAATTTTTAAGTTTTATTTTAATGAATTAAAATAT TTGTTAACCA ACTTTAAAGTCAGTCCTGTGTATACCTAGATATTAGTCAGTTGGTGCCAGATAGAAGACA GGTTGTGTTT TTATCCTGTGGCTTGTGTAGTGTCCTGGGATTCTCTGCCCCCTCTGAGTAGAGTGTTGTG GGATAAAGGA ATCTCTCAGGGCAAGGAGCTTCTTAAGTTAAATCACTAGAAATTTAGGGGTGATCTGGGC CTTCATATGT GTGAGAAGCCGTTTCATTTTATTTCTCACTGTATTTTCCTCAACGTCTGGTTGATGAGAA AAAATTCTTG AAGAGTTTTCATATGTGGGAGCTAAGGTAGTATTGTAAAATTTCAAGTCATCCTTAAACA AAATGATCCA CCTAAGATCTTGCCCCTGTTAAGTGGTGAAATCAACTAGAGGTGGTTCCTACAAGTTGTT CATTCTAGTT TTGTTTGGTGTAAGTAGGTTGTGTGAGTTAATTCATTTATATTTACTATGTCTGTTAAAT CAGAAATTTT TTATTATCTATGTTCTTCTAGATTTTACCTGTAGTTCATACTTCAGTCACCCAGTGTCTT ATTCTGGCAT TGTCTAAATCTGAGCATTGTCTAGGGGGATCTTAAACTTTAGTAGGAAACCATGAGCTGT TAATACAGTT TCCATTCAAATATTAATTTCAGAATGAAACATAATTTTTTTTTTTTTTTTTTGAGATGGA GTCTCGCTCT GTTGCCCAGGCTGGAGTGCAGTGGCGCGATTTTGGCTCACTGTAACCTCCATCTCCTGGG TTCAAGCAAT TCTCCTGTCTCAGCCTCCCTAGTAGCTGGGACTGCAGGTATGTGCTACCACACCTGGCTA ATTTTTGTAT TTTTAGTAGAGATGGAGTTTCACCATATTGGTCAGGCTGGTCTTGAACTCCTGACCTCAG GTGATCCACC CACCTCGGCCTCCCAAAGTGCTGGGATTGCAGGCGTGATAAACAAATATTCTTAATAGGG CTACTTTGAA TTAATCTGCCTTTATGTTTGGGAGAAGAAAGCTGAGACATTGCATGAAAGATGATGAGAG ATAAATGTTG ATCTTTTGGCCCCATTTGTTAATTGTATTCAGTATTTGAACGTCGTCCTGTTTATTGTTA GTTTTCTTCA TCATTTATTGTATAGACAATTTTTAAATCTCTGTAATATGATACATTTTCCTATCTTTTA AGTTATTGTT ACCTAAAGTTAATCCAGATTATATGGTCCTTATATGTGTACAACATTAAAATGAAAGGCT TTGTCTTGCA TTGTGAGGTACAGGCGGAAGTTGGAATCAGGTTTTAGGATTCTGTCTCTCATTAGCTGAA TAATGTGAGG ATTAACTTCTGCCAGCTCAGACCATTTCCTAATCAGTTGAAAGGGAAACAAGTATTTCAG TCTCAAAATT GAATAATGCACAAGTCTTAAGTGATTAAAATAAAACTGTTCTTATGTCAGTTT

BC036503 AGCGGGGGCACTCCAGCCCTGCAGCCTCCGGAGTCAGTGCCGCGCGCCCGCCGCCCCGCG CCTTCCTGCT 148

CGCCGCACCTCCGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCCCCGCCCGCGCC GCCTCCGACC GCAGGCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCCGCGGAGCCG GGCAACGCTG GGGACTGCGCCTTTTGTCCCCGGAGGTCCCTGGAAGTTTGCGGCAGGACGCGCGCGGGGA GGCGGCGGAG GCAGCCCCGACGTCGCGGAGAACAGGGCGCAGAGCCGGCATGGGCATCGGGCGCAGCGAG GGGGGCCGCC GCGGGGCAGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGGCGCTTCTGGCCGTGGGCTCGG CCAGCGAGTA CGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCAGAGCGGGCGCTTCTACACCAA GCCACCTCAG TGCGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGCTACAAGAAGATGGTG CTGCCCAACC TGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGGCCAGCAGCTGGGTGCCCCTGC TCAACAAGAA CTGCCACGCCGGCACCCAGGTCTTCCTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCG GCCCATCTAC CCGTGTCGCTGGCTCTGCGAGGCCGTGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTC GGCTTCTACT GGCCCGAGATGCTTAAGTGTGACAAGTTCCCCGAGGGGGACGTCTGCATCGCCATGACGC CGCCCAATGC CACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAGTTGAA ATCTGAGGCC ATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGAGGATGAAAATAAAAGAAGTGAAA AAAGAAAATG GCGACAAGAAGATTGTCCCCAAGAAGAAGAAGCCCCTGAAGTTGGGGCCCATCAAGAAGA AGGACCTGAA GAAGCTTGTGCTGTACCTGAAGAATGGGGCTGACTGTCCCTGCCACCAGCTGGACAACCT CAGCCACCAC TTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTTGCTGACGGCCATCCACAAGTGG GACAAGAAAA ACAAGGAGTTCAAAAACTTCATGAAGAAAATGAAAAACCATGAGTGCCCCACCTTTCAGT CCGTGTTTAA GTGATTCTCCCGGGGGCAGGGTGGGGAGGGAGCCTCGGGTGGGGTGGGAGCGGGGGGGAC AGTGCCCCGG GAACCCGGTGGGTCACACACACGCACTGCGCCTGTCAGTAGTGGACATTTAATCCAGTCG GCTTGTTCTT GCAGCATTCCCGCTCCCTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGCCATGGCC GGGTAAAGCA AGGGCCATTTAGATTAGGAAGGTTTTTAAGATCCGCAATGTGGAGCAGCAGCCACTGCAC AGGAGGAGGT GACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAAAGGGGGATTGGGCGGAA AGTGAGAGCC AGCAGCAAAAACTACATTTTGCAACTTGTTGGTGTGGATCTATTGGCTGATCTATGCCTT TCAACTAGAA AATTCTAATGATTGGCAAGTCACGTTGTTTTCAGGTCCAGAGTAGTTTCTTTCTGTCTGC TTTAAATGGA AACAGACTCATACCACACTTACAATTAAGGTCAAGCCCAGAAAGTGATAAGTGCAGGGAG GAAAAGTGCA AGTCCATTATGTAATAGTGACAGCAAAGGGACCAGGGGAGAGGCATTGCCTTCTCTGCCC ACAGTCTTTC CGTGTGATTGTCTTTGAATCTGAATCAGCCAGTCTCAGATGCCCCAAAGTTTCGGTTCCT ATGAGCCCGG GGCATGATCTGATCCCCAAGACATGTGGAGGGGCAGCCTGTGCCTGCCTTTGTGTCAGAA AAAGGAAACC ACAGTGAGCCTGAGAGAGACGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACAC ATAGTTAAAA AAGAAACAAATGAAAAAAATTTTAGAACAGTCCAGCAAATTGCTAGTCAGGGTGAATTGT GAAATTGGGT GAAGAGCTTACGATTCTAATCTCATGTTTTTTCCTTTTCACATTTTTAAAAGAACAATGA CAAACACCCA CTTATTTTTCAAGGTTTTAAAACAGTCTACATTGAGCATTTGAAAGGTGTGCTAGAACAA GGTCTCCTGA TCCGTCCGAGGCTGCTTCCCAGAGGAGCAGCTCTCCCCAGGCATTTGCCAAGGGAGGCGG ATTTCCCTGG TAGTGTAGCTGTGTGGCTTTCCTTCCTGAAGAGTCCGTGGTTGCCCTAGAACCTAACACC CCCTAGCAAA ACTCACAGAGCTTTCCGTTTTTTTCTTTCCTGTAAAGAAACATTTCCTTTGAACTTGATT GCCTATGGAT CAAAGAAATTCAGAACAGCCTGCCTGTCCCCCCGCACTTTTTACATATATTTGTTTCATT TCTGCAGATG GAAAGTTGACATGGGTGGGGTGTCCCCATCCAGCGAGAGAGTTTAAAAAGCAAAACATCT CTGCAGTTTT TCCCAAGTGCCCTGAGATACTTCCCAAAGCCCTTATGTTTAATCAGCGATGTATATAAGC CAGTTCACTT AGACAACTTTACCCTTCTTGTCCAATGTACAGGAAGTAGTTCTAAAAAAAATGCATATTA ATTTCTTCCC CCAAAGCCGGATTCTTAATTCTCTGCAACACTTTGAGGACATTTATGATTGTCCCTCTGG GCCAATGCTT ATACCCAGTGAGGATGCTGCAGTGAGGCTGTAAAGTGGCCCCCTGCGGCCCTAGCCTGAC CCGGAGGAAA GGATGGTAGATTCTGTTAACTCTTGAAGACTCCAGTATGAAAATCAGCATGCCCGCCTAG TTACCTACCG GAGAGTTATCCTGATAAATTAACCTCTCACAGTTAGTGATCCTGTCCTTTTAACACCTTT TTTGTGGGGT TCTCTCTGACCTTTCATCGTAAAGTGCTGGGGACCTTAAGTGATTTGCCTGTAATTTTGG ATGATTAAAA AATGTGTATATATATTAGCTAATTAGAAATATTCTACTTCTCTGTTGTCAAACTGAAATT CAGAGCAAGT TCCTGAGTGCGTGGATCTGGGTCTTAGTTCTGGTTGATTCACTCAAGAGTTCAGTGCTCA TACGTATCTG CTCATTTTGACAAAGTGCCTCATGCAACCGGGCCCTCTCTCTGCGGCAGAGTCCTTAGTG GAGGGGTTTA CCTGGAACATTAGTAGTTACCACAGAATACGGAAGAGCAGGTGACTGTGCTGTGCAGCTC TCTAAATGGG AATTCTCAGGTAGGAAGCAACAGCTTCAGAAAGAGCTCAAAATAAATTGGAAATGTGAAT CGCAGCTGTG GGTTTTACCACCGTCTGTCTCAGAGTCCCAGGACCTTGAGTGTCATTAGTTACTTTATTG AAGGTTTTAG ACCCATAGCAGCTTTGTCTCTGTCACATCAGCAATTTCAGAACCAAAAGGGAGGCTCTCT GTAGGCACAG AGCTGCACTATCACGAGCCTTTGTTTTTCTCCACAAAGTATCTAACAAAACCAATGTGCA GACTGATTGG CCTGGTCATTGGTCTCCGAGAGAGGAGGTTTGCCTGTGATTTCCTAATTATCGCTAGGGC CAAGGTGGGA TTTGTAAAGCTTTACAATAATCATTCTGGATAGAGTCCTGGGAGGTCCTTGGCAGAACTC AGTTAAATCT TTGAAGAATATTTGTAGTTATCTTAGAAGATAGCATGGGAGGTGAGGATTCCAAAAACAT TTTATTTTTA AAATATCCTGTGTAACACTTGGCTCTTGGTACCTGTGGGTTAGCATCAAGTTCTCCCCAG GGTAGAATTC AATCAGAGCTCCAGTTTGCATTTGGATGTGTAAATTACAGTAATCCCATTTCCCAAACCT AAAATCTGTT TTTCTCATCAGACTCTGAGTAACTGGTTGCTGTGTCATAACTTCATAGATGCAGGAGGCT CAGGTGATCT GTTTGAGCAGAGCACCCTAGGCAGCCTGCAGGGAATAACATACTGGCCGTTCTGACCTGT TGCCAGCAGA TACACAGGACATGGATGAAATTCCCGTTTCCTCTAGTTTCTTCCTGTAGTACTCCTCTTT TAGATCCTAA GTCTCTTACAAAAGCTTTGAATACTGTGAAAATGTTTTACATTCCATTTCATTTGTGTTG TTTTTTTAAC TGCATTTTACCAGATGTTTTGATGTTATCGCTTATGTTAATAGTAATTCCCGTACGTGTT CATTTTATTT TCATGCTTTTTCAGCCATGTAT CAAT AT T CAC T T GAC TAAAAT CAC T CAAT TAAT CAAAAAAAAAAAAAA AA

NM_012319 AGTCCTGGGCGAAGGGGGCGGTGGTTCCCCGCGGCGCTGCGCGCGGCGGTAATTAGTGAT TGTCTTCCAG 149

CTTCGCGAAGGCTAGGGGCGCGGCTGCCGGGTGGCTGCGCGGCGCTGCCCCCGGACCGAG GGGCAGCCAA CCCAATGAAACCACCGCGTGTTCGCGCCTGGTAGAGATTTCTCGAAGACACCAGTGGGCC CGTTCCGAGC CCTCTGGACCGCCCGTGTGGAACCAAACCTGCGCGCGTGGCCGGGCCGTGGGACAACGAG GCCGCGGAGA CGAAGGCGCAATGGCGAGGAAGTTATCTGTAATCTTGATCCTGACCTTTGCCCTCTCTGT CACAAATCCC CTTCATGAACTAAAAGCAGCTGCTTTCCCCCAGACCACTGAGAAAATTAGTCCGAATTGG GAATCTGGCA TTAATGTTGACTTGGCAATTTCCACACGGCAATATCATCTACAACAGCTTTTCTACCGCT ATGGAGAAAA TAAT TCTTTGTCAGTTGAAGGGTTCAGAAAAT TACT TCAAAATATAGGCATAGATAAGATTAAAAGAATC CATATACACCATGACCACGACCATCACTCAGACCACGAGCATCACTCAGACCATGAGCGT CACTCAGACC AT GAGC AT CAC T C AGACCACGAGCAT CAC T C T GAC CAT GATCATCACTCTCACCATAATCATGCTGCTTC TGGTAAAAATAAGCGAAAAGCTCTTTGCCCAGACCATGACTCAGATAGTTCAGGTAAAGA TCCTAGAAAC AGCCAGGGGAAAGGAGCTCACCGACCAGAACATGCCAGTGGTAGAAGGAATGTCAAGGAC AGTGTTAGTG C T AGT GAAGT GAC C T CAAC T GT GT AC AACAC T G T C T C T GAAGGAAC T CAC T T T C T AGAGACAAT AGAGAC TCCAAGACCTGGAAAACTCTTCCCCAAAGATGTAAGCAGCTCCACTCCACCCAGTGTCAC ATCAAAGAGC CGGGTGAGCCGGCTGGCTGGTAGGAAAACAAATGAATCTGTGAGTGAGCCCCGAAAAGGC TTTATGTATT CCAGAAACACAAATGAAAATCCTCAGGAGTGTTTCAATGCATCAAAGCTACTGACATCTC ATGGCATGGG CATCCAGGTTCCGCTGAATGCAACAGAGTTCAACTATCTCTGTCCAGCCATCATCAACCA AATTGATGCT AGATCTTGTCTGATTCATACAAGTGAAAAGAAGGCTGAAATCCCTCCAAAGACCTATTCA TTACAAATAG CCTGGGTTGGTGGTTTTATAGCCATTTCCATCATCAGTTTCCTGTCTCTGCTGGGGGTTA TCTTAGTGCC TCTCATGAATCGGGTGTTTTTCAAATTTCTCCTGAGTTTCCTTGTGGCACTGGCCGTTGG GACTTTGAGT GGTGATGCTTTTTTACACCTTCTTCCACATTCTCATGCAAGTCACCACCATAGTCATAGC CATGAAGAAC CAGCAATGGAAATGAAAAGAGGACCACTTTTCAGTCATCTGTCTTCTCAAAACATAGAAG AAAGTGCCTA TTTTGATTCCACGTGGAAGGGTCTAACAGCTCTAGGAGGCCTGTATTTCATGTTTCTTGT TGAACATGTC C T CACAT T GAT CAAACAAT T T AAAGAT AAGAAGAAAAAGAAT CAGAAGAAACC T GAAAAT GAT GAT GAT G TGGAGATTAAGAAGCAGTTGTCCAAGTATGAATCTCAACTTTCAACAAATGAGGAGAAAG TAGATACAGA TGATCGAACTGAAGGCTATTTACGAGCAGACTCACAAGAGCCCTCCCACTTTGATTCTCA GCAGCCTGCA GTCTTGGAAGAAGAAGAGGTCATGATAGCTCATGCTCATCCACAGGAAGTCTACAATGAA TATGTACCCA GAGGGTGCAAGAATAAATGCCATTCACATTTCCACGATACACTCGGCCAGTCAGACGATC TCATTCACCA CCATCATGACTACCATCATATTCTCCATCATCACCACCACCAAAACCACCATCCTCACAG TCACAGCCAG CGCTACTCTCGGGAGGAGCTGAAAGATGCCGGCGTCGCCACTCTGGCCTGGATGGTGATA ATGGGTGATG GCCTGCACAATTTCAGCGATGGCCTAGCAATTGGTGCTGCTTTTACTGAAGGCTTATCAA GTGGTTTAAG TACTTCTGTTGCTGTGTTCTGTCATGAGTTGCCTCATGAATTAGGTGACTTTGCTGTTCT ACTAAAGGCT GGCATGACCGTTAAGCAGGCTGTCCTTTATAATGCATTGTCAGCCATGCTGGCGTATCTT GGAATGGCAA CAGGAATTTTCATTGGTCATTATGCTGAAAATGTTTCTATGTGGATATTTGCACT TACT GCTGGCT TAT T CATGTATGTTGCTCTGGTTGATATGGTACCTGAAATGCTGCACAATGATGCTAGTGACCA TGGATGTAGC CGCTGGGGGTATTTCTTTTTACAGAATGCTGGGATGCTTTTGGGTTTTGGAATTATGTTA CTTATTTCCA TATTTGAACATAAAATCGTGTTTCGTATAAATTTCTAGTTAAGGTTTAAATGCTAGAGTA GCTTAAAAAG TTGTCATAGTTTCAGTAGGTCATAGGGAGATGAGTTTGTATGCTGTACTATGCAGCGTTT AAAGTTAGTG GGTTTTGTGATTTTTGTATTGAATATTGCTGTCTGTTACAAAGTCAGTTAAAGGTACGTT TTAATATTTA AGTTATTCTATCTTGGAGATAAAATCTGTATGTGCAATTCACCGGTATTACCAGTTTATT ATGTAAACAA GAGATTTGGCATGACATGTTCTGTATGTTTCAGGGAAAAATGTCTTTAATGCTTTTTCAA GAACTAACAC AGTTATTCCTATACTGGATTTTAGGTCTCTGAAGAACTGCTGGTGTTTAGGAATAAGAAT GTGCATGAAG CC T AAAAT ACCAAGAAAGC T T AT AC T GAAT T T AAGCAAAGAAAT AAAGGAGAAAAGAGAAGAAT C T GAGA AT TGGGGAGGCAT AGAT TC T T AT AAAAAT CACAAAAT T T GT TGTAAAT TAGAGGGGAGAAAT T TAGAAT T AAGTATAAAAAGGCAGAATTAGTATAGAGTACATTCATTAAACATTTTTGTCAGGATTAT TTCCCGTAAA AACGTAGTGAGCACTTTTCATATACTAATTTAGTTGTACATTTAACTTTGTATAATACAG AAATCTAAAT ATATTTAATGAATTCAAGCAATATATCACTTGACCAAGAAATTGGAATTTCAAAATGTTC GTGCGGGTAT ATACCAGATGAGTACAGTGAGTAGTTTTATGTATCACCAGACTGGGTTATTGCCAAGTTA TATATCACCA AAAGCTGTATGACTGGATGTTCTGGTTACCTGGTTTACAAAATTATCAGAGTAGTAAAAC TTTGATATAT ATGAGGATATTAAAACTACACTAAGTATCATTTGATTCGATTCAGAAAGTACTTTGATAT CTCTCAGTGC TTCAGTGCTATCATTGTGAGCAATTGTCTTTTATATACGGTACTGTAGCCATACTAGGCC TGTCTGTGGC AT T C T C T AGAT GT TTCTTTTT TACACAATAAAT TCCTTATAT CAGC T T GAAAAAAAAAAAAAAAAAA

AACGCACTTGGCGCGCGGCGCGGGCTGCAGACGGCTGCGAGGCGCTGGGCACAGGTG TCCTGATGGCAAA 150 TTTCAAGGGCCACGCGCTTCCAGGGAGTTTCTTCCTGATCATTGGGCTGTGTTGGTCAGT GAAGTACCCG CTGAAGTACTTTAGCCACACGCGGAAGAACAGCCCACTACATTACTATCAGCGTCTCGAG ATCGTCGAAG CCGCAATTAGGACTTTGTTTTCCGTCACTGGGATCCTGGCAGAGCAGTTTGTTCCGGATG GGCCCCACCT GCACCTCTACCATGAGAACCACTGGATAAAGTTAATGAATTGGCAGCACAGCACCATGTA CCTATTCTTT GCAGTCTCAGGAATTGTTGACATGCTCACCTATCTGGTCAGCCACGTTCCCTTGGGGGTG GACAGACTGG TTATGGCTGTGGCAGTATTCATGGAAGGTTTCCTCTTCTACTACCACGTCCACAACCGGC CTCCGCTGGA CCAGCACATCCACTCACTCCTGCTGTATGCTCTGTTCGGAGGGTGTGTTAGTATCTCCCT AGAGGTGATC TTCCGGGACCACATTGTGCTGGAACTTTTCCGAACCAGTCTCATCATTCTTCAGGGAACC TGGTTCTGGC AGATTGGGTTTGTGCTGTTCCCACCTTTTGGAACACCCGAATGGGACCAGAAGGATGATG CCAACCTCAT GTTCATCACCATGTGCTTCTGCTGGCACTACCTGGCTGCCCTCAGCATTGTGGCCGTCAA CTATTCTCTT GTTTACTGCCTTTT GAC TCGGAT GAAGAGACAC GGAAGGGGAGAAAT CAT T GGAAT T CAGAAGC T GAAT T CAGATGACACTTACCAGACCGCCCTCTTGAGTGGCTCAGATGAGGAATGAGCCGAGATGC GGAGGGCGCA GATGTCCCACTGCACAGCTGGAATGAATGGAGTTCATCCCCTCCACCTGAATGCCTGCTG TGGTCTGATC TTAAGGGTCTATATATTTGCACCTCCTCATTCAACACAGGGCTGGAGGTTCTACAACAGG AAATCAGGCC TACAGCATCCTGTGTATCTTGCAGTTGGGATTTTTAAACATACTATAAAGTCTGTGTTGG TATAGTACCC TTCATAAGGAAAAATGAAGTAATGCCTATAAGTAGCAGGCCTTTGTGCCTCAGTGTCAAG AGAAATCAAG AGATGCTAAAAGCTTTACAATGGAAGTGGCCTCATGGATGAATCCGGGGTATGAGCCCAG GAGAACGTGC TGCTTTTGGTAACTTATCCCTTTTTCTCTTAAGAAAGCAGGTACTTTCTTATTAGAAATA TGTTAGAATG TGTAAGCAAACGACAGTGCCTTTAGAATTACAATTCTAACTTACATATTTTTTGAAAGTA AAATAATTCA CAAGCTTTGGTATTTTAAAATTATTGTTAAACATATCATAACTAATCATACCAGGGTACT GCAATACCAC TGTTTATAAGTGACAAAATTAGGCCAAAGGTGATTTTTTTTTAAATCAGGAAGCTGGTTA CTGGCTCTAC TGAGAGTTGGAGCCCTGATGTTCTGATTCTTCAAAGTCACCCTAAAAGAAGATCTGACAG GAAAGCTGTA TAATGAGATAGAAAAACGT CAGGTAT GGAAGGC T TTCAGTT T TAAT ATGGC TGAAAGCAAAGGATAACGA ATTCAGAATTAGTAATGTAAAATCTTGATACCCTAATCTTGCTTCTGGATCTGTTCTTTT TTTAAAAAAA CTTCCTTCACCGCGCCTATAATCCTAGCACTTTGGGAGGCCGAGGCAGGCAGATCACGGG GTCAGGAGAT CAAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTGAAAATACAAAAAATTAGC CGGGTGTGGT GGCGGGCGCCTGTAGTTCCAGCTACTCGGGAGGCTGAGGCAAGAGAATGGCATGAACCCG GTAGGGGAGC TTGCAGTGAGCCCAGATCATGCCACTGTACTCCAGCCTAGGTGACAGAGCAAGACTCTGT CTCAAAAACA AGCAAACAGAC T T CC T T CAACAAAT AT T TAT T AAAT AT C CAC T T T GCAACAGC AC T GAAAT GGCTG TAAG GACTCCTGAGATATGTGTCCAGCAAGGAGTTTACAGTCAAACAGGAGAGACATGCCTGTA GTTACATCCA GTGTGATGGGTGCTGAGAGGCAAGTACAAACCACGATG

BQ056428 TCCCGCCGCGCCACTTCGCCTGCCTCCGTCCCCCGCCCGCCGCGCCATGCCTGTGGCCGG CTCGGAGCTG 151

CCGCGCCGGCCCTTGCCCCCCGCCGCACAGGAGCGGGACGCCGAGCCGCGTCCGCCGCAC GGGGAGCTGC AGTACCTGGGGCAGATCCAACACATCCTCCGCTGCGGCGTCAGGAAGGACGCCCGCCCGG GCACCGGTAC CCTGCCGGTATTCGGCATGCAGGCGCGCTACAGCCTGAGAGATGAATTCCCTCTGCTGAC AACCAAACGT GTGTTCTGGAACGGTGCTTCGGAGGAGCTGCTGTGGCTTATCAAGGGATCCACAAACGCT ATAGACCTGT CTTCCCCGGCAGCGAAAATCTCGGGATGCCACTGGATCCCGACACTCTCTGGACACCCTG GGATTCTCCA CCAGAGAAGAACGCGACTTGGGCCCAGTTTGTGGCTCTCAGCGGAGGCCTCCTGTGGCAG AATACATACA TTTCCAATCAGATCACTTCCCGGACACGGACCNTGACCAGCCTGCCAAAAAGTGGATTTC CCCCCACCCC AGAACCCANCCCCTGACGCACAGAAACCAACCCATTCGTTGTTGCCGCCTTGCGAACCCC AACCAGAATC TCTCCCCCCTGGCCGGCGCGCCTGCCGCTGCCAATGCCCCTATGGCGGCCTCTTGGCCCG CACCTTCCAA TTGGTCGCCCTGCGCAACCAGCGAGAAAACACTGGCCCGCCCGTCTCCCCCCCGCTCCGC CTACCCCACT TAATGCGCCTCCGTGGCATGACGCACGCGTTTGGTGTCCGCCGCCGTCTCATGTCCGCGC GGTGTGGACC CCCTTTTCTCTCGCGGCACATCCCCCCTATTCCCTTGCCCTTTGGGGGGCACCCCCTCTA GACCCGCGCT TCTCTTCTCGTCCGGTGGGGGACATTGGTTTGCCTGCCGCGGCGGGGGCGNTAAAAATAA AAACAGCCTG TTAGCCCGGCCCAGTACCCCCCCCCGGCCGGGGCCGCCTTNCGTTTGCATTTATACCCCA ACCCATAAAG CCGCGCCCCTTTAGCNCCNTAACTTTTGTGGTGTGGCCTCCCCCCTTTTTCCCGGGGAGC AGCAACGGAC ATCTGTACACTAATGCTGGCCCCGACCTTTCCCAAAAACCCCCCGCCCGTGTCCCGTATA AATTTGGTGC CAANCCTGACGNGTTCTCCCCCGCCCTCGCCCCGTTGGCCGCCCGTTTAAAGCCCCCCCG GTGGTTGCGC CGCCCAACGAGTCCACCTATAGTTAANTCCACCAACACCCCCACCTTTTCCTCCCCGCCG CATCTTCCCC ACGTACCCCCTTTTGTCGCGAGATGGCCACTCCCCCCCCCCTGTTTGTTTAAAACAACGA GAATGGTGCT GCCAACGCTGGTCTTTTCCCCCCCCGGACCGCGACCGCCAGGGGGAATACGTACCATAAG CCCCCGCGCC CNCCTTTTTTCCCCCCTCCCCGCCAATCAAGATCCGCCGTCCATTAGACGTATTATTTTT CCCGCGATAC ACGAAAAAACAGGGCCGCCCATTTATAACTAAATTCCCGTCGCCGCCGCGCGGATATGTT TCCCAAAATA CCACCCCCCCCCCCCCATTTTCTTTGCCCCCAACTCCTGCGCACCGGTGTTCACCAGCCT CGCGCCGC

BC032677 GGACGCGTGGGTCGACCCACGCGTCCGGACCCACGCGTCCGGTCGTGTTCTCCGAGTTCC TGTCTCTCTG 152

CCAACGCCGCCCGGATGGCTTCCCAAAACCGCGACCCAGCCGCCACTAGCGTCGCCGCCG CCCGTAAAGG AGCTGAGCCGAGCGGGGGCGCCGCCCGGGGTCCGGTGGGCAAAAGGCTACAGCAGGAGCT GATGACCCTC ATGGTGAGTGATTAAGTGCCCAGAACCCCAGCCTTCCATCCAATTTTCAGTAGCCTCCTT TTTTCCGTCA GCTTTTTTGCTAGACATAGGGGTAATGTAATTTGCTCCCTCCTGGGAAAGAAGTTCATAC ACCCCACCTA CACCATTTCTTCCAGCAGTCCCTCCTCCCAATTCCATCCCCCCACACGAAGTTATCTCGA ACACTTCCCT GAAGTCATACAAGACCCTCCCTATCCAGTGTGTCCCTACTTCCTAGCCCCAACCAAGCTT TACCCACACC CAACTCCCCGCCCTTCTTGGTATTTCTAGCCTATGAATTTGGTTGCTTTATTTTGGATCA GAGTGATGAG ATTAAGGGGAGGCTGGGCGCGGTAGCTCACACCTTATAATCCCAAAGTGCTGGGATTACA GGCGTGAGCC ACCGCGCCCGGCCAGCAACTAATATTCTAATTGAACTAAAGCACAGGATGCCAATTTACA ATCCTTAGAC CAAAGAGTCACTGATGTCTCCACCAGATAAGAGGAAAGCATCAGGCTAGGCATAGTGGCT CACACCTGTA ATCTCAGCACTTTGGGAGGCTGAGGCAGGCAGATCACATGAGCCCAGGAGTTTGAGACTG GCCTGGGCAA CATGGTGAAACCCTGTCTCTAAAATAAAAACTAAACTAAAAAAACTTTTTAAAAAGGCAG TGGGGAGCAT CAGAACCAGCTCAACAGTTTGTCTACTGTCCGGTCCCAGAGAAACTCAAGATTCTAGCAA GCCCCTTGTG TGGGGCTTGGGTTGGGACATGAGGCTGCTGCTGGAGCTTACTCTGCAACTGTTTCTCCAA ATGCCAGGTA TATGAAGACCTGAGGTATAAGCTCTCGCTAGAGTTCCCCAGTGGCTACCCTTACAATGCG CCCACAGTGA AGTTCCTCACGCCCTGCTATCACCCCAACGTGGACACCCAGGGTAACATATGCCTGGACA TCCTGAAGGA AAAGTGGTCTGCCCTGTATGATGTCAGGACCATTCTGCTCTCCATCCAGAGCCTTCTAGG AGAACCCAAC ATTGATAGTCCCTTGAACACACATGCTGCCGAGCTCTGGAAAAACCCCACAGCTTTTAAG AAGTACCTGC AAGAAACCTACTCAAAGCAGGTCACCAGCCAGGAGCCCTGACCCAGGCTGCCCAGCCTGT CCTTGTGTCG TCTTTTTAATTTTTCCTTAGATGGTCTGTCCTTTTTGTGATTTCTGTATAGGACTCTTTA TCTTGAGCTG TGGTATTTTTGTTTTGTTTTTGTCTTTTAAATTAAGCCTCGGTTGAGCCCTTGTATATTA AATAAATGCA TTTTTGTCCTTTTTTAAAAAAAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAA

A

[32] At least 10, at least 15, at least 20, at least 25, at least 40, at least 41, at least 42, at least 43, at least 44, at least 46, at least 47, at least 48, at least 49 or all 50 of the genes in

Table 1 can be utilized in the methods of the present invention. Preferably, the expression of each of the 50 genes is determined in a biological sample. The prototypical gene expression profiles (i.e. centroid) of the four intrinsic subtypes were pre-defined from a training set of FFPE breast tumor samples using hierarchical clustering analysis of gene expression data.

Table 3 shows the actual values of the prototypical gene expression profiles (i. e. centroids) of these four subtypes. [33] Table 3.

Tumor Subtype Centroids for Comparison to a Sample

Target Gene Basal-like Her2-enriched Luminal A Luminal B

ACT 3B -0.2052 -0.7965 -0.2790 -0.4380

ANLN 1.0227 0.5006 -0.7289 0.1149

BAG1 -0.4676 -0.3132 0.4716 0.5879

BCL2 -0.7365 -0.7237 0.7234 0.6363

BLVRA -0.8761 0.2270 0.1628 0.7138

CCNE1 1.3100 0.2201 -0.6231 -0.2729

CDC20 1.0995 0.1445 -1.0518 -0.1173

CDC6 0.5817 0.6601 -0.7032 0.3134

CDCA1 0.9367 0.1623 -0.4509 0.2692

CDH3 0.7639 0.0144 -0.0502 -1.0229

CENPF 1.0222 0.2944 -0.5657 0.2437

CEP55 1.0442 0.4881 -0.6365 0.2921

CXXC5 -0.9732 0.1866 0.5687 0.9463

EGFR 0.3352 -0.1326 -0.0011 -0.9755

ERBB2 -0.7045 1.4182 0.2420 0.1978

ESR1 -1.1847 -0.4926 0.7177 1.0101

EXOl 1.0546 0.4317 -0.7259 0.2559

FGFR4 -0.2073 1.4562 0.1707 -0.2223

FOXA1 -1.3590 0.5726 0.7131 0.7963

FOXC1 1.0666 -0.7362 -0.4078 -0.9877

GPR160 -1.0540 0.5524 0.6032 0.7305

KIF2C 0.9242 0.1104 -1.1001 -0.2771

KNTC2 1.1373 0.2266 -0.7593 0.1656

KRT14 0.4759 -0.5269 0.8187 -0.8879

KRT17 0.6863 -0.3777 0.6149 -1.1415

KRT5 0.7136 -0.4146 0.5832 -0.9462

MAPT -1.1343 -0.2711 1.0957 0.8372

MDM2 -0.7498 -0.4855 -0.1788 0.2397

MELK 1.0209 0.2678 -0.8016 0.1012

MIA 1.2408 -0.5475 0.3289 -0.6320

MKI67 1.0446 0.4630 -0.6717 0.3161

MLPH -1.4150 0.4842 0.8829 0.8194

MMP11 -0.1295 0.5220 0.3402 0.5653

MYC 0.5639 -0.9904 -0.3015 -0.2791

NAT1 -0.9711 -0.2708 1.2256 0.9576

ORC6L 1.0086 0.5152 -1.0385 -0.0336

PGR -0.9216 -0.5755 1.2061 0.9278

PHGDH 0.9192 0.0322 -0.5194 -0.5371

PTTG1 0.9541 0.2079 -1.1207 0.1052

RRM2 0.7895 0.6336 -0.8099 0.3228

SFRP1 0.7694 -0.8271 0.2617 -1.0846

SLC39A6 -0.9992 -0.4573 0.6607 0.9222

TMEM45B -1.0721 0.7926 0.3190 0.2016

TYMS 0.9823 -0.0960 -0.8593 0.1827

UBE2C 0.8294 0.3358 -1.0141 0.0608

UBE2T 0.6258 0.0617 -0.8652 -0.0487

[34] After performing the Breast Cancer Intrinsic Subtyping test with a test breast cancer tumor sample and the reference sample provided as part of the test kit, a computational algorithm based on a Pearson's correlation compares the normalized and scaled gene expression profile of the PAM50 intrinsic gene set of the test sample to the prototypical expression signatures of the four breast cancer intrinsic subtypes. The intrinsic subtype analysis is determined by determining the expression of a PAM50 set of genes and the risk of recurrence ("ROR") is determined using the NAN046 set of genes (which is determining the expression of all 50 genes in Table 1 with the exception of determining the expression of MYBL2, BIRC5, GRB7 and CCNB1). Specifically, the intrinsic subtype is identified by comparing the expression of the PAM50 set of genes in the biological sample with the expected expression profiles for the four intrinsic subtypes. The subtype with the most similar expression profile is assigned to the biological sample. The ROR score is an integer value on a 0-100 scale that is related to an individual patient's probability of distant recurrence within 10 years for the defined intended use population. The ROR score is calculated by comparing the expression profiles of the NAN046 genes in the biological sample with the expected profiles for the four intrinsic subtypes, as described above, to calculate four different correlation values. These correlation values are then combined with a proliferation score (and optionally one or more clinicopathological variables, such as tumor size) to calculate the ROR score. Preferably, the ROR score is calculated by comparing only the expression profiles of the NAN046 genes.

[35] The training set of FFPE breast tumor samples, which had well defined clinical characteristics and clinical outcome data, were used to establish a continuous Risk of Recurrence (ROR) score. The score is calculated using coefficients from a Cox model that includes correlation to each intrinsic subtype, a proliferation score (mean gene expression of a subset of 18 of the 46 genes), and tumor size, Table 4.

Table 4. Coefficients to calculate ROR-PT (equation 1)

Test Variables Coefficient

Basal-like Pearson's correlation (A) - 0.0067

Her2-enriched Pearson' s correlation (B) 0.4317

Luminal A Pearson' s correlation (C) - 0.3172

Luminal B Pearson' s correlation (D) 0.4894

Proliferation Score (E) 0.1981

Tumor Size (F) 0.1133

[36] The test variables in Table 4 are multiplied by the corresponding coefficients and summed to produce a risk score ("ROR-PT").

[37] ROR-PT equation = -0.0067*A + 0.4317*B + -0.3172*C + 0.4894*D + 0.1981*E + 0.1133*F

[38] In previous studies, the ROR score provided a continuous estimate of the risk of recurrence for ER-positive, node-negative patients who were treated with tamoxifen for 5 years (Nielsen et al. Clin. Cancer Res., 16(21):5222-5232 (2009)). The ROR score also exhibited a statistically significant improvement over a clinical model based in determining RFS within this test population providing further evidence of the improved accuracy of this decision making tool when compared to traditional clinicopathological measures (Nielsen et al. Clin. Cancer Res., 16(21):5222-5232 (2009)).

[39] The gene set contains many genes that are known markers for proliferation. The methods of the present invention provide for the determination of subsets of genes that provide a proliferation signature. The methods of the present invention can include determining the expression of at least one of, a combination of, or each of, a 18-gene subset of the intrinsic genes of Table 1 selected from ANLN, CCNE1, CDC20, CDC6, CDCA1, CENPF, CEP55, EXOl, KIF2C, KNTC2, MELK, MKI67, ORC6L, PTTGl, RRM2, TYMS, UBE2C and/or UBE2T. Preferably, the expression of each of the 18-gene subset of the gene set of Table 1 is determined to provide a proliferation score. The expression of one or more of these genes may be determined and a proliferation signature index can be generated by averaging the normalized expression estimates of one or more of these genes in a sample. The sample can be assigned a high proliferation signature, a moderate/intermediate proliferation signature, a low proliferation signature or an ultra-low proliferation signature. Methods of determining a proliferation signature from a biological sample are as described in Nielsen et al. Clin. Cancer Res., 16(21):5222-5232 (2009) and supplemental online material (these documents are incorporated herein, by reference, in their entireties).

[40] Breast Cancer

[41] Subjects with breast cancer tumors that fit in the basal-like subtype, classified by intrinsic gene analysis, were surprisingly found to have a better prognosis on average when treated with a breast cancer treatment that included gemcitabine. Also surprisingly, breast cancer tumors that fit in the HER2-enriched subtype were found to have a poorer prognosis on average when treated with a breast cancer treatment that included gemcitabine.

[42] Differentiating the clinical outcome in breast cancer patients demonstrating the basal- like subtype from those demonstrating non-basal-like subtypes administered a breast cancer treatment including gemcitabine when this treatment would not provide increased therapeutic efficacy and be accompanied by worse side effects, improves the clinical outcome and quality of life of thousands of patients.

[43] For the purposes of the present disclosure, "breast cancer" includes, for example, those conditions classified by biopsy or histology as malignant pathology. The clinical delineation of breast cancer diagnoses is well known in the medical arts. One of skill in the art will appreciate that breast cancer refers to any malignancy of the breast tissue, including, for example, carcinomas and sarcomas. Particular embodiments of breast cancer include ductal carcinoma in situ (DCIS), lobular carcinoma in situ (LCIS), or mucinous carcinoma. Breast cancer also refers to infiltrating ductal (IDC), lobular neoplasia or infiltrating lobular carcinoma (ILC). In most embodiments of the disclosure, the subject of interest is a human patient suspected of or actually diagnosed with breast cancer.

[44] Breast cancer includes all forms of cancer of the breast. Breast cancer can include primary epithelial breast cancers. Breast cancer can include cancers in which the breast is involved by other tumors such as lymphoma, sarcoma or melanoma. Breast cancer can include carcinoma of the breast, ductal carcinoma of the breast, lobular carcinoma of the breast, undifferentiated carcinoma of the breast, cystosarcoma phyllodes of the breast, angiosarcoma of the breast, and primary lymphoma of the breast. Breast cancer can include Stage I, II, IIIA, IIIB, IIIC and IV breast cancer. Ductal carcinoma of the breast can include invasive carcinoma, invasive carcinoma in situ with predominant intraductal component, inflammatory breast cancer, and a ductal carcinoma of the breast with a histologic type selected from the group consisting of comedo, mucinous (colloid), medullary, medullary with lymphcytic infiltrate, papillary, scirrhous, and tubular. Lobular carcinoma of the breast can include invasive lobular carcinoma with predominant in situ component, invasive lobular carcinoma, and infiltrating lobular carcinoma. Breast cancer can include Paget' s disease, Paget' s disease with intraductal carcinoma, and Paget' s disease with invasive ductal carcinoma. Breast cancer can include breast neoplasms having histologic and ultrastructual heterogeneity (e.g., mixed cell types).

[45] A breast cancer that is to be treated can include familial breast cancer. A breast cancer that is to be treated can include sporadic breast cancer. A breast cancer that is to be treated can arise in a male subject. A breast cancer that is to be treated can arise in a female subject. A breast cancer that is to be treated can arise in a premenopausal female subject or a postmenopausal female subject.

[46] A breast cancer that is to be treated can include a localized tumor of the breast. A breast cancer that is to be treated can include a tumor of the breast that is associated with a negative sentinel lymph node (SLN) biopsy. A breast cancer that is to be treated can include a tumor of the breast that is associated with a positive sentinel lymph node (SLN) biopsy. A breast cancer that is to be treated can include a tumor of the breast that is associated with one or more positive axillary lymph nodes, where the axillary lymph nodes have been staged by any applicable method. A breast cancer that is to be treated can include a tumor of the breast that has been typed as having nodal negative status (e.g., node-negative) or nodal positive status (e.g., node-positive). A breast cancer that is to be treated can include a tumor of the breast that has metastasized to other locations in the body. A breast cancer that is to be treated can be classified as having metastasized to a location selected from the group consisting of bone, lung, liver, or brain. A breast cancer that is to be treated can be classified according to a characteristic selected from the group consisting of metastatic, localized, regional, local-regional, locally advanced, distant, multicentric, bilateral, ipsilateral, contralateral, newly diagnosed, recurrent, and inoperable.

[47] For the purposes of the present disclosure, "a breast cancer treatment comprising gemcitabine" is a breast cancer treatment that includes gemcitabine. A "breast cancer treatment comprising gemcitabine" can also be a breast cancer treatment that includes an analog or derivative of gemcitabine or another nucleoside anti-tumor agent. These treatments can also include other anti-cancer or chemotherapeutic agents.

[48] For the purposes of the present disclosure, "a breast cancer treatment not comprising gemcitabine" is a breast cancer treatment that does not include any gemcitabine. These treatments contain other anti-cancer or chemotherapeutic agents.

[49] Classes of anti-cancer or chemotherapeutic agents can include anthracycline agents, alkylating agents, platinum agents, taxanes, vinca agents, anti-estrogen drugs, aromatase inhibitors, ovarian suppression agents, endocrine/hormonal agents, bisphophonate therapy agents and targeted biological therapy agents.

[50] Specific anti-cancer or chemotherapeutic agents can include anthracyclines, cyclophosphamide, fluorouracil (or 5-fluorouracil or 5-FU), methotrexate, thiotepa, carboplatin, cisplatin, taxanes, paclitaxel, protein-bound paclitaxel, docetaxel, vinorelbine, tamoxifen, raloxifene, toremifene, fulvestrant, irinotecan, ixabepilone, temozolmide, topotecan, vincristine, vinblastine, eribulin, mutamycin, capecitabine, capecitabine, anastrozole, exemestane, letrozole, leuprolide, abarelix, buserlin, goserelin, megestrol acetate, risedronate, pamidronate, ibandronate, alendronate, denosumab, zoledronate, trastuzumab, tykerb or bevacizumab, or combinations thereof.

[51] Combinational anti-cancer or chemotherapeutic therapies can include AT:

Adriamycin ® (Doxorubicin) and Taxotere ® (Docetaxel); AC: Adriamycin ® , Cytoxan ® (Cyclophosphamide); AC + Taxol ® ; AC + Taxotere ® ; CMF: Cytoxan ® , Methotrexate, 5- fluorouracil; CEF: Cytoxan ® , Ellence ® (Epirubicin), and fluorouracil; EC: Ellence ® ,

Cytoxan ® ; FAC: 5-fluorouracil, Adriamycin ® , and Cytoxan ® ; GET: Gemzar ® (Gemcitabine), Ellence ® , and Taxol ® ; TC: Taxotere ® , Cytoxan ® ; TC: Taxotere ® , Paraplatin ® (Carboplatin); TAC: Taxotere ® , Adriamycin ® , Cytoxan ® or TCH: Taxotere ® , Herceptin ® (Trastuzumab), and Paraplatin ® . Additional combination chemotherapeutic therapies for metastatic breast cancer can include: Taxol ® and Xeloda ® (Capecitabine); Taxotere ® and Xeloda ® ; Taxotere ® and Paraplatin ® ; Taxol ® and Paraplatin ® ; Taxol ® and Gemzar ® ; Abraxane ® (Protein-bound Paclitaxel) and Xeloda ® ; Abraxane ® and Paraplatin ® ; Camptosor ® (Irinotecan) and Temodar ® (Temozolomide); Gemzar ® and Paraplatin ® or Ixempra ® (Ixabepilone) and Xeloda ®

[52] Preferably, the anti-cancer or chemotherapeutic agents include one or more taxanes. More preferably, the taxanes are paclitaxel or docetaxel.

[53] Preferably gemcitabine is administered intravenously, but can be administered by any method known in the art. In certain embodiments, a subject or patient receives gemcitabine, administered at about 2500 mg/m 2 to about 50 mg/m 2 , once daily. In certain embodiments, gemcitabine is administered at a decreased dose to reduce toxicity. For example, gemcitabine is administered at 1500 mg/m 2 , 1250 mg/m 2 , 1000 mg/m 2 , 750 mg/m 2 , 500 mg/m 2 , 250 mg/m 2 , 100 mg/m 2 , or 50 mg/m 2 once daily.

[54] The taxane agents may be administered in any manner found appropriate by a clinician in generally accepted efficacious dose ranges such as those described in the

Physician Desk Reference, 53th Ed. (1999), Publisher Edward R. Barnhart, New Jersey ("PDR"). Preferably taxanes are administered intravenously, but can be administered by any method known in the art. In general, paclitaxel is administered at dosages from about 135 to about 300 mg/m 2 , preferably from about 135 to about 175 mg/m 2 , and most preferably about 175 mg/m 2 daily. In general, docetaxel is administered at dosages from about 60 to about 100 mg/m 2 , and most preferably about 75 mg/m 2 daily.

[55] The article "a" and "an" are used herein to refer to one or more than one (i.e. , to at least one) of the grammatical object of the article. By way of example, "an element" means one or more element.

[56] Throughout the specification the word "comprising," or variations such as

"comprises" or "comprising," will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps.

[57] Description of Intrinsic Subtype Biology

[58] Luminal subtypes: The most common subtypes of breast cancer are the luminal subtypes, Luminal A and Luminal B. Prior studies suggest that luminal A comprises approximately 30% to 40% and luminal B approximately 20% of all breast cancers, but they represent over 90 % of hormone receptor positive breast cancers (Nielsen et al. Clin. Cancer Res., 16(21):5222-5232 (2009)). The gene expression pattern of these subtypes resembles the luminal epithelial component of the breast. These tumors are characterized by high expression of estrogen receptor (ER), progesterone receptor (PR), and genes associated with ER activation, such as LIV1, GATA3, and cyclin Dl, as well as expression of luminal cytokeratins 8 and 18 (Lisa Carey & Charles Perou (2009). Gene Arrays, Prognosis, and Therapeutic Interventions. Jay R. Harris et al. (4th ed.), Diseases of the breast (pp. 458-472). Philadelphia, PA: Lippincott Williams & Wilkins).

[59] Luminal A: Luminal A (LumA) breast cancers exhibit low expression of genes associated with cell cycle activation and the ERBB2 cluster resulting in a better prognosis than Luminal B. The Luminal A subgroup has the most favorable prognosis of all subtypes and is enriched for endocrine therapy-responsive tumors.

[60] Luminal B: Luminal B (LumB) breast cancers also express ER and ER-associated genes. Genes associated with cell cycle activation are highly expressed and this tumor type can be HER2(+) (-20%) or HER2(-). The prognosis is unfavorable (despite ER expression) and endocrine therapy responsiveness is generally diminished relative to LumA.

[61] HER2-enriched: The HER2-enriched subtype is generally ER-negative and is HER2- positive in the majority of cases with high expression of the ERBB2 cluster, including ERBB2 and GRB7. Genes associated with cell cycle activation are highly expressed and these tumors have a poor outcome.

[62] Basal-like: The Basal-like subtype is generally ER- negative, is almost always clinically HER2-negative and expresses a suite of "basal" biomarkers including the basal epithelial cytokeratins (CK) and epidermal growth factor receptor (EGFR). Genes associated with cell cycle activation are highly expressed.

[63] Clinical variables

[64] The PAM50 classification model described herein may be further combined with information on clinical variables to generate a continuous risk of relapse (ROR) predictor. As described herein, a number of clinical and prognostic breast cancer factors are known in the art and are used to predict treatment outcome and the likelihood of disease recurrence. Such factors include, for example, lymph node involvement, tumor size, histologic grade, estrogen and progesterone hormone receptor status, HER-2 levels, and tumor ploidy. In one embodiment, risk of relapse (ROR) score is provided for a subject diagnosed with or suspected of having breast cancer. This score uses the PAM50 classification model in combination with clinical factors of lymph node status (N) and tumor size (T). Assessment of clinical variables is based on the American Joint Committee on Cancer (AJCC) standardized system for breast cancer staging. In this system, primary tumor size is categorized on a scale of 0-4 (TO: no evidence of primary tumor; Tl : < 2 cm; T2: > 2 cm - < 5 cm; T3 : > 5 cm; T4: tumor of any size with direct spread to chest wall or skin). Lymph node status is classified as N0-N3 (NO: regional lymph nodes are free of metastasis; Nl : metastasis to movable, same-side axillary lymph node(s); N2: metastasis to same-side lymph node(s) fixed to one another or to other structures; N3: metastasis to same-side lymph nodes beneath the breastbone). Methods of identifying breast cancer patients and staging the disease are well known and may include manual examination, biopsy, review of patient's and/or family history, and imaging techniques, such as mammography, magnetic resonance imaging (MRI), and positron emission tomography (PET).

[65] Sample Source

[66] In one embodiment of the present disclosure, breast cancer subtype is assessed through the evaluation of expression patterns, or profiles, of the intrinsic genes listed in Table 1 in one or more subject samples and/or FISH analysis or IHC performed to ascertain the Her-2 status of the cancer. For the purpose of discussion, the term subject, or subject sample, refers to an individual regardless of health and/or disease status. A subject can be a subject, a study participant, a control subject, a screening subject, or any other class of individual from whom a sample is obtained and assessed in the context of the disclosure. Accordingly, a subject can be diagnosed with breast cancer, can present with one or more symptoms of breast cancer, or a predisposing factor, such as a family (genetic) or medical history (medical) factor, for breast cancer, can be undergoing treatment or therapy for breast cancer, or the like. As such, the subject is a subject in need of treatment for breast cancer or detection of breast cancer. Alternatively, a subject can be healthy with respect to any of the aforementioned factors or criteria. It will be appreciated that the term "healthy" as used herein, is relative to breast cancer status, as the term "healthy" cannot be defined to correspond to any absolute evaluation or status. Thus, an individual defined as healthy with reference to any specified disease or disease criterion, can in fact be diagnosed with any other one or more diseases, or exhibit any other one or more disease criterion, including one or more cancers other than breast cancer. However, the healthy controls are preferably free of any cancer.

[67] As used herein, a "subject in need thereof is a subject having breast cancer or presenting with one or more symptoms of breast cancer, or a subject having an increased risk of developing breast cancer relative to the population at large. Preferably, a subject in need thereof has breast cancer. The breast cancer can be primary breast cancer, locally advanced breast cancer or metastatic breast cancer. A "subject" includes a mammal. The mammal can be e.g. , any mammal, e.g. , a human, primate, bird, mouse, rat, fowl, dog, cat, cow, horse, goat, camel, sheep or a pig. Preferably, the mammal is a human.

[68] In particular embodiments, the methods for predicting breast cancer intrinsic subtypes or Her-2 status include collecting a biological sample comprising a cancer cell or tissue, such as a breast tissue sample or a primary breast tumor tissue sample. By "biological sample" is intended any sampling of cells, tissues, or bodily fluids in which expression of an intrinsic gene can be detected. Examples of such biological samples include, but are not limited to, biopsies and smears. Bodily fluids useful in the present disclosure include blood, lymph, urine, saliva, nipple aspirates, gynecological fluids, or any other bodily secretion or derivative thereof. Blood can include whole blood, plasma, serum, or any derivative of blood. In some embodiments, the biological sample includes breast cells, particularly breast tissue from a biopsy, such as a breast tumor tissue sample. Biological samples may be obtained from a subject by a variety of techniques including, for example, by scraping or swabbing an area, by using a needle to aspirate cells or bodily fluids, or by removing a tissue sample (i.e. , biopsy). Methods for collecting various biological samples are well known in the art. In some embodiments, a breast tissue sample is obtained by, for example, fine needle aspiration biopsy, core needle biopsy, or excisional biopsy. Fixative and staining solutions may be applied to the cells or tissues for preserving the specimen and for facilitating examination. Biological samples, particularly breast tissue samples, may be transferred to a glass slide for viewing under magnification. In one embodiment, the biological sample is a formalin-fixed, paraffin-embedded breast tissue sample, particularly a primary breast tumor sample. In various embodiments, the tissue sample is obtained from a pathologist-guided tissue core sample.

[69] Expression Profiling

[70] In various embodiments, the present disclosure provides methods for classifying, prognosticating, or monitoring breast cancer in subjects. In this embodiment, data obtained from analysis of intrinsic gene expression is evaluated using one or more pattern recognition algorithms. Such analysis methods may be used to form a predictive model, which can be used to classify test data. For example, one convenient and particularly effective method of classification employs multivariate statistical analysis modeling, first to form a model (a "predictive mathematical model") using data ("modeling data") from samples of known subtype {e.g. , from subjects known to have a particular breast cancer intrinsic subtype: LumA, LumB, Basal-like, HER2-enriched, or normal-like), and second to classify an unknown sample {e.g. , "test sample") according to subtype. Pattern recognition methods have been used widely to characterize many different types of problems ranging, for example, over linguistics, fingerprinting, chemistry and psychology. In the context of the methods described herein, pattern recognition is the use of multivariate statistics, both parametric and non-parametric, to analyze data, and hence to classify samples and to predict the value of some dependent variable based on a range of observed measurements. There are two main approaches. One set of methods is termed "unsupervised" and these simply reduce data complexity in a rational way and also produce display plots which can be interpreted by the human eye. However, this type of approach may not be suitable for developing a clinical assay that can be used to classify samples derived from subjects independent of the initial sample population used to train the prediction algorithm.

[71] The other approach is termed "supervised" whereby a training set of samples with known class or outcome is used to produce a mathematical model which is then evaluated with independent validation data sets. Here, a "training set" of intrinsic gene expression data is used to construct a statistical model that predicts correctly the "subtype" of each sample. This training set is then tested with independent data (referred to as a test or validation set) to determine the robustness of the computer-based model. These models are sometimes termed "expert systems," but may be based on a range of different mathematical procedures. Supervised methods can use a data set with reduced dimensionality (for example, the first few principal components), but typically use unreduced data, with all dimensionality. In all cases the methods allow the quantitative description of the multivariate boundaries that characterize and separate each subtype in terms of its intrinsic gene expression profile. It is also possible to obtain confidence limits on any predictions, for example, a level of probability to be placed on the goodness of fit. The robustness of the predictive models can also be checked using cross-validation, by leaving out selected samples from the analysis.

[72] The PAM50 classification model described herein is based on the gene expression profile for a plurality of subject samples using the intrinsic genes listed in Table 1. The plurality of samples includes a sufficient number of samples derived from subjects belonging to each subtype class. By "sufficient samples" or "representative number" in this context is intended a quantity of samples derived from each subtype that is sufficient for building a classification model that can reliably distinguish each subtype from all others in the group. A supervised prediction algorithm is developed based on the profiles of objectively-selected prototype samples for "training" the algorithm. The samples are selected and subtyped using an expanded intrinsic gene set according to the methods disclosed in International Patent Publication WO 2007/061876 and U.S. Patent Publication No. 2009/0299640, which is herein incorporated by reference in its entirety. Alternatively, the samples can be subtyped according to any known assay for classifying breast cancer subtypes. After stratifying the training samples according to subtype, a centroid-based prediction algorithm is used to construct centroids based on the expression profile of the intrinsic gene set described in Table 1.

[73] In one embodiment, the prediction algorithm is the nearest centroid methodology related to that described in Narashiman and Chu (2002) PNAS 99:6567-6572, which is herein incorporated by reference in its entirety. In the present disclosure, the method computes a standardized centroid for each subtype. This centroid is the average gene expression for each gene in each subtype (or "class") divided by the within-class standard deviation for that gene. Nearest centroid classification takes the gene expression profile of a new sample, and compares it to each of these class centroids. Subtype prediction is done by calculating the Spearman's rank correlation of each test case to the five centroids, and assigning a sample to a subtype based on the nearest centroid.

[74] Detection of intrinsic gene expression

[75] Any methods available in the art for detecting expression of the intrinsic genes listed in Table 1 are encompassed herein. By "detecting expression" is intended determining the quantity or presence of an RNA transcript or its expression product of an intrinsic gene. Methods for detecting expression of the intrinsic genes of the disclosure, that is, gene expression profiling, include methods based on hybridization analysis of polynucleotides, methods based on sequencing of polynucleotides, immunohistochemistry methods, and proteomics-based methods. The methods generally detect expression products (e.g. , mRNA) of the intrinsic genes listed in Table 1. In preferred embodiments, PCR-based methods, such as reverse transcription PCR (RT-PCR) (Weis et al , TIG 8:263- 64, 1992), and array-based methods such as microarray (Schena et al , Science 270:467- 70, 1995) are used. By "microarray" is intended an ordered arrangement of hybridizable array elements, such as, for example, polynucleotide probes, on a substrate. The term "probe" refers to any molecule that is capable of selectively binding to a specifically intended target biomolecule, for example, a nucleotide transcript or a protein encoded by or corresponding to an intrinsic gene. Probes can be synthesized by one of skill in the art, or derived from appropriate biological preparations. Probes may be specifically designed to be labeled. Examples of molecules that can be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, and organic molecules.

[76] Many expression detection methods use isolated RNA. The starting material is typically total RNA isolated from a biological sample, such as a tumor or tumor cell line, and corresponding normal tissue or cell line, respectively. If the source of RNA is a primary tumor, RNA (e.g. , mRNA) can be extracted, for example, from frozen or archived paraffin- embedded and fixed (e.g. , formalin- fixed) tissue samples (e.g. , pathologist-guided tissue core samples).

[77] General methods for RNA extraction are well known in the art and are disclosed in standard textbooks of molecular biology, including Ausubel et al , ed., Current Protocols in Molecular Biology, John Wiley & Sons, New York 1987-1999. Methods for RNA extraction from paraffin embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67, (1987); and De Andres et al. Biotechniques 18:42-44, (1995). In particular, RNA isolation can be performed using a purification kit, a buffer set and protease from commercial manufacturers, such as Qiagen (Valencia, CA), according to the manufacturer's instructions. For example, total RNA from cells in culture can be isolated using Qiagen RNeasy mini- columns. Other commercially available RNA isolation kits include MASTERPURE™ Complete DNA and RNA Purification Kit (Epicentre, Madison, Wis.) and Paraffin Block RNA Isolation Kit (Ambion, Austin, TX). Total RNA from tissue samples can be isolated, for example, using RNA Stat-60 (Tel-Test, Friendswood, TX). RNA prepared from a tumor can be isolated, for example, by cesium chloride density gradient centrifugation.

Additionally, large numbers of tissue samples can readily be processed using techniques well known to those of skill in the art, such as, for example, the single-step RNA isolation process of Chomczynski (U.S. Pat. No. 4,843,155).

[78] Isolated RNA can be used in hybridization or amplification assays that include, but are not limited to, PCR analyses and probe arrays. One method for the detection of RNA levels involves contacting the isolated RNA with a nucleic acid molecule (probe) that can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide of at least 7, 15, 30, 60, 100, 250, or 500 nucleotides in length and sufficient to specifically hybridize under stringent conditions to an intrinsic gene of the present disclosure, or any derivative DNA or RNA. Hybridization of an mRNA with the probe indicates that the intrinsic gene in question is being expressed.

[79] In one embodiment, the mRNA is immobilized on a solid surface and contacted with a probe, for example by running the isolated mRNA on an agarose gel and transferring the mRNA from the gel to a membrane, such as nitrocellulose. In an alternative embodiment, the probes are immobilized on a solid surface and the mRNA is contacted with the probes, for example, in an Agilent gene chip array. A skilled artisan can readily adapt known mRNA detection methods for use in detecting the level of expression of the intrinsic genes of the present disclosure.

[80] An alternative method for determining the level of intrinsic gene expression product in a sample involves the process of nucleic acid amplification, for example, by RT-PCR (U.S. Pat. No. 4,683,202), ligase chain reaction (Barany, PNAS USA 88: 189-93, (1991)), self sustained sequence replication (Guatelli et al. , Proc. Natl. Acad. Sci USA 87: 1874-78, (1990)), transcriptional amplification system (Kwoh et al. , Proc. Natl. Acad. ScL USA 86: 1173-77, (1989)), Q-Beta Replicase (Lizardi et al , Bio/Technology 6: 1197, (1988)), rolling circle replication (U.S. Pat. No. 5,854,033), or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.

[81] In particular aspects of the disclosure, intrinsic gene expression can assessed by quantitative RT-PCR. Numerous different PCR or QPCR protocols are known in the art and exemplified herein below and can be directly applied or adapted for use using the presently- described compositions for the detection and/or quantification of the intrinsic genes listed in Table 1. Generally, in PCR, a target polynucleotide sequence is amplified by reaction with at least one oligonucleotide primer or pair of oligonucleotide primers. The primer(s) hybridize to a complementary region of the target nucleic acid and a DNA polymerase extends the primer(s) to amplify the target sequence. Under conditions sufficient to provide polymerase- based nucleic acid amplification products, a nucleic acid fragment of one size dominates the reaction products (the target polynucleotide sequence which is the amplification product). The amplification cycle is repeated to increase the concentration of the single target polynucleotide sequence. The reaction can be performed in any thermocycler commonly used for PCR. However, preferred are cyclers with real time fluorescence measurement capabilities, for example, SMARTCYCLER® (Cepheid, Sunnyvale, CA), ABI PRISM 7700® (Applied Biosystems, Foster City, Calif.), ROTOR- GENE™ (Corbett Research, Sydney, Australia), LIGHTCYCLER® (Roche Diagnostics Corp, Indianapolis, Ind.), ICYCLER® (Biorad Laboratories, Hercules, Calif.) and MX4000® (Stratagene, La Jolla, Calif.).

[82] In another embodiment of the disclosure, microarrays are used for expression profiling. Microarrays are particularly well suited for this purpose because of the reproducibility between different experiments. DNA microarrays provide one method for the simultaneous measurement of the expression levels of large numbers of genes. Each array consists of a reproducible pattern of capture probes attached to a solid support. Labeled RNA or DNA is hybridized to complementary probes on the array and then detected by laser scanning. Hybridization intensities for each probe on the array are determined and converted to a quantitative value representing relative gene expression levels. See, for example, U.S. Pat. Nos. 6,040, 138, 5,800,992 and 6,020,135, 6,033,860, and 6,344,316. High-density oligonucleotide arrays are particularly useful for determining the gene expression profile for a large number of RNAs in a sample.

[83] In a preferred embodiment, the nCounter® Analysis system is used to detect intrinsic gene expression. The basis of the nCounter® Analysis system is the unique code assigned to each nucleic acid target to be assayed (International Patent Application Publication No. WO 08/124847, U.S. Patent No. 8,415,102 and Geiss et al. Nature Biotechnology. 2008. 26(3): 317-325; the contents of which are each incorporated herein by reference in their entireties). The code is composed of an ordered series of colored fluorescent spots which create a unique barcode for each target to be assayed. A pair of probes is designed for each DNA or RNA target, a biotinylated capture probe and a reporter probe carrying the fluorescent barcode. This system is also referred to, herein, as the nanoreporter code system. [84] Specific reporter and capture probes are synthesized for each target. The reporter probe can comprise at a least a first label attachment region to which are attached one or more label monomers that emit light constituting a first signal; at least a second label attachment region, which is non-over-lapping with the first label attachment region, to which are attached one or more label monomers that emit light constituting a second signal; and a first target- specific sequence. Preferably, each sequence specific reporter probe comprises a target specific sequence capable of hybridizing to no more than one PAM50 gene of Table 1 and optionally comprises at least three, or at least four label attachment regions, said attachment regions comprising one or more label monomers that emit light, constituting at least a third signal, or at least a fourth signal, respectively. The capture probe can comprise a second target-specific sequence; and a first affinity tag. In some embodiments, the capture probe can also comprise one or more label attachment regions. Preferably, the first target- specific sequence of the reporter probe and the second target- specific sequence of the capture probe hybridize to different regions of the same gene of Table 1 to be detected. Reporter and capture probes are all pooled into a single hybridization mixture, the "probe library".

Preferably, the probe library comprises a probe pair (a capture probe and reporter) for each of the PAM50 genes in Table 1.

[85] The relative abundance of each target is measured in a single multiplexed

hybridization reaction. The method comprises contacting a biological sample with a probe library, the library comprising a probe pair for the PAM50 genes in Table 1, such that the presence of the target in the sample creates a probe pair - target complex. The complex is then purified. More specifically, the sample is combined with the probe library, and hybridization occurs in solution. After hybridization, the tripartite hybridized complexes (probe pairs and target) are purified in a two-step procedure using magnetic beads linked to oligonucleotides complementary to universal sequences present on the capture and reporter probes. This dual purification process allows the hybridization reaction to be driven to completion with a large excess of target-specific probes, as they are ultimately removed, and, thus, do not interfere with binding and imaging of the sample. All post hybridization steps are handled robotically on a custom liquid-handling robot (Prep Station, NanoString

Technologies).

[86] Purified reactions are deposited by the Prep Station into individual flow cells of a sample cartridge, bound to a streptavidin-coated surface via the capture probe,

electrophoresed to elongate the reporter probes, and immobilized. After processing, the sample cartridge is transferred to a fully automated imaging and data collection device (Digital Analyzer, NanoString Technologies). The expression level of a target is measured by imaging each sample and counting the number of times the code for that target is detected. For each sample, typically 600 fields-of-view (FOV) are imaged (1376 X 1024 pixels) representing approximately 10 mm 2 of the binding surface. Typical imaging density is 100- 1200 counted reporters per field of view depending on the degree of multiplexing, the amount of sample input, and overall target abundance. Data is output in simple spreadsheet format listing the number of counts per target, per sample.

[87] This system can be used along with nanoreporters. Additional disclosure regarding nanoreporters can be found in International Publication No. WO 07/076129 and WO

07/076132, and US Patent Publication No. 2010/0015607 and 2010/0261026, the contents of which are incorporated herein in their entireties. Further, the term nucleic acid probes and nanoreporters can include the rationally designed (e.g. synthetic sequences) described in International Publication No. WO 2010/019826 and US Patent Publication No.

2010/0047924, incorporated herein by reference in its entirety.

[88] Data processing

[89] It is often useful to pre-process gene expression data, for example, by addressing missing data, translation, scaling, normalization, weighting, etc. Multivariate projection methods, such as principal component analysis (PCA) and partial least squares analysis (PLS), are so-called scaling sensitive methods. By using prior knowledge and experience about the type of data studied, the quality of the data prior to multivariate modeling can be enhanced by scaling and/or weighting. Adequate scaling and/or weighting can reveal important and interesting variation hidden within the data, and therefore make subsequent multivariate modeling more efficient. Scaling and weighting may be used to place the data in the correct metric, based on knowledge and experience of the studied system, and therefore reveal patterns already inherently present in the data.

[90] If possible, missing data, for example gaps in column values, should be avoided. However, if necessary, such missing data may be replaced or "filled" with, for example, the mean value of a column ("mean fill"); a random value ("random fill"); or a value based on a principal component analysis ("principal component fill").

[91] "Translation" of the descriptor coordinate axes can be useful. Examples of such translation include normalization and mean centering. "Normalization" may be used to remove sample-to-sample variation. For microarray data, the process of normalization aims to remove systematic errors by balancing the fluorescence intensities of the two labeling dyes. The dye bias can come from various sources including differences in dye labeling efficiencies, heat and light sensitivities, as well as scanner settings for scanning two channels. Some commonly used methods for calculating normalization factor include: (i) global normalization that uses all genes on the array; (ii) housekeeping genes normalization that uses constantly expressed housekeeping/invariant genes; and (iii) internal controls normalization that uses known amount of exogenous control genes added during

hybridization (Quackenbush Nat. Genet. 32 (Suppl.), 496-501 (2002)). In one embodiment, the intrinsic genes disclosed herein can be normalized to control housekeeping genes. For example, the housekeeping genes described in U.S. Patent Publication 2008/0032293, which is herein incorporated by reference in its entirety, can be used for normalization. Exemplary housekeeping genes include MRPL19, PSMC4, SF3A1, PUM1, ACTB, GAPD, GUSB, RPLPO, and TFRC. It will be understood by one of skill in the art that the methods disclosed herein are not bound by normalization to any particular housekeeping genes, and that any suitable housekeeping gene(s) known in the art can be used.

[92] Many normalization approaches are possible, and they can often be applied at any of several points in the analysis. In one embodiment, microarray data is normalized using the LOWESS method, which is a global locally weighted scatterplot smoothing normalization function. In another embodiment, qPCR data is normalized to the geometric mean of set of multiple housekeeping genes.

[93] "Mean centering" may also be used to simplify interpretation. Usually, for each descriptor, the average value of that descriptor for all samples is subtracted. In this way, the mean of a descriptor coincides with the origin, and all descriptors are "centered" at zero. In "unit variance scaling," data can be scaled to equal variance. Usually, the value of each descriptor is scaled by 1/StDev, where StDev is the standard deviation for that descriptor for all samples. "Pareto scaling" is, in some sense, intermediate between mean centering and unit variance scaling. In pareto scaling, the value of each descriptor is scaled by l/sqrt(StDev), where StDev is the standard deviation for that descriptor for all samples. In this way, each descriptor has a variance numerically equal to its initial standard deviation. The pareto scaling may be performed, for example, on raw data or mean centered data.

[94] "Logarithmic scaling" may be used to assist interpretation when data have a positive skew and/or when data spans a large range, e.g. , several orders of magnitude. Usually, for each descriptor, the value is replaced by the logarithm of that value. In "equal range scaling," each descriptor is divided by the range of that descriptor for all samples. In this way, all descriptors have the same range, that is, 1. However, this method is sensitive to presence of outlier points. In "autoscaling," each data vector is mean centered and unit variance scaled. This technique is a very useful because each descriptor is then weighted equally, and large and small values are treated with equal emphasis. This can be important for genes expressed at very low, but still detectable, levels.

[95] In one embodiment, data is collected for one or more test samples and classified using the PAM50 classification model described herein. When comparing data from multiple analyses (e.g. , comparing expression profiles for one or more test samples to the centroids constructed from samples collected and analyzed in an independent study), it will be necessary to normalize data across these data sets. In one embodiment, Distance Weighted Discrimination (DWD) is used to combine these data sets together (Benito et al. (2004) Bioinformatics 20(1): 105-114, incorporated by reference herein in its entirety). DWD is a multivariate analysis tool that is able to identify systematic biases present in separate data sets and then make a global adjustment to compensate for these biases; in essence, each separate data set is a multi-dimensional cloud of data points, and DWD takes two points clouds and shifts one such that it more optimally overlaps the other.

[96] The methods described herein may be implemented and/or the results recorded using any device capable of implementing the methods and/or recording the results. Examples of devices that may be used include but are not limited to electronic computational devices, including computers of all types. When the methods described herein are implemented and/or recorded in a computer, the computer program that may be used to configure the computer to carry out the steps of the methods may be contained in any computer readable medium capable of containing the computer program. Examples of computer readable medium that may be used include but are not limited to diskettes, CD- ROMs, DVDs, ROM, RAM, and other memory and computer storage devices. The computer program that may be used to configure the computer to carry out the steps of the methods and/or record the results may also be provided over an electronic network, for example, over the internet, an intranet, or other network.

[97] Calculation of risk of relapse

[98] Provided herein are methods for predicting breast cancer outcome within the context of the intrinsic subtype and optionally other clinical variables. Outcome may refer to overall or disease-specific survival, event-free survival, or outcome in response to a particular treatment or therapy. In particular, the methods may be used to predict the likelihood of long-term, disease-free survival. "Predicting the likelihood of survival of a breast cancer patient" is intended to assess the risk that a patient will die as a result of the underlying breast cancer. "Long-term, disease-free survival" is intended to mean that the patient does not die from or suffer a recurrence of the underlying breast cancer within a period of at least five years, or at least ten or more years, following initial diagnosis or treatment.

[99] In one embodiment, outcome is predicted based on classification of a subject according to subtype. This classification is based on expression profiling using the list of intrinsic genes listed in Table 1. In addition to providing a subtype assignment, the PAM50 bioinformatics model provides a measurement of the similarity of a test sample to all four subtypes which is translated into a Risk of Relapse (ROR) score that can be used in any patient population regardless of disease status and treatment options. The intrinsic subtypes and ROR also have value in the prediction of pathological complete response in women treated with, for example, neoadjuvant taxane and anthracycline chemotherapy (Rouzier et ah , J Clin Oncol 23:8331-9 (2005), incorporated herein by reference in its entirety). Thus, in various embodiments of the present disclosure, a risk of relapse (ROR) model is used to predict outcome. Using these risk models, subjects can be stratified into low, medium, and high risk of relapse groups. Calculation of ROR can provide prognostic information to guide treatment decisions and/or monitor response to therapy.

[100] In some embodiments described herein, the prognostic performance of the PAM50- defined intrinsic subtypes and/or other clinical parameters is assessed utilizing a Cox Proportional Hazards Model Analysis, which is a regression method for survival data that provides an estimate of the hazard ratio and its confidence interval. The Cox model is a well- recognized statistical technique for exploring the relationship between the survival of a patient and particular variables. This statistical method permits estimation of the hazard {i.e. , risk) of individuals given their prognostic variables {e.g. , intrinsic gene expression profile with or without additional clinical factors, as described herein). The "hazard ratio" is the risk of death at any given time point for patients displaying particular prognostic variables. See generally Spruance et al , Antimicrob. Agents & Chemo. 48:2787-92 (2004).

[101] The PAM50 classification model described herein can be trained for risk of relapse using subtype distances (or correlations) alone, or using subtype distances with clinical variables as discussed supra. In one embodiment, the risk score for a test sample is calculated using intrinsic subtype distances alone using the following equation:

[102] ROR = 0.05*Basal + 0.1 l*Her2 + -0.25*LumA + 0.07*LumB + -0.1 l*Normal, where the variables "Basal," "Her2," "LumA," "LumB," and "Normal" are the distances to the centroid for each respective classifier when the expression profile from a test sample is compared to centroids constructed using the gene expression data deposited with the Gene Expression Omnibus (GEO) as accession number GSE2845. [103] Risk score can also be calculated using a combination of breast cancer subtype and the clinical variables tumor size (T) and lymph nodes status (N) using the following equation: ROR (full) = 0.05*Basal + 0.1*Her2 + -0.19*LumA + 0.05*LumB + - 0.09*Normal + 0.16*T + 0.08*N, again when comparing test expression profiles to centroids constructed using the gene expression data deposited with GEO as accession number GSE2845.

[104] In yet another embodiment, risk score for a test sample is calculated using intrinsic subtype distances alone using the following equation:

[105] ROR-S = 0.05*Basal + 0.12*Her2 + -0.34*LumA + 0.0.23*LumB, where the variables "Basal," "Her2," "LumA," and "LumB" are as described supra and the test expression profiles are compared to centroids constructed using the gene expression data deposited with GEO as accession number GSE2845. In yet another embodiment, risk score can also be calculated using a combination of breast cancer subtype and the clinical variable tumor size (T) using the following equation (where the variables are as described supra): ROR-C = 0.05*Basal + 0.1 l*Her2 + -0.23*LumA + 0.09*LumB + 0.17*T.

[106] In yet another embodiment, risk score for a test sample is calculated using intrinsic subtype distances in combination with the proliferation signature ("Prolif ') using the following equation:

[107] ROR-P = -0.001 *Basal + 0.7*Her2 + -0.95*LumA + 0.49*LumB + 0.34*Prolif, where the variables "Basal," "Her2," "LumA," "LumB" and "Prolif are as described supra and the test expression profiles are compared to centroids constructed using the gene expression data deposited with GEO as accession number GSE2845.

[108] In yet another embodiment, risk score can also be calculated using a combination of breast cancer subtype, proliferation signature and the clinical variable tumor size (T) using the ROR-PT described in conjunction with Table 3, supra.

[109] Detection of Subtypes

[110] Immunohistochemistry for estrogen (ER), progesterone (PgR), HER2, and Ki67 can be performed concurrently on serial sections with the standard streptavidin-biotin complex method with 3,3'-diaminobenzidine as the chromogen. Staining for ER, PgR, and HER2 interpretation can be performed as described previously (Cheang et ah , Clin Cancer Res. 2008;14(5): 1368-1376.), however any method known in the art may be used.

[Ill] For example, a Ki67 antibody (clone SP6; ThermoScientific, Fremont, CA) can be applied at a 1:200 dilution for 32 minutes, by following the Ventana Benchmark automated immunostainer (Ventana, Tucson AZ) standard Cell Conditioner 1 (CC1, a proprietary buffer) protocol at 98°C for 30 minutes. An ER antibody (clone SP1; ThermoFisher Scientific, Fremont CA) can be used at 1:250 dilution with 10-minute incubation, after an 8- minute microwave antigen retrieval in 10 mM sodium citrate (pH 6.0). Ready-to-use PR antibody (clone 1E2; Ventana) can be used by following the CC1 protocol as above. HER2 staining can be done with a SP3 antibody (ThermoFisher Scientific) at a 1: 100 dilution after antigen retrieval in 0.05 M Tris buffer (pH 10.0) with heating to 95°C in a steamer for 30 minutes. For HER2 fluorescent in situ hybridization (FISH) assay, slides can be hybridized with probes to LSI (locus-specific identifier) HER2/neu and to centromere 17 by use of the PathVysion HER- 2 DNA Probe kit (Abbott Molecular, Abbott Park, IL) according to manufacturer's instructions, with modifications to pretreatment and hybridization as previously described (Brown LA, Irving J, Parker R, et al. Amplification of EMSY, a novel oncogene on l lql3, in high grade ovarian surface epithelial carcinomas. Gynecol Oncol. 2006;100(2):264-270). Slides can then be counterstained with 4',6-diamidino-2- phenylindole, stained material was visualized on a Zeiss Axioplan epifluorescent microscope, and signals were analyzed with a Metafer image acquisition system (Metasystems,

Altlussheim, Germany). Biomarker expression from immunohistochemistry assays can then be scored by two pathologists, who were blinded to the clinicopathological characteristics and outcome and who used previously established and published criteria for biomarker expression levels that had been developed on other breast cancer cohorts.

[112] Tumors are considered positive for ER or PR if immunostaining is observed in more than 1% of tumor nuclei, as described previously. Tumors are considered positive for HER2 if immunostaining is scored as 3+ according to HercepTest criteria, with an amplification ratio for fluorescent in situ hybridization of 2.0 or more being the cut point that can be used to segregate immunohistochemistry equivocal tumors (scored as 2+) (Yaziji, et al. , JAMA, 291(16): 1972-1977 (2004)). Ki67 can be visually scored for percentage of tumor cell nuclei with positive immunostaining above the background level.

[113] Other methods can also be used to detect the Her2+ subtype. These techniques include ELISA, Western blots, Northern blots, or FACS analysis.

[114] Kits

[115] The present disclosure also describes kits useful for classifying breast cancer intrinsic subtypes and/or providing prognostic information to identify breast cancers that are more responsive to gemcitabine. These kits comprise a set of capture probes and/or primers specific for the intrinsic genes listed in Table 1 and can further include instructions for detecting the genes in Table 1 and classifying breast cancer intrinsic subtypes and/or providing prognostic information to identify breast cancers that are more responsive to gemcitabine. The kits may also contain reagents sufficient to facilitate detection and/or quantitation of Her2, in order to classify cells as Her2+. Preferably, the kit comprises a set of capture probes and/or primers specific for at least 10, at least 15, at least 20, at least 25 of the intrinsic genes or all 50 intrinsic genes listed in Table 1. The kit may further comprise a computer readable medium.

[116] In one embodiment of the present disclosure, the capture probes are immobilized on an array. By "array" is intended a solid support or a substrate with peptide or nucleic acid probes attached to the support or substrate. Arrays typically comprise a plurality of different capture probes that are coupled to a surface of a substrate in different, known locations. The arrays of the disclosure comprise a substrate having a plurality of capture probes that can specifically bind an intrinsic gene expression product. The number of capture probes on the substrate varies with the purpose for which the array is intended. The arrays may be low- density arrays or high-density arrays and may contain 4 or more, 8 or more, 12 or more, 16 or more, 32 or more addresses, but will minimally comprise capture probes for at least 10, at least 15, at least 20, at least 25 of the intrinsic genes or all 50 intrinsic genes listed in Table 1.

[117] Techniques for the synthesis of these arrays using mechanical synthesis methods are described in, e.g. , U.S. Patent No. 5,384,261, incorporated herein by reference in its entirety for all purposes. The array may be fabricated on a surface of virtually any shape or even a multiplicity of surfaces. Arrays may be probes (e.g. , nucleic-acid binding probes) on beads, gels, polymeric surfaces, fibers such as fiber optics, glass or any other appropriate substrate, see U.S. Pat. Nos. 5,770,358, 5,789,162, 5,708,153, 6,040,193 and 5,800,992, each of which is hereby incorporated in its entirety for all purposes. Arrays may be packaged in such a manner as to allow for diagnostics or other manipulation on the device. See, for example, U.S. Pat. Nos. 5,856,174 and 5,922,591 herein incorporated by reference.

[118] In another embodiment, the kit comprises a set of oligonucleotide primers sufficient for the detection and/or quantitation of each of the intrinsic genes listed in Table 1.

Preferably, the kit comprises a set of oligonucleotide primers sufficient for the detection and/or quantitation of at least 10, at least 15, at least 20, at least 25 of the intrinsic genes or all 50 intrinsic genes listed in Table 1. The oligonucleotide primers may be provided in a lyophilized or reconstituted form, or may be provided as a set of nucleotide sequences. In one embodiment, the primers are provided in a microplate format, where each primer set occupies a well (or multiple wells, as in the case of replicates) in the microplate. The microplate may further comprise primers sufficient for the detection of one or more housekeeping genes as discussed infra. The kit may further comprise reagents and instructions sufficient for the amplification of expression products from the genes listed in Table 1.

[119] In order to facilitate ready access, e.g. , for comparison, review, recovery, and/or modification, the molecular signatures/expression profiles are typically recorded in a database. Most typically, the database is a relational database accessible by a computational device, although other formats, e.g. , manually accessible indexed files of expression profiles as photographs, analogue or digital imaging readouts, spreadsheets, etc. can be used.

Regardless of whether the expression patterns initially recorded are analog or digital in nature, the expression patterns, expression profiles (collective expression patterns), and molecular signatures (correlated expression patterns) are stored digitally and accessed via a database. Typically, the database is compiled and maintained at a central facility, with access being available locally and/or remotely.

[120] In certain embodiments, the kit also includes a substance that is used to find the expression level of Her-2. This substance can be an antibody or a nucleic acid probe. These substances can be used to detect Her-2 using FISH, IHC, ELISA, Western blots, Northern blots, or FACS analysis. Optionally, the kit also includes reagents that allows for the detection of the detecting substance and the quantitation of Her-2 expression in a sample.

EXAMPLES

Example 1. Classification of tumors using PAM50

[121] The patient study cohort

[122] The current study is based upon a patient cohort enrolled in a randomized trial comparing the efficacy of single agent docetaxel (D) versus gemcitabine plus docetaxel (GD) in 337 women with locally advanced or metastatic disease (3). Patients were randomly assigned to docetaxel (100 mg/m 2 ) day 1, every 21 days or gemcitabine (1000 mg/m 2 ) days 1 and 8 plus docetaxel (75 mg/m 2 ) day 8. Patients were either previously untreated, had prior anthracycline -based (neo)adjuvant chemotherapy or had received a single prior

anthracyc line -based chemotherapy regimen for metastatic breast cancer. The Danish Breast Cancer Cooperative Group (DBCG) prepared the original protocol as well as the biomarker supplement, and the Danish National Committee on Biomedical Research Ethics has approved the original protocol as well as the add-on (KF 02-045-01 and H-KF-02-045-01) before their activation.

[123] Macro-dissection and RNA isolation

[124] Hematoxylin and eosin stained sections from archival formalin-fixed, paraffin- embedded (FFPE) primary breast tumor tissue were reviewed by a biologist (CLTJ) under supervision of a pathologist (TON). Areas containing representative invasive breast carcinoma were outlined on the slide. Depending on the tumor surface area, 1-6 unstained tissue sections of 10-15 μιη thickness were mounted on positively charged glass microscope slides and baked overnight at 45 °C. The unstained tissue sections were deparaffinized with CitroSolv, rinsed in ethanol and left to dry. The tissue was rehydrated with 3% glycerol, before manual macro-dissection to remove the surrounding normal tissue outside the outlined area.

[125] Total RNA was extracted using the High Pure RNA Paraffin Kit (Roche Applied Science, Indianapolis IN, cat# 03270289001), according to the manufacturer's protocol. RNA yield and purity were assessed using the NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies, Rockland, DE). RNA samples used in downstream analysis met prespecified quality criteria of an initial concentration of total RNA > 12.5 ng/μΐ, a minimum total yield of 250ng, and a purity ratio in the range 1.7-2.5.

[126] The PAM50 nCounter System assay

[127] Gene expression was measured on the NanoString nCounter Analysis System which delivers direct, multiplexed measurements through digital readouts of the relative abundance of hundreds of mRNA transcripts. In brief, the expression of the fifty target genes of Table 1 (PAM50) as well as normalizing "housekeeping" genes (for example MRPL19, PSMC4, SF3A1, PUM1, ACTB, GAPDH, GUSB, RPLPO, and TFRC) was measured in a single hybridization reaction without the use of any enzymatic reactions. An nCounter CodeSet with gene-specific probe-pairs to the PAM50 targets as well as exogenous positive and negative controls was hybridized in solution to 125-500ng total RNA (nominally 250ng). After overnight hybridization, the samples were processed using the NanoString nCounter Prep Station and Digital Analyzer according to the instructions and kits provided by NanoString Technologies. Data from each sample were qualified using prospectively defined quality control metrics for the positive and negative controls included in each reaction.

[128] Intrinsic subtype classification of qualified patient samples was based upon the PAM50 gene expression signature. Reporter-code-count files, containing the digital abundance or "counts" of each target mRNA molecule for every sample, were sent to NanoString Technologies for PAM50 subtype calling using a prospectively defined and locked proprietary algorithm. Assignment of subtypes was performed in a blinded fashion, by researchers with no access to information regarding the clinical parameters or outcomes. [129] Results

[130] The original trial of GD versus D recruited 337 participants; archival tumor tissue was available from 273 (81%) patients (CONSORT diagram, Figure 1). In the CONSORT diagram, patients were withdrawn for one of the following reasons: archival tissue not available (n=38), no tumor cells in available samples (n=12), only needle biopsies available/tissue unsuited for PAM50 (n=l l), tissue samples available of metastasis only (n=3).

[131] The assessable 270 patients differed from the 67 non-assessable patients (P<.05) with regard to prior (neo)adjuvant chemotherapy, adjuvant hormonal therapy, and adjuvant radiotherapy, but not for other assessed parameters (Table 5). These differences are considered reflections of a higher number of locally advanced cases in the excluded cohort. Primary tumor samples from locally advanced patients were in general more often either unavailable or had insufficient tissue for subtype analysis (i.e. needle biopsy only).

[132] Sufficient high quality RNA was obtained from the 270 patients allowing accurate estimation of the PAM50 algorithm. Based on the nearest PAM50 centroid algorithm, intrinsic breast cancer subtypes were assigned using gene expression as follows: 84 samples (31.1%) were luminal A, 97 samples (35.9%) luminal B, 43 (15.9%) basal-like, and 46 (17.1%) HER2-enriched. Patient and baseline characteristics of the 270 cases according to intrinsic subtypes are summarized in Table 6.

[133] Statistical Considerations

[134] The association between PAM50 subtypes and prognostic and demographic variables of the main study was assessed (Nielsen et al., JCO 2011 ; 29:4748-4754). Associations between PAM50 subtypes and categorical variables (regimen, hormone receptor status, HER2 status, type of metastatic site, stage of disease, and previous chemo-, hormonal-, and radiotherapy) were evaluated by Fisher's exact test, while associations between PAM50 subtypes and ordinal and interval variables (WHO performance status, age at randomization, number of metastatic sites, and disease-free interval) were evaluated by the Kruskal-Wallis test.

[135] Time to progression (TTP) was the primary endpoint for the original trial as well as this biomarker sub-study (Nielsen et al., JCO 2011; 29:4748-4754). Overall survival (OS) and response rate (RR) were secondary endpoints. TTP was measured from random assignment to date of documented progression with censoring at date of last visit or of death. OS was calculated from date of random assignment to date of death with censoring for surviving patients at last visit date. Time-to-event endpoints (TTP and OS) were estimated by the Kaplan-Meier method, and PAM50 subtypes were compared using the log-rank test. Analyses of PAM50 subtypes were done unadjusted and adjusted for preselected covariates in multivariate Cox proportional hazards models. The preselected covariates were those found to be significant in the previous analysis of the main study (Nielsen et al., JCO 2011 ; 29:4748-4754): regimen, disease type, and stage of disease, or were included due to their molecular association with PAM50 subtypes: hormone receptor status (positive/unknown vs. negative) and HER2 status (amplified vs. normal/deleted/unknown). The adjusted model was further stratified for previous chemotherapy (Nielsen et al., JCO 2011; 29:4748-4754). The assumption of proportional hazards was assessed by Schoenfeld residuals.

[136] Analyses were done to assess whether treatment effects on TTP and OS varied according to PAM50 subtypes or the levels of preselected variables. The multivariate Cox proportional hazards model was extended by one interaction term at a time. The interaction terms were tested using the Wald test and results were given in a Forest plot. RR was evaluated for patients with measurable disease. The overall RR was defined as a complete or partial response according to RECIST criteria, version 1.0. RRs were compared by using Fisher' s exact test.

[137] Statistical analyses were conducted using the SAS System (version 9.2). All statistical tests are two sided, and P<.05 was considered statistically significant. Results of this study are presented according to reporting recommendations for tumor marker prognostic studies (McShane et al., Breast Cancer Res Treat 2006; 100:229-235). The design of the study is prospective-retrospective as described in Simon et al (JNCI commentaries 2009; 101 : 1446- 1452).

[138] Results

[139] Recurrence patterns were significantly different between molecular subtypes. Visceral metastasis was more common in luminal B and HER2-enriched subtypes, and non- visceral metastasis more frequent in luminal A and basal-like subtypes. The luminal B and HER2- enriched showed a roughly similar pattern in terms of preferred sites for systemic relapse, however, luminal cases presented more often with bone metastases compared to both basal- like and HER2-enriched subtypes. Less frequently the luminal A subtype metastasized to lung, whereas metastases in the liver were less observed in the basal-like patients, however not statistically significant.

[140] Median disease-free (MDF) interval (time interval from diagnosis of primary cancer to recurrence) differed significantly between subtypes (P<.001), with the luminal A and B subtypes demonstrating the longest MDF interval (45 and 37 months respectively), compared to the HER2-enriched and basal-like groups who had significantly shorter MDF intervals (20 and 15 months respectively).

[141] Intrinsic subtypes and univariate analysis

[142] In Kaplan-Meier analyses, the intrinsic biological subtypes were significantly associated with TTP (P=.0006) and OS (P= .0083) (Figures 2A and 2B, respectively). Those assigned a luminal A subtype by the PAM50 assay had a significantly better outcome in terms of median time to progression and overall survival in months (median TTP: 12.8, 95% CI, 10.7-16.9; median OS: 24.0, 95% CI, 19.4-29.6) than luminal B (median TTP: 9.2, 95% CI, 7.3-11.2; median OS: 18.1, 95% CI, 15.9-22.2), HER2-enriched (median TTP: 8.2, 95% CI, 6.1-11.8; median OS: 17.6, 95% CI, 14.5-22.0), or basal-like tumors (median TTP: 6.2, 95% CI, 4.1-8.2; median OS: 12.4, 95% CI, 8.6-17.6).

[143] The Cox univariate proportional hazards model for TTP and OS (Table 7) confirmed this result (TTP, P=.0008; OS, P=.009).

[144] Furthermore, a significant difference in outcome was evident when comparing the luminal A subtype versus non-luminal A subtypes (TTP, HR, 0.56; 95% CI, 0.40-0.79; P= .001; OS, HR, 0.71; 95% CI, 0.54-0.94; P= .02), and the basal-like versus the non-basal-like subtypes (TTP, HR, 1.80; 95% CI, 1.23-2.64; P= .003; OS, HR, 1.65; 95% CI, 1.18-2.31 ; P=.004).

[145] Multivariate analysis

[146] To test the independent value of PAM50 subtyping against standard clinical and pathologic factors multivariable Cox models were constructed. The intrinsic biological subtype remained a significant independent prognostic factor for both TTP and OS (Table 8).

[147] The treatment effect was similar to the effect observed in the original study (HR=0.68 for TTP, HR=0.94 for OS) (3), with an HR favoring GD for TTP (adjusted HR 0.57, p=0.0007) but not for OS (adjusted HR 0.81, P=.13).

[148] Interaction tests for treatment effect on TTP and OS

[149] In multivariate Cox regression analyses, heterogeneity of treatment according to HER2 status and PAM50 intrinsic subtype was further examined. TTP seemed equally improved in PAM50 intrinsic subtypes (Figure 3A), while a significant interaction was observed between HER2 status and chemotherapy regimen (Wald's test, P=.0019). In contrast, for OS a significant heterogeneity was observed according to PAM50 subtype

(Figure 3B; P=.0008). Among patients with basal-like breast cancer, GD significantly improved OS, whereas the addition of gemcitabine significantly worsened OS among patients with a HER2-enriched subtype. In this model a significant interaction was furthermore observed between HER2 status and chemotherapy regimen (P=.019). Thus, PAM50 intrinsic subtype classification was a highly significant predictor of overall survival by treatment arm (P = 0.0016). Patients with a basal-like subtype had a 71% relative reduction in mortality from the addition of gemcitabine to docetaxel compared to docetaxel alone (Figures 3A and 3B).

[150] Kaplan-Meier estimates revealed a gain in median overall survival of 10 months for the basal-like patients in the doublet arm compared to the monotherapy arm, hence reaching the same level of median overall survival as the non-basal-like patients (Figure 4). A similar significant reduction in time to progression events was not demonstrated. No support was found for a more general benefit from adding gemcitabine to docetaxel in patients with highly proliferative subtypes (non-luminal A).

[151] Intrinsic subtypes and response rate

[152] Overall RR (complete response plus partial response) among patients with measurable disease (n=168) did not differ significantly among the four subtypes (luminal A 37.5%, luminal B 42.0%, basal-like 24.1%, HER2-enriched 43.3%; P=.36; Table 9), nor between the basal-like versus non-basal-like (P=.10) nor luminal A versus non-luminal A (P=1.00) pre- specified subtype groupings.

[153] Discussion

[154] Disease segmentation into breast cancer intrinsic subtypes can offer insight into personalized treatment. Thus, to test the hypothesis that molecular subtypes differ in their response to therapeutic agents, the relationship between molecular subtypes classified by the PAM50 assay and the effect of gemcitabine was evaluated, in patients with available tumor blocks enrolled in a randomized trial of docetaxel alone versus gemcitabine and docetaxel doublet for advanced breast cancer. Although the clinical trial, when analyzed as a whole, failed to demonstrate any clinically meaningful difference between the docetaxel versus gemcitabine plus docetaxel arms, the present invention demonstrates that when assessed by subtype, wider differences in TTP and OS between the two treatment arms are found. By PAM50 intrinsic subtype classification, in patients with a basal-like subtype, a 73% relative reduction in mortality from the addition of gemcitabine to docetaxel compared to docetaxel alone was demonstrated. In contrast, patients with non-basal-like subtypes had no significant incremental survival benefit from gemcitabine plus docetaxel compared with docetaxel monotherapy. The test for interaction between basal-like subtype and addition of gemcitabine was highly significant for OS A similar trend was observed for TTP with a relative 63% reduction for patients with basal-like and a 37% reduction for patients with other subtypes, although this difference was not statistically significant 19). No support was found for a more general benefit from adding gemcitabine to docetaxel in patients with highly proliferative subtypes (non-luminal A). An unexpected finding among patients with HER2 amplified tumors was a higher risk of TTP events and mortality in the doublet arm compared to single agent docetaxel. A similar trend was noticed for patients with a HER2-enriched subtype by PAM50.

[155] This study furthermore ascertains intrinsic molecular subtypes among primary tumors from patients who went on to have advanced breast cancer. All subtypes were represented and as expected luminal subtypes were the most frequent (67%), though in contrast to most published literature the luminal B subtype was more common than luminal A (33-36).

Luminal B subtype is associated with an higher risk of recurrence compared to luminal A and this may explain a higher frequency of luminal B in patients with advanced breast cancer compared to other published series of patients with early breast cancer. Nevertheless a significant proportion of patients with recurrent disease had a luminal A subtype in their primary tumor.

[156] In agreement with previous studies PAM50 intrinsic subtypes were associated with significant differences in the timing of distant recurrences. Recent studies described site- specific recurrence patterns according to subtypes supporting previous publications suggesting distinct patterns of metastatic spread and survival. This study supports a distinct metastatic pattern by PAM50 intrinsic subtypes as well as supports that subtype in addition influences survival after relapse.

[157] In summary, this retrospective subtype analysis applied to a prospective clinical trial demonstrates that subtype classification reveals predictive capacity not evident in an unselected patient cohort. A more substantial reduction in mortality was demonstrated by gemcitabine and docetaxel compared to docetaxel in patients with basal-like tumors.

However, a similar significant reduction in TTP events was not evident.

Abbreviations: HR: Hazard ratio. * ChiSq=16.68, df=3, P = 0.0008. b ChiSq=11.51, df=3, P = 0.009.

Abbreviations: D: Docetaxel; GD: gemcitabine plus docetaxel; HR: Hazard ratio.

* Models stratified for previous chemotherapy (none, n=79; adjuvant, n=88; locally advanced or metastatic, n=103).

** Estimate in multivariable models adjusted for the effects of regimen, hormone receptor status, HER2 status, visceral disease, and stage of disease and stratified for previous chemotherapy.

Measurable disease (n = 168).

b Total responses, Fishers exact test P = .36.

c Total responses, Luminal A v non-Luminal A, Fishers exact test P d Total responses, Basal-like v non-Basal-like, Fishers exact test P =