Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ANALYTICAL STANDARDS AND METHODS OF USING SAME
Document Type and Number:
WIPO Patent Application WO/2017/192974
Kind Code:
A1
Abstract:
Analytical standards can allow one to detect and/or measure sampling, processing, and/or amplification errors in a sample that includes a plurality of polynucleotide molecules. The analytical standards can provide an internal control to detect errors in the representation of the original sample reflected in data obtained after manipulating and/or processing of sample molecules.

Inventors:
BECKMAN KENNETH (US)
GOHL DARYL (US)
Application Number:
PCT/US2017/031271
Publication Date:
November 09, 2017
Filing Date:
May 05, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV MINNESOTA (US)
BECKMAN KENNETH (US)
GOHL DARYL (US)
International Classes:
C12Q1/68
Foreign References:
US20150017652A12015-01-15
Other References:
SABAT ET AL.: "Selective and sensitive method for PCR amplification of Escherichia coli 16S rRNA genes in soil", APPL ENVIRON MICROBIOL, vol. 66, no. 2, February 2000 (2000-02-01), pages 844 - 849, XP055438379
AIRD ET AL.: "Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries", GENOME BIOL, vol. 12, no. 2, 21 February 2011 (2011-02-21), pages R18 1 - 14, XP055123443
See also references of EP 3452618A4
Attorney, Agent or Firm:
GRAM, Christopher, D. (US)
Download PDF:
Claims:
What is claimed is:

1. A method for measuring and correcting amplification bias in a sample comprising a plurality of polynucleotides, the method comprising:

obtaining a sample comprising at least a first sample polynucleotide and a second sample polynucleotide;

spiking the sample with at least one synthetic standard designed to detect amplification bias between two sample polynucleotides;

amplifying polynucleotides in the spiked sample;

sequencing a first sample polynucleotide, a second sample polynucleotide, and the standard;

measuring the frequency of occurrence of the synthetic standard polynucleotide, the first sample polynucleotide, and the second sample polynucleotide;

comparing the measured occurrence of the synthetic standard polynucleotide to an expected frequency of occurrence of the synthetic standard polynucleotide, thereby generating a synthetic standard polynucleotide bias value; and

correcting the frequency of occurrence of the first sample polynucleotide and the second sample polynucleotide according to the synthetic standard polynucleotide bias value. 2. The method of claim 1 wherein the synthetic standard polynucleotide comprises 16S rRNA gene nucleotides.

3. The method of claim 1 or claim 2 wherein the synthetic standard polynucleotide comprises a plurality of different synthetic standard polynucleotides.

4. The method of claim 3 wherein the different synthetic standard polynucleotides comprise differences designed to detect different biases in amplification.

5. The method of claim 3 or claim 4 wherein a first synthetic standard polynucleotide and a second synthetic standard polynucleotide differ in G-C content, secondary structure, amplicon size, or degree of mismatch to a primer sequence.

6. The method of claim any preceding claim wherein the synthetic standard polynucleotide comprises a primer editing standard. 7. The method of any preceding claim wherein the synthetic standard polynucleotide comprises a polynucleotide obtained from a biological standard organism that is added to the sample.

8. The method of any preceding claim wherein the synthetic standard polynucleotide comprises a circular polynucleotide.

9. The method of any preceding claim wherein the synthetic standard polynucleotide is spiked into a sample at a defined level in order to measure the absolute or relative abundance of polynucleotides in the sample.

10. The method of any preceding claim wherein a plurality of synthetic standard

polynucleotides are spiked into a sample at a plurality of defined concentrations in order to measure a limit of detection. 11. The method of claim 9 or claim 10 wherein the synthetic standard polynucleotide is spiked into the sample at a standard polynucleotide: sample polynucleotide ratio of at least 1 : 10,000 and no more than 100: 1.

12. The method of claim 11 wherein the synthetic standard polynucleotide is spiked into the sample at a standard polynucleotide: sample polynucleotide ratio of at least 1 :3 and no more than

3 : 1.

13. The method of claim 9 or claim 10 wherein the synthetic standard polynucleotide is spiked into the sample in an amount of from one molecule to 100,000 molecules.

14. The method of any preceding claim wherein amplifying the polynucleotides comprises using a single set of primers.

15. The method of any preceding claim wherein the synthetic standard polynucleotide comprises a feature allowing PCR-free quantitation of the synthetic standard.

16. The method of claim 15 wherein the feature allowing PCR-free quantitation of the synthetic standard comprises a barcode. 17. The method of any preceding claim wherein:

the first sample polynucleotide comprises a polynucleotide from a first microbe; and the second sample polynucleotide comprises a polynucleotide from a second microbe.

18. The method of claim 17 wherein the first microbe and the second microbe are members of a microbiome sample.

19. A method of determining amplification bias among a plurality of polynucleotides, the method comprising:

amplifying a plurality of polynucleotides in a sample, the sample comprising:

a first sample polynucleotide;

a second sample polynucleotide; and

a plurality of synthetic standard polynucleotides comprising:

a first synthetic standard polynucleotide; and

a second synthetic standard polynucleotide that differs from the first synthetic standard polynucleotide in G-C content, secondary structure, amplicon size, or degree of mismatch to a primer sequence;

sequencing the first sample polynucleotide, the second sample polynucleotide, and the plurality of synthetic standard polynucleotides;

measuring the frequency of occurrence of the first sample polynucleotide, the second sample polynucleotide, the first synthetic standard polynucleotide, and the second synthetic standard polynucleotide; comparing the measured occurrence of the first synthetic standard polynucleotide with an expected frequency of occurrence of the first synthetic standard polynucleotide, thereby generating a first synthetic standard value;

comparing the measured occurrence of the second synthetic standard polynucleotide with an expected frequency of occurrence of the second synthetic standard polynucleotide, thereby generating a second synthetic standard value; and

detecting amplification bias if the first synthetic standard value differs from the second synthetic standard value. 20. The method of claim 19 wherein the plurality of synthetic standard polynucleotides are provided at a plurality of concentrations; and

failing to detect a synthetic standard defines a limit of detection.

21. A method of measuring abundance values among a plurality of polynucleotides, the method comprising:

obtaining a plurality of polynucleotides in a sample, the sample comprising:

a first synthetic polynucleotide comprising a first PCR-free quantitation tag, the first PCR-free quantitation tag positioned between cut sites of a restriction enzyme; and

a second synthetic polynucleotide comprising a second PCR-free quantitation tag, the second PCR-free quantitation tag positioned between cut sites of a restriction enzyme;

digesting the first synthetic polynucleotide to liberate the first PCR-free quantitation tag; digesting the second synthetic polynucleotide to liberate the second PCR-free

quantitation tag;

sequencing the first PCR-free quantitation tag and the second PCR-free quantitation tag; and

measuring the abundance of the first PCR-free quantitation tag and the second PCR-free quantitation tag.

22. The method of claim 21 further comprising comparing the abundance of the first PCR- free quantitation tag and the abundance of the second PCR-free quantitation tag; and determining the relative abundance of the first synthetic polynucleotide and the second synthetic polynucleotide.

23. The method of claim 21 or claim 22 wherein the first PCR-free quantitation tag is positioned between cut sites of the same restriction enzyme as the restriction enzyme whose cut sites the second PCR-free quantitation tag is positioned.

24. The method of claim 21 or claim 22 wherein:

the first PCR-free quantitation tag is positioned between cut sites of a first restriction enzyme; and

the second PCR-free quantitation tag is positioned between cut sites of a second restriction enzyme that differs from the first restriction enzyme.

25. A method for detecting sub-sampling error in a sample comprising a plurality of polynucleotides, the method comprising:

obtaining a sample comprising at least a first sample polynucleotide and a second sample polynucleotide;

spiking the sample with at least one synthetic diversity standard designed to detect sub- sampling error;

amplifying polynucleotides in the spiked sample;

sequencing a first sample polynucleotide, a second sample polynucleotide, and at least one synthetic diversity standard;

measuring the diversity of unique sequences of the synthetic diversity standard polynucleotide;

comparing the diversity of unique sequences of the synthetic diversity standard polynucleotide to an expected diversity of unique sequences of the synthetic diversity standard polynucleotide; and

detecting and quantifying sub-sampling error in the sample if the measured diversity of the synthetic diversity standard is less than the expected diversity of the synthetic diversity standard polynucleotide.

26. The method of claim 25 wherein the at least one synthetic diversity standard comprises a set of synthetic diversity standards.

Description:
ANALYTICAL STANDARDS AND METHODS OF USING SAME

CROSS-REFERENCE TO RELATED APPLICATION

This application claims priority to U.S. Provisional Patent Application No. 62/332,879, filed May 6, 2016, which is incorporated herein by reference.

SEQUENCE LISTING

This application contains a Sequence Listing electronically submitted via EFS-Web to the United States Patent and Trademark Office as an ASCII text file entitled "2017-05-05- SequenceListing_ST25.txt" having a size of 313 kilobytes and created on May 5, 2017. The information contained in the Sequence Listing is incorporated by reference herein.

SUMMARY

This disclosure describes, generally, analytical standards that allow one to detect and/or measure sampling, processing, and/or amplification errors in a sample that includes a plurality of polynucleotide molecules.

In one aspect, this disclosure describes a method for measuring and correcting amplification bias in a sample. Generally, the method includes obtaining that includes polynucleotide molecules; spiking the sample with at least one synthetic standard that detects amplification bias between two sample polynucleotides; amplifying polynucleotides in the spiked sample; sequencing a first sample polynucleotide, a second sample polynucleotide, and the standard; measuring the frequency of occurrence of the synthetic standard polynucleotide, the first sample polynucleotide, and the second sample polynucleotide; comparing the measured occurrence of the synthetic standard polynucleotide to an expected frequency of occurrence of the synthetic standard polynucleotide, thereby generating a synthetic standard polynucleotide bias value; and correcting the frequency of occurrence of the first sample polynucleotide and the second sample polynucleotide according to the synthetic standard polynucleotide bias value.

In another aspect, this disclosure describes another method of determining amplification bias among a plurality of polynucleotides. Generally, the method includes amplifying a plurality of polynucleotides in a sample that includes a first sample polynucleotide, a second sample polynucleotide, and a plurality of synthetic standard polynucleotides, wherein the plurality of synthetic standards include a first synthetic standard polynucleotide and a second synthetic standard polynucleotide that differs from the first synthetic standard polynucleotide in G-C content, secondary structure, amplicon size, or degree of mismatch to a primer sequence;

sequencing the first sample polynucleotide, the second sample polynucleotide, and the plurality of synthetic standard polynucleotides; measuring the frequency of occurrence of the first sample polynucleotide, the second sample polynucleotide, the first synthetic standard polynucleotide, and the second synthetic standard polynucleotide; comparing the measured occurrence of the first synthetic standard polynucleotide with an expected frequency of occurrence of the first synthetic standard polynucleotide, thereby generating a first synthetic standard value; comparing the measured occurrence of the second synthetic standard polynucleotide with an expected frequency of occurrence of the second synthetic standard polynucleotide, thereby generating a second synthetic standard value; and detecting amplification bias if the first synthetic standard value differs from the second synthetic standard value.

In another aspect, this disclosure describes another method of determining amplification bias among a plurality of polynucleotides. Generally, the method includes amplifying a plurality of polynucleotides in a sample that includes a first synthetic polynucleotide having a first PCR- free quantitation tag and a second synthetic polynucleotide comprising a second PCR-free quantitation tag; digesting the first synthetic polynucleotide to liberate the first PCR-free quantitation tag; digesting the second synthetic polynucleotide to liberate the second PCR-free quantitation tag; sequencing the first PCR-free quantitation tag and the second PCR-free quantitation tag; and measuring the abundance of the first PCR-free quantitation tag and the second PCR-free quantitation tag.

In another aspect, this disclosure describes a method for detecting sub-sampling error in a sample that includes a plurality of polynucleotides. Generally, the method includes obtaining a sample that includes at least a first sample polynucleotide and a second sample polynucleotide; spiking the sample with at least one synthetic diversity standard designed to detect sub-sampling error; amplifying polynucleotides in the spiked sample; sequencing a first sample polynucleotide, a second sample polynucleotide, and the at least one synthetic diversity standard; measuring the frequency of occurrence of the synthetic diversity standard polynucleotide; comparing the measured occurrence of the synthetic diversity standard polynucleotide to an expected frequency of occurrence of the synthetic diversity standard polynucleotide; and detecting sub-sampling error in the sample if the measured occurrence of the synthetic diversity standard is less than the expected frequency of occurrence of the synthetic diversity standard polynucleotide.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include 16S rRNA gene nucleotides.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include a plurality of different synthetic standard polynucleotides. In some of these embodiments, the different synthetic standard polynucleotides can include differences designed to detect different biases in amplification. For example, a first synthetic standard polynucleotide and a second synthetic standard polynucleotide can differ in G-C content, secondary structure, amplicon size, or degree of mismatch to a primer sequence.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include a primer editing standard.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include a polynucleotide obtained from a biological standard organism that is added to the sample.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include a circular polynucleotide.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide is spiked into a sample at a defined level in order to measure the absolute or relative abundance of polynucleotides in the sample.

In various embodiments of the various methods summarized above, a plurality of synthetic standard polynucleotides are spiked into a sample at a plurality of defined

concentrations in order to measure a limit of detection.

In various embodiments of the various methods summarized above, amplifying the polynucleotides can include using a single set of primers.

In various embodiments of the various methods summarized above, the synthetic standard polynucleotide can include a feature allowing PCR-free quantitation of the synthetic standard. For example, the feature allowing PCR-free quantitation of the synthetic standard can include a barcode. In various embodiments of the various methods summarized above, the first sample polynucleotide can be from a first microbe and the second sample polynucleotide can be from a second microbe.

The above summary of the present invention is not intended to describe each disclosed embodiment or every implementation of the present invention. The description that follows more particularly exemplifies illustrative embodiments. In several places throughout the application, guidance is provided through lists of examples, which examples can be used in various combinations. In each instance, the recited list serves only as a representative group and should not be interpreted as an exclusive list.

BRIEF DESCRIPTION OF THE FIGURES FIG. 1. Design of an exemplary standard construct containing a standard molecule, homing endocuclease I-Scel site for plasmid linearization, and /yl-flanked PCR-free Illumina sequencing cassette for sequencing-based quantification of standards and standard pools.

FIG. 2. Design of the 16S rRNA gene V4 515F region primer editing standards (SEQ ID

NOs:288-318). Primer mismatches are shaded.

FIG. 3. Introducing a phosphorothiol bond (ITS* primer set) can reduce the formation of adapter dimers (an undesired side-effect of the primer editing process), indicating that the extent of primer editing can be modulated using phosphorothiol modifications.

FIG. 4. Digestion with /yl can liberate a 148 bp molecule containing the quantification barcodes.

FIG. 5. Illumina-adapter flanked /yl-liberated quantification barcodes can be directly sequenced and used to quantify the abundance of the standard molecules in a complex pool.

FIG. 6. Percentages of reads corresponding to the spike-in synthetic standards in mock community sequencing datasets correlate well with expected (targeted) percentages.

FIG. 7. Abundance analyses of an exemplary mock community. Top: Uncorrected (raw) abundance measurements for the HMP even mock community (HM-276D) amplified using either KAPA HiFi (left) or Taq (right). Bottom: Abundance measurements with linearized V4 synthetic standard-based correction factors applied. FIG. 8. Amount of P. acnes reads observed in the mock community data (left bar) or the linearized V4 synthetic standard data (right bar) for three different concentrations of synthetic standard spike-ins.

FIG. 9. Comparison of uncorrected and corrected data for the HMP even mock community (HM-276D) using either circular (uncut) or linearized V4 synthetic standard-based correction factors. Black boxes indicate cases where linearized standards "overcorrect" samples.

FIG. 10. Aggregate error (RMSD) measurements for the HMP even (HM-276D, left) or staggered (HM-277D, right) mock community with or without V4 synthetic standard-based correction using circularized (top) or linearized (bottom) standards.

FIG. 11. Comparison of uncorrected and corrected data using circular (top) or linear V4 synthetic standards (bottom), with (right) or without (left) shearing the template DNA.

FIG. 12. Plots showing the variance in accuracy of quantification of the HMP mock community when correction factors have been applied using circular (top) or linear (bottom) V4 synthetic standard reads subsampled down to various read depths.

FIG. 13. The circular standard molecules exhibit increased primer editing, detected here as the proportion of reads corresponding to P. acnes, a species whose detection depends upon primer editing.

FIG. 14. Linearized V4 synthetic standard-based correction factors

(KAPA.HM.276D.2500L and KAPA.HM.276D. lkb.2500L) inflate the estimation of the abundance of P. acnes (poor primer editing of the standards leads to a spuriously high correction factor), while circular V4 synthetic standard-based correction factors (KAPA.HM.276D.2500C and KAPA.HM.276D. lkb.2500C) improve quantitative accuracy for 5 , acnes.

FIG. 15. The absence of reads for a synthetic standard molecule can be used to flag drop out of taxa due to amplification artifacts (in this case, the failure of Taq polymerase to amplify P. acnes template).

FIG. 16. Primer editing by error-correcting polymerases allow recovery of organisms with mismatches to the amplification primers. (A) Alignment of the 16S rRNA gene V4 primer region of the Propionibacterium acnes 16S rRNA gene (SEQ ID NO:319 and SEQ ID NO:321) to the V4 515F (SEQ ID NO:320) and V4 806R primer (SEQ ID NO:322) sequences. Positions with mismatches to the V4 515F and V4 806R primers are shaded. (B) Percentage of reads mapped to Propionibacterium acnes in the indicated methods. Error bars are +/- SEM. *** p < 0.01, * p < 0.05 determined by ANOVA with Tukey HSD post-hoc test. (C) Mean percentage of edited bases in the V4 515F primer (SEQ ID NO:320) region in HM-276D even mock community data measured with the DI protocol with Q5 polymerase. Error bars are +/- SEM, n = 4. (D) Mean percentage of edited bases in the V4 806R primer (SEQ ID NO:322) region in HM- 276D even mock community data measured with the DI protocol with Q5 polymerase. Error bars are +/- SEM, n = 4. (E) Mean percentage of edited bases in the V4 515F primer (SEQ ID

NO:320) region in HM-276D even mock community data measured with the DI protocol with KAPA HiFi polymerase. Error bars are +/- SEM, n = 4. (F) Mean percentage of edited bases in the V4 806R primer (SEQ ID NO:322) region in HM-276D even mock community data measured with the DI protocol with KAPA HiFi polymerase. Error bars are +/- SEM, n = 4. (G) Mean percentage of edited bases in the V4 515F primer (SEQ ID NO:320) region in HM-276D even mock community data measured with the DI protocol with Taq polymerase. Error bars are +/- SEM, n = 4. (H) Mean percentage of edited bases in the V4 806R primer (SEQ ID NO:322) region in HM-276D even mock community data measured with the DI protocol with Taq polymerase. Error bars are +/- SEM, n = 4.

FIG. 17. The effect of KAPA HiFi enzyme concentration on accuracy, chimera formation, sample balance, and adapter dimer formation. Plots for the HM-276D even mock community at 5 different starting template concentrations amplified for 20, 25, 30, or 35 cycles using 0.25x, 0.5x, lx KAPA HiFi polymerase, or KAPA ReadyMix showing: (A-D) RMSD; (E- H) Percentage of chimeric reads; (I- J) Total number of reads; (M-P) Percentage of adapter dimers.

FIG. 18. The effect of KAPA HiFi enzyme concentration primer editing efficiency and the occurrence of primer editing artifacts. (A) Distribution of edited bases in the V4 515F primer (SEQ ID NO:320) region in data from a pure isolate of Campylobacter jejuni measured with the DI protocol with KAPA ReadyMix. (B) Distribution of edited bases in the V4 806R primer (SEQ ID NO:322) region in data from a pure isolate of Campylobacter jejuni measured with the DI protocol with KAPA ReadyMix. (C) Schematic of 16S V3-V5 amplification from a pure isolate of Campylobacter jejuni. This amplicon contains the V4 515F primer sequence, allowing assessment of the endogenous sequence. (D) Percentage of each base observed at position 6 of the sequence

corresponding to the V4 515F primer sequence in a V3-V5 amplicon from a pure isolate of Campylobacter jejuni.

FIG. 19. Distribution of 16S rRNA gene V4 region %GC and predicted secondary structure (dG), black open circles, and designed process control standards, light filled circles. The black smear is the result of densely overlapping black open circles.

FIG. 20. Natural distribution of 16S rRNA gene V4 region sizes. Process control standards range from 193 bp to 313 bp.

FIG. 21. Analysis of reads using KAPA HiFi polymerase. (A-D) Proportion of P. acnes reads recovered with different concentrations of KAPA HiFi polymerase across a range of PCR cycle numbers and starting template concentrations. (E) Proportion of V4 515F primer (SEQ ID NO:320) bases that have been edited to guanine. (F) Proportion of V4 806R primer (SEQ ID NO:320) bases that have been edited to guanine. (SEQ ID NO:322)

FIG. 22. The design and use of diversity standards. (A) An example of the design of one diversity standard, in which a 10-bp sequence tag (SEQ ID NO:323) that is unique to the standard is flanked by a sequence region common to other diversity standards, to be used to recover the tag from a sample by, for example, PCR amplification). (B) The construction of a pool of diversity standards of complexity equal to 1000 tags, with each tag present at 0.1% of the total population. Tag sequences are shown for tags 1-3 and tag 1000 (SEQ ID NOs:323-326 illustrated) for purposes of illustration, but the entire pool contains 1000 unique tag sequences. (C) An illustration of the loss of sequence tag diversity experienced as a result of constricting the population size of molecules of the pool of diversity standards. (D) An illustration of the use of a diversity standard pool to detect and estimate a population bottleneck of a sample carried through several molecular steps. All that is required for such an estimation is a reliable measurement of the frequency of each tag in the final data.

FIG. 23. The design and use of diversity standard sets. (A) An example of the design of five diversity standard sets, each containing an equimolar pool of nine diversity standards (FIG. 22). Each diversity standard set is combined at a different concentration into a super pool of all five sets, such that the frequency of each tag in each set is known. For example, all tags in Set A are present at 10%, all tags in Set B are present at 1%, and so on. (B). An illustration of the loss of sequence tag diversity experienced as a result of constricting the population size of molecules of the pool of diversity standard sets. FIG. 24. Design and analysis of primer editing standards. (A) Design of V4 515F region of primer editing standards (SEQ ID NOs:288-318). Primer mismatches are shaded. (B) Primer editing standard (SEQ ID NO:288) used to demonstrate that many proofreading polymerases can support primer editing, though to varying extents. (C) Primer editing standard used to

demonstrate that there is little apparent sequence specificity to primer editing of template sequence (SEQ ID NO: 327). (D) Primer editing standards used to demonstrate that primer editing of template sequence (SEQ ID NO:327) by KAPA HiFi polymerase is concentration dependent. (E) Primer editing can result in edits that change a wildtype E. coli primer to match a mutant template (light curve) or a mutant primer to match a wildtype E. coli template (dark curve) with similar efficiency at a given primer position.

FIG. 25. Primer editing standards used to demonstrate that primer editing by KAPA HiFi polymerase can be blocked at a specified position by a single phosphorothiol linkage.

FIG. 26. Analysis of bias using biophysical standards and size standards. (A) Distribution of 16S rRNA gene V4 region %GC and predicted secondary structure (dG) based on sequences in the Greengenes database, black open circles, and designed biophysical standards, light filled circles. As in FIG. 19, the black smear is the result of densely overlapping black open circles. (B) Distribution of 16S rRNA gene V4 region sizes based on sequences in the Greengenes database. V4 size standards range from 193 bp to 313 bp. (C-D) Biophysical standards and size standards used to report on bias due to sequence properties when amplified with KAPA HiFi polymerase. (E-F) Biophysical standards and size standards used to report on bias due to sequence properties when amplified with Accuprime Taq polymerase.

FIG. 27. Analysis of PCR bias using a spike-in control. (A) A synthetic spike-in control is amplified together with a sample of unknown composition. The spike-in control is used to measure the PCR biases specific to different template molecules and used to generate a correction matrix, which is used to infer the starting abundances of different species in the sample. (B) Measured abundance of a mock community comprised of eight bacterial species (five of which, dark dots, are targeted by synthetic standards and three of which, light dots, are not) using the 16S V4 region. (C) Measured abundance the same mock community using the 16S VI -V3 regions. (D-E) Abundance measurements when correction factors from the synthetic spike-in standards are used to correct for amplification bias. Abundances for corrected taxa (dark dots) are closer to expected values (diagonal dashed line). FIG. 28. Design and analysis of PCR-free barcodes. (A) Design of a synthetic DNA standard plasmid incorporating a PCR-free quantification barcode construct. (B) Quantification of a plasmid standard pool using the PCR-free quantification barcodes. This same set of standards was re-pooled using this abundance information and the second pool was quantified using the PCR-free barcode quantification method. (C) Three independent technical replicates of the even re-pooled mixture measured with the PCR-free barcode quantification method, demonstrating that the measurements obtained with this technique are highly precise.

FIG. 29. Abundance analysis using PCR-free barcodes. (A-B) Comparison of

quantification of an evenly mixed plasmid standard pool using either the PCR-free barcode quantification method or different amounts of PCR cycles to prepare libraries for Illumina sequencing (dashed line indicated expected values). (C) Root mean squared deviation from expected values for the evenly mixed plasmid pool measured with PCR-free barcodes or with different numbers of PCR cycles. (D) Root mean squared deviation from expected values for the staggered mixture of plasmids measured with PCR-free barcodes or with different numbers of PCR cycles. (E) PCR-free barcode measurements correlate very well with droplet digital PCR data. Increased cycles of PCR result in decreased correlation of the measured abundances with droplet digital PCR data.

FIG. 30. PCR-free barcode constructs. (A) Tn5 transposon construct containing PCR-free barcode construct (SEQ ID NO:328) that has been used to generate a library of more than 13,000 insertion transposons in E. coli. (B) Testing the barcode diversity of the library by liberating the PCR-free barcode construct with /yl prior to transforming into E. coli (Pre-transposed library, black curve), and demonstrating that the PCR-free barcode construct can be liberated from the E. coli genome and sequenced directly (Transposed library, gray curve). (C) A strategy for using multiple enzymes to allow multiplexing of PCR-free quantification barcode constructs. (D) Test of specificity of clustering of an evenly pooled mixture of these four plasmids, cut with Mlyl. The fact that there is bleed-through of the other barcodes suggests that size selection may be required to improve specificity. (E) Concatamerized PCR-free barcode constructs can be cloned into a transposon or plasmid vector backbone to potentially allow multiplexing of PCR-free barcode measurements.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS This disclosure describes several analytical standards for quantifying and correcting errors and biases in amplicon-based analyses that include an amplification component such as, for example, microbiome experiments and/or quantification experiments that have an

amplification component, such as Tn-Seq or pooled RNA interference or CRISPR-Cas9 screens.

The analytical standards used in a particular application can be synthetic nucleic acid standards or biological (recombinant organism-based) standards. The analytical standard can be a synthetic standard or a biological standard. A biological standard can be a recombinant organism that includes any type of synthetic standard sequence. The biological standard can further include an additional synthetic sequence designed specifically to permit one to measure the efficiency of extracting and recovering nucleic acids from the biological standard cells.

A synthetic standard can be a standalone reagent that is amplified in isolation, or it can be a "spike-in" standard that is added to a sample to monitor and/or control errors and biases that occur during the amplification and subsequent processing of the sample. For example, a synthetic spike-in standard can include modified 16S rRNA gene nucleotides that are designed to be spiked into amplification reactions. While discussed below in the context of an exemplary embodiment in which the synthetic standard includes modified 16S rRNA gene nucleotides, the synthetic standard can include nucleotides from any suitable marker gene such as, for example, 18S rRNA or internal transcribed spacer (ITS) for eukaryotes.

In some embodiments when used as a "spike-in" standard, the synthetic standard molecules may be added to a sample to provide ratio of standard polynucleotide to sample polynucleotide (standard polynucleotide: sample polynucleotide ratio) of, for example, from 1 : 10,000 to 100: 1. For example, the synthetic standard can be added to a sample to provide a minimum standard polynucleotide: sample polynucleotide ratio of at least 1 : 10,000, at least 1 :5,000, at least 1 : 1000, at least 1 :500, 1 : 100, at least 1 :50, at least 1 : 10, at least 1 :5, at least 1 : 1, at least 5: 1, at least 10: 1, or at least 50: 1. The synthetic standard can be added to a sample to provide a maximum standard polynucleotide: sample polynucleotide ratio of no more than 100: 1, no more than 50: 1, no more than 10: 1, no more than 5: 1, no more than 1 : 1, no more than 1 :5, no more than 1 : 10, or no more than 1 :50. The synthetic standard can be added to a sample to provide a standard polynucleotide: sample polynucleotide ratio defined by a range having as endpoints any minimum standard polynucleotide: sample polynucleotide ratio set forth above and any maximum standard polynucleotide: sample polynucleotide ratio set forth above that is greater than the minimum standard polynucleotide: sample polynucleotide ratio.

In other embodiments when used as a spike-in standard, the synthetic standard molecule (or molecules) may be added to a sample in an amount of from one molecule to 100,000 molecules. For example, the synthetic standard molecule (or molecules) may be provided in a minimum amount of at least one molecule, at least ten molecules, at least 100 molecules, at least 500 molecules, at least 1000 molecules, at least 5000 molecules, or at least 10,000 molecules. The synthetic standard molecule (or molecules) may be provided in a maximum amount of no more than 100,000 molecules, no more than 50,000 molecules, no more than 10,000 molecules, no more than 5000 molecules, no more than 1000 molecules, no more than 500 molecules, no more than 100 molecules, no more than 50 molecules, or no more than 10 molecules. The synthetic standard can be added to a sample to provide the synthetic standard molecule (or molecules) within a range having as endpoints any minimum amount of standard synthetic molecule (or molecules) set forth above and any maximum amount of synthetic standard molecules set forth above that is greater than the minimum amount of synthetic standard molecule (or molecules).

Regardless of whether a synthetic standard is designed to be a standalone reagent or a spike-in standard, a synthetic standard can be one or more of the following types of sequence- specific standard: a quantitative bias standard, a process standard, a primer editing standard, and/or a diversity standard. As used herein, a quantitation bias standard is designed to measure sequence-specific quantitative amplification errors and biases that can differentially affect the amplification efficiency of sequences from different biological species. As used herein, a process standard is designed to assess the effect of sequence characteristics on amplification bias. As used herein, a primer editing standard is designed to measure the occurrence and extent of primer editing by DNA polymerase during amplification. As used herein, a diversity standard is designed to measure bottlenecks in populations of molecules during laboratory processing.

Process standards can include a collection of molecules that vary systematically in many different sequence properties that can affect amplification. Exemplary properties that can affect amplification include, for example, GC content, secondary structure, amplicon size, and/or the extent of mismatches to primer sequences. Process standards can be designed to be run in parallel to experimental samples in order to detect systematic biases in the amplification process. Primer editing standards can include 16S rRNA gene nucleotide sequences that are modified to differ systematically in their primer binding sites and report on the efficacy of primer editing in the PCR reaction. Primer editing standards can be spiked into an amplification reaction. Again, while discussed below in the context of an exemplary embodiment in which the synthetic standard includes modified 16S rRNA gene nucleotides, the synthetic standard can include nucleotides from any suitable marker gene such as, for example, 18S rRNA gene or internal transcribed spacer (ITS) for eukaryotes.

Diversity standards can include a population of unique sequence tags at known concentrations in a mixture, such that these standards can be used to report on the absolute size (i.e., number of molecules) of a population of molecules, as well as constrictions ("bottlenecks") that occur in that population during its manipulation. If the population size (number of molecules) is reduced to a number that is below the number of diversity tags, the diversity of tags will be permanently reduced by the stochastic loss of some of the tags from the mixture. The likelihood of "drop-out" of tags will increase as the population size approaches the tag diversity.

Furthermore, diversity tag sets can be designed to permit the measurement of molecular population size across a broad range, by mixing such diversity tag sets across a range of relative concentration (e.g., two-fold dilutions in concentration for each set), such that the loss of diversity is observed first for sets at lower relative concentration.

When diversity standards or diversity standard sets are spiked into a sample that is subjected to serial manipulation, they permit the integrative assessment of population

"bottlenecking" during those manipulations by measuring the recovery of the diversity standards or diversity standard sets at a final point following the manipulation, for example, by next- generation sequencing.

Abundance standards can be a collection of molecules that are spiked into a sample to allow for absolute or relative quantification of sample template molecules.

Biological standards can be used to detect biases in extraction and can be spiked into samples prior to extraction to monitor the efficiency of DNA extraction from different types of microbes, including gram negative bacteria, gram positive bacteria, fungi, or other

microorganism. A biological standard can include one or more organisms with distinct membrane properties that are designed, for example, to include unique sequence tags that can be amplified and quantified. In these embodiments, the sequence tag can be, for example, an edited 16S rRNA gene polynucleotide or a distinct sequence. In other embodiments, such unique tags could be diversity standards or diversity standard sets designed to measure population sizes and bottlenecks in population size, allow for absolute or relative quantification, or to assess limits of detection. A biological standard may be replication-defective or otherwise inactivated so that they cannot be "re-grown" by a consumer when provided in a commercial analytical kit. In other instances, a biological standard can be replication competent and designed to report on bacterial growth that occurred in transit or storage of samples.

Various embodiments of the standards and methods described herein can provide one or more of the following properties. First, certain standards and methods can correct biases due to differences in amplification efficiency between different primer sets for known targets. The standards and methods can correct for biases due to amplicon properties using a single set of primers. Second, by incorporating more than one type of standard, certain standards and methods described herein allow one to measure and correct biases due to intrinsic biophysical properties of the template molecules and/or additional types of PCR artifact— e.g., such as drop out due to primer mismatches. Third, certain synthetic standards incorporate PCR-free quantification barcodes that allow for, for example, accurate quantification of the standard molecules. Fourth, diversity standards and standard sets allow for the detection and semi -quantitative measurement of artifacts introduced by bottlenecks in the molecular population size during sample processing.

The design of constructs for exemplary nucleotide standards is illustrated in FIG. 1. After synthesis, the standards may be cloned into a plasmid and transformed into a host cell (e.g., E. coli) for propagation. The constructs include (1) the standard sequence, (2) a restriction site for optionally linearizing the plasmid prior to amplification, and (3) a barcode. The standard sequence can include any one or more of the nucleotide standards summarized above— i.e., a synthetic amplification bias standard, process standard, primer editing standard, and/or diversity standard. In some embodiments, the restriction site can include the recognition sequence for a homing endonuclease such as, for example, I-Scel. When the restriction site is present, the site can be recognized by any suitable restriction endonuclease, so long as the recognition sequence for the restriction endonuclease is not present between the amplification primers.

In the embodiment illustrated in FIG. 1, the construct includes a 12 bp i7 barcode:

AATCAGTCTCGT (SEQ ID NO:7). The particular barcode sequence can be arbitrary. In addition, for embodiments that allow PCR-free quantification, the construct can include an additional barcode sequence that enables direct PCR-free quantification of the standard molecules. In some embodiments, the PCR-free quantification barcode can be, for example, a /yl-flanked Illumina adapter-tagged 20 bp barcode so that the standards can be directly quantified, without PCR amplification, using Illumina sequencing. The PCR-free quantification allows one to improve the accuracy of pools of the synthetic standards.

Conventional methods for quantifying the standard pools would be to perform quantitative PCR, which can introduce bias into the analysis and, therefore, can result in an inaccurate

concentration measurement. The exemplary embodiment illustrated in FIG. 1 includes a 148 bp barcode sequence, which is typically distinct for each synthetic standard, to be liberated by digestion with an appropriate endonuclease— e.g., Mlyl as illustrated in FIG. 1. This barcode can, however, be of any suitable length. The liberated molecule can be directly sequenced (e.g., using an Illumina sequencer) with no intervening library preparation or PCR. Synthetic Spike-in Standards

In some embodiments, a spike-in synthetic standard can include a nucleotide present in all organisms of the sample being subject to the analysis. For example, in some embodiments, the spike-in standard can include a nucleotide that encodes the V4 variable region of the 16S rRNA gene. Synthetic standard molecules were designed for a defined bacterial mock

community (made by the Human Microbiome Project) consisting of 20 different organisms present either in equal abundances (an "even mock community") or in varying abundances (a "staggered mock community"). Synthetic standards for each of the unique 16S-V4-encoding regions present in the genomes of the organisms that make up the mock communities were synthesized (see synthetic standards 01-23, below; SEQ ID NO:8 through SEQ ID NO:30). After synthesis, the standards were cloned into a plasmid and transformed into E. coli. The 16S V4 region (+ 20 bp on either side outside of primer sites) was modified to have "TCT" tag at an analogous position for each molecule present in HMP mock community. The modification was made at a highly-conserved position that was identified by aligning 500 16S genes from the Greengenes database using ClustalW. A highly-conserved site within a predicted stem-loop region was chosen to minimize any effects that the "TCT" insertion might have on secondary structure of the synthetic standard molecule. In addition, several molecules were designed to test whether the sequence composition or length of the tag added to the 16S V4 region affects amplification kinetics (testing the following 3 bp tag sequences "TTT", "TCA", "CCC", "GGG", and tags of 1, 2, 5, 7, and 10 bp; see synthetic standards 24-32, below; SEQ ID NO:31 through SEQ ID NO:39).

In other embodiments, the spike-in synthetic standard can include a full-length nucleotide present in all organisms being subject to analysis. Thus, in one embodiment, the synthetic spike- in standard can include a full-length 16S rRNA nucleotide sequence from each of the organisms present in the sample being analyzed. One can assess how closely related the molecules are within a species by, for example, calculating the pair-wise Hamming distances of both the full- length 16S rRNA coding sequence, as well as the V3-V6 variable coding regions. In the exemplary case of 16S rRNA, the Hamming distances indicated that within a single species, the 16S rRNA genes varied by less than 1%, which is typically used as a stringent cut-off for a sequence similarity in defining Operational Taxonomic Units (OTUs). Thus, standards can be designed based on one representative sequence per organism (e.g., a sequence with the lowest cumulative Hamming distance from all other 16S rRNA sequences from a given organism) as the basis for the full-length standards.

To use these full-length standards to assess the effect of the primary sequence or position of the 3 bp exogenous sequence tag on the ability of the standards to model the template-specific PCR biases, three different 3 bp tags, "TAG", "TCT", and "CAT", were inserted into highly- conserved segments of the V3, V4, and V5 regions, respectively (Synthetic standards 78-97, below; SEQ ID NO:85 through SEQ ID NO: 104). In addition to the tagged full-length synthetic standards targeting the HMP mock community organisms, another 25 tagged full-length synthetic standards for common human gut microbes were made (Synthetic standards 208-232, below, SEQ ID NO:215 through SEQ ID NO:239).

To test the efficacy of using synthetic standards to correct for amplification biases, even mock community DNA and staggered mock community DNA were amplified using a range of template concentrations and two different enzymes (KAPA HiFi and 5 PRIME Taq). Different amounts and different relative abundances of synthetic standard DNA were spiked into mock community samples (0, 25, 250, and 2500 standard molecules per organism). Samples were amplified using primers that amplify the 16S rRNA gene V4 region and also contain adapter tails. Following the primary amplification, the amplicons were diluted 1 : 100 in nuclease free water and amplified for an additional 10 cycles using indexing primers that target the adapter tails and add the flow cell adapters and indices required for Illumina sequencing. After the indexing PCR, the reactions were normalized using SequalPrep plates, pooled, and cleaned up and concentrated with 1.8X AmPure XP beads. The pool was then quantified with PicoGreen, diluted to 8 pM, and sequenced on a portion of a MiSeq 2 x 300 bp run.

After sequencing, the reads for each sample were split into two files using a custom script. One file contained the synthetic standard reads which were identified by the "TCT" tag that was added, and was mapped to a reference file containing the standard sequences. The other file contained the remaining reads and was mapped to a reference file containing the mock community sequences.

Based on the relative number of reads assigned to the standard file and mock community file, the relative concentration of spike-in molecules to mock community molecules was well targeted (FIG. 6). The ability to target this ratio correctly allows one to sequence the standard molecules deeply enough to accurately quantify the abundances of the standard molecules, but not so deeply that the standard reads swamp the sample being investigated.

In some applications, the standard molecules can be used to correct for amplification biases in the mock community data. Species-specific correction factors were generated based on the ratio of observed to expected standard molecules (expected values were measured above using the PCR-free quantification barcodes, described in more detail, below). These correction factors were then applied to the mock community data and the accuracy of the data, relative to the known starting abundances, was compared before and after the correction is applied (FIG. 7). For some species (e.g., Actinomyces odontolyticus and Lactobacillus gasseri), the use of the synthetic standard-based correction factor improved the accuracy of quantification. However, for other species (most prominently Proprionibacterium acnes (P. acnes)), the use of a synthetic standard-based correction factor decreased the accuracy of quantification.

P. acnes is the only organism in the HMP mock community that has a mismatch in its 16S rRNA gene to the V4 amplification primers. Reads from this organism are only seen in the sequencing data when a proofreading polymerase is used, allowing editing of the primer sequences to match the P. acnes template (FIG. 16). When the extent of recovery of P. acnes sequences from either the mock community or the synthetic standards were evaluated, the standard-based correction factors were inflating the abundance of P. acnes was that the standard molecules were not accurately reporting on the primer editing that was occurring with the P. acnes template (FIG. 8).

To troubleshoot the misestimation of several species when using the synthetic standard- based correction factors, the effect of linearizing the plasmid on (a) amplification of the standard sequences and (b) the extent of primer correction was evaluated. Again, both the even mock community DNA and the staggered mock community DNA were amplified using a range of template concentrations and two different enzymes (KAPA HiFi and 5 PRXME Taq). Different amounts and different relative abundances of synthetic standard DNA were spiked into the mock community DNA samples (0, 25, 250, and 2500 standard molecules per organism).

Using a circular (uncut) plasmid improved the accuracy of standard-based correction, including for 5 , acnes (FIG. 9). In addition, the overall accuracy across the whole even mock community (as measured by the root mean squared deviation (RMSD) from expected values) was improved with the circular standard based correction relative to both the uncorrected data and to the linear standard corrected data (which was less accurate than uncorrected data). Both the linear and circular standards improved the accuracy of quantification for the staggered mock community, though this effect is likely driven by several of the high abundance organisms in the community (FIG. 10).

Also, the effect of shearing the template DNA (to make it more closely resemble the linear standard molecules) on accuracy was tested. There was not a substantial difference in the measurements or corrections with either circular or linear standards between unsheared template and templates sheared to average sizes of 300 bp, 1 kb, or 5 kb (FIG. 11).

In addition, the depth to which the standard pool needs to be sequenced to get an accurate measurement of the relative abundances of the standard molecules was assessed. The standard reads were subsampled to different levels, correction factors were calculated, the correction factors were applied to the mock community data, and the variance in overall accuracy of quantification was examined. At low subsampling depths (<1,000 reads), the variance of the calculated RMSD values was high. 2,500 reads, however, produced a robust quantification (FIG. 12). This demonstrates that the standards should be able to be spiked in at a small fraction of the sample concentration and still be accurately quantified.

The improvement in accuracy seen with the circular standards may be due, at least in part, to the circular standards more effectively reporting on primer editing. The circular standards recovered a much larger amount of standard reads corresponding to P. acnes (FIG. 13), which led to an improved standard-based correction for this organism (FIG. 14). The mechanism for this difference in primer editing between the circular and linearized standard templates is currently unknown and under investigation.

Process Standards

Exemplary process control standards were designed to report on amplification biases that can arise through the interaction of amplification conditions and reagents with the biophysical properties of the template molecules— e.g., GC content, amplicon size, and/or secondary structure. The parameter space encompassed by the natural genetic variation in the V4 region of the 16S rRNA gene was evaluated by assessing the GC content and predicted secondary structure of all of the identifiable V4 regions in the Greengenes database. Next, the sequence of the E. coli 16 rRNA gene was varied in silco, adding different amounts of GC or AT bias and generating an in silico library of millions of variant sequences. Then, secondary structure predictions were generated for these sequences, sequences that were >97% identical to a sequence in the

Greengenes database were filtered out, a set of molecules that tiled the extent of natural GC content and secondary structure variation were chosen (FIG. 19). The naturally-occurring distribution of V4 region sizes were characterized and a set of standards to cover this size distribution were made, which could be used to detect biases in amplicon size due to size selection or amplification (FIG. 20). (See process control standards 33-96, below, SEQ ID

NO:40 through SEQ ID NO: 103). An additional set of process control standards was made for the eukaryotic ITS2 region, using similar design considerations (Synthetic standards 160-207, below, SEQ ID NO: 167 through SEQ ID NO:214). Primer Editing Standards

The generation of primer editing standards resulted from an unexpected phenomenon. An error correcting polymerase can edit primer sequences during amplification to correct mismatches between the primer sequence and a template molecule (FIG. 16). The V4 515F and V4 806R primers typically perform poorly for detecting P. acnes due to mismatches with the P. acnes 16S rRNA gene (FIG. 16). Low levels of P. acnes are detected using the EMP protocol, or with the DI (Taq) protocol; the species was also effectively absent in a published mock community EMP dataset (FIG. 16; Nelson et al., 2014, PLoS One 9:e94249). Surprisingly, relatively high levels of P. acnes were observed with the DI (Q5) and DI (KAPA) protocols (FIG. 16). When we the portion of reads corresponding to the amplification primers for the DI (Q5) and DI (KAPA) datasets were examined, approximately 4% of the V4 515F primer sequences had been edited from A>G at position 18 and approximately 4% of the V4 806R primer sequences had been edited from T>G at position 20, modifications matching the P. acnes template sequence (FIG. 16). No such modifications were observed in the DI (Taq) dataset (FIG. 16). These results demonstrate that proofreading polymerases can edit amplification primers in a PCR reaction, permitting the amplification of sequences from organisms whose templates contain primer mismatches. The efficiency of primer editing is noteworthy since the editing must occur in essentially every PCR cycle in order to be observed in the final sequencing reads.

The extent of adapter dimer formation, and therefore the overall sensitivity of the assay, can also be modulated by adjusting polymerase concentration (FIG. 21). Samples from this dilution series were amplified with KAPA HiFi polymerase either at the manufacturer' s recommended enzyme concentration (lx), at 0.5x, or at 0.25x the recommended enzyme concentration. In addition, these datasets were compared to samples amplified with KAPA ReadyMix (a pre-made 2x master mix). There was a strong correlation between the enzyme concentration used and the extent of adapter dimer formation. Samples amplified with the KAPA ReadyMix had by far the largest extent of adapter dimer formation, while samples amplified with 0.25x KAPA HiFi had negligible amounts of adapter dimer formation, even at very low template concentrations (FIG. 17). Except for the KAPA ReadyMix samples, which had very few reads for the samples with lowest template concentrations, accuracy was only moderately reduced with decreased concentrations of KAPA HiFi, particularly at lower PCR cycle numbers (FIG. 17). Decreasing KAPA HiFi concentration and thus the levels of adapter dimer contamination greatly improved sample balance (FIG. 17). Excessive enzyme concentration may mediate adapter dimer formation through, for example, chew-back and extension of primer heterodimers. Thus, optimizing the enzyme concentration can involve a tradeoff between on-target/off-target editing (FIG. 18), sensitivity, and overall accuracy. The extent of primer editing also varies as a function of polymerase concentration (FIG. 21). Since drop-out of a template molecule due to non-amplification represents the most severe form of PCR bias— i.e., it is a qualitative error as opposed to a quantitative

misestimation— it be desirable to have standard molecules that can report on the efficacy of primer editing. A set of standards were designed in which the V4 515F primer site from an E. coli 16S rRNA gene template has been modified with every possible single base mismatch in the most 3' 10 bp of the primer binding site (FIG. 2). These standards can be used to assess the extent of editing and any base preferences associated with primer editing (see primer editing standards 97-128, below; SEQ ID NO: 104 through SEQ ID NO: 135). An additional set of primer editing standards was made for the eukaryotic ITS2 region, using similar design considerations (Synthetic standards 129-159, below, SEQ ID NO: 136 through SEQ ID NO: 166). In addition, FIG. 3 presents data suggesting that the extent of primer editing can be controlled using exonuclease-protecting phosphorothiol modifications in the amplification primers.

Organisms with primer mismatches, such as P. acnes, are only amplified and present in the sequencing data at appreciable levels when a proofreading polymerase is used. Thus, synthetic standards that can report on the efficacy of primer editing and flag the potential drop out of taxa due to primer mismatches will help to identify qualitative errors in amplicon-based microbiome sequencing. A synthetic standard molecule can be used to identify such a taxon drop out. When the UMP mock community is amplified with standard Taq polymerase prior to sequencing, primer editing does not occur and P. acnes is not detected. Similarly, a drop out of the P. acnes standard molecule containing the corresponding primer mismatches is also observed (FIG. 15). This demonstrates that in addition to correcting for quantitative errors, synthetic standards can be used to flag qualitative errors (taxa drop out).

PCR-free quantification barcodes

As mentioned above, in some embodiments, the synthetic standard can include a barcode that allows PCR-free quantitation. PCR-free quantitation eliminated bias PCR-mediated amplification bias. A collection of 20 16S rRNA gene V4 synthetic standards were synthesized, cloned into a pTOPO vector, transformed into . coli (DH5alpha), and individual clones were sequence verified by Sanger sequencing. Plasmids were purified from each of the 20 sequence- verified clones using a Qiagen MiniPrep kit, and the plasmid DNA was quantified using a

PicoGreen assay. Plasmid DNA from the 20 clones was pooled at an equimolar ratio, such that each plasmid was expected to make up 5% of the standard pool. Next, the plasmid pool was cut with /yl to liberate the quantification barcodes (FIG. 4; see also FIG. 1). The /yl digested plasmid material was then directly sequenced using a fraction of a 100 bp single-read Illumina HiSeq lane, generating 592,728 reads. The Illumina adapters were trimmed from the resulting reads, and the reads were mapped to a reference file containing the known barcode sequences using a custom script. Using these counts, one can measure the actual abundances of the standard molecules in the standard pool, which must be accurately assessed for the standards to provide quantitative corrections. The actual abundances for the standard molecules were mostly close to, but not exactly 5%, with a range between 3.4% and 6.3% (FIG. 5).

The PCR-free barcode quantification technology described herein has numerous practical applications and can be used to make reliable measurements of essentially any mixture of engineered DNA constructs where PCR-free barcodes could be inserted. Exemplary applications include, for example, quantifying plasmid pools; quantifying pools of shRNA, CRISPR sgRNA plasmids, or viral vectors (such as would be used on large-scale genetic screening); quantifying transposon or other insertion libraries— e.g., Tn-Seq and related methods.

To demonstrate an exemplary application of this technology, a Tn5 transposon library was constructed containing random barcode-containing PCR-free barcode constructs within the transposon. This Tn5 element was cloned and transformed into E. coli together with the Tn5 transposase by electroporation to generate a library of >13,000 insertion strains (FIG. 30A). Prior to transformation, the barcode composition of the Tn5 library was confirmed to be highly diverse by digesting with /yl, purifying, and sequencing the PCR-free barcode construct (FIG. 30B). In order to use the PCR-free barcodes in the transposon to quantify Tn-Seq libraries, the barcodes are mapped to transposon junctions by fragmenting the library DNA, ligating on adapters, and enriching for the Tn5:genome junction by amplifying with a primer in the transposon and one in the adapter. Once barcode/genome associations have been made, then the library can be quantified in the absence of PCR by simply liberating the PCR-free barcodes using Mlyl and sequencing them.

It is possible to sequence the PCR-free quantification barcode cassette in the transposon construct from purified E. coli genomic DNA. Because the barcode cassette on the integrated transposon accounts for only a small fraction of the E. coli genome (around l/40,000 th ), it was unclear whether Illumina sequencing of the digested material would be possible in the context of the large amount of non-functional background DNA. Moreover, since the PCR-free quantification barcode molecules are sequenced directly, without any intervening amplification, the quantity of material that can be recovered will in most cases be below the recommended concentrations for loading an Illumina sequencer. Thus, to sequence the PCR-free quantification barcode cassette in the transposon construct from purified E. coli genomic DNA, after digestion of the genomic DNA with Mly the amount of transposon in the digested sample was quantified. Starting with more than 2 μg of genomic DNA, the PCR-free quantification barcode construct was recovered at a concentration of 112 pM as assessed by qPCR, roughly l/20 th of the recommended concentration for loading an Illumina MiSeq. Sequencing these libraries required a modified denaturation protocol in which the NaOH used to denature the DNA prior to sequencing was neutralized with an equal amount of HC1 so that excess NaOH in the sample did not interfere with clustering and sequencing. More than three million reads corresponding to the PCR-free quantification barcode construct from the transposon were obtained, which represented approximately 15,000 unique abundant barcodes, consistent with our estimates of transposon library complexity based on colony counts (FIG. 30B).

In some applications, such as, for example, those in which one would like to assess the same library across many experimental perturbations, it may be desirable to multiplex these measurements in a single sequencing lane. Constructs can be designed to test whether enzymes that leave small single strand overhangs can be used to liberate PCR-free barcode constructs so that multiple tags could be placed into a single concatamerized construct (FIG. 30C-E). The specificity of multiplexed PCR-free barcode constructs was tested by making an even pool of the four plasmids in FIG. 30C and cutting with /yl to liberate one of the four barcodes. FIG. 30D shows the successful liberation of barcodes by Mlyl.

To demonstrate another exemplary application of this technology, size standards were made to characterize the clustering efficiency of molecules of various sizes on different sequencing platforms. These standard molecules contain two PCR-free quantification barcode constructs on the same plasmid, ensuring that each pair is present in a truly equal molar ratio. Each plasmid contains a 164 bp M-liberatable PCR-free barcode construct and a second Mlyl- liberatable PCR-free barcode construct of variable size ranging from 150 bp to 1500 bp in 150 bp increments. The ratio of the variably sized construct to the 164 bp normalization control can be used to quantify and compare the number of reads resulting from each standard molecule, allowing direct measurement of sequencing platform-specific size biases (Synthetic standards 233-262, below, SEQ ID NO:240 through SEQ ID NO:269).

In the preceding description and following claims, the term "and/or" means one or all of the listed elements or a combination of any two or more of the listed elements; the terms "comprises," "comprising," and variations thereof are to be construed as open ended— i.e., additional elements or steps are optional and may or may not be present; unless otherwise specified, "a," "an," "the," and "at least one" are used interchangeably and mean one or more than one; and the recitations of numerical ranges by endpoints include all numbers subsumed within that range (e.g., 1 to 5 includes 1, 1.5, 2, 2.75, 3, 3.80, 4, 5, etc.).

In the preceding description, particular embodiments may be described in isolation for clarity. Unless otherwise expressly specified that the features of a particular embodiment are incompatible with the features of another embodiment, certain embodiments can include a combination of compatible features described herein in connection with one or more

embodiments.

For any method disclosed herein that includes discrete steps, the steps may be conducted in any feasible order. And, as appropriate, any combination of two or more steps may be conducted simultaneously.

The present invention is illustrated by the following examples. It is to be understood that the particular examples, materials, amounts, and procedures are to be interpreted broadly in accordance with the scope and spirit of the invention as set forth herein.

EXAMPLES

EXAMPLE 1

Samples and standards

The mock community DNA was obtained through BEI Resources, NIAID, NIH, as part of the Human Microbiome Project: Genomic Mock Community B (HM-276D, Even, High Concentration, v5.1H, and HM-277D, Staggered, High Concentration, v5.2H).

16S V4 synthetic standards were synthesized using an SGI-DNA BioXP 3200. These constructs were 3' adenylated by incubating with Taq polymerase and dATP at 72°C for 10 minutes. Next, the synthetic DNA was cloned into a pTOPO vector (Invitrogen) according to the manufacturer's protocol, transformed into . coli (DH5alpha), and individual clones were sequence verified by Sanger sequencing. Plasmids were purified from each of the 20 sequence- verified clones using a Qiagen MiniPrep kit, and the plasmid DNA was quantified using a PicoGreen assay and pooled as described above.

Full-length 16S rRNA standards, process control standards, and primer editing standards were synthesized as full plasmids (in the pUCGA backbone) using an SGI-DNA BioXP 3200.

DI method

The V4 region of the 16S rRNA was amplified using a two-step PCR protocol. The primary amplification was done in a qPCR reaction, using the AB 17900 so that the dynamics of the PCR reactions could be monitored. The following recipe was used: 3 μΐ template DNA, 0.48 μΐ nuclease-free water, 1.2 μΐ 5x KAPA HiFi buffer (Kapa Biosystems, Woburn, MA), 0.18 μΐ 10 mM dNTPs (Kapa Biosystems, Woburn, MA), 0.3 μΐ DMSO (Fisher Scientific, Waltham, MA), 0.12 μΐ ROX (25 μΜ) (Life Technologies, Carlsbad, CA), 0.003 μΐ ΙΟΟΟχ SYBR Green, 0.12 μΐ KAPA HiFi Polymerase (Kapa Biosystems, Woburn, MA), 0.3 μΐ forward primer (10 μΜ), 0.3 μΐ reverse primer (10 μΜ). Cycling conditions were: 95°C for 5 minutes, followed by 20 cycles of 98°C for 20 seconds, 55°C for 15 seconds, and 72°C for 1 minute. The primers for the primary amplification contained both 16S-specific primers (V4 515F and V4 806R), as well as adapter tails for adding indices and Illumina flow cell adapters in a secondary amplification.

The following primers were used (16S-specific sequences in bold):

V4_515F_Nextera:

TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTGCCAGCMGCCGCGGTAA

(SEQ ID NO: l)

V4_806R_Nextera:

GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGGACTACHVGGGTWTCTAAT

(SEQ ID NO:2)

The amplicons from the primary PCR were diluted 1 : 100 in sterile, nuclease-free water, and a second PCR reaction was set up to add the Illumina flow cell adapters and indices. The secondary amplification was done using the following recipe: 5 μΐ template DNA, 1 μΐ nuclease- free water, 2 μΐ 5x KAPA HiFi buffer (Kapa Biosystems, Woburn, MA), 0.3 μΐ 10 mM dNTPs (Kapa Biosystems, Woburn, MA), 0.5 μΐ DMSO (Fisher Scientific, Waltham, MA) 0.2 μΐ KAPA HiFi Polymerase (Kapa Biosystems, Woburn, MA), 0.5 μΐ forward primer (10 μΜ), 0.5 μΐ reverse primer (10 μΜ). Cycling conditions were: 95°C for 5 minutes, followed by 10 cycles of 98°C for 20 seconds, 55°C for 15 seconds, 72°C for 1 minute, followed by a final extension at 72°C for 10 minutes. The following indexing primers were used (X indicates the positions of the 8 bp indices):

Forward indexing primer:

AATGATACGGCGACCACCGAGATCTACACXXXXXXXXTCGTCGGCAGCGTC (SEQ ID NO:3)

Reverse indexing primer:

C A AGC AGA AGACGGC AT AC GAGATXXXXXXXXGTC TC GTGGGCTC GG (SEQ ID NO:4)

Dilution series experiments

For the dilution series experiments, the DI method primers (V4 515F_Nextera and V4_806R_Nextera, see above) were used for all of the comparisons. A ten-fold dilution series of the FDVI-276D mock community DNA was amplified for 20, 25, 30, or 35 cycles, using one of two different polymerases: Kapa HiFi HotStart (Kapa Biosystems, Woburn, MA), or 5 PRFME HotMasterMix (5 PRIME, Gaithersberg, MD). PCR recipes and cycling conditions for the primary amplifications were as follows:

KAPA HiFi primary PCR recipe: 2.5 μΐ DNA template, 0.48 μΐ nuclease-free water, 2 μΐ 5x KAPA HiFi buffer (Kapa Biosystems, Woburn, MA), 0.3 μΐ 10 mM dNTPs (Kapa

Biosystems, Woburn, MA), 0.5 μΐ DMSO (Fisher Scientific, Waltham, MA), 0.2 μΐ KAPA HiFi Polymerase (Kapa Biosystems, Woburn, MA), 0.5 μΐ forward primer (10 μΜ), 0.5 μΐ reverse primer (10 μΜ).

KAPA HiFi cycling conditions: 95°C for 5 minutes, followed by 20, 25, 30, or 35 cycles of 98°C for 20 seconds, 55°C for 15 seconds, 72°C for 1 minute, followed by 72°C for 5 minutes.

5 PRFME Taq cycling conditions: 94°C for 3 minutes, followed by 20, 25, 30, or 35 cycles of 94°C for 20 seconds, 55°C for 15 seconds, 72°C for 1 minute, followed by 72°C for 5 minutes. Primary PCRs were then diluted 1 : 100 in sterile, nuclease-free water, and a second PCR reaction was set up to add the Illumina flow cell adapters and indices. For these reactions the following recipes were used (polymerase-specific cycling conditions were the same as above, but using 10 cycles in the indexing step):

KAPA HiFi indexing PCR recipe: 5 μΐ 1 : 100 DNA template, 5 μΐ template DNA, 1 μΐ nuclease-free water, 2 μΐ 5x KAPA HiFi buffer (Kapa Biosystems, Woburn, MA), 0.3 μΐ 10 mM dNTPs (Kapa Biosystems, Woburn, MA), 0.5 μΐ DMSO (Fisher Scientific, Waltham, MA) 0.1 μΐ KAPA HiFi Polymerase (Kapa Biosystems, Woburn, MA), 0.5 μΐ forward primer (10 μΜ), 0.5 μΐ reverse primer (10 μΜ).

5 PRFME Taq indexing PCR recipe: 5 μΐ 1 : 100 DNA template, 4 μΐ 2x 5 PRIME Hot

Start High-Fidelity Master Mix, 1 μΐ sterile, nuclease-free water, dried-down indexing primers (final concentration of 0.5 μΜ for each primer).

KAPA HiFi concentration tests

For the KAPA HiFi concentration tests, amplifications were performed using the KAPA

HiFi primary PCR recipe and cycling conditions described in the dilution series experiment section above, but the amount of KAPA HiFi Polymerase added to the 0.5x reactions was cut in half (0.1 μΐ per 10 μΐ reaction) and the amount added to the 0.25x reactions was one fourth the lx concentration (0.05 μΐ per 10 μΐ reaction); nuclease-free water was added to compensate for the missing volume. The indexing reactions for each of these conditions was carried out with the 0.5x concentration of KAPA HiFi polymerase, so the differences observed between these conditions are a result of the differing KAPA HiFi polymerase concentrations in the primary PCR reaction. KAPA HiFi Readymix amplifications

KAPA HiFi ReadyMix PCRs were carried out as described above, using the DI primers (V4_515F_Nextera and V4_806R_Nextera, see above) using the following recipes:

KAPA HiFi Readymix PCR recipe: 2.5 μΐ DNA template, 5 μΐ 2x Kapa HiFi HotStart

Readymix, 0.5 μΐ forward primer (10 μΜ), 0.5 μΐ reverse primer (10 μΜ), 1.5 μΐ sterile, nuclease-free water. KAPA HiFi ReadyMix indexing PCR recipe: 5 μΐ 1 : 100 DNA template, 5 μΐ 2x Kapa HiFi HotStart Readymix, dried-down indexing primers (final concentration of 0.5 μΜ for each primer). Amplifying C. jejuni V4 and V3-V5 variable regions

DNA from a pure isolate of C. jejuni (81-176) was amplified using the V4 515F and V4 806R primers and the KAPA ReadyMix protocol described above, or using the KAPA HiFi (lx) protocol with primers for the V3-V5 variable region. The primer sequences for the primary amplification for the V3-V5 variable region were as follows (16S-specific sequences in bold): V3F_Nextera:

TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGAGGCAGCAG (SEQ

ID NO: 5)

V5R_Nextera:

GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCGTCAATTCMTTTRAGT (SEQ ID NO:6)

Normalization and pooling of sequencing libraries

For sample normalization prior to sequencing, for experiments not including the synthetic standard molecules, PCR products were quantified using a PicoGreen dsDNA assay (Life Technologies, Carlsbad, CA), and the samples were normalized, pooled, and approximately 1 μg of material was concentrated to 10 μΐ using 1.8x AMPureXP beads (Beckman Coulter, Inc.,

Brea, CA). The pooled sample was then size selected at 427 bp +/- 20% for the DI pools, or at

368 bp +/- 20% for the EMP pools, on a Caliper XT DNA 750 chip (Caliper Life Science,

Hopkinton, MA). The size-selected material was cleaned up using AMPureXP beads, and eluted in 20 μΐ of EB buffer (10 mM Tris-HCl, pH 8.5). The final pooled sample was quantified using the PicoGreen dsDNA assay.

For experiments containing the synthetic standards, samples were normalized prior to sequencing using a SequalPrep normalization plate kit (Therm oFisher) according to

manufacturer's instructions.

The libraries containing the PCR-free quantification barcodes were prepared by treating the standard plasmid pools with /yl (New England Biolabs, Inc., Ipswich, MA), following manufacturer's recommendations for the digest. The resulting digest was purified using

AmPureXP beads, and quantified with the PicoGreen assay.

Sequencing

The sample pools were diluted to 2 nM based on the PicoGreen measurements, and 10 μΐ of the 2 nM pool was denatured with 10 μΐ of 0.2 N NaOH, diluted to 8 pM in Illumina's HT1 buffer, spiked with 15% PhiX, heat denatured at 96°C for 2 minutes, and sequenced using a MiSeq 600 cycle v3 kit (Illumina, San Diego, CA). Analysis

The mock community samples were subsampled to a depth of 10,000 reads per sample. Sequencing adapter sequences were then trimmed using Trimmomatic (Bolger et al., 2014, Bioinformatics btul70) and PANDAseq (Masella et al., 2012, BMC Bioinformatics 13 :31) was used to remove primer sequences (where applicable) and join paired end reads. Fastq files were converted to QIIME (Caporaso et al., 2010, Nat. Methods 7:335-336) fastq format using a custom script. Next, individual sample fasta files were concatenated into one fasta file and chimera detection and removal was run using ChimeraSlayer's usearch61 method (Haas et al., 2011, Genome Res. 21 :494-504). The resulting reads were mapped to an HMP mock community reference file (Salipante et al., 2014, Appl. Environ. Microbiol. AEM.02206-14-;

doi: 10.1128/AEM.02206- 14) for the calculation of the percent abundance, RMSD, and MAPE values. The distribution of primer corrections was analyzed by cataloging mismatches to the V4 primer sequences using custom Python scripts and BioPython (Cock et al., 2009, Bioinformatics 25: 1422-1423). Illumina adapters were trimmed using cutadapt (Martin, M., 2011,

EMBnet.journal 17: 10-12) and paired reads were merged using PANDAseq (Masella et al., 2012, BMC Bioinformatics 13 :31). In order to filter out noise from indels in the primer regions, a threshold of a maximum of three mismatches per primer sequence was used for this analysis. The primer sequences associated with the differentially abundant OTUs in the NHP and human datasets were analyzed by searching for exact matches to the rep set sequences from these OTUs in the untrimmed subsampled fastq files. The analysis of the PCR-free quantification barcodes and synthetic standard experiments were carried out using custom Python scripts. EXAMPLE 2

Primer editing standards

Synthetic standards were designed that allow primer editing to be studied in greater detail and monitored for the purposes of process quality control/quality assurance. These standards are based on the V4 515F primer region of E. coli and include 30 plasmids containing the E. coli 16S rRNA gene V4 (variable region 4) with every possible single base mismatch in the last 10 bp of the primer sequence and one wild-type plasmid (FIG. 27A). Similar standards could be designed for any primer region of interest, such as the additional set of standards developed for ITS2. The synthetic primer editing standards were synthesized, cloned into the pUCGAl .O cloning vector, and transformed into NEB 5 Alpha E. coli competent cells. Multiple clones were picked for each construct, DNA was extracted and the constructs were sequenced by Sanger sequencing to verify that the sequence of the synthetic DNA was correct. The following primers were used for Sanger verification of these constructs: pUCGA1.0_Sanger_For: C GAC T C T AGAG GAT C GAG C AC A (SEQ ID NO:270)

pUCGA1.0_Sanger_Rev: T TCGAGCTCGGTACCCGCAT (SEQ ID NO:271)

DNA from the 31 standard plasmids was quantified using the Quant-iT PicoGreen dsDNA quantitation assay (Thermo Fisher Scientific, Inc., Wa!tham, MA) and the plasmids were pooled at equal masses. The PCR-free quantification barcode constructs in the plasmids were used to verify that each construct was present in the pool and to determine the exact ratios of construct, abundances. The following restriction digest was used to liberate PCR-free quantification barcodes: 17 μΐ primer editing standard pool DNA (10 ng/μΐ), 2 μΐ Cutsmart buffer (New

England Biolabs Inc., Ipswich, MA), 1 μΐ Mlyl (New England Biolabs Inc., Ipswich, MA). The digests were incubated at 37°C for one hour, then 30 μΐ of water was added to the digest (to bring volume up to 50 μΐ), then 30 μΐ of magnetic beads (0.6x AMPure XP, Beckman Coulter, Inc., Brea, CA) were added and the supernatant added transferred to new tube (discarded beads). The restriction digest (supernatant from 0.6x binding) was purified using magnetic beads (1.8x AmpureXP beads, Beckman Coulter, Inc., Brea, CA) and eluted in 25 μΐ of elution buffer.

The eluted DNA was quantified using both Quant-iT PicoGreen dsDNA quantitation assay (Thermo Fisher Scientific, Inc., Waltham, MA) and Bioanalyzer HS analysis (Agilent Technologies, Santa Clara, CA). The pool was diluted to 2 nM and sequenced on a fraction of an MISEQ 2 x 300 bp lane (Illumina, Inc., San Diego, CA) following the manufacturer's instructions (8 pM clustering concentration). Composition of the plasmid pool (barcode counts and percentages) was determined using a custom python script.

In order to assess the ability of these standards to report on primer editing, and to compare the editing abilities of different enzymes, the primer editing standard pool was amplified using eight different polymerases: KAPA HiFi (KAPA Biosystems, Woburn, MA), Qiagen Taq (Qiagen USA, Germantown, MD), Q5 (New England Biolabs, Inc., Ipswich, MA), PHUSION (Thermo Fisher Scientific, Inc., Waltham, MA), VENT (New England Biolabs, Inc., Ipswich, MA), Pfu DNA polymerase (Promega Corp., Madison, WI), ACCUPRIME Taq (Invitrogen, Thermo Fisher Scientific, Carlsbad, CA), and Taq (New England Biolabs, Inc., Ipswich, MA) at four different concentrations (0.25x, 0.5x, lx, or 2x manufacturer's

recommended concentration) and the primer editing standard pool at four different template concentrations (250,000 template molecules, 25,000 template molecules, 2,500 template molecules, or 250 template molecules per standard). E. coli specific primers (non-degenerate V4 515F/V4 806R) were used for these amplifications:

E_coli_V4_515F:

TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTGCCAGCAGCCGCGGTAA (SEQ ID NO:272) E_coli_V4_806R:

GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGGACTACCAGGGTATCTAAT (SEQ ID NO:273)

PCR recipes and conditions are listed in Table 1, below (volumes are in microliters, temperatures are in degrees Celsius, all amplifications were done for 30 PCR cycles).

Table 1

These amplicons were then diluted 1 : 100, and amplified with 10 cycles of PCR (using KAPA HiFi 0.5x conditions) with indexing primers to add sample specific indices and Illumina flow cell adapters. Indexing primers had the following sequence ([i5] and [i7] refer to the index sequence codes used by Illumina, the p5 and p7 flow cell adapters are in bold):

Forward indexing primer:

AATGATACGGCGACCACCGAGATCTACAC [ i5 ] TCGTCGGCAGCGTC (SEQ ID NO:274) Reverse indexing primer:

CAAGCAGAAGACGGCATACGAGAT [ i7 ] GTCTCGTGGGCTCGG (SEQ ID NO:275)

Indexed samples were normalized using normalization plates (SEQUALPREP, Thermo

Fisher Scientific, Waltham, MA), an equal volume of each sample was pooled, and the sample pool was purified and concentrated using magnetic beads (lx AmPureXP, Beckman Coulter, Inc., Brea, CA), and eluted in 25 μΐ of elution buffer. The eluted DNA was quantified using both Quant-iT PicoGreen dsDNA quantitation assay (Thermo Fisher Scientific, Inc., Wa!tham, MA) and Bioanalyzer HS analysis (Agilent Technologies, Santa Clara, CA). The pool was diluted to 2 nM and sequenced on a fraction of an MISEQ 2 χ 300 bp lane (Illumina, Inc., San Diego, CA) following the manufacturer's instructions (8 pM clustering concentration). Composition of the plasmid pool (barcode counts and percentages) was determined using a custom python script.

Primer editing was not observed with non-proofreading polymerases (e.g., NEB Taq, Qiagen Taq, or Accuprime Taq). The proofreading polymerases tested were all able to edit the amplification primers to match the primer editing standard templates, though the extent to which the editing took place was variable between the different enzymes at the manufacturer's recommended enzyme concentration, lx (FIG. 24B). There was little apparent sequence specificity to the editing observed by proofreading polymerases, with the possibly exception of a slight bias towards G/C edits in the two 3' terminal positions (FIG. 24C). The extent of primer editing observed was dependent on the concentration of enzyme used (FIG. 24D), though the magnitude of concentration dependence also varied among the different enzymes studied (data not shown). These data provide direct evidence of enzyme concentration dependent primer editing. Template concentration had little discernable effect on primer editing at the

concentrations that could be reliably examined. These results demonstrate that the primer editing standards can be used to assess the amount of primer editing that occurs under various reaction conditions.

Next, the wildtype E. coli plasmid standard was amplified with a mixed pool of primers containing the 31 possible sequences encoded in the primer editing plasmid pool using KAPA HiFi polymerase (lx reaction condition). The mutant primers were edited to match the wildtype template sequence with a similar extent and frequency as the edits seen in the previous experiments with the primer editing standards. This demonstrates that the primer editing standards accurately report on the extent and frequency of primer editing.

In order to determine whether introduction of a phosphorothiol bond at a specific position in the primer sequence could limit the extent of primer editing, the primer editing standard pool was amplified with KAPA HiFi polymerase (lx reaction condition) using E. coli V4_5 \5 ~ F derivatives containing a single phosphorothiol bond at position 15, 16, 17, 18, or 19, together with the E. co// ' _V4_806R primer. These amplicons were indexed and sequenced as described above. Introduction of the phosphorothiol bond at a specific position caused a truncation of any primer editing activity 5' of the position of the phosphorothiol bond (FIG. 25). These results demonstrate that primer editing is tunable by using phosphorothiol modified primers.

EXAMPLE 3

Biophysical standards

A set of biophysical process control standards were designed to report on amplification biases that arise through the interaction of amplification conditions and reagents with the biophysical properties of the template molecules such as, for example, GC content, amplicon size, and/or secondary structure). These controls were designed to tile the parameter space encompassed by the natural genetic variation in the V4 region of the 16S rRNA gene, as assessed by the GC content and predicted secondary structure of all of the identifiable V4 regions in the Greengenes database (DeSantis et al., 2006. Appl Environ Microbiol 72:5069-72). (FIG. 26A). Next, the sequence of the E. coli 16 rRNA molecule was varied in silco, adding different amounts of GC or AT bias and generating an in silico library of millions of variant sequences. Then, secondary structure predictions were generated for these sequences, sequences that were >97% identical to a sequence in the Greengenes database were filtered out, and a set of molecules that tiled the extent of natural GC content and secondary structure variation (FIG. 26A) were selected. The naturally occurring distribution of V4 region sizes also were

characterized and a set of standards was made to cover this size distribution, which could be used to detect biases in amplicon size due to size selection or amplification bias (FIG. 26B).

The biophysical process control standards were synthesized, cloned, transformed, and sequence verified as described for the primer editing standards in EXAMPLE 2. They were normalized, pooled, and the PCR-free quantification barcodes were used to determine exact pool composition as described for the primer editing standards in EXAMPLE 2.

In order to assess the ability of these standards to report on amplification bias, and to compare the biases of different enzymes, the biophysical standard pool was amplified using eight different polymerases: KAPA HiFi (KAPA Biosy stems, Woburn, MA), Qiagen Taq (Qiagen USA, Germantown, MD), Q5 (New England Biolabs, Inc., Ipswich, MA), PHUSION (Thermo Fisher Scientific, Inc., Waltham, MA), VENT (New England Biolabs, Inc., Ipswich, MA), Pfu DNA polymerase (Promega Corp., Madison, WI), ACCUPRIME Taq (Invitrogen, Thermo Fisher Scientific, Carlsbad, CA), and Taq (New England Biolabs, Inc., Ipswich, MA) at four different concentrations (0.25x, 0.5x, lx, or 2x manufacturer' s recommended concentration) and the biophysical standard pool at four different template concentrations (250,000 template molecules, 25,000 template molecules, 2,500 template molecules, or 250 template molecules per standard).

Standard degenerate V4 515F/V4 806R were used for these amplifications:

V4_515F_Nextera:

TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTGCCAGCMGCCGCGGTAA (SEQ ID NO:276) V4_806R_Nextera:

GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGGACTACHVGGGTWTCTAAT (SEQ ID NO:277)

PCR recipes and conditions are listed in Table 1, above, in EXAMPLE 2. These amplicons were indexed and sequenced as described above. Sequence data was trimmed of adapters and primer sequences using cutadapt (Martin, M., 201 1 . E&iBneL journal 17(1): 10-12) paired end reads were merged using pandaseq (Masella et al., 2012. BMC Bioinformatics 13 :31) or PEAR (Zhang et al., 2014. Bioinformatics 30(5):614-620) and reads were mapped to a biophysical standards reference file using bowtie2 (Langmead et al., 2012. 9(4):357-359). Size standards were analyzed by counting sequences of various sizes after read merging using a custom python script. Different polymerases produced data that had distinctive patterns with respect to GC content and amplicon size (FIG. 26C-F). These results demonstrate that the biophysical standards can be used to report on bias due to amplicon biophysical properties in an amplification reaction.

EXAMPLE 4

Full-length 16S rRNA gene synthetic spike-in standards

After seeing inconsistent results with synthetic standards targeting just the 16S rRNA gene variable region V4, a set of 20 full-length 16S rRNA gene standards were designed with three independent 3 bp tags in variable regions V3, V4, and V5.

The tagged spike-in standards were synthesized, cloned, transformed, and sequence verified as described above in EXAMPLE 2. They were normalized, pooled, and the PCR-free quantification barcodes were used to determine exact pool composition as described above in EXMAPLE 2.

In order to test the ability of these full-length 16S rRNA gene synthetic standards to correct for amplification bias, a commercially available mock community reference standard was amplified with or without the pool of tagged synthetic spike-in standards. The pool of tagged synthetic spike-in standards contained plasmids corresponding to five of eight bacterial strains in the mock microbial community. These samples were amplified with primers targeting the bacterial 16S rRNA gene variable regions V1-V3, V3-V4, V4, and V5-V6, using either the KAPA HiFi lx, or Qiagen Taq lx reaction conditions described above, and the primer sets shown in Table 2.

Table 2

These amplicons were indexed and sequenced as described above. A custom python script was used to identify reads containing the 3 bp sequence tag that marks a read as corresponding to a spike-in standard, and spike-in standard and non-spike-in standard reads were split into separate fastq files. These reads were then trimmed, merged, and mapped to their respective reference files as described above in EXAMPLE 3. The abundance values for the reads corresponding to the tagged synthetic spike-in standards were then compared to their expected values, determined using the PCR-free barcode counts from the standard pool, in order to determine the extent of amplification bias observed for each construct. The ratio of observed to expected abundance for each construct was used to calculate a correction factor, and applied this correction factor to the mock community data for each sample (for the 5 strains targeted by a tagged spike-in standard), while normalizing the data to keep the total percentage for all organisms at 100% (FIG. 27A). As can be seen in FIG. 27B-E, measurements for strains targeted by a tagged spike-in standard (dark dots) were in general considerably more accurate when the calculated correction factor was applied to the data. In addition, the aggregate accuracy of the measurements for the whole mock community (including the data points for the non-targeted strains, light dots) was also improved when the correction factors were applied (FIG. 27D-E).

EXAMPLE 5

The precision and accuracy of the PCR-free barcode quantification technology was tested using a standard pool described above in EXAMPLE 1 consisting of 20 tagged synthetic spike-in constructs targeting 16S rRNA gene variable region V4 and each containing a distinct 20 bp PCR-free quantification barcode construct.

Precision of PCR-free quantification barcode measurements

The 20-construct standard pool was cut with /yl as follows:

10 μΐ plasmid DNA (from pooled sample - 50 ng/μΐ)

2 μΐ Cutsmart buffer (New England Biolabs Inc., Ipswich, MA)

7 μΐ water

1 μΐ Mlyl restriction enzyme (New England Biolabs Inc., Ipswich, MA)

The reaction was incubated at 37°C for one hour. Next, 14 μΐ of solid phase reversible immobilization beads (SPRI 0.7x, Beckman Coulter, Inc., Brea, CA) were added. The supernatant (35 μΐ) was transferred to a tube with 70 μΐ of SPRI beads (2x), washed twice with 80%) ethanol, air-dried for 10 minutes, then eluted in 20 μΐ elution buffer. The eluted DNA was quantified using both Quant-iT PieoGreen dsDNA quantitation assay (Thermo Fisher Scientific, Inc., Waltham, MA) and Bioanalyzer HS analysis (Agilent Technologies, Santa Clara, CA). The pool was diluted to 2 nM and sequenced on a fraction of an MISEQ 2 χ 300 bp lane (Illumina, Inc., San Diego, CA) following the manufacturer's instructions (8 pM clustering concentration). Barcode counts were determined using a custom python script (FIG. 28A).

Based on this initial sequencing data, two additional pools of these standards were made: a re-pooled even pool (targeting 5%> abundance for each construct) and a staggered pool (with a range of targeted abundances for each construct spanning roughly four logs). The even re-pooled sample was processed and sequenced as above and yielded data that showed that construct balance was improved in the pool (FIG. 28B).

To test the precision of the PCR-free barcode quantification technique, three independent digests of the re-pooled even standard pool were performed, purified, and sequenced as above. The three technical replicates yielded nearly identical data, demonstrating that this PCR-free barcode quantification technique is highly precise (FIG. 28C).

Accuracy of PCR-free quantification barcode measurements

Next, the accuracy of the PCR-free barcode quantification method was assessed by first comparing these measurements to those obtained by using PCR to amplify the barcode cassette, followed by comparison of the PCR-free and PCR measurements to droplet digital PCR measurements.

PCR-free barcode measurements of the initial re-pooled even mixture and the staggered pool were made as described above, with the exception that in the case of the staggered mixture 197.2 ng, as opposed to 500 ng of DNA was digested with /yl. To set up the PCR reactions, pooled DNA was diluted to 1 ng/μΐ, to which 1 ng of DNA (1 μΐ diluted in 24 μΐ of water) per 50 μΐ PCR reaction was added, lx Qiagen Taq conditions were used to amplify for 10 cycles, 20 cycles, 30 cycles, or 40 cycles, with the following primers (that target the ends of the PCR-free barcode construct): p5: AAT GAT AC G G C GAC C AC C GA (SEQ ID NO:286)

p7: CAAG C AGAAGAC G G CAT AC GA (SEQ ID NO:287)

The mixtures were amplified as follows:

95°C - 5 minutes

X cycles

94°C - 30 seconds

60°C - 30 seconds

72°C - 30 seconds 72°C - 10 minutes

4°C - hold

The PCR reactions were purified using magnetic beads (0.8x AmpureXP beads, Beckman Coulter, Inc., Brea, CA) and eluted purified DNA in 25 μΐ of elution buffer. The eluted DNA was quantified using both Quant-iT PicoGreen dsD A quantitation assay (Thermo Fisher Scientific, inc., WaStham, MA) and Bioanalyzer HS analysis (Agilent Technologies, Santa Clara, CA). The pool was diluted to 2 nM and sequenced on a fraction of an MISEQ 2 χ 300 bp lane (Illumina, Inc., San Diego, CA) following the manufacturer's instructions (8 pM clustering concentration). Barcode counts were determined using a custom python script.

Increasing the number of PCR cycles led to increased quantitative deviation from the expected values as well as the values measured with the PCR free barcode quantification method for both the even and staggered plasmid pool (FIG. 29A-D).

To further confirm the accuracy of the PCR free barcode quantification method, these measurements we compared to droplet digital PCR (ddPCR) measurements, a gold standard for accurate quantification. To measure the relative amount of each barcode in the even and staggered pools by ddPCR, a collection of 40 primer sets were designed that amplified between the plasmid backbone and each of the unique 20 bp barcode sequences in both the forward and reverse orientations. The specificity of these primer sets was determined by amplifying each individual plasmid construct with all 40 possible primer sets by qPCR. ddPCR reactions were carried out using a QX200 droplet digital PCR system (Bio-Rad Laboratories, Inc., Hercules, CA) following the manufacturer's instructions. The following reaction recipe was used:

5 μΐ - template (1 : 10,000 dilution of 1 ng/μΐ plasmid pool template. Note: for staggered template, different dilutions were made for different assays in order to make sure that all measurements were in the quantitative range of the instrument)

0.44 μΐ primer 1

0.44 μΐ primer 2

5.12 μΐ water

11 μΐ dye (EVAGREEN, Biotium, Fremont, CA)

2 μΐ of I-Scel (New England Biolabs, Inc., Ipswich, MA) to linearize the plasmids was added to the reaction master mix. The reactions were partitioned into emulsions and then cycled using the following PCR conditions (lid temp = 105°C): 95°C - 10 minutes

40 cycles of:

95°C 30 seconds

55°C 1 minute

72°C 5 minutes

12°C hold

Results were analyzed using QuantaSoft software (Bio-Rad Laboratories, Inc., Hercules, CA). In cases where there was clear separation of positive and negative droplet signals, a threshold was drawn that separated these populations of droplets in order to generate a molecule count for each assay. Assays that did not show a clear separation of positive and negative droplet signals were not analyzed. Data was averaged for any replicates and for the forward and reverse orientation assays for each construct to produce one measurement for each barcode construct.

The ddPCR measurements of the even plasmid pool correlated very well with the PCR- free barcode measurements, but did not correlate well with the measurements of the barcode constructs made with 10-40 PCR cycles (FIG. 29E). Similar results were obtained for the staggered plasmid pool. The strong correlation of ddPCR measurements with the PCR-free barcode measurements demonstrates that the PCR-free barcode quantitation method is highly accurate.

The complete disclosure of all patents, patent applications, and publications, and electronically available material (including, for instance, nucleotide sequence submissions in, e.g., GenBank and RefSeq, and amino acid sequence submissions in, e.g., SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq) cited herein are incorporated by reference in their entirety. In the event that any inconsistency exists between the disclosure of the present application and the disclosure(s) of any document incorporated herein by reference, the disclosure of the present application shall govern. The foregoing detailed description and examples have been given for clarity of understanding only. No unnecessary limitations are to be understood therefrom. The invention is not limited to the exact details shown and described, for variations obvious to one skilled in the art will be included within the invention defined by the claims.

Unless otherwise indicated, all numbers expressing quantities of components, molecular weights, and so forth used in the specification and claims are to be understood as being modified in all instances by the term "about." Accordingly, unless otherwise indicated to the contrary, the numerical parameters set forth in the specification and claims are

approximations that may vary depending upon the desired properties sought to be obtained by the present invention. At the very least, and not as an attempt to limit the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.

Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the invention are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. All numerical values, however, inherently contain a range necessarily resulting from the standard deviation found in their respective testing measurements.

All headings are for the convenience of the reader and should not be used to limit the meaning of the text that follows the heading, unless so specified.

Sequence Listing Free Text

Synthetic standard sequences - initial 16S rRNA gene V4 region tests (HMP mock community)

Sequence tag in bold

lllumina adapters in lower case

>0 l_ABa_v4_TCT (SEQ ID NO:8)

AATAAGCACCGGCTAACTCTGTGCCAGCAGCCGCGGTAATACAGAGGGTGCGAGCGTTAA TCGGATTTAC TGGGCGTAAATCTGCGTGCGTAGGCGGCTTATTAAGTCGGATGTGAAATCCCCGAGCTTA ACTTGGGAAT TGCATTCGATACTGGTGAGCTAGAGTATGGGAGAGGATGGTAGAATTCCAGGTGTAGCGG TGAAATGCGT AGAGATCTGGAGGAATACCGATGGCGAAGGCAGCCATCTGGCCTAATACTGACGCTGAGG TACGAAAGCA TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCATGCCGTAAACGATGTCTACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tGCACATTGTAGCGTTGATAAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>02_BCe_v4_TCT (SEQ ID NO:9)

AGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTAT CCGGAATTAT TGGGCGTAAATCTGCGCGCGCAGGTGGTTTCTTAAGTCTGATGTGAAAGCCCACGGCTCA ACCGTGGAGG GTCATTGGAAACTGGGAGACTTGAGTGCAGAAGAGGAAAGTGGAATTCCATGTGTAGCGG TGAAATGCGT AGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAACTGACACTGAGG CGCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCGGAGGAGCTATAAATGACCagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>03_BVU_V4_TCT (SEQ ID NCv lO)

AATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATCCGAGCGTTAT CCGGATTTAT TGGGTTTAAATCTGGGAGCGTAGATGGATGTTTAAGTCAGTTGTGAAAGTTTGCGGCTCA ACCGTAAAAT TGCAGTTGATACTGGATATCTTGAGTGCAGTTGAGGCAGGCGGAATTCGTGGTGTAGCGG TGAAATGCTT AGATATCACGAAGAACTCCGATTGCGAAGGCAGCCTGCTAAGCTGCAACTGACATTGAGG CTCGAAAGTG TGGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACGGTAAACGATGAA AC AGGGATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTAATCGTACAAATTTCGAAAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>04_BVU_V4_TCT (SEQ ID NCv l l)

AATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATCCGAGCGTTAT CCGGATTTAT TGGGTTTAAATCTGGGAGCGTAGATGGATATTTAAGTCAGTTGTGAAAGTTTGCGGCTCA ACCGTAAAAT TGCAGTTGATACTGGATATCTTGAGTGCAGTTGAGGCAGGCGGAATTCGTGGTGTAGCGG TGAAATGCTT AGATATCACGAAGAACTCCGATTGCGAAGGCAGCCTGCTAAGCTGCAACTGACATTGAGG CTCGAAAGTG TGGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACGGTAAACGATGAA AC AGGGATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tAACATTCAGGCGCACCTACAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>05_BVU_V4_TCT (SEQ ID NO: 12)

AATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATCCGAGCGTTAT CCGGATTTAT TGGGTTTAAATCTGGGAGCGTAGATGGATGTTTAAGTCAGTTGTGAAAGTTTGCGGCTCA ACCGTAAAAT TGCAGTTGATACTGGATATCTTGAGTGCAGTTGAGGCAGGCGGAATTCGTGGTGTAGCGG TGAAATGCTT AGATATCACGAGGAACTCCGATTGCGAAGGCAGCCTGCTAAGCTGCAACTGACATTGAGG CTCGAAAGTG TGGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACGGTAAACGATGAA AC AGGGATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTATGCCGCTCGAGACCCATTagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>06_CBe_v4_TCT (SEQ ID NO: 13)

AGGAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGATTTAC TGGGCGTAAATCTGGGAGCGTAGGTGGATATTTAAGTGGGATGTGAAATACTCGGGCTTA ACCTGGGTGC TGCATTCCAAACTGGATATCTAGAGTGCAGGAGAGGAAAGTAGAATTCCTAGTGTAGCGG TGAAATGCGT AGAGATTAGGAAGAATACCAGTGGCGAAGGCGACTTTCTGGACTGTAACTGACACTGAGG CTCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAA AC AGGGATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTTGGATCACTACGCCAGGACagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>07_CBe_v4_TCT (SEQ ID NO: 14)

AGGAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGATTTAC TGGGCGTAAATCTGGGAGCGTAGGTGGATATTTAAGTGGGATGTGAAATACTCGGGCTTA ACCTGGGTGC TGCATTCCAAACTGGATATCTAGAGTGCAGGAGAGGAAAGTAGAATTCTTAGTGTAGCGG TGAAATGCGT AGAGATTAGGAAGAATACCAGTGGCGAAGGCGACTTTCTGGACTGTAACTGACACTGAGG CTCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAA AC AGGGATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCATCCTACAGGCTTGTAGCGagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>08_DRa_v4_TCT (SEQ ID NO: 15)

TAATAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAC CCGGAATCAC TGGGCGTAAATCTGGGCGTGTAGGCGGAAATTTAAGTCTGGTTTTAAAGACCGGGGCTCA ACCTCGGGGA TGGACTGGATACTGGATTTCTTGACCTCTGGAGAGGTAACTGGAATTCCTGGTGTAGCGG TGGAATGCGT AGATACCAGGAGGAACACCAATGGCGAAGGCAAGTTACTGGACAGAAGGTGACGCTGAGG CGCGAAAGTG TGGGGAGCAAACCGGATTAGATACCCGGGTAGTCCACACCCTAAACGATGTACGTTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tAGGCGTTTAGGTACCTGTTCagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>09_EFa_v4_TCT (SEQ ID NO: 16)

AGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGCGAGCGCAGGCGGTTTCTTAAGTCTGATGTGAAAGCCCCCGGCTCA ACCGGGGAGG GTCATTGGAAACTGGGAGACTTGAGTGCAGAAGAGGAGAGTGGAATTCCATGTGTAGCGG TGAAATGCGT AGATATATGGAGGAACACCAGTGGCGAAGGCGGCTCTCTGGTCTGTAACTGACGCTGAGG CTCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTCACAAGTTTCGCAATCGAGagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>10_ECO_V4_TCT (SEQ ID NO: 17)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGG TGAAATGCGT AGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGG TGCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTATTGCTAAGGCTATGGAGAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>ll_HPy_v4_TCT (SEQ ID NO: 18)

AATAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAC TCGGAATCAC TGGGCGTAAATCTGAGCGCGTAGGCGGGATAGTCAGTCAGGTGTGAAATCCTATGGCTTA ACCATAGAAC TGCATTTGAAACTACTATTCTAGAGTGTGGGAGAGGTAGGTGGAATTCTTGGTGTAGGGG TAAAATCCGT AGAGATCAAGAGGAATACTCATTGCGAAGGCGACCTGCTGGAACATTACTGACGCTGATT GCGCGAAAGC GTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCCTAAACGATGGATGCTAGG GATAACAGGG TAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacg ctcttccgat ctCTGACGGGACAAACGGATCTagatcggaagagcacacgtctgaactccagtcacAATC AGTCTCGTat ctcgtatgccgtcttctgcttgTTGTCGACTC

>l2_LGa_v4_TCT (SEQ ID NO: 19)

AGAAAGTCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGCGAGTGCAGGCGGTTCAATAAGTCTGATGTGAAAGCCTTCGGCTCA ACCGGAGAAT TGCATCAGAAACTGTTGAACTTGAGTGCAGAAGAGGAGAGTGGAACTCCATGTGTAGCGG TGGAATGCGT AGATATATGGAAGAACACCAGTGGCGAAGGCGGCTCTCTGGTCTGCAACTGACGCTGAGG CTCGAAAGCA TGGGTAGCGAACAGGATTAGATACCCTGGTAGTCCATGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCATCGCGGACAACGCCAACCagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>l3_NMe_v4_TCT (SEQ ID NO:20)

AATAAGCACCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGCGGGCGCAGACGGTTACTTAAGCAGGATGTGAAATCCCCGGGCTCA ACCCGGGAAC TGCGTTCTGAACTGGGTGACTCGAGTGTGTCAGAGGGAGGTAGAATTCCACGTGTAGCAG TGAAATGCGT AGAGATGTGGAGGAATACCGATGGCGAAGGCAGCCTCCTGGGACAACACTGACGTTCATG CCCGAAAGCG TGGGTAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCCTAAACGATGTCAATTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tAAGGGTCACACAGTATCATCagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>14_PAC_V4_TCT (SEQ ID NO:21)

AAGAAGCACCGGCTAACTACGTGCCAGCAGCCGCGGTGATACGTAGGGTGCGAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGGGCTCGTAGGTGGTTGATCGCGTCGGAAGTGTAATCTTGGGGCTTA ACCCTGAGCG TGCTTTCGATACGGGTTGACTTGAGGAAGGTAGGGGAGAATGGAATTCCTGGTGGAGCGG TGGAATGCGC AGATATCAGGAGGAACACCAGTGGCGAAGGCGGTTCTCTGGGCCTTTCCTGACGCTGAGG AGCGAAAGCG TGGGGAGCGAACAGGCTTAGATACCCTGGTAGTCCACGCTGTAAACGGTGGGTACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tGAAGTAGAGAGCCGGCTAACagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>l5_PAr_v4_TCT (SEQ ID NO:22)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCCCGGCGGTATACGTTCAAGagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>l 6_RS P _V4_TCT (SEQ ID NO:23)

AAGAAGCCCCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGGGCTAGCGTTAT TCGGAATTAC TGGGCGTAAATCTGCGCACGTAGGCGGATCGGAAAGTCAGAGGTGAAATCCCAGGGCTCA ACCCTGGAAC TGCCTTTGAAACTCCCGATCTTGAGGTCGAGAGAGGTGAGTGGAATTCCGAGTGTAGAGG TGAAATTCGT AGATATTCGGAGGAACACCAGTGGCGAAGGCGGCTCACTGGCTCGATACTGACGCTGAGG TGCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAATGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tGGATGATGCGTTCGTACACAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>17_SAU_V4_TCT (SEQ ID NO:24)

AGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTAT CCGGAATTAT TGGGCGTAAATCTGCGCGCGTAGGCGGTTTTTTAAGTCTGATGTGAAAGCCCACGGCTCA ACCGTGGAGG GTCATTGGAAACTGGAAAACTTGAGTGCAGAAGAGGAAAGTGGAATTCCATGTGTAGCGG TGAAATGCGC AGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAACTGACGCTGATG TGCGAAAGCG TGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCGCCACGAGGATTAGAAATTagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>18_SE P _V4_TCT (SEQ ID NO:25)

AGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGAATCAT TGGGCGTAAATCTGCGCGCGTAGGCGGTTTCTTAAGTCTGATGTGAAAGCCCACGGCTCA ACCGTGGAGG GTCATTGGAAACTGGAGGACTTGAGTGCAGAAGAGGAAAGTGGAATTCCATGTGTAGCGG TGAAATGCGC AGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAACTGACGCTGATG TGCGAAAGCG TGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTTTCGGTGCTAAATCACACTagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>l 9_SAg_v4_TCT (SEQ ID NO:26)

AGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTCCCGAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGCGAGCGCAGGCGGTTCTTTAAGTCTGAAGTTAAAGGCAGTGGCTTA ACCATTG AC GCTTTGGAAACTGGAGGACTTGAGTGCAGAAGGGGAGAGTGGAATTCCATGTGTAGCGGT GAAATGCG A GATATATGGAGGAACACCGGTGGCGAAAGCGGCTCTCTGGTCTGTAACTGACGCTGAGGC TCGAAAGCGT GGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGGA TAACAGGGTA ATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgct cttccgatct CCTGCGCATTGCAATGGCGTagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTC

>20_SMU_V4_TCT (SEQ ID NO:27)

AGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTCCCGAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGGGAGCGCAGGCGGTCAGGAAAGTCTGGAGTAAAAGGCTATGGCTCA ACCATAGTGT GCTCTGGAAACTGTCTGACTTGAGTGCAGAAGGGGAGAGTGGAATTCCATGTGTAGCGGT GAAATGCG A GATATATGGAGGAACACCAGTGGCGAAAGCGGCTCTCTGGTCTGTCACTGACGCTGAGGC TCGAAAGCGT GGGTAGCGAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGGA TAACAGGGTA ATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgct cttccgatct TCTGGCGGGCGTATCGGAGAagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTC

>2 l_SPn_v4_TCT (SEQ ID NO:28)

AGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTCCCGAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGCGAGCGCAGGCGGTTAGATAAGTCTGAAGTTAAAGGCTGTGGCTTA ACCATAGTAG GCTTTGGAAACTGTTTAACTTGAGTGCAAGAGGGGAGAGTGGAATTCCATGTGTAGCGGT GAAATGCGTA GATATATGGAGGAACACCGGTGGCGAAAGCGGCTCTCTGGCTTGTAACTGACGCTGAGGC TCGAAAGCGT GGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCTGTAAACGATGAGTGCTAGGGA TAACAGGGTA ATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgct cttccgatct ATAGTTAAATAAAGAGCCAAagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTC

>22_AOd_v4_TCT (SEQ ID NO:29)

AAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGCGCGAGCGTTGT CCGGAATTAT TGGGCGTAAATCTGGGCTTGTAGGCGGTTGGTCGCGTCTGCCGTGAAATCCTCTGGCTTA ACTGGGGGCG TGCGGTGGGTACGGGCTGACTTGAGTGCGGTAGGGGAGACTGGAACTCCTGGTGTAGCGG TGGAATGCGC AGATATCAGGAAGAACACCGGTGGCGAAGGCGGGTCTCTGGGCCGTTACTGACGCTGAGG AGCGAAAGCG TGGGGAGCGAACAGGATTAGATACCCTGGTAGTCCACGCTGTAAACGTTGGGCACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCTTCCTGGCTCCTAAATTACagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>23_LMO_V4_TCT (SEQ ID NO:30)

AGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGT CCGGATTTAT TGGGCGTAAATCTGCGCGCGCAGGCGGTCTTTTAAGTCTGATGTGAAAGCCCCCGGCTTA ACCGGGGAGG GTCATTGGAAACTGGAAGACTGGAGTGCAGAAGAGGAGAGTGGAATTCCACGTGTAGCGG TGAAATGCGT AGATATGTGGAGGAACACCAGTGGCGAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGG CGCGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tTGTGTTGGCATATTTAAGTAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>24_PAr_v4_TTT (SEQ ID NO:31)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATTTGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCCGAATGAAAGTACCCGAAAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>25_PAr_v4_TCA (SEQ ID NO:32)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCAGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tGGTCGTGCTATCAATCCAACagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>26_PAr_v4_ccc (SEQ ID NO:33)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAACCCGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tCAGTTACTCGAAGGTATAGGagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>27_PAr_V4_GGG (SEQ ID NO:34)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGGGGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tGTGATAGCCGGGCGTTACATagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>28_PAr_v4_l (SEQ ID NO:35)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCAAC CTGGGAACTG CATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGGTG AAATGCGTAG A A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCGTG GGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGAT AACAGGGTAA TGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctc ttccgatctT TTGATAGTGCGCGCATAGCagatcggaagagcacacgtctgaactccagtcacAATCAGT CTCGTatctc gtatgccgtcttctgcttgTTGTCGACTC

>29_PAr_v4_2 (SEQ ID NO:36)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCAA CCTGGGAACT GCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGGT GAAATGCGTA GATATAGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGT GCGAAAGCGT GGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGA TAACAGGGTA ATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgct cttccgatct GTTCTATGCCTTACCTAAAGagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTC

>30_PAr_v4_5 (SEQ ID NO:37)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGAGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCT CAACCTGGGA ACTGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGC GGTGAAATGC GTAGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AG CGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAG GGATAACAGG GTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgac gctcttccga tctGCAGCTTCTTGATGAGGCTTagatcggaagagcacacgtctgaactccagtcacAAT CAGTCTCGTa tctcgtatgccgtcttctgcttgTTGTCGACTC

>3l_PAr_v4_7 (SEQ ID NO:38)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGACTGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGG CTCAACCTGG GAACTGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTA GCGGTGAAAT GCGTAGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACT AGGGATAACA GGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacg acgctcttcc gatctTTTCATTACGCTCCACTTCTagatcggaagagcacacgtctgaactccagtcacA ATCAGTCTCG TatctcgtatgccgtcttctgcttgTTGTCGACTC

>32_PAr_v4_lO (SEQ ID NO:39)

AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGACTAAGGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCC GGGCTCAACC TGGGAACTGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGT GTAGCGGTGA AATGCGTAGATATAGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGAC ACTGAGGTGC GAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCG ACTAGGGATA ACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctac acgacgctct tccgatctGCTCCTATTTAACCTGGACCagatcggaagagcacacgtctgaactccagtc acAATCAGTC TCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

16S rRNA Gene V4 process standard sequences lllum ina adapters in lower case

>33_GC_dG_35-40_V4 (SEQ ID NO:40)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATATGAAGGTTGAAAGCGTAAT TAGGAATTAT TGATAGTAAAGTGCACACTAGCGTTTTGTTAAATCTTAAGTGTAATCCCCGAGCTAAAAC AAGGAATAAC ATCTGATAATTACAAGATTGAAAATCGTATTTGGAGGTAGAATTCCAGGAGTAAAGGAGA AATTAATAGT GTTCTGTAATAATACAAGTATCGTATGCAGCAACTAGGTCGAAGACTGATGATCAGGTGA GAAAGTGTTG GGAGCTAACTGAATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA TTTGGCACTGTAGGTACTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>34_GC_dG_35-50_V4 (SEQ ID NO:41)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATATGGAGGATTCAAACATAAA TCAGAATTAC TGGGCTTAAAATAATCGCATTCTGTTTGTTAAGTAATATGTGTAATCCCCGGGCTAATCC TGGGAAATGC ATTTAATACTGGCAATCTAGAGTATAATAAAGGAGAATAGTATTTTAGTAGAAACAGAGA ATTGTTTAGA TATTTGGAGGAATAAAGTTAGCATTTGCTGCCCCATGGACGAAAAATGATGCTCATATGC AAAAGCGTGG TGTACAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG TGCTATAGGCCCTCTTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>35_GC_dG_35-60_V4 (SEQ ID NO:42)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAAAATGAAGAATGAATGCTTTAA TCGGTATTAT TGGTCGTAAAACGCACGCAGGAGGATTGTTAAATTAGATGTGAAATCCACTGGCTTAACA TTAGATAAGC ATCTGATACAGGATAGCTTGATTTTCATATAAGAGGTTTGAAATCCAGATATAGCTTTGT AATTCGTAGA AATCTGGATGATTACCGGTTATGAAGGCGGTCTCATGGATGAAATCTGATGCTAAAATAC GAATGCGTGG TTATCAAATAATATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT CACATGCTGCTGCGTCCAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>36_GC_dG_36-70_V4 (SEQ ID NO:43) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAAAACTGATGGTGTAAACGTTTT TCGGTTTAAC TGAACATAATGTTCACGAAAGATAAATGTTATTACATTTGATATTTATCTTGACTCAACA TGGGAACAGC A ATGATACAGGAAAACTTGAGTCTCGTAGAAGGGAGTTTAATTCCAGGTTAAGCTATTATA TGATAA A GAACTGGAGAAAATCCGGTGTTGATGGCGGTTACTTGGATTTAGACTTACGTTCAGGAAC AAAATCTTGT GGTGCTAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG GCATGGGATCATGTCAGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>37_GC_dG_40-40_V4 (SEQ ID NO:44)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGATAGTGTTAA TCGTAATTAC TAGGCGTTAAGCACAAACAGGTGGATTGTTAAGACAGATATGATATCCCAGGGCTTAAAC AGGAAACTGC AAATGA AC AGCTAGCTTGAGACTCGAATATGGGGGTAGAATACCAGGATTAAAGATGATTTACGTAGA GA AAGGAGTATTACCGTTGTTAAAGGCGGCAACCTGAATTAATACTAACTAACAGGAAAGAA AGCGTGG TAAGAAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA GGACCAGATCATGTGATCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>38_GC_dG_40-50_V4 (SEQ ID NO:45)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAATTTTAA TCGGAATTAC TGGTAATAAAGCGCACGTAGTCGGTTTGTTAATTCATATTTGAATTCCTAAGTCTAAACC TAGTAACTAC ATCTGATACTGGTAAACTTGAGTCTCTTAGAGGGGGATAGAATTATAGTTGTAGCGGTGA AATTCGAAGA GTTCTGGAGTAATACCGGTAGCAAAGACGACCAACTGGACGAAGTCTGACGTTAAGATAA GAAAGTATGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA TGCATGTAAGACGCTCCGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>39_GC_dG_40-60_V4 (SEQ ID NO:46)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACAGAGGGTGTAAGCGTTAA TAGAAATTAC TGGGCGTAAAAAGCACGCAGGCGGTTTGTAAAGTTAGATATAAATTCATTGGACTCTAAC TGAGAACTGC ATTTGATACTTTCAAGCTTTTGTCTCGTTGAGGAGGGTAGAAATTCAGGAGTTGCGATGA TATGCTTAGA GATCTTGAGGAATTCCGGTGTCGAATGCAAACTCCTGGACGAAGACTAACGTTCAGTTGC AAAAGAGTGG AAATTAAACATTATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG GACAGTCGCTTATCTGCAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>40_GC_dG_40-70_V4 (SEQ ID NO:47)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATAAGGAATGAGCTAGCGTTAA TCGAAATTAC TGGGTGTTTTGCTCATGCAGGAGATTAGTTTATTCAGATGTAAAAAACCCGGGTACAACC TGGGAATTGC ATCTGATACTTATAAGCTAAATACTCGTAGAGGGAGGTAGTATTCCTGGTGTTGTGGTGA AATGTGTAGA GATCTATATAATTACATGTTGCGAAGGCGGACCCAAGGACGAAGACTGATGCTCAGAAAT TAAAACGTGG AAATCAAACTTGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA TCGTAATTGCCTATGAGTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>41_GC_dG_40-80_V4 (SEQ ID NO:48) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATTCAGAATGTGTAAGCGTAAT TCGAAATTTC TGAGAGTAAAGCGAATGCAGATGGTTTATTTTGTTAGAAGTGAAATCCCCGGGCTATACC TGGTTACTGC ATCTGTTACTGGTAAACTTGAAACTCGAAGAGGGTGATAATATTCCAGGTATTTAGGTTA AATGTGTAGA TATCTGGATGAATACTAGTGTCTAAGGCAGTCCACTGGACGTAGACTTACTCTCAGGTTC GAAAGCGTGG GGAACATTCATAATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG AAGGTCGCGGCGGATATGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>42_GC_dG_ 5-50_V4 (SEQ ID NO:49)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATAAGGAGGGTGCTTGCGTTAA ACAGAAA AC TGGGTGTAAAACGTACGTAGACGGTTTATTAAGTAAGATGTGAAATCCCCGTGCTCAACC TAGGAACTGC ATATGATATTGGAAAACTTGAGACATATAGAGGGAGATAGAATACTTGGAGTAGCGTTGT AATGCGTATA GATTTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACAGACGCTCAGGTGC GAAATCGTTG TGATCAAATAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGC CCGCAGCTTCGCTCTAAGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>43_GC_dG_45-60_V4 (SEQ ID NO:50)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGAAGTGTGAAAACGTTAT TAAGAATTAC TGGGCGTAAAGCGTTCGCAGGCGGTTTGTTAAGTCATAAGTGAAATCCCCGGGCACAAAC TGGGAACTGA ATCTGTAACTGACAAGCTTGAGTATCTTATAATGGGATAGAATTTAATGTGTAGCTGTGA AATGCGTAGA GATCTGGAGATATACCGGTGACTAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC TAAATCGTGG GGAACAAAAAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA TGATTTAAAGTCAAGAGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>44_GC_dG_45-70_V4 (SEQ ID NO:51)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAAAACTGAGGGTGCAAGCGTTAA TCGGAATTAC TGTGCATAAAGCTCACGCATTCGTTTTGTTAAGTCAGATAATAAATCCTCGAGTTAAACC TGGGAAATGC ATCTGATACTGAAAAGCTTGATTCTCGTAGAGGGGTGTAGAATTCCAGGTGTAGCAGTAA AATACGTAGA GATCAGAATGAATTCCGGTGGTGAAGTCGGCCTACTGGACGAAGACTGACGCTAAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAC GTAGGTGATCGGTACCACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>45_GC_dG_45-80_V4 (SEQ ID NO:52)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGTGTTAAATCGTTAA TAGGAATTAC TGGGCGAAAAGTGCTAGCAGGCGGTTTGTTAAATCAGTTGTGAAATCCCTGGGCACAACC TGGGATCTGC ATCTGATTTTGGCAAGCTTTAGTCTATTAGAGGGGGGTAAATTTCCATGTGTAGATTTGA AATGCGTTTA GATCTGGAGGAATACCGGAGGTGAAGGCGATCCCCTGGACGTAGACTGAAGCTCAAGTGA GAAAGCTTGG AGTGCAAACTAGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG AACACCACTGGTGACCCAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>46_GC_dG_45-90_V4 (SEQ ID NO:53) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGATAA TAGTATTTAC TGTGCGAAAAGCATACTTAGGAAGATTTTTATGTCAGATGTGAAATCCCCGGGCTTAACC TGGGAACTGC ATCTGATACTGACAAGTTTGAGACTCG A AGGGGGGTAGAATTCCAGGTGTTGCAGTGAAAAGTGTAGA GATCTGGAAGAATACCGGTGGCGAAGGTTGCCCCCTGTACGAATAATGACGCTATGGTGC GAAAGCATTG TGTGCAAACAAGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAC AGAGGGCAATGACGTACAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>47_GC_dG_50-60_V4 (SEQ ID NO:54)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAAGGTGCAAGTGTTAA TCGGAATTAC TGGGCATAAAGCGCACGAAGGCGGTATGTTAAGTTAGATGTGAAATCCCCGGGCTCAATC TGTGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCAAGGTGTAGCGGTGA AATGCGTAGA GATCTAGATGAATACCGGTGGCGAAGGAGGTCCCCTGGACGAAGACTGACACTCTGGTGC GAAATAGTGG GGAGCAAACAGAATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTC ATTGACAGGTTGGGTTAGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>48_GC_dG_50-70_V4 (SEQ ID NO:55)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAATCGTTAA TCGGATTTAC TGGGCTTAAATCGCACGCAGGCGGTTTGTTAAGTCATATGTGAAAACCCCGGGCTCAACC TGGGAACTGC ATCTGATACTTGCAAGCTTGAGTCTCGTATAGGGAGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGAAGTAATACCGGTAGCTAATACGGCCCACTGGACGAAGACTGACGCACAGGTGC TAAAGCGTGT GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAC ACTGTATGGACCGGTCACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>49_GC_dG_50-80_V4 (SEQ ID NO:56)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGTGTGCAAGCGTTAA TCGGAATTAC TGGTCGTAAAGCGCACGTAGACGGTTTGTTAATTCAGATGTTAAATTCCAGGGCAAAACC TGGGAACTGC ATCTTATACTGGCAAGCTTGAGTCTCGTAGAGGGGGTTAGAATTCCAGGTGTAGCGGTGA AATGTGTAAA GATCTGGAGGAATACCGGTGTTGAAGGCGGCCTCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG GCCTGGCTCTTATGTAGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>50_GC_dG_50-90_V4 (SEQ ID NO:57)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGAACAAGCGTTAA TCGAAATAAC TGGGCGTAAAGCGCTCGTAGGTTGTTTGTTAAGTCAGATGTGAAATCCCCGGGTTCAACC TGGGAACTGC ATCTGATTATGGCAAGCTTGAGTCTCGTAGAGGGGGGTATAATTCCAGTTGAAGCGGTAA AATGCGTTGA GATCTGGAGGTATACCGGTGGCGAAAGCGGCCCCATGGACGAAGACTGACGCTCATTTGC GAAATCGTTG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA TAGGCCCGCTCATCCCGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>51_GC_dG_49-100_V4 (SEQ ID NO:58) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATTCTTCGGGTACTAGCGTTAA TTAGGATTTC TGCGCGTAAAGTGAACGCAGGCTTGTTGGTCAGTGAGATGTGTATTACAGGTACTTAACC TGTGAACCGC ATCTGATACTCGCAAGCCTGAGGCTCCTAGTGGGGGGTAGAAATCTATGTGTATCGTTGG AACCCG AAA CATCTGTAGGATGGCATGTGTCCAAGGCAGCCCCCTGGTCTGAGACTGACAATCAGTTTC GAAAGCGTGG GGAGCAAACAGCATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCT TAGAGACACTCTTACCGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>52_GC_dG_55-70_V4 (SEQ ID NO:59)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAAGACGGAGGGTGCAAGCGTTAA TCGGAATAAC TGGGCGTAAAGCGGTCGCAGGCGGTTTGTTAAGTCAGATGTGAATTCCCCGGGCTCAACG TGGGAACTTC ACCTAATACGGGCAAGCTTTAGAATCGTAGAGGGGGGTAGAAT A AGG A AGCGGTGCAATGCGAAGA GAGCTGGAGGAATCCCGGTGGAGAAGGCAGCCCCCTGGACGAAGACAGAAGCTCAGGGGC GAAACCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTA TCGGGCGCTCTCCGGTACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>53_GC_dG_55-80_V4 (SEQ ID NO:60)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTTA TCGGAATTAC TGGGCGTAAAGCGCACGTAGGCGGTTTGTTAAGTCAGAAGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCTAGCATGAGTATCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGT AATGCGTAGA GATCTAGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGT GAAAGCGAGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTA CCTATCGCAGCGCGTATAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>54_GC_dG_55-90_V4 (SEQ ID NO:61)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA TCGGAATTAC TGGGCGTAAAGCGCAAGCAGGCAGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCAGATACTGGCAAGCTTGAGTCTCGTTGAGGGGGGTAGAATTCCATGTGTAGCGGTGA ATTGCGTAGA GAACTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGTGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC CGCGTACAGGATTGTTCGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>55_GC_dG_55-100_V4 (SEQ ID NO:62)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTTAG TCGGAATTAC TGGGCGTAAAGCGCACTCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGGACTGC ATCTGATACTGGCAAGCCTGAGTCTCGTAGTGGGGGGTATAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCAGGAAGAAGTCCAGTTGTGAAGGCGGCCCCCTGGACGAAGACTGAGGCTCAGGTGC GAAAGCGTGG GGAGCAAACAAGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA AGCAAAGGGAGAGCGCGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>56_GC_dG_55-110_V4 (SEQ ID NO:63) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACCGAGGGTGCAAGAGTTAA GCGGAATTGC TGGCCGTAAAGCGCACACAGGCGCTTTGTCAAGTTAGATGCGAAATCCCCAGGTTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTAACTCTCGTAGAGGGGGTTACAATTCCAGGTGGAGCGCTGA AATGCGTAGA CATCTGGAGGAATACCGGTGGCGAAGGCGACCCCCTGGACGAAGACTCCCGCTTAGGTTC GCAAGCGGGG GGAGCAAACAGAATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT AGTCGTTTGCGCACCCGCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>57_GC_dG_60-80_V4 (SEQ ID NO:64)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGGGCAAGCGTTGA TCGGAATTAC CCGGCGTGAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACG TGGGAACTGC ATGTGATACTGGCACGCTTGAGTCTCGCAGAGGGGGGGAGAATTGCAGGGGTAGCGGTGA AAGGCGTAGA GATCTGGAGGAATACCGGTGGCGAGGGCGGCCCCCTGGAGGAAGACTGACGCTCAGGTGC GAAAGCGTGG CGAGCAAACACGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC TTTCCTGCTCCCGCCTGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>58_GC_dG_60-90_V4 (SEQ ID NO:65)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATGAC TGGGCGTAAAGCGCACGCAGGCGCTGTGTTAAGTCAGATGTGGAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGGTTGAGTCTGGTGGAGGGGGGGAGAATCCCAGGTGTGGCGGTGA AATGCGGAGA GAGCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGGAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA GATTCGTGTCCTCCACATagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>59_GC_dG_60-100_V4 (SEQ ID NO:66)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGGTAA TCGGAATTAC TGGGCGTAAACCGCACGCAGGCGGTCTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC GGGGAACTGC ATCTGATACTGGCAAGCTTGAGCCTCGTAGAGGGGGGTAGAAGTCCGGGTGTAGCGGTGA ACTGCGTAGA CATCTGGAGGAATACCGGGGGCGAAGGCGGCCCCCTGGACGAAGACTGACGGGCAGGTGC GACAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAC CTTCTAAACGTGCGAAGCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>60_GC_dG_60-110_V4 (SEQ ID NO:67)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATCCGGAGGGTGCGAGCGTTAA TCGGAATTAC TGCGCGTAAAGCGCACGCAGGCGGTTTCTTAAGTCAGCTGTGAAATCCCCGGGCTCACCC CGGGAACTGC ATCTGATACTCGCAACCTTGAGTCTCGTAGAGGGGGCCAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCCGGAGGAATACCGGGGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGCGG GGAGCAAACCGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG CTTCGCTGTGCCTATGACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>61_GC_dG_60-120_V4 (SEQ ID NO: 68) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATCCGGAGCGTGCAAGCGTCAA TCCGCATTAG TGGGCGTAAAGCGCACGCAGGCGGGTTGTTAAGTCAGATGTGAAATCCCGGGGCTCAACC TGGGAACTGC ATCTGAGACTGGCAAGCTTGAGTCTCGTACAGGGGGGTAGAATTCCAGGTCTGGCGCTGA AATGCGTAGA GATCTGGAGGCAGACCGGTCGCGAAGGCGGCCCCCTGCACGACGAGTGACCCTCAGGCGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA ATAAGACCGCATAGTTATagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>62_GC_dG_65-90_V4 (SEQ ID NO:69)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGCAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGGAAGGCGCACGCACGCGGCCTGTCAAGTGAGATGTGACATCCCCGGGCTCAACC TCGGAACTGC GTCTGATACTGGCACGCTTGCGTCGCGTACAGGGGGCGAGAATTCCAGGGGGAGGGGTGA AATGCGTGGC GATCCGGAGGAATACCGGTGGCGAAGGCGGCCCCCGGGACGAAGACGGCCGCTCAGGGGC CAAAGCGTGG GGGGCAGACACGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG AACTACAGGGACGAGAGTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>63_GC_dG_65-100_V4 (SEQ ID NO:70)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGGGGGTGCAAGCGTGGA TCGGAATTGC TGGGCGTCAAGGGCACGCAGGCGGTTTGTGAAGTCAGACGCGAGAGCCCCGGGCTCCACC GGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGACGGGGGCAGAATCCCGGGTGTGGCGGGGA AATGCGTAGA GATCTGGAGGGATCCCGGTGGCGAAGGCGGCCCCCTGGACGGAGACTGACGCTCAGGTGC GGAAGCGGGG GGACCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT CGGTTCTTGGTCCGCTTAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>64_GC_dG_65-110_V4 (SEQ ID NO:71)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGCAGGGTGCCAGCGTTAA TCGGAATTAC TGGGCGTGAAGCGCACGCGGGCGGTTTGTTAAGTCAGACGTGAAATCCCCGCGCTCAACC TGGGAGCCGC ACCTGAGAGTGGCGAGCTTGGGTGTCGTAGAGGGGGGTAGACTTCCAGGTGTAGCGGTGA AATGCGCAGC GCTCTGGCGGGATACCGCTGGCGAAGGCGGCCCCCCGGGCGAAGCCTGCCGCTCAGGGGC GAAAGCGTGG GGAGCACACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC GCCTGATGTCACGGCCTTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>65_GC_dG_65-120_V4 (SEQ ID NO:72)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGCGTGCAAGCGCTAA CCGGAATTAC TGCGCGTCAAGCGGACGCAGGCGGTTCGTTAAGTCAGGTGTGAGATCCCCGGGGTCAACC GGGGACCTGC ATCTGACACCGGCACGCTTGAGTCCCGTAGGGGCGGGTAGAATCCCCGGTGTAGCGGTGC GAGCCGTAGC GATCCGGAGGAATACCGGTGGCGACGGCCGCCCCCTGGACGAAGGCTGACGCTGAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG TCACTTAAACACTTCAATagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>66_GC_dG_65-130_V4 (SEQ ID NO:73) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACCGCCGGTGCACGCGTTAA TCGGAATTAC TGGGGGTACAGCGCGCGCGGGCGGTTTGTTAGCTCCGGTGTGAAGTCCCGGGGCTCAACC TGGGAACTCC AGCGGACACTGGCAAGCCTGAGTCTCGTCCCGGGGGGGAGAGTTCCAGCTGTAGCGGTGA CGTCCCTGGA GATCTCGGGGAATACGGGTGGCCAAGGCCGCCCCCTCGAGGAGGAGTCACGCTGAGGCGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTC CTGCCAGCGTCGGCAGACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>67_GC_dG_70-100_V4 (SEQ ID NO:74)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATGCGGGGGGTGGAAGCGGCGA GCGGACTGGC GGGGCGCCAAGCGCGCGCCGGCGGCTTGCTAGGTCAGATGTGAGGTGCCCGGCCTCAACC TGGGAACTGC AGGTGATACTGGGCAGCCGGAGTCGGGTAGACGGGGGTACAATGCCAGGTGTAGCGGGGC AACGGGTAGC GATGTGGGGGAATACCGGTGGCGAACGGGGCCCCCCGGACGAAGGCTGGCGCTCGGGTGC CACAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTC GAAGGATTTATAACGATGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>68_GC_dG_70-110_V4 (SEQ ID NO: 75)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATGCGGAGGGTGCACGGGGTAA CGGGAATGCC TGGGCGCCCAGCCGACGGAGCCGGTTCGGTAAGTCAGAGGTGAACGCGCCGGGCTCAACC CGCGAACTGC CGCTGATACCGGGGCGCTTCCGTCTCGTAGAGGGGGGTCGAATTCCAGGTGTGGCGCTGA AGTCCCGAGA GCTCTGGAGGAAGCGCGGTGGCGAGGGCGCCCGCCCGGACCAAGACTGGCGGCCAGGTGC GAAAGCGCGG GGAGCGAACGGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTG CTGTCTGCGATCCGGAACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>69_GC_dG_70-120_V4 (SEQ ID NO: 76)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAACACGGAGGGCGCAAGCGTTAA TCGGAACCAC TGGCCCTAAAGCGGCCCCAGGCGGGTCGTGAGGTCAGATGTGAAACCGCCGGGGTCAACC GGGGGGGGGC GGCTGACACTGGCGAGCCTGGGTCTCGTACACGGGGGCAGACCTCCAGGTGTCCCGCTGA GGCGCGTGGA GATCCGGAGGAGTACCGGTGGGGACGCCGGCCCCCTCGAGGCAGACTGACGCGCAGGTGC GAAAGCGCGG GGAGCAAACGGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGT CTGATCGTTATATGCCGTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>70_GC_dG_70-130_V4 (SEQ ID NO: 77)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGCGGGTGCCAGCCTTAA TCGGAATGCC CGGGCGCACAGCGGACGCGGGCGGTTCGTTAAGCCGCAGGCGAGATCCCCGGGCCCAACC TCGGCACGGC GTCTGACACTGGCGAGGTTGAGTCTCGGAGAGGGGGGTAGGATTCCAGGTCCACCGGTGG AACCCCTAGA GCTCTGGGGGACTACCGGTGGCCCAGGCCGCGGCCTGGACGAACGCTGGCGCTCAGGTCC GCAAGCCTGC GGCGCACACGGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA TATCGCATCCGCAGAAATagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>71_GC_dG_70-140_V4 (SEQ ID NO: 78) AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACCGCGGGTGCAAGCGTTAA TGCGGCTTAC TGGGCGTAAAGCGGACCCCGGCGGTTTGTGAGGTCACATGTGAAGCCCCCGCCCTCCGCC TGGGAACTGC GTCTGATACTGGCGGGCTCGGGGCCCGTACAGGGGGGTAGAATCCCAGGTGGAGGGCGGA ACCGGGTGCC GAGCTGCAGGAAGGCCGGCGGCGAAGCCGGCCCCCCGGGCGGAGACTGACGCCCAGGGGC GCGACCGTGG GGAGCAAGCAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT TCCCTTCTACATGAGTGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>72_Size_233_V4 (SEQ ID NO:79)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA GCGCAAGCAG GCAGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACCTGGGAACTGCATCAGATACT GGCAAGCTTG AGTCTCGTTGAGGGGGGTAGAATTCCATGTGTAGCGGTGAATTGCGTAGAGAACTGGAGG AATACCGGTG GCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGCGAAAGCGTGGGGTGCAAACA GGATTAGA A CCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATAACAGGGTAATGAGTCGACAA aatgatacgg cgaccaccgagatctacactctttccctacacgacgctcttccgatctCAAAGACTAACG AATCCGCAag atcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtct tctgcttgTT GTCGACTC

>73_Size_213_V4 (SEQ ID NO:80)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA AAGTCAGATG TGAAATCCCCGGGCTCAACCTGGGAACTGCATCAGATACTGGCAAGCTTGAGTCTCGTTG AGGGGGGTAG AATTCCATGTGTAGCGGTGAATTGCGTAGAGAACTGGAGGAATACCGGTGGCGAAGGCGG CCCCCTGGAC GAAGACTGACGCTCAGGTGCGAAAGCGTGGGGTGCAAACAGGATTAGATACCCTGGTAGT CCACGCCGTA AACGATGTCGACTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccga gatctacact ctttccctacacgacgctcttccgatctGTCTGAGGTGATAAGGGCATagatcggaagag cacacgtctg aactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>74_Size_193_V4 (SEQ ID NO:81)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATAAGTCAGATGTGAAATCCCC GGGCTCAACC TGGGAACTGCATCAGATACTGGCAAGCTTGAGTCTCGTTGAGGGGGGTAGAATTCCATGT GTAGCGGTGA ATTGCGTAGAGAACTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGAC GCTCAGGTGC GAAAGCGTGGGGTGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCG ACTAGGGATA ACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctac acgacgctct tccgatctTTGATGGTGAGTGGTGAATGagatcggaagagcacacgtctgaactccagtc acAATCAGTC TCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>75_Size_273_V4 (SEQ ID NO:82)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA TCGGAATTAC TGGGCGTAAAGTAGGGATGGCGATGCATTGGCGCAAGCAGGCAGTTTGTTAAGTCAGATG TGAAATCCCC GGGCTCAACCTGGGAACTGCATCAGATACTGGCAAGCTTGAGTCTCGTTGAGGGGGGTAG AATTCCATGT GTAGCGGTGAATTGCGTAGAGAACTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGAC GAAGACTGAC GCTCAGGTGCGAAAGCGTGGGGTGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTA AACGATGTCG ACTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacact ctttccctac acgacgctcttccgatctCGGTATACTACCTTACCAGAagatcggaagagcacacgtctg aactccagtc acAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>76_Size_293_V4 (SEQ ID NO:83)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA TCGGAATTAC TGGGCGTAAAGTAGGGATGGCGATGCATTGTCCTGTGTCGCCAACCTTGAGCGCAAGCAG GCAGTTTGTT AAGTCAGATGTGAAATCCCCGGGCTCAACCTGGGAACTGCATCAGATACTGGCAAGCTTG AGTCTCGTTG AGGGGGGTAGAATTCCATGTGTAGCGGTGAATTGCGTAGAGAACTGGAGGAATACCGGTG GCGAAGGCGG CCCCCTGGACGAAGACTGACGCTCAGGTGCGAAAGCGTGGGGTGCAAACAGGATTAGATA CCCTGGTAGT CCACGCCGTAAACGATGTCGACTAGGGATAACAGGGTAATGAGTCGACAAaatgatacgg cgaccaccga gatctacactctttccctacacgacgctcttccgatctACATATCACAACCAGGCCTCag atcggaagag cacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTT GTCGACTC

>77_Size_313_V4 (SEQ ID NO:84)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACTGAGGGTGCAAGCGTAAA TCGGAATTAC TGGGCGTAAAGTAGGGATGGCGATGCATTGTCCTGTGTCGCCAACCTTGACTGTACCGCT GTGATAACGC GCGCAAGCAGGCAGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACCTGGGAACTGC ATCAGATACT GGCAAGCTTGAGTCTCGTTGAGGGGGGTAGAATTCCATGTGTAGCGGTGAATTGCGTAGA GAACTGGAGG AATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGCGAAAGCGTGG GGTGCAAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATAACAGGGTAAT GAGTCGACAA

aatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatc tCCCGTAGACTGG GTCGACGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcg tatgccgtct tctgcttgTTGTCGACTC

Synthetic standard sequences - Full-length 16S rRNA gene tests (HMP mock community)

Sequence tags in bold

lllumina adapters in lower case

>78_ABa_Full_TAG_TCT_CAT (SEQ ID NO:85)

AGAGTTTGATCATGGCTCAGATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGC GGGGGAAGGT AGCTTGCTACTGGACCTAGCGGCGGACGGGTGAGTAATGCTTAGGAATCTGCCTATTAGT GGGGGACAAC ATCTCGAAAGGGATGCTAATACCGCATACGTCCTACGGGAGAAAGCAGGGGATCTTCGGA CCTTGCGC A A AGATGAGCCTAAGTCGGATTAGCTAGTTGGTGGGGTAAAGGCCTACCAAGGCGACGATCT GTAGCGGG TCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCA GCAGTGGGGA ATATTGGACAATTAGGGGGGGAACCCTGATCCAGCCATGCCGCGTGTGTGAAGAAGGCCT TATGGTTGTA AAGCACTTTAAGCGAGGAGGAGGCTACTTTAGTTAATACCTAGAGATAGTGGACGTTACT CGCAGAATAA GCACCGGCTAACTCTGTGCCAGCAGCCGCGGTAATACAGAGGGTGCGAGCGTTAATCGGA TTTACTGGGC GTAAATCTGCGTGCGTAGGCGGCTTATTAAGTCGGATGTGAAATCCCCGAGCTTAACTTG GGAATTGCAT TCGATACTGGTGAGCTAGAGTATGGGAGAGGATGGTAGAATTCCAGGTGTAGCGGTGAAA TGCGTAGAGA TCTGGAGGAATACCGATGGCGAAGGCAGCCATCTGGCCTAATACTGACGCTGAGGTACGA AAGCATGGGG AGCAAACAGGATTAGATACCCTGGTAGTCCATGCCGTAAACGATGTCTACTAGCCGTTGG GGCCTTTGAG GCTTTAGTGGCGCAGCTAACGCGATAAGTAGACCGCCTGGCATGGAGTACGGTCGCAAGA CTAAAACTCA AATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAACGCG AAGAACCT A CCTGGCCTTGACATACTAGAAACTTTCCAGAGATGGATTGGTGCCTTCGGGAATCTAGAT ACAGGTGCTG CATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACC CTTTTCCTTA CTTGCCAGCATTTCGGATGGGAACTTTAAGGATACTGCCAGTGACAAACTGGAGGAAGGC GGGGACGACG TCAAGTCATCATGGCCCTTACGGCCAGGGCTACACACGTGCTACAATGGTCGGTACAAAG GGTTGCTACA CAGCGATGTGATGCTAATCTCAAAAAGCCGATCGTAGTCCGGATTGGAGTCTGCAACTCG ACTCCATGAA GTCGGAATCGCTAGTAATCGCGGATCAGAATGCCGCGGTGAATACGTTCCCGGGCCTTGT ACACACCGCC CGTCACACCATGGGAGTTTGTTGCACCAGAAGTAGCTAGCCTAACTGCAAAGAGGGCGGT TACCACGGTG TGGCCGATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTAGGGGAACCTGCGGCTGGATC ACCTCCTTTA GGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactcttt ccctacacga cgctcttccgatctGCTTGTCTGTTCAAGCATCCagatcggaagagcacacgtctgaact ccagtcacAA TCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC >79_BCe_Full_TAG_TCT_CAT (SEQ ID NO:86)

GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAATGGATTAAGAGCTTGCT CTTATGAAGT TAGCGGCGGACGGGTGAGTAACACGTGGGTAACCTGCCCATAAGACTGGGATAACTCCGG GAAACCGGGG CTAATACCGGATAACATTTTGAACCGCATGGTTCGAAATTGAAAGGCGGCTTCGGCTGTC ACTTATGGAT GGACCCGCGTCGCATTAGCTAGTTGGTGAGGTAACGGCTCACCAAGGCAACGATGCGTAG CCGACCTGAG AGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTA GGGAATCTTC CGCAATTAGGGACGAAAGTCTGACGGAGCAACGCCGCGTGAGTGATGAAGGCTTTCGGGT CGTAAAACTC TGTTGTTAGGGAAGAACAAGTGCTAGTTGAATAAGCTGGCACCTTGACGGTACCTAACCA GAAAGCCACG GCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTATCCGGAATTATT GGGCGTAAAT CTGCGCGCGCAGGTGGTTTCTTAAGTCTGATGTGAAAGCCCACGGCTCAACCGTGGAGGG TCATTGGAAA CTGGGAGACTTGAGTGCAGAAGAGGAAAGTGGAATTCCATGTGTAGCGGTGAAATGCGTA GAGATATGGA GGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAACTGACACTGAGGCGCGAAAGCGT GGGGAGCAAA CAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAAGTGTTAGAGGGTT TCCGCCCTTT AGTGCTGAAGTTAACGCATTAAGCACTCCGCCTGGCATGGAGTACGGCCGCAAGGCTGAA ACTCAAAGGA ATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAA CCTTACCAGG TCTTGACATCCTCTGACAACCCTAGAGATAGGGCTTCTCCTTCGGGAGCAGAGTGACAGG TGGTGCATGG TTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGA TCTTAGTTGC CATCATTCAGTTGGGCACTCTAAGGTGACTGCCGGTGACAAACCGGAGGAAGGTGGGGAT GACGTCAAAT CATCATGCCCCTTATGACCTGGGCTACACACGTGCTACAATGGACGGTACAAAGAGCTGC AAGACCGCGA GGTGGAGCTAATCTCATAAAACCGTTCTCAGTTCGGATTGTAGGCTGCAACTCGCCTACA TGAAGCTGGA ATCGCTAGTAATCGCGGATCAGCATGCCGCGGTGAATACGTTCCCGGGCCTTGTACACAC CGCCCGTCAC ACCACGAGAGTTTGTAACACCCGAAGTCGGTGGGGTAACCTTTTGGAGCCAGCCGCCTAA GGTGGGACAG ATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGAAGGTGCGGTAGGGATAACAGG GTAATGAGTC GACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccga tctTCTCCCT GTGATTAATGAACagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTa tctcgtatgc cgtcttctgcttgTTGTCGACTC

>80_BVu_Full_TAG_TCT_CAT (SEQ ID NO:87)

TATTACAATGAAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCTACAGGCTTAACACAT GCAAGTCGAG GGGCAGCATGGTCTTAGCTTGCTAAGGCCGATGGCGACCGGCGCACGGGTGAGTAACACG TATCCAACCT GCCGTCTACTCTTGGACAGCCTTCTGAAAGGAAGATTAATACAAGATGGCATCATGAGTC CGCATGTTCA CATGATTAAAGGTATTCCGGTAGACGATGGGGATGCGTTCCATTAGATAGTAGGCGGGGT AACGGCCCAC CTAGTCTTCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACG GTCCAAACTC CTACGGGAGGCAGCAGTGAGGAATATTGGTCAA TAGGGGCGAGAGCCTGAACCAGCCAAGTAGCGTGAA GGATGACTGCCCTATGGGTTGTAAACTTCTTTTATAAAGGAATAAAGTCGGGTATGGATA CCCGTTTGCA TGTACTTTATGAATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAT CCGAGCGT A TCCGGATTTATTGGGTTTAAATCTGGGAGCGTAGATGGATGTTTAAGTCAGTTGTGAAAG TTTGCGGCTC AACCGTAAAATTGCAGTTGATACTGGATATCTTGAGTGCAGTTGAGGCAGGCGGAATTCG TGGTGTAGCG GTGAAATGCTTAGATATCACGAAGAACTCCGATTGCGAAGGCAGCCTGCTAAGCTGCAAC TGACATTGAG GCTCGAAAGTGTGGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACGGTAAACGAT GAATACTCGC TGTTTGCGA A ACGGCAAGCGGCCAAGCGAAAGCGTTAAGTATTCCACCTGGCATGGAGTACGCCGGCA ACGGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAAT TCGATGA AC GCGAGGAACCTTACCCGGGCTTAAATTGCAGATGAATTACGGTGAAAGCCGTAAGCCGCA AGGCATCTGT GAAGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAAC GAGCGCAACC CTTGTTGTCAGTTACTAACAGGTCATGCTGAGGACTCTGACAAGACTGCCATCGTAAGAT GTGAGGAAGG TGGGGATGACGTCAAATCAGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGG GGGGTACAGA GGGCCGCTACCACGCGAGTGGATGCCAATCCCCAAAACCTCTCTCAGTTCGGACTGGAGT CTGCAACCCG ACTCCACGAAGCTGGATTCGCTAGTAATCGCGCATCAGCCACGGCGCGGTGAATACGTTC CCGGGCCTTG TACACACCGCCCGTCAAGCCATGGGAGCCGGGGGTACCTGAAGTGCGTAACCGCGAGGAG CGCCCTAGGG TAAAACTGGTGACTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGTAGGGATAACA GGGTAATGAG TCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttcc gatctTAGAG CCATAGACTGCTGTCagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCG Tatctcgtat gccgtcttctgcttgTTGTCGACTC

>81_CBe_Full_TAG_TCT_CAT (SEQ ID NO:88)

AGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAGC GATGAAGCTC CTTCGGGAGTGGATTAGCGGCGGACGGGTGAGTAACACGTGGGTAACCTGCCTCATAGAG GGGAATAGCC TTTCGAAAGGAAGATTAATACCGCATAAGATTGTAGTGCCGCATGGCATAGCAATTAAAG GAGTAATCCG CTATGAGATGGACCCGCGTCGCATTAGCTAGTTGGTGAGGTAACGGCTCACCAAGGCGAC GATGCGTAGC CGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAGACTCCTACGGGAG GCAGCAGTGG GGAATATTGCACAATTAGGGGGGAAACCCTGATGCAGCAACGCCGCGTGAGTGATGACGG TCTTCGGATT GTAAAGCTCTGTCTTCAGGGACGATAATGACGGTACCTGAGGAGGAAGCCACGGCTAACT ACGTGCCAGC AGCCGCGGTAATACGTAGGTGGCAAGCGTTGTCCGGATTTACTGGGCGTAAATCTGGGAG CGTAGGTGGA TATTTAAGTGGGATGTGAAATACTCGGGCTTAACCTGGGTGCTGCATTCCAAACTGGATA TCTAGAGTGC AGGAGAGGAAAGTAGAATTCCTAGTGTAGCGGTGAAATGCGTAGAGATTAGGAAGAATAC CAGTGGCGAA GGCGACTTTCTGGACTGTAACTGACACTGAGGCTCGAAAGCGTGGGGAGCAAACAGGATT AGATACCCTG GTAGTCCACGCCGTAAACGATGAATACTAGGTGTAGGGGTTGTCATGACCTCTGTGCCGC CGCTAACGCA TTAAGTATTCCGCCTGGCATGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATTGACGG GGGCCCGCAC AAGCAGCGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCTAGACTTGACA TCTCCTGAAT TACCCTTAATCGGGGAAGCCCTTCGGGGCAGGAAGACAGGTGGTGCATGGTTGTCGTCAG CTCGTGTCGT GAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTATTGTTAGTTGCTACCATTTAG TTGAGCACTC TAGCGAGACTGCCCGGGTTAACCGGGAGGAAGGTGGGGATGACGTCAAATCATCATGCCC CTTATGTC A GGGCTACACACGTGCTACAATGGCTGGTACAGAGAGATGCTAAACCGCGAGGTGGAGCCA AACTTTAAAA CCAGTCTCAGTTCGGATTGTAGGCTGAAACTCGCCTACATGAAGCTGGAGTTGCTAGTAA TCGCGAATCA GAATGTCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGAGAGT TGGCAATACC CAAAGTTCGTGAGCTAACGCGTAAGCGGGGCAGCGACCTAAGGTAGGGTCAGCGATTGGG GTGAAGTCGT AACAAGGTAGCCGTAGGAGAACCTGCGGCTGGATCACCTCCTTTTAGGGATAACAGGGTA ATGAGTCGAC AAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatct TAAAGATTAT TTGCAGCCACagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatct cgtatgccgt cttctgcttgTTGTCGACTC

>82_DRa_Full_TAG_TCT_CAT (SEQ ID NO:89)

TTTATGGAGAGTTTGATCCTGGCTCAGGGTGAACGCTGGCGGCGTGCTTAAGACATGCAA GTCGAACGCG GTCTTCGGACCGAGTGGCGCACGGGTGAGTAACACGTAACTGACCTACCCAGAAGTCACG AATAACTGGC CGAAAGGTCCGCTAATACGTGATGTGGTGATGCACCGTGGTGCATCACTAAAGATTTATC GCTTCTGGAT GGGGTTGCGTTCCATCAGCTGGTTGGTGGGGTAAAGGCCTACCAAGGCGACGACGGATAG CCGGCCTGAG AGGGTGGCCGGCCACAGGGGCACTGAGACACGGGTCCCACTCCTACGGGAGGCAGCAGTT AGGAATCTTC CACAATTAGGGGCGCAAGCCTGATGGAGCGACGCCGCGTGAGGGATGAAGGTTTTCGGAT CGTAAACCTC TGAATCTGGGACGAAAGAGCCTTAGGGCAGATGACGGTACCAGAGTAATAGCACCGGCTA ACTCCGTGCC AGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTACCCGGAATCACTGGGCGTAAATCTGG GCGTGTAGGC GGAAATTTAAGTCTGGTTTTAAAGACCGGGGCTCAACCTCGGGGATGGACTGGATACTGG ATTTCTTGAC CTCTGGAGAGG AACTGGAATTCCTGGTGTAGCGG GGAATGCG AGA ACCAGGAGGAACACCAATGGC GAAGGCAAGTTACTGGACAGAAGGTGACGCTGAGGCGCGAAAGTGTGGGGAGCAAACCGG ATTAGATACC CGGGTAGTCCACACCCTAAACGATGTACGTTGGCTAAGCGCAGGATGCTGTGCTTGGCGA AGCTAACGCG ATAAACGTACCGCCTGGCATGAAGTACGGCCGCAAGGTTGAAACTCAAAGGAATTGACGG GGGCCCGCAC AAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACA TGCTAGGAAC TTTGCAGAGATGCAGAGGTGCCCTTCGGGGAACCTAGACACAGGTGCTGCATGGCTGTCG TCAGCTCGTG TCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGCCTTTAGTTGTCAGCAT TCAGTTGGAC ACTCTAGAGGGACTGCCTATGAAAGTAGGAGGAAGGCGGGGATGACGTCTAGTCAGCATG GTCCTTACGT CCTGGGCGACACACGTGCTACAATGGGTAGGACAACGCGCAGCAAACCCGCGAGGGTAAG CGAATCGC A AAACCTATCCCCAGTTCAGATCGGAGTCTGCAACTCGACTCCGTGAAGTTGGAATCGCTA GTAATCGCGG GTCAGCATACCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGG GAGTAGATTG CAGTTGAAACCGCCGGGAGCTTAACGGCAGGCGTCTAGACTGTGGTTTATGACTGGGGTG AAGTCGTAAC AAGGTAACTGTACCGGAAGGTGCGGTTGGATCACCTCCTTTTAGGGATAACAGGGTAATG AGTCGACAAa atgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGTA TTCAGCCGTC AACTTATagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgt atgccgtctt ctgcttgTTGTCGACTC

>83_EFa_Full_TAG_TCT_CAT (SEQ ID NO:90)

AGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAAC GCTTCTTTCC TCCCGAGTGCTTGCACTCAATTGGAAAGAGGAGTGGCGGACGGGTGAGTAACACGTGGGT AACCTACCCA TCAGAGGGGGATAACACTTGGAAACAGGTGCTAATACCGCATAACAGTTTATGCCGCATG GCATAAGAGT GAAAGGCGCTTTCGGGTGTCGCTGATGGATGGACCCGCGGTGCATTAGCTAGTTGGTGAG GTAACGGCTC ACCAAGGCCACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACA CGGCCCAGAC TCCTACGGGAGGCAGCAGTAGGGAATCTTCGGCAATTAGGGACGAAAGTCTGACCGAGCA ACGCCGCGTG AGTGAAGAAGGTTTTCGGATCGTAAAACTCTGTTGTTAGAGAAGAACAAGGACGTTAGTA ACTGAACGTC CCCTGACGGTATCTAACCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATAC GTAGGTGGCA AGCGTTGTCCGGATTTATTGGGCGTAAATCTGCGAGCGCAGGCGGTTTCTTAAGTCTGAT GTGAAAGCCC CCGGCTCAACCGGGGAGGGTCATTGGAAACTGGGAGACTTGAGTGCAGAAGAGGAGAGTG GAATTCCATG TGTAGCGGTGAAATGCGTAGA A ATGGAGGAACACCAGTGGCGAAGGCGGCTCTCTGGTCTGTAACTGA CGCTGAGGCTCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGT AAACGATGAG TGCTAAGTGTTGGAGGGTTTCCGCCCTTCAGTGCTGCAGCAAACGCATTAAGCACTCCGC CTGGCATGGA GTACGACCGCAAGGTTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCA TGTGGTTTAA TTCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACATCCTTTGACCACTCTAGAGATAG AGCTTTCCCT TCGGGGACAAAGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGG TTAAGTCCCG CAACGAGCGCAACCCTTATTGTTAGTTGCCATCATTTAGTTGGGCACTCTAGCGAGACTG CCGGTGACAA ACCGGAGGAAGGTGGGGATGACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACAC GTGCTACAAT GGGAAGTACAACGAGTCGCTAGACCGCGAGGTCATGCAAATCTCTTAAAGCTTCTCTCAG TTCGGATTGC AGGCTGCAACTCGCCTGCATGAAGCCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCG GTGAATACGT TCCCGGGCCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGT GAGGTAACCT TTTTGGAGCCAGCCGCCTAAGGTGGGATAGATGATTGGGGTGAAGTCGTAACAAGGTAGC CGTATCGGAA GGTGCGGCTGGATCACCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgac caccgagatc tacactctttccctacacgacgctcttccgatctTGCTGGGCTCAGAGGTGAATagatcg gaagagcaca cgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCG ACTC

>84_ECo_Full_TAG_TCT_CAT (SEQ ID NO:91)

AAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGGCAGGCCTAACACATGCAA GTCGAACGGT AACAGGAAGAAGCTTGCTTCTTTGCTGACGAGTGGCGGACGGGTGAGTAATGTCTGGGAA ACTGCCTGAT GGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAACGTCGCAAGACCAAAGAG GGGGACCTTC GGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTAGGTGGGGTAACGGCTCA CCTAGGCGAC GATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAGACT CCTACGGGAG GCAGCAGTGGGGAATATTGCACAATTAGGGGCGCAAGCCTGATGCAGCCATGCCGCGTGT ATGAAGAAGG CCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGTTAATACCTTTGCTC ATTGACGT A CCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAA GCGTTAATCG GAATTACTGGGCGTAAATCTGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCC GGGCTCAACC TGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGT GTAGCGGTGA AATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGAC GCTCAGGTGC GAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCG ACTTGGAGGT TGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAGTCGACCGCCTGGCATGGAGT ACGGCCGCAA GGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATT CGATGCAACG CGAAGAACCTTACCTGGTCTTGACATCCACAGAACTTTCCAGAGATGGATTGGTGCCTTC GGGAACTGTG AGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTTGTGAAATGTTGGGTTAAGTCCCGCA ACGAGCGCAA CCCTTATCTTTTGTTGCCAGCGGTCCGGCCGGGAACTCAAAGGAGACTGCCAGTGATAAA CTGGAGGAAG GTGGGGATGACGTCAAGTCATCATGGCCCTTACGACCAGGGCTACACACGTGCTACAATG GCGCATACAA AGAGAAGCGACCTCGCGAGAGCAAGCGGACCTCATAAAGTGCGTCGTAGTCCGGATTGGA GTCTGCAACT CGACTCCATGAAGTCGGAATCGCTAGTAATCGTGGATCAGAATGCCACGGTGAATACGTT CCCGGGCCTT GTACACACCGCCCGTCACACCATGGGAGTGGGTTGCAAAAGAAGTAGGTAGCTTAACCTT CGGGAGGGCG CTTACCACTTTGTGATTCATGACTGGGGTGAAGTCGTAACAAGGTAACCGTAGGGGAACC TGCGGTTGGA TCACCTCCT A AGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactc tttccctacacgacgctcttccgatctAACAGCACTGTCGCACGGCTagatcggaagagc acacgtctga actccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>85_HPy_Full_TAG_TCT_CAT (SEQ ID NO:92)

TTTATGGAGAGTTTGATCCTGGCTCAGAGTGAACGCTGGCGGCGTGCCTAATACATGCAA GTCGAACGAT GAAGCTTCTAGCTTGCTAGAGTGCTGATTAGTGGCGCACGGGTGAGTAACGCATAGGTTA TGTGCCTCTT AGTTTGGGATAGCCATTGGAAACGATGATTAATACCAGATACTCCTACGGGGGAAAGATT TATCGCTAAG AGATCAGCCTATGTCCTATCAGCTTGTTGGTAAGGTAATGGCTTACCAAGGCTATGACGG GTATCCGGCC TGAGAGGGTGAACGGACACACTGGAACTGAGACACGGTCCAGACTCCTACGGGAGGCAGC AGTAGGGAAT ATTGCTCAATTAGGGGGGAAACCCTGAAGCAGCAACGCCGCGTGGAGGATGAAGGTTTTA GGATTGTAAA CTCCTTTTGTTAGAGAAGATAATGACGGTATCTAACGAATAAGCACCGGCTAACTCCGTG CCAGCAGCCG CGGTAATACGGAGGGTGCAAGCGTTACTCGGAATCACTGGGCGTAAATCTGAGCGCGTAG GCGGGATAGT CAGTCAGGTGTGAAATCCTATGGCTTAACCATAGAACTGCATTTGAAACTACTATTCTAG AGTGTGGGAG AGGTAGGTGGAATTCTTGGTGTAGGGGTAAAATCCGTAGAGATCAAGAGGAATACTCATT GCGAAGGCGA CCTGCTGGAACATTACTGACGCTGATTGCGCGAAAGCGTGGGGAGCAAACAGGATTAGAT ACCCTGGTAG TCCACGCCCTAAACGATGGATGCTAGTTGTTGGAGGGCTTAGTCTCTCCAGTAATGCAGC TAACGCAT A AGCATCCCGCCTGGCATGGAGTACGGTCGCAAGATTAAAACTCAAAGGAATAGACGGGGA CCCGCACAAG CGGTGGAGCATGTGGTTTAATTCGAAGATACACGAAGAACCTTACCTAGGCTTGACATTG AGAGAATCCG CTAGAAATAGTGGAGTGTCTGGCTTGCCAGACCTTGAAAACAGGTGCTGCACGGCTGTCG TCAGCTCGTG TCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCCTTTTCTTAGTTGCTAACAG GTTATGCTGA GAACTCTAAGGATACTGCCTCCGTAAGGGGAGGAAGGTGGGGACGACGTCAAGTCATCAT GGCCCTTACG CCTAGGGCTACACACGTGCTACAATGGGGTGCACAAAGAGAAGCAATACTGCGAATGGAG CCAATCTTCA AAACACCTCTCAGTTCGGATTGTAGGCTGCAACTCGCCTGCATGAAGCTGGAATCGCTAG TAATCGCAAA TCAGCCATGTTGCGGTGAATACGTTCCCGGGTCTTGTACTCACCGCCCGTCACACCATGG GAGTTGTGTT TGCCTTAAGTCAGGATGCTAAATTGGCTACTGCCCACGGCACACACAGCGACTGGGGTGA AGTCGTAACA AGGTAACCGTAGTGAACCTGCGGTTGGATCACCTCCTTAGGGATAACAGGGTAATGAGTC GACAAaatga tacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCACGTAC TAGTGGTCAG CGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgc cgtcttctgc ttgTTGTCGACTC

>86_LGa_Full_TAG_TCT_CAT (SEQ ID NO:93)

GAAGGAAAATGAGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACAT GCAAGTCGAG CGAGCTTGCCTAGATGAATTTGGTGCTTGCACCAGATGAAACTAGATACAAGCGAGCGGC GGACGGGTGA GTAACACGTGGGTAACCTGCCCAAGAGACTGGGATAACACCTGGAAACAGATGCTAATAC CGGATAACAA CACTAGACGCATGTCTAGAGTTTAAAAGATGGTTCTGCTATCACTCTTGGATGGACCTGC GGTGCATTAG CTAGTTGGTAAGGTAACGGCTTACCAAGGCAATGATGCATAGCCGAGTTGAGAGACTGAT CGGCCACATT GGGACTGAGACACGGCCCAAACTCCTACGGGAGGCAGCAGTAGGGAATCTTCCACAATTA GGGACGCAAG TCTGATGGAGCAACGCCGCGTGAGTGAAGAAGGGTTTCGGCTCGTAAAGCTCTGTTGGTA GTGAAGAAAG A AGAGGTAGTAACTGGCCTTTATTTGACGGTAATTACTTAGAAAGTCACGGCTAACTACGT GCCAGCAG CCGCGGTAATACGTAGGTGGCAAGCGTTGTCCGGATTTATTGGGCGTAAATCTGCGAGTG CAGGCGGTTC AATAAGTCTGATGTGAAAGCCTTCGGCTCAACCGGAGAATTGCATCAGAAACTGTTGAAC TTGAGTGCAG AAGAGGAGAGTGGAACTCCATGTGTAGCGGTGGAATGCGTAGA A ATGGAAGAACACCAGTGGCGAAGG CGGCTCTCTGGTCTGCAACTGACGCTGAGGCTCGAAAGCATGGGTAGCGAACAGGATTAG ATACCCTGGT AGTCCATGCCGTAAACGATGAGTGCTAAGTGTTGGGAGGTTTCCGCCTCTCAGTGCTGCA GCTAACGCAT TAAGCACTCCGCCTGGCATGGAGTACGACCGCAAGGTTGAAACTCAAAGGAATTGACGGG GGCCCGCACA AGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACAT CCAGTGCAAA CCTAAGAGATTAGGTGTTCCCTTCGGGGACGCTGAGACAGGTGGTGCATGGCTGTCGTCA GCTCGTGTCG TGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCATTAGTTGCCATCATTAA GTTGGGCACT CTAATGAGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAGTCATCATGCC CCTTATGACC TGGGCTACACACGTGCTACAATGGACGGTACAACGAGAAGCGAACCTGCGAAGGCAAGCG GATCTCTGAA AGCCGTTCTCAGTTCGGACTGTAGGCTGCAACTCGCCTACACGAAGCTGGAATCGCTAGT AATCGCGGAT CAGCACGCCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGAGA GTCTGTAACA CCCAAAGCCGGTGGGATAACCTT A AGGAGTCAGCCGTCTAAGGTAGGACAGATGATTAGGGTGAAGTC GTAACAAGGTAGCCGTAGGAGAACCTGCGGCTGGATCACCTCCTTTCTTAGGGATAACAG GGTAATGAGT CGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccg atctACGTAA AGGGTTATTGCATTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGT atctcgtatg ccgtcttctgcttgTTGTCGACTC

>87_NMe_Full_TAG_TCT_CAT (SEQ ID NO:94)

TGAACATAAGAGTTTGATCCTGGCTCAGATTGAACGCTGGCGGCATGCTTTACACATGCA AGTCGGACGG CAGCACAGAGAAGCTTGCTTCTCGGGTGGCGAGTGGCGAACGGGTGAGTAACATATCGGA ACGTACCGAG TAGTGGGGGATAACTGATCGAAAGATCAGCTAATACCGCATACGTCTTGAGAGAGAAAGC AGGGGACCTT CGGGCCTTGCGCTATTCGAGCGGCCGATATCTGATTAGCTAGTTGGTGGGGTAAAGGCCT ACCAAGGCGA CGATCAGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAGAC TCCTACGGGA GGCAGCAGTGGGGAATTTTGGACAATTAGGGGCGCAAGCCTGATCCAGCCATGCCGCGTG TCTGAAGAAG GCCTTCGGGTTGTAAAGGACTTTTGTCAGGGAAGAAAAGGCTGTTGCTAATATCAGCGGC TGATGACGGT ACCTGAAGAATAAGCACCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGTGCG AGCGTTAATC GGAATTACTGGGCGTAAATCTGCGGGCGCAGACGGTTACTTAAGCAGGATGTGAAATCCC CGGGCTCAAC CCGGGAACTGCGTTCTGAACTGGGTGACTCGAGTGTGTCAGAGGGAGGTAGAATTCCACG TGTAGCAGTG AAATGCGTAGAGATGTGGAGGAATACCGATGGCGAAGGCAGCCTCCTGGGACAACACTGA CGTTCATGCC CGAAAGCGTGGGTAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCCTAAACGATGTC AATTAGCTGT TGGGCAACCTGATTGCTTGGTAGCGTAGCTAACGCGTGAAATTGACCGCCTGGCATGGAG TACGGTCGCA AGATTAAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGATGATGTGGATTAAT TCGATGCAAC GCGAAGAACCTTACCTGGTCTTGACATGTACGGAATCCTCCGGAGACGGAGGAGTGCCTT CGGGAGCCGT AACACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGC AACGAGCGCA ACCCTTGTCATTAGTTGCCATCATTCAGTTGGGCACTCTAATGAGACTGCCGGTGACAAG CCGGAGGAAG GTGGGGATGACGTCAAGTCCTCATGGCCCTTATGACCAGGGCTTCACACGTCATACAATG GTCGGTACAG AGGGTAGCCAAGCCGCGAGGCGGAGCCAATCTCACAAAACCGATCGTAGTCCGGATTGCA CTCTGCAACT CGAGTGCATGAAGTCGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATACGTT CCCGGGTCTT GTACACACCGCCCGTCACACCATGGGAGTGGGGGATACCAGAAGTAGGTAGGATAACCAC AAGGAGTCCG CTTACCACGGTATGCTTCATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTAGGGGAACC TGCGGCTGGA TCACCTCCTTTCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccg agatctacac tctttccctacacgacgctcttccgatctAGTCCCAGGATTGCTGAAATagatcggaaga gcacacgtct gaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>88_PAc_Full_TAG_TCT_CAT (SEQ ID NO:95)

AGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAAC GGAAAGGCCC TGCTTTTGTGGGGTGCTCGAGTGGCGAACGGGTGAGTAACACGTGAGTAACCTGCCCTTG ACTTTGGGAT AACTTCAGGAAACTGGGGCTAATACCGGATAGGAGCTCCTGCTGCATGGTGGGGGTTGGA AAGTTTCGGC GGTTGGGGATGGACTCGCGGCTTATCAGCTTGTTGGTGGGGTAGTGGCTTACCAAGGCTT TGACGGGTAG CCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAGACTCCTACGGGA GGCAGCAGTG GGGAATATTGCACAATTAGGGGCGGAAGCCTGATGCAGCAACGCCGCGTGCGGGATGACG GCCTTCGGGT TGTAAACCGCTTTCGCCTGTGACGAAGCGTGAGTGACGGTAATGGGTAAAGAAGCACCGG CTAACTACGT GCCAGCAGCCGCGGTGATACGTAGGGTGCGAGCGTTGTCCGGATTTATTGGGCGTAAATC TGGGCTCGTA GGTGGTTGATCGCGTCGGAAGTGTAATCTTGGGGCTTAACCCTGAGCGTGCTTTCGATAC GGGTTGACTT GAGGAAGGTAGGGGAGAATGGAATTCCTGGTGGAGCGGTGGAATGCGCAGATATCAGGAG GAACACCAGT GGCGAAGGCGGTTCTCTGGGCCTTTCCTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAAC AGGCTTAGAT ACCCTGGTAGTCCACGCTGTAAACGGTGGGTACTAGGTGTGGGGTCCATTCCACGGGTTC CGTGCCGTAG CTAACGCTTTAAGTACCCCGCCTGGCATGGAGTACGGCCGCAAGGCTAAAACTCAAAGGA ATTGACGGGG CCCCGCACAAGCGGCGGAGCATGCGGATTAATTCGATGCAACGCGTAGAACCTTACCTGG GTTTGACATG GATCGGGAGTGCTCAGAGATGGGTGTGCCTCTTTTGGGGTCGGTTCACAGGTGGTGCATG GCTGTCGTCA GCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTTCACTGTTG CCAGCACGTT ATGGTGGGGACTCAGTGGAGACCGCCGGGGTCAACTCGGAGGAAGGTGGGGATGACGTCA AGTCATCATG CCCCTTATGTCCAGGGCTTCACGCATGCTACAATGGCTGGTACAGAGAGTGGCGAGCCTG TGAGGGTGAG CGAATCTCGGAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACCTCATGAAGTC GGAGTCGC A GTAATCGCAGATCAGCAACGCTGCGGTGAATACGTTCCCGGGGCTTGTACACACCGCCCG TCAAGTCATG AAAGTTGGTAACACCCGAAGCCGGTGGCCTAACCGTTGTGGGGGAGCCGTCGAAGGTGGG ACTGGTGATT AGGACTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGATCACCTCCTTTCTA AGGATAGGGA TAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccct acacgacgct cttccgatctGATCACCCTGCATGTACACAagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>89_PAr_Full_TAG_TCT_CAT (SEQ ID NO:96)

AACTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGGCAGGCCTAACACATGCAAG TCGAGCGGAT GAAGGGAGCTTGCTCCTGGATTCAGCGGCGGACGGGTGAGTAATGCCTAGGAATCTGCCT GGTAGTGGGG GA AACGTCCGGAAACGGGCGCTAATACCGCATACGTCCTGAGGGAGAAAGTGGGGGATCTTC GGACCTC ACGCTATCAGATGAGCCTAGGTCGGATTAGCTAGTTGGTGGGGTAAAGGCCTACCAAGGC GACGATCCGT AACTGGTCTGAGAGGATGATCAGTCACACTGGAACTGAGACACGGTCCAGACTCCTACGG GAGGCAGCAG TGGGGAATATTGGACAATTAGGGGCGAAAGCCTGATCCAGCCATGCCGCGTGTGTGAAGA AGGTCTTCGG ATTGTAAAGCACTTTAAGTTGGGAGGAAGGGCAGTAAGTTAATACCTTGCTGTTTTGACG TTACCAACAG AATAAGCACCGGCTAACTTCGTGCCAGCAGCCGCGGTAATACGAAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAATCTGCGCGCGTAGGTGGTTCAGCAAGTTGGATGTGAAATCCCCGGGCTCA ACCTGGGAAC TGCATCCAAAACTACTGAGCTAGAGTACGGTAGAGGGTGGTGGAATTTCCTGTGTAGCGG TGAAATGCGT AGA A AGGAAGGAACACCAGTGGCGAAGGCGACCACCTGGACTGATACTGACACTGAGGTGCGAA AGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGCC GTTGGGATCC TTGAGATCTTAGTGGCGCAGCTAACGCGATAAGTCGACCGCCTGGCATGGAGTACGGCCG CAAGGTTAAA ACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCA ACGCGAAGAA CCTTACCTGGCCTTGACATGCTGAGAACTTTCCAGAGATGGATTGGTGCCTTCGGGAACT CAGACACAGG TGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGTAACGAGCG CAACCCTTGT CCTTAGTTACCAGCACCTCGGGTGGGCACTCTAAGGAGACTGCCGGTGACAAACCGGAGG AAGGTGGGGA TGACGTCAAGTCATCATGGCCCTTACGGCCAGGGCTACACACGTGCTACAATGGTCGGTA CAAAGGGTTG CCAAGCCGCGAGGTGGAGCTAATCCCATAAAACCGATCGTAGTCCGGATCGCAGTCTGCA ACTCGACTGC GTGAAGTCGGAATCGCTAGTAATCGTGAATCAGAATGTCACGGTGAATACGTTCCCGGGC CTTGTACACA CCGCCCGTCACACCATGGGAGTGGGTTGCTCCAGAAGTAGCTAGTCTAACCGCAAGGGGG ACGGTTACCA CGGAGTGATTCATGACTGGGGTGAAGTCGTAACAAGGTAGCCGTAGGGGAACCTGCGGCT GGATCACCTC CTTAATAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctac actctttccc tacacgacgctcttccgatctGGTAACA A AAGCTTCTCGagatcggaagagcacacgtctgaactcca gtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>90_RSp_Full_TAG_TCT_CAT (SEQ ID NO:97)

CAACTTGAGAGTTTGATCCTGGCTCAGAATGAACGCTGGCGGCAGGCCTAACACATGCAA GTCGAGCGAA GTCTTCGGACTTAGCGGCGGACGGGTGAGTAACGCGTGGGAACGTGCCCTTTGCTTCGGA ATAGCCCCGG GAAACTGGGAGTAATACCGAATGTGCCCTTTGGGGGAAAGATTTATCGGCAAAGGATCGG CCCGCGTTGG ATTAGGTAGTTGGTGGGGTAATGGCCTACCAAGCCGACGATCCATAGCTGGTTTGAGAGG ATGATCAGCC ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAATCTTAGAC AATTAGGGGC GCAAGCCTGATCTAGCCATGCCGCGTGATCGATGAAGGCCTTAGGGTTGTAAAGATCTTT CAGGTGGGAA GA AATGACGGTACCACCAGAAGAAGCCCCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACG GAGGGGG CTAGCGTTATTCGGAATTACTGGGCGTAAATCTGCGCACGTAGGCGGATCGGAAAGTCAG AGGTGAAATC CCAGGGCTCAACCCTGGAACTGCCTTTGAAACTCCCGATCTTGAGGTCGAGAGAGGTGAG TGGAATTCCG AGTGTAGAGGTGAAATTCGTAGATATTCGGAGGAACACCAGTGGCGAAGGCGGCTCACTG GCTCGA ACT GACGCTGAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCC GTAAACGATG AATGCCAGTCGTCGGGCAGCATGCTGTTCGGTGACACACCTAACGGATTAAGCATTCCGC CTGGCATGGA GTACGGCCGCAAGGTTAAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCA TGTGGTTTAA TTCGAAGCAACGCGCAGAACCTTACCAACCCTTGACATGGCGATCGCGGTTCCAGAGATG GTTCCTTCAG TTCGGCTGGATCGCACACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTC GGTTAAGTCC GGCAACGAGCGCAACCCACGTCCTTAGTTGCCAGCATTCAGTTGGGCACTCTAGGGAAAC TGCCGGTGAT AAGCCGGAGGAAGGTGTGGATGACGTCAAGTCCTCATGGCCCTTACGGGTTGGGCTACAC ACGTGCTACA ATGGCAGTGACAATGGGTTAATCCCAAAAAGCTGTCTCAGTTCGGATTGGGGTCTGCAAC TCGACCCCAT GAAGTCGGAATCGCTAGTAATCGCGTAACAGCATGACGCGGTGAATACGTTCCCGGGCCT TGTACACACC GCCCGTCACACCATGGGAATTGGTTCTACCCGAAGGCGGTGCGCCAACCTCGCAAGAGGA GGCAGCCGAC CACGGTAGGATCAGTGACTGGGGTGAAGTCGTAACAAGGTAGCCGTAGGGGAACCTGCGG CTGGATCACC TCCTTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctac actctttccc tacacgacgctcttccgatctGTAGAAATAATCGGATTCGAagatcggaagagcacacgt ctgaactcca gtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>91_SAr_Full_TAG_TCT_CAT (SEQ ID NO:98)

AGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGC GAACGGACGA GAAGCTTGCTTCTCTGATGTTAGCGGCGGACGGGTGAGTAACACGTGGATAACCTACCTA TAAGACTGGG ATAACTTCGGGAAACCGGAGCTAATACCGGATAATATTTTGAACCGCATGGTTCAAAAGT GAAAGACGGT CTTGCTGTCACTTATAGATGGATCCGCGCTGCATTAGCTAGTTGGTAAGGTAACGGCTTA CCAAGGCAAC GATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAGACT CCTACGGGAG GCAGCAGTAGGGAATCTTCCGCAATTAGGGGCGAAAGCCTGACGGAGCAACGCCGCGTGA GTGATGAAGG TCTTCGGATCGTAAAACTCTGTTATTAGGGAAGAACATATGTGTAAGTAACTGTGCACAT CTTGACGGTA CCTAATCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAA GCGTTATCCG GAATTATTGGGCGTAAATCTGCGCGCGTAGGCGGTTTTTTAAGTCTGATGTGAAAGCCCA CGGCTCAACC GTGGAGGGTCATTGGAAACTGGAAAACTTGAGTGCAGAAGAGGAAAGTGGAATTCCATGT GTAGCGGTGA AATGCGCAGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAACTGAC GCTGATGTGC GAAAGCGTGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGT GCTAAGTGTT AGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACGCATTAAGCACTCCGCCTGGCATGGAG TACGACCGCA AGGTTGAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTAAT TCGAAGCAAC GCGAAGAACCTTACCAAATCTTGACATCCTTTGACAACTCTAGAGATAGAGCCTTCCCCT TCGGGGGACA AAGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCC GCAACGAGCG CAACCCTTAAGCTTAGTTGCCATCATTAAGTTGGGCACTCTAAGTTGACTGCCGGTGACA AACCGGAGGA AGGTGGGGATGACGTCAAATCATCATGCCCCTTATGATTTGGGCTACACACGTGCTACAA TGGACAATAC AAAGGGCAGCGAAACCGCGAGGTCAAGCAAATCCCATAAAGTTGTTCTCAGTTCGGATTG TAGTCTGCAA CTCGACTACATGAAGCTGGAATCGCTAGTAATCGTAGATCAGCATGCTACGGTGAATACG TTCCCGGGTC TTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGCCGGTGGAGTAACC TTTTAGGAGC TAGCCGTCGAAGGTGGGACAAATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGA AGGTGCGGCT GGATCACCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagat ctacactctt tccctacacgacgctcttccgatctTCACCTGCCGGGCGGGCGCGagatcggaagagcac acgtctgaac tccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>92_SEp_Full_TAG_TCT_CAT (SEQ ID NO:99)

TTTTATGGAGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCCTAATACATGCA AGTCGAGCGA ACAGACGAGGAGCTTGCTCCTCTGACGTTAGCGGCGGACGGGTGAGTAACACGTGGATAA CCTACCTATA AGACTGGGATAACTTCGGGAAACCGGAGCTAATACCGGATAATATATTGAACCGCATGGT TCAATAGTGA AAGACGGTTTTGCTGTCACTTATAGATGGATCCGCGCCGCATTAGCTAGTTGGTAAGGTA ACGGCTTACC AAGGCAACGATGCGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGG TCCAGACTCC TACGGGAGGCAGCAGTAGGGAATCTTCCGCAATTAGGGGCGAAAGCCTGACGGAGCAACG CCGCGTGAGT GATGAAGGTCTTCGGATCGTAAAACTCTGTTATTAGGGAAGAACAAATGTGTAAGTAACT ATGCACGTCT TGACGGTACCTAATCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTA GGTGGCAAGC GTTATCCGGAATTATTGGGCGTAAATCTGCGCGCGTAGGCGGTTTTTTAAGTCTGATGTG AAAGCCCACG GCTCAACCGTGGAGGGTCATTGGAAACTGGAAAACTTGAGTGCAGAAGAGGAAAGTGGAA TTCCATGTGT AGCGGTGAAATGCGCAGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTG TAACTGACGC TGATGTGCGAAAGCGTGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAA CGATGAGTGC TAAGTGTTAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACGCATTAAGCACTCCGCCTG GCATGGAGTA CGACCGCAAGGTTGAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGAGCATGT GGTTTAATTC GAAGCAACGCGAAGAACCTTACCAAATCTTGACATCCTCTGACCCCTCTAGAGATAGAGT TTTCCCCTTC GGGGGACAGAGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGT TAAGTCCCGC AACGAGCGCAACCCTTAAGCTTAGTTGCCATCATTAAGTTGGGCACTCTAAGTTGACTGC CGGTGACAAA CCGGAGGAAGGTGGGGATGACGTCAAATCATCATGCCCCTTATGATTTGGGCTACACACG TGCTACAATG GACAATACAAAGGGCAGCGAAACCGCGAGGTCAAGCAAATCCCATAAAGTTGTTCTCAGT TCGGATTG A GTCTGCAACTCGACTATATGAAGCTGGAATCGCTAGTAATCGTAGATCAGCATGCTACGG TGAATACGTT CCCGGGTCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGCCGGTG GAGTAACCAT TTGGAGCTAGCCGTCGAAGGTGGAACAAATGATTGGGGTGAAGTCGTAACAAGGTAGCCG TATCGGAAGG TGCGGCTGGATCACCTCCTTTCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacg gcgaccaccg agatctacactctttccctacacgacgctcttccgatctCGATAATTTCGGATCGGGATa gatcggaaga gcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgT TGTCGACTC

>93_SAg_Full_TAG_TCT_CAT (SEQ ID NO: 100)

TTTAATGAGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAA GTAGAACGCT GAGGTTTGGTGTTTACACTAGACTGATGAGTTGCGAACGGGTGAGTAACGCGTAGGTAAC CTGCCTCA A GCGGGGGATAACTATTGGAAACGATAGCTAATACCGCATAAGAGTAATTAACACATGTTA GTTATT AAA AGGAGCAATTGCTTCACTGTGAGATGGACCTGCGTTGTATTAGCTAGTTGGTGAGGTAAA GGCTCACCAA GGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCC CAGACTCCTA CGGGAGGCAGCAGTAGGGAATCTTCGGCAATTAGGGACGGAAGTCTGACCGAGCAACGCC GCGTGAGTGA AGAAGGTTTTCGGATCGTAAAGCTCTGTTGTTAGAGAAGAACGTTGGTAGGAGTGGAAAA TCTACCAAGT GACGGTAACTAACCAGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAG GTCCCGAGCG TTGTCCGGATTTATTGGGCGTAAATCTGCGAGCGCAGGCGGTTCTTTAAGTCTGAAGTTA AAGGCAGTGG CTTAACCATTGTACGCTTTGGAAACTGGAGGACTTGAGTGCAGAAGGGGAGAGTGGAATT CCATGTGTAG CGGTGAAATGCGTAGATATATGGAGGAACACCGGTGGCGAAAGCGGCTCTCTGGTCTGTA ACTGACGCTG AGGCTCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACG ATGAGTGCTA GGTGTTAGGCCCTTTCCGGGGCTTAGTGCCGCAGCTAACGCATTAAGCACTCCGCCTGGC ATGGAGTACG ACCGCAAGGTTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGG TTTAATTCGA AGCAACGCGAAGAACCTTACCAGGTCTTGACATCCTTCTGACCGGCCTAGAGATAGGCTT TCTCTTCGGA GCAGAAGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAG TCCCGCAACG AGCGCAACCCCTATTGTTAGTTGCCATCATTAAGTTGGGCACTCTAGCGAGACTGCCGGT AATAAACCGG AGGAAGGTGGGGATGACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACACGTGCT ACAATGGTTG GTACAACGAGTCGCAAGCCGGTGACGGCAAGCTAATCTCTTAAAGCCAATCTCAGTTCGG ATTGTAGGCT GCAACTCGCCTACATGAAGTCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGGTGAA TACGTTCCCG GGCCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGTGAGGT AACCTTTTAG GAGCCAGCCGCCTAAGGTGGGATAGATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTAT CGGAAGGTGC GGCTGGATCACCTCCTTTCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcg accaccgaga tctacactctttccctacacgacgctcttccgatctTCATGTCGCCGTTTGGCAAAagat cggaagagca cacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGT CGACTC

>94_SMu_Full_TAG_TCT_CAT (SEQ ID NO: 101)

AGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAAGTGGGAC GCAAGAGGAC ACACTGTGCTTGCACACCGTGTTTTCTTGAGTCGCGAACGGGTGAGTAACGCGTAGGTAA CCTGCCTATT AGCGGGGGATAACTATTGGAAACGATAGCTAATACCGCATAATATTAATTATTGCATGAT AATTGATTGA AAGATGCAAGCGCATCACTAGTAGATGGACCTGCGTTGTATTAGCTAGTTGGTAAGGTAA GAGCTTACCA AGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGC CCAGACTCCT ACGGGAGGCAGCAGTAGGGAATCTTCGGCAATTAGGGACGAAAGTCTGACCGAGCAACGC CGCGTGAGTG AAGAAGGTTTTCGGATCGTAAAGCTCTGTTGTAAGTCAAGAACGTGTGTGAGAGTGGAAA GTTCACACAG TGACGGTAGCTTACCAGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTA GGTCCCGAGC GTTGTCCGGATTTATTGGGCGTAAATCTGGGAGCGCAGGCGGTCAGGAAAGTCTGGAGTA AAAGGCTATG GCTCAACCATAGTGTGCTCTGGAAACTGTCTGACTTGAGTGCAGAAGGGGAGAGTGGAAT TCCATGTG A GCGGTGAAATGCGTAGA A ATGGAGGAACACCAGTGGCGAAAGCGGCTCTCTGGTCTGTCACTGACGCT GAGGCTCGAAAGCGTGGGTAGCGAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAAC GATGAGTGCT AGGTGTTAGGCCCTTTCCGGGGCTTAGTGCCGGAGCTAACGCAATAAGCACTCCGCCTGG CATGGAGTAC GACCGCAAGGTTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTG GTTTAATTCG AAGCAACGCGAAGAACCTTACCAGGTCTTGACATCCCGATGCTATTCTTAGAGATAGGAA GTTACTTCGG TACATCGGAGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAA GTCCCGCAAC GAGCGCAACCCTTATTGTTAGTTGCCATCATTAAGTTGGGCACTCTAGCGAGACTGCCGG TAATAAACCG GAGGAAGGTGGGGATGACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACACGTGC TACAATGGTC GGTACAACGAGTTGCGAGCCGGTGACGGCAAGCTAATCTCTGAAAGCCGATCTCAGTTCG GATTGGAGGC TGCAACTCGCCTCCATGAAGTCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGGTGA ATACGTTCCC GGGCCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGTGAGG TAACCTTTTA GGGGCCAGCCGCCTAAGGTGGGATGGATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTA TCGGAAGGTG CGGCTGGATCACCTCCTTTCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggc gaccaccgag atctacactctttccctacacgacgctcttccgatctTTCTAGCACTGTAAGACACCaga tcggaagagc acacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTG TCGACTC

>95_SPn_Full_TAG_TCT_CAT (SEQ ID NO: 102)

AAACTTTTTAATGAGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATAC ATGCAAGTAG AACGCTGAAGGAGGAGCTTGCTTCTCTGGATGAGTTGCGAACGGGTGAGTAACGCGTAGG TAACCTGCCT GGTAGCGGGGGATAACTATTGGAAACGATAGCTAATACCGCATAAGAGTAGATGTTGCAT GACATTTGCT TAAAAGGTGCACTTGCATCACTACCAGATGGACCTGCGTTGTATTAGCTAGTTGGTGGGG TAACGGCTCA CCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACAC GGCCCAGACT CCTACGGGAGGCAGCAGTAGGGAATCTTCGGCAATTAGGGACGGAAGTCTGACCGAGCAA CGCCGCGTGA GTGAAGAAGGTTTTCGGATCGTAAAGCTCTGTTGTAAGAGAAGAACGAGTGTGAGAGTGG AAAGTTCACA CTGTGACGGTATCTTACCAGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATAC GTAGGTCCCG AGCGTTGTCCGGATTTATTGGGCGTAAATCTGCGAGCGCAGGCGGTTAGATAAGTCTGAA GTTAAAGGCT GTGGCTTAACCATAGTAGGCTTTGGAAACTGTTTAACTTGAGTGCAAGAGGGGAGAGTGG AATTCCATGT GTAGCGGTGAAATGCGTAGATATATGGAGGAACACCGGTGGCGAAAGCGGCTCTCTGGCT TGTAACTGAC GCTGAGGCTCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCTGTA AACGATGAGT GCTAGGTGTTAGACCCTTTCCGGGGTTTAGTGCCGTAGCTAACGCATTAAGCACTCCGCC TGGCATGGAG TACGACCGCAAGGTTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCAT GTGGTTTAAT TCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACATCCCTCTGACCGCTCTAGAGATAG AGCTTTCCTT CGGGACAGAGGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGT TAAGTCCCGC AACGAGCGCAACCCCTATTGTTAGTTGCCATCATTTAGTTGGGCACTCTAGCGAGACTGC CGGTAA AAA CCGGAGGAAGGTGGGGATGACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACACG TGCTACAATG GCTGGTACAACGAGTCGCAAGCCGGTGACGGCAAGCTAATCTCTTAAAGCCAGTCTCAGT TCGGATTGTA GGCTGCAACTCGCCTACATGAAGTCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGG TGAATACGTT CCCGGGCCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGTG AGGTAACCGT AAGGAGCCAGCCGCCTAAGGTGGGATAGATGATTGGGGTGAAGTCGTAACAAGGTAGCCG TATCGGAAGG TGCGGCTGGATCACCTCCTTTCTAAGGATTAGGGATAACAGGGTAATGAGTCGACAAaat gatacggcga ccaccgagatctacactctttccctacacgacgctcttccgatctGACCTACGGATTAGA CTATTagatc ggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttct gcttgTTGTC GACTC >96_AOd_Full_TAG_TCT_CAT (SEQ ID NO: 103)

AGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAAC GCTGAAGCCC AGCTTGCTGGGTGGATGAGTGGCGAACGGGTGAGTAACACGTGAGTAACCTGCCCCCTTC TTTGGGATAA CGCCCGGAAACGGGTGCTAATACTGGATATTCACTGATCTTCGCATGGGGGTTGGTGGAA AGGTTTTTTC TGGTGGGGGATGGGCTCGCGGCCTATCAGCTTGTTGGTGGGGTGATGGCCTACCAAGGCT TTGACGGGTA GCCGGCCTGAGAGGGTGACCGGTCACATTGGGACTGAGATACGGCCCAGACTCCTACGGG AGGCAGCAGT GGGGAATATTGCACAATTAGGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGGA GGCCTTCGGG TTGTAAACCTCTTTCGCTCATGGTCAAGCCGCAACTCAAGGTTGTGGTGAGGGTAGTGGG TAAAGAAGCG CCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGCGCGAGCGTTGTCCGGAATT ATTGGGCGTA AATCTGGGCTTGTAGGCGGTTGGTCGCGTCTGCCGTGAAATCCTCTGGCTTAACTGGGGG CGTGCGGTGG GTACGGGCTGACTTGAGTGCGGTAGGGGAGACTGGAACTCCTGGTGTAGCGGTGGAATGC GCAGATATCA GGAAGAACACCGGTGGCGAAGGCGGGTCTCTGGGCCGTTACTGACGCTGAGGAGCGAAAG CGTGGGGAGC GAACAGGATTAGATACCCTGGTAGTCCACGCTGTAAACGTTGGGCACTAGGTGTGGGGGC CACCCGTGGT TTCTGCGCCGTAGCTAACGCTTTAAGTGCCCCGCCTGGCATGGAGTACGGCCGCAAGGCT AAAACTCAAA GGAATTGACGGGGGCCCGCACAAGCGGCGGAGCATGCGGATTAATTCGATGCAACGCGAA GAACCTTACC AAGGCTTGACATGCACGGCGGCACTGCAGAGATGTGGTGGCATTTAGTTGGTCGTGTGCA GGTGGTGCAT GGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTT GCCCTATGTT GCCAGCACGTGATGGTGGGGACTCGTGGGGGACTGCCGGGGTTAACTCGGAGGAAGGTGG GGATGACGTC AAATCATCATGCCCCTTATGTCTTGGGCTTCACGCATGCTACAATGGCTGGTACAGAGGG TTGCGATACT GTGAGGTGGAGCGAATCCCTTAAAGCCAGTCTCAGTTCGGATTGGGGTCTGCAACTCGAC CCCATGAAGG TGGAGTCGCTAGTAATCGCAGATCAGCAACGCTGCGGTGAATACGTTCTCGGGCCTTGTA CACACCGCCC GTCACGTCACGAAAGTTGGTAACACCCGAAGCCCATGGCCTAACCGCTTTGTGCTAGGGA TAACAGGG A ATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgct cttccgatct TTTAAACTCTATCCATCCCAagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTC

>97_LMo_Full_TAG_TCT_CAT (SEQ ID NO: 104)

GAGAGTTTGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAA CGAACGGAGG AAGAGCTTGCTCTTCCAATGTTAGTGGCGGACGGGTGAGTAACACGTGGGCAACCTGCCT GTAAGTTGGG GA AACTCCGGGAAACCGGGGCTAATACCGAATGATAAGATGTGGCGCATGCCACGCCTTTGA AAGATGG TTTCGGCTATCGCTTACAGATGGGCCCGCGGTGCATTAGCTAGTTGGTAGGGTAATGGCC TACCAAGGCA ACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAGA CTCCTACGGG AGGCAGCAGTAGGGAATCTTCCGCAATTAGGGACGAAAGTCTGACGGAGCAACGCCGCGT GTATGAAGAA GGTTTTCGGATCGTAAAGTACTGTTGTTAGAGAAGAACAAGGATAAGAGTAACTGCTTGT CCCTTGACGG TATCTAACCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGC AAGCGTTGTC CGGATTTATTGGGCGTAAATCTGCGCGCGCAGGCGGTCTTTTAAGTCTGATGTGAAAGCC CCCGGCTTAA CCGGGGAGGGTCATTGGAAACTGGAAGACTGGAGTGCAGAAGAGGAGAGTGGAATTCCAC GTGTAGCGGT GAAATGCGTAGATATGTGGAGGAACACCAGTGGCGAAGGCGACTCTCTGGTCTGTAACTG ACGCTGAGGC GCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGA GTGCTAAGTG TTAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACGCATTAAGCACTCCGCCTGGCATGG AGTACGACCG CAAGGTTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTA ATTCGAAGCA ACGCGAAGAACCTTACCAGGTCTTGACATCCTTTGACCACTCTGGAGACAGAGCTTTCCC TTCGGGGACA AAGTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCC GCAACGAGCG CAACCCTTGATTTTAGTTGCCAGCATTTAGTTGGGCACTCTAAAGTGACTGCCGGTGCAA GCCGGAGGAA GGTGGGGATGACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACACGTGCTACAAT GGATAGTACA AAGGGTCGCGAAGCCGCGAGGTGGAGCTAATCCCATAAAACTATTCTCAGTTCGGATTGT AGGCTGCAAC TCGCCTACATGAAGCCGGAATCGCTAGTAATCGTGGATCAGCATGCCACGGTGAATACGT TCCCGGGCCT TGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGTAGGGTAACCT TTATGGAGCC AGCCGCCGAAGGTGGGACAGATAATTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGAA GGTGCGGCTG GATCACCTCCTTTCTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccac cgagatctac actctttccctacacgacgctcttccgatctCTAACTTATGTGTCGTCGGTagatcggaa gagcacacgt ctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACT C

16S rRNA Gene V4 Primer editing standard sequences lllum ina adapters in lower case

>98_ECO_V4_WT (SEQ ID NO: 105)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT GACATCAGAATTGAGTGCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>99_Eco_V4_10_A (SEQ ID NO: 106)

AAGAAGCACCGGCTAACTCCGTGCCAGCAACCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG ACACTGCCAGTGTCACTCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>100_ECO_V4_10_T (SEQ IDNO:107)

AAGAAGCACCGGCTAACTCCGTGCCAGCATCCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA AGTGGACTTGCTTATACGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>101_ECO_V4_10_C (SEQIDNO:108)

AAGAAGCACCGGCTAACTCCGTGCCAGCACCCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG AGGCGTTGATTGGCGGCTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>102_ECO_V4_11_A(SEQIDNO:109)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGACGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA CGACCAATAATGAACTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>103_ECO_V4_11_T (SEQ ID NO:110)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGTCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC CGGAAACAAATCCGGGCTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>104_ECO_V4_11_G(SEQIDNO:111)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGGCGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctAC TTAAAGACCATTTGATGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>105_Eco_V4_12_A (SEQ ID NO: 112)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCAGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG GTGCCAGACTTAAGTTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>106_ECO_V4_12_T (SEQ ID NO: 113)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCTGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG CTCTCTGCTTAGATGACGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>107_Eco_V4_12_G (SEQ ID NO:l14)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCGGCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG TTATTAGGATATGCCGTTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>108_ECO_V4_13_A(SEQIDNO:115)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCACGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctAC CAAATGCCGAGGTTTGACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>109_ECO_V4_13_T (SEQ ID NO: 116)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCTCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG ATATATAAAGGTAACCAAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>110_ECO_V4_13_C(SEQIDNO:117)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCCCGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctGC TGTGGTCAGCTTATCATAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>111_ECO_V4_14_A(SEQIDNO:118)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGAGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctCA GTGGTTACTCCAGCCCGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>112_ECO_V4_14_T (SEQ ID NO:119)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGTGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG TAATTGCACTAGAGGCGGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>113_Eco_V4_14_G (SEQ ID NO:120)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGGGGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTG CGCGGGTAAGCCCATAGAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>114_Eco_V4_15_A (SEQ ID NO:121)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCAGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTC ACACGGAGCGTGTTATACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>115_ECO_V4_15_T (SEQ ID NO: 122)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCTGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTG ATGATGATCACACTACCTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>116_ECO_V4_15_C(SEQIDNO:123)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCCGTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctCG GTTCCAGGCTAAATGTCCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>117_Eco_V4_16_A (SEQ ID NO:124)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGATAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG AGCAGGAAGGATGGCGAGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>118_E CO_V4_16_T (SEQ ID NO: 125)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGTTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAC GTCCCTGAGCTACGTGTGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>l 19_Eco_v4_l 6_c (SEQ ID NO: 126)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGCTAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA TTCCCTCACGGACCGGTAagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>120_Eco_V4_17_A (SEQ ID NO: 127)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGAAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTA CTCTCAGACGAGCGGCCCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>121_Eco_V4_17_G (SEQ ID NO: 128)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGGAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG GCAGGTCACATCGCTGACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>122_E CO_V4_17_C (SEQ ID NO: 129)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGCAATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA AACCTAGCTGTTGCAGGCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>123_ECO_V4_18_T (SEQ ID NO: 130)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGT A ACGGAGGGTGCAAGCGTTAATCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTT AGCACTACGGTTCCGAACagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>124_Eco_V4_18_G(SEQ IDNO:131)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTGATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCT GCTCTCTGGTTTACAGGTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>125_ECO_V4_18_C(SEQIDNO:132)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTCATACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatctCC CGTACACTACATCGGGTTagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>126_ECO_V4_19_T (SEQ ID NO: 133)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTATTACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTT CGCTCGACTTCCATTCGCagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>127_Eco_V4_19_G (SEQ ID NO:134)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAGTACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGT ATGCCTTGAGGCCCATAGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

>128_ECO_V4_19_C (SEQ ID NO:135)

AAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTACTACGGAGGGTGCAAGCGTTAA TCGGAATTAC TGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACC TGGGAACTGC ATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGA AATGCGTAGA GATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGC GAAAGCGTGG GGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGTCGACTAGGGATA ACAGGGTAAT GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGT ACTTTCAAGTAGATTGAGagatcggaagagcacacgtctgaactccagtcacAATCAGTC TCGTatctcg tatgccgtcttctgcttgTTGTCGACTC

ITS2 Primer editing standard sequences lllumina adapters in lower case

>129_Sce_ITS2_WT

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTATA TCATCACTAT GGTAACagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>130_Sce_ITS2_09_A

CGGATCTCTTGGTTCTCGCATCGATGAAAAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGGTT CATAATCGGA TACGAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>131_Sce_ITS2_09_T

CGGATCTCTTGGTTCTCGCATCGATGAATAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctACTC ATGGTAAACC AGGCTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>132_Sce_ITS2_09_C

CGGATCTCTTGGTTCTCGCATCGATGAACAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCTGA AACCTTTCAC TGGCCCagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>133_Sce_ITS2_10_T

CGGATCTCTTGGTTCTCGCATCGATGAAGTACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTTCC GAGCCCTGCA CTCTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>13 _Sce_ITS2_10_G

CGGATCTCTTGGTTCTCGCATCGATGAAGGACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTTCA TTCCTGGTGA GAAGATagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>135_Sce_ITS2_10_C

CGGATCTCTTGGTTCTCGCATCGATGAAGCACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGGTA CTGATTCGAA ACCAGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>136 See ITS2 11 T CGGATCTCTTGGTTCTCGCATCGATGAAGATCGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctAAGC ATAGCCGGCC CGAAGAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>137_Sce_ITS2_ll_G

CGGATCTCTTGGTTCTCGCATCGATGAAGAGCGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGTGC ATTAGTGGCT TCGACAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>138_Sce_ITS2_ll_C

CGGATCTCTTGGTTCTCGCATCGATGAAGACCGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctATAC AGTCCTGCTC CCGTGCagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>139_Sce_ITS2_12_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAAAGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTCCC AAGTGGGATC AGTTTAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 0_Sce_ITS2_12_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAATGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTCAA CACCCAGTGG ACGCATagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 1_Sce_ITS2_12_G

CGGATCTCTTGGTTCTCGCATCGATGAAGAAGGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctACTA GGAAGTCCGG ACCTATagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 2_Sce_ITS2_13_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAACACAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTCCT GGGAGGGAGG TCTCGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 3_Sce_ITS2_13_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACTCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGAAG AATATACCTA CCGGAAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 4_Sce_ITS2_13_C

CGGATCTCTTGGTTCTCGCATCGATGAAGAACCCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCGGG CAGAGCGCTT ACGTACagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 5_Sce_ITS2_14_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGAAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCGTG GAATATTTGG GTTCGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>1 6_Sce_ITS2_14_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGTAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGCAC CCGACAAGGG TTCGGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>147_Sce_ITS2_14_G

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGGAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctAGAG CGTTCGTAAT ACCGGAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>148_Sce_ITS2_15_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCTGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTACG CCTGTCATCA TGACTAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>149 See ITS2 15 G CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCGGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctACGC ATCACGCCTA CGACGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>150_Sce_ITS2_15_C

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCCGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGAGG GAGGATGACC GTAGGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>151_Sce_ITS2_16_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAACGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCGAT AATATCATCC CGGACTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>152_Sce_ITS2_16_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCATCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctATGA CGATCACTTT CTAGCTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>153_Sce_ITS2_16_C

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCACCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctAGTA CTCGGTCCCT TCCTAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>154_Sce_ITS2_17_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGAGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctATTC ATACCAGATA TCCCTCagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>155_Sce_ITS2_17_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGTGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGACT GTGTGTTACT GCTGACagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>156_Sce_ITS2_17_G

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGGGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctAAAT ACTGTTTATA CGGTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>157_Sce_ITS2_18_A

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCAAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGTAA ACAGTAAGGA GGCATCagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>158_Sce_ITS2_18_T

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCTAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTTGC AGACAGGTGC GGGATGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

>159_Sce_ITS2_18_C

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCCAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGAGTACCCG CTGAACTTAA GCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGA GTCGACAAaa tgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCTCA TTCAGCTTTG TTAAAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgta tgccgtcttc tgcttgTTGTCGACTC

ITS2 process standard sequences lllumina adapters in lower case

>160_GC_dG_35-40_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTATGAAGGTTGAAAGCGTAATT AGGAATTATT GATAGTAAAGTGCACACTAGCGTTTTGTTAAATCTTAAGTGTAATCCCCGAGCTAAAACA AGGAATAACA TCTGATAATTACAAGATTGAAAATCGTATTTGGAGGTAGAATTCCAGGAGTAAAGGAGAA ATTAATAGTG TTCTGTAATAATACAAGTATCGTATGCAGCAACTAGGTCGAAGACTGATGATCAGGTGAG AAAGTGTTGG GAGCTAACTGAGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGGC CATACTCGCAGTGGCTCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>161_GC_dG_35-50_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTATGGAGGATTCAAACATAAAT CAGAATTACT GGGCTTAAAATAATCGCATTCTGTTTGTTAAGTAATATGTGTAATCCCCGGGCTAATCCT GGGAAATGCA TTTAATACTGGCAATCTAGAGTATAATAAAGGAGAATAGTATTTTAGTAGAAACAGAGAA TTGTTTAGAT ATTTGGAGGAATAAAGTTAGCATTTGCTGCCCCATGGACGAAAAATGATGCTCATATGCA AAAGCGTGGT GTACAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGTG TCCTACCGCTATACCGCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC >162_GC_dG_35-60_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAATGAAGAATGAATGCTTTAAT CGGTATTATT GGTCGTAAAACGCACGCAGGAGGATTGTTAAATTAGATGTGAAATCCACTGGCTTAACAT TAGATAAGCA TC GA ACAGGA AGC GA TTTCATATAAGAGGTTTGAAATCCAGATATAGCTTTGTAATTCGTAGAA ATCTGGATGATTACCGGTTATGAAGGCGGTCTCATGGATGAAATCTGATGCTAAAATACG AATGCGTGGT TATCAAATAATGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGCT CGAACACACAGCCGGTAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>163_GC_dG_36-70_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAACTGATGGTGTAAACGTTTTT CGGTTTAACT GAACATAATGTTCACGAAAGATAAATGTTATTACATTTGATATTTATCTTGACTCAACAT GGGAACAGCA TATGATACAGGAAAACTTGAGTCTCGTAGAAGGGAGTTTAATTCCAGGTTAAGCTATTAT ATGATAATAG AACTGGAGAAAATCCGGTGTTGATGGCGGTTACTTGGATTTAGACTTACGTTCAGGAACA AAATCTTGTG GTGCTAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCCA GGTGGGTAGGTCTTTGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>164_GC_dG_40-40_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGATAGTGTTAAT CGTAATTACT AGGCGTTAAGCACAAACAGGTGGATTGTTAAGACAGATATGATATCCCAGGGCTTAAACA GGAAACTGCA AATGATACTAGCTAGCTTGAGACTCGAATATGGGGGTAGAATACCAGGATTAAAGATGAT TTACGTAGAG ATAAGGAGTATTACCGTTGTTAAAGGCGGCAACCTGAATTAATACTAACTAACAGGAAAG AAAGCGTGGT AAGAAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGTC TCACTCAGTCAGTACGAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>165_GC_dG_40-50_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAAATTTTAAT CGGAATTACT GGTAATAAAGCGCACGTAGTCGGTTTGTTAATTCATATTTGAATTCCTAAGTCTAAACCT AGTAACTACA TCTGATACTGGTAAACTTGAGTCTCTTAGAGGGGGATAGAATTATAGTTGTAGCGGTGAA ATTCGAAGAG TTCTGGAGTAATACCGGTAGCAAAGACGACCAACTGGACGAAGTCTGACGTTAAGATAAG AAAGTATGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTGA TCTCGAGTGTCGTCACAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>166_GC_dG_40-60_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACAGAGGGTGTAAGCGTTAAT AGAAATTACT GGGCGTAAAAAGCACGCAGGCGGTTTGTAAAGTTAGATATAAATTCATTGGACTCTAACT GAGAACTGCA TTTGATACTTTCAAGCTTTTGTCTCGTTGAGGAGGGTAGAAATTCAGGAGTTGCGATGAT ATGCTTAGAG ATCTTGAGGAATTCCGGTGTCGAATGCAAACTCCTGGACGAAGACTAACGTTCAGTTGCA AAAGAGTGGA AATTAAACATTGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGTG TAGCTAACTTAAGGTGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>167 GC dG 40-70 ITS2 CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTAAGGAATGAGCTAGCGTTAAT CGAAATTACT GGGTGTTTTGCTCATGCAGGAGATTAGTTTATTCAGATGTAAAAAACCCGGGTACAACCT GGGAATTGCA TCTGATACT A AAGCTAAATACTCGTAGAGGGAGGTAGTATTCCTGGTGTTGTGGTGAAATGTGTAGAG ATCTATATAATTACATGTTGCGAAGGCGGACCCAAGGACGAAGACTGATGCTCAGAAATT AAAACGTGGA AATCAAACTTGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAAC GTTGTCCAGCCGTATGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>168_GC_dG_40-80_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTTCAGAATGTGTAAGCGTAATT CGAAATTTCT GAGAGTAAAGCGAATGCAGATGGTTTATTTTGTTAGAAGTGAAATCCCCGGGCTATACCT GGTTACTGCA TCTGTTACTGGTAAACTTGAAACTCGAAGAGGGTGATAATATTCCAGGTATTTAGGTTAA ATGTGTAGAT ATCTGGATGAATACTAGTGTCTAAGGCAGTCCACTGGACGTAGACTTACTCTCAGGTTCG AAAGCGTGGG GAACATTCATAGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctACC CGGACGGCGGTCAATAAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>169_GC_dG_45-50_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTAAGGAGGGTGCTTGCGTTAAA CAGAAATACT GGGTGTAAAACGTACGTAGACGGTTTATTAAGTAAGATGTGAAATCCCCGTGCTCAACCT AGGAACTGCA TATGATATTGGAAAACTTGAGACATATAGAGGGAGATAGAATACTTGGAGTAGCGTTGTA ATGCGTATAG ATTTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACAGACGCTCAGGTGCG AAATCGTTGT GATCAAATAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTCA CTCGGGTTCTCGGCACGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>170_GC_dG_45-60_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGAAGTGTGAAAACGTTATT AAGAATTACT GGGCGTAAAGCGTTCGCAGGCGGTTTGTTAAGTCATAAGTGAAATCCCCGGGCACAAACT GGGAACTGAA TCTGTAACTGACAAGCTTGAGTATCTTATAATGGGATAGAATTTAATGTGTAGCTGTGAA ATGCGTAGAG ATCTGGAGATATACCGGTGACTAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGCT AAATCGTGGG GAACAAAAAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAAA CAACTCTGGCTCGATAAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>171_GC_dG_45-70_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAACTGAGGGTGCAAGCGTTAAT CGGAATTACT GTGCATAAAGCTCACGCATTCGTTTTGTTAAGTCAGATAATAAATCCTCGAGTTAAACCT GGGAAATGCA TCTGATACTGAAAAGCTTGATTCTCGTAGAGGGGTGTAGAATTCCAGGTGTAGCAGTAAA ATACGTAGAG ATCAGAATGAATTCCGGTGGTGAAGTCGGCCTACTGGACGAAGACTGACGCTAAGGTGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCGA CAGTCCGACCGCAACAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>172_GC_dG_45-80_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGTGTTAAATCGTTAAT AGGAATTACT GGGCGAAAAGTGCTAGCAGGCGGTTTGTTAAATCAGTTGTGAAATCCCTGGGCACAACCT GGGATCTGCA TCTGATTTTGGCAAGCTTTAGTCTATTAGAGGGGGGTAAATTTCCATGTGTAGATTTGAA ATGCGTTTAG ATCTGGAGGAATACCGGAGGTGAAGGCGATCCCCTGGACGTAGACTGAAGCTCAAGTGAG AAAGCTTGGA GTGCAAACTAGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctACA GATACGAGGGAGCAGGTagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>173_GC_dG_45-90_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAAGCGATAAT AGTATT ACT GTGCGAAAAGCATACTTAGGAAGATTTTTATGTCAGATGTGAAATCCCCGGGCTTAACCT GGGAACTGCA TCTGATACTGACAAGTTTGAGACTCGTATAGGGGGGTAGAATTCCAGGTGTTGCAGTGAA AAGTGTAGAG ATCTGGAAGAATACCGGTGGCGAAGGTTGCCCCCTGTACGAATAATGACGCTATGGTGCG AAAGCATTGT GTGCAAACAAGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAGT TTCCAGTCGGTTCTCACagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>174_GC_dG_50-60_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAAGGTGCAAGTGTTAAT CGGAATTACT GGGCATAAAGCGCACGAAGGCGGTATGTTAAGTTAGATGTGAAATCCCCGGGCTCAATCT GTGAACTGCA TCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTAGAATTCAAGGTGTAGCGGTGAA ATGCGTAGAG ATCTAGATGAATACCGGTGGCGAAGGAGGTCCCCTGGACGAAGACTGACACTCTGGTGCG AAATAGTGGG GAGCAAACAGAGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCAC GTTTACTGACACGAAGCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>175_GC_dG_50-70_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAATCGTTAAT CGGATTTACT GGGCTTAAATCGCACGCAGGCGGTTTGTTAAGTCATATGTGAAAACCCCGGGCTCAACCT GGGAACTGCA TCTGATACTTGCAAGCTTGAGTCTCGTATAGGGAGGTAGAATTCCAGGTGTAGCGGTGAA ATGCGTAGAG ATCTGAAGTAATACCGGTAGCTAATACGGCCCACTGGACGAAGACTGACGCACAGGTGCT AAAGCGTGTG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGAT ATAAGCAGCCTCCGCAAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>176_GC_dG_50-80_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGTGTGCAAGCGTTAAT CGGAATTACT GGTCGTAAAGCGCACGTAGACGGTTTGTTAATTCAGATGTTAAATTCCAGGGCAAAACCT GGGAACTGCA TCTTATACTGGCAAGCTTGAGTCTCGTAGAGGGGGTTAGAATTCCAGGTGTAGCGGTGAA ATGTGTAAAG ATCTGGAGGAATACCGGTGTTGAAGGCGGCCTCCTGGACGAAGACTGACGCTCAGGTGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGAA TTTAGTGAGCACGAAGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>177_GC_dG_50-90_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGAACAAGCGTTAAT CGAAATAACT GGGCGTAAAGCGCTCGTAGGTTGTTTGTTAAGTCAGATGTGAAATCCCCGGGTTCAACCT GGGAACTGCA TCTGATTATGGCAAGCTTGAGTCTCGTAGAGGGGGGTATAATTCCAGTTGAAGCGGTAAA ATGCGTTGAG ATCTGGAGGTATACCGGTGGCGAAAGCGGCCCCATGGACGAAGACTGACGCTCATTTGCG AAATCGTTGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctATT TAAGTGCAGCTATGTCCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>178_GC_dG_49-100_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTTCTTCGGGTACTAGCGTTAAT TAGGATTTCT GCGCGTAAAGTGAACGCAGGCTTGTTGGTCAGTGAGATGTGTATTACAGGTACTTAACCT GTGAACCGCA TCTGATACTCGCAAGCCTGAGGCTCCTAGTGGGGGGTAGAAATCTATGTGTATCGTTGGA ACCCGTAAAC ATCTGTAGGATGGCATGTGTCCAAGGCAGCCCCCTGGTCTGAGACTGACAATCAGTTTCG AAAGCGTGGG GAGCAAACAGCGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAAG AGGATAACTCCAGTCTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>179_GC_dG_55-70_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGGACGGAGGGTGCAAGCGTTAAT CGGAATAACT GGGCGTAAAGCGGTCGCAGGCGGTTTGTTAAGTCAGATGTGAATTCCCCGGGCTCAACGT GGGAACTTCA CCTAATACGGGCAAGCTTTAGAATCGTAGAGGGGGGTAGAATTATAGGTATAGCGGTGCA ATGCGAAGAG AGCTGGAGGAATCCCGGTGGAGAAGGCAGCCCCCTGGACGAAGACAGAAGCTCAGGGGCG AAACCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTGG GCAAGGTAATCCGTGCAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>180_GC_dG_55-80_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAAGCGTTTAT CGGAATTACT GGGCGTAAAGCGCACGTAGGCGGTTTGTTAAGTCAGAAGTGAAATCCCCGGGCTCAACCT GGGAACTGCA TCTGATACTGGCTAGCATGAGTATCGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGTA ATGCGTAGAG ATCTAGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGTG AAAGCGAGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCTT CCGTCTTTATCTAACTAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>181_GC_dG_55-90_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACTGAGGGTGCAAGCGTAAAT CGGAATTACT GGGCGTAAAGCGCAAGCAGGCAGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACCT GGGAACTGCA TCAGATACTGGCAAGCTTGAGTCTCGTTGAGGGGGGTAGAATTCCATGTGTAGCGGTGAA TTGCGTAGAG AACTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGCG AAAGCGTGGG GTGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctATG TAGGTGGAATACCCAAAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>182_GC_dG_55-100_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACTGAGGGTGCAAGCGTTAGT CGGAATTACT GGGCGTAAAGCGCACTCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACCT GGGGACTGCA TCTGATACTGGCAAGCCTGAGTCTCGTAGTGGGGGGTATAATTCCAGGTGTAGCGGTGAA ATGCGTAGAG ATCAGGAAGAAGTCCAGTTGTGAAGGCGGCCCCCTGGACGAAGACTGAGGCTCAGGTGCG AAAGCGTGGG GAGCAAACAAGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTGG ATACCTAGACCCGAGGAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>183_GC_dG_55-110_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACCGAGGGTGCAAGAGTTAAG CGGAATTGCT GGCCGTAAAGCGCACACAGGCGCTTTGTCAAGTTAGATGCGAAATCCCCAGGTTCAACCT GGGAACTGCA TCTGATACTGGCAAGCTTAACTCTCGTAGAGGGGGTTACAATTCCAGGTGGAGCGCTGAA ATGCGTAGAC ATCTGGAGGAATACCGGTGGCGAAGGCGACCCCCTGGACGAAGACTCCCGCTTAGGTTCG CAAGCGGGGG GAGCAAACAGAGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTTG TGCTAACAAGCCGCGGAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>184_GC_dG_60-80_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGGGCAAGCGTTGAT CGGAATTACC CGGCGTGAAGCGCACGCAGGCGGTTTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACGT GGGAACTGCA TGTGATACTGGCACGCTTGAGTCTCGCAGAGGGGGGGAGAATTGCAGGGGTAGCGGTGAA AGGCGTAGAG ATCTGGAGGAATACCGGTGGCGAGGGCGGCCCCCTGGAGGAAGACTGACGCTCAGGTGCG AAAGCGTGGC GAGCAAACACGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGAA TTTGTCCAGTCACGCATagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>185_GC_dG_60-90_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAAGCGTTAAT CGGAATGACT GGGCGTAAAGCGCACGCAGGCGCTGTGTTAAGTCAGATGTGGAATCCCCGGGCTCAACCT GGGAACTGCA TCTGATACTGGCAAGGTTGAGTCTGGTGGAGGGGGGGAGAATCCCAGGTGTGGCGGTGAA ATGCGGAGAG AGCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGACGGAGACTGACGCTCAGGTGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAGG GAAGATAGGAGGCTCCCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>186_GC_dG_60-100_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGGGTGCAAGCGGTAAT CGGAATTACT GGGCGTAAACCGCACGCAGGCGGTCTGTTAAGTCAGATGTGAAATCCCCGGGCTCAACCG GGGAACTGCA TCTGATACTGGCAAGCTTGAGCCTCGTAGAGGGGGGTAGAAGTCCGGGTGTAGCGGTGAA CTGCGTAGAC ATCTGGAGGAATACCGGGGGCGAAGGCGGCCCCCTGGACGAAGACTGACGGGCAGGTGCG ACAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTAT TGTACGGTACAGGTTCAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>187_GC_dG_60-110_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTCCGGAGGGTGCGAGCGTTAAT CGGAATTACT GCGCGTAAAGCGCACGCAGGCGGTTTCTTAAGTCAGCTGTGAAATCCCCGGGCTCACCCC GGGAACTGCA TCTGATACTCGCAACCTTGAGTCTCGTAGAGGGGGCCAGAATTCCAGGTGTAGCGGTGAA ATGCGTAGAG ATCCGGAGGAATACCGGGGGCGAAGGCGGCCCCCTGGACGAAGACTGACGCTCAGGTGCG AAAGCGCGGG GAGCAAACCGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAAG AAGGCACCTGAAGCTCAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC >188_GC_dG_60-120_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTCCGGAGCGTGCAAGCGTCAAT CCGCATTAGT GGGCGTAAAGCGCACGCAGGCGGGTTGTTAAGTCAGATGTGAAATCCCGGGGCTCAACCT GGGAACTGCA TCTGAGACTGGCAAGCTTGAGTCTCGTACAGGGGGGTAGAATTCCAGGTCTGGCGCTGAA ATGCGTAGAG ATCTGGAGGCAGACCGGTCGCGAAGGCGGCCCCCTGCACGACGAGTGACCCTCAGGCGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTGC CGACGTTCGACCCGTTAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>189_GC_dG_65-90_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGCAGGGTGCAAGCGTTAAT CGGAAT ACT GGGCGGAAGGCGCACGCACGCGGCCTGTCAAGTGAGATGTGACATCCCCGGGCTCAACCT CGGAACTGCG TCTGATACTGGCACGCTTGCGTCGCGTACAGGGGGCGAGAATTCCAGGGGGAGGGGTGAA ATGCGTGGCG ATCCGGAGGAATACCGGTGGCGAAGGCGGCCCCCGGGACGAAGACGGCCGCTCAGGGGCC AAAGCGTGGG GGGCAGACACGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTAA ATGATCCGCCTGGTCAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>190_GC_dG_65-100_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGGGGGTGCAAGCGTGGAT CGGAATTGCT GGGCGTCAAGGGCACGCAGGCGGTTTGTGAAGTCAGACGCGAGAGCCCCGGGCTCCACCG GGGAACTGCA TCTGATACTGGCAAGCTTGAGTCTCGTAGACGGGGGCAGAATCCCGGGTGTGGCGGGGAA ATGCGTAGAG ATCTGGAGGGATCCCGGTGGCGAAGGCGGCCCCCTGGACGGAGACTGACGCTCAGGTGCG GAAGCGGGGG GACCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGCA GCTCCGCACTAAGCGACagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>191_GC_dG_65-110_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGCAGGGTGCCAGCGTTAAT CGGAATTACT GGGCGTGAAGCGCACGCGGGCGGTTTGTTAAGTCAGACGTGAAATCCCCGCGCTCAACCT GGGAGCCGCA CCTGAGAGTGGCGAGCTTGGGTGTCGTAGAGGGGGGTAGACTTCCAGGTGTAGCGGTGAA ATGCGCAGCG CTCTGGCGGGATACCGCTGGCGAAGGCGGCCCCCCGGGCGAAGCCTGCCGCTCAGGGGCG AAAGCGTGGG GAGCACACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTAC TTGGGTCCTAGGGACCAagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>192_GC_dG_65-120_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGAGCGTGCAAGCGCTAAC CGGAATTACT GCGCGTCAAGCGGACGCAGGCGGTTCGTTAAGTCAGGTGTGAGATCCCCGGGGTCAACCG GGGACCTGCA TCTGACACCGGCACGCTTGAGTCCCGTAGGGGCGGGTAGAATCCCCGGTGTAGCGGTGCG AGCCGTAGCG ATCCGGAGGAATACCGGTGGCGACGGCCGCCCCCTGGACGAAGGCTGACGCTGAGGTGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCGG ATTTACGATAGTAGGACagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>193 GC dG 65-130 ITS2 CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACCGCCGGTGCACGCGTTAAT CGGAATTACT GGGGGTACAGCGCGCGCGGGCGGTTTGTTAGCTCCGGTGTGAAGTCCCGGGGCTCAACCT GGGAACTCCA GCGGACACTGGCAAGCCTGAGTCTCGTCCCGGGGGGGAGAGTTCCAGCTGTAGCGGTGAC GTCCCTGGAG ATCTCGGGGAATACGGGTGGCCAAGGCCGCCCCCTCGAGGAGGAGTCACGCTGAGGCGCG AAAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTTA TCACTGTAGACGGGAATagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>194_GC_dG_70-100_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTGCGGGGGGTGGAAGCGGCGAG CGGACTGGCG GGGCGCCAAGCGCGCGCCGGCGGCTTGCTAGGTCAGATGTGAGGTGCCCGGCCTCAACCT GGGAACTGCA GGTGATACTGGGCAGCCGGAGTCGGGTAGACGGGGGTACAATGCCAGGTGTAGCGGGGCA ACGGGTAGCG ATGTGGGGGAATACCGGTGGCGAACGGGGCCCCCCGGACGAAGGCTGGCGCTCGGGTGCC ACAGCGTGGG GAGCAAACAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTAA GTTCTTATGCAGCTATTagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>195_GC_dG_70-110_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTGCGGAGGGTGCACGGGGTAAC GGGAATGCCT GGGCGCCCAGCCGACGGAGCCGGTTCGGTAAGTCAGAGGTGAACGCGCCGGGCTCAACCC GCGAACTGCC GCTGATACCGGGGCGCTTCCGTCTCGTAGAGGGGGGTCGAATTCCAGGTGTGGCGCTGAA GTCCCGAGAG CTCTGGAGGAAGCGCGGTGGCGAGGGCGCCCGCCCGGACCAAGACTGGCGGCCAGGTGCG AAAGCGCGGG GAGCGAACGGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctTGA CAGAGAGACCTCCCTACagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>196_GC_dG_70-120_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGCACGGAGGGCGCAAGCGTTAAT CGGAACCACT GGCCCTAAAGCGGCCCCAGGCGGGTCGTGAGGTCAGATGTGAAACCGCCGGGGTCAACCG GGGGGGGGCG GCTGACACTGGCGAGCCTGGGTCTCGTACACGGGGGCAGACCTCCAGGTGTCCCGCTGAG GCGCGTGGAG ATCCGGAGGAGTACCGGTGGGGACGCCGGCCCCCTCGAGGCAGACTGACGCGCAGGTGCG AAAGCGCGGG GAGCAAACGGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCCT ACGTTAATGCGCAAATTagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>197_GC_dG_70-130_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACGGCGGGTGCCAGCCTTAAT CGGAATGCCC GGGCGCACAGCGGACGCGGGCGGTTCGTTAAGCCGCAGGCGAGATCCCCGGGCCCAACCT CGGCACGGCG TCTGACACTGGCGAGGTTGAGTCTCGGAGAGGGGGGTAGGATTCCAGGTCCACCGGTGGA ACCCCTAGAG CTCTGGGGGACTACCGGTGGCCCAGGCCGCGGCCTGGACGAACGCTGGCGCTCAGGTCCG CAAGCCTGCG GCGCACACGGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctAAC CCATGTAAAGAGTTATCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>198_GC_dG_70-140_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGTACCGCGGGTGCAAGCGTTAAT GCGGCTTACT GGGCGTAAAGCGGACCCCGGCGGTTTGTGAGGTCACATGTGAAGCCCCCGCCCTCCGCCT GGGAACTGCG TCTGATACTGGCGGGCTCGGGGCCCGTACAGGGGGGTAGAATCCCAGGTGGAGGGCGGAA CCGGGTGCCG AGCTGCAGGAAGGCCGGCGGCGAAGCCGGCCCCCCGGGCGGAGACTGACGCCCAGGGGCG CGACCGTGGG GAGCAAGCAGGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctCGT GTGTATCTCTAGCCTTCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

>199_Size_213_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCATATCAATAAGCGGAGG AAAAGAAACC AACCGGGATTGTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgag atctacactc tttccctacacgacgctcttccgatctGTTGTCTCTTAGGCCCTCAGagatcggaagagc acacgtctga actccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>200_Size_243_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATG AGTCGACAAa atgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctACG CAATGTTTCG ATGAGCTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgt atgccgtctt ctgcttgTTGTCGACTC

>201_Size_273_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTGCATATCAATAAGCGGAGGAAAAGAAACC AACCGGGATT GTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactc tttccctaca cgacgctcttccgatctCACCTTTGGAGAATGTCACCagatcggaagagcacacgtctga actccagtca cAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>202_Size_303_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TGCATATCAA TAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGAGTCGACAAa atgatacggc gaccaccgagatctacactctttccctacacgacgctcttccgatctAGGACCCGACAAG CACACGTaga tcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtctt ctgcttgTTG TCGACTC

>203_Size_333_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATT GTAGGGATAA CAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctaca cgacgctctt ccgatctTTTAGCGGTGCGAGTGATCAagatcggaagagcacacgtctgaactccagtca cAATCAGTCT CGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>204_Size_363_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGGCATATCAA TAAGCGGAGG AAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggc gaccaccgag atctacactctttccctacacgacgctcttccgatctTCTCGCAATCGACATGACCGaga tcggaagagc acacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTG TCGACTC

>205_Size_403_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGCCTTAGTAA CGGCGAGTGA AGCGGCAAAAGCTCAAATTTGGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATT GTAGGGATAA CAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctaca cgacgctctt ccgatctAGTGACAGTGTTGAGAGGATagatcggaagagcacacgtctgaactccagtca cAATCAGTCT CGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>206_Size_ 33_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGCCTTAGTAA CGGCGAGTGA AGCGGCAAAAGCTCAAATTTGAAATCTGGTACCTTCGGTGCCCGAGTTGTAGCATATCAA TAAGCGGAGG AAAAGAAACCAACCGGGATTGTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggc gaccaccgag atctacactctttccctacacgacgctcttccgatctCCAAAGCACGTACGGGAAGTaga tcggaagagc acacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTG TCGACTC

>207_Size_463_ITS2

CGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAAT TGCAGAATTC CGTGAATCATCGAATCGCCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATG CCTGTTTGAG CGTCATTTCCTTCTCAAACATTCTGTTTGGTAGTGAGTGATACTCTTTGGAGTTAACTTG AAATTGCTGG CCTTTTCATTGGATGTTTTTTTTCCAAAGAGAGGTTTCTCTGCGTGCTTGAGGTATAATG CAAGTACGGT CGTTTTAGGTTTTACCAACTGCGGCTAATCTTTTTTTATACTGAGCGTATTGGAACGTTA TCGATAAGAA GAGAGCGTCTAGGCGAACAATGTTCTTAAAGTTTGACCTCAAATCAGGTAGCCTTAGTAA CGGCGAGTGA AGCGGCAAAAGCTCAAATTTGAAATCTGGTACCTTCGGTGCCCGAGTTGTAATTTGGAGA GGGCAACTTT GGGGCCGTTCCGCATATCAATAAGCGGAGGAAAAGAAACCAACCGGGATTGTAGGGATAA CAGGGTAATG AGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctctt ccgatctGTA CCCGAGGTTCCATACTCagatcggaagagcacacgtctgaactccagtcacAATCAGTCT CGTatctcgt atgccgtcttctgcttgTTGTCGACTC

Synthetic standard sequences - Full-length 16S rRNA gene tests (select human gut microbes) lllumina adapters in lower case

>208_Bov_Full_TAG_TCT_CAT

TTACAATGAAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCTACAGGCTTAACACATGC AAGTCGAGGG GCAGCATTTTAGTTTGCTTGCAAACTGAAGATGGCGACCGGCGCACGGGTGAGTAACACG TATCCAACCT GCCGATAACTCCGGAATAGCCTTTCGAAAGAAAGATTAATACCGGATAGCATACGAATAT CGCATGA AT TTTTATTAAAGAATTTCGGTTATCGATGGGGATGCGTTCCATTAGTTTGTTGGCGGGGTA ACGGCCCACC AAGACTACGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGG TCCAAACTCC TACGGGAGGCAGCAGTGAGGAATATTGGTACAATAGTGGGCGAGAGCCTGAACCAGCCAA GTAGCGTGAA GGATGAAGGCTCTATGGGTCGTAAACTTCTTTTATATGGGAATAAAGTTTTCCACGTGTG GAATTTTG A TGTACCATATGAATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAT CCGAGCGT A TCCGGATTTATTGGGTTTAAATCTGGGAGCGTAGGTGGATTGTTAAGTCAGTTGTGAAAG TTTGCGGCTC AACCGTAAAATTGCAGTTGAAACTGGCAGTCTTGAGTACAGTAGAGGTGGGCGGAATTCG TGGTGTAGCG GTGAAATGCTTAGATATCACGAAGAACTCCGATTGCGAAGGCAGCTCACTAGACTGTTAC TGACACTGAT GCTCGAAAGTGTAGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACAGTAAACGAT GAATACTCGC TGTTTGCGA A ACAGTAAGCGGCCAAGCGAAAGCATTAAGTATTCCACCTGGCATGGTACGCCGGCAAC GGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTC GATGATACGC GAGGAACCTTACCCGGGCTTAAATTGCAACAGAA A ATTGGAAACAGTATAGCCGTAAGGCTGTTGTGA AGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAACGA GCGCAACCCT TATCTTTAGTTACTAACAGGTTATGCTGAGGACTCTAGAGAGACTGCCGTCGTAAGATGT GAGGAAGGTG GGGATGACGTCAAATCAGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGGGG GGTACAGAAG GCAGCTACACGGCGACGTGATGCTAATCCCAAAAACCTCTCTCAGTTCGGATCGAAGTCT GCAACCCGAC TTCGTGAAGCTGGATTCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTCCC GGGCCTTG A CACACCGCCCGTCAAGCCATGAAAGCCGGGGGTACCTGAAGTACGTAACCGCAAGGAGCG TCCTAGGGTA AAACTGGTAATTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAAC ACCTCCTTTC TTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactc tttccctaca cgacgctcttccgatctCTCAGCCAATGAGAAGGAGCagatcggaagagcacacgtctga actccagtca cAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>209_Pdi_Full_TAG_TCT_CAT

ACAACGAAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCGACAGGCTTAACACATGCAA GTCGAGGGGC AGCGGGGTGTAGCAATACACCGCCGGCGACCGGCGCACGGGTGAGTAACGCGTATGCAAC TTGCCTATCA GAGGGGGATAACCCGGCGAAAGTCGGACTAATACCGCATGAAGCAGGGATCCCGCATGGG AATATTTGCT AAAGATTCATCGCTGATAGATAGGCATGCGTTCCATTAGGCAGTTGGCGGGGTAACGGCC CACCAAACCG ACGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGTACTGAGACACGGACCAAA CTCCTACGGG AGGCAGCAGTGAGGAATATTGGTACAATAGTGGGCGTAAGCCTGAACCAGCCAAGTCGCG TGAGGGATGA AGGTTCTATGGATCGTAAACCTCTTTTATAAGGGAATAAAGTGCGGGACGTGTCCCGTTT TGTATGTACC TTATGAATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATCCGAGC GTTATCCGGA TTTATTGGGTTTAAATCTGGGTGCGTAGGCGGCCTTTTAAGTCAGCGGTGAAAGTCTGTG GCTCAACCAT AGAATTGCCGTTGAAACTGGGGGGCTTGAGTATGTTTGAGGCAGGCGGAATGCGTGGTGT AGCGGTGAAA TGCATAGATATCACGCAGAACCCCGATTGCGAAGGCAGCCTGCCAAGCCATTACTGACGC TGATGCACGA AAGCGTGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCAGTAAACGATGATCAC TAGCTGTTTG CGATACACTGTAAGCGGCACAGCGAAAGCGTTAAGTGATCCACCTGGCATGGTACGCCGG CAACGGTGAA ACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTCGATGAT ACGCGAGGAA CCTTACCCGGGTTTGAACGCATTCGGACCGAGGTGGAAACACCTTTTCTAGCAATAGCCG TTTGCGAGGT GCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAACGAGCGC AACCCTTGCC ACTAGTTACTAACAGGTTAGGCTGAGGACTCTGGTGGGACTGCCAGCGTAAGCTGCGAGG AAGGCGGGGA TGACGTCAAATCAGCACGGCCCTTACATCCGGGGCGACACACGTGTTACAATGGCGTGGA CAAAGGGAGG CCACCTGGCGACAGGGAGCGAATCCCCAAACCACGTCTCAGTTCGGATCGGAGTCTGCAA CCCGACTCCG TGAAGCTGGATTCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTCCCGGGC CTTGTACACA CCGCCCGTCAAGCCATGGGAGCCGGGGGTACCTGAAGTCCGTAACCGCGAGGATCGGCCT AGGGTAAAAC TGGTGACTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAACACCT CCTTTAGGGA TAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccct acacgacgct cttccgatctCCCACAAAGATCGCGCCGGCagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>210_Fpr_Full_TAG_TCT_CAT

GAGAGTTTGATTCTGGCTCAGGACGAACGCTGGCGGCGCGCCTAACACATGCAAGTCGAA CGAGCGAGAG AGGAGCTTGCTTTCTCAAGCGAGTGGCGAACGGGTGAGTAACGCGTGAGGAACCTGCCTC AAAGAGGGGG ACAACAGTTGGAAACGACTGCTAATACCGCATAAGCCCACGACCTGGCATCGGGTAGAGG GAAAAGGAGC AATCCGCTTTGAGATGGCCTCGCGTCCGATTAGCTAGTTGGTGAGGTAACGGCCCACCAA GGCGACGATC GGTAGCCGGACTGAGAGGTTGAACGGCCACATTGGGACTGAGACACGGCCCAGACTCCTA CGGGAGGCAG CAGTGGGGAATATTGCAACAATAGTGGGGGAAACCCTGATGCAGCGACGCCGCGTGGAGG AAGAAGGTCT TCGGATTGTAAACTCCTGTTGTTGAGGAAGATAATGACGGTACTCAACAAGGAAGTGACG GCTAACTACG TGCCAGCAGCCGCGGTAAAACGTAGGTCACAAGCGTTGTCCGGAATTACTGGGTGTAAAT CTGGGAGCGC AGGCGGGAAGGCAAGTTGGAAGTGAAATCCATGGGCTCAACCCATGAACTGCTTTCAAAA CTGTTTTTCT TGAGTAGTGCAGAGGTAGGCGGAATTCCCGGTGTAGCGGTGGAATGCGTAGATATCGGGA GGAACACCAG TGGCGAAGGCGGCCTACTGGGCACCAACTGACGCTGAGGCTCGAAAGTGTGGGTAGCAAA CAGGATTAGA TACCCTGGTAGTCCACACTGTAAACGATGATTACTAGGTGTTGGAGGATTGACCCCTTCA GTGCCGCAGT TAACACAATAAGTAATCCACCTGGCATGGTACGACCGCAAGGTTGAAACTCAAAGGAATT GACGGGGGCC CGCACAAGCAGTGGAGTATGTGGTTTAATTCGACGCAACGCGAAGAACCTTACCAAGTCT TGACATCCTG CGACGCGCATAGAAATATGTGTTTCTTCGGGACCAGAGACAGGTGGTGCATGGTTGTCGT CAGCTCGTGT CGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTATGGTCAGTTACTACGCAA GAGGACTCTG GCCAGACTGCCGTTGACAAAACGGAGGAAGGTGGGGATGACGTCAAATCATCATGCCCTT TATGACTTGG GCTACACACGTACTACAATGGCGTTAAACAAAGAGAAGCAAGACCGCGAGGCGAGCAAAA CTCAGAAACT TCGTCCCAGTTCGGACTGCAGGCTGCAACTCGCCTGCACGAAGTCGGAATTGCTAGTAAT CGCAGATCAG CATGCTGCGGTGAATACGTTCCCGGGCCTGTACACACCGCCCGTCACACCATGAGAGCCG GGGGGACCCG AAGTCGGTAGTCTAACCGCAAGGAGGACGCCGCCGAAGTAAAACTGGTGATTGGGGTGAA GTCGTAACAA GGTAGCCGTAGAGAACCTGCGGCTGGATCACCTCCTTTAGGGATAACAGGGTAATGAGTC GACAAaatga tacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCAAGTCA ATGAAAGCGC ATGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgc cgtcttctgc ttgTTGTCGACTC

>211_Rin_Ful1_ AG_ CT_CAT

TTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGAAGC ACTTTTACAG ATTTCTTCGGAATGAAGTTTTAGTGACTGAGTGGCGGACGGGTGAGTAACGCGTGGGTAA CCTGCCTCAC ACAGGGGGATAACAGTTGGAAACGGCTGCTAATACCGCATAAGCGCACAGTACCGCATGG TACAGTGTGA AAAACTCCGGTGGTGTGAGATGGACCCGCGTCTGATTAGCTAGTTGGCAGGGCAACGGCC TACCAAGGCG ACGATCAGTAGCCGACCTGAGAGGGTGACCGGCCACATTGGGACTGAGACACGGCCCAAA CTCCTACGGG AGGCAGCAGTGGGGAATATTGCAACAATAGTGGGGGAAACCCTGATGCAGCGACGCCGCG TGAGCGAAGA AGTATTTCGGTATGTAAAGCTCTATCAGCAGGGAAGAAGAAATGACGGTACCTGACTAAG AAGCACCGGC TAAATACGTGCCAGCAGCCGCGGTAATACGTATGGTGCAAGCGTTATCCGGATTTACTGG GTGTAAATCT GGGAGCGCAGGCGGAAGGCTAAGTCTGATGTGAAAGCCCGGGGCTCAACCCCGGTACTGC ATTGGAAACT GGTCATCTAGAGTGTCGGAGGGGTAAGTGGAATTCCTAGTGTAGCGGTGAAATGCGTAGA TATTAGGAGG AACACCAGTGGCGAAGGCGGCTTACTGGACGATAACTGACGCTGAGGCTCGAAAGCGTGG GGAGCAAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAATACTAGGTGTCGGAAAGCACA GCTTTTCGGT GCCGCCGCAAACGCATTAAGTATTCCACCTGGCATGGTACGTTCGCAAGAATGAAACTCA AAGGAATTGA CGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTA CCAAGTCTTG ACATCCTTCTGACCGGACAGTAATGTGTCCTTTCCTTCGGGACAGAAGTGACAGGTGGTG CATGGTTGTC GTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTATCCCCA GTAGCCAGCG GTTCGGACGGGCACTCTGAGGAGACTGCCAGGGATAACCTGGAGGAAGGTGGGGATGACG TCAAATCATC ATGCCCCTTATGACTTGGGCTACACACGTGCTACAATGGCGTAAACAAAGGGAAGCGAGA CCGTGAGGTG GAGCAAATCCCAAAAATAACGTCTCAGTTCGGACTGTAGTCTGCAACCCGACTACACGAA GCTGGAATCG CTAGTAATCGCAGATCAGAATGCTGCGGTGAATACGTTCCCGGGTCTTGTACACACCGCC CGTCACACCA TGGGAGTTGGAAATGCCCGAAGTCAGTGACCCAACCGCAAGGAGGGAGCTGCGAAGGCAG GTTAGGGA A ACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctac acgacgctct tccgatctGTTAGGGAGCTAGTTTAGGCagatcggaagagcacacgtctgaactccagtc acAATCAGTC TCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>212_Bbi_Full_TAG_TCT_CAT

TTTTTGTGGAGGGTTCGATTCTGGCTCAGGATGAACGCTGGCGGCGTGCTTAACACATGC AAGTCGAACG GGATCCATCAAGCTTGCTTGGTGGTGAGAGTGGCGAACGGGTGAGTAATGCGTGACCGAC CTGCCCCATG CTCCGGAATAGCTCCTGGAAACGGGTGGTAATGCCGGATGTTCCACATGATCGCATGTGA TTGTGGGAAA GATTCTATCGGCGTGGGATGGGGTCGCGTCCTATCAGCTTGTTGGTGAGGTAACGGCTCA CCAAGGCTTC GACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACATTGGGACTGAGATACGGCCCAGACT CCTACGGGAG GCAGCAGTGGGGAATATTGCAACAATAGTGGGCGCAAGCCTGATGCAGCGACGCCGCGTG AGGGATGGAG GCCTTCGGGTTGTAAACCTCTTTTGTTTGGGAGCAAGCCTTCGGGTGAGTGTACCTTTCG AATAAGCGCC GGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGCGCAAGCGTTATCCGGATTTAT TGGGCG AAA TCTGGGCTCGTAGGCGGCTCGTCGCGTCCGGTGTGAAAGTCCATCGCTTAACGGTGGATC TGCGCCGGGT ACGGGCGGGCTGGAGTGCGGTAGGGGAGACTGGAATTCCCGGTGTAACGGTGGAATGTGT AGATATCGGG AAGAACACCGATGGCGAAGGCAGGTCTCTGGGCCGTCACTGACGCTGAGGAGCGAAAGCG TGGGGAGCGA ACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGACGCTGGATGTGGGGCACG TTCCACGTGT TCCGTGTCGGAGCTAACGCGTTAAGCGTCCCGCCTGGCATGGTACGGCCGCAAGGCTAAA ACTCAAAGAA ATTGACGGGGGCCCGCACAAGCGGCGGAGCATGCGGATTAATTCGATGCAACGCGAAGAA CCTTACCTGG GCTTGACATGTTCCCGACGACGCCAGAGATGGCGTTTCCCTTCGGGGCGGGTTCACAGGT GGTGCATGGT CGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGCC CCGTGTTGCC AGCACGTTATGGTGGGAACTCACGGGGGACCGCCGGGGTTAACTCGGAGGAAGGTGGGGA TGACGTCAGA TCATCATGCCCCTTACGTCCAGGGCTTCACGCATGCTACAATGGCCGGTACAGCGGGATG CGACATGGCG ACATGGAGCGGATCCCTGAAAACCGGTCTCAGTTCGGATCGGAGCCTGCAACCCGGCTCC GTGAAGGCGG AGTCGCTAGTAATCGCGGATCAGCAACGCCGCGGTGAATGCGTTCCCGGGCCTTGTACAC ACCGCCCGTC AAGTCATGAAAGTGGGCAGCACCCGAAGCCGGTGGCCTAACCCCTTGTGGGATGGAGCCG TCTAAGGTGA GGCTCGTGATTGGGACTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGATCA CCTCCTTTCT ACGGAGTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatcta cactctttcc ctacacgacgctcttccgatctGAGCACGGGAATTACTCCGAagatcggaagagcacacg tctgaactcc agtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>213_Hpa_Full_TAG_TCT_CAT

CAGATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAACAGGAAGAAGCT TGCTTCTTTG CTGACGAGTGGCGGACGGGTGAGTAATGCTTGGGTATCTGGCTTATGGAGGGGGATAACT ACGGGAAACT GTAGCTAATACCGCGTAGTATCGGAAGATGAAAGTGTGGGACCGCAAGGCCACATGCCAT AGGATGAGCC CAAGTGGGATTAGGTAGTTGGTGAGGTAATGGCTCACCAAGCCGACGATCTCTAGCTGGT CTGAGAGGAT GACCAGCCACACCGGGACTGAGACACGGCCCGGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCGACA A AGTGGGGGCAACCCTGACGCAGCCATGCCGCGTGAATGAAGAAGGCCTTCGGGTTGTAAA GTTCTTTC GGTAGCGAGGAAGGCATTTAGTTTAATAGACTAGATGATTGACGTTAACTACAGAAGAAG CACCGGCTAA CTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCGAGCGTTAATCGGAATAACTGGGCG TAAATCTGGG CACGCAGGCGGACTTTTAAGTGAGGTGTGAAAGCCCCGGGCTTAACCTGGGAATTGCATT TCAGACTGGG AGTCTAGAGTACTTTAGGGAGGGGTAGAATTCCACGTGTAGCGGTGAAATGCGTAGAGAT GTGGAGGAAT ACCGAAGGCGAAGGCAGCCCCTTGGGAATGTACTGACGCTCATGTGCGAAAGCGTGGGGA GCAAACAGGA TTAGATACCCTGGTAGTCCACGCTGTAAACGCTGTCGATTTGGGGATTGGGCTTAATGCT TGGTGCCCGT AGCTAACGTGATAAATCGACCGCCTGGCATGGTACGGCCGCAAGGTTAAAACTCAAATGA ATTGACGGGG GCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCTAC TCTTGACATC CAGAGAACTTTCCAGAGATGGATTGGTGCCTTCGGGAGCTCTGAGACAGGTGCTGCATGG CTGTCGTCAG CTCGTGTTGTGAAATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTATCCTTTGTTGC CAGCGATTTG GTCGGGAACTCAAAGGAGACTGCCGGTGATAAACCGGAGGAAGGTGGGGATGACGTCAAG TCATCATGGC CCTTACGAGTAGGGCTACACACGTGCTACAATGGCGTATACAGAGGGAGGCGAAGCAGCG ATGTGGAGCG AATCCCAGAAAGTGCGTCTAAGTCCGGATTGGAGTCTGCAACTCGACTCCATGAAGTCGG AATCGCTAGT AATCGCGAATCAGAATGTCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCA CACCATGGGA GTGGGTTGTACCAGAAGTAGATAGCTTAACCTTCGGGAGGGCGTTTACCACGGTATGATT CATGACTGGG GTGATAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctaca ctctttccct acacgacgctcttccgatctGAGCCATAAACCAGCTCGGAagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>214_Ssa_Full_TAG_TCT_CAT

TGATCCTGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAA GAGAGGAGCT TGCTCTTCTTGGATGAGTTGCGAACGGGTGAGTAACGCGTAGGTAACCTGCCTGGTAGCG GGGGATAACT ATTGGAAACGATAGCTAATACCGCATAAAATTGATTATTGCATGATAATTAATTGAAAGA TGCAATTGCA TCACTACCAGATGGACCTGCGTTGTATTAGCTAGTTGGTGAGGTAACGGCTCACCAAGGC GACGATACAT AGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGG GAGGCAGCAG TAGGGAATCTTCGGACAATAGTGGGGGGAACCCTGACCGAGCAACGCCGCGTGAGTGAAG AAGGTTTTCG GATCGTAAAGCTCTGTTGTAAGAGAAGAACGGGTGTGAGAGTGGAAAGTTCACACTGTGA CGGTATCT A CCAGAAAGGGACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTCCCGAGCGTT GTCCGGATTT ATTGGGCGTAAATCTGCGAGCGCAGGCGGTTAGATAAGTCTGAAGTTAAAGGCTGTGGCT TAACCATAGT ATGCTTTGGAAACTGTTTAACTTGAGTGCAGAAGGGGAGAGTGGAATTCCATGTGTAGCG GTGAAATGCG TAGA A ATGGAGGAACACCGGTGGCGAAAGCGGCTCTCTGGTCTGTAACTGACGCTGAGGCTCGAA AGC GTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGTGCTAGG TGTTAGGCCC TTTCCGGGGCTTAGTGCCGCAGCTAACGCATTAAGCACTCCGCCTGGCATGGTACGACCG CAAGGTTGAA ACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCA ACGCGAAGAA CCTTACCAGGTCTTGACATCCCTCTGACCGCTCTAGAGATAGAGTTTTCCTTCGGGACAG AGGTGACAGG TGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCG CAACCCCTAT TGTTAGTTGCCATCATTCAGTTGGGCACTCTAGCGAGACTGCCGGTAATAAACCGGAGGA AGGTGGGGAT GACGTCAAATCATCATGCCCCTTATGACCTGGGCTACACACGTGCTACAATGGCTGGTAC AACGAGTCGC AAGCCGGTGACGGCAAGCTAATCTCTGAAAGCCAGTCTCAGTTCGGATTGTAGGCTGCAA CTCGCCTACA TGAAGTCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGGTGAATACGTTCCCGGGCC TTGTACACAC CGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGTCGGTGAGGTAACCGTAAGGAGCC AGCCGCCTTA GGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactcttt ccctacacga cgctcttccgatctTTGGCCGGAGTACAGTATCAagatcggaagagcacacgtctgaact ccagtcacAA TCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>215_Vpa_Full_TAG_TCT_CAT

GCTGGCGGCGTGCTTAACACATGCAAGTCGAACGAAGAGCGATGGAAGCTTGCTTCTATC AATCTTAGTG GCGAACGGGTGAGTAACGCGTAATCAACCTGCCCTTCAGAGGGGGACAACAGTTGGAAAC GACTGCTAAT ACCGCATACGATCTAACCTCGGCATCGAGGAAAGATGAAAGGTGGCCTCTATT A AAGCTATCACTGAA GGAGGGGATTGCGTCTGATTAGCTAGTTGGAGGGGTAACGGCCCACCAAGGCGATGATCA GTAGCCGGTC TGAGAGGATGAACGGCCACATTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGC AGTGGGGAAT CTTCCGACAATAGTGGACGAAAGTCTGACGGAGCAACGCCGCGTGAGTGATGACGGCCTT CGGGTTGTAA AGCTCTGTTAATCGGGACGAAAGGCCTTCTTGCGAACAGTTAGAAGGATTGACGGTACCG GAATAGAAAG CCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCGTTGTCCGGAA TTATTGGGCG TAAATCTGCGCGCGCAGGCGGATCAGTCAGTCTGTCTTAAAAGTTCGGGGCTTAACCCCG TGATGGGATG GAAACTGCTGATCTAGAGTATCGGAGAGGAAAGTGGAATTCCTAGTGTAGCGGTGAAATG CGTAGATATT AGGAAGAACACCAGTGGCGAAGGCGACTTTCTGGACGAAAACTGACGCTGAGGCGCGAAA GCCAGGGGAG CGAACGGGATTAGATACCCCGGTAGTCCTGGCCGTAAACGATGGGTACTAGGTGTAGGAG GTATCGACCC CTTCTGTGCCGGAGTTAACGCAATAAGTACCCCGCCTGGCATGGTACGACCGCAAGGTTG AAACTCAAAG GAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGACGCAACGCGAAG AACCTTACCA GGTCTTGACATTGATGGACAGAACCAGAGATGGTTCCTCTTCTTCGGAAGCCAGAAAACA GGTGGTGCAC GGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCCT ATCTTATGTT GCCAGCACTTTGGGTGGGAACTCATGAGAGACTGCCGCAGACAATGCGGAGGAAGGCGGG GATGACGTCA AATCATCATGCCCCTTATGACCTGGGCTACACACGTACTACAATGGGAGTTAATAGACGG AAGCGAGATC GCGAGATGGAGCAAACCCGAGAAACACTCTCTCAGTTCGGATCGTAGGCTGCAACTCGCC TACGTGAAGT CGGAATCGCTAGTAATCGCAGGTCAGCATACTGCGGTGAATACGTTCCCGGGCCTTGTAC ACACCGCCCG TCACACCACGAAAGTCGGAAGTGCCCAAAGCCGGTGGGGTAACCTTCGGGAGCCAGCCGT CTAAGG AAA GTCGATGATTGGGGTGAAGTCGTAACAAGGTAGCCTAGGGATAACAGGGTAATGAGTCGA CAAaatgata cggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGTCCCGCTA TTCGGCTTGT CagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccg tcttctgctt gTTGTCGACTC

>216_Ele_Full_TAG_TCT_CAT

AGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAAC GATGAAACCG CCCTCGGGCGGACATGAAGTGGCGAACGGGTGAGTAACACGTGACCAACCTGCCCCTTGC TCCGGGACAA CCTTGGGAAACCGAGGCTAATACCGGATACTCCTCGCCCCCCTCCTGGGGGGCCCGGGAA AGCCCAGACG GCAAGGGATGGGGTCGCGGCCCATTAGGTAGTAGGCGGGGTAACGGCCCACCTAGCCCGC GATGGGTAGC CGGGTTGAGAGACCGACCGGCCACATTGGGACTGAGATACGGCCCAGACTCCTACGGGAG GCAGCAGTGG GGAATTTTGCGACAATAGTGGGGGAAACCCTGACGCAGCAACGCCGCGTGCGGGACGACG GCCTTCGGGT TGTAAACCGCTTTCAGCAGGGAAGAAATTCGACGGTACCTGCAGAAGAAGCTCCGGCTAA CTACGTGCCA GCAGCCGCGGTAATACGTAGGGAGCGAGCGTTATCCGGATTCATTGGGCGTAAATCTGAG CGCGTAGGCG GCCTCTCAAGCGGGATCTCTAATCCGAGGGCTCAACCCCCGGCCGGATCCCGAACTGGGA GGCTCGAGTT CGGTAGAGGCAGGCGGAATTCCCGGTGTAGCGGTGGAATGCGCAGATATCGGGAAGAACA CCGATGGCGA AGGCAGCCTGCTGGGCCGCAACTGACGCTGAGGCGCGAAAGCTAGGGGAGCGAACAGGAT TAGATACCCT GGTAGTCCTAGCCGTAAACGATGGATACTAGGTGTGGGGCTCCGCCCTCCGTGCCGCAGC CAACGCATTA AGTATCCCGCCTGGCATGGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCC CGCACAAGCA GCGGAGCATGTGGCTTAATTCGAAGCAACGCGAAGAACCTTACCAGGGCTTGACATGGAC GTGAAGCCGG GGAAACCCGGTGGCCGAGAGGAGCGTCCGCAGGTGGTGCATGGCTGTCGTCAGCTCGTGT CGTGAGATGT TGGGTTAAGTCCCGCAACGAGCGCAACCCCTGCCCCATGTTGCCAGCATTAGGTTGGGGA CTCATGGGGG ACTGCCGGCGTCAAGCCGGAGGAAGGTGGGGACGACGTCAAGTCATCATGCCCTTTATGC CCTGGGCTGC ACACGTGCTACAATGGCCGGTACAACGGGCTGCGAGACCGCGAGGTCGAGCGAATCCCTC AAAGCCGGCC CCAGTTCGGATCGGAGGCTGCAACCCGCCTCCGTGAAGTCGGAGTTGCTAGTAATCGCGG ATCAGCATGC CGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCACCCGAGTCGTATG CACCCGAAGC CGCCGGCCGAACCCGCAAGGGGCGGAGGCGTCGAAGGTGTGGAGGGTAAGGGGGGTGAAG TCGTAACAAG GTAGCCGTACCGGAAGGTGCGGCTGGATCACCTCCTTTTAGGGATAACAGGGTAATGAGT CGACAAaatg atacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctCTTCGT GTTGGTGCCG GTCTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatg ccgtcttctg cttgTTGTCGACTC

>217_Dde_Full_TAG_TCT_CAT

TGAACTGGAGAGTTTGATTCTGGCTCAGATTGAACGCTGGCGGCGTGCTTAACACATGCA AGTCGAACGC GAAAGGGACTTCGGTCCTGAGTAAAGTGGCGCACGGGTGAGTAACGCGTGGATAATCTGC CCTTATGATC GGGATAACAGTTGGAAACGGCTGCTAATACCGGATACGCTCAAAATGAACTTTTTGAGGA AAGATGGCCT CTGCTTGCATGCTATCACGTAAGGATGAGTCCGCGTCCCATTAGCTTGTTGGCGGGGTAA CGGCCCACCA AGGCATCGATGGGTAGCCGATTTGAGAGGATGATCGGCCACACTGGAACTGAAACACGGT CCAGACTCCT ACGGGAGGCAGCAGTGGGGAATATTGCGACAATAGTGGGCGAAAGCCTGACGCAGCGACG CCGCGTGAGG GATGAAGGTTTTCGGATCGTAAACCTCTGTCAGAAGGGAAGAAACTACGTTGTGCTAATC AGCAGCGTAC TGACGGTACCTTCAAAGGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGA GGGTGCAAGC GTTAATCGGAATTACTGGGCGTAAATCTGCGCACGTAGGCTGTAGTGTAAGTCAGGGGTG AAATCCCACG GCTCAACCGTGGAACTGCCTTTGATACTGCACAACTTGAATCCGGGAGAGGGTGGCGGAA TTCCAGGTGT AGGAGTGAAATCCGTAGATATCTGGAGGAACATCAGTGGCGAAGGCGGCCACCTGGACCG GTATTGACGC TGAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCTGTAAA CGATGGATGC TAGATGTCGGGGAGTATTCTTCGGTGTCGTAGTTAACGCGTTAAGCATCCCGCCTGGCAT GGTACGGTCG CAAGGCTGAAACTCAAAGAAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTA ATTCGATGCA ACGCGAAGAACCTTACCTAGGTTTGACATCCACGGAACCCTCCCGAAAAGGAGGGGTGCC CTTCGGGGAG CCGTGAGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTC CCGCAACGAG CGCAACCCCTATGGATAGTTGCCAGCAAGTAATGTTGGGCACTCTATTCAGACTGCCCGG GTTAACCGGG AGGAAGGTGGGGACGACGTCAAGTCATCATGGCCCTTACGCCTAGGGCTACACACGTACT ACAATGGCGC GCACAAAGGGGAGCGAGACCGCGAGGTGGAGCCAATCCCAAAAAACGCGTCCCAGTCCGG ATTGCAGTCT GCAACTCGACTGCATGAAGTTGGAATCGCTAGTAATTCGAGATCAGCATGCTCGGGTGAA TGCGTTCCCG GGCCTTGTACACACCGCCCGTCACACCACGAAAGTCGGTTTTACCCGAAGCCGGTGAGCC AACCAGCAAT GGAGGCAGCCGTCTACGGTAGGGCCGATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTA GGGGAACCTG CGGCTGGATCACCTCCTTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgac caccgagatc tacactctttccctacacgacgctcttccgatctGTCCGATCAGTCGCGTGCACagatcg gaagagcaca cgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCG ACTC

>218_Apa_Full_TAG_TCT_CAT

ATGGAGAGTTCGATCCTGGCTCAGGATGAACGCTGGCGGCGCGCCTAACACATGCAAGTC GAACGATTAA AGCACCTTCGGGTGTGTATAAAGTGGCGAACGGCTGAGTAACACGTGGGCAACCTGCCCC TTTCATTGGG A AGCCACGGGAAACCGTGGATAATACCGAATACTTCGAGACTTCCGCATGGAAGACTCGAG AAAGCTCC GGCGGAGAGGGATGGGCCCGCGGCCTGTTAGCTTGTTGGTGGGGTAACGGCCTACCAAGG CAATGATGGG TAGCTGGGTTGAGAGACCGACCAGCCAGATTGGGACTGAGACACGGCCCAGACTCCTACG GGAGGCAGCA GTGGGGAATCTTGCAACAATAGTGGGCGAAAGCCTGATGCAGCGACGCCGCGTGCGGGAT GAAGGCCTTC GGGTTGTAAACCGCTTTCAGCAGGGACGAGGCGAAAGTGACGGTACCTGCAGAAGAAGCC CCGGCTAACT ACGTGCCAGCAGCCGCGGTAATACGTAGGGGGCAAGCGTTATCCGGATTCATTGGGCGTA AATCTGCGCT CGTAGGCGGTCTGTTAGGTCGGGAGTTAAATCCGGGGGCTCAACCCCCGCTCGCTCTCGA TACCGGCAGA CTTGAGTTTGGTAGGGGAAGGTGGAATTCCTAGTGTAGCGGTGGAATGCGCAGA AT AGGAAGAACACC AGTGGCGAAGGCGGCCTTCTGGGCCATAACTGACGCTGAGGAGCGAAAGCTAGGGGAGCA AACAGGAT A GATACCCTGGTAGTCCTAGCCGTAAACGATGGACACTAGGTGTGGGGGAGTATTTCTTCC GTGCCGCAGC TAACGCATTAAGTGTCCCGCCTGGCATGGTACGGCCGCAAGGCTAAAACTCAAAGGAATT GACGGGGGCC CGCACAAGCAGCGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGGCT TGACATTTAG GTGAAGCGGCGGAAACGTCGTGGCCGAAAGGAGCCTAAACAGGTGGTGCATGGCTGTCGT CAGCTCGTGT CGTGAGATGTTAGGTTAAGTCCTGCAACGAGCGCAACCCTCGTCGTATGTTGCCAGCGGT TAGGCCGGGC ACCCATACGAGACCGCCGGCGTCAAGCCGGAGGAAGGTGGGGACGACGTCAAGTCATCAT GCCCCTTATG TCCTGGGCTACACACGTGCTACAATGGCCGGCACAATGGGCTGCCAACCCGCGAGGGTGA GCGAATCCCT AAAGCCGGTCCCAGTTCGGATTGGAGGCTGCAACCCGCCTCCATGAAGTCGGAGTTGCTA GTAATCGCGG ATCAGCACGCCGCGGTGAATGCGTTCCCGGGCCTTGTACACACCGCCCGTCACACCACCC GAGTCGATTG CACCCGAAGTCGTCGGCCTAACCTTTTAGGAGGGAGACGCCGAAGGTGTGGTTGGTAAGG GGGGTGAAGT CGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGATCACCTCCTTTCTAGGGAGTAGGG ATAACAGGGT AATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgc tcttccgatc tACACGTCAAGCAGCAGTGGAagatcggaagagcacacgtctgaactccagtcacAATCA GTCTCGTatc tcgtatgccgtcttctgcttgTTGTCGACTC

>219_Gad_Full_TAG_TCT_CAT

AGTTTGATCATGGCTCAGGACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGA GAGCGACCGG TGCTTGCACTGGTCAATCTAGTGGCGAACGGGTGAGTAACACGTGGGTAACCTGCCCATC AGAGGGGGAT AACATCCGGAAACGGATGCTAAAACCGCATAGGTCTTCGAGCCGCATGGCTTGAAGAGGA AAAGAGGCGC AAGCTTCTGCTGATGGATGGACCCGCGGTGCATTAGCTAGTTGGTGAGGTAACGGCTCAC CAAGGCCGTG ATGCATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAAACTC CTACGGGAGG CAGCAGTAGGGAATCTTCCGACAATAGTGGACGCAAGTCTGACGGAGCAACGCCGCGTGA GTGAAGAAGG TTTTCGGATCGTAAAACTCTGTTGTTAGAGAAGAACAAGTGCTAGAGTAACTGTTAGCGC CTTGACGGTA TCTAACCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAA GCGTTGTCCG GATTTATTGGGCGTAAATCTGCGAGCGCAGGCGGTTCCTTAAGTCTGATGTGAAAGCCCC CGGCTCAACC GGGGAGGGTCATTGGAAACTGGGGAACTTGAGTGCAGAAGAGGAGAGTGGAATTCCATGT GTAGCGGTGA AATGCGTAGA A ATGGAGGAACACCAGTGGCGAAGGCGACTCTCTGGTCTGTAACTGACGCTGAGGCTC GAAAGCGTGGGTAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAGT GCTAAGTGTT GGAGGGTTTCCGCCCTTCAGTGCTGCAGTTAACGCATTAAGCACTCCGCCTGGCATGGTA CGACCGCAAG GTTGAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTAATTC GAAGCAACGC GAAGAACCTTACCAAGTCTTGACATCCTTTGACCACTCTAGAGATAGAGCTTTCCCTTCG GGGACAAAGT GACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAA CGAGCGCAAC CCTTATTACTAGTTGCCAGCATTCAGTTGGGCACTCTAGTGAGACTGCCGGTGACAAACC GGAGGAAGGT GGGGATGACGTCAAATCATCATGCCCCTTATGACTTGGGCTACACACGTGCTACAATGGA TGGTACAACG AGCAGCGAACTCGCGAGGGTAAGCGAATCTCTTAAAGCCATTCTCAGTTCGGATTGTAGG CTGCAACTCG CCTACATGAAGCCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGGTGAATACGTTCC CGGGTCTTGT ACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCAAAGTCGGTGAGGTAACCATTT GGAGCCAGCC GCCTAAGGTGGGATAGATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTATAGGGATAAC AGGGTAATGA GTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttc cgatctATCC TTGCGCAGGTCACCTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTC GTatctcgta tgccgtcttctgcttgTTGTCGACTC

>220_Aca_Ful1_ AG_ C _CAT

AGAGTTTGATCCATGGCTCAGGATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAA CGAAGCATTT AGGATTGAAGTTTTCGGATGGATTTCC A ATGACTGAGTGGCGGACGGGTGAGTAACGCGTGGGGAACC TGCCCTATACAGGGGGATAACAGCTGGAAACGGCTGCTAATACCGCATAAGCGCACAGAA TCGCATGATT CAGTGTGAAAAGCCCTGGCAG A AGGATGGTCCCGCGTCTGATTAGCTGGTTGGTGAGGTAACGGCTCA CCAAGGCGACGATCAGTAGCCGGCTTGAGAGAGTGAACGGCCACATTGGGACTGAGACAC GGCCCAAACT CCTACGGGAGGCAGCAGTGGGGAATATTGCAACAATAGTGGGGGAAACCCTGATGCAGCG ACGCCGCGTG AGTGAAGAAGTATTTCGGTATGTAAAGCTCTATCAGCAGGGAAGAAAACAGACGGTACCT GACTAAGAAG CCCCGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGGGGCAAGCGTTATCCGGAA TTACTGGGTG TAAATCTGGGTGCGTAGGTGGCATGGTAAGTCAGAAGTGAAAGCCCGGGGCTTAACCCCG GGACTGCTTT TGAAACTGTCATGCTGGAGTGCAGGAGAGGTAAGCGGAATTCCTAGTGTAGCGGTGAAAT GCGTAGA AT TAGGAGGAACACCAGTGGCGAAGGCGGCTTACTGGACTGTCACTGACACTGATGCACGAA AGCGTGGGGA GCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAA AC AGGTGTCGGGGCCGTAGAGG CTTCGGTGCCGCAGCAAACGCAGTAAGTATTCCACCTGGCATGGTACGTTCGCAAGAATG AAACTCAAAG GAATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAG AACCTTACCT GGTCTTGACATCCCAATGACCGAACCTTAACCGGTTTTTTCTTTCGAGACATTGGAGACA GGTGGTGCAT GGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCCT ATCTTTAG A GCCAGCATTTGAGGTGGGCACTCTAGAGAGACTGCCAGGGATAACCTGGAGGAAGGTGGG GACGACGTCA AATCATCATGCCCCTTATGGCCAGGGCTACACACGTGCTACAATGGCGTAAACAAAGGGA AGCGAAGTCG TGAGGCGAAGCAAATCCCAGAAATAACGTCTCAGTTCGGATTGTAGTCTGCAACTCGACT ACATGAAGCT GGAATCGCTAGTAATCGTGAATCAGAATGTCACGGTGAATACGTTCCCGGGTCTTGTACA CACCGCCCGT CACACCATGGGAGTCAGTAACGCCCGAAGTCAGTGACCCAACCGCAAGGAGGGAGCTGCC GAAGGTGGGA CCGATAACTGGGGTGAAGTCGTAACAAGGTAGCCGTATCGGTAGGGATAACAGGGTAATG AGTCGACAAa atgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctGGC ACCTAGAATA GCCGTTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgt atgccgtctt ctgcttgTTGTCGACTC

>22 l_Rmu_Full_TAG_TCT_CAT

AGAGTTTGATCATGGCTCAGGACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAAC GATGAAGCCT AGCTTGCTAGGTGGATTAGTGGCGAACGGGTGAGTAATACGTGAGTAACCTACCTTTAAC TCTGGGATAA GCCTGGGAAACTGGGTCTAATACCGGATACGACCAATCTCCGCATGGGGTGTTGGTGGAA AGCGTTATGT AGTGGTTATAGATGGGCTCACGGCCTATCAGCTCGTTGGTGAGGTAACGGCTCACCAAGG CGACGACGGG TAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAGACTCCTACG GGAGGCAGCA GTGGGGAATATTGCAACAATAGTGGGCGCAAGCCTGATGCAGCGACGCCGCGTGAGGGAT GACGGCCTTC GGGTTGTAAACCTCTGTTAGCAGGGAAGAAGAGAGATTGACGGTACCTGCAGAGAAAGCG CCGGCTAACT ACGTGCCAGCAGCCGCGGTAATACGTAGGGCGCGAGCGTTGTCCGGAATTATTGGGCGTA AATCTGAGCT TGTAGGCGGTTTGTCGCGTCTGCTGTGAAAGGCCGGGGCTTAACTCCGTGTATTGCAGTG GGTACGGGCA GACTAGAGTGCAGTAGGGGAGACTGGAATTCCTGGTGTAGCGGTGGAATGCGCAGATATC AGGAGGAACA CCGATGGCGAAGGCAGGTCTCTGGGCTGTAACTGACGCTGAGAAGCGAAAGCATGGGGAG CGAACAGGAT TAGATACCCTGGTAGTCCATGCCGTAAACGTTGGGCACTAGGTGTGGGGGACATTCCACG TTTTCCGCGC CGTAGCTAACGCATTAAGTGCCCCGCCTGGCATGGTACGGCCGCAAGGCTAAAACTCAAA GAAATTGACG GGGGCCCGCACAAGCGGCGGAGCATGCGGATTAATTCGATGCAACGCGAAGAACCTTACC AAGGCTTGAC ATATACTGGACCGCATCAGAGATGGTGTTTCCCTTCGGGGCTGGTATACAGGTGGTGCAT GGTTGTCGTC AGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTCGTTCTATGTT GCCAGCACGT TATGGTGGGGACTCATAGGAGACTGCCGGGGTCAACTCGGAGGAAGGTGGGGATGACGTC AAATCATCAT GCCCCTTATGTCTTGGGCTTCACGCATGCTACAATGGCCGGTACAGAGGGTTGCGATACT GTGAGGTGGA GCTAATCCCTAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACCCCATGAAGT CGGAGTCGCT AGTAATCGCAGATCAGCAACGCTGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCC GTCAAGTCAC GAAAGTTGGTAACACCCAAAGCCGGTGGCCTAACCCTTTTGGGAGGGAGCCGTCTAAGGT GGGATTGGCG ATTGGGACTAAGTCGTAACAAGGTAGCCTAGGGATAACAGGGTAATGAGTCGACAAaatg atacggcgac caccgagatctacactctttccctacacgacgctcttccgatctACACCTATTAGAGGTC AGACagatcg gaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctg cttgTTGTCG ACTC

>222_Kpn_Fu11_ AG_ CT_CAT

TGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGTAGCACAGAGAGCTTGCTCT CGGGTGACGA GCGGCGGACGGGTGAGTAATGTCTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAA ACGGTAGC A A ACCGCATAACGTCGCAAGACCAAAGTGGGGGACCTTCGGGCCTCATGCCATCAGATGTGC CCAGATGG GATTAGCTAGTAGGTGGGGTAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAG GATGACCAGC CACACTGGAACTGAGACACGGTCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCA ACAATAGTGG GCGCAAGCCTGATGCAGCCATGCCGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACT TTCAGCGGGG AGGAAGGCGATAAGGTTAATAACCTTGTCGATTGACGTTACCCGCAGAAGAAGCACCGGC TAACTCCGTG CCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAATCT GCGCACGCAG GCGGTCTGTCAAGTCGGATGTGAAATCCCCGGGCTCAACCTGGGAACTGCATTCGAAACT GGCAGGCTAG AGTCTTGTAGAGGGGGGTAGAATTCCAGGTGTAGCGGTGAAATGCGTAGAGATCTGGAGG AATACCGGTG GCGAAGGCGGCCCCCTGGACAAAGACTGACGCTCAGGTGCGAAAGCGTGGGGAGCAAACA GGATTAGA A CCCTGGTAGTCCACGCCGTAAACGATGTCGATTTGGAGGTTGTGCCCTTGAGGCGTGGCT TCCGGAGCTA ACGCGTTAAATCGACCGCCTGGCATGGTACGGCCGCAAGGTTAAAACTCAAATGAATTGA CGGGGGCCCG CACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCTGGTCTTG ACATCCACAG AACTTAGCAGAGATGCTTTGGTGCCTTCGGGAACTGTGAGACAGGTGCTGCATGGCTGTC GTCAGCTCGT GTTGTGAAATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTATCCTTTGTTGCCAGCG GTTAGGCCGG GAACTCAAAGGAGACTGCCAGTGATAAACTGGAGGAAGGTGGGGATGACGTCAAGTCATC ATGGCCCT A CGACCAGGGCTACACACGTGCTACAATGGCATATACAAAGAGAAGCGACCTCGCGAGAGC AAGCGGACCT CATAAAGTATGTCGTAGTCCGGATTGGAGTCTGCAACTCGACTCCATGAAGTCGGAATCG CTAGTAATCG TAGATCAGAATGCTACGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCA TGGGAGTGGG TTGCAAAAGAAGTAGGTAGCTTAACCTTCGGGAGGGCGCTTACCACTTTGTGATTCATGA TAGGGATAAC AGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacac gacgctcttc cgatctTTTGTATGCACCTTGTAAATagatcggaagagcacacgtctgaactccagtcac AATCAGTCTC GTatctcgtatgccgtcttctgcttgTTGTCGACTC

>223_Pco_Full_TAG_TCT_CAT

TACAATGGAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCTACAGGCTTAACACATGCA AGTCGAGGGG AAACGACATCGAAAGCTTGCTTTTGATGGGCGTCGACCGGCGCACGGGTGAGTAACGCGT ATCCAACCTG CCCACCACTTGGGGATAACCTTGCGAAAGTAAGACTAATACCCAATGATATCTCTAGAAG ACATCTGAAA GAGATTAAAGATTTATCGGTGATGGATGGGGATGCGTCTGATTAGCTTGTTGGCGGGGTA ACGGCCCACC AAGGCGACGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGG TCCAAACTCC TACGGGAGGCAGCAGTGAGGAATATTGGTACAATAGTGGGCGAGAGCCTGAACCAGCCAA GTAGCGTGCA GGATGACGGCCCTATGGGTTGTAAACTGCTTTTATAAGGGAATAAAGTTAGTCTCGTGAG ACTTTTTGCA TGTACCTTATGAATAAGGACCGGCTAATTCCGTGCCAGCAGCCGCGGTAATACGGAAGGT CCGGGCGT A TCCGGATTTATTGGGTTTAAATCTGGGAGCGTAGGCCGGAGATTAAGCGTGTTGTGAAAT GTAGGCGCTC AACGTCTGCACTGCAGCGCGAACTGGTTTCCTTGAGTACGCACAAAGTGGGTGGAATTCG TGGTGTAGCG GTGAAATGCTTAGATATCACGAAGAACTCCGATTGCGAAGGCAGCTCACTGGAGCGCAAC TGACGCTGAA GCTCGAAAGTGCGGGTATCGAACAGGATTAGATACCCTGGTAGTCCGCACGGTAAACGAT GGATGCCCGC TGTTGGTCTGAACAGGTCAGCGGCCAAGCGAAAGCATTAAGCATCCCACCTGGCATGGTA CGCCGGCAAC GGTGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTC GATGATACGC GAGGAACCTTACCCGGGCTTGAATTGCAGAGGAAGGATTTGGAGACAATGACGCCCTTCG GGGCCTCTGT GAAGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAAC GAGCGCAACC CCTCTCCTTAGTTGCCATCAGGTTATGCTGGGCACTCTGGGGACACTGCCACCGTAAGGT GTGAGGAAGG TGGGGATGACGTCAAATCAGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGG CAGGTACAGA GAGACGGTTGTACGTAAGTACGATCAAATCCTTAAAGCCTGTCTCAGTTCGGATTGGGGT CTGCAACCCG ACCCCACGAAGCTGGATTCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTC CCGGGCCTTG TACACACCGCCCGTCAAGCCATGAAAGCCGGGGGCGCCTAAAGTCCGTGACCGTAAGGAG CGGCCTAGGG CGAAACTGGTAATTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGA ACACCTCCTT TAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactct ttccctacac gacgctcttccgatctGGAACGGGTGTTGCCCAGATagatcggaagagcacacgtctgaa ctccagtcac AATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>224_Apu_Full_TAG_TCT_CAT

GAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCGGCAGGCTTAACACATGCAAGTCGAG GGGCAGCA A ATGGATAGCAATATCTATGGTGGCGACCGGCGCACGGGTGCGTAACGCGTATGCAACCTA CCTTTAACAG GGGGATAACACTGAGAAATTGGTACTAATACCCCATAATATCATAGAAGGCATCTTTTAT GGTTGAAAAT TCCGATGGTTAGAGATGGGCATGCGTTGTATTAGCTAGTTGGTGGGGTAACGGCTCACCA AGGCGACGAT ACATAGGGGGACTGAGAGGTTAACCCCCCACACTGGTACTGAGACACGGACCAGACTCCT ACGGGAGGCA GCAGTGAGGAATATTGGTACAATAGTGGACGCAAGTCTGAACCAGCCATGCCGCGTGCAG GATGACGGCT CTATGAGTTGTAAACTGCTTTTGTACGAGGGTAAACGCAGATACGTGTATCTGTCTGAAA GTATCGTACG AATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATTCAAGCGTTAT CCGGATTTAT TGGGTTTAAATCTGGGTGCGTAGGCGGTTTGATAAGTTAGAGGTGAAATTTCGGGGCTCA ACCCTGAACG TGCCTCTAATACTGTTGAGCTAGAGAGTAGTTGCGGTAGGCGGAATGTATGGTGTAGCGG TGAAATGCTT AGAGATCATACAGAACACCGATTGCGAAGGCAGCTTACCAAACTATATCTGACGTTGAGG CACGAAAGCG TGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCAGTAAACGATGATAACTCGTT GTCGGCGA A CACAGTCGGTGACTAAGCGAAAGCGATAAGTTATCCACCTGGCATGGTACGTTCGCAAGA ATGAAACTCA AAGGAATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTCGATGATACGCG AGGAACCT A CCCGGGCTTGAAAGTTAGCGACGATTCTTGAAAGAGGATTTCCCTTCGGGGCGCGAAACT AGGTGCTGCA TGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGGTTAAGTCCCATAACGAGCGCAACCCC TACCGTTAGT TGCCATCAGGTGAAGCTGGGCACTCTGGCGGGACTGCCGGTGTAAGCCGAGAGGAAGGTG GGGATGACGT CAAATCAGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGGTAGGTACAGAGG GCAGCTACCC AGCGATGGGATGCGAATCTCGAAAGCCTATCTCAGTTCGGATTGGAGGCTGAAACCCGCC TCCATGAAGT TGGATTCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTCCCGGGCCTTGTA CACACCGCCC GTCAAGCCATGGGAGCCGGGGGTGCCTGAAGTTCGTGACCGCAAGGAGCGACCTAGGGCA AAACTGGTGA CTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAACACCTCCTTTC TTAGGGATAA CAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctaca cgacgctctt ccgatctCAGTGCTCGACCCGACACCCagatcggaagagcacacgtctgaactccagtca cAATCAGTCT CGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>225_Ere_Full_TAG_TCT_CAT

AGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAAC GAAGCACTTT ATTTGATTTCCTTCGGGACTGATTATTTTGTGACTGAGTGGCGGACGGGTGAGTAACGCG TGGGTAACCT GCCTTGTACAGGGGGATAACAGTTGGAAACGACTGCTAATACCGCATAAGCGCACAGTAT CGCATGATGC AGTGTGAAAAACTCCGGTGGTATAAGATGGACCCGCGTTGGATTAGCTAGTTGGTGAGGT GACGGCCCAC CAAGGCGACGATCCATAGCCGACCTGAGAGGGTGACCGGCCACATTGGGACTGAGACACG GCCCAAACTC CTACGGGAGGCAGCAGTGGGGAATATTGCAACAATAGTGGGCGAAAGCCTGATGCAGCGA CGCCGCGTGA GCGAAGAAGTATTTCGGTATGTAAAGCTCTATCAGCAGGGAAGATAATGACGGTACCTGA CTAAGAAGCA CCGGCTAAATACGTGCCAGCAGCCGCGGTAATACGTATGGTGCAAGCGTTATCCGGATTT ACTGGGTG A AATCTGGGAGCGCAGGCGGTGCGGCAAGTCTGATGTGAAAGCCCGGGGCTCAACCCCGGT ACTGCATTGG AAACTGTCGTACTAGAGTGTCGGAGGGGTAAGCGGAATTCCTAGTGTAGCGGTGAAATGC GTAGA AT A GGAGGAACACCAGTGGCGAAGGCGGCTTACTGGACGATAACTGACGCTGAGGCTCGAAAG CGTGGGGAGC AAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGAA AC AGGTGTTGGGAAGCATTGCTTC TCGGTGCCGTCGCAAACGCAGTAAGTATTCCACCTGGCATGGTACGTTCGCAAGAATGAA ACTCAAAGGA ATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGAAGCAACGCGAAGAA CCTTACCAAG TCTTGACATCCTTCTGACCGGTACTTAACCGTACCTTCTCTTCGGAGCAGGAGTGACAGG TGGTGCATGG TTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTAT CTTTAGTAGC CAGCGGTTCGGCCGGGCACTCTAGAGAGACTGCCAGGGATAACCTGGAGGAAGGCGGGGA TGACGTCAAA TCATCATGCCCCTTATGACTTGGGCTACACACGTGCTACAATGGCGTAAACAAAGGGAAG CAAAGCTGTG AAGCCGAGCAAATCTCAAAAATAACGTCTCAGTTCGGACTGTAGTCTGCAACCCGACTAC ACGAAGCTGG AATCGCTAGTAATCGCAGATCAGAATGCTGCGGTGAATACGTTCCCGGGTCTTGTACACA CCGCCCGTCA CACCATGGGAGTTGGGAATGCCCGAACCAGTGACCTAACCGTAAGGAAGGAGCTGTCGAA GGCAGGCTCG A AACTGGGGTGAAGTCTAACAAGGTAACCTAGGGATAACAGGGTAATGAGTCGACAAaatg atacggcg accaccgagatctacactctttccctacacgacgctcttccgatctGAGTTTACCTGCGC CCAGTTagat cggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttc tgcttgTTGT CGACTC

>226_Rbr_Full_TAG_TCT_CAT

GACGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAACGGAACTGTTTTGAAAGATTT CTTCGGAATG AATTTGATTTAGTTTAGTGGCGGACGGGTGAGTAACGCGTGAGTAACCTGCCTTCAAGAG GGGGATAACA TTCTGAAAAGAATGCTAATACCGCATGACATATCGGAACCACATGGTTTTGATATCAAAG ATTTTATCGC TTGAAGATGGACTCGCGTCCGATTAGTTAGTTGGTGAGGTAACGGCTCACCAAGACCGCG ATCGGTAGCC GGACTGAGAGGTTGAACGGCCACATTGGGACTGAGACACGGCCCAGACTCCTACGGGAGG CAGCAGTGGG GGATATTGCGACAATAGTGGGGGCAACCCTGACGCAGCAACGCCGCGTGAAGGATGAAGG TTTTCGGATT GTAAACTTCTTTTATTAAGGACGAAACTTGACGGTACTTAATGAATAAGCTCCGGCTAAC TACGTGCCAG CAGCCGCGGTAATACGTAGGGAGCAAGCGTTGTCCGGATTTACTGGGTGTAAATCTGGGT GCGTAGGCGG CTTTGCAAGTCAGATGTGAAATCTATGGGCTCAACCCATAAACTGCATTTGAAACTGTAG AGCTTGAGTG AAGTAGAGGCAGGCGGAATTCCCCGTGTAGCGGTGAAATGCGTAGAGATGGGGAGGAACA CCAGTGGCGA AGGCGGCCTGCTGGGCTTTAACTGACGCTGAGGCACGAAAGCGTGGGTAGCAAACAGGAT TAGATACCCT GGTAGTCCACGCTGTAAACGATGATTACTAGGTGTGGGGGGTCTGACCCCTTCCGTGCCG GAGTTAACAC AATAAGTAATCCACCTGGCATGGTACGGCCGCAAGGTTGAAACTCAAAGGAATTGACGGG GGCCCGCACA AGCAGTGGAGTATGTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACAT CCAACTAACG AAGTAGAGATACATTAGGTGCCCTTCGGGGAAAGTTGAGACAGGTGGTGCATGGTTGTCG TCAGCTCGTG TCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGCTATTAGTTGCTACGCA AGAGCACTCT AATAGGACTGCCGTTGACAAAACGGAGGAAGGTGGGGACGACGTCAAATCATCATGCCCC TTATGACCTG GGCTACACACGTACTACAATGGATGTTAACAGAGGGAAGCAAGACAGCGATGTGGAGCAA ACCCCTAAAA ACATTCTCAGTTCAGATTGCAGGCTGCAACCCGCCTGCATGAAGATGGAATTGCTAGTAA TCGCGGATCA GCATGCCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCATGGGAGC CGGTAATACC CGAAGTCAGTAGTCCAACCTCGTGAGGACGCTGCCGAAGGTAGGATTGGCGACTGGGGTG TAGGGATAAC AGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacac gacgctcttc cgatctAAGATGCATACGAGGAGCAAagatcggaagagcacacgtctgaactccagtcac AATCAGTCTC GTatctcgtatgccgtcttctgcttgTTGTCGACTC

>227_Osp_Full_TAG_TCT_CAT

AGAGTTTGATCCTGGCTCAGGATAACGCTAGCGACAGGCTTAACACATGCAAGTCGAGGG GCATCATGAG GTAGCAATACCTTGATGGCGACCGGCGCACGGGTGAGTAACGCGTATGCAACCTGCCTGA TACCGGGG A TAGCCCATGGAAACGTGGATTAACACCCCATAGTACTTTTATCCTGCATGGGATGTGAGT TAAATGTTCA AGGTATCGGATGGGCATGCGTCCTATTAGTTAGTTGGCGGGGTAACAGCCCACCAAGACG ATGATAGGTA GGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAAACTCCTACGGG AGGCAGCAGT GAGGAATATTGGTACAATAGTGGACGTAAGTCTGAACCAGCCAAGTCGCGTGAGGGAAGA CTGCCCTATG GGTTGTAAACCTCTTTTATAAGGGAAGAATAAGTTCTACGTGTAGAATGATGCCTGTACC TTATGAATAA GCATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATGCGAGCGTTATCCGGA TTTATTGGGT TTAAATCTGGGTGCGTAGGCGGTTTATTAAGTTAGTGGTTAAATATTTGAGCTAAACTCA ATTGTGCCAT TAATACTGGTAAACTGGAGTACAGACGAGGTAGGCGGAATAAGTTAAGTAGCGGTGAAAT GCATAGA AT AACTTAGAACTCCGATAGCGAAGGCAGCTTACCAGACTGTAACTGACGCTGAAGCACGAG AGCGTGGG A GCGAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGCTCACTGGTTCTGTGC GA A ATTGT ACGGGATTAAGCGAAAGTATTAAGTGAGCCACCTGGCATGGTACGTCGGCAACGATGAAA CTCAAAGGAA TTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTCGATGATACGCGAGGAAC CTTACCTGGG TTTAAATGGGAAATGTCGTATTTGGAAACAGATATTCTCTTCGGAGCGTTTTTCAAGGTG CTGCATGGTT GTCGTCAGCTCGTGCCGTGAGGTGTCGGGTTAAGTCCCATAACGAGCGCAACCCTTACCG TTAGTTGC A GCATGTAATGATGAGCACTCTAACGGGACTGCCACCGTAAGGTGAGAGGAAGGCGGGGAT GACGTCAAAT CAGCACGGCCCTTACACCCAGGGCTACACACGTGTTACAATGGCCGGTACAGAGGGCCGC TACCAGGTGA CTGGATGCCAATCTCAAAAGCCGGTCGTAGTTCGGATTGGAGTCTGTAACCCGACTCCAT GAAGTTGGAT TCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTCCCGGGCCTTGTACACAC CGCCCGTCAA GCCATGGAAGCCGGGGGTGCCTGAAGTCCGTAACCGCGAGGATCGGCCTAGGGCAAAACT GGTAACTGGG GCTAAGTCGTAACATAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccacc gagatctaca ctctttccctacacgacgctcttccgatctATCATGATACTGCCGTTTCGagatcggaag agcacacgtc tgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>228_Amu_Full_TAG_TCT_CAT

AACGAACGCTGGCGGCGTGGATAAGACATGCAAGTCGAACGAGAGAATTGCTAGCTTGCT AATAATTCTC TAGTGGCGCACGGGTGAGTAACACGTGAGTAACCTGCCCCCGAGAGCGGGATAGCCCTGG GAAACTGGGA TTAATACCGCATAGTATCGAAAGATTAAAGCAGCAATGCGCTTGGGGATGGGCTCGCGGC CTATTAGTTA GTTGGTGAGGTAACGGCTCACCAAGGCGATGACGGGTAGCCGGTCTGAGAGGATGTCCGG CCACACTGGA ACTGAGACACGGTCCAGACACCTACGGGTGGCAGCAGTCGAGAATCATTCAACAATAGTG GGGGAAACCC TGATGGTGCGACGCCGCGTGGGGGAATGAAGGTCTTCGGATTGTAAACCCCTGTCATGTG GGAGCAAATT AAAAAGATAGTACCACAAGAGGAAGAGACGGCTAACTCTGTGCCAGCAGCCGCGGTAATA CAGAGGTCTC AAGCGTTGTTCGGAATCACTGGGCGTAAATCTGCGTGCGTAGGCTGTTTCGTAAGTCGTG TGTGAAAGGC GCGGGCTCAACCCGCGGACGGCACATGATACTGCGAGACTAGAGTAATGGAGGGGGAACC GGAATTCTCG GTGTAGCAGTGAAATGCGTAGATATCGAGAGGAACACTCGTGGCGAAGGCGGGTTCCTGG ACATTAACTG ACGCTGAGGCACGAAGGCCAGGGGAGCGAAAGGGATTAGATACCCCTGTAGTCCTGGCAG TAAACGGTGC ACGCTTGGTGTGCGGGGAATCGACCCCCTGCGTGCCGGAGTAACGCGTTAAGCGTGCCGC CTGGCATGGT ACGGTCGCAAGATTAAAACTCAAAGAAATTGACGGGGACCCGCACAAGCGGTGGAGTATG TGGCTTAATT CGATGCAACGCGAAGAACCTTACCTGGGCTTGACATGTAATGAACAACATGTGAAAGCAT GCGACTCTTC GGAGGCGTTACACAGGTGCTGCATGGCCGTCGTCAGCTCGTGTCGTGAGATGTTTGGTTA AGTCCAGCAA CGAGCGCAACCCCTGTTGCCAGTTACCAGCACGTGAAGGTGGGGACTCTGGCGAGACTGC CCAGATCAAC TGGGAGGAAGGTGGGGACGACGTCAGGTCAGTATGGCCCTTATGCCCAGGGCTGCACACG TACTACAATG CCCAGTACAGAGGGGGCCGAAGCCGCGAGGCGGAGGAAATCCTAAAAACTGGGCCCAGTT CGGACTGTAG GCTGCAACCCGCCTACACGAAGCCGGAATCGCTAGTAATGGCGCATCAGCTACGGCGCCG TGAATACGTT CCCGGGTCTTGTACACACCGCCCGTCACATCATGGAAGCTGGTCGCACCCGAAGTATCTG AAGCCAACCG CAAGGAGGCAGGGTCCTAAGGTGAGACTGGTAACTGGGATGTAGGGATAACAGGGTAATG AGTCGACAAa atgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatctTGT CTTATCTGAA TACAGAGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgt atgccgtctt ctgcttgTTGTCGACTC

>229_Din_Full_TAG_TCT_CAT

GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGAAAAGACGGAAAGAGCTTG CTCTTTTCAG AATTGAGTGGCAAACGGGTGAGTAACACGTAAACAACCTGCCTTCAGGATGGGGACAACA GACGGAAACG ACTGCTAATACCGAATAAGTTCCAAGAGCCGCATGGCCCATGGAAGAAAAGGTGGCCTCT ACCTGTAAGC TATCGCCTGAAGAGGGGTTTGCGTCTGATTAGCTGGTTGGAGGGGTAACGGCCCACCAAG GCGACGATCA GTAGCCGGTCTGAGAGGATGAACGGCCACACTGGAACTGAGACACGGTCCAGACTCCTAC GGGAGGCAGC AGTGGGGAATCTTCCGACAATAGTGGGCGAAAGCCTGACGGAGCAACGCCGCGTGAGTGA TGACGGCCTT CGGGTTGTAAAACTCTGTGATCCGGGACGAAAAGGCAGAGTGCGAAGAACAAACTGCATT GACGGTACCG GAAAAGCAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGCG TTGTCCGGAA TTATTGGGCGTAAATCTGCGCGCGCAGGCGGCTTCCCAAGTCCCTCTTAAAAGTGCGGGG CTTAACCCCG TGATGGGAAGGAAACTGGGAAGCTGGAGTATCGGAGAGGAAAGTGGAATTCCTAGTGTAG CGGTGAAATG CGTAGAGATTAGGAAGAACACCGGTGGCGAAGGCGACTTTCTGGACGAAAACTGACGCTG AGGCGCGAAA GCGTGGGGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGATGGATACTAGG TGTAGGAGGT ATCGACCCCTCCTGTGCCGGAGTTAACGCAATAAGTATCCCGCCTCATGGGAAGTACGAT CGCAAGAT A AAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGTATGTGGTTTAATTCGACG CAACGCGAAG AACCTTACCAGGTCTTGACATTGATCGCGATCTGCAGAAATGCGGAGTTCTTCTTCGGAA GACGAGAAAA CAGGTGGTGCACGGCTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACG AGCGCAACCC CTATCATTTGTTACCAGCACGTAAAGGTGGGGACTCAAATGAGACCGCCGCGGACAACGC GGAGGAAGGC GGGGACGACGTCAAGTCATCATGCCCCTTATGACCTGGGCTACACACGTACTACAATGGG TGTCAACAAA GAGAAGCGAACCCGCGAGGAAGAGCAAACCTCAAAAACACACCCCCAGTTCAGATCGCAG GCTGCAACCC GCCTGCGTGAAGTAGGAATCGCTAGTAATCGCGGGTCAGCATACCGCGGTGAATACGTTC CCGGGCCTTG TACACACCGCCCGTCACACTATGAGAGTCAGAAACACCCGAAGCCGGTGAGGTAACCGCA AGGAGCCAGC CGTCGAAGGCGGAGCTGATGATTGGAGTGAAGTCGTAACAAGGTAGCCGTATCGGAAGGT GCTAGGGA A ACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctac acgacgctct tccgatctTGGCAGCCATAACATAGTCCagatcggaagagcacacgtctgaactccagtc acAATCAGTC TCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>230_Bun_Full_TAG_TCT_CAT

TTTGATCATGGCTCAGGATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCA GCATGAACTT AGCTTGCTAAGTTTGATGGCGACCGGCGCACGGGTGAGTAACACGTATCCAACCTGCCGA TGACTCGGGG A AGCCTTTCGAAAGAAAGATTAATACCCGATGGCATAGTTCTTCCGCATGGTGGAACTATT AAAGAATT TCGGTCATCGATGGGGATGCGTTCCATTAGGTTGTTGGCGGGGTAACGGCCCACCAAGCC TTCGATGGAT AGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAAACTCCTACGG GAGGCAGCAG TGAGGAATATTGGTACAATAGTGGACGAGAGTCTGAACCAGCCAAGTAGCGTGAAGGATG ACTGCCCTAT GGGTTGTAAACTTCTTT A ACGGGAATAAAGTGAGGCACGTGTGCCTTTTTGTATGTACCGTATGAA A AGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGATCCGAGCGTTATCCGG ATTTATTGGG TTTAAATCTGGGAGCGTAGGCGGACGCTTAAGTCAGTTGTGAAAGTTTGCGGCTCAACCG TAAAATTGCA GTTGATACTGGGTGTCTTGAGTACAGTAGAGGCAGGCGGAATTCGTGGTGTAGCGGTGAA ATGCTTAGAT ATCACGAAGAACTCCGATTGCGAAGGCAGCTTGCTGGACTGTAACTGACGCTGATGCTCG AAAGTGTGGG TATCAAACAGGATTAGATACCCTGGTAGTCCACACAGTAAACGATGAATACTCGCTGTTT GCGA A ACA GTAAGCGGCCAAGCGAAAGCGTTAAGTATTCCACCTGGCATGGTACGCCGGCAACGGTGA AACTCAAAGG AATTGACGGGGGCCCGCACAAGCGGAGGAACATGTGGTTTAATTCGATGATACGCGAGGA ACTTACCCGG GCTTGAATTGCAACTGAATGATGTGGAGACATGTCAGCCGCAAGGCAGTTGTGAAGGTGC TGCATGGTTG TCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAACGAGCGCAACCCTTATCGA TAGTTACCAT CAGGTGATGCTGGGGACTCTGTCGAGACTGCCGTCGTAAGATGTGAGGAAGGTGGGGATG ACGTCAAATC AGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGGGGGGTACAGAAGGCAGCT ACACGGCGAC GTGATGCTAATCCCTAAAGCCTCTCTCAGTTCGGATTGGAGTCTGCAACCCGACTCCATG AAGCTGGATT CGCTAGTAATCGCGCATCAGCCACGGCGCGGTGAATACGTTCCCGGGCCTTGTACACACC GCCCGTCAAG CCATGAAAGCCGGGGGTACCTGAAGTGCGTAACCGCAAGGAGCGCCCTAGGGTAAAACTG GTGATTGGGG CTAATAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctaca ctctttccct acacgacgctcttccgatctGGCACGCGGGTCCGACAGCCagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>231 Bth Full TAG TCT CAT TTACAATGAAGAGTTTGATCCTGGCTCAGGATGAACGCTAGCTACAGGCTTAACACATGC AAGTCGAGGG GCAGCATTTCAGTTTGCTTGCAAACTGGAGATGGCGACCGGCGCACGGGTGAGTAACACG TATCCAACCT GCCGATAACTCGGGGATAGCCTTTCGAAAGAAAGATTAATACCCGATGG A AATCAGACCGCATGGTCT TGTTATTAAAGAATTTCGGTTATCGATGGGGATGCGTTCCATTAGGCAGTTGGTGAGGTA ACGGCTCACC AAACCTTCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGG TCCAAACTCC TACGGGAGGCAGCAGTGAGGAATATTGGTACAATAGTGGGCGCAGGCCTGAACCAGCCAA GTAGCGTGAA GGATGACTGCCCTATGGGTTGTAAACTTCTTTTATATGGGAATAAAGTTTTCCACGTGTG GAATTTTG A TGTACCATATGAATAAGGATCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGAT CCGAGCGT A TCCGGATTTATTGGGTTTAAATCTGGGAGCGTAGGTGGACAGTTAAGTCAGTTGTGAAAG TTTGCGGCTC AACCGTAAAATTGCAGTTGATACTGGCTGTCTTGAGTACAGTAGAGGTGGGCGGAATTCG TGGTGTAGCG GTGAAATGCTTAGATATCACGAAGAACTCCGATTGCGAAGGCAGCTCACTGGACTGCAAC TGACACTGAT GCTCGAAAGTGTGGGTATCAAACAGGATTAGATACCCTGGTAGTCCACACAGTAAACGAT GAATACTCGC TGTTTGCGA A ACAGTAAGCGGCCAAGCGAAAGCATTAAGTATTCCACCTGGCATGGTACGCCGGCAAC GGGTGAAACTCAAAGGAATTGACGGGGGCCCGTACAAGCGGAGGAACATGTGGTTTAATT CGATGATACG CGAGGAACCTTACCCGGGCTTAAATTGCATTTGAA A ATTGGAAACAG A AGCCGTAAGGCAAATGTG AAGGTGCTGCATGGTTGTCGTCAGCTCGTGCCGTGAGGTGTCGGCTTAAGTGCCATAACG AGCGCAACCC TTATCTTTAGTTACTAACAGGTCATGCTGAGGACTCTAGAGAGACTGCCGTCGTAAGATG TGAGGAAGGT GGGGATGACGTCAAATCAGCACGGCCCTTACGTCCGGGGCTACACACGTGTTACAATGGG GGGTACAGAA GGCAGCTACCTGGTGACAGGATGCTAATCCCAAAAGCCTCTCTCAGTTCGGATCGAAGTC TGCAACCCGA CTTCGTCAAGCTGGATTCGCTAGTAATCGCGCATCAGCCATGGCGCGGTGAATACGTTCC CGGGCCTTGT ACACACCGCCCGTCAAGCCATGAAAGCCGGGGGTACCTGAAGTACGTAACCGCAAGGAGC GTCCTAGGGT AAAACTGGTAATTGGGGCTAAGTCGTAACAAGGTAGCCGTACCGGAAGGTGCGGCTGGAA CACCTCCTTT CTTAGGGATAACAGGGTAATGAGTCGACAAaatgatacggcgaccaccgagatctacact ctttccctac acgacgctcttccgatctCAAGTCGCATCCGAATATTAagatcggaagagcacacgtctg aactccagtc acAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC

>232_Bfr_Ful1_ AG_ CT_CAT

ATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCATCAGGAAGAAAGCTTGC TTTCTTTGCT GGCGACCGGCGCACGGGTGAGTAACACGTATCCAACCTGCCCTTTACTCGGGGATAGCCT TTCGAAAGAA AGATTAATACCCGATAGCATAATGATTCCGCATGGTTTCATTATTAAAGGATTCCGGTAA AGGATGGGGA TGCGTTCCATTAGGTTGTTGGTGAGGTAACGGCTCACCAAGCCTTCGATGGATAGGGGTT CTGAGAGGAA GGTCCCCCACATTGGAACTGAGACACGGTCCAAACTCCTACGGGAGGCAGCAGTGAGGAA TATTGGTACA A AGTGGGCGCTAGCCTGAACCAGCCAAGTAGCGTGAAGGATGAAGGCTCTATGGGTCGTAA ACTTCTTT TA A AAGAATAAAGTGCAGTATGTATACTGTTTTGTATGTAT A ATGAATAAGGATCGGCTAACTCCG TGCCAGCAGCCGCGGTAATACGGAGGATCCGAGCGTTATCCGGATTTATTGGGTTTAAAT CTGGGAGCGT AGGTGGACTGGTAAGTCAGTTGTGAAAGTTTGCGGCTCAACCGTAAAATTGCAGTTGATA CTGTCAGTCT TGAGTACAGTAGAGGTGGGCGGAATTCGTGGTGTAGCGGTGAAATGCTTAGATATCACGA AGAACTCCGA TTGCGAAGGCAGCTCACTGGACTGCAACTGACACTGATGCTCGAAAGTGTGGGTATCAAA CAGGATTAGA TACCCTGGTAGTCCACACAGTAAACGATGAATACTCGCTGTTTGCGA A ACAGTAAGCGGCCAAGCGAA AGCATTAAGTATTCCACCTGGCATGGTACGCCGGCAACGGTGAAACTCAAAGGAATTGAC GGGGGCCCGC ACAAGCGGAGGAACATGTGGTTTAATTCGATGATACGCGAGGAACCTTACCCGGGCTTAA ATTGCAGTGG AATGATGTGGAAACATGTCAGTGAGCAATCACCGCTGTGAAGGTGCTGCATGGTTGTCGT CAGCTCGTGC CGTGAGGTGTCGGCTTAAGTGCCATAACGAGCGCAACCCTTATCTTTAGTTACTAACAGG TTATGCTGAG GACTCTAGAGAGACTGCCGTCGTAAGATGTGAGGAAGGTGGGGATGACGTCAAATCAGCA CGGCCCTTAC GTCCGGGGCTACACACGTGTTACAATGGGGGGTACAGAAGGCAGCTAGCGGGTGACCGTA TGCTAATCCC AAAATCCTCTCTCAGTTCGGATCGAAGTCTGCAACCCGACTTCGTGAAGCTGGATTCGCT AGTAATCGCG CATCAGCCACGGCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCAAGCCAT GGGAGCCGGG GGTACCTGAAGTACGTAACCGCAAGGATCGTCCTAGGGTAAAACTAGGGATAACAGGGTA ATGAGTCGAC AAaatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatct CGCGCCGCGG CAGCAATCCAagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatct cgtatgccgt cttctgcttgTTGTCGAC C Size standard sequences lllumina adapters in lower case

>233_ILL_size_150_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGA GCATGCCGATGGTTTGTTAAagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagGC CGCCCGTCAC AGCACGTActgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC

>234_ILL_size_300_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG ACTATCGCCTTTAGCCTCAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagGC AGCTGTTAGAGACGAATCctgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>235_ILL_size_ 50_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTG ATGTATATAGCCGGCGGCAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGAagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagGGACAAACAGAAATATCACGctgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>236_ILL_size_600_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA ACGGAACGTGCACTGCAGAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGagatcggaag agcacacgtc tgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC TAGGGATAAC AGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcg tgggctcgga gatgtgtataagagacagGGCCACCGTAAACAGTGCGActgtctcttatacacatctgac gctgccgacg aATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>237 ILL size 750 ECO 16S GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA TGTGAGCGTATCAGGAGAAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGA agatcggaag agcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttg TTGTCGACTC TAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAAC GATAgtctcg tgggctcggagatgtgtataagagacagTAGCGCCCACAGCAAGTGA ctgtctcttatacacatctgac gctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGAC TC

>238_ILL_size_900_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG GCGGGTAGTACCTGTACCAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGAT GTGAAATCCC CGGGCTCAACCTGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTA GAATTCCAGG TGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGA CGAAGACTGA

agatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgc cgtcttctgcttg TTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgag atTGGTCAAC GATAgtctcgtgggctcggagatgtgtataagagacagACAAGCCCTAATGATGATAGct gtctcttata cacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatca ttTTGTCGAC TC

>239_ILL_size_l050_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGT TCTCCTGCTACAGAGGTTAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGAT GTGAAATCCC CGGGCTCAACCTGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTA GAATTCCAGG TGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGA CGAAGACTGA CGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGT AAACGATGTC GACTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAGTCGACCGCC TGGGGAGTAC GGCCGCAAGGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatct cgtatgccgt cttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagac ggcatacgag atTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagACGCTGATAAAT ATCGAGTTct gtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggt cgccgtatca ttTTGTCGACTC

>240_ILL_size_l200_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC CACATGCCGGAACGCACCAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGAT GTGAAATCCC CGGGCTCAACCTGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTA GAATTCCAGG TGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGA CGAAGACTGA CGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGT AAACGATGTC GACTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAGTCGACCGCC TGGGGAGTAC GGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTG GTTTAATTCG ATGCAACGCGAAGAACCTTACCTGGTCTTGACATCCACGGAAGTTTTCAGAGATGAGAAT GTGCCTTCGG GAACCGTGAGACAGGTGCTGagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagTC GTTCTAAGAG GGTGCCAGctgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC

>2 1_ILL_size_l350_ECO_16S

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC CAAACGTCGGAAAGGTCTAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGAT GTGAAATCCC CGGGCTCAACCTGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTA GAATTCCAGG TGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGA CGAAGACTGA CGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGT AAACGATGTC GACTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAGTCGACCGCC TGGGGAGTAC GGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTG GTTTAATTCG ATGCAACGCGAAGAACCTTACCTGGTCTTGACATCCACGGAAGTTTTCAGAGATGAGAAT GTGCCTTCGG GAACCGTGAGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTTGTGAAATGTTGGGTTAA GTCCCGCAAC GAGCGCAACCCTTATCCTTTGTTGCCAGCGGTCCGGCCGGGAACTCAAAGGAGACTGCCA GTGATAAACT GGAGGAAGGTGGGGATGACGTCAAGTCATCagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagTG CAACTCAACGGTCCCAGGctgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>242 ILL size 1500 ECO 16S GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA CCAGAGGATGAGACACGTAAATTGAAGAGTTTGATCATGGCTCAGATTGAACGCTGGCGG CAGGCCTAAC ACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTCTTTGCTGACGAGTGGCGGACGGG TGAGTAATGT CTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACCGCATAAC GTCGCAAGAC CAAAGAGGGGGACCTTCGGGCCTCTTGCCATCGGATGTGCCCAGATGGGATTAGCTAGTA GGTGGGGTAA CGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACT GAGACACGGT CCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCCTGATGCA GCCATGCCGC GTGTATGAAGAAGGCCTTCGGGTTGTAAAGTACTTTCAGCGGGGAGGAAGGGAGTAAAGT TAATACCTTT GCTCATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAA TACGGAGGGT GCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTTGTTAAGTCAGAT GTGAAATCCC CGGGCTCAACCTGGGAACTGCATCTGATACTGGCAAGCTTGAGTCTCGTAGAGGGGGGTA GAATTCCAGG TGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCCCCCTGGA CGAAGACTGA CGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGT AAACGATGTC GACTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAGTCGACCGCC TGGGGAGTAC GGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTG GTTTAATTCG ATGCAACGCGAAGAACCTTACCTGGTCTTGACATCCACGGAAGTTTTCAGAGATGAGAAT GTGCCTTCGG GAACCGTGAGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTTGTGAAATGTTGGGTTAA GTCCCGCAAC GAGCGCAACCCTTATCCTTTGTTGCCAGCGGTCCGGCCGGGAACTCAAAGGAGACTGCCA GTGATAAACT GGAGGAAGGTGGGGATGACGTCAAGTCATCATGGCCCTTACGACCAGGGCTACACACGTG CTACAATGGC GCATACAAAGAGAAGCGACCTCGCGAGAGCAAGCGGACCTCATAAAGTGCGTCGTAGTCC GGATTGGAGT CTGCAACTCGACTCCATGAAGTCGGAATCGCTAGTAATCGagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagTGGCTCCTTCTGTTAAGGCActgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>243_ILL_size_150_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCC ACCTAACAGACACTTGTTAGagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagGG TAAGTAGTGC GTGAGGGTctgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC

>244_ILL_size_300_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGC GCCCTGGCGCCGGCCAGGTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagAC GGAGTAGTACGGTCAAATctgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>245_ILL_size_450_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCT ACTAAACAATAATGGGAATTTTTAAGGCGCTTATATAATCAAACCCTTTGTAAAAATTAA AGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAATAC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagGGATTGCCACACGCGATAGActgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>246_ILL_size_600_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCT GTGTTCGGCCTTCGAGATTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGagatcggaag agcacacgtc tgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC TAGGGATAAC AGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcg tgggctcgga gatgtgtataagagacagTAGTGTTTAAGTGCGAACCTctgtctcttatacacatctgac gctgccgacg aATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>247_ILL_size_750_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTA TGAAATCGGAGTATCAGTTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG agatcggaag agcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttg TTGTCGACTC TAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAAC GATAgtctcg tgggctcggagatgtgtataagagacagAAGAGCCCTGCCTCAAGTCCctgtctcttata cacatctgac gctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGAC TC

>248_ILL_size_900_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG CCAAACGTCTGAACAGATTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG AAGGTCATCA TCTCGGCCCCATCCGCCGATGCGCCCATGTTCGTGTGCGGCGTTAACCTGGACGCCTACA GCCCCGACAT GAAGGTGGTCTCCAACGCCTCGTGCACCACCAACTGCCTGGCTCCCCTGGCCAAGGTCAT CAATGACAAC

agatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgc cgtcttctgcttg TTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgag atTGGTCAAC GATAgtctcgtgggctcggagatgtgtataagagacagCCGTGTCGAACGCCACTCGAct gtctcttata cacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatca ttTTGTCGAC TC >249_ILL_size_1050_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCA AGTCCAGGGCACTCGCCTTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG AAGGTCATCA TCTCGGCCCCATCCGCCGATGCGCCCATGTTCGTGTGCGGCGTTAACCTGGACGCCTACA GCCCCGACAT GAAGGTGGTCTCCAACGCCTCGTGCACCACCAACTGCCTGGCTCCCCTGGCCAAGGTCAT CAATGACAAC TTCGAGATCGTCGAGGGTCTGATGACCACCGTGCACGCCACCACTGCCACCCAGAAGACC GTCGACGGTC CCTCTGGCAAACTGTGGCGCGATGGACGTGGCGCCGCCCAGAACATCATCCCGGCCGCCA CCGGAGCCGC CAAGGCTGTGagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatct cgtatgccgt cttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagac ggcatacgag atTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagTATGCCATGGGC TTTCGAACct gtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggt cgccgtatca ttTTGTCGACTC

>250_ILL_size_1200_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAT TTCTACTCATAGGTTCATTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG AAGGTCATCA TCTCGGCCCCATCCGCCGATGCGCCCATGTTCGTGTGCGGCGTTAACCTGGACGCCTACA GCCCCGACAT GAAGGTGGTCTCCAACGCCTCGTGCACCACCAACTGCCTGGCTCCCCTGGCCAAGGTCAT CAATGACAAC TTCGAGATCGTCGAGGGTCTGATGACCACCGTGCACGCCACCACTGCCACCCAGAAGACC GTCGACGGTC CCTCTGGCAAACTGTGGCGCGATGGACGTGGCGCCGCCCAGAACATCATCCCGGCCGCCA CCGGAGCCGC CAAGGCTGTGGGCAAGGTCATCCCCGCCCTGAACGGCAAGCTGACCGGCATGGCTTTCCG CGTGCCCACG CCCAATGTCTCCGTTGTGGATCTTACCGTCCGCTTGGGCAAGGGAGCCACCTATGACGAA ATCAAGGC A AGGTCGAGGAGGCCTCCAAGagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagTT AGATCAGATA GAAGGTACctgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC

>251_ILL_size_1350_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTT AAGACTGTTAGTTCGAGGTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG AAGGTCATCA TCTCGGCCCCATCCGCCGATGCGCCCATGTTCGTGTGCGGCGTTAACCTGGACGCCTACA GCCCCGACAT GAAGGTGGTCTCCAACGCCTCGTGCACCACCAACTGCCTGGCTCCCCTGGCCAAGGTCAT CAATGACAAC TTCGAGATCGTCGAGGGTCTGATGACCACCGTGCACGCCACCACTGCCACCCAGAAGACC GTCGACGGTC CCTCTGGCAAACTGTGGCGCGATGGACGTGGCGCCGCCCAGAACATCATCCCGGCCGCCA CCGGAGCCGC CAAGGCTGTGGGCAAGGTCATCCCCGCCCTGAACGGCAAGCTGACCGGCATGGCTTTCCG CGTGCCCACG CCCAATGTCTCCGTTGTGGATCTTACCGTCCGCTTGGGCAAGGGAGCCACCTATGACGAA ATCAAGGC A AGGTCGAGGAGGCCTCCAAGGGACCCCTGAAGGGAATCCTGGGCTACACCGATGAGGAGG TGGTCTCCAC CGACTTCTTCAGCGACACCCATTCGTCTGTGTTCGACGCCAAGGCTGGCATTTCGCTGAA CGATAAGTTC GTCAAGCTAATCTCGTGGTACGACAACGAGagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagTT TATATTGTTCTGCCTCACctgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>252_ILL_size_1500_GAPDH

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAG TTACTGGCTTTGTAGGATTTTTTAAGGCGCT A A AATCAAACCCTTTGTAAAAATTAAAGTTTTAAAT GGAATTCTAATCGATTTATTTCACATTAGCTTTATTTAAGTGTGACCTACGCAGAAAGCT AGCGAAA AC TCATCAACCCTCCCCCGCCATCGCAGCGCCATTCTCCTAATTTGCGAAAAAAGCTCCGGG AAAAGGAAAA AGCGGCAGTCGTAATAGCGAACTGAAACTGAACGAGAGTAAAAGTGAAAAGACAGCAGGA ACTCAGCCAT GTCGAAGATCGGAATTAACGGATTTGGCCGCATCGGCCGCTTGGTGCTCCGCGCCGCCAT CGATAAGGGC GCCTCCGTGGTGGCCGTCAACGATCCCTTCATCGATGTCAACTACATGGTTTACCTGTTT AAATTCGACT CGACTCACGGTCGTTTCAAGGGCACCGTTGCGGCTGAGGGCGGATTCCTGGTGGTGAACG GCCAGAAGAT CACCGTGTTCAGCGAGCGCGACCCGGCCAACATCAACTGGGCCAGTGCTGGAGCCGAGTA TGTGGTGGAG TCCACCGGAGTGTTCACCACCATTGACAAGGCGTCCACCCACTTGAAGGGCGGCGCCAAG AAGGTCATCA TCTCGGCCCCATCCGCCGATGCGCCCATGTTCGTGTGCGGCGTTAACCTGGACGCCTACA GCCCCGACAT GAAGGTGGTCTCCAACGCCTCGTGCACCACCAACTGCCTGGCTCCCCTGGCCAAGGTCAT CAATGACAAC TTCGAGATCGTCGAGGGTCTGATGACCACCGTGCACGCCACCACTGCCACCCAGAAGACC GTCGACGGTC CCTCTGGCAAACTGTGGCGCGATGGACGTGGCGCCGCCCAGAACATCATCCCGGCCGCCA CCGGAGCCGC CAAGGCTGTGGGCAAGGTCATCCCCGCCCTGAACGGCAAGCTGACCGGCATGGCTTTCCG CGTGCCCACG CCCAATGTCTCCGTTGTGGATCTTACCGTCCGCTTGGGCAAGGGAGCCACCTATGACGAA ATCAAGGC A AGGTCGAGGAGGCCTCCAAGGGACCCCTGAAGGGAATCCTGGGCTACACCGATGAGGAGG TGGTCTCCAC CGACTTCTTCAGCGACACCCATTCGTCTGTGTTCGACGCCAAGGCTGGCATTTCGCTGAA CGATAAGTTC GTCAAGCTAATCTCGTGGTACGACAACGAGTTCGGTTACTCCAACCGCGTCATCGACCTG ATCAAG A A TGCAGAGCAAGGACTAAACTAGCCAAAACTATCGTACAAACCCGGCGCCCAGCAGCTGGT CGGGAATCAC TGTTGCATAATCCGCAAGGGGCGCAATTGAGGATGCTTTTagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagTGTGGCTCCAATTGCTGCAActgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>253_ILL_size_l50_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctAA CGGTTGATGGGCCTGGTATCagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagCA CGCGTACGTG CTATCTTCctgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC >254_ILL_size_300_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG TTGTGTTTGCTGGCGCAATCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAAagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagTC GCGACCAAATGGTCAGTCctgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>255_ILL_size_ 50_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG CTGACGGTTGAGAGGGATTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagTTCGGCAATCAGAAAGGGTActgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>256_ILL_size_600_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTT CACATCGCCTTGACCTTATCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACagatcggaag agcacacgtc tgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTC TAGGGATAAC AGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcg tgggctcgga gatgtgtataagagacagTTAGGACCGGATTAGGTTCActgtctcttatacacatctgac gctgccgacg aATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC

>257_ILL_size_750_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTG GTACTGCCTCCTGGCCTCTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG agatcggaag agcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgccgtcttctgcttg TTGTCGACTC TAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgagatTGGTCAAC GATAgtctcg tgggctcggagatgtgtataagagacagAGACCTCGGACGAGGCTCACctgtctcttata cacatctgac gctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatcattTTGTCGAC TC >258_ILL_size_900_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG ACCGTCATTGACGGCCCTTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG ATGGAGCGTC TCTCCGTGGACTACGGCAAGAAGTCCAAGCTGGAGTTCGCCATCTACCCAGCCCCCCAGG TGTCCACTGC CGTGGTCGAGCCCTACAACTCCATCCTGACCACCCACACCACCCTGGAGCATTCCGACTG CGCCTTCATG

agatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatctcgtatgc cgtcttctgcttg TTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagacggcatacgag atTGGTCAAC GATAgtctcgtgggctcggagatgtgtataagagacagACCTGAAATACACAGTAACCct gtctcttata cacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggtcgccgtatca ttTTGTCGAC TC

>259_ILL_size_l050_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTA TCACCAGGGATGCATTGATCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG ATGGAGCGTC TCTCCGTGGACTACGGCAAGAAGTCCAAGCTGGAGTTCGCCATCTACCCAGCCCCCCAGG TGTCCACTGC CGTGGTCGAGCCCTACAACTCCATCCTGACCACCCACACCACCCTGGAGCATTCCGACTG CGCCTTCATG GTCGACAACGAGGCTATCTACGACATCTGCCGCCGCAATCTGGACATTGAGCGCCCCACG TACACCAACC TGAACCGTCTGATTGGCCAGATCGTGTCCTCGATTACCGCCTCTCTGCGATTCGATGGTG CCCTTAACGT GGATCTGACTagatcggaagagcacacgtctgaactccagtcacAATCAGTCTCGTatct cgtatgccgt cttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAcaagcagaagac ggcatacgag atTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagTATTGTGTAAGA CATTACCGct gtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtagatctcggtggt cgccgtatca ttTTGTCGACTC

>260_ILL_size_1200_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctGG ACCATATTTAGTTATGACTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG ATGGAGCGTC TCTCCGTGGACTACGGCAAGAAGTCCAAGCTGGAGTTCGCCATCTACCCAGCCCCCCAGG TGTCCACTGC CGTGGTCGAGCCCTACAACTCCATCCTGACCACCCACACCACCCTGGAGCATTCCGACTG CGCCTTCATG GTCGACAACGAGGCTATCTACGACATCTGCCGCCGCAATCTGGACATTGAGCGCCCCACG TACACCAACC TGAACCGTCTGATTGGCCAGATCGTGTCCTCGATTACCGCCTCTCTGCGATTCGATGGTG CCCTTAACGT GGATCTGACTGAGTTCCAGACCAACTTGGTGCCCTACCCACGTATTCACTTCCCTCTGGT GACCTACGCC CCCGTTATCTCCGCCGAGAAGGCCTACCACGAGCAGCTGTCGGTGGCTGAGATCACCAAC GCCTGCTTCG AGCCGGCCAACCAGATGGTCagatcggaagagcacacgtctgaactccagtcacAATCAG TCTCGTatct cgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGAGTCGACAAca agcagaagac ggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtataagagacagTG GAGGTATTGC TAATAATGctgtctcttatacacatctgacgctgccgacgaATCACCAGGTGTgtgtaga tctcggtggt cgccgtatcattTTGTCGACTC

>261_ILL_size_1350_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctCG ATTTCTAGGTGTTACTTGTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG ATGGAGCGTC TCTCCGTGGACTACGGCAAGAAGTCCAAGCTGGAGTTCGCCATCTACCCAGCCCCCCAGG TGTCCACTGC CGTGGTCGAGCCCTACAACTCCATCCTGACCACCCACACCACCCTGGAGCATTCCGACTG CGCCTTCATG GTCGACAACGAGGCTATCTACGACATCTGCCGCCGCAATCTGGACATTGAGCGCCCCACG TACACCAACC TGAACCGTCTGATTGGCCAGATCGTGTCCTCGATTACCGCCTCTCTGCGATTCGATGGTG CCCTTAACGT GGATCTGACTGAGTTCCAGACCAACTTGGTGCCCTACCCACGTATTCACTTCCCTCTGGT GACCTACGCC CCCGTTATCTCCGCCGAGAAGGCCTACCACGAGCAGCTGTCGGTGGCTGAGATCACCAAC GCCTGCTTCG AGCCGGCCAACCAGATGGTCAAGTGCGATCCCCGTCACGGCAAGTACATGGCCTGCTGCA TGCTGTACCG CGGTGATGTTGTGCCCAAGGACGTCAACGCCGCTATTGCCACCATCAAGACCAAGCGCAC CATTCAATTC GTCGACTGGTGCCCCACTGGCTTCAAGGTTagatcggaagagcacacgtctgaactccag tcacAATCAG TCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAACAGGGTAATGA GTCGACAAca agcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcggagatgtgtata agagacagCC TGCGTGTGCCGTGTAGGActgtctcttatacacatctgacgctgccgacgaATCACCAGG TGTgtgtaga tctcggtggtcgccgtatcattTTGTCGACTC

>262_ILL_size_1500_TUB

GAGTCGACAAaatgatacggcgaccaccgagatctacactctttccctacacgacgctct tccgatctTT TGAGGGTCGCTACAGAATTCATATTCGTTTTACGTTTGTCAAGCCTCATAGCCGGCAGTT CGAACGTATA CGCTCTCTGAGTCAGACCTCGAAATCGTAGCTCTACACAATTCTGTGAATTTTCCTTGTC GCGTGTGAAA CACTTCCAATAAAAACTCAATATGCGTGAATGTATCTCTATCCATGTTGGTCAGGCTGGT GTCCAGATTG GAAACGCCTGCTGGGAGCTCTACTGCTTGGAGCACGGCATCCAGCCCGATGGCCAGATGC CGTCTGACAA GACCGTGGGCGGAGGTGATGACTCGTTCAACACCTTCTTCAGCGAGACTGGAGCTGGCAA GCACGTGCCC CGCGCCGTGTTTGTGGATCTGGAACCCACTGTGGTCGATGAGGTCCGTACCGGAACCTAC CGTCAGCTGT TCCACCCCGAGCAGCTGATCACTGGTAAGGAGGATGCGGCCAACAACTACGCCCGTGGCC ACTACACCAT CGGCAAGGAGATCGTCGATCTGGTTCTGGACAGGATCCGCAAGCTGGCCGATCAGTGCAC CGGTCTGCAG GGCTTCCTCATCTTCCACTCGTTCGGTGGAGGTACCGGCTCCGGCTTCACCTCGCTGCTG ATGGAGCGTC TCTCCGTGGACTACGGCAAGAAGTCCAAGCTGGAGTTCGCCATCTACCCAGCCCCCCAGG TGTCCACTGC CGTGGTCGAGCCCTACAACTCCATCCTGACCACCCACACCACCCTGGAGCATTCCGACTG CGCCTTCATG GTCGACAACGAGGCTATCTACGACATCTGCCGCCGCAATCTGGACATTGAGCGCCCCACG TACACCAACC TGAACCGTCTGATTGGCCAGATCGTGTCCTCGATTACCGCCTCTCTGCGATTCGATGGTG CCCTTAACGT GGATCTGACTGAGTTCCAGACCAACTTGGTGCCCTACCCACGTATTCACTTCCCTCTGGT GACCTACGCC CCCGTTATCTCCGCCGAGAAGGCCTACCACGAGCAGCTGTCGGTGGCTGAGATCACCAAC GCCTGCTTCG AGCCGGCCAACCAGATGGTCAAGTGCGATCCCCGTCACGGCAAGTACATGGCCTGCTGCA TGCTGTACCG CGGTGATGTTGTGCCCAAGGACGTCAACGCCGCTATTGCCACCATCAAGACCAAGCGCAC CATTCAATTC GTCGACTGGTGCCCCACTGGCTTCAAGGTTGGCATCAACTACCAGCCACCCACCGTGGTG CCTGGAGGTG ATTTGGCCAAGGTGCAGCGTGCCGTGTGCATGTTGTCCAACACCACGGCCATCGCCGAGG CCTGGGCCCG TCTGGACCACAAGTTCGATCTGATGTACGCCAAGCGTGCCagatcggaagagcacacgtc tgaactccag tcacAATCAGTCTCGTatctcgtatgccgtcttctgcttgTTGTCGACTCTAGGGATAAC AGGGTAATGA GTCGACAAcaagcagaagacggcatacgagatTGGTCAACGATAgtctcgtgggctcgga gatgtgtata agagacagTGGTTTCACCTCACGACAAGctgtctcttatacacatctgacgctgccgacg aATCACCAGG TGTgtgtagatctcggtggtcgccgtatcattTTGTCGACTC