Method For Identifying Or Detecting Genomic Rearrangements In A Biological Sample

KOMATSU; Jun ;   et al.

Patent Application Summary

U.S. patent application number 15/845543 was filed with the patent office on 2018-04-12 for method for identifying or detecting genomic rearrangements in a biological sample. This patent application is currently assigned to GENOMIC VISION. The applicant listed for this patent is GENOMIC VISION. Invention is credited to Maurizio CEPPI, Emmanuel CONSEILLER, Jun KOMATSU, Pierre WALRAFEN.

Application Number20180100202 15/845543
Document ID /
Family ID47559567
Filed Date2018-04-12

United States Patent Application 20180100202
Kind Code A1
KOMATSU; Jun ;   et al. April 12, 2018

METHOD FOR IDENTIFYING OR DETECTING GENOMIC REARRANGEMENTS IN A BIOLOGICAL SAMPLE

Abstract

A method for detection, visualization and/or comparison of polynucleotide sequences of interest using specially designed sets of long and short probes that enhance resolution and simplify visualization and detection. Probe compositions useful for practicing this method and procedures for identifying useful probes and probe combinations. These methods are useful for the detection of genomic rearrangements, especially those associated with various diseases, disorders and conditions including cancer or for assessment of genomic rearrangements associated with therapy. The probe compositions may be used in kits for detection of genetic rearrangements or in companion diagnostic products or kits, such as kits for the diagnosis or assessment of predisposition to cancer such as colorectal cancer.


Inventors: KOMATSU; Jun; (Bagneux, FR) ; WALRAFEN; Pierre; (Montrouge, FR) ; CEPPI; Maurizio; (Issy-Les-Moulineaux, FR) ; CONSEILLER; Emmanuel; (Paris, FR)
Applicant:
Name City State Country Type

GENOMIC VISION

Bagneux

FR
Assignee: GENOMIC VISION
Bagneux
FR

Family ID: 47559567
Appl. No.: 15/845543
Filed: December 18, 2017

Related U.S. Patent Documents

Application Number Filing Date Patent Number
14816397 Aug 3, 2015
15845543
13665440 Oct 31, 2012 9133514
14816397
PCT/IB2012/002423 Oct 30, 2012
13665440
61553889 Oct 31, 2011

Current U.S. Class: 1/1
Current CPC Class: C12Q 2565/102 20130101; C12Q 1/6841 20130101; C12Q 2600/156 20130101; C12Q 1/6827 20130101; C12Q 1/6883 20130101; C12Q 1/6886 20130101; C12Q 1/6881 20130101; C12Q 1/6827 20130101; C12Q 2523/303 20130101; C12Q 2537/143 20130101; C12Q 2565/102 20130101
International Class: C12Q 1/6886 20060101 C12Q001/6886; C12Q 1/6881 20060101 C12Q001/6881; C12Q 1/6841 20060101 C12Q001/6841; C12Q 1/6827 20060101 C12Q001/6827; C12Q 1/6883 20060101 C12Q001/6883

Claims



1-48. (canceled)

49. A kit comprising a set of short probes hybridizing specifically on the MSH2 gene or on the MLH1 gene, and suitable for the detection of rearrangements within said MSH2 gene or MLH1 gene, wherein at least one short probe comprises a label for detection and wherein, for each of detection, (i) the set of short probes comprises a set of probes that taken together hybridize to a continuous stretch of more than 12 kb of the MSH2 gene or of the MLH1 gene; or (ii) the kit further comprises a set of long probes, wherein the long probes bind to sequences outside the MSH2 gene or the MLH1 gene and do not overlap the short probe sequences, wherein the short probe sequence(s) specific of the MSH2 gene are obtained by amplification on human genomic DNA using primer pairs, wherein the primer pairs are selected from the group consisting of the sequences of SEQ ID NO: 21 and SEQ ID NO: 22, the sequences of SEQ ID NO: 23 and SEQ ID NO: 24, the sequences of SEQ ID NO: 25 and SEQ ID NO: 26, the sequences of SEQ ID NO: 27 and SEQ ID NO: 28, the sequences of SEQ ID NO: 29 and SEQ ID NO: 30, the sequences of SEQ ID NO: 31 and SEQ ID NO: 32, the sequences of SEQ ID NO: 33 and SEQ ID NO: 34, the sequences of SEQ ID NO: 35 and SEQ ID NO: 36, the sequences of SEQ ID NO: 37 and SEQ ID NO: 38, the sequences of SEQ ID NO: 39 and SEQ ID NO: 40, the sequences of SEQ ID NO: 41 and SEQ ID NO: 42, the sequences of SEQ ID NO: 43 and SEQ ID NO: 44, the sequences of SEQ ID NO: 45 and SEQ ID NO: 46, the sequences of SEQ ID NO: 47 and SEQ ID NO: 48, the sequences of SEQ ID NO: 49 and SEQ ID NO: 50, the sequences of SEQ ID NO: 51 and SEQ ID NO: 52, the sequences of SEQ ID NO: 53 and SEQ ID NO: 54, the sequences of SEQ ID NO: 55 and SEQ ID NO: 56, the sequences of SEQ ID NO: 57 and SEQ ID NO: 58, the sequences of SEQ ID NO: 59 and SEQ ID NO: 60, the sequences of SEQ ID NO: 163 and SEQ ID NO: 164, the sequences of SEQ ID NO: 165 and SEQ ID NO: 166, the sequences of SEQ ID NO: 167 and SEQ ID NO: 168, the sequences of SEQ ID NO: 169 and SEQ ID NO: 170, the sequences of SEQ ID NO: 171 and SEQ ID NO: 172, the sequences of SEQ ID NO: 185 and SEQ ID NO: 186, the sequences of SEQ ID NO: 187 and SEQ ID NO: 188, the sequences of SEQ ID NO: 189 and SEQ ID NO: 190, the sequences of SEQ ID NO: 191 and SEQ ID NO: 192, the sequences of SEQ ID NO: 193 and SEQ ID NO: 194, the sequences of SEQ ID NO: 195 and SEQ ID NO: 196, the sequences of SEQ ID NO: 197 and SEQ ID NO: 198, the sequences of SEQ ID NO: 199 and SEQ ID NO: 200, and the sequences of SEQ ID NO: 201 and SEQ ID NO: 202; and wherein the short probe sequence(s) specific of the MLH I gene are obtained by amplification on human genomic DNA using primer pairs, wherein the primer pairs are selected from the group consisting of the sequences of SEQ ID NO: 95 and SEQ ID NO: 96, the sequences of SEQ ID NO: 97 and SEQ ID NO: 98, the sequences of SEQ ID NO: 99 and SEQ ID NO: 100, the sequences of SEQ ID NO: 101 and SEQ ID NO: 102, the sequences of SEQ ID NO: 103 and SEQ ID NO: 104, the sequences of SEQ ID NO: 105 and SEQ ID NO: 106, the sequences of SEQ ID NO: 107 and SEQ ID NO: 108, the sequences of SEQ ID NO: 109 and SEQ ID NO: 110, the sequences of SEQ ID NO: 111 and SEQ ID NO: 112, the sequences of SEQ ID NO: 113 and SEQ ID NO: 114, the sequences of SEQ ID NO: 115 and SEQ ID NO: 116, the sequences of SEQ ID NO: 117 and SEQ ID NO: 118, the sequences of SEQ ID NO: 119 and SEQ ID NO: 120, the sequences of SEQ ID NO: 121 and SEQ ID NO: 122, the sequences of SEQ ID NO: 227 and SEQ ID NO: 228, the sequences of SEQ ID NO: 229 and SEQ ID NO: 230, the sequences of SEQ ID NO: 231 and SEQ ID NO: 232, the sequences of SEQ ID NO: 233 and SEQ ID NO: 234, the sequences of SEQ ID NO: 235 and SEQ ID NO: 236, the sequences of SEQ ID NO: 237 and SEQ ID NO: 238, the sequences of SEQ ID NO: 239 and SEQ ID NO: 240, the sequences of SEQ ID NO: 241 and SEQ ID NO: 242, the sequences of SEQ ID NO: 243 and SEQ ID NO: 244, the sequences of SEQ ID NO: 245 and SEQ ID NO: 246, and the sequences of SEQ ID NO: 247 and SEQ ID NO: 248; and wherein the long probe sequence(s) specific of the MSH2 gene are obtained by amplification on human genomic DNA using primer pairs, wherein the primer pairs are selected from the group consisting of the sequences of SEQ ID NO: 61 and SEQ ID NO: 62, the sequences of SEQ ID NO: 63 and SEQ ID NO: 64, the sequences of SEQ ID NO: 65 and SEQ ID NO: 66, the sequences of SEQ ID NO: 67 and SEQ ID NO: 68, the sequences of SEQ ID NO: 69 and SEQ ID NO: 70, the sequences of SEQ ID NO: 71 and SEQ ID NO: 72, the sequences of SEQ ID NO: 73 and SEQ ID NO: 74, and the sequences of SEQ ID NO: 75 and SEQ ID NO: 76; and wherein the long probe sequence(s) specific of the MLH I gene are obtained by amplification on human genomic DNA using primer pairs, wherein the primer pairs are selected from the group consisting of the sequences of SEQ ID NO: 123 and SEQ ID NO: 124, the sequences of SEQ ID NO: 125 and SEQ ID NO: 126, the sequences of SEQ ID NO: 127 and SEQ ID NO: 128, the sequences of SEQ ID NO: 129 and SEQ ID NO: 130, the sequences of SEQ ID NO: 131 and SEQ ID NO: 132, the sequences of SEQ ID NO: 133 and SEQ ID NO: 134, the sequences of SEQ ID NO: 135 and SEQ ID NO: 136, and the sequences of SEQ ID NO: 137 and SEQ ID NO: 138.

50. The kit according to claim 49 for the detection of genomic rearrangements associated with a condition selected from the group consisting of: colorectal cancer or genetic predisposition to colorectal cancer, breast cancer or genetic predisposition to breast cancer, ovarian cancer or genetic predisposition to ovarian cancer, and lung cancer or genetic predisposition to lung cancer.

51. The kit according to claim 49, wherein the kit comprises a set of long probes, wherein the long probes bind to sequences outside the MSH2 gene or the MLH1 gene and do not overlap the short probe sequences.

52. The kit according to claim 49, wherein different components of the probe sets are tagged with different labels for detection.

53. The kit according to claim 49, wherein the set of short probes comprises a set of probes that taken together hybridize to a continuous stretch of more than 12 kb of the MSH2 gene or of the MLH1 gene and at least one short probe comprises a label for detection.

54. The kit according to claim 49, wherein the short probe sequences hybridize specifically on the MSH2 gene.

55. The kit according to claim 49, wherein the short probe sequences hybridize specifically on the MLH1 gene.

56. The kit according to claim 49, wherein the long probe sequences hybridize specifically on the MSH2 gene.

57. The kit according to claim 49, wherein the long probe sequences hybridize specifically on the MLH1 gene.
Description



CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application is a continuation of U.S. Ser. No. 14/816,397, filed Aug. 3, 2015, which is a continuation of U.S. Ser. 13/665,440, filed Oct. 31, 2012, which claims priority to U.S. Provisional Application No. 61/553,889, filed Oct. 31, 2011, the entire contents of which are incorporated herein by reference. On Oct. 30, 2012, International Application PCT/IB/2012/002423 was also filed with the same title, the entire contents of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION

Field of the Invention

[0002] The invention relates to high-resolution, precise method for detecting genomic rearrangements iii vitro using specially designed combinations of polynucleotide probes. The invention concerns accurate methods of detection and diagnosis of conditions, disorders and diseases associated with rearrangement of genomic DNA.

Description of the Related Art

The Multigenic Paradigm of Human Diseases

[0003] Advances in genetic analysis of human diseases have provided better insights into the molecular mechanisms contributing to disease initiation and progression. Previous associations were made between particular diseases and association and/or linkage disequilibrium to single base mutations in somatic genetic sequences or with particular single nucleotide polymorphisms ("SNPs") in genomic DNA. Newer technologies have provided evidence that larger genetic alterations and rearrangements are associated with, or can constitute major causes of diseases, disorders or conditions having a genetic origin or basis. Disease associations have now moved from a monogenic to a multigenic paradigm where a disease's origins and progression is mainly linked to more than one single genetic mutation or origin. While these new insights provide better avenues for disease detection and treatments, they also highlight the need for combinatorial genetic analysis that goes beyond detection of single mutational events or SNPs by assessing disease associations with larger genomic rearrangements. Such combinatorial genetic analysis would provide a better, more precise and accurate diagnosis of a particular condition, disorder, disease or pathology, but would also help establishing a more appropriate medical survey, more accurate therapeutic decisions and interventions, as well as help in assessing the efficacy of such therapies and interventions.

Multigenic Causes of Genetic Disease

[0004] Genetic disorders manifesting the same or similar clinical signs and consequences can arise from both single and exclusive, or combined, mutations in various genes. Such mutations can fall within either the single base alteration and/or the class of large genetic rearrangements. A few examples of such genetic disorders are Fragile X syndrome (imitations and expansions in the FMR1 gene), Ataxia Telangectasia (single base pair mutations in either intronic and exonic sequences as well as deletions and translocations of the ATM gene), Seckel syndrome (mutations as well as large rearrangements in SCKL1, SCKL2, SCKL3, PCTN and ATR). autism (mutations as well as large rearrangements in GLO1, MTF1 and SLC11A3), Spinal Muscular Atrophy (mutations, deletions, transconversions as well as cis-duplications involving the SMN1 and SMN2 genes) and myotonic dystrophy (trinucleotide/tetranucleotide expansions in DM1 and DM2).

Multigenic Causes of Cancer Predisposition

[0005] In the case of cancer predisposition, there are several examples of familial cancer predisposition syndromes for which one can nominate several causative genes for which both single base alterations and/or large rearrangements were identified.

[0006] Breast and Ovary Cancer. Causative genes: BRCA1, BRCA2, ATM . . . mutation type: higher proportion of point mutations identified so far.

[0007] Hereditary nonpolyposis colorectal cancer (Lynch syndroma). Causative genes: MSH2, MLH1, MSH6, EPCAM, . . . mutation type: equivalent proportion of point mutations has also been identified.

Multigenic Causes of Cancer Progression

[0008] Cancer progression is surely the human disease domain where the monogenic causative hypothesis was definitely ruled out since several years. First, the disease's initiation is strictly dependent of two molecular events (immortalizing and transforming) due to genetic alterations in at least two independent genes classified at either oncogene or tumor suppressor genes. Second, the disease's progression is linked to additional genetic alterations independent from the causative ones. Not only do these additional alterations play a role in cancer progression, they also were demonstrated to be the basis for appearance of resistance to therapy during treatments. Strikingly, in the list of cancer related genes, if extremely rare examples are only subject to discrete single base mutations (e.g., KRas or BRaf), the large majority is either subject to only large rearrangements (e.g., HER2, ALK . . . ) or to both single base mutations and large rearrangements (p53, c-myc, c-Met, EGFR . . . ).

[0009] The identification and characterization of multigenic conditions, disorders and diseases, including cancer, cardiovascular disease, diabetes and other heritable genetic conditions has been made difficult in part due to the imprecision of existing methods of molecular diagnosis. Molecular Combing is probably the sole approach allowing detecting all type of large genetic rearrangements (deletion, amplification, expansions, inversions, translocations . . . ) even in a complex and heterogeneous population (such as tumors).

[0010] High resolution barcodes allowing multiplex analysis of patients could help diagnostic at different level such as for patient stratification/classification and/or prognosis.

Multiplex High Resolution Barcodes for Identifying the Right Genetic Alterations as a Key Driver for Therapeutic Intervention

The Example of Myotonic Dystrophy

[0011] Myotonic Dystrophy (DM1) and Myotonic Dystrophy 2 (DM2) are two muscular dystrophies characterized by trinucleotide/tetranucleotide expansions in two different genes. If severe forms of DM1 can be clinically differentiated from DM2, milder DM1 forms are displayed extremely similar clinical signs than DM2. There is currently no cure for or treatment specific to myotonic dystrophy. However, DM1 patients exhibit Complications of the disease (heart problems, cataracts . . . ) not existing in DM2 that could can be treated but not cured. Differentiating DM1 and DM2 by the use of a multiplex assay of high resolution barcodes could thus help preventing and treating secondary effects

The Example of Hereditary Breast and Ovary Cancer

[0012] In certain countries (U.S.) detecting constitutional alterations in BRCA1/2 drives to therapeutic intervention (surgery/reconstitution). Thus, there is a clear need for an accurate diagnostic comprising all the potentially involved genes. Such a test could be made on the basis of a multiplex assay of high resolution barcodes comprising large chromosomal regions around genes known to be involved in this syndrome; BRCA1, BRCA2, ATM, ATR . . .

DNA Damage and Response Inhibitors Example

[0013] Synthetic lethality became a strong reality for therapeutic decision to include Cancer patients in specific protocols/regimens. One of the first examples was given with the demonstration that Breast cancer patients with BRCA deficiency exhibit a higher sensitivity to PARP inhibitors, a new category of drug acting on DNA Damage and Response pathway. More recently, this was extended to other type of inhibitors in this category such as ATM inhibitors but also to more traditional anti-cancer drugs including all types of DNA polymerase and replication inhibitors.

[0014] Not only does this concept extended to other inhibitors, but it was also demonstrated that it could be extended to other types of cancers such as lung and metastatic melanoma.

[0015] Here, a multiplex high resolution barcode will allow detection of genetic alteration in genes involved in DNA damage and response that could help predicting sensitivity to this class of inhibitors. A list of such genes could include BRCA1, BRCA2, ATM, ATR, MSH2, MLH1, MSH6, EPCAM . . .

The Lung Cancer Example

[0016] Numerous alterations involved in lung cancer could be multiplexed for a better patient classification such as: [0017] LOH/Deletion (P53, STK11, LKB1, BRG1, KLF6); [0018] Amplification (FGFRI, MET, EGFR HER2 . . . ); [0019] Translocation: (ALK); All these genetic alteration are associated to therapeutic treatments: [0020] P53: Nutlin (low doses Actinomycin D produce similar effects) [0021] FGFR1: Masitinib, PD173074, SU5402 TK1258 AZD4547 . . . [0022] MET: GSK1363089, ARQ197, SGX523, XL184 . . . [0023] EGFR: Tarceva, Erbitux, Vectibix . . . [0024] HER2: Herceptin, Lapatinib . . . [0025] ALK: Crizotinib

[0026] As at least 30% of NSCLCs were demonstrated to be dependent on at least one of these mutations, defining the genetic profile of the tumor could help driving therapeutic options. This could be made possible by designing multiplex assays combining high resolution barcodes covering this major genetic loci.

Localization of (Genetic) Sequences of Interest

[0027] Genetic sequence is the most fundamental information to synthesize functional protein. Alteration of genetic sequence sometimes results in loss of functional protein synthesis. In addition to alteration of genetic sequence, loss or gain of genetic sequence (copy number variation, CNV) also can be problematic for homeostasis of cellular activity. For example, loss of (functional) anti-tumor protein (p53) or gain of proto-oncogene (c-myc) results in cancer-prone cell. When such mutation happens (or exists) in germ cell, this mutation spreads whole cell in an individual who is either carrier or patient of genetic disease, or has a predisposition to cancer. The germline mutation can be heritable. These days CNV becomes more and more important to understand in the field of genetics (ref 1). However, copy number count alone is not always sufficient and it is often critical to establish the actual location of sequence elements. This is strikingly the case for e.g. balanced translocations. DNA sequencing and CNV detection methods such as array-based comparative genomic hybridization (aCGH) and quantitative PCR generally cannot detect these balanced mutations because these methods assess whether the sequence and the copy number are correct or not. FISH and its extended forms such as fiber-FISH or molecular combing can address these balanced mutations with different resolutions and precisions depending on methods.

Resolution and Precision

[0028] The use of BAC/PAC/cosmid probes on targeted regions was successfully conducted to detect large (a few kb to tens of kb) genomic rearrangements (ref 2). In these approaches, the minimum size of detectable events (e.g., the size of the deleted or amplified sequence), hereafter designated as the "resolution" of such an assay, is limited due to the large standard deviation involved in measuring probes or gaps of tens of kilobases. Indeed, in such assays the standard deviation of measurements increases with the length of the measured element. For example, a 40 kb-probe is measured with a standard deviation of .about.5 kb. Thus, if 16 measurements of a given probe are made on a slide, the precision on the size of the probe obtained as the mean value of measurements is in the order of magnitude of 2.5 kb (Considering the distribution is gaussian, and the precision is the half-width of the confidence interval, i.e. 2.sd/ n where sd=standard deviation and n=number of measurements). For a 10 kb-probe, where the standard deviation is .about.2 kb, the precision would be .about.1 kb. This illustrates the fact that shorter probes allow for better (lower) resolution.

[0029] Besides, the location of such an event (the position of the extremities of the event) may be defined with a precision (hereafter the location precision) limited by the size of the probe or gap within which it occurs: e.g. if a 40 kb probe is estimated to measure 39 kb in a sample, one can conclude that a 1 kb deletion occurred somewhere within the probe, with no further precision--thus, somewhere in a 40 kb genomic region. If the same 1 kb deletion had occurred within a 10 kb probe, the location of that deletion would be known with a better precision, as the range would be reduced to a 10 kb genomic region. Therefore, the smaller the probes and gaps, the better the location precision.

[0030] There are limits to small probes: (i) below a certain size, they become difficult to detect; (ii) they involve more complex color schemes (as there are relatively more probes); (iii) there are more distinct probes to cover a given region, and the experiments are therefore more expensive and time-consuming; (iv) most importantly, fast and reliable identification of probes, whether by a human operator or a piece of software, is easier with longer probes, as they are more readily distinguished from background. Indeed, background is mainly constituted of roughly circular fluorescent spots. When large enough, the shape of these spots allows to one to easily distinguish them from probes. However, when their size is small enough, they appear difficult to distinguish from small probes.

[0031] In operating conditions according to the invention, probes shorter than .about.3 kb are detected with a diminished efficiency. Within the 3-10 kb range, the standard deviation of measurements varies little, and there is therefore little benefit in resolution with the shorter probes within this range. Therefore, this range is usually considered to be a good compromise for probe size. However, in cases where probes are close enough (less than 10 kb gaps), smaller probes (within the 500-3000 bp range) are still useful, as they will be detected in at least a fraction of signals and the presence of the corresponding sequences may therefore be established with certainty. It was also found that detection of isolated probes longer than 12 kb (preferably longer than 14 kb) is more reliable, whether for a human operator or for automatic detection software.

Exclusion of Repeats

[0032] Eukaryotic genomic DNA contains various repetitive sequences, i.e., sequences that appear more than once (and more than statistically predicted based on their length and base content) in a normal haploid genome. Among these, some appear with very high frequency (tens of thousands to millions of copies). In human genomic DNA, the most abundant of these is the Alu family, which has .about.1,000,000 copies constituting .about.10% of the genome. In any hybridization procedure involving human genomic DNA, it is expected that probes carrying such repeats would hybridize on numerous targets, generating non-specific signal from regions throughout the genome. Other types of repetitive sequences exist, with lower frequency, and often more specific localization. The number of copies and repeat sequence length may vary widely, as well as the degree of homology. Beta-satellite sequences, for example, are present in multiple copies (hundreds to thousands), usually as tandem repeat arrays comprising hundreds of copies of the same 50-100 bp long sequence, specifically localized in a limited number of loci. Strategies to get rid of the non-specific signals depend on the type of procedure and probe. Schematically, when probes are very short sequences of DNA (oligonucleotides, typically less than 100 bp), as in aCGH procedures, the sequence of the oligonucleotides is chosen to be free of repetitive sequences, by comparison with repetitive sequences found in databases. This strategy is only practical for very short probes, as short sequences free of repetitive sequences are relatively abundant, but unpractical for longer probes, as long stretches completely devoid of repetitive elements are rare (although this has been adapted to longer FISH probes, in an approach that suffers multiple drawbacks, see below). Besides, even for short probes, it constrains the design of probes heavily and some genomic regions, rich in repetitive sequences, have lower density of coverage (and thus lower resolution of events) due to this constraint.

[0033] When probes are longer (typically PCR products or cloned DNA inserts--1 to 150 kb), in Southern Blot or in FISH procedures, non-labeled competitive DNA, enriched in repetitive elements such as Alu repeats (usually Cot-1 DNA), is added in large excess along with the labeled probe. Competition of unlabelled probes on the repetitive sequences minimizes the hybridization of labeled probes. This strategy is expensive and since the competitor DNA is not purely made of repetitive sequences, competition also occurs on the unique sequences for which the probes were designed, thus limiting the amount of competitor DNA that may be used. Therefore, the efficiency of this approach is limited.

[0034] An alternative approach for longer probes has been proposed by Knoll and collaborators (U.S. Pat. No. 7,014,997), resembling the strategy usually adopted for oligonucleotides: probes are chosen within sequence intervals devoid from repetitive elements. This strategy is based on bioinformatics analysis of the regions of interest and exclusion of known repetitive sequences by comparison with sequence databases. However, this approach has several limitations: prior knowledge of the repetitive sequences is required, which can be a problem e.g. in species where such knowledge is unavailable. More importantly, intervals longer than 2 kb devoid of repetitive sequences appear only once in 20-30 kb on average and are unevenly distributed(Considering the distribution is gaussian, and the precision is the half-width of the confidence interval, i.e. 2.sd/ n where sd=standard deviation and n=number o) so the design of probes would be highly constrained, impairing the possibility to design a high-resolution code. This would prove especially difficult in repeat-rich regions, and/or regions where pseudogenes are located next to homologous genes of interest--such low-copy repetitive sequences being also excluded with the strategy from Knoll and co (ref. 3). Since regions targeted in rearrangement tests, e.g., for diagnostics purposes, often display these features, this approach is not suitable for the design of high-resolution barcodes and especially not if such a code is to be used for diagnostics purposes. Distinctions between this approach and the invention are disclosed in more detail below.

BRIEF SUMMARY OF THE INVENTION

[0035] The present invention concerns the field of the in vitro diagnosis and detection of genetic rearrangements and is related to a method to identify or detect genetic rearrangements in a biological sample to be tested which are already known or which are new and provide markers for example of diseases as cancers or metabolic or foetal genetic diseases. The invention is characterized by using compositions containing purified or synthesized nucleic acid molecules (polynucleotides) having nucleotide sequences selected as short sequences with a length of less than 10 Kb and associated in the said method with other different nucleic acid molecules (polynucleotides) having nucleotide sequences non-overlapping with the former ones and having a size longer than 12 Kb. The selected nucleotide sequences (polynucleotides) used as probes are partly deleted of their natural frequently repeated sequences. The present invention concerns also improvements brought to the design of set of probe sequences for the detection of genetic rearrangements by hybridization as with fiber-FISH-like technologies such as Molecular Combing. The improvements described herein allow for high precision/high-resolution detection of rearrangements in time- and cost-efficient assays. This invention also relates to the use of probe sequences for diagnostics applications and companion diagnostics tests, to a method of detection of presence or absence of alterations in sequences and to a kit for the above uses. This is illustrated hereinafter with sets of nucleotide sequences corresponding to parts of at least two genes: MSH2 and MLH1 or to the regions of MSH2 and MLH1, whose mutations increase the risk of occurrence of human colorectal cancer

[0036] The invention is related to the sets of polynucleotides or probes labeled or not which are specific of said genes. Presently, the detection of genetic rearrangements using current technologies is often insufficiently reliable for diagnostics use. Unlike most technologies used to detect genetic alterations, which suffer strong intrinsic limitations towards some types of rearrangements, direct technologies such as FISH or Fiber-FISH can intrinsically detect any type of rearrangements. Their use is mainly limited by their resolution. Molecular Combing, on the other hand, may reach sufficient resolution, but probe designs currently used fail to allow cost- and time-efficient high resolution analysis of rearrangements.

[0037] These improvements involve the combination within the same sets of probes of -typically shorter--probes designed to optimize the sensitive detection and precise measurement of rearrangements and--typically longer--probes to allow for fast and reliable detection of signals of interest when analyzing results. Alternative designs where the longer probes are replace with a combination of shorter probes having equivalent functions and effects are also disclosed.

[0038] Specific aspects of the invention based on the concept of combining small probes for resolution and long probes for ease of detection for the detection on one or more genomic region(s) of interest as disclosed in more detail below.

[0039] The invention thus concerns a method for detecting mutated or rearranged genomic polynucleotide (target) sequence comprising:

[0040] (a1) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes, where on each genomic region a subset of short probes are selected so that when taken together they form a long contiguous stretch inside or outside the region of interest, and wherein the probes may optionally have frequent repetitive sequences removed and thus more generally are optionally devoid of such repetitive sequences; or

[0041] (a2) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes and to one or more long (docking) probe(s) that bind to sequences near but outside of the region(s) of interest; wherein the sequence(s) of the long probe(s) does not overlap that of the short probes and wherein the short and/or long probes may optionally have frequent repetitive sequences removed and thus more generally are optionally devoid of such repetitive sequences;

[0042] (b) detecting the locations of hybridized probes on the genomic region(s) of interest; optionally,

[0043] (c) comparing the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide sequence; and optionally,

[0044] (d) correlating the presence of a mutated or rearranged genomic polynucleotide with a specific phenotype, disease, disorder, or condition.

[0045] The mutated or arranged genomic polynucleotide sequence can be obtained from a subject who has cancer or who is suspected to having cancer, for example, from a subject who has colorectal cancer or who is suspected of having colorectal cancer. In such a case, the short and long probes identify mutations or genomic rearrangements associated with colorectal cancer and a control or reference sample would not contain these mutations or rearrangements. The presence or risk of developing colorectal cancer is assessed by comparing a target genomic polynucleotide sequence with the reference and determining whether a mutation or rearrangement associated with colorectal cancer is present. This method can be practiced with specific probes corresponding to or derived from Probe sets 1, 2, 3 and 4. For colorectal cancer, a genomic region of interest can be selected from genes associated with this disease, such as MSH2, MLH1, MSH6, PMS2 or EPCAM.

[0046] Similarly, the method may be applied to samples obtained from subjects having or at risk of developing other kinds of cancer, such as breast cancer, ovary cancer, or lung cancer. The method may also be applied to samples obtained from subjects having or at risk of other kinds of diseases, disorders, or conditions, including cardiovascular disease, diabetes, neuromuscular disorders; such as myotonic dystrophy or spinal muscular atrophy or samples obtained from a subject who has, is suspected of having, or is suspected of being a carrier for a genetic or hereditary disease, disorder or condition, including known or unknown foetal genetic alterations. The sample can be obtained from a subject having a multigenic genetic or hereditary disease, disorder or condition or for a genetic or hereditary disease, disorder or condition associated with rearrangement of genomic DNA.

[0047] In some aspects of the invention, the sample will be obtained from a subject undergoing treatment for a disease, disorder or condition associated with a genomic or somatic genetic rearrangement and the results obtained are compared to results obtained at other time points before, during or after the termination of treatment. A companion test for evaluating the efficiency of a therapeutic drug on the mutated or rearranged nucleotide sequences of the gene or the region of the gene of interest can be performed using the short and long probes according to the invention.

[0048] Preferably, in the method described above, the hybridizing with the short and long probes in step a) will be performed simultaneously.

[0049] Preferably, the short probes range in length from 0.5 kb to 10 kb and the maximum size of the gaps between the short probes when they are bound to the target is 15 kb, preferably 12 kb and more preferably 10 kb.

[0050] The number of short probes employed in the method described above can range from 1, 2, 3 to 10, 15 or more.

[0051] The maximum size for the long probes is 150 kb and these probes preferably range from 12 kb to 40 kb in length. Preferably, in order to have "long probe(s) that hind to sequences near but outside of the region of interest", distance between the long probes and the region of interest is no longer than 150 kb, and more preferably no longer than 75 kb and even more preferably no longer than 25 kb from the region of interest. The minimum size for a genomic region to be tested or targeted is 50 kb. The minimum number of regions of interest is one for a singleplex test and two or more for a multiplex test. Examples of combinations of short and/or long probes include at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 15 kb), or at least one group of at least two short sequences, less than 10 kb each, which total group length is longer than 14 kb and less than 150 kb, hybridizing continuously on the mutated or rearranged polynucleotide sequence. The short probes can comprise a set of contiguous probes that span a stretch of the genomic polynucleotide sequences inside or outside the region of interest that is at least 15 kb.

[0052] The long probes may have repetitive DNA sequences excluded. These repetitive sequences to be excluded would ordinarily appear more than once and more often than statistically predicted based on their length and base content, for example, repetitive sequences between 50 and 400 bp can be excluded, though shorter or longer repetitive sequences that decrease sensitivity or specificity of the method can be identified and excluded. An example of such a sequence is the repetitive Alu family DNA sequences.

[0053] According to an embodiment of the invention, in order for the probes, either short probes or long probes, to have repetitive sequences excluded, these probes are designed to hybridize in regions of the genome which are free of such repetitive sequences, i.e. which have less than 10% preferably less than 2% of the selected type(s) of repetititve sequences to be excluded.

[0054] In the method described above, the short and long probes are preferably fluorescently tagged and different components of the probe sets may be tagged with different labels, such as labels with different colors. Tagging provides one means to identify motifs or submotifs characteristic of a mutated or rearranged sequence.

[0055] Compositions or kits comprising a set of short probes or a combination of short and long probes as described herein and optionally one or more components for binding said probes to a polynucleotide, for performing molecular combing, and/or for detecting whether hybridization has occurred are also contemplated. For example, a composition containing the short and long probe(s) described above, wherein at least two of said probe sequences detect a genetic rearrangement by using Molecular Combing, said composition comprising either at least one short (<12 kb) sequence and at least one non-overlapping long sequence (>14 kb), or at least one group of at least two short sequences, less than 10 kb each, which total length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the genetic target. The short probe(s) in such a composition may preferably range from 0.5 kb to 12 kb and the long probe(s) range from 14 kb to 40 kb. Frequent repetitive sequences described above may be removed from the probes. Examples of probe sequences are those that hybridize specifically on the MSH2 gene or in the region of the MSH2 gene or on the MLH1 gene or in the region of the MLH1 gene. Specific kinds of short probe sequence(s) where repetitive sequences have been removed include those selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines:

[0056] MSH2-v1

[0057] P3 (primer pairs P3a_MSH2-v1 to P3c_MSH2-v1, SEQ ID NO:21-26)

[0058] P4 (primer pairs P4a_MSH2-v1 to P4b_MSH2-v1, SEQ ID NO:27-30)

[0059] P5 (primer pairs P5a_MSH2-v1 to P5c_MSH2-v1, SEQ ID NO:31-36)P6 (primer pairs P6a_MSH2-v1 to P6b_MSH2-v1, SEQ ID NO:37-40)

[0060] P7 (primer pairs P7a_MSH2-v1 to P7c_MSH2-v1, SEQ ID NO:41-46)

[0061] P8 (primer pairs P8a_MSH2-v1 to P8b_MSH2-v1, SEQ ID NO:47-50)

[0062] P9 (primer pairs P9a_MSH2-v1 to P9c_MSH2-v1, SEQ ID NO:51-56)

[0063] P10 (primer pairs P10a_MSH2-v1 to P10b_MSH2-v1, SEQ ID NO:57-60)

[0064] MLH1-v1

[0065] P3 (primer pairs P3a_MLH1-v1 to P3d_MLH1-v1, SEQ ID NO:95-102)

[0066] P4 (primer pairs P4a_MLH1-v1 to P4b_MLH1-v1, SEQ ID NO:103-106)

[0067] P5 (primer pairs P5a_MLH1-v1 to P5b_MLH1-v1, SEQ ID NO:107-110)

[0068] P6 (primer pair P6a_MLH1-v1, SEQ ID NO:111-112)

[0069] P7 (primer pair P7a_MLH1-v1, SEQ ID NO:113-114

[0070] P8 (primer pairs P8a_MLH1-v1 to P8d_MLH1-v1, SEQ ID NO:115-122)

[0071] and the short probes may be used in combination with the long probe sequence(s) selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines

[0072] MSH2-v1

[0073] P11 (primer pairs P11a_MSH2-v1 to P11c_MSH2-v1, SEQ ID NO:61-66)

[0074] P12 (primer pairs P12a_MSH2-v1 to P12e_MSH2-v1, SEQ ID NO:67-76)

[0075] MLH1-v1

[0076] P9 (primer pairs P9a_MLH1-v1 to P9c_SEQ ID NO:123-128)

[0077] P10 (primer pairs P10a_MLH1-v.sup.-1 to P10e_MLH1-v1, SEQ ID NO:129-138),

[0078] Specific kinds of contiguous short probe sequence(s) forming long stretches include those selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines:

[0079] MSH2-v2

[0080] PE1-2 (primer pairs PE1_MSH2-v2 to PE2_MSH2-v2, SEQ ID NO:163-166) and

[0081] PE3-6 (primer pairs PE3_MSH2-v2 to PE5-6_MSH2-v2, SEQ ID NO:167-172), together forming one stretch;

[0082] PE9 (primer pairs E9_MSH2-v2 and I9-10_MSH2-v2, SEQ ID NO:185-188),

[0083] PE10 (primer pair E10_MSH2-v2, SEQ ID NO:189-190),

[0084] PE11 (primer pairs E11_MSH2-v2 and I11-12_MSH2-v2, SEQ ID NO:191-194),

[0085] PE12-14 (primer pairs E12_MSH2-v2 and E13-14_MSH2-v2, SEQ ID NO:195-198) and

[0086] PE15-16 (primer pairs E15_MSH2-v2 and E16_MSH2-v2, SEQ ID NO:199-202), together forming one stretch;

[0087] MLH1-v2

[0088] PE1-2 (primer pairs E1_MLH1-v2 and E2_MLH1-v2, SEQ ID NO:227-230),

[0089] PE3-4 (primer pairs I23_MLH1-v2, E3_MLH1-v2 and E4_MLH1-v2, SEQ ID NO:231-236),

[0090] PE5-6 (primer pairs E5_MLH1-v2 and E6_MLH1-v2, SEQ ID NO:237-240),

[0091] PE7-9 (primer pairs E7-8_MILH1-v2 and E9_MLH1-v2, SEQ ID NO:241-244) and

[0092] PE10-11 (primer pairs E10_MLH1-v2 and E11_MLH1-v2, SEQ ID NO:245-248), together forming one stretch;

The primers designed for the purpose of preparing short probes of the invention may have a sequence of 20 to 40 nucleotides and comprise in their 3' end a sequence of at least 20 contiguous nucleotides that base pairs with the target. The primer sequence thus may also comprise additional nucleotides that do not base pair with the target in its 5' end. The nucleotides which do not base pair may be useful for the construction of the primers or for the cloning of the amplified sequence resulting from polymerization starting from the primers. In a particular embodiment the sequence of the primer that hybridizes to the target is longer than 20 nucleotides. Molecular Combing is a powerful FISH-based technique for direct visualization of single DNA molecules that are attached, uniformly and irreversibly, to specially treated glass surfaces (Herrick and Bensimon. 2009); (Schurra and Bensimon, 2009). This technology considerably improves the structural and functional analysis of DNA across the genome and is capable of visualizing the entire genome at high resolution (in the kb range) in a single analysis. Another embodiment of the invention is a method for designing a set of short probes or set of short and long probes as described above comprising:

[0093] identifying a polynucleotide containing a genomic region of interest,

[0094] selecting long probe sequences outside of the genomic region of interest but within 100 kb of the closest probe in the region of interest, and preferably within 30 kb of the closest probe in the region of interest and optionally removing frequently repeated sequences from said long probe sequences,

[0095] selecting a short probe sequences from within the genomic region of interest so that no gaps longer than 20 kb, and preferably no gaps longer than 12 kb appear between the short probes; or selecting a series of short probes that together form a long continuous stretch that covers the genomic region of interest;

[0096] hybridizing the probes to a genomic polynucleotide comprising the genomic region of interest,

[0097] detecting the hybridized probes, and

[0098] determining which sets of probes form motifs that specifically identify the genomic sequence of interest from a reference genomic sequence.

[0099] The comparison of the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide sequence, as disclosed in the databanks or experimentally obtained on samples.

[0100] The techniques disclosed herein may be applied to diagnosis of disease as well as for the identification of genetic rearrangements associated with a disease, disorder or condition. They may also be used as companion diagnostics to study the responses of a subject or group of subjects who have particular rearrangements to therapy, responses to environmental agents, or the effects of lifestyle choices. Specifically, the diagnostic products and methods of the invention are useful for diagnosis and assessments for subjects having or at risk of developing colorectal cancer. High resolution barcodes allow multiplex analysis of patients for extended or expanded diagnosis at the levels of patient stratification/classification and prognosis. Thus, the techniques disclosed herein can also be used to predict the course and probably outcome of a disease, disorder or condition as well as the likelihood of progression, stability, or recovery. Multiplex high resolution barcodes also permit the identification of key genetic alterations in a subject that would benefit from a particular kind of therapy as well as a way to assess the reaction of a subject to a particular kind of therapy or therapeutic intervention. Specific embodiments of the invention include the following, which embodiments are especially carried out in vitro.

[0101] A method for detecting mutated or rearranged genomic polynucleotide sequence comprising: (a1) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes said set of short probes optionally including or being in combination with a (sub)set of short probes selected so that on each genomic region some of the short probes when taken together form a long contiguous stretch inside or outside the region of interest and where the short probes may optionally have frequent repetitive sequences removed; or (a2) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes and to one or more long (docking) probe(s) that bind to sequences near but outside of the region(s) of interest; wherein the sequence(s) of the long probe(s) does not overlap that of the short probes and wherein the short and/or long probes may optionally have some or all of the frequently repeating sequences removed; (b) detecting the locations of hybridized probes on the genomic region(s) of interest; optionally, (c) comparing the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide)sequence; and optionally, and/or (d) correlating the presence of a mutated or rearranged genomic polynucleotide with a specific phenotype, disease, disorder, or condition.

[0102] The invention relates in particular to the method herein described wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has cancer or who is suspected of having cancer or who is susceptible to have a genetic predisposition to cancer.

[0103] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has colorectal cancer or who is suspected of having colorectal cancer or who is susceptible to have a genetic predisposition to colorectal cancer, wherein said short and long probes identify mutations or genomic rearrangements associated with colorectal cancer, wherein said control, not mutated or genomic sequence is obtained from a subject not at risk for colorectal cancer and wherein the detection of a genomic rearrangement; and assessing presence of or risk of developing colorectal cancer when said genomic rearrangement is detected. In this method the probes can hybridize specifically on the MSH2 gene, in the region of the MSH2 gene, on the MLH1 gene, or in the region of the MLH1 gene.

[0104] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has breast cancer or who is suspected to having breast cancer or who is susceptible to have a genetic predisposition to breast cancer.

[0105] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has ovarian cancer or who is suspected to having ovarian cancer or who is susceptible to have a genetic predisposition to ovarian cancer.

[0106] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has lung cancer or who is suspected to having lung cancer or who is susceptible to have a genetic predisposition to lung cancer.

[0107] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a cardiovascular disease, disorder or condition or who is suspected of having cardiovascular disease, disorder or condition or who is susceptible to have a genetic predisposition to cardiovascular disease, disorder or condition.

[0108] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a diabetes or who is suspected of having diabetes or who is susceptible to have a genetic predisposition to diabetes.

[0109] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a neuromuscular disorder or who is suspected of having a neuromuscular disorder.

[0110] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has, is suspected of having, or is susceptible of being a carrier for a genetic or hereditary disease, disorder or condition.

[0111] The invention also relates in a particular embodiment to a method wherein the short and long probe sequences are specific to human genes or to human genomic regions associated with cancer, colorectal cancer or a foetal genetic alteration known or unknown when said region or gene is mutated or genetically rearranged.

[0112] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has, is suspected of having, or is suspected of being a carrier for a multigenic genetic or hereditary disease, disorder or condition or for a genetic or hereditary disease, disorder or condition associated with rearrangement of genomic DNA.

[0113] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject undergoing treatment for a disease, disorder or condition associated with a genomic inherited or acquired rearrangement and the results obtained are compared to results obtained at other time points before, during or after the termination of treatment.

[0114] The invention relates to method of any of the embodiments described herein, characterized by the following features taken individually or in any combination: the hybridizing with the short and long probes in (a2) is performed simultaneously; the short probes are 10 kb or less; and/or the short probe(s) comprise at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 12 kb), or at least one group of at least two short sequences, less than 5, 6, 7, 8, 9 or 10 kb each, total group length is longer than 12 kb and less than 150 kb, hybridizing continuously on the mutated or rearranged polynucleotide sequence. In these methods the short probes may comprise a set of contiguous probes that span a stretch of the genomic polynucleotide sequences inside or outside the region of interest that is at least 14 kb; and/or the long probe(s) may comprise one or more docking probes of more than 14 kb and less than 40 kb. The long probe(s) may have a length of at least 14 kb and bind to a polynucleotide sequence outside the region of interest.

[0115] Both the long and short probes may be designed to exclude frequently occurring repetitive DNA sequences. These repetitive DNA sequences, which may be excluded from the long and short probes, will generally appear more than once and more often than statistically predicted based on their length and base content. For example, a repetitive DNA sequence between 50 and 400 contiguous nucleotides in length, which appear more than once and more often than statistically predicted based on their length and base content, can be excluded from the short and/or long probe(s). One example of a repetitive sequence that can be excluded from the short and long probes is or are members of the repetitive Alu family DNA sequences.

[0116] In some embodiments of the invention the probes in (b) of the first embodiment are fluorescently tagged so that they can be detected fluorometrically. In other embodiments in b) each probe is tagged with one of two or more fluorescent tags.

[0117] According to other embodiments of the methods above, motifs or easily identifiable subsets of the probes are detected and compared instead of every probe sequence.

[0118] The methods described above may employ at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more short probes. These short probes may each have a length of least 500, 600, 700, 800, 900 or more base pairs (bp). In some embodiments of the methods above, the probes will be selected so that the gaps between short probes in the genomic region of interest are no more than 12 kb each. In further embodiments the short probes will bind to a single contiguous genomic region of interest or the short probes can be selected to bind to more than one non-contiguous genomic region of interest. The long probes used in the method above may be selected so as to be no more than 20, 30 or 40 kb. The or each of the genomic region(s) of interest in the methods described above can be selected to be longer than 50 kb.

[0119] Another embodiment of the invention is a kit comprising a set of short probes or a set of short and a set of long probe(s); and optionally one or more components for binding said probes to a polynucleotide, for performing molecular combing, and/or for detecting whether hybridization has occurred; (i) wherein the short probes comprise a set of probes that taken together bind to a long continuous stretch of the genomic region of interest; or(ii) wherein the long probes bind to sequences outside the genomic region of interest, do not overlap the short probe sequences; and optionally, where the repetitive sequences have been removed from the long and/or short probes. A kit of the invention is suitable and/or is specific for use in a method of the invention as disclosed herein. In a particular embodiment its short and/or long probes are characterized by the features described herein in relation with the methods. Such a kit may be employed for or contain instructions for the detection of genomic rearrangements associated with colorectal cancer or genetic predisposition to colorectal cancer; for the detection of genomic rearrangements associated with breast cancer or genetic predisposition to breast cancer; for the detection of genomic rearrangements associated with ovarian cancer or genetic predisposition to ovarian cancer; for the detection of genomic rearrangements associated with lung cancer or genetic predisposition to lung cancer.

[0120] Another embodiment of the invention is a composition containing the short, or short and long probe(s) described by the first embodiment above, wherein at least two of said probe sequences detect a genetic rearrangement by using Molecular Combing, said composition comprising either (a) at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 14 kb), or (b) at least one group of at least two short sequences, less than 10 kb each, which total length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the genetic target. In this composition the short probe(s) can range from 0.5 kb to 9 kb and the long probe(s) can range from 14 kb to 40 kb. The size of the short probes may range from 0.5 to 9 kb and at least 90% of the frequent repetitive sequences can be been removed from the short probe sequences. This composition may contain probes sequences that hybridize specifically on the MSH2 gene or in the region of the MSH2 gene or on the MLH1 gene or in the region of the MLH1 gene.

[0121] In yet another embodiment the invention involves a method for designing short and long probes described herein in relation to methods comprising (a) identifying a polynucleotide containing a genomic region of interest, (b) selecting long probe sequences outside of the genomic region of interest but within 100 kb of the closest probe within the region of interest and optionally removing frequently repeated sequences from the long probe sequences, (c) selecting a set of short probe sequences from within the genomic region of interest so that no gaps longer than 15 kb appear between the short probes; or selecting a series of short probes that together form a long continuous stretch that covers the genomic region of interest; (d) hybridizing the probes to a genomic polynucleotide comprising the genomic region of interest, (e) detecting the hybridized probes, and (f) determining which sets of probes form motifs that distinguish the genomic sequence of interest from a reference genomic sequence.

BRIEF DESCRIPTION OF THE DRAWINGS

[0122] FIG. 1, which includes sub-parts identified as FIG. 1A, FIG. 1B, and FIG. 1C. (A) FIG. 1A: Dot-plot of MSH2 gene sequence on RP11-1084A21 BAC clone. (B) FIG. 1B: probe code v1 (without repetitive element) on RP11-1084A21. (C) FIG. 1C: probe code-v2 on RP11-1084A21. Diagonal lines are perfectly matched region of DNA between two sequences. Dots are representatives of repetitive elements. Higher density of dots (or grey band) are higher density of repetitive element.

[0123] FIG. 2, which includes sub-parts identified as FIG. 2A, FIG. 2B, and FIG. 2C. Dot plot analysis of MLH1 region. (A) FIG. 2A: Dot-plot of MLH I gene sequence on RP11-426N19 BAC clone. (B) FIG. 2B: probe code v1 (without repetitive element) on RP11-426N19 (C) FIG. 2C: probe code-v2 on RP11-426N19.

[0124] FIG. 3, which includes sub-parts identified as FIG. 3A and FIG. 2B. Designed probe set for MSH2 by exclusion of repetitive element. A) FIG. 3A: theoretical probe set (labeled in red and green in microscopy experiments represented here in grey and black, respectively), and position of exon (small numbered dots). (B) FIG. 3B: actual hybridization image corresponding to MSH2-v1 probe set. Original microscopy images consist of three channel images where each channel is the signal from a given fluorophore--these are acquired separately in the microscopy procedure. These channels are represented here as different shades on a grayscale: green probes are shown in black and red probes in gray, while the background (absence of signal) is white. The aspect ratio was not preserved, signals have been "widened" (i.e. stretched perpendicularly to the direction of the DNA fiber) in order to improve the visibility of the probes.

[0125] FIG. 4, which includes sub-parts identified as FIG. 4A and FIG. 4B. Designed probe set for MLH1 by exclusion of repetitive element. A) FIG. 4A: theoretical probe set (red and green), and position of exon (purple dot). (B) FIG. 4B: actual hybridization image corresponding to MLH1-v1 probe set. The same color conventions are used for diagrams and microscopy images as in panels A and B of FIG. 3.

[0126] FIG. 5, which includes sub-parts identified as FIG. 5A and FIG. 5B. Designed probe set for MSH2 with docking probes (v2). (A) FIG. 5A: theoretical probe set). B) FIG. 5B: actual hybridization image corresponding to MSH2-v2 probe set. The color conventions in this and the other 3-color microscopy images (and corresponding diagrams) is as follows: blue probes are represented in black, green probes in dark gray, red probes in light gray and the background is white.

[0127] FIG. 6, which includes sub-parts identified as FIG. 6A and FIG. 6B. Designed probe set for with docking probes (v2). (A) FIG. 6A: theoretical probe set). (B) FIG. 6B: actual hybridization image corresponding to MLH1-v1 probe set. The same color conventions are used for diagrams and microscopy images as in FIG. 5.

[0128] FIG. 7, which includes sub-parts identified as FIG. 7A, FIG. 7B, and FIG. 2C. Validation of genomic rearrangement in MSH2 in LoVo cell line with v2 probe set. Sketches of both theoretical probe set (top; FIG. 7A) and validated rearrangement (middle, FIG. 7B) by molecular combing. The photo (bottom, FIG. 7C) is the recurrent abnormal signal set which corresponding to deletion from exon 3 to exon 8 of MSH2 (as in middle). The same color conventions are used for diagrams and microscopy images as in FIG. 5

[0129] FIG. 8, which includes sub-parts identified as FIG. 8A, FIG. 8B, and FIG. 8C. Validation of genomic rearrangement in MLH1 in SK-OV-3 cell line with v2 probe set. Sketches of both theoretical probe set (top; FIG. 8A) and validated rearrangement (middle; FIG. 8B) by molecular combing. The photo (bottom; FIG. 8C) is the representative (but few cases) signal set corresponding to the upper stream of MLH1 probe set (left side of theoretical probe set). The difference of observation number between MSH2 probe signal (normal) and MLH1 (a part of left side) clearly demonstrates that deletion of exon 4 to 19 in MLH1 is homozygous, (consistent with reference 7). Molecular combing test also revealed that the breakpoint of deletion is larger than previously reported (downstream probes from exon 19 are all deleted). The same color conventions are used for diagrams and microscopy images as in FIG. 5

[0130] Table 1. describes primer sequences and coordinates on human genomic DNA used for hybridization fragment synthesis to design the probes of the invention. These primers or variant therefore obtained by adding nucleotides in the ends of the described sequences and having up to 40 nucleotides, are part of the invention.

[0131] Table 2. Analysis of sequence of probe sets and their covering region. These sequences and the sets of probes that are disclosed in particular, are part of the invention.

[0132] Sequence of each of probe sets or region was subjected to RepeatMasker test and some of representative values are shown in the table. Sum length: sum up of sequence of all probes in each set. For MLH1 and MSH2 regions, this is the total length of each region. Repeat length: sum of sequences recognized as sorts of repeat in human genome. This includes sequences other than SINE. Total repeat. % of repeat length in sum length. SINE: % of sequences categorized as SINE in sum length. ALUs: % of sequences categorized as Alu family sequences in sum length.

DETAILED DESCRIPTION OF THE INVENTION

[0133] The above described strategies, for the reasons mentioned, are unsuitable to design a high-resolution code for diagnostics applications using technologies such as molecular combing.

[0134] In the present invention, the probes are defined as follows: a short probe is a nucleic acid sequence complementary to a genomic sequence, which probe can be detected with a given marker (such as a fluorochrome) once hybridized on the genomic sequence. One probe may be either made of (i) one single fragment covering the whole sequence, or of (ii) several exactly contiguous fragments, and/or (iii) slightly overlapping fragments (with an overlap less than 250 bp) and/or (iv) fragments separated by a very short gap (less than 1000 bp). With such short overlaps or gaps, using Molecular combing in our current setup, the fragments appears almost contiguous. The distance may be adjusted depending on the specific technique and experimental conditions. For example, with less resolutive conditions, longer gaps (less than 2 kb) or overlaps may be tolerated, provided fragments separated by such a gap still appear contiguous. Under more resolutive conditions, gaps should be shorter (less than 200 bp) in order for the fragments to appear contiguous. Short probes range in size from 500 bp to 10 kb.

[0135] A long probe is a nucleic acid sequence complementary to a genomic sequence, which probe can be detected with a given marker (such as a fluorochrome) once hybridized on the genomic sequence. One probe may be either made of (i) one single fragment covering the whole sequence, or of (ii) several exactly contiguous fragments, and/or (iii) slightly overlapping fragments (with an overlap less than 250 bp) and/or (iv) fragments separated by a gap (less than 3.5 kb), provided that more than 70% of the target sequence stretch is covered by probes (i.e. provided the gaps represent less than 30% of the target sequence). With such overlaps or gaps, using Molecular combing in our current setup, the fragments are efficiently detected. The distance may be adjusted depending on the specific technique and experimental conditions. For example, with less resolutive conditions, longer gaps (less than 5 kb each, representing in total less than 50% of the sequence) or overlaps may be tolerated, provided fragments separated by such gaps are still detected efficiently. Also, under such conditions, longer probes should be used (more than 20 kb) to allow for efficient detection. Under more resolutive conditions, gaps should be shorter (less than 2 kb) in order for the fragments to be efficiently detected, and probes may still be efficiently detected with shorter size (more than 10 kb). Long probes range in size from 12 kb to 150 kb.

[0136] In the present invention, the size of probes reflects the length of the genomic sequence where the probe hybridizes, independently of the number of strands in the DNA molecules. Therefore, a probe may be described as 1 kb (1 kilobase=1000 bases) or, indifferently, as 1000 bp (base pairs): in both cases, the probe hybridizes over 1000 bases of one of the strands of the target DNA molecule (and, if the probe is double stranded, also on the 1000 complementary bases of the other strand of the target molecule).

[0137] In the present invention, a "barcode" designates a specific motif formed by a set of probes labeled with different markers, where the motif characteristics are the lengths of the probes in the set, the lengths of the gaps separating successive probes and the colors in which the probes are detected (or, more generally, the markers with which the probes are labeled).

[0138] If a high coverage barcode is to be designed for high resolution, probe and space lengths need to be roughly in the 0.5 kb to 10 kb range (see above). This makes it unpractical to design probes that completely exclude rearrangements, and yet are spaced closely enough for the code to allow high location precision. On the other hand, some non-specific hybridization (i.e. hybridization of [parts of] a probe on genomic regions that are not the designed target of that probe) of a probe is acceptable when using a code strategy for the reading of signals. Indeed, in applications such as Southern blot where the hybridization of a single probe is assessed or aCGH where hybridization of every probe is considered separately, the non-specific hybridization of probes on even a very limited number of regions may lead to completely unusable results. To a lesser extent, this is also the case with multiple-probe applications such as FISH, since the resolution of FISH is insufficient to distinguish genomic regions as far apart as several tens of megabases: a single non-specific hybridization would lead to unusable results if it were located close enough to the targeted region.

[0139] In molecular combing and other similar applications using a code strategy, the quantity of non-specifically hybridized probes is not in issue per se. If a probe (or fragments of a probe) hybridizes even multiple times outside the region of interest, it is unlikely it will recreate a motif sufficiently similar to the code to be confusing. Also, non-specific hybridization over short sequences (<<1 kb), even within the region of interest, would most likely not be detected, unless they are sufficiently clustered to generate a long (>1 kb) stretch of non-specific hybridization. For the above reasons, the inventors have developed an alternative approach for the design of probes when the main issue is the design of a (several) high resolution code(s) in a (several) given genomic region(s). The main step of this approach relies only on the knowledge of the sequence of the region(s) themselves. When designing such a code, the major issue is to avoid significant non-specific hybridization within the regions of interest(s). Non-specific hybridization becomes an issue only if several probes display non-specific hybridization on neighboring sequences outside the region of interest. In the latter case, there is a risk that the pattern of probes resembles the original code, or a rearranged version of it, and this would likely lead to false conclusions. Although the invention described herein does not allow excluding such occurrences, this is relatively easily done once the method described herein has been used to exclude other non-specific hybridizations (see below).

[0140] The basis for this approach is the detection and exclusion of sequences that are repetitive within the region(s) of interest. For this, only the corresponding sequence(s) (the target sequence(s)) have to be known. One easy way to detect such repeats is the search for local sequence alignments within the target sequence(s), which can be done with e.g. a dot-plot comparison of each target sequence with itself and the other target sequences. A dot-plot is a graph with the two (sets of) sequences that are being compared forming the two axis, while dots are printed at every point where the coordinates correspond to a local homology. For example, if nucleotide x from sequence A (horizontal axis) matches nucleotide y from sequence B (vertical axis), then a dot will appear at the point with (x; y) coordinates. Graphically, local alignments appear as diagonal lines. Some more elaborate tools inspired from dot-plots are available, that compare short sequences ("words", typically a few nucleotides/tens of nucleotides long) rather than single nucleotides, and display dots in various shades of gray depending on the extent of homology, thus allowing a direct visual reading of relaxed homologies (non-specific hybridization may well appear with incomplete homology). The comparison may also be done directly on both strands for one of the sequences, so homologies appear for both sense and reverse complement orientations. An example of such a tool is "Dotter" (ref 4).

[0141] With these tools, very frequent repetitive sequences, such as Alu sequences in the Human genome, appear quite clearly, as they have local homologies with numerous other sequences within the target regions. Therefore, stretches with a high frequency of these sequences appear as a gray band (horizontal or vertical depending on whether the stretch is located on the vertical or horizontal axis). The exact appearance of these stretches with dot-plot display tools will depend on settings, and possibly word size. Settings were selected such that sequence stretches longer than 200 bp with more than 80% homology appear clearly and can be located with a roughly 10 bp precision.

[0142] A sequence of 200 bp or more that contains more than 10 significant homologous sequences (less than 1, 2, 3, 4, 5, 10, 15 or 20% nucleotide mismatch or insertion/deletion) within the regions of interest is a frequent repetitive sequence, prone to generate significant non-specific hybridization. It is generally possible to design probes in such a way that they are void of these frequent repetitive sequences, thus increasing the specificity and the high resolution of the present technology compared to the published previous methods.

"Docking" Probes

[0143] Although, as shown above, shorter probes make for more precise localization of breakpoints and measurement of deleted or amplified sequences, they are, generally speaking, more difficult to detect with fiber-fish techniques and molecular combing, as they appear as shorter stretches of signal, i.e., they are both smaller and less easy to distinguish from noise (fluorescent spots either unrelated to probes or to hybridization of probes). This is particularly true when considering automatic (computer-based) detection of signals.

[0144] It is therefore desirable to include longer probes in the code (for example, more than 12 kb and less than 150 kb, preferably more than 14 kb and less than 40 kb, in particular for the detection of genetic rearrangements in the regions of MSH2 or MLH1 genes). These probes would appear as actual lines (rather than spots), readily distinguishable from noise and easily detectable due to their size. Once the signals of interest are detected, the detection of other probes located on the same DNA fiber is easier.

[0145] This is especially true using technologies such as Molecular Combing where the linearity of the fibers implies the other probes, if any, are located in the alignment of the first probe. Therefore, the invention provides that the inclusion of longer (>12 kb, preferably >14 kb) probes in the set of probes is a step towards easier detection of signals of interest. Not all probes in the set need to be that long: in a fast and "rough" detection step, the long probes are sought, which allows the localization of signals of interest. These probes are called "docking probes" as they allow to "land" on the regions of interest efficiently. In a second step, the shorter probes are sought in the neighborhood of the docking probes (and more specifically in the case of Molecular Combing or related technologies, in the alignment of these probes). Although when performed by a human operator these steps can hardly be formally executed consecutively, if an operator may limit his search to longer probes, he can browse through images more rapidly, which would only allow him to detect these probes and spend more time on images where a docking probe is seen in order to look for other shorter probes. As the longer docking probes would locally diminish the location precision and the resolution of the code, it is preferable for them not to be located in the region where rearrangements are sought. This is possible if the probes are located near, but not in, the region of interest, e.g. at either end of this region.

[0146] If it is desirable to only consider complete signals in the analysis of a given region (i.e. signals covering the entire contiguous region), these longer probes may also be used to assess the integrity of the region: if there is a probe located at each end and both probes are present, no breakage of the fiber has occurred during the DNA preparation or stretching step. In cases where several non contiguous regions are analyzed in a single test, obviously each region has to have its "docking" probes in order to be correctly detected.

Continuous Stretch of Short Probes

[0147] An alternative to the "docking probes" approach above is to design the set of probes in such a way that at least some groups of shorter probes form a continuous stretch of signal. This is possible if probe sequences are adjacent. In that case, several probes, although short enough (less than 10 kb) to provide for sufficient resolution, may well combine to form a long enough (more than 14 kb) signal for fast and reliable detection. Indeed, if the operator may combine color channels to view images, this stretch would still appear as a long line rather than a spot, allowing its distinction from background noise. This is possible by using either common optical setups such as tri-color filters in fluorescence microscopy, or by using common image viewing software. In the case of automatic detection, it is also possible to use combined color information and therefore to make use of the very characteristic aspect of a multicolor line relatively to background spot-like noise.

Measurements

[0148] The probe designs described above likely lead to a large number of probes to be measured in a test. The usual approach for probe measurement is to measure all of the probes constituting a signal, as well as the gaps separating them. In a test with a large number of probes, the amount of work required for analyzing results is increased. In order to balance this, the invention relates to a more efficient designed approach for signal measurement. This approach consists in the measurement of subgroups of probes constituting easily recognizable motifs. The subgroups are two or several consecutive probes and the gaps between them, and possibly gaps at either end, chosen in order for their total length to remain within reasonably precise measurement range (10-30 kb).

[0149] There is likely to be a systematic bias in the measurement of digitalized images of fluorescent segments. Indeed, at the extremity of such a fragment, the intensity of the signal decreases gradually when moving away from the center, to reach the level of the background. Depending on where the operator/the software sets the threshold for the determination of the actual end there may be a systematic over- or under-estimation of the lengths. This bias is compensated for if the measured motifs have a probe at one end and a gap at the other. Therefore, it is preferable to design motifs in this way.

[0150] If a motif is found to have an abnormal length (different from the expected theoretical length) in a given sample, it remains possible to measure the probes and gaps within this motif in order to further precise the location of the rearrangement. With this approach, it is possible to measure in a fast and efficient way all of the signals for initial screening, while keeping the location precision allowed by small probes. The somewhat lower precision on measurements due to the larger size of the subgroups compared to the probes is essentially compensated for by the higher number of signals that can be measured within the same operator time.

Application to HNPCC--Rationale

[0151] Colorectal cancer is the 4th most frequent form of cancer in human and around 5% of the cancer is considered as a hereditary form. The most frequent form of hereditary colorectal cancer is known as Lynch syndrome, or HNPCC (hereditary non-polyposis colorectal cancer). HNPCC increases a lifetime risk of cancer development in up to 80% (lifetime risk is around 7% in normal population US). HNPCC also increases other cancers (endometrial, ovarian, stomach).

[0152] Genetic aspect of HNPCC is known as a result of mutation in some of Mismatch Repair (MMR) genes such as MSH2, MLH1, MSH6, PMS2, etc. MSH2 and MLH1 mutation accounts for more than 80% of all mutation of MMR genes in HNPCC. Both point mutation and large rearrangements are reported in mutation of those genes, and especially high % of large mutation in MSH2 is observed because of high level of small repetitive element in its genetic sequence. Today the molecular diagnosis is done after studies of familial cancer history, tumor characterization by microsatellite instability test.

[0153] Normally mutation one alleles of one of MMR genes is sufficient for molecular diagnosis of HNPCC. All HNPCC individuals have both wild and mutated genes. Point mutation of targeted MMR genes can be detected by sequencing of genes and current sequencing test investigates only the sequence of exons. In case of large rearrangements such as deletion and amplification (loss and gain of genetic elements, respectively), sequencing does not detect them because altered sequences do not exist, and frequently primer binding regions for sequencing are deleted. As a result, sequence information comes from only wild allele and gives false negative. Indeed, MSH2 and MLH1 genes are higher percentage of repetitive elements of SINE in their genetic sequence. To address this large rearrangement, the test should detect presence of deletion or amplification in the MMR genes. One approach is cartography of MMR genes with designed probes of hybridization. Causal large rearrangement has a wide range from sub-kb to loss of total gene (up to 100kb). A given cartography has to be sensitive to this wide dynamic range of mutation. To cope with it specific probe design was done for MSH2 and MLH1 loci.

[0154] The present invention is also related to the detection of known or unknown genomic rearrangements. It is also related to kits containing probes according to the invention, for the detection of known or unknown genomic rearrangements and the associated pathologies, or associated predispositions to pathologies such as cancers or cardiovascular diseases for example.

EXAMPLES

Application to HNPCC--Materials and Method

Probe Design v1

[0155] Each probe (probe means continuous hybridization signal, can consist of multiple cloned DNA fragments, e.g., probe 1 of MSH2-v2 covers a 15 kb stretch and consists of five cloned DNA fragments of 3 kb. Since gap or overlap of each junction of these five fragments are smaller than resolution (<50 bp), they are considered and indeed look like continuous single probe of 15 kb) on a region of gene sequence itself has a length between 3-6 kb. In case of larger rearrangement than probe or gap size, obvious change of color pattern of designed probe will be observed. As well as large rearrangement in probe region, such rearrangement is also detectable in gap region, meaning any rearrangement larger than 1 kb at any position in the target genes are detectable. This is a uniqueness of cartography method with high resolution probe hybridization. Other techniques (MLPA, aCGH) can detect only such rearrangement involving probe sequence. For genes with high frequency of large rearrangement such as MSH2 and MLH 1, presence of repetitive element in their genetic sequence limits a freedom of probe design for the other technology. Inclusion of repetitive element sequence in their probe design increases false detection a lot, their probe designing has to be free of repetitive element in principle.

[0156] Probe sequence was chosen by a dot plot analysis. BAC clone sequence of each gene (RP11-1084A21 (Ch2:47, 574, 044-47, 785, 729 for MSH2, RP11-426N19 (Ch3: 36, 992, 516-37, 161, 490) for MLH1 was self-plotted and all grey bands region were excluded from the target region of PCR primer design. PCR primer set was designed in the target regions by Primer3plus PCR primer design tool (ref 6). A list of the primers' sequence is shown in table 1A and B. Exclusion of Alu repeat was verified by both dot-plot analysis and RepeatMasker (http://www._repeatmasker.org). FIG. 1B and FIG. 2B show a lot less grey band on dot-plot of probe fragment sequence on BAC clone than dot-plot of gene (containing Alu repeat) on BAC clone. This indicates that sequence of designed probes does not include recurrent repetitive sequence in this target regions. RepeatMasker analysis (with default setting of web server) also clearly shows a dramatic reduction of % of Alu sequence in designed probe sequence.(table 2).

Probe Design v2

[0157] To facilitate "recognition" of barcodes on hybridization images, an alternative design of probe set (called v2) was done as said in "Docking" probe section. Design process is same as vi except no exclusion of repetitive elements based on dot-plot. For v2 probe design, each probe was designed to have more than 3 kb length, close to limit to be recognized as "line", and all exon sequences are covered by a probe stretch (no exons fall in gaps). Docking probes were designed on both extremities of each gene with 15-20 kb length. For MSH2-v2 code, specific probes covering EPCAM gene (see rationale part) was also included between two docking probes. DNA sequence of designed code v2 was subjected to dot-plot analysis to make sure that there is no segmental repeats inside of designed region (FIGS. 1C and 2C).

Cloning of Probe Fragments and Labeling for Hybridization Probe

[0158] Each fragment of probes was amplified by PCR, then the fragment was ligated into plasmid vector (pNEB193, pCR2.1-TOPO, pCRXL-TOPO). The ligation product was transformed into E. coli competent cells and end-sequences of cloned fragment were verified. Purified plasmid DNA set of each gene was separated into two (v1) or three (v2) gropes according to colors corresponding to theoretical barcodes (FIG. 3A and FIG. 4A for v1, FIG. 5 and FIG. 6 for v2 probe sets). Each group of plasmid DNA was labeled by random priming method. Either whole plasmids containing probe fragments' sequence or PCR amplified probe fragments were used as a template for random priming. There are three haptens to be used for three color detection, biotin (Biot), digoxigenin (Dig) and Alexa Fluor 488 (A488). Biot-labeling was done by BioPrime DNA labeling system (Invitrogen) with manufacture's instruction. For Dig and A488 labeling, dNTP mixture in the kit was replaced with home-blend dNTP mixtures (either 0.1 mM Digoxigenin -11-dUTP (Roche applied science) for Dig labeling or 0.1 mM ChromaTide.RTM. Alexa Fluor.RTM. 488-7-OBEA-dCTP (Invitrogen) for A488 labeling, 0.1 mM unmodified equivalent (dTTP or dCTP) and 0.2 mM each of other three deoxynucleotides in final labeling reaction solution.).

Sample DNA Preparation

[0159] 3 cell human cell lines were used for validation for large rearrangement detection in either MSH2 or MLH1. Cell line GM17939 was used as non-mutated sample. Cell line LoVo was used for MSH2 rearrangement validation, which is homozygous for deletion of exon 3-exon8 in MSH2. Another cell line SK-OV-3 was used for rearrangement validation of MLH1, which was reported as homozygous deletion of exon 4-exon 19 in MLH1. For each cell line, cell culture was prepared according to cell bank's instruction. Cultured cells were harvested (for LoVo and SK-OV-3 when 50-70% confluency) or collected by centrifuge (for GM17939 when between 300,000-400,000cells/ml of medium. Cell pellet was resuspended in 1.times.PBS/Trypsin mixture to have 1,000,000 cells in 45 .mu.l the cell suspension was mixed with an equal volume of 1.2% (w/v) NuSieve GTG agarose solution in 1.times.PBS (melted and equilibrated at 50.degree. C. in advance). The cell/agarose mixture as poured into a well of gel plug mold, followed by gelification at 4.degree. C. for 30 min. the gelified agarose plug was immersed in a mixture of 2 mg/ml of Proteinase K, 1% (w/v) of sarcosyl in 0.5M EDTA (pH8.0, 250 .mu.l for each plug). The agarose plug was incubated at 50.degree. C. overnight.

[0160] Next day the incubated plug was washed in 1.times.TE (10mM Tris-HCl, 1 mM EDTA, pH8.0) 3 times for 1 hour each. The DNA plug can be stored in 0.5mEDTA at 4.degree. C. The washed plug was stained in 100 .mu.l of 33 .mu.M YOYO-1 (Invitrogen) in TE40.2 (40 mM Tris-HCl, 2 mM EDTA pH8.0) for 1 hour in the dark. The stained plug was heated at 68.degree. C. in 1 ml of combing buffer (0.5M MES pH5.5) for 20 min, then cooled at 42.degree. C. 10 min prior to add 1.5 unit of beta agarase I (NEB). Beta agarase treatment was carried overnight at 42.degree. C. in the dark.

[0161] The following day the treated DNA solution was poured into a combing reservoir and a level of the solution in the reservoir was adjusted with additional combing buffer.

Molecular Combing

[0162] The DNA solution was set on a Molecular Combing Machine (MCS, Genomic Vision). Molecular combing was performed on a silanized coverslips (Combicoverslips, Genomic Vision). The combed coverslips was fixed at 68.degree. C. for 4 hours, then used for hybridization (or stored at -20.degree. C. until use).

Hybridization and Detection of Probe

[0163] For one hybridization, 5 .mu.of each of labeled probe solutions (of both MSH2 and MLH1) was combined together and with 10 .mu.g of sonicated herring or salmon sperm DNA and 10 .mu.g of human Cot1-DNA (only for V2 probe sets), then purified by standard ethanol precipitation. The precipitate was resuspended with 20 .mu.l of hybridization buffer (50% formamide, 2.times. SSC, 1% SDS and BlockAid blocking solution (Invitrogen)). The resuspended probe solution was set on a clean glass slide and covered with a DNA combed coverslip. The slide was heated at 90.degree. C. for 5 min for co-denaturation of both probe and combed DNA then incubated at 37.degree. C. overnight with an humidity for hybridization between labeled probes and combed DNA.

[0164] The hybridized coverslips was washed in 50% Formamid/2.times.SSC solution 3 times for 5 min each, followed by another 3 times washing with 2.times.SSC for 5 min each. The washed coveslips was then developed with two or three layers of fluorescently labeled antibodies or streptavidin. For each layer, antibodies for all haptens were diluted 25 times in BlockAid blocking solution (20 .mu.l in final volume) and incubated for 20 min at 37.degree. C. For Biot, Streptavidin Alexa Fluor 594 (Invitrogen) was used for the 1.sup.st and the 3.sup.rd layer, biotin conjugated-goat anti-streptavidin antibody was used for the 2.sup.nd layer. Fr Dig, mouse anti-Digoxin AMCA conjugated (Jackson immunoresearch) was for the 1.sup.st layer, rat anti-mouse AMCA conjugated (Jackson immunoresearch) conjugated was for the 2.sup.nd, the goat anti-rat Alexa Fluore 350 conjugated (Invitrogen) was used for the 3.sup.rd layer. For A488, rabbit anti-Alexa Fluor 488 (Invitrogen) was used for the 1.sup.st layer, goat anti-rabbit Alexa Fluor 488 conjugated was used for the 2.sup.nd layer (no third antibody for A488). After 20 min incubation of each layer of antibody, the coverslip was washed in 2.times.SSC/1% Tween 20 washing solution 3 times for 5 min each at room temperature. After the washing of 3.sup.rd layer, the coverslip was rinsed in 1.times.PBS, followed by successive bath of 70, 90 and 100% ethanol for 1 min each. The coverslip was dried at room temperature prior to microscopy.

Signal Acquisition and Measurement

[0165] Fluorescent signal of developed antibody on the coverslip was obtained by standard epi-fluorescent microscope system or automated fluorescent microscope system (Image Xpress Micro, Molecular Devices) with custom scanning configuration for molecular combing signal. Every set of linearly aligned fluorescent signals and gaps was measured by ImageJ. Each measured set of signals (with color information) was subjected to pattern matching to determine position (if the set is a part of one of probe set) and orientation by comparison with the theoretical probe sets. All unclassified sets (did not match with any positions and orientations of theoretical probe sets) were subjected to similarity check between them to find whether recurrent abnormal pattern appears or not.

Application to HNPCC--Results

[0166] FIGS. 3B and 4B are representative images of signal from hybridized DNA. Some of probes look like "dot" rather than "line" as expected from their length. There are some "random" spots on images of hybridization, but these spots do not interfere recognition of designed code. Although signals of some small probes (arrowed in FIG. 3B, for example) is not evident to measure "length" of probe signals for size evaluation, measurement of "distance" between probe signals is possible and equivalent to measurement of the length of probe and gaps in normal probe set hybridization

[0167] FIGS. 5B and 6B are the representative image of hybridization signal of barcodes-v2. Fluorescent signals are more continuous than the signals of barcodes-v1, and easier to find docking probes and measure the length of each probe and gap. These barcodes-v2 were used to visualize large genomic rearrangements of characterized cancer cell lines, LoVo and SK-OV-3 (ref. 5).

[0168] FIG. 7 is a result of hybridization of barcodes v2 on combed DNA from LoVo cell line; LoVo cell line is homozygous for deletion in MSH2 (from exon 3 to 8). Hybridization slide had many normal (identical to theoretical code) signal of MLH1 gene but none of normal MSH2 signals. Instead, there was a recurrent signal of truncated form of the normal MSH2 signal (FIG. 7B). By deduction from the truncated signals, this truncation results from loss of probes and gaps corresponding to ex3 to 8 of MSH2 gene.

[0169] FIG. 8 is a result of barcodes-v2 on SK-OV-3 cell line DNA, homozygous for deletion in MLH1 (from ex4 to 19). Among many normal MSH2 signals, only a few signals of part of MLH1 (from probe 1 to probe 3) were observed. This means a lack of following sequence of MLH1, which is consistent with reference. Moreover, a lack of the right (downstream of MLH1) docking probe indicates that this deletion affects beyond exon 19 of MLH1.

[0170] The sequences selected to detect predisposition to colorectal cancer linked to rearrangements in the MSH2 genomic region or the MLH 1 genomic region are preferably chosen among the following nucleotide sequences and their corresponding complementary sequences and are described as:

[0171] The short probes covering the MSH2 gene region and constituting contiguous stretches (PEI-2 and PE3-6 (SEQ ID NO:354-358); PE9 to PE15-16 (SEQ ID NO:365-373) in table 1 under the header MSH2-v2) and the other short probes covering MSH2 gene region (PE7 and PE8, SEQ ID NO:359-364 in table 1 under the header MSH2-v2); the long probes neighboring the MSH2 gene (tPP1, EPCAM5', EPCAM3' (SEQ ID NO:342-353) and cPP1 (SEQ ID NO:374-378) in table 1 under the header MSH2-v2); the short probes covering the MLH1 gene region and constituting a contiguous stretch (PE1-2 to PE 10-11, SEQ ID NO:386-396, in table 1 under the header MILH1-v2) and the other short probes covering MLH1 gene region (PE12-13, PEI4-15 and PEI6-19, SEQ ID NO:397-401, in table 1 under the header MLH1-v2); the long probes neighboring the MLH1 gene (tPP1 (SEQ ID NO:379-385) and cPP1 (SEQ ID NO:402-408) in table 1 under the header MLH1-v2). For example, these probes may be obtained by amplification of the fragments using the primers listed in Table 1 under the headers MSH2-v2 (SEQ ID NO:139-212) and MLH1-v2 (SEQ ID NO:213-272).

Incorporation by Reference

[0172] Each document, patent, patent application or patent publication cited by or referred to in this disclosure is incorporated by reference in its entirety, especially with respect to the specific subject matter surrounding the citation of the reference in the text. However, no admission is made that any such reference constitutes background art and the right to challenge the accuracy and pertinence of the cited documents is reserved.

TABLE-US-00001 TABLE 1 MSH2-v1 Name SEQ ID SEQ ID of Name of NO For / NO probe fragment (fragment) Rev (primer) Sequence (5'-3') start end P1 P1a_MSH2-v1 273 forward 1 TTCTTCCCAAGAGAGCCAAG 47595911 47595930 reverse 2 CTGTTTTGGAACCCCAAGTC 47597074 47597093 P1b_MSH2-v1 274 forward 3 GGCTTCAATCTGGGACTACG 47598716 47598735 reverse 4 GCTGTCACCGCCTCTTTTAC 47599478 47599497 P1c_MSH2-v1 275 forward 5 GCCAGGCACTTAGGCAGTAG 47600433 47600452 reverse 6 TTGGTCCTGACATCCTTTCC 47601671 47601690 P1d_MSH2-v1 276 forward 7 TTAGTTGAACAGGGCATGACAC 47602097 47602118 reverse 8 GGTAAAGGGGCCTGATGTC 47602743 47602761 P1e_MSH2-v1 277 forward 9 GAGCCTTGATGTTCCCTCTTAAC 47603695 47603717 reverse 10 ACCCAGATCCGAAACTGTTG 47604324 47604343 P1f_MSH2-v1 278 forward 11 CCGGCCTTACCTTTCATTTC 47605735 47605754 reverse 12 CCAGGATCCAGATCCAGTTG 47606965 47606984 P2 P2a_MSH2-v1 279 forward 13 GAGTTCCATGGCAGATCACC 47612521 47612540 reverse 14 GCAGCTTTCAATCACAAATCAG 47614067 47614088 P2b_MSH2-v1 280 forward 15 GAAGGGTTGGTCTTGCTGTC 47615115 47615134 reverse 16 ACCCTTTGCACCTCTCTGTG 47615632 47615651 P2c_MSH2-v1 281 forward 17 CCCGGTGTTGAATCATTTG 47616079 47616097 reverse 18 TTCAGCCCTGAAGGTAGAGG 47617513 47617532 P2d_MSH2-v1 282 forward 19 CTGGCCACTTTTTGGAAGAG 47618884 47618903 reverse 20 TGGGACGCAGAGTGATACAG 47619394 47619413 P3 P3a_MSH2-v1 283 forward 21 TTACTGGCGATCCTCAGAGC 47629651 47629670 reverse 22 AACGCCTCTTCCGTTGTATG 47631623 47631642 P3b_MSH2-v1 284 forward 23 GAAAGGACAGACCAAGTGCAG 47632605 47632625 reverse 24 AGCCTGTGCAGGGAAACTC 47633083 47633101 P3c_MSH2-v1 285 forward 25 AGTGGGATGCAGCTGAAAAG 47633591 47633610 reverse 26 CAACAGCATGGGAAAGATCC 47635238 47635257 P4 P4a_MSH2-v1 286 forward 27 TTGAAAGTTGGTCTTAGGAAGAGG 47643286 47643309 reverse 28 CCCAACAAACCTGGCTTTAG 47644179 47644198 P4b_MSH2-v1 287 forward 29 AGACGCCCAAAATCAACAAC 47645155 47645174 reverse 30 CCGCTTGCTGCTAAAAATTG 47646042 47646061 P5 P5a_MSH2-v1 288 forward 31 TGATTGCCAAGGAAGATTCAC 47657647 47657667 reverse 32 TGGAAGTAAATGCAGGTGCTC 47658763 47658783 P5b_MSH2-v1 289 forward 33 TCATTCTTGGGTGTTTCTCG 47659578 47659597 reverse 34 ATGGCGGTTTTGTGGAATAG 47660015 47660034 P5c_MSH2-v1 290 forward 35 GAGGGAGAGGGAACCTTTTG 47661699 47661718 reverse 36 GGGGACTATACCGCATTCAC 47662243 47662262 P6 P6a_MSH2-v1 291 forward 37 TGTTGATTCATGGGCATTTG 47669651 47669670 reverse 38 GCTGGGGAATCATGTATGAAG 47671879 47671899 P6b_MSH2-v1 292 forward 39 CATCAAGCACAGTTCCATTG 47672243 47672262 reverse 40 TTCTCTTTCCGTTTCCAGTG 47673113 47673132 P7 P7a_MSH2-v1 293 forward 41 GGAGCTTGGGAATTCAACTG 47678126 47678145 reverse 42 AGAAACGGGCATGTCATAGG 47679330 47679349 P7b_MSH2-v1 294 forward 43 CAGCCTACGTGCCCATTTC 47679649 47679667 reverse 44 TCAAAAGATGGCCAAAATGC 47681179 47681198 P7c_MSH2-v1 295 forward 45 GTGTTGCACCCATTAACTCG 47681915 47681934 reverse 46 AGCCTGGTGAGAGGTGACTG 47684723 47684742 P8 P8a_MSH2-v1 296 forward 47 CACGATGCCAGTCCAATTC 47689478 47689496 reverse 48 AAGGTGGACTTTAATGCAAAGG 47690835 47690856 P8b_MSH2-v1 297 forward 49 GGAGTGAGAGCGACACCTTG 47691634 47691653 reverse 50 CGACAGCTGACTGCTCTATGG 47694068 47694088 P9 P9a_MSH2-v1 298 forward 51 CACAATGGGAAAGGATGTAGC 47701939 47701959 reverse 52 CAGAGAAAAACACCCATGACC 47704112 47704132 P9b_MSH2-v1 299 forward 53 CACCGTGATCCTCCTTATTTC 47704395 47704415 reverse 54 GAACAAACAACGGATGAAAGG 47704945 47704965 P9c_MSH2-v1 300 forward 55 GTGGCATATCCTTCCCAATG 47705311 47705330 reverse 56 CCCCCAGACTGTGAATTAAGG 47705787 47705807 P10 P10a_MSH2-v1 301 forward 57 GATGCAGATCAGGGAAATGC 47711630 47711649 reverse 58 ATCTTGCTGGATGGACAAGG 47715272 47715291 P10b_MSH2-v1 302 forward 59 CTTAATCCTGAAAGGCAGGTG 47715788 47715808 reverse 60 TGTTTCTCAGGCAACCACAG 47717266 47717285 P11 P11a_MSH2-v1 303 forward 61 GAAACCACAGAATCGCCTTC 47731087 47731106 reverse 62 ACCTGGACAGTCCCACAGAC 47733482 47733501 P11b_MSH2-v1 304 forward 63 CAGTGCTTTTGCATCCTTCC 47734903 47734922 reverse 64 ATTTAATCCCCTGGCCAATC 47741649 47741668 P11c_MSH2-v1 305 forward 65 CACCTGTGCCCATCACATAG 47742239 47742258 reverse 66 GAGTCCCCTCTTGGAGAACC 47747829 47747848 P12 P12a_MSH2-v1 306 forward 67 AAAGCCATTTCCAGTGTCG 47753989 47754007 reverse 68 ATTGTGCAGCCAGAATTGAG 47758158 47758177 P12b_MSH2-v1 307 forward 69 TTCACAGCAAAGTGGCTCAG 47760593 47760612 reverse 70 GCTATTATGGGCTGCAAAGC 47764302 47764321 P12c_MSH2-v1 308 forward 71 TTCACTCCCAACAAGCACTG 47764863 47764882 reverse 72 TGCCCAGTCCTTTTTCACT 47765618 47765636 P12d_MSH2-v1 309 forward 73 AATCCCTCCTGCACACTTTC 47765925 47765944 reverse 74 AATGGATGCTTCCACTGTCC 47767687 47767706 P12e_MSH2-v1 310 forward 75 CCATCTGTGCAATTCCTTCC 47768105 47768124 reverse 76 GTTCAAAGGCAGAAGCCATC 47769886 47769905 MLH1-v1 SEQ ID Name of Name of NO For / SEQ ID NO probe fragment (fragment) Rev (primer) sequence (5'-3') start end P1 P1a_MLH1-v1 311 forward 77 GTCTGGATTCTTTCACAATGTAGC 37005551 37005576 reverse 78 TGCCAATCTTCTCCTCTGTTC 37006562 37006582 P1b_MLH1-v1 312 forward 79 AACCACCCAATGTGTTCACC 37006815 37006836 reverse 80 GTTCATTCCTGCGAGTAGGC 37007422 37007441 P1c_MLH1-v1 313 forward 81 GCCAAAGGTGGAAAATGTTG 37008987 37009008 reverse 82 GCCTTCTTCATGAAAGCACTG 37009873 37009893 P1d_MLH1-v1 314 forward 83 CCAGAAGGTGGAAGCTACAG 37011079 37011100 reverse 84 TGGGGTCAATGAAGCAAG 37011830 37011847 P1e_MLH1-v1 315 forward 85 ACATCGACCCAGAAAGTTCC 37012314 37012335 reverse 86 AATGTGCTTCGTACCACTGC 37012867 37012886 P1f_MLH1-v1 316 forward 87 AGCGTGCCATTGTACTCTCC 37013822 37013843 reverse 88 TTTCTGAGCCCATGATTTCC 37015267 37015286 P2 P2a_MLH1-v1 317 forward 89 GTGCCCAGCTAGTTCCATTC 37023623 37023644 reverse 90 TCAAGAGCGCTAATCCCATC 37025002 37025021 P2b_MLH1-v1 318 forward 91 TGCACATGCTCACTGAAAGAC 37026505 37026527 reverse 92 TTTTGCCTGCAAACTGACC 37027818 37027836 P2c_MLH1-v1 319 forward 93 CAGCAAGCACCAAATCACTG 37028305 37028326 reverse 94 AGTACCAGCCGTCCAAACTG 37032621 37032640 P3 P3a_MLH1-v1 320 forward 95 CCTGGCCAGAAAATTCATTG 37037607 37037628 reverse 96 ACCCTGCATTCCAAACTCAC 37039199 37039218 P3b_MLH1-v1 321 forward 97 GCAGTCCTTTGAGGATTTAGC 37042493 37042515 reverse 98 GAAAGATATCCAACAGGAAGTGAG 37043300 37043323 P3c_MLH1-v1 322 forward 99 TGGCCTTGTTTAAGGTCCTG 37043746 37043767 reverse 100 ATGGTCCTGCTGCTTCAGAG 37044723 37044742 P3d_MLH1-v1 323 forward 101 ACCCCGTCATAGCACAGTTC 37045295 37045316 reverse 102 CAAAGGCCATTCATCAGTTTC 37046439 37046459 P4 P4a_MLH1-v1 324 forward 103 GTGGCGTGATATCCTTGATTC 37053034 37053056 reverse 104 CTCTGGAATGACTGCTGCTG 37054289 37054308 P4b_MLH1-v1 325 forward 105 TGTGCTAGATGCCTCACTGG 37055182 37055203 reverse 106 TTGCCAAGAAGCACAACAAG 37058326 37058345 P5 P5a1_MLH1-v1 326 forward 107 CGGAGGCTCTACTGTTGGAC 37062345 37062366 reverse 108 TGCTGTCCACTCTGGAACTG 37064753 37064772 P5b_MLH1-v1 327 forward 109 ACATCAGAAGCCCTGGTTTG 37064571 37064592 reverse 110 GCTGGGAGTTCAAGCATCTC 37067377 37067396 P6 P6a_MLH1-v1 328 forward 111 TCGGTCTCAGTCACCATTTG 37072097 37072118 reverse 112 AACGCACCTGGCTGAAATAC 37075920 37075939 P7 P7a_MLH1-v1 329 forward 113 TGAACCTGCAATATCTCAGAGG 37079607 37079630 reverse 114 CTTACCGATAACCTGAGAACACC 37083805 37083827 P8 P8a_MLH1-v1 330 forward 115 CCCAGCCCATATATTTTAAAGC 37088387 37088410 reverse 116 CCAGCCACTCTCTGGACTATC 37089049 37089069 P8b_MLH1-v1 331 forward 117 GACATGGAGAGCCGAATCC 37089669 37089689 reverse 118 CCATTAAAATCGGGTCTGAAAG 37091446 37091467 P8c_MLH1-v1 332 forward 119 TCCAGACCCAGTGCACATC 37091887 37091907 reverse 120 CATGGTCAGTGCCATCAGAG 37092412 37092431 P8d_MLH1-v1 333 forward 121 AGCCTCCCAAAGTTAAGTGC 37092788 37092809 reverse 122 CCCAGCTAAAACCAACACAC 37093346 37093365 P9 P9a_MLH1-v1 334 forward 123 TGCCCTCAGCTACTCACTCC 37103285 37103306 reverse 124 AGGGCTCAGCCTTTAGGAAC 37105620 37105639 P9b_MLH1-v1 335 forward 125 GCCAGACTCTCGTTCCATTC 37106390 37106411 reverse 126 ACTCCCCATTCAGTCCCTTC 37111053 37111072 P9c_MLH1-v1 336 forward 127 AGGCACAACGTCAGGTTTTC 37114109 37114130 reverse 128 TTGGAATTTGTCCTGGTGTG 37117519 37117538 P10 P10a_MLH1-v1 337 forward 129 CACCATTGCCAACACTTCTG 37132898 37132919 reverse 130 GCCATTGGTTTGAAGGTGAC 37134201 37134220 P10b_MLH1-v1 338 forward 131 CTTAGTCACCGCCTGTCCTC 37134738 37134759 reverse 132 TAGCTGCATGTGGCTAATCG 37136986 37137005 P10c_MLH1-v1 339 forward 133 TGTGGCTCGCATTACATTTC 37137579 37137600 reverse 134 CGCTGTCATTACCTGCTTTG 37139742 37139761 P10d_MLH1-v1 340 forward 135 TGACCTCCAAAATCATCCAG 37140449 37140470 reverse 136 TTCTGAGCTAGGAGGTGCTG 37141321 37141340 P10e_MLH1-v1 341 forward 137 CCAGATTTGTAAATCCCTGTTC 37142008 37142031 reverse 138 TGTGTGGTTCTTAAGCATTCC 37142420 37142440 MSH2-v2 Name of SEQ ID NO For / SEQ ID NO probe Name of fragment (fragment) Rev (primer) sequence (5'-3/) start end tPP1 tPP1a_MSH2-v2 342 forward 139 CTCAGTCCATCAGCCTCCTC 47574824 47577784 reverse 140 TGCTGTGCCCTGAGATTAAG 47574823 47577783 tPP1b_MSH2-v2 343 forward 141 AACTTAATCTCAGGGCACAGC 47577763 47580677 reverse 142 TGCAGCTTCAGCCTCTTG 47577762 47580676 tPP1c_MSH2-v2 344 forward 143 GCGTGGTGTTTCGTACCAG 47580604 47583785 reverse 144 GCTACTGGCCAGAAATCTTCC 47580603 47583784 tPP1d_MSH2-v2 345 forward 145 GCCCAGCCCTACTAAGGAAG 47583750 47586723 reverse 146 CTGTGCTCCCCTGCTAGAAC 47583749 47586722 tPP1e_MSH2-v2 346 forward 147 GTCGTCCICTTCGACCTAGC 47586769 47589967 reverse 148 CAGCGCCTATTCTACAGCAG 47586768 47589966 EPCAM5' EPCa_MSH2-v2 347 forward 149 TTCTTCCCAAGAGAGCCAAG 47595912 47598965 reverse 150 CCACCTTTAATCTGCCCAAC 47595911 47598964 EPCb_MSH2-v2 348 forward 151 GTGTTGGGCAGATTAAAGGTG 47598944 47602122 reverse 152 GCAGTGTCATGCCCTGTTC 47598943 47602121 EPCc_MSH2-v2 349 forward 153 CTCTTIGTGCCCTITCTTTTG 47601745 47604931 reverse 154 AGTTCCTTAAAGCAGAGAAGATGG 47601744 47604930 EPCAM3' EPCd_MSH2-v2 350 forward 155 AACCTGTCCCTGTGGATGAG 47604796 47607923 reverse 156 CCGAAGCATCCTTACATTCC 47604795 47607922 EPCe_MSH2-v2 351 forward 157 AATACCTGAACCCCCAAACC 47607722 47609876 reverse 158 CTCAGGCTATTTTCCAGATTCAC 47607721 47609875 EPCf_MSH2-v2 352 forward 159 GCATGCCTGTCATTCTGG 47609695 47612812 reverse 160 TCCAAGGGACTGAAACACAC 47609694 47612811 EPCg_MSH2-v2 353 forward 161 TTAGTGTGTTTCAGTCCCTTGG 47612790 47615135 reverse 162 GACAGCAAGACCAACCCTTC 47612789 47615134 PE-2 E1_MSH2-v2 354 forward 163 GCACATTACGAGCTCAGTGC 47629942 47633045 reverse 164 CTACCAGGAGAACAGCACAGG 47629941 47633044 E2_MSH2-v2 355 forward 165 TGGGTTAGCATTGTGTTAGGTG 47632899 47636029 reverse 166 CCACAGGTGTGTGCCAATAG 47632898 47636028 PE3-6 E3_MSH2-v2 356 forward 167 AAGTTGCAGTTTGGCTGGTC 47635845 47638929 reverse 168 TTATCTCCAGCGGTGCTTATG 47635844 47638928 E4_MSH2-v2 357 forward 169 TACCATAAGCACCGCTGGAG 47638906 47642053 reverse 170 ACTCCACCAAGCCCAGTCTC 47638905 47642052 E5-6_MSH2-v2 358 forward 171 TTTAGAGACTGGGCTTGGTG 47642030 47644205 reverse 172 CTCTTCCCCAACAAACCTG 47642029 47644204 PE7 I6-7_MSH2-v2 359 forward 173 CCCAGTTTCAAGCGATTAAG 47651443 47654570 reverse 174 AGGAAAAGCATGTTATCTCCAG 47651442 47654569 E7_MSH2-v2 360 forward 175 TTCCGTAGCAGTAGGCATCC 47654026 47657170 reverse 176 TCACCACCACCAACTTTATGAG 47654025 47657169 I7-8_MSH2-v2 361 forward 177 TCCCAGATCTTAACCGACTTG 47656956 47660035 reverse 178 ATGGCGGTTTTGTGGAATAG 47656955 47660034 PE8 E8_MSH2-v2 362 forward 179 CCCAAACAACAGCATTAGCC 47670887 47673915 reverse 180 ACATCAGCCTCGGGACAAG 47670886 47673914 I8-9a_MSH2-v2 363 forward 181 TGAGCCCGTTGAATATAGTGG 47673830 47675514 reverse 182 AGTTTTCCTAAACGGGATGATG 47673829 47675513 I8-9b_MSH2-v2 364 forward 183 ATGGGTGTGCACGTGTGTAG 47675368 47678365 reverse 184 GCCATGTGCAATTGTGAGTC 47675367 47678364 PE9 E9_MSH2-v2 365 forward 185 CCTTGCATAGII1GCTTCTGG 47688375 47690450 reverse 186 ATCATACAAGGGCCTGTTGG 47688374 47690449 I9-10_MSH2-v2 366 forward 187 AAACAGAAATCGCCCAACAG 47690418 47692377 reverse 188 TAGAGACCCACCCAGAAACG 47690417 47692376 PE10 E10_MSH2-v2 367 forward 189 CAGTCCGATTTCGTTICTGG 47692347 47695506 reverse 190 CACACCTAGATTTGGCAATGG 47692346 47695505 PE11 E11_MSH2-v2 368 forward 191 TTCCATTGCCAAATCTAGGTG 47695484 47698468 reverse 192 GGCCCTAGTGTTTCCTTTCC 47695483 47698467 I11-12_MSH2-v2 369 forward 193 AAGGAAACACTAGGGCCTACAAC 47698452 47700589 reverse 194 CCTGGCCTCAGTACACTITTG 47698451 47700588 PE12-14 E12_MSH2-v2 370 forward 195 AGGGATTCTCCCCACTTAGC 47700228 47702718 reverse 196 ATTGGAGGACTGGCTCAAAG 47700227 47702718 E13-14_MSH2-v2 371 forward 197 GCTTACCTTTGAGCCAGTCC 47702694 47705819 reverse 198 ACATGTTCCTACCCCCAGAC 47702693 47705818

PE15-16 E15_MSH2-v2 372 forward 199 TTTCTGCATCAGTTGGTTGC 47706613 47709532 reverse 200 GCCAAGTTATTGCTGCTTCAG 47706612 47709531 E16_MSH2-v2 373 forward 201 AGCCCTGTGAGGTTGGTAAC 47709413 47712504 reverse 202 TCAACAACAGCTGGAACTGC 47709412 47712503 cPP1 cPP1a_MSH2-v2 374 forward 203 CCTCTCAGGTCAGGCTTCTG 47730898 47733882 reverse 204 GCTCCCGCTAGAGAAACTCC 47730897 47733881 cPP1b_MSH2-v2 375 forward 205 GAGCGAAGCACCTAAAGCAC 47733879 47736946 reverse 206 AATTGGAGGGGGTGGAGTAG 47733878 47736945 cPP1c_MSH2-v2 376 forward 207 TGTCACCCAGTCAGGTCATC 47736760 47739876 reverse 208 TTGGAAGGAATCCAACAAGG 47736759 47739875 cPP1d_MSH2-v2 377 forward 209 TTCCCAGAACTCCTTGTTGG 47739846 47742962 reverse 210 TGCAAACCCCTTCTTTTCAG 47739845 47742961 cPP1e_MSH2-v2 378 forward 211 ACCCCATGCAGAAGCAATAG 47743027 47746218 reverse 212 AAATCCTGAAGGTGGGTTCC 47743026 47746217 MLH1v2 Name of Name of SEQ ID NO For / SEQ ID NO probe fragment (fragment) Rev (primer) sequence (5'-3') start end tPP1 tPP1b_MLH1-v2 379 forward 213 AGTTTCAGCCATGTTGCAG 37005587 37005605 reverse 214 TTGGCAAAATTGTGACTGAG 37007511 37007530 tPP1c_MLH1-v2 380 forward 215 CAGTCACAATTTTGCCAAGG 37007513 37007532 reverse 216 AGTTCGTGGCATCTAACTATCG 37009688 37009709 tPP1d_MLH1-v2 381 forward 217 GGTCCATGTGCTCCAAAAAG 37009460 37009479 reverse 218 TCCAAAACTGGGAACAAACC 37012624 37012643 tPP1e_MLH1-v2 382 forward 219 TGGTTTGTTCCCAGTTTTGG 37012623 37012642 reverse 220 TAGTGCACCACAGCCTCAAG 37015706 37015725 tPPlf_MLH1-v2 383 forward 221 GGATCACTTGAGGCTGTGGT 37015700 37015719 reverse 222 TCCAACAACTGCTGTGAAGG 37018677 37018696 tPP1g_MLH1-v2 384 forward 223 CACCACTGACCTTCCCTTCC 37018492 37018511 reverse 224 GCACAGAAAGACAAATATCACATGC 37020534 37020558 tPP1h_MLH1-v2 385 forward 225 CTCTTCCTCGTCTCCTCCTG 37020430 37020449 reverse 226 CCAATTCAATGCAAAACCTG 37022464 37022483 PE1-2 E1_MLH1-v2 386 forward 227 CGAGCAGCTCTCTCTTCAGG 37034273 37034292 reverse 228 AGCCTATAAGCACAGACCAACTG 37037250 37037272 E2_MLH1-v2 387 forward 229 TTCTCTAGCAGTTGGTCTGTGC 37037242 37037263 reverse 230 ACCCTGCATTCCAAACTCAC 37039199 37039218 PE3-4 I23_MLH1-v2 388 forward 231 GTTCATTTTGGGGCATGTTC 37039148 37039167 reverse 232 CTGCAACCTCCTTTGAGACAG 37042218 37042238 E3_MLH1-v2 389 forward 233 TGTCTCAAAGGAGGTTGCAG 37042219 37042238 reverse 234 CCAAAATGAAACTGCCTTCC 37044367 37044386 E4_MLH1-v2 390 forward 235 AGTTCCCTGGGTCATTTTCC 37044393 37044412 reverse 236 TTGTGGGAAGGCAAACTAGC 37046381 37046400 PES-6 E5_MLH1-v2 391 forward 237 CCTGTGCTAGTTTGCCTTCC 37046376 37046395 reverse 238 GGTGGTCACCGTGGTAAAAG 37049553 37049572 E6_MLH1-v2 392 forward 239 GACCACCATGTGATTTCCAAG 37049566 37049586 reverse 240 TTGGTTGGCGGTTATTTCTC 37052510 37052529 PE7-9 E7-8_MLH1-v2 393 forward 241 TAACCGCCAACCAAGAAAAG 37052516 37052535 reverse 242 TGTCTGGAGACCTTCCCAAG 37055360 37055379 E9_MLH1-v2 394 forward 243 TGTGCTAGATGCCTCACTGG 37055182 37055201 reverse 244 ACTTGCCTACATTGCCCATC 37058175 37058194 PE10-11 E10_MLH1-v2 395 forward 245 ATGGGCAATGTAGGCAAGTC 37058176 37058195 reverse 246 TCTGCAGCCATGAATAAGTCC 37061070 37061090 E11_MLH1-v2 396 forward 247 CAGAGCTGAGGCGATAAATTG 37060960 37060980 reverse 248 TGCTCCTCTCCAATCCATTC 37063973 37063992 PE12-13 E12_MLH1-v2 397 forward 249 ATACTTTCCCAGCCCAAACC 37066434 37066453 reverse 250 TGATGGGGAAATGAGAGGAG 37069438 37069457 E13_MLH1-v2 398 forward 251 AGTGGCCTTTGTCCATTGAG 37069405 37069424 reverse 252 GACAGAGGTGAGAGCCTAGGAG 37071540 37071561 PE14-15 E14-15_MLH1-v2 399 forward 253 AATGTGTTGGGGAAGTGGTC 37081262 37081281 reverse 254 TTTGGACCACGGCTTTAGAC 37084405 37084424 PE16-19 E16-18_MLH1-v2 400 forward 255 AAGCTGAGGTCACGGATTTG 37087522 37087541 reverse 256 GATGGGCAAGTTTCATCTCC 37090568 37090587 E19_MLH1-v2 401 forward 257 TGGGACGAAGAAAAGGAATG 37090401 37090420 reverse 258 CACCGTGCCTCAGCCTATAC 37093446 37093465 cPP1 cPP1a_MLH1-v2 402 forward 259 GGACTAACCCACCTCCCTTC 37103239 37103258 reverse 260 GCTATAGGCAGCCCAGAGTG 37106372 37106391 cPP2a_MLH1-v2 403 forward 261 GCCAGACTCTCGTTCCATTC 37106390 37106409 reverse 262 AGGATTTGCCGTATGGACTC 37109450 37109469 cPP3a_MLH1-v2 404 forward 263 TCGCCCAAAGTCACAGTAAG 37109303 37109322 reverse 264 GATCTGTAGGCCCAGGATTTC 37112356 37112376 cPP4a_MLH1-v2 405 forward 265 AGGGGTTTCTATGGCTGGTC 37112314 37112333 reverse 266 CCTCCCTCAAACCTCCTCTC 37114423 37114442 cPP5a_MLH1-v2 406 forward 267 TTCTCCTGCAGAGGAAGAGG 37114369 37114388 reverse 268 TTGGAATTTGTCCTGGTGTG 37117519 37117538 cPP6a_MLH1-v2 407 forward 269 AAAGCCAGGGAGTGAATGG 37117566 37117584 reverse 270 ATGTGCATCTCCCTGGTGAC 37120703 37120722 cPP7a_MLH1-v2 408 forward 271 TGTGGGGAAATCAAAACCTG 37120784 37120803 reverse 272 GGGTAGACTGTGCGTGTGTG 37123930 37123949

TABLE-US-00002 TABLE 2 MLH1-v2 MLH1-v1 MLH1 MSH2-V2 MSH2-V1 MSH2 probe probe region probe probe region sum length 86366 55582 121536 106534 73609 171394 bp repeat 44684 18525 64712 53243 22133 94584 bp length total repeat 51.74 33.33 53.25 49.98 30.07 55.19 % SINE 24.93 2.58 23.85 34.68 5.03 35.95 % ALUs 22.38 0.09 21.85 32.85 0.76 34.15 %

REFERENCES

[0173] 1. "Gene copy number variation and common human disease", Fanciulli, et. al. Clinical Genetics, 2010 77, 201-213 [0174] 2. " Dynamic molecular combing : stretching the whole human genome for high-resolution studies" Michalet, et al., Science 1997 277, 1518-1523 and "Bar code screening on combed DNA for large rearrangemens of the BRCA1 and BRCA2 gene in French breast cancer families", Gad, et. al., J. Medical Genetics, 2002, 39, 817-821 [0175] 3. "Sequence-based design of single-copy genomic DNA probes for fluorescence in situ hybridization" Rogan, et. al.,. Genome Res. 2001 11, 1086-94. [0176] 4. "A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis". Erik L. L. Sonnhammer and Richard Durbin. Gene 1995, 167:GC1-10 [0177] 5. "Microsatellite instability, mismatch repair deficiency and genetic defects in human cancer cel lines", Boyer J. C., et al. Cancer Research 1995 55, 6063-6070, [0178] 6. "Primer3Plus, an enhanced web interface to Primer3", Untergasser A., et al. Nucleic Acids Research 2007 35, W71-W74

Sequence CWU 1

1

408120DNAHomo sapiens 1ttcttcccaa gagagccaag 20220DNAHomo sapiens 2ctgttttgga accccaagtc 20320DNAHomo sapiens 3ggcttcaatc tgggactacg 20420DNAHomo sapiens 4gctgtcaccg cctcttttac 20520DNAHomo sapiens 5gccaggcact taggcagtag 20620DNAHomo sapiens 6ttggtcctga catcctttcc 20722DNAHomo sapiens 7ttagttgaac agggcatgac ac 22819DNAHomo sapiens 8ggtaaagggg cctgatgtc 19923DNAHomo sapiens 9gagccttgat gttccctctt aac 231020DNAHomo sapiens 10acccagatcc gaaactgttg 201120DNAHomo sapiens 11ccggccttac ctttcatttc 201220DNAHomo sapiens 12ccaggatcca gatccagttg 201320DNAHomo sapiens 13gagttccatg gcagatcacc 201422DNAHomo sapiens 14gcagctttca atcacaaatc ag 221520DNAHomo sapiens 15gaagggttgg tcttgctgtc 201620DNAHomo sapiens 16accctttgca cctctctgtg 201719DNAHomo sapiens 17cccggtgttg aatcatttg 191820DNAHomo sapiens 18ttcagccctg aaggtagagg 201920DNAHomo sapiens 19ctggccactt tttggaagag 202020DNAHomo sapiens 20tgggacgcag agtgatacag 202120DNAHomo sapiens 21ttactggcga tcctcagagc 202220DNAHomo sapiens 22aacgcctctt ccgttgtatg 202321DNAHomo sapiens 23gaaaggacag accaagtgca g 212419DNAHomo sapiens 24agcctgtgca gggaaactc 192520DNAHomo sapiens 25agtgggatgc agctgaaaag 202620DNAHomo sapiens 26caacagcatg ggaaagatcc 202724DNAHomo sapiens 27ttgaaagttg gtcttaggaa gagg 242820DNAHomo sapiens 28cccaacaaac ctggctttag 202920DNAHomo sapiens 29agacgcccaa aatcaacaac 203020DNAHomo sapiens 30ccgcttgctg ctaaaaattg 203121DNAHomo sapiens 31tgattgccaa ggaagattca c 213221DNAHomo sapiens 32tggaagtaaa tgcaggtgct c 213320DNAHomo sapiens 33tcattcttgg gtgtttctcg 203420DNAHomo sapiens 34atggcggttt tgtggaatag 203520DNAHomo sapiens 35gagggagagg gaaccttttg 203620DNAHomo sapiens 36ggggactata ccgcattcac 203720DNAHomo sapiens 37tgttgattca tgggcatttg 203821DNAHomo sapiens 38gctggggaat catgtatgaa g 213920DNAHomo sapiens 39catcaagcac agttccattg 204020DNAHomo sapiens 40ttctctttcc gtttccagtg 204120DNAHomo sapiens 41ggagcttggg aattcaactg 204220DNAHomo sapiens 42agaaacgggc atgtcatagg 204319DNAHomo sapiens 43cagcctacgt gcccatttc 194420DNAHomo sapiens 44tcaaaagatg gccaaaatgc 204520DNAHomo sapiens 45gtgttgcacc cattaactcg 204620DNAHomo sapiens 46agcctggtga gaggtgactg 204719DNAHomo sapiens 47cacgatgcca gtccaattc 194822DNAHomo sapiens 48aaggtggact ttaatgcaaa gg 224920DNAHomo sapiens 49ggagtgagag cgacaccttg 205021DNAHomo sapiens 50cgacagctga ctgctctatg g 215121DNAHomo sapiens 51cacaatggga aaggatgtag c 215221DNAHomo sapiens 52cagagaaaaa cacccatgac c 215321DNAHomo sapiens 53caccgtgatc ctccttattt c 215421DNAHomo sapiens 54gaacaaacaa cggatgaaag g 215520DNAHomo sapiens 55gtggcatatc cttcccaatg 205621DNAHomo sapiens 56cccccagact gtgaattaag g 215720DNAHomo sapiens 57gatgcagatc agggaaatgc 205820DNAHomo sapiens 58atcttgctgg atggacaagg 205921DNAHomo sapiens 59cttaatcctg aaaggcaggt g 216020DNAHomo sapiens 60tgtttctcag gcaaccacag 206120DNAHomo sapiens 61gaaaccacag aatcgccttc 206220DNAHomo sapiens 62acctggacag tcccacagac 206320DNAHomo sapiens 63cagtgctttt gcatccttcc 206420DNAHomo sapiens 64atttaatccc ctggccaatc 206520DNAHomo sapiens 65cacctgtgcc catcacatag 206620DNAHomo sapiens 66gagtcccctc ttggagaacc 206719DNAHomo sapiens 67aaagccattt ccagtgtcg 196820DNAHomo sapiens 68attgtgcagc cagaattgag 206920DNAHomo sapiens 69ttcacagcaa agtggctcag 207020DNAHomo sapiens 70gctattatgg gctgcaaagc 207120DNAHomo sapiens 71ttcactccca acaagcactg 207219DNAHomo sapiens 72tgcccagtcc tttttcact 197320DNAHomo sapiens 73aatccctcct gcacactttc 207420DNAHomo sapiens 74aatggatgct tccactgtcc 207520DNAHomo sapiens 75ccatctgtgc aattccttcc 207620DNAHomo sapiens 76gttcaaaggc agaagccatc 207724DNAHomo sapiens 77gtctggattc tttcacaatg tagc 247821DNAHomo sapiens 78tgccaatctt ctcctctgtt c 217920DNAHomo sapiens 79aaccacccaa tgtgttcacc 208020DNAHomo sapiens 80gttcattcct gcgagtaggc 208120DNAHomo sapiens 81gccaaaggtg gaaaatgttg 208221DNAHomo sapiens 82gccttcttca tgaaagcact g 218320DNAHomo sapiens 83ccagaaggtg gaagctacag 208418DNAHomo sapiens 84tggggtcaat gaagcaag 188520DNAHomo sapiens 85acatcgaccc agaaagttcc 208620DNAHomo sapiens 86aatgtgcttc gtaccactgc 208720DNAHomo sapiens 87agcgtgccat tgtactctcc 208820DNAHomo sapiens 88tttctgagcc catgatttcc 208920DNAHomo sapiens 89gtgcccagct agttccattc 209020DNAHomo sapiens 90tcaagagcgc taatcccatc 209121DNAHomo sapiens 91tgcacatgct cactgaaaga c 219219DNAHomo sapiens 92ttttgcctgc aaactgacc 199320DNAHomo sapiens 93cagcaagcac caaatcactg 209420DNAHomo sapiens 94agtaccagcc gtccaaactg 209520DNAHomo sapiens 95cctggccaga aaattcattg 209620DNAHomo sapiens 96accctgcatt ccaaactcac 209721DNAHomo sapiens 97gcagtccttt gaggatttag c 219824DNAHomo sapiens 98gaaagatatc caacaggaag tgag 249920DNAHomo sapiens 99tggccttgtt taaggtcctg 2010020DNAHomo sapiens 100atggtcctgc tgcttcagag 2010120DNAHomo sapiens 101accccgtcat agcacagttc 2010221DNAHomo sapiens 102caaaggccat tcatcagttt c 2110321DNAHomo sapiens 103gtggcgtgat atccttgatt c 2110420DNAHomo sapiens 104ctctggaatg actgctgctg 2010520DNAHomo sapiens 105tgtgctagat gcctcactgg 2010620DNAHomo sapiens 106ttgccaagaa gcacaacaag 2010720DNAHomo sapiens 107cggaggctct actgttggac 2010820DNAHomo sapiens 108tgctgtccac tctggaactg 2010920DNAHomo sapiens 109acatcagaag ccctggtttg 2011020DNAHomo sapiens 110gctgggagtt caagcatctc 2011120DNAHomo sapiens 111tcggtctcag tcaccatttg 2011220DNAHomo sapiens 112aacgcacctg gctgaaatac 2011322DNAHomo sapiens 113tgaacctgca atatctcaga gg 2211423DNAHomo sapiens 114cttaccgata acctgagaac acc 2311522DNAHomo sapiens 115cccagcccat atattttaaa gc 2211621DNAHomo sapiens 116ccagccactc tctggactat c 2111719DNAHomo sapiens 117gacatggaga gccgaatcc 1911822DNAHomo sapiens 118ccattaaaat cgggtctgaa ag 2211919DNAHomo sapiens 119tccagaccca gtgcacatc 1912020DNAHomo sapiens 120catggtcagt gccatcagag 2012120DNAHomo sapiens 121agcctcccaa agttaagtgc 2012220DNAHomo sapiens 122cccagctaaa accaacacac 2012320DNAHomo sapiens 123tgccctcagc tactcactcc 2012420DNAHomo sapiens 124agggctcagc ctttaggaac 2012520DNAHomo sapiens 125gccagactct cgttccattc 2012620DNAHomo sapiens 126actccccatt cagtcccttc 2012720DNAHomo sapiens 127aggcacaacg tcaggttttc 2012820DNAHomo sapiens 128ttggaatttg tcctggtgtg 2012920DNAHomo sapiens 129caccattgcc aacacttctg 2013020DNAHomo sapiens 130gccattggtt tgaaggtgac 2013120DNAHomo sapiens 131cttagtcacc gcctgtcctc 2013220DNAHomo sapiens 132tagctgcatg tggctaatcg 2013320DNAHomo sapiens 133tgtggctcgc attacatttc 2013420DNAHomo sapiens 134cgctgtcatt acctgctttg 2013520DNAHomo sapiens 135tgacctccaa aatcatccag 2013620DNAHomo sapiens 136ttctgagcta ggaggtgctg 2013722DNAHomo sapiens 137ccagatttgt aaatccctgt tc 2213821DNAHomo sapiens 138tgtgtggttc ttaagcattc c 2113920DNAHomo sapiens 139ctcagtccat cagcctcctc 2014020DNAHomo sapiens 140tgctgtgccc tgagattaag 2014121DNAHomo sapiens 141aacttaatct cagggcacag c 2114218DNAHomo sapiens 142tgcagcttca gcctcttg 1814319DNAHomo sapiens 143gcgtggtgtt tcgtaccag 1914421DNAHomo sapiens 144gctactggcc agaaatcttc c 2114520DNAHomo sapiens 145gcccagccct actaaggaag 2014620DNAHomo sapiens 146ctgtgctccc ctgctagaac 2014720DNAHomo sapiens 147gtcgtcctct tcgacctagc 2014820DNAHomo sapiens 148cagcgcctat tctacagcag 2014920DNAHomo sapiens 149ttcttcccaa gagagccaag 2015020DNAHomo sapiens 150ccacctttaa tctgcccaac 2015121DNAHomo sapiens 151gtgttgggca gattaaaggt g 2115219DNAHomo sapiens 152gcagtgtcat gccctgttc 1915321DNAHomo sapiens 153ctctttgtgc cctttctttt g 2115424DNAHomo sapiens 154agttccttaa agcagagaag atgg 2415520DNAHomo sapiens 155aacctgtccc tgtggatgag 2015620DNAHomo sapiens 156ccgaagcatc cttacattcc 2015720DNAHomo sapiens 157aatacctgaa cccccaaacc 2015823DNAHomo sapiens 158ctcaggctat tttccagatt cac 2315918DNAHomo sapiens 159gcatgcctgt cattctgg 1816020DNAHomo sapiens 160tccaagggac tgaaacacac 2016122DNAHomo sapiens 161ttagtgtgtt tcagtccctt gg 2216220DNAHomo sapiens 162gacagcaaga ccaacccttc 2016320DNAHomo sapiens 163gcacattacg agctcagtgc 2016421DNAHomo sapiens 164ctaccaggag aacagcacag g 2116522DNAHomo sapiens 165tgggttagca ttgtgttagg tg 2216620DNAHomo sapiens 166ccacaggtgt gtgccaatag 2016720DNAHomo sapiens 167aagttgcagt ttggctggtc 2016821DNAHomo sapiens 168ttatctccag cggtgcttat g 2116920DNAHomo sapiens 169taccataagc accgctggag 2017020DNAHomo sapiens 170actccaccaa gcccagtctc 2017120DNAHomo sapiens 171tttagagact gggcttggtg 2017219DNAHomo sapiens 172ctcttcccca acaaacctg 1917320DNAHomo sapiens 173cccagtttca agcgattaag 2017422DNAHomo sapiens 174aggaaaagca tgttatctcc ag 2217520DNAHomo sapiens 175ttccgtagca gtaggcatcc 2017622DNAHomo sapiens 176tcaccaccac caactttatg ag 2217721DNAHomo sapiens 177tcccagatct taaccgactt g 2117820DNAHomo sapiens 178atggcggttt tgtggaatag 2017920DNAHomo sapiens 179cccaaacaac agcattagcc 2018019DNAHomo sapiens 180acatcagcct cgggacaag 1918121DNAHomo sapiens 181tgagcccgtt gaatatagtg g 2118222DNAHomo sapiens 182agttttccta aacgggatga tg 2218320DNAHomo sapiens 183atgggtgtgc acgtgtgtag 2018420DNAHomo sapiens 184gccatgtgca attgtgagtc 2018521DNAHomo sapiens 185ccttgcatag tttgcttctg g 2118620DNAHomo sapiens 186atcatacaag ggcctgttgg 2018720DNAHomo sapiens 187aaacagaaat cgcccaacag 2018820DNAHomo sapiens 188tagagaccca cccagaaacg 2018920DNAHomo sapiens 189cagtccgatt tcgtttctgg

2019021DNAHomo sapiens 190cacacctaga tttggcaatg g 2119121DNAHomo sapiens 191ttccattgcc aaatctaggt g 2119220DNAHomo sapiens 192ggccctagtg tttcctttcc 2019323DNAHomo sapiens 193aaggaaacac tagggcctac aac 2319421DNAHomo sapiens 194cctggcctca gtacactttt g 2119520DNAHomo sapiens 195agggattctc cccacttagc 2019620DNAHomo sapiens 196attggaggac tggctcaaag 2019720DNAHomo sapiens 197gcttaccttt gagccagtcc 2019820DNAHomo sapiens 198acatgttcct acccccagac 2019920DNAHomo sapiens 199tttctgcatc agttggttgc 2020021DNAHomo sapiens 200gccaagttat tgctgcttca g 2120120DNAHomo sapiens 201agccctgtga ggttggtaac 2020220DNAHomo sapiens 202tcaacaacag ctggaactgc 2020320DNAHomo sapiens 203cctctcaggt caggcttctg 2020420DNAHomo sapiens 204gctcccgcta gagaaactcc 2020520DNAHomo sapiens 205gagcgaagca cctaaagcac 2020620DNAHomo sapiens 206aattggaggg ggtggagtag 2020720DNAHomo sapiens 207tgtcacccag tcaggtcatc 2020820DNAHomo sapiens 208ttggaaggaa tccaacaagg 2020920DNAHomo sapiens 209ttcccagaac tccttgttgg 2021020DNAHomo sapiens 210tgcaaacccc ttcttttcag 2021120DNAHomo sapiens 211accccatgca gaagcaatag 2021220DNAHomo sapiens 212aaatcctgaa ggtgggttcc 2021319DNAHomo sapiens 213agtttcagcc atgttgcag 1921420DNAHomo sapiens 214ttggcaaaat tgtgactgag 2021520DNAHomo sapiens 215cagtcacaat tttgccaagg 2021622DNAHomo sapiens 216agttcgtggc atctaactat cg 2221720DNAHomo sapiens 217ggtccatgtg ctccaaaaag 2021820DNAHomo sapiens 218tccaaaactg ggaacaaacc 2021920DNAHomo sapiens 219tggtttgttc ccagttttgg 2022020DNAHomo sapiens 220tagtgcacca cagcctcaag 2022120DNAHomo sapiens 221ggatcacttg aggctgtggt 2022220DNAHomo sapiens 222tccaacaact gctgtgaagg 2022320DNAHomo sapiens 223caccactgac cttcccttcc 2022425DNAHomo sapiens 224gcacagaaag acaaatatca catgc 2522520DNAHomo sapiens 225ctcttcctcg tctcctcctg 2022620DNAHomo sapiens 226ccaattcaat gcaaaacctg 2022720DNAHomo sapiens 227cgagcagctc tctcttcagg 2022823DNAHomo sapiens 228agcctataag cacagaccaa ctg 2322922DNAHomo sapiens 229ttctctagca gttggtctgt gc 2223020DNAHomo sapiens 230accctgcatt ccaaactcac 2023120DNAHomo sapiens 231gttcattttg gggcatgttc 2023221DNAHomo sapiens 232ctgcaacctc ctttgagaca g 2123320DNAHomo sapiens 233tgtctcaaag gaggttgcag 2023420DNAHomo sapiens 234ccaaaatgaa actgccttcc 2023520DNAHomo sapiens 235agttccctgg gtcattttcc 2023620DNAHomo sapiens 236ttgtgggaag gcaaactagc 2023720DNAHomo sapiens 237cctgtgctag tttgccttcc 2023820DNAHomo sapiens 238ggtggtcacc gtggtaaaag 2023921DNAHomo sapiens 239gaccaccatg tgatttccaa g 2124020DNAHomo sapiens 240ttggttggcg gttatttctc 2024120DNAHomo sapiens 241taaccgccaa ccaagaaaag 2024220DNAHomo sapiens 242tgtctggaga ccttcccaag 2024320DNAHomo sapiens 243tgtgctagat gcctcactgg 2024420DNAHomo sapiens 244acttgcctac attgcccatc 2024520DNAHomo sapiens 245atgggcaatg taggcaagtc 2024621DNAHomo sapiens 246tctgcagcca tgaataagtc c 2124721DNAHomo sapiens 247cagagctgag gcgataaatt g 2124820DNAHomo sapiens 248tgctcctctc caatccattc 2024920DNAHomo sapiens 249atactttccc agcccaaacc 2025020DNAHomo sapiens 250tgatggggaa atgagaggag 2025120DNAHomo sapiens 251agtggccttt gtccattgag 2025222DNAHomo sapiens 252gacagaggtg agagcctagg ag 2225320DNAHomo sapiens 253aatgtgttgg ggaagtggtc 2025420DNAHomo sapiens 254tttggaccac ggctttagac 2025520DNAHomo sapiens 255aagctgaggt cacggatttg 2025620DNAHomo sapiens 256gatgggcaag tttcatctcc 2025720DNAHomo sapiens 257tgggacgaag aaaaggaatg 2025820DNAHomo sapiens 258caccgtgcct cagcctatac 2025920DNAHomo sapiens 259ggactaaccc acctcccttc 2026020DNAHomo sapiens 260gctataggca gcccagagtg 2026120DNAHomo sapiens 261gccagactct cgttccattc 2026220DNAHomo sapiens 262aggatttgcc gtatggactc 2026320DNAHomo sapiens 263tcgcccaaag tcacagtaag 2026421DNAHomo sapiens 264gatctgtagg cccaggattt c 2126520DNAHomo sapiens 265aggggtttct atggctggtc 2026620DNAHomo sapiens 266cctccctcaa acctcctctc 2026720DNAHomo sapiens 267ttctcctgca gaggaagagg 2026820DNAHomo sapiens 268ttggaatttg tcctggtgtg 2026919DNAHomo sapiens 269aaagccaggg agtgaatgg 1927020DNAHomo sapiens 270atgtgcatct ccctggtgac 2027120DNAHomo sapiens 271tgtggggaaa tcaaaacctg 2027220DNAHomo sapiens 272gggtagactg tgcgtgtgtg 202731133DNAHomo sapiens 273ttcttcccaa gagagccaag atttcttctt tcctcttctt tctttttttt ttctttctaa 60tttcaaagga gtataattaa attgccaggt aaaagctcaa aggtcttttt tatagtgttc 120tggaaggttc tctgcctgtg tttgtatttc ctttagcctc cacgttcctc tatccagttc 180ccgcaccctt ccccccaggc cccattcttc aaggcttcag agcagcgctc ctccggttaa 240aaggaagtct cagcacagaa tcttcaaacc tcctcggagg ccaccaaaga tccctaacgc 300cgccatggag acgaagcacc tggggcgggg cggagcgggg cgcgcgggcc cacacctgtg 360gagagggccg cgccccaact gcagcgccgg ggctggggga ggggagccta ctcactcccc 420caactcccgg gcggtgactc atcaacgagc accagcggcc agaggtgagc agtcccggga 480aggggccgag aggcggggcc gccaggtcgg gcaggtgtgc gctccgcccc gccgcgcgca 540cagagcgcta gtccttcggc gagcgagcac cttcgacgcg gtccggggac cccctcgtcg 600ctgtcctccc gacgcggacc cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt 660gtcccactcc cggcgcacgc cctcccgcga gtcccgggcc cctcccgcgc ccctcttctc 720ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt cgggcttctg cttgccgcgg 780cgacggcgac ttttgccgca gctcaggaag gtgaggcgcg gattggagca gagttgtgga 840gctgggctgg gctggggggc agcggccccc ggccctcggc ccccgaaacg ggcataatag 900ggaggggacc aagaggccgc gctttccagc gtggagaccg gacggtgcgg ccgtgctccg 960gctcaggccc tccgcgcggt aggaaacggc gagggccgtc ccggggagca gcctcacttc 1020gcagctttgc tcgccttggt agggaaatgg ccttgggcgg aggcggggga caggcaggga 1080acggagtggc cacgtccagg tttcctgcgg ccaccgaacc ggtgcctcgc gcc 1133274782DNAHomo sapiens 274ggcttcaatc tgggactacg tacttaatgt taaattgctt taaagtggtc atagctgcta 60caggtttgtg ctcagaaagt ctgcacctga ctggtctgat ttaaatttta cgccccttag 120gtatgaacag tgtgttttaa acaagtacag gatggggctg cagaagattt aaacgcttga 180gaacaagtgc tgtattttcc ccttttgtga ccccagtatt gagtttagtg ttgggcagat 240taaaggtggt tcatatcgac tataacttga acagggaaaa attgaaatca acttagggta 300cttgggatac gaaggatcaa tataaaaact ctggtttgtc atgctagctt tttctttttt 360ttcctcttca gttgaactga ggagatagtt tttgttttta atgattgtgc tcttttaact 420agacaaaagg aattagatag tcttgcctat tcgaagttaa atgaactttt gaggttgtta 480aggacaaaac tattaaactg acatcaataa tacagaatgg gctgcttagt atcactttcc 540ttatcaggta ctaggattta atttagttag gaaactcact taaagggagg actataactg 600cagttgaaag tgtaattttt ccaagatata aaattgttta aagattgaat atattcctgt 660taagccccaa aggaaacatc cctcatttaa gaaaatgggg tgggagagca agagaaggtg 720aggattcaca gatcctagaa ttggaatagt tgattttttt ttgtaaaaga ggcggtgaca 780gc 7822751258DNAHomo sapiens 275gccaggcact taggcagtag tctatagctg aaaaataaaa cattcagaac cactttttaa 60ggttttgtgt ccttgtaact ttaggcatta ttattacaat ataacttagc tgggacatga 120gagttaatag atccacattt taaagtagat ttttttttta attttctaga atgtgtctgt 180gaaaactaca agctggccgt aaactgcttt gtgaataata atcgtcaatg ccagtgtact 240tcagttggtg cacaaaatac tgtcatttgc tcaaagcgtg agtaaaatat cctaattacc 300tgtaagcttt attttgactt aatacttctt taattgatgt gccttgagtt ggaaagagtt 360ttattggctt aaatctgaat catgttacaa agtaagtgtg ggaacacata aatttcaaat 420aatctttgac cctggaactt tagagttaat tttttttttc ccgtaatcat gaaatcagtt 480atttttcagt ttggcattaa ggtttctttt tcagtggctg ccaaatgttt ggtgatgaag 540gcagaaatga atggctcaaa acttgggaga agagcaaaac ctgaaggggc cctccagaac 600aatgatgggc tttatgatcc tgactgcgat gagagcgggc tctttaaggc caagcagtgc 660aacggcacct ccatgtgctg gtgtgtgaac actgctgggg tcagaagaac agacaaggac 720actgaaataa cctgctctga gcgagtgaga acctagtgag tggggctgcc tatactactt 780gttttcatgc tgttcagatt catttaatta aatttatttt tgattatgta atatgatttc 840atggtttaga attcagaaga tatgagtgtc cagtgaaaag cttccttctc attccagtcc 900ccctcgctac ccattggacc tccacagaat tgatgttatt gattattcta taaccttcca 960gagatagttg atgaatttgt tatatatctg ttttattatt tttacataaa tgatagcata 1020ctaggtataa tttttctttt atatctttac ttaacattat tcagtatttc attgttgcat 1080tagtagtaaa tgtatgtaat ttaacctatg tatttgctta ttgattgtgt tttaaaagtg 1140agatatgctt gttttaggga ttgtttaatg aaaaggcaca gaaacccact caagctagct 1200taagcaaaaa aagacttcat tggaagggac tagaaactgg aaaggatgtc aggaccaa 1258276665DNAHomo sapiens 276ttagttgaac agggcatgac actgccagct aaactttgac ttaatgtgac tttatgtatt 60gtgtccagag aacagagggt caatattaga aaaggtgttc cctcctgggt gtgtccttta 120tgaaggatgt gtaagggaag aaattatagg aatagctact gcataaattt tttttctctt 180agtccttata attcgagaat tttaggatta gcttattagg aaaatagtat ggaagactga 240gttatagtca actgacattg tctttttact ttatagctgg atcatcattg aactaaaaca 300caaagcaaga gaaaaacctt atgatagtaa aagtttgcgg acgtaagtgc aattaaatgc 360atcatattct tgcacagttg gtggctcaaa tcttccatcc tacaccatta gaaaaagcaa 420gtctaaatgc ttttttatat ttctgaaaaa taaagttact tgaaatagag ttgcaagaat 480agcacagaga ttctgggaat acacttcact cagattcacc aattaacatt ttggcacatt 540tgctttttat atgtgtatgt gtggatgaat atgtgtgtgt gctttacatc agtgtatcta 600tgcatgtata aatatttttc ccagaagcac atgagagcaa gttgtagaca tcaggcccct 660ttacc 665277649DNAHomo sapiens 277gagccttgat gttccctctt aactaaaagc aggttatgca tttttgacag gaaaactact 60taagcgatct tgtgtccttt ataatacttc acattaggag ttgcatgatg tcagcttgtc 120cctttactag taaagtaaac tttggttaaa gtggtatcca ccaggttttt ccactgtgaa 180gttaccattc tccctttgta atccataaat aatctatggg cagatacttg gatactaagt 240aaatgttctt tttctaatta aactggtacc cagcagtttg aatatcaatg gatgattcca 300gcctgaatca attattatta tgatagttgc aaaatggcag aaaaatttta actttaatga 360cagttttaga ccctgagctg tctgcttaaa gagtagtgct tcttactgtt gtgtggtaca 420aacatttttt tttaatacag attttaaatt ctttacagtg cacttcagaa ggagatcaca 480acgcgttatc aactggatcc aaaatttatc acgagtattt tggtatgatt ttttaataag 540tgagctttag cagacagttg gtgagacagt atgttttgag tataaggaca gccagtgatt 600taagtggtgg ttaaatgcac ttactggagc aacagtttcg gatctgggt 6492781250DNAHomo sapiens 278ccggccttac ctttcatttc tttagtaatt tagttttaaa gtagttctaa tccaaataaa 60atactttcat atcttattta aaaatctttt caatataaga aaatcctctt aggaaaaatt 120gtacattgta attatgtttg gttgcatggc tgtcttattt ccctttgata gatttagaga 180cctcccaaag atttcttgat tagtgataaa cttagttatc cactaatgga aaggaacagt 240gatgcatgta gattatagaa aatcaaacac tgaatattct gattctcaat taatgttatt 300ttcaaatgat tttgattata ttagtattaa tttgtattat tcaatttttt tccccagtat 360gagaataatg ttatcactat tgatctggtt caaaattctt ctcaaaaaac tcagaatgat 420gtggacatag ctgatgtggc ttattatttt gaaaaagatg tgagtatcat cttctttatt 480cctgtgttca ggaatgtagt ctatcatgcc tcaatgaatt aaatatattt catcaccttt 540ttatccactt acagatcaac caaatggttc gctgctgccg ttaattttgt cctccctgtc 600actcacatgc atcttgcttg tttgtatatt tatgcctctt atcaaattgt tctgcctaaa 660atatctcccc tctttcttat aattcttatt tattatctac ttggtggtta cttagtttgt 720gcatatatgc tcccctatga tatttataat ttacacaaat aaaagtctgt taaaaaagac 780tgtaactgat atgattaaaa tattttgttg aaactttaat atattatagt gaggtatttt 840ctgctgaaat atgaggtttg cttcaaaata atctgggcgg gggtgaaagg atgaaaggaa 900gaaaagatga agtaagagag gctatgtgtt gttggccttg catctgggtg ataggtacat 960gggcatcatt gcactactct ttctactttc gtgtatgttg aaaggttcct gtaataaaca 1020gttttttaaa gttccaataa attagattgt tatcactaaa accataaaga ttcttggcag 1080cggttctttt ggcatacaat ttgtatgtaa ttatatgtgg ccatggttgg tttccttaaa 1140tatttttaat tccttttctc cttttcaata caggttaaag gtgaatcctt gtttcattct 1200aagaaaatgg acctgacagt aaatggggaa caactggatc tggatcctgg 12502791568DNAHomo sapiens 279gagttccatg gcagatcacc atctgttttc tgcctcatag aagagtggaa tgggaagcct 60atggttttta ttctacaaag agtcaacatc taacagaatc ttctgaaggc atactccagt 120ggattcacct tggagaaact cattgtgact gatgatctga tttattatct ctatgccagt 180gaaataatca tttaatatga acttaatttg tcataatcta ttgtgtacta actagtctat 240actagtgtga catcaaagtg tcagattgtt agtgtgtttc agtcccttgg aattgaatat 300gaacacttat ccttgaaccc tatcaataac atttttcaca tatctcaatt tttgtgtgtc 360tttgtagttg tatgtgggcc acttactaat attttagcaa gtaataaaaa tagaaacgta 420aaggaatatt ggaaaaagtc taatggaacc agaaagttct agcatttttt tcccattctg 480tagtaggtca tctggtttat ttggtttggt gaccgcaagt ctagaagact aaccctgaat 540tgaatggtaa cagacaggca gaatgacaat gtagtgttgc agtgcagagc agtacagacc 600tgggtttggc tgggcaaaat tatataactt ctttaagcct ccatgtttcc tcatctgtaa 660aatgaggata atagatagta tggacctgtt gcaaggatta aacataatca gtgtaaagtg 720ttggtcccat gcttgccaca taagaaaata tttgtcaaca gagtggtagt tgtcattatc 780attgtctcag tttgcctgta actagttgtg tgatctgaga caaacactaa ttttgaactt 840gagtttcccc acatgtaaaa tgaaagattg ataatagaaa gtaaatcaat tttttctagc 900attaaaaata gtatgcattt aataaaaatc ttattcttaa tgatctagct tacctccaac 960ttgccctagt cactttggcg atcttgtctc taaatagaac cttgaaaaca cttaaatgtg 1020tgtttccttg caatataact ttttcttttt ttatttaaat aagtcttata aatgtgggaa 1080aaaattatct tgtgttcctt taatttcatt tttatttaat actattttca gaatgaacaa 1140aagattgaaa aattatttag aatttttttc tgtgcttttt cctgtttcag ataaaggaga 1200tgggtgagat gcatagggaa ctcaatgcat aactatataa tttgaagatt atagaagaag 1260ggaaatagca aatggacaca aattacaaat gtgtgtgcgt gggacgaaga catctttgaa 1320ggtcatgagt ttgttagttt aacatcatat atttgtaata gtgaaacctg tactcaaaat 1380ataagcagct tgaaactggc tttaccaatc ttgaaatttg accacaagtg tcttatatat 1440gcagatctaa tgtaaaatcc agaacttgga ctccatcgtt aaaattattt atgtgtaaca 1500ttcaaatgtg tgcattaaat atgcttccac agtaaaatct gaaaaactga tttgtgattg 1560aaagctgc 1568280537DNAHomo sapiens 280gaagggttgg tcttgctgtc tctgggtcgc agaggcagaa gaggtggagg aggtagaagg 60gaggcaggtg cacactgggt gtaactttta ttgaaaaaaa atgtgttcaa atatacccgc 120acaattcaaa cccatgttca gggtcaattg tagttgtgac agacccaatg acccacagag 180tctaaaatgt ttattggaaa tgtttgctga cccctgctct aggatgctgg gggaaagcta 240ttcctaggta ggtgtctcag cagacatgga aagcagccta taatattgcc ccagcaggtg 300gggtatggaa cagatgctca gggaggctgc tggctgctgt ccactgcagg cccagaagcg 360tcctggagaa gccaccccat gctgcaagag ccagatcatg gagcagccct ggatgctgca 420ggagcctgtc aagccaggac accagagcta ggaaacaaaa ccttcctttt cagtgcctct 480ctagcaccct ctgctgacag agcttcagat ccattttcac agagaggtgc aaagggt 5372811454DNAHomo sapiens 281cccggtgttg aatcatttgc tttctctgca gcctgtcacc aggatgactt ccccattctc 60taccccactg tgttttcttt tttttccttt ggcttcactg gtatgaatct ctcctgattt 120tttccctctt ctactgattg

tgtaatactg atgttcccag gatttgtcct tagttctatt 180tttcctccct ttttcttcct ggaggatttc atccactttc ttgcctttat taccttctgt 240gaggctcaat gagaacagaa gcaccatctc catttctgtt cttttttcct gagatctata 300gtaaagtatg tatatttcac ataactagtt tttaatgatt tgatcatctt catccacaaa 360catattttat ggcttgtatt ctcagaatca attgatggca ttgccatgaa ccaagtccca 420aattccttct cttttacttc ctccattgtt ttagtcacca gagcctgtga tcctccccga 480gaaacaaatg ctcctctaat ctgccttctc ttacacactt tccctggtct ctttataaac 540aatctactcc ctctccagcc caaacctcat attgctccca ggattatttg cctaaagttg 600atcacagcac ttttttacaa cataagatgt acccctttcc cctgccagag gggcatatat 660aaagcctttc agtgtggccc tgtagtgtga accttttctc cttttgtctg tctggcatcc 720catttcctct gcaatttggg aagaattgtc tgcatgtttt gtaagagcca ggtctctacc 780tccctcttta gaagtttaag ggcagagatc ctctcatggc tctcagcatc cagggcacag 840gcacccccac atagtacttg gaatcttgtg atggaaagaa gtaagaacag ttgagaaaac 900acccatgggc agtggtggca tgtgaccaat ggcagttata ggcatgttgc agtggtggca 960tcccagccct tggtcatctc tgccctacat gaccctgtgg ccatccagct gtggttgctc 1020tgcttggcct cccttggctg ctgattcctc tcccctccca ttgccaatac cctgctccag 1080agctttatca catgcctgga ttaatttaat ttccgacttt cccaatacca aattctcccc 1140actctcaatc ctttcccaac gtcattccca gaataactaa aatctagtca cagtctacct 1200tatttctcac tgtacctcca atacatcaaa ttttcctgca tttaagattt cccttcccat 1260tcctccaatg ttccttcttt ccttgcttcc ttccttcctt ccttccttcc tccctccctc 1320cctccctccc tccctatctt cctttcttcc cttctttttt tctttttttc cctggacagg 1380tcttggtctc ttgcccagtc tggagtgcag tggcacaaac acagctcact gcaacctcta 1440ccttcagggc tgaa 1454282530DNAHomo sapiens 282ctggccactt tttggaagag gaaatgtgtg agcaagcagt gtgggaacca gagtgagcga 60atgctggaac tggttggtcg gtcctctctg gcaggagcag gctctgtaca gccctggcag 120cagcatccaa gcccctgccc tcttggcacc cgggttcttg tctggcatcc aggaagaatc 180aggacaaatg aatggattga aggggtagtg tatgtggagg attttactgg gtgatggaaa 240tggccctcag tgggatggca gctagaatgg ggatggtgca ggaggtaccc tccgaagtta 300gctgcgtctg cagtctctga cactcagtag cttctctgct cgctgcccag gcgcttatgt 360tgctctgcca gctgaagtca ttatgggcac aggatagggg catggcaggc caaaaaggta 420acattgggca gaaaaatggg gtcagctgtt ttcacttagg gccatggttc caggccttag 480gggtggggtt tggccagtag cccagccatt ctgtatcact ctgcgtccca 5302831992DNAHomo sapiens 283ttactggcga tcctcagagc caagaagagt ctgggacata gcaggccata taaatgtttt 60cgaatgagtg aatcatcaac gagtggatga aacgataatg tggctaacag gcagcagtaa 120ggaggctgtg tagaataaac ccgtaatccc gatgttggca gtttgcttag aaagaaaaag 180ggaggcagtc ggagaggggc acacgtttta acaaaatact gggaggagga ggaaggctag 240tttttttttt gttttcaagt ttccttctga tgttactccc atgcttccgg gcacattacg 300agctcagtgc ctgccggaaa tctcccacct ggtggcaacc tacccttgca tacaccccac 360ccaggggctt caagccttgc agctgagtaa acacagaaag gagctctact aaggatgcgc 420gtctgcgggt ttccgcgcga cctaggcgca ggcatgcgca gtagctaaag tcaccagcgt 480gcgcgggaag ctgggccgcg tctgcttatg attggttgcc gcggcagact cccacccacc 540gaaacgcagc cctggaagct gattgggtgt ggtcgccgtg gccggacgcc gctcggggga 600cgtgggaggg gaggcgggaa acagcttagt gggtgtgggg tcgcgcattt tcttcaacca 660ggaggtgagg aggtttcgac atggcggtgc agccgaagga gacgctgcag ttggagagcg 720cggccgaggt cggcttcgtg cgcttctttc agggcatgcc ggagaagccg accaccacag 780tgcgcctttt cgaccggggc gacttctata cggcgcacgg cgaggacgcg ctgctggccg 840cccgggaggt gttcaagacc cagggggtga tcaagtacat ggggccggca ggtgagggcc 900gggacggcgc gtgctgggga gggacccggg gccttgtggc gcggctcctt tcccgcctca 960gagagtgggc ggtgagcagc ctctccagtg cggaggcacg ggggcggaac gttggtgctt 1020gtgcggattc cgccgtcccc aggttctgct tggctccgga gggacgcccc cctcagccct 1080gaaacccgtg cctctccagc cgccccggat ctgaacttgt gatcacggag tgtttacgtc 1140gtgccaggca ttttaatgca ttgttctagt tcattttcca gcagtcgcat tcctcgcctt 1200ggccctacat gtagcgctca ttacaaacac ggccagaatc tcttattaac aaacagcagc 1260caggagtgag atttaaaata gactgggggt ttaggagacc cttttatgac acgtaattct 1320gctcccacga cgctcccatt tataccgccg gtccagctaa gggtctggta atggagcgcc 1380gttgaagagc agtatgatga agtggtcagg accaacggac tctggagctg ggctgcttgg 1440gatcaagtcg ctgcccctct gcttattaac gtgtgacctt gggccagtca tggacgctat 1500ctgcttcagc tcagcattca gtgctctccg tcacccgacc ccatctatcc aggattatct 1560ctccctggaa agctacaaac gtctcaccct atgtgggcca aatgttctgg ataggcctag 1620ttaacctctt ctctccctgt tttctttgcg ctttcttgca gctatgtagt tatgctaatg 1680aaaagagcat cctaggggga gcagagttgt ggattctagt cctgactaga ggactagtgc 1740aaatgcgata ctcctgatga aaaatgtttc attcgttaga tataaatgtg ttaggcaggg 1800ttatggacac tagatgaaaa aagaaatacc tctactttca tagagatcac tattggacag 1860caaggcagaa ataattacaa ttcaagttgg aggcttatgg aggtgagctt gtaagaggtt 1920acaagaggcg ccaaggcagg atcgccaaag acggaagact ttggaagagt ctcatacaac 1980ggaagaggcg tt 1992284497DNAHomo sapiens 284gaaaggacag accaagtgca gtggttcgtt ccagcactta gggatgccaa ggtgggagga 60ttgcttgatg ctaggagttg aagactagcc tgtgtaacat agcgagaccc atctctacaa 120aaaaattaaa aagttacctt tagaacttac gatttttatg tgtagactcc atataagcag 180agggtctatg cttattcact atttattacc ttccatagtc cctgcacata taataggtgc 240ttcataaaca atttaatgaa tgaataaatt actgagaaaa cactggaagt ttttgggtta 300gcattgtgtt aggtgcttga tatggtctgg ctgtgttccc acccttatct catcttgaat 360tcccatgttt tgtgggaggt acctggtggg acataattga atcatgtggg caggtttttc 420ctgtgctgtt ctcctggtag tgaataagcc tcacaagatc tgatggtttt aaaaatggga 480gtttccctgc acaggct 4972851667DNAHomo sapiens 285agtgggatgc agctgaaaag atacccgaaa atatggaagc aactttggag ctgggtaaca 60ggcagaggtc agagcagttt agagggctca gaagaagacc agaaaatgtg ggaaagtttg 120gaacttccta gagacttgtt caatggcttt gaccaaaatc ctgataatga tatggacaat 180gaaatccagg ctcatgtggt ctcagatgga gatgaggaac ttgttgggaa ctggagcaaa 240ggtgacactt gttatgtttt agtaaagaga ctggtggcat tttgccctgc cctagagatt 300tgtggagctt tgaacttgag agaaatgatt ttgggtatct ggtgggagaa atttctaagc 360agcaaagcat tcaagaggtg acttgggtgc tgttaaaggc attcagtttt aaaagggaaa 420cagcatgaaa gtttggaaaa tttgcagcct gacaatgtga tagaaaagaa aatcccgttt 480tctgaggaga aattcaagct agctacagaa atttgcataa gtaatgagga tcccaatgtt 540aatccccaag acaatgggaa aaatgtttcc agggcatgtc agaggccttc atggcagccc 600ctctcatcac aagcctagag gcctaggaga aaaaagtgat ttcatgggcc agcccggggt 660ccccatgctg tgtgcagcct agtgacttgg tgccctgcat cccagctgcc ccagctgtgg 720ctgaaagggg ccaacctaga gctcaggcca tggcttcaga gggtgcaagc ctgaaacctt 780gacagcttcc aggtggtgtt gagcctgcag gtgcacagaa atcaataatt gaggtttgag 840aatctctgcc taggtttcaa agatgtatgg aaacgcctgc atgtccaggc agaagtttgc 900tgcaggggtg gggtgctcat tgagttcctc tgctagggca atgtagaagg gaaatgtagg 960gtcagagccc ccccacagag tccctactgg ggcaccacct agtggagctg tgaaaagagg 1020gctaccattc tccagacctc agaatggtag atccacagac agcttgcacc atgtgcctgg 1080aaaagctgta gacacttaac gccatctcat gaaagcaacc aggcagtgtg ctgtaccctg 1140caaagccaca ggggcagagc tgtccaaggc tgtggttgcc cagctcttgc atccgcatga 1200cctggacatg agacatagag tcaaaggaga tcattttgga gctttaagat ttgactgcca 1260tgctggattt tggacttgca tggggcctgt agcccctttg ttttggccaa tttctcccat 1320ttggaatggc tgtatttacc caattcctat accccattgt atctgggaag taactaactt 1380gcttttgatt tgacaggctc atatgcggaa aggacttacc ttgtcttgaa tgagactttg 1440gactggaatt ttgaattaat gctgaaatga gttaaggctt tgggggactg ttgggaatgc 1500atgattggtt ttgaaatgtg aggacatgag atttgggagg ggtcatggca gaatgatatg 1560gtttggctat gtccccacct aaatcccatc ttgaattccc atgtattgtg ggagggacct 1620ggtgggagat agttgaatca tggggatgga tctttcccat gctgttg 1667286913DNAHomo sapiens 286ttgaaagttg gtcttaggaa gaggaacttt ttgtggaaat ttcttaatat ttgaagaata 60ttatgttatt gttcctctgt ttttcatggc gtagtaaggt tttcactaat gagcttgcca 120ttctttctat tttatttttt gtttactagg gttctgttga agataccact ggctctcagt 180ctctggctgc cttgctgaat aagtgtaaaa cccctcaagg acaaagactt gttaaccagt 240ggattaagca gcctctcatg gataagaaca gaatagagga gaggtatgtt attagtttat 300actttcgtta gttttatgta acctgcagtt acccacatga ttataccact tattgtaata 360tgcagttttg gaagtatatg ttaccattta actgtacaga gtacatagta atagagtggt 420aattatttag attgattaaa gaactcattt ttttaaataa gttttttttt tttcactata 480aaagtttatt ttatttgaga tggtatggta tcgaacatgt tcatattgtg tgtaatcgtg 540ggtaaattac tcaaccttta tgtcatagtt tcttcacctt taaaatgaca ttaataaaag 600agctacttaa taggattata agcatgagat gatttaatat acataaaata cttacagtct 660gatatatagg aagcacttaa ctctttatcc tagaaaagat ttaaggtgac cttaacatat 720atgtcagaaa atctttaaaa ttgtggaaat aaaaggttgt ataattctgc tatcctaaaa 780ttactagtat ttcaatatat tttattttag tcttttcttt tagatacaag ttttaaaact 840tttaagtgaa gtgtaatata cgtaagtact gcttgatgaa tttaaggtga tttctaaagc 900caggtttgtt ggg 913287907DNAHomo sapiens 287agacgcccaa aatcaacaac aacaaaaacg atattggaat gattggatcc ccaaagataa 60atgtttgagg tgatggatat ctcagttacc ctgagttaag tattatacat tgtatacgtg 120tattaaaata ttacaaaccc ccaaatgtgt acaattatga ggtatcaata aaagagattg 180gaaggactgg gtaatttgca agtaattaag gcaatttaca atttttaatt tttatttgtg 240aataagtagt tatacgtgtc aaaattcaaa aaggacaggt ggatatacag tgataagtca 300tccccccttc tctgtcagct ccataaagag cccctgtctt gcatggctcc agggtcacat 360ttcctattgt attttgccac cacctgccct gggagcaaca gtgttagttt cttgaacatc 420cttccaagca gagtctgggc ctacacaagc aaaacaagta tgtctattct ctctcctctt 480taattttttt aaaggaagtg attgataatt taacactcaa gctataggtc attggttata 540tttttaattt ccaatttatg ggaatagagg aagtgtcagt gatccccttc tggtttaaga 600actggaggat gcatgtgttt agacccttta gaaacctgaa atgtcaccta atataattat 660cagagtaaca ctttttagta agcaagctat ctatcaaaag taggtttttg aagaagaggg 720taaggaaagg ttactttcat gggacatagc aataatttct aaaatctaat ggttttacaa 780gacttgttca ttagaagtaa catctgtgag gatggcttta tgagtcaaaa tattatctgc 840ttaatacccc acctgtaggg taagaagaaa tgtttttttc ttggtgacaa tttttagcag 900caagcgg 9072881137DNAHomo sapiens 288tgattgccaa ggaagattca cagggcctag aatggcagtg gttatgcatc tacagtttat 60tacaggagaa ggatacaatc cagtagcagg attatggtaa ggatatgcat cacagtcaaa 120ggctgtcata gcaagtcatc cagagagttc gggtgcaagt tccagttttc ctttgttgtg 180taaagtctgt ggtggggtgc attttctctc tcagagcagg atgtgtgcac aggacacctt 240ggaacctagg agcccaaaat agagtcttca ctggactttt taatattttt cttgtcaagc 300ggacatgttc ctgttctcta actagcctct tcagtggagg tcagaggaag agcctcattg 360agaccaagtg caactcatca atcacatgaa acaatgctga taaataaacc acctaaatat 420cccctgaccc acaaatacaa aacaacacca ttcaatcagt atttttcatg ccttgatcag 480gggtcattgc catgcaggaa ctttaacaaa acagtacagg ctaataatag aattgttgga 540attaactcac acagcacacc tatgagagag agttaagata gagggtcttg gtggtctcta 600acagttgaat tcaaagtgaa gttaccagag taaagtgagc aaagacacat attagtacaa 660tattggtaga taaaatcacg ttgctctaat aagcatagtt ttaaacttta accatgtttc 720tccagtaatt ttagtaatta tattgttgtt atgtctaata cataaagcat tttttacttt 780tttaaaaaat ttttaggcaa tgtggggtcc aaagtaatta aaaaaaaatt tttttaacat 840aaagcatctt aaaattttac ttaatcatga tcacttagaa ccattaaaac atacgttttg 900atattatggg gaagcttcgt tgttcctttg tagacagact taaagaaata caactttatg 960atgacaagat ataagataat tatagattta aattttatag aaaccttttc ccttatctag 1020tgcaagaggt agctaagtgc ttattttctc aaagtactgt gttataaaaa gtattcctag 1080tgtagtcaaa gcttctcttt agactgataa aacttagagc acctgcattt acttcca 1137289457DNAHomo sapiens 289tcattcttgg gtgtttctcg cagaggggga tttggcaggg tcacaggaca atagtggagg 60gaaggtcagc agataaacaa gtgaacaaag gtctctggtt ttcctaggca gaggaccctg 120cggccttccg cagtgtttgt gtccctgggt acttgagatt agggagtggt gatgactctt 180aatgagcgtg ctgccttcaa gcatctgttt aacaaagcac atcttgcacc acccttaatc 240cgttcaaccc tgagtggaca cagcacatgt ttcagagagc acagggttgg gggtaaggtc 300acagatcaac aggatcccaa ggcagaataa tttttcgtag tacagaacaa aatgaaaagt 360ctcccacgtc tacctctttc tacacagaca cggcaaccat ccgatttctc aatcttttcc 420ccacctttcc cccctttcta ttccacaaaa ccgccat 457290564DNAHomo sapiens 290gagggagagg gaaccttttg ttttattcca gtaggaccag ctagaaacag aaggtgattg 60accagtatta gggatggaat cagggtacaa ttatggagac aggctatcta aacaattcac 120tctcaccatt taaatcagct gtttgatcat tttttttcca tatatcttta ccatcgcata 180gtaaataata tcctttttat tttcaagagg gagtattggc cttaagttag gaactctctt 240aatttttttc ccccatcatc ccacccgcac ttcttactcc ttacttccta cttgctttta 300ttctttactg gctctttacc actgcgtatt tttaggtgca tacatctatt ttttaaaaaa 360gcacccttgt tcctgggtcc tcttccagta ccatctatta atatatctct ctccctcttt 420ccactcccag ctgggtttct gaaagcgtgc acttcccatc ttccattcat tcatctggtt 480tccagccctg accacagtac tgaaatggca tttgctaggt gacctttatt tttttttaaa 540tccagtgaat gcggtatagt cccc 5642912249DNAHomo sapiens 291tgttgattca tgggcatttg ggttggttcc acgtttttgc aattgtgaat tgtgctgcta 60taaacatccg tgtgcaagta tcttttttgt ataatgatat cttttcccct gggtagatac 120ccagtagtgg gattgctgga tcaactggta gttctacttt taaggaatct ccacactgtt 180ttccttagtg gttatactgg tttacattcc caccagaagt gtagaagtgt tccctgttca 240ctgcatccac accaacatct atttttgatt ttttgattat ggccattctt gcaggagtaa 300ggtggtatcg cattgtggtt ttgatttaca tttccctgat cattagtgat gttgagcatt 360tttttatgtt tgtttgccat ttgtatatct tcttgagaat tgtctattca tgtccttagc 420ccattttttg ataggattgt ttgttttttt tcttgctagt ttgtttgagc ttgttgtaga 480ttctggttat tagtcctttg tcagatttat agattgtgaa gatttttttc ccactctgtg 540ggttgtctgt ttttgtctgt ttccttctgc tgactgttcc ttttgccatg caaaagctct 600ttttttttga gacagaatct cgctctgtcg gccaggctgg taacaaagac acaggtactg 660gtaataactg ccatggctta ttgcctacat taatgatgaa agcaaatgct aaatttcagc 720tagaggctag agaaaataag cctggaattt tcttttatgt ttatatactg ctatgaatac 780caggagtcct tgggttaaga ctgtagggct ttctaaagcc tgtgatcact agtggagaat 840gtagctttac aaagtctagt tggaaattgg caactggggg ttagtacaag ttacaaggaa 900gggatggaat ttaagatgct agtgaaagct tggaggataa gggagcaggt gaactcataa 960ggaagtttat gaactgagaa gggctgcagc aaagtgggct catgtgcttg aggagccaga 1020ggacatgttg agggtgacat aggttctgaa gttcgtacag atacttatgc agtatggatt 1080cttggaaaac cttctttagt catgtgatag aaaaataaca gcttatggaa aaaacagggt 1140tgaggcagac ctgaaaatac atgaaatttt aaaaaccgct tctaacagaa gcataacaga 1200ctgtaataaa aactgtggcc ttcctggcat ttgcacccaa acaacagcat tagccaactc 1260tttgaagcct tagatctgtg gctcttgttt tctcctttga ggtgtaggtc cttgagggca 1320tttgcttcta atagaggcta gtttcatcag aattaaaaat ctgaaccatg gtatgaaatt 1380caattctttt ttttttttct tttttgaaaa cactggcaaa tgttttgtat ccttgagctt 1440tcccacatat cttaacatag tgagtggaaa gtacagtggc tgttaagcca actactctga 1500ggtcttcact gctaaggctt actcttaatt gtgtgagagc ttaaccttga tccctttaaa 1560acattaatgg gctagaaaaa aaaccattca taaaccagtg ccacctctga attttgctac 1620cacaattccc ttatttacca atagtgcatg agctaatttg gaataaagaa ctaggcattg 1680tagcacaaca gacattatgt gggcaaagtg ttgtttatat tctgtctaaa tagtgcttca 1740catgtatgta ctattttcta aatatgtata gatgcttttg tgattaataa taaaacatga 1800attcttaaaa caattttgct gacttcatag tagcttttca ccgttttttc agtagctgct 1860aaaatttctg gagaagtttg ggaactattg ttttggagtg aaatgcagtg tgttagatat 1920cacttgcaga attcttctaa gggtatttat tggcgattag aaaaaaaatc cttgtgttat 1980accagtagta atacaaagta attgttcagc ttctgttaag tgtaaaggac tatacaagta 2040ttgtgtatag ttatctcatt tattattttc tgggtagcta ttgttattat tacttcgtac 2100aaaaagggaa aaggaggctc aaagtatcat gctccagata acagagccag taggtagcag 2160agctgggatt gctacccagg tctctagtcc tgctttttca cactatatac tcattgcttc 2220acttactcct tcatacatga ttccccagc 2249292890DNAHomo sapiens 292catcaagcac agttccattg tgtaaaaact tggcttgatt taacctgtta attggaacac 60tgtcattaat ggaaattagg aatatgaggt aagctagagg ttttatttta atgactttgg 120gttattaaat ctataagaaa tgaaattcat ttagtcataa ttaatgtcat gtttctgcat 180ctatattact tgttgggttt acagacgagg tagtgtatta ttagtgggaa gctttgagtg 240ctacatcatc tccctttcta taaaataaat tgagtacgaa acaatttgaa ttaaaacacc 300tgagtaaata gtaactttgg agacctgctg tactatttgt accttttgga tcaaatgatg 360cttgtttatc tcagtcaaaa ttttatgatt tgtattctgt aaaatgagat ctttttattt 420gtttgtttta ctactttctt ttaggaaaac accagaaatt attgttggca gtttttgtga 480ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 540tagatatgga tcaggtatgc aatatacttt ttaatttaag cagtagttat ttttaaaaag 600caaaggccac tttaagaaag tttgtagatt tttcttttta gtatctaatt gtagcacctt 660tgtggacagt ggatgtaata ttaagtgaca gatgggaaaa ggatttttaa aaaaatagca 720actgtttcag tggatgaaat aaagattatt agcagagaaa atgaatattg ggcataactg 780tcctggtgaa agacaatctc ataaatgaac aatttcataa tttcgtaaat gcaactgcat 840tttattttca aagagaagga aaattatagt cactggaaac ggaaagagaa 8902931224DNAHomo sapiens 293ggagcttggg aattcaactg acacacgaca gatttacagg agaaaagttt tatttcaagt 60acacatgaga gcttcataga aaagaagtga agacctaaag aaacagactg gagagttcat 120atgccatttt aataaaggat aatgtattag tctgttctca tgctgctaat aaatacatac 180ccaagactgg gtaatttata aagaaaaaga ggtttaatcg actcacaatt gcacatggct 240ggggaggcct tacaatcatg gcagaaggta aaggaggagc aaaggcacat attacatggt 300gtcaggcaag agagtgtgtg caggggaact gccctttata aaaccatcag atctcgagag 360acttattcac catcacaaga acggcatggg aaaaacctgc ccccgtgatt caattacctc 420ccaccgggtc cctcccatga cacatgggga ttatgggagc tacaactcaa gatgagattt 480gggtggggac acagccaaga catatcagat aataaattgt ggagaggcag taagattgaa 540gaaaagaggt ttgagcttcg aggggtggta aattgtggga aggtaattat ttggggcaaa 600ctaatggcac ataaggattg ttttagtaag gcttgttatg catacccaaa acaagtgcca 660tctccagtaa tttaagagtc tatggtgatc aagagtagtt ctcttcctgc tagaagaggg 720gtgggagaga acaccttcac aaagggaaat ttatattctg ccttcatgca gaaagggggc 780gagcagagag ttcctacgta tactgtttct tcattatctt cctctcaaaa gaatacttag 840gctaaagtgg catgatttgg ggtgacattc tgatcctctt cagtgacaat ccttgatatt 900tttccttctt tctctccagg taaacagtgt taacatcctg gtatgcttcc cccaattcca 960ttatactaac tctgtattgt gggttaaaga ttttttactt tgatcagcag tatttgaaac 1020atacctgtta tactagatgt actctgactg taaaatagtg gtcagtgtta cttctttaat 1080gatgctgtgg gattaaagga ttttattata aatgctggga agagcctgga tttgaggaag 1140gtaagcagtg cagttaggtg gatgtagact agaagaggtc atttgttctc atttcattgt 1200tgcccctatg acatgcccgt ttct 12242941550DNAHomo sapiens 294cagcctacgt gcccatttct taaagtagaa aatttagtag

ttgatgatgt cagggaagaa 60aagctttttc tctgccttac gttaagtagt tgggggcaaa ttaaattaat aaaagacaga 120ttagtgagag aaaaggctgt aagaatttgg actttatata ccatcataat agaggaagta 180aagggagatg aagggcactt aagggaaaac agatgacttg taggaaagat aaatgaaccc 240ttaagagaat agatgagaaa tatgaaggtt ttgtgacaat gtctgtttag gtggttactt 300ctcttcttgt tatgagagtc agtcttctgg ttgctggaaa ctgctaggag atttataaca 360attgggctct ttcgagaggc tcttctttta agcagataag ggagttcaca aaaaagcctg 420ttctcaaatg atttcagcac acacacacac acacttacga cacagttaag tactgtgcca 480gtaagatgtg agttgtgcat ttcttttttt tctctgagta gactgtttga ggttatttat 540atcaggactt gttatgcagg taactgaaaa ctcaacataa tctaggttat gtagttaaaa 600gtatggagag aaggtaggct ttatttagag ttgcttgatc ctgtagatct cttctacctt 660ttgtaatttt aatttcaacc aaggatggtt ccctttttgg ccctaggacc agttagcagt 720tggggcacca ttccaagcaa gagtcctgca gtttgtagtg atgggaccat ttaaacagtg 780attgtgacca gggacataga atgggatgat tggcccgagg taaccatggt tgggtatgga 840gtcagcttcc ctgtaggaag agatagacaa agtctgagca ctcctgggaa gggggaggaa 900gggaataact gttgtgtaaa tcatcagcag tgtctactaa gataccatct gtaaccatag 960gcttctatgt tttataatat aaggctgtct tttaaataaa tcagattccc tgttaaagat 1020ctgttctaga ttccctaggg ggttgacctc atatagtatc ttctttttct ttggttacaa 1080acttttaaac ttgtctgagg ttataaggtg aattcaactg tccactgtca atgtagatat 1140ttttaatgga tttagggatt taaattacat gattcagaac cactttgagg aagtctaggg 1200aatatcagtt gtttctgtat aatttctgaa agcttcactg ttttctaggt gtgcacttaa 1260ttcatgtgat gaagggaaca gtatttacat gagtggtttg gttaattttt cccctcctaa 1320gcttagcttt gtgtatcgtg cgtgcttcca gtgtttttgt ggctgcttta cataagtctt 1380ttagaagtat tttctatttt tgaagtaaat gtggatcaaa accaccccaa gacaggattg 1440aaaaaaagac agtttttcgc aagaaagtaa ataattttat ttagcttggg actttaaatg 1500atatgtctta aatgtaaaca tttctatact gcattttggc catcttttga 15502952828DNAHomo sapiens 295gtgttgcacc cattaactcg tcatttacat taggtatatc tcctgatgct atccctcccc 60cctccctcca cccctcaaca ggtgtgtgat gttccccttc ctgtgtccaa gtgtcctcat 120tgttcaattc ccacctatga ctgagaacat gcggtgtttg gttttttgtc cttgcgatag 180tttgctgaga atgatagttt ccagcttcat ccatttccct acaaaggaca tgaactcatc 240attttttatg gctgcatagt attccatggt gtatatgtgc cacattttct taatctagtc 300tgtcattgtt ggacatttgg gttgattcca agtctttgat gttgtgaata gtgccgcaat 360aaacacacgt gtgcgtgagc ctttatagca gcatgattta taatcctttg ggtatatacc 420cagtaatggg atggctgggt ccaatggtat ttctacttct agatccctga ggaatcgcca 480cagtcttcca caatggttga actagtttac agtcccacca acagtgtaaa agtgttccta 540tttctccaca tcgtctccag cacctgtcgt ttcctgactt tttaatgatc gccattctaa 600ctggtgtgag atggtatctc attgtggttt tgatttgcat ttctctgatg gccagtgatg 660atgagctata gaaatccttt ttagaaacaa cagagccttg ttgtaaaaca ggtaaatgta 720cgtgaggact tcaaaaagtt tgtggaaaaa tggaattaaa agataaaatt taaaaacaca 780ttttaaattt atttcccaac ataagctcct caagttcaag acacttttat aaatgatgat 840ctcagctgtt tagttcatcc gtaaagaact gagggtacta gaaattttac catgtcaatg 900cagtctcttt acattactaa ctaaagaaaa ataggtgctc tttaaagatc ttttaagatt 960aggaacaaaa agaagtcaga agaagccaaa tcaaggtgga tgcttaacga ctttccatag 1020aaacttacaa aattggcctt gtttgatgag aagagcgtgc aggaacgttg tcatggtgga 1080aaaggacttt gatgatgctt tccctggcat ttttctgcaa aaacttggga taactttctc 1140aaaacactct aataataagc agagcttatg ttctttatcc ccccagaaca tcagcaagca 1200aaatgcctga acatcccaaa aaactgttgc catgaccttt gcccttgact ggtccacttt 1260tgcttcgact ggaccacttc catttttggt agccattgct ttgattgtgc tttgtcttca 1320ggatggcatt ggtaaagcca tgttttggct cctgttacag ttctttgaag aaatgcttca 1380ggatcttgat cccttgttta aatttctatg gaaagctctg ctcctgtctg cagttaatct 1440gggtgcaaca gttttgtcac ccatcaagtg aaaagtttgt tcagctttaa tttttcagtc 1500agaattgtgt aaactggacc aattgttgag atgcctgtag tgttggctat tgtttctgct 1560gttagtcatt agttctcttc aattagggaa tgaacaaaat taatttttcc tgaaaaattg 1620atgtggatgg tctgccgctg tgggcttcat cttcgacatg gtctcatccc ttgttagaac 1680aagttatccg tttgtaaact gctgatttcc taggagcatt gaccccataa aattttcata 1740aagcatcagt tatttcatta ttcttctatg cagacttcac tataaatttg ctgtttggtc 1800ttacttcaat tttagcagaa ctcatactgc tctgacatct aaactgatgt cttagccttc 1860atagtgtctc tgactagatc ctattcagac gtgttatagc aaattagtaa agtttatttt 1920ggtgccaaaa acttttgaat ccacgcatag ttttttcaca acacattttc catgaacttt 1980ttgaagaccc ttcatatatt ataagaagaa agttaaaaat atcccctgca tctactactc 2040agaaataacc actgttaaca ttaagtctgt tctcaactct aggcattatt gagggttttg 2100aggacaggtc ttgaaaattt ctatggctac cttttactgg gtggagacta gcatgtatag 2160ttgaccgcat aggttaatcc ctccactcaa aaagccacaa ttttaaagtg tagtattcac 2220tagcatttag tatattcaca gtgttgtgaa atgaccacca ccatctagtt tgaaaatatt 2280tcatcacaac caaaagaaaa cctcatatct attagcggtc tctcctgttt ccccagacac 2340cggcaaccac taatgtactt tttgtctctg tggacttgtc agttctggac attttatata 2400aatggaatca tgtgaccttt tatgattgac ctctttcact tagtataatg ttttagaggc 2460tcatccacat tgtagcatgt gtcagtactt catttccttg tgtattggtc cattcttgta 2520ctgctataaa gaaatagctg agactgagta atttataaag aaaagaggtt taattggctc 2580atggttctgc aggccgtaca ggaagcatga tgatggcacc tgctcagctt ctggggaggc 2640ctcaggaaat ttaaaatcat ggcagaaggg gaagtgcggc gtcttacatg gtggagcagg 2700agcaagagag agaaggggga ggtgctccac acttttaaac aactagatct caggacaagt 2760cagtcactat aatgagaaca gcacccaggg gaaaccgccc tcacgatcca gtcacctctc 2820accaggct 28282961379DNAHomo sapiens 296cacgatgcca gtccaattct tgtgtagttt tttaatcagc tgaatttaac attcaaattc 60ttcttttaaa tcttccaata ggcagttatc tttataaaga tcctatataa tcaagacttt 120gtttctgaat attttatgta tgtttttgct actgtaaatg agatctattt ctcattgtgg 180tttcttgctg ttattactgg taagaattta gtgaaacaaa gtacttaaga gtatgtcttt 240aaattgtgag attttgatga acttttaaga aataaaattc tttagtttct tagagctttt 300tgagatttct aaggtagatc cttggtttgg gcaacatata actattacaa gttttgcaca 360ttgaacgtta tttggtaatt tttagagagg acattttaaa tgtttaggaa aaatataaat 420aaaatgtaga atactattgg gggcatatac atcatcagca ctgtaactgt ttcatatgaa 480tcatttttgt acatatagaa ctctaaagtc ctaatgaaca gaattttaca tttctataaa 540tagaaagtcc ttaatagttg tgactgaata acttatggat agcaaattat ttaactgaaa 600acagtaaaat ttaagtggga ggaaatattt gctttataat ttctgtcttt acccattatt 660tataggattt tgtcactttg ttctgtttgc aggtggaaaa ccatgaattc cttgtaaaac 720cttcatttga tcctaatctc agtgaattaa gagaaataat gaatgacttg gaaaagaaga 780tgcagtcaac attaataagt gcagccagag atcttggtaa gaatgggtca ttggaggttg 840gaataattct tttgtctata cactgtatag acaaaatatt gatgccagaa ttattttata 900agttccctgt ccccaagatg atgacttcac atctctgtca aacagaaatc gcccaacagg 960cccttgtatg atgtcattta aacaagccct attttaaatg tcacctccac tggtaacagg 1020atactcctag gaggatcacc aagcccaatt cttctaggag tagtgcattg attaggcttt 1080ggggtttcca agcagttcat taatgtcact tttggaaaaa gtctgtcttt cataccagct 1140tattaattcc ctatgggttc acacggtttt ttttcctgga ttttcatcaa acatgtgtaa 1200ggtactcagt acaaagaagt ttagaaatcc agaacaaagc agtgtattta agtagtagta 1260aacttccaga taatctgatg cccatatcta catatataaa aaatttgcaa atagttctgt 1320agagagtcca aacatggagt agatccctaa ttaagagcct ttgcattaaa gtccacctt 13792972455DNAHomo sapiens 297ggagtgagag cgacaccttg tctttaaaaa aaaaaacaga ggaatgcatc atagtatata 60ttaaattatt gcctattttt ttatctattt tattgagtgc taataagaaa attaatggca 120aaaacttgtt ttttacagta taaattaagt ttaatttcat tttaaaatta agtaaatttg 180ttttattaaa aagtatgttg aaagcaacat aaatagcact caaattgaga cagaaactgt 240aactgtagta taagaagcat taggctggga attgggaaac acgagttcta gttgcagctt 300ggaaactttt tctgaagctc tttacaaatt acttaatttc tctggttttc accacattgt 360tctatagcat taacatgttg gattcattgc tttaattctt agacctacgt gtcatcagaa 420atgccattac actttgagga tttgagcctt attttaaata aagttgtgat cctcatggca 480gcctaggttt acatgtgtta aataaacagt attctgtaaa taccattgtc tttcatgttt 540agtgatgttg ctgttgttaa cactgcagtg aaatgcatat ataagcaaac tacattacat 600actcatgaac atggtccttt gttttgaaac tttgatcact gattgttcgc agtctttcat 660tgtggaacta ctctttcact ttgaatgttt tgagaggttc ctttgttcag atcagtccga 720tttcgtttct gggtgggtct ctactttccc ttttctcact ggtcaagcga ggtctgtcta 780attgtttgct actactaaca tttgatggcc acgcttcagc aagtacattt gtagattctc 840tctctctgtc tctcttaatt tgtggtctag agatcatatt ggttaatgaa attatgaaga 900gggaatgtat ttataaaaac tcaaattctt gatgcagaag gtctagctga ttgtgaaccc 960aaaatatccg agacaggtca caaccaattt agaaacttta ttttgccaag gttaaggatg 1020catccatgac atagtctcac aaggttctaa tgacacatgc gcaaggtggt tagggtacag 1080cttggtttta tacattttag ggagacatga gacatcagtc aacatgtgta agatgtacat 1140tgattctatc cagaaaggca ggacaacttg aagcaagggg ctttcaggta ataagtagat 1200aagagacaaa aggttgcata cttttgagtc cttgatcagc ctttcactga ataaacaagc 1260ttagtcttgt tagtgaatct gcgtttttac ataaacagta ggtcagagga agcaatcaga 1320aatgcatttg tgtcaggtga gccgagggat gactttctgt ccctcacctg tgaagataag 1380ctatcagttt ccattgctag ggtgaaattc aacagaattg tttgagagtg aacatctgga 1440ggcccacaag gactttcctt gtggagggga agtatgtagt gagggaagta tgtagttttt 1500aaatctttgt cgctatctta tttagaaata agatggaagg caggtttgtc tgacatagtt 1560cccagcttga cttttccctc ggcttagtga ttttgcggtt ccgagattta ttttcctttc 1620acatatcagt cagatcattt ggtttgtgaa gtttcctatg cttaacagaa aatatgtgca 1680ctagttttcc tagagtttca ttgtcagagt ctcaagtttt tgtttggaaa ttgtatttgg 1740tcacattaat tatactctat gttagttcca aagaaatacc tttggttaag aaaagaattc 1800tcatgcataa ctcctcgagg gtggggttac accttaatcc atcctcaggt gctcatggta 1860attggggcaa atatgttgcc cagtgctggt gctctgcagc cttggatggg tttacccaga 1920aagcagcttt caagtcagaa actaacattc ataagggagt taaggatttt ataaatagat 1980atccataatt catgtagttt tcaagtaagt agtatttgaa tcttttctgg ttagataata 2040attgtgagta tgttgtcata taataacagt atgtttttca ctatttaaat aattttagaa 2100ttacattgaa aaatggtagt aggtatttat ggaatacttt ttcttttctt cttgattatc 2160aaggcttgga ccctggcaaa cagattaaac tggattccag tgcacagttt ggatattact 2220ttcgtgtaac ctgtaaggaa gaaaaagtcc ttcgtaacaa taaaaacttt agtactgtag 2280atatccagaa gaatggtgtt aaatttacca acaggtttgc aagtcgttat tatattttta 2340accctttatt aattccctaa atgctctaac atgatgtgaa tgttctatga taagttttac 2400taatgtagtc atcaggtaag agtcaagctt tcttccatag agcagtcagc tgtcg 24552982194DNAHomo sapiens 298cacaatggga aaggatgtag caacacattt taaccctatg ttgagtttta ggtgggttcc 60tttgaaattt tgttaaggct aacttttgtt aattttttta aaaaagtgta aattaggaaa 120tgggttttga attcccaaat ggggggatta aatgtatttt tacggcttat atctgtttat 180tattcagtat tcctgtgtac attttctgtt tttattttta tacaggctat gtagaaccaa 240tgcagacact caatgatgtg ttagctcagc tagatgctgt tgtcagcttt gctcacgtgt 300caaatggagc acctgttcca tatgtacgac cagccatttt ggagaaagga caaggaagaa 360ttatattaaa agcatccagg catgcttgtg ttgaagttca agatgaaatt gcatttattc 420ctaatgacgt atactttgaa aaagataaac agatgttcca catcattact ggtaaaaaac 480ctggtttttg ggctttgtgg gggtaacgtt ttgttttttt tttttttttt ttaatcttgg 540agtagaaata tatttaaaat tgatggagaa aattcccagt tcttaacatt agaaagggaa 600tatattattc ttaccagtta gtaatctatt cacatttggt ttagagggaa gatttagaag 660gtgagataaa agcttgtgag agaatagtgt attcatgtga aacttcttcc atgggttcag 720agcatttaga aacaaacatc ccttcacact caaagcttac ctttgagcca gtcctccaat 780agtgaggtct ttgaaggtca ggccaaattg gctgtgggag gacctcaggt taggatagga 840attattttaa gacatggcac tatattcatg tgaaactcgc aaaaactagc cttgcatata 900ggctcatgta tcatgtctca gctgagatgt ttgagagatc ttaactagat tctagaaaac 960aaaaaaggaa gtagttttgg ggcaaatata tttgggaaac agtttattgt atttcctttc 1020cccaaatgga ttttcaagtt cttcatataa tctaacccca acaaataaat tgcctgtttt 1080tcaaaagaaa gatcatgtct tcaggttttt gtgtggggtt taaatgattc gaaagatttg 1140accatactga tacattcact agtaacctta gttactaatg agtaatggtt ttgagttaat 1200cagttaggcc tgaactactt ttctggaagt tagtaaatta tctcacaggc agccctgtga 1260gccatgggaa aatgtgtata tggtctttct aggccacagt caaattacag gtatatttgt 1320catggcttct cttgatgaaa ggcccagtat cggtttgtct gaagatatat aatagcattg 1380cttttggggg taatatgggc agtaactctg tccacatctt tgggcaggct gtggttctgc 1440ctttatatgc tatgtcagtg taaacctacg cgattaatca tcagtgtaca gtttaggact 1500aacaatccat ttattagtag cagaaagaag tttaaaatct tgctttctga tataatttgt 1560tttgtaggcc ccaatatggg aggtaaatca acatatattc gacaaactgg ggtgatagta 1620ctcatggccc aaattgggtg ttttgtgcca tgtgagtcag cagaagtgtc cattgtggac 1680tgcatcttag cccgagtagg ggctggtgac agtcaattga aaggagtctc cacgttcatg 1740gctgaaatgt tggaaactgc ttctatcctc aggtaagtgc atctcctagt cccttgaaga 1800tagaaatgta tgtctctgtc ctgtgagaag gaaaagtata tttgcagatt ctcatgtaaa 1860aacatctgag aatgtttgtc ttagtttaat agttgttttc ctgtggactt tatatacttt 1920gtattgtctt aaaagagtga ttgatggtag ctacggaaaa ctttgatttt taaaattgtc 1980tctttaagta gacaatttat aagctactgg tacgagttca ccttataaat ctccactacc 2040atgtttttgc ttggactgtt cacacttcct ggaatggtcc ttcttgccgt ttatccaact 2100tctttctaat ttttaagtcc ctaatgatgg gaattctatt tctgtagtga tttttctggt 2160catacgaccg taaggtcatg ggtgtttttc tctg 2194299571DNAHomo sapiens 299caccgtgatc ctccttattt cttagtatct tctaaagaac attaaatata gtaggtgcct 60agtaaattat gtattgattt aacttctttg aggttctgtt gtttgtgaag aattataaaa 120gcaatacaaa tgtttgtata gtaattaagc aacaggttaa tattcatgac ttaaaagatt 180aaagaaataa gcaaaacatg ttagctggca actcacagaa aaagaattaa attgccaatg 240agcacacgag cacatgaaaa attagcaaaa gtttcacccc tttacatata tttggttaaa 300attgagaaaa gaatagtaat agatggtatt ggtaggactg tggcaggcac acaatttaca 360tgaccaccaa aagtgtatgc aggtatccat gtcaccacac cctggtctca tcttcattca 420gttttattta ttttttttaa tctcggccta tttgattggc acgaaatgaa tgatagctgc 480cttatttgga attcctttga ttactactag tgtgcttgat aatgtaaaac aatattcaaa 540atctgttttt cctttcatcc gttgtttgtt c 571300497DNAHomo sapiens 300gtggcatatc cttcccaatg tattgtctta attttgtttt tgtatgtgta tgttaccaca 60ttttatgtga tgggaaattt catgtaatta tgtgcttcag gtctgcaacc aaagattcat 120taataatcat agatgaattg ggaagaggaa cttctaccta cgatggattt gggttagcat 180gggctatatc agaatacatt gcaacaaaga ttggtgcttt ttgcatgttt gcaacccatt 240ttcatgaact tactgccttg gccaatcaga taccaactgt taataatcta catgtcacag 300cactcaccac tgaagagacc ttaactatgc tttatcaggt gaagaaaggt atgtactatt 360ggagtactct aaattcagaa cttggtaatg ggaaacttac tacccttgaa atcatcagta 420attgccttat tctaagttag tataaattat tgatgttgtt atagaaccca tttacccctt 480aattcacagt ctggggg 4973013662DNAHomo sapiens 301gatgcagatc agggaaatgc aagtcaaaac cacaatgagc tacaacttca cactgattac 60gatagttaaa atcaaaaagt cagatggtaa gtactggcaa ggaagtggag aaattgaaac 120tgtcatgcgc tcttggtgcg aatgtaaaat ggtgcagctg ctttggaaaa cagtctggca 180gttcctcaga caattccact ccaacgtata tccaagtgga atcacaacat atgtccccac 240aaacttgtac ataaatgttt atagcaggat tattcataat agccaaaagg tggaaacaac 300ccgaatgtcc atcagcagat gaatgcataa atgaaacgtg gtctatccat acaatggagt 360atattattga gccattaaag gaatgaagta ctggtacatg gtgcagctta gatgaacctt 420ggaaacattg tgctaaatga aagaagctgg ttacaagagt caacacgtat gatttcattc 480atgtgaaagt tcagaataga gacagcagta gagacaaagt agcagttcag ggttggtgcc 540agggaatagg gggtaggtgg ggtgaaagct aaaggatacg gtgtttcttt gtgagatgga 600aattctaaaa taggtgatgt ttatacatgt ctgtgaatat actaaaaacc attgaattgt 660acacattaaa tggatgaatt gtataggaat tatattttaa taaagctatt taaaaaaatc 720cagacacttc acccaagagg aaatctaagt ggtccataaa catgaaaagg tctttaatca 780ccagtcagaa aaatgaaaat gaaaaccatg ccaggccacc tcccaccacc atagtgacaa 840gcatttcaag tgtggcagtt ccagctgttg ttgaggatgt ggaataacac tggtaggggt 900gttaagatta tctggtgaaa ttgaaaagac gcatacgacc cagcaattct gctcttaagt 960gcatactctg gagatgcttt tgcccattgt gctgcgagat gtatacaaga atgttcctaa 1020tacctccaca ctggaaacaa ctcatcagtg aaaatgaact acagctacac aaaatgacat 1080agatggaatc ttaaaacgtt tagtaaaaga aatgatacaa aaggatacag tttttttttc 1140atttatgtga agtttaagaa taggtggtat tgtttaggga tgcagtcttt gggatggcaa 1200ctgtaaagaa aaagtgattg tgttaatcag agtgattgtc tttagggaaa tggagtgctg 1260atggggaggg ggcacattag ggcttctgga gggccacagt tctggttttt aacctgagtg 1320gtggttttgc acgtgcttgc tttatagtta gctgcaattt ttttttttaa tgcagttaaa 1380gtttggtatg agaacaaatg tatgaccgat gagtcctttc agtttaccaa gttctttttc 1440gtcatcgtta atttagagtg ggttacatca gtttttcttt tctggctgcc aaaggcttag 1500gaaaaaggca aactgacaga ggaagatttt aaatgtagaa atatttattg gtttacaaat 1560ccttttaatc acttatacat gaaaagcttt catataattc aaaaagcaaa ttttaaattc 1620caatgaaata gttcatcccg tggttgtgaa agagtgtttt tagattgctg cacagaagca 1680tgtttaacgt ggaaatcagc tcatggtttt agttgttagg gctacaagaa attgggggag 1740acttcattcc aagaaaacat gtagtctgtc aggctgtttt cattcctcta aaagagacag 1800ttttctaaga tgtttttgaa aatgagaaaa tacgtaatag atctgcttaa gaagtttcaa 1860acttaatctg tgcttattac atgaatatgc taatgtaaaa ccaggccttc agttagtgtt 1920tccttccttt tagaatggtg tatgtaaagc aaaatataaa ctaatttctg acctgtcaaa 1980ggttttttct taaaatttaa atttataatg tggtttggtt tttctttccc actcaaacat 2040gaatttgggt aataccagaa taaagctgga tatataaatt ttatccaaaa tttagaactc 2100tgttgttaag aaatctgttg accacataac catgtttctg agaaaataca tgattttttg 2160catctttaaa aaaaattagc actaagaagc taagatgaag ttgtttttgt aatttgattt 2220ttttttcctt aaaatactgt tttggagtta aaagttgtag caaaactggt ataagaaaga 2280tgttttaaga tatatttaag tcttgtctca tactctattg actaagctag cccggtgact 2340agggtagatg tatttaaaga ataacttttc ccccttaaaa tctcaatatt ccacatcctg 2400ttagacttct tgagtattaa atacatcttc tatccttggt ctttctgcat ttagcttttt 2460tgggaagtat gtttttaccc aagcatatgg tatgagctgc tgattcagta ttgagtggct 2520ctttaagctt gttagttaca ttctgctgat taaaatggtg tacagaatag tcaggaaaaa 2580ccagtccctg gtctgaaata aacaatgtta attagcttat ggggaagaac aaatgagtaa 2640ggagaatttt catatacaaa ggaaatctct gattgtcttt ctggactcag tgtgtttggg 2700ttaaggagat agggtgcggc tggagaaaat gatgaaaatg ttcagaatgt tacatgtatt 2760tttacactga aactggaagt ggaagcccag tgtgatagtt ttctgcccga tgttggcctg 2820tcttcacacc cacaccactt atcttgattg atagagctac tacttcctct tatactgctt 2880cagaagttaa cctctgtggt gcagtgctag gatatcacag aggaaataat cccttgtaga 2940cagtgtcttg ttgctgggag ttatcagtgc ctcctgttct ctctaaggag ggcaatggga 3000agccctttcc ttgcatttgc tacagccgtt tccttgacct ccctggaaga acagtatttc 3060atggtgtcag acaacattca gaacatgctc aaattaaatg tatgtcagta tgcatttgct 3120ttggtgtgtg gtttgtccaa accaaagtgc ccatacatgt ctctggtcca gccatgtggg 3180aaattcagca gtggggtgaa ccatatggaa atggcaggtg ttgggcagcc ttgactggac 3240tgccctggtc ttacttctgg atttggtgag tagaagatca cactgttgct tgctgccctg

3300ggttcacctc aaagagggaa agaagaatta gcaacttaaa tggttaaatt tagaaacaag 3360aaaaagttct gtcagtgggc agtttcttac gtctaacaaa aaaaacaaca gcagtgaatt 3420cttttgtgtt cagaattaac cagtaaacac aaccttagca attaacctat cactatcacg 3480attgtttatt tcctggaatt ttgttgacag aattagtcca ataaatgtta tgaataataa 3540ttttatgaat agagataggt taatgccaaa ttaagtataa ttgaatctga gacattaatg 3600caactgtttt aaattcaacc actgacctgc aatttttata tgccttgtcc atccagcaag 3660at 36623021498DNAHomo sapiens 302cttaatcctg aaaggcaggt gcttttatta tcttttatct tattatctac attttccaga 60tgaggaaacg taggtacaga ggtttagtaa cttgcccagg tcacatagcc agtaagtggc 120agagctggga tttgaacccc agtaccctat ctccagcgaa tctgagatgt acatgtgata 180aatttaatct ttctcaataa attattaagt gtcaaagcaa gtggtatggg caatgcacca 240ggattaagaa aaacagtgtg tggtaaagat gtaaaatatt tctaattctg ttgtgggctg 300tggcactccc gtggaaggct tgccacagac acagccagag gcatccacgt gggcccctgc 360tgcacacctg gtttgctgct accaaggctg ctctcccgag gcttgttcac acaaaggaaa 420gtgagcagct aggaagctgc atatttgaaa gttgactagt caccaaatgc tggcatccaa 480ccaagtgatt gcattgtacc ctgtttggat gaaagattgt gtttaaatga aaagagagat 540gatgagccag aagtgtggca aatgagttaa aataaattgt cagcagtgtt tgaagcaggt 600tgctgagggc tggtgtcctg aaatccggtc acttggagga tgtatatgtt ccatcagggg 660ccggaaatgt tttatccaag ctttagggaa taaccctgga gattctcttc gttactctac 720tgttaagtac gtgcttacgg agtaaacttc gcatgactaa ggtttacagg cctgaatgtg 780caactgagtt caagtaagca gcaatgtggt gtattaggaa gactgcttga cttgggttct 840aatccttgct tcaccaccta gctgtgtgac tttaaacatc actgtttttc ctcctgcctt 900ccttctgtaa cttaaggggg ttggattatt agagttctca aatgccatac cttcaaggcc 960aggtgcagga tgcagagaat agtgggttaa agtgaacacc tcaatgtaaa atcattcaaa 1020aatttaaaaa catcacggac caaacaaata tgtctttaaa tctgaatttg gttaaaggtc 1080acaagtttat gccctttgga gtactctctg acattttcat gatgatatga aaggattttt 1140ccatacatac tcaaaaggcg ctcacgcctc tgttgcagtc agtctggcca cttccaaata 1200gccaccccat gttggtctcc acttcttccc tccctcttta agtgctatgt taataatcta 1260gcttataatt ctctaatcag cagtagagca ctttgctact ttatttttta ttgttagggg 1320tgatcttagc agcccaagta tgctaagtct tagaaatatt catcagtgat gtttttccct 1380gaagctcgtt tggtgactgc taaactagaa ccagaattgg agaaaaacga ccctgtgaat 1440tccaagccaa caaagccggg gaagaggcat tgagcaacct gtggttgcct gagaaaca 14983032415DNAHomo sapiens 303gaaaccacag aatcgccttc ctccccagtt atttatactt caagtcatat tgtagagaga 60aaatttctgt cagcaaaaat ctcaggaatc ctcctcattt ctatttgtat ggctttcaat 120cgttgacatg attttttcac atatgtcatc ttctggggat ggattcgtat aaccctgctt 180cacttgcttc cctgtgggag gctcacttgc ttctcgacag gctctggaag aactaggcag 240tctggtacat ggttgtgcaa gaacccttga gggggccttg gagtgtgtgc ttgggccctg 300gaactcatgc ctaggatgga gggctgagat tgccccttcc catccaccag ggagttgaca 360agggggagaa gaaacttctt gtgagcttgc gatgacttgt ggcacttgca tcagaccttg 420gagttccctg gggagaggca ctcttgggta tgacactgta tagtgccacc tgattgccat 480ttgacccagt ttggccctgg atccttgagc aagagggctg gaaagaaaga caggcccact 540ttttgggaca ctattagggt ctgtagcatt ggtggggaga gaattccccc aacccccaaa 600agagctgaaa atgagacacg cgtggagggg tgaaagtgga gtgtggtcaa cagtgtggtt 660acagagatgt gtgtcggggc cactcccact caccagggag actcatgaag cagaagggat 720ggggcacaat gtggcttcca taggcacacc aagccacctg gagagcgcat cagccctttg 780ggtaccccca agcggaagga ggttgggtct ttgggtctgg gaactttggt gcttgttctg 840gtgggaaggg cagggagtca agaccagctg tgtcttccac tgctcttctt gtccactttg 900gttactggcc tctgttggca tgaactgggg aggcagaggc tacctacaga cgaggaactg 960tgtggagtgc gagtgtatgc agtaaagggt tagcttagct gacttgaggt actcacaccc 1020atattccgaa gaaaagactg gccctcagcc tgagcctccg aaataatctc taagccctta 1080gaataccctg ctttgtattc aaagagtatc tttgaatgct gaacttagaa ccactctaga 1140aaatgtatgc taacaatgcg atttatgatg aacacttgtc tttgttcccc tggggccctg 1200ggccacattg tatcagtttg agccctagag ggacagagaa tgagaaacta agatcagtca 1260tgcaggtgct ccaggcctat gtgaccaacc accaataaaa accctgaaca tcaaggctca 1320agtgagcaat acagctggtc ccaacttaca gtggttcaac ttgtgagttt tgcactctac 1380aatgggttta ttgggacata acccagtgga ggaggatctg tacttcattc acatgtgttg 1440tcacatcatt actgggagaa ttaagcactg tccacgtgaa tccactggga gaggataact 1500ggaagcttgc acctggcttc tcctggattc tgctctgtac gcctttttcc cttgttaatt 1560ttaatctgta ttctttcact gtagtaatct acaactataa gcagaatagc ttttctgagt 1620tctgtgagtc tttctagtga atcattgaat ccaaggtggt cttggggacc tctaacaaaa 1680gatgtctgga cctgaacttc ctgttgtttc aaagatccta tagcaggctg tcttaccaac 1740tttcagcatc aagaagctgg tggagagtgg gttagtttaa aaatgaaact ggggagagag 1800atgaagccgg gggaagatgc cgtgaaatct caccttatag gcagcctctg attcacctga 1860gggtttttcc ttgaatactt tctgggtaca agtatttgag acaggtgatg tgctggtcac 1920tttattctca gctgcttgtg gcctagccct aacatgggca ctggaaacaa tgggggtagg 1980ggttgatgat ggagaaatgg ggagtaaagg gatttaaaac tttgaaaaac tgagctgttt 2040ccatgatttg tctcttttga ttctcacaaa acctttatga aatatgtgct gacattttaa 2100gctctcactt atagtgagaa aagcaatctt cagcaaggtg atgacttgtc caagggaaga 2160catggtcgcc cttgttcctt gggagatttt gtgctcccag gggaaagcat aagccctcag 2220gagccatgat gagaacagct gtagaacagc aagtgaacag gtgtgtatca gtcaggatag 2280gcaaggctaa gctgcagtaa taaataatcc ccggatctca gtggcggaac attgaggagg 2340tttatttctt ctttatacaa atatgctgtg gatcaggatg actctccagg caactgtctg 2400tgggactgtc caggt 24153046766DNAHomo sapiens 304cagtgctttt gcatccttcc tgttaacttg tgtaggaata aaacattgtc acaataagat 60ttttttcctt tttattgttt tgatttttta gccaatgaga aggaaaattc cttattaggg 120agggcgaggg tgaggatatg tggggtgggg agaagcgaac gttccaagtt tcgaaaacag 180cgactctctc ttggactctc tagccagtag aaacctccct cccactctct tgccccaaga 240tctggtgctt agaagagaat caagggaagt tggaacccag aagacggaga cagattgagg 300gactgctgtg aaatgttggg gtgtttggtg aataatatta gaagttgggc tggcagagac 360cctgtcacat aaacattaaa tcaacactgg agactgagca tttgttagaa atgtaagcgg 420gaatggcaga aaacttgttt ttaagggaaa gcatgttacg gcttatgttc agcctccatc 480ctctgaaggc aaaagttagc aaagttgatg tatggcgttg ctttttctgg gaactttatc 540tcgtttggtg gggttcccat ctctgtctcc caggagccaa gactttcccc tccctctgct 600ccagcagaag ccagtctcag gcaaggctcc ctgtacctca tttacacttt ggtgtgaata 660tgttattgta acctctctcc tggaggtgtc tgcattccaa gactgaactt ttctgtgaaa 720gttactgtca ctgtgaaagg cagttcagcc cccagggatt gaaaaaggaa atcattttgg 780gtaaggggac agttagtcca gattttttca gttgcaagta aacctaactc agccagtagg 840caaaggggga aattgctggt ttgaactggt gggaagaaag ctgaggaaac tcctacactt 900gggggaagaa ctgcaggtgc ctggctgcag ggaacgcagc gggggctcag gaccaggcag 960atgccctgcc tctgcttccc ttggcacagt ggcctccttc tcccttcaag taggcagatg 1020ctgcctgtgg cagaggacag cagctgattg gcagcccagc agggaggatg tggtagacag 1080gcactgagca tctcttctac cctccttcta gagggctatc ctgtactgtt gaggctaaaa 1140gactgaaaac cacatttccc agcctctctt gcagctacca atctggatga gagttagatt 1200ctacacatta gatgcacttt agcaagattt tcaaaagcag attggagaag gagcccatgc 1260ttctgctggt tttttttgct ggcaagtgag gggttctgtt tttcctggag tgactttatc 1320atggtggcat ctgaaaaagg ctatttcttg atcagagaga cagcaaccct ctcagtgacc 1380tagttctgtg ggtgtgtctc tcctgagagt taatcccaga gctcaaacta gagctcaacc 1440ctagagtctc ttcaggcttc ccaggggtgg gggtgcattt aacagtccaa gttaaagaga 1500aaataaaggc cattaaagac caaacattga gcactgagtg aaaaagtttt attgccaaac 1560aggaaacctg attcaggcca gggtcttgga aggttgttca ggatgagatg ggggaggtga 1620aatggggtag gtctttgaaa accaacagat tgcaaattct ctgtcccata gcaggaaacc 1680acagtctctg atgtcagctg gctgccaaca cgtcagttgt atcagcatta gctggctgga 1740ggtggcctgc tgtgtgcaga tggtacctgg tgcaggattg tggtgtccag gtgtctctcc 1800ttagcacata agaccctgtc cgaggactgt ggcatgacgt gctggagtca cgattctgtc 1860acccagtcag gtcatcagtg tcagagagct aggtggccag gttggagttg attgccaatg 1920ataggtcttt ttctgcttaa atcagctgga ctggattcta ttgcattaac ttgaccctga 1980ctcatgccgc caggcctaat ttataaacca agacaagaaa gggctactcc accccctcca 2040atttgtgtaa ggccagggga cttccccccc actccccaac ctgaggcatg caccctccct 2100tagatcaatg gctgtttctc tgagaatgcg gaaccgtgat taatccagcc ttgatgggga 2160ggcagcagga actgtaggca ttctcacttc acacccatcc caatcccctc ccccttgctg 2220tcctcttgta cagaggactg aaagcacaac actctctccc tccctccctt ataggtggtg 2280acgatcatgt gactctcttc tggtcaatga gatgcagcag aaagtcctag ggaggtctag 2340gaaaagtcct gttgggagag agcatttttt accttctccc tgctacttct tgctactagt 2400aacatggatg tgagccttgg aggggtagct accatctggc acctggggtg gcaagccaac 2460atggaaagga tggcagagcg ggaaggagga gccagcctta ccgatggcat cactgtcact 2520gcgctagccc cagaccacct gctccagagt tctggttatg gtaatgaaat aaaccttgat 2580ttttattcct taaaactacc cttcaatggg ttttctgttc attacagttg aatgctttca 2640taactgatac aggagggacc ctgtgattgg cagttccact agactgcatg gagatgggtg 2700gagttatcta aaagaacaga gatagtgtcc ctagaagaag gggacaggaa agcatcctgg 2760gtacacaaaa gtcaaggctc caggatctgc cctgggggct atctcaacac ccctacactc 2820tcaccgcacg tatttggtca gctatgaata tgaccaactc tcgtcgttta tctctattca 2880gtggaacaca gcagcactgt gacctgccca cgagaagaag gatttttaga acttatctta 2940gggcaatttt aggtagagga gcagacaaga tggtgtacag gagaaacagg tctattaacc 3000ctggtattaa tattaactgg ctgcccagaa taaatgaaga atagcttatt ctttgccagg 3060ttgaagatag aaaaggaatg aagggccgga gaagtacagc tgggtgaagc acagagcagc 3120ctagtgcttg gcatgggact cagatctgaa gcagcctctc cgggacttct ctgagcctgc 3180ccctggtggt atgactgtga tatccctgct tctatagttg gcaaccaaca tgtcctagct 3240cctagaccat agagggccag attcatgtct cattgactgt gtaatctctg tgtggcccag 3300tacagagcat gcacaccgta ggttctcaca tatgtttgtt gagtgaatga atacaatacc 3360aaacgaatgg acaggacaga gctgtgggct agcaggaagg atatctggct tttgcttgaa 3420ttagctagtg aattgctgtg tggcctcctt actgagcctc atttccctct gtctgcagag 3480tcaagcaaat cttccatttt ttgttcccct gctgccagag catggcagag taaatgtgtg 3540agttgaaggg agcaacctca tgaggttttg ctttgtgtct taattacagc catttgtgga 3600attaggcttt taatataaat atttgtgtgc ctgcgcctgc atatatgtat ttggaccaat 3660gctctcatgt gtgcaaatac atgtattcta aagaaatctg tccagaaccc cagcatctgt 3720ggtgtctgtg gtgggagggg cttccatatt acagagagat gcccacagtg catgacgtta 3780cccgcacagg tgtgacatca cagggtaacc aaatgctttt gccctggggg tgggagaggg 3840atgggtgcac ggtgaacagc aggtgggggt ctttccatag gggatgagga agacaaggcc 3900acttggaggc agaggagacc acagtggggc atgatggttg gggaaggcct tttacttctg 3960ccccttaagg atgccctgga attcaggctt tcggatccca gagctctcat tagagcagcc 4020ctgcgttgta gacttttctg cagtgacaga aatgttctat atctgtgcta tccaatatgg 4080tagccacaag ttacatgtgg ctattgaaca cttgaaatgg ggttagtgca attgacgagc 4140tgaaaatgta gtttaaattc acttacattt aaatagctgt gtgtggcttg tggctgccta 4200ttggactgtg cagttctgga gaatggtact ttacttgtcc ttggggaagc agaaacaaat 4260gaaaacgagg atctggagct catgaagttt ctcatggggt ggggtatgtg tgttgaagct 4320gcaccttcag caggaacctg gccagtcctt agtggaggac atttctttcc atcctgcatc 4380cagatggctg gtcctgctcc tcccagtcca tggagaaaaa agaattgaac aaactgtcta 4440agctgggtca ggtactctgc agatgtttgc tgagtatcgt tcttgatgga aatccccgtg 4500gaactcctac attttctcct ctcttctcct tcctttcaga acctcagagt gacagagcca 4560aaagaccagt gcctcatttt gctgacatgg aaaaggaaac ttcgtggggg aaagagatct 4620gcttgcagtc ggccagagag acagaaccag ggcagtggtg agctctcatg acctggtgtc 4680tgttgccttc tggttaagtt tttcatttgt aattctacaa acatcccttc tgtaaacatt 4740tccctcaaaa tggagcagga agctctcaaa aatggaccag aaaggggtca ggaatataac 4800tttctctgcc cagattccag gacttacagt gagaaagcgc cttctgggaa cttcacaatg 4860gctaaagtgt gctaatggga tgatgtgccc ttgtacaccc actgcctctg aactctgctc 4920tgcattgctg agcaaactac atttcccaga actccttgtt ggattccttc caaacaggtt 4980taccactggg agagcctgtt ggttggggag ggcaggaaga gggaggaaag aggaagggac 5040tcacttcctg tttccagctg aagtctaaat caatccacta tcaacaggta gctatcatac 5100taccctcatt gtcacccctc agaggtccca ctgcagctgc ataatgtccc ctcagtggcc 5160tgaacatgag atgaacaaca ctcttcttgg gagtaccagc cttgcttggt tcatggccac 5220ttttcctgat tatcttgcag ctatattagg tcatgtgaca aagttctggc cagtggcaag 5280ggaacacaag tgataggtac agatagaagt gtctgatact acatagatta tgcttgcact 5340cactcttaag agagagacat gaacttttac caacggaagc cagtattatt ttgaacctct 5400gttagagtgg cttgaatctg tatcctaact tgtatcccta atgtgtgacc catgaaaatt 5460agccaggcag caccagttcc aaagaagctc acactcccct gcggctgctt ctgccaaggt 5520cactgatatt tccctttgct aaatcttgtg ggtgttttct tcagtccttg tcttaatcac 5580tcagtggcac ttggcactta ttccttcttg aaacccttgt ttcccttggc tttgtggcat 5640cctgtgctct tggttttctc ccatatctct gaccctcttt ccttagtctt ttttcttctt 5700cctcctgtcc cttaaatgct ggttgtgatc ctctttttat ctcattctac acactcacag 5760cctgagtaat tcacaccatc ttgatgctga gaacttccaa aatgttggtc tagcctgggt 5820cattgttatg agctctagac tcacaaggcc aattgcttgg tgggaacccc tcccccatgg 5880ttatctcatg ggtccctgaa gtccaacttc tccttcattg aactcatcac ctcttctgtt 5940cctcctcctg ggttcccagg ctcagtggtg gcaccactgt ctacctggct gcttagcctg 6000agacctggct ccgtcccaat tcctctctct cagtcttatc atccccatcc aggcaaatca 6060ttgattctgt ggacctactc tttcgggtgt ccctcaaatc tctccacgtc tctgtgttct 6120cactagcact accttggtcc accctgccat ctgctttcct cctccactcc tgcattctga 6180gtcattttcg gcagcacacg catccttaaa acccctccac tggcttgcca gtgtcctcag 6240gattaggcga aaagtctttg ctttgtttta caaggccctt cgctatctgg ccccctcatt 6300acctcccttg ctctgcatgc tccagtcctg cagaactaca cacagttccc ccaacaaggc 6360cctgctctgt tcttcccaca cactgctcct ctgcctgggc cactcttcct gctccttgtc 6420agcaggcttg ctgctctcag gctcagcatg gacagctgct tctgagagcc ttctctgcct 6480acccaggctg ggtggctgcc tctctttggt gtgcccatgg cagcccagaa tgcctggtgg 6540acagggagcc ctcagcaggc cgtactgcag cgccctgccc ccgtcagcct ccaggagcct 6600ggagtccagg gacatcaagg gcggtcctgt ctttctcacc cttgtctctc cagcccctaa 6660cacaggggat gcctgacccc aaactagacg agttacttga cctctctgac ccaagacaaa 6720atgggaggaa agtgccaaat ttccaagatt ggccagggga ttaaat 67663055610DNAHomo sapiens 305cacctgtgcc catcacatag ctggggcaca gctggagacc ccaacagaga ggagagctga 60tgggtgacga gaaatcaggc ctctccgcca cggcagccta gctaatgggt cttggctgga 120agctaacagg aaggcctctt tccagaaaca ctgtaagcca gtgtttctca gattgctggg 180tgtaattcat aggcagatca tgaaatcagt ttaatagctt tgaccagcat taacctattt 240atgcctagcg ttcccttatt ggaacactaa gtctgtgaga gttatttaca tcctactgct 300taaggtcatc gccaaaatct gattttttac acaaaaaatt tgcaacctcc agcataaatg 360ggttaaaaca agacaaaaca aaacaatacc agaatggaaa atagtgcatg atctgtacag 420tatagttgta gaaaacttct tgttttatca tttgatgtca tgaaagtccc tgctgtagat 480aaaagatgga gcttgtgctt ctgagtggtc atgctcaaca gggtggggag cccaggggag 540tggggagtga tcgtatagac agaggtgggt ggggccagtg tgagcctgat ggtcaattac 600ttctcatttc tagggaaaat tgaaggaaaa gaaggagggg gatgtggagg ggagagaagg 660cctcagtaga gtttgcacta ttattagggc aagtaagctg cttctgaaaa gaaggggttt 720gcaaagccaa cccaggcaaa agcaatctgc tggaagaact tcatccccag ctgacactgt 780gggaaggacc ccatgcagaa gcaatagggc agcctggtcc catatcctca tgaaatgcct 840cttataattg tgacatcttg caattgtgga ggactttaca cttttcggag ttcctagccc 900ctcacttatt tctcgtaaga ccgctgggag gtggggggat ggtatcatca tcccacttta 960gagatgagga aacaggatca gagtgagcta aatgactgcc agatccaaaa ctagaattca 1020gacctcctag tttctaagtg gacgctcttt ctacaccacc ataatgtgag tgttctgtgt 1080ttacagggtg tattcaagtc catgactgcc cattagaatc cccccaaaaa attccaggac 1140tggcctgagt tgctccttag accaatgaaa tcagactcct gggagtacgg cccgggcctc 1200gggatccttt aaagctccat ttggagagcc tcgggcacag ccaggttgga tccatctccc 1260agtcccccag ccttggctca gcctggccaa gctgcccagg aggtcccttg gtgccctggg 1320ctctgtttca ctgttgtttt gtagagcaac ttcccagtga tgctgccact gggccccatc 1380ctaacagtga agtcccccgg gccctcctga gaggaggtgt gaactggaag atggggaggc 1440aggcggctct gacagacaga aagcaaacag ctcagagggg tggcaggctg cattttattc 1500atcgttaatt taaacaccct tcaagtcctc tcttggaatg ctgctcagaa aaatagatgt 1560attgtttgag aaaccctgca ggcttgtccc gcatgctcta gccccctcct gagagaacag 1620atagcataaa aaatgatttg taaagcaagg gggagcttcc ttagggaaga aggggaaggg 1680gaagagggtt tggggccagg tccgagtgca gaaatcctca atgcatgaga ctagcgtgga 1740aggtgtagca attgtgctct ggggtgcctg aaagtgccag agctgcttca ggggcaagag 1800tccaggcccc aagtccatgc tgatgagccc accctggggg tcaggaatgg cctcagcagg 1860ccctccctcc ctccctctcc accctacaaa gtgaggagcc ttgagtcacc accagcacat 1920tatacaacaa tacaagaacc ctgcaacaga taaagcccca gcgcctcttc tggactcaga 1980tgccctaggc tggctgtctg gctgtgcttt ccagacagtg tgtatgtgga attgtgcttt 2040ttgtttttta agaatgtaaa aagttacagt aagatcgaac cacagggccc gtcgctccta 2100tggtctctgc ctgactgggc tgccgtctgc ctcagttccc cagaagcttc tcctttggcc 2160atgagggctc agtcatccct caccccagag tccacaggaa gagggggtct gctgggaggc 2220ctgtctgaag gacggaggat cctgggtcaa tttagcagct attttccagg gtttggcttg 2280ggtttggatg ctggcttctg tgtgaaacct gaatacatgc aaattgtaca taaaactccc 2340ccaaggcaga gagggatttt ccaggccctg gtacatctct agagagttaa aaatgggaaa 2400tctttcttct taaagtggcc cagactgaga cttttccttg gggaaaaggg ttagtagctc 2460tttgtaaggc tggtgtgtat gtgtgtgtgt atatatatat acatatatgc atgatgctgt 2520gcaaatgccc agggctgtct ggcattttcc acaaaatgag agcctgagat tgcctaagcc 2580ttctgatgcc ttctccaggc ctggaggcac tgcttcattc agaggacaca aaggcctgac 2640cacctggctt tagcaagcta ggacacccag ggtggcttct ttacctttct cctcagctct 2700gagaaggctg ctagccaaga ctctggattc tctgtggcca cagtcatatg gtgagggcct 2760cttggagttc attcaaactt taagggagcc ccacagcacc ggcatgatgg gtaagtccag 2820gcctaaggtt aggaagcaaa tcctggagca tgaggaaatt gtaggctaca gtgagctacc 2880agtggtgtgc aaactggaga cccccaagac agtgagagag gccacagcat ctgagggaat 2940ggagctcttt cttggcctga ggttcagaag aacctgcacc aaagaaaggc atccctatca 3000atgtcactgt tcctgaaatg atgggagaac cacatccctg cttcagggaa gcagtccctg 3060tcgtctgggg cgctgagccc tttggcctga gatgaaggat gatggtgtga tgtatcatgg 3120cagtgtgact gagactggat tgggggatgg ggacagggga acataggcaa aaatacacat 3180gtgccactgg atcctgagct gccattgtac cttggaggac tggcgtttct ctgggaagtt 3240gggaggtggg aagaggaagg gtctcatttt cctgcccctt gaaaccatgc ttaccattcc 3300tttagaagat tgctcaagct gcctccaatt gcctctttcc aaaaccaaag cataggaaaa 3360caagtaaaaa cagctgaggc tgcagcataa gcaacttagg atagagtcta ggaagcaccg 3420ccaacagaga agactgccaa gaaacatttt gagtttttct tctctggagg tgggtcctgg 3480ttcctcccat ggagaccacg attctgtgta gtcctgcacg ctgggcgggg gattgcctgg 3540aggtttcttt agacctgtct agctcacaca gtcttgatgc ctgggtttta ggctgctgta 3600ctgttgctgg ggctcacttc ctgtgggtag gctgttattt tgcccgcaga tcaagtcctc 3660actgtctaga tgcctctatc atggggatct cttcttccct ctctggatgg ctctgatccc 3720caagttattt cctgttgcct aggtaacacc tctaattgga tgccttttaa tcgttccctt 3780ttttaaaggg

ataaatgtgg attttatttc caggtcctgt cagagggccc tgccctagag 3840aacacgtgcg cccctgcgtg ggcaatccct tcactgtgac cgcaaccatg ggttggatgg 3900ggggcactca ctgggctggc ctgacagtca cagtgaatcc tgaaagcatg gttttcacag 3960gaacccacct tcaggattta gcaagactga cgtctctcct ggccagcgct gcttcactgg 4020cttcacccca gattagggcc tgtgtttaaa aaccaatccc aactcaaatc agaaattacc 4080caaaatagct ggagagtcac tgagatctca ggtaagcttt cccttcctgc catgaactag 4140aaggggaaag aagagtttga cattcaagtt tgactctaat gctgggtgcg tgagcgcatg 4200cgtgcatgtt tgtgtgtgtg tgtgttccac gcacatttgc cagggagaga gatttcacag 4260catggctcca gctggaggcg gtgaggcggt gcttttctaa gacttcctat cagaagctgt 4320gcatactggt gggtcacgcc gtgcctgtat aaactctggc acctgtcctt gccctcatca 4380tatatgagaa aaatgggcag agagagtgtt cgtttacacc cccagaccac tatcctttca 4440atgaagcctg ggtatctggc cttcctccag gtcagggacc ccctatgctg cagaaggcaa 4500gtctgggaga atctgtccct cagcccgaga gcaaaactgt aatcctaaca ttacttccat 4560ccaccagttt caccagctac ctccctcctg ccttcctctg cctccaatag gctgtgcatg 4620gagaagacaa atcctcttga taaacaatat ttagaaaggg attctatctt tcctgacccc 4680aaacacatca tggcctctgg agccaaatac cctgacattt gcaagatggc ttcttttggg 4740ttcctggtgc tgcaggccct ggttcccaag gacgcagctg gcagaggtgc ctccttcaga 4800ggaggaggag gagaagctgg agggctgcgc ccggcaaccc catgatctct taaaggggga 4860aaagttgaac tgatcaacag tagttaagaa aaaaaaaatc cacaccaaca aataaatatc 4920ttgtctgaga agactcagat attcctggtt aatattgaaa agcactgctg tggatgagct 4980tgtgaaagaa aggacggttg ggggattcaa gatctgccga tccgagcctg gagatcagcc 5040agctaaaagc ccagcagggc tcctgcagcc tcaccgctcc cctcctcaca ggtgccctgg 5100accgcccacc attagaagta gctgccctgt gctctgtgct aaatggacta actctgagct 5160gagaaaggcc agctaagccc ctcaccactg caatttccaa atctggggga aatgccacag 5220tccgcaagtt ggtgctatgt ttcatctcat tgcataatac tacaccattc tctgtgtgta 5280gtggctgttc tatatatata catcgggagg caacatatgg ctgtcccaac ccccacctgt 5340caaaactgtg actatatcac ttctgacgac cagaaggaag ctgctaggct gggccaggat 5400tctaaatgct gaggaggtaa ttcagagcca cgaaaagttg caccatatgc tttggggttg 5460ccggctgctt ctgtgcatgg ggacggggtt tagtgccagt ctgcaaaacc ctcctcgctg 5520cggtatgccc tgggtgtggg cctggggggc cacgttttct ctccctgaag gagaatctgc 5580tgggggccac ggttctccaa gaggggactc 56103064189DNAHomo sapiens 306aaagccattt ccagtgtcgg cccatttaat aaactggttc cacctggatt ttctcttcat 60tgtggtgaaa gccaccacct aacaatgctg gccctgcctg catcatcgca tgtcatcatg 120acaacctatg ggggagaaca gctccattca acagatgcag aaactgcagg ttaaagggtg 180aaaagcagtt gcgaagtccc cacagcttgg aaatggtgga gccgagactg aaacccaggt 240gtgctagcat ctaaagttca tgctctttcc accacattag actgtattct gagggcacca 300aggaagctcc atttttctta agaaaccaaa ttgcagtcct ccaggaccac agccagggga 360gcatcttcgt gggagagtgg ctgctgctca gagttgtgac tcccatcctt aagagtcctc 420tgtcctctct ggcctccttt ctactgatca ttgctgtccc cttcacaggg gagaggggcc 480atggcctatc ccctaaagag tctgccaagg tagactcata acctccccgt ggcacagctc 540agacaagctg ggctatttac ataagacttg acccagggct tgaggacagc gcgaggaatg 600aggtgcagag gagactgctg cttctgggtg acagtctgcc tggctgacca cagctggggt 660actcattggc ctcttgaggc cccccacagg cctgccctgc ctgacctact cttgtgaggc 720caaggccatc tcctccactc tctgggggcc tcttctgact cctccaaact cttccatgtc 780tggactcctg gcttctgccc aaggccatct attggagttt gggtttccag ttgaggatct 840gcctttttcc tggatgacca aacctagaat gtgaccggcc tcatgctccc tcctcacaag 900ggtgtggctt tatccgaggg cctcagcaag gcaaccaaca ccagacatga agctggtaag 960accaacatcc acctattcat tccttcagca aacatttact gtggacagca tcaggttggc 1020agtatcatgc aaggctcaga aatatggtgg aaaaccagac agatgcagtt cctgtcctca 1080gtctctaggc tgccttccta gaggcccctc actgggtttc ttagcagttt tgtacagatc 1140ctaccccctt tgctgccagc aggctcacct ctgggacagg gcgcatatag tctgggccag 1200aacttgtcct ggggtcttct tacggccctg gttcagcttg cattcagcat aacaacttag 1260ctagggagtg ctgcaggccc caaatgatgc taaatactag actagctgtg taacaccatc 1320tgcccagaat gaagggacag gtgaggcaga agggtctccg acagcgcaca gggcaaccag 1380tgaaagcgtc cttactgtcc tgttcctgag gtctctctgt gcctgcttta ctgcccttcg 1440ctttcctaca gagcacactc agctcatcct gggagacaag gtgggggtgg aggatggtcc 1500atccttcttc cgcatcaagg tcagtaggtt cagagctctg ggggggtgct gagaccctgg 1560gacaggcttc ctgctgaggg cactggggcc tatgcttgtg ccactgccta gccagttgcc 1620tcccagagta gagaagcagt ctcccaagct cttgcaattt gtggggagcc aagctgctct 1680ggagaggggc ctcaaagctt cagccagaga aaaggcaaac ccagccaccc tgagaatctc 1740ctcctccccc tcaatcacac cctgcagagg cgtgatctgt ccctgggttc gcaccaagcc 1800tgctattttg tttatgccac aattgatctg ccatcccagt ttgcaaagag cagacacttg 1860ggggctttat tatgccactt tgacaaaagc tgtgaagctc gttcccacag cctgtctggt 1920gccgccttcg caaatggggc cctggtgatg gggccttcgg agttcagctc agagagcatg 1980gaagtgagat ggagaggcca gcactgatct gtatcgtgca gccctgggcg gcagcctcgg 2040ttgggccctt gacacactcc tcccatccag gcccccagcc accctgtgag ggaggcacta 2100ttacgcccaa aaatgcaggc aaggaaatgg gctgagggag gggaagcatt cactgaagtt 2160agtttgtacg tggctgagct ggccctgaag cccatgccct ttccacttgc cagacagatg 2220ggaagtcttg actcattacc cgctggagac ttttcctgct gggctctgca ctgtcaactg 2280tgagagaggg aaataaaacg tactgtacag tccaatctgg gatacttctg agagtgaaag 2340tggctctact agtaattacc ccaggacagc atgtataaac cagggctgtt ccaagcaact 2400gggacacatg attaaaatgc agattccatg gcaggtcccg ctcagaggtt tatttagtgg 2460gtcagaaaat gggcccagga atttcatttt aacaaatgtc tccagataaa tctgatgtaa 2520atggaatatt cctttaagaa tgccattcct ttaagaaata atgttaataa ggtattccag 2580atgaccctat tggttggaat ctgtccaact acaaatattt tgatttaaat ttccattgac 2640ctaaaatttt tgtggtgggc actgcagctc tctccttacc aatcattctc ccagcctgta 2700ctatattgag tagcagccag gctacttgga gaacagactg aactccagga atgggctcta 2760ttagtctagg ccaatcagga taatccagtt ccttaccatg actggctcaa gaatgggtag 2820acctaagtca atcagtgcag agcattttca tgactacaga gaaaccatgg gaaggctgag 2880gcgtggactc agtgggcagg gatggaaaaa gactcagaaa tactgggcat ggcccatggc 2940tcattagggt tgccaggtaa aatacagaac acccagttaa atatgacttg ggtaaacaag 3000aaatcatttt ttaagtataa gtatgtccca aatattgcat aggacatact tatactaaaa 3060tatagttgat tatctgaaat tcaaatttca ctgggcatgc tgtcttttta tttcctaaac 3120ctggcaaccc tactcatgac tccacttgtc agcttggtac actcctctga gaagcttctc 3180ccaatattcc catctcattt gatccttcct gactggccag tgttgggatt aagagcctca 3240ctttaatcaa ggaacccaag gctcacacaa ggctgagccc tgcccagcca ggccagcggc 3300caggcctccc catgtccctc tctctcctaa cagggaggag tgggagatgg gggagggctg 3360tgggatggag ggcggggctg ccaacagcct gctccgtggc tggaactgcg acatcgcctc 3420tctgggagta ggagtgggct tctggccaga ctcagtgggg gaggactgga cacttgagag 3480ggcactgggc caaagacttc cctgggacat gtgccccagc cctggtacct cagggctgca 3540catgtcagcc atctccatgt cacacccccg gggaggacaa ccgaccaccg tggacaagcc 3600ctgaaccttt tgggaaagct ggtgctaaaa gaatagctgg agaagtcact actgaggact 3660gaaaatgcgg agggtataat aactgctgtt gtgtatcgag cctcactatt tccacgcacc 3720gtgccaagtg ctgcacgcgt atcaattcat ttaatcctca caacaactgc atgaggctcc 3780atgttttcac tgattccaag tcacagcttt tttcctctca tgttaacatc tctaaaatcg 3840tgcatcttac agtcaatggc ctgatagttt atttggcagt attttaaatt cctaatggta 3900cgtaccatga tggtgtgttc ataaccaata gtgtcttaga tttgatgaac tatagtgggg 3960atgattattg tccccagttg tgagattagt aaactgagtt gcatttaaca gataaagaca 4020ttgaagttca gtaacttgcc caagatcaca aagcaagcac atggcagaga tttgaaacta 4080gaggaaggag cttgcaatgt gataaagcag caaatgtaca agagtttgga agaaggagag 4140gtaggttgct tttggcaaga gcagcaattc tcaattctgg ctgcacaat 41893073729DNAHomo sapiens 307ttcacagcaa agtggctcag gtgaggcagg caaggaatgg gcaaatcacg acatgacata 60tggatttcca tggcagggaa atgcccccgt aggcacagtc aagcctggct ctaccattgg 120ctcgccgttc tcctacctcg ctgggcctcc atctccccac ctctggctca cttcctgctt 180tggcccctac gctgggtagg aggcccggct agaggttagg caccatcttt tccagtcccc 240aaagtgagag tgtgtgtgtg tgggagagat atttttaaat ggggctgttg tggaaaagct 300gagaccgtgg gctgctctat ttgttggcgc ttgctggttt gtctgatttg cagagctgga 360tggactgctc cctgaggaca gaagctcttg gttttcttcc ctccgaagcc aggcgtgggg 420tgggagcatc cagtgcaccc ctcttgcatt gggtgcgcag tgatccggac agagaggctc 480cagtcagcca ggcacagaga aaatggccct ctgcccctgt tctgcttgtt tttgtcttgt 540tctctggggg cctttgaggt gactttcttc atttgatgac aacaagatgg gaggcgggga 600cagctgaggt ggcaggagta ggggagctag ggacagagga tgaaccccac aggctcaggc 660cagtgacttc taacattaga gaggttttgg ttaactggga gcaaatgcaa gtgacttctt 720tgaatcgact ttgtacctcg gcacagcctt ccttgctagc agggctgact tcaaccaccc 780cccactctgt gctttatctc tgggattaag gttttctctc ctcaccagaa atcattcagc 840aaaatgagtt attaaaagcc ggttaaccac tcctgcctcc gggtagctcc cgtttaacaa 900cctctcctgg ggagcagctg tcaagctcgg ccctgagctg gcgggaagat gactcattta 960catacagccc gtctccaggc cccccccacc gccaccccaa gatctgtccc tgtctccctg 1020atgactaatc ctttccaggg atgagatcac tgccccttct aacccccccc cccgccccca 1080cacacagaaa gagcagagcc ctcatctcag cccagaattt tgggagaaga ctaaatccaa 1140gaccaaggga ggcctttgat gggacaaaga cgtgactgat gaacccggag tgaggagcaa 1200tgagatgaag aaagctctgc ccacctaccc cgtccctcac tcctccctcc cacctcaggg 1260cgctcatgtg gggcttgtgt ggggaacagc tccagggtca taccacctct cagaagggag 1320acagaccagc caggcgtgag gtgacagacc agcgggcagc tcagagcagc aagacaatgt 1380caattcaatc actttacctc aattcctcta tcacacagga ggagatttta aaaggaagtc 1440tctggtggtt tgtaaagcaa caaatcctgc tctcaagtgg atagttccaa gccctctcaa 1500tgaattcagt tttatacacc tggagaagca cagcctcgtc ctttccatgg agctacaagc 1560cacatctggg ggcgctcagt gcccaggctg agggggcacg cagagccctc ggggacgact 1620caatgcacag aggccactcc ttaagggccc ggctccctca aactgaggtg tccccatgct 1680tggtcttccc acagaagcca gcctggttgg ctgcttcaaa ggaggaataa agatgaggag 1740ccatgatgca aacaaaccca cacctttcag ctgcagccag ggaggtgctc tagaggccca 1800cggagagctg tgtgtctgct ctgctagccc gacctgcacc tgccctatgg gctggtaaag 1860gggctgccca cagcacctca gcacatgggt ctctctctct tttcatccag cccaaaatgt 1920caaagcacaa gggtctctgt cagggcctgg ctgtggtcac tggactgcgg ctgaggggta 1980aggtgcaccc ctcctctaat gggggcgcac ccctcctcta atgggggtgg ggctggagct 2040aatggcacat tccactctca gctgccacac acagatgggg aggttgatgg cccgcacaca 2100ggaagtgagg gatggtgggg actgaattta tggagcccct atcccagacc aagcactctg 2160ctggtacttt cacaggtgta atccccagag cagctcctgg gggaggtgtc attatgccgg 2220tgaggaaacc aaggctcaga gaagtaaggc agcacaggtc ccagccccac tccacctctt 2280gaggcctgac tcagccttag gttcagagaa tgcaactgtg atttttccct gagatgagca 2340ttcaatcata ctgcgccagg gtacttgctg tggccaaaac gcctgccctg gatctgtggc 2400atgactttgt gtcagaccct gtgcgataga aggagatgga agctgaggtt cagagacgtt 2460aatgaccttg aaggtcaggg tcacagaaaa tggcacaacc gggattgcaa ctcagttctg 2520cccaatttca aatctacctt cctgccacct ccttgccttg cctattgtcc ccctcccttc 2580atatgacctg ggacccagac tccctggttt tctggaaata ttctctctcc ttttggctca 2640ttctgtgcac tgtcccagtt ggtggtattg aggcacagct ctgcccagat cactctccag 2700ctcagctgcc ctgagctccc cagccccacc tttcaaggtc aggaatgact tatttccttt 2760tctcttcgcc tgtctgaatc ctcgccatca tctgacagca ctttcagctc accaggcatt 2820taactgtgtg ctgtcttgtg acagcccctg tcctgaccat tgtcccagag atttaaccct 2880ttgtgttttt ctatgttagc ttgccagtga ggacatggcc catgtcttgg acttcttgcc 2940accctccaga atgcaggtcc tcagggagca gctgctaaat tcccaacaga actgacagtt 3000tgtccagtga tcaccagcag acactgtacc agaaacattg tctccatctt agtttcagtt 3060aactcaacat gtttttgaga cctgctgtgt gccagacatg agggcagggg aaggaagact 3120atggtgaata agattgccca cagacctcca ggaacacacc tgtagtcaaa acacaattga 3180gcttactggc tcactgcaat gaggaagctt ggccaccatg gattctgggg catctcagtc 3240agagggtatc aaagagggct aattctagga gttgggcttg agttcttgag ttaggtgatt 3300tatggaggac ttaaggaagc aggcttcgct ctggatggga tgctgccaaa gagcggaagt 3360acttctatga ctgggcatct taattcttag ctggaagatg ggaacgacat agcgaggcca 3420agctgtgatt ggccaagaag cagcagttgc tcatactaac cagatgaggg atgtttggtc 3480tttttagtgg tttggacgat gttcttgttt tggtctgtgt ttggacatga ttatggggtg 3540aatggtcttg tttttctctc actccatctt ggtcataagg cggccttgcc tgatcagggg 3600ttctgtgaaa tgcttatgct ccatggcaga tcctcccagc cccactgtga gtgccaggcc 3660agctctcggc gctcaggggc tgccttcatc tttctcaaaa tgctgctgtg ctttgcagcc 3720cataatagc 3729308774DNAHomo sapiens 308ttcactccca acaagcactg ttaagtttat aaataaaaat gaataagata tgatccctgt 60cctcatacat aatgaccaca atgccctatg aaaatgacag taagggagat gtcaggggat 120tccgggaagc aagagaagcg gtttgggagg gtgccccaga gcaggtgcca ttggaactga 180gtagtgaaga tgccttagcc aggagatgga ggcagttggg cacaaggtct gcaaggtcca 240ggttggcatt caagggttca ggtgtcatgt ggataagact ttccagggag cacgtgtggt 300atgagaatga aggcccctga ggaacttcag tatctaaagg gcagaagagg agtctatgaa 360ggagaccaca ttcattcaac aaacatttat tgagggccta ctgtggccag gaactgtgct 420aggcccttgg gattcaacag tgatctaaaa gacaaagtcc cctaccctca agaagcttac 480atttcagtat gggagatgga ataataaact acaaacaatt acgtggcctg tgagaaagtg 540gcaagtacta caaaaaataa aaaatagagc aggctaagag ggtgaggagt gtgcgggcag 600ggaggaggca ggtcatggtt ttaagtgggg tgggtccagg gaggcctcgt taaggaacgg 660atgtttgata aaggactagg gtaagagtgt ttcaggctag tcaacagcac gagcaaaggc 720cctgaggcag aggggctgga aacagagaag aaaagagtga aaaaggactg ggca 7743091782DNAHomo sapiens 309aatccctcct gcacactttc agggcaaagt ttaggtgata taaatgtccc tgaaatgaga 60aaaaccatga ctttcatttg attttaatgt gagggagaaa cataaactag tagttttaca 120aaaagaaaaa gaaatataat attcaagtag atttcaagca acagcagata tgctgaattt 180atttgataac tgtcttcttt ttctctgtca gcagagtctc atgcaatttt aaaaggaaat 240tcgatgaaac gaacacccat caacatcttt aatagctgca agcaatgtgg agcaaatttt 300ttgtcttatt taatgtggtc atcaccataa cccagtaaag acaatatcat cattgctccc 360attttgtaga cagggaaact gaatccagga taaataatgt agcttgcatg acaccaatct 420tcctcaagtc tgagccagaa tttatatctc ccatttctca acctcatctc tcaagcctat 480aatctttcag ttataagaag ggaaacactt gagggtgtat cagtttgtgt tttgttcata 540gtgtttatat gctctcaatc aaggactgtt tattaaaaaa ttttaggagg tggtagtcaa 600aaagtgtctc tggctgcagt actggggaca gactgcaggg gtgagttcag cgagtctagt 660tcagaggctg tggatcaaac aggtggggtg gcccagacca ggagagtagc caaaagggga 720ctaaggaagg gaggccaaag ggaggccaca aacgcaggag aggatgaagg tgctgaagct 780agaggccatt caggaaggaa ggaatgacgg ggcaaggggt cagaaatttc tagaagaatc 840tagtaagatg gaaacctaac agtcctactg ggatttggca actgggagga agctggctct 900gtgcaggaaa gaagggggca ccgctgtgcg gacgccagac tgcgaagggc tgcagaagga 960gccgaaaggg gaagaaacgg acgcaggtag gggtggctgc tgttaaagcc gcttcccggg 1020gaggccaagg acatccacag ctgaagtgct caggaccatc cacagctgaa gtgctcagac 1080actgcgtttt ctttatctca gagaggctgt gtgacttgcc cacgtatgag tacagtggct 1140aaatcacaag ccctggagtc aagggtttag gttgatccag cccccactac tcactggtgg 1200ctgtcagcaa gctactcgct gtgcctcagt ttccccatct atcaagtaga cagcactgcc 1260ttacagatgg ttgtggggat cagaggggag gggacagctg gcggatttag cagagtacgt 1320ggcacagagg aaacactaaa tatgcttctt cagctcctta tcaaggttag gcctccacaa 1380agggtggagc agggaagaga aggcctcacc gggcagacct atcttggaga agatacaagc 1440aatggtgctg aagtttcaca acagtgtcaa ccccctccct catgtgtgta ctcacagcta 1500ctcactttcc tactctgtgc cagccatgag gtgtagtcac tgtgccaggg ggctgagtgt 1560ccggcctggg acgtgagagg gcatgggctc acctgctcag ggtttgaatg agaccccggt 1620aaccgcagca gtaaagaccc ctcaaatgcc atctctaaat taaaatgggt gatcagaaaa 1680tagcaggtga acgatagtgc cctcactgcc cacagaagtg ccttcagtca gatttagcgc 1740tccatcttct gcctttctga agggacagtg gaagcatcca tt 17823101801DNAHomo sapiens 310ccatctgtgc aattccttcc tcctagaatt cagaatctga ggtgctggtt tcctgaggac 60acttgtgact tgctgccttt tattgaactc tgagtgccct attgcccagt ttgagtgttc 120caatgggaag tgcagagcca ccgtggccat tcattgctgt agagctgcgc cccagtacct 180gatacatccc tcaccctttt ccaattgatt tttagcttcc ttcatccctc cctctttccc 240ttgtcctctt cgtgtccaca ggaagcctgt tgggagcctg ctatggcaag tgctgtgcta 300ggacacggtc ctgcactctt agagtttgtg gttcagttat tccagtttca gcacttacat 360tcattcaaat gctttgtgga agcaagctgg cttttagtca ccagcaatag caatttctga 420aaatcaccaa gccacaccaa atatatgaaa tatctttctc taaggtggtc tttaaaattt 480gggctgactc tcctccctct aggaatgttc tgatgagttt cagtctgaag gcagggagat 540ggtctcggtg acctcctggg cccctgttct gcactgaact gtatgcccat acattcatag 600gttgagatcg taacactcca gtacctcaga atgttactgc attggtagaa aggcttttta 660aaaaagggaa tcaaggtaaa acgaggccat tagggtgagc cctaatccaa tatggctggt 720gtcctcacag gaagagtgta ttaagataca gacatacaca aggaaaacca cgtgaagata 780tggagaaggt ggttgtctgc aagccaagga gagagtcctc aggagaaacc aaccctgcca 840gccccttgat cttagacttc tggcctccag aattgtgaga aaatacattt ccattgttta 900agtcccccag tccgtggtac tttgttatgg cagccggaag gagactgggg ccgcctgttt 960gcttggctgc agaagcccca cgtggctgca ccctggctca ttctgttttc tgtagcagca 1020gcagcagcag cagcggcagc agggagccca ggatgcaaag cttggtttct gagccctgat 1080caggaggctg tgtttatatt tatcctgcta actgcagggg actgtttatt cccagagaaa 1140taacctcctg ggcaggatag gggcagccaa ggaaccagct gcttccatca ggcctgctgg 1200gctcctccag gttctcatca taccacttct gtcgaggctc tctctgacgc agctctcctc 1260actccacacc aggcttgggc ccaggggcac agcctggtct tcctgaggat gctcagacgc 1320agggaccgac tgctcctcac aagcaccctg gcacatgcac agcccaggga ctggagcctt 1380cgcaaacaag tcacagtcct agtctgagat tcagtgcaac actaggcgct tagtagatgc 1440tcagtaaaca gaacaacaag gattttcttt tttagtttta aaacattagt ctacccatgc 1500cttgataaac tgtaaaatgc ctctgccacc cattctccct tcttgctccc tttcatggga 1560gctctgaggg gaaggtctct ggggtgggtt ccagcaaccc tgggcctgtt ctggggtcct 1620gcagccaggt tgggctttca ggagcctata tttcatctgg gccccagtca cactacatag 1680atttttgttt tatcacagaa atcactgcca cactgtgacc cttaaggtcc tcagcaggga 1740tggcgcgagg tgagagtatc aaagccaggt gagagcactc agatggcttc tgcctttgaa 1800c 18013111032DNAHomo sapiens 311gtctggattc tttcacaatg tagcataatg ctctggagtt tcagccatgt tgcagcatgc 60atcagtactt catttctttt tatagctgaa taatattcca tagtatttat atatcaaaat 120ttgtttatcc attaacctgt ggagggacat ttaggctgtt tccacctttt ggctattgtg 180aatggtgcta ctataaacat gtgtacacat gcctgtttaa gtatatgttt tcagttcttt 240ggggtatata cctaggagtg gaattgtaga atcatgtggt aattttgttt aactttttgg 300aaaaatatca agctgtaccc aaagtggttg caccattttg catttccacc agcaaaatgt 360gagagttcca gtttctccat atccttgcca atacttattt ttctttttaa aaaatagcta 420tcctagtaca tgggaagtga cattcattgt ggttttaatt tgcatttccc taatgattag 480tgatgttgag catcttttca tgtgtttatt agtcatctgg atatctttgg agaaatggct 540attcaagccc tttgtccatt tttaactggg ttgttcggtt ttgttgttga gttgtaggag 600ttcattatgt attctggata ttaatcactt acctgataca

tgatttgcaa atattttctc 660ccattctgtg ggatgccttt tcattctctt catagtgtcc tttgatacac aaaagttttt 720cattttgatg aagtccaatt cacctgtttt tttcttgacc aaaaagtaga aacaactgaa 780atgtccacca actcatgaac agataaacaa aatgtgtata taatgggata tattcagcca 840taaaatgaat gaagtacaaa cacatacaac atggatgaac cttggaaact ttatgctaag 900tgaatacagt cagatacaaa aagggaacta ttgtataatt ctatgcatgt gaggtacaca 960gaatagtcat tttcataagg acaggaaatg gaatagtggt tagcaggggc tgaacagagg 1020agaagattgg ca 1032312627DNAHomo sapiens 312aaccacccaa tgtgttcacc ttgcccgctg cctagacaga gccgatttat caagacagga 60taactgcaat ggagaaagag taattcacac agagctggct gtgcaggaaa ccggagtttt 120attattactc aaatcagtct ccccaagcat tcggggatca gggtttttaa agataatttg 180gcaggtagga gtttgggaag tggggagtgc tgattggtca ggttagagat ggaatcatag 240gtggttgaag tgagtttttc ttgctgtctt ctgttcttgg gtgtgatggc agaactggtt 300gagccagatt cctggtctga gtggtgtcag ctgatccatt gagtgtaggg tctgcaaata 360tctcaagcac tgatcttagg ttttacaata gtgatgttat ccccagaagc aattagggga 420agttcagact ctaggcgcca gaggtggcat gatccctaaa ctgtaatttc taatcttgta 480gctaatttgt tagttcgcaa aggcagactg gtccccaggc aagaaggggg tcttttcagg 540aaagggctgt tattaatttt gtttcagagt caaaccatga actgaattcc ttcccaaggt 600tagtttggcc tactcgcagg aatgaac 627313907DNAHomo sapiens 313gccaaaggtg gaaaatgttg atgtagactt ctaagatttt gacaaaattt tgttttatgg 60cctggtggtt atataaatat ttactgtata acaattcatt aagatacaca tttgtgtttt 120ttgtatatat gtgttctatt tcacaatctt aaatgttcct taattaatta atggagcaca 180ccttcagagt tgggtgggaa aataattctg cctagaaatc caaacttaga caagctagct 240atcaagactg aggacaaact aaagccattc ttacacctgt aaggattcag ggtttatcta 300ctatttatgc tatctgaagg agacaattga atatgttggc caggaaacca agtgtgagga 360gtatgtagaa aacagaagat gatagtacta accctgttaa tctaataaaa agaaacccca 420ggatgactgc ttgcagtggg gtttgaaaga aatctattca aattaaaaca ggaggtccat 480gtgctccaaa aagatattct ttttttttaa atatatatat atcttttatt atactttaag 540ttctagggta catgtacaca acgtgcaggt ttgttacata tgcatacatg tgccatgttg 600gtgtgctgca cccattaact cctcatttac attaggtatg tctcctaatg ctatccctcc 660cccctcccct accccataac aggccccagt gtgtgaaaaa acgatagtta gatgccacga 720actaggtggc aatgccttaa ccgtatgtgt gttgtcaggc ctgagggcct cttccatcct 780tgtcaagggg agtactaacc ttctcccctt tcatacaaca caaagatatt cttaagactt 840ctagaataga ccctgaacaa ttttagagta aggaactaat agatatcagt gctttcatga 900agaaggc 907314769DNAHomo sapiens 314ccagaaggtg gaagctacag tgagccaaca gagtgagacc atctcaaaaa aaatttaaaa 60aaatgaagaa ggaaggaagg aagagaggga gggagggagc gtgggcgggg gggggggggt 120ggaggaggag gagagaagga gtgggaggag tggagaagga gggggaggag gagaaggata 180aaaggttaca agtggttgtt actaggaatg ggggagaaga gaagtgggta atggcactga 240agctttttat tatgtctttc agcattctct gattgttctt aaaccatcaa cagatctcag 300tatgtagact aaaagggaat atttggtgaa gagatcttct ttcactattg tacacttgct 360atggacatgt ccatgcctgc tgcctggcag gcaccattca ttaagtaggc ccctgttgcc 420aaggaaacca gctcttcact gataccaaag ataatgcaga ggcctgccgc tcaccaagca 480accttcctca tgagctatgc ccccaccttc ctgaactgtc tcttgctcct gtttgatact 540gtcatgctgc acgaagctta cacttgctat ctctcacttc cctcttagtc atctgtgatg 600ctggctaagg gagctaggcc agtcagcagt gacctgttgc ccttggttta ttataagcaa 660actgttcaca agaaatgaac ttctgttgtt ttataaatga tatgcatcac agaacacaga 720ataatatcaa aaccacatta gttttttcat acttgcttca ttgacccca 769315573DNAHomo sapiens 315acatcgaccc agaaagttcc ttctgtcagt agcagttcac ccccccatgc ccccaaccct 60tggcctccct gccttcccat ctccactccc aaccctcact gctctgattc tatcaccatt 120gttttgattc ttctgctgtt gatcttcata aaaccagtat atttcctttt gtgtctggtt 180tattttcctc agaataatgt ttttaacatt tatccatatt gttatgtgta tcagtcgttt 240cttccagatt agtactctat tgtatggata gagcctattt tgtttaccca tttcctgttg 300acagacattt ggtttgttcc cagttttgga ttataatgaa taaagctgct atgaacattc 360ttgaacgatg aacatttttg tggacatatg ttttgatttt tttgtgtaaa tacctaggag 420tgaaattatt gaggtatggt ataggtttat gcttaatttt atagagtact taaacttgat 480tcttttattt aaaattgtga taaaatacac ataacataaa atgaaccgtc ttaactgttt 540ttaactgtac agtgcagtgg tacgaagcac att 5733161465DNAHomo sapiens 316agcgtgccat tgtactctcc ctgggtgaca aagcaaggcc ctatctaaaa caaacaaaca 60agcaaacaaa aaaccccaaa actggaactc tgtatctatt aaacagtaat ctctcattga 120gtggtgttaa gagtaaaatt ttttttaaca aaagaaaaaa gtaaaaagta aattttgaaa 180aaagaattaa aaacaaaaaa tctccattac cccctccccc agcccctggc aaccaccatt 240ctactttctg tctttctgaa tttgactact gcacataacc ttatataggt ggaatcaaac 300agtatttgtc tttttgtgac tgacttattt cacttaggat agtgccctca gcttttaaaa 360ggaaagacat tttgatatat gctacaacat aatattccat tgtatgtaca taccaaattt 420tattaacgat ttcatctgtc aatgaacatt tgggttgctt ccaccttttg tctattgtga 480ataatgctgc cgcgaacatg tttaagtcct tgctttcact tttttgtgta tacacccaga 540agttgaaatg ctggattata tgtaattcta tttttaatat gagtgactgc catactgttt 600tctatagtgg ctgtaccgtt ttacgttccc actaagagaa catgagtgtt ccagtttcac 660catatcctca ccaacactta ttttctgttt tgttggtggt agccatccta ctggatgtaa 720actttattca tttttcgaac ctttttaata tggaattttc aaacacacac aaaagatgag 780agatctccag gtacccacca caagctttaa taatgattaa catttggtag caggtggaca 840aagatatacc ttctctatag cagctataag atcagggaca aacaaagatc tatttggaac 900tccaactaag aatggtgttt tgtaggctgc ctgatgaata aggttagata actaatggcc 960agtctttcag cctgtgctca agggatagga taacaataaa gcatagttgg tgaaggagca 1020gcagataaag gtcacaatag ataggccata agagaaccct cactatcact taccattcag 1080accattcgct tcatattcta acaagttatt ttcctttcat aaaaggaagc tgaagctttt 1140atttgtgttt gtggtgcatg tgatccatga gaggggactc aaccaggtgc tatgtgtgag 1200tagtacttaa tccgacagta ttagtgggct ggtgggcttt cctggttaca tgggaaccct 1260agaaacccaa gccaagcaca aaagccaaga ctgaattctc cagtaagtca cctggtagcc 1320ttgacatgct catgcttaaa aaagagccag tgacctatta ataggaagct cctgaaatga 1380gtcctctgaa catctgcaag tatggtcagc tacacctgag ctgagacttg cctgtttccc 1440tgccaggaaa tcatgggctc agaaa 14653171399DNAHomo sapiens 317gtgcccagct agttccattc ttttcagata aattttttca aatcctctag aaacaggtaa 60tatttgtgct ttttaaacag ttcagaatac acacaaaata taaaatgttt ctaatattta 120ttatatatct aacatattga taatctaata gaattccaga ttcctaaagg atttctgtat 180aagcactgag agataacctg tcttagccat tgctagtcag aaccaaagaa aataccatga 240agattctgag gtcttccacc aaaaaaagtt tcttaaaaga aatggaggca tgaaggcagt 300caggtaatga cagcaatgac agaatgagaa aagtactgca acagttcaaa aaactgcttt 360ttcttcctgg ttctgctaca taattcaaga taactttaac cacctctctg gggcccaatt 420tctttacatt gcaaagaagt tatggaccct ttaatactca gttccacaaa ttctgactca 480gagggttcag tgagaactcc aataattggg aggcaataaa ctcactggat agctttgagt 540aagacgactt ttggtgtgcc tgtcagttca tatcctccta taaagtctct aacctcaacc 600catcccaacc acaggcctgg gggcctgtag ctatgtatta tggatccttt taggaaaaag 660tatcttgcta gtcacaacta tgttctccct tgaagaaaaa tgagcaggtg aagctgctgt 720tcagacagaa tgaagcggat gtgcaaaggg accacagaca accatcacgg taggaaatac 780cgcttgcttt actgctgaat ctccagtgcc tagatcagtg cctggcccta gcaggttttc 840atcaaataat tattgaagga ccactgaatt tcattccctc atgtggtttc catgagatac 900ttctgtattt ctctaatcat tcaattattc ctccccctta agctagcaca agtttctttc 960ttacaaccag aaagcccttc caaatacatt atgatattct ccccttcata gccaccactt 1020acttcactac aggtatatgt cagacctcag gaaagacacc accgaagact ggatcacatg 1080tccccactca ggaatacaga attggcacat gagattaggt cagttggtca gcagcactaa 1140aggtggtgat agacaccaat gcagcgcata aaggctggcc ggcaggcgaa gtgataagaa 1200agcagacaca aacaggaaag tagacaatgg tggttctgag acatccctat attttcctgc 1260tatggactga atgtttgtgt ctcccacacc cccattcata tgttgaaatg ctaacatcca 1320acagtatttc gaggtggggc ctttgagagg caactagttc ataaatgtgg agccctcatg 1380atgggattag cgctcttga 13993181332DNAHomo sapiens 318tgcacatgct cactgaaaga cagatgtgat cattttcata gtaactttat tcataatgac 60cacaagctgg aaacaaccta aatttctatc aacggtagaa tgggtaagtt gtgatgtagt 120cacacaatgg aatactacac agaagtaaaa catgaactgc tactacatat aacatggatt 180actctcacag atacaatgtt gaacaagaca cggaagagtg catataatta tttcacttac 240ataaagtttt aaaacaggca aaactaatcc gtggtgataa aaggggttgg tgggggcctg 300gggaacataa ggacacctac tgaagtgcta gaaatattct atattttgac ctgggtggtg 360cattcaaggg catatataaa aacccactga ggtgtacact taatatcggt acacttttaa 420attttacctc agtaacaagt tgaaaatata ttggaagaaa gcatataaat gaagatgtta 480atcagtggtg ttgtcttcca aatatttctg gtttttctcc caggtatatg caaggatcac 540actcctccta gaactttaat gtggctatgt gatttgcttt ggccaatgaa agtcctttaa 600gagtccttaa gtgatttgcc atactctttt ttcatttccc aaagttagca cccacactcc 660tgatggtatc tgctttgtca gcctgagacc cagaatgaag gcaacttaga gcacaaattg 720gtgaactttc tctgtaaagg gccacagaat aagtatttta ggttttgtga gatgtacaca 780ctctgtagca actactcagc tgtgccattg tagaaccaaa gtagccacag ataataaata 840aatggacaag gctatattcc agtaaaactt tatttagata aataaggagc tagacagatt 900ttgcccatgg accatagttt gatcatggcc aacctatgat ctaaaccaaa gtccctaggc 960aagttgcaat agaaaagttg tgtgagacag tgagattttg cctttgttat tcaatggcaa 1020gctagcccat gctgacacag aaattggtcc cttgtttttt aaaatgctaa aatactgaac 1080actggcttaa tggttagctg gcaggcattc aggaaattgc tatcagaaga tggaagaaat 1140aacttacaca agggtaattt atattatgga aaggtgaaac tgccccttga gataacctga 1200aaggcagatc atctaccctt tactaaagaa aaagacagaa aaacaaaata ttttgtagtt 1260gctgttcact gcacttgaca aaacgtacag aagagatgaa ctcagaaaag actggtcagt 1320ttgcaggcaa aa 13323194336DNAHomo sapiens 319cagcaagcac caaatcactg gatgactatg acacctactg ttaagatctc tgaatgactt 60aatatgcctc agactaaaac tccaactaag attatactag ttaatcccat aaattggtca 120aaatggctta aggaaacaga ctaagggtgt ggctttccca cctaagcctg aaaagcctca 180gttacccaga aattaagttc agagaggcat gggaacaaga aggcaaagga atacagcaaa 240ctttagaagt atacattaac tagatgtcac ttttgggtca tctccctttg ggtgataagg 300acattctgca ggtgtaaaaa caagtgaact gaatatatag tgaccagaag ggctgagtca 360gtcttctact ttgttaaatt tacttacttt taaatcccag aagagccaaa aatcatctgt 420aaaggaaagt agcaagtaaa tagcatgcca ttttctccta gtgttgatgc ctacaaaagg 480aaaaatgatt attaactctt aggcggcatt atctttttcc aatactaaat gtaacattta 540gtaaaaacat attgttttag ttcactaagt agttgtctaa tcttttccct ctatatgtag 600gttcctttgg aaatttaaaa aaaaaaaaaa aatatatata tatatatgac actgtttaca 660aagggtaaaa aaaaataaga ttctatacat ggatatgcaa acttaatatt acatagtgga 720tttggtgtgt atatttggtt ttgaatcctg agtttactac ttactgtgtt taccagggaa 780aaccgcaatt tgttatcctt ctctcctttc atacaacaga gaaaaccaca atttgtttaa 840cagtcacata atacatattt aagtcacatg ctaagcacta cactaaattc tgagaataca 900acgaggtccc tagttacaaa gaacttgtct tcatttttca attagtaata tgtggataaa 960agttacccaa tggacagtct aaggcagaat gactgtgaag gtcaaataag actgtgaaag 1020agcttcaaaa attgtaaaac actacacaaa tattcgtttg tccaaacatt tattgaaatg 1080ccaggcattg tgctaagcac tagagatata acagtgaaca aggcttatat ggtccctgcc 1140cttacaaagc ttacagtcta gcagtgatca ataagcagta acaataaagt gtgccaagtg 1200tatgtctggg aaagaacagg gtgtataggg aatggatagt aagggcacct aatctagagg 1260gcatcaatga aggtttccta gatgaagtgg catactgaga ccttaaagat gaagataaat 1320ttgtattgta ccctaagagc aatggtgaaa gcaatgcagt gacatgatca gtaagtcttt 1380tggagcaatt tggttgtagt gtagaaagga ataaaaataa aaaacaggga gactaataag 1440gaggctgttg ctataattta ggtaggttga tggcctgaat taaaatggca gcattggaga 1500attggtaaaa aggacaaatg aatggttggt agtggtaatg ccatttagtc aaagaggaaa 1560catgagagga ggagcaggat tgggggcaag atcaatgaca tgtagtgcct gagacagcaa 1620agagcatttg ttagcaatta gatacatcaa ttagggaaga tctagaaagg agatatgaat 1680ctgacagtca tttgcatata aatggtaagg aaaccatgga aggaaatgag atcagctagc 1740gagctgacac agaacaaggc agtctaaaac aaattttttt aaaaatacga agaacagata 1800ttgaagggaa gaggtgcctg caaagactaa gaaagcacac ctggagatgg tatctcctca 1860aagctaaagt catcaagtgt tcaagtgttt caaggagggt aagactatta acaaggactt 1920agcatagtag agcaatttga gtggcaatac gggacactgg gaatacaaat ctgtcaagaa 1980aactagtagg aatgagctat aggacagtaa ctggtaagga cctaataatt ttttttttaa 2040tgtacgtatt ttaactatat tcactgctac aacaggacca gtaacaacta tatttattta 2100aaaaaaaaaa gactgccatg cagttacaga attacttaat acagaaaaca gtaaaataca 2160cttttttctt tttctttttt tttttttttt tttacaaaca agactagctt atagcaaatt 2220ctctatagct aagggtcaat ttaaaatcct tggcttatat ctccccctca ctcaatgact 2280acatgatgca aactaatttt attaacacct taagcaaaac atactggaat ttcacaaaat 2340gtacaagatt tcaatattta aggaactggg gttagaaagc agaagtggct ttcaggtctt 2400ccagtctttc tctcaagtaa taaagctctg ctgtgaatat tcaaagctat tgggaaatta 2460ccggtagatt tttctgtttt tttttttcgg ttttccacta tgttgtttct ctagatatgt 2520aagcttactc tattaaccaa aatctcagct tgaccattct tgataagtac ctaatcgaca 2580tgtaactttt tttctgcctt aaatatgtat aacaggacag agcccttaaa tctgattcaa 2640ttattaattc ctgatttaca agtgctatgg tgagctaaca gaacttatca atgcctttat 2700tgcactttac tagccaaatt tagaaggttg gaattagtct ctcctatcta gtattctgtc 2760agtttgccca gcttgtactt ttaattttgc ttctaatggt aatctgccct atcccttgaa 2820ataaaataat ctacattttg ggagggctaa ttcttcattg tgccaggctg tcccatgcac 2880tgcaggggtg agtgtcttta ggcttaaatg ccaacagaag cccctagtaa atatgacaac 2940caaaaaagtg cccctacaca tttctcagca tcctctggaa tgacaggtta ctgcctctag 3000ttgaaagcca ctggcacaac tttggttttt aagctcttat gccatttatt ttaattgccc 3060agacatcaat tccacctaaa ttcttagtca tagcctggtt ccttgaattt gctggattag 3120taaccacaga ttaaggtgtt tcaatagtta agacaggact ttggaacaag agtttttaaa 3180ttgtataata cttgagagga tctatgaata taaattgggt cctgtttata attagtttta 3240cataatgaac tttaagattg ccttttcatg gtgaacagaa gtttggaaat tactgttttg 3300gcacaaagca gattatctta gtagaaatac agaattactg caatctgtga ataagactgc 3360ttttaaatat ttctacttgt gtgctatctt acatatagaa tgtgtacgac agttccaaat 3420tttagaataa atccatttct agcatctaac aaaatctgat actgtatcat tttaaaacaa 3480agtgtttact ttaggcagga ttttttaaaa taaagcagca atacccacgc agataagaca 3540aaaaagctaa aatatctcac acctcctaat cctggagtgc aatctttttt cctcatcgtt 3600tttgataggg ccaaacttgt gtctacagta aaaaaaaaaa aaaaaagaat tactaactgg 3660caaccattaa gattctatac ttaccatagt cctttaatag gcaagctgat aaaatagccc 3720ccagttatta aaaaaaaaat ccaaggaaaa cccccaataa ttagtcttat ctccaaattg 3780catgaagtct cctatatctg aaacttaaaa atgattctaa tgacttcctc tatcagtaat 3840gtgttatcac tgaggtgggt gatggggagg gaagagggaa gaaatctgtc agtattacct 3900tcgaactcag aaatgtttaa aaaaaagtct caaacatttt gatggttaga caaaacacct 3960ccactgttat gtatgggctt cctttttgga aacttatgaa cttgctatgt gagcttctgc 4020aaattggttc aaaagcacat ttaaggagtt gataatttaa gactatatga atcagaattt 4080taacactcca ttaaaataag agctgaaatt tttggcattt atcttcagaa cacctaaaaa 4140acagactgca aattcaactc acattaatac taaatctctt taaaattaac tatatcataa 4200aagacaatga ctttgtcact aaactaagtt ttaaaaaagg tggcattctc atgtttcagt 4260cccatgctgc catttgagat gaaaaaaaag gcaactgtca gaattttaat tgtgatcagt 4320ttggacggct ggtact 43363201612DNAHomo sapiens 320cctggccaga aaattcattg acttcctaaa gatttattaa ctttctgcat tacttttttt 60tttcccctcc atcgtaaata taaaagggaa tagtagagaa aatcattcag aattttattt 120tttagtgaca ttatttagtg acattttatt agagtcactt aggaacctga ggctgaataa 180agttcaggta aaagtaaaat tagttgagaa gagacatctg ccaaaagaaa tctattttta 240acttcacttg ctgtctttcc tagaggaaca gaaatagtgc tgaatgtcct attagaaatg 300atggttgctc tgcccgtctc ttccctctct ctcacacaat atgtaaactc atacagtgta 360tgagcctgta agacaaagga aaaacacgtt aatgaggcac tattgtttgt atttggagtt 420tgttatcatt gcttggctca tattaaaata tgtacattag agtagttgca gactgataaa 480ttattttctg tttgatttgc cagtttagat gcaaaatcca caagtattca agtgattgtt 540aaagagggag gcctgaagtt gattcagatc caagacaatg gcaccgggat cagggtaagt 600aaaacctcaa agtagcagga tgtttgtgcg cttcatggaa gagtcaggac ctttctctgt 660tctggaaact aggcttttgc agatgggatt ttttcactga aaaattcaac accaacaata 720aatatttatt gagtacctat tatttgctgg gcactgttca ggggatgtgt cagtgaataa 780aatagattaa aatctattct cttctgatgc ttacattata gtggtgggag acaaaatggg 840tataataaat attatattag atagcattaa gtgctgtgga gaaaactaaa gcagggagga 900agataggagt gtgcaagcca gaaaggttgc aattaaattg agtagttcag gaaggcttca 960atatggatgt gatatttgag agaccggtgg aagtcaagga gcaagttgtg aggctattta 1020aaggtattct tggcttacag aacaatatac gcaaagacta ttaaatggaa gcatacctga 1080catgttaaag gactatcaag gaggccagtt tgtctagagg ctgaaaagga aagagtaata 1140ggagatgagg tctgagtgaa aacacgtaaa tccttgtggg ccaaggtaaa atctttagct 1200ttttttctga atatggtggg atactgttag agggttttaa gcagaggtta cgtggtgtgg 1260tgagtttttt ttttttaatc ctttgtcttt ctgtgtggaa aatagcagga cagggcagaa 1320gcagtctgtc ctgcagactg cttggtcgca gtagagatgt aagaagcagt gagattctgg 1380gttaattatg gaggcaaagt tctcagaatt tgctgatata gggtatgaga gaaagaggaa 1440tcaggaatga tttcaaggtt ttggtctgct aaatggaagg agttgccatt tactaagatg 1500ggaaagacta tgaaagaagc agattttcag agagatcaga agttcatttt ggggcatgtt 1560caatttaaga tgcctgttag ttggatgttt atgtgagttt ggaatgcagg gt 1612321831DNAHomo sapiens 321gcagtccttt gaggatttag ccagtatttc tacctatggc tttcgaggtg aggtaagcta 60aagattcaag aaatgtgtaa aatatcctcc tgtgatgaca ttgtctgtca tttgttagta 120tgtatttctc aacatagata aataaggttt ggtacctttt acttgttaaa tgtatgcaaa 180tctgagcaaa cttaatgaac tttaactttc aaagactgag aattgttcat aaataaacta 240ttttacctgc agagacctct gatatatgtt tcttgatgga agtacccagt accacctatg 300aagttttctt gtcaaaaaat caaatgtgaa tctgatcatt acttagatct aagtaccaat 360atatgaaaaa tataggagac aaggaagcat ggtaaatgat actgagattg ggagactaca 420tggaaaaaga cttgttccct tcaacagata gacagcaggg aaaaaagaat agagaaagga 480gtaaagaacc tgtagattaa aagacattta agggacatat gaaccaggtc cagtgtatag 540atcttaccta aatcctgatg gagcaaacta taaaaaaatt tttttgagac aaatgtttga 600atacaggttg actatttgat ggcattaagg agaaattatg aattatcttg gtataagaat 660attgtcatgg gttttttttt ttgagtcctt acctgttaag atacatacta aaatatttgt 720gggtaaaatt atatgacgta taggagtata tgatttagaa aacggattaa aatataaaag 780gataaaatag gatcttatat tttgtgactc acttcctgtt ggatatcttt c 831322997DNAHomo sapiens 322tggccttgtt taaggtcctg atgagtattc ttataggtac actgtgtttc gtttaattat 60ttccttagga taaatttata gaaataacat tccttggtaa aagaatacat attttaaaaa 120ctgtattagt ttcctgttgc tgtcaaaaaa tttccagaaa cttagtggca ttaaacaata 180caaattaatt attctacagt tctggagatc agaagatacg

ggtcttacta ggcctcacta 240ggctaaaatc aaggttttgg cagggctgtg ttcctctatg gaggttccaa gggaccagag 300aaactacttt acagtagtta ttttaaggga atgaaagtga agatggggtt gggcagtcaa 360agaggctgtt acttttcatt tttggccttt cagtagtttg aattttttta tcatatacat 420gtattacttt aatttttaaa aagtaaaaag cagctgtgat tcagtctctg taatttagat 480caatttacat caaactaggg tggtctcatg tgttgtcttg ctcacagtga ccactagatt 540attccaagaa gggacaattt ccaagacttg gtttacactg agacggctcc tgattttaag 600gataccttag atcaaactct aggaaggcag tttcattttg gccttgcagt tccctgggtc 660attttccaag cccatggcct cctggagtct tcgcctagct gtaggttatc tttgtggcta 720ttatttcact gtaattatac aggaagattt attgagggat ttctgtgtac cagccgtggt 780tctcagcact ttgtatactt tgtattaact ctgactcctg acagtaactc tacagaggtt 840ctgctgttac ccagttttac atagaaacat ggccagcgga cgcagttaga aaatggcaaa 900gtggggatta gaaactaggc agtttgactc cagagtctgt gcccctgtcc acttggctcc 960actgctgggg aagaggcctc tgaagcagca ggaccat 9973231165DNAHomo sapiens 323accccgtcat agcacagttc ctgagttaca tctttacata ctgtagtatc cttcttgtga 60aaaaagatac agattccaaa ggtctgagaa accaatcttg gttataaagg ggaaaaatgg 120tcatgggttt ttaaaatttg ttttgtctta attgcatttc aaatttacat ttctaaatga 180ataattgctt atataaagca gttttgatta acaatataaa acactatcta tttggagtga 240ttcctttacc catttctgaa ggcaagtttt aaaaattact agaagacact tcattgagaa 300tattattaaa catgcctata gttctaccac ctcaacacaa ttgcttatta acacattaat 360gttttggtgt gttttggact ttttaatatg tatttttcac ttgttctagt aattatgcta 420cagattgatc atttcttttt caacatgtca tcaaagcaag tgagcaaagt gctcatcgtt 480gccacatatt aatacaaaat ggaagcagca gttcagataa cctttccctt tggtgaggtg 540acagtgggtg acccagcagt gagtttttct ttcagtctat tttcttttct tccttaggct 600ttggccagca taagccatgt ggctcatgtt actattacaa cgaaaacagc tgatggaaag 660tgtgcataca ggtatagtgc tgacttcttt tactcatata tattcattct gaaatgtatt 720ttttgcctag gtctcagagt aatcctgtct caacaccagt gttatctttt ttggcagaga 780tcttgagtac gttttctttt ctccttattg ataaattgat aatcctcaag gatgattatt 840aggtgatact cttacttcat ggattcttaa aagatatgat ttaacatatt acaagtgcct 900agcaaggtgt ctgttacacg taggtatttt aagtaaatgg tagctgctga tgtaatttct 960gcccctttgc ccttcagttg gggtattgct ttggaccgat tagagggctg tggctgggat 1020gctaaaggtt catgtttcct tagctggctc ctgagccacc agctcccacc acctgtgtat 1080acctgtgcta gtttgccttc ccacaagtag ctgctggcta tctgttatgc tggtacagtt 1140ttcagaaact gatgaatggc ctttg 11653241275DNAHomo sapiens 324gtggcgtgat atccttgatt ctatcagcaa cctataaaag tagagaggag tctgtgtttt 60gattcagtca cctttagcat ttttatttcc atgaagtttc tgctggttta tttttctgtg 120ggtaaaatat taataggctg tatggagata tttttcttta tatgtacctt tgtttagatt 180actcaactcc actaatttat ttaactaaaa gggggctctg acatctagtg tgtgtttttg 240gcaactcttt tcttactctt ttgtttttct tttccaggta ttcagtacac aatgcaggca 300ttagtttctc agttaaaaaa gtaagttctt ggtttatggg ggatggtttt gttttatgaa 360aagaaaaaag gggattttta atagtttgct ggtggagata aggttatgat gtttcagtct 420cagccatgag acaataaatc cttgtgtctt ctgctgtttg tttatcagca aggagagaca 480gtagctgatg ttaggacact acccaatgcc tcaaccgtgg acaatattcg ctccatcttt 540ggaaatgctg ttagtcggta tgtcgataac ctatataaaa aaatctttta catttattat 600cttggtttat cattccatca cattattttg gaacctttca agatattatg tgtgttaaga 660gtttgcttta gtcaaataca caggcttgtt ttatgcttca gatttgttaa tggagttctt 720atttcacgta atcaacactt tctaggtgta tgtaatctcc tagattctgt ggcgtgaatc 780atgtgttctt tcaaggtctt agtcttgaaa atatttatag tgtagtagaa ctattttatc 840ctccaatgct ccttcttttc cttgtatttc cattatcatc actttaggat ttcacttatt 900tatcattcaa catttattaa ttgcctctca tattccaggc tttgtgctag aagttaggga 960tataaagaca aataagatat ttcctgccct taaagactag attcgtgttg ctaagtcttc 1020attatcaaga aaagcataag tggggaaaag tgcttgcatt atggattcct catagttgct 1080cccctctgca tgtaaaaatc accatttcca tcatagattc ctagcggtct caggacttta 1140taaagcccaa agtgcctatg tcataatatg aggaaaaata ctgagaccct tccatatatg 1200ggaggtatat ggatgagaca gctcctgact tcacttttcc cagaaatctg aaaagcagca 1260gcagtcattc cagag 12753253164DNAHomo sapiens 325tgtgctagat gcctcactgg aaaaataaag gacatgatgg aaaactctgt agggtcagag 60aaagggatca ttagagaagg ttctttgaag aaatattttt tgaaatatga aggataaata 120ggaattaact aggtaccaat aggttaggag tagagctttc cagacagagg gactagttct 180tgggaaggtc tccagacaga aataagtgtg gcttgtctga ggacctctta ttcgcctatt 240aaccttccct ccccagtaaa cactcctggg aacaacacac attgtagaac cacgttgtgg 300tgctgttcag tatagcaagt aattcagcag agataagttc ttggaatctc atctttggga 360tttagttact aagatacatt caagtttgag caaaataagg tctcagagct tggattcatt 420gttctgttcc agcaattaga gcagtacctg gcacatagca caagtgcttg aaaacactga 480ctgagtaggg taggtgggtg agtgggtggg tgggtgggtg ggtggatgga tggatgggag 540gatgggtggg tgaatgggtg aacagacaaa tggatggatg aatggacagg cacaggagga 600cctcaaatgg accaagtctt cggggccctc atttcacaaa gttagtttat gggaaggaac 660cttgtgtttt taaattctga ttcttttgta atgtttgagt tttgagtatt ttcaaaagct 720tcagaatctc ttttctaata gagaactgat agaaattgga tgtgaggata aaaccctagc 780cttcaaaatg aatggttaca tatccaatgc aaactactca gtgaagaagt gcatcttctt 840actcttcatc aaccgtaagt taaaaagaac cacatgggaa atccactcac aggaaacacc 900cacagggaat tttatgggac catggaaaaa tttctgatcc ataggtttga ttaaacatgg 960agaaacctca tggcaaagtt tggttttatt gggaagcatg tataattttt gtcctaagtc 1020tgtgctcagc cctcccacat gtgctcattg ctggttgact gttggagtct ggttcttacc 1080tctaagagga agcccaggag agggcataaa gccagcacac tgtcctcacc tgatggtgtc 1140agagtcctta cgagtaagcc ctagccagaa cattgctgga agagatcaag ggccactgtt 1200tgaaattgca cagcaggata cggaaaaggg gtaccttagg tataggcatt gtcattaaag 1260aaattgctaa gatacttgag attttcctgt ttaaggaatg agctttatga tacaaagagc 1320agttctaaaa attagggagg gaattaacta aattaattag gatatttctc aaattccttt 1380acagtttttg tctctctgct gatatagtgt ttacatgatt gttatttact aaacaaatgc 1440tattttgtat tgtgctcctt ataacttaat tgtttattac aaggttttga tggtgaccta 1500ccaacaacaa gtaatcccaa acacagtctg aattttttgt tttccatcca gaaataagat 1560gaatctttcc atttccgtgt tttcagtttt catcattttt atcctatagg ttacttatct 1620ttattttaaa gcatttcata ataattttat agtttttgtt ttgtttgctt gtttgctgtt 1680ggaaatggaa tattccctcc ttccatttag actgctaacc agctgtaaat gtttcaaaat 1740atgcatgttt tacagcagtt gttcaaagca atacaggaac agtaaggaca gagccagtca 1800ttttacaacc acattctgtt aaactgatgt ctattagcag ggtttttcct attttattag 1860gaaggactta cacctgatat ataacaaagc ttgttttaat caaggctcag aaaatgtttt 1920tcattagttt ttttcctaac catgaagaat aactgctttg taacacacat gctggctata 1980aagcagacaa aaaattcact gtaggtgctg cctgactggc ctctgtccgt gtttctgttg 2040gggctgctta ccacagcctc tgcattatca ttagctagtg tgttcacaat accaagttcc 2100cagtagcaaa gaaaggtcaa gctcttacgc atgccattca tttatctaca ctgtgcaggc 2160gcactcaggt ggcagggaca aagaccactc ctttggcgca tctcaagttc agaattctca 2220gtagaggggc tccagctgtc cttttgtcag gtgcccatgc ctgctccagg cctgtgtggt 2280caggacacgt gttacagagt acagtgacat taatgatggg gccatggata tggtcagcac 2340tcagaggatg ttagtctctt cattgataaa gtcacaacca cttttcctgt tggaaataaa 2400aagatttgac gtatccttgt ctacagcaac acaggacaac agataatcag caggtcatct 2460aaatctgttc agagagaaag gagagctgtt tcctgaaaat acatcttccc ctgattttag 2520tcttattttt ttctgccttt attgctttct accctcttca aaccagcctc atttcctaaa 2580ttaccttgaa tatgcattga cacttgtact gcctgaaatt ctggaaaact cagtatggct 2640actccaccgt cagaacttcc tgagcaaagt tagttgctct ctcggctcac tgttttgttt 2700tgttttgttt tcctgcctca ggtttatttg tacaaatagc acaggaggac cagccccatg 2760cagatggtag cccaggggcg ggggtagggg gtcacaccag tccttctgtc ctcatgttgg 2820cagagatatc tactctgaag cctttgtagg ggcctgggca cctttgggag cctgagctgg 2880aactgaaggt ggagctgcag cctgggcctt ggtttgatcc ttggccttgg cctttggccg 2940gcacagcctg agccccttgg caatacgggc acgagcacgc ttcccaagct tgggatgggc 3000aatgtaggca agtcgatcga gcttgcggct gacacccttt gggatcttgg gcttaacctc 3060cttgggcttt acgagggcct tgatagcctc ggcacgtgca ctcatggcct tggcattgtt 3120ggcctgcatc ttctttaggc ccttcttgtt gtgcttcttg gcaa 31643262468DNAHomo sapiens 326cggaggctct actgttggac tgctgtccac tctggaactg cggaggctct actgttggac 60agacctgggt taccagccgt gtgactagcc ttccctggcc tccatatccc cctcagtaat 120gaaggaatgt gtcatcccca aatccaggga cagttacaag cagtcagtga acagaaagtg 180tctggtacag gttctaagtg cttattattc taagtcactt cacttacctg agttctcagt 240tttcctatct ataagataag caggttggat aaaatgttct ccaatatact cctggtcctg 300agatgatgtg attgtgggca gccctttaat catggtgaag atgttcatca taagcacact 360gaaactacaa aataggaata taaatatttt ctccattaaa ttatgctgga tcctagaagc 420aaaaactgga actgtgaaac cctacttcac agaaaactta aaattcccaa gcagatgaat 480gcttctcgga aggacactga cagttaccta cctggaaaga atctagatgg aggtggcatg 540ggcactaagc ggtgagatta aacccagtta gggcagcccc accagccttg gaacccacac 600atctggagat tgttgatgca gagagaaagg ttcctactgg tgagacctga aagggatatg 660tggcaggtgg gaggaagaag ttctgtctgg aaaccaaccc ttgttcctcc gttattgatt 720gactcctggt accaacatga gccctaggtc ttatagaggc cataagtccc tatgccttat 780agtgcccatg gatgagatga ggccacacat gcccccagtg ggttaacatg tctagcgtgg 840gtaaggctct tggagcacta tgatacacag gaaatgccca gtaactctta gttggtttga 900tatctgttcc cattgctcac ttaagctcag tgccccttta ctgatccttt tattctgcct 960ccctctgcac atgtgcattg agactcctat ctgagacaca cactgtgttg ggtgcccagg 1020gatgcagcat agatgttgct gccttccaca gaagcgctca tggtctgcta gagaatatat 1080cccatgggag agaaaaacag actcgggaga atatagcagg ggcccttgtc ctggactttg 1140gcagttagga aagggaggga agagacatgg aggctgggac ccaaaggcta aataggaatt 1200tgctgggcca aaggggaggg ggaatgaaaa gagtgtttct ggcagaggaa atggcaagga 1260taaaggcctg gaggcgcaag agaatatgtg tttgaggatc tgaaagttga gtgcagtggg 1320tccagtgttc tctaccctgg ctgccattag aattacctgg gaaactttta gaaaattcca 1380gtgtctgggc cctccctaaa acaataaatc attcttgggt ggtggggtct gggcatcagg 1440attgtttaaa accctcccca ggtactgtca tgtgcagctg gggttaagct gtgctggggt 1500ctgagtatgg atctgttagg gcaagtggcg gtgatggagt tgaggctgca gaattcaggc 1560caaatagaga ggttttcatc aggatattaa agagtttaga tttcaatttg gtgggaatgg 1620atgggatctt atttgcattt tatgaagagc tccctggttg caatatcaga atggattgga 1680gaggagcaag atggaagcct acagtgattt gggagaagtg gtgagggact tgagacacag 1740gaagtagccc cattcactaa tagttgagta tgtagatttg ctaggacctg gaaatggttt 1800ggctggtggg gagtgggaag aaaggcccaa agtgtgaaat gaagatggag agcacattgc 1860ctagcccaga gtgattgcca tttgctctgt cccagttgag gtccaagggg ttggccagag 1920atcatggagt ctgtggctcc atggggagaa gaacctctca gcatgcctcc ttgtcttatc 1980ctgggttagt cagattcatt ttgttagatt acattttttt tccagtggaa ctctgcttaa 2040gtcctgacca gtatgttttc agaaggatca gagggcctgc ccttgtccat tggtgcatga 2100caccagcttg gtgggttcct tgctgctccc tgttttcata gggttatcag aataccttct 2160ctccctgcca ccagcaggtc acactggctc ctgacttttt ggcccatgga accaccatct 2220ttctgcttct tagattgtgc cttgtactcc actgatcatg gccagtacat cagaagccct 2280ggtttgcagt gaatgcattt gatatggaaa tcaggaaccc tggggatacc actcatcata 2340tttggttgct gtgtttttcc tccaatcttt caccataaca acaatcaact caaaagattt 2400ctataaccac ttgtgtgggg gtttctcccc acacactaaa caagcagtca gttccagagt 2460ggacagca 24683272826DNAHomo sapiens 327acatcagaag ccctggtttg cagtgaatgc atttgatatg gaaatcagga accctgggga 60taccactcat catatttggt tgctgtgttt ttcctccaat ctttcaccat aacaacaatc 120aactcaaaag atttctataa ccacttgtgt gggggtttct ccccacacac taaacaagca 180gtcagttcca gagtggacag cagctggtct cctccaattt aattccaaca ctgtctactt 240ggagatagca ttagatccca caggttgagg gtgcagtccc ctagactgcc cccagtctcc 300tgcttcagac accagtcaca agtccaggac tctagaagtt ctgaccagtt tcaagttggg 360gttcccacaa ccccccactt tatttttgat taatttgctg gagtggctca tagaactcag 420ggaaacactt agttttctgg acttattaca aagatttaaa aagataccaa taaatagcca 480aataaagaga tatacagggc tagatctgga agggtctgga gcgcaggagc ttctgtcccc 540atctacttgg ctcccagcag atggatgagt tcttattcat tttcttgtca gcttcgacat 600gttcagctct ctggaagccc gcaaactctt gtcttcttgg gccttttatg gagacgtcgt 660taggcaggca tgattgaaac atggacaact gtgtcgaaat atgattggac ataaaggggt 720ctaaactcag tgaggcctgt ttgttcagat tcttcttggc ctctctgtgg ccattctttc 780ctccaggata tggggcagga cccctatgga atgagggtct tatgacccac aatcaaatta 840gagtcctgcc ttgggcaagt gaaaggaaag caggagaagg taagagaaat tctgttgcct 900aagaccttct gaggcctaaa gcaccccaac attataacag aagacgataa caggactatg 960ggagttatga gctgggaacc ttggacaaaa atatatacat attaaataaa tattaagtgt 1020atatatatac ttacgtatat taagtgtatg tgtgtgtgtg tatatatata tttttttaat 1080ttactggttg gttttgggaa gcagaaatta ccataactac tcttaaaaat cttttaagtc 1140tctttgaagt tagaaaagtc actgtacctt tttgtttcca ttggccctgt acttcttatt 1200ataccccagc aggaggagca taatgtgttg ttatatcatt ctggtgataa gattcataag 1260tgggttcagc tggtgacagc ctgattccct cattgtaaac ttatccatca acatgtagct 1320taatcgtttc accttttgtg atgaccatta cctgaatcag ttatttcatt agattgcaag 1380attatgcttt tctgatttta tcatttcttc tgtattgact gtaattcttt ggtatagaag 1440aactttccct tgttaatagc tatttggttg tcctgaagta cagttcttac tagaaagtaa 1500gaccaaatgc tgaattatat ccctctagct atcaattttc gaaggaatga atggtgtcct 1560agtaatttcc agtggtgttt aattacgttt tcccttctct ttctccttct cttattccct 1620ccctctccat ctcctccctc ctcactttca gttttttgct ctttcagtat tttgtcatag 1680ctgttaacag agcaacatat tttaatcaat tgtagtcatt tttctttttg gtgctcaaat 1740tatcccgtct tagtcccatg gaagcaagcc cttggagcta gggccctcta ccttttgatg 1800gatttccatt tgtcttgata atttccttgt ttctgacaag acaagatgtt gcaggcacat 1860tttatacttt cccagcccaa accctggaat aggccttttc tccgaggagc tctagttcat 1920tttagtggga aatggtattt agagactata atctgggatc tgggagtcct cattgctact 1980gagtagtcat tacttttagg cttttccagt ggtcagagct aggaaatatg tatatttaaa 2040aatggacagt tgaatggttg ttgccaggag ctgggaggaa ggggaagtga gaaattgttt 2100aatgggcaca gagtttcagt ttggggaaga tgaaaaagtt ctagagatag ctggtggtga 2160tggttgcgca acaatgtaaa tgccactgag ctctcattta aaaatggtta aaatggtaaa 2220ttttatatat attttaccac aataaaaaaa agtcttcttc tgggagcacc cccccaagac 2280aaaaatatga aaattttaca ctgatacttc catttcaaga taattttaag attataagga 2340ttttgcttaa ttcttgaatt ttatacctgt aaacctttta tacttcaaat ttcgggcaga 2400attgcttcta taacaatgat aattatacct catactagct tctttcttag tactgctcca 2460tttggggacc tgtatatcta tacttcttat tctgagtctc tccactatat atatatatat 2520atatatatat tttttttttt tttttttttt aatacagact ttgctaccag gacttgctgg 2580cccctctggg gagatggtta aatccacaac aagtctgacc tcgtcttcta cttctggaag 2640tagtgataag gtctatgccc accagatggt tcgtacagat tcccgggaac agaagcttga 2700tgcatttctg cagcctctga gcaaacccct gtccagtcag ccccaggcca ttgtcacaga 2760ggataagaca gatatttcta gtggcagggc taggcagcaa gatgaggaga tgcttgaact 2820cccagc 28263283843DNAHomo sapiens 328tcggtctcag tcaccatttg tctaagcaaa ttcaggcagg cttcaccttg cctttctaca 60tttgttccct tttcttagca ttttgggcct ttgtttacac gtgggaaaag acccacaggt 120cgtctctccc tttgggcagg atacaggctt cctgtgactg aggttttgct agctgtagaa 180gtggctgcca attggcttct ggtttttatt tccatgattt gctccagtgg ctcttccctt 240ccatcattgt tagctttcaa gctaggaact tttaaaatgc ttttaaataa aagtgagctg 300ttacttgatg catttagcag tcttcctcac agtggttttg atagacagac tccctcagtt 360tggaatttat gagttttctt taagggtttg tctccctcat gtatagcagg ctgttgaaag 420ttacaatgtc aataactttc tgaatagtat caaactgttt tcagtgcagt gtattaacaa 480aactaacctg cctcaagttt ggtcagcttt ggagtcttac tgaggctaaa atgataaatc 540taaatgattt aaaattgtgt attcctacac agtatctcac ttaattatgt aatagtcttg 600tgagtgaggc agagcagatg ccgttttctc tattttaaag atgaggaaaa tggaatggaa 660aatggaaagg acagactaat tgcaacatcc tcgcaatcaa aaacaggccc aggttcatgc 720cttgttggca gtgggttgct actggctgtg gccttcatgc aggaaggcta gatgcataac 780caggtcaaca gcccgtgcag gacaagcacg ccatgtaatt ctgattccat cgactgaggc 840tggtgttttc aaacgtgctg gtgtagggtc ttacagacag agtcatctgt gctatgggga 900atggaatgtg ctcttgcttt ggagccagaa ctcctctgaa gctcccacca cctacaccat 960tcagaggcca gacagaaatt tgttcaccat tttgggcatg attttcgtgc ttttgtaaaa 1020tgtgcttcac tgcagccctt actgggctgt ggtgatgaac acttaagata ctgtgtgtgt 1080gctttataat ctgtaaggca ctgttcaagg ggagggacct ctgccatgag cccctaccca 1140ctggtatctg gttgacatcc aaagccccag cctgggagaa gctgattctc tagttgaatg 1200ctgtataggg atttgactga ggctcagatt tggtgaggaa gaccactaac cttaacagac 1260caacaggctg gctactccct gatgaagttc cccaggccat gaaagaagta agagatacat 1320tccttgtaac agctttctta gttgcacctg tatgattatt tgatcagtgt gttgtctgtg 1380cagggatcat gtctgtggag ctcaccacct cgtcctcggt gctgagcaga gtgcctggca 1440tgtgtactca gtagatattt gctaagggag cgagtcagtg attgagagga gcagcctggg 1500aggtaaagcc ctagaatctt tattttaaag ggatatcaaa gttgaacatt cagttagaca 1560gttctcttga gtccagggat ttacccatcc atggtggaca cactttcagt taaaaagtaa 1620ggttaatttt gacaggttgc agtatccagg caagcattct atggaataag gctcatctca 1680gggattagta atgactgaat taacttactg ctagtcccat aattttgacg ttaattaatg 1740gggttaagaa atgtcataag ctatttggta ccatttaaag tgaaaatacc cttaacgttt 1800tttgcctcca gatatccaca cttaatttca ttttcttgct ctttggtgaa cagtcctggg 1860tctgaatgta tatatccatg gtttgtcact aggtgacagg tttttttgga acaagaaatc 1920agttcagtga acatttgtca agtatcttct ctgtaaaaag tgtaatgtgc caagctcaga 1980agtaggaagt gaaatggata aactatgacc cctgccttaa agaacaccat ggtgttgtat 2040gggaattgtt taggtagaat gaaagaaatc ctctaataga gatatgaggc cagttcagca 2100gaaagccagg gtgagatctc ctgagaggga tggaagggtg tcttgatcat ctctggtagc 2160agcaaaggca ctggcataca gtggccactg gaagacaacc agcaggggat gggggcgttt 2220acccttgcaa gtgagcatta ggaactagag gactgattgc cctttcttca gctttggttt 2280cccttgctgc agaaaaagat gctgagactc atggcctcgg ttatgaactc agatatgtgg 2340tttggctttg aagcacagat ggattttgtc cgattttggc agggaaatgc ctacagacag 2400cactatgggc atatttaggt tagggacgaa atgcaagttg attaagtcct gataagaggc 2460tgtgaagagg tccaagaagc ctcacaatgc ccaatgaaga aaagccctgt gcttggtgct 2520gccgcctccc ttccccgtcc tgctggcagg gctgcgcttc agtagctctg gatgcgtcag 2580agcagtccat gaacattctg tgtggaaaat ctctgactgt tttagtggat tacactgctc 2640tccctttcct ccagtgcctc gttattcagt attatttgat gttctccagc ttttaaaata 2700atcattttcc gcctacgcag aacatcctgt agagacgttg aggttccagt gggaacagag 2760aggaatactt attctaaaaa tgaagaaaat aaaccttttt ttatggagtg ggtgatagta 2820ttgcagaact tctataatag tatgagaatt cacttgtggt gccaaagctt aaaaaaaaag 2880tatagtaaaa acataatgta taggcttatt gctgtgctat gacccatgcc ccgttttctc 2940caacctctct tgtcctcact cttccttttt gctggtgata tttttactta tttcatgaaa 3000aaaaagataa catatacaca cacatagata

tatgcacaag tatatgtata tatgtgtgca 3060taacacacat aaacatatac attggtaaat ttaaaaacat atttatgaaa tatatgtagc 3120atctacagaa aaacatgaac acttgtgaga atagcatctg cctaaaaaat aggacatcac 3180catcaccttt gaggctctta tgtgctgctc ccctgtgcca ttcccttccc ttcttcctta 3240gaggtgatta ctattctaaa ttttgggatt attatttcct ttttttatta tagtgtttta 3300attacagttt tattacctgt atttgtattc ctaaaaattt gtttactttt gcaagcttta 3360gattttataa aagtagaatt acactgtaag tttaattttt ctgtaattta tatatagcta 3420cacatatatt cctaagattc atccatcttg ttacatatag ctctggttta ccttttctgt 3480ataatataga ttctgcttcg tgaatttaca gttcattcat tcttctgtta aaggacagtt 3540ggaggactca tatggcctca gtctctgtgt ccccacatgc caccctgctt cccagcctca 3600tatgagttga ttggtggcct ggcatactgg atgagaagct ctaggtcata tatttaagag 3660agttattgct gggtcataaa atgacagatt gttttccaga ggggtcatat tgatttaaat 3720tatcaccaac aattatattg tcagattttt accagtttgg tgattgtgaa acagtgtctg 3780atggtagttt ttatttgcat tttcttggtt gaaataaagt tgtgtatttc agccaggtgc 3840gtt 38433294221DNAHomo sapiens 329tgaacctgca atatctcaga ggtatgcctg tatctacttg ttctgtgata cttgttattg 60tcagtttgtt tggatttacc acatattatt tgatcataat tctttcctgt agatgtttta 120tggtctgcct aaacctttag tggggccttt gatggcttag tcctttcagg cttaagacaa 180tagaagttta tttctcagag ttctaaaagc tgggaagtcc aagatcaagg caccgacaga 240tttagtgtct agtgaaggcc cgcttcctca tacatggcac cttctagctg tatccttaca 300tagtggaagg gaatagctag ctctctggag tttctttcat aagggctaat cccactaatc 360ccaattatga gggaagacct aatcacctcc caaaggcccc acctcctaat agtatcacct 420tgggggttag gatttaacat atgaattttg tggggacaca gacattcaaa caatagccat 480ggcaaacttt tttgctttgt ctaattcact cttattttga aaagtatttg tgttgggttt 540aaaactccag attggtaatt attttttctt agtgcattga aggtaatagt gtatcatttt 600ctgatttcta ctcttgctct tgaaaattca gctatcaatc ttaaaattta ttacctgttg 660aaaatccagc taccagtctt atattttatt tacttagtgg gtaatctctc ttctgagtac 720ctttaagatc tcctttcaga aataccatgt agtaaccctg tgtgtcacgt gtggattttg 780ttgggcttgc tagctgagac ttgacagttt tcatcacttc tgggatattc tcaggtattt 840tgtcttcaaa gtcttcagat attgtcctct tcctgccctc tctccgactc cttctggaac 900atgagttatg tatttattat ctcccatgtg cataagttat ctttacatat tttcaatttc 960tttatctttc tgtgctacat tctggataat tttgttgatc taccttccag ttaattagct 1020tgttaacttt gtcaaatctc tttttaagtc tatcttgatt tttcttttca attattgtat 1080ttttcatttt taaaaacttt atgtgctctt ttggaaatct tgatcccagg agatagtgga 1140tagtgtcctg ctgcttactc atggttttaa tagttcttga gcatgctgaa catacttatt 1200ttatgttatt tgctaatctt tccaattcct gaaaccttta cagatctcat tctgtggatt 1260cttctggatt ctaattcatg gggcattttt tttgtttttt gttaattcct catactttat 1320ctgtggggaa ttacttgaag cctgggttga caatgaaatt ctgcagagag aatttgcatt 1380tgattctact ggaggaacag tcagccccga tatcagttta aattaaaatc tctgcttaag 1440gttttcaggc aacctgctta gcatgaatcc tggctggaaa agcatgtgag gaccagttta 1500tgattacaca ttcacagggt gtcatgtttt cttccaacac caatgctaga ggtggcagtt 1560ttgcttactg cccttggagg gacaggggag tgggcatggg catagtagta tggttttcct 1620tttcactggg ggtgcagccc ttggagtctc agcttaatgt gttggggaag tggtctccta 1680ttagactctc catttcaaac cattccatga ttttgtcctc cttttgccac cttccgagcc 1740tgtaaaaact aatgtttgtg attcctgagg tttctctaat gtcttttaat aaagttgacc 1800tcagagatct cgttacctct ctgagttcct gctttgtctt agattttgat ccttgagtgt 1860tctttaatct tttagcaatt ccttgttgca tgttaaaaga ttagttatat tttattcctc 1920atttgtgttc gttttcacca ggaggctcaa ttcaggcttc tttgcttact tggtgtctct 1980agttctggtg cctggtgctt tggtcaatga agtggggttg gtaggattct attacttacc 2040tgttttttgg ttttattttt tgttttgcag ttctccggga gatgttgcat aaccactcct 2100tcgtgggctg tgtgaatcct cagtgggcct tggcacagca tcaaaccaag ttataccttc 2160tcaacaccac caagcttagg taaatcagct gagtgtgtga acaagcagag ctactacaac 2220aatggtccag ggagcacagg cacaaaagct aaggagagca gcatgaggta gttgggaggg 2280cacaggcttt ggagtcagac acatgtggtt tcaaatccaa gttcgaccat ttcccattta 2340tttgactgta gacaagttac attcctaaac tatgtctcag atttctcatc tgtaagttgt 2400ggtattacta gttaacatgc aggggttttg tttgtttgtt tgtttgtttg tttgtgaggg 2460taagaaataa cccaagaagc ctagtccttg gtagttgctc agtgccctat aaatgttgtg 2520aaccaggtgg tgagggtttg gtgctgctag agaattctgg tatctgctct gtgcaacaga 2580gtactgtagg tgatgcaaga gaaagaagac ctgatgcctt ctttcctccc agctttgaga 2640atggagcaaa ggcctacccc agccaccaag tgagccagtg ggcttgatca gcacaggaaa 2700ggtgaccccg gcagtttcat ttgactattg catggctggc aacatttcta ttgattgttt 2760ccagggacct tggcggatga gctcctgttg agtctagcat ctctgttaaa tctgttctca 2820aataggtaat gcatatggga ggatgctgcc accttgcatc tactagacat cacctatcta 2880ctgtgagact ctccctctaa gccctgctgt ggcctcagag tgcttattgg ccctgtgagt 2940ggggcagcca ctatacattg catggagttg gtacatgaga tagaaaccta ttcgccatcc 3000cttgaaactg ccccagtcca gaagcttcct gttagcacat gtacctcctt gtatgtattc 3060agaactcatt ccatttaggc ttggaaaccc gtttggtgca actctgttca agttccattg 3120tctgctttga gaatgcttgg gcttgtatag tgagctgtca ctttttaatt tgttaggaat 3180tctactcgcc ttgctttttc ttttccagca tgtttaaggg aatgacctcc aaggccccaa 3240atcacagttg tattcatgtt ctttcatttc acagatacaa tccaggccag tcccagattt 3300gcagctgtta ataaatgtga atggttttcc agtaaggggg tagaaaaaca tagggagaga 3360accgggttca gagttcaata tctggattca agtccttcct ttagcacttt actaactgat 3420gtagaataag tcagctactc aataggtgcc tcagtttccc caccaaaatg cagacataga 3480aggtgctttg tctgctttga tgagaagtct ttaagcaagt ctatggggtt caatgtgttt 3540taagaactat aaagtaccat ataaatgtgg cctttattcc cattgtgttc ttggaagtaa 3600ttcaatatag tgtgtacttc atagctgctt ttggactatt gccagccagt gtatcatcct 3660aaactacatg tcagcatagt ataatcctgc cttaggtcta cttttgatta tttaggaaga 3720ctccctgccc ttcctataca tttcacataa tttttaataa gttgtaaaaa agtgatttat 3780aggattcttt gtaagtgggg gaagttaagc agacaaaaag tttttaaatc ttactgcaga 3840gtgtcaggaa ccttttatag caccagacag gtagggacag aacatgagtg gcagcaagcc 3900agacttggtc ttagtgctct aacctgtctg ttagaggctg gccagtcaga cccctggttg 3960aagacgttgg gaatcccagc tctttggagg ggtaagagat tttgttagac tgttaaccag 4020attccacagc caggcagaac tatttctgtc tcatccatgt ttcagggatt acttctccca 4080ttttgtccca actggttgta tctcaagcat gaattcagct tttccttaaa gtcacttcat 4140ttttattttc agtgaagaac tgttctacca gatactcatt tatgattttg ccaattttgg 4200tgttctcagg ttatcggtaa g 4221330683DNAHomo sapiens 330cccagcccat atattttaaa gctctgttat tgggtacata aacatttagg attgttatat 60ccttttgata atggactctt ctattatgaa aagataatat actgtgggtt tataacatat 120gtaaaagtat gagtaacata ttatcagaag gggagaaatg gaagataact taggcatctt 180atttttaagc atagttttcc ctttgtttct gcattagatg atttacctga aatgtcattc 240aatttaactt actctccatc ctcacccgcc cagctttggt tatgaggcag tagaaagaaa 300tgatctgcct gtggttttct agaaatacga aagttgagtc cttaaggcta cacagaaaga 360aagtacctcc ccagggcttc acccttccca tcctttcagc aggctttttg tctgtcgtat 420cttctctgtt gaaatggcca ttgacaagag gaggaaaggg gttttgttgt ggattgttca 480ggcacttcct ttggggtata tgggggatga gtgttacatt tatggtttct cacctgccat 540tctgatagtg gattcttggg aattcaggct tcatttggat gctccgttaa agcttgctcc 600ttcatgttct tgcttcttcc taggagccag caccgctctt tgaccttgcc atgcttgcct 660tagatagtcc agagagtggc tgg 6833311799DNAHomo sapiens 331gacatggaga gccgaatccc tgcaggccat tataaatgag attatgccat ttgctcccat 60ttcttcttat tctttcattt ttggggctct ccatcttgat gtgttctttg gatcgtgaac 120agatccaaag aaaaggttgt tctgccgtgc tgtttgtcag gatgaaaaac tcttttttaa 180gtgtttaggt ctgcccccag tgcccagccc aatcaagtaa cgtggtcacc cagagtggca 240gataggagca caaggcctgg gaaagcactg gagaaatggg atttgtttaa actatgacag 300cattatttct tgttcccttg tcctttttcc tgcaagcagg aagggaacct gattggatta 360ccccttctga ttgacaacta tgtgccccct ttggagggac tgcctatctt cattcttcga 420ctagccactg aggtcagtga tcaagcagat actaagcatt tcggtacatg catgtgtgct 480ggagggaaag ggcaaatgac caccctttga tctggaatga taaagatgat aagggtggga 540tagctgaagg cctgctctca tccccactaa tattcattcc cagcaatatt cagcagtccc 600atttacagtt ttaacgccta aagtatcaca tttcgttttt tagctttaag tagtctgtga 660tctccgttta gaatgagaat gtttaaattc gtacctattt tgaggtattg aatttctttg 720gaccaggtga attgggacga agaaaaggaa tgttttgaaa gcctcagtaa agaatgcgct 780atgttctatt ccatccggaa gcagtacata tctgaggagt cgaccctctc aggccagcag 840gtacagtggt gatgcacact ggcaccccag gactaggaca ggacctcata caatctttag 900gagatgaaac ttgcccatct ctaaaatttc gggatttctt tgtacccaac aaggttcaaa 960cacaacagtc agcttttatt catgattttt acttccatct gctgatgtag aacatacctc 1020cagagtgacc tcagaaattg tcaaatgtga aaacacaagc catcacagtg agaaatggga 1080ggttgagtta gattgtctaa ggctggagag tccatatact cccactgtta gctctgaagt 1140gtgtagccag tcttcagatt ctgggtcagt tgcctcagtc tctcttagct tttgccttac 1200tctttatccg accactgccc tgccaggaaa acaaggctct ataactcctc ttacaggtca 1260gcttgacaca aaaagggtgc ctggattcct aatgtttcat tgtcactttt cccagtcaga 1320tgataatgct tttcaaatca acatatattt tgggggaggt tggaagggag agttgaaata 1380ttctaagaat caaagagtag cccactttaa tcagagtatg acccctgatt gctcacagtc 1440atctcctgag cagtgtgagc gagtttcaga tgaggaggct gaaggccagt caggcatgct 1500cgaggattcc aagtctgtag gtgggagggc agagatttag tcctgttggc caaagcctct 1560agggaatttc tcactccagt ggagaaggca acacacttac caaactgtgt ggaaactatc 1620tcatttgatt agaaatttta cctcaagaag aggaaggaca gttgagaaag aacattttct 1680tacacatgag acagctaagg cttacaagaa ggagaggaat aatgaggcaa aataatcctc 1740attaatattt tcattcctcc cctggggatt agaactactt tcagacccga ttttaatgg 1799332545DNAHomo sapiens 332tccagaccca gtgcacatcc catcagccag gacaccagtg tatgttggga tgcaaacagg 60gaggcttatg acatctaatg tgttttccag agtgaagtgc ctggctccat tccaaactcc 120tggaagtgga ctgtggaaca cattgtctat aaagccttgc gctcacacat tctgcctcct 180aaacatttca cagaagatgg aaatatcctg cagcttgcta acctgcctga tctatacaaa 240gtctttgaga ggtgttaaat atggttattt atgcactgtg ggatgtgttc ttctttctct 300gtattccgat acaaagtgtt gtatcaaagt gtgatataca aagtgtacca acataagtgt 360tggtagcact taagacttat acttgccttc tgatagtatt cctttataca cagtggattg 420attataaata aatagatgtg tcttaacata atttcttatt taattttatt atgtatatat 480tgtgtcagtt cagatgccaa aaagaggtct tgaacatgtc acaggctctg atggcactga 540ccatg 545333578DNAHomo sapiens 333agcctcccaa agttaagtgc tgggattaca ggcatgagcc actgcggccg gcattaagta 60tgagttttta agttagccca ctttgttaat gactatgagt actaatagct taagataaag 120aagtttctag gtaatcttgt ttgaaggatg atgtaaaaat ataaatttaa actgtgagtg 180acaaaataaa cttccttaat atttgcctac atttagagaa atggagcatt cagctcagaa 240aggaagaatg tctgtggttt taaggtaaaa tccatattcc aagactcagt gaagaaagtt 300cagtgataaa gaacagacta ctctcatctt atgaagaaat ggagcaattt cacttggaaa 360gactaggaag acaaaatgtt acagacgtat ttgttgtgcc acaaaatagg caaggtcagt 420tttgaacaat aagaactcca taaagtagac cagggcatct cagaagtgag gttccatgag 480cccaggtggg gcacaggctg ggtgatcttg agtggagagg aagaggggtt ttctgagctt 540caagagctgg gccacacagt gtgttggttt tagctggg 5783342355DNAHomo sapiens 334tgccctcagc tactcactcc cacagtcata taccagatct catcattaga caattgtaat 60ccctacacaa tttagttcca tgtatcctct ctctaaccac tattcctcat ctttccaggt 120cattctctct agacccgaat tccaacaacc cttcaaccac actggtacca ctaatctaca 180gattacatct tctttctact ataccttgat gtgttcctga atatctcccg aatcctcttc 240atccagttta atttcaaggt ccatcattat aatcattttc ttacatactc cctcacctct 300cctgccccat taatactgtc ctagtaaaat ctagctctct acccactcca tgcctgcccc 360tatgctgctg taagtagcca gagaaacaca tataataaat gcattcacac aaaccttcta 420acatatcata taatattgtc tgatgtcttc ctactagaat gcctctcagg caggaatttt 480ttttttctaa actaatttat tcactgaaat atcccagtgc ctagaatagt gcatgttaaa 540tagtagaatc tcactcaaca tttgttgaat gactgaatag gagttccaaa atagagaaca 600cagcatatgg gaggggaaaa aaatcagtaa caaaatcatt caagaaattt tcccagaact 660aaaggatggg agctcctaga attgacaggg gcccagcatc acacatgaaa acttcaaatc 720acatgactat cttcaaatta caccagaatg ctagagagaa agagaatagg atacaagctt 780ccacaaagag gagaaaaata gatcacaaat cagaaaagat cagaactcaa aatgttcatg 840aaaactcaac agccatgctc gaagtcacag cacaatgaag aaatgtcctt ttaaaaaatc 900ttaaggagaa ccatggcaac tcaggattct ctacccagcc aaactatttt aatcaagtga 960gagggtagaa tgaagacatc ttcaggcctg caaggtcatg aaaaattaac aatccacaaa 1020ccctcttctc aggaagctac tggaagatgt accaaaataa gagaataaat aaggagaaag 1080gcatgagaca ccggaaaaag ggaacccaac ctaaatcaca tgcaaagaaa atctccagat 1140gccaatgaag ggtgaccaca tctatgtacc gagagggcaa gtcactagtt tagaaaggga 1200caagtcagat gcaccaagat tcaacaaact ggaactgaaa taacaccaga tgcatctgaa 1260aatactgagt gggattaatc tactcttgga gattctgtgg ctaaattgat gatagaaaac 1320caagcaaata caaagaaaaa ccataacatt aactttagag gaaactaata gttctgaggg 1380agatgatcct agaatgcaac ctggctccac tgtgtgagta gtgtttagag ggtcctaatg 1440acacaagcag gctggaatta cactgttcct ttattaggag gatataagag tggaaaataa 1500gtatgtgtgt ggcagggaca aaggatgaaa aacagctaaa tcctcatctt ccataaaagg 1560atgtcaatat agaatgcctg aagcagaaca atcaagatgc aacataagta tgttatacag 1620agatacaagg acagtacaca agaatcagct aaaagtattt aacagaaatg gtcaggggcg 1680aggtcagagg agccagggca ggggactgct gtgttcataa caagctttgt aaaaaactat 1740atgactcctt aaactatgtg tccttaaaaa aatgttttaa gaacagaaaa taacaaagag 1800gtaaaatatg aattatctat ccttcatatc tcacttgagt actgatgttt gaaagaagca 1860tattttttta atgaacattt caattagcca gtattttacc atgtaacttt gttaaaatta 1920tattacactc caataagaat gcctttacct gtgacagtag ttcttccttc tctccagcaa 1980gttttcgtag ccttacatct aaaacaaatg aaaaagatca taaactaaat atgtgatgat 2040atagtacata aacaattaaa aatttttcaa actcataaac agctaatatt atctgataaa 2100ttacattact tacagctctg aatatctaaa gaaataaagg tgttaatagc attacagaaa 2160agttcttaac tatctaaaaa gtatttccac acaactgata tttatcaggg caccaaatcc 2220aacatttgtt ccccacagca gtgatttgcc acttaaagac aaacagaagt acaaaggagg 2280tcatttcctt gtttcaagct ttcactagta gacagacaac tcaaatgtca agtgtgttcc 2340taaaggctga gccct 23553354683DNAHomo sapiens 335gccagactct cgttccattc tccagatctc tcttgctcac ccagcatcct gttttattca 60aagtgcccta caatcacatt tctggaatgc acattagaga atgtgcttac taactttcaa 120aatgtttttc agtttgcttc acacttgtat ctctcactcc tctaagaagc ttacacatat 180atgaaaacaa gatgaaaaac aaaaaaattg ttttttttta aaataaaagt gagctaatga 240tacagtatct atctgtgcca tttttcttcc tctagagtag atttctttgt ggggtcaatg 300gatgggtgac tttgatttct cagacagagg tgtcagcaac tttgtggttt cctggagaga 360ggtgtcagat tctcaaaggg ttaaatttaa gaggtttaga ctttaagagt ctgggaagcc 420ctgctctgga agtcatactt ctctgatatc tttttggtca tctgtttctt ggcttaagaa 480atgtggtgga aaagaggtac agaaccctgg ggtaagcagt ggaacataaa accagatgtt 540ccaaggatga gaaacttata acacacttga gaagtctcct gctagcctac tgctccccta 600gcacaggtat actagactat ctctttgcag aacagtttgt agttaagtaa aaaccgatgt 660gtataggccc atagtacttc catccacagg ccttacagtt acacttattg ccttacagtg 720acccagatgc tgatttccca aggtcaagga tgtctgaaga caatgtgcca atgtgcccag 780attcttctag ttaaggatct acttgagtct cagcccttat gctgtttttg ttttccaagc 840tgggatatga aaaagcagaa aacccaatag ggtaacatta atccaagtca acatagcaac 900cagtatctta cctaatggcc cttctcctgc tgactccaag acctgagcag cttcctgaga 960cacaacagtg atggctccag ccactggttc atgactgaca tcaccattgg gagtgccatc 1020ggggattata actaagccat gtttctgcag gggggaaaaa cccaccatca caaaaggccc 1080gtatggaagc tgtaagctct gtgaggtcac tctgcaacaa tacatgtttg ctacaggtaa 1140aacctggtta gaatcagtta catgaaatat agctctgtgt aagaaatagc ttcaacctac 1200caaatctgga ttagagaata aacactgtag tttgtattta ggctaggaaa gatggcagga 1260tgaaaggaag gaagatagag agtaaaacag tgagggacct gaattccagg ctaatgctaa 1320catacctctc ccgtcttcac tgtctcctgc aggtcagcca gctcctctct gagcatatct 1380cgctcattcc taaggcaggc aatgtattct ttctgtttct ctagggcctg gttttaggta 1440aggtagcaag ggaaacaatg gcacagaaaa agagcaggtg aaaggtagca gagaagtacc 1500taattcaaat aagcaaagat aaaggcataa aaagcaagaa agcagtcaaa agattggaaa 1560caaacagtca gatatgggag gaaatacaga gttacatgga tatacatctc cagaagagac 1620ttctcataga aactggttct catgcatcaa tttggcaaaa catgtttaat cacatcaagc 1680agggaaataa atcttttcca gtcaatgaaa aaaataaaac aggaaaagga agataaagag 1740agaagccaga gtaaaataaa gctttcctta ctgactgcct aagtgcattt ttatttggtg 1800aacaaaaaaa accccacatt tcatgtttaa ctaaactagt ttattcaaga atacagttga 1860ttttttaaaa aatagttctg gaataaaaat aactattata cataggtatt ttaatttaat 1920attggctgta gatttttctc caagtagtgt ggcaaaatac tcaaatacca cttaattcaa 1980aatagttaac ctccaaaagg attcaaagat caacttctga caacttaatt aaatataact 2040gagactcatt tggctttctg ttatactccc aaaatgtgaa aaacaaaaat aaacactgac 2100aaaataaata cagccaagct atgaagagtt acagaatatg gatttcagaa tcaggctttt 2160gggttctggc acatacttgt cctatgcctc agtttcctca ctggaaaaac agaagggata 2220atagcaccca tcccaagggc agaggcataa atcaaggtaa agcattgcct gtaatgccta 2280gatagcaggg acagttcagg agaatcaggt tggtgatttc atttgtaaat tccctgccat 2340ttccttaatc tcacaactgt cagctgagga caatgcagaa gcaggaacat actttggtca 2400tcaatgaaaa ataaaatcta ctatgaaaaa ataaaatcta ttgtaaaaga aaataaccca 2460gaattaaaaa tacacccaag gtaagtagtc tatgcaggaa tctgattact ggcctatttg 2520aaaaagcctt tccccaaata tttttgttca tatatttaat gtcttctgtt agcattccca 2580ttaatccaag aagttaaact atatcaggta actttcctct cagttcactg ggtttggaag 2640tgggacagcg aattgctgag aaattgatag ctgaatagct gggcaattca aaaaatcatt 2700ataatcctgt tttgcaacca aatagggagc aagtaaataa gggatgatag caactacgat 2760ttgtatagca caaattatat ggcaggcact attttatata atttctctct tatacattat 2820tttacatttg aaacctctac atatcctgtg aggtacttgt attatcccca tttaacagat 2880cagaaaattg aggctcacag tggttatatt ttttcgccca aagtcacagt aagtggcaaa 2940accagaaaat gaatctggtt gtttttgttt ccaaagccct taaatagttt tttaaatatc 3000acagctctat gaaggccaca ttatattccc ttattgttag cccagatgat gctaggaaag 3060gagtccatac ggcaaatcct actctttact tatccaaact gcaatgtcaa tatctgactt 3120cttttcaaca atttacattc acactatatg atgtgtctca agtctgcctg tgaattaaca 3180atgtgcattt ctagcaccat ctagctagtg ttaacactcc attatgttaa taattaataa 3240taactgaaac attgggaaaa caaagcacaa caatactttc ccatgtgttg agtgtcactt 3300tatggattag gtatttttgg ttactggtat ctgcatgcat agttatgtca tgtatcacca 3360catataagtg ggtaaatgat cactgtcaca acatgctcta cataaacaac aacactgaat 3420aaaaaagacc tctgaggaac aggccaattt gaaactagga attctagcaa atgatataca 3480tgacatttgc tcttcttcca catcgtattg cactgggttt tatttttacc ttcggacttt 3540ttaatttcct cttcccataa ttacagatga gaaaataaaa tacatcctgt aaattcaccc 3600acttcaccac aaagtttgaa gactactaaa ataccttata attggatcaa atgtattcaa

3660gctggatcta aaaccctctg tattacctga ccatataacc actacccttg tgtttgtgtg 3720caacaatagc tcctacagta gatttttttt agggtaaaaa gtacacgctt gtagagttca 3780aaataactct ttatccctga cctaacctca aatcctacca cccggaagcc aaaaggatgt 3840gtataatggg ctgaactttt gggcaagggg ttaattctcc acataattgt actggggaac 3900aaatatcttt ggtcagaatg gaagtgagtt tatgctgggc tatagagata cgcaagttct 3960tcatacgcac ctattctata catgggctcc tggtgtttag aaccgcagtg gagctagagg 4020caagaccact aatgaactga actttaacct gggaataatg gacatatttc ttcattaagt 4080tactaaatgt aaatcttaaa aatgaagcta gagacaagta gttactgacc atactgaaaa 4140tgtgtcttaa aagtcaaggg aggaccactg cccttgtatt ataatgataa caaatgttgg 4200caaggacatg gagaaattgg aacccttgtt cactagtggt gggaatgtaa aatggtacat 4260ctgctacaga acacagtata actgttactc aaaaaaatta aacacagaat taccatatga 4320tccagcaatt ccacttctgg gtacataccg aaaacaactg aaggcagagt cttgaagagt 4380tatttgaata cccatgttca cagcagcatt attcacaatg gccaaaaggt agatgtgttg 4440atatatcaac agaagaatgt ggtatataca tacaatggaa tatgattcag ccttaaaagg 4500gatggacatt ctgacatatg ctgcaaaatg aaccttgagg gcataatgcc aagtgaaata 4560aatcagatac tgtatgattc cacttacatg aagtacctag agcagtcaaa ttcacagaga 4620cagaaggtgg aatggtagtt gccattccac caggggtttg ggagaaggga ctgaatgggg 4680agt 46833363430DNAHomo sapiens 336aggcacaacg tcaggttttc ctatggaagt ctctgtctcc tactgactca tttttcatac 60tgtgtaaatg ctcaagaaga atcaaaagga caggtttttt caatctctag gttaaattct 120actgtagtcc tcatcaatga gcttctaacc aaagcccaat ttcatttcat accccaattt 180ttttatcttt ccaaagaagt gtctcctgga ggtcaaacac ctcttttgtc atggtgtcta 240ttttctgctg catgcgctgc ttctcctgca gaggaagagg ggaagagaag taataaaaga 300gcagaaagaa aagggagagg aggtttgagg gaggaaacaa aaataaagcc gataaagaaa 360cttaaccaaa agggaaagtc tgtgatgaac aggaaaagca aaattggtct gccaaaagaa 420aagatgacat tcacagtctt ggccacaaga ttcttattgg cttgccccta caaaagtaag 480caaaggaacc aggaataatt gttccaacca cagctacgtg gcagcaagcc agctagaatt 540tctgtgtaca tacagctcca tatgtatatt ctttctttga taactgcctt tttaccaaac 600aagaacttac attcctagag agggaaattt aggtttgctt atgaacaaat gatctttcat 660cttagagaac aagcagtttt gaattttatt ttttaagcag aactgatcat tttgaatttc 720tgttagcaaa atctatgaca gcaagaacac catgaatttt gtattatttt aaaattatat 780tattttgaaa catttaaatt tagcatttaa caatccttaa atgacctttc taattaggca 840atggtgctta acaggttttc ttcttatgca ttattggtaa attattatgt cctcctttcc 900ctactcatac attaggtact ttaccatgga attttcaatt ccaaagacca aaaaacatta 960tttgtaatat ttaaagtttt tcagcataac catagatact aacatctaaa agatgttcat 1020tctagatgta aaaaacatct aaaactatag ttctcaaagt ttgtatacct agcaccctaa 1080gcttttaaag aagccacagt gatgaactat agaaatcaag cattatattc ttcttaaatg 1140caattacaat taattactag aacactttac cagtcctaac ttaagctatt gaatttgaga 1200agcagccccc aaagcaggtt tattatttta tgtggttggc attttggcac aaaaagataa 1260aagaacaaaa agggaaagaa tttcacatta ttttaaaata ccagcaggat acagattctg 1320gaaaatatgc ttcctacctt atatggagaa aaaccaagaa aattaacttc acatgtaatc 1380tgatagatcc aaaaggttat ctgtatctgc acttgaaatc cacaaattct gagtatgttc 1440aattattctt aatgatgaca aaaattaaca cgtcttcaaa tttaaagtca tttctttttc 1500tctattaaat ggtttttaaa aatcatttgt agagagacat attaagaggt aggtccgagg 1560ggaaagagag aaagagggag agaaaaagaa aggctaaggt ctgagtagcc aggaatgtgg 1620acaagtgtgg ttgtgagatc tctctcctgg gatcattaac aatctatgct tcctgacatc 1680tctggcgtgt caacactaac ttaacattag atgcctttga tagccacacc tagatagtgg 1740gcaggatccc ccttcaaact tatttccata tttatctaaa aacatcgtct caggagggaa 1800aaccacattt aaagaaaaaa gatgcatgca atgtagcagg cctgcaagga tgactaatgt 1860tttcaaagag ttcttggtag actatgcttc attccattcc taagatgttg ccagcaatgt 1920ggcagagtcc cttcgcttgc agaaacctga accttcagac taaccattct ttaccttttt 1980gtacagaacg tatcttgatg tttcttcttt tttcatttag ccacctgaga aatgtattta 2040cctgagtgaa aatcaaactt attccccaag aatcatgtcc caaaagatgg cattcactaa 2100ttccaaagaa taatgttatt ctataatttt tccttttgcc catttcctaa gatatctgta 2160ggaaacagtg tgcttaggaa taaaagacac aaaaatttct gctaccaaag tggggtaatg 2220tttataggat ttatagtatt aatttttaag cataatctgg tttatgtttg aaaatttgta 2280gtgtacagtc aaatataaag agacaaactc tgatgcatct taactctcct tccctcccaa 2340cacatcctca tcccattcaa ctcatttttt ttcaaaatta agtattccca cagttcatgt 2400acatacctca ataagctcat ctctttgccg caggccttct ttaagttctt ccatcttatg 2460ctgcagcaca ctacacatat gtttctgcct ttctaactcc tgttattaaa caaataatat 2520catttacaca ggtcatggca cacaagaaat ttgaacatac acaatacaac acagaggtta 2580agtatgacct ccagaaacat gcccaaactc ctgattcata gtaacttaga aaaattgtgt 2640attctataga aaagttaaga aaattttaaa attccatctt gtataattat caggaaaacc 2700tgaactaatc aatggcaaaa ttattaaaaa caaaagataa tttagtaaag taacaggtta 2760taaaatgaac atatacaatt caatgacatt catatacaaa taaaattcaa agaaggaata 2820ataaatgcaa tatcaaaata aaatcaatat taaataaaaa acatacatgt aaacttacaa 2880aatatatcaa aaacctatat gaggaaaatt atataaagca ttcccaaaag acagagaaat 2940aggattgaat aaatggaaag gcataccgtc ttcttggatt aaaagtctca caacattata 3000aaaatgccag ttctccctaa attaatctat acatttaatg tagtacaaat aaaaatacca 3060tcaggttttt cttttatcat catcagagca agttgatttg aaagaaaaac acaagaaaaa 3120gtagccagaa aaatacatac tgaaaaagaa gaaagccggc cttattaggt attaaaacat 3180attataaagc ttctataatt aaaacaatgt tgttatggca catgaatata gaccaaggga 3240gcagaataga gaattcagga aaaacccact taaatataca aatatattta aaaacaataa 3300aaataagagc atctcaaatc aatgagaagg aaagactttt aaattagtaa tgttgggata 3360actggatatc catttggaaa aagataaaat tggaactata cctcatacca cacaccagga 3420caaattccaa 34303371323DNAHomo sapiens 337caccattgcc aacacttctg catacagagc atgcttgggc tgcagaatgg gccctgatac 60ctttagttct ttaagcccct gcatgtatct cccttctaca tcctgtatct ggtccttaag 120gtcatagata tcctgcagga cataggaatg aaccattgca taaaaccatg cacaaacgta 180tcttaaatcg caaaggattc agatgaaatg tgactcactt ttgtatattc cagactaaaa 240gcagagaaat caaagtacaa aaacataact cccactccca accactgaaa agggcaaata 300ggtcagggga tagtgggact gggggaaggt ctagagtaat caatctaatg ttaaatattt 360tcttggcatt aaattctgtt aataacagtg tagcaaatgg ggacagggct atatatggag 420gaaaaagcta tataaattat aatatttaaa atcatacaac ttttaatatt ttataaatca 480cttaaaattt ttttagcaca atgcttcacc tagaactagt aaaatataca gtaaattaat 540aagaggtggc aaattttaat gattcatgca aaagtttttt aaaaatataa taaatgaata 600tgaacaaagt tttctttcaa tgacttggtt actggccaca attaacttga gagaaaggga 660gtaaagggag ggaagttaaa cattttgaac agaatgtcaa atgagatatt ctatcctgag 720gataccattt aaatgatgag aaaagcactt gctccaaatg ttactataat ccttataaga 780aaagtgaaac aggtcaaatt ttaaatggaa aatactgttt ccctgtgtct gaccttgtta 840tataaggtct attctgaatg ctcgatttat gtccgaaata actgcacagg gccctaaata 900caattctgca attacaagca ggatcaatta ttaaaggctg attatacaca tttttggtat 960tatattttcc ctgcctcctt cattgcctct ccagtaggtt tgactgtctt ttacttatcg 1020attcattcaa tacacattta tgaaggacct atttaggagg cagatggtag gatacaaaaa 1080taacacttcc ttcaagaagg tcattctctc aaggaagaaa aaacaagcaa caagtagtat 1140gacaaaagaa agctaagtct aaggctggaa ggccttgtcc tctcatatcc ttttgtccca 1200ttagaatgca gtagctaaag gcaagagttt atcttattca actttccacc ctggcattta 1260tgtctggtac atagtaagag ctggtaagag ctcaattaat actgtcacct tcaaaccaat 1320ggc 13233382268DNAHomo sapiens 338cttagtcacc gcctgtcctc tacctcccct ccagtgaaga ggggacctcc agtccaggca 60agctcatcca tctgagtcct ggactctcct ctcacccacg caggtatttt gatctgtcag 120ttacccctct ctccttcctc aaccttcctc ctctctctac tggttcaacc caaccaaaga 180aaataataat cactctcacc tttaaaagaa agctaaaaac cttttcttaa atttctctct 240ccctaccaat tgcccaacca attcccagtc tctcgtattc actccagagc aagtttctca 300tattttctcc ccttgcagtt ttgacttgct cacctcatcc tcactgttga accacagggt 360tactgaactt ctggcctgca atgcttcctc tgactgtgat caaagttatt ttttaagaat 420gcaaagcaga tagtattgct ctcctattta atatcccaca gttgtcaggg taaaactgaa 480ttcctttcag gctagcatac aagatgcttt gtgatttggc ccctcactac ttttccagcc 540ttctctctta cacgctacta ttcttcactt ctcatgccac cctctggcaa ctttgctata 600caggtagctt tctgggttgt ctgaatgcat ctctttgagg aagccttccc ccttcaatta 660ctttccttgg ttaattttta tttttccttc aaactttagc tcagagtttg tcttttctga 720aaagcctttt gttacctctt ctcctagtcc aatttaaatg atgctttctt gaattacctt 780aacagcccca ttcttgctac taccattgta taacaccacc tggttttatt ggtttgtatc 840agtttgtctt ccctatgaaa aagggaaatc ctgaaggcca gaacatattt tgcctcttaa 900atctccaagg ccaagtgctc aataaatctt gaatgaacaa atgtatgcaa tttctctcac 960tatcccctgc ctcacaaaca gcaggttcag caaataaaat atatattttt taaagttctc 1020tttttccaag taaaggggtt aataaggatg gagacttaat gagtcaaata agcttttcat 1080tatctcaggt atttctgtta cccaaatatt attacacttc tattactatg aaaaactgag 1140agataactat attcaagcag ctgaatacca tcaaggctca aaagactaat aattcccaca 1200atctagttct ttgtacactc acatattttc catttaaaac atggccagtt gtttctgaga 1260gaattacttt tacatacatt acaaagtggg ttttttttgg catttgtaaa ttagcagaaa 1320ttccaacacc attaaattta caggaaagag gacagaaaaa agaactatct aatctccttc 1380ctttcaagct ctgaaacact tgagtcacac ttttgtacag tacttatttt ttaaaaagaa 1440atcatattcc ttctttctac taagtcacat taaaggttaa aagttccagt atattaccaa 1500gtcaaaaaac tgcttttaaa aagaaaacca aaaacttcag tttacccgca attcacttaa 1560tgaagtgtct ggatctatta agctgctggt gtccccactt cctcgtctgg atgagtttcc 1620acttagaggg gttgttgctg aggcagaatt tcgagatgaa ggctagaaag agaaatatga 1680cccttctaag tctcatttct ttaaaaatat gtttgctgtg gaaactaaaa acagggcagg 1740gtggttgggg gaagatgtgg gtttcagatg aagaagttac catgtgtaat caatggtcaa 1800tttagattgg ctttgcaggt atccaggtat acagagaagg gcccaagcat actaagcaaa 1860atttcatgac acaaatacct tatcaaaatc tataattcgt aataaccaaa aagaaaatcg 1920tttttatgag ggttattttc taagtttatc tttgctatcc tgtaaacaat acttaagtga 1980gcacatcaga ttaacttaaa gcagttttag aattttataa ctgatatgct aatcagcact 2040tccccttttt aatttctgaa ctgcttatca atctcttctt tctctaatac tctgttgagt 2100tgataaaggt gaagatacta tttgcagcag agagtagcaa cactggatat cactatgaat 2160gtagactgtt cttcccaaaa ttcatttaaa cgtgagatta tgctattgat taataaactg 2220catgaatctg tgctgtccaa tatacttgcg attagccaca tgcagcta 22683392183DNAHomo sapiens 339tgtggctcgc attacatttc tactggacag agttaattta tctttcttgg gtatagcata 60aaattaatct tgataaatca tctccctgac tttctaataa tactttcaga agaaagagac 120tttaacacaa agcaaattct agaaaggagt tattttactt caagaaacaa tttatatata 180taaaaggaaa aaatatctca gagtttgtca ccaccattcc taaagaaact tcattttcag 240tgtttactgg cttcttctca ggatcaattt ctagaggaat gaaaatcaaa agagcagcca 300aaagctaaag aggtgacatt caaggaaata aaatgcagtg cctcttctct tcccagggcc 360tcacatgatt taactccgaa ggaattttgg agggtcaagg acaggaaata gatgaaggga 420ggaagagcag ctacatcaga ccaattgttg taaaataaat aatgaacaag ttacagatag 480caagtttaat tgaccttgag aatttccctt aggaatttac atactcactc ttgtataatt 540ttcagcatac tgtttgtcag atttttcatc caactagaaa agaacaaaga taaaaatcag 600taaagatgct tagcaagttg actttttttg ggaggaaaaa agtgatcttg agttttccaa 660gttcaatgat gtttgaagcc taaaaataaa catattaaaa atgaaattaa aattccagtt 720taacataaga taaaatgtca gcaactttaa tttttatcct aagggagaca ctgagtaagc 780aatctatgtt tactcaaaag tttagaactt ttgggtaaat tttctgttta aggtatttaa 840aacacattaa agaaggctac atataaagtt ttagcactgc aaatgttaat acaaaaaaag 900cttgtacata tcaaaaagaa aagtctgata aaaccaaaag cgaaaaactt ttatttctaa 960acattattag gacacaggac acacagataa agactagtac actcagccaa aaaagttcag 1020attagattaa agcatgttaa atatgtcacc aaaatatttt cccccacatc aaggattttt 1080tgttttctca gtaatacatg cacatgacca aaaaaaaaaa aaaaatcaaa cagcacaaaa 1140gagtatgagt agggctctct ccctaccatg ccataccctg accaccagtc tctaatagca 1200accaaccaca aagtctttaa aaaaattcta gtaagtgtcc catatatgtt ctacatcttc 1260tttttcccct taccaagtct cagagatttg agaaattact tcatctttct tcatagcatt 1320tatcactccc tgacactacg gtatttatcc atttatttat tgtctgctct ctttctataa 1380tgtaagctct atatcagtat gttcttttct gtactcctag tatcaagaat agtgtctggc 1440atagagaagg gctcaaaatt tgttggatga atgaataaat cataccccat ctatactaat 1500agacttccct catttatttt aaacagcttc atagtagcct gttatctcaa cttgccacaa 1560ttaaattaac ctatcttctg tagataggcc ttaaggttgt ttcagacagt atttcaagaa 1620gggaggtacc ttaaaagtct agtccaaacc ctgagtttta tagctaagga aaaagaaatt 1680cagaggtgac agattttgct gaaggttgca gagctgaatc caacccttag ttttgtaaat 1740tttcaattca gtgttcatat tgtttacctg aatatatatg ttattccaat tatcaccagg 1800agaaaaaagg cacataggga tgtaactaaa tgaaaataac tccaactcct acacaaacct 1860tctagaccca atctagagaa gagatctgaa aaccttctca gttaattatt aaaacatgaa 1920acatcatttc tggctctact cagaacactt caccctaact tccacatatc tgaaagtttt 1980tagtgttctt tttattaccc acacttatgt ttataaatat tctttattat gatgatagta 2040ttacatcagt aatataaaca ctatttttag acatttaact ctttgttact tcttgctgta 2100aaaactctga aaaagaaact aattttagtt cattttaatt tttaaaacaa catataaatg 2160tttcaaagca ggtaatgaca gcg 2183340892DNAHomo sapiens 340tgacctccaa aatcatccag tttttaatag actgtccatg gaataatctt atatttcaga 60ggttgcatca ctgaactaat ccatcacaat gttgactcta ttgctttcac catcttctga 120gagatcccaa ttgtcgcaac ccaaactgat catgcaccaa acctcggaga ttgtcaagcc 180agcaaaaatt acccaaagca accatgccat agttctctta ttttggctac actaacttgg 240caagttctca cttcttagac ttgtactatc ttgtgtgcta tcttccattt agacaaaaac 300ctagctgaaa cttggtattc taaaagccca ataggatcca acagccaaga gttctacatg 360gctgcaatgc aataacaagc taatgttcat aatgatgatc cagagtctgc cagatccaga 420aaaaagcttt ttaaacagaa taaaatttaa tcatatttaa tatattaagc tgtaaaactg 480tagtttaata aagtatttac agtgtagttt cttcaatgca gtcattggat cttgatcttt 540tgttaatttc taatttatat atatactaac ttgataatgc tactcaaaaa ttgtttgaaa 600aaatatctga tttccacaac cacagaaggg aagacatcaa tcaattttaa caatttccta 660aagcaaaact ggctaaggat tccatttaga aatgggttta attattaaca atcagcaaaa 720tttcagtttt gcttaaaaaa catttacatt tgtttgcttt attgtagcct gacttacaaa 780acagaaataa agccaatgta gaagaaaata agaggctaga acccccaaaa atattatata 840ggcttgcaga aacatgacct ggcacccttt ctcagcacct cctagctcag aa 892341433DNAHomo sapiens 341ccagatttgt aaatccctgt tctagagaac tttttcctta tgttaaagag ttacagttaa 60ctagtagggg cagggggaga ttttttaaac tgtcagcact ttcacttata tgtaagtact 120aaagtaaatc atcacaaaat aaatataagg aagacataac tattaattca gatattataa 180attaagaggc tgacagaatt cagtttttaa gagaacagat gtaaacagta agtttaattg 240tccatactta tgggggaaaa acggggattg ctaggcaatt taatataaag ggcaatattt 300caggggaatg acttggaaac cagtcttcta actttgtatg cagagagagt tattgattag 360acccacacca agatgatact gttgcatagc cagatttgag aaccaatgat ctggaatgct 420taagaaccac aca 4333422961DNAHomo sapiens 342tcagtccatc agcctcctcc agcttccttg tacctcctct ccaatctata cctcactgaa 60tatcacctta cccacttgcc tgccaggagc ttcggtgctg ttaagaaaaa ttattggctg 120ggcacagcgg ctcatgcctg taatcccaac acgttgggag gccgaggcag gcggatcacc 180tgaggtcagg cattcaagac cagcctggcc aacatgatga aaccctgtct ctactaaaag 240tacgaaaaat tagccgggct tggtggcaga tgcctgtaat cccagctact tgagaggctg 300aggcaggaga atcgcttgaa cccaggaggc ggaggttgca gtgagccaag atcgtaccac 360tgcactgcag cctgggagac aagagaaaaa ctgtgttaaa aaaaaaaaaa aaaagagaga 420gagtccgggc acagtggctc atgcctgtaa tcccagcact ttgggaggcc aaggtgggtg 480gatcacctaa ggtcaggagc tcaagaccag cttgaccagc atggagaaac cccgtctcta 540ctaaaaaaat acaaaattag ccaggtgtgc tggtggtcgc ctgtagtccc agatgctcag 600gaggctgagg caggagaatt gcttgaacct gggaggtgga ggttgtggtg agccgagatc 660gtgccattgc actccaggtt gggcaagaag agtgtgaaac tctgtctcaa aaaaaaaaaa 720aaaaaaatgt atccaaaact tggaaacaag aaaagaggat gactgcatgt cctttcaaag 780ttgtttaatg tggtttacag gctcactcaa ggattattta attatttaga agaataaatc 840ctcttagaac cctgggtttt taggccaaag tctcacctat ttattttctt tttttttctt 900ttcttttttt tttttttttt gaggtggaat ctcgctctgt tgcccaggct ggagtgcagt 960ggcacaatct cagctcactg caacctctgc ctcccgggtt caagcaattc tcctgcctca 1020gcctccggag tagctgggac tacaggcaca tgccaccatg cttggctaat gtttctattt 1080ttagtagata cagggtttca ccatgttggc caggatggtc tcaatctctt gacctcgtga 1140tccaaccatc tcgacctacc aaagtgctgg gattacaggc gtgaggcact gcgcctggcc 1200agcctcacct atttcattat attcccttaa atgtacagtc atgggtcact tcacaatggg 1260gatacattct gagaaatgtg ttgttaggtg attttgtggt tgtgcaagca tcatacagtg 1320tgcttaccca aacttggatg atgtagcctg ctatacaact aggctatatg gtgtagctta 1380ttgttcctag gctataacct gcataacatg ttactgtact aagtactgta gacagttgta 1440gcacaatggt aagtatttgt gtatctaaac atacttcaac agaaaaggta cagtaaaaat 1500atggtataac agattaaaaa tggtacatcc atatagggaa gttaccatga atggaacttg 1560caggactgga agtgagtgag tgagtgggtg agtgaatgtg aagtcctagg atatgatcat 1620acactactat agactttata aatgctgtac acttaggcta catgaaatta aaaatatatc 1680tttttcttgg acgggcatgg tagctcatgt ctgtaatccc agcactttgg gaggccgagg 1740tgggtgggtc acttgagatc agcaattcaa gaccagcctg gccaacatgg tgaaaccctg 1800tctctaccaa aatatacaaa aattagccac atgtgatggc gtgtgtatgt agtcccagct 1860actcgggagg ctgaggtagg agaatcgctt gaacctggga ggcagaggtt gcagtgagca 1920gagatagcac cattgcaccc cagcctgggc aacagagtga gaccctgtct caaaataaat 1980aaacaaataa aaataaaaag atgtccagtg cctaatctaa tcaacattga ttggaaaggc 2040agatgattaa gcacaaataa agcctgttta gtcaagaaat ttactattgg atacataaga 2100tgaatctata tataaaaatg aaaaggcaat gttagagcta tgtaggaatg aatgcaaatt 2160aataaaatgt cattggagta agaacagaac ttactgacaa gacaagattt ccattgaaat 2220cttagtgtgt gtggcttttt ttttttttag atggagtctt gctccgttgc ccagtctgaa 2280gtgcaacggc tcgatctctg ctcactgcaa cctccacctc ccgggcttaa gcaattctcc 2340tgcctcagcc tcctgagtag ctgggattac aggcacccat cactacgtct ggctaatttt 2400tgtattttta gtagagacgg gttttcacca tgttgcccag actggtctta aactcctgac 2460ttcaaatgat ccaccagcct cagcctccca aagtgctggg attacaggca tgagccactg 2520tgcccagcca tcctagtgat ttttttaatc tatatatata tatatatata tatatatttt 2580tttttttttg agacaggtct cgctctatca cccaggctga agtacaatgg cacaatcatg 2640gctcattgca gcctcgatct cccaagctaa agcgatcctt ccacctcagc gtcccaagta 2700gctgaggcta caggtgtgtg ccaccatacc catttagttt ttttaaattt atttttcttt 2760tgtagagaca gggtctcacc atgttaccca ggttggtctt gaactcctgg gctcaagcga 2820tcctccaagg cctcccacct cagcctccca aagttctagg attataggta tgagccactg 2880tggcccctct agatttttaa atgatgttat tatttgcagt aattcttcaa ttcataggaa 2940cttaatctca gggcacagca a 29613432915DNAHomo sapiens 343acttaatctc agggcacagc aagagacaga acttcaatgc attttttttt tttttttttt 60tttgagacag agtctcaccc

tgtcgcccag gctggagtgc aatggtgcga tctcagctca 120ctgcaagctc cgcctcctgg gttcacactg ttctcctgcc tcagcctccc gagtagctgg 180gactacaggc atctgccacc acgaccagct aatttttttt ttgtattttt ggtagagaca 240gggtttcgcc ctgttagcaa gaatggtctt gatctcctga cctccttcgt gatccaccca 300cctcggcctc ccaaagtgct gggattacag gcatgagcca ccacgcctgg caatgcattt 360tttaaataac cattttttcc cataccattg aatgaattat ccatctcttt ccaggggaat 420gaaattcccc tgtaaagatg agcccttgac tcacacctct aatcccagca ctttgggagg 480ctaaggcggg cggatcactg gaggccagga gtttgagacc atcctggcca acatggtgga 540aacctgtctc tgctaaaaat gcaaaaagca gtcaggcata gtgcatgcac ctgtagtctc 600agctacccgg gagtctgagg cactagaatc acttgaacct gggaggtaga ggttgcagtg 660agccaagaca gcactactgc actccaatct aggtgtggag agacactctg tctcattaaa 720aaaaaaaaaa aaaaaagatg agccctcaat tacaaacttc ttttgggatc aatatcaatc 780agaagttatt aagtgctata gtttgtctga tgcagaagta aacatttaaa gttttgacat 840aaactttagg gttggcaaga gcattaagtg agttaatgca tacatgttag ctattacatc 900acaaatcact gaaatgttgt agtttaatgt caaattatta caagttgcta aaatagactt 960gcatgggaat ctaaagtaca gtaaaaataa tgcttaatta ttagccaaag tgctctctca 1020gctaaaatgt ttactcattg gtctgccatg aatgctttca aataacaatc attctttttg 1080gtgttcagga aaagatcagt atccagactt aaatttggga gctctgaaag gaaagcaatg 1140aactttccct ccaacacttt tggatgtttt tatgtactcc ctcaacccca tgggccccca 1200cggtaccctt atgatacttt tgggaaccat gatgtctctt tcttctgaaa agctcaatgc 1260tgcactctgg tactattgcc tgatattaat atgtagttat ttatttttta actttatgtg 1320tatgtctgat ccctcccaac tgggatggaa gcttctcaag aacaggatct gacttggcac 1380agtggctcac gcctgtcatc ccagcacttt ggaaggctta ggcaggagga tcacttgcac 1440tcagcagttt gagatctgcc tggacaacat gacggaacca cgtctccaca aaaaaacaca 1500aaaattagct gggtgtggta gcgcttgcct gtaatcccag ctactcagga ggctgaggtg 1560ggaagattgg ttaagcctgg gaggttgagg ctgtagggag ccatgattgt gccattgcac 1620ttccagcctg ggcatgactc tgactaaaaa aaaaaaaaaa aagtaaataa gttacaaatt 1680aataccatac gagcctgtgc ttcattacag tgattagaaa ggtcctaaaa ggctctgatg 1740cctgaaatat gtctctcaaa gacatgcttc tgtgccttag agcctccact attgcttact 1800tcttttattt ttatttacgt atgtatgtat gtatgtatgt atgtatttat ttatttttga 1860gacagaatct tgctcttgtt gcccaggctg gagtgcagtg gcgtgatctc agctcactgc 1920aacctctgcc tcccaggttc aagcaattct cctgcttcag cctcccaagt agctgggatt 1980acaggtgtcc gccaccatgc gtggctaatt tttttgtatt tttagtagag atgggttttc 2040accatgttgg ccaggctggt cttaaactcc tgacctcagg tgattgccca cctcggcctc 2100ccaaagtgct gggattacag gtgtgagcca ccgtgcccgg ccatgtattt atttttttga 2160gacagggtct cgctctcttg cccaggctgg agtgcagtgg tgtgattatg gttcatggcg 2220gcctcagcct cctaggctca agagatcctc ctacctcagc ctcctgagta gctgggacca 2280caggcaccac cacattagcc accacgcctg gatgattttt tatttttatt ttttgagaca 2340gagttttgct cttgttgccc aggctggagt gcaatggcgt gatcttggct cactgcaacc 2400tccacctcct gggttcaagc gattctcctg cctcagccac cctagtagct gagattacag 2460gcatgtgcca ccatgcccag ctaattttgt atttttaata gagatggggt ttctccatgt 2520tggtcaggct ggtcttaaac tcccgacctc aggtgatttg cccacctcgg cctcccaaag 2580tgctgggatt ataggcgtga gccactgctc ttggctgatt tttgtatttt ttgtagagat 2640ggggttccac cgtgttgccc gggcttcgct tgcttttttt agataaaatg ttgtctccag 2700gccagatgtg gtggctcaca cttgtaatcc cagcgttttg agaggtcgag gggggaggat 2760cacttgagcc taggagtttg agaccagcct gggcagtata gtgagacccc tgtttctaca 2820aaaaataaaa aattagccag gcgtggtgtt tcgtaccagc tacttgggag gctgaggcag 2880gaggattgct tgagcccaag aggctgaagc tgcag 29153443182DNAHomo sapiens 344cgtggtgttt cgtaccagct acttgggagg ctgaggcagg aggattgctt gagcccaaga 60ggctgaagct gcagtgagct gtgagtgtgc cactggactc cggcctgggc aagagagtga 120gactctgtct caaaaacaaa caaacgtggc cgggtgcggt ggctcatacc tgtaatcccg 180atactttggg atgctgaggc gggcggatca cttgaggcca ggagttcaag accagcccgg 240ccaacatggc gaaactccat ctctactaaa aattcaaaaa tcagccagat gtggtggtgt 300gtgcctgtag tcccagctac tcgggaggct gaggcacgag aatcacttga acccaatagg 360tgggggttgc tgtgagccaa gatcatacga atgtattcca gcctgggtga taaggcaaga 420tcttgtctca aaacaaaaaa caataacaac aacaacgaaa acacaaagaa caaaacaaaa 480ccaaagaaac acaaactttg tctccagaag gcctctatta gaatctaaat acctaacctt 540cgaggtgtaa ctcactagca cgttgtctct ctaacagttt cctagcagac agttcaggtc 600taggattgta tccagggaca gagctagaga agccggagcc ccactgtggg gatgctgatg 660aggcagaccc ctcagtgagg ccagtgaaca gatgagtcca ctgggctggg cacctgtgag 720atggggcaga ggaacaccca gataggttaa agggcatctt gacacaacca gagtttatct 780gtagcatagt cttcacaaac caagccagaa cccaagccag agccgcatga gagtgaattc 840ccatctggct ttgggggaca aatgactcat ccaaggctac actcagcgct gagtggtgac 900tgggagccag tgcgctctgc tgactgctcc actttcagaa atacttgcag atctcaatta 960tctaattgca attgcaacga gaaccaaagc aggggagcag agacaaacaa tttctgaggt 1020aaccagatgg ctttattaac tcaagttctc acctaaaatt gccctcaaga atcctgtggg 1080aatgggttgc agtggtgtgg ccctggattc acaaccgaca gagcttctga attctgagtg 1140atctgtacac aaacacacct ctgcctgggt tacacggtaa gggcctcatg tacataatcg 1200cagcatgctt tcctagaaat cgcttggtag cgtgatgggt gggattcaga agtcagcagg 1260aacccaaagt gagtggagag gtcatggcca tgagtcagag gcctctatcc ttcagcagcc 1320tccaacagga agcagacagg gaaggttcct atagttacaa gggcttggct ggtttattac 1380tttcattcta atgggcgttt ttataagaca taagcaaagt acgaaatatt ttatagccat 1440tcggagagga agtccgccac acatttcaaa gaatgaatgc cctctgtaag gataagcagc 1500taactacaag cttcttttgg aattgactag aagttattaa gtaccaaagc ttatctgata 1560ctcaaataaa tatttctcta agttttgact cttgagaggt aaacttcaag gttgacagca 1620ccgaggctat gtggtatact aaaagggtgt gggatttaga gtcccacaga cctaggatca 1680agatttatgg ccaccattca ccaacaccaa caataggatc ttgggtttta cttaattttg 1740gttaactagt tttctcattt gtaaaataaa aataaatgat acctagctca cagtttctgt 1800gaagattaag tgagataata tgaaagaaat cacattgtac ttaccaaatg tttcgtgttc 1860ttttctcttc cttcatatgt gttaagctga attaacaaac tactaagtaa gatttctttt 1920tttttttttt ttcccgagac agggtcttgc tctgccgccc aggctggagt gcagtgacat 1980gatcatggct cactacagcc ttgaccttca cctcccaggt gcaagccatc ctctcgcctc 2040agcctctcat gtagctggga ccacaggcat aaaccaccat gcctggccaa tttttttaat 2100ttttagtaaa gacagggtct cactgtgttg cccaggctgg cctcaaattc ctgggctctc 2160caatcacatt tgggattagg taaaaaatta aaaacaaaaa aaaataaaaa aaaacttctg 2220ggctcaagtg ttcctcccac atcagcctcc caaagtgctg gaattacagg gatgagtcat 2280aatgcctggc ctaaaccact tactcttttt tttttttttt ttgagacgga gttttgctct 2340tgttgcccag actggagtgc aacggcacaa tcttggctca ctgcaacctc tgccttccta 2400gttcaagtga ttctcctgcc tcagcctcct gagtagctgg gattacaggt gtgtgccacc 2460acggccagct aattttgtat tttcagtaga gacggggttt ctccatgttg gtaaggctgg 2520tctcgactcc tgacctcagg taatccgcct gcctaggcct cccaaagtgc tgggattaca 2580ggcatgagcc accatgccca gcctttttgt tgttgttgtt gtttttgaga cagggtcttg 2640ctgtgttgcc caggctggag tgcagtggta cgaacttggc tcactgcaac ctcttcctcc 2700caggttcaag ccattctcct gcttcagcct tcccagtagc tgggactaca tgtgggcacc 2760accgcacctg actaattttt gtgtttttag tagagacagg gttttaccat gttggccagg 2820ctggtctcaa acttcttctt tctttctttc tttctttttt tttttttcct tgagacagag 2880tctcactctg tcacccaggc tggagtgcag tggcgtgatc tcggctcact gcaacctcca 2940cctcctggga tcaagcaatt cttctgcctc agcctcccga gtagctggga ctatgggcgc 3000acgccaccac atccagctaa tttttgtatt tttagtagag atggggtttc accatattgg 3060ctaggctggt cttgaacttc tgacctcaag tgatccaccc gcctcagcct cccaaagtgc 3120tgggattaca ggcatgagcc actgagccca gccctactaa ggaagatttc tggccagtag 3180cc 31823452974DNAHomo sapiens 345cccagcccta ctaaggaaga tttctggcca gtagccaaat aggactttga aaatctttaa 60gaataggaag aatctgaaaa ataatcttca aaaaagaaag cagcatgttt catgaaaatg 120tgtaattatg tataactggt agtggccagc caatgctaat tctactaatt ctgtgtacta 180gagttatcta tgtggatatc tagaaactcc tcaagggaat gtgtgaggat ggagaattca 240tcttgttgac catgaatctc tacagaacta agtacagagc cttatagttt gataattcat 300tgaaacagaa tcattttata tccccttcgc actaactggt tctgaaatac cattctcctt 360gggtaatttt gtttttttgt tttttgtttg tttgagacag tctcactccg ttgcccaggc 420tggagtacag tggtgggcac tatatcgtct cactgcaacc tccacctcct gggttcaagt 480gatgctcctg cctcagccaa ccgagtagct ggaattacag gcatgtgcca gcacgcccgg 540ccaatttttg tttttttagt acagacgaag tttcaccatg ctgggcaagc tggtcttgaa 600ctcctggcct caagagatct gattgctttg acctcccaaa gtgcaaggat tacaggcatg 660agccactgtg cctggcctct tcaggtaatt ttggatcccc taaaggctca ctcacaggcc 720ggctctcaca tttttgccca cactttatgt tcaaaacatg tatcagtggt tacctatgct 780ttcggacaga atattcctac aagagtgagc cagcttgcac cacagacaag ccaaactatg 840cctgtgtcct tatctatcgc tgcataatcc caatggttag tgatctccat tccacggacc 900ccgtgctgtc tcatacaaag catttcggac ttaaggatag aagcaaactg ccatgtcctc 960tatgccatga tgcttactaa tcctttacca ctctgagatt ttcttggagc tatttatagc 1020tgattttcct gggctgactt tcgaccaaag aggagatgga aactttgttc ttaacagtgc 1080tccaactgtg tgattcaact tggctgcatt ccagcaagtt ctgtgagttg ttaatggagg 1140tgagaaagga gtggggtggg gagtcacagg gatgctaact gtagatctgc tttttctctt 1200ttttttaaat gtttgttttt agagacaggg tcttgctctg ttgctgagcc tggagtgtag 1260tggcataatc atggttcgtt gaagcctcaa actcctgggc taaaacgatc ctcccacctc 1320agcctctcaa gtagctggaa ctacaggtat gcatcaccag gcctggctaa ttaaaaaaaa 1380aaaatttata gagacagggg tcttgctatg ttcctcaggc tggtctcaac tcctgtcctc 1440aagcaatcct ctgaccttag cctcccaaag tgctgcaatt tcagttgtaa gccaccatgc 1500ccagccctgc agatttgctt tttttttttt ttttttttga gacggagttt cgctgttgtt 1560gtctaggctg gaatgcaatg gtgcgatctc tgctcaccgc aacctctgcc tctggggttc 1620aagtgattct cctgccttag cctcctgagc agctgggatt acaggcatgc accaccacgc 1680tcggctaatt ttgtgttttt agtagagacg gggtttctcc atgttggtca ggctggtctt 1740gaactctcaa cctcaggtga tctgcccacc tcagcctccc aaagtgctgg gattgcaggc 1800gtgagtcaca gcgcccagcc tagatttgct ttctatagga ctttatattg tcatcctcat 1860caccactatt ttaacaagct gctagtttac ctagtaaatc ctacatgaaa tagaaatgtg 1920gtcattattg gctggtgcag tggctcacgc ctgtagtccc agcactttag gaggccgaag 1980cgggtcgatc acaaggtcag gagttcgaga ccagcctggc caacatggtg aaacctcgtc 2040tctactaaaa atacaaaaat tagccaggtg tggtggtgcg cacctgcaat cccagctact 2100ggggaggctg aggcaggaga attgcttgaa cccaggaggc agaggttgca gtgagctgag 2160atcgcgccac tgcactccag cctgggggac agagcaagac tctgtctgcg tgggggggaa 2220aaggaagaag tttgagacca gcctggacaa catggtgaaa tgctgtccct gctaaaaata 2280caaaaattag ccaggcgtag gccgggtgcg gtggctcaca cctgtaatcc cagcactttg 2340ggaggccaag gcaggcggat cacaaggtca ggagattgag accatcttgg ctaacactgt 2400gaaacgccgt ctctactaaa aatacaaaaa aattagccag gtgtagtggc gggcgcctgt 2460agtcccagct gctggggagg ctgaggcagg agaatggcgt gaacccagga ggcagagctt 2520gcagtgagcc aagatcatgc cactgcactc cagcctgggc aacagagcga gactgtctca 2580aaaaaaaaaa aaaaaaaaaa aaaaattagc caggcgtggt ggctggcggc tgcaatccca 2640atcccagcta cttgggaggc tgaggcagga gaatcacttg aacccaggag gcagaggctg 2700cagtgagcca cgatcacacc actgcgctcc agcctgggtg acagagcaag actccatctc 2760aaaaaaaaaa aatgtggtta ttactttatc tattcacaac acttccctac agactcctgg 2820agttcacctt ctttccgtaa acagggaacc aaccaacaga cacgacatat cctccctctc 2880ccactactct atccacattc ttggtttcct tttttctttc acttccttct ggaacttgag 2940agcttgtttg gaggttctag caggggagca cagc 29743463199DNAHomo sapiens 346tcgtcctctt cgacctagca tgcagctttg ggagggacgc acatggagcg gtgagagagg 60aaggagacac ctacctatcc agccagatca gctgaatcaa ccctggcgat caatggggtg 120acagatgtcg taggaacctt atcaatctgg gtattctgag tcagtttcgt gtacagtgat 180gatgatgatt atgtatagct cagccagact atgacacttg acaactccct catcctgagt 240aggagtacaa ataaaattaa gtttgtgaca tttagttcat tctttttttt ttttttgagg 300tggagtctgg ctctgtcacc caggctggag tacagtggtg caatctcagc tcactgcaac 360ctctgcctgc tgggttcaaa tgattctcct gcctagcctc ccaagtacct gggattacag 420gcacacacca ctatgcccgg ctaatttttt ttgtattttt agtagagacg gggttttgcc 480atgttggcca ggctggtcaa gtgatccaac tgtctaggtc tcccaaagtg ctgggattac 540aggcatgagc caccacgcca ggcccgttta gttcattctt actacacacc ttgattttcc 600atgaacatct caggaatcgg aacatacaga taattccaga aaggagagga atctgtgtat 660ttttcttctt ttgtttcctt attatgcctt gtgagaggcc aatgcatgag tttttaacta 720ggtccatgag aacccacaga gacagcctcg tttgacccag tctggttatc agaagaggga 780agttccttat aattgtgtat gtatacctgg ttggttcaca gatgtcctta aacatgagaa 840cgactatgtc tgaaaaaaac tctcaagttt caccggggct gttgcacacc ctataaatga 900cccatcataa agacctcacc cctctctgat aggataaggc aaaggttaag gtccatcctg 960ttagccacac tctattttcc ttctagctag gccagaacat aatatctgga accaactgtt 1020ctctctctca gctggctgta agaatgctgt atgctttttt tttttttttt ttttgagaca 1080ggctcttgct ctgtcgccca ggctggagtg cagtggtgtg atctcggctc actgcagcct 1140ctgcctcctg ggttcaaacg attctcctgc cccagcctcc tgagtagctg ggattacagg 1200cacacgccac catgccaggc taatttttat atttttagta gagacagggt ttcaccatgt 1260tggccaggct ggtctcgaac tcccgacttc aggtgatccg cccccctcgg cctcccaaag 1320ggctggggtt acaggctgta tgctttttat agtgttgggt ggttaagtct tacacaaagt 1380aaatgcccag taaatactta ttactggtca tgactcaacc attcaggttg ttactaagct 1440aagaccagtc accccatagt ccctgccata ccatatgctc ccagagagag cacttctggc 1500cctccctatg atggctgcca ccaccactac tttgtgggga agaatagtca tcctgacggt 1560tagtcatccc taacctttgg actaactatt cacaattcag tttaggctga tttctctttg 1620caccttatat tcctatgtgc ctcagtcact agaagaataa gccttctaga tcatccaaca 1680tggatagatc atcaacagtg gatactatcc cagtaccctg agtccactgc taatctgatc 1740aagcccctct ccctctcctt cccaaattct tcaatgtgcc tttgcaactc cagatctgtc 1800gccatcaaat gtctttgtag cctcgtcctc ttctttgaat gttcccttca ccacttggca 1860ataaatgaac ctggctgtcc ctgagcagcc catctcctga gcagtcctct gaggtagaag 1920ctgctttact tttcccctga catttcaggc tcctaagggc cagggggtat agtaggtttc 1980ctacttgcca tttccaaact gttccttgcc tctcctcctt cagacacgca gcttctttga 2040agcctctctg atgacctcct aaccttccag ctcacttact caagaactcc cactgtctca 2100gttcttcaac tgtatctgac accatttctc tcttctctta tcttcacttc ccaacctcac 2160ttaagttcca gggcccagca tttttattcc acatttgcaa atactgtcca caacaaactt 2220atccttctct cttctatcgt attttatttc taaacagagt ctcgctctac aatagtcact 2280tttaaaatta tttattagcc aggcgaggtg gttcatgcct gtaatcccag cattttggga 2340ggccaaagcg ggcagatcac ctgaggtcag gagtttgaga ccagcctggc caacattgcg 2400aaaccccatc tctaccaaaa atacaacaat tagccaggtg tggtggcacg tgcctataat 2460cccagctact ctggaggctg aggcaggaga attgcttgaa cctgggaggc ggaggttaca 2520ctgagctgag atcacgccac tgcactccag cctgggcaac agagcgggac tctgtctcaa 2580aaaaaaagac taccattcag accattcagc aaactccagt gcccaggcgg cctggtcctc 2640catctcaccc ttcgctccat cttgccctta gcctgagcaa ccctgcaccc tgctccttct 2700ccccctggtc atttgcggtc acagtgcacc aagagaagag gacgccacct tcctggtctc 2760atccctactc aggtgtgcac cctttgctag ggcccgtgcc tccacccagg tcagagcttg 2820gagattcacc ctcttgcttt cacgtttaaa taagatgcaa gcaagggccg ggcgcagtgg 2880ctcactcctg taatcccagc acattgggag gccgaggcgg gtggatcacg aggtcaggag 2940atcaagacca tcctggctaa cacggtgaaa ctccgtctct actaaaaata caaaaaatta 3000gctgggcgtg gttgtgggcg cctgtatatt cccagctact caggaggctg aggcaagaga 3060atggcgtgaa cccaggaggc ggagcttgca gtgagccaag atcacgccac tgcactccag 3120cttgggcaac agagccagac tccgtcccaa aaagaaaaaa aaaagatgca ggcaaaggct 3180gctgtagaat aggcgctgc 31993473054DNAHomo sapiens 347tcttcccaag agagccaaga tttcttcttt cctcttcttt cttttttttt tctttctaat 60ttcaaaggag tataattaaa ttgccaggta aaagctcaaa ggtctttttt atagtgttct 120ggaaggttct ctgcctgtgt ttgtatttcc tttagcctcc acgttcctct atccagttcc 180cgcacccttc cccccaggcc ccattcttca aggcttcaga gcagcgctcc tccggttaaa 240aggaagtctc agcacagaat cttcaaacct cctcggaggc caccaaagat ccctaacgcc 300gccatggaga cgaagcacct ggggcggggc ggagcggggc gcgcgggccc acacctgtgg 360agagggccgc gccccaactg cagcgccggg gctgggggag gggagcctac tcactccccc 420aactcccggg cggtgactca tcaacgagca ccagcggcca gaggtgagca gtcccgggaa 480ggggccgaga ggcggggccg ccaggtcggg caggtgtgcg ctccgccccg ccgcgcgcac 540agagcgctag tccttcggcg agcgagcacc ttcgacgcgg tccggggacc ccctcgtcgc 600tgtcctcccg acgcggaccc gcgtgcccca ggcctcgcgc tgcccggccg gctcctcgtg 660tcccactccc ggcgcacgcc ctcccgcgag tcccgggccc ctcccgcgcc cctcttctcg 720gcgcgcgcgc agcatggcgc ccccgcaggt cctcgcgttc gggcttctgc ttgccgcggc 780gacggcgact tttgccgcag ctcaggaagg tgaggcgcgg attggagcag agttgtggag 840ctgggctggg ctggggggca gcggcccccg gccctcggcc cccgaaacgg gcataatagg 900gaggggacca agaggccgcg ctttccagcg tggagaccgg acggtgcggc cgtgctccgg 960ctcaggccct ccgcgcggta ggaaacggcg agggccgtcc cggggagcag cctcacttcg 1020cagctttgct cgccttggta gggaaatggc cttgggcgga ggcgggggac aggcagggaa 1080cggagtggcc acgtccaggt ttcctgcggc caccgaaccg gtgcctcgcg ccctggcgca 1140cccacgtcct cggttcgggg tggacttggg gttccaaaac agccccagcc ggtggcggag 1200tctttacgac agggaccagc gggctcgccc ttgtccttgc agcgggcccc ggatgtgggc 1260ctcaggcggg gacaggcgcc cgcagggagg cctccagggc cgctatgcac ctgcgcgcgg 1320caggcggccc ggaccacaca gggcgtgtgg gtgttttccc ttttctaagg atcatatgag 1380taatgccagg cttattgtag ggaacgcaga aataataacc gtaaagagta aaaacatata 1440atcccagcat tttgagaatc ccataattag taattaggtg tatctttctt tctttttatt 1500tatttattta attttttgag actgagtctt gctctgtcgc ccaagctgga gtgcaatggc 1560gcgatctcgg ctcactgcaa ctttcgcctc ccgggttcaa gtgattctcc tgcctcagcc 1620tcctgagtag agtagctggg attacaggcg cgcgccacca ccccccgcta atttttgtat 1680tgttagtaga gacggggttt ctccatgttg gtcaggctgg tctctaactc ctgagctcgt 1740gatccgcccg cctcggcctc ccaaagtgct gtgattacag gcgtgagcca ccgtgcccgg 1800cctattttat ttttttattt gaaacagcct tgttctgtca cccaggctgg agtgcaatgg 1860caagatcttg actcattgta gactacgcct cccggcctca gaccatcctt ctgcgtcagc 1920ctttatgcct ggctaatttt tgtatttatt atttattatt attattatta tttttgagac 1980agagtttcgc tcttgttgcc caggctggag tacaacggcg cgatctcatc tcactgcaat 2040tcaggcgatt ctcctgcctc agcctcccga gtagctggga ctacaggcat gcaccaccac 2100ggtcagctaa tttgtatttt ttgtagagag gggtttcgcc atgttggcca ggctggtctc 2160gaactcctga cctcaggtga tccaccgacc ttggcctccc aaagtgctgg gattacagac 2220gtcagccaca gtgccagccg aatatttgta tttgtagaga cgacatctca ctatgttgcc 2280caggctggtc tcgaactcct gggctcaagt gatcactccg tctgggcctc ccagagtgct 2340gggattacag gcgtgcatca ccacacccgg ccttaaaaac aagatttaaa atggtgactg 2400gtatgttgca ccgttattca aatgttagac atgtagtttg atttcagttt ctcttaactg 2460tggaataaac aacttggctg ccgtctctct ctctctcttt tttttggaaa cagtgtctcc 2520gtctgtcgtt cagcctggag tgcagtggca catttacatg tcactgcgtc ctccatttcc 2580caggctcaag cgatgctctt acttggacct cccaaagtgc tgggattaca ggcatgagcc

2640accggtccgg catctcttgg tttatttgta agatggtgcc tagaagtgga gtggcgtttg 2700ccaaaggtct ctggaagggc ttttacactt tcaccaatgg agtggcctaa attcagtaat 2760tatactctca aagtaatgca gttttagtca actcatgttt ttctggcttc aatctgggac 2820tacgtactta atgttaaatt gctttaaagt ggtcatagct gctacaggtt tgtgctcaga 2880aagtctgcac ctgactggtc tgatttaaat tttacgcccc ttaggtatga acagtgtgtt 2940ttaaacaagt acaggatggg gctgcagaag atttaaacgc ttgagaacaa gtgctgtatt 3000ttcccctttt gtgaccccag tattgagttt agtgttgggc agattaaagg tggt 30543483179DNAHomo sapiens 348tgttgggcag attaaaggtg gttcatatcg actataactt gaacagggaa aaattgaaat 60caacttaggg tacttgggat acgaaggatc aatataaaaa ctctggtttg tcatgctagc 120tttttctttt ttttcctctt cagttgaact gaggagatag tttttgtttt taatgattgt 180gctcttttaa ctagacaaaa ggaattagat agtcttgcct attcgaagtt aaatgaactt 240ttgaggttgt taaggacaaa actattaaac tgacatcaat aatacagaat gggctgctta 300gtatcacttt ccttatcagg tactaggatt taatttagtt aggaaactca cttaaaggga 360ggactataac tgcagttgaa agtgtaattt ttccaagata taaaattgtt taaagattga 420atatattcct gttaagcccc aaaggaaaca tccctcattt aagaaaatgg ggtgggagag 480caagagaagg tgaggattca cagatcctag aattggaata gttgattttt ttttgtaaaa 540gaggcggtga cagccgggca tggtggctca cgtctgtaat cccagcactt taggaggccg 600aggtgggtgg gttacctgag gtcaggagtc ctagaccagc ctgaccaaca tggtgaaaac 660ccgtctctac taaaaataga aaaaaaaagc cgggcgcggt ggctgacacc tgtaatccca 720gcactttggt aggccgaggc ggacggatca tgaggtcagg agtttgagat cagcctggcc 780attatgctga aaccccgtct ctactaaaaa tacaaaaatt agccaggtgt ggtggcatac 840ccctgtagtc ccagctactt gggaggctga ggcaggagaa tcgcttgatc ctgggagatg 900gatgttgcag tgagctgcga ttgtaccact gcaatccagc ctgcacgaca gagtgagact 960ctgtctcaag aagaaaaaca aaaaaaggca gtgactaaca gggatgttac ttagcaggac 1020aggactgtgg aaggagctaa gactgggagt ttcacaaaga caaagctaga aatgatactt 1080ggagagctgt gttcttgttt taaaaaaatt gtaacaggag gccaggcaca gtggctcatg 1140cctgtaatcc cagcactttg ggaggctgag gcaggaggat tgcttgaggc caggagttca 1200aaaccagcct gggcaacatg gcgaaacccc gtatctacaa aaagttaaaa attagccagg 1260catggtggtg catggctgta gtcccagcta cttgggaggc tgagacagga ggatcacttg 1320agccctgtag gtccatgctg cagtaaacca agattgtgcc actgcattcc agcctgggcc 1380acagagtgag accctatctt taaaaaaaaa aaaaaaaaaa aaaaaaaaaa acaggaatgc 1440atgcagatta aactatgtgt ctgtatacag tatgcaaact ttagcaagtg ccaggcactt 1500aggcagtagt ctatagctga aaaataaaac attcagaacc actttttaag gttttgtgtc 1560cttgtaactt taggcattat tattacaata taacttagct gggacatgag agttaataga 1620tccacatttt aaagtagatt ttttttttaa ttttctagaa tgtgtctgtg aaaactacaa 1680gctggccgta aactgctttg tgaataataa tcgtcaatgc cagtgtactt cagttggtgc 1740acaaaatact gtcatttgct caaagcgtga gtaaaatatc ctaattacct gtaagcttta 1800ttttgactta atacttcttt aattgatgtg ccttgagttg gaaagagttt tattggctta 1860aatctgaatc atgttacaaa gtaagtgtgg gaacacataa atttcaaata atctttgacc 1920ctggaacttt agagttaatt ttttttttcc cgtaatcatg aaatcagtta tttttcagtt 1980tggcattaag gtttcttttt cagtggctgc caaatgtttg gtgatgaagg cagaaatgaa 2040tggctcaaaa cttgggagaa gagcaaaacc tgaaggggcc ctccagaaca atgatgggct 2100ttatgatcct gactgcgatg agagcgggct ctttaaggcc aagcagtgca acggcacctc 2160catgtgctgg tgtgtgaaca ctgctggggt cagaagaaca gacaaggaca ctgaaataac 2220ctgctctgag cgagtgagaa cctagtgagt ggggctgcct atactacttg ttttcatgct 2280gttcagattc atttaattaa atttattttt gattatgtaa tatgatttca tggtttagaa 2340ttcagaagat atgagtgtcc agtgaaaagc ttccttctca ttccagtccc cctcgctacc 2400cattggacct ccacagaatt gatgttattg attattctat aaccttccag agatagttga 2460tgaatttgtt atatatctgt tttattattt ttacataaat gatagcatac taggtataat 2520ttttctttta tatctttact taacattatt cagtatttca ttgttgcatt agtagtaaat 2580gtatgtaatt taacctatgt atttgcttat tgattgtgtt ttaaaagtga gatatgcttg 2640ttttagggat tgtttaatga aaaggcacag aaacccactc aagctagctt aagcaaaaaa 2700agacttcatt ggaagggact agaaactgga aaggatgtca ggaccaaagt gggcactttg 2760tttttctgtt ctggtcttct ggagcctcgt tgtcagtttt ctctttgtgc cctttctttt 2820gttttttctt ttttcttttc ttttcttttt ttttcgagat ggaatttcca ctcttgttgc 2880ccaggttgga gtgcagtggc acaatctcag ctcactgcaa cctctgcctc ccgggttcaa 2940gcaactctcc tgccttagcc tcctgagtag ctgggactac agctatacca cacctgacta 3000atttttgtat tttagtagag atggggtttc accatgttgg ccaggctggt ctccaactcc 3060tgacctcagg caatccaccc acctccacct cccaaagtgt tggattacag ttgtgagcca 3120ccatgcccgg gcctttcatg ccttttcatc tttttagttg aacagggcat gacactgcc 31793493187DNAHomo sapiens 349tctttgtgcc ctttcttttg ttttttcttt tttcttttct tttctttttt tttcgagatg 60gaatttccac tcttgttgcc caggttggag tgcagtggca caatctcagc tcactgcaac 120ctctgcctcc cgggttcaag caactctcct gccttagcct cctgagtagc tgggactaca 180gctataccac acctgactaa tttttgtatt ttagtagaga tggggtttca ccatgttggc 240caggctggtc tccaactcct gacctcaggc aatccaccca cctccacctc ccaaagtgtt 300ggattacagt tgtgagccac catgcccggg cctttcatgc cttttcatct ttttagttga 360acagggcatg acactgccag ctaaactttg acttaatgtg actttatgta ttgtgtccag 420agaacagagg gtcaatatta gaaaaggtgt tccctcctgg gtgtgtcctt tatgaaggat 480gtgtaaggga agaaattata ggaatagcta ctgcataaat tttttttctc ttagtcctta 540taattcgaga attttaggat tagcttatta ggaaaatagt atggaagact gagttatagt 600caactgacat tgtcttttta ctttatagct ggatcatcat tgaactaaaa cacaaagcaa 660gagaaaaacc ttatgatagt aaaagtttgc ggacgtaagt gcaattaaat gcatcatatt 720cttgcacagt tggtggctca aatcttccat cctacaccat tagaaaaagc aagtctaaat 780gcttttttat atttctgaaa aataaagtta cttgaaatag agttgcaaga atagcacaga 840gattctggga atacacttca ctcagattca ccaattaaca ttttggcaca tttgcttttt 900atatgtgtat gtgtggatga atatgtgtgt gtgctttaca tcagtgtatc tatgcatgta 960taaatatttt tcccagaagc acatgagagc aagttgtaga catcaggccc ctttacccct 1020aagtacttca gtgtatgttt tcctaagaac aaaaggcatt cttttatata aaccactata 1080caacgatcaa atttaggaaa aatttttttt ttttttttta gacggagtct cgctctgtca 1140cccaggctgg cgtgcagtgg cgtgatctca gctcactgca acctgcgcct gccggtttca 1200agcgattctc ctgcctcagc cttccaagta gctgggacta caggtgcctg ccactacgcc 1260ctgctaattt ttgtagtttt agtagaaaca gggtttcacc atattggcca ggctggtctc 1320gaactcctga acttgtgatc ctcccgcctc tgcctcccaa agtgctgcaa ttacaggtgt 1380gagcttccgc gcccggccag gaaatttaac gttatatcac gttgtgccca ttttcccaat 1440attgtccttt gtagtaattt ttcccctctg attcaggacc cagtccaaga tccatgtatc 1500acatttagtt gtcatgactc tttagtctct taatatcgaa cagtttcttg gcctttcttt 1560gtcttccatg aacttgctat ttttaaagag catgggcaag tcattatata taatgtccct 1620caaattttga tttgtctgat atttcctcct tttttttttt tttttttgag ttggagtttt 1680cccttttgtt gcccaggctg gagtgcaatg gtgcaatcac ggctcaccgc aacctctgct 1740tcccggattc aagcgattct cctgcctcag cctcctgagt agctgggatt acaggcgtgc 1800gccaccatgc ctggctaatt ttttttgtat ttttagtaga cacggggttt ctccacgttg 1860gtcaggctgg tctcgaactc ccaacctcag gtgatctgcc cacctcagca tcccaaagtg 1920ctgggattac aggcatgagc cacctcaccc gagccttgat gttccctctt aactaaaagc 1980aggttatgca tttttgacag gaaaactact taagcgatct tgtgtccttt ataatacttc 2040acattaggag ttgcatgatg tcagcttgtc cctttactag taaagtaaac tttggttaaa 2100gtggtatcca ccaggttttt ccactgtgaa gttaccattc tccctttgta atccataaat 2160aatctatggg cagatacttg gatactaagt aaatgttctt tttctaatta aactggtacc 2220cagcagtttg aatatcaatg gatgattcca gcctgaatca attattatta tgatagttgc 2280aaaatggcag aaaaatttta actttaatga cagttttaga ccctgagctg tctgcttaaa 2340gagtagtgct tcttactgtt gtgtggtaca aacatttttt tttaatacag attttaaatt 2400ctttacagtg cacttcagaa ggagatcaca acgcgttatc aactggatcc aaaatttatc 2460acgagtattt tggtatgatt ttttaataag tgagctttag cagacagttg gtgagacagt 2520atgttttgag tataaggaca gccagtgatt taagtggtgg ttaaatgcac ttactggagc 2580aacagtttcg gatctgggta cttaatgtga atttcctgtt actgtttttt tttgtttgtt 2640tgtttcttta agacagacta ttgctctctt ccccaggctg gagtgtcatg gcaagatctc 2700ggctcaatgt aacctctgct tccaaggttc aagcaattct catgcctcag cctcccgagg 2760agctgggact acaggcacat gtcaccatgc ccagctaatt tttgtatttt tagtgtcggc 2820ggggttttgc tatgttggcc aggctggtct cgaactcctg gcctcaagtg atctgtctgc 2880ctcagcctct caaagtgttg ggattacagg tgtgagccac cacgcccggc ccattgtttt 2940tggttatcgt tgttttcctt ccatagcctt tgaaaagcct agttttactc ctaaagaaaa 3000cgtagtatct cttagtatcc ctaaaacatt tgagttttct tatcctggag aacctgtccc 3060tgtggatgag ctccagtaac atcttaaagt aaatatgcac caaaattact tttggtaaat 3120acagttttgg tgcatattta ctttaggatg ttactggagc tcccatcttc tctgctttaa 3180ggaacta 31873503128DNAHomo sapiens 350acctgtccct gtggatgagc tccagtaaca tcttaaagta aatatgcacc aaaattactt 60ttggtaaata cagttttggt gcatatttac tttaggatgt tactggagct cccatcttct 120ctgctttaag gaactagtcc ttaactagtt agcccttact taactcttta aactctggtt 180taaaaaataa aaagaagctt gaatagtgtg acggaactct ttaaaggtag tatgaattta 240ttcaagagtc tttagaaaga atgtactttt tttactcttt aaaaacaaaa tgatggccgg 300gcacggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg tggatcacaa 360gatcaggaga tcgagaccat cgtggctaac acagtgaaac cctgtctcta ctaaaaacat 420acaaaatagc cgaatgtggt ggtgggcacc tgtagtccca gctactcggg aggcttgagg 480caggagtatg gcgtgaacct gagaggcgga gcttgcagtc agctgagatt gtgccactgc 540actccagcct gggcgacaca gcaagactcc gtctcaaaaa caaaacaaaa aaacaacatg 600gaaaatgcat gctgcgtttt accttgcatt tctttttctt ttcttttttt tttttttttt 660ttgagacgga gtttcgctct tgttgcccag gctggagtgc aatggcgcca tctcggctca 720ccacgacttt tgcctcccag gttcaagcga ttctcctgcc tcagcttccc tggtagctgg 780gattacaggc aatgtgtcac cacgcctggc taattttgta tttttagtag agatggggtt 840tctccatgtt ggtcaggctg gtcttgaact ccggacctca ggtgatccgc ccacctcagc 900ctcccaaagt gctgggatta caggcatgag ccactgcacc cggccttacc tttcatttct 960ttagtaattt agttttaaag tagttctaat ccaaataaaa tactttcata tcttatttaa 1020aaatcttttc aatataagaa aatcctctta ggaaaaattg tacattgtaa ttatgtttgg 1080ttgcatggct gtcttatttc cctttgatag atttagagac ctcccaaaga tttcttgatt 1140agtgataaac ttagttatcc actaatggaa aggaacagtg atgcatgtag attatagaaa 1200atcaaacact gaatattctg attctcaatt aatgttattt tcaaatgatt ttgattatat 1260tagtattaat ttgtattatt caattttttt ccccagtatg agaataatgt tatcactatt 1320gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct 1380tattattttg aaaaagatgt gagtatcatc ttctttattc ctgtgttcag gaatgtagtc 1440tatcatgcct caatgaatta aatatatttc atcacctttt tatccactta cagatcaacc 1500aaatggttcg ctgctgccgt taattttgtc ctccctgtca ctcacatgca tcttgcttgt 1560ttgtatattt atgcctctta tcaaattgtt ctgcctaaaa tatctcccct ctttcttata 1620attcttattt attatctact tggtggttac ttagtttgtg catatatgct cccctatgat 1680atttataatt tacacaaata aaagtctgtt aaaaaagact gtaactgata tgattaaaat 1740attttgttga aactttaata tattatagtg aggtattttc tgctgaaata tgaggtttgc 1800ttcaaaataa tctgggcggg ggtgaaagga tgaaaggaag aaaagatgaa gtaagagagg 1860ctatgtgttg ttggccttgc atctgggtga taggtacatg ggcatcattg cactactctt 1920tctactttcg tgtatgttga aaggttcctg taataaacag ttttttaaag ttccaataaa 1980ttagattgtt atcactaaaa ccataaagat tcttggcagc ggttcttttg gcatacaatt 2040tgtatgtaat tatatgtggc catggttggt ttccttaaat atttttaatt ccttttctcc 2100ttttcaatac aggttaaagg tgaatccttg tttcattcta agaaaatgga cctgacagta 2160aatggggaac aactggatct ggatcctggt caaactttaa tttattatgt tgatgaaaaa 2220gcacctgaat tctcaatgca gggtctaaaa gctggtgtta ttgctgttat tgtggttgtg 2280gtgatagcag ttgttgctgg aattgttgtg ctggtgagta cagaacaagt aaaatttcat 2340ttaagggtat attttttcaa gaaaaagtaa tagtggctgg gcgcggtggc tcaccacacc 2400tgttatccct acactttggg aggctgagac aggtggatca cttgagccca ggagtttgag 2460accacactgg gcaacatggt gaaaccttat ctgtagtaaa aatacaaaaa ttagtcagat 2520gtgatggctt gcacctgtgg tcccatctac ttaggaggct gatgtgggag tggtcagttg 2580agtccaggag gtcaaagctg cagtgagcca tgatcacacc actgcactcc agcctgggca 2640acacagcagg accctgtctc aaaaagaaga aaaaaggaaa tatgaaaaag taacatccat 2700attccaaaac attcagggaa aaaaatcttc atttttaaat aattttttta tggtgaatga 2760atctattgta tctctggtct ctttttacaa aagtcatttt atgaagcaag aaaggatgct 2820aatattaaaa agcttgtggc tgtgcacctc acaggccagt taaattgcca tctagcagca 2880agcgtctttc agttgtcact gcaaacaatt caacacctag tgcaaaatac ctgaaccccc 2940aaaccactca ataagatgga acaacagaac acaaagttaa cgttagccat acaaaagagt 3000taaaagtgat atgtgaatca atacttccaa gtaaagatga gcaaattgaa tttaacagtg 3060cttcagcaaa agaatgtatt gcttgaagaa gtgaaaggtt tattttagga atgtaaggat 3120gcttcggt 31283512155DNAHomo sapiens 351atacctgaac ccccaaacca ctcaataaga tggaacaaca gaacacaaag ttaacgttag 60ccatacaaaa gagttaaaag tgatatgtga atcaatactt ccaagtaaag atgagcaaat 120tgaatttaac agtgcttcag caaaagaatg tattgcttga agaagtgaaa ggtttatttt 180aggaatgtaa ggatgcttcg gtatcaagaa atcttactaa cactggccag gtgtgatggc 240tcaggcctgt aatcgcagca ctttggaagg ctgaggcggg tagatcactt gagatcagaa 300gttcgagacc agcctggcca acatggtgaa accctgtctc tactgaacat acaaaaaaat 360tagctgggcg tggtggcaca tgcctgtaat ctattcggga ggctgaggca ggagaatagc 420ttgaacctgg gaagcagagg ttgtagtgcg ccaagatcat gccactgcac tctaatctgg 480gtgacagagc aagactctgt ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaggccaggc 540acagtggctc atgcctgtaa tcccagcact ttgggaggct gaggcgggtg gatcacccga 600ggtcgggagt tcgagaccag cctgaccaac gtggagaaac cccatctcta ctaaaaatac 660aaaattatcc gggcatggtg tctcatgcct gtaatcccag ctactcagga ggctgaggca 720ggagaatcac ttgaacccag gaggtggagg ttgcagctga gatcatgcca ttgcactcca 780gcctgagcaa caagagtgaa actccgtctc aaaaaaaaaa aaaaaaaaaa gaaatcttac 840taacacaaca gaattcagaa agaggtttga gggtatttag gaacttagat ttccagttca 900atcaaccatg tttggctatc catctggaac aaaatgaaag ttgaattcct atttcactcc 960accaggctgg ccatattgcc cagctgtgtg agggtggcat gtccagagca cagtagtagg 1020aaaggcgttg ggcagtgtat ccattttcaa agacatttac atatttaaaa atacaaaaaa 1080gtaaactccc aagaaaatta attgagggaa tgtttgtaca accttgtggt aggggaaatt 1140atgtaaggca agaaatctgg aatccatgaa agaaaagata catatatgtg tatgtatatt 1200ttgagagagg gtcttgctgt gtcacccagg ctggagtgca gtagcatgat cataactcac 1260tgcaacctcc aattcctgga cttaagtaat cctcctgacc tatcctccca agtagcaagg 1320actacaggta tgtgccacta tacctggcta attttttaat ttttagtaga gacgaattct 1380tgctatggct gccgaggctg ggcttgaact cctaggctca agcagttctt ttggcttagc 1440ctcccaaact gctgggatta caggcatgag ccattgcacc tagtcctata tatatatatt 1500ttggcttcat taaaattaag cattttatat ggcaaagaaa ctgtaaagta aaaaataacg 1560atgggcatga aaaaaatatg gcgcataaag caaaaatgga tattatacat aatatacaaa 1620gagttcttac aaattgatga ggaaacctaa agaaagaatg acaacaggta gggatagaca 1680gttaatagaa atttcagatg gcaaatgaac acaagtggtt aatgctggaa gtctaattgt 1740tctgtagaaa taaatgaaaa cacaagtgca ataaggaagc acattgttat tgtatcatag 1800cattgcttgt aaaggtgaat ctggccaggc gtggtggctt acgcctataa tcccagcact 1860ttgggaggct gaggtgggca gatcacctga ggctgggagt ccgagaccag cctgaccaac 1920acggagaaac cccgtctcta ctaaaaatac aaaatgagcc aggcatggtg gtgcatgcct 1980gtcattctgg ctactcagga ggctgaggca ggagagtcac ttgaacccag gaggcagagg 2040ttgtagtgag ccgagatcat gtcattgcac tccagcctgg gcaacgagag caaaactctg 2100tctcaaaaaa tgaataaaaa caacaacaaa agtgaatctg gaaaatagcc tgagt 21553523118DNAHomo sapiens 352catgcctgtc attctggcta ctcaggaggc tgaggcagga gagtcacttg aacccaggag 60gcagaggttg tagtgagccg agatcatgtc attgcactcc agcctgggca acgagagcaa 120aactctgtct caaaaaatga ataaaaacaa caacaaaagt gaatctggaa aatagcctga 180gtgtgtatca gtaagagagt aaattatgtt tattgtatct acgataggga ataatgtgaa 240tggtgaatga gttcgatctt tatctttgga tctggaatgg ttgctatgat gttgatacaa 300gctgtgcaca ggtggtgatg atactgcatg gtcccatttt tagaccccaa aacttagatg 360catgtgttta tatatgatat ttgtattagt gtggaaaagg aggatgtgga agaatgcaca 420ccaaactgtt aaatttcttt cttttttttt tttggaatgg agtctcgctg gccggacgtg 480gtggctcact gctgtaatcc cagcactttg ggaggccaag gcagctgggt cacgaggtca 540ggagatcgag gccatcctgg ctaacacggt gaaaccctgt ctctactaaa aatacaaaaa 600attagccagg tgtggtggcg ggcacctgta atcccagcta cttgggaggc tgaggcagga 660gaatggcgtg aacctgggag gcggagcttg cagtgagccg agattgcacc actgcactcc 720agcctgggtg acagagcgag actccatctg aaaaaaaaaa aaaaaagaaa aggagtctct 780ctgtgttgcc ctggctggag tgcagtgtca tgatctcggc tcactgcagc ctccacctgc 840cgggttcaat tgattctcct gcctcaccct cccgagtagc cgggactaca tgcagaagcc 900accatgtcca gctaattttt gtattttttg gtagagacag ggtttcacca tattggccag 960gctggtctcg aactcatcac ctcgtgatcc gcctgcctcg gcctctcaaa gtgctaggat 1020tacaggcatg agccactgtg cccggcttct tctttttttt tttttttttt tttttttttt 1080tttttttttt tttttttgag atggagtctt gctctgttgc ccaggctgga gtgcagtggc 1140acgatctcgg ctcactgcaa cctccatctc ccaggttcaa gccattttct tgcctcagct 1200tcccaagtag ctgggactac aggcgtgcac caccatacct ggctaatttt tttgtatttc 1260tagtagagat agggtttcac catgttggcc aggctgatct cgaaatcctg atgtcaggtg 1320atctgctcac ttcggcctcc caaagtgctg tgattatagg cgtgaaccac catgcctggc 1380ctaaactgtt aaatttcttt aaagattatt cattgtttcc tttttttctt tctctttctt 1440ttctgttgtc ccattggatc cagcattgtt tttgattttg atttttgttt gtttgtttca 1500cttgtcgtgg tagacttttt tttgtttagt agtgaaagtt tttattttat tttatttatt 1560tatggagaca gagtctcctt ctgttgccca ggctggagtg caatggtgca tgatcttggc 1620tcactgcaac ctctgccccc caggttcaag ctattctcct gcctcagcct cccgagtaga 1680tgggattaca ggcgcctgcc accacgcctg gctaattttt gtatctttag tagagatgag 1740gtttcacaat attggccagg ctggtcttga actcctgacc taaagtgatc cacccacctc 1800agcctctgaa agtggtaaga ttacaggcat gagccatcat gcctgaccta ttttatttta 1860ttttaatttt tttttagaga tggagtccca ctctgtcgcc caggctggag tgcaatggcg 1920ccatctcggc tcactgcaac ctctgcctct cgggttcaag tgattttcct gcttcagtct 1980cccaagtagc tgggattaca ggcgaccacc accgcgcctg gctaattttt ttgttttttt 2040agtagagtcg gggggtttca tcatgttggc caggctggtc ttgacctcct gacctcaagt 2100gatccgccca cctcggcccc acaaagtacc ggtgagccac cacgcccagc ccaccttatt 2160tatttttaag agacagggtc ttactctgta gcccaggctg gagagcagtg atgccatctc 2220cactcactgc aacctctgcc tcctgggttc aagcaattct ggtgccttag cctcctgagt 2280agctgggact acaggtgcgt gccatgacac ctggctaatt tttgtatttg tagtagagat 2340ggggtttcac cgtgttggct gggctggtct gaaactcctg acctcagatg atcttcccgc 2400ttcggcctcc caaagtgccg ggattacagg catgagccac tccactggtg tgaaattttt 2460aatttaagaa gcaataaatg tttatggata gatgttaaaa ttagtttttt ttcagatcaa 2520aattatgtcc attaaaagca tatatgtctg tttagataat ctttttttga atagcagtcc 2580taaaacaata gttgtctttc ttccactcag gttatttcca gaaagaagag aatggcaaag 2640tatgagaagg ctgaggtaaa tggattactt acctaaatag aaaggccctg ttgaatctct 2700tactcctaat cactctacct tcctacacac tgatgcattt cagttatact ggagtccctt

2760tatactgttg tctttagggt cttagggaca gtcttagaat gtactcttac ctaaatattc 2820ttgcgtgagt tccatggcag atcaccatct gttttctgcc tcatagaaga gtggaatggg 2880aagcctatgg tttttattct acaaagagtc aacatctaac agaatcttct gaaggcatac 2940tccagtggat tcaccttgga gaaactcatt gtgactgatg atctgattta ttatctctat 3000gccagtgaaa taatcattta atatgaactt aatttgtcat aatctattgt gtactaacta 3060gtctatacta gtgtgacatc aaagtgtcag attgttagtg tgtttcagtc ccttggaa 31183532346DNAHomo sapiens 353tagtgtgttt cagtcccttg gaattgaata tgaacactta tccttgaacc ctatcaataa 60catttttcac atatctcaat ttttgtgtgt ctttgtagtt gtatgtgggc cacttactaa 120tattttagca agtaataaaa atagaaacgt aaaggaatat tggaaaaagt ctaatggaac 180cagaaagttc tagcattttt ttcccattct gtagtaggtc atctggttta tttggtttgg 240tgaccgcaag tctagaagac taaccctgaa ttgaatggta acagacaggc agaatgacaa 300tgtagtgttg cagtgcagag cagtacagac ctgggtttgg ctgggcaaaa ttatataact 360tctttaagcc tccatgtttc ctcatctgta aaatgaggat aatagatagt atggacctgt 420tgcaaggatt aaacataatc agtgtaaagt gttggtccca tgcttgccac ataagaaaat 480atttgtcaac agagtggtag ttgtcattat cattgtctca gtttgcctgt aactagttgt 540gtgatctgag acaaacacta attttgaact tgagtttccc cacatgtaaa atgaaagatt 600gataatagaa agtaaatcaa ttttttctag cattaaaaat agtatgcatt taataaaaat 660cttattctta atgatctagc ttacctccaa cttgccctag tcactttggc gatcttgtct 720ctaaatagaa ccttgaaaac acttaaatgt gtgtttcctt gcaatataac tttttctttt 780tttatttaaa taagtcttat aaatgtggga aaaaattatc ttgtgttcct ttaatttcat 840ttttatttaa tactattttc agaatgaaca aaagattgaa aaattattta gaattttttt 900ctgtgctttt tcctgtttca gataaaggag atgggtgaga tgcataggga actcaatgca 960taactatata atttgaagat tatagaagaa gggaaatagc aaatggacac aaattacaaa 1020tgtgtgtgcg tgggacgaag acatctttga aggtcatgag tttgttagtt taacatcata 1080tatttgtaat agtgaaacct gtactcaaaa tataagcagc ttgaaactgg ctttaccaat 1140cttgaaattt gaccacaagt gtcttatata tgcagatcta atgtaaaatc cagaacttgg 1200actccatcgt taaaattatt tatgtgtaac attcaaatgt gtgcattaaa tatgcttcca 1260cagtaaaatc tgaaaaactg atttgtgatt gaaagctgcc tttctattta cttgagtctt 1320gtacatacat acttttttat gagctatgaa ataaaacatt ttaaactgaa tttcttaact 1380ttgacatttc aaatttcttc ttctttttct tttctttttt tttttttttt gagatggagt 1440cccactctgt tgccaggctg gagtgcagtg gcacaatctc ggctcactgc aacttctgcc 1500tcctaggttc aagcgattct tctgcctcag cctcccgagt agctgactac aggcgcccac 1560caccattcct ggctaatttt tgtattttta gtagagacaa agtttcacca tattggccac 1620gctagtctcg aactcctgac ctcacgatcc acccacctct acctcccata gtgctggggt 1680tacaggcgtg agccaccgcg cccggcctct ttttttcttt ttgttttgtt ttttcttttt 1740ttttttgaga caggatcttg ctctgtggcc taggctggag tgtagtggtg cgatctcagc 1800tcactgcagg attcaagcga ttctcctgcc tcagcctacc aagtagctag gattacaggc 1860tcccactacc atgcccggct aatttttgta tttttagtag agaaaaggtt tctttttctt 1920ttttcttttc tttttttctt tttttttttt tgggggggtg agacagagcc taactctgtt 1980gcccaggctg gagtgcagtg gcacaatctc agctcactgc aacttctgcc tcctgggttc 2040aagcaattct cctgcctcag cttcccaagt agctgggact acagatgtgc accaccatgc 2100ccggattatt tttgtatttg tagtagagac agagtttcgc catgttggcc aggctgatct 2160cgaactcctg acctcaagtg atccacccac cttggtctcc caaagtgctg ggattacatg 2220tgtgagccac catgcctggt cctatttact ctttgttaag tggaagtgga tcatcataaa 2280ggtcttgatc ctcatagttt tcactttgag taggctgagg aagaggaagg gttggtcttg 2340ctgtct 23463543104DNAHomo sapiens 354cacattacga gctcagtgcc tgccggaaat ctcccacctg gtggcaacct acccttgcat 60acaccccacc caggggcttc aagccttgca gctgagtaaa cacagaaagg agctctacta 120aggatgcgcg tctgcgggtt tccgcgcgac ctaggcgcag gcatgcgcag tagctaaagt 180caccagcgtg cgcgggaagc tgggccgcgt ctgcttatga ttggttgccg cggcagactc 240ccacccaccg aaacgcagcc ctggaagctg attgggtgtg gtcgccgtgg ccggacgccg 300ctcgggggac gtgggagggg aggcgggaaa cagcttagtg ggtgtggggt cgcgcatttt 360cttcaaccag gaggtgagga ggtttcgaca tggcggtgca gccgaaggag acgctgcagt 420tggagagcgc ggccgaggtc ggcttcgtgc gcttctttca gggcatgccg gagaagccga 480ccaccacagt gcgccttttc gaccggggcg acttctatac ggcgcacggc gaggacgcgc 540tgctggccgc ccgggaggtg ttcaagaccc agggggtgat caagtacatg gggccggcag 600gtgagggccg ggacggcgcg tgctggggag ggacccgggg ccttgtggcg cggctccttt 660cccgcctcag agagtgggcg gtgagcagcc tctccagtgc ggaggcacgg gggcggaacg 720ttggtgcttg tgcggattcc gccgtcccca ggttctgctt ggctccggag ggacgccccc 780ctcagccctg aaacccgtgc ctctccagcc gccccggatc tgaacttgtg atcacggagt 840gtttacgtcg tgccaggcat tttaatgcat tgttctagtt cattttccag cagtcgcatt 900cctcgccttg gccctacatg tagcgctcat tacaaacacg gccagaatct cttattaaca 960aacagcagcc aggagtgaga tttaaaatag actgggggtt taggagaccc ttttatgaca 1020cgtaattctg ctcccacgac gctcccattt ataccgccgg tccagctaag ggtctggtaa 1080tggagcgccg ttgaagagca gtatgatgaa gtggtcagga ccaacggact ctggagctgg 1140gctgcttggg atcaagtcgc tgcccctctg cttattaacg tgtgaccttg ggccagtcat 1200ggacgctatc tgcttcagct cagcattcag tgctctccgt cacccgaccc catctatcca 1260ggattatctc tccctggaaa gctacaaacg tctcacccta tgtgggccaa atgttctgga 1320taggcctagt taacctcttc tctccctgtt ttctttgcgc tttcttgcag ctatgtagtt 1380atgctaatga aaagagcatc ctagggggag cagagttgtg gattctagtc ctgactagag 1440gactagtgca aatgcgatac tcctgatgaa aaatgtttca ttcgttagat ataaatgtgt 1500taggcagggt tatggacact agatgaaaaa agaaatacct ctactttcat agagatcact 1560attggacagc aaggcagaaa taattacaat tcaagttgga ggcttatgga ggtgagcttg 1620taagaggtta caagaggcgc caaggcagga tcgccaaaga cggaagactt tggaagagtc 1680tcatacaacg gaagaggcgt tatatgagac accaaagtcc acgttgagtc ttggtggact 1740agaagtttgc tagggagagg gcttgaaacg aggtagattg gcgttgctgg tgtagaaaag 1800gaaggagact ggcccaggtg ggtggggtta gatgaccaaa ggcttttagt gtggtgttga 1860gctgttgaaa ttttatgctg tagccaatga aaagtctgaa atgttttttt tttttttttt 1920tctgagacgg agtcttactg tgtcgcccag gttggagttc agtggtgtaa tcctggctca 1980ctgcaacctc cacctcctgg gttcaagcga ttctcctgcc tcagccaccg gagtagctgg 2040gattacaggc acgtgccacc acgcctagct aatttttgta tttttagtag agatggggtt 2100tcaccatgtt ggccaggctg gtctcaaact cctgacctca agtgatccac ccaccttggc 2160ctcccgatgt gctgggatta caggtattag ccactgcacc tgacctacat agattttaca 2220taagacttta aaacagggcg ggcgcagtga ctcacgcttg taatcccagc actttgggag 2280gctgaggtgg gcggatcaca aggtcaggag atcaagacca tcctggctaa catggtgaaa 2340ccctgtctac actaaaaata caaaaatccc agcactttgg gaggctgagg tgggcggatc 2400acgaggtcag gagatccaga ccatcctggt taacactgtg aaaccctgtc tctactaaaa 2460atacaaaaaa ttagctgggt gcggtggcag gtgtctgtag tcccagctac ttgggaggct 2520gaggcaggag aatggtgtga acccggaagg cagagcttgc agtgagccga gattgtgcca 2580ctgcactcca gcctgggcaa cagagcgaga ctccatctca aaaaaaaaaa aaaaaaaaaa 2640aaagacttta aaaaaaatta taagaaagga cagaccaagt gcagtggttc gttccagcac 2700ttagggatgc caaggtggga ggattgcttg atgctaggag ttgaagacta gcctgtgtaa 2760catagcgaga cccatctcta caaaaaaatt aaaaagttac ctttagaact tacgattttt 2820atgtgtagac tccatataag cagagggtct atgcttattc actatttatt accttccata 2880gtccctgcac atataatagg tgcttcataa acaatttaat gaatgaataa attactgaga 2940aaacactgga agtttttggg ttagcattgt gttaggtgct tgatatggtc tggctgtgtt 3000cccaccctta tctcatcttg aattcccatg ttttgtggga ggtacctggt gggacataat 3060tgaatcatgt gggcaggttt ttcctgtgct gttctcctgg tagt 31043553131DNAHomo sapiens 355gggttagcat tgtgttaggt gcttgatatg gtctggctgt gttcccaccc ttatctcatc 60ttgaattccc atgttttgtg ggaggtacct ggtgggacat aattgaatca tgtgggcagg 120tttttcctgt gctgttctcc tggtagtgaa taagcctcac aagatctgat ggttttaaaa 180atgggagttt ccctgcacag gctctctctc tttgcctgcc gccatccatg taagatgtaa 240cttgctcctc cttgccttcc tcaatgattg tgaggcctcc tcagccatgt ggaactggct 300gcagagtcat taaacttcgt tcttttgtaa attgcccagt ctcaggtatg tcttttttta 360tttttttttg agacagagtc tggctctgtg gctaggctgg agtgcagtgg tgcgatctcg 420actcactgca gcctccgcct cccgggttca agcgattctc ctgcctcagc ctcccaagta 480gctgggacta taggtgcacg ccaccatgcc cagctaattt ttgtattttt aatagagacg 540gagtttcact gtgttggcca ggatggtctt gatctcttga cctcgtgatc ttcccgcctc 600cgccttccaa agagctggga ttacctaccc agctgggtat gtctttatta gcagcgtgaa 660aacagactaa aacagtaaac tgataccaat agagtgggat gcagctgaaa agatacccga 720aaatatggaa gcaactttgg agctgggtaa caggcagagg tcagagcagt ttagagggct 780cagaagaaga ccagaaaatg tgggaaagtt tggaacttcc tagagacttg ttcaatggct 840ttgaccaaaa tcctgataat gatatggaca atgaaatcca ggctcatgtg gtctcagatg 900gagatgagga acttgttggg aactggagca aaggtgacac ttgttatgtt ttagtaaaga 960gactggtggc attttgccct gccctagaga tttgtggagc tttgaacttg agagaaatga 1020ttttgggtat ctggtgggag aaatttctaa gcagcaaagc attcaagagg tgacttgggt 1080gctgttaaag gcattcagtt ttaaaaggga aacagcatga aagtttggaa aatttgcagc 1140ctgacaatgt gatagaaaag aaaatcccgt tttctgagga gaaattcaag ctagctacag 1200aaatttgcat aagtaatgag gatcccaatg ttaatcccca agacaatggg aaaaatgttt 1260ccagggcatg tcagaggcct tcatggcagc ccctctcatc acaagcctag aggcctagga 1320gaaaaaagtg atttcatggg ccagcccggg gtccccatgc tgtgtgcagc ctagtgactt 1380ggtgccctgc atcccagctg ccccagctgt ggctgaaagg ggccaaccta gagctcaggc 1440catggcttca gagggtgcaa gcctgaaacc ttgacagctt ccaggtggtg ttgagcctgc 1500aggtgcacag aaatcaataa ttgaggtttg agaatctctg cctaggtttc aaagatgtat 1560ggaaacgcct gcatgtccag gcagaagttt gctgcagggg tggggtgctc attgagttcc 1620tctgctaggg caatgtagaa gggaaatgta gggtcagagc ccccccacag agtccctact 1680ggggcaccac ctagtggagc tgtgaaaaga gggctaccat tctccagacc tcagaatggt 1740agatccacag acagcttgca ccatgtgcct ggaaaagctg tagacactta acgccatctc 1800atgaaagcaa ccaggcagtg tgctgtaccc tgcaaagcca caggggcaga gctgtccaag 1860gctgtggttg cccagctctt gcatccgcat gacctggaca tgagacatag agtcaaagga 1920gatcattttg gagctttaag atttgactgc catgctggat tttggacttg catggggcct 1980gtagcccctt tgttttggcc aatttctccc atttggaatg gctgtattta cccaattcct 2040ataccccatt gtatctggga agtaactaac ttgcttttga tttgacaggc tcatatgcgg 2100aaaggactta ccttgtcttg aatgagactt tggactggaa ttttgaatta atgctgaaat 2160gagttaaggc tttgggggac tgttgggaat gcatgattgg ttttgaaatg tgaggacatg 2220agatttggga ggggtcatgg cagaatgata tggtttggct atgtccccac ctaaatccca 2280tcttgaattc ccatgtattg tgggagggac ctggtgggag atagttgaat catggggatg 2340gatctttccc atgctgttgt gatagtgaat aagcctcatg agatctgatg gttttaaaaa 2400cggaagtcta cctgcacaag ctctttcttt gcctgctgcc atccatgtaa gacatgactt 2460gttcctcctt gccttctgcc atgattgtga gacctcccca gccatgtgga actataagtc 2520cagtaagcct ctttttcttc ccagtctcgg gtatgtcttt atcagcagca tgaagtccag 2580ctaatacagt gcttgaacat gtaatatctc aaatctgtaa tgtacttttt ttttttttaa 2640ggagcaaaga atctgcagag tgttgtgctt agtaaaatga attttgaatc ttttgtaaaa 2700gatcttcttc tggttcgtca gtatagagtt gaagtttata agaatagagc tggaaataag 2760gcatccaagg agaatgattg gtatttggca tataaggtaa ttatcttcct ttttaattta 2820cttatttttt taagagtaga aaaataaaaa tgtgaagaat ttaattgtgt tttagtattt 2880taagtagatt gtgatagtag aatggtttga gacactttaa tagcaattag catgtggttt 2940ttaaaaagtt gcagtttggc tggtcgcagt ggctcatgct tgtaatccca gtattttggg 3000aggctgaggc aggtaggttg cctgagccca ggagttcaag accagcctgc ccaacgtggt 3060aaagccccat ctctactgaa gataaaaaaa tttaaaaaaa ttagctgggg ctattggcac 3120acacctgtgg t 31313563085DNAHomo sapiens 356agttgcagtt tggctggtcg cagtggctca tgcttgtaat cccagtattt tgggaggctg 60aggcaggtag gttgcctgag cccaggagtt caagaccagc ctgcccaacg tggtaaagcc 120ccatctctac tgaagataaa aaaatttaaa aaaattagct ggggctattg gcacacacct 180gtggtcccag ctaatcaaga ggatgaggtt agaggatcac ttgagcccag gaggttgagg 240ttacagttta actttcagag gccaaggcag gaggattgct tgagtccagg agtttgagac 300caccctgggg aatgtaggga gatcccatct ctatagaggg atagattaga tagataattt 360ctgaggggag gggaggggga gggccaggga aggggaggga aaggggaggg gagggcaggg 420ccagcagtaa ggtcataata gagacatgta tctgtaagat ccttataata ggtgaggatg 480gccacaaatt agcgccacag atttgtattt ttagtagaga caaggtttta ccatgttggc 540caggctggtc ttgaactcct gacctcaagt gatccgcctg ccttggcctc ccaaagtgct 600gagattacag atgtgagcca ccatgcccaa ccacaagcat ttatttattt atttatttat 660ttatttattt atttatttag agacagtctt gctctgtcgc caggctggag tgcagtggcg 720ccatctgggc tcactgcaaa ctctgactcc ctggttcaag cttttctccc gcctcagcct 780cccgagtagc tgggattaca ggtgcatgct gcaacacccg gctaattttt gtatttttag 840tagagatggg gtttcaccat gttggccagg acggtctcga tctcctgacc tcgtgatccg 900cctgccttgg cctcccaaag tgttgggatt acaggcgtga gccacagcac tcagccagtt 960atttttttat aagaaaacat tttactggcc aggcctggtg gctcacacct gtaatcccag 1020cactttggga ggccgaggca ggcggatcac gaggtcagga gttcgagacc agcctggcca 1080acatggtgaa accccatctc tactaaaaat acaaaaatta gccaggcgtg gtggtgtgcg 1140cctgtattcc cagctactgg ggaggctgaa gcaggagaat cgattgaacc cttgaggcag 1200aggttgcagt gagttgagat cgcaccattg cactctagcc tgggtgacag agcaagactt 1260catctcaaaa aaaagagaaa acattttatt aataaggttc atagagtttg gatttttcct 1320ttttgcttat aaaattttaa agtatgttca agagtttgtt aaatttttaa aattttattt 1380ttacttaggc ttctcctggc aatctctctc agtttgaaga cattctcttt ggtaacaatg 1440atatgtcagc ttccattggt gttgtgggtg ttaaaatgtc cgcagttgat ggccagagac 1500aggttggagt tgggtatgtg gattccatac agaggaaact aggactgtgt gaattccctg 1560ataatgatca gttctccaat cttgaggctc tcctcatcca gattggacca aaggaatgtg 1620ttttacccgg aggagagact gctggagaca tggggaaact gagacaggta agcaaattga 1680gtctagtgat agaggagatt ccaggcctag gaaaggctct ttaattgaca tgatactgtt 1740tcatttaagg aaaaataata aaaaaactct tttttttgta tctaattaaa ataatgttct 1800gatgtttaca gaaactttgt atatttaatt ggacattaga acaagctgtt tgttgtgtaa 1860gatttatttt acctcagatc ttttctcccc cctttccttt ctgtcttgtg ttccaaaaga 1920gtaattatta cggtaaatat tactgtaatt atggatttat caaataagat gcagttcttt 1980agcatttttt gataaatcga gtggaacttt agcctgttat tttactattt gttttatttt 2040aactaaattc tgattgtgtc attttttttt tttttttttg ggaccgagtc tcgctctgtc 2100gcccaggctg gagtgcagtg gtgcgatctc ggctcactgc aacctctgcc tcccaggttc 2160aagcaattct tctgcctcag cctcctgagt agctgggatt acaggtgtgt accaccacac 2220ccagctaatt tttgtatttt tagtagaggt gaggtttcac catcttggcc aggctggtct 2280tgaactcctc acctcgtgat ccacccacct gggcctccca aagtgctggg attacagcca 2340tgagccacca tgctcggctt tgattgtgtc atttgtatag gcatgtggtt tattatttag 2400ttattttttt ttttttcttt gaggtggagt atcactcttg gtgcccaggc tggagtgtaa 2460tggcgtgatc tcagctcact gcaacctcta cctcctgggt tcaagcaatt ctcctgcccc 2520agcaggagta gcttgggatt acaggcatgc cccaccacac ctggccaatt ttgtgttttt 2580agtagagaca gggttccacc atgttggtca ggctggtctt gaactcctga cctcaggtga 2640tctgcccacc tcagcctccc agagtgctgg gattataggc atgagccacg gtgcccagca 2700tatttagatt tttttttttt tgagactgag tctgactctg tcacccaggc tagagtgcag 2760tggcacgatc cacgatcttg gctcactgca gcctccacct tatgggttca agcgattctt 2820ctgcctcagc ctcccaagta gctgggactg caggcacatg ccaacacgcc cggcttattt 2880ttgtattttt atagagacgg ggtttcatca tattggtcag gctggtctct aactcctgac 2940cttgtgatcc acccgccttg gcctcccata gttctgggat tacaggcatg agccacagcg 3000ccaggcctag atgtttctta aggtatgtat ctcccaaaga ttctttttgt ggtcctcaag 3060taccataagc accgctggag ataac 30853573148DNAHomo sapiens 357accataagca ccgctggaga taacacatgt gatgggcatt tttagcatag attgtatcta 60agcaactttc cacaagtaat agttctgtta agggttgtta ttgtggccgg gcgcggtggc 120tcacacctgt aatcctggca ctttgggaag ctgaggcggc cggatcacct gaggtcaggg 180attcgagacc agcctgtcca atgtgctgaa accctgtctc tactaaaaat gcaaagaaaa 240aaaaaatcta gccaagcatg gtggcttgct cctgtaatcc tagctacttg ggaggctgag 300gcaggagaat tgcttgaacc tgggaggcag aggtagcagt gagccaagat cgtgtcaccg 360cattccatcc tgggcgacag tgagactctg tctcaaaaca aaaaaagagt tgttaccgtt 420gggactattt tttgaaagct ttatgtgaac gtaattttat attttgatga aaatttagtt 480tattgatgta aaaagtgtat cagtacatca tatcagtgtc ttgcacattg tataaacatt 540taatgtaggt gaatctgtta tcactatagt tatcaatgtt ataattttca tttttgcttt 600tcttattcct tttctcatag tagtttaaac tatttctttc aaaatagata attcaaagag 660gaggaattct gatcacagaa agaaaaaaag ctgacttttc cacaaaagac atttatcagg 720acctcaaccg gttgttgaaa ggcaaaaagg gagagcagat gaatagtgct gtattgccag 780aaatggagaa tcaggtacat ggattataaa tgtgaattac aatatatata atgtaaatat 840gtaatatata ataaataata tgtaaactat agtgactttt tagaaggata tttctgtcat 900atttatctca aaacctaaac tgtgtatcaa tgatattaag cttttttttt tttttgagac 960agagtttcac ttttgttgcc caggctggag tacaatggcg cgatcttggc tcaccacatc 1020ctctgcctcc caggttcaag tgatcctcct gccttggcct cctgagtagc tgggattaca 1080ggcatgtgcc accacgcctg gctcatcttt tttgtatttt tagtagagat ggggtttctc 1140tatgttggtc aggctggtct caaactcctg aacctcaggt gatccgcccg cctcgggctt 1200ccaaagcgct gagattgcag gcatgagcca ctgtgtctgg cctattttta tagtttatgt 1260acttggaatt atataatata ttctgcctag cttctttcat tcaatatttg taagatttat 1320ccatattatt gagtgtagtt gtggattttt gcatttatat ttcatagcac gagcatgtca 1380gaatttatcc attttacttc ccttctgccc gccactgcta ctctccccat tttacctttt 1440tttttgtttt tttgagatgg agtctcagaa tttcgctctg tcgcccaggc tggagtgctg 1500tggcacggtc tcagctcact gcaacttctg cctctgggtt cagctgcacg ccaccatgcc 1560tggctaattt ttgtattttc agtagagggg attttgctat gttggccagg ctggtcttga 1620actcctgacc tcaggtgatc cacccacctt ggcctgccag agtgctgtga ttacaggcgt 1680gaaccaccgt gcccgacccc cattctaatt ttgatggaca tttgggtaat tttcattttt 1740ggctgttata aatactgctg caattacagt taattttcac agtttttttt tttttttttt 1800tttttttttt tttttgaggt gagtttcgct cttgttgctc aggctggagt gcagtggtgc 1860gatctcagcc cactgcaacc ttcaccttct ggattcaagc aattctcctt tctcatctcc 1920taagtagctg gggtttacag gcatgtgcca ccatgcccag ctaatttttg tattttaatt 1980tcacagttct ggaggctggg aagttcagaa ttaaggcact ggctgatctg ttgtctggtg 2040agggcccact tgttcataga taaccatttt ctcactctaa cctcacaagg ttgaaagggc 2100ctaatttttg tgtttttagt agagacgggg tttcactatg ttggctaggc tggtctcaaa 2160ctcctagcct cgagtcatcc acccgcctcg tcctcccgga gtgcttggat tacagcatga 2220gccactgcgc ccggccccca ttttagtttt gatggacatt tgggtaattt tcttttttgg 2280ctattctaaa taatgctgca attactgtta attttcacct tgtaaaaacc attttcaaat 2340ctcaagagat taacctttag ttttcttggt ttggattggg aaggaacacc aaggaaaatg 2400agggacttca gaatttattt tcattttgca tttgtttttt aaaatcttta gaactggatc 2460cagtggtata gaaatcttcg atttttaaat tcttaatttt aggttgcagt ttcatcactg 2520tctgcggtaa tcaagttttt agaactctta tcagatgatt ccaactttgg acagtttgaa 2580ctgactactt ttgacttcag ccagtatatg aaattggata ttgcagcagt cagagccctt 2640aacctttttc aggtaaaaaa aaaaaaaaaa aaaaaaaaaa agggttaaaa atgttgaatg 2700gttaaaaaat gttttcattg acatatactg aagaagctta taaaggagct aaaatatttt 2760gaaatattat tatacttgga

ttagataact agctttaaat ggctgtattt ttctctcccc 2820tcctccactc cactttttaa cttttttttt tttaagtcag agtctcactt gttccctagg 2880ccagagtgca gtggcacaat ctcagcccac tctaacctcc acctcccaag tagttgggat 2940tacagttgcc tgccaccatg cctggttaat ttttatattt ttagtagggt tgcggggaca 3000gggtttcacc atgttggcca ggttggtctc aaacttctga ccttaggtga tcctcccacc 3060tcggcttccc aaagtgctgg gattacaggc ttgagccatc gtgcccagcc tactttttac 3120ttttttagag actgggcttg gtggagtg 31483582176DNAHomo sapiens 358ttagagactg ggcttggtgg agtgaagtgg caagatcata gctcactgca gtattgaact 60cctgggctca agcgatcttc ctgcttcaac ctcatgagta gctgggtcta caggcacaag 120ccaccatgct tgcctaattt taaaattttt gcagagttgg agtttcacag tgttgcccag 180gatgttcgct cactcctgac ttcaagtgat tcttctgcct tagcctctag agtggtagct 240gggattacag gcatgaacca ccatgctctg ctattttttt tcaaggtttt tttttttttt 300tttttttttg agagactggt atgactatgt atgctcccta ggctggagtg cagtggctat 360tcacaggaag tgccatcaga gtgtactaca gcttcaaact cctgggctca agcacttcta 420tcatagtctc caaagtagct gggactacga gtgtgtctca ttgtgccttg ctctcgaatt 480gctttttttt tttttttctg gtttcaagct atctatgtgg tattagtcct cactttatga 540ataattttgt atactactaa tagcaatttt tttttttttt ttttttttga gacggagtct 600cattcttgtc gcccaggctg gagtgcagtg gtgtgatctt agctcactgc aacctctgcc 660tctccggttt gggcaattag ctgggattag aggcgcctgc caccatgccc agctaatttt 720tgtattttta gtagacatgg ggtttcatct tgttggctag gctggactct aactccaggt 780gatctgcctg cctcggcctc ccaaattgat gggattacag gtgtaaacca ctgggcctgg 840cctagcaatt taaaatgaca ttctaagaag ttttatgtct aaatctgcag taagtggctg 900ggtgacgtgg ctcatgcctg taatcccaac gctttgggag tccagggtgg gaggatgact 960tgaggccagg agttgagacc agcctgggca acatagtgag actctgtctc tacaaaagaa 1020aaaattagcg gggcttagtg gcgtgcgcct gtagtctcag ctactcgaaa ggctgaagtg 1080ggaggattct ttgagcccca agggttctgg cttgccgtga gccaggatgg caccactgca 1140ctccagtctg ggcaatagag tcagaccctg tctcaacaaa taaaataaaa ctgtagtaat 1200tataaagtgg ttttggctgg gggagaaatg tacagttgaa catacggatt aagaggttga 1260aagttggtct taggaagagg aactttttgt ggaaatttct taatatttga agaatattat 1320gttattgttc ctctgttttt catggcgtag taaggttttc actaatgagc ttgccattct 1380ttctatttta ttttttgttt actagggttc tgttgaagat accactggct ctcagtctct 1440ggctgccttg ctgaataagt gtaaaacccc tcaaggacaa agacttgtta accagtggat 1500taagcagcct ctcatggata agaacagaat agaggagagg tatgttatta gtttatactt 1560tcgttagttt tatgtaacct gcagttaccc acatgattat accacttatt gtaatatgca 1620gttttggaag tatatgttac catttaactg tacagagtac atagtaatag agtggtaatt 1680atttagattg attaaagaac tcattttttt aaataagttt tttttttttc actataaaag 1740tttattttat ttgagatggt atggtatcga acatgttcat attgtgtgta atcgtgggta 1800aattactcaa cctttatgtc atagtttctt cacctttaaa atgacattaa taaaagagct 1860acttaatagg attataagca tgagatgatt taatatacat aaaatactta cagtctgata 1920tataggaagc acttaactct ttatcctaga aaagatttaa ggtgacctta acatatatgt 1980cagaaaatct ttaaaattgt ggaaataaaa ggttgtataa ttctgctatc ctaaaattac 2040tagtatttca atatatttta ttttagtctt ttcttttaga tacaagtttt aaaactttta 2100agtgaagtgt aatatacgta agtactgctt gatgaattta aggtgatttc taaagccagg 2160tttgttgggg aagagg 21763593128DNAHomo sapiens 359ccagtttcaa gcgattaagt gattctcctt cctcagcctc ccgagtagct gggattacag 60gcgtgtgcca ccacgtcgcc accacgtctg gctaattttt gtatttttag tagagacagg 120gtttcgccat gttggccagg ctggtcatga actcctgacc tcaggtgatc cacctgcttg 180gcctcccaaa gtgctaggat tacaggtgtg agccactgtg cctggcttaa gttttgtatt 240tttagtagag acggtgttcc atcatgttgg tcaggctggt gtcaaactcc tgaccatgcg 300atccgcctgc ctcggcctcc caaagtgctg agattacagg cgagagccac cgtgcccggc 360ctgtttgagt atcttttaaa accagtaagg acaaactaga ggtgtcagct ctcttcatgg 420gctttggaga aacaagacaa aaaggaaaga gatgtttcgc cgggcgcggt ggctcactcc 480tgtaatccca gcactttggg aggctgagga cagcggatca cccgaggtca ggagtcaaga 540ccagagccat tgcactccag cctgggcaac aagagcacaa ctttatctca aaaaaaaaaa 600aaccaaaaaa gaaacaggaa agagatgttt tgatttttta agtctagagt gttctgttct 660tactctacag cacttagcag tagtccatct atcctccttg tttgttcttt acaacaaaac 720cccattggtt ctctcttacc aagtttgctt tattcttggt ttatcctttg taagatgtga 780aagggatatg aagagcaaat aggaagtgtt actcttgctg cttgagagaa agctgtttta 840caatttgttg gcaaacaatt tgtaaaagta caacaaaagt gtgcattttt ggcttcttat 900ttatgtttta tcattgctat atctcataat ttgtgatttt taaaataact ttttatttga 960aaagcactac agggtcacgt catgttttta aaaaataaat taagaaggta aacacccgta 1020cttctacttt acctctagtc ctagtctatg gtggtaatca gtgttaacag tttagtttgt 1080gttcttaccc ttccaggggt tttttttctc tatgtataca gatatatgca tttttaaaaa 1140catagttaac acttaaaaac aatatgggat cgtattagga atacaatctg tattccttcc 1200caacagtata tacagttttt ttccatttca ctatgtatct atttataaat tttttatttc 1260taataatttc tcttgaatag gtgagacatc atatagtata aaattcagta gaaaatcagt 1320ttttcagagg tacaaaattg gctgactttg cacagactcc tttcatttca caggtaggga 1380tgcacagcca cctcttccac cgacgagagg aaaggatatg tgtgcctgtg ggctcttcaa 1440ctctgttgat tagttatgat ttattttctg gtcagtttga gaggaaacag tgataaaata 1500ctgggaacag ggaagaagca taagattatt attgtttttt tttttttttt tgagacagag 1560tcttgctcag ttgcccaggc tggagtgcag tggtgcgatc tttgctcact gcaagctccg 1620cctcccgggt ccatgccatt ctcctgcctc agcctcccga gtagctggga ctacaggcgc 1680ccgccaccac gccctgctaa tttttttttt gtatttttag tagagacagg gtttcaccat 1740gttagccagg atggtctcga tttcctgacc tcgtgatcca cccgcctcgg cctcccaaag 1800tgctgggatt ataggcgtga accaccgcgc ccagctttaa tttttttttt tttttttttt 1860ttgagacaga gtcttgctct gtcgcccagg ctgaagtgca gtggcgcgat ctcggcttgc 1920tgcaagctcc gcctcccagg ttcacgccat tctcctgcct cagcctcctg agtagctggg 1980attacaggca cccgtcacca tgcccagcta attacgggac ctcgctctgt cgcccgggct 2040ggagtgcagt ggcacagtct cgctcactgc aatctggcaa gtgattctct tgcctcagcc 2100tccagagtag ctgggactac aggtgtgcgc cgctacgccc agctaatttt tgtattttta 2160gtagagatag ggtttcgcca tgttggttgg ccaggatggt ctcgatctct tgacctcgtg 2220atccgccctt ctcggcctcc caaagtgctg ggattaccgg tgtgagccat cgcacctggc 2280cttcctactt tattaagata cctaagggat ttctgtgatt gttaggattc aaatttctgt 2340gagcataaga atcaagctgt gtgcataata attgcatggg atttcacagc tgggccccat 2400tcccagggat tttgtattat ctacctccaa gtgattttga tgctggtgat ccttggacca 2460gacttggtga agctcaatgc ttagctagga aagccccaaa aatttgcttt attggattgt 2520gtaatttgac tacatccatt gtttcttttt tcaaatgtag agttatatgc cacaaaaata 2580ttttccgtag cagtaggcat cctaattaat ctcgatgttt gtttatagcc ccattgatgg 2640ggctataaac ttggcagcaa attgttttcc cactaatttg gcattttcca taaatgtttg 2700tttatagccc cattgatggg gctataaact tggcagcaaa ttgttttccc actaatttgg 2760cattttccat aaaaaacacg tatctgttgt tagctgccta gacgttagct ggacatggtt 2820taggttactt ttctcttaaa aagtaaattt taattcaagt tcctttaagc cagcagtctc 2880aacctggggc agtttttccc tccaggggac attcagcagt gtctagagac atttttggtt 2940gtcatgctga ggaagagagt gtatagtggg tagaatccag ggatgctgtt aagcatggaa 3000cagcccctta caacaaaaaa ttatgtagcc taaaatggca gtgttgccaa gattgagaaa 3060ttatgcttta aatgtgtttt tatatatggc cattttgtgt ttactctgga gataacatgc 3120ttttcctc 31283603145DNAHomo sapiens 360tccgtagcag taggcatcct aattaatctc gatgtttgtt tatagcccca ttgatggggc 60tataaacttg gcagcaaatt gttttcccac taatttggca ttttccataa atgtttgttt 120atagccccat tgatggggct ataaacttgg cagcaaattg ttttcccact aatttggcat 180tttccataaa aaacacgtat ctgttgttag ctgcctagac gttagctgga catggtttag 240gttacttttc tcttaaaaag taaattttaa ttcaagttcc tttaagccag cagtctcaac 300ctggggcagt ttttccctcc aggggacatt cagcagtgtc tagagacatt tttggttgtc 360atgctgagga agagagtgta tagtgggtag aatccaggga tgctgttaag catggaacag 420ccccttacaa caaaaaatta tgtagcctaa aatggcagtg ttgccaagat tgagaaatta 480tgctttaaat gtgtttttat atatggccat tttgtgttta ctctggagat aacatgcttt 540tcctcatata acatgcttga taaacatttt ggtaacacag gaattgtaaa tgctggtgat 600gtcagtaaat agttaagaaa tttagggctg tgcgcggtgg ctcacgcctg taatcccagc 660actttgggag gccgaggcgg gtggatcccg aggtcaggag atcgagacca tcctggctaa 720catggtgaaa ccccgtctct actaaaaata caaaaaaatg agccgggtgt ggtggcaggc 780acctgtagtc ccagctactc aatttagaaa gcagatttgt ttcctttcta tacctgtgta 840atttgaggtt tagtttactg tcacatcgtt tataaacata aggaagatcg ttgctcatct 900gatagcattc cgaaccttga gtcatctgta atgcctatgg cctccagaaa agcttctcta 960atactgtact tagagatgtg taaaatatgt aggaacattt tcccaccttc gattgttagt 1020ttacctttca gcttcagtaa tttacctttc agctattact ttagtaacat cttcaacatt 1080gtttttcaaa ctgcaaggtg tgacccagta gtgggtcgtt aaattagtag gtgacagagc 1140atttttgaag aattaaatac aatagaacat agcagagtgg gctcacgcct gtaatcccag 1200cactttggga ggcgaggctg gcaggtcaca aggtcggcag gtcacaaggt cagaagatcg 1260agaccttcct ggctctaaca tggtgaaacc ccgtctctac taatagtaca aaaaattagc 1320ggggtgtggt ggcatgcgtc tctagtccca gctactcagg aggctgaggc acgagaatca 1380cttgaatccg ggagctggag gttgcagtga gccgagattg caccactgca ctccagcctc 1440agcaacagag caagactatt tcaaaaaaaa aaaaaaaaaa aaagaaagaa agaaaaaaag 1500aaaatagagt gtatcacata attagagtag caagtattga tttgtgaaac ctattttaat 1560catagatcta tgtatgtatg tgctggattg tgatgtaaag acatttcttg ctgtggttac 1620actgaaaaaa atgaaaagtc actgatttcc aataacttac agaagcagta tgaactacat 1680attctgtcgt tcttgaaaca agctgagatt ttattgactt tgggaagcag tagaattatt 1740ttagtttttt aattaacagt ttttggcttt gtactgtcaa gaggtaattt tagaaagcat 1800tctaaaaatg taagtactgg atttggcaac attcttgaac tgtaattctg tttcgttaaa 1860catcactatt tacatgtgca acagcgtgtc tgtaacaatg tcccagtaat gaaattcttt 1920cttctattta aggcatgtct gtttgataaa agtcaaacaa aattgggtat atgtcagtgt 1980cttatgatac tgcttaatta aacattaatt tgactcttag ctaatcagga aatgtttgcc 2040tcacagtctt acagagcttt ccaccttcta aaaaagctaa cgtttcagaa tagattcagg 2100attcaacctt ctttctgtct tttttttttt ttgtttgaga cagagtcttg ctctgttgcc 2160caggctggag tacagtggcg ctatctcggc tcactgcaac ctccgcctcc tgggttcaag 2220caattctcct gcctcagcct cccgagtagc cggggttaca ggcgtgcgcc accatgccca 2280gctaattttt ttgtattttt agtagagaca gggtttcacc atgctgggtg gccaggcggg 2340tctcaaactt ctgaccttga gatctgccca ccgtggcttc ccaaaatgct gggattatag 2400gcgtgagcca ccgcacctag cctagattca ggctgcttct tttttttttt ttttttgaga 2460cagagtcttg ctcttgttgc ccaggctgga gtgccatggc atgatctcag tgcaccacaa 2520tctctgcttc ccaggtttaa gcgattctcc tgcctcagcc tcccaagtag atgggatcac 2580aggcatgagc caccatgcct ggctaatttt gtattttttg tacagacggg gtttctccat 2640gttggtcagg ccagtctcga actccctacc tcaggtgatc tgcctgcctc ggcctctcaa 2700agtgctggga ttacaggtgt gagccactgc gcccagcaga ttcaagcttt ttaaatggaa 2760ttttgagctg atttagttga gacttacgtg cttagttgat aaattttaat tttatactaa 2820aatattttac attaattcaa gttaatttat ttcagattga atttagtgga agcttttgta 2880gaagatgcag aattgaggca gactttacaa gaagatttac ttcgtcgatt cccagatctt 2940aaccgacttg ccaagaagtt tcaaagacaa gcagcaaact tacaagattg ttaccgactc 3000tatcagggta taaatcaact acctaatgtt atacaggctc tggaaaaaca tgaaggtaac 3060aagtgatttt gtttttttgt tttccttcaa ctcatacaat atatacttgg caatgtgctg 3120tcctcataaa gttggtggtg gtgac 31453613080DNAHomo sapiens 361cccagatctt aaccgacttg ccaagaagtt tcaaagacaa gcagcaaact tacaagattg 60ttaccgactc tatcagggta taaatcaact acctaatgtt atacaggctc tggaaaaaca 120tgaaggtaac aagtgatttt gtttttttgt tttccttcaa ctcatacaat atatacttgg 180caatgtgctg tcctcataaa gttggtggtg gtgactcact cttaggacac attcagattt 240cttttttttt tttttttgag aaggagtctt gctccgttgc caaggctaga gtgcagtggc 300acaatctcag ctcactgcaa cctctgcctc ctgggttcaa gcgattctcc tgcctcagct 360tcctgagtgg ctgggattac aggcatgtgc caccatgccc ggctaatttt tgtactttta 420gttttaccat gttggccagg ttcgtctgga actcccaatc tcaggtgacc cacctgcctc 480ggcctcccaa agtgctggga gtacaggcgt gagccacaga gcctggccat gttcagactt 540ctaataacag gtttgtattg actcttagcc tcatggcaga agccaagaga catgagacag 600cttagaaatt tttgcttttt ggaaatgaat gttagagtta ctggtttgtg attaaggcct 660attgcactga cagaggcagt gaaaaagggt ttgattgcca aggaagattc acagggccta 720gaatggcagt ggttatgcat ctacagttta ttacaggaga aggatacaat ccagtagcag 780gattatggta aggatatgca tcacagtcaa aggctgtcat agcaagtcat ccagagagtt 840cgggtgcaag ttccagtttt cctttgttgt gtaaagtctg tggtggggtg cattttctct 900ctcagagcag gatgtgtgca caggacacct tggaacctag gagcccaaaa tagagtcttc 960actggacttt ttaatatttt tcttgtcaag cggacatgtt cctgttctct aactagcctc 1020ttcagtggag gtcagaggaa gagcctcatt gagaccaagt gcaactcatc aatcacatga 1080aacaatgctg ataaataaac cacctaaata tcccctgacc cacaaataca aaacaacacc 1140attcaatcag tatttttcat gccttgatca ggggtcattg ccatgcagga actttaacaa 1200aacagtacag gctaataata gaattgttgg aattaactca cacagcacac ctatgagaga 1260gagttaagat agagggtctt ggtggtctct aacagttgaa ttcaaagtga agttaccaga 1320gtaaagtgag caaagacaca tattagtaca atattggtag ataaaatcac gttgctctaa 1380taagcatagt tttaaacttt aaccatgttt ctccagtaat tttagtaatt atattgttgt 1440tatgtctaat acataaagca ttttttactt ttttaaaaaa tttttaggca atgtggggtc 1500caaagtaatt aaaaaaaaat ttttttaaca taaagcatct taaaatttta cttaatcatg 1560atcacttaga accattaaaa catacgtttt gatattatgg ggaagcttcg ttgttccttt 1620gtagacagac ttaaagaaat acaactttat gatgacaaga tataagataa ttatagattt 1680aaattttata gaaacctttt cccttatcta gtgcaagagg tagctaagtg cttattttct 1740caaagtactg tgttataaaa agtattccta gtgtagtcaa agcttctctt tagactgata 1800aaacttagag cacctgcatt tacttccaac aaagcagaat taaagaaaat gagacttggc 1860cgggtacgtt tgtaatccca gcactttggg aggccgaggc aggtggatca tgaggttagg 1920agatcaagac cattctggct aacatggtga aaccctgtct ctaccaaaaa tacaaaaaat 1980tagctgacat ggtggtgcgc acctgtagtc ccagcttctc aggtggctga ggcaggagaa 2040tcgcttgaac ccaggaggtg gaggttgcag tgagctgaga tcacaccact gcgctccagc 2100ttgggcaaca aaaaaaaaaa aaaaaaaaag aaaaagaaaa tgagtcttta ctggctgggc 2160acagtggctc acacctgtaa tcccagcact ttgggagacc gagacgggca gatcacctga 2220ggtcgggcat tcgagaccag cctgaccaat atggagaaac cccatttgta ctaaaaatac 2280aaaattagcg gggcgtggtg gcgcatgcct gtaatcccag ctattcggga ggctgaggca 2340ggagaattgc ctgaacccgg gaggcggagg ttgcggtgag cagagatcgt gccgttgcac 2400tccattctgg gcaacaagag cgaaactctc catctcaaaa aaaagaaaat gagtctatac 2460tttgctgttt tcatactctc ttagtgtggt gtaggcagcc atgtatcccc cttgtgcctc 2520tatttctcca ttctgtgaat gagtgtcttc cactgctgtg cttttctgat tccgtaacct 2580ttgtttgttt gtttgtttgt ttgtttgttt gttttttatt gatcattctt gggtgtttct 2640cgcagagggg gatttggcag ggtcacagga caatagtgga gggaaggtca gcagataaac 2700aagtgaacaa aggtctctgg ttttcctagg cagaggaccc tgcggccttc cgcagtgttt 2760gtgtccctgg gtacttgaga ttagggagtg gtgatgactc ttaatgagcg tgctgccttc 2820aagcatctgt ttaacaaagc acatcttgca ccacccttaa tccgttcaac cctgagtgga 2880cacagcacat gtttcagaga gcacagggtt gggggtaagg tcacagatca acaggatccc 2940aaggcagaat aatttttcgt agtacagaac aaaatgaaaa gtctcccacg tctacctctt 3000tctacacaga cacggcaacc atccgatttc tcaatctttt ccccaccttt cccccctttc 3060tattccacaa aaccgccatt 30803623029DNAHomo sapiens 362ccaaacaaca gcattagcca actctttgaa gccttagatc tgtggctctt gttttctcct 60ttgaggtgta ggtccttgag ggcatttgct tctaatagag gctagtttca tcagaattaa 120aaatctgaac catggtatga aattcaattc tttttttttt ttcttttttg aaaacactgg 180caaatgtttt gtatccttga gctttcccac atatcttaac atagtgagtg gaaagtacag 240tggctgttaa gccaactact ctgaggtctt cactgctaag gcttactctt aattgtgtga 300gagcttaacc ttgatccctt taaaacatta atgggctaga aaaaaaacca ttcataaacc 360agtgccacct ctgaattttg ctaccacaat tcccttattt accaatagtg catgagctaa 420tttggaataa agaactaggc attgtagcac aacagacatt atgtgggcaa agtgttgttt 480atattctgtc taaatagtgc ttcacatgta tgtactattt tctaaatatg tatagatgct 540tttgtgatta ataataaaac atgaattctt aaaacaattt tgctgacttc atagtagctt 600ttcaccgttt tttcagtagc tgctaaaatt tctggagaag tttgggaact attgttttgg 660agtgaaatgc agtgtgttag atatcacttg cagaattctt ctaagggtat ttattggcga 720ttagaaaaaa aatccttgtg ttataccagt agtaatacaa agtaattgtt cagcttctgt 780taagtgtaaa ggactataca agtattgtgt atagttatct catttattat tttctgggta 840gctattgtta ttattacttc gtacaaaaag ggaaaaggag gctcaaagta tcatgctcca 900gataacagag ccagtaggta gcagagctgg gattgctacc caggtctcta gtcctgcttt 960ttcacactat atactcattg cttcacttac tccttcatac atgattcccc agcatgtact 1020cttttttttt tttttttttt tttgtttgag atagaatctc gctctctgtt gcccaggctg 1080gcaggcagta gtgtgatctt gggctaactg caacctccat ctcctgcatt caagcagttc 1140tcctgcttca acctcctgag tagctgagat tataagccta tgctaccacg cctggctaat 1200ttttgtattt ttagcagaga tgaggtttcg ccttgttggc caggctggtc tcaaactcct 1260gaactcaagt gatctgccca cctcagcctc cgaaagtgct gggattatag gcatgagcca 1320tcatgtccgg cctccccatc atgtaccctt aaataccatc aagcacagtt ccattgtgta 1380aaaacttggc ttgatttaac ctgttaattg gaacactgtc attaatggaa attaggaata 1440tgaggtaagc tagaggtttt attttaatga ctttgggtta ttaaatctat aagaaatgaa 1500attcatttag tcataattaa tgtcatgttt ctgcatctat attacttgtt gggtttacag 1560acgaggtagt gtattattag tgggaagctt tgagtgctac atcatctccc tttctataaa 1620ataaattgag tacgaaacaa tttgaattaa aacacctgag taaatagtaa ctttggagac 1680ctgctgtact atttgtacct tttggatcaa atgatgcttg tttatctcag tcaaaatttt 1740atgatttgta ttctgtaaaa tgagatcttt ttatttgttt gttttactac tttcttttag 1800gaaaacacca gaaattattg ttggcagttt ttgtgactcc tcttactgat cttcgttctg 1860acttctccaa gtttcaggaa atgatagaaa caactttaga tatggatcag gtatgcaata 1920tactttttaa tttaagcagt agttattttt aaaaagcaaa ggccacttta agaaagtttg 1980tagatttttc tttttagtat ctaattgtag cacctttgtg gacagtggat gtaatattaa 2040gtgacagatg ggaaaaggat ttttaaaaaa atagcaactg tttcagtgga tgaaataaag 2100attattagca gagaaaatga atattgggca taactgtcct ggtgaaagac aatctcataa 2160atgaacaatt tcataatttc gtaaatgcaa ctgcatttta ttttcaaaga gaaggaaaat 2220tatagtcact ggaaacggaa agagaagtta gaggtaaaca taggacacac aagaaaactt 2280tcattttgtt tattttcttg tttttctttt gagacagggt ttccctctgt tacccaggct 2340taagtgcagt gacactatca tagttcacta acccctcaaa ttcctgggtt caagtaatcc 2400tcctgcctta gccttagtag gtgtaaatac aggtgtgtac caccatgcct ggcgaatttt 2460aaaaaaactt ttttatagag atgagctctc gccgtgttgc ccaagctggt cctaaaacgc 2520tggcctcaag ctatcctccg gcctcagtct tagcctccca aaatgctggg gtttcagtag 2580aagccaccat gccgggccac ttctgtttct tttccatgta gagttctttg caggaggagg 2640ttagaatagg tgtgcatctc ctaaatagtt gtcgaatata actaaaaagt taaccaggac 2700tctaaatact atttacttct aaaatttgtt aattgggaac atttagggtt taactgatct 2760atatcttatg tctttaacaa ttttgaatga taattatatg taaagtaaga acagtttgtg 2820aaatagttga aaatatcctt acatgaaagt gaattttaaa

gcacagttta tgtaatgtta 2880atgttttgtt ttgtatctgt taaaaatttg tttatatgaa caagtttaca ggtttactgt 2940ggtgagcccg ttgaatatag tgggtttttt ttgtttgttt tgtttttgtt tttgagatga 3000agtctcactc ttgtcccgag gctgatgtg 30293631685DNAHomo sapiens 363gagcccgttg aatatagtgg gttttttttg tttgttttgt ttttgttttt gagatgaagt 60ctcactcttg tcccgaggct gatgtgcaat ggcgcgatct tggctcactg caacctctgc 120ctcctgggtt caagcgattc tcctgcctta gcctcccgag tagctgggat tataggcacc 180tgtcaccaaa cccggctaag ttttgtattt ttggtagaga tgggatctca gcatgttggc 240caggctggac tcaggtgatc cgtctgcctc ggcctcccaa gtgctgggat tacaggtgtg 300agccaccatg ccgagcctga atatagtgtt tttaagttgc aggactttaa aaataatatt 360ttgaaatttt tctaagttaa attccctgtt aaaatggtca tgcaggaata tacgcttgca 420ttattcatat tagggtaact gtttggtttg ctagttgtta gattctttgc attccttttt 480tttttttttt tttttttttt ttttgagacg gagtttcact ctttttgaca aggctggagt 540gcaatggcgc tatctcggct cacctcaacc tccgcctcct gggttcaagc gattctcctg 600cctcagcctc ccaagtagct ggaattacag gaatacgcca ccaagcccgg ctaattttgt 660atttttagta gagatggggt ttctccatgt tggtcaggct ggtctcaaac tcccagtctc 720aggtgatcag cccacctcgg cctcccaaag tgctgggatt acaggagtaa tcccccaccc 780ttttaaaaaa atgagacaga gttttattct gtcacccagg gtggagtgca gtggtgcgat 840catggttcac cgcagccttg aatctgggct caagtgatcc tcccacttca gcctcccaag 900tagttggaac catagatgtg catcaccaca cctggctgat ttttaaatta tttgtagaga 960tgaggtcttg cttgttgtct aggctggtct taaacttctg ggcttcagca gtcctcctgc 1020ctcagcctcc cagagtgctg agatgataga catgggccac tgcccctggc cgcatttttc 1080ttttcttttc ctttcttttt tttttttttt ttttgaaacg gagttttgcc attgtcgccc 1140aggctggagt gcagtggcac gatctctgct cactgcaacc tctgcctccc gagttcaagc 1200cattcttctg cctcagcctt ccagttatct gggattacag tcatgtgcca ccacgcccag 1260ctaatttttg tatttttagt agaaacaggg tttctctatg ttggtcaggc ttgtcccaaa 1320ctcctgacct cagatgatcc acctgcgtct gcctcccaaa gtgctgggat tataggcgtg 1380agccaccatg cccggcccta actgcatttt tcttagtatt tgtggtttga gttaatactt 1440gccctatgtg atgttgattt attattactg gatcattaag tgaggtttaa agaagctaaa 1500tgccatttgc tctatgccct ctggatttta aaagtgcatg ggtgtgcacg tgtgtaggta 1560taaatgtttc catattctag tatattctgt gtcagtgata gagcagtctt agagctgtct 1620tttccattta cttgtaggtt aagaagccaa aaaaagttgt gtcatcatcc cgtttaggaa 1680aactt 16853642998DNAHomo sapiens 364tgggtgtgca cgtgtgtagg tataaatgtt tccatattct agtatattct gtgtcagtga 60tagagcagtc ttagagctgt cttttccatt tacttgtagg ttaagaagcc aaaaaaagtt 120gtgtcatcat cccgtttagg aaaacttaca ttttggctat tgtttcctct agtgctgcta 180ttagtggaat gattttaggt gttcaacttt cagatcaatg ggagacagaa atattgttct 240gagacatctg gaagccgaat gtgttttatt cctgcctgtc tgaggatgtg gtcttgcctt 300tgatagggca aagttatttg taaacattgc tttaaataaa aacatgtaaa ggtgtttttg 360atggttaaca aaaactatga gtataataga gcctagtccc tattacggac tggtattgat 420ctggtgtggg aagagtattg agcttttcag tgtcacctac ctgtattccc ttgaagggac 480ccagagccca ggcaaagctc tgctgaggtc gggcgtggtg gttcacgcct gtaatcctag 540cattttagga gaccaaggcg ggtggatcac ctgaggtcag gagttcaaga ccagcctagc 600caacatggtg aaaccctgtc tctactaaaa atacaaaaat tagctgggtg tggtggtgca 660tgcctgtaat cccagctatc tgggaggctg aggcaagaga attgcttgaa cccaggagac 720ggaggttgca atgagccgag atcatgccac tgcactctag gtgggtccct gagtgagact 780ccatctcaaa aaaaaaaaca aacaaaaaaa aaaaaaaaaa aaaacctctg ctgaaatgct 840acagttaatt ttgccatttg tggtcagcat tcttcttcta aattgctata atcttgcctt 900catattatgt gtctcaaatt taagcaggta tcagaatgtc cacgggaaca aattgccatg 960gctctaagcc cagaatcaga ttcttcagat ctggagtagg gctggggaat ttgcatttct 1020aacacacaag tttgttgatg ctgtttgtct ggggtccacg cttgcctaac ttctgatgtg 1080atttatttct gccagtttct ttttttgttg ttgttttatt tttttgagat ggagtctcgc 1140tctgtcactc aggctagggt gcagtggcat gatcttggct cactgcaacc cctgcctcct 1200gggttcaagc gattctcctg cctcagcctc ctgagtagct ggggttatag gcacactgca 1260ccacacccag ctaatttttg tatttttcgt agagacaggg tttcaccatg ttggccaggc 1320tggtcttgaa ctcctgacct caggtgatcc attggcctcg gcctcccaaa gtgctcggat 1380tacaggtgtg agccacccac catgcctggc ccttcctacc aatttctatc ctccctgaaa 1440tgctgcacac ttaggcagtc actggacaat atctgcccca aaattggttt gtataattga 1500gaatatttaa gaggttgtta aaatttgaac cactttctat tcttctatta agtgtacaca 1560tctattaaag atccccttgt agctcttttt atctgggcca tcacatttct gcccagcaga 1620tgcagaggcc ctgtcctctc ttccacctcc ccactacctc tccttcccta cttttggact 1680gtaaaagctg tctttctgca gttaattgtt ttattctttg taggttctac tcgttgataa 1740tgttatctac tgctataata attacagacg gcaacaggat gatcaaatct tggatatttt 1800aaatttacat tatgcctttt ttattttatt tttttaaagt ctctgcttga cagcaaataa 1860gcctaacgtt ccctaacaaa tgatgatgtc ccattaatga tttgatgact tcctgtttgt 1920agtttttatt tagagtgctt gtgggtagtt tttcataacg acatttaaaa atcaggatat 1980aaataatttt ttaagttttt tttttaggcg gggcacagtg gctcacacct gtaattccag 2040cattttggga ggctgaggtg ggcagatctt gtgaggtcag gagttcaaca ccagcctggc 2100caacagggcg acaccccatt tctactaaaa atacaaaaat taggccgggt gcggtggctc 2160acacctgtaa tcccagcact ttgggaggcc gaggcaggca gatcacaagg tcaggagatc 2220gagaccatcc tggctaacac ggtgaaaccc catctctact aaaaatgcaa aaaattagcc 2280gggcatggtg gcaggcgcct atagtcccag ctactcggaa ggctgaggca ggagaatggc 2340ttgaacccag gaggtggagc ttgcagtgag ccgagatggc gctgctgcac tccaacctgg 2400gcgagagtgc gagactctgt ctcaaaaaaa taaacaaata aaaaataaaa aaattaacca 2460ggcatggtgg cgcatacctg tagtcccagc tacttgggag gctgggacag tagaatcgct 2520tgaactcggg aggtggaggt tgcagtgagc tgagatcacc cactgaactc cagcctgggc 2580aacagagcaa gactctgtct ccaaaaaaaa aaaatgtatt tttctttgaa gcttttctac 2640ttttaaatgt aatgtatagt attataacaa gtgaacaaaa tgatacaaag aagtatggcg 2700ggaaaggtgt ggtagagatg ggaaaacata tttcctccag cctcttaggt tcattggagg 2760agcttgggaa ttcaactgac acacgacaga tttacaggag aaaagtttta tttcaagtac 2820acatgagagc ttcatagaaa agaagtgaag acctaaagaa acagactgga gagttcatat 2880gccattttaa taaaggataa tgtattagtc tgttctcatg ctgctaataa atacataccc 2940aagactgggt aatttataaa gaaaaagagg tttaatcgac tcacaattgc acatggct 29983652076DNAHomo sapiens 365cttgcatagt ttgcttctgg tatgttaaag tgtgctctct ctaagtgggt agtaattagg 60aacaatttat ctcaacctca tttattgaat gttttaaatc aagagaacgg actctgttat 120attaagcttc tatatataat tgtctgtttc actgtaatgc ctagtaagga tacacttcat 180tcttttttta gatgttcttt cacaatttca tgtaaatttt agttgttttg tttcaaaaaa 240caattcctat tgaacaatct ctaggaatag atagcttaat aataatatta gatctagttt 300tctcttttca tagtttacct cttctttctt ttctttcttt tttttttttt tttgagacgg 360agtctcgctg tgtcgcccag gctggagtgc agtggtgcga tctctgctca cagcaagctc 420cgcctcccag gttcgcgcca ttctcctgcc tcagcctccc aactagctgg gactacaggt 480gccccccacc actcctggct aatttttttt tttttttttt tttttttttt tttgtatttt 540tagtagagac agggtttcat tgtgttagcc aggatggtct caatctcctg acctcgtgat 600ccgcccacct cggcctccca aagtggatta caggcgtgag ccaccgcgcc cagcctctgt 660ctctcttttc tttttctttt tcttttcttt tcttttcttt tcttttcttt tcttttcttt 720tcttttcttt tctttccttt cctttccttt cctttccttt cctttccttt ccttttcttt 780tctttctttt cttttcttct ctcttctctt ctcttctctt ctcttctctg tctttttttg 840acgagtctca gtatgtcacc taggctggag tacagttgca caatgttggc tcattgcaac 900ctctgcctcc cttgttcaag tgattgtcct gcctcagcct gccaaatagc tgggactaca 960ggtgcgcact gctacgcccg gctaattttg tatttttagt agagatgggg tttcaccatg 1020ttggccaagc cggtctcaaa ctcctgacct caagagatcc acctgcctcg gcctcccaaa 1080gtgctgggat tacaagtatg agccacgatg ccagtccaat tcttgtgtag ttttttaatc 1140agctgaattt aacattcaaa ttcttctttt aaatcttcca ataggcagtt atctttataa 1200agatcctata taatcaagac tttgtttctg aatattttat gtatgttttt gctactgtaa 1260atgagatcta tttctcattg tggtttcttg ctgttattac tggtaagaat ttagtgaaac 1320aaagtactta agagtatgtc tttaaattgt gagattttga tgaactttta agaaataaaa 1380ttctttagtt tcttagagct ttttgagatt tctaaggtag atccttggtt tgggcaacat 1440ataactatta caagttttgc acattgaacg ttatttggta atttttagag aggacatttt 1500aaatgtttag gaaaaatata aataaaatgt agaatactat tgggggcata tacatcatca 1560gcactgtaac tgtttcatat gaatcatttt tgtacatata gaactctaaa gtcctaatga 1620acagaatttt acatttctat aaatagaaag tccttaatag ttgtgactga ataacttatg 1680gatagcaaat tatttaactg aaaacagtaa aatttaagtg ggaggaaata tttgctttat 1740aatttctgtc tttacccatt atttatagga ttttgtcact ttgttctgtt tgcaggtgga 1800aaaccatgaa ttccttgtaa aaccttcatt tgatcctaat ctcagtgaat taagagaaat 1860aatgaatgac ttggaaaaga agatgcagtc aacattaata agtgcagcca gagatcttgg 1920taagaatggg tcattggagg ttggaataat tcttttgtct atacactgta tagacaaaat 1980attgatgcca gaattatttt ataagttccc tgtccccaag atgatgactt cacatctctg 2040tcaaacagaa atcgcccaac aggcccttgt atgatg 20763661960DNAHomo sapiens 366aacagaaatc gcccaacagg cccttgtatg atgtcattta aacaagccct attttaaatg 60tcacctccac tggtaacagg atactcctag gaggatcacc aagcccaatt cttctaggag 120tagtgcattg attaggcttt ggggtttcca agcagttcat taatgtcact tttggaaaaa 180gtctgtcttt cataccagct tattaattcc ctatgggttc acacggtttt ttttcctgga 240ttttcatcaa acatgtgtaa ggtactcagt acaaagaagt ttagaaatcc agaacaaagc 300agtgtattta agtagtagta aacttccaga taatctgatg cccatatcta catatataaa 360aaatttgcaa atagttctgt agagagtcca aacatggagt agatccctaa ttaagagcct 420ttgcattaaa gtccaccttc ctcatttcat agctaaggat attgaggctc agagagttta 480tgtgtctgga gttaaagtta ttttgtgttt ccttaatttt tgacttacta gaaagttaaa 540gtacctacag atttctgtgt ttcactatat gttaacttgc ttggctggaa gtttttctgc 600tgataattgg ttttatgaag gaagaatcct gttaagaatg catcattgga ctgggtgtgg 660tggctcacgc ctgtagtgat cctagcagtt tgagagaccg aggtgggcag attgcttgag 720tccaggagtt tgacactaac ctgggcaaca tgatgaaacc ctgtctctac aacaaataca 780aaaattggcc atacatggtg gcacgcacct gtggtcccag ctactcagga ggctgaggtg 840agaggatcac ttgagccagg gaggttgagg ctataatgag ccataattgc actactgcac 900tccagcctgg gtgacagggt gagatcctgt ctcaaaataa gaaaagagaa tgcatcattg 960gccaggcaca gtgactcatg cctataatcc caatacttta ggaggatcac ttcagcccag 1020gagttcaaga ctagcctgtg ccacatagac cacatttcta ccaaaaatca aaaggaaaaa 1080acttgctggg tgtggtgatg cacacctgtg gtcccagcta ctcgggaggc tgaggtgaga 1140ggattgcttt agcttaggtg gttgaggctg cagtgagcca tgatagcacc actgcattcc 1200atccagcctg agggacggag tgagagcgac accttgtctt taaaaaaaaa aacagaggaa 1260tgcatcatag tatatattaa attattgcct atttttttat ctattttatt gagtgctaat 1320aagaaaatta atggcaaaaa cttgtttttt acagtataaa ttaagtttaa tttcatttta 1380aaattaagta aatttgtttt attaaaaagt atgttgaaag caacataaat agcactcaaa 1440ttgagacaga aactgtaact gtagtataag aagcattagg ctgggaattg ggaaacacga 1500gttctagttg cagcttggaa actttttctg aagctcttta caaattactt aatttctctg 1560gttttcacca cattgttcta tagcattaac atgttggatt cattgcttta attcttagac 1620ctacgtgtca tcagaaatgc cattacactt tgaggatttg agccttattt taaataaagt 1680tgtgatcctc atggcagcct aggtttacat gtgttaaata aacagtattc tgtaaatacc 1740attgtctttc atgtttagtg atgttgctgt tgttaacact gcagtgaaat gcatatataa 1800gcaaactaca ttacatactc atgaacatgg tcctttgttt tgaaactttg atcactgatt 1860gttcgcagtc tttcattgtg gaactactct ttcactttga atgttttgag aggttccttt 1920gttcagatca gtccgatttc gtttctgggt gggtctctac 19603673160DNAHomo sapiens 367agtccgattt cgtttctggg tgggtctcta ctttcccttt tctcactggt caagcgaggt 60ctgtctaatt gtttgctact actaacattt gatggccacg cttcagcaag tacatttgta 120gattctctct ctctgtctct cttaatttgt ggtctagaga tcatattggt taatgaaatt 180atgaagaggg aatgtattta taaaaactca aattcttgat gcagaaggtc tagctgattg 240tgaacccaaa atatccgaga caggtcacaa ccaatttaga aactttattt tgccaaggtt 300aaggatgcat ccatgacata gtctcacaag gttctaatga cacatgcgca aggtggttag 360ggtacagctt ggttttatac attttaggga gacatgagac atcagtcaac atgtgtaaga 420tgtacattga ttctatccag aaaggcagga caacttgaag caaggggctt tcaggtaata 480agtagataag agacaaaagg ttgcatactt ttgagtcctt gatcagcctt tcactgaata 540aacaagctta gtcttgttag tgaatctgcg tttttacata aacagtaggt cagaggaagc 600aatcagaaat gcatttgtgt caggtgagcc gagggatgac tttctgtccc tcacctgtga 660agataagcta tcagtttcca ttgctagggt gaaattcaac agaattgttt gagagtgaac 720atctggaggc ccacaaggac tttccttgtg gaggggaagt atgtagtgag ggaagtatgt 780agtttttaaa tctttgtcgc tatcttattt agaaataaga tggaaggcag gtttgtctga 840catagttccc agcttgactt ttccctcggc ttagtgattt tgcggttccg agatttattt 900tcctttcaca tatcagtcag atcatttggt ttgtgaagtt tcctatgctt aacagaaaat 960atgtgcacta gttttcctag agtttcattg tcagagtctc aagtttttgt ttggaaattg 1020tatttggtca cattaattat actctatgtt agttccaaag aaataccttt ggttaagaaa 1080agaattctca tgcataactc ctcgagggtg gggttacacc ttaatccatc ctcaggtgct 1140catggtaatt ggggcaaata tgttgcccag tgctggtgct ctgcagcctt ggatgggttt 1200acccagaaag cagctttcaa gtcagaaact aacattcata agggagttaa ggattttata 1260aatagatatc cataattcat gtagttttca agtaagtagt atttgaatct tttctggtta 1320gataataatt gtgagtatgt tgtcatataa taacagtatg tttttcacta tttaaataat 1380tttagaatta cattgaaaaa tggtagtagg tatttatgga atactttttc ttttcttctt 1440gattatcaag gcttggaccc tggcaaacag attaaactgg attccagtgc acagtttgga 1500tattactttc gtgtaacctg taaggaagaa aaagtccttc gtaacaataa aaactttagt 1560actgtagata tccagaagaa tggtgttaaa tttaccaaca ggtttgcaag tcgttattat 1620atttttaacc ctttattaat tccctaaatg ctctaacatg atgtgaatgt tctatgataa 1680gttttactaa tgtagtcatc aggtaagagt caagctttct tccatagagc agtcagctgt 1740cgcaacacca tttgttaaat agtccgtctg ttctccattg actgaagtgg tactttgggt 1800ctattttaaa gactctactt ttacctcgtc tcaccattct tttgtctaca caaaatatat 1860tttatcgctt attctgtgtt accatatcta ttagagctag ttccccctca tatctctgct 1920ttagttattt tcacatgttt cttttatctt tttttttttt ggagatggag tctcgctctg 1980ttgcccaggc tggagtgcag tggcatgatc tcggctcact gcaagctccg ccttccgggt 2040tcacgccatt ctcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccactgc 2100gcccagctaa ttttttgtat ttttagtaga gacggggttt caccgtggtc tcgatctcct 2160gacctcgtga tccgcctgcc tctgcctccc aaagtactgg gattacaggt gtgagccacc 2220gcgcccagcc ttatcttttt tttttttccc cctgagacag agtcttgctg tgtcgcccag 2280gctggagtgc agtgacgcgc agtcttgact cactgcagcc tccacctccc ggattcaagc 2340gattctcatg cttcagcttc ctgagtagct aggattatag gcatgcacca ccacgcctag 2400ttcatttttg tatttttagt agagatgggt tttcaccatg ttggacaggc tggtctcgga 2460ctcctggcct caagtgatcc acctgcctca gcttcccaaa gtgctgagat tacaggtgtg 2520agccaccgtg cctgacccac atgtttattt tttctaagaa aactttacta tcatttatca 2580agttaagaaa attattctga tatttcaatt gggtgtttaa attagttgag ggaaatatga 2640ggccattcac tagatgatag gttttttttg ttttaatcat gtttcatgtt gaaacaaaaa 2700agttttttcc tgccagtttt ctggctaatc tcaggaagtc cctgaaacaa attattgata 2760agtaaaaaaa attatttaaa aaattttaaa ttatatttaa aatcttctgt gacttatggt 2820ggggggaggc taaagccttt ctccttctgt actgttctgg aaactatggc ctgttctact 2880ccctcccctc ctgaattttc ccagaacttt acaggtagct tttatatata tgatcccctg 2940tcgtctgttt aacaagtact ttgagtgtct attatatgca gacattctag gtgttcagac 3000accctagtaa ttagtttgtt cctcataatt ctcagtaaag aagacatgta tatttctcat 3060tttataggtg aagaagctaa gactttactt ttcctcagtt agacagctag tgctggtggg 3120tgcctaaact tagatcttcc attgccaaat ctaggtgtgt 31603682985DNAHomo sapiens 368tccattgcca aatctaggtg tgttgttttt ccagcacact agaatcctcc tggttcaaga 60aatgtatata ttttagcttg gataagatac aacttttgga gtgttctaat catcttcaag 120tttttcgtgg attagttata acatatgaaa aaagataggg ctgaatgggc cacatgatgc 180caaaagtgaa aaagtcactc actagattat gacctgcaga atctggtcct tgcctgcctc 240tgcttttata ttttgcagct tgtcccttca cacagtggtc tcacttttat aatgtctttc 300cctcatgcat ttctttaatt ctttttattt gcctgttcca tagtagtctg tttgtgctgc 360tacttgcccc tgtactgttc ttgagctata catatacatg tctgctgtgc cattgagtga 420ttccatcaag gccacaatta tcatcttgat gaactgattt tctcccactg ctgataatta 480cttctctctc ctttctttct cctttacatc acctcttttt gttcttaatt tcattccctc 540cttgatgcca gtgagtattt ttttcttatt ttattctcat cttccttgag tattgtttat 600ttcaacctct tttttttttt ttttttttgg agaagggttt ggctttgtcg ctcaggctgg 660agtgcagtgg cacaattttg gcccactgca acctccacct cctgggctca agccatccca 720cctcagccac ccaagtagct gggactacag gtgttgccca ctgctttgta tttttaatag 780acacaggatt tccccatgtt gctcaggctg gtctcgaact cctgggctca agcagtccac 840ctgccttgcc ctcccaaagt tctgggatta caggattaca gatgctgtgc ccggcccaac 900ctctaatttt aattttctct tcaaattgtt caataagatt tagtttcaag acattttcct 960ggccgggcat ggtggcttac gcctataatt tcaacacttt gggaggccga ggcaggtgga 1020tcacttgagg tcaagagttc aagaccagcc tggccagcgt ggtgaaaccc catctctact 1080aaaaaataca aaaattagcc gggtgtggtg gtacatgcct gtaatcgtag ctattgtgga 1140ggccgaggca tgagaatcgc ttgagcccgg gaagcagagg ttgcagtgag ttgagatgac 1200accactgaaa tccagcccgg gcaacagagt cagactacgt ctcaaaaaaa acaaaacaag 1260ctgggcgccg tggctcacgc ctgtaatccc agcactttgg gaggccgagg ccggtggatc 1320acgaggtcag gagatcgaga ccatcctggc taacacggtg gtgaaaccct acctctagta 1380aaaatataaa acattagccg ggcgtagtgg ttggtgcctg tagtcccagc tactcaggag 1440gctgaggcag gagaatggtg tgaagccggg aggcagaggt tgcagtgagc ctagatcgcg 1500ccactgcact ttagcctggg tgacagaaca agactccgtc tcaaaaaaaa aaccattttt 1560cttattttga aaacttttgg tattgaaaga tatttatact acagtaatga gaaatactgt 1620gtgtgtgtat atatgtttgt gttttttttt ttgttttttt ctttctctct ctctcttttt 1680tttttttttg acagagtttt gctcctgttg tccaggctgg agtgcagtgg tgctatctcg 1740actcaccaca acctctgcct cccgggttca agtgattctc ctccctcagc ctcccgaata 1800gctgggatta caggaatgtg ccaccacacc taactttgta tttttagtag agacgggttt 1860tccccatgtt ggtcaggctg gtcttgaact cctgacctca ggtgatccac ctgcctcggc 1920ctcccaaagt gctgggatta caggcaccct gcctgtgttt gtgttttaaa aggggtaata 1980gcttcagtct tttttttctt tctctgagac ggagttttag ttttgttgcc caggctggaa 2040tgcaatggtg tgttcttggc tcaccacaac ctccatttcc tgggttcaag cgattctcct 2100gcctcagcct cctgagaagc tgggattaca agcacgcgcc accatgctgg gctaattttt 2160gtatttttag tagagacggg gtttctccat gttggtcagg ctggtctcga actcctgacc 2220tcaggcaatc caccgacctc aggtgatcca cccgcctcag cctcccaaag ttctggggtt 2280acaggcgtga gccaccacgc ccggctgtct tcaatcttaa ataaggattc catttaaata 2340ttttgtaaaa ggacacagat cacagtttta ctcaggggaa tataattgtt atagcaggaa 2400ttgtgccatt gcgctattcc aaacagtgta aaagaacatt aataaattga attctaacta 2460catttgtccc taaggagttg ttcgttttcc acttgtattt ccattttaat tatcattatt 2520tggatgtttc ataggatact ttggatatgt ttcacgtagt acacattgct tctagtacac 2580attttaatat ttttaataaa actgttattt cgatttgcag caaattgact tctttaaatg 2640aagagtatac caaaaataaa acagaatatg aagaagccca ggatgccatt gttaaagaaa 2700ttgtcaatat ttcttcaggt aaacttaata gaactaataa

tgttctgaat gtcacctggc 2760ttttggtaac agaagaaaaa tcatgatatt tgaagtgtgt tttgttattt tcgcaagcca 2820ttacattctg actatttaat atgttaggtt tcctatataa aataaggcat ggtatgttac 2880agtaggacac ataactggaa gttactcttg cacatagaaa caaaaaatgg cagaaaagca 2940caaaacttac tatagttgta acagggaaag gaaacactag ggcct 29853692138DNAHomo sapiens 369aggaaacact agggcctaca acgtactaat gtcttgggtc atctatgggc tcatgaggct 60ctaggttatg gaagtaaata ccactgaaaa gcaaatatta attacacatg aggcaagcct 120ttttgagttc tgtatgtcat tttgtagatt ttgagttcat tctagtggca ccatttgaga 180tcattttcat gtaattaaag gaacacagca acctggcact gtgttattgc ccttagaatg 240gaatgaatat atgtttagca caaggtagga agtgatgcgt taagttggaa ggctttgccg 300atcatggtgt gtatgttgac taacctttat tgtgccttta aaaaatatac tcaagaacta 360ccttaaccaa gtaattaaag tcaagattac cagttgtggg acaaatgaca tgtacttcct 420ggtgtgatat agaaggaagg acacagtatc acctatatag tattcttgac cagaatattt 480aacctgattt taaacaagaa gtaaaaattc aaataaattt agattgtggt gcattcaagg 540cctgaacttt aataaatgtc catgtcacgg cagcaaaaaa gaaatcaaca ggtcttaaag 600agacagggca accaaacgca gtaggcagta gttgattaga tcccaattta gaggttggag 660ttggggaata gctatagagg acactattgg ggcgaattga gaaagtttaa tatgagacaa 720tatggtgtta gtgtcagatt tcttgtgtga aatggtagtg ttatgattag gagaatgtcc 780ttgttctcag gatatgcatg ctaaattatt taaggacaaa tattttttta aaaggttatg 840tgcatgagta attctataaa ttgtgttgct attatgaatt gtcatggtaa atcaaaagga 900aacataaaac tcaaaaggtt ttattttaat acactttatg tattgaaatg aatggaattg 960atttgtaaag attacatttt tgcttgttgg tgtcagataa ctgtgacgta ataatctttt 1020gctgaattat gtttcttagg ctagatttca ttttaaagaa ccctgtaaat accatttatt 1080tgaactgtgg atcttcctta aaaaataata tttattaagc acctagcagg gtaaagtttt 1140tagattttaa catttaaatt gaaggtttta tattagaagt caacctgaat ttaaatgaaa 1200cttcttcttg gtctgatatt acatattatg agctattttt atttaaaaat gtaatggcgg 1260ccagacatgg tgattcacac ctgtaatccc agcactttgg gaggctgagc tgggaggatt 1320gcttaagccc agaagtttga gaccagccta gccaacatag ggggacccca actctacaaa 1380aaaatccaaa aaatattagc cggctgtggt ggtacatgcc tgtagtccca gctactcagg 1440aggctgaggc aggagaatca cttgaaccca ggaggtcgag ggtgtggtga gccataatta 1500tgctactgta cttcagcctg ggcgacagag caagactccc atctcaaaaa gtgtaatgga 1560tcactttaat aattttctat catacaatta agtcataaaa ggtcatgcta ttaagagcca 1620gttatgtgac atgccaagta tagactctta attaagatgc tttggtttgc tttttattta 1680tttatttatt tttcagatgg ggtcttacca tgttgcccag gctttagtgc agtgatgcga 1740tcatgactca ctgcagcctc aacctcctag gttcaaggga ttctccccac ttagcctccc 1800aagtagcttg ggactactac atgtagtagt gccaccacac ctggttaatt tttttttaat 1860tatcttttgt ggagatgaag tctcactctg ttgcccaggc cagactcaag cagtcttcct 1920gccttggcct ccgaaagtgt tgggattaca ggcgtgagcc accctgccca gcctagtttt 1980ctttttttta ctataaactt attcttgtca gtatgctagc aattttacaa gttttaaagt 2040agttatagca agtacttcac tcatgtttaa ttcttaaagg cttctattgc tatataatag 2100ggtagtctga attcttcaaa agtgtactga ggccaggt 21383702491DNAHomo sapiens 370gggattctcc ccacttagcc tcccaagtag cttgggacta ctacatgtag tagtgccacc 60acacctggtt aatttttttt taattatctt ttgtggagat gaagtctcac tctgttgccc 120aggccagact caagcagtct tcctgccttg gcctccgaaa gtgttgggat tacaggcgtg 180agccaccctg cccagcctag ttttcttttt tttactataa acttattctt gtcagtatgc 240tagcaatttt acaagtttta aagtagttat agcaagtact tcactcatgt ttaattctta 300aaggcttcta ttgctatata atagggtagt ctgaattctt caaaagtgta ctgaggccag 360gtgcagtagc tcacacctat aatcccagca ttttgggagg ccgaggcggg tggatcacct 420gaggtcagga gttcgaaact ggcctaacca acatgttgaa accctgtctt tactaaaagt 480acaaaaatta gctgggtatg gtggcaggtg cctgtaatcc cagctactca ggaggctgag 540gcaggagaat cgcttgaacc caggaggcgg aggttgcagt gagccaagat cacaccattg 600cactccagcc tgggcgacag agcaagactc tgtctccaaa aaaaaaaaaa aaaaaaaaaa 660aaaaaagtat actgaaacag aggaagataa ttaggtctgc ttggccattg ttaagttgat 720ttttattttc aaaacatttg atcactgttg tggggaacaa gggaataaaa aataagttaa 780atttccagcc cctagattaa actaataatt tttggttttc ctagaattaa atgcttttat 840cttgaatgtt ctgtgaagct tttgacatga ttgatagctg tatgatagtc tgaatgacat 900gtgggtcatg caccagcccc tccaacctgt taacatttag aatctattca gaaaaattta 960agcattgtta atttcctttg ttttttgtct agcatgtgtc agattttttt aaatgtattt 1020attaatagct tttaatgtta atactctaga acagtagaat cttgaaaatg ttttaagtga 1080caattagaga tttaaattta tgctgacatc ctctgcatgt gatactgatg aggaaagaaa 1140gccaaactgt cttacggtca gttcgtacaa tataccaggc cttgatggtc acatttcaac 1200ttgctacctt tttgcttaca tttttcttat ggtgattttg aggtgtcatt ctggtttctc 1260agatacttaa aatataggaa aaggtgtgtc ttaaaattga gagaatgtct tggataagca 1320gctgtgtagt tttatatttt gctgataagg gaaggtactc tatttttgtt ttttgtgtgt 1380ttttgtttgt ttgtttttga gacagaattg cccaggctgg agtgctgtgg cgcaatctca 1440gcttactgca acttccacct tctgggttca tgcaattctg gtgcctcagc ctcccaagta 1500tctgggttta cagacatgca ccaccatacc tggctaattt ttgtattttt ggtagagatg 1560gggtttcgcc gtgttaccag gctggtcttg aattcctggc cccatgtgat cccccggcct 1620catgcgatct gcccgcctca gcctccctaa gtgctgggat tataggcgtg agccacccaa 1680cccagccagt actctgtttt tgatagctat tcacaatggg aaaggatgta gcaacacatt 1740ttaaccctat gttgagtttt aggtgggttc ctttgaaatt ttgttaaggc taacttttgt 1800taattttttt aaaaaagtgt aaattaggaa atgggttttg aattcccaaa tggggggatt 1860aaatgtattt ttacggctta tatctgttta ttattcagta ttcctgtgta cattttctgt 1920ttttattttt atacaggcta tgtagaacca atgcagacac tcaatgatgt gttagctcag 1980ctagatgctg ttgtcagctt tgctcacgtg tcaaatggag cacctgttcc atatgtacga 2040ccagccattt tggagaaagg acaaggaaga attatattaa aagcatccag gcatgcttgt 2100gttgaagttc aagatgaaat tgcatttatt cctaatgacg tatactttga aaaagataaa 2160cagatgttcc acatcattac tggtaaaaaa cctggttttt gggctttgtg ggggtaacgt 2220tttgtttttt tttttttttt tttaatcttg gagtagaaat atatttaaaa ttgatggaga 2280aaattcccag ttcttaacat tagaaaggga atatattatt cttaccagtt agtaatctat 2340tcacatttgg tttagaggga agatttagaa ggtgagataa aagcttgtga gagaatagtg 2400tattcatgtg aaacttcttc catgggttca gagcatttag aaacaaacat cccttcacac 2460tcaaagctta cctttgagcc agtcctccaa t 24913713126DNAHomo sapiens 371cttacctttg agccagtcct ccaatagtga ggtctttgaa ggtcaggcca aattggctgt 60gggaggacct caggttagga taggaattat tttaagacat ggcactatat tcatgtgaaa 120ctcgcaaaaa ctagccttgc atataggctc atgtatcatg tctcagctga gatgtttgag 180agatcttaac tagattctag aaaacaaaaa aggaagtagt tttggggcaa atatatttgg 240gaaacagttt attgtatttc ctttccccaa atggattttc aagttcttca tataatctaa 300ccccaacaaa taaattgcct gtttttcaaa agaaagatca tgtcttcagg tttttgtgtg 360gggtttaaat gattcgaaag atttgaccat actgatacat tcactagtaa ccttagttac 420taatgagtaa tggttttgag ttaatcagtt aggcctgaac tacttttctg gaagttagta 480aattatctca caggcagccc tgtgagccat gggaaaatgt gtatatggtc tttctaggcc 540acagtcaaat tacaggtata tttgtcatgg cttctcttga tgaaaggccc agtatcggtt 600tgtctgaaga tatataatag cattgctttt gggggtaata tgggcagtaa ctctgtccac 660atctttgggc aggctgtggt tctgccttta tatgctatgt cagtgtaaac ctacgcgatt 720aatcatcagt gtacagttta ggactaacaa tccatttatt agtagcagaa agaagtttaa 780aatcttgctt tctgatataa tttgttttgt aggccccaat atgggaggta aatcaacata 840tattcgacaa actggggtga tagtactcat ggcccaaatt gggtgttttg tgccatgtga 900gtcagcagaa gtgtccattg tggactgcat cttagcccga gtaggggctg gtgacagtca 960attgaaagga gtctccacgt tcatggctga aatgttggaa actgcttcta tcctcaggta 1020agtgcatctc ctagtccctt gaagatagaa atgtatgtct ctgtcctgtg agaaggaaaa 1080gtatatttgc agattctcat gtaaaaacat ctgagaatgt ttgtcttagt ttaatagttg 1140ttttcctgtg gactttatat actttgtatt gtcttaaaag agtgattgat ggtagctacg 1200gaaaactttg atttttaaaa ttgtctcttt aagtagacaa tttataagct actggtacga 1260gttcacctta taaatctcca ctaccatgtt tttgcttgga ctgttcacac ttcctggaat 1320ggtccttctt gccgtttatc caacttcttt ctaattttta agtccctaat gatgggaatt 1380ctatttctgt agtgattttt ctggtcatac gaccgtaagg tcatgggtgt ttttctctga 1440attcctcttg agatgcctgt aacttgaacc acgtttttat tctagacatt actgaaatgt 1500tttgtcttta tttcactttt taggagcttc cttgaaggta gggactatac cttctatttc 1560ttggtatctt tttctttctt tttttaaaag ttttttagag agacagggtc tcactctttt 1620gcccagactg gtctcgaact cctgggctca ggtgatcttc ctgccttggc ttcccagagt 1680gctgggatta caggcatgaa ccaccgtgat cctccttatt tcttagtatc ttctaaagaa 1740cattaaatat agtaggtgcc tagtaaatta tgtattgatt taacttcttt gaggttctgt 1800tgtttgtgaa gaattataaa agcaatacaa atgtttgtat agtaattaag caacaggtta 1860atattcatga cttaaaagat taaagaaata agcaaaacat gttagctggc aactcacaga 1920aaaagaatta aattgccaat gagcacacga gcacatgaaa aattagcaaa agtttcaccc 1980ctttacatat atttggttaa aattgagaaa agaatagtaa tagatggtat tggtaggact 2040gtggcaggca cacaatttac atgaccacca aaagtgtatg caggtatcca tgtcaccaca 2100ccctggtctc atcttcattc agttttattt atttttttta atctcggcct atttgattgg 2160cacgaaatga atgatagctg ccttatttgg aattcctttg attactacta gtgtgcttga 2220taatgtaaaa caatattcaa aatctgtttt tcctttcatc cgttgtttgt tcatgttcat 2280gacctttttt tttttttcct attctcctcc ctccctccct ccctccctcc cttccttcct 2340tccctccttc cctccttccc tccctccctc ccacacaaag gtgtgtgcta ccatacctgg 2400ctagttttta attttttttt tttttttttt ttttagaggc aaggtctcac tatgttgctc 2460aggctggtct gggctcaagt gatcctccca cctccgcctt ccaaagtgct gggattacag 2520acgtgagcca tcatgcctgg cccttgccca tttttctatt gaagttttag tgctttttat 2580tgactttgtt tatatattaa gataatccat tatgtttgtg gcatatcctt cccaatgtat 2640tgtcttaatt ttgtttttgt atgtgtatgt taccacattt tatgtgatgg gaaatttcat 2700gtaattatgt gcttcaggtc tgcaaccaaa gattcattaa taatcataga tgaattggga 2760agaggaactt ctacctacga tggatttggg ttagcatggg ctatatcaga atacattgca 2820acaaagattg gtgctttttg catgtttgca acccattttc atgaacttac tgccttggcc 2880aatcagatac caactgttaa taatctacat gtcacagcac tcaccactga agagacctta 2940actatgcttt atcaggtgaa gaaaggtatg tactattgga gtactctaaa ttcagaactt 3000ggtaatggga aacttactac ccttgaaatc atcagtaatt gccttattct aagttagtat 3060aaattattga tgttgttata gaacccattt accccttaat tcacagtctg ggggtaggaa 3120catgta 31263722920DNAHomo sapiens 372ttctgcatca gttggttgca catgagtgag ataatcttgg ttctttatcc tttgttattt 60gtacttcatt gggaatcctt ttgagttagt atatttgagt cattattatt attgctgtag 120aattcaggaa cttttagtag atctggcagc ataaaatttt gcttttaaat cattgtttgt 180gttttgtatg ctatagaaat gggttcagaa tattttttaa aaggccagat gaagtgtgaa 240gatagaaaaa cttcatcctt cactgtgaat gtttaacaaa catttgcttc tactttattt 300ttgtttgctt cctttagttg tgcaaagtat tcagttctag aatgcatgag atatatgaca 360aagccaaaaa attctttata gttgataaat aattgtggca aaaacagctg tatagtaact 420ttgcaagcat catttgatta aatgcttaaa aagtcttgac tcagttttaa ctatttcctg 480caaataatca atatttaatt aaagctactc caaattagtg acactttacg tgtctgtctt 540tctccctccc cttctccctt ctcccttccc ccttctccca ttctcccatt ctcccttctc 600tcttcttcct ttcctcttcc cttcccttcc cctttccctt cccccttccc tcttctcttc 660ccctccccct tcccatcccc catcccttcc cttcccccat ccctttcctt tccccttccc 720ttccctcctc ttcctccttc ccttccccct tcctccttcc cttccccctt cctccttccc 780tttcctcttc cctttcccct tcccttcccc cttcccttcc ctcttccctt ccccttcccc 840tttcccctcc ccctctcctc ccctccctta ccttcccatg aaatgagaaa gcctcagaga 900tagtggcttg attaattttt ctttagatta agatatttgt ctaagccttt aaggtttatc 960tattgagctt ttttgtctcc tatttttatt tttcctacta tgtttgtcga ggataaaata 1020cagcactgtg tgccaagtca taatcacttt tcatttgaga cttaattaaa atgcctttat 1080tttaatgata tatttggcta atgtatttga agtaatccga aattaagttt tctaatgaca 1140aggtgagaag gataaattcc atttacataa attgctgtct cttctcatgc tgtcccctca 1200cgcttcccca aatttcttat aggtgtctgt gatcaaagtt ttgggattca tgttgcagag 1260cttgctaatt tccctaagca tgtaatagag tgtgctaaac agaaagccct ggaacttgag 1320gagtttcagt atattggaga atcgcaagga tatgatatca tggaaccagc agcaaagaag 1380tgctatctgg aaagagaggt ttgtcagttt gttttcatag tttaacttag cttctctatt 1440attacataaa caggacacta agatgaaggt tttttgttgt tgtttgtttt cctctgtgtt 1500tctagtgctt attttttaat cagttttttt gatggcaaag aatctatctc tgtgttattt 1560tgatttctgc agtatataca tctgcatgat caatattcga tttcaagtac caaagtagga 1620gtaaaggaat attaacctag gtttaaaatt agtcatttca ctaaaattag ttattatgga 1680cgatagatgt ctaggtatat ctttgttcat aaacgaatat atcaagttca gttattaaat 1740tacacattag gtaagaaaag gacaaagaaa taaaaaagca tgattcataa ttcctgccct 1800ctatttgtct agaatttagt tgggaagata agaataacga acgtgacaca gagaataaag 1860tggcatatga caaatattta ttcaagaaag ctatatgtgg acgggatgtt tcagttctca 1920tgggagaagt ggattttatg gtgcctttga gtaatgggtc atatttgggc gttcacacag 1980aaagacccaa gcatatgcct aattttttat tattattatt ttttatttat ttatttattt 2040tttagacgga gtctcgctct gtcgcccagg ctggagagca ggggcgcgcg atctcggctc 2100actgcaaact ctgcctcctg ggttcacacc attctcctgc ctcaggctcc cgagcagctg 2160ggactacagg cgcctgccac cacgcccggc taaatttttt gtatttttta gtagagatgg 2220ggtttcaccg tgttagccag gatggtctcg atctcctgac ctcatgatct gcctgccttg 2280gcctcccaaa gtgccgggat tacaggagtg ggccactgtg cccggccctt tttttttttt 2340ttttttttaa attagaggat tactagttct cttcaattat aaaaataaaa gaatcttatt 2400tcactgcctg gtcctggaaa catgtactgc aatatacatt gtgacaactt tttacctgtc 2460atgtttttag cttttacctg tgaatgtctt atcattgttc ttatctgaag gatagatagt 2520tgctacaata ataatagatg gtgtgtatgg tttttgagcc taaaaagtgt agttttatct 2580gttgtaccta tacaagcagg agaaatataa cttgttaata attttaggta tggcaggctg 2640ccatcctaaa tatgaagtgg tctttgtatt tgcactttaa tgtgttgaaa tcatagcttt 2700cagtgatcca ggattaggca gactctttta tgcaatctct tgtttccagt tagaatagaa 2760gtcgtgtact tttgataaca ttaattataa tatattttga gccctgtgag gttggtaaca 2820ttattcccat tttatgaatg aggaatgtgt gttaaggagt ttgcccaaga gtcacatagc 2880aagtcatagt catgctctct gaagcagcaa taacttggca 29203733092DNAHomo sapiens 373gccctgtgag gttggtaaca ttattcccat tttatgaatg aggaatgtgt gttaaggagt 60ttgcccaaga gtcacatagc aagtcatagt catgctctct gaagcagcaa taacttggca 120ataaaataaa aatgaagcat cttctgtatg tgttaacttt tcagtgactg tttatgcctt 180ccagtattct ttgtaaacct tgaattcttt ttttcacaga tgattaaagt ttatcaattg 240taaaggtgga ggaatttggg aactagacag tgcacacata aataataaat atgttcttca 300aatattgggt gggctaatgt gggaggagtt tgagaccagc ctgggcaaca tagtgagacc 360ctcgtctcta aaaatatgaa aaataaaaaa aaaatttttt aaatgtgtga tatgtttaga 420tggaaatgaa acaatttgtc actgtctaac atgactttta gaaaagatat tttaattact 480aatgggacat tcacatgtgt ttcagcaagg tgaaaaaatt attcaggagt tcctgtccaa 540ggtgaaacaa atgcccttta ctgaaatgtc agaagaaaac atcacaataa agttaaaaca 600gctaaaagct gaagtaatag caaagaataa tagctttgta aatgaaatca tttcacgaat 660aaaagttact acgtgaaaaa tcccagtaat ggaatgaagg taatattgat aagctattgt 720ctgtaatagt tttatattgt tttatattaa ccctttttcc atagtgttaa ctgtcagtgc 780ccatgggcta tcaacttaat aagatattta gtaatatttt actttgagga cattttcaaa 840gatttttatt ttgaaaaatg agagctgtaa ctgaggactg tttgcaattg acataggcaa 900taataagtga tgtgctgaat tttataaata aaatcatgta gtttgtggaa tttgagatgc 960attgtagttc ttcgcagtgt gacttcaaat attttggaag aaacaaatag ctcagagacc 1020tcgtaaaata tcttaaactg gagggctcca tggagatcat tgcgagtgac tcccccagaa 1080tgtccatctg ttgacaggag ccaggctggc tgcatacgaa ttagctaagg agcttattat 1140atatccagag tcctaccgtg agcctccatc ccgtctgcca ttctcccatc cctggtctat 1200gataagactt agaaatctgg attttaacaa aacgtttcag attgagaacc ttgatttagt 1260ctacttctcc tattttacaa taaagagatg aagcggttaa gaattagcta atcctacgca 1320aagtgaggga aaaaggacag tctttttaat aaatgcggcg ggctggtggg gtatccatat 1380aggaagaaat gacattggac ccctactcca tgtcatatat aaaaacctcc actttgggag 1440gcgaagcagg caatcacttg aactcaggag atcaagacca gcctggacaa catgacgaaa 1500ccccatctct acaaaaataa atgcaaaaat tagccgggca tagtggtgct tgcctgtagt 1560cccagctact caggaggctg aggtgggagg atcacgtgat ctgggagagg ttgaggttac 1620agtgagctgc actccatcct gggtaataca gtgataactg tgtctcaaac aaaacaaaac 1680aaatcacctt cagtgatttt tagaccaaat gtacaaggta atactctcaa ggttttaatg 1740ttttatagtt ctgcagaaga taacatagga aaatattttt atgtccttgg ctttgggaag 1800aatttaagtc acagaaaaac accatccata aagtttgact tatttagcta tttgaaatta 1860acaacttcta ttaaaaggca ccacaagtga aaagacatga atcgtaatgg aagaacatac 1920tggtacgtta taaaatatca aagagttggg catggtgtcc catgcttgta gtcccagcta 1980ctcaggaggc tgaggcagga ggatcacttg agcccagcag ttcaagtctc agcagttcaa 2040gtccagcctg ggcaatatag caagactgca tttcttttct tcttcttttt taatacctgg 2100aataaagaac tcctataaaa tcactaagaa aaggggtcac ttaagaatct cattaacaaa 2160aagaatttga atatttttcc aaggaagata tgcaaatgga ctgtaagcac atgaaaagat 2220gcagatcagg gaaatgcaag tcaaaaccac aatgagctac aacttcacac tgattacgat 2280agttaaaatc aaaaagtcag atggtaagta ctggcaagga agtggagaaa ttgaaactgt 2340catgcgctct tggtgcgaat gtaaaatggt gcagctgctt tggaaaacag tctggcagtt 2400cctcagacaa ttccactcca acgtatatcc aagtggaatc acaacatatg tccccacaaa 2460cttgtacata aatgtttata gcaggattat tcataatagc caaaaggtgg aaacaacccg 2520aatgtccatc agcagatgaa tgcataaatg aaacgtggtc tatccataca atggagtata 2580ttattgagcc attaaaggaa tgaagtactg gtacatggtg cagcttagat gaaccttgga 2640aacattgtgc taaatgaaag aagctggtta caagagtcaa cacgtatgat ttcattcatg 2700tgaaagttca gaatagagac agcagtagag acaaagtagc agttcagggt tggtgccagg 2760gaataggggg taggtggggt gaaagctaaa ggatacggtg tttctttgtg agatggaaat 2820tctaaaatag gtgatgttta tacatgtctg tgaatatact aaaaaccatt gaattgtaca 2880cattaaatgg atgaattgta taggaattat attttaataa agctatttaa aaaaatccag 2940acacttcacc caagaggaaa tctaagtggt ccataaacat gaaaaggtct ttaatcacca 3000gtcagaaaaa tgaaaatgaa aaccatgcca ggccacctcc caccaccata gtgacaagca 3060tttcaagtgt ggcagttcca gctgttgttg ag 30923742985DNAHomo sapiens 374ctctcaggtc aggcttctga caagccacaa tgtgggtgag cctttgtgca ctgcctgccc 60acctctcacc aggagccctc tctccccatg gcctcaaggt accagtgagg cttttttctg 120tctcagcctg gccataagca gccctctgca agagttccgt taccagtcat ttgcattgta 180gtataagtgg aaaccacaga atcgccttcc tccccagtta tttatacttc aagtcatatt 240gtagagagaa aatttctgtc agcaaaaatc tcaggaatcc tcctcatttc tatttgtatg 300gctttcaatc gttgacatga ttttttcaca tatgtcatct tctggggatg gattcgtata 360accctgcttc acttgcttcc ctgtgggagg ctcacttgct tctcgacagg ctctggaaga 420actaggcagt ctggtacatg gttgtgcaag aacccttgag ggggccttgg agtgtgtgct 480tgggccctgg aactcatgcc taggatggag ggctgagatt gccccttccc atccaccagg 540gagttgacaa gggggagaag aaacttcttg tgagcttgcg atgacttgtg gcacttgcat 600cagaccttgg agttccctgg ggagaggcac tcttgggtat gacactgtat agtgccacct 660gattgccatt tgacccagtt tggccctgga tccttgagca agagggctgg aaagaaagac 720aggcccactt tttgggacac

tattagggtc tgtagcattg gtggggagag aattccccca 780acccccaaaa gagctgaaaa tgagacacgc gtggaggggt gaaagtggag tgtggtcaac 840agtgtggtta cagagatgtg tgtcggggcc actcccactc accagggaga ctcatgaagc 900agaagggatg gggcacaatg tggcttccat aggcacacca agccacctgg agagcgcatc 960agccctttgg gtacccccaa gcggaaggag gttgggtctt tgggtctggg aactttggtg 1020cttgttctgg tgggaagggc agggagtcaa gaccagctgt gtcttccact gctcttcttg 1080tccactttgg ttactggcct ctgttggcat gaactgggga ggcagaggct acctacagac 1140gaggaactgt gtggagtgcg agtgtatgca gtaaagggtt agcttagctg acttgaggta 1200ctcacaccca tattccgaag aaaagactgg ccctcagcct gagcctccga aataatctct 1260aagcccttag aataccctgc tttgtattca aagagtatct ttgaatgctg aacttagaac 1320cactctagaa aatgtatgct aacaatgcga tttatgatga acacttgtct ttgttcccct 1380ggggccctgg gccacattgt atcagtttga gccctagagg gacagagaat gagaaactaa 1440gatcagtcat gcaggtgctc caggcctatg tgaccaacca ccaataaaaa ccctgaacat 1500caaggctcaa gtgagcaata cagctggtcc caacttacag tggttcaact tgtgagtttt 1560gcactctaca atgggtttat tgggacataa cccagtggag gaggatctgt acttcattca 1620catgtgttgt cacatcatta ctgggagaat taagcactgt ccacgtgaat ccactgggag 1680aggataactg gaagcttgca cctggcttct cctggattct gctctgtacg cctttttccc 1740ttgttaattt taatctgtat tctttcactg tagtaatcta caactataag cagaatagct 1800tttctgagtt ctgtgagtct ttctagtgaa tcattgaatc caaggtggtc ttggggacct 1860ctaacaaaag atgtctggac ctgaacttcc tgttgtttca aagatcctat agcaggctgt 1920cttaccaact ttcagcatca agaagctggt ggagagtggg ttagtttaaa aatgaaactg 1980gggagagaga tgaagccggg ggaagatgcc gtgaaatctc accttatagg cagcctctga 2040ttcacctgag ggtttttcct tgaatacttt ctgggtacaa gtatttgaga caggtgatgt 2100gctggtcact ttattctcag ctgcttgtgg cctagcccta acatgggcac tggaaacaat 2160gggggtaggg gttgatgatg gagaaatggg gagtaaaggg atttaaaact ttgaaaaact 2220gagctgtttc catgatttgt ctcttttgat tctcacaaaa cctttatgaa atatgtgctg 2280acattttaag ctctcactta tagtgagaaa agcaatcttc agcaaggtga tgacttgtcc 2340aagggaagac atggtcgccc ttgttccttg ggagattttg tgctcccagg ggaaagcata 2400agccctcagg agccatgatg agaacagctg tagaacagca agtgaacagg tgtgtatcag 2460tcaggatagg caaggctaag ctgcagtaat aaataatccc cggatctcag tggcggaaca 2520ttgaggaggt ttatttcttc tttatacaaa tatgctgtgg atcaggatga ctctccaggc 2580aactgtctgt gggactgtcc aggtgggctt ggatcacctg gtgttgggcc ttgaagtcgg 2640taatggagag gacatgttag aagagaagga acttacaagc agtgggagtg cagcgccctt 2700ttgtggatag gggtcaaggc aatgctttcc aaggctatga cttggtgtgg tcgaaaaagt 2760caagcagtct tcactttttg ctgtggtccc agcaaatctg cttccaatcc aggcttctcc 2820catataaaaa gcctcctttg tgtacagtga gtgaactaga acagggagga gatgccagtg 2880gagcttggct tgctccttct gtggccagct ggcttgtttt accactgcct ttggggtaca 2940gtggcagctg tggcaaatct ctctggagtt tctctagcgg gagcg 29853753068DNAHomo sapiens 375agcgaagcac ctaaagcaca tgggtgcagg agcagccagg cctgcaccca tagacatggt 60acagagagga gcagggaagc ccgctgcctg cagacttcag gaggagagag gtaggggtgg 120tgcaggggag agggccttaa tgccttcagg gaaaggagtc aaagaggaat acccaggaga 180caactagact ttagaattct tggggccaga aacttgattc cacctctagt gctttctttt 240agatttcttt ctctctttac tttctttctt tctttctctt tctttctctc tctctctctc 300tctccctccc tccctccctc cctccctctc tctctctctc tctctctctc tctctccctc 360cctctctctc cctctctctc tctctctcct ttctctctct ctcttcttaa gactgggtct 420cgcagttggg cacagtggct catacctgta atcccagcac tttaggaggc tgaggtgggt 480acatcacatg aggccaggag ttcaagacca ggctgggcaa cactgtgaaa cccatctcta 540ctaaaaacac aaaaatttgc caggcatggt ggcagatgcc tgtaatccca gctactcagg 600aggctgaggc aggagaatcg cttgaacctg gcaggtggaa gttgcagtga gccgagattg 660cactactgca ctctagcctg ggtaacagaa caagactcta tctcaaaaaa aaaataaata 720aataaaataa aagggatacc gggtcttgct ctgtgtccta ggctggagta ccatggtgtg 780atcatggctc actgcagcct ccacctcccg ggttcaagca attctcctgt ctcagcctcc 840caagtgagta cctgggacca caggcatgtg ccaccatgcc tggctaattt ttaaattttt 900tgtagagatg aggtcttgat acgttgtcca ggctggtctt gaactcctgg gcccaagcag 960tcctcccact ttggcctcct gaagtgctgg gggtacaggc gtgagcctcc acctggccag 1020cctccagtgc ttttgcatcc ttcctgttaa cttgtgtagg aataaaacat tgtcacaata 1080agattttttt cctttttatt gttttgattt tttagccaat gagaaggaaa attccttatt 1140agggagggcg agggtgagga tatgtggggt ggggagaagc gaacgttcca agtttcgaaa 1200acagcgactc tctcttggac tctctagcca gtagaaacct ccctcccact ctcttgcccc 1260aagatctggt gcttagaaga gaatcaaggg aagttggaac ccagaagacg gagacagatt 1320gagggactgc tgtgaaatgt tggggtgttt ggtgaataat attagaagtt gggctggcag 1380agaccctgtc acataaacat taaatcaaca ctggagactg agcatttgtt agaaatgtaa 1440gcgggaatgg cagaaaactt gtttttaagg gaaagcatgt tacggcttat gttcagcctc 1500catcctctga aggcaaaagt tagcaaagtt gatgtatggc gttgcttttt ctgggaactt 1560tatctcgttt ggtggggttc ccatctctgt ctcccaggag ccaagacttt cccctccctc 1620tgctccagca gaagccagtc tcaggcaagg ctccctgtac ctcatttaca ctttggtgtg 1680aatatgttat tgtaacctct ctcctggagg tgtctgcatt ccaagactga acttttctgt 1740gaaagttact gtcactgtga aaggcagttc agcccccagg gattgaaaaa ggaaatcatt 1800ttgggtaagg ggacagttag tccagatttt ttcagttgca agtaaaccta actcagccag 1860taggcaaagg gggaaattgc tggtttgaac tggtgggaag aaagctgagg aaactcctac 1920acttggggga agaactgcag gtgcctggct gcagggaacg cagcgggggc tcaggaccag 1980gcagatgccc tgcctctgct tcccttggca cagtggcctc cttctccctt caagtaggca 2040gatgctgcct gtggcagagg acagcagctg attggcagcc cagcagggag gatgtggtag 2100acaggcactg agcatctctt ctaccctcct tctagagggc tatcctgtac tgttgaggct 2160aaaagactga aaaccacatt tcccagcctc tcttgcagct accaatctgg atgagagtta 2220gattctacac attagatgca ctttagcaag attttcaaaa gcagattgga gaaggagccc 2280atgcttctgc tggttttttt tgctggcaag tgaggggttc tgtttttcct ggagtgactt 2340tatcatggtg gcatctgaaa aaggctattt cttgatcaga gagacagcaa ccctctcagt 2400gacctagttc tgtgggtgtg tctctcctga gagttaatcc cagagctcaa actagagctc 2460aaccctagag tctcttcagg cttcccaggg gtgggggtgc atttaacagt ccaagttaaa 2520gagaaaataa aggccattaa agaccaaaca ttgagcactg agtgaaaaag ttttattgcc 2580aaacaggaaa cctgattcag gccagggtct tggaaggttg ttcaggatga gatgggggag 2640gtgaaatggg gtaggtcttt gaaaaccaac agattgcaaa ttctctgtcc catagcagga 2700aaccacagtc tctgatgtca gctggctgcc aacacgtcag ttgtatcagc attagctggc 2760tggaggtggc ctgctgtgtg cagatggtac ctggtgcagg attgtggtgt ccaggtgtct 2820ctccttagca cataagaccc tgtccgagga ctgtggcatg acgtgctgga gtcacgattc 2880tgtcacccag tcaggtcatc agtgtcagag agctaggtgg ccaggttgga gttgattgcc 2940aatgataggt ctttttctgc ttaaatcagc tggactggat tctattgcat taacttgacc 3000ctgactcatg ccgccaggcc taatttataa accaagacaa gaaagggcta ctccaccccc 3060tccaattt 30683763117DNAHomo sapiens 376gtcacccagt caggtcatca gtgtcagaga gctaggtggc caggttggag ttgattgcca 60atgataggtc tttttctgct taaatcagct ggactggatt ctattgcatt aacttgaccc 120tgactcatgc cgccaggcct aatttataaa ccaagacaag aaagggctac tccaccccct 180ccaatttgtg taaggccagg ggacttcccc cccactcccc aacctgaggc atgcaccctc 240ccttagatca atggctgttt ctctgagaat gcggaaccgt gattaatcca gccttgatgg 300ggaggcagca ggaactgtag gcattctcac ttcacaccca tcccaatccc ctcccccttg 360ctgtcctctt gtacagagga ctgaaagcac aacactctct ccctccctcc cttataggtg 420gtgacgatca tgtgactctc ttctggtcaa tgagatgcag cagaaagtcc tagggaggtc 480taggaaaagt cctgttggga gagagcattt tttaccttct ccctgctact tcttgctact 540agtaacatgg atgtgagcct tggaggggta gctaccatct ggcacctggg gtggcaagcc 600aacatggaaa ggatggcaga gcgggaagga ggagccagcc ttaccgatgg catcactgtc 660actgcgctag ccccagacca cctgctccag agttctggtt atggtaatga aataaacctt 720gatttttatt ccttaaaact acccttcaat gggttttctg ttcattacag ttgaatgctt 780tcataactga tacaggaggg accctgtgat tggcagttcc actagactgc atggagatgg 840gtggagttat ctaaaagaac agagatagtg tccctagaag aaggggacag gaaagcatcc 900tgggtacaca aaagtcaagg ctccaggatc tgccctgggg gctatctcaa cacccctaca 960ctctcaccgc acgtatttgg tcagctatga atatgaccaa ctctcgtcgt ttatctctat 1020tcagtggaac acagcagcac tgtgacctgc ccacgagaag aaggattttt agaacttatc 1080ttagggcaat tttaggtaga ggagcagaca agatggtgta caggagaaac aggtctatta 1140accctggtat taatattaac tggctgccca gaataaatga agaatagctt attctttgcc 1200aggttgaaga tagaaaagga atgaagggcc ggagaagtac agctgggtga agcacagagc 1260agcctagtgc ttggcatggg actcagatct gaagcagcct ctccgggact tctctgagcc 1320tgcccctggt ggtatgactg tgatatccct gcttctatag ttggcaacca acatgtccta 1380gctcctagac catagagggc cagattcatg tctcattgac tgtgtaatct ctgtgtggcc 1440cagtacagag catgcacacc gtaggttctc acatatgttt gttgagtgaa tgaatacaat 1500accaaacgaa tggacaggac agagctgtgg gctagcagga aggatatctg gcttttgctt 1560gaattagcta gtgaattgct gtgtggcctc cttactgagc ctcatttccc tctgtctgca 1620gagtcaagca aatcttccat tttttgttcc cctgctgcca gagcatggca gagtaaatgt 1680gtgagttgaa gggagcaacc tcatgaggtt ttgctttgtg tcttaattac agccatttgt 1740ggaattaggc ttttaatata aatatttgtg tgcctgcgcc tgcatatatg tatttggacc 1800aatgctctca tgtgtgcaaa tacatgtatt ctaaagaaat ctgtccagaa ccccagcatc 1860tgtggtgtct gtggtgggag gggcttccat attacagaga gatgcccaca gtgcatgacg 1920ttacccgcac aggtgtgaca tcacagggta accaaatgct tttgccctgg gggtgggaga 1980gggatgggtg cacggtgaac agcaggtggg ggtctttcca taggggatga ggaagacaag 2040gccacttgga ggcagaggag accacagtgg ggcatgatgg ttggggaagg ccttttactt 2100ctgcccctta aggatgccct ggaattcagg ctttcggatc ccagagctct cattagagca 2160gccctgcgtt gtagactttt ctgcagtgac agaaatgttc tatatctgtg ctatccaata 2220tggtagccac aagttacatg tggctattga acacttgaaa tggggttagt gcaattgacg 2280agctgaaaat gtagtttaaa ttcacttaca tttaaatagc tgtgtgtggc ttgtggctgc 2340ctattggact gtgcagttct ggagaatggt actttacttg tccttgggga agcagaaaca 2400aatgaaaacg aggatctgga gctcatgaag tttctcatgg ggtggggtat gtgtgttgaa 2460gctgcacctt cagcaggaac ctggccagtc cttagtggag gacatttctt tccatcctgc 2520atccagatgg ctggtcctgc tcctcccagt ccatggagaa aaaagaattg aacaaactgt 2580ctaagctggg tcaggtactc tgcagatgtt tgctgagtat cgttcttgat ggaaatcccc 2640gtggaactcc tacattttct cctctcttct ccttcctttc agaacctcag agtgacagag 2700ccaaaagacc agtgcctcat tttgctgaca tggaaaagga aacttcgtgg gggaaagaga 2760tctgcttgca gtcggccaga gagacagaac cagggcagtg gtgagctctc atgacctggt 2820gtctgttgcc ttctggttaa gtttttcatt tgtaattcta caaacatccc ttctgtaaac 2880atttccctca aaatggagca ggaagctctc aaaaatggac cagaaagggg tcaggaatat 2940aactttctct gcccagattc caggacttac agtgagaaag cgccttctgg gaacttcaca 3000atggctaaag tgtgctaatg ggatgatgtg cccttgtaca cccactgcct ctgaactctg 3060ctctgcattg ctgagcaaac tacatttccc agaactcctt gttggattcc ttccaaa 31173773117DNAHomo sapiens 377tcccagaact ccttgttgga ttccttccaa acaggtttac cactgggaga gcctgttggt 60tggggagggc aggaagaggg aggaaagagg aagggactca cttcctgttt ccagctgaag 120tctaaatcaa tccactatca acaggtagct atcatactac cctcattgtc acccctcaga 180ggtcccactg cagctgcata atgtcccctc agtggcctga acatgagatg aacaacactc 240ttcttgggag taccagcctt gcttggttca tggccacttt tcctgattat cttgcagcta 300tattaggtca tgtgacaaag ttctggccag tggcaaggga acacaagtga taggtacaga 360tagaagtgtc tgatactaca tagattatgc ttgcactcac tcttaagaga gagacatgaa 420cttttaccaa cggaagccag tattattttg aacctctgtt agagtggctt gaatctgtat 480cctaacttgt atccctaatg tgtgacccat gaaaattagc caggcagcac cagttccaaa 540gaagctcaca ctcccctgcg gctgcttctg ccaaggtcac tgatatttcc ctttgctaaa 600tcttgtgggt gttttcttca gtccttgtct taatcactca gtggcacttg gcacttattc 660cttcttgaaa cccttgtttc ccttggcttt gtggcatcct gtgctcttgg ttttctccca 720tatctctgac cctctttcct tagtcttttt tcttcttcct cctgtccctt aaatgctggt 780tgtgatcctc tttttatctc attctacaca ctcacagcct gagtaattca caccatcttg 840atgctgagaa cttccaaaat gttggtctag cctgggtcat tgttatgagc tctagactca 900caaggccaat tgcttggtgg gaacccctcc cccatggtta tctcatgggt ccctgaagtc 960caacttctcc ttcattgaac tcatcacctc ttctgttcct cctcctgggt tcccaggctc 1020agtggtggca ccactgtcta cctggctgct tagcctgaga cctggctccg tcccaattcc 1080tctctctcag tcttatcatc cccatccagg caaatcattg attctgtgga cctactcttt 1140cgggtgtccc tcaaatctct ccacgtctct gtgttctcac tagcactacc ttggtccacc 1200ctgccatctg ctttcctcct ccactcctgc attctgagtc attttcggca gcacacgcat 1260ccttaaaacc cctccactgg cttgccagtg tcctcaggat taggcgaaaa gtctttgctt 1320tgttttacaa ggcccttcgc tatctggccc cctcattacc tcccttgctc tgcatgctcc 1380agtcctgcag aactacacac agttccccca acaaggccct gctctgttct tcccacacac 1440tgctcctctg cctgggccac tcttcctgct ccttgtcagc aggcttgctg ctctcaggct 1500cagcatggac agctgcttct gagagccttc tctgcctacc caggctgggt ggctgcctct 1560ctttggtgtg cccatggcag cccagaatgc ctggtggaca gggagccctc agcaggccgt 1620actgcagcgc cctgcccccg tcagcctcca ggagcctgga gtccagggac atcaagggcg 1680gtcctgtctt tctcaccctt gtctctccag cccctaacac aggggatgcc tgaccccaaa 1740ctagacgagt tacttgacct ctctgaccca agacaaaatg ggaggaaagt gccaaatttc 1800caagattggc caggggatta aataagataa atatgcaagt ctcttatctg ggggtctggc 1860ttggtaaata taaagttctt ttttcttttc ttcctttttc tttttttttt ttctttcttt 1920ttgagacagg gtcttactct gtcaccaagg ctggagtgca gtggcatgat catggctcaa 1980tgaaacatcg acttcctggg ctcaggcgat cctcccacct cagccccctg agtctcttgg 2040actccaggcg tgcaccacca tgactggcta attttttgta tttttagtaa agacagggtt 2100tcgacatgtt gcctaggctg gtctcgaact cctaggctaa agtgatccac ttgtctcagc 2160ctcccaaagt gctgggatta tagacatgag ccaccatgcc cagctaaaag ttccttttta 2220aaatctgctt gttagataca ctcatagaaa ggtaactggc cacagaaggg agaggaatgg 2280cagtccatcc agggatcact ggagtgtcat atgaaatgtt ataggaatca caggccttag 2340aacttgaaag gaacccaagg atcatctagg ctactttatg caggtaaaac agccacctgt 2400gcccatcaca tagctggggc acagctggag accccaacag agaggagagc tgatgggtga 2460cgagaaatca ggcctctccg ccacggcagc ctagctaatg ggtcttggct ggaagctaac 2520aggaaggcct ctttccagaa acactgtaag ccagtgtttc tcagattgct gggtgtaatt 2580cataggcaga tcatgaaatc agtttaatag ctttgaccag cattaaccta tttatgccta 2640gcgttccctt attggaacac taagtctgtg agagttattt acatcctact gcttaaggtc 2700atcgccaaaa tctgattttt tacacaaaaa atttgcaacc tccagcataa atgggttaaa 2760acaagacaaa acaaaacaat accagaatgg aaaatagtgc atgatctgta cagtatagtt 2820gtagaaaact tcttgtttta tcatttgatg tcatgaaagt ccctgctgta gataaaagat 2880ggagcttgtg cttctgagtg gtcatgctca acagggtggg gagcccaggg gagtggggag 2940tgatcgtata gacagaggtg ggtggggcca gtgtgagcct gatggtcaat tacttctcat 3000ttctagggaa aattgaagga aaagaaggag ggggatgtgg aggggagaga aggcctcagt 3060agagtttgca ctattattag ggcaagtaag ctgcttctga aaagaagggg tttgcaa 31173783192DNAHomo sapiens 378ccccatgcag aagcaatagg gcagcctggt cccatatcct catgaaatgc ctcttataat 60tgtgacatct tgcaattgtg gaggacttta cacttttcgg agttcctagc ccctcactta 120tttctcgtaa gaccgctggg aggtgggggg atggtatcat catcccactt tagagatgag 180gaaacaggat cagagtgagc taaatgactg ccagatccaa aactagaatt cagacctcct 240agtttctaag tggacgctct ttctacacca ccataatgtg agtgttctgt gtttacaggg 300tgtattcaag tccatgactg cccattagaa tccccccaaa aaattccagg actggcctga 360gttgctcctt agaccaatga aatcagactc ctgggagtac ggcccgggcc tcgggatcct 420ttaaagctcc atttggagag cctcgggcac agccaggttg gatccatctc ccagtccccc 480agccttggct cagcctggcc aagctgccca ggaggtccct tggtgccctg ggctctgttt 540cactgttgtt ttgtagagca acttcccagt gatgctgcca ctgggcccca tcctaacagt 600gaagtccccc gggccctcct gagaggaggt gtgaactgga agatggggag gcaggcggct 660ctgacagaca gaaagcaaac agctcagagg ggtggcaggc tgcattttat tcatcgttaa 720tttaaacacc cttcaagtcc tctcttggaa tgctgctcag aaaaatagat gtattgtttg 780agaaaccctg caggcttgtc ccgcatgctc tagccccctc ctgagagaac agatagcata 840aaaaatgatt tgtaaagcaa gggggagctt ccttagggaa gaaggggaag gggaagaggg 900tttggggcca ggtccgagtg cagaaatcct caatgcatga gactagcgtg gaaggtgtag 960caattgtgct ctggggtgcc tgaaagtgcc agagctgctt caggggcaag agtccaggcc 1020ccaagtccat gctgatgagc ccaccctggg ggtcaggaat ggcctcagca ggccctccct 1080ccctccctct ccaccctaca aagtgaggag ccttgagtca ccaccagcac attatacaac 1140aatacaagaa ccctgcaaca gataaagccc cagcgcctct tctggactca gatgccctag 1200gctggctgtc tggctgtgct ttccagacag tgtgtatgtg gaattgtgct ttttgttttt 1260taagaatgta aaaagttaca gtaagatcga accacagggc ccgtcgctcc tatggtctct 1320gcctgactgg gctgccgtct gcctcagttc cccagaagct tctcctttgg ccatgagggc 1380tcagtcatcc ctcaccccag agtccacagg aagagggggt ctgctgggag gcctgtctga 1440aggacggagg atcctgggtc aatttagcag ctattttcca gggtttggct tgggtttgga 1500tgctggcttc tgtgtgaaac ctgaatacat gcaaattgta cataaaactc ccccaaggca 1560gagagggatt ttccaggccc tggtacatct ctagagagtt aaaaatggga aatctttctt 1620cttaaagtgg cccagactga gacttttcct tggggaaaag ggttagtagc tctttgtaag 1680gctggtgtgt atgtgtgtgt gtatatatat atacatatat gcatgatgct gtgcaaatgc 1740ccagggctgt ctggcatttt ccacaaaatg agagcctgag attgcctaag ccttctgatg 1800ccttctccag gcctggaggc actgcttcat tcagaggaca caaaggcctg accacctggc 1860tttagcaagc taggacaccc agggtggctt ctttaccttt ctcctcagct ctgagaaggc 1920tgctagccaa gactctggat tctctgtggc cacagtcata tggtgagggc ctcttggagt 1980tcattcaaac tttaagggag ccccacagca ccggcatgat gggtaagtcc aggcctaagg 2040ttaggaagca aatcctggag catgaggaaa ttgtaggcta cagtgagcta ccagtggtgt 2100gcaaactgga gacccccaag acagtgagag aggccacagc atctgaggga atggagctct 2160ttcttggcct gaggttcaga agaacctgca ccaaagaaag gcatccctat caatgtcact 2220gttcctgaaa tgatgggaga accacatccc tgcttcaggg aagcagtccc tgtcgtctgg 2280ggcgctgagc cctttggcct gagatgaagg atgatggtgt gatgtatcat ggcagtgtga 2340ctgagactgg attgggggat ggggacaggg gaacataggc aaaaatacac atgtgccact 2400ggatcctgag ctgccattgt accttggagg actggcgttt ctctgggaag ttgggaggtg 2460ggaagaggaa gggtctcatt ttcctgcccc ttgaaaccat gcttaccatt cctttagaag 2520attgctcaag ctgcctccaa ttgcctcttt ccaaaaccaa agcataggaa aacaagtaaa 2580aacagctgag gctgcagcat aagcaactta ggatagagtc taggaagcac cgccaacaga 2640gaagactgcc aagaaacatt ttgagttttt cttctctgga ggtgggtcct ggttcctccc 2700atggagacca cgattctgtg tagtcctgca cgctgggcgg gggattgcct ggaggtttct 2760ttagacctgt ctagctcaca cagtcttgat gcctgggttt taggctgctg tactgttgct 2820ggggctcact tcctgtgggt aggctgttat tttgcccgca gatcaagtcc tcactgtcta 2880gatgcctcta tcatggggat ctcttcttcc ctctctggat ggctctgatc cccaagttat 2940ttcctgttgc ctaggtaaca cctctaattg gatgcctttt aatcgttccc ttttttaaag 3000ggataaatgt ggattttatt tccaggtcct gtcagagggc cctgccctag agaacacgtg 3060cgcccctgcg tgggcaatcc cttcactgtg accgcaacca tgggttggat ggggggcact 3120cactgggctg gcctgacagt cacagtgaat cctgaaagca tggttttcac aggaacccac 3180cttcaggatt ta 31923791944DNAHomo sapiens 379agtttcagcc atgttgcagc atgcatcagt acttcatttc tttttatagc tgaataatat 60tccatagtat ttatatatca

aaatttgttt atccattaac ctgtggaggg acatttaggc 120tgtttccacc ttttggctat tgtgaatggt gctactataa acatgtgtac acatgcctgt 180ttaagtatat gttttcagtt ctttggggta tatacctagg agtggaattg tagaatcatg 240tggtaatttt gtttaacttt ttggaaaaat atcaagctgt acccaaagtg gttgcaccat 300tttgcatttc caccagcaaa atgtgagagt tccagtttct ccatatcctt gccaatactt 360atttttcttt ttaaaaaata gctatcctag tacatgggaa gtgacattca ttgtggtttt 420aatttgcatt tccctaatga ttagtgatgt tgagcatctt ttcatgtgtt tattagtcat 480ctggatatct ttggagaaat ggctattcaa gccctttgtc catttttaac tgggttgttc 540ggttttgttg ttgagttgta ggagttcatt atgtattctg gatattaatc acttacctga 600tacatgattt gcaaatattt tctcccattc tgtgggatgc cttttcattc tcttcatagt 660gtcctttgat acacaaaagt ttttcatttt gatgaagtcc aattcacctg tttttttctt 720gaccaaaaag tagaaacaac tgaaatgtcc accaactcat gaacagataa acaaaatgtg 780tatataatgg gatatattca gccataaaat gaatgaagta caaacacata caacatggat 840gaaccttgga aactttatgc taagtgaata cagtcagata caaaaaggga actattgtat 900aattctatgc atgtgaggta cacagaatag tcattttcat aaggacagga aatggaatag 960tggttagcag gggctgaaca gaggagaaga ttggcagtta ttatttaatg gacatagagt 1020gtttttcttt gaatgattaa taagttatgg aactagatag tgataatcat gaatgtactt 1080aataccactg aattgtacat tttaaaattg ttaaaatggg gctgggaaca gtggctcatg 1140cctgtaatct aatcctagca ctttgggagg acaaggaggg aggatggcat gagccttgga 1200gttcgaagtt acagtgaact ctgattgtaa ccacccaatg tgttcacctt gcccgctgcc 1260tagacagagc cgatttatca agacaggata actgcaatgg agaaagagta attcacacag 1320agctggctgt gcaggaaacc ggagttttat tattactcaa atcagtctcc ccaagcattc 1380ggggatcagg gtttttaaag ataatttggc aggtaggagt ttgggaagtg gggagtgctg 1440attggtcagg ttagagatgg aatcataggt ggttgaagtg agtttttctt gctgtcttct 1500gttcttgggt gtgatggcag aactggttga gccagattcc tggtctgagt ggtgtcagct 1560gatccattga gtgtagggtc tgcaaatatc tcaagcactg atcttaggtt ttacaatagt 1620gatgttatcc ccagaagcaa ttaggggaag ttcagactct aggcgccaga ggtggcatga 1680tccctaaact gtaatttcta atcttgtagc taatttgtta gttcgcaaag gcagactggt 1740ccccaggcaa gaagggggtc ttttcaggaa agggctgtta ttaattttgt ttcagagtca 1800aaccatgaac tgaattcctt cccaaggtta gtttggccta ctcgcaggaa tgaacaaaga 1860cagcttaaag gttagaagca agatggagtt atttaggtct gattgctttc attgtcataa 1920tttcctcagt cacaattttg ccaa 19443802197DNAHomo sapiens 380cagtcacaat tttgccaagg cggtttcatg atcatgcaac tgcactccag cctgggcaac 60agagcaagac cttgcctcta aaaaaagtaa ataaaatggt taaaatggaa atttttatat 120tatgtgtatt ttaccatgat aaaaaaaatg aaagaaaact ggtctagctt tattaatatg 180agacaaaaca gaatttagga caaaaaaatt agagaggacc acttaattat gataaaagct 240tcaagtcatc aggaataatt aacattggta caaaatatgt atgtaccaaa tattattgcc 300ttgacatgta taaagcaaaa gctgtcagaa tcacagagaa actcacaatc cttgcgggag 360atttgaacaa aattatctca gtaactgata gaacaagcag tcaaaaattt tctttcggcc 420gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg ggcagttcac 480gaagtcagga gttcgagacc agcctggcca acacagcgaa accctgtctc tactaaaaaa 540tacaaaaaat tagctggtca tggtggcggg cacctgtaat cccagctact cgggaggctg 600aggcaagaga attgcttgaa cccgggaggc agaggttgca aggagcctag atcacgccat 660tacgctacag cccaggcgac agtgcgagac tctgtctcaa aaaaaaaaaa aattttcttt 720cacatcaggg tgagaaaact catacaaaga tcttcctagc agcattattc atgacagcct 780caaactggaa ccgacctatt aataaatatc tatcactagt agaagagata aacacattgt 840attagattaa tccaatgtaa tactgaacag caatggaaat gaaatgaact gtaggtacat 900ccaacaacat ggatgaattt caaaacataa tgctaagcaa ataaagccag actcaaaata 960atatatgctg tattattcca tttacgtgaa gctcaaaaat aagcaaacta aattatatgt 1020gtagagaagc atatttattt gataacatta tttttataaa gcaagaaagt tatttccata 1080aaattcagaa ttgtagattt tttttttttt tttgaaacag agtctcattc tgtcgccagg 1140ctggagtgca gtggcatgat ctcagctcac tgcaatctcc gcctccaggt tcgagtgatt 1200accctgcctc agcctcccta gtagctggga ttacaggtgt gcaccaccac gcccagctaa 1260ttttttgtat tttagtagag acagggtttc accatgttgg ccaggatggt ctcgatctcc 1320tgacctcgtg atccgcccac ctcggcctcc caaagtgctg ggattacagg catgagccac 1380tgtggctggc ctcttttttt ttgagacaga gtctcgtaat tgtggatatt tctaagagga 1440aagaggaaca ttggaattgg aaagagacca gtgggccaaa ggtggaaaat gttgatgtag 1500acttctaaga ttttgacaaa attttgtttt atggcctggt ggttatataa atatttactg 1560tataacaatt cattaagata cacatttgtg ttttttgtat atatgtgttc tatttcacaa 1620tcttaaatgt tccttaatta attaatggag cacaccttca gagttgggtg ggaaaataat 1680tctgcctaga aatccaaact tagacaagct agctatcaag actgaggaca aactaaagcc 1740attcttacac ctgtaaggat tcagggttta tctactattt atgctatctg aaggagacaa 1800ttgaatatgt tggccaggaa accaagtgtg aggagtatgt agaaaacaga agatgatagt 1860actaaccctg ttaatctaat aaaaagaaac cccaggatga ctgcttgcag tggggtttga 1920aagaaatcta ttcaaattaa aacaggaggt ccatgtgctc caaaaagata ttcttttttt 1980ttaaatatat atatatcttt tattatactt taagttctag ggtacatgta cacaacgtgc 2040aggtttgtta catatgcata catgtgccat gttggtgtgc tgcacccatt aactcctcat 2100ttacattagg tatgtctcct aatgctatcc ctcccccctc ccctacccca taacaggccc 2160cagtgtgtga aaaaacgata gttagatgcc acgaact 21973813184DNAHomo sapiens 381ggtccatgtg ctccaaaaag atattctttt tttttaaata tatatatatc ttttattata 60ctttaagttc tagggtacat gtacacaacg tgcaggtttg ttacatatgc atacatgtgc 120catgttggtg tgctgcaccc attaactcct catttacatt aggtatgtct cctaatgcta 180tccctccccc ctcccctacc ccataacagg ccccagtgtg tgaaaaaacg atagttagat 240gccacgaact aggtggcaat gccttaaccg tatgtgtgtt gtcaggcctg agggcctctt 300ccatccttgt caaggggagt actaaccttc tcccctttca tacaacacaa agatattctt 360aagacttcta gaatagaccc tgaacaattt tagagtaagg aactaataga tatcagtgct 420ttcatgaaga aggcttttgc ttctcctgat gagggaaaaa ttataaaaat tctaagacag 480gaaagttatg atccaaactt gaaataaaca aatgtggtat gaatttgggc aactgtggtt 540ctttaagaaa agagaatcca ccatgcaatt ttctttttct tttttttttt tttttttttt 600tgagacaggg tctcactctg tcacccaggc tggagtgcag tggtgatctc agttcactgc 660aacctctgct tcccggggtc aagtgattct catgcctcag cgtcccaagc agctgggatt 720acaggtgccc gccacaacac ctggctaatt tttgtatttt ttagtagaga caggatttca 780ccctgtttgt caggctggtc ttgaactccc gaactcaagt gatctgccaa cctcagcctc 840ccaaagtgct gggattatag gcgtgagcca ccgcgcctgg cccatcatag aattttctag 900gaatattgtc ctttgagagg tctagggtga tgacataatt atacaagaaa acataatgtc 960ataacaattt aatattttta gtaattttaa atttgtgtca tcaacctaca gacaaaggat 1020gggggttcag gtttctgaac agaatgtaaa ttttcaacct caacaatgta aatatcaaag 1080tgaagctcac agaaaccaga aggtagaagt aggaaaagag atggaggcaa gggtagggga 1140aaaaagtcaa gagacgttag tgaaaattga cagaattaaa aacaaatagt ttaagaacag 1200aatctaaatg tataaaagta accaatggaa agaaaaccaa tgatacaaca aaagtcatgg 1260taaaaagaag aaaaggagaa atggggtggt attaattagt taaatcctta ttaatcataa 1320gcaataagta gacaatgcct acagttgata aattaagaat tagcaatata cagaaatata 1380tatgaaacca aaataactaa tgaaagaaaa ggaggctggg cacggtggct caggcctgta 1440atcccagcac tttgggaggc agaggcaggc agatcatttg agtcccggag tttgagacca 1500gcctaagcaa cgtagtgaga cctcatcgct acaaaaaaac agaaaaatta gctgggtgtg 1560gtggtgtatg cctgtattcc cagctacttc agaggctgag gcaggagaat cagttgagcc 1620cagaaggtgg aagctacagt gagccaacag agtgagacca tctcaaaaaa aatttaaaaa 1680aatgaagaag gaaggaagga agagagggag ggagggagcg tgggcggggg ggggggggtg 1740gaggaggagg agagaaggag tgggaggagt ggagaaggag ggggaggagg agaaggataa 1800aaggttacaa gtggttgtta ctaggaatgg gggagaagag aagtgggtaa tggcactgaa 1860gctttttatt atgtctttca gcattctctg attgttctta aaccatcaac agatctcagt 1920atgtagacta aaagggaata tttggtgaag agatcttctt tcactattgt acacttgcta 1980tggacatgtc catgcctgct gcctggcagg caccattcat taagtaggcc cctgttgcca 2040aggaaaccag ctcttcactg ataccaaaga taatgcagag gcctgccgct caccaagcaa 2100ccttcctcat gagctatgcc cccaccttcc tgaactgtct cttgctcctg tttgatactg 2160tcatgctgca cgaagcttac acttgctatc tctcacttcc ctcttagtca tctgtgatgc 2220tggctaaggg agctaggcca gtcagcagtg acctgttgcc cttggtttat tataagcaaa 2280ctgttcacaa gaaatgaact tctgttgttt tataaatgat atgcatcaca gaacacagaa 2340taatatcaaa accacattag ttttttcata cttgcttcat tgaccccagg ggaagagggg 2400agagcaggga gaggactttc tcttttttta aatactaatt atattgaggt ataaagaaca 2460tatagtaagt tcacagacct taagtataca gtttgatgag ttttggcaaa tatgtatacc 2520tgtggaacca acacctcagt caagatataa atacttacat cagccgggcg cagtggctca 2580tgcccgtaat cctagcactt tgggaggcca aggcaggtgg atcatgaggt caggagatcg 2640agaccatcct ggctgtccac taaaaataca aaaaattagc caggcatggt ggcacatgcc 2700tgttgtccca gctactcagg aggttgaggc aggaaaatcg cttgaacccg ggaggcagag 2760gttgcattga gccgagatag caccactgca ctccagcctg ggcaacagag agagactccg 2820tctcaaaaaa acaaaaaaca acaaaaaaaa ccatacatcg acccagaaag ttccttctgt 2880cagtagcagt tcaccccccc atgcccccaa cccttggcct ccctgccttc ccatctccac 2940tcccaaccct cactgctctg attctatcac cattgttttg attcttctgc tgttgatctt 3000cataaaacca gtatatttcc ttttgtgtct ggtttatttt cctcagaata atgtttttaa 3060catttatcca tattgttatg tgtatcagtc gtttcttcca gattagtact ctattgtatg 3120gatagagcct attttgttta cccatttcct gttgacagac atttggtttg ttcccagttt 3180tgga 31843823103DNAHomo sapiens 382tggtttgttc ccagttttgg attataatga ataaagctgc tatgaacatt cttgaacgat 60gaacattttt gtggacatat gttttgattt ttttgtgtaa atacctagga gtgaaattat 120tgaggtatgg tataggttta tgcttaattt tatagagtac ttaaacttga ttcttttatt 180taaaattgtg ataaaataca cataacataa aatgaaccgt cttaactgtt tttaactgta 240cagtgcagtg gtacgaagca cattcacatt gttgcacaac catcaccacc atccatctgc 300agaattattt ttatcttgca aaactggaac tctgaaccag gtgcggtggc tcacgactgt 360aatctcagca ctttgagagg ccgaagcagg aggatcgctt cagcccagga gtttgagacc 420agctggggca atatagtgag acaccgtctc tataaaaaca aaataaaaat agaccaggcg 480cgatggctca tgcctgtaat cccagcactt tgggaggcca tggtgggcag attgcctgag 540ctcaggagtt caagaccagc ctgcccaaca tggtgaaacc ccatctgtac taaaaataca 600aaaaattacc tgggcatggt ggcgcgcacc tgtagtcccg gttactctgg aggctgcagc 660aggagaatcg tttgaacctg ggaggcggag gttgcagtga gccaagatcg tgccactgca 720ctacagcctg ggcaacagag tgagactcta tctcagaaaa ataaaatagc tgggcgcggt 780ggctcatgcc tgtaatccca gcactttggg aggctgaggc gggcggatca cgaggtcagg 840agattgagac catcctggct aacacgtgaa accccgtctc tactaaaaat acaaaaaaat 900tagctgggag tagtggcggg cgcctgtagt cccagctact caggaggctg aggcaggtga 960atggcatgaa cccgtgaggt ggagcttgca gtgagccgag atcatgccac tgcactccag 1020cctgggcgac agagcgagac tccatctcaa aataaataaa taaataaatg aaatgaaata 1080aaataaaata aaataaaaat agccaagtat agtgatacac atctgtagtc ccagctactc 1140aggaagctga ggtgggaggg tcacttgagc ccaggagttc aaggctgcag tgagctttga 1200gcgtgccatt gtactctccc tgggtgacaa agcaaggccc tatctaaaac aaacaaacaa 1260gcaaacaaaa aaccccaaaa ctggaactct gtatctatta aacagtaatc tctcattgag 1320tggtgttaag agtaaaattt tttttaacaa aagaaaaaag taaaaagtaa attttgaaaa 1380aagaattaaa aacaaaaaat ctccattacc ccctccccca gcccctggca accaccattc 1440tactttctgt ctttctgaat ttgactactg cacataacct tatataggtg gaatcaaaca 1500gtatttgtct ttttgtgact gacttatttc acttaggata gtgccctcag cttttaaaag 1560gaaagacatt ttgatatatg ctacaacata atattccatt gtatgtacat accaaatttt 1620attaacgatt tcatctgtca atgaacattt gggttgcttc caccttttgt ctattgtgaa 1680taatgctgcc gcgaacatgt ttaagtcctt gctttcactt ttttgtgtat acacccagaa 1740gttgaaatgc tggattatat gtaattctat ttttaatatg agtgactgcc atactgtttt 1800ctatagtggc tgtaccgttt tacgttccca ctaagagaac atgagtgttc cagtttcacc 1860atatcctcac caacacttat tttctgtttt gttggtggta gccatcctac tggatgtaaa 1920ctttattcat ttttcgaacc tttttaatat ggaattttca aacacacaca aaagatgaga 1980gatctccagg tacccaccac aagctttaat aatgattaac atttggtagc aggtggacaa 2040agatatacct tctctatagc agctataaga tcagggacaa acaaagatct atttggaact 2100ccaactaaga atggtgtttt gtaggctgcc tgatgaataa ggttagataa ctaatggcca 2160gtctttcagc ctgtgctcaa gggataggat aacaataaag catagttggt gaaggagcag 2220cagataaagg tcacaataga taggccataa gagaaccctc actatcactt accattcaga 2280ccattcgctt catattctaa caagttattt tcctttcata aaaggaagct gaagctttta 2340tttgtgtttg tggtgcatgt gatccatgag aggggactca accaggtgct atgtgtgagt 2400agtacttaat ccgacagtat tagtgggctg gtgggctttc ctggttacat gggaacccta 2460gaaacccaag ccaagcacaa aagccaagac tgaattctcc agtaagtcac ctggtagcct 2520tgacatgctc atgcttaaaa aagagccagt gacctattaa taggaagctc ctgaaatgag 2580tcctctgaac atctgcaagt atggtcagct acacctgagc tgagacttgc ctgtttccct 2640gccaggaaat catgggctca gaaatggcag gtaccatgtg tattaactat atttccttac 2700tttctgtctt cttgatgttc tagcatcagg tgcctctttg acctaagaga cttcccctcc 2760taggactagc taattcctag aaatatcaaa ccactcccct gtaagcatgc cattcctatg 2820caaaccaacc aatccagagc ccatactcga aaccacttcc tttacctggc tcttccacac 2880cagagggcaa tgttcctctg tcctaatcat tctcagggct agatatcaga taactacaaa 2940tgctccttga cttatggtgg ggttacatcc taataaaccc atcataagtt gaaaatatca 3000taagtcaaaa gtacatttaa atcaggtgtg gtggcacatg cctgtagtcc cagctatctt 3060gggaggctga gacaggagga tcacttgagg ctgtggtgca cta 31033832997DNAHomo sapiens 383ggatcacttg aggctgtggt gcactatgat catgcctgtg aatagccact gcaccccagc 60ctgggcaaca gagtgagacc tcatacctta aaaaaaaatt aaaaaacaag ctctcaccag 120tcgaagatga ggcaacataa gcaataagca ctaaaaagaa taatgactgc aaaccaaaat 180aaataagaag ataaaagtcc acaagtttat aaataaaagg ttttatttaa aagcccacaa 240gtaaaaaaga ggagagaaac gctaactcct aactctgaaa attagtaatt aaagggaaag 300aagccttcaa caattttttt ttttctgaga cagagtcttg ctctgtcact caggctgaag 360tacagtggca taatcacagc tcactgcacc ctcaaactcc tgggcttaag caattctcct 420gcctcagcct cccaagtggc ttggactaca ggtgtgtgcc tccacactcg gctgatttta 480aatgttttgt agagatgggg tctcactata ttgcccaggc tggtctccaa ctcctgagct 540caagcaaaac tcccacctca gcctcccaag tagccagggc cataggtgtg caccaccatg 600cccagctaat tttccttttt tccattttgt agagatgggc tctcgccatg ttgcccaagc 660tgttctccaa ctcctgagct cagacaatcc tcccgcctcg gcctcccagg tgtgagccac 720tatgcctggc caaaaatttt ttaatgaagt ccccctgggt ccaggcactg gtttgggcac 780tgaagattca gcagtaaagt aaaataaatt tcctcattga tcttgtcagt gactcctttg 840catgcttgct tcacactata tttcaaggta accaaatagt tgtagttaga aaaagttcca 900tcttacagaa aactttcagc taataaatgt agagggaatg ataaagttag aaaaataact 960atattttaag tcctaatgaa acaacagacc cacacaacaa tgaccaacgg atgaaaaata 1020tcaggtgaaa cacttatacg gaacctgtca gtggcaagat tgggctgtaa ccacctgaaa 1080ccactgacca atctcggcat tactaaaaca gggctgacca gatagtctgt gattctgatg 1140taaagcaata aggagtacat agcaccacct tttcagtagc caaaacagtt aaacctgaat 1200ctaatcaaga ctttagaatt acctttacga ttggatgaaa tatggagagc agaagaacaa 1260attcaacagc acaaaaagga agaaaacaga taaatctaga gtgggccacg ttctacaaaa 1320ctgagctggt ttcttggtca agacaatagc atggaaaaaa atgggaagta ggcaaagaga 1380ctcctctgga ttgaatgaaa tttaagagac acaatagcca agtatgatgt gtggaccttg 1440tttggatcta gatttgaaga aatcgattgt aaaaagtaat ttttgaaaac aaatagggaa 1500atctgaatat gggctaggca ttagttttta ccaaagaatt attattaggt gtgataatgg 1560tactgtaatt acgtaagatt gtaattattt atattgtttt tagagataaa aataagacct 1620tcagtgctga agaagggcac agtggcacat ggcacagcac agcatctaca tcatcagtca 1680aataagaatt tttttttttt tttttgagac agagtctcgc tctgtcggcc aggttggagt 1740gcagtggcaa gatctcgact cactgcaacc cctgcctccc gggctcaagc aattctcctg 1800cctcagcctc ccgagtagct gggattacag gggtgtgcca ccatgcccgg ctaatttttt 1860ttgtgttttt agtagagacg gggtttcacc atgttggcca ggctggtctt gaactcctga 1920cctcaggtaa tccacccgcc tcggcctccc aaagtgctgg gattataggc gtgagccacc 1980gtgcccggcc cagtcaaata attcaaggca gctgtcaggc taaagttcgg cgagcgacac 2040gcggctgggc ggcgggagga aacgcggggc cgggccgggc gctggagatg gtccccggcg 2100ccgcgggctg gtgttgtctc gtgctctggc ttcccacggc ttccgtatcc atgattattt 2160gtactttcaa gtgctgagtc ctggggacat tcgatacatc ttcacagcca cacctgccaa 2220ggacttcggt ggtatctttc acacaaggta tgagcagatt caccttgtcc ctgctgaacc 2280tccagaggcc tgcggggaac tcagcaggtt tcttcatcca ggaccagatc gctctggtgg 2340agagtggggg ctgctccctc ctctccaaga ctcgggtggt ccaagagcat ggcgggcggg 2400ccgtgatcat ctctgacaat gcggttgaca atgacagctt ctatgtggcg atgatccagg 2460acagtaccca gcgcacagct gacatctccg ccctctttct tctcagccga gaggctacat 2520gatccgccgc tccctggaac agcctgggct gccatgggcc atcatttcca tcccagtcaa 2580tgtcaccagt atccccacct ttgagctgca gcaaccgtcc tggtccttct ggtagaagag 2640tttgtcccac attccagcca taagtgactc tgagatggta aggggaaacc caggaatttt 2700gctatttaga atttgggaat agcatttggg gacaagtgga gccaggtaga ggaaaaggat 2760ttgggcgttg ctaggctgaa agagggaaac cacaccactg accttccctt ccccagggcc 2820cccaagggtg tcccagaaga ggtaagagac aggccccagg gcttctggat agaacctgaa 2880acaaaaggtg ctgaaggtag gtggcctgag agccatctgt gacctgtcac atctcacctg 2940gctccagcct cccctaccca gggtctctgc acagtgacct tcacagcagt tgttgga 29973842067DNAHomo sapiens 384caccactgac cttcccttcc ccagggcccc caagggtgtc ccagaagagg taagagacag 60gccccagggc ttctggatag aacctgaaac aaaaggtgct gaaggtaggt ggcctgagag 120ccatctgtga cctgtcacat ctcacctggc tccagcctcc cctacccagg gtctctgcac 180agtgaccttc acagcagttg ttggagtggt ttaaagagcc ggtgtttggg gactcaataa 240accctcattg cctttttagc aattaaaaaa aaaaggcaat aaaaggcata atataggttt 300tagaaattta tatttataat gggtttgatg tacaataaag atacattagt tattaaacaa 360ggtataaaaa tactcaattc aaggatatgg aaaaataatg aaaaaaataa gaaaatagga 420agaattaatt ttaaaaagca gaagtcaatg aaatagaaaa taataatact gatatatagg 480ctgggtgtgg tggctcatgc ctgtaatccc agcaatttag gaggccaagg caggaggatt 540gcttgagcct aggagttgga gaacagcctg ggcaatatag gaagacccca tctctacaaa 600aaatttaaaa tcagccagac atggaggtgt gcgcctgtag acccagctgc aggggaggat 660cacttgagcc caggatcctg aagctgcagt gtgccatgtt tgcaccactg cactccagcc 720tgggtgacag agggagaccc tgtcaggaag gaaagaagag aggaaggaag gaaataataa 780taataatata taaatgcagg aataaattct tttaaaaaga caaaaataat ctgtggtgag 840cctaattaag aaaaagagaa agcccatgag agagggagca taacctgaga tacagagaaa 900acaaaaatgc taaaaataac tcaataaatt tgaaaacctt aatgaaaaac tccctaggaa 960aatttgttaa aattgaaatt aattcaatat gtgtaagata gaagaaatgg aaaagttgtc 1020agagaactac ctaaagtgaa gctgggtgcg gtggctgaca cctgtaatcc cagcactttg 1080ggaggttggg ggcgagagga tcatttgagc tcaggagttc aagaccagcc tgggcaacag 1140ggcaaaaacc ccatctccac caaaaaaaaa cattaaaatt agccgggtgt ggtagagtgt 1200gcctgtagtt ccagctacta aggaggctgt ggtgggagga tcacttgaac ctggaggtca 1260aggctgcagt gagttgtgat tatcccaccg cacagcctgg gtgacagagt gagaccctgt 1320ctcaaaaaaa ccaaaccaaa ataaaccgaa aaaaaaaaaa aacctaaagt gacaccatcc 1380tcattctttc ttaaaaaatg aattattggc cgggtgcggt ggctcacgcc tgtaatccca 1440gcactttggg aggccaagtc gggtggatca cgaggtcagg

agatcgagac catcctggct 1500aacacggtga aaccccatct ctactaaaaa tacaaaaaat tagctgggcg tggtggtgga 1560cacctgtagt cccagatact cgggaggctg agacaggaga atggcgtgaa cccgggaggc 1620ggagcttgca gtgagccaag atcatgccac tgcactccag cctgggcgac agagcaagac 1680tccgtctcaa aaaaaaaaaa aaattatttt actgatgtat aataggtaca catagatttg 1740gagtacatgg gattaataaa gttcaaattg gtgtacttgg gacatccatc accttaaata 1800tttgtctttt ctttacactg gaaacatcca agctattctc ttctagctac tttgaaatgt 1860acaagattac tgtaaactat caaacactag gtcatatttc ttctataaaa ccatatattt 1920gtatcagttg atcaacttct cttcctcgtc tcctcctgat acctttcctg gcctctggta 1980accataaatc tactctctat cttcatgaga tccaattttt tagtttccac atatgagtaa 2040gagcatgtga tatttgtctt tctgtgc 20673852054DNAHomo sapiens 385ctcttcctcg tctcctcctg atacctttcc tggcctctgg taaccataaa tctactctct 60atcttcatga gatccaattt tttagtttcc acatatgagt aagagcatgt gatatttgtc 120tttctgtgct tgacttattt catttagcat gatgaccttt aattccatgt tgctacaaat 180gacaggattt catttttatg gctgaataat attctatttt gtatatgtac cacatacaca 240ttttcttttt ccttttcttt tttttttttt ttttttttga gatggagtct cgctctgttg 300cccaggctgg agtgcagtgg ttccatctcg gctcactgca agctctgcct cctgggttca 360tgccattctc ctgccttagc ctcccgagta gctgggacta caggcgcccg ccacaacgcc 420cggctagttt tttttgtttt gtttttgttt tctgtatttt tagtagagat ggggtttcac 480cgtgttagcc aggatggtct cgatctcctg accttgtgat ctgcccacct tggcctccca 540aagtgctggg attacaggcg tgagccaccg tgcctggccc acatccacat tttctttacc 600tattcatccg ttgatgagca ctttgattcc atatttgagc tattgtgagt agtgctgcaa 660caaacatgag agtgcagata cctctttcgt atactgattt tctttctttt ggatatacac 720tcagtagtgg aattgctgga tcatatggta gttctagatt tatgaagaaa cgccatactg 780ttctccatag tgactgtact aatttacatt cccaccaaca gtgtacaagg gttccccttt 840ctccacatcc tcaccagcat ccgttattgc ctgttgtttt gataaaagcc attttaactg 900gggtaagctg acatctcatt gtagttttga tttgcattta tctaatgatt agtgatgttg 960agcacttctt catgtacctg ttggccattt gtgtgtcttc ttttgagaac tgtctattca 1020gatcttttgt ccatttttaa atcggatttt ttttctattt gtttgagctc cttgtatatt 1080ctggtcacta actccttgtt agatgggtag tttgcaaata ttttctccta ttctgtgggt 1140tgtctcttta gtctgctgat tgtttccttt actgtgccgc ttcttagctt gatgtaagct 1200cacttgtcta ctttcgcttt ggttgcctgt gccgttgagg tcttacacaa aaaatttgcc 1260cagatcactg tcctgaagaa gaaactgtct ccagtttctt ctaacagttt cacattagag 1320ttaagtcttt tttttttttc tttaagacag aatctcgccc tgttgcccag gctggagtcc 1380aatggtgcga tctcggctca ctgcaaccac agcctgtggg ttcacgccat tctcctgcct 1440cagcctcccg agtagctggg actacaggtg tacgccatca tgcctggata attttttgta 1500ttttcagtag agatgggttt tcaccatgct ggccaggctg gtctcgaact cctgacatcg 1560tgatctgccc gcctccgcgt cccaaagtgc tgggattaca ggtgtgagcc accgcgccta 1620gcccagactt aggtctttaa tcaattttga tgtgattttt ttttttgtat ggtgagagat 1680agtttagttt atttcttctg catatagtta tccagttttc ccagtaacac ttactgaaga 1740gactgtcttt ttcccattgt atattcttgg tacctttgtc aaagatgagt tggctgggtg 1800gatttacatg agttctctat tctgttccat tggtctatgt ctctattttt atgccagtac 1860catgctaatt tggttactac agctttgcag taaattttga agtcaggtag tgaaatgcct 1920tcagctttat tctttttgct caggattgtt ttgtctatta ggggtctttt ctagttccac 1980ataaatttaa ggattttttt tctatttctg tgaagaatgt cgttggtatt ttcacaggtt 2040ttgcattgaa ttgg 20543863000DNAHomo sapiens 386cgagcagctc tctcttcagg agtgaaggag gccacgggca agtcgccctg acgcagacgc 60tccaccaggg ccgcgcgctc gccgtccgcc acataccgct cgtagtattc gtgctcagcc 120tcgtagtggc gcctgacgtc gcgttcgcgg gtagctacga tgaggcggcg acagaccagg 180cacagggccc catcgccctc cggaggctcc accaccaaat aacgctgggt ccactcgggc 240cggaaaacta gagcctcgtc gacttccatc ttgcttcttt tgggcgtcat ccacattctg 300cgggaggcca caagagcagg gccaacgtta gaaaggccgc aaggggagag gaggagcctg 360agaagcgcca agcacctcct ccgctctgcg ccagatcacc tcagcagagg cacacaagcc 420cggttccggc atctctgctc ctattggctg gatatttcgt attccccgag ctcctaaaaa 480cgaaccaata ggaagagcgg acagcgatct ctaacgcgca agcgcatatc cttctaggta 540gcgggcagta gccgcttcag ggagggacga agagacccag caacccacag agttgagaaa 600tttgactggc attcaagctg tccaatcaat agctgccgct gaagggtggg gctggatggc 660gtaagctaca gctgaaggaa gaacgtgagc acgaggcact gaggtgattg gctgaaggca 720cttccgttga gcatctagac gtttccttgg ctcttctggc gccaaaatgt cgttcgtggc 780aggggttatt cggcggctgg acgagacagt ggtgaaccgc atcgcggcgg gggaagttat 840ccagcggcca gctaatgcta tcaaagagat gattgagaac tggtacggag ggagtcgagc 900cgggctcact taagggctac gacttaacgg gccgcgtcac tcaatggcgc ggacacgcct 960ctttgcccgg gcagaggcat gtacagcgca tgcccacaac ggcggaggcc gccgggttcc 1020ctgacgtgcc agtcaggcct tctccttttc cgcagaccgt gtgtttcttt accgctctcc 1080cccgagacct tttaagggtt gtttggagtg taagtggagg aatatacgta gtgttgtctt 1140aatggtaccg ttaactaagt aaggaagcca cttaatttaa aattatgtat gcagaacatg 1200cgaagttaaa agatgtataa aagcttaaga tggggagaaa aacctttttt cagagggtac 1260tgtgttactg ttttcttgct tttcattcat tccagaaatc atctgttcac atccaaaggc 1320acaattcatt ttgagtttct ttcaaaacaa atcgtttgta gttttaggac aggctgatgc 1380actttgggct tgacttctga ttaccctatt gttaaattag tgacccctct tagtgttttc 1440ctgtccttta tttcggagga cgcacttcga agataccaga ttttatgggt catccttgga 1500ttttgaagct tataactgtg acaaaaaatg tgaagggaag agatttgaaa catgtggaag 1560gaaaagtgag tgcagactat aaacttccaa aaagacaagc ccaaaataca cctaaacgtt 1620atgtcagatt attttgttaa aatcagttgt tagtgacgtc cgtacgttaa tagaaaaaag 1680aatgcttcag tttggagtgg taggtttcta gagggattta ttgtgaaagt ataaactatt 1740cagggcaatg ggactgagag aacagtgggt agaaaggacc actgaaggaa aggaagagaa 1800ttggaaggta gatgaaagaa ggagcaagaa cctggggatg ttttttcctt ttcacttgta 1860atagtagtaa cagaagcaat ggcagactgg cttttgtttc tactgtgtta gaatgaattg 1920acaggacaac tgggcctatt attgtactgt gccagaatac tgtaaaacaa aactaaacat 1980actagcttgg tggcttgtaa ttaattactt aagtggagat ttttattttt tttttatttt 2040ttttttagac ggagtctcac tttgtcaccc aggctggagt gcagtggcgc gatctcagct 2100gactgcaacc tcctcctcac aggttcaagg gagattctcc tgcctcagcc tcccgagtag 2160ctaggactat aggcatgtgc caccacacct ggctaatttt gtatttttag tagagatggg 2220atttctccat gttggtcagg ctggtgtcaa aactctcgat ctcaggtgaa ccgcctgcct 2280cagccttcca aagtgctggg attacaggcg tgagccaccg cgccctgcag ttttttgtat 2340ttttaataga gacagggttt caccatgtta gccaggatgg tctcgatttc ctgacctcag 2400gtgatctgcc cgctttggcc tcccaaagtg ctgggattac aagcatgagc caccgcgccc 2460ggctcaagtg gagattttta tatggagtcc agttatactc tttttaatat ataagttgag 2520atgactaata caacttcaat acaggggctc atgagaaatg tctgtaatat ttaagtaact 2580tattgtcttc tttctttttt ttttaagatg aagtcttact ctgttgccca ggcggaagtg 2640cagtggcgtg atcttggctc agggcaacct ctgcctcctg gtttcaagcg atcttcctgc 2700ctcagcctcc cgagtagctg ggagtacagg cgtgcatgac cacacccggc taattttttt 2760atttttagta gagacggggt ttctccatgt tggccgggct ggtcttgaac tcctgacctc 2820aggtgatccg cccacctcag cctccccaag tgttgggatt acaggtgtga gcccccgtgc 2880ccagcctatt atcttatttc tgaataaaga attgtctgtg tggggaatag ataactcttt 2940ctcatgcagc ccctgctaga aaatttgttt tctctagcag ttggtctgtg cttataggct 30003871977DNAHomo sapiens 387ttctctagca gttggtctgt gcttataggc tactctttga aagcacaaaa aatttattga 60cttctttttt ttgggttttt tttttttttt gagacagagt tttgcccttg ttgcccaggt 120tggagtgcaa tggcgcgatc tcagctcacc gcaacctcca cctcctgggt tcaagtgatt 180ctcctgcctt agcctcctga gtagctggga ttacaggcat gcgtcaccat gcctggctaa 240ttttgtattt ttagtacaaa tggggtttct ccatgttggt caggctggtc tcaaactcct 300gacctcaggt gatccacccg ccttggcctc ccaaagtgct gggattatgg gtgtgagcca 360ttgcgcctgg ccagaaaatt cattgacttc ctaaagattt attaactttc tgcattactt 420ttttttttcc cctccatcgt aaatataaaa gggaatagta gagaaaatca ttcagaattt 480tattttttag tgacattatt tagtgacatt ttattagagt cacttaggaa cctgaggctg 540aataaagttc aggtaaaagt aaaattagtt gagaagagac atctgccaaa agaaatctat 600ttttaacttc acttgctgtc tttcctagag gaacagaaat agtgctgaat gtcctattag 660aaatgatggt tgctctgccc gtctcttccc tctctctcac acaatatgta aactcataca 720gtgtatgagc ctgtaagaca aaggaaaaac acgttaatga ggcactattg tttgtatttg 780gagtttgtta tcattgcttg gctcatatta aaatatgtac attagagtag ttgcagactg 840ataaattatt ttctgtttga tttgccagtt tagatgcaaa atccacaagt attcaagtga 900ttgttaaaga gggaggcctg aagttgattc agatccaaga caatggcacc gggatcaggg 960taagtaaaac ctcaaagtag caggatgttt gtgcgcttca tggaagagtc aggacctttc 1020tctgttctgg aaactaggct tttgcagatg ggattttttc actgaaaaat tcaacaccaa 1080caataaatat ttattgagta cctattattt gctgggcact gttcagggga tgtgtcagtg 1140aataaaatag attaaaatct attctcttct gatgcttaca ttatagtggt gggagacaaa 1200atgggtataa taaatattat attagatagc attaagtgct gtggagaaaa ctaaagcagg 1260gaggaagata ggagtgtgca agccagaaag gttgcaatta aattgagtag ttcaggaagg 1320cttcaatatg gatgtgatat ttgagagacc ggtggaagtc aaggagcaag ttgtgaggct 1380atttaaaggt attcttggct tacagaacaa tatacgcaaa gactattaaa tggaagcata 1440cctgacatgt taaaggacta tcaaggaggc cagtttgtct agaggctgaa aaggaaagag 1500taataggaga tgaggtctga gtgaaaacac gtaaatcctt gtgggccaag gtaaaatctt 1560tagctttttt tctgaatatg gtgggatact gttagagggt tttaagcaga ggttacgtgg 1620tgtggtgagt tttttttttt taatcctttg tctttctgtg tggaaaatag caggacaggg 1680cagaagcagt ctgtcctgca gactgcttgg tcgcagtaga gatgtaagaa gcagtgagat 1740tctgggttaa ttatggaggc aaagttctca gaatttgctg atatagggta tgagagaaag 1800aggaatcagg aatgatttca aggttttggt ctgctaaatg gaaggagttg ccatttacta 1860agatgggaaa gactatgaaa gaagcagatt ttcagagaga tcagaagttc attttggggc 1920atgttcaatt taagatgcct gttagttgga tgtttatgtg agtttggaat gcagggt 19773883091DNAHomo sapiens 388gttcattttg gggcatgttc aatttaagat gcctgttagt tggatgttta tgtgagtttg 60gaatgcaggg tagagattta gggatgaata tttggtagtt gtctgcattt taatggtatt 120aaaagccacg agaaggatgg gcatggtggc tcacacctgt aatcccagca ctttgggagg 180ccaaggcggg cagatcacct gaggtcggga gttcgagacc agcctgacca acatggagaa 240accccatctc tactaaaaat atataattag ccgggcgtgg tggcacatgc ctgtaatccc 300agctactcgg gaggctgagg caggagaatc gcttgaacct gggaggtgga ggttgcgatg 360agccgagatc gcaccgttgc actccagctt gggcaacaag agcaaaactc catcaaaaaa 420aaaaaaaaaa aaaaaaaaaa gccttgagac tcacctgaaa agatgctcaa cattattggt 480cattaggaaa atgaatgaaa accacaatga gataccactt cacacctatt aggatggcta 540ttatcaaaaa caaaaacaag tgtttgcaag gatgtagaga ttggaattct tgtgtattgc 600tagagggaat gtaaaatagt gcagggtgct gtggaaaatg ctgtggtgat tcctcaaaaa 660attaaacata attatataat ccagtaattc cacttctgag ttattcccaa aagaagggat 720gcaagcagat atttgtacac tcatattcat ggcagcatta tttacagtag ccaaaaggtg 780aaagcaacct aagtgtccgt cagtggatga atggataaac aaaatggaat aatttcagcc 840ttaaatagaa ataaaatgtt gacacatgtt gcaacatata cgaaccttga agacatcatg 900ttaagttaaa taagttggtc actaaaggac aaatattgta tgattcccct tatgaggttc 960ctagagtagt cacattcata gagacagtag agtggtggtt gcccagggcc ggggggagcg 1020aggagaatgg aaattattgt ttattgggta cagagtttct gtttggggaa gatgaaaaaa 1080ttctggagat ggatcatgat gatagttaac acagcagtgt gaatatagtt aatggcacag 1140aactgtacat ttaaaaatgg ttaagatgga aaattttctg ttacatatat tttactgcaa 1200tttttttaaa ttttattatt atactttaag ttttagggta catgtgcaca acatgcaggt 1260ttgttacata tgtatacatg tgccatgttg gtgtgctgca cccattaagt catcatttag 1320cattaggtat atctcctaat gctatccctc ccccctcccc caccccacaa cagtccccag 1380tgtgtgatgt tcccctttct gtgtccatgt gttctcattg ttcaattccc acctatgagt 1440gagcacatgc agtgtttggt tttttgtcct tgtgatagtt tgctgagaat gatggtttcc 1500agcttcatcc atgtccctgc aaaggacatg aactcatcat tttttgtggc tgcatagtat 1560tccatggtgt atatgtgcca ccttttctta atccagtcta tcattgttgg acatttgggt 1620tggttccaag tctttgctgt tgcgaatagt gctgcagtaa acatacgtgt gcatgtgtct 1680ttatagcagc atgatttata atcctttggg tatataccca gtaatgggat ggctgggtca 1740aatggtattt ctagttctag atccctgagg aattgccaca ctgacttcca caatggttga 1800actagtttac agtcccacca acagtgtaaa agtgttccta tttctccaca tcctctccag 1860cacctgttgt ttcctgactt tttaagatcg ccattctaac tggtgtgaga tggtatctca 1920ttgtggtttt gatttgcatt tctctgatgg ccagtgatga tgagcatttc ttcatgtgtt 1980ttttggctgc ataaatgtct tctttcgaga agtgtctgtt catatccttc actcactttt 2040tgatggggtt gtttgttttt ttcttgtaaa tttgagttca ttgaaaaatt agaatttttt 2100tttttttccc ttttttagag gcaaggtctc actctgtcgc ccacactgga gtgcagtagt 2160gtaagcatag ctcactgtaa ccttgaactc ctgggctcaa gcaattctgt catctcagcc 2220agctgaagta gtaactgtag gttcacacca ccatgcctat ttttgttttt gtagaaatag 2280ggccttgctt tgttgccaag gctggtcttg aactcctgac ctcaagcagt cctcctgtct 2340cagcctccca aagtgctggg attataggtg tgagccactg cacccagcct tggagatttt 2400taataaagaa gcttgtcaat taaacaaaca acaaaaagcc ctgagactga atgagataat 2460caagagagta tgtgtagata gagaagaggt ccaaggaagg agtcttgggt gactctgatg 2520tcaagtgagg acatgaggca gaaacagcag tgactgagaa ggagccacct agtaagaaag 2580gaggaacacc aggacagtgt ggtattctgg attccaaaca aggaagttac tgctaatttt 2640aaagctcttc tcaggctggg catggtggct cacacctgta gtcccagcac ttcgggaggc 2700tgaggtaggt aaatcacttg agctcatgtg tttgagacca gcttgggcaa catggtgaaa 2760cctcatctct actaaaaata taagaaatta aggccaggtg tggtagttca tgcctgtaat 2820cccagtgctt tgggaggtca aggcagccag atcatttgag atcaggagtt cgagaccagc 2880atggccagca tagtgaagcc ccatctctac taaaaataca agaaaaaatt aaccaagcat 2940ggtggcgcat acctgtaatc ccagccactc tggaggctga gacatgaaaa ttgcttgaac 3000ccgggaggcg gaggttgcag tgagctgaga tctcgccact gcacttcagc ctgggtgaca 3060gagcaagact ctgtctcaaa ggaggttgca g 30913892168DNAHomo sapiens 389tgtctcaaag gaggttgcag tgagctgaga tctcgccact gcacttcagc ctgggtgaca 60gagcaagact ctgtctcaaa aaaaaaaaaa acaaaaacca agaaaagaaa aaaaaactct 120tctaagagga tttttttttc ctggattaaa tcaagaaaat gggaattcaa agagatttgg 180aaaaatgagt aacatgatta tttactcatc tttttggtat ctaacagaaa gaagatctgg 240atattgtatg tgaaaggttc actactagta aactgcagtc ctttgaggat ttagccagta 300tttctaccta tggctttcga ggtgaggtaa gctaaagatt caagaaatgt gtaaaatatc 360ctcctgtgat gacattgtct gtcatttgtt agtatgtatt tctcaacata gataaataag 420gtttggtacc ttttacttgt taaatgtatg caaatctgag caaacttaat gaactttaac 480tttcaaagac tgagaattgt tcataaataa actattttac ctgcagagac ctctgatata 540tgtttcttga tggaagtacc cagtaccacc tatgaagttt tcttgtcaaa aaatcaaatg 600tgaatctgat cattacttag atctaagtac caatatatga aaaatatagg agacaaggaa 660gcatggtaaa tgatactgag attgggagac tacatggaaa aagacttgtt cccttcaaca 720gatagacagc agggaaaaaa gaatagagaa aggagtaaag aacctgtaga ttaaaagaca 780tttaagggac atatgaacca ggtccagtgt atagatctta cctaaatcct gatggagcaa 840actataaaaa aatttttttg agacaaatgt ttgaatacag gttgactatt tgatggcatt 900aaggagaaat tatgaattat cttggtataa gaatattgtc atgggttttt ttttttgagt 960ccttacctgt taagatacat actaaaatat ttgtgggtaa aattatatga cgtataggag 1020tatatgattt agaaaacgga ttaaaatata aaaggataaa ataggatctt atattttgtg 1080actcacttcc tgttggatat ctttctaccc agtaaatata gtcctatcta ggttttaatg 1140gctacatgta tgtactgtag tttgtttaaa tggtttccta ttgaacattt atgctctttg 1200ccattttttc ctgtttaacg ttctgttttt ttttttgttt tttttttttt ttgagacagt 1260cttgctctgt tatccagact ggagtgcagt gacatgatct cagctcactg caacctctgc 1320cttctgggtt caagctattc tcctgcctca gcctcctgaa tagctgtgat tacaggcgtg 1380caccactatg cccagctaat ttttgtattt tgggtagaga cagggtttgg ccatgttggc 1440caggctggtc ttgaactcct gaccttgaat gatctgcccg ccttggcctt gcaaagtgct 1500ggggttacag gcatgagcca ccacgtctgg ccttgtttaa ggtcctgatg agtattctta 1560taggtacact gtgtttcgtt taattatttc cttaggataa atttatagaa ataacattcc 1620ttggtaaaag aatacatatt ttaaaaactg tattagtttc ctgttgctgt caaaaaattt 1680ccagaaactt agtggcatta aacaatacaa attaattatt ctacagttct ggagatcaga 1740agatacgggt cttactaggc ctcactaggc taaaatcaag gttttggcag ggctgtgttc 1800ctctatggag gttccaaggg accagagaaa ctactttaca gtagttattt taagggaatg 1860aaagtgaaga tggggttggg cagtcaaaga ggctgttact tttcattttt ggcctttcag 1920tagtttgaat ttttttatca tatacatgta ttactttaat ttttaaaaag taaaaagcag 1980ctgtgattca gtctctgtaa tttagatcaa tttacatcaa actagggtgg tctcatgtgt 2040tgtcttgctc acagtgacca ctagattatt ccaagaaggg acaatttcca agacttggtt 2100tacactgaga cggctcctga ttttaaggat accttagatc aaactctagg aaggcagttt 2160cattttgg 21683902008DNAHomo sapiens 390agttccctgg gtcattttcc aagcccatgg cctcctggag tcttcgccta gctgtaggtt 60atctttgtgg ctattatttc actgtaatta tacaggaaga tttattgagg gatttctgtg 120taccagccgt ggttctcagc actttgtata ctttgtatta actctgactc ctgacagtaa 180ctctacagag gttctgctgt tacccagttt tacatagaaa catggccagc ggacgcagtt 240agaaaatggc aaagtgggga ttagaaacta ggcagtttga ctccagagtc tgtgcccctg 300tccacttggc tccactgctg gggaagaggc ctctgaagca gcaggaccat ctgctgtgcc 360gtgtgtagtg gtactctatc ttcctggtgt gatgttgtgt tctactttgc attttcatgt 420ctttccttat acaggtctca aaatcattta cttttttttt tttttttttg agacggagtc 480tcactctgtt gcccaggcta gagtgtagtg gcatagtctc actcactgca acctccgcct 540ccgaggttca agtaattctc ctgcctcagc ctcccaagta gctcggatta caggcacatg 600ccaccacagc tagcaaattt ttgtattttt agtagagatt ggtgtttcac catgttggcc 660aggctgttct tgaactcctg acctcaggtg atccacccac ctaggcctcc caaagtgctg 720ggattacagg cgtgagccac cccacccagc cttatatttt ttaatgatgc acattagctc 780aattacataa accagggaaa tccagctagg acctggtgat ttctgagcct gacccatgtg 840actttcaatg aactgaactt gccacagctg tatttactgt ctactgagat gctgtcacac 900agaccccgtc atagcacagt tcctgagtta catctttaca tactgtagta tccttcttgt 960gaaaaaagat acagattcca aaggtctgag aaaccaatct tggttataaa ggggaaaaat 1020ggtcatgggt ttttaaaatt tgttttgtct taattgcatt tcaaatttac atttctaaat 1080gaataattgc ttatataaag cagttttgat taacaatata aaacactatc tatttggagt 1140gattccttta cccatttctg aaggcaagtt ttaaaaatta ctagaagaca cttcattgag 1200aatattatta aacatgccta tagttctacc acctcaacac aattgcttat taacacatta 1260atgttttggt gtgttttgga ctttttaata tgtatttttc acttgttcta gtaattatgc 1320tacagattga tcatttcttt ttcaacatgt catcaaagca agtgagcaaa gtgctcatcg 1380ttgccacata ttaatacaaa atggaagcag cagttcagat aacctttccc tttggtgagg 1440tgacagtggg tgacccagca gtgagttttt ctttcagtct attttctttt cttccttagg 1500ctttggccag cataagccat gtggctcatg ttactattac aacgaaaaca gctgatggaa 1560agtgtgcata caggtatagt gctgacttct tttactcata tatattcatt ctgaaatgta 1620ttttttgcct aggtctcaga gtaatcctgt ctcaacacca gtgttatctt ttttggcaga 1680gatcttgagt acgttttctt ttctccttat tgataaattg ataatcctca aggatgatta 1740ttaggtgata ctcttacttc atggattctt aaaagatatg atttaacata ttacaagtgc 1800ctagcaaggt gtctgttaca cgtaggtatt ttaagtaaat ggtagctgct gatgtaattt 1860ctgccccttt gcccttcagt

tggggtattg ctttggaccg attagagggc tgtggctggg 1920atgctaaagg ttcatgtttc cttagctggc tcctgagcca ccagctccca ccacctgtgt 1980atacctgtgc tagtttgcct tcccacaa 20083913197DNAHomo sapiens 391cctgtgctag tttgccttcc cacaagtagc tgctggctat ctgttatgct ggtacagttt 60tcagaaactg atgaatggcc tttgaacaga acaaaaatga gattcagaat aacaaaattg 120cacctttgtt tttataagca ctggccattc actagttgaa gactggtagg aatacctaat 180tcatgccaaa agaaagataa tttttaaaaa tcacacaggt tgtttgtaga ttaaaaggga 240aaataggcta ggtatagtgg ctttgcctgt gagtttggga ggctgaagtg ggaggattgc 300ttgaagtcag gagtttgaga ccagcctggg aaacagagca agaccccgtc tctacagaaa 360atttttaaaa aattagctgg gcatggtgat gcatatctgt agtcttagct actccggagg 420tgggaagatt gcttgagccc agcagtttga ggctgcagtg agctgtgatt acaccactgt 480actccaacct taaaataaat aaataaataa gggaaaatat cttcaacaaa ggatagttct 540gtctgtttct cagtcttcct caacagataa atgtgtgaag taatggaagg tggagatttc 600agattacaca acattaatgc taagggcgtt tgactctgtg tgaattctaa ttgccctaga 660tctagacggg ctgatactat tagaatcccc tgtcactaac tgaagacaga gttgtaagtt 720aatgccttcc tagatagcct agattgtggt atgctgctgc atgctaaaat ggctcccctt 780ccatagcagg atgaaataga gtcattatct tggcaaccag cccctgccaa tgtgctctca 840gtctgccttt ccagcccctt ctctctacct attcccagct gccatgtatt ctaaagcctc 900tatgctttca tttttgtttt tgccttcctg gatggtcttt cctgctgtct ccacctgaaa 960ctattcctct ctaaagaaca gatgaattgc catctctctg ggatgctttt acccaccctc 1020actcccacct caggctgaat ggacccttct ctagatcgct tagcatattg ttctacagtt 1080aggtaaaaag tctacctatc actagatcaa gagctttgtt tttttttatt aatttaattt 1140tctttttttt ttttcttttt tttttgagac agagtctcgc tctgtcgccc aggctggagt 1200gcagtgcaca atcttggctc actgcaagct ccgcctccca ggttcacacc attctcctgc 1260ctcagcctcc cgagtagccg ggactacagg cgcccaccac cacgcccagc taattttttg 1320tatttttagt agagacgggg tttcaccatg ttagttagcc aggatggtct cgatctcctg 1380acctcgtgat ccacccacct cggcctccca aagcactggg attacaggca tgagccaccg 1440cgccgagccc caagaccttt ctttattacc agggcttcca cagacctgac acatggtagt 1500tcctcaataa ataattgcag aattactgaa aaattttact gttaacttag gcagtggtaa 1560aaccattgtt tggtagctca gaactcagca agtaaatagc aacatttgct ggaagaacag 1620atagtttttc aaatccaatt caaggactgg gtatggtggc tcatgcctgt aatcccagca 1680ctttgggagg ccgaggcagg cgtatccagg agttcgagac tagcctgacc aacatggtga 1740aactccgtct ctactaaaaa tacaaaatta gccaggtgtg gtggtgggca cctgtaatct 1800cagctacttg ggaggctgag gcaggagaat cgcttgaacc tggtaggcgg aggttgtagt 1860gagctgagat tgtgccattg ctctccagcc tgggaaacaa gagcaaaact ccgtctcaaa 1920aaaaaaaaaa atccaattca aatgattatg gaagtagtgg agaaataaac aggaaaatga 1980taaataatta agataatata taatatggct atattttaat ctattgttga tatgattttc 2040tcttttcccc ttgggattag tatctatctc tctactggat attaatttgt tatattttct 2100cattagagca agttactcag atggaaaact gaaagcccct cctaaaccat gtgctggcaa 2160tcaagggacc cagatcacgg taagaatggt acatgggaga gtaaattgtt gaagctttgt 2220ttgtataaat attggaataa aaaataaaat tgcttctaag ttttcagggt aataataaaa 2280tgaatttgca ctagttaatg gaggtcccaa gatatcctct aagcaagata aatgactatt 2340ggcttttgtg gcatggcagc ctgccacgtc cttgtctttt ttaagggcta ggagattctt 2400tattgggatg gcaaaagtca atggcagggt agttgtcatt gaaagaagat taagcttgac 2460cccagaaggc atgggttaga gcccagcctt gtcactcaat ggttgtatgt ccagaggcaa 2520gtcacttaac atcccttaac cccagttttc tcatctgtca aatgaagcaa agaatacttg 2580ccctcttgac ttaaagggtg tctgatgaga catatgactg tatcattagc tgggagaaag 2640tccatcgtgc tgcctatgta tagtgcctca agttggtctc tttcccttct atgattacac 2700aaagcactcc gctgtcatgt tatccatccc gcccctccat tccaagtccc atctagagca 2760catcttcttg aagtccactg taacctgcct aatcctggat gtgacgagcc aggcaggagg 2820cagaaaagaa tgtgtgtttt gcaatacatg ttaagagaca tcttgggctg ggcacggtgg 2880ctcacacctg taatctcagc actttgggag gctgaggagg gcggatcatc tgaggttggg 2940agttcgagac cagcctgacc aacatggaga aaccccatct ctactaaaaa tacaaaatta 3000gccaggcgtg atggcgcatg cctgtaatcc cagctactca ggaaggctga ggcaggagaa 3060ttgcttgaac ccgggaggca gaggttgtgg tgagttgaga tcatgccact gcactccagc 3120ctgggcaaca agagtgaaac agggtctcaa aaacaaaaac aaacaaacaa aaaaaatctt 3180ttaccacggt gaccacc 31973922964DNAHomo sapiens 392gaccaccatg tgatttccaa gaacttcaaa tgatctaaga aattttgtga ttattactag 60tttgaaaaat actttttttt tttttgagac aaagtctcac tctgttgccc aggctgaagt 120gcagtggtgt gatctcagct cactgcaatc actacctctt gagttcaagc agttgtcctg 180cctcagcctc ttgagtacct gggattacag gcatgcgtca ccatgcccgg ctaatttttg 240tatttttagt agagacaggg tttcaccatg ttggccaggc tggtctcgaa ctcctgacct 300caggtgaccc acccaccttg gcctcccaaa gttctgggat tacagacgtg agccactgca 360cccagcctga aaaatatctt tgaatgccat gtgatactat acttgtcagt ttacatgtgt 420gtcccactaa atcatgtact ctcctgagca ggatcatgct ttgtcttcat attttctgta 480caaagcaaag actctgacac aaagctagcc cccagtgcat agttgagaaa tcagtgaatg 540aatgtgggag gcaggaaaaa tgtcctttaa ttcttctgtt aatgctgtct tatccctggc 600cccagtcagt gcttagaact gtgctgttgg taaatataat tggattcact atcttaagac 660ctcgcttttg ccaggacatc ttgggtttta ttttcaagta cttctatgaa tttacaagaa 720aaatcaatct tctgttcagg tggaggacct tttttacaac atagccacga ggagaaaagc 780tttaaaaaat ccaagtgaag aatatgggaa aattttggaa gttgttggca ggtacagtcc 840aaaatctggg agtgggtctc tgagatttgt catcaaagta atgtgttcta gtgctcatac 900attgaacagt tgctgagcta gatggtgaaa agtaaaacta gcttacagat agtttctggt 960caaggtttag ccaccaattt tgcagtttct ctcatctccc caggaaagag cagttggtct 1020ttagatcaat gagagctctt ttatggcaga caaaacaaag tgactctagc caacttgagc 1080taaaaagaaa tttagtggaa ggctaggagt taccacatga agtgtgtgca gctgcccctt 1140ggagagaata agaaccaggg tgcctctggg acttaacatc attactgtac tccagttgtt 1200ttcattcttt tcctgacttt gctctagagt cagtttccta acagagtaca ttcgatgatc 1260atgtgcccat atctgtgggg agaagatttc ttgattggca gtcttactaa gggtgcatat 1320caagtagaat ggaatagagg tagtttccta aaggaagatg agaggctgtt accaggagga 1380ggagaaggga ttcaggacag atgaaaacaa cgttatatcc atgatagact tacgctgctg 1440gtacagatgg tacaggtggc ttcagtatag gctctccgaa cccacatatc attgattatg 1500atagggatat gttaactatt tttcagtgta tatatgtata tgtgtgtgtg tatatatatg 1560tatatgtata tatatatgta tgtgtatata tgtatatgta tatatttata tatgtatatg 1620tatatattta tatatgtata tgtatatatt tatatatgta tatgtatata tatttatata 1680tgtatatgtg tgtatatata tatttatata tatgtatatg tgtgtatata tatatatttt 1740tttttgaaac ggaatttcgc tcttgttgcc caggctggag tgcaatggtg cgatctcagc 1800tcactgcaac ctctgcctcc tgggttcaag cgattctcct gtctcagcct cccgagtagc 1860tgggattaca ggcacttgcc accatgcccg gcaatttttt ttttgttttt ttttagtaga 1920gagggggttt aatcattttg gccaggctgg tcttgaactc ctgacctcag gtgatctgcc 1980tgccttggcc tcctaaagtg ctgggattac aggcgtgagc caccatgcct ggccattttt 2040cagtatttct tttttttttt tttttttttt tttttttgag acagagtttc actcttgttg 2100cacaggctgg agtacaatgg tgtgatctcg gctcaccgca acctctactt cccaggttca 2160agcaattcgc ctgcctcagc cttctcaagt agctgggatt acaggcatat gccaccatgc 2220ccggctaatt ttgtgttttt agtagagatg gggtttctcc atgttggtca ggctagtctc 2280aaactcccga cctcagatga tcctcccgcc ttggcctccc agagtgctgg gattactggc 2340atgagccagc gctcctggcc catttttcag tatttctaaa aaaaatctaa agtgggtcaa 2400acatttcacc ttaatagaat gacaggtttg tacatcaagt ttctttgctt tttcttggaa 2460ttttatactt tttttttttt tttggagaca gagtcttgct gtgttaccca ggctggagtg 2520cagtggtgcg atctcagctc accacaacct ccacctccag gttgaagcaa ttctcctacc 2580tcagcctcct gagtagctgg gattacaggc acatgccacc acacccggct aatttttttt 2640ttttttttgt atttttagta gagacagggt ttcaccatgt tgtccaggct ggtctcgaac 2700tcctgacctc aggtgatccg cccatctcgg cccaccaaag tgctgggatt acaggcgtga 2760gccactgcac ccggcctttt tcttggaatt ttatcaatca gtgtcagaat attcattacc 2820tcctaaaaat aaaggagttc tagttggctg ttttgattct aggtgtggta aagtgaaata 2880ttgttactta ataaatgcat tttgctagac acaatccttc ggttcacgag ctctgtagag 2940aaaagagaaa taaccgccaa ccaa 29643932864DNAHomo sapiens 393taaccgccaa ccaagaaaag attgggagat actagaataa gacccagggg caggaagaag 60ccagtgagaa ggagggcatg ttgagagctc tgagagagaa taaaagcagg ggttgttgga 120gctagcttct caagatgtcc ttgaggcaaa ccagaccttt gggacactct gaaaataaaa 180ctgaaagtga agagattgtg ggccgaatgt ggtggctcac gcctgtaatc ccagcacttt 240gggaggtcga ggcgggtgga tcacctgaga tcaggagttc gataccagcc tggccaacat 300ggcgaaacgc catctctact aaaaatacaa aaaaaattag ctgggcctgg tggcaggcgc 360ctataatccc agctactcgg gaggctgagg cgggagaatc gcttgagtcc aggaggcgga 420ggttgcagtg agctgagatc gtgccattgc actccagcct gggcaacaag agcaaaactc 480tgtctcaaaa ataaataaaa ataaataaaa aagagatagt ggcgtgatat ccttgattct 540atcagcaacc tataaaagta gagaggagtc tgtgttttga ttcagtcacc tttagcattt 600ttatttccat gaagtttctg ctggtttatt tttctgtggg taaaatatta ataggctgta 660tggagatatt tttctttata tgtacctttg tttagattac tcaactccac taatttattt 720aactaaaagg gggctctgac atctagtgtg tgtttttggc aactcttttc ttactctttt 780gtttttcttt tccaggtatt cagtacacaa tgcaggcatt agtttctcag ttaaaaaagt 840aagttcttgg tttatggggg atggttttgt tttatgaaaa gaaaaaaggg gatttttaat 900agtttgctgg tggagataag gttatgatgt ttcagtctca gccatgagac aataaatcct 960tgtgtcttct gctgtttgtt tatcagcaag gagagacagt agctgatgtt aggacactac 1020ccaatgcctc aaccgtggac aatattcgct ccatctttgg aaatgctgtt agtcggtatg 1080tcgataacct atataaaaaa atcttttaca tttattatct tggtttatca ttccatcaca 1140ttattttgga acctttcaag atattatgtg tgttaagagt ttgctttagt caaatacaca 1200ggcttgtttt atgcttcaga tttgttaatg gagttcttat ttcacgtaat caacactttc 1260taggtgtatg taatctccta gattctgtgg cgtgaatcat gtgttctttc aaggtcttag 1320tcttgaaaat atttatagtg tagtagaact attttatcct ccaatgctcc ttcttttcct 1380tgtatttcca ttatcatcac tttaggattt cacttattta tcattcaaca tttattaatt 1440gcctctcata ttccaggctt tgtgctagaa gttagggata taaagacaaa taagatattt 1500cctgccctta aagactagat tcgtgttgct aagtcttcat tatcaagaaa agcataagtg 1560gggaaaagtg cttgcattat ggattcctca tagttgctcc cctctgcatg taaaaatcac 1620catttccatc atagattcct agcggtctca ggactttata aagcccaaag tgcctatgtc 1680ataatatgag gaaaaatact gagacccttc catatatggg aggtatatgg atgagacagc 1740tcctgacttc acttttccca gaaatctgaa aagcagcagc agtcattcca gagcccagtt 1800tctactttga agggcagatt atttattctt tgagctaacc tgactgagga acaattagtt 1860tgcttttaat ttactatttt ctttttcttt tcttttcttt tttgagacag agtctcactc 1920tgttgcctag gctggagtgc agtggctcaa acttggctca ctgcaagctc cgcctcccgg 1980gttcacgcca ttctcctgcc tcagcctccc gagtagctgg gactacaggc gcctgtcacc 2040acacccagct aattttttgt attttttagt agagacgggg tttcatcgtg ttagccagga 2100tgatctcgat ctccagacct cgtgatccac ccacctcggc ctcccaaagt gctgggatta 2160caggcgtgag ccaccgtgcc cagccactat tttctttcta attgttaatg aattaatttt 2220ttaaaactgt gctcctagag cgaagggaga gctctgttta cagtgtaact tttcagagct 2280tctttaacta gattttaaga tcagaattag ttgttgtgaa atcttaggga ctgtacaaga 2340ttagaaatcc tctatagcag catttcccaa agcaggcttc cagaacacta gcctcatgag 2400gcattttggg aaaaaagagt ttgctggttc agtgtgtatg ggcagtgcca caagccgtac 2460cctccgttga agacactcat tccacacatt actgcataaa aagcttccac cagccattcg 2520gcaaacttat tgagtgtctg ctatttcctg ggtattgtgc tatatggtag ggttatagta 2580gtgaacaaag aagaaatgat gcctgctctc agctgacttt gcagttggaa agacacatga 2640aataattacg ccattcatta gcagattgtg ctagatgcct cactggaaaa ataaaggaca 2700tgatggaaaa ctctgtaggg tcagagaaag ggatcattag agaaggttct ttgaagaaat 2760attttttgaa atatgaagga taaataggaa ttaactaggt accaataggt taggagtaga 2820gctttccaga cagagggact agttcttggg aaggtctcca gaca 28643943013DNAHomo sapiens 394tgtgctagat gcctcactgg aaaaataaag gacatgatgg aaaactctgt agggtcagag 60aaagggatca ttagagaagg ttctttgaag aaatattttt tgaaatatga aggataaata 120ggaattaact aggtaccaat aggttaggag tagagctttc cagacagagg gactagttct 180tgggaaggtc tccagacaga aataagtgtg gcttgtctga ggacctctta ttcgcctatt 240aaccttccct ccccagtaaa cactcctggg aacaacacac attgtagaac cacgttgtgg 300tgctgttcag tatagcaagt aattcagcag agataagttc ttggaatctc atctttggga 360tttagttact aagatacatt caagtttgag caaaataagg tctcagagct tggattcatt 420gttctgttcc agcaattaga gcagtacctg gcacatagca caagtgcttg aaaacactga 480ctgagtaggg taggtgggtg agtgggtggg tgggtgggtg ggtggatgga tggatgggag 540gatgggtggg tgaatgggtg aacagacaaa tggatggatg aatggacagg cacaggagga 600cctcaaatgg accaagtctt cggggccctc atttcacaaa gttagtttat gggaaggaac 660cttgtgtttt taaattctga ttcttttgta atgtttgagt tttgagtatt ttcaaaagct 720tcagaatctc ttttctaata gagaactgat agaaattgga tgtgaggata aaaccctagc 780cttcaaaatg aatggttaca tatccaatgc aaactactca gtgaagaagt gcatcttctt 840actcttcatc aaccgtaagt taaaaagaac cacatgggaa atccactcac aggaaacacc 900cacagggaat tttatgggac catggaaaaa tttctgatcc ataggtttga ttaaacatgg 960agaaacctca tggcaaagtt tggttttatt gggaagcatg tataattttt gtcctaagtc 1020tgtgctcagc cctcccacat gtgctcattg ctggttgact gttggagtct ggttcttacc 1080tctaagagga agcccaggag agggcataaa gccagcacac tgtcctcacc tgatggtgtc 1140agagtcctta cgagtaagcc ctagccagaa cattgctgga agagatcaag ggccactgtt 1200tgaaattgca cagcaggata cggaaaaggg gtaccttagg tataggcatt gtcattaaag 1260aaattgctaa gatacttgag attttcctgt ttaaggaatg agctttatga tacaaagagc 1320agttctaaaa attagggagg gaattaacta aattaattag gatatttctc aaattccttt 1380acagtttttg tctctctgct gatatagtgt ttacatgatt gttatttact aaacaaatgc 1440tattttgtat tgtgctcctt ataacttaat tgtttattac aaggttttga tggtgaccta 1500ccaacaacaa gtaatcccaa acacagtctg aattttttgt tttccatcca gaaataagat 1560gaatctttcc atttccgtgt tttcagtttt catcattttt atcctatagg ttacttatct 1620ttattttaaa gcatttcata ataattttat agtttttgtt ttgtttgctt gtttgctgtt 1680ggaaatggaa tattccctcc ttccatttag actgctaacc agctgtaaat gtttcaaaat 1740atgcatgttt tacagcagtt gttcaaagca atacaggaac agtaaggaca gagccagtca 1800ttttacaacc acattctgtt aaactgatgt ctattagcag ggtttttcct attttattag 1860gaaggactta cacctgatat ataacaaagc ttgttttaat caaggctcag aaaatgtttt 1920tcattagttt ttttcctaac catgaagaat aactgctttg taacacacat gctggctata 1980aagcagacaa aaaattcact gtaggtgctg cctgactggc ctctgtccgt gtttctgttg 2040gggctgctta ccacagcctc tgcattatca ttagctagtg tgttcacaat accaagttcc 2100cagtagcaaa gaaaggtcaa gctcttacgc atgccattca tttatctaca ctgtgcaggc 2160gcactcaggt ggcagggaca aagaccactc ctttggcgca tctcaagttc agaattctca 2220gtagaggggc tccagctgtc cttttgtcag gtgcccatgc ctgctccagg cctgtgtggt 2280caggacacgt gttacagagt acagtgacat taatgatggg gccatggata tggtcagcac 2340tcagaggatg ttagtctctt cattgataaa gtcacaacca cttttcctgt tggaaataaa 2400aagatttgac gtatccttgt ctacagcaac acaggacaac agataatcag caggtcatct 2460aaatctgttc agagagaaag gagagctgtt tcctgaaaat acatcttccc ctgattttag 2520tcttattttt ttctgccttt attgctttct accctcttca aaccagcctc atttcctaaa 2580ttaccttgaa tatgcattga cacttgtact gcctgaaatt ctggaaaact cagtatggct 2640actccaccgt cagaacttcc tgagcaaagt tagttgctct ctcggctcac tgttttgttt 2700tgttttgttt tcctgcctca ggtttatttg tacaaatagc acaggaggac cagccccatg 2760cagatggtag cccaggggcg ggggtagggg gtcacaccag tccttctgtc ctcatgttgg 2820cagagatatc tactctgaag cctttgtagg ggcctgggca cctttgggag cctgagctgg 2880aactgaaggt ggagctgcag cctgggcctt ggtttgatcc ttggccttgg cctttggccg 2940gcacagcctg agccccttgg caatacgggc acgagcacgc ttcccaagct tgggatgggc 3000aatgtaggca agt 30133952915DNAHomo sapiens 395atgggcaatg taggcaagtc gatcgagctt gcggctgaca ccctttggga tcttgggctt 60aacctccttg ggctttacga gggccttgat agcctcggca cgtgcactca tggccttggc 120attgttggcc tgcatcttct ttaggccctt cttgttgtgc ttcttggcaa agtgcatgtt 180cctcaggaac ttggggtcca cccccttaag agattcgtat ctttgtgatc ggggtttctt 240gataccattt ctgtgccatt ttcgggactg gttgtgtgtg gtgtggttct tggacttcgc 300catgtctaca ccttaagccg cggctcccga agcacctaga accggaagag ttggctcact 360atttagcaca cacacacgtc tataatagtg ctggccactt ggggttggaa ttagtttatt 420tatcagcatg ttgtctccca gcacttggtg tgtgtgatat gcagtatgta tttgcagaat 480gaaaagtctg agggctgaca tcatatttcc cactgtgccc agaaagagca cagttagtcc 540acatgagcta atgggggcaa agggaagtga ggagggagaa tgtactgcct tatcatgttt 600tctattactt ggctgaagta aaacagtccc aagccgatag taagatagtg ggctggaaag 660tggcgacagg taaaggtgca cctttcttcc tggggatgtg atgtgcatat cactacagaa 720atgtctttcc tgaggtgatt tcatgacttt gtgtgaatgt acacctgtga cctcacccct 780caggacagtt ttgaactggt tgctttcttt ttattgttta gatcgtctgg tagaatcaac 840ttccttgaga aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc 900attcctgtac ctcaggtaat gtagcaccaa actcctcaac caagactcac aaggaacaga 960tgttctatca ggctctcctc tttgaaagag atgagcatgc taatagtaca atcagagtga 1020atcccataca ccactggcaa aaggatgttc tgtcccttct tacaggtaca aggcacagtt 1080ttccttcatt tattcactaa tttagcagaa cctcactaag agcctcctat atgccaggct 1140ctgcgttagc aataaaagga atgccatgcc tcaccccatc aggaggtgct gatagcttgt 1200aggcggagtg gaaacagatg tgctctagag gctctaaata ttacttctgc tggggtcagt 1260tgggaagcca caacagctac tgttcatctt ccataaaaga caatcagccg ggcacagtgg 1320ctcacacctg taaatcccag cactttggga ggctgaggtg ggtggatcac aaggtcaggt 1380gtttgagacc agcctggcca acgtggcgaa accctgtctc tactaaaaat acaaaaatta 1440gccaggcatg gtggcgggcg cctgtagtcc cagctactcg ggaggctgag gcaggagaat 1500cgcttgaacc taggaggtgg aggttgcagt gagctgagac tgtaccactg cactccagcc 1560tgggcgacag agcgagactc catctcaaaa aaaaaaaaaa aaagactggg ttctgttctg 1620tggaggttct tgtcttaaca tatccactgt tgattgccca gatgttgatg taattaattt 1680agcagtcgta aatagtttag cacttgcatt aaatagacca aaccccatag taggtatttg 1740aaatacagaa taaatgtgag gtacccctgc tctaaaggag tttatagtcc agagctgact 1800tatggaggat ttctttctat tatttctggg tctgctacta atttgtctat ttcatatcct 1860aattatcctt gttttcattt tgattgaaag ggggagagca tagaaattgt ggtaaaaggt 1920agttttattt tttatttgag atggagtctt gctctgtcac ccaggctgga gtgcagtggc 1980acaatctcat ctcattgcaa cctccacctc ccgcgttcaa gcaattctcc tgcctcagcc 2040tcccgagtag ctgggattac aggtgtgcac caccacgccc agctaatttt tgtattttta 2100gtagagatgg attttaccat gttggccagt ctggtcttga actcccgacc tcaggtgatc 2160ctctcacttt ggcctcccaa agtgttagga ttacaggcct cagccactgc acccagccta 2220aagttagttt tagattaagt gttttcatgt tttcccttgc aaagtaataa actggtcaag 2280ttatcacctt gttccatctc catattaatc agggtccaaa caggagatag aaaccatgca 2340acaatttgag tagttgaata aagaattata aacaggagat tagagtaata ggggattaga 2400tagtaagagg tgaagagata ggaacagcag atataaagaa caaccatttc ctcctatggc 2460tgagatacca tcccctcacc acactccccc acctactcac tgagatgcag accttattga 2520agagaatgta actggcttgc tgcgaggtaa agtcaatgag gcgctcccca gtaccactct 2580gaggggatgc tggggaaaac tgcccatgag aagagggcac

atgctgctgg ccacttgtgc 2640taaagaactt gaagtctgat aggagtgcac cctaacctgg catagaaacc ctttcttcct 2700gctgagtccc tctagcacct tatactggca aagctttaca ttgcaaacct ccattatcac 2760agagcaagca atgaaagatg gactcagagc tgaggcgata aattgatagc tagcatagcc 2820tctaaactga cttttatgac tacattttat ggatagaaag tgttcttata tatattgttt 2880ctttacataa taggggactt attcatggct gcaga 29153963033DNAHomo sapiens 396cagagctgag gcgataaatt gatagctagc atagcctcta aactgacttt tatgactaca 60ttttatggat agaaagtgtt cttatatata ttgtttcttt acataatagg ggacttattc 120atggctgcag atgagaaaac agatcctaag aagttaagtg acttgcccaa ggtcacacaa 180agaattccac tagttctaaa atgacagtaa ttacagttaa catacattgt atgtggcaga 240tacatataaa gcacatggca ttaatttttt tttttgagat ggagtcttgc tctgtcgcca 300agctggagtg cagtggcacg atctcggctt actgcaacct ctgactccct ggttgaaggg 360attctcctcc ctcagcctcc cgagtacctg ggattacagg catgcgccac cacgcccagc 420taatttttgt atttttagta gagacgtggt ttcatcatgt tggccaggat ggtctcgatc 480tcctgacctt gtgatccacc cgcctcggcc tccccaaatg ctgggattac aggcgtgagc 540caccacgccc ggccacttgg catgaattta attcccgcca taaacctgtg agataggtaa 600ttctgttata tccactttac aaatgaagag actgaggcaa agaaagatga tgtaacttac 660gcaaagctac acagctctta agtagcagtg ccaatatttg aacacactca gactcgatcc 720tgaggttttg accactgtgt catctggcct caaatcttct ggccaccaca tacaccatat 780gtgggctttt tctccccctc ccactatcta aggtaattgt tctctcttat tttcctgaca 840gtttagaaat cagtccccag aatgtggatg ttaatgtgca ccccacaaag catgaagttc 900acttcctgca cgaggagagc atcctggagc gggtgcagca gcacatcgag agcaagctcc 960tgggctccaa ttcctccagg atgtacttca cccaggtcag ggcgcttctc atccagctac 1020ttctctgggg cctttgaaat gtgcccggcc agacgtgaga gcccagattt ttgcctgtta 1080tttaggaact ttctttgcaa gtattacctg gatagtttta acattttctt ctttgaacct 1140agttataaag gtattgtgct gttgttccta ggcttagagt cataaggcct gagctcactt 1200cctcactttg cctccatctg gaaccttaga ccaacttcct aggaaaacga gctgtctgaa 1260aacagaatag ggtgcctctt caatgtgctc ttcactggag atgttcagga ggaggctact 1320cccacctaca cagggtgcag tggagggtct gggccccagg gaggcagcag gaagagtgga 1380aagagcggag gctctactgt tggacagacc tgggttacca gccgtgtgac tagccttccc 1440tggcctccat atccccctca gtaatgaagg aatgtgtcat ccccaaatcc agggacagtt 1500acaagcagtc agtgaacaga aagtgtctgg tacaggttct aagtgcttat tattctaagt 1560cacttcactt acctgagttc tcagttttcc tatctataag ataagcaggt tggataaaat 1620gttctccaat atactcctgg tcctgagatg atgtgattgt gggcagccct ttaatcatgg 1680tgaagatgtt catcataagc acactgaaac tacaaaatag gaatataaat attttctcca 1740ttaaattatg ctggatccta gaagcaaaaa ctggaactgt gaaaccctac ttcacagaaa 1800acttaaaatt cccaagcaga tgaatgcttc tcggaaggac actgacagtt acctacctgg 1860aaagaatcta gatggaggtg gcatgggcac taagcggtga gattaaaccc agttagggca 1920gccccaccag ccttggaacc cacacatctg gagattgttg atgcagagag aaaggttcct 1980actggtgaga cctgaaaggg atatgtggca ggtgggagga agaagttctg tctggaaacc 2040aacccttgtt cctccgttat tgattgactc ctggtaccaa catgagccct aggtcttata 2100gaggccataa gtccctatgc cttatagtgc ccatggatga gatgaggcca cacatgcccc 2160cagtgggtta acatgtctag cgtgggtaag gctcttggag cactatgata cacaggaaat 2220gcccagtaac tcttagttgg tttgatatct gttcccattg ctcacttaag ctcagtgccc 2280ctttactgat ccttttattc tgcctccctc tgcacatgtg cattgagact cctatctgag 2340acacacactg tgttgggtgc ccagggatgc agcatagatg ttgctgcctt ccacagaagc 2400gctcatggtc tgctagagaa tatatcccat gggagagaaa aacagactcg ggagaatata 2460gcaggggccc ttgtcctgga ctttggcagt taggaaaggg agggaagaga catggaggct 2520gggacccaaa ggctaaatag gaatttgctg ggccaaaggg gagggggaat gaaaagagtg 2580tttctggcag aggaaatggc aaggataaag gcctggaggc gcaagagaat atgtgtttga 2640ggatctgaaa gttgagtgca gtgggtccag tgttctctac cctggctgcc attagaatta 2700cctgggaaac ttttagaaaa ttccagtgtc tgggccctcc ctaaaacaat aaatcattct 2760tgggtggtgg ggtctgggca tcaggattgt ttaaaaccct ccccaggtac tgtcatgtgc 2820agctggggtt aagctgtgct ggggtctgag tatggatctg ttagggcaag tggcggtgat 2880ggagttgagg ctgcagaatt caggccaaat agagaggttt tcatcaggat attaaagagt 2940ttagatttca atttggtggg aatggatggg atcttatttg cattttatga agagctccct 3000ggttgcaata tcagaatgga ttggagagga gca 30333973024DNAHomo sapiens 397atactttccc agcccaaacc ctggaatagg ccttttctcc gaggagctct agttcatttt 60agtgggaaat ggtatttaga gactataatc tgggatctgg gagtcctcat tgctactgag 120tagtcattac ttttaggctt ttccagtggt cagagctagg aaatatgtat atttaaaaat 180ggacagttga atggttgttg ccaggagctg ggaggaaggg gaagtgagaa attgtttaat 240gggcacagag tttcagtttg gggaagatga aaaagttcta gagatagctg gtggtgatgg 300ttgcgcaaca atgtaaatgc cactgagctc tcatttaaaa atggttaaaa tggtaaattt 360tatatatatt ttaccacaat aaaaaaaagt cttcttctgg gagcaccccc ccaagacaaa 420aatatgaaaa ttttacactg atacttccat ttcaagataa ttttaagatt ataaggattt 480tgcttaattc ttgaatttta tacctgtaaa ccttttatac ttcaaatttc gggcagaatt 540gcttctataa caatgataat tatacctcat actagcttct ttcttagtac tgctccattt 600ggggacctgt atatctatac ttcttattct gagtctctcc actatatata tatatatata 660tatatatttt tttttttttt tttttttaat acagactttg ctaccaggac ttgctggccc 720ctctggggag atggttaaat ccacaacaag tctgacctcg tcttctactt ctggaagtag 780tgataaggtc tatgcccacc agatggttcg tacagattcc cgggaacaga agcttgatgc 840atttctgcag cctctgagca aacccctgtc cagtcagccc caggccattg tcacagagga 900taagacagat atttctagtg gcagggctag gcagcaagat gaggagatgc ttgaactccc 960agcccctgct gaagtggctg ccaaaaatca gagcttggag ggggatacaa caaaggggac 1020ttcagaaatg tcagagaaga gaggacctac ttccagcaac cccaggtatg gccttttggg 1080aaaagtacag cctacctcct ttattctgta ataaaactgc cttctaactt tggcttttca 1140tgaatcactt gcatcttctc tctgcctgac ttgccctctg gaatggtgct ggaatggtcc 1200tgtggccttg tccactgtct gcctttgacc ataacttgaa agtcacccac catagtgtcc 1260tttgaaataa cttaaatgtc cacagttcca agcatgagtt aaaaacactt cagaatgtag 1320agtagttgtt caattgaata aacacacaca ccagaaaaaa aagcaagttt atcttttatt 1380tttagtaaag aattttgata gagcctcaac accagaaatg gctagagaga gaagcctaac 1440atatctggag gattattttt catcctactt aaagctgctt tcactttttt caggaaaaaa 1500cacacgttct gaatctaatt tataaaactc cctggccggg tgctgtggct cacacctata 1560atcccagcac tttgggaggc tgaggcaggt ggatcacctg aaatcaagag ttcaagacca 1620gcctgaccaa catggtgaaa ccccatctct actaaaaata caaaattagc cagacgtggt 1680ggcgcatgcc tgtaatcccc gctactcggg aggctgagac aggagaatga cttgaacccg 1740ggaggcggag gttgcagtga gccgagatcg cgccattgca ctccagcctg ggcaacaaga 1800gcgaaactcc gtctcaaaac aaacgaacaa acaaaaaccc caaaaatccc tgaagtacgt 1860gagctagtgg tgaaagaaag ctggagaaaa ggagcaggaa taataataat aataataata 1920ataataaaga ttgtcattta attttgagta cttccagtgt acactttgca ggtactctaa 1980gacattacct cactgaaatc tctaaggtag atattcttta tttaaagtgt acttgtatga 2040aacctggagc tcaaggtgaa ggaatttgcc caaggctgca cttgcactat cgtggcacta 2100attagccgtg tgaactggga cacgttactt cagtttgctc atttctgagt cagcctagca 2160agatgacttc taagaatttt ttccagccgg gtacattggc ctgtaatccc agcacttcga 2220gaggccaagg tggaagggtc acttgagtct aggagttaca cacaacacac acacacacac 2280acacacacac actagccagg catggtggca aatgcctgta gtctcagcta ctccggaggc 2340tcaggtggaa ggatcacttg agcccaggag gttggggctg cagtgagcca tgatcacgcc 2400actgcactcc agcctggctg acagagtgag atcctctgtc tcaaaaaaag aaaaaaaaaa 2460agattttttt ccagggaata ataaaggaag ctaatattta tggagcatct acggtgtgcc 2520aaatactttg catacgttat ctcatttaat gctcttatcc ctgcagggaa agtattaaca 2580tttgtttatc acttgcagaa ctaagtgata tttaccacag agtagacaaa tattttcaag 2640cccaaaatca agtggtatca cttttctgct gagaatgttt cagtggtttc ctttgctctt 2700gggataaaac ttaaatccct caccctaccc ttgctccaac cctccacttt ccttctccca 2760tgtggtgatt tggccataca gctcttgtgg ctgatctgaa ctgactgagc tttttaccct 2820tttgctcttg ctgttcttac agcctgggaa ccccctggtt acctcttggc ttggtgtggt 2880ggcttacatc tgtaatccca gcactctggg aggccaaggc ggacggatca cctgaggttg 2940ggagtatgag accagcaagt cacctcttgc cagtggcctt tgtccattga gtctgaagtt 3000ctttctcctc tcatttcccc atca 30243982157DNAHomo sapiens 398agtggccttt gtccattgag tctgaagttc tttctcctct catttcccca tcattctatt 60atgctacctt gttttatttt cttcattgtg tttattgata cttaaaatga tctcttttct 120gttgctgttt gactctccca ctagaaagta agcattgtag atcgggcact gtggctcaca 180cctgtaatcc cagcactttg tggggcagag gcgggtggat cacctgaggt caggagttcg 240agaccagcct ggccaacacg gtgaaacccc atctctacta aaaatacaaa aaatagctgg 300gtatggtggc tcgtacctgt aatcccagct actcaggagg ctgagacatg agactcactt 360gaacctggga ggcagaggct gcagtgagct gagatcacac cacagcactc cagcctggaa 420gacatagtga gactctctct caaaaaaaaa aaaaaaaaaa aaggaagtaa gcattgtgag 480ggcaggtacc ttctctgttt tgttcattgc tggatgtagt tagtatacag cagtatctga 540tggatggata gatggaggaa tgaatgaatg agacttcaca aattcagctc acttgctcaa 600ggccctgcag ctctacggga tgaagctata ctccagagtc ctgctacatt ggctgtgtgg 660ccagctgctg ggatctgagg gttgtcagat aagcagtcta ccagagaaca gactgatctt 720gttggccttc tgccagcaca ggggttcatt cacagctctg tagaaccagc acagagaagt 780tgcttgctcc tccaaaatgc aacccacaaa atttggctaa gtttaaaaac aagaataata 840atgatctgca cttccttttc ttcattgcag aaagagacat cgggaagatt ctgatgtgga 900aatggtggaa gatgattccc gaaaggaaat gactgcagct tgtacccccc ggagaaggat 960cattaacctc actagtgttt tgagtctcca ggaagaaatt aatgagcagg gacatgaggg 1020tacgtaaacg ctgtggcctg cctgggatgc atagggcctc aactgccaag gttttggaaa 1080tggagaaagc agtcatgttg tcagagtggc cactacagtt ttgctgggca agctcctctt 1140cctttactaa cccacaatag catcagctta aagacaattt ttgattggga gaaaagggag 1200aaaaataatc tctgtttatt ttaattagca ttaattggta ttcttgttaa accataggag 1260tcagagtaaa tcagccattt caccaatttt cagtttgttt ctgtcttagc taacagcagt 1320gtaatggtca gcaaaattct tatcttgtgt actgaatggc atgtcctgtt gctgaaagtg 1380cacaggcttg ggaggtagcc atgagctcaa atcctggcac taccacctct cttgtgtgac 1440cttagactcc tgacctttct atgcctcagt tctttcttac ctataaaatg aaattaattt 1500tacccttaaa gatcatcgtg ctgattagag ataaaatata aataataaca cttgttacag 1560agcaaggagt tgacactttt atattctgaa gacaaagtgg taaatcatta tcatctatgt 1620cagaaatagc ttttgagaat acctgagtat agaactatct tgatccctgt tacttcaaaa 1680ctaaaataat ggttttagga attaaaaggt gaggctagtc acctccaagg gatgaactga 1740ctcagggatt gaggtatata acagtgaact ggtccaaaca acagtcctga ccccacttta 1800tgagtgagac tatgagtaat ggtctaagtg tagacatcat tgtccagggc tccagtaggc 1860agctctgtac ttgagaattt agcagtgacc cttctatttt tcatctatta tacctttttt 1920tttttttttt tttgacacag ggtctcactt tgtcacccag ctggagtgtg gtggtgcaat 1980catggcccac tgcagcctca acctccctgg gcttaggtga tcctcccacc tcagcttcct 2040gagtagctgt aattacaggc atgtgccatc atgcccagct aatttttctt ttcttagagg 2100tggggttttg ccatgtttcc caggctggtc ttgaactcct aggctctcac ctctgtc 21573993163DNAHomo sapiens 399aatgtgttgg ggaagtggtc tcctattaga ctctccattt caaaccattc catgattttg 60tcctcctttt gccaccttcc gagcctgtaa aaactaatgt ttgtgattcc tgaggtttct 120ctaatgtctt ttaataaagt tgacctcaga gatctcgtta cctctctgag ttcctgcttt 180gtcttagatt ttgatccttg agtgttcttt aatcttttag caattccttg ttgcatgtta 240aaagattagt tatattttat tcctcatttg tgttcgtttt caccaggagg ctcaattcag 300gcttctttgc ttacttggtg tctctagttc tggtgcctgg tgctttggtc aatgaagtgg 360ggttggtagg attctattac ttacctgttt tttggtttta ttttttgttt tgcagttctc 420cgggagatgt tgcataacca ctccttcgtg ggctgtgtga atcctcagtg ggccttggca 480cagcatcaaa ccaagttata ccttctcaac accaccaagc ttaggtaaat cagctgagtg 540tgtgaacaag cagagctact acaacaatgg tccagggagc acaggcacaa aagctaagga 600gagcagcatg aggtagttgg gagggcacag gctttggagt cagacacatg tggtttcaaa 660tccaagttcg accatttccc atttatttga ctgtagacaa gttacattcc taaactatgt 720ctcagatttc tcatctgtaa gttgtggtat tactagttaa catgcagggg ttttgtttgt 780ttgtttgttt gtttgtttgt gagggtaaga aataacccaa gaagcctagt ccttggtagt 840tgctcagtgc cctataaatg ttgtgaacca ggtggtgagg gtttggtgct gctagagaat 900tctggtatct gctctgtgca acagagtact gtaggtgatg caagagaaag aagacctgat 960gccttctttc ctcccagctt tgagaatgga gcaaaggcct accccagcca ccaagtgagc 1020cagtgggctt gatcagcaca ggaaaggtga ccccggcagt ttcatttgac tattgcatgg 1080ctggcaacat ttctattgat tgtttccagg gaccttggcg gatgagctcc tgttgagtct 1140agcatctctg ttaaatctgt tctcaaatag gtaatgcata tgggaggatg ctgccacctt 1200gcatctacta gacatcacct atctactgtg agactctccc tctaagccct gctgtggcct 1260cagagtgctt attggccctg tgagtggggc agccactata cattgcatgg agttggtaca 1320tgagatagaa acctattcgc catcccttga aactgcccca gtccagaagc ttcctgttag 1380cacatgtacc tccttgtatg tattcagaac tcattccatt taggcttgga aacccgtttg 1440gtgcaactct gttcaagttc cattgtctgc tttgagaatg cttgggcttg tatagtgagc 1500tgtcactttt taatttgtta ggaattctac tcgccttgct ttttcttttc cagcatgttt 1560aagggaatga cctccaaggc cccaaatcac agttgtattc atgttctttc atttcacaga 1620tacaatccag gccagtccca gatttgcagc tgttaataaa tgtgaatggt tttccagtaa 1680gggggtagaa aaacataggg agagaaccgg gttcagagtt caatatctgg attcaagtcc 1740ttcctttagc actttactaa ctgatgtaga ataagtcagc tactcaatag gtgcctcagt 1800ttccccacca aaatgcagac atagaaggtg ctttgtctgc tttgatgaga agtctttaag 1860caagtctatg gggttcaatg tgttttaaga actataaagt accatataaa tgtggccttt 1920attcccattg tgttcttgga agtaattcaa tatagtgtgt acttcatagc tgcttttgga 1980ctattgccag ccagtgtatc atcctaaact acatgtcagc atagtataat cctgccttag 2040gtctactttt gattatttag gaagactccc tgcccttcct atacatttca cataattttt 2100aataagttgt aaaaaagtga tttataggat tctttgtaag tgggggaagt taagcagaca 2160aaaagttttt aaatcttact gcagagtgtc aggaaccttt tatagcacca gacaggtagg 2220gacagaacat gagtggcagc aagccagact tggtcttagt gctctaacct gtctgttaga 2280ggctggccag tcagacccct ggttgaagac gttgggaatc ccagctcttt ggaggggtaa 2340gagattttgt tagactgtta accagattcc acagccaggc agaactattt ctgtctcatc 2400catgtttcag ggattacttc tcccattttg tcccaactgg ttgtatctca agcatgaatt 2460cagcttttcc ttaaagtcac ttcattttta ttttcagtga agaactgttc taccagatac 2520tcatttatga ttttgccaat tttggtgttc tcaggttatc ggtaagttta gatccttttc 2580acttctgaaa tttcaactga tcgtttctga aaatagtagc tctccactaa tatcttattt 2640gtagtatgtt aaatttttct aaaacttcta aggatagttg ctgtattgta tgatttgcat 2700atggaggtat ctataagaag ttttatactt tttagcaaaa tagtcatttg gtagccaact 2760taaacaaatg tttattaata tagaagttaa taatatctac tgatactcgg ccgggtgcgg 2820tggctcatgc ctgtaatccc accactttgg gaggctgagg cgggcagatc atttgaggtc 2880aggagttcaa gaccagcctg accaatatga tgaaaccctg tctctactaa attacaaata 2940ttagcagggt atggtggtgg gcgcctgtaa tcccagctac tcaggaggct aaggcaggag 3000aatcatttga acccaggagg cagaggttgc aatgagctga gatcacgcca ctgcactcca 3060gcctgggcaa cagagcaaga ttccctcaaa aaaataaata tctactgaca cttaatactt 3120ggaaagggat aaaaataaac attgtctaaa gccgtggtcc aaa 31634003066DNAHomo sapiens 400aagctgaggt cacggatttg agacctttct tcttttctaa tacaggtgtt aagtgctaca 60aatatccctt aagcactgct tcaacagcat cccacaaatt ttgatagttt gttttcattt 120tcattcagtt caaaatacct tctaatttcc cttttgattt cgtctttgac ctacaggttt 180tttagaactg tgttatttag tttccaatct cttgaggatt tttaaaacaa tatgttattg 240atttctaatt tatttccatc tcagtcaaag aacatacttg ccttttttta tacatttatt 300gaaacttttt ttatggccca gaatatggtc tgtgttggta aatgttccat gtgtacttga 360aaataatttg tattctgatc tcattgagtt gaatgttcta ggtatatcaa gttgatagtg 420atgcccaagt ctcctgtatc tttactgatt ttctgcctgt tctgttattg agaaaggggt 480attgaaactt ccaactataa ttatgatttg tctgttctct ttgcagttct cttagttttt 540gccttcatat atatatacat atatatgtat atatatatat attttttttt ttttgagatg 600gagtcttgct ctgttgccca ggctggagtg cagtggtgtg atcttggctc actgcaagct 660ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgaatagctg ggactacagg 720cgcccaccac cacgcccagc taattttttg tatttttagt agagacaggg tttcaccatg 780ttagcaagga tggtctcgat ctgacctcgt gatccgccca gcttagcctc ccaaagtgct 840gggattacag gcatgagcca ctgcacccag cccatatatt ttaaagctct gttattgggt 900acataaacat ttaggattgt tatatccttt tgataatgga ctcttctatt atgaaaagat 960aatatactgt gggtttataa catatgtaaa agtatgagta acatattatc agaaggggag 1020aaatggaaga taacttaggc atcttatttt taagcatagt tttccctttg tttctgcatt 1080agatgattta cctgaaatgt cattcaattt aacttactct ccatcctcac ccgcccagct 1140ttggttatga ggcagtagaa agaaatgatc tgcctgtggt tttctagaaa tacgaaagtt 1200gagtccttaa ggctacacag aaagaaagta cctccccagg gcttcaccct tcccatcctt 1260tcagcaggct ttttgtctgt cgtatcttct ctgttgaaat ggccattgac aagaggagga 1320aaggggtttt gttgtggatt gttcaggcac ttcctttggg gtatatgggg gatgagtgtt 1380acatttatgg tttctcacct gccattctga tagtggattc ttgggaattc aggcttcatt 1440tggatgctcc gttaaagctt gctccttcat gttcttgctt cttcctagga gccagcaccg 1500ctctttgacc ttgccatgct tgccttagat agtccagaga gtggctggac agaggaagat 1560ggtcccaaag aaggacttgc tgaatacatt gttgagtttc tgaagaagaa ggctgagatg 1620cttgcagact atttctcttt ggaaattgat gaggtgtgac agccattctt atacttctgt 1680tgtattcttc aaataaaatt tccagccggg tgcggtggct catggctgta atcccagcac 1740tttgggaggc tgaggtgggc agataacttg gggtcaggag ttcaaaacca gctggccaac 1800atgatgaaac cccgtctcta ctaaaaaaat agaaaaatta gccaggcgtg gtggcgggta 1860cctgtaatcc aagctgctca ggaggctgag gcagaagaat cacttaaacc caagaggtag 1920aagttgcagt gagccgagat tgcaccactg cactctagcc taggcgacag cgagactgcg 1980tctcaaaaaa aaaaaaaaag aacgttccaa ggtcaggact aggcctcccc tcagaagcag 2040caagtgacat atgtgacatc ctctccactc cctatttgca tttctaggtt atataactgt 2100actactatcc atgcatgcct actcttgttc ccagggtgaa ggacccagac atggagagcc 2160gaatccctgc aggccattat aaatgagatt atgccatttg ctcccatttc ttcttattct 2220ttcatttttg gggctctcca tcttgatgtg ttctttggat cgtgaacaga tccaaagaaa 2280aggttgttct gccgtgctgt ttgtcaggat gaaaaactct tttttaagtg tttaggtctg 2340cccccagtgc ccagcccaat caagtaacgt ggtcacccag agtggcagat aggagcacaa 2400ggcctgggaa agcactggag aaatgggatt tgtttaaact atgacagcat tatttcttgt 2460tcccttgtcc tttttcctgc aagcaggaag ggaacctgat tggattaccc cttctgattg 2520acaactatgt gccccctttg gagggactgc ctatcttcat tcttcgacta gccactgagg 2580tcagtgatca agcagatact aagcatttcg gtacatgcat gtgtgctgga gggaaagggc 2640aaatgaccac cctttgatct ggaatgataa agatgataag ggtgggatag ctgaaggcct 2700gctctcatcc ccactaatat tcattcccag caatattcag cagtcccatt tacagtttta 2760acgcctaaag tatcacattt cgttttttag ctttaagtag tctgtgatct ccgtttagaa 2820tgagaatgtt taaattcgta cctattttga ggtattgaat ttctttggac caggtgaatt 2880gggacgaaga aaaggaatgt tttgaaagcc tcagtaaaga atgcgctatg ttctattcca 2940tccggaagca gtacatatct gaggagtcga ccctctcagg ccagcaggta cagtggtgat 3000gcacactggc accccaggac taggacagga cctcatacaa tctttaggag atgaaacttg 3060cccatc 30664013065DNAHomo sapiens 401tgggacgaag aaaaggaatg

ttttgaaagc ctcagtaaag aatgcgctat gttctattcc 60atccggaagc agtacatatc tgaggagtcg accctctcag gccagcaggt acagtggtga 120tgcacactgg caccccagga ctaggacagg acctcataca atctttagga gatgaaactt 180gcccatctct aaaatttcgg gatttctttg tacccaacaa ggttcaaaca caacagtcag 240cttttattca tgatttttac ttccatctgc tgatgtagaa catacctcca gagtgacctc 300agaaattgtc aaatgtgaaa acacaagcca tcacagtgag aaatgggagg ttgagttaga 360ttgtctaagg ctggagagtc catatactcc cactgttagc tctgaagtgt gtagccagtc 420ttcagattct gggtcagttg cctcagtctc tcttagcttt tgccttactc tttatccgac 480cactgccctg ccaggaaaac aaggctctat aactcctctt acaggtcagc ttgacacaaa 540aagggtgcct ggattcctaa tgtttcattg tcacttttcc cagtcagatg ataatgcttt 600tcaaatcaac atatattttg ggggaggttg gaagggagag ttgaaatatt ctaagaatca 660aagagtagcc cactttaatc agagtatgac ccctgattgc tcacagtcat ctcctgagca 720gtgtgagcga gtttcagatg aggaggctga aggccagtca ggcatgctcg aggattccaa 780gtctgtaggt gggagggcag agatttagtc ctgttggcca aagcctctag ggaatttctc 840actccagtgg agaaggcaac acacttacca aactgtgtgg aaactatctc atttgattag 900aaattttacc tcaagaagag gaaggacagt tgagaaagaa cattttctta cacatgagac 960agctaaggct tacaagaagg agaggaataa tgaggcaaaa taatcctcat taatattttc 1020attcctcccc tggggattag aactactttc agacccgatt ttaatggtaa gttaggtact 1080tcctacagtt gccatccaaa tatcagtcag gatcagacat gatgttagct cctgctacaa 1140taaaaccatt ttctccctga atgaaaacaa aggttccaca ggagacagtc ccacagagca 1200gtggcttctt ttcctccctt taaaacctca tgttggctgg acacagtggc tcacacctgt 1260aatcccagca ttttaggagg ctgaggtggg aagatggctt aagcccagga gtttgaggct 1320gtagagctat gatcacacca ctgcccttca gcctgggtga cagagcaaga ccttgtctct 1380aaataaacaa acaaacaaaa aatcctcttg tgttcaggcc tgtgggatcc cctgagaggc 1440tagcccacaa gatccacttc aaaagcccta gataacacca agtctttcca gacccagtgc 1500acatcccatc agccaggaca ccagtgtatg ttgggatgca aacagggagg cttatgacat 1560ctaatgtgtt ttccagagtg aagtgcctgg ctccattcca aactcctgga agtggactgt 1620ggaacacatt gtctataaag ccttgcgctc acacattctg cctcctaaac atttcacaga 1680agatggaaat atcctgcagc ttgctaacct gcctgatcta tacaaagtct ttgagaggtg 1740ttaaatatgg ttatttatgc actgtgggat gtgttcttct ttctctgtat tccgatacaa 1800agtgttgtat caaagtgtga tatacaaagt gtaccaacat aagtgttggt agcacttaag 1860acttatactt gccttctgat agtattcctt tatacacagt ggattgatta taaataaata 1920gatgtgtctt aacataattt cttatttaat tttattatgt atatattgtg tcagttcaga 1980tgccaaaaag aggtcttgaa catgtcacag gctctgatgg cactgaccat ggagaaagct 2040tgatttgatc atctggtgtc tacaataacc aaagctaatt attaaggaaa aaaacttgaa 2100gaaagaaaat agtccttact tcatctataa tgaggttttt gtttttttgt tttgagacgg 2160agtcttgctt tgttgcccag gccggagtgc agtggcgcga tattggctca ctgcaacctc 2220cgcttaccgg gttcaagcaa ttctcctgcc tcagcttcct gagtagctgg gattacaggc 2280acctgccacc acgcccggct aatttttgta tttttagtag agatgtggtt tcacgatgct 2340ggccaagctg gtctcaaact cctgacctca ggtgatcctc ccacctcagc ctcccaaagt 2400taagtgctgg gattacaggc atgagccact gcggccggca ttaagtatga gtttttaagt 2460tagcccactt tgttaatgac tatgagtact aatagcttaa gataaagaag tttctaggta 2520atcttgtttg aaggatgatg taaaaatata aatttaaact gtgagtgaca aaataaactt 2580ccttaatatt tgcctacatt tagagaaatg gagcattcag ctcagaaagg aagaatgtct 2640gtggttttaa ggtaaaatcc atattccaag actcagtgaa gaaagttcag tgataaagaa 2700cagactactc tcatcttatg aagaaatgga gcaatttcac ttggaaagac taggaagaca 2760aaatgttaca gacgtatttg ttgtgccaca aaataggcaa ggtcagtttt gaacaataag 2820aactccataa agtagaccag ggcatctcag aagtgaggtt ccatgagccc aggtggggca 2880caggctgggt gatcttgagt ggagaggaag aggggttttc tgagcttcaa gagctgggcc 2940acacagtgtg ttggttttag ctgggatgga gttctagaac aaacctgcac tttagaacac 3000ctttctaccc acccccaacc acacaacttg ctactattag taaatgtata ggctgaggca 3060cggtg 30654023153DNAHomo sapiens 402ggactaaccc acctcccttc caaggatctt gctatctatc ccttcttgcc ctcagctact 60cactcccaca gtcatatacc agatctcatc attagacaat tgtaatccct acacaattta 120gttccatgta tcctctctct aaccactatt cctcatcttt ccaggtcatt ctctctagac 180ccgaattcca acaacccttc aaccacactg gtaccactaa tctacagatt acatcttctt 240tctactatac cttgatgtgt tcctgaatat ctcccgaatc ctcttcatcc agtttaattt 300caaggtccat cattataatc attttcttac atactccctc acctctcctg ccccattaat 360actgtcctag taaaatctag ctctctaccc actccatgcc tgcccctatg ctgctgtaag 420tagccagaga aacacatata ataaatgcat tcacacaaac cttctaacat atcatataat 480attgtctgat gtcttcctac tagaatgcct ctcaggcagg aatttttttt ttctaaacta 540atttattcac tgaaatatcc cagtgcctag aatagtgcat gttaaatagt agaatctcac 600tcaacatttg ttgaatgact gaataggagt tccaaaatag agaacacagc atatgggagg 660ggaaaaaaat cagtaacaaa atcattcaag aaattttccc agaactaaag gatgggagct 720cctagaattg acaggggccc agcatcacac atgaaaactt caaatcacat gactatcttc 780aaattacacc agaatgctag agagaaagag aataggatac aagcttccac aaagaggaga 840aaaatagatc acaaatcaga aaagatcaga actcaaaatg ttcatgaaaa ctcaacagcc 900atgctcgaag tcacagcaca atgaagaaat gtccttttaa aaaatcttaa ggagaaccat 960ggcaactcag gattctctac ccagccaaac tattttaatc aagtgagagg gtagaatgaa 1020gacatcttca ggcctgcaag gtcatgaaaa attaacaatc cacaaaccct cttctcagga 1080agctactgga agatgtacca aaataagaga ataaataagg agaaaggcat gagacaccgg 1140aaaaagggaa cccaacctaa atcacatgca aagaaaatct ccagatgcca atgaagggtg 1200accacatcta tgtaccgaga gggcaagtca ctagtttaga aagggacaag tcagatgcac 1260caagattcaa caaactggaa ctgaaataac accagatgca tctgaaaata ctgagtggga 1320ttaatctact cttggagatt ctgtggctaa attgatgata gaaaaccaag caaatacaaa 1380gaaaaaccat aacattaact ttagaggaaa ctaatagttc tgagggagat gatcctagaa 1440tgcaacctgg ctccactgtg tgagtagtgt ttagagggtc ctaatgacac aagcaggctg 1500gaattacact gttcctttat taggaggata taagagtgga aaataagtat gtgtgtggca 1560gggacaaagg atgaaaaaca gctaaatcct catcttccat aaaaggatgt caatatagaa 1620tgcctgaagc agaacaatca agatgcaaca taagtatgtt atacagagat acaaggacag 1680tacacaagaa tcagctaaaa gtatttaaca gaaatggtca ggggcgaggt cagaggagcc 1740agggcagggg actgctgtgt tcataacaag ctttgtaaaa aactatatga ctccttaaac 1800tatgtgtcct taaaaaaatg ttttaagaac agaaaataac aaagaggtaa aatatgaatt 1860atctatcctt catatctcac ttgagtactg atgtttgaaa gaagcatatt tttttaatga 1920acatttcaat tagccagtat tttaccatgt aactttgtta aaattatatt acactccaat 1980aagaatgcct ttacctgtga cagtagttct tccttctctc cagcaagttt tcgtagcctt 2040acatctaaaa caaatgaaaa agatcataaa ctaaatatgt gatgatatag tacataaaca 2100attaaaaatt tttcaaactc ataaacagct aatattatct gataaattac attacttaca 2160gctctgaata tctaaagaaa taaaggtgtt aatagcatta cagaaaagtt cttaactatc 2220taaaaagtat ttccacacaa ctgatattta tcagggcacc aaatccaaca tttgttcccc 2280acagcagtga tttgccactt aaagacaaac agaagtacaa aggaggtcat ttccttgttt 2340caagctttca ctagtagaca gacaactcaa atgtcaagtg tgttcctaaa ggctgagccc 2400ttagcgggag agatccaaat atgtgaaaga agatggggta agagcaggac tgggcaaagg 2460aagctagaga agagaaaaga gaggagcata atgctggaag aagcaaagtc cccaaaagct 2520agtagggagg gaaggggacc cactccaaga tgtgggcagc caggccaggt gtggtggctc 2580acgcctgtaa tcccagcact ttgggaggcc taggtgggtg gatcacttga ggtcaggagt 2640tcaagaccag cctggccaaa atggtgaaac cacgtctcta ctaaacaaat acaaaaatta 2700accaagcgtg gtggcaggcg cctgtaatcc cagctactcg ggaagccgtc tgaacctggg 2760agatagaggt tgtatgttgc agtgaactga gatcgcgcca ctacaatcct gcctgggcga 2820ccgagtgaga cttcgtctca aaaaaaaaaa aaaaaaagat gtgggcatcc atgggtagat 2880ctgcggcttg gtggcagcac cattagggct cactcctagc ctgtggaggt ttgactcttc 2940ccaagcctgc attaaaatag gccccttcag ctgaggagat accatggttc acttaaaaaa 3000gcagctgata cctcgccaac cactaccctc tgtaaacagg gtcatagcca aataaagatt 3060ttggtttctt ctgcaccttc caagcagatc tgcctgcttg gacctgcaga cgagggaaaa 3120aacagcaggg aaacactctg ggctgcctat agc 31534033080DNAHomo sapiens 403gccagactct cgttccattc tccagatctc tcttgctcac ccagcatcct gttttattca 60aagtgcccta caatcacatt tctggaatgc acattagaga atgtgcttac taactttcaa 120aatgtttttc agtttgcttc acacttgtat ctctcactcc tctaagaagc ttacacatat 180atgaaaacaa gatgaaaaac aaaaaaattg ttttttttta aaataaaagt gagctaatga 240tacagtatct atctgtgcca tttttcttcc tctagagtag atttctttgt ggggtcaatg 300gatgggtgac tttgatttct cagacagagg tgtcagcaac tttgtggttt cctggagaga 360ggtgtcagat tctcaaaggg ttaaatttaa gaggtttaga ctttaagagt ctgggaagcc 420ctgctctgga agtcatactt ctctgatatc tttttggtca tctgtttctt ggcttaagaa 480atgtggtgga aaagaggtac agaaccctgg ggtaagcagt ggaacataaa accagatgtt 540ccaaggatga gaaacttata acacacttga gaagtctcct gctagcctac tgctccccta 600gcacaggtat actagactat ctctttgcag aacagtttgt agttaagtaa aaaccgatgt 660gtataggccc atagtacttc catccacagg ccttacagtt acacttattg ccttacagtg 720acccagatgc tgatttccca aggtcaagga tgtctgaaga caatgtgcca atgtgcccag 780attcttctag ttaaggatct acttgagtct cagcccttat gctgtttttg ttttccaagc 840tgggatatga aaaagcagaa aacccaatag ggtaacatta atccaagtca acatagcaac 900cagtatctta cctaatggcc cttctcctgc tgactccaag acctgagcag cttcctgaga 960cacaacagtg atggctccag ccactggttc atgactgaca tcaccattgg gagtgccatc 1020ggggattata actaagccat gtttctgcag gggggaaaaa cccaccatca caaaaggccc 1080gtatggaagc tgtaagctct gtgaggtcac tctgcaacaa tacatgtttg ctacaggtaa 1140aacctggtta gaatcagtta catgaaatat agctctgtgt aagaaatagc ttcaacctac 1200caaatctgga ttagagaata aacactgtag tttgtattta ggctaggaaa gatggcagga 1260tgaaaggaag gaagatagag agtaaaacag tgagggacct gaattccagg ctaatgctaa 1320catacctctc ccgtcttcac tgtctcctgc aggtcagcca gctcctctct gagcatatct 1380cgctcattcc taaggcaggc aatgtattct ttctgtttct ctagggcctg gttttaggta 1440aggtagcaag ggaaacaatg gcacagaaaa agagcaggtg aaaggtagca gagaagtacc 1500taattcaaat aagcaaagat aaaggcataa aaagcaagaa agcagtcaaa agattggaaa 1560caaacagtca gatatgggag gaaatacaga gttacatgga tatacatctc cagaagagac 1620ttctcataga aactggttct catgcatcaa tttggcaaaa catgtttaat cacatcaagc 1680agggaaataa atcttttcca gtcaatgaaa aaaataaaac aggaaaagga agataaagag 1740agaagccaga gtaaaataaa gctttcctta ctgactgcct aagtgcattt ttatttggtg 1800aacaaaaaaa accccacatt tcatgtttaa ctaaactagt ttattcaaga atacagttga 1860ttttttaaaa aatagttctg gaataaaaat aactattata cataggtatt ttaatttaat 1920attggctgta gatttttctc caagtagtgt ggcaaaatac tcaaatacca cttaattcaa 1980aatagttaac ctccaaaagg attcaaagat caacttctga caacttaatt aaatataact 2040gagactcatt tggctttctg ttatactccc aaaatgtgaa aaacaaaaat aaacactgac 2100aaaataaata cagccaagct atgaagagtt acagaatatg gatttcagaa tcaggctttt 2160gggttctggc acatacttgt cctatgcctc agtttcctca ctggaaaaac agaagggata 2220atagcaccca tcccaagggc agaggcataa atcaaggtaa agcattgcct gtaatgccta 2280gatagcaggg acagttcagg agaatcaggt tggtgatttc atttgtaaat tccctgccat 2340ttccttaatc tcacaactgt cagctgagga caatgcagaa gcaggaacat actttggtca 2400tcaatgaaaa ataaaatcta ctatgaaaaa ataaaatcta ttgtaaaaga aaataaccca 2460gaattaaaaa tacacccaag gtaagtagtc tatgcaggaa tctgattact ggcctatttg 2520aaaaagcctt tccccaaata tttttgttca tatatttaat gtcttctgtt agcattccca 2580ttaatccaag aagttaaact atatcaggta actttcctct cagttcactg ggtttggaag 2640tgggacagcg aattgctgag aaattgatag ctgaatagct gggcaattca aaaaatcatt 2700ataatcctgt tttgcaacca aatagggagc aagtaaataa gggatgatag caactacgat 2760ttgtatagca caaattatat ggcaggcact attttatata atttctctct tatacattat 2820tttacatttg aaacctctac atatcctgtg aggtacttgt attatcccca tttaacagat 2880cagaaaattg aggctcacag tggttatatt ttttcgccca aagtcacagt aagtggcaaa 2940accagaaaat gaatctggtt gtttttgttt ccaaagccct taaatagttt tttaaatatc 3000acagctctat gaaggccaca ttatattccc ttattgttag cccagatgat gctaggaaag 3060gagtccatac ggcaaatcct 30804043074DNAHomo sapiens 404tcgcccaaag tcacagtaag tggcaaaacc agaaaatgaa tctggttgtt tttgtttcca 60aagcccttaa atagtttttt aaatatcaca gctctatgaa ggccacatta tattccctta 120ttgttagccc agatgatgct aggaaaggag tccatacggc aaatcctact ctttacttat 180ccaaactgca atgtcaatat ctgacttctt ttcaacaatt tacattcaca ctatatgatg 240tgtctcaagt ctgcctgtga attaacaatg tgcatttcta gcaccatcta gctagtgtta 300acactccatt atgttaataa ttaataataa ctgaaacatt gggaaaacaa agcacaacaa 360tactttccca tgtgttgagt gtcactttat ggattaggta tttttggtta ctggtatctg 420catgcatagt tatgtcatgt atcaccacat ataagtgggt aaatgatcac tgtcacaaca 480tgctctacat aaacaacaac actgaataaa aaagacctct gaggaacagg ccaatttgaa 540actaggaatt ctagcaaatg atatacatga catttgctct tcttccacat cgtattgcac 600tgggttttat ttttaccttc ggacttttta atttcctctt cccataatta cagatgagaa 660aataaaatac atcctgtaaa ttcacccact tcaccacaaa gtttgaagac tactaaaata 720ccttataatt ggatcaaatg tattcaagct ggatctaaaa ccctctgtat tacctgacca 780tataaccact acccttgtgt ttgtgtgcaa caatagctcc tacagtagat tttttttagg 840gtaaaaagta cacgcttgta gagttcaaaa taactcttta tccctgacct aacctcaaat 900cctaccaccc ggaagccaaa aggatgtgta taatgggctg aacttttggg caaggggtta 960attctccaca taattgtact ggggaacaaa tatctttggt cagaatggaa gtgagtttat 1020gctgggctat agagatacgc aagttcttca tacgcaccta ttctatacat gggctcctgg 1080tgtttagaac cgcagtggag ctagaggcaa gaccactaat gaactgaact ttaacctggg 1140aataatggac atatttcttc attaagttac taaatgtaaa tcttaaaaat gaagctagag 1200acaagtagtt actgaccata ctgaaaatgt gtcttaaaag tcaagggagg accactgccc 1260ttgtattata atgataacaa atgttggcaa ggacatggag aaattggaac ccttgttcac 1320tagtggtggg aatgtaaaat ggtacatctg ctacagaaca cagtataact gttactcaaa 1380aaaattaaac acagaattac catatgatcc agcaattcca cttctgggta cataccgaaa 1440acaactgaag gcagagtctt gaagagttat ttgaataccc atgttcacag cagcattatt 1500cacaatggcc aaaaggtaga tgtgttgata tatcaacaga agaatgtggt atatacatac 1560aatggaatat gattcagcct taaaagggat ggacattctg acatatgctg caaaatgaac 1620cttgagggca taatgccaag tgaaataaat cagatactgt atgattccac ttacatgaag 1680tacctagagc agtcaaattc acagagacag aaggtggaat ggtagttgcc attccaccag 1740gggtttggga gaagggactg aatggggagt tgtttaatgg gtacagattt cagctgggga 1800agactaaaaa gttctatggt ggtgacagta gcacaacaac atgaatgtac tcaatgccac 1860tgaactgtac acttaaaaat agttaaaatg gtaaatttta tgttatttgt actttagcac 1920aatttttcaa attaaaaaag agtcaactcg tgattcaata acttggaaga atcttgaggg 1980acttatacag agtgaaaagg gataattcca aaaggttaac atatactata taattccatt 2040tttataacat tcttaaaaga gcaaaactac acaaatgaag aaaagattag tggttcttag 2100ggcttgggag gggaagggga gattaaggct atgactataa aaaggcaaga ggaggagaaa 2160tcccttatat tgatggaaat gttctgtatc ttcaccatat caagggcaat atcctggttg 2220tgatattgta ctatagtttt gtaagatgtt acatttgggg aagattgagc aaagaatata 2280taggatctct gttaaatttc ctcttttttt tttttttttt gaaacagggt tttggtctgt 2340tgcccaggct ggactgcagt gacatgatct cagctcactg caaccttggc ctcccggatt 2400caagtgattc tcatgcctca gcctcccaag tagctgggat tacaggtgtg caccaccatg 2460cctggctaat ttttgtattt ttagtagaga cagcgtttta ccatgttggt caggctggtc 2520tcgaactctt gacctcaagg gatccaccct ctttggcctc ccaaagtgct gggattaaag 2580gcataagcca ccatgcccag ccctgtttaa tttcttacaa ctgcatgtaa atctaaaatt 2640ctggccgggc acagtggctc atgcctgtag taccaccact ttggaaggcc gaggtgggtg 2700gatcacttga ggtcaggagt tcgagaccag cctggccaac atggtgaaac cccatctcta 2760ctaaaaatac aaaaattagc cggacgtggt ggtgcacacc tgtagtccca gctactcagg 2820aggctgaggc aggagaattg cttgaaccca ggaggttgca gtgagctgag atcgtgccac 2880tatactccag ccttggggag agagagagat tccatctcaa aaaaagaata ctaaaataaa 2940atattttcca attaaaagcc aaataattta tattttaaac tgagacatct gaggggtttc 3000tatggctggt ccaagattat cagtttaaaa tattaaggca ctcatacaag ctagaaatcc 3060tgggcctaca gatc 30744052129DNAHomo sapiens 405caagattatc agtttaaaat attaaggcac tcatacaagc tagaaatcct gggcctacag 60atctgtgtta aagaaaatta tgtgaagtcc taaaagaagc ccattctaga cagtgaccag 120attttaacta aaaattttaa attacctact ttgccccacg ttttttcacc tcttatattt 180cccaagcaaa aatttaaatc aaatcaatgg ttcaacaatc aaatttcact tttcaatttc 240aaatttcaat tttaaaatca tagtttcaaa acacctaaca gaattattga tttattcccc 300agaaggcttt tctaggttta gatggaattt ttaatactca gcaatttgaa agtcagagaa 360ttatctataa agtagctttt gttctttaaa tttttggtct acaaactttt ttaaagaaag 420ggtatcactc tattgcttag gctggagtgc catggcacga tcatagctca ctgcagcctc 480catcgtgtgg gctccagtga tcctcccacc tcagcctcct aagtagctgg ggcaggtgca 540tgctctgcaa attttaaaat tcttttgcag agacagggtg ggtctcacta tgttgcccaa 600gctggtctca agctcctgac ctcaagcaat cctctggcct caagcgatcc tccgtgctag 660gattccaggc atgagcctac aaactcttaa gaggtaatgt aatcttccca tgtgtatatt 720aatgagagag gtccttgaag tgatgaaaaa gactggatcc tgctgactac tggttgggct 780tcagaatgtt cctaacaaca ttctgagggt ataatccaca ggatttcata tccaggcctg 840tctctcaaga gatgttctca gggatcttta acaactattc cctactcccc ctaaccttaa 900gcagaacaag acttttctta catgttctat ttcctctgcc cttccctgac aaggtaagcc 960tctggcaact atggctaagt ggttccccta ctgtagaaca gagagctcag ccaggtgatg 1020ggactgccaa tcaaaggcca catgagatga actggaggga attttttcca gcttttggtg 1080tacatggaat ctacctgcaa ggcttagcaa aacagcaatg aagacatttc gtttatctgg 1140gcccttactt gggggagttc tgtggttata attacagaca gccaccctag aaagtcttac 1200attcctatcc atttctgtaa ttgaattgat tttaatctct tcctatttta tacaccaagg 1260atttatagga tgctaataac tttctcccca ccactaccct cttcttatcc aaattcctgt 1320aacgtaagga tatcaagtta accacagagt ttgaattgaa tgcctgtggc tgtttctgga 1380taagaatctg aagggaggcc aggcatggtg gctcacgcct ataatcctag cactttggga 1440ggccaaggtg ggtggattac ctgaggtcag gagttggaga ccagcctggc taacatggtg 1500aaaccccgtc tctaataaaa agacaaaaaa ttagctgggc atggttgtgt gtgcctgtaa 1560tcccagctac tcgggaggct gaggcaggag aatcacttga acccaggagg cagaggttgt 1620agtgagccga gatcatgcca ctgcactcca gcctgggcaa cagagtgaga ctccgtctca 1680aaaaaaaaaa aaaaaaaaaa aaaagaatcc taagggaata cagagaaact tcctttaaaa 1740gttcctactt atacatttta caagcctagt gtttgctgaa aagaagagtt cttccaggca 1800caacgtcagg ttttcctatg gaagtctctg tctcctactg actcattttt catactgtgt 1860aaatgctcaa gaagaatcaa aaggacaggt tttttcaatc tctaggttaa attctactgt 1920agtcctcatc aatgagcttc taaccaaagc ccaatttcat ttcatacccc aattttttta 1980tctttccaaa gaagtgtctc ctggaggtca aacacctctt ttgtcatggt gtctattttc 2040tgctgcatgc gctgcttctc ctgcagagga agaggggaag agaagtaata aaagagcaga 2100aagaaaaggg agaggaggtt tgagggagg 21294063170DNAHomo sapiens 406ttctcctgca gaggaagagg ggaagagaag taataaaaga gcagaaagaa aagggagagg 60aggtttgagg gaggaaacaa aaataaagcc gataaagaaa cttaaccaaa agggaaagtc 120tgtgatgaac aggaaaagca aaattggtct gccaaaagaa aagatgacat tcacagtctt 180ggccacaaga ttcttattgg cttgccccta caaaagtaag caaaggaacc aggaataatt 240gttccaacca cagctacgtg gcagcaagcc agctagaatt

tctgtgtaca tacagctcca 300tatgtatatt ctttctttga taactgcctt tttaccaaac aagaacttac attcctagag 360agggaaattt aggtttgctt atgaacaaat gatctttcat cttagagaac aagcagtttt 420gaattttatt ttttaagcag aactgatcat tttgaatttc tgttagcaaa atctatgaca 480gcaagaacac catgaatttt gtattatttt aaaattatat tattttgaaa catttaaatt 540tagcatttaa caatccttaa atgacctttc taattaggca atggtgctta acaggttttc 600ttcttatgca ttattggtaa attattatgt cctcctttcc ctactcatac attaggtact 660ttaccatgga attttcaatt ccaaagacca aaaaacatta tttgtaatat ttaaagtttt 720tcagcataac catagatact aacatctaaa agatgttcat tctagatgta aaaaacatct 780aaaactatag ttctcaaagt ttgtatacct agcaccctaa gcttttaaag aagccacagt 840gatgaactat agaaatcaag cattatattc ttcttaaatg caattacaat taattactag 900aacactttac cagtcctaac ttaagctatt gaatttgaga agcagccccc aaagcaggtt 960tattatttta tgtggttggc attttggcac aaaaagataa aagaacaaaa agggaaagaa 1020tttcacatta ttttaaaata ccagcaggat acagattctg gaaaatatgc ttcctacctt 1080atatggagaa aaaccaagaa aattaacttc acatgtaatc tgatagatcc aaaaggttat 1140ctgtatctgc acttgaaatc cacaaattct gagtatgttc aattattctt aatgatgaca 1200aaaattaaca cgtcttcaaa tttaaagtca tttctttttc tctattaaat ggtttttaaa 1260aatcatttgt agagagacat attaagaggt aggtccgagg ggaaagagag aaagagggag 1320agaaaaagaa aggctaaggt ctgagtagcc aggaatgtgg acaagtgtgg ttgtgagatc 1380tctctcctgg gatcattaac aatctatgct tcctgacatc tctggcgtgt caacactaac 1440ttaacattag atgcctttga tagccacacc tagatagtgg gcaggatccc ccttcaaact 1500tatttccata tttatctaaa aacatcgtct caggagggaa aaccacattt aaagaaaaaa 1560gatgcatgca atgtagcagg cctgcaagga tgactaatgt tttcaaagag ttcttggtag 1620actatgcttc attccattcc taagatgttg ccagcaatgt ggcagagtcc cttcgcttgc 1680agaaacctga accttcagac taaccattct ttaccttttt gtacagaacg tatcttgatg 1740tttcttcttt tttcatttag ccacctgaga aatgtattta cctgagtgaa aatcaaactt 1800attccccaag aatcatgtcc caaaagatgg cattcactaa ttccaaagaa taatgttatt 1860ctataatttt tccttttgcc catttcctaa gatatctgta ggaaacagtg tgcttaggaa 1920taaaagacac aaaaatttct gctaccaaag tggggtaatg tttataggat ttatagtatt 1980aatttttaag cataatctgg tttatgtttg aaaatttgta gtgtacagtc aaatataaag 2040agacaaactc tgatgcatct taactctcct tccctcccaa cacatcctca tcccattcaa 2100ctcatttttt ttcaaaatta agtattccca cagttcatgt acatacctca ataagctcat 2160ctctttgccg caggccttct ttaagttctt ccatcttatg ctgcagcaca ctacacatat 2220gtttctgcct ttctaactcc tgttattaaa caaataatat catttacaca ggtcatggca 2280cacaagaaat ttgaacatac acaatacaac acagaggtta agtatgacct ccagaaacat 2340gcccaaactc ctgattcata gtaacttaga aaaattgtgt attctataga aaagttaaga 2400aaattttaaa attccatctt gtataattat caggaaaacc tgaactaatc aatggcaaaa 2460ttattaaaaa caaaagataa tttagtaaag taacaggtta taaaatgaac atatacaatt 2520caatgacatt catatacaaa taaaattcaa agaaggaata ataaatgcaa tatcaaaata 2580aaatcaatat taaataaaaa acatacatgt aaacttacaa aatatatcaa aaacctatat 2640gaggaaaatt atataaagca ttcccaaaag acagagaaat aggattgaat aaatggaaag 2700gcataccgtc ttcttggatt aaaagtctca caacattata aaaatgccag ttctccctaa 2760attaatctat acatttaatg tagtacaaat aaaaatacca tcaggttttt cttttatcat 2820catcagagca agttgatttg aaagaaaaac acaagaaaaa gtagccagaa aaatacatac 2880tgaaaaagaa gaaagccggc cttattaggt attaaaacat attataaagc ttctataatt 2940aaaacaatgt tgttatggca catgaatata gaccaaggga gcagaataga gaattcagga 3000aaaacccact taaatataca aatatattta aaaacaataa aaataagagc atctcaaatc 3060aatgagaagg aaagactttt aaattagtaa tgttgggata actggatatc catttggaaa 3120aagataaaat tggaactata cctcatacca cacaccagga caaattccaa 31704073157DNAHomo sapiens 407aaagccaggg agtgaatggg ggaagaggga agggaaggga gaacaaactg tacaggaata 60aaagtaacca aggagtgggg caatctttac tgaaaaaatg actcaaaaat ccacaagcaa 120tgaggtaatg gtacaataaa ttagactaca taaaaataaa aattttgggc tgggcatggt 180agctcatgcc tgtaatccca gcactctggg aggctgaagc aggcagatcc cttgagccga 240agaattcaag accctgcctg ggcaacatgg caaaacccca tctctataaa aaaaattcaa 300aaattagcca gggttggtgg cgtgtgcctg tagtcccagg tactctggag gctgaggtgg 360gaggaccacc tgagcctggg gaggtcaagg ctatagtgag ccatgatggt gccattgcac 420tccagcctga cgacggagtg agactctgtc tccaaaaaat aaatacataa ataaataaaa 480cttttgaatg gcagagaatc tctaaaacta ggccaggcac ggtggctcac gcctgtaatc 540ccagcacttt gggaggccga ggtgggcgga tcacctgagg tcgggagttc gagaccagcc 600tgaccaacat agagaaaccc tgtctctact aaaaatacaa aattagtcag gcgtggtggt 660gcatgcctgt aatcccagct actcgggagg ctgaggcagg agaattgctt gaacctggga 720ggcggagatt gtggtgagcc gagattgcgc cattgcactc cagcctgggc aacaggagcg 780aaactctgtc ttaaaaaaca aacaaacaaa aactagaaag aaagaaaaca ctaactgcat 840agaataataa gctacggaaa cggacagttt acagaaaaag aaatagaaat agctctgaat 900atgaaaagat actcatacta agagaaacgg aaacaaacaa aatactagca aaagttcaaa 960aacttgacaa catattccag aacagaacta tggggaaaga aataggccct catacatttt 1020ggtgagaatg caaatggtat aatgcttaca aaggagacta cagcagtatc tgcaaaacta 1080catacctttc gacccagcaa tctcactctt catcatagat acattggcaa aaatacaaaa 1140agacctatgc agtatgttat ttctacagga ctatttttaa cagcaaaaca tgacaaactt 1200gaatgtctat taatataggg aactggtaga ataaagtgtg gtacatccat actgtggaat 1260aattatgcag tggtgaaaaa gaatgagcaa gatatctcta tacaacattc ataaggtgat 1320aaaatctaca tgcacgacag catttatatt aacaatatgc tactattttc taagaagagt 1380aagaaataca tatatttgta catatatttt gaatgattat atatacatat atatcttttt 1440agattaaaaa tggctaccta atttatcttc ttggatttaa aacatggaaa gataaaccat 1500taaaatttaa aattccctaa aggaaggaga aaatagagac agagacaggg atagaagatt 1560aacttcttca gatatattct ttgttaacgt gactccagat ctatgtaact attctagata 1620gttacaaaat tgtaaaacaa aattaaattt aaaaagcaat tcctagtgga aacaattcaa 1680gtggccattg atagatgaat gcataagcaa aatgtggtat atatacagtg gaatattatt 1740cagctttaaa aaagaaactc ttgtcacacg ctacaacatg gatgagcctt aggacattat 1800gctaaatgaa gtaaaccagt caagaaaaga tatattatta tatgattcta cttacttatg 1860agtttggtac acagagtagc caaactcaca gagacagaaa gtaggatggt agttgccaag 1920ggctggtggt agggagaaat gggtaattgt ttaatgggta cagagttcta gttttgaaag 1980ataaaaaatt tctggagatc tgatgtacac taatgtgaat atgcttcaca ctactgaact 2040ctacacttta aaatggttaa gataataaat ttatcatgtt ttttaatgat aattaaattt 2100tttaaaataa aaataatttt aaaagtaatt ccaaatattg aaaataaaat gcagtgaacc 2160taactataca tccagttaga agcgcagaaa gaaactattt caagtaactt ctaaaaacaa 2220tagtttggcc acacacagac tagtggcaaa aataacagcc aagcaaaaca aacaaaataa 2280aaatctttta actattttca gtaattaaat tgttggtgtt aatgttggta ttgctattct 2340aggcaacttc ggataaagca aagagaacag aacgtaacat aattactatc atccctagaa 2400actttgagaa ctaggaattg tagtgtaggg gaaacaaaca aagatacaga tgtaagacag 2460aagaggttaa ataaccctag agtcctgcat gtgaactgga actatcagca agaactcata 2520atgtattttg ttgttaaaaa caaaaaaaaa cttcacacac atatttccca aatttatcca 2580ctgaaaagac ccataaacac tgacctactt ggtggcaatg agcatctcta gcactcatac 2640taaaacagaa ctagggctcc ttggacaaat ggctgattcc aggtctgggg cagaaaatgt 2700acaagatgag actgggacat cttctgccag aaagcaaaga agctatcaaa aacaattaga 2760ctcaccagaa gtacttgaga atcaacctca agaggttcac attagccaaa gatgggacaa 2820ttttatcatc aaaaagaata aaagctgcaa tgcaactgaa gatatcaaat gcttgaattt 2880atgacttcat attgatattt taagagaaaa gtaattggtc acaaccaatt cctttttttg 2940aaaactcata aagggagaaa atatttgcga ataatatatc tgataaaatt cttatatcta 3000gaatatataa accttacaag tcaataataa taaggcaaaa tccaatttta aaatggacaa 3060aggatctgaa tagacatttt gccaaggaag atatgcaaat agccaataag cccatgaaaa 3120tatgttcaaa atcattagtc accagggaga tgcacat 31574083166DNAHomo sapiens 408tgtggggaaa tcaaaacctg catacattgc tggtgtgaga atataaaatg gtgcagccac 60ttcggaaaac agtctggcac tgttcaaatg gttaaacaca gatttatcgt atgaatcagc 120aattccactc ccaagtatat acttaagaaa aaggaaagca tatatccaga ccatgagcag 180tggctcatgc ctgtaattcc aacactttgg gaggccaagg caggaagatt gcttgaggcc 240aggagttacc agcctgggca acatagcaag gccccatctc ttagaataaa aaagaaaaag 300aaaacttacg tccaaaaaac aacctgtata caaatgttta tggaagcatt attcttaaca 360ggaaaaagta taaacaaccc aaatgtcaat caatgcacaa atgggtaaac aaaatgtagc 420atacccaaaa aaatggaata ttaataggct ataaaaagga attaagtatt gatacatggt 480ataacatgga tgaaccttga aaacatcacg ctaagcgaaa gaagccagtc acaaaagacc 540atgtattata tgactccttt catatgagtc tagaataggg aactctatag atagaaagta 600gatcagtggt tacttaagac tgaggggttt gggggaaagg aagatgatac taaagggtat 660atggtttctt tctgaggtaa tgaaaacata ctaaaagtaa ctgtgatgaa ggttacacat 720atatgtgaat atactaaaaa ccactgaatt gtacacttta aatggatgat ttgtatgtta 780tttgaattat atctcaataa agctgcttaa aaataacatt aaataggcca ggcgcagtgg 840ctcacacctg taatcccagc actttgggag gccgaggtgg gtggatcacc tgaggtcagt 900tcgggaccag cctgaccaac aaagtgaaac cccatctcta ctaaaaatac aaaattagcc 960aggcgtggtg gtccatgcct gtaatcccag ctactcagaa ggctgaggca ggagaatcac 1020ttgaacccgg gaggcagagg ttgcagtgag ccgagattgc gccattgcac tccaccctgg 1080gcaacaagag caaaactcgt ctaaaaaaat aaaataaata aaatttaaaa aaataaataa 1140aataacataa aataaaataa attggtaatg ataaaatcag aacatcccat tttgcagccc 1200ctagtcaatt aaaggatcta agcacacaca tacagcctaa cagtcagcca cacatctggg 1260cctcctgaag gaagacaact gcatcatcta tgaagcagac tttcaaaaaa aactgaagtt 1320atatttgatt aagcctctgt gcctaactac ctatttacag agaatacaga ggaaagggaa 1380acatggtaaa gatactatgg ggacgaaaac ggaaaaactt gtaagactgg gaaatattaa 1440gcaacccagt ttcttcaaca tatatattat aaggagaaaa aacatgaaag aggacctata 1500catgaaaaga gacttaaaag atttatcaac tcattctaag gtgtgaaact tacctggatc 1560ccgatttttt taagtgtaaa aaagaaaaat catttatgac attttgaaac tactgaaatt 1620ttaacattga ttagatatat aatatgaatt attgttaatt ttacaggtgt gaaatggtct 1680tttgattatg ctttaaaaga gaaacaatgg gctgggcaaa gtggctcatg cctgtaatcc 1740caacactttg ggaggtcaaa gaaggaggat tgggcaatac agcaaggccc catctccaca 1800aaaagatttt taaaaaacag ccaggcatgg tagcatgtgc ctacagttct agctactcca 1860gattacttaa gcgtaggaga tcaaagttac agtgggctat gatcgtgcca ctgcactcca 1920gcctggggag cagagcaaga ccctgtctgt aaaaacaata aaattaaatt aaattaaaat 1980ataaaacaaa atagttattg taattccagg aggaggcacc aaagacattt tgaaaatttt 2040tattaatact gtataattat tatggaatgg tccttctcta ctctcactac ctgcttctgt 2100aatgaaatac accagaatgc ctgacctctt tgtgtatctc tgaatataaa cattctactt 2160tataaagcaa tagttgttta gaacaagaga gctcggagta aaggtatgtt agaaagagat 2220actgtcacaa agtgttctgc atcacagtac gtgaagcaaa tttccagggt attaaaaaaa 2280acaatacatt ttaggtttga aaagtacaag acctgccacg gtggctcaca actgtaatct 2340cagcactgtg ggaagctgag gtgggtggat tgcttgagcc caggagttcg agaccagcct 2400gggcaacatg gtgaaaccgt ttctataaaa aaaaatttgt ctttaattag ccaggcgtgg 2460tggtacgtgc ctgtagtccc agctactcag gaggctgagg tgggaggatc acttgagcct 2520ggagacagag gctgcagtga gctatacttg cgctactcca ctccagcctg ggctacagag 2580tgagaccttg tctccaaaag aaaaaaaaaa aaaaaaaaag gacaagatta agaaaacaac 2640tctatgcata ctaaaataac tctattacct tctaaaatga tggagggaaa aactaagttg 2700gttaaaatag ctaattataa gggttttctt tatttttaaa aatgctgaag gatatttgag 2760acgatggaaa gataattcat tttagtaatg acccttaaat acttagcatg agatttttac 2820tctagtctgc tagaagcaaa gagccaaaga cagataagaa aaaatgtgtg ggacagcaaa 2880aaaaggagtc cagaggctct aggaaaagca tgatggctca gaatctccaa aaatggccta 2940tgggatatat aagaagaatc aaagatatac atatctagca tatataatgc tcttcaatgg 3000gccactgtca gtccaatatg aaaacaacta gaagcctctt tttcaccatc taaataccaa 3060tcctaaataa acaattttag agcagtttaa gattcacagt aaaattatgt ttaaagtaca 3120gagagttttc atatactccc tgtccccaca cacgcacagt ctaccc 3166

* * * * *

References


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed