Method Of Classifying Gene Expression Strength In Lung Cancer Tissues

TAKAHASHI; TAKASHI ;   et al.

Patent Application Summary

U.S. patent application number 13/549961 was filed with the patent office on 2013-11-14 for method of classifying gene expression strength in lung cancer tissues. This patent application is currently assigned to FUJIFILM Corporation. The applicant listed for this patent is Tetsuya Mitsudomi, Nobuhiko Ogura, Masato Some, TAKASHI TAKAHASHI, Shuta Tomita, Yasushi Yatabe. Invention is credited to Tetsuya Mitsudomi, Nobuhiko Ogura, Masato Some, TAKASHI TAKAHASHI, Shuta Tomita, Yasushi Yatabe.

Application Number20130303389 13/549961
Document ID /
Family ID34510569
Filed Date2013-11-14

United States Patent Application 20130303389
Kind Code A1
TAKAHASHI; TAKASHI ;   et al. November 14, 2013

METHOD OF CLASSIFYING GENE EXPRESSION STRENGTH IN LUNG CANCER TISSUES

Abstract

The present invention provides a method of confirming the gene expression, useful in the decision of a five year survival rate of a patient with lung cancer and the use of a DNA probe kit in the method. A method useful in the decision of a survival rate of a patient with non-small cell lung cancer comprising confirming the expression strength of at least one gene in lung cancer tissues isolated from the patient.


Inventors: TAKAHASHI; TAKASHI; (Nagoya-city, JP) ; Tomita; Shuta; (Nagoya-city, JP) ; Mitsudomi; Tetsuya; (Nagoya-city, JP) ; Yatabe; Yasushi; (Nagoya-city, JP) ; Ogura; Nobuhiko; (Ashigarakami-gun, JP) ; Some; Masato; (Ashigarakami-gun, JP)
Applicant:
Name City State Country Type

TAKAHASHI; TAKASHI
Tomita; Shuta
Mitsudomi; Tetsuya
Yatabe; Yasushi
Ogura; Nobuhiko
Some; Masato

Nagoya-city
Nagoya-city
Nagoya-city
Nagoya-city
Ashigarakami-gun
Ashigarakami-gun

JP
JP
JP
JP
JP
JP
Assignee: FUJIFILM Corporation
Tokyo
JP

Aichi Prefecture
Nagoya-city
JP

Family ID: 34510569
Appl. No.: 13/549961
Filed: July 16, 2012

Related U.S. Patent Documents

Application Number Filing Date Patent Number
12942770 Nov 9, 2010 8244478
13549961
11008265 Dec 10, 2004 7856318
12942770

Current U.S. Class: 506/9 ; 435/6.1
Current CPC Class: C12Q 2600/158 20130101; C12Q 1/6886 20130101; C12Q 2600/118 20130101
Class at Publication: 506/9 ; 435/6.1
International Class: C12Q 1/68 20060101 C12Q001/68

Foreign Application Data

Date Code Application Number
Dec 12, 2003 JP 2003-415119

Claims



1. (canceled)

2. A method for predicting a survival rate of a patient with non-squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of SEQ ID NO: 9, SEQ ID NO: 1, SEQ ID NO: 6, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 20, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 11, SEQ ID NO: 61, SEQ ID NO: 7, SEQ ID NO: 62, SEQ ID NO: 2, SEQ ID NO: 63, SEQ ID NO: 14, SEQ ID NO: 21, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, SEQ ID NO: 71, SEQ ID NO: 5, SEQ ID NO: 72, SEQ ID NO: 18, SEQ ID NO: 73 and SEQ ID NO: 14 in lung cancer tissues isolated from the patient.
Description



CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This is a divisional of U.S. patent application Ser. No. 11/008,265, filed Dec. 10, 2004 (presently allowed). The entire disclosure of the prior application is considered part of the disclosure and is hereby incorporated by reference.

TECHNICAL FIELD

[0002] The present invention relates to a method of confirming the expression of a specific gene in lung cancer tissues, used in a technique of predicting a five year survival rate of a patient with lung cancer with high accuracy.

BACKGROUND OF THE INVENTION

[0003] When various therapies are applied to patients with cancer (carcinoma), a five year survival rate is often used as a measure of cure. That is, a five year survival rate is a probability that a patient who underwent a cancer diagnosis or therapy will be survival over five years thereafter. By this probability, a progressive level (stage) of cancer, a therapeutic effect and the like are represented.

[0004] Until now, the TNM classification comprising the combination of the size of tumor (tumor meter, represented by T), the range where metastasis to lymphonodi are observed (represented by N) and the presence or absence of distant metastasis (represented by M), each of which is determined by clinical method, has been mainly used ("Cancer of the lung," written by Robert Ginsberg et al., 5th edition, pp. 858 to 910, Lippincott-Raven (1997)). For example, patients judged to be in stage I under the TNM classification means those having a progressive level such that a little over 60% of the patients could be survival for five years if cancer is resected by surgery. Patients judged to be in stage III means those having a progressive level such that at most 20% the patients could be survival even under the same condition.

[0005] Recently, focusing on one or two genes specifically expressed in cancer patients or cancer tissues, a therapeutic effect is often predicted by determining the difference in the expression of said gene(s) between patients showing superior therapeutic effect and patients showing poor therapeutic effect (Horio et al, Cancer Research, Vol. 54, pp. 1 to 4, Jan. 1, 1993).

SUMMARY OF THE INVENTION

[0006] However, the TNM classification cannot be applied unless outcomes of many clinical tests are accumulated. Thus, this classification is not be said to be simple and its accuracy is not satisfactory at all. And, in a method of predicting a therapeutic effect by confirming the expression of a specific gene, the correlation between the gene expression in patients with lung cancer and a five year survival rate of the patients has not been reported.

[0007] An object of the present invention is to accurately decide a survival rate of patients especially with lung cancer. In the present invention, the expression of a specific gene in lung Cancer tissues is confirmed.

[0008] Accordingly, the present invention relates to a method useful in the decision of a survival rate of a patient with non-small cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of WEE1 (AA039640), MYC (AA464600), TITF1 (T60168), FOSL1 (T82817), LYPLA1 (H00817), SSBP1 (R05693), SFTPC (AA487571), THBD (H59861), NICE-4 (AA054954), PTN (AA001449), SNRPB (AA599116), NAP1L1 (R93829) CTNND1 (AA024656), CCT3 (R60933), DSC2 (AA074677), SPRR1B (AA447835), COPB (AA598868), ARG1 (AA453673), ARCN1 (AA598401), MST1 (T47813), SERPINE1 (N75719), SERPINB1 (AA486275), EST fragment (N73201), ACTR3 (N34974), PTP4A3 (AA039851), ISLR (H62387), ANXA1 (1.163077), GJA1 (AA487623), HSPE1 (AA448396) and PSMA5 (AA598815) in lung cancer tissues isolated from the patient.

[0009] And, the present invention provides a method useful in the decision of a survival rate of a patient with squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of FLJ20619 (R74480), SPC12 (R19183), EST fragment (R96358), KRT5 (AA160507), PTP4A3 (AA039851), SPRR1B (AA947835), LOC339324 (W23522), MYST4 (AA057313), SPARCL1 (AA990699), IGJ (T70057), EIF4A2 (H05919), EST fragment (AA115121), ID2 (H82706), THBD (H59861), MGC15476 (W72525), ZFP (H53499), COPB (AA598868), ZYG (AA453289) CACNA1I (N52765), FLJ4623 (N71473), CSTB (H22919), EPB41L1 (R71689), MGC4549 (AA455267), EST fragment (T64878), DSC2 (AA074677), EST fragment (H79007), EST fragment (W84776), IF130 (AA630800), EST fragment (T81155) and IL1RN(T72877) in lung cancer tissues isolated from the patient.

[0010] Further, the present invention provides a method useful in the decision of the survival rate of a patient with non-squamous cell lung cancer comprising confirming the expression strength of at least one gene selected from the group consisting of NICE-4 (AA054954), WEE1 (AA039640) SSBP1 (R05693), WFDC2 (AA451904), ACTA2 (AA634006), G22P1 (AA486311), MST1 (T47813), PHB (R60946), DRPLA(H08642), SNRBP (AA599116), GJA1 (AA487623), SFTPC (AA487571), ACTR1A(R40850), MYC (AA464600), RAD23B (A2489678), CCT3(R60933), SERPINE1 (N75719), LAMP1 (H29077), IRAK1 (AA683550), BIRC2 (R19628), LMAN1 (H73420), HSPE1 (AA448396), TMSB4X (AA634103), EEF1G (R43973), EST fragment (H05820), LYPLA1 (H00817), SOD1 (R52548), ARG1 (AA453673), KRT25A (W73634) and FOSL1 (T82817) in lung cancer tissues isolated from the patient.

[0011] Another aspect of the present invention relates to the use in the above method of a DNA probe comprising a nucleic acid sequence specifically hybridizing to at least one gene targeted in this method.

[0012] All genes which expression is to be confirmed in the present invention are known genes. The nucleotide sequence of each gene is registered in "UniGene", one of the public databases provided by NCBI, with its abbreviated name and its accession number represented by the combination of alphabet (such as AA) and numeral. In the present specification including claims, all of the genes to be confirmed in the method of the present invention are represented with the abbreviated names and the accession numbers registered in "UniGene" on Nov. 19, 2003. Since a gene can be specified with the abbreviated name and the accession number registered in "UniGene", those skilled in the art easily confirm a gene in question and its detailed nucleotide sequence by referring to "UniGene" and conduct the present invention. Similarly, as to a nucleic acid sequence of a DNA prove specific for each gene used in the method of the present invention, those skilled in the art can easily determine some candidate sequences for each gene based on the nucleic acid sequence registered in the above database using a homology searching program or the like. Especially, the nucleic acid sequence of the probe of the present invention is not limited unless it is selected such that the probe can be specifically hybridized to a gene corresponding therefor. It is not necessarily to restrict or limit to one nucleic acid sequence. Such a procedure can be made by those skilled in the art without having a need of any specific effort.

[0013] The present inventors studied to search for genes specifically expressed in lung cancer tissues of patients who were underwent non-small cell lung cancer diagnosis or therapy and who were dead within five years thereafter or survival over five or more years thereafter. As the result, they found that there is a specific tendency between a five year survival rate and a gene expression pattern.

[0014] Focusing on genes whose expression amounts were specifically increased or decreased in cancer tissues of the group of patients who were dead within five years after operation or diagnosis as compared with the group of patients who were survival over five years after operation or diagnosis, the present inventors selected predictive genes capable of distinguishing both groups efficiently using a signal-to-noise metrics (Golub et al., Science, Vol. 286, pp. 531 to 537 (1999)). Briefly, if a prognosis favorable patient and a prognosis fatal patient are defined to belong to class 0 and class 1 respectively, a signal-to-noise statistic (Sx) for gene x is calculated as follows:

Sx=(.mu.class 0-.mu.class 1/.delta.class 0+.delta.class 1)

As to each gene, .mu.class 0 means an average of data on total expression strength of patients belonging to class 0 (a group of prognosis favorable patients) and .delta.class 0 means a standard deviation of data on total expression strength of patients belonging to class 0 (a group of prognosis favorable patients). Using the thus-calculated absolute value of Sx, genes ranked higher, i.e. genes showing a significant difference in expression strength between the group of prognosis favorable patients and the group of prognosis fatal patients, were selected.

[0015] In order to assay a statistical significance of a marker gene specific for a different type of cancer, a temple level (prognosis favorable or fatal) of each patient used in the analysis in association with a set of data on gene expression strength were randomly labeled and then the signal-to-noise value (Sx value) was recalculated in accordance with the labels after randomizing. This procedure was repeated 10,000 times. P values were assigned to every genes based on the extent so that Sx value obtained by randomizing the labels was better than Sx value obtained actually.

[0016] When genes to be judged that they are significantly related to a survival rate of patients with a different type of lung cancer, i.e. predictive genes, were searched for among genes expressed in cancer tissues of the patients, the following correlation became clear.

[0017] Thus, an expression pattern such that in many lung cancer tissues of patients who were underwent non-small cell lung cancer diagnosis or therapy and dead within five years thereafter, the expression of each of WEE1 (AA039640), MYC (AA464600), FOSL1 (T82817), LYPLA1(H00817), SSBP1 (R05693), THEM (H59861), NICE-4 (AA054954), PTN (AA001449), SNRPB (AA599116), NAP1L1 (R93829), CTNND1 (AA024656), CCT3 (R60933), DSC2 (AA074677), SPRR1B (AA447835), COPB(AA598868), ARG1(AA453673), ARCN1(AA598401), MST1 (T47813), SERPINE1 (N75719), SERPINB1 (AA486275), ACTR3 (N34974), PTP4A3(AA039851), ISLR (H62387), ANXA1 (1163077), GJA1 (AA487623), HSPE1 (AA448396) and PSMA5 (AA598815) was significantly increased and the expression of each of TITF1 (T60168), SFTPC (AA487571) and EST fragment (N73201) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 1.

[0018] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a non-small cell lung cancer diagnosis and confirming the expression strength of at least one gene belonging to the gene group 1, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.

[0019] For example, when PTP4A3 (AA039851, fatal) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 64% can be expected. When WEE1 (AA039640, fatal) or ACTR3 (N34974, fatal) is selected as a gene in addition to PTP4A3 (AA039851, fatal) and a five year survival rate is predicted based on the outcomes obtained by confirming the expression strength of these genes, an accuracy will be 66% or 7.4%. And, based on the outcomes obtained by confirming the expression strength of all genes constituting the gene group 1, an accuracy will reach 82%. The above outcomes have reliability higher than that of the prior method.

[0020] Although non-small cell lung cancer is further classified squamous cell cancer (SQ) and non-squamous cell cancer (non-SQ), the gene group 1 is useful as a gene group selected when a five year survival rate is decided without subdividing the type of lung cancer cells.

[0021] On the other hand, the present inventors confirmed the gene expression strength for squamous cell cancer (SQ) and non-squamous cell cancer (non-SQ) and as the result, they found that a five year survival rate can be decided more accurately by using a gene group different from the gene group 1 as targets.

[0022] Thus, an expression pattern such that in many lung cancer tissues of patients who were underwent squamous cell cancer diagnosis of therapy and dead within five years thereafter, the expression of each of KRT5 (AA160507), PTP4A3 (AA039851), SPRR1B (AA447835), MYST4 (AA057313), SPARCL1 (AA490694), IGJ (T70057), EST fragment (AA115121), ID2 (H82706), THBD (H59861), MGC15476 (W72525), COPB (AA598868), ZYG (AA453289), CACNA1I (N52765), CSTB (1122919), EPB41L1 (R71689), MGC4549 (AA455267), DSC2 (AA074677), IFI30 (AA630800), EST fragment (T81155) and IL1RN(T72877) was significantly increased and the expression of each of FLJ20619 (R74480), SPC12 (R19183), EST fragment (R96358), LOC339324 (W23522), EIF4A2 (H05919), ZFP (H53499), FLJ4623 (N71473), EST fragment (T64878), EST fragment (H79007) and EST fragment (W84776) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 2.

[0023] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a squamous cell cancer diagnosis and confirming the expression strength of at least one gene belonging to the gene group 2, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.

[0024] For example, when CACNAII (N52765, fatal) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 81% can be expected. When FLJ20619 (R74480, favorable) is selected as gene in addition to CACNAII (N52765, fatal) and a five year survival rate is predicted based on the outcomes obtained by confirming the expression strength of these genes, an accuracy will be 75% or 81%. And, based on the outcomes obtained by confirming the expression strength of all genes constituting the gene group 2, an accuracy will reach 100%.

[0025] And, an expression pattern such that in many lung cancer tissues of patients who were underwent non-squamous cell cancer diagnosis or therapy and dead within five years thereafter, the expression of each of NICE-4 (AA054954), WEE1 (AA039640), SSBP1 (R05693), G22P1 (AA486311), MST1 (T47$13), PHB (R60946), DRPLA (H08642), SNRBP (AA59911.6), GJA1 (AA487623), ACTR1A (R40850), MYC (AA464600), RAD23B (AA489678), CCT3 (R60933), SERPINE1 (N75719), BIRC2 (R19628), LMAN1 (H73420) HSPE1 (AA448396), EEF1G (R43973), EST fragment (1405820), LYPLA1 (H00817), SOD1 (R52548), ARG1 (AA453673), KRT25A (W73634) and FOSL1 (T82817) was significantly increased and the expression of each of WFDC2 (AA451904), ACTA2 (AA634006), SFTPC (AA487571), LAMP1 (H29077), IRAK1 (AA683550) and TMSB4X (AA634103) was significantly lowered was observed. Hereinafter, the group comprising the above genes is referred to be a gene group 3.

[0026] Accordingly, by extracting total RNAs from cancer tissues of a patient who was underwent a non-squamous cell cancer and confirming the expression strength of at least one gene belonging to the gene group 3, it is possible to predict a five year survival rate of the patient whether the patient would be dead within five years or survival over five or more years.

[0027] For example, when SFTPC (AA487571, favorable) is selected as a gene and a five year survival rate is predicted based on the outcome obtained by confirming the expression strength of this gene, an accuracy of 56% can be expected. When NICE-4 (AA054954, fatal) or GJA1 (AA487623, fatal) is selected as a gene in addition to SFTPC (AA487571, favorable) and a five year survival rate is predicted based on the outcomes obtained by the expression strength of these genes, an accuracy will be 79% or 76%. And, based on the outcomes obtained by the expression strength of all genes constituting the gene group 3, an accuracy will reach 91%.

[0028] As mentioned above, it is preferable to select two or more genes, more preferably all genes belonging to each gene group as targets although only one gene may be freely selected from each gene group and used it.

[0029] Further, the present invention provides information about samples .gamma. obtained from cancer tissues of new patients for deciding whether the patients will be survival or dead based on the above correlation.

[0030] In order to decide whether new patients with lung cancer (test samples .gamma.) will be prognostic favorable or fatal after five years, Vx may be calculated for each gene contained in a set of predictive genes from the equation: Vx=Sx (Gx.sup..gamma.-bx) wherein Sx is the above-mentioned signal-to-noise statistic; Gx.sup..gamma. represents the expression strength of each gene x contained in the set of predictive genes; and bx is calculated from the equation: bx=(.mu.class 0+.mu.class 1)/2. When the sum of Vx (.SIGMA.Vx) for the genes contained in the set of predictive genes is calculated to be plus (+), the patient in question is decided to be "prognosis favorable". When .SIGMA.Vx is calculated to be minus (-), the patient in question is decided to be "prognosis fatal".

BRIEF DESCRIPTION OF DRAWINGS

[0031] FIG. 1 represents the outcomes obtained by predicting patients with non-squamous cell lung cancer using 25 predictive genes in a weighted-voting model.

[0032] FIG. 2 is a survival curve showing the prognosis "favorable" or "fatal" of patients with non-small cell lung cancer.

[0033] FIG. 3 represents the outcomes obtained by predicting patients with non-squamous cell lung cancer using 12 predictive genes in a weighted-voting model.

[0034] FIG. 4 represents the outcomes obtained by predicting patients with squamous cell lung cancer using 19 predictive genes in a weighted-voting model.

[0035] FIG. 5 is a survival curve showing the prognosis "favorable" or "fatal" of patients with non-squamous cell lung cancer.

[0036] FIG. 6 is a survival curve showing the prognosis "favorable" or "fatal" of patients with squamous cell lung cancer.

EFFECT OF THE INVENTION

[0037] By using the method of the present invention, a five year survival rate of patients with lung cancer can be predicted with high accuracy. Therefore, it is possible according to the present invention to predict whether or not a patient with a different type of lung cancer could be survival over five or more years with high accuracy by confirming that a specified gene group is expressed in cancer tissues of the patient.

DISCLOSURE OF THE INVENTION

[0038] Expression strength of each gene belonging to the gene group specified in the present invention can be confirmed by providing a specific probe every nucleotide sequence and conducting PCR or hybridization. The nucleotide sequence of each gene can be easily confirmed from the database "UniGene". And, conditions such as the design of a probe specifically hybridizing to each gene, its synthesis, hybridization and the like can be suitably determined by those skilled in the art without having a need of any specific effort.

[0039] The probe can be synthesized as a set of probes capable of subjecting to PCR reaction for each gene, i.e. PCR primers. The expression strength may be confirmed by conducting PCR reaction using these primers.

[0040] Upon practice of the present method, the expression of a gene is preferably confirmed in the so-called microarray. As an microarray, a glass substrate on which probe DNAs are spotted; a membrane on which probe DNAs are spotted; beads on which probe DNAs are spotted; a glass substrate on which probes are directly synthesized; and the like have been developed. Examples of the microarray include a membrane microarray available from Invitrogen (GeneFilters.TM., Mammalian Microarrays; Catalog #GF200 or GF201). This membrane microarray contains 11168 spots in total of probe DNA corresponding to 8644 independent genes. It is confirmed by Blast search that the sequence of each probe does not occur the so-called cross hybridization even when gene (s) closely related to each sequence is (are) present, Otherwise the expression of such gene(s) is detected erroneously.

[0041] Examples of the microarray available in the present invention include cDNA or oligo-arrays available from Affimetrix, Agilent and other companies, in addition to the membrane microarray available from Invitrogen.

[0042] It is desirable in the present invention to immediately frozen cancer tissues isolated from a patient with lung cancer during thoractomy or by biopsy with an endoscope or the like to prepare a slice, prepare a tissue section by hollowing out minutely regions rich in cancer cells in the slice, extract RNAs from the tissue section according to any standard method and transform all mRNAs expressed in the tissue into a cDNA by acting a reverse transcriptase thereto. In this case, the targeted gene group can be labeled by adding to the cDNA a suitable radioisotope such as .sup.33P and the like or a fluorochrome such as Cy3, Cy5 and the like during the preparation of the cDNA via the reaction with a reverse transcriptase.

[0043] According to the present invention, based on the information about the nucleotide sequence of the gene contained in each gene group, the expression strength of the gene to be detected can be confirmed by hybridization or real time PCR using an oligoDNA specific for each gene to be detected. Preferably the expression of each gene group to be detected is confirmed more easily by combining cDNAs prepared with a reverse transcriptase and a suitable label with a microarray.

[0044] The expression strength of a gene group targeted in the present invention can be confirmed easily by hybridizing a labeled cDNA and a microarray under suitable conditions and then confirming the expression of the genes and their amounts as an index of the label. The expression strength is confirmed by quantifying the strength of a signal produced from the label by a suitable method.

[0045] For example, when a radioactive label is used, a signal strength can be quantified by exposing a hybridized array to an imaging plate (Fuji Photo Film), scanning and imaging using a bioimaging analyzer BAS 5000 (Fuji Photo Film), processing images of the hybridized array using L Process (Fuji Photo Film) and then analyzing using an analytical soft Array Gauge (Fuji Photo Film). Alternatively, the strength of a radioactive label can be quantified using a phospho-imager (Amersham). And, the strength of a fluorescent label can be quantified using a microarray reader (Agilent) or the like.

[0046] The thus-obtained data on label strength are converted to data on hybridization strength, respectively by using, for example, the method of Tseng et al. (Nucleic Acids Res., Vol., 29, pp. 2549 to 2557). Thereafter, a reproducibility in expression is evaluated after normalization, preparation of scatter plots for each gene and the like. Thus, a significant increase or decrease in expression amount of a targeted gene may be evaluated.

EXAMPLES

[0047] The present invention will be described in more detail by referring to the following examples which are not to be construed as limiting the scope of the invention.

Example 1

[0048] In the following example, all procedures using commercially available kits were conducted under conditions as recommended by the manufactures unless otherwise stated.

1) Extraction of Total RNAs from Lung Cancer Tissue

[0049] From each of 50 patients (15 females and 35 males; between the ages of 43 and 76, average age of 63) with non-small cell lung cancer, specifically 30 patients with glandular lung cancer, 16 patients with squamous cell lung cancer and 4 patients with large cell lung cancer (23 patients with stage I, 11 patients with stage II and 16 patients with stage III), lung cancer tissues (0.5 g in average) were isolated. The tissues were embedded in OCT compound and frozen at -80.degree. C., thereby a frozen sample of 7 .mu.m in thickness was prepared. Then, a region rich in cancer cells was carefully excised from the sample to obtain a section having cancer cells accounted for 75.4% in average of cells contained therein. From this section, total RNAs (12 .mu.g in average) were extracted using RNAeasy (Quiagen) and a purity thereof was confirmed using RNA 600 nanoassay kit and 2100 Bioanalyzer (Agilent).

2) Hybridization to Microarray

[0050] 5 micrograms of the total RNAs as prepared in the above 1) was transformed into cDNA using oligo-dT primer (Invitrogen) and Superscript II reverse transcriptase (Invitrogen) by adding 10 .mu.Ci of [.sup.32P] dCTP. GeneFilters (Invitrogen) was prehybridized in 10 ml of AlkPhos DIRECT hybridization buffer (Amersham) containing 0.5 .mu.g/ml of poly-dA (Invitrogen) and 0.5 .mu.g/ml of Cot-1 DNA (Invitrogen) at 51.degree. C. for 2 hours and then hybridized with a modified radiolabeled probe cDNA for 17 hours.

[0051] After hybridizing, the microarray was washed with a solution containing 2M urea, 0.1% SDS, 50 mM sodium phosphate buffer solution (pH 7.0), 150 mM NaCl, 1 mM MgCl.sub.2 and 0.2% AlkPhos DIRECT blocking reagent (Amersham) twice, a solution containing 2 mM MgCl.sub.2, 50 mM Tris and 100 mM NaCl twice ands solution containing 2 mM MgCl.sub.2, 50 mM Tris and 15 mM NaCl twice successively. The microarray was exposed to an imaging plate (Fuji Photo Film) for 2 hours and then the imaging plate was scanned and imaged using a bioimaging analyzer BAS 5000 (Fuji Photo Film) with resolution of 25 .mu.m. The image of the hybridized array was processed with L Process (Fuji Photo Film) and then a signal strength was quantified using an analytical soft Array Gauge (Fuji Photo Film).

3) Data Processing

[0052] The data on signal strength obtained in the above 2) was converted to data on hybridization strength, respectively. First, the method of Tseng et al. (Nucleic Acids Res., Vol. 29, pp. 2549 to 2557) was employed for selecting genes used in the fitting of a non-linear normalization curve. After normalization, scatter plots of 50 sets of replication data on each gene were prepared and a reproducibility of expression between replication pairs was evaluated. Genes showing a Pearson correlation coefficient of 0.85 or higher were selected. An average of the first hybridization and the second hybridization was used for further analysis. In addition, genes not showing a double or half change at at least an expression level were excluded. Genes having a median intensity of less than 0.3 were excluded from the following analysis.

4) Isolation of Gene for Five Year Survival

[0053] Predictive genes distinguishing patients who would be dead within five years after operation or diagnosis (prognosis fatal patients) and patients who would be survival over five years after operation or diagnosis (prognosis favorable patients) most efficiently were selected using a signal-to noise metrics (Golub et al., Science, Vol. 286, pp. 531 to 537 (1999)). Briefly, if a prognosis favorable patient and a prognosis fatal patient are defined to belong to class 0 and class 1 respectively, a signal-to-noise statistic (Sx) is calculated as follows:

Sx=(.mu.class 0-.mu.class 1/.delta.class 0+.delta.class 1)

As to each gene, .mu.class 0 means an average of data on total expression strength of patients belonging to class 0 (the group of prognosis favorable patients) and .delta.class 0 means a standard deviation of data on total expression strength of patients belonging to class 0 (the group of prognosis favorable patients).

[0054] Genes ranked higher based on the absolute value of Sx were selected. In order to predict the outcomes using the thus-selected genes, a weighted-voting classification algorithm was employed. The thus-obtained outcome classifiers were tested using a leave-one-out cross validation. In this scheme, the algorithm can be employed to find decision boundaries between class average and bx=(.mu.class 0+.mu.class 1)/2 for each gene, in addition to the calculation of Sx.

5) Permutation Test

[0055] In order to assay a statistical significance of a marker gene specific for a different type of cancer, a sample level (survival or dead) of each patient used in the analysis together with a set of data on gene expression strength were labeled randomly and then the signal-to-noise value (Sx value) for each gene was recalculated in accordance with the labels after randomizing. This procedure was repeated 10,000 times. P values were assigned to every genes based on the extent so that Sx value obtained by randomizing the labels was better than $x value obtained actually.

6) Construction of Model Predicting Survival Rate of Patients with Non-Small Cell Cancer

[0056] In order to develop an outcome prediction classifier of each patient, a signal-to-noise metrics was employed for selecting a gene distinguishing prognosis favorable patients from prognosis fatal patients most clearly. As the outcomes of a non-supervised hierarchical clustering algorithm using spots ranked top 100 corresponding to unique 98 genes, two major branches representing prognosis favorable patients and prognosis fatal patients were obtained. Among 21 patients with non-small cell cancer, 19 patients (left frame), i.e. the favorable branch, were survival over five years after operation. On the other hand, among 29 patients with non-small cell cancer, 15 patients(right frame),i.e. the fatal branch, were dead within five years after operation. The Kaplan-Meier survival curve reveals statistically significant difference.

[0057] Since our final goal was to develop outcome classifiers at patient level, a supervised learning method was employed. Thus, weighted-voting outcome classifiers were constructed based on the predictive genes preselected using the signal-to-noise metrics. A learning error against each model while increasing the number of predictive genes used was calculated by a leave-one-out cross validation. Among 30 genes constituting the outcome classifiers for non-small cell cancer (Table 1), the weighted-voting model using 25 predictive genes ranked top 25 revealed the highest accuracy such that 41 patients (82%) of 50 patients revealed the outcomes as predicted individually (FIG. 1).

TABLE-US-00001 TABLE 1 Non-small cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 WEE1 WEE1 homolog AA039640 Up 0.0027 0.483 0.483 SEQ ID NO: 1 2 MYC v-myc viral oncogene homolog AA464600 Up 0.0057 0.479 0.441 SEQ ID NO: 2 3 TITF1 thyroid transcription factor 1 T60168 Down 0.0085 0.452 0.416 SEQ ID NO: 3 4 FOSL1 FOS-like antigen 1 (Fra-1) T82817 Up 0.0062 0.330 0.411 SEQ ID NO: 4 5 LYPLA1 lysophospholipase 1 H00817 Up 0.0081 0.460 0.408 SEQ ID NO: 5 6 SSBP1 single-stranded DNA binding protein R05693 Up 0.0199 0.495 0.406 SEQ ID NO: 6 7 SFTPC surfactant, pulmonary-associated protein C AA487571 Down 0.0113 0.322 0.405 SEQ ID NO: 7 8 THBD thrombomodulin H59861 Up 0.0099 0.466 0.403 SEQ ID NO: 8 9 NICE-4 NICE-4 protein AA054954 Up 0.0099 0.514 0.403 SEQ ID NO: 9 10 PTN pleiotrophin (heparin binding growth factor 8) AA001449 Up 0.0100 0.500 0.401 SEQ ID NO: 10 11 SNRPB small nuclear ribonucleoprotein polypeptides B AA599116 Up 0.0115 0.657 0.394 SEQ ID NO: 11 and B1 13 CTNND1 catenin delta 1 R93829 Up 0.0120 0.513 0.393 SEQ ID NO: 12 12 NAP1L1 nucleosome assembly protein 1-like 1 AA024656 Up 0.0131 0.483 0.384 SEQ ID NO: 13 14 CCT3 chaperonin containing TCP1, subunit 3 R60933 Up 0.0186 0.566 0.378 SEQ ID NO: 14 15 DSC2 desmocollin 2 AA074677 Up 0.0160 0.533 0.374 SEQ ID NO: 15 16 SPRR1B small proline-rich protein 1B (cornifin) AA447835 Up 0.0209 0.421 0.370 SEQ ID NO: 16 17 COPB coatomer protein complex, subunit beta AA598868 Up 0.0195 0.466 0.369 SEQ ID NO: 17 18 ARG1 arginase type I (liver) AA453673 Up 0.0193 0.581 0.369 SEQ ID NO: 18 19 ARCN1 archain 1 (coatomer protein complex, subunit delta) AA598401 Up 0.0169 0.412 0.367 SEQ ID NO: 19 20 MST1 macrophage stimulating 1 T47813 Up 0.0193 0.462 0.366 SEQ ID NO: 20 21 SERPINE1 serine (or cysteine) proteinase inhibitor, clade N75719 Up 0.0194 0.495 0.366 SEQ ID NO: 21 E member 1 22 SERPINB1 serine (or cysteine) proteinase inhibitor, clade AA486275 Up 0.0205 0.556 0.362 SEQ ID NO: 22 B member 1 23 ESTs N73201 Down 0.0205 0.494 0.360 SEQ ID NO: 23 24 ACTR3 actin-related protein 3 homolog (ARP3) N34974 Up 0.0229 0.496 0.358 SEQ ID NO: 24 25 PTP4A3 protein tyrosine phosphatase type 4A, member 3 AA039851 Up 0.0199 0.478 0.357 SEQ ID NO: 25 26 ISLR immunoglobulin superfamily containing leucine-rich H62387 Up 0.0228 0.478 0.356 SEQ ID NO: 26 repeat 27 ANXA1 annexin A1 H63077 Up 0.0262 0.367 0.354 SEQ ID NO: 27 28 GJA1 gap junction protein, alpha 1 AA487623 Up 0.0230 0.406 0.354 SEQ ID NO: 28 29 HSPE1 heat shock 10 kD protein 1 AA448396 Up 0.0273 0.444 0.352 SEQ ID NO: 29 30 PSMA5 proteasome (prosome, macropain) subunit, alpha AA598815 Up 0.0265 0.545 0.346 SEQ ID NO: 30 type, 5

[0058] As to these classifiers, 27 patients of 33 patients (82%) practically survival over five or more years after operation were decided to be "prognosis favorable" and 14 patients of 17 patients (82%) practically dead within five years after operation were decided to be "prognosis fatal". A survival curve of patients for the prediction of "prognosis favorable" or "prognosis fatal" is shown in FIG. 2. This figure reveals the difference between two groups (P=6.0.times.10.sup.-6).

[0059] With the increase in the number of the above genes, another supervised learning algorithm including Support vector machine and k-nearest neighbors was employed. The accuracy of the model is comparable with that of the weighted-voting outcome classifiers, but the latter showed the highest accuracy.

[0060] In order to decide whether new patients with lung cancer (test samples .gamma.) could be prognosis favorable or fatal after five years, Vx may be calculated for each gene contained in the set of predictive genes from the equation: Vx=Sx (Gx.sup..gamma.-bx) wherein Sx is the above-mentioned signal-to-noise statistic; GX.sup..gamma. represents an expression strength of each gene x contained in the set of predictive genes; and bx is calculated from bx=(.mu.class 0+.mu.class 1)/2. When the sum of VX (.SIGMA.Vx) for genes contained in the set of predictive genes is calculated to be plus (+), the patient in question is decided to be "prognosis favorable". When .SIGMA.Vx is calculated to be minus (-), the patient in question is decided to be "prognosis fatal".

[0061] With the increase in the number of the above genes, another supervised learning algorithm including Support vector machine and k-nearest neighbors was employed. The accuracy of the model is comparable with that of the weighted-voting outcome classifiers, but the latter showed the highest accuracy.

7) Construction of Model Predicting Survival Rate Specific for Each of Squamous Cell Cancer and Non-Squamous Cell Cancer

[0062] Squamous cell cancer and non-squamous cell cancer are recognized as diseases distinguishable clinicopathologically each other. Thus, using predictive genes for each subtype selected with the weighted-voting algorithm and the signal-to-noise metrics, outcome prediction classifiers for a different type of cancer were constructed.

[0063] Among 30 genes constituting the outcome classifiers for a different type of cancer (Tables 2 and 3), 12 genes (Table 2) for non-squamous cell cancer and 19 genes (Table 3) for squamous cell cancer revealed the highest accuracy by a leave-one-out cross validation including the increase in the number of predictive genes ranked higher.

TABLE-US-00002 TABLE 2 Non-squamous cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 NICE-4 NICE-4 protein AA054954 Up 0.0036 0.567 0.604 SEQ ID NO: 9 2 WEE1 WEE1 homolog AA039640 Up 0.0039 0.485 0.567 SEQ ID NO: 1 3 SSBP1 single-stranded DNA binding protein R05693 Up 0.0122 0.466 0.500 SEQ ID NO: 6 4 WFDC2 WAP four-disulfide core domain 2 AA451904 Down 0.0155 0.544 0.489 SEQ ID NO: 56 5 ACTA2 actin, alpha 2, smooth muscle, aorta AA634006 Down 0.0149 0.684 0.487 SEQ ID NO: 57 6 G22P1 thyroid autoantigen 70 kDa (Ku70) AA486311 Up 0.0176 0.519 0.482 SEQ ID NO: 58 7 MST1 macrophage stimulating 1 T47813 Up 0.0153 0.462 0.481 SEQ ID NO: 20 8 PHB prohibitin R60946 Up 0.0219 0.419 0.472 SEQ ID NO: 59 9 DRPLA dentatorubral-pallidoluysian atrophy H08642 Up 0.0238 0.478 0.455 SEQ ID NO: 60 10 SNRPB small nuclear ribonucleoprotein polypeptides B AA599116 Up 0.0192 0.615 0.455 SEQ ID NO: 11 and B1 11 GJA1 gap junction protein, alpha 1 AA487623 Up 0.0268 0.332 0.446 SEQ ID NO: 61 12 SFTPC surfactant, pulmonary-associated protein C AA487571 Down 0.0313 0.350 0.445 SEQ ID NO: 7 13 ACTR1A actin-related protein 1 homolog A R40850 Up 0.0256 0.626 0.444 SEQ ID NO: 62 14 MYC v-myc viral oncogene homolog AA464600 Up 0.0294 0.385 0.434 SEQ ID NO: 2 15 RAD23B RAD23 homolog B AA489678 Up 0.0276 0.495 0.434 SEQ ID NO: 63 16 CCT3 chaperonin containing TCP1, subunit 3 R60933 Up 0.0305 0.548 0.431 SEQ ID NO: 14 17 SERPINE1 serine (or cysteine) proteinase inhibitor, clade N75719 Up 0.0338 0.473 0.424 SEQ ID NO: 21 E member 1 18 LAMP1 lysosomal-associated membrane protein 1 H29077 Down 0.0374 0.382 0.418 SEQ ID NO: 64 19 IRAK1 interleukin-1 receptor-associated kinase 1 AA683550 Down 0.0355 0.199 0.414 SEQ ID NO: 65 20 BIRC2 baculoviral IAP repeat-containing 2 R19628 Up 0.0362 0.359 0.412 SEQ ID NO: 66 21 LMAN1 lectin, mannose-binding, 1 H73420 Up 0.0339 0.409 0.411 SEQ ID NO: 67 22 HSPE1 heat shock 10 kD protein 1 AA448396 up 0.0411 0.406 0.410 SEQ ID NO: 68 23 TMSB4X thymosin, beta 4, X chromosome AA634103 Down 0.0440 0.585 0.404 SEQ ID NO: 69 24 EEF1G eukaryotic translation elongation factor 1 gamma R43973 up 0.0450 0.638 0.404 SEQ ID NO: 70 25 ESTs H05820 Up 0.0492 0.570 0.403 SEQ ID NO: 71 26 LYPLA1 lysophospholipase I H00817 Up 0.0488 0.456 0.401 SEQ ID NO: 5 27 SOD1 superoxide dismutase 1 R52548 Up 0.0477 0.609 0.397 SEQ ID NO: 72 28 ARG1 arginase type I (liver) AA453673 Up 0.0454 0.541 0.396 SEQ ID NO: 18 29 KRT25A type I inner root sheath specific keratin 25 irs1 W73634 Up 0.0534 0.584 0.394 SEQ ID NO: 73 30 FOSL1 FOS-like antigen 1 (Fra-1) T82817 Up 0.0366 0.309 0.391 SEQ ID NO: 4

TABLE-US-00003 TABLE 3 Squamous cell cancer accession expression in Rank Gene Description No. lung cancer P bx Sx SEQ ID NO. 1 FLJ20619 hypothetical protein R74480 Down 0.0068 0.507 0.882 SEQ ID NO: 31 2 SPC12 signal peptidase 12 kDa R19183 Down 0.0087 0.521 0.859 SEQ ID NO: 32 3 ESTs R96358 Down 0.0034 0.448 0.835 SEQ ID NO: 33 4 KRT5 keratin 5 AA160507 Up 0.0046 0.841 0.789 SEQ ID NO: 34 5 PTP4A3 protein tyrosine phosphatase type 4A, member 3 AA039851 Up 0.0104 0.438 0.753 SEQ ID NO: 25 6 SPRR1B small proline-rich protein 1B AA447835 Up 0.0147 0.695 0.730 SEQ ID NO: 16 7 LOC339324 hypothetical protein LOC339324 W23522 Down 0.0171 0.536 0.693 SEQ ID NO: 35 8 MYST4 MYST histone acetyltransferase 4 AA057313 Up 0.0188 0.573 0.691 SEQ ID NO: 36 9 SPARCL1 SPARC-like 1 AA490694 Up 0.0210 0.454 0.682 SEQ ID NO: 37 10 IGJ immunoglobulin J polypeptide T70057 Up 0.0143 0.385 0.681 SEQ ID NO: 38 11 EIF4A2 eukaryotic translation initiation factor 4A, H05919 Down 0.0233 0.750 0.679 SEQ ID NO: 39 isoform 2 12 ESTs AA115121 Up 0.0226 0.412 0.672 SEQ ID NO: 40 13 ID2 inhibitor of DNA binding 2 H82706 Up 0.0214 0.608 0.670 SEQ ID NO: 41 14 THBD thrombomodulin H59861 Up 0.0077 0.636 0.669 SEQ ID NO: 8 15 MGC15476 Thymus expressed gene 3-like W72525 Up 0.0231 0.412 0.665 SEQ ID NO: 42 16 ZFP zinc finger protein H53499 Down 0.0217 0.632 0.659 SEQ ID NO: 43 17 COPB coatomer protein complex, subunit beta AA598868 Up 0.0272 0.527 0.648 SEQ ID NO: 17 18 ZYG ZYG homolog AA453289 Up 0.0237 0.349 0.647 SEQ ID NO: 44 19 CACNA1I calcium channel, voltage-dependent, alpha 1I N52765 Up 0.0312 0.495 0.636 SEQ ID NO: 45 subunit 20 FLJ4623 hypothetical protein N71473 Down 0.0309 0.457 0.632 SEQ ID NO: 46 21 CSTB cystatin B H22919 Up 0.0286 0.762 0.631 SEQ ID NO: 47 22 EPB41L1 erythrocyte membrane protein band 4.1-like 1 R71689 Up 0.0482 0.690 0.613 SEQ ID NO: 48 23 MGC4549 hypothetical protein AA455267 Up 0.0327 0.410 0.606 SEQ ID NO: 49 24 ESTs T64878 Down 0.0406 0.457 0.600 SEQ ID NO: 50 25 DSC2 desmocollin 2 AA074677 Up 0.0407 0.656 0.592 SEQ ID NO: 15 26 ESTs H79007 Down 0.0415 0.363 0.590 SEQ ID NO: 51 27 ESTs W84776 Down 0.0364 0.665 0.587 SEQ ID NO: 52 28 IFI30 interferon, gamma-inducible protein 30 AA630800 Up 0.0415 0.336 0.587 SEQ ID NO: 53 29 ESTs T81155 Up 0.0552 0.633 0.583 SEQ ID NO: 54 30 IL1RN interleukin 1 receptor antagonist T72877 Up 0.0431 0.573 0.578 SEQ ID NO: 55

[0064] These outcomes show that among 34 patients with non-squamous cell cancer, a five year survival rate after operation of 31 patients (91%) was accurately predicted (FIG. 3). Specifically, among 25 patients who were predicted to be "prognosis favorable", 23 patients (92%) were actually survival over five years after operation. Among 9 patients who were decided to be "prognosis fatal", only one patient was survival over five years. The difference between the survival curve of 25 patients who were decided to be "prognosis favorable" and that of 9 patients who were predicted to be "prognosis fatal" was very significant.

Sequence CWU 1

1

7314232DNAHomo sapiens 1aaaattgcgt ttgagtttgc cgcgagccgg gccaatcggt tttgccaacg catgcccacg 60tgctggcgaa caaatgtaaa cacggagatc gtgtgccggg cacttggttt cgtggtgggc 120aactgtgctg ctgtttcttt tggccgcgga caaggtcggc agaggtggac ccctgcttgg 180gagagctctt ctcgctgtgc tgacacccgc ccctaacagt cacccacccc ggggaaataa 240tggggctcgg aggcctcctc ccagccagtg tccagcctaa gcacatcggc tcccgcagtt 300cagaaaggtc ccgaggcccg agtcaccatt tccggctcag acctcgaccc ggaacgtggc 360tgcccactgc cacgcccact acgccccagt ggctcgcccc aggggacgag gggcaagaag 420cggcctccga gggcagcggc cgaaggccat tcggtccctg gctcttccca gctcgcagag 480acccggaagc gctgcccggc cgcctgcccc tcttcagatc ccccagcacc ggaggagcag 540cgagggggct gcgtccaggc cggctttcgg gtcggcttag gcgaatccag ctctcttttg 600cccctcccag aaggcccagc cccgtccggg cggtgttcgg gcggcgccgg gccgggcccc 660ccgccgcccc aggctcgctc ataggcccgg aacaccacag cccgcccaga cttggctggc 720gccgagccgg gggtggagcc agcgggttcc cgccaaaatc gcgtagctgg tccttccccc 780gcgggctacg tcgcgccctc cttttttttt caaacccgga gctgcactgg gattggtgga 840ctgggcactc acgtggttaa cggtcgcggg aagccgcgga gcccgaacct gagactggac 900ctgaggagac ctcagcctcg gtgctcgggc cgccccgcct ctgccggaaa gtccgcgccg 960ccgctgccgc caccgtccgc agcccgagcg ccccggagcc gcaggccgcc gccgcgcaga 1020gacgccgcgg ctgcgactag gcgcgcccag ccgcacgtgg cggacccgcc cccaggcccg 1080cagtgtcctg gaccccgcag gcctccgctc tcctgtcctc ggccccgtcc ccagggccgc 1140gatgagcttc ctgagccgac agcagccgcc gccaccccgc cgcgccgggg cggcctgcac 1200cttgcggcag aagctgatct tctcgccctg cagcgactgt gaggaggagg aagaagagga 1260ggaggaggag ggcagcggcc acagcaccgg ggaggactcg gcctttcaag agcccgactc 1320gccgctgccg cccgcgcgga gccccacgga gcccgggccc gagcgccgcc gctcgcccgg 1380gccggccccc gggagccccg gcgagctgga ggaggacctg ttgctgcccg gcgcctgccc 1440gggcgcggac gaggcgggcg gtggggcgga gggcgactcg tgggaggagg agggcttcgg 1500ctcctcgtcg ccggtcaagt cgccggcggc cccctacttc ctgggtagct ctttctcgcc 1560ggtgcgctgc ggcggcccag gagatgcgtc gccgcggggt tgcggggcgc gccgggcggg 1620cgaaggccgc cgctcgccgc ggccggacca cccgggcacc ccgccacaca agaccttccg 1680caagctgcga ctcttcgaca ccccgcacac gcccaagagt ttgctctcca aagctcgggg 1740aattgattcc agctctgtta aactccgggg tagttctctc ttcatggata cagaaaaatc 1800aggaaaaagg gaatttgatg tgcgacagac tcctcaagtg aatattaatc cttttactcc 1860ggattctttg ttgcttcatt cctcaggaca gtgtcgtcgt agaaagagaa cgtattggaa 1920tgattcctgt ggtgaagaca tggaagccag tgattatgag cttgaagatg aaacaagacc 1980tgctaagaga attacaatta ctgaaagcaa tatgaagtcc cggtatacaa cagaatttca 2040tgagctagag aaaatcggct ctggagaatt tggttctgta tttaagtgtg tgaagaggct 2100ggatggatgc atttatgcca ttaagcgatc aaaaaagcca ttggcgggct ctgttgatga 2160gcagaacgct ttgagagaag tatatgctca tgcagtgctt ggacagcatt ctcatgtagt 2220tcgatatttc tctgcgtggg cagaagatga tcatatgctt atacagaatg aatattgtaa 2280tggtggaagt ttagctgatg ctataagtga aaactacaga atcatgagtt actttaaaga 2340agcagagttg aaggatctcc ttttgcaagt tggccgaggc ttgaggtata ttcattcaat 2400gtctttggtt cacatggata taaaacctag taatattttc atatctcgaa cctcaatccc 2460aaatgctgcc tctgaagaag gagacgaaga tgattgggca tccaacaaag ttatgtttaa 2520aataggtgat cttgggcatg taacaaggat ctccagtcca caagttgaag agggcgatag 2580tcgttttctt gcaaatgaag ttttacagga gaattatacc catctaccaa aagcagatat 2640ttttgcgctt gccctcacag tggtatgtgc tgctggtgct gaacctcttc cgagaaatgg 2700agatcaatgg catgaaatca gacagggtag attacctcgg ataccacaag tgctttccca 2760agaatttaca gagttgctaa aagttatgat tcatccagat ccagagagaa gaccttcagc 2820aatggcactg gtaaagcatt cagtattgct gtccgcttct agaaagagtg cagaacaatt 2880acgaatagaa ttgaatgccg aaaagttcaa aaattcactt ttacaaaaag aactcaagaa 2940agcacagatg gcaaaagctg cagctgagga aagagcactc ttcactgacc ggatggccac 3000taggtccacc acccagagta atagaacatc tcgacttatt ggaaagaaaa tgaaccgctc 3060tgtcagcctt actatatact gagctactcc tttcccacct ccccctgaac actgtgacaa 3120gaggaagcta ggttgaaatc actgatagaa tccagtttgc aattactttc tcgattggtg 3180tcagtagttt tactgattag gacttttatt gtgaattaca gttgaaagct gtattttgat 3240gattgctatg tcaggctttc atctaatctt accagtctgt cttctgtagg atgtgtcact 3300gttggatgtt acaccagcct ttccagggtt aaccactgtg gtggtgtgct gcttatagtt 3360tgctgttgca ttgtaataaa aggtgtcttt ccctgtagtg acctgtaaaa agtactcaag 3420ggctttatta cagacatacc ctccctttga aaagggacat gctaaaagac tcattactac 3480tcagccttca atgtacctgt gtgtccatct tatatttctt tttttttttt aattgtgaat 3540tagacttgta tatcccactg ggagcacttt gtaggcattg catgaaccat gggatgatga 3600ttctgtggag gtattgcctt gtgaatttgc tgctatttta gttttgtctt tgctgtaaac 3660ttgtagcatt aaacaatcat tgttgttaat aggtcttctt tttgaaacaa ttatgtgaaa 3720tgtatagctg cttttgatga aaagcagcta tttgcctttt ttttttttcc tttgaacttt 3780gaagctagtg cattggaaaa atgcaccctt tccctccttt ggaatgctgt attaatgtag 3840tataataatt actggttttg taacttgttc tggtaatgtc cttcccggac tctttttaaa 3900tgtctccccc taagttttat acttgattgt attattagtc tgtttttaaa tgttttgccc 3960ggtttttctc ttcaatattt gtgtatataa accgatcttc gtgatactgt acatagctgt 4020ttgaaatgcc agaatgactt ctgacattcc aagtttttca caaaatatat tttatctgtg 4080attagccatt tgactaataa tactggctaa cagatgttga aaaaaattgt ctgtttgttt 4140tctcattaat tttggtctaa aacatgtttg cacttgtctt tgacttgtgt tttattaaca 4200ttgattggca tattaaaagt cactctgagc tt 423222189DNAHomo sapiens 2gcagagggag cgagcgggcg gccggctagg gtggaagagc cgggcgagca gagctgcgct 60gcgggcgtcc tgggaaggga gatccggagc gaataggggg cttcgcctct ggcccagccc 120tcccgctgat cccccagcca gcggtccgca acccttgccg catccacgaa actttgccca 180tagcagcggg cgggcacttt gcactggaac ttacaacacc cgagcaagga cgcgactctc 240ccgacgcggg gaggctattc tgcccatttg gggacacttc cccgccgctg ccaggacccg 300cttctctgaa aggctctcct tgcagctgct tagacgctgg atttttttcg ggtagtggaa 360aaccagcagc ctcccgcgac gatgcccctc aacgttagct tcaccaacag gaactatgac 420ctcgactacg actcggtgca gccgtatttc tactgcgacg aggaggagaa cttctaccag 480cagcagcagc agagcgagct gcagcccccg gcgcccagcg aggatatctg gaagaaattc 540gagctgctgc ccaccccgcc cctgtcccct agccgccgct ccgggctctg ctcgccctcc 600tacgttgcgg tcacaccctt ctcccttcgg ggagacaacg acggcggtgg cgggagcttc 660tccacggccg accagctgga gatggtgacc gagctgctgg gaggagacat ggtgaaccag 720agtttcatct gcgacccgga cgacgagacc ttcatcaaaa acatcatcat ccaggactgt 780atgtggagcg gcttctcggc cgccgccaag ctcgtctcag agaagctggc ctcctaccag 840gctgcgcgca aagacagcgg cagcccgaac cccgcccgcg gccacagcgt ctgctccacc 900tccagcttgt acctgcagga tctgagcgcc gccgcctcag agtgcatcga cccctcggtg 960gtcttcccct accctctcaa cgacagcagc tcgcccaagt cctgcgcctc gcaagactcc 1020agcgccttct ctccgtcctc ggattctctg ctctcctcga cggagtcctc cccgcagggc 1080agccccgagc ccctggtgct ccatgaggag acaccgccca ccaccagcag cgactctgag 1140gaggaacaag aagatgagga agaaatcgat gttgtttctg tggaaaagag gcaggctcct 1200ggcaaaaggt cagagtctgg atcaccttct gctggaggcc acagcaaacc tcctcacagc 1260ccactggtcc tcaagaggtg ccacgtctcc acacatcagc acaactacgc agcgcctccc 1320tccactcgga aggactatcc tgctgccaag agggtcaagt tggacagtgt cagagtcctg 1380agacagatca gcaacaaccg aaaatgcacc agccccaggt cctcggacac cgaggagaat 1440gtcaagaggc gaacacacaa cgtcttggag cgccagagga ggaacgagct aaaacggagc 1500ttttttgccc tgcgtgacca gatcccggag ttggaaaaca atgaaaaggc ccccaaggta 1560gttatcctta aaaaagccac agcatacatc ctgtccgtcc aagcagagga gcaaaagctc 1620atttctgaag aggacttgtt gcggaaacga cgagaacagt tgaaacacaa acttgaacag 1680ctacggaact cttgtgcgta aggaaaagta aggaaaacga ttccttctaa cagaaatgtc 1740ctgagcaatc acctatgaac ttgtttcaaa tgcatgatca aatgcaacct cacaaccttg 1800gctgagtctt gagactgaaa gatttagcca taatgtaaac tgcctcaaat tggactttgg 1860gcataaaaga acttttttat gcttaccatc tttttttttt ctttaacaga tttgtattta 1920agaattgttt ttaaaaaatt ttaagattta cacaatgttt ctctgtaaat attgccatta 1980aatgtaaata actttaataa aacgtttata gcagttacac agaatttcaa tcctagtata 2040tagtacctag tattataggt actataaacc ctaatttttt ttatttaagt acattttgct 2100ttttaaagtt gatttttttc tattgttttt agaaaaaata aaataactgg caaatatatc 2160attgagccaa aaaaaaaaaa aaaaaaaaa 218932352DNAHomo sapiens 3gaaacttaaa ggtgtttacc ttgtcatcag catgtaagct aattatctcg ggcaagatgt 60aggcttctat tgtcttgttg ctttagcgct tacgccccgc ctctggtggc tgcctaaaac 120ctggcgccgg gctaaaacaa acgcgaggca gcccccgagc ctccactcaa gccaattaag 180gaggactcgg tccactccgt tacgtgtaca tccaacaaga tcggcgttaa ggtaacacca 240gaatatttgg caaagggaga aaaaaaaagc agcgaggctt cgccttcccc ctctcccttt 300tttttcctcc tcttccttcc tcctccagcc gccgccgaat catgtcgatg agtccaaagc 360acacgactcc gttctcagtg tctgacatct tgagtcccct ggaggaaagc tacaagaaag 420tgggcatgga gggcggcggc ctcggggctc cgctggcggc gtacaggcag ggccaggcgg 480caccgccaac agcggccatg cagcagcacg ccgtggggca ccacggcgcc gtcaccgccg 540cctaccacat gacggcggcg ggggtgcccc agctctcgca ctccgccgtg gggggctact 600gcaacggcaa cctgggcaac atgagcgagc tgccgccgta ccaggacacc atgaggaaca 660gcgcctctgg ccccggatgg tacggcgcca acccagaccc gcgcttcccc gccatctccc 720gcttcatggg cccggcgagc ggcatgaaca tgagcggcat gggcggcctg ggctcgctgg 780gggacgtgag caagaacatg gccccgctgc caagcgcgcc gcgcaggaag cgccgggtgc 840tcttctcgca ggcgcaggtg tacgagctgg agcgacgctt caagcaacag aagtacctgt 900cggcgccgga gcgcgagcac ctggccagca tgatccacct gacgcccacg caggtcaaga 960tctggttcca gaaccaccgc tacaaaatga agcgccaggc caaggacaag gcggcgcagc 1020agcaactgca gcaggacagc ggcggcggcg ggggcggcgg gggcaccggg tgcccgcagc 1080agcaacaggc tcagcagcag tcgccgcgac gcgtggcggt gccggtcctg gtgaaagacg 1140gcaaaccgtg ccaggcgggt gcccccgcgc cgggcgccgc cagcctacaa ggccacgcgc 1200agcagcaggc gcagcaccag gcgcaggccg cgcaggcggc ggcagcggcc atctccgtgg 1260gcagcggtgg cgccggcctt ggcgcacacc cgggccacca gccaggcagc gcaggccagt 1320ctccggacct ggcgcaccac gccgccagcc ccgcggcgct gcagggccag gtatccagcc 1380tgtcccacct gaactcctcg ggctcggact acggcaccat gtcctgctcc accttgctat 1440acggtcggac ctggtgagag gacgccgggc cggccctagc ccagcgctct gcctcaccgc 1500ttccctcctg cccgccacac agaccaccat ccaccgctgc tccacgcgct tcgacttttc 1560ttaacaacct ggccgcgttt agaccaagga acaaaaaaac cacaaaggcc aaactgctgg 1620acgtctttct ttttttcccc ccctaaaatt tgtgggtttt tttttttaaa aaaagaaaat 1680gaaaaacaac caagcgcatc caatctcaag gaatctttaa gcagagaagg gcataaaaca 1740gctttggggt gtcttttttt ggtgattcaa atgggttttc cacgctaggg cggggcacag 1800attggagagg gctctgtgct gacatggctc tggactctaa agaccaaact tcactctggg 1860cacactctgc cagcaaagag gactcgcttg taaataccag gatttttttt tttttttgaa 1920gggaggacgg gagctgggga gaggaaagag tcttcaacat aacccacttg tcactgacac 1980aaaggaagtg ccccctcccc ggcaccctct ggccgcctag gctcagcggc gaccgccctc 2040cgcgaaaata gtttgtttaa tgtgaacttg tagctgtaaa acgctgtcaa aagttggact 2100aaatgcctag tttttagtaa tctgtacatt ttgttgtaaa aagaaaaacc actcccagtc 2160cccagccctt cacatttttt atgggcattg acaaatctgt gtatattatt tggcagtttg 2220gtatttgcgg cgtcagtctt tttctgttgt aacttatgta gatatttggc ttaaatatag 2280ttcctaagaa gcttctaata aattatacaa attaaaaaga ttctttttct gattaaaaaa 2340aaaaaaaaaa aa 23524431DNAHomo sapiensmisc_feature(353)..(353)n is a, c, g, or t 4cagcagcgga gacccatcct ctgaccccct tggctctcca accctcctcg ctttgtgagg 60cacccgagcc ttactccctg caggtgccac cctaagcaac gtctgctccc cttcccccac 120cagtccagct ggcctggaca gtatcccata cccaactcca gcagctgctt ctccatccct 180ctaatgagac taaccatatt gtgcttcaca gtagagccag cttggggcca ccaaagctgc 240ccattgtttc tctaggagct gggcctctct aggcacaatt tggcactaaa tcaggaggac 300aaaatatttt cccatttctg gccggaggaa ttccggggga ggcccaggag gantttgtta 360ggattcctta ggagggtcct ctggggaggc cctaaaccct ttccagattc attggccaca 420tttttcccnt c 4315292DNAHomo sapiensmisc_feature(286)..(286)n is a, c, g, or t 5gtttttgatg cagacataaa aatagcaatc attttaaatt gtcaaaattt ccagattact 60ggtaaaaatt atttgaaaac aaacttatgg gtaataaagg ctagtcagaa ccctatacca 120taaagtgtag ttaccataca gattaatatg tagcaaaaat gtatgcttga tatttctcaa 180ctgtgttaat ttttctgctg tattccagct gaccaaaaca atattaagaa tgcatcttta 240taaatggggt gctaattgat aatgggaaat aatttaggta atgggnctat ac 2926400DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t 6ngaagggata gccagcgcga aggaagtnct ggagtcgtgt gttttggctg cgcgtgatcc 60tgcgtgggtc gggaggtgtt tctgtgtagg tntctggccc tttnatcagt cgtgcggagg 120accgcgtgat ttccttccag ttctnctcgg ntttcangaa aagcctaaag attagactnt 180aagaaaagan aatagaagcc atgtttcgaa gacctgtatt acaggtactt cgtcagtttc 240taagacatga gtcccganac aactaccagt ttggttcttn gaaagatccc tggaatgcac 300tttnctttng gcccaggtng ggtcagggac cctgtctttt taggacaggn tcggaaggga 360aaaaaatccc agttcacaat antttttntc ttaggcaact 4007859DNAHomo sapiens 7acaggagagc atagcacctg cagcaagatg gatgtgggca gcaaagaggt cctgatggag 60agcccgccgg actactccgc agctccccgg ggccgatttg gcattccctg ctgcccagtg 120cacctgaaac gccttcttat cgtggtggtg gtggtggtcc tcatcgtcgt ggtgattgtg 180ggagccctgc tcatgggtct ccacatgagc cagaaacaca cggagatggt tctggagatg 240agcattgggg cgccggaagc ccagcaacgc ctggccctga gtgagcacct ggttaccact 300gccaccttct ccatcggctc cactggcctc gtggtgtatg actaccagca gctgctgatc 360gcctacaagc cagcccctgg cacctgctgc tacatcatga agatagctcc agagagcatc 420cccagtcttg aggctctcaa tagaaaagtc cacaacttcc agatggaatg ctctctgcag 480gccaagcccg cagtgcctac gtctaagctg ggccaggcag aggggcgaga tgcaggctca 540gcaccctccg gaggggaccc ggccttcttg ggcatggccg tgaacaccct gtgtggcgag 600gtgccgctct actacatcta ggacgcctcc ggtgagcagg gtcagtggaa gccccaacgg 660gaaaggaaac gccccgggca aagggtcttt tgcagctttt gcagacgggc aagaagctgc 720ttctgcccac accgcaggga caaaccctgg agaaatggga gcttggggag aggatgggag 780tgggcagagg tggcacccag gggcccggga actcctgcca caacagaata aagcagcctg 840atttgaaaag caaaaaaaa 85984050DNAHomo sapiens 8cttgcaatcc aggctttcct tggaagtggc tgtaacatgt atgaaaagaa agaaaggagg 60accaagagat gaaagagggc tgcacgcgtg ggggcccgag tggtgggcgg ggacagtcgt 120cttgttacag gggtgctggc cttccctggc gcctgcccct gtcggccccg cccgagaacc 180tccctgcgcc agggcagggt ttactcatcc cggcgaggtg atcccatgcg cgagggcggg 240cgcaagggcg gccagagaac ccagcaatcc gagtatgcgg catcagccct tcccaccagg 300cacttccttc cttttcccga acgtccaggg agggagggcc gggcacttat aaactcgagc 360cctggccgat ccgcatgtca gaggctgcct cgcaggggct gcgcgcacgg caagaagtgt 420ctgggctggg acggacagga gaggctgtcg ccatcggcgt cctgtgcccc tctgctccgg 480cacggccctg tcgcagtgcc cgcgctttcc ccggcgcctg cacgcggcgc gcctgggtaa 540catgcttggg gtcctggtcc ttggcgcgct ggccctggcc ggcctggggt tccccgcacc 600cgcagagccg cagccgggtg gcagccagtg cgtcgagcac gactgcttcg cgctctaccc 660gggccccgcg accttcctca atgccagtca gatctgcgac ggactgcggg gccacctaat 720gacagtgcgc tcctcggtgg ctgccgatgt catttccttg ctactgaacg gcgacggcgg 780cgttggccgc cggcgcctct ggatcggcct gcagctgcca cccggctgcg gcgaccccaa 840gcgcctcggg cccctgcgcg gcttccagtg ggttacggga gacaacaaca ccagctatag 900caggtgggca cggctcgacc tcaatggggc tcccctctgc ggcccgttgt gcgtcgctgt 960ctccgctgct gaggccactg tgcccagcga gccgatctgg gaggagcagc agtgcgaagt 1020gaaggccgat ggcttcctct gcgagttcca cttcccagcc acctgcaggc cactggctgt 1080ggagcccggc gccgcggctg ccgccgtctc gatcacctac ggcaccccgt tcgcggcccg 1140cggagcggac ttccaggcgc tgccggtggg cagctccgcc gcggtggctc ccctcggctt 1200acagctaatg tgcaccgcgc cgcccggagc ggtccagggg cactgggcca gggaggcgcc 1260gggcgcttgg gactgcagcg tggagaacgg cggctgcgag cacgcgtgca atgcgatccc 1320tggggctccc cgctgccagt gcccagccgg cgccgccctg caggcagacg ggcgctcctg 1380caccgcatcc gcgacgcagt cctgcaacga cctctgcgag cacttctgcg ttcccaaccc 1440cgaccagccg ggctcctact cgtgcatgtg cgagaccggc taccggctgg cggccgacca 1500acaccggtgc gaggacgtgg atgactgcat actggagccc agtccgtgtc cgcagcgctg 1560tgtcaacaca cagggtggct tcgagtgcca ctgctaccct aactacgacc tggtggacgg 1620cgagtgtgtg gagcccgtgg acccgtgctt cagagccaac tgcgagtacc agtgccagcc 1680cctgaaccaa actagctacc tctgcgtctg cgccgagggc ttcgcgccca ttccccacga 1740gccgcacagg tgccagatgt tttgcaacca gactgcctgt ccagccgact gcgaccccaa 1800cacccaggct agctgtgagt gccctgaagg ctacatcctg gacgacggtt tcatctgcac 1860ggacatcgac gagtgcgaaa acggcggctt ctgctccggg gtgtgccaca acctccccgg 1920taccttcgag tgcatctgcg ggcccgactc ggcccttgcc cgccacattg gcaccgactg 1980tgactccggc aaggtggacg gtggcgacag cggctctggc gagcccccgc ccagcccgac 2040gcccggctcc accttgactc ctccggccgt ggggctcgtg cattcgggct tgctcatagg 2100catctccatc gcgagcctgt gcctggtggt ggcgcttttg gcgctcctct gccacctgcg 2160caagaagcag ggcgccgcca gggccaagat ggagtacaag tgcgcggccc cttccaagga 2220ggtagtgctg cagcacgtgc ggaccgagcg gacgccgcag agactctgag cggcctccgt 2280ccaggagcct ggctccgtcc aggagctgtg cctcctcacc cccagctttg ctaccaaagc 2340accttagctg gcattacagc tggagaagac cctccccgca ccccccaagc tgttttcttc 2400tattccatgg ctaactggcg agggggtgat tagagggagg agaatgagcc tcggcctctt 2460ccgtgacgtc actggaccac tgggcaatga tggcaatttt gtaacgaaga cacagactgc 2520gatttgtccc aggtcctcac taccgggcgc aggagggtga gcgttattgg tcggcagcct 2580tctgggcaga ccttgacctc gtgggctagg gatgactaaa atatttattt tttttaagta 2640tttaggtttt tgtttgtttc ctttgttctt acctgtatgt ctccagtatc cactttgcac 2700agctctccgg tctctctctc tctacaaact cccacttgtc atgtgacagg taaactatct 2760tggtgaattt ttttttccta gccctctcac atttatgaag caagccccac ttattcccca 2820ttcttcctag ttttctcctc ccaggaactg ggccaactca cctgagtcac cctacctgtg 2880cctgacccta cttcttttgc tcatctagct gtctgctcag acagaacccc tacatgaaac 2940agaaacaaaa acactaaaaa taaaaatggc catttgcttt ttcaccagat ttgctaattt 3000atcctgaaat ttcagattcc cagagcaaaa taattttaaa caaagggttg agatgtaaaa 3060ggtattaaat tgatgttgct ggactgtcat agaaattaca cccaaagagg tatttatctt 3120tacttttaaa cagtgagcct gaattttgtt gctgttttga tttgtactga aaaatggtaa 3180ttgttgctaa tcttcttatg caatttcctt ttttgttatt attacttatt tttgacagtg 3240ttgaaaatgt tcagaaggtt gctctagatt gagagaagag acaaacacct cccaggagac 3300agttcaagaa agcttcaaac tgcatgattc atgccaatta gcaattgact gtcactgttc 3360cttgtcactg gtagaccaaa ataaaaccag ctctactggt cttgtggaat tgggagcttg 3420ggaatggatc ctggaggatg cccaattagg gcctagcctt aatcaggtcc tcagagaatt 3480tctaccattt cagagaggcc ttttggaatg tggcccctga acaagaattg gaagctgccc 3540tgcccatggg agctggttag aaatgcagaa tcctaggctc caccccatcc agttcatgag 3600aatctatatt taacaagatc tgcagggggt gtgtctgctc agtaatttga ggacaaccat 3660tccagactgc ttccaatttt ctggaataca tgaaatatag atcagttata agtagcaggc 3720caagtcaggc ccttattttc aagaaactga ggaattttct ttgtgtagct ttgctctttg 3780gtagaaaagg ctaggtacac agctctagac

actgccacac agggtctgca aggtctttgg 3840ttcagctaag ctaggaatga aatcctgctt cagtgtatgg aaataaatgt atcatagaaa 3900tgtaactttt gtaagacaaa ggttttcctc ttctattttg taaactcaaa atatttgtac 3960atagttattt atttattgga gataatctag aacacaggca aaatccttgc ttatgacatc 4020acttgtacaa aataaacaaa taacaatgtg 40509466DNAHomo sapiensmisc_feature(155)..(155)n is a, c, g, or t 9tttttttttt tttttttttt taagtctcct tctttattat taggaaaaca acaacaacaa 60caaacaaaaa aatggcgtca tgaatatgaa cagcattgtc agatgaatta gttgaagtgg 120tttttttttt gttttttttt ttttttttgt actgngtcct caaatttaat ggattaatgt 180gtcttgtata tataaaaaga aaacctctac cttcagcctc tgcctattct tgctccgtct 240aggacatccn caatttcgtc gatgaccagc ttggtgaata agtattactg taccaactgg 300gcctcctcta gcaggcccct gaaggcagtg gaataaaatg aaatcttcgc cctttaagaa 360ctcctgacct taatgtggta gtagtatctt gtccttgagg ggatttcctt cccctcaccc 420ctaagacttt cacaacctgg tgactggaaa gaaccaccac naatcc 46610470DNAHomo sapiensmisc_feature(208)..(208)n is a, c, g, or t 10aacaaatgct tctgccaaag tgaaagaatt ttatgtctta atgcttttct ttaaaaaaaa 60aaaaagtcaa cattgaacta ggacatgctc tgcttcccca cccccatttt gctgactaca 120ttttaaaaaa tctattggca gaaaacaaga tattttcttc aaatagagtg attatgtttt 180attgctattt tgtttagtat atattttnct caattgggaa aaaaatctag gtgaaaaaaa 240ttacctaaca agagaagtag tttacatagt cataacattt aaatttgctg cccaaaaaat 300gtaaaanaat ttnaatgtaa aatgtcacat antttcaaaa aacttacctc aattgtctat 360catttatcat gtactataag tcaacttcct aaataagatt cagtccttta ttataagccc 420ctactggtac catngtatac attaaaaacg ctnctccaaa atttcctggc 470111007DNAHomo sapiens 11aactccaggg ctagtgagct ggaccggaag taggtttcta cccgaccgca ttttacgtgg 60tgctgcattt ccggtagcgg cggcgggaaa tcggctgtgg gagagaggct aggcctctga 120ggaggcgaat ccggcgggta tcagagccat cagaaccgcc accatgacgg tgggcaagag 180cagcaagatg ctgcagcata ttgattacag gatgaggtgc atcctgcagg acggccggat 240cttcattggc accttcaagg cttttgacaa gcacatgaat ttgatcctct gtgactgtga 300tgagttcaga aagatcaagc caaagaactc caaacaagca gaaagggaag agaagcgagt 360cctcggtctg gtgctgctgc gaggggagaa tctggtctca atgacagtag agggacctcc 420tcccaaagat actggtattg ctcgagttcc acttgctgga gctgccgggg gcccagggat 480cggcagggct gctggcagag gaatcccagc tggggttccc atgccccagg ctcctgcagg 540acttgctggg ccagtccgtg gggttggcgg gccatcccaa caggtgatga ccccacaagg 600aagaggtact gttgcagccg ctgcagctgc tgccacagcc agtattgccg gggctccaac 660ccagtaccca cctggccgtg ggggtcctcc cccacctatg ggccgaggag caccccctcc 720aggcatgatg ggcccacctc ctggtatgag acctcctatg ggtcccccaa tggggatccc 780ccctggaaga gggactccaa tgggcatgcc ccctccggga atgcggcctc ctccccctgg 840gatgcgaggg ccccctcccc cgggaatgcg cccaccaagg ccctagactc atcttggccc 900tcctcagctc cctgcctgtt tcccgtaagg ctgtacatag tccttttatc tccttgtggc 960ctatgaaact ggtttataat aaactcttaa gagaacatta taattgc 1007123582DNAHomo sapiens 12ctgctcgcgg cgccgcctcc tgctcctccc gctgctgctg ccgctgccgc cctgagtcac 60tgcctgcgca gctccggccg cctggctccc catactagtc gccgatattt ggagttctta 120caacatggca gacattgaca acaaagaaca gtctgaactt gatcaagatt tggatgatgt 180tgaagaagta gaagaagagg aaactggtga agaaacaaaa ctcaaagcac gtcagctaac 240tgttcagatg atgcaaaatc ctcagattct tgcagccctt caagaaagac ttgatggtct 300ggtagaaaca ccaacaggat acattgaaag cctgcctagg gtagttaaaa gacgagtgaa 360tgctctcaaa aacctgcaag ttaaatgtgc acagatagaa gccaaattct atgaggaagt 420tcacgatctt gaaaggaagt atgctgttct ctatcagcct ctatttgata agcgatttga 480aattattaat gcaatttatg aacctacgga agaagaatgt gaatggaaac cagatgaaga 540agatgagatt tcggaggaat tgaaagaaaa ggccaagatt gaagatgaga aaaaggatga 600agaaaaagaa gaccccaaag gaattcctga attttggtta actgttttta agaatgttga 660cttgctcagt gatatggttc aggaacacga tgaacctatt ctgaagcact tgaaagatat 720taaagtgaag ttctcagatg ctggccagcc tatgagtttt gtcttagaat ttcactttga 780acccaatgaa tattttacaa atgaagtgct gacaaagaca tacaggatga ggtcagaacc 840agatgattct gatccctttt cttttgatgg accagaaatt atgggttgta cagggtgcca 900gatagattgg aaaaaaggaa agaatgtcac tttgaaaact attaagaaga agcagaaaca 960caagggacgt gggacagttc gtactgtgac taaaacagtt tccaatgact ctttctttaa 1020cttttttgcc cctcctgaag ttcctgagag tggagatctg gatgatgatg ctgaagctat 1080ccttgctgca gacttcgaaa ttggtcactt tttacgtgag cgtataatcc caagatcagt 1140gttatatttt actggagaag ctattgaaga tgatgatgat gattatgatg aagaaggtga 1200agaagcggat gaggaagggg aagaagaagg agatgaggaa aatgatccag actatgaccc 1260aaagaaggat caaaacccag cagagtgcaa gcagcagtga agcaggatgt atgtggcctt 1320gaggataacc tgcactgtaa tagcctaaac acaactctta tttacttaca gccttatgtt 1380tttgtatttt cttggtagac taggtaattt ttttttaaag gacaggaaac tgatatttta 1440aagaccaatt tgttctacct agcattttaa ctagtttttc tgccagctat gttgaatgca 1500caaattctgt cacgcatgtt cattcattgc tacataattt ggttcttctg gaatattttt 1560atgtagctct tggagtacag ctatgaaaat taacaactgt taaaggaaat accttttttt 1620tttttttgta attttttcct tgaagaacca aagtattttt tcagctggtt gttgaatagg 1680gttaagtccg cttggattag ctgtgccttt cattactttg ttacagaaat gcagtgactt 1740atactaagac aatttattgt ttaaaaaaaa aattggcaag acaactatat ggttaagaat 1800ttccagtatg accacaccca ataactgtta ttagagtgtt aatggattat tgtgttttag 1860gtgacatagt taactgtaaa gtaacctgac tcagtatagt tactggtacc acagtgaggt 1920gaataaaacg ggattttcag aagttagcct gaatttaact gtatttttaa atttaacctc 1980cattaactaa gcatcttttc tttgtggtag ggtctacctt ctgcttccct ggaaaggatg 2040aatttacatc atttgacaag cctattttca agttatttgt tgtttgtttg cttgtttttg 2100tttttgcagc taaaataaaa atttcaaata caattttagt tcttacaaga taatgtctta 2160attttgtacc aattcaggta gaagtagagg cctaccttga attaagggtt atactcagtt 2220tttaacacat tgttgaagaa aaggtaccag ctttggaacg agatgctata ctaataagca 2280agtgtaaaaa aaaaaaaaaa agaggaagaa aatcttaagt gattgatgct gttttctttt 2340aaaaaaaaaa aaaaaaattc attttctttg ggttagagct agagagaagg ccccaagctt 2400ctatggtttc ttctaattct tattgcttaa agtatgagta tgtcacttac ccgtgcttct 2460gtttactgtg taattaaaat gggtagtact gtttacctaa ctacctcatg gatgtgttaa 2520ggcatattga gttaaatctc atataatgtt tctcaatctt gttaaaagct caaaattttg 2580ggcctatttg taatgccagt gtgacactaa gcattttgtt cacaccacgc tttgataact 2640aaactggaaa acaaaggtgt taagtacctc tgttctggat ctgggcagtc agcactcttt 2700ttagatcttt gtgtggctcc tatttttata gaagtggagg gatgcactat ttcacaaggt 2760ccaagatttg ttttcagata tttttgatga ctgtattgta aatactacag ggatagcact 2820atagtattgt agtcatgaga cttaaagtgg aaataagact atttttgaca aaagatgcca 2880ttaaatttca gactgtagag ccacatttac aatacctcag gctaattact gttaattttg 2940gggttgaact ttttttgaca gtgagggtgg attattggat tgtcattaga ggaaggtcta 3000gatttcctgc tcttaataaa attacattga attgattttt agaggtaatg aaaacttcct 3060ttctgagaag ttagtgttaa ggtcttggaa tgtgaacaca ttgtttgtag tgctatccat 3120tcctctcctg agattttaac ttactactgg aaatccttaa ccaattataa tagctttttt 3180tctttatttt caaaatgatt tcctttgctt tgattagaca ctatgtgctt ttttttttta 3240accatagttc atcgaaatgc agctttttct gaacttcaaa gatagaatcc catttttaat 3300gaactgaagt agcaaaatca tctttttcat tctttaggaa atagctattg ccaaagtgaa 3360ggtgtagata atacctagtc ttgttacata aaggggatgt ggtttgcaga agaattttct 3420ttataaaatt gaagttttaa gggacgtcag tgtttatgcc atttttccag ttccaaaatg 3480attccattcc attctagaaa tttgaagtat gtaacctgaa atccttaata aaatttggat 3540ttaattttat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3582136232DNAHomo sapiens 13ctgccagatc agtttgtcac cacccaggct cccttgcctt tggctgggtg caacttccat 60tttaggtgtt ggatctgagg gggaaaaaaa agagagaggg agagagagag aaagaagagc 120aggaaagatc ccgaaaggag gaagaggtgg cgaaaaatca actgccctgc tggatttgtc 180tttctcagca ccttggcgaa gccttgggtt tctttcttaa aggactgatt tttagaactc 240cacatttgag gtgtgtggct tttgaagaaa atgtatgtac tgacgggaaa aggaagataa 300gcaagtcgaa tttttgtctt acgctctctc cttcctgctt cctccttgct gtggtggctg 360ggatgctcct tccatgattt tttgaatcta gactgggctg ttctctgtgt taaaccaatc 420agttgcgacc ttctcttaac agtgtgaagt gagggggtct ctctccctcc ttctccttcc 480tctgtgattc accttccttt ttaccctgcc ctgcggcggc tccgcccctt accttcatgg 540acgactcaga ggtggagtcg accgccagca tcttggcctc tgtgaaggaa caagaggccc 600agtttgagaa gctgacccgg gcgctggagg aggaacggcg ccacgtctcg gcgcagctgg 660aacgcgtccg ggtctcacca caagatgcca acccactcat ggccaacggc acactcaccc 720gccggcatca gaacggccgg tttgtgggcg atgctgacct tgaaagacag aaattttcag 780atttgaaact caacggaccc caggatcaca gtcaccttct atatagcacc atccccagga 840tgcaggagcc ggggcagatt gtggagacct acacggagga ggatcctgag ggagccatgt 900ctgtagtctc tgtggagacc tcagatgatg ggaccactcg gcgcacagag accacggtca 960agaaagtagt gaagactgtg acaacacgga cagtacagcc agtcgctatg ggaccagacg 1020ggttgcctgt ggatgcttca tcagtttcta acaactatat ccagactttg ggtcgtgatt 1080tccgcaagaa tggcaatggg ggacctggtc cctatgtggg gcaagctggc actgctaccc 1140ttcctaggaa cttccactac cctcctgatg gttatagtcg ccactatgaa gatggttatc 1200caggtggcag tgataactat ggcagtctgt cccgggtgac ccgcattgag gagcggtata 1260ggcccagcat ggaaggctac cgggcaccta gtagacagga tgtgtatggg ccccaacccc 1320aggttcgggt aggtgggagc agcgtggatc tgcatcgctt tcatccagag ccttatgggc 1380tagaggatga ccagcgtagt atgggctatg atgacctgga ttatggtatg atgtctgatt 1440atggcactgc ccgtcggact gggacaccct ctgaccctcg tcggcgcctc aggagctatg 1500aagacatgat tggtgaggag gtgccatcgg atcaatacta ctgggctcct ttggcccagc 1560atgagcgagg aagtttagca agcttggata gcctgcgcaa aggagggcct ccacctccta 1620attggagaca gccagagctg ccagaggtga tcgccatgct tggattccgc ttggatgctg 1680tcaagtccaa tgcagctgca tacctgcaac acttatgcta ccgcaatgac aaggtgaaga 1740ctgacgtgcg gaagctcaag ggcatcccag tactggtggg attgttagac catcccaaaa 1800aggaagtgca ccttggagcc tgtggagctc tcaagaatat ctcttttgga cgtgaccagg 1860ataacaagat tgccataaaa aactgtgatg gtgtgcctgc ccttgtgcga ttgcttcgaa 1920aggctcgtga tatggacctt actgaagtta ttaccggaac cctgtggaat ctttcatccc 1980atgactcaat caaaatggag attgtggacc atgcactgca tgccttgaca gatgaagtga 2040tcattcctca ttctggttgg gagcgggaac ctaatgaaga ctgtaagcca cgccatattg 2100agtgggaatc ggtgctcacc aacacagctg gctgccttag gaatgtaagc tcagagagga 2160gtgaagctcg ccggaaactt cgggaatgtg atggtttagt tgatgccctc attttcattg 2220ttcaggctga gattgggcag aaggattcag acagcaagct tgtagagaac tgtgtttgcc 2280ttcttcggaa cttatcatat caagttcacc gggagatccc acaggcagag cgttaccaag 2340aggcagctcc caatgttgcc aacaatactg ggccacatgc tgccagttgc tttggggcca 2400agaagggcaa agggaaaaaa cctatagagg atccagcaaa cgatacagtg gatttcccta 2460aaagaacgag tccagctcga ggctatgagc tcttatttca gccagaggtg gttcggatat 2520acatctcact tcttaaggag agcaagactc ctgccatcct agaagcctca gctggagcta 2580tccagaactt gtgtgctggg cgctggacgt atggtcgata catccgctct gctctgcgtc 2640aagagaaggc tctttctgcc atagctgacc tcctgactaa tgaacatgaa cgggtggtga 2700aagctgcatc tggagcactg agaaacctgg ctgtggatgc tcgcaacaaa gaattaattg 2760gtaaacatgc tattcctaac ttggtaaaga atctgccagg aggacagcag aactcctctt 2820ggaatttctc tgaggacact gtcatctcta ttttgaacac tatcaacgag gttatcgctg 2880agaacttgga ggctgccaaa aagcttcgag agacacaggg tattgagaag ctggtgttga 2940tcaacaaatc agggaaccgc tcagaaaaag aagttcgagc agcagcactt gtattacaga 3000caatctgggg atataaggaa ctgcggaagc cactggaaaa agaaggatgg aagaaatcag 3060actttcaggt gaatctaaac aatgcttccc gaagccagag cagtcattca tatgatgata 3120gtactctccc tctcattgac cggaaccaaa aatcagataa caactattcc acaccaaatg 3180agagaggaga ccacaataga acactggatc gatcggggga tctaggcgac atggagccat 3240tgaagggaac aacacccttg atgcaggacg aggggcagga atctctggag gaagagttgg 3300atgtgttggt tttggatgat gaggggggcc aagtgtctta cccctccatg cagaagattt 3360agcaccacta tctccgttcc atctgggctt atatgtactt ttattttttg gtggtgaaat 3420tgactgatga ttttcctttt tcttcgctgg actattgtgc caactgccag gctgcctcct 3480gcccttacag ccctaagtgg ctgccttctt tccatcaact cccaacttct tcctgtgaag 3540tttaattgtc tcaacgcctc cccctccccc attccctcca tttttctccc aagaaacctg 3600actcaattat ttgcatattt tgagaaactg ctgcagatta gttctttttg ccagttttcc 3660ctggaactcc tggccttttg tggaggggag ggatggagag aataggaatc ttcactagaa 3720gccgtgggaa gaattggaag ttacatgctg tatatgcaat gtccagcagt ctgataaact 3780gacgattctt aatcaagatt tttttcctga tggggaaggg acttttattt tcttttagag 3840aggggaaagt gtgagctctt cccttattcc taatggctat ttttgaagca aagaaggcca 3900gcaacattgg cacatgccac ctggcaaagg acccttgagt aagtgaaggt ctcctaaaac 3960tgggattaag aaaccttgct ctcctcatct ccaaggcagg gaccatcaag aacctacaga 4020ctccatctct tctgcaagcc tcatgccaac cctgggctat tgctgctgcc ccttaaacac 4080aggctgtcct taacccacct ctcctgccct gtgatatgtc tgctgagttg gcctggccat 4140ttccaagagg ctgtagaaag gggagaatgt caaggaagac ttttggtaga gaaggagcag 4200aaagatgtgt ttttgggaag aagaagacct ctaggaggag ctagtaggaa tgtacatgaa 4260gcaattagtc tgaaactggc ttccccactc ccccgtttct ccttttccta tccttatagg 4320cctgtccctt gcctctgccc tggattggtt ggcaaactat aggacttgat gtacataact 4380cctgtccctt ttcccttaca aggtggggat tgcccctggc tttgcctctt ctttgtgcct 4440ttggcctggg gtgcatctcc tcccgccctt ccatgtgcct ttctttgcct ctgcagtctc 4500atttctcata attttgcaaa ttatattttg ttgctttctt acctactatt ggccctaaat 4560agcagaaaga agagaagtga ccgagagaac ctcagattct tcattgagga ttggtatagc 4620catgatttca gtcatagcaa gcttttgctc aacagcatat gggtgggatt tggcaaaaat 4680cctattctga tgaatctcaa agtaaggctg gtaagagaag tgagtggtgt gactcttact 4740ccttaggtgc ccagaattta ccatcatctc tgaaggagtt acagggaagt ggtctcccca 4800attctcccct ccctccagta ttgccccctc tcactttagc atatattaat tagcaggttg 4860ggctagagaa atcagctgct atgcgggttg attattatta ttatttctaa tccttttcct 4920tatttgcctt ctactcccct taatctaatc taaaagctct gttccatgca actggagttc 4980cttatccctc tcttcccctt cccttatata ttgaggctat ggggtaggag aaaagtgcac 5040aacccaccac cccctctact cgtgcattaa aatttcttat ttaccctttt cccccttccc 5100atttcttccc actttcatct accttttctg gcaaaaagga gccttttgct ctctgtgacc 5160ctaagagcac actgcacagg gaaaattgcc ccatccagac ctggctccac tcttgatctc 5220tcttgtcctc ttctgctctt ttcctggtgc tcttttttct cggtggggtg tgggtaatag 5280aacagccgtg ggcttttggg gacctttaac ttttttttct ctcttttgtt tataaaaaac 5340actaaacatt caattccaga gaacccaaaa tcccaccttc ccaccgaaca ctactaaggg 5400gcttgtgttc tgctccatac cttttctctt ttctttctgt cttgttaatg cttttaaaaa 5460caaatgagtt ttttatataa ataaagtttt taaagtgtgt atgtgggggg tctgtgtcat 5520ttcttcactt caagctgtta tttcttccct gctttgcatc tttgttactt ccttatgtat 5580cagtgtcctt tccagagcaa ccagaaggag gttataccag gatttatttt gagctcagcc 5640ccaactcttt atcaagcaac attcttgtta actatatgtg aaacattttt tcttctgaag 5700attcttaaaa attgaatgtg gctgaagttg aacatgggag cttattgcta atttagagat 5760aggaaactga agcataaaga attaatgact tactttaatt actggaattc ttctgcaaca 5820tttgacaaaa ctaaccttga ataaggccca ctgtaatacg tagctctctt aaatataaca 5880cttaggacta gaagattaga aactaccaat cccaactacg taataggaaa atgtaggatc 5940aaaaggccca tgtatataag tactgaccac tgggccataa tgttgcttct caggctatat 6000gcagtccttt agtcagaagt caataggcct atttattaat attttacaga ccatattacc 6060tggattacca gggactatct ttgctgcaga gatcaagggt taagatctat gggaagatac 6120ttatttttct gaggtcctta tgtcctgtca tataattaaa gactcaagag aatttatgtg 6180aaatgctttc tgtatgcccc aatctttaga ttaaaattat atacctgctc ct 6232141965DNAHomo sapiens 14gtctggttct ctctctccag aaggttctgc cggttccccc agctctgggt acccggctct 60gcatcgcgtc gccatgatgg gccatcgtcc agtgctcgtg ctcagccaga acacaaagcg 120tgaatccgga agaaaagttc aatctggaaa catcaatgct gccaagacta ttgcagatat 180catccgaaca tgtttgggac ccaagtccat gatgaagatg cttttggacc caatgggagg 240cattgtgatg accaatgatg gcaatgccat tcttcgagag attcaagtcc agcatccagc 300ggccaagtcc atgatcgaaa ttagccggac ccaggatgaa gaggttggag atgggaccac 360atcagtaatt attcttgcag gggaaatgct gtctgtagct gagcacttcc tggagcagca 420gatgcaccca acagtggtga tcagtgctta ccgcaaggca ttggatgata tgatcagcac 480cctaaagaaa ataagtatcc cagtcgacat cagtgacagt gatatgatgc tgaacatcat 540caacagctct attactacca aagccatcag tcggtggtca tctttggctt gcaacattgc 600cctggatgct gtcaagatgg tacagtttga ggagaatggt cggaaagaga ttgacataaa 660aaaatatgca agagtggaaa agatacctgg aggcatcatt gaagactcct gtgtcttgcg 720tggagtcatg attaacaagg atgtgaccca tccacgtatg cggcgctata tcaagaaccc 780tcgcattgtg ctgctggatt cttctctgga atacaagaaa ggagaaagcc agactgacat 840tgagattaca cgagaggagg acttcacccg aattctccag atggaggaag agtacatcca 900gcagctctgt gaggacatta tccaactgaa gcccgatgtg gtcatcactg aaaagggcat 960ctcagattta gctcagcact accttatgcg ggccaatatc acagccatcc gcagagtccg 1020gaagacagac aataatcgca ttgctagagc ctgtggggcc cggatagtca gccgaccaga 1080ggaactgaga gaagatgatg ttggaacagg agcaggcctg ttggaaatca agaaaattgg 1140agatgaatac tttactttca tcactgactg caaagacccc aaggcctgca ccattctcct 1200ccggggggct agcaaagaga ttctctcgga agtagaacgc aacctccagg atgccatgca 1260agtgtgtcgc aatgttctcc tggaccctca gctggtgcca gggggtgggg cctccgagat 1320ggctgtggcc catgccttga cagaaaaatc caaggccatg actggtgtgg aacaatggcc 1380atacagggct gttgcccagg ccctagaggt cattcctcgt accctgatcc agaactgtgg 1440ggccagcacc atccgtctac ttacctccct tcgggccaag cacacccagg agaactgtga 1500gacctggggt gtaaatggtg agacgggtac tttggtggac atgaaggaac tgggcatatg 1560ggagccattg gctgtgaagc tgcagactta taagacagca gtggagacgg cagttctgct 1620actgcgaatt gatgacatcg tttcaggcca caaaaagaaa ggcgatgacc agagccggca 1680aggcggggct cctgatgctg gccaggagtg agtgctaggc aaggctactt caatgcacag 1740aaccagcaga gtctcccctt ttcctgagcc agagtgccag gaacactgtg gacgtctttg 1800ttcagaaggg atcaggttgg ggggcagccc ccagtccctt tctgtcccag ctcagttttc 1860caaaagacac tgacatgtaa ttcttctcta ttgtaaggtt tccatttagt ttgcttccga 1920tgattaaatc taagtcattt gaaaaaaaaa aaaaaaaaaa aaaaa 1965153454DNAHomo sapiens 15cgccaaagga aaagcccctt ggatgagagg caggcgcttc agagaagcta agaaaagcac 60ctctccgcgc gccccacctc ctccgcctcg cgctcctcct gagcagcggg cccagactgc 120gctccggccg cggccctcgc cccgcggagc cctcctaccc cggcccgacg ctcggcccgc 180gacctgcccc gagccctctc catggaggca gcccgcccct ccggctcctg gaacggagcc 240ctctgccggc tgctcctgct gaccctcgcg atcttaatat ttgccagtga tgcctgcaaa 300aatgtgacat tacatgttcc ctccaaacta gatgccgaga aacttgttgg tagagttaac 360ctgaaagagt gctttacagc tgcaaatcta attcattcaa gtgatcctga cttccaaatt 420ttggaggatg gttcagtcta tacaacaaat actattctat tgtcctcgga gaagagaagt 480tttaccatat tactttccaa cactgagaac caagaaaaga agaaaatatt tgtctttttg 540gagcatcaaa caaaggtcct aaagaaaaga catactaaag aaaaagttct aaggcgcgcc 600aagagaagat gggctccaat tccttgttcg atgctagaaa actccttggg tccttttcca 660cttttccttc aacaggttca atctgacacg gcccaaaact ataccatata ctattccata 720agaggtcctg gagttgacca agaacctcgg aatttatttt

atgtggagag agacactgga 780aacttgtatt gtactcgtcc tgtagatcgt gagcagtatg aatcttttga gataattgcc 840tttgcaacaa ctccagatgg gtatactcca gaacttccac tgcccctaat aatcaaaata 900gaggatgaaa atgataacta cccaattttt acagaagaaa cttatacttt tacaattttt 960gaaaattgca gagtgggcac tactgtggga caagtgtgtg ctactgacaa agatgagcct 1020gacacgatgc acacacgcct gaagtactcc atcattgggc aggtgccacc atcacccacc 1080ctattttcta tgcatccaac tacaggcgtg atcaccacaa catcatctca gctagacaga 1140gagttaattg acaagtacca gttgaaaata aaagtacaag acatggatgg tcagtatttt 1200ggtctacaga caacttcaac ttgtatcatt aacattgatg atgtaaatga ccacttgcca 1260acatttactc gtacttctta tgtgacatca gtggaagaaa atacagttga tgtggaaatc 1320ttacgagtta ctgttgagga taaggactta gtgaatactg ctaactggag agctaattat 1380accattttaa agggcaatga aaatggcaat tttaaaattg taacagatgc caaaaccaat 1440gaaggagttc tttgtgtagt taagcctttg aattatgaag aaaagcaaca gatgatcttg 1500caaattggtg tagttaatga agctccattt tccagagagg ctagtccaag atcagccatg 1560agcacagcaa cagttactgt taatgtagaa gatcaggatg agggccctga gtgtaaccct 1620ccaatacaga ctgttcgcat gaaagaaaat gcagaagtgg gaacaacaag caatggatat 1680aaagcatatg acccagaaac aagaagtagc agtggcataa ggtataagaa attaactgat 1740ccaacagggt gggtcaccat tgatgaaaat acaggatcaa tcaaagtttt cagaagcctg 1800gatagagagg cagagaccat caaaaatggc atatataata ttacagtcct tgcatcagac 1860caaggaggga gaacatgtac ggggacactg ggcattatac ttcaagacgt gaatgataac 1920agcccattca tacctaaaaa gacagtgatc atctgcaaac ccaccatgtc atctgcggag 1980attgttgcgg ttgatcctga tgagcctatc catggcccac cctttgactt tagtctggag 2040agttctactt cagaagtaca gagaatgtgg agactgaaag caattaatga tacagcagca 2100cgtctttcct atcagaatga tcctccattt ggctcatatg tagtacctat aacagtgaga 2160gatagacttg gcatgtctag tgtcacttca ttggatgtta cactgtgtga ctgcattacc 2220gaaaatgact gcacacatcg tgtagatcca aggattggcg gtggaggagt acaacttgga 2280aagtgggcca tccttgcaat attgttgggc atagcattgc tcttttgcat cctgtttacg 2340ctggtctgtg gggcttctgg gacgtctaaa caaccaaaag taattcctga tgatttagcc 2400cagcagaacc taattgtatc aaacacagaa gctcctggag atgacaaagt gtattctgcg 2460aatggcttca caacccaaac tgtgggcgct tctgctcagg gagtttgtgg caccgtggga 2520tcaggaatca aaaacggagg tcaggagacc atcgaaatgg tgaaaggagg acaccagacc 2580tcggaatcct gccggggggc tggccaccat cacaccctgg actcctgcag gggaggacac 2640acggaggtgg acaactgcag atacacttac tcggagtggc acagttttac tcagccccgt 2700cttggtgaaa aagtgtatct gtgtaatcaa gatgaaaatc acaagcatgc ccaagactat 2760gtcctgacat ataactatga aggaagagga tcggtggctg ggtctgtagg ttgttgcagt 2820gaacgacaag aagaagatgg gcttgaattt ttggataatt tggagcccaa atttaggaca 2880ctagcagaag catgcatgaa gagatgagtg tgttctaata agtctctgaa agccagtggc 2940tttatgactt ttaaaaaaaa ttacaaacca agaatttttt aaagcagaag atgctatttg 3000tgggggtttt tctctcatta tttggatgga atctctttgg tcaaatgcac atttacagag 3060agacactata aacaagtaca caaatttttc aatttttaca tatttttaaa ttacttatct 3120tctatccaag gaggtctaca gagaaattaa agtctgcctt atttgttaca tttgggtata 3180atgacaacag ccaatttata gtgcaataaa atgtaattaa ttcaagtcct tattatagac 3240tatttgaagc acaacctaat ggaaaattgt agagaccttg ctttaacatt atctccagtt 3300aattaagtgt tcatgtggtg cttggaaact gttgttttcc tgaacatcta aagtgtgtag 3360actgcattct tgctattatt ttattcttgt aatgtgacct tttcactgtg caaagggaga 3420tttctagcca ggcattgact attacaattt catt 345416619DNAHomo sapiens 16agcagttcta agggaccata cagagtattc ctctcttcac accaggacca gccactgttg 60cagcatgagt tcccagcagc agaagcagcc ctgcatccca ccccctcagc ttcagcagca 120gcaggtgaaa cagccttgcc agcctccacc tcaggaacca tgcatcccca aaaccaagga 180gccctgccac cccaaggtgc ctgagccctg ccaccccaaa gtgcctgagc cctgccagcc 240caagcttcca gagccatgcc accccaaggt gcctgagccc tgcccttcaa tagtcactcc 300agcaccagcc cagcagaaga ccaagcagaa gtaatgtggt ccacagccat gcccttgagg 360agccggccac cagatgctga atcccctatc ccattctgtg tatgagtccc atttgccttg 420caattagcat tctgtctccc ccaaaaaaga atgtgctatg aagctttctt tcctacacac 480tctgagtctc tgaatgaagc tgaaggtctt agtaccagag ctagttttca gctgctcaga 540attcatctga agagagactt aagatgaaag caaatgattc agctccctta tacccccatt 600aaattcactt tcaattcca 619173528DNAHomo sapiens 17agccaaggac tctggagccg ccgccgccgc tgctgcggtt catatccgga gtagacggag 60ccgcagtaga cggatccgcg gctgcaccaa accactgccc ctcggagcct ggtagtgggc 120cacaagcccc cagtcccaga ggcgtggtgg gtcgggcaga gtcggaagaa ctggctttct 180agctggaaga tgcggaaggg gagcgactag gccgcttgcg tctgggcctg gcagaaggga 240ccggattttc tggcatcctt aaatcttgtg tcaaggattg gttataatat aaccagaaac 300catgacggcg gctgagaacg tatgctacac gttaattaac gtgccaatgg attcagaacc 360accatctgaa attagcttaa aaaatgatct agaaaaagga gatgtaaagt caaagactga 420agctttgaag aaagtaatca ttatgattct gaatggtgaa aaacttcctg gacttctgat 480gaccatcatt cgttttgtgc tacctcttca ggatcacact atcaagaaat tacttctggt 540attttgggaa attgttccta aaacaactcc agatgggaga cttttacatg agatgatcct 600tgtatgtgat gcatacagaa aggatcttca acatcctaat gaatttattc gaggatctac 660tcttcgtttt ctttgcaaat tgaaagaagc agaattgcta gaacctttaa tgccagctat 720tcgtgcatgt ttggagcatc gacacagcta tgttagaaga aatgctgttt tggccatcta 780taccatctat agaaattttg aacatcttat acctgatgct cctgaactga tacatgattt 840tctggtgaat gagaaggatg caagttgcaa aaggaatgca tttatgatgc taattcatgc 900agatcaggat cgagctttgg attacttaag tacttgcatt gatcaagttc aaacatttgg 960agacattctg cagctggtta ttgttgaact gatttataag gtctgtcatg ctaatccatc 1020agaaagagct cgttttattc gctgcatcta taacttatta cagtcatcca gccctgctgt 1080aaaatatgaa gctgctggga cattagtgac actctctagt gcaccaactg caatcaaggc 1140tgctgctcag tgttacattg atttaattat taaggagagc gacaacaatg taaaactcat 1200agttttggat cgcttgatag aattaaaaga gcatcctgct catgaacgag tactacagga 1260tctggttatg gatatcctaa gagtattgag cacaccagac ttagaagtac gaaagaaaac 1320tctgcagtta gcactggatc ttgtctcttc tagaaatgtt gaagagctgg ttattgtcct 1380gaagaaggaa gtgataaaaa caaataatgt gtctgagcat gaagatactg acaaatacag 1440acaactccta gtgcgaacat tgcattcctg ttctgtccga tttccagata tggctgcaaa 1500tgttattcct gtgttaatgg aatttctcag tgacaacaac gaagcagcag ctgctgatgt 1560cttggagttt gttcgtgaag ccattcagcg ctttgataac ctgagaatgc ttattgttga 1620gaagatgctt gaagtctttc atgctattaa atctgtcaag atttaccgag gagcattatg 1680gatcctggga gaatactgta gtaccaagga agacattcag agtgtgatga ctgagatccg 1740caggtccctt ggagagatcc caattgtaga gtcagaaata aagaaagaag ctggtgaatt 1800aaaacctgaa gaagaaataa ctgtagggcc agttcagaaa ttggttactg aaatgggtac 1860ctatgcaact cagagtgccc ttagcagttc tagacccacc aagaaagagg aagacagacc 1920tcccttgaga ggattccttc tggatggaga tttctttgtt gctgcctccc ttgccacaac 1980tctgaccaag attgcattgc gctatgtagc tttggttcag gagaagaaaa agcaaaattc 2040ttttgttgct gaggctatgt tgctcatggc tactatcctg catttgggaa aatcctctct 2100tcctaagaag ccaattactg atgatgatgt ggatcgaatt tccctgtgcc tcaaggtctt 2160gtctgaatgt tcacctttaa tgaatgacat tttcaataag gaatgcagac agtccctttc 2220tcacatgtta tctgctaaac tagaagaaga gaaattatcc caaaagaaag aatctgaaaa 2280gaggaatgtg acagtacagc ctgatgaccc catttccttc atgcaactaa ctgctaagaa 2340tgaaatgaac tgcaaggaag atcagtttca gctgagttta ctggcagcaa tgggtaacac 2400acagaggaaa gaggcagcag atcccctagc atctaaactt aacaaggtca cccaattgac 2460aggtttctca gatcctgtat atgcagaagc ttacgttcat gtcaaccaat atgatattgt 2520cctggatgta cttgttgtga accaaaccag tgatactttg cagaattgca cattagaact 2580agctacacta ggggatctga aacttgtgga aaagccgtct cctttgactc ttgctcctca 2640tgacttcgca aatattaaag ctaacgtcaa agtagcatca acagaaaatg gaataatttt 2700tggtaatata gtttatgatg tctctggagc agcaagtgac agaaattgtg tggttctcag 2760tgatattcac atcgacatca tggactatat ccagcctgca acttgcactg atgcagaatt 2820ccgtcagatg tgggccgaat ttgaatggga aaacaaagtg acagttaaca ccaacatggt 2880tgatttaaat gactacttac agcacatatt aaagtcaacc aatatgaaat gcctgactcc 2940agaaaaggcc ctttctggtt actgtggctt tatggcagcc aacctttatg ctcgttccat 3000atttggtgaa gatgcacttg caaatgtcag cattgagaag ccaattcacc agggaccaga 3060tgctgctgtt accggccata taagaattcg tgcaaagagc cagggaatgg ccttaagtct 3120tggagataaa atcaacttgt cacagaagaa aactagtata taaaaataaa caaaaagtcc 3180ttgaagcttt acagttaatt taggtatggg cttactggac tccaacatct tttgtactct 3240ttcatgctta tatagaatct gagttcatgc tgaatacttt tcagccaata atttatagcc 3300tttcccttaa atcaagattg agtttaaaat tatagtttgt cttttgtctt aacagttctg 3360aatgctgtcc tcaaagtata taatgtttca tgtaccaaga cccttttcac agtacaataa 3420acagatctat tcataaattt ttgttatttt ataaataaat gattacataa ttttagttat 3480aaaaaaaaaa aaaaaaaaaa agaaaaaaaa aaaaaaaaaa aaaaaaaa 3528181447DNAHomo sapiens 18tgtcactgag ggttgactga ctggagagct caagtgcagc aaagagaagt gtcagagcat 60gagcgccaag tccagaacca tagggattat tggagctcct ttctcaaagg gacagccacg 120aggaggggtg gaagaaggcc ctacagtatt gagaaaggct ggtctgcttg agaaacttaa 180agaacaagag tgtgatgtga aggattatgg ggacctgccc tttgctgaca tccctaatga 240cagtcccttt caaattgtga agaatccaag gtctgtggga aaagcaagcg agcagctggc 300tggcaaggtg gcagaagtca agaagaacgg aagaatcagc ctggtgctgg gcggagacca 360cagtttggca attggaagca tctctggcca tgccagggtc caccctgatc ttggagtcat 420ctgggtggat gctcacactg atatcaacac tccactgaca accacaagtg gaaacttgca 480tggacaacct gtatctttcc tcctgaagga actaaaagga aagattcccg atgtgccagg 540attctcctgg gtgactccct gtatatctgc caaggatatt gtgtatattg gcttgagaga 600cgtggaccct ggggaacact acattttgaa aactctaggc attaaatact tttcaatgac 660tgaagtggac agactaggaa ttggcaaggt gatggaagaa acactcagct atctactagg 720aagaaagaaa aggccaattc atctaagttt tgatgttgac ggactggacc catctttcac 780accagctact ggcacaccag tcgtgggagg tctgacatac agagaaggtc tctacatcac 840agaagaaatc tacaaaacag ggctactctc aggattagat ataatggaag tgaacccatc 900cctggggaag acaccagaag aagtaactcg aacagtgaac acagcagttg caataacctt 960ggcttgtttc ggacttgctc gggagggtaa tcacaagcct attgactacc ttaacccacc 1020taagtaaatg tggaaacatc cgatataaat ctcatagtta atggcataat tagaaagcta 1080atcattttct taagcataga gttatccttc taaagacttg ttctttcaga aaaatgtttt 1140tccaattagt ataaactcta caaattccct cttggtgtaa aattcaagat gtggaaattc 1200taactttttt gaaatttaaa agcttatatt ttctaacttg gcaaaagact tatccttaga 1260aagagaagtg tacattgatt tccaattaaa aatttgctgg cattaaaaat aagcacactt 1320acataagccc ccatacatag agtgggactc ttggaatcag gagacaaagc taccacatgt 1380ggaaaggtac tatgtgtcca tgtcattcaa aaaatgtgat tttttataat aaactcttta 1440taacaag 1447193916DNAHomo sapiens 19gcttggggcc gccatcttgg caagaggcga agcggcagcg gttcctgtca agggggcagc 60aggtccagag ctgctggtgc tcccgttccc cagaccctac ccctatcccc agtggagccg 120gagtgcgggc gcgccccacc accgccctca ccatggtgct gttggcagca gcggtctgca 180caaaagcagg aaaggctatt gtttctcgac agtttgtgga aatgacccga actcggattg 240agggcttatt agcagctttt ccaaagctca tgaacactgg aaaacaacat acgtttgttg 300aaacagagag tgtaagatat gtctaccagc ctatggagaa actgtatatg gtactgatca 360ctaccaaaaa cagcaacatt ttagaagatt tggagaccct aaggctcttc tcaagagtga 420tccctgaata ttgccgagcc ttagaagaga atgaaatatc tgagcactgt tttgatttga 480tttttgcttt tgatgaaatt gtcgcactgg gataccggga gaatgttaac ttggcacaga 540tcagaacctt cacagaaatg gattctcatg aggagaaggt gttcagagcc gtcagagaga 600ctcaagaacg tgaagctaag gctgagatgc gtcgtaaagc aaaggaatta caacaggccc 660gaagagatgc agagagacag ggcaaaaaag caccaggatt tggcggattt ggcagctctg 720cagtatctgg aggcagcaca gctgccatga tcacagagac catcattgaa actgataaac 780caaaagtggc acctgcacca gccaggcctt caggccccag caaggcttta aaacttggag 840ccaaaggaaa ggaagtagat aactttgtgg acaaattaaa atctgaaggt gaaaccatca 900tgtcctctag tatgggcaag cgtacttctg aagcaaccaa aatgcatgct ccacccatta 960atatggaaag tgtacatatg aagattgaag aaaagataac attaacctgt ggacgagacg 1020gaggattaca gaatatggag ttgcatggca tgatcatgct taggatctca gatgacaagt 1080atggccgaat tcgtcttcat gtggaaaatg aagataagaa aggggtgcag ctacagaccc 1140atccaaatgt ggataaaaaa cttttcactg cagagtctct aattggcctg aagaatccag 1200agaagtcatt tccagtcaac agtgacgtag gggtgctaaa gtggagacta caaaccacag 1260aggaatcttt tattccactg acaattaatt gctggccctc ggagagtgga aatggctgtg 1320atgtcaacat agaatatgag ctacaagaag ataatttaga actgaatgat gtggttatca 1380ccatcccact cccgtctggt gtcggcgcgc ctgttatcgg tgagatcgat ggggagtatc 1440gacatgacag tcgacgaaat accctggagt ggtgcctgcc tgtgattgat gccaaaaata 1500agagtggcag cctggagttt agcattgctg ggcagcccaa tgacttcttc cctgttcaag 1560tttcctttgt ctccaagaaa aattactgta acatacaggt taccaaagtg acccaggtag 1620atggaaacag ccccgtcagg ttttccacag agaccacttt cctagtggat aagtatgaaa 1680ttctgtaata ccaagaagag ggagctgaaa aggaaaattt tcagattaat aaagaagacg 1740ccaatgatgg ctgaagagtt tttcccagat ttacaagcca ctggagaccc cttttttctg 1800atacaatgca cgattctctg cgcgcaagga ccctcgactc acccccatgt ttcagtgtca 1860cagagacatt ctttgataag gaaatggcac aaacataaag ggaaaggctg ctaattttct 1920ttggcagatt gtattggcca gcaggaaagc aagctctcca gagaatgccc ccagttaaat 1980acctcctcta cctttaccta agttgctcct ttatttttat tttattatta ttattattat 2040tattattttt tgagatggag tctcactttg taacccaggc tggaatgcaa tggcatgatc 2100tcagctcact gcaacctccg cctcctgggt tcaagcaagt ctcctgcctc agcctccgag 2160tagctgggac tacaggtgca cgccaccacg cctggctaat tttttgtatt ttagtagaga 2220cggggtttca ccgtgttgcc caggctggtc gcgaactcct gagctcaggc aatccgccca 2280cctcagcctc ccaaagtgtt gggattacag gcatgagcca ccatgcccag ctgctccttt 2340attttaatcc ctaaatataa tccctaaata tagttatatt tcatacttag tttgttttta 2400aaaagttttc tctgtagaaa attttaatca ttcataccct ttacctttag gtttttcttt 2460ctatacattc agtcaggcac tgggatcatc tgtttacagg cattatattt atttggcact 2520cctggaacaa gtatatctaa cccattcttg atttttggac tattcaggtg aactatttga 2580ggggtatggg gtctagaagt taaaagatac gcatgtcttc tgttcttttc ccgtatcaat 2640tcattccttc atctctttgc caagttgttt tcctttcagg gcctgtcctt ccagtttaga 2700acagtaccat gaatcccact tgtgtcaata ttaaagatag ctgagaagca cctttcaaat 2760ggcacagtcc ctcttcaaga tgtctaaaag aatggttatg tctgtccagt tagggatttc 2820acatccacat gtaatcatgt ctgctgctgt tgctacccaa attttcattt ctccacattt 2880tgggtactta agctaaaacg taatggccac agtctgtaat ccattcacat tcctcagttt 2940caccacctcc ctcttccaga ctgcactctc tgtcatcagt cccctccttt ctaacagaaa 3000tggggttatg attttgaagg ctgtgggttc agggagtctt tgccaatcct gttggcccta 3060aactatcaag gaggctccat ttcaccattt gattttttgc atttcaggag gcaactgatt 3120gtttcgatat gtacatatta ctcacgtata ccccatttcc ttccagtcag cccaacattt 3180tccaccagtc tgtccccatc tctgaaatcc ttccttctct ttccccctaa gtcttttgag 3240tgtcatcatg tactggtggt ttctcggttc catctcatcc atttcctttt caatggagac 3300tacagcgtca gccagctcag ccttggcttt taactcaata ttccagtcca taggggtggt 3360taaaagttgc tgcaaggctg caggcactgg cagtgggaag aggcagacga ctagatgact 3420tctgcacttt tagctggttg aaaagtacca ctcccactct gaacatctgg ccgtccctgc 3480aaagagtgta ctgtgcttga agcagagcac tcacacataa atggctgtgt gtggaattgc 3540ttgccaaaga agtttctagc ctttcccttt cccctaactg catcagggaa gaattcttat 3600ctctagcttg gtttccacat gaggtttttc tgagaagggc ttgggacaag aagtctgtca 3660tgttagttaa gcaggcaaga aatcctacta atccagtttt gtttgaaagt tgtttgtccg 3720tatgattttt taaaagtcaa gtttaatttc aaaaaacctt ttttttctga gattactttt 3780ggggtaatat ttaaaatgag agacattttg taaccctgta aaatacatag ggaatataac 3840attccagtgt atacaaagaa ggcaaattct ttaatcaaat aaagcgtatt ataaaatgag 3900aaaaaaaaaa aaaaaa 3916202280DNAHomo sapiens 20tccagccaga aggatggggt ggctcccact cctgctgctt ctgactcaat gcttaggggt 60ccctgggcag cgctcgccat tgaatgactt ccaagtgctc cggggcacag agctacagca 120cctgctacat gcggtggtgc ccgggccttg gcaggaggat gtggcagatg ctgaagagtg 180tgctggtcgc tgtgggccct taatggactg ccgggccttc cactacaacg tgagcagcca 240tggttgccaa ctgctgccat ggactcaaca ctcgccccac acgaggctgc ggcgttctgg 300gcgctgtgac ctcttccaga agaaagacta cgtacggacc tgcatcatga acaatggggt 360tgggtaccgg ggcaccatgg ccacgaccgt gggtggcctg ccctgccagg cttggagcca 420caagttcccg aatgatcaca agtacacgcc cactctccgg aatggcctgg aagagaactt 480ctgccgtaac cctgatggcg accccggagg tccttggtgc tacacaacag accctgctgt 540gcgcttccag agctgcggca tcaaatcctg ccgggaggcc gcgtgtgtct ggtgcaatgg 600cgaggaatac cgcggcgcgg tagaccgcac ggagtcaggg cgcgagtgcc agcgctggga 660tcttcagcac ccgcaccagc accccttcga gccgggcaag ttcctcgacc aaggtctgga 720cgacaactat tgccggaatc ctgacggctc cgagcggcca tggtgctaca ctacggatcc 780gcagatcgag cgagagttct gtgacctccc ccgctgcggg tccgaggcac agccccgcca 840agaggccaca actgtcagct gcttccgcgg gaagggtgag ggctaccggg gcacagccaa 900taccaccact gcgggcgtac cttgccagcg ttgggacgcg caaatccctc atcagcaccg 960atttacgcca gaaaaatacg cgtgcaaaga ccttcgggag aacttctgcc ggaaccccga 1020cggctcagag gcgccctggt gcttcacact gcggcccggc atgcgcgcgg ccttttgcta 1080ccagatccgg cgttgtacag acgacgtgcg gccccaggac tgctaccacg gcgcagggga 1140gcagtaccgc ggcacggtca gcaagacccg caagggtgtc cagtgccagc gctggtccgc 1200tgagacgccg cacaagccgc agttcacgtt tacctccgaa ccgcatgcac aactggagga 1260gaacttctgc cggaacccag atggggatag ccatgggccc tggtgctaca cgatggaccc 1320aaggacccca ttcgactact gtgccctgcg acgctgcgct gatgaccagc cgccatcaat 1380cctggacccc ccagaccagg tgcagtttga gaagtgtggc aagagggtgg atcggctgga 1440tcagcggcgt tccaagctgc gcgtggttgg gggccatccg ggcaactcac cctggacagt 1500cagcttgcgg aatcggcagg gccagcattt ctgcgggggg tctctagtga aggagcagtg 1560gatactgact gcccggcagt gcttctcctc ctgccatatg cctctcacgg gctatgaggt 1620atggttgggc accctgttcc agaacccaca gcatggagag ccaagcctac agcgggtccc 1680agtagccaag atggtgtgtg ggccctcagg ctcccagctt gtcctgctca agctggagag 1740atctgtgacc ctgaaccagc gtgtggccct gatctgcctg ccccctgaat ggtatgtggt 1800gcctccaggg accaagtgtg agattgcagg ctggggtgag accaaaggta cgggtaatga 1860cacagtccta aatgtggcct tgctgaatgt catctctaac caggagtgta acatcaagca 1920ccgaggacgt gtgcgggaga gtgagatgtg cactgaggga ctgttggccc ctgtgggggc 1980ctgtgagggt gactacgggg gcccacttgc ctgctttacc cacaactgct gggtcctgga 2040aggaattata atccccaacc gagtatgcgc aaggtcccgc tggccagctg tcttcacgcg 2100tgtctctgtg tttgtggact ggattcacaa ggtcatgaga ctgggttagg cccagccttg 2160atgccatatg ccttggggag gacaaaactt cttgtcagac ataaagccat gtttcctctt 2220taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaataaaaaa aaaaaaaaaa aaaaaaaaaa 2280212876DNAHomo sapiens 21gaattcctgc agctcagcag ccgccgccag agcaggacga accgccaatc gcaaggcacc 60tctgagaact tcaggatgca gatgtctcca gccctcacct gcctagtcct gggcctggcc 120cttgtctttg gtgaagggtc tgctgtgcac catcccccat cctacgtggc ccacctggcc 180tcagacttcg gggtgagggt gtttcagcag gtggcgcagg cctccaagga ccgcaacgtg 240gttttctcac cctatggggt ggcctcggtg ttggccatgc

tccagctgac aacaggagga 300gaaacccagc agcagattca agcagctatg ggattcaaga ttgatgacaa gggcatggcc 360cccgccctcc ggcatctgta caaggagctc atggggccat ggaacaagga tgagatcagc 420accacagacg cgatcttcgt ccagcgggat ctgaagctgg tccagggctt catgccccac 480ttcttcaggc tgttccggag cacggtcaag caagtggact tttcagaggt ggagagagcc 540agattcatca tcaatgactg ggtgaagaca cacacaaaag gtatgatcag caacttgctt 600gggaaaggag ccgtggacca gctgacacgg ctggtgctgg tgaatgccct ctacttcaac 660ggccagtgga agactccctt ccccgactcc agcacccacc gccgcctctt ccacaaatca 720gacggcagca ctgtctctgt gcccatgatg gctcagacca acaagttcaa ctatactgag 780ttcaccacgc ccgatggcca ttactacgac atcctggaac tgccctacca cggggacacc 840ctcagcatgt tcattgctgc cccttatgaa aaagaggtgc ctctctctgc cctcaccaac 900attctgagtg cccagctcat cagccactgg aaaggcaaca tgaccaggct gccccgcctc 960ctggttctgc ccaagttctc cctggagact gaagtcgacc tcaggaagcc cctagagaac 1020ctgggaatga ccgacatgtt cagacagttt caggctgact tcacgagtct ttcagaccaa 1080gagcctctcc acgtcgcgca ggcgctgcag aaagtgaaga tcgaggtgaa cgagagtggc 1140acggtggcct cctcatccac agctgtcata gtctcagccc gcatggcccc cgaggagatc 1200atcatggaca gacccttcct ctttgtggtc cggcacaacc ccacaggaac agtccttttc 1260atgggccaag tgatggaacc ctgaccctgg ggaaagacgc cttcatctgg gacaaaactg 1320gagatgcatc gggaaagaag aaactccgaa gaaaagaatt ttagtgttaa tgactctttc 1380tgaaggaaga gaagacattt gccttttgtt aaaagatggt aaaccagatc tgtctccaag 1440accttggcct ctccttggag gacctttagg tcaaactccc tagtctccac ctgagaccct 1500gggagagaag tttgaagcac aactccctta aggtctccaa accagacggt gacgcctgcg 1560ggaccatctg gggcacctgc ttccacccgt ctctctgccc actcgggtct gcagacctgg 1620ttcccactga ggccctttgc aggatggaac tacggggctt acaggagctt ttgtgtgcct 1680ggtagaaact atttctgttc cagtcacatt gccatcactc ttgtactgcc tgccaccgcg 1740gaggaggctg gtgacaggcc aaaggccagt ggaagaaaca ccctttcatc tcagagtcca 1800ctgtggcact ggccacccct ccccagtaca ggggtgctgc aggtggcaga gtgaatgtcc 1860cccatcatgt ggcccaactc tcctggcctg gccatctccc tccccagaaa cagtgtgcat 1920gggttatttt ggagtgtagg tgacttgttt actcattgaa gcagatttct gcttcctttt 1980atttttatag gaatagagga agaaatgtca gatgcgtgcc cagctcttca ccccccaatc 2040tcttggtggg gaggggtgta cctaaatatt tatcatatcc ttgcccttga gtgcttgtta 2100gagagaaaga gaactactaa ggaaaataat attatttaaa ctcgctccta gtgtttcttt 2160gtggtctgtg tcaccgtatc tcaggaagtc cagccacttg actggcacac acccctccgg 2220acatccagcg tgacggagcc cacactgcca ccttgtggcc gcctgagacc ctcgcgcccc 2280ccgcgccccc cgcgcccctc tttttcccct tgatggaaat tgaccataca atttcatcct 2340ccttcagggg atcaaaagga cggagtgggg ggacagagac tcagatgagg acagagtggt 2400ttccaatgtg ttcaatagat ttaggagcag aaatgcaagg ggctgcatga cctaccagga 2460cagaactttc cccaattaca gggtgactca cagccgcatt ggtgactcac ttcaatgtgt 2520catttccggc tgctgtgtgt gagcagtgga cacgtgaggg gggggtgggt gagagagaca 2580ggcagctcgg attcaactac cttagataat atttctgaaa acctaccagc cagagggtag 2640ggcacaaaga tggatgtaat gcactttggg aggccaaggc gggaggattg cttgagccca 2700ggagttcaag accagcctgg gcaacatacc aagacccccg tctctttaaa aatatatata 2760ttttaaatat acttaaatat atatttctaa tatctttaaa tatatatata tattttaaag 2820accaatttat gggagaattg cacacagatg tgaaatgaat gtaatctaat agaagc 2876221310DNAHomo sapiens 22gctcggagcc cggagcgtgc ctcggcggcc tgtcggtttt caccatggag cagctgagct 60cagcaaacac ccgcttcgcc ttggacctgt tcctggcgtt gagtgagaac aatccggctg 120gaaacatctt catctctccc ttcagcattt catctgctat ggccatggtt tttctgggga 180ccagaggtaa cacggcagca cagctgtcca agactttcca tttcaacacg gttgaagagg 240ttcattcaag attccagagt ctgaatgctg atatcaacaa acgtggagcg tcttatattc 300tgaaacttgc taatagatta tatggagaga aaacttacaa tttccttcct gagttcttgg 360tttcgactca gaaaacatat ggtgctgacc tggccagtgt ggattttcag catgcctctg 420aagatgcaag gaagaccata aaccagtggg tcaaaggaca gacagaagga aaaattccgg 480aactgttggc ttcgggcatg gttgataaca tgaccaaact tgtgctagta aatgccatct 540atttcaaggg aaactggaag gataaattca tgaaagaagc cacgacgaat gcaccattca 600gattgaataa gaaagacaga aaaactgtga aaatgatgta tcagaagaaa aaatttgcat 660atggctacat cgaggacctt aagtgccgtg tgctggaact gccttaccaa ggcgaggagc 720tcagcatggt catcctgctg ccggatgaca ttgaggacga gtccacgggc ctgaagaaga 780ttgaggaaca gttgactttg gaaaagttgc atgagtggac taaacctgag aatctcgatt 840tcattgaagt taatgtcagc ttgcccaggt tcaaactgga agagagttac actctcaact 900ccgacctcgc ccgcctaggt gtgcaggatc tctttaacag tagcaaggct gatctgtctg 960gcatgtcagg agccagagat atttttatat caaaaattgt ccacaagtca tttgtggaag 1020tgaatgaaga gggaacagag gcggcagctg ccacagcagg catcgcaact ttctgcatgt 1080tgatgcccga agaaaatttc actgccgacc atccattcct tttctttatt cggcataatt 1140cctcaggtag catcctattc ttggggagat tttcttcccc ttagaagaaa gagactgtag 1200caatacaaaa atcaagctta gtgctttatt acctgagttt ttaatagagc caatatgtct 1260tatatcttta ccaataaaac cactgtccag aaaaaaaaaa aaaaaaaaaa 131023495DNAHomo sapiensmisc_feature(488)..(488)n is a, c, g, or t 23tttgaatatt tatgtcaaat tacaaaccag tttaaagctg cctatttggc aaaatgatct 60gctgcagaat tttcattttc tgtctctaga atgcagaaaa atgtcttaaa gttccttaat 120ttgcttaatt taatgtggtt tccagaagat gtgaaaacct cctttatttt taaaatacct 180gattccacat tggtcaatag tttcctcttt aatttacctc tctcctctca ctttatctat 240aataagcagg gagaaatgaa gacacaccat caacacgttt gcttagatat gtcctcaact 300aaatttctag tgtcacttac taattctaat ttcatccaat ataacataat taagataaat 360tctataacaa gctacacata ctttccagtt ctaataccat gtttgtgatg gaaacaaagc 420aggagtgccc tctgcaaggt gatcatctga gggtccaaga tgaaggggca cacaggtatt 480ttatctgncc cacac 49524488DNAHomo sapiens 24ctgatatttt gtatattaat gaattatcca agattcgatg ggatttatca gtgtgtagat 60agctctataa tgcttgaatt gtacacttct aagtgtgcag tgcaagagct tgtttatatt 120tcatactttt tatactttga ggaaaaaaag tcaaagaaaa attgtatttg agggaaaaaa 180ccatgaccaa gtaaaggata aattcaaaaa atagcctcat gagacttggc atacacactc 240atgggattcc agttattatg gagtgcttcc atccctctcc accccttccc cccaaaaggt 300tttctttgca agtgcttttg gaactaagag ctagtatctt ggattaactg atgcctgcta 360gtgctttctg attactcgca ttctgtttct tgctttaaaa gaagagtaaa gacaagagtg 420ttggaccagt attgcagttc tgtagtgtca tttcttataa aaaacaaaac aacaacaata 480atttatca 488251396DNAHomo sapiens 25tgactatcca gctctgagag acgggagttt ggagttgccc gctttacttt ggttgggttg 60gggggggcgg cgggctgttt tgttcctttt cttttttaag agttgggttt tcttttttaa 120ttatccaaac agtgggcagc ttcctccccc acacccaagt atttgcacaa tatttgtgcg 180gggtatgggg gtgggttttt aaatctcgtt tctcttggac aagcacaggg atctcgttct 240cctcattttt tgggggtgtg tggggacttc tcaggtcgtg tccccagcct tctctgcagt 300cccttctgcc ctgccgggcc cgtcgggagg cgccatggct cggatgaacc gcccggcccc 360ggtggaggtg agctacaaac acatgcgctt cctcatcacc cacaacccca ccaacgccac 420gctcagcacc ttcattgagg acctgaagaa gtacggggct accactgtgg tgcgtgtgtg 480tgaagtgacc tatgacaaaa cgccgctgga gaaggatggc atcaccgttg tggactggcc 540gtttgacgat ggggcgcccc cgcccggcaa ggtagtggaa gactggctga gcctggtgaa 600ggccaagttc tgtgaggccc ccggcagctg cgtggctgtg cactgcgtgg cgggcctggg 660ccgggctcca gtccttgtgg cgctggcgct tattgagagc gggatgaagt acgaggacgc 720catccagttc atccgccaga agcgccgcgg agccatcaac agcaagcagc tcacctacct 780ggagaaatac cggcccaaac agaggctgcg gttcaaagac ccacacacgc acaagacccg 840gtgctgcgtt atgtagctca ggaccttggc tgggcctggt cgtcatgtag gtcaggacct 900tggctggacc tggaggccct gcccagccct gctctgccca gcccagcagg ggctccaggc 960cttggctggc cccacatcgc cttttcctcc ccgacacctc cgtgcacttg tgtccgagga 1020gcgaggagcc cctcgggccc tgggtggcct ctgggccctt tctcctgtct ccgccactcc 1080ctctggcggc gctggccgtg gctctgtctc tctgaggtgg gtcgggcgcc ctctgcccgc 1140cccctcccac accagccagg ctggtctcct ctagcctgtt tgttgtgggg tgggggtata 1200ttttgtaacc actgggcccc cagcccctct tttgcgaccc cttgtcctga cctgttctcg 1260gcaccttaaa ttattagacc ccggggcagt caggtgctcc ggacacccga aggcaataaa 1320acaggagccg tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380aaaaaaaaaa aaaaaa 1396262294DNAHomo sapiens 26aagcagttgt tttgctggaa ggagggagtg cgcgggctgc cccgggctcc tccctgccgc 60ctcctctcag tggatggttc caggcaccct gtctggggca gggagggcac aggcctgcac 120atcgaaggtg gggtgggacc aggctgcccc tcgccccagc atccaagtcc tcccttgggc 180gcccgtggcc ctgcagactc tcagggctaa ggtcctctgt tgctttttgg ttccacctta 240gaagaggctc cgcttgacta agagtagctt gaaggaggca ccatgcagga gctgcatctg 300ctctggtggg cgcttctcct gggcctggct caggcctgcc ctgagccctg cgactgtggg 360gaaaagtatg gcttccagat cgccgactgt gcctaccgcg acctagaatc cgtgccgcct 420ggcttcccgg ccaatgtgac tacactgagc ctgtcagcca accggctgcc aggcttgccg 480gagggtgcct tcagggaggt gcccctgctg cagtcgctgt ggctggcaca caatgagatc 540cgcacggtgg ccgccggagc cctggcctct ctgagccatc tcaagagcct ggacctcagc 600cacaatctca tctctgactt tgcctggagc gacctgcaca acctcagtgc cctccaattg 660ctcaagatgg acagcaacga gctgaccttc atcccccgcg acgccttccg cagcctccgt 720gctctgcgct cgctgcaact caaccacaac cgcttgcaca cattggccga gggcaccttc 780accccgctca ccgcgctgtc ccacctgcag atcaacgaga accccttcga ctgcacctgc 840ggcatcgtgt ggctcaagac atgggccctg accacggccg tgtccatccc ggagcaggac 900aacatcgcct gcacctcacc ccatgtgctc aagggtacgc cgctgagccg cctgccgcca 960ctgccatgct cggcgccctc agtgcagctc agctaccaac ccagccagga tggtgccgag 1020ctgcggcctg gttttgtgct ggcactgcac tgtgatgtgg acgggcagcc ggcccctcag 1080cttcactggc acatccagat acccagtggc attgtggaga tcaccagccc caacgtgggc 1140actgatgggc gtgccctgcc tggcacccct gtggccagct cccagccgcg cttccaggcc 1200tttgccaatg gcagcctgct tatccccgac tttggcaagc tggaggaagg cacctacagc 1260tgcctggcca ccaatgagct gggcagtgct gagagctcag tggacgtggc actggccacg 1320cccggtgagg gtggtgagga cacactgggg cgcaggttcc atggcaaagc ggttgaggga 1380aagggctgct atacggttga caacgaggtg cagccatcag ggccggagga caatgtggtc 1440atcatctacc tcagccgtgc tgggaaccct gaggctgcag tcgcagaagg ggtccctggg 1500cagctgcccc caggcctgct cctgctgggc caaagcctcc tcctcttctt cttcctcacc 1560tccttctagc cccacccagg gcttccctaa ctcctcccct tgcccctacc aatgcccctt 1620taagtgctgc aggggtctgg ggttggcaac tcctgaggcc tgcatgggtg acttcacatt 1680ttcctacctc tccttctaat ctcttctaga gcacctgcta tccccaactt ctagacctgc 1740tccaaactag tgactaggat agaatttgat cccctaactc actgtctgcg gtgctcattg 1800ctgctaacag cattgcctgt gctctcctct caggggcagc atgctaacgg ggcgacgtcc 1860taatccaact gggagaagcc tcagtggtgg aattccaggc actgtgactg tcaagctggc 1920aagggccagg attgggggaa tggagctggg gcttagctgg gaggtggtct gaagcagaca 1980gggaatggga gaggaggatg ggaagtagac agtggctggt atggctctga ggctccctgg 2040ggcctgctca agctcctcct gctccttgct gttttctgat gatttggggg cttgggagtc 2100cctttgtcct catctgagac tgaaatgtgg ggatccagga tggcttcctt cctcttaccc 2160ttcctccctc agcctgcaac ctctatcctg gaacctgtcc tccctttctc cccaactatg 2220catctgttgt ctgctcctct gcaaaggcca gccagcttgg gagcagcaga gaaataaaca 2280gcatttctga tgcc 2294271399DNAHomo sapiens 27agtgtgaaat cttcagagaa gaatttctct ttagttcttt gcaagaaggt agagataaag 60acactttttc aaaaatggca atggtatcag aattcctcaa gcaggcctgg tttattgaaa 120atgaagagca ggaatatgtt caaactgtga agtcatccaa aggtggtccc ggatcagcgg 180tgagccccta tcctaccttc aatccatcct cggatgtcgc tgccttgcat aaggccataa 240tggttaaagg tgtggatgaa gcaaccatca ttgacattct aactaagcga aacaatgcac 300agcgtcaaca gatcaaagca gcatatctcc aggaaacagg aaagcccctg gatgaaacac 360ttaagaaagc ccttacaggt caccttgagg aggttgtttt agctctgcta aaaactccag 420cgcaatttga tgctgatgaa cttcgtgctg ccatgaaggg ccttggaact gatgaagata 480ctctaattga gattttggca tcaagaacta acaaagaaat cagagacatt aacagggtct 540acagagagga actgaagaga gatctggcca aagacataac ctcagacaca tctggagatt 600ttcggaacgc tttgctttct cttgctaagg gtgaccgatc tgaggacttt ggtgtgaatg 660aagacttggc tgattcagat gccagggcct tgtatgaagc aggagaaagg agaaagggga 720cagacgtaaa cgtgttcaat accatcctta ccaccagaag ctatccacaa cttcgcagag 780tgtttcagaa atacaccaag tacagtaagc atgacatgaa caaagttctg gacctggagt 840tgaaaggtga cattgagaaa tgcctcacag ctatcgtgaa gtgcgccaca agcaaaccag 900ctttctttgc agagaagctt catcaagcca tgaaaggtgt tggaactcgc cataaggcat 960tgatcaggat tatggtttcc cgttctgaaa ttgacatgaa tgatatcaaa gcattctatc 1020agaagatgta tggtatctcc ctttgccaag ccatcctgga tgaaaccaaa ggagattatg 1080agaaaatcct ggtggctctt tgtggaggaa actaaacatt cccttgatgg tctcaagcta 1140tgatcagaag actttaatta tatattttca tcctataagc ttaaatagga aagtttcttc 1200aacaggatta cagtgtagct acctacatgc tgaaaaatat agcctttaaa tcatttttat 1260attataactc tgtataatag agataagtcc attttttaaa aatgttttcc ccaaaccata 1320aaaccctata caagttgttc tagtaacaat acatgagaaa gatgtctatg tagctgaaaa 1380taaaatgacg tcacaagac 1399283088DNAHomo sapiens 28acaaaaaagc ttttacgagg tatcagcact tttctttcat tagggggaag gcgtgaggaa 60agtaccaaac agcagcggag ttttaaactt taaatagaca ggtctgagtg cctgaacttg 120ccttttcatt ttacttcatc ctccaaggag ttcaatcact tggcgtgact tcactacttt 180taagcaaaag agtggtgccc aggcaacatg ggtgactgga gcgccttagg caaactcctt 240gacaaggttc aagcctactc aactgctgga gggaaggtgt ggctgtcagt acttttcatt 300ttccgaatcc tgctgctggg gacagcggtt gagtcagcct ggggagatga gcagtctgcc 360tttcgttgta acactcagca acctggttgt gaaaatgtct gctatgacaa gtctttccca 420atctctcatg tgcgcttctg ggtcctgcag atcatatttg tgtctgtacc cacactcttg 480tacctggctc atgtgttcta tgtgatgcga aaggaagaga aactgaacaa gaaagaggaa 540gaactcaagg ttgcccaaac tgatggtgtc aatgtggaca tgcacttgaa gcagattgag 600ataaagaagt tcaagtacgg tattgaagag catggtaagg tgaaaatgcg aggggggttg 660ctgcgaacct acatcatcag tatcctcttc aagtctatct ttgaggtggc cttcttgctg 720atccagtggt acatctatgg attcagcttg agtgctgttt acacttgcaa aagagatccc 780tgcccacatc aggtggactg tttcctctct cgccccacgg agaaaaccat cttcatcatc 840ttcatgctgg tggtgtcctt ggtgtccctg gccttgaata tcattgaact cttctatgtt 900ttcttcaagg gcgttaagga tcgggttaag ggaaagagcg acccttacca tgcgaccagt 960ggtgcgctga gccctgccaa agactgtggg tctcaaaaat atgcttattt caatggctgc 1020tcctcaccaa ccgctcccct ctcgcctatg tctcctcctg ggtacaagct ggttactggc 1080gacagaaaca attcttcttg ccgcaattac aacaagcaag caagtgagca aaactgggct 1140aattacagtg cagaacaaaa tcgaatgggg caggcgggaa gcaccatctc taactcccat 1200gcacagcctt ttgatttccc cgatgataac cagaattcta aaaaactagc tgctggacat 1260gaattacagc cactagccat tgtggaccag cgaccttcaa gcagagccag cagtcgtgcc 1320agcagcagac ctcggcctga tgacctggag atctagatac aggcttgaaa gcatcaagat 1380tccactcaat tgtggagaag aaaaaaggtg ctgtagaaag tgcaccaggt gttaattttg 1440atccggtgga ggtggtactc aacagcctta ttcatgaggc ttagaaaaca caaagacatt 1500agaataccta ggttcactgg gggtgtatgg ggtagatggg tggagaggga ggggataaga 1560gaggtgcatg ttggtattta aagtagtgga ttcaaagaac ttagattata aataagagtt 1620ccattaggtg atacatagat aagggctttt tctccccgca aacaccccta agaatggttc 1680tgtgtatgtg aatgagcggg tggtaattgt ggctaaatat ttttgtttta ccaagaaact 1740gaaataattc tggccaggaa taaatacttc ctgaacatct taggtctttt caacaagaaa 1800aagacagagg attgtcctta agtccctgct aaaacattcc attgttaaaa tttgcacttt 1860gaaggtaagc tttctaggcc tgaccctcca ggtgtcaatg gacttgtgct actatatttt 1920tttattcttg gtatcagttt aaaattcaga caaggcccac agaataagat tttccatgca 1980tttgcaaata cgtatattct ttttccatcc acttgcacaa tatcattacc atcacttttt 2040catcattcct cagctactac tcacattcat ttaatggttt ctgtaaacat ttttaagaca 2100gttgggatgt cacttaacat tttttttttt tgagctaaag tcagggaatc aagccatgct 2160taatatttaa caatcactta tatgtgtgtc gaagagtttg ttttgtttgt catgtattgg 2220tacaagcaga tacagtataa actcacaaac acagatttga aaataatgca catatggtgt 2280tcaaatttga acctttctca tggatttttg tggtgtgggc caatatggtg tttacattat 2340ataattcctg ctgtggcaag taaagcacac tttttttttc tcctaaaatg tttttccctg 2400tgtatcctat tatggatact ggttttgtta attatgattc tttattttct ctcctttttt 2460taggatatag cagtaatgct attactgaaa tgaatttcct ttttctgaaa tgtaatcatt 2520gatgcttgaa tgatagaatt ttagtactgt aaacaggctt tagtcattaa tgtgagagac 2580ttagaaaaaa tgcttagagt ggactattaa atgtgcctaa atgaattttg cagtaactgg 2640tattcttggg ttttcctact taatacacag taattcagaa cttgtattct attatgagtt 2700tagcagtctt ttggagtgac cagcaacttt gatgtttgca ctaagatttt atttggaatg 2760caagagaggt tgaaagagga ttcagtagta cacatacaac taatttattt gaactatatg 2820ttgaagacat ctaccagttt ctccaaatgc cttttttaaa actcatcaca gaagattggt 2880gaaaatgctg agtatgacac ttttcttctt gcatgcatgt cagctacata aacagttttg 2940tacaatgaaa attactaatt tgtttgacat tccatgttaa actacggtca tgttcagctt 3000cattgcatgt aatgtagacc tagtccatca gatcatgtgt tctggagagt gttctttatt 3060caataaagtt ttaatttagt ataaacat 308829403DNAHomo sapiens 29tttcattagt tatcattagt ttattataaa agagaaatat ggaaattatt tacatgacga 60aagatttcag aacttcagtg gaatgggcag catcatgttg atgccatttc aatagtgact 120tatttcagtc tacgtacttt ccaagaatgt caccatctct aaataggaaa taatccttgt 180catctagaac tactttggtg cctccatatt ctgggagaag aactttatct ccaactttca 240cgctaactgg ttgaatctct ccaccctttc ctttagaacc cgatccaaca gcgactactg 300ttgcttgcaa tacttttcct tgagattttt ctggaagcat aatgcctcct ttggttacag 360tttcagcagc actcctttca accaatactc ggtcaaagag tgg 403301023DNAHomo sapiens 30gttggctgcc ggtgagttgg gtgccggtgg agtcgtgttg gtcctcagaa tccccgcgta 60gccgctgcct cctcctaccc tcgccatgtt tcttacccgg tctgagtacg acaggggcgt 120gaatactttt tctcccgaag gaagattatt tcaagtggaa tatgccattg aggctatcaa 180gcttggttct acagccattg ggatccagac atcagagggt gtgtgcctag ctgtggagaa 240gagaattact tccccactga tggagcccag cagcattgag aaaattgtag agattgatgc 300tcacataggt tgtgccatga gtgggctaat tgctgatgct aagactttaa ttgataaagc 360cagagtggag acacagaacc actggttcac ctacaatgag acaatgacag tggagagtgt 420gacccaagct gtgtccaatc tggctttgca gtttggagaa gaagatgcag atccaggtgc 480catgtctcgt ccctttggag tagcattatt atttggagga gttgatgaga aaggacccca 540gctgtttcat atggacccat ctgggacctt tgtacagtgt gatgctcgag caattggctc 600tgcttcagag ggtgcccaga gctccttgca agaagtttac cacaagtcta tgactttgaa 660agaagccatc aagtcttcac tcatcatcct caaacaagta atggaggaga agctgaatgc 720aacaaacatt gagctagcca cagtgcagcc tggccagaat ttccacatgt tcacaaagga 780agaacttgaa gaggttatca aggacattta aggaatcctg atcctcagaa cttctctggg 840acaatttcag ttctaataat gtccttaaat tttatttcca gctcctgttc cttggaaaat 900ctccattgta tgtgcatttt ttaaatgatg tctgtacata aaggcagttc tgaaataaag 960aaaattttaa aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020aaa

102331313DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t 31ntcttgggct caagcaancc tcctgccctg gcttcccaaa gtgttcagat tacaagtgtg 60agccactgca cccagaccaa gaaattttaa ccctaactaa atacccaaaa aaagngtata 120tatgttccac aaaggacatg ggtaagaatg tttatagcag cagtatttgt aatagccaga 180aactggaaac aagccaaaca tctatctaca gcagaagaga ctattgttta tttatacaat 240aaactacaat ataggcaata aaatgantga ggctacaaca acaggaaatc aatttcacaa 300acatantact gag 31332358DNAHomo sapiensmisc_feature(205)..(205)n is a, c, g, or t 32tgttaagtac ttaagattta ttgaatgaga actgcattgt acaatatggt gccactagac 60acgtctattt aatttaaatt aaaatataaa actctaaaac tagccatgat tcaaaggttc 120aatagctata tgtgactagt ggctaccata taaaacattt ccatcacaaa gttccattta 180tcagatctta tataggaacc ttgantaaaa tttaatagac aagtgatttt gtatttaaca 240tttcaccttt attgaatgcc ctatagggcc atttgaatac gggtcatgtn caaggcacag 300gggaaaaaaa aactgcagcn ggtaagggtt ttncaggggg gttttccagg tcccctcc 35833326DNAHomo sapiensmisc_feature(3)..(4)n is a, c, g, or t 33ttnnatatta nttatttttt attatacttt aagttttagg gtacatgtgc acaatgtcag 60ggtttgttac atatgtatgg gcaaggactt catgtctaaa acaccaaaag caatggcaac 120aaaagccaaa attgacaaaa gtagtatcat tctattatag ctgcatggaa aaagttaatt 180tattaataca atggatgcct aaggncagaa gtactcaaac ttttggtctc agtactcctt 240tacattctta aaaatcatta nggnccccaa ngantgtttg tttacaaggg ttacttacat 300tgataattac cacatttgaa atgaaa 326342301DNAHomo sapiens 34tcgacagctc tctcgcccag cccagttctg gaagggataa aaagggggca tcaccgttcc 60tgggtaacag agccaccttc tgcgtcctgc tgagctctgt tctctccagc acctcccaac 120ccactagtgc ctggttctct tgctccacca ggaacaagcc accatgtctc gccagtcaag 180tgtgtccttc cggagcgggg gcagtcgtag cttcagcacc gcctctgcca tcaccccgtc 240tgtctcccgc accagcttca cctccgtgtc ccggtccggg ggtggcggtg gtggtggctt 300cggcagggtc agccttgcgg gtgcttgtgg agtgggtggc tatggcagcc ggagcctcta 360caacctgggg ggctccaaga ggatatccat cagcactaga ggaggcagct tcaggaaccg 420gtttggtgct ggtgctggag gcggctatgg ctttggaggt ggtgccggta gtggatttgg 480tttcggcggt ggagctggtg gtggctttgg gctcggtggc ggagctggct ttggaggtgg 540cttcggtggc cctggctttc ctgtctgccc tcctggaggt atccaagagg tcactgtcaa 600ccagagtctc ctgactcccc tcaacctgca aatcgacccc agcatccaga gggtgaggac 660cgaggagcgc gagcagatca agaccctcaa caataagttt gcctccttca tcgacaaggt 720gcggttcctg gagcagcaga acaaggttct ggacaccaag tggaccctgc tgcaggagca 780gggcaccaag actgtgaggc agaacctgga gccgttgttc gagcagtaca tcaacaacct 840caggaggcag ctggacagca tcgtggggga acggggccgc ctggactcag agctgagaaa 900catgcaggac ctggtggaag acttcaagaa caagtatgag gatgaaatca acaagcgtac 960cactgctgag aatgagtttg tgatgctgaa gaaggatgta gatgctgcct acatgaacaa 1020ggtggagctg gaggccaagg ttgatgcact gatggatgag attaacttca tgaagatgtt 1080ctttgatgcg gagctgtccc agatgcagac gcatgtctct gacacctcag tggtcctctc 1140catggacaac aaccgcaacc tggacctgga tagcatcatc gctgaggtca aggcccagta 1200tgaggagatt gccaaccgca gccggacaga agccgagtcc tggtatcaga ccaagtatga 1260ggagctgcag cagacagctg gccggcatgg cgatgacctc cgcaacacca agcatgagat 1320cacagagatg aaccggatga tccagaggct gagagccgag attgacaatg tcaagaaaca 1380gtgcgccaat ctgcagaacg ccattgcgga tgccgagcag cgtggggagc tggccctcaa 1440ggatgccagg aacaagctgg ccgagctgga ggaggccctg cagaaggcca agcaggacat 1500ggcccggctg ctgcgtgagt accaggagct catgaacacc aagctggccc tggacgtgga 1560gatcgccact taccgcaagc tgctggaggg cgaggaatgc agactcagtg gagaaggagt 1620tggaccagtc aacatctctg ttgtcacaag cagtgtttcc tctggatatg gcagtggcag 1680tggctatggc ggtggcctcg gtggaggtct tggcggcggc ctcggtggag gtcttgccgg 1740aggtagcagt ggaagctact actccagcag cagtgggggt gtcggcctag gtggtgggct 1800cagtgtgggg ggctctggct tcagtgcaag cagtggccga gggctggggg tgggctttgg 1860cagtggcggg ggtagcagct ccagcgtcaa atttgtctcc accacctcct cctcccggaa 1920gagcttcaag agctaagaac ctgctgcaag tcactgcctt ccaagtgcag caacccagcc 1980catggagatt gcctcttcta ggcagttgct caagccatgt tttatccttt tctggagagt 2040agtctagacc aagccaattg cagaaccaca ttctttggtt cccaggagag ccccattccc 2100agcccctggt ctcccgtgcc gcagttctat attctgcttc aaatcagcct tcaggtttcc 2160cacagcatgg cccctgctga cacgagaacc caaagttttc ccaaatctaa atcatcaaaa 2220cagaatcccc accccaatcc caaattttgt tttggttcta actacctcca gaatgtgttc 2280aataaaatgc ttttataata t 230135448DNAHomo sapiensmisc_feature(437)..(437)n is a, c, g, or t 35gatcatatta ttaaataata tatgcacaga catggagaga attagttttt actaaaacat 60ttatcagaaa ttttaatact ctgcataacc agtattagca ttagaaatta gccactttta 120aaatgagaaa actgtgtcac tcttcaattt ttttataagc cattgaggaa aacattaact 180cctggatttc agcttcactt ttaacctgca gactaaattt ctttctcaat tatgtcagac 240acacccaagt caatcccaac ccccttgtta ccttgggaag acccgtgctg aaaaaggaga 300tcttccacct aaacacgtgt tctcttattt gaagcaaatc tttttgagaa tttgtttact 360tgatttcttt ccacaataaa ctgacagaga acgctactaa tgattttttt ttttttttgg 420agacggggtt ttgttcntgg ttggccca 44836219DNAHomo sapiensmisc_feature(39)..(39)n is a, c, g, or t 36tgtttttttg aagtgactga ctaaaaagag aacagatana tacaagagtg tcgctggatc 60ctattttata caaggattac gcctctcctg cttggccctt actgtcaccc tgtacaggta 120caaaggctac aaaaaaggaa gcaatataaa cagacacaaa taactttttt gcttttttac 180atgcgatttg taagcttagt ttgagctatt cacaagcta 219372808DNAHomo sapiens 37cggcatgaga ggccagcctg ccagggaaat ccaggaatct gcaacaaaaa cgatgacagt 60ctgaaatact ctctggtgcc aacctccaaa ttctcgtctg tcacttcaga cccccactag 120ttgacagagc agcagaatat caactccagt agacttgaat gtgcctctgg gcaaagaagc 180agagctaacg aggaaaggga tttaaagagt ttttcttggg tgtttgtcaa acttttattc 240cctgtctgtg tgcagagggg attcaacttc aattttctgc agtggctctg ggtccagccc 300cttacttaaa gatctggaaa gcatgaagac tgggcctttt ttcctatgtc tcttgggaac 360tgcagctgca atcccgacaa atgcaagatt attatctgat cattccaaac caactgctga 420aacggtagca cctgacaaca ctgcaatccc cagtttatgg gctgaagctg aagaaaatga 480aaaagaaaca gcagtatcca cagaagacga ttcccaccat aaggctgaaa aatcatcagt 540actaaagtca aaagaggaaa gccatgaaca gtcagcagaa cagggcaaga gttctagcca 600agagctggga ttgaaggatc aagaggacag tgatggtcac ttaagtgtga atttggagta 660tgcaccaact gaaggtacat tggacataaa agaagatatg attgagcctc aggagaaaaa 720actctcagag aacactgatt ttttggctcc tggtgttagt tccttcacag attctaacca 780acaagaaagt atcacaaaga gagaggaaaa ccaagaacaa cctagaaatt attcacatca 840tcagttgaac aggagcagta aacatagcca aggcctaagg gatcaaggaa accaagagca 900ggatccaaat atttccaatg gagaagagga agaagaaaaa gagccaggtg aagttggtac 960ccacaatgat aaccaagaaa gaaagacaga attgcccagg gagcatgcta acagcaagca 1020ggaggaagac aatacccaat ctgatgatat tttggaagag tctgatcaac caactcaagt 1080aagcaagatg caggaggatg aatttgatca gggtaaccaa gaacaagaag ataactccaa 1140tgcagaaatg gaagaggaaa atgcatcgaa cgtcaataag cacattcaag aaactgaatg 1200gcagagtcaa gagggtaaaa ctggcctaga agctatcagc aaccacaaag agacagaaga 1260aaagactgtt tctgaggctc tgctcatgga acctactgat gatggtaata ccacgcccag 1320aaatcatgga gttgatgatg atggcgatga tgatggcgat gatggcggca ctgatggccc 1380caggcacagt gcaagtgatg actacttcat cccaagccag gcctttctgg aggccgagag 1440agctcaatcc attgcctatc acctcaaaat tgaggagcaa agagaaaaag tacatgaaaa 1500tgaaaatata ggtaccactg agcctggaga gcaccaagag gccaagaaag cagagaactc 1560atcaaatgag gaggaaacgt caagtgaagg caacatgagg gtgcatgctg tggattcttg 1620catgagcttc cagtgtaaaa gaggccacat ctgtaaggca gaccaacagg gaaaacctca 1680ctgtgtctgc caggatccag tgacttgtcc tccaacaaaa ccccttgatc aagtttgtgg 1740cactgacaat cagacctatg ctagttcctg tcatctattc gctactaaat gcagactgga 1800ggggaccaaa aaggggcatc aactccagct ggattatttt ggagcctgca aatctattcc 1860tacttgtacg gactttgaag tgattcagtt tcctctacgg atgagagact ggctcaagaa 1920tatcctcatg cagctttatg aagccaactc tgaacatgct ggttatctaa atgagaagca 1980gagaaataaa gtcaagaaaa tttacctgga tgaaaagagg cttttggctg gggaccatcc 2040cattgatctt ctcttaaggg actttaagaa aaactaccac atgtatgtgt atcctgtgca 2100ctggcagttt agtgaacttg accaacaccc tatggataga gtcttgacac attctgaact 2160tgctcctctg cgagcatctc tggtgcccat ggaacactgc ataacccgtt tctttgagga 2220gtgtgacccc aacaaggata agcacatcac cctgaaggag tggggccact gctttggaat 2280taaagaagag gacatagatg aaaatctctt gttttgaacg aagattttaa agaactcaac 2340tttccagcat cctcctctgt tctaaccact tcagaaatat atgcagctgt gatacttgta 2400gatttatatt tagcaaaatg ttagcatgta tgacaagaca atgagagtaa ttgcttgaca 2460acaacctatg caccaggtat ttaacattaa ctttggaaac aaaaatgtac aattaagtaa 2520agtcaacata tgcaaaatac tgtacattgt gaacagaagt ttaattcata gtaatttcac 2580tctctgcatt gacttatgag ataattaatg attaaactat taatgataaa aataatgcat 2640ttgtattgtt cataatatca tgtgcacttc aagaaaatgg aatgctactc ttttgtggtt 2700tacgtgtatt attttcaata tcttaatacc ctaataaaga gtccataaaa atccaaaaaa 2760aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 280838416DNAHomo sapiensmisc_feature(8)..(9)n is a, c, g, or t 38tttatttnnt tgaatctatt taattgctca gactgtgcta gagaatacgt accatgaaat 60acatatattt cataaggttc agttacaaaa tggattgttt caaatggcaa tttcttacac 120taacctgatt atgaaaaaaa gaagtctgta tcatctgctt ccaagtctgt tatgtccaaa 180tatattttaa ttatgcattt attttgctac ttttataaat attagagatt tcaccntaaa 240ttatttttgt aactagttct agaacatgtt tnccaattat tattnnccta atgggagaca 300tataattgac cnatggttta tggcatatat ggtcctctac acagnggaac ctntttttaa 360aaggaatagg taaaggaaaa tgcgggacgg cctgggctct ccagggccaa gggcca 41639471DNAHomo sapiensmisc_feature(6)..(6)n is a, c, g, or t 39tttttntttt tttaaagtga atatacaatt tatttaacat tcaaacttca ttaagacatg 60tgcaatatgg caattttact ggggattaaa ccctacctag gattgcttgc tggggcttag 120caacagggtc cagttcacac ttagcactaa ttaaatactt tattgaataa atacaatacc 180angcaaaatg cattcaaatg ctttctaaaa aaattttaaa ggcctttcta ctcaggctaa 240tgacaaacac aataaaggca gatatgctag tttaacataa ttgggctgat tttatacagg 300cacttatatc ttttagtcca caaggtatat tattaaatga taggggaaca tctnatacaa 360ccatttctac agnactaggg gaattaaatt tctatgggaa ggaagggttt ttacagaccc 420catctttttt tacccncccc aacagttcta actctaaggg ggttatagcc a 47140525DNAHomo sapiens 40tttttttttt tttttttttg aaattttaac attttatatg catataaagc tgaacacatg 60actaacaatc tagtggatgt gtatagaacc caacaattgc agaatatata ttcttttcaa 120gcacacattg aatatttata aaaactgatc atatactgtg ccgtaagttt catctcagca 180aatttcaaag ttttgatgcc atgaatgaaa tgaaacctga catttcaaaa ttataaacag 240aatatgccct ggagtaactt gtggtattgt ttggggatga ggagagccat ccgaatagtg 300ttttaaggaa agtctctatt cattgatctg gggtaacaag gcaggaacca ttccaatgca 360gaagctttgg ctaagcagtt gagcgttcag tagtgcatgt aaattcctgt gtgaaggctg 420tggtgtcatg gctaaaggca tagcccctgg aacccagact gtttgggttc aaatctcagt 480tctgctgctt aactcactgt gtgatggtgg gcaagttgcc taacc 525411402DNAHomo sapiens 41ggggacgaag ggaagctcca gcgtgtggcc ccggcgagtg cggataaaag ccgccccgcc 60gggctcgggc ttcattctga gccgagcccg gtgccaagcg cagctagctc agcaggcggc 120agcggcggcc tgagcttcag ggcagccagc tccctcccgg tctcgccttc cctcgcggtc 180agcatgaaag ccttcagtcc cgtgaggtcc gttaggaaaa acagcctgtc ggaccacagc 240ctgggcatct cccggagcaa aacccctgtg gacgacccga tgagcctgct atacaacatg 300aacgactgct actccaagct caaggagctg gtgcccagca tcccccagaa caagaaggtg 360agcaagatgg aaatcctgca gcacgtcatc gactacatct tggacctgca gatcgccctg 420gactcgcatc ccactattgt cagcctgcat caccagagac ccgggcagaa ccaggcgtcc 480aggacgccgc tgaccaccct caacacggat atcagcatcc tgtccttgca ggcttctgaa 540ttcccttctg agttaatgtc aaatgacagc aaagcactgt gtggctgaat aagcggtgtt 600catgatttct tttattcttt gcacaacaac aacaacaaca aattcacgga atcttttaag 660tgctgaactt atttttcaac catttcacaa ggaggacaag ttgaatggac ctttttaaaa 720agaaaaaaaa aatggaagga aaactaagaa tgatcatctt cccagggtgt tctcttactt 780ggactgtgat attcgttatt tatgaaaaag acttttaaat gccctttctg cagttggaag 840gttttcttta tatactattc ccaccatggg gagcgaaaac gttaaaatca caaggaattg 900cccaatctaa gcagactttg ccttttttca aaggtggagc gtgaatacca gaaggatcca 960gtattcagtc acttaaatga agtcttttgg tcagaaatta cctttttgac acaagcctac 1020tgaatgctgt gtatatattt atatataaat atatctattt gagtgaaacc ttgtgaactc 1080tttaattaga gttttcttgt atagtggcag agatgtctat ttctgcattc aaaagtgtaa 1140tgatgtactt attcatgcta aactttttat aaaagtttag ttgtaaactt aaccctttta 1200tacaaaataa atcaagtgtg tttattgaat ggtgattgcc tgctttattt cagaggacca 1260gtgctttgat ttttattatg ctatgttata actgaaccca aataaataca agttcaaatt 1320tatgtagact gtataagatt ataataaaac atgtctgaag tcaaaaaaaa aaaaaaaaaa 1380aaaaaaaaaa aaaaaaaaaa aa 1402422544DNAHomo sapiens 42ctcactcaga cccatgaggc cctgcctggt ctcgtctggg acctgggaca gcagctggga 60gacctgagcc tggagtctgg gggcctggaa caggagagcg ggcgtagctc gggcttctat 120gaagatccca gctctacagg aggtccagat tcaccaccct caaccttctg tggggacagt 180ggcttctctg gatccagctc ctatggtcgc ctgggtccct ctgagccccg gggcatctat 240gccagtgaga ggcccaagtc cctaggagac gccagtccca gcgctccgga ggtggtgggc 300gcgcgggcag cggtgccgcg gtccttctca gcgccctacc cgacggcagg tgggtcgccg 360gcccggaggc ctgctcctcg gcggagcggc gggcccgcgc cgggcccttt ctgacgccca 420gccccctgca cgccgtggcg atgcgcagcc cgcggccctg cggccgccct cccaccgact 480cgcccgacgc ggggggcgca gggcggcccc tggacggcta catctcggcg ctcctgcgca 540ggcgccgccg ccggggggcg ggccagcccc ggaccagtcc tgggggcgcg gacggcggcc 600cgcggcgcca gaacagcgtg cgccagcggc cgcccgacgc gtctccgtcc cccggcagcg 660cgcgacccgc gcgggagccc tcgttggagc gcgtcggggg ccaccccacc agccctgccg 720ccttgagccg cgcctgggcg tcgtcgtggg agtcggaggc ggcacccgag cccgctgcgc 780cgcccgccgc cccctcaccc cccgacagcc cggctgaggg ccgcttggtg aaggcgcagt 840acatcccggg cgcgcaggcg gccacccgag gcctccctgg ccgcgccgcc cgccgcaaac 900cgccgccact gacccgcggc cgcagcgtgg agcagtcacc accccgggag cgtccccggg 960ccgccggccg ccgtggacgc atggccgagg cttcgggccg ccgcggctcg cccagggccc 1020gcaaggcctc gcgctcccag tctgagacca gcctgctggg ccgcgcctcc gcggtccctt 1080cggggccccc taagtacccc acggcggagc gggaagagcc tcggcctcca cggccacgcc 1140gcggcccagc gcccacgctg gcggcccagg ccgcagggtc ctgccgtcgc tggcgctcca 1200ctgcggagat cgacgctgcc gatgggcgcc gcgtgcggcc ccgagcccct gcggcgcgtg 1260ttcccggccc cggcccgtcc ccgtcagctc cccagcgtcg tctgctttac ggctgcgcgg 1320gcagcgactc cgagtgctcg gctgggcgcc tggggcccct gggacgccgg gggcctgcgg 1380gaggcgtcgg cgggggttac ggggagagcg aatcgagcgc cagcgaggga gaatcgcctg 1440ccttcagctc tgcctccagc gactcagacg gcagcggtgg cctcgtgtgg ccgcagcagc 1500tggtggcggc caccgcggcc tctgggggtg gagcaggtgc aggggcgccc gcaggccccg 1560ccaaagtctt cgtgaaaatc aaagcttccc acgcgctcaa gaaaaagata ctgcgtttcc 1620gttcgggttc tctcaaggtc atgactacag tgtgagtttg gggatttgct tgggctcccc 1680cttcatggcc tctgcacctc cacactccca accactgacc cttccacatc taccttccaa 1740agaccatcgt tttctctgct tccaaagacc cccctcactc tccccactcc taacagtctt 1800ggttgaaaag gctcccccac caccaccgag aggaatgggg aggagccctg tttgacccag 1860ttcagcttct agcttggaag cccttgggca agacagttcc ccttctctgg gcgtcacttt 1920cctcatctgt acagtaagtg tccatgtatg caaaaggggt aattcggttt gaatttcccc 1980gttttagttt agaagcctag tctgtttgtt ccccttcacc gctctccctc tcattcctga 2040tgagccctct cattcctcct ttccttgccc agctatggcc ccctctcatt cacaaagtgc 2100cccctccatg tccctggacc cttaagatat ccccttggca ccctggtcag agactctgtg 2160tctgactcag gtggtccctg cagagtgccc tgggaaggga aggagcactg atttgggggt 2220tttgagggtc aagtaggggt tggtaacacc tggaaagaag gactctttca cttcgatccc 2280tggacaatta tggaggattc ggaggtagaa gaggggaagg aagatggttt ctatctcatg 2340acccccactc cctgtgagag ggaatggggg aagcctgatg accctcagct gttccaatct 2400agtatttttt ttctttttta aaattactgt atttattatg acgatggtga ctccccagtg 2460caaagggggg ccagattctg tgtgtttctc taacctcttt gtaaataaat gcacagtgta 2520acataaaaaa aaaaaaaaaa aaaa 254443374DNAHomo sapiensmisc_feature(345)..(345)n is a, c, g, or t 43aagcattaga gaagcatcag gccgccattc tagactcaac tgctcacctc ctctgatcca 60ctgaggtgtc tctggaaatc ctccaccaca gccacagcct cctcaccact ctcagggtga 120tgcagctgca cccaggtccg gagctcccca gggaggatgc tcaggaactg ctcaagcacc 180agcagctcca ggatctgctc cttggtgtgc acctctgggc atgagccacc agcggcagag 240cttcccgaag ccggctcaat gctttcctgc ggcccagaca tctcgtgggt aacacaattg 300cctgaagtgt aggccggaag attttcgcag acaggaggat agttnttttt gggagattgt 360tggccttgnc ccca 374444299DNAHomo sapiens 44tgacagcgga ggcggcggcg gctgcaggct ccgagccgta ggagccggat cgggggaggg 60gccgggccca ggagcctcag ccccgccggc agccctaagg gcaaggtaac cgccacgggg 120tccccgtcgc gaccccctcc ctcccggagc tcccgtcccc gggatcccaa gctccgcccc 180gccgaccccc gtctcccctg gaccccggct ctagcctgac gagatcccca acctcctgag 240gtgctctggc cccggattct cccgggctgc attctctgct cctcctcgcc tgcgaagcat 300cacgtccgct tcccgacgct gagggcagcc ccgtccaggg cagtggctct gccaatgatc 360ctgtgagtat tcaggaatca ctgttgcccc tggggatcct tgtcctggag tggcccacct 420gcttgccccc agcatggcgt ccgacactcc cgagtcgctg atggccctct gtactgactt 480ctgcttgcgc aacctggatg gcaccctggg ctacctgctg gacaaggaga ccctgcggct 540acatccggac atcttcttgc ccagcgagat ctgtgaccgg ctcgtcaatg agtatgtgga 600gctggtgaac gctgcctgta acttcgagcc acacgagagc ttcttcagcc tcttttcgga 660cccccgcagc acccgcctca cgcggatcca cctccgtgag gacctggtgc aggaccagga 720cctggaggcc atccgcaagc aggacctggt ggagctgtac ctgactaact gcgagaagct 780gtccgccaag agcctgcaga cactgaggag cttcagccac accctggtgt ccttgagcct 840cttcggctgt acaaacattt tctatgagga ggagaaccca gggggctgtg aagatgagta 900cctcgtcaac cccacctgcc aggtgctggt taaggatttc accttcgagg gcttcagccg 960cctccgcttc ctcaacttgg gccgcatgat tgattgggtc cctgtggagt ccctgctgcg 1020gccgcttaac tccctggctg ccttggacct ctcaggcatt cagacgagcg acgccgcctt 1080cctcacccag tggaaagaca gcctggtgtc cctcgtcctc tacaacatgg acctgtccga 1140cgaccacatc cgggtcatcg tgcagctgca caagctgcga cacctggaca tctcccgaga 1200ccgcctctcc agctactaca agttcaagct gactcgggag gtgctgagcc tctttgtgca 1260gaagctgggg aacctaatgt ccctggacat ctctggccac atgatcctag agaactgcag 1320catctccaag atggaagagg aagcggggca gaccagcatt gagccttcca agagcagcat 1380catacctttc cgggctctga agaggccgct gcagttcctc gggctctttg agaactctct 1440gtgccgcctc acgcacattc cagcctacaa agtaagtggt gacaaaaacg aagagcaggt 1500gctgaatgcc atcgaggcct acacggagca ccggcctgag atcacctcgc gggccatcaa 1560cttgcttttt gacatcgccc gcatcgagcg ttgcaaccag ctgctgcggg ccctgaagct 1620ggtcatcacg gccctcaagt gccacaaata tgacaggaac

attcaagtga caggcagcgc 1680cgctctcttc tacctaacaa attccgagta ccgctcagag cagagtgtga agctgcgccg 1740gcaggttatc caggtggtgc tgaatggcat ggaatcctac caggaggtga cggtgcagcg 1800gaactgctgc ctgacgctct gcaacttcag catccccgag gagctggaat tccagtaccg 1860ccgggtcaac gagctcctgc tcagcatcct caaccccacg cggcaggacg agtctatcca 1920gcggatcgcc gtgcacctgt gcaatgccct ggtctgccag gtagacaacg accacaagga 1980ggccgtgggc aagatgggct ttgtcgtgac catgctgaag ctgattcaga agaagctgct 2040ggacaagaca tgtgaccagg tcatggagtt ctcctggagt gccctgtgga acatcacaga 2100tgaaactcct gacaactgcg agatgttcct caatttcaac ggcatgaagc tcttcctgga 2160ctgcctgaag gaattcccag agaagcagga actgcatagg aatatgctag gacttttggg 2220gaatgtggca gaagtgaagg agctgaggcc tcaactaatg acttcccagt tcatcagcgt 2280cttcagcaac ctgttggaga gcaaggccga tgggatcgag gtttcctaca atgcctgcgg 2340cgtcctctcc cacatcatgt ttgatggacc cgaggcctgg ggcgtctgtg agccccagcg 2400tgaggaggtg gaggaacgca tgtgggctgc catccagagc tgggacataa actctcggag 2460aaacatcaat tacaggtcat ttgaaccaat tctccgcctc cttccccagg gaatctctcc 2520tgtcagccag cactgggcaa cctgggccct gtataacctc gtgtctgtct acccggacaa 2580gtactgccct ctgctgatca aagaaggggg gatgcccctt ctgagggaca taattaagat 2640ggcgaccgca cggcaggaga ccaaggaaat ggcccgcaag gtgattgagc actgcagtaa 2700ctttaaagag gagaacatgg acacgtctag atagaggcct ccgtccccat ggccgccacc 2760gctctggacc acaggcgggg aggaagcatg ctcaagcagc ccagcgggcg ggccccttcc 2820gagggagcct cccacggagt gaagagacat gggggacttt tgcacaaccg acgcttttcc 2880ttaatgttag tgagatatat atatattata tatatatatt ttttttttgg ttaggaagtg 2940tgaagttttg tgtgtatgat ttctgtgcaa aaacaaaagc aacactcctg agtccttgca 3000gcttccttgg ccattctcaa acccactcag ccttcatcgc tgacacacac actcctaccc 3060caaccagact aaatgcctat aacgctgtga gtgtccagtc cttgtccagg aaactcagat 3120cccggcctgg cttcctttca tgagaggagc aggccttgga cagcgtatcg agcatcctga 3180cccactgccc ctgcctgaga acgccatctc ggctcccggg cacagctgat ggggtttggg 3240gattagaact taccccactg ggtctcccaa aagccttggt gctcccggct gtgggccatc 3300tggggcagga aagtgagcca ttcctaggct gaggtccagg cagccctgcc cctgaagacc 3360ctctaggagc agggcaccca gtggccctgc tgctgtccag ccaggcctgc ctgaggccac 3420gctgctatgg aggctgcctc ctagtctccc accaggtccc aggctgtgga aagccccagc 3480ccagggatgg tcagaactcg ggggcagatt ccactgcccc ttctgccaaa cacatccaga 3540acctgccctc agccctggaa gctagcatct tctggggcca ggggcttgct tcctcgctcc 3600atagccctca actgcccagg cgctcccacc agcagaactg agcctgcctc ctcctcccag 3660cctgccccgc tgcccagagg accccacgcc tctcagaggc agaggtccca tgccagcctt 3720tgacccacaa cggccacaca gccgcctcca gaccagcact cggactgccc tgcagtggcc 3780gcttgggcct ccctggcggt cccgccctgc cctaggcttt accttggaag cctgagaggc 3840gccggctctc ttgctcctcc atcgatggac actgcattgc ttctcatcgg acacttgtgg 3900agcgcagggg cctggggagc agcgctaacc ctggaggcag cctttgggtg atggcttttt 3960cttccctttt cctcccgcgg gcctgttttc aggtgttcct agcatttctg cctccaggca 4020ggacggcagg ggtgagcagc tttgggagag acacctggcc tttttctcct ggagcctctc 4080cctcccggcc ctgggaagtg ggcgcagccc tgtgttcccc cagcttggca gatgggctgc 4140atgcggcgct cccttccttc ccacgctcag cggccccggc cagaccctgg cagacttcac 4200acctcattgc tttaccccct ggggcctggg gaaatgtctg tactttggga agtcacagaa 4260atacattttt gtgcaaaatg gaaaaaaaaa aaaaaaaaa 4299456990DNAHomo sapiens 45atggctgaga gcgcctcccc gccctcctca tctgcagcag ccccagccgc tgagccagga 60gtcaccacgg agcagcccgg accccggagc cccccatcct ccccgccagg cctggaggag 120cctctggatg gagctgatcc tcatgtccca cacccagacc tggcgcctat tgccttcttc 180tgcctgcgac agaccaccag cccccggaac tggtgcatca agatggtgtg caacccgtgg 240tttgaatgtg tcagcatgct ggtgatcctg ctgaactgcg tgacacttgg catgtaccag 300ccgtgcgacg acatggactg cctgtccgac cgctgcaaga tcctgcaggt ctttgatgac 360ttcatcttta tcttctttgc catggagatg gtgctcaaga tggtggccct ggggattttt 420ggcaagaagt gctacctcgg ggacacatgg aaccgcctgg atttcttcat cgtcatggca 480gggatggtcg agtactccct ggaccttcag aacatcaacc tgtcagccat ccgcaccgtg 540cgcgtcctga ggcccctcaa agccatcaac cgcgtgccca gtatgcggat cctggtgaac 600ctgctcctgg acacactgcc catgctgggg aatgtcctgc tgctctgctt ctttgtcttc 660ttcatctttg gcatcatagg tgtgcagctc tgggcgggcc tgctgcgtaa ccgctgcttc 720ctggaggaga acttcaccat acaaggggat gtggccttgc ccccatacta ccagccggag 780gaggatgatg agatgccctt catctgctcc ctgtcgggcg acaatgggat aatgggctgc 840catgagatcc ccccgctcaa ggagcagggc cgtgagtgct gcctgtccaa ggacgacgtc 900tacgactttg gggcggggcg ccaggacctc aatgccagcg gcctctgtgt caactggaac 960cgttactaca atgtgtgccg cacgggcagc gccaaccccc acaagggtgc catcaacttt 1020gacaacatcg gttatgcttg gattgtcatc ttccaggtga tcactctgga aggctgggtg 1080gagatcatgt actacgtgat ggatgctcac tccttctaca acttcatcta cttcatcctg 1140cttatcatag tgggctcctt cttcatgatc aacctgtgcc tcgttgtcat agcgacccag 1200ttctcggaga ccaagcaacg ggagcaccgg ctgatgctgg agcagcggca gcgctacctg 1260tcctccagca cggtggccag ctacgccgag cctggcgact gctacgagga gatcttccag 1320tatgtctgcc acatcctgcg caaggccaag cgccgcgccc tgggcctcta ccaggccctg 1380cagagccggc gccaggccct gggcccggag gccccggccc ccgccaaacc tgggccccac 1440gccaaggagc cccggcacta ccatgggaag actaagggtc agggagatga agggagacat 1500ctcggaagcc ggcattgcca gactttgcat gggcctgcct cccctggaaa tgatcactcg 1560ggaagagagc tgtgcccgca acatagcccc ctggatgcga cgccccacac cctggtgcag 1620cccatccccg ccacgctggc ttccgatccc gccagctgcc cttgctgcca gcatgaggac 1680ggccggcggc cctcgggcct gggcagcacc gactcgggcc aggagggctc gggctccggg 1740agctccgctg gtggcgagga cgaggcggat ggggacgggg cccggagcag cgaggacgga 1800gcctcctcag aactggggaa ggaggaggag gaggaggagc aggcggatgg ggcggtctgg 1860ctgtgcgggg atgtgtggcg ggagacgcga gccaagctgc gcggcatcgt ggacagcaag 1920tacttcaacc ggggcatcat gatggccatc ctggtcaaca ccgtcagcat gggcatcgag 1980caccacgagc agccggagga gctgaccaac atcctggaga tctgcaatgt ggtcttcacc 2040agcatgtttg ccctggagat gatcctgaag ctggctgcat ttgggctctt cgactacctg 2100cgtaacccct acaacatctt cgacagcatc attgtcatca tcagcatctg ggagatcgtg 2160gggcaggcgg acggtgggct gtcggtgctg cggaccttcc ggctgctgcg cgtgctgaaa 2220ctggtgcgct tcatgcctgc cctgcggcgc cagctcgtgg tgctcatgaa gaccatggac 2280aacgtggcca ccttctgcat gctgctcatg ctcttcatct tcatcttcag catccttggg 2340atgcatattt ttggctgcaa gttcagcctc cgcacggaca ctggagacac ggtgcccgac 2400aggaagaact tcgactccct gctgtgggcc atcgtcactg tgttccagat cctcacccag 2460gaggactgga acgtcgttct ctacaatggc atggcctcca cttctccctg ggcctccctc 2520tactttgtcg ccctcatgac cttcggcaac tatgtgctct tcaacctgct ggtggccatc 2580ctggtggagg gcttccaggc ggagggtgac gccaatcgct cctactcgga cgaggaccag 2640agctcatcca acatagaaga gtttgataag ctccaggaag gcctggacag cagcggagat 2700cccaagctct gcccaatccc catgaccccc aatgggcacc tggaccccag tctcccactg 2760ggtgggcacc taggtcctgc tggggctgcg ggacctgccc cccgactctc actgcagccg 2820gaccccatgc tggtggccct gggctcccga aagagcagtg tcatgtctct agggaggatg 2880agctatgacc agcgctccct gtccagctcc cggagctcct actacgggcc atggggccgc 2940agcgcggcct gggccagccg tcgctccagc tggaacagcc tcaagcacaa gccgccgtcg 3000gcggagcatg agtccctgct ctctgcggag cgcggcggcg gcgcccgggt ctgcgaggtt 3060gccgcggacg aggggccgcc gcgggccgca cccctgcaca ccccacacgc ccaccacatt 3120catcacgggc cccatctggc gcaccgccac cgccaccacc gccggacgct gtccctcgac 3180aacagggact cggtggacct ggccgagctg gtgcccgcgg tgggcgccca cccccgggcc 3240gcctggaggg cggcaggccc ggcccccggg catgaggact gcaatggcag gatgcccagc 3300atcgccaaag acgtcttcac caagatgggc gaccgcgggg atcgcgggga ggatgaggag 3360gaaatcgact acaccctgtg cttccgcgtc cgcaagatga tcgacgtcta taagcccgac 3420tggtgcgagg tccgcgaaga ctggtctgtc tacctcttct ctcccgagaa caggttccgg 3480gtcctgtgtc agaccattat tgcccacaaa ctcttcgact acgtcgtcct ggccttcatc 3540tttctcaact gcatcaccat cgccctggag cggcctcaga tcgaggccgg cagcaccgaa 3600cgcatctttc tcaccgtgtc caactacatc ttcacggcca tcttcgtggg cgagatgaca 3660ttgaaggtag tctcgctggg cctgtacttc ggcgagcagg cgtacctacg cagcagctgg 3720aacgtgctgg atggctttct tgtcttcgtg tccatcatcg acatcgtggt gtccctggcc 3780tcagccgggg gagccaagat cttgggggtc ctccgagtct tgcggctcct gcgcacccta 3840cgccccctgc gtgtcatcag ccgggcgccg ggcctgaagc tggtggtgga gacactcatc 3900tcctccctca agcccatcgg caacatcgtg ctcatctgct gtgccttctt catcatcttt 3960ggcatcctgg gagtgcagct cttcaagggc aagttctacc actgtctggg cgtggacacc 4020cgcaacatca ccaaccgctc ggactgcatg gccgccaact accgctgggt ccatcacaaa 4080tacaacttcg acaacctggg ccaggctctg atgtccctct ttgtcctggc atccaaggat 4140ggttgggtga acatcatgta caatggactg gatgctgttg ctgtggacca gcagcctgtg 4200accaaccaca acccctggat gctgctgtac ttcatctcct tcctgctcat cgtcagcttc 4260tttgtgctca acatgtttgt gggtgtcgtg gtggagaact tccacaagtg ccggcagcac 4320caggaggctg aagaggcacg gcggcgtgag gagaagcggc tgcggcgcct ggagaagaag 4380cgccggaagg cccagcggct gccctactat gccacctatt gtcacacccg gctgctcatc 4440cactccatgt gcaccagcca ctacctggac atcttcatca ccttcatcat ctgcctcaac 4500gtggtcacca tgtccctgga gcactacaat cagcccacgt ccctggagac agccctcaag 4560tactgcaact atatgttcac cactgtcttt gtgctggagg ctgtgctgaa gctggtggca 4620tttggtctga ggcgcttctt caaggaccga tggaaccagc tggacctggc cattgtgcta 4680ctgtcagtca tgggcatcac cctggaggag atcgagatca atgcggccct gcccatcaat 4740cccaccatca tccgcatcat gagggttctg cgcattgccc gagtgctgaa gctgttgaag 4800atggccacag gaatgcgggc cctgctggac acggtggtgc aagctttgcc ccaggtgggc 4860aacctgggcc tcctcttcat gctgctcttc ttcatctatg ctgctctcgg ggtggagctc 4920tttgggaagc tggtctgcaa cgacgagaac ccgtgcgagg gcatgagccg gcatgccacc 4980ttcgagaact tcggcatggc cttcctcaca ctcttccagg tctccacggg tgacaactgg 5040aacgggatca tgaaggacac gctgcgggac tgcacccacg acgagcgcag ctgcctgagc 5100agcctgcagt ttgtgtcgcc gctgtacttc gtgagcttcg tgctcaccgc gcagttcgtg 5160ctcatcaacg tggtggtggc tgtgctcatg aagcacctgg acgacagcaa caaggaggcg 5220caggaggacg ccgagatgga tgccgagctc gagctggaga tggcccatgg cctgggccct 5280ggcccgaggc tgcctaccgg ctccccgggc gcccctggcc gagggccggg aggggcgggc 5340ggcgggggcg acaccgaggg cggcttgtgc cggcgctgct actcgcctgc ccaggagaac 5400ctgtggctgg acagcgtctc tttaatcatc aaggactcct tggaggggga gctgaccatc 5460atcgacaacc tgtcgggctc catcttccac cactactcct cgcctgccgg ctgcaagaag 5520tgtcaccacg acaagcaaga ggtgcagctg gctgagacgg aggccttctc cctgaactca 5580gacaggtcct cgtccatcct gctgggtgac gacctgagtc tcgaggaccc cacagcctgc 5640ccacctggcc gcaaagacag caagggtgag ctggacccac ctgagcccat gcgtgtggga 5700gacctgggcg aatgcttctt ccccttgtcc tctacggccg tctcgccgga tccagagaac 5760ttcctgtgtg agatggagga gatcccattc aaccctgtcc ggtcctggct gaaacatgac 5820agcagtcaag cacccccaag tcccttctcc ccggatgcct ccagccctct cctgcccatg 5880ccagccgagt tcttccaccc tgcagtgtct gccagccaga aaggcccaga aaagggcact 5940ggcactggaa ccctccccaa gattgcgctg cagggctcct gggcatctct gcggtcacca 6000agggtcaact gtaccctcct ccggcaggcc accgggagcg acacgtcgct ggacgccagc 6060cccagcagct ccgcgggcag cctgcagacc acgctcgagg acagcctgac cctgagcgac 6120agcccccggc gtgccctggg gccgcccgcg cctgctccag gaccccgggc cggcctgtcc 6180cccgccgctc gccgccgcct gagcctgcgc ggccggggcc tcttcagcct gcgggggctg 6240cgggcgcatc agcgcagcca cagcagcggg ggctccacca gcccgggctg cacccaccac 6300gactccatgg acccctcgga cgaggagggc cgcggtggcg cgggcggcgg gggcgcgggc 6360agcgagcact cggagaccct cagcagcctc tcgctcacct ccctcttctg cccgccgccc 6420ccgccgccag cccccggcct cacgcccgcc aggaagttca gcagcaccag cagcctggcc 6480gcccccggcc gcccccacgc cgccgccctg gcccacggcc tggcccggag cccctcgtgg 6540gccgcggacc gcagcaagga cccccccggc cgggcaccgc tgcccatggg cctgggcccc 6600ttggcgcccc cgccgcaacc gctccccgga gagctggagc cgggagacgc cgccagcaag 6660aggaagagat gagggtcgca ggggcccccg gccgcccacc gcccgccccg tctcaccttc 6720tttacctcag gagccaggag cagacagcaa tacttcgtcc acacctggga tcgcgcaggg 6780cccgcagggc acaggcgccc gacagccggg ctgagcggag tctgggttag ccaggcctgc 6840gtggcccatg gtggcccttc cagtgcatat acatacatat atatatatat atgcatatat 6900atatatatat atatatatat gtgtatacac acacacatag acagacatat atatatatat 6960ttattttttt tactgagagc ttatgacttc 699046139DNAHomo sapiensmisc_feature(138)..(138)n is a, c, g, or t 46ctaatatttg catgtacaca atgagttatc ttagggaggg gatccaagtg gaaacacaaa 60atttattttt gtgtgtatac acacatacac acatcactta tatacatagc cttaaggtaa 120ttttataccg tatttttng 13947674DNAHomo sapiens 47ccccttggtt ccgcccgcgc gtcacgtgac cccagcgcct acttgggctg aggagccgcc 60gcgtcccctc gccgagtccc ctcgccagat tccctccgtc gccgccaaga tgatgtgcgg 120ggcgccctcc gccacgcagc cggccaccgc cgagacccag cacatcgccg accaggtgag 180gtcccagctt gaagagaaag aaaacaagaa gttccctgtg tttaaggccg tgtcattcaa 240gagccaggtg gtcgcgggga caaactactt catcaaggtg cacgtcggcg acgaggactt 300cgtacacctg cgagtgttcc aatctctccc tcatgaaaac aagcccttga ccttatctaa 360ctaccagacc aacaaagcca agcatgatga gctgacctat ttctgatcct gactttggac 420aaggcccttc agccagaaga ctgacaaagt catcctccgt ctaccagagc gtgcacttgt 480gatcctaaaa taagcttcat ctccgggctg tgccccttgg ggtggaaggg gcaggattct 540gcagctgctt ttgcatttct cttcctaaat ttcattgtgt tgatttcttt ccttcccaat 600aggtgatctt aattactttc agaatatttt caaaatagat atatttttaa aatccttaaa 660aaaaaaaaaa aaaa 674486276DNAHomo sapiens 48agtcggcatc catcagcggg cgggggtgtc gccgaacagg ctgctccgca gagcccgccg 60cgaccccgcg ccgccccgcc ccgcggcctg cctgccagag gagccgaggg ggccgcccct 120cgcccaacct gcccgacatg gggaaccccg ggcccaggcg tgctggtcac catgacaaca 180gagacaggcc ccgactctga ggtgaagaaa gctcaggagg aggccccgca gcagcccgag 240gctgctgccg ctgtgaccac ccctgtgacc cctgcaggcc acggccaccc agaggccaac 300tccaatgaga agcatccatc ccagcaggac acgcggcctg ctgaacagag cctagacatg 360gaggagaagg actacagtga ggccgatggc ctttcggaga ggaccacgcc cagcaaggcc 420cagaaatcgc cccagaagat tgccaagaaa tacaagagtg ccatctgccg ggtcactctg 480cttgatgcct cggagtatga gtgtgaggtg gagaaacatg gccggggcca ggtgctgttt 540gacctggtct gtgaacacct caacctccta gagaaggact acttcggcct gaccttctgt 600gatgctgaca gccagaagaa ctggctggac ccctccaagg agatcaagaa gcagatccgg 660agtagcccct ggaattttgc cttcacagtc aagttctacc cgcctgatcc tgcccagctg 720acagaagaca tcacaagata ctacctgtgc ctgcagctgc gggcagacat catcacgggc 780cggctgccat gctcctttgt cacgcatgcc ctactgggct cctacgctgt gcaggctgag 840ctgggtgact atgatgctga ggagcatgtg ggcaactatg tcagcgagct ccgcttcgcc 900cctaaccaga cccgggagct ggaggagagg atcatggagc tgcataagac atataggggg 960atgaccccgg gagaagcaga aatccacttc ttagagaatg ccaagaagct ttccatgtac 1020ggagtagacc tgcaccatgc caaggactct gagggcatcg acatcatgtt aggcgtttgt 1080gccaatggcc tgctcatcta ccgggaccgg ctgagaatca accgctttgc ctggcccaag 1140atcctcaaga tctcctacaa gaggagtaac ttctatatca agatccggcc tggggagtat 1200gagcaatttg agagcacaat tggctttaag ctcccaaacc accggtcagc caagagactg 1260tggaaggtct gcatcgagca tcatacattc ttccggctgg tgtcccctga gcccccaccc 1320aagggcttcc tggtgatggg ctccaagttc cggtacagtg ggaggaccca ggcacagact 1380cgccaggcca gcgccctcat tgaccggcct gcacccttct ttgagcgttc ttccagcaaa 1440cggtacacca tgtcccgcag ccttgatgga gcagagttct cccgcccagc ctcggtcagc 1500gagaaccatg atgcagggcc tgacggtgac aagcgggatg aggatggcga gtctgggggg 1560caacggtcag aggctgagga gggagaggtc aggactccaa ccaagatcaa ggagctaaag 1620ccggagcagg aaaccacgcc gagacacaag caggagttct tagacaagcc agaagatgtc 1680ttgctgaagc accaggccag catcaatgag ctcaaaagga ccctgaagga gcccaacagc 1740aaactcatcc accgggatcg agactgggaa cgggagcgca ggctgccctc ctcccccgcc 1800tccccctccc ccaagggcac ccctgagaaa gccaatgaga gagcagggct gagggagggc 1860tccgaggaga aagtcaaacc accacgtccc cgggccccag agagtgacac aggcgatgag 1920gaccaggacc aggagaggga cacggtgttc ctgaaggaca accacctggc cattgagcgc 1980aagtgctcca gcatcacggt cagctctacg tctagcctgg aggctgaggt ggacttcacg 2040gtcattggtg actaccatgg cagcgccttc gaagacttct cccgcagcct gcctgagctc 2100gaccgggaca aaagcgactc ggacactgag ggcctgctgt tctcccggga tctcaacaag 2160ggggccccca gccaggatga tgagtctggg ggcattgagg acagcccgga tcgaggggcc 2220tgctccaccc cggatatgcc ccagtttgag cccgtgaaaa cagaaaccat gactgtcagc 2280agtctggcca ttagaaagaa gattgagccg gaggccgtac tgcagaccag agtctccgct 2340atggataaca cccagcaggt tgatgggagt gcctcagtgg ggagggagtt catagcaacc 2400actccctcca tcaccacgga gaccatatcg accaccatgg agaacagtct caagtccggg 2460aagggggcag ctgccatgat cccaggccca cagacggtgg ccacggaaat ccgttctctt 2520tctccgatca tcgggaaaga tgtcctcacc agcacctacg gcgccactgc ggaaaccctc 2580tcaacctcca ccaccaccca tgtcaccaaa actgtgaaag gagggttttc tgagacaagg 2640atcgagaagc gaatcatcat tactggggat gaagatgtcg atcaagacca ggccctggct 2700ttggccatca aggaggccaa actgcagcat cctgatatgc tggtaaccaa agctgtcgta 2760tacagagaaa cagacccatc cccagaggag agggacaaga agccacagga atcctgacct 2820ctgtgaagag atcctggcat ttctggtcca acccaagcca gagaaccatt aagaaggggc 2880cttcattctg gattctccga cgcaacactg acgtcccagc tgcgacgtac tgtcactgat 2940gagagactgg gaagggaaaa gcatatatat atagatatat agagatatag atatatatac 3000aggaaacacc gcatccttgc actgctgctg gggctggcag agcagttggc tgacagcaac 3060aaccgacatc tgaacaccta catttccttt gcagacaaat tgaagaactg gtgggatttt 3120tttcaagaaa aaaaattata taataactat aatcccttgc tcaccccttt cccccgccaa 3180ataagaaacg caagccagac cacgatgatt gtagaagtcc ctcccgccct ggttctgcac 3240gttacagtta gcagacgagc aattccattt gttcttctcc agcatctcta aggcccactt 3300gaatgcaaag gaaaacactt gcacagcaaa gcaagagaag tcacagcagc aagacacgca 3360cagtcaacca ttttccgaga aaaaaagaaa attccccact tggaaagaaa gaggaggaac 3420actggattct tactttctgg atcttgacac tgggctgcaa aacctacctt cctctctccc 3480gcctcccctc accctcaact ctcaatgtct tgctgtcatt ttctgtctcg gctccctcct 3540cccccttccc ccttccccca ccccacaccc ttcaccctct gtgtcctggt ccttctgagg 3600gccactgcag atgactctcc tttgaaatga gaaaaagaaa agaaagcaag aacagaaaac 3660gaagccacag gaagggaagt agacattgta tgcttatggt ttctcattat gaaggtgcag 3720cttgtaggag gtttgtacgg atgtgctttg aagttatgta tattacatat aacaggaaaa 3780aatattaaaa taaacagtgc tggtaagtat gaagctgaca ttctaaaatt ataattatct 3840gactgtgatt gatgtatcct gaggttccta gatctcactg aactggccca gctaaggaga 3900cctggactct gggtgtgggt tggctcacag taggggctga cgggttcagt gtagtaatac 3960tgtgtgtggt gtttgtaatt ggttgattgg tggggagggg tggggggccc taatggagag 4020gtgtgggttt ggcaagaaag aagcaacaca gatgtcgtcc ccaaaatgcc agttcaagac 4080accttctccc tgcccccctg gtagtaacag tcagggcctg gtctgtgctc aggtactggg 4140tcccagtctg ggactctgct gctgaagttg ccacagtaga ggtccctggc ttagtcctta 4200tctccctacg gggcttgcct tggttttcag tcttctctct ctttctctct tttttttttt 4260tttgccacat tctgcccttc cctgacccca ttgtaataac caactccata tccaaaggga 4320ggtggtgctc tcagccattg tagaagatgg tggctttaac

ctgactgtct aaaaattccc 4380agctaagcct tttcctctac tctcttcctt gttctgaatc atttcttctt ctcaggccaa 4440agtagccatg gtaaggaggc ttcatggggc agaccctgaa agatcaaaac tgcatttgca 4500aagccctccc ctgtcccagg acaaagctga gactgacggg tgatgttgct cataggctcc 4560agctctgcat aagaccttgg cttggagacc tccctctcag tcaacagctg aactctgagc 4620ttgtgcccag aaattacccc aagaccacag gaacccttca agaagctccc atcacaagct 4680tggcattgct ctctgccaca cgtgggcttc ctcaggcttg tctgccacaa gctacttctc 4740tgagctcaga aagtgcccct tgatgaggga aaatgtccca ctgcactgcg aatttctcag 4800ttccatttta cctcccagtc ctccttctaa accagttaat aaattcattc cacaagtatt 4860tactgattac ctgcttgtgc cagggactat tctcaggctg aagaaggtgg gaggggaggg 4920cggaacctga ggagccacct gagccagctt tatatttcaa ccatggctgg cccatctgag 4980agcatctccc cactctcgcc aacctatcgg ggcatagccc agggatgccc ccaggcggcc 5040caggttagat gcgtcccttt ggcttgtcag tgatgacata caccttagct gcttagctgg 5100tgctggcctg aggcagggca ggaaatcaga atagcatttg cttctctggg caaatgggaa 5160gttcagcggg gcagcagaat cagtggcatt ccccctggtg caggccggtg ggtccactcc 5220aactccccct gagtgtagca gcacactttc catacaccag gttctttcta caatcctggt 5280ggaaaagcca cagaaccttc ttcctgccct tcttgagagt tccccctctt tctgggtcaa 5340gagctggagt ggtggctcca tcctctctgg gccacttcgg tctaggaact catctttgca 5400ggaaccagga gtcctgagca cactgaacac acctcagagg gaggatcctt gttgtggatt 5460ttgcacctgg ctttggggca ggggtgaagt gaccaggctt agcttgtgga gtttatgggc 5520caccagggtt tggggaaatc accatcccgc ggatgctgtg acctcccttc tacggagatg 5580caggcagtgc cacgagggag gaggggacct gcaaagctag aatctagggc actgtttcct 5640ccccatcctt ctctttgtag agaatagaga cgtttgtctt gtctgtcttc aacctacttt 5700tccttttctc ttttttgttt ctcatcctct ctgtgccacc tctccaccca ggaggccatg 5760tagcatagtg gaaaaagtcc ctgagggcgg ttaggagttc tgggtgacca tcctggctca 5820gctcctaact caccatgtga catcaggcta tccccattcc ccctcttggg cctcagtttc 5880ccgacttgca aaataagcag aaagaaccag atgctctcca gggtcttttt ctactttgct 5940atctcatggg tcttcatttt ctcttatttt gttttctctg gatcttttcc atctgagggt 6000acaggaagta ccaggacctg tttcagtttt tgaatcctgc aagcacattc caagactggc 6060ctgaaactgc atgagcaaca tcactcgaaa taattttttt tttcaaaagc accttaacaa 6120ccaattgcga tgctgtcctg ttccttttta ctcacaccct tctctccttt ctcgtcccca 6180tgctccccca cctcagtgct ccgtgctgta tgcgtgtgct ctctgttctt gtatactcaa 6240tataagtgaa ataaatgtgt ttgatgctga accata 627649982DNAHomo sapiens 49ggctcatcca cctgcagaca tggggcgcag aaagtcaaaa cgaaagccgc ctcccaagaa 60gaagatgaca ggcaccctcg agacccagtt cacctgcccc ttctgcaacc acgagaaatc 120ctgtgatgtg aaaatggacc gtgcccgcaa caccggagtc atctcttgta ccgtgtgcct 180agaggaattc cagacgccca taacgtatct gtcagaaccc gtggatgtgt acagtgattg 240gatagacgcc tgcgaggcgg ccaatcagta gcgacacaga ggacccgccc cctgagcagc 300cccgcgtact gtggatccag ctgttcggtt ctggtccaga gacattccag gggtccaggg 360tgtgggtcct gggctgtcac agccgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 420gtgtgtgtag tgggtgtgcg tgtgggtgtg ggtgtgagtg agtgtgggtg tgtgtggctg 480cacgtgtcac tggggtggcc gtgagtgtgt gctcacaggt acgcggtggt gtcgggttcc 540tgggcctgag gggcctgaac tgatctcact tggctccgaa agcctttgct gtgttccctg 600cagcccctgg ccccccagcc ttggggctct ggctcccccc ggcggaattg ggggactgtt 660tcctgacatc ctggacaagg gaagcccact agaggctgga acaggacctc tccagcctcc 720tcaccagcac cgtgcccatc tcaactggac ttcccgccct ccttctccac cttctagtgc 780ccgtggccgg ggattcaaag ccgccgttcc ccaggtccct gggctgggcc ctgacaggga 840gccgcccccc tccccatggt aaccaggaag cccgtttcat gttcagttgc ttttgtagag 900gaagcaaggg ctgggatggg gacagctgtc aatcacaagc ccttaaataa agcagccagc 960gcacaaaaaa aaaaaaaaaa aa 982501767DNAHomo sapiens 50gaaaggagca agccaggaag ccagacaaca acagcatcaa aacaaggctg tttctgtgtg 60tgaggaactt tgcctgggag ataaaattag acctagagct ttctgacagg gagtctgaag 120cgtgggacat ggaccgttca ctgggatggc aagggaattc tgtccctgag gacaggactg 180aagctgggat caagcgtttc ctggaggaca ccacggatga tggagaactg agcaagttcg 240tgaaggattt ctcaggaaat gcgagctgcc acccaccaga ggctaagacc tgggcatcca 300ggccccaagt cccggagcca aggccccagg ccccggacct ctatgatgat gacctggagt 360tcagaccccc ctcgcggccc cagtcctctg acaaccagca gtacttctgt gccccagccc 420ctctcagccc atctgccagg ccccgcagcc catggggcaa gcttgatccc tatgattcct 480ctgaggatga caaggagtat gtgggctttg caaccctccc caaccaagtc caccgaaagt 540ccgtgaagaa aggctttgac tttaccctca tggtggcagg agagtctggc ctgggcaaat 600ccacacttgt caatagcctc ttcctcactg atctgtaccg ggaccggaaa cttcttggtg 660ctgaagagag gatcatgcaa actgtggaga tcactaagca tgcagtggac atagaagaga 720agggtgtgag gctgcggctc accattgtgg acacaccagg ttttggggat gcagtcaaca 780acacagagtg ctggaagcct gtggcagaat acattgatca gcagtttgag cagtatttcc 840gagacgagag tggcctgaac cgaaagaaca tccaagacaa cagggtgcac tgctgcctgt 900acttcatctc acccttcggc catgggctcc ggccattgga tgttgaattc atgaaggccc 960tgcatcagcg ggtcaacatc gtgcctatcc tggctaaggc agacacactg acacctcccg 1020aagtggacca caagaaacgc aaaatccggg aggagattga gcattttgga atcaagatct 1080atcaattccc agactgtgac tctgatgagg atgaggactt caaattgcag gaccaagccc 1140taaaggaaag catcccattt gcagtaattg gcagcaacac tgtagtagag gccagagggc 1200ggcgagttcg gggtcgactc tacccctggg gcatcgtgga agtggaaaac ccagggcact 1260gcgactttgt gaagctgagg acaatgctgg tacgtaccca catgcaggac ctgaaggatg 1320tgacacggga gacacattat gagaactacc gggcacagtg catccagagc atgacccgcc 1380tggtggtgaa ggaacggaat cgcaacaaac tgactcggga aagtggtacc gacttcccca 1440tccctgctgt cccaccaggg acagatccag aaactgagaa gcttatccga gagaaagatg 1500aggagctgcg gcggatgcag gagatgctac acaaaataca aaaacagatg aaggagaact 1560attaactggc tttcagccct ggatatttaa atctcctcct cttcttcctg tccatgccgg 1620cccctcccag caccagctct gctcaggccc cttcagctac tgccacttcg ccttacatcc 1680ctgctgactg cccagagact cagaggaaat aaagtttaat aaatctgtag gtggctaaaa 1740aaaaaaaaaa aaaaaaaaaa aaaaaaa 176751339DNAHomo sapiensmisc_feature(1)..(1)n is a, c, g, or t 51naaatgttaa tagtaacttt tatttgaaag ttagggagat gaaaatacat ttccaaattc 60ttccaaagat atagctaaat gacaaaataa aaacttcact atgggccagg cgcggtgact 120cacgcctgta atcctagcac tttgggaggc cgaggcaggt ggatcacctg agagcaggag 180attgagacca gcctggccaa cttggtgaaa accctatctc tactaaaaaa tacaaaaact 240agccgngcat gatggcgtat gtttgtaaat ccccagctac ttngggacat taagggcaga 300agggatccgc tttgaacctc agggnggcca gaggtttac 33952453DNAHomo sapiensmisc_feature(453)..(453)n is a, c, g, or t 52ggtggggggg gggggtgttt aaaaaatccc tcaaatataa caatgaagca tgcttttcta 60acacaaagag taccaaaatg aatgtgctac tttctgttaa agttttattt ccagagcttg 120cccaagcaag aatctacttg ccctgtaaaa ttctgcttat acagaattaa aactccttta 180ttatcccaca aatacattat atatttccat agctttcttt agcccataca cttcttctta 240agtgttcaac tttcaaatct ctgataaaat gaaactcatc atgaagacca gtcaaaatgc 300taaaggaaac cttccttaat ctactttgca attactgttc ctttcagtta ctccctacct 360gcgcctgcca tgaatttttg tttttgtgtt ggtctattct ggactagtgg gctctacaat 420gagggatgcg tatctggaat accgagagct ttn 453531051DNAHomo sapiens 53ggaccgccgc ctggttaaag gcgcttattt cccaggcagc cgctgcagtc gccacacctt 60tgcccctgct gcgatgaccc tgtcgccact tctgctgttc ctgccaccgc tgctgctgct 120gctggacgtc cccacggcgg cggtgcaggc gtcccctctg caagcgttag acttctttgg 180gaatgggcca ccagttaact acaagacagg caatctatac ctgcgggggc ccctgaagaa 240gtccaatgca ccgcttgtca atgtgaccct ctactatgaa gcactgtgcg gtggctgccg 300agccttcctg atccgggagc tcttcccaac atggctgttg gtcatggaga tcctcaatgt 360cacgctggtg ccctacggaa acgcacagga acaaaatgtc agtggcaggt gggagttcaa 420gtgccagcat ggagaagagg agtgcaaatt caacaaggtg gaggcctgcg tgttggatga 480acttgacatg gagctagcct tcctgaccat tgtctgcatg gaagagtttg aggacatgga 540gagaagtctg ccactatgcc tgcagctcta cgccccaggg ctgtcgccag acactatcat 600ggagtgtgca atgggggacc gcggcatgca gctcatgcac gccaacgccc agcggacaga 660tgctctccag ccaccacacg agtatgtgcc ctgggtcacc gtcaatggga aacccttgga 720agatcagacc cagctcctta cccttgtctg ccagttgtac cagggcaaga agccggatgt 780ctgcccttcc tcaaccagct ccctcaggag tgtttgcttc aagtgatggc cggtgagctg 840cggagagctc atggaaggcg agtgggaacc cggctgcctg cctttttttc tgatccagac 900cctcggcacc tgctacttac caactggaaa attttatgca tcccatgaag cccagataca 960caaaattcca ccccatgatc aagaatcctg ctccactaag aatggtgcta aagtaaaact 1020agtttaataa gcaaaaaaaa aaaaaaaaaa a 105154340DNAHomo sapiensmisc_feature(49)..(49)n is a, c, g, or t 54ggcacgagca taccccattt ttgagctttc tttgagggcc aactttttnc tctaaaacca 60gccagggcat gcttttccct caccagctct ganttcttcc aggctaggca actggaaaag 120cctggnctta gaaactgctt tnttggctta cggcccagct gagctgacca aaatagccaa 180gagaaagact gtttgcacag tgtgaaattc ctccagggga aataccatag ncaaaaagcc 240aaganagcca gnacccacgn atggncaggg aacccacagg gcaaaaaaag gccgagttac 300ccccaaggnc cggggtttgt gggagatggg aggcctaggt 340551760DNAHomo sapiens 55atttctttat aaaccacaac tctgggcccg caatggcagt ccactgcctt gctgcagtca 60cagaatggaa atctgcagag gcctccgcag tcacctaatc actctcctcc tcttcctgtt 120ccattcagag acgatctgcc gaccctctgg gagaaaatcc agcaagatgc aagccttcag 180aatctgggat gttaaccaga agaccttcta tctgaggaac aaccaactag ttgctggata 240cttgcaagga ccaaatgtca atttagaaga aaagatagat gtggtaccca ttgagcctca 300tgctctgttc ttgggaatcc atggagggaa gatgtgcctg tcctgtgtca agtctggtga 360tgagaccaga ctccagctgg aggcagttaa catcactgac ctgagcgaga acagaaagca 420ggacaagcgc ttcgccttca tccgctcaga cagcggcccc accaccagtt ttgagtctgc 480cgcctgcccc ggttggttcc tctgcacagc gatggaagct gaccagcccg tcagcctcac 540caatatgcct gacgaaggcg tcatggtcac caaattctac ttccaggagg acgagtagta 600ctgcccaggc ctgcctgttc ccattcttgc atggcaagga ctgcagggac tgccagtccc 660cctgccccag ggctcccggc tatgggggca ctgaggacca gccattgagg ggtggaccct 720cagaaggcgt cacaagaacc tggtcacagg actctgcctc ctcttcaact gaccagcctc 780catgctgcct ccagaatggt ctttctaatg tgtgaatcag agcacagcag cccctgcaca 840aagcccttcc atgtcgcctc tgcattcagg atcaaacccc gaccacctgc ccaacctgct 900ctcctcttgc cactgcctct tcctccctca ttccaccttc ccatgccctg gatccatcag 960gccacttgat gacccccaac caagtggctc ccacaccctg ttttacaaaa aagaaaagac 1020cagtccatga gggaggtttt taagggtttg tggaaaatga aaattaggat ttcatgattt 1080ttttttttca gtccccgtga aggagagccc ttcatttgga gattatgttc tttcggggag 1140aggctgagga cttaaaatat tcctgcattt gtgaaatgat ggtgaaagta agtggtagct 1200tttcccttct ttttcttctt tttttgtgat gtcccaactt gtaaaaatta aaagttatgg 1260tactatgtta gccccataat tttttttttc cttttaaaac acttccataa tctggactcc 1320tctgtccagg cactgctgcc cagcctccaa gctccatctc cactccagat tttttacagc 1380tgcctgcagt actttacctc ctatcagaag tttctcagct cccaaggctc tgagcaaatg 1440tggctcctgg gggttctttc ttcctctgct gaaggaataa attgctcctt gacattgtag 1500agcttctggc acttggagac ttgtatgaaa gatggctgtg cctctgcctg tctcccccac 1560cgggctggga gctctgcaga gcaggaaaca tgactcgtat atgtctcagg tccctgcagg 1620gccaagcacc tagcctcgct cttggcaggt actcagcgaa tgaatgctgt atatgttggg 1680tgcaaagttc cctacttcct gtgacttcag ctctgtttta caataaaatc ttgaaaatgc 1740ctaaaaaaaa aaaaaaaaaa 176056584DNAHomo sapiens 56cacctgcacc ccgcccgggc atagcaccat gcctgcttgt cgcctaggcc cgctagccgc 60cgccctcctc ctcagcctgc tgctgttcgg cttcacccta gtctcaggca caggagcaga 120gaagactggc gtgtgccccg agctccaggc tgaccagaac tgcacgcaag agtgcgtctc 180ggacagcgaa tgcgccgaca acctcaagtg ctgcagcgcg ggctgtgcca ccttctgcct 240tctctgccca aatgataagg agggttcctg cccccaggtg aacattaact ttccccagct 300cggcctctgt cgggaccagt gccaggtgga cagccagtgt cctggccaga tgaaatgctg 360ccgcaatggc tgtgggaagg tgtcctgtgt cactcccaat ttctgaggtc cagccaccac 420caggctgagc agtgaggaga gaaagtttct gcctggccct gcatctggtt ccagcccacc 480tgccctcccc tttttcggga ctctgtattc cctcttgggc tgaccacagc ttctcccttt 540cccaaccaat aaagtaacca ctttcagcaa aaaaaaaaaa aaaa 584571330DNAHomo sapiens 57gcagcccagc caagcactgt caggaatcct gtgaagcagc tccagctatg tgtgaagaag 60aggacagcac tgccttggtg tgtgacaatg gctctgggct ctgtaaggcc ggctttgctg 120gggacgatgc tcccagggct gttttcccat ccattgtggg acgtcccaga catcaggggg 180tgatggtggg aatgggacaa aaagacagct acgtgggtga cgaagcacag agcaaaagag 240gaatcctgac cctgaagtac ccgatagaac atggcatcat caccaactgg gacgacatgg 300aaaagatctg gcaccactct ttctacaatg agcttcgtgt tgcccctgaa gagcatccca 360ccctgctcac ggaggcaccc ctgaacccca aggccaaccg ggagaaaatg actcaaatta 420tgtttgagac tttcaatgtc ccagccatgt atgtggctat ccaggcggtg ctgtctctct 480atgcctctgg acgcacaact ggcatcgtgc tggactctgg agatggtgtc acccacaatg 540tccccatcta tgagggctat gccttgcccc atgccatcat gcgtctggat ctggctggcc 600gagatctcac tgactacctc atgaagatcc tgactgagcg tggctattcc ttcgttacta 660ctgctgagcg tgagattgtc cgggacatca aggagaaact gtgttatgta gctctggact 720ttgaaaatga gatggccact gccgcatcct catcctccct tgagaagagt tacgagttgc 780ctgatgggca agtgatcacc atcggaaatg aacgtttccg ctgcccagag accctgttcc 840agccatcctt catcgggatg gagtctgctg gcatccatga aaccacctac aacagcatca 900tgaagtgtga tattgacatc aggaaggacc tctatgctaa caatgtccta tcagggggca 960ccactatgta ccctggcatt gccgaccgaa tgcagaagga gatcacggcc ctagcaccca 1020gcaccatgaa gatcaagatc attgcccctc cggagcgcaa atactctgtc tggatcggtg 1080gctccatcct ggcctctctg tccaccttcc agcagatgtg gatcagcaaa caggaatacg 1140atgaagccgg gccttccatt gtccaccgca aatgcttcta aaacactttc ctgctcctct 1200ctgtctctag cacacaactg tgaatgtcct gtggaattat gccttcagtt cttttccaaa 1260tcattcctag ccaaagctct gactcgttac ctatgtgttt tttaataaat ctgaaatagg 1320ctactggtaa 1330582743DNAHomo sapiens 58gcgggccgtt atccatttgt gttgttcgcc agctaggcct ggcctcgtcc cgcttcgctc 60ggtcggtctc gcgcgccccc atagccttgc tagagggtta gcgttagcct taagtgtgcg 120aatccgagga gcagcgacag actcgagacc acgctccttc ctcgggaagg aggcggcacc 180tcgcgtttga ggcccgcctg cgtttgaggc ccgcctgcgc ttgcggcccg cctgcgcttg 240aggcctgtct gcgtttgaga tctcattggg cgtgattgag gaatttgggg aggtttttgg 300gcggtattga ggacgagggg gtccgttagt cagcatagaa tcctggagcg ggaatccctc 360accgtctaaa tggcgtcggg ggcgggacct ccgggatctg gcttccgcgg gccgccgccg 420gccctgaaac gtgagggata gctgagatga ggcagctact gggatggccc ccatgcgcat 480ttacatgcag tccgactgcc gagctttcga ggcagcagga tttaccgtcc acattcctca 540ctactaacca agcttttaga acagatctca caagaaccta gaggtcggta ttttttcgat 600ttaaatttgc ctgttactga cgttaacgtc tttcgcctag tgagcagtag ccaacatgtc 660agggtgggag tcatattaca aaaccgaggg cgatgaagaa gcagaggaag aacaagaaga 720gaaccttgaa gcaagtggag actataaata ttcaggaaga gatagtttga tttttttggt 780tgatgcctcc aaggctatgt ttgaatctca gagtgaagat gagttgacac cttttgacat 840gagcatccag tgtatccaaa gtgtgtacat cagtaagatc ataagcagtg atcgagatct 900cttggctgtg gtgttctatg gtaccgagaa agacaaaaat tcagtgaatt ttaaaaatat 960ttacgtctta caggagctgg ataatccagg tgcaaaacga attctagagc ttgaccagtt 1020taaggggcag cagggacaaa aacgtttcca agacatgatg ggccacggat ctgactactc 1080actcagtgaa gtgctgtggg tctgtgccaa cctctttagt gatgtccaat tcaagatgag 1140tcataagagg atcatgctgt tcaccaatga agacaacccc catggcaatg acagtgccaa 1200agccagccgg gccaggacca aagccggtga tctccgagat acaggcatct tccttgactt 1260gatgcacctg aagaaacctg ggggctttga catatccttg ttctacagag atatcatcag 1320catagcagag gatgaggacc tcagggttca ctttgaggaa tccagcaagc tagaagacct 1380gttgcggaag gttcgcgcca aggagaccag gaagcgagca ctcagcaggt taaagctgaa 1440gctcaacaaa gatatagtga tctctgtggg catttataat ctggtccaga aggctctcaa 1500gcctcctcca ataaagctct atcgggaaac aaatgaacca gtgaaaacca agacccggac 1560ctttaataca agtacaggcg gtttgcttct gcctagcgat accaagaggt ctcagatcta 1620tgggagtcgt cagattatac tggagaaaga ggaaacagaa gagctaaaac ggtttgatga 1680tccaggtttg atgctcatgg gtttcaagcc gttggtactg ctgaagaaac accattacct 1740gaggccctcc ctgttcgtgt acccagagga gtcgctggtg attgggagct caaccctgtt 1800cagtgctctg ctcatcaagt gtctggagaa ggaggttgca gcattgtgca gatacacacc 1860ccgcaggaac atccctcctt attttgtggc tttggtgcca caggaagaag agttggatga 1920ccagaaaatt caggtgactc ctccaggctt ccagctggtc tttttaccct ttgctgatga 1980taaaaggaag atgcccttta ctgaaaaaat catggcaact ccagagcagg tgggcaagat 2040gaaggctatc gttgagaagc ttcgcttcac atacagaagt gacagctttg agaaccccgt 2100gctgcagcag cacttcagga acctggaggc cttggccttg gatttgatgg agccggaaca 2160agcagtggac ctgacattgc ccaaggttga agcaatgaat aaaagactgg gctccttggt 2220ggatgagttt aaggagcttg tttacccacc agattacaat cctgaaggga aagttaccaa 2280gagaaaacac gataatgaag gttctggaag caaaaggccc aaggtggagt attcagaaga 2340ggagctgaag acccacatca gcaagggtac gctgggcaag ttcactgtgc ccatgctgaa 2400agaggcctgc cgggcttacg ggctgaagag tggtctgaag aagcaggagc tgctggaagc 2460cctcaccaag cacttccagg actgaccaga ggccgcgcgt ccagctgccc ttccgcagtg 2520tggccaggct gcctggcctt gtcctcagcc agttaaaatg tgtttctcct gagctaggaa 2580gagtctaccc gacataagtc gagggacttt atgtttttga ggctttctgt tgccatggtg 2640atggtgtagc cctcccactt tgctgttctt tactttactg cctgaataaa gagccctaag 2700tttgtactaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 2743591826DNAHomo sapiens 59agtatgtgtg gttggggaat tcatgtggag gtcagagtgg aagcaggtgt gagagggtcc 60agcagaagga aacatggctg ccaaagtgtt tgagtccatt ggcaagtttg gcctggcctt 120agctgttgca ggaggcgtgg tgaactctgc cttatataat gtggatgctg ggcacagagc 180tgtcatcttt gaccgattcc gtggagtgca ggacattgtg gtaggggaag ggactcattt 240tctcatcccg tgggtacaga aaccaattat ctttgactgc cgttctcgac cacgtaatgt 300gccagtcatc actggtagca aagatttaca gaatgtcaac atcacactgc gcatcctctt 360ccggcctgtc gccagccagc ttcctcgcat cttcaccagc atcggagagg actatgatga 420gcgtgtgctg ccgtccatca caactgagat cctcaagtca gtggtggctc gctttgatgc 480tggagaacta atcacccaga gagagctggt ctccaggcag gtgagcgacg accttacaga 540gcgagccgcc acctttgggc tcatcctgga tgacgtgtcc ttgacacatc tgaccttcgg 600gaaggagttc acagaagcgg tggaagccaa acaggtggct cagcaggaag cagagagggc 660cagatttgtg gtggaaaagg ctgagcaaca gaaaaaggcg gccatcatct ctgctgaggg 720cgactccaag gcagctgagc tgattgccaa ctcactggcc actgcagggg atggcctgat 780cgagctgcgc aagctggaag ctgcagagga catcgcgtac cagctctcac gctctcggaa 840catcacctac ctgccagcgg ggcagtccgt gctcctccag ctgccccagt gagggcccac 900cctgcctgca cctccgcggg ctgactgggc cacagccccg atgattctta acacagcctt 960ccttctgctc ccaccccaga aatcactgtg aaatttcatg attggcttaa agtgaaggaa 1020ataaaggtaa aatcacttca gatctctaat tagtctatca aatgaaactc tttcattctt 1080ctcacatcca tctacttttt tatccacctc cctaccaaaa attgccaagt gcctatgcaa

1140accagcttta ggtcccaatt cggggcctgc tggagttccg gcctgggcac cagcatttgg 1200cagcacgcag gcggggcagt atgtgatgga ctggggagca caggtgtctg cctagatcca 1260cgtgtggcct ccgtcctgtc actgatggaa ggtttgcgga tgagggcatg tgcggctgaa 1320ctgagaaggc aggcctccgt cttcccagcg gttcctgtgc agatgctgct gaagagaggt 1380gccggggagg ggcagagagg aagtggtctg tctgttacca taagtctgat tctctttaac 1440tgtgtgacca gcggaaacag gtgtgtgtga actgggcaca gattgaagaa tctgcccctg 1500ttgaggtggg tgggcctgac tgttgccccc cagggtccta aaacttggat ggacttgtat 1560agtgagagag gaggcctgga ccgagatgtg agtcctgttg aagacttcct ctctaccccc 1620caccttggtc cctctcagat acccagtgga attccaactt gaaggattgc atcctgctgg 1680ggctgaacat gcctgccaaa gacgtgtccg acctacgttc ctggccccct cgttcagaga 1740ctgcccttct cacgggctct atgcctgcac tgggaaggaa acaaatgtgt ataaactgct 1800gtcaataaat gacacccaga ccttcc 1826604322DNAHomo sapiens 60cccccagagg cgccggagcc cggaatcccg ctcggagcca gccagccgtc ccgagctacc 60agcaggtttc attgaaaaca gatcctgcaa aagttccagg tgcccacact ggaaacttgg 120agatcctgct tcccagacca cagctgtggg gaacttgggg tggagcagag aagtttctgt 180attcagctgc ccaggcagag gagaatgggg tctccacagc ctgaagaatg aagacacgac 240agaataaaga ctcgatgtca atgaggagtg gacggaagaa agaggcccct gggccccggg 300aagaactgag atcgaggggc cgggcctccc ctggaggggt cagcacgtcc agcagtgatg 360gcaaagctga gaagtccagg cagacagcca agaaggcccg agtagaggaa gcctccaccc 420caaaggtcaa caagcagggt cggagtgagg agatctcaga gagtgaaagt gaggagacca 480atgcaccaaa aaagaccaaa actgaggaac tccctcggcc acagtctccc tccgatctgg 540atagcttgga cgggcggagc cttaatgatg atggcagcag cgaccctagg gatatcgacc 600aggacaaccg aagcacgtcc cccagtatct acagccctgg aagtgtggag aatgactctg 660actcatcttc tggcctgtcc cagggcccag cccgccccta ccacccacct ccactctttc 720ctccttcccc tcaaccgcca gacagcaccc ctcgacagcc agaggctagc tttgaacccc 780atccttctgt gacacccact ggatatcatg ctcccatgga gccccccaca tctcgaatgt 840tccaggctcc tcctggggcc cctccccctc acccacagct ctatcccggg ggcactggtg 900gagttttgtc tggaccccca atgggtccca aggggggagg ggctgcctca tcagtggggg 960gccctaatgg gggtaagcag caccccccac ccactactcc catttcagta tcaagctctg 1020gggctagtgg tgctccccca acaaagccgc ctaccactcc agtgggtggt gggaacctac 1080cttctgctcc accaccagcc aacttccccc atgtgacacc gaacctgcct cccccacctg 1140ccctgagacc cctcaacaat gcatcagcct ctccccctgg cctgggggcc caaccactac 1200ctggtcatct gccctctccc cacgccatgg gacagggtat cggtggactt cctcctggcc 1260cagagaaggg cccaactctg gctccttcac cccactctct gcctcctgct tcctcttctg 1320ctccagcgcc ccccatgagg tttccttatt catcctctag tagtagctct gcagcagcct 1380cctcttccag ttcttcctcc tcttcctctg cctccccctt cccagcttcc caggcattgc 1440ccagctaccc ccactctttc cctcccccaa caagcctctc tgtctccaat cagcccccca 1500agtatactca gccttctctc ccatcccagg ctgtgtggag ccagggtccc ccaccacctc 1560ctccctatgg ccgcctctta gccaacagca atgcccatcc aggccccttc cctccctcta 1620ctggggccca gtccaccgcc cacccaccag tctcaacaca tcaccatcac caccagcaac 1680agcaacagca gcagcagcag cagcagcagc agcagcatca cggaaactct gggccccctc 1740ctcctggagc atttccccac ccactggagg gcggtagctc ccaccacgca cacccttacg 1800ccatgtctcc ctccctgggg tctctgaggc cctacccacc agggccagca cacctgcccc 1860cacctcacag ccaggtgtcc tacagccaag caggccccaa tggccctcca gtctcttcct 1920cttccaactc ttcctcttcc acttctcaag ggtcctaccc atgttcacac ccctcccctt 1980cccagggccc tcaaggggcg ccctaccctt tcccaccggt gcctacggtc accacctctt 2040cggctaccct ttccacggtc attgccaccg tggcttcctc gccagcaggc tacaaaacgg 2100cctccccacc tgggccccca ccgtacggaa agagagcccc gtccccgggg gcctacaaga 2160cagccacccc acccggatac aaacccgggt cgcctccctc cttccgaacg gggaccccac 2220cgggctatcg aggaacctcg ccacctgcag gcccagggac cttcaagccg ggctcgccca 2280ccgtgggacc tgggcccctg ccacctgcgg ggccctcagg cctgccatcg ctgccaccac 2340cacctgcggc ccctgcctca gggccgcccc tgagcgccac gcagatcaaa caggagccgg 2400ctgaggagta tgagaccccc gagagcccgg tgcccccagc ccgcagcccc tcgccccctc 2460ccaaggtggt agatgtaccc agccatgcca gtcagtctgc caggttcaac aaacacctgg 2520atcgcggctt caactcgtgc gcgcgcagcg acctgtactt cgtgccactg gagggctcca 2580agctggccaa gaagcgggcc gacctggtgg agaaggtgcg gcgcgaggcc gagcagcgcg 2640cgcgcgaaga aaaggagcgc gagcgcgagc gggaacgcga gaaagagcgc gagcgcgaga 2700aggagcgcga gcttgaacgc agcgtgaagt tggctcagga gggccgtgct ccggtggaat 2760gcccatctct gggcccagtg ccccatcgcc ctccatttga accgggcagt gcggtggcta 2820cagtgccccc ctacctgggt cctgacactc cagccttgcg cactctcagt gaatatgccc 2880ggcctcatgt catgtctcct ggcaatcgca accatccatt ctacgtgccc ctgggggcag 2940tggacccggg gctcctgggt tacaatgtcc cggccctgta cagcagtgat ccagctgccc 3000gggagaggga acgggaagcc cgtgaacgag acctccgtga ccgcctcaag cctggctttg 3060aggtgaagcc tagtgagctg gaacccctac atggggtccc tgggccgggc ttggatccct 3120ttccccgaca tgggggcctg gctctgcagc ctggcccacc tggcctgcac cctttcccct 3180ttcatccgag cctggggccc ctggagcgag aacgtctagc gctggcagct gggccagccc 3240tgcggcctga catgtcctat gctgagcggc tggcagctga gaggcagcac gcagaaaggg 3300tggcggccct gggcaatgac ccactggccc ggctgcagat gctcaatgtg actccccatc 3360accaccagca ctcccacatc cactcgcacc tgcacctgca ccagcaagat gctatccatg 3420cagcctctgc ctcggtgcac cctctcattg accccctggc ctcagggtct caccttaccc 3480ggatccccta cccagctgga actctcccta accccctgct tcctcaccct ctgcacgaga 3540acgaagttct tcgtcaccag ctctttgctg ccccttaccg ggacctgccg gcctcccttt 3600ctgccccgat gtcagcagct catcagctgc aggccatgca cgcacagtca gctgagctgc 3660agcgcttggc gctggaacag cagcagtggc tgcatgccca tcacccgctg cacagtgtgc 3720cgctgcctgc ccaggaggac tactacagtc acctgaagaa ggaaagcgac aagccactgt 3780agaacctgcg atcaagagag caccatggct cctacattgg accttggagc acccccaccc 3840tccccccacc gtgcccttgg cctgccaccc agagccaaga gggtgctgct cagttgcagg 3900gcctccgcag ctggacagag agtgggggag ggagggacag acagaaggcc aaggcccgat 3960gtggtgtgca gaggtgggga ggtggcgagg atggggacag aaagcgcaca gaatcttgga 4020ccaggtctct cttccttgtc ccccctgctt ttctcctccc ccatgcccaa cccctgtggc 4080cgccgcccct cccctgcccc gttggtgtga ttatttcatc tgttagatgt ggctgttttg 4140cgtagcatcg tgtgccaccc ctgcccctcc ccgatccctg tgtgcgcgcc ccctctgcaa 4200tgtatgcccc ttgccccttc cccacactaa taatttatat atataaatat ctatatgacg 4260ctcttaaaaa aacatcccaa ccaaaaccaa ccaaacaaaa acatcctcac aactccccag 4320ga 4322613088DNAHomo sapiens 61acaaaaaagc ttttacgagg tatcagcact tttctttcat tagggggaag gcgtgaggaa 60agtaccaaac agcagcggag ttttaaactt taaatagaca ggtctgagtg cctgaacttg 120ccttttcatt ttacttcatc ctccaaggag ttcaatcact tggcgtgact tcactacttt 180taagcaaaag agtggtgccc aggcaacatg ggtgactgga gcgccttagg caaactcctt 240gacaaggttc aagcctactc aactgctgga gggaaggtgt ggctgtcagt acttttcatt 300ttccgaatcc tgctgctggg gacagcggtt gagtcagcct ggggagatga gcagtctgcc 360tttcgttgta acactcagca acctggttgt gaaaatgtct gctatgacaa gtctttccca 420atctctcatg tgcgcttctg ggtcctgcag atcatatttg tgtctgtacc cacactcttg 480tacctggctc atgtgttcta tgtgatgcga aaggaagaga aactgaacaa gaaagaggaa 540gaactcaagg ttgcccaaac tgatggtgtc aatgtggaca tgcacttgaa gcagattgag 600ataaagaagt tcaagtacgg tattgaagag catggtaagg tgaaaatgcg aggggggttg 660ctgcgaacct acatcatcag tatcctcttc aagtctatct ttgaggtggc cttcttgctg 720atccagtggt acatctatgg attcagcttg agtgctgttt acacttgcaa aagagatccc 780tgcccacatc aggtggactg tttcctctct cgccccacgg agaaaaccat cttcatcatc 840ttcatgctgg tggtgtcctt ggtgtccctg gccttgaata tcattgaact cttctatgtt 900ttcttcaagg gcgttaagga tcgggttaag ggaaagagcg acccttacca tgcgaccagt 960ggtgcgctga gccctgccaa agactgtggg tctcaaaaat atgcttattt caatggctgc 1020tcctcaccaa ccgctcccct ctcgcctatg tctcctcctg ggtacaagct ggttactggc 1080gacagaaaca attcttcttg ccgcaattac aacaagcaag caagtgagca aaactgggct 1140aattacagtg cagaacaaaa tcgaatgggg caggcgggaa gcaccatctc taactcccat 1200gcacagcctt ttgatttccc cgatgataac cagaattcta aaaaactagc tgctggacat 1260gaattacagc cactagccat tgtggaccag cgaccttcaa gcagagccag cagtcgtgcc 1320agcagcagac ctcggcctga tgacctggag atctagatac aggcttgaaa gcatcaagat 1380tccactcaat tgtggagaag aaaaaaggtg ctgtagaaag tgcaccaggt gttaattttg 1440atccggtgga ggtggtactc aacagcctta ttcatgaggc ttagaaaaca caaagacatt 1500agaataccta ggttcactgg gggtgtatgg ggtagatggg tggagaggga ggggataaga 1560gaggtgcatg ttggtattta aagtagtgga ttcaaagaac ttagattata aataagagtt 1620ccattaggtg atacatagat aagggctttt tctccccgca aacaccccta agaatggttc 1680tgtgtatgtg aatgagcggg tggtaattgt ggctaaatat ttttgtttta ccaagaaact 1740gaaataattc tggccaggaa taaatacttc ctgaacatct taggtctttt caacaagaaa 1800aagacagagg attgtcctta agtccctgct aaaacattcc attgttaaaa tttgcacttt 1860gaaggtaagc tttctaggcc tgaccctcca ggtgtcaatg gacttgtgct actatatttt 1920tttattcttg gtatcagttt aaaattcaga caaggcccac agaataagat tttccatgca 1980tttgcaaata cgtatattct ttttccatcc acttgcacaa tatcattacc atcacttttt 2040catcattcct cagctactac tcacattcat ttaatggttt ctgtaaacat ttttaagaca 2100gttgggatgt cacttaacat tttttttttt tgagctaaag tcagggaatc aagccatgct 2160taatatttaa caatcactta tatgtgtgtc gaagagtttg ttttgtttgt catgtattgg 2220tacaagcaga tacagtataa actcacaaac acagatttga aaataatgca catatggtgt 2280tcaaatttga acctttctca tggatttttg tggtgtgggc caatatggtg tttacattat 2340ataattcctg ctgtggcaag taaagcacac tttttttttc tcctaaaatg tttttccctg 2400tgtatcctat tatggatact ggttttgtta attatgattc tttattttct ctcctttttt 2460taggatatag cagtaatgct attactgaaa tgaatttcct ttttctgaaa tgtaatcatt 2520gatgcttgaa tgatagaatt ttagtactgt aaacaggctt tagtcattaa tgtgagagac 2580ttagaaaaaa tgcttagagt ggactattaa atgtgcctaa atgaattttg cagtaactgg 2640tattcttggg ttttcctact taatacacag taattcagaa cttgtattct attatgagtt 2700tagcagtctt ttggagtgac cagcaacttt gatgtttgca ctaagatttt atttggaatg 2760caagagaggt tgaaagagga ttcagtagta cacatacaac taatttattt gaactatatg 2820ttgaagacat ctaccagttt ctccaaatgc cttttttaaa actcatcaca gaagattggt 2880gaaaatgctg agtatgacac ttttcttctt gcatgcatgt cagctacata aacagttttg 2940tacaatgaaa attactaatt tgtttgacat tccatgttaa actacggtca tgttcagctt 3000cattgcatgt aatgtagacc tagtccatca gatcatgtgt tctggagagt gttctttatt 3060caataaagtt ttaatttagt ataaacat 3088622828DNAHomo sapiens 62gcgctacggc ggacccggct gggcagttcc ttccccagaa ggagagattc ctctgccatg 60gagtcctacg atgtgatcgc caaccagcct gtcgtgatcg acaacggatc cggtgtgatt 120aaagctggtt ttgctggtga tcagatcccc aaatactgct ttccaaacta tgtgggccga 180cccaagcacg ttcgtgtcat ggcaggagcc cttgaaggcg acatcttcat tggccccaaa 240gctgaggagc accgagggct gctttcaatc cgctatccca tggagcatgg catcgtcaag 300gattggaacg acatggaacg catttggcaa tatgtctatt ctaaggacca gctgcagact 360ttctcagagg agcatcctgt gctcctgact gaggcgcctt taaacccacg aaaaaaccgg 420gaacgagctg ccgaagtttt cttcgagacc ttcaatgtgc ccgctctttt catctccatg 480caagctgtac tcagccttta cgctacaggc aggaccacag gggtggtgct ggattctggg 540gatggagtca cccatgctgt gcccatctat gagggctttg ccatgcccca ctccatcatg 600cgcatcgaca tcgcgggccg ggacgtctct cgcttcctgc gcctctacct gcgtaaggag 660ggctacgact tccactcatc ctctgagttt gagattgtca aggccataaa agaaagagcc 720tgttacctat ccataaaccc ccaaaaggat gagacgctag agacagagaa agctcagtac 780tacctgcctg atggcagcac cattgagatt ggtccttccc gattccgggc ccctgagttg 840ctcttcaggc cagatttgat tggagaggag agtgaaggca tccacgaggt cctggtgttc 900gccattcaga agtcagacat ggacctgcgg cgcacgcttt tctctaacat tgtcctctca 960ggaggctcta ccctgttcaa aggttttggt gacaggctcc tgagtgaagt gaagaaacta 1020gctccaaaag atgtgaagat caggatatct gcacctcagg agagactgta ttccacgtgg 1080attgggggct ccatccttgc ctccctggac acctttaaga agatgtgggt ctccaaaaag 1140gaatatgagg aagacggtgc ccgatccatc cacagaaaaa ccttctaatg tcgggacatc 1200atcttcacct ctctctgaag ttaactccac tttaaaactc gctttcttga gtcggagtgt 1260ttgcgaggaa ctgcctgtgt gtgagtgcgt gtgtggatat gagtgtgtgc gcacatgcga 1320gtgccgtgtg gccctgggac cctgggccca gaaaggacga tgaactaccc gcagtggtga 1380tgcctgaggc ctggggttga ccactaactg gctcctgaca gggaagagcg ctggcagagg 1440ctgtgctccc tcctcaggtg gcctctggct ggctgtgggg gactccgttt actaccacag 1500ggagacagag ggaggtaagc catcccccgg gagaccttgc tgctgaccat cctaggctgg 1560gctggcccac cctcaccccc acccccaggg tgccctgagg ccccaggcag ctgctgcctc 1620cactatcgat gcctcctgac tgcacactga ggactgggac tggggttgag ttctgtctgg 1680ttttgttgcc attttggttt gggaggctgg aaaagcaccc caagaagcta ttacagagac 1740tggagtcagg agagagcagg aggccctcat gttcaccagg gaacaggacc acaccggcca 1800ctgaaggagg gcaggagcag tcctccctct gaatggctgc agagttaatg ttcccagccc 1860agtccccttt cgggggcctt gggagagttt aaggcacctg ctggttccag gacctcgctt 1920tccatctgtt cttgttgcaa tgccatcttc aaaccgtttt atttattgaa gtgtttgttc 1980agttaggggc tggagagagg gagcttgctg cctcctgcct tgctacacta atgtttacag 2040cacctaagct tagcctccag ggccccacct ctcccagctg atggtgagct gacagtgtcc 2100acaggttcca ggaccatttg agattggaag ctacactcaa agacactccc accaggctct 2160ttctcccttt tcctcttctc actgccctgg aatcaacagg ctggttgctg gttagatttt 2220ctgaaacagg aggtaaaatt tttctttggc agaggcccct aagcaaggga ggggtgttgg 2280agagccagtg cccttaagac tggagaaagc tgcaatttac caagttgcct tttgccactg 2340tagctgacca ggggactagg ttgtagaggt gggaaggccc ctctgggctg atcttgtgcc 2400attcttgacc ttggacctgc ttggttaagg agggagtggg ccagaccaga gtgccaggag 2460ctaatggagc caggcctgac acctaggagt ggtccaaagc cttcagccta gatggtgcaa 2520agctggggcc agcctgtctt caccggcacc ctcacctgtg acaccaagac ccaccccaat 2580ccagacttca cacagtattc tcccccacgc cgtctatgac caaaggcccc tgccaggtgt 2640gggtccacag cagcaggtat gtgtgaaagc aacgtagcgc cccgcggact gcagtgcgct 2700taaccaactc acctcccttc tcttagccca agcctgtccc tcgcacagcc tcgcacaaac 2760cacattgcct ggtggggccc agtgtactga aataaagtcg ttccgataga cacgtcaaaa 2820aaaaaaaa 282863415DNAHomo sapiens 63ttttttttat tgctattaag atttttcttt taatatgcca tgagatatct tgattgtata 60ttttccaaag tactttccag ccacatctcc caacccatcc aaaagacttt gccagtcttt 120ccaatgcaat aaaagatgct ggattatagt tttgtctacc atttcttttt gaaagcaata 180ttatactaat gactttaatg gtaatacact cttatctaat aaagaaacac atttacaaat 240atcagaaacc cagttttgga acaatttgca taaattttga actgaatcag cattttgtgg 300gttttttaaa aggcagcagt ttgactcacg acttgctgat aaacacgttt ctgctgaggg 360aaggggaaaa gacagggaga gtgaatgctg catttctcca ttggccccaa aagtg 415642455DNAHomo sapiens 64gaattcgggc gggcttcttc gctgccgacg tacgacgagt ggccgggctc ttgcgtctgg 60taacgcgctg tctctaacgc cagcgccgtc tcgcgcgcac tgcgcacaga ccacccgcag 120acgcccggca gtccgcaggc ccaaacgcgc acgcgacccc gctctccgca ccgtacccgg 180ccgcctcggc atggcgcccc gcagcgcccg gcgacccctg ctgctgctac tgcctgttgc 240tgctgctcgg cctcatgcat tgtcgtcagc agccatgttt atggtgaaaa atggcaacgg 300gaccgcgtgc ataatggcca acttctctgc tgccttctca gtgaactacg acaccaagag 360tggccccaag aacatgacct ttgacctgcc atcagatgcc acagtggtgc tcaaccgcag 420ctcctgtgga aaagagaaca cttctgaccc cagtctcgtg attgcttttg gaagaggaca 480tacactcact ctcaatttca cgagaaatgc aacacgttac agcgttcagc tcatgagttt 540tgtttataac ttgtcagaca cacacctttt ccccaatgcg agctccaaag aaatcaagac 600tgtggaatct ataactgaca tcagggcaga tatagataaa aaatacagat gtgttagtgg 660cacccaggtc cacatgaaca acgtgaccgt aacgctccat gatgccacca tccaggcgta 720cctttccaac agcagcttca gcaggggaga gacacgctgt gaacaagaca ggccttcccc 780aaccacagcg ccccctgcgc cacccagccc ctcgccctca cccgtgccca agagcccctc 840tgtggacaag tacaacgtga gcggcaccaa cgggacctgc ctgctggcca gcatggggct 900gcagctgaac ctcacctatg agaggaagga caacacgacg gtgacaaggc ttctcaacat 960caaccccaac aagacctcgg ccagcgggag ctgcggcgcc cacctggtga ctctggagct 1020gcacagcgag ggcaccaccg tcctgctctt ccagttcggg atgaatgcaa gttctagccg 1080gtttttccta caaggaatcc agttgaatac aattcttcct gacgccagag accctgcctt 1140taaagctgcc aacggctccc tgcgagcgct gcaggccaca gtcggcaatt cctacaagtg 1200caacgcggag gagcacgtcc gtgtcacgaa ggcgttttca gtcaatatat tcaaagtgtg 1260ggtccaggct ttcaaggtgg aaggtggcca gtttggctct gtggaggagt gtctgctgga 1320cgagaacagc acgctgatcc ccatcgctgt gggtggtgcc ctggcggggc tggtcctcat 1380cgtcctcatc gcctacctcg tcggcaggaa gaggagtcac gcaggctacc agactatcta 1440gcctggtgca cgcaggcaca gcagctgcag gggcctctgt tcctttctct gggcttaggg 1500tcctgtcgaa ggggaggcac actttctgca aacgtttctc aaatctgctt catccaatgt 1560gaagttcatc ttgcagcatt tactatgcac aacagagtaa ctatcgaaat gacggtgtta 1620attttgctaa ctgggttaaa tattttgcta actggttaaa cattaatatt taccaaagta 1680ggattttgag ggtgggggtg ctctctctga gggggtgggg gtgccgctgt ctctgagggg 1740tgggggtgcc gctgtctgag gggtgggggt gccgctctct ctgagggggt gggggtgccg 1800ctttctctga gggggtgggg gtgccgctct ctctgagggg gtgggggtgc tgctctctcc 1860gaggggtgga atgccgctgt ctctgagggg tgggggtgcc gctctaaatt ggctccatat 1920cattgagttt agggttctgg tgtttggttt cttcattctt tactgcactc agatttaagc 1980cttacaaagg gaaacctctg gccgtcacac gtaggacgca tgaaggtcac tcgtgtgagg 2040ctgacatgct cacacattac aacagtagag agggaaaatc ctaagacaga ggaactccag 2100agatgagtgt ctggagcggc ttcagttcag ctttaaaggc caggacgcgc gacacgtggc 2160tggcggcctc gttccagtgg cggcacgtcc ttggcgtctc taatgtctgc agctcaaggg 2220ctggcacttt tttaaatata aaaatggtgt tatttttatt tttttttgta aagtgatttt 2280tggtcttctg ttgacattcg ggtgatcctg ttctgcgctg tgtacaatgt gagatcggtg 2340cgttctcctg atgttttgcc gtggcttggg gattgtacac gggaccagct cacgtaatgc 2400attgcctgta acaatgtaat aaaaagcctc tttctttcaa aaaaaccccg aattc 2455653583DNAHomo sapiens 65cgcggacccg gccggcccag gcccgcgccc gccgcggccc tgagaggccc cggcaggtcc 60cggcccggcg gcggcagcca tggccggggg gccgggcccg ggggagcccg cagcccccgg 120cgcccagcac ttcttgtacg aggtgccgcc ctgggtcatg tgccgcttct acaaagtgat 180ggacgccctg gagcccgccg actggtgcca gttcgccgcc ctgatcgtgc gcgaccagac 240cgagctgcgg ctgtgcgagc gctccgggca gcgcacggcc agcgtcctgt ggccctggat 300caaccgcaac gcccgtgtgg ccgacctcgt gcacatcctc acgcacctgc agctgctccg 360tgcgcgggac atcatcacag cctggcaccc tcccgccccg cttccgtccc caggcaccac 420tgccccgagg cccagcagca tccctgcacc cgccgaggcc gaggcctgga gcccccggaa 480gttgccatcc tcagcctcca ccttcctctc cccagctttt ccaggctccc agacccattc 540agggcctgag ctcggcctgg ttccaagccc tgcttccctg tggcctccac cgccatctcc 600agccccttct tctaccaagc caggcccaga gagctcagtg tccctcctgc agggagcccg 660cccctctccg ttttgctggc ccctctgtga gatttcccgg ggcacccaca acttctcgga 720ggagctcaag atcggggagg gtggctttgg gtgcgtgtac cgggcggtga tgaggaacac 780ggtgtatgct gtgaagaggc tgaaggagaa cgctgacctg gagtggactg cagtgaagca 840gagcttcctg accgaggtgg agcagctgtc caggtttcgt cacccaaaca ttgtggactt 900tgctggctac tgtgctcaga acggcttcta ctgcctggtg tacggcttcc tgcccaacgg

960ctccctggag gaccgtctcc actgccagac ccaggcctgc ccacctctct cctggcctca 1020gcgactggac atccttctgg gtacagcccg ggcaattcag tttctacatc aggacagccc 1080cagcctcatc catggagaca tcaagagttc caacgtcctt ctggatgaga ggctgacacc 1140caagctggga gactttggcc tggcccggtt cagccgcttt gccgggtcca gccccagcca 1200gagcagcatg gtggcccgga cacagacagt gcggggcacc ctggcctacc tgcccgagga 1260gtacatcaag acgggaaggc tggctgtgga cacggacacc ttcagctttg gggtggtagt 1320gctagagacc ttggctggtc agagggctgt gaagacgcac ggtgccagga ccaagtatct 1380gaaagacctg gtggaagagg aggctgagga ggctggagtg gctttgagaa gcacccagag 1440cacactgcaa gcaggtctgg ctgcagatgc ctgggctgct cccatcgcca tgcagatcta 1500caagaagcac ctggacccca ggcccgggcc ctgcccacct gagctgggcc tgggcctggg 1560ccagctggcc tgctgctgcc tgcaccgccg ggccaaaagg aggcctccta tgacccaggt 1620gtacgagagg ctagagaagc tgcaggcagt ggtggcgggg gtgcccgggc atttggaggc 1680cgccagctgc atcccccctt ccccgcagga gaactcctac gtgtccagca ctggcagagc 1740ccacagtggg gctgctccat ggcagcccct ggcagcgcca tcaggagcca gtgcccaggc 1800agcagagcag ctgcagagag gccccaacca gcccgtggag agtgacgaga gcctaggcgg 1860cctctctgct gccctgcgct cctggcactt gactccaagc tgccctctgg acccagcacc 1920cctcagggag gccggctgtc ctcaggggga cacggcagga gaatcgagct gggggagtgg 1980cccaggatcc cggcccacag ccgtggaagg actggccctt ggcagctctg catcatcgtc 2040gtcagagcca ccgcagatta tcatcaaccc tgcccgacag aagatggtcc agaagctggc 2100cctgtacgag gatggggccc tggacagcct gcagctgctg tcgtccagct ccctcccagg 2160cttgggcctg gaacaggaca ggcaggggcc cgaagaaagt gatgaatttc agagctgatg 2220tgttcacctg ggcagatccc ccaaatccgg aagtcaaagt tctcatggtc agaagttctc 2280atggtgcacg agtcctcagc actctgccgg cagtgggggt gggggcccat gcccgcgggg 2340gagagaagga ggtggccctg ctgttctagg ctctgtgggc ataggcaggc agagtggaac 2400cctgcctcca tgccagcatc tgggggcaag gaaggctggc atcatccagt gaggaggctg 2460gcgcatgttg ggaggctgct ggctgcacag acccgtgagg ggaggagagg ggctgctgtg 2520caggggtgtg gagtagggag ctggctcccc tgagagccat gcagggcgtc tgcagcccag 2580gcctctggca gcagctcttt gcccatctct ttggacagtg gccaccctgc acaatggggc 2640cgacgaggcc tagggccctc ctacctgctt acaatttgga aaagtgtggc cgggtgcggt 2700ggctcacgcc tgtaatccca gcactttggg aggccaaggc aggaggatcg ctggagccca 2760gtaggtcaag accagccagg gcaacatgat gagaccctgt ctctgccaaa aaatttttta 2820aactattagc ctggcgtggt agcgcacgcc tgtggtccca gctgctgggg aggctgaagt 2880aggaggatca tttatgcttg ggaggtcgag gctgcagtga gtcatgattg tatgactgca 2940ctccagcctg ggtgacagag caagaccctg tttcaaaaag aaaaaccctg ggaaaagtga 3000agtatggctg taagtctcat ggttcagtcc tagcaagaag cgagaattct gagatcctcc 3060agaaagtcga gcagcaccca cctccaacct cgggccagtg tcttcaggct ttactgggga 3120cctgcgagct ggcctaatgt ggtggcctgc aagccaggcc atccctgggc gccacagacg 3180agctccgagc caggtcaggc ttcggaggcc acaagctcag cctcaggccc aggcactgat 3240tgtggcagag gggccactac ccaaggtcta gctaggccca agacctagtt acccagacag 3300tgagaagccc ctggaaggca gaaaagttgg gagcatggca gacagggaag ggaaacattt 3360tcagggaaaa gacatgtatc acatgtcttc agaagcaagt caggtttcat gtaaccgagt 3420gtcctcttgc gtgtccaaaa gtagcccagg gctgtagcac aggcttcaca gtgattttgt 3480gttcagccgt gagtcacact acatgccccc gtgaagctgg gcattggtga cgtccaggtt 3540gtccttgagt aataaaaacg tatgttccct aaaaaaaaaa aaa 3583663496DNAHomo sapiens 66gaattctatg gagtgtaatt ttgtgtatga attatatttt taaaacattg aagagttttc 60agaaagaagg ctagtagagt tgattactga tactttatgc taagcagtac ttttttggta 120gtacaatatt ttgttaggcg tttctgataa cactagaaag gacaagtttt atcttgtgat 180aaattgatta atgtttacaa catgactgat aattatagct gaatagtcct taaatgatga 240acaggttatt tagtttttaa atgcagtgta aaaagtgtgc tgtggaaatt ttatggctaa 300ctaagtttat ggagaaaata ccttcagttg atcaagaata atagtggtat acaaagttag 360gaagaaagtc aacatgatgc tgcaggaaat ggaaacaaat acaaatgata tttaacaaag 420atagagttta cagtttttga actttaagcc aaattcattt gacatcaagc actatagcag 480gcacaggttc aacaaagctt gtgggtattg acttccccca aaagttgtca gctgaagtaa 540tttagcccac ttaagtaaat actatgatga taagctgtgt gaacttagct tttaaatagt 600gtgaccatat gaaggtttta attacttttg tttattggaa taaaatgaga ttttttgggt 660tgtcatgtta aagtgcttat agggaaagaa gcctgcatat aattttttac cttgtggcat 720aatcagtaat tggtctgtta ttcaggcttc atagcttgta accaaatata aataaaaggc 780ataatttagg tattctatag ttgcttagaa ttttgttaat ataaatctct gtgaaaaatc 840aaggagtttt aatattttca gaagtgcatc cacctttcag ggctttaagt tagtattact 900caagattatg aacaaatagc acttaggtta cctgaaagag ttactacaac cccaaagagt 960tgtgttctaa gtagtatctt ggaaattcag agagatactc atcctacctg aatataaact 1020gagataaatc cagtaaagaa agtgtagtaa attctacata agagtctatc attgatttct 1080tttggtggta aaaatcttag ttcatgtgaa gaaatttcat gtgaatgttt tagctatcaa 1140acagcactgt cacctactca tgcacaaaac tgcctcccaa agacttttcc caggtccctc 1200gtatcaaaac attaagagta taatggaaga tagcacgatc ttgtcagatt ggacaaacag 1260caacaaacaa aaaatgaagt atgacttttc ctgtgaactc tacagaatgt ctacatattc 1320aactttcccc gccggggtgc ctgtctcaga aaggagtctt gctcgtgctg gtttttatta 1380tactggtgtg aatgacaagg tcaaatgctt ctgttgtggc ctgatgctgg ataactggaa 1440actaggagac agtcctattc aaaagcataa acagctatat cctagctgta gctttattca 1500gaatctggtt tcagctagtc tgggatccac ctctaagaat acgtctccaa tgagaaacag 1560ttttgcacat tcattatctc ccaccttgga acatagtagc ttgttcagtg gttcttactc 1620cagcctttct ccaaaccctc ttaattctag agcagttgaa gacatctctt catcgaggac 1680taacccctac agttatgcaa tgagtactga agaagccaga tttcttacct accatatgtg 1740gccattaact tttttgtcac catcagaatt ggcaagagct ggtttttatt atataggacc 1800tggagatagg gtagcctgct ttgcctgtgg tgggaagctc agtaactggg aaccaaagga 1860tgatgctatg tcagaacacc ggaggcattt tcccaactgt ccatttttgg aaaattctct 1920agaaactctg aggtttagca tttcaaatct gagcatgcag acacatgcag ctcgaatgag 1980aacatttatg tactggccat ctagtgttcc agttcagcct gagcagcttg caagtgctgg 2040tttttattat gtgggtcgca atgatgatgt caaatgcttt tgttgtgatg gtggcttgag 2100gtgttgggaa tctggagatg atccatgggt agaacatgcc aagtggtttc caaggtgtga 2160gttcttgata cgaatgaaag gccaagagtt tgttgatgag attcaaggta gatatcctca 2220tcttcttgaa cagctgttgt caacttcaga taccactgga gaagaaaatg ctgacccacc 2280aattattcat tttggacctg gagaaagttc ttcagaagat gctgtcatga tgaatacacc 2340tgtggttaaa tctgccttgg aaatgggctt taatagagac ctggtgaaac aaacagttca 2400aagtaaaatc ctgacaactg gagagaacta taaaacagtt aatgatattg tgtcagcact 2460tctaaatgct gaagatgaaa aaagagagga ggagaaggaa aaacaagctg aagaaatggc 2520atcagatgat ttgtcattaa ttcggaagaa cagaatggct ctctttcaac aattgacatg 2580tgtgcttcct atcctggata atcttttaaa ggccaatgta attaataaac aggaacatga 2640tattattaaa caaaaaacac agataccttt acaagcgaga gaactgattg ataccatttt 2700ggttaaagga aatgctgcgg ccaacatctt caaaaactgt ctaaaagaaa ttgactctac 2760attgtataag aacttatttg tggataagaa tatgaagtat attccaacag aagatgtttc 2820aggtctgtca ctggaagaac aattgaggag gttgcaagaa gaacgaactt gtaaagtgtg 2880tatggacaaa gaagtttctg ttgtatttat tccttgtggt catctggtag tatgccagga 2940atgtgcccct tctctaagaa aatgccctat ttgcaggggt ataatcaagg gtactgttcg 3000tacatttctc tcttaaagaa aaatagtcta tattttaacc tgcataaaaa ggtctttaaa 3060atattgttga acacttgaag ccatctaaag taaaaaggga attatgagtt tttcaattag 3120taacattcat gttctagtct gctttggtac taataatctt gtttctgaaa agatggtatc 3180atatatttaa tcttaatctg tttatttaca agggaagatt tatgtttggt gaactatatt 3240agtatgtatg tgtacctaag ggagtagtgt cactgcttgt tatgcatcat ttcaggagtt 3300actggatttg ttgttctttc agaaagcttt gaatactaaa ttatagtgta gaaaagaact 3360ggaaaccagg aactctggag ttcatcagag ttatggtgcc gaattgtctt tggtgctttt 3420cacttgtgtt ttaaaataag gatttttctc ttatttctcc ccctagtttg tgagaaacat 3480ctcaataaag tgcttt 3496672764DNAHomo sapiens 67ctctaaagct tagagccaag atggcgggat ccaggcaaag gggtctccgg gccagagttc 60ggccgctgtt ctgcgccttg ctgctgtcac tcggtcgctt cgtccggggc gacggcgtgg 120gaggagaccc cgcggtcgcg ttgccacatc gccgtttcga gtacaaatac agcttcaagg 180ggccgcacct ggtgcagagc gacgggaccg tgcccttctg ggcccacgcg gggaatgcta 240ttccaagttc agatcaaatt cgagtagcac catctttaaa aagccaaaga ggctcagtgt 300ggacaaagac aaaagcggcc tttgagaact gggaagttga ggtgacattt cgagtgactg 360gaagaggtcg aattggagct gatggcctag caatttggta tgcagaaaat caaggcttgg 420agggccctgt gtttggatca gctgatctgt ggaatggtgt tggaatattt tttgattctt 480ttgacaatga tggaaagaaa aataatcctg ctatagtaat tataggcaac aatggacaaa 540tccattatga ccatcaaaat gacggggcta gtcaagcttt ggcaagttgc cagagggact 600tccgcaacaa accctatcct gtccgagcaa agattaccta ttaccagaac acactgacag 660taatgatcaa taatggcttt acaccagata aaaatgatta tgaattttgt gccaaagtgg 720aaaatatgat tatccctgca caagggcatt ttggaatatc tgctgcaact ggaggtcttg 780cagatgacca tgatgtcctt tcttttctga ctttccagtt gactgaacct ggaaaagagc 840cgcccacacc agataaagaa atttcggaaa aggaaaaaga aaagtatcag gaggaatttg 900agcactttca acaagaattg gataaaaaaa aagaggaatt ccagaagggc caccccgacc 960tccaagggca gcctgcggag gaaatatttg agagtgtagg agatcgagag ctaagacaag 1020tctttgaagg acagaatcgt attcatcttg aaatcaagca gctgaaccgg cagttagata 1080tgattcttga tgaacagaga agatatgtct cttccttaac agaggaaatc tctaaaagag 1140gagcaggaat gcctgggcag catgggcaga ttactcaaca agaactggat actgttgtga 1200aaactcagca tgagattctg agacaagtaa atgaaatgaa aaattccatg agtgaaaccg 1260tcagactggt cagtggaatg cagcaccctg gctctgctgg aggcgtctat gagacaacac 1320agcacttcat tgacatcaaa gagcacctgc acatagtaaa gagggacata gataacttag 1380tgcagcgaaa tatgccatca aatgaaaagc cgaaatgccc agaactacca ccatttccat 1440catgtttgtc tacggtccac ttcattatat ttgttgtggt gcaaactgta ttattcattg 1500gttatatcat gtataggtct cagcaagaag cagctgccaa aaaattcttt tgactaccat 1560tttcctgtgt acttcatcta tttgtgtaca aaatgatgtc gttttgaggg aatttaagta 1620tttaaattgc ttcatagtct aaattattaa ttttcttaat aaaataactg tttaaacatt 1680gatttgcagt taagaataaa ccttaaagca aagacaacca cattttaatt tgttcacagt 1740atgtaaatct gtctaaattt cagtgaattt ctggtcagta tgatgcagcc tctgagcaga 1800tattgaccag taagagggta aataaagtgg gggcaacccc tggatatgaa tgttaccccc 1860taagtctcca atattgcagg tttccctgta taacgtaaac acacttgccc tcatgcctcc 1920cagaatatga ggtctaatta agaagtccca tcaggtttat tttgtaacca aagtcttttt 1980tagaggtcag acttcctaat caaaggcctg ggcctgcagt cctttcatct taatgcaact 2040tcctttgaaa tcaaagaata ttttgtctga gagctttaag gatctggtaa tagacttcaa 2100aatgttaagt gaaatttttt ttcctctatt tatcaatgat atatttcact tttaaaggaa 2160attttggagg aaaatatagc tgctttttgc ctaaaaaacc ttgtgggtgg aaatattcct 2220ctgagaatgg cttttatagg tattttgcct ggtaatgtat tcattcatga ttgcccatat 2280tcttgaatgt ttcttcattc caatggggtc aggtcaatat tatgaaaata atttttatat 2340ttatatttgt aactaagaat ttatttctcc ctttactaca cgatgtaaat tcacgtcaaa 2400ttcgatgatc tgaggattta aattcacaaa acctgccact acattctggt ttacattagt 2460tacttcatgc tggctggggt tagtgaccat ttgcatactc ttttaaatca aggaggctgt 2520agtagaggca gttttaagat tcttgaaggc aaaatttgaa aaacagtgaa tacttctaat 2580tgtttccttt tagtgccaga actaagacat tgtgaagcac ttgttagtaa acttaacctt 2640gaaatgtcag actggaagga gtttttatgt ctttgtgcat acttctgggt attacagaaa 2700cagtctgtaa ataacatttt aagatgcaaa tttaattctg ttcacagctg atttatactg 2760attt 276468403DNAHomo sapiens 68tttcattagt tatcattagt ttattataaa agagaaatat ggaaattatt tacatgacga 60aagatttcag aacttcagtg gaatgggcag catcatgttg atgccatttc aatagtgact 120tatttcagtc tacgtacttt ccaagaatgt caccatctct aaataggaaa taatccttgt 180catctagaac tactttggtg cctccatatt ctgggagaag aactttatct ccaactttca 240cgctaactgg ttgaatctct ccaccctttc ctttagaacc cgatccaaca gcgactactg 300ttgcttgcaa tacttttcct tgagattttt ctggaagcat aatgcctcct ttggttacag 360tttcagcagc actcctttca accaatactc ggtcaaagag tgg 40369656DNAHomo sapiens 69acaactcggt ggtggccact gcgcagacca gacttcgctc gtactcgtgc gcctcgcttc 60gcttttcctc cgcaaccatg tctgacaaac ccgatatggc tgagatcgag aaattcgata 120agtcgaaact gaagaagaca gagacgcaag agaaaaatcc actgccttcc aaagaaacga 180ttgaacagga gaagcaagca ggcgaatcgt aatgaggcgt gcgccgccaa tatgcactgt 240acattccaca agcattgcct tcttatttta cttcttttag ctgtttaact ttgtaagatg 300caaagaggtt ggatcaagtt taaatgactg tgctgcccct ttcacatcaa agaactactg 360acaacgaagg ccgcgcctgc ctttcccatc tgtctatcta tctggctggc agggaaggaa 420agaacttgca tgttggtgaa ggaagaagtg gggtggaaga agtggggtgg gacgacagtg 480aaatctagag taaaaccaag ctggcccaag gtgtcctgca ggctgtaatg cagtttaatc 540agagtgccat tttttttttt gttcaaatga ttttaattat tggaatgcac aattttttta 600atatgcaaat aaaaagttta aaaacttaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 65670361DNAHomo sapiens 70tttttttttc aatgttcagt ttcctttaat gacccccatc tccctgaagg gcaggtgcag 60gcagctaggt gatggcaaga gatgttcact tgaagatctt gccctgattg aaggctttgc 120cacatgctgg aaggccccct cccaggaaaa gtactctcga accagcgtct gggtctcctc 180gctgccagga tccagtttcc gccatgtgta tgactcgtag tccacctgcc aatctggact 240cagcggaaag gcaagctcct ggcctcggaa gacccagact ccagaaatgg agtctgctat 300tgttggttcc aaaaaggatg acactgggcg aaggcatttc ttcctcagct tgtccagttc 360g 36171455DNAHomo sapiensmisc_feature(376)..(376)n is a, c, g, or t 71ttttttttga taatttatga ttttattgtc tttcctttgt ccggccttta acatgtttct 60gtaatttaaa taaaaatcta tttactttct ccattttagc aaatggtttc tttacccaaa 120taggttgcac tatagtcccc atatggtttt ctactgttcc acaaccacta tttcacaaag 180attgacaaaa ctttaataaa agttaaattt acaggacatc ttaaggataa cttggggaaa 240tatgtaggta aaaaaggaat cgagtccaca aattaaggaa tattttgcta atatggccca 300acaccaattt caggcaaatc caatctactt aactcatata tttaatgtgg ggtaattttt 360cttaaccaaa atttangggg gggtatggan tggatattat ttatggccct tggacaaggg 420tggacngtgt ggntttgttg tggactaggg ngggg 45572645DNAHomo sapiens 72ctcctgcagc gtctggggtt tccgttgcag tcctcggaac caggacctcg gcgtggccta 60gcgagttatg gcgacgaagg ccgtgtgcgt gctgaagggc gacggcccag tgcagggcat 120catcaatttc gagcagaagg aaagtaatgg accagtgaag gtgtggggaa gcattaaagg 180actgactgaa ggcctgcatg gattccatgt tcatgagttt ggagataata cagcaggctg 240taccagtgca ggtcctcact ttaatcctct atccagaaaa cacggtgggc caaaggatga 300agagaggcat gttggagact tgggcaatgt gactgctgac aaagatggtg tggccgatgt 360gtctattgaa gattctgtga tctcactctc aggagaccat tgcatcattg gccgcacact 420ggtggtccat gaaaaagcag atgacttggg caaaggtgga aatgaagaaa gtacaaagac 480aggaaacgct ggaagtcgtt tggcttgtgg tgtaattggg atcgcccaat aaacattccc 540ttggatgtag tctgaggccc cttaactcat ctgttatcct gctagctgta gaaatgtatc 600ctgataaaca ttaaacactg taatcttaaa aaaaaaaaaa aaaaa 645731684DNAHomo sapiens 73gctttcacaa atacagctct gcaacgcgtt tgccctgata ccatgtctct tcgactttcc 60agtgcatcca ggaggtcctg tcctcgtccc accactggat cactcagact ctatggtggg 120ggaaccagct ttggtactgg aaattcttgt ggcatttcag ggattggaag tggcttctct 180agtgccttcg gaggcagctc atcgggagga aacacagggg gaggtaatcc ctgtgctggc 240ttcactgtga atgagcgggg gctcctttct ggcaatgaga aggtgaccat gcagaacctc 300aatgaccgcc tggcatccta cctggacagt gtgcatgctc tggaggaggc caacgctgac 360ctggagcaga agatcaaggg ctggtatgag aaatttgggc ctggctcttg ccgtggtctt 420gatcatgact atagcagata tttcccaata attgatgacc ttaaaaatca gatcatcgca 480tccaccacca gcaatgctaa tgctgttctg cagatcgata atgccaggct tacagctgat 540gatttcagac tcaagtatga aaatgagctg gctcttcacc agagtgtaga ggctgatgtc 600aatgggttac gaagagtttt ggatgaaata accctgtgca gaacagatct ggagattcag 660tatgaaaccc tgagtgagga gatgacttac ctcaaaaaga accataaaga ggaaatgcaa 720gttctgcagt gcgcagctgg aggcaacgtg aacgtggaga tgaacgcagc ccccggggtg 780gacctcacag ttctgctgaa caacatgcga gctgagtacg aagcccttgc agagcagaac 840cgcagggacg cggaggcctg gttcaacgag aagagcgcct ccctgcagca gcagatctct 900gaggatgtcg gagccacaac ctcagcccgg aatgagctga ctgaaatgaa gcgcactctt 960caaaccctgg aaattgaact tcagtctctc ctagccacga aacactccct ggagtgctcc 1020ttgacagaga ccgagagcaa ctactgtgcg cagctggcgc agatccaggc tcagatcggg 1080gccctggagg agcagctgca ccaggtcaga accgagaccg agggccagaa gctggagtat 1140gagcagctcc tggacatcaa gctccacctg gaaaaagaaa ttgagaccta ctgtctcctt 1200ataggaggag atgatggagc ctgtaagtct gggggttaca agtctaaaga ttatggatct 1260ggaaatgtgg gaagtcaagt caaagaccca gccaaagcca tagtggttaa gaaagttctt 1320gaggaggtag accaacgcag caaaatactt accaccaggc tccactccct ggaagagaaa 1380tctcaaagca attaatttga gatgcaacag agaacgtatg ccacatagcc cctgcgaaga 1440aaaggcatta tgtatctgtc cagaaaaatg tgcatgtcta agaaaaatgt ctaacctgtt 1500gtctttctgt tactttcttt ctgggcaatc aatgacagca tctccccatt catctagaag 1560aatgccacac acaaatatga ctcatttgat tatcctacag aaatctgttg tcaattcttt 1620gtattcaata aacctcttct ttagcaagtt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1680aaaa 1684

* * * * *


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed