Methods for detecting expression of lnc-FANCI-2 in cervical cells

Zheng , et al. Nov

Patent Grant 10487365

U.S. patent number 10,487,365 [Application Number 15/270,774] was granted by the patent office on 2019-11-26 for methods for detecting expression of lnc-fanci-2 in cervical cells. This patent grant is currently assigned to THE UNITED STATES OF AMERICA, AS REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES. The grantee listed for this patent is The United States of America, as Represented by the Secretary,Department of Health and Human Services. Invention is credited to Xiaohong Wang, Junfen Xu, Yanqin Yang, Zhi-Ming Zheng, Jun Zhu.


United States Patent 10,487,365
Zheng ,   et al. November 26, 2019

Methods for detecting expression of lnc-FANCI-2 in cervical cells

Abstract

Described herein are biomarkers for HPV-associated pre-cancers and cancers such as cervical cancer and cervical intraepithelial neoplasia. The RNA binding protein (RBP) and long-noncoding RNA (lnc-RNA) biomarkers can be detected and used to diagnose HPV-associated pre-cancers and cancers. In addition, early diagnosis of HPV-associated pre-cancers and cancers can facilitate therapeutic intervention in patients, particularly in the pre-cancer stage which can delay or prevent progression to cancer.


Inventors: Zheng; Zhi-Ming (Rockville, MD), Xu; Junfen (Frederick, MD), Zhu; Jun (Potomac, MD), Yang; Yanqin (Bethesda, MD), Wang; Xiaohong (Germantown, MD)
Applicant:
Name City State Country Type

The United States of America, as Represented by the Secretary,Department of Health and Human Services

Bethesda

MD

US
Assignee: THE UNITED STATES OF AMERICA, AS REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES (Bethesda, MD)
Family ID: 61618416
Appl. No.: 15/270,774
Filed: September 20, 2016

Prior Publication Data

Document Identifier Publication Date
US 20180080084 A1 Mar 22, 2018

Current U.S. Class: 1/1
Current CPC Class: C12Q 1/6886 (20130101); C12Q 2600/112 (20130101); C12Q 2600/158 (20130101)
Current International Class: C12Q 1/68 (20180101); C12Q 1/6886 (20180101)

References Cited [Referenced By]

U.S. Patent Documents
7526387 April 2009 Baker et al.
7659062 February 2010 Santin
7927795 April 2011 Santin
7939261 May 2011 Baker et al.
7939263 May 2011 Clarke et al.
7943306 May 2011 Chang et al.
8110358 February 2012 Liew
8669058 March 2014 Liew
8741574 June 2014 Ried et al.
8855941 October 2014 Noguchi et al.
2003/0225528 December 2003 Baker et al.
2007/0141618 June 2007 Dressman et al.
2008/0286781 November 2008 Monahan
2009/0136486 May 2009 Pyeon
2009/0215054 August 2009 Carter et al.
2010/0316990 December 2010 Dynan et al.
2011/0244459 October 2011 Bertucci et al.
2012/0015827 January 2012 Wirtz
2012/0129705 May 2012 Iftner et al.
2013/0102488 April 2013 Barrie et al.
2013/0280258 October 2013 D'Andrea et al.
2014/0024539 January 2014 Craig et al.
2014/0162254 June 2014 Miller et al.
2014/0235479 August 2014 Depinho et al.
2014/0342946 November 2014 Kuriakose et al.
2015/0051103 February 2015 Barrie et al.

Other References

Chen et al (Biomedicine & Pharmacotherapy. May 2015. 72: 83-90 (Year: 2015). cited by examiner .
Xu et al RNA. May 2015. The Twentieth Annual Meeting of the RNA Society, Abstract 178, available via URL <masociety.org/wp-content/uploads/2015/05/RNA-2015-Abstract-Book-print- -150505.pdf> (Year: 2015). cited by examiner .
Fu et al Med Sci Monito. May 2015. 21: 1276-1287 (Year: 2015). cited by examiner .
Gibb et al Int J Gynecol Cancer. 2012. 22: 1557-1563 (Year: 2012). cited by examiner .
Camargo et al.; "GWAS Reveals New Recessive Loci Asociated with Non-syndromic Facial Clefting"; Eur J Med Genet.; 55(10); pp. 510-514; (2012). cited by applicant .
Expression of GLB1L2 in cancer--Summary--The Human Protein Atlas; printed May 6, 2015; 1 page; http://www.proteinatlas.org/ENSG00000149328-GLB1L2/cancer. cited by applicant .
Flanagan et al.; "Genomics Screen in Transformed Stem Cell Reveals RNASEH2A, PPAP2C, and ADARB1 as Putative Anticancer Drug Targets"; Mol Cancer Ther; 8(1); pp. 249-260; (2009). cited by applicant .
Itoh et al.; "Role of Growth Factor Receptor--Bound Protein 7 in Hepatocellular Carcinoma"; Mol Cancer Res; 5(7); pp. 667-673; (2007). cited by applicant .
Nadler et al.; "Growth Factor Receptor-bound Protein-7 (Grb7) as a Prognostic Marker and Therapeutic Target in Breast Cancer"; Annals of Oncology; 21; pp. 466-473; (2010). cited by applicant .
Takahashi et al.; Manuscript: Significance of Polypyrimidine Tract Binding Protein 1 Expression in Colorectal Cancer; Published OnlineFirst Apr. 22, 2015; DOI: 10.1158/1535-7163.MCT-14-0142; 50 pages (2015)_downloaded from mct.aacrjournals.org on Apr. 24, 2015. cited by applicant .
Wang et al.; "Differential Functions of Growth Factor Receptor-Bound Protein 7 (GRB7) and Its Valiant GRB7v in Ovarian Carcinogenesis"; Clin Cancer Res; 16; pp. 2529-2539; (2010). cited by applicant .
Williams et al.; "A Systems Genetics Approach Identifies CXCL14, ITGAX, and LPCAT2 as Novel Aggressive Prostate Cancer Susceptibility Genes"; PLoS Genet; 10(11): e1004809; 15 pages; (2014). cited by applicant .
Yang et al.; "Identification of Genes with Correlated Patterns ofVariations in DNA Copy Number and Gene Expression Level in Gastric Cancer"; Genomics; 89; pp. 451-459; (2007). cited by applicant .
Zhang et al.; "High Expression of Neuro-Oncological Ventral Antigen 1 Correlates with Poor Prognosis in Hepatocellular Carcinoma"; PLoS ONE; 9(3); c90955; 11 pages (2014). cited by applicant.

Primary Examiner: Myers; Carla J
Attorney, Agent or Firm: Cantor Colburn LLP

Government Interests



STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH & DEVELOPMENT

This invention was made in part with government support from the National Institutes of Health. The government has certain rights in this invention.
Claims



The invention claimed is:

1. A method of quantitating an expression level of a lnc-FANCI-2 polynucleotide in a sample containing cells from a test patient's cervix with one or more first polynucleotides that hybridizes to the lnc-FANCI-2 polynucleotide, the method comprising contacting the sample containing cells from the test patient's cervix with the one or more first polynucleotides, and detecting the level of hybridization of the one or more first polynucleotides to the lnc-FANCI-2 polynucleotide, comparing the level of hybridization in the sample containing cells from the test patient's cervix to a control level of hybridization in a control sample of normal cervical tissues, and determining differential expression of the lnc-FANCI-2 polynucleotide in the sample containing cells from the test patient's cervix when the level of hybridization for the sample containing cells from the test patient's cervix is at least about 300% of the control level of hybridization in the control sample, wherein the one or more first polynucleotides are SEQ ID NOs: 78, 79 and 80.

2. The method of claim 1, wherein detecting the level of hybridization of the one or more first polynucleotides to the Inc-FANCI-2 polynucleotide is done with real-time RT-PCR.

3. The method of claim 1, wherein the sample containing cells from the test patient's cervix comprises a PAP smear, a vaginal wash, or a cervical biopsy sample.
Description



FIELD OF THE DISCLOSURE

The present disclosure is related to novel polynucleotide biomarkers which can be detected and can be used for the diagnosis of HPV-associated pre-cancers and HPV-associated cancers such as cervical cancer and cervical intraepithelial neoplasia as well as methods of treatment of HPV-associated pre-cancers and HPV-associated cancers.

BACKGROUND

High-risk HPV persistent infection leads to the development of certain types of cancers in the cervix, anus, and oropharynx, for example. Fifteen mucosal HPV types are identified as oncogenic or high-risk (HR) HPVs, with HPV16 and HPV18 being particularly associated with invasive cervical cancer. Cervical cancer is the second most common cancer among women worldwide. Approximately 500,000 incident cases of cervical cancer and approximately 320,000 cervical cancer deaths are estimated each year and more than 80% of the cases arise in developing countries.

There is a need for diagnostic markers that can be detected and used for early diagnosis of high-risk HPV infection, HPV-associated pre-cancer and HPV-associated cancer and for the development of intervention strategies for treatment of HPV-induced cancers.

SUMMARY

In one aspect, a method of determining if a test patient has stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia or cervical cancer comprises

determining an expression level of a first polynucleotide biomarker in a sample containing cells from the test patient's cervix with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker, wherein the first polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof,

correlating the expression level of the first polynucleotide biomarker in the sample containing cells from the test patient's cervix to a reference expression level of the first polynucleotide biomarker in a reference sample, wherein the reference sample is a control sample from a patient or patients with no evidence of cervical cancer, a control sample from a cervical cancer patient or patients, or a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia, and

determining, based on said correlation, if the test patient has cervical cancer, or stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia.

In another aspect, the method of determining if a test patient has stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia or cervical cancer comprises

determining an expression level of a first polynucleotide biomarker in a sample containing cells from the test patient's cervix with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker, wherein the first polynucleotide biomarker is GRB7 (SEQ ID NOs: 8-11 and 84), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), or a combination thereof, and/or

determining an expression level of a second polynucleotide biomarker in the sample containing cells from the test patient's cervix with one or more second polynucleotides that hybridizes to the second polynucleotide biomarker, wherein the second polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, or a combination thereof.

In a further aspect, a method of quantitating an expression level of a first polynucleotide biomarker in a sample containing cells from a test patient's cervix with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker comprises

contacting the sample containing cells from test patient's cervix with the one or more first polynucleotides, and

detecting the level of hybridization of the one or more first polynucleotides to the first polynucleotide biomarker,

wherein the first polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.

In a yet further aspect, a method of treating a test patient in need of treatment for stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia or cervical cancer comprises

determining an expression level of a first polynucleotide biomarker in a sample containing cells from the test patient's cervix with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker, wherein the first polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof,

correlating the expression level of the first polynucleotide biomarker in the sample containing cells from the test patient's cervix to a reference expression level of the first polynucleotide biomarker in a reference sample, wherein the reference sample is a control sample from a patient or patients with no evidence of cervical cancer, a control sample from a cervical cancer patient or patients, or a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia, and

administering a therapeutic intervention for the treatment of stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia, or cervical cancer when it is determined, based on said expression levels, that the test patient has stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia.

In a still further aspect, a method of determining if a test patient has an HPV-associated pre-cancer or an HPV-associated cancer comprises

determining an expression level of a first polynucleotide biomarker in a sample containing cells from a tissue of the test patient with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker,

correlating the expression level of the first polynucleotide biomarker in the sample containing cells from the tissue of the test patient to a reference expression level of the first polynucleotide biomarker in a reference sample, wherein the reference sample is a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer, a control sample from a patient or patients with HPV-associated pre-cancer, or a control sample from a patient or patients with HPV-associated cancer, and

determining, based on said correlation, if the test patient has HPV-associated pre-cancer or HPV-associated cancer,

wherein the first polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.

In another aspect, a method of quantitating an expression level of a first polynucleotide biomarker in a sample containing cells from a tissue of the test patient with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker comprises

contacting the sample containing cells from a tissue of the test patient with the one or more first polynucleotides, and

detecting the level of hybridization of the one or more first polynucleotides to the first polynucleotide biomarker,

wherein the first polynucleotide biomarker lnc-FANCI-2, lnc-GLB1L2-1, is GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.

In a yet further aspect, a method of treating a test patient in need of treatment for an HPV-associated pre-cancer or an HPV-associated cancer comprises

determining an expression level of a first polynucleotide biomarker in a sample containing cells from a tissue of the test patient with one or more first polynucleotides that hybridizes to the first polynucleotide biomarker,

correlating the expression level of the first polynucleotide biomarker in the sample containing cells from the tissue of the test patient to a reference expression level of the first polynucleotide biomarker in a reference sample, wherein the reference sample is a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer, a control sample from a patient or patients with HPV-associated pre-cancer, or a control sample from a patient or patients with HPV-associated cancer, and

administering a therapeutic intervention for the treatment of HPV-associated pre-cancer or HPV-associated cancer when it is determined, based on said expression levels, that the test patient has HPV-associated pre-cancer or an HPV-associated cancer,

wherein the first polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

FIG. 1 is a flowchart of the RNA-sequencing (RNA-Seq) analyses for RNA-binding proteins (RBPs).

FIG. 2 shows Venn diagrams showing 95 differentially expressed RBP genes being identified from two separate RNA-seq analyses of cervical cancer, pre-cancer to normal cervical tissues.

FIG. 3 shows a heat map comparing 95 differentially expressed RBP genes in cervical cancer to normal cervical tissues.

FIG. 4 shows the TaqMan.RTM. RT-qPCR validation of the 8 selected RBPs.

FIG. 5 shows that high-risk HPV16 infection affects the expression of RBPs. Total RNA extracted from human vaginal keratinocyte (HVK)-derived raft cultures with (HVK16) or without (HVK) productive HPV16 infection and human foreskin keratinocyte (HFK) derived raft cultures with (HFK16) or without (HFK) productive HPV16 infection were examined by TaqMan.RTM. RT-qPCR for the expression of 8 RBPs.

FIG. 6 shows that high-risk HPV18 infection affects the expression of RBPs. Total RNA extracted from human vaginal keratinocyte (HVK)-derived raft cultures with (HVK18) or without (HVK) productive HPV18 infection and human foreskin keratinocyte (HFK) derived raft cultures with (HFK18) or without (HFK) productive HPV18 infection were examined by TaqMan.RTM. RT-qPCR for the expression of 8 RBPs.

FIG. 7 shows that both HPV16 and HPV18 increase the expression of CDKN2A and RNASEH2A, but decrease the expression of NOVA1 in HFK- and HVK-derived rafts.

FIG. 8 shows that HPV18 infection and viral E6 and/or E7 affect the expression of RNASEH2A and Nova1. The expression of RNASEH2A and NOVA1 in primary human keratinocytes (PHK)-derived raft tissues with or without HPV18 infection on day 8, day 12, and day 16 or PHK rafts transduced with a retrovirus expression HPV18 E6, E7 or E6E7 or with an empty control retrovirus were further validated by TaqMan.RTM. RT-qPCR.

FIG. 9 shows that knockdown or overexpression of RNASEH2A in HeLa or CaSki cells affects cell proliferation. Specific-siRNA knockdown or ectopic expression of RNASEH2A from a mammalian expression vector in HeLa or CaSki cells on cell proliferation was evaluated by Cell Counting Kit-8 (CCK-8) assay

FIG. 10 shows HPV oncoprotein E7 regulates the expression of RNASEH2A via E2F1. Specific-siRNA knockdown or ectopic expression of E2F1 from a mammalian expression vector in HeLa or CaSki cells on RNASEH2A was evaluated by Western blot.

FIG. 11 is a flowchart of the RNA-Seq analyses for long-noncoding RNAs (lnc-RNAs).

FIG. 12 is a heat map showing 209 overlapped, differentially expressed lnc-RNAs from cervical cancer, pre-cancer to normal cervical tissues.

FIG. 13 shows an increase of lnc-FANCI-2, and decrease of lnc-GLB1L2-1 expression along with the cervical lesion progression from normal cervix. Lnc-FANCI-2 and lnc-GLB1L2-1 RNA expression was examined by RT-qPCR in 24 normal, 25 CIN 2-3, and 23 cancer tissues.

FIG. 14 shows that HPV infection increases lnc-FANCI-2 expression in HVK- and PHK-derived rafts and viral E7 or E6 is responsible for the increase. The expression of lnc-FANCI-2 in human vaginal keratinocytes (HVK)-derived raft tissues without (HVK) or with HPV16 (HVK16) or HPV18 (HVK18) infection or primary human keratinocytes (PHK)-derived raft tissues without or with HPV18 infection.

The above-described and other features will be appreciated and understood by those skilled in the art from the following detailed description, drawings, and appended claims.

DETAILED DESCRIPTION

Using an RNA-sequencing (RNA-Seq) approach, the inventors of the present application examined seven normal cervical tissues and seven cervical cancer tissues for their expression landscapes of approximately 19,000 coding and 113,513 noncoding RNAs. 614 differentially expressed coding transcripts enriched in cancer related pathways were identified, with 95 of them encoding RNA-binding proteins (RBPs) from the analyzed 1502 human RBPs. Moreover, 209 differentially, abundantly expressed long-noncoding RNAs (lnc-RNAs) from normal cervix to cervical cancer were identified. Validation of the altered expression of 26 candidates, including 8 RBP genes by using TaqMan.RTM. real-time PCR in a cohort of 47 human cervical tissue samples, including 24 normal cervical tissues and 23 cervical cancer tissues, showed that they are broadly involved in cervical carcinogenesis. Many of the identified RBP candidates had not been previously reported. Using human vaginal keratinocyte-derived raft culture tissues with or without HPV16 and HPV18 infection, it was further corroborated that these RBP candidates, including CDKN2A, ELAVL2, GRB7, HSPB1, KHSRP, PTBP1, RNASEH2A, and NOVA1, are regulated by HPV infection. Further, the inventors found that lnc-FANCI-2 was increasingly expressed along with cervical lesion progression from cervical intraepithelial neoplasia (CIN) to cervical cancer, when compared to the normal tissues. In contrast, lncGLB1L2-1 was gradually decreased along with the lesion progression, when compared to the normal tissues. In addition, FAM83A, SEMA3F, CLDN10, ASRGL1, which are not RBPs, were also found to have altered expression in cervical cancer compared to normal tissue, with FAM83A and SEMA3F being increased in cervical cancer and CLDN10 and ASRGL1 being decreased in cervical cancer. The results presented herein provide the first comprehensive expression atlas of RBPs and lnc-RNAs in normal cervix and cervical cancer, which can be detected to provide better diagnosis and treatment of patients with cervical cancer.

More specifically, an increase of lnc-FANCI-2 RNA, including all of its 35 isoforms, and a decrease of lnc-GLB1L2-1, including its 21 isoforms, were identified in cervical cancer. Fanconi anemia (FA) frequently develops squamous cell carcinoma at sites that are associated with HPV-driven cancer including the female reproductive tract, and is caused by mutations in one of 15 genes in the FA pathway (including FANCA, FANCD2, and FANCI). Loss of FA pathway components FANCA and FANCD2 stimulates E7 protein accumulation in human keratinocytes, and loss of FANCD2 stimulates HPV DNA replication. Both FANCI and lnc-FANCI-2 are expressed from the same location at chromosome 15q26.1. Further, both GLB1L2 (galactosidase, beta 1-like 2) and lnc-GLB1L2-1 are expressed from Chromosome 11q25, with unknown function in cancer development. By using TaqMan.RTM. qRT-PCR validation of lnc-FANCI-2 and lnc-GLB1L2-1 in 24 normal, 25 CIN 2-3, and 23 cervical cancer tissues, it was confirmed that altered expression of these lnc-RNAs is remarkably related to cervical lesion progression from CIN to cancer. Moreover, the altered changes of lnc-FANCI-2 could be attributed to HPV16 and HPV18 infection in raft cultures and viral E7 expression. These lnc-RNAs are biomarkers for early diagnosis of high-risk HPV infection with high risk of progression and for development of intervention strategies to treat HPV-induced cancers.

As used herein, a non-coding RNA (ncRNA) is an RNA transcript that does not encode a protein. ncRNAs include short ncRNAs and long ncRNAs (lnc-RNAs). Short ncRNAs are ncRNAs that are generally 18-200 nucleotides (nt) in length. Examples of short ncRNAs include, but are not limited to, microRNAs (miRNAs), piwi-associated RNAs (piRNAs), short interfering RNAs (siRNAs), promoter-associated short RNAs (PASRs), transcription initiation RNAs (tiRNAs), termini-associated short RNAs (TASRs), antisense termini associated short RNAs (aTASRs), small nucleolar RNAs (snoRNAs), transcription start site antisense RNAs (TSSa-RNAs), small nuclear RNAs (snRNAs), retroposon-derived RNAs (RE-RNAs), 3'UTR-derived RNAs (uaRNAs), x-ncRNA, human Y RNA (hY RNA), unusually small RNAs (usRNAs), small NF90-associated RNAs (snaRs), vault RNAs (vtRNAs), small Cajal body-specific RNAs (scaRNAs), and telomere specific small RNAs (tel-sRNAs). lnc-RNAs are cellular RNAs, exclusive of rRNAs, greater than 200 nucleotides in length and having no obvious protein-coding capacity. Lnc-RNAs include, but are not limited to, large or long intergenic ncRNAs (lincRNAs), transcribed ultraconserved regions (T-UCRs), pseudogenes, GAA-repeat containing RNAs (GRC-RNAs), long intronic ncRNAs, antisense RNAs (aRNAs), promoter-associated long RNAs (PALRs), promoter upstream transcripts (PROMPTs), and long stress-induced non-coding transcripts (LSINCTs).

An RNA-binding protein is a protein that binds single or double stranded RNA to form ribonucleoprotein complexes. RBPs contain conserved structural motifs such as the RNA recognition motif (RRM), dsRNA binding domain, zinc finger domain, and others.

The biomarkers for detection and diagnosis of CIN and cervical cancer include the RBP and lnc-RNA biomarkers of Tables 1-3:

TABLE-US-00001 TABLE 1 RBP biomarkers SEQ ID NO: chr start end refseqID Symbol description 1 chr9 21967750 21975132 NM_000077 CDKN2A cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript variant 1, mRNA. 2 chr9 21967750 21975132 NM_001195132 CDKN2A Homo sapiens cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript variant 5, mRNA. 3 chr9 21967750 21994490 NM_058195 CDKN2A cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript variant 4, mRNA. 4 chr9 21967750 21974826 NM_058197 CDKN2A cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript variant 3, mRNA. 5 chr9 23690102 23821843 NM_001171195 ELAVL2 Homo sapiens ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B) (ELAVL2), transcript variant 2, mRNA. 6 chr9 23690102 23821478 NM_001171197 ELAVL2 Homo sapiens ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B) (ELAVL2), transcript variant 3, mRNA. 7 chr9 23690102 23826063 NM_004432 ELAVL2 ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B) (ELAVL2), transcript variant 1, mRNA. 8 chr17 37894575 37903538 NM_001030002 GRB7 growth factor receptor-bound protein 7 (GRB7), transcript variant 2, mRNA. 9 chr17 37895023 37903538 NM_001242442 GRB7 Homo sapiens growth factor receptor-bound protein 7 (GRB7), transcript variant 4, mRNA. 10 chr17 37896219 37903538 NM_001242443 GRB7 Homo sapiens growth factor receptor-bound protein 7 (GRB7), transcript variant 3, mRNA. 11 chr17 37894161 37903538 NM_005310 GRB7 growth factor receptor-bound protein 7 (GRB7), transcript variant 1, mRNA. 94 chr17 NM_001330207.1 GRB7 growth factor receptor-bound protein 7 (GRB7), transcript variant 5, mRNA. 12 chr7 75931874 75933614 NM_001540 HSPB1 heat shock 27 kDa protein 1 (HSPB1), mRNA. 13 chr19 6413118 6424822 NM_003685 KHSRP KH-type splicing regulatory protein (KHSRP), mRNA. 14 chr14 26915088 27066960 NM_002515 NOVA1 neuro-oncological ventral antigen 1 (NOVA1), transcript variant 1, mRNA. 15 chr14 26915088 27066960 NM_006489 NOVA1 neuro-oncological ventral antigen 1 (NOVA1), transcript variant 2, mRNA. 95 chr14 NM_006491.2 NOVA1 neuro-oncological ventral antigen 1 (NOVA1), transcript variant 3, mRNA. 16 chr19 797391 812327 NM_002819 PTBP1 polypyrimidine tract binding protein 1 (PTBP1), transcript variant 1, mRNA. 17 chr19 797391 812327 NM_031990 PTBP1 polypyrimidine tract binding protein 1 (PTBP1), transcript variant 2, mRNA. 18 chr19 797391 812327 NM_031991 PTBP1 polypyrimidine tract binding protein 1 (PTBP1), transcript variant 3, mRNA. 19 chr19 12917427 12924462 NM_006397 RNASEH2A ribonuclease H2, subunit A (RNASEH2A), mRNA.

TABLE-US-00002 TABLE 2 lnc-FANCI-2 isoforms SEQ ID Transcript ID NO: Location (hg19) Length lnc-FANCI-2: 1 20 chr15: 89904810-89938553 1613 lnc-FANCI-2: 10 21 chr15: 89921280-89938544 606 lnc-FANCI-2: 11 22 chr15: 89921331-89938354 551 lnc-FANCI-2: 12 23 chr15: 89921347-89939471 1877 lnc-FANCI-2: 13 24 chr15: 89921362-89938500 561 lnc-FANCI-2: 14 25 chr15: 89921794-89931745 786 lnc-FANCI-2: 15 26 chr15: 89922355-89938350 569 lnc-FANCI-2: 16 27 chr15: 89922468-89941720 3779 lnc-FANCI-2: 17 28 chr15: 89922495-89941719 3670 lnc-FANCI-2: 18 29 chr15: 89923111-89941720 3784 lnc-FANCI-2: 19 30 chr15: 89925731-89938271 779 lnc-FANCI-2: 2 31 chr15: 89904810-89938551 1611 lnc-FANCI-2: 20 32 chr15: 89929827-89939471 2718 lnc-FANCI-2: 21 33 chr15: 89930671-89941720 3723 lnc-FANCI-2: 22 34 chr15: 89904810-89941718 4778 lnc-FANCI-2: 23 35 chr15: 89911330-89941718 4113 lnc-FANCI-2: 24 36 chr15: 89911399-89941721 3936 lnc-FANCI-2: 25 37 chr15: 89912393-89941683 4026 lnc-FANCI-2: 26 38 chr15: 89921102-89941708 4334 lnc-FANCI-2: 27 39 chr15: 89921273-89941718 3868 lnc-FANCI-2: 28 40 chr15: 89922232-89941683 3978 lnc-FANCI-2: 29 41 chr15: 89923021-89941683 3837 lnc-FANCI-2: 3 42 chr15: 89905705-89922463 571 lnc-FANCI-2: 30 43 chr15: 89929880-89941721 4915 lnc-FANCI-2: 31 44 chr15: 89930027-89941721 4687 lnc-FANCI-2: 32 45 chr15: 89930389-89931372 706 lnc-FANCI-2: 33 46 chr15: 89930557-89941683 3922 lnc-FANCI-2: 34 47 chr15: 89931724-89941721 3690 lnc-FANCI-2: 35 48 chr15: 89932071-89941708 4093 lnc-FANCI-2: 4 49 chr15: 89905718-89938562 957 lnc-FANCI-2: 5 50 chr15: 89911330-89941718 2124 lnc-FANCI-2: 6 51 chr15: 89912386-89931074 576 lnc-FANCI-2: 7 52 chr15: 89918593-89941720 6547 lnc-FANCI-2: 8 53 chr15: 89921220-89941692 3814 lnc-FANCI-2: 9 54 chr15: 89921273-89941718 4198

TABLE-US-00003 TABLE 3 lnc-GLB1L2-1 isoforms SEQ ID Transcript ID NO: Location (hg19) Length lnc-GLB1L2-1: 1 55 chr11: 134306367-134337169 1402 bp lnc-GLB1L2-1: 10 56 chr11: 134350719-134372941 295 bp lnc-GLB1L2-1: 11 57 chr11: 134352524-134373110 374 bp lnc-GLB1L2-1: 12 58 chr11: 134306376-134375555 2737 bp lnc-GLB1L2-1: 13 59 chr11: 134339378-134360125 15706 bp lnc-GLB1L2-1: 14 60 chr11: 134339400-134373384 744 bp lnc-GLB1L2-1: 15 61 chr11: 134339400-134375553 1129 bp lnc-GLB1L2-1: 16 62 chr11: 134343291-134373078 1843 bp lnc-GLB1L2-1: 17 63 chr11: 134344051-134375009 1160 bp lnc-GLB1L2-1: 18 64 chr11: 134346572-134375009 572 bp lnc-GLB1L2-1: 19 65 chr11: 134349193-134375555 4435 bp lnc-GLB1L2-1: 2 66 chr11: 134306469-134308558 374 bp lnc-GLB1L2-1: 20 67 chr11: 134349983-134375009 1245 bp lnc-GLB1L2-1: 21 68 chr11: 134350411-134401542 537 bp lnc-GLB1L2-1: 3 69 chr11: 134306629-134374934 1863 bp lnc-GLB1L2-1: 4 70 chr11: 134336079-134357809 3679 bp lnc-GLB1L2-1: 5 71 chr11: 134336079-134357809 3620 bp lnc-GLB1L2-1: 6 72 chr11: 134344060-134350796 720 bp lnc-GLB1L2-1: 7 73 chr11: 134349193-134375507 4387 bp lnc-GLB1L2-1: 8 74 chr11: 134349731-134352843 1398 bp lnc-GLB1L2-1: 9 75 chr11: 134350086-134367700 939 bp

In additional aspects, the biomarker includes FAM83A (SEQ ID NO: 86; KJ895067.1), SEMA3F (SEQ ID NOs: 87-89; NM_004186.4; NM_001318800.1; NM_001318798.1), CLDN10 (SEQ ID NO: 90-91; NM_182848.3; NM_006984.4), ASRGL1 (SEQ ID NO: 92, 93; NM_001083926.1; NM_025080.3), or a combination thereof.

An RBP, lnc-RNA, or additional RNA biomarker is differentially expressed between two samples if the amount of the RBP, lnc-RNA, or additional RNA biomarker in one sample is statistically significantly different from the amount of the RBP, lnc-RNA, or additional RNA biomarker in the other sample. The expression level of an RBP, lnc-RNA, or additional RNA biomarker can be increased or decreased in a test sample relative to a reference sample. For example, an RBP gene, lnc-RNA, or additional RNA biomarker is differentially expressed in two samples if it is present at least about 120%, at least about 130%, at least about 150%, at least about 180%, at least about 200%, at least about 300%, at least about 500%, at least about 700%, at least about 900%, or at least about 1000% greater than it is present in the other sample, or if it is detectable in one sample and not detectable in the other.

Alternatively or additionally, an RBP gene, lnc-RNA, or additional RNA biomarker is differentially expressed in two sets of samples if the frequency of detecting the RBP gene, lnc-RNA, or additional RNA biomarker in samples is statistically significantly higher or lower than in the control samples. For example, an RBP gene, lnc-RNA, or additional RNA biomarker is differentially expressed in two sets of samples if it is detected at least about 120%, at least about 130%, at least about 150%, at least about 180%, at least about 200%, at least about 300%, at least about 500%, at least about 700%, at least about 900%, or at least about 1000% more frequently or less frequently observed in one set of samples than the other set of samples.

A test amount and a control amount of a biomarker can be either an absolute amount (e.g., number of copies/ml, nanogram/ml or microgram/ml) or a relative amount (e.g., relative intensity of signals).

Diagnostic samples for use in the methods described herein comprise nucleic acids suitable for providing polynucleotide, e.g., RNA, expression information. The sample contains cells from a tissue of the test patient. For example, when the HPV-associated pre-cancer or HPV-associated cancer is anal cancer, the tissue of the test patient contains anal cells; when the HPV-associated pre-cancer or HPV-associated cancer is vulvovaginal cancer, the tissue of the test patient contains vulvovaginal cells; when the HPV-associated pre-cancer or HPV-associated cancer is penile cancer, the tissue of the test patient contains penal cells; or when the HPV-associated pre-cancer or HPV-associated cancer is oropharyngeal cancer, the tissue of the test patient contains oropharyngeal cells.

In one aspect, samples for the methods disclosed herein contain cells from a patient's cervix. Exemplary test samples include a PAP smear, a vaginal wash, or a cervical biopsy sample. In certain aspects, the methods described herein include obtaining from the test patient the sample containing cells from the test patient's cervix.

In certain aspects, the test patient is a patient at risk for an HPV-associated pre-cancer or an HPV-associated cancer, such as a patient diagnosed with HPV infection or a patient at high risk for HPV infection.

In certain aspects, the test patient is a patient at high risk for cervical cancer such as a woman at high risk for HPV infection, a woman with a diagnosed HPV infection, a woman with a history of DES exposure, a woman with a previous history of gynecological cancer, a woman with an abnormal PAP test, a woman immunosuppressed due to AIDS or therapy following organ transplantation, or a woman with abnormal endometrial cells.

In certain aspects, the methods disclosed herein comprise detecting the expression level of one or more biomarkers as disclosed herein.

In addition, the methods disclosed herein include the comparison/correlation of the expression levels of biomarkers in the diagnostic sample from the test patient to a reference sample. Exemplary reference samples include a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer, a control sample from a patient or patients with HPV-associated pre-cancer, and a control sample from a patient or patients with HPV-associated cancer. Additional exemplary reference samples include a control sample from a patient or patients with no evidence of cervical cancer, a control sample from a cervical cancer patient or patients, or a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia. The reference sample can be a single sample from a control patient with a known disease state, or preferably samples from a plurality of subjects such that the reference expression level is averaged over the expression levels for a population of known disease state. Useful population sizes for a reference population are greater than 100 subjects, specifically about 500 subjects for each reference group (CIN 1, 2, 3 and cervical cancer), for example.

RNA can be extracted and purified from biological samples using suitable techniques that are known in the art, and several are commercially available (e.g., FormaPure.RTM. nucleic acid extraction kit, Agencourt.RTM. Biosciences, Beverly Mass., High Pure FFPE RNA Micro Kit, Roche Applied Science, Indianapolis, Ind.). RNA can be extracted from frozen tissue sections using TRIzol.RTM. (Invitrogen, Carlsbad, Calif.) and purified using RNeasy.RTM. Protect kit (Qiagen, Valencia, Calif.). RNA can be further purified using DNase I treatment (Ambion, Austin, Tex.) to eliminate any contaminating DNA. RNA concentrations can be made using a NanoDrop ND-1000 spectrophotometer (NanoDdrop Technologies, Rockland, Del.). RNA can be further purified to eliminate contaminants that interfere with cDNA synthesis by cold sodium acetate precipitation. RNA integrity can be evaluated by running electropherograms, and RNA integrity number (RIN, a correlative measure that indicates intactness of mRNA) can be determined using the RNA 6000 PicoAssay for the Bioanalyzer 2100 (Agilent Technologies, Santa Clara, Calif.).

Following sample collection and nucleic acid extraction, the nucleic acid portion of the sample comprising RNA that is or can be used to prepare the target polynucleotide(s) of interest can be subjected to one or more preparative reactions. These preparative reactions can include in vitro transcription (IVT), labeling, fragmentation, amplification, and other reactions. mRNA can first be treated with reverse transcriptase and a primer to create cDNA prior to detection, quantitation, or amplification; this can be done in vitro with purified mRNA or in situ, e.g., in cells or tissues affixed to a slide.

By "amplification" is meant a process of producing at least one copy of a nucleic acid, in this case an expressed RNA, and in many cases produces multiple copies. An amplification product can be RNA or DNA, and may include a complementary strand to the expressed target sequence. DNA amplification products can be produced initially through reverse transcription and then optionally from further amplification reactions. The amplification product may include all or a portion of a target sequence, and may optionally be labeled. A variety of amplification methods are suitable for use, including polymerase-based methods and ligation-based methods.

The expression level of a polynucleotide biomarker can be determined by reverse transcriptase-polymerase chain reaction (RT-PCR) methods, quantitative real-time RT-PCR (RT-qPCR), microarray, serial analysis of gene expression (SAGE), next-generation RNA sequencing (deep sequencing), gene expression analysis by massively parallel signature sequencing (MPSS), immunoassays such as ELISA, in situ hybridization (ISH) formulations that allow histopathological analysis, mass spectrometry (MS) methods, transcriptomics, RNA pull-down and chromatin isolation by RNA purification (ChiRP), proteomics-based identification of lncRNA, detection of single nucleotide polymorphisms (SNPs), measurement of DNA methylation or unmethylation, measurement of siRNA silencing or miRNA silencing, or measurement of downstream targets.

As used herein, the terms "quantitative real time polymerase chain reaction," "real-time polymerase chain reaction," and "qPCR" are synonymous and refer to a laboratory technique based on a polymerase chain reaction used to amplify and simultaneously quantify a targeted DNA molecule. Frequently, real-time PCR is combined with reverse transcription to quantify messenger RNA and non-coding RNA in cells or tissues, e.g., RT-qPCR.

Additional methods for detecting and/or quantifying a polynucleotide biomarker can comprise single-molecule sequencing (e.g., Illumina.RTM., PacBio, ABI SOLID.TM.), in situ hybridization, bead-array technologies (e.g., Luminex xMAP.RTM., Illumina.RTM. BeadChips), branched DNA technology (e.g., Affymetrix.RTM., Genisphere.RTM.), and Ion Torrent.TM.. In some instances, methods for detecting and/or quantifying a target sequence comprise transcriptome sequencing techniques. Transcription sequencing (e.g., RNA-seq, "Whole Transcriptome Shotgun Sequencing" (WTSS)) may comprise the use of high-throughput sequencing technologies to sequence cDNA in order to get information about a sample's RNA content. Transcriptome sequencing can provide information on differential expression of genes, including gene alleles and differently spliced transcripts, non-coding RNAs, post-transcriptional mutations or editing, and gene fusions.

Included herein is a method for measuring the expression levels of biomarkers for HPV-associated pre-cancers and cancers as described herein. The methods optionally include identifying HPV-associated pre-cancer or cancer status of a test subject (e.g., cervical cancer). The data obtained from the expression profiles of a population (e.g., normal, CIN1-3, or cervical cancer) can be evaluated using one or more pattern recognition algorithms. In addition, the results of imaging tests or histological evaluation may optionally be combined with expression profiles generated using the genes disclosed herein.

In one aspect, the methods include

comparing (correlating) the expression level of the first polynucleotide biomarker in the sample containing cells from a tissue of the test patient to a reference expression level of the first polynucleotide biomarker in a reference sample, wherein the reference sample is

a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer,

a control sample from a patient or patients with HPV-associated pre-cancer, or

a control sample from a patient or patients with HPV-associated cancer, and

determining, based on said correlation, if the test patient has HPV-associated pre-cancer or HPV-associated cancer

In another aspect, the methods comprise

predicting (or determining), based on the expression level of one or more polynucleotide biomarkers in the containing cells from a tissue of the test patient and a reference expression level of the one or more polynucleotide biomarkers in a reference sample that the patient has no HPV-associated pre-cancer or cancer, that the test patient has HPV-associated pre-cancer, or that the patient has HPV-associated cancer, wherein the reference sample is

a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer,

a control sample from a patient or patients with HPV-associated pre-cancer, or

a control sample from a patient or patients with HPV-associated cancer.

In a further aspect, the methods include

classifying the patient as having no cervical cancer or cervical intraepithelial neoplasia, or as having HPV-associated pre-cancer or cancer based on the expression level of one or more polynucleotide biomarkers in the sample containing cells from a tissue of the test patient and a reference expression level of the one or more polynucleotide biomarkers in a reference sample, wherein the reference sample is

a control sample from a patient or patients with no evidence of HPV-associated pre-cancer or HPV-associated cancer,

a control sample from a patient or patients with HPV-associated pre-cancer, or a control sample from a patient or patients with HPV-associated cancer.

In one aspect, the methods include

comparing (or correlating) the expression level of one or more polynucleotide biomarkers in the sample containing cells from the test patient's cervix to a reference expression level of the one or more polynucleotide biomarkers in a reference sample, wherein the reference sample is

a control sample from a patient or patients with no evidence of cervical cancer,

a control sample from a cervical cancer patient or patients, or

a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia, and

determining, based on said comparison, if the test patient has cervical cancer, or stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia.

In another aspect, the methods comprise

predicting (or determining), based on the expression level of one or more polynucleotide biomarkers in the sample containing cells from the test patient's cervix and a reference expression level of the one or more polynucleotide biomarkers in a reference sample that the patient has no cervical cancer or cervical intraepithelial neoplasia, that the test patient has cervical cancer, or that the patient has stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia, wherein the reference sample is

a control sample from a patient or patients with no evidence of cervical cancer,

a control sample from a cervical cancer patient or patients, or

a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia.

In a further aspect, the methods include

classifying the patient as having no cervical cancer or cervical intraepithelial neoplasia, as having cervical cancer, or as having stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia based on the expression level of one or more polynucleotide biomarkers in the sample containing cells from the test patient's cervix and a reference expression level of the one or more polynucleotide biomarkers in a reference sample, wherein the reference sample is

a control sample from a patient or patients with no evidence of cervical cancer,

a control sample from a cervical cancer patient or patients, or

a control sample from a patient or patients with stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia.

Analysis methods may be used to form a predictive model, and then the predictive model may be used to classify test data. For example, one convenient and particularly effective method of classification employs multivariate statistical analysis modeling, first to form a model (a "predictive mathematical model") using data ("modeling data") from samples of known class (e.g., from subjects known to have, or not have, a particular grade of CIN or cervical cancer), and second to classify an unknown sample (e.g., "test data"), according to HPV-associated (e.g., cervical) cancer status.

Pattern recognition (PR) is the use of multivariate statistics, both parametric and non-parametric, to analyze spectroscopic data, and hence to classify samples and to predict the value of some dependent variable based on a range of observed measurements. There are two main approaches. One set of methods is termed "unsupervised" and these simply reduce data complexity in a rational way and also produce display plots which can be interpreted by the human eye. The other approach is termed "supervised" whereby a training set of samples with known class or outcome is used to produce a mathematical model and is then evaluated with independent validation data sets.

Unsupervised PR methods are used to analyze data without reference to any other independent knowledge. Examples of unsupervised pattern recognition methods include principal component analysis (PCA), hierarchical cluster analysis (HCA), and non-linear mapping (NLM).

Alternatively, and in order to develop automatic classification methods, it has proved efficient to use a "supervised" approach to data analysis. Here, a "training set" of biomarker expression data is used to construct a statistical model that predicts correctly the "class" of each sample. This training set is then tested with independent data (referred to as a test or validation set) to determine the robustness of the computer-based model. These models are sometimes termed "expert systems," but may be based on a range of different mathematical procedures. Supervised methods can use a data set with reduced dimensionality (for example, the first few principal components), but typically use unreduced data, with all dimensionality. In all cases the methods allow the quantitative description of the multivariate boundaries that characterize and separate each class, for example, each class of cervical cancer in terms of its biomarker expression profile. It is also possible to obtain confidence limits on any predictions, for example, a level of probability to be placed on the goodness of fit. The robustness of the predictive models can also be checked using cross-validation, by leaving out selected samples from the analysis.

It is often useful to pre-process data, for example, by addressing missing data, translation, scaling, weighting, etc. Multivariate projection methods, such as principal component analysis (PCA) and partial least squares analysis (PLS), are so-called scaling sensitive methods. By using prior knowledge and experience about the type of data studied, the quality of the data prior to multivariate modeling can be enhanced by scaling and/or weighting. Adequate scaling and/or weighting can reveal important and interesting variation hidden within the data, and therefore make subsequent multivariate modeling more efficient. Scaling and weighting may be used to place the data in the correct metric, based on knowledge and experience of the studied system, and therefore reveal patterns already inherently present in the data.

The methods described herein may be implemented and/or the results recorded using a device capable of implementing the methods and/or recording the results. Examples of devices that may be used include but are not limited to electronic computational devices, including computers of all types. When the methods described herein are implemented and/or recorded in a computer, the computer program that may be used to configure the computer to carry out the steps of the methods may be contained in any computer readable medium capable of containing the computer program. Examples of computer readable medium that may be used include but are not limited to diskettes, CD-ROMs, DVDs, ROM, RAM, and other memory and computer storage devices. The computer program that may be used to configure the computer to carry out the steps of the methods and/or record the results may also be provided over an electronic network, for example, over the internet, an intranet, or other network.

The process of comparing a measured value and a reference value can be carried out in a convenient manner appropriate to the type of measured value and reference value for the discriminative gene at issue. "Measuring" can be performed using quantitative or qualitative measurement techniques, and the mode of comparing a measured value and a reference value can vary depending on the measurement technology employed. For example, when a qualitative colorimetric assay is used to measure expression levels, the levels may be compared by visually comparing the intensity of the colored reaction product, or by comparing data from densitometric or spectrometric measurements of the colored reaction product (e.g., comparing numerical data or graphical data, such as bar charts, derived from the measuring device). However, it is expected that the measured values used in the methods will most commonly be quantitative values. In other examples, measured values are qualitative. As with qualitative measurements, the comparison can be made by inspecting the numerical data, or by inspecting representations of the data (e.g., inspecting graphical representations such as bar or line graphs).

The process of comparing may be manual (such as visual inspection by the practitioner of the method) or it may be automated. For example, an assay device (such as a luminometer for measuring chemiluminescent signals) may include circuitry and software enabling it to compare a measured value with a reference value for a biomarker. Alternately, a separate device (e.g., a digital computer) may be used to compare the measured value(s) and the reference value(s). Automated devices for comparison may include stored reference values for the biomarker(s) being measured, or they may compare the measured value(s) with reference values that are derived from contemporaneously measured reference samples (e.g., samples from control subjects).

As will be apparent to those of skill in the art, when replicate measurements are taken, the measured value that is compared with the reference value is a value that takes into account the replicate measurements. The replicate measurements may be taken into account by using either the mean or median of the measured values as the "measured value."

When it has been determined that the test patient has HPV-pre-cancer or cancer, the methods optionally include HPV detection and or typing.

When it has been determined that the test patient has CIN 1, 2, or 3 cervical cancer, the methods optionally include HPV detection and or typing, for example, using the Cobas.RTM. HPV test marketed by Roche Diagnostics.

Also included herein are methods of treating the test patient with an interventional strategy for HPV-associated pre-cancer or cancer.

Interventional therapies for anal, vulvovaginal, penile, and oropharyngeal cancer include radiation therapy, surgery, and chemotherapy.

Further included herein are methods of treating the test patient with an interventional strategy for CIN or cervical cancer. When the patient is determined to have stage 1 CIN, the interventional strategy may include screening for further cervical changes, screening the patient for HPV infection, HPV typing, or a combination thereof. Exemplary tests for the detection of HPV infection include detection of HPV infection via DNA/RNA amplification with PCR using, for example, the Cobas.RTM. HPV test marketed by Roche Diagnostics. Advantageously, early identification of CIN 1 optionally coupled with determining the HPV infection type will provide critical information regarding the type of intervention required to treat the patient. Early diagnosis and treatment at stage CIN 1 could prevent or slow progression to later disease stages.

When the patient is determined to have stage 2 or stage 3 CIN, interventional strategies may include, in addition to monitoring, cryosurgery to freeze abnormal cells, laser therapy to remove abnormal tissue, loop electrosurgical procedure excision, surgery to remove abnormal tissue, or hysterectomy. At early stages, for example, low cost outpatient procedures such as loop electrosurgical excision are 90-95% effective. Thus, a benefit to the methods disclosed herein is the ability to use minor surgical intervention before CIN progresses to cervical cancer.

Interventional strategies for the treatment of cervical cancer include surgery, radiation therapy, chemotherapy, targeted therapy, or a combination thereof. Surgery involves removal of the cancer and may include conization to remove tissue from the cervix and/or cervical canal or hysterectomy such as total, radical, modified radical hysterectomy. Radiation therapy includes internal and external radiation therapy in addition to intensity-modulated radiation therapy. Chemotherapy involves the use of drugs to inhibit the growth of cancer calls and can involve systemic or regional chemotherapy. Drugs approved for the treatment of cervical cancer include bleomycin, cisplatin, topotecan hydrochloride, and gemcitabine-cisplatin. Targeted therapy involves the use of drugs that identify and attack specific cancer cells without harming normal cells. Targeted therapy includes antibody therapy such as bevacizumab therapy.

Further disclosed herein, is a probe set for diagnosing, predicting, and/or monitoring cervical cancer in a subject. The probe set comprises a plurality of polynucleotide probes capable of detecting an expression level of at least one biomarker for CIN or cervical cancer, wherein the expression level determines the CIN or cervical cancer status of the subject.

In one aspect, a probe set comprises

one or more polynucleotides that hybridizes to a first polynucleotide biomarker, wherein the first polynucleotide biomarker is GRB7 (SEQ ID NOs: 8-11), NOVA1 (SEQ ID Nos: 14 and 15), RNASEH2A (SEQ ID NO: 19), or a combination thereof, and

one or more polynucleotides that hybridizes to a second polynucleotide biomarker, wherein the second polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, or a combination thereof.

In certain aspects, the probe set is attached to a solid support, and/or each member of the probe set comprises a detectable moiety.

One skilled in the art understands that the nucleotide sequence of the polynucleotide probe need not be identical to its target sequence in order to specifically hybridize thereto. The polynucleotide probes, therefore, comprise a nucleotide sequence that is at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, or more identical to a region of the coding target or non-coding target. Methods of determining sequence identity are known in the art and can be determined, for example, by using the BLASTN program of the University of Wisconsin Computer Group (GCG) software or provided on the NCBI website. The nucleotide sequence of the polynucleotide probes may exhibit variability by differing (e.g. by nucleotide substitution, including transition or transversion) at one, two, three, four or more nucleotides from the sequence of the coding target or non-coding target.

Primers/probes based on the nucleotide sequences of target sequences can be used in amplification of the target sequences. For use in amplification reactions such as PCR, a pair of primers can be used. The exact composition of the primer sequences is selected so that the primers hybridize to specific sequences of the probe set under stringent conditions, particularly under conditions of high stringency. The pairs of primers are usually chosen so as to generate an amplification product of at least about 50 nucleotides, more usually at least about 100 nucleotides. Algorithms for the selection of primer sequences are generally known, and are available in commercial software packages. These primers may be used in standard quantitative or qualitative PCR-based assays to assess transcript expression levels of RNAs defined by the probe set. Alternatively, these primers may be used in combination with probes, such as molecular beacons in amplifications using real-time PCR.

The polynucleotide probes or primers can incorporate moieties useful in detection, isolation, purification, or immobilization, if desired. Such moieties are detectable labels, such as radioisotopes, fluorophores, chemiluminophores, enzymes, colloidal particles, and fluorescent microparticles, as well as antigens, antibodies, haptens, avidin/streptavidin, biotin, haptens, enzyme cofactors/substrates, enzymes, and the like. A label can optionally be attached to or incorporated into a probe or primer polynucleotide to allow detection and/or quantitation of a target polynucleotide representing the target sequence of interest.

In some embodiments, one or more polynucleotide probes/primers provided herein can be provided on a substrate. The substrate can comprise a wide range of material, either biological, nonbiological, organic, inorganic, or a combination of any of these. For example, the substrate may be a polymerized Langmuir Blodgett film, functionalized glass, Si, Ge, GaAs, GaP, SiO.sub.2, SiN.sub.4, modified silicon, or any one of a wide variety of gels or polymers such as (poly)tetrafluoroethylene, (poly)vinylidenedifluoride, polystyrene, cross-linked polystyrene, polyacrylic, polylactic acid, polyglycolic acid, poly(lactide coglycolide), polyanhydrides, poly(methyl methacrylate), poly(ethylene-co-vinyl acetate), polysiloxanes, polymeric silica, latexes, dextran polymers, epoxies, polycarbonates, or combinations thereof. Conducting polymers and photoconductive materials can be used.

Substrates can be planar crystalline substrates such as silica based substrates (e.g., glass, quartz, or the like), or crystalline substrates used in, e.g., the semiconductor and microprocessor industries, such as silicon, gallium arsenide, indium doped GaN and the like, and include semiconductor nanocrystals.

The substrate can take the form of an array, a photodiode, an optoelectronic sensor such as an optoelectronic semiconductor chip or optoelectronic thin-film semiconductor, or a biochip. The location(s) of probe(s) on the substrate can be addressable; this can be done in highly dense formats, and the location(s) can be microaddressable or nanoaddressable.

The substrate can be a plate, slide, bead, pellet, disk, particle, microparticle, nanoparticle, strand, precipitate, optionally porous gel, sheets, tube, sphere, capillary, film, chip, multiwell plate or dish, optical fiber, etc. The substrate can be a form that is rigid or semi-rigid. The substrate may contain raised or depressed regions on which an assay component is located. The surface of the substrate can be etched using known techniques to provide for desired surface features, for example trenches, v-grooves, mesa structures, or the like.

Surfaces on the substrate can be composed of the same material as the substrate or can be made from a different material, and can be coupled to the substrate by chemical or physical means. Such coupled surfaces may be composed of any of a wide variety of materials, for example, polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, membranes, or any of the above-listed substrate materials. The surface can be optically transparent and can have surface Si--OH functionalities, such as those found on silica surfaces.

The substrate and/or its optional surface can be chosen to provide appropriate characteristics for the synthetic and/or detection methods used. The substrate and/or surface can be transparent to allow the exposure of the substrate by light applied from multiple directions. The substrate and/or its surface is generally resistant to, or is treated to resist, the conditions to which it is to be exposed in use, and can be optionally treated to remove any resistant material after exposure to such conditions.

The substrate or a region thereof may be encoded so that the identity of the sensor located in the substrate or region being queried may be determined. A suitable coding scheme can be used, for example optical codes, RFID tags, magnetic codes, physical codes, fluorescent codes, and combinations of codes.

The invention is further illustrated by the following non-limiting examples.

EXAMPLES

Materials and Methods

Human patient samples: Samples for RNA sequencing, containing 7 normal cervical tissues, 7 pre-cancer tissues and 7 cervical cancer tissues, and samples for validation, including 24 normal cervical tissues, 25 CIN 2-3 tissues, and 23 cervical cancer tissues, were all collected from the Women's Hospital, School of Medicine, Zhejiang University. All the human samples were used in accordance with the Institutional Review Board procedures of the hospital. Informed consent was obtained from each participant prior to the study. Samples were snap-frozen and stored at -80.degree. C. until use.

RNA isolation: RNA was isolated from each human tissue sample by TRIzol.RTM. (Invitrogen, CA, USA) according to the instructions provided by the manufacturer. Total RNA quality and quantity were verified spectrophotometrically (NanoDrop ND-1000 spectrometer; Thermo Scientific, DE, USA) and electrophoretically (Bioanalyzer 2100; Agilent Technologies, CA, USA).

RNA sequencing and mapping: RNA-seq libraries were prepared using TruSeq.RTM. Stranded Total RNA Sample Preparation Kit with Ribo-Zero.TM. depletion and sequenced on an Illumina.RTM. HiSeq.TM.-2500 platform as paired-end reads. In brief, high-quality of human total RNA (1 .mu.g) was Ribo-Zero.TM. depleted, fragmented, and then reverse transcribed. The double-stranded cDNA were A-tailed and ligated with Illumia.RTM. sequencing adapters. Subsequently, the ligated products were enriched by PCR and size-selected by agarose gel electrophoresis. The products of approximately 200-400-bp in size were sequenced by the Illumina.RTM. HiSeq.TM.-2500 platform. The raw data in fastq format were mapped to the human reference genome (hg19, GRCh37) by Tophat v2.0.11(-g 1), which had the aligner Bowtie (v2.2.1.0) with the parameter settings (-N 0, -L 20, -i S,1,1.25, -n-ceil L,0,0.15 and -gbar 4). The mapping results were further sorted in coordination position by samtools (v0.1.19.0) (Robinson M D, Oshlack A., "A scaling normalization method for differential expression analysis of RNA-seq data," Genome Biology, 11:R25 (2010); Robinson M D, McCarthy D J and Smyth G K., "edgeR: a Bioconductor package for differential expression analysis of digital gene expression data," Bioinformatics, 26, pp. 139-140 (2010)). The latest annotation of LncRNA was downloaded from the publicly available Incipedia database version 3.0. The mapped reads in individual lncRNA region of each sample were counted by bedtools (v2.19.0). The R Bioconductor edgeR package was used to normalize raw reads by the scaling method. Differentially expressed lncRNAs were identified by one-way ANOVA method with 10% false discovery rate (FDR) and four-fold changes between the conditions. The FDR was controlled by the Benjamini-Hochberg (BH) procedure. RNA-binding protein genes were compiled from the literature (Alfredo Castello, et al., "Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins," Cell, 149, pp. 1393-1406 (2012); Alfredo Castello, et al., "RNA-binding proteins in Mendelian disease," Trends in Genetics, 29, pp. 318-327 (2013)). The normalized reads from the multiple transcripts of each gene were averaged to represent composite gene expression. The expression results were clustered using unsupervised hierarchical clustering analysis, in which the Euclidean Distance is used as the similarity measure.

Human primary keratinocytes and organotypic (raft) epithelial cultures: Total RNA extracted from various raft tissues were leftovers from previous studies (Wang, X. et al., "Oncogenic HPV infection interrupts the expression of tumor-suppressive miR-34a through viral oncoprotein E6," RNA, 15, pp. 637-647 (2009); Wang, X., et al., "microRNAs are biomarkers of oncogenic human papillomavirus infections," Proc. Natl. Acad. Sci. USA, 111, pp. 4262-4267 (2014)). Briefly, primary human foreskin keratinocytes (HFK) and primary human vaginal keratinocytes (HVK) were isolated from newborn circumcision and adult vaginectomy tissue specimens, respectively, as previously described (Meyers, C., Mayer, T. J., and Ozbun, M. A., "Synthesis of infectious human papillomavirus type 18 in differentiating epithelium transfected with viral DNA," J. Virol., 71, pp, 7381-7386 (1997)). Keratinocytes were grown in monolayer culture by using epithelial (E) medium plus epidermal growth factor (5 ng/ml) in the presence of mitomycin C (4 .mu.g/ml)-treated J2 3T3 feeder cells. Keratinocyte lines stably maintaining HPV16 and HPV18 DNA following electroporation were subcloned by limiting dilutions of cells. Organotypic (raft) epithelial culture tissues derived from HPV16 and HPV18-immortalized HFK or HVK were prepared as described previously (McLaughlin-Drubin, M. E. and Meyers, C., "Propagation of infectious, high-risk HPV in organotypic "raft" culture," Methods Mol. Med., 119, pp. 171-186 (2005)). The stratified and differentiated raft culture epidermal tissues were collected free from collagen (no fibroblasts) on day 10 and frozen on dry ice for total cell RNA preparation. Additional productive HPV18 raft cultures of HFKs were obtained by Cre-loxP-mediated recombination as described (Wang, H. K., Duffy, A. A., Broker, T. R., and Chow, L. T., "Robust production and passaging of infectious HPV in squamous epithelium of primary human keratinocytes", Genes Dev., 23, pp. 181-194 (2009)), and the derived raft cultures were collected on day 8, day 12, and day 16.

Plasmid pLJd-HPV-18URR-E6, pLC-HPV-18URR-E7, and pLJd-HPV-18URR-E6E7 have been described (Cheng, S., Schmidt-Grimminger, D. C., Murant, T., Broker, T. R., and Chow, L. T., "Differentiation-dependent up-regulation of the human papillomavirus E7 gene reactivates cellular DNA replication in suprabasal differentiated keratinocytes.," Genes Dev., 9, pp. 2335-2349 (1995); Genovese, N. J., Banerjee, N. S., Broker, T. R., and Chow, L. T., "Casein kinase II motif-dependent phosphorylation of human papillomavirus E7 protein promotes p130 degradation and S-phase induction in differentiated human keratinocytes," J. Virol., 82, pp. 4862-4873 (2008)). Retroviruses derived from the above vectors were prepared as described (Banerjee, N. S., Chow, L. T., and Broker, T. R., "Retrovirus-mediated gene transfer to analyze HPV gene regulation and protein functions in organotypic "raft" cultures," Methods Mol. Med., 119, pp. 187-202 (2005)). Primary HFKs were acutely infected with the retroviruses and selected with G-418 (300 .mu.g/mL). The selected HFKs were used to establish epithelial raft cultures and harvested on day 11.

TaqMan.RTM. real-time quantitative PCR assays: Quantitative validation of genes in clinical samples and raft tissues was analyzed by real-time PCR TaqMan.RTM. gene expression assays (Applied Biosystems). In brief, 2 .mu.g of total RNA from each sample was reversely transcribed using Superscript.RTM. First-stand Synthesis kit (Invitrogen) according to the manufacturer's instructions. TaqMan.RTM. gene expression assays for RNA-binding protein gene expression were obtained from life technologies and lncRNA primers for RT-qPCR were designed as given in Example 2.

The TaqMan.RTM. assay probes that span over exon-exon junctions were designed to amplify spliced RNA products to avoid detection of any contaminated residual genomic DNA in our RNA samples. After reverse transcription, PCR products were amplified from the cDNA samples using TaqMan.RTM. gene expression Master Mix (Applied Biosystems) together with TaqMan.RTM. gene expression assays on a StepOne Plus.TM. Real-Time PCR system (Applied Biosystems). Gene enrichment was calculated using the 2.sup.-.DELTA..DELTA.Ct method in relation to the housekeeping gene GAPDH. The mean Ct value of a given gene from 24 normal cervical tissues after normalization was served as a basal level to calculate a relative level of the gene detected in each clinical sample. Data are presented as a bar graph with mean.+-.SE for each group. Significance of mRNA levels among clinical tissue groups was analyzed using the nonparametric Mann-Whitney U-test, while significance of the mRNA levels between raft culture tissue groups was analyzed by Student t-test.

Example 1: Identification of Altered Expression of RNA-Binding Protein Genes in Cervical Cancer

Using RNA-sequencing (RNA-Seq) approach, seven normal cervical tissues and seven cervical cancer tissues were examined for their expression landscapes of approximately 19,000 coding and 113,513 noncoding RNAs. We identified 614 differentially expressed coding transcripts enriched in cancer related pathways and 95 of them encoding RNA-binding proteins (RBPs) from the analyzed 1502 human RBPs. Moreover, we identified 34 differentially, abundantly expressed lnc-RNAs from normal cervix to cervical cancer. Table 4 shows the two RNA-Seq analyses of 14 different clinical cervical tissues with two different RNA-seq platforms, each containing normal cervical tissues without HPV infection and cervical cancer tissues with HPV infection. The right column of the table shows the raw reads of individual samples from each RNA-Seq platform.

TABLE-US-00004 TABLE 4 RNA-Seq detection from 14 cervical tissue samples Sample No. Age (yr) Pathology HPV infection Total reads RNA-Seq-1 1 27 N No 13,171,863 2 38 N No 12,028,762 3 42 N No 31,143,321 4 40 SCC Yes 12,422,476 5 42 SCC Yes 11,425,454 6 24 SCC Yes 22,302,605 RNA-Seq-2 7 42 N No 85,255,279 8 37 N No 83,376,820 9 52 N No 80,265,055 10 44 N No 81,954,460 11 48 SCC Yes 66,982,821 12 45 SCC Yes 74,819,347 13 47 SCC Yes 93,579,886 14 49 SCC Yes 66,891,722

FIG. 1 is a flowchart of the RNA-Seq analyses. FIG. 2 shows Venn diagrams and FIG. 3 shows a heat map showing 95 differentially expressed RNA-binding protein genes in cervical cancer (n=7) compared to normal cervical tissues (n=7). Table 5 summarizes the 8 RBPs with expression changes between normal and cancer tissues by RNA-Seq. (CPM: Counts per Million)

TABLE-US-00005 TABLE 5 RNA-Seq data of the 8 RBP genes between normal and cancer tissues Normal Cancer RNA-binding (log.sub.2 CPM, (log.sub.2 CPM, protein genes Description mean .+-. SD) mean .+-. SD) CDKN2A Cyclin-dependent -0.24 .+-. 0.88 6.3 .+-. 1.12 kinase inhibitor 2A ELAVL2 ELAV like neuron- -3.38 .+-. 1.89 0.17 .+-. 3.54 specific RNA binding protein 2 GRB7 Growth factor receptor- 0.9 .+-. 0.96 4.07 .+-. 1.22 bound protein 7 HSPB1 Heat shock 27 kDa 5.74 .+-. 1.09 8.84 .+-. 2.49 protein 1 KHSRP KH-type splicing 4.35 .+-. 0.18 5.85 .+-. 0.78 regulatory protein NOVA1 Neuro-oncological 2.82 .+-. 0.55 0.1 .+-. 1.55 ventral antigen 1 PTBP1 Polypyrimidine tract 5.74 .+-. 0.21 7.18 .+-. 0.83 binding protein 1 RNASEH2A Ribonuclease H2, 2.32 .+-. 0.47 5.01 .+-. 0.72 subunit A

Table 6 provides the TaqMan.RTM. probe information of each RBP.

TABLE-US-00006 TABLE 6 TaqMan .RTM. probe information of each RBP Company Order name Cat No ID No Applied Single Tube Cat. # 4331182 Hs00918009_g1 Biosystems .RTM. TaqMan .RTM. Assay for GRB7 Applied Single Tube Cat. # 4331182 Hs00270011_m1 Biosystems .RTM. TaqMan .RTM. Assay for ELAVL2 Applied Single Tube Cat. # 4331182 Hs00958451_g1 Biosystems .RTM. TaqMan .RTM. Assay for RNASEH2A Applied Single Tube Cat. # 4351372 Hs01100863_g1 Biosystems .RTM. TaqMan .RTM. Assay for KHSRP Applied Single Tube Cat. # 4351372 Hs01103130_m1 Biosystems .RTM. TaqMan .RTM. Assay for NOVA1 Applied Single Tube Cat. # 4351372 Hs00914687_g1 Biosystems .RTM. TaqMan .RTM. Assay for PTBP1 Applied Single Tube Cat. # 4331182 Hs00923894_m1 Biosystems .RTM. TaqMan .RTM. Assay for CDKN2A Applied Single Tube Cat. # 4331182 Hs03044127_g1 Biosystems .RTM. TaqMan .RTM. Assay for HSPB1

FIG. 4 shows the TaqMan.RTM. RT-qPCR validation confirming that all 8 RBPs significantly increased (7 RBPs) or decreased (1 RBP) in cervical cancer tissues (n=23), compared to normal cervical tissues (n=24). 7 increased RBP genes in cervical cancer were also shown higher expression in pre-cancerous lesions (CIN 2-3, n=25) when compared to the normal tissues, indicating these changes appear even at the early stage of cervical carcinogenesis. **, P<0.01; ***, P<0.001; NS, no statistics significance.

FIGS. 5 and 6 show that high-risk HPV16 and HPV18 infection affects the expression of RBPs. FIG. 5 shows Total RNA extracted from human vaginal keratinocyte (HVK)-derived raft cultures with (HVK16) or without (HVK) productive HPV16 infection and human foreskin keratinocyte (HFK) derived raft cultures with (HFK16) or without (HFK) productive HPV16 infection were examined by TaqMan.RTM. RT-qPCR for the expression of 8 RBPs. *, P<0.05; **, P<0.01; ***, P<0.001; NS, no statistics significance. FIG. 6 shows Total RNA extracted from human vaginal keratinocyte (HVK)-derived raft cultures with (HVK18) or without (HVK) productive HPV18 infection and human foreskin keratinocyte (HFK) derived raft cultures with (HFK18) or without (HFK) productive HPV18 infection were examined by TaqMan.RTM. RT-qPCR for the expression of 8 RBPs. *, P<0.05; ***, P<0.001; NS, no statistics significance. FIG. 7 shows that both HPV16 and HPV18 increase the expression of CDKN2A and RNASEH2A, but decrease the expression of NOVA1 in HFK- and HVK-derived rafts. In this experiment, total RNA was used to determine the relative levels of individual proteins by TaqMan.RTM. RT-qPCR. FIG. 8 shows that HPV18 infection and viral E6 and/or E7 affect the expression of RNASEH2A and Nova1. The expression of RNASEH2A and NOVA1 in primary human keratinocytes (PHK)-derived raft tissues with or without HPV18 infection on day 8, day 12, and day 16 or PHK rafts transduced with a retrovirus expression HPV18 E6, E7 or E6E7 or with an empty control retrovirus were further validated by TaqMan.RTM. RT-qPCR. These results demonstrate that RNASEH2A and NOVA1 respond to HPV18 infection and their altered expression in cervical cancer could be attributed to viral oncoprotein E6 and/or E7. *, P<0.05; ***, P<0.001; NS, no statistics significance.

FIG. 9 shows that knockdown or overexpression of RNASEH2A in HeLa or CaSki cells affects cell proliferation. Specific-siRNA knockdown or ectopic expression of RNASEH2A from a mammalian expression vector in HeLa or CaSki cells on cell proliferation was evaluated by Cell Counting Kit-8 (CCK-8) assay at time indicated. si-NS, non-specific siRNA; siRNASEH2A, RNASEH2A-specific siRNA; P, control vector; p-RNASEH2A, RNASEH2A-expression vector. FIG. 10 shows HPV oncoprotein E7 regulates the expression of RNASEH2A via E2F1. Specific-siRNA knockdown or ectopic expression of E2F1 from a mammalian expression vector in HeLa or CaSki cells on RNASEH2A was evaluated by Western blot using anti-RNASEH2A antibody. si-NS, non-specific siRNA; si-E2F1, E2F1-specific siRNA; P, control vector; p-E2F1, E2F1-expression vector.

Example 2: The Expression Profile of Long Noncoding RNAs Distinguishes Normal Cervix from and Cancerous Cervix

RNA was extracted from each sample using Trizol.RTM. reagent (Life technologies). RNAseq libraries were prepared using TruSeq.RTM. Stranded Total RNA Kit with Ribo-Zero depletion and sequenced on an Illumina HiSeq.TM. 2000 platform as paired-end reads. The fastq data were mapped to human reference genome (hg19, GRCh37) by Bowtie (v2.2.1.0), and the mapping results were further filtered by samtools (v0.1.19.0). The latest annotation of LncRNA was downloaded from Incipedia database version 3.0. We counted the mapped reads in individual lncRNA region of each sample by bedtools (v2.19.0). The R Bioconductor edgeR package was used to normalize raw reads by the scaling method. The differentially expressed lncRNAs were detected by one-way ANOVA method with 10% false discovery rate (FDR) and four fold changes between the conditions. FIG. 11 is a flow chart of the RNA-Seq analysis. FIG. 12 is a heat map showing 34 overlapped, differentially expressed lnc-RNAs in cervical cancer compared to normal cervical tissues. lnc-FANCI-2 and lnc-GLB1L2-1 were specifically identified as associated with cervical cancer. Tables 2 and 3 list all of the isoforms of these two lnc-RNAs.

TABLE-US-00007 Taqman .RTM. primer design for lnc-FANCI-2 Exon 6: (SEQ ID NO: 76) CTGGAAAGGAGGAGAACATGAAACATTGCTTGAAGACAATGGCCGAGACA GCAGGTCCCACCCTGCACAGCCACCAGCATCTCTCCCCTCAGCCCTGTCT CCTCTTCTGCAGTTGGGATCTGCACATTTAAGCCTGAA Exon 7: (SEQ ID NO: 77) ATTGTCCTGTGAAGTGAAGTATGATCGGACAGCCTCTTTTCAGCTTTTAT GACAATGGAGACAGAGGAATTGTGGCTCTTGCCAAGGTCACAGGATTGGA ATACAGAGCCAAGCCACCCCAGGACATGCAAGAGCCTCAGAAGGGAA Primers for RT-qPCR Forward: (SEQ ID NO: 78) 5'- ACAGCCACCAGCATCTCTC -3' Probe: (SEQ ID NO: 79) 5'- TGAAGTGAAGTATGATCGGACAGCCTC -3' Reverse: (SEQ ID NO: 80) 5'- CCACAATTCCTCTGTCTCCATT -3' TaqMan .RTM. primer design for lnc-GLB1L2-1: Last Exon 3: (SEQ ID NO: 81) TCTCTCATCTGTGTTTTCAGGGCATGGACTGGAACTCCCAATACCCCTGA CATGGGCTGAGTCAACGTGGTCATGAACATGTGACAGGAG Last Exon 2: (SEQ ID NO: 82) GCAGCAGAAGTTGCAGAGAAGAGTGAGGCACGTTTGAAAAAGGCTGAAAA ATGTTTCTGTCCAGGCAAGGGTGTGTGCTGAATGACTCAAGGATTTTTTG G Primers for RT-qPCR Forward: (SEQ ID NO: 83) 5'- CATGGACTGGAACTCCCAATA -3' Probe: (SEQ ID NO: 84) 5'- TGCAGAGAAGAGTGAGGCACGTTTG -3' Reverse: (SEQ ID NO: 85) 5'- CCTTGCCTGGACAGAAACATT -3'

FIG. 13 shows an increase of lnc-FANCI-2, and decrease of lnc-GLB1L2-1 expression along with the cervical lesion progression from normal cervix. Lnc-FANCI-2 and lnc-GLB1L2-1 RNA expression was examined by RT-qPCR in 24 normal, 25 CIN 2-3, and 23 cancer tissues. FIG. 14 shows that HPV infection increases lnc-FANCI-2 expression in HVK- and PHK-derived rafts and viral E7 or E6 is responsible for the increase. The expression of lnc-FANCI-2 in human vaginal keratinocytes (HVK)-derived raft tissues without (HVK) or with HPV16 (HVK16) or HPV18 (HVK18) infection or primary human keratinocytes (PHK)-derived raft tissues without or with HPV18 infection on day 8, day 12, and day 16 or PHK rafts transduced with a retrovirus expressing HPV18 E6, E7 or E6E7 or with an empty control retrovirus were further validated by RT-qPCR. These results demonstrate that lnc-FANCI-2 expression responds to HPV18 infection and viral oncoprotein E6 and/or E7.

In data not shown, lnc-FANCI-2 was upregulated in isolated keratinocyte lines infected by high-risk HPVs, but not low risk HPV11 and epidermodysplasia verruciformis-associated HPV5 and 10.

The term "polynucleotide" as used herein refers to a polymer of greater than one nucleotide in length of ribonucleic acid (RNA), deoxyribonucleic acid (DNA), hybrid RNA/DNA, modified RNA or DNA, or RNA or DNA mimetics, including peptide nucleic acids (PNAs). The polynucleotides may be single- or double-stranded. The term includes polynucleotides composed of naturally-occurring nucleobases, sugars, and covalent internucleoside (backbone) linkages as well as polynucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted polynucleotides are well known in the art and are referred to as "analogues."

"Complementary" or "substantially complementary" refers to the ability to hybridize or base pair between nucleotides or nucleic acids, such as, for instance, between a sensor peptide nucleic acid or polynucleotide and a target polynucleotide. Complementary nucleotides are, generally, A and T (or A and U), or C and G. Two single-stranded polynucleotides or PNAs are said to be substantially complementary when the bases of one strand, optimally aligned and compared and with appropriate insertions or deletions, pair with at least about 80% of the bases of the other strand, usually at least about 90% to 95%, and more preferably from about 98 to 100%.

Alternatively, substantial complementarity exists when a polynucleotide may hybridize under selective hybridization conditions to its complement. Typically, selective hybridization may occur when there is at least about 65% complementarity over a stretch of at least 14 to 25 bases, for example at least about 75%, or at least about 90% complementarity.

The term "homologous region" refers to a region of a nucleic acid with homology to another nucleic acid region. Whether a "homologous region" is present in a nucleic acid molecule is determined with reference to another nucleic acid region in the same or a different molecule.

Hybridization conditions typically include salt concentrations of less than about 1M, more usually less than about 500 mM, for example, less than about 200 mM. In the case of hybridization between a peptide nucleic acid and a polynucleotide, the hybridization can be done in solutions containing little or no salt. Hybridization temperatures can be as low as 5.degree. C., but are typically greater than 22.degree. C., and more typically greater than about 30.degree. C., for example in excess of about 37.degree. C. Longer fragments may require higher hybridization temperatures for specific hybridization as is known in the art. Other factors may affect the stringency of hybridization, including base composition and length of the complementary strands, presence of organic solvents and extent of base mismatching, and the combination of parameters used is more important than the absolute measure of any one alone. Other hybridization conditions which may be controlled include buffer type and concentration, solution pH, presence and concentration of blocking reagents to decrease background binding such as repeat sequences or blocking protein solutions, detergent type(s) and concentrations, molecules such as polymers which increase the relative concentration of the polynucleotides, metal ion(s) and their concentration(s), chelator(s) and their concentrations, and other conditions known in the art.

As used herein, a "probe" is a polynucleotide capable of selectively hybridizing to a target sequence, a complement thereof, a reverse complement thereof, or to an RNA version of the target sequence, the complement thereof, or the reverse complement thereof. A probe may comprise ribonucleotides, deoxyribonucleotides, peptide nucleic acids, and combinations thereof. A probe may optionally comprise one or more labels. In some embodiments, a probe may be used to amplify one or both strands of a target sequence or an RNA form thereof, acting as a sole primer in an amplification reaction or as a member of a set of primers. In one aspect, probes include nucleotide sequences of 10 to 1,000 nucleotides. In other embodiments, the probes are 10-200, 10-30, 10-40, 20-50, 40-80, 50-150, or 80-120 nucleotides in length.

The use of the terms "a" and "an" and "the" and similar referents (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The terms first, second etc. as used herein are not meant to denote any particular ordering, but simply for convenience to denote a plurality of, for example, layers. The terms "comprising", "having", "including", and "containing" are to be construed as open-ended terms (i.e., meaning "including, but not limited to") unless otherwise noted. Recitation of ranges of values are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. The endpoints of all ranges are included within the range and independently combinable. All methods described herein can be performed in a suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., "such as"), is intended merely to better illustrate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention as used herein.

While the invention has been described with reference to an exemplary embodiment, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed as the best mode contemplated for carrying out this invention, but that the invention will include all embodiments falling within the scope of the appended claims. Any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.

SEQUENCE LISTINGS

1

9511267DNAHomo sapiens 1cgagggctgc ttccggctgg tgcccccggg ggagacccaa cctggggcga cttcaggggt 60gccacattcg ctaagtgctc ggagttaata gcacctcctc cgagcactcg ctcacggcgt 120ccccttgcct ggaaagatac cgcggtccct ccagaggatt tgagggacag ggtcggaggg 180ggctcttccg ccagcaccgg aggaagaaag aggaggggct ggctggtcac cagagggtgg 240ggcggaccgc gtgcgctcgg cggctgcgga gagggggaga gcaggcagcg ggcggcgggg 300agcagcatgg agccggcggc ggggagcagc atggagcctt cggctgactg gctggccacg 360gccgcggccc ggggtcgggt agaggaggtg cgggcgctgc tggaggcggg ggcgctgccc 420aacgcaccga atagttacgg tcggaggccg atccaggtca tgatgatggg cagcgcccga 480gtggcggagc tgctgctgct ccacggcgcg gagcccaact gcgccgaccc cgccactctc 540acccgacccg tgcacgacgc tgcccgggag ggcttcctgg acacgctggt ggtgctgcac 600cgggccgggg cgcggctgga cgtgcgcgat gcctggggcc gtctgcccgt ggacctggct 660gaggagctgg gccatcgcga tgtcgcacgg tacctgcgcg cggctgcggg gggcaccaga 720ggcagtaacc atgcccgcat agatgccgcg gaaggtccct cagacatccc cgattgaaag 780aaccagagag gctctgagaa acctcgggaa acttagatca tcagtcaccg aaggtcctac 840agggccacaa ctgcccccgc cacaacccac cccgctttcg tagttttcat ttagaaaata 900gagcttttaa aaatgtcctg ccttttaacg tagatatatg ccttccccca ctaccgtaaa 960tgtccattta tatcattttt tatatattct tataaaaatg taaaaaagaa aaacaccgct 1020tctgcctttt cactgtgttg gagttttctg gagtgagcac tcacgcccta agcgcacatt 1080catgtgggca tttcttgcga gcctcgcagc ctccggaagc tgtcgacttc atgacaagca 1140ttttgtgaac tagggaagct caggggggtt actggcttct cttgagtcac actgctagca 1200aatggcagaa ccaaagctca aataaaaata aaataatttt cattcattca ctcaaaaaaa 1260aaaaaaa 126721464DNAHomo sapiens 2cgagggctgc ttccggctgg tgcccccggg ggagacccaa cctggggcga cttcaggggt 60gccacattcg ctaagtgctc ggagttaata gcacctcctc cgagcactcg ctcacggcgt 120ccccttgcct ggaaagatac cgcggtccct ccagaggatt tgagggacag ggtcggaggg 180ggctcttccg ccagcaccgg aggaagaaag aggaggggct ggctggtcac cagagggtgg 240ggcggaccgc gtgcgctcgg cggctgcgga gagggggaga gcaggcagcg ggcggcgggg 300agcagcatgg agccggcggc ggggagcagc atggagcctt cggctgactg gctggccacg 360gccgcggccc ggggtcgggt agaggaggtg cgggcgctgc tggaggcggg ggcgctgccc 420aacgcaccga atagttacgg tcggaggccg atccaggtca tgatgatggg cagcgcccga 480gtggcggagc tgctgctgct ccacggcgcg gagcccaact gcgccgaccc cgccactctc 540acccgacccg tgcacgacgc tgcccgggag ggcttcctgg acacgctggt ggtgctgcac 600cgggccgggg cgcggctgga cgtgcgcgat gcctggggcc gtctgcccgt ggacctggct 660gaggagctgg gccatcgcga tgtcgcacgg tacctgcgcg cggctgcggg gggcaccaga 720ggcagtaacc atgcccgcat agatgccgcg gaaggtccct cagaaatgat cggaaaccat 780ttgtgggttt gtagaagcag gcatgcgtag ggaagctacg ggattccgcc gaggagcgcc 840agagcctgag gcgccctttg gttatcgcaa gctggctggc tcactccgca ccaggtgcaa 900aagatgcctg gggatgcggg aagggaaagg ccacatcttc acgccttcgc gcctggcatt 960acatccccga ttgaaagaac cagagaggct ctgagaaacc tcgggaaact tagatcatca 1020gtcaccgaag gtcctacagg gccacaactg cccccgccac aacccacccc gctttcgtag 1080ttttcattta gaaaatagag cttttaaaaa tgtcctgcct tttaacgtag atatatgcct 1140tcccccacta ccgtaaatgt ccatttatat cattttttat atattcttat aaaaatgtaa 1200aaaagaaaaa caccgcttct gccttttcac tgtgttggag ttttctggag tgagcactca 1260cgccctaagc gcacattcat gtgggcattt cttgcgagcc tcgcagcctc cggaagctgt 1320cgacttcatg acaagcattt tgtgaactag ggaagctcag gggggttact ggcttctctt 1380gagtcacact gctagcaaat ggcagaacca aagctcaaat aaaaataaaa taattttcat 1440tcattcactc aaaaaaaaaa aaaa 146431164DNAHomo sapiens 3cgctcaggga aggcgggtgc gcgcctgcgg ggcggagatg ggcagggggc ggtgcgtggg 60tcccagtctg cagttaaggg ggcaggagtg gcgctgctca cctctggtgc caaagggcgg 120cgcagcggct gccgagctcg gccctggagg cggcgagaac atggtgcgca ggttcttggt 180gaccctccgg attcggcgcg cgtgcggccc gccgcgagtg agggttttcg tggttcacat 240cccgcggctc acgggggagt gggcagcgcc aggggcgccc gccgctgtgg ccctcgtgct 300gatgctactg aggagccagc gtctagggca gcagccgctt cctagaagac caggtcatga 360tgatgggcag cgcccgagtg gcggagctgc tgctgctcca cggcgcggag cccaactgcg 420ccgaccccgc cactctcacc cgacccgtgc acgacgctgc ccgggagggc ttcctggaca 480cgctggtggt gctgcaccgg gccggggcgc ggctggacgt gcgcgatgcc tggggccgtc 540tgcccgtgga cctggctgag gagctgggcc atcgcgatgt cgcacggtac ctgcgcgcgg 600ctgcgggggg caccagaggc agtaaccatg cccgcataga tgccgcggaa ggtccctcag 660acatccccga ttgaaagaac cagagaggct ctgagaaacc tcgggaaact tagatcatca 720gtcaccgaag gtcctacagg gccacaactg cccccgccac aacccacccc gctttcgtag 780ttttcattta gaaaatagag cttttaaaaa tgtcctgcct tttaacgtag atatatgcct 840tcccccacta ccgtaaatgt ccatttatat cattttttat atattcttat aaaaatgtaa 900aaaagaaaaa caccgcttct gccttttcac tgtgttggag ttttctggag tgagcactca 960cgccctaagc gcacattcat gtgggcattt cttgcgagcc tcgcagcctc cggaagctgt 1020cgacttcatg acaagcattt tgtgaactag ggaagctcag gggggttact ggcttctctt 1080gagtcacact gctagcaaat ggcagaacca aagctcaaat aaaaataaaa taattttcat 1140tcattcactc aaaaaaaaaa aaaa 116441235DNAHomo sapiens 4atggagccgg cggcggggag cagcatggag ccttcggctg actggctggc cacggccgcg 60gcccggggtc gggtagagga ggtgcgggcg ctgctggagg cgggggcgct gcccaacgca 120ccgaatagtt acggtcggag gccgatccag gtgggtagag ggtctgcagc gggagcaggg 180gatggcgggc gactctggag gacgaagttt gcaggggaat tggaatcagg tagcgcttcg 240attctccgga aaaaggggag gcttcctggg gagttttcag aaggggtttg taatcacaga 300cctcctcctg gcgacgccct gggggcttgg gaagccaagg aagaggaatg aggagccacg 360cgcgtacaga tctctcgaat gctgagaaga tctgaagggg ggaacatatt tgtattagat 420ggaagtcatg atgatgggca gcgcccgagt ggcggagctg ctgctgctcc acggcgcgga 480gcccaactgc gccgaccccg ccactctcac ccgacccgtg cacgacgctg cccgggaggg 540cttcctggac acgctggtgg tgctgcaccg ggccggggcg cggctggacg tgcgcgatgc 600ctggggccgt ctgcccgtgg acctggctga ggagctgggc catcgcgatg tcgcacggta 660cctgcgcgcg gctgcggggg gcaccagagg cagtaaccat gcccgcatag atgccgcgga 720aggtccctca gacatccccg attgaaagaa ccagagaggc tctgagaaac ctcgggaaac 780ttagatcatc agtcaccgaa ggtcctacag ggccacaact gcccccgcca caacccaccc 840cgctttcgta gttttcattt agaaaataga gcttttaaaa atgtcctgcc ttttaacgta 900gatatatgcc ttcccccact accgtaaatg tccatttata tcatttttta tatattctta 960taaaaatgta aaaaagaaaa acaccgcttc tgccttttca ctgtgttgga gttttctgga 1020gtgagcactc acgccctaag cgcacattca tgtgggcatt tcttgcgagc ctcgcagcct 1080ccggaagctg tcgacttcat gacaagcatt ttgtgaacta gggaagctca ggggggttac 1140tggcttctct tgagtcacac tgctagcaaa tggcagaacc aaagctcaaa taaaaataaa 1200ataattttca ttcattcact caaaaaaaaa aaaaa 123553756DNAHomo sapiens 5aacggcggga ccgcggcgcc tgggcgtcac tgaggcagta gccggccggg tgaggagggc 60ggttgccggc gcggcgcggc gcggcgcggg tggggcgggg gttccgccgg cttccagtcc 120cctttcccgc cgccgccgcc gccaccgcct ctccgcggag ctcgccccga gcgactcctc 180cgcggcagtg ctgacggcca gcggcacgag ccgtagtagc tgcagcttcg agtcacagca 240gcaggtaatt gctgccatgg aaacacaact gtctaatggg ccaacttgca ataacacagc 300caatggtcca accaccataa acaacaactg ttcgtcacca gttgactctg ggaacacaga 360agacagcaag accaacttaa tagtcaacta ccttcctcag aacatgacac aggaggaact 420aaagagtctc tttgggagca ttggtgaaat agagtcctgt aagcttgtaa gagacaaaat 480aacagggcag agcttgggat atggctttgt gaactacatt gaccccaagg atgcagagaa 540agctatcaac accctgaatg gattgagact tcaaaccaaa acaataaaag tttcctatgc 600tcgcccaagt tcagcttcta tcagagatgc aaatttatat gtcagcggac ttccaaaaac 660aatgacccag aaggagttgg aacagctttt ttcacaatat ggacgcatta ttacttctcg 720tattcttgtc gaccaggtca ctggcatatc aaggggtgta gggtttattc gatttgacaa 780gcgaattgag gcagaagaag ctatcaaagg cctaaatggc cagaaacctc ccggtgccac 840ggagccaatc actgtaaagt ttgctaataa cccaagccaa aaaaccaatc aggccatcct 900ttcccagctg taccagtctc caaacagaag gtatccagga ccgctagctc agcaggcaca 960gcgttttagg ttttctccaa tgaccattga cggaatgacc agtttggctg gaattaatat 1020ccctgggcac cctggaacag ggtggtgtat atttgtgtac aacctggctc ctgacgcaga 1080tgagagtatc ctgtggcaaa tgtttgggcc ttttggagct gtcaccaatg tgaaggtcat 1140ccgtgacttt aacaccaata aatgcaaagg ttttggattt gtgactatga caaactatga 1200tgaggctgcc atggcgatag ctagcctcaa tggataccgt ctgggagaca gagtactgca 1260ggtctccttt aagacaaaca aaacgcacaa agcctaatga gctcttgtcc tcagtccatt 1320tatatatgaa aactatacaa caaaggcaag ttaagagaaa ctttatacat tagtaaatgt 1380ctttgtaagt cagtgttgag atggggataa aatgactact tagcatccta agaaatatgt 1440gagatttttt attgctagta tttgaattaa aacttcttaa atatctttta tgtttgaata 1500tggacaagag gtacagggtt tttacctgtc acattgcatt ctattgcctt ctttgaagaa 1560ggtggacctt ttaaagtgtt tcagctaagg gaagacattt cttttctttt tacataactg 1620ccttgaacct gtgagtaaat attgaggctt tgtgttgtaa ttcttcagtt ggttgtgtct 1680tttttttccc cccttttttt cctttttctg attagctttg tgtttggttt acatttaaag 1740cattgctgtt atgtctgttt aagaaaagta ttttgaagtt tacattttta tttatgaagt 1800ttaaaacagt atttattttg taattatgat ttgggttggg gaaggggggg ctacattata 1860aacgcttatt gtaagaatac tggagaactt ttcgtaaagc agtaccttgc caaagagata 1920agagcctctt tgatgtgggt ttaaaaaaag catctatttt tataaaaaag aaaatttgga 1980gaaacttttt actggtcctg gaacaaatat tttgacttga atactttgag aaatctcttc 2040atatgacacc tagtgagctt ttaaaattta ccaggaaatt tgcagcggtt ggaaaattta 2100gaaagattta tggtgtagaa aatacttttg agatctttgt atgaaaggag tagaatcaat 2160ggggggaaac actgctggtt tcatttttgt aatcaccagt ggagcgtctg atcatcctgg 2220ttattatgtg ataggtggct cacattgatt tgtgattttg aaacaaataa aaaaaattta 2280caaaagaata tataagagca ggcaagaaat ttaaattacc gagagatggg ggaaaaaatc 2340tgttcttcct aaagaaatcc cttcagatag agctcatggt gtttagtgat gtacttgcag 2400tattgtttga agaattgttt tgtcttaagg aaaaaagacg ttgcacatga tttgtactgc 2460agcaaatcag caaaagtgat ctgagttgga tatatttgaa ggtattttga aagttacgtt 2520caaggctaac acctgagctt tgtgtaatgt aaataagacc ttgtgtttat gaacctttca 2580gctaatttaa ttttttttcc cttacatgcc aagtgatgtt caggttttga atgtttttgt 2640atcagttttt tcctttgtaa atggcattaa cattgttact tgaggtcttg cttaatcact 2700tttgttgtcc tgaggacttg aatttacagt gcatcagatt tgttgcaaat tttgtctgta 2760gatagtctag cttcagctgt ttatggtgat gctacatttt cgtttataaa tatgtttgtg 2820gtataaaaaa atgagtataa ccataggttt tgaacaaatt tccttacatt tttcatacaa 2880aaatcataaa tatctgtatg ctattgaaat ttaactttgt atgatgctta aaaaccacta 2940tttggggaaa taataaaata agtctttacc atgtatgaaa gaaattttaa aaaatacaaa 3000atattttctg attagcatct agcttataat aaattttcaa aaaagctgaa ggcaaaaatg 3060ccttcatcag gatgcactga gaactatata gttacgtcct gctttttgta taaactgaga 3120tgctcacatg cttcccctta gaacaggcaa tgtgctatgc ataacatagt tgtacattat 3180ctttgcggtt gctttgagtt ttatttttta ttatttaaaa ttgtagttat aaaatttttc 3240agtatagtac agtacatata ctgtgaggcg cgtgctaaag tgaataagcg agttttcatg 3300ctgacccact caatgctatt cagaaatcaa ttggcttagc actttctcat atccttaggt 3360gcatttagat tgccagagtt aaccttctgc gtttaaaaaa agaaaaacac taaaaaataa 3420aatacatgta tatacttaaa aaaaaataat aaggtttccc tcaagggaaa acagcagcta 3480catgcttctt tcctatacta ctgtagcaaa ccaaggcatt gatgagaggg catgcaaatt 3540gtgcttcact ttacagtgtt ttatcagagc acttaataaa atgtaaggct ggtatttatt 3600tgaagttgta cagtatgact taattcacat ctgttggaat agaaaatata ttctgttgag 3660tatttaagag gctgtacatg ttttcttttg tgtttggatt ctttgtactt tttcatgttc 3720agtacatcaa taaacaaagt tgaagggaaa aaaaaa 375663769DNAHomo sapiens 6agtccgaact ctgggcggga acactggtgg gggcggcgga ggttgtgccc gcgaagttcc 60tagagctcag cccgttgcgg cgggagtaga gagaattggg cgcctcggga ggtggcaccg 120cccctcccgt gggcacaagc aggttggggg cggcgggagc cgagcgggga cagtcgcgcc 180tggcagcgtg cacgggcgtg gacgtgcccg ggtgcggccg cgtgtagcgc aagaaggaaa 240ctgttgagac gcagcaggta attgctgcca tggaaacaca actgtctaat gggccaactt 300gcaataacac agccaatggt ccaaccacca taaacaacaa ctgttcgtca ccagttgact 360ctgggaacac agaagacagc aagaccaact taatagtcaa ctaccttcct cagaacatga 420cacaggagga actaaagagt ctctttggga gcattggtga aatagagtcc tgtaagcttg 480taagagacaa aataacaggg cagagcttgg gatatggctt tgtgaactac attgacccca 540aggatgcaga gaaagctatc aacaccctga atggattgag acttcaaacc aaaacaataa 600aagtttccta tgctcgccca agttcagctt ctatcagaga tgcaaattta tatgtcagcg 660gacttccaaa aacaatgacc cagaaggagt tggaacagct tttttcacaa tatggacgca 720ttattacttc tcgtattctt gtcgaccagg tcactggcat atcaaggggt gtagggttta 780ttcgatttga caagcgaatt gaggcagaag aagctatcaa aggcctaaat ggccagaaac 840ctcccggtgc cacggagcca atcactgtaa agtttgctaa taacccaagc caaaaaacca 900atcaggccat cctttcccag ctgtaccagt ctccaaacag aaggtatcca ggaccgctag 960ctcagcaggc acagcgtttt aggttttctc caatgaccat tgacggaatg accagtttgg 1020ctggaattaa tatccctggg caccctggaa cagggtggtg tatatttgtg tacaacctgg 1080ctcctgacgc agatgagagt atcctgtggc aaatgtttgg gccttttgga gctgtcacca 1140atgtgaaggt catccgtgac tttaacacca ataaatgcaa aggttttgga tttgtgacta 1200tgacaaacta tgatgaggct gccatggcga tagctagcct caatggatac cgtctgggag 1260acagagtact gcaggtctcc tttaagacaa acaaaacgca caaagcctaa tgagctcttg 1320tcctcagtcc atttatatat gaaaactata caacaaaggc aagttaagag aaactttata 1380cattagtaaa tgtctttgta agtcagtgtt gagatgggga taaaatgact acttagcatc 1440ctaagaaata tgtgagattt tttattgcta gtatttgaat taaaacttct taaatatctt 1500ttatgtttga atatggacaa gaggtacagg gtttttacct gtcacattgc attctattgc 1560cttctttgaa gaaggtggac cttttaaagt gtttcagcta agggaagaca tttcttttct 1620ttttacataa ctgccttgaa cctgtgagta aatattgagg ctttgtgttg taattcttca 1680gttggttgtg tctttttttt cccccctttt tttccttttt ctgattagct ttgtgtttgg 1740tttacattta aagcattgct gttatgtctg tttaagaaaa gtattttgaa gtttacattt 1800ttatttatga agtttaaaac agtatttatt ttgtaattat gatttgggtt ggggaagggg 1860gggctacatt ataaacgctt attgtaagaa tactggagaa cttttcgtaa agcagtacct 1920tgccaaagag ataagagcct ctttgatgtg ggtttaaaaa aagcatctat ttttataaaa 1980aagaaaattt ggagaaactt tttactggtc ctggaacaaa tattttgact tgaatacttt 2040gagaaatctc ttcatatgac acctagtgag cttttaaaat ttaccaggaa atttgcagcg 2100gttggaaaat ttagaaagat ttatggtgta gaaaatactt ttgagatctt tgtatgaaag 2160gagtagaatc aatgggggga aacactgctg gtttcatttt tgtaatcacc agtggagcgt 2220ctgatcatcc tggttattat gtgataggtg gctcacattg atttgtgatt ttgaaacaaa 2280taaaaaaaat ttacaaaaga atatataaga gcaggcaaga aatttaaatt accgagagat 2340gggggaaaaa atctgttctt cctaaagaaa tcccttcaga tagagctcat ggtgtttagt 2400gatgtacttg cagtattgtt tgaagaattg ttttgtctta aggaaaaaag acgttgcaca 2460tgatttgtac tgcagcaaat cagcaaaagt gatctgagtt ggatatattt gaaggtattt 2520tgaaagttac gttcaaggct aacacctgag ctttgtgtaa tgtaaataag accttgtgtt 2580tatgaacctt tcagctaatt taattttttt tcccttacat gccaagtgat gttcaggttt 2640tgaatgtttt tgtatcagtt ttttcctttg taaatggcat taacattgtt acttgaggtc 2700ttgcttaatc acttttgttg tcctgaggac ttgaatttac agtgcatcag atttgttgca 2760aattttgtct gtagatagtc tagcttcagc tgtttatggt gatgctacat tttcgtttat 2820aaatatgttt gtggtataaa aaaatgagta taaccatagg ttttgaacaa atttccttac 2880atttttcata caaaaatcat aaatatctgt atgctattga aatttaactt tgtatgatgc 2940ttaaaaacca ctatttgggg aaataataaa ataagtcttt accatgtatg aaagaaattt 3000taaaaaatac aaaatatttt ctgattagca tctagcttat aataaatttt caaaaaagct 3060gaaggcaaaa atgccttcat caggatgcac tgagaactat atagttacgt cctgcttttt 3120gtataaactg agatgctcac atgcttcccc ttagaacagg caatgtgcta tgcataacat 3180agttgtacat tatctttgcg gttgctttga gttttatttt ttattattta aaattgtagt 3240tataaaattt ttcagtatag tacagtacat atactgtgag gcgcgtgcta aagtgaataa 3300gcgagttttc atgctgaccc actcaatgct attcagaaat caattggctt agcactttct 3360catatcctta ggtgcattta gattgccaga gttaaccttc tgcgtttaaa aaaagaaaaa 3420cactaaaaaa taaaatacat gtatatactt aaaaaaaaat aataaggttt ccctcaaggg 3480aaaacagcag ctacatgctt ctttcctata ctactgtagc aaaccaaggc attgatgaga 3540gggcatgcaa attgtgcttc actttacagt gttttatcag agcacttaat aaaatgtaag 3600gctggtattt atttgaagtt gtacagtatg acttaattca catctgttgg aatagaaaat 3660atattctgtt gagtatttaa gaggctgtac atgttttctt ttgtgtttgg attctttgta 3720ctttttcatg ttcagtacat caataaacaa agttgaaggg aaaaaaaaa 376973814DNAHomo sapiens 7caataggagg gtagtctctc cgtcttttta aactcttttt taagtttccc ctcccctttc 60atattttttt tcgccatttc ttttagcatt ggactttggg gtcgaaagcg tttcttttta 120tttgcttctt ttaagccgag cacagtttag gtttcgtgct gtcttaagag aactatccag 180cagcttcttg ctcatcctta ttgggagaac tgcaccgtta ctttaaaaac acacatacac 240aaaaacctta agggagaaag caggtaattg ctgccatgga aacacaactg tctaatgggc 300caacttgcaa taacacagcc aatggtccaa ccaccataaa caacaactgt tcgtcaccag 360ttgactctgg gaacacagaa gacagcaaga ccaacttaat agtcaactac cttcctcaga 420acatgacaca ggaggaacta aagagtctct ttgggagcat tggtgaaata gagtcctgta 480agcttgtaag agacaaaata acagggcaga gcttgggata tggctttgtg aactacattg 540accccaagga tgcagagaaa gctatcaaca ccctgaatgg attgagactt caaaccaaaa 600caataaaagt ttcctatgct cgcccaagtt cagcttctat cagagatgca aatttatatg 660tcagcggact tccaaaaaca atgacccaga aggagttgga acagcttttt tcacaatatg 720gacgcattat tacttctcgt attcttgtcg accaggtcac tggcatatca aggggtgtag 780ggtttattcg atttgacaag cgaattgagg cagaagaagc tatcaaaggc ctaaatggcc 840agaaacctcc cggtgccacg gagccaatca ctgtaaagtt tgctaataac ccaagccaaa 900aaaccaatca ggccatcctt tcccagctgt accagtctcc aaacagaagg tatccaggac 960cgctagctca gcaggcacag cgttttaggt tggacaatct gctcaatatg gcttatggag 1020taaagaggtt ttctccaatg accattgacg gaatgaccag tttggctgga attaatatcc 1080ctgggcaccc tggaacaggg tggtgtatat ttgtgtacaa cctggctcct gacgcagatg 1140agagtatcct gtggcaaatg tttgggcctt ttggagctgt caccaatgtg aaggtcatcc 1200gtgactttaa caccaataaa tgcaaaggtt ttggatttgt gactatgaca aactatgatg 1260aggctgccat ggcgatagct agcctcaatg gataccgtct gggagacaga gtactgcagg 1320tctcctttaa gacaaacaaa acgcacaaag cctaatgagc tcttgtcctc agtccattta 1380tatatgaaaa ctatacaaca aaggcaagtt aagagaaact ttatacatta gtaaatgtct 1440ttgtaagtca gtgttgagat ggggataaaa tgactactta gcatcctaag aaatatgtga 1500gattttttat tgctagtatt tgaattaaaa cttcttaaat atcttttatg tttgaatatg 1560gacaagaggt acagggtttt tacctgtcac attgcattct attgccttct ttgaagaagg 1620tggacctttt aaagtgtttc agctaaggga agacatttct tttcttttta cataactgcc 1680ttgaacctgt gagtaaatat tgaggctttg tgttgtaatt cttcagttgg ttgtgtcttt 1740tttttccccc ctttttttcc tttttctgat tagctttgtg tttggtttac atttaaagca 1800ttgctgttat gtctgtttaa gaaaagtatt ttgaagttta catttttatt tatgaagttt 1860aaaacagtat ttattttgta attatgattt gggttgggga agggggggct acattataaa 1920cgcttattgt aagaatactg gagaactttt cgtaaagcag taccttgcca aagagataag 1980agcctctttg atgtgggttt aaaaaaagca tctattttta taaaaaagaa aatttggaga 2040aactttttac tggtcctgga acaaatattt tgacttgaat

actttgagaa atctcttcat 2100atgacaccta gtgagctttt aaaatttacc aggaaatttg cagcggttgg aaaatttaga 2160aagatttatg gtgtagaaaa tacttttgag atctttgtat gaaaggagta gaatcaatgg 2220ggggaaacac tgctggtttc atttttgtaa tcaccagtgg agcgtctgat catcctggtt 2280attatgtgat aggtggctca cattgatttg tgattttgaa acaaataaaa aaaatttaca 2340aaagaatata taagagcagg caagaaattt aaattaccga gagatggggg aaaaaatctg 2400ttcttcctaa agaaatccct tcagatagag ctcatggtgt ttagtgatgt acttgcagta 2460ttgtttgaag aattgttttg tcttaaggaa aaaagacgtt gcacatgatt tgtactgcag 2520caaatcagca aaagtgatct gagttggata tatttgaagg tattttgaaa gttacgttca 2580aggctaacac ctgagctttg tgtaatgtaa ataagacctt gtgtttatga acctttcagc 2640taatttaatt ttttttccct tacatgccaa gtgatgttca ggttttgaat gtttttgtat 2700cagttttttc ctttgtaaat ggcattaaca ttgttacttg aggtcttgct taatcacttt 2760tgttgtcctg aggacttgaa tttacagtgc atcagatttg ttgcaaattt tgtctgtaga 2820tagtctagct tcagctgttt atggtgatgc tacattttcg tttataaata tgtttgtggt 2880ataaaaaaat gagtataacc ataggttttg aacaaatttc cttacatttt tcatacaaaa 2940atcataaata tctgtatgct attgaaattt aactttgtat gatgcttaaa aaccactatt 3000tggggaaata ataaaataag tctttaccat gtatgaaaga aattttaaaa aatacaaaat 3060attttctgat tagcatctag cttataataa attttcaaaa aagctgaagg caaaaatgcc 3120ttcatcagga tgcactgaga actatatagt tacgtcctgc tttttgtata aactgagatg 3180ctcacatgct tccccttaga acaggcaatg tgctatgcat aacatagttg tacattatct 3240ttgcggttgc tttgagtttt attttttatt atttaaaatt gtagttataa aatttttcag 3300tatagtacag tacatatact gtgaggcgcg tgctaaagtg aataagcgag ttttcatgct 3360gacccactca atgctattca gaaatcaatt ggcttagcac tttctcatat ccttaggtgc 3420atttagattg ccagagttaa ccttctgcgt ttaaaaaaag aaaaacacta aaaaataaaa 3480tacatgtata tacttaaaaa aaaataataa ggtttccctc aagggaaaac agcagctaca 3540tgcttctttc ctatactact gtagcaaacc aaggcattga tgagagggca tgcaaattgt 3600gcttcacttt acagtgtttt atcagagcac ttaataaaat gtaaggctgg tatttatttg 3660aagttgtaca gtatgactta attcacatct gttggaatag aaaatatatt ctgttgagta 3720tttaagaggc tgtacatgtt ttcttttgtg tttggattct ttgtactttt tcatgttcag 3780tacatcaata aacaaagttg aagggaaaaa aaaa 381482130DNAHomo sapiens 8agttaagggc ctggcgtctc cctccctgaa gacgtggtcc cagccgggtg tcctgacgct 60cggggttcag gacaagggca cacaactggt tccgttaagc ccctctctcg ctcagacgcc 120atggagctgg atctgtctcc acctcatctt agcagctctc cggaagacct ttgcccagcc 180cctgggaccc ctcctgggac tccccggccc cctgataccc ctctgcctga ggaggtaaag 240aggtcccagc ctctcctcat cccaaccacc ggcaggaaac ttcgagagga ggagaggcgt 300gccacctccc tcccctctat ccccaacccc ttccctgagc tctgcagtcc tccctcacag 360agcccaattc tcgggggccc ctccagtgca agggggctgc tcccccgcga tgccagccgc 420ccccatgtag taaaggtgta cagtgaggat ggggcctgca ggtctgtgga ggtggcagca 480ggtgccacag ctcgccacgt gtgtgaaatg ctggtgcagc gagctcacgc cttgagcgac 540gagacctggg ggctggtgga gtgccacccc cacctagcac tggagcgggg tttggaggac 600cacgagtccg tggtggaagt gcaggctgcc tggcccgtgg gcggagatag ccgcttcgtc 660ttccggaaaa acttcgccaa gtacgaactg ttcaagagct ccccacactc cctgttccca 720gaaaaaatgg tctccagctg tctcgatgca cacactggta tatcccatga agacctcatc 780cagaacttcc tgaatgctgg cagctttcct gagatccagg gctttctgca gctgcggggt 840tcaggacgga agctttggaa acgctttttc tgcttcttgc gccgatctgg cctctattac 900tccaccaagg gcacctctaa ggatccgagg cacctgcagt acgtggcaga tgtgaacgag 960tccaacgtgt acgtggtgac gcagggccgc aagctctacg ggatgcccac tgacttcggt 1020ttctgtgtca agcccaacaa gcttcgaaat ggccacaagg ggcttcggat cttctgcagt 1080gaagatgagc agagccgcac ctgctggctg gctgccttcc gcctcttcaa gtacggggtg 1140cagctgtaca agaattacca gcaggcacag tctcgccatc tgcatccatc ttgtttgggc 1200tccccaccct tgagaagtgc ctcagataat accctggtgg ccatggactt ctctggccat 1260gctgggcgtg tcattgagaa cccccgggag gctctgagtg tggccctgga ggaggcccag 1320gcctggagga agaagacaaa ccaccgcctc agcctgccca tgccagcctc cggcacgagc 1380ctcagtgcag ccatccaccg cacccaactc tggttccacg ggcgcatttc ccgtgaggag 1440agccagcggc ttattggaca gcagggcttg gtagacggcc tgttcctggt ccgggagagt 1500cagcggaacc cccagggctt tgtcctctct ttgtgccacc tgcagaaagt gaagcattat 1560ctcatcctgc cgagcgagga ggagggccgc ctgtacttca gcatggatga tggccagacc 1620cgcttcactg acctgctgca gctcgtggag ttccaccagc tgaaccgcgg catcctgccg 1680tgcttgctgc gccattgctg cacgcgggtg gccctctgac caggccgtgg actggctcat 1740gcctcagccc gccttcaggc tgcccgccgc ccctccaccc atccagtgga ctctggggcg 1800cggccacagg ggacgggatg aggagcggga gggttccgcc actccagttt tctcctctgc 1860ttctttgcct ccctcagata gaaaacagcc cccactccag tccactcctg acccctctcc 1920tcaagggaag gccttgggtg gccccctctc cttctcctag ctctggaggt gctgctctag 1980ggcagggaat tatgggagaa gtgggggcag cccaggcggt ttcacgcccc acactttgta 2040cagaccgaga ggccagttga tctgctctgt tttatactag tgacaataaa gattattttt 2100tgatacaaaa aaaaaaaaaa aaaaaaaaaa 213092214DNAHomo sapiens 9tgcgggctgc ggggagatgt ggggagggcc ccctccactt tggagggcag tgaaggagag 60ggatcctcta aattgtcgag gcttcatctc tccagattgt atgcccttct cagcaacacc 120gcctccggcc ctccgatggg aaagtggagg ccgggacaag ggcacacaac tggttccgtt 180aagcccctct ctcgctcaga cgccatggag ctggatctgt ctccacctca tcttagcagc 240tctccggaag acctttgccc agcccctggg acccctcctg ggactccccg gccccctgat 300acccctctgc ctgaggaggt aaagaggtcc cagcctctcc tcatcccaac caccggcagg 360aaacttcgag aggaggagag gcgtgccacc tccctcccct ctatccccaa ccccttccct 420gagctctgca gtcctccctc acagagccca attctcgggg gcccctccag tgcaaggggg 480ctgctccccc gcgatgccag ccgcccccat gtagtaaagg tgtacagtga ggatggggcc 540tgcaggtctg tggaggtggc agcaggtgcc acagctcgcc acgtgtgtga aatgctggtg 600cagcgagctc acgccttgag cgacgagacc tgggggctgg tggagtgcca cccccaccta 660gcactggagc ggggtttgga ggaccacgag tccgtggtgg aagtgcaggc tgcctggccc 720gtgggcggag atagccgctt cgtcttccgg aaaaacttcg ccaagtacga actgttcaag 780agctccccac actccctgtt cccagaaaaa atggtctcca gctgtctcga tgcacacact 840ggtatatccc atgaagacct catccagaac ttcctgaatg ctggcagctt tcctgagatc 900cagggctttc tgcagctgcg gggttcagga cggaagcttt ggaaacgctt tttctgcttc 960ttgcgccgat ctggcctcta ttactccacc aagggcacct ctaaggatcc gaggcacctg 1020cagtacgtgg cagatgtgaa cgagtccaac gtgtacgtgg tgacgcaggg ccgcaagctc 1080tacgggatgc ccactgactt cggtttctgt gtcaagccca acaagcttcg aaatggccac 1140aaggggcttc ggatcttctg cagtgaagat gagcagagcc gcacctgctg gctggctgcc 1200ttccgcctct tcaagtacgg ggtgcagctg tacaagaatt accagcaggc acagtctcgc 1260catctgcatc catcttgttt gggctcccca cccttgagaa gtgcctcaga taataccctg 1320gtggccatgg acttctctgg ccatgctggg cgtgtcattg agaacccccg ggaggctctg 1380agtgtggccc tggaggaggc ccaggcctgg aggaagaaga caaaccaccg cctcagcctg 1440cccatgccag cctccggcac gagcctcagt gcagccatcc accgcaccca actctggttc 1500cacgggcgca tttcccgtga ggagagccag cggcttattg gacagcaggg cttggtagac 1560ggcctgttcc tggtccggga gagtcagcgg aacccccagg gctttgtcct ctctttgtgc 1620cacctgcaga aagtgaagca ttatctcatc ctgccgagcg aggaggaggg ccgcctgtac 1680ttcagcatgg atgatggcca gacccgcttc actgacctgc tgcagctcgt ggagttccac 1740cagctgaacc gcggcatcct gccgtgcttg ctgcgccatt gctgcacgcg ggtggccctc 1800tgaccaggcc gtggactggc tcatgcctca gcccgccttc aggctgcccg ccgcccctcc 1860acccatccag tggactctgg ggcgcggcca caggggacgg gatgaggagc gggagggttc 1920cgccactcca gttttctcct ctgcttcttt gcctccctca gatagaaaac agcccccact 1980ccagtccact cctgacccct ctcctcaagg gaaggccttg ggtggccccc tctccttctc 2040ctagctctgg aggtgctgct ctagggcagg gaattatggg agaagtgggg gcagcccagg 2100cggtttcacg ccccacactt tgtacagacc gagaggccag ttgatctgct ctgttttata 2160ctagtgacaa taaagattat tttttgatac aaaaaaaaaa aaaaaaaaaa aaaa 2214102275DNAHomo sapiens 10aggcaaaccc cagccttgga ctggccctct ctgatctctg aggccaggct ctaatgtgat 60ttgaatctac ttctaacccc ttccaagcac tgccctcccg aattctctgc tcctctcccc 120accccactgt tggtctgtga tttcgaggca ggcgtggccc cctgcagcct ggaatgaagt 180cactggggct gtttggagac cggggctgtt tggaggacaa gggcacacaa ctggttccgt 240taagcccctc tctcgctcag acgccatgga gctggatctg tctccacctc atcttagcag 300ctctccggaa gacctttgcc cagcccctgg gacccctcct gggactcccc ggccccctga 360tacccctctg cctgaggagg taaagaggtc ccagcctctc ctcatcccaa ccaccggcag 420gaaacttcga gaggaggaga ggcgtgccac ctccctcccc tctatcccca accccttccc 480tgagctctgc agtcctccct cacagagccc aattctcggg ggcccctcca gtgcaagggg 540gctgctcccc cgcgatgcca gccgccccca tgtagtaaag gtgtacagtg aggatggggc 600ctgcaggtct gtggaggtgg cagcaggtgc cacagctcgc cacgtgtgtg aaatgctggt 660gcagcgagct cacgccttga gcgacgagac ctgggggctg gtggagtgcc acccccacct 720agcactggag cggggtttgg aggaccacga gtccgtggtg gaagtgcagg ctgcctggcc 780cgtgggcgga gatagccgct tcgtcttccg gaaaaacttc gccaagtacg aactgttcaa 840gagctcccca cactccctgt tcccagaaaa aatggtctcc agctgtctcg atgcacacac 900tggtatatcc catgaagacc tcatccagaa cttcctgaat gctggcagct ttcctgagat 960ccagggcttt ctgcagctgc ggggttcagg acggaagctt tggaaacgct ttttctgctt 1020cttgcgccga tctggcctct attactccac caagggcacc tctaaggatc cgaggcacct 1080gcagtacgtg gcagatgtga acgagtccaa cgtgtacgtg gtgacgcagg gccgcaagct 1140ctacgggatg cccactgact tcggtttctg tgtcaagccc aacaagcttc gaaatggcca 1200caaggggctt cggatcttct gcagtgaaga tgagcagagc cgcacctgct ggctggctgc 1260cttccgcctc ttcaagtacg gggtgcagct gtacaagaat taccagcagg cacagtctcg 1320ccatctgcat ccatcttgtt tgggctcccc acccttgaga agtgcctcag ataataccct 1380ggtggccatg gacttctctg gccatgctgg gcgtgtcatt gagaaccccc gggaggctct 1440gagtgtggcc ctggaggagg cccaggcctg gaggaagaag acaaaccacc gcctcagcct 1500gcccatgcca gcctccggca cgagcctcag tgcagccatc caccgcaccc aactctggtt 1560ccacgggcgc atttcccgtg aggagagcca gcggcttatt ggacagcagg gcttggtaga 1620cggcctgttc ctggtccggg agagtcagcg gaacccccag ggctttgtcc tctctttgtg 1680ccacctgcag aaagtgaagc attatctcat cctgccgagc gaggaggagg gccgcctgta 1740cttcagcatg gatgatggcc agacccgctt cactgacctg ctgcagctcg tggagttcca 1800ccagctgaac cgcggcatcc tgccgtgctt gctgcgccat tgctgcacgc gggtggccct 1860ctgaccaggc cgtggactgg ctcatgcctc agcccgcctt caggctgccc gccgcccctc 1920cacccatcca gtggactctg gggcgcggcc acaggggacg ggatgaggag cgggagggtt 1980ccgccactcc agttttctcc tctgcttctt tgcctccctc agatagaaaa cagcccccac 2040tccagtccac tcctgacccc tctcctcaag ggaaggcctt gggtggcccc ctctccttct 2100cctagctctg gaggtgctgc tctagggcag ggaattatgg gagaagtggg ggcagcccag 2160gcggtttcac gccccacact ttgtacagac cgagaggcca gttgatctgc tctgttttat 2220actagtgaca ataaagatta ttttttgata caaaaaaaaa aaaaaaaaaa aaaaa 2275112285DNAHomo sapiens 11acccgccccc atctgcccaa gataatttta gtttccttgg gcctggaatc tggacacaca 60gggctccccc ccgcctctga cttctctgtc cgaagtcggg acaccctcct accacctgta 120gagaagcggg agtggatctg aaataaaatc caggaatctg ggggttccta gacggagcca 180gacttcggaa cgggtgtcct gctactcctg ctggggctcc tccaggacaa gggcacacaa 240ctggttccgt taagcccctc tctcgctcag acgccatgga gctggatctg tctccacctc 300atcttagcag ctctccggaa gacctttgcc cagcccctgg gacccctcct gggactcccc 360ggccccctga tacccctctg cctgaggagg taaagaggtc ccagcctctc ctcatcccaa 420ccaccggcag gaaacttcga gaggaggaga ggcgtgccac ctccctcccc tctatcccca 480accccttccc tgagctctgc agtcctccct cacagagccc aattctcggg ggcccctcca 540gtgcaagggg gctgctcccc cgcgatgcca gccgccccca tgtagtaaag gtgtacagtg 600aggatggggc ctgcaggtct gtggaggtgg cagcaggtgc cacagctcgc cacgtgtgtg 660aaatgctggt gcagcgagct cacgccttga gcgacgagac ctgggggctg gtggagtgcc 720acccccacct agcactggag cggggtttgg aggaccacga gtccgtggtg gaagtgcagg 780ctgcctggcc cgtgggcgga gatagccgct tcgtcttccg gaaaaacttc gccaagtacg 840aactgttcaa gagctcccca cactccctgt tcccagaaaa aatggtctcc agctgtctcg 900atgcacacac tggtatatcc catgaagacc tcatccagaa cttcctgaat gctggcagct 960ttcctgagat ccagggcttt ctgcagctgc ggggttcagg acggaagctt tggaaacgct 1020ttttctgctt cttgcgccga tctggcctct attactccac caagggcacc tctaaggatc 1080cgaggcacct gcagtacgtg gcagatgtga acgagtccaa cgtgtacgtg gtgacgcagg 1140gccgcaagct ctacgggatg cccactgact tcggtttctg tgtcaagccc aacaagcttc 1200gaaatggcca caaggggctt cggatcttct gcagtgaaga tgagcagagc cgcacctgct 1260ggctggctgc cttccgcctc ttcaagtacg gggtgcagct gtacaagaat taccagcagg 1320cacagtctcg ccatctgcat ccatcttgtt tgggctcccc acccttgaga agtgcctcag 1380ataataccct ggtggccatg gacttctctg gccatgctgg gcgtgtcatt gagaaccccc 1440gggaggctct gagtgtggcc ctggaggagg cccaggcctg gaggaagaag acaaaccacc 1500gcctcagcct gcccatgcca gcctccggca cgagcctcag tgcagccatc caccgcaccc 1560aactctggtt ccacgggcgc atttcccgtg aggagagcca gcggcttatt ggacagcagg 1620gcttggtaga cggcctgttc ctggtccggg agagtcagcg gaacccccag ggctttgtcc 1680tctctttgtg ccacctgcag aaagtgaagc attatctcat cctgccgagc gaggaggagg 1740gccgcctgta cttcagcatg gatgatggcc agacccgctt cactgacctg ctgcagctcg 1800tggagttcca ccagctgaac cgcggcatcc tgccgtgctt gctgcgccat tgctgcacgc 1860gggtggccct ctgaccaggc cgtggactgg ctcatgcctc agcccgcctt caggctgccc 1920gccgcccctc cacccatcca gtggactctg gggcgcggcc acaggggacg ggatgaggag 1980cgggagggtt ccgccactcc agttttctcc tctgcttctt tgcctccctc agatagaaaa 2040cagcccccac tccagtccac tcctgacccc tctcctcaag ggaaggcctt gggtggcccc 2100ctctccttct cctagctctg gaggtgctgc tctagggcag ggaattatgg gagaagtggg 2160ggcagcccag gcggtttcac gccccacact ttgtacagac cgagaggcca gttgatctgc 2220tctgttttat actagtgaca ataaagatta ttttttgata caaaaaaaaa aaaaaaaaaa 2280aaaaa 228512914DNAHomo sapiens 12gcatggggag gggcggccct caaacgggtc attgccatta atagagacct caaacaccgc 60ctgctaaaaa tacccgactg gaggagcata aaagcgcagc cgagcccagc gccccgcact 120tttctgagca gacgtccaga gcagagtcag ccagcatgac cgagcgccgc gtccccttct 180cgctcctgcg gggccccagc tgggacccct tccgcgactg gtacccgcat agccgcctct 240tcgaccaggc cttcgggctg ccccggctgc cggaggagtg gtcgcagtgg ttaggcggca 300gcagctggcc aggctacgtg cgccccctgc cccccgccgc catcgagagc cccgcagtgg 360ccgcgcccgc ctacagccgc gcgctcagcc ggcaactcag cagcggggtc tcggagatcc 420ggcacactgc ggaccgctgg cgcgtgtccc tggatgtcaa ccacttcgcc ccggacgagc 480tgacggtcaa gaccaaggat ggcgtggtgg agatcaccgg caagcacgag gagcggcagg 540acgagcatgg ctacatctcc cggtgcttca cgcggaaata cacgctgccc cccggtgtgg 600accccaccca agtttcctcc tccctgtccc ctgagggcac actgaccgtg gaggccccca 660tgcccaagct agccacgcag tccaacgaga tcaccatccc agtcaccttc gagtcgcggg 720cccagcttgg gggcccagaa gctgcaaaat ccgatgagac tgccgccaag taaagcctta 780gcccggatgc ccacccctgc tgccgccact ggctgtgcct cccccgccac ctgtgtgttc 840ttttgataca tttatcttct gtttttctca aataaagttc aaagcaacca cctgtcaaaa 900aaaaaaaaaa aaaa 914133262DNAHomo sapiens 13agagtgctcc gcggccgtgt ggagcgaggc cttgttcccg cgttgagccg ccgccgccgc 60cgccgcctcc tcagcttcag cctccgcgcc aggcccggcc ccgccgcgcc atgtcggact 120acagcacggg aggacccccg cccgggccgc cgccgcccgc cggcgggggc gggggagccg 180gaggcgccgg gggaggccct ccgccgggcc cgccaggcgc gggggaccgg ggcggcggcg 240gtcccggcgg cggcggcccg ggcggggggt cggccggggg cccctctcag ccacccggcg 300gaggcggccc gggaatccgc aaggacgctt tcgccgacgc cgtgcagcgg gcccgccaga 360ttgcagccaa aattggaggc gatgctgcca cgacagtgaa taacagcact cctgattttg 420gttttggggg ccaaaagaga cagttggaag atggagatca accggagagc aagaagctgg 480cttcccaggg agactcaatc agttctcaac ttggacccat ccatcctccc ccaaggactt 540caatgacaga agagtacagg gtcccagacg gcatggtggg cctgatcatt ggcagaggag 600gtgaacaaat taacaaaatc caacaggatt caggctgcaa agtacagatt tctccagaca 660gcggtggcct acccgagcgc agtgtgtcct tgacaggagc cccagaatct gtccagaaag 720ccaagatgat gctggatgac attgtgtctc ggggtcgtgg gggcccccca ggacagttcc 780acgacaacgc caacgggggc cagaacggca ccgtgcagga gatcatgatc cccgcgggca 840aggccggcct ggtcattggc aagggcgggg agaccattaa gcagctgcag gaacgcgctg 900gagtgaagat gatcttaatt caggacggat ctcagaatac gaatgtggac aaacctctcc 960gcatcattgg ggatccttac aaagtgcagc aagcctgtga gatggtgatg gacatcctcc 1020gggaacgtga ccaaggcggc tttggggacc ggaatgagta cggatctcgg attggcggag 1080gcatcgatgt gccagtgccc aggcattctg ttggcgtggt cattggccgg agtggagaga 1140tgatcaagaa gatccagaat gatgctggcg tgcggataca gttcaagcaa gatgacggga 1200cagggcccga gaagattgct catataatgg ggcccccaga caggtgcgag cacgcagccc 1260ggatcatcaa cgacctcctc cagagcctca ggagtggtcc cccaggtcct ccagggggtc 1320caggcatgcc cccggggggc cgaggccgag gaagaggcca aggcaattgg ggtccccctg 1380gcggggagat gaccttctcc atccccactc acaagtgtgg gctggtcatc ggccgaggtg 1440gcgagaatgt gaaagccata aaccagcaga cgggagcctt cgtagagatc tcccggcagc 1500tgccacccaa cggggacccc aacttcaagt tgttcatcat ccggggttca ccccagcaga 1560ttgaccacgc caagcagctt atcgaggaaa agatcgaggg tcctctctgc ccagttggac 1620caggcccagg tggcccaggc cctgctggcc caatggggcc cttcaatcct gggcccttca 1680accaggggcc acccggggct cccccacatg ccggggggcc ccctcctcac cagtacccac 1740cccagggctg gggcaatacc tacccccagt ggcagccgcc tgctcctcat gacccaagca 1800aagcagctgc agcggccgcg gaccccaacg ccgcgtgggc cgcctactac tcacactact 1860accagcagcc cccgggcccc gtccccggcc ccgcaccggc ccctgcggcc ccaccggctc 1920agggtgagcc ccctcagccc ccacccaccg gccagtcgga ctacactaag gcctgggaag 1980agtattacaa aaagatcggc cagcagcccc agcagcccgg agcaccccca cagcaggact 2040acacgaaggc ttgggaggag tactacaaga agcaagcgca agtggccacc ggagggggtc 2100caggagctcc cccaggctcc cagccagact acagtgccgc ctgggcggaa tattacagac 2160agcaggccgc ttactacgga cagaccccag gtcctggcgg cccccagccg ccgcccacgc 2220agcagggaca gcagcaggct caatgaatcg aatgaatgtg aacttcttca tctgtgaaaa 2280atcttttttt tttccatttt gttctgtttg ggggcttctg ttttgtttgg cgagagagcg 2340atggctgccg tggggagtac tggggagccc tcgcggcaag cagggtgggg gggacttggg 2400ggcatgccgg gccctcactc tctcgcctgt tctgtgtctc acatgctttt tctttcaaaa 2460ttgggatcct tccatgttga gccagccaga gaagatagcg agatctaaat ctctgccaaa 2520aaaaaaaaaa aacttaaaaa ttaaaaacac aaagagcaaa gcagaactta taaaattata 2580tatatatata ttaaaaagtc tctattcttc accccccagc cttcctgaac ctgcctctct 2640gaggataaag caattcattt tctcccaccc tcggccctct tgtttttaaa ataaactttt 2700aaaaaggaaa aaaaaaagtc actcttgcta tttctttttt ttagttagag gtggaacatt 2760ccttggacca ggtgttgtat tgcaggaccc cttcccccag cagccaagcc ccctcttctc 2820tccctcccgc cctggctcag ctcccgcggc cccgcccgtc ccccctccca ggactggtct 2880gttgtctttt catctgttca agaggagatt gaaactgaaa acaaaatgag aacaacaaaa 2940aaaattgtat ggcagttttt actttttatc gctcgttttt aacttcacaa ataaatgata 3000acaaaacctc cccgtctgcg ggtgctgtct gtctcccccc ctttccttcc ctccctgtag 3060ttttgaagcg gatgtttgtt ctttatagat gttgtttaaa aagcctgata atggtgattg 3120aaatttacaa actttgtgtt tttttttttt taagaaaaat ataaaatagt tttcttcagg 3180ctcaatgtgc tttcctaacc gtgccccccc cccttttttt

tttttgttaa ataaagtgct 3240ttttgtttaa aaaaaaaaaa aa 3262143918DNAHomo sapiens 14ctctcccttc tccactctct ccccctgtct cctttcttct tcttctttca ccctccgtct 60ctcacacccc ctccattccc ctgtctcctt tctgacactg cactgcagct gctcctcagc 120cctgccccct ccccagtgag aacaaaccag caacattgct ttttttccta aagagattta 180tattgatccg attaaaaaaa aaaaacctta agaaacccca aacgcaaaaa aaaaaaaaaa 240aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct cccggtagca 300gcggcaggaa ctgcaaacat gatggcggca gctcccatcc agcagaacgg gacccacact 360ggggttccca tagacctgga cccgccggac tcgcggaaaa ggccgctgga agccccccct 420gaagccggca gcaccaagag gaccaatacg ggcgaagacg gccagtattt tctaaaggtt 480ctcataccta gttatgctgc tggatctata attgggaagg gaggacagac aattgttcag 540ttgcaaaaag aaactggagc caccatcaag ctgtctaagt ccaaagattt ttacccaggt 600actactgagc gagtgtgctt gatccaggga acggttgaag cactgaatgc agttcatgga 660ttcattgcag aaaaaattcg agaaatgccc caaaatgtgg ccaagacaga accagtcagc 720attctacaac cccagaccac cgttaatcca gatcgcatca aacaaacatt gccatcttcc 780ccaactacca ccaagtcctc tccatctgat cccatgacca cctccagagc taatcaggta 840aagattatag ttcccaacag cacagcaggt ctgataatag ggaagggagg tgctactgtg 900aaggctgtaa tggagcagtc aggggcttgg gtgcagcttt cccagaaacc tgatgggatc 960aacttgcaag agagggttgt cactgtgagt ggagaacctg aacaaaaccg aaaagctgtt 1020gaacttatca tccagaagat acaagaggat ccacaaagtg gcagctgtct caatatcagt 1080tatgccaatg tgacaggtcc agtggcaaat tccaatccaa ccggatctcc ttatgcaaac 1140actgctgaag tgttaccaac tgctgcagca gctgcagggc tattaggaca tgctaacctt 1200gctggcgttg cagcctttcc agcagtttta tctggcttca caggcaatga cctggtggcc 1260atcacctctg cacttaatac attagccagc tatggatata atctcaacac tttaggttta 1320ggtctcagtc aagcagcagc aacaggggct ttggctgcag cagctgccag tgccaaccca 1380gcagcagcag cagccaattt attggccacc tatgccagtg aagcctcagc cagtggcagc 1440acagctggtg gtacggcggg gacatttgca ttaggtagcc tggctgctgc tactgctgca 1500accaatggat attttggagc tgcttctccc ctagctgcca gtgccattct aggaacagaa 1560aagtccacag atggatccaa ggatgtagtt gaaatagcag tgccagaaaa cttagttggt 1620gcaatacttg gcaaaggagg gaaaacatta gtggaatacc aggagttgac tggtgcaagg 1680atacagatct ccaaaaaagg agaattcgta cctggcacaa ggaatcggaa ggtaaccatt 1740actggaacac cagctgcaac acaggctgct caatatttaa ttacacaaag gatcacatat 1800gagcaaggag ttcgggctgc caatcctcag aaagtgggtt gagtgcccca gttacacatc 1860agattgtttt aacccctcct ttaccccatt ttcaagaagg atgtactgta ctttgcagaa 1920gtgaagtttt tctgttatta atatataatt atgcaaatga atgcgactat gttgacaatg 1980tgtatatgta aataatatgt gttttaccag atgtttcata gaaagaattt tttcttgatc 2040tgttttgttc tctatacttt gcttgtgtat atttgtcaga ggtgtttcta gtgtaagatt 2100taagcctgcc attttaccag cattattgta gtttaatgat tgaatgtaga cagggatatg 2160cgtatagttt tcagtattag ttctagataa cactaaatta actactgtta ggttgagtat 2220ggtggggtca gtgacctaaa atggagtgag gccaaagcac tgtcctgtaa gtcttacttc 2280ctgcttaggg cacagtgaag taggaaacaa tattttgaaa ataagtttta aatttaaaat 2340gatcaaaaag caatatagtt gcataaaagc actgtaaaat atttaaaagg ttaaaactgt 2400ggaaaattat attggtaagt ttacagatca ataaaagcac ctgttctcca tctgaactag 2460acaatggaaa taatgctgca tgctggccat ggcccattct tcatcatttg taagttcaac 2520aaaagttctc acatggagtc ccacctcttc agaggtttgt acatttgttt ttaagcactg 2580aattcactac tgatcccatc gcctggccag tagaacagtc attactccat taacatcctc 2640actgtttaga cacataactg tggtacagtg tattggaaat tttataaaca aaagtgaaag 2700tgccaacaaa ttattgatag ctgataatgt ttcattatct gcaactgctt gataagtatg 2760ttgcatttta agagcttata attgtgtata atttgttaac actagaaacc tattagtatt 2820gtgaatgtag attttactgt gaagctatct gtgatttagc tgtttgctcc catgatggag 2880tctttgcagc atggcgctag cagccaatgc agtttctaat actcggtaat ttgcatgttt 2940tgtggagcat ttttatgtca ccaaccagac agtatttcct gcatgcttat ttagaagagg 3000cagcttatct tgagaggtag tgttatctac ctttgtcagg ctttttgaca ggtcatttca 3060gagtaagcct ttgttcccaa gacccaacaa ctgtcaccct cttctgtacc tctcctgagt 3120gccaactgtc caggccattt gacacaccat ctgttaacct ctgagtttgc ccactcaagg 3180ccactcatag gggcatccat ccccaagcac ctcctcatgc tgtgcatgca gtcttaaatt 3240caatggacaa aaataaaatg ctggctacct ctggatcatc tggctgagca actgaattac 3300aaaagagaat tacttccatc tcaacttcaa cccattgatt acgtccatcc tagcaagcta 3360aatggcatcc cagctgctcc tttctgtgca accaattaaa gaacaatgag tgtgatgctc 3420catgtctgaa tttcgtccag cctctctctg aactgtgatc tttgtcctca tgaactttcc 3480cttttgttca ttgaactata tggactcttc atttcatatt gatttactgt gcaatttact 3540tttggacatt gagaacttga aattatttcc tgatcccttc cccttccact attaataatt 3600catttctgtc aaactgtaag agtagactca tttttttttt tttagttttt aacattggac 3660tgttatttca tttagagttc tctatctcta aatatttatt tagagaatga ttttaaaagg 3720gaatgatatg cttgtttaaa tgaaagagaa aagctgtagt aaactgtgtt aattggtaat 3780gactatttat cgtcgatact ctgtagctgt gtaagttttg acaaatagtg tatctcgtgg 3840aatcagtggt tagcattgcc gctattatat ttactcattt tatcattata aatgtgctta 3900gttcatcatg tagcatca 3918153846DNAHomo sapiens 15ctctcccttc tccactctct ccccctgtct cctttcttct tcttctttca ccctccgtct 60ctcacacccc ctccattccc ctgtctcctt tctgacactg cactgcagct gctcctcagc 120cctgccccct ccccagtgag aacaaaccag caacattgct ttttttccta aagagattta 180tattgatccg attaaaaaaa aaaaacctta agaaacccca aacgcaaaaa aaaaaaaaaa 240aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct cccggtagca 300gcggcaggaa ctgcaaacat gatggcggca gctcccatcc agcagaacgg gacccacact 360ggggttccca tagacctgga cccgccggac tcgcggaaaa ggccgctgga agccccccct 420gaagccggca gcaccaagag gaccaatacg ggcgaagacg gccagtattt tctaaaggtt 480ctcataccta gttatgctgc tggatctata attgggaagg gaggacagac aattgttcag 540ttgcaaaaag aaactggagc caccatcaag ctgtctaagt ccaaagattt ttacccaggt 600actactgagc gagtgtgctt gatccaggga acggttgaag cactgaatgc agttcatgga 660ttcattgcag aaaaaattcg agaaatgccc caaaatgtgg ccaagacaga accagtcagc 720attctacaac cccagaccac cgttaatcca gatcgcatca aacaagtaaa gattatagtt 780cccaacagca cagcaggtct gataataggg aagggaggtg ctactgtgaa ggctgtaatg 840gagcagtcag gggcttgggt gcagctttcc cagaaacctg atgggatcaa cttgcaagag 900agggttgtca ctgtgagtgg agaacctgaa caaaaccgaa aagctgttga acttatcatc 960cagaagatac aagaggatcc acaaagtggc agctgtctca atatcagtta tgccaatgtg 1020acaggtccag tggcaaattc caatccaacc ggatctcctt atgcaaacac tgctgaagtg 1080ttaccaactg ctgcagcagc tgcagggcta ttaggacatg ctaaccttgc tggcgttgca 1140gcctttccag cagttttatc tggcttcaca ggcaatgacc tggtggccat cacctctgca 1200cttaatacat tagccagcta tggatataat ctcaacactt taggtttagg tctcagtcaa 1260gcagcagcaa caggggcttt ggctgcagca gctgccagtg ccaacccagc agcagcagca 1320gccaatttat tggccaccta tgccagtgaa gcctcagcca gtggcagcac agctggtggt 1380acggcgggga catttgcatt aggtagcctg gctgctgcta ctgctgcaac caatggatat 1440tttggagctg cttctcccct agctgccagt gccattctag gaacagaaaa gtccacagat 1500ggatccaagg atgtagttga aatagcagtg ccagaaaact tagttggtgc aatacttggc 1560aaaggaggga aaacattagt ggaataccag gagttgactg gtgcaaggat acagatctcc 1620aaaaaaggag aattcgtacc tggcacaagg aatcggaagg taaccattac tggaacacca 1680gctgcaacac aggctgctca atatttaatt acacaaagga tcacatatga gcaaggagtt 1740cgggctgcca atcctcagaa agtgggttga gtgccccagt tacacatcag attgttttaa 1800cccctccttt accccatttt caagaaggat gtactgtact ttgcagaagt gaagtttttc 1860tgttattaat atataattat gcaaatgaat gcgactatgt tgacaatgtg tatatgtaaa 1920taatatgtgt tttaccagat gtttcataga aagaattttt tcttgatctg ttttgttctc 1980tatactttgc ttgtgtatat ttgtcagagg tgtttctagt gtaagattta agcctgccat 2040tttaccagca ttattgtagt ttaatgattg aatgtagaca gggatatgcg tatagttttc 2100agtattagtt ctagataaca ctaaattaac tactgttagg ttgagtatgg tggggtcagt 2160gacctaaaat ggagtgaggc caaagcactg tcctgtaagt cttacttcct gcttagggca 2220cagtgaagta ggaaacaata ttttgaaaat aagttttaaa tttaaaatga tcaaaaagca 2280atatagttgc ataaaagcac tgtaaaatat ttaaaaggtt aaaactgtgg aaaattatat 2340tggtaagttt acagatcaat aaaagcacct gttctccatc tgaactagac aatggaaata 2400atgctgcatg ctggccatgg cccattcttc atcatttgta agttcaacaa aagttctcac 2460atggagtccc acctcttcag aggtttgtac atttgttttt aagcactgaa ttcactactg 2520atcccatcgc ctggccagta gaacagtcat tactccatta acatcctcac tgtttagaca 2580cataactgtg gtacagtgta ttggaaattt tataaacaaa agtgaaagtg ccaacaaatt 2640attgatagct gataatgttt cattatctgc aactgcttga taagtatgtt gcattttaag 2700agcttataat tgtgtataat ttgttaacac tagaaaccta ttagtattgt gaatgtagat 2760tttactgtga agctatctgt gatttagctg tttgctccca tgatggagtc tttgcagcat 2820ggcgctagca gccaatgcag tttctaatac tcggtaattt gcatgttttg tggagcattt 2880ttatgtcacc aaccagacag tatttcctgc atgcttattt agaagaggca gcttatcttg 2940agaggtagtg ttatctacct ttgtcaggct ttttgacagg tcatttcaga gtaagccttt 3000gttcccaaga cccaacaact gtcaccctct tctgtacctc tcctgagtgc caactgtcca 3060ggccatttga cacaccatct gttaacctct gagtttgccc actcaaggcc actcataggg 3120gcatccatcc ccaagcacct cctcatgctg tgcatgcagt cttaaattca atggacaaaa 3180ataaaatgct ggctacctct ggatcatctg gctgagcaac tgaattacaa aagagaatta 3240cttccatctc aacttcaacc cattgattac gtccatccta gcaagctaaa tggcatccca 3300gctgctcctt tctgtgcaac caattaaaga acaatgagtg tgatgctcca tgtctgaatt 3360tcgtccagcc tctctctgaa ctgtgatctt tgtcctcatg aactttccct tttgttcatt 3420gaactatatg gactcttcat ttcatattga tttactgtgc aatttacttt tggacattga 3480gaacttgaaa ttatttcctg atcccttccc cttccactat taataattca tttctgtcaa 3540actgtaagag tagactcatt tttttttttt tagtttttaa cattggactg ttatttcatt 3600tagagttctc tatctctaaa tatttattta gagaatgatt ttaaaaggga atgatatgct 3660tgtttaaatg aaagagaaaa gctgtagtaa actgtgttaa ttggtaatga ctatttatcg 3720tcgatactct gtagctgtgt aagttttgac aaatagtgta tctcgtggaa tcagtggtta 3780gcattgccgc tattatattt actcatttta tcattataaa tgtgcttagt tcatcatgta 3840gcatca 3846163340DNAHomo sapiens 16tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 60attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 120cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta cttgtgtcac 180taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 240gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa 300gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 360caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 420ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 480catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 540gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 600tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 660gatcatcgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 720caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 780gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 840catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 900caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 960ccagccctcg ctggaccaga ccatggccgc ggccttcggt gcacctggta taatctcagc 1020ctctccgtat gcaggagctg gtttccctcc cacctttgcc attcctcaag ctgcaggcct 1080ttccgttccg aacgtccacg gcgccctggc ccccctggcc atcccctcgg cggcggcggc 1140agctgcggcg gcaggtcgga tcgccatccc gggcctggcg ggggcaggaa attctgtatt 1200gctggtcagc aacctcaacc cagagagagt cacaccccaa agcctcttta ttcttttcgg 1260cgtctacggt gacgtgcagc gcgtgaagat cctgttcaat aagaaggaga acgccctagt 1320gcagatggcg gacggcaacc aggcccagct ggccatgagc cacctgaacg ggcacaagct 1380gcacgggaag cccatccgca tcacgctctc gaagcaccag aacgtgcagc tgccccgcga 1440gggccaggag gaccagggcc tgaccaagga ctacggcaac tcacccctgc accgcttcaa 1500gaagccgggc tccaagaact tccagaacat attcccgccc tcggccacgc tgcacctctc 1560caacatcccg ccctcagtct ccgaggagga tctcaaggtc ctgttttcca gcaatggggg 1620cgtcgtcaaa ggattcaagt tcttccagaa ggaccgcaag atggcactga tccagatggg 1680ctccgtggag gaggcggtcc aggccctcat tgacctgcac aaccacgacc tcggggagaa 1740ccaccacctg cgggtctcct tctccaagtc caccatctag gggcacaggc ccccacggcc 1800gggccccctg gcgacaactt ccatcattcc agagaaaagc cactttaaaa acagctgaag 1860tgaccttagc agaccagaga ttttattttt ttaaagagaa atcagtttac ctgtttttaa 1920aaaaattaaa tctagttcac cttgctcacc ctgcggtgac agggacagct caggctcttg 1980gtgactgtgg cagcgggagt tcccggccct ccacacccgg ggccagaccc tcggggccat 2040gccttggtgg ggcctgtgtc gggcgtgggg cctgcaggtg ggcgccccga ccacgacttg 2100gcttccttgt gccttaaaaa acctgccttc ctgcagccac acacccaccc ggggtgtcct 2160ggggacccaa ggggtggggg ggtcacacca gagagaggca gggggcctgg ccggctcctg 2220caggatcatg cagctggggc gcggcggccg cggctgcgac accccaaccc cagccctcta 2280atcaagtcac gtgattctcc cttcaccccg cccccagggc cttcccttct gcccccaggc 2340gggctccccg ctgctccagc tgcggagctg gtcgacataa tctctgtatt atatactttg 2400cagttgcaga cgtctgtgcc tagcaatatt tccagttgac caaatattct aatctttttt 2460catttatatg caaaagaaat agttttaagt aactttttat agcaagatga tacaatggta 2520tgagtgtaat ctaaacttcc ttgtggtatt accttgtatg ctgttacttt tattttattc 2580cttgtaatta agtcacaggc aggacccagt ttccagagag caggcggggc cgcccagtgg 2640gtcaggcaca gggagccccg gtcctatctt agagcccctg agcttcaggg aaggggcggg 2700cgtgtcgccg cctctggcat cgcctccggt tgccttacac cacgccttca cctgcagtcg 2760cctagaaaac ttgctctcaa acttcagggt tttttcttcc ttcaaatttt ggaccaaagt 2820ctcatttctg tgttttgcct gcctctgatg ctgggacccg gaaggcgggc gctcctcctg 2880tcttctctgt gctctttcta ccgcccccgc gtcctgtccc gggggctctc ctaggatccc 2940ctttccgtaa aagcgtgtaa caagggtgta aatatttata attttttata cctgttgtga 3000gacccgaggg gcggcggcgc ggttttttat ggtgacacaa atgtatattt tgctaacagc 3060aattccaggc tcagtattgt gaccgcggag ccacagggga ccccacgcac attccgttgc 3120cttacccgat ggcttgtgac gcggagagaa ccgattaaaa ccgtttgaga aactcctccc 3180ttgtctagcc ctgtgttcgc tgtggacgct gtagaggcag gttggccagt ctgtacctgg 3240acttcgaata aatcttctgt atcctcgctc cgttccgcct taaaaaaaaa aaaaaaaaaa 3300aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3340173319DNAHomo sapiens 17tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 60attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 120cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta cttgtgtcac 180taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 240gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa 300gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 360caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 420ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 480catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 540gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 600tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 660gatcatcgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 720caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 780gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 840catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 900caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 960ccagccctcg ctggaccaga ccatggccgc ggccttcgcc tctccgtatg caggagctgg 1020tttccctccc acctttgcca ttcctcaagc tgcaggcctt tccgttccga acgtccacgg 1080cgccctggcc cccctggcca tcccctcggc ggcggcggca gctgcggcgg caggtcggat 1140cgccatcccg ggcctggcgg gggcaggaaa ttctgtattg ctggtcagca acctcaaccc 1200agagagagtc acaccccaaa gcctctttat tcttttcggc gtctacggtg acgtgcagcg 1260cgtgaagatc ctgttcaata agaaggagaa cgccctagtg cagatggcgg acggcaacca 1320ggcccagctg gccatgagcc acctgaacgg gcacaagctg cacgggaagc ccatccgcat 1380cacgctctcg aagcaccaga acgtgcagct gccccgcgag ggccaggagg accagggcct 1440gaccaaggac tacggcaact cacccctgca ccgcttcaag aagccgggct ccaagaactt 1500ccagaacata ttcccgccct cggccacgct gcacctctcc aacatcccgc cctcagtctc 1560cgaggaggat ctcaaggtcc tgttttccag caatgggggc gtcgtcaaag gattcaagtt 1620cttccagaag gaccgcaaga tggcactgat ccagatgggc tccgtggagg aggcggtcca 1680ggccctcatt gacctgcaca accacgacct cggggagaac caccacctgc gggtctcctt 1740ctccaagtcc accatctagg ggcacaggcc cccacggccg ggccccctgg cgacaacttc 1800catcattcca gagaaaagcc actttaaaaa cagctgaagt gaccttagca gaccagagat 1860tttatttttt taaagagaaa tcagtttacc tgtttttaaa aaaattaaat ctagttcacc 1920ttgctcaccc tgcggtgaca gggacagctc aggctcttgg tgactgtggc agcgggagtt 1980cccggccctc cacacccggg gccagaccct cggggccatg ccttggtggg gcctgtgtcg 2040ggcgtggggc ctgcaggtgg gcgccccgac cacgacttgg cttccttgtg ccttaaaaaa 2100cctgccttcc tgcagccaca cacccacccg gggtgtcctg gggacccaag gggtgggggg 2160gtcacaccag agagaggcag ggggcctggc cggctcctgc aggatcatgc agctggggcg 2220cggcggccgc ggctgcgaca ccccaacccc agccctctaa tcaagtcacg tgattctccc 2280ttcaccccgc ccccagggcc ttcccttctg cccccaggcg ggctccccgc tgctccagct 2340gcggagctgg tcgacataat ctctgtatta tatactttgc agttgcagac gtctgtgcct 2400agcaatattt ccagttgacc aaatattcta atcttttttc atttatatgc aaaagaaata 2460gttttaagta actttttata gcaagatgat acaatggtat gagtgtaatc taaacttcct 2520tgtggtatta ccttgtatgc tgttactttt attttattcc ttgtaattaa gtcacaggca 2580ggacccagtt tccagagagc aggcggggcc gcccagtggg tcaggcacag ggagccccgg 2640tcctatctta gagcccctga gcttcaggga aggggcgggc gtgtcgccgc ctctggcatc 2700gcctccggtt gccttacacc acgccttcac ctgcagtcgc ctagaaaact tgctctcaaa 2760cttcagggtt ttttcttcct tcaaattttg gaccaaagtc tcatttctgt gttttgcctg 2820cctctgatgc tgggacccgg aaggcgggcg ctcctcctgt cttctctgtg ctctttctac 2880cgcccccgcg tcctgtcccg ggggctctcc taggatcccc tttccgtaaa agcgtgtaac 2940aagggtgtaa atatttataa ttttttatac ctgttgtgag acccgagggg cggcggcgcg 3000gttttttatg gtgacacaaa tgtatatttt gctaacagca attccaggct cagtattgtg 3060accgcggagc cacaggggac cccacgcaca ttccgttgcc ttacccgatg gcttgtgacg 3120cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc tgtgttcgct 3180gtggacgctg tagaggcagg ttggccagtc tgtacctgga cttcgaataa atcttctgta 3240tcctcgctcc gttccgcctt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300aaaaaaaaaa aaaaaaaaa 3319183262DNAHomo sapiens 18tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 60attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt 120cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta cttgtgtcac 180taacggaccg tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa 240gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc acatccggaa

300gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct ttgggaaggt 360caccaacctc ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga 420ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc gcggccagcc 480catctacatc cagttctcca accacaagga gctgaagacc gacagctctc ccaaccaggc 540gcgggcccag gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc 600tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 660gatcatcgtg gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc 720caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt tccaggccct 780gctgcagtat gcggaccccg tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 840catctacaac gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt 900caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt ccggggacag 960ccagccctcg ctggaccaga ccatggccgc ggccttcggc ctttccgttc cgaacgtcca 1020cggcgccctg gcccccctgg ccatcccctc ggcggcggcg gcagctgcgg cggcaggtcg 1080gatcgccatc ccgggcctgg cgggggcagg aaattctgta ttgctggtca gcaacctcaa 1140cccagagaga gtcacacccc aaagcctctt tattcttttc ggcgtctacg gtgacgtgca 1200gcgcgtgaag atcctgttca ataagaagga gaacgcccta gtgcagatgg cggacggcaa 1260ccaggcccag ctggccatga gccacctgaa cgggcacaag ctgcacggga agcccatccg 1320catcacgctc tcgaagcacc agaacgtgca gctgccccgc gagggccagg aggaccaggg 1380cctgaccaag gactacggca actcacccct gcaccgcttc aagaagccgg gctccaagaa 1440cttccagaac atattcccgc cctcggccac gctgcacctc tccaacatcc cgccctcagt 1500ctccgaggag gatctcaagg tcctgttttc cagcaatggg ggcgtcgtca aaggattcaa 1560gttcttccag aaggaccgca agatggcact gatccagatg ggctccgtgg aggaggcggt 1620ccaggccctc attgacctgc acaaccacga cctcggggag aaccaccacc tgcgggtctc 1680cttctccaag tccaccatct aggggcacag gcccccacgg ccgggccccc tggcgacaac 1740ttccatcatt ccagagaaaa gccactttaa aaacagctga agtgacctta gcagaccaga 1800gattttattt ttttaaagag aaatcagttt acctgttttt aaaaaaatta aatctagttc 1860accttgctca ccctgcggtg acagggacag ctcaggctct tggtgactgt ggcagcggga 1920gttcccggcc ctccacaccc ggggccagac cctcggggcc atgccttggt ggggcctgtg 1980tcgggcgtgg ggcctgcagg tgggcgcccc gaccacgact tggcttcctt gtgccttaaa 2040aaacctgcct tcctgcagcc acacacccac ccggggtgtc ctggggaccc aaggggtggg 2100ggggtcacac cagagagagg cagggggcct ggccggctcc tgcaggatca tgcagctggg 2160gcgcggcggc cgcggctgcg acaccccaac cccagccctc taatcaagtc acgtgattct 2220cccttcaccc cgcccccagg gccttccctt ctgcccccag gcgggctccc cgctgctcca 2280gctgcggagc tggtcgacat aatctctgta ttatatactt tgcagttgca gacgtctgtg 2340cctagcaata tttccagttg accaaatatt ctaatctttt ttcatttata tgcaaaagaa 2400atagttttaa gtaacttttt atagcaagat gatacaatgg tatgagtgta atctaaactt 2460ccttgtggta ttaccttgta tgctgttact tttattttat tccttgtaat taagtcacag 2520gcaggaccca gtttccagag agcaggcggg gccgcccagt gggtcaggca cagggagccc 2580cggtcctatc ttagagcccc tgagcttcag ggaaggggcg ggcgtgtcgc cgcctctggc 2640atcgcctccg gttgccttac accacgcctt cacctgcagt cgcctagaaa acttgctctc 2700aaacttcagg gttttttctt ccttcaaatt ttggaccaaa gtctcatttc tgtgttttgc 2760ctgcctctga tgctgggacc cggaaggcgg gcgctcctcc tgtcttctct gtgctctttc 2820taccgccccc gcgtcctgtc ccgggggctc tcctaggatc ccctttccgt aaaagcgtgt 2880aacaagggtg taaatattta taatttttta tacctgttgt gagacccgag gggcggcggc 2940gcggtttttt atggtgacac aaatgtatat tttgctaaca gcaattccag gctcagtatt 3000gtgaccgcgg agccacaggg gaccccacgc acattccgtt gccttacccg atggcttgtg 3060acgcggagag aaccgattaa aaccgtttga gaaactcctc ccttgtctag ccctgtgttc 3120gctgtggacg ctgtagaggc aggttggcca gtctgtacct ggacttcgaa taaatcttct 3180gtatcctcgc tccgttccgc cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3240aaaaaaaaaa aaaaaaaaaa aa 3262191148DNAHomo sapiens 19gcgccgagac ccgctcctgc agtattagtt cttgcagctg gtggtggcgg ctgaggcggc 60atggatctca gcgagctgga gagagacaat acaggccgct gtcgcctgag ttcgcctgtg 120cccgcggtgt gccgcaagga gccttgcgtc ctgggcgtcg atgaggcggg caggggcccc 180gtgctgggcc ccatggtcta cgccatctgt tattgtcccc tgcctcgcct ggcagatctg 240gaggcgctga aagtggcaga ctcaaagacc ctattggaga gcgagcggga aaggctgttt 300gcgaaaatgg aggacacgga ctttgtcggc tgggcgctgg atgtgctgtc tccaaacctc 360atctctacca gcatgcttgg gcgggtcaaa tacaacctga actccctgtc acatgataca 420gccactgggc ttatacagta tgcattggac cagggcgtga acgtcaccca ggtattcgtg 480gacaccgtag ggatgccaga gacataccag gcgcggctgc agcaaagttt tcccgggatt 540gaggtgacgg tcaaggccaa agcagatgcc ctctacccgg tggttagtgc tgccagcatc 600tgtgccaagg tggcccggga ccaggccgtg aagaaatggc agttcgtgga gaaactgcag 660gacttggata ctgattatgg ctcaggctac cccaatgatc ccaagacaaa agcgtggttg 720aaggagcacg tggagcctgt gttcggcttc ccccagtttg tccggttcag ctggcgcacg 780gcccagacca tcctggagaa agaggcggaa gatgttatat gggaggactc agcatccgag 840aatcaggagg gactcaggaa gatcacatcc tacttcctca atgaagggtc ccaagcccgt 900ccccgttctt cccaccgata tttcctggaa cgcggcctgg agtcagcaac cagcctctag 960cagctgcctc tacgcgctct acctgcttcc ccaacccaga cattaaaatt gtttaaggag 1020aaccacacgt aggggatgta cttttgggac agaagcaagg tgggagtgtg ctctgcagcc 1080gggtccagct acttcctttt ggaaccttaa atagaatggg tgttggttga ttaattttat 1140ttaaaaaa 1148201613DNAHomo sapiens 20ggagaaacac acacgggcgg gcggagggga cccggggcga gtcatcaagg gcgcgtggtt 60cggcgtgcca ggcgcgctgc tctgcctgct ctcttggctt ctgtctccct tcgaccgatc 120gccccctatc ctgaagcttt ccaatgtcat cttggagccc caaagtttcc tggggcctcc 180gcgttgtgcg tcccagaacc ccttgcctgc ccctgaggga aacgcggagc cataggcagc 240gggacgtcgg gagccagccc aggggaggcc agattcagca tttggacagc ggctctgggg 300cgcagtcggc ccagcgagtt tgccggtgaa cagcctcggg cacatggcgg gtaggagggc 360cgcagggctg ctctgggtct tgaagaagca ggacccagcc tagagggcat ccccagctcc 420gaatgggaca cgttttcccg agataaaaga tcccttctga gctcacacgg gagccccggg 480accatccaat ccagcgtgga tatccccagc ctaaccaaca cctgtgctgg ggggaaagat 540aagacgcccc ctttcagcca ggaggtggac gaccctcatg ccctcagctc tccattcttc 600ccaaagcagc tcggatccct aagtctggag ctgccagcga ggcttccaac ccgctgcttg 660ccatcacctc ccaggtcgtt ggtggctccg attactcccc tgctggtgcc tccctccttg 720gcgcgcttcc cacctgcgat cggcgccctc ttcgcagtca cgaactcgcc agcagctagc 780agcactgact agtaggaggg cccgccggag gagagccgcg cggcccacag aagcggaacg 840cgcgtcgaga gcgccctgtc cgctcgcccc agacagatgc ccggttattc attaccgcga 900ggcctagagg aaagagtggc tgccgtcttc ctgcccacag cccgccggac cctccgtcgc 960ggctgcccgg tccccggagc cgcagccgcc gagcccggct gtgcgtgtcg tggctgctgg 1020ggagaaagag gcttccggac atgctctgga gtcagaagac agcgaaaaga gaagcagaag 1080ccccggtggc aagagtctga aggaaggatg actgtagcct gtggattgta ctgcagtagg 1140aaactgtcct agcaaggctc cactttgccc cagcttcaag ctggaaagga ggagaacatg 1200aaacattgct tgaagacaat ggccgagaca gcaggtccca ccctgcacag ccaccagcat 1260ctctcccctc agccctgtct cctcttctgc agttgggatc tgcacattta agcctgaaat 1320tgtcctgtga agtgaagtat gatcggacag cctcttttca gcttttatga caatggagac 1380agaggaattg tggctcttgc caaggtcaca ggattggaat acagagccaa gccaccccag 1440gacatgcaag agcctcagaa gggaaaaaag cccagcagga agggagaaca agtagcctct 1500gtcctgaagt tgtaacagcc aggggccagg atggaggagg aggaccccat aatctgccca 1560tctgggactt ggcaggggac ctgggaaaat gtaccccaac ccatccctta agg 161321606RNAHomo sapiens 21ccugcuggug ccucccuccu uggcgcgcuu cccaccugcg aucggcgccc ucuucgcagu 60cacgaacucg ccagcagcua gcagcacuga cuaguaggag ggcccgccgg aggagaggac 120augcucugga gucagaagac agcgaaaaga gaagcagaag ccccgguggc aagagucuga 180agcuggaaag gaggagaaca ugaaacauug cuugaagaca auggccgaga cagcaggucc 240cacccugcac agccaccagc aucucucccc ucagcccugu cuccucuucu gcaguuggga 300ucugcacauu uaagccugaa auuguccugu gaagugaagu augaucggac agccucuuuu 360cagcuuuuau gacaauggag acagaggaau uguggcucuu gccaagguca caggauugga 420auacagagcc aagccacccc aggacaugca agagccucag aagggaaaaa agcccagcag 480gaagggagaa caaguagccu cuguccugaa guuguaacag ccaggggcca ggauggagga 540ggaggacccc auaaucugcc caucugggac uuggcagggg accugggaaa auguacccca 600acccau 60622551DNAHomo sapiens 22cttcgcagtc acgaactcgc cagcagctag cagcactgac tagtaggagg gcccgccgga 60ggagaggaag ccccagagag attggtgagg gtgatttccc aggaagacgc agtgtgctct 120gacttctgtg acagtgagca acgggaccag tggatgtcca gatgctggca atgagacatg 180ctctggagtc agaagacagc gaaaagagaa gcagaagccc cggtggcaag agtctgaagg 240aaggatgact gtagcctgtg gattgtactg cagtaggaaa ctgtcctagc aaggctccac 300tttgccccag cttcaagctg gaaaggagga gaacatgaaa cattgcttga agacaatggc 360cgagacagca ggtcccaccc tgcacagcca ccagcatctc tcccctcagc cctgtctcct 420cttctgcagt tgggatctgc acatttaagc ctgaaattgt cctgtgaagt gaagtatgat 480cggacagcct cttttcagct tttatgacaa tggagacaga ggaattgtgg ctcttgccaa 540ggtcacagga t 551231877DNAHomo sapiens 23tcgccagcag ctagcagcac tgactagtag gagggcccgc cggaggagag ccgcgcggcc 60cacagaagcg gaacgcgcgt cgagagcgcc ctgtccgctc gccccagaca gatgcccggt 120tattcattac cgcgaggcct agaggaaaga gtggctgccg tcttcctgcc cacagcccgc 180cggaccctcc gtcgcggctg cccggtcccc ggagccgcag ccgccgagcc cggctgtgcg 240tgtcgtggct gctggggaga aagaggcttc cggaagcccc agagagattg gtgagggtga 300tttcccagga agacgcagtg tgctctgact tctgtgacag tgagcaacgg gaccagtgga 360tgtccagatg ctggcaatga gacatgctct ggagtcagaa gacagcgaaa agagaagcag 420aagccccggt ggcaagagtc tgaagcagga aggatgactg tagcctgtgg attgtactgc 480agtaggaaac tgtcctagca aggctccact ttgccccagc ttcaagctgg aaaggaggag 540aacatgaaac attgcttgaa gacaatggcc gagacagcag gtcccaccct gcacagccac 600cagcatctct cccctcagcc ctgtctcctc ttctgcagtt gggatctgca catttaagcc 660tgaaattgtc ctgtgaagtg aagtatgatc ggacagcctc ttttcagctt ttatgacaat 720ggagacagag gaattgtggc tcttgccaag gtcacaggat tggaatacag agccaagcca 780ccccaggaca tgcaagagcc tcagaaggga aaaaagccca gcaggaaggg agaacaagta 840gcctctgtcc tgaagttgta acagccaggg gccaggatgg aggaggagga ccccataatc 900tgcccatctg ggacttggca ggggacctgg gaaaatgtac cccaacccat cccttaaggg 960cctttgtctt tggcccattg gcctagcatc tacttcttca ccgtgtctgt tcttgtcaca 1020cctagtcagg tctgtttggg tctgaggtgc atggaacatt ctgggtaggc ctccagcaaa 1080cggaagctct tcaccgtgtt tccagcctgg gaccaagggc agcatactgg caaagttgcc 1140aaagcaaggg actccagcct cttaggagtt aatgactccc tctccccagc tgtcctcccc 1200ttggtgctcc tcttcctccc tcctcctgct cacagcaggc agggcctaga cccgggagcc 1260atgctgctgt gctgttgcca ggggagcacg gaggcagatc tgagctatgc agggaaaagg 1320cccagcctgt caaagtgtct gagatgaacc gccgccgtcc ctgtgcagct gggctcagac 1380gtgtctcagc tcttgttctg tgcctgagaa tggcgaaacc cagtgaggtt caagggcaaa 1440ctcgctattc attagtcagg ggttcttgac gtcccgtctc tcccagggat gagttccccc 1500ctcctctttc tccccctcct atgacacatt cctgggtgcc tttggtgagg actgcacacc 1560ctcctcctgc ctagccccct ctccaaaggc ccctgaataa actcccccca aggagaccag 1620gcagggcaga gacaatggct gcaggaaatc attcaggcgg gacatgctgg cctgccctcc 1680acccagtccc cctgtgggcc ccactccctt ctgattcagg gcacccttgg gcccccagcc 1740tatacaggcc tggacaggaa gaaaccactg ggaaccaccc taaggacaac atgctagtcc 1800agtgccattc ttcgctggct ctgtgggtgc ctttgtggcc tgtaccgact ggctggctaa 1860ttttgtggtt tctgtac 187724561DNAHomo sapiens 24agcactgact agtaggaggg cccgccggag gagaggacat gctctggagt cagaagacag 60cgaaaagaga agcagaagcc ccggtggcaa gagtctgaag caggaaggat gactgtagcc 120tgtggattgt actgcagtag gaaactgtcc tagcaaggct ccactttgcc ccagcttcaa 180gctggaaagg aggagaacat gaaacattgc ttgaagacaa tggccgagac agcaggtccc 240accctgcaca gccaccagca tctctcccct cagccctgtc tcctcttctg cagttgggat 300ctgcacattt aagcctgaaa ttgtcctgtg aagtgaagta tgatcggaca gcctcttttc 360agcttttatg acaatggaga cagaggaatt gtggctcttg ccaaggtcac aggattggaa 420tacagagcca agccacccca ggacatgcaa gagcctcaga agggaaaaaa gcccagcagg 480aagggagaac aagtagcctc tgtcctgaag ttgtaacagc caggggccag gatggaggag 540gaggacccca taatctgccc a 56125786DNAHomo sapiens 25ctgggagtgg cgcggctgct tcccgcccgc gcaggatcag gccggccccc gcgggcctgg 60agctggatcc agagctaggg aaactggaaa aacaggcaca aactcggaag ccgcggtacg 120gcaagagcct aagcaaagaa tcctttccaa gattcacacc tcgtctacac cagggcaccg 180cctgggccta cggccttccg aacccgaagc gcccgcagcc cagagctggc atcaggccat 240caggccggga aggtcgtcgc aggccccaga gtgcgggcgc ggggggcgcg cgcccacagg 300acgcccgggg ttgggtaggc aggagagaag ggcgccagca ggcccgcggc tgtttcccct 360cggtccgcac agcgggcccg ggaggccatt ttgagagcgc gaagaggggc ggcaagatgg 420ctgcgtgggc acccggaagg tcgccgcgcc aagggcccgc tgagcccctc ctcccattcg 480tccagccgcg cggcccacag aagcggaacg cgcgtcgaga gcgccctgtc cgctcgcccc 540agacagatgc ccggttattc attaccgcga ggcctagagg aaagagtggc tgccgtcttc 600ctgcccacag cccgccggac cctccgtcgc ggctgcccgg tccccggagc cgcagccgcc 660gagcccggct gtgcgtgtcg tggctgctgg ggagaaagag gcttccggac atgctctgga 720gtcagaagac agcgaaaaga gaagcagaag ccccggtggc aagagtctga agcaggaagg 780atgact 78626569DNAHomo sapiens 26ttaccgcgag gcctagagga aagagtggct gccgtcttcc tgcccacagc ccgccggacc 60ctccgtcgcg gctgcccggt ccccggagcc gcagccgccg agcccggctg tgcgtgtcgt 120ggctgctggg gagaaagagg cttccggaca tgctctggag tcagaagaca gcgaaaagag 180aagcagaagc cccggtggca agagtctgaa gggagaaaat aacccagttt gggaaggaca 240tttaaaaggg gaaaatatta ggaaggatga ctgtagcctg tggattgtac tgcagtagga 300aactgtccta gcaaggctcc actttgcccc agcttcaagc tggaaaggag gagaacatga 360aacattgctt gaagacaatg gccgagacag caggtcccac cctgcacagc caccagcatc 420tctcccctca gccctgtctc ctcttctgca gttgggatct gcacatttaa gcctgaaatt 480gtcctgtgaa gtgaagtatg atcggacagc ctcttttcag cttttatgac aatggagaca 540gaggaattgt ggctcttgcc aaggtcaca 569273779DNAHomo sapiens 27gtgtcgtggc tgctggggag aaagaggctt ccggacatgc tctggagtca gaagacagcg 60aaaagagaag cagaagcccc ggtggcaaga gtctgaagca ggaaggatga ctgtagcctg 120tggattgtac tgcagtagga aactgtccta gcaaggctcc actttgcccc agcttcaagc 180tggaaaggag gagaacatga aacattgctt gaagacaatg gccgagacag caggtcccac 240cctgcacagc caccagcatc tctcccctca gccctgtctc ctcttctgca gttgggatct 300gcacatttaa gcctgaaatt gtcctgtgaa gtgaagtatg atcggacagc ctcttttcag 360cttttatgac aatggagaca gaggaattgt ggctcttgcc aaggtcacag gattggaata 420cagagccaag ccaccccagg acatgcaaga gcctcagaag ggaaaaaagc ccagcaggaa 480gggagaacaa gtagcctctg tcctgaagtt gtaacagcca ggggccagga tggaggagga 540ggaccccata atctgcccat ctgggacttg gcaggggacc tgggaaaatg taccccaacc 600catcccttaa gggcctttgt ctttggccca ttggcctagc atctacttct tcaccgtgtc 660tgttcttgtc acacctagtc aggtctgttt gggtctgagg tgcatggaac attctgggta 720ggcctccagc aaacggaagc tcttcaccgt gtttccagcc tgggaccaag ggcagcatac 780tggcaaagtt gccaaagcaa gggactccag cctcttagga gttaatgact ccctctcccc 840agctgtcctc cccttggtgc tcctcttcct ccctcctcct gctcacagca ggcagggcct 900agacccggga gccatgctgc tgtgctgttg ccaggggagc acggaggcag atctgagcta 960tgcagggaaa aggcccagcc tgtcaaagtg tctgagatga accgccgccg tccctgtgca 1020gctgggctca gacgtgtctc agctcttgtt ctgtgcctga gaatggcgaa acccagtgag 1080gttcaagggc aaactcgcta ttcattagtc aggggttctt gacgtcccgt ctctcccagg 1140gatgagttcc cccctcctct ttctccccct cctatgacac attcctgggt gcctttggtg 1200aggactgcac accctcctcc tgcctagccc cctctccaaa ggcccctgaa taaactcccc 1260ccaaggagac caggcagggc agagacaatg gctgcaggaa atcattcagg cgggacatgc 1320tggcctgccc tccacccagt ccccctgtgg gccccactcc cttctgattc agggcaccct 1380tgggccccca gcctatacag gcctggacag gaagaaacca ctgggaacca ccctaaggac 1440aacatgctag tccagtgcca ttcttcgctg gctctgtggg tgcctttgtg gcctgtaccg 1500actggctggc taattttgtg gtttctgtac catcacatgc ctattttaag acactctcca 1560gcactgtcgg ttagggagtg taaattttgc aatattttct gaaatgtggc aatatcaaaa 1620tgtaaaaggc acacatactt ggtcacaaac aaatggcact atttactctg tgggcatatt 1680tgtaaaagtt gccaaagaat tatatacaag gatgttcatc agagcatttc ttttgaagag 1740taaagaaatg gacatgaacc tgtggtccgt tcatacggtg gaatacctat gcagctgtaa 1800aaatcagtgt ggtagatctc cgtatatgag ttgatgtgga aggttggcca gttcacatga 1860taaggtgaat agaataagtt acagaacagg ctgtagagta tgatcttatt tgtagatgtt 1920taaaactgag tcataagtat gcttatatac agatcgtttc tggaagtatg tactggaagt 1980ctacctctgg ggagtgggga tgggggagtg cactcttcta tactgttata ttttcttttc 2040atgctcctaa ggtactttta ttggaagatg taaagcggtt caatgtaata ggcttaactt 2100ctgtcaacta agttggcgtg ggtgctttaa gagggtggta gtgatgttgc tggagaaagt 2160atcccacagt cactggtggc ttcagccacg ggccattttg gggcctaata atcacatatc 2220atcatggttg ctagtgttaa tcgaaaacct actaagtgcc aggcttactg tctctgggtc 2280ttgcttacgt ggatgtcatt tttccagttg caccaaatcg aaagaggtta attggtttgt 2340tggagttcct ttgtaggtga agggcagagc caggagcttg gctagggaca ggggaggtga 2400gtgggggatg gtggataggt cttggctccc agtttccttc tgggcagaca ttgcccctct 2460gccctgagga cctgcttgtt tgggggaaga ggcctttaga ggcaccaggg tcatgccagg 2520tgttggacat ggtgaactgg gaagtgctcc catctggcca cagcgcagaa gtatcaccgt 2580gctgggggat ggggaacagg gctgtgaatg ggcctatttg cataagcagc atgtgtctgg 2640agagaaagac atcacagagc agaagagtgc gggtgcccag gagtgcactt gccaccccta 2700cttcatccct gaaagagtaa atggcctgga aggtgtctct gagaggtaat gccgcacacc 2760accctccctg ggggcagggt caggctacac ctgccttagg tcgggggctg cagcagcctg 2820agagctctca gtagggcctc agtagcctgg gagggagcag gggcaggggg cagggaaaga 2880ggcgtaatgg ggctgtccag aggggcctgg gaaacctggt ccctgaggcc tgggcacagc 2940tacaatcact tcaaattggc tgtggggcca gtggactggg aaggaaaaaa gcaataagag 3000tgaccaagtg cagaaggctg tcaggtccca ggtcacatgc cttagtgcag tgactcctca 3060tcattttatg gggtgtgggt gtcgttggta cacccatttt acagatgagg acaccgaggc 3120ccagaaaagt taagttacat gtcctaagtc acacagcttg taagtgccag aactgagatc 3180aaaaccaagt ctctttgact ttaaagtctg tactctgacc ccaaagagat cctgtttggc 3240cacttatagg aggtccctaa agctgcagac tccccttgcc ggcacccaca tatagagaca 3300ttaacccttc ccctgcaggg tcacctcaaa tagtctttta gctgggcttc tcctgcaatt 3360ccacctaatg ccatcccctg ggttttgccc aaacctgaac tgggcagtgg ggtgagagga 3420ggggtttaca gggttacaga gcctcataca gataggagcc catggctgct ggtcatctgc 3480attcctgcag gattggctgt tccttggggt ccttggcagg aaaatgagga ttgctccgag 3540gcctgctcca gtacttccca gaggctggcc tggtgtgggg ctctgggaag gctgaggctg 3600gagaagcgta agtaggaggg cagagatggc actcaggtag cttgaatcac caggaccctt 3660ccaagcccca caggttctga gggagtacta gggccagctc tgggagaggt ctcttcctat 3720gctgtgaacc ccctgccttt cttgcagcct acaacgaata aattttcttt gcaaaggct 3779283670DNAHomo sapiens 28cttccggaca tgctctggag tcagaagaca gcgaaaagag

aagcagaagc cccggtggca 60agagtctgaa gctggaaagg aggagaacat gaaacattgc ttgaagacaa tggccgagac 120agcaggtccc accctgcaca gccaccagca tctctcccct cagccctgtc tcctcttctg 180cagttgggat ctgcacattt aagcctgaaa ttgtcctgtg aagtgaagta tgatcggaca 240gcctcttttc agcttttatg acaatggaga cagaggaatt gtggctcttg ccaaggtcac 300aggattggaa tacagagcca agccacccca ggacatgcaa gagcctcaga agggaaaaaa 360gcccagcagg aagggagaac aagtagcctc tgtcctgaag ttgtaacagc caggggccag 420gatggaggag gaggacccca taatctgccc atctgggact tggcagggga cctgggaaaa 480tgtaccccaa cccatccctt aagggccttt gtctttggcc cattggccta gcatctactt 540cttcaccgtg tctgttcttg tcacacctag tcaggtctgt ttgggtctga ggtgcatgga 600acattctggg taggcctcca gcaaacggaa gctcttcacc gtgtttccag cctgggacca 660agggcagcat actggcaaag ttgccaaagc aagggactcc agcctcttag gagttaatga 720ctccctctcc ccagctgtcc tccccttggt gctcctcttc ctccctcctc ctgctcacag 780caggcagggc ctagacccgg gagccatgct gctgtgctgt tgccagggga gcacggaggc 840agatctgagc tatgcaggga aaaggcccag cctgtcaaag tgtctgagat gaaccgccgc 900cgtccctgtg cagctgggct cagacgtgtc tcagctcttg ttctgtgcct gagaatggcg 960aaacccagtg aggttcaagg gcaaactcgc tattcattag tcaggggttc ttgacgtccc 1020gtctctccca gggatgagtt cccccctcct ctttctcccc ctcctatgac acattcctgg 1080gtgcctttgg tgaggactgc acaccctcct cctgcctagc cccctctcca aaggcccctg 1140aataaactcc ccccaaggag accaggcagg gcagagacaa tggctgcagg aaatcattca 1200ggcgggacat gctggcctgc cctccaccca gtccccctgt gggccccact cccttctgat 1260tcagggcacc cttgggcccc cagcctatac aggcctggac aggaagaaac cactgggaac 1320caccctaagg acaacatgct agtccagtgc cattcttcgc tggctctgtg ggtgcctttg 1380tggcctgtac cgactggctg gctaattttg tggtttctgt accatcacat gcctatttta 1440agacactctc cagcactgtc ggttagggag tgtaaatttt gcaatatttt ctgaaatgtg 1500gcaatatcaa aatgtaaaag gcacacatac ttggtcacaa acaaatggca ctatttactc 1560tgtgggcata tttgtaaaag ttgccaaaga attatataca aggatgttca tcagagcatt 1620tcttttgaag agtaaagaaa tggacatgaa cctgtggtcc gttcatacgg tggaatacct 1680atgcagctgt aaaaatcagt gtggtagatc tccgtatatg agttgatgtg gaaggttggc 1740cagttcacat gataaggtga atagaataag ttacagaaca ggctgtagag tatgatctta 1800tttgtagatg tttaaaactg agtcataagt atgcttatat acagatcgtt tctggaagta 1860tgtactggaa gtctacctct ggggagtggg gatgggggag tgcactcttc tatactgtta 1920tattttcttt tcatgctcct aaggtacttt tattggaaga tgtaaagcgg ttcaatgtaa 1980taggcttaac ttctgtcaac taagttggcg tgggtgcttt aagagggtgg tagtgatgtt 2040gctggagaaa gtatcccaca gtcactggtg gcttcagcca cgggccattt tggggcctaa 2100taatcacata tcatcatggt tgctagtgtt aatcgaaaac ctactaagtg ccaggcttac 2160tgtctctggg tcttgcttac gtggatgtca tttttccagt tgcaccaaat cgaaagaggt 2220taattggttt gttggagttc ctttgtaggt gaagggcaga gccaggagct tggctaggga 2280caggggaggt gagtggggga tggtggatag gtcttggctc ccagtttcct tctgggcaga 2340cattgcccct ctgccctgag gacctgcttg tttgggggaa gaggccttta gaggcaccag 2400ggtcatgcca ggtgttggac atggtgaact gggaagtgct cccatctggc cacagcgcag 2460aagtatcacc gtgctggggg atggggaaca gggctgtgaa tgggcctatt tgcataagca 2520gcatgtgtct ggagagaaag acatcacaga gcagaagagt gcgggtgccc aggagtgcac 2580ttgccacccc tacttcatcc ctgaaagagt aaatggcctg gaaggtgtct ctgagaggta 2640atgccgcaca ccaccctccc tgggggcagg gtcaggctac acctgcctta ggtcgggggc 2700tgcagcagcc tgagagctct cagtagggcc tcagtagcct gggagggagc aggggcaggg 2760ggcagggaaa gaggcgtaat ggggctgtcc agaggggcct gggaaacctg gtccctgagg 2820cctgggcaca gctacaatca cttcaaattg gctgtggggc cagtggactg ggaaggaaaa 2880aagcaataag agtgaccaag tgcagaaggc tgtcaggtcc caggtcacat gccttagtgc 2940agtgactcct catcatttta tggggtgtgg gtgtcgttgg tacacccatt ttacagatga 3000ggacaccgag gcccagaaaa gttaagttac atgtcctaag tcacacagct tgtaagtgcc 3060agaactgaga tcaaaaccaa gtctctttga ctttaaagtc tgtactctga ccccaaagag 3120atcctgtttg gccacttata ggaggtccct aaagctgcag actccccttg ccggcaccca 3180catatagaga cattaaccct tcccctgcag ggtcacctca aatagtcttt tagctgggct 3240tctcctgcaa ttccacctaa tgccatcccc tgggttttgc ccaaacctga actgggcagt 3300ggggtgagag gaggggttta cagggttaca gagcctcata cagataggag cccatggctg 3360ctggtcatct gcattcctgc aggattggct gttccttggg gtccttggca ggaaaatgag 3420gattgctccg aggcctgctc cagtacttcc cagaggctgg cctggtgtgg ggctctggga 3480aggctgaggc tggagaagcg taagtaggag ggcagagatg gcactcaggt agcttgaatc 3540accaggaccc ttccaagccc cacaggttct gagggagtac tagggccagc tctgggagag 3600gtctcttcct atgctgtgaa ccccctgcct ttcttgcagc ctacaacgaa taaattttct 3660ttgcaaaggc 3670293784DNAHomo sapiens 29gctaacagct tcaggagaat tcagcctcac cttgacagga catgctctgg agtcagaaga 60cagcgaaaag agaagcagaa gccccggtgg caagagtctg aagcaggaag gatgactgta 120gcctgtggat tgtactgcag taggaaactg tcctagcaag gctccacttt gccccagctt 180caagctggaa aggaggagaa catgaaacat tgcttgaaga caatggccga gacagcaggt 240cccaccctgc acagccacca gcatctctcc cctcagccct gtctcctctt ctgcagttgg 300gatctgcaca tttaagcctg aaattgtcct gtgaagtgaa gtatgatcgg acagcctctt 360ttcagctttt atgacaatgg agacagagga attgtggctc ttgccaaggt cacaggattg 420gaatacagag ccaagccacc ccaggacatg caagagcctc agaagggaaa aaagcccagc 480aggaagggag aacaagtagc ctctgtcctg aagttgtaac agccaggggc caggatggag 540gaggaggacc ccataatctg cccatctggg acttggcagg ggacctggga aaatgtaccc 600caacccatcc cttaagggcc tttgtctttg gcccattggc ctagcatcta cttcttcacc 660gtgtctgttc ttgtcacacc tagtcaggtc tgtttgggtc tgaggtgcat ggaacattct 720gggtaggcct ccagcaaacg gaagctcttc accgtgtttc cagcctggga ccaagggcag 780catactggca aagttgccaa agcaagggac tccagcctct taggagttaa tgactccctc 840tccccagctg tcctcccctt ggtgctcctc ttcctccctc ctcctgctca cagcaggcag 900ggcctagacc cgggagccat gctgctgtgc tgttgccagg ggagcacgga ggcagatctg 960agctatgcag ggaaaaggcc cagcctgtca aagtgtctga gatgaaccgc cgccgtccct 1020gtgcagctgg gctcagacgt gtctcagctc ttgttctgtg cctgagaatg gcgaaaccca 1080gtgaggttca agggcaaact cgctattcat tagtcagggg ttcttgacgt cccgtctctc 1140ccagggatga gttcccccct cctctttctc cccctcctat gacacattcc tgggtgcctt 1200tggtgaggac tgcacaccct cctcctgcct agccccctct ccaaaggccc ctgaataaac 1260tccccccaag gagaccaggc agggcagaga caatggctgc aggaaatcat tcaggcggga 1320catgctggcc tgccctccac ccagtccccc tgtgggcccc actcccttct gattcagggc 1380acccttgggc ccccagccta tacaggcctg gacaggaaga aaccactggg aaccacccta 1440aggacaacat gctagtccag tgccattctt cgctggctct gtgggtgcct ttgtggcctg 1500taccgactgg ctggctaatt ttgtggtttc tgtaccatca catgcctatt ttaagacact 1560ctccagcact gtcggttagg gagtgtaaat tttgcaatat tttctgaaat gtggcaatat 1620caaaatgtaa aaggcacaca tacttggtca caaacaaatg gcactattta ctctgtgggc 1680atatttgtaa aagttgccaa agaattatat acaaggatgt tcatcagagc atttcttttg 1740aagagtaaag aaatggacat gaacctgtgg tccgttcata cggtggaata cctatgcagc 1800tgtaaaaatc agtgtggtag atctccgtat atgagttgat gtggaaggtt ggccagttca 1860catgataagg tgaatagaat aagttacaga acaggctgta gagtatgatc ttatttgtag 1920atgtttaaaa ctgagtcata agtatgctta tatacagatc gtttctggaa gtatgtactg 1980gaagtctacc tctggggagt ggggatgggg gagtgcactc ttctatactg ttatattttc 2040ttttcatgct cctaaggtac ttttattgga agatgtaaag cggttcaatg taataggctt 2100aacttctgtc aactaagttg gcgtgggtgc tttaagaggg tggtagtgat gttgctggag 2160aaagtatccc acagtcactg gtggcttcag ccacgggcca ttttggggcc taataatcac 2220atatcatcat ggttgctagt gttaatcgaa aacctactaa gtgccaggct tactgtctct 2280gggtcttgct tacgtggatg tcatttttcc agttgcacca aatcgaaaga ggttaattgg 2340tttgttggag ttcctttgta ggtgaagggc agagccagga gcttggctag ggacagggga 2400ggtgagtggg ggatggtgga taggtcttgg ctcccagttt ccttctgggc agacattgcc 2460cctctgccct gaggacctgc ttgtttgggg gaagaggcct ttagaggcac cagggtcatg 2520ccaggtgttg gacatggtga actgggaagt gctcccatct ggccacagcg cagaagtatc 2580accgtgctgg gggatgggga acagggctgt gaatgggcct atttgcataa gcagcatgtg 2640tctggagaga aagacatcac agagcagaag agtgcgggtg cccaggagtg cacttgccac 2700ccctacttca tccctgaaag agtaaatggc ctggaaggtg tctctgagag gtaatgccgc 2760acaccaccct ccctgggggc agggtcaggc tacacctgcc ttaggtcggg ggctgcagca 2820gcctgagagc tctcagtagg gcctcagtag cctgggaggg agcaggggca gggggcaggg 2880aaagaggcgt aatggggctg tccagagggg cctgggaaac ctggtccctg aggcctgggc 2940acagctacaa tcacttcaaa ttggctgtgg ggccagtgga ctgggaagga aaaaagcaat 3000aagagtgacc aagtgcagaa ggctgtcagg tcccaggtca catgccttag tgcagtgact 3060cctcatcatt ttatggggtg tgggtgtcgt tggtacaccc attttacaga tgaggacacc 3120gaggcccaga aaagttaagt tacatgtcct aagtcacaca gcttgtaagt gccagaactg 3180agatcaaaac caagtctctt tgactttaaa gtctgtactc tgaccccaaa gagatcctgt 3240ttggccactt ataggaggtc cctaaagctg cagactcccc ttgccggcac ccacatatag 3300agacattaac ccttcccctg cagggtcacc tcaaatagtc ttttagctgg gcttctcctg 3360caattccacc taatgccatc ccctgggttt tgcccaaacc tgaactgggc agtggggtga 3420gaggaggggt ttacagggtt acagagcctc atacagatag gagcccatgg ctgctggtca 3480tctgcattcc tgcaggattg gctgttcctt ggggtccttg gcaggaaaat gaggattgct 3540ccgaggcctg ctccagtact tcccagaggc tggcctggtg tggggctctg ggaaggctga 3600ggctggagaa gcgtaagtag gagggcagag atggcactca ggtagcttga atcaccagga 3660cccttccaag ccccacaggt tctgagggag tactagggcc agctctggga gaggtctctt 3720cctatgctgt gaaccccctg cctttcttgc agcctacaac gaataaattt tctttgcaaa 3780ggct 378430779DNAHomo sapiens 30gcacgacttg ttcttgcctt ctaaagcaga gaggagcttt tgtgggtagt tcctacaggg 60atacatggta gaaaattcac caaacccagt gctggagtgt ttctcttcct cagaagaaat 120cagatgctgt tcagagcacg aaggctagaa ttttaccctg gttctcatgc taccttgcac 180ccaggttgga tcctgagtac agtttttggc aggaagcccc agagagattg gtgagggtga 240tttcccagga agacgcagtg tgctctgact tctgtgacag tgagcaacgg gaccagtgga 300tgtccagatg ctggcaatga gacatgctct ggagtcagaa gacagcgaaa agagaagcag 360aagccccggt ggcaagagtc tgaaggaagg atgactgtag cctgtggatt gtactgcagt 420aggaaactgt cctagcaagg ctccactttg ccccagcttc aaggtatatc gtctcaaaat 480gcaggggact tcagatgagt tttgagcacc ctttctttta ttataaaaaa aattccagac 540agttcagcca atactgacta agggctgaga ccagttccat gcttttctgt ctccagagga 600atttgcttcc atctggatgc ctgaaacgct ggaaaggagg agaacatgaa acattgcttg 660aagacaatgg ccgagacagc aggtcccacc ctgcacagcc accagcatct ctcccctcag 720ccctgtctcc tcttctgcag ttgggatctg cacatttaag cctgaaattg tcctgtgaa 779311611DNAHomo sapiens 31ggagaaacac acacgggcgg gcggagggga cccggggcga gtcatcaagg gcgcgtggtt 60cggcgtgcca ggcgcgctgc tctgcctgct ctcttggctt ctgtctccct tcgaccgatc 120gccccctatc ctgaagcttt ccaatgtcat cttggagccc caaagtttcc tggggcctcc 180gcgttgtgcg tcccagaacc ccttgcctgc ccctgaggga aacgcggagc cataggcagc 240gggacgtcgg gagccagccc aggggaggcc agattcagca tttggacagc ggctctgggg 300cgcagtcggc ccagcgagtt tgccggtgaa cagcctcggg cacatggcgg gtaggagggc 360cgcagggctg ctctgggtct tgaagaagca ggacccagcc tagagggcat ccccagctcc 420gaatgggaca cgttttcccg agataaaaga tcccttctga gctcacacgg gagccccggg 480accatccaat ccagcgtgga tatccccagc ctaaccaaca cctgtgctgg ggggaaagat 540aagacgcccc ctttcagcca ggaggtggac gaccctcatg ccctcagctc tccattcttc 600ccaaagcagc tcggatccct aagtctggag ctgccagcga ggcttccaac ccgctgcttg 660ccatcacctc ccaggtcgtt ggtggctccg attactcccc tgctggtgcc tccctccttg 720gcgcgcttcc cacctgcgat cggcgccctc ttcgcagtca cgaactcgcc agcagctagc 780agcactgact agtaggaggg cccgccggag gagagccgcg cggcccacag aagcggaacg 840cgcgtcgaga gcgccctgtc cgctcgcccc agacagatgc ccggttattc attaccgcga 900ggcctagagg aaagagtggc tgccgtcttc ctgcccacag cccgccggac cctccgtcgc 960ggctgcccgg tccccggagc cgcagccgcc gagcccggct gtgcgtgtcg tggctgctgg 1020ggagaaagag gcttccggac atgctctgga gtcagaagac agcgaaaaga gaagcagaag 1080ccccggtggc aagagtctga aggaaggatg actgtagcct gtggattgta ctgcagtagg 1140aaactgtcct agcaaggctc cactttgccc cagcttcaag ctggaaagga ggagaacatg 1200aaacattgct tgaagacaat ggccgagaca gcaggtccca ccctgcacag ccaccagcat 1260ctctcccctc agccctgtct cctcttctgc agttgggatc tgcacattta agcctgaaat 1320tgtcctgtga agtgaagtat gatcggacag cctcttttca gcttttatga caatggagac 1380agaggaattg tggctcttgc caaggtcaca ggattggaat acagagccaa gccaccccag 1440gacatgcaag agcctcagaa gggaaaaaag cccagcagga agggagaaca agtagcctct 1500gtcctgaagt tgtaacagcc aggggccagg atggaggagg aggaccccat aatctgccca 1560tctgggactt ggcaggggac ctgggaaaat gtaccccaac ccatccctta a 1611322718DNAHomo sapiens 32atcaagcgat cctcccacct gggcctccca aagtgttgag attacagcat gagccaccac 60acccagacta aaaggcagtt tgattttaca aatcaaaata gcagtaatct atggagattt 120acttgtgaga ttggtaggaa acatcttaaa tgtaatcaaa caataactta catcttgatg 180aattcacgtg taggtttctc ttcctcagaa gaaatcagat gctgttcaga gcacgaaggc 240tagaatttta ccctggttct catgctacct tgcacccagg ttggatcctg agtacagttt 300ttggcaggtg ggcctgcata taagttagca atgggggata cccagctgcc tctcttcata 360cagctgaggt tttggggagt cattcttata gcccctgggt tgggcctagt cctgcaaatg 420aattcaccag ccctaaagcc caaattgcag cctctgtcat tcaccttcca ggagtggaaa 480gggcagtaag tttcatctta ttattattgc tattttggtg gttttgttga ggttggtgtg 540tgtatgttag taagataaag ctctcagaaa ttacatagca tttgtcaagg atataagagg 600gactgtgcca catctggctg tatagaaggt ggttccatat ctttaaatag agccccaggt 660ccttagccac cagaaaggtt ttcaggggaa gtgtgcaccc tcagcagctg ctgctggtgg 720gcaggatggg cacgcatgga acaggctttc ctctgtggcc aggtgagaag caggtggtga 780gacacagagc agtgctgggc tctgcttctg aagcctccaa cctttccttc cctaggaagc 840cccagagaga ttggtgaggg tgatttccca ggaagacgca gtgtgctctg acttctgtga 900cagtgagcaa cgggaccagt ggatgtccag atgctggcaa tgagtaggcc ttccctacgc 960tgggtggcgt ccacaccctc cggcttccat tgcctgggtc tcctggaggt ggtttgctgg 1020atgaataccg catgcacaga ggctggcctt gggtttgaat atggcagcca gtggacagca 1080tgtgcttcag ttatgagact gcccaggaga tgcttcttcc aaggcagagc acgtgcagag 1140tccagtgctg gagaggccgg gtgcgcagtt gacccatttc cagttctgtt ttccctctca 1200tgttcctctg tccccatcta ggacatgctc tggagtcaga agacagcgaa aagagaagca 1260gaagccccgg tggcaagagt ctgaagcagg aaggatgact gtagcctgtg gattgtactg 1320cagtaggaaa ctgtcctagc aaggctccac tttgccccag cttcaagctg gaaaggagga 1380gaacatgaaa cattgcttga agacaatggc cgagacagca ggtcccaccc tgcacagcca 1440ccagcatctc tcccctcagc cctgtctcct cttctgcagt tgggatctgc acatttaagc 1500ctgaaattgt cctgtgaagt gaagtatgat cggacagcct cttttcagct tttatgacaa 1560tggagacaga ggaattgtgg ctcttgccaa ggtcacagga ttggaataca gagccaagcc 1620accccaggac atgcaagagc ctcagaaggg aaaaaagccc agcaggaagg gagaacaagt 1680agcctctgtc ctgaagttgt aacagccagg ggccaggatg gaggaggagg accccataat 1740ctgcccatct gggacttggc aggggacctg ggaaaatgta ccccaaccca tcccttaagg 1800gcctttgtct ttggcccatt ggcctagcat ctacttcttc accgtgtctg ttcttgtcac 1860acctagtcag gtctgtttgg gtctgaggtg catggaacat tctgggtagg cctccagcaa 1920acggaagctc ttcaccgtgt ttccagcctg ggaccaaggg cagcatactg gcaaagttgc 1980caaagcaagg gactccagcc tcttaggagt taatgactcc ctctccccag ctgtcctccc 2040cttggtgctc ctcttcctcc ctcctcctgc tcacagcagg cagggcctag acccgggagc 2100catgctgctg tgctgttgcc aggggagcac ggaggcagat ctgagctatg cagggaaaag 2160gcccagcctg tcaaagtgtc tgagatgaac cgccgccgtc cctgtgcagc tgggctcaga 2220cgtgtctcag ctcttgttct gtgcctgaga atggcgaaac ccagtgaggt tcaagggcaa 2280actcgctatt cattagtcag gggttcttga cgtcccgtct ctcccaggga tgagttcccc 2340cctcctcttt ctccccctcc tatgacacat tcctgggtgc ctttggtgag gactgcacac 2400cctcctcctg cctagccccc tctccaaagg cccctgaata aactcccccc aaggagacca 2460ggcagggcag agacaatggc tgcaggaaat cattcaggcg ggacatgctg gcctgccctc 2520cacccagtcc ccctgtgggc cccactccct tctgattcag ggcacccttg ggcccccagc 2580ctatacaggc ctggacagga agaaaccact gggaaccacc ctaaggacaa catgctagtc 2640cagtgccatt cttcgctggc tctgtgggtg cctttgtggc ctgtaccgac tggctggcta 2700attttgtggt ttctgtac 2718333723DNAHomo sapiens 33gagagattgg tgagggtgat ttcccaggaa gacgcagtgt gctctgactt ctgtgacaga 60catgctctgg agtcagaaga cagcgaaaag agaagcagaa gccccggtgg caagagtctg 120aagctggaaa ggaggagaac atgaaacatt gcttgaagac aatggccgag acagcaggtc 180ccaccctgca cagccaccag catctctccc ctcagccctg tctcctcttc tgcagttggg 240atctgcacat ttaagcctga aattgtcctg tgaagtgaag tatgatcgga cagcctcttt 300tcagctttta tgacaatgga gacagaggaa ttgtggctct tgccaaggtc acaggattgg 360aatacagagc caagccaccc caggacatgc aagagcctca gaagggaaaa aagcccagca 420ggaagggaga acaagtagcc tctgtcctga agttgtaaca gccaggggcc aggatggagg 480aggaggaccc cataatctgc ccatctggga cttggcaggg gacctgggaa aatgtacccc 540aacccatccc ttaagggcct ttgtctttgg cccattggcc tagcatctac ttcttcaccg 600tgtctgttct tgtcacacct agtcaggtct gtttgggtct gaggtgcatg gaacattctg 660ggtaggcctc cagcaaacgg aagctcttca ccgtgtttcc agcctgggac caagggcagc 720atactggcaa agttgccaaa gcaagggact ccagcctctt aggagttaat gactccctct 780ccccagctgt cctccccttg gtgctcctct tcctccctcc tcctgctcac agcaggcagg 840gcctagaccc gggagccatg ctgctgtgct gttgccaggg gagcacggag gcagatctga 900gctatgcagg gaaaaggccc agcctgtcaa agtgtctgag atgaaccgcc gccgtccctg 960tgcagctggg ctcagacgtg tctcagctct tgttctgtgc ctgagaatgg cgaaacccag 1020tgaggttcaa gggcaaactc gctattcatt agtcaggggt tcttgacgtc ccgtctctcc 1080cagggatgag ttcccccctc ctctttctcc ccctcctatg acacattcct gggtgccttt 1140ggtgaggact gcacaccctc ctcctgccta gccccctctc caaaggcccc tgaataaact 1200ccccccaagg agaccaggca gggcagagac aatggctgca ggaaatcatt caggcgggac 1260atgctggcct gccctccacc cagtccccct gtgggcccca ctcccttctg attcagggca 1320cccttgggcc cccagcctat acaggcctgg acaggaagaa accactggga accaccctaa 1380ggacaacatg ctagtccagt gccattcttc gctggctctg tgggtgcctt tgtggcctgt 1440accgactggc tggctaattt tgtggtttct gtaccatcac atgcctattt taagacactc 1500tccagcactg tcggttaggg agtgtaaatt ttgcaatatt ttctgaaatg tggcaatatc 1560aaaatgtaaa aggcacacat acttggtcac aaacaaatgg cactatttac tctgtgggca 1620tatttgtaaa agttgccaaa gaattatata caaggatgtt catcagagca tttcttttga 1680agagtaaaga aatggacatg aacctgtggt ccgttcatac ggtggaatac ctatgcagct 1740gtaaaaatca gtgtggtaga tctccgtata tgagttgatg tggaaggttg gccagttcac 1800atgataaggt gaatagaata agttacagaa caggctgtag agtatgatct tatttgtaga 1860tgtttaaaac tgagtcataa gtatgcttat atacagatcg tttctggaag tatgtactgg 1920aagtctacct ctggggagtg gggatggggg agtgcactct tctatactgt tatattttct 1980tttcatgctc ctaaggtact tttattggaa gatgtaaagc ggttcaatgt aataggctta 2040acttctgtca actaagttgg cgtgggtgct ttaagagggt ggtagtgatg ttgctggaga 2100aagtatccca cagtcactgg tggcttcagc cacgggccat tttggggcct aataatcaca 2160tatcatcatg gttgctagtg ttaatcgaaa acctactaag tgccaggctt actgtctctg 2220ggtcttgctt acgtggatgt catttttcca gttgcaccaa atcgaaagag gttaattggt

2280ttgttggagt tcctttgtag gtgaagggca gagccaggag cttggctagg gacaggggag 2340gtgagtgggg gatggtggat aggtcttggc tcccagtttc cttctgggca gacattgccc 2400ctctgccctg aggacctgct tgtttggggg aagaggcctt tagaggcacc agggtcatgc 2460caggtgttgg acatggtgaa ctgggaagtg ctcccatctg gccacagcgc agaagtatca 2520ccgtgctggg ggatggggaa cagggctgtg aatgggccta tttgcataag cagcatgtgt 2580ctggagagaa agacatcaca gagcagaaga gtgcgggtgc ccaggagtgc acttgccacc 2640cctacttcat ccctgaaaga gtaaatggcc tggaaggtgt ctctgagagg taatgccgca 2700caccaccctc cctgggggca gggtcaggct acacctgcct taggtcgggg gctgcagcag 2760cctgagagct ctcagtaggg cctcagtagc ctgggaggga gcaggggcag ggggcaggga 2820aagaggcgta atggggctgt ccagaggggc ctgggaaacc tggtccctga ggcctgggca 2880cagctacaat cacttcaaat tggctgtggg gccagtggac tgggaaggaa aaaagcaata 2940agagtgacca agtgcagaag gctgtcaggt cccaggtcac atgccttagt gcagtgactc 3000ctcatcattt tatggggtgt gggtgtcgtt ggtacaccca ttttacagat gaggacaccg 3060aggcccagaa aagttaagtt acatgtccta agtcacacag cttgtaagtg ccagaactga 3120gatcaaaacc aagtctcttt gactttaaag tctgtactct gaccccaaag agatcctgtt 3180tggccactta taggaggtcc ctaaagctgc agactcccct tgccggcacc cacatataga 3240gacattaacc cttcccctgc agggtcacct caaatagtct tttagctggg cttctcctgc 3300aattccacct aatgccatcc cctgggtttt gcccaaacct gaactgggca gtggggtgag 3360aggaggggtt tacagggtta cagagcctca tacagatagg agcccatggc tgctggtcat 3420ctgcattcct gcaggattgg ctgttccttg gggtccttgg caggaaaatg aggattgctc 3480cgaggcctgc tccagtactt cccagaggct ggcctggtgt ggggctctgg gaaggctgag 3540gctggagaag cgtaagtagg agggcagaga tggcactcag gtagcttgaa tcaccaggac 3600ccttccaagc cccacaggtt ctgagggagt actagggcca gctctgggag aggtctcttc 3660ctatgctgtg aaccccctgc ctttcttgca gcctacaacg aataaatttt ctttgcaaag 3720gct 3723344778RNAHomo sapiens 34ggagaaacac acacgggcgg gcggagggga cccggggcga gucaucaagg gcgcgugguu 60cggcgugcca ggcgcgcugc ucugccugcu cucuuggcuu cugucucccu ucgaccgauc 120gcccccuauc cugaagcuuu ccaaugucau cuuggagccc caaaguuucc uggggccucc 180gcguugugcg ucccagaacc ccuugccugc cccugaggga aacgcggagc cauaggcagc 240gggacgucgg gagccagccc aggggaggcc agauucagca uuuggacagc ggcucugggg 300cgcagucggc ccagcgaguu ugccggugaa cagccucggg cacauggcgg guaggagggc 360cgcagggcug cucugggucu ugaagaagca ggacccagcc uagagggcau ccccagcucc 420gaaugggaca cguuuucccg agauaaaaga ucccuucuga gcucacacgg gagccccggg 480accauccaau ccagcgugga uauccccagc cuaaccaaca ccugugcugg ggggaaagau 540aagacgcccc cuuucagcca ggagguggac gacccucaug cccucagcuc uccauucuuc 600ccaaagcagc ucggaucccu aagucuggag cugccagcga ggcuuccaac ccgcugcuug 660ccaucaccuc ccaggucguu gguggcuccg auuacucccc ugcuggugcc ucccuccuug 720gcgcgcuucc caccugcgau cggcgcccuc uucgcaguca cgaacucgcc agcagcuagc 780agcacugacu aguaggaggg cccgccggag gagagccgcg cggcccacag aagcggaacg 840cgcgucgaga gcgcccuguc cgcucgcccc agacagaugc ccgguuauuc auuaccgcga 900ggccuagagg aaagaguggc ugccgucuuc cugcccacag cccgccggac ccuccgucgc 960ggcugcccgg uccccggagc cgcagccgcc gagcccggcu gugcgugucg uggcugcugg 1020ggagaaagag gcuuccggac augcucugga gucagaagac agcgaaaaga gaagcagaag 1080ccccgguggc aagagucuga aggaaggaug acuguagccu guggauugua cugcaguagg 1140aaacuguccu agcaaggcuc cacuuugccc cagcuucaag cuggaaagga ggagaacaug 1200aaacauugcu ugaagacaau ggccgagaca gcagguccca cccugcacag ccaccagcau 1260cucuccccuc agcccugucu ccucuucugc aguugggauc ugcacauuua agccugaaau 1320uguccuguga agugaaguau gaucggacag ccucuuuuca gcuuuuauga caauggagac 1380agaggaauug uggcucuugc caaggucaca ggauuggaau acagagccaa gccaccccag 1440gacaugcaag agccucagaa gggaaaaaag cccagcagga agggagaaca aguagccucu 1500guccugaagu uguaacagcc aggggccagg auggaggagg aggaccccau aaucugccca 1560ucugggacuu ggcaggggac cugggaaaau guaccccaac ccaucccuua agggccuuug 1620ucuuuggccc auuggccuag caucuacuuc uucaccgugu cuguucuugu cacaccuagu 1680caggucuguu ugggucugag gugcauggaa cauucugggu aggccuccag caaacggaag 1740cucuucaccg uguuuccagc cugggaccaa gggcagcaua cuggcaaagu ugccaaagca 1800agggacucca gccucuuagg aguuaaugac ucccucuccc cagcuguccu ccccuuggug 1860cuccucuucc ucccuccucc ugcucacagc aggcagggcc uagacccggg agccaugcug 1920cugugcuguu gccaggggag cacggaggca gaucugagcu augcagggaa aaggcccagc 1980cugucaaagu gucugagaug aaccgccgcc gucccugugc agcugggcuc agacgugucu 2040cagcucuugu ucugugccug agaauggcga aacccaguga gguucaaggg caaacucgcu 2100auucauuagu cagggguucu ugacgucccg ucucucccag ggaugaguuc cccccuccuc 2160uuucuccccc uccuaugaca cauuccuggg ugccuuuggu gaggacugca cacccuccuc 2220cugccuagcc cccucuccaa aggccccuga auaaacuccc cccaaggaga ccaggcaggg 2280cagagacaau ggcugcagga aaucauucag gcgggacaug cuggccugcc cuccacccag 2340ucccccugug ggccccacuc ccuucugauu cagggcaccc uugggccccc agccuauaca 2400ggccuggaca ggaagaaacc acugggaacc acccuaagga caacaugcua guccagugcc 2460auucuucgcu ggcucugugg gugccuuugu ggccuguacc gacuggcugg cuaauuuugu 2520gguuucugua ccaucacaug ccuauuuuaa gacacucucc agcacugucg guuagggagu 2580guaaauuuug caauauuuuc ugaaaugugg caauaucaaa auguaaaagg cacacauacu 2640uggucacaaa caaauggcac uauuuacucu gugggcauau uuguaaaagu ugccaaagaa 2700uuauauacaa ggauguucau cagagcauuu cuuuugaaga guaaagaaau ggacaugaac 2760cugugguccg uucauacggu ggaauaccua ugcagcugua aaaaucagug ugguagaucu 2820ccguauauga guugaugugg aagguuggcc aguucacaug auaaggugaa uagaauaagu 2880uacagaacag gcuguagagu augaucuuau uuguagaugu uuaaaacuga gucauaagua 2940ugcuuauaua cagaucguuu cuggaaguau guacuggaag ucuaccucug gggagugggg 3000augggggagu gcacucuucu auacuguuau auuuucuuuu caugcuccua agguacuuuu 3060auuggaagau guaaagcggu ucaauguaau aggcuuaacu ucugucaacu aaguuggcgu 3120gggugcuuua agaggguggu agugauguug cuggagaaag uaucccacag ucacuggugg 3180cuucagccac gggccauuuu ggggccuaau aaucacauau caucaugguu gcuaguguua 3240aucgaaaacc uacuaagugc caggcuuacu gucucugggu cuugcuuacg uggaugucau 3300uuuuccaguu gcaccaaauc gaaagagguu aauugguuug uuggaguucc uuuguaggug 3360aagggcagag ccaggagcuu ggcuagggac aggggaggug agugggggau gguggauagg 3420ucuuggcucc caguuuccuu cugggcagac auugccccuc ugcccugagg accugcuugu 3480uugggggaag aggccuuuag aggcaccagg gucaugccag guguuggaca uggugaacug 3540ggaagugcuc ccaucuggcc acagcgcaga aguaucaccg ugcuggggga uggggaacag 3600ggcugugaau gggccuauuu gcauaagcag caugugucug gagagaaaga caucacagag 3660cagaagagug cgggugccca ggagugcacu ugccaccccu acuucauccc ugaaagagua 3720aauggccugg aaggugucuc ugagagguaa ugccgcacac cacccucccu gggggcaggg 3780ucaggcuaca ccugccuuag gucgggggcu gcagcagccu gagagcucuc aguagggccu 3840caguagccug ggagggagca ggggcagggg gcagggaaag aggcguaaug gggcugucca 3900gaggggccug ggaaaccugg ucccugaggc cugggcacag cuacaaucac uucaaauugg 3960cuguggggcc aguggacugg gaaggaaaaa agcaauaaga gugaccaagu gcagaaggcu 4020gucagguccc aggucacaug ccuuagugca gugacuccuc aucauuuuau gggguguggg 4080ugucguuggu acacccauuu uacagaugag gacaccgagg cccagaaaag uuaaguuaca 4140uguccuaagu cacacagcuu guaagugcca gaacugagau caaaaccaag ucucuuugac 4200uuuaaagucu guacucugac cccaaagaga uccuguuugg ccacuuauag gaggucccua 4260aagcugcaga cuccccuugc cggcacccac auauagagac auuaacccuu ccccugcagg 4320gucaccucaa auagucuuuu agcugggcuu cuccugcaau uccaccuaau gccauccccu 4380ggguuuugcc caaaccugaa cugggcagug gggugagagg agggguuuac aggguuacag 4440agccucauac agauaggagc ccauggcugc uggucaucug cauuccugca ggauuggcug 4500uuccuugggg uccuuggcag gaaaaugagg auugcuccga ggccugcucc aguacuuccc 4560agaggcuggc cugguguggg gcucugggaa ggcugaggcu ggagaagcgu aaguaggagg 4620gcagagaugg cacucaggua gcuugaauca ccaggacccu uccaagcccc acagguucug 4680agggaguacu agggccagcu cugggagagg ucucuuccua ugcugugaac ccccugccuu 4740ucuugcagcc uacaacgaau aaauuuucuu ugcaaagg 4778354113RNAHomo sapiens 35gauucucaca acuucugcgu gcgagcgccc gccccaccga ccgccccggc ccggcccgca 60agagccagag gagccgagag gagcccagcg ccggcccagc ggacuccagc ucgacggagc 120ggccgcgccc cgaccaguua cuccccugcu ggugccuccc uccuuggcgc gcuucccacc 180ugcgaucggc gcccucuucg cagucacgaa cucgccagca gcuagcagca cugacuagua 240ggagggcccg ccggaggaga ggaagcccca gagagauugg ugagggugau uucccaggaa 300gacgcagugu gcucugacuu cugugacagu gagcaacggg accaguggau guccagaugc 360uggcaaugag acaugcucug gagucagaag acagcgaaaa gagaagcaga agccccggug 420gcaagagucu gaagcaggaa ggaugacugu agccugugga uuguacugca guaggaaacu 480guccuagcaa ggcuccacuu ugccccagcu ucaagcugga aaggaggaga acaugaaaca 540uugcuugaag acaauggccg agacagcagg ucccacccug cacagccacc agcaucucuc 600cccucagccc ugucuccucu ucugcaguug ggaucugcac auuuaagccu gaaauugucc 660ugugaaguga aguaugaucg gacagccucu uuucagcuuu uaugacaaug gagacagagg 720aauuguggcu cuugccaagg ucacaggauu ggaauacaga gccaagccac cccaggacau 780gcaagagccu cagaagggaa aaaagcccag caggaaggga gaacaaguag ccucuguccu 840gaaguuguaa cagccagggg ccaggaugga ggaggaggac cccauaaucu gcccaucugg 900gacuuggcag gggaccuggg aaaauguacc ccaacccauc ccuuaagggc cuuugucuuu 960ggcccauugg ccuagcaucu acuucuucac cgugucuguu cuugucacac cuagucaggu 1020cuguuugggu cugaggugca uggaacauuc uggguaggcc uccagcaaac ggaagcucuu 1080caccguguuu ccagccuggg accaagggca gcauacuggc aaaguugcca aagcaaggga 1140cuccagccuc uuaggaguua augacucccu cuccccagcu guccuccccu uggugcuccu 1200cuuccucccu ccuccugcuc acagcaggca gggccuagac ccgggagcca ugcugcugug 1260cuguugccag gggagcacgg aggcagaucu gagcuaugca gggaaaaggc ccagccuguc 1320aaagugucug agaugaaccg ccgccguccc ugugcagcug ggcucagacg ugucucagcu 1380cuuguucugu gccugagaau ggcgaaaccc agugagguuc aagggcaaac ucgcuauuca 1440uuagucaggg guucuugacg ucccgucucu cccagggaug aguucccccc uccucuuucu 1500cccccuccua ugacacauuc cugggugccu uuggugagga cugcacaccc uccuccugcc 1560uagcccccuc uccaaaggcc ccugaauaaa cuccccccaa ggagaccagg cagggcagag 1620acaauggcug caggaaauca uucaggcggg acaugcuggc cugcccucca cccagucccc 1680cugugggccc cacucccuuc ugauucaggg cacccuuggg cccccagccu auacaggccu 1740ggacaggaag aaaccacugg gaaccacccu aaggacaaca ugcuagucca gugccauucu 1800ucgcuggcuc ugugggugcc uuuguggccu guaccgacug gcuggcuaau uuugugguuu 1860cuguaccauc acaugccuau uuuaagacac ucuccagcac ugucgguuag ggaguguaaa 1920uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua aaaggcacac auacuugguc 1980acaaacaaau ggcacuauuu acucuguggg cauauuugua aaaguugcca aagaauuaua 2040uacaaggaug uucaucagag cauuucuuuu gaagaguaaa gaaauggaca ugaaccugug 2100guccguucau acgguggaau accuaugcag cuguaaaaau caguguggua gaucuccgua 2160uaugaguuga uguggaaggu uggccaguuc acaugauaag gugaauagaa uaaguuacag 2220aacaggcugu agaguaugau cuuauuugua gauguuuaaa acugagucau aaguaugcuu 2280auauacagau cguuucugga aguauguacu ggaagucuac cucuggggag uggggauggg 2340ggagugcacu cuucuauacu guuauauuuu cuuuucaugc uccuaaggua cuuuuauugg 2400aagauguaaa gcgguucaau guaauaggcu uaacuucugu caacuaaguu ggcgugggug 2460cuuuaagagg gugguaguga uguugcugga gaaaguaucc cacagucacu gguggcuuca 2520gccacgggcc auuuuggggc cuaauaauca cauaucauca ugguugcuag uguuaaucga 2580aaaccuacua agugccaggc uuacugucuc ugggucuugc uuacguggau gucauuuuuc 2640caguugcacc aaaucgaaag agguuaauug guuuguugga guuccuuugu aggugaaggg 2700cagagccagg agcuuggcua gggacagggg aggugagugg gggauggugg auaggucuug 2760gcucccaguu uccuucuggg cagacauugc cccucugccc ugaggaccug cuuguuuggg 2820ggaagaggcc uuuagaggca ccagggucau gccagguguu ggacauggug aacugggaag 2880ugcucccauc uggccacagc gcagaaguau caccgugcug ggggaugggg aacagggcug 2940ugaaugggcc uauuugcaua agcagcaugu gucuggagag aaagacauca cagagcagaa 3000gagugcgggu gcccaggagu gcacuugcca ccccuacuuc aucccugaaa gaguaaaugg 3060ccuggaaggu gucucugaga gguaaugccg cacaccaccc ucccuggggg cagggucagg 3120cuacaccugc cuuaggucgg gggcugcagc agccugagag cucucaguag ggccucagua 3180gccugggagg gagcaggggc agggggcagg gaaagaggcg uaauggggcu guccagaggg 3240gccugggaaa ccuggucccu gaggccuggg cacagcuaca aucacuucaa auuggcugug 3300gggccagugg acugggaagg aaaaaagcaa uaagagugac caagugcaga aggcugucag 3360gucccagguc acaugccuua gugcagugac uccucaucau uuuauggggu gugggugucg 3420uugguacacc cauuuuacag augaggacac cgaggcccag aaaaguuaag uuacaugucc 3480uaagucacac agcuuguaag ugccagaacu gagaucaaaa ccaagucucu uugacuuuaa 3540agucuguacu cugaccccaa agagauccug uuuggccacu uauaggaggu cccuaaagcu 3600gcagacuccc cuugccggca cccacauaua gagacauuaa cccuuccccu gcagggucac 3660cucaaauagu cuuuuagcug ggcuucuccu gcaauuccac cuaaugccau ccccuggguu 3720uugcccaaac cugaacuggg caguggggug agaggagggg uuuacagggu uacagagccu 3780cauacagaua ggagcccaug gcugcugguc aucugcauuc cugcaggauu ggcuguuccu 3840ugggguccuu ggcaggaaaa ugaggauugc uccgaggccu gcuccaguac uucccagagg 3900cuggccuggu guggggcucu gggaaggcug aggcuggaga agcguaagua ggagggcaga 3960gauggcacuc agguagcuug aaucaccagg acccuuccaa gccccacagg uucugaggga 4020guacuagggc cagcucuggg agaggucucu uccuaugcug ugaacccccu gccuuucuug 4080cagccuacaa cgaauaaauu uucuuugcaa agg 4113363936RNAHomo sapiens 36ggagccgaga ggagcccagc gccggcccag cggacuccag cucgacggag cggccgcgcc 60ccgaccaguu acuccccugc uggugccucc cuccuuggcg cgcuucccac cugcgaucgg 120cgcccucuuc gcagucacga acucgccagc agcuagcagc acugacuagu aggagggccc 180gccggaggag aggacaugcu cuggagucag aagacagcga aaagagaagc agaagccccg 240guggcaagag ucugaaggaa ggaugacugu agccugugga uuguacugca guaggaaacu 300guccuagcaa ggcuccacuu ugccccagcu ucaagcugga aaggaggaga acaugaaaca 360uugcuugaag acaauggccg agacagcagg ucccacccug cacagccacc agcaucucuc 420cccucagccc ugucuccucu ucugcaguug ggaucugcac auuuaagccu gaaauugucc 480ugugaaguga aguaugaucg gacagccucu uuucagcuuu uaugacaaug gagacagagg 540aauuguggcu cuugccaagg ucacaggauu ggaauacaga gccaagccac cccaggacau 600gcaagagccu cagaagggaa aaaagcccag caggaaggga gaacaaguag ccucuguccu 660gaaguuguaa cagccagggg ccaggaugga ggaggaggac cccauaaucu gcccaucugg 720gacuuggcag gggaccuggg aaaauguacc ccaacccauc ccuuaagggc cuuugucuuu 780ggcccauugg ccuagcaucu acuucuucac cgugucuguu cuugucacac cuagucaggu 840cuguuugggu cugaggugca uggaacauuc uggguaggcc uccagcaaac ggaagcucuu 900caccguguuu ccagccuggg accaagggca gcauacuggc aaaguugcca aagcaaggga 960cuccagccuc uuaggaguua augacucccu cuccccagcu guccuccccu uggugcuccu 1020cuuccucccu ccuccugcuc acagcaggca gggccuagac ccgggagcca ugcugcugug 1080cuguugccag gggagcacgg aggcagaucu gagcuaugca gggaaaaggc ccagccuguc 1140aaagugucug agaugaaccg ccgccguccc ugugcagcug ggcucagacg ugucucagcu 1200cuuguucugu gccugagaau ggcgaaaccc agugagguuc aagggcaaac ucgcuauuca 1260uuagucaggg guucuugacg ucccgucucu cccagggaug aguucccccc uccucuuucu 1320cccccuccua ugacacauuc cugggugccu uuggugagga cugcacaccc uccuccugcc 1380uagcccccuc uccaaaggcc ccugaauaaa cuccccccaa ggagaccagg cagggcagag 1440acaauggcug caggaaauca uucaggcggg acaugcuggc cugcccucca cccagucccc 1500cugugggccc cacucccuuc ugauucaggg cacccuuggg cccccagccu auacaggccu 1560ggacaggaag aaaccacugg gaaccacccu aaggacaaca ugcuagucca gugccauucu 1620ucgcuggcuc ugugggugcc uuuguggccu guaccgacug gcuggcuaau uuugugguuu 1680cuguaccauc acaugccuau uuuaagacac ucuccagcac ugucgguuag ggaguguaaa 1740uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua aaaggcacac auacuugguc 1800acaaacaaau ggcacuauuu acucuguggg cauauuugua aaaguugcca aagaauuaua 1860uacaaggaug uucaucagag cauuucuuuu gaagaguaaa gaaauggaca ugaaccugug 1920guccguucau acgguggaau accuaugcag cuguaaaaau caguguggua gaucuccgua 1980uaugaguuga uguggaaggu uggccaguuc acaugauaag gugaauagaa uaaguuacag 2040aacaggcugu agaguaugau cuuauuugua gauguuuaaa acugagucau aaguaugcuu 2100auauacagau cguuucugga aguauguacu ggaagucuac cucuggggag uggggauggg 2160ggagugcacu cuucuauacu guuauauuuu cuuuucaugc uccuaaggua cuuuuauugg 2220aagauguaaa gcgguucaau guaauaggcu uaacuucugu caacuaaguu ggcgugggug 2280cuuuaagagg gugguaguga uguugcugga gaaaguaucc cacagucacu gguggcuuca 2340gccacgggcc auuuuggggc cuaauaauca cauaucauca ugguugcuag uguuaaucga 2400aaaccuacua agugccaggc uuacugucuc ugggucuugc uuacguggau gucauuuuuc 2460caguugcacc aaaucgaaag agguuaauug guuuguugga guuccuuugu aggugaaggg 2520cagagccagg agcuuggcua gggacagggg aggugagugg gggauggugg auaggucuug 2580gcucccaguu uccuucuggg cagacauugc cccucugccc ugaggaccug cuuguuuggg 2640ggaagaggcc uuuagaggca ccagggucau gccagguguu ggacauggug aacugggaag 2700ugcucccauc uggccacagc gcagaaguau caccgugcug ggggaugggg aacagggcug 2760ugaaugggcc uauuugcaua agcagcaugu gucuggagag aaagacauca cagagcagaa 2820gagugcgggu gcccaggagu gcacuugcca ccccuacuuc aucccugaaa gaguaaaugg 2880ccuggaaggu gucucugaga gguaaugccg cacaccaccc ucccuggggg cagggucagg 2940cuacaccugc cuuaggucgg gggcugcagc agccugagag cucucaguag ggccucagua 3000gccugggagg gagcaggggc agggggcagg gaaagaggcg uaauggggcu guccagaggg 3060gccugggaaa ccuggucccu gaggccuggg cacagcuaca aucacuucaa auuggcugug 3120gggccagugg acugggaagg aaaaaagcaa uaagagugac caagugcaga aggcugucag 3180gucccagguc acaugccuua gugcagugac uccucaucau uuuauggggu gugggugucg 3240uugguacacc cauuuuacag augaggacac cgaggcccag aaaaguuaag uuacaugucc 3300uaagucacac agcuuguaag ugccagaacu gagaucaaaa ccaagucucu uugacuuuaa 3360agucuguacu cugaccccaa agagauccug uuuggccacu uauaggaggu cccuaaagcu 3420gcagacuccc cuugccggca cccacauaua gagacauuaa cccuuccccu gcagggucac 3480cucaaauagu cuuuuagcug ggcuucuccu gcaauuccac cuaaugccau ccccuggguu 3540uugcccaaac cugaacuggg caguggggug agaggagggg uuuacagggu uacagagccu 3600cauacagaua ggagcccaug gcugcugguc aucugcauuc cugcaggauu ggcuguuccu 3660ugggguccuu ggcaggaaaa ugaggauugc uccgaggccu gcuccaguac uucccagagg 3720cuggccuggu guggggcucu gggaaggcug aggcuggaga agcguaagua ggagggcaga 3780gauggcacuc agguagcuug aaucaccagg acccuuccaa gccccacagg uucugaggga 3840guacuagggc cagcucuggg agaggucucu uccuaugcug ugaacccccu gccuuucuug 3900cagccuacaa cgaauaaauu uucuuugcaa aggcuu 3936374026RNAHomo sapiens 37gggggucccg gccccacaca gugcuagggu cccucucgag uuucucaucu gccuucaggu 60cacuuuccac ccugaugccu uggcuugucc ugaagcucag ggccccugua gcuugggaaa 120ccucccaagc uccccagcga guggcuguag accaaggaag ggacccugcc cggcuucagg 180gaagaaagga agaaaguuac uccccugcug gugccucccu ccuuggcgcg cuucccaccu 240gcgaucggcg cccucuucgc agucacgaac ucgccagcag cuagcagcac ugacuaguag 300gagggcccgc cggaggagag gacaugcucu ggagucagaa gacagcgaaa agagaagcag 360aagccccggu ggcaagaguc ugaaggaagg augacuguag ccuguggauu guacugcagu 420aggaaacugu ccuagcaagg cuccacuuug ccccagcuuc aagcuggaaa ggaggagaac 480augaaacauu gcuugaagac aauggccgag acagcagguc ccacccugca cagccaccag 540caucucuccc cucagcccug ucuccucuuc ugcaguuggg

aucugcacau uuaagccuga 600aauuguccug ugaagugaag uaugaucgga cagccucuuu ucagcuuuua ugacaaugga 660gacagaggaa uuguggcucu ugccaagguc acaggauugg aauacagagc caagccaccc 720caggacaugc aagagccuca gaagggaaaa aagcccagca ggaagggaga acaaguagcc 780ucuguccuga aguuguaaca gccaggggcc aggauggagg aggaggaccc cauaaucugc 840ccaucuggga cuuggcaggg gaccugggaa aauguacccc aacccauccc uuaagggccu 900uugucuuugg cccauuggcc uagcaucuac uucuucaccg ugucuguucu ugucacaccu 960agucaggucu guuugggucu gaggugcaug gaacauucug gguaggccuc cagcaaacgg 1020aagcucuuca ccguguuucc agccugggac caagggcagc auacuggcaa aguugccaaa 1080gcaagggacu ccagccucuu aggaguuaau gacucccucu ccccagcugu ccuccccuug 1140gugcuccucu uccucccucc uccugcucac agcaggcagg gccuagaccc gggagccaug 1200cugcugugcu guugccaggg gagcacggag gcagaucuga gcuaugcagg gaaaaggccc 1260agccugucaa agugucugag augaaccgcc gccgucccug ugcagcuggg cucagacgug 1320ucucagcucu uguucugugc cugagaaugg cgaaacccag ugagguucaa gggcaaacuc 1380gcuauucauu agucaggggu ucuugacguc ccgucucucc cagggaugag uuccccccuc 1440cucuuucucc cccuccuaug acacauuccu gggugccuuu ggugaggacu gcacacccuc 1500cuccugccua gcccccucuc caaaggcccc ugaauaaacu ccccccaagg agaccaggca 1560gggcagagac aauggcugca ggaaaucauu caggcgggac augcuggccu gcccuccacc 1620cagucccccu gugggcccca cucccuucug auucagggca cccuugggcc cccagccuau 1680acaggccugg acaggaagaa accacuggga accacccuaa ggacaacaug cuaguccagu 1740gccauucuuc gcuggcucug ugggugccuu uguggccugu accgacuggc uggcuaauuu 1800ugugguuucu guaccaucac augccuauuu uaagacacuc uccagcacug ucgguuaggg 1860aguguaaauu uugcaauauu uucugaaaug uggcaauauc aaaauguaaa aggcacacau 1920acuuggucac aaacaaaugg cacuauuuac ucugugggca uauuuguaaa aguugccaaa 1980gaauuauaua caaggauguu caucagagca uuucuuuuga agaguaaaga aauggacaug 2040aaccuguggu ccguucauac gguggaauac cuaugcagcu guaaaaauca gugugguaga 2100ucuccguaua ugaguugaug uggaagguug gccaguucac augauaaggu gaauagaaua 2160aguuacagaa caggcuguag aguaugaucu uauuuguaga uguuuaaaac ugagucauaa 2220guaugcuuau auacagaucg uuucuggaag uauguacugg aagucuaccu cuggggagug 2280gggauggggg agugcacucu ucuauacugu uauauuuucu uuucaugcuc cuaagguacu 2340uuuauuggaa gauguaaagc gguucaaugu aauaggcuua acuucuguca acuaaguugg 2400cgugggugcu uuaagagggu gguagugaug uugcuggaga aaguauccca cagucacugg 2460uggcuucagc cacgggccau uuuggggccu aauaaucaca uaucaucaug guugcuagug 2520uuaaucgaaa accuacuaag ugccaggcuu acugucucug ggucuugcuu acguggaugu 2580cauuuuucca guugcaccaa aucgaaagag guuaauuggu uuguuggagu uccuuuguag 2640gugaagggca gagccaggag cuuggcuagg gacaggggag gugagugggg gaugguggau 2700aggucuuggc ucccaguuuc cuucugggca gacauugccc cucugcccug aggaccugcu 2760uguuuggggg aagaggccuu uagaggcacc agggucaugc cagguguugg acauggugaa 2820cugggaagug cucccaucug gccacagcgc agaaguauca ccgugcuggg ggauggggaa 2880cagggcugug aaugggccua uuugcauaag cagcaugugu cuggagagaa agacaucaca 2940gagcagaaga gugcgggugc ccaggagugc acuugccacc ccuacuucau cccugaaaga 3000guaaauggcc uggaaggugu cucugagagg uaaugccgca caccacccuc ccugggggca 3060gggucaggcu acaccugccu uaggucgggg gcugcagcag ccugagagcu cucaguaggg 3120ccucaguagc cugggaggga gcaggggcag ggggcaggga aagaggcgua auggggcugu 3180ccagaggggc cugggaaacc uggucccuga ggccugggca cagcuacaau cacuucaaau 3240uggcuguggg gccaguggac ugggaaggaa aaaagcaaua agagugacca agugcagaag 3300gcugucaggu cccaggucac augccuuagu gcagugacuc cucaucauuu uauggggugu 3360gggugucguu gguacaccca uuuuacagau gaggacaccg aggcccagaa aaguuaaguu 3420acauguccua agucacacag cuuguaagug ccagaacuga gaucaaaacc aagucucuuu 3480gacuuuaaag ucuguacucu gaccccaaag agauccuguu uggccacuua uaggaggucc 3540cuaaagcugc agacuccccu ugccggcacc cacauauaga gacauuaacc cuuccccugc 3600agggucaccu caaauagucu uuuagcuggg cuucuccugc aauuccaccu aaugccaucc 3660ccuggguuuu gcccaaaccu gaacugggca guggggugag aggagggguu uacaggguua 3720cagagccuca uacagauagg agcccauggc ugcuggucau cugcauuccu gcaggauugg 3780cuguuccuug ggguccuugg caggaaaaug aggauugcuc cgaggccugc uccaguacuu 3840cccagaggcu ggccuggugu ggggcucugg gaaggcugag gcuggagaag cguaaguagg 3900agggcagaga uggcacucag guagcuugaa ucaccaggac ccuuccaagc cccacagguu 3960cugagggagu acuagggcca gcucugggag aggucucuuc cuaugcugug aacccccugc 4020cuuucu 4026384334RNAHomo sapiens 38ugaggcgcca ccggugccca gcaaccuccc caggcugugg uugugaccug aggacgcgug 60uguccccgcc cucaggccac cgcuacgcga cccugagugc accuucaaga aggccgggca 120cguuucuggg cgggcguggg gggugccuga uaucuccgcu cuauuuuaca guuacucccc 180ugcuggugcc ucccuccuug gcgcgcuucc caccugcgau cggcgcccuc uucgcaguca 240cgaacucgcc agcagcuagc agcacugacu aguaggaggg cccgccggag gagaggaagc 300cccagagaga uuggugaggg ugauuuccca ggaagacgca gugugcucug acuucuguga 360cagugagcaa cgggaccagu ggauguccag augcuggcaa ugaguaggcc uucccuacgc 420uggguggcgu ccacacccuc cggcuuccau ugccuggguc uccuggaggu gguuugcugg 480augaauaccg caugcacaga ggcuggccuu ggguuugaau auggcagcca guggacagca 540ugugcuucag uuaugagacu gcccaggaga ugcuucuucc aaggcagagc acgugcagag 600uccagugcug gagaggccgg gugcgcaguu gacccauuuc caguucuguu uucccucuca 660uguuccucug uccccaucua ggacaugcuc uggagucaga agacagcgaa aagagaagca 720gaagccccgg uggcaagagu cugaagcugg aaaggaggag aacaugaaac auugcuugaa 780gacaauggcc gagacagcag gucccacccu gcacagccac cagcaucucu ccccucagcc 840cugucuccuc uucugcaguu gggaucugca cauuuaagcc ugaaauuguc cugugaagug 900aaguaugauc ggacagccuc uuuucagcuu uuaugacaau ggagacagag gaauuguggc 960ucuugccaag gucacaggau uggaauacag agccaagcca ccccaggaca ugcaagagcc 1020ucagaaggga aaaaagccca gcaggaaggg agaacaagua gccucugucc ugaaguugua 1080acagccaggg gccaggaugg aggaggagga ccccauaauc ugcccaucug ggacuuggca 1140ggggaccugg gaaaauguac cccaacccau cccuuaaggg ccuuugucuu uggcccauug 1200gccuagcauc uacuucuuca ccgugucugu ucuugucaca ccuagucagg ucuguuuggg 1260ucugaggugc auggaacauu cuggguaggc cuccagcaaa cggaagcucu ucaccguguu 1320uccagccugg gaccaagggc agcauacugg caaaguugcc aaagcaaggg acuccagccu 1380cuuaggaguu aaugacuccc ucuccccagc uguccucccc uuggugcucc ucuuccuccc 1440uccuccugcu cacagcaggc agggccuaga cccgggagcc augcugcugu gcuguugcca 1500ggggagcacg gaggcagauc ugagcuaugc agggaaaagg cccagccugu caaagugucu 1560gagaugaacc gccgccgucc cugugcagcu gggcucagac gugucucagc ucuuguucug 1620ugccugagaa uggcgaaacc cagugagguu caagggcaaa cucgcuauuc auuagucagg 1680gguucuugac gucccgucuc ucccagggau gaguuccccc cuccucuuuc ucccccuccu 1740augacacauu ccugggugcc uuuggugagg acugcacacc cuccuccugc cuagcccccu 1800cuccaaaggc cccugaauaa acucccccca aggagaccag gcagggcaga gacaauggcu 1860gcaggaaauc auucaggcgg gacaugcugg ccugcccucc acccaguccc ccugugggcc 1920ccacucccuu cugauucagg gcacccuugg gcccccagcc uauacaggcc uggacaggaa 1980gaaaccacug ggaaccaccc uaaggacaac augcuagucc agugccauuc uucgcuggcu 2040cugugggugc cuuuguggcc uguaccgacu ggcuggcuaa uuuugugguu ucuguaccau 2100cacaugccua uuuuaagaca cucuccagca cugucgguua gggaguguaa auuuugcaau 2160auuuucugaa auguggcaau aucaaaaugu aaaaggcaca cauacuuggu cacaaacaaa 2220uggcacuauu uacucugugg gcauauuugu aaaaguugcc aaagaauuau auacaaggau 2280guucaucaga gcauuucuuu ugaagaguaa agaaauggac augaaccugu gguccguuca 2340uacgguggaa uaccuaugca gcuguaaaaa ucaguguggu agaucuccgu auaugaguug 2400auguggaagg uuggccaguu cacaugauaa ggugaauaga auaaguuaca gaacaggcug 2460uagaguauga ucuuauuugu agauguuuaa aacugaguca uaaguaugcu uauauacaga 2520ucguuucugg aaguauguac uggaagucua ccucugggga guggggaugg gggagugcac 2580ucuucuauac uguuauauuu ucuuuucaug cuccuaaggu acuuuuauug gaagauguaa 2640agcgguucaa uguaauaggc uuaacuucug ucaacuaagu uggcgugggu gcuuuaagag 2700ggugguagug auguugcugg agaaaguauc ccacagucac ugguggcuuc agccacgggc 2760cauuuugggg ccuaauaauc acauaucauc augguugcua guguuaaucg aaaaccuacu 2820aagugccagg cuuacugucu cugggucuug cuuacgugga ugucauuuuu ccaguugcac 2880caaaucgaaa gagguuaauu gguuuguugg aguuccuuug uaggugaagg gcagagccag 2940gagcuuggcu agggacaggg gaggugagug ggggauggug gauaggucuu ggcucccagu 3000uuccuucugg gcagacauug ccccucugcc cugaggaccu gcuuguuugg gggaagaggc 3060cuuuagaggc accaggguca ugccaggugu uggacauggu gaacugggaa gugcucccau 3120cuggccacag cgcagaagua ucaccgugcu gggggauggg gaacagggcu gugaaugggc 3180cuauuugcau aagcagcaug ugucuggaga gaaagacauc acagagcaga agagugcggg 3240ugcccaggag ugcacuugcc accccuacuu caucccugaa agaguaaaug gccuggaagg 3300ugucucugag agguaaugcc gcacaccacc cucccugggg gcagggucag gcuacaccug 3360ccuuaggucg ggggcugcag cagccugaga gcucucagua gggccucagu agccugggag 3420ggagcagggg cagggggcag ggaaagaggc guaauggggc uguccagagg ggccugggaa 3480accugguccc ugaggccugg gcacagcuac aaucacuuca aauuggcugu ggggccagug 3540gacugggaag gaaaaaagca auaagaguga ccaagugcag aaggcuguca ggucccaggu 3600cacaugccuu agugcaguga cuccucauca uuuuaugggg uguggguguc guugguacac 3660ccauuuuaca gaugaggaca ccgaggccca gaaaaguuaa guuacauguc cuaagucaca 3720cagcuuguaa gugccagaac ugagaucaaa accaagucuc uuugacuuua aagucuguac 3780ucugacccca aagagauccu guuuggccac uuauaggagg ucccuaaagc ugcagacucc 3840ccuugccggc acccacauau agagacauua acccuucccc ugcaggguca ccucaaauag 3900ucuuuuagcu gggcuucucc ugcaauucca ccuaaugcca uccccugggu uuugcccaaa 3960ccugaacugg gcaguggggu gagaggaggg guuuacaggg uuacagagcc ucauacagau 4020aggagcccau ggcugcuggu caucugcauu ccugcaggau uggcuguucc uugggguccu 4080uggcaggaaa augaggauug cuccgaggcc ugcuccagua cuucccagag gcuggccugg 4140uguggggcuc ugggaaggcu gaggcuggag aagcguaagu aggagggcag agauggcacu 4200cagguagcuu gaaucaccag gacccuucca agccccacag guucugaggg aguacuaggg 4260ccagcucugg gagaggucuc uuccuaugcu gugaaccccc ugccuuucuu gcagccuaca 4320acgaauaaau uuuc 4334393868RNAHomo sapiens 39uuacuccccu gcuggugccu cccuccuugg cgcgcuuccc accugcgauc ggcgcccucu 60ucgcagucac gaacucgcca gcagcuagca gcacugacua guaggagggc ccgccggagg 120agaggacaug cucuggaguc agaagacagc gaaaagagaa gcagaagccc cgguggcaag 180agucugaagc aggaaggaug acuguagccu guggauugua cugcaguagg aaacuguccu 240agcaaggcuc cacuuugccc cagcuucaag cuggaaagga ggagaacaug aaacauugcu 300ugaagacaau ggccgagaca gcagguccca cccugcacag ccaccagcau cucuccccuc 360agcccugucu ccucuucugc aguugggauc ugcacauuua agccugaaau uguccuguga 420agugaaguau gaucggacag ccucuuuuca gcuuuuauga caauggagac agaggaauug 480uggcucuugc caaggucaca ggauuggaau acagagccaa gccaccccag gacaugcaag 540agccucagaa gggaaaaaag cccagcagga agggagaaca aguagccucu guccugaagu 600uguaacagcc aggggccagg auggaggagg aggaccccau aaucugccca ucugggacuu 660ggcaggggac cugggaaaau guaccccaac ccaucccuua agggccuuug ucuuuggccc 720auuggccuag caucuacuuc uucaccgugu cuguucuugu cacaccuagu caggucuguu 780ugggucugag gugcauggaa cauucugggu aggccuccag caaacggaag cucuucaccg 840uguuuccagc cugggaccaa gggcagcaua cuggcaaagu ugccaaagca agggacucca 900gccucuuagg aguuaaugac ucccucuccc cagcuguccu ccccuuggug cuccucuucc 960ucccuccucc ugcucacagc aggcagggcc uagacccggg agccaugcug cugugcuguu 1020gccaggggag cacggaggca gaucugagcu augcagggaa aaggcccagc cugucaaagu 1080gucugagaug aaccgccgcc gucccugugc agcugggcuc agacgugucu cagcucuugu 1140ucugugccug agaauggcga aacccaguga gguucaaggg caaacucgcu auucauuagu 1200cagggguucu ugacgucccg ucucucccag ggaugaguuc cccccuccuc uuucuccccc 1260uccuaugaca cauuccuggg ugccuuuggu gaggacugca cacccuccuc cugccuagcc 1320cccucuccaa aggccccuga auaaacuccc cccaaggaga ccaggcaggg cagagacaau 1380ggcugcagga aaucauucag gcgggacaug cuggccugcc cuccacccag ucccccugug 1440ggccccacuc ccuucugauu cagggcaccc uugggccccc agccuauaca ggccuggaca 1500ggaagaaacc acugggaacc acccuaagga caacaugcua guccagugcc auucuucgcu 1560ggcucugugg gugccuuugu ggccuguacc gacuggcugg cuaauuuugu gguuucugua 1620ccaucacaug ccuauuuuaa gacacucucc agcacugucg guuagggagu guaaauuuug 1680caauauuuuc ugaaaugugg caauaucaaa auguaaaagg cacacauacu uggucacaaa 1740caaauggcac uauuuacucu gugggcauau uuguaaaagu ugccaaagaa uuauauacaa 1800ggauguucau cagagcauuu cuuuugaaga guaaagaaau ggacaugaac cugugguccg 1860uucauacggu ggaauaccua ugcagcugua aaaaucagug ugguagaucu ccguauauga 1920guugaugugg aagguuggcc aguucacaug auaaggugaa uagaauaagu uacagaacag 1980gcuguagagu augaucuuau uuguagaugu uuaaaacuga gucauaagua ugcuuauaua 2040cagaucguuu cuggaaguau guacuggaag ucuaccucug gggagugggg augggggagu 2100gcacucuucu auacuguuau auuuucuuuu caugcuccua agguacuuuu auuggaagau 2160guaaagcggu ucaauguaau aggcuuaacu ucugucaacu aaguuggcgu gggugcuuua 2220agaggguggu agugauguug cuggagaaag uaucccacag ucacuggugg cuucagccac 2280gggccauuuu ggggccuaau aaucacauau caucaugguu gcuaguguua aucgaaaacc 2340uacuaagugc caggcuuacu gucucugggu cuugcuuacg uggaugucau uuuuccaguu 2400gcaccaaauc gaaagagguu aauugguuug uuggaguucc uuuguaggug aagggcagag 2460ccaggagcuu ggcuagggac aggggaggug agugggggau gguggauagg ucuuggcucc 2520caguuuccuu cugggcagac auugccccuc ugcccugagg accugcuugu uugggggaag 2580aggccuuuag aggcaccagg gucaugccag guguuggaca uggugaacug ggaagugcuc 2640ccaucuggcc acagcgcaga aguaucaccg ugcuggggga uggggaacag ggcugugaau 2700gggccuauuu gcauaagcag caugugucug gagagaaaga caucacagag cagaagagug 2760cgggugccca ggagugcacu ugccaccccu acuucauccc ugaaagagua aauggccugg 2820aaggugucuc ugagagguaa ugccgcacac cacccucccu gggggcaggg ucaggcuaca 2880ccugccuuag gucgggggcu gcagcagccu gagagcucuc aguagggccu caguagccug 2940ggagggagca ggggcagggg gcagggaaag aggcguaaug gggcugucca gaggggccug 3000ggaaaccugg ucccugaggc cugggcacag cuacaaucac uucaaauugg cuguggggcc 3060aguggacugg gaaggaaaaa agcaauaaga gugaccaagu gcagaaggcu gucagguccc 3120aggucacaug ccuuagugca gugacuccuc aucauuuuau gggguguggg ugucguuggu 3180acacccauuu uacagaugag gacaccgagg cccagaaaag uuaaguuaca uguccuaagu 3240cacacagcuu guaagugcca gaacugagau caaaaccaag ucucuuugac uuuaaagucu 3300guacucugac cccaaagaga uccuguuugg ccacuuauag gaggucccua aagcugcaga 3360cuccccuugc cggcacccac auauagagac auuaacccuu ccccugcagg gucaccucaa 3420auagucuuuu agcugggcuu cuccugcaau uccaccuaau gccauccccu ggguuuugcc 3480caaaccugaa cugggcagug gggugagagg agggguuuac aggguuacag agccucauac 3540agauaggagc ccauggcugc uggucaucug cauuccugca ggauuggcug uuccuugggg 3600uccuuggcag gaaaaugagg auugcuccga ggccugcucc aguacuuccc agaggcuggc 3660cugguguggg gcucugggaa ggcugaggcu ggagaagcgu aaguaggagg gcagagaugg 3720cacucaggua gcuugaauca ccaggacccu uccaagcccc acagguucug agggaguacu 3780agggccagcu cugggagagg ucucuuccua ugcugugaac ccccugccuu ucuugcagcc 3840uacaacgaau aaauuuucuu ugcaaagg 3868403978RNAHomo sapiens 40ggucgccgcg ccaagggccc gcugagcccc uccucccauu cguccagccg cgcggcccac 60agaagcggaa cgcgcgucga gagcgcccug uccgcucgcc ccagacagau gcccgguuau 120ucauuaccgc gaggccuaga ggaaagagug gcugccgucu uccugcccac agcccgccgg 180acccuccguc gcggcugccc gguccccgga gccgcagccg ccgagcccgg cugugcgugu 240cguggcugcu ggggagaaag aggcuuccgg acaugcucug gagucagaag acagcgaaaa 300gagaagcaga agccccggug gcaagagucu gaagcaggaa ggaugacugu agccugugga 360uuguacugca guaggaaacu guccuagcaa ggcuccacuu ugccccagcu ucaagcugga 420aaggaggaga acaugaaaca uugcuugaag acaauggccg agacagcagg ucccacccug 480cacagccacc agcaucucuc cccucagccc ugucuccucu ucugcaguug ggaucugcac 540auuuaagccu gaaauugucc ugugaaguga aguaugaucg gacagccucu uuucagcuuu 600uaugacaaug gagacagagg aauuguggcu cuugccaagg ucacaggauu ggaauacaga 660gccaagccac cccaggacau gcaagagccu cagaagggaa aaaagcccag caggaaggga 720gaacaaguag ccucuguccu gaaguuguaa cagccagggg ccaggaugga ggaggaggac 780cccauaaucu gcccaucugg gacuuggcag gggaccuggg aaaauguacc ccaacccauc 840ccuuaagggc cuuugucuuu ggcccauugg ccuagcaucu acuucuucac cgugucuguu 900cuugucacac cuagucaggu cuguuugggu cugaggugca uggaacauuc uggguaggcc 960uccagcaaac ggaagcucuu caccguguuu ccagccuggg accaagggca gcauacuggc 1020aaaguugcca aagcaaggga cuccagccuc uuaggaguua augacucccu cuccccagcu 1080guccuccccu uggugcuccu cuuccucccu ccuccugcuc acagcaggca gggccuagac 1140ccgggagcca ugcugcugug cuguugccag gggagcacgg aggcagaucu gagcuaugca 1200gggaaaaggc ccagccuguc aaagugucug agaugaaccg ccgccguccc ugugcagcug 1260ggcucagacg ugucucagcu cuuguucugu gccugagaau ggcgaaaccc agugagguuc 1320aagggcaaac ucgcuauuca uuagucaggg guucuugacg ucccgucucu cccagggaug 1380aguucccccc uccucuuucu cccccuccua ugacacauuc cugggugccu uuggugagga 1440cugcacaccc uccuccugcc uagcccccuc uccaaaggcc ccugaauaaa cuccccccaa 1500ggagaccagg cagggcagag acaauggcug caggaaauca uucaggcggg acaugcuggc 1560cugcccucca cccagucccc cugugggccc cacucccuuc ugauucaggg cacccuuggg 1620cccccagccu auacaggccu ggacaggaag aaaccacugg gaaccacccu aaggacaaca 1680ugcuagucca gugccauucu ucgcuggcuc ugugggugcc uuuguggccu guaccgacug 1740gcuggcuaau uuugugguuu cuguaccauc acaugccuau uuuaagacac ucuccagcac 1800ugucgguuag ggaguguaaa uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua 1860aaaggcacac auacuugguc acaaacaaau ggcacuauuu acucuguggg cauauuugua 1920aaaguugcca aagaauuaua uacaaggaug uucaucagag cauuucuuuu gaagaguaaa 1980gaaauggaca ugaaccugug guccguucau acgguggaau accuaugcag cuguaaaaau 2040caguguggua gaucuccgua uaugaguuga uguggaaggu uggccaguuc acaugauaag 2100gugaauagaa uaaguuacag aacaggcugu agaguaugau cuuauuugua gauguuuaaa 2160acugagucau aaguaugcuu auauacagau cguuucugga aguauguacu ggaagucuac 2220cucuggggag uggggauggg ggagugcacu cuucuauacu guuauauuuu cuuuucaugc 2280uccuaaggua cuuuuauugg aagauguaaa gcgguucaau guaauaggcu uaacuucugu 2340caacuaaguu ggcgugggug cuuuaagagg gugguaguga uguugcugga gaaaguaucc 2400cacagucacu gguggcuuca gccacgggcc auuuuggggc cuaauaauca cauaucauca 2460ugguugcuag uguuaaucga aaaccuacua agugccaggc uuacugucuc ugggucuugc 2520uuacguggau gucauuuuuc caguugcacc aaaucgaaag agguuaauug guuuguugga 2580guuccuuugu aggugaaggg cagagccagg agcuuggcua gggacagggg aggugagugg 2640gggauggugg auaggucuug gcucccaguu uccuucuggg cagacauugc cccucugccc 2700ugaggaccug cuuguuuggg ggaagaggcc uuuagaggca ccagggucau gccagguguu 2760ggacauggug aacugggaag ugcucccauc uggccacagc gcagaaguau caccgugcug 2820ggggaugggg aacagggcug ugaaugggcc uauuugcaua agcagcaugu gucuggagag 2880aaagacauca cagagcagaa gagugcgggu gcccaggagu gcacuugcca ccccuacuuc 2940aucccugaaa gaguaaaugg ccuggaaggu gucucugaga gguaaugccg cacaccaccc 3000ucccuggggg cagggucagg cuacaccugc cuuaggucgg gggcugcagc agccugagag 3060cucucaguag ggccucagua gccugggagg gagcaggggc agggggcagg gaaagaggcg 3120uaauggggcu guccagaggg gccugggaaa ccuggucccu gaggccuggg cacagcuaca 3180aucacuucaa auuggcugug gggccagugg acugggaagg

aaaaaagcaa uaagagugac 3240caagugcaga aggcugucag gucccagguc acaugccuua gugcagugac uccucaucau 3300uuuauggggu gugggugucg uugguacacc cauuuuacag augaggacac cgaggcccag 3360aaaaguuaag uuacaugucc uaagucacac agcuuguaag ugccagaacu gagaucaaaa 3420ccaagucucu uugacuuuaa agucuguacu cugaccccaa agagauccug uuuggccacu 3480uauaggaggu cccuaaagcu gcagacuccc cuugccggca cccacauaua gagacauuaa 3540cccuuccccu gcagggucac cucaaauagu cuuuuagcug ggcuucuccu gcaauuccac 3600cuaaugccau ccccuggguu uugcccaaac cugaacuggg caguggggug agaggagggg 3660uuuacagggu uacagagccu cauacagaua ggagcccaug gcugcugguc aucugcauuc 3720cugcaggauu ggcuguuccu ugggguccuu ggcaggaaaa ugaggauugc uccgaggccu 3780gcuccaguac uucccagagg cuggccuggu guggggcucu gggaaggcug aggcuggaga 3840agcguaagua ggagggcaga gauggcacuc agguagcuug aaucaccagg acccuuccaa 3900gccccacagg uucugaggga guacuagggc cagcucuggg agaggucucu uccuaugcug 3960ugaacccccu gccuuucu 3978413837RNAHomo sapiens 41ccaggcgugu gcauuuauau gcagagugac caagaaacuu caguaauacu aguuuguguc 60uuuggagucc cacuuuuugc cagggcuagu gcuaacagcu ucaggagaau ucagccucac 120cuugacagga caugcucugg agucagaaga cagcgaaaag agaagcagaa gccccggugg 180caagagucug aagcaggaag gaugacugua gccuguggau uguacugcag uaggaaacug 240uccuagcaag gcuccacuuu gccccagcuu caagcuggaa aggaggagaa caugaaacau 300ugcuugaaga caauggccga gacagcaggu cccacccugc acagccacca gcaucucucc 360ccucagcccu gucuccucuu cugcaguugg gaucugcaca uuuaagccug aaauuguccu 420gugaagugaa guaugaucgg acagccucuu uucagcuuuu augacaaugg agacagagga 480auuguggcuc uugccaaggu cacaggauug gaauacagag ccaagccacc ccaggacaug 540caagagccuc agaagggaaa aaagcccagc aggaagggag aacaaguagc cucuguccug 600aaguuguaac agccaggggc caggauggag gaggaggacc ccauaaucug cccaucuggg 660acuuggcagg ggaccuggga aaauguaccc caacccaucc cuuaagggcc uuugucuuug 720gcccauuggc cuagcaucua cuucuucacc gugucuguuc uugucacacc uagucagguc 780uguuuggguc ugaggugcau ggaacauucu ggguaggccu ccagcaaacg gaagcucuuc 840accguguuuc cagccuggga ccaagggcag cauacuggca aaguugccaa agcaagggac 900uccagccucu uaggaguuaa ugacucccuc uccccagcug uccuccccuu ggugcuccuc 960uuccucccuc cuccugcuca cagcaggcag ggccuagacc cgggagccau gcugcugugc 1020uguugccagg ggagcacgga ggcagaucug agcuaugcag ggaaaaggcc cagccuguca 1080aagugucuga gaugaaccgc cgccgucccu gugcagcugg gcucagacgu gucucagcuc 1140uuguucugug ccugagaaug gcgaaaccca gugagguuca agggcaaacu cgcuauucau 1200uagucagggg uucuugacgu cccgucucuc ccagggauga guuccccccu ccucuuucuc 1260ccccuccuau gacacauucc ugggugccuu uggugaggac ugcacacccu ccuccugccu 1320agcccccucu ccaaaggccc cugaauaaac uccccccaag gagaccaggc agggcagaga 1380caauggcugc aggaaaucau ucaggcggga caugcuggcc ugcccuccac ccaguccccc 1440ugugggcccc acucccuucu gauucagggc acccuugggc ccccagccua uacaggccug 1500gacaggaaga aaccacuggg aaccacccua aggacaacau gcuaguccag ugccauucuu 1560cgcuggcucu gugggugccu uuguggccug uaccgacugg cuggcuaauu uugugguuuc 1620uguaccauca caugccuauu uuaagacacu cuccagcacu gucgguuagg gaguguaaau 1680uuugcaauau uuucugaaau guggcaauau caaaauguaa aaggcacaca uacuugguca 1740caaacaaaug gcacuauuua cucugugggc auauuuguaa aaguugccaa agaauuauau 1800acaaggaugu ucaucagagc auuucuuuug aagaguaaag aaauggacau gaaccugugg 1860uccguucaua cgguggaaua ccuaugcagc uguaaaaauc agugugguag aucuccguau 1920augaguugau guggaagguu ggccaguuca caugauaagg ugaauagaau aaguuacaga 1980acaggcugua gaguaugauc uuauuuguag auguuuaaaa cugagucaua aguaugcuua 2040uauacagauc guuucuggaa guauguacug gaagucuacc ucuggggagu ggggaugggg 2100gagugcacuc uucuauacug uuauauuuuc uuuucaugcu ccuaagguac uuuuauugga 2160agauguaaag cgguucaaug uaauaggcuu aacuucuguc aacuaaguug gcgugggugc 2220uuuaagaggg ugguagugau guugcuggag aaaguauccc acagucacug guggcuucag 2280ccacgggcca uuuuggggcc uaauaaucac auaucaucau gguugcuagu guuaaucgaa 2340aaccuacuaa gugccaggcu uacugucucu gggucuugcu uacguggaug ucauuuuucc 2400aguugcacca aaucgaaaga gguuaauugg uuuguuggag uuccuuugua ggugaagggc 2460agagccagga gcuuggcuag ggacagggga ggugaguggg ggauggugga uaggucuugg 2520cucccaguuu ccuucugggc agacauugcc ccucugcccu gaggaccugc uuguuugggg 2580gaagaggccu uuagaggcac cagggucaug ccagguguug gacaugguga acugggaagu 2640gcucccaucu ggccacagcg cagaaguauc accgugcugg gggaugggga acagggcugu 2700gaaugggccu auuugcauaa gcagcaugug ucuggagaga aagacaucac agagcagaag 2760agugcgggug cccaggagug cacuugccac cccuacuuca ucccugaaag aguaaauggc 2820cuggaaggug ucucugagag guaaugccgc acaccacccu cccugggggc agggucaggc 2880uacaccugcc uuaggucggg ggcugcagca gccugagagc ucucaguagg gccucaguag 2940ccugggaggg agcaggggca gggggcaggg aaagaggcgu aauggggcug uccagagggg 3000ccugggaaac cuggucccug aggccugggc acagcuacaa ucacuucaaa uuggcugugg 3060ggccagugga cugggaagga aaaaagcaau aagagugacc aagugcagaa ggcugucagg 3120ucccagguca caugccuuag ugcagugacu ccucaucauu uuauggggug ugggugucgu 3180ugguacaccc auuuuacaga ugaggacacc gaggcccaga aaaguuaagu uacauguccu 3240aagucacaca gcuuguaagu gccagaacug agaucaaaac caagucucuu ugacuuuaaa 3300gucuguacuc ugaccccaaa gagauccugu uuggccacuu auaggagguc ccuaaagcug 3360cagacucccc uugccggcac ccacauauag agacauuaac ccuuccccug cagggucacc 3420ucaaauaguc uuuuagcugg gcuucuccug caauuccacc uaaugccauc cccuggguuu 3480ugcccaaacc ugaacugggc agugggguga gaggaggggu uuacaggguu acagagccuc 3540auacagauag gagcccaugg cugcugguca ucugcauucc ugcaggauug gcuguuccuu 3600gggguccuug gcaggaaaau gaggauugcu ccgaggccug cuccaguacu ucccagaggc 3660uggccuggug uggggcucug ggaaggcuga ggcuggagaa gcguaaguag gagggcagag 3720auggcacuca gguagcuuga aucaccagga cccuuccaag ccccacaggu ucugagggag 3780uacuagggcc agcucuggga gaggucucuu ccuaugcugu gaacccccug ccuuucu 383742571DNAHomo sapiens 42gtgcccgccc gagaaggcgg cgctgggagc cgctcagagc ccagagaagc ggcgcgcggc 60caggagcccc cgctccgcca ctgccgtgcc tgcctcccgc agctgtctgc catgcgctcg 120ccggggcagg ggcgcccgga gggcggctag agctgggcct gagcccggga acgcgcctga 180tcaggggtgg cggagccgcg gtccccacag ccgccccacc cgcgccgctg cctcgctggg 240gcccgggccc ccttcccggt ccttactccc ctgctggtgc ctccctcctt ggcgcgcttc 300ccacctgcga tcggcgccct cttcgcagtc acgaactcgc cagcagctag cagcactgac 360tagtaggagg gcccgccgga ggagagccgc gcggcccaca gaagcggaac gcgcgtcgag 420agcgccctgt ccgctcgccc cagacagatg cccggttatt cattaccgcg aggcctagag 480gaaagagtgg ctgccgtctt cctgcccaca gcccgccgga ccctccgtcg cggctgcccg 540gtccccggag ccgcagccgc cgagcccggc t 571434915DNAHomo sapiens 43ccaccacacc cagactaaaa ggcagtttga ttttacaaat caaaatagca gtaatctatg 60gagatttact tgtgagattg gtaggaaaca tcttaaatgt aatcaaacaa taacttacat 120cttgatgaat tcacgtgtag gtttctcttc ctcagaagaa atcagatgct gttcagagca 180cgaaggctag aattttaccc tggttctcat gctaccttgc acccaggttg gatcctgagt 240acagtttttg gcaggtgggc ctgcatataa gttagcaatg ggggataccc agctgcctct 300cttcatacag ctgaggtttt ggggagtcat tcttatagcc cctgggttgg gcctagtcct 360gcaaatgaat tcaccagccc taaagcccaa attgcagcct ctgtcattca ccttccagga 420gtggaaaggg cagtaagttt catcttatta ttattgctat tttggtggtt ttgttgaggt 480tggtgtgtgt atgttagtaa gataaagctc tcagaaatta catagcattt gtcaaggata 540taagagggac tgtgccacat ctggctgtat agaaggtggt tccatatctt taaatagagc 600cccaggtcct tagccaccag aaaggttttc aggggaagtg tgcaccctca gcagctgctg 660ctggtgggca ggatgggcac gcatggaaca ggctttcctc tgtggccagg tgagaagcag 720gtggtgagac acagagcagt gctgggctct gcttctgaag cctccaacct ttccttccct 780aggaagcccc agagagattg gtgagggtga tttcccagga agacgcagtg tgctctgact 840tctgtgacag tgagcaacgg gaccagtgga tgtccagatg ctggcaatga gtaggccttc 900cctacgctgg gtggcgtcca caccctccgg cttccattgc ctgggtctcc tggaggtggt 960ttgctggatg aataccgcat gcacagaggc tggccttggg tttgaatatg gcagccagtg 1020gacagcatgt gcttcagtta tgagactgcc caggagatgc ttcttccaag gcagagcacg 1080tgcagagtcc agtgctggag aggccgggtg cgcagttgac ccatttccag ttctgttttc 1140cctctcatgt tcctctgtcc ccatctagga catgctctgg agtcagaaga cagcgaaaag 1200agaagcagaa gccccggtgg caagagtctg aagcaggaag gatgactgta gcctgtggat 1260tgtactgcag taggaaactg tcctagcaag gctccacttt gccccagctt caagctggaa 1320aggaggagaa catgaaacat tgcttgaaga caatggccga gacagcaggt cccaccctgc 1380acagccacca gcatctctcc cctcagccct gtctcctctt ctgcagttgg gatctgcaca 1440tttaagcctg aaattgtcct gtgaagtgaa gtatgatcgg acagcctctt ttcagctttt 1500atgacaatgg agacagagga attgtggctc ttgccaaggt cacaggattg gaatacagag 1560ccaagccacc ccaggacatg caagagcctc agaagggaaa aaagcccagc aggaagggag 1620aacaagtagc ctctgtcctg aagttgtaac agccaggggc caggatggag gaggaggacc 1680ccataatctg cccatctggg acttggcagg ggacctggga aaatgtaccc caacccatcc 1740cttaagggcc tttgtctttg gcccattggc ctagcatcta cttcttcacc gtgtctgttc 1800ttgtcacacc tagtcaggtc tgtttgggtc tgaggtgcat ggaacattct gggtaggcct 1860ccagcaaacg gaagctcttc accgtgtttc cagcctggga ccaagggcag catactggca 1920aagttgccaa agcaagggac tccagcctct taggagttaa tgactccctc tccccagctg 1980tcctcccctt ggtgctcctc ttcctccctc ctcctgctca cagcaggcag ggcctagacc 2040cgggagccat gctgctgtgc tgttgccagg ggagcacgga ggcagatctg agctatgcag 2100ggaaaaggcc cagcctgtca aagtgtctga gatgaaccgc cgccgtccct gtgcagctgg 2160gctcagacgt gtctcagctc ttgttctgtg cctgagaatg gcgaaaccca gtgaggttca 2220agggcaaact cgctattcat tagtcagggg ttcttgacgt cccgtctctc ccagggatga 2280gttcccccct cctctttctc cccctcctat gacacattcc tgggtgcctt tggtgaggac 2340tgcacaccct cctcctgcct agccccctct ccaaaggccc ctgaataaac tccccccaag 2400gagaccaggc agggcagaga caatggctgc aggaaatcat tcaggcggga catgctggcc 2460tgccctccac ccagtccccc tgtgggcccc actcccttct gattcagggc acccttgggc 2520ccccagccta tacaggcctg gacaggaaga aaccactggg aaccacccta aggacaacat 2580gctagtccag tgccattctt cgctggctct gtgggtgcct ttgtggcctg taccgactgg 2640ctggctaatt ttgtggtttc tgtaccatca catgcctatt ttaagacact ctccagcact 2700gtcggttagg gagtgtaaat tttgcaatat tttctgaaat gtggcaatat caaaatgtaa 2760aaggcacaca tacttggtca caaacaaatg gcactattta ctctgtgggc atatttgtaa 2820aagttgccaa agaattatat acaaggatgt tcatcagagc atttcttttg aagagtaaag 2880aaatggacat gaacctgtgg tccgttcata cggtggaata cctatgcagc tgtaaaaatc 2940agtgtggtag atctccgtat atgagttgat gtggaaggtt ggccagttca catgataagg 3000tgaatagaat aagttacaga acaggctgta gagtatgatc ttatttgtag atgtttaaaa 3060ctgagtcata agtatgctta tatacagatc gtttctggaa gtatgtactg gaagtctacc 3120tctggggagt ggggatgggg gagtgcactc ttctatactg ttatattttc ttttcatgct 3180cctaaggtac ttttattgga agatgtaaag cggttcaatg taataggctt aacttctgtc 3240aactaagttg gcgtgggtgc tttaagaggg tggtagtgat gttgctggag aaagtatccc 3300acagtcactg gtggcttcag ccacgggcca ttttggggcc taataatcac atatcatcat 3360ggttgctagt gttaatcgaa aacctactaa gtgccaggct tactgtctct gggtcttgct 3420tacgtggatg tcatttttcc agttgcacca aatcgaaaga ggttaattgg tttgttggag 3480ttcctttgta ggtgaagggc agagccagga gcttggctag ggacagggga ggtgagtggg 3540ggatggtgga taggtcttgg ctcccagttt ccttctgggc agacattgcc cctctgccct 3600gaggacctgc ttgtttgggg gaagaggcct ttagaggcac cagggtcatg ccaggtgttg 3660gacatggtga actgggaagt gctcccatct ggccacagcg cagaagtatc accgtgctgg 3720gggatgggga acagggctgt gaatgggcct atttgcataa gcagcatgtg tctggagaga 3780aagacatcac agagcagaag agtgcgggtg cccaggagtg cacttgccac ccctacttca 3840tccctgaaag agtaaatggc ctggaaggtg tctctgagag gtaatgccgc acaccaccct 3900ccctgggggc agggtcaggc tacacctgcc ttaggtcggg ggctgcagca gcctgagagc 3960tctcagtagg gcctcagtag cctgggaggg agcaggggca gggggcaggg aaagaggcgt 4020aatggggctg tccagagggg cctgggaaac ctggtccctg aggcctgggc acagctacaa 4080tcacttcaaa ttggctgtgg ggccagtgga ctgggaagga aaaaagcaat aagagtgacc 4140aagtgcagaa ggctgtcagg tcccaggtca catgccttag tgcagtgact cctcatcatt 4200ttatggggtg tgggtgtcgt tggtacaccc attttacaga tgaggacacc gaggcccaga 4260aaagttaagt tacatgtcct aagtcacaca gcttgtaagt gccagaactg agatcaaaac 4320caagtctctt tgactttaaa gtctgtactc tgaccccaaa gagatcctgt ttggccactt 4380ataggaggtc cctaaagctg cagactcccc ttgccggcac ccacatatag agacattaac 4440ccttcccctg cagggtcacc tcaaatagtc ttttagctgg gcttctcctg caattccacc 4500taatgccatc ccctgggttt tgcccaaacc tgaactgggc agtggggtga gaggaggggt 4560ttacagggtt acagagcctc atacagatag gagcccatgg ctgctggtca tctgcattcc 4620tgcaggattg gctgttcctt ggggtccttg gcaggaaaat gaggattgct ccgaggcctg 4680ctccagtact tcccagaggc tggcctggtg tggggctctg ggaaggctga ggctggagaa 4740gcgtaagtag gagggcagag atggcactca ggtagcttga atcaccagga cccttccaag 4800ccccacaggt tctgagggag tactagggcc agctctggga gaggtctctt cctatgctgt 4860gaaccccctg cctttcttgc agcctacaac gaataaattt tctttgcaaa ggctt 4915444687DNAHomo sapiens 44uuccucagaa gaaaucagau gcuguucaga gcacgaaggc uagaauuuua cccugguucu 60caugcuaccu ugcacccagg uuggauccug aguacaguuu uuggcaggug ggccugcaua 120uaaguuagca augggggaua cccagcugcc ucucuucaua cagcugaggu uuuggggagu 180cauucuuaua gccccugggu ugggccuagu ccugcaaaug aauucaccag cccuaaagcc 240caaauugcag ccucugucau ucaccuucca ggaguggaaa gggcaguaag uuucaucuua 300uuauuauugc uauuuuggug guuuuguuga gguuggugug uguauguuag uaagauaaag 360cucucagaaa uuacauagca uuugucaagg auauaagagg gacugugcca caucuggcug 420uauagaaggu gguuccauau cuuuaaauag agccccaggu ccuuagccac cagaaagguu 480uucaggggaa gugugcaccc ucagcagcug cugcuggugg gcaggauggg cacgcaugga 540acaggcuuuc cucuguggcc aggugagaag caggugguga gacacagagc agugcugggc 600ucugcuucug aagccuccaa ccuuuccuuc ccuaggaagc cccagagaga uuggugaggg 660ugauuuccca ggaagacgca gugugcucug acuucuguga cagugagcaa cgggaccagu 720ggauguccag augcuggcaa ugaguaggcc uucccuacgc uggguggcgu ccacacccuc 780cggcuuccau ugccuggguc uccuggaggu gguuugcugg augaauaccg caugcacaga 840ggcuggccuu ggguuugaau auggcagcca guggacagca ugugcuucag uuaugagacu 900gcccaggaga ugcuucuucc aaggcagagc acgugcagag uccagugcug gagaggccgg 960gugcgcaguu gacccauuuc caguucuguu uucccucuca uguuccucug uccccaucua 1020ggacaugcuc uggagucaga agacagcgaa aagagaagca gaagccccgg uggcaagagu 1080cugaagcugg aaaggaggag aacaugaaac auugcuugaa gacaauggcc gagacagcag 1140gucccacccu gcacagccac cagcaucucu ccccucagcc cugucuccuc uucugcaguu 1200gggaucugca cauuuaagcc ugaaauuguc cugugaagug aaguaugauc ggacagccuc 1260uuuucagcuu uuaugacaau ggagacagag gaauuguggc ucuugccaag gucacaggau 1320uggaauacag agccaagcca ccccaggaca ugcaagagcc ucagaaggga aaaaagccca 1380gcaggaaggg agaacaagua gccucugucc ugaaguugua acagccaggg gccaggaugg 1440aggaggagga ccccauaauc ugcccaucug ggacuuggca ggggaccugg gaaaauguac 1500cccaacccau cccuuaaggg ccuuugucuu uggcccauug gccuagcauc uacuucuuca 1560ccgugucugu ucuugucaca ccuagucagg ucuguuuggg ucugaggugc auggaacauu 1620cuggguaggc cuccagcaaa cggaagcucu ucaccguguu uccagccugg gaccaagggc 1680agcauacugg caaaguugcc aaagcaaggg acuccagccu cuuaggaguu aaugacuccc 1740ucuccccagc uguccucccc uuggugcucc ucuuccuccc uccuccugcu cacagcaggc 1800agggccuaga cccgggagcc augcugcugu gcuguugcca ggggagcacg gaggcagauc 1860ugagcuaugc agggaaaagg cccagccugu caaagugucu gagaugaacc gccgccgucc 1920cugugcagcu gggcucagac gugucucagc ucuuguucug ugccugagaa uggcgaaacc 1980cagugagguu caagggcaaa cucgcuauuc auuagucagg gguucuugac gucccgucuc 2040ucccagggau gaguuccccc cuccucuuuc ucccccuccu augacacauu ccugggugcc 2100uuuggugagg acugcacacc cuccuccugc cuagcccccu cuccaaaggc cccugaauaa 2160acucccccca aggagaccag gcagggcaga gacaauggcu gcaggaaauc auucaggcgg 2220gacaugcugg ccugcccucc acccaguccc ccugugggcc ccacucccuu cugauucagg 2280gcacccuugg gcccccagcc uauacaggcc uggacaggaa gaaaccacug ggaaccaccc 2340uaaggacaac augcuagucc agugccauuc uucgcuggcu cugugggugc cuuuguggcc 2400uguaccgacu ggcuggcuaa uuuugugguu ucuguaccau cacaugccua uuuuaagaca 2460cucuccagca cugucgguua gggaguguaa auuuugcaau auuuucugaa auguggcaau 2520aucaaaaugu aaaaggcaca cauacuuggu cacaaacaaa uggcacuauu uacucugugg 2580gcauauuugu aaaaguugcc aaagaauuau auacaaggau guucaucaga gcauuucuuu 2640ugaagaguaa agaaauggac augaaccugu gguccguuca uacgguggaa uaccuaugca 2700gcuguaaaaa ucaguguggu agaucuccgu auaugaguug auguggaagg uuggccaguu 2760cacaugauaa ggugaauaga auaaguuaca gaacaggcug uagaguauga ucuuauuugu 2820agauguuuaa aacugaguca uaaguaugcu uauauacaga ucguuucugg aaguauguac 2880uggaagucua ccucugggga guggggaugg gggagugcac ucuucuauac uguuauauuu 2940ucuuuucaug cuccuaaggu acuuuuauug gaagauguaa agcgguucaa uguaauaggc 3000uuaacuucug ucaacuaagu uggcgugggu gcuuuaagag ggugguagug auguugcugg 3060agaaaguauc ccacagucac ugguggcuuc agccacgggc cauuuugggg ccuaauaauc 3120acauaucauc augguugcua guguuaaucg aaaaccuacu aagugccagg cuuacugucu 3180cugggucuug cuuacgugga ugucauuuuu ccaguugcac caaaucgaaa gagguuaauu 3240gguuuguugg aguuccuuug uaggugaagg gcagagccag gagcuuggcu agggacaggg 3300gaggugagug ggggauggug gauaggucuu ggcucccagu uuccuucugg gcagacauug 3360ccccucugcc cugaggaccu gcuuguuugg gggaagaggc cuuuagaggc accaggguca 3420ugccaggugu uggacauggu gaacugggaa gugcucccau cuggccacag cgcagaagua 3480ucaccgugcu gggggauggg gaacagggcu gugaaugggc cuauuugcau aagcagcaug 3540ugucuggaga gaaagacauc acagagcaga agagugcggg ugcccaggag ugcacuugcc 3600accccuacuu caucccugaa agaguaaaug gccuggaagg ugucucugag agguaaugcc 3660gcacaccacc cucccugggg gcagggucag gcuacaccug ccuuaggucg ggggcugcag 3720cagccugaga gcucucagua gggccucagu agccugggag ggagcagggg cagggggcag 3780ggaaagaggc guaauggggc uguccagagg ggccugggaa accugguccc ugaggccugg 3840gcacagcuac aaucacuuca aauuggcugu ggggccagug gacugggaag gaaaaaagca 3900auaagaguga ccaagugcag aaggcuguca ggucccaggu cacaugccuu agugcaguga 3960cuccucauca uuuuaugggg uguggguguc guugguacac ccauuuuaca gaugaggaca 4020ccgaggccca gaaaaguuaa guuacauguc cuaagucaca cagcuuguaa gugccagaac 4080ugagaucaaa accaagucuc uuugacuuua aagucuguac ucugacccca aagagauccu 4140guuuggccac uuauaggagg ucccuaaagc ugcagacucc ccuugccggc acccacauau 4200agagacauua acccuucccc ugcaggguca ccucaaauag ucuuuuagcu gggcuucucc 4260ugcaauucca ccuaaugcca uccccugggu uuugcccaaa ccugaacugg gcaguggggu 4320gagaggaggg guuuacaggg uuacagagcc ucauacagau aggagcccau ggcugcuggu 4380caucugcauu ccugcaggau uggcuguucc uugggguccu uggcaggaaa augaggauug 4440cuccgaggcc ugcuccagua cuucccagag gcuggccugg uguggggcuc ugggaaggcu 4500gaggcuggag aagcguaagu aggagggcag agauggcacu cagguagcuu gaaucaccag 4560gacccuucca agccccacag guucugaggg aguacuaggg ccagcucugg gagaggucuc 4620uuccuaugcu gugaaccccc ugccuuucuu gcagccuaca acgaauaaau uuucuuugca 4680aaggcuu 468745706DNAHomo sapiens 45cucagaaauu acauagcauu ugucaaggau auaagaggga cugugccaca ucuggcugua

60uagaaggugg uuccauaucu uuaaauagag ccccaggucc uuagccacca gaaagguuuu 120caggggaagu gugcacccuc agcagcugcu gcuggugggc aggaugggca cgcauggaac 180aggcuuuccu cuguggccag gugagaagca gguggugaga cacagagcag ugcugggcuc 240ugcuucugaa gccuccaacc uuuccuuccc uaggaagccc cagagagauu ggugagggug 300auuucccagg aagacgcagu gugcucugac uucugugaca gugagcaacg ggaccagugg 360auguccagau gcuggcaaug agacaugcuc uggagucaga agacagcgaa aagagaagca 420gaagccccgg uggcaagagu cugaaggugg guuccuuccu gacaugggca uugggcugcg 480cauguguguu cgcaguucuu uccagcugcu guucugaccu cuuugugcag uguauuuaug 540uggcuguaga uggauggucc aagguagauu uagguuuugg aauacuguuu uuuuuuucua 600cuucagggag aaaauaaccc aguuugggaa ggacauuuaa aaggggaaaa uauuagguau 660gauggcacac cugcaguccc agcuauucgg gaggcuaagg cuggag 706463922DNAHomo sapiens 46cacgcaugga acaggcuuuc cucuguggcc aggugagaag caggugguga gacacagagc 60agugcugggc ucugcuucug aagccuccaa ccuuuccuuc ccuaggaagc cccagagaga 120uuggugaggg ugauuuccca ggaagacgca gugugcucug acuucuguga cagugagcaa 180cgggaccagu ggauguccag augcuggcaa ugagacaugc ucuggaguca gaagacagcg 240aaaagagaag cagaagcccc gguggcaaga gucugaagca ggaaggauga cuguagccug 300uggauuguac ugcaguagga aacuguccua gcaaggcucc acuuugcccc agcuucaagc 360uggaaaggag gagaacauga aacauugcuu gaagacaaug gccgagacag caggucccac 420ccugcacagc caccagcauc ucuccccuca gcccugucuc cucuucugca guugggaucu 480gcacauuuaa gccugaaauu guccugugaa gugaaguaug aucggacagc cucuuuucag 540cuuuuaugac aauggagaca gaggaauugu ggcucuugcc aaggucacag gauuggaaua 600cagagccaag ccaccccagg acaugcaaga gccucagaag ggaaaaaagc ccagcaggaa 660gggagaacaa guagccucug uccugaaguu guaacagcca ggggccagga uggaggagga 720ggaccccaua aucugcccau cugggacuug gcaggggacc ugggaaaaug uaccccaacc 780caucccuuaa gggccuuugu cuuuggccca uuggccuagc aucuacuucu ucaccguguc 840uguucuuguc acaccuaguc aggucuguuu gggucugagg ugcauggaac auucugggua 900ggccuccagc aaacggaagc ucuucaccgu guuuccagcc ugggaccaag ggcagcauac 960uggcaaaguu gccaaagcaa gggacuccag ccucuuagga guuaaugacu cccucucccc 1020agcuguccuc cccuuggugc uccucuuccu cccuccuccu gcucacagca ggcagggccu 1080agacccggga gccaugcugc ugugcuguug ccaggggagc acggaggcag aucugagcua 1140ugcagggaaa aggcccagcc ugucaaagug ucugagauga accgccgccg ucccugugca 1200gcugggcuca gacgugucuc agcucuuguu cugugccuga gaauggcgaa acccagugag 1260guucaagggc aaacucgcua uucauuaguc agggguucuu gacgucccgu cucucccagg 1320gaugaguucc ccccuccucu uucucccccu ccuaugacac auuccugggu gccuuuggug 1380aggacugcac acccuccucc ugccuagccc ccucuccaaa ggccccugaa uaaacucccc 1440ccaaggagac caggcagggc agagacaaug gcugcaggaa aucauucagg cgggacaugc 1500uggccugccc uccacccagu cccccugugg gccccacucc cuucugauuc agggcacccu 1560ugggccccca gccuauacag gccuggacag gaagaaacca cugggaacca cccuaaggac 1620aacaugcuag uccagugcca uucuucgcug gcucuguggg ugccuuugug gccuguaccg 1680acuggcuggc uaauuuugug guuucuguac caucacaugc cuauuuuaag acacucucca 1740gcacugucgg uuagggagug uaaauuuugc aauauuuucu gaaauguggc aauaucaaaa 1800uguaaaaggc acacauacuu ggucacaaac aaauggcacu auuuacucug ugggcauauu 1860uguaaaaguu gccaaagaau uauauacaag gauguucauc agagcauuuc uuuugaagag 1920uaaagaaaug gacaugaacc ugugguccgu ucauacggug gaauaccuau gcagcuguaa 1980aaaucagugu gguagaucuc cguauaugag uugaugugga agguuggcca guucacauga 2040uaaggugaau agaauaaguu acagaacagg cuguagagua ugaucuuauu uguagauguu 2100uaaaacugag ucauaaguau gcuuauauac agaucguuuc uggaaguaug uacuggaagu 2160cuaccucugg ggagugggga ugggggagug cacucuucua uacuguuaua uuuucuuuuc 2220augcuccuaa gguacuuuua uuggaagaug uaaagcgguu caauguaaua ggcuuaacuu 2280cugucaacua aguuggcgug ggugcuuuaa gaggguggua gugauguugc uggagaaagu 2340aucccacagu cacugguggc uucagccacg ggccauuuug gggccuaaua aucacauauc 2400aucaugguug cuaguguuaa ucgaaaaccu acuaagugcc aggcuuacug ucucuggguc 2460uugcuuacgu ggaugucauu uuuccaguug caccaaaucg aaagagguua auugguuugu 2520uggaguuccu uuguagguga agggcagagc caggagcuug gcuagggaca ggggagguga 2580gugggggaug guggauaggu cuuggcuccc aguuuccuuc ugggcagaca uugccccucu 2640gcccugagga ccugcuuguu ugggggaaga ggccuuuaga ggcaccaggg ucaugccagg 2700uguuggacau ggugaacugg gaagugcucc caucuggcca cagcgcagaa guaucaccgu 2760gcugggggau ggggaacagg gcugugaaug ggccuauuug cauaagcagc augugucugg 2820agagaaagac aucacagagc agaagagugc gggugcccag gagugcacuu gccaccccua 2880cuucaucccu gaaagaguaa auggccugga aggugucucu gagagguaau gccgcacacc 2940acccucccug ggggcagggu caggcuacac cugccuuagg ucgggggcug cagcagccug 3000agagcucuca guagggccuc aguagccugg gagggagcag gggcaggggg cagggaaaga 3060ggcguaaugg ggcuguccag aggggccugg gaaaccuggu cccugaggcc ugggcacagc 3120uacaaucacu ucaaauuggc uguggggcca guggacuggg aaggaaaaaa gcaauaagag 3180ugaccaagug cagaaggcug ucagguccca ggucacaugc cuuagugcag ugacuccuca 3240ucauuuuaug gggugugggu gucguuggua cacccauuuu acagaugagg acaccgaggc 3300ccagaaaagu uaaguuacau guccuaaguc acacagcuug uaagugccag aacugagauc 3360aaaaccaagu cucuuugacu uuaaagucug uacucugacc ccaaagagau ccuguuuggc 3420cacuuauagg aggucccuaa agcugcagac uccccuugcc ggcacccaca uauagagaca 3480uuaacccuuc cccugcaggg ucaccucaaa uagucuuuua gcugggcuuc uccugcaauu 3540ccaccuaaug ccauccccug gguuuugccc aaaccugaac ugggcagugg ggugagagga 3600gggguuuaca ggguuacaga gccucauaca gauaggagcc cauggcugcu ggucaucugc 3660auuccugcag gauuggcugu uccuuggggu ccuuggcagg aaaaugagga uugcuccgag 3720gccugcucca guacuuccca gaggcuggcc uggugugggg cucugggaag gcugaggcug 3780gagaagcgua aguaggaggg cagagauggc acucagguag cuugaaucac caggacccuu 3840ccaagcccca cagguucuga gggaguacua gggccagcuc ugggagaggu cucuuccuau 3900gcugugaacc cccugccuuu cu 3922473690DNAHomo sapiens 47uuuaacagca ggaaggauga cuguagccug uggauuguac ugcaguagga aacuguccua 60gcaaggcucc acuuugcccc agcuucaagc uggaaaggag gagaacauga aacauugcuu 120gaagacaaug gccgagacag caggucccac ccugcacagc caccagcauc ucuccccuca 180gcccugucuc cucuucugca guugggaucu gcacauuuaa gccugaaauu guccugugaa 240gugaaguaug aucggacagc cucuuuucag cuuuuaugac aauggagaca gaggaauugu 300ggcucuugcc aaggucacag gauuggaaua cagagccaag ccaccccagg acaugcaaga 360gccucagaag ggaaaaaagc ccagcaggaa gggagaacaa guagccucug uccugaaguu 420guaacagcca ggggccagga uggaggagga ggaccccaua aucugcccau cugggacuug 480gcaggggacc ugggaaaaug uaccccaacc caucccuuaa gggccuuugu cuuuggccca 540uuggccuagc aucuacuucu ucaccguguc uguucuuguc acaccuaguc aggucuguuu 600gggucugagg ugcauggaac auucugggua ggccuccagc aaacggaagc ucuucaccgu 660guuuccagcc ugggaccaag ggcagcauac uggcaaaguu gccaaagcaa gggacuccag 720ccucuuagga guuaaugacu cccucucccc agcuguccuc cccuuggugc uccucuuccu 780cccuccuccu gcucacagca ggcagggccu agacccggga gccaugcugc ugugcuguug 840ccaggggagc acggaggcag aucugagcua ugcagggaaa aggcccagcc ugucaaagug 900ucugagauga accgccgccg ucccugugca gcugggcuca gacgugucuc agcucuuguu 960cugugccuga gaauggcgaa acccagugag guucaagggc aaacucgcua uucauuaguc 1020agggguucuu gacgucccgu cucucccagg gaugaguucc ccccuccucu uucucccccu 1080ccuaugacac auuccugggu gccuuuggug aggacugcac acccuccucc ugccuagccc 1140ccucuccaaa ggccccugaa uaaacucccc ccaaggagac caggcagggc agagacaaug 1200gcugcaggaa aucauucagg cgggacaugc uggccugccc uccacccagu cccccugugg 1260gccccacucc cuucugauuc agggcacccu ugggccccca gccuauacag gccuggacag 1320gaagaaacca cugggaacca cccuaaggac aacaugcuag uccagugcca uucuucgcug 1380gcucuguggg ugccuuugug gccuguaccg acuggcuggc uaauuuugug guuucuguac 1440caucacaugc cuauuuuaag acacucucca gcacugucgg uuagggagug uaaauuuugc 1500aauauuuucu gaaauguggc aauaucaaaa uguaaaaggc acacauacuu ggucacaaac 1560aaauggcacu auuuacucug ugggcauauu uguaaaaguu gccaaagaau uauauacaag 1620gauguucauc agagcauuuc uuuugaagag uaaagaaaug gacaugaacc ugugguccgu 1680ucauacggug gaauaccuau gcagcuguaa aaaucagugu gguagaucuc cguauaugag 1740uugaugugga agguuggcca guucacauga uaaggugaau agaauaaguu acagaacagg 1800cuguagagua ugaucuuauu uguagauguu uaaaacugag ucauaaguau gcuuauauac 1860agaucguuuc uggaaguaug uacuggaagu cuaccucugg ggagugggga ugggggagug 1920cacucuucua uacuguuaua uuuucuuuuc augcuccuaa gguacuuuua uuggaagaug 1980uaaagcgguu caauguaaua ggcuuaacuu cugucaacua aguuggcgug ggugcuuuaa 2040gaggguggua gugauguugc uggagaaagu aucccacagu cacugguggc uucagccacg 2100ggccauuuug gggccuaaua aucacauauc aucaugguug cuaguguuaa ucgaaaaccu 2160acuaagugcc aggcuuacug ucucuggguc uugcuuacgu ggaugucauu uuuccaguug 2220caccaaaucg aaagagguua auugguuugu uggaguuccu uuguagguga agggcagagc 2280caggagcuug gcuagggaca ggggagguga gugggggaug guggauaggu cuuggcuccc 2340aguuuccuuc ugggcagaca uugccccucu gcccugagga ccugcuuguu ugggggaaga 2400ggccuuuaga ggcaccaggg ucaugccagg uguuggacau ggugaacugg gaagugcucc 2460caucuggcca cagcgcagaa guaucaccgu gcugggggau ggggaacagg gcugugaaug 2520ggccuauuug cauaagcagc augugucugg agagaaagac aucacagagc agaagagugc 2580gggugcccag gagugcacuu gccaccccua cuucaucccu gaaagaguaa auggccugga 2640aggugucucu gagagguaau gccgcacacc acccucccug ggggcagggu caggcuacac 2700cugccuuagg ucgggggcug cagcagccug agagcucuca guagggccuc aguagccugg 2760gagggagcag gggcaggggg cagggaaaga ggcguaaugg ggcuguccag aggggccugg 2820gaaaccuggu cccugaggcc ugggcacagc uacaaucacu ucaaauuggc uguggggcca 2880guggacuggg aaggaaaaaa gcaauaagag ugaccaagug cagaaggcug ucagguccca 2940ggucacaugc cuuagugcag ugacuccuca ucauuuuaug gggugugggu gucguuggua 3000cacccauuuu acagaugagg acaccgaggc ccagaaaagu uaaguuacau guccuaaguc 3060acacagcuug uaagugccag aacugagauc aaaaccaagu cucuuugacu uuaaagucug 3120uacucugacc ccaaagagau ccuguuuggc cacuuauagg aggucccuaa agcugcagac 3180uccccuugcc ggcacccaca uauagagaca uuaacccuuc cccugcaggg ucaccucaaa 3240uagucuuuua gcugggcuuc uccugcaauu ccaccuaaug ccauccccug gguuuugccc 3300aaaccugaac ugggcagugg ggugagagga gggguuuaca ggguuacaga gccucauaca 3360gauaggagcc cauggcugcu ggucaucugc auuccugcag gauuggcugu uccuuggggu 3420ccuuggcagg aaaaugagga uugcuccgag gccugcucca guacuuccca gaggcuggcc 3480uggugugggg cucugggaag gcugaggcug gagaagcgua aguaggaggg cagagauggc 3540acucagguag cuugaaucac caggacccuu ccaagcccca cagguucuga gggaguacua 3600gggccagcuc ugggagaggu cucuuccuau gcugugaacc cccugccuuu cuugcagccu 3660acaacgaaua aauuuucuuu gcaaaggcuu 3690484093DNAHomo sapiens 48cuuuuagcca ccccagugcu gggcagccag ggugugggcu uuugacugaa ugcacuugcc 60cuccugcauu cauuacacca uugucagugu gugugucugg ggcugccucu gggugugcau 120gguuuuuuuu gugucugcgu gucaguguca ggcuaugugu gucuguuucu gucggccugu 180cuaggcgcgc ucagugcaac aaggagcugg gggagguggc gguaaagagg aagggcauuu 240caaagcccag cuguccuccu cagggaccuc aggagaugcg ugugugugug ugugugugug 300ugugugugug ugugugugua uuuuuuucca ugcugcucau uguguggggc ugcaugcgag 360ugucugacca gguguggugu gagcagccgc ugggcugggu gagccccauc ugccgugagc 420ucccagacuu gccuucuagc ccucugccgc cauccauggg gagccucucc cuucgcagcu 480caccgucucu ucucuaauuu auuagcugga aaggaggaga acaugaaaca uugcuugaag 540acaauggccg agacagcagg ucccacccug cacagccacc agcaucucuc cccucagccc 600ugucuccucu ucugcaguug ggaucugcac auuuaagccu gaaauugucc ugugaaguga 660aguaugaucg gacagccucu uuucagcuuu uaugacaaug gagacagagg aauuguggcu 720cuugccaagg ucacaggauu ggaauacaga gccaagccac cccaggacau gcaagagccu 780cagaagggaa aaaagcccag caggaaggga gaacaaguag ccucuguccu gaaguuguaa 840cagccagggg ccaggaugga ggaggaggac cccauaaucu gcccaucugg gacuuggcag 900gggaccuggg aaaauguacc ccaacccauc ccuuaagggc cuuugucuuu ggcccauugg 960ccuagcaucu acuucuucac cgugucuguu cuugucacac cuagucaggu cuguuugggu 1020cugaggugca uggaacauuc uggguaggcc uccagcaaac ggaagcucuu caccguguuu 1080ccagccuggg accaagggca gcauacuggc aaaguugcca aagcaaggga cuccagccuc 1140uuaggaguua augacucccu cuccccagcu guccuccccu uggugcuccu cuuccucccu 1200ccuccugcuc acagcaggca gggccuagac ccgggagcca ugcugcugug cuguugccag 1260gggagcacgg aggcagaucu gagcuaugca gggaaaaggc ccagccuguc aaagugucug 1320agaugaaccg ccgccguccc ugugcagcug ggcucagacg ugucucagcu cuuguucugu 1380gccugagaau ggcgaaaccc agugagguuc aagggcaaac ucgcuauuca uuagucaggg 1440guucuugacg ucccgucucu cccagggaug aguucccccc uccucuuucu cccccuccua 1500ugacacauuc cugggugccu uuggugagga cugcacaccc uccuccugcc uagcccccuc 1560uccaaaggcc ccugaauaaa cuccccccaa ggagaccagg cagggcagag acaauggcug 1620caggaaauca uucaggcggg acaugcuggc cugcccucca cccagucccc cugugggccc 1680cacucccuuc ugauucaggg cacccuuggg cccccagccu auacaggccu ggacaggaag 1740aaaccacugg gaaccacccu aaggacaaca ugcuagucca gugccauucu ucgcuggcuc 1800ugugggugcc uuuguggccu guaccgacug gcuggcuaau uuugugguuu cuguaccauc 1860acaugccuau uuuaagacac ucuccagcac ugucgguuag ggaguguaaa uuuugcaaua 1920uuuucugaaa uguggcaaua ucaaaaugua aaaggcacac auacuugguc acaaacaaau 1980ggcacuauuu acucuguggg cauauuugua aaaguugcca aagaauuaua uacaaggaug 2040uucaucagag cauuucuuuu gaagaguaaa gaaauggaca ugaaccugug guccguucau 2100acgguggaau accuaugcag cuguaaaaau caguguggua gaucuccgua uaugaguuga 2160uguggaaggu uggccaguuc acaugauaag gugaauagaa uaaguuacag aacaggcugu 2220agaguaugau cuuauuugua gauguuuaaa acugagucau aaguaugcuu auauacagau 2280cguuucugga aguauguacu ggaagucuac cucuggggag uggggauggg ggagugcacu 2340cuucuauacu guuauauuuu cuuuucaugc uccuaaggua cuuuuauugg aagauguaaa 2400gcgguucaau guaauaggcu uaacuucugu caacuaaguu ggcgugggug cuuuaagagg 2460gugguaguga uguugcugga gaaaguaucc cacagucacu gguggcuuca gccacgggcc 2520auuuuggggc cuaauaauca cauaucauca ugguugcuag uguuaaucga aaaccuacua 2580agugccaggc uuacugucuc ugggucuugc uuacguggau gucauuuuuc caguugcacc 2640aaaucgaaag agguuaauug guuuguugga guuccuuugu aggugaaggg cagagccagg 2700agcuuggcua gggacagggg aggugagugg gggauggugg auaggucuug gcucccaguu 2760uccuucuggg cagacauugc cccucugccc ugaggaccug cuuguuuggg ggaagaggcc 2820uuuagaggca ccagggucau gccagguguu ggacauggug aacugggaag ugcucccauc 2880uggccacagc gcagaaguau caccgugcug ggggaugggg aacagggcug ugaaugggcc 2940uauuugcaua agcagcaugu gucuggagag aaagacauca cagagcagaa gagugcgggu 3000gcccaggagu gcacuugcca ccccuacuuc aucccugaaa gaguaaaugg ccuggaaggu 3060gucucugaga gguaaugccg cacaccaccc ucccuggggg cagggucagg cuacaccugc 3120cuuaggucgg gggcugcagc agccugagag cucucaguag ggccucagua gccugggagg 3180gagcaggggc agggggcagg gaaagaggcg uaauggggcu guccagaggg gccugggaaa 3240ccuggucccu gaggccuggg cacagcuaca aucacuucaa auuggcugug gggccagugg 3300acugggaagg aaaaaagcaa uaagagugac caagugcaga aggcugucag gucccagguc 3360acaugccuua gugcagugac uccucaucau uuuauggggu gugggugucg uugguacacc 3420cauuuuacag augaggacac cgaggcccag aaaaguuaag uuacaugucc uaagucacac 3480agcuuguaag ugccagaacu gagaucaaaa ccaagucucu uugacuuuaa agucuguacu 3540cugaccccaa agagauccug uuuggccacu uauaggaggu cccuaaagcu gcagacuccc 3600cuugccggca cccacauaua gagacauuaa cccuuccccu gcagggucac cucaaauagu 3660cuuuuagcug ggcuucuccu gcaauuccac cuaaugccau ccccuggguu uugcccaaac 3720cugaacuggg caguggggug agaggagggg uuuacagggu uacagagccu cauacagaua 3780ggagcccaug gcugcugguc aucugcauuc cugcaggauu ggcuguuccu ugggguccuu 3840ggcaggaaaa ugaggauugc uccgaggccu gcuccaguac uucccagagg cuggccuggu 3900guggggcucu gggaaggcug aggcuggaga agcguaagua ggagggcaga gauggcacuc 3960agguagcuug aaucaccagg acccuuccaa gccccacagg uucugaggga guacuagggc 4020cagcucuggg agaggucucu uccuaugcug ugaacccccu gccuuucuug cagccuacaa 4080cgaauaaauu uuc 409349957DNAHomo sapiens 49aaggcggcgc tgggagccgc tcagagccca gagaagcggc gcgcggccag gagcccccgc 60tccgccactg ccgtgcctgc ctcccgcagc tgtctgccat gcgctcgccg gggcaggggc 120gcccggaggg cggctagagc tgggcctgag cccgggaacg cgcctgatca ggggtggcgg 180agccgcggtc cccacagccg ccccacccgc gccgctgcct cgctggggcc cgggccccct 240tcccgttact cccctgctgg tgcctccctc cttggcgcgc ttcccacctg cgatcggcgc 300cctcttcgca gtcacgaact cgccagcagc tagcagcact gactagtagg agggcccgcc 360ggaggagagg acatgctctg gagtcagaag acagcgaaaa gagaagcaga agccccggtg 420gcaagagtct gaagcaggaa ggatgactgt agcctgtgga ttgtactgca gtaggaaact 480gtcctagcaa ggctccactt tgccccagct tcaagctgga aaggaggaga acatgaaaca 540ttgcttgaag acaatggccg agacagcagg tcccaccctg cacagccacc agcatctctc 600ccctcagccc tgtctcctct tctgcagttg ggatctgcac atttaagcct gaaattgtcc 660tgtgaagtga agtatgatcg gacagcctct tttcagcttt tatgacaatg gagacagagg 720aattgtggct cttgccaagg tcacaggatt ggaatacaga gccaagccac cccaggacat 780gcaagagcct cagaagggaa aaaagcccag caggaaggga gaacaagtag cctctgtcct 840gaagttgtaa cagccagggg ccaggatgga ggaggaggac cccataatct gcccatctgg 900gacttggcag gggacctggg aaaatgtacc ccaacccatc ccttaagggc ctttgtc 957502124DNAHomo sapiens 50gattctcaca acttctgcgt gcgagcgccc gccccaccga ccgccccggc ccggcccgca 60agagccagag gagccgagag gagcccagcg ccggcccagc ggactccagc tcgacggagc 120ggccgcgccc cgaccagtta ctcccctgct ggtgcctccc tccttggcgc gcttcccacc 180tgcgatcggc gccctcttcg cagtcacgaa ctcgccagca gctagcagca ctgactagta 240ggagggcccg ccggaggaga ggaagcccca gagagattgg tgagggtgat ttcccaggaa 300gacgcagtgt gctctgactt ctgtgacagt gagcaacggg accagtggat gtccagatgc 360tggcaatgag acatgctctg gagtcagaag acagcgaaaa gagaagcaga agccccggtg 420gcaagagtct gaagcaggaa ggatgactgt agcctgtgga ttgtactgca gtaggaaact 480gtcctagcaa ggctccactt tgccccagct tcaagctgga aaggaggaga acatgaaaca 540ttgcttgaag acaatggccg agacagcagg tcccaccctg cacagccacc agcatctctc 600ccctcagccc tgtctcctct tctgcagttg ggatctgcac atttaagcct gaaattgtcc 660tgtgaagtga agtatgatcg gacagcctct tttcagcttt tatgacaatg gagacagagg 720aattgtggct cttgccaagg tcacaggatt ggaatacaga gccaagccac cccaggacat 780gcaagagcct cagaagggaa aaaagcccag caggaaggga gaacaagtag cctctgtcct 840gaagttgtaa cagccagggg ccaggatgga ggaggaggac cccataatct gcccatctgg 900gacttggcag gggacctggg aaaatgtacc ccaacccatc ccttaagggc ctttgtcttt 960ggcccattgg cctagcatct acttcttcac cgtgtctgtt cttgtcacac ctagtcaggt 1020ctgtttgggt ctgaggtgca tggaacattc tgggtaggcc tccagcaaac ggaagctctt 1080caccgtgttt ccagcctggg accaagggca gcatactggc aaagttgcca aagcaaggga 1140ctccagcctc ttaggagtta atgactccct ctccccagct gtcctcccct tggtgctcct 1200cttcctccct cctcctgctc acagcaggca gggcctagac ccgggagcca tgctgctgtg 1260ctgttgccag gggagcacgg aggcagatct gagctatgca gggaaaaggc ccagcctgtc 1320aaagtgtctg agatgaaccg ccgccgtccc tgtgcagctg ggctcagacg tgtctcagct 1380cttgttctgt gcctgagaat ggcgaaaccc agtgaggttc aagggcaaac tcgctattca 1440ttagtcaggg gttcttgacg tcccgtctct cccagggatg agttcccccc tcctctttct 1500ccccctccta tgacacattc

ctgggtgcct ttggtgagga ctgcacaccc tcctcctgcc 1560tagccccctc tccaaaggcc cctgaataaa ctccccccaa ggagaccagg cagggcagag 1620acaatggctg caggaaatca ttcaggcggg acatgctggc ctgcccttca cctcaaatag 1680tcttttagct gggcttctcc tgcaattcca cctaatgcca tcccctgggt tttgcccaaa 1740cctgaactgg gcagtggggt gagaggaggg gtttacaggg ttacagagcc tcatacagat 1800aggagcccat ggctgctggt catctgcatt cctgcaggat tggctgttcc ttggggtcct 1860tggcaggaaa atgaggattg ctccgaggcc tgctccagta cttcccagag gctggcctgg 1920tgtggggctc tgggaaggct gaggctggag aagcgtaagt aggagggcag agatggcact 1980caggtagctt gaatcaccag gacccttcca agccccacag gttctgaggg agtactaggg 2040ccagctctgg gagaggtctc ttcctatgct gtgaaccccc tgcctttctt gcagcctaca 2100acgaataaat tttctttgca aagg 212451576DNAHomo sapiens 51agtgcgtggg ggtcccggcc ccacacagtg ctagggtccc tctcgagttt ctcatctgcc 60ttcaggtcac tttccaccct gatgccttgg cttgtcctga agctcagggc ccctgtagct 120tgggaaacct cccaagctcc ccagcgagtg gctgtagacc aaggaaggga ccctgcccgg 180cttcagggaa gaaaggaaga aagttactcc cctgctggtg cctccctcct tggcgcgctt 240cccacctgcg atcggcgccc tcttcgcagt cacgaactcg ccagcagcta gcagcactga 300ctagtaggag ggcccgccgg aggagagccg cgcggcccac agaagcggaa cgcgcgtcga 360gagcgccctg tccgctcgcc ccagacagat gcccggttat tcattaccgc gaggcctaga 420ggaaagagtg gctgccgtct tcctgcccac agcccgccgg accctccgtc gcggctgccc 480ggtccccgga gccgcagccg ccgagcccgg ctgtgcgtgt cgtggctgct ggggagaaag 540aggcttccgg acatgctctg gagtcagaag acagcg 576526547DNAHomo sapiens 52gagcaatgtc ctgggaggcc tggctgagct tgtgtccagg agcactggac ttgtgttaaa 60cactgtcccc ttggatgggc ccagaagtca aacctgtcca ttagattttt tttttttttc 120ctttgggaga gctggtatgg gctggttgtc ctccaggaga gccctgttct cacccgaggt 180ctgttaatga gctggggaca ggtgagcctc acacgttcca acttggctgc cttcagcggc 240atccaggagc agtggtgagc tattaatgga aggtgccggc tttgtgctaa ttagaacttc 300ctttcagctt ccatctgtgc agacactgga gcccctcact ggtcagcctc gccgtcccaa 360cccccctcag tttgcaacct agtttttgtc ccccccaccc cccatgaatt agggggtgct 420atgagtggag ctgctttcct ctagctctgg tcaaatcccg gctctttgtg tattgcagaa 480ctgtactggg tggtatttct cagggcttct ctctcttgtt ggggtggaga tggacctgga 540agatggagtt ggaaagggat ttgggcacca tggccaccct cctgggtagg ctggacttac 600atcatcgacc tgagtttgtt ttgtgaaaga cctctctcct ctgccctctg gagactgtga 660cttcagaccc ttgtcctctc cattacccca gtcctgatgt ctcccaagtc tgatggtacc 720cacccatgtc aactacagct gccatctttg ccatctcagg cagcctacag gtgggggctg 780tgtccttgac cctcttctga aaaagaaaaa cctatttttt tccttcactt ttgcttttta 840tttctttcac ctcaggccca atggatatat atatatatat atatatatat acatatatat 900atatacatat atacatatat atatatatat atatatatat attttaatgg aggttgtctc 960ttacagaggt ttcattgaaa aagaagaaac aatgtcccat taacgtcatt taaagaaaaa 1020gcacctctca gaatggaggt tgggaaagct agggtttctt gcctgaatat cagttgggat 1080gaaatccctt gtaaggaact caagagagga gcgtttcctc agaatgcttt ctttagctcc 1140tgagtctcct taggtctcca ctggggttgt gtgtaaaaat accaagccct ccctgacaat 1200gcatctcatt ctcttctgct tgaattatcc tgataatgag agatcaccca cttctttgac 1260agtgtagact aggattttaa aaattgggag tgaattattg gacagtgtgg cacttcacca 1320gcttccctca aggttctgga tctatgctaa agaggggtgg aaatgcttct ggggtgtcca 1380gaagggctgc agaaattcct cgtcactggt catggggaga gcaggactgg cttgcctctg 1440tggcctcttc tgcctctgga ggtgacaatt cctgatttga ggcactaggg tggaagactc 1500aggactatcc aggaccaggt taataaaccg gcagtccaga ttgcagaagg gcagcagctg 1560ggggctgggg acatgcccat gcctgtggga cagagttctt ttgcatgctt tggcctttac 1620gactctgtat ccttgacaag tcacaggcat ctctgggtga aatggggaca atagtaccca 1680tacctccaag ggttatgtga gaattaagta aaatatgcaa ataaagtgcc tggcatacag 1740taggcactga gcaaacggta gctctttttt ccaggctggg gcaagggatg catataaatg 1800tctggatctg aagtttgaaa ttccacctgc tggagacagt gaacacccca gtagataccc 1860caaatcacac agaaggacgg atgaccagct gccttcttcc cccagggcat gccatacaca 1920ctgggcctga agtgggagaa tcgggacccc aaaaaaacgg cttgtggagc ggggttgcac 1980atgggtgtaa agttcccagc ttggctgcct ggggaggggg agcatgtaaa tgtctttaga 2040gatttgaagg gaccaggatc tggactgatt tgcgttgccc agggggctgg ggctgggagc 2100caagggggtg ctgccgggag gcccaggtta gcttggggta tggcatttct aacagttggc 2160gcctgcggaa aatggcctgg ggttccagct ctggaaggtt ccgaatctca gtattcacga 2220gcggcgctgt ccggagcagc cagggttgtc ccttggtggt ctcgggcagg ttctccgcga 2280tgcgcttgct gggtcgcagg tgagaacctc acggttctcc atttccggag atccagctct 2340gagcaggcag agggtcgctc ccgtcgcctg cccctgcggt agccaagcgg gtggctggaa 2400gcgtggctag ctggcaggta aggagctcca ggtgagacgg aacacgaccc ccaaccccct 2460tagccggtgc cccacccgat ttctctcctg cgtcctggga gggcatggtt gaggcgccac 2520cggtgcccag caacctcccc aggctgtggt tgtgacctga ggacgcgtgt gtccccgccc 2580tcaggccacc gctacgcgac cctgagtgca ccttcaagaa ggccgggcac gtttctgggc 2640gggcgtgggg ggtgcctgat atctccgctc tattttacag ttactcccct gctggtgcct 2700ccctccttgg cgcgcttccc acctgcgatc ggcgccctct tcgcagtcac gaactcgcca 2760gcagctagca gcactgacta gtaggagggc ccgccggagg agaggacatg ctctggagtc 2820agaagacagc gaaaagagaa gcagaagccc cggtggcaag agtctgaagg aaggatgact 2880gtagcctgtg gattgtactg cagtaggaaa ctgtcctagc aaggctccac tttgccccag 2940cttcaagctg gaaaggagga gaacatgaaa cattgcttga agacaatggc cgagacagca 3000ggtcccaccc tgcacagcca ccagcatctc tcccctcagc cctgtctcct cttctgcagt 3060tgggatctgc acatttaagc ctgaaattgt cctgtgaagt gaagtatgat cggacagcct 3120cttttcagct tttatgacaa tggagacaga ggaattgtgg ctcttgccaa ggtcacagga 3180ttggaataca gagccaagcc accccaggac atgcaagagc ctcagaaggg aaaaaagccc 3240agcaggaagg gagaacaagt agcctctgtc ctgaagttgt aacagccagg ggccaggatg 3300gaggaggagg accccataat ctgcccatct gggacttggc aggggacctg ggaaaatgta 3360ccccaaccca tcccttaagg gcctttgtct ttggcccatt ggcctagcat ctacttcttc 3420accgtgtctg ttcttgtcac acctagtcag gtctgtttgg gtctgaggtg catggaacat 3480tctgggtagg cctccagcaa acggaagctc ttcaccgtgt ttccagcctg ggaccaaggg 3540cagcatactg gcaaagttgc caaagcaagg gactccagcc tcttaggagt taatgactcc 3600ctctccccag ctgtcctccc cttggtgctc ctcttcctcc ctcctcctgc tcacagcagg 3660cagggcctag acccgggagc catgctgctg tgctgttgcc aggggagcac ggaggcagat 3720ctgagctatg cagggaaaag gcccagcctg tcaaagtgtc tgagatgaac cgccgccgtc 3780cctgtgcagc tgggctcaga cgtgtctcag ctcttgttct gtgcctgaga atggcgaaac 3840ccagtgaggt tcaagggcaa actcgctatt cattagtcag gggttcttga cgtcccgtct 3900ctcccaggga tgagttcccc cctcctcttt ctccccctcc tatgacacat tcctgggtgc 3960ctttggtgag gactgcacac cctcctcctg cctagccccc tctccaaagg cccctgaata 4020aactcccccc aaggagacca ggcagggcag agacaatggc tgcaggaaat cattcaggcg 4080ggacatgctg gcctgccctc cacccagtcc ccctgtgggc cccactccct tctgattcag 4140ggcacccttg ggcccccagc ctatacaggc ctggacagga agaaaccact gggaaccacc 4200ctaaggacaa catgctagtc cagtgccatt cttcgctggc tctgtgggtg cctttgtggc 4260ctgtaccgac tggctggcta attttgtggt ttctgtacca tcacatgcct attttaagac 4320actctccagc actgtcggtt agggagtgta aattttgcaa tattttctga aatgtggcaa 4380tatcaaaatg taaaaggcac acatacttgg tcacaaacaa atggcactat ttactctgtg 4440ggcatatttg taaaagttgc caaagaatta tatacaagga tgttcatcag agcatttctt 4500ttgaagagta aagaaatgga catgaacctg tggtccgttc atacggtgga atacctatgc 4560agctgtaaaa atcagtgtgg tagatctccg tatatgagtt gatgtggaag gttggccagt 4620tcacatgata aggtgaatag aataagttac agaacaggct gtagagtatg atcttatttg 4680tagatgttta aaactgagtc ataagtatgc ttatatacag atcgtttctg gaagtatgta 4740ctggaagtct acctctgggg agtggggatg ggggagtgca ctcttctata ctgttatatt 4800ttcttttcat gctcctaagg tacttttatt ggaagatgta aagcggttca atgtaatagg 4860cttaacttct gtcaactaag ttggcgtggg tgctttaaga gggtggtagt gatgttgctg 4920gagaaagtat cccacagtca ctggtggctt cagccacggg ccattttggg gcctaataat 4980cacatatcat catggttgct agtgttaatc gaaaacctac taagtgccag gcttactgtc 5040tctgggtctt gcttacgtgg atgtcatttt tccagttgca ccaaatcgaa agaggttaat 5100tggtttgttg gagttccttt gtaggtgaag ggcagagcca ggagcttggc tagggacagg 5160ggaggtgagt gggggatggt ggataggtct tggctcccag tttccttctg ggcagacatt 5220gcccctctgc cctgaggacc tgcttgtttg ggggaagagg cctttagagg caccagggtc 5280atgccaggtg ttggacatgg tgaactggga agtgctccca tctggccaca gcgcagaagt 5340atcaccgtgc tgggggatgg ggaacagggc tgtgaatggg cctatttgca taagcagcat 5400gtgtctggag agaaagacat cacagagcag aagagtgcgg gtgcccagga gtgcacttgc 5460cacccctact tcatccctga aagagtaaat ggcctggaag gtgtctctga gaggtaatgc 5520cgcacaccac cctccctggg ggcagggtca ggctacacct gccttaggtc gggggctgca 5580gcagcctgag agctctcagt agggcctcag tagcctggga gggagcaggg gcagggggca 5640gggaaagagg cgtaatgggg ctgtccagag gggcctggga aacctggtcc ctgaggcctg 5700ggcacagcta caatcacttc aaattggctg tggggccagt ggactgggaa ggaaaaaagc 5760aataagagtg accaagtgca gaaggctgtc aggtcccagg tcacatgcct tagtgcagtg 5820actcctcatc attttatggg gtgtgggtgt cgttggtaca cccattttac agatgaggac 5880accgaggccc agaaaagtta agttacatgt cctaagtcac acagcttgta agtgccagaa 5940ctgagatcaa aaccaagtct ctttgacttt aaagtctgta ctctgacccc aaagagatcc 6000tgtttggcca cttataggag gtccctaaag ctgcagactc cccttgccgg cacccacata 6060tagagacatt aacccttccc ctgcagggtc acctcaaata gtcttttagc tgggcttctc 6120ctgcaattcc acctaatgcc atcccctggg ttttgcccaa acctgaactg ggcagtgggg 6180tgagaggagg ggtttacagg gttacagagc ctcatacaga taggagccca tggctgctgg 6240tcatctgcat tcctgcagga ttggctgttc cttggggtcc ttggcaggaa aatgaggatt 6300gctccgaggc ctgctccagt acttcccaga ggctggcctg gtgtggggct ctgggaaggc 6360tgaggctgga gaagcgtaag taggagggca gagatggcac tcaggtagct tgaatcacca 6420ggacccttcc aagccccaca ggttctgagg gagtactagg gccagctctg ggagaggtct 6480cttcctatgc tgtgaacccc ctgcctttct tgcagcctac aacgaataaa ttttctttgc 6540aaaggct 6547533814DNAHomo sapiens 53cacgtttctg ggcgggcgtg gggggtgcct gatatctccg ctctatttta cagttactcc 60cctgctggtg cctccctcct tggcgcgctt cccacctgcg atcggcgccc tcttcgcagt 120cacgaactcg ccagcagcta gcagcactga ctagtaggag ggcccgccgg aggagaggac 180atgctctgga gtcagaagac agcgaaaaga gaagcagaag ccccggtggc aagagtctga 240agctggaaag gaggagaaca tgaaacattg cttgaagaca atggccgaga cagcaggtcc 300caccctgcac agccaccagc atctctcccc tcagccctgt ctcctcttct gcagttggga 360tctgcacatt taagcctgaa attgtcctgt gaagtgaagt atgatcggac agcctctttt 420cagcttttat gacaatggag acagaggaat tgtggctctt gccaaggtca caggattgga 480atacagagcc aagccacccc aggacatgca agagcctcag aagggaaaaa agcccagcag 540gaagggagaa caagtagcct ctgtcctgaa gttgtaacag ccaggggcca ggatggagga 600ggaggacccc ataatctgcc catctgggac ttggcagggg acctgggaaa atgtacccca 660acccatccct taagggcctt tgtctttggc ccattggcct agcatctact tcttcaccgt 720gtctgttctt gtcacaccta gtcaggtctg tttgggtctg aggtgcatgg aacattctgg 780gtaggcctcc agcaaacgga agctcttcac cgtgtttcca gcctgggacc aagggcagca 840tactggcaaa gttgccaaag caagggactc cagcctctta ggagttaatg actccctctc 900cccagctgtc ctccccttgg tgctcctctt cctccctcct cctgctcaca gcaggcaggg 960cctagacccg ggagccatgc tgctgtgctg ttgccagggg agcacggagg cagatctgag 1020ctatgcaggg aaaaggccca gcctgtcaaa gtgtctgaga tgaaccgccg ccgtccctgt 1080gcagctgggc tcagacgtgt ctcagctctt gttctgtgcc tgagaatggc gaaacccagt 1140gaggttcaag ggcaaactcg ctattcatta gtcaggggtt cttgacgtcc cgtctctccc 1200agggatgagt tcccccctcc tctttctccc cctcctatga cacattcctg ggtgcctttg 1260gtgaggactg cacaccctcc tcctgcctag ccccctctcc aaaggcccct gaataaactc 1320cccccaagga gaccaggcag ggcagagaca atggctgcag gaaatcattc aggcgggaca 1380tgctggcctg ccctccaccc agtccccctg tgggccccac tcccttctga ttcagggcac 1440ccttgggccc ccagcctata caggcctgga caggaagaaa ccactgggaa ccaccctaag 1500gacaacatgc tagtccagtg ccattcttcg ctggctctgt gggtgccttt gtggcctgta 1560ccgactggct ggctaatttt gtggtttctg taccatcaca tgcctatttt aagacactct 1620ccagcactgt cggttaggga gtgtaaattt tgcaatattt tctgaaatgt ggcaatatca 1680aaatgtaaaa ggcacacata cttggtcaca aacaaatggc actatttact ctgtgggcat 1740atttgtaaaa gttgccaaag aattatatac aaggatgttc atcagagcat ttcttttgaa 1800gagtaaagaa atggacatga acctgtggtc cgttcatacg gtggaatacc tatgcagctg 1860taaaaatcag tgtggtagat ctccgtatat gagttgatgt ggaaggttgg ccagttcaca 1920tgataaggtg aatagaataa gttacagaac aggctgtaga gtatgatctt atttgtagat 1980gtttaaaact gagtcataag tatgcttata tacagatcgt ttctggaagt atgtactgga 2040agtctacctc tggggagtgg ggatggggga gtgcactctt ctatactgtt atattttctt 2100ttcatgctcc taaggtactt ttattggaag atgtaaagcg gttcaatgta ataggcttaa 2160cttctgtcaa ctaagttggc gtgggtgctt taagagggtg gtagtgatgt tgctggagaa 2220agtatcccac agtcactggt ggcttcagcc acgggccatt ttggggccta ataatcacat 2280atcatcatgg ttgctagtgt taatcgaaaa cctactaagt gccaggctta ctgtctctgg 2340gtcttgctta cgtggatgtc atttttccag ttgcaccaaa tcgaaagagg ttaattggtt 2400tgttggagtt cctttgtagg tgaagggcag agccaggagc ttggctaggg acaggggagg 2460tgagtggggg atggtggata ggtcttggct cccagtttcc ttctgggcag acattgcccc 2520tctgccctga ggacctgctt gtttggggga agaggccttt agaggcacca gggtcatgcc 2580aggtgttgga catggtgaac tgggaagtgc tcccatctgg ccacagcgca gaagtatcac 2640cgtgctgggg gatggggaac agggctgtga atgggcctat ttgcataagc agcatgtgtc 2700tggagagaaa gacatcacag agcagaagag tgcgggtgcc caggagtgca cttgccaccc 2760ctacttcatc cctgaaagag taaatggcct ggaaggtgtc tctgagaggt aatgccgcac 2820accaccctcc ctgggggcag ggtcaggcta cacctgcctt aggtcggggg ctgcagcagc 2880ctgagagctc tcagtagggc ctcagtagcc tgggagggag caggggcagg gggcagggaa 2940agaggcgtaa tggggctgtc cagaggggcc tgggaaacct ggtccctgag gcctgggcac 3000agctacaatc acttcaaatt ggctgtgggg ccagtggact gggaaggaaa aaagcaataa 3060gagtgaccaa gtgcagaagg ctgtcaggtc ccaggtcaca tgccttagtg cagtgactcc 3120tcatcatttt atggggtgtg ggtgtcgttg gtacacccat tttacagatg aggacaccga 3180ggcccagaaa agttaagtta catgtcctaa gtcacacagc ttgtaagtgc cagaactgag 3240atcaaaacca agtctctttg actttaaagt ctgtactctg accccaaaga gatcctgttt 3300ggccacttat aggaggtccc taaagctgca gactcccctt gccggcaccc acatatagag 3360acattaaccc ttcccctgca gggtcacctc aaatagtctt ttagctgggc ttctcctgca 3420attccaccta atgccatccc ctgggttttg cccaaacctg aactgggcag tggggtgaga 3480ggaggggttt acagggttac agagcctcat acagatagga gcccatggct gctggtcatc 3540tgcattcctg caggattggc tgttccttgg ggtccttggc aggaaaatga ggattgctcc 3600gaggcctgct ccagtacttc ccagaggctg gcctggtgtg gggctctggg aaggctgagg 3660ctggagaagc gtaagtagga gggcagagat ggcactcagg tagcttgaat caccaggacc 3720cttccaagcc ccacaggttc tgagggagta ctagggccag ctctgggaga ggtctcttcc 3780tatgctgtga accccctgcc tttcttgcag ccta 3814544198DNAHomo sapiens 54ttactcccct gctggtgcct ccctccttgg cgcgcttccc acctgcgatc ggcgccctct 60tcgcagtcac gaactcgcca gcagctagca gcactgacta gtaggagggc ccgccggagg 120agagccgcgc ggcccacaga agcggaacgc gcgtcgagag cgccctgtcc gctcgcccca 180gacagatgcc cggttattca ttaccgcgag gcctagagga aagagtggct gccgtcttcc 240tgcccacagc ccgccggacc ctccgtcgcg gctgcccggt ccccggagcc gcagccgccg 300agcccggctg tgcgtgtcgt ggctgctggg gagaaagagg cttccggaag ccccagagag 360attggtgagg gtgatttccc aggaagacgc agtgtgctct gacttctgtg acagtgagca 420acgggaccag tggatgtcca gatgctggca atgagacatg ctctggagtc agaagacagc 480gaaaagagaa gcagaagccc cggtggcaag agtctgaagc aggaaggatg actgtagcct 540gtggattgta ctgcagtagg aaactgtcct agcaaggctc cactttgccc cagcttcaag 600ctggaaagga ggagaacatg aaacattgct tgaagacaat ggccgagaca gcaggtccca 660ccctgcacag ccaccagcat ctctcccctc agccctgtct cctcttctgc agttgggatc 720tgcacattta agcctgaaat tgtcctgtga agtgaagtat gatcggacag cctcttttca 780gcttttatga caatggagac agaggaattg tggctcttgc caaggtcaca ggattggaat 840acagagccaa gccaccccag gacatgcaag agcctcagaa gggaaaaaag cccagcagga 900agggagaaca agtagcctct gtcctgaagt tgtaacagcc aggggccagg atggaggagg 960aggaccccat aatctgccca tctgggactt ggcaggggac ctgggaaaat gtaccccaac 1020ccatccctta agggcctttg tctttggccc attggcctag catctacttc ttcaccgtgt 1080ctgttcttgt cacacctagt caggtctgtt tgggtctgag gtgcatggaa cattctgggt 1140aggcctccag caaacggaag ctcttcaccg tgtttccagc ctgggaccaa gggcagcata 1200ctggcaaagt tgccaaagca agggactcca gcctcttagg agttaatgac tccctctccc 1260cagctgtcct ccccttggtg ctcctcttcc tccctcctcc tgctcacagc aggcagggcc 1320tagacccggg agccatgctg ctgtgctgtt gccaggggag cacggaggca gatctgagct 1380atgcagggaa aaggcccagc ctgtcaaagt gtctgagatg aaccgccgcc gtccctgtgc 1440agctgggctc agacgtgtct cagctcttgt tctgtgcctg agaatggcga aacccagtga 1500ggttcaaggg caaactcgct attcattagt caggggttct tgacgtcccg tctctcccag 1560ggatgagttc ccccctcctc tttctccccc tcctatgaca cattcctggg tgcctttggt 1620gaggactgca caccctcctc ctgcctagcc ccctctccaa aggcccctga ataaactccc 1680cccaaggaga ccaggcaggg cagagacaat ggctgcagga aatcattcag gcgggacatg 1740ctggcctgcc ctccacccag tccccctgtg ggccccactc ccttctgatt cagggcaccc 1800ttgggccccc agcctataca ggcctggaca ggaagaaacc actgggaacc accctaagga 1860caacatgcta gtccagtgcc attcttcgct ggctctgtgg gtgcctttgt ggcctgtacc 1920gactggctgg ctaattttgt ggtttctgta ccatcacatg cctattttaa gacactctcc 1980agcactgtcg gttagggagt gtaaattttg caatattttc tgaaatgtgg caatatcaaa 2040atgtaaaagg cacacatact tggtcacaaa caaatggcac tatttactct gtgggcatat 2100ttgtaaaagt tgccaaagaa ttatatacaa ggatgttcat cagagcattt cttttgaaga 2160gtaaagaaat ggacatgaac ctgtggtccg ttcatacggt ggaataccta tgcagctgta 2220aaaatcagtg tggtagatct ccgtatatga gttgatgtgg aaggttggcc agttcacatg 2280ataaggtgaa tagaataagt tacagaacag gctgtagagt atgatcttat ttgtagatgt 2340ttaaaactga gtcataagta tgcttatata cagatcgttt ctggaagtat gtactggaag 2400tctacctctg gggagtgggg atgggggagt gcactcttct atactgttat attttctttt 2460catgctccta aggtactttt attggaagat gtaaagcggt tcaatgtaat aggcttaact 2520tctgtcaact aagttggcgt gggtgcttta agagggtggt agtgatgttg ctggagaaag 2580tatcccacag tcactggtgg cttcagccac gggccatttt ggggcctaat aatcacatat 2640catcatggtt gctagtgtta atcgaaaacc tactaagtgc caggcttact gtctctgggt 2700cttgcttacg tggatgtcat ttttccagtt gcaccaaatc gaaagaggtt aattggtttg 2760ttggagttcc tttgtaggtg aagggcagag ccaggagctt ggctagggac aggggaggtg 2820agtgggggat ggtggatagg tcttggctcc cagtttcctt ctgggcagac attgcccctc 2880tgccctgagg acctgcttgt ttgggggaag aggcctttag aggcaccagg gtcatgccag 2940gtgttggaca tggtgaactg ggaagtgctc ccatctggcc acagcgcaga agtatcaccg 3000tgctggggga tggggaacag ggctgtgaat gggcctattt gcataagcag catgtgtctg 3060gagagaaaga catcacagag cagaagagtg cgggtgccca ggagtgcact tgccacccct 3120acttcatccc tgaaagagta aatggcctgg aaggtgtctc tgagaggtaa tgccgcacac 3180caccctccct gggggcaggg tcaggctaca cctgccttag gtcgggggct gcagcagcct 3240gagagctctc agtagggcct cagtagcctg ggagggagca ggggcagggg gcagggaaag

3300aggcgtaatg gggctgtcca gaggggcctg ggaaacctgg tccctgaggc ctgggcacag 3360ctacaatcac ttcaaattgg ctgtggggcc agtggactgg gaaggaaaaa agcaataaga 3420gtgaccaagt gcagaaggct gtcaggtccc aggtcacatg ccttagtgca gtgactcctc 3480atcattttat ggggtgtggg tgtcgttggt acacccattt tacagatgag gacaccgagg 3540cccagaaaag ttaagttaca tgtcctaagt cacacagctt gtaagtgcca gaactgagat 3600caaaaccaag tctctttgac tttaaagtct gtactctgac cccaaagaga tcctgtttgg 3660ccacttatag gaggtcccta aagctgcaga ctccccttgc cggcacccac atatagagac 3720attaaccctt cccctgcagg gtcacctcaa atagtctttt agctgggctt ctcctgcaat 3780tccacctaat gccatcccct gggttttgcc caaacctgaa ctgggcagtg gggtgagagg 3840aggggtttac agggttacag agcctcatac agataggagc ccatggctgc tggtcatctg 3900cattcctgca ggattggctg ttccttgggg tccttggcag gaaaatgagg attgctccga 3960ggcctgctcc agtacttccc agaggctggc ctggtgtggg gctctgggaa ggctgaggct 4020ggagaagcgt aagtaggagg gcagagatgg cactcaggta gcttgaatca ccaggaccct 4080tccaagcccc acaggttctg agggagtact agggccagct ctgggagagg tctcttccta 4140tgctgtgaac cccctgcctt tcttgcagcc tacaacgaat aaattttctt tgcaaagg 4198551402DNAHomo sapiens 55ctgtctcaag cctccaatca acagatcaga cagcttgtac tcacaggcca aggacacgtg 60gaaagaggct caattttcta gatgggtggc aacagccatg atcttctgtc ctctgggtcc 120ccacaagcct ggatgaactc aagatctgac tcagtggcac agtgaggaga cctttgaggc 180ctcagtgacc atccttggac ttcacctctc acggctttca ggcagagagg ccctcccatg 240cccacaacag gctgagccca gccttcctcg gggtttgctt ccaggcctga cttttactcc 300cctttctaag tgaggcagcc atgactggcc acttcatgtg ctcctggaga agggcttgca 360ccagccgttt tcaggaaagt caagcagctg ttgactcctg agtctgggtg aatttgtgtg 420aagagcataa ggcgctgttt cttaaccaaa acgcttcctc ttgcagtgca gatgggatgt 480gcttctccac aggaggcccc acggcttccc cacccctcag aggagcgccg tgcgtgcgtc 540tgtgtggagg attggcagct cctgcagtcg gcccttggtc ctatttggcg acgcctctgc 600cttcccctta attatacagt catgagccgc cctggaatca cggcagctcc ggatggatcc 660tggatgccag aatgcagcct cagcacgggg ctgcaggaca ggagtgagcg aggggctgca 720gagccggcgg ccgcggtggg caccatggag ggggctgccc tgggcagcac gggcatgagt 780ctcaaggccc aggtttgagt aacaggtgtt gagagcttac ttacttttcc tgagacacag 840tttcctcatc tcgagagcac ggaaaatcat tctaacttca gaggattgtt gtgaaagtta 900aatgagatta aagaggtaaa gcccatgacg tgcttagctc gtgcttggct cttggtcaat 960gccagttagc gctgcatttt ctcccctctc cctccctcct tctctctttc ttttcttcta 1020ttctccattc ctgttttctc ccccacccca ctccccaaag ctctgcgttg agaaccagat 1080gctgtctggt gggttagggc cagaggagga aaagctgccc gccgtgggct gcacccatac 1140cctcttcatt ccaatgacat gaggggaggg gaaaggacag aggtagactg tcctccccta 1200cctcctccta atacaaatgg aattcctgga actggaaaac aaagaatacc cccataaaaa 1260taagacagta cttctggtgc ggtgtaataa aggggaaagt aaccctcaat gtcaggaaac 1320tccgcacctc ccagctcata tttgtgtgga ggaaaagtta aatattaatt tggactcaac 1380tgaatgtgga cacaaacaat gg 140256295DNAHomo sapiens 56tctctcatct gtgttttcag ggcatggact ggaactccca atacccctga catgggctga 60gtcaacgtgg tcatgaacat gtgacaggag gcagcagaag ttgcagagaa gagtgaggca 120cgtttgaaaa aggctgaaaa atgtttctgt ccaggcaagg gtgtgtgctg aatgactcaa 180ggattttttg gtgcattgaa tgaacagcgg gacattggac acctgctgat ccatcacccc 240gggcccgggc aggcccgtgg atgaagagag atggagaaga ccaggcatga gactg 29557374DNAHomo sapiens 57gcagcagaag ttgcagagaa gagtgaggca cgtttgaaaa aggctgaaaa atgtttctgt 60ccaggcaagg gtgtgtgctg aatgactcaa ggattttttg gtgcattgaa tgaacagcgg 120gacattggac acctgctgat ccatcacccc gggcccgggc aggcccgtgg atgaagagag 180atggagaaga ccaggcatga gactgtggag aagccacacc accagaaacc cctgccccat 240gcgccgtcca gcccacacct gtggatgcac gggggattgc aggcagggct cccaccgtgg 300actcaggaac aggcagggaa gctgctgcct caccaggcga aggggccagg agggggaggc 360ggagaggccc gtct 374582737DNAHomo sapiens 58gcctccaatc aacagatcag acagcttgta ctcacaggcc aaggacacgt ggaaagaggc 60tcaattttct agatgggtgg caacagccat gatcttctgt cctctgggtc cccacaagcc 120tggatgaact caagatctga ctcagtggca cagtgaggag acctttgagg cctcagtgac 180catccttgga cttcacctct cacggctttc aggcagagag gccctcccat gcccacaaca 240ggctgagccc agccttcctc ggggtttgct tccaggcctg acttttactc ccctttctaa 300gtgtgcagat gggatgtgct tctccacagg aggccccacg gcttccccac ccctcagagg 360agcgccgtgc gtgcgtctgt gtggaggatt ggcagctcct gcagtcggcc cttggtccta 420tttggcgacg cctctgcctt ccccttaatt atacagtcat gagccgccct ggaatcacgg 480cagctccgga tggatcctgg atgccagaat gcagcctcag cacggggctg caggacagga 540gtgagcgagg ggctgcagag ccggcggccg cggtgggcac catggagggg gctgccctgg 600gcagcacggg catgagtctc aaggcccagg tttgagtaac aggtgttgag agcttactta 660cttttcctga gacacagttt cctcatctcg agagcacgga aaatcattct aacttcagag 720gattgttgtg aaagttaaat gagattaaag aggtaaagcc catgacgtgc ttagctcgtg 780cttggctctt ggtcaatgcc agttagcgct gcattttctc ccctctccct ccctccttct 840ctctttcttt tcttctattc tccattcctg ttttctcccc caccccactc cccaaagctc 900tgcgttgaga accagatgct gtctggtggg ttagggccag aggaggaaaa gctgcccgcc 960gtgggctgca cccataccct cttcattcca atgacatgag gggaggggaa aggacagagg 1020tagactgtcc tcccctacct cctcctaata caaatggaat tcctggaact ggaaaacaaa 1080gaataccccc ataaaaataa gacagtactt ctggtgcggt gtaataaagg ggaaagtaac 1140cctcaatgtc aggaaactcc gcacctccca gctcatattt gtgtggagga aaagttaaat 1200attaatttgg actcaactga atgtggacac aaacaatggt caccaagtcc cggaacaggt 1260tgtgtgagcc tcttcagggg ttcatccagc gctgttttgg agaaatctct atttcaattt 1320attcctatac gttagttact gaaaaacaac agacaatcgc aaaagcaagt tgcccgtttt 1380gtgttccttg agcccaatca tgaagtgccg tcgtgactgg gcctcatgac aaacaacttg 1440taacaagtaa caacagagct caggtcccag accgcactga agctctgtga gacctctcct 1500catctgtgca tgaacgagtg tctgactctg gagcccagcc tgctgcttcc cagtctggtg 1560gtgaatcctc cgtagtctga tggaggtttg ctcttgttgc ccaggctgga gtgcaatggc 1620acaatctcgg ctcactgcag cccctgcctc ccaggctcaa gcaattctta cgcctcagcc 1680tcctgagtag atggaactac agggcatgga ctggaactcc caatacccct gacatgggct 1740gagtcaacgt ggtcatgaac atgtgacagg aggcagcaga agttgcagag aagagtgagg 1800cacgtttgaa aaaggctgaa aaatgtttct gtccaggcaa gggtgtgtgc tgaatgactc 1860aaggattttt tgggcaacac aaaccaacac gagccgtgtg aggatcaggt gacagctgcc 1920caaaagctga cacaaggaac aagcctggag gagtgaggat gggtgctgtg aaggaggttg 1980tgcagctggg cccgcagtcg gacctggtga gatcagagga gggggtgcca ccagtctgtg 2040gacgaagatg agaagctgga atagagcaga aaacaggagg ctgccactct ccatctttcc 2100caaagtcact ccaggagcaa gggtgtcatt tactgaaatg acagactctc catttcacat 2160ttttccccca agtgcagagt gcagggaagc agatgggcta aatttttaga gtcagggtta 2220ttaatgtata ctttacatag taaactttcc ccttttaagt gtgcaggcct gaggtttgcc 2280aaatatgtgt aggcatttaa tcaccaccac gatcaagatg tagaatattc ccactatcaa 2340aaagtttgct gtgtcccttg atggtcatgc cccattccac agccccagcc ccagcccctg 2400gagattgctg tctgctttat gttccagtgg ttttatcttt tccagactgt atggatgtga 2460atggaatcag atgtgattcc aaggtgtttt atcttttcca gatgtgaatg gaatcagatg 2520tacgaaatcc tatggtaggg ggtcttctga gtctagctcc ttttgtttag cgtgatgcat 2580ttgaaattaa tccatgtctc aggcatcagg agttcatttc tttttctgct gagtagtatt 2640tcattgtatg gatgtactgc aatttgccta tccattcacc tgttgatgta catttgagat 2700ttttggcaat tatgaataaa gctgctataa acagaca 27375915706DNAHomo sapiens 59cagatgctaa aattgacacc aaaagcgtag gtgatgagct tgaatcaggt aagaattata 60ttctacttcc agctaaagaa gaggatatag aagatcaata gcttaaacaa aacagaaggt 120cttctcatgg actaaaacag tgacagtgag tgcaactcta cctatcatgt tcatattcta 180ggtagaaact ggggaaaaga gaaaaggcgg gaaaaaggcg tttctctaca aagatcacac 240ataatcttct gtctgcatct cattggtcgg catttagtca catggccata actaccagca 300agggaagctg ggaaatgtag tttttcaatg gcacattgct acccttaata aaataaggat 360atgctaccaa caaagaatga atatgggaaa ggcccctagc aatgtctgct gcatctgcca 420ctgtggtttt ctctgatttc cacagacatt gctcttccac atagagcgca ttctgattcc 480caagccacca ctcccaaatc acacccagtt tctgcatcta gcttgtgtgt gggcagtctg 540ctccatcagt cccagatgtg gcttctcatg gctggtgacc tatgaactaa aagacaggcc 600atttttttcc caacctaccc taccttttct ccttgagagg aaagataaaa taaccacaat 660ttaaaaatgt caatcttcca ttcagaaaag ggaggaaaga gaaacacaga gatcactggt 720ttacagtgat ggacccaccc tgctaggcag gagtgaaaag gcctctccgt ccggcagtgg 780actgggttcc ttggctggcc catgtggtcc ccacccactg tctccaggag catcgccttg 840aactgtgtct tcatggcttt tggctccgcc ttcagaaggg tcctccctgt caatcatcct 900ccattgccac ctctgatgtg ggcactaggg agctggtcct tcccagagga tgcatgattt 960tgacaccgtg cagtgtggga tgttggacca gggaatccag catattttcg ggcttttgta 1020gtctcagact gggcctgaga tttctttaga aatatattaa cggttccttc aacactccaa 1080tgggcatata gcctatttac ttctgatcat tgtaatgtgc aaataaccac agccccattt 1140ctttgtgtcc agatgcagtt tttaggcttg aatattcatc tgttttactc actggcctct 1200gtcactgagt ctgtcccctt agctattaaa ttaatggtga ctatgataac acttttcacc 1260cgatccttgc tagtgcattg acttgcgttg tctgcatcga gttcttaaaa gatgcttggt 1320aaagtcttgg gtcacagttt taacttctgc tggggttcct gctccacagc cctcatttag 1380aactgtttca ctttggctat tgtttagctt tgggccatat ttggctgttt tttagctttg 1440taaaaatgct catttaacgg acacaaggcc tgggttagaa aatgtagtat ttccttcccc 1500ctctgtttgc gtaggagcta gcgctagtgg aagcccatct ctcccttgca ggattttgtg 1560gaaagtgacg aaaataactc aaaccacatc agtgttctgc ttgatttcca ccattcttgg 1620atgctataac ttcagaaagc acatgatcta acttcccaag ggacagcagg gcctcgacca 1680aatgccgttg tgctgagggc tgccggccgc cccgctaggc tgctgcaccc tccccgcgct 1740gccccacacc ttgcacagag ctcaggcagc attctagcac catcagacaa attcttcatg 1800agtcagaaat cgcacctgtc tcttaatagc gatctggcct caggtggctg aaacacacag 1860ggttgaaatg ctgcggtgcc tgcgctcatt gtgcggtgaa gccatagagt ccgcgctgaa 1920acccggggcg cccgcctcct gtccgcgcag gcgctgcacg ctgtcgccgg ggcagcttca 1980ctcagcttca gcctctccat ctgcagagga ggagtaatcc cgtgctgcct catggggccg 2040tttcaagaag gaaatgagat gtggtataaa gtgttagagg agtatttagt gttattatta 2100atgtgttttc tcatctggga acatgatttg cattttaagc agaacttcaa catttggact 2160caaaggagtg atgtggatgg ggagaatttc aggcatcgtg caggctggca ttagaaatgc 2220tcaccggaaa tggagcacag gcagcaactt cagcccctag aattcttcct cctatgtcac 2280ttccgtccct caaacccttt ctgcagcccc tgtcctctct cctgcaatgg cagtgagcag 2340gtggtctcct ggtgtctctg gaaaattccc atatggagtc ctggttgaga tgagccagaa 2400ggctaggaag gagcctacag tcccggtctc agcgcccagg aggtttttct aatggagttg 2460gcatgcagaa gctggggcat accctgggag cagccctgtg gctggatgag ggatggacag 2520cagcactggg cgggtggagc cgggtgcgag gtccccacag cctgcctccc agtcccactg 2580gccagcccgg ggagggtgct gaagacaggc ttcctctctg gtcaagctct ggctagcttg 2640gcaagtgcag gcaggaccct ggacctgcca tatggtgcac attttgagga catcattagc 2700aggtttggaa ggttgtgatt ctggttgtgg gagagaagga acctgggagt tgagcaggtg 2760gtgggtggag ctatgcatgc aactgagtgt gaggagtgct ttggccatgc tagggtctcc 2820tctgggctgg aagccttgtc ttcaagaggg ttggggtagg aggcttttta ttatagtggg 2880aagcctcaag cctggacctg ggaaaacact ttctgggtat gaggaattaa aataaacctt 2940gaagcactat ttaggaagaa catagtatca cttcatagat attttccccc ctaaatggtt 3000acaaagtaaa atggcagagg tacaaagagg aaaatgaaaa ccacccaatc ccttccctga 3060gcgatggccc tgttatcctt gggctctgtg gccatctgac ctctctctgg ggacagaagt 3120acatgaggag aggggccact tcataagacg gagatgctct cttcctgttc cctgcctctg 3180catcgtgctg gggatgtctg tcctcatcaa ctaagagacc catgtagacc gcagctctga 3240gatcttctat ttttgtaaac agaacagttt gtagggagat gtgagatgtt aggtggtgct 3300ggagagcctg ggctttgtct gtttggacaa ggctccatca aggcatcctt tgtccgacgg 3360tgaggcatct tgagggctct ggacccagag ctggtcctgc aagagtcctg tctgagagcc 3420caggccccac tccactagga gggacaggag ggaacagagc agagctgagt cccttcacct 3480acccaaaccc atacatgttt ctagagcaga gtaactgctt gtgaaacaga ccatcagagc 3540acagggcagc accgagctgc gttctgcagg gctcggtcta ggtgatctgg agggcttggg 3600gagctggctt ctcccctcat ccagcatgta gctacccaca gccacctgca tttcacaggg 3660ccagtgccta gggacattgg gccagaagcc agaatttctt ttttcttttt ttttttcccc 3720tctagcattc actactagcg caagctagtg catcccaaag ttttggccct gcgtggataa 3780tcaatccaca atttatcttc cgtctgttgc caaagtaatt tagctaaaat gcagatctca 3840actggtcact tccctgcgta aaagcttcaa tagctttcta tcgcctacag gcaaaagtcc 3900ctcttctgaa aagcggcttg caaagcccta gtagctggct ccatgccccc cagcaatacg 3960tggcctcctc agacgcatct tctagcaaag gagagctgct cagtgacatc cacaaaccga 4020gctgcttctc acctacctgt tttccttctc tttgggatgc cccccgccct caccttctgc 4080ttctggctaa ctcctaggca tccttgaagg cttggctcag tatcctctcc tccaggaagc 4140tgtccctcac cttctctcct ttccctccat cccagtcacc atgcaccaca cccatatccc 4200cattgcatcc tgcagctacc ttgtgcctgc acacgttggg ctgggcatgc atctccttct 4260ctctcaagac ctggatccct tcactttgtg tctctggacc ccccagtgtg ctgataatgg 4320ggttggaacc catcatattt ctcttgaacg agttaatgat gggactgcta tttctctaac 4380tgttgccttg gaggccctgt cacgtgctca tggaagagag ccaggggggt ggaggtgatt 4440cttgttaccc agaggacgtg gggtctggat acacgtttct gccatctgcc atctgccagc 4500ttctttctgg ttggtagctt tggagcctgg tgcagctggg gccagtcccg agcctggact 4560ctgctgggca gtggcaagag cactgtctgg agctctcctg aggagcccac agatccaact 4620ccctaggcca aggctgcagc ctggggcaga gatgcagagg cctggaggag cctagggcac 4680gcggcctgcg ggctggctgg ggttcagagt tcgtatgtgt gtggagtgac tgggcaggtg 4740ttcagaaatg aaggctggca ctgccaggta aggcccttcc ctccctgatg tgagagccct 4800ggagccaccg cagaggccca gtcagatctc tgttctaatt ctggcctggt gtggaggatg 4860aggagagacg gcccagaaag gaaggcagac tgtgcagacc ccatgtcttc tggcccgcga 4920ggccctcctc ctgtgcctgc ttatcttaaa gaatccggga taagaggtga cttgggcctt 4980ggccgggagg cccctcctca gcttcagaca aggagggagc tctgggcatg aggacattga 5040gcaagaggcg atggcagtgc ccacaactta ccctcagctc ggctctgttg ggtccgagaa 5100gttgcatgga aagggctcct tgggggccag ttgtcagtaa gctgcagaag cctggagccg 5160gccaggaaat aaccacgtgt aggagccttc tcagctgaga ggaaggagga ctcacgcgcg 5220gcgagcacat gcttggagcc aggcacaggt ttgaactaag ctatttcatc tcgattctta 5280tgacaagctc cacgtagttt gctcctattt tacagatgtg gaagctgagg ctcagagagg 5340ttaagcgact tgtcccaaat cgcattgtca atcagtgaag gggctgggat ttgagctagt 5400cacctgcctc taggttcagt gtgctttcta ctctgctccc ctccatgcct gcccgaccct 5460ttgctgatga cacattcctg agacctcaaa ggagtcctac tttgaatcat gaatggcctc 5520gcgtttcccc agagggcatg acgcaagctt cgccacctca ctcccaccct caccactggc 5580tggttgctct taggaagatg ctgtttacag gatcacgcag tggttcaggc caccacgatg 5640ccacttcccc tgctcttaac cccccaccaa actgcaggtg gcttccctgg gagactcggg 5700gcaacaccct ctcgtcctgt atgaagcttg tacctttctc cacccaagtg agtgacagct 5760ggcgggagtt ttgcactgtg gaacagggta cacaaagaca ctggagtgag aaggcagggg 5820catggccagt ctatgtctag gaggtggtgg ctccaccttc cttgtggact cagctttgga 5880gtcatgcagg ctgctctctg ggcactcctg tgagccattt cttctgctga tagggaggaa 5940cgtcccactg ccccaagatg ggttctgtgc agagtctccc cagggtcaag acaggagcta 6000gaatgtatgt catagaagga tgatctaaat ggtgacttct ggctggtgag gagaggctat 6060ggcacctcca cggctggtgg cctcttgctg aagtaagcat ggtcagcatc ccccctgcac 6120cctgtggagg tggcttaatg cattcccttg actgcaaagg actccttgcc aaagagacct 6180tcttccccat gaggctgagg cctcctagga ccctcagtgc tcaggaaatt ataaccagcc 6240acccccatct ccattcattg gagaaggagt gacggccgcc tcagtgccat taactctgtg 6300ctgtgattaa atggatccca aggagactcc tgctcaggga caccccctgt aggactgatg 6360ccagggctag gcttgccgca cagtgctcta ttcctttggt tatgcacctt ccttggcaga 6420atccacgctt accaagagga gtcactctga gctgcttgct gccagtcaca tgcttagcag 6480tagaagatat cttgtgcttg tcaggtgact gtgagtgaga gggaaggggg cccagcgtga 6540agccaggtgg gacggcttct gcggggcaag acaccccact ggaggaggca ggggcgcgct 6600gtgcaggcct cagcaccagc tctggctgct gtggtgggat ctcagatcag tcactttgcc 6660tttttggcct cagttttctc atctgtacat catggatttg ggattaaagg atttctgaag 6720attttactca ttctgagatt atggtccctg gaaaccttta gggaaaaggg agcttcttct 6780cttcattatt ttaacatact tgatctttta tgttctttgg tagggagaga tgataggtag 6840ttagagaaga agcagctcag tgaaaaagct gaaagccttg tggaaaaata agttaaattg 6900acccactgtg actccaggga cctggggaga ctttgatgtc cgtgtttttg attacacatc 6960tcttctctca gagtgaagat gcgcagttct aaggaattat gcccaatggc agaattggca 7020agggacaggg aactgtccag cagagaagct gctgaaaccc ttcagggaac attccattcc 7080gccagggccc ggctctcacg ctctgctctg cagcgcatcc ggtgccaagg aggggagtag 7140cgaacgttga ccttgtccct aaggagctta cacgcaggga ggtcagcaca gacactggaa 7200tatctaggac tctgctatcg aaaaacacaa ttgctggcac atgggttcag aagacagaga 7260aggaaaagag ctgtaaagag ttgggtgggc tgaggtttga aatgggcttt aattatgact 7320ggacacaatt tggtgacaaa tgggtgaaat ttcaagcaga agaggtagca tgagcaaaaa 7380ggggtgtgca gtctttgtga agagaagggg gagggagtgg caagggaata ggacagagag 7440gaggaggagg gagaggaagc cagagataag gagccagggg gggaagaaaa gggcagaaac 7500gaatgtgaag gagattctga agacaagcca ctgtggtggt ccaggtggtt ccatggtgcc 7560gcgccaagcc aggggcttgg aatttagtct gggaggatgg tgctgagtat ggatgaggaa 7620ggagaggaat ggagagggga gaaacgggaa ggtacttcac actgattaag caagctcctc 7680acagggctat gacttctccc ttctcagaag gaggtgccct gtggcatgcc tctaggccca 7740gagttaaaga gctggagcca ttaggagcag caaggggctg cctccccact tgtctggtta 7800cttctggtta cttctccctc agggtaagtc ctcatgaggg atcatctctg cccatggagc 7860tgcttccgct gcccttaggc tggtgtaaga ggaaggctgt gtaccagagg tagatcttgc 7920tcctagtcca ccagcaaaac acatccagtt atttctatgc ctcagtctcc cttccccaca 7980ctgatctctt tattccccta ctacctagaa atggaaggga gacaatgagg tgggaaatag 8040agtttttcga aaggtgtttt ttggataaga caaaggcctc tcagagcaga gctggccatt 8100ggaatggttt ccttttgtta tttaataccg gggctttcac acaggagttt agcccagctt 8160ttaagtcttc agtatcaaat agtagttggt gtcacatccc tgctgaaata cagccatgaa 8220aatgtttctt agtgatagat ttaggttgta ctccttaaga aaagcccaaa tgtcaagaat 8280tgcttcccac tggtacactt tattggggag aagggcatct caaatagaag gatggttggt 8340ctgtctagaa atggtaagaa tactacaggt taaaggcagt ggtggtctgg actaaatgac 8400ccctgaaact cggattttat ggtgttttca atctctggct gagtacaggt tcctccctct 8460ccctttgtgc ctccttgggg acctgggcta ttttctcctc cgtgaaagag aatggataca 8520tccattatga aaaccaattg atataatttg agcctgcatg caggtaaaaa ctatattaag 8580aaggtttata aaatctaacc ttctggctta acaagctgat tctcaaagtg gcctctcaga 8640ccctggctga gggatggtgg tcaggggttg cagagggacc cgcacagctg ctgaggaggc 8700tgtgggacag gaggcgctat cactgtctac ctcttccatc catgcacctg tgtgcaggtg 8760tgagaagggc accgtgctgc aaagaagggt cctttccacc ctctgcgggt tgctgccggc 8820tccccgatgc ctgcctctga ggcctgagcc tggggctggg gagtgtgctg gcctcacctc 8880tggaggatac tgcctcagtg ttacagcctg accccccacc ttgtcatcac tgcctactac 8940agaccagagg ggacggccac agagactctg caaccatggg ctctgccctt cttccttctg 9000cctgtgtatc tgtgaaaaac tttttttttt ttaaatggag aaagctacct tgacttctca 9060gagagttgaa tggggtcagg ggatagaatc tatatttttt agttatgggc acttacccat 9120attcaaaaag atttgaggag

gctggcagaa tggagctggg agaagaaaac cctcccctgg 9180gaggaagctg tcctgtgcat gttggccagg ctgcctcttt gattagggac aatggaaacc 9240ggcctgaggg cacgggtgaa agcagttgag tgtagaggag gctctgcagc agaagccaga 9300ggacacagga gccagtgaag acacacaata agtcagaaag gagggatctt tgcagcccca 9360aaagtagaaa ttcttaccat ctactgcaaa gagcaaaagt tgaaaattgg tctgatttct 9420atctaaatgt gcttacatat tcgtgttctg ttaaatactt ctgtaacctg tgtgtctcac 9480ataaatgcag ctttctttag ttttggaaat aaatcacatg aatcctgaat agtagtcttt 9540aataatttgc ttagttgtag ggcagtgttg tgttttcaga aggcaagtgt atttgctaga 9600agagtgagct gggaggtgtg aaccacatcg tcacatctgc tgtaagccta gccgttcata 9660atacggagtt acagttagga cacgtcgccc tgaagagcta ccatcgaatg tgtgctcatc 9720aaatgcctgg cagcgtcctc ggtgcttcac ctgccatagc cgacagtggc tgacctccca 9780tgcctgttgc cttttctttc tgttggatca gggatacact gccatgtgtg ttaagaaaag 9840ctggccttac ctacagggct ggccagtccc ggtcacgttt ctagtaagcc attgccttac 9900ataagggtaa cggcatggga cgctatctta gccaatgtga taaaagtgga catgaggtga 9960gaggcttcag agagaggttt taaaaaagag acaaaagcag gacgttgcct ctcttcctcc 10020tctccacgtg tcctacccgg atgtgaagcc aaaacagatg caggcttagt gcaaccatgg 10080ggaacccagc ataagcacag attcaacagc agaagagtgg cagagggaga aggtgaaagg 10140aacctaggtt ttcctgtcct tgttgagtca ttcagttaaa aatccctgga attttcctct 10200ctccggcagt gtgttttgtg ggataatgag ttgccttatt ggggttggct tgctagtcgg 10260gatgtttcgc tcccatcaac atccatacgc ttgctctgtg aaccaatgac ctgatgaggt 10320agtattagca ccaccatcat tatgctgagg atgagattta tggcacagtg gttcagtagc 10380ttgcccaagg ccatgcggct ggtaggttct ggaggagggc tcagggcacc ccctgagcta 10440cccctgctgg ccattgcacc accccataaa gctgctggca gtcacttctc tgaggggtta 10500gcatgtaaga aatgtcctcc tgaatgctgg ccagacaaat ggaaatctgc cagggttggg 10560tacccccatg acagcagcca gcctgccctc ttagtccctg acagctgcag tgacagcatc 10620tgtgattgca aagcgtgaca atttatatct ctcatttcat cacaccatct atcagcagac 10680agtcaggctt taaaaatcaa tcccacactg actcagtccc cagcagagat ggcctctgac 10740aacagtatcc acactgcagg ctggacaagg gccctattaa ttttgagact cagccaaatt 10800tccttctgac cctaagctgg tgaatccctg ctcctttgct ttggttgggg ttggtgtgag 10860ctaaggctgt gatcccattt gctcctatgg cctccaggtg gcctgggcct ccatgaatgg 10920gccacatggt catactgaat gcttgattac actcagacct agcagtcgtc tgggcgcagc 10980tggtttatgg atcactttgt cacaatgttc catccttcca ggtccccatc cccgcggtgg 11040gaaaacattg ctttaggcag tgctagagga cttcagcagg cattggcagc ttctggattc 11100aggattagaa caaagaagga ggagtcacag caaagatagg aacagaaggc agagagaaca 11160gacagatggg ggtgtttgag aaggagggcc tttgagacct cagggagtgg gagacactgg 11220ctcgagaata ataataatgg caatttctct catctgtgtt ttcagggcat ggactggaac 11280tcccaatacc cctgacatgg gctgagtcaa cgtggtcatg aacatgtgac aggaggcagc 11340agaagttgca gagaagagtg aggcacgttt gaaaaaggct gaaaaatgtt tctgtccagg 11400caagggtgtg tgctgaatga ctcaaggatt ttttggagag aattggagtg tctcaccaga 11460ggagaccacg tctgaagggc tttgcatccc tccttggaca tgtctaatac ctaacactca 11520gaaagcatcc agtaaatatt cgtggaaaga aaggagtgga gaaggggaga aaggggaaag 11580ggagtaggcg agagagaaga aagactctgc ttcttgccca gggcctggca tggggcggag 11640gcaaagcagt ggggtcctca gctatgtccc actgtgagtg cacagcgagt cctgaccttc 11700agagggtgca gcccgagggg ccctggcctg tctgaagggt gcgccagccg agtggcctgc 11760tctgaccacc aggctcaccc atgactacct gggtggctac agccagttcc tgacaatgag 11820tacagcactc agttatcggg gcccttccac ccacacgctg tccacttcct ggggtactgc 11880tgtgggcatg tgagtgcttg ctccccgggg cactgctgtg ggcatgcgag tgcttgctcc 11940ccggggcact gctgtccact tcctggggta ctgctgtggg catgcgagtg cttgctcccc 12000ggggcactgc tgtggacatg tgatagcttg ctccccagct ccactagtga cactggcggc 12060ccctcgctgg ggccttcccc gcctgctccg ctccattacc gctgccgggc tcctcacgtc 12120tctccttgct gcttcctgca ctggggtgag gagagtgggg ctggtcccct tgagaccgga 12180gaagctccag gcttttaagg aaaactgcca gggacgaaga gaagatatca cttccccacg 12240tggttggctt ccagattcag aaggaatgtc tgtccttgtg gattccgtac cagatgaccc 12300cagatgctgc ctcagtacta ggtccctgtg gctctggagc ctttgctggg tctgggcagt 12360gtctcttcct ctccagttca tccttgggtc tcttcaccct tgccaggggc aggcttcctg 12420gtgagaggtc gacctcctgc atgaaggctc tcaagaggcc agttcaaagc caagctccgg 12480gtctgtgcct gtggggctgc tcctcgatca ggagatggtc actcccctcc tggtctgtat 12540ctgtgggatt ctcctccatc aggagatggt ctctcccctc ctggtctata cccgtgggat 12600tctcctccat caggagatgg tcactcccca tcctggtcta tacccgtggg attctcctcc 12660attagatggt cactcccctc ctggtctatt cccgtggggc tgctcctcca tcaggaggtg 12720gtcactcccc ctcctggtct atacccgtgg ggctgctcct ccatcaggag atagtcactc 12780cccctcctgg tctatacccg tgggattctc ctccatctgg agatggtcac tcccctcctg 12840gtctataccc atgggattct cctccatctg gagatggtca ctcccctcct ggtctatacc 12900catgggattc tcctccatct ggagatagtc actcccctcc tggtctatac ccgtggggtt 12960ctcctcaatc aggaggtggt cactcccctc ctggtctata cccgtgggat tctcctccat 13020caggagatgg tcactctccc tcctggtcta tacccgtggg gctgctcctc catcaggaga 13080tggtcactcc cctcctggtc tatacctgtg ggattctcct ccctcagaag atggtcactc 13140cccctcctgg tctataccca tgggattctc ctccctcaga agatggtcac tccccctcct 13200ggtctatacc cgtggggctg ctcctccatc aggagatggt cactctccct ctcggttgct 13260cagtccaaaa acaacctctc tggaaaactg cgtggaattt ttttttaaag aattgaaact 13320agaactagca tttgatccag ccatctgcct actgggaata cacccaaaga aaaataaatc 13380attatatcag aaagatagaa tatgcatgtg gatgttcatt gcagcaccat ttactatagc 13440aaagatgggc agttgagcta agtgtccaac agtggtagac tggataaaga gaatgtgtta 13500cacacacagc atggaatatt actcaggcat agcaaagaat gaaatcatgc cttttgcagc 13560aacatggttg gaggtggagg tcaggagtta gagaccagcc tggccaacat ggtgaaatcg 13620tgtctctact aaaaatacaa aaattagccg ggcatggggg tgcacacctg tagtcccagc 13680tactctggag gctgaggcag gagaatcgct tgaacccagg aggtggagat ggcagtgagc 13740tgagatcaca ccactgcact ccagcctggg caacagagtg aaatcctgtc tcaaaaacaa 13800aaataaaaac aaaaaaagca tacaaaccac aggagctcct cttggtcccc ctttgtcttt 13860cattccacct ccagaaatcc cagcagaatc accttcaaaa actcctagaa tccaattttt 13920cccctccatt gctactgccc tgatctgagc ctccataacc cttacccaaa tgcttcctaa 13980acgtatcctg gctggtgctg ctgaattcca tgtctttcca gctgcccttt aaaatacggt 14040aggaggcaag tcttttctca aaaccctcca gtggcttctc tctcagagtt aagatcctgc 14100agtggccttc ctggcctcag gtagtgtctg ctgtcctgta ccctcggcca ctatactcca 14160gccacatggc tttgtgtttc ccttggacat atccagcatg tttctgcccc acggttttgg 14220cacttgctgt cctttctgcc tggagctcct tctcctccct ctgcactgaa gaccctccct 14280tcctttcagg atggcagaga cattatgctg tcataaccac accccatatt cacccttaca 14340cgatgtgtcc ctctctggcc agctaggggc tcagctccat gagccccttg tcctggcaac 14400aaagctggct ggggcggcca cctgaagtat gtctcatgga gctgactcaa tgagagacac 14460agttcattcc atgcacagtc cacgccacag taagtcacgt ggccagcgct gacttcccct 14520gcacaggaag aacctgcacc caccacccgc ggagaaggat ctagagctgg gatgactgag 14580caggatgcta acaacctcaa agttcttctt agacctcatg tcttgaacag ccctaggcaa 14640catagcaaca cacgccatga caaccccaca agaaggcaac ccgtcctctg acagcttctg 14700gtgacaaagc caccccgctt gtgacaacct caggtcacac agcagctcct cccctgacaa 14760cctaaggtca cacaacaact tctcctttta aagtctcagg tgacacagca gactctcccc 14820tgacaaactc aggtcacaca gtaacccttc agctgacctc aggtgacaca gccacccctc 14880ccctgacaac ctcaggtcac acagccaccc ctcccctgac aacctcaggt cacacagcca 14940cccctcccct gacaacctca ggtcacacag ctacccttca actgacaacc tcaggtcaca 15000cagccacccc tcccctgaca acctcaggtc acacagccac ccctcccctg acaacctcag 15060gtcacacagc cacccctctc ctgacaactt caggtcacac agccacccct cccctgacaa 15120cctcaggtca cacagccacc cctctcctga caacttcagg tcacacagcc acccctcccc 15180tgacaacctc aggccacaca gccacccctc acctgacaat ctcaggtgac acagccaccc 15240ctcccctcac aacctcaggt cacacagcca cccctcctca gacaacttct gacatagcaa 15300ctccttgcct gacaacccta ggtaacatag caaccctccc ttgacaaccc atgtgacatg 15360gcaatgcttc tcctgacagc cacatgtcag caacctctgc ctgacaaccc aggtgacata 15420acagcacccc ccgacaaccg catgttacct tgccaccctc ccataccgac tgtatgtggg 15480tatcccctcc taccccgcct tgggagcccc atgtgaggta gccagccttt ccctggccct 15540gggccctcca tttctgcttg ctgtctcctc tgttcctccc aagaactcac tgctctaccg 15600tgtaatctct tgtttctctg ctgtcttagt ccgcttgggc tgccggagga gcacaccttg 15660ggcagggagg cttagatgca cctgtgcatg gttctggagg cccagg 1570660744DNAHomo sapiens 60aagcgtaggt gatgagcttg aatcagggca tggactggaa ctcccaatac ccctgacatg 60ggctgagtca acgtggtcat gaacatgtga caggaggcag cagaagttgc agagaagagt 120gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag gcaagggtgt gtgctgaatg 180actcaaggat tttttggtgc attgaatgaa cagcgggaca ttggacacct gctgatccat 240caccccgggc ccgggcaggc ccgtggatga agagagatgg agaagaccag gcatgagact 300gtggagaagc cacaccacca gaaacccctg ccccatgcgc cgtccagccc acacctgtgg 360atgcacgggg gattgcaggc agggctccca ccgtggactc aggaacaggc agggaagctg 420ctgcctcacc aggcgaaggg gccaggaggg ggaggcggag aggcccgtct agcccctgcg 480gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc gcctcagctt cgttaatagg 540ggagggggcc taagagtttt cacgtccagg ctcgggcagt ggggaggcag gcaggagtgg 600ccgctggttt ttcagacctc ccagggaggc cgaggaaatg gcccgtcctg gagtgggcgt 660ggttctgtct tcagatggat gctggagggt tgggctgcgt gggaccctgg gccctgctgc 720ttcccggagg atgcgctgtc cggg 744611129DNAHomo sapiens 61aagcgtaggt gatgagcttg aatcaggtaa gaattatatt ctacttccag ctaaagaaga 60ggatatagaa gatcaatagc ttaaacaaaa cagaagggca tggactggaa ctcccaatac 120ccctgacatg ggctgagtca acgtggtcat gaacatgtga caggaggcag cagaagttgc 180agagaagagt gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag gcaagggtgt 240gtgctgaatg actcaaggat tttttgggca acacaaacca acacgagccg tgtgaggatc 300aggtgacagc tgcccaaaag ctgacacaag gaacaagcct ggaggagtga ggatgggtgc 360tgtgaaggag gttgtgcagc tgggcccgca gtcggacctg gtgagatcag aggagggggt 420gccaccagtc tgtggacgaa gatgagaagc tggaatagag cagaaaacag gaggctgcca 480ctctccatct ttcccaaagt cactccagga gcaagggtgt catttactga aatgacagac 540tctccatttc acatttttcc cccaagtgca gagtgcaggg aagcagatgg gctaaatttt 600tagagtcagg gttattaatg tatactttac atagtaaact ttcccctttt aagtgtgcag 660gcctgaggtt tgccaaatat gtgtaggcat ttaatcacca ccacgatcaa gatgtagaat 720attcccacta tcaaaaagtt tgctgtgtcc cttgatggtc atgccccatt ccacagcccc 780agccccagcc cctggagatt gctgtctgct ttatgttcca gtggttttat cttttccaga 840ctgtatggat gtgaatggaa tcagatgtga ttccaaggtg ttttatcttt tccagatgtg 900aatggaatca gatgtacgaa atcctatggt agggggtctt ctgagtctag ctccttttgt 960ttagcgtgat gcatttgaaa ttaatccatg tctcaggcat caggagttca tttctttttc 1020tgctgagtag tatttcattg tatggatgta ctgcaatttg cctatccatt cacctgttga 1080tgtacatttg agatttttgg caattatgaa taaagctgct ataaacaga 1129621843DNAHomo sapiens 62atttagctaa aatgcagatc tcaactggtc acttccctgc gtaaaagctt caatagcttt 60ctatcgccta caggcaaaag tccctcttct gaaaagcggc ttgcaaagcc ctagtagctg 120gctccatgcc ccccagcaat acgtggcctc ctcagacgca tcttctagca aaggagagct 180gctcagtgac atccacaaac cgagctgctt ctcacctacc tgttttcctt ctctttggga 240tgccccccgc cctcaccttc tgcttctggc taactcctag gcatccttga aggcttggct 300cagtatcctc tcctccagga agctgtccct caccttctct cctttccctc catcccagtc 360accatgcacc acacccatat ccccattgca tcctgcagct accttgtgcc tgcacacgtt 420gggctgggca tgcatctcct tctctctcaa gacctggatc ccttcacttt gtgtctctgg 480accccccagt gtgctgataa tggggttgga acccatcata tttctcttga acgagttaat 540gatgggactg ctatttctct aactgttgcc ttggaggccc tgtcacgtgc tcatggaaga 600gagccagggg ggtggaggtg attcttgtta cccagaggac gtggggtctg gatacacgtt 660tctgccatct gccatctgcc agcttctttc tggttggtag ctttggagcc tggtgcagct 720ggggccagtc ccgagcctgg actctgctgg gcagtggcaa gagcactgtc tggagctctc 780ctgaggagcc cacagatcca actccctagg ccaaggctgc agcctggggc agagatgcag 840aggcctggag gagcctaggg cacgcggcct gcgggctggc tggggttcag agttcgtatg 900tgtgtggagt gactgggcag gtgttcagaa atgaaggctg gcactgccag gtaaggccct 960tccctccctg atgtgagagc cctggagcca ccgcagaggc ccagtcagat ctctgttcta 1020attctggcct ggtgtggagg atgaggagag acggcccaga aaggaaggca gactgtgcag 1080accccatgtc ttctggcccg cgaggccctc ctcctgtgcc tgcttatctt aaagaatccg 1140ggataagagg tgacttgggc cttggccggg aggcccctcc tcagcttcag acaaggaggg 1200agctctgggc atgaggacat tgagcaagag gcgatggcag tgcccacaac ttaccctcag 1260ctcggctctg ttgggtccga gaagttgcat ggaaagggct ccttgggggc cagttgtcag 1320taagctgcag aagcctggag ccggccagga aataaccacg tgtaggagcc ttctcagctg 1380agaggaagga ggactcacgc gcggcgagca catgcttgga gccaggcaca gggcatggac 1440tggaactccc aatacccctg acatgggctg agtcaacgtg gtcatgaaca tgtgacagga 1500ggcagcagaa gttgcagaga agagtgaggc acgtttgaaa aaggctgaaa aatgtttctg 1560tccaggcaag ggtgtgtgct gaatgactca aggatttttt ggtgcattga atgaacagcg 1620ggacattgga cacctgctga tccatcaccc cgggcccggg caggcccgtg gatgaagaga 1680gatggagaag accaggcatg agactgtgga gaagccacac caccagaaac ccctgcccca 1740tgcgccgtcc agcccacacc tgtggatgca cgggggattg caggcagggc tcccaccgtg 1800gactcaggaa caggcaggga agctgctgcc tcaccaggcg aag 1843631160DNAHomo sapiens 63gagcactgtc tggagctctc ctgaggagcc cacagatcca actccctagg ccaaggctgc 60agcctggggc agagatgcag aggcctggag gagcctaggg cacgcggcct gcgggctggc 120tggggttcag agttcgtatg tgtgtggagt gactgggcag gtgttcagaa atgaaggctg 180gcactgccag gtaaggccct tccctccctg atgtgagagc cctggagcca ccgcagaggc 240ccagtcagat ctctgttcta attctggcct ggtgtggagg atgaggagag acggcccaga 300aaggaaggca gactgtgcag accccatgtc ttctggcccg cgaggccctc ctcctgtgcc 360tgcttatctt aaagaatccg ggataagagg tgacttgggc cttggccggg aggcccctcc 420tcagcttcag acaaggaggg agctctgggc atgaggacat tgagcaagag gcgatggcag 480tgcccacaac ttaccctcag ctcggctctg ttgggtccga gaagttgcat ggaaagggct 540ccttgggggc cagttgtcag taagctgcag aagcctggag ccggccagga aataaccacg 600tgtaggagcc ttctcagctg agaggaagga ggactcacgc gcggcgagca catgcttgga 660gccaggcaca gggcatggac tggaactccc aatacccctg acatgggctg agtcaacgtg 720gtcatgaaca tgtgacagga ggcagcagaa gttgcagaga agagtgaggc acgtttgaaa 780aaggctgaaa aatgtttctg tccaggcaag ggtgtgtgct gaatgactca aggatttttt 840gggcaacaca aaccaacacg agccgtgtga ggatcaggtg acagctgccc aaaagctgac 900acaaggaaca agcctggagg agtgaggatg ggtgctgtga aggaggttgt gcagctgggc 960ccgcagtcgg acctggtgag atcagaggag ggggtgccac cagtctgtgg acgaagatga 1020gaagctggaa tagagcagaa aacaggaggc tgccactctc catctttccc aaagtcactc 1080caggagcaag ggtgtcattt actgaaatga cagactctcc atttcacatt tttcccccaa 1140gtgcagagtg cagggaagca 116064572DNAHomo sapiens 64cgctctgctc tgcagcgcat ccggtgccaa ggaggggagt agcgaacgtt gaccttgtcc 60ctaaggagct tacacgcagg gagggcatgg actggaactc ccaatacccc tgacatgggc 120tgagtcaacg tggtcatgaa catgtgacag gaggcagcag aagttgcaga gaagagtgag 180gcacgtttga aaaaggctga aaaatgtttc tgtccaggca agggtgtgtg ctgaatgact 240caaggatttt ttgggcaaca caaaccaaca cgagccgtgt gaggatcagg tgacagctgc 300ccaaaagctg acacaaggaa caagcctgga ggagtgagga tgggtgctgt gaaggaggtt 360gtgcagctgg gcccgcagtc ggacctggtg agatcagagg agggggtgcc accagtctgt 420ggacgaagat gagaagctgg aatagagcag aaaacaggag gctgccactc tccatctttc 480ccaaagtcac tccaggagca agggtgtcat ttactgaaat gacagactct ccatttcaca 540tttttccccc aagtgcagag tgcagggaag ca 572654435DNAHomo sapiens 65caaatgcctg gcagcgtcct cggtgcttca cctgccatag ccgacagtgg ctgacctccc 60atgcctgttg ccttttcttt ctgttggatc agggatacac tgccatgtgt gttaagaaaa 120gctggcctta cctacagggc tggccagtcc cggtcacgtt tctagtaagc cattgcctta 180cataagggta acggcatggg acgctatctt agccaatgtg ataaaagtgg acatgaggtg 240agaggcttca gagagaggtt ttaaaaaaga gacaaaagca ggacgttgcc tctcttcctc 300ctctccacgt gtcctacccg gatgtgaagc caaaacagat gcaggcttag tgcaaccatg 360gggaacccag cataagcaca gattcaacag cagaagagtg gcagagggag aaggtgaaag 420gaacctaggt tttcctgtcc ttgttgagtc attcagttaa aaatccctgg aattttcctc 480tctccggcag tgtgttttgt gggataatga gttgccttat tggggttggc ttgctagtcg 540ggatgtttcg ctcccatcaa catccatacg cttgctctgt gaaccaatga cctgatgagg 600tagtattagc accaccatca ttatgctgag gatgagattt atggcacagt ggttcagtag 660cttgcccaag gccatgcggc tggtaggttc tggaggaggg ctcagggcac cccctgagct 720acccctgctg gccattgcac caccccataa agctgctggc agtcacttct ctgaggggtt 780agcatgtaag aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg ccagggttgg 840gtacccccat gacagcagcc agcctgccct cttagtccct gacagctgca gtgacagcat 900ctgtgattgc aaagcgtgac aatttatatc tctcatttca tcacaccatc tatcagcaga 960cagtcaggct ttaaaaatca atcccacact gactcagtcc ccagcagaga tggcctctga 1020caacagtatc cacactgcag gctggacaag ggccctatta attttgagac tcagccaaat 1080ttccttctga ccctaagctg gtgaatccct gctcctttgc tttggttggg gttggtgtga 1140gctaaggctg tgatcccatt tgctcctatg gcctccaggt ggcctgggcc tccatgaatg 1200ggccacatgg tcatactgaa tgcttgatta cactcagacc tagcagtcgt ctgggcgcag 1260ctggtttatg gatcactttg tcacaatgtt ccatccttcc aggtccccat ccccgcggtg 1320ggaaaacatt gctttaggca gtgctagagg acttcagcag gcattggcag cttctggatt 1380caggattaga acaaagaagg aggagtcaca gcaaagatag gaacagaagg cagagagaac 1440agacagatgg gggtgtttga gaaggagggc ctttgagacc tcagggagtg ggagacactg 1500gctcgagaat aataataatg gcaatttctc tcatctgtgt tttcagggca tggactggaa 1560ctcccaatac ccctgacatg ggctgagtca acgtggtcat gaacatgtga caggaggcag 1620cagaagttgc agagaagagt gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag 1680gcaagggtgt gtgctgaatg actcaaggat tttttggtgc attgaatgaa cagcgggaca 1740ttggacacct gctgatccat caccccgggc ccgggcaggc ccgtggatga agagagatgg 1800agaagaccag gcatgagact gtggagaagc cacaccacca gaaacccctg ccccatgcgc 1860cgtccagccc acacctgtgg atgcacgggg gattgcaggc agggctccca ccgtggactc 1920aggaacaggc agggaagctg ctgcctcacc aggcgaaggg gccaggaggg ggaggcggag 1980aggcccgtct agcccctgcg gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc 2040gcctcagctt cgttaatagg ggagggggcc taagagtttt cacgtccagg ctcgggcagt 2100ggggaggcag gcaggagtgg ccgctggttt ttcagacctc ccagggaggc cgaggaaatg 2160gcccgtcctg gagtgggcgt ggttctgtct tcagatggat gctggagggt tgggctgcgt 2220gggaccctgg gccctgctgc ttcccggagg atgcgctgtc cggggctgca caggttggct 2280gtgttttttg gatgcttgat attttgtttt ttcttctctt cactctgtca tgaaactggc 2340aatagtagtt tgtaaataaa tatgtgttat agatgaatat ttgctatgag taaattaata 2400aaggagtgaa taaatgagcg attgatgtag ggcctgtcct gtctcaggga gccccacgaa 2460ggcctgcgcg ccggccagag cctgcctgcc tgccagggta ctgggacgtc actctcaaag 2520cggcgggacc cagccgctga tcttgctgag gaggcccggt ctcagaaaac tgagcggctg 2580cttctgcaga ccctgcatcc tcccctccct ggagaaagaa gctctggctg agtcctggga 2640ccgaaccctt gggtgccaca gaaacgggct ttgctgcctg tcagtcaagc ggcgggagaa 2700acagacctgg ggaggaggag gctgggaggg ctgtgttttc tgcacagcga gtagctcctt 2760agcctggtgc catttctctc

caaacaccct gaaggttgag tccagggtga agatgtagag 2820gcaagttttg gggggatgga gtgggcttgg agggatgctg gcgccttagc aggctgtgct 2880cctgaggtgc ccagtgtctg cgggcacagg aacatgttgc cgagggcatt tgggtgtggg 2940tggggtgggg aaagggagac agggctgtct cttttaatgg gtatctgcga gcatgtgatt 3000gtaagagagg aagaagtagg ggaggaagaa ggcctccttg ggaggtgcgt catcctgagg 3060aaggctgaac aatgagggtc ttggagagtc aattcagaag cacaaccttg cagagcaggc 3120aaaaacaata gggcttcttg aggctgcccg ggcactcatg caatcaccat ttcctgctgt 3180gaatgagcct acattttgtt ggggaagaga cgcaacgacg ccaaacgatg gactctgagt 3240caacgataag atgaaacaaa attaaaacaa agtaggaaat caagagtggc tgctgtgatg 3300gcgttgcgga gatgatgttt gctttgagaa ctggacaagt gagcccctga gctgcatctg 3360cacccagagg ctgagccggt gcacaggact tgcagaggga tgggcctggg cttgtagagc 3420agcacaacgg ccccaggcct ggaggagcaa gggtgggaag gggggcaggc cagctcctgc 3480caggctggag aaggactcgg acctcaggcc acctgtgcct gggtgattgt gaacttgtaa 3540caaatgtgat cttatttatg ttttgaaaaa ggcaacacaa accaacacga gccgtgtgag 3600gatcaggtga cagctgccca aaagctgaca caaggaacaa gcctggagga gtgaggatgg 3660gtgctgtgaa ggaggttgtg cagctgggcc cgcagtcgga cctggtgaga tcagaggagg 3720gggtgccacc agtctgtgga cgaagatgag aagctggaat agagcagaaa acaggaggct 3780gccactctcc atctttccca aagtcactcc aggagcaagg gtgtcattta ctgaaatgac 3840agactctcca tttcacattt ttcccccaag tgcagagtgc agggaagcag atgggctaaa 3900tttttagagt cagggttatt aatgtatact ttacatagta aactttcccc ttttaagtgt 3960gcaggcctga ggtttgccaa atatgtgtag gcatttaatc accaccacga tcaagatgta 4020gaatattccc actatcaaaa agtttgctgt gtcccttgat ggtcatgccc cattccacag 4080ccccagcccc agcccctgga gattgctgtc tgctttatgt tccagtggtt ttatcttttc 4140cagactgtat ggatgtgaat ggaatcagat gtgattccaa ggtgttttat cttttccaga 4200tgtgaatgga atcagatgta cgaaatccta tggtaggggg tcttctgagt ctagctcctt 4260ttgtttagcg tgatgcattt gaaattaatc catgtctcag gcatcaggag ttcatttctt 4320tttctgctga gtagtatttc attgtatgga tgtactgcaa tttgcctatc cattcacctg 4380ttgatgtaca tttgagattt ttggcaatta tgaataaagc tgctataaac agaca 443566374DNAHomo sapiens 66cttctgtcct ctgggtcccc acaagcctgg atgaactcaa gatctgactc agtggcacag 60tgaggagacc tttgaggcct cagtgaccat ccttggactt cacctctcac ggctttcagg 120cagagaggcc ctcccatgcc cacaacaggc tgagcccagc cttcctcggg gtttgcttcc 180aggcctgact tttactcccc tttctaagtg tgctcccggg aatgctgtct acttgttgcg 240attttactcc cgtggcctgt gctagctgcc tgcttggccg ttgggactga agggatgctc 300atccacttgg cacactgact gcaagcctgg caccggcctt gcctttgttc tcccatgagt 360cctcttgaag gcaa 374671245DNAHomo sapiens 67aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg ccagggttgg gtacccccat 60gacagcagcc agcctgccct cttagtccct gacagctgca gtgacagcat ctgtgattgc 120aaagcgtgac aatttatatc tctcatttca tcacaccatc tatcagcaga cagtcaggct 180ttaaaaatca atcccacact gactcagtcc ccagcagaga tggcctctga caacagtatc 240cacactgcag gctggacaag ggccctatta attttgagac tcagccaaat ttccttctga 300ccctaagctg gtgaatccct gctcctttgc tttggttggg gttggtgtga gctaaggctg 360tgatcccatt tgctcctatg gcctccaggt ggcctgggcc tccatgaatg ggccacatgg 420tcatactgaa tgcttgatta cactcagacc tagcagtcgt ctgggcgcag ctggtttatg 480gatcactttg tcacaatgtt ccatccttcc aggtccccat ccccgcggtg ggaaaacatt 540gctttaggca gtgctagagg acttcagcag gcattggcag cttctggatt caggattaga 600acaaagaagg aggagtcaca gcaaagatag gaacagaagg cagagagaac agacagatgg 660gggtgtttga gaaggagggc ctttgagacc tcagggagtg ggagacactg gctcgagaat 720aataataatg gcaatttctc tcatctgtgt tttcagggca tggactggaa ctcccaatac 780ccctgacatg ggctgagtca acgtggtcat gaacatgtga caggaggcag cagaagttgc 840agagaagagt gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag gcaagggtgt 900gtgctgaatg actcaaggat tttttgggca acacaaacca acacgagccg tgtgaggatc 960aggtgacagc tgcccaaaag ctgacacaag gaacaagcct ggaggagtga ggatgggtgc 1020tgtgaaggag gttgtgcagc tgggcccgca gtcggacctg gtgagatcag aggagggggt 1080gccaccagtc tgtggacgaa gatgagaagc tggaatagag cagaaaacag gaggctgcca 1140ctctccatct ttcccaaagt cactccagga gcaagggtgt catttactga aatgacagac 1200tctccatttc acatttttcc cccaagtgca gagtgcaggg aagca 124568537DNAHomo sapiens 68aatgcttgat tacactcaga cctagcagtc gtctgggcgc agctggttta tggatcactt 60tgtcacaatg ttccatcctt ccaggtcccc atccccgcgg tgggaaaaca ttgctttagg 120cagtgctaga ggacttcagc aggcattggc agcttctgga ttcaggatta gaacaaagaa 180ggaggagtca cagcaaagat aggaacagaa ggcagagaga acagacagat gggggtgttt 240gagaaggagg gcctttgaga cctcagggag tgggagacac tggctcgaga ataataataa 300tggcaatttc tctcatctgt gttttcaggg catggactgg aactcccaat acccctgaca 360tgggctgagt caacgtggtc atgaacatgt gacaggaggc agcagaagtt gcagagaaga 420gtgaggcacg tttgaaaaag gctgaaaaat gtttctgtcc aggcaagggt gtgtgctgaa 480tgactcaagg attttttggc ctctgcctgt gtcctggccc tcactgcacc cccaaga 537691863DNAHomo sapiens 69cttcctcggg gtttgcttcc aggcctgact tttactcccc tttctaagtg tgcagatggg 60atgtgcttct ccacaggagg ccccacggct tccccacccc tcagaggagc gccgtgcgtg 120cgtctgtgtg gaggattggc agctcctgca gtcggccctt ggtcctattt ggcgacgcct 180ctgccttccc cttaattata cagtcatgag ccgccctgga atcacggcag ctccggatgg 240atcctggatg ccagaatgca gcctcagcac ggggctgcag gacaggagtg agcgaggggc 300tgcagagccg gcggccgcgg tgggcaccat ggagggggct gccctgggca gcacgggcat 360gagtctcaag gcccaggttt gagtaacagg tgttgagagc ttacttactt ttcctgagac 420acagtttcct catctcgaga gcacggaaaa tcattctaac ttcagaggat tgttgtgaaa 480gttaaatgag attaaagagg taaagcccat gacgtgctta gctcgtgctt ggctcttggt 540caatgccagt tagcgctgca ttttctcccc tctccctccc tccttctctc tttcttttct 600tctattctcc attcctgttt tctcccccac cccactcccc aaagctctgc gttgagaacc 660agatgctgtc tggtgggtta gggccagagg aggaaaagct gcccgccgtg ggctgcaccc 720ataccctctt cattccaatg acatgagggg aggggaaagg acagaggtag actgtcctcc 780cctacctcct cctaatacaa atggaattcc tggaactgga aaacaaagaa tacccccata 840aaaataagac agtacttctg gtgcggtgta ataaagggga aagtaaccct caatgtcagg 900aaactccgca cctcccagct catatttgtg tggaggaaaa gttaaatatt aatttggact 960caactgaatg tggacacaaa caatggtcac caagtcccgg aacaggttgt gtgagcctct 1020tcaggggttc atccagcgct gttttggaga aatctctatt tcaatttatt cctatacgtt 1080agttactgaa aaacaacaga caatcgcaaa agcaagttgc ccgttttgtg ttccttgagc 1140ccaatcatga agtgccgtcg tgactgggcc tcatgacaaa caacttgtaa caagtaacaa 1200cagagctcag gtcccagacc gcactgaagc tctgtgagac ctctcctcat ctgtgcatga 1260acgagtgtct gactctggag cccagcctgc tgcttcccag tctggtggtg aatcctccgt 1320agtctgatgg aggtttgctc ttgttgccca ggctggagtg caatggcaca atctcggctc 1380actgcagccc ctgcctccca ggctcaagca attcttacgc ctcagcctcc tgagtagatg 1440gaactacagg gcatggactg gaactcccaa tacccctgac atgggctgag tcaacgtggt 1500catgaacatg tgacaggagg cagcagaagt tgcagagaag agtgaggcac gtttgaaaaa 1560ggctgaaaaa tgtttctgtc caggcaaggg tgtgtgctga atgactcaag gattttttgg 1620gcaacacaaa ccaacacgag ccgtgtgagg atcaggtgac agctgcccaa aagctgacac 1680aaggaacaag cctggaggag tgaggatggg tgctgtgaag gaggttgtgc agctgggccc 1740gcagtcggac ctggtgagat cagaggaggg ggtgccacca gtctgtggac gaagatgaga 1800agctggaata gagcagaaaa caggaggctg ccactctcca tctttcccaa agtcactcca 1860gga 1863703679DNAHomo sapiens 70gaggcagcca tgactggcca cttcatgtgc tcctggagaa gggcttgcac cagccgtttt 60caggaaagtc aagcagctgt tgactcctga gtctgggtga atttgtgtga agagcataag 120gcgctgtttc ttaaccaaaa cgcttcctct tgcagtgcag atgggatgtg cttctccaca 180ggaggcccca cggcttcccc acccctcaga ggagcgccgt gcgtgcgtct gtgtggagga 240ttggcagctc ctgcagtcgg cccttggtcc tatttggcga cgcctctgcc ttccccttaa 300ttatacagtc atgagccgcc ctggaatcac ggcagctccg gatggatcct ggatgccaga 360atgcagcctc agcacggggc tgcaggacag gagtgagcga ggggctgcag agccggcggc 420cgcggtgggc accatggagg gggctgccct gggcagcacg ggcatgagtc tcaaggccca 480ggtttgagta acaggtgttg agagcttact tacttttcct gagacacagt ttcctcatct 540cgagagcacg gaaaatcatt ctaacttcag aggattgttg tgaaagttaa atgagattaa 600agaggtaaag cccatgacgt gcttagctcg tgcttggctc ttggtcaatg ccagttagcg 660ctgcattttc tcccctctcc ctccctcctt ctctctttct tttcttctat tctccattcc 720tgttttctcc cccaccccac tccccaaagc tctgcgttga gaaccagatg ctgtctggtg 780ggttagggcc agaggaggaa aagctgcccg ccgtgggctg cacccatacc ctcttcattc 840caatgacatg aggggagggg aaaggacaga ggtagactgt cctcccctac ctcctcctaa 900tacaaatgga attcctggaa ctggaaaaca aagaataccc ccataaaaat aagacagtac 960ttctggtgcg gtgtaataaa ggggaaagta accctcaatg tcaggaaact ccgcacctcc 1020cagctcatat ttgtgtggag gaaaagttaa atattaattt ggactcaact gaatgtggac 1080acaaacaatg gtcaccaagt cccggaacag gttgtgtgag cctcttcagg ggttcatcca 1140gcgctgtttt ggagaaatct ctatttcaat ttattcctat acgttagtta ctgaaaaaca 1200acagacaatc gcaaaagcaa gttgcccgtt ttgtgttcct tgagcccaat catgaagtgc 1260cgtcgtgact gggcctcatg acaaacaact tgtaacaagt aacaacagag ctcaggtccc 1320agaccgcact gaagctctgt gagacctctc ctcatctgtg catgaacgag tgtctgactc 1380tggagcccag cctgctgctt cccagtctgg tggtgaatcc tccgtagtct gatggaggtt 1440tgctcttgtt gcccaggctg gagtgcaatg gcacaatctc ggctcactgc agcccctgcc 1500tcccaggctc aagcaattct tacgcctcag cctcctgagt agatggaact acagggcatg 1560gactggaact cccaataccc ctgacatggg ctgagtcaac gtggtcatga acatgtgaca 1620ggaggcagca gaagttgcag agaagagtga ggcacgtttg aaaaaggctg aaaaatgttt 1680ctgtccaggc aagggtgtgt gctgaatgac tcaaggattt tttggagaga attggagtgt 1740ctcaccagag gagaccacgt ctgaagggct ttgcatccct ccttggacat gtctaatacc 1800taacactcag aaagcatcca gtaaatattc gtggaaagaa aggagtggag aaggggagaa 1860aggggaaagg gagtaggcga gagagaagaa agactctgct tcttgcccag ggcctggcat 1920ggggcggagg caaagcagtg gggtcctcag ctatgtccca ctgtgagtgc acagcgagtc 1980ctgaccttca gagggtgcag cccgaggggc cctggcctgt ctgaagggtg cgccagccga 2040gtggcctgct ctgaccacca ggctcaccca tgactacctg ggtggctaca gccagttcct 2100gacaatgagt acagcactca gttatcgggg cccttccacc cacacgctgt ccacttcctg 2160gggtactgct gtgggcatgt gagtgcttgc tccccggggc actgctgtgg gcatgcgagt 2220gcttgctccc cggggcactg ctgtccactt cctggggtac tgctgtgggc atgcgagtgc 2280ttgctccccg gggcactgct gtggacatgt gatagcttgc tccccagctc cactagtgac 2340actggcggcc cctcgctggg gccttccccg cctgctccgc tccattaccg ctgccgggct 2400cctcacgtct ctccttgctg cttcctgcac tggggtgagg agagtggggc tggtcccctt 2460gagaccggag aagctccagg cttttaagga aaactgccag ggacgaagag aagatatcac 2520ttccccacgt ggttggcttc cagattcaga aggaatgtct gtccttgtgg attccgtacc 2580agatgacccc agatgctgcc tcagtactag gtccctgtgg ctctggagcc tttgctgggt 2640ctgggcagtg tctcttcctc tccagttcat ccttgggtct cttcaccctt gccaggggca 2700ggcttcctgg tgagaggtcg acctcctgca tgaaggctct caagaggcca gttcaaagcc 2760aagctccggg tctgtgcctg tggggctgct cctcgatcag gagatggtca ctcccctcct 2820ggtctgtatc tgtgggattc tcctccatca ggagatggtc tctcccctcc tggtctatac 2880ccgtgggatt ctcctccatc aggagatggt cactccccat cctggtctat acccgtggga 2940ttctcctcca ttagatggtc actcccctcc tggtctattc ccgtggggct gctcctccat 3000caggaggtgg tcactccccc tcctggtcta tacccgtggg gctgctcctc catcaggaga 3060tagtcactcc ccctcctggt ctatacccgt gggattctcc tccatctgga gatggtcact 3120cccctcctgg tctataccca tgggattctc ctccatctgg agatggtcac tcccctcctg 3180gtctataccc atgggattct cctccatctg gagatagtca ctcccctcct ggtctatacc 3240cgtggggttc tcctcaatca ggaggtggtc actcccctcc tggtctatac ccgtgggatt 3300ctcctccatc aggagatggt cactctccct cctggtctat acccgtgggg ctgctcctcc 3360atcaggagat ggtcactccc ctcctggtct atacctgtgg gattctcctc cctcagaaga 3420tggtcactcc ccctcctggt ctatacccat gggattctcc tccctcagaa gatggtcact 3480ccccctcctg gtctataccc gtggggctgc tcctccatca ggagatggtc actctccctc 3540tcggttgctc agtccaaaaa caacctctct ggaaaactgc gtggaatttt tttttaaaga 3600attgaaacta gaactagcat ttgatccagc catctgccta ctgggaatac acccaaagaa 3660aaataaatca ttatatcag 3679713620DNAHomo sapiens 71gaggcagcca tgactggcca cttcatgtgc tcctggagaa gggcttgcac cagccgtttt 60caggaaagtc aagcagctgt tgactcctga gtctgggtga atttgtgtga agagcataag 120gcgctgtttc ttaaccaaaa cgcttcctct tgcagtgcag atgggatgtg cttctccaca 180ggaggcccca cggcttcccc acccctcaga ggagcgccgt gcgtgcgtct gtgtggagga 240ttggcagctc ctgcagtcgg cccttggtcc tatttggcga cgcctctgcc ttccccttaa 300ttatacagtc atgagccgcc ctggaatcac ggcagctccg gatggatcct ggatgccaga 360atgcagcctc agcacggggc tgcaggacag gagtgagcga ggggctgcag agccggcggc 420cgcggtgggc accatggagg gggctgccct gggcagcacg ggcatgagtc tcaaggccca 480ggtttgagta acaggtgttg agagcttact tacttttcct gagacacagt ttcctcatct 540cgagagcacg gaaaatcatt ctaacttcag aggattgttg tgaaagttaa atgagattaa 600agaggtaaag cccatgacgt gcttagctcg tgcttggctc ttggtcaatg ccagttagcg 660ctgcattttc tcccctctcc ctccctcctt ctctctttct tttcttctat tctccattcc 720tgttttctcc cccaccccac tccccaaagc tctgcgttga gaaccagatg ctgtctggtg 780ggttagggcc agaggaggaa aagctgcccg ccgtgggctg cacccatacc ctcttcattc 840caatgacatg aggggagggg aaaggacaga ggtagactgt cctcccctac ctcctcctaa 900tacaaatgga attcctggaa ctggaaaaca aagaataccc ccataaaaat aagacagtac 960ttctggtgcg gtgtaataaa ggggaaagta accctcaatg tcaggaaact ccgcacctcc 1020cagctcatat ttgtgtggag gaaaagttaa atattaattt ggactcaact gaatgtggac 1080acaaacaatg gtcaccaagt cccggaacag gttgtgtgag cctcttcagg ggttcatcca 1140gcgctgtttt ggagaaatct ctatttcaat ttattcctat acgttagtta ctgaaaaaca 1200acagacaatc gcaaaagcaa gttgcccgtt ttgtgttcct tgagcccaat catgaagtgc 1260cgtcgtgact gggcctcatg acaaacaact tgtaacaagt aacaacagag ctcaggtccc 1320agaccgcact gaagctctgt gagacctctc ctcatctgtg catgaacgag tgtctgactc 1380tggagcccag cctgctgctt cccagtctgg tggtgaatcc tccgtagtct gatggaggtt 1440tgctcttgtt gcccaggctg gagtgcaatg gcacaatctc ggctcactgc agcccctgcc 1500tcccaggctc aagcaattct tacgcctcag cctcctgagt agatggaact acagggcatg 1560gactggaact cccaataccc ctgacatggg ctgagtcaac gtggtcatga acatgtgaca 1620ggaggcagca gaagttgcag agaagagtga ggcacgtttg aaaaaggctg aaaaatgttt 1680ctgtccaggc aagggtgtgt gctgaatgac tcaaggattt tttggagaga attggagtgt 1740ctcaccagag gagaccacgt ctgaagggct ttgcatccct ccttggacat gtctaatacc 1800taacactcag aaagcatcca gtaaatattc gtggaaagaa aggagtggag aaggggagaa 1860aggggaaagg gagtaggcga gagagaagaa agactctgct tcttgcccag ggcctggcat 1920ggggcggagg caaagcagtg gggtcctcag ctatgtccca ctgtgagtgc acagcgagtc 1980ctgaccttca gagggtgcag cccgaggggc cctggcctgt ctgaagggtg cgccagccga 2040gtggcctgct ctgaccacca ggctcaccca tgactacctg ggtggctaca gccagttcct 2100gacaatgagt acagcactca gttatcgggg cccttccacc cacacgctgt ccacttcctg 2160gggtactgct gtgggcatgt gagtgcttgc tccccggggc actgctgtgg gcatgcgatg 2220cttgctcccc ggggcactgc tgtggacatg tgatagcttg ctccccagct ccactagtga 2280cactggcggc ccctcgctgg ggccttcccc gcctgctccg ctccattacc gctgccgggc 2340tcctcacgtc tctccttgct gcttcctgca ctggggtgag gagagtgggg ctggtcccct 2400tgagaccgga gaagctccag gcttttaagg aaaactgcca gggacgaaga gaagatatca 2460cttccccacg tggttggctt ccagattcag aaggaatgtc tgtccttgtg gattccgtac 2520cagatgaccc cagatgctgc ctcagtacta ggtccctgtg gctctggagc ctttgctggg 2580tctgggcagt gtctcttcct ctccagttca tccttgggtc tcttcaccct tgccaggggc 2640aggcttcctg gtgagaggtc gacctcctgc atgaaggctc tcaagaggcc agttcaaagc 2700caagctccgg gtctgtgcct gtggggctgc tcctcgatca ggagatggtc actcccctcc 2760tggtctgtat ctgtgggatt ctcctccatc aggagatggt ctctcccctc ctggtctata 2820cccgtgggat tctcctccat caggagatgg tcactcccca tcctggtcta tacccgtggg 2880attctcctcc attagatggt cactcccctc ctggtctatt cccgtggggc tgctcctcca 2940tcaggaggtg gtcactcccc ctcctggtct atacccgtgg ggctgctcct ccatcaggag 3000atagtcactc cccctcctgg tctatacccg tgggattctc ctccatctgg agatggtcac 3060tcccctcctg gtctataccc atgggattct cctccatctg gagatggtca ctcccctcct 3120ggtctatacc catgggattc tcctccatct ggagatagtc actcccctcc tggtctatac 3180ccgtggggtt ctcctcaatc aggaggtggt cactcccctc ctggtctata cccgtgggat 3240tctcctccat caggagatgg tcactctccc tcctggtcta tacccgtggg gctgctcctc 3300catcaggaga tggtcactcc cctcctggtc tatacctgtg ggattctcct ccctcagaag 3360atggtcactc cccctcctgg tctataccca tgggattctc ctccctcaga agatggtcac 3420tccccctcct ggtctatacc cgtggggctg ctcctccatc aggagatggt cactctccct 3480ctcggttgct cagtccaaaa acaacctctc tggaaaactg cgtggaattt ttttttaaag 3540aattgaaact agaactagca tttgatccag ccatctgcct actgggaata cacccaaaga 3600aaaataaatc attatatcag 362072720DNAHomo sapiens 72ctggagctct cctgaggagc ccacagatcc aactccctag gccaaggctg cagcctgggg 60cagagatgca gaggcctgga ggagcctagg gcacgcggcc tgcgggctgg ctggggttca 120gagttcgtat gtgtgtggag tgactgggca ggtgttcaga aatgaaggct ggcactgcca 180ggtaaggccc ttccctccct gatgtgagag ccctggagcc accgcagagg cccagtcaga 240tctctgttct aattctggcc tggtgtggag gatgaggaga gacggcccag aaaggaaggc 300agactgtgca gaccccatgt cttctggccc gcgaggccct cctcctgtgc ctgcttatct 360taaagaatcc gggataagag gtgacttggg ccttggccgg gaggcccctc ctcagcttca 420gacaaggagg gagctctggg catgaggaca ttgagcaaga ggcgatggca gtgcccacaa 480cttaccctca gctcggctct gttgggtccg agaagttgca tggaaagggc tccttggggg 540ccagttgtca gtaagctgca gaagcctgga gccggccagg aaataaccac gtgtaggagc 600cttctcagct gagaggaagg aggactcacg cgcggcgagc acatgcttgg agccaggcac 660agggcatgga ctggaactcc caatacccct gacatgggct gagtcaacgt ggtcatgaac 720734387DNAHomo sapiens 73caaatgcctg gcagcgtcct cggtgcttca cctgccatag ccgacagtgg ctgacctccc 60atgcctgttg ccttttcttt ctgttggatc agggatacac tgccatgtgt gttaagaaaa 120gctggcctta cctacagggc tggccagtcc cggtcacgtt tctagtaagc cattgcctta 180cataagggta acggcatggg acgctatctt agccaatgtg ataaaagtgg acatgaggtg 240agaggcttca gagagaggtt ttaaaaaaga gacaaaagca ggacgttgcc tctcttcctc 300ctctccacgt gtcctacccg gatgtgaagc caaaacagat gcaggcttag tgcaaccatg 360gggaacccag cataagcaca gattcaacag cagaagagtg gcagagggag aaggtgaaag 420gaacctaggt tttcctgtcc ttgttgagtc attcagttaa aaatccctgg aattttcctc 480tctccggcag tgtgttttgt gggataatga gttgccttat tggggttggc ttgctagtcg 540ggatgtttcg ctcccatcaa catccatacg cttgctctgt gaaccaatga cctgatgagg 600tagtattagc accaccatca ttatgctgag gatgagattt atggcacagt ggttcagtag 660cttgcccaag gccatgcggc tggtaggttc tggaggaggg ctcagggcac cccctgagct 720acccctgctg gccattgcac caccccataa agctgctggc agtcacttct ctgaggggtt 780agcatgtaag aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg ccagggttgg 840gtacccccat gacagcagcc agcctgccct cttagtccct gacagctgca gtgacagcat 900ctgtgattgc aaagcgtgac aatttatatc tctcatttca tcacaccatc tatcagcaga 960cagtcaggct ttaaaaatca atcccacact gactcagtcc

ccagcagaga tggcctctga 1020caacagtatc cacactgcag gctggacaag ggccctatta attttgagac tcagccaaat 1080ttccttctga ccctaagctg gtgaatccct gctcctttgc tttggttggg gttggtgtga 1140gctaaggctg tgatcccatt tgctcctatg gcctccaggt ggcctgggcc tccatgaatg 1200ggccacatgg tcatactgaa tgcttgatta cactcagacc tagcagtcgt ctgggcgcag 1260ctggtttatg gatcactttg tcacaatgtt ccatccttcc aggtccccat ccccgcggtg 1320ggaaaacatt gctttaggca gtgctagagg acttcagcag gcattggcag cttctggatt 1380caggattaga acaaagaagg aggagtcaca gcaaagatag gaacagaagg cagagagaac 1440agacagatgg gggtgtttga gaaggagggc ctttgagacc tcagggagtg ggagacactg 1500gctcgagaat aataataatg gcaatttctc tcatctgtgt tttcagggca tggactggaa 1560ctcccaatac ccctgacatg ggctgagtca acgtggtcat gaacatgtga caggaggcag 1620cagaagttgc agagaagagt gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag 1680gcaagggtgt gtgctgaatg actcaaggat tttttggtgc attgaatgaa cagcgggaca 1740ttggacacct gctgatccat caccccgggc ccgggcaggc ccgtggatga agagagatgg 1800agaagaccag gcatgagact gtggagaagc cacaccacca gaaacccctg ccccatgcgc 1860cgtccagccc acacctgtgg atgcacgggg gattgcaggc agggctccca ccgtggactc 1920aggaacaggc agggaagctg ctgcctcacc aggcgaaggg gccaggaggg ggaggcggag 1980aggcccgtct agcccctgcg gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc 2040gcctcagctt cgttaatagg ggagggggcc taagagtttt cacgtccagg ctcgggcagt 2100ggggaggcag gcaggagtgg ccgctggttt ttcagacctc ccagggaggc cgaggaaatg 2160gcccgtcctg gagtgggcgt ggttctgtct tcagatggat gctggagggt tgggctgcgt 2220gggaccctgg gccctgctgc ttcccggagg atgcgctgtc cggggctgca caggttggct 2280gtgttttttg gatgcttgat attttgtttt ttcttctctt cactctgtca tgaaactggc 2340aatagtagtt tgtaaataaa tatgtgttat agatgaatat ttgctatgag taaattaata 2400aaggagtgaa taaatgagcg attgatgtag ggcctgtcct gtctcaggga gccccacgaa 2460ggcctgcgcg ccggccagag cctgcctgcc tgccagggta ctgggacgtc actctcaaag 2520cggcgggacc cagccgctga tcttgctgag gaggcccggt ctcagaaaac tgagcggctg 2580cttctgcaga ccctgcatcc tcccctccct ggagaaagaa gctctggctg agtcctggga 2640ccgaaccctt gggtgccaca gaaacgggct ttgctgcctg tcagtcaagc ggcgggagaa 2700acagacctgg ggaggaggag gctgggaggg ctgtgttttc tgcacagcga gtagctcctt 2760agcctggtgc catttctctc caaacaccct gaaggttgag tccagggtga agatgtagag 2820gcaagttttg gggggatgga gtgggcttgg agggatgctg gcgccttagc aggctgtgct 2880cctgaggtgc ccagtgtctg cgggcacagg aacatgttgc cgagggcatt tgggtgtggg 2940tggggtgggg aaagggagac agggctgtct cttttaatgg gtatctgcga gcatgtgatt 3000gtaagagagg aagaagtagg ggaggaagaa ggcctccttg ggaggtgcgt catcctgagg 3060aaggctgaac aatgagggtc ttggagagtc aattcagaag cacaaccttg cagagcaggc 3120aaaaacaata gggcttcttg aggctgcccg ggcactcatg caatcaccat ttcctgctgt 3180gaatgagcct acattttgtt ggggaagaga cgcaacgacg ccaaacgatg gactctgagt 3240caacgataag atgaaacaaa attaaaacaa agtaggaaat caagagtggc tgctgtgatg 3300gcgttgcgga gatgatgttt gctttgagaa ctggacaagt gagcccctga gctgcatctg 3360cacccagagg ctgagccggt gcacaggact tgcagaggga tgggcctggg cttgtagagc 3420agcacaacgg ccccaggcct ggaggagcaa gggtgggaag gggggcaggc cagctcctgc 3480caggctggag aaggactcgg acctcaggcc acctgtgcct gggtgattgt gaacttgtaa 3540caaatgtgat cttatttatg ttttgaaaaa ggcaacacaa accaacacga gccgtgtgag 3600gatcaggtga cagctgccca aaagctgaca caaggaacaa gcctggagga gtgaggatgg 3660gtgctgtgaa ggaggttgtg cagctgggcc cgcagtcgga cctggtgaga tcagaggagg 3720gggtgccacc agtctgtgga cgaagatgag aagctggaat agagcagaaa acaggaggct 3780gccactctcc atctttccca aagtcactcc aggagcaagg gtgtcattta ctgaaatgac 3840agactctcca tttcacattt ttcccccaag tgcagagtgc agggaagcag atgggctaaa 3900tttttagagt cagggttatt aatgtatact ttacatagta aactttcccc ttttaagtgt 3960gcaggcctga ggtttgccaa atatgtgtag gcatttaatc accaccacga tcaagatgta 4020gaatattccc actatcaaaa agtttgctgt gtcccttgat ggtcatgccc cattccacag 4080ccccagcccc agcccctgga gattgctgtc tgctttatgt tccagtggtt ttatcttttc 4140cagactgtat ggatgtgaat ggaatcagat gtgattccaa ggtgttttat cttttccaga 4200tgtgaatgga atcagatgta cgaaatccta tggtaggggg tcttctgagt ctagctcctt 4260ttgtttagcg tgatgcattt gaaattaatc catgtctcag gcatcaggag ttcatttctt 4320tttctgctga gtagtatttc attgtatgga tgtactgcaa tttgcctatc cattcacctg 4380ttgatgt 4387741398DNAHomo sapiens 74cgggatgttt cgctcccatc aacatccata cgcttgctct gtgaaccaat gacctgatga 60ggtagtatta gcaccaccat cattatgctg aggatgagat ttatggcaca gtggttcagt 120agcttgccca aggccatgcg gctggtaggt tctggaggag ggctcagggc accccctgag 180ctacccctgc tggccattgc accaccccat aaagctgctg gcagtcactt ctctgagggg 240ttagcatgta agaaatgtcc tcctgaatgc tggccagaca aatggaaatc tgccagggtt 300gggtaccccc atgacagcag ccagcctgcc ctcttagtcc ctgacagctg cagtgacagc 360atctgtgatt gcaaagcgtg acaatttata tctctcattt catcacacca tctatcagca 420gacagtcagg ctttaaaaat caatcccaca ctgactcagt ccccagcaga gatggcctct 480gacaacagta tccacactgc aggctggaca agggccctat taattttgag actcagccaa 540atttccttct gaccctaagc tggtgaatcc ctgctccttt gctttggttg gggttggtgt 600gagctaaggc tgtgatccca tttgctccta tggcctccag gtggcctggg cctccatgaa 660tgggccacat ggtcatactg aatgcttgat tacactcaga cctagcagtc gtctgggcgc 720agctggttta tggatcactt tgtcacaatg ttccatcctt ccaggtcccc atccccgcgg 780tgggaaaaca ttgctttagg cagtgctaga ggacttcagc aggcattggc agcttctgga 840ttcaggatta gaacaaagaa ggaggagtca cagcaaagat aggaacagaa ggcagagaga 900acagacagat gggggtgttt gagaaggagg gcctttgaga cctcagggag tgggagacac 960tggctcgaga ataataataa tggcaatttc tctcatctgt gttttcaggg catggactgg 1020aactcccaat acccctgaca tgggctgagt caacgtggtc atgaacatgt gacaggaggc 1080agcagaagtt gcagagaaga gtgaggcacg tttgaaaaag gctgaaaaat gtttctgtcc 1140aggcaagggt gtgtgctgaa tgactcaagg attttttggg tatgtcattt cccatttctc 1200accctcaaat aggactccgc ttcccatcta agcatttgta taaatattga ttattggtta 1260gtgtgtatca gagagctatt gagtaaaaat tatatcagaa aaattaagaa tctctagaga 1320tggcaaggtg tgaaacaaaa aacgccagga aggtaaatgc tcaaagttca ccacacacca 1380cagtgagaag tgttgggg 139875939DNAHomo sapiens 75acagcatctg tgattgcaaa gcgtgacaat ttatatctct catttcatca caccatctat 60cagcagacag tcaggcttta aaaatcaatc ccacactgac tcagtcccca gcagagatgg 120cctctgacaa cagtatccac actgcaggct ggacaagggc cctattaatt ttgagactca 180gccaaatttc cttctgaccc taagctggtg aatccctgct cctttgcttt ggttggggtt 240ggtgtgagct aaggctgtga tcccatttgc tcctatggcc tccaggtggc ctgggcctcc 300atgaatgggc cacatggtca tactgaatgc ttgattacac tcagacctag cagtcgtctg 360ggcgcagctg gtttatggat cactttgtca caatgttcca tccttccagg tccccatccc 420cgcggtggga aaacattgct ttaggcagtg ctagaggact tcagcaggca ttggcagctt 480ctggattcag gattagaaca aagaaggagg agtcacagca aagataggaa cagaaggcag 540agagaacaga cagatggggg tgtttgagaa ggagggcctt tgagacctca gggagtggga 600gacactggct cgagaataat aataatggca atttctctca tctgtgtttt cagggcatgg 660actggaactc ccaatacccc tgacatgggc tgagtcaacg tggtcatgaa catgtgacag 720gaggcagcag aagttgcaga gaagagtgag gcacgtttga aaaaggctga aaaatgtttc 780tgtccaggca agggtgtgtg ctgaatgact caaggatttt ttggctgatt tagtaaacaa 840acaagaatga agaaggaaac catagctgag tggcagagcg tgcctggctg tttacacagg 900actccagggc agggctcctg gagagggacg tgccagagg 93976138DNAArtificial Sequenceprimer 76ctggaaagga ggagaacatg aaacattgct tgaagacaat ggccgagaca gcaggtccca 60ccctgcacag ccaccagcat ctctcccctc agccctgtct cctcttctgc agttgggatc 120tgcacattta agcctgaa 13877147DNAArtificial Sequenceprimer 77attgtcctgt gaagtgaagt atgatcggac agcctctttt cagcttttat gacaatggag 60acagaggaat tgtggctctt gccaaggtca caggattgga atacagagcc aagccacccc 120aggacatgca agagcctcag aagggaa 1477819DNAArtificial Sequenceprimer 78acagccacca gcatctctc 197927DNAArtificial Sequenceprimer 79tgaagtgaag tatgatcgga cagcctc 278022DNAArtificial Sequenceprimer 80ccacaattcc tctgtctcca tt 228190DNAArtificial Sequenceprimer 81tctctcatct gtgttttcag ggcatggact ggaactccca atacccctga catgggctga 60gtcaacgtgg tcatgaacat gtgacaggag 9082101DNAArtificial Sequenceprimer 82gcagcagaag ttgcagagaa gagtgaggca cgtttgaaaa aggctgaaaa atgtttctgt 60ccaggcaagg gtgtgtgctg aatgactcaa ggattttttg g 1018321DNAArtificial Sequenceprimer 83catggactgg aactcccaat a 218425DNAHomo sapiens 84tgcagagaag agtgaggcac gtttg 258521DNAArtificial Sequenceprimer 85ccttgcctgg acagaaacat t 21861434DNAHomo sapiens 86gttcgttgca acaaattgat gagcaatgct tttttataat gccaactttg tacaaaaaag 60ttggcatgag ccggtcaagg cacctgggca aaatccggaa gcgtctggaa gatgtcaaga 120gccagtgggt ccggccagcc agggctgact ttagtgacaa cgagagtgcc cggctggcca 180cggacgccct cttggatggg ggttctgaag cctactggcg ggtgctcagc caggaaggcg 240aggtggactt cttgtcctcg gtggaggccc agtacatcca ggcccaggcc agggagcccc 300cgtgtccccc agacaccctg ggaggggcgg aagcaggccc taagggactg gactccagct 360ccctacagtc cggcacctac ttccctgtgg cctcagaggg cagcgagccg gccctactgc 420acagctgggc ctcagctgag aagccctacc tgaaggaaaa atccagcgcc actgtgtact 480tccagaccgt caagcacaac aacatcagag acctcgtccg ccgctgcatc acccggacta 540gccaggtcct ggtcatcctg atggatgtgt tcacggatgt ggagatcttc tgtgacattc 600tagaggcagc caacaagcgt ggggtgttcg tttgtgtgct cctggaccag ggaggtgtga 660agctcttcca ggagatgtgt gacaaagtcc agatctctga cagtcacctc aagaacattt 720ccatccggag tgtggaagga gagatatact gtgccaagtc aggcaggaaa ttcgctggcc 780aaatccggga gaagttcatc atctcggact ggagatttgt cctgtctgga tcttacagct 840tcacctggct ctgcggacac gtgcaccgga acatcctctc caagttcaca ggccaggcgg 900tggagctgtt tgacgaggag ttccgccacc tctacgcctc ctccaagcct gtgatgggcc 960tgaagtcccc gcggctggtc gcccccgtcc cgcccggagc agccccggcc aatggccgcc 1020ttagcagcag cagtggctcc gccagtgacc gcacgtcctc caaccccttc agcggccgct 1080cggcaggcag ccaccccggt acccgaagtg tgtccgcgtc ttcagggccc tgtagccccg 1140cggccccaca cccgcctcca ccgccccggt tccagcccca ccaaggccct tggggagccc 1200cgagtcccca ggcccacctc tccccgcggc cccacgacgg cccgcccgcc gctgtctaca 1260gcaacctggg ggcctacagg cccacgcggc tgcagctgga gcagctgggc ctggtgccga 1320ggctgactcc aacctggagg cccttcctgc aggcctcccc tcacttctgc ccaactttct 1380tgtacaaagt tggcattata agaaagcatt gcttatcaat ttgttgcaac gaac 1434873535DNAHomo sapiens 87gcggccgcgg cgccgatccc ggctgaggcg cagcggcgag aggtcgcggg cagggccatg 60gccccggggg gccgctagcg cggaccggcc caacgggagc cgctccgtgc cgccgccgcc 120gcccgggcgc ccaggccccg ccgctgcgga agaggtttct agagagtgga gcctgcttcc 180tgggccctag gcccctccca caatgcttgt cgccggtctt cttctctggg cttccctact 240gaccggggcc tggccatcct tccccaccca ggaccacctc ccggccacgc cccgggtccg 300gctctcattc aaagagctga aggccacagg caccgcccac ttcttcaact tcctgctcaa 360cacaaccgac taccgaatct tgctcaagga cgaggaccac gaccgcatgt acgtgggcag 420caaggactac gtgctgtccc tggacctgca cgacatcaac cgcgagcccc tcattataca 480ctgggcagcc tccccacagc gcatcgagga atgcgtgctc tcaggcaagg atgtcaacgg 540cgagtgtggg aacttcgtca ggctcatcca gccctggaac cgaacacacc tgtatgtgtg 600cgggacaggt gcctacaacc ccatgtgcac ctatgtgaac cgcggacgcc gcgcccaggc 660cacaccatgg acccagactc aggcggtcag aggccgcggc agcagagcca cggatggtgc 720cctccgcccg atgcccacag ccccacgcca ggattacatc ttctacctgg agcctgagcg 780actcgagtca gggaagggca agtgtccgta cgatcccaag ctggacacag catcggccct 840catcaatgag gagctctatg ctggtgtgta catcgatttt atgggcactg atgcagccat 900cttccgcaca cttggaaagc agacagccat gcgcacggat cagtacaact cccggtggct 960gaacgacccg tcgttcatcc atgctgagct cattcctgac agtgcggagc gcaatgatga 1020taagctttac ttcttcttcc gtgagcggtc ggcagaggcg ccgcagagcc ccgcggtgta 1080cgcccgcatc gggcgcattt gcctgaacga tgacggtggt cactgttgcc tggtcaacaa 1140gtggagcaca ttcctgaagg cgcggctcgt ctgctctgtc ccgggcgagg atggcattga 1200gactcacttt gatgagctcc aggacgtgtt tgtccagcag acccaggacg tgaggaaccc 1260tgtcatttac gctgtcttta cctcctctgg ctccgtgttc cgaggctctg ccgtgtgtgt 1320ctactccatg gctgatattc gcatggtctt caacgggccc tttgcccaca aagaggggcc 1380caactaccag tggatgccct tctcagggaa gatgccctac ccacggccgg gcacgtgccc 1440tggtggaacc ttcacgccat ctatgaagtc caccaaggat tatcctgatg aggtgatcaa 1500cttcatgcgc agccacccac tcatgtacca ggccgtgtac cctctgcagc ggcggcccct 1560ggtagtccgc acaggtgctc cctaccgcct taccactatt gccgtggacc aggtggatgc 1620agccgacggg cgctatgagg tgcttttcct gggcacagac cgcgggacag tgcagaaggt 1680cattgtgctg cccaaggatg accaggagtt ggaggagctc atgctggagg aggtggaggt 1740cttcaaggat ccagcacccg tcaagaccat gaccatctct tctaagaggc aacaactcta 1800cgtggcgtca gccgtgggtg tcacacacct gagcctgcac cgctgccagg cgtatggggc 1860tgcctgtgct gactgctgcc ttgcccggga cccttactgt gcctgggatg gccaggcctg 1920ctcccgctat acagcatcct ccaagaggcg gagccgccgg caggacgtcc ggcacggaaa 1980ccccatcagg cagtgccgtg ggttcaactc caatgccaac aagaatgccg tggagtctgt 2040gcagtatggc gtggccggca gcgcagcctt ccttgagtgc cagccccgct cgccccaagc 2100cactgttaag tggctgttcc agcgagatcc tggtgaccgg cgccgagaga ttcgtgcaga 2160ggaccgcttc ctgcgcacag agcagggctt gttgctccgt gcactgcagc tcagcgatcg 2220tggcctctac tcctgcacag ccactgagaa caactttaag cacgtcgtca cacgagtgca 2280gctgcatgta ctgggccggg acgccgtcca tgctgccctc ttcccaccac tgtccatgag 2340cgccccgcca cccccaggcg caggcccccc aacgcctcct taccaggagt tagcccagct 2400gctggcccag ccagaagtgg gcctcatcca ccagtactgc cagggttact ggcgccatgt 2460gccccccagc cccagggagg ctccaggggc accccggtct cctgagcccc aggaccagaa 2520aaagccccgg aaccgccggc accaccctcc ggacacatga ggccagctgc ctgtgcctgc 2580catgggccag cctagccctt gtccctttta atataaaaga tatatatata tatatatata 2640tataaaatat ctatattcta tacacaccct gcccctgcaa agacagtatt tattggtggg 2700ttgaatatag cctgcctcag tggcagcatc ctccaaaact tagacccatg ctggtcagag 2760acggcagaaa acagagcctg cctaaccagg cccagccagt tggtggggcc aggccaggac 2820cacacagtcc ccagactcag ctggaagtct acctgctgga cagcctccgc caagatctac 2880aggacaaagg gagggagcaa gccctactcg gatggggcac ggactgtcca ccttttctga 2940tgtgtgttgt cagcctgtgc tgtggcatag acatggatgc gaggaccact ttggagactg 3000gggtggcctc aagagcacac agagaaggga agaaggggcc atcacaggat gccagcccct 3060gcctgggttg ggggcactca gccacgacca gccccttcct gggtatttat tctctattta 3120ttggggatag gagaagaggc atcctgcctg ggtgggacag cctcttcagc cccttctccc 3180ctccccgcct ggccagggca gggccacccc actctacctc cttagctttc cctgtgccac 3240tttgactcag aggctgggag catagcagag gggccaggcc caggcagagc tgacgggagg 3300ccccagctct gaggggaggg ggtccgtggt agaggcctgg ggccggtaga ggctccccag 3360ggctccctta tgtccaccac ttcaggggat gggtgtggat gtaattagct ctggggggca 3420gttgggtaga tgggtggggg ctcctggtgg ccttctgctg cccaggccac agccgccttt 3480gggttccatc ttgctaataa acactggctc tgggactaga aaaaaaaaaa aaaaa 3535883558DNAHomo sapiens 88ctgactggtg ctccctctct tccatcttgg gctgtctgca tgtgtctcat tcccccactc 60tctcctgtgc ctcccctcta ccgtaataat caggtccagg tttctctgta ctgggagaag 120acctgtggct ggagcaggca gggatgcacc ctatctgttc cccattcctc caggtgggag 180ggagaaggag taacccactt tattggccac agatgcaggg gagaaaggag aaagcatgct 240gggagctgga aagagcccta agatcacctg gtttctagag agtggagcct gcttcctggg 300ccctaggccc ctcccacaat gcttgtcgcc ggtcttcttc tctgggcttc cctactgacc 360ggggcctggc catccttccc cacccaggac cacctcccgg ccacgccccg ggtccggctc 420tcattcaaag agctgaaggc cacaggcacc gcccacttct tcaacttcct gctcaacaca 480accgactacc gaatcttgct caaggacgag gaccacgacc gcatgtacgt gggcagcaag 540gactacgtgc tgtccctgga cctgcacgac atcaaccgcg agcccctcat tatacactgg 600gcagcctccc cacagcgcat cgaggaatgc gtgctctcag gcaaggatgt caacggcgag 660tgtgggaact tcgtcaggct catccagccc tggaaccgaa cacacctgta tgtgtgcggg 720acaggtgcct acaaccccat gtgcacctat gtgaaccgcg gacgccgcgc ccaggattac 780atcttctacc tggagcctga gcgactcgag tcagggaagg gcaagtgtcc gtacgatccc 840aagctggaca cagcatcggc cctcatcaat gaggagctct atgctggtgt gtacatcgat 900tttatgggca ctgatgcagc catcttccgc acacttggaa agcagacagc catgcgcacg 960gatcagtaca actcccggtg gctgaacgac ccgtcgttca tccatgctga gctcattcct 1020gacagtgcgg agcgcaatga tgataagctt tacttcttct tccgtgagcg gtcggcagag 1080gcgccgcaga gccccgcggt gtacgcccgc atcgggcgca tttgcctgaa cgatgacggt 1140ggtcactgtt gcctggtcaa caagtggagc acattcctga aggcgcggct cgtctgctct 1200gtcccgggcg aggatggcat tgagactcac tttgatgagc tccaggacgt gtttgtccag 1260cagacccagg acgtgaggaa ccctgtcatt tacgctgtct ttacctcctc tggctccgtg 1320ttccgaggct ctgccgtgtg tgtctactcc atggctgata ttcgcatggt cttcaacggg 1380ccctttgccc acaaagaggg gcccaactac cagtggatgc ccttctcagg gaagatgccc 1440tacccacggc cgggcacgtg ccctggtgga accttcacgc catctatgaa gtccaccaag 1500gattatcctg atgaggtgat caacttcatg cgcagccacc cactcatgta ccaggccgtg 1560taccctctgc agcggcggcc cctggtagtc cgcacaggtg ctccctaccg ccttaccact 1620attgccgtgg accaggtgga tgcagccgac gggcgctatg aggtgctttt cctgggcaca 1680gaccgcggga cagtgcagaa ggtcattgtg ctgcccaagg atgaccagga gttggaggag 1740ctcatgctgg aggaggtgga ggtcttcaag gatccagcac ccgtcaagac catgaccatc 1800tcttctaaga ggcaacaact ctacgtggcg tcagccgtgg gtgtcacaca cctgagcctg 1860caccgctgcc aggcgtatgg ggctgcctgt gctgactgct gccttgcccg ggacccttac 1920tgtgcctggg atggccaggc ctgctcccgc tatacagcat cctccaagag gcggagccgc 1980cggcaggacg tccggcacgg aaaccccatc aggcagtgcc gtgggttcaa ctccaatgcc 2040aacaagaatg ccgtggagtc tgtgcagtat ggcgtggccg gcagcgcagc cttccttgag 2100tgccagcccc gctcgcccca agccactgtt aagtggctgt tccagcgaga tcctggtgac 2160cggcgccgag agattcgtgc agaggaccgc ttcctgcgca cagagcaggg cttgttgctc 2220cgtgcactgc agctcagcga tcgtggcctc tactcctgca cagccactga gaacaacttt 2280aagcacgtcg tcacacgagt gcagctgcat gtactgggcc gggacgccgt ccatgctgcc 2340ctcttcccac cactgtccat gagcgccccg ccacccccag gcgcaggccc cccaacgcct 2400ccttaccagg agttagccca gctgctggcc cagccagaag tgggcctcat ccaccagtac 2460tgccagggtt actggcgcca tgtgcccccc agccccaggg aggctccagg ggcaccccgg 2520tctcctgagc cccaggacca gaaaaagccc cggaaccgcc ggcaccaccc tccggacaca 2580tgaggccagc tgcctgtgcc tgccatgggc cagcctagcc cttgtccctt ttaatataaa 2640agatatatat atatatatat atatataaaa tatctatatt ctatacacac cctgcccctg 2700caaagacagt atttattggt gggttgaata tagcctgcct cagtggcagc atcctccaaa 2760acttagaccc atgctggtca gagacggcag aaaacagagc ctgcctaacc aggcccagcc 2820agttggtggg gccaggccag gaccacacag tccccagact cagctggaag tctacctgct

2880ggacagcctc cgccaagatc tacaggacaa agggagggag caagccctac tcggatgggg 2940cacggactgt ccaccttttc tgatgtgtgt tgtcagcctg tgctgtggca tagacatgga 3000tgcgaggacc actttggaga ctggggtggc ctcaagagca cacagagaag ggaagaaggg 3060gccatcacag gatgccagcc cctgcctggg ttgggggcac tcagccacga ccagcccctt 3120cctgggtatt tattctctat ttattgggga taggagaaga ggcatcctgc ctgggtggga 3180cagcctcttc agccccttct cccctccccg cctggccagg gcagggccac cccactctac 3240ctccttagct ttccctgtgc cactttgact cagaggctgg gagcatagca gaggggccag 3300gcccaggcag agctgacggg aggccccagc tctgagggga gggggtccgt ggtagaggcc 3360tggggccggt agaggctccc cagggctccc ttatgtccac cacttcaggg gatgggtgtg 3420gatgtaatta gctctggggg gcagttgggt agatgggtgg gggctcctgg tggccttctg 3480ctgcccaggc cacagccgcc tttgggttcc atcttgctaa taaacactgg ctctgggact 3540agaaaaaaaa aaaaaaaa 3558893274DNAHomo sapiens 89cccgcgcggc tctgagcgcc ccgtcccgcc ggcggccgcg agaccagagc gagcgaacga 60accgcggcgg tccggagagc cccgagcgca gcgcaggacc tgggaccacc tcccggccac 120gccccgggtc cggctctcat tcaaagagct gaaggccaca ggcaccgccc acttcttcaa 180cttcctgctc aacacaaccg actaccgaat cttgctcaag gacgaggacc acgaccgcat 240gtacgtgggc agcaaggact acgtgctgtc cctggacctg cacgacatca accgcgagcc 300cctcattata cactgggcag cctccccaca gcgcatcgag gaatgcgtgc tctcaggcaa 360ggatgtcaac ggcgagtgtg ggaacttcgt caggctcatc cagccctgga accgaacaca 420cctgtatgtg tgcgggacag gtgcctacaa ccccatgtgc acctatgtga accgcggacg 480ccgcgcccag gattacatct tctacctgga gcctgagcga ctcgagtcag ggaagggcaa 540gtgtccgtac gatcccaagc tggacacagc atcggccctc atcaatgagg agctctatgc 600tggtgtgtac atcgatttta tgggcactga tgcagccatc ttccgcacac ttggaaagca 660gacagccatg cgcacggatc agtacaactc ccggtggctg aacgacccgt cgttcatcca 720tgctgagctc attcctgaca gtgcggagcg caatgatgat aagctttact tcttcttccg 780tgagcggtcg gcagaggcgc cgcagagccc cgcggtgtac gcccgcatcg ggcgcatttg 840cctgaacgat gacggtggtc actgttgcct ggtcaacaag tggagcacat tcctgaaggc 900gcggctcgtc tgctctgtcc cgggcgagga tggcattgag actcactttg atgagctcca 960ggacgtgttt gtccagcaga cccaggacgt gaggaaccct gtcatttacg ctgtctttac 1020ctcctctggc tccgtgttcc gaggctctgc cgtgtgtgtc tactccatgg ctgatattcg 1080catggtcttc aacgggccct ttgcccacaa agaggggccc aactaccagt ggatgccctt 1140ctcagggaag atgccctacc cacggccggg cacgtgccct ggtggaacct tcacgccatc 1200tatgaagtcc accaaggatt atcctgatga ggtgatcaac ttcatgcgca gccacccact 1260catgtaccag gccgtgtacc ctctgcagcg gcggcccctg gtagtccgca caggtgctcc 1320ctaccgcctt accactattg ccgtggacca ggtggatgca gccgacgggc gctatgaggt 1380gcttttcctg ggcacagacc gcgggacagt gcagaaggtc attgtgctgc ccaaggatga 1440ccaggagttg gaggagctca tgctggagga ggtggaggtc ttcaaggatc cagcacccgt 1500caagaccatg accatctctt ctaagaggca acaactctac gtggcgtcag ccgtgggtgt 1560cacacacctg agcctgcacc gctgccaggc gtatggggct gcctgtgctg actgctgcct 1620tgcccgggac ccttactgtg cctgggatgg ccaggcctgc tcccgctata cagcatcctc 1680caagaggcgg agccgccggc aggacgtccg gcacggaaac cccatcaggc agtgccgtgg 1740gttcaactcc aatgccaaca agaatgccgt ggagtctgtg cagtatggcg tggccggcag 1800cgcagccttc cttgagtgcc agccccgctc gccccaagcc actgttaagt ggctgttcca 1860gcgagatcct ggtgaccggc gccgagagat tcgtgcagag gaccgcttcc tgcgcacaga 1920gcagggcttg ttgctccgtg cactgcagct cagcgatcgt ggcctctact cctgcacagc 1980cactgagaac aactttaagc acgtcgtcac acgagtgcag ctgcatgtac tgggccggga 2040cgccgtccat gctgccctct tcccaccact gtccatgagc gccccgccac ccccaggcgc 2100aggcccccca acgcctcctt accaggagtt agcccagctg ctggcccagc cagaagtggg 2160cctcatccac cagtactgcc agggttactg gcgccatgtg ccccccagcc ccagggaggc 2220tccaggggca ccccggtctc ctgagcccca ggaccagaaa aagccccgga accgccggca 2280ccaccctccg gacacatgag gccagctgcc tgtgcctgcc atgggccagc ctagcccttg 2340tcccttttaa tataaaagat atatatatat atatatatat ataaaatatc tatattctat 2400acacaccctg cccctgcaaa gacagtattt attggtgggt tgaatatagc ctgcctcagt 2460ggcagcatcc tccaaaactt agacccatgc tggtcagaga cggcagaaaa cagagcctgc 2520ctaaccaggc ccagccagtt ggtggggcca ggccaggacc acacagtccc cagactcagc 2580tggaagtcta cctgctggac agcctccgcc aagatctaca ggacaaaggg agggagcaag 2640ccctactcgg atggggcacg gactgtccac cttttctgat gtgtgttgtc agcctgtgct 2700gtggcataga catggatgcg aggaccactt tggagactgg ggtggcctca agagcacaca 2760gagaagggaa gaaggggcca tcacaggatg ccagcccctg cctgggttgg gggcactcag 2820ccacgaccag ccccttcctg ggtatttatt ctctatttat tggggatagg agaagaggca 2880tcctgcctgg gtgggacagc ctcttcagcc ccttctcccc tccccgcctg gccagggcag 2940ggccacccca ctctacctcc ttagctttcc ctgtgccact ttgactcaga ggctgggagc 3000atagcagagg ggccaggccc aggcagagct gacgggaggc cccagctctg aggggagggg 3060gtccgtggta gaggcctggg gccggtagag gctccccagg gctcccttat gtccaccact 3120tcaggggatg ggtgtggatg taattagctc tggggggcag ttgggtagat gggtgggggc 3180tcctggtggc cttctgctgc ccaggccaca gccgcctttg ggttccatct tgctaataaa 3240cactggctct gggactagaa aaaaaaaaaa aaaa 3274902658DNAHomo sapiens 90aataaatatc cgtgtagaaa atcagaacga ctctttcagg ccatctttaa aatgtcattg 60gtaaaccata cttgatccta aattcctgta cttcctcagg ccatccgagc atgaaacgct 120gtcacctacc cacatccgct ggctgtgacg cttgtcaaag tgttctctat cggctgcatg 180cctagaccac caaagcgttc tgaccggaca gtgtcactgg agaaggcggc gcgacatgtc 240cagggcgcag atctgggctc tggtgtctgg tgtcggaggg tttggagctc tcgttgctgc 300taccacgtcc aatgagtgga aagtgaccac gcgagcctcc tcggtgataa cagccacttg 360ggtttaccag ggtctgtgga tgaactgcgc aggtaacgcg ttgggttctt tccattgccg 420accgcatttt actatcttca aagtagcagg ttatatacag gcatgtagag gacttatgat 480cgctgctgtc agcctgggct tctttggttc catatttgcg ctctttggaa tgaagtgtac 540caaagtcgga ggctccgata aagccaaagc taaaattgct tgtttggctg ggattgtatt 600catactgtca gggctgtgct caatgactgg atgttcccta tatgcaaaca aaatcacaac 660ggaattcttt gatcctctct ttgttgagca aaagtatgaa ttaggagccg ctctgtttat 720tggatgggca ggagcctcac tgtgcataat tggtggtgtc atattttgct tttcaatatc 780tgacaacaac aaaacaccca gatacacata caacggggcc acatctgtca tgtcttctcg 840gacaaagtat catggtggag aagattttaa aacaacaaac ccttcaaaac agtttgataa 900aaatgcttat gtctaaaaga gctcgctggc aagctgcctc ttgagtttgt tataaaagcg 960aactgttcac aaaatgatcc catcaaggcc ctcccataat taacactcaa aactattttt 1020aaaatatgca tttgaagcat ctgttgattg tatggatgta agtgttctta catagttagt 1080tatatactaa tcattttctg ttgtggcttt ctataaaaaa taaacagttt atttacagga 1140tttgtaaaat gttttctaca tttatataga acatgaaaag catttagtac caaaggttca 1200agaagtattc gtactctagc ctttttaatc attcatagat agaagtcttt gtacccactc 1260cttatgtttc ttttcattca taaacaggtg tataaggaac aatgtcttat aaacagcatg 1320ggggcaatct gagaatattc ctcaaaaggt gtccaggtta aatagacatg ttactggctg 1380cacacaggca aattctagtt tgtttttttt aagtattcta caacatttat ttaaaaaggt 1440aaatcttttt gttgaagcag caagttatct ggtagaactt aacttctaca ggatcagaga 1500ggatcttgct cattcatggc catatccaca tgcccatggc cactcagtag attgttgaaa 1560aagcaaagcc acaccattct ctttgatgta tgcagagagt tacgtagcag gggatgttct 1620ctgatttatt ccactggcac cattagtgaa tatttagttg ttttcataaa cgatgctgtg 1680atgaagactc atgtacatat ttagcaaatt ttggtttctt acatgtgcct gtcatgactg 1740taattcatta tgactgctcc aggaagggct aatggggcca atatattatt gcctgtcatg 1800tggcacatcc atgttaaggg gctgaggcgt ccctggcacg gaatgcagag ccctgagcta 1860gggcatcagc agaagctgag atagagatat tggtcatggt tgactgagga gccaattaaa 1920acctgtttat gcctagtgtt ccattattgg aacactaagc atgtgggagt tatttatatc 1980ctactgctca aggtcatcgc caaggtgtga ttggaaaaat tcaaaaaatt gcaacctcag 2040gcataaatgg gttaaggaca tcccaagccc aagtggtacg tgcctcactc agaactgacg 2100ggccgagttc tatctaggtg tgtcttccag aacctgttta cggctaactg gataactgag 2160agacttgtca tttctaaaga catttaagtt gctccaggga tttctgaaaa aagacacagg 2220cttcttccta gagccagccc tatataacat gcccacaagg gcaacagtta tcacagttca 2280tacacacctt tcatgtcctg tctcactcac tcctcacagc catcctagga gatacatatt 2340gttttcatcc tgcatttaca gaaaaagaaa tgaaaacaga gagcttaaat aatttgccac 2400agtaatgtcg aaactaggcc tttgaaccaa ggcagtctag ggtaaaatat agtttcaaag 2460tatgaataag aattggtatt tgtgttatct ttgagtaaga aactgtccga tatgaatcac 2520aacgtgggtg aatgtagtat tttcctgaag tgtgaaagac ttaaaaaaaa gaatcacatt 2580gttcagaggt gctcaatgga aagaaaagga aatgaacaag tttgttaaaa gataaaaaat 2640aaaaaaaatt ccatacct 2658912490DNAHomo sapiens 91gagtgcgggg gtcgcggcgc agagtgggag ccggagagcg agcgcggctg cagccggcgg 60catggctagc acggcttcgg agatcatcgc cttcatggtc tccatctcag gctgggtact 120ggtgtcctcc acgctgccca ccgactactg gaaggtgtct accatcgacg gcacggtcat 180cacaaccgcc acctattggg ccaacctgtg gaaggcgtgc gttaccgact ccacgggcgt 240ctccaactgc aaggacttcc cctccatgct ggcgctggac ggttatatac aggcatgtag 300aggacttatg atcgctgctg tcagcctggg cttctttggt tccatatttg cgctctttgg 360aatgaagtgt accaaagtcg gaggctccga taaagccaaa gctaaaattg cttgtttggc 420tgggattgta ttcatactgt cagggctgtg ctcaatgact ggatgttccc tatatgcaaa 480caaaatcaca acggaattct ttgatcctct ctttgttgag caaaagtatg aattaggagc 540cgctctgttt attggatggg caggagcctc actgtgcata attggtggtg tcatattttg 600cttttcaata tctgacaaca acaaaacacc cagatacaca tacaacgggg ccacatctgt 660catgtcttct cggacaaagt atcatggtgg agaagatttt aaaacaacaa acccttcaaa 720acagtttgat aaaaatgctt atgtctaaaa gagctcgctg gcaagctgcc tcttgagttt 780gttataaaag cgaactgttc acaaaatgat cccatcaagg ccctcccata attaacactc 840aaaactattt ttaaaatatg catttgaagc atctgttgat tgtatggatg taagtgttct 900tacatagtta gttatatact aatcattttc tgttgtggct ttctataaaa aataaacagt 960ttatttacag gatttgtaaa atgttttcta catttatata gaacatgaaa agcatttagt 1020accaaaggtt caagaagtat tcgtactcta gcctttttaa tcattcatag atagaagtct 1080ttgtacccac tccttatgtt tcttttcatt cataaacagg tgtataagga acaatgtctt 1140ataaacagca tgggggcaat ctgagaatat tcctcaaaag gtgtccaggt taaatagaca 1200tgttactggc tgcacacagg caaattctag tttgtttttt ttaagtattc tacaacattt 1260atttaaaaag gtaaatcttt ttgttgaagc agcaagttat ctggtagaac ttaacttcta 1320caggatcaga gaggatcttg ctcattcatg gccatatcca catgcccatg gccactcagt 1380agattgttga aaaagcaaag ccacaccatt ctctttgatg tatgcagaga gttacgtagc 1440aggggatgtt ctctgattta ttccactggc accattagtg aatatttagt tgttttcata 1500aacgatgctg tgatgaagac tcatgtacat atttagcaaa ttttggtttc ttacatgtgc 1560ctgtcatgac tgtaattcat tatgactgct ccaggaaggg ctaatggggc caatatatta 1620ttgcctgtca tgtggcacat ccatgttaag gggctgaggc gtccctggca cggaatgcag 1680agccctgagc tagggcatca gcagaagctg agatagagat attggtcatg gttgactgag 1740gagccaatta aaacctgttt atgcctagtg ttccattatt ggaacactaa gcatgtggga 1800gttatttata tcctactgct caaggtcatc gccaaggtgt gattggaaaa attcaaaaaa 1860ttgcaacctc aggcataaat gggttaagga catcccaagc ccaagtggta cgtgcctcac 1920tcagaactga cgggccgagt tctatctagg tgtgtcttcc agaacctgtt tacggctaac 1980tggataactg agagacttgt catttctaaa gacatttaag ttgctccagg gatttctgaa 2040aaaagacaca ggcttcttcc tagagccagc cctatataac atgcccacaa gggcaacagt 2100tatcacagtt catacacacc tttcatgtcc tgtctcactc actcctcaca gccatcctag 2160gagatacata ttgttttcat cctgcattta cagaaaaaga aatgaaaaca gagagcttaa 2220ataatttgcc acagtaatgt cgaaactagg cctttgaacc aaggcagtct agggtaaaat 2280atagtttcaa agtatgaata agaattggta tttgtgttat ctttgagtaa gaaactgtcc 2340gatatgaatc acaacgtggg tgaatgtagt attttcctga agtgtgaaag acttaaaaaa 2400aagaatcaca ttgttcagag gtgctcaatg gaaagaaaag gaaatgaaca agtttgttaa 2460aagataaaaa ataaaaaaaa ttccatacct 2490922424DNAHomo sapiens 92gttccccgcg tgccaccagg aagctcgggc cggccaagag cgtagactct tgagaggagt 60gagacaggtg cgcgccagcc ggccttcggg gctttatggg aactgggccg tgcggcggtc 120ccgccctcgt gcgcaggcgc agaaccgttg tgaccagagc ggttgcgggc tgagcggttt 180cgagccggcg tcggggagcg gcggtaccgg gcggctgcgg ggctggctcg acccagcttg 240aggtctcggc gtccgcgtcc tgcggtgccc tggggtctcc cgaggacctt gtacccgcgc 300ggcttccttg ggctggcttt ggacgacgct ttcgccttcc tgctgcctag gatccgccga 360catgaatccc atcgtagtgg tccacggcgg cggagccggt cccatctcca aggatcggaa 420ggagcgagtg caccagggca tggtcagagc cgccaccgtg ggctacggca tcctccggga 480gggcgggagc gccgtggatg ccgtagaggg agctgtcgtc gccctggaag acgatcccga 540gttcaacgca ggttgtgggt ctgtcttgaa cacaaatggt gaggttgaaa tggatgctag 600tatcatggat ggaaaagacc tgtctgcagg agcagtgtcc gcagtccagt gtatagcaaa 660tcccattaaa cttgctcggc ttgtcatgga aaagacacct cattgctttc tgactgacca 720aggcgcagcg cagtttgcag cagctatggg ggttccagag attcctggag aaaaactggt 780gacagagaga aacaaaaagc gcctggaaaa agagaagcat gaaaaaggtg ctcagaaaac 840agattgtcaa aaaaacttgg gaaccgtggg tgctgttgcc ttggactgca aagggaatgt 900agcctacgca acctccacag gcggtatcgt taataaaatg gtcggccgcg ttggggactc 960accgtgtcta ggagctggag gttatgccga caatgacatc ggagccgtct caaccacagg 1020gcatggggaa agcatcctga aggtgaacct ggctagactc accctgttcc acatagaaca 1080aggaaagacg gtagaagagg ctgcggacct atcgttgggt tatatgaagt caagggttaa 1140aggtttaggt ggcctcatcg tggttagcaa aacaggagac tgggtggcaa agtggacctc 1200cacctccatg ccctgggcag ccgccaagga cggcaagctg cacttcggaa ttgatcctga 1260cgatactact atcaccgacc ttccctaagc cgctggaaga ttgtattcca gatgctagct 1320tagaggtcaa gtacagtctc ctcatgagac atagcctaat caattagatc tagaattgga 1380aaaattgtcc cgtctgtcac ttgttttgtt gccttaataa gcatctgaat gtttggttgt 1440ggggcgggtt ctgaagcgat gagagaaatg cccgtattag gaggattact tgagcccagg 1500aggtcaaagc tgaggtgagc catgattact ccactgcact ccagcctggg caacagagcc 1560aggccctgta tcaaaaaaaa aaaaaaaaag aaaagggaaa aaagaaagaa agcagcagca 1620tgatcctgac atgacagatg tgggagaccc acagcctgca gacactgtgg gctggaaggt 1680gggaagggag gggccggtgg aggtggagct gtttgaaagt gacacagcag cagtagaagc 1740agtggtgggc gaagcccagg tgaccctcag aacgttgcac aagaacatca gggaaaagaa 1800ccagaatcct ttaaggaaaa tgttcttcat gtatgagaga ctaaagtgat ttttctaaga 1860aagttcagcc cttctctgac ttacctggac atttctagat acttccaaag gaccctctgg 1920gaatccatag cttcctaatc tggagatggg aggtcataag ggagacgctg tggggttcct 1980tgaagtttct tgggttcaca gaggagcccc ctcacttggt gttctcccgt gagccagcct 2040ccacctgcca aagacactct ggtcctcgta tagtgagtaa tggggctcag ggcctctcca 2100acaacagaga ggagctgatg ctgtagggct gaccccgtga cttcctgagt cctcaccctg 2160tccagtgctt tgagattctt cccacctccc catcctcacc agccggatcg ggcgctgtgc 2220agtgtggtca gcatggtgaa gaaagtcatt tcctcggtgg gcagtattcc tctttatctc 2280tcattacact ggaaatgtta tttctgctgt atcatccgtg ctcaacgttt tagtctgtca 2340ggctcacctt ctctctggaa agaatttgct taacttgaca ttccatgtgc cgctaataaa 2400atatattttg aaagaataaa aaaa 2424932347DNAHomo sapiens 93gttccccgcg tgccaccagg aagctcgggc cggccaagag cgtagactct tgagaggagt 60gagacaggtg cgcgccagcc ggccttcggg gctttatggg aactgggccg tgcggcggtc 120ccgccctcgt gcgcaggcgc agaaccgttg tgaccagagc ggttgcgggc tgagcggttt 180cgagccggcg tcggggagcg gcggtaccgg gcggctgcgg ggctggctcg acccagcttg 240aggtctcggc gtccgcgtcc tgcggtgccc tgggatccgc cgacatgaat cccatcgtag 300tggtccacgg cggcggagcc ggtcccatct ccaaggatcg gaaggagcga gtgcaccagg 360gcatggtcag agccgccacc gtgggctacg gcatcctccg ggagggcggg agcgccgtgg 420atgccgtaga gggagctgtc gtcgccctgg aagacgatcc cgagttcaac gcaggttgtg 480ggtctgtctt gaacacaaat ggtgaggttg aaatggatgc tagtatcatg gatggaaaag 540acctgtctgc aggagcagtg tccgcagtcc agtgtatagc aaatcccatt aaacttgctc 600ggcttgtcat ggaaaagaca cctcattgct ttctgactga ccaaggcgca gcgcagtttg 660cagcagctat gggggttcca gagattcctg gagaaaaact ggtgacagag agaaacaaaa 720agcgcctgga aaaagagaag catgaaaaag gtgctcagaa aacagattgt caaaaaaact 780tgggaaccgt gggtgctgtt gccttggact gcaaagggaa tgtagcctac gcaacctcca 840caggcggtat cgttaataaa atggtcggcc gcgttgggga ctcaccgtgt ctaggagctg 900gaggttatgc cgacaatgac atcggagccg tctcaaccac agggcatggg gaaagcatcc 960tgaaggtgaa cctggctaga ctcaccctgt tccacataga acaaggaaag acggtagaag 1020aggctgcgga cctatcgttg ggttatatga agtcaagggt taaaggttta ggtggcctca 1080tcgtggttag caaaacagga gactgggtgg caaagtggac ctccacctcc atgccctggg 1140cagccgccaa ggacggcaag ctgcacttcg gaattgatcc tgacgatact actatcaccg 1200accttcccta agccgctgga agattgtatt ccagatgcta gcttagaggt caagtacagt 1260ctcctcatga gacatagcct aatcaattag atctagaatt ggaaaaattg tcccgtctgt 1320cacttgtttt gttgccttaa taagcatctg aatgtttggt tgtggggcgg gttctgaagc 1380gatgagagaa atgcccgtat taggaggatt acttgagccc aggaggtcaa agctgaggtg 1440agccatgatt actccactgc actccagcct gggcaacaga gccaggccct gtatcaaaaa 1500aaaaaaaaaa aagaaaaggg aaaaaagaaa gaaagcagca gcatgatcct gacatgacag 1560atgtgggaga cccacagcct gcagacactg tgggctggaa ggtgggaagg gaggggccgg 1620tggaggtgga gctgtttgaa agtgacacag cagcagtaga agcagtggtg ggcgaagccc 1680aggtgaccct cagaacgttg cacaagaaca tcagggaaaa gaaccagaat cctttaagga 1740aaatgttctt catgtatgag agactaaagt gatttttcta agaaagttca gcccttctct 1800gacttacctg gacatttcta gatacttcca aaggaccctc tgggaatcca tagcttccta 1860atctggagat gggaggtcat aagggagacg ctgtggggtt ccttgaagtt tcttgggttc 1920acagaggagc cccctcactt ggtgttctcc cgtgagccag cctccacctg ccaaagacac 1980tctggtcctc gtatagtgag taatggggct cagggcctct ccaacaacag agaggagctg 2040atgctgtagg gctgaccccg tgacttcctg agtcctcacc ctgtccagtg ctttgagatt 2100cttcccacct ccccatcctc accagccgga tcgggcgctg tgcagtgtgg tcagcatggt 2160gaagaaagtc atttcctcgg tgggcagtat tcctctttat ctctcattac actggaaatg 2220ttatttctgc tgtatcatcc gtgctcaacg ttttagtctg tcaggctcac cttctctctg 2280gaaagaattt gcttaacttg acattccatg tgccgctaat aaaatatatt ttgaaagaat 2340aaaaaaa 2347942205DNAHomo sapiens 94tcccctggac ccgcccccat ctgcccaaga taattttagt ttccttgggc ctggaatctg 60gacacacagg gctccccccc gcctctgact tctctgtccg aagtcgggac accctcctac 120cacctgtaga gaagcgggag tggatctgaa ataaaatcca ggaatctggg ggttcctaga 180cggagccaga cttcggaacg ggtgtcctgc tactcctgct ggggctcctc caggacaagg 240gcacacaact ggttccgtta agcccctctc ttgctcagac gccatggagc tggatctgtc 300tccacctcat cttagcagct ctccggaaga cctttgccca gcccctggga cccctcctgg 360gactccccgg ccccctgata cccctctgcc tgaggaggta aagaggtccc agcctctcct 420catcccaacc accggcagga aacttcgaga ggaggagagg cgtgccacct ccctcccctc 480tatccccaac cccttccctg agctctgcag tcctccctca cagagcccaa ttctcggggg 540cccctccagt gcaagggggc tgctcccccg cgatgccagc cgcccccatg tagtaaaggt 600gtacagtgag gatggggcct gcaggtctgt ggaggtggca gcaggtgcca cagctcgcca 660cgtgtgtgaa atgctggtgc agcgagctca cgccttgagc gacgagacct gggggctggt 720ggagtgccac ccccacctag cactggagcg gggtttggag gaccacgagt ccgtggtgga 780agtgcaggct gcctggcccg tgggcggaga tagccgcttc gtcttccgga aaaacttcgc

840caagtacgaa ctgttcaaga gctccccaca ctccctgttc ccagaaaaaa tggtctccag 900ctgtctcgat gcacacactg gtatatccca tgaagacctc atccagaact tcctgaatgc 960tggcagcttt cctgagatcc agggctttct gcagctgcgg ggttcaggac ggaagctttg 1020gaaacgcttt ttctgcttct tgcgccgatc tggcctctat tactccacca agggcacctc 1080taaggatccg aggcacctgc agtacgtggc agatgtgaac gagtccaacg tgtacgtggt 1140gacgcagggc cgcaagctct acgggatgcc cactgacttc ggtttctgtg tcaagcccaa 1200caagcttcga aatggccaca aggggcttcg gatcttctgc agtgaagatg agcagagccg 1260cacctgctgg ctggctgcct tccgcctctt caagtacggg gtgcagctgt acaagaatta 1320ccagcaggca cagtctcgcc atctgcatcc atcttgtttg ggctccccac ccttgagaag 1380tgcctcagat aataccctgg tggccatgga cttctctggc catgctgggc gtgtcattga 1440gaacccccgg gaggctctga gtgtggccct ggaggaggcc caggcctgga ggaagaagac 1500aaaccaccgc ctcagcctgc ccatgccagc ctccggcacg agcctcagtg cagcctgttc 1560ctggtccggg agagtcagcg gaacccccag ggctttgtcc tctctttgtg ccacctgcag 1620aaagtgaagc attatctcat cctgccgagc gaggaggagg gccgcctgta cttcagcatg 1680gatgatggcc agacccgctt cactgacctg ctgcagctcg tggagttcca ccagctgaac 1740cgcggcatcc tgccgtgctt gctgcgccat tgctgcacgc gggtggccct ctgaccaggc 1800cgtggactgg ctcatgcctc agcccgcctt caggctgccc gccgcccctc cacccatcca 1860gtggactctg gggcgcggcc acaggggacg ggatgaggag cgggagggtt ccgccactcc 1920agttttctcc tctgcttctt tgcctccctc agatagaaaa cagcccccac tccagtccac 1980tcctgacccc tctcctcaag ggaaggcctt gggtggcccc ctctccttct cctagctctg 2040gaggtgctgc tctagggcag ggaattatgg gagaagtggg ggcagcccag gcggtttcac 2100gccccacact ttgtacagac cgagaggcca gttgatctgc tctgttttat actagtgaca 2160ataaagatta ttttttgata caaaaaaaaa aaaaaaaaaa aaaaa 2205951210DNAHomo sapiens 95ctctcccttc tccactctct ccccctgtct cctttcttct tcttctttca ccctccgtct 60ctcacacccc ctccattccc ctgtctcctt tctgacactg cactgcagct gctcctcagc 120cctgccccct ccccagtgag aacaaaccag caacattgct ttttttccta aagagattta 180tattgatccg attaaaaaaa aaaaacctta agaaacccca aacgcaaaaa aaaaaaaaaa 240aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct cccggtagca 300gcggcaggaa ctgcaaacat gatggcggca gctcccatcc agcagaacgg gacccacact 360ggggttccca tagacctgga cccgccggac tcgcggaaaa ggccgctgga agccccccct 420gaagccggca gcaccaagag gaccaatacg ggcgaagacg gccagtattt tctaaaggtt 480ctcataccta gttatgctgc tggatctata attgggaagg gaggacagac aattgttcag 540ttgcaaaaag aaactggagc caccatcaag ctgtctaagt ccaaagattt ttacccaggt 600actactgagc gagtgtgctt gatccaggga acggttgaag cactgaatgc agttcatgga 660ttcattgcag aaaaaattcg agaaatgccc caaaatgtgg ccaagacaga accagtcagc 720attctacaac cccagaccac cgttaatcca gatcgcatca aacaaacatt gccatcttcc 780ccaactacca ccaagtcctc tccatctgat cccatgacca cctccagagc taatcagaag 840cataatatct cctggatatc atgaagcaag atataagaga agaacaaaac aaaatccgta 900attcattgaa agaattgtaa tcatcaatct ttcatattat taatactttg taattatttt 960ctccccaaca gtattttcca gtagattcta atcatgtggt agggcagaag gaaatgtgtt 1020ttttgttgtt catttgtttc ttgtcaatag tcctgattaa tttagctttg ctatactgac 1080ttatatctgg aagtatataa ccaagataag aaaataggtt ttaatatgat catcttaagc 1140taattgtaat gaaaagaact aatggactgt caatattcag aaaaccaaaa ataaaaaata 1200cagaaaacta 1210

* * * * *

References

Patent Diagrams and Documents

D00001


D00002


D00003


D00004


D00005


D00006


D00007


D00008


D00009


D00010


D00011


S00001


XML


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed