U.S. patent number 10,487,365 [Application Number 15/270,774] was granted by the patent office on 2019-11-26 for methods for detecting expression of lnc-fanci-2 in cervical cells.
This patent grant is currently assigned to THE UNITED STATES OF AMERICA, AS REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES. The grantee listed for this patent is The United States of America, as Represented by the Secretary,Department of Health and Human Services. Invention is credited to Xiaohong Wang, Junfen Xu, Yanqin Yang, Zhi-Ming Zheng, Jun Zhu.
United States Patent |
10,487,365 |
Zheng , et al. |
November 26, 2019 |
Methods for detecting expression of lnc-FANCI-2 in cervical
cells
Abstract
Described herein are biomarkers for HPV-associated pre-cancers
and cancers such as cervical cancer and cervical intraepithelial
neoplasia. The RNA binding protein (RBP) and long-noncoding RNA
(lnc-RNA) biomarkers can be detected and used to diagnose
HPV-associated pre-cancers and cancers. In addition, early
diagnosis of HPV-associated pre-cancers and cancers can facilitate
therapeutic intervention in patients, particularly in the
pre-cancer stage which can delay or prevent progression to
cancer.
Inventors: |
Zheng; Zhi-Ming (Rockville,
MD), Xu; Junfen (Frederick, MD), Zhu; Jun (Potomac,
MD), Yang; Yanqin (Bethesda, MD), Wang; Xiaohong
(Germantown, MD) |
Applicant: |
Name |
City |
State |
Country |
Type |
The United States of America, as Represented by the
Secretary,Department of Health and Human Services |
Bethesda |
MD |
US |
|
|
Assignee: |
THE UNITED STATES OF AMERICA, AS
REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN
SERVICES (Bethesda, MD)
|
Family
ID: |
61618416 |
Appl.
No.: |
15/270,774 |
Filed: |
September 20, 2016 |
Prior Publication Data
|
|
|
|
Document
Identifier |
Publication Date |
|
US 20180080084 A1 |
Mar 22, 2018 |
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12Q
1/6886 (20130101); C12Q 2600/112 (20130101); C12Q
2600/158 (20130101) |
Current International
Class: |
C12Q
1/68 (20180101); C12Q 1/6886 (20180101) |
References Cited
[Referenced By]
U.S. Patent Documents
Other References
Chen et al (Biomedicine & Pharmacotherapy. May 2015. 72: 83-90
(Year: 2015). cited by examiner .
Xu et al RNA. May 2015. The Twentieth Annual Meeting of the RNA
Society, Abstract 178, available via URL
<masociety.org/wp-content/uploads/2015/05/RNA-2015-Abstract-Book-print-
-150505.pdf> (Year: 2015). cited by examiner .
Fu et al Med Sci Monito. May 2015. 21: 1276-1287 (Year: 2015).
cited by examiner .
Gibb et al Int J Gynecol Cancer. 2012. 22: 1557-1563 (Year: 2012).
cited by examiner .
Camargo et al.; "GWAS Reveals New Recessive Loci Asociated with
Non-syndromic Facial Clefting"; Eur J Med Genet.; 55(10); pp.
510-514; (2012). cited by applicant .
Expression of GLB1L2 in cancer--Summary--The Human Protein Atlas;
printed May 6, 2015; 1 page;
http://www.proteinatlas.org/ENSG00000149328-GLB1L2/cancer. cited by
applicant .
Flanagan et al.; "Genomics Screen in Transformed Stem Cell Reveals
RNASEH2A, PPAP2C, and ADARB1 as Putative Anticancer Drug Targets";
Mol Cancer Ther; 8(1); pp. 249-260; (2009). cited by applicant
.
Itoh et al.; "Role of Growth Factor Receptor--Bound Protein 7 in
Hepatocellular Carcinoma"; Mol Cancer Res; 5(7); pp. 667-673;
(2007). cited by applicant .
Nadler et al.; "Growth Factor Receptor-bound Protein-7 (Grb7) as a
Prognostic Marker and Therapeutic Target in Breast Cancer"; Annals
of Oncology; 21; pp. 466-473; (2010). cited by applicant .
Takahashi et al.; Manuscript: Significance of Polypyrimidine Tract
Binding Protein 1 Expression in Colorectal Cancer; Published
OnlineFirst Apr. 22, 2015; DOI: 10.1158/1535-7163.MCT-14-0142; 50
pages (2015)_downloaded from mct.aacrjournals.org on Apr. 24, 2015.
cited by applicant .
Wang et al.; "Differential Functions of Growth Factor
Receptor-Bound Protein 7 (GRB7) and Its Valiant GRB7v in Ovarian
Carcinogenesis"; Clin Cancer Res; 16; pp. 2529-2539; (2010). cited
by applicant .
Williams et al.; "A Systems Genetics Approach Identifies CXCL14,
ITGAX, and LPCAT2 as Novel Aggressive Prostate Cancer
Susceptibility Genes"; PLoS Genet; 10(11): e1004809; 15 pages;
(2014). cited by applicant .
Yang et al.; "Identification of Genes with Correlated Patterns
ofVariations in DNA Copy Number and Gene Expression Level in
Gastric Cancer"; Genomics; 89; pp. 451-459; (2007). cited by
applicant .
Zhang et al.; "High Expression of Neuro-Oncological Ventral Antigen
1 Correlates with Poor Prognosis in Hepatocellular Carcinoma"; PLoS
ONE; 9(3); c90955; 11 pages (2014). cited by applicant.
|
Primary Examiner: Myers; Carla J
Attorney, Agent or Firm: Cantor Colburn LLP
Government Interests
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH &
DEVELOPMENT
This invention was made in part with government support from the
National Institutes of Health. The government has certain rights in
this invention.
Claims
The invention claimed is:
1. A method of quantitating an expression level of a lnc-FANCI-2
polynucleotide in a sample containing cells from a test patient's
cervix with one or more first polynucleotides that hybridizes to
the lnc-FANCI-2 polynucleotide, the method comprising contacting
the sample containing cells from the test patient's cervix with the
one or more first polynucleotides, and detecting the level of
hybridization of the one or more first polynucleotides to the
lnc-FANCI-2 polynucleotide, comparing the level of hybridization in
the sample containing cells from the test patient's cervix to a
control level of hybridization in a control sample of normal
cervical tissues, and determining differential expression of the
lnc-FANCI-2 polynucleotide in the sample containing cells from the
test patient's cervix when the level of hybridization for the
sample containing cells from the test patient's cervix is at least
about 300% of the control level of hybridization in the control
sample, wherein the one or more first polynucleotides are SEQ ID
NOs: 78, 79 and 80.
2. The method of claim 1, wherein detecting the level of
hybridization of the one or more first polynucleotides to the
Inc-FANCI-2 polynucleotide is done with real-time RT-PCR.
3. The method of claim 1, wherein the sample containing cells from
the test patient's cervix comprises a PAP smear, a vaginal wash, or
a cervical biopsy sample.
Description
FIELD OF THE DISCLOSURE
The present disclosure is related to novel polynucleotide
biomarkers which can be detected and can be used for the diagnosis
of HPV-associated pre-cancers and HPV-associated cancers such as
cervical cancer and cervical intraepithelial neoplasia as well as
methods of treatment of HPV-associated pre-cancers and
HPV-associated cancers.
BACKGROUND
High-risk HPV persistent infection leads to the development of
certain types of cancers in the cervix, anus, and oropharynx, for
example. Fifteen mucosal HPV types are identified as oncogenic or
high-risk (HR) HPVs, with HPV16 and HPV18 being particularly
associated with invasive cervical cancer. Cervical cancer is the
second most common cancer among women worldwide. Approximately
500,000 incident cases of cervical cancer and approximately 320,000
cervical cancer deaths are estimated each year and more than 80% of
the cases arise in developing countries.
There is a need for diagnostic markers that can be detected and
used for early diagnosis of high-risk HPV infection, HPV-associated
pre-cancer and HPV-associated cancer and for the development of
intervention strategies for treatment of HPV-induced cancers.
SUMMARY
In one aspect, a method of determining if a test patient has stage
1, stage 2, or stage 3 cervical intraepithelial neoplasia or
cervical cancer comprises
determining an expression level of a first polynucleotide biomarker
in a sample containing cells from the test patient's cervix with
one or more first polynucleotides that hybridizes to the first
polynucleotide biomarker, wherein the first polynucleotide
biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and
94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19),
CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID
NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a
combination thereof,
correlating the expression level of the first polynucleotide
biomarker in the sample containing cells from the test patient's
cervix to a reference expression level of the first polynucleotide
biomarker in a reference sample, wherein the reference sample is a
control sample from a patient or patients with no evidence of
cervical cancer, a control sample from a cervical cancer patient or
patients, or a control sample from a patient or patients with stage
1, stage 2, or stage 3 cervical intraepithelial neoplasia, and
determining, based on said correlation, if the test patient has
cervical cancer, or stage 1, stage 2, or stage 3 cervical
intraepithelial neoplasia.
In another aspect, the method of determining if a test patient has
stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia or
cervical cancer comprises
determining an expression level of a first polynucleotide biomarker
in a sample containing cells from the test patient's cervix with
one or more first polynucleotides that hybridizes to the first
polynucleotide biomarker, wherein the first polynucleotide
biomarker is GRB7 (SEQ ID NOs: 8-11 and 84), NOVA1 (SEQ ID NOs: 14,
15 and 95), RNASEH2A (SEQ ID NO: 19), or a combination thereof,
and/or
determining an expression level of a second polynucleotide
biomarker in the sample containing cells from the test patient's
cervix with one or more second polynucleotides that hybridizes to
the second polynucleotide biomarker, wherein the second
polynucleotide biomarker is lnc-FANCI-2, lnc-GLB1L2-1, or a
combination thereof.
In a further aspect, a method of quantitating an expression level
of a first polynucleotide biomarker in a sample containing cells
from a test patient's cervix with one or more first polynucleotides
that hybridizes to the first polynucleotide biomarker comprises
contacting the sample containing cells from test patient's cervix
with the one or more first polynucleotides, and
detecting the level of hybridization of the one or more first
polynucleotides to the first polynucleotide biomarker,
wherein the first polynucleotide biomarker is lnc-FANCI-2,
lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs:
14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4),
ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO:
13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.
In a yet further aspect, a method of treating a test patient in
need of treatment for stage 1, stage 2, or stage 3 cervical
intraepithelial neoplasia or cervical cancer comprises
determining an expression level of a first polynucleotide biomarker
in a sample containing cells from the test patient's cervix with
one or more first polynucleotides that hybridizes to the first
polynucleotide biomarker, wherein the first polynucleotide
biomarker is lnc-FANCI-2, lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and
94), NOVA1 (SEQ ID NOs: 14, 15 and 95), RNASEH2A (SEQ ID NO: 19),
CDKN2A (SEQ ID NOs: 1-4), ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID
NO: 12), KHSRP (SEQ ID NO: 13), PTBP1 (SEQ ID NOs: 16-18), or a
combination thereof,
correlating the expression level of the first polynucleotide
biomarker in the sample containing cells from the test patient's
cervix to a reference expression level of the first polynucleotide
biomarker in a reference sample, wherein the reference sample is a
control sample from a patient or patients with no evidence of
cervical cancer, a control sample from a cervical cancer patient or
patients, or a control sample from a patient or patients with stage
1, stage 2, or stage 3 cervical intraepithelial neoplasia, and
administering a therapeutic intervention for the treatment of stage
1, stage 2, or stage 3 cervical intraepithelial neoplasia, or
cervical cancer when it is determined, based on said expression
levels, that the test patient has stage 1, stage 2, or stage 3
cervical intraepithelial neoplasia.
In a still further aspect, a method of determining if a test
patient has an HPV-associated pre-cancer or an HPV-associated
cancer comprises
determining an expression level of a first polynucleotide biomarker
in a sample containing cells from a tissue of the test patient with
one or more first polynucleotides that hybridizes to the first
polynucleotide biomarker,
correlating the expression level of the first polynucleotide
biomarker in the sample containing cells from the tissue of the
test patient to a reference expression level of the first
polynucleotide biomarker in a reference sample, wherein the
reference sample is a control sample from a patient or patients
with no evidence of HPV-associated pre-cancer or HPV-associated
cancer, a control sample from a patient or patients with
HPV-associated pre-cancer, or a control sample from a patient or
patients with HPV-associated cancer, and
determining, based on said correlation, if the test patient has
HPV-associated pre-cancer or HPV-associated cancer,
wherein the first polynucleotide biomarker is lnc-FANCI-2,
lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs:
14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4),
ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO:
13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.
In another aspect, a method of quantitating an expression level of
a first polynucleotide biomarker in a sample containing cells from
a tissue of the test patient with one or more first polynucleotides
that hybridizes to the first polynucleotide biomarker comprises
contacting the sample containing cells from a tissue of the test
patient with the one or more first polynucleotides, and
detecting the level of hybridization of the one or more first
polynucleotides to the first polynucleotide biomarker,
wherein the first polynucleotide biomarker lnc-FANCI-2,
lnc-GLB1L2-1, is GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs:
14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4),
ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO:
13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.
In a yet further aspect, a method of treating a test patient in
need of treatment for an HPV-associated pre-cancer or an
HPV-associated cancer comprises
determining an expression level of a first polynucleotide biomarker
in a sample containing cells from a tissue of the test patient with
one or more first polynucleotides that hybridizes to the first
polynucleotide biomarker,
correlating the expression level of the first polynucleotide
biomarker in the sample containing cells from the tissue of the
test patient to a reference expression level of the first
polynucleotide biomarker in a reference sample, wherein the
reference sample is a control sample from a patient or patients
with no evidence of HPV-associated pre-cancer or HPV-associated
cancer, a control sample from a patient or patients with
HPV-associated pre-cancer, or a control sample from a patient or
patients with HPV-associated cancer, and
administering a therapeutic intervention for the treatment of
HPV-associated pre-cancer or HPV-associated cancer when it is
determined, based on said expression levels, that the test patient
has HPV-associated pre-cancer or an HPV-associated cancer,
wherein the first polynucleotide biomarker is lnc-FANCI-2,
lnc-GLB1L2-1, GRB7 (SEQ ID NOs: 8-11 and 94), NOVA1 (SEQ ID NOs:
14, 15 and 95), RNASEH2A (SEQ ID NO: 19), CDKN2A (SEQ ID NOs: 1-4),
ELAVL2 (SEQ ID NOs: 5-7), HSPB1 (SEQ ID NO: 12), KHSRP (SEQ ID NO:
13), PTBP1 (SEQ ID NOs: 16-18), or a combination thereof.
BRIEF DESCRIPTION OF THE DRAWINGS
The patent or application file contains at least one drawing
executed in color. Copies of this patent or patent application
publication with color drawing(s) will be provided by the Office
upon request and payment of the necessary fee.
FIG. 1 is a flowchart of the RNA-sequencing (RNA-Seq) analyses for
RNA-binding proteins (RBPs).
FIG. 2 shows Venn diagrams showing 95 differentially expressed RBP
genes being identified from two separate RNA-seq analyses of
cervical cancer, pre-cancer to normal cervical tissues.
FIG. 3 shows a heat map comparing 95 differentially expressed RBP
genes in cervical cancer to normal cervical tissues.
FIG. 4 shows the TaqMan.RTM. RT-qPCR validation of the 8 selected
RBPs.
FIG. 5 shows that high-risk HPV16 infection affects the expression
of RBPs. Total RNA extracted from human vaginal keratinocyte
(HVK)-derived raft cultures with (HVK16) or without (HVK)
productive HPV16 infection and human foreskin keratinocyte (HFK)
derived raft cultures with (HFK16) or without (HFK) productive
HPV16 infection were examined by TaqMan.RTM. RT-qPCR for the
expression of 8 RBPs.
FIG. 6 shows that high-risk HPV18 infection affects the expression
of RBPs. Total RNA extracted from human vaginal keratinocyte
(HVK)-derived raft cultures with (HVK18) or without (HVK)
productive HPV18 infection and human foreskin keratinocyte (HFK)
derived raft cultures with (HFK18) or without (HFK) productive
HPV18 infection were examined by TaqMan.RTM. RT-qPCR for the
expression of 8 RBPs.
FIG. 7 shows that both HPV16 and HPV18 increase the expression of
CDKN2A and RNASEH2A, but decrease the expression of NOVA1 in HFK-
and HVK-derived rafts.
FIG. 8 shows that HPV18 infection and viral E6 and/or E7 affect the
expression of RNASEH2A and Nova1. The expression of RNASEH2A and
NOVA1 in primary human keratinocytes (PHK)-derived raft tissues
with or without HPV18 infection on day 8, day 12, and day 16 or PHK
rafts transduced with a retrovirus expression HPV18 E6, E7 or E6E7
or with an empty control retrovirus were further validated by
TaqMan.RTM. RT-qPCR.
FIG. 9 shows that knockdown or overexpression of RNASEH2A in HeLa
or CaSki cells affects cell proliferation. Specific-siRNA knockdown
or ectopic expression of RNASEH2A from a mammalian expression
vector in HeLa or CaSki cells on cell proliferation was evaluated
by Cell Counting Kit-8 (CCK-8) assay
FIG. 10 shows HPV oncoprotein E7 regulates the expression of
RNASEH2A via E2F1. Specific-siRNA knockdown or ectopic expression
of E2F1 from a mammalian expression vector in HeLa or CaSki cells
on RNASEH2A was evaluated by Western blot.
FIG. 11 is a flowchart of the RNA-Seq analyses for long-noncoding
RNAs (lnc-RNAs).
FIG. 12 is a heat map showing 209 overlapped, differentially
expressed lnc-RNAs from cervical cancer, pre-cancer to normal
cervical tissues.
FIG. 13 shows an increase of lnc-FANCI-2, and decrease of
lnc-GLB1L2-1 expression along with the cervical lesion progression
from normal cervix. Lnc-FANCI-2 and lnc-GLB1L2-1 RNA expression was
examined by RT-qPCR in 24 normal, 25 CIN 2-3, and 23 cancer
tissues.
FIG. 14 shows that HPV infection increases lnc-FANCI-2 expression
in HVK- and PHK-derived rafts and viral E7 or E6 is responsible for
the increase. The expression of lnc-FANCI-2 in human vaginal
keratinocytes (HVK)-derived raft tissues without (HVK) or with
HPV16 (HVK16) or HPV18 (HVK18) infection or primary human
keratinocytes (PHK)-derived raft tissues without or with HPV18
infection.
The above-described and other features will be appreciated and
understood by those skilled in the art from the following detailed
description, drawings, and appended claims.
DETAILED DESCRIPTION
Using an RNA-sequencing (RNA-Seq) approach, the inventors of the
present application examined seven normal cervical tissues and
seven cervical cancer tissues for their expression landscapes of
approximately 19,000 coding and 113,513 noncoding RNAs. 614
differentially expressed coding transcripts enriched in cancer
related pathways were identified, with 95 of them encoding
RNA-binding proteins (RBPs) from the analyzed 1502 human RBPs.
Moreover, 209 differentially, abundantly expressed long-noncoding
RNAs (lnc-RNAs) from normal cervix to cervical cancer were
identified. Validation of the altered expression of 26 candidates,
including 8 RBP genes by using TaqMan.RTM. real-time PCR in a
cohort of 47 human cervical tissue samples, including 24 normal
cervical tissues and 23 cervical cancer tissues, showed that they
are broadly involved in cervical carcinogenesis. Many of the
identified RBP candidates had not been previously reported. Using
human vaginal keratinocyte-derived raft culture tissues with or
without HPV16 and HPV18 infection, it was further corroborated that
these RBP candidates, including CDKN2A, ELAVL2, GRB7, HSPB1, KHSRP,
PTBP1, RNASEH2A, and NOVA1, are regulated by HPV infection.
Further, the inventors found that lnc-FANCI-2 was increasingly
expressed along with cervical lesion progression from cervical
intraepithelial neoplasia (CIN) to cervical cancer, when compared
to the normal tissues. In contrast, lncGLB1L2-1 was gradually
decreased along with the lesion progression, when compared to the
normal tissues. In addition, FAM83A, SEMA3F, CLDN10, ASRGL1, which
are not RBPs, were also found to have altered expression in
cervical cancer compared to normal tissue, with FAM83A and SEMA3F
being increased in cervical cancer and CLDN10 and ASRGL1 being
decreased in cervical cancer. The results presented herein provide
the first comprehensive expression atlas of RBPs and lnc-RNAs in
normal cervix and cervical cancer, which can be detected to provide
better diagnosis and treatment of patients with cervical
cancer.
More specifically, an increase of lnc-FANCI-2 RNA, including all of
its 35 isoforms, and a decrease of lnc-GLB1L2-1, including its 21
isoforms, were identified in cervical cancer. Fanconi anemia (FA)
frequently develops squamous cell carcinoma at sites that are
associated with HPV-driven cancer including the female reproductive
tract, and is caused by mutations in one of 15 genes in the FA
pathway (including FANCA, FANCD2, and FANCI). Loss of FA pathway
components FANCA and FANCD2 stimulates E7 protein accumulation in
human keratinocytes, and loss of FANCD2 stimulates HPV DNA
replication. Both FANCI and lnc-FANCI-2 are expressed from the same
location at chromosome 15q26.1. Further, both GLB1L2
(galactosidase, beta 1-like 2) and lnc-GLB1L2-1 are expressed from
Chromosome 11q25, with unknown function in cancer development. By
using TaqMan.RTM. qRT-PCR validation of lnc-FANCI-2 and
lnc-GLB1L2-1 in 24 normal, 25 CIN 2-3, and 23 cervical cancer
tissues, it was confirmed that altered expression of these lnc-RNAs
is remarkably related to cervical lesion progression from CIN to
cancer. Moreover, the altered changes of lnc-FANCI-2 could be
attributed to HPV16 and HPV18 infection in raft cultures and viral
E7 expression. These lnc-RNAs are biomarkers for early diagnosis of
high-risk HPV infection with high risk of progression and for
development of intervention strategies to treat HPV-induced
cancers.
As used herein, a non-coding RNA (ncRNA) is an RNA transcript that
does not encode a protein. ncRNAs include short ncRNAs and long
ncRNAs (lnc-RNAs). Short ncRNAs are ncRNAs that are generally
18-200 nucleotides (nt) in length. Examples of short ncRNAs
include, but are not limited to, microRNAs (miRNAs),
piwi-associated RNAs (piRNAs), short interfering RNAs (siRNAs),
promoter-associated short RNAs (PASRs), transcription initiation
RNAs (tiRNAs), termini-associated short RNAs (TASRs), antisense
termini associated short RNAs (aTASRs), small nucleolar RNAs
(snoRNAs), transcription start site antisense RNAs (TSSa-RNAs),
small nuclear RNAs (snRNAs), retroposon-derived RNAs (RE-RNAs),
3'UTR-derived RNAs (uaRNAs), x-ncRNA, human Y RNA (hY RNA),
unusually small RNAs (usRNAs), small NF90-associated RNAs (snaRs),
vault RNAs (vtRNAs), small Cajal body-specific RNAs (scaRNAs), and
telomere specific small RNAs (tel-sRNAs). lnc-RNAs are cellular
RNAs, exclusive of rRNAs, greater than 200 nucleotides in length
and having no obvious protein-coding capacity. Lnc-RNAs include,
but are not limited to, large or long intergenic ncRNAs (lincRNAs),
transcribed ultraconserved regions (T-UCRs), pseudogenes,
GAA-repeat containing RNAs (GRC-RNAs), long intronic ncRNAs,
antisense RNAs (aRNAs), promoter-associated long RNAs (PALRs),
promoter upstream transcripts (PROMPTs), and long stress-induced
non-coding transcripts (LSINCTs).
An RNA-binding protein is a protein that binds single or double
stranded RNA to form ribonucleoprotein complexes. RBPs contain
conserved structural motifs such as the RNA recognition motif
(RRM), dsRNA binding domain, zinc finger domain, and others.
The biomarkers for detection and diagnosis of CIN and cervical
cancer include the RBP and lnc-RNA biomarkers of Tables 1-3:
TABLE-US-00001 TABLE 1 RBP biomarkers SEQ ID NO: chr start end
refseqID Symbol description 1 chr9 21967750 21975132 NM_000077
CDKN2A cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript
variant 1, mRNA. 2 chr9 21967750 21975132 NM_001195132 CDKN2A Homo
sapiens cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript
variant 5, mRNA. 3 chr9 21967750 21994490 NM_058195 CDKN2A
cyclin-dependent kinase inhibitor 2A (CDKN2A), transcript variant
4, mRNA. 4 chr9 21967750 21974826 NM_058197 CDKN2A cyclin-dependent
kinase inhibitor 2A (CDKN2A), transcript variant 3, mRNA. 5 chr9
23690102 23821843 NM_001171195 ELAVL2 Homo sapiens ELAV (embryonic
lethal, abnormal vision, Drosophila)-like 2 (Hu antigen B)
(ELAVL2), transcript variant 2, mRNA. 6 chr9 23690102 23821478
NM_001171197 ELAVL2 Homo sapiens ELAV (embryonic lethal, abnormal
vision, Drosophila)-like 2 (Hu antigen B) (ELAVL2), transcript
variant 3, mRNA. 7 chr9 23690102 23826063 NM_004432 ELAVL2 ELAV
(embryonic lethal, abnormal vision, Drosophila)-like 2 (Hu antigen
B) (ELAVL2), transcript variant 1, mRNA. 8 chr17 37894575 37903538
NM_001030002 GRB7 growth factor receptor-bound protein 7 (GRB7),
transcript variant 2, mRNA. 9 chr17 37895023 37903538 NM_001242442
GRB7 Homo sapiens growth factor receptor-bound protein 7 (GRB7),
transcript variant 4, mRNA. 10 chr17 37896219 37903538 NM_001242443
GRB7 Homo sapiens growth factor receptor-bound protein 7 (GRB7),
transcript variant 3, mRNA. 11 chr17 37894161 37903538 NM_005310
GRB7 growth factor receptor-bound protein 7 (GRB7), transcript
variant 1, mRNA. 94 chr17 NM_001330207.1 GRB7 growth factor
receptor-bound protein 7 (GRB7), transcript variant 5, mRNA. 12
chr7 75931874 75933614 NM_001540 HSPB1 heat shock 27 kDa protein 1
(HSPB1), mRNA. 13 chr19 6413118 6424822 NM_003685 KHSRP KH-type
splicing regulatory protein (KHSRP), mRNA. 14 chr14 26915088
27066960 NM_002515 NOVA1 neuro-oncological ventral antigen 1
(NOVA1), transcript variant 1, mRNA. 15 chr14 26915088 27066960
NM_006489 NOVA1 neuro-oncological ventral antigen 1 (NOVA1),
transcript variant 2, mRNA. 95 chr14 NM_006491.2 NOVA1
neuro-oncological ventral antigen 1 (NOVA1), transcript variant 3,
mRNA. 16 chr19 797391 812327 NM_002819 PTBP1 polypyrimidine tract
binding protein 1 (PTBP1), transcript variant 1, mRNA. 17 chr19
797391 812327 NM_031990 PTBP1 polypyrimidine tract binding protein
1 (PTBP1), transcript variant 2, mRNA. 18 chr19 797391 812327
NM_031991 PTBP1 polypyrimidine tract binding protein 1 (PTBP1),
transcript variant 3, mRNA. 19 chr19 12917427 12924462 NM_006397
RNASEH2A ribonuclease H2, subunit A (RNASEH2A), mRNA.
TABLE-US-00002 TABLE 2 lnc-FANCI-2 isoforms SEQ ID Transcript ID
NO: Location (hg19) Length lnc-FANCI-2: 1 20 chr15:
89904810-89938553 1613 lnc-FANCI-2: 10 21 chr15: 89921280-89938544
606 lnc-FANCI-2: 11 22 chr15: 89921331-89938354 551 lnc-FANCI-2: 12
23 chr15: 89921347-89939471 1877 lnc-FANCI-2: 13 24 chr15:
89921362-89938500 561 lnc-FANCI-2: 14 25 chr15: 89921794-89931745
786 lnc-FANCI-2: 15 26 chr15: 89922355-89938350 569 lnc-FANCI-2: 16
27 chr15: 89922468-89941720 3779 lnc-FANCI-2: 17 28 chr15:
89922495-89941719 3670 lnc-FANCI-2: 18 29 chr15: 89923111-89941720
3784 lnc-FANCI-2: 19 30 chr15: 89925731-89938271 779 lnc-FANCI-2: 2
31 chr15: 89904810-89938551 1611 lnc-FANCI-2: 20 32 chr15:
89929827-89939471 2718 lnc-FANCI-2: 21 33 chr15: 89930671-89941720
3723 lnc-FANCI-2: 22 34 chr15: 89904810-89941718 4778 lnc-FANCI-2:
23 35 chr15: 89911330-89941718 4113 lnc-FANCI-2: 24 36 chr15:
89911399-89941721 3936 lnc-FANCI-2: 25 37 chr15: 89912393-89941683
4026 lnc-FANCI-2: 26 38 chr15: 89921102-89941708 4334 lnc-FANCI-2:
27 39 chr15: 89921273-89941718 3868 lnc-FANCI-2: 28 40 chr15:
89922232-89941683 3978 lnc-FANCI-2: 29 41 chr15: 89923021-89941683
3837 lnc-FANCI-2: 3 42 chr15: 89905705-89922463 571 lnc-FANCI-2: 30
43 chr15: 89929880-89941721 4915 lnc-FANCI-2: 31 44 chr15:
89930027-89941721 4687 lnc-FANCI-2: 32 45 chr15: 89930389-89931372
706 lnc-FANCI-2: 33 46 chr15: 89930557-89941683 3922 lnc-FANCI-2:
34 47 chr15: 89931724-89941721 3690 lnc-FANCI-2: 35 48 chr15:
89932071-89941708 4093 lnc-FANCI-2: 4 49 chr15: 89905718-89938562
957 lnc-FANCI-2: 5 50 chr15: 89911330-89941718 2124 lnc-FANCI-2: 6
51 chr15: 89912386-89931074 576 lnc-FANCI-2: 7 52 chr15:
89918593-89941720 6547 lnc-FANCI-2: 8 53 chr15: 89921220-89941692
3814 lnc-FANCI-2: 9 54 chr15: 89921273-89941718 4198
TABLE-US-00003 TABLE 3 lnc-GLB1L2-1 isoforms SEQ ID Transcript ID
NO: Location (hg19) Length lnc-GLB1L2-1: 1 55 chr11:
134306367-134337169 1402 bp lnc-GLB1L2-1: 10 56 chr11:
134350719-134372941 295 bp lnc-GLB1L2-1: 11 57 chr11:
134352524-134373110 374 bp lnc-GLB1L2-1: 12 58 chr11:
134306376-134375555 2737 bp lnc-GLB1L2-1: 13 59 chr11:
134339378-134360125 15706 bp lnc-GLB1L2-1: 14 60 chr11:
134339400-134373384 744 bp lnc-GLB1L2-1: 15 61 chr11:
134339400-134375553 1129 bp lnc-GLB1L2-1: 16 62 chr11:
134343291-134373078 1843 bp lnc-GLB1L2-1: 17 63 chr11:
134344051-134375009 1160 bp lnc-GLB1L2-1: 18 64 chr11:
134346572-134375009 572 bp lnc-GLB1L2-1: 19 65 chr11:
134349193-134375555 4435 bp lnc-GLB1L2-1: 2 66 chr11:
134306469-134308558 374 bp lnc-GLB1L2-1: 20 67 chr11:
134349983-134375009 1245 bp lnc-GLB1L2-1: 21 68 chr11:
134350411-134401542 537 bp lnc-GLB1L2-1: 3 69 chr11:
134306629-134374934 1863 bp lnc-GLB1L2-1: 4 70 chr11:
134336079-134357809 3679 bp lnc-GLB1L2-1: 5 71 chr11:
134336079-134357809 3620 bp lnc-GLB1L2-1: 6 72 chr11:
134344060-134350796 720 bp lnc-GLB1L2-1: 7 73 chr11:
134349193-134375507 4387 bp lnc-GLB1L2-1: 8 74 chr11:
134349731-134352843 1398 bp lnc-GLB1L2-1: 9 75 chr11:
134350086-134367700 939 bp
In additional aspects, the biomarker includes FAM83A (SEQ ID NO:
86; KJ895067.1), SEMA3F (SEQ ID NOs: 87-89; NM_004186.4;
NM_001318800.1; NM_001318798.1), CLDN10 (SEQ ID NO: 90-91;
NM_182848.3; NM_006984.4), ASRGL1 (SEQ ID NO: 92, 93;
NM_001083926.1; NM_025080.3), or a combination thereof.
An RBP, lnc-RNA, or additional RNA biomarker is differentially
expressed between two samples if the amount of the RBP, lnc-RNA, or
additional RNA biomarker in one sample is statistically
significantly different from the amount of the RBP, lnc-RNA, or
additional RNA biomarker in the other sample. The expression level
of an RBP, lnc-RNA, or additional RNA biomarker can be increased or
decreased in a test sample relative to a reference sample. For
example, an RBP gene, lnc-RNA, or additional RNA biomarker is
differentially expressed in two samples if it is present at least
about 120%, at least about 130%, at least about 150%, at least
about 180%, at least about 200%, at least about 300%, at least
about 500%, at least about 700%, at least about 900%, or at least
about 1000% greater than it is present in the other sample, or if
it is detectable in one sample and not detectable in the other.
Alternatively or additionally, an RBP gene, lnc-RNA, or additional
RNA biomarker is differentially expressed in two sets of samples if
the frequency of detecting the RBP gene, lnc-RNA, or additional RNA
biomarker in samples is statistically significantly higher or lower
than in the control samples. For example, an RBP gene, lnc-RNA, or
additional RNA biomarker is differentially expressed in two sets of
samples if it is detected at least about 120%, at least about 130%,
at least about 150%, at least about 180%, at least about 200%, at
least about 300%, at least about 500%, at least about 700%, at
least about 900%, or at least about 1000% more frequently or less
frequently observed in one set of samples than the other set of
samples.
A test amount and a control amount of a biomarker can be either an
absolute amount (e.g., number of copies/ml, nanogram/ml or
microgram/ml) or a relative amount (e.g., relative intensity of
signals).
Diagnostic samples for use in the methods described herein comprise
nucleic acids suitable for providing polynucleotide, e.g., RNA,
expression information. The sample contains cells from a tissue of
the test patient. For example, when the HPV-associated pre-cancer
or HPV-associated cancer is anal cancer, the tissue of the test
patient contains anal cells; when the HPV-associated pre-cancer or
HPV-associated cancer is vulvovaginal cancer, the tissue of the
test patient contains vulvovaginal cells; when the HPV-associated
pre-cancer or HPV-associated cancer is penile cancer, the tissue of
the test patient contains penal cells; or when the HPV-associated
pre-cancer or HPV-associated cancer is oropharyngeal cancer, the
tissue of the test patient contains oropharyngeal cells.
In one aspect, samples for the methods disclosed herein contain
cells from a patient's cervix. Exemplary test samples include a PAP
smear, a vaginal wash, or a cervical biopsy sample. In certain
aspects, the methods described herein include obtaining from the
test patient the sample containing cells from the test patient's
cervix.
In certain aspects, the test patient is a patient at risk for an
HPV-associated pre-cancer or an HPV-associated cancer, such as a
patient diagnosed with HPV infection or a patient at high risk for
HPV infection.
In certain aspects, the test patient is a patient at high risk for
cervical cancer such as a woman at high risk for HPV infection, a
woman with a diagnosed HPV infection, a woman with a history of DES
exposure, a woman with a previous history of gynecological cancer,
a woman with an abnormal PAP test, a woman immunosuppressed due to
AIDS or therapy following organ transplantation, or a woman with
abnormal endometrial cells.
In certain aspects, the methods disclosed herein comprise detecting
the expression level of one or more biomarkers as disclosed
herein.
In addition, the methods disclosed herein include the
comparison/correlation of the expression levels of biomarkers in
the diagnostic sample from the test patient to a reference sample.
Exemplary reference samples include a control sample from a patient
or patients with no evidence of HPV-associated pre-cancer or
HPV-associated cancer, a control sample from a patient or patients
with HPV-associated pre-cancer, and a control sample from a patient
or patients with HPV-associated cancer. Additional exemplary
reference samples include a control sample from a patient or
patients with no evidence of cervical cancer, a control sample from
a cervical cancer patient or patients, or a control sample from a
patient or patients with stage 1, stage 2, or stage 3 cervical
intraepithelial neoplasia. The reference sample can be a single
sample from a control patient with a known disease state, or
preferably samples from a plurality of subjects such that the
reference expression level is averaged over the expression levels
for a population of known disease state. Useful population sizes
for a reference population are greater than 100 subjects,
specifically about 500 subjects for each reference group (CIN 1, 2,
3 and cervical cancer), for example.
RNA can be extracted and purified from biological samples using
suitable techniques that are known in the art, and several are
commercially available (e.g., FormaPure.RTM. nucleic acid
extraction kit, Agencourt.RTM. Biosciences, Beverly Mass., High
Pure FFPE RNA Micro Kit, Roche Applied Science, Indianapolis,
Ind.). RNA can be extracted from frozen tissue sections using
TRIzol.RTM. (Invitrogen, Carlsbad, Calif.) and purified using
RNeasy.RTM. Protect kit (Qiagen, Valencia, Calif.). RNA can be
further purified using DNase I treatment (Ambion, Austin, Tex.) to
eliminate any contaminating DNA. RNA concentrations can be made
using a NanoDrop ND-1000 spectrophotometer (NanoDdrop Technologies,
Rockland, Del.). RNA can be further purified to eliminate
contaminants that interfere with cDNA synthesis by cold sodium
acetate precipitation. RNA integrity can be evaluated by running
electropherograms, and RNA integrity number (RIN, a correlative
measure that indicates intactness of mRNA) can be determined using
the RNA 6000 PicoAssay for the Bioanalyzer 2100 (Agilent
Technologies, Santa Clara, Calif.).
Following sample collection and nucleic acid extraction, the
nucleic acid portion of the sample comprising RNA that is or can be
used to prepare the target polynucleotide(s) of interest can be
subjected to one or more preparative reactions. These preparative
reactions can include in vitro transcription (IVT), labeling,
fragmentation, amplification, and other reactions. mRNA can first
be treated with reverse transcriptase and a primer to create cDNA
prior to detection, quantitation, or amplification; this can be
done in vitro with purified mRNA or in situ, e.g., in cells or
tissues affixed to a slide.
By "amplification" is meant a process of producing at least one
copy of a nucleic acid, in this case an expressed RNA, and in many
cases produces multiple copies. An amplification product can be RNA
or DNA, and may include a complementary strand to the expressed
target sequence. DNA amplification products can be produced
initially through reverse transcription and then optionally from
further amplification reactions. The amplification product may
include all or a portion of a target sequence, and may optionally
be labeled. A variety of amplification methods are suitable for
use, including polymerase-based methods and ligation-based
methods.
The expression level of a polynucleotide biomarker can be
determined by reverse transcriptase-polymerase chain reaction
(RT-PCR) methods, quantitative real-time RT-PCR (RT-qPCR),
microarray, serial analysis of gene expression (SAGE),
next-generation RNA sequencing (deep sequencing), gene expression
analysis by massively parallel signature sequencing (MPSS),
immunoassays such as ELISA, in situ hybridization (ISH)
formulations that allow histopathological analysis, mass
spectrometry (MS) methods, transcriptomics, RNA pull-down and
chromatin isolation by RNA purification (ChiRP), proteomics-based
identification of lncRNA, detection of single nucleotide
polymorphisms (SNPs), measurement of DNA methylation or
unmethylation, measurement of siRNA silencing or miRNA silencing,
or measurement of downstream targets.
As used herein, the terms "quantitative real time polymerase chain
reaction," "real-time polymerase chain reaction," and "qPCR" are
synonymous and refer to a laboratory technique based on a
polymerase chain reaction used to amplify and simultaneously
quantify a targeted DNA molecule. Frequently, real-time PCR is
combined with reverse transcription to quantify messenger RNA and
non-coding RNA in cells or tissues, e.g., RT-qPCR.
Additional methods for detecting and/or quantifying a
polynucleotide biomarker can comprise single-molecule sequencing
(e.g., Illumina.RTM., PacBio, ABI SOLID.TM.), in situ
hybridization, bead-array technologies (e.g., Luminex xMAP.RTM.,
Illumina.RTM. BeadChips), branched DNA technology (e.g.,
Affymetrix.RTM., Genisphere.RTM.), and Ion Torrent.TM.. In some
instances, methods for detecting and/or quantifying a target
sequence comprise transcriptome sequencing techniques.
Transcription sequencing (e.g., RNA-seq, "Whole Transcriptome
Shotgun Sequencing" (WTSS)) may comprise the use of high-throughput
sequencing technologies to sequence cDNA in order to get
information about a sample's RNA content. Transcriptome sequencing
can provide information on differential expression of genes,
including gene alleles and differently spliced transcripts,
non-coding RNAs, post-transcriptional mutations or editing, and
gene fusions.
Included herein is a method for measuring the expression levels of
biomarkers for HPV-associated pre-cancers and cancers as described
herein. The methods optionally include identifying HPV-associated
pre-cancer or cancer status of a test subject (e.g., cervical
cancer). The data obtained from the expression profiles of a
population (e.g., normal, CIN1-3, or cervical cancer) can be
evaluated using one or more pattern recognition algorithms. In
addition, the results of imaging tests or histological evaluation
may optionally be combined with expression profiles generated using
the genes disclosed herein.
In one aspect, the methods include
comparing (correlating) the expression level of the first
polynucleotide biomarker in the sample containing cells from a
tissue of the test patient to a reference expression level of the
first polynucleotide biomarker in a reference sample, wherein the
reference sample is
a control sample from a patient or patients with no evidence of
HPV-associated pre-cancer or HPV-associated cancer,
a control sample from a patient or patients with HPV-associated
pre-cancer, or
a control sample from a patient or patients with HPV-associated
cancer, and
determining, based on said correlation, if the test patient has
HPV-associated pre-cancer or HPV-associated cancer
In another aspect, the methods comprise
predicting (or determining), based on the expression level of one
or more polynucleotide biomarkers in the containing cells from a
tissue of the test patient and a reference expression level of the
one or more polynucleotide biomarkers in a reference sample that
the patient has no HPV-associated pre-cancer or cancer, that the
test patient has HPV-associated pre-cancer, or that the patient has
HPV-associated cancer, wherein the reference sample is
a control sample from a patient or patients with no evidence of
HPV-associated pre-cancer or HPV-associated cancer,
a control sample from a patient or patients with HPV-associated
pre-cancer, or
a control sample from a patient or patients with HPV-associated
cancer.
In a further aspect, the methods include
classifying the patient as having no cervical cancer or cervical
intraepithelial neoplasia, or as having HPV-associated pre-cancer
or cancer based on the expression level of one or more
polynucleotide biomarkers in the sample containing cells from a
tissue of the test patient and a reference expression level of the
one or more polynucleotide biomarkers in a reference sample,
wherein the reference sample is
a control sample from a patient or patients with no evidence of
HPV-associated pre-cancer or HPV-associated cancer,
a control sample from a patient or patients with HPV-associated
pre-cancer, or a control sample from a patient or patients with
HPV-associated cancer.
In one aspect, the methods include
comparing (or correlating) the expression level of one or more
polynucleotide biomarkers in the sample containing cells from the
test patient's cervix to a reference expression level of the one or
more polynucleotide biomarkers in a reference sample, wherein the
reference sample is
a control sample from a patient or patients with no evidence of
cervical cancer,
a control sample from a cervical cancer patient or patients, or
a control sample from a patient or patients with stage 1, stage 2,
or stage 3 cervical intraepithelial neoplasia, and
determining, based on said comparison, if the test patient has
cervical cancer, or stage 1, stage 2, or stage 3 cervical
intraepithelial neoplasia.
In another aspect, the methods comprise
predicting (or determining), based on the expression level of one
or more polynucleotide biomarkers in the sample containing cells
from the test patient's cervix and a reference expression level of
the one or more polynucleotide biomarkers in a reference sample
that the patient has no cervical cancer or cervical intraepithelial
neoplasia, that the test patient has cervical cancer, or that the
patient has stage 1, stage 2, or stage 3 cervical intraepithelial
neoplasia, wherein the reference sample is
a control sample from a patient or patients with no evidence of
cervical cancer,
a control sample from a cervical cancer patient or patients, or
a control sample from a patient or patients with stage 1, stage 2,
or stage 3 cervical intraepithelial neoplasia.
In a further aspect, the methods include
classifying the patient as having no cervical cancer or cervical
intraepithelial neoplasia, as having cervical cancer, or as having
stage 1, stage 2, or stage 3 cervical intraepithelial neoplasia
based on the expression level of one or more polynucleotide
biomarkers in the sample containing cells from the test patient's
cervix and a reference expression level of the one or more
polynucleotide biomarkers in a reference sample, wherein the
reference sample is
a control sample from a patient or patients with no evidence of
cervical cancer,
a control sample from a cervical cancer patient or patients, or
a control sample from a patient or patients with stage 1, stage 2,
or stage 3 cervical intraepithelial neoplasia.
Analysis methods may be used to form a predictive model, and then
the predictive model may be used to classify test data. For
example, one convenient and particularly effective method of
classification employs multivariate statistical analysis modeling,
first to form a model (a "predictive mathematical model") using
data ("modeling data") from samples of known class (e.g., from
subjects known to have, or not have, a particular grade of CIN or
cervical cancer), and second to classify an unknown sample (e.g.,
"test data"), according to HPV-associated (e.g., cervical) cancer
status.
Pattern recognition (PR) is the use of multivariate statistics,
both parametric and non-parametric, to analyze spectroscopic data,
and hence to classify samples and to predict the value of some
dependent variable based on a range of observed measurements. There
are two main approaches. One set of methods is termed
"unsupervised" and these simply reduce data complexity in a
rational way and also produce display plots which can be
interpreted by the human eye. The other approach is termed
"supervised" whereby a training set of samples with known class or
outcome is used to produce a mathematical model and is then
evaluated with independent validation data sets.
Unsupervised PR methods are used to analyze data without reference
to any other independent knowledge. Examples of unsupervised
pattern recognition methods include principal component analysis
(PCA), hierarchical cluster analysis (HCA), and non-linear mapping
(NLM).
Alternatively, and in order to develop automatic classification
methods, it has proved efficient to use a "supervised" approach to
data analysis. Here, a "training set" of biomarker expression data
is used to construct a statistical model that predicts correctly
the "class" of each sample. This training set is then tested with
independent data (referred to as a test or validation set) to
determine the robustness of the computer-based model. These models
are sometimes termed "expert systems," but may be based on a range
of different mathematical procedures. Supervised methods can use a
data set with reduced dimensionality (for example, the first few
principal components), but typically use unreduced data, with all
dimensionality. In all cases the methods allow the quantitative
description of the multivariate boundaries that characterize and
separate each class, for example, each class of cervical cancer in
terms of its biomarker expression profile. It is also possible to
obtain confidence limits on any predictions, for example, a level
of probability to be placed on the goodness of fit. The robustness
of the predictive models can also be checked using
cross-validation, by leaving out selected samples from the
analysis.
It is often useful to pre-process data, for example, by addressing
missing data, translation, scaling, weighting, etc. Multivariate
projection methods, such as principal component analysis (PCA) and
partial least squares analysis (PLS), are so-called scaling
sensitive methods. By using prior knowledge and experience about
the type of data studied, the quality of the data prior to
multivariate modeling can be enhanced by scaling and/or weighting.
Adequate scaling and/or weighting can reveal important and
interesting variation hidden within the data, and therefore make
subsequent multivariate modeling more efficient. Scaling and
weighting may be used to place the data in the correct metric,
based on knowledge and experience of the studied system, and
therefore reveal patterns already inherently present in the
data.
The methods described herein may be implemented and/or the results
recorded using a device capable of implementing the methods and/or
recording the results. Examples of devices that may be used include
but are not limited to electronic computational devices, including
computers of all types. When the methods described herein are
implemented and/or recorded in a computer, the computer program
that may be used to configure the computer to carry out the steps
of the methods may be contained in any computer readable medium
capable of containing the computer program. Examples of computer
readable medium that may be used include but are not limited to
diskettes, CD-ROMs, DVDs, ROM, RAM, and other memory and computer
storage devices. The computer program that may be used to configure
the computer to carry out the steps of the methods and/or record
the results may also be provided over an electronic network, for
example, over the internet, an intranet, or other network.
The process of comparing a measured value and a reference value can
be carried out in a convenient manner appropriate to the type of
measured value and reference value for the discriminative gene at
issue. "Measuring" can be performed using quantitative or
qualitative measurement techniques, and the mode of comparing a
measured value and a reference value can vary depending on the
measurement technology employed. For example, when a qualitative
colorimetric assay is used to measure expression levels, the levels
may be compared by visually comparing the intensity of the colored
reaction product, or by comparing data from densitometric or
spectrometric measurements of the colored reaction product (e.g.,
comparing numerical data or graphical data, such as bar charts,
derived from the measuring device). However, it is expected that
the measured values used in the methods will most commonly be
quantitative values. In other examples, measured values are
qualitative. As with qualitative measurements, the comparison can
be made by inspecting the numerical data, or by inspecting
representations of the data (e.g., inspecting graphical
representations such as bar or line graphs).
The process of comparing may be manual (such as visual inspection
by the practitioner of the method) or it may be automated. For
example, an assay device (such as a luminometer for measuring
chemiluminescent signals) may include circuitry and software
enabling it to compare a measured value with a reference value for
a biomarker. Alternately, a separate device (e.g., a digital
computer) may be used to compare the measured value(s) and the
reference value(s). Automated devices for comparison may include
stored reference values for the biomarker(s) being measured, or
they may compare the measured value(s) with reference values that
are derived from contemporaneously measured reference samples
(e.g., samples from control subjects).
As will be apparent to those of skill in the art, when replicate
measurements are taken, the measured value that is compared with
the reference value is a value that takes into account the
replicate measurements. The replicate measurements may be taken
into account by using either the mean or median of the measured
values as the "measured value."
When it has been determined that the test patient has
HPV-pre-cancer or cancer, the methods optionally include HPV
detection and or typing.
When it has been determined that the test patient has CIN 1, 2, or
3 cervical cancer, the methods optionally include HPV detection and
or typing, for example, using the Cobas.RTM. HPV test marketed by
Roche Diagnostics.
Also included herein are methods of treating the test patient with
an interventional strategy for HPV-associated pre-cancer or
cancer.
Interventional therapies for anal, vulvovaginal, penile, and
oropharyngeal cancer include radiation therapy, surgery, and
chemotherapy.
Further included herein are methods of treating the test patient
with an interventional strategy for CIN or cervical cancer. When
the patient is determined to have stage 1 CIN, the interventional
strategy may include screening for further cervical changes,
screening the patient for HPV infection, HPV typing, or a
combination thereof. Exemplary tests for the detection of HPV
infection include detection of HPV infection via DNA/RNA
amplification with PCR using, for example, the Cobas.RTM. HPV test
marketed by Roche Diagnostics. Advantageously, early identification
of CIN 1 optionally coupled with determining the HPV infection type
will provide critical information regarding the type of
intervention required to treat the patient. Early diagnosis and
treatment at stage CIN 1 could prevent or slow progression to later
disease stages.
When the patient is determined to have stage 2 or stage 3 CIN,
interventional strategies may include, in addition to monitoring,
cryosurgery to freeze abnormal cells, laser therapy to remove
abnormal tissue, loop electrosurgical procedure excision, surgery
to remove abnormal tissue, or hysterectomy. At early stages, for
example, low cost outpatient procedures such as loop
electrosurgical excision are 90-95% effective. Thus, a benefit to
the methods disclosed herein is the ability to use minor surgical
intervention before CIN progresses to cervical cancer.
Interventional strategies for the treatment of cervical cancer
include surgery, radiation therapy, chemotherapy, targeted therapy,
or a combination thereof. Surgery involves removal of the cancer
and may include conization to remove tissue from the cervix and/or
cervical canal or hysterectomy such as total, radical, modified
radical hysterectomy. Radiation therapy includes internal and
external radiation therapy in addition to intensity-modulated
radiation therapy. Chemotherapy involves the use of drugs to
inhibit the growth of cancer calls and can involve systemic or
regional chemotherapy. Drugs approved for the treatment of cervical
cancer include bleomycin, cisplatin, topotecan hydrochloride, and
gemcitabine-cisplatin. Targeted therapy involves the use of drugs
that identify and attack specific cancer cells without harming
normal cells. Targeted therapy includes antibody therapy such as
bevacizumab therapy.
Further disclosed herein, is a probe set for diagnosing,
predicting, and/or monitoring cervical cancer in a subject. The
probe set comprises a plurality of polynucleotide probes capable of
detecting an expression level of at least one biomarker for CIN or
cervical cancer, wherein the expression level determines the CIN or
cervical cancer status of the subject.
In one aspect, a probe set comprises
one or more polynucleotides that hybridizes to a first
polynucleotide biomarker, wherein the first polynucleotide
biomarker is GRB7 (SEQ ID NOs: 8-11), NOVA1 (SEQ ID Nos: 14 and
15), RNASEH2A (SEQ ID NO: 19), or a combination thereof, and
one or more polynucleotides that hybridizes to a second
polynucleotide biomarker, wherein the second polynucleotide
biomarker is lnc-FANCI-2, lnc-GLB1L2-1, or a combination
thereof.
In certain aspects, the probe set is attached to a solid support,
and/or each member of the probe set comprises a detectable
moiety.
One skilled in the art understands that the nucleotide sequence of
the polynucleotide probe need not be identical to its target
sequence in order to specifically hybridize thereto. The
polynucleotide probes, therefore, comprise a nucleotide sequence
that is at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, or more
identical to a region of the coding target or non-coding target.
Methods of determining sequence identity are known in the art and
can be determined, for example, by using the BLASTN program of the
University of Wisconsin Computer Group (GCG) software or provided
on the NCBI website. The nucleotide sequence of the polynucleotide
probes may exhibit variability by differing (e.g. by nucleotide
substitution, including transition or transversion) at one, two,
three, four or more nucleotides from the sequence of the coding
target or non-coding target.
Primers/probes based on the nucleotide sequences of target
sequences can be used in amplification of the target sequences. For
use in amplification reactions such as PCR, a pair of primers can
be used. The exact composition of the primer sequences is selected
so that the primers hybridize to specific sequences of the probe
set under stringent conditions, particularly under conditions of
high stringency. The pairs of primers are usually chosen so as to
generate an amplification product of at least about 50 nucleotides,
more usually at least about 100 nucleotides. Algorithms for the
selection of primer sequences are generally known, and are
available in commercial software packages. These primers may be
used in standard quantitative or qualitative PCR-based assays to
assess transcript expression levels of RNAs defined by the probe
set. Alternatively, these primers may be used in combination with
probes, such as molecular beacons in amplifications using real-time
PCR.
The polynucleotide probes or primers can incorporate moieties
useful in detection, isolation, purification, or immobilization, if
desired. Such moieties are detectable labels, such as
radioisotopes, fluorophores, chemiluminophores, enzymes, colloidal
particles, and fluorescent microparticles, as well as antigens,
antibodies, haptens, avidin/streptavidin, biotin, haptens, enzyme
cofactors/substrates, enzymes, and the like. A label can optionally
be attached to or incorporated into a probe or primer
polynucleotide to allow detection and/or quantitation of a target
polynucleotide representing the target sequence of interest.
In some embodiments, one or more polynucleotide probes/primers
provided herein can be provided on a substrate. The substrate can
comprise a wide range of material, either biological,
nonbiological, organic, inorganic, or a combination of any of
these. For example, the substrate may be a polymerized Langmuir
Blodgett film, functionalized glass, Si, Ge, GaAs, GaP, SiO.sub.2,
SiN.sub.4, modified silicon, or any one of a wide variety of gels
or polymers such as (poly)tetrafluoroethylene,
(poly)vinylidenedifluoride, polystyrene, cross-linked polystyrene,
polyacrylic, polylactic acid, polyglycolic acid, poly(lactide
coglycolide), polyanhydrides, poly(methyl methacrylate),
poly(ethylene-co-vinyl acetate), polysiloxanes, polymeric silica,
latexes, dextran polymers, epoxies, polycarbonates, or combinations
thereof. Conducting polymers and photoconductive materials can be
used.
Substrates can be planar crystalline substrates such as silica
based substrates (e.g., glass, quartz, or the like), or crystalline
substrates used in, e.g., the semiconductor and microprocessor
industries, such as silicon, gallium arsenide, indium doped GaN and
the like, and include semiconductor nanocrystals.
The substrate can take the form of an array, a photodiode, an
optoelectronic sensor such as an optoelectronic semiconductor chip
or optoelectronic thin-film semiconductor, or a biochip. The
location(s) of probe(s) on the substrate can be addressable; this
can be done in highly dense formats, and the location(s) can be
microaddressable or nanoaddressable.
The substrate can be a plate, slide, bead, pellet, disk, particle,
microparticle, nanoparticle, strand, precipitate, optionally porous
gel, sheets, tube, sphere, capillary, film, chip, multiwell plate
or dish, optical fiber, etc. The substrate can be a form that is
rigid or semi-rigid. The substrate may contain raised or depressed
regions on which an assay component is located. The surface of the
substrate can be etched using known techniques to provide for
desired surface features, for example trenches, v-grooves, mesa
structures, or the like.
Surfaces on the substrate can be composed of the same material as
the substrate or can be made from a different material, and can be
coupled to the substrate by chemical or physical means. Such
coupled surfaces may be composed of any of a wide variety of
materials, for example, polymers, plastics, resins,
polysaccharides, silica or silica-based materials, carbon, metals,
inorganic glasses, membranes, or any of the above-listed substrate
materials. The surface can be optically transparent and can have
surface Si--OH functionalities, such as those found on silica
surfaces.
The substrate and/or its optional surface can be chosen to provide
appropriate characteristics for the synthetic and/or detection
methods used. The substrate and/or surface can be transparent to
allow the exposure of the substrate by light applied from multiple
directions. The substrate and/or its surface is generally resistant
to, or is treated to resist, the conditions to which it is to be
exposed in use, and can be optionally treated to remove any
resistant material after exposure to such conditions.
The substrate or a region thereof may be encoded so that the
identity of the sensor located in the substrate or region being
queried may be determined. A suitable coding scheme can be used,
for example optical codes, RFID tags, magnetic codes, physical
codes, fluorescent codes, and combinations of codes.
The invention is further illustrated by the following non-limiting
examples.
EXAMPLES
Materials and Methods
Human patient samples: Samples for RNA sequencing, containing 7
normal cervical tissues, 7 pre-cancer tissues and 7 cervical cancer
tissues, and samples for validation, including 24 normal cervical
tissues, 25 CIN 2-3 tissues, and 23 cervical cancer tissues, were
all collected from the Women's Hospital, School of Medicine,
Zhejiang University. All the human samples were used in accordance
with the Institutional Review Board procedures of the hospital.
Informed consent was obtained from each participant prior to the
study. Samples were snap-frozen and stored at -80.degree. C. until
use.
RNA isolation: RNA was isolated from each human tissue sample by
TRIzol.RTM. (Invitrogen, CA, USA) according to the instructions
provided by the manufacturer. Total RNA quality and quantity were
verified spectrophotometrically (NanoDrop ND-1000 spectrometer;
Thermo Scientific, DE, USA) and electrophoretically (Bioanalyzer
2100; Agilent Technologies, CA, USA).
RNA sequencing and mapping: RNA-seq libraries were prepared using
TruSeq.RTM. Stranded Total RNA Sample Preparation Kit with
Ribo-Zero.TM. depletion and sequenced on an Illumina.RTM.
HiSeq.TM.-2500 platform as paired-end reads. In brief, high-quality
of human total RNA (1 .mu.g) was Ribo-Zero.TM. depleted,
fragmented, and then reverse transcribed. The double-stranded cDNA
were A-tailed and ligated with Illumia.RTM. sequencing adapters.
Subsequently, the ligated products were enriched by PCR and
size-selected by agarose gel electrophoresis. The products of
approximately 200-400-bp in size were sequenced by the
Illumina.RTM. HiSeq.TM.-2500 platform. The raw data in fastq format
were mapped to the human reference genome (hg19, GRCh37) by Tophat
v2.0.11(-g 1), which had the aligner Bowtie (v2.2.1.0) with the
parameter settings (-N 0, -L 20, -i S,1,1.25, -n-ceil L,0,0.15 and
-gbar 4). The mapping results were further sorted in coordination
position by samtools (v0.1.19.0) (Robinson M D, Oshlack A., "A
scaling normalization method for differential expression analysis
of RNA-seq data," Genome Biology, 11:R25 (2010); Robinson M D,
McCarthy D J and Smyth G K., "edgeR: a Bioconductor package for
differential expression analysis of digital gene expression data,"
Bioinformatics, 26, pp. 139-140 (2010)). The latest annotation of
LncRNA was downloaded from the publicly available Incipedia
database version 3.0. The mapped reads in individual lncRNA region
of each sample were counted by bedtools (v2.19.0). The R
Bioconductor edgeR package was used to normalize raw reads by the
scaling method. Differentially expressed lncRNAs were identified by
one-way ANOVA method with 10% false discovery rate (FDR) and
four-fold changes between the conditions. The FDR was controlled by
the Benjamini-Hochberg (BH) procedure. RNA-binding protein genes
were compiled from the literature (Alfredo Castello, et al.,
"Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding
Proteins," Cell, 149, pp. 1393-1406 (2012); Alfredo Castello, et
al., "RNA-binding proteins in Mendelian disease," Trends in
Genetics, 29, pp. 318-327 (2013)). The normalized reads from the
multiple transcripts of each gene were averaged to represent
composite gene expression. The expression results were clustered
using unsupervised hierarchical clustering analysis, in which the
Euclidean Distance is used as the similarity measure.
Human primary keratinocytes and organotypic (raft) epithelial
cultures: Total RNA extracted from various raft tissues were
leftovers from previous studies (Wang, X. et al., "Oncogenic HPV
infection interrupts the expression of tumor-suppressive miR-34a
through viral oncoprotein E6," RNA, 15, pp. 637-647 (2009); Wang,
X., et al., "microRNAs are biomarkers of oncogenic human
papillomavirus infections," Proc. Natl. Acad. Sci. USA, 111, pp.
4262-4267 (2014)). Briefly, primary human foreskin keratinocytes
(HFK) and primary human vaginal keratinocytes (HVK) were isolated
from newborn circumcision and adult vaginectomy tissue specimens,
respectively, as previously described (Meyers, C., Mayer, T. J.,
and Ozbun, M. A., "Synthesis of infectious human papillomavirus
type 18 in differentiating epithelium transfected with viral DNA,"
J. Virol., 71, pp, 7381-7386 (1997)). Keratinocytes were grown in
monolayer culture by using epithelial (E) medium plus epidermal
growth factor (5 ng/ml) in the presence of mitomycin C (4
.mu.g/ml)-treated J2 3T3 feeder cells. Keratinocyte lines stably
maintaining HPV16 and HPV18 DNA following electroporation were
subcloned by limiting dilutions of cells. Organotypic (raft)
epithelial culture tissues derived from HPV16 and
HPV18-immortalized HFK or HVK were prepared as described previously
(McLaughlin-Drubin, M. E. and Meyers, C., "Propagation of
infectious, high-risk HPV in organotypic "raft" culture," Methods
Mol. Med., 119, pp. 171-186 (2005)). The stratified and
differentiated raft culture epidermal tissues were collected free
from collagen (no fibroblasts) on day 10 and frozen on dry ice for
total cell RNA preparation. Additional productive HPV18 raft
cultures of HFKs were obtained by Cre-loxP-mediated recombination
as described (Wang, H. K., Duffy, A. A., Broker, T. R., and Chow,
L. T., "Robust production and passaging of infectious HPV in
squamous epithelium of primary human keratinocytes", Genes Dev.,
23, pp. 181-194 (2009)), and the derived raft cultures were
collected on day 8, day 12, and day 16.
Plasmid pLJd-HPV-18URR-E6, pLC-HPV-18URR-E7, and
pLJd-HPV-18URR-E6E7 have been described (Cheng, S.,
Schmidt-Grimminger, D. C., Murant, T., Broker, T. R., and Chow, L.
T., "Differentiation-dependent up-regulation of the human
papillomavirus E7 gene reactivates cellular DNA replication in
suprabasal differentiated keratinocytes.," Genes Dev., 9, pp.
2335-2349 (1995); Genovese, N. J., Banerjee, N. S., Broker, T. R.,
and Chow, L. T., "Casein kinase II motif-dependent phosphorylation
of human papillomavirus E7 protein promotes p130 degradation and
S-phase induction in differentiated human keratinocytes," J.
Virol., 82, pp. 4862-4873 (2008)). Retroviruses derived from the
above vectors were prepared as described (Banerjee, N. S., Chow, L.
T., and Broker, T. R., "Retrovirus-mediated gene transfer to
analyze HPV gene regulation and protein functions in organotypic
"raft" cultures," Methods Mol. Med., 119, pp. 187-202 (2005)).
Primary HFKs were acutely infected with the retroviruses and
selected with G-418 (300 .mu.g/mL). The selected HFKs were used to
establish epithelial raft cultures and harvested on day 11.
TaqMan.RTM. real-time quantitative PCR assays: Quantitative
validation of genes in clinical samples and raft tissues was
analyzed by real-time PCR TaqMan.RTM. gene expression assays
(Applied Biosystems). In brief, 2 .mu.g of total RNA from each
sample was reversely transcribed using Superscript.RTM. First-stand
Synthesis kit (Invitrogen) according to the manufacturer's
instructions. TaqMan.RTM. gene expression assays for RNA-binding
protein gene expression were obtained from life technologies and
lncRNA primers for RT-qPCR were designed as given in Example 2.
The TaqMan.RTM. assay probes that span over exon-exon junctions
were designed to amplify spliced RNA products to avoid detection of
any contaminated residual genomic DNA in our RNA samples. After
reverse transcription, PCR products were amplified from the cDNA
samples using TaqMan.RTM. gene expression Master Mix (Applied
Biosystems) together with TaqMan.RTM. gene expression assays on a
StepOne Plus.TM. Real-Time PCR system (Applied Biosystems). Gene
enrichment was calculated using the 2.sup.-.DELTA..DELTA.Ct method
in relation to the housekeeping gene GAPDH. The mean Ct value of a
given gene from 24 normal cervical tissues after normalization was
served as a basal level to calculate a relative level of the gene
detected in each clinical sample. Data are presented as a bar graph
with mean.+-.SE for each group. Significance of mRNA levels among
clinical tissue groups was analyzed using the nonparametric
Mann-Whitney U-test, while significance of the mRNA levels between
raft culture tissue groups was analyzed by Student t-test.
Example 1: Identification of Altered Expression of RNA-Binding
Protein Genes in Cervical Cancer
Using RNA-sequencing (RNA-Seq) approach, seven normal cervical
tissues and seven cervical cancer tissues were examined for their
expression landscapes of approximately 19,000 coding and 113,513
noncoding RNAs. We identified 614 differentially expressed coding
transcripts enriched in cancer related pathways and 95 of them
encoding RNA-binding proteins (RBPs) from the analyzed 1502 human
RBPs. Moreover, we identified 34 differentially, abundantly
expressed lnc-RNAs from normal cervix to cervical cancer. Table 4
shows the two RNA-Seq analyses of 14 different clinical cervical
tissues with two different RNA-seq platforms, each containing
normal cervical tissues without HPV infection and cervical cancer
tissues with HPV infection. The right column of the table shows the
raw reads of individual samples from each RNA-Seq platform.
TABLE-US-00004 TABLE 4 RNA-Seq detection from 14 cervical tissue
samples Sample No. Age (yr) Pathology HPV infection Total reads
RNA-Seq-1 1 27 N No 13,171,863 2 38 N No 12,028,762 3 42 N No
31,143,321 4 40 SCC Yes 12,422,476 5 42 SCC Yes 11,425,454 6 24 SCC
Yes 22,302,605 RNA-Seq-2 7 42 N No 85,255,279 8 37 N No 83,376,820
9 52 N No 80,265,055 10 44 N No 81,954,460 11 48 SCC Yes 66,982,821
12 45 SCC Yes 74,819,347 13 47 SCC Yes 93,579,886 14 49 SCC Yes
66,891,722
FIG. 1 is a flowchart of the RNA-Seq analyses. FIG. 2 shows Venn
diagrams and FIG. 3 shows a heat map showing 95 differentially
expressed RNA-binding protein genes in cervical cancer (n=7)
compared to normal cervical tissues (n=7). Table 5 summarizes the 8
RBPs with expression changes between normal and cancer tissues by
RNA-Seq. (CPM: Counts per Million)
TABLE-US-00005 TABLE 5 RNA-Seq data of the 8 RBP genes between
normal and cancer tissues Normal Cancer RNA-binding (log.sub.2 CPM,
(log.sub.2 CPM, protein genes Description mean .+-. SD) mean .+-.
SD) CDKN2A Cyclin-dependent -0.24 .+-. 0.88 6.3 .+-. 1.12 kinase
inhibitor 2A ELAVL2 ELAV like neuron- -3.38 .+-. 1.89 0.17 .+-.
3.54 specific RNA binding protein 2 GRB7 Growth factor receptor-
0.9 .+-. 0.96 4.07 .+-. 1.22 bound protein 7 HSPB1 Heat shock 27
kDa 5.74 .+-. 1.09 8.84 .+-. 2.49 protein 1 KHSRP KH-type splicing
4.35 .+-. 0.18 5.85 .+-. 0.78 regulatory protein NOVA1
Neuro-oncological 2.82 .+-. 0.55 0.1 .+-. 1.55 ventral antigen 1
PTBP1 Polypyrimidine tract 5.74 .+-. 0.21 7.18 .+-. 0.83 binding
protein 1 RNASEH2A Ribonuclease H2, 2.32 .+-. 0.47 5.01 .+-. 0.72
subunit A
Table 6 provides the TaqMan.RTM. probe information of each RBP.
TABLE-US-00006 TABLE 6 TaqMan .RTM. probe information of each RBP
Company Order name Cat No ID No Applied Single Tube Cat. # 4331182
Hs00918009_g1 Biosystems .RTM. TaqMan .RTM. Assay for GRB7 Applied
Single Tube Cat. # 4331182 Hs00270011_m1 Biosystems .RTM. TaqMan
.RTM. Assay for ELAVL2 Applied Single Tube Cat. # 4331182
Hs00958451_g1 Biosystems .RTM. TaqMan .RTM. Assay for RNASEH2A
Applied Single Tube Cat. # 4351372 Hs01100863_g1 Biosystems .RTM.
TaqMan .RTM. Assay for KHSRP Applied Single Tube Cat. # 4351372
Hs01103130_m1 Biosystems .RTM. TaqMan .RTM. Assay for NOVA1 Applied
Single Tube Cat. # 4351372 Hs00914687_g1 Biosystems .RTM. TaqMan
.RTM. Assay for PTBP1 Applied Single Tube Cat. # 4331182
Hs00923894_m1 Biosystems .RTM. TaqMan .RTM. Assay for CDKN2A
Applied Single Tube Cat. # 4331182 Hs03044127_g1 Biosystems .RTM.
TaqMan .RTM. Assay for HSPB1
FIG. 4 shows the TaqMan.RTM. RT-qPCR validation confirming that all
8 RBPs significantly increased (7 RBPs) or decreased (1 RBP) in
cervical cancer tissues (n=23), compared to normal cervical tissues
(n=24). 7 increased RBP genes in cervical cancer were also shown
higher expression in pre-cancerous lesions (CIN 2-3, n=25) when
compared to the normal tissues, indicating these changes appear
even at the early stage of cervical carcinogenesis. **, P<0.01;
***, P<0.001; NS, no statistics significance.
FIGS. 5 and 6 show that high-risk HPV16 and HPV18 infection affects
the expression of RBPs. FIG. 5 shows Total RNA extracted from human
vaginal keratinocyte (HVK)-derived raft cultures with (HVK16) or
without (HVK) productive HPV16 infection and human foreskin
keratinocyte (HFK) derived raft cultures with (HFK16) or without
(HFK) productive HPV16 infection were examined by TaqMan.RTM.
RT-qPCR for the expression of 8 RBPs. *, P<0.05; **, P<0.01;
***, P<0.001; NS, no statistics significance. FIG. 6 shows Total
RNA extracted from human vaginal keratinocyte (HVK)-derived raft
cultures with (HVK18) or without (HVK) productive HPV18 infection
and human foreskin keratinocyte (HFK) derived raft cultures with
(HFK18) or without (HFK) productive HPV18 infection were examined
by TaqMan.RTM. RT-qPCR for the expression of 8 RBPs. *, P<0.05;
***, P<0.001; NS, no statistics significance. FIG. 7 shows that
both HPV16 and HPV18 increase the expression of CDKN2A and
RNASEH2A, but decrease the expression of NOVA1 in HFK- and
HVK-derived rafts. In this experiment, total RNA was used to
determine the relative levels of individual proteins by TaqMan.RTM.
RT-qPCR. FIG. 8 shows that HPV18 infection and viral E6 and/or E7
affect the expression of RNASEH2A and Nova1. The expression of
RNASEH2A and NOVA1 in primary human keratinocytes (PHK)-derived
raft tissues with or without HPV18 infection on day 8, day 12, and
day 16 or PHK rafts transduced with a retrovirus expression HPV18
E6, E7 or E6E7 or with an empty control retrovirus were further
validated by TaqMan.RTM. RT-qPCR. These results demonstrate that
RNASEH2A and NOVA1 respond to HPV18 infection and their altered
expression in cervical cancer could be attributed to viral
oncoprotein E6 and/or E7. *, P<0.05; ***, P<0.001; NS, no
statistics significance.
FIG. 9 shows that knockdown or overexpression of RNASEH2A in HeLa
or CaSki cells affects cell proliferation. Specific-siRNA knockdown
or ectopic expression of RNASEH2A from a mammalian expression
vector in HeLa or CaSki cells on cell proliferation was evaluated
by Cell Counting Kit-8 (CCK-8) assay at time indicated. si-NS,
non-specific siRNA; siRNASEH2A, RNASEH2A-specific siRNA; P, control
vector; p-RNASEH2A, RNASEH2A-expression vector. FIG. 10 shows HPV
oncoprotein E7 regulates the expression of RNASEH2A via E2F1.
Specific-siRNA knockdown or ectopic expression of E2F1 from a
mammalian expression vector in HeLa or CaSki cells on RNASEH2A was
evaluated by Western blot using anti-RNASEH2A antibody. si-NS,
non-specific siRNA; si-E2F1, E2F1-specific siRNA; P, control
vector; p-E2F1, E2F1-expression vector.
Example 2: The Expression Profile of Long Noncoding RNAs
Distinguishes Normal Cervix from and Cancerous Cervix
RNA was extracted from each sample using Trizol.RTM. reagent (Life
technologies). RNAseq libraries were prepared using TruSeq.RTM.
Stranded Total RNA Kit with Ribo-Zero depletion and sequenced on an
Illumina HiSeq.TM. 2000 platform as paired-end reads. The fastq
data were mapped to human reference genome (hg19, GRCh37) by Bowtie
(v2.2.1.0), and the mapping results were further filtered by
samtools (v0.1.19.0). The latest annotation of LncRNA was
downloaded from Incipedia database version 3.0. We counted the
mapped reads in individual lncRNA region of each sample by bedtools
(v2.19.0). The R Bioconductor edgeR package was used to normalize
raw reads by the scaling method. The differentially expressed
lncRNAs were detected by one-way ANOVA method with 10% false
discovery rate (FDR) and four fold changes between the conditions.
FIG. 11 is a flow chart of the RNA-Seq analysis. FIG. 12 is a heat
map showing 34 overlapped, differentially expressed lnc-RNAs in
cervical cancer compared to normal cervical tissues. lnc-FANCI-2
and lnc-GLB1L2-1 were specifically identified as associated with
cervical cancer. Tables 2 and 3 list all of the isoforms of these
two lnc-RNAs.
TABLE-US-00007 Taqman .RTM. primer design for lnc-FANCI-2 Exon 6:
(SEQ ID NO: 76) CTGGAAAGGAGGAGAACATGAAACATTGCTTGAAGACAATGGCCGAGACA
GCAGGTCCCACCCTGCACAGCCACCAGCATCTCTCCCCTCAGCCCTGTCT
CCTCTTCTGCAGTTGGGATCTGCACATTTAAGCCTGAA Exon 7: (SEQ ID NO: 77)
ATTGTCCTGTGAAGTGAAGTATGATCGGACAGCCTCTTTTCAGCTTTTAT
GACAATGGAGACAGAGGAATTGTGGCTCTTGCCAAGGTCACAGGATTGGA
ATACAGAGCCAAGCCACCCCAGGACATGCAAGAGCCTCAGAAGGGAA Primers for RT-qPCR
Forward: (SEQ ID NO: 78) 5'- ACAGCCACCAGCATCTCTC -3' Probe: (SEQ ID
NO: 79) 5'- TGAAGTGAAGTATGATCGGACAGCCTC -3' Reverse: (SEQ ID NO:
80) 5'- CCACAATTCCTCTGTCTCCATT -3' TaqMan .RTM. primer design for
lnc-GLB1L2-1: Last Exon 3: (SEQ ID NO: 81)
TCTCTCATCTGTGTTTTCAGGGCATGGACTGGAACTCCCAATACCCCTGA
CATGGGCTGAGTCAACGTGGTCATGAACATGTGACAGGAG Last Exon 2: (SEQ ID NO:
82) GCAGCAGAAGTTGCAGAGAAGAGTGAGGCACGTTTGAAAAAGGCTGAAAA
ATGTTTCTGTCCAGGCAAGGGTGTGTGCTGAATGACTCAAGGATTTTTTG G Primers for
RT-qPCR Forward: (SEQ ID NO: 83) 5'- CATGGACTGGAACTCCCAATA -3'
Probe: (SEQ ID NO: 84) 5'- TGCAGAGAAGAGTGAGGCACGTTTG -3' Reverse:
(SEQ ID NO: 85) 5'- CCTTGCCTGGACAGAAACATT -3'
FIG. 13 shows an increase of lnc-FANCI-2, and decrease of
lnc-GLB1L2-1 expression along with the cervical lesion progression
from normal cervix. Lnc-FANCI-2 and lnc-GLB1L2-1 RNA expression was
examined by RT-qPCR in 24 normal, 25 CIN 2-3, and 23 cancer
tissues. FIG. 14 shows that HPV infection increases lnc-FANCI-2
expression in HVK- and PHK-derived rafts and viral E7 or E6 is
responsible for the increase. The expression of lnc-FANCI-2 in
human vaginal keratinocytes (HVK)-derived raft tissues without
(HVK) or with HPV16 (HVK16) or HPV18 (HVK18) infection or primary
human keratinocytes (PHK)-derived raft tissues without or with
HPV18 infection on day 8, day 12, and day 16 or PHK rafts
transduced with a retrovirus expressing HPV18 E6, E7 or E6E7 or
with an empty control retrovirus were further validated by RT-qPCR.
These results demonstrate that lnc-FANCI-2 expression responds to
HPV18 infection and viral oncoprotein E6 and/or E7.
In data not shown, lnc-FANCI-2 was upregulated in isolated
keratinocyte lines infected by high-risk HPVs, but not low risk
HPV11 and epidermodysplasia verruciformis-associated HPV5 and
10.
The term "polynucleotide" as used herein refers to a polymer of
greater than one nucleotide in length of ribonucleic acid (RNA),
deoxyribonucleic acid (DNA), hybrid RNA/DNA, modified RNA or DNA,
or RNA or DNA mimetics, including peptide nucleic acids (PNAs). The
polynucleotides may be single- or double-stranded. The term
includes polynucleotides composed of naturally-occurring
nucleobases, sugars, and covalent internucleoside (backbone)
linkages as well as polynucleotides having non-naturally-occurring
portions which function similarly. Such modified or substituted
polynucleotides are well known in the art and are referred to as
"analogues."
"Complementary" or "substantially complementary" refers to the
ability to hybridize or base pair between nucleotides or nucleic
acids, such as, for instance, between a sensor peptide nucleic acid
or polynucleotide and a target polynucleotide. Complementary
nucleotides are, generally, A and T (or A and U), or C and G. Two
single-stranded polynucleotides or PNAs are said to be
substantially complementary when the bases of one strand, optimally
aligned and compared and with appropriate insertions or deletions,
pair with at least about 80% of the bases of the other strand,
usually at least about 90% to 95%, and more preferably from about
98 to 100%.
Alternatively, substantial complementarity exists when a
polynucleotide may hybridize under selective hybridization
conditions to its complement. Typically, selective hybridization
may occur when there is at least about 65% complementarity over a
stretch of at least 14 to 25 bases, for example at least about 75%,
or at least about 90% complementarity.
The term "homologous region" refers to a region of a nucleic acid
with homology to another nucleic acid region. Whether a "homologous
region" is present in a nucleic acid molecule is determined with
reference to another nucleic acid region in the same or a different
molecule.
Hybridization conditions typically include salt concentrations of
less than about 1M, more usually less than about 500 mM, for
example, less than about 200 mM. In the case of hybridization
between a peptide nucleic acid and a polynucleotide, the
hybridization can be done in solutions containing little or no
salt. Hybridization temperatures can be as low as 5.degree. C., but
are typically greater than 22.degree. C., and more typically
greater than about 30.degree. C., for example in excess of about
37.degree. C. Longer fragments may require higher hybridization
temperatures for specific hybridization as is known in the art.
Other factors may affect the stringency of hybridization, including
base composition and length of the complementary strands, presence
of organic solvents and extent of base mismatching, and the
combination of parameters used is more important than the absolute
measure of any one alone. Other hybridization conditions which may
be controlled include buffer type and concentration, solution pH,
presence and concentration of blocking reagents to decrease
background binding such as repeat sequences or blocking protein
solutions, detergent type(s) and concentrations, molecules such as
polymers which increase the relative concentration of the
polynucleotides, metal ion(s) and their concentration(s),
chelator(s) and their concentrations, and other conditions known in
the art.
As used herein, a "probe" is a polynucleotide capable of
selectively hybridizing to a target sequence, a complement thereof,
a reverse complement thereof, or to an RNA version of the target
sequence, the complement thereof, or the reverse complement
thereof. A probe may comprise ribonucleotides,
deoxyribonucleotides, peptide nucleic acids, and combinations
thereof. A probe may optionally comprise one or more labels. In
some embodiments, a probe may be used to amplify one or both
strands of a target sequence or an RNA form thereof, acting as a
sole primer in an amplification reaction or as a member of a set of
primers. In one aspect, probes include nucleotide sequences of 10
to 1,000 nucleotides. In other embodiments, the probes are 10-200,
10-30, 10-40, 20-50, 40-80, 50-150, or 80-120 nucleotides in
length.
The use of the terms "a" and "an" and "the" and similar referents
(especially in the context of the following claims) are to be
construed to cover both the singular and the plural, unless
otherwise indicated herein or clearly contradicted by context. The
terms first, second etc. as used herein are not meant to denote any
particular ordering, but simply for convenience to denote a
plurality of, for example, layers. The terms "comprising",
"having", "including", and "containing" are to be construed as
open-ended terms (i.e., meaning "including, but not limited to")
unless otherwise noted. Recitation of ranges of values are merely
intended to serve as a shorthand method of referring individually
to each separate value falling within the range, unless otherwise
indicated herein, and each separate value is incorporated into the
specification as if it were individually recited herein. The
endpoints of all ranges are included within the range and
independently combinable. All methods described herein can be
performed in a suitable order unless otherwise indicated herein or
otherwise clearly contradicted by context. The use of any and all
examples, or exemplary language (e.g., "such as"), is intended
merely to better illustrate the invention and does not pose a
limitation on the scope of the invention unless otherwise claimed.
No language in the specification should be construed as indicating
any non-claimed element as essential to the practice of the
invention as used herein.
While the invention has been described with reference to an
exemplary embodiment, it will be understood by those skilled in the
art that various changes may be made and equivalents may be
substituted for elements thereof without departing from the scope
of the invention. In addition, many modifications may be made to
adapt a particular situation or material to the teachings of the
invention without departing from the essential scope thereof.
Therefore, it is intended that the invention not be limited to the
particular embodiment disclosed as the best mode contemplated for
carrying out this invention, but that the invention will include
all embodiments falling within the scope of the appended claims.
Any combination of the above-described elements in all possible
variations thereof is encompassed by the invention unless otherwise
indicated herein or otherwise clearly contradicted by context.
SEQUENCE LISTINGS
1
9511267DNAHomo sapiens 1cgagggctgc ttccggctgg tgcccccggg ggagacccaa
cctggggcga cttcaggggt 60gccacattcg ctaagtgctc ggagttaata gcacctcctc
cgagcactcg ctcacggcgt 120ccccttgcct ggaaagatac cgcggtccct
ccagaggatt tgagggacag ggtcggaggg 180ggctcttccg ccagcaccgg
aggaagaaag aggaggggct ggctggtcac cagagggtgg 240ggcggaccgc
gtgcgctcgg cggctgcgga gagggggaga gcaggcagcg ggcggcgggg
300agcagcatgg agccggcggc ggggagcagc atggagcctt cggctgactg
gctggccacg 360gccgcggccc ggggtcgggt agaggaggtg cgggcgctgc
tggaggcggg ggcgctgccc 420aacgcaccga atagttacgg tcggaggccg
atccaggtca tgatgatggg cagcgcccga 480gtggcggagc tgctgctgct
ccacggcgcg gagcccaact gcgccgaccc cgccactctc 540acccgacccg
tgcacgacgc tgcccgggag ggcttcctgg acacgctggt ggtgctgcac
600cgggccgggg cgcggctgga cgtgcgcgat gcctggggcc gtctgcccgt
ggacctggct 660gaggagctgg gccatcgcga tgtcgcacgg tacctgcgcg
cggctgcggg gggcaccaga 720ggcagtaacc atgcccgcat agatgccgcg
gaaggtccct cagacatccc cgattgaaag 780aaccagagag gctctgagaa
acctcgggaa acttagatca tcagtcaccg aaggtcctac 840agggccacaa
ctgcccccgc cacaacccac cccgctttcg tagttttcat ttagaaaata
900gagcttttaa aaatgtcctg ccttttaacg tagatatatg ccttccccca
ctaccgtaaa 960tgtccattta tatcattttt tatatattct tataaaaatg
taaaaaagaa aaacaccgct 1020tctgcctttt cactgtgttg gagttttctg
gagtgagcac tcacgcccta agcgcacatt 1080catgtgggca tttcttgcga
gcctcgcagc ctccggaagc tgtcgacttc atgacaagca 1140ttttgtgaac
tagggaagct caggggggtt actggcttct cttgagtcac actgctagca
1200aatggcagaa ccaaagctca aataaaaata aaataatttt cattcattca
ctcaaaaaaa 1260aaaaaaa 126721464DNAHomo sapiens 2cgagggctgc
ttccggctgg tgcccccggg ggagacccaa cctggggcga cttcaggggt 60gccacattcg
ctaagtgctc ggagttaata gcacctcctc cgagcactcg ctcacggcgt
120ccccttgcct ggaaagatac cgcggtccct ccagaggatt tgagggacag
ggtcggaggg 180ggctcttccg ccagcaccgg aggaagaaag aggaggggct
ggctggtcac cagagggtgg 240ggcggaccgc gtgcgctcgg cggctgcgga
gagggggaga gcaggcagcg ggcggcgggg 300agcagcatgg agccggcggc
ggggagcagc atggagcctt cggctgactg gctggccacg 360gccgcggccc
ggggtcgggt agaggaggtg cgggcgctgc tggaggcggg ggcgctgccc
420aacgcaccga atagttacgg tcggaggccg atccaggtca tgatgatggg
cagcgcccga 480gtggcggagc tgctgctgct ccacggcgcg gagcccaact
gcgccgaccc cgccactctc 540acccgacccg tgcacgacgc tgcccgggag
ggcttcctgg acacgctggt ggtgctgcac 600cgggccgggg cgcggctgga
cgtgcgcgat gcctggggcc gtctgcccgt ggacctggct 660gaggagctgg
gccatcgcga tgtcgcacgg tacctgcgcg cggctgcggg gggcaccaga
720ggcagtaacc atgcccgcat agatgccgcg gaaggtccct cagaaatgat
cggaaaccat 780ttgtgggttt gtagaagcag gcatgcgtag ggaagctacg
ggattccgcc gaggagcgcc 840agagcctgag gcgccctttg gttatcgcaa
gctggctggc tcactccgca ccaggtgcaa 900aagatgcctg gggatgcggg
aagggaaagg ccacatcttc acgccttcgc gcctggcatt 960acatccccga
ttgaaagaac cagagaggct ctgagaaacc tcgggaaact tagatcatca
1020gtcaccgaag gtcctacagg gccacaactg cccccgccac aacccacccc
gctttcgtag 1080ttttcattta gaaaatagag cttttaaaaa tgtcctgcct
tttaacgtag atatatgcct 1140tcccccacta ccgtaaatgt ccatttatat
cattttttat atattcttat aaaaatgtaa 1200aaaagaaaaa caccgcttct
gccttttcac tgtgttggag ttttctggag tgagcactca 1260cgccctaagc
gcacattcat gtgggcattt cttgcgagcc tcgcagcctc cggaagctgt
1320cgacttcatg acaagcattt tgtgaactag ggaagctcag gggggttact
ggcttctctt 1380gagtcacact gctagcaaat ggcagaacca aagctcaaat
aaaaataaaa taattttcat 1440tcattcactc aaaaaaaaaa aaaa
146431164DNAHomo sapiens 3cgctcaggga aggcgggtgc gcgcctgcgg
ggcggagatg ggcagggggc ggtgcgtggg 60tcccagtctg cagttaaggg ggcaggagtg
gcgctgctca cctctggtgc caaagggcgg 120cgcagcggct gccgagctcg
gccctggagg cggcgagaac atggtgcgca ggttcttggt 180gaccctccgg
attcggcgcg cgtgcggccc gccgcgagtg agggttttcg tggttcacat
240cccgcggctc acgggggagt gggcagcgcc aggggcgccc gccgctgtgg
ccctcgtgct 300gatgctactg aggagccagc gtctagggca gcagccgctt
cctagaagac caggtcatga 360tgatgggcag cgcccgagtg gcggagctgc
tgctgctcca cggcgcggag cccaactgcg 420ccgaccccgc cactctcacc
cgacccgtgc acgacgctgc ccgggagggc ttcctggaca 480cgctggtggt
gctgcaccgg gccggggcgc ggctggacgt gcgcgatgcc tggggccgtc
540tgcccgtgga cctggctgag gagctgggcc atcgcgatgt cgcacggtac
ctgcgcgcgg 600ctgcgggggg caccagaggc agtaaccatg cccgcataga
tgccgcggaa ggtccctcag 660acatccccga ttgaaagaac cagagaggct
ctgagaaacc tcgggaaact tagatcatca 720gtcaccgaag gtcctacagg
gccacaactg cccccgccac aacccacccc gctttcgtag 780ttttcattta
gaaaatagag cttttaaaaa tgtcctgcct tttaacgtag atatatgcct
840tcccccacta ccgtaaatgt ccatttatat cattttttat atattcttat
aaaaatgtaa 900aaaagaaaaa caccgcttct gccttttcac tgtgttggag
ttttctggag tgagcactca 960cgccctaagc gcacattcat gtgggcattt
cttgcgagcc tcgcagcctc cggaagctgt 1020cgacttcatg acaagcattt
tgtgaactag ggaagctcag gggggttact ggcttctctt 1080gagtcacact
gctagcaaat ggcagaacca aagctcaaat aaaaataaaa taattttcat
1140tcattcactc aaaaaaaaaa aaaa 116441235DNAHomo sapiens 4atggagccgg
cggcggggag cagcatggag ccttcggctg actggctggc cacggccgcg 60gcccggggtc
gggtagagga ggtgcgggcg ctgctggagg cgggggcgct gcccaacgca
120ccgaatagtt acggtcggag gccgatccag gtgggtagag ggtctgcagc
gggagcaggg 180gatggcgggc gactctggag gacgaagttt gcaggggaat
tggaatcagg tagcgcttcg 240attctccgga aaaaggggag gcttcctggg
gagttttcag aaggggtttg taatcacaga 300cctcctcctg gcgacgccct
gggggcttgg gaagccaagg aagaggaatg aggagccacg 360cgcgtacaga
tctctcgaat gctgagaaga tctgaagggg ggaacatatt tgtattagat
420ggaagtcatg atgatgggca gcgcccgagt ggcggagctg ctgctgctcc
acggcgcgga 480gcccaactgc gccgaccccg ccactctcac ccgacccgtg
cacgacgctg cccgggaggg 540cttcctggac acgctggtgg tgctgcaccg
ggccggggcg cggctggacg tgcgcgatgc 600ctggggccgt ctgcccgtgg
acctggctga ggagctgggc catcgcgatg tcgcacggta 660cctgcgcgcg
gctgcggggg gcaccagagg cagtaaccat gcccgcatag atgccgcgga
720aggtccctca gacatccccg attgaaagaa ccagagaggc tctgagaaac
ctcgggaaac 780ttagatcatc agtcaccgaa ggtcctacag ggccacaact
gcccccgcca caacccaccc 840cgctttcgta gttttcattt agaaaataga
gcttttaaaa atgtcctgcc ttttaacgta 900gatatatgcc ttcccccact
accgtaaatg tccatttata tcatttttta tatattctta 960taaaaatgta
aaaaagaaaa acaccgcttc tgccttttca ctgtgttgga gttttctgga
1020gtgagcactc acgccctaag cgcacattca tgtgggcatt tcttgcgagc
ctcgcagcct 1080ccggaagctg tcgacttcat gacaagcatt ttgtgaacta
gggaagctca ggggggttac 1140tggcttctct tgagtcacac tgctagcaaa
tggcagaacc aaagctcaaa taaaaataaa 1200ataattttca ttcattcact
caaaaaaaaa aaaaa 123553756DNAHomo sapiens 5aacggcggga ccgcggcgcc
tgggcgtcac tgaggcagta gccggccggg tgaggagggc 60ggttgccggc gcggcgcggc
gcggcgcggg tggggcgggg gttccgccgg cttccagtcc 120cctttcccgc
cgccgccgcc gccaccgcct ctccgcggag ctcgccccga gcgactcctc
180cgcggcagtg ctgacggcca gcggcacgag ccgtagtagc tgcagcttcg
agtcacagca 240gcaggtaatt gctgccatgg aaacacaact gtctaatggg
ccaacttgca ataacacagc 300caatggtcca accaccataa acaacaactg
ttcgtcacca gttgactctg ggaacacaga 360agacagcaag accaacttaa
tagtcaacta ccttcctcag aacatgacac aggaggaact 420aaagagtctc
tttgggagca ttggtgaaat agagtcctgt aagcttgtaa gagacaaaat
480aacagggcag agcttgggat atggctttgt gaactacatt gaccccaagg
atgcagagaa 540agctatcaac accctgaatg gattgagact tcaaaccaaa
acaataaaag tttcctatgc 600tcgcccaagt tcagcttcta tcagagatgc
aaatttatat gtcagcggac ttccaaaaac 660aatgacccag aaggagttgg
aacagctttt ttcacaatat ggacgcatta ttacttctcg 720tattcttgtc
gaccaggtca ctggcatatc aaggggtgta gggtttattc gatttgacaa
780gcgaattgag gcagaagaag ctatcaaagg cctaaatggc cagaaacctc
ccggtgccac 840ggagccaatc actgtaaagt ttgctaataa cccaagccaa
aaaaccaatc aggccatcct 900ttcccagctg taccagtctc caaacagaag
gtatccagga ccgctagctc agcaggcaca 960gcgttttagg ttttctccaa
tgaccattga cggaatgacc agtttggctg gaattaatat 1020ccctgggcac
cctggaacag ggtggtgtat atttgtgtac aacctggctc ctgacgcaga
1080tgagagtatc ctgtggcaaa tgtttgggcc ttttggagct gtcaccaatg
tgaaggtcat 1140ccgtgacttt aacaccaata aatgcaaagg ttttggattt
gtgactatga caaactatga 1200tgaggctgcc atggcgatag ctagcctcaa
tggataccgt ctgggagaca gagtactgca 1260ggtctccttt aagacaaaca
aaacgcacaa agcctaatga gctcttgtcc tcagtccatt 1320tatatatgaa
aactatacaa caaaggcaag ttaagagaaa ctttatacat tagtaaatgt
1380ctttgtaagt cagtgttgag atggggataa aatgactact tagcatccta
agaaatatgt 1440gagatttttt attgctagta tttgaattaa aacttcttaa
atatctttta tgtttgaata 1500tggacaagag gtacagggtt tttacctgtc
acattgcatt ctattgcctt ctttgaagaa 1560ggtggacctt ttaaagtgtt
tcagctaagg gaagacattt cttttctttt tacataactg 1620ccttgaacct
gtgagtaaat attgaggctt tgtgttgtaa ttcttcagtt ggttgtgtct
1680tttttttccc cccttttttt cctttttctg attagctttg tgtttggttt
acatttaaag 1740cattgctgtt atgtctgttt aagaaaagta ttttgaagtt
tacattttta tttatgaagt 1800ttaaaacagt atttattttg taattatgat
ttgggttggg gaaggggggg ctacattata 1860aacgcttatt gtaagaatac
tggagaactt ttcgtaaagc agtaccttgc caaagagata 1920agagcctctt
tgatgtgggt ttaaaaaaag catctatttt tataaaaaag aaaatttgga
1980gaaacttttt actggtcctg gaacaaatat tttgacttga atactttgag
aaatctcttc 2040atatgacacc tagtgagctt ttaaaattta ccaggaaatt
tgcagcggtt ggaaaattta 2100gaaagattta tggtgtagaa aatacttttg
agatctttgt atgaaaggag tagaatcaat 2160ggggggaaac actgctggtt
tcatttttgt aatcaccagt ggagcgtctg atcatcctgg 2220ttattatgtg
ataggtggct cacattgatt tgtgattttg aaacaaataa aaaaaattta
2280caaaagaata tataagagca ggcaagaaat ttaaattacc gagagatggg
ggaaaaaatc 2340tgttcttcct aaagaaatcc cttcagatag agctcatggt
gtttagtgat gtacttgcag 2400tattgtttga agaattgttt tgtcttaagg
aaaaaagacg ttgcacatga tttgtactgc 2460agcaaatcag caaaagtgat
ctgagttgga tatatttgaa ggtattttga aagttacgtt 2520caaggctaac
acctgagctt tgtgtaatgt aaataagacc ttgtgtttat gaacctttca
2580gctaatttaa ttttttttcc cttacatgcc aagtgatgtt caggttttga
atgtttttgt 2640atcagttttt tcctttgtaa atggcattaa cattgttact
tgaggtcttg cttaatcact 2700tttgttgtcc tgaggacttg aatttacagt
gcatcagatt tgttgcaaat tttgtctgta 2760gatagtctag cttcagctgt
ttatggtgat gctacatttt cgtttataaa tatgtttgtg 2820gtataaaaaa
atgagtataa ccataggttt tgaacaaatt tccttacatt tttcatacaa
2880aaatcataaa tatctgtatg ctattgaaat ttaactttgt atgatgctta
aaaaccacta 2940tttggggaaa taataaaata agtctttacc atgtatgaaa
gaaattttaa aaaatacaaa 3000atattttctg attagcatct agcttataat
aaattttcaa aaaagctgaa ggcaaaaatg 3060ccttcatcag gatgcactga
gaactatata gttacgtcct gctttttgta taaactgaga 3120tgctcacatg
cttcccctta gaacaggcaa tgtgctatgc ataacatagt tgtacattat
3180ctttgcggtt gctttgagtt ttatttttta ttatttaaaa ttgtagttat
aaaatttttc 3240agtatagtac agtacatata ctgtgaggcg cgtgctaaag
tgaataagcg agttttcatg 3300ctgacccact caatgctatt cagaaatcaa
ttggcttagc actttctcat atccttaggt 3360gcatttagat tgccagagtt
aaccttctgc gtttaaaaaa agaaaaacac taaaaaataa 3420aatacatgta
tatacttaaa aaaaaataat aaggtttccc tcaagggaaa acagcagcta
3480catgcttctt tcctatacta ctgtagcaaa ccaaggcatt gatgagaggg
catgcaaatt 3540gtgcttcact ttacagtgtt ttatcagagc acttaataaa
atgtaaggct ggtatttatt 3600tgaagttgta cagtatgact taattcacat
ctgttggaat agaaaatata ttctgttgag 3660tatttaagag gctgtacatg
ttttcttttg tgtttggatt ctttgtactt tttcatgttc 3720agtacatcaa
taaacaaagt tgaagggaaa aaaaaa 375663769DNAHomo sapiens 6agtccgaact
ctgggcggga acactggtgg gggcggcgga ggttgtgccc gcgaagttcc 60tagagctcag
cccgttgcgg cgggagtaga gagaattggg cgcctcggga ggtggcaccg
120cccctcccgt gggcacaagc aggttggggg cggcgggagc cgagcgggga
cagtcgcgcc 180tggcagcgtg cacgggcgtg gacgtgcccg ggtgcggccg
cgtgtagcgc aagaaggaaa 240ctgttgagac gcagcaggta attgctgcca
tggaaacaca actgtctaat gggccaactt 300gcaataacac agccaatggt
ccaaccacca taaacaacaa ctgttcgtca ccagttgact 360ctgggaacac
agaagacagc aagaccaact taatagtcaa ctaccttcct cagaacatga
420cacaggagga actaaagagt ctctttggga gcattggtga aatagagtcc
tgtaagcttg 480taagagacaa aataacaggg cagagcttgg gatatggctt
tgtgaactac attgacccca 540aggatgcaga gaaagctatc aacaccctga
atggattgag acttcaaacc aaaacaataa 600aagtttccta tgctcgccca
agttcagctt ctatcagaga tgcaaattta tatgtcagcg 660gacttccaaa
aacaatgacc cagaaggagt tggaacagct tttttcacaa tatggacgca
720ttattacttc tcgtattctt gtcgaccagg tcactggcat atcaaggggt
gtagggttta 780ttcgatttga caagcgaatt gaggcagaag aagctatcaa
aggcctaaat ggccagaaac 840ctcccggtgc cacggagcca atcactgtaa
agtttgctaa taacccaagc caaaaaacca 900atcaggccat cctttcccag
ctgtaccagt ctccaaacag aaggtatcca ggaccgctag 960ctcagcaggc
acagcgtttt aggttttctc caatgaccat tgacggaatg accagtttgg
1020ctggaattaa tatccctggg caccctggaa cagggtggtg tatatttgtg
tacaacctgg 1080ctcctgacgc agatgagagt atcctgtggc aaatgtttgg
gccttttgga gctgtcacca 1140atgtgaaggt catccgtgac tttaacacca
ataaatgcaa aggttttgga tttgtgacta 1200tgacaaacta tgatgaggct
gccatggcga tagctagcct caatggatac cgtctgggag 1260acagagtact
gcaggtctcc tttaagacaa acaaaacgca caaagcctaa tgagctcttg
1320tcctcagtcc atttatatat gaaaactata caacaaaggc aagttaagag
aaactttata 1380cattagtaaa tgtctttgta agtcagtgtt gagatgggga
taaaatgact acttagcatc 1440ctaagaaata tgtgagattt tttattgcta
gtatttgaat taaaacttct taaatatctt 1500ttatgtttga atatggacaa
gaggtacagg gtttttacct gtcacattgc attctattgc 1560cttctttgaa
gaaggtggac cttttaaagt gtttcagcta agggaagaca tttcttttct
1620ttttacataa ctgccttgaa cctgtgagta aatattgagg ctttgtgttg
taattcttca 1680gttggttgtg tctttttttt cccccctttt tttccttttt
ctgattagct ttgtgtttgg 1740tttacattta aagcattgct gttatgtctg
tttaagaaaa gtattttgaa gtttacattt 1800ttatttatga agtttaaaac
agtatttatt ttgtaattat gatttgggtt ggggaagggg 1860gggctacatt
ataaacgctt attgtaagaa tactggagaa cttttcgtaa agcagtacct
1920tgccaaagag ataagagcct ctttgatgtg ggtttaaaaa aagcatctat
ttttataaaa 1980aagaaaattt ggagaaactt tttactggtc ctggaacaaa
tattttgact tgaatacttt 2040gagaaatctc ttcatatgac acctagtgag
cttttaaaat ttaccaggaa atttgcagcg 2100gttggaaaat ttagaaagat
ttatggtgta gaaaatactt ttgagatctt tgtatgaaag 2160gagtagaatc
aatgggggga aacactgctg gtttcatttt tgtaatcacc agtggagcgt
2220ctgatcatcc tggttattat gtgataggtg gctcacattg atttgtgatt
ttgaaacaaa 2280taaaaaaaat ttacaaaaga atatataaga gcaggcaaga
aatttaaatt accgagagat 2340gggggaaaaa atctgttctt cctaaagaaa
tcccttcaga tagagctcat ggtgtttagt 2400gatgtacttg cagtattgtt
tgaagaattg ttttgtctta aggaaaaaag acgttgcaca 2460tgatttgtac
tgcagcaaat cagcaaaagt gatctgagtt ggatatattt gaaggtattt
2520tgaaagttac gttcaaggct aacacctgag ctttgtgtaa tgtaaataag
accttgtgtt 2580tatgaacctt tcagctaatt taattttttt tcccttacat
gccaagtgat gttcaggttt 2640tgaatgtttt tgtatcagtt ttttcctttg
taaatggcat taacattgtt acttgaggtc 2700ttgcttaatc acttttgttg
tcctgaggac ttgaatttac agtgcatcag atttgttgca 2760aattttgtct
gtagatagtc tagcttcagc tgtttatggt gatgctacat tttcgtttat
2820aaatatgttt gtggtataaa aaaatgagta taaccatagg ttttgaacaa
atttccttac 2880atttttcata caaaaatcat aaatatctgt atgctattga
aatttaactt tgtatgatgc 2940ttaaaaacca ctatttgggg aaataataaa
ataagtcttt accatgtatg aaagaaattt 3000taaaaaatac aaaatatttt
ctgattagca tctagcttat aataaatttt caaaaaagct 3060gaaggcaaaa
atgccttcat caggatgcac tgagaactat atagttacgt cctgcttttt
3120gtataaactg agatgctcac atgcttcccc ttagaacagg caatgtgcta
tgcataacat 3180agttgtacat tatctttgcg gttgctttga gttttatttt
ttattattta aaattgtagt 3240tataaaattt ttcagtatag tacagtacat
atactgtgag gcgcgtgcta aagtgaataa 3300gcgagttttc atgctgaccc
actcaatgct attcagaaat caattggctt agcactttct 3360catatcctta
ggtgcattta gattgccaga gttaaccttc tgcgtttaaa aaaagaaaaa
3420cactaaaaaa taaaatacat gtatatactt aaaaaaaaat aataaggttt
ccctcaaggg 3480aaaacagcag ctacatgctt ctttcctata ctactgtagc
aaaccaaggc attgatgaga 3540gggcatgcaa attgtgcttc actttacagt
gttttatcag agcacttaat aaaatgtaag 3600gctggtattt atttgaagtt
gtacagtatg acttaattca catctgttgg aatagaaaat 3660atattctgtt
gagtatttaa gaggctgtac atgttttctt ttgtgtttgg attctttgta
3720ctttttcatg ttcagtacat caataaacaa agttgaaggg aaaaaaaaa
376973814DNAHomo sapiens 7caataggagg gtagtctctc cgtcttttta
aactcttttt taagtttccc ctcccctttc 60atattttttt tcgccatttc ttttagcatt
ggactttggg gtcgaaagcg tttcttttta 120tttgcttctt ttaagccgag
cacagtttag gtttcgtgct gtcttaagag aactatccag 180cagcttcttg
ctcatcctta ttgggagaac tgcaccgtta ctttaaaaac acacatacac
240aaaaacctta agggagaaag caggtaattg ctgccatgga aacacaactg
tctaatgggc 300caacttgcaa taacacagcc aatggtccaa ccaccataaa
caacaactgt tcgtcaccag 360ttgactctgg gaacacagaa gacagcaaga
ccaacttaat agtcaactac cttcctcaga 420acatgacaca ggaggaacta
aagagtctct ttgggagcat tggtgaaata gagtcctgta 480agcttgtaag
agacaaaata acagggcaga gcttgggata tggctttgtg aactacattg
540accccaagga tgcagagaaa gctatcaaca ccctgaatgg attgagactt
caaaccaaaa 600caataaaagt ttcctatgct cgcccaagtt cagcttctat
cagagatgca aatttatatg 660tcagcggact tccaaaaaca atgacccaga
aggagttgga acagcttttt tcacaatatg 720gacgcattat tacttctcgt
attcttgtcg accaggtcac tggcatatca aggggtgtag 780ggtttattcg
atttgacaag cgaattgagg cagaagaagc tatcaaaggc ctaaatggcc
840agaaacctcc cggtgccacg gagccaatca ctgtaaagtt tgctaataac
ccaagccaaa 900aaaccaatca ggccatcctt tcccagctgt accagtctcc
aaacagaagg tatccaggac 960cgctagctca gcaggcacag cgttttaggt
tggacaatct gctcaatatg gcttatggag 1020taaagaggtt ttctccaatg
accattgacg gaatgaccag tttggctgga attaatatcc 1080ctgggcaccc
tggaacaggg tggtgtatat ttgtgtacaa cctggctcct gacgcagatg
1140agagtatcct gtggcaaatg tttgggcctt ttggagctgt caccaatgtg
aaggtcatcc 1200gtgactttaa caccaataaa tgcaaaggtt ttggatttgt
gactatgaca aactatgatg 1260aggctgccat ggcgatagct agcctcaatg
gataccgtct gggagacaga gtactgcagg 1320tctcctttaa gacaaacaaa
acgcacaaag cctaatgagc tcttgtcctc agtccattta 1380tatatgaaaa
ctatacaaca aaggcaagtt aagagaaact ttatacatta gtaaatgtct
1440ttgtaagtca gtgttgagat ggggataaaa tgactactta gcatcctaag
aaatatgtga 1500gattttttat tgctagtatt tgaattaaaa cttcttaaat
atcttttatg tttgaatatg 1560gacaagaggt acagggtttt tacctgtcac
attgcattct attgccttct ttgaagaagg 1620tggacctttt aaagtgtttc
agctaaggga agacatttct tttcttttta cataactgcc 1680ttgaacctgt
gagtaaatat tgaggctttg tgttgtaatt cttcagttgg ttgtgtcttt
1740tttttccccc ctttttttcc tttttctgat tagctttgtg tttggtttac
atttaaagca 1800ttgctgttat gtctgtttaa gaaaagtatt ttgaagttta
catttttatt tatgaagttt 1860aaaacagtat ttattttgta attatgattt
gggttgggga agggggggct acattataaa 1920cgcttattgt aagaatactg
gagaactttt cgtaaagcag taccttgcca aagagataag 1980agcctctttg
atgtgggttt aaaaaaagca tctattttta taaaaaagaa aatttggaga
2040aactttttac tggtcctgga acaaatattt tgacttgaat
actttgagaa atctcttcat 2100atgacaccta gtgagctttt aaaatttacc
aggaaatttg cagcggttgg aaaatttaga 2160aagatttatg gtgtagaaaa
tacttttgag atctttgtat gaaaggagta gaatcaatgg 2220ggggaaacac
tgctggtttc atttttgtaa tcaccagtgg agcgtctgat catcctggtt
2280attatgtgat aggtggctca cattgatttg tgattttgaa acaaataaaa
aaaatttaca 2340aaagaatata taagagcagg caagaaattt aaattaccga
gagatggggg aaaaaatctg 2400ttcttcctaa agaaatccct tcagatagag
ctcatggtgt ttagtgatgt acttgcagta 2460ttgtttgaag aattgttttg
tcttaaggaa aaaagacgtt gcacatgatt tgtactgcag 2520caaatcagca
aaagtgatct gagttggata tatttgaagg tattttgaaa gttacgttca
2580aggctaacac ctgagctttg tgtaatgtaa ataagacctt gtgtttatga
acctttcagc 2640taatttaatt ttttttccct tacatgccaa gtgatgttca
ggttttgaat gtttttgtat 2700cagttttttc ctttgtaaat ggcattaaca
ttgttacttg aggtcttgct taatcacttt 2760tgttgtcctg aggacttgaa
tttacagtgc atcagatttg ttgcaaattt tgtctgtaga 2820tagtctagct
tcagctgttt atggtgatgc tacattttcg tttataaata tgtttgtggt
2880ataaaaaaat gagtataacc ataggttttg aacaaatttc cttacatttt
tcatacaaaa 2940atcataaata tctgtatgct attgaaattt aactttgtat
gatgcttaaa aaccactatt 3000tggggaaata ataaaataag tctttaccat
gtatgaaaga aattttaaaa aatacaaaat 3060attttctgat tagcatctag
cttataataa attttcaaaa aagctgaagg caaaaatgcc 3120ttcatcagga
tgcactgaga actatatagt tacgtcctgc tttttgtata aactgagatg
3180ctcacatgct tccccttaga acaggcaatg tgctatgcat aacatagttg
tacattatct 3240ttgcggttgc tttgagtttt attttttatt atttaaaatt
gtagttataa aatttttcag 3300tatagtacag tacatatact gtgaggcgcg
tgctaaagtg aataagcgag ttttcatgct 3360gacccactca atgctattca
gaaatcaatt ggcttagcac tttctcatat ccttaggtgc 3420atttagattg
ccagagttaa ccttctgcgt ttaaaaaaag aaaaacacta aaaaataaaa
3480tacatgtata tacttaaaaa aaaataataa ggtttccctc aagggaaaac
agcagctaca 3540tgcttctttc ctatactact gtagcaaacc aaggcattga
tgagagggca tgcaaattgt 3600gcttcacttt acagtgtttt atcagagcac
ttaataaaat gtaaggctgg tatttatttg 3660aagttgtaca gtatgactta
attcacatct gttggaatag aaaatatatt ctgttgagta 3720tttaagaggc
tgtacatgtt ttcttttgtg tttggattct ttgtactttt tcatgttcag
3780tacatcaata aacaaagttg aagggaaaaa aaaa 381482130DNAHomo sapiens
8agttaagggc ctggcgtctc cctccctgaa gacgtggtcc cagccgggtg tcctgacgct
60cggggttcag gacaagggca cacaactggt tccgttaagc ccctctctcg ctcagacgcc
120atggagctgg atctgtctcc acctcatctt agcagctctc cggaagacct
ttgcccagcc 180cctgggaccc ctcctgggac tccccggccc cctgataccc
ctctgcctga ggaggtaaag 240aggtcccagc ctctcctcat cccaaccacc
ggcaggaaac ttcgagagga ggagaggcgt 300gccacctccc tcccctctat
ccccaacccc ttccctgagc tctgcagtcc tccctcacag 360agcccaattc
tcgggggccc ctccagtgca agggggctgc tcccccgcga tgccagccgc
420ccccatgtag taaaggtgta cagtgaggat ggggcctgca ggtctgtgga
ggtggcagca 480ggtgccacag ctcgccacgt gtgtgaaatg ctggtgcagc
gagctcacgc cttgagcgac 540gagacctggg ggctggtgga gtgccacccc
cacctagcac tggagcgggg tttggaggac 600cacgagtccg tggtggaagt
gcaggctgcc tggcccgtgg gcggagatag ccgcttcgtc 660ttccggaaaa
acttcgccaa gtacgaactg ttcaagagct ccccacactc cctgttccca
720gaaaaaatgg tctccagctg tctcgatgca cacactggta tatcccatga
agacctcatc 780cagaacttcc tgaatgctgg cagctttcct gagatccagg
gctttctgca gctgcggggt 840tcaggacgga agctttggaa acgctttttc
tgcttcttgc gccgatctgg cctctattac 900tccaccaagg gcacctctaa
ggatccgagg cacctgcagt acgtggcaga tgtgaacgag 960tccaacgtgt
acgtggtgac gcagggccgc aagctctacg ggatgcccac tgacttcggt
1020ttctgtgtca agcccaacaa gcttcgaaat ggccacaagg ggcttcggat
cttctgcagt 1080gaagatgagc agagccgcac ctgctggctg gctgccttcc
gcctcttcaa gtacggggtg 1140cagctgtaca agaattacca gcaggcacag
tctcgccatc tgcatccatc ttgtttgggc 1200tccccaccct tgagaagtgc
ctcagataat accctggtgg ccatggactt ctctggccat 1260gctgggcgtg
tcattgagaa cccccgggag gctctgagtg tggccctgga ggaggcccag
1320gcctggagga agaagacaaa ccaccgcctc agcctgccca tgccagcctc
cggcacgagc 1380ctcagtgcag ccatccaccg cacccaactc tggttccacg
ggcgcatttc ccgtgaggag 1440agccagcggc ttattggaca gcagggcttg
gtagacggcc tgttcctggt ccgggagagt 1500cagcggaacc cccagggctt
tgtcctctct ttgtgccacc tgcagaaagt gaagcattat 1560ctcatcctgc
cgagcgagga ggagggccgc ctgtacttca gcatggatga tggccagacc
1620cgcttcactg acctgctgca gctcgtggag ttccaccagc tgaaccgcgg
catcctgccg 1680tgcttgctgc gccattgctg cacgcgggtg gccctctgac
caggccgtgg actggctcat 1740gcctcagccc gccttcaggc tgcccgccgc
ccctccaccc atccagtgga ctctggggcg 1800cggccacagg ggacgggatg
aggagcggga gggttccgcc actccagttt tctcctctgc 1860ttctttgcct
ccctcagata gaaaacagcc cccactccag tccactcctg acccctctcc
1920tcaagggaag gccttgggtg gccccctctc cttctcctag ctctggaggt
gctgctctag 1980ggcagggaat tatgggagaa gtgggggcag cccaggcggt
ttcacgcccc acactttgta 2040cagaccgaga ggccagttga tctgctctgt
tttatactag tgacaataaa gattattttt 2100tgatacaaaa aaaaaaaaaa
aaaaaaaaaa 213092214DNAHomo sapiens 9tgcgggctgc ggggagatgt
ggggagggcc ccctccactt tggagggcag tgaaggagag 60ggatcctcta aattgtcgag
gcttcatctc tccagattgt atgcccttct cagcaacacc 120gcctccggcc
ctccgatggg aaagtggagg ccgggacaag ggcacacaac tggttccgtt
180aagcccctct ctcgctcaga cgccatggag ctggatctgt ctccacctca
tcttagcagc 240tctccggaag acctttgccc agcccctggg acccctcctg
ggactccccg gccccctgat 300acccctctgc ctgaggaggt aaagaggtcc
cagcctctcc tcatcccaac caccggcagg 360aaacttcgag aggaggagag
gcgtgccacc tccctcccct ctatccccaa ccccttccct 420gagctctgca
gtcctccctc acagagccca attctcgggg gcccctccag tgcaaggggg
480ctgctccccc gcgatgccag ccgcccccat gtagtaaagg tgtacagtga
ggatggggcc 540tgcaggtctg tggaggtggc agcaggtgcc acagctcgcc
acgtgtgtga aatgctggtg 600cagcgagctc acgccttgag cgacgagacc
tgggggctgg tggagtgcca cccccaccta 660gcactggagc ggggtttgga
ggaccacgag tccgtggtgg aagtgcaggc tgcctggccc 720gtgggcggag
atagccgctt cgtcttccgg aaaaacttcg ccaagtacga actgttcaag
780agctccccac actccctgtt cccagaaaaa atggtctcca gctgtctcga
tgcacacact 840ggtatatccc atgaagacct catccagaac ttcctgaatg
ctggcagctt tcctgagatc 900cagggctttc tgcagctgcg gggttcagga
cggaagcttt ggaaacgctt tttctgcttc 960ttgcgccgat ctggcctcta
ttactccacc aagggcacct ctaaggatcc gaggcacctg 1020cagtacgtgg
cagatgtgaa cgagtccaac gtgtacgtgg tgacgcaggg ccgcaagctc
1080tacgggatgc ccactgactt cggtttctgt gtcaagccca acaagcttcg
aaatggccac 1140aaggggcttc ggatcttctg cagtgaagat gagcagagcc
gcacctgctg gctggctgcc 1200ttccgcctct tcaagtacgg ggtgcagctg
tacaagaatt accagcaggc acagtctcgc 1260catctgcatc catcttgttt
gggctcccca cccttgagaa gtgcctcaga taataccctg 1320gtggccatgg
acttctctgg ccatgctggg cgtgtcattg agaacccccg ggaggctctg
1380agtgtggccc tggaggaggc ccaggcctgg aggaagaaga caaaccaccg
cctcagcctg 1440cccatgccag cctccggcac gagcctcagt gcagccatcc
accgcaccca actctggttc 1500cacgggcgca tttcccgtga ggagagccag
cggcttattg gacagcaggg cttggtagac 1560ggcctgttcc tggtccggga
gagtcagcgg aacccccagg gctttgtcct ctctttgtgc 1620cacctgcaga
aagtgaagca ttatctcatc ctgccgagcg aggaggaggg ccgcctgtac
1680ttcagcatgg atgatggcca gacccgcttc actgacctgc tgcagctcgt
ggagttccac 1740cagctgaacc gcggcatcct gccgtgcttg ctgcgccatt
gctgcacgcg ggtggccctc 1800tgaccaggcc gtggactggc tcatgcctca
gcccgccttc aggctgcccg ccgcccctcc 1860acccatccag tggactctgg
ggcgcggcca caggggacgg gatgaggagc gggagggttc 1920cgccactcca
gttttctcct ctgcttcttt gcctccctca gatagaaaac agcccccact
1980ccagtccact cctgacccct ctcctcaagg gaaggccttg ggtggccccc
tctccttctc 2040ctagctctgg aggtgctgct ctagggcagg gaattatggg
agaagtgggg gcagcccagg 2100cggtttcacg ccccacactt tgtacagacc
gagaggccag ttgatctgct ctgttttata 2160ctagtgacaa taaagattat
tttttgatac aaaaaaaaaa aaaaaaaaaa aaaa 2214102275DNAHomo sapiens
10aggcaaaccc cagccttgga ctggccctct ctgatctctg aggccaggct ctaatgtgat
60ttgaatctac ttctaacccc ttccaagcac tgccctcccg aattctctgc tcctctcccc
120accccactgt tggtctgtga tttcgaggca ggcgtggccc cctgcagcct
ggaatgaagt 180cactggggct gtttggagac cggggctgtt tggaggacaa
gggcacacaa ctggttccgt 240taagcccctc tctcgctcag acgccatgga
gctggatctg tctccacctc atcttagcag 300ctctccggaa gacctttgcc
cagcccctgg gacccctcct gggactcccc ggccccctga 360tacccctctg
cctgaggagg taaagaggtc ccagcctctc ctcatcccaa ccaccggcag
420gaaacttcga gaggaggaga ggcgtgccac ctccctcccc tctatcccca
accccttccc 480tgagctctgc agtcctccct cacagagccc aattctcggg
ggcccctcca gtgcaagggg 540gctgctcccc cgcgatgcca gccgccccca
tgtagtaaag gtgtacagtg aggatggggc 600ctgcaggtct gtggaggtgg
cagcaggtgc cacagctcgc cacgtgtgtg aaatgctggt 660gcagcgagct
cacgccttga gcgacgagac ctgggggctg gtggagtgcc acccccacct
720agcactggag cggggtttgg aggaccacga gtccgtggtg gaagtgcagg
ctgcctggcc 780cgtgggcgga gatagccgct tcgtcttccg gaaaaacttc
gccaagtacg aactgttcaa 840gagctcccca cactccctgt tcccagaaaa
aatggtctcc agctgtctcg atgcacacac 900tggtatatcc catgaagacc
tcatccagaa cttcctgaat gctggcagct ttcctgagat 960ccagggcttt
ctgcagctgc ggggttcagg acggaagctt tggaaacgct ttttctgctt
1020cttgcgccga tctggcctct attactccac caagggcacc tctaaggatc
cgaggcacct 1080gcagtacgtg gcagatgtga acgagtccaa cgtgtacgtg
gtgacgcagg gccgcaagct 1140ctacgggatg cccactgact tcggtttctg
tgtcaagccc aacaagcttc gaaatggcca 1200caaggggctt cggatcttct
gcagtgaaga tgagcagagc cgcacctgct ggctggctgc 1260cttccgcctc
ttcaagtacg gggtgcagct gtacaagaat taccagcagg cacagtctcg
1320ccatctgcat ccatcttgtt tgggctcccc acccttgaga agtgcctcag
ataataccct 1380ggtggccatg gacttctctg gccatgctgg gcgtgtcatt
gagaaccccc gggaggctct 1440gagtgtggcc ctggaggagg cccaggcctg
gaggaagaag acaaaccacc gcctcagcct 1500gcccatgcca gcctccggca
cgagcctcag tgcagccatc caccgcaccc aactctggtt 1560ccacgggcgc
atttcccgtg aggagagcca gcggcttatt ggacagcagg gcttggtaga
1620cggcctgttc ctggtccggg agagtcagcg gaacccccag ggctttgtcc
tctctttgtg 1680ccacctgcag aaagtgaagc attatctcat cctgccgagc
gaggaggagg gccgcctgta 1740cttcagcatg gatgatggcc agacccgctt
cactgacctg ctgcagctcg tggagttcca 1800ccagctgaac cgcggcatcc
tgccgtgctt gctgcgccat tgctgcacgc gggtggccct 1860ctgaccaggc
cgtggactgg ctcatgcctc agcccgcctt caggctgccc gccgcccctc
1920cacccatcca gtggactctg gggcgcggcc acaggggacg ggatgaggag
cgggagggtt 1980ccgccactcc agttttctcc tctgcttctt tgcctccctc
agatagaaaa cagcccccac 2040tccagtccac tcctgacccc tctcctcaag
ggaaggcctt gggtggcccc ctctccttct 2100cctagctctg gaggtgctgc
tctagggcag ggaattatgg gagaagtggg ggcagcccag 2160gcggtttcac
gccccacact ttgtacagac cgagaggcca gttgatctgc tctgttttat
2220actagtgaca ataaagatta ttttttgata caaaaaaaaa aaaaaaaaaa aaaaa
2275112285DNAHomo sapiens 11acccgccccc atctgcccaa gataatttta
gtttccttgg gcctggaatc tggacacaca 60gggctccccc ccgcctctga cttctctgtc
cgaagtcggg acaccctcct accacctgta 120gagaagcggg agtggatctg
aaataaaatc caggaatctg ggggttccta gacggagcca 180gacttcggaa
cgggtgtcct gctactcctg ctggggctcc tccaggacaa gggcacacaa
240ctggttccgt taagcccctc tctcgctcag acgccatgga gctggatctg
tctccacctc 300atcttagcag ctctccggaa gacctttgcc cagcccctgg
gacccctcct gggactcccc 360ggccccctga tacccctctg cctgaggagg
taaagaggtc ccagcctctc ctcatcccaa 420ccaccggcag gaaacttcga
gaggaggaga ggcgtgccac ctccctcccc tctatcccca 480accccttccc
tgagctctgc agtcctccct cacagagccc aattctcggg ggcccctcca
540gtgcaagggg gctgctcccc cgcgatgcca gccgccccca tgtagtaaag
gtgtacagtg 600aggatggggc ctgcaggtct gtggaggtgg cagcaggtgc
cacagctcgc cacgtgtgtg 660aaatgctggt gcagcgagct cacgccttga
gcgacgagac ctgggggctg gtggagtgcc 720acccccacct agcactggag
cggggtttgg aggaccacga gtccgtggtg gaagtgcagg 780ctgcctggcc
cgtgggcgga gatagccgct tcgtcttccg gaaaaacttc gccaagtacg
840aactgttcaa gagctcccca cactccctgt tcccagaaaa aatggtctcc
agctgtctcg 900atgcacacac tggtatatcc catgaagacc tcatccagaa
cttcctgaat gctggcagct 960ttcctgagat ccagggcttt ctgcagctgc
ggggttcagg acggaagctt tggaaacgct 1020ttttctgctt cttgcgccga
tctggcctct attactccac caagggcacc tctaaggatc 1080cgaggcacct
gcagtacgtg gcagatgtga acgagtccaa cgtgtacgtg gtgacgcagg
1140gccgcaagct ctacgggatg cccactgact tcggtttctg tgtcaagccc
aacaagcttc 1200gaaatggcca caaggggctt cggatcttct gcagtgaaga
tgagcagagc cgcacctgct 1260ggctggctgc cttccgcctc ttcaagtacg
gggtgcagct gtacaagaat taccagcagg 1320cacagtctcg ccatctgcat
ccatcttgtt tgggctcccc acccttgaga agtgcctcag 1380ataataccct
ggtggccatg gacttctctg gccatgctgg gcgtgtcatt gagaaccccc
1440gggaggctct gagtgtggcc ctggaggagg cccaggcctg gaggaagaag
acaaaccacc 1500gcctcagcct gcccatgcca gcctccggca cgagcctcag
tgcagccatc caccgcaccc 1560aactctggtt ccacgggcgc atttcccgtg
aggagagcca gcggcttatt ggacagcagg 1620gcttggtaga cggcctgttc
ctggtccggg agagtcagcg gaacccccag ggctttgtcc 1680tctctttgtg
ccacctgcag aaagtgaagc attatctcat cctgccgagc gaggaggagg
1740gccgcctgta cttcagcatg gatgatggcc agacccgctt cactgacctg
ctgcagctcg 1800tggagttcca ccagctgaac cgcggcatcc tgccgtgctt
gctgcgccat tgctgcacgc 1860gggtggccct ctgaccaggc cgtggactgg
ctcatgcctc agcccgcctt caggctgccc 1920gccgcccctc cacccatcca
gtggactctg gggcgcggcc acaggggacg ggatgaggag 1980cgggagggtt
ccgccactcc agttttctcc tctgcttctt tgcctccctc agatagaaaa
2040cagcccccac tccagtccac tcctgacccc tctcctcaag ggaaggcctt
gggtggcccc 2100ctctccttct cctagctctg gaggtgctgc tctagggcag
ggaattatgg gagaagtggg 2160ggcagcccag gcggtttcac gccccacact
ttgtacagac cgagaggcca gttgatctgc 2220tctgttttat actagtgaca
ataaagatta ttttttgata caaaaaaaaa aaaaaaaaaa 2280aaaaa
228512914DNAHomo sapiens 12gcatggggag gggcggccct caaacgggtc
attgccatta atagagacct caaacaccgc 60ctgctaaaaa tacccgactg gaggagcata
aaagcgcagc cgagcccagc gccccgcact 120tttctgagca gacgtccaga
gcagagtcag ccagcatgac cgagcgccgc gtccccttct 180cgctcctgcg
gggccccagc tgggacccct tccgcgactg gtacccgcat agccgcctct
240tcgaccaggc cttcgggctg ccccggctgc cggaggagtg gtcgcagtgg
ttaggcggca 300gcagctggcc aggctacgtg cgccccctgc cccccgccgc
catcgagagc cccgcagtgg 360ccgcgcccgc ctacagccgc gcgctcagcc
ggcaactcag cagcggggtc tcggagatcc 420ggcacactgc ggaccgctgg
cgcgtgtccc tggatgtcaa ccacttcgcc ccggacgagc 480tgacggtcaa
gaccaaggat ggcgtggtgg agatcaccgg caagcacgag gagcggcagg
540acgagcatgg ctacatctcc cggtgcttca cgcggaaata cacgctgccc
cccggtgtgg 600accccaccca agtttcctcc tccctgtccc ctgagggcac
actgaccgtg gaggccccca 660tgcccaagct agccacgcag tccaacgaga
tcaccatccc agtcaccttc gagtcgcggg 720cccagcttgg gggcccagaa
gctgcaaaat ccgatgagac tgccgccaag taaagcctta 780gcccggatgc
ccacccctgc tgccgccact ggctgtgcct cccccgccac ctgtgtgttc
840ttttgataca tttatcttct gtttttctca aataaagttc aaagcaacca
cctgtcaaaa 900aaaaaaaaaa aaaa 914133262DNAHomo sapiens 13agagtgctcc
gcggccgtgt ggagcgaggc cttgttcccg cgttgagccg ccgccgccgc 60cgccgcctcc
tcagcttcag cctccgcgcc aggcccggcc ccgccgcgcc atgtcggact
120acagcacggg aggacccccg cccgggccgc cgccgcccgc cggcgggggc
gggggagccg 180gaggcgccgg gggaggccct ccgccgggcc cgccaggcgc
gggggaccgg ggcggcggcg 240gtcccggcgg cggcggcccg ggcggggggt
cggccggggg cccctctcag ccacccggcg 300gaggcggccc gggaatccgc
aaggacgctt tcgccgacgc cgtgcagcgg gcccgccaga 360ttgcagccaa
aattggaggc gatgctgcca cgacagtgaa taacagcact cctgattttg
420gttttggggg ccaaaagaga cagttggaag atggagatca accggagagc
aagaagctgg 480cttcccaggg agactcaatc agttctcaac ttggacccat
ccatcctccc ccaaggactt 540caatgacaga agagtacagg gtcccagacg
gcatggtggg cctgatcatt ggcagaggag 600gtgaacaaat taacaaaatc
caacaggatt caggctgcaa agtacagatt tctccagaca 660gcggtggcct
acccgagcgc agtgtgtcct tgacaggagc cccagaatct gtccagaaag
720ccaagatgat gctggatgac attgtgtctc ggggtcgtgg gggcccccca
ggacagttcc 780acgacaacgc caacgggggc cagaacggca ccgtgcagga
gatcatgatc cccgcgggca 840aggccggcct ggtcattggc aagggcgggg
agaccattaa gcagctgcag gaacgcgctg 900gagtgaagat gatcttaatt
caggacggat ctcagaatac gaatgtggac aaacctctcc 960gcatcattgg
ggatccttac aaagtgcagc aagcctgtga gatggtgatg gacatcctcc
1020gggaacgtga ccaaggcggc tttggggacc ggaatgagta cggatctcgg
attggcggag 1080gcatcgatgt gccagtgccc aggcattctg ttggcgtggt
cattggccgg agtggagaga 1140tgatcaagaa gatccagaat gatgctggcg
tgcggataca gttcaagcaa gatgacggga 1200cagggcccga gaagattgct
catataatgg ggcccccaga caggtgcgag cacgcagccc 1260ggatcatcaa
cgacctcctc cagagcctca ggagtggtcc cccaggtcct ccagggggtc
1320caggcatgcc cccggggggc cgaggccgag gaagaggcca aggcaattgg
ggtccccctg 1380gcggggagat gaccttctcc atccccactc acaagtgtgg
gctggtcatc ggccgaggtg 1440gcgagaatgt gaaagccata aaccagcaga
cgggagcctt cgtagagatc tcccggcagc 1500tgccacccaa cggggacccc
aacttcaagt tgttcatcat ccggggttca ccccagcaga 1560ttgaccacgc
caagcagctt atcgaggaaa agatcgaggg tcctctctgc ccagttggac
1620caggcccagg tggcccaggc cctgctggcc caatggggcc cttcaatcct
gggcccttca 1680accaggggcc acccggggct cccccacatg ccggggggcc
ccctcctcac cagtacccac 1740cccagggctg gggcaatacc tacccccagt
ggcagccgcc tgctcctcat gacccaagca 1800aagcagctgc agcggccgcg
gaccccaacg ccgcgtgggc cgcctactac tcacactact 1860accagcagcc
cccgggcccc gtccccggcc ccgcaccggc ccctgcggcc ccaccggctc
1920agggtgagcc ccctcagccc ccacccaccg gccagtcgga ctacactaag
gcctgggaag 1980agtattacaa aaagatcggc cagcagcccc agcagcccgg
agcaccccca cagcaggact 2040acacgaaggc ttgggaggag tactacaaga
agcaagcgca agtggccacc ggagggggtc 2100caggagctcc cccaggctcc
cagccagact acagtgccgc ctgggcggaa tattacagac 2160agcaggccgc
ttactacgga cagaccccag gtcctggcgg cccccagccg ccgcccacgc
2220agcagggaca gcagcaggct caatgaatcg aatgaatgtg aacttcttca
tctgtgaaaa 2280atcttttttt tttccatttt gttctgtttg ggggcttctg
ttttgtttgg cgagagagcg 2340atggctgccg tggggagtac tggggagccc
tcgcggcaag cagggtgggg gggacttggg 2400ggcatgccgg gccctcactc
tctcgcctgt tctgtgtctc acatgctttt tctttcaaaa 2460ttgggatcct
tccatgttga gccagccaga gaagatagcg agatctaaat ctctgccaaa
2520aaaaaaaaaa aacttaaaaa ttaaaaacac aaagagcaaa gcagaactta
taaaattata 2580tatatatata ttaaaaagtc tctattcttc accccccagc
cttcctgaac ctgcctctct 2640gaggataaag caattcattt tctcccaccc
tcggccctct tgtttttaaa ataaactttt 2700aaaaaggaaa aaaaaaagtc
actcttgcta tttctttttt ttagttagag gtggaacatt 2760ccttggacca
ggtgttgtat tgcaggaccc cttcccccag cagccaagcc ccctcttctc
2820tccctcccgc cctggctcag ctcccgcggc cccgcccgtc ccccctccca
ggactggtct 2880gttgtctttt catctgttca agaggagatt gaaactgaaa
acaaaatgag aacaacaaaa 2940aaaattgtat ggcagttttt actttttatc
gctcgttttt aacttcacaa ataaatgata 3000acaaaacctc cccgtctgcg
ggtgctgtct gtctcccccc ctttccttcc ctccctgtag 3060ttttgaagcg
gatgtttgtt ctttatagat gttgtttaaa aagcctgata atggtgattg
3120aaatttacaa actttgtgtt tttttttttt taagaaaaat ataaaatagt
tttcttcagg 3180ctcaatgtgc tttcctaacc gtgccccccc cccttttttt
tttttgttaa ataaagtgct 3240ttttgtttaa aaaaaaaaaa aa
3262143918DNAHomo sapiens 14ctctcccttc tccactctct ccccctgtct
cctttcttct tcttctttca ccctccgtct 60ctcacacccc ctccattccc ctgtctcctt
tctgacactg cactgcagct gctcctcagc 120cctgccccct ccccagtgag
aacaaaccag caacattgct ttttttccta aagagattta 180tattgatccg
attaaaaaaa aaaaacctta agaaacccca aacgcaaaaa aaaaaaaaaa
240aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg agaaccttct
cccggtagca 300gcggcaggaa ctgcaaacat gatggcggca gctcccatcc
agcagaacgg gacccacact 360ggggttccca tagacctgga cccgccggac
tcgcggaaaa ggccgctgga agccccccct 420gaagccggca gcaccaagag
gaccaatacg ggcgaagacg gccagtattt tctaaaggtt 480ctcataccta
gttatgctgc tggatctata attgggaagg gaggacagac aattgttcag
540ttgcaaaaag aaactggagc caccatcaag ctgtctaagt ccaaagattt
ttacccaggt 600actactgagc gagtgtgctt gatccaggga acggttgaag
cactgaatgc agttcatgga 660ttcattgcag aaaaaattcg agaaatgccc
caaaatgtgg ccaagacaga accagtcagc 720attctacaac cccagaccac
cgttaatcca gatcgcatca aacaaacatt gccatcttcc 780ccaactacca
ccaagtcctc tccatctgat cccatgacca cctccagagc taatcaggta
840aagattatag ttcccaacag cacagcaggt ctgataatag ggaagggagg
tgctactgtg 900aaggctgtaa tggagcagtc aggggcttgg gtgcagcttt
cccagaaacc tgatgggatc 960aacttgcaag agagggttgt cactgtgagt
ggagaacctg aacaaaaccg aaaagctgtt 1020gaacttatca tccagaagat
acaagaggat ccacaaagtg gcagctgtct caatatcagt 1080tatgccaatg
tgacaggtcc agtggcaaat tccaatccaa ccggatctcc ttatgcaaac
1140actgctgaag tgttaccaac tgctgcagca gctgcagggc tattaggaca
tgctaacctt 1200gctggcgttg cagcctttcc agcagtttta tctggcttca
caggcaatga cctggtggcc 1260atcacctctg cacttaatac attagccagc
tatggatata atctcaacac tttaggttta 1320ggtctcagtc aagcagcagc
aacaggggct ttggctgcag cagctgccag tgccaaccca 1380gcagcagcag
cagccaattt attggccacc tatgccagtg aagcctcagc cagtggcagc
1440acagctggtg gtacggcggg gacatttgca ttaggtagcc tggctgctgc
tactgctgca 1500accaatggat attttggagc tgcttctccc ctagctgcca
gtgccattct aggaacagaa 1560aagtccacag atggatccaa ggatgtagtt
gaaatagcag tgccagaaaa cttagttggt 1620gcaatacttg gcaaaggagg
gaaaacatta gtggaatacc aggagttgac tggtgcaagg 1680atacagatct
ccaaaaaagg agaattcgta cctggcacaa ggaatcggaa ggtaaccatt
1740actggaacac cagctgcaac acaggctgct caatatttaa ttacacaaag
gatcacatat 1800gagcaaggag ttcgggctgc caatcctcag aaagtgggtt
gagtgcccca gttacacatc 1860agattgtttt aacccctcct ttaccccatt
ttcaagaagg atgtactgta ctttgcagaa 1920gtgaagtttt tctgttatta
atatataatt atgcaaatga atgcgactat gttgacaatg 1980tgtatatgta
aataatatgt gttttaccag atgtttcata gaaagaattt tttcttgatc
2040tgttttgttc tctatacttt gcttgtgtat atttgtcaga ggtgtttcta
gtgtaagatt 2100taagcctgcc attttaccag cattattgta gtttaatgat
tgaatgtaga cagggatatg 2160cgtatagttt tcagtattag ttctagataa
cactaaatta actactgtta ggttgagtat 2220ggtggggtca gtgacctaaa
atggagtgag gccaaagcac tgtcctgtaa gtcttacttc 2280ctgcttaggg
cacagtgaag taggaaacaa tattttgaaa ataagtttta aatttaaaat
2340gatcaaaaag caatatagtt gcataaaagc actgtaaaat atttaaaagg
ttaaaactgt 2400ggaaaattat attggtaagt ttacagatca ataaaagcac
ctgttctcca tctgaactag 2460acaatggaaa taatgctgca tgctggccat
ggcccattct tcatcatttg taagttcaac 2520aaaagttctc acatggagtc
ccacctcttc agaggtttgt acatttgttt ttaagcactg 2580aattcactac
tgatcccatc gcctggccag tagaacagtc attactccat taacatcctc
2640actgtttaga cacataactg tggtacagtg tattggaaat tttataaaca
aaagtgaaag 2700tgccaacaaa ttattgatag ctgataatgt ttcattatct
gcaactgctt gataagtatg 2760ttgcatttta agagcttata attgtgtata
atttgttaac actagaaacc tattagtatt 2820gtgaatgtag attttactgt
gaagctatct gtgatttagc tgtttgctcc catgatggag 2880tctttgcagc
atggcgctag cagccaatgc agtttctaat actcggtaat ttgcatgttt
2940tgtggagcat ttttatgtca ccaaccagac agtatttcct gcatgcttat
ttagaagagg 3000cagcttatct tgagaggtag tgttatctac ctttgtcagg
ctttttgaca ggtcatttca 3060gagtaagcct ttgttcccaa gacccaacaa
ctgtcaccct cttctgtacc tctcctgagt 3120gccaactgtc caggccattt
gacacaccat ctgttaacct ctgagtttgc ccactcaagg 3180ccactcatag
gggcatccat ccccaagcac ctcctcatgc tgtgcatgca gtcttaaatt
3240caatggacaa aaataaaatg ctggctacct ctggatcatc tggctgagca
actgaattac 3300aaaagagaat tacttccatc tcaacttcaa cccattgatt
acgtccatcc tagcaagcta 3360aatggcatcc cagctgctcc tttctgtgca
accaattaaa gaacaatgag tgtgatgctc 3420catgtctgaa tttcgtccag
cctctctctg aactgtgatc tttgtcctca tgaactttcc 3480cttttgttca
ttgaactata tggactcttc atttcatatt gatttactgt gcaatttact
3540tttggacatt gagaacttga aattatttcc tgatcccttc cccttccact
attaataatt 3600catttctgtc aaactgtaag agtagactca tttttttttt
tttagttttt aacattggac 3660tgttatttca tttagagttc tctatctcta
aatatttatt tagagaatga ttttaaaagg 3720gaatgatatg cttgtttaaa
tgaaagagaa aagctgtagt aaactgtgtt aattggtaat 3780gactatttat
cgtcgatact ctgtagctgt gtaagttttg acaaatagtg tatctcgtgg
3840aatcagtggt tagcattgcc gctattatat ttactcattt tatcattata
aatgtgctta 3900gttcatcatg tagcatca 3918153846DNAHomo sapiens
15ctctcccttc tccactctct ccccctgtct cctttcttct tcttctttca ccctccgtct
60ctcacacccc ctccattccc ctgtctcctt tctgacactg cactgcagct gctcctcagc
120cctgccccct ccccagtgag aacaaaccag caacattgct ttttttccta
aagagattta 180tattgatccg attaaaaaaa aaaaacctta agaaacccca
aacgcaaaaa aaaaaaaaaa 240aaaaaaagaa aaaagaaaag aaaaagccaa
aacaaaaggg agaaccttct cccggtagca 300gcggcaggaa ctgcaaacat
gatggcggca gctcccatcc agcagaacgg gacccacact 360ggggttccca
tagacctgga cccgccggac tcgcggaaaa ggccgctgga agccccccct
420gaagccggca gcaccaagag gaccaatacg ggcgaagacg gccagtattt
tctaaaggtt 480ctcataccta gttatgctgc tggatctata attgggaagg
gaggacagac aattgttcag 540ttgcaaaaag aaactggagc caccatcaag
ctgtctaagt ccaaagattt ttacccaggt 600actactgagc gagtgtgctt
gatccaggga acggttgaag cactgaatgc agttcatgga 660ttcattgcag
aaaaaattcg agaaatgccc caaaatgtgg ccaagacaga accagtcagc
720attctacaac cccagaccac cgttaatcca gatcgcatca aacaagtaaa
gattatagtt 780cccaacagca cagcaggtct gataataggg aagggaggtg
ctactgtgaa ggctgtaatg 840gagcagtcag gggcttgggt gcagctttcc
cagaaacctg atgggatcaa cttgcaagag 900agggttgtca ctgtgagtgg
agaacctgaa caaaaccgaa aagctgttga acttatcatc 960cagaagatac
aagaggatcc acaaagtggc agctgtctca atatcagtta tgccaatgtg
1020acaggtccag tggcaaattc caatccaacc ggatctcctt atgcaaacac
tgctgaagtg 1080ttaccaactg ctgcagcagc tgcagggcta ttaggacatg
ctaaccttgc tggcgttgca 1140gcctttccag cagttttatc tggcttcaca
ggcaatgacc tggtggccat cacctctgca 1200cttaatacat tagccagcta
tggatataat ctcaacactt taggtttagg tctcagtcaa 1260gcagcagcaa
caggggcttt ggctgcagca gctgccagtg ccaacccagc agcagcagca
1320gccaatttat tggccaccta tgccagtgaa gcctcagcca gtggcagcac
agctggtggt 1380acggcgggga catttgcatt aggtagcctg gctgctgcta
ctgctgcaac caatggatat 1440tttggagctg cttctcccct agctgccagt
gccattctag gaacagaaaa gtccacagat 1500ggatccaagg atgtagttga
aatagcagtg ccagaaaact tagttggtgc aatacttggc 1560aaaggaggga
aaacattagt ggaataccag gagttgactg gtgcaaggat acagatctcc
1620aaaaaaggag aattcgtacc tggcacaagg aatcggaagg taaccattac
tggaacacca 1680gctgcaacac aggctgctca atatttaatt acacaaagga
tcacatatga gcaaggagtt 1740cgggctgcca atcctcagaa agtgggttga
gtgccccagt tacacatcag attgttttaa 1800cccctccttt accccatttt
caagaaggat gtactgtact ttgcagaagt gaagtttttc 1860tgttattaat
atataattat gcaaatgaat gcgactatgt tgacaatgtg tatatgtaaa
1920taatatgtgt tttaccagat gtttcataga aagaattttt tcttgatctg
ttttgttctc 1980tatactttgc ttgtgtatat ttgtcagagg tgtttctagt
gtaagattta agcctgccat 2040tttaccagca ttattgtagt ttaatgattg
aatgtagaca gggatatgcg tatagttttc 2100agtattagtt ctagataaca
ctaaattaac tactgttagg ttgagtatgg tggggtcagt 2160gacctaaaat
ggagtgaggc caaagcactg tcctgtaagt cttacttcct gcttagggca
2220cagtgaagta ggaaacaata ttttgaaaat aagttttaaa tttaaaatga
tcaaaaagca 2280atatagttgc ataaaagcac tgtaaaatat ttaaaaggtt
aaaactgtgg aaaattatat 2340tggtaagttt acagatcaat aaaagcacct
gttctccatc tgaactagac aatggaaata 2400atgctgcatg ctggccatgg
cccattcttc atcatttgta agttcaacaa aagttctcac 2460atggagtccc
acctcttcag aggtttgtac atttgttttt aagcactgaa ttcactactg
2520atcccatcgc ctggccagta gaacagtcat tactccatta acatcctcac
tgtttagaca 2580cataactgtg gtacagtgta ttggaaattt tataaacaaa
agtgaaagtg ccaacaaatt 2640attgatagct gataatgttt cattatctgc
aactgcttga taagtatgtt gcattttaag 2700agcttataat tgtgtataat
ttgttaacac tagaaaccta ttagtattgt gaatgtagat 2760tttactgtga
agctatctgt gatttagctg tttgctccca tgatggagtc tttgcagcat
2820ggcgctagca gccaatgcag tttctaatac tcggtaattt gcatgttttg
tggagcattt 2880ttatgtcacc aaccagacag tatttcctgc atgcttattt
agaagaggca gcttatcttg 2940agaggtagtg ttatctacct ttgtcaggct
ttttgacagg tcatttcaga gtaagccttt 3000gttcccaaga cccaacaact
gtcaccctct tctgtacctc tcctgagtgc caactgtcca 3060ggccatttga
cacaccatct gttaacctct gagtttgccc actcaaggcc actcataggg
3120gcatccatcc ccaagcacct cctcatgctg tgcatgcagt cttaaattca
atggacaaaa 3180ataaaatgct ggctacctct ggatcatctg gctgagcaac
tgaattacaa aagagaatta 3240cttccatctc aacttcaacc cattgattac
gtccatccta gcaagctaaa tggcatccca 3300gctgctcctt tctgtgcaac
caattaaaga acaatgagtg tgatgctcca tgtctgaatt 3360tcgtccagcc
tctctctgaa ctgtgatctt tgtcctcatg aactttccct tttgttcatt
3420gaactatatg gactcttcat ttcatattga tttactgtgc aatttacttt
tggacattga 3480gaacttgaaa ttatttcctg atcccttccc cttccactat
taataattca tttctgtcaa 3540actgtaagag tagactcatt tttttttttt
tagtttttaa cattggactg ttatttcatt 3600tagagttctc tatctctaaa
tatttattta gagaatgatt ttaaaaggga atgatatgct 3660tgtttaaatg
aaagagaaaa gctgtagtaa actgtgttaa ttggtaatga ctatttatcg
3720tcgatactct gtagctgtgt aagttttgac aaatagtgta tctcgtggaa
tcagtggtta 3780gcattgccgc tattatattt actcatttta tcattataaa
tgtgcttagt tcatcatgta 3840gcatca 3846163340DNAHomo sapiens
16tgcgggcgtc tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct
60attccggcgc ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt
120cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta
cttgtgtcac 180taacggaccg tttatcatga gcagcaactc ggcttctgca
gcaaacggaa atgacagcaa 240gaagttcaaa ggtgacagcc gaagtgcagg
cgtcccctct agagtgatcc acatccggaa 300gctccccatc gacgtcacgg
agggggaagt catctccctg gggctgccct ttgggaaggt 360caccaacctc
ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga
420ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc
gcggccagcc 480catctacatc cagttctcca accacaagga gctgaagacc
gacagctctc ccaaccaggc 540gcgggcccag gcggccctgc aggcggtgaa
ctcggtccag tcggggaacc tggccttggc 600tgcctcggcg gcggccgtgg
acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 660gatcatcgtg
gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc
720caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt
tccaggccct 780gctgcagtat gcggaccccg tgagcgccca gcacgccaag
ctgtcgctgg acgggcagaa 840catctacaac gcctgctgca cgctgcgcat
cgacttttcc aagctcacca gcctcaacgt 900caagtacaac aatgacaaga
gccgtgacta cacacgccca gacctgcctt ccggggacag 960ccagccctcg
ctggaccaga ccatggccgc ggccttcggt gcacctggta taatctcagc
1020ctctccgtat gcaggagctg gtttccctcc cacctttgcc attcctcaag
ctgcaggcct 1080ttccgttccg aacgtccacg gcgccctggc ccccctggcc
atcccctcgg cggcggcggc 1140agctgcggcg gcaggtcgga tcgccatccc
gggcctggcg ggggcaggaa attctgtatt 1200gctggtcagc aacctcaacc
cagagagagt cacaccccaa agcctcttta ttcttttcgg 1260cgtctacggt
gacgtgcagc gcgtgaagat cctgttcaat aagaaggaga acgccctagt
1320gcagatggcg gacggcaacc aggcccagct ggccatgagc cacctgaacg
ggcacaagct 1380gcacgggaag cccatccgca tcacgctctc gaagcaccag
aacgtgcagc tgccccgcga 1440gggccaggag gaccagggcc tgaccaagga
ctacggcaac tcacccctgc accgcttcaa 1500gaagccgggc tccaagaact
tccagaacat attcccgccc tcggccacgc tgcacctctc 1560caacatcccg
ccctcagtct ccgaggagga tctcaaggtc ctgttttcca gcaatggggg
1620cgtcgtcaaa ggattcaagt tcttccagaa ggaccgcaag atggcactga
tccagatggg 1680ctccgtggag gaggcggtcc aggccctcat tgacctgcac
aaccacgacc tcggggagaa 1740ccaccacctg cgggtctcct tctccaagtc
caccatctag gggcacaggc ccccacggcc 1800gggccccctg gcgacaactt
ccatcattcc agagaaaagc cactttaaaa acagctgaag 1860tgaccttagc
agaccagaga ttttattttt ttaaagagaa atcagtttac ctgtttttaa
1920aaaaattaaa tctagttcac cttgctcacc ctgcggtgac agggacagct
caggctcttg 1980gtgactgtgg cagcgggagt tcccggccct ccacacccgg
ggccagaccc tcggggccat 2040gccttggtgg ggcctgtgtc gggcgtgggg
cctgcaggtg ggcgccccga ccacgacttg 2100gcttccttgt gccttaaaaa
acctgccttc ctgcagccac acacccaccc ggggtgtcct 2160ggggacccaa
ggggtggggg ggtcacacca gagagaggca gggggcctgg ccggctcctg
2220caggatcatg cagctggggc gcggcggccg cggctgcgac accccaaccc
cagccctcta 2280atcaagtcac gtgattctcc cttcaccccg cccccagggc
cttcccttct gcccccaggc 2340gggctccccg ctgctccagc tgcggagctg
gtcgacataa tctctgtatt atatactttg 2400cagttgcaga cgtctgtgcc
tagcaatatt tccagttgac caaatattct aatctttttt 2460catttatatg
caaaagaaat agttttaagt aactttttat agcaagatga tacaatggta
2520tgagtgtaat ctaaacttcc ttgtggtatt accttgtatg ctgttacttt
tattttattc 2580cttgtaatta agtcacaggc aggacccagt ttccagagag
caggcggggc cgcccagtgg 2640gtcaggcaca gggagccccg gtcctatctt
agagcccctg agcttcaggg aaggggcggg 2700cgtgtcgccg cctctggcat
cgcctccggt tgccttacac cacgccttca cctgcagtcg 2760cctagaaaac
ttgctctcaa acttcagggt tttttcttcc ttcaaatttt ggaccaaagt
2820ctcatttctg tgttttgcct gcctctgatg ctgggacccg gaaggcgggc
gctcctcctg 2880tcttctctgt gctctttcta ccgcccccgc gtcctgtccc
gggggctctc ctaggatccc 2940ctttccgtaa aagcgtgtaa caagggtgta
aatatttata attttttata cctgttgtga 3000gacccgaggg gcggcggcgc
ggttttttat ggtgacacaa atgtatattt tgctaacagc 3060aattccaggc
tcagtattgt gaccgcggag ccacagggga ccccacgcac attccgttgc
3120cttacccgat ggcttgtgac gcggagagaa ccgattaaaa ccgtttgaga
aactcctccc 3180ttgtctagcc ctgtgttcgc tgtggacgct gtagaggcag
gttggccagt ctgtacctgg 3240acttcgaata aatcttctgt atcctcgctc
cgttccgcct taaaaaaaaa aaaaaaaaaa 3300aaaaaaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa 3340173319DNAHomo sapiens 17tgcgggcgtc
tccgccattt tgtgagtcta taactcggag ccgttgggtc ggttcctgct 60attccggcgc
ctccactccg tcccccgcgg gtctgctctg tgtgccatgg acggcattgt
120cccagatata gccgttggta caaagcgggg atctgacgag cttttctcta
cttgtgtcac 180taacggaccg tttatcatga gcagcaactc ggcttctgca
gcaaacggaa atgacagcaa 240gaagttcaaa ggtgacagcc gaagtgcagg
cgtcccctct agagtgatcc acatccggaa 300gctccccatc gacgtcacgg
agggggaagt catctccctg gggctgccct ttgggaaggt 360caccaacctc
ctgatgctga aggggaaaaa ccaggccttc atcgagatga acacggagga
420ggctgccaac accatggtga actactacac ctcggtgacc cctgtgctgc
gcggccagcc 480catctacatc cagttctcca accacaagga gctgaagacc
gacagctctc ccaaccaggc 540gcgggcccag gcggccctgc aggcggtgaa
ctcggtccag tcggggaacc tggccttggc 600tgcctcggcg gcggccgtgg
acgcagggat ggcgatggcc gggcagagcc ccgtgctcag 660gatcatcgtg
gagaacctct tctaccctgt gaccctggat gtgctgcacc agattttctc
720caagttcggc acagtgttga agatcatcac cttcaccaag aacaaccagt
tccaggccct 780gctgcagtat gcggaccccg tgagcgccca gcacgccaag
ctgtcgctgg acgggcagaa 840catctacaac gcctgctgca cgctgcgcat
cgacttttcc aagctcacca gcctcaacgt 900caagtacaac aatgacaaga
gccgtgacta cacacgccca gacctgcctt ccggggacag 960ccagccctcg
ctggaccaga ccatggccgc ggccttcgcc tctccgtatg caggagctgg
1020tttccctccc acctttgcca ttcctcaagc tgcaggcctt tccgttccga
acgtccacgg 1080cgccctggcc cccctggcca tcccctcggc ggcggcggca
gctgcggcgg caggtcggat 1140cgccatcccg ggcctggcgg gggcaggaaa
ttctgtattg ctggtcagca acctcaaccc 1200agagagagtc acaccccaaa
gcctctttat tcttttcggc gtctacggtg acgtgcagcg 1260cgtgaagatc
ctgttcaata agaaggagaa cgccctagtg cagatggcgg acggcaacca
1320ggcccagctg gccatgagcc acctgaacgg gcacaagctg cacgggaagc
ccatccgcat 1380cacgctctcg aagcaccaga acgtgcagct gccccgcgag
ggccaggagg accagggcct 1440gaccaaggac tacggcaact cacccctgca
ccgcttcaag aagccgggct ccaagaactt 1500ccagaacata ttcccgccct
cggccacgct gcacctctcc aacatcccgc cctcagtctc 1560cgaggaggat
ctcaaggtcc tgttttccag caatgggggc gtcgtcaaag gattcaagtt
1620cttccagaag gaccgcaaga tggcactgat ccagatgggc tccgtggagg
aggcggtcca 1680ggccctcatt gacctgcaca accacgacct cggggagaac
caccacctgc gggtctcctt 1740ctccaagtcc accatctagg ggcacaggcc
cccacggccg ggccccctgg cgacaacttc 1800catcattcca gagaaaagcc
actttaaaaa cagctgaagt gaccttagca gaccagagat 1860tttatttttt
taaagagaaa tcagtttacc tgtttttaaa aaaattaaat ctagttcacc
1920ttgctcaccc tgcggtgaca gggacagctc aggctcttgg tgactgtggc
agcgggagtt 1980cccggccctc cacacccggg gccagaccct cggggccatg
ccttggtggg gcctgtgtcg 2040ggcgtggggc ctgcaggtgg gcgccccgac
cacgacttgg cttccttgtg ccttaaaaaa 2100cctgccttcc tgcagccaca
cacccacccg gggtgtcctg gggacccaag gggtgggggg 2160gtcacaccag
agagaggcag ggggcctggc cggctcctgc aggatcatgc agctggggcg
2220cggcggccgc ggctgcgaca ccccaacccc agccctctaa tcaagtcacg
tgattctccc 2280ttcaccccgc ccccagggcc ttcccttctg cccccaggcg
ggctccccgc tgctccagct 2340gcggagctgg tcgacataat ctctgtatta
tatactttgc agttgcagac gtctgtgcct 2400agcaatattt ccagttgacc
aaatattcta atcttttttc atttatatgc aaaagaaata 2460gttttaagta
actttttata gcaagatgat acaatggtat gagtgtaatc taaacttcct
2520tgtggtatta ccttgtatgc tgttactttt attttattcc ttgtaattaa
gtcacaggca 2580ggacccagtt tccagagagc aggcggggcc gcccagtggg
tcaggcacag ggagccccgg 2640tcctatctta gagcccctga gcttcaggga
aggggcgggc gtgtcgccgc ctctggcatc 2700gcctccggtt gccttacacc
acgccttcac ctgcagtcgc ctagaaaact tgctctcaaa 2760cttcagggtt
ttttcttcct tcaaattttg gaccaaagtc tcatttctgt gttttgcctg
2820cctctgatgc tgggacccgg aaggcgggcg ctcctcctgt cttctctgtg
ctctttctac 2880cgcccccgcg tcctgtcccg ggggctctcc taggatcccc
tttccgtaaa agcgtgtaac 2940aagggtgtaa atatttataa ttttttatac
ctgttgtgag acccgagggg cggcggcgcg 3000gttttttatg gtgacacaaa
tgtatatttt gctaacagca attccaggct cagtattgtg 3060accgcggagc
cacaggggac cccacgcaca ttccgttgcc ttacccgatg gcttgtgacg
3120cggagagaac cgattaaaac cgtttgagaa actcctccct tgtctagccc
tgtgttcgct 3180gtggacgctg tagaggcagg ttggccagtc tgtacctgga
cttcgaataa atcttctgta 3240tcctcgctcc gttccgcctt aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300aaaaaaaaaa aaaaaaaaa
3319183262DNAHomo sapiens 18tgcgggcgtc tccgccattt tgtgagtcta
taactcggag ccgttgggtc ggttcctgct 60attccggcgc ctccactccg tcccccgcgg
gtctgctctg tgtgccatgg acggcattgt 120cccagatata gccgttggta
caaagcgggg atctgacgag cttttctcta cttgtgtcac 180taacggaccg
tttatcatga gcagcaactc ggcttctgca gcaaacggaa atgacagcaa
240gaagttcaaa ggtgacagcc gaagtgcagg cgtcccctct agagtgatcc
acatccggaa
300gctccccatc gacgtcacgg agggggaagt catctccctg gggctgccct
ttgggaaggt 360caccaacctc ctgatgctga aggggaaaaa ccaggccttc
atcgagatga acacggagga 420ggctgccaac accatggtga actactacac
ctcggtgacc cctgtgctgc gcggccagcc 480catctacatc cagttctcca
accacaagga gctgaagacc gacagctctc ccaaccaggc 540gcgggcccag
gcggccctgc aggcggtgaa ctcggtccag tcggggaacc tggccttggc
600tgcctcggcg gcggccgtgg acgcagggat ggcgatggcc gggcagagcc
ccgtgctcag 660gatcatcgtg gagaacctct tctaccctgt gaccctggat
gtgctgcacc agattttctc 720caagttcggc acagtgttga agatcatcac
cttcaccaag aacaaccagt tccaggccct 780gctgcagtat gcggaccccg
tgagcgccca gcacgccaag ctgtcgctgg acgggcagaa 840catctacaac
gcctgctgca cgctgcgcat cgacttttcc aagctcacca gcctcaacgt
900caagtacaac aatgacaaga gccgtgacta cacacgccca gacctgcctt
ccggggacag 960ccagccctcg ctggaccaga ccatggccgc ggccttcggc
ctttccgttc cgaacgtcca 1020cggcgccctg gcccccctgg ccatcccctc
ggcggcggcg gcagctgcgg cggcaggtcg 1080gatcgccatc ccgggcctgg
cgggggcagg aaattctgta ttgctggtca gcaacctcaa 1140cccagagaga
gtcacacccc aaagcctctt tattcttttc ggcgtctacg gtgacgtgca
1200gcgcgtgaag atcctgttca ataagaagga gaacgcccta gtgcagatgg
cggacggcaa 1260ccaggcccag ctggccatga gccacctgaa cgggcacaag
ctgcacggga agcccatccg 1320catcacgctc tcgaagcacc agaacgtgca
gctgccccgc gagggccagg aggaccaggg 1380cctgaccaag gactacggca
actcacccct gcaccgcttc aagaagccgg gctccaagaa 1440cttccagaac
atattcccgc cctcggccac gctgcacctc tccaacatcc cgccctcagt
1500ctccgaggag gatctcaagg tcctgttttc cagcaatggg ggcgtcgtca
aaggattcaa 1560gttcttccag aaggaccgca agatggcact gatccagatg
ggctccgtgg aggaggcggt 1620ccaggccctc attgacctgc acaaccacga
cctcggggag aaccaccacc tgcgggtctc 1680cttctccaag tccaccatct
aggggcacag gcccccacgg ccgggccccc tggcgacaac 1740ttccatcatt
ccagagaaaa gccactttaa aaacagctga agtgacctta gcagaccaga
1800gattttattt ttttaaagag aaatcagttt acctgttttt aaaaaaatta
aatctagttc 1860accttgctca ccctgcggtg acagggacag ctcaggctct
tggtgactgt ggcagcggga 1920gttcccggcc ctccacaccc ggggccagac
cctcggggcc atgccttggt ggggcctgtg 1980tcgggcgtgg ggcctgcagg
tgggcgcccc gaccacgact tggcttcctt gtgccttaaa 2040aaacctgcct
tcctgcagcc acacacccac ccggggtgtc ctggggaccc aaggggtggg
2100ggggtcacac cagagagagg cagggggcct ggccggctcc tgcaggatca
tgcagctggg 2160gcgcggcggc cgcggctgcg acaccccaac cccagccctc
taatcaagtc acgtgattct 2220cccttcaccc cgcccccagg gccttccctt
ctgcccccag gcgggctccc cgctgctcca 2280gctgcggagc tggtcgacat
aatctctgta ttatatactt tgcagttgca gacgtctgtg 2340cctagcaata
tttccagttg accaaatatt ctaatctttt ttcatttata tgcaaaagaa
2400atagttttaa gtaacttttt atagcaagat gatacaatgg tatgagtgta
atctaaactt 2460ccttgtggta ttaccttgta tgctgttact tttattttat
tccttgtaat taagtcacag 2520gcaggaccca gtttccagag agcaggcggg
gccgcccagt gggtcaggca cagggagccc 2580cggtcctatc ttagagcccc
tgagcttcag ggaaggggcg ggcgtgtcgc cgcctctggc 2640atcgcctccg
gttgccttac accacgcctt cacctgcagt cgcctagaaa acttgctctc
2700aaacttcagg gttttttctt ccttcaaatt ttggaccaaa gtctcatttc
tgtgttttgc 2760ctgcctctga tgctgggacc cggaaggcgg gcgctcctcc
tgtcttctct gtgctctttc 2820taccgccccc gcgtcctgtc ccgggggctc
tcctaggatc ccctttccgt aaaagcgtgt 2880aacaagggtg taaatattta
taatttttta tacctgttgt gagacccgag gggcggcggc 2940gcggtttttt
atggtgacac aaatgtatat tttgctaaca gcaattccag gctcagtatt
3000gtgaccgcgg agccacaggg gaccccacgc acattccgtt gccttacccg
atggcttgtg 3060acgcggagag aaccgattaa aaccgtttga gaaactcctc
ccttgtctag ccctgtgttc 3120gctgtggacg ctgtagaggc aggttggcca
gtctgtacct ggacttcgaa taaatcttct 3180gtatcctcgc tccgttccgc
cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3240aaaaaaaaaa
aaaaaaaaaa aa 3262191148DNAHomo sapiens 19gcgccgagac ccgctcctgc
agtattagtt cttgcagctg gtggtggcgg ctgaggcggc 60atggatctca gcgagctgga
gagagacaat acaggccgct gtcgcctgag ttcgcctgtg 120cccgcggtgt
gccgcaagga gccttgcgtc ctgggcgtcg atgaggcggg caggggcccc
180gtgctgggcc ccatggtcta cgccatctgt tattgtcccc tgcctcgcct
ggcagatctg 240gaggcgctga aagtggcaga ctcaaagacc ctattggaga
gcgagcggga aaggctgttt 300gcgaaaatgg aggacacgga ctttgtcggc
tgggcgctgg atgtgctgtc tccaaacctc 360atctctacca gcatgcttgg
gcgggtcaaa tacaacctga actccctgtc acatgataca 420gccactgggc
ttatacagta tgcattggac cagggcgtga acgtcaccca ggtattcgtg
480gacaccgtag ggatgccaga gacataccag gcgcggctgc agcaaagttt
tcccgggatt 540gaggtgacgg tcaaggccaa agcagatgcc ctctacccgg
tggttagtgc tgccagcatc 600tgtgccaagg tggcccggga ccaggccgtg
aagaaatggc agttcgtgga gaaactgcag 660gacttggata ctgattatgg
ctcaggctac cccaatgatc ccaagacaaa agcgtggttg 720aaggagcacg
tggagcctgt gttcggcttc ccccagtttg tccggttcag ctggcgcacg
780gcccagacca tcctggagaa agaggcggaa gatgttatat gggaggactc
agcatccgag 840aatcaggagg gactcaggaa gatcacatcc tacttcctca
atgaagggtc ccaagcccgt 900ccccgttctt cccaccgata tttcctggaa
cgcggcctgg agtcagcaac cagcctctag 960cagctgcctc tacgcgctct
acctgcttcc ccaacccaga cattaaaatt gtttaaggag 1020aaccacacgt
aggggatgta cttttgggac agaagcaagg tgggagtgtg ctctgcagcc
1080gggtccagct acttcctttt ggaaccttaa atagaatggg tgttggttga
ttaattttat 1140ttaaaaaa 1148201613DNAHomo sapiens 20ggagaaacac
acacgggcgg gcggagggga cccggggcga gtcatcaagg gcgcgtggtt 60cggcgtgcca
ggcgcgctgc tctgcctgct ctcttggctt ctgtctccct tcgaccgatc
120gccccctatc ctgaagcttt ccaatgtcat cttggagccc caaagtttcc
tggggcctcc 180gcgttgtgcg tcccagaacc ccttgcctgc ccctgaggga
aacgcggagc cataggcagc 240gggacgtcgg gagccagccc aggggaggcc
agattcagca tttggacagc ggctctgggg 300cgcagtcggc ccagcgagtt
tgccggtgaa cagcctcggg cacatggcgg gtaggagggc 360cgcagggctg
ctctgggtct tgaagaagca ggacccagcc tagagggcat ccccagctcc
420gaatgggaca cgttttcccg agataaaaga tcccttctga gctcacacgg
gagccccggg 480accatccaat ccagcgtgga tatccccagc ctaaccaaca
cctgtgctgg ggggaaagat 540aagacgcccc ctttcagcca ggaggtggac
gaccctcatg ccctcagctc tccattcttc 600ccaaagcagc tcggatccct
aagtctggag ctgccagcga ggcttccaac ccgctgcttg 660ccatcacctc
ccaggtcgtt ggtggctccg attactcccc tgctggtgcc tccctccttg
720gcgcgcttcc cacctgcgat cggcgccctc ttcgcagtca cgaactcgcc
agcagctagc 780agcactgact agtaggaggg cccgccggag gagagccgcg
cggcccacag aagcggaacg 840cgcgtcgaga gcgccctgtc cgctcgcccc
agacagatgc ccggttattc attaccgcga 900ggcctagagg aaagagtggc
tgccgtcttc ctgcccacag cccgccggac cctccgtcgc 960ggctgcccgg
tccccggagc cgcagccgcc gagcccggct gtgcgtgtcg tggctgctgg
1020ggagaaagag gcttccggac atgctctgga gtcagaagac agcgaaaaga
gaagcagaag 1080ccccggtggc aagagtctga aggaaggatg actgtagcct
gtggattgta ctgcagtagg 1140aaactgtcct agcaaggctc cactttgccc
cagcttcaag ctggaaagga ggagaacatg 1200aaacattgct tgaagacaat
ggccgagaca gcaggtccca ccctgcacag ccaccagcat 1260ctctcccctc
agccctgtct cctcttctgc agttgggatc tgcacattta agcctgaaat
1320tgtcctgtga agtgaagtat gatcggacag cctcttttca gcttttatga
caatggagac 1380agaggaattg tggctcttgc caaggtcaca ggattggaat
acagagccaa gccaccccag 1440gacatgcaag agcctcagaa gggaaaaaag
cccagcagga agggagaaca agtagcctct 1500gtcctgaagt tgtaacagcc
aggggccagg atggaggagg aggaccccat aatctgccca 1560tctgggactt
ggcaggggac ctgggaaaat gtaccccaac ccatccctta agg 161321606RNAHomo
sapiens 21ccugcuggug ccucccuccu uggcgcgcuu cccaccugcg aucggcgccc
ucuucgcagu 60cacgaacucg ccagcagcua gcagcacuga cuaguaggag ggcccgccgg
aggagaggac 120augcucugga gucagaagac agcgaaaaga gaagcagaag
ccccgguggc aagagucuga 180agcuggaaag gaggagaaca ugaaacauug
cuugaagaca auggccgaga cagcaggucc 240cacccugcac agccaccagc
aucucucccc ucagcccugu cuccucuucu gcaguuggga 300ucugcacauu
uaagccugaa auuguccugu gaagugaagu augaucggac agccucuuuu
360cagcuuuuau gacaauggag acagaggaau uguggcucuu gccaagguca
caggauugga 420auacagagcc aagccacccc aggacaugca agagccucag
aagggaaaaa agcccagcag 480gaagggagaa caaguagccu cuguccugaa
guuguaacag ccaggggcca ggauggagga 540ggaggacccc auaaucugcc
caucugggac uuggcagggg accugggaaa auguacccca 600acccau
60622551DNAHomo sapiens 22cttcgcagtc acgaactcgc cagcagctag
cagcactgac tagtaggagg gcccgccgga 60ggagaggaag ccccagagag attggtgagg
gtgatttccc aggaagacgc agtgtgctct 120gacttctgtg acagtgagca
acgggaccag tggatgtcca gatgctggca atgagacatg 180ctctggagtc
agaagacagc gaaaagagaa gcagaagccc cggtggcaag agtctgaagg
240aaggatgact gtagcctgtg gattgtactg cagtaggaaa ctgtcctagc
aaggctccac 300tttgccccag cttcaagctg gaaaggagga gaacatgaaa
cattgcttga agacaatggc 360cgagacagca ggtcccaccc tgcacagcca
ccagcatctc tcccctcagc cctgtctcct 420cttctgcagt tgggatctgc
acatttaagc ctgaaattgt cctgtgaagt gaagtatgat 480cggacagcct
cttttcagct tttatgacaa tggagacaga ggaattgtgg ctcttgccaa
540ggtcacagga t 551231877DNAHomo sapiens 23tcgccagcag ctagcagcac
tgactagtag gagggcccgc cggaggagag ccgcgcggcc 60cacagaagcg gaacgcgcgt
cgagagcgcc ctgtccgctc gccccagaca gatgcccggt 120tattcattac
cgcgaggcct agaggaaaga gtggctgccg tcttcctgcc cacagcccgc
180cggaccctcc gtcgcggctg cccggtcccc ggagccgcag ccgccgagcc
cggctgtgcg 240tgtcgtggct gctggggaga aagaggcttc cggaagcccc
agagagattg gtgagggtga 300tttcccagga agacgcagtg tgctctgact
tctgtgacag tgagcaacgg gaccagtgga 360tgtccagatg ctggcaatga
gacatgctct ggagtcagaa gacagcgaaa agagaagcag 420aagccccggt
ggcaagagtc tgaagcagga aggatgactg tagcctgtgg attgtactgc
480agtaggaaac tgtcctagca aggctccact ttgccccagc ttcaagctgg
aaaggaggag 540aacatgaaac attgcttgaa gacaatggcc gagacagcag
gtcccaccct gcacagccac 600cagcatctct cccctcagcc ctgtctcctc
ttctgcagtt gggatctgca catttaagcc 660tgaaattgtc ctgtgaagtg
aagtatgatc ggacagcctc ttttcagctt ttatgacaat 720ggagacagag
gaattgtggc tcttgccaag gtcacaggat tggaatacag agccaagcca
780ccccaggaca tgcaagagcc tcagaaggga aaaaagccca gcaggaaggg
agaacaagta 840gcctctgtcc tgaagttgta acagccaggg gccaggatgg
aggaggagga ccccataatc 900tgcccatctg ggacttggca ggggacctgg
gaaaatgtac cccaacccat cccttaaggg 960cctttgtctt tggcccattg
gcctagcatc tacttcttca ccgtgtctgt tcttgtcaca 1020cctagtcagg
tctgtttggg tctgaggtgc atggaacatt ctgggtaggc ctccagcaaa
1080cggaagctct tcaccgtgtt tccagcctgg gaccaagggc agcatactgg
caaagttgcc 1140aaagcaaggg actccagcct cttaggagtt aatgactccc
tctccccagc tgtcctcccc 1200ttggtgctcc tcttcctccc tcctcctgct
cacagcaggc agggcctaga cccgggagcc 1260atgctgctgt gctgttgcca
ggggagcacg gaggcagatc tgagctatgc agggaaaagg 1320cccagcctgt
caaagtgtct gagatgaacc gccgccgtcc ctgtgcagct gggctcagac
1380gtgtctcagc tcttgttctg tgcctgagaa tggcgaaacc cagtgaggtt
caagggcaaa 1440ctcgctattc attagtcagg ggttcttgac gtcccgtctc
tcccagggat gagttccccc 1500ctcctctttc tccccctcct atgacacatt
cctgggtgcc tttggtgagg actgcacacc 1560ctcctcctgc ctagccccct
ctccaaaggc ccctgaataa actcccccca aggagaccag 1620gcagggcaga
gacaatggct gcaggaaatc attcaggcgg gacatgctgg cctgccctcc
1680acccagtccc cctgtgggcc ccactccctt ctgattcagg gcacccttgg
gcccccagcc 1740tatacaggcc tggacaggaa gaaaccactg ggaaccaccc
taaggacaac atgctagtcc 1800agtgccattc ttcgctggct ctgtgggtgc
ctttgtggcc tgtaccgact ggctggctaa 1860ttttgtggtt tctgtac
187724561DNAHomo sapiens 24agcactgact agtaggaggg cccgccggag
gagaggacat gctctggagt cagaagacag 60cgaaaagaga agcagaagcc ccggtggcaa
gagtctgaag caggaaggat gactgtagcc 120tgtggattgt actgcagtag
gaaactgtcc tagcaaggct ccactttgcc ccagcttcaa 180gctggaaagg
aggagaacat gaaacattgc ttgaagacaa tggccgagac agcaggtccc
240accctgcaca gccaccagca tctctcccct cagccctgtc tcctcttctg
cagttgggat 300ctgcacattt aagcctgaaa ttgtcctgtg aagtgaagta
tgatcggaca gcctcttttc 360agcttttatg acaatggaga cagaggaatt
gtggctcttg ccaaggtcac aggattggaa 420tacagagcca agccacccca
ggacatgcaa gagcctcaga agggaaaaaa gcccagcagg 480aagggagaac
aagtagcctc tgtcctgaag ttgtaacagc caggggccag gatggaggag
540gaggacccca taatctgccc a 56125786DNAHomo sapiens 25ctgggagtgg
cgcggctgct tcccgcccgc gcaggatcag gccggccccc gcgggcctgg 60agctggatcc
agagctaggg aaactggaaa aacaggcaca aactcggaag ccgcggtacg
120gcaagagcct aagcaaagaa tcctttccaa gattcacacc tcgtctacac
cagggcaccg 180cctgggccta cggccttccg aacccgaagc gcccgcagcc
cagagctggc atcaggccat 240caggccggga aggtcgtcgc aggccccaga
gtgcgggcgc ggggggcgcg cgcccacagg 300acgcccgggg ttgggtaggc
aggagagaag ggcgccagca ggcccgcggc tgtttcccct 360cggtccgcac
agcgggcccg ggaggccatt ttgagagcgc gaagaggggc ggcaagatgg
420ctgcgtgggc acccggaagg tcgccgcgcc aagggcccgc tgagcccctc
ctcccattcg 480tccagccgcg cggcccacag aagcggaacg cgcgtcgaga
gcgccctgtc cgctcgcccc 540agacagatgc ccggttattc attaccgcga
ggcctagagg aaagagtggc tgccgtcttc 600ctgcccacag cccgccggac
cctccgtcgc ggctgcccgg tccccggagc cgcagccgcc 660gagcccggct
gtgcgtgtcg tggctgctgg ggagaaagag gcttccggac atgctctgga
720gtcagaagac agcgaaaaga gaagcagaag ccccggtggc aagagtctga
agcaggaagg 780atgact 78626569DNAHomo sapiens 26ttaccgcgag
gcctagagga aagagtggct gccgtcttcc tgcccacagc ccgccggacc 60ctccgtcgcg
gctgcccggt ccccggagcc gcagccgccg agcccggctg tgcgtgtcgt
120ggctgctggg gagaaagagg cttccggaca tgctctggag tcagaagaca
gcgaaaagag 180aagcagaagc cccggtggca agagtctgaa gggagaaaat
aacccagttt gggaaggaca 240tttaaaaggg gaaaatatta ggaaggatga
ctgtagcctg tggattgtac tgcagtagga 300aactgtccta gcaaggctcc
actttgcccc agcttcaagc tggaaaggag gagaacatga 360aacattgctt
gaagacaatg gccgagacag caggtcccac cctgcacagc caccagcatc
420tctcccctca gccctgtctc ctcttctgca gttgggatct gcacatttaa
gcctgaaatt 480gtcctgtgaa gtgaagtatg atcggacagc ctcttttcag
cttttatgac aatggagaca 540gaggaattgt ggctcttgcc aaggtcaca
569273779DNAHomo sapiens 27gtgtcgtggc tgctggggag aaagaggctt
ccggacatgc tctggagtca gaagacagcg 60aaaagagaag cagaagcccc ggtggcaaga
gtctgaagca ggaaggatga ctgtagcctg 120tggattgtac tgcagtagga
aactgtccta gcaaggctcc actttgcccc agcttcaagc 180tggaaaggag
gagaacatga aacattgctt gaagacaatg gccgagacag caggtcccac
240cctgcacagc caccagcatc tctcccctca gccctgtctc ctcttctgca
gttgggatct 300gcacatttaa gcctgaaatt gtcctgtgaa gtgaagtatg
atcggacagc ctcttttcag 360cttttatgac aatggagaca gaggaattgt
ggctcttgcc aaggtcacag gattggaata 420cagagccaag ccaccccagg
acatgcaaga gcctcagaag ggaaaaaagc ccagcaggaa 480gggagaacaa
gtagcctctg tcctgaagtt gtaacagcca ggggccagga tggaggagga
540ggaccccata atctgcccat ctgggacttg gcaggggacc tgggaaaatg
taccccaacc 600catcccttaa gggcctttgt ctttggccca ttggcctagc
atctacttct tcaccgtgtc 660tgttcttgtc acacctagtc aggtctgttt
gggtctgagg tgcatggaac attctgggta 720ggcctccagc aaacggaagc
tcttcaccgt gtttccagcc tgggaccaag ggcagcatac 780tggcaaagtt
gccaaagcaa gggactccag cctcttagga gttaatgact ccctctcccc
840agctgtcctc cccttggtgc tcctcttcct ccctcctcct gctcacagca
ggcagggcct 900agacccggga gccatgctgc tgtgctgttg ccaggggagc
acggaggcag atctgagcta 960tgcagggaaa aggcccagcc tgtcaaagtg
tctgagatga accgccgccg tccctgtgca 1020gctgggctca gacgtgtctc
agctcttgtt ctgtgcctga gaatggcgaa acccagtgag 1080gttcaagggc
aaactcgcta ttcattagtc aggggttctt gacgtcccgt ctctcccagg
1140gatgagttcc cccctcctct ttctccccct cctatgacac attcctgggt
gcctttggtg 1200aggactgcac accctcctcc tgcctagccc cctctccaaa
ggcccctgaa taaactcccc 1260ccaaggagac caggcagggc agagacaatg
gctgcaggaa atcattcagg cgggacatgc 1320tggcctgccc tccacccagt
ccccctgtgg gccccactcc cttctgattc agggcaccct 1380tgggccccca
gcctatacag gcctggacag gaagaaacca ctgggaacca ccctaaggac
1440aacatgctag tccagtgcca ttcttcgctg gctctgtggg tgcctttgtg
gcctgtaccg 1500actggctggc taattttgtg gtttctgtac catcacatgc
ctattttaag acactctcca 1560gcactgtcgg ttagggagtg taaattttgc
aatattttct gaaatgtggc aatatcaaaa 1620tgtaaaaggc acacatactt
ggtcacaaac aaatggcact atttactctg tgggcatatt 1680tgtaaaagtt
gccaaagaat tatatacaag gatgttcatc agagcatttc ttttgaagag
1740taaagaaatg gacatgaacc tgtggtccgt tcatacggtg gaatacctat
gcagctgtaa 1800aaatcagtgt ggtagatctc cgtatatgag ttgatgtgga
aggttggcca gttcacatga 1860taaggtgaat agaataagtt acagaacagg
ctgtagagta tgatcttatt tgtagatgtt 1920taaaactgag tcataagtat
gcttatatac agatcgtttc tggaagtatg tactggaagt 1980ctacctctgg
ggagtgggga tgggggagtg cactcttcta tactgttata ttttcttttc
2040atgctcctaa ggtactttta ttggaagatg taaagcggtt caatgtaata
ggcttaactt 2100ctgtcaacta agttggcgtg ggtgctttaa gagggtggta
gtgatgttgc tggagaaagt 2160atcccacagt cactggtggc ttcagccacg
ggccattttg gggcctaata atcacatatc 2220atcatggttg ctagtgttaa
tcgaaaacct actaagtgcc aggcttactg tctctgggtc 2280ttgcttacgt
ggatgtcatt tttccagttg caccaaatcg aaagaggtta attggtttgt
2340tggagttcct ttgtaggtga agggcagagc caggagcttg gctagggaca
ggggaggtga 2400gtgggggatg gtggataggt cttggctccc agtttccttc
tgggcagaca ttgcccctct 2460gccctgagga cctgcttgtt tgggggaaga
ggcctttaga ggcaccaggg tcatgccagg 2520tgttggacat ggtgaactgg
gaagtgctcc catctggcca cagcgcagaa gtatcaccgt 2580gctgggggat
ggggaacagg gctgtgaatg ggcctatttg cataagcagc atgtgtctgg
2640agagaaagac atcacagagc agaagagtgc gggtgcccag gagtgcactt
gccaccccta 2700cttcatccct gaaagagtaa atggcctgga aggtgtctct
gagaggtaat gccgcacacc 2760accctccctg ggggcagggt caggctacac
ctgccttagg tcgggggctg cagcagcctg 2820agagctctca gtagggcctc
agtagcctgg gagggagcag gggcaggggg cagggaaaga 2880ggcgtaatgg
ggctgtccag aggggcctgg gaaacctggt ccctgaggcc tgggcacagc
2940tacaatcact tcaaattggc tgtggggcca gtggactggg aaggaaaaaa
gcaataagag 3000tgaccaagtg cagaaggctg tcaggtccca ggtcacatgc
cttagtgcag tgactcctca 3060tcattttatg gggtgtgggt gtcgttggta
cacccatttt acagatgagg acaccgaggc 3120ccagaaaagt taagttacat
gtcctaagtc acacagcttg taagtgccag aactgagatc 3180aaaaccaagt
ctctttgact ttaaagtctg tactctgacc ccaaagagat cctgtttggc
3240cacttatagg aggtccctaa agctgcagac tccccttgcc ggcacccaca
tatagagaca 3300ttaacccttc ccctgcaggg tcacctcaaa tagtctttta
gctgggcttc tcctgcaatt 3360ccacctaatg ccatcccctg ggttttgccc
aaacctgaac tgggcagtgg ggtgagagga 3420ggggtttaca gggttacaga
gcctcataca gataggagcc catggctgct ggtcatctgc 3480attcctgcag
gattggctgt tccttggggt ccttggcagg aaaatgagga ttgctccgag
3540gcctgctcca gtacttccca gaggctggcc tggtgtgggg ctctgggaag
gctgaggctg 3600gagaagcgta agtaggaggg cagagatggc actcaggtag
cttgaatcac caggaccctt 3660ccaagcccca caggttctga gggagtacta
gggccagctc tgggagaggt ctcttcctat 3720gctgtgaacc ccctgccttt
cttgcagcct acaacgaata aattttcttt gcaaaggct 3779283670DNAHomo
sapiens 28cttccggaca tgctctggag tcagaagaca gcgaaaagag
aagcagaagc cccggtggca 60agagtctgaa gctggaaagg aggagaacat gaaacattgc
ttgaagacaa tggccgagac 120agcaggtccc accctgcaca gccaccagca
tctctcccct cagccctgtc tcctcttctg 180cagttgggat ctgcacattt
aagcctgaaa ttgtcctgtg aagtgaagta tgatcggaca 240gcctcttttc
agcttttatg acaatggaga cagaggaatt gtggctcttg ccaaggtcac
300aggattggaa tacagagcca agccacccca ggacatgcaa gagcctcaga
agggaaaaaa 360gcccagcagg aagggagaac aagtagcctc tgtcctgaag
ttgtaacagc caggggccag 420gatggaggag gaggacccca taatctgccc
atctgggact tggcagggga cctgggaaaa 480tgtaccccaa cccatccctt
aagggccttt gtctttggcc cattggccta gcatctactt 540cttcaccgtg
tctgttcttg tcacacctag tcaggtctgt ttgggtctga ggtgcatgga
600acattctggg taggcctcca gcaaacggaa gctcttcacc gtgtttccag
cctgggacca 660agggcagcat actggcaaag ttgccaaagc aagggactcc
agcctcttag gagttaatga 720ctccctctcc ccagctgtcc tccccttggt
gctcctcttc ctccctcctc ctgctcacag 780caggcagggc ctagacccgg
gagccatgct gctgtgctgt tgccagggga gcacggaggc 840agatctgagc
tatgcaggga aaaggcccag cctgtcaaag tgtctgagat gaaccgccgc
900cgtccctgtg cagctgggct cagacgtgtc tcagctcttg ttctgtgcct
gagaatggcg 960aaacccagtg aggttcaagg gcaaactcgc tattcattag
tcaggggttc ttgacgtccc 1020gtctctccca gggatgagtt cccccctcct
ctttctcccc ctcctatgac acattcctgg 1080gtgcctttgg tgaggactgc
acaccctcct cctgcctagc cccctctcca aaggcccctg 1140aataaactcc
ccccaaggag accaggcagg gcagagacaa tggctgcagg aaatcattca
1200ggcgggacat gctggcctgc cctccaccca gtccccctgt gggccccact
cccttctgat 1260tcagggcacc cttgggcccc cagcctatac aggcctggac
aggaagaaac cactgggaac 1320caccctaagg acaacatgct agtccagtgc
cattcttcgc tggctctgtg ggtgcctttg 1380tggcctgtac cgactggctg
gctaattttg tggtttctgt accatcacat gcctatttta 1440agacactctc
cagcactgtc ggttagggag tgtaaatttt gcaatatttt ctgaaatgtg
1500gcaatatcaa aatgtaaaag gcacacatac ttggtcacaa acaaatggca
ctatttactc 1560tgtgggcata tttgtaaaag ttgccaaaga attatataca
aggatgttca tcagagcatt 1620tcttttgaag agtaaagaaa tggacatgaa
cctgtggtcc gttcatacgg tggaatacct 1680atgcagctgt aaaaatcagt
gtggtagatc tccgtatatg agttgatgtg gaaggttggc 1740cagttcacat
gataaggtga atagaataag ttacagaaca ggctgtagag tatgatctta
1800tttgtagatg tttaaaactg agtcataagt atgcttatat acagatcgtt
tctggaagta 1860tgtactggaa gtctacctct ggggagtggg gatgggggag
tgcactcttc tatactgtta 1920tattttcttt tcatgctcct aaggtacttt
tattggaaga tgtaaagcgg ttcaatgtaa 1980taggcttaac ttctgtcaac
taagttggcg tgggtgcttt aagagggtgg tagtgatgtt 2040gctggagaaa
gtatcccaca gtcactggtg gcttcagcca cgggccattt tggggcctaa
2100taatcacata tcatcatggt tgctagtgtt aatcgaaaac ctactaagtg
ccaggcttac 2160tgtctctggg tcttgcttac gtggatgtca tttttccagt
tgcaccaaat cgaaagaggt 2220taattggttt gttggagttc ctttgtaggt
gaagggcaga gccaggagct tggctaggga 2280caggggaggt gagtggggga
tggtggatag gtcttggctc ccagtttcct tctgggcaga 2340cattgcccct
ctgccctgag gacctgcttg tttgggggaa gaggccttta gaggcaccag
2400ggtcatgcca ggtgttggac atggtgaact gggaagtgct cccatctggc
cacagcgcag 2460aagtatcacc gtgctggggg atggggaaca gggctgtgaa
tgggcctatt tgcataagca 2520gcatgtgtct ggagagaaag acatcacaga
gcagaagagt gcgggtgccc aggagtgcac 2580ttgccacccc tacttcatcc
ctgaaagagt aaatggcctg gaaggtgtct ctgagaggta 2640atgccgcaca
ccaccctccc tgggggcagg gtcaggctac acctgcctta ggtcgggggc
2700tgcagcagcc tgagagctct cagtagggcc tcagtagcct gggagggagc
aggggcaggg 2760ggcagggaaa gaggcgtaat ggggctgtcc agaggggcct
gggaaacctg gtccctgagg 2820cctgggcaca gctacaatca cttcaaattg
gctgtggggc cagtggactg ggaaggaaaa 2880aagcaataag agtgaccaag
tgcagaaggc tgtcaggtcc caggtcacat gccttagtgc 2940agtgactcct
catcatttta tggggtgtgg gtgtcgttgg tacacccatt ttacagatga
3000ggacaccgag gcccagaaaa gttaagttac atgtcctaag tcacacagct
tgtaagtgcc 3060agaactgaga tcaaaaccaa gtctctttga ctttaaagtc
tgtactctga ccccaaagag 3120atcctgtttg gccacttata ggaggtccct
aaagctgcag actccccttg ccggcaccca 3180catatagaga cattaaccct
tcccctgcag ggtcacctca aatagtcttt tagctgggct 3240tctcctgcaa
ttccacctaa tgccatcccc tgggttttgc ccaaacctga actgggcagt
3300ggggtgagag gaggggttta cagggttaca gagcctcata cagataggag
cccatggctg 3360ctggtcatct gcattcctgc aggattggct gttccttggg
gtccttggca ggaaaatgag 3420gattgctccg aggcctgctc cagtacttcc
cagaggctgg cctggtgtgg ggctctggga 3480aggctgaggc tggagaagcg
taagtaggag ggcagagatg gcactcaggt agcttgaatc 3540accaggaccc
ttccaagccc cacaggttct gagggagtac tagggccagc tctgggagag
3600gtctcttcct atgctgtgaa ccccctgcct ttcttgcagc ctacaacgaa
taaattttct 3660ttgcaaaggc 3670293784DNAHomo sapiens 29gctaacagct
tcaggagaat tcagcctcac cttgacagga catgctctgg agtcagaaga 60cagcgaaaag
agaagcagaa gccccggtgg caagagtctg aagcaggaag gatgactgta
120gcctgtggat tgtactgcag taggaaactg tcctagcaag gctccacttt
gccccagctt 180caagctggaa aggaggagaa catgaaacat tgcttgaaga
caatggccga gacagcaggt 240cccaccctgc acagccacca gcatctctcc
cctcagccct gtctcctctt ctgcagttgg 300gatctgcaca tttaagcctg
aaattgtcct gtgaagtgaa gtatgatcgg acagcctctt 360ttcagctttt
atgacaatgg agacagagga attgtggctc ttgccaaggt cacaggattg
420gaatacagag ccaagccacc ccaggacatg caagagcctc agaagggaaa
aaagcccagc 480aggaagggag aacaagtagc ctctgtcctg aagttgtaac
agccaggggc caggatggag 540gaggaggacc ccataatctg cccatctggg
acttggcagg ggacctggga aaatgtaccc 600caacccatcc cttaagggcc
tttgtctttg gcccattggc ctagcatcta cttcttcacc 660gtgtctgttc
ttgtcacacc tagtcaggtc tgtttgggtc tgaggtgcat ggaacattct
720gggtaggcct ccagcaaacg gaagctcttc accgtgtttc cagcctggga
ccaagggcag 780catactggca aagttgccaa agcaagggac tccagcctct
taggagttaa tgactccctc 840tccccagctg tcctcccctt ggtgctcctc
ttcctccctc ctcctgctca cagcaggcag 900ggcctagacc cgggagccat
gctgctgtgc tgttgccagg ggagcacgga ggcagatctg 960agctatgcag
ggaaaaggcc cagcctgtca aagtgtctga gatgaaccgc cgccgtccct
1020gtgcagctgg gctcagacgt gtctcagctc ttgttctgtg cctgagaatg
gcgaaaccca 1080gtgaggttca agggcaaact cgctattcat tagtcagggg
ttcttgacgt cccgtctctc 1140ccagggatga gttcccccct cctctttctc
cccctcctat gacacattcc tgggtgcctt 1200tggtgaggac tgcacaccct
cctcctgcct agccccctct ccaaaggccc ctgaataaac 1260tccccccaag
gagaccaggc agggcagaga caatggctgc aggaaatcat tcaggcggga
1320catgctggcc tgccctccac ccagtccccc tgtgggcccc actcccttct
gattcagggc 1380acccttgggc ccccagccta tacaggcctg gacaggaaga
aaccactggg aaccacccta 1440aggacaacat gctagtccag tgccattctt
cgctggctct gtgggtgcct ttgtggcctg 1500taccgactgg ctggctaatt
ttgtggtttc tgtaccatca catgcctatt ttaagacact 1560ctccagcact
gtcggttagg gagtgtaaat tttgcaatat tttctgaaat gtggcaatat
1620caaaatgtaa aaggcacaca tacttggtca caaacaaatg gcactattta
ctctgtgggc 1680atatttgtaa aagttgccaa agaattatat acaaggatgt
tcatcagagc atttcttttg 1740aagagtaaag aaatggacat gaacctgtgg
tccgttcata cggtggaata cctatgcagc 1800tgtaaaaatc agtgtggtag
atctccgtat atgagttgat gtggaaggtt ggccagttca 1860catgataagg
tgaatagaat aagttacaga acaggctgta gagtatgatc ttatttgtag
1920atgtttaaaa ctgagtcata agtatgctta tatacagatc gtttctggaa
gtatgtactg 1980gaagtctacc tctggggagt ggggatgggg gagtgcactc
ttctatactg ttatattttc 2040ttttcatgct cctaaggtac ttttattgga
agatgtaaag cggttcaatg taataggctt 2100aacttctgtc aactaagttg
gcgtgggtgc tttaagaggg tggtagtgat gttgctggag 2160aaagtatccc
acagtcactg gtggcttcag ccacgggcca ttttggggcc taataatcac
2220atatcatcat ggttgctagt gttaatcgaa aacctactaa gtgccaggct
tactgtctct 2280gggtcttgct tacgtggatg tcatttttcc agttgcacca
aatcgaaaga ggttaattgg 2340tttgttggag ttcctttgta ggtgaagggc
agagccagga gcttggctag ggacagggga 2400ggtgagtggg ggatggtgga
taggtcttgg ctcccagttt ccttctgggc agacattgcc 2460cctctgccct
gaggacctgc ttgtttgggg gaagaggcct ttagaggcac cagggtcatg
2520ccaggtgttg gacatggtga actgggaagt gctcccatct ggccacagcg
cagaagtatc 2580accgtgctgg gggatgggga acagggctgt gaatgggcct
atttgcataa gcagcatgtg 2640tctggagaga aagacatcac agagcagaag
agtgcgggtg cccaggagtg cacttgccac 2700ccctacttca tccctgaaag
agtaaatggc ctggaaggtg tctctgagag gtaatgccgc 2760acaccaccct
ccctgggggc agggtcaggc tacacctgcc ttaggtcggg ggctgcagca
2820gcctgagagc tctcagtagg gcctcagtag cctgggaggg agcaggggca
gggggcaggg 2880aaagaggcgt aatggggctg tccagagggg cctgggaaac
ctggtccctg aggcctgggc 2940acagctacaa tcacttcaaa ttggctgtgg
ggccagtgga ctgggaagga aaaaagcaat 3000aagagtgacc aagtgcagaa
ggctgtcagg tcccaggtca catgccttag tgcagtgact 3060cctcatcatt
ttatggggtg tgggtgtcgt tggtacaccc attttacaga tgaggacacc
3120gaggcccaga aaagttaagt tacatgtcct aagtcacaca gcttgtaagt
gccagaactg 3180agatcaaaac caagtctctt tgactttaaa gtctgtactc
tgaccccaaa gagatcctgt 3240ttggccactt ataggaggtc cctaaagctg
cagactcccc ttgccggcac ccacatatag 3300agacattaac ccttcccctg
cagggtcacc tcaaatagtc ttttagctgg gcttctcctg 3360caattccacc
taatgccatc ccctgggttt tgcccaaacc tgaactgggc agtggggtga
3420gaggaggggt ttacagggtt acagagcctc atacagatag gagcccatgg
ctgctggtca 3480tctgcattcc tgcaggattg gctgttcctt ggggtccttg
gcaggaaaat gaggattgct 3540ccgaggcctg ctccagtact tcccagaggc
tggcctggtg tggggctctg ggaaggctga 3600ggctggagaa gcgtaagtag
gagggcagag atggcactca ggtagcttga atcaccagga 3660cccttccaag
ccccacaggt tctgagggag tactagggcc agctctggga gaggtctctt
3720cctatgctgt gaaccccctg cctttcttgc agcctacaac gaataaattt
tctttgcaaa 3780ggct 378430779DNAHomo sapiens 30gcacgacttg
ttcttgcctt ctaaagcaga gaggagcttt tgtgggtagt tcctacaggg 60atacatggta
gaaaattcac caaacccagt gctggagtgt ttctcttcct cagaagaaat
120cagatgctgt tcagagcacg aaggctagaa ttttaccctg gttctcatgc
taccttgcac 180ccaggttgga tcctgagtac agtttttggc aggaagcccc
agagagattg gtgagggtga 240tttcccagga agacgcagtg tgctctgact
tctgtgacag tgagcaacgg gaccagtgga 300tgtccagatg ctggcaatga
gacatgctct ggagtcagaa gacagcgaaa agagaagcag 360aagccccggt
ggcaagagtc tgaaggaagg atgactgtag cctgtggatt gtactgcagt
420aggaaactgt cctagcaagg ctccactttg ccccagcttc aaggtatatc
gtctcaaaat 480gcaggggact tcagatgagt tttgagcacc ctttctttta
ttataaaaaa aattccagac 540agttcagcca atactgacta agggctgaga
ccagttccat gcttttctgt ctccagagga 600atttgcttcc atctggatgc
ctgaaacgct ggaaaggagg agaacatgaa acattgcttg 660aagacaatgg
ccgagacagc aggtcccacc ctgcacagcc accagcatct ctcccctcag
720ccctgtctcc tcttctgcag ttgggatctg cacatttaag cctgaaattg tcctgtgaa
779311611DNAHomo sapiens 31ggagaaacac acacgggcgg gcggagggga
cccggggcga gtcatcaagg gcgcgtggtt 60cggcgtgcca ggcgcgctgc tctgcctgct
ctcttggctt ctgtctccct tcgaccgatc 120gccccctatc ctgaagcttt
ccaatgtcat cttggagccc caaagtttcc tggggcctcc 180gcgttgtgcg
tcccagaacc ccttgcctgc ccctgaggga aacgcggagc cataggcagc
240gggacgtcgg gagccagccc aggggaggcc agattcagca tttggacagc
ggctctgggg 300cgcagtcggc ccagcgagtt tgccggtgaa cagcctcggg
cacatggcgg gtaggagggc 360cgcagggctg ctctgggtct tgaagaagca
ggacccagcc tagagggcat ccccagctcc 420gaatgggaca cgttttcccg
agataaaaga tcccttctga gctcacacgg gagccccggg 480accatccaat
ccagcgtgga tatccccagc ctaaccaaca cctgtgctgg ggggaaagat
540aagacgcccc ctttcagcca ggaggtggac gaccctcatg ccctcagctc
tccattcttc 600ccaaagcagc tcggatccct aagtctggag ctgccagcga
ggcttccaac ccgctgcttg 660ccatcacctc ccaggtcgtt ggtggctccg
attactcccc tgctggtgcc tccctccttg 720gcgcgcttcc cacctgcgat
cggcgccctc ttcgcagtca cgaactcgcc agcagctagc 780agcactgact
agtaggaggg cccgccggag gagagccgcg cggcccacag aagcggaacg
840cgcgtcgaga gcgccctgtc cgctcgcccc agacagatgc ccggttattc
attaccgcga 900ggcctagagg aaagagtggc tgccgtcttc ctgcccacag
cccgccggac cctccgtcgc 960ggctgcccgg tccccggagc cgcagccgcc
gagcccggct gtgcgtgtcg tggctgctgg 1020ggagaaagag gcttccggac
atgctctgga gtcagaagac agcgaaaaga gaagcagaag 1080ccccggtggc
aagagtctga aggaaggatg actgtagcct gtggattgta ctgcagtagg
1140aaactgtcct agcaaggctc cactttgccc cagcttcaag ctggaaagga
ggagaacatg 1200aaacattgct tgaagacaat ggccgagaca gcaggtccca
ccctgcacag ccaccagcat 1260ctctcccctc agccctgtct cctcttctgc
agttgggatc tgcacattta agcctgaaat 1320tgtcctgtga agtgaagtat
gatcggacag cctcttttca gcttttatga caatggagac 1380agaggaattg
tggctcttgc caaggtcaca ggattggaat acagagccaa gccaccccag
1440gacatgcaag agcctcagaa gggaaaaaag cccagcagga agggagaaca
agtagcctct 1500gtcctgaagt tgtaacagcc aggggccagg atggaggagg
aggaccccat aatctgccca 1560tctgggactt ggcaggggac ctgggaaaat
gtaccccaac ccatccctta a 1611322718DNAHomo sapiens 32atcaagcgat
cctcccacct gggcctccca aagtgttgag attacagcat gagccaccac 60acccagacta
aaaggcagtt tgattttaca aatcaaaata gcagtaatct atggagattt
120acttgtgaga ttggtaggaa acatcttaaa tgtaatcaaa caataactta
catcttgatg 180aattcacgtg taggtttctc ttcctcagaa gaaatcagat
gctgttcaga gcacgaaggc 240tagaatttta ccctggttct catgctacct
tgcacccagg ttggatcctg agtacagttt 300ttggcaggtg ggcctgcata
taagttagca atgggggata cccagctgcc tctcttcata 360cagctgaggt
tttggggagt cattcttata gcccctgggt tgggcctagt cctgcaaatg
420aattcaccag ccctaaagcc caaattgcag cctctgtcat tcaccttcca
ggagtggaaa 480gggcagtaag tttcatctta ttattattgc tattttggtg
gttttgttga ggttggtgtg 540tgtatgttag taagataaag ctctcagaaa
ttacatagca tttgtcaagg atataagagg 600gactgtgcca catctggctg
tatagaaggt ggttccatat ctttaaatag agccccaggt 660ccttagccac
cagaaaggtt ttcaggggaa gtgtgcaccc tcagcagctg ctgctggtgg
720gcaggatggg cacgcatgga acaggctttc ctctgtggcc aggtgagaag
caggtggtga 780gacacagagc agtgctgggc tctgcttctg aagcctccaa
cctttccttc cctaggaagc 840cccagagaga ttggtgaggg tgatttccca
ggaagacgca gtgtgctctg acttctgtga 900cagtgagcaa cgggaccagt
ggatgtccag atgctggcaa tgagtaggcc ttccctacgc 960tgggtggcgt
ccacaccctc cggcttccat tgcctgggtc tcctggaggt ggtttgctgg
1020atgaataccg catgcacaga ggctggcctt gggtttgaat atggcagcca
gtggacagca 1080tgtgcttcag ttatgagact gcccaggaga tgcttcttcc
aaggcagagc acgtgcagag 1140tccagtgctg gagaggccgg gtgcgcagtt
gacccatttc cagttctgtt ttccctctca 1200tgttcctctg tccccatcta
ggacatgctc tggagtcaga agacagcgaa aagagaagca 1260gaagccccgg
tggcaagagt ctgaagcagg aaggatgact gtagcctgtg gattgtactg
1320cagtaggaaa ctgtcctagc aaggctccac tttgccccag cttcaagctg
gaaaggagga 1380gaacatgaaa cattgcttga agacaatggc cgagacagca
ggtcccaccc tgcacagcca 1440ccagcatctc tcccctcagc cctgtctcct
cttctgcagt tgggatctgc acatttaagc 1500ctgaaattgt cctgtgaagt
gaagtatgat cggacagcct cttttcagct tttatgacaa 1560tggagacaga
ggaattgtgg ctcttgccaa ggtcacagga ttggaataca gagccaagcc
1620accccaggac atgcaagagc ctcagaaggg aaaaaagccc agcaggaagg
gagaacaagt 1680agcctctgtc ctgaagttgt aacagccagg ggccaggatg
gaggaggagg accccataat 1740ctgcccatct gggacttggc aggggacctg
ggaaaatgta ccccaaccca tcccttaagg 1800gcctttgtct ttggcccatt
ggcctagcat ctacttcttc accgtgtctg ttcttgtcac 1860acctagtcag
gtctgtttgg gtctgaggtg catggaacat tctgggtagg cctccagcaa
1920acggaagctc ttcaccgtgt ttccagcctg ggaccaaggg cagcatactg
gcaaagttgc 1980caaagcaagg gactccagcc tcttaggagt taatgactcc
ctctccccag ctgtcctccc 2040cttggtgctc ctcttcctcc ctcctcctgc
tcacagcagg cagggcctag acccgggagc 2100catgctgctg tgctgttgcc
aggggagcac ggaggcagat ctgagctatg cagggaaaag 2160gcccagcctg
tcaaagtgtc tgagatgaac cgccgccgtc cctgtgcagc tgggctcaga
2220cgtgtctcag ctcttgttct gtgcctgaga atggcgaaac ccagtgaggt
tcaagggcaa 2280actcgctatt cattagtcag gggttcttga cgtcccgtct
ctcccaggga tgagttcccc 2340cctcctcttt ctccccctcc tatgacacat
tcctgggtgc ctttggtgag gactgcacac 2400cctcctcctg cctagccccc
tctccaaagg cccctgaata aactcccccc aaggagacca 2460ggcagggcag
agacaatggc tgcaggaaat cattcaggcg ggacatgctg gcctgccctc
2520cacccagtcc ccctgtgggc cccactccct tctgattcag ggcacccttg
ggcccccagc 2580ctatacaggc ctggacagga agaaaccact gggaaccacc
ctaaggacaa catgctagtc 2640cagtgccatt cttcgctggc tctgtgggtg
cctttgtggc ctgtaccgac tggctggcta 2700attttgtggt ttctgtac
2718333723DNAHomo sapiens 33gagagattgg tgagggtgat ttcccaggaa
gacgcagtgt gctctgactt ctgtgacaga 60catgctctgg agtcagaaga cagcgaaaag
agaagcagaa gccccggtgg caagagtctg 120aagctggaaa ggaggagaac
atgaaacatt gcttgaagac aatggccgag acagcaggtc 180ccaccctgca
cagccaccag catctctccc ctcagccctg tctcctcttc tgcagttggg
240atctgcacat ttaagcctga aattgtcctg tgaagtgaag tatgatcgga
cagcctcttt 300tcagctttta tgacaatgga gacagaggaa ttgtggctct
tgccaaggtc acaggattgg 360aatacagagc caagccaccc caggacatgc
aagagcctca gaagggaaaa aagcccagca 420ggaagggaga acaagtagcc
tctgtcctga agttgtaaca gccaggggcc aggatggagg 480aggaggaccc
cataatctgc ccatctggga cttggcaggg gacctgggaa aatgtacccc
540aacccatccc ttaagggcct ttgtctttgg cccattggcc tagcatctac
ttcttcaccg 600tgtctgttct tgtcacacct agtcaggtct gtttgggtct
gaggtgcatg gaacattctg 660ggtaggcctc cagcaaacgg aagctcttca
ccgtgtttcc agcctgggac caagggcagc 720atactggcaa agttgccaaa
gcaagggact ccagcctctt aggagttaat gactccctct 780ccccagctgt
cctccccttg gtgctcctct tcctccctcc tcctgctcac agcaggcagg
840gcctagaccc gggagccatg ctgctgtgct gttgccaggg gagcacggag
gcagatctga 900gctatgcagg gaaaaggccc agcctgtcaa agtgtctgag
atgaaccgcc gccgtccctg 960tgcagctggg ctcagacgtg tctcagctct
tgttctgtgc ctgagaatgg cgaaacccag 1020tgaggttcaa gggcaaactc
gctattcatt agtcaggggt tcttgacgtc ccgtctctcc 1080cagggatgag
ttcccccctc ctctttctcc ccctcctatg acacattcct gggtgccttt
1140ggtgaggact gcacaccctc ctcctgccta gccccctctc caaaggcccc
tgaataaact 1200ccccccaagg agaccaggca gggcagagac aatggctgca
ggaaatcatt caggcgggac 1260atgctggcct gccctccacc cagtccccct
gtgggcccca ctcccttctg attcagggca 1320cccttgggcc cccagcctat
acaggcctgg acaggaagaa accactggga accaccctaa 1380ggacaacatg
ctagtccagt gccattcttc gctggctctg tgggtgcctt tgtggcctgt
1440accgactggc tggctaattt tgtggtttct gtaccatcac atgcctattt
taagacactc 1500tccagcactg tcggttaggg agtgtaaatt ttgcaatatt
ttctgaaatg tggcaatatc 1560aaaatgtaaa aggcacacat acttggtcac
aaacaaatgg cactatttac tctgtgggca 1620tatttgtaaa agttgccaaa
gaattatata caaggatgtt catcagagca tttcttttga 1680agagtaaaga
aatggacatg aacctgtggt ccgttcatac ggtggaatac ctatgcagct
1740gtaaaaatca gtgtggtaga tctccgtata tgagttgatg tggaaggttg
gccagttcac 1800atgataaggt gaatagaata agttacagaa caggctgtag
agtatgatct tatttgtaga 1860tgtttaaaac tgagtcataa gtatgcttat
atacagatcg tttctggaag tatgtactgg 1920aagtctacct ctggggagtg
gggatggggg agtgcactct tctatactgt tatattttct 1980tttcatgctc
ctaaggtact tttattggaa gatgtaaagc ggttcaatgt aataggctta
2040acttctgtca actaagttgg cgtgggtgct ttaagagggt ggtagtgatg
ttgctggaga 2100aagtatccca cagtcactgg tggcttcagc cacgggccat
tttggggcct aataatcaca 2160tatcatcatg gttgctagtg ttaatcgaaa
acctactaag tgccaggctt actgtctctg 2220ggtcttgctt acgtggatgt
catttttcca gttgcaccaa atcgaaagag gttaattggt
2280ttgttggagt tcctttgtag gtgaagggca gagccaggag cttggctagg
gacaggggag 2340gtgagtgggg gatggtggat aggtcttggc tcccagtttc
cttctgggca gacattgccc 2400ctctgccctg aggacctgct tgtttggggg
aagaggcctt tagaggcacc agggtcatgc 2460caggtgttgg acatggtgaa
ctgggaagtg ctcccatctg gccacagcgc agaagtatca 2520ccgtgctggg
ggatggggaa cagggctgtg aatgggccta tttgcataag cagcatgtgt
2580ctggagagaa agacatcaca gagcagaaga gtgcgggtgc ccaggagtgc
acttgccacc 2640cctacttcat ccctgaaaga gtaaatggcc tggaaggtgt
ctctgagagg taatgccgca 2700caccaccctc cctgggggca gggtcaggct
acacctgcct taggtcgggg gctgcagcag 2760cctgagagct ctcagtaggg
cctcagtagc ctgggaggga gcaggggcag ggggcaggga 2820aagaggcgta
atggggctgt ccagaggggc ctgggaaacc tggtccctga ggcctgggca
2880cagctacaat cacttcaaat tggctgtggg gccagtggac tgggaaggaa
aaaagcaata 2940agagtgacca agtgcagaag gctgtcaggt cccaggtcac
atgccttagt gcagtgactc 3000ctcatcattt tatggggtgt gggtgtcgtt
ggtacaccca ttttacagat gaggacaccg 3060aggcccagaa aagttaagtt
acatgtccta agtcacacag cttgtaagtg ccagaactga 3120gatcaaaacc
aagtctcttt gactttaaag tctgtactct gaccccaaag agatcctgtt
3180tggccactta taggaggtcc ctaaagctgc agactcccct tgccggcacc
cacatataga 3240gacattaacc cttcccctgc agggtcacct caaatagtct
tttagctggg cttctcctgc 3300aattccacct aatgccatcc cctgggtttt
gcccaaacct gaactgggca gtggggtgag 3360aggaggggtt tacagggtta
cagagcctca tacagatagg agcccatggc tgctggtcat 3420ctgcattcct
gcaggattgg ctgttccttg gggtccttgg caggaaaatg aggattgctc
3480cgaggcctgc tccagtactt cccagaggct ggcctggtgt ggggctctgg
gaaggctgag 3540gctggagaag cgtaagtagg agggcagaga tggcactcag
gtagcttgaa tcaccaggac 3600ccttccaagc cccacaggtt ctgagggagt
actagggcca gctctgggag aggtctcttc 3660ctatgctgtg aaccccctgc
ctttcttgca gcctacaacg aataaatttt ctttgcaaag 3720gct
3723344778RNAHomo sapiens 34ggagaaacac acacgggcgg gcggagggga
cccggggcga gucaucaagg gcgcgugguu 60cggcgugcca ggcgcgcugc ucugccugcu
cucuuggcuu cugucucccu ucgaccgauc 120gcccccuauc cugaagcuuu
ccaaugucau cuuggagccc caaaguuucc uggggccucc 180gcguugugcg
ucccagaacc ccuugccugc cccugaggga aacgcggagc cauaggcagc
240gggacgucgg gagccagccc aggggaggcc agauucagca uuuggacagc
ggcucugggg 300cgcagucggc ccagcgaguu ugccggugaa cagccucggg
cacauggcgg guaggagggc 360cgcagggcug cucugggucu ugaagaagca
ggacccagcc uagagggcau ccccagcucc 420gaaugggaca cguuuucccg
agauaaaaga ucccuucuga gcucacacgg gagccccggg 480accauccaau
ccagcgugga uauccccagc cuaaccaaca ccugugcugg ggggaaagau
540aagacgcccc cuuucagcca ggagguggac gacccucaug cccucagcuc
uccauucuuc 600ccaaagcagc ucggaucccu aagucuggag cugccagcga
ggcuuccaac ccgcugcuug 660ccaucaccuc ccaggucguu gguggcuccg
auuacucccc ugcuggugcc ucccuccuug 720gcgcgcuucc caccugcgau
cggcgcccuc uucgcaguca cgaacucgcc agcagcuagc 780agcacugacu
aguaggaggg cccgccggag gagagccgcg cggcccacag aagcggaacg
840cgcgucgaga gcgcccuguc cgcucgcccc agacagaugc ccgguuauuc
auuaccgcga 900ggccuagagg aaagaguggc ugccgucuuc cugcccacag
cccgccggac ccuccgucgc 960ggcugcccgg uccccggagc cgcagccgcc
gagcccggcu gugcgugucg uggcugcugg 1020ggagaaagag gcuuccggac
augcucugga gucagaagac agcgaaaaga gaagcagaag 1080ccccgguggc
aagagucuga aggaaggaug acuguagccu guggauugua cugcaguagg
1140aaacuguccu agcaaggcuc cacuuugccc cagcuucaag cuggaaagga
ggagaacaug 1200aaacauugcu ugaagacaau ggccgagaca gcagguccca
cccugcacag ccaccagcau 1260cucuccccuc agcccugucu ccucuucugc
aguugggauc ugcacauuua agccugaaau 1320uguccuguga agugaaguau
gaucggacag ccucuuuuca gcuuuuauga caauggagac 1380agaggaauug
uggcucuugc caaggucaca ggauuggaau acagagccaa gccaccccag
1440gacaugcaag agccucagaa gggaaaaaag cccagcagga agggagaaca
aguagccucu 1500guccugaagu uguaacagcc aggggccagg auggaggagg
aggaccccau aaucugccca 1560ucugggacuu ggcaggggac cugggaaaau
guaccccaac ccaucccuua agggccuuug 1620ucuuuggccc auuggccuag
caucuacuuc uucaccgugu cuguucuugu cacaccuagu 1680caggucuguu
ugggucugag gugcauggaa cauucugggu aggccuccag caaacggaag
1740cucuucaccg uguuuccagc cugggaccaa gggcagcaua cuggcaaagu
ugccaaagca 1800agggacucca gccucuuagg aguuaaugac ucccucuccc
cagcuguccu ccccuuggug 1860cuccucuucc ucccuccucc ugcucacagc
aggcagggcc uagacccggg agccaugcug 1920cugugcuguu gccaggggag
cacggaggca gaucugagcu augcagggaa aaggcccagc 1980cugucaaagu
gucugagaug aaccgccgcc gucccugugc agcugggcuc agacgugucu
2040cagcucuugu ucugugccug agaauggcga aacccaguga gguucaaggg
caaacucgcu 2100auucauuagu cagggguucu ugacgucccg ucucucccag
ggaugaguuc cccccuccuc 2160uuucuccccc uccuaugaca cauuccuggg
ugccuuuggu gaggacugca cacccuccuc 2220cugccuagcc cccucuccaa
aggccccuga auaaacuccc cccaaggaga ccaggcaggg 2280cagagacaau
ggcugcagga aaucauucag gcgggacaug cuggccugcc cuccacccag
2340ucccccugug ggccccacuc ccuucugauu cagggcaccc uugggccccc
agccuauaca 2400ggccuggaca ggaagaaacc acugggaacc acccuaagga
caacaugcua guccagugcc 2460auucuucgcu ggcucugugg gugccuuugu
ggccuguacc gacuggcugg cuaauuuugu 2520gguuucugua ccaucacaug
ccuauuuuaa gacacucucc agcacugucg guuagggagu 2580guaaauuuug
caauauuuuc ugaaaugugg caauaucaaa auguaaaagg cacacauacu
2640uggucacaaa caaauggcac uauuuacucu gugggcauau uuguaaaagu
ugccaaagaa 2700uuauauacaa ggauguucau cagagcauuu cuuuugaaga
guaaagaaau ggacaugaac 2760cugugguccg uucauacggu ggaauaccua
ugcagcugua aaaaucagug ugguagaucu 2820ccguauauga guugaugugg
aagguuggcc aguucacaug auaaggugaa uagaauaagu 2880uacagaacag
gcuguagagu augaucuuau uuguagaugu uuaaaacuga gucauaagua
2940ugcuuauaua cagaucguuu cuggaaguau guacuggaag ucuaccucug
gggagugggg 3000augggggagu gcacucuucu auacuguuau auuuucuuuu
caugcuccua agguacuuuu 3060auuggaagau guaaagcggu ucaauguaau
aggcuuaacu ucugucaacu aaguuggcgu 3120gggugcuuua agaggguggu
agugauguug cuggagaaag uaucccacag ucacuggugg 3180cuucagccac
gggccauuuu ggggccuaau aaucacauau caucaugguu gcuaguguua
3240aucgaaaacc uacuaagugc caggcuuacu gucucugggu cuugcuuacg
uggaugucau 3300uuuuccaguu gcaccaaauc gaaagagguu aauugguuug
uuggaguucc uuuguaggug 3360aagggcagag ccaggagcuu ggcuagggac
aggggaggug agugggggau gguggauagg 3420ucuuggcucc caguuuccuu
cugggcagac auugccccuc ugcccugagg accugcuugu 3480uugggggaag
aggccuuuag aggcaccagg gucaugccag guguuggaca uggugaacug
3540ggaagugcuc ccaucuggcc acagcgcaga aguaucaccg ugcuggggga
uggggaacag 3600ggcugugaau gggccuauuu gcauaagcag caugugucug
gagagaaaga caucacagag 3660cagaagagug cgggugccca ggagugcacu
ugccaccccu acuucauccc ugaaagagua 3720aauggccugg aaggugucuc
ugagagguaa ugccgcacac cacccucccu gggggcaggg 3780ucaggcuaca
ccugccuuag gucgggggcu gcagcagccu gagagcucuc aguagggccu
3840caguagccug ggagggagca ggggcagggg gcagggaaag aggcguaaug
gggcugucca 3900gaggggccug ggaaaccugg ucccugaggc cugggcacag
cuacaaucac uucaaauugg 3960cuguggggcc aguggacugg gaaggaaaaa
agcaauaaga gugaccaagu gcagaaggcu 4020gucagguccc aggucacaug
ccuuagugca gugacuccuc aucauuuuau gggguguggg 4080ugucguuggu
acacccauuu uacagaugag gacaccgagg cccagaaaag uuaaguuaca
4140uguccuaagu cacacagcuu guaagugcca gaacugagau caaaaccaag
ucucuuugac 4200uuuaaagucu guacucugac cccaaagaga uccuguuugg
ccacuuauag gaggucccua 4260aagcugcaga cuccccuugc cggcacccac
auauagagac auuaacccuu ccccugcagg 4320gucaccucaa auagucuuuu
agcugggcuu cuccugcaau uccaccuaau gccauccccu 4380ggguuuugcc
caaaccugaa cugggcagug gggugagagg agggguuuac aggguuacag
4440agccucauac agauaggagc ccauggcugc uggucaucug cauuccugca
ggauuggcug 4500uuccuugggg uccuuggcag gaaaaugagg auugcuccga
ggccugcucc aguacuuccc 4560agaggcuggc cugguguggg gcucugggaa
ggcugaggcu ggagaagcgu aaguaggagg 4620gcagagaugg cacucaggua
gcuugaauca ccaggacccu uccaagcccc acagguucug 4680agggaguacu
agggccagcu cugggagagg ucucuuccua ugcugugaac ccccugccuu
4740ucuugcagcc uacaacgaau aaauuuucuu ugcaaagg 4778354113RNAHomo
sapiens 35gauucucaca acuucugcgu gcgagcgccc gccccaccga ccgccccggc
ccggcccgca 60agagccagag gagccgagag gagcccagcg ccggcccagc ggacuccagc
ucgacggagc 120ggccgcgccc cgaccaguua cuccccugcu ggugccuccc
uccuuggcgc gcuucccacc 180ugcgaucggc gcccucuucg cagucacgaa
cucgccagca gcuagcagca cugacuagua 240ggagggcccg ccggaggaga
ggaagcccca gagagauugg ugagggugau uucccaggaa 300gacgcagugu
gcucugacuu cugugacagu gagcaacggg accaguggau guccagaugc
360uggcaaugag acaugcucug gagucagaag acagcgaaaa gagaagcaga
agccccggug 420gcaagagucu gaagcaggaa ggaugacugu agccugugga
uuguacugca guaggaaacu 480guccuagcaa ggcuccacuu ugccccagcu
ucaagcugga aaggaggaga acaugaaaca 540uugcuugaag acaauggccg
agacagcagg ucccacccug cacagccacc agcaucucuc 600cccucagccc
ugucuccucu ucugcaguug ggaucugcac auuuaagccu gaaauugucc
660ugugaaguga aguaugaucg gacagccucu uuucagcuuu uaugacaaug
gagacagagg 720aauuguggcu cuugccaagg ucacaggauu ggaauacaga
gccaagccac cccaggacau 780gcaagagccu cagaagggaa aaaagcccag
caggaaggga gaacaaguag ccucuguccu 840gaaguuguaa cagccagggg
ccaggaugga ggaggaggac cccauaaucu gcccaucugg 900gacuuggcag
gggaccuggg aaaauguacc ccaacccauc ccuuaagggc cuuugucuuu
960ggcccauugg ccuagcaucu acuucuucac cgugucuguu cuugucacac
cuagucaggu 1020cuguuugggu cugaggugca uggaacauuc uggguaggcc
uccagcaaac ggaagcucuu 1080caccguguuu ccagccuggg accaagggca
gcauacuggc aaaguugcca aagcaaggga 1140cuccagccuc uuaggaguua
augacucccu cuccccagcu guccuccccu uggugcuccu 1200cuuccucccu
ccuccugcuc acagcaggca gggccuagac ccgggagcca ugcugcugug
1260cuguugccag gggagcacgg aggcagaucu gagcuaugca gggaaaaggc
ccagccuguc 1320aaagugucug agaugaaccg ccgccguccc ugugcagcug
ggcucagacg ugucucagcu 1380cuuguucugu gccugagaau ggcgaaaccc
agugagguuc aagggcaaac ucgcuauuca 1440uuagucaggg guucuugacg
ucccgucucu cccagggaug aguucccccc uccucuuucu 1500cccccuccua
ugacacauuc cugggugccu uuggugagga cugcacaccc uccuccugcc
1560uagcccccuc uccaaaggcc ccugaauaaa cuccccccaa ggagaccagg
cagggcagag 1620acaauggcug caggaaauca uucaggcggg acaugcuggc
cugcccucca cccagucccc 1680cugugggccc cacucccuuc ugauucaggg
cacccuuggg cccccagccu auacaggccu 1740ggacaggaag aaaccacugg
gaaccacccu aaggacaaca ugcuagucca gugccauucu 1800ucgcuggcuc
ugugggugcc uuuguggccu guaccgacug gcuggcuaau uuugugguuu
1860cuguaccauc acaugccuau uuuaagacac ucuccagcac ugucgguuag
ggaguguaaa 1920uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua
aaaggcacac auacuugguc 1980acaaacaaau ggcacuauuu acucuguggg
cauauuugua aaaguugcca aagaauuaua 2040uacaaggaug uucaucagag
cauuucuuuu gaagaguaaa gaaauggaca ugaaccugug 2100guccguucau
acgguggaau accuaugcag cuguaaaaau caguguggua gaucuccgua
2160uaugaguuga uguggaaggu uggccaguuc acaugauaag gugaauagaa
uaaguuacag 2220aacaggcugu agaguaugau cuuauuugua gauguuuaaa
acugagucau aaguaugcuu 2280auauacagau cguuucugga aguauguacu
ggaagucuac cucuggggag uggggauggg 2340ggagugcacu cuucuauacu
guuauauuuu cuuuucaugc uccuaaggua cuuuuauugg 2400aagauguaaa
gcgguucaau guaauaggcu uaacuucugu caacuaaguu ggcgugggug
2460cuuuaagagg gugguaguga uguugcugga gaaaguaucc cacagucacu
gguggcuuca 2520gccacgggcc auuuuggggc cuaauaauca cauaucauca
ugguugcuag uguuaaucga 2580aaaccuacua agugccaggc uuacugucuc
ugggucuugc uuacguggau gucauuuuuc 2640caguugcacc aaaucgaaag
agguuaauug guuuguugga guuccuuugu aggugaaggg 2700cagagccagg
agcuuggcua gggacagggg aggugagugg gggauggugg auaggucuug
2760gcucccaguu uccuucuggg cagacauugc cccucugccc ugaggaccug
cuuguuuggg 2820ggaagaggcc uuuagaggca ccagggucau gccagguguu
ggacauggug aacugggaag 2880ugcucccauc uggccacagc gcagaaguau
caccgugcug ggggaugggg aacagggcug 2940ugaaugggcc uauuugcaua
agcagcaugu gucuggagag aaagacauca cagagcagaa 3000gagugcgggu
gcccaggagu gcacuugcca ccccuacuuc aucccugaaa gaguaaaugg
3060ccuggaaggu gucucugaga gguaaugccg cacaccaccc ucccuggggg
cagggucagg 3120cuacaccugc cuuaggucgg gggcugcagc agccugagag
cucucaguag ggccucagua 3180gccugggagg gagcaggggc agggggcagg
gaaagaggcg uaauggggcu guccagaggg 3240gccugggaaa ccuggucccu
gaggccuggg cacagcuaca aucacuucaa auuggcugug 3300gggccagugg
acugggaagg aaaaaagcaa uaagagugac caagugcaga aggcugucag
3360gucccagguc acaugccuua gugcagugac uccucaucau uuuauggggu
gugggugucg 3420uugguacacc cauuuuacag augaggacac cgaggcccag
aaaaguuaag uuacaugucc 3480uaagucacac agcuuguaag ugccagaacu
gagaucaaaa ccaagucucu uugacuuuaa 3540agucuguacu cugaccccaa
agagauccug uuuggccacu uauaggaggu cccuaaagcu 3600gcagacuccc
cuugccggca cccacauaua gagacauuaa cccuuccccu gcagggucac
3660cucaaauagu cuuuuagcug ggcuucuccu gcaauuccac cuaaugccau
ccccuggguu 3720uugcccaaac cugaacuggg caguggggug agaggagggg
uuuacagggu uacagagccu 3780cauacagaua ggagcccaug gcugcugguc
aucugcauuc cugcaggauu ggcuguuccu 3840ugggguccuu ggcaggaaaa
ugaggauugc uccgaggccu gcuccaguac uucccagagg 3900cuggccuggu
guggggcucu gggaaggcug aggcuggaga agcguaagua ggagggcaga
3960gauggcacuc agguagcuug aaucaccagg acccuuccaa gccccacagg
uucugaggga 4020guacuagggc cagcucuggg agaggucucu uccuaugcug
ugaacccccu gccuuucuug 4080cagccuacaa cgaauaaauu uucuuugcaa agg
4113363936RNAHomo sapiens 36ggagccgaga ggagcccagc gccggcccag
cggacuccag cucgacggag cggccgcgcc 60ccgaccaguu acuccccugc uggugccucc
cuccuuggcg cgcuucccac cugcgaucgg 120cgcccucuuc gcagucacga
acucgccagc agcuagcagc acugacuagu aggagggccc 180gccggaggag
aggacaugcu cuggagucag aagacagcga aaagagaagc agaagccccg
240guggcaagag ucugaaggaa ggaugacugu agccugugga uuguacugca
guaggaaacu 300guccuagcaa ggcuccacuu ugccccagcu ucaagcugga
aaggaggaga acaugaaaca 360uugcuugaag acaauggccg agacagcagg
ucccacccug cacagccacc agcaucucuc 420cccucagccc ugucuccucu
ucugcaguug ggaucugcac auuuaagccu gaaauugucc 480ugugaaguga
aguaugaucg gacagccucu uuucagcuuu uaugacaaug gagacagagg
540aauuguggcu cuugccaagg ucacaggauu ggaauacaga gccaagccac
cccaggacau 600gcaagagccu cagaagggaa aaaagcccag caggaaggga
gaacaaguag ccucuguccu 660gaaguuguaa cagccagggg ccaggaugga
ggaggaggac cccauaaucu gcccaucugg 720gacuuggcag gggaccuggg
aaaauguacc ccaacccauc ccuuaagggc cuuugucuuu 780ggcccauugg
ccuagcaucu acuucuucac cgugucuguu cuugucacac cuagucaggu
840cuguuugggu cugaggugca uggaacauuc uggguaggcc uccagcaaac
ggaagcucuu 900caccguguuu ccagccuggg accaagggca gcauacuggc
aaaguugcca aagcaaggga 960cuccagccuc uuaggaguua augacucccu
cuccccagcu guccuccccu uggugcuccu 1020cuuccucccu ccuccugcuc
acagcaggca gggccuagac ccgggagcca ugcugcugug 1080cuguugccag
gggagcacgg aggcagaucu gagcuaugca gggaaaaggc ccagccuguc
1140aaagugucug agaugaaccg ccgccguccc ugugcagcug ggcucagacg
ugucucagcu 1200cuuguucugu gccugagaau ggcgaaaccc agugagguuc
aagggcaaac ucgcuauuca 1260uuagucaggg guucuugacg ucccgucucu
cccagggaug aguucccccc uccucuuucu 1320cccccuccua ugacacauuc
cugggugccu uuggugagga cugcacaccc uccuccugcc 1380uagcccccuc
uccaaaggcc ccugaauaaa cuccccccaa ggagaccagg cagggcagag
1440acaauggcug caggaaauca uucaggcggg acaugcuggc cugcccucca
cccagucccc 1500cugugggccc cacucccuuc ugauucaggg cacccuuggg
cccccagccu auacaggccu 1560ggacaggaag aaaccacugg gaaccacccu
aaggacaaca ugcuagucca gugccauucu 1620ucgcuggcuc ugugggugcc
uuuguggccu guaccgacug gcuggcuaau uuugugguuu 1680cuguaccauc
acaugccuau uuuaagacac ucuccagcac ugucgguuag ggaguguaaa
1740uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua aaaggcacac
auacuugguc 1800acaaacaaau ggcacuauuu acucuguggg cauauuugua
aaaguugcca aagaauuaua 1860uacaaggaug uucaucagag cauuucuuuu
gaagaguaaa gaaauggaca ugaaccugug 1920guccguucau acgguggaau
accuaugcag cuguaaaaau caguguggua gaucuccgua 1980uaugaguuga
uguggaaggu uggccaguuc acaugauaag gugaauagaa uaaguuacag
2040aacaggcugu agaguaugau cuuauuugua gauguuuaaa acugagucau
aaguaugcuu 2100auauacagau cguuucugga aguauguacu ggaagucuac
cucuggggag uggggauggg 2160ggagugcacu cuucuauacu guuauauuuu
cuuuucaugc uccuaaggua cuuuuauugg 2220aagauguaaa gcgguucaau
guaauaggcu uaacuucugu caacuaaguu ggcgugggug 2280cuuuaagagg
gugguaguga uguugcugga gaaaguaucc cacagucacu gguggcuuca
2340gccacgggcc auuuuggggc cuaauaauca cauaucauca ugguugcuag
uguuaaucga 2400aaaccuacua agugccaggc uuacugucuc ugggucuugc
uuacguggau gucauuuuuc 2460caguugcacc aaaucgaaag agguuaauug
guuuguugga guuccuuugu aggugaaggg 2520cagagccagg agcuuggcua
gggacagggg aggugagugg gggauggugg auaggucuug 2580gcucccaguu
uccuucuggg cagacauugc cccucugccc ugaggaccug cuuguuuggg
2640ggaagaggcc uuuagaggca ccagggucau gccagguguu ggacauggug
aacugggaag 2700ugcucccauc uggccacagc gcagaaguau caccgugcug
ggggaugggg aacagggcug 2760ugaaugggcc uauuugcaua agcagcaugu
gucuggagag aaagacauca cagagcagaa 2820gagugcgggu gcccaggagu
gcacuugcca ccccuacuuc aucccugaaa gaguaaaugg 2880ccuggaaggu
gucucugaga gguaaugccg cacaccaccc ucccuggggg cagggucagg
2940cuacaccugc cuuaggucgg gggcugcagc agccugagag cucucaguag
ggccucagua 3000gccugggagg gagcaggggc agggggcagg gaaagaggcg
uaauggggcu guccagaggg 3060gccugggaaa ccuggucccu gaggccuggg
cacagcuaca aucacuucaa auuggcugug 3120gggccagugg acugggaagg
aaaaaagcaa uaagagugac caagugcaga aggcugucag 3180gucccagguc
acaugccuua gugcagugac uccucaucau uuuauggggu gugggugucg
3240uugguacacc cauuuuacag augaggacac cgaggcccag aaaaguuaag
uuacaugucc 3300uaagucacac agcuuguaag ugccagaacu gagaucaaaa
ccaagucucu uugacuuuaa 3360agucuguacu cugaccccaa agagauccug
uuuggccacu uauaggaggu cccuaaagcu 3420gcagacuccc cuugccggca
cccacauaua gagacauuaa cccuuccccu gcagggucac 3480cucaaauagu
cuuuuagcug ggcuucuccu gcaauuccac cuaaugccau ccccuggguu
3540uugcccaaac cugaacuggg caguggggug agaggagggg uuuacagggu
uacagagccu 3600cauacagaua ggagcccaug gcugcugguc aucugcauuc
cugcaggauu ggcuguuccu 3660ugggguccuu ggcaggaaaa ugaggauugc
uccgaggccu gcuccaguac uucccagagg 3720cuggccuggu guggggcucu
gggaaggcug aggcuggaga agcguaagua ggagggcaga 3780gauggcacuc
agguagcuug aaucaccagg acccuuccaa gccccacagg uucugaggga
3840guacuagggc cagcucuggg agaggucucu uccuaugcug ugaacccccu
gccuuucuug 3900cagccuacaa cgaauaaauu uucuuugcaa aggcuu
3936374026RNAHomo sapiens 37gggggucccg gccccacaca gugcuagggu
cccucucgag uuucucaucu gccuucaggu 60cacuuuccac ccugaugccu uggcuugucc
ugaagcucag ggccccugua gcuugggaaa 120ccucccaagc uccccagcga
guggcuguag accaaggaag ggacccugcc cggcuucagg 180gaagaaagga
agaaaguuac uccccugcug gugccucccu ccuuggcgcg cuucccaccu
240gcgaucggcg cccucuucgc agucacgaac ucgccagcag cuagcagcac
ugacuaguag 300gagggcccgc cggaggagag gacaugcucu ggagucagaa
gacagcgaaa agagaagcag 360aagccccggu ggcaagaguc ugaaggaagg
augacuguag ccuguggauu guacugcagu 420aggaaacugu ccuagcaagg
cuccacuuug ccccagcuuc aagcuggaaa ggaggagaac 480augaaacauu
gcuugaagac aauggccgag acagcagguc ccacccugca cagccaccag
540caucucuccc cucagcccug ucuccucuuc ugcaguuggg
aucugcacau uuaagccuga 600aauuguccug ugaagugaag uaugaucgga
cagccucuuu ucagcuuuua ugacaaugga 660gacagaggaa uuguggcucu
ugccaagguc acaggauugg aauacagagc caagccaccc 720caggacaugc
aagagccuca gaagggaaaa aagcccagca ggaagggaga acaaguagcc
780ucuguccuga aguuguaaca gccaggggcc aggauggagg aggaggaccc
cauaaucugc 840ccaucuggga cuuggcaggg gaccugggaa aauguacccc
aacccauccc uuaagggccu 900uugucuuugg cccauuggcc uagcaucuac
uucuucaccg ugucuguucu ugucacaccu 960agucaggucu guuugggucu
gaggugcaug gaacauucug gguaggccuc cagcaaacgg 1020aagcucuuca
ccguguuucc agccugggac caagggcagc auacuggcaa aguugccaaa
1080gcaagggacu ccagccucuu aggaguuaau gacucccucu ccccagcugu
ccuccccuug 1140gugcuccucu uccucccucc uccugcucac agcaggcagg
gccuagaccc gggagccaug 1200cugcugugcu guugccaggg gagcacggag
gcagaucuga gcuaugcagg gaaaaggccc 1260agccugucaa agugucugag
augaaccgcc gccgucccug ugcagcuggg cucagacgug 1320ucucagcucu
uguucugugc cugagaaugg cgaaacccag ugagguucaa gggcaaacuc
1380gcuauucauu agucaggggu ucuugacguc ccgucucucc cagggaugag
uuccccccuc 1440cucuuucucc cccuccuaug acacauuccu gggugccuuu
ggugaggacu gcacacccuc 1500cuccugccua gcccccucuc caaaggcccc
ugaauaaacu ccccccaagg agaccaggca 1560gggcagagac aauggcugca
ggaaaucauu caggcgggac augcuggccu gcccuccacc 1620cagucccccu
gugggcccca cucccuucug auucagggca cccuugggcc cccagccuau
1680acaggccugg acaggaagaa accacuggga accacccuaa ggacaacaug
cuaguccagu 1740gccauucuuc gcuggcucug ugggugccuu uguggccugu
accgacuggc uggcuaauuu 1800ugugguuucu guaccaucac augccuauuu
uaagacacuc uccagcacug ucgguuaggg 1860aguguaaauu uugcaauauu
uucugaaaug uggcaauauc aaaauguaaa aggcacacau 1920acuuggucac
aaacaaaugg cacuauuuac ucugugggca uauuuguaaa aguugccaaa
1980gaauuauaua caaggauguu caucagagca uuucuuuuga agaguaaaga
aauggacaug 2040aaccuguggu ccguucauac gguggaauac cuaugcagcu
guaaaaauca gugugguaga 2100ucuccguaua ugaguugaug uggaagguug
gccaguucac augauaaggu gaauagaaua 2160aguuacagaa caggcuguag
aguaugaucu uauuuguaga uguuuaaaac ugagucauaa 2220guaugcuuau
auacagaucg uuucuggaag uauguacugg aagucuaccu cuggggagug
2280gggauggggg agugcacucu ucuauacugu uauauuuucu uuucaugcuc
cuaagguacu 2340uuuauuggaa gauguaaagc gguucaaugu aauaggcuua
acuucuguca acuaaguugg 2400cgugggugcu uuaagagggu gguagugaug
uugcuggaga aaguauccca cagucacugg 2460uggcuucagc cacgggccau
uuuggggccu aauaaucaca uaucaucaug guugcuagug 2520uuaaucgaaa
accuacuaag ugccaggcuu acugucucug ggucuugcuu acguggaugu
2580cauuuuucca guugcaccaa aucgaaagag guuaauuggu uuguuggagu
uccuuuguag 2640gugaagggca gagccaggag cuuggcuagg gacaggggag
gugagugggg gaugguggau 2700aggucuuggc ucccaguuuc cuucugggca
gacauugccc cucugcccug aggaccugcu 2760uguuuggggg aagaggccuu
uagaggcacc agggucaugc cagguguugg acauggugaa 2820cugggaagug
cucccaucug gccacagcgc agaaguauca ccgugcuggg ggauggggaa
2880cagggcugug aaugggccua uuugcauaag cagcaugugu cuggagagaa
agacaucaca 2940gagcagaaga gugcgggugc ccaggagugc acuugccacc
ccuacuucau cccugaaaga 3000guaaauggcc uggaaggugu cucugagagg
uaaugccgca caccacccuc ccugggggca 3060gggucaggcu acaccugccu
uaggucgggg gcugcagcag ccugagagcu cucaguaggg 3120ccucaguagc
cugggaggga gcaggggcag ggggcaggga aagaggcgua auggggcugu
3180ccagaggggc cugggaaacc uggucccuga ggccugggca cagcuacaau
cacuucaaau 3240uggcuguggg gccaguggac ugggaaggaa aaaagcaaua
agagugacca agugcagaag 3300gcugucaggu cccaggucac augccuuagu
gcagugacuc cucaucauuu uauggggugu 3360gggugucguu gguacaccca
uuuuacagau gaggacaccg aggcccagaa aaguuaaguu 3420acauguccua
agucacacag cuuguaagug ccagaacuga gaucaaaacc aagucucuuu
3480gacuuuaaag ucuguacucu gaccccaaag agauccuguu uggccacuua
uaggaggucc 3540cuaaagcugc agacuccccu ugccggcacc cacauauaga
gacauuaacc cuuccccugc 3600agggucaccu caaauagucu uuuagcuggg
cuucuccugc aauuccaccu aaugccaucc 3660ccuggguuuu gcccaaaccu
gaacugggca guggggugag aggagggguu uacaggguua 3720cagagccuca
uacagauagg agcccauggc ugcuggucau cugcauuccu gcaggauugg
3780cuguuccuug ggguccuugg caggaaaaug aggauugcuc cgaggccugc
uccaguacuu 3840cccagaggcu ggccuggugu ggggcucugg gaaggcugag
gcuggagaag cguaaguagg 3900agggcagaga uggcacucag guagcuugaa
ucaccaggac ccuuccaagc cccacagguu 3960cugagggagu acuagggcca
gcucugggag aggucucuuc cuaugcugug aacccccugc 4020cuuucu
4026384334RNAHomo sapiens 38ugaggcgcca ccggugccca gcaaccuccc
caggcugugg uugugaccug aggacgcgug 60uguccccgcc cucaggccac cgcuacgcga
cccugagugc accuucaaga aggccgggca 120cguuucuggg cgggcguggg
gggugccuga uaucuccgcu cuauuuuaca guuacucccc 180ugcuggugcc
ucccuccuug gcgcgcuucc caccugcgau cggcgcccuc uucgcaguca
240cgaacucgcc agcagcuagc agcacugacu aguaggaggg cccgccggag
gagaggaagc 300cccagagaga uuggugaggg ugauuuccca ggaagacgca
gugugcucug acuucuguga 360cagugagcaa cgggaccagu ggauguccag
augcuggcaa ugaguaggcc uucccuacgc 420uggguggcgu ccacacccuc
cggcuuccau ugccuggguc uccuggaggu gguuugcugg 480augaauaccg
caugcacaga ggcuggccuu ggguuugaau auggcagcca guggacagca
540ugugcuucag uuaugagacu gcccaggaga ugcuucuucc aaggcagagc
acgugcagag 600uccagugcug gagaggccgg gugcgcaguu gacccauuuc
caguucuguu uucccucuca 660uguuccucug uccccaucua ggacaugcuc
uggagucaga agacagcgaa aagagaagca 720gaagccccgg uggcaagagu
cugaagcugg aaaggaggag aacaugaaac auugcuugaa 780gacaauggcc
gagacagcag gucccacccu gcacagccac cagcaucucu ccccucagcc
840cugucuccuc uucugcaguu gggaucugca cauuuaagcc ugaaauuguc
cugugaagug 900aaguaugauc ggacagccuc uuuucagcuu uuaugacaau
ggagacagag gaauuguggc 960ucuugccaag gucacaggau uggaauacag
agccaagcca ccccaggaca ugcaagagcc 1020ucagaaggga aaaaagccca
gcaggaaggg agaacaagua gccucugucc ugaaguugua 1080acagccaggg
gccaggaugg aggaggagga ccccauaauc ugcccaucug ggacuuggca
1140ggggaccugg gaaaauguac cccaacccau cccuuaaggg ccuuugucuu
uggcccauug 1200gccuagcauc uacuucuuca ccgugucugu ucuugucaca
ccuagucagg ucuguuuggg 1260ucugaggugc auggaacauu cuggguaggc
cuccagcaaa cggaagcucu ucaccguguu 1320uccagccugg gaccaagggc
agcauacugg caaaguugcc aaagcaaggg acuccagccu 1380cuuaggaguu
aaugacuccc ucuccccagc uguccucccc uuggugcucc ucuuccuccc
1440uccuccugcu cacagcaggc agggccuaga cccgggagcc augcugcugu
gcuguugcca 1500ggggagcacg gaggcagauc ugagcuaugc agggaaaagg
cccagccugu caaagugucu 1560gagaugaacc gccgccgucc cugugcagcu
gggcucagac gugucucagc ucuuguucug 1620ugccugagaa uggcgaaacc
cagugagguu caagggcaaa cucgcuauuc auuagucagg 1680gguucuugac
gucccgucuc ucccagggau gaguuccccc cuccucuuuc ucccccuccu
1740augacacauu ccugggugcc uuuggugagg acugcacacc cuccuccugc
cuagcccccu 1800cuccaaaggc cccugaauaa acucccccca aggagaccag
gcagggcaga gacaauggcu 1860gcaggaaauc auucaggcgg gacaugcugg
ccugcccucc acccaguccc ccugugggcc 1920ccacucccuu cugauucagg
gcacccuugg gcccccagcc uauacaggcc uggacaggaa 1980gaaaccacug
ggaaccaccc uaaggacaac augcuagucc agugccauuc uucgcuggcu
2040cugugggugc cuuuguggcc uguaccgacu ggcuggcuaa uuuugugguu
ucuguaccau 2100cacaugccua uuuuaagaca cucuccagca cugucgguua
gggaguguaa auuuugcaau 2160auuuucugaa auguggcaau aucaaaaugu
aaaaggcaca cauacuuggu cacaaacaaa 2220uggcacuauu uacucugugg
gcauauuugu aaaaguugcc aaagaauuau auacaaggau 2280guucaucaga
gcauuucuuu ugaagaguaa agaaauggac augaaccugu gguccguuca
2340uacgguggaa uaccuaugca gcuguaaaaa ucaguguggu agaucuccgu
auaugaguug 2400auguggaagg uuggccaguu cacaugauaa ggugaauaga
auaaguuaca gaacaggcug 2460uagaguauga ucuuauuugu agauguuuaa
aacugaguca uaaguaugcu uauauacaga 2520ucguuucugg aaguauguac
uggaagucua ccucugggga guggggaugg gggagugcac 2580ucuucuauac
uguuauauuu ucuuuucaug cuccuaaggu acuuuuauug gaagauguaa
2640agcgguucaa uguaauaggc uuaacuucug ucaacuaagu uggcgugggu
gcuuuaagag 2700ggugguagug auguugcugg agaaaguauc ccacagucac
ugguggcuuc agccacgggc 2760cauuuugggg ccuaauaauc acauaucauc
augguugcua guguuaaucg aaaaccuacu 2820aagugccagg cuuacugucu
cugggucuug cuuacgugga ugucauuuuu ccaguugcac 2880caaaucgaaa
gagguuaauu gguuuguugg aguuccuuug uaggugaagg gcagagccag
2940gagcuuggcu agggacaggg gaggugagug ggggauggug gauaggucuu
ggcucccagu 3000uuccuucugg gcagacauug ccccucugcc cugaggaccu
gcuuguuugg gggaagaggc 3060cuuuagaggc accaggguca ugccaggugu
uggacauggu gaacugggaa gugcucccau 3120cuggccacag cgcagaagua
ucaccgugcu gggggauggg gaacagggcu gugaaugggc 3180cuauuugcau
aagcagcaug ugucuggaga gaaagacauc acagagcaga agagugcggg
3240ugcccaggag ugcacuugcc accccuacuu caucccugaa agaguaaaug
gccuggaagg 3300ugucucugag agguaaugcc gcacaccacc cucccugggg
gcagggucag gcuacaccug 3360ccuuaggucg ggggcugcag cagccugaga
gcucucagua gggccucagu agccugggag 3420ggagcagggg cagggggcag
ggaaagaggc guaauggggc uguccagagg ggccugggaa 3480accugguccc
ugaggccugg gcacagcuac aaucacuuca aauuggcugu ggggccagug
3540gacugggaag gaaaaaagca auaagaguga ccaagugcag aaggcuguca
ggucccaggu 3600cacaugccuu agugcaguga cuccucauca uuuuaugggg
uguggguguc guugguacac 3660ccauuuuaca gaugaggaca ccgaggccca
gaaaaguuaa guuacauguc cuaagucaca 3720cagcuuguaa gugccagaac
ugagaucaaa accaagucuc uuugacuuua aagucuguac 3780ucugacccca
aagagauccu guuuggccac uuauaggagg ucccuaaagc ugcagacucc
3840ccuugccggc acccacauau agagacauua acccuucccc ugcaggguca
ccucaaauag 3900ucuuuuagcu gggcuucucc ugcaauucca ccuaaugcca
uccccugggu uuugcccaaa 3960ccugaacugg gcaguggggu gagaggaggg
guuuacaggg uuacagagcc ucauacagau 4020aggagcccau ggcugcuggu
caucugcauu ccugcaggau uggcuguucc uugggguccu 4080uggcaggaaa
augaggauug cuccgaggcc ugcuccagua cuucccagag gcuggccugg
4140uguggggcuc ugggaaggcu gaggcuggag aagcguaagu aggagggcag
agauggcacu 4200cagguagcuu gaaucaccag gacccuucca agccccacag
guucugaggg aguacuaggg 4260ccagcucugg gagaggucuc uuccuaugcu
gugaaccccc ugccuuucuu gcagccuaca 4320acgaauaaau uuuc
4334393868RNAHomo sapiens 39uuacuccccu gcuggugccu cccuccuugg
cgcgcuuccc accugcgauc ggcgcccucu 60ucgcagucac gaacucgcca gcagcuagca
gcacugacua guaggagggc ccgccggagg 120agaggacaug cucuggaguc
agaagacagc gaaaagagaa gcagaagccc cgguggcaag 180agucugaagc
aggaaggaug acuguagccu guggauugua cugcaguagg aaacuguccu
240agcaaggcuc cacuuugccc cagcuucaag cuggaaagga ggagaacaug
aaacauugcu 300ugaagacaau ggccgagaca gcagguccca cccugcacag
ccaccagcau cucuccccuc 360agcccugucu ccucuucugc aguugggauc
ugcacauuua agccugaaau uguccuguga 420agugaaguau gaucggacag
ccucuuuuca gcuuuuauga caauggagac agaggaauug 480uggcucuugc
caaggucaca ggauuggaau acagagccaa gccaccccag gacaugcaag
540agccucagaa gggaaaaaag cccagcagga agggagaaca aguagccucu
guccugaagu 600uguaacagcc aggggccagg auggaggagg aggaccccau
aaucugccca ucugggacuu 660ggcaggggac cugggaaaau guaccccaac
ccaucccuua agggccuuug ucuuuggccc 720auuggccuag caucuacuuc
uucaccgugu cuguucuugu cacaccuagu caggucuguu 780ugggucugag
gugcauggaa cauucugggu aggccuccag caaacggaag cucuucaccg
840uguuuccagc cugggaccaa gggcagcaua cuggcaaagu ugccaaagca
agggacucca 900gccucuuagg aguuaaugac ucccucuccc cagcuguccu
ccccuuggug cuccucuucc 960ucccuccucc ugcucacagc aggcagggcc
uagacccggg agccaugcug cugugcuguu 1020gccaggggag cacggaggca
gaucugagcu augcagggaa aaggcccagc cugucaaagu 1080gucugagaug
aaccgccgcc gucccugugc agcugggcuc agacgugucu cagcucuugu
1140ucugugccug agaauggcga aacccaguga gguucaaggg caaacucgcu
auucauuagu 1200cagggguucu ugacgucccg ucucucccag ggaugaguuc
cccccuccuc uuucuccccc 1260uccuaugaca cauuccuggg ugccuuuggu
gaggacugca cacccuccuc cugccuagcc 1320cccucuccaa aggccccuga
auaaacuccc cccaaggaga ccaggcaggg cagagacaau 1380ggcugcagga
aaucauucag gcgggacaug cuggccugcc cuccacccag ucccccugug
1440ggccccacuc ccuucugauu cagggcaccc uugggccccc agccuauaca
ggccuggaca 1500ggaagaaacc acugggaacc acccuaagga caacaugcua
guccagugcc auucuucgcu 1560ggcucugugg gugccuuugu ggccuguacc
gacuggcugg cuaauuuugu gguuucugua 1620ccaucacaug ccuauuuuaa
gacacucucc agcacugucg guuagggagu guaaauuuug 1680caauauuuuc
ugaaaugugg caauaucaaa auguaaaagg cacacauacu uggucacaaa
1740caaauggcac uauuuacucu gugggcauau uuguaaaagu ugccaaagaa
uuauauacaa 1800ggauguucau cagagcauuu cuuuugaaga guaaagaaau
ggacaugaac cugugguccg 1860uucauacggu ggaauaccua ugcagcugua
aaaaucagug ugguagaucu ccguauauga 1920guugaugugg aagguuggcc
aguucacaug auaaggugaa uagaauaagu uacagaacag 1980gcuguagagu
augaucuuau uuguagaugu uuaaaacuga gucauaagua ugcuuauaua
2040cagaucguuu cuggaaguau guacuggaag ucuaccucug gggagugggg
augggggagu 2100gcacucuucu auacuguuau auuuucuuuu caugcuccua
agguacuuuu auuggaagau 2160guaaagcggu ucaauguaau aggcuuaacu
ucugucaacu aaguuggcgu gggugcuuua 2220agaggguggu agugauguug
cuggagaaag uaucccacag ucacuggugg cuucagccac 2280gggccauuuu
ggggccuaau aaucacauau caucaugguu gcuaguguua aucgaaaacc
2340uacuaagugc caggcuuacu gucucugggu cuugcuuacg uggaugucau
uuuuccaguu 2400gcaccaaauc gaaagagguu aauugguuug uuggaguucc
uuuguaggug aagggcagag 2460ccaggagcuu ggcuagggac aggggaggug
agugggggau gguggauagg ucuuggcucc 2520caguuuccuu cugggcagac
auugccccuc ugcccugagg accugcuugu uugggggaag 2580aggccuuuag
aggcaccagg gucaugccag guguuggaca uggugaacug ggaagugcuc
2640ccaucuggcc acagcgcaga aguaucaccg ugcuggggga uggggaacag
ggcugugaau 2700gggccuauuu gcauaagcag caugugucug gagagaaaga
caucacagag cagaagagug 2760cgggugccca ggagugcacu ugccaccccu
acuucauccc ugaaagagua aauggccugg 2820aaggugucuc ugagagguaa
ugccgcacac cacccucccu gggggcaggg ucaggcuaca 2880ccugccuuag
gucgggggcu gcagcagccu gagagcucuc aguagggccu caguagccug
2940ggagggagca ggggcagggg gcagggaaag aggcguaaug gggcugucca
gaggggccug 3000ggaaaccugg ucccugaggc cugggcacag cuacaaucac
uucaaauugg cuguggggcc 3060aguggacugg gaaggaaaaa agcaauaaga
gugaccaagu gcagaaggcu gucagguccc 3120aggucacaug ccuuagugca
gugacuccuc aucauuuuau gggguguggg ugucguuggu 3180acacccauuu
uacagaugag gacaccgagg cccagaaaag uuaaguuaca uguccuaagu
3240cacacagcuu guaagugcca gaacugagau caaaaccaag ucucuuugac
uuuaaagucu 3300guacucugac cccaaagaga uccuguuugg ccacuuauag
gaggucccua aagcugcaga 3360cuccccuugc cggcacccac auauagagac
auuaacccuu ccccugcagg gucaccucaa 3420auagucuuuu agcugggcuu
cuccugcaau uccaccuaau gccauccccu ggguuuugcc 3480caaaccugaa
cugggcagug gggugagagg agggguuuac aggguuacag agccucauac
3540agauaggagc ccauggcugc uggucaucug cauuccugca ggauuggcug
uuccuugggg 3600uccuuggcag gaaaaugagg auugcuccga ggccugcucc
aguacuuccc agaggcuggc 3660cugguguggg gcucugggaa ggcugaggcu
ggagaagcgu aaguaggagg gcagagaugg 3720cacucaggua gcuugaauca
ccaggacccu uccaagcccc acagguucug agggaguacu 3780agggccagcu
cugggagagg ucucuuccua ugcugugaac ccccugccuu ucuugcagcc
3840uacaacgaau aaauuuucuu ugcaaagg 3868403978RNAHomo sapiens
40ggucgccgcg ccaagggccc gcugagcccc uccucccauu cguccagccg cgcggcccac
60agaagcggaa cgcgcgucga gagcgcccug uccgcucgcc ccagacagau gcccgguuau
120ucauuaccgc gaggccuaga ggaaagagug gcugccgucu uccugcccac
agcccgccgg 180acccuccguc gcggcugccc gguccccgga gccgcagccg
ccgagcccgg cugugcgugu 240cguggcugcu ggggagaaag aggcuuccgg
acaugcucug gagucagaag acagcgaaaa 300gagaagcaga agccccggug
gcaagagucu gaagcaggaa ggaugacugu agccugugga 360uuguacugca
guaggaaacu guccuagcaa ggcuccacuu ugccccagcu ucaagcugga
420aaggaggaga acaugaaaca uugcuugaag acaauggccg agacagcagg
ucccacccug 480cacagccacc agcaucucuc cccucagccc ugucuccucu
ucugcaguug ggaucugcac 540auuuaagccu gaaauugucc ugugaaguga
aguaugaucg gacagccucu uuucagcuuu 600uaugacaaug gagacagagg
aauuguggcu cuugccaagg ucacaggauu ggaauacaga 660gccaagccac
cccaggacau gcaagagccu cagaagggaa aaaagcccag caggaaggga
720gaacaaguag ccucuguccu gaaguuguaa cagccagggg ccaggaugga
ggaggaggac 780cccauaaucu gcccaucugg gacuuggcag gggaccuggg
aaaauguacc ccaacccauc 840ccuuaagggc cuuugucuuu ggcccauugg
ccuagcaucu acuucuucac cgugucuguu 900cuugucacac cuagucaggu
cuguuugggu cugaggugca uggaacauuc uggguaggcc 960uccagcaaac
ggaagcucuu caccguguuu ccagccuggg accaagggca gcauacuggc
1020aaaguugcca aagcaaggga cuccagccuc uuaggaguua augacucccu
cuccccagcu 1080guccuccccu uggugcuccu cuuccucccu ccuccugcuc
acagcaggca gggccuagac 1140ccgggagcca ugcugcugug cuguugccag
gggagcacgg aggcagaucu gagcuaugca 1200gggaaaaggc ccagccuguc
aaagugucug agaugaaccg ccgccguccc ugugcagcug 1260ggcucagacg
ugucucagcu cuuguucugu gccugagaau ggcgaaaccc agugagguuc
1320aagggcaaac ucgcuauuca uuagucaggg guucuugacg ucccgucucu
cccagggaug 1380aguucccccc uccucuuucu cccccuccua ugacacauuc
cugggugccu uuggugagga 1440cugcacaccc uccuccugcc uagcccccuc
uccaaaggcc ccugaauaaa cuccccccaa 1500ggagaccagg cagggcagag
acaauggcug caggaaauca uucaggcggg acaugcuggc 1560cugcccucca
cccagucccc cugugggccc cacucccuuc ugauucaggg cacccuuggg
1620cccccagccu auacaggccu ggacaggaag aaaccacugg gaaccacccu
aaggacaaca 1680ugcuagucca gugccauucu ucgcuggcuc ugugggugcc
uuuguggccu guaccgacug 1740gcuggcuaau uuugugguuu cuguaccauc
acaugccuau uuuaagacac ucuccagcac 1800ugucgguuag ggaguguaaa
uuuugcaaua uuuucugaaa uguggcaaua ucaaaaugua 1860aaaggcacac
auacuugguc acaaacaaau ggcacuauuu acucuguggg cauauuugua
1920aaaguugcca aagaauuaua uacaaggaug uucaucagag cauuucuuuu
gaagaguaaa 1980gaaauggaca ugaaccugug guccguucau acgguggaau
accuaugcag cuguaaaaau 2040caguguggua gaucuccgua uaugaguuga
uguggaaggu uggccaguuc acaugauaag 2100gugaauagaa uaaguuacag
aacaggcugu agaguaugau cuuauuugua gauguuuaaa 2160acugagucau
aaguaugcuu auauacagau cguuucugga aguauguacu ggaagucuac
2220cucuggggag uggggauggg ggagugcacu cuucuauacu guuauauuuu
cuuuucaugc 2280uccuaaggua cuuuuauugg aagauguaaa gcgguucaau
guaauaggcu uaacuucugu 2340caacuaaguu ggcgugggug cuuuaagagg
gugguaguga uguugcugga gaaaguaucc 2400cacagucacu gguggcuuca
gccacgggcc auuuuggggc cuaauaauca cauaucauca 2460ugguugcuag
uguuaaucga aaaccuacua agugccaggc uuacugucuc ugggucuugc
2520uuacguggau gucauuuuuc caguugcacc aaaucgaaag agguuaauug
guuuguugga 2580guuccuuugu aggugaaggg cagagccagg agcuuggcua
gggacagggg aggugagugg 2640gggauggugg auaggucuug gcucccaguu
uccuucuggg cagacauugc cccucugccc 2700ugaggaccug cuuguuuggg
ggaagaggcc uuuagaggca ccagggucau gccagguguu 2760ggacauggug
aacugggaag ugcucccauc uggccacagc gcagaaguau caccgugcug
2820ggggaugggg aacagggcug ugaaugggcc uauuugcaua agcagcaugu
gucuggagag 2880aaagacauca cagagcagaa gagugcgggu gcccaggagu
gcacuugcca ccccuacuuc 2940aucccugaaa gaguaaaugg ccuggaaggu
gucucugaga gguaaugccg cacaccaccc 3000ucccuggggg cagggucagg
cuacaccugc cuuaggucgg gggcugcagc agccugagag 3060cucucaguag
ggccucagua gccugggagg gagcaggggc agggggcagg gaaagaggcg
3120uaauggggcu guccagaggg gccugggaaa ccuggucccu gaggccuggg
cacagcuaca 3180aucacuucaa auuggcugug gggccagugg acugggaagg
aaaaaagcaa uaagagugac 3240caagugcaga aggcugucag gucccagguc
acaugccuua gugcagugac uccucaucau 3300uuuauggggu gugggugucg
uugguacacc cauuuuacag augaggacac cgaggcccag 3360aaaaguuaag
uuacaugucc uaagucacac agcuuguaag ugccagaacu gagaucaaaa
3420ccaagucucu uugacuuuaa agucuguacu cugaccccaa agagauccug
uuuggccacu 3480uauaggaggu cccuaaagcu gcagacuccc cuugccggca
cccacauaua gagacauuaa 3540cccuuccccu gcagggucac cucaaauagu
cuuuuagcug ggcuucuccu gcaauuccac 3600cuaaugccau ccccuggguu
uugcccaaac cugaacuggg caguggggug agaggagggg 3660uuuacagggu
uacagagccu cauacagaua ggagcccaug gcugcugguc aucugcauuc
3720cugcaggauu ggcuguuccu ugggguccuu ggcaggaaaa ugaggauugc
uccgaggccu 3780gcuccaguac uucccagagg cuggccuggu guggggcucu
gggaaggcug aggcuggaga 3840agcguaagua ggagggcaga gauggcacuc
agguagcuug aaucaccagg acccuuccaa 3900gccccacagg uucugaggga
guacuagggc cagcucuggg agaggucucu uccuaugcug 3960ugaacccccu gccuuucu
3978413837RNAHomo sapiens 41ccaggcgugu gcauuuauau gcagagugac
caagaaacuu caguaauacu aguuuguguc 60uuuggagucc cacuuuuugc cagggcuagu
gcuaacagcu ucaggagaau ucagccucac 120cuugacagga caugcucugg
agucagaaga cagcgaaaag agaagcagaa gccccggugg 180caagagucug
aagcaggaag gaugacugua gccuguggau uguacugcag uaggaaacug
240uccuagcaag gcuccacuuu gccccagcuu caagcuggaa aggaggagaa
caugaaacau 300ugcuugaaga caauggccga gacagcaggu cccacccugc
acagccacca gcaucucucc 360ccucagcccu gucuccucuu cugcaguugg
gaucugcaca uuuaagccug aaauuguccu 420gugaagugaa guaugaucgg
acagccucuu uucagcuuuu augacaaugg agacagagga 480auuguggcuc
uugccaaggu cacaggauug gaauacagag ccaagccacc ccaggacaug
540caagagccuc agaagggaaa aaagcccagc aggaagggag aacaaguagc
cucuguccug 600aaguuguaac agccaggggc caggauggag gaggaggacc
ccauaaucug cccaucuggg 660acuuggcagg ggaccuggga aaauguaccc
caacccaucc cuuaagggcc uuugucuuug 720gcccauuggc cuagcaucua
cuucuucacc gugucuguuc uugucacacc uagucagguc 780uguuuggguc
ugaggugcau ggaacauucu ggguaggccu ccagcaaacg gaagcucuuc
840accguguuuc cagccuggga ccaagggcag cauacuggca aaguugccaa
agcaagggac 900uccagccucu uaggaguuaa ugacucccuc uccccagcug
uccuccccuu ggugcuccuc 960uuccucccuc cuccugcuca cagcaggcag
ggccuagacc cgggagccau gcugcugugc 1020uguugccagg ggagcacgga
ggcagaucug agcuaugcag ggaaaaggcc cagccuguca 1080aagugucuga
gaugaaccgc cgccgucccu gugcagcugg gcucagacgu gucucagcuc
1140uuguucugug ccugagaaug gcgaaaccca gugagguuca agggcaaacu
cgcuauucau 1200uagucagggg uucuugacgu cccgucucuc ccagggauga
guuccccccu ccucuuucuc 1260ccccuccuau gacacauucc ugggugccuu
uggugaggac ugcacacccu ccuccugccu 1320agcccccucu ccaaaggccc
cugaauaaac uccccccaag gagaccaggc agggcagaga 1380caauggcugc
aggaaaucau ucaggcggga caugcuggcc ugcccuccac ccaguccccc
1440ugugggcccc acucccuucu gauucagggc acccuugggc ccccagccua
uacaggccug 1500gacaggaaga aaccacuggg aaccacccua aggacaacau
gcuaguccag ugccauucuu 1560cgcuggcucu gugggugccu uuguggccug
uaccgacugg cuggcuaauu uugugguuuc 1620uguaccauca caugccuauu
uuaagacacu cuccagcacu gucgguuagg gaguguaaau 1680uuugcaauau
uuucugaaau guggcaauau caaaauguaa aaggcacaca uacuugguca
1740caaacaaaug gcacuauuua cucugugggc auauuuguaa aaguugccaa
agaauuauau 1800acaaggaugu ucaucagagc auuucuuuug aagaguaaag
aaauggacau gaaccugugg 1860uccguucaua cgguggaaua ccuaugcagc
uguaaaaauc agugugguag aucuccguau 1920augaguugau guggaagguu
ggccaguuca caugauaagg ugaauagaau aaguuacaga 1980acaggcugua
gaguaugauc uuauuuguag auguuuaaaa cugagucaua aguaugcuua
2040uauacagauc guuucuggaa guauguacug gaagucuacc ucuggggagu
ggggaugggg 2100gagugcacuc uucuauacug uuauauuuuc uuuucaugcu
ccuaagguac uuuuauugga 2160agauguaaag cgguucaaug uaauaggcuu
aacuucuguc aacuaaguug gcgugggugc 2220uuuaagaggg ugguagugau
guugcuggag aaaguauccc acagucacug guggcuucag 2280ccacgggcca
uuuuggggcc uaauaaucac auaucaucau gguugcuagu guuaaucgaa
2340aaccuacuaa gugccaggcu uacugucucu gggucuugcu uacguggaug
ucauuuuucc 2400aguugcacca aaucgaaaga gguuaauugg uuuguuggag
uuccuuugua ggugaagggc 2460agagccagga gcuuggcuag ggacagggga
ggugaguggg ggauggugga uaggucuugg 2520cucccaguuu ccuucugggc
agacauugcc ccucugcccu gaggaccugc uuguuugggg 2580gaagaggccu
uuagaggcac cagggucaug ccagguguug gacaugguga acugggaagu
2640gcucccaucu ggccacagcg cagaaguauc accgugcugg gggaugggga
acagggcugu 2700gaaugggccu auuugcauaa gcagcaugug ucuggagaga
aagacaucac agagcagaag 2760agugcgggug cccaggagug cacuugccac
cccuacuuca ucccugaaag aguaaauggc 2820cuggaaggug ucucugagag
guaaugccgc acaccacccu cccugggggc agggucaggc 2880uacaccugcc
uuaggucggg ggcugcagca gccugagagc ucucaguagg gccucaguag
2940ccugggaggg agcaggggca gggggcaggg aaagaggcgu aauggggcug
uccagagggg 3000ccugggaaac cuggucccug aggccugggc acagcuacaa
ucacuucaaa uuggcugugg 3060ggccagugga cugggaagga aaaaagcaau
aagagugacc aagugcagaa ggcugucagg 3120ucccagguca caugccuuag
ugcagugacu ccucaucauu uuauggggug ugggugucgu 3180ugguacaccc
auuuuacaga ugaggacacc gaggcccaga aaaguuaagu uacauguccu
3240aagucacaca gcuuguaagu gccagaacug agaucaaaac caagucucuu
ugacuuuaaa 3300gucuguacuc ugaccccaaa gagauccugu uuggccacuu
auaggagguc ccuaaagcug 3360cagacucccc uugccggcac ccacauauag
agacauuaac ccuuccccug cagggucacc 3420ucaaauaguc uuuuagcugg
gcuucuccug caauuccacc uaaugccauc cccuggguuu 3480ugcccaaacc
ugaacugggc agugggguga gaggaggggu uuacaggguu acagagccuc
3540auacagauag gagcccaugg cugcugguca ucugcauucc ugcaggauug
gcuguuccuu 3600gggguccuug gcaggaaaau gaggauugcu ccgaggccug
cuccaguacu ucccagaggc 3660uggccuggug uggggcucug ggaaggcuga
ggcuggagaa gcguaaguag gagggcagag 3720auggcacuca gguagcuuga
aucaccagga cccuuccaag ccccacaggu ucugagggag 3780uacuagggcc
agcucuggga gaggucucuu ccuaugcugu gaacccccug ccuuucu
383742571DNAHomo sapiens 42gtgcccgccc gagaaggcgg cgctgggagc
cgctcagagc ccagagaagc ggcgcgcggc 60caggagcccc cgctccgcca ctgccgtgcc
tgcctcccgc agctgtctgc catgcgctcg 120ccggggcagg ggcgcccgga
gggcggctag agctgggcct gagcccggga acgcgcctga 180tcaggggtgg
cggagccgcg gtccccacag ccgccccacc cgcgccgctg cctcgctggg
240gcccgggccc ccttcccggt ccttactccc ctgctggtgc ctccctcctt
ggcgcgcttc 300ccacctgcga tcggcgccct cttcgcagtc acgaactcgc
cagcagctag cagcactgac 360tagtaggagg gcccgccgga ggagagccgc
gcggcccaca gaagcggaac gcgcgtcgag 420agcgccctgt ccgctcgccc
cagacagatg cccggttatt cattaccgcg aggcctagag 480gaaagagtgg
ctgccgtctt cctgcccaca gcccgccgga ccctccgtcg cggctgcccg
540gtccccggag ccgcagccgc cgagcccggc t 571434915DNAHomo sapiens
43ccaccacacc cagactaaaa ggcagtttga ttttacaaat caaaatagca gtaatctatg
60gagatttact tgtgagattg gtaggaaaca tcttaaatgt aatcaaacaa taacttacat
120cttgatgaat tcacgtgtag gtttctcttc ctcagaagaa atcagatgct
gttcagagca 180cgaaggctag aattttaccc tggttctcat gctaccttgc
acccaggttg gatcctgagt 240acagtttttg gcaggtgggc ctgcatataa
gttagcaatg ggggataccc agctgcctct 300cttcatacag ctgaggtttt
ggggagtcat tcttatagcc cctgggttgg gcctagtcct 360gcaaatgaat
tcaccagccc taaagcccaa attgcagcct ctgtcattca ccttccagga
420gtggaaaggg cagtaagttt catcttatta ttattgctat tttggtggtt
ttgttgaggt 480tggtgtgtgt atgttagtaa gataaagctc tcagaaatta
catagcattt gtcaaggata 540taagagggac tgtgccacat ctggctgtat
agaaggtggt tccatatctt taaatagagc 600cccaggtcct tagccaccag
aaaggttttc aggggaagtg tgcaccctca gcagctgctg 660ctggtgggca
ggatgggcac gcatggaaca ggctttcctc tgtggccagg tgagaagcag
720gtggtgagac acagagcagt gctgggctct gcttctgaag cctccaacct
ttccttccct 780aggaagcccc agagagattg gtgagggtga tttcccagga
agacgcagtg tgctctgact 840tctgtgacag tgagcaacgg gaccagtgga
tgtccagatg ctggcaatga gtaggccttc 900cctacgctgg gtggcgtcca
caccctccgg cttccattgc ctgggtctcc tggaggtggt 960ttgctggatg
aataccgcat gcacagaggc tggccttggg tttgaatatg gcagccagtg
1020gacagcatgt gcttcagtta tgagactgcc caggagatgc ttcttccaag
gcagagcacg 1080tgcagagtcc agtgctggag aggccgggtg cgcagttgac
ccatttccag ttctgttttc 1140cctctcatgt tcctctgtcc ccatctagga
catgctctgg agtcagaaga cagcgaaaag 1200agaagcagaa gccccggtgg
caagagtctg aagcaggaag gatgactgta gcctgtggat 1260tgtactgcag
taggaaactg tcctagcaag gctccacttt gccccagctt caagctggaa
1320aggaggagaa catgaaacat tgcttgaaga caatggccga gacagcaggt
cccaccctgc 1380acagccacca gcatctctcc cctcagccct gtctcctctt
ctgcagttgg gatctgcaca 1440tttaagcctg aaattgtcct gtgaagtgaa
gtatgatcgg acagcctctt ttcagctttt 1500atgacaatgg agacagagga
attgtggctc ttgccaaggt cacaggattg gaatacagag 1560ccaagccacc
ccaggacatg caagagcctc agaagggaaa aaagcccagc aggaagggag
1620aacaagtagc ctctgtcctg aagttgtaac agccaggggc caggatggag
gaggaggacc 1680ccataatctg cccatctggg acttggcagg ggacctggga
aaatgtaccc caacccatcc 1740cttaagggcc tttgtctttg gcccattggc
ctagcatcta cttcttcacc gtgtctgttc 1800ttgtcacacc tagtcaggtc
tgtttgggtc tgaggtgcat ggaacattct gggtaggcct 1860ccagcaaacg
gaagctcttc accgtgtttc cagcctggga ccaagggcag catactggca
1920aagttgccaa agcaagggac tccagcctct taggagttaa tgactccctc
tccccagctg 1980tcctcccctt ggtgctcctc ttcctccctc ctcctgctca
cagcaggcag ggcctagacc 2040cgggagccat gctgctgtgc tgttgccagg
ggagcacgga ggcagatctg agctatgcag 2100ggaaaaggcc cagcctgtca
aagtgtctga gatgaaccgc cgccgtccct gtgcagctgg 2160gctcagacgt
gtctcagctc ttgttctgtg cctgagaatg gcgaaaccca gtgaggttca
2220agggcaaact cgctattcat tagtcagggg ttcttgacgt cccgtctctc
ccagggatga 2280gttcccccct cctctttctc cccctcctat gacacattcc
tgggtgcctt tggtgaggac 2340tgcacaccct cctcctgcct agccccctct
ccaaaggccc ctgaataaac tccccccaag 2400gagaccaggc agggcagaga
caatggctgc aggaaatcat tcaggcggga catgctggcc 2460tgccctccac
ccagtccccc tgtgggcccc actcccttct gattcagggc acccttgggc
2520ccccagccta tacaggcctg gacaggaaga aaccactggg aaccacccta
aggacaacat 2580gctagtccag tgccattctt cgctggctct gtgggtgcct
ttgtggcctg taccgactgg 2640ctggctaatt ttgtggtttc tgtaccatca
catgcctatt ttaagacact ctccagcact 2700gtcggttagg gagtgtaaat
tttgcaatat tttctgaaat gtggcaatat caaaatgtaa 2760aaggcacaca
tacttggtca caaacaaatg gcactattta ctctgtgggc atatttgtaa
2820aagttgccaa agaattatat acaaggatgt tcatcagagc atttcttttg
aagagtaaag 2880aaatggacat gaacctgtgg tccgttcata cggtggaata
cctatgcagc tgtaaaaatc 2940agtgtggtag atctccgtat atgagttgat
gtggaaggtt ggccagttca catgataagg 3000tgaatagaat aagttacaga
acaggctgta gagtatgatc ttatttgtag atgtttaaaa 3060ctgagtcata
agtatgctta tatacagatc gtttctggaa gtatgtactg gaagtctacc
3120tctggggagt ggggatgggg gagtgcactc ttctatactg ttatattttc
ttttcatgct 3180cctaaggtac ttttattgga agatgtaaag cggttcaatg
taataggctt aacttctgtc 3240aactaagttg gcgtgggtgc tttaagaggg
tggtagtgat gttgctggag aaagtatccc 3300acagtcactg gtggcttcag
ccacgggcca ttttggggcc taataatcac atatcatcat 3360ggttgctagt
gttaatcgaa aacctactaa gtgccaggct tactgtctct gggtcttgct
3420tacgtggatg tcatttttcc agttgcacca aatcgaaaga ggttaattgg
tttgttggag 3480ttcctttgta ggtgaagggc agagccagga gcttggctag
ggacagggga ggtgagtggg 3540ggatggtgga taggtcttgg ctcccagttt
ccttctgggc agacattgcc cctctgccct 3600gaggacctgc ttgtttgggg
gaagaggcct ttagaggcac cagggtcatg ccaggtgttg 3660gacatggtga
actgggaagt gctcccatct ggccacagcg cagaagtatc accgtgctgg
3720gggatgggga acagggctgt gaatgggcct atttgcataa gcagcatgtg
tctggagaga 3780aagacatcac agagcagaag agtgcgggtg cccaggagtg
cacttgccac ccctacttca 3840tccctgaaag agtaaatggc ctggaaggtg
tctctgagag gtaatgccgc acaccaccct 3900ccctgggggc agggtcaggc
tacacctgcc ttaggtcggg ggctgcagca gcctgagagc 3960tctcagtagg
gcctcagtag cctgggaggg agcaggggca gggggcaggg aaagaggcgt
4020aatggggctg tccagagggg cctgggaaac ctggtccctg aggcctgggc
acagctacaa 4080tcacttcaaa ttggctgtgg ggccagtgga ctgggaagga
aaaaagcaat aagagtgacc 4140aagtgcagaa ggctgtcagg tcccaggtca
catgccttag tgcagtgact cctcatcatt 4200ttatggggtg tgggtgtcgt
tggtacaccc attttacaga tgaggacacc gaggcccaga 4260aaagttaagt
tacatgtcct aagtcacaca gcttgtaagt gccagaactg agatcaaaac
4320caagtctctt tgactttaaa gtctgtactc tgaccccaaa gagatcctgt
ttggccactt 4380ataggaggtc cctaaagctg cagactcccc ttgccggcac
ccacatatag agacattaac 4440ccttcccctg cagggtcacc tcaaatagtc
ttttagctgg gcttctcctg caattccacc 4500taatgccatc ccctgggttt
tgcccaaacc tgaactgggc agtggggtga gaggaggggt 4560ttacagggtt
acagagcctc atacagatag gagcccatgg ctgctggtca tctgcattcc
4620tgcaggattg gctgttcctt ggggtccttg gcaggaaaat gaggattgct
ccgaggcctg 4680ctccagtact tcccagaggc tggcctggtg tggggctctg
ggaaggctga ggctggagaa 4740gcgtaagtag gagggcagag atggcactca
ggtagcttga atcaccagga cccttccaag 4800ccccacaggt tctgagggag
tactagggcc agctctggga gaggtctctt cctatgctgt 4860gaaccccctg
cctttcttgc agcctacaac gaataaattt tctttgcaaa ggctt 4915444687DNAHomo
sapiens 44uuccucagaa gaaaucagau gcuguucaga gcacgaaggc uagaauuuua
cccugguucu 60caugcuaccu ugcacccagg uuggauccug aguacaguuu uuggcaggug
ggccugcaua 120uaaguuagca augggggaua cccagcugcc ucucuucaua
cagcugaggu uuuggggagu 180cauucuuaua gccccugggu ugggccuagu
ccugcaaaug aauucaccag cccuaaagcc 240caaauugcag ccucugucau
ucaccuucca ggaguggaaa gggcaguaag uuucaucuua 300uuauuauugc
uauuuuggug guuuuguuga gguuggugug uguauguuag uaagauaaag
360cucucagaaa uuacauagca uuugucaagg auauaagagg gacugugcca
caucuggcug 420uauagaaggu gguuccauau cuuuaaauag agccccaggu
ccuuagccac cagaaagguu 480uucaggggaa gugugcaccc ucagcagcug
cugcuggugg gcaggauggg cacgcaugga 540acaggcuuuc cucuguggcc
aggugagaag caggugguga gacacagagc agugcugggc 600ucugcuucug
aagccuccaa ccuuuccuuc ccuaggaagc cccagagaga uuggugaggg
660ugauuuccca ggaagacgca gugugcucug acuucuguga cagugagcaa
cgggaccagu 720ggauguccag augcuggcaa ugaguaggcc uucccuacgc
uggguggcgu ccacacccuc 780cggcuuccau ugccuggguc uccuggaggu
gguuugcugg augaauaccg caugcacaga 840ggcuggccuu ggguuugaau
auggcagcca guggacagca ugugcuucag uuaugagacu 900gcccaggaga
ugcuucuucc aaggcagagc acgugcagag uccagugcug gagaggccgg
960gugcgcaguu gacccauuuc caguucuguu uucccucuca uguuccucug
uccccaucua 1020ggacaugcuc uggagucaga agacagcgaa aagagaagca
gaagccccgg uggcaagagu 1080cugaagcugg aaaggaggag aacaugaaac
auugcuugaa gacaauggcc gagacagcag 1140gucccacccu gcacagccac
cagcaucucu ccccucagcc cugucuccuc uucugcaguu 1200gggaucugca
cauuuaagcc ugaaauuguc cugugaagug aaguaugauc ggacagccuc
1260uuuucagcuu uuaugacaau ggagacagag gaauuguggc ucuugccaag
gucacaggau 1320uggaauacag agccaagcca ccccaggaca ugcaagagcc
ucagaaggga aaaaagccca 1380gcaggaaggg agaacaagua gccucugucc
ugaaguugua acagccaggg gccaggaugg 1440aggaggagga ccccauaauc
ugcccaucug ggacuuggca ggggaccugg gaaaauguac 1500cccaacccau
cccuuaaggg ccuuugucuu uggcccauug gccuagcauc uacuucuuca
1560ccgugucugu ucuugucaca ccuagucagg ucuguuuggg ucugaggugc
auggaacauu 1620cuggguaggc cuccagcaaa cggaagcucu ucaccguguu
uccagccugg gaccaagggc 1680agcauacugg caaaguugcc aaagcaaggg
acuccagccu cuuaggaguu aaugacuccc 1740ucuccccagc uguccucccc
uuggugcucc ucuuccuccc uccuccugcu cacagcaggc 1800agggccuaga
cccgggagcc augcugcugu gcuguugcca ggggagcacg gaggcagauc
1860ugagcuaugc agggaaaagg cccagccugu caaagugucu gagaugaacc
gccgccgucc 1920cugugcagcu gggcucagac gugucucagc ucuuguucug
ugccugagaa uggcgaaacc 1980cagugagguu caagggcaaa cucgcuauuc
auuagucagg gguucuugac gucccgucuc 2040ucccagggau gaguuccccc
cuccucuuuc ucccccuccu augacacauu ccugggugcc 2100uuuggugagg
acugcacacc cuccuccugc cuagcccccu cuccaaaggc cccugaauaa
2160acucccccca aggagaccag gcagggcaga gacaauggcu gcaggaaauc
auucaggcgg 2220gacaugcugg ccugcccucc acccaguccc ccugugggcc
ccacucccuu cugauucagg 2280gcacccuugg gcccccagcc uauacaggcc
uggacaggaa gaaaccacug ggaaccaccc 2340uaaggacaac augcuagucc
agugccauuc uucgcuggcu cugugggugc cuuuguggcc 2400uguaccgacu
ggcuggcuaa uuuugugguu ucuguaccau cacaugccua uuuuaagaca
2460cucuccagca cugucgguua gggaguguaa auuuugcaau auuuucugaa
auguggcaau 2520aucaaaaugu aaaaggcaca cauacuuggu cacaaacaaa
uggcacuauu uacucugugg 2580gcauauuugu aaaaguugcc aaagaauuau
auacaaggau guucaucaga gcauuucuuu 2640ugaagaguaa agaaauggac
augaaccugu gguccguuca uacgguggaa uaccuaugca 2700gcuguaaaaa
ucaguguggu agaucuccgu auaugaguug auguggaagg uuggccaguu
2760cacaugauaa ggugaauaga auaaguuaca gaacaggcug uagaguauga
ucuuauuugu 2820agauguuuaa aacugaguca uaaguaugcu uauauacaga
ucguuucugg aaguauguac 2880uggaagucua ccucugggga guggggaugg
gggagugcac ucuucuauac uguuauauuu 2940ucuuuucaug cuccuaaggu
acuuuuauug gaagauguaa agcgguucaa uguaauaggc 3000uuaacuucug
ucaacuaagu uggcgugggu gcuuuaagag ggugguagug auguugcugg
3060agaaaguauc ccacagucac ugguggcuuc agccacgggc cauuuugggg
ccuaauaauc 3120acauaucauc augguugcua guguuaaucg aaaaccuacu
aagugccagg cuuacugucu 3180cugggucuug cuuacgugga ugucauuuuu
ccaguugcac caaaucgaaa gagguuaauu 3240gguuuguugg aguuccuuug
uaggugaagg gcagagccag gagcuuggcu agggacaggg 3300gaggugagug
ggggauggug gauaggucuu ggcucccagu uuccuucugg gcagacauug
3360ccccucugcc cugaggaccu gcuuguuugg gggaagaggc cuuuagaggc
accaggguca 3420ugccaggugu uggacauggu gaacugggaa gugcucccau
cuggccacag cgcagaagua 3480ucaccgugcu gggggauggg gaacagggcu
gugaaugggc cuauuugcau aagcagcaug 3540ugucuggaga gaaagacauc
acagagcaga agagugcggg ugcccaggag ugcacuugcc 3600accccuacuu
caucccugaa agaguaaaug gccuggaagg ugucucugag agguaaugcc
3660gcacaccacc cucccugggg gcagggucag gcuacaccug ccuuaggucg
ggggcugcag 3720cagccugaga gcucucagua gggccucagu agccugggag
ggagcagggg cagggggcag 3780ggaaagaggc guaauggggc uguccagagg
ggccugggaa accugguccc ugaggccugg 3840gcacagcuac aaucacuuca
aauuggcugu ggggccagug gacugggaag gaaaaaagca 3900auaagaguga
ccaagugcag aaggcuguca ggucccaggu cacaugccuu agugcaguga
3960cuccucauca uuuuaugggg uguggguguc guugguacac ccauuuuaca
gaugaggaca 4020ccgaggccca gaaaaguuaa guuacauguc cuaagucaca
cagcuuguaa gugccagaac 4080ugagaucaaa accaagucuc uuugacuuua
aagucuguac ucugacccca aagagauccu 4140guuuggccac uuauaggagg
ucccuaaagc ugcagacucc ccuugccggc acccacauau 4200agagacauua
acccuucccc ugcaggguca ccucaaauag ucuuuuagcu gggcuucucc
4260ugcaauucca ccuaaugcca uccccugggu uuugcccaaa ccugaacugg
gcaguggggu 4320gagaggaggg guuuacaggg uuacagagcc ucauacagau
aggagcccau ggcugcuggu 4380caucugcauu ccugcaggau uggcuguucc
uugggguccu uggcaggaaa augaggauug 4440cuccgaggcc ugcuccagua
cuucccagag gcuggccugg uguggggcuc ugggaaggcu 4500gaggcuggag
aagcguaagu aggagggcag agauggcacu cagguagcuu gaaucaccag
4560gacccuucca agccccacag guucugaggg aguacuaggg ccagcucugg
gagaggucuc 4620uuccuaugcu gugaaccccc ugccuuucuu gcagccuaca
acgaauaaau uuucuuugca 4680aaggcuu 468745706DNAHomo sapiens
45cucagaaauu acauagcauu ugucaaggau auaagaggga cugugccaca
ucuggcugua
60uagaaggugg uuccauaucu uuaaauagag ccccaggucc uuagccacca gaaagguuuu
120caggggaagu gugcacccuc agcagcugcu gcuggugggc aggaugggca
cgcauggaac 180aggcuuuccu cuguggccag gugagaagca gguggugaga
cacagagcag ugcugggcuc 240ugcuucugaa gccuccaacc uuuccuuccc
uaggaagccc cagagagauu ggugagggug 300auuucccagg aagacgcagu
gugcucugac uucugugaca gugagcaacg ggaccagugg 360auguccagau
gcuggcaaug agacaugcuc uggagucaga agacagcgaa aagagaagca
420gaagccccgg uggcaagagu cugaaggugg guuccuuccu gacaugggca
uugggcugcg 480cauguguguu cgcaguucuu uccagcugcu guucugaccu
cuuugugcag uguauuuaug 540uggcuguaga uggauggucc aagguagauu
uagguuuugg aauacuguuu uuuuuuucua 600cuucagggag aaaauaaccc
aguuugggaa ggacauuuaa aaggggaaaa uauuagguau 660gauggcacac
cugcaguccc agcuauucgg gaggcuaagg cuggag 706463922DNAHomo sapiens
46cacgcaugga acaggcuuuc cucuguggcc aggugagaag caggugguga gacacagagc
60agugcugggc ucugcuucug aagccuccaa ccuuuccuuc ccuaggaagc cccagagaga
120uuggugaggg ugauuuccca ggaagacgca gugugcucug acuucuguga
cagugagcaa 180cgggaccagu ggauguccag augcuggcaa ugagacaugc
ucuggaguca gaagacagcg 240aaaagagaag cagaagcccc gguggcaaga
gucugaagca ggaaggauga cuguagccug 300uggauuguac ugcaguagga
aacuguccua gcaaggcucc acuuugcccc agcuucaagc 360uggaaaggag
gagaacauga aacauugcuu gaagacaaug gccgagacag caggucccac
420ccugcacagc caccagcauc ucuccccuca gcccugucuc cucuucugca
guugggaucu 480gcacauuuaa gccugaaauu guccugugaa gugaaguaug
aucggacagc cucuuuucag 540cuuuuaugac aauggagaca gaggaauugu
ggcucuugcc aaggucacag gauuggaaua 600cagagccaag ccaccccagg
acaugcaaga gccucagaag ggaaaaaagc ccagcaggaa 660gggagaacaa
guagccucug uccugaaguu guaacagcca ggggccagga uggaggagga
720ggaccccaua aucugcccau cugggacuug gcaggggacc ugggaaaaug
uaccccaacc 780caucccuuaa gggccuuugu cuuuggccca uuggccuagc
aucuacuucu ucaccguguc 840uguucuuguc acaccuaguc aggucuguuu
gggucugagg ugcauggaac auucugggua 900ggccuccagc aaacggaagc
ucuucaccgu guuuccagcc ugggaccaag ggcagcauac 960uggcaaaguu
gccaaagcaa gggacuccag ccucuuagga guuaaugacu cccucucccc
1020agcuguccuc cccuuggugc uccucuuccu cccuccuccu gcucacagca
ggcagggccu 1080agacccggga gccaugcugc ugugcuguug ccaggggagc
acggaggcag aucugagcua 1140ugcagggaaa aggcccagcc ugucaaagug
ucugagauga accgccgccg ucccugugca 1200gcugggcuca gacgugucuc
agcucuuguu cugugccuga gaauggcgaa acccagugag 1260guucaagggc
aaacucgcua uucauuaguc agggguucuu gacgucccgu cucucccagg
1320gaugaguucc ccccuccucu uucucccccu ccuaugacac auuccugggu
gccuuuggug 1380aggacugcac acccuccucc ugccuagccc ccucuccaaa
ggccccugaa uaaacucccc 1440ccaaggagac caggcagggc agagacaaug
gcugcaggaa aucauucagg cgggacaugc 1500uggccugccc uccacccagu
cccccugugg gccccacucc cuucugauuc agggcacccu 1560ugggccccca
gccuauacag gccuggacag gaagaaacca cugggaacca cccuaaggac
1620aacaugcuag uccagugcca uucuucgcug gcucuguggg ugccuuugug
gccuguaccg 1680acuggcuggc uaauuuugug guuucuguac caucacaugc
cuauuuuaag acacucucca 1740gcacugucgg uuagggagug uaaauuuugc
aauauuuucu gaaauguggc aauaucaaaa 1800uguaaaaggc acacauacuu
ggucacaaac aaauggcacu auuuacucug ugggcauauu 1860uguaaaaguu
gccaaagaau uauauacaag gauguucauc agagcauuuc uuuugaagag
1920uaaagaaaug gacaugaacc ugugguccgu ucauacggug gaauaccuau
gcagcuguaa 1980aaaucagugu gguagaucuc cguauaugag uugaugugga
agguuggcca guucacauga 2040uaaggugaau agaauaaguu acagaacagg
cuguagagua ugaucuuauu uguagauguu 2100uaaaacugag ucauaaguau
gcuuauauac agaucguuuc uggaaguaug uacuggaagu 2160cuaccucugg
ggagugggga ugggggagug cacucuucua uacuguuaua uuuucuuuuc
2220augcuccuaa gguacuuuua uuggaagaug uaaagcgguu caauguaaua
ggcuuaacuu 2280cugucaacua aguuggcgug ggugcuuuaa gaggguggua
gugauguugc uggagaaagu 2340aucccacagu cacugguggc uucagccacg
ggccauuuug gggccuaaua aucacauauc 2400aucaugguug cuaguguuaa
ucgaaaaccu acuaagugcc aggcuuacug ucucuggguc 2460uugcuuacgu
ggaugucauu uuuccaguug caccaaaucg aaagagguua auugguuugu
2520uggaguuccu uuguagguga agggcagagc caggagcuug gcuagggaca
ggggagguga 2580gugggggaug guggauaggu cuuggcuccc aguuuccuuc
ugggcagaca uugccccucu 2640gcccugagga ccugcuuguu ugggggaaga
ggccuuuaga ggcaccaggg ucaugccagg 2700uguuggacau ggugaacugg
gaagugcucc caucuggcca cagcgcagaa guaucaccgu 2760gcugggggau
ggggaacagg gcugugaaug ggccuauuug cauaagcagc augugucugg
2820agagaaagac aucacagagc agaagagugc gggugcccag gagugcacuu
gccaccccua 2880cuucaucccu gaaagaguaa auggccugga aggugucucu
gagagguaau gccgcacacc 2940acccucccug ggggcagggu caggcuacac
cugccuuagg ucgggggcug cagcagccug 3000agagcucuca guagggccuc
aguagccugg gagggagcag gggcaggggg cagggaaaga 3060ggcguaaugg
ggcuguccag aggggccugg gaaaccuggu cccugaggcc ugggcacagc
3120uacaaucacu ucaaauuggc uguggggcca guggacuggg aaggaaaaaa
gcaauaagag 3180ugaccaagug cagaaggcug ucagguccca ggucacaugc
cuuagugcag ugacuccuca 3240ucauuuuaug gggugugggu gucguuggua
cacccauuuu acagaugagg acaccgaggc 3300ccagaaaagu uaaguuacau
guccuaaguc acacagcuug uaagugccag aacugagauc 3360aaaaccaagu
cucuuugacu uuaaagucug uacucugacc ccaaagagau ccuguuuggc
3420cacuuauagg aggucccuaa agcugcagac uccccuugcc ggcacccaca
uauagagaca 3480uuaacccuuc cccugcaggg ucaccucaaa uagucuuuua
gcugggcuuc uccugcaauu 3540ccaccuaaug ccauccccug gguuuugccc
aaaccugaac ugggcagugg ggugagagga 3600gggguuuaca ggguuacaga
gccucauaca gauaggagcc cauggcugcu ggucaucugc 3660auuccugcag
gauuggcugu uccuuggggu ccuuggcagg aaaaugagga uugcuccgag
3720gccugcucca guacuuccca gaggcuggcc uggugugggg cucugggaag
gcugaggcug 3780gagaagcgua aguaggaggg cagagauggc acucagguag
cuugaaucac caggacccuu 3840ccaagcccca cagguucuga gggaguacua
gggccagcuc ugggagaggu cucuuccuau 3900gcugugaacc cccugccuuu cu
3922473690DNAHomo sapiens 47uuuaacagca ggaaggauga cuguagccug
uggauuguac ugcaguagga aacuguccua 60gcaaggcucc acuuugcccc agcuucaagc
uggaaaggag gagaacauga aacauugcuu 120gaagacaaug gccgagacag
caggucccac ccugcacagc caccagcauc ucuccccuca 180gcccugucuc
cucuucugca guugggaucu gcacauuuaa gccugaaauu guccugugaa
240gugaaguaug aucggacagc cucuuuucag cuuuuaugac aauggagaca
gaggaauugu 300ggcucuugcc aaggucacag gauuggaaua cagagccaag
ccaccccagg acaugcaaga 360gccucagaag ggaaaaaagc ccagcaggaa
gggagaacaa guagccucug uccugaaguu 420guaacagcca ggggccagga
uggaggagga ggaccccaua aucugcccau cugggacuug 480gcaggggacc
ugggaaaaug uaccccaacc caucccuuaa gggccuuugu cuuuggccca
540uuggccuagc aucuacuucu ucaccguguc uguucuuguc acaccuaguc
aggucuguuu 600gggucugagg ugcauggaac auucugggua ggccuccagc
aaacggaagc ucuucaccgu 660guuuccagcc ugggaccaag ggcagcauac
uggcaaaguu gccaaagcaa gggacuccag 720ccucuuagga guuaaugacu
cccucucccc agcuguccuc cccuuggugc uccucuuccu 780cccuccuccu
gcucacagca ggcagggccu agacccggga gccaugcugc ugugcuguug
840ccaggggagc acggaggcag aucugagcua ugcagggaaa aggcccagcc
ugucaaagug 900ucugagauga accgccgccg ucccugugca gcugggcuca
gacgugucuc agcucuuguu 960cugugccuga gaauggcgaa acccagugag
guucaagggc aaacucgcua uucauuaguc 1020agggguucuu gacgucccgu
cucucccagg gaugaguucc ccccuccucu uucucccccu 1080ccuaugacac
auuccugggu gccuuuggug aggacugcac acccuccucc ugccuagccc
1140ccucuccaaa ggccccugaa uaaacucccc ccaaggagac caggcagggc
agagacaaug 1200gcugcaggaa aucauucagg cgggacaugc uggccugccc
uccacccagu cccccugugg 1260gccccacucc cuucugauuc agggcacccu
ugggccccca gccuauacag gccuggacag 1320gaagaaacca cugggaacca
cccuaaggac aacaugcuag uccagugcca uucuucgcug 1380gcucuguggg
ugccuuugug gccuguaccg acuggcuggc uaauuuugug guuucuguac
1440caucacaugc cuauuuuaag acacucucca gcacugucgg uuagggagug
uaaauuuugc 1500aauauuuucu gaaauguggc aauaucaaaa uguaaaaggc
acacauacuu ggucacaaac 1560aaauggcacu auuuacucug ugggcauauu
uguaaaaguu gccaaagaau uauauacaag 1620gauguucauc agagcauuuc
uuuugaagag uaaagaaaug gacaugaacc ugugguccgu 1680ucauacggug
gaauaccuau gcagcuguaa aaaucagugu gguagaucuc cguauaugag
1740uugaugugga agguuggcca guucacauga uaaggugaau agaauaaguu
acagaacagg 1800cuguagagua ugaucuuauu uguagauguu uaaaacugag
ucauaaguau gcuuauauac 1860agaucguuuc uggaaguaug uacuggaagu
cuaccucugg ggagugggga ugggggagug 1920cacucuucua uacuguuaua
uuuucuuuuc augcuccuaa gguacuuuua uuggaagaug 1980uaaagcgguu
caauguaaua ggcuuaacuu cugucaacua aguuggcgug ggugcuuuaa
2040gaggguggua gugauguugc uggagaaagu aucccacagu cacugguggc
uucagccacg 2100ggccauuuug gggccuaaua aucacauauc aucaugguug
cuaguguuaa ucgaaaaccu 2160acuaagugcc aggcuuacug ucucuggguc
uugcuuacgu ggaugucauu uuuccaguug 2220caccaaaucg aaagagguua
auugguuugu uggaguuccu uuguagguga agggcagagc 2280caggagcuug
gcuagggaca ggggagguga gugggggaug guggauaggu cuuggcuccc
2340aguuuccuuc ugggcagaca uugccccucu gcccugagga ccugcuuguu
ugggggaaga 2400ggccuuuaga ggcaccaggg ucaugccagg uguuggacau
ggugaacugg gaagugcucc 2460caucuggcca cagcgcagaa guaucaccgu
gcugggggau ggggaacagg gcugugaaug 2520ggccuauuug cauaagcagc
augugucugg agagaaagac aucacagagc agaagagugc 2580gggugcccag
gagugcacuu gccaccccua cuucaucccu gaaagaguaa auggccugga
2640aggugucucu gagagguaau gccgcacacc acccucccug ggggcagggu
caggcuacac 2700cugccuuagg ucgggggcug cagcagccug agagcucuca
guagggccuc aguagccugg 2760gagggagcag gggcaggggg cagggaaaga
ggcguaaugg ggcuguccag aggggccugg 2820gaaaccuggu cccugaggcc
ugggcacagc uacaaucacu ucaaauuggc uguggggcca 2880guggacuggg
aaggaaaaaa gcaauaagag ugaccaagug cagaaggcug ucagguccca
2940ggucacaugc cuuagugcag ugacuccuca ucauuuuaug gggugugggu
gucguuggua 3000cacccauuuu acagaugagg acaccgaggc ccagaaaagu
uaaguuacau guccuaaguc 3060acacagcuug uaagugccag aacugagauc
aaaaccaagu cucuuugacu uuaaagucug 3120uacucugacc ccaaagagau
ccuguuuggc cacuuauagg aggucccuaa agcugcagac 3180uccccuugcc
ggcacccaca uauagagaca uuaacccuuc cccugcaggg ucaccucaaa
3240uagucuuuua gcugggcuuc uccugcaauu ccaccuaaug ccauccccug
gguuuugccc 3300aaaccugaac ugggcagugg ggugagagga gggguuuaca
ggguuacaga gccucauaca 3360gauaggagcc cauggcugcu ggucaucugc
auuccugcag gauuggcugu uccuuggggu 3420ccuuggcagg aaaaugagga
uugcuccgag gccugcucca guacuuccca gaggcuggcc 3480uggugugggg
cucugggaag gcugaggcug gagaagcgua aguaggaggg cagagauggc
3540acucagguag cuugaaucac caggacccuu ccaagcccca cagguucuga
gggaguacua 3600gggccagcuc ugggagaggu cucuuccuau gcugugaacc
cccugccuuu cuugcagccu 3660acaacgaaua aauuuucuuu gcaaaggcuu
3690484093DNAHomo sapiens 48cuuuuagcca ccccagugcu gggcagccag
ggugugggcu uuugacugaa ugcacuugcc 60cuccugcauu cauuacacca uugucagugu
gugugucugg ggcugccucu gggugugcau 120gguuuuuuuu gugucugcgu
gucaguguca ggcuaugugu gucuguuucu gucggccugu 180cuaggcgcgc
ucagugcaac aaggagcugg gggagguggc gguaaagagg aagggcauuu
240caaagcccag cuguccuccu cagggaccuc aggagaugcg ugugugugug
ugugugugug 300ugugugugug ugugugugua uuuuuuucca ugcugcucau
uguguggggc ugcaugcgag 360ugucugacca gguguggugu gagcagccgc
ugggcugggu gagccccauc ugccgugagc 420ucccagacuu gccuucuagc
ccucugccgc cauccauggg gagccucucc cuucgcagcu 480caccgucucu
ucucuaauuu auuagcugga aaggaggaga acaugaaaca uugcuugaag
540acaauggccg agacagcagg ucccacccug cacagccacc agcaucucuc
cccucagccc 600ugucuccucu ucugcaguug ggaucugcac auuuaagccu
gaaauugucc ugugaaguga 660aguaugaucg gacagccucu uuucagcuuu
uaugacaaug gagacagagg aauuguggcu 720cuugccaagg ucacaggauu
ggaauacaga gccaagccac cccaggacau gcaagagccu 780cagaagggaa
aaaagcccag caggaaggga gaacaaguag ccucuguccu gaaguuguaa
840cagccagggg ccaggaugga ggaggaggac cccauaaucu gcccaucugg
gacuuggcag 900gggaccuggg aaaauguacc ccaacccauc ccuuaagggc
cuuugucuuu ggcccauugg 960ccuagcaucu acuucuucac cgugucuguu
cuugucacac cuagucaggu cuguuugggu 1020cugaggugca uggaacauuc
uggguaggcc uccagcaaac ggaagcucuu caccguguuu 1080ccagccuggg
accaagggca gcauacuggc aaaguugcca aagcaaggga cuccagccuc
1140uuaggaguua augacucccu cuccccagcu guccuccccu uggugcuccu
cuuccucccu 1200ccuccugcuc acagcaggca gggccuagac ccgggagcca
ugcugcugug cuguugccag 1260gggagcacgg aggcagaucu gagcuaugca
gggaaaaggc ccagccuguc aaagugucug 1320agaugaaccg ccgccguccc
ugugcagcug ggcucagacg ugucucagcu cuuguucugu 1380gccugagaau
ggcgaaaccc agugagguuc aagggcaaac ucgcuauuca uuagucaggg
1440guucuugacg ucccgucucu cccagggaug aguucccccc uccucuuucu
cccccuccua 1500ugacacauuc cugggugccu uuggugagga cugcacaccc
uccuccugcc uagcccccuc 1560uccaaaggcc ccugaauaaa cuccccccaa
ggagaccagg cagggcagag acaauggcug 1620caggaaauca uucaggcggg
acaugcuggc cugcccucca cccagucccc cugugggccc 1680cacucccuuc
ugauucaggg cacccuuggg cccccagccu auacaggccu ggacaggaag
1740aaaccacugg gaaccacccu aaggacaaca ugcuagucca gugccauucu
ucgcuggcuc 1800ugugggugcc uuuguggccu guaccgacug gcuggcuaau
uuugugguuu cuguaccauc 1860acaugccuau uuuaagacac ucuccagcac
ugucgguuag ggaguguaaa uuuugcaaua 1920uuuucugaaa uguggcaaua
ucaaaaugua aaaggcacac auacuugguc acaaacaaau 1980ggcacuauuu
acucuguggg cauauuugua aaaguugcca aagaauuaua uacaaggaug
2040uucaucagag cauuucuuuu gaagaguaaa gaaauggaca ugaaccugug
guccguucau 2100acgguggaau accuaugcag cuguaaaaau caguguggua
gaucuccgua uaugaguuga 2160uguggaaggu uggccaguuc acaugauaag
gugaauagaa uaaguuacag aacaggcugu 2220agaguaugau cuuauuugua
gauguuuaaa acugagucau aaguaugcuu auauacagau 2280cguuucugga
aguauguacu ggaagucuac cucuggggag uggggauggg ggagugcacu
2340cuucuauacu guuauauuuu cuuuucaugc uccuaaggua cuuuuauugg
aagauguaaa 2400gcgguucaau guaauaggcu uaacuucugu caacuaaguu
ggcgugggug cuuuaagagg 2460gugguaguga uguugcugga gaaaguaucc
cacagucacu gguggcuuca gccacgggcc 2520auuuuggggc cuaauaauca
cauaucauca ugguugcuag uguuaaucga aaaccuacua 2580agugccaggc
uuacugucuc ugggucuugc uuacguggau gucauuuuuc caguugcacc
2640aaaucgaaag agguuaauug guuuguugga guuccuuugu aggugaaggg
cagagccagg 2700agcuuggcua gggacagggg aggugagugg gggauggugg
auaggucuug gcucccaguu 2760uccuucuggg cagacauugc cccucugccc
ugaggaccug cuuguuuggg ggaagaggcc 2820uuuagaggca ccagggucau
gccagguguu ggacauggug aacugggaag ugcucccauc 2880uggccacagc
gcagaaguau caccgugcug ggggaugggg aacagggcug ugaaugggcc
2940uauuugcaua agcagcaugu gucuggagag aaagacauca cagagcagaa
gagugcgggu 3000gcccaggagu gcacuugcca ccccuacuuc aucccugaaa
gaguaaaugg ccuggaaggu 3060gucucugaga gguaaugccg cacaccaccc
ucccuggggg cagggucagg cuacaccugc 3120cuuaggucgg gggcugcagc
agccugagag cucucaguag ggccucagua gccugggagg 3180gagcaggggc
agggggcagg gaaagaggcg uaauggggcu guccagaggg gccugggaaa
3240ccuggucccu gaggccuggg cacagcuaca aucacuucaa auuggcugug
gggccagugg 3300acugggaagg aaaaaagcaa uaagagugac caagugcaga
aggcugucag gucccagguc 3360acaugccuua gugcagugac uccucaucau
uuuauggggu gugggugucg uugguacacc 3420cauuuuacag augaggacac
cgaggcccag aaaaguuaag uuacaugucc uaagucacac 3480agcuuguaag
ugccagaacu gagaucaaaa ccaagucucu uugacuuuaa agucuguacu
3540cugaccccaa agagauccug uuuggccacu uauaggaggu cccuaaagcu
gcagacuccc 3600cuugccggca cccacauaua gagacauuaa cccuuccccu
gcagggucac cucaaauagu 3660cuuuuagcug ggcuucuccu gcaauuccac
cuaaugccau ccccuggguu uugcccaaac 3720cugaacuggg caguggggug
agaggagggg uuuacagggu uacagagccu cauacagaua 3780ggagcccaug
gcugcugguc aucugcauuc cugcaggauu ggcuguuccu ugggguccuu
3840ggcaggaaaa ugaggauugc uccgaggccu gcuccaguac uucccagagg
cuggccuggu 3900guggggcucu gggaaggcug aggcuggaga agcguaagua
ggagggcaga gauggcacuc 3960agguagcuug aaucaccagg acccuuccaa
gccccacagg uucugaggga guacuagggc 4020cagcucuggg agaggucucu
uccuaugcug ugaacccccu gccuuucuug cagccuacaa 4080cgaauaaauu uuc
409349957DNAHomo sapiens 49aaggcggcgc tgggagccgc tcagagccca
gagaagcggc gcgcggccag gagcccccgc 60tccgccactg ccgtgcctgc ctcccgcagc
tgtctgccat gcgctcgccg gggcaggggc 120gcccggaggg cggctagagc
tgggcctgag cccgggaacg cgcctgatca ggggtggcgg 180agccgcggtc
cccacagccg ccccacccgc gccgctgcct cgctggggcc cgggccccct
240tcccgttact cccctgctgg tgcctccctc cttggcgcgc ttcccacctg
cgatcggcgc 300cctcttcgca gtcacgaact cgccagcagc tagcagcact
gactagtagg agggcccgcc 360ggaggagagg acatgctctg gagtcagaag
acagcgaaaa gagaagcaga agccccggtg 420gcaagagtct gaagcaggaa
ggatgactgt agcctgtgga ttgtactgca gtaggaaact 480gtcctagcaa
ggctccactt tgccccagct tcaagctgga aaggaggaga acatgaaaca
540ttgcttgaag acaatggccg agacagcagg tcccaccctg cacagccacc
agcatctctc 600ccctcagccc tgtctcctct tctgcagttg ggatctgcac
atttaagcct gaaattgtcc 660tgtgaagtga agtatgatcg gacagcctct
tttcagcttt tatgacaatg gagacagagg 720aattgtggct cttgccaagg
tcacaggatt ggaatacaga gccaagccac cccaggacat 780gcaagagcct
cagaagggaa aaaagcccag caggaaggga gaacaagtag cctctgtcct
840gaagttgtaa cagccagggg ccaggatgga ggaggaggac cccataatct
gcccatctgg 900gacttggcag gggacctggg aaaatgtacc ccaacccatc
ccttaagggc ctttgtc 957502124DNAHomo sapiens 50gattctcaca acttctgcgt
gcgagcgccc gccccaccga ccgccccggc ccggcccgca 60agagccagag gagccgagag
gagcccagcg ccggcccagc ggactccagc tcgacggagc 120ggccgcgccc
cgaccagtta ctcccctgct ggtgcctccc tccttggcgc gcttcccacc
180tgcgatcggc gccctcttcg cagtcacgaa ctcgccagca gctagcagca
ctgactagta 240ggagggcccg ccggaggaga ggaagcccca gagagattgg
tgagggtgat ttcccaggaa 300gacgcagtgt gctctgactt ctgtgacagt
gagcaacggg accagtggat gtccagatgc 360tggcaatgag acatgctctg
gagtcagaag acagcgaaaa gagaagcaga agccccggtg 420gcaagagtct
gaagcaggaa ggatgactgt agcctgtgga ttgtactgca gtaggaaact
480gtcctagcaa ggctccactt tgccccagct tcaagctgga aaggaggaga
acatgaaaca 540ttgcttgaag acaatggccg agacagcagg tcccaccctg
cacagccacc agcatctctc 600ccctcagccc tgtctcctct tctgcagttg
ggatctgcac atttaagcct gaaattgtcc 660tgtgaagtga agtatgatcg
gacagcctct tttcagcttt tatgacaatg gagacagagg 720aattgtggct
cttgccaagg tcacaggatt ggaatacaga gccaagccac cccaggacat
780gcaagagcct cagaagggaa aaaagcccag caggaaggga gaacaagtag
cctctgtcct 840gaagttgtaa cagccagggg ccaggatgga ggaggaggac
cccataatct gcccatctgg 900gacttggcag gggacctggg aaaatgtacc
ccaacccatc ccttaagggc ctttgtcttt 960ggcccattgg cctagcatct
acttcttcac cgtgtctgtt cttgtcacac ctagtcaggt 1020ctgtttgggt
ctgaggtgca tggaacattc tgggtaggcc tccagcaaac ggaagctctt
1080caccgtgttt ccagcctggg accaagggca gcatactggc aaagttgcca
aagcaaggga 1140ctccagcctc ttaggagtta atgactccct ctccccagct
gtcctcccct tggtgctcct 1200cttcctccct cctcctgctc acagcaggca
gggcctagac ccgggagcca tgctgctgtg 1260ctgttgccag gggagcacgg
aggcagatct gagctatgca gggaaaaggc ccagcctgtc 1320aaagtgtctg
agatgaaccg ccgccgtccc tgtgcagctg ggctcagacg tgtctcagct
1380cttgttctgt gcctgagaat ggcgaaaccc agtgaggttc aagggcaaac
tcgctattca 1440ttagtcaggg gttcttgacg tcccgtctct cccagggatg
agttcccccc tcctctttct 1500ccccctccta tgacacattc
ctgggtgcct ttggtgagga ctgcacaccc tcctcctgcc 1560tagccccctc
tccaaaggcc cctgaataaa ctccccccaa ggagaccagg cagggcagag
1620acaatggctg caggaaatca ttcaggcggg acatgctggc ctgcccttca
cctcaaatag 1680tcttttagct gggcttctcc tgcaattcca cctaatgcca
tcccctgggt tttgcccaaa 1740cctgaactgg gcagtggggt gagaggaggg
gtttacaggg ttacagagcc tcatacagat 1800aggagcccat ggctgctggt
catctgcatt cctgcaggat tggctgttcc ttggggtcct 1860tggcaggaaa
atgaggattg ctccgaggcc tgctccagta cttcccagag gctggcctgg
1920tgtggggctc tgggaaggct gaggctggag aagcgtaagt aggagggcag
agatggcact 1980caggtagctt gaatcaccag gacccttcca agccccacag
gttctgaggg agtactaggg 2040ccagctctgg gagaggtctc ttcctatgct
gtgaaccccc tgcctttctt gcagcctaca 2100acgaataaat tttctttgca aagg
212451576DNAHomo sapiens 51agtgcgtggg ggtcccggcc ccacacagtg
ctagggtccc tctcgagttt ctcatctgcc 60ttcaggtcac tttccaccct gatgccttgg
cttgtcctga agctcagggc ccctgtagct 120tgggaaacct cccaagctcc
ccagcgagtg gctgtagacc aaggaaggga ccctgcccgg 180cttcagggaa
gaaaggaaga aagttactcc cctgctggtg cctccctcct tggcgcgctt
240cccacctgcg atcggcgccc tcttcgcagt cacgaactcg ccagcagcta
gcagcactga 300ctagtaggag ggcccgccgg aggagagccg cgcggcccac
agaagcggaa cgcgcgtcga 360gagcgccctg tccgctcgcc ccagacagat
gcccggttat tcattaccgc gaggcctaga 420ggaaagagtg gctgccgtct
tcctgcccac agcccgccgg accctccgtc gcggctgccc 480ggtccccgga
gccgcagccg ccgagcccgg ctgtgcgtgt cgtggctgct ggggagaaag
540aggcttccgg acatgctctg gagtcagaag acagcg 576526547DNAHomo sapiens
52gagcaatgtc ctgggaggcc tggctgagct tgtgtccagg agcactggac ttgtgttaaa
60cactgtcccc ttggatgggc ccagaagtca aacctgtcca ttagattttt tttttttttc
120ctttgggaga gctggtatgg gctggttgtc ctccaggaga gccctgttct
cacccgaggt 180ctgttaatga gctggggaca ggtgagcctc acacgttcca
acttggctgc cttcagcggc 240atccaggagc agtggtgagc tattaatgga
aggtgccggc tttgtgctaa ttagaacttc 300ctttcagctt ccatctgtgc
agacactgga gcccctcact ggtcagcctc gccgtcccaa 360cccccctcag
tttgcaacct agtttttgtc ccccccaccc cccatgaatt agggggtgct
420atgagtggag ctgctttcct ctagctctgg tcaaatcccg gctctttgtg
tattgcagaa 480ctgtactggg tggtatttct cagggcttct ctctcttgtt
ggggtggaga tggacctgga 540agatggagtt ggaaagggat ttgggcacca
tggccaccct cctgggtagg ctggacttac 600atcatcgacc tgagtttgtt
ttgtgaaaga cctctctcct ctgccctctg gagactgtga 660cttcagaccc
ttgtcctctc cattacccca gtcctgatgt ctcccaagtc tgatggtacc
720cacccatgtc aactacagct gccatctttg ccatctcagg cagcctacag
gtgggggctg 780tgtccttgac cctcttctga aaaagaaaaa cctatttttt
tccttcactt ttgcttttta 840tttctttcac ctcaggccca atggatatat
atatatatat atatatatat acatatatat 900atatacatat atacatatat
atatatatat atatatatat attttaatgg aggttgtctc 960ttacagaggt
ttcattgaaa aagaagaaac aatgtcccat taacgtcatt taaagaaaaa
1020gcacctctca gaatggaggt tgggaaagct agggtttctt gcctgaatat
cagttgggat 1080gaaatccctt gtaaggaact caagagagga gcgtttcctc
agaatgcttt ctttagctcc 1140tgagtctcct taggtctcca ctggggttgt
gtgtaaaaat accaagccct ccctgacaat 1200gcatctcatt ctcttctgct
tgaattatcc tgataatgag agatcaccca cttctttgac 1260agtgtagact
aggattttaa aaattgggag tgaattattg gacagtgtgg cacttcacca
1320gcttccctca aggttctgga tctatgctaa agaggggtgg aaatgcttct
ggggtgtcca 1380gaagggctgc agaaattcct cgtcactggt catggggaga
gcaggactgg cttgcctctg 1440tggcctcttc tgcctctgga ggtgacaatt
cctgatttga ggcactaggg tggaagactc 1500aggactatcc aggaccaggt
taataaaccg gcagtccaga ttgcagaagg gcagcagctg 1560ggggctgggg
acatgcccat gcctgtggga cagagttctt ttgcatgctt tggcctttac
1620gactctgtat ccttgacaag tcacaggcat ctctgggtga aatggggaca
atagtaccca 1680tacctccaag ggttatgtga gaattaagta aaatatgcaa
ataaagtgcc tggcatacag 1740taggcactga gcaaacggta gctctttttt
ccaggctggg gcaagggatg catataaatg 1800tctggatctg aagtttgaaa
ttccacctgc tggagacagt gaacacccca gtagataccc 1860caaatcacac
agaaggacgg atgaccagct gccttcttcc cccagggcat gccatacaca
1920ctgggcctga agtgggagaa tcgggacccc aaaaaaacgg cttgtggagc
ggggttgcac 1980atgggtgtaa agttcccagc ttggctgcct ggggaggggg
agcatgtaaa tgtctttaga 2040gatttgaagg gaccaggatc tggactgatt
tgcgttgccc agggggctgg ggctgggagc 2100caagggggtg ctgccgggag
gcccaggtta gcttggggta tggcatttct aacagttggc 2160gcctgcggaa
aatggcctgg ggttccagct ctggaaggtt ccgaatctca gtattcacga
2220gcggcgctgt ccggagcagc cagggttgtc ccttggtggt ctcgggcagg
ttctccgcga 2280tgcgcttgct gggtcgcagg tgagaacctc acggttctcc
atttccggag atccagctct 2340gagcaggcag agggtcgctc ccgtcgcctg
cccctgcggt agccaagcgg gtggctggaa 2400gcgtggctag ctggcaggta
aggagctcca ggtgagacgg aacacgaccc ccaaccccct 2460tagccggtgc
cccacccgat ttctctcctg cgtcctggga gggcatggtt gaggcgccac
2520cggtgcccag caacctcccc aggctgtggt tgtgacctga ggacgcgtgt
gtccccgccc 2580tcaggccacc gctacgcgac cctgagtgca ccttcaagaa
ggccgggcac gtttctgggc 2640gggcgtgggg ggtgcctgat atctccgctc
tattttacag ttactcccct gctggtgcct 2700ccctccttgg cgcgcttccc
acctgcgatc ggcgccctct tcgcagtcac gaactcgcca 2760gcagctagca
gcactgacta gtaggagggc ccgccggagg agaggacatg ctctggagtc
2820agaagacagc gaaaagagaa gcagaagccc cggtggcaag agtctgaagg
aaggatgact 2880gtagcctgtg gattgtactg cagtaggaaa ctgtcctagc
aaggctccac tttgccccag 2940cttcaagctg gaaaggagga gaacatgaaa
cattgcttga agacaatggc cgagacagca 3000ggtcccaccc tgcacagcca
ccagcatctc tcccctcagc cctgtctcct cttctgcagt 3060tgggatctgc
acatttaagc ctgaaattgt cctgtgaagt gaagtatgat cggacagcct
3120cttttcagct tttatgacaa tggagacaga ggaattgtgg ctcttgccaa
ggtcacagga 3180ttggaataca gagccaagcc accccaggac atgcaagagc
ctcagaaggg aaaaaagccc 3240agcaggaagg gagaacaagt agcctctgtc
ctgaagttgt aacagccagg ggccaggatg 3300gaggaggagg accccataat
ctgcccatct gggacttggc aggggacctg ggaaaatgta 3360ccccaaccca
tcccttaagg gcctttgtct ttggcccatt ggcctagcat ctacttcttc
3420accgtgtctg ttcttgtcac acctagtcag gtctgtttgg gtctgaggtg
catggaacat 3480tctgggtagg cctccagcaa acggaagctc ttcaccgtgt
ttccagcctg ggaccaaggg 3540cagcatactg gcaaagttgc caaagcaagg
gactccagcc tcttaggagt taatgactcc 3600ctctccccag ctgtcctccc
cttggtgctc ctcttcctcc ctcctcctgc tcacagcagg 3660cagggcctag
acccgggagc catgctgctg tgctgttgcc aggggagcac ggaggcagat
3720ctgagctatg cagggaaaag gcccagcctg tcaaagtgtc tgagatgaac
cgccgccgtc 3780cctgtgcagc tgggctcaga cgtgtctcag ctcttgttct
gtgcctgaga atggcgaaac 3840ccagtgaggt tcaagggcaa actcgctatt
cattagtcag gggttcttga cgtcccgtct 3900ctcccaggga tgagttcccc
cctcctcttt ctccccctcc tatgacacat tcctgggtgc 3960ctttggtgag
gactgcacac cctcctcctg cctagccccc tctccaaagg cccctgaata
4020aactcccccc aaggagacca ggcagggcag agacaatggc tgcaggaaat
cattcaggcg 4080ggacatgctg gcctgccctc cacccagtcc ccctgtgggc
cccactccct tctgattcag 4140ggcacccttg ggcccccagc ctatacaggc
ctggacagga agaaaccact gggaaccacc 4200ctaaggacaa catgctagtc
cagtgccatt cttcgctggc tctgtgggtg cctttgtggc 4260ctgtaccgac
tggctggcta attttgtggt ttctgtacca tcacatgcct attttaagac
4320actctccagc actgtcggtt agggagtgta aattttgcaa tattttctga
aatgtggcaa 4380tatcaaaatg taaaaggcac acatacttgg tcacaaacaa
atggcactat ttactctgtg 4440ggcatatttg taaaagttgc caaagaatta
tatacaagga tgttcatcag agcatttctt 4500ttgaagagta aagaaatgga
catgaacctg tggtccgttc atacggtgga atacctatgc 4560agctgtaaaa
atcagtgtgg tagatctccg tatatgagtt gatgtggaag gttggccagt
4620tcacatgata aggtgaatag aataagttac agaacaggct gtagagtatg
atcttatttg 4680tagatgttta aaactgagtc ataagtatgc ttatatacag
atcgtttctg gaagtatgta 4740ctggaagtct acctctgggg agtggggatg
ggggagtgca ctcttctata ctgttatatt 4800ttcttttcat gctcctaagg
tacttttatt ggaagatgta aagcggttca atgtaatagg 4860cttaacttct
gtcaactaag ttggcgtggg tgctttaaga gggtggtagt gatgttgctg
4920gagaaagtat cccacagtca ctggtggctt cagccacggg ccattttggg
gcctaataat 4980cacatatcat catggttgct agtgttaatc gaaaacctac
taagtgccag gcttactgtc 5040tctgggtctt gcttacgtgg atgtcatttt
tccagttgca ccaaatcgaa agaggttaat 5100tggtttgttg gagttccttt
gtaggtgaag ggcagagcca ggagcttggc tagggacagg 5160ggaggtgagt
gggggatggt ggataggtct tggctcccag tttccttctg ggcagacatt
5220gcccctctgc cctgaggacc tgcttgtttg ggggaagagg cctttagagg
caccagggtc 5280atgccaggtg ttggacatgg tgaactggga agtgctccca
tctggccaca gcgcagaagt 5340atcaccgtgc tgggggatgg ggaacagggc
tgtgaatggg cctatttgca taagcagcat 5400gtgtctggag agaaagacat
cacagagcag aagagtgcgg gtgcccagga gtgcacttgc 5460cacccctact
tcatccctga aagagtaaat ggcctggaag gtgtctctga gaggtaatgc
5520cgcacaccac cctccctggg ggcagggtca ggctacacct gccttaggtc
gggggctgca 5580gcagcctgag agctctcagt agggcctcag tagcctggga
gggagcaggg gcagggggca 5640gggaaagagg cgtaatgggg ctgtccagag
gggcctggga aacctggtcc ctgaggcctg 5700ggcacagcta caatcacttc
aaattggctg tggggccagt ggactgggaa ggaaaaaagc 5760aataagagtg
accaagtgca gaaggctgtc aggtcccagg tcacatgcct tagtgcagtg
5820actcctcatc attttatggg gtgtgggtgt cgttggtaca cccattttac
agatgaggac 5880accgaggccc agaaaagtta agttacatgt cctaagtcac
acagcttgta agtgccagaa 5940ctgagatcaa aaccaagtct ctttgacttt
aaagtctgta ctctgacccc aaagagatcc 6000tgtttggcca cttataggag
gtccctaaag ctgcagactc cccttgccgg cacccacata 6060tagagacatt
aacccttccc ctgcagggtc acctcaaata gtcttttagc tgggcttctc
6120ctgcaattcc acctaatgcc atcccctggg ttttgcccaa acctgaactg
ggcagtgggg 6180tgagaggagg ggtttacagg gttacagagc ctcatacaga
taggagccca tggctgctgg 6240tcatctgcat tcctgcagga ttggctgttc
cttggggtcc ttggcaggaa aatgaggatt 6300gctccgaggc ctgctccagt
acttcccaga ggctggcctg gtgtggggct ctgggaaggc 6360tgaggctgga
gaagcgtaag taggagggca gagatggcac tcaggtagct tgaatcacca
6420ggacccttcc aagccccaca ggttctgagg gagtactagg gccagctctg
ggagaggtct 6480cttcctatgc tgtgaacccc ctgcctttct tgcagcctac
aacgaataaa ttttctttgc 6540aaaggct 6547533814DNAHomo sapiens
53cacgtttctg ggcgggcgtg gggggtgcct gatatctccg ctctatttta cagttactcc
60cctgctggtg cctccctcct tggcgcgctt cccacctgcg atcggcgccc tcttcgcagt
120cacgaactcg ccagcagcta gcagcactga ctagtaggag ggcccgccgg
aggagaggac 180atgctctgga gtcagaagac agcgaaaaga gaagcagaag
ccccggtggc aagagtctga 240agctggaaag gaggagaaca tgaaacattg
cttgaagaca atggccgaga cagcaggtcc 300caccctgcac agccaccagc
atctctcccc tcagccctgt ctcctcttct gcagttggga 360tctgcacatt
taagcctgaa attgtcctgt gaagtgaagt atgatcggac agcctctttt
420cagcttttat gacaatggag acagaggaat tgtggctctt gccaaggtca
caggattgga 480atacagagcc aagccacccc aggacatgca agagcctcag
aagggaaaaa agcccagcag 540gaagggagaa caagtagcct ctgtcctgaa
gttgtaacag ccaggggcca ggatggagga 600ggaggacccc ataatctgcc
catctgggac ttggcagggg acctgggaaa atgtacccca 660acccatccct
taagggcctt tgtctttggc ccattggcct agcatctact tcttcaccgt
720gtctgttctt gtcacaccta gtcaggtctg tttgggtctg aggtgcatgg
aacattctgg 780gtaggcctcc agcaaacgga agctcttcac cgtgtttcca
gcctgggacc aagggcagca 840tactggcaaa gttgccaaag caagggactc
cagcctctta ggagttaatg actccctctc 900cccagctgtc ctccccttgg
tgctcctctt cctccctcct cctgctcaca gcaggcaggg 960cctagacccg
ggagccatgc tgctgtgctg ttgccagggg agcacggagg cagatctgag
1020ctatgcaggg aaaaggccca gcctgtcaaa gtgtctgaga tgaaccgccg
ccgtccctgt 1080gcagctgggc tcagacgtgt ctcagctctt gttctgtgcc
tgagaatggc gaaacccagt 1140gaggttcaag ggcaaactcg ctattcatta
gtcaggggtt cttgacgtcc cgtctctccc 1200agggatgagt tcccccctcc
tctttctccc cctcctatga cacattcctg ggtgcctttg 1260gtgaggactg
cacaccctcc tcctgcctag ccccctctcc aaaggcccct gaataaactc
1320cccccaagga gaccaggcag ggcagagaca atggctgcag gaaatcattc
aggcgggaca 1380tgctggcctg ccctccaccc agtccccctg tgggccccac
tcccttctga ttcagggcac 1440ccttgggccc ccagcctata caggcctgga
caggaagaaa ccactgggaa ccaccctaag 1500gacaacatgc tagtccagtg
ccattcttcg ctggctctgt gggtgccttt gtggcctgta 1560ccgactggct
ggctaatttt gtggtttctg taccatcaca tgcctatttt aagacactct
1620ccagcactgt cggttaggga gtgtaaattt tgcaatattt tctgaaatgt
ggcaatatca 1680aaatgtaaaa ggcacacata cttggtcaca aacaaatggc
actatttact ctgtgggcat 1740atttgtaaaa gttgccaaag aattatatac
aaggatgttc atcagagcat ttcttttgaa 1800gagtaaagaa atggacatga
acctgtggtc cgttcatacg gtggaatacc tatgcagctg 1860taaaaatcag
tgtggtagat ctccgtatat gagttgatgt ggaaggttgg ccagttcaca
1920tgataaggtg aatagaataa gttacagaac aggctgtaga gtatgatctt
atttgtagat 1980gtttaaaact gagtcataag tatgcttata tacagatcgt
ttctggaagt atgtactgga 2040agtctacctc tggggagtgg ggatggggga
gtgcactctt ctatactgtt atattttctt 2100ttcatgctcc taaggtactt
ttattggaag atgtaaagcg gttcaatgta ataggcttaa 2160cttctgtcaa
ctaagttggc gtgggtgctt taagagggtg gtagtgatgt tgctggagaa
2220agtatcccac agtcactggt ggcttcagcc acgggccatt ttggggccta
ataatcacat 2280atcatcatgg ttgctagtgt taatcgaaaa cctactaagt
gccaggctta ctgtctctgg 2340gtcttgctta cgtggatgtc atttttccag
ttgcaccaaa tcgaaagagg ttaattggtt 2400tgttggagtt cctttgtagg
tgaagggcag agccaggagc ttggctaggg acaggggagg 2460tgagtggggg
atggtggata ggtcttggct cccagtttcc ttctgggcag acattgcccc
2520tctgccctga ggacctgctt gtttggggga agaggccttt agaggcacca
gggtcatgcc 2580aggtgttgga catggtgaac tgggaagtgc tcccatctgg
ccacagcgca gaagtatcac 2640cgtgctgggg gatggggaac agggctgtga
atgggcctat ttgcataagc agcatgtgtc 2700tggagagaaa gacatcacag
agcagaagag tgcgggtgcc caggagtgca cttgccaccc 2760ctacttcatc
cctgaaagag taaatggcct ggaaggtgtc tctgagaggt aatgccgcac
2820accaccctcc ctgggggcag ggtcaggcta cacctgcctt aggtcggggg
ctgcagcagc 2880ctgagagctc tcagtagggc ctcagtagcc tgggagggag
caggggcagg gggcagggaa 2940agaggcgtaa tggggctgtc cagaggggcc
tgggaaacct ggtccctgag gcctgggcac 3000agctacaatc acttcaaatt
ggctgtgggg ccagtggact gggaaggaaa aaagcaataa 3060gagtgaccaa
gtgcagaagg ctgtcaggtc ccaggtcaca tgccttagtg cagtgactcc
3120tcatcatttt atggggtgtg ggtgtcgttg gtacacccat tttacagatg
aggacaccga 3180ggcccagaaa agttaagtta catgtcctaa gtcacacagc
ttgtaagtgc cagaactgag 3240atcaaaacca agtctctttg actttaaagt
ctgtactctg accccaaaga gatcctgttt 3300ggccacttat aggaggtccc
taaagctgca gactcccctt gccggcaccc acatatagag 3360acattaaccc
ttcccctgca gggtcacctc aaatagtctt ttagctgggc ttctcctgca
3420attccaccta atgccatccc ctgggttttg cccaaacctg aactgggcag
tggggtgaga 3480ggaggggttt acagggttac agagcctcat acagatagga
gcccatggct gctggtcatc 3540tgcattcctg caggattggc tgttccttgg
ggtccttggc aggaaaatga ggattgctcc 3600gaggcctgct ccagtacttc
ccagaggctg gcctggtgtg gggctctggg aaggctgagg 3660ctggagaagc
gtaagtagga gggcagagat ggcactcagg tagcttgaat caccaggacc
3720cttccaagcc ccacaggttc tgagggagta ctagggccag ctctgggaga
ggtctcttcc 3780tatgctgtga accccctgcc tttcttgcag ccta
3814544198DNAHomo sapiens 54ttactcccct gctggtgcct ccctccttgg
cgcgcttccc acctgcgatc ggcgccctct 60tcgcagtcac gaactcgcca gcagctagca
gcactgacta gtaggagggc ccgccggagg 120agagccgcgc ggcccacaga
agcggaacgc gcgtcgagag cgccctgtcc gctcgcccca 180gacagatgcc
cggttattca ttaccgcgag gcctagagga aagagtggct gccgtcttcc
240tgcccacagc ccgccggacc ctccgtcgcg gctgcccggt ccccggagcc
gcagccgccg 300agcccggctg tgcgtgtcgt ggctgctggg gagaaagagg
cttccggaag ccccagagag 360attggtgagg gtgatttccc aggaagacgc
agtgtgctct gacttctgtg acagtgagca 420acgggaccag tggatgtcca
gatgctggca atgagacatg ctctggagtc agaagacagc 480gaaaagagaa
gcagaagccc cggtggcaag agtctgaagc aggaaggatg actgtagcct
540gtggattgta ctgcagtagg aaactgtcct agcaaggctc cactttgccc
cagcttcaag 600ctggaaagga ggagaacatg aaacattgct tgaagacaat
ggccgagaca gcaggtccca 660ccctgcacag ccaccagcat ctctcccctc
agccctgtct cctcttctgc agttgggatc 720tgcacattta agcctgaaat
tgtcctgtga agtgaagtat gatcggacag cctcttttca 780gcttttatga
caatggagac agaggaattg tggctcttgc caaggtcaca ggattggaat
840acagagccaa gccaccccag gacatgcaag agcctcagaa gggaaaaaag
cccagcagga 900agggagaaca agtagcctct gtcctgaagt tgtaacagcc
aggggccagg atggaggagg 960aggaccccat aatctgccca tctgggactt
ggcaggggac ctgggaaaat gtaccccaac 1020ccatccctta agggcctttg
tctttggccc attggcctag catctacttc ttcaccgtgt 1080ctgttcttgt
cacacctagt caggtctgtt tgggtctgag gtgcatggaa cattctgggt
1140aggcctccag caaacggaag ctcttcaccg tgtttccagc ctgggaccaa
gggcagcata 1200ctggcaaagt tgccaaagca agggactcca gcctcttagg
agttaatgac tccctctccc 1260cagctgtcct ccccttggtg ctcctcttcc
tccctcctcc tgctcacagc aggcagggcc 1320tagacccggg agccatgctg
ctgtgctgtt gccaggggag cacggaggca gatctgagct 1380atgcagggaa
aaggcccagc ctgtcaaagt gtctgagatg aaccgccgcc gtccctgtgc
1440agctgggctc agacgtgtct cagctcttgt tctgtgcctg agaatggcga
aacccagtga 1500ggttcaaggg caaactcgct attcattagt caggggttct
tgacgtcccg tctctcccag 1560ggatgagttc ccccctcctc tttctccccc
tcctatgaca cattcctggg tgcctttggt 1620gaggactgca caccctcctc
ctgcctagcc ccctctccaa aggcccctga ataaactccc 1680cccaaggaga
ccaggcaggg cagagacaat ggctgcagga aatcattcag gcgggacatg
1740ctggcctgcc ctccacccag tccccctgtg ggccccactc ccttctgatt
cagggcaccc 1800ttgggccccc agcctataca ggcctggaca ggaagaaacc
actgggaacc accctaagga 1860caacatgcta gtccagtgcc attcttcgct
ggctctgtgg gtgcctttgt ggcctgtacc 1920gactggctgg ctaattttgt
ggtttctgta ccatcacatg cctattttaa gacactctcc 1980agcactgtcg
gttagggagt gtaaattttg caatattttc tgaaatgtgg caatatcaaa
2040atgtaaaagg cacacatact tggtcacaaa caaatggcac tatttactct
gtgggcatat 2100ttgtaaaagt tgccaaagaa ttatatacaa ggatgttcat
cagagcattt cttttgaaga 2160gtaaagaaat ggacatgaac ctgtggtccg
ttcatacggt ggaataccta tgcagctgta 2220aaaatcagtg tggtagatct
ccgtatatga gttgatgtgg aaggttggcc agttcacatg 2280ataaggtgaa
tagaataagt tacagaacag gctgtagagt atgatcttat ttgtagatgt
2340ttaaaactga gtcataagta tgcttatata cagatcgttt ctggaagtat
gtactggaag 2400tctacctctg gggagtgggg atgggggagt gcactcttct
atactgttat attttctttt 2460catgctccta aggtactttt attggaagat
gtaaagcggt tcaatgtaat aggcttaact 2520tctgtcaact aagttggcgt
gggtgcttta agagggtggt agtgatgttg ctggagaaag 2580tatcccacag
tcactggtgg cttcagccac gggccatttt ggggcctaat aatcacatat
2640catcatggtt gctagtgtta atcgaaaacc tactaagtgc caggcttact
gtctctgggt 2700cttgcttacg tggatgtcat ttttccagtt gcaccaaatc
gaaagaggtt aattggtttg 2760ttggagttcc tttgtaggtg aagggcagag
ccaggagctt ggctagggac aggggaggtg 2820agtgggggat ggtggatagg
tcttggctcc cagtttcctt ctgggcagac attgcccctc 2880tgccctgagg
acctgcttgt ttgggggaag aggcctttag aggcaccagg gtcatgccag
2940gtgttggaca tggtgaactg ggaagtgctc ccatctggcc acagcgcaga
agtatcaccg 3000tgctggggga tggggaacag ggctgtgaat gggcctattt
gcataagcag catgtgtctg 3060gagagaaaga catcacagag cagaagagtg
cgggtgccca ggagtgcact tgccacccct 3120acttcatccc tgaaagagta
aatggcctgg aaggtgtctc tgagaggtaa tgccgcacac 3180caccctccct
gggggcaggg tcaggctaca cctgccttag gtcgggggct gcagcagcct
3240gagagctctc agtagggcct cagtagcctg ggagggagca ggggcagggg
gcagggaaag
3300aggcgtaatg gggctgtcca gaggggcctg ggaaacctgg tccctgaggc
ctgggcacag 3360ctacaatcac ttcaaattgg ctgtggggcc agtggactgg
gaaggaaaaa agcaataaga 3420gtgaccaagt gcagaaggct gtcaggtccc
aggtcacatg ccttagtgca gtgactcctc 3480atcattttat ggggtgtggg
tgtcgttggt acacccattt tacagatgag gacaccgagg 3540cccagaaaag
ttaagttaca tgtcctaagt cacacagctt gtaagtgcca gaactgagat
3600caaaaccaag tctctttgac tttaaagtct gtactctgac cccaaagaga
tcctgtttgg 3660ccacttatag gaggtcccta aagctgcaga ctccccttgc
cggcacccac atatagagac 3720attaaccctt cccctgcagg gtcacctcaa
atagtctttt agctgggctt ctcctgcaat 3780tccacctaat gccatcccct
gggttttgcc caaacctgaa ctgggcagtg gggtgagagg 3840aggggtttac
agggttacag agcctcatac agataggagc ccatggctgc tggtcatctg
3900cattcctgca ggattggctg ttccttgggg tccttggcag gaaaatgagg
attgctccga 3960ggcctgctcc agtacttccc agaggctggc ctggtgtggg
gctctgggaa ggctgaggct 4020ggagaagcgt aagtaggagg gcagagatgg
cactcaggta gcttgaatca ccaggaccct 4080tccaagcccc acaggttctg
agggagtact agggccagct ctgggagagg tctcttccta 4140tgctgtgaac
cccctgcctt tcttgcagcc tacaacgaat aaattttctt tgcaaagg
4198551402DNAHomo sapiens 55ctgtctcaag cctccaatca acagatcaga
cagcttgtac tcacaggcca aggacacgtg 60gaaagaggct caattttcta gatgggtggc
aacagccatg atcttctgtc ctctgggtcc 120ccacaagcct ggatgaactc
aagatctgac tcagtggcac agtgaggaga cctttgaggc 180ctcagtgacc
atccttggac ttcacctctc acggctttca ggcagagagg ccctcccatg
240cccacaacag gctgagccca gccttcctcg gggtttgctt ccaggcctga
cttttactcc 300cctttctaag tgaggcagcc atgactggcc acttcatgtg
ctcctggaga agggcttgca 360ccagccgttt tcaggaaagt caagcagctg
ttgactcctg agtctgggtg aatttgtgtg 420aagagcataa ggcgctgttt
cttaaccaaa acgcttcctc ttgcagtgca gatgggatgt 480gcttctccac
aggaggcccc acggcttccc cacccctcag aggagcgccg tgcgtgcgtc
540tgtgtggagg attggcagct cctgcagtcg gcccttggtc ctatttggcg
acgcctctgc 600cttcccctta attatacagt catgagccgc cctggaatca
cggcagctcc ggatggatcc 660tggatgccag aatgcagcct cagcacgggg
ctgcaggaca ggagtgagcg aggggctgca 720gagccggcgg ccgcggtggg
caccatggag ggggctgccc tgggcagcac gggcatgagt 780ctcaaggccc
aggtttgagt aacaggtgtt gagagcttac ttacttttcc tgagacacag
840tttcctcatc tcgagagcac ggaaaatcat tctaacttca gaggattgtt
gtgaaagtta 900aatgagatta aagaggtaaa gcccatgacg tgcttagctc
gtgcttggct cttggtcaat 960gccagttagc gctgcatttt ctcccctctc
cctccctcct tctctctttc ttttcttcta 1020ttctccattc ctgttttctc
ccccacccca ctccccaaag ctctgcgttg agaaccagat 1080gctgtctggt
gggttagggc cagaggagga aaagctgccc gccgtgggct gcacccatac
1140cctcttcatt ccaatgacat gaggggaggg gaaaggacag aggtagactg
tcctccccta 1200cctcctccta atacaaatgg aattcctgga actggaaaac
aaagaatacc cccataaaaa 1260taagacagta cttctggtgc ggtgtaataa
aggggaaagt aaccctcaat gtcaggaaac 1320tccgcacctc ccagctcata
tttgtgtgga ggaaaagtta aatattaatt tggactcaac 1380tgaatgtgga
cacaaacaat gg 140256295DNAHomo sapiens 56tctctcatct gtgttttcag
ggcatggact ggaactccca atacccctga catgggctga 60gtcaacgtgg tcatgaacat
gtgacaggag gcagcagaag ttgcagagaa gagtgaggca 120cgtttgaaaa
aggctgaaaa atgtttctgt ccaggcaagg gtgtgtgctg aatgactcaa
180ggattttttg gtgcattgaa tgaacagcgg gacattggac acctgctgat
ccatcacccc 240gggcccgggc aggcccgtgg atgaagagag atggagaaga
ccaggcatga gactg 29557374DNAHomo sapiens 57gcagcagaag ttgcagagaa
gagtgaggca cgtttgaaaa aggctgaaaa atgtttctgt 60ccaggcaagg gtgtgtgctg
aatgactcaa ggattttttg gtgcattgaa tgaacagcgg 120gacattggac
acctgctgat ccatcacccc gggcccgggc aggcccgtgg atgaagagag
180atggagaaga ccaggcatga gactgtggag aagccacacc accagaaacc
cctgccccat 240gcgccgtcca gcccacacct gtggatgcac gggggattgc
aggcagggct cccaccgtgg 300actcaggaac aggcagggaa gctgctgcct
caccaggcga aggggccagg agggggaggc 360ggagaggccc gtct
374582737DNAHomo sapiens 58gcctccaatc aacagatcag acagcttgta
ctcacaggcc aaggacacgt ggaaagaggc 60tcaattttct agatgggtgg caacagccat
gatcttctgt cctctgggtc cccacaagcc 120tggatgaact caagatctga
ctcagtggca cagtgaggag acctttgagg cctcagtgac 180catccttgga
cttcacctct cacggctttc aggcagagag gccctcccat gcccacaaca
240ggctgagccc agccttcctc ggggtttgct tccaggcctg acttttactc
ccctttctaa 300gtgtgcagat gggatgtgct tctccacagg aggccccacg
gcttccccac ccctcagagg 360agcgccgtgc gtgcgtctgt gtggaggatt
ggcagctcct gcagtcggcc cttggtccta 420tttggcgacg cctctgcctt
ccccttaatt atacagtcat gagccgccct ggaatcacgg 480cagctccgga
tggatcctgg atgccagaat gcagcctcag cacggggctg caggacagga
540gtgagcgagg ggctgcagag ccggcggccg cggtgggcac catggagggg
gctgccctgg 600gcagcacggg catgagtctc aaggcccagg tttgagtaac
aggtgttgag agcttactta 660cttttcctga gacacagttt cctcatctcg
agagcacgga aaatcattct aacttcagag 720gattgttgtg aaagttaaat
gagattaaag aggtaaagcc catgacgtgc ttagctcgtg 780cttggctctt
ggtcaatgcc agttagcgct gcattttctc ccctctccct ccctccttct
840ctctttcttt tcttctattc tccattcctg ttttctcccc caccccactc
cccaaagctc 900tgcgttgaga accagatgct gtctggtggg ttagggccag
aggaggaaaa gctgcccgcc 960gtgggctgca cccataccct cttcattcca
atgacatgag gggaggggaa aggacagagg 1020tagactgtcc tcccctacct
cctcctaata caaatggaat tcctggaact ggaaaacaaa 1080gaataccccc
ataaaaataa gacagtactt ctggtgcggt gtaataaagg ggaaagtaac
1140cctcaatgtc aggaaactcc gcacctccca gctcatattt gtgtggagga
aaagttaaat 1200attaatttgg actcaactga atgtggacac aaacaatggt
caccaagtcc cggaacaggt 1260tgtgtgagcc tcttcagggg ttcatccagc
gctgttttgg agaaatctct atttcaattt 1320attcctatac gttagttact
gaaaaacaac agacaatcgc aaaagcaagt tgcccgtttt 1380gtgttccttg
agcccaatca tgaagtgccg tcgtgactgg gcctcatgac aaacaacttg
1440taacaagtaa caacagagct caggtcccag accgcactga agctctgtga
gacctctcct 1500catctgtgca tgaacgagtg tctgactctg gagcccagcc
tgctgcttcc cagtctggtg 1560gtgaatcctc cgtagtctga tggaggtttg
ctcttgttgc ccaggctgga gtgcaatggc 1620acaatctcgg ctcactgcag
cccctgcctc ccaggctcaa gcaattctta cgcctcagcc 1680tcctgagtag
atggaactac agggcatgga ctggaactcc caatacccct gacatgggct
1740gagtcaacgt ggtcatgaac atgtgacagg aggcagcaga agttgcagag
aagagtgagg 1800cacgtttgaa aaaggctgaa aaatgtttct gtccaggcaa
gggtgtgtgc tgaatgactc 1860aaggattttt tgggcaacac aaaccaacac
gagccgtgtg aggatcaggt gacagctgcc 1920caaaagctga cacaaggaac
aagcctggag gagtgaggat gggtgctgtg aaggaggttg 1980tgcagctggg
cccgcagtcg gacctggtga gatcagagga gggggtgcca ccagtctgtg
2040gacgaagatg agaagctgga atagagcaga aaacaggagg ctgccactct
ccatctttcc 2100caaagtcact ccaggagcaa gggtgtcatt tactgaaatg
acagactctc catttcacat 2160ttttccccca agtgcagagt gcagggaagc
agatgggcta aatttttaga gtcagggtta 2220ttaatgtata ctttacatag
taaactttcc ccttttaagt gtgcaggcct gaggtttgcc 2280aaatatgtgt
aggcatttaa tcaccaccac gatcaagatg tagaatattc ccactatcaa
2340aaagtttgct gtgtcccttg atggtcatgc cccattccac agccccagcc
ccagcccctg 2400gagattgctg tctgctttat gttccagtgg ttttatcttt
tccagactgt atggatgtga 2460atggaatcag atgtgattcc aaggtgtttt
atcttttcca gatgtgaatg gaatcagatg 2520tacgaaatcc tatggtaggg
ggtcttctga gtctagctcc ttttgtttag cgtgatgcat 2580ttgaaattaa
tccatgtctc aggcatcagg agttcatttc tttttctgct gagtagtatt
2640tcattgtatg gatgtactgc aatttgccta tccattcacc tgttgatgta
catttgagat 2700ttttggcaat tatgaataaa gctgctataa acagaca
27375915706DNAHomo sapiens 59cagatgctaa aattgacacc aaaagcgtag
gtgatgagct tgaatcaggt aagaattata 60ttctacttcc agctaaagaa gaggatatag
aagatcaata gcttaaacaa aacagaaggt 120cttctcatgg actaaaacag
tgacagtgag tgcaactcta cctatcatgt tcatattcta 180ggtagaaact
ggggaaaaga gaaaaggcgg gaaaaaggcg tttctctaca aagatcacac
240ataatcttct gtctgcatct cattggtcgg catttagtca catggccata
actaccagca 300agggaagctg ggaaatgtag tttttcaatg gcacattgct
acccttaata aaataaggat 360atgctaccaa caaagaatga atatgggaaa
ggcccctagc aatgtctgct gcatctgcca 420ctgtggtttt ctctgatttc
cacagacatt gctcttccac atagagcgca ttctgattcc 480caagccacca
ctcccaaatc acacccagtt tctgcatcta gcttgtgtgt gggcagtctg
540ctccatcagt cccagatgtg gcttctcatg gctggtgacc tatgaactaa
aagacaggcc 600atttttttcc caacctaccc taccttttct ccttgagagg
aaagataaaa taaccacaat 660ttaaaaatgt caatcttcca ttcagaaaag
ggaggaaaga gaaacacaga gatcactggt 720ttacagtgat ggacccaccc
tgctaggcag gagtgaaaag gcctctccgt ccggcagtgg 780actgggttcc
ttggctggcc catgtggtcc ccacccactg tctccaggag catcgccttg
840aactgtgtct tcatggcttt tggctccgcc ttcagaaggg tcctccctgt
caatcatcct 900ccattgccac ctctgatgtg ggcactaggg agctggtcct
tcccagagga tgcatgattt 960tgacaccgtg cagtgtggga tgttggacca
gggaatccag catattttcg ggcttttgta 1020gtctcagact gggcctgaga
tttctttaga aatatattaa cggttccttc aacactccaa 1080tgggcatata
gcctatttac ttctgatcat tgtaatgtgc aaataaccac agccccattt
1140ctttgtgtcc agatgcagtt tttaggcttg aatattcatc tgttttactc
actggcctct 1200gtcactgagt ctgtcccctt agctattaaa ttaatggtga
ctatgataac acttttcacc 1260cgatccttgc tagtgcattg acttgcgttg
tctgcatcga gttcttaaaa gatgcttggt 1320aaagtcttgg gtcacagttt
taacttctgc tggggttcct gctccacagc cctcatttag 1380aactgtttca
ctttggctat tgtttagctt tgggccatat ttggctgttt tttagctttg
1440taaaaatgct catttaacgg acacaaggcc tgggttagaa aatgtagtat
ttccttcccc 1500ctctgtttgc gtaggagcta gcgctagtgg aagcccatct
ctcccttgca ggattttgtg 1560gaaagtgacg aaaataactc aaaccacatc
agtgttctgc ttgatttcca ccattcttgg 1620atgctataac ttcagaaagc
acatgatcta acttcccaag ggacagcagg gcctcgacca 1680aatgccgttg
tgctgagggc tgccggccgc cccgctaggc tgctgcaccc tccccgcgct
1740gccccacacc ttgcacagag ctcaggcagc attctagcac catcagacaa
attcttcatg 1800agtcagaaat cgcacctgtc tcttaatagc gatctggcct
caggtggctg aaacacacag 1860ggttgaaatg ctgcggtgcc tgcgctcatt
gtgcggtgaa gccatagagt ccgcgctgaa 1920acccggggcg cccgcctcct
gtccgcgcag gcgctgcacg ctgtcgccgg ggcagcttca 1980ctcagcttca
gcctctccat ctgcagagga ggagtaatcc cgtgctgcct catggggccg
2040tttcaagaag gaaatgagat gtggtataaa gtgttagagg agtatttagt
gttattatta 2100atgtgttttc tcatctggga acatgatttg cattttaagc
agaacttcaa catttggact 2160caaaggagtg atgtggatgg ggagaatttc
aggcatcgtg caggctggca ttagaaatgc 2220tcaccggaaa tggagcacag
gcagcaactt cagcccctag aattcttcct cctatgtcac 2280ttccgtccct
caaacccttt ctgcagcccc tgtcctctct cctgcaatgg cagtgagcag
2340gtggtctcct ggtgtctctg gaaaattccc atatggagtc ctggttgaga
tgagccagaa 2400ggctaggaag gagcctacag tcccggtctc agcgcccagg
aggtttttct aatggagttg 2460gcatgcagaa gctggggcat accctgggag
cagccctgtg gctggatgag ggatggacag 2520cagcactggg cgggtggagc
cgggtgcgag gtccccacag cctgcctccc agtcccactg 2580gccagcccgg
ggagggtgct gaagacaggc ttcctctctg gtcaagctct ggctagcttg
2640gcaagtgcag gcaggaccct ggacctgcca tatggtgcac attttgagga
catcattagc 2700aggtttggaa ggttgtgatt ctggttgtgg gagagaagga
acctgggagt tgagcaggtg 2760gtgggtggag ctatgcatgc aactgagtgt
gaggagtgct ttggccatgc tagggtctcc 2820tctgggctgg aagccttgtc
ttcaagaggg ttggggtagg aggcttttta ttatagtggg 2880aagcctcaag
cctggacctg ggaaaacact ttctgggtat gaggaattaa aataaacctt
2940gaagcactat ttaggaagaa catagtatca cttcatagat attttccccc
ctaaatggtt 3000acaaagtaaa atggcagagg tacaaagagg aaaatgaaaa
ccacccaatc ccttccctga 3060gcgatggccc tgttatcctt gggctctgtg
gccatctgac ctctctctgg ggacagaagt 3120acatgaggag aggggccact
tcataagacg gagatgctct cttcctgttc cctgcctctg 3180catcgtgctg
gggatgtctg tcctcatcaa ctaagagacc catgtagacc gcagctctga
3240gatcttctat ttttgtaaac agaacagttt gtagggagat gtgagatgtt
aggtggtgct 3300ggagagcctg ggctttgtct gtttggacaa ggctccatca
aggcatcctt tgtccgacgg 3360tgaggcatct tgagggctct ggacccagag
ctggtcctgc aagagtcctg tctgagagcc 3420caggccccac tccactagga
gggacaggag ggaacagagc agagctgagt cccttcacct 3480acccaaaccc
atacatgttt ctagagcaga gtaactgctt gtgaaacaga ccatcagagc
3540acagggcagc accgagctgc gttctgcagg gctcggtcta ggtgatctgg
agggcttggg 3600gagctggctt ctcccctcat ccagcatgta gctacccaca
gccacctgca tttcacaggg 3660ccagtgccta gggacattgg gccagaagcc
agaatttctt ttttcttttt ttttttcccc 3720tctagcattc actactagcg
caagctagtg catcccaaag ttttggccct gcgtggataa 3780tcaatccaca
atttatcttc cgtctgttgc caaagtaatt tagctaaaat gcagatctca
3840actggtcact tccctgcgta aaagcttcaa tagctttcta tcgcctacag
gcaaaagtcc 3900ctcttctgaa aagcggcttg caaagcccta gtagctggct
ccatgccccc cagcaatacg 3960tggcctcctc agacgcatct tctagcaaag
gagagctgct cagtgacatc cacaaaccga 4020gctgcttctc acctacctgt
tttccttctc tttgggatgc cccccgccct caccttctgc 4080ttctggctaa
ctcctaggca tccttgaagg cttggctcag tatcctctcc tccaggaagc
4140tgtccctcac cttctctcct ttccctccat cccagtcacc atgcaccaca
cccatatccc 4200cattgcatcc tgcagctacc ttgtgcctgc acacgttggg
ctgggcatgc atctccttct 4260ctctcaagac ctggatccct tcactttgtg
tctctggacc ccccagtgtg ctgataatgg 4320ggttggaacc catcatattt
ctcttgaacg agttaatgat gggactgcta tttctctaac 4380tgttgccttg
gaggccctgt cacgtgctca tggaagagag ccaggggggt ggaggtgatt
4440cttgttaccc agaggacgtg gggtctggat acacgtttct gccatctgcc
atctgccagc 4500ttctttctgg ttggtagctt tggagcctgg tgcagctggg
gccagtcccg agcctggact 4560ctgctgggca gtggcaagag cactgtctgg
agctctcctg aggagcccac agatccaact 4620ccctaggcca aggctgcagc
ctggggcaga gatgcagagg cctggaggag cctagggcac 4680gcggcctgcg
ggctggctgg ggttcagagt tcgtatgtgt gtggagtgac tgggcaggtg
4740ttcagaaatg aaggctggca ctgccaggta aggcccttcc ctccctgatg
tgagagccct 4800ggagccaccg cagaggccca gtcagatctc tgttctaatt
ctggcctggt gtggaggatg 4860aggagagacg gcccagaaag gaaggcagac
tgtgcagacc ccatgtcttc tggcccgcga 4920ggccctcctc ctgtgcctgc
ttatcttaaa gaatccggga taagaggtga cttgggcctt 4980ggccgggagg
cccctcctca gcttcagaca aggagggagc tctgggcatg aggacattga
5040gcaagaggcg atggcagtgc ccacaactta ccctcagctc ggctctgttg
ggtccgagaa 5100gttgcatgga aagggctcct tgggggccag ttgtcagtaa
gctgcagaag cctggagccg 5160gccaggaaat aaccacgtgt aggagccttc
tcagctgaga ggaaggagga ctcacgcgcg 5220gcgagcacat gcttggagcc
aggcacaggt ttgaactaag ctatttcatc tcgattctta 5280tgacaagctc
cacgtagttt gctcctattt tacagatgtg gaagctgagg ctcagagagg
5340ttaagcgact tgtcccaaat cgcattgtca atcagtgaag gggctgggat
ttgagctagt 5400cacctgcctc taggttcagt gtgctttcta ctctgctccc
ctccatgcct gcccgaccct 5460ttgctgatga cacattcctg agacctcaaa
ggagtcctac tttgaatcat gaatggcctc 5520gcgtttcccc agagggcatg
acgcaagctt cgccacctca ctcccaccct caccactggc 5580tggttgctct
taggaagatg ctgtttacag gatcacgcag tggttcaggc caccacgatg
5640ccacttcccc tgctcttaac cccccaccaa actgcaggtg gcttccctgg
gagactcggg 5700gcaacaccct ctcgtcctgt atgaagcttg tacctttctc
cacccaagtg agtgacagct 5760ggcgggagtt ttgcactgtg gaacagggta
cacaaagaca ctggagtgag aaggcagggg 5820catggccagt ctatgtctag
gaggtggtgg ctccaccttc cttgtggact cagctttgga 5880gtcatgcagg
ctgctctctg ggcactcctg tgagccattt cttctgctga tagggaggaa
5940cgtcccactg ccccaagatg ggttctgtgc agagtctccc cagggtcaag
acaggagcta 6000gaatgtatgt catagaagga tgatctaaat ggtgacttct
ggctggtgag gagaggctat 6060ggcacctcca cggctggtgg cctcttgctg
aagtaagcat ggtcagcatc ccccctgcac 6120cctgtggagg tggcttaatg
cattcccttg actgcaaagg actccttgcc aaagagacct 6180tcttccccat
gaggctgagg cctcctagga ccctcagtgc tcaggaaatt ataaccagcc
6240acccccatct ccattcattg gagaaggagt gacggccgcc tcagtgccat
taactctgtg 6300ctgtgattaa atggatccca aggagactcc tgctcaggga
caccccctgt aggactgatg 6360ccagggctag gcttgccgca cagtgctcta
ttcctttggt tatgcacctt ccttggcaga 6420atccacgctt accaagagga
gtcactctga gctgcttgct gccagtcaca tgcttagcag 6480tagaagatat
cttgtgcttg tcaggtgact gtgagtgaga gggaaggggg cccagcgtga
6540agccaggtgg gacggcttct gcggggcaag acaccccact ggaggaggca
ggggcgcgct 6600gtgcaggcct cagcaccagc tctggctgct gtggtgggat
ctcagatcag tcactttgcc 6660tttttggcct cagttttctc atctgtacat
catggatttg ggattaaagg atttctgaag 6720attttactca ttctgagatt
atggtccctg gaaaccttta gggaaaaggg agcttcttct 6780cttcattatt
ttaacatact tgatctttta tgttctttgg tagggagaga tgataggtag
6840ttagagaaga agcagctcag tgaaaaagct gaaagccttg tggaaaaata
agttaaattg 6900acccactgtg actccaggga cctggggaga ctttgatgtc
cgtgtttttg attacacatc 6960tcttctctca gagtgaagat gcgcagttct
aaggaattat gcccaatggc agaattggca 7020agggacaggg aactgtccag
cagagaagct gctgaaaccc ttcagggaac attccattcc 7080gccagggccc
ggctctcacg ctctgctctg cagcgcatcc ggtgccaagg aggggagtag
7140cgaacgttga ccttgtccct aaggagctta cacgcaggga ggtcagcaca
gacactggaa 7200tatctaggac tctgctatcg aaaaacacaa ttgctggcac
atgggttcag aagacagaga 7260aggaaaagag ctgtaaagag ttgggtgggc
tgaggtttga aatgggcttt aattatgact 7320ggacacaatt tggtgacaaa
tgggtgaaat ttcaagcaga agaggtagca tgagcaaaaa 7380ggggtgtgca
gtctttgtga agagaagggg gagggagtgg caagggaata ggacagagag
7440gaggaggagg gagaggaagc cagagataag gagccagggg gggaagaaaa
gggcagaaac 7500gaatgtgaag gagattctga agacaagcca ctgtggtggt
ccaggtggtt ccatggtgcc 7560gcgccaagcc aggggcttgg aatttagtct
gggaggatgg tgctgagtat ggatgaggaa 7620ggagaggaat ggagagggga
gaaacgggaa ggtacttcac actgattaag caagctcctc 7680acagggctat
gacttctccc ttctcagaag gaggtgccct gtggcatgcc tctaggccca
7740gagttaaaga gctggagcca ttaggagcag caaggggctg cctccccact
tgtctggtta 7800cttctggtta cttctccctc agggtaagtc ctcatgaggg
atcatctctg cccatggagc 7860tgcttccgct gcccttaggc tggtgtaaga
ggaaggctgt gtaccagagg tagatcttgc 7920tcctagtcca ccagcaaaac
acatccagtt atttctatgc ctcagtctcc cttccccaca 7980ctgatctctt
tattccccta ctacctagaa atggaaggga gacaatgagg tgggaaatag
8040agtttttcga aaggtgtttt ttggataaga caaaggcctc tcagagcaga
gctggccatt 8100ggaatggttt ccttttgtta tttaataccg gggctttcac
acaggagttt agcccagctt 8160ttaagtcttc agtatcaaat agtagttggt
gtcacatccc tgctgaaata cagccatgaa 8220aatgtttctt agtgatagat
ttaggttgta ctccttaaga aaagcccaaa tgtcaagaat 8280tgcttcccac
tggtacactt tattggggag aagggcatct caaatagaag gatggttggt
8340ctgtctagaa atggtaagaa tactacaggt taaaggcagt ggtggtctgg
actaaatgac 8400ccctgaaact cggattttat ggtgttttca atctctggct
gagtacaggt tcctccctct 8460ccctttgtgc ctccttgggg acctgggcta
ttttctcctc cgtgaaagag aatggataca 8520tccattatga aaaccaattg
atataatttg agcctgcatg caggtaaaaa ctatattaag 8580aaggtttata
aaatctaacc ttctggctta acaagctgat tctcaaagtg gcctctcaga
8640ccctggctga gggatggtgg tcaggggttg cagagggacc cgcacagctg
ctgaggaggc 8700tgtgggacag gaggcgctat cactgtctac ctcttccatc
catgcacctg tgtgcaggtg 8760tgagaagggc accgtgctgc aaagaagggt
cctttccacc ctctgcgggt tgctgccggc 8820tccccgatgc ctgcctctga
ggcctgagcc tggggctggg gagtgtgctg gcctcacctc 8880tggaggatac
tgcctcagtg ttacagcctg accccccacc ttgtcatcac tgcctactac
8940agaccagagg ggacggccac agagactctg caaccatggg ctctgccctt
cttccttctg 9000cctgtgtatc tgtgaaaaac tttttttttt ttaaatggag
aaagctacct tgacttctca 9060gagagttgaa tggggtcagg ggatagaatc
tatatttttt agttatgggc acttacccat 9120attcaaaaag atttgaggag
gctggcagaa tggagctggg agaagaaaac cctcccctgg 9180gaggaagctg
tcctgtgcat gttggccagg ctgcctcttt gattagggac aatggaaacc
9240ggcctgaggg cacgggtgaa agcagttgag tgtagaggag gctctgcagc
agaagccaga 9300ggacacagga gccagtgaag acacacaata agtcagaaag
gagggatctt tgcagcccca 9360aaagtagaaa ttcttaccat ctactgcaaa
gagcaaaagt tgaaaattgg tctgatttct 9420atctaaatgt gcttacatat
tcgtgttctg ttaaatactt ctgtaacctg tgtgtctcac 9480ataaatgcag
ctttctttag ttttggaaat aaatcacatg aatcctgaat agtagtcttt
9540aataatttgc ttagttgtag ggcagtgttg tgttttcaga aggcaagtgt
atttgctaga 9600agagtgagct gggaggtgtg aaccacatcg tcacatctgc
tgtaagccta gccgttcata 9660atacggagtt acagttagga cacgtcgccc
tgaagagcta ccatcgaatg tgtgctcatc 9720aaatgcctgg cagcgtcctc
ggtgcttcac ctgccatagc cgacagtggc tgacctccca 9780tgcctgttgc
cttttctttc tgttggatca gggatacact gccatgtgtg ttaagaaaag
9840ctggccttac ctacagggct ggccagtccc ggtcacgttt ctagtaagcc
attgccttac 9900ataagggtaa cggcatggga cgctatctta gccaatgtga
taaaagtgga catgaggtga 9960gaggcttcag agagaggttt taaaaaagag
acaaaagcag gacgttgcct ctcttcctcc 10020tctccacgtg tcctacccgg
atgtgaagcc aaaacagatg caggcttagt gcaaccatgg 10080ggaacccagc
ataagcacag attcaacagc agaagagtgg cagagggaga aggtgaaagg
10140aacctaggtt ttcctgtcct tgttgagtca ttcagttaaa aatccctgga
attttcctct 10200ctccggcagt gtgttttgtg ggataatgag ttgccttatt
ggggttggct tgctagtcgg 10260gatgtttcgc tcccatcaac atccatacgc
ttgctctgtg aaccaatgac ctgatgaggt 10320agtattagca ccaccatcat
tatgctgagg atgagattta tggcacagtg gttcagtagc 10380ttgcccaagg
ccatgcggct ggtaggttct ggaggagggc tcagggcacc ccctgagcta
10440cccctgctgg ccattgcacc accccataaa gctgctggca gtcacttctc
tgaggggtta 10500gcatgtaaga aatgtcctcc tgaatgctgg ccagacaaat
ggaaatctgc cagggttggg 10560tacccccatg acagcagcca gcctgccctc
ttagtccctg acagctgcag tgacagcatc 10620tgtgattgca aagcgtgaca
atttatatct ctcatttcat cacaccatct atcagcagac 10680agtcaggctt
taaaaatcaa tcccacactg actcagtccc cagcagagat ggcctctgac
10740aacagtatcc acactgcagg ctggacaagg gccctattaa ttttgagact
cagccaaatt 10800tccttctgac cctaagctgg tgaatccctg ctcctttgct
ttggttgggg ttggtgtgag 10860ctaaggctgt gatcccattt gctcctatgg
cctccaggtg gcctgggcct ccatgaatgg 10920gccacatggt catactgaat
gcttgattac actcagacct agcagtcgtc tgggcgcagc 10980tggtttatgg
atcactttgt cacaatgttc catccttcca ggtccccatc cccgcggtgg
11040gaaaacattg ctttaggcag tgctagagga cttcagcagg cattggcagc
ttctggattc 11100aggattagaa caaagaagga ggagtcacag caaagatagg
aacagaaggc agagagaaca 11160gacagatggg ggtgtttgag aaggagggcc
tttgagacct cagggagtgg gagacactgg 11220ctcgagaata ataataatgg
caatttctct catctgtgtt ttcagggcat ggactggaac 11280tcccaatacc
cctgacatgg gctgagtcaa cgtggtcatg aacatgtgac aggaggcagc
11340agaagttgca gagaagagtg aggcacgttt gaaaaaggct gaaaaatgtt
tctgtccagg 11400caagggtgtg tgctgaatga ctcaaggatt ttttggagag
aattggagtg tctcaccaga 11460ggagaccacg tctgaagggc tttgcatccc
tccttggaca tgtctaatac ctaacactca 11520gaaagcatcc agtaaatatt
cgtggaaaga aaggagtgga gaaggggaga aaggggaaag 11580ggagtaggcg
agagagaaga aagactctgc ttcttgccca gggcctggca tggggcggag
11640gcaaagcagt ggggtcctca gctatgtccc actgtgagtg cacagcgagt
cctgaccttc 11700agagggtgca gcccgagggg ccctggcctg tctgaagggt
gcgccagccg agtggcctgc 11760tctgaccacc aggctcaccc atgactacct
gggtggctac agccagttcc tgacaatgag 11820tacagcactc agttatcggg
gcccttccac ccacacgctg tccacttcct ggggtactgc 11880tgtgggcatg
tgagtgcttg ctccccgggg cactgctgtg ggcatgcgag tgcttgctcc
11940ccggggcact gctgtccact tcctggggta ctgctgtggg catgcgagtg
cttgctcccc 12000ggggcactgc tgtggacatg tgatagcttg ctccccagct
ccactagtga cactggcggc 12060ccctcgctgg ggccttcccc gcctgctccg
ctccattacc gctgccgggc tcctcacgtc 12120tctccttgct gcttcctgca
ctggggtgag gagagtgggg ctggtcccct tgagaccgga 12180gaagctccag
gcttttaagg aaaactgcca gggacgaaga gaagatatca cttccccacg
12240tggttggctt ccagattcag aaggaatgtc tgtccttgtg gattccgtac
cagatgaccc 12300cagatgctgc ctcagtacta ggtccctgtg gctctggagc
ctttgctggg tctgggcagt 12360gtctcttcct ctccagttca tccttgggtc
tcttcaccct tgccaggggc aggcttcctg 12420gtgagaggtc gacctcctgc
atgaaggctc tcaagaggcc agttcaaagc caagctccgg 12480gtctgtgcct
gtggggctgc tcctcgatca ggagatggtc actcccctcc tggtctgtat
12540ctgtgggatt ctcctccatc aggagatggt ctctcccctc ctggtctata
cccgtgggat 12600tctcctccat caggagatgg tcactcccca tcctggtcta
tacccgtggg attctcctcc 12660attagatggt cactcccctc ctggtctatt
cccgtggggc tgctcctcca tcaggaggtg 12720gtcactcccc ctcctggtct
atacccgtgg ggctgctcct ccatcaggag atagtcactc 12780cccctcctgg
tctatacccg tgggattctc ctccatctgg agatggtcac tcccctcctg
12840gtctataccc atgggattct cctccatctg gagatggtca ctcccctcct
ggtctatacc 12900catgggattc tcctccatct ggagatagtc actcccctcc
tggtctatac ccgtggggtt 12960ctcctcaatc aggaggtggt cactcccctc
ctggtctata cccgtgggat tctcctccat 13020caggagatgg tcactctccc
tcctggtcta tacccgtggg gctgctcctc catcaggaga 13080tggtcactcc
cctcctggtc tatacctgtg ggattctcct ccctcagaag atggtcactc
13140cccctcctgg tctataccca tgggattctc ctccctcaga agatggtcac
tccccctcct 13200ggtctatacc cgtggggctg ctcctccatc aggagatggt
cactctccct ctcggttgct 13260cagtccaaaa acaacctctc tggaaaactg
cgtggaattt ttttttaaag aattgaaact 13320agaactagca tttgatccag
ccatctgcct actgggaata cacccaaaga aaaataaatc 13380attatatcag
aaagatagaa tatgcatgtg gatgttcatt gcagcaccat ttactatagc
13440aaagatgggc agttgagcta agtgtccaac agtggtagac tggataaaga
gaatgtgtta 13500cacacacagc atggaatatt actcaggcat agcaaagaat
gaaatcatgc cttttgcagc 13560aacatggttg gaggtggagg tcaggagtta
gagaccagcc tggccaacat ggtgaaatcg 13620tgtctctact aaaaatacaa
aaattagccg ggcatggggg tgcacacctg tagtcccagc 13680tactctggag
gctgaggcag gagaatcgct tgaacccagg aggtggagat ggcagtgagc
13740tgagatcaca ccactgcact ccagcctggg caacagagtg aaatcctgtc
tcaaaaacaa 13800aaataaaaac aaaaaaagca tacaaaccac aggagctcct
cttggtcccc ctttgtcttt 13860cattccacct ccagaaatcc cagcagaatc
accttcaaaa actcctagaa tccaattttt 13920cccctccatt gctactgccc
tgatctgagc ctccataacc cttacccaaa tgcttcctaa 13980acgtatcctg
gctggtgctg ctgaattcca tgtctttcca gctgcccttt aaaatacggt
14040aggaggcaag tcttttctca aaaccctcca gtggcttctc tctcagagtt
aagatcctgc 14100agtggccttc ctggcctcag gtagtgtctg ctgtcctgta
ccctcggcca ctatactcca 14160gccacatggc tttgtgtttc ccttggacat
atccagcatg tttctgcccc acggttttgg 14220cacttgctgt cctttctgcc
tggagctcct tctcctccct ctgcactgaa gaccctccct 14280tcctttcagg
atggcagaga cattatgctg tcataaccac accccatatt cacccttaca
14340cgatgtgtcc ctctctggcc agctaggggc tcagctccat gagccccttg
tcctggcaac 14400aaagctggct ggggcggcca cctgaagtat gtctcatgga
gctgactcaa tgagagacac 14460agttcattcc atgcacagtc cacgccacag
taagtcacgt ggccagcgct gacttcccct 14520gcacaggaag aacctgcacc
caccacccgc ggagaaggat ctagagctgg gatgactgag 14580caggatgcta
acaacctcaa agttcttctt agacctcatg tcttgaacag ccctaggcaa
14640catagcaaca cacgccatga caaccccaca agaaggcaac ccgtcctctg
acagcttctg 14700gtgacaaagc caccccgctt gtgacaacct caggtcacac
agcagctcct cccctgacaa 14760cctaaggtca cacaacaact tctcctttta
aagtctcagg tgacacagca gactctcccc 14820tgacaaactc aggtcacaca
gtaacccttc agctgacctc aggtgacaca gccacccctc 14880ccctgacaac
ctcaggtcac acagccaccc ctcccctgac aacctcaggt cacacagcca
14940cccctcccct gacaacctca ggtcacacag ctacccttca actgacaacc
tcaggtcaca 15000cagccacccc tcccctgaca acctcaggtc acacagccac
ccctcccctg acaacctcag 15060gtcacacagc cacccctctc ctgacaactt
caggtcacac agccacccct cccctgacaa 15120cctcaggtca cacagccacc
cctctcctga caacttcagg tcacacagcc acccctcccc 15180tgacaacctc
aggccacaca gccacccctc acctgacaat ctcaggtgac acagccaccc
15240ctcccctcac aacctcaggt cacacagcca cccctcctca gacaacttct
gacatagcaa 15300ctccttgcct gacaacccta ggtaacatag caaccctccc
ttgacaaccc atgtgacatg 15360gcaatgcttc tcctgacagc cacatgtcag
caacctctgc ctgacaaccc aggtgacata 15420acagcacccc ccgacaaccg
catgttacct tgccaccctc ccataccgac tgtatgtggg 15480tatcccctcc
taccccgcct tgggagcccc atgtgaggta gccagccttt ccctggccct
15540gggccctcca tttctgcttg ctgtctcctc tgttcctccc aagaactcac
tgctctaccg 15600tgtaatctct tgtttctctg ctgtcttagt ccgcttgggc
tgccggagga gcacaccttg 15660ggcagggagg cttagatgca cctgtgcatg
gttctggagg cccagg 1570660744DNAHomo sapiens 60aagcgtaggt gatgagcttg
aatcagggca tggactggaa ctcccaatac ccctgacatg 60ggctgagtca acgtggtcat
gaacatgtga caggaggcag cagaagttgc agagaagagt 120gaggcacgtt
tgaaaaaggc tgaaaaatgt ttctgtccag gcaagggtgt gtgctgaatg
180actcaaggat tttttggtgc attgaatgaa cagcgggaca ttggacacct
gctgatccat 240caccccgggc ccgggcaggc ccgtggatga agagagatgg
agaagaccag gcatgagact 300gtggagaagc cacaccacca gaaacccctg
ccccatgcgc cgtccagccc acacctgtgg 360atgcacgggg gattgcaggc
agggctccca ccgtggactc aggaacaggc agggaagctg 420ctgcctcacc
aggcgaaggg gccaggaggg ggaggcggag aggcccgtct agcccctgcg
480gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc gcctcagctt
cgttaatagg 540ggagggggcc taagagtttt cacgtccagg ctcgggcagt
ggggaggcag gcaggagtgg 600ccgctggttt ttcagacctc ccagggaggc
cgaggaaatg gcccgtcctg gagtgggcgt 660ggttctgtct tcagatggat
gctggagggt tgggctgcgt gggaccctgg gccctgctgc 720ttcccggagg
atgcgctgtc cggg 744611129DNAHomo sapiens 61aagcgtaggt gatgagcttg
aatcaggtaa gaattatatt ctacttccag ctaaagaaga 60ggatatagaa gatcaatagc
ttaaacaaaa cagaagggca tggactggaa ctcccaatac 120ccctgacatg
ggctgagtca acgtggtcat gaacatgtga caggaggcag cagaagttgc
180agagaagagt gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag
gcaagggtgt 240gtgctgaatg actcaaggat tttttgggca acacaaacca
acacgagccg tgtgaggatc 300aggtgacagc tgcccaaaag ctgacacaag
gaacaagcct ggaggagtga ggatgggtgc 360tgtgaaggag gttgtgcagc
tgggcccgca gtcggacctg gtgagatcag aggagggggt 420gccaccagtc
tgtggacgaa gatgagaagc tggaatagag cagaaaacag gaggctgcca
480ctctccatct ttcccaaagt cactccagga gcaagggtgt catttactga
aatgacagac 540tctccatttc acatttttcc cccaagtgca gagtgcaggg
aagcagatgg gctaaatttt 600tagagtcagg gttattaatg tatactttac
atagtaaact ttcccctttt aagtgtgcag 660gcctgaggtt tgccaaatat
gtgtaggcat ttaatcacca ccacgatcaa gatgtagaat 720attcccacta
tcaaaaagtt tgctgtgtcc cttgatggtc atgccccatt ccacagcccc
780agccccagcc cctggagatt gctgtctgct ttatgttcca gtggttttat
cttttccaga 840ctgtatggat gtgaatggaa tcagatgtga ttccaaggtg
ttttatcttt tccagatgtg 900aatggaatca gatgtacgaa atcctatggt
agggggtctt ctgagtctag ctccttttgt 960ttagcgtgat gcatttgaaa
ttaatccatg tctcaggcat caggagttca tttctttttc 1020tgctgagtag
tatttcattg tatggatgta ctgcaatttg cctatccatt cacctgttga
1080tgtacatttg agatttttgg caattatgaa taaagctgct ataaacaga
1129621843DNAHomo sapiens 62atttagctaa aatgcagatc tcaactggtc
acttccctgc gtaaaagctt caatagcttt 60ctatcgccta caggcaaaag tccctcttct
gaaaagcggc ttgcaaagcc ctagtagctg 120gctccatgcc ccccagcaat
acgtggcctc ctcagacgca tcttctagca aaggagagct 180gctcagtgac
atccacaaac cgagctgctt ctcacctacc tgttttcctt ctctttggga
240tgccccccgc cctcaccttc tgcttctggc taactcctag gcatccttga
aggcttggct 300cagtatcctc tcctccagga agctgtccct caccttctct
cctttccctc catcccagtc 360accatgcacc acacccatat ccccattgca
tcctgcagct accttgtgcc tgcacacgtt 420gggctgggca tgcatctcct
tctctctcaa gacctggatc ccttcacttt gtgtctctgg 480accccccagt
gtgctgataa tggggttgga acccatcata tttctcttga acgagttaat
540gatgggactg ctatttctct aactgttgcc ttggaggccc tgtcacgtgc
tcatggaaga 600gagccagggg ggtggaggtg attcttgtta cccagaggac
gtggggtctg gatacacgtt 660tctgccatct gccatctgcc agcttctttc
tggttggtag ctttggagcc tggtgcagct 720ggggccagtc ccgagcctgg
actctgctgg gcagtggcaa gagcactgtc tggagctctc 780ctgaggagcc
cacagatcca actccctagg ccaaggctgc agcctggggc agagatgcag
840aggcctggag gagcctaggg cacgcggcct gcgggctggc tggggttcag
agttcgtatg 900tgtgtggagt gactgggcag gtgttcagaa atgaaggctg
gcactgccag gtaaggccct 960tccctccctg atgtgagagc cctggagcca
ccgcagaggc ccagtcagat ctctgttcta 1020attctggcct ggtgtggagg
atgaggagag acggcccaga aaggaaggca gactgtgcag 1080accccatgtc
ttctggcccg cgaggccctc ctcctgtgcc tgcttatctt aaagaatccg
1140ggataagagg tgacttgggc cttggccggg aggcccctcc tcagcttcag
acaaggaggg 1200agctctgggc atgaggacat tgagcaagag gcgatggcag
tgcccacaac ttaccctcag 1260ctcggctctg ttgggtccga gaagttgcat
ggaaagggct ccttgggggc cagttgtcag 1320taagctgcag aagcctggag
ccggccagga aataaccacg tgtaggagcc ttctcagctg 1380agaggaagga
ggactcacgc gcggcgagca catgcttgga gccaggcaca gggcatggac
1440tggaactccc aatacccctg acatgggctg agtcaacgtg gtcatgaaca
tgtgacagga 1500ggcagcagaa gttgcagaga agagtgaggc acgtttgaaa
aaggctgaaa aatgtttctg 1560tccaggcaag ggtgtgtgct gaatgactca
aggatttttt ggtgcattga atgaacagcg 1620ggacattgga cacctgctga
tccatcaccc cgggcccggg caggcccgtg gatgaagaga 1680gatggagaag
accaggcatg agactgtgga gaagccacac caccagaaac ccctgcccca
1740tgcgccgtcc agcccacacc tgtggatgca cgggggattg caggcagggc
tcccaccgtg 1800gactcaggaa caggcaggga agctgctgcc tcaccaggcg aag
1843631160DNAHomo sapiens 63gagcactgtc tggagctctc ctgaggagcc
cacagatcca actccctagg ccaaggctgc 60agcctggggc agagatgcag aggcctggag
gagcctaggg cacgcggcct gcgggctggc 120tggggttcag agttcgtatg
tgtgtggagt gactgggcag gtgttcagaa atgaaggctg 180gcactgccag
gtaaggccct tccctccctg atgtgagagc cctggagcca ccgcagaggc
240ccagtcagat ctctgttcta attctggcct ggtgtggagg atgaggagag
acggcccaga 300aaggaaggca gactgtgcag accccatgtc ttctggcccg
cgaggccctc ctcctgtgcc 360tgcttatctt aaagaatccg ggataagagg
tgacttgggc cttggccggg aggcccctcc 420tcagcttcag acaaggaggg
agctctgggc atgaggacat tgagcaagag gcgatggcag 480tgcccacaac
ttaccctcag ctcggctctg ttgggtccga gaagttgcat ggaaagggct
540ccttgggggc cagttgtcag taagctgcag aagcctggag ccggccagga
aataaccacg 600tgtaggagcc ttctcagctg agaggaagga ggactcacgc
gcggcgagca catgcttgga 660gccaggcaca gggcatggac tggaactccc
aatacccctg acatgggctg agtcaacgtg 720gtcatgaaca tgtgacagga
ggcagcagaa gttgcagaga agagtgaggc acgtttgaaa 780aaggctgaaa
aatgtttctg tccaggcaag ggtgtgtgct gaatgactca aggatttttt
840gggcaacaca aaccaacacg agccgtgtga ggatcaggtg acagctgccc
aaaagctgac 900acaaggaaca agcctggagg agtgaggatg ggtgctgtga
aggaggttgt gcagctgggc 960ccgcagtcgg acctggtgag atcagaggag
ggggtgccac cagtctgtgg acgaagatga 1020gaagctggaa tagagcagaa
aacaggaggc tgccactctc catctttccc aaagtcactc 1080caggagcaag
ggtgtcattt actgaaatga cagactctcc atttcacatt tttcccccaa
1140gtgcagagtg cagggaagca 116064572DNAHomo sapiens 64cgctctgctc
tgcagcgcat ccggtgccaa ggaggggagt agcgaacgtt gaccttgtcc 60ctaaggagct
tacacgcagg gagggcatgg actggaactc ccaatacccc tgacatgggc
120tgagtcaacg tggtcatgaa catgtgacag gaggcagcag aagttgcaga
gaagagtgag 180gcacgtttga aaaaggctga aaaatgtttc tgtccaggca
agggtgtgtg ctgaatgact 240caaggatttt ttgggcaaca caaaccaaca
cgagccgtgt gaggatcagg tgacagctgc 300ccaaaagctg acacaaggaa
caagcctgga ggagtgagga tgggtgctgt gaaggaggtt 360gtgcagctgg
gcccgcagtc ggacctggtg agatcagagg agggggtgcc accagtctgt
420ggacgaagat gagaagctgg aatagagcag aaaacaggag gctgccactc
tccatctttc 480ccaaagtcac tccaggagca agggtgtcat ttactgaaat
gacagactct ccatttcaca 540tttttccccc aagtgcagag tgcagggaag ca
572654435DNAHomo sapiens 65caaatgcctg gcagcgtcct cggtgcttca
cctgccatag ccgacagtgg ctgacctccc 60atgcctgttg ccttttcttt ctgttggatc
agggatacac tgccatgtgt gttaagaaaa 120gctggcctta cctacagggc
tggccagtcc cggtcacgtt tctagtaagc cattgcctta 180cataagggta
acggcatggg acgctatctt agccaatgtg ataaaagtgg acatgaggtg
240agaggcttca gagagaggtt ttaaaaaaga gacaaaagca ggacgttgcc
tctcttcctc 300ctctccacgt gtcctacccg gatgtgaagc caaaacagat
gcaggcttag tgcaaccatg 360gggaacccag cataagcaca gattcaacag
cagaagagtg gcagagggag aaggtgaaag 420gaacctaggt tttcctgtcc
ttgttgagtc attcagttaa aaatccctgg aattttcctc 480tctccggcag
tgtgttttgt gggataatga gttgccttat tggggttggc ttgctagtcg
540ggatgtttcg ctcccatcaa catccatacg cttgctctgt gaaccaatga
cctgatgagg 600tagtattagc accaccatca ttatgctgag gatgagattt
atggcacagt ggttcagtag 660cttgcccaag gccatgcggc tggtaggttc
tggaggaggg ctcagggcac cccctgagct 720acccctgctg gccattgcac
caccccataa agctgctggc agtcacttct ctgaggggtt 780agcatgtaag
aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg ccagggttgg
840gtacccccat gacagcagcc agcctgccct cttagtccct gacagctgca
gtgacagcat 900ctgtgattgc aaagcgtgac aatttatatc tctcatttca
tcacaccatc tatcagcaga 960cagtcaggct ttaaaaatca atcccacact
gactcagtcc ccagcagaga tggcctctga 1020caacagtatc cacactgcag
gctggacaag ggccctatta attttgagac tcagccaaat 1080ttccttctga
ccctaagctg gtgaatccct gctcctttgc tttggttggg gttggtgtga
1140gctaaggctg tgatcccatt tgctcctatg gcctccaggt ggcctgggcc
tccatgaatg 1200ggccacatgg tcatactgaa tgcttgatta cactcagacc
tagcagtcgt ctgggcgcag 1260ctggtttatg gatcactttg tcacaatgtt
ccatccttcc aggtccccat ccccgcggtg 1320ggaaaacatt gctttaggca
gtgctagagg acttcagcag gcattggcag cttctggatt 1380caggattaga
acaaagaagg aggagtcaca gcaaagatag gaacagaagg cagagagaac
1440agacagatgg gggtgtttga gaaggagggc ctttgagacc tcagggagtg
ggagacactg 1500gctcgagaat aataataatg gcaatttctc tcatctgtgt
tttcagggca tggactggaa 1560ctcccaatac ccctgacatg ggctgagtca
acgtggtcat gaacatgtga caggaggcag 1620cagaagttgc agagaagagt
gaggcacgtt tgaaaaaggc tgaaaaatgt ttctgtccag 1680gcaagggtgt
gtgctgaatg actcaaggat tttttggtgc attgaatgaa cagcgggaca
1740ttggacacct gctgatccat caccccgggc ccgggcaggc ccgtggatga
agagagatgg 1800agaagaccag gcatgagact gtggagaagc cacaccacca
gaaacccctg ccccatgcgc 1860cgtccagccc acacctgtgg atgcacgggg
gattgcaggc agggctccca ccgtggactc 1920aggaacaggc agggaagctg
ctgcctcacc aggcgaaggg gccaggaggg ggaggcggag 1980aggcccgtct
agcccctgcg gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc
2040gcctcagctt cgttaatagg ggagggggcc taagagtttt cacgtccagg
ctcgggcagt 2100ggggaggcag gcaggagtgg ccgctggttt ttcagacctc
ccagggaggc cgaggaaatg 2160gcccgtcctg gagtgggcgt ggttctgtct
tcagatggat gctggagggt tgggctgcgt 2220gggaccctgg gccctgctgc
ttcccggagg atgcgctgtc cggggctgca caggttggct 2280gtgttttttg
gatgcttgat attttgtttt ttcttctctt cactctgtca tgaaactggc
2340aatagtagtt tgtaaataaa tatgtgttat agatgaatat ttgctatgag
taaattaata 2400aaggagtgaa taaatgagcg attgatgtag ggcctgtcct
gtctcaggga gccccacgaa 2460ggcctgcgcg ccggccagag cctgcctgcc
tgccagggta ctgggacgtc actctcaaag 2520cggcgggacc cagccgctga
tcttgctgag gaggcccggt ctcagaaaac tgagcggctg 2580cttctgcaga
ccctgcatcc tcccctccct ggagaaagaa gctctggctg agtcctggga
2640ccgaaccctt gggtgccaca gaaacgggct ttgctgcctg tcagtcaagc
ggcgggagaa 2700acagacctgg ggaggaggag gctgggaggg ctgtgttttc
tgcacagcga gtagctcctt 2760agcctggtgc catttctctc
caaacaccct gaaggttgag tccagggtga agatgtagag 2820gcaagttttg
gggggatgga gtgggcttgg agggatgctg gcgccttagc aggctgtgct
2880cctgaggtgc ccagtgtctg cgggcacagg aacatgttgc cgagggcatt
tgggtgtggg 2940tggggtgggg aaagggagac agggctgtct cttttaatgg
gtatctgcga gcatgtgatt 3000gtaagagagg aagaagtagg ggaggaagaa
ggcctccttg ggaggtgcgt catcctgagg 3060aaggctgaac aatgagggtc
ttggagagtc aattcagaag cacaaccttg cagagcaggc 3120aaaaacaata
gggcttcttg aggctgcccg ggcactcatg caatcaccat ttcctgctgt
3180gaatgagcct acattttgtt ggggaagaga cgcaacgacg ccaaacgatg
gactctgagt 3240caacgataag atgaaacaaa attaaaacaa agtaggaaat
caagagtggc tgctgtgatg 3300gcgttgcgga gatgatgttt gctttgagaa
ctggacaagt gagcccctga gctgcatctg 3360cacccagagg ctgagccggt
gcacaggact tgcagaggga tgggcctggg cttgtagagc 3420agcacaacgg
ccccaggcct ggaggagcaa gggtgggaag gggggcaggc cagctcctgc
3480caggctggag aaggactcgg acctcaggcc acctgtgcct gggtgattgt
gaacttgtaa 3540caaatgtgat cttatttatg ttttgaaaaa ggcaacacaa
accaacacga gccgtgtgag 3600gatcaggtga cagctgccca aaagctgaca
caaggaacaa gcctggagga gtgaggatgg 3660gtgctgtgaa ggaggttgtg
cagctgggcc cgcagtcgga cctggtgaga tcagaggagg 3720gggtgccacc
agtctgtgga cgaagatgag aagctggaat agagcagaaa acaggaggct
3780gccactctcc atctttccca aagtcactcc aggagcaagg gtgtcattta
ctgaaatgac 3840agactctcca tttcacattt ttcccccaag tgcagagtgc
agggaagcag atgggctaaa 3900tttttagagt cagggttatt aatgtatact
ttacatagta aactttcccc ttttaagtgt 3960gcaggcctga ggtttgccaa
atatgtgtag gcatttaatc accaccacga tcaagatgta 4020gaatattccc
actatcaaaa agtttgctgt gtcccttgat ggtcatgccc cattccacag
4080ccccagcccc agcccctgga gattgctgtc tgctttatgt tccagtggtt
ttatcttttc 4140cagactgtat ggatgtgaat ggaatcagat gtgattccaa
ggtgttttat cttttccaga 4200tgtgaatgga atcagatgta cgaaatccta
tggtaggggg tcttctgagt ctagctcctt 4260ttgtttagcg tgatgcattt
gaaattaatc catgtctcag gcatcaggag ttcatttctt 4320tttctgctga
gtagtatttc attgtatgga tgtactgcaa tttgcctatc cattcacctg
4380ttgatgtaca tttgagattt ttggcaatta tgaataaagc tgctataaac agaca
443566374DNAHomo sapiens 66cttctgtcct ctgggtcccc acaagcctgg
atgaactcaa gatctgactc agtggcacag 60tgaggagacc tttgaggcct cagtgaccat
ccttggactt cacctctcac ggctttcagg 120cagagaggcc ctcccatgcc
cacaacaggc tgagcccagc cttcctcggg gtttgcttcc 180aggcctgact
tttactcccc tttctaagtg tgctcccggg aatgctgtct acttgttgcg
240attttactcc cgtggcctgt gctagctgcc tgcttggccg ttgggactga
agggatgctc 300atccacttgg cacactgact gcaagcctgg caccggcctt
gcctttgttc tcccatgagt 360cctcttgaag gcaa 374671245DNAHomo sapiens
67aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg ccagggttgg gtacccccat
60gacagcagcc agcctgccct cttagtccct gacagctgca gtgacagcat ctgtgattgc
120aaagcgtgac aatttatatc tctcatttca tcacaccatc tatcagcaga
cagtcaggct 180ttaaaaatca atcccacact gactcagtcc ccagcagaga
tggcctctga caacagtatc 240cacactgcag gctggacaag ggccctatta
attttgagac tcagccaaat ttccttctga 300ccctaagctg gtgaatccct
gctcctttgc tttggttggg gttggtgtga gctaaggctg 360tgatcccatt
tgctcctatg gcctccaggt ggcctgggcc tccatgaatg ggccacatgg
420tcatactgaa tgcttgatta cactcagacc tagcagtcgt ctgggcgcag
ctggtttatg 480gatcactttg tcacaatgtt ccatccttcc aggtccccat
ccccgcggtg ggaaaacatt 540gctttaggca gtgctagagg acttcagcag
gcattggcag cttctggatt caggattaga 600acaaagaagg aggagtcaca
gcaaagatag gaacagaagg cagagagaac agacagatgg 660gggtgtttga
gaaggagggc ctttgagacc tcagggagtg ggagacactg gctcgagaat
720aataataatg gcaatttctc tcatctgtgt tttcagggca tggactggaa
ctcccaatac 780ccctgacatg ggctgagtca acgtggtcat gaacatgtga
caggaggcag cagaagttgc 840agagaagagt gaggcacgtt tgaaaaaggc
tgaaaaatgt ttctgtccag gcaagggtgt 900gtgctgaatg actcaaggat
tttttgggca acacaaacca acacgagccg tgtgaggatc 960aggtgacagc
tgcccaaaag ctgacacaag gaacaagcct ggaggagtga ggatgggtgc
1020tgtgaaggag gttgtgcagc tgggcccgca gtcggacctg gtgagatcag
aggagggggt 1080gccaccagtc tgtggacgaa gatgagaagc tggaatagag
cagaaaacag gaggctgcca 1140ctctccatct ttcccaaagt cactccagga
gcaagggtgt catttactga aatgacagac 1200tctccatttc acatttttcc
cccaagtgca gagtgcaggg aagca 124568537DNAHomo sapiens 68aatgcttgat
tacactcaga cctagcagtc gtctgggcgc agctggttta tggatcactt 60tgtcacaatg
ttccatcctt ccaggtcccc atccccgcgg tgggaaaaca ttgctttagg
120cagtgctaga ggacttcagc aggcattggc agcttctgga ttcaggatta
gaacaaagaa 180ggaggagtca cagcaaagat aggaacagaa ggcagagaga
acagacagat gggggtgttt 240gagaaggagg gcctttgaga cctcagggag
tgggagacac tggctcgaga ataataataa 300tggcaatttc tctcatctgt
gttttcaggg catggactgg aactcccaat acccctgaca 360tgggctgagt
caacgtggtc atgaacatgt gacaggaggc agcagaagtt gcagagaaga
420gtgaggcacg tttgaaaaag gctgaaaaat gtttctgtcc aggcaagggt
gtgtgctgaa 480tgactcaagg attttttggc ctctgcctgt gtcctggccc
tcactgcacc cccaaga 537691863DNAHomo sapiens 69cttcctcggg gtttgcttcc
aggcctgact tttactcccc tttctaagtg tgcagatggg 60atgtgcttct ccacaggagg
ccccacggct tccccacccc tcagaggagc gccgtgcgtg 120cgtctgtgtg
gaggattggc agctcctgca gtcggccctt ggtcctattt ggcgacgcct
180ctgccttccc cttaattata cagtcatgag ccgccctgga atcacggcag
ctccggatgg 240atcctggatg ccagaatgca gcctcagcac ggggctgcag
gacaggagtg agcgaggggc 300tgcagagccg gcggccgcgg tgggcaccat
ggagggggct gccctgggca gcacgggcat 360gagtctcaag gcccaggttt
gagtaacagg tgttgagagc ttacttactt ttcctgagac 420acagtttcct
catctcgaga gcacggaaaa tcattctaac ttcagaggat tgttgtgaaa
480gttaaatgag attaaagagg taaagcccat gacgtgctta gctcgtgctt
ggctcttggt 540caatgccagt tagcgctgca ttttctcccc tctccctccc
tccttctctc tttcttttct 600tctattctcc attcctgttt tctcccccac
cccactcccc aaagctctgc gttgagaacc 660agatgctgtc tggtgggtta
gggccagagg aggaaaagct gcccgccgtg ggctgcaccc 720ataccctctt
cattccaatg acatgagggg aggggaaagg acagaggtag actgtcctcc
780cctacctcct cctaatacaa atggaattcc tggaactgga aaacaaagaa
tacccccata 840aaaataagac agtacttctg gtgcggtgta ataaagggga
aagtaaccct caatgtcagg 900aaactccgca cctcccagct catatttgtg
tggaggaaaa gttaaatatt aatttggact 960caactgaatg tggacacaaa
caatggtcac caagtcccgg aacaggttgt gtgagcctct 1020tcaggggttc
atccagcgct gttttggaga aatctctatt tcaatttatt cctatacgtt
1080agttactgaa aaacaacaga caatcgcaaa agcaagttgc ccgttttgtg
ttccttgagc 1140ccaatcatga agtgccgtcg tgactgggcc tcatgacaaa
caacttgtaa caagtaacaa 1200cagagctcag gtcccagacc gcactgaagc
tctgtgagac ctctcctcat ctgtgcatga 1260acgagtgtct gactctggag
cccagcctgc tgcttcccag tctggtggtg aatcctccgt 1320agtctgatgg
aggtttgctc ttgttgccca ggctggagtg caatggcaca atctcggctc
1380actgcagccc ctgcctccca ggctcaagca attcttacgc ctcagcctcc
tgagtagatg 1440gaactacagg gcatggactg gaactcccaa tacccctgac
atgggctgag tcaacgtggt 1500catgaacatg tgacaggagg cagcagaagt
tgcagagaag agtgaggcac gtttgaaaaa 1560ggctgaaaaa tgtttctgtc
caggcaaggg tgtgtgctga atgactcaag gattttttgg 1620gcaacacaaa
ccaacacgag ccgtgtgagg atcaggtgac agctgcccaa aagctgacac
1680aaggaacaag cctggaggag tgaggatggg tgctgtgaag gaggttgtgc
agctgggccc 1740gcagtcggac ctggtgagat cagaggaggg ggtgccacca
gtctgtggac gaagatgaga 1800agctggaata gagcagaaaa caggaggctg
ccactctcca tctttcccaa agtcactcca 1860gga 1863703679DNAHomo sapiens
70gaggcagcca tgactggcca cttcatgtgc tcctggagaa gggcttgcac cagccgtttt
60caggaaagtc aagcagctgt tgactcctga gtctgggtga atttgtgtga agagcataag
120gcgctgtttc ttaaccaaaa cgcttcctct tgcagtgcag atgggatgtg
cttctccaca 180ggaggcccca cggcttcccc acccctcaga ggagcgccgt
gcgtgcgtct gtgtggagga 240ttggcagctc ctgcagtcgg cccttggtcc
tatttggcga cgcctctgcc ttccccttaa 300ttatacagtc atgagccgcc
ctggaatcac ggcagctccg gatggatcct ggatgccaga 360atgcagcctc
agcacggggc tgcaggacag gagtgagcga ggggctgcag agccggcggc
420cgcggtgggc accatggagg gggctgccct gggcagcacg ggcatgagtc
tcaaggccca 480ggtttgagta acaggtgttg agagcttact tacttttcct
gagacacagt ttcctcatct 540cgagagcacg gaaaatcatt ctaacttcag
aggattgttg tgaaagttaa atgagattaa 600agaggtaaag cccatgacgt
gcttagctcg tgcttggctc ttggtcaatg ccagttagcg 660ctgcattttc
tcccctctcc ctccctcctt ctctctttct tttcttctat tctccattcc
720tgttttctcc cccaccccac tccccaaagc tctgcgttga gaaccagatg
ctgtctggtg 780ggttagggcc agaggaggaa aagctgcccg ccgtgggctg
cacccatacc ctcttcattc 840caatgacatg aggggagggg aaaggacaga
ggtagactgt cctcccctac ctcctcctaa 900tacaaatgga attcctggaa
ctggaaaaca aagaataccc ccataaaaat aagacagtac 960ttctggtgcg
gtgtaataaa ggggaaagta accctcaatg tcaggaaact ccgcacctcc
1020cagctcatat ttgtgtggag gaaaagttaa atattaattt ggactcaact
gaatgtggac 1080acaaacaatg gtcaccaagt cccggaacag gttgtgtgag
cctcttcagg ggttcatcca 1140gcgctgtttt ggagaaatct ctatttcaat
ttattcctat acgttagtta ctgaaaaaca 1200acagacaatc gcaaaagcaa
gttgcccgtt ttgtgttcct tgagcccaat catgaagtgc 1260cgtcgtgact
gggcctcatg acaaacaact tgtaacaagt aacaacagag ctcaggtccc
1320agaccgcact gaagctctgt gagacctctc ctcatctgtg catgaacgag
tgtctgactc 1380tggagcccag cctgctgctt cccagtctgg tggtgaatcc
tccgtagtct gatggaggtt 1440tgctcttgtt gcccaggctg gagtgcaatg
gcacaatctc ggctcactgc agcccctgcc 1500tcccaggctc aagcaattct
tacgcctcag cctcctgagt agatggaact acagggcatg 1560gactggaact
cccaataccc ctgacatggg ctgagtcaac gtggtcatga acatgtgaca
1620ggaggcagca gaagttgcag agaagagtga ggcacgtttg aaaaaggctg
aaaaatgttt 1680ctgtccaggc aagggtgtgt gctgaatgac tcaaggattt
tttggagaga attggagtgt 1740ctcaccagag gagaccacgt ctgaagggct
ttgcatccct ccttggacat gtctaatacc 1800taacactcag aaagcatcca
gtaaatattc gtggaaagaa aggagtggag aaggggagaa 1860aggggaaagg
gagtaggcga gagagaagaa agactctgct tcttgcccag ggcctggcat
1920ggggcggagg caaagcagtg gggtcctcag ctatgtccca ctgtgagtgc
acagcgagtc 1980ctgaccttca gagggtgcag cccgaggggc cctggcctgt
ctgaagggtg cgccagccga 2040gtggcctgct ctgaccacca ggctcaccca
tgactacctg ggtggctaca gccagttcct 2100gacaatgagt acagcactca
gttatcgggg cccttccacc cacacgctgt ccacttcctg 2160gggtactgct
gtgggcatgt gagtgcttgc tccccggggc actgctgtgg gcatgcgagt
2220gcttgctccc cggggcactg ctgtccactt cctggggtac tgctgtgggc
atgcgagtgc 2280ttgctccccg gggcactgct gtggacatgt gatagcttgc
tccccagctc cactagtgac 2340actggcggcc cctcgctggg gccttccccg
cctgctccgc tccattaccg ctgccgggct 2400cctcacgtct ctccttgctg
cttcctgcac tggggtgagg agagtggggc tggtcccctt 2460gagaccggag
aagctccagg cttttaagga aaactgccag ggacgaagag aagatatcac
2520ttccccacgt ggttggcttc cagattcaga aggaatgtct gtccttgtgg
attccgtacc 2580agatgacccc agatgctgcc tcagtactag gtccctgtgg
ctctggagcc tttgctgggt 2640ctgggcagtg tctcttcctc tccagttcat
ccttgggtct cttcaccctt gccaggggca 2700ggcttcctgg tgagaggtcg
acctcctgca tgaaggctct caagaggcca gttcaaagcc 2760aagctccggg
tctgtgcctg tggggctgct cctcgatcag gagatggtca ctcccctcct
2820ggtctgtatc tgtgggattc tcctccatca ggagatggtc tctcccctcc
tggtctatac 2880ccgtgggatt ctcctccatc aggagatggt cactccccat
cctggtctat acccgtggga 2940ttctcctcca ttagatggtc actcccctcc
tggtctattc ccgtggggct gctcctccat 3000caggaggtgg tcactccccc
tcctggtcta tacccgtggg gctgctcctc catcaggaga 3060tagtcactcc
ccctcctggt ctatacccgt gggattctcc tccatctgga gatggtcact
3120cccctcctgg tctataccca tgggattctc ctccatctgg agatggtcac
tcccctcctg 3180gtctataccc atgggattct cctccatctg gagatagtca
ctcccctcct ggtctatacc 3240cgtggggttc tcctcaatca ggaggtggtc
actcccctcc tggtctatac ccgtgggatt 3300ctcctccatc aggagatggt
cactctccct cctggtctat acccgtgggg ctgctcctcc 3360atcaggagat
ggtcactccc ctcctggtct atacctgtgg gattctcctc cctcagaaga
3420tggtcactcc ccctcctggt ctatacccat gggattctcc tccctcagaa
gatggtcact 3480ccccctcctg gtctataccc gtggggctgc tcctccatca
ggagatggtc actctccctc 3540tcggttgctc agtccaaaaa caacctctct
ggaaaactgc gtggaatttt tttttaaaga 3600attgaaacta gaactagcat
ttgatccagc catctgccta ctgggaatac acccaaagaa 3660aaataaatca
ttatatcag 3679713620DNAHomo sapiens 71gaggcagcca tgactggcca
cttcatgtgc tcctggagaa gggcttgcac cagccgtttt 60caggaaagtc aagcagctgt
tgactcctga gtctgggtga atttgtgtga agagcataag 120gcgctgtttc
ttaaccaaaa cgcttcctct tgcagtgcag atgggatgtg cttctccaca
180ggaggcccca cggcttcccc acccctcaga ggagcgccgt gcgtgcgtct
gtgtggagga 240ttggcagctc ctgcagtcgg cccttggtcc tatttggcga
cgcctctgcc ttccccttaa 300ttatacagtc atgagccgcc ctggaatcac
ggcagctccg gatggatcct ggatgccaga 360atgcagcctc agcacggggc
tgcaggacag gagtgagcga ggggctgcag agccggcggc 420cgcggtgggc
accatggagg gggctgccct gggcagcacg ggcatgagtc tcaaggccca
480ggtttgagta acaggtgttg agagcttact tacttttcct gagacacagt
ttcctcatct 540cgagagcacg gaaaatcatt ctaacttcag aggattgttg
tgaaagttaa atgagattaa 600agaggtaaag cccatgacgt gcttagctcg
tgcttggctc ttggtcaatg ccagttagcg 660ctgcattttc tcccctctcc
ctccctcctt ctctctttct tttcttctat tctccattcc 720tgttttctcc
cccaccccac tccccaaagc tctgcgttga gaaccagatg ctgtctggtg
780ggttagggcc agaggaggaa aagctgcccg ccgtgggctg cacccatacc
ctcttcattc 840caatgacatg aggggagggg aaaggacaga ggtagactgt
cctcccctac ctcctcctaa 900tacaaatgga attcctggaa ctggaaaaca
aagaataccc ccataaaaat aagacagtac 960ttctggtgcg gtgtaataaa
ggggaaagta accctcaatg tcaggaaact ccgcacctcc 1020cagctcatat
ttgtgtggag gaaaagttaa atattaattt ggactcaact gaatgtggac
1080acaaacaatg gtcaccaagt cccggaacag gttgtgtgag cctcttcagg
ggttcatcca 1140gcgctgtttt ggagaaatct ctatttcaat ttattcctat
acgttagtta ctgaaaaaca 1200acagacaatc gcaaaagcaa gttgcccgtt
ttgtgttcct tgagcccaat catgaagtgc 1260cgtcgtgact gggcctcatg
acaaacaact tgtaacaagt aacaacagag ctcaggtccc 1320agaccgcact
gaagctctgt gagacctctc ctcatctgtg catgaacgag tgtctgactc
1380tggagcccag cctgctgctt cccagtctgg tggtgaatcc tccgtagtct
gatggaggtt 1440tgctcttgtt gcccaggctg gagtgcaatg gcacaatctc
ggctcactgc agcccctgcc 1500tcccaggctc aagcaattct tacgcctcag
cctcctgagt agatggaact acagggcatg 1560gactggaact cccaataccc
ctgacatggg ctgagtcaac gtggtcatga acatgtgaca 1620ggaggcagca
gaagttgcag agaagagtga ggcacgtttg aaaaaggctg aaaaatgttt
1680ctgtccaggc aagggtgtgt gctgaatgac tcaaggattt tttggagaga
attggagtgt 1740ctcaccagag gagaccacgt ctgaagggct ttgcatccct
ccttggacat gtctaatacc 1800taacactcag aaagcatcca gtaaatattc
gtggaaagaa aggagtggag aaggggagaa 1860aggggaaagg gagtaggcga
gagagaagaa agactctgct tcttgcccag ggcctggcat 1920ggggcggagg
caaagcagtg gggtcctcag ctatgtccca ctgtgagtgc acagcgagtc
1980ctgaccttca gagggtgcag cccgaggggc cctggcctgt ctgaagggtg
cgccagccga 2040gtggcctgct ctgaccacca ggctcaccca tgactacctg
ggtggctaca gccagttcct 2100gacaatgagt acagcactca gttatcgggg
cccttccacc cacacgctgt ccacttcctg 2160gggtactgct gtgggcatgt
gagtgcttgc tccccggggc actgctgtgg gcatgcgatg 2220cttgctcccc
ggggcactgc tgtggacatg tgatagcttg ctccccagct ccactagtga
2280cactggcggc ccctcgctgg ggccttcccc gcctgctccg ctccattacc
gctgccgggc 2340tcctcacgtc tctccttgct gcttcctgca ctggggtgag
gagagtgggg ctggtcccct 2400tgagaccgga gaagctccag gcttttaagg
aaaactgcca gggacgaaga gaagatatca 2460cttccccacg tggttggctt
ccagattcag aaggaatgtc tgtccttgtg gattccgtac 2520cagatgaccc
cagatgctgc ctcagtacta ggtccctgtg gctctggagc ctttgctggg
2580tctgggcagt gtctcttcct ctccagttca tccttgggtc tcttcaccct
tgccaggggc 2640aggcttcctg gtgagaggtc gacctcctgc atgaaggctc
tcaagaggcc agttcaaagc 2700caagctccgg gtctgtgcct gtggggctgc
tcctcgatca ggagatggtc actcccctcc 2760tggtctgtat ctgtgggatt
ctcctccatc aggagatggt ctctcccctc ctggtctata 2820cccgtgggat
tctcctccat caggagatgg tcactcccca tcctggtcta tacccgtggg
2880attctcctcc attagatggt cactcccctc ctggtctatt cccgtggggc
tgctcctcca 2940tcaggaggtg gtcactcccc ctcctggtct atacccgtgg
ggctgctcct ccatcaggag 3000atagtcactc cccctcctgg tctatacccg
tgggattctc ctccatctgg agatggtcac 3060tcccctcctg gtctataccc
atgggattct cctccatctg gagatggtca ctcccctcct 3120ggtctatacc
catgggattc tcctccatct ggagatagtc actcccctcc tggtctatac
3180ccgtggggtt ctcctcaatc aggaggtggt cactcccctc ctggtctata
cccgtgggat 3240tctcctccat caggagatgg tcactctccc tcctggtcta
tacccgtggg gctgctcctc 3300catcaggaga tggtcactcc cctcctggtc
tatacctgtg ggattctcct ccctcagaag 3360atggtcactc cccctcctgg
tctataccca tgggattctc ctccctcaga agatggtcac 3420tccccctcct
ggtctatacc cgtggggctg ctcctccatc aggagatggt cactctccct
3480ctcggttgct cagtccaaaa acaacctctc tggaaaactg cgtggaattt
ttttttaaag 3540aattgaaact agaactagca tttgatccag ccatctgcct
actgggaata cacccaaaga 3600aaaataaatc attatatcag 362072720DNAHomo
sapiens 72ctggagctct cctgaggagc ccacagatcc aactccctag gccaaggctg
cagcctgggg 60cagagatgca gaggcctgga ggagcctagg gcacgcggcc tgcgggctgg
ctggggttca 120gagttcgtat gtgtgtggag tgactgggca ggtgttcaga
aatgaaggct ggcactgcca 180ggtaaggccc ttccctccct gatgtgagag
ccctggagcc accgcagagg cccagtcaga 240tctctgttct aattctggcc
tggtgtggag gatgaggaga gacggcccag aaaggaaggc 300agactgtgca
gaccccatgt cttctggccc gcgaggccct cctcctgtgc ctgcttatct
360taaagaatcc gggataagag gtgacttggg ccttggccgg gaggcccctc
ctcagcttca 420gacaaggagg gagctctggg catgaggaca ttgagcaaga
ggcgatggca gtgcccacaa 480cttaccctca gctcggctct gttgggtccg
agaagttgca tggaaagggc tccttggggg 540ccagttgtca gtaagctgca
gaagcctgga gccggccagg aaataaccac gtgtaggagc 600cttctcagct
gagaggaagg aggactcacg cgcggcgagc acatgcttgg agccaggcac
660agggcatgga ctggaactcc caatacccct gacatgggct gagtcaacgt
ggtcatgaac 720734387DNAHomo sapiens 73caaatgcctg gcagcgtcct
cggtgcttca cctgccatag ccgacagtgg ctgacctccc 60atgcctgttg ccttttcttt
ctgttggatc agggatacac tgccatgtgt gttaagaaaa 120gctggcctta
cctacagggc tggccagtcc cggtcacgtt tctagtaagc cattgcctta
180cataagggta acggcatggg acgctatctt agccaatgtg ataaaagtgg
acatgaggtg 240agaggcttca gagagaggtt ttaaaaaaga gacaaaagca
ggacgttgcc tctcttcctc 300ctctccacgt gtcctacccg gatgtgaagc
caaaacagat gcaggcttag tgcaaccatg 360gggaacccag cataagcaca
gattcaacag cagaagagtg gcagagggag aaggtgaaag 420gaacctaggt
tttcctgtcc ttgttgagtc attcagttaa aaatccctgg aattttcctc
480tctccggcag tgtgttttgt gggataatga gttgccttat tggggttggc
ttgctagtcg 540ggatgtttcg ctcccatcaa catccatacg cttgctctgt
gaaccaatga cctgatgagg 600tagtattagc accaccatca ttatgctgag
gatgagattt atggcacagt ggttcagtag 660cttgcccaag gccatgcggc
tggtaggttc tggaggaggg ctcagggcac cccctgagct 720acccctgctg
gccattgcac caccccataa agctgctggc agtcacttct ctgaggggtt
780agcatgtaag aaatgtcctc ctgaatgctg gccagacaaa tggaaatctg
ccagggttgg 840gtacccccat gacagcagcc agcctgccct cttagtccct
gacagctgca gtgacagcat 900ctgtgattgc aaagcgtgac aatttatatc
tctcatttca tcacaccatc tatcagcaga 960cagtcaggct ttaaaaatca
atcccacact gactcagtcc
ccagcagaga tggcctctga 1020caacagtatc cacactgcag gctggacaag
ggccctatta attttgagac tcagccaaat 1080ttccttctga ccctaagctg
gtgaatccct gctcctttgc tttggttggg gttggtgtga 1140gctaaggctg
tgatcccatt tgctcctatg gcctccaggt ggcctgggcc tccatgaatg
1200ggccacatgg tcatactgaa tgcttgatta cactcagacc tagcagtcgt
ctgggcgcag 1260ctggtttatg gatcactttg tcacaatgtt ccatccttcc
aggtccccat ccccgcggtg 1320ggaaaacatt gctttaggca gtgctagagg
acttcagcag gcattggcag cttctggatt 1380caggattaga acaaagaagg
aggagtcaca gcaaagatag gaacagaagg cagagagaac 1440agacagatgg
gggtgtttga gaaggagggc ctttgagacc tcagggagtg ggagacactg
1500gctcgagaat aataataatg gcaatttctc tcatctgtgt tttcagggca
tggactggaa 1560ctcccaatac ccctgacatg ggctgagtca acgtggtcat
gaacatgtga caggaggcag 1620cagaagttgc agagaagagt gaggcacgtt
tgaaaaaggc tgaaaaatgt ttctgtccag 1680gcaagggtgt gtgctgaatg
actcaaggat tttttggtgc attgaatgaa cagcgggaca 1740ttggacacct
gctgatccat caccccgggc ccgggcaggc ccgtggatga agagagatgg
1800agaagaccag gcatgagact gtggagaagc cacaccacca gaaacccctg
ccccatgcgc 1860cgtccagccc acacctgtgg atgcacgggg gattgcaggc
agggctccca ccgtggactc 1920aggaacaggc agggaagctg ctgcctcacc
aggcgaaggg gccaggaggg ggaggcggag 1980aggcccgtct agcccctgcg
gctgtcaccg tggtgcctcc tcactggcca gtgcggtcgc 2040gcctcagctt
cgttaatagg ggagggggcc taagagtttt cacgtccagg ctcgggcagt
2100ggggaggcag gcaggagtgg ccgctggttt ttcagacctc ccagggaggc
cgaggaaatg 2160gcccgtcctg gagtgggcgt ggttctgtct tcagatggat
gctggagggt tgggctgcgt 2220gggaccctgg gccctgctgc ttcccggagg
atgcgctgtc cggggctgca caggttggct 2280gtgttttttg gatgcttgat
attttgtttt ttcttctctt cactctgtca tgaaactggc 2340aatagtagtt
tgtaaataaa tatgtgttat agatgaatat ttgctatgag taaattaata
2400aaggagtgaa taaatgagcg attgatgtag ggcctgtcct gtctcaggga
gccccacgaa 2460ggcctgcgcg ccggccagag cctgcctgcc tgccagggta
ctgggacgtc actctcaaag 2520cggcgggacc cagccgctga tcttgctgag
gaggcccggt ctcagaaaac tgagcggctg 2580cttctgcaga ccctgcatcc
tcccctccct ggagaaagaa gctctggctg agtcctggga 2640ccgaaccctt
gggtgccaca gaaacgggct ttgctgcctg tcagtcaagc ggcgggagaa
2700acagacctgg ggaggaggag gctgggaggg ctgtgttttc tgcacagcga
gtagctcctt 2760agcctggtgc catttctctc caaacaccct gaaggttgag
tccagggtga agatgtagag 2820gcaagttttg gggggatgga gtgggcttgg
agggatgctg gcgccttagc aggctgtgct 2880cctgaggtgc ccagtgtctg
cgggcacagg aacatgttgc cgagggcatt tgggtgtggg 2940tggggtgggg
aaagggagac agggctgtct cttttaatgg gtatctgcga gcatgtgatt
3000gtaagagagg aagaagtagg ggaggaagaa ggcctccttg ggaggtgcgt
catcctgagg 3060aaggctgaac aatgagggtc ttggagagtc aattcagaag
cacaaccttg cagagcaggc 3120aaaaacaata gggcttcttg aggctgcccg
ggcactcatg caatcaccat ttcctgctgt 3180gaatgagcct acattttgtt
ggggaagaga cgcaacgacg ccaaacgatg gactctgagt 3240caacgataag
atgaaacaaa attaaaacaa agtaggaaat caagagtggc tgctgtgatg
3300gcgttgcgga gatgatgttt gctttgagaa ctggacaagt gagcccctga
gctgcatctg 3360cacccagagg ctgagccggt gcacaggact tgcagaggga
tgggcctggg cttgtagagc 3420agcacaacgg ccccaggcct ggaggagcaa
gggtgggaag gggggcaggc cagctcctgc 3480caggctggag aaggactcgg
acctcaggcc acctgtgcct gggtgattgt gaacttgtaa 3540caaatgtgat
cttatttatg ttttgaaaaa ggcaacacaa accaacacga gccgtgtgag
3600gatcaggtga cagctgccca aaagctgaca caaggaacaa gcctggagga
gtgaggatgg 3660gtgctgtgaa ggaggttgtg cagctgggcc cgcagtcgga
cctggtgaga tcagaggagg 3720gggtgccacc agtctgtgga cgaagatgag
aagctggaat agagcagaaa acaggaggct 3780gccactctcc atctttccca
aagtcactcc aggagcaagg gtgtcattta ctgaaatgac 3840agactctcca
tttcacattt ttcccccaag tgcagagtgc agggaagcag atgggctaaa
3900tttttagagt cagggttatt aatgtatact ttacatagta aactttcccc
ttttaagtgt 3960gcaggcctga ggtttgccaa atatgtgtag gcatttaatc
accaccacga tcaagatgta 4020gaatattccc actatcaaaa agtttgctgt
gtcccttgat ggtcatgccc cattccacag 4080ccccagcccc agcccctgga
gattgctgtc tgctttatgt tccagtggtt ttatcttttc 4140cagactgtat
ggatgtgaat ggaatcagat gtgattccaa ggtgttttat cttttccaga
4200tgtgaatgga atcagatgta cgaaatccta tggtaggggg tcttctgagt
ctagctcctt 4260ttgtttagcg tgatgcattt gaaattaatc catgtctcag
gcatcaggag ttcatttctt 4320tttctgctga gtagtatttc attgtatgga
tgtactgcaa tttgcctatc cattcacctg 4380ttgatgt 4387741398DNAHomo
sapiens 74cgggatgttt cgctcccatc aacatccata cgcttgctct gtgaaccaat
gacctgatga 60ggtagtatta gcaccaccat cattatgctg aggatgagat ttatggcaca
gtggttcagt 120agcttgccca aggccatgcg gctggtaggt tctggaggag
ggctcagggc accccctgag 180ctacccctgc tggccattgc accaccccat
aaagctgctg gcagtcactt ctctgagggg 240ttagcatgta agaaatgtcc
tcctgaatgc tggccagaca aatggaaatc tgccagggtt 300gggtaccccc
atgacagcag ccagcctgcc ctcttagtcc ctgacagctg cagtgacagc
360atctgtgatt gcaaagcgtg acaatttata tctctcattt catcacacca
tctatcagca 420gacagtcagg ctttaaaaat caatcccaca ctgactcagt
ccccagcaga gatggcctct 480gacaacagta tccacactgc aggctggaca
agggccctat taattttgag actcagccaa 540atttccttct gaccctaagc
tggtgaatcc ctgctccttt gctttggttg gggttggtgt 600gagctaaggc
tgtgatccca tttgctccta tggcctccag gtggcctggg cctccatgaa
660tgggccacat ggtcatactg aatgcttgat tacactcaga cctagcagtc
gtctgggcgc 720agctggttta tggatcactt tgtcacaatg ttccatcctt
ccaggtcccc atccccgcgg 780tgggaaaaca ttgctttagg cagtgctaga
ggacttcagc aggcattggc agcttctgga 840ttcaggatta gaacaaagaa
ggaggagtca cagcaaagat aggaacagaa ggcagagaga 900acagacagat
gggggtgttt gagaaggagg gcctttgaga cctcagggag tgggagacac
960tggctcgaga ataataataa tggcaatttc tctcatctgt gttttcaggg
catggactgg 1020aactcccaat acccctgaca tgggctgagt caacgtggtc
atgaacatgt gacaggaggc 1080agcagaagtt gcagagaaga gtgaggcacg
tttgaaaaag gctgaaaaat gtttctgtcc 1140aggcaagggt gtgtgctgaa
tgactcaagg attttttggg tatgtcattt cccatttctc 1200accctcaaat
aggactccgc ttcccatcta agcatttgta taaatattga ttattggtta
1260gtgtgtatca gagagctatt gagtaaaaat tatatcagaa aaattaagaa
tctctagaga 1320tggcaaggtg tgaaacaaaa aacgccagga aggtaaatgc
tcaaagttca ccacacacca 1380cagtgagaag tgttgggg 139875939DNAHomo
sapiens 75acagcatctg tgattgcaaa gcgtgacaat ttatatctct catttcatca
caccatctat 60cagcagacag tcaggcttta aaaatcaatc ccacactgac tcagtcccca
gcagagatgg 120cctctgacaa cagtatccac actgcaggct ggacaagggc
cctattaatt ttgagactca 180gccaaatttc cttctgaccc taagctggtg
aatccctgct cctttgcttt ggttggggtt 240ggtgtgagct aaggctgtga
tcccatttgc tcctatggcc tccaggtggc ctgggcctcc 300atgaatgggc
cacatggtca tactgaatgc ttgattacac tcagacctag cagtcgtctg
360ggcgcagctg gtttatggat cactttgtca caatgttcca tccttccagg
tccccatccc 420cgcggtggga aaacattgct ttaggcagtg ctagaggact
tcagcaggca ttggcagctt 480ctggattcag gattagaaca aagaaggagg
agtcacagca aagataggaa cagaaggcag 540agagaacaga cagatggggg
tgtttgagaa ggagggcctt tgagacctca gggagtggga 600gacactggct
cgagaataat aataatggca atttctctca tctgtgtttt cagggcatgg
660actggaactc ccaatacccc tgacatgggc tgagtcaacg tggtcatgaa
catgtgacag 720gaggcagcag aagttgcaga gaagagtgag gcacgtttga
aaaaggctga aaaatgtttc 780tgtccaggca agggtgtgtg ctgaatgact
caaggatttt ttggctgatt tagtaaacaa 840acaagaatga agaaggaaac
catagctgag tggcagagcg tgcctggctg tttacacagg 900actccagggc
agggctcctg gagagggacg tgccagagg 93976138DNAArtificial
Sequenceprimer 76ctggaaagga ggagaacatg aaacattgct tgaagacaat
ggccgagaca gcaggtccca 60ccctgcacag ccaccagcat ctctcccctc agccctgtct
cctcttctgc agttgggatc 120tgcacattta agcctgaa 13877147DNAArtificial
Sequenceprimer 77attgtcctgt gaagtgaagt atgatcggac agcctctttt
cagcttttat gacaatggag 60acagaggaat tgtggctctt gccaaggtca caggattgga
atacagagcc aagccacccc 120aggacatgca agagcctcag aagggaa
1477819DNAArtificial Sequenceprimer 78acagccacca gcatctctc
197927DNAArtificial Sequenceprimer 79tgaagtgaag tatgatcgga cagcctc
278022DNAArtificial Sequenceprimer 80ccacaattcc tctgtctcca tt
228190DNAArtificial Sequenceprimer 81tctctcatct gtgttttcag
ggcatggact ggaactccca atacccctga catgggctga 60gtcaacgtgg tcatgaacat
gtgacaggag 9082101DNAArtificial Sequenceprimer 82gcagcagaag
ttgcagagaa gagtgaggca cgtttgaaaa aggctgaaaa atgtttctgt 60ccaggcaagg
gtgtgtgctg aatgactcaa ggattttttg g 1018321DNAArtificial
Sequenceprimer 83catggactgg aactcccaat a 218425DNAHomo sapiens
84tgcagagaag agtgaggcac gtttg 258521DNAArtificial Sequenceprimer
85ccttgcctgg acagaaacat t 21861434DNAHomo sapiens 86gttcgttgca
acaaattgat gagcaatgct tttttataat gccaactttg tacaaaaaag 60ttggcatgag
ccggtcaagg cacctgggca aaatccggaa gcgtctggaa gatgtcaaga
120gccagtgggt ccggccagcc agggctgact ttagtgacaa cgagagtgcc
cggctggcca 180cggacgccct cttggatggg ggttctgaag cctactggcg
ggtgctcagc caggaaggcg 240aggtggactt cttgtcctcg gtggaggccc
agtacatcca ggcccaggcc agggagcccc 300cgtgtccccc agacaccctg
ggaggggcgg aagcaggccc taagggactg gactccagct 360ccctacagtc
cggcacctac ttccctgtgg cctcagaggg cagcgagccg gccctactgc
420acagctgggc ctcagctgag aagccctacc tgaaggaaaa atccagcgcc
actgtgtact 480tccagaccgt caagcacaac aacatcagag acctcgtccg
ccgctgcatc acccggacta 540gccaggtcct ggtcatcctg atggatgtgt
tcacggatgt ggagatcttc tgtgacattc 600tagaggcagc caacaagcgt
ggggtgttcg tttgtgtgct cctggaccag ggaggtgtga 660agctcttcca
ggagatgtgt gacaaagtcc agatctctga cagtcacctc aagaacattt
720ccatccggag tgtggaagga gagatatact gtgccaagtc aggcaggaaa
ttcgctggcc 780aaatccggga gaagttcatc atctcggact ggagatttgt
cctgtctgga tcttacagct 840tcacctggct ctgcggacac gtgcaccgga
acatcctctc caagttcaca ggccaggcgg 900tggagctgtt tgacgaggag
ttccgccacc tctacgcctc ctccaagcct gtgatgggcc 960tgaagtcccc
gcggctggtc gcccccgtcc cgcccggagc agccccggcc aatggccgcc
1020ttagcagcag cagtggctcc gccagtgacc gcacgtcctc caaccccttc
agcggccgct 1080cggcaggcag ccaccccggt acccgaagtg tgtccgcgtc
ttcagggccc tgtagccccg 1140cggccccaca cccgcctcca ccgccccggt
tccagcccca ccaaggccct tggggagccc 1200cgagtcccca ggcccacctc
tccccgcggc cccacgacgg cccgcccgcc gctgtctaca 1260gcaacctggg
ggcctacagg cccacgcggc tgcagctgga gcagctgggc ctggtgccga
1320ggctgactcc aacctggagg cccttcctgc aggcctcccc tcacttctgc
ccaactttct 1380tgtacaaagt tggcattata agaaagcatt gcttatcaat
ttgttgcaac gaac 1434873535DNAHomo sapiens 87gcggccgcgg cgccgatccc
ggctgaggcg cagcggcgag aggtcgcggg cagggccatg 60gccccggggg gccgctagcg
cggaccggcc caacgggagc cgctccgtgc cgccgccgcc 120gcccgggcgc
ccaggccccg ccgctgcgga agaggtttct agagagtgga gcctgcttcc
180tgggccctag gcccctccca caatgcttgt cgccggtctt cttctctggg
cttccctact 240gaccggggcc tggccatcct tccccaccca ggaccacctc
ccggccacgc cccgggtccg 300gctctcattc aaagagctga aggccacagg
caccgcccac ttcttcaact tcctgctcaa 360cacaaccgac taccgaatct
tgctcaagga cgaggaccac gaccgcatgt acgtgggcag 420caaggactac
gtgctgtccc tggacctgca cgacatcaac cgcgagcccc tcattataca
480ctgggcagcc tccccacagc gcatcgagga atgcgtgctc tcaggcaagg
atgtcaacgg 540cgagtgtggg aacttcgtca ggctcatcca gccctggaac
cgaacacacc tgtatgtgtg 600cgggacaggt gcctacaacc ccatgtgcac
ctatgtgaac cgcggacgcc gcgcccaggc 660cacaccatgg acccagactc
aggcggtcag aggccgcggc agcagagcca cggatggtgc 720cctccgcccg
atgcccacag ccccacgcca ggattacatc ttctacctgg agcctgagcg
780actcgagtca gggaagggca agtgtccgta cgatcccaag ctggacacag
catcggccct 840catcaatgag gagctctatg ctggtgtgta catcgatttt
atgggcactg atgcagccat 900cttccgcaca cttggaaagc agacagccat
gcgcacggat cagtacaact cccggtggct 960gaacgacccg tcgttcatcc
atgctgagct cattcctgac agtgcggagc gcaatgatga 1020taagctttac
ttcttcttcc gtgagcggtc ggcagaggcg ccgcagagcc ccgcggtgta
1080cgcccgcatc gggcgcattt gcctgaacga tgacggtggt cactgttgcc
tggtcaacaa 1140gtggagcaca ttcctgaagg cgcggctcgt ctgctctgtc
ccgggcgagg atggcattga 1200gactcacttt gatgagctcc aggacgtgtt
tgtccagcag acccaggacg tgaggaaccc 1260tgtcatttac gctgtcttta
cctcctctgg ctccgtgttc cgaggctctg ccgtgtgtgt 1320ctactccatg
gctgatattc gcatggtctt caacgggccc tttgcccaca aagaggggcc
1380caactaccag tggatgccct tctcagggaa gatgccctac ccacggccgg
gcacgtgccc 1440tggtggaacc ttcacgccat ctatgaagtc caccaaggat
tatcctgatg aggtgatcaa 1500cttcatgcgc agccacccac tcatgtacca
ggccgtgtac cctctgcagc ggcggcccct 1560ggtagtccgc acaggtgctc
cctaccgcct taccactatt gccgtggacc aggtggatgc 1620agccgacggg
cgctatgagg tgcttttcct gggcacagac cgcgggacag tgcagaaggt
1680cattgtgctg cccaaggatg accaggagtt ggaggagctc atgctggagg
aggtggaggt 1740cttcaaggat ccagcacccg tcaagaccat gaccatctct
tctaagaggc aacaactcta 1800cgtggcgtca gccgtgggtg tcacacacct
gagcctgcac cgctgccagg cgtatggggc 1860tgcctgtgct gactgctgcc
ttgcccggga cccttactgt gcctgggatg gccaggcctg 1920ctcccgctat
acagcatcct ccaagaggcg gagccgccgg caggacgtcc ggcacggaaa
1980ccccatcagg cagtgccgtg ggttcaactc caatgccaac aagaatgccg
tggagtctgt 2040gcagtatggc gtggccggca gcgcagcctt ccttgagtgc
cagccccgct cgccccaagc 2100cactgttaag tggctgttcc agcgagatcc
tggtgaccgg cgccgagaga ttcgtgcaga 2160ggaccgcttc ctgcgcacag
agcagggctt gttgctccgt gcactgcagc tcagcgatcg 2220tggcctctac
tcctgcacag ccactgagaa caactttaag cacgtcgtca cacgagtgca
2280gctgcatgta ctgggccggg acgccgtcca tgctgccctc ttcccaccac
tgtccatgag 2340cgccccgcca cccccaggcg caggcccccc aacgcctcct
taccaggagt tagcccagct 2400gctggcccag ccagaagtgg gcctcatcca
ccagtactgc cagggttact ggcgccatgt 2460gccccccagc cccagggagg
ctccaggggc accccggtct cctgagcccc aggaccagaa 2520aaagccccgg
aaccgccggc accaccctcc ggacacatga ggccagctgc ctgtgcctgc
2580catgggccag cctagccctt gtccctttta atataaaaga tatatatata
tatatatata 2640tataaaatat ctatattcta tacacaccct gcccctgcaa
agacagtatt tattggtggg 2700ttgaatatag cctgcctcag tggcagcatc
ctccaaaact tagacccatg ctggtcagag 2760acggcagaaa acagagcctg
cctaaccagg cccagccagt tggtggggcc aggccaggac 2820cacacagtcc
ccagactcag ctggaagtct acctgctgga cagcctccgc caagatctac
2880aggacaaagg gagggagcaa gccctactcg gatggggcac ggactgtcca
ccttttctga 2940tgtgtgttgt cagcctgtgc tgtggcatag acatggatgc
gaggaccact ttggagactg 3000gggtggcctc aagagcacac agagaaggga
agaaggggcc atcacaggat gccagcccct 3060gcctgggttg ggggcactca
gccacgacca gccccttcct gggtatttat tctctattta 3120ttggggatag
gagaagaggc atcctgcctg ggtgggacag cctcttcagc cccttctccc
3180ctccccgcct ggccagggca gggccacccc actctacctc cttagctttc
cctgtgccac 3240tttgactcag aggctgggag catagcagag gggccaggcc
caggcagagc tgacgggagg 3300ccccagctct gaggggaggg ggtccgtggt
agaggcctgg ggccggtaga ggctccccag 3360ggctccctta tgtccaccac
ttcaggggat gggtgtggat gtaattagct ctggggggca 3420gttgggtaga
tgggtggggg ctcctggtgg ccttctgctg cccaggccac agccgccttt
3480gggttccatc ttgctaataa acactggctc tgggactaga aaaaaaaaaa aaaaa
3535883558DNAHomo sapiens 88ctgactggtg ctccctctct tccatcttgg
gctgtctgca tgtgtctcat tcccccactc 60tctcctgtgc ctcccctcta ccgtaataat
caggtccagg tttctctgta ctgggagaag 120acctgtggct ggagcaggca
gggatgcacc ctatctgttc cccattcctc caggtgggag 180ggagaaggag
taacccactt tattggccac agatgcaggg gagaaaggag aaagcatgct
240gggagctgga aagagcccta agatcacctg gtttctagag agtggagcct
gcttcctggg 300ccctaggccc ctcccacaat gcttgtcgcc ggtcttcttc
tctgggcttc cctactgacc 360ggggcctggc catccttccc cacccaggac
cacctcccgg ccacgccccg ggtccggctc 420tcattcaaag agctgaaggc
cacaggcacc gcccacttct tcaacttcct gctcaacaca 480accgactacc
gaatcttgct caaggacgag gaccacgacc gcatgtacgt gggcagcaag
540gactacgtgc tgtccctgga cctgcacgac atcaaccgcg agcccctcat
tatacactgg 600gcagcctccc cacagcgcat cgaggaatgc gtgctctcag
gcaaggatgt caacggcgag 660tgtgggaact tcgtcaggct catccagccc
tggaaccgaa cacacctgta tgtgtgcggg 720acaggtgcct acaaccccat
gtgcacctat gtgaaccgcg gacgccgcgc ccaggattac 780atcttctacc
tggagcctga gcgactcgag tcagggaagg gcaagtgtcc gtacgatccc
840aagctggaca cagcatcggc cctcatcaat gaggagctct atgctggtgt
gtacatcgat 900tttatgggca ctgatgcagc catcttccgc acacttggaa
agcagacagc catgcgcacg 960gatcagtaca actcccggtg gctgaacgac
ccgtcgttca tccatgctga gctcattcct 1020gacagtgcgg agcgcaatga
tgataagctt tacttcttct tccgtgagcg gtcggcagag 1080gcgccgcaga
gccccgcggt gtacgcccgc atcgggcgca tttgcctgaa cgatgacggt
1140ggtcactgtt gcctggtcaa caagtggagc acattcctga aggcgcggct
cgtctgctct 1200gtcccgggcg aggatggcat tgagactcac tttgatgagc
tccaggacgt gtttgtccag 1260cagacccagg acgtgaggaa ccctgtcatt
tacgctgtct ttacctcctc tggctccgtg 1320ttccgaggct ctgccgtgtg
tgtctactcc atggctgata ttcgcatggt cttcaacggg 1380ccctttgccc
acaaagaggg gcccaactac cagtggatgc ccttctcagg gaagatgccc
1440tacccacggc cgggcacgtg ccctggtgga accttcacgc catctatgaa
gtccaccaag 1500gattatcctg atgaggtgat caacttcatg cgcagccacc
cactcatgta ccaggccgtg 1560taccctctgc agcggcggcc cctggtagtc
cgcacaggtg ctccctaccg ccttaccact 1620attgccgtgg accaggtgga
tgcagccgac gggcgctatg aggtgctttt cctgggcaca 1680gaccgcggga
cagtgcagaa ggtcattgtg ctgcccaagg atgaccagga gttggaggag
1740ctcatgctgg aggaggtgga ggtcttcaag gatccagcac ccgtcaagac
catgaccatc 1800tcttctaaga ggcaacaact ctacgtggcg tcagccgtgg
gtgtcacaca cctgagcctg 1860caccgctgcc aggcgtatgg ggctgcctgt
gctgactgct gccttgcccg ggacccttac 1920tgtgcctggg atggccaggc
ctgctcccgc tatacagcat cctccaagag gcggagccgc 1980cggcaggacg
tccggcacgg aaaccccatc aggcagtgcc gtgggttcaa ctccaatgcc
2040aacaagaatg ccgtggagtc tgtgcagtat ggcgtggccg gcagcgcagc
cttccttgag 2100tgccagcccc gctcgcccca agccactgtt aagtggctgt
tccagcgaga tcctggtgac 2160cggcgccgag agattcgtgc agaggaccgc
ttcctgcgca cagagcaggg cttgttgctc 2220cgtgcactgc agctcagcga
tcgtggcctc tactcctgca cagccactga gaacaacttt 2280aagcacgtcg
tcacacgagt gcagctgcat gtactgggcc gggacgccgt ccatgctgcc
2340ctcttcccac cactgtccat gagcgccccg ccacccccag gcgcaggccc
cccaacgcct 2400ccttaccagg agttagccca gctgctggcc cagccagaag
tgggcctcat ccaccagtac 2460tgccagggtt actggcgcca tgtgcccccc
agccccaggg aggctccagg ggcaccccgg 2520tctcctgagc cccaggacca
gaaaaagccc cggaaccgcc ggcaccaccc tccggacaca 2580tgaggccagc
tgcctgtgcc tgccatgggc cagcctagcc cttgtccctt ttaatataaa
2640agatatatat atatatatat atatataaaa tatctatatt ctatacacac
cctgcccctg 2700caaagacagt atttattggt gggttgaata tagcctgcct
cagtggcagc atcctccaaa 2760acttagaccc atgctggtca gagacggcag
aaaacagagc ctgcctaacc aggcccagcc 2820agttggtggg gccaggccag
gaccacacag tccccagact cagctggaag tctacctgct
2880ggacagcctc cgccaagatc tacaggacaa agggagggag caagccctac
tcggatgggg 2940cacggactgt ccaccttttc tgatgtgtgt tgtcagcctg
tgctgtggca tagacatgga 3000tgcgaggacc actttggaga ctggggtggc
ctcaagagca cacagagaag ggaagaaggg 3060gccatcacag gatgccagcc
cctgcctggg ttgggggcac tcagccacga ccagcccctt 3120cctgggtatt
tattctctat ttattgggga taggagaaga ggcatcctgc ctgggtggga
3180cagcctcttc agccccttct cccctccccg cctggccagg gcagggccac
cccactctac 3240ctccttagct ttccctgtgc cactttgact cagaggctgg
gagcatagca gaggggccag 3300gcccaggcag agctgacggg aggccccagc
tctgagggga gggggtccgt ggtagaggcc 3360tggggccggt agaggctccc
cagggctccc ttatgtccac cacttcaggg gatgggtgtg 3420gatgtaatta
gctctggggg gcagttgggt agatgggtgg gggctcctgg tggccttctg
3480ctgcccaggc cacagccgcc tttgggttcc atcttgctaa taaacactgg
ctctgggact 3540agaaaaaaaa aaaaaaaa 3558893274DNAHomo sapiens
89cccgcgcggc tctgagcgcc ccgtcccgcc ggcggccgcg agaccagagc gagcgaacga
60accgcggcgg tccggagagc cccgagcgca gcgcaggacc tgggaccacc tcccggccac
120gccccgggtc cggctctcat tcaaagagct gaaggccaca ggcaccgccc
acttcttcaa 180cttcctgctc aacacaaccg actaccgaat cttgctcaag
gacgaggacc acgaccgcat 240gtacgtgggc agcaaggact acgtgctgtc
cctggacctg cacgacatca accgcgagcc 300cctcattata cactgggcag
cctccccaca gcgcatcgag gaatgcgtgc tctcaggcaa 360ggatgtcaac
ggcgagtgtg ggaacttcgt caggctcatc cagccctgga accgaacaca
420cctgtatgtg tgcgggacag gtgcctacaa ccccatgtgc acctatgtga
accgcggacg 480ccgcgcccag gattacatct tctacctgga gcctgagcga
ctcgagtcag ggaagggcaa 540gtgtccgtac gatcccaagc tggacacagc
atcggccctc atcaatgagg agctctatgc 600tggtgtgtac atcgatttta
tgggcactga tgcagccatc ttccgcacac ttggaaagca 660gacagccatg
cgcacggatc agtacaactc ccggtggctg aacgacccgt cgttcatcca
720tgctgagctc attcctgaca gtgcggagcg caatgatgat aagctttact
tcttcttccg 780tgagcggtcg gcagaggcgc cgcagagccc cgcggtgtac
gcccgcatcg ggcgcatttg 840cctgaacgat gacggtggtc actgttgcct
ggtcaacaag tggagcacat tcctgaaggc 900gcggctcgtc tgctctgtcc
cgggcgagga tggcattgag actcactttg atgagctcca 960ggacgtgttt
gtccagcaga cccaggacgt gaggaaccct gtcatttacg ctgtctttac
1020ctcctctggc tccgtgttcc gaggctctgc cgtgtgtgtc tactccatgg
ctgatattcg 1080catggtcttc aacgggccct ttgcccacaa agaggggccc
aactaccagt ggatgccctt 1140ctcagggaag atgccctacc cacggccggg
cacgtgccct ggtggaacct tcacgccatc 1200tatgaagtcc accaaggatt
atcctgatga ggtgatcaac ttcatgcgca gccacccact 1260catgtaccag
gccgtgtacc ctctgcagcg gcggcccctg gtagtccgca caggtgctcc
1320ctaccgcctt accactattg ccgtggacca ggtggatgca gccgacgggc
gctatgaggt 1380gcttttcctg ggcacagacc gcgggacagt gcagaaggtc
attgtgctgc ccaaggatga 1440ccaggagttg gaggagctca tgctggagga
ggtggaggtc ttcaaggatc cagcacccgt 1500caagaccatg accatctctt
ctaagaggca acaactctac gtggcgtcag ccgtgggtgt 1560cacacacctg
agcctgcacc gctgccaggc gtatggggct gcctgtgctg actgctgcct
1620tgcccgggac ccttactgtg cctgggatgg ccaggcctgc tcccgctata
cagcatcctc 1680caagaggcgg agccgccggc aggacgtccg gcacggaaac
cccatcaggc agtgccgtgg 1740gttcaactcc aatgccaaca agaatgccgt
ggagtctgtg cagtatggcg tggccggcag 1800cgcagccttc cttgagtgcc
agccccgctc gccccaagcc actgttaagt ggctgttcca 1860gcgagatcct
ggtgaccggc gccgagagat tcgtgcagag gaccgcttcc tgcgcacaga
1920gcagggcttg ttgctccgtg cactgcagct cagcgatcgt ggcctctact
cctgcacagc 1980cactgagaac aactttaagc acgtcgtcac acgagtgcag
ctgcatgtac tgggccggga 2040cgccgtccat gctgccctct tcccaccact
gtccatgagc gccccgccac ccccaggcgc 2100aggcccccca acgcctcctt
accaggagtt agcccagctg ctggcccagc cagaagtggg 2160cctcatccac
cagtactgcc agggttactg gcgccatgtg ccccccagcc ccagggaggc
2220tccaggggca ccccggtctc ctgagcccca ggaccagaaa aagccccgga
accgccggca 2280ccaccctccg gacacatgag gccagctgcc tgtgcctgcc
atgggccagc ctagcccttg 2340tcccttttaa tataaaagat atatatatat
atatatatat ataaaatatc tatattctat 2400acacaccctg cccctgcaaa
gacagtattt attggtgggt tgaatatagc ctgcctcagt 2460ggcagcatcc
tccaaaactt agacccatgc tggtcagaga cggcagaaaa cagagcctgc
2520ctaaccaggc ccagccagtt ggtggggcca ggccaggacc acacagtccc
cagactcagc 2580tggaagtcta cctgctggac agcctccgcc aagatctaca
ggacaaaggg agggagcaag 2640ccctactcgg atggggcacg gactgtccac
cttttctgat gtgtgttgtc agcctgtgct 2700gtggcataga catggatgcg
aggaccactt tggagactgg ggtggcctca agagcacaca 2760gagaagggaa
gaaggggcca tcacaggatg ccagcccctg cctgggttgg gggcactcag
2820ccacgaccag ccccttcctg ggtatttatt ctctatttat tggggatagg
agaagaggca 2880tcctgcctgg gtgggacagc ctcttcagcc ccttctcccc
tccccgcctg gccagggcag 2940ggccacccca ctctacctcc ttagctttcc
ctgtgccact ttgactcaga ggctgggagc 3000atagcagagg ggccaggccc
aggcagagct gacgggaggc cccagctctg aggggagggg 3060gtccgtggta
gaggcctggg gccggtagag gctccccagg gctcccttat gtccaccact
3120tcaggggatg ggtgtggatg taattagctc tggggggcag ttgggtagat
gggtgggggc 3180tcctggtggc cttctgctgc ccaggccaca gccgcctttg
ggttccatct tgctaataaa 3240cactggctct gggactagaa aaaaaaaaaa aaaa
3274902658DNAHomo sapiens 90aataaatatc cgtgtagaaa atcagaacga
ctctttcagg ccatctttaa aatgtcattg 60gtaaaccata cttgatccta aattcctgta
cttcctcagg ccatccgagc atgaaacgct 120gtcacctacc cacatccgct
ggctgtgacg cttgtcaaag tgttctctat cggctgcatg 180cctagaccac
caaagcgttc tgaccggaca gtgtcactgg agaaggcggc gcgacatgtc
240cagggcgcag atctgggctc tggtgtctgg tgtcggaggg tttggagctc
tcgttgctgc 300taccacgtcc aatgagtgga aagtgaccac gcgagcctcc
tcggtgataa cagccacttg 360ggtttaccag ggtctgtgga tgaactgcgc
aggtaacgcg ttgggttctt tccattgccg 420accgcatttt actatcttca
aagtagcagg ttatatacag gcatgtagag gacttatgat 480cgctgctgtc
agcctgggct tctttggttc catatttgcg ctctttggaa tgaagtgtac
540caaagtcgga ggctccgata aagccaaagc taaaattgct tgtttggctg
ggattgtatt 600catactgtca gggctgtgct caatgactgg atgttcccta
tatgcaaaca aaatcacaac 660ggaattcttt gatcctctct ttgttgagca
aaagtatgaa ttaggagccg ctctgtttat 720tggatgggca ggagcctcac
tgtgcataat tggtggtgtc atattttgct tttcaatatc 780tgacaacaac
aaaacaccca gatacacata caacggggcc acatctgtca tgtcttctcg
840gacaaagtat catggtggag aagattttaa aacaacaaac ccttcaaaac
agtttgataa 900aaatgcttat gtctaaaaga gctcgctggc aagctgcctc
ttgagtttgt tataaaagcg 960aactgttcac aaaatgatcc catcaaggcc
ctcccataat taacactcaa aactattttt 1020aaaatatgca tttgaagcat
ctgttgattg tatggatgta agtgttctta catagttagt 1080tatatactaa
tcattttctg ttgtggcttt ctataaaaaa taaacagttt atttacagga
1140tttgtaaaat gttttctaca tttatataga acatgaaaag catttagtac
caaaggttca 1200agaagtattc gtactctagc ctttttaatc attcatagat
agaagtcttt gtacccactc 1260cttatgtttc ttttcattca taaacaggtg
tataaggaac aatgtcttat aaacagcatg 1320ggggcaatct gagaatattc
ctcaaaaggt gtccaggtta aatagacatg ttactggctg 1380cacacaggca
aattctagtt tgtttttttt aagtattcta caacatttat ttaaaaaggt
1440aaatcttttt gttgaagcag caagttatct ggtagaactt aacttctaca
ggatcagaga 1500ggatcttgct cattcatggc catatccaca tgcccatggc
cactcagtag attgttgaaa 1560aagcaaagcc acaccattct ctttgatgta
tgcagagagt tacgtagcag gggatgttct 1620ctgatttatt ccactggcac
cattagtgaa tatttagttg ttttcataaa cgatgctgtg 1680atgaagactc
atgtacatat ttagcaaatt ttggtttctt acatgtgcct gtcatgactg
1740taattcatta tgactgctcc aggaagggct aatggggcca atatattatt
gcctgtcatg 1800tggcacatcc atgttaaggg gctgaggcgt ccctggcacg
gaatgcagag ccctgagcta 1860gggcatcagc agaagctgag atagagatat
tggtcatggt tgactgagga gccaattaaa 1920acctgtttat gcctagtgtt
ccattattgg aacactaagc atgtgggagt tatttatatc 1980ctactgctca
aggtcatcgc caaggtgtga ttggaaaaat tcaaaaaatt gcaacctcag
2040gcataaatgg gttaaggaca tcccaagccc aagtggtacg tgcctcactc
agaactgacg 2100ggccgagttc tatctaggtg tgtcttccag aacctgttta
cggctaactg gataactgag 2160agacttgtca tttctaaaga catttaagtt
gctccaggga tttctgaaaa aagacacagg 2220cttcttccta gagccagccc
tatataacat gcccacaagg gcaacagtta tcacagttca 2280tacacacctt
tcatgtcctg tctcactcac tcctcacagc catcctagga gatacatatt
2340gttttcatcc tgcatttaca gaaaaagaaa tgaaaacaga gagcttaaat
aatttgccac 2400agtaatgtcg aaactaggcc tttgaaccaa ggcagtctag
ggtaaaatat agtttcaaag 2460tatgaataag aattggtatt tgtgttatct
ttgagtaaga aactgtccga tatgaatcac 2520aacgtgggtg aatgtagtat
tttcctgaag tgtgaaagac ttaaaaaaaa gaatcacatt 2580gttcagaggt
gctcaatgga aagaaaagga aatgaacaag tttgttaaaa gataaaaaat
2640aaaaaaaatt ccatacct 2658912490DNAHomo sapiens 91gagtgcgggg
gtcgcggcgc agagtgggag ccggagagcg agcgcggctg cagccggcgg 60catggctagc
acggcttcgg agatcatcgc cttcatggtc tccatctcag gctgggtact
120ggtgtcctcc acgctgccca ccgactactg gaaggtgtct accatcgacg
gcacggtcat 180cacaaccgcc acctattggg ccaacctgtg gaaggcgtgc
gttaccgact ccacgggcgt 240ctccaactgc aaggacttcc cctccatgct
ggcgctggac ggttatatac aggcatgtag 300aggacttatg atcgctgctg
tcagcctggg cttctttggt tccatatttg cgctctttgg 360aatgaagtgt
accaaagtcg gaggctccga taaagccaaa gctaaaattg cttgtttggc
420tgggattgta ttcatactgt cagggctgtg ctcaatgact ggatgttccc
tatatgcaaa 480caaaatcaca acggaattct ttgatcctct ctttgttgag
caaaagtatg aattaggagc 540cgctctgttt attggatggg caggagcctc
actgtgcata attggtggtg tcatattttg 600cttttcaata tctgacaaca
acaaaacacc cagatacaca tacaacgggg ccacatctgt 660catgtcttct
cggacaaagt atcatggtgg agaagatttt aaaacaacaa acccttcaaa
720acagtttgat aaaaatgctt atgtctaaaa gagctcgctg gcaagctgcc
tcttgagttt 780gttataaaag cgaactgttc acaaaatgat cccatcaagg
ccctcccata attaacactc 840aaaactattt ttaaaatatg catttgaagc
atctgttgat tgtatggatg taagtgttct 900tacatagtta gttatatact
aatcattttc tgttgtggct ttctataaaa aataaacagt 960ttatttacag
gatttgtaaa atgttttcta catttatata gaacatgaaa agcatttagt
1020accaaaggtt caagaagtat tcgtactcta gcctttttaa tcattcatag
atagaagtct 1080ttgtacccac tccttatgtt tcttttcatt cataaacagg
tgtataagga acaatgtctt 1140ataaacagca tgggggcaat ctgagaatat
tcctcaaaag gtgtccaggt taaatagaca 1200tgttactggc tgcacacagg
caaattctag tttgtttttt ttaagtattc tacaacattt 1260atttaaaaag
gtaaatcttt ttgttgaagc agcaagttat ctggtagaac ttaacttcta
1320caggatcaga gaggatcttg ctcattcatg gccatatcca catgcccatg
gccactcagt 1380agattgttga aaaagcaaag ccacaccatt ctctttgatg
tatgcagaga gttacgtagc 1440aggggatgtt ctctgattta ttccactggc
accattagtg aatatttagt tgttttcata 1500aacgatgctg tgatgaagac
tcatgtacat atttagcaaa ttttggtttc ttacatgtgc 1560ctgtcatgac
tgtaattcat tatgactgct ccaggaaggg ctaatggggc caatatatta
1620ttgcctgtca tgtggcacat ccatgttaag gggctgaggc gtccctggca
cggaatgcag 1680agccctgagc tagggcatca gcagaagctg agatagagat
attggtcatg gttgactgag 1740gagccaatta aaacctgttt atgcctagtg
ttccattatt ggaacactaa gcatgtggga 1800gttatttata tcctactgct
caaggtcatc gccaaggtgt gattggaaaa attcaaaaaa 1860ttgcaacctc
aggcataaat gggttaagga catcccaagc ccaagtggta cgtgcctcac
1920tcagaactga cgggccgagt tctatctagg tgtgtcttcc agaacctgtt
tacggctaac 1980tggataactg agagacttgt catttctaaa gacatttaag
ttgctccagg gatttctgaa 2040aaaagacaca ggcttcttcc tagagccagc
cctatataac atgcccacaa gggcaacagt 2100tatcacagtt catacacacc
tttcatgtcc tgtctcactc actcctcaca gccatcctag 2160gagatacata
ttgttttcat cctgcattta cagaaaaaga aatgaaaaca gagagcttaa
2220ataatttgcc acagtaatgt cgaaactagg cctttgaacc aaggcagtct
agggtaaaat 2280atagtttcaa agtatgaata agaattggta tttgtgttat
ctttgagtaa gaaactgtcc 2340gatatgaatc acaacgtggg tgaatgtagt
attttcctga agtgtgaaag acttaaaaaa 2400aagaatcaca ttgttcagag
gtgctcaatg gaaagaaaag gaaatgaaca agtttgttaa 2460aagataaaaa
ataaaaaaaa ttccatacct 2490922424DNAHomo sapiens 92gttccccgcg
tgccaccagg aagctcgggc cggccaagag cgtagactct tgagaggagt 60gagacaggtg
cgcgccagcc ggccttcggg gctttatggg aactgggccg tgcggcggtc
120ccgccctcgt gcgcaggcgc agaaccgttg tgaccagagc ggttgcgggc
tgagcggttt 180cgagccggcg tcggggagcg gcggtaccgg gcggctgcgg
ggctggctcg acccagcttg 240aggtctcggc gtccgcgtcc tgcggtgccc
tggggtctcc cgaggacctt gtacccgcgc 300ggcttccttg ggctggcttt
ggacgacgct ttcgccttcc tgctgcctag gatccgccga 360catgaatccc
atcgtagtgg tccacggcgg cggagccggt cccatctcca aggatcggaa
420ggagcgagtg caccagggca tggtcagagc cgccaccgtg ggctacggca
tcctccggga 480gggcgggagc gccgtggatg ccgtagaggg agctgtcgtc
gccctggaag acgatcccga 540gttcaacgca ggttgtgggt ctgtcttgaa
cacaaatggt gaggttgaaa tggatgctag 600tatcatggat ggaaaagacc
tgtctgcagg agcagtgtcc gcagtccagt gtatagcaaa 660tcccattaaa
cttgctcggc ttgtcatgga aaagacacct cattgctttc tgactgacca
720aggcgcagcg cagtttgcag cagctatggg ggttccagag attcctggag
aaaaactggt 780gacagagaga aacaaaaagc gcctggaaaa agagaagcat
gaaaaaggtg ctcagaaaac 840agattgtcaa aaaaacttgg gaaccgtggg
tgctgttgcc ttggactgca aagggaatgt 900agcctacgca acctccacag
gcggtatcgt taataaaatg gtcggccgcg ttggggactc 960accgtgtcta
ggagctggag gttatgccga caatgacatc ggagccgtct caaccacagg
1020gcatggggaa agcatcctga aggtgaacct ggctagactc accctgttcc
acatagaaca 1080aggaaagacg gtagaagagg ctgcggacct atcgttgggt
tatatgaagt caagggttaa 1140aggtttaggt ggcctcatcg tggttagcaa
aacaggagac tgggtggcaa agtggacctc 1200cacctccatg ccctgggcag
ccgccaagga cggcaagctg cacttcggaa ttgatcctga 1260cgatactact
atcaccgacc ttccctaagc cgctggaaga ttgtattcca gatgctagct
1320tagaggtcaa gtacagtctc ctcatgagac atagcctaat caattagatc
tagaattgga 1380aaaattgtcc cgtctgtcac ttgttttgtt gccttaataa
gcatctgaat gtttggttgt 1440ggggcgggtt ctgaagcgat gagagaaatg
cccgtattag gaggattact tgagcccagg 1500aggtcaaagc tgaggtgagc
catgattact ccactgcact ccagcctggg caacagagcc 1560aggccctgta
tcaaaaaaaa aaaaaaaaag aaaagggaaa aaagaaagaa agcagcagca
1620tgatcctgac atgacagatg tgggagaccc acagcctgca gacactgtgg
gctggaaggt 1680gggaagggag gggccggtgg aggtggagct gtttgaaagt
gacacagcag cagtagaagc 1740agtggtgggc gaagcccagg tgaccctcag
aacgttgcac aagaacatca gggaaaagaa 1800ccagaatcct ttaaggaaaa
tgttcttcat gtatgagaga ctaaagtgat ttttctaaga 1860aagttcagcc
cttctctgac ttacctggac atttctagat acttccaaag gaccctctgg
1920gaatccatag cttcctaatc tggagatggg aggtcataag ggagacgctg
tggggttcct 1980tgaagtttct tgggttcaca gaggagcccc ctcacttggt
gttctcccgt gagccagcct 2040ccacctgcca aagacactct ggtcctcgta
tagtgagtaa tggggctcag ggcctctcca 2100acaacagaga ggagctgatg
ctgtagggct gaccccgtga cttcctgagt cctcaccctg 2160tccagtgctt
tgagattctt cccacctccc catcctcacc agccggatcg ggcgctgtgc
2220agtgtggtca gcatggtgaa gaaagtcatt tcctcggtgg gcagtattcc
tctttatctc 2280tcattacact ggaaatgtta tttctgctgt atcatccgtg
ctcaacgttt tagtctgtca 2340ggctcacctt ctctctggaa agaatttgct
taacttgaca ttccatgtgc cgctaataaa 2400atatattttg aaagaataaa aaaa
2424932347DNAHomo sapiens 93gttccccgcg tgccaccagg aagctcgggc
cggccaagag cgtagactct tgagaggagt 60gagacaggtg cgcgccagcc ggccttcggg
gctttatggg aactgggccg tgcggcggtc 120ccgccctcgt gcgcaggcgc
agaaccgttg tgaccagagc ggttgcgggc tgagcggttt 180cgagccggcg
tcggggagcg gcggtaccgg gcggctgcgg ggctggctcg acccagcttg
240aggtctcggc gtccgcgtcc tgcggtgccc tgggatccgc cgacatgaat
cccatcgtag 300tggtccacgg cggcggagcc ggtcccatct ccaaggatcg
gaaggagcga gtgcaccagg 360gcatggtcag agccgccacc gtgggctacg
gcatcctccg ggagggcggg agcgccgtgg 420atgccgtaga gggagctgtc
gtcgccctgg aagacgatcc cgagttcaac gcaggttgtg 480ggtctgtctt
gaacacaaat ggtgaggttg aaatggatgc tagtatcatg gatggaaaag
540acctgtctgc aggagcagtg tccgcagtcc agtgtatagc aaatcccatt
aaacttgctc 600ggcttgtcat ggaaaagaca cctcattgct ttctgactga
ccaaggcgca gcgcagtttg 660cagcagctat gggggttcca gagattcctg
gagaaaaact ggtgacagag agaaacaaaa 720agcgcctgga aaaagagaag
catgaaaaag gtgctcagaa aacagattgt caaaaaaact 780tgggaaccgt
gggtgctgtt gccttggact gcaaagggaa tgtagcctac gcaacctcca
840caggcggtat cgttaataaa atggtcggcc gcgttgggga ctcaccgtgt
ctaggagctg 900gaggttatgc cgacaatgac atcggagccg tctcaaccac
agggcatggg gaaagcatcc 960tgaaggtgaa cctggctaga ctcaccctgt
tccacataga acaaggaaag acggtagaag 1020aggctgcgga cctatcgttg
ggttatatga agtcaagggt taaaggttta ggtggcctca 1080tcgtggttag
caaaacagga gactgggtgg caaagtggac ctccacctcc atgccctggg
1140cagccgccaa ggacggcaag ctgcacttcg gaattgatcc tgacgatact
actatcaccg 1200accttcccta agccgctgga agattgtatt ccagatgcta
gcttagaggt caagtacagt 1260ctcctcatga gacatagcct aatcaattag
atctagaatt ggaaaaattg tcccgtctgt 1320cacttgtttt gttgccttaa
taagcatctg aatgtttggt tgtggggcgg gttctgaagc 1380gatgagagaa
atgcccgtat taggaggatt acttgagccc aggaggtcaa agctgaggtg
1440agccatgatt actccactgc actccagcct gggcaacaga gccaggccct
gtatcaaaaa 1500aaaaaaaaaa aagaaaaggg aaaaaagaaa gaaagcagca
gcatgatcct gacatgacag 1560atgtgggaga cccacagcct gcagacactg
tgggctggaa ggtgggaagg gaggggccgg 1620tggaggtgga gctgtttgaa
agtgacacag cagcagtaga agcagtggtg ggcgaagccc 1680aggtgaccct
cagaacgttg cacaagaaca tcagggaaaa gaaccagaat cctttaagga
1740aaatgttctt catgtatgag agactaaagt gatttttcta agaaagttca
gcccttctct 1800gacttacctg gacatttcta gatacttcca aaggaccctc
tgggaatcca tagcttccta 1860atctggagat gggaggtcat aagggagacg
ctgtggggtt ccttgaagtt tcttgggttc 1920acagaggagc cccctcactt
ggtgttctcc cgtgagccag cctccacctg ccaaagacac 1980tctggtcctc
gtatagtgag taatggggct cagggcctct ccaacaacag agaggagctg
2040atgctgtagg gctgaccccg tgacttcctg agtcctcacc ctgtccagtg
ctttgagatt 2100cttcccacct ccccatcctc accagccgga tcgggcgctg
tgcagtgtgg tcagcatggt 2160gaagaaagtc atttcctcgg tgggcagtat
tcctctttat ctctcattac actggaaatg 2220ttatttctgc tgtatcatcc
gtgctcaacg ttttagtctg tcaggctcac cttctctctg 2280gaaagaattt
gcttaacttg acattccatg tgccgctaat aaaatatatt ttgaaagaat 2340aaaaaaa
2347942205DNAHomo sapiens 94tcccctggac ccgcccccat ctgcccaaga
taattttagt ttccttgggc ctggaatctg 60gacacacagg gctccccccc gcctctgact
tctctgtccg aagtcgggac accctcctac 120cacctgtaga gaagcgggag
tggatctgaa ataaaatcca ggaatctggg ggttcctaga 180cggagccaga
cttcggaacg ggtgtcctgc tactcctgct ggggctcctc caggacaagg
240gcacacaact ggttccgtta agcccctctc ttgctcagac gccatggagc
tggatctgtc 300tccacctcat cttagcagct ctccggaaga cctttgccca
gcccctggga cccctcctgg 360gactccccgg ccccctgata cccctctgcc
tgaggaggta aagaggtccc agcctctcct 420catcccaacc accggcagga
aacttcgaga ggaggagagg cgtgccacct ccctcccctc 480tatccccaac
cccttccctg agctctgcag tcctccctca cagagcccaa ttctcggggg
540cccctccagt gcaagggggc tgctcccccg cgatgccagc cgcccccatg
tagtaaaggt 600gtacagtgag gatggggcct gcaggtctgt ggaggtggca
gcaggtgcca cagctcgcca 660cgtgtgtgaa atgctggtgc agcgagctca
cgccttgagc gacgagacct gggggctggt 720ggagtgccac ccccacctag
cactggagcg gggtttggag gaccacgagt ccgtggtgga 780agtgcaggct
gcctggcccg tgggcggaga tagccgcttc gtcttccgga aaaacttcgc
840caagtacgaa ctgttcaaga gctccccaca ctccctgttc ccagaaaaaa
tggtctccag 900ctgtctcgat gcacacactg gtatatccca tgaagacctc
atccagaact tcctgaatgc 960tggcagcttt cctgagatcc agggctttct
gcagctgcgg ggttcaggac ggaagctttg 1020gaaacgcttt ttctgcttct
tgcgccgatc tggcctctat tactccacca agggcacctc 1080taaggatccg
aggcacctgc agtacgtggc agatgtgaac gagtccaacg tgtacgtggt
1140gacgcagggc cgcaagctct acgggatgcc cactgacttc ggtttctgtg
tcaagcccaa 1200caagcttcga aatggccaca aggggcttcg gatcttctgc
agtgaagatg agcagagccg 1260cacctgctgg ctggctgcct tccgcctctt
caagtacggg gtgcagctgt acaagaatta 1320ccagcaggca cagtctcgcc
atctgcatcc atcttgtttg ggctccccac ccttgagaag 1380tgcctcagat
aataccctgg tggccatgga cttctctggc catgctgggc gtgtcattga
1440gaacccccgg gaggctctga gtgtggccct ggaggaggcc caggcctgga
ggaagaagac 1500aaaccaccgc ctcagcctgc ccatgccagc ctccggcacg
agcctcagtg cagcctgttc 1560ctggtccggg agagtcagcg gaacccccag
ggctttgtcc tctctttgtg ccacctgcag 1620aaagtgaagc attatctcat
cctgccgagc gaggaggagg gccgcctgta cttcagcatg 1680gatgatggcc
agacccgctt cactgacctg ctgcagctcg tggagttcca ccagctgaac
1740cgcggcatcc tgccgtgctt gctgcgccat tgctgcacgc gggtggccct
ctgaccaggc 1800cgtggactgg ctcatgcctc agcccgcctt caggctgccc
gccgcccctc cacccatcca 1860gtggactctg gggcgcggcc acaggggacg
ggatgaggag cgggagggtt ccgccactcc 1920agttttctcc tctgcttctt
tgcctccctc agatagaaaa cagcccccac tccagtccac 1980tcctgacccc
tctcctcaag ggaaggcctt gggtggcccc ctctccttct cctagctctg
2040gaggtgctgc tctagggcag ggaattatgg gagaagtggg ggcagcccag
gcggtttcac 2100gccccacact ttgtacagac cgagaggcca gttgatctgc
tctgttttat actagtgaca 2160ataaagatta ttttttgata caaaaaaaaa
aaaaaaaaaa aaaaa 2205951210DNAHomo sapiens 95ctctcccttc tccactctct
ccccctgtct cctttcttct tcttctttca ccctccgtct 60ctcacacccc ctccattccc
ctgtctcctt tctgacactg cactgcagct gctcctcagc 120cctgccccct
ccccagtgag aacaaaccag caacattgct ttttttccta aagagattta
180tattgatccg attaaaaaaa aaaaacctta agaaacccca aacgcaaaaa
aaaaaaaaaa 240aaaaaaagaa aaaagaaaag aaaaagccaa aacaaaaggg
agaaccttct cccggtagca 300gcggcaggaa ctgcaaacat gatggcggca
gctcccatcc agcagaacgg gacccacact 360ggggttccca tagacctgga
cccgccggac tcgcggaaaa ggccgctgga agccccccct 420gaagccggca
gcaccaagag gaccaatacg ggcgaagacg gccagtattt tctaaaggtt
480ctcataccta gttatgctgc tggatctata attgggaagg gaggacagac
aattgttcag 540ttgcaaaaag aaactggagc caccatcaag ctgtctaagt
ccaaagattt ttacccaggt 600actactgagc gagtgtgctt gatccaggga
acggttgaag cactgaatgc agttcatgga 660ttcattgcag aaaaaattcg
agaaatgccc caaaatgtgg ccaagacaga accagtcagc 720attctacaac
cccagaccac cgttaatcca gatcgcatca aacaaacatt gccatcttcc
780ccaactacca ccaagtcctc tccatctgat cccatgacca cctccagagc
taatcagaag 840cataatatct cctggatatc atgaagcaag atataagaga
agaacaaaac aaaatccgta 900attcattgaa agaattgtaa tcatcaatct
ttcatattat taatactttg taattatttt 960ctccccaaca gtattttcca
gtagattcta atcatgtggt agggcagaag gaaatgtgtt 1020ttttgttgtt
catttgtttc ttgtcaatag tcctgattaa tttagctttg ctatactgac
1080ttatatctgg aagtatataa ccaagataag aaaataggtt ttaatatgat
catcttaagc 1140taattgtaat gaaaagaact aatggactgt caatattcag
aaaaccaaaa ataaaaaata 1200cagaaaacta 1210
* * * * *
References