U.S. patent number 10,660,950 [Application Number 15/781,959] was granted by the patent office on 2020-05-26 for virus purification.
This patent grant is currently assigned to Valneva SE. The grantee listed for this patent is Valneva SE. Invention is credited to Jana Barbero Calzado, Jurgen Heindl-Wruss, Mario Nebenfuhr, Robert Schlegl, Michael Weber.
View All Diagrams
United States Patent |
10,660,950 |
Barbero Calzado , et
al. |
May 26, 2020 |
Virus purification
Abstract
Described herein are processes for purifying infectious virus
particles and uses of protamine in such processes.
Inventors: |
Barbero Calzado; Jana (Vienna,
AT), Nebenfuhr; Mario (Vienna, AT),
Schlegl; Robert (Siegenfeld, AT), Weber; Michael
(Vienna, AT), Heindl-Wruss; Jurgen (Vienna,
AT) |
Applicant: |
Name |
City |
State |
Country |
Type |
Valneva SE |
Nantes |
N/A |
FR |
|
|
Assignee: |
Valneva SE (Nantes,
FR)
|
Family
ID: |
57821924 |
Appl.
No.: |
15/781,959 |
Filed: |
December 23, 2016 |
PCT
Filed: |
December 23, 2016 |
PCT No.: |
PCT/EP2016/082663 |
371(c)(1),(2),(4) Date: |
June 06, 2018 |
PCT
Pub. No.: |
WO2017/109224 |
PCT
Pub. Date: |
June 29, 2017 |
Prior Publication Data
|
|
|
|
Document
Identifier |
Publication Date |
|
US 20180362937 A1 |
Dec 20, 2018 |
|
Foreign Application Priority Data
|
|
|
|
|
Dec 23, 2015 [EP] |
|
|
15202585 |
Mar 18, 2016 [EP] |
|
|
16161068 |
Jun 23, 2016 [EP] |
|
|
16176025 |
Jun 23, 2016 [EP] |
|
|
16176049 |
Aug 4, 2016 [EP] |
|
|
16182845 |
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12N
7/02 (20130101); C07K 14/1825 (20130101); C07K
14/18 (20130101); A61K 39/39 (20130101); C12N
7/06 (20130101); A61K 39/12 (20130101); A61P
31/14 (20180101); C12N 7/00 (20130101); Y02A
50/383 (20180101); C12N 2770/24151 (20130101); Y02A
50/388 (20180101); A61K 2039/5258 (20130101); A61K
2039/5252 (20130101); Y02A 50/39 (20180101); C12N
2770/24163 (20130101); A61K 2039/5254 (20130101); A61K
2039/55505 (20130101); C12N 2770/24134 (20130101); Y02A
50/386 (20180101); Y02A 50/30 (20180101); Y02A
50/392 (20180101) |
Current International
Class: |
C12N
7/02 (20060101); A61P 31/14 (20060101); C07K
14/18 (20060101); C12N 7/06 (20060101); C12N
7/00 (20060101); A61K 39/39 (20060101); A61K
39/12 (20060101); A61K 39/00 (20060101) |
References Cited
[Referenced By]
U.S. Patent Documents
Foreign Patent Documents
|
|
|
|
|
|
|
105749268 |
|
Jul 2016 |
|
CN |
|
WO 1999/011762 |
|
Mar 1999 |
|
WO |
|
WO 2001/092552 |
|
Dec 2001 |
|
WO |
|
WO 2013/083726 |
|
Jun 2013 |
|
WO |
|
WO 2016/145149 |
|
Sep 2016 |
|
WO |
|
Other References
US. Appl. No. 16/062,245, filed Jun. 14, 2018, Barbero Calzado et
al. cited by applicant .
PCT/EP2016/082663, Jul. 5, 2018, International Preliminary Report
and Patentability. cited by applicant .
PCT/EP2016/082663, Apr. 19, 2017, International Search Report and
Written Opinion. cited by applicant .
PCT/EP2016/082662, Jul. 5, 2018, International Preliminary Report
and Patentability. cited by applicant .
PCT/EP2016/082662, Apr. 18, 2017, International Search Report and
Written Opinion. cited by applicant .
U.S. Appl. No. 16/702,764, filed Dec. 4, 2019, Barbero Calzado et
al. cited by applicant .
[No Author Listed] Valneva Reports Excellent Final Phase 1 Results
for its Chikungunya Vaccine Candidate, Confirms Plans. Press
release. Nov. 18, 2019. cited by applicant .
[No Author Listed] Centers for Disease Control and Prevention.
Ingredients of vaccines fact sheet; continuously updated;
https://www.cdc.gov/vaccines/vac-gen/additives.htm. cited by
applicant .
[No Author Listed] Japanese Encephalitis Vaccine. Centers for
Disease Control and Prevention, 2016. Retrieved from
https://www.cdc.gov/japaneseencephalitis/vaccine/ on Jun. 16, 2016.
cited by applicant .
[No Author Listed] Pan-American Health Organization, 2015. Number
of Reported Cases of Chikungunya Fever in the Americas, by Country
or Territory 2013-2014. Cumulative Cases (Updated Oct. 23, 2015).
cited by applicant .
[No Author Listed] Protamine sulfate. Wikimedia Foundation, Inc.,
2015. Retrieved from
https://en.wikipedia.org/wiki/Protamine_sulfate; updated Sep. 30,
2015 on Nov. 26, 2015. cited by applicant .
[No Author Listed] Valneva Announces Successful Generation of a
Highly-purified Zika Vaccine Candidate Using its FDA-EMA Approved
Japanese Encephalitis Platform. Press release. Jul. 7, 2016. cited
by applicant .
[No Author Listed] World Health Organization, 2016Zika Situation
Report Feb. 5, 2016. cited by applicant .
[No Author Listed] World Health Organization, 2016Zika Virus Fact
Sheet 2016. Retrieved from
http://www.who.int/mediacentre/factsheets/zika/en/ on Mar. 11,
2016. cited by applicant .
[No Author Listed] Zika virus, strain H/PF/2013. European virus
archive, 2016. cited by applicant .
Abbink et al., Durability and correlates of vaccine protection
against Zika virus in rhesus monkeys. Sci. Transl. Med. 9, eaao4163
(2017). cited by applicant .
Altschul et al., Basic Local Alignment Search Tool. J. Mol. Biol.
1990;215:403-410. cited by applicant .
Altschul et al., Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs. Nuc. Acids Res.
1997;25(17):3389-3402. cited by applicant .
Baronti et al., Complete Coding Sequence of Zika Virus from a
French Polynesia Outbreak in 2013. Genome Announc. May-Jun. 2014;
2(3):e00500-14. Abstract. cited by applicant .
Bender et al., Zika Virus Vaccine Candidate VLA1601: Cooperation
Valneva & Emergent. Presentation at World Vaccine Congress Apr.
4, 2018. cited by applicant .
Cohen, Infectious Disease. The race for a Zika vaccine is on.
Science. Feb. 5, 2016;351(6273):543-4. doi:
10.1126/science.351.6273.543. cited by applicant .
Cox et al., Predicting Zika virus structural biology:Challenges and
opportunities for intervention. Antivir Chem Chemother. Aug.
2015;24(3-4):118-26. doi: 10.1177/2040206616653873. Epub Jun. 13,
2016. cited by applicant .
Dowall et al., A susceptible mouse model for Zika virus infection.
PLOS Neglected Tropical Diseases.10(5):e0004658. May 5, 2016.
DOI:10.1371/journal.pntd.0004658. cited by applicant .
Fritsche et al., Vaccine hypersensitivity--update and overview.
Swiss Med Wkly. 2010;140(17-18):238-246. cited by applicant .
Gardner et al., Deliberate Attenuation of Chikungunya Virus by
Adaptation to Heparan Sulfate-Dependent Infectivity: A Model of
Rational Arboviral Vaccine Design. PLOS Neglected Tropical
Diseases. 2014;8(2):e2719. cited by applicant .
Geradin et al., Chikungunya virus-associated encephalitis: A cohort
study on La Reunion Island. 2005-2009. Neurology. 86(1):94-102.
cited by applicant .
Gubler et al., Fields Virology. Knipe DM, Howley PM, editors.
Lippincott-Raven Publishers; Philadelphia: 2007. pp. 1153-1252.
cited by applicant .
Haddow et al., Genetic Characterization of Zika Virus Strains:
Geographic Expansion of the Asian Lineage. PLoS Negl Trop Dis 6(2):
e1477. doi:10.1371/journal.pntd.0001477. cited by applicant .
Hallengard et al., Novel Attenuated Chikungunya Vaccine Candidates
Elicit Protective Immunity in C57BL/6 mice. J. Virology
88(5):2858-2866. cited by applicant .
Hallengard et al., Prime-Boost Immunization Strategies against
Chikungunya Virus. J. Virology. 88(22):13333-13343. cited by
applicant .
Hombach et al., Report on a WHO consultation on immunological
endpoints for evaluation of new Japanese encephalitis vaccines,
WHO, Geneva, Sep. 2-3, 2004. Vaccine. 2005; 23(45):5205-5211. cited
by applicant .
Hutornojs et al., Comparison of ultracentrifugation methods for
concentration of recombinant alphaviruses: sucrose and iodixanol
cushions. Environmental Experimental Biology. 2012;10:117-123.
cited by applicant .
Katoh et al., Recent developments in the MAFFT multiple sequence
alignment program. Briefings in Bioinformatics. 2008;9(4):286-298.
cited by applicant .
Kim et al., Design of Chimeric Alphaviruses with a Programmed,
Attenuated, Cell Type-Restricted Phenotype. J Virol.
2011;85(9):4363-4376. cited by applicant .
Konishi et al., Studies on structural proteins of Chikungunya
Virus. I. Separation of three species of proteins and their
preliminary characterization. Microbiol Immunol. 1980;24(5):419-28.
cited by applicant .
Larkin et al., Clustal W and Clustal X version 2.0. Bioinformatics.
2007;23(21):2947-2948. cited by applicant .
Larocca et al., Vaccine protection against Zika virus from Brazil.
Nature. 2016;536:474-478. doi:10.1038/nature18952. Methods. cited
by applicant .
Lindenbach et al., Fields Virology. Knipe DM, Howley PM, editors.
Lippincott-Raven Publishers; Philadelphia: 2007. pp. 1101-1152.
cited by applicant .
Malone et al., Zika Virus: Medical Countermeasure Development
Challenges. PLoS Negl Trop Dis. 2016;10(3):e0004530.
doi:10.1371/journal.pntd.0004530. cited by applicant .
Modjarrad et al., Preliminary aggregate safety and immunogenicity
results from three trials of a purified inactivated Zika virus
vaccine candidate: phase 1, randomised, double-blind,
placebo-controlled clinical trials. Dec. 4, 2017. cited by
applicant .
Monath, Yellow fever: an update. Lancet Infect Dis.
2001;1(1):11-20. cited by applicant .
Needleman et al., A general method applicable to the search for
similarities in the amino acid sequence of two proteins. J. Mol.
Biol. 1970;48(3):443-453. cited by applicant .
Patkar et al., Yellow Fever virus NS3 plays an essential role in
virus assembly independent of its known enzymatic functions. J
Virol. Apr. 2008;82(7):3342-52. doi: 10.1128/JVI.02447-07. Epub
Jan. 16, 2008. cited by applicant .
Pearson et al., Improved tools for biological sequence comparison.
Proc. Natl. Acad. Sci. USA. 1988;85(8):2444-8. cited by applicant
.
Pellerin, Walter Reed Scientists Test Zika Vaccine Candidate. U.S.
Department of Defense. Jun. 9, 2016. cited by applicant .
Pinto et al., A Temporal Role of Type I Interferon Signaling in
CD8+ T Cell Maturation during Acute West Nile Virus Infection. PLoS
Pathog. Dec. 2011;7(12): e1002407. cited by applicant .
Plevka et al., Maturation of flaviviruses starts from one or more
icosahedrally independent nucleation centres. EMBO reports.
2011;12(6):602-606. cited by applicant .
Reed et al., A simple method of estimating fifty percent endpoints.
American J Hygiene. May 1938;27:493-497. cited by applicant .
Rocha et al., Microcephaly: normality parameters and its
determinants in northeastern Brazil: a multicentre prospective
cohort study. Bull World Health Organ, E-pub: Feb. 8, 2016.
doi:http://dx.doi.org/10.2471/BLT.16.171215. cited by applicant
.
Rozen-Gagnon et al., Alphavirus Mutator Variants Present
Host-Specific Defects and Attenuation in Mammalian and Insect
Models, PLOS Pathogens, 10(1):e1003877. cited by applicant .
Schlegl et al., Influence of elemental impurities in aluminum
hydroxide adjuvant on the stability of inactivated Japanese
Encephalitis vaccine, IXIARO.RTM.. Vaccine. 2015;33(44):5989-5996.
cited by applicant .
Shustov et al., Efficient, trans-complementing packaging systems
for chimeric, pseudoinfectious dengue 2/yellow fever viruses.
Virology. Apr. 25, 2010;400(1):8-17. doi:
10.1016/j.virol.2009.12.015. cited by applicant .
Simizu et al., Structural Proteins of Chikungunya Virus, J Virol.
1984;51(1): 254-258. cited by applicant .
Smith et al., Comparison of Biosequences. Adv. Appl. Math.
1981;2:482-489. cited by applicant .
Srivastava et al., A purified inactivated Japanese encephalitis
virus vaccine made in vero cells. Vaccine. 2001;19:4557-4565. cited
by applicant .
Tiwari et al., Assessment of immunogenic potential of Vero adapted
formalin inactivated vaccine derived from novel ECSA genotype of
Chikungunya virus. Vaccine. Apr. 21, 2009;27(18):2513-22. doi:
10.1016/j.vaccine.2009.02.062. Epub Feb. 27, 2009. cited by
applicant .
Vega-Rua et al., Chikungunya Virus Transmission Potential by Local
Aedes Mosquitoes in the Americas and Europe. PLOS Neglected
Tropical Diseases. 2015;9(5):e000378 cited by applicant .
Waterhouse et al., Jalview Version 2--a multiple sequence alignment
editor and analysis workbench. Bioinformatics.
2009;25(9):1189-1191. cited by applicant .
Way et al., Comparative Studies of some African Arboviruses in Cell
Culture and in Mice, J Gen. Virol. 1976;30:123-130. cited by
applicant .
Weaver et al., Arrival of Chiungunya Virus in the New Word:
Prospects for Spread and Impact on Public Health. PLoS Negl Trop
Dis. 2014;8(6): e2921. doi:10.1371/journal.pntd.0002921. cited by
applicant.
|
Primary Examiner: Li; Bao Q
Attorney, Agent or Firm: Wolf, Greenfield & Sacks,
P.C.
Claims
What is claimed is:
1. A method for purifying infectious virus particles, comprising
the steps of (A) providing a crude harvest (a) comprising
infectious virus particles, non-infectious virus particles, and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate; (B) contacting said crude
harvest (a) with an agent comprising protamine to obtain a virus
preparation (b) comprising infectious virus particles; and (C)
further purifying said virus preparation (b) by one or more size
exclusion methods wherein said one or more size exclusion methods
comprise (i) sucrose density gradient centrifugation, and/or (ii) a
solid-phase matrix packed in a column comprising a ligand-activated
core and an inactive shell comprising pores, wherein the pores
comprise a molecular weight cut-off that excludes the virus
particles from entering the ligand-activated core, and wherein a
molecule smaller than the molecular weight cut-off of the pores can
enter the ligand-activated core, and/or (iii) size exclusion
chromatography; to obtain a final virus preparation (c) comprising
the infectious virus particles, less than 100 ng/mL residual host
cell DNA, less than 1 .mu.g/mL residual host cell protein, and less
than 1 .mu.g/mL residual aggregates of virus particles are selected
from the group consisting of filovirus particles of a flavivirus or
alphavirus particles of a alphavirus but; wherein the virus
particles are not of a Japanese encephalitis virus.
2. The method according to claim 1, wherein the final virus
preparation (c) comprises less than 10 ng/mL residual host cell DNA
and less than 100 ng/mL residual host cell protein.
3. The method according to claim 1, wherein said crude harvest (a)
is subjected to one or more pre-purification step(s) prior to step
(B), wherein the one or more pre-purification step(s) comprise a)
filtration using a filter having a pore size equal to or less than
0.2 .mu.m; and/or b) digestion of host cell genomic DNA by
enzymatic treatment; and/or c) ultra/diafiltration using a hollow
fiber membrane having a pore size equal to or greater than 100
kDa.
4. The method according to claim 1, wherein the concentration of
protamine is between 0.5 mg/ml and 3 mg/ml.
5. The method according to claim 1, wherein the infectious virus
particles in said final virus preparation (c) are enriched by at
least 50% to 95% relative to total virus particles in said crude
harvest (a).
6. The method according to claim 1, wherein said virus preparation
(c) comprises less than 10% impurities.
7. The method according to claim 1, wherein said infectious virus
particles are propagated in a cell line selected from the group
consisting of an EB66 cell line, a Vero cell line, a
Vero-.alpha.His cell line, a HeLa cell line, a HeLa-S3 cell line, a
293 cell line, a PC12 cell line, a CHO cell line, a 3T3 cell line,
a PerC6 cell line, an MDSK cell line, a chicken embryonic
fibroblast cell line, a duck cell line, and a diploid avian cell
line.
8. The method according to claim 1, wherein said infectious virus
particles are selected from the group consisting of a live virus,
an attenuated live virus, a chimeric virus, a modified live virus
and a recombinant live virus.
9. The method according to claim 8, wherein said infectious virus
particles are selected from the group of viruses consisting of Zika
virus, Chikungunya virus, yellow fever virus, and Dengue virus.
10. The method according to claim 1, further comprising a step (D)
comprising inactivating the final virus preparation (c).
11. The method according to claim 9, wherein said Chikungunya virus
is an attenuated Chikungunya virus comprising a deletion mutation
in the non-structural protein 3 provided by SEQ ID NO: 77 or an
immunogenic variant thereof, wherein said immunogenic variant is
defined as having at least 80% sequence identity to SEQ ID NO:
77.
12. The method according to claim 1, wherein said protamine is
selected from the group comprising a protamine salt, a protamine
sulphate and a recombinant protamine sulphate.
13. The method according to claim 10, wherein said inactivating
step (D) is performed using formaldehyde.
14. The method according to claim 1, wherein said infectious virus
particles are selected from the group consisting of flaviviruses
and flaviviruses but not Japanese encephalitis virus.
Description
RELATED APPLICATION
This application is a national stage filing under 35 U.S.C. .sctn.
371 of International Patent Application Serial No.
PCT/EP2016/082663, filed Dec. 23, 2016, the contents of which is
incorporated herein by reference in its entirety.
FIELD OF THE INVENTION
The disclosure relates to methods for the purification of viruses
for use in vaccines.
BACKGROUND OF THE INVENTION
Regulatory agencies such as the World Health Organization establish
standards and guidelines for the production of pharmaceutical
compositions administered to humans, such as vaccines, that limit
quantity and components of the compositions. Meeting these
standards is particularly challenging with regard to production of
vaccines containing biological agents, such as viruses, which must
be propagated on cell substrates. Such vaccine preparations must be
sterile (i.e., free from independently replicating organisms) and
may contain no more than 10 ng of host cell DNA per human dose,
among other requirements. These standards are in place in order to
ensure safety of the composition for human administration, but may
introduce challenges in the development of processes used to
produce such compositions.
Protamine was originally isolated from the sperm of salmon and
other species of fish but is now produced primarily through
recombinant biotechnology. It is a highly cationic peptide that
binds to negatively charged molecules such as nucleic acids to form
a stable ion pair. Its use in removing host cell nucleic acid is
well document.
SUMMARY
During the course of routine virus purification, it was observed
that addition of protamine sulfate to a virus harvest produced on a
cell substrate removed not only contaminating DNA derived from host
cells, as expected, but surprisingly also virtually eliminated
immature and otherwise non-infectious virus particles from the
preparation. This finding provides a streamlined, gentle,
reproducible and broadly-applicable process for obtaining
highly-purified infectious virus particles for applications such as
vaccine preparation; furthermore, the process is not dependent on
the charge of the virus particle.
Disclosed herein are downstream processes for purifying virus
particles from a crude preparation. The downstream process can be
applied to either a virus which has not adapted for propagation on
a particular cell substrate or for a partial/fully cell substrate
adapted virus particle.
Aspects of the invention provide processes for the purification of
infectious virus particles comprising the steps of (a) providing a
crude harvest (a) comprising virus particles and impurities,
wherein the impurities are generated from growing said virus
particles on a cell substrate; (b) reducing impurities from the
crude harvest (a) by precipitation with an agent comprising a
protamine salt, preferably a protamine sulphate, to obtain a virus
preparation (b); and further purifying the virus preparation (b) by
method or methods selecting for size of the virus particles, such
as e.g. a sucrose density gradient centrifugation to obtain a virus
preparation (c) comprising the infectious virus particles.
In some embodiments, the concentration of protamine sulphate in
step (b) is about 1 to 10 mg/ml, more preferably about 1 to 5
mg/ml, more preferably about 1 to 2 mg/ml. In one embodiment, the
concentration of protamine sulphate in step (b) is about 2 mg/mL.
In one embodiment, the concentration of protamine sulphate is 1.2
to 1.8 mg/ml, more preferably 1.4 to 1.6 mg/ml. In a preferred
embodiment, the concentration of protamine sulphate in step (b) is
about 1.6 mg/ml or about 2 mg/ml.
In some embodiments, the residual host cell DNA of the virus
preparation (e) is less than 1 .mu.g/mL, especially less than 900,
800, 700, 600, 500, 400, 300 or 200 ng/mL, preferably less than 100
ng/mL. In a preferred embodiment, the residual host cell DNA of the
virus preparation (c) is less than 10 ng/mL. In some embodiments,
the residual host cell protein of the final virus preparation (c)
is less than 10 .mu.g/mL, especially less than 9, 8, 7, 6, 5, 4, 3
or 2 .mu.g/mL, preferably less than 1 .mu.g/mL. In a preferred
embodiment, the residual host cell protein of the virus preparation
(c) is less than 100 ng/mL. In some embodiments, the residual
non-infectious virus particles of the final virus preparation (c)
is less than 10 g/mL, especially less than 9, 8, 7, 6, 5, 4, 3 or 2
.mu.g/mL, preferably less than 1 .mu.g/mL. In a preferred
embodiment, the residual non-infectious virus particles of the
virus preparation (c) is less than 100 ng/mL.
In some embodiments, the residual protamine is less than 1
.mu.g/mL, especially less than 900, 800, 700, 600, 500, 400, 300 or
200 ng/mL, preferably less than 100 ng/mL, more preferably is below
the detection limit of HPLC, in particular below the detection
limit in the final drug substance. In some embodiments, the PS
content is tested by HPLC or size exclusion chromatography (SEC).
For example, HPLC is validated for PS determination in JEV sucrose
gradient pool samples as a routine release assay and is very
sensitive (i.e., LOQ 3 .mu.g/mL; LOD 1 .mu.g/mL). In the current
invention, PS content in virus DS samples was <LOD. In one
embodiment, the HPLC assessment of PS content can be performed on a
Superdex Peptide 10/300GL column (GE: 17-5176-01) using 30%
Acetonitrile, 0.1% Trifluoroacetic acid as solvent with a flow rate
of 0.6 ml/min at 25.degree. C. and detection at 214 nm. A more
sensitive method of measurement for residual protamine in a
purified virus preparation is mass spectrometry (MS). In some
embodiments, the residual PS levels in a virus preparation are
tested by MS or other such highly sensitive method, e.g., nuclear
magnetic resonance (NMR). With this method, residual PS, as well as
fragments and/or break-down products of PS, can be detected at
trace amounts, such as levels as low as, for example, 10.sup.6,
10.sup.7 or 10.sup.8 molecules per typical sample load. In some
embodiments, the PS levels are tested in the sucrose gradient pool.
In some embodiments, the PS levels are tested in the drug product.
In some embodiments, the PS levels are tested in the drug
substance.
In some embodiments, the crude harvest (a) comprising the virus
particles and impurities is subjected to one or more
pre-purification step(s) prior to step (b). In some embodiments,
the one or more pre-purification step(s) comprises digesting host
cell genomic DNA in the crude harvest (a) comprising the virus
particles and impurities by enzymatic treatment. In some
embodiments, the one or more pre-purification step(s) comprises
filtration, ultrafiltration, concentration, buffer exchange and/or
diafiltration. In some embodiments, the one or more
pre-purification steps is filtration using a filter having a pore
size equal to or less than 1 .mu.m. In some embodiments, the filter
has a pore size equal to or less than 0.2 .mu.m. In a preferred
embodiment, the filter has a pore size of 0.2 .mu.m. In some
embodiments, the concentration and/or ultra/diafiltration and/or
buffer exchange is performed by tangential flow filtration (TFF).
In some embodiments, ultra/diafiltration of the crude harvest (a)
comprising the virus particles and impurities is performed using a
hollow fiber membrane having a cut-off of equal to or less than 300
kDa. In a preferred embodiment, the hollow fiber membrane has a
cut-off of 100 kDa.
In some embodiments, the virus particle is a live virus, a chimeric
virus, an attenuated live virus, a modified live virus, or a
recombinant live virus. In a further step, the virus particles of
the invention may by optionally inactivated. In some embodiments,
the virus particle is an attenuated form of the virus particle. For
example, the virus may have reduced infectivity, virulence, and/or
replication in a host, as compared to a wild-type virus. In some
embodiments, the virus is a mutated or modified virus, for example
the nucleic acid of the virus may contain at least one mutation
relative to the wild-type virus. In some embodiments, the virus is
a recombinant live virus, meaning a virus that is generated
recombinantly and may contain nucleic acid from different
sources.
In some embodiments, the virus particle is a live virus, an
attenuated live virus, a modified live virus, or a recombinant live
virus. In some embodiments, the virus belongs to a virus family
selected from the group consisting of Paramyxoviridae,
Orthomyxoviridae, Flaviviridae, Filoviridae, Arenaviridae,
Rhabdoviridae, and Coronaviridae. In some embodiments, the virus
belongs to a virus family selected from the group consisting of
Togaviridae (being live or inactivated), such as alphaviruses, or
Flaviviridae (being live or inactivated). In some embodiments, the
virus is a virus of the family Flaviviridae, i.e. a flavivirus. In
other embodiments, the virus is a Zika virus or Yellow Fever virus.
In preferred embodiments, the virus is a Zika virus. In a most
preferred embodiment, the Zika virus is a Zika virus from the Asian
lineage.
In some embodiments, the relative reduction of impurity of the
final virus preparation relative to the liquid medium (a)
comprising the virus particles and impurities is in a range from 60
to 95%. In some embodiments, the residual impurity of the final
virus preparation is less than 1%. We observed a decrease in the
HCP peaks and the non-infectious aggregate peaks in the HPLC-SEC or
SDS-PAGE. An exact quantification is difficult but one can measure
the density of the SDS-PAGE bands and other methods.
In some embodiments, the filtration of step in (b)(ii) of claim 1
is performed using a filter having a pore size equal to or greater
than 1 .mu.m. In some embodiments, the filter has a pore size equal
to or greater than 0.2 .mu.m. In a preferred embodiment, the filter
has a pore size of 0.2 .mu.m.
In some embodiments, the virus is propagated in a cell line
selected from the group consisting of an EB66 cell line, a Vero
cell line, a Vero-.alpha.His cell line, a HeLa cell line, a HeLa-S3
cell line, a 293 cell line, a PC12 cell line, a CHO cell line, a
3T3 cell line, a PerC6 cell line, a MDSK cell line, a chicken
embryonic fibroblast cell line, a duck cell line, and a diploid
avian cell line. In some embodiments, said cell line is a duck cell
line. In some embodiments, said cell line is a diploid avian cell
line. In some embodiments, said cell line is EB66 cell line. In a
preferred embodiment, said cell line is a Vero cell line.
Aspects of the invention provide a use of any of the processes
described herein for manufacturing a composition for immunization
against a viral infection. In a preferred embodiment, the
composition is a vaccine. In one embodiment, the composition or
vaccine is directed against Chikungunya virus. In one embodiment,
the composition or vaccine is directed against a flavivirus. In one
embodiment, the composition or vaccine is directed against Yellow
Fever virus. In one embodiment, the composition or vaccine is
directed against Zika virus such as e.g. a Zika virus of the Asian
lineage.
Other aspects provide compositions comprising the virus particles
obtainable by any of the processes described herein for treating
and/or preventing a viral infection. In one embodiment, the viral
infection is caused by Chikungunya virus. In one embodiment, the
viral infection is caused by a flavivirus. In one embodiment, the
viral infection is caused by Yellow Fever virus. In one embodiment,
the viral infection is caused by Zika virus such as e.g. a Zika
virus of the Asian lineage.
In some embodiments, the attenuated form of ChikV is derived from
the LR2006-OPY1 ChikV infectious clone (La Reunion isolate). In
some embodiments, the attenuated form of ChikV is the .DELTA.5nsP3
mutant as described by Hallengird et al. (Novel Attenuated
Chikungunya Vaccine Candidates Elicit Protective Immunity in
C57BL/6 mice (2014) Journal of Virology 88(5):2858-2866) or an
immunogenic variant thereof. The immunogenic variant of the
.DELTA.5nsP3 ChikV mutant is herein defined as having at least 80%
sequence identity to the nucleotide sequence of the .DELTA.5nsP3
mutant sequence as provided by SEQ ID NO: 77, especially at least
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88% 89%, 90%, 91%, 92%, 93%,
94%, 95%, 96%, 97%, 98% or at least 99% sequence identity to SEQ ID
NO: 77.
In some embodiments, the Zika virus is derived from the Asian
lineage. In some embodiments, the Zika virus is a Zika virus as
described partially or fully in Sequence section of this
application, i.e. any of sequences SEQ ID Nos 2 to 69 or 78, in
particular all partly or fully described Zika viruses of the Asian
lineages or an immunogenic variant thereof. The immunogenic
variants of the Zika virus or Zika virus of the Asian lineages are
herein defined as having at least 80% sequence identity to the
nucleotide sequence of the sequences described in any of sequences
SEQ ID Nos 2 to 69 or 78, especially at least 81%, 82%, 83%, 84%,
85%, 86%, 87%, 88% 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%
or at least 99% sequence identity.
In some embodiments, the process of the invention results in an
enrichment of infectious virus particles from the crude harvest
comprising infectious virus particles and non-infectious virus
particles and other virus products such that the enrichment of the
infectious virus particles is at least 50%, at least 60%, at least
70%, at least 80%, at least 90%, at least 95%, preferably at least
80%, especially 85% relative to the total virus particle content of
the crude harvest (a) comprising the virus particles and
impurities.
In some embodiments, the residual impurity of the final virus
preparation with respect to all impurities in the crude harvest is
less than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, preferably less
than 5% as determined by SEC-HPLC (Size Exclusion
Chromatography--HPLC).
In some embodiments, the filtration step of the virus preparation
(b) after contact with the solid-phase matrix is performed using a
filter having a pore size equal to or greater than 1 Lm. In some
embodiments, the filter has a pore size equal to or greater than
0.2 Lm. In a preferred embodiment, the filter has a pore size of
about 0.2 .mu.m, such as 0.22 .mu.m.
In some embodiments, the Zika virus, or Chikungunya virus is
propagated in a cell line selected from the group consisting of an
EB66 cell line, a Vero cell line, a Vero-.alpha.His cell line, a
HeLa cell line, a HeLa-S3 cell line, a 293 cell line, a PC12 cell
line, a CHO cell line, a 3T3 cell line, a PerC6 cell line, a MDSK
cell line, a chicken embryonic fibroblast cell line, a duck cell
line, and a diploid avian cell line. In some embodiments, said cell
line is a duck cell line. In some embodiments, said cell line is a
diploid avian cell line. In some embodiments, said cell line is
EB66 cell line. In a preferred embodiment, said cell line is a Vero
cell line.
Aspects of the invention provide a use of any of the processes
described herein for manufacturing a composition for immunization
against a Zika virus, Yellow Fever, or Chikungunya virus infection.
In a preferred embodiment, the composition is a vaccine. In
preferred embodiments, the vaccine is administered to the subject
once, twice or three or more times. In a preferred embodiment, the
vaccine is administered once or twice. In a preferred embodiment,
the vaccine is administered only once.
The herein disclosed in vivo data regarding immunogenicity of the
inactivated Zika virus vaccine of the current invention indicates
that the virus is surprisingly potently immunogenic and also highly
cross-protective (very similar immunogenicity in African and Asian
strains). Data indicate that immunogenicity was unexpectedly higher
than the recently reported inactivated Zika virus vaccine candidate
(Larocca, et. al, 2016, Nature doi:10.1038/nature18952.).
Inactivated viruses are among the safest vaccines and especially
preferred for delivery to populations where safety is especially
concerning, such as pregnant women, children and immunocompromised
individuals, which makes the herein disclosed inactivated Zika
virus particularly suitable. Obtaining a high titer of inactivated
virus is a challenge in the field. The herein disclosed process for
purifying inactivated Zika virus results in not only a high yield,
but also a very pure drug substance.
Other aspects provide compositions comprising the virus particles
obtainable by any of the processes described herein for treating
and/or preventing a Chikungunya virus infection.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings are not intended to be drawn to scale.
The figures are illustrative only and are not required for
enablement of the disclosure. For purposes of clarity, not every
component may be labeled in every drawing, alignments were
performed with the multi alignment package Jalview (Waterhouse et
al., 2009, Bioinformatics 25 (9) 1189-1191). In the drawings:
FIG. 1: Average distance tree (by % identity, nt), complete
genomes.
FIG. 2: Neighbor joining tree (by % identity, nt), complete
genomes.
FIG. 3: Pairwise alignment-Jalview (% identity, nt), complete
genomes.
FIG. 4: Average distance tree (by % identity, aa), E-protein.
FIG. 5: Neighbor joining tree (by % identity. aa), E-protein.
FIG. 6: Pairwise alignment-Jalview (% identity, aa), E-protein.
FIG. 7: Alignment (shading: % identity, aa), E-protein.
FIG. 8: An example of virus particle maturation in the host cell.
As observed in flaviviruses, full maturation of the particles
requires proteolytic cleavage of the precursor membrane
glycoprotein (prM) by the host protease furin. Not all prM
molecules are cleaved, resulting in the release of mature, mosaic
or immature-like conformations from the cells. Mosaic and immature
forms are generally not infectious-only mature virions are
infective and have hemagglutinin (HA)/TCID50 activity. (Figure
adapted from Plevka, et al., Maturation of flaviviruses starts from
one or more icosahedrally independent nucleation centres, EMBO
reports (2011) 12, 602-606).
FIG. 9: CHIKV schematic genome, including non-structural and
structural proteins (labeled "CHIKV") as well as a schematic
representation of the .DELTA.5nsP3 attenuated Chikungunya virus
used to exemplify the purification process of the current invention
(labeled ".DELTA.5nsP3"). The black triangle indicates the
approximate location of the deletion in the nsP3 coding region.
(Figure adapted from Hallengird et al. 2014, supra.)
FIG. 10: Flow-chart showing an exemplary downstream .DELTA.5nsP3
CHIK virus purification process from the crude harvest to
formulation of the (vaccine) drug substance, a preferred embodiment
of the process of the invention.
FIG. 11: Absorbance at 214 nm, 260 nm and 280 nm of individual
sucrose gradient centrifugation (SGC) fractions of a representative
purification run of the process of the invention (A); the SEC-HPLC
analysis of the final pooled fractions containing purified
infectious attenuated .DELTA.5nsP3 ChikV virus particles (B); and a
silver-stained SDS-PAGE gel showing the protein content of the
virus preparation following different steps of the process of the
invention (defined in the table below the figure) (C). The SGC
purified pool consisting of SGC fractions F7-F11 is shown in lane
12.
FIG. 12: SEC area (mAU*min; right axis) and TCID.sub.50 results
(log TCID50/mL; left axis) of attenuated .DELTA.5nsP3 ChikV
production harvests before and after PS treatment. The grey
portions of the bars indicate large losses in SEC area following PS
treatment, but no corresponding change in the total number of
infectious particles (indicated by black portions of the bars) (A);
SEC profile of virus preparation before and after PS addition,
showing a complete removal of large size virus aggregates by PS
treatment as well as a reduction in host cell proteins (HCP) and
LMW impurities (B).
FIG. 13: Electron micrographs of attenuated .DELTA.5nsP3 ChikV
harvest before and after PS treatment.
FIG. 14: Preparation of the sucrose gradient.
FIG. 15: Comparison of four different sucrose gradient
centrifugation experiments performed to empirically determine the
optimal combination of sucrose layers for CHIKV purification. The
CHIKV content in the gradient fractions was determined by SEC. The
sucrose content in the gradient fractions was determined by
refractometry (comparing the value of the refractive index of the
sucrose solution to that of sucrose standard curve the
concentration of sucrose solution can be determined with good
accuracy, this is also referred to as "Brix" scale that is
calibrated to give the percentage (w/w) of sucrose dissolved in
water, i.e. ".degree. Bx"). Protamine sulphate (PS) was determined
by SEC. PS is separated within the sucrose gradient alongside host
cell derived residual contaminants and was therefore used to assess
the quality of CHIKV separation from residual contaminants in the
tested gradients. A: CHIKV load material containing 10% sucrose was
loaded on top of one 50% (w/w) sucrose layer. Determination of
sucrose content in the fractions showed the formation of a linear
gradient. SEC showed concentration of CHIKV within a sucrose
concentration range from 40-30% (w/w) sucrose. PS SEC showed
insufficient separation of PS from CHIKV. B: CHIKV load material
containing 10% sucrose was loaded on top of a two layer system
consisting of a 50% (w/w) sucrose bottom layer and a second 35%
(w/w) sucrose layer. Determination of sucrose content in the
fractions showed the formation of a linear gradient. SEC showed
concentration of CHIKV within a sucrose concentration range from
40-30% (w/w) sucrose. PS SEC showed acceptable separation of PS
from CHIKV, however a slight overlap is still present. C: CHIKV
load material containing 10% sucrose was loaded on top of a two
layer system consisting of a 50% (w/w) sucrose bottom layer and a
second 25% (w/w) sucrose layer. Determination of sucrose content in
the fractions showed the formation of a linear gradient. SEC showed
concentration of CHIKV within a sucrose concentration range from
40-30% (w/w) sucrose. PS SEC showed a good separation of PS from
CHIKV. D: CHIKV load material containing 10% sucrose was loaded on
top of a three layer system consisting of a 50% (w/w) sucrose
bottom layer as well as a 35% and a 15% (w/w) sucrose layer.
Determination of sucrose content in the fractions showed the
formation of a linear gradient and SEC showed concentration of
CHIKV within a sucrose concentration range from 40-30% (w/w)
sucrose. PS SEC showed a very good separation of PS and residual
contaminants from CHIKV. Of the four tested sucrose layer systems
the combination of 3 layers (shown in FIG. 16D) showed the best
separation of the virus particles from residual contaminants and
was therefore used for further DSP development.
FIG. 16: Relative amounts of attenuated .DELTA.5nsP3 ChikV
particles and other components by SEC-HPLC analysis at the
different steps of the process of the invention including, from top
to bottom: crude harvest (a); 10.times. concentrated harvest;
diafiltrated concentrated harvest; PS treated material;
CC700-treated material and SGC purified pool.
FIG. 17: An exemplary downstream virus purification process from
the crude harvest to formulation of the drug substance (vaccine), a
preferred embodiment of the process of the invention (A). A
flow-chart of an exemplary virus inactivation process is shown in
(B).
FIG. 18: PS treatment results in selective removal of Zika virus
aggregates and Vero HCP and LMW impurities (SEC-HPLC of 30.times.
concentrated Zika Virus harvest day 5).
FIG. 19: SEC-HPLC of individual 30.times. concentrated Zika harvest
prior PS treatment at different time points.
FIG. 20: SEC-HPLC of individual 30.times. concentrated Zika harvest
post PS treatment at different time points.
FIG. 21: Representative SDS-PAGE from the sucrose gradient harvest
of a Zika purification is shown.
FIG. 22: Correlation between JEV Antigen content in NIV analysed by
ELISA and SEC-HPLC (Dionex Ultimate 3000, Superose 6 column).
FIG. 23: Comparison of JEV and ZikaV harvest schedules/yields.
FIG. 24: SEC-HPLC elution profile of ZikaV NIV. Data were processed
on Dionex Ultimate 3000/Superose 6 Increase column. Both panels are
from the same chromatogram. The upper graph is the complete elution
profile; the lower graph is an enlargement of the ZIKAV elution
peak.
FIG. 25: SEC-MALLS analysis of inactivated ZikaV.
FIG. 26: Cumulative particle size distribution of Zika NIV.
FIG. 27: Graphical representation of the neutralization of the Zika
virus H/PF/2013 with pooled mouse sera. The number of plaques
without serum was set to 100%. The EC50 was calculated with the
3-parameter method.
FIG. 28: Graphical representation of the neutralization of the Zika
virus MR766 with pooled mouse sera. The number of plaques without
serum was set to 100%. The EC50 was calculated with the 3-parameter
method.
FIG. 29: Change in SEC profile of Yellow fever virus peak after PS
addition according to the invention showing a complete removal of
large size aggregates and LMW impurities.
DETAILED DESCRIPTION OF THE INVENTION
Disclosed herein are processes for the purification of infectious
virus particles, i.e., mature, functional virus particles, e.g.
flavivirus particles (Yellow Fever, Zika Virus, Japanese
Encephalitis virus, Dengue virus) and/or alphavirus particles
(Chikungunya virus). The processes disclosed are characterized by
the removal of undesired by-products of virus production on host
cells, such as non-infectious virus particles and aggregated and
immature virus by-products. The processes provided herein allow the
production of highly-purified virus preparations comprising mostly
infectious virus particles. During the course of the invention, it
was observed that protamine sulphate (PS), added to remove
contaminating DNA during virus purification, resulted not only in
removal of contaminating DNA, but also in the loss of a high
percentage of total virus particles present in the preparation.
Surprisingly, however, quantification of total infectious virus
particles by TCID50 before and after PS treatment revealed that the
absolute number of infectious virus particles did not change
following this loss of total virus particles. This observation
clearly shows that treatment with PS can facilitate selective
removal of non-infectious, aggregated and immature viral
by-products, leaving behind the infectious Chikungunya virus
particles or other infectious virus particles. Because by-products
produced during virus growth on host cells may have different (and
undesirable) immunological properties or other unwanted
side-effects or safety issues, a simple and robust way to remove
these by-products is of high importance for the quality and safety
of the final product.
Protamines are small arginine-rich nuclear proteins, present in
high amounts in the sperm of fish, which have an important role in
DNA packaging during spermatogenesis. Protamine sulfate (or
"protamine" or "PS") can form a stable ion pair with heparin and is
thus commonly used during certain surgeries when the
anti-coagulation effect of heparin is no longer needed. In large
doses, protamine sulfate administered alone can also have a weak
anticoagulant effect ("Protamine sulfate". Wikipedia: The Free
Encyclopedia. Wikimedia Foundation, Inc. 30 Sep. 2015 Web. 26 Nov.
2015<https://en.wikipedia.org/wiki/Protamine_sulfate>).
Protamine Sulphate is additionally routinely used in biotechnology
applications such as DNA precipitation (e.g., removal of host cell
DNA from cell culture processes), purification of DNA binding
proteins and retroviral-mediated gene transfer.
Protamine is obtained from salmon sperm or produced recombinantly
and is used as a sulphate salt. The four major peptides, which
constitute almost the entire nitrogen-containing material in salmon
protamine, have been fully characterized and found to be
polypeptides of 30-32 amino acids in length, of which 21-22
residues are arginine. The average molecular mass is in the range
of 4250 Da for the following sequence: PRRRRSSSRP VRRRRRPRVS
RRRRRRGGRR RR (SEQ ID NO: 1). Herein, protamine is also referred to
as protamine salt, or preferably protamine sulphate.
The present invention relates to the use of protamine sulphate (PS)
in a process of purification of a live virus, wherein the protamine
sulphate facilitates the removal of impurities from a crude virus
harvest, including non-infectious virus particles and aggregates.
As seen in FIG. 8 using flaviviruses as an example, virus
production in the host cell can result in the release of virus
products which are not mature, and non-infectious particles, which
can also be considered impurities according to the present
invention. As such, the present invention also relates to the
enrichment of infectious virus particles from a crude harvest
containing a mixture of virus particles and other viral products in
various stages of maturation.
The use of protamine sulphate can follow crude cell lysis or any
further step after cell lysis (e.g. including after a
pre-purification with filtration, chromatography etc) wherein the
virus particles are further enriched or concentrated and/or other
impurities are removed and/or buffer components are exchanged. The
further steps may comprise filtration or concentration of the crude
cell lysate.
The protamine sulphate may comprise the sequence PRRRRSSSRP
VRRRRRPRVS RRRRRRGGRR RR (SEQ ID NO: 1) or a variant thereof
wherein the amino acid sequence comprises from 28-35 amino acids,
preferably 29-34, more preferably 30-33 amino acids, most
preferably 31 or 32 amino acids. The protamine sulphate preferably
comprises at least 19 arginine residues, more preferably at least
20 arginine residues, more preferably at least 21 arginine
residues, even more preferably at least 22 residues, most
preferably 20 or 21 arginine residues. Further, other protamine
sulphate-like compounds or variants thereof may be used. Therefore,
the use of the term "protamine salt" herein shall serve to
encompass natural variations on SEQ ID NO: 1, preferably, but not
limited to, the protamine sulphate forms.
The process according to the current invention may also comprise
the use of a sucrose gradient, preferably an optimized sucrose
gradient. The sucrose gradient is preferably optimized for the
removal of protamine sulfate, also for the removal of immature
viral particles or other viral particles which are non-infectious
or host cell proteins or nucleic acids (DNA, RNA, mRNA, etc) or
other host cell debris. In the current invention the optimized
sucrose gradient comprises at least two, at least three, at least
four layers of sucrose solutions with different densities. In one
embodiment, the virus preparation to be purified is provided in a
sucrose solution which has a density of about 8%, about 9%, about
10%, about 11%, about 12% sucrose (w/w), preferably about 10%. In
one embodiment, one sucrose solution in the gradient has a density
of about 45%, about 46%, about 47%, about 48%, about 49%, about
50%, about 51%, about 52%, about 53%, about 54%, about 55% sucrose
(w/w), preferably about 50%. In one embodiment, one sucrose
solution in the gradient has a density of about 30%, about 31%,
about 32%, about 33%, about 34%, about 35%, about 36%, about 37%,
about 38%, about 39%, about 40% sucrose (w/w), preferably about
35%. In one embodiment, one sucrose solution in the gradient has a
density of about 10%, about 11%, about 12%, about 13%, about 14%,
about 15%, about 16%, about 17%, about 18%, about 19%, about 20%
sucrose (w/w), preferably about 15% sucrose. In a preferred
embodiment, the sucrose gradient comprises three layers of sucrose
solutions of about 50%, about 35% and about 15% (w/w) sucrose and
the virus composition to be purified is contained in about 10%
(w/w) sucrose. Because the invention provided for means to not only
test for host cell DNA but also immature viral particles, the
skilled person in the art is able to more precisely optimize the
sucrose gradient for most efficient purification and include
additional tools such as PRNT assay to monitor purification
success.
The process comprising the use of protamine sulphate of the
invention can be applied to purification of any virus for use in
pharmaceutical compositions, for example, for a pharmaceutical
composition such as a vaccine where it is important that the virus
is in its infectious form. The virus to be purified may be any live
virus, any live attenuated virus or any live chimeric virus,
preferably a live wild type virus such as a Zika virus of the Asian
lineage. In one embodiment, the virus particle is also be later
inactivated. In a preferred embodiment, the virus is inactivated
with formaldehyde.
In a preferred embodiment, the produced Zika virus is derived from
the Asian lineage (which includes the strains found in South
America and all strains derived from any Asian lineage). In some
other embodiments, the produced Zika virus is a Zika virus as
described in the Sequence section of this application (SEQ ID NO: 2
to 69 or 78).
In another preferred embodiment, the live attenuated Chikungunya
virus is the protective ChikV-ICRES1-.DELTA.5nsP3 described by
Hallengard et al. (Novel Attenuated Chikungunya Vaccine Candidates
Elicit Protective Immunity in C57BL/6 mice (2014) J. Virology,
88(5):2858-2866). Briefly, the ChikV genome carries a
positive-sense single-stranded RNA genome of 11 Kb containing two
open reading frames encoding nonstructural proteins (nsP1 to nsP4)
and structural proteins (C, E3, E2, 6K, and E1), respectively (see
FIG. 9, top construct). The attenuated virus .DELTA.5nsP3, based on
the La Reunion ChikV strain LR2006-OPY1, was obtained by the
substitution of amino acid residues 1656 to 1717 of the P1234
polyprotein with a small linker (aa sequence AYRAAAG) in the
hypervariable region of the nsP3 protein (see FIG. 9, bottom
construct). The .DELTA.5nsP3 ChikV mutant was shown to be
infectious, highly immunogenic and protective against challenge
with wild type ChikV (Hallengard, et al., supra and Hallengard, et
al., Prime-Boost Immunization Strategies against Chikungunya Virus
(2014) J. Virology, 88(22):13333-13343). In one embodiment, the
live attenuated Chikungunya virus may be a variant of the
ChikV-ICRES1-.DELTA.5nsP3 attenuated mutant virus.
A preferred embodiment of the process of the current invention is
shown in FIG. 10 (Chikungunya virus) and FIG. 17A (Zika virus).
TABLE-US-00001 TABLE 1 Overview of process buffers and stock
solutions Final conductivity Buffer Composition Final pH [mS/cm] A
0.5M NaOH n.a. B 0.1M NaOH n.a. C 25 mM Tris, 150 mM NaCl 7.4 .+-.
0.2 16.5 D 1M Tris 7.4 .+-. 0.2 n.a. E 4.5M NaCl n.a. n.a. F 1M
NaCl n.a. n.a. G 1% SDS n.a. n.a. H 50% (w/w) Sucrose in 25 mM 7.4
.+-. 0.2 n.a. Tris, 150 mM NaCl I 35% (w/w) Sucrose in 25 mM 7.4
.+-. 0.2 n.a. Tris, 150 mM NaCl J 15% (w/w) Sucrose in 25 mM 7.4
.+-. 0.2 n.a. Tris, 150 mM NaCl K 10.times. PBS 7.4 .+-. 0.2 n.a. L
50 mg/mL Protamine sulphate 7.4 .+-. 0.2 n.a. M Drug substance
formulation 7.4 .+-. 0.2 1.3 buffer (10 mM Tris(hydroxymethyl)-
aminomethan, 5% Sucrose, 1% (10 mg/mL) rHSA)
TABLE-US-00002 TABLE 2 Abbreviations Degrees Brix = sugar content
.degree.Bx (w/w) of an aqueous solution BSA Bovine serum albumin
CC700 Capto .TM. Core 700 ChikV Chikungunya virus CPE Cytopathic
effect EtOH Ethanol EU Endotoxin units DS Drug Substance DP Drug
Product DSP Downstream Process HCP Host cell protein hcDNA Host
cell DNA hpi Hours post infection HPLC High Performance Liquid
Chromatography ID Inner diameter JEV Japanese Encephalitis virus
LAL Limulus amebocyte lysate LDS buffer Lithium dodecyl sulfate
sample loading buffer LOD Limit of detection LOQ Limit of
quantitation MALLS Multiangle light scattering mAU Milli absorbance
units MS Mass spectroscopy NIV Neutralized inactivated virus PBS
Phosphate buffered saline PD Process development PFU Plaque forming
units p.i. Post-infection PS Protamine sulphate or protamine
sulfate rcf Relative centrifugal force rHSA Recombinant human serum
albumin Rms radius Root mean square radius rMSB Research master
seed bank RSD Relative standard deviation SEC Size exclusion
chromatography SGC Sucrose gradient centrifugation SGP Sucrose
gradient purified SDS Sodium dodecyl sulphate TBS Tris buffered
saline TFF Tangential flow filtration TCID50 Tissue culture
infectious dose 50% UF/DF Ultrafiltration/diafiltration WFI Water
for injection ZikaV Zika virus
Brix:
Degrees Brix (.degree. Bx) is the sugar content of an aqueous
solution. One degree Brix is 1 gram of sucrose in 100 grams of
solution and represents the strength of the solution as percentage
by mass. .degree. Bx corresponds to the sucrose content in % (w/w),
eg. 45.degree. Bx equals 45% (w/w) sucrose.
TABLE-US-00003 TABLE A Primers for Zika virus sequencing: lower
case letters indicate bases not included in ZIKA but containing
restriction sites for later cloning when needed (therefore, two Tms
provided). Tm Primer Primer sequence (5'-3') (gene- Tm (entire
Amplicon Pair Oligoname restriction sites (lower case) specific)
primer) size [bp] 1 9320_Zika_PF_1F SEQ ID NO: 80 69.9 74.6 707
ttaggatccGTTGTTGATCTGTGTGAAT 9321_Zika_PF_1R SEQ ID NO: 81 69.3
75.6 taactcgagCGTACACAACCCAAGTT 2 9322_Zika_PF_2F SEQ ID NO: 82 70
73.9 704 ttaggatccTCACTAGACGTGGGAGTG 9323_Zika_PF_2R SEQ ID NO: 83
69.8 73.7 taactcgagAAGCCATGTCYGATATTGAT 3 9324_Zika_PF_3F SEQ ID
NO: 84 72.3 74.5 712 ttaggatccGCATACAGCATCAGGTG 9325_Zika_PF_3R SEQ
ID NO: 85 72 76.4 taactcgagTGTGGAGTTCCGGTGTCT 4 9326_Zika_PF_4F SEQ
ID NO: 86 70.9 74 712 ttaggatccGAATAGAGCGAARGTTGAGATA
9327_Zika_PF_4R SEQ ID NO: 87 70.5 73.7
taactcgAGTGGTGGGTGATCTTCTTCT 5 9328_Zika_PF_5F SEQ ID NO: 88 70.3
75 704 ttaggatcCAGTCACAGTGGAGGTACAGTAC 9329_Zika_PF_5R SEQ ID NO:
89 71.5 77.3 taactcgagCRCAGATACCATCTTCCC 6 9330_Zika_PF_6F SEQ ID
NO: 90 70.7 72.7 698 ttaggatCCCTTATGTGCTTGGCCTTAG 9331_Zika_PF_6R
SEQ ID NO: 91 70.4 76.9 taactcgagTCTTCAGCCTCCATGTG 7
9332_Zika_PF_7F SEQ ID NO: 92 71.9 75 716
ttaggatccAATGCCCACTCAAACATAGA 9333_Zika_PF_7R SEQ ID NO: 93 71 74
taactcgagTCATTCTCTTCTTCAGCCCTT 8 9334_Zika_PF_8F SEQ ID NO: 94 70.9
75.2 703 ttaggatccAAGGGTGATCGAGGAAT 9335_Zika_PF_8R SEQ ID NO: 95
71.9 73.4 taactcgagTTCCCTTCAGAGAGAGGAGC 9 9336_Zika_PF_9F SEQ ID
NO: 96 71.9 75 699 ttaggatccTCTTTTGCAAACTGCGATC 9337_Zika_PF_9R SEQ
ID NO: 97 71 74.9 taactcgagTCCAGCTGCAAAGGGTAT 10 9338_Zika_PF_10F
SEQ ID NO: 98 71.4 75.8 706 ttaggatccGTGTGGACATGTACATTGA
9339_Zika_PF_10R SEQ ID NO: 99 70.4 75.8 taactcgagCCCATTGCCATAAAGTC
11 9340_Zika_PF_11F SEQ ID NO: 100 71.6 78.1 692
ttaggatccTCATACTGTGGTCCATGGA 9341_Zika_PF_11R SEQ ID NO: 101 74 78
taactcgagGCCCATCTCAACCCTTG 12 9342_Zika_PF_12F SEQ ID NO: 102 70.9
74 707 ttaggatccTAGAGGGCTTCCAGTGC 9343_Zika_PF_12R SEQ ID NO: 103
70.2 72.2 taactcgAGATACTCATCTCCAGGTTTGTTG 13 9344_Zika_PF_13F SEQ
ID NO: 104 70.6 75.4 726 ttaggatccGAAAACAAAACATCAAGAGTG
9345_Zika_PF_13R SEQ ID NO: 105 71.9 75.6
taactcgagGAATCTCTCTGTCATGTGTCCT 14 9346_Zika_PF_14F SEQ ID NO: 106
73.1 75.6 715 ttaggatccTTGATGGCACGACCAAC 9347_Zika_PF_14R SEQ ID
NO: 107 70.8 77.9 ttaggatccGTTGTTGATCTGTGTGAAT 15 9348_Zika_PF_15F
SEQ ID NO: 108 71.9 75.4 719 taactcgagCAGGTCAATGTCCATTG
9349_Zika_PF_15R SEQ ID NO: 109 73.9 77.2
ttaggatccTGTTGTGTTCCTATTGCTGGT 16 9350_Zika_PF_16F SEQ ID NO: 110
72.3 75.4 703 taactcgaGTGATCAGRGCCCCAGC 9351_Zika_PF_16R SEQ ID NO:
111 72 76.3 ttaggatccTGCTGCCCAGAAGAGAA 17 9352_Zika_PF_17F SEQ ID
NO: 112 73.6 76 705 taactcgaGCACCAACAYGGGTTCTT 9353_Zika_PF_17R SEQ
ID NO: 113 72 75.5 ttaggatcCTCAAGGACGGTGTGGC 18 9354_Zika_PF_18F
SEQ ID NO: 114 71.7 75.8 699 taactcgagCAATGATCTTCATGTTGGG
9355_Zika_PF_18R SEQ ID NO: 115 71 74.1 ttaggatccTATGGGGGAGGACTGGT
19 9356_Zika_PF_19F SEQ ID NO: 116 73.3 75.5 711
taactcGAGCCCAGAACCTTGGATC 9357_Zika_PF_19R SEQ ID NO: 117 71.3 76.9
ttaggatcCAGACCCCCAAGAAGGC 20 9358_Zika_PF_20F SEQ ID NO: 118 71.7
75 706 taactcgagCCCCTTTGGTCTTGTCT 9359_Zika_PF_20R SEQ ID NO: 119
71.9 73.9 ttaggatccAGGAAGGATGTATGCAGATG 21 9360_Zika_PF_21F SEQ ID
NO: 120 70.4 75.7 709 taactcgagACATTTGCGCATATGATTTTG
9361_Zika_PF_21R SEQ ID NO: 121 71.8 75
ttaggatccAGGAAGGACACACAAGAGT 22 9362_Zika_PF_22F SEQ ID NO: 122 70
79.1 581 taactcgagACAGGCTGCACAGCTTT 9363_Zika_PF_22R SEQ ID NO: 123
74.8 81.1 ttaggatccTCTCTCATAGGGCACAGAC
Sequences
TABLE-US-00004 A typical form of protamine SEQ ID NO: 1 PRRRRSSSRP
VRRRRRPRVS RRRRRRGGRR RR
Provided below are examples of nucleic acid sequences of the
genomes of Zika viruses that may be used in the methods,
compositions, and/or vaccines described herein.
TABLE-US-00005 KU321639.1 Zika virus strain ZikaSPH2015, Brazil,
complete genome (SEQ ID NO: 2) SEQ ID NO: 2
GTTGTTACTGTTGCTGACTCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTT-
TATTTGGATT
TGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAA-
ACGCGGAG
TAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGG-
ATGGTCTTG
GCAATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGG-
GAAAAAAGA
GGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAGA-
AGAAGAG
ACGGGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGAC-
GTGGGAGT
GCATACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAA-
GTGTTATAT
ACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGG-
AACCAGAT
GACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGC-
ACGGAGAT
CTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCA-
AGAGAATAC
ACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGC-
TTGGCTTTTG
GGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTG-
CATAGGAGT
CAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATATTGTCTTGGAACATGGAGGTTGTG-
TCACCGTA
ATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATC-
CTACTGCT
ATGAGGCATCAATATCAGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAA-
TCAGACACT
CAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGTCT-
GGTGACAT
GCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATG-
CTGTCAGT
TCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTTG-
AGATAACG
CCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGG-
CCTTGACTT
TTCAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCAT-
TACCTTGGCAC
GCTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAA-
AAGGCAA
ACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGA-
TGGTGCAA
AGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATAC-
TCCTTGTGT
ACCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGC-
AGGGACAG
ATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACC-
GCTAACCCC
GTAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGT-
CATAGGAGTC
GGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAG-
AGGTGCC
AAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAA-
GGGCATCC
ATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACG-
TTGCTGATGTG
GTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTAT-
CCACAGCCGT
CTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATA-
ACGACGTT
GAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTG-
GGAAGATG
GTATCTGCGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCA-
ATCCTGGAA
GAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCC-
CGTGCCTG
TGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGCACTTCGTCAGAGCAGCAAAGACAAATAACAGC-
TTTGTCGT
GGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGT-
TCGGGGTA
TTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGC-
TGTTAAGGGA
AAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGC-
CCATCTG
ATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGAT-
CATACCCA
AGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCAC-
AGTGAAGA
GCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCAT-
CTCTGAGA
TCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCG-
GGCTAAAG
ATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGACT-
GCAGGATC
AACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAAGA-
GAATGACCA
CAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTG-
GCTAAGCTT
GCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGC-
ATTCAAAGT
CAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGG-
CCTCGTGTCT
TTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGG-
CAATACGAGC
GATGGTTGTTCCACGCACTGATAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACAC-
TGCTTGTGGC
GTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGA-
ACTTACCA
TTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGGCTGCTGTTGCTCAC-
AAGGAGTGG
GAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCA-
AGGCAGAT
ATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGA-
CATGTACA
TTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCG-
CTAGATGA
GAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGA-
CCATCTGTG
GCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGT-
GCTCTATG
GGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGAC-
TGCTAGG
TTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCG-
CGCTGAGA
AGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAA-
GCTAGATG
CCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACT-
CTGCCCG
GAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCA-
ATCCTAGA
CAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATCA-
CCCAAGGG
AGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGA-
CTTGCATC
CTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTG-
ATCTTAGCT
CCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGT-
CAATGTCAC
CCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAG-
TCCCCAACTAT
AATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAAG-
GGTTGAGAT
GGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCAC-
CAATTATGG
ACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATTATTCTGGAAAAACA-
GTTTGGTT
TGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCA-
GCAGAAAG
ACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGAT-
GGGCGCCAA
CTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTC-
TGGCTGGAC
CCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGAT-
GAGTATCT
GTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATA-
TTTACCTCC
AAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTT-
AGGACGGA
GCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTG-
CCGGAATAA
CCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAG-
GTGTGGAC
CAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGA-
AGTCATTC
AAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACAT-
GACAGAG
AGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGC-
GGCGGCCC
AATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGTC-
TTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATGGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACGCCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCG
CACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCAT-
CATGAAGA
ACCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATG-
GGACAGGT
GCTACTCATGGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCC-
TGATCACA
GCCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAA-
CATTTTTAGG
GGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGG-
TGGAACAG
GAGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAG-
TCAGGCAT
CACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCC-
GAGGAAG
TGCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCA-
GAGGGGG
CTGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATG-
AAGAACCC
GTGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGA-
GCCGTGTG
ACACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTC-
TCCATGGTG
GGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGA-
AACCCTGGA
GCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACT-
GGGTCTCT
GGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAG-
GAGGCCA
GTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACAT-
GAAGATCA
TTGGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGG-
ACATGGGC
TTACCATGGAAGCTATGAGGCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGT-
CAAAACCCT
GGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAG-
GAAAAAGT
GGACACTAGGGTGCCAGACCCCCAAGAAGGTACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAG-
AGCTAGGC
AAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGC-
AATATTTGA
AGAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAA-
GAGAGCA
CCACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTG-
GAAAGGC
CAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGA-
ACGAGGATC
ACTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAA-
GAGATGA
GTCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTG-
GAGAATGA
AGCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAA-
ACAAAGTG
GTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAAACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAG-
CGGACAA
GTTGTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGT-
CCTAGAGATG
CAAGACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAA-
ACGAATG
GCAGTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGA-
TATGGGAAA
AGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCC-
ACCACTTCA
ACAAGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCC-
CGCGTCTCT
CCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTA-
TTTCCACAG
AAGGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTA-
CCTGGTCAAT
CCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACG-
ACCACAT
GGAAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGAT-
CTCTCATA
GGGCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGA-
TGAAGAAA
AGTACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCA-
CCAATCTTA
ATGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGG-
GAAACCAA
GCCTATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAG-
TCAAAAAA
CCCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAAC-
TGGAGATC AGCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGA KU497555.1 Zika
virus isolate Brazil-ZKV2015, Brazil, complete genome SEQ ID NO: 3
CCAATCTGTGAATCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTTTATTTT-
GGATTTGGA
AACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAAACGC-
GGAGTAGC
CCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGGATGG-
TCTTGGCG
ATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGGGAA-
AAAAGAGGC
TATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCCAGGAAGGAGAAGA-
AGAGACG
AGGCGCAGATACTAGTGTCGGAATCGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGACGTG-
GGAGTGCA
TACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAAGTG-
TTATATACA
GATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGGAAC-
CAGATGAC
GTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGCACG-
GAGATCTA
GAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCAAGA-
GAATACACA
AAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGCTTG-
GCTTTTGGG
AAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTGCA-
TAGGAGTCA
GCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGGGGTTGTGTC-
ACCGTAAT
GGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATCCT-
ACTGCTAT
GAGGCATCAATATCAGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAATC-
AGACACTCA
ATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCCTGG-
TGACATGC
GCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATGCT-
GTCAGTTC
ATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTTGAG-
ATAACGCC
CAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCTTAGGACTTGATTGTGAACCGAGGACAGGCC-
TTGACTTTT
CAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCATTA-
CCTTGGCACG
CTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAAA-
AGGCAAA
CTGTCGTGGTTCTAGGGACTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGAT-
GGTGCAAA
GGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATACT-
CCTTGTGTA
CCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGCA-
GGGACAGA
TGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACCG-
CTAACCCCG
TAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGTC-
ATAGGAGTCG
GGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAGA-
GGTGCCA
AGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAAG-
GGCATCCA
TCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACGT-
TGCTGATGTGG
TTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTATC-
CACAGCCGTC
TCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGTGGTACAGGGGTGTTCGTCTATAA-
CGACGTTG
AAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCTCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTGG-
GAAGATGG
TATCTGCGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTTAACGCAA-
TCCTGGAAG
AGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCCC-
GTGCCTGT
GAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAACAGCT-
TTGTCGTG
GATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGTT-
CGGGGTAT
TTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGCT-
GTTAAGGGA
AAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGC-
CCATCTG
ATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGAT-
CATACCCA
AGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCAC-
AGTGAAGA
GCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCAT-
CTCTGAGA
TCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCG-
GGCTAAAG
ATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGACT-
GCAGGATC
AACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAAGA-
GAATGACCA
CAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTG-
GCTAAGCTT
GCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGC-
ATTCAAAGT
CAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGG-
CCTCGTGTTT
TTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGG-
CAATACGAGC
GATGGTTGTTCCACGCACTGACAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACAC-
TGCTTGTGGC
GTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGA-
ACTTACCA
TTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCTCAC-
AAGGAGTGG
GAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCA-
AGGCAGAT
ATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGA-
CATGTACA
TTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCG-
CTAGATGA
GAGTGGTGACTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGA-
CCATCTGTG
GCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGT-
GCTCTATG
GGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGAC-
TGCTAGG
TTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCG-
CGCTGAGA
AGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAA-
GCTAGATG
CCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACT-
CTGCCCG
GAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCA-
ATCCTAGA
CAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATAAAAAATGGGAGTTATGTTAGTGCCATCA-
CCCAAGGG
AGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGA-
CTTGCATC
CTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTG-
ATCTTAGCT
CCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGT-
CAATGTCAC
CCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAG-
TCCCCAACTAT
AATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGCATAGCAGCAAGAGGATACATTTCAACAAG-
GGTTGAGAT
GGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCAC-
CAATTATGG
ACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAACA-
GTTTGGTT
TGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCA-
GCAGAAAG
ACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGAT-
GGGCGCCAA
CTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTC-
TGGCTGGAC
CCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGAT-
GAGTACCT
GTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATA-
TTTACCTCC
AAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTT-
AGGACGGA
GCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTG-
CCGGAATAA
CCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAG-
GTGTGGAC
CAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGA-
AGTCATTC
AAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACAT-
GACAGAG
AGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGC-
GGCGGCCC
AATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGTC-
TTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATAGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACACCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCGC
ACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCATC-
ATGAAGAA
CCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATGG-
GACAGGTG
CTACTCATAGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCCT-
GATCACAG
CCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAAC-
ATTTTTAGGG
GAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGGT-
GGAACAGG
AGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAGT-
CAGGCATC
ACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCCG-
AGGAAGT
GCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCAG-
AGGGGGC
TGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATGA-
AGAACCCG
TGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGAG-
CCGTGTGA
CACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTCT-
CCATGGTGG
GGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGCATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGAA-
ACCCTGGAG
CGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACTG-
GGTCTCTG
GAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAGG-
AGGCCAGT
GAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACATGA-
AGATCATT
GGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAAAACCACCCATATAGGAC-
ATGGGCTT
ACCATGGAAGCTATGTGGCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGTCA-
AAACCCTGG
GATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAGGA-
AAAAGTGG
ACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAGAG-
CTAGGCAA
ACACAAACGACCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGCAA-
TATTTGAAG
AGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAAGA-
GAGCACC
ACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTGGA-
AAGGCCA
AGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGAAC-
GAGGATCAC
TGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAAGA-
GATGAGT
CGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTGGA-
GAATGAAG
CTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAAAC-
AAAGTGGTA
AAGGTCCTTAGACCAGCTGAAAAAGGGAAAACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAGCGG-
ACAAGTT
GTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGTTCT-
AGAGATGCAA
GACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAAACG-
AATGGCA
GTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGATAT-
GGGAAAAGT
TAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCCACC-
ACTTCAACA
AGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCCCGC-
GTCTCTCCA
GGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTATTT-
CCACAGAA
GGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTACC-
TGGTCAATCC
ATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACGAC-
CACATGG
AAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGATCT-
CTCATAGG
GCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAATACAGTCAACATGGTGCGCAGGATCATAGGTGATG-
AAGAAAAG
TACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTGAGCACC-
AATCTTAATG
TTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCTCCAGGAGAAGCTGGGTA-
ACCAAGCCT
ATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCAA-
AAAACCCC
ACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGGA-
GATCAGCT
GTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGCTGGG-
AAAGACC
AGAGACTCCATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAATAGCGGCGGCCGGTGTGGGGAAA-
TCCA KU501215.1 Zika virus strain PRVABC59, Puerto Rico, complete
genome SEQ ID NO: 4
GTTGTTGATCTGTGTGAATCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTT-
TATTTTGGAT
TTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAA-
AACGCGGA
GTAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAG-
GATGGTCTT
GGCGATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGG-
GGAAAAAAG
AGGCTATGGAAACAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAG-
AAGAAGA
GACGAGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGA-
CGTGGGAG
TGCATACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATA-
AGTGTTATAT
ACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGG-
AACCAGAT
GACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGC-
ACGGAGAT
CTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACCAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCA-
AGAGAATAC
ACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGC-
TTGGCTTTTG
GGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTG-
CATAGGAGT
CAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTGTG-
TCACCGTA
ATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATC-
CTACTGCT
ATGAGGCATCAATATCAGACATGGCTTCTGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAA-
TCAGACACT
CAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCCT-
GGTGACAT
GCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATG-
CTGTCAGT
TCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAAGTTG-
AGATAACG
CCCAATTCACCGAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGG-
CCTTGACTT
TTCAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCAT-
TACCTTGGCAC
GCTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAA-
AAGGCAA
ACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGA-
TGGTGCAA
AGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATAC-
TCCTTGTGT
ACTGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGC-
AGGGACAG
ATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACC-
GCTAACCCC
GTAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGT-
CATAGGAGTC
GGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAG-
AGGTGCC
AAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAA-
GGGCATCC
ATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACG-
TTGCTGATGTG
GTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTAT-
CCACAGCCGT
CTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATA-
ACGACGTT
GAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTG-
GGAAGATG
GTATCTGCGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCA-
ATCCTGGAA
GAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCC-
CGTGCCTG
TGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTATTTCGTCAGAGCAGCAAAGACAAATAACAGC-
TTTGTCGT
GGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGT-
TCGGGGTA
TTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGC-
TGTTAAGGGA
AAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGC-
CCATCTG
ATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGAT-
CATACCCA
AGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCAC-
AGTGAAGA
GCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCAT-
CTCTGAGA
TCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCG-
GGCTAAAG
ATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGACT-
GCAGGATC
AACTGATCACATGGACCACTTCTCCCTTGGAGTGCTTGTGATCCTGCTCATGGTGCAGGAAGGGCTGAAGAAGA-
GAATGACCA
CAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTG-
GCTAAGCTT
GCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGC-
ATTCAAAGT
CAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGG-
CCTCGTGTCT
TTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGG-
CAATACGAGC
GATGGTTGTTCCACGCACTGATAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACAC-
TGCTTGTGGC
GTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGA-
ACTTACCA
TTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCTCAC-
AAGGAGTGG
GAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCA-
AGGCAGAT
ATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGA-
CATGTACA
TTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCG-
CTAGATGA
GAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGA-
CCATCTGTG
GCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGT-
GCTCTATG
GGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGAC-
TGCTAGG
TTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCG-
CGCTGAGA
AGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAA-
GCTAGATG
CCGCCTGGGATGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACT-
CTGCCCG
GAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCA-
ATCCTAGA
CAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAAAAACGGGAGTTATGTTAGTGCCATCA-
CCCAAGGG
AGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCCTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGA-
CTTGCATC
CTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTG-
ATCTTAGCT
CCAACCAGGGTTGTCGCTGCTGAAATGGAGGAGGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGT-
CAATGTCAC
CCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAG-
TCCCCAACTAT
AATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAAG-
GGTTGAGAT
GGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCAC-
CAATTATGG
ACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAACA-
GTTTGGTT
TGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCA-
GCAGAAAG
ACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGAT-
GGGCGCCAA
CTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTC-
TGGCTGGAC
CCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGAT-
GAGTATCT
GTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATA-
TTTACCTCC
AAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTT-
AGGACGGA
GCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTG-
CCGGAATAA
CCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAG-
GTGTGGAC
CAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGA-
AGTCATTC
AAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACAT-
GACAGAG
AGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGC-
GGCGGCCC
AATTGCCGGAGACCCTAGAGACCATAATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTCTTCGTC-
TTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATAGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACCTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGCATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACACCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCGC
ACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCATC-
ATGAAGAA
CCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATGG-
GACAGGTG
CTACTCATAGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCTCT-
GATCACAG
CCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAAC-
ATTTTTAGGG
GAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGGT-
GGAACAGG
AGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAGT-
CAGGCATC
ACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCCG-
AGGAAGT
GCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCAG-
AGGGGGC
TGGAGTTACTACGTCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATGA-
AGAACCCG
TGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGAG-
CCGTGTGA
CACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTCT-
CCATGGTGG
GGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGAA-
ACCCTGGAG
CGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACTG-
GGTCTCTG
GAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAGG-
AGGCCAGT
GAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACATGA-
AGATCATT
GGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGGAC-
ATGGGCTT
ACCATGGAAGCTATGAGGCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGTCA-
AAACCCTGG
GATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAGGA-
AAAAGTGG
ACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAGAG-
CTAGGCAA
ACACAAACGGCCACGAGTCTGCACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGCAA-
TATTTGAA
GAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAAG-
AGAGCAC
CACCTGAGAGGAGAGTGCCAGAGCTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTGG-
AAAGGCC
AAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGAA-
CGAGGATCA
CTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAAG-
AGATGAG
TCGTATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATTAGCAGGTTTGATCTGG-
AGAATGAA
GCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAAA-
CAAAGTGGT
AAAGGTCCTTAGACCAGCTGAAAAAGGGAAAACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAGCG-
GACAAGTT
GTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGTTCT-
AGAGATGCAA
GACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAAACG-
AATGGCA
GTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGATAT-
GGGAAAAGT
TAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCCACC-
ACTTCAACA
AGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCCCGC-
GTCTCTCCA
GGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTATTT-
CCACAGAA
GGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTACC-
TGGTCAATCC
ATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACGAC-
CACATGG
AAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGATCT-
CTCATAGG
GCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGATG-
AAGAAAAG
TACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCACC-
AATCTTAATG
TTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGGGAA-
ACCAAGCC
TATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCA-
AAAAACCC
CACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGG-
AGATCAGC TGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGA KU509998.1 Zika virus
strain Haiti/1225/2014, Haiti, complete genome SEQ ID NO: 5
GTTGTTACTGTTGCTGACTCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTT-
TATTTGGATT
TGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAA-
ACGCGGAG
TAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGG-
ATGGTCTTG
GCAATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGG-
GAAAAAAGA
GGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAGA-
AGAAGAG
ACGAGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGAC-
GTGGGAGT
GCATACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAA-
GTGTTATAT
ACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGG-
AACCAGAT
GACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGC-
ACGGAGAT
CTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCA-
AGAGAATAC
ACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGC-
TTGGCTTTTG
GGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTG-
CATAGGAGT
CAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTGTG-
TCACCGTA
ATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATC-
CTACTGCT
ATGAGGCATCAATATCAGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAA-
TCAGACACT
CAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGTCT-
GGTGACAT
GCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATG-
CTGTCAGT
TCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTTG-
AGATAACG
CCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGG-
CCTTGACTT
TTCAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCAT-
TACCTTGGCAC
GCTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAA-
AAGGCAA
ACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGA-
TGGTGCAA
AGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATAC-
TCCTTGTGT
ACCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGC-
AGGGACAG
ATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACC-
GCTAACCCC
GTAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGT-
CATAGGAGTC
GGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAG-
AGGTGCC
AAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAA-
GGGCATCC
ATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACG-
TTGCTGATGTG
GTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTAT-
CCACAGCCGT
CTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATA-
ACGACGTT
GAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTG-
GGAAGATG
GTATCTGCGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCA-
ATCCTGGAA
GAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCC-
CGTGCCTG
TGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGCACTTCGTCAGAGCAGCAAAGACAAATAACAGC-
TTTGTCGT
GGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGT-
TCGGGGTA
TTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGC-
TGTTAAGGGA
AAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGC-
CCATCTG
ATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGAT-
CATACCCA
AGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCAC-
AGTGAAGA
GCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCAT-
CTCTGAGA
TCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCG-
GGCTAAAG
ATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGACT-
GCAGGATC
AACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAAGA-
GAATGACCA
CAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTG-
GCTAAGCTT
GCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGC-
ATTCAAAGT
CAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGG-
CCTCGTGTCT
TTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGG-
CAATACGAGC
GATGGTTGTTCCACGCACTGATAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACAC-
TGCTTGTGGC
GTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGA-
ACTTACCA
TTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGGCTGCTGTTGCTCAC-
AAGGAGTGG
GAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCA-
AGGCAGAT
ATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGA-
CATGTACA
TTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCG-
CTAGATGA
GAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGA-
CCATCTGTG
GCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGT-
GCTCTATG
GGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGAC-
TGCTAGG
TTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCG-
CGCTGAGA
AGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAA-
GCTAGATG
CCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACT-
CTGCCCG
GAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCA-
ATCCTAGA
CAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATCA-
CCCAAGGG
AGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGA-
CTTGCATC
CTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTG-
ATCTTAGCT
CCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGT-
CAATGTCAC
CCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAG-
TCCCCAACTAT
AATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAAG-
GGTTGAGAT
GGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCAC-
CAATTATGG
ACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATTATTCTGGAAAAACA-
GTTTGGTT
TGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCA-
GCAGAAAG
ACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGAT-
GGGCGCCAA
CTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTC-
TGGCTGGAC
CCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGAT-
GAGTATCT
GTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATA-
TTTACCTCC
AAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTT-
AGGACGGA
GCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTG-
CCGGAATAA
CCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAG-
GTGTGGAC
CAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGA-
AGTCATTC
AAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACAT-
GACAGAG
AGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGC-
GGCGGCCC
AATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGTC-
TTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATGGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACGCCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCG
CACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCAT-
CATGAAGA
ACCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATG-
GGACAGGT
GCTACTCATGGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCC-
TGATCACA
GCCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAA-
CATTTTTAGG
GGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGG-
TGGAACAG
GAGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAG-
TCAGGCAT
CACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCC-
GAGGAAG
TGCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCA-
GAGGGGG
CTGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATG-
AAGAACCC
GTGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGA-
GCCGTGTG
ACACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTC-
TCCATGGTG
GGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGA-
AACCCTGGA
GCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACT-
GGGTCTCT
GGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAG-
GAGGCCA
GTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACAT-
GAAGATCA
TTGGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGG-
ACATGGGC
TTACCATGGAAGCTATGAGGCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGT-
CAAAACCCT
GGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAG-
GAAAAAGT
GGACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAG-
AGCTAGGC
AAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGC-
AATATTTGA
AGAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAA-
GAGAGCA
CCACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTG-
GAAAGGC
CAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGA-
ACGAGGATC
ACTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAA-
GAGATGA
GTCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTG-
GAGAATGA
AGCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAA-
ACAAAGTG
GTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAGACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAG-
CGGACAA
GTTGTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGT-
TCTAGAGATG
CAAGACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAA-
ACGAATG
GCAGTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGA-
TATGGGAAA
AGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCC-
ACCACTTCA
ACAAGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCC-
CGCGTCTCT
CCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTA-
TTTCCACAG
AAGGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTA-
CCTGGTCAAT
CCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACG-
ACCACAT
GGAAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGAT-
CTCTCATA
GGGCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGA-
TGAAGAAA
AGTACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCA-
CCAATCTTA
ATGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGG-
GAAACCAA
GCCTATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAG-
TCAAAAAA
CCCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAAC-
TGGAGATC AGCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGA KU527068.1 Zika
virus strain Natal RGN, Brazil: Rio Grande do Norte, Natal,
complete genome SEQ ID NO: 6
AGTTGTTGATCTGTGTGAATCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTT-
TTATTTTGG
ATTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCT-
AAAACGCGG
AGTAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCA-
GGATGGTC
TTGGCAATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGT-
GGGGAAAAA
AGAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGG-
AGAAGAA
GAGACGAGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTA-
GACGTGGG
AGTGCATACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAA-
TAAGTGTTA
TATACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGG-
TGGAACCAG
ATGACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAA-
GCACGGAG
ATCTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAAT-
CAAGAGAAT
ACACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATC-
GCTTGGCTTT
TGGGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGG-
TGCATAGGA
GTCAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTG-
TGTCACCG
TAATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGA-
TCCTACTG
CTATGAGGCATCAATATCAGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGC-
AATCAGACA
CTCAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGC-
CTGGTGAC
ATGCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAA-
TGCTGTCA
GTTCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGT-
TGAGATAA
CGCCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACA-
GGCCTTGA
CTTTTCAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTCCACAAGGAGTGGTTCCACGACATTC-
CATTACCTTGG
CACGCTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGC-
CAAAAGG
CAAACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGAT-
GGATGGTG
CAAAGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCA-
TACTCCTTG
TGTACCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTA-
CGCAGGGA
CAGATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATA-
ACCGCTAAC
CCCGTAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACAT-
TGTCATAGGA
GTCGGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGT-
GAGAGGT
GCCAAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGG-
CAAGGGCA
TCCATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATCCTCATTGGA-
ACGTTGCTGAT
GTGGTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCT-
TATCCACAGC
CGTCTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCT-
ATAACGAC
GTTGAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGC-
CTGGGAAG
ATGGTATCTGCGGGATCTCCTCTGTTTCAAGAATGGAGAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAAC-
GCAATCTTG
GAAGAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATT-
GCCCGTGC
CTGTGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAAC-
AGCTTTGT
CGTGGATGGTGACACACTGAAGGAATGCCCACTCGAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATG-
GGTTCGGG
GTATTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAAC-
AGCTGTTAAG
GGGAAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAG-
GGCCCAT
CTAATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGGCAGATGGAATAGAAGAGAGTGATCT-
GATCATTCC
CAAGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGC-
ACAGTGAA
GAGCTTGAAATTCGGTTTGAGGAATGCCCGGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACC-
ATCTCTGA
GATCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTC-
CGGGCTAA
AGATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAGTGGTGA-
CTGCAGG
ATCAACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGA-
AGAGAATGA
CCACAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGAC-
CTGGCTAAG
CTTGCAATTTTGATGGGCGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGC-
GGCATTCAA
AGTCAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCT-
TGGCCTCGTG
TCTTTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGT-
TGGCAATACG
AGCGATGGTTGTTCCACGCACTGATAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCA-
CACTGCTTGT
GGCGTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGA-
AGAACTTA
CCATTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCT-
CACAAGGAG
TGGGAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCG-
CCAAGGCA
GATATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGT-
GGACATGT
ACATTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTG-
GCGCTAGA
TGAGAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGA-
TGACCATCT
GTGGCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGT-
GGTGCTCT
ATGGGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTA-
GACTGCTA
GGTTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATC-
CGCGCTGA
GAAGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGG-
AAGCTAGA
TGCCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGA-
CTCTGCCC
GGAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCC-
AATCCTAG
ACAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATC-
ACCCAAGG
GAGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAG-
ACTTGCAT
CCTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGT-
GATCTTAGC
TCCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAG-
TCAATGTCA
CCCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGA-
GTCCCCAACTA
TAATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAA-
GGGTTGAGA
TGGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCA-
CCAATTATG
GACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAAC-
AGTTTGGT
TTGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTC-
AGCAGAAA
GACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGA-
TGGGCGCCA
ACTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATT-
TTGGCTGGA
CCCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGA-
TGAGTATC
TGTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAAT-
ATTTACCTC
CAAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCT-
TAGGACGG
AGCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCT-
GCCGGAATA
ACCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGA-
GGTGTGGA
CCAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTG-
AAGTCATT
CAAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACA-
TGACAGA
GAGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCG-
CGGCGGCC
CAATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGT-
CTTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGAGCAACCATAGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACACCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCGC
ACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCATC-
ATGAAGAA
CCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATGG-
GACAGGTG
CTACTCATAGCAGTAGCAGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCCT-
GATCACAG
CCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAAC-
ATTTTTAGGG
GAAGTTACTTGGCTGGAGCTTCTCTAATCTACATAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGGT-
GGAACAGG
AGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAGT-
CAGGCATC
ACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGATGGTGTGGCAACGGGAGGCCATGCTGTGTCCCG-
AGGAAGT
GCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCAG-
AGGGGGC
TGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATGA-
AGAACCCG
TGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGAG-
CCGTGTGA
CACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTCT-
CCATGGTGG
GGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGAA-
ACCCTGGAG
CGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACTG-
GGTCTCTG
GAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAGG-
AGGCCAGT
GAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACATGA-
AGATCATT
GGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGGAC-
ATGGGCTT
ACCATGGAAGCTATGAGGCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGTCA-
AAACCCTGG
GATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAGGA-
AAAAGTGG
ACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAGAG-
CTAGGCAA
ACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGCAA-
TATTTGAA
GAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAAG-
AGAGCAC
CACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTGG-
AAAGGCC
AAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGAA-
CGAGGATCA
CTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAAG-
AGATGAG
TCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTCGATCTGG-
AGAATGAA
GCTCTAATCACCAACCAAATGGAGAAAGGGCATAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAAA-
CAAAGTGGT
AAAGGTCCTTAGACCAGCTGAAAAAGGGAAAACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAGCG-
GACAAGTT
GTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGTTCT-
AGAGATGCAA
GACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAAACG-
AATGGCA
GTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGATAT-
GGGAAAAGT
TAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCCACC-
ACTTCAACA
AGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCCCGC-
GTCTCTCCA
GGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTATTT-
CCACAGAA
GGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTACC-
TGGTCAATCC
ATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACGAC-
CACATGG
AAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGATCT-
CTCATAGG
GCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGATG-
AAGAAAAG
TACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCACC-
AATCTTAATG
TTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGGGAA-
ACCAAGCC
TATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCA-
AAAAACCC
CATGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGG-
AGATCAGC
TGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGCTGG-
GAAAGAC
CAGAGACTCCATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAATAGCGGCGGCCGGTGTGGGGAA-
ATCCATGG GTCTT KU681081.3 Zika virus isolate Zika virus/H.
sapiens-tc/THA/2014/SV0127-14, Thailand, complete genome SEQ ID NO:
7
AGTTGTTGATCTGTGTGAATCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTT-
TTATTTTGG
ATTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCT-
AAAACGCGG
AGTAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCA-
GGATGGTC
TTGGCGATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGT-
GGGAAAAAA
AGAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGG-
AGAAGAA
GAGACGAGGCACAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTA-
GACGTGGG
AGTGCATACTATATGTACTTGGACAGAAGCGATGCTGGGGAGGCCATATCTTTTCCAACCACACTGGGGATGAA-
TAAGTGTTA
TATACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGG-
TAGAACCAG
ATGACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAA-
GCACGGAG
ATCCAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAGACCTGGTTGGAAT-
CAAGAGAAT
ACACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATC-
GCTTGGCTTT
TGGGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGG-
TGCATAGGA
GTCAGTAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTG-
TGTCACCGT
AATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGAT-
CCTACTGC
TATGAGGCATCAATATCGGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCA-
ATCAGACAC
TCAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCC-
TGGTGACA
TGCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAAT-
GCTGTCAG
TTCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTT-
GAGATAAC
GCCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAG-
GCCTTGAC
TTTTCAGATTTGTATTACTTGACTATGAACAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCC-
ATTACCTTGGC
ACACTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCC-
AAAAGGC
AAACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATG-
GATGGTGC
AAAGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCAT-
ACTCCTTGT
GTACCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTAC-
GCAGGGAC
AGATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAA-
CCGCTAACC
CCGTAATCACTGAAGGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATT-
GTCATAGGA
GTCGGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGT-
GAGAGGT
GCCAAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGTTCTTAACTCATTGGG-
CAAGGGCA
TCCATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGA-
ACGTTGCTGAT
GTGGTTGGGTCTGAATACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCT-
TATCCACAGC
CGTCTCCGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAAACGAGATGCGGTACAGGGGTGTTCGTCT-
ATAACGAC
GTTGAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCTCGTAGATTGGCAGCAGTAGTCAAGCAAGC-
CTGGGAAG
ATGGTATCTGTGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAAC-
GCAATCCTG
GAAGAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATT-
GCCCGTGC
CTGTGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAAC-
AGCTTTGT
CGTGGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATG-
GGTTCGGG
GTATTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCACTAGAGTGTGATCCAGCCGTCATTGGAAC-
AGCTGTTAAG
GGAAAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAACGACACATGGAGGCTGAGGAG-
GGCCCAC
CTGATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCT-
GATCATAC
CCAAGTCTTTAGCTGGGCCACTCAGCCATCACAACACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGG-
CACAGTGA
AGAGCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGAC-
CATCTCTG
AGATCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTT-
CCGGGCTA
AAGATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGTAACTTAGTAAGGTCAATGGTG-
ACTGCAG
GATCAACTGATCACATGGATCACTTTTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAG-
AAGAGAATG
ACCACAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGA-
TCTGGCTAA
GCTTGCAATTTTGATGGGTGCCACCTTTGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGGTAG-
CGGCATTCA
AAGTCAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCC-
TTGGCCTCGT
GTCTTTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGG-
TTGGCAATAC
GAGCGATGGTTGTTCCACGCACTGACAATATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGC-
ACACTGCTTG
TGGCGTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTCATGCTCCTCTCTCTGAAGGGGAAAGGCAGTGTGAAG-
AAGAACTT
ACCATTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGC-
TCACAAGGA
GTGGGAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTC-
GCCAAGGC
AGATATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTG-
TGGACATG
TACATTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTTACTGGAAACAGTCCCCGGCTCGATGT-
GGCACTAG
ATGAGAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAAGTGGTCCTG-
ATGACCATC
TGTGGCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAAACTGGAAAAAGGAG-
TGGTGCTCT
ATGGGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTA-
GACTGCTA
GGTTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCATGTCACAAAAGGATC-
CGCGCTGA
GAAGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGG-
AAGCTAGA
TGCCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGA-
CTCTGCCC
GGAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGACTATCCAGCAGGAACTTCAGGATCTCC-
AATCCTAG
ACAAGTGTGGGAGAGTGATAGGACTCTATGGCAATGGGGTCGTGATCAAGAATGGGAGTTATGTCAGTGCCATC-
ACCCAAGG
GAGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAG-
ACTTGCAT
CCTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACGAGACTCCGTACTGT-
GATCTTAGC
TCCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAG-
TCAATGTCA
CCCATTCTGGGACAGAAATCGTTGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGA-
GTCCCCAACTA
TAATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAA-
GGGTTGAGA
TGGGCGAGGCAGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTCCCGGACTCCAACTCA-
CCAATTATG
GACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAAC-
AGTTTGGT
TTGTCCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTC-
AGCAGAAA
GACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTCGTCGTGACAACTGACATTTCAGAGA-
TGGGCGCCA
ACTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATT-
CTGGCTGGA
CCCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGA-
TGAGTATC
TGTATGGAGGTGGGTGCGCAGAGACTGATGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAAT-
ATTTACCTC
CAAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCT-
TAGGACGG
AGCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCT-
GCCGGAATA
ACCTACACAGATAGAAGATGGTGCTTTGATGGCATGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGA-
GGTGTGGA
CCAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTG-
AAGTCATT
CAAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACA-
TGACGGA
GAGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCG-
CGGCGGCC
CAATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGT-
CTTGATGCGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGCGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCCCCCC
AGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATAGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCGGCCTGGGCCATCTATGCTGCCCTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATAC
AACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTA-
CGCATGGGA
CTTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACACCCCTGACCCTAATAGTGGCTATCATTT-
TGCTCGTGGCG
CACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCAT-
CATGAAGA
ACCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACTATTGACCCCCAAGTGGAGAAAAAGATG-
GGACAGGT
GCTACTCATAGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAAGCTGGGGCCC-
TGATCACA
GCTGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGCAA-
CATTTTTAGG
GGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGG-
TGGAACAG
GAGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAG-
TCAGGCAT
CACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCC-
GAGGAAG
TGCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCA-
GAGGGGG
CTGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATG-
AAGAACCC
ATGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGA-
GCCGTGTG
ACACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTC-
TCCATGGTG
GGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTGTAAAAGTGTTGTGCCCATACACCAGCACTATGATGGA-
AACCCTGGA
GCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACT-
GGGTCTCT
GGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCCAG-
GAGGCCA
GTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACAT-
GAAGATCA
TTGGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGG-
ACATGGGC
TTACCATGGAAGCTATGAGGCCCCTACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGT-
CAAAACCCT
GGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAG-
GAAAAAGT
GGACACCAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAG-
AGCTAGGC
AAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGC-
AATATTTGA
AGAGGAAAAAGAGTGGAAGACCGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAA-
GAGAGC
ACCACCTGAGAGGAGAGTGCCAGAGCTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTT-
GGAAAG
GCCAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTT-
AAATGAGGA
TCACTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAG-
AAGAGAT
GAGTCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATC-
TGGAGAAT
GAAGCTTTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTAGCATTGGCCATAATCAAGTACACATACCA-
AAACAAAGT
GGTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAGACAGTTATGGACATTATTTCAAGACAAGACCAAAGGGGGA-
GCGGACA
AGTTGTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAG-
TTCTAGAGAT
GCAAGACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCA-
AACGAAT
GGCAGTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATG-
ATATGGGAA
AAGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGTTCC-
CACCACTTC
AACAAGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGC-
CCGTGTCTC
TCCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAGTCATATGCGCAAATGTGGCAGCTCCTTT-
ATTTCCACA
GAAGGGACCTCCGACTGATGGCCAATGCCATCTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACT-
ACCTGGTCA
ATCCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAA-
CGACCAC
ATGGAAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATCTGGGAAAAAGGGAAGACTTGTGGTGTGG-
ATCTCTCA
TAGGGCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGT-
GATGAAGA
AAAGTACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTATAAG-
CACCAATCTT
AGTGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAGGCTG-
GGAAACCA
AGCCCATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGA-
GTCAAAAA
ACCCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAA-
CTGGAGAT
CAGCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACG-
CTGGGAA
AGACCAGAGACTCCATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAATAGCGGCGGCCGGTGTGG-
GGAAATCC ATGGGTCT KU681082.3 Zika virus isolate Zika virus/H.
sapiens-tc/PHL/2012/CPC-0740, Philippines, complete genome SEQ ID
NO: 8
AGTTGTTGATCTGTGTGAATCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTT-
TTATTTTGG
ATTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCT-
AAAACGCGG
AGTAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGCCATGGGCCCATCA-
GGATGGTC
TTGGCGATACTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGT-
GGGGAAAAA
AGAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGG-
AGAAGAA
GAGACGAGGCGCAGATACTAGCGTCGGAATTGTTGGCCTCCTCCTGACCACAGCCATGGCAGTAGAGGTCACTA-
GACGTGGG
AGTGCATACTATATGTACTTGGACAGAAGCGATGCTGGGGAGGCCATATCTTTTCCAACCACACTGGGGATGAA-
TAAGTGTTA
CATACAAATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGTTGGATGAGGGGG-
TAGAACCAG
ATGACGTCGATTGCTGGTGCAACACGACATCAACTTGGGTTGTGTATGGAACCTGCCACCACAAAAAAGGTGAA-
GCACGGAG
ATCTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAGACCTGGTTGGAAT-
CAAGAGAAT
ACACAAAGCACCTGATTAGAGTTGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGTCATC-
GCTTGGCTTT
TGGGAAGTTCAACGAGCCAAAAAGTCATATATCTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGG-
TGCATAGGA
GTCAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTG-
TGTTACCGT
AATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGAT-
CCTACTGC
TATGAGGCATCAATATCGGATATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAGGCCTACCTTGACAAGCA-
GTCAGACAC
TCAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCC-
TGGTGACA
TGCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAAT-
GCTGTCAG
TTCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTT-
GAGATAAC
GCCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGGAGCCTAGGACTTGATTGTGAACCGAGGACAG-
GCCTTGAC
TTTTCAGATTTGTATTACCTGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCC-
ATTACCTTGGC
ATGCTGGGGCAGACACTGGAACTCCACATTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCA-
AAAAGGCA
AACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGG-
ATGGAGCC
AAGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATA-
CTCCTTGTG
CACTGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACG-
CAGGGACA
GATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGATATGCAAACTCTGACCCCAGTTGGGAGGTTGATAAC-
CGCTAACCC
TGTAATCACTGAAAGCACCGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTG-
TCATAGGAGT
CGGGGAGAAGAAGATCACCCATCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGA-
GAGGTGC
CAAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGGGGTGCTCTCAACTCATTGGGCA-
AGGGCATC
CATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTCGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAAC-
GTTGCTGGTGT
GGTTGGGTCTGAATACAAAGAATGGATCTATTTCCCTTACGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTA-
TCCACAGCCG
TTTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAAACGAGATGCGGTACAGGGGTGTTCGTCTAT-
AACGACGTT
GAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCTCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTG-
GGAAGATG
GGATCTGTGGGATCTCCTCTGTCTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCA-
ATCCTGGA
AGAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGC-
CCGTGCCT
GTGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAACAG-
CTTTGTCG
TGGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGG-
TTTGGGGT
ATTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTCATTGGAACAG-
CTGCTAAGG
GAAAGGAGGCTGTGCACAGCGATCTAGGCTACTGGATTGAGAGTGAGAAGAACGACACATGGAGGCTGAAGAGG-
GCCCACC
TGATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAGTAGAAGAAAGTGATCTG-
ATCATACC
CAAGTCTTTAGCTGGGCCACTCAGCCATCACAACACCAGAGAGGGCTACAGGACTCAAATGAAAGGGCCATGGC-
ACAGTGAA
GAGCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGGACAAGAGGACC-
ATCCCTGA
GATCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAATGCACAATGCCCCCACTGTCGTTC-
CGAGCTAA
AGATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGTAACTTAGTAAGGTCAATGGTGA-
CTGCAGG
ATCAACTGATCACATGGATCACTTCTCTCTTGGAGTGCTTGTGATTTTGCTCATGGTGCAGGAAGGGCTGAAGA-
AGAGAATGA
CCACAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCCATGATCCTGGGAGGATTTTCAATGAGTGAC-
CTGGCTAAG
CTTGCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATTTGGCGCTGATAGC-
GGCATTCAA
AGTCAGACCTGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAGAGCATGCTGCTGGCCT-
TGGCCTCGTG
TCTTCTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGT-
TGGCAATACG
AGCGATGGTTGTTCCACGCACTGACAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCA-
CACTGCTTGT
GGCGTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTCATGCTCCTCTCTCTGAAGGGGAAAGGCAGTGTGAAGA-
AGAACCTA
CCATTTGTCATGGCCTTGGGACTAACTGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCT-
CACAAGGAG
TGGGAAGCGGAGCTGGCCCCCTAGTGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCG-
CCAAGGCG
GATATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGT-
GGACATGT
ACATTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAATCACTGGAAACAGTCCCCGGCTCGATGTG-
GCACTAGA
TGAGAGTGGTGATTTCTCCCTAGTGGAGGATGATGGTCCACCCATGAGAGAGATCATACTCAAAGTGGTCCTGA-
TGACCATCT
GCGGCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTGTATGTGAAGACTGGAAAAAGGAGT-
GGTGCTCT
ATGGGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTA-
GACTGCTT
GGTTCAACACAAGTTGGAGTGGGAGTCATGCAAGAGGGGGTCTTCCACACTATGTGGCACGTCACAAAAGGATC-
CGCGCTGA
GAAGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCGTGG-
AAGCTAGA
CGCCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGA-
CTCTGCC
CGGAACATTTAAGACAAAGGATGGGGACATTGGAGCAGTTGCGCTGGACTACCCAGCAGGAACTTCAGGATCTC-
CAATCCTA
GACAAGTGTGGGAGAGTGATAGGACTCTATGGTAATGGGGTCGTGATAAAAAATGGGAGTTATGTTAGTGCCAT-
CACCCAAG
GGAGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTA-
GACCTGCA
TCCTGGAGCCGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTG-
TGATCTTAG
CTCCAACCAGGGTCGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTTCGTTATATGACAACAGCA-
GTCAATGTC
ACCCATTCTGGGACAGAAATCGTTGACTTAATGTGCCATGCTACCTTCACTTCACGCCTACTACAACCAATCAG-
AGTCCCCAACT
ATAATTTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACA-
AGGGTTGAG
ATGGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTCCCGGACTCCAACTC-
ACCAATTAT
GGACACCGAGGTGGAAGTCCCAGAGAGAGCCTGGAGCACAGGCTTTGATTGGGTGACGGATCATTCTGGGAAAA-
CAGTCTG
GTTTGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGC-
TCAGCAGA
AAGACTTTTGAGACAGAGTTCCAGAAAACGAAAAATCAAGAGTGGGACTTCGTCGTGACAACCGACATTTCAGA-
GATGGGCG
CCAACTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCTTAAAGCCGGTCATACTTGATGGCGAGAGAGTC-
ATTTTGGCT
GGACCCATGCCTGTCACACATGCCAGCGCTGCTCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGG-
AGATGAGT
ATCTGTATGGAGGTGGGTGCGCAGAGACTGATGAAGATCACGCACACTGGCTTGAAGCAAGAATGCTTCTTGAC-
AACATTTAC
CTCCAAGATGGCCTCATAGCTTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCTATTGAGGGAGAGTTCAA-
GCTTAGGAC
GGAGCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCGGTTTGGTTGGCCTATCAGGTTGCAT-
CTGCCGGA
ATAACCTACACAGATAGAAGATGGTGCTTTGATGGCATGACCAACAACACCATAATGGAAGACAGTGTGCCGGC-
AGAGGTGT
GGACCAGATACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCC-
CTGAAGT
CATTCAAAGAGTTTGCCGCTGGGAAAAGAGGAGCGGCCTTTGGAGTGATAGAAGCCCTGGGAACACTGCCAGGA-
CACATGAC
AGAGAGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAG-
CCGCGGCG
GCCCAATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTT-
CGTCTTGATG
CGGAACAAGGGCATGGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTTATGTGGCTCTC-
GGAAATTG
AGCCAGCCAGAATTGCATGTGTCCTCATTGTCGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAG-
CAAAGATCTC
CTCAGGACAACCAAATGGCAATCATCATCATGGTAGCAGTGGGTCTTCTGGGCTTGATTACCGCCAATGAACTC-
GGATGGTTG
GAGAGAACAAAAAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCACAGGATTCTCAATGGA-
CATTGAC
CTGCGGCCAGCCTCAGCTTGGGCTATCTATGCTGCTCTGACAACTTTCATCACCCCAGCCGTCCAACATGCGGT-
GACCACTTCAT
ACAACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGGGTGTTGTTTGGTATGGGCAAAGGGATGCCATTC-
TACGCATGG
GACTTTGGAGTCCCGCTGCTAATGATGGGTTGCTACTCACAATTAACACCTCTGACCCTAATAGTGGCCATCAT-
TTTGCTCGTG
GCGCACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGGGCTGCCCAGAAGAGAACGGCAGCTGG-
CATCATGA
AGAACCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAAAAAAAG-
ATGGGGCA
GGTGCTACTCATAGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGG-
CCCTGATC
ACAGCTGCAACTTCCACCTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCCACAGCCACTTCACTGTG-
TAACATTTTTA
GGGGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGG-
GGTGGAAC
GGGAGAGACCCTGGGAGAGAAATGGAAGGCCCGCCTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAA-
AGTCAGGC
ATCACCGAGGTGTGCAGAGAAGAGGCCCGCCGTGCCCTCAAGGACGGTGTGGCAACAGGAGGCCATGCTGTGTC-
CCGAGGA
AGTGCAAAGCTTAGATGGCTGGTGGAGAGAGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGG-
CAGAGGG
GGCTGGAGTTACTATGCCGCCACCATCCGCAAAGTTCAGGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCA-
TGAAGAAC
CCATGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCACATGGCGGCT-
GAGCCGTG
TGACACTTTGCTGTGTGATATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCC-
TCTCCATGG
TGGGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATG-
GAAACCCTG
GAGCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGGGTGCCACTCTCCCGCAACTCTACACATGAGATGTA-
CTGGGTCT
CTGGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCC-
AGGAGGCC
AGTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACA-
TGAAGATC
ATTGGTAACCGCATTGAGAGGATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAG-
GACATGGG
CTTACCATGGAAGCTATGAGGCCCCTACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTG-
TCAAAACCCT
GGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACTGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAG-
GAAAAAGT
GGACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTATGGAAGG-
AGCTAGGC
AAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGC-
AATATTTGA
AGAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAATGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAA-
GAGAGCA
TCACCTGAGAGGAGAGTGTCAGAGCTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTG-
GAAAGGC
CAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTCCTAGAGTTCGAAGCCCTTGGATTCTTGA-
ATGAGGATC
ATTGGATGGGGAGAGAGAATTCAGGAGGTGGTGTTGAAGGACTGGGATTACAAAGACTCGGATATGTCCTAGAA-
GAGATGA
GTCGCATACCAGGAGGAAGGATGTATGCAGATGATACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTG-
GAGAATGA
AGCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAA-
ACAAAGTG
GTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAGACAGTTATGGACATTATTTCAAGACAAGACCAAAGGGGGAG-
CGGACAA
GTTGTCACTTACGCTCTTAATACATTCACCAACCTGGTGGTGCAGCTCATTCGGAATATGGAGGCTGAGGAAGT-
TCTAGAGATG
CAAGACTTGTGGCTGCTGCGGAGGCCAGAGAAAGTGACCAACTGGTTGCAAAGCAACGGATGGGATAGGCTCAA-
AAGAATG
GCAGTCAGTGGAGATGATTGCGTTGTGAAACCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGA-
TATGGGAAA
AGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCC-
ACCACTTCA
ACAAACTCCATCTTAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGAGCC-
CGCGTATCA
CCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTA-
TTTCCACAG
AAGGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGATTGGGTTCCAACTGGGAGAACTA-
CCTGGTCAAT
CCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTATGGAACAGAGTGTGGATTGAGGAAAACG-
ACCACAT
GGAAGACAAGACCCCAGTTACAAAATGGACAGACATTCCCTATTTGGGAAAAAGAGAAGACTTGTGGTGTGGAT-
CTCTCATAG
GGCACAGACCGCGTACTACCTGGGCTGAGAACATCAAAAATACAGTCAACATGATGCGCAGGATCATAGGTGAT-
GAAGAAAA
GTACATGGACTACCTATCCACCCAGGTTCGCTACTTGGGTGAAGAAGGGTCCACACCTGGAGTGCTGTAAGCAC-
CAATCTTAG
TGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGGG-
AAACCAAG
CCTATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGT-
CAAAAAAC
CCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACT-
GGAGATCA
GCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGCT-
GGGAAAG
ACCAGAGACTCCATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAATAGCGGCGGCCGGTGTGGGG-
AAATCCAT GGGTCT KU707826.1 Zika virus isolate SSABR1, Brazil,
complete genome SEQ ID NO: 9
GACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTTTATTTGGATTTGGAAACGAGAGTTTCT-
GGTCATGAA
AAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAAACGCGGAGTAGCCCGTGTGAGCCCCT-
TTGGGGGC
TTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGGATGGTCTTGGCGATTCTAGCCTTTTT-
GAGATTCAC
GGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGGGAAAAAAGAGGCTATGGAAATAATAA-
AGAAGTTC
AAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAGAAGAAGAGACGAGGCGCAGATACTAG-
TGTCGGA
ATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGACGTGGGAGTGCATACTATATGTACTT-
GGACAGAAA
CGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAAGTGTTATATACAGATCATGGATCTTG-
GACACATGT
GTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGGAACCAGATGACGTCGATTGTTGGTGC-
AACACGAC
GTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGCACGGAGATCTAGAAGAGCTGTGACGC-
TCCCCTCCC
ATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCAAGAGAATACACAAAGCACTTGATTAGA-
GTCGAAAAT
TGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGCTTGGCTTTTGGGAAGCTCAACGAGCCA-
AAAAGTCAT
ATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTGCATAGGAGTCAGCAATAGGGACTTTG-
TGGAAGGTA
TGTCAGGTGGGACCTGGGTTGATGTTGTCTTGGAACATGGAGGTTGTGTCACCGTAATGGCACAGGACAAACCG-
ACTGTCGA
CATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATCCTACTGCTATGAGGCATCAATATCAG-
ACATGGCTT
CGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAATCAGACACTCAATATGTCTGCAAAAGA-
ACGTTAGTG
GACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCCTGGTGACATGCGCTAAGTTTGCATGCTC-
CAAGAAAA
TGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATGCTGTCAGTTCATGGCTCCCAGCACAGT-
GGGATGAT
TGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAAGTTGAGATAACGCCCAATTCACCAAGAGCCG-
AAGCCACC
CTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGGCCTTGACTTTTCAGATTTGTATTACTT-
GACTATGAAT
AACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCATTACCTTGGCACGCTGGGGCAGACACCGG-
AACTCCACA
CTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAAAAGGCAAACTGTCGTGGTTCTAGGGA-
GTCAAGAA
GGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGATGGTGCAAAGGGAAGGCTGTCCTCTGG-
CCACTTGA
AATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATACTCCTTGTGTACTGCAGCGTTCACATTC-
ACCAAGATCC
CGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGCAGGGACAGATGGACCTTGCAAGGTTCCA-
GCTCAGAT
GGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACCGCTAACCCCGTAATCACTGAAAGCACTG-
AGAACTCTA
AGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGTCATAGGAGTCGGGGAGAAGAAGATCACC-
CACCACTGG
CACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAGAGGTGCCAAGAGAATGGCAGTCTTGGG-
AGACACA
GCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAAGGGCATCCATCAAATTTTTGGAGCAGC-
TTTCAAATCA
TTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACGTTGCTGATGTGGTTGGGTCTGAACACAAA-
GAATGGATCT
ATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTATCCACAGCCGTCTCTGCTGATGTGGGGTG-
CTCGGTGGAC
TTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATAACGACGTTGAAGCCTGGAGGGACAGGTA-
CAAGTACC
ATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTGGGAAGATGGTATCTGCGGGATCTCCTCT-
GTTTCAAGA
ATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCAATCCTGGAAGAGAATGGAGTTCAACTGAC-
GGTCGTTG
TGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCCCGTGCCTGTGAACGAGCTGCCCCACGGC-
TGGAAGGC
TTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAACAGCTTTGTCGTGGATGGTGACACACTGAAGG-
AATGCCCA
CTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGTTCGGGGTATTTCACACTAGTGTCTGGCT-
CAAGGTTAG
AGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGCTGTTAAGGGAAAGGAGGCTGTACACAGTG-
ATCTAGGCT
ACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGCCCATCTGATCGAGATGAAAACATGTGAA-
TGGCCAA
AGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGATCATACCCAAGTCTTTAGCTGGGCCACTC-
AGCCATCAC
AATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCACAGTGAAGAGCTTGAAATTCGGTTTGAGGA-
ATGCCCA
GGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCATCTCTGAGATCAACCACTGCAAGCGGAAG-
GGTGATC
GAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCGGGCTAAAGATGGCTGTTGGTATGGAAT-
GGAGATAA
GGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGACTGCAGGATCAACTGATCACATGGACCAC-
TTCTCCCTT
GGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAAGAGAATGACCACAAAGATCATCATAAGCAC-
ATCAATGG
CAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTGGCTAAGCTTGCAATTTTGATGGGTGCC-
ACCTTCGCG
GAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGCATTCAAAGTCAGACCAGCGTTGCTGGT-
ATCTTTCAT
CTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGGCCTCGTGTCTTTTGCAAACTGCGATCT-
CCGCCTTGGA
AGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGGCAATACGAGCGATGGTTGTTCCACGCA-
CTGATAACAT
CACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACACTGCTTGTGGCGTGGAGAGCAGGCCTTG-
CTACTTGCG
GGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGAACTTACCATTTGTCATGGCCCTGGGA-
CTAACCGCT
GTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCTCACAAGGAGTGGGAAGCGGAGCTGGCCCCC-
TAGCGAA
GTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCAAGGCAGATATAGAGATGGCTGGGCC-
CATGGCCG
CGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGACATGTACATTGAAAGAGCAGGTGAC-
ATCACATG
GGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCGCTAGATGAGAGTGGTGATTTCTCCC-
TGGTGGAG
GATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGACCATCTGTGGCATGAACCCAATAGC-
CATACCCTTT
GCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGTGCTCTATGGGATGTGCCTGCTCCCAA-
GGAAGTAA
AAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGACTGCTAGGTTCAACACAAGTTGGAGTG-
GGAGTTA
TGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCGCGCTGAGAAGCGGTGAAGGGAGACTT-
GATCCATA
CTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAAGCTAGATGCCGCCTGGGACGGGCACA-
GCGAGGT
GCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACTCTGCCCGGAATATTTAAGACAAAGG-
ATGGGGAC
ATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCAATCCTAGACAAGTGTGGGAGAGTGAT-
AGGACTTTA
TGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATCACCCAAGGGAGGAGGGAGGAAGAGACTC-
CTGTTGA
GTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGACTTGCATCCTGGAGCTGGGAAAACCA-
GGAGAGTTC
TTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTGATCTTAGCTCCAACCAGGGTTGTCGCT-
GCTGAAATGG
AGGAGGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGTCAATGTCACCCACTCTGGAACAGAAATC-
GTCGACTTA
ATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAGTCCCCAACTATAATCTGTATATTATGGA-
TGAGGCCCACT
TCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAAGGGTTGAGATGGGCGAGGCGGCTGCCATC-
TTCATGACC
GCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCACCAATTATGGACACCGAAGTGGAAGTCCC-
AGAGAGAGC
CTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAACAGTTTGGTTTGTTCCAAGCGTGAGGAACG-
GCAATGAG
ATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCAGCAGAAAGACTTTTGAGACAGAGTTCCA-
GAAAACAA
AACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGATGGGCGCCAACTTTAAAGCTGACCGTGTC-
ATAGATTCC
AGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTCTGGCTGGACCCATGCCTGTCACACATGC-
CAGCGCTGC
CCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGATGAGTATCTGTATGGAGGTGGGTGCGCAG-
AGACTGA
CGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATATTTACCTCCAAGATGGCCTCATAGCCT-
CGCTCTATCG
ACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTTAGGACGGAGCAAAGGAAGACCTTTGTGG-
AACTCATG
AAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTGCCGGAATAACCTACACAGATAGAAGATG-
GTGCTTTGA
TGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAGGTGTGGACCAGACACGGAGAGAAAAGAG-
TGCTCAA
ACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGAAGTCATTCAAGGAGTTTGCCGCTGGGA-
AAAGAGGA
GCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACATGACAGAGAGATTCCAGGAAGCCATTGA-
CAACCTCG
CTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGCGGCGGCCCAATTGCCGGAGACCCTAGAG-
ACCATTAT
GCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTCTTCGTCTTGATGAGGAACAAGGGCATAGGGAAGA-
TGGGCTTT
GGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGAAATTGAGCCAGCCAGAATTGCATGTGT-
CCTCATTGT
TGTGTTTCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAAAGATCTCCCCAGGACAACCAAATGGCAA-
TCATCATCAT
GGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGATGGTTGGAGAGAACAAAGAGTGACCTAA-
GCCATCTA
ATGGGAAGGAGAGAGGAGGGGGCAACCATAGGATTCTCAATGGACATTGACCTGCGGCCAGCCTCAGCTTGGGC-
CATCTATG
CTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGACCACTTCATACAACAACTACTCCTTAATG-
GCGATGGCCAC
GCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTACGCATGGGACTTTGGAGTCCCGCTGCTAA-
TGATAGGTT
GCTACTCACAATTAACACCCCTGACCCTAATAGTGGCCATCATTTTGCTCGTGGCGCACTACATGTACTTGATC-
CCAGGGCTGCA
GGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCATCATGAAGAACCCTGTTGTGGATGGAATAG-
TGGTGACT
GACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATGGGACAGGTGCTACTCATAGCAGTAGCCGT-
CTCCAGCG
CCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCCTGATCACAGCCGCAACTTCCACTTTGTGG-
GAAGGCTC
TCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAACATTTTTAGGGGAAGTTACTTGGCTGGAG-
CTTCTCTAATC
TACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGGTGGAACAGGAGAGACCCTGGGAGAGAAATG-
GAAGGC
CCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAGTCAGGCATCACCGAGGTGTGCAGAGAAG-
AGGCCCGC
CGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCCGAGGAAGTGCAAAGCTGAGATGGTTGGT-
GGAGCGG
GGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCAGAGGGGGCTGGAGTTACTACGCCGCCAC-
CATCCGCA
AAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATGAAGAACCCGTGTTGGTGCAAAGCTATGGG-
TGGAACAT
AGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGAGCCGTGTGACACGTTGCTGTGTGACATAG-
GTGAGTCAT
CATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTCTCCATGGTGGGGGATTGGCTTGAAAAAAGA-
CCAGGAGC
CTTTTGTATAAAGGTGTTGTGCCCATACACCAGCACTATGATGGAAACCCTGGAGCGACTGCAGCGTAGGTATG-
GGGGAGGA
CTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTATTGGGTCTCTGGAGCGAAAAGCAACACCAT-
AAAAAGTGT
GTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAGGAGGCCAGTGAAATATGAGGAGGATGTGA-
ATCTCGGC
TCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACATGAAGATCATTGGTAACCGCATTGAAAGGAT-
CCGCAGTG
AGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGGACATGGGCTTACCATGGAAGCTATGAGGCC-
CCCACACAA
GGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGTCAAAACCCTGGGATGTGGTGACTGGAGTCAC-
AGGAATAGC
CATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAGGAAAAAGTGGACACTAGGGTGCCAGACCCCC-
AAGAAGGC
ACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAGAGCTAGGCAAACACAAACGGCCACGAGTCTG-
TACCAAAGA
AGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGCAATATTTGAAGAGGAAAAAGAGTGGAAGACTG-
CAGTGGA
AGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGATAAGGAAAGAGAGCACCACCTGAGAGGAGAGTGCCAGA-
GTTGTGT
GTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTGGAAAGGCCAAGGGCAGCCGCGCCATCTGGT-
ATATGTG
GCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGAACGAGGATCACTGGATGGGGAGAGAGAACT-
CAGGAGGT
GGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAAGAGATGAGTCGTATACCAGGAGGAAGGAT-
GTATGCA
GATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTGGAGAATGAAGCTCTAATCACCAACCAAAT-
GGAAAAAG
GGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAAACAAAGTGGTAAAGGTCCTTAGACCAGCT-
GAAAAAGG
GAAAACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAGCGGACAAGTTGTCACTTACGCTCTTAACA-
CATTTACCA
ACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGTTCTAGAGATGCAAGACTTGTGGCTGCTGCGG-
AGGTCAGA
GAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAAACGAATGGCAGTCAGTGGAGATGATTGCG-
TTGTGAA
GCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGATATGGGAAAAGTTAGGAAGGACACACAAG-
AGTGGAAA
CCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCCACCACTTCAACAAGCTCCATCTCAAGGA-
CGGGAGGTC
CATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCCCGCGTCTCTCCAGGGGCGGGATGGAGCA-
TCCGGGAG
ACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTATTTCCACAGAAGGGACCTCCGACTGAT-
GGCCAATGCC
ATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTACCTGGTCAATCCATGGAAAGGGAGAATG-
GATGACCAC
TGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACGACCACATGGAAGACAAGACCCCAGTTA-
CGAAATG
GACAGACATCCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGATCTCTCATAGGGCACAGACCGCGCACCA-
CCTGGGCT
GAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGATGAAGAAAAGTACATGGACTACCTATC-
CACCCAAG
TTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCACCAGTCTTAATGTTGTCAGGCCTGCTA-
GTCAGCCAC
AGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGGGAAACCAAGCCTATAGTCAGGCCGAGA-
ACGCCATG
GCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCAAAAAACCCCACGCGCTTGGAGGCG-
CAGGATG
GGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGGAGATCAGCTGTGGATCTCCAGAA-
GAGGGACT AGTGGTTAGAGGAG KU744693.1 Zika virus isolate VE_Ganxian,
China, complete genome SEQ ID NO: 10
GTTGTTACTGTTGCTGACTCAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTT-
TATTTGGATT
TGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAA-
ACGCGGAG
TAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGG-
ATGGTCTTG
GCAATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGG-
GAAAAAAGA
TGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAGA-
AGAAGAGA
CGAGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGACG-
TGGGAGTG
CATACTATATGTACTTGGACAGAAACGATGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAAG-
TGTTATATAC
AGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGGAA-
CCAGATGA
CGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGCAC-
GGAGATCTA
GAAGAGCTGTGACGCTCCCTTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCAAGA-
GAATACACA
AAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGCTTG-
GCTTTTGGG
AAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTGCA-
TAGGAGTCA
GCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTGTGTC-
ACCGCAAT
GGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATCCT-
ACTGCTAT
GAGGCATCAATATCAGACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAATC-
AGACACTCA
ATATGTTTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGTCTGG-
TGACATGC
GCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATGCT-
GTCAGTTC
ATGGCTCCCAGCACAGTGGGATGCTCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTTGAG-
ATAACGCC
CAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGGCC-
TTGACTTTT
CAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGCTCACAAGGAGTGGTTCCACGACATTCCATTA-
CCTTGGCACG
CTGGGGCAGCCACCGGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAAA-
AGGCAAAC
TGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGATG-
GTGCAAAG
GGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATACTC-
CTTGTGTAC
CGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACAGTGGACGGGACAGTCACAGTGGAGGGACAGTACGGAG-
GGACAGA
TGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAGACTCTGACCCCAGTTGGGAGGTTGATAACCG-
CTAACCCCG
TAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGTC-
ATAGGAGTCG
GGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAGA-
GGTGCCA
AGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAAG-
GGCATCCA
TCAAATTATTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGGACGT-
TGCTGATGTG
GTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTAT-
CCACAGCCGT
CTCAGGTGGTGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATA-
ACGATGTT
GAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTG-
GGAAGATG
GTATCTGCGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCA-
ATCCTGGAA
GAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCC-
CGTGCCTG
TGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAACAGC-
TTTGTCGT
GGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGT-
TCGGGGTA
TTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGACTATTGGTTAGAGTGTGATCCAGCCGTTATTGGAACAGC-
TGTTAAGGG
AAAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGTGGCTGAAGAGGG-
CCCATCT
GATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGA-
TCATACCC
AAGTCTTTAGCTGGGCCACTCAGCCATCACAATGCCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCA-
CAGTGAAG
AGCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCA-
TCTCTGAG
ATCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTCCAGGGAGTGCACAATGCCCCCACTGTCCTTCC-
AGGCTAAA
GATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAAGCAACTTAGTAAGGTCAATGGTGAC-
TGCAGGA
TCAACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAA-
GAGAATGAC
CACAAAGATCATCATAAGCACATCAATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACC-
TGGCTAAGC
TTGCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCG-
GCATTCAAA
GTCAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTT-
GGCCTCGTGT
CTTTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTT-
GGCAATACGA
GCGATGGTTGTTCCACGCACTGATAACATCACCTTAGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCAC-
ACTGCTTGTG
GCGTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAA-
GAACTTAC
CATTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCTC-
ACAAGGAGT
GGGAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGC-
CAAGGCAG
ATATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTG-
GACATGTA
CATTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGG-
CGCTAGAT
GAGAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGAT-
GACCATCTG
TGGCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTG-
GTGCTCTAT
GGGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGCAGA-
CTGCTAG
GTTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCC-
GCGCTGAG
AAGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGA-
AGCTAGAT
GCCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGAC-
TCTGCCC
GGAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCACTGGATTACCCAGCAGGAACTTCAGGATCTCC-
AATCCTAG
ACAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATC-
ACCCAAGG
GAGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAG-
ACTTGCAT
CCTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGT-
GATCTTGGC
TCCAACCAGGGTTGTCGCTGCTGAAATGGAGGAGGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAG-
TCAATGTCA
CCCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATTAGA-
GTCCCCAACTA
TAATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAA-
GGGTTGAGA
TGGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCA-
CCAATTATG
GACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGAGTATTCTGGAAAAAC-
AGTTTGGT
TTGTTCCACGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTC-
AGCAGAAA
GACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGA-
TGGGCGCCA
ACTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGGTGGCGAGAGAGTCATT-
CTGGCTGGA
CCCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGA-
TGAGTATC
TGTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAAT-
ATTTACCTC
CAAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCT-
TAGGACGG
AGCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCT-
GCCGGAATA
ACCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGA-
GGTGTGGA
CCAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTG-
AAGTCATT
CAAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACA-
TGACAGA
GAGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCG-
CGGCGGCC
CAATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGT-
CTTGATGAGG
AACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGA-
AATTGAGC
CAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAA-
AGATCTCCCC
AGGACAACCAAATGGCCATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGA-
TGGTTGGAG
AGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATGGGATTCTCAATGGACAT-
TGACCTG
CGGCCAGCCTCAGCTTGGGCCATCTATCCTGCCTTGACATCTTTCATTACCCCAGCCGTCCAACATGCAGTGAC-
CACTTCATACA
ACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTAC-
GCATGGGAC
TTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACGCCCCTGACCCTAATAGTGGCCATCATTTT-
GCTCGTGGCG
CACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCAT-
CATGAAGA
ACCCTGTTGTGGAGGGAATAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATG-
GGACAGGT
GCTACTCATGGCAGTAGCCGTCTCCAGCGCCATACTGTCGAGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCC-
TGATCACA
GCCGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACCTCACTGTGTAA-
CATTTTTAGG
GGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGG-
TGGAACAG
GAGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAG-
TCAGGCAT
CACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCC-
GAGGAAG
TGCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCA-
GAGGGGG
CTGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATG-
AAGAACCC
GTGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGA-
GCCGTGTG
ACACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTC-
TCCATGGTG
GGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGA-
AACCCTGGA
GCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACT-
GGGTCTCT
GGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCTAG-
GAGGCCA
GTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACAT-
GAAGATCA
TTGGTAACCGCATTGAAAGGATCCGCGCTGAGAAAGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGG-
ACATGGGC
TTACCATGGAAGCTATGATGCCGCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGT-
CAAAACCCT
GGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAG-
GAAAAAGT
GGACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAG-
AGCTAGGC
AAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGC-
AATATTTGA
AGAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAA-
GAGAGCA
CCACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACATCACAATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTG-
GAAAGGC
CAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGA-
ACGAGGATC
ACTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAA-
GAGATGA
GTCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTG-
GAGAATGA
AGCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAA-
ACAAAGTG
GTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAGACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAG-
CGGACAA
GTTGTCACTTACGCTCTCAACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGT-
TCTAGAGATG
CAAGACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAA-
ACGAATG
GCGGTCAGTGGAGATGATTGCGTTGTGAAACCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGA-
TATGGGAAA
AGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCCTTCTGCTCCC-
ACCACTTCA
ACAAGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCC-
CGCGTCTCT
CCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTA-
TTTCCACAG
AAGGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTA-
CCTGGTCAAT
CCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGCGTGGAACAGAGTGTGGATTGAGGAGAACG-
ACCACAT
GGAAGACAAGACCCCAGTCACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGAT-
CTCTCATA
GGGCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGA-
TGAAGAAA
AGTACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCA-
CCAATCTTA
ATGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGG-
GAAACCAA
GCCTATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAG-
TCAAAAAA
CCCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAAC-
TGGAGATC AGCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAGA LC002520.1 Zika
virus genomic RNA, strain: MR766-NIID, Uganda, complete genome SEQ
ID NO: 11
AGTTGTTGATCTGTGTGAGTCAGACTGCGACAGTTCGAGTCTGAAGCGAGAGCTAACAACAGTATCAACAGGTT-
TAATTTGGA
TTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAGAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTA-
AAACGCGG
AGTAGCCCGTGTAAACCCCTTGGGAGGTTTGAAGAGGTTGCCAGCCGGACTTCTGCTGGGTCATGGACCCATCA-
GAATGGTTT
TGGCGATACTAGCCTTTTTGAGATTTACAGCAATCAAGCCATCACTGGGCCTTATCAACAGATGGGGTTCCGTG-
GGGAAAAAA
GAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTTGCTGCCATGTTGAGAATAATCAATGCTAGGAAAGA-
GAGGAAGA
GACGTGGCGCAGACACCAGCATCGGAATCATTGGCCTCCTGCTGACTACAGCCATGGCAGCAGAGATCACTAGA-
CGCGGGAG
TGCATACTACATGTACTTGGATAGGAGCGATGCCGGGAAGGCCATTTCGTTTGCTACCACATTGGGAGTGAACA-
AGTGCCACG
TACAGATCATGGACCTCGGGCACATGTGTGACGCCACCATGAGTTATGAGTGCCCTATGCTGGATGAGGGAGTG-
GAACCAGA
TGATGTCGATTGCTGGTGCAACACGACATCAACTTGGGTTGTGTACGGAACCTGTCATCACAAAAAAGGTGAGG-
CACGGCGAT
CTAGAAGAGCCGTGACGCTCCCTTCTCACTCTACAAGGAAGTTGCAAACGCGGTCGCAGACCTGGTTAGAATCA-
AGAGAATAC
ACGAAGCACTTGATCAAGGTTGAAAACTGGATATTCAGGAACCCCGGGTTTGCGCTAGTGGCCGTTGCCATTGC-
CTGGCTTTT
GGGAAGCTCGACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGTATCAGGT-
GCATTGGAG
TCAGCAATAGAGACTTCGTGGAGGGCATGTCAGGTGGGACCTGGGTTGATGTTGTCTTGGAACATGGAGGCTGC-
GTTACCGT
GATGGCACAGGACAAGCCAACAGTTGACATAGAGTTGGTCACGACGACGGTTAGTAACATGGCCGAGGTAAGAT-
CCTATTGC
TACGAGGCATCGATATCGGACATGGCTTCGGACAGTCGTTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCA-
ATCAGACAC
TCAATATGTCTGCAAAAGAACATTAGTGGACAGAGGTTGGGGAAACGGTTGTGGACTTTTTGGCAAAGGGAGCT-
TGGTGACA
TGTGCCAAGTTTACGTGTTCTAAGAAGATGACCGGGAAGAGCATTCAACCGGAAAATCTGGAGTATCGGATAAT-
GCTATCAGT
GCATGGCTCCCAGCATAGCGGGATGACTGTCAATGATATAGGATATGAAACTGACGAAAATAGAGCGAAAGTCG-
AGGTTACG
CCTAATTCACCAAGAGCGGAAGCAACCTTGGGAGGCTTTGGAAGCTTAGGACTTGACTGTGAACCAAGGACAGG-
CCTTGACTT
TTCAGATCTGTATTACCTGACCATGAACAATAAGCATTGGTTGGTGCACAAAGAGTGGTTTCATGACATCCCAT-
TGCCTTGGCA
TGCTGGGGCAGACACTGGAACTCCACACTGGAACAACAAAGAGGCATTGGTAGAATTCAAGGATGCCCACGCCA-
AGAGGCAA
ACCGTCGTCGTTCTGGGGAGCCAGGAAGGAGCCGTTCACACGGCTCTCGCTGGAGCTCTAGAGGCTGAGATGGA-
TGGTGCAA
AGGGAAAGCTGTTCTCTGGCCATTTGAAATGCCGCCTAAAAATGGACAAGCTTAGATTGAAGGGCGTGTCATAT-
TCCTTGTGC
ACTGCGGCATTCACATTCACCAAGGTCCCAGCTGAAACACTGCATGGAACAGTCACAGTGGAGGTGCAGTATGC-
AGGGACAG
ATGGACCCTGCAAGATCCCAGTCCAGATGGCGGTGGACATGCAGACCCTGACCCCAGTTGGAAGGCTGATAACC-
GCCAACCC
CGTGATTACTGAAAGCACTGAGAACTCAAAGATGATGTTGGAGCTTGACCCACCATTTGGGGATTCTTACATTG-
TCATAGGAG
TTGGGGACAAGAAAATCACCCACCACTGGCATAGGAGTGGTAGCACCATCGGAAAGGCATTTGAGGCCACTGTG-
AGAGGCGC
CAAGAGAATGGCAGTCCTGGGGGATACAGCCTGGGACTTCGGATCAGTCGGGGGTGTGTTCAACTCACTGGGTA-
AGGGCATT
CACCAGATTTTTGGAGCAGCCTTCAAATCACTGTTTGGAGGAATGTCCTGGTTCTCACAGATCCTCATAGGCAC-
GCTGCTAGTG
TGGTTAGGTTTGAACACAAAGAATGGATCTATCTCCCTCACATGCTTGGCCCTGGGGGGAGTGATGATCTTCCT-
CTCCACGGCT
GTTTCTGCTGACGTGGGGTGCTCAGTGGACTTCTCAAAAAAGGAAACGAGATGTGGCACGGGGGTATTCATCTA-
TAATGATGT
TGAAGCCTGGAGGGACCGGTACAAGTACCATCCTGACTCCCCCCGCAGATTGGCAGCAGCAGTCAAGCAGGCCT-
GGGAAGAG
GGGATCTGTGGGATCTCATCCGTTTCAAGAATGGAAAACATCATGTGGAAATCAGTAGAAGGGGAGCTCAATGC-
TATCCTAGA
GGAGAATGGAGTTCAACTGACAGTTGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAAAGATTGC-
CAGTGCCT
GTGAATGAGCTGCCCCATGGCTGGAAAGCCTGGGGGAAATCGTATTTTGTTAGGGCGGCAAAGACCAACAACAG-
TTTTGTTGT
CGACGGTGACACACTGAAGGAATGTCCGCTTGAGCACAGAGCATGGAATAGTTTTCTTGTGGAGGATCACGGGT-
TTGGAGTC
TTCCACACCAGTGTCTGGCTTAAGGTCAGAGAAGATTACTCATTAGAATGTGACCCAGCCGTCATAGGAACAGC-
TGTTAAGGG
AAGGGAGGCCGCGCACAGTGATCTGGGCTATTGGATTGAAAGTGAAAAGAATGACACATGGAGGCTGAAGAGGG-
CCCACCT
GATTGAGATGAAAACATGTGAATGGCCAAAGTCTCACACATTGTGGACAGATGGAGTAGAAGAAAGTGATCTTA-
TCATACCCA
AGTCTTTAGCTGGTCCACTCAGCCACCACAACACCAGAGAGGGTTACAGAACCCAAGTGAAAGGGCCATGGCAC-
AGTGAAGA
GCTTGAAATCCGGTTTGAGGAATGTCCAGGCACCAAGGTTTACGTGGAGGAGACATGCGGAACTAGAGGACCAT-
CTCTGAGA
TCAACTACTGCAAGTGGAAGGGTCATTGAGGAATGGTGCTGTAGGGAATGCACAATGCCCCCACTATCGTTTCG-
AGCAAAAG
ACGGCTGCTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAGAGCAACTTAGTGAGGTCAATGGTGACA-
GCGGGGT
CAACCGATCATATGGACCACTTCTCTCTTGGAGTGCTTGTGATTCTACTCATGGTGCAGGAGGGGTTGAAGAAG-
AGAATGACC
ACAAAGATCATCATGAGCACATCAATGGCAGTGCTGGTAGTCATGATCTTGGGAGGATTTTCAATGAGTGACCT-
GGCCAAGCT
TGTGATCCTGATGGGTGCTACTTTCGCAGAAATGAACACTGGAGGAGATGTAGCTCACTTGGCATTGGTAGCGG-
CATTTAAAG
TCAGACCAGCCTTGCTGGTCTCCTTCATTTTCAGAGCCAATTGGACACCCCGTGAGAGCATGCTGCTAGCCCTG-
GCTTCGTGTC
TTCTGCAAACTGCGATCTCTGCTCTTGAAGGTGACTTGATGGTCCTCATTAATGGATTTGCTTTGGCCTGGTTG-
GCAATTCGAGC
AATGGCCGTGCCACGCACTGACAACATCGCTCTACCAATCTTGGCTGCTCTAACACCACTAGCTCGAGGCACAC-
TGCTCGTGGC
ATGGAGAGCGGGCCTGGCTACTTGTGGAGGGATCATGCTCCTCTCCCTGAAAGGGAAAGGTAGTGTGAAGAAGA-
ACCTGCCA
TTTGTCATGGCCCTGGGATTGACAGCTGTGAGGGTAGTAGACCCTATTAATGTGGTAGGACTACTGTTACTCAC-
AAGGAGTGG
GAAGCGGAGCTGGCCCCCTAGTGAAGTTCTCACAGCCGTTGGCCTGATATGTGCACTGGCCGGAGGGTTTGCCA-
AGGCAGAC
ATTGAGATGGCTGGACCCATGGCTGCAGTAGGCTTGCTAATTGTCAGCTATGTGGTCTCGGGAAAGAGTGTGGA-
CATGTACAT
TGAAAGAGCAGGTGACATCACATGGGAAAAGGACGCGGAAGTCACTGGAAACAGTCCTCGGCTTGACGTGGCAC-
TGGATGA
GAGTGGTGATTTCTCCTTGGTAGAGGAAGATGGTCCACCCATGAGAGAGATCATACTTAAGGTGGTCCTGATGG-
CCATCTGTG
GCATGAACCCAATAGCTATACCTTTTGCTGCAGGAGCGTGGTATGTGTATGTGAAGACTGGGAAAAGGAGTGGC-
GCCCTCTG
GGACGTGCCTGCTCCCAAAGAAGTGAAGAAAGGAGAGACCACAGATGGAGTGTACAGAGTGATGACTCGCAGAC-
TGCTAGG
TTCAACACAGGTTGGAGTGGGAGTCATGCAAGAGGGAGTCTTCCACACCATGTGGCACGTTACAAAAGGAGCCG-
CACTGAGG
AGCGGTGAGGGAAGACTTGATCCATACTGGGGGGATGTCAAGCAGGACTTGGTGTCATACTGTGGGCCTTGGAA-
GTTGGATG
CAGCTTGGGATGGACTCAGCGAGGTACAGCTTTTGGCCGTACCTCCCGGAGAGAGGGCCAGAAACATTCAGACC-
CTGCCTGG
AATATTCAAGACAAAGGACGGGGACATCGGAGCAGTTGCTCTGGACTACCCTGCAGGGACCTCAGGATCTCCGA-
TCCTAGAC
AAATGTGGAAGAGTGATAGGACTCTATGGCAATGGGGTTGTGATCAAGAATGGAAGCTATGTTAGTGCTATAAC-
CCAGGGAA
AGAGGGAGGAGGAGACTCCGGTTGAATGTTTCGAACCCTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTGGAT-
CTGCATCC
AGGAGCCGGAAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAAAGAGACTCCGGACAGTGA-
TCTTGGCA
CCAACTAGGGTTGTCGCTGCTGAGATGGAGGAGGCCTTGAGAGGACTTCCGGTGCGTTACATGACAACAGCAGT-
CAACGTCA
CCCATTCTGGGACAGAAATCGTTGATTTGATGTGCCATGCCACTTTCACTTCACGCTTACTACAACCCATCAGA-
GTCCCTAATTA
CAATCTCTACATCATGGATGAAGCCCACTTCACAGACCCCTCAAGTATAGCTGCAAGAGGATATATATCAACAA-
GGGTTGAAAT
GGGCGAGGCGGCTGCCATTTTTATGACTGCCACACCACCAGGAACCCGTGATGCGTTTCCTGACTCTAACTCAC-
CAATCATGGA
CACAGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACAGACCATTCTGGGAAAACAG-
TTTGGTTC
GTTCCAAGCGTGAGAAACGGAAATGAAATCGCAGCCTGTCTGACAAAGGCTGGAAAGCGGGTCATACAGCTCAG-
CAGGAAG
ACTTTTGAGACAGAATTTCAGAAAACAAAAAATCAAGAGTGGGACTTTGTCATAACAACTGACATCTCAGAGAT-
GGGCGCCAA
CTTCAAGGCTGACCGGGTCATAGACTCTAGGAGATGCCTAAAACCAGTCATACTTGATGGTGAGAGAGTCATCT-
TGGCTGGGC
CCATGCCTGTCACGCATGCTAGTGCTGCTCAGAGGAGAGGACGTATAGGCAGGAACCCTAACAAACCTGGAGAT-
GAGTACAT
GTATGGAGGTGGGTGTGCAGAGACTGATGAAGGCCATGCACACTGGCTTGAAGCAAGAATGCTTCTTGACAACA-
TCTACCTCC
AGGATGGCCTCATAGCCTCGCTCTATCGGCCTGAGGCCGATAAGGTAGCCGCCATTGAGGGAGAGTTTAAGCTG-
AGGACAGA
GCAAAGGAAGACCTTCGTGGAACTCATGAAGAGAGGAGACCTTCCCGTCTGGCTAGCCTATCAGGTTGCATCTG-
CCGGAATAA
CTTACACAGACAGAAGATGGTGCTTTGATGGCACAACCAACAACACCATAATGGAAGACAGCGTACCAGCAGAG-
GTGTGGAC
AAAGTATGGAGAGAAGAGAGTGCTCAAACCGAGATGGATGGATGCTAGGGTCTGTTCAGACCATGCGGCCCTGA-
AGTCGTTC
AAAGAATTCGCCGCTGGAAAAAGAGGAGCGGCTTTGGGAGTAATGGAGGCCCTGGGAACACTGCCAGGACACAT-
GACAGAG
AGGTTTCAGGAAGCCATTGACAACCTCGCCGTGCTCATGCGAGCAGAGACTGGAAGCAGGCCTTATAAGGCAGC-
GGCAGCCC
AACTGCCGGAGACCCTAGAGACCATTATGCTCTTAGGTTTGCTGGGAACAGTTTCACTGGGGATCTTCTTCGTC-
TTGATGCGGA
ATAAGGGCATCGGGAAGATGGGCTTTGGAATGGTAACCCTTGGGGCCAGTGCATGGCTCATGTGGCTTTCGGAA-
ATTGAACC
AGCCAGAATTGCATGTGTCCTCATTGTTGTGTTTTTATTACTGGTGGTGCTCATACCCGAGCCAGAGAAGCAAA-
GATCTCCCCA
AGATAACCAGATGGCAATTATCATCATGGTGGCAGTGGGCCTTCTAGGTTTGATAACTGCAAACGAACTTGGAT-
GGCTGGAAA
GAACAAAAAATGACATAGCTCATCTAATGGGAAGGAGAGAAGAAGGAGCAACCATGGGATTCTCAATGGACATT-
GATCTGCG
GCCAGCCTCCGCCTGGGCTATCTATGCCGCATTGACAACTCTCATCACCCCAGCTGTCCAACATGCGGTAACCA-
CTTCATACAAC
AACTACTCCTTAATGGCGATGGCCACACAAGCTGGAGTGCTGTTTGGCATGGGCAAAGGGATGCCATTTTATGC-
ATGGGACCT
TGGAGTCCCGCTGCTAATGATGGGTTGCTATTCACAATTAACACCCCTGACTCTGATAGTAGCTATCATTCTGC-
TTGTGGCGCA
CTACATGTACTTGATCCCAGGCCTACAAGCGGCAGCAGCGCGTGCTGCCCAGAAAAGGACAGCAGCTGGCATCA-
TGAAGAAT
CCCGTTGTGGATGGAATAGTGGTAACTGACATTGACACAATGACAATAGACCCCCAGGTGGAGAAGAAGATGGG-
ACAAGTGT
TACTCATAGCAGTAGCCATCTCCAGTGCTGTGCTGCTGCGGACCGCCTGGGGATGGGGGGAGGCTGGAGCTCTG-
ATCACAGC
AGCGACCTCCACCTTGTGGGAAGGCTCTCCAAACAAATACTGGAACTCCTCTACAGCCACCTCACTGTGCAACA-
TCTTCAGAGG
AAGCTATCTGGCAGGAGCTTCCCTTATCTATACAGTGACGAGAAACGCTGGCCTGGTTAAGAGACGTGGAGGTG-
GGACGGGA
GAGACTCTGGGAGAGAAGTGGAAAGCTCGTCTGAATCAGATGTCGGCCCTGGAGTTCTACTCTTATAAAAAGTC-
AGGTATCAC
TGAAGTGTGTAGAGAGGAGGCTCGCCGTGCCCTCAAGGATGGAGTGGCCACAGGAGGACATGCCGTATCCCGGG-
GAAGTGC
AAAGCTCAGATGGTTGGTGGAGAGAGGATATCTGCAGCCCTATGGGAAGGTTGTTGACCTCGGATGTGGCAGAG-
GGGGCTG
GAGCTATTATGCCGCCACCATCCGCAAAGTGCAGGAGGTGAGAGGATACACAAAGGGAGGTCCCGGTCATGAAG-
AACCCATG
CTGGTGCAAAGCTATGGGTGGAACATAGTTCGTCTCAAGAGTGGAGTGGACGTCTTCCACATGGCGGCTGAGCC-
GTGTGACA
CTCTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAGACACGAACACTCAGAGTGCTCTCT-
ATGGTGGGG
GACTGGCTTGAAAAAAGACCAGGGGCCTTCTGTATAAAGGTGCTGTGCCCATACACCAGCACTATGATGGAAAC-
CATGGAGC
GACTGCAACGTAGGCATGGGGGAGGATTAGTCAGAGTGCCATTGTCTCGCAACTCCACACATGAGATGTACTGG-
GTCTCTGG
GGCAAAGAGCAACATCATAAAAAGTGTGTCCACCACAAGTCAGCTCCTCCTGGGACGCATGGATGGCCCCAGGA-
GGCCAGTG
AAATATGAGGAGGATGTGAACCTCGGCTCGGGTACACGAGCTGTGGCAAGCTGTGCTGAGGCTCCTAACATGAA-
AATCATCG
GCAGGCGCATTGAGAGAATCCGCAATGAACATGCAGAAACATGGTTTCTTGATGAAAACCACCCATACAGGACA-
TGGGCCTAC
CATGGGAGCTACGAAGCCCCCACGCAAGGATCAGCGTCTTCCCTCGTGAACGGGGTTGTTAGACTCCTGTCAAA-
GCCTTGGGA
CGTGGTGACTGGAGTTACAGGAATAGCCATGACTGACACCACACCATACGGCCAACAAAGAGTCTTCAAAGAAA-
AAGTGGAC
ACCAGGGTGCCAGATCCCCAAGAAGGCACTCGCCAGGTAATGAACATAGTCTCTTCCTGGCTGTGGAAGGAGCT-
GGGGAAAC
GCAAGCGGCCACGCGTCTGCACCAAAGAAGAGTTTATCAACAAGGTGCGCAGCAATGCAGCACTGGGAGCAATA-
TTTGAAGA
GGAAAAAGAATGGAAGACGGCTGTGGAAGCTGTGAATGATCCAAGGTTTTGGGCCCTAGTGGATAGGGAGAGAG-
AACACCA
CCTGAGAGGAGAGTGTCACAGCTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAGCAAGGAGAGTTCGGGA-
AAGCAA
AAGGTAGCCGCGCCATCTGGTACATGTGGTTGGGAGCCAGATTCTTGGAGTTTGAAGCCCTTGGATTCTTGAAC-
GAGGACCAT
TGGATGGGAAGAGAAAACTCAGGAGGTGGAGTCGAAGGGTTAGGATTGCAAAGACTTGGATACATTCTAGAAGA-
AATGAAT
CGGGCACCAGGAGGAAAGATGTACGCAGATGACACTGCTGGCTGGGACACCCGCATTAGTAAGTTTGATCTGGA-
GAATGAAG
CTCTGATTACCAACCAAATGGAGGAAGGGCACAGAACTCTGGCGTTGGCCGTGATTAAATACACATACCAAAAC-
AAAGTGGTG
AAGGTTCTCAGACCAGCTGAAGGAGGAAAAACAGTTATGGACATCATTTCAAGACAAGACCAGAGAGGGAGTGG-
ACAAGTT
GTCACTTATGCTCTCAACACATTCACCAACTTGGTGGTGCAGCTTATCCGGAACATGGAAGCTGAGGAAGTGTT-
AGAGATGCA
AGACTTATGGTTGTTGAGGAAGCCAGAGAAAGTGACCAGATGGTTGCAGAGCAATGGATGGGATAGACTCAAAC-
GAATGGC
GGTCAGTGGAGATGACTGCGTTGTGAAGCCAATCGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGACA-
TGGGAAAA
GTTAGGAAAGACACACAGGAGTGGAAACCCTCGACTGGATGGAGCAATTGGGAAGAAGTCCCGTTCTGCTCCCA-
CCACTTCA
ACAAGCTGTACCTCAAGGATGGGAGATCCATTGTGGTCCCTTGCCGCCACCAAGATGAACTGATTGGCCGAGCT-
CGCGTCTCA
CCAGGGGCAGGATGGAGCATCCGGGAGACTGCCTGTCTTGCAAAATCATATGCGCAGATGTGGCAGCTCCTTTA-
TTTCCACAG
AAGAGACCTTCGACTGATGGCTAATGCCATTTGCTCGGCTGTGCCAGTTGACTGGGTACCAACTGGGAGAACCA-
CCTGGTCAA
TCCATGGAAAGGGAGAATGGATGACCACTGAGGACATGCTCATGGTGTGGAATAGAGTGTGGATTGAGGAGAAC-
GACCATA
TGGAGGACAAGACTCCTGTAACAAAATGGACAGACATTCCCTATCTAGGAAAAAGGGAGGACTTATGGTGTGGA-
TCCCTTATA
GGGCACAGACCCCGCACCACTTGGGCTGAAAACATCAAAGACACAGTCAACATGGTGCGCAGGATCATAGGTGA-
TGAAGAAA
AGTACATGGACTATCTATCCACCCAAGTCCGCTACTTGGGTGAGGAAGGGTCCACACCCGGAGTGTTGTAAGCA-
CCAATTTTA
GTGTTGTCAGGCCTGCTAGTCAGCCACAGTTTGGGGAAAGCTGTGCAGCCTGTAACCCCCCCAGGAGAAGCTGG-
GAAACCAA
GCTCATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAG-
TCAAAAAA
CCCCACGCGCTTGGAAGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAAC-
TGGAGACT
AGCTGTGAATCTCCAGCAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGC-
TGGGAAA
GACCAGAGACTCCATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAACAGCGGCGGCCGGTGTGGG-
GAAATCCA TGGTTTCT AY632535.2 NC_012532.1 Zika virus strain MR 766,
Uganda, complete genome SEQ ID NO: 12
AGTTGTTGATCTGTGTGAGTCAGACTGCGACAGTTCGAGTCTGAAGCGAGAGCTAACAACAGTATCAACAGGTT-
TAATTTGGA
TTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCCAAAGAAGAAATCCGGAGGATCCGGATTGTCAATATGCTA-
AAACGCGG
AGTAGCCCGTGTAAACCCCTTGGGAGGTTTGAAGAGGTTGCCAGCCGGACTTCTGCTGGGTCATGGACCCATCA-
GAATGGTTT
TGGCGATACTAGCCTTTTTGAGATTTACAGCAATCAAGCCATCACTGGGCCTTATCAACAGATGGGGTTCCGTG-
GGGAAAAAA
GAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTTGCTGCCATGTTGAGAATAATCAATGCTAGGAAAGA-
GAGGAAGA
GACGTGGCGCAGACACCAGCATCGGAATCATTGGCCTCCTGCTGACTACAGCCATGGCAGCAGAGATCACTAGA-
CGCGGGAG
TGCATACTACATGTACTTGGATAGGAGCGATGCCGGGAAGGCCATTTCGTTTGCTACCACATTGGGAGTGAACA-
AGTGCCACG
TACAGATCATGGACCTCGGGCACATGTGTGACGCCACCATGAGTTATGAGTGCCCTATGCTGGATGAGGGAGTG-
GAACCAGA
TGATGTCGATTGCTGGTGCAACACGACATCAACTTGGGTTGTGTACGGAACCTGTCATCACAAAAAAGGTGAGG-
CACGGCGAT
CTAGAAGAGCCGTGACGCTCCCTTCTCACTCTACAAGGAAGTTGCAAACGCGGTCGCAGACCTGGTTAGAATCA-
AGAGAATAC
ACGAAGCACTTGATCAAGGTTGAAAACTGGATATTCAGGAACCCCGGGTTTGCGCTAGTGGCCGTTGCCATTGC-
CTGGCTTTT
GGGAAGCTCGACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGTATCAGGT-
GCATTGGAG
TCAGCAATAGAGACTTCGTGGAGGGCATGTCAGGTGGGACCTGGGTTGATGTTGTCTTGGAACATGGAGGCTGC-
GTTACCGT
GATGGCACAGGACAAGCCAACAGTCGACATAGAGTTGGTCACGACGACGGTTAGTAACATGGCCGAGGTAAGAT-
CCTATTGC
TACGAGGCATCGATATCGGACATGGCTTCGGACAGTCGTTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCA-
ATCAGACAC
TCAATATGTCTGCAAAAGAACATTAGTGGACAGAGGTTGGGGAAACGGTTGTGGACTTTTTGGCAAAGGGAGCT-
TGGTGACA
TGTGCCAAGTTTACGTGTTCTAAGAAGATGACCGGGAAGAGCATTCAACCGGAAAATCTGGAGTATCGGATAAT-
GCTATCAGT
GCATGGCTCCCAGCATAGCGGGATGATTGGATATGAAACTGACGAAGATAGAGCGAAAGTCGAGGTTACGCCTA-
ATTCACCA
AGAGCGGAAGCAACCTTGGGAGGCTTTGGAAGCTTAGGACTTGACTGTGAACCAAGGACAGGCCTTGACTTTTC-
AGATCTGTA
TTACCTGACCATGAACAATAAGCATTGGTTGGTGCACAAAGAGTGGTTTCATGACATCCCATTGCCTTGGCATG-
CTGGGGCAG
ACACCGGAACTCCACACTGGAACAACAAAGAGGCATTGGTAGAATTCAAGGATGCCCACGCCAAGAGGCAAACC-
GTCGTCGT
TCTGGGGAGCCAGGAAGGAGCCGTTCACACGGCTCTCGCTGGAGCTCTAGAGGCTGAGATGGATGGTGCAAAGG-
GAAGGCT
GTTCTCTGGCCATTTGAAATGCCGCCTAAAAATGGACAAGCTTAGATTGAAGGGCGTGTCATATTCCTTGTGCA-
CTGCGGCATT
CACATTCACCAAGGTCCCAGCTGAAACACTGCATGGAACAGTCACAGTGGAGGTGCAGTATGCAGGGACAGATG-
GACCCTGC
AAGATCCCAGTCCAGATGGCGGTGGACATGCAGACCCTGACCCCAGTTGGAAGGCTGATAACCGCCAACCCCGT-
GATTACTGA
AAGCACTGAGAACTCAAAGATGATGTTGGAGCTTGACCCACCATTTGGGGATTCTTACATTGTCATAGGAGTTG-
GGGACAAGA
AAATCACCCACCACTGGCATAGGAGTGGTAGCACCATCGGAAAGGCATTTGAGGCCACTGTGAGAGGCGCCAAG-
AGAATGGC
AGTCCTGGGGGATACAGCCTGGGACTTCGGATCAGTCGGGGGTGTGTTCAACTCACTGGGTAAGGGCATTCACC-
AGATTTTTG
GAGCAGCCTTCAAATCACTGTTTGGAGGAATGTCCTGGTTCTCACAGATCCTCATAGGCACGCTGCTAGTGTGG-
TTAGGTTTGA
ACACAAAGAATGGATCTATCTCCCTCACATGCTTGGCCCTGGGGGGAGTGATGATCTTCCTCTCCACGGCTGTT-
TCTGCTGACG
TGGGGTGCTCAGTGGACTTCTCAAAAAAGGAAACGAGATGTGGCACGGGGGTATTCATCTATAATGATGTTGAA-
GCCTGGAG
GGACCGGTACAAGTACCATCCTGACTCCCCCCGCAGATTGGCAGCAGCAGTCAAGCAGGCCTGGGAAGAGGGGA-
TCTGTGGG
ATCTCATCCGTTTCAAGAATGGAAAACATCATGTGGAAATCAGTAGAAGGGGAGCTCAATGCTATCCTAGAGGA-
GAATGGAG
TTCAACTGACAGTTGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAAAGATTGCCAGTGCCTGTG-
AATGAGCTG
CCCCATGGCTGGAAAGCCTGGGGGAAATCGTATTTTGTTAGGGCGGCAAAGACCAACAACAGTTTTGTTGTCGA-
CGGTGACAC
ACTGAAGGAATGTCCGCTTGAGCACAGAGCATGGAATAGTTTTCTTGTGGAGGATCACGGGTTTGGAGTCTTCC-
ACACCAGTG
TCTGGCTTAAGGTCAGAGAAGATTACTCATTAGAATGTGACCCAGCCGTCATAGGAACAGCTGTTAAGGGAAGG-
GAGGCCGC
GCACAGTGATCTGGGCTATTGGATTGAAAGTGAAAAGAATGACACATGGAGGCTGAAGAGGGCCCACCTGATTG-
AGATGAAA
ACATGTGAATGGCCAAAGTCTCACACATTGTGGACAGATGGAGTAGAAGAAAGTGATCTTATCATACCCAAGTC-
TTTAGCTGG
TCCACTCAGCCACCACAACACCAGAGAGGGTTACAGAACCCAAGTGAAAGGGCCATGGCACAGTGAAGAGCTTG-
AAATCCGG
TTTGAGGAATGTCCAGGCACCAAGGTTTACGTGGAGGAGACATGCGGAACTAGAGGACCATCTCTGAGATCAAC-
TACTGCAA
GTGGAAGGGTCATTGAGGAATGGTGCTGTAGGGAATGCACAATGCCCCCACTATCGTTTCGAGCAAAAGACGGC-
TGCTGGTA
TGGAATGGAGATAAGGCCCAGGAAAGAACCAGAGAGCAACTTAGTGAGGTCAATGGTGACAGCGGGGTCAACCG-
ATCATAT
GGACCACTTCTCTCTTGGAGTGCTTGTGATTCTACTCATGGTGCAGGAGGGGTTGAAGAAGAGAATGACCACAA-
AGATCATCA
TGAGCACATCAATGGCAGTGCTGGTAGTCATGATCTTGGGAGGATTTTCAATGAGTGACCTGGCCAAGCTTGTG-
ATCCTGATG
GGTGCTACTTTCGCAGAAATGAACACTGGAGGAGATGTAGCTCACTTGGCATTGGTAGCGGCATTTAAAGTCAG-
ACCAGCCTT
GCTGGTCTCCTTCATTTTCAGAGCCAATTGGACACCCCGTGAGAGCATGCTGCTAGCCCTGGCTTCGTGTCTTC-
TGCAAACTGC
GATCTCTGCTCTTGAAGGTGACTTGATGGTCCTCATTAATGGATTTGCTTTGGCCTGGTTGGCAATTCGAGCAA-
TGGCCGTGCC
ACGCACTGACAACATCGCTCTACCAATCTTGGCTGCTCTAACACCACTAGCTCGAGGCACACTGCTCGTGGCAT-
GGAGAGCGG
GCCTGGCTACTTGTGGAGGGATCATGCTCCTCTCCCTGAAAGGGAAAGGTAGTGTGAAGAAGAACCTGCCATTT-
GTCATGGCC
CTGGGATTGACAGCTGTGAGGGTAGTAGACCCTATTAATGTGGTAGGACTACTGTTACTCACAAGGAGTGGGAA-
GCGGAGCT
GGCCCCCTAGTGAAGTTCTCACAGCCGTTGGCCTGATATGTGCACTGGCCGGAGGGTTTGCCAAGGCAGACATT-
GAGATGGCT
GGACCCATGGCTGCAGTAGGCTTGCTAATTGTCAGCTATGTGGTCTCGGGAAAGAGTGTGGACATGTACATTGA-
AAGAGCAG
GTGACATCACATGGGAAAAGGACGCGGAAGTCACTGGAAACAGTCCTCGGCTTGACGTGGCACTGGATGAGAGT-
GGTGACTT
CTCCTTGGTAGAGGAAGATGGTCCACCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGGCCATCTGTGGCA-
TGAACCCAA
TAGCTATACCTTTTGCTGCAGGAGCGTGGTATGTGTATGTGAAGACTGGGAAAAGGAGTGGCGCCCTCTGGGAC-
GTGCCTGC
TCCCAAAGAAGTGAAGAAAGGAGAGACCACAGATGGAGTGTACAGAGTGATGACTCGCAGACTGCTAGGTTCAA-
CACAGGTT
GGAGTGGGAGTCATGCAAGAGGGAGTCTTCCACACCATGTGGCACGTTACAAAAGGAGCCGCACTGAGGAGCGG-
TGAGGGA
AGACTTGATCCATACTGGGGGGATGTCAAGCAGGACTTGGTGTCATACTGTGGGCCTTGGAAGTTGGATGCAGC-
TTGGGATG
GACTCAGCGAGGTACAGCTTTTGGCCGTACCTCCCGGAGAGAGGGCCAGAAACATTCAGACCCTGCCTGGAATA-
TTCAAGACA
AAGGACGGGGACATCGGAGCAGTTGCTCTGGACTACCCTGCAGGGACCTCAGGATCTCCGATCCTAGACAAATG-
TGGAAGAG
TGATAGGACTCTATGGCAATGGGGTTGTGATCAAGAATGGAAGCTATGTTAGTGCTATAACCCAGGGAAAGAGG-
GAGGAGG
AGACTCCGGTTGAATGTTTCGAACCCTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTGGATCTGCATCCAGGA-
GCCGGAAAA
ACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAAAGAGACTCCGGACAGTGATCTTGGCACCAAC-
TAGGGTTGT
CGCTGCTGAGATGGAGGAGGCCTTGAGAGGACTTCCGGTGCGTTACATGACAACAGCAGTCAACGTCACCCATT-
CTGGGACA
GAAATCGTTGATTTGATGTGCCATGCCACTTTCACTTCACGCTTACTACAACCCATCAGAGTCCCTAATTACAA-
TCTCAACATCAT
GGATGAAGCCCACTTCACAGACCCCTCAAGTATAGCTGCAAGAGGATACATATCAACAAGGGTTGAAATGGGCG-
AGGCGGCT
GCCATTTTTATGACTGCCACACCACCAGGAACCCGTGATGCGTTTCCTGACTCTAACTCACCAATCATGGACAC-
AGAAGTGGAA
GTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACAGACCATTCTGGGAAAACAGTTTGGTTCGTTCC-
AAGCGTGA
GAAACGGAAATGAAATCGCAGCCTGTCTGACAAAGGCTGGAAAGCGGGTCATACAGCTCAGCAGGAAGACTTTT-
GAGACAGA
ATTTCAGAAAACAAAAAATCAAGAGTGGGACTTTGTCATAACAACTGACATCTCAGAGATGGGCGCCAACTTCA-
AGGCTGACC
GGGTCATAGACTCTAGGAGATGCCTAAAACCAGTCATACTTGATGGTGAGAGAGTCATCTTGGCTGGGCCCATG-
CCTGTCACG
CATGCTAGTGCTGCTCAGAGGAGAGGACGTATAGGCAGGAACCCTAACAAACCTGGAGATGAGTACATGTATGG-
AGGTGGG
TGTGCAGAGACTGATGAAGGCCATGCACACTGGCTTGAAGCAAGAATGCTTCTTGACAACATCTACCTCCAGGA-
TGGCCTCAT
AGCCTCGCTCTATCGGCCTGAGGCCGATAAGGTAGCCGCCATTGAGGGAGAGTTTAAGCTGAGGACAGAGCAAA-
GGAAGAC
CTTCGTGGAACTCATGAAGAGAGGAGACCTTCCCGTCTGGCTAGCCTATCAGGTTGCATCTGCCGGAATAACTT-
ACACAGACA
GAAGATGGTGCTTTGATGGCACAACCAACAACACCATAATGGAAGACAGTGTACCAGCAGAGGTTTGGACAAAG-
TATGGAGA
GAAGAGAGTGCTCAAACCGAGATGGATGGATGCTAGGGTCTGTTCAGACCATGCGGCCCTGAAGTCGTTCAAAG-
AATTCGCC
GCTGGAAAAAGAGGAGCGGCTTTGGGAGTAATGGAGGCCCTGGGAACACTGCCAGGACACATGACAGAGAGGTT-
TCAGGAA
GCCATTGACAACCTCGCCGTGCTCATGCGAGCAGAGACTGGAAGCAGGCCTTATAAGGCAGCGGCAGCCCAACT-
GCCGGAGA
CCCTAGAGACCATTATGCTCTTAGGTTTGCTGGGAACAGTTTCACTGGGGATCTTCTTCGTCTTGATGCGGAAT-
AAGGGCATCG
GGAAGATGGGCTTTGGAATGGTAACCCTTGGGGCCAGTGCATGGCTCATGTGGCTTTCGGAAATTGAACCAGCC-
AGAATTGC
ATGTGTCCTCATTGTTGTGTTTTTATTACTGGTGGTGCTCATACCCGAGCCAGAGAAGCAAAGATCTCCCCAAG-
ATAACCAGAT
GGCAATTATCATCATGGTGGCAGTGGGCCTTCTAGGTTTGATAACTGCAAACGAACTTGGATGGCTGGAAAGAA-
CAAAAAAT
GACATAGCTCATCTAATGGGAAGGAGAGAAGAAGGAGCAACCATGGGATTCTCAATGGACATTGATCTGCGGCC-
AGCCTCCG
CCTGGGCTATCTATGCCGCATTGACAACTCTCATCACCCCAGCTGTCCAACATGCGGTAACCACTTCATACAAC-
AACTACTCCTT
AATGGCGATGGCCACACAAGCTGGAGTGCTGTTTGGCATGGGCAAAGGGATGCCATTTATGCATGGGGACCTTG-
GAGTCCCG
CTGCTAATGATGGGTTGCTATTCACAATTAACACCCCTGACTCTGATAGTAGCTATCATTCTGCTTGTGGCGCA-
CTACATGTACT
TGATCCCAGGCCTACAAGCGGCAGCAGCGCGTGCTGCCCAGAAAAGGACAGCAGCTGGCATCATGAAGAATCCC-
GTTGTGGA
TGGAATAGTGGTAACTGACATTGACACAATGACAATAGACCCCCAGGTGGAGAAGAAGATGGGACAAGTGTTAC-
TCATAGCA
GTAGCCATCTCCAGTGCTGTGCTGCTGCGGACCGCCTGGGGATGGGGGGAGGCTGGAGCTCTGATCACAGCAGC-
GACCTCCA
CCTTGTGGGAAGGCTCTCCAAACAAATACTGGAACTCCTCTACAGCCACCTCACTGTGCAACATCTTCAGAGGA-
AGCTATCTGG
CAGGAGCTTCCCTTATCTATACAGTGACGAGAAACGCTGGCCTGGTTAAGAGACGTGGAGGTGGGACGGGAGAG-
ACTCTGG
GAGAGAAGTGGAAAGCTCGTCTGAATCAGATGTCGGCCCTGGAGTTCTACTCTTATAAAAAGTCAGGTATCACT-
GAAGTGTGT
AGAGAGGAGGCTCGCCGTGCCCTCAAGGATGGAGTGGCCACAGGAGGACATGCCGTATCCCGGGGAAGTGCAAA-
GATCAGA
TGGTTGGAGGAGAGAGGATATCTGCAGCCCTATGGGAAGGTTGTTGACCTCGGATGTGGCAGAGGGGGCTGGAG-
CTATTAT
GCCGCCACCATCCGCAAAGTGCAGGAGGTGAGAGGATACACAAAGGGAGGTCCCGGTCATGAAGAACCCATGCT-
GGTGCAA
AGCTATGGGTGGAACATAGTTCGTCTCAAGAGTGGAGTGGACGTCTTCCACATGGCGGCTGAGCCGTGTGACAC-
TCTGCTGTG
TGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAGACACGAACACTCAGAGTGCTCTCTATGGTGGGGG-
ACTGGCTT
GAAAAAAGACCAGGGGCCTTCTGTATAAAGGTGCTGTGCCCATACACCAGCACTATGATGGAAACCATGGAGCG-
ACTGCAAC
GTAGGCATGGGGGAGGATTAGTCAGAGTGCCATTGTGTCGCAACTCCACACATGAGATGTACTGGGTCTCTGGG-
GCAAAGAG
CAACATCATAAAAAGTGTGTCCACCACAAGTCAGCTCCTCCTGGGACGCATGGATGGCCCCAGGAGGCCAGTGA-
AATATGAG
GAGGATGTGAACCTCGGCTCGGGTACACGAGCTGTGGCAAGCTGTGCTGAGGCTCCTAACATGAAAATCATCGG-
CAGGCGCA
TTGAGAGAATCCGCAATGAACATGCAGAAACATGGTTTCTTGATGAAAACCACCCATACAGGACATGGGCCTAC-
CATGGGAGC
TACGAAGCCCCCACGCAAGGATCAGCGTCTTCCCTCGTGAACGGGGTTGTTAGACTCCTGTCAAAGCCTTGGGA-
CGTGGTGAC
TGGAGTTACAGGAATAGCCATGACTGACACCACACCATACGGCCAACAAAGAGTCTTCAAAGAAAAAGTGGACA-
CCAGGGTG
CCAGATCCCCAAGAAGGCACTCGCCAGGTAATGAACATAGTCTCTTCCTGGCTGTGGAAGGAGCTGGGGAAACG-
CAAGCGGC
CACGCGTCTGCACCAAAGAAGAGTTTATCAACAAGGTGCGCAGCAATGCAGCACTGGGAGCAATATTTGAAGAG-
GAAAAAGA
ATGGAAGACGGCTGTGGAAGCTGTGAATGATCCAAGGTTTTGGGCCCTAGTGGATAGGGAGAGAGAACACCACC-
TGAGAGG
AGAGTGTCACAGCTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAGCAAGGAGAGTTCGGGAAAGCAAAAG-
GTAGCC
GCGCCATCTGGTACATGTGGTTGGGAGCCAGATTCTTGGAGTTTGAAGCCCTTGGATTCTTGAACGAGGACCAT-
TGGATGGGA
AGAGAAAACTCAGGAGGTGGAGTCGAAGGGTTAGGATTGCAAAGACTTGGATACATTCTAGAAGAAATGAATCG-
GGCACCA
GGAGGAAAGATGTACGCAGATGACACTGCTGGCTGGGACACCCGCATTAGTAAGTTTGATCTGGAGAATGAAGC-
TCTGATTA
CCAACCAAATGGAGGAAGGGCACAGAACTCTGGCGTTGGCCGTGATTAAATACACATACCAAAACAAAGTGGTG-
AAGGTTCT
CAGACCAGCTGAAGGAGGAAAAACAGTTATGGACATCATTTCAAGACAAGACCAGAGAGGGAGTGGACAAGTTG-
TCACTTAT
GCTCTCAACACATTCACCAACTTGGTGGTGCAGCTTATCCGGAACATGGAAGCTGAGGAAGTGTTAGAGATGCA-
AGACTTATG
GTTGTTGAGGAAGCCAGAGAAAGTGACCAGATGGTTGCAGAGCAATGGATGGGATAGACTCAAACGAATGGCGG-
TCAGTGG
AGATGACTGCGTTGTGAAGCCAATCGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGACATGGGAAAAG-
TTAGGAAAG
ACACACAGGAGTGGAAACCCTCGACTGGATGGAGCAATTGGGAAGAAGTCCCGTTCTGCTCCCACCACTTCAAC-
AAGCTGTAC
CTCAAGGATGGGAGATCCATTGTGGTCCCTTGCCGCCACCAAGATGAACTGATTGGCCGAGCTCGCGTCTCACC-
AGGGGCAG
GATGGAGCATCCGGGAGACTGCCTGTCTTGCAAAATCATATGCGCAGATGTGGCAGCTCCTTTATTTCCACAGA-
AGAGACCTT
CGACTGATGGCTAATGCCATTTGCTCGGCTGTGCCAGTTGACTGGGTACCAACTGGGAGAACCACCTGGTCAAT-
CCATGGAAA
GGGAGAATGGATGACCACTGAGGACATGCTCATGGTGTGGAATAGAGTGTGGATTGAGGAGAACGACCATATGG-
AGGACAA
GACTCCTGTAACAAAATGGACAGACATTCCCTATCTAGGAAAAAGGGAGGACTTATGGTGTGGATCCCTTATAG-
GGCACAGAC
CCCGCACCACTTGGGCTGAAAACATCAAAGACACAGTCAACATGGTGCGCAGGATCATAGGTGATGAAGAAAAG-
TACATGGA
CTATCTATCCACCCAAGTCCGCTACTTGGGTGAGGAAGGGTCCACACCCGGAGTGTTGTAAGCACCAATTTTAG-
TGTTGTCAGG
CCTGCTAGTCAGCCACAGTTTGGGGAAAGCTGTGCAGCCTGTAACCCCCCCAGGAGAAGCTGGGAAACCAAGCT-
CATAGTCA
GGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCAAAAAACCC-
CACGCGCT
TGGAAGCGCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGGAGACTAGC-
TGTGAATC
TCCAGCAGAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGTGGGAAAGACCA-
GAGACTC
CATGAGTTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAACTTCGGCGGCCGGTGTGGGGAAATCCATGGT-
TTCT KJ776791.1, Zika virus strain H/PF/2013 polyprotein gene,
complete cds SEQ ID NO: 13
AGTATCAACAGGTTTTATTTTGGATTTGGAAACGAGAGTTTCTGGTCATGAAAAACCCAAAAAAGAAATCCGGA-
GGATTCCGG
ATTGTCAATATGCTAAAACGCGGAGTAGCCCGTGTGAGCCCCTTTGGGGGCTTGAAGAGGCTGCCAGCCGGACT-
TCTGCTGG
GTCATGGGCCCATCAGGATGGTCTTGGCGATTCTAGCCTTTTTGAGATTCACGGCAATCAAGCCATCACTGGGT-
CTCATCAATA
GATGGGGTTCAGTGGGGAAAAAAGAGGCTATGGAAATAATAAAGAAGTTCAAGAAAGATCTGGCTGCCATGCTG-
AGAATAA
TCAATGCTAGGAAGGAGAAGAAGAGACGAGGCGCAGATACTAGTGTCGGAATTGTTGGCCTCCTGCTGACCACA-
GCTATGGC
AGCGGAGGTCACTAGACGTGGGAGTGCATACTATATGTACTTGGACAGAAACGACGCTGGGGAGGCCATATCTT-
TTCCAACC
ACATTGGGGATGAATAAGTGTTATATACAGATCATGGATCTTGGACACATGTGTGATGCCACCATGAGCTATGA-
ATGCCCTAT
GCTGGATGAGGGGGTGGAACCAGATGACGTCGATTGTTGGTGCAACACGACGTCAACTTGGGTTGTGTACGGAA-
CCTGCCAT
CACAAAAAAGGTGAAGCACGGAGATCTAGAAGAGCTGTGACGCTCCCCTCCCATTCCACTAGGAAGCTGCAAAC-
GCGGTCGC
AAACCTGGTTGGAATCAAGAGAATACACAAAGCACTTGATTAGAGTCGAAAATTGGATATTCAGGAACCCTGGC-
TTCGCGTTA
GCAGCAGCTGCCATCGCTTGGCTTTTGGGAAGCTCAACGAGCCAAAAAGTCATATACTTGGTCATGATACTGCT-
GATTGCCCCG
GCATACAGCATCAGGTGCATAGGAGTCAGCAATAGGGACTTTGTGGAAGGTATGTCAGGTGGGACTTGGGTTGA-
TGTTGTCT
TGGAACATGGAGGTTGTGTCACCGTAATGGCACAGGACAAACCGACTGTCGACATAGAGCTGGTTACAACAACA-
GTCAGCAA
CATGGCGGAGGTAAGATCCTACTGCTATGAGGCATCAATATCGGACATGGCTTCGGACAGCCGCTGCCCAACAC-
AAGGTGAA
GCCTACCTTGACAAGCAATCAGACACTCAATATGTCTGCAAAAGAACGTTAGTGGACAGAGGCTGGGGAAATGG-
ATGTGGAC
TTTTTGGCAAAGGGAGCCTGGTGACATGCGCTAAGTTTGCATGCTCCAAGAAAATGACCGGGAAGAGCATCCAG-
CCAGAGAA
TCTGGAGTACCGGATAATGCTGTCAGTTCATGGCTCCCAGCACAGTGGGATGATCGTTAATGACACAGGACATG-
AAACTGATG
AGAATAGAGCGAAGGTTGAGATAACGCCCAATTCACCAAGAGCCGAAGCCACCCTGGGGGGTTTTGGAAGCCTA-
GGACTTGA
TTGTGAACCGAGGACAGGCCTTGACTTTTCAGATTTGTATTACTTGACTATGAATAACAAGCACTGGTTGGTTC-
ACAAGGAGTG
GTTCCACGACATTCCATTACCTTGGCACGCTGGGGCAGACACCGGAACTCCACACTGGAACAACAAAGAAGCAC-
TGGTAGAGT
TCAAGGACGCACATGCCAAAAGGCAAACTGTCGTGGTTCTAGGGAGTCAAGAAGGAGCAGTTCACACGGCCCTT-
GCTGGAGC
TCTGGAGGCTGAGATGGATGGTGCAAAGGGAAGGCTGTCCTCTGGCCACTTGAAATGTCGCCTGAAAATGGATA-
AACTTAGA
TTGAAGGGCGTGTCATACTCCTTGTGTACCGCAGCGTTCACATTCACCAAGATCCCGGCTGAAACACTGCACGG-
GACAGTCAC
AGTGGAGGTACAGTACGCAGGGACAGATGGACCTTGCAAGGTTCCAGCTCAGATGGCGGTGGACATGCAAACTC-
TGACCCCA
GTTGGGAGGTTGATAACCGCTAACCCCGTAATCACTGAAAGCACTGAGAACTCTAAGATGATGCTGGAACTTGA-
TCCACCATT
TGGGGACTCTTACATTGTCATAGGAGTCGGGGAGAAGAAGATCACCCACCACTGGCACAGGAGTGGCAGCACCA-
TTGGAAAA
GCATTTGAAGCCACTGTGAGAGGTGCCAAGAGAATGGCAGTCTTGGGAGACACAGCCTGGGACTTTGGATCAGT-
TGGAGGCG
CTCTCAACTCATTGGGCAAGGGCATCCATCAAATTTTTGGAGCAGCTTTCAAATCATTGTTTGGAGGAATGTCC-
TGGTTCTCACA
AATTCTCATTGGAACGTTGCTGATGTGGTTGGGTCTGAACACAAAGAATGGATCTATTTCCCTTATGTGCTTGG-
CCTTAGGGGG
AGTGTTGATCTTCTTATCCACAGCTGTCTCTGCTGATGTGGGGTGCTCGGTGGACTTCTCAAAGAAGGAGACGA-
GATGCGGTA
CAGGGGTGTTCGTCTATAACGACGTTGAAGCCTGGAGGGACAGGTACAAGTACCATCCTGACTCCCCCCGTAGA-
TTGGCAGCA
GCAGTCAAGCAAGCCTGGGAAGATGGTATCTGTGGGATCTCCTCTGTTTCAAGAATGGAAAACATCATGTGGAG-
ATCAGTAG
AAGGGGAGCTCAACGCAATCCTGGAAGAGAATGGAGTTCAACTGACGGTCGTTGTGGGATCTGTAAAAAACCCC-
ATGTGGAG
AGGTCCACAGAGATTGCCCGTGCCTGTGAACGAGCTGCCCCACGGCTGGAAGGCTTGGGGGAAATCGTACTTCG-
TCAGAGCA
GCAAAGACAAATAACAGCTTTGTCGTGGATGGTGACACACTGAAGGAATGCCCACTCAAACATAGAGCATGGAA-
CAGCTTTCT
TGTGGAGGATCATGGGTTCGGGGTATTTCACACTAGTGTCTGGCTCAAGGTTAGAGAAGATTATTCATTAGAGT-
GTGATCCAG
CCGTTATTGGAACAGCTGTTAAGGGAAAGGAGGCTGTACACAGTGATCTAGGCTACTGGATTGAGAGTGAGAAG-
AATGACAC
ATGGAGGCTGAAGAGGGCCCATCTGATCGAGATGAAAACATGTGAATGGCCAAAGTCCCACACATTGTGGACAG-
ATGGAATA
GAAGAGAGTGATCTGATCATACCCAAGTCTTTAGCTGGGCCACTCAGCCATCACAATACCAGAGAGGGCTACAG-
GACCCAAAT
GAAAGGGCCATGGCACAGTGAAGAGCTTGAAATTCGGTTTGAGGAATGCCCAGGCACTAAGGTCCACGTGGAGG-
AAACATG
TGGAACAAGAGGACCATCTCTGAGATCAACCACTGCAAGCGGAAGGGTGATCGAGGAATGGTGCTGCAGGGAGT-
GCACAAT
GCCCCCACTGTCGTTCCGGGCTAAAGATGGCTGTTGGTATGGAATGGAGATAAGGCCCAGGAAAGAACCAGAAA-
GTAACTTA
GTAAGGTCAATGGTGACTGCAGGATCAACTGATCACATGGATCACTTCTCCCTTGGAGTGCTTGTGATTCTGCT-
CATGGTGCAG
GAAGGGCTGAAGAAGAGAATGACCACAAAGATCATCATAAGCACATCGATGGCAGTGCTGGTAGCTATGATCCT-
GGGAGGAT
TTTCAATGAGTGACCTGGCTAAGCTTGCAATTTTGATGGGTGCCACCTTCGCGGAAATGAACACTGGAGGAGAT-
GTAGCTCAT
CTGGCGCTGATAGCGGCATTCAAAGTCAGACCAGCGTTGCTGGTATCTTTCATCTTCAGAGCTAATTGGACACC-
CCGTGAAAGC
ATGCTGCTGGCCTTGGCCTCGTGTCTTTTGCAAACTGCGATCTCCGCCTTGGAAGGCGACCTGATGGTTCTCAT-
CAATGGTTTT
GCTTTGGCCTGGTTGGCAATACGAGCGATGGTTGTTCCACGCACTGATAACATCACCTTGGCAATCCTGGCTGC-
TCTGACACCA
CTGGCCCGGGGCACACTGCTTGTGGCGTGGAGAGCAGGCCTTGCTACTTGCGGGGGGTTTATGCTCCTCTCTCT-
GAAGGGAA
AAGGCAGTGTGAAGAAGAACTTACCATTTGTCATGGCCCTGGGACTAACCGCTGTGAGGCTGGTCGACCCCATC-
AACGTGGTG
GGACTGCTGTTGCTCACAAGGAGTGGGAAGCGGAGCTGGCCCCCTAGCGAAGTACTCACAGCTGTTGGCCTGAT-
ATGCGCAT
TGGCTGGAGGGTTCGCCAAGGCAGATATAGAGATGGCTGGGCCCATGGCCGCGGTCGGTCTGCTAATTGTCAGT-
TACGTGGT
CTCAGGAAAGAGTGTGGACATGTACATTGAAAGAGCAGGTGACATCACATGGGAAAAAGATGCGGAAGTCACTG-
GAAACAG
TCCCCGGCTCGATGTGGCGCTAGATGAGAGTGGTGATTTCTCCCTGGTGGAGGATGACGGTCCCCCCATGAGAG-
AGATCATAC
TCAAGGTGGTCCTGATGACCATCTGTGGCATGAACCCAATAGCCATACCCTTTGCAGCTGGAGCGTGGTACGTA-
TACGTGAAG
ACTGGAAAAAGGAGTGGTGCTCTATGGGATGTGCCTGCTCCCAAGGAAGTAAAAAAGGGGGAGACCACAGATGG-
AGTGTAC
AGAGTAATGACTCGTAGACTGCTAGGTTCAACACAAGTTGGAGTGGGAGTTATGCAAGAGGGGGTCTTTCACAC-
TATGTGGC
ACGTCACAAAAGGATCCGCGCTGAGAAGCGGTGAAGGGAGACTTGATCCATACTGGGGAGATGTCAAGCAGGAT-
CTGGTGT
CATACTGTGGTCCATGGAAGCTAGATGCCGCCTGGGACGGGCACAGCGAGGTGCAGCTCTTGGCCGTGCCCCCC-
GGAGAGAG
AGCGAGGAACATCCAGACTCTGCCCGGAATATTTAAGACAAAGGATGGGGACATTGGAGCGGTTGCGCTGGATT-
ACCCAGCA
GGAACTTCAGGATCTCCAATCCTAGACAAGTGTGGGAGAGTGATAGGACTTTATGGCAATGGGGTCGTGATCAA-
AAATGGGA
GTTATGTTAGTGCCATCACCCAAGGGAGGAGGGAGGAAGAGACTCCTGTTGAGTGCTTCGAGCCTTCGATGCTG-
AAGAAGAA
GCAGCTAACTGTCTTAGACTTGCATCCTGGAGCTGGGAAAACCAGGAGAGTTCTTCCTGAAATAGTCCGTGAAG-
CCATAAAAA
CAAGACTCCGTACTGTGATCTTAGCTCCAACCAGGGTTGTCGCTGCTGAAATGGAGGAAGCCCTTAGAGGGCTT-
CCAGTGCGT
TATATGACAACAGCAGTCAATGTCACCCACTCTGGAACAGAAATCGTCGACTTAATGTGCCATGCCACCTTCAC-
TTCACGTCTAC
TACAGCCAATCAGAGTCCCCAACTATAATCTGTATATTATGGATGAGGCCCACTTCACAGATCCCTCAAGTATA-
GCAGCAAGAG
GATACATTTCAACAAGGGTTGAGATGGGCGAGGCGGCTGCCATCTTCATGACCGCCACGCCACCAGGAACCCGT-
GACGCATTT
CCGGACTCCAACTCACCAATTATGGACACCGAAGTGGAAGTCCCAGAGAGAGCCTGGAGCTCAGGCTTTGATTG-
GGTGACGG
ATCATTCTGGAAAAACAGTTTGGTTTGTTCCAAGCGTGAGGAACGGCAATGAGATCGCAGCTTGTCTGACAAAG-
GCTGGAAAA
CGGGTCATACAGCTCAGCAGAAAGACTTTTGAGACAGAGTTCCAGAAAACAAAACATCAAGAGTGGGACTTTGT-
CGTGACAA
CTGACATTTCAGAGATGGGCGCCAACTTTAAAGCTGACCGTGTCATAGATTCCAGGAGATGCCTAAAGCCGGTC-
ATACTTGAT
GGCGAGAGAGTCATTCTGGCTGGACCCATGCCTGTCACACATGCCAGCGCTGCCCAGAGGAGGGGGCGCATAGG-
CAGGAAT
CCCAACAAACCTGGAGATGAGTATCTGTATGGAGGTGGGTGCGCAGAGACTGACGAAGACCATGCACACTGGCT-
TGAAGCAA
GAATGCTCCTTGACAATATTTACCTCCAAGATGGCCTCATAGCCTCGCTCTATCGACCTGAGGCCGACAAAGTA-
GCAGCCATTG
AGGGAGAGTTCAAGCTTAGGACGGAGCAAAGGAAGACCTTTGTGGAACTCATGAAAAGAGGAGATCTTCCTGTT-
TGGCTGGC
CTATCAGGTTGCATCTGCCGGAATAACCTACACAGATAGAAGATGGTGCTTTGATGGCACGACCAACAACACCA-
TAATGGAAG
ACAGTGTGCCGGCAGAGGTGTGGACCAGACACGGAGAGAAAAGAGTGCTCAAACCGAGGTGGATGGACGCCAGA-
GTTTGTT
CAGATCATGCGGCCCTGAAGTCATTCAAGGAGTTTGCCGCTGGGAAAAGAGGAGCGGCTTTTGGAGTGATGGAA-
GCCCTGGG
AACACTGCCAGGACACATGACAGAGAGATTCCAGGAAGCCATTGACAACCTCGCTGTGCTCATGCGGGCAGAGA-
CTGGAAGC
AGGCCTTACAAAGCCGCGGCGGCCCAATTGCCGGAGACCCTAGAGACCATTATGCTTTTGGGGTTGCTGGGAAC-
AGTCTCGCT
GGGAATCTTTTTCGTCTTGATGAGGAACAAGGGCATAGGGAAGATGGGCTTTGGAATGGTGACTCTTGGGGCCA-
GCGCATGG
CTCATGTGGCTCTCGGAAATTGAGCCAGCCAGAATTGCATGTGTCCTCATTGTTGTGTTCCTATTGCTGGTGGT-
GCTCATACCTG
AGCCAGAAAAGCAAAGATCTCCCCAGGACAACCAAATGGCAATCATCATCATGGTAGCAGTAGGTCTTCTGGGC-
TTGATTACC
GCCAATGAACTCGGATGGTTGGAGAGAACAAAGAGTGACCTAAGCCATCTAATGGGAAGGAGAGAGGAGGGGGC-
AACCATA
GGATTCTCAATGGACATTGACCTGCGGCCAGCCTCAGCTTGGGCCATCTATGCTGCCTTGACAACTTTCATTAC-
CCCAGCCGTCC
AACATGCAGTGACCACTTCATACAACAACTACTCCTTAATGGCGATGGCCACGCAAGCTGGAGTGTTGTTTGGT-
ATGGGCAAA
GGGATGCCATTCTACGCATGGGACTTTGGAGTCCCGCTGCTAATGATAGGTTGCTACTCACAATTAACACCCCT-
GACCCTAATA
GTGGCCATCATTTTGCTCGTGGCGCACTACATGTACTTGATCCCAGGGCTGCAGGCAGCAGCTGCGCGTGCTGC-
CCAGAAGAG
AACGGCAGCTGGCATCATGAAGAACCCTGTTGTGGATGGAATAGTGGTGACTGACATTGACACAATGACAATTG-
ACCCCCAA
GTGGAGAAAAAGATGGGACAGGTGCTACTCATAGCAGTAGCCGTCTCCAGCGCCATACTGTCGCGGACCGCCTG-
GGGGTGG
GGGGAGGCTGGGGCCCTGATCACAGCGGCAACTTCCACTTTGTGGGAAGGCTCTCCGAACAAGTACTGGAACTC-
CTCTACAGC
CACTTCACTGTGTAACATTTTTAGGGGAAGTTACTTGGCTGGAGCTTCTCTAATCTACACAGTAACAAGAAACG-
CTGGCTTGGT
CAAGAGACGTGGGGGTGGAACAGGAGAGACCCTGGGAGAGAAATGGAAGGCCCGCTTGAACCAGATGTCGGCCC-
TGGAGT
TCTACTCCTACAAAAAGTCAGGCATCACCGAGGTGTGCAGAGAAGAGGCCCGCCGCGCCCTCAAGGACGGTGTG-
GCAACGGG
AGGCCATGCTGTGTCCCGAGGAAGTGCAAAGCTGAGATGGTTGGTGGAGCGGGGATACCTGCAGCCCTATGGAA-
AGGTCATT
GATCTTGGATGTGGCAGAGGGGGCTGGAGTTACTACGCCGCCACCATCCGCAAAGTTCAAGAAGTGAAAGGATA-
CACAAAAG
GAGGCCCTGGTCATGAAGAACCCATGTTGGTGCAAAGCTATGGGTGGAACATAGTCCGTCTTAAGAGTGGGGTG-
GACGTCTT
TCATATGGCGGCTGAGCCGTGTGACACGTTGCTGTGTGACATAGGTGAGTCATCATCTAGTCCTGAAGTGGAAG-
AAGCACGG
ACGCTCAGAGTCCTCTCCATGGTGGGGGATTGGCTTGAAAAAAGACCAGGAGCCTTTTGTATAAAAGTGTTGTG-
CCCATACAC
CAGCACTATGATGGAAACCCTGGAGCGACTGCAGCGTAGGTATGGGGGAGGACTGGTCAGAGTGCCACTCTCCC-
GCAACTCT
ACACATGAGATGTACTGGGTCTCTGGAGCGAAAAGCAACACCATAAAAAGTGTGTCCACCACGAGCCAGCTCCT-
CTTGGGGC
GCATGGACGGGCCCAGGAGGCCAGTGAAATATGAGGAGGATGTGAATCTCGGCTCTGGCACGCGGGCTGTGGTA-
AGCTGCG
CTGAAGCTCCCAACATGAAGATCATTGGTAACCGCATTGAAAGGATCCGCAGTGAGCACGCGGAAACGTGGTTC-
TTTGACGA
GAACCACCCATATAGGACATGGGCTTACCATGGAAGCTATGAGGCCCCCACACAAGGGTCAGCGTCCTCTCTAA-
TAAACGGGG
TTGTCAGGCTCCTGTCAAAACCCTGGGATGTGGTGACTGGAGTCACAGGAATAGCCATGACCGACACCACACCG-
TATGGTCAG
CAAAGAGTTTTCAAGGAAAAAGTGGACACTAGGGTGCCAGACCCCCAAGAAGGCACTCGTCAGGTTATGAGCAT-
GGTCTCTTC
CTGGTTGTGGAAAGAGCTAGGCAAACACAAACGGCCACGAGTCTGTACCAAAGAAGAGTTCATCAACAAGGTTC-
GTAGCAAT
GCAGCATTAGGGGCAATATTTGAAGAGGAAAAAGAGTGGAAGACTGCAGTGGAAGCTGTGAACGATCCAAGGTT-
CTGGGCT
CTAGTGGACAAGGAAAGAGAGCACCACCTGAGAGGAGAGTGCCAGAGTTGTGTGTACAACATGATGGGAAAAAG-
AGAAAAG
AAACAAGGGGAATTTGGAAAGGCCAAGGGCAGCCGCGCCATCTGGTATATGTGGCTAGGGGCTAGATTTCTAGA-
GTTCGAAG
CCCTTGGATTCTTGAACGAGGATCACTGGATGGGGAGAGAGAACTCAGGAGGTGGTGTTGAAGGGCTGGGATTA-
CAAAGAC
TCGGATATGTCCTAGAAGAGATGAGTCGCATACCAGGAGGAAGGATGTATGCAGATGACACTGCTGGCTGGGAC-
ACCCGCAT
CAGCAGGTTTGATCTGGAGAATGAAGCTCTAATCACCAACCAAATGGAGAAAGGGCACAGGGCCTTGGCATTGG-
CCATAATC
AAGTACACATACCAAAACAAAGTGGTAAAGGTCCTTAGACCAGCTGAAAAAGGGAAGACAGTTATGGACATTAT-
TTCGAGAC
AAGACCAAAGGGGGAGCGGACAAGTTGTCACTTACGCTCTTAACACATTTACCAACCTAGTGGTGCAACTCATT-
CGGAATATG
GAGGCTGAGGAAGTTCTAGAGATGCAAGACTTGTGGCTGCTGCGGAGGTCAGAGAAAGTGACCAACTGGTTGCA-
GAGCAAC
GGATGGGATAGGCTCAAACGAATGGCAGTCAGTGGAGATGATTGCGTTGTGAAGCCAATTGATGATAGGTTTGC-
ACATGCCC
TCAGGTTCTTGAATGATATGGGAAAAGTTAGGAAGGACACACAAGAGTGGAAACCCTCAACTGGATGGGACAAC-
TGGGAAG
AAGTTCCGTTTTGCTCCCACCACTTCAACAAGCTCCATCTCAAGGACGGGAGGTCCATTGTGGTTCCCTGCCGC-
CACCAAGATG
AACTGATTGGCCGGGCCCGCGTCTCTCCAGGGGCGGGATGGAGCATCCGGGAGACTGCTTGCCTAGCAAAATCA-
TATGCGCA
AATGTGGCAGCTCCTTTATTTCCACAGAAGGGACCTCCGACTGATGGCCAATGCCATTTGTTCATCTGTGCCAG-
TTGACTGGGT
TCCAACTGGGAGAACTACCTGGTCAATCCATGGAAAGGGAGAATGGATGACCACTGAAGACATGCTTGTGGTGT-
GGAACAGA
GTGTGGATTGAGGAGAACGACCACATGGAAGACAAGACCCCAGTTACGAAATGGACAGACATTCCCTATTTGGG-
AAAAAGGG
AAGACTTGTGGTGTGGATCTCTCATAGGGCACAGACCGCGCACCACCTGGGCTGAGAACATTAAAAACACAGTC-
AACATGGT
GCGCAGGATCATAGGTGATGAAGAAAAGTACATGGACTACCTATCCACCCAAGTTCGCTACTTGGGTGAAGAAG-
GGTCTACA
CCTGGAGTGCTGTAAGCACCAATCTTAGTGTTGTCAGGCCTGCTAGTCAGCCACAGCTTGGGGAAAGCTGTGCA-
GCCTGTGAC
CCCCCCAGGAGAAGCTGGGAAACCAAGCCTATAGTCAGGCCGAGAACGCCATGGCACGGAAGAAGCCATGCTGC-
CTGTGAGC
CCCTCAGAGGACACTGAGTCAAAAAACCCCACGCGCTTGGAGGCGCAGGATGGGAAAAGAAGGTGGCGACCTTC-
CCCACCCT
TCAATCTGGGGCCTGAACTGGAGATCAGCTGTGGATCTCCAGAAGAGGGACTAGTGGTTAGAGGAG
In some embodiments, the Zika virus has a RNA genome corresponding
to the DNA sequence provided by the nucleic acid sequence of any
one of SEQ ID NOs: 2-13 or 73. In some embodiments, the Zika virus
has a variant genome that is at least 80%, 81%, 82%, 83%, 84%, 85%,
86%, 87%, 88%, 89%, 90%, 91%, 92%. 93%, 94%, 95%, 96%, 97%, 98%,
99%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% identical to any one of
SEQ ID NOs: 2-13 or 78.
Provided below are amino acid sequences of the E-proteins of Zika
strains that may be used in the methods, compositions, and/or
vaccines described herein.
TABLE-US-00006
isol-ARB15076.AHF49784.1.Central_African_Republic/291-788
Flavivirus envelope glycoprotein E. SEQ ID NO: 14
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDENRA-
KVEVTPNSPRA
EATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAHA-
KRQTVVVLGS
QEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQYAG-
TDGPCKVPAQM
AVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGAK-
RMAVLGDTAWDF
GSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVSA
isol-IbH_30656.AEN75265.1.Nigeria/291-788 Flavivirus envelope
glycoprotein E. SEQ ID NO: 15
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDENRA-
KVEVTPNSPRA
EATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHSGADTETPHWNNKEALVEFKDAHA-
KRQTVVVLGS
QEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQYAG-
RDGPCKVPAQM
AVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSIIGKAFEATVRGAK-
RMAVLGDTAWDF
GSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVSA
ArB1362.AHL43500.1.-/291-794 Flavivirus envelope glycoprotein E.
SEQ ID NO: 16
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDXXXX-
XXXNRAEVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA ArD128000.AHL43502.1.-/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 17
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMXXXXXGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHRLVRKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWLKKGSSIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGVFNSLGKGVHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLST-
AVSA ArD158095.AHL43505.1.-/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 18
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDIGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA ArD158084.AHL43504.1.-/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 19
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDIGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA isol-ARB13565.AHF49783.1.Central_African_Republic/291-794
Flavivirus envelope glycoprotein E. SEQ ID NO: 20
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDIGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGVHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA isol-ARB7701.AHF49785.1.Central_African_Republic/291-794
Flavivirus envelope glycoprotein E. SEQ ID NO: 21
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDIGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGVHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA isol-ArD_41519.AEN75266.1.Senegal/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 22
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLS-
TAVSA MR766-NIID.BAP47441.1.Uganda/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 23
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMTVNDIGYE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGKLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
IPVQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLST-
AVSA LC002520.1/326-829 Zika virus genomic RNA, strain: MR766-NIID,
Uganda, Flavivirus envelope glycoprotein E. SEQ ID NO: 24
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMTVNDIGYE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGKLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
IPVQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLST-
AVSA isol-MR_766.AEN75263.1.Uganda/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 25
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGYE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGKLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTV-
EVQYAGTDGPCK
IPVQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLST-
AVSA ArD7117.AHL43501.1.-/291-794 Flavivirus envelope glycoprotein
E. SEQ ID NO: 26
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDIGHE-
TDENRAKVEVT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAVCTAAKVPAETLHGTVTV-
EVQYAGTDGPC
KVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEA-
TVRGAKRMAVL
GDTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFL-
STAVSA AY632535.2/326-825 NC_012532.1 Zika virus strain MR 766,
Uganda, Flavivirus envelope glycoprotein E. SEQ ID NO: 27
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIGYETDED-
RAKVEVTPNSPR
AEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAH-
AKRQTVVVL
GSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQY-
AGTDGPCKIPVQ
MAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGA-
KRMAVLGDTAW
DFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVS-
A MR_766.AAV34151.1.Uganda/291-790 Flavivirus envelope glycoprotein
E. |Q32ZE1|Q32ZE1_9FL SEQ ID NO: 28
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIGYETDED-
RAKVEVTPNSPR
AEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAH-
AKRQTVVVL
GSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQY-
AGTDGPCKIPVQ
MAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGA-
KRMAVLGDTAW
DFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVS-
A MR_766.YP_009227198.1.Uganda/1-500 envelope protein E [Zika
virus] SEQ ID NO: 29
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIGYETDED-
RAKVEVTPNSPR
AEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAH-
AKRQTVVVL
GSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQY-
AGTDGPCKIPVQ
MAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGA-
KRMAVLGDTAW
DFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVS-
A KU681081.3/308-811 Zika virus isolate Zika virus/H.
sapiens-tc/THA/2014/ SV0127-14, Thailand, Flavivirus envelope
glycoprotein E. SEQ ID NO: 30
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHTGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITEGTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVLNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLS-
TAVSA isol-Zika_virus % H. sapiens-tc % THA % 2014 %
SV0127-_14.AMD61710.1.Thailand/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 31
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHTGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITEGTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGVLNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLS-
TAVSA CK-ISL_2014.AIC06934.1.Cook_Islands/1-504 Flavivirus envelope
glycoprotein E. (Fragment) OS = Zika virus GN = E PE = 4 SV = 1 SEQ
ID NO: 32
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA
Natal_RGN.AMB18850.1.Brazil:_Rio_Grande_do_Norte,_Natal/291-794
Flavivirus envelope glycoprotein E.] SEQ ID NO: 33
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-Si323.AMC37200.1.Colombia/1-504 Flavivirus envelope
glycoprotein E. SEQ ID NO: 34
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU707826.1/317-820 Zika virus isolate SSABR1, Brazil,
Flavivirus envelope glycoprotein E. SEQ ID NO: 35
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU509998.1/326-829 Zika virus strain Haiti/1225/2014, Haiti,
Flavivirus envelope glycoprotein E. SEQ ID NO: 36
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-GDZ16001.AML82110.1.China/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 37
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA BeH819015.AMA12085.1.Brazil/291-794 Flavivirus envelope
glycoprotein E.] SEQ ID NO: 38
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA MRS_OPY_Martinique_PaRi_2015.AMC33116.1.Martinique/291-794
Flavivirus envelope glycoprotein E. SEQ ID NO: 39
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU501215.1/308-811 Zika virus strain PRVABC59, Puerto Rico,
Flavivirus envelope glycoprotein E. SEQ ID NO: 40
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA Haiti % 1225%2014.AMB37295.1.Haiti/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 41
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU527068.1/308-811 Zika virus strain Natal RGN, Brazil: Rio
Grande do Norte, Natal, Flavivirus envelope glycoprotein E. SEQ ID
NO: 42
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-Z1106027.ALX35662.1.Suriname/5-508 Flavivirus envelope
glycoprotein E. SEQ ID NO: 43
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-FLR.AMM39804.1.Colombia:_Barranquilla/291-794 Flavivirus
envelope glycoprotein E. SEQ ID NO: 44
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA PLCal_ZV_isol-From_Vero_E6_cells.AHL37808.1.Canada/254-757
Flavivirus envelope glycoprotein E. SEQ ID NO: 45
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA BeH818995.AMA12084.1.Brazil/291-794 Flavivirus envelope
glycoprotein E. [Zika virus]. SEQ ID NO: 46
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA H/PF/2013.AHZ13508.1.French_Polynesia/291-794 Flavivirus
envelope glycoprotein E. SEQ ID NO: 47
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA PRVABC59.AMC13911.1.Puerto_Rico/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 48
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU321639.1/326-829 Zika virus strain ZikaSPH2015, Brazil,
Flavivirus envelope glycoprotein E. SEQ ID NO: 49
IRCIGVSNRDFVEGMSGGTWVDIVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA ZikaSPH2015.ALU33341.1.Brazil/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 50
IRCIGVSNRDFVEGMSGGTWVDIVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA 103344.AMC13912.1.Guatemala/291-794 polyprotein [Zika virus].
103344.AMC13912.1.Guatemala Flavivirus envelope glycoprotein E. SEQ
ID NO: 51
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEIRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-Brazil-ZKV2015.AMD16557.1.Brazil/291-794 Flavivirus
envelope glycoprotein E. SEQ ID NO: 52
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGTQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA KU497555.1/308-811 Zika virus isolate Brazil-ZKV2015,
Flavivirus envelope glycoprotein E. SEQ ID NO: 53
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGTQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-ZJ03.AMM39806.1.China/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 54
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGARRMAVLG
DTAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLS-
TAVSA isol-FSS13025.AFD30972.1.Cambodia/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 55
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-Z1106032.ALX35660.1.Suriname/291-794 Flavivirus envelope
glycoprotein E. [Zika virus] SEQ ID NO: 56
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNAKNGSISLMCLALGGVLIFLST-
AVSA isol-Z1106033.ALX35659.1.Suriname/291-794 Flavivirus envelope
glycoprotein E. [Zika virus] SEQ ID NO: 57
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNAKNGSISLMCLALGGVLIFLST-
AVSA isol-BeH828305.AMK49165.1.Brazil/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 58
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDTQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-GD01.AMK79468.1.China/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 59
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNGTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-Z1106031.ALX35661.1.Suriname/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 60
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VLAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA ACD75819.1.Micronesia/291-794 Flavivirus envelope glycoprotein
E. SEQ ID NO: 61
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPAVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVLIFLST-
AVSA KU681082.3/308-811 Zika virus isolate Zika virus/H.
sapiens-tc/PHL/2012/CPC-0740, Philippines, Flavivirus envelope
glycoprotein E. SEQ ID NO: 62
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVLIFLST-
AVSA isol-Zika_virus % H. sapiens-tc % PHL % 2012 %
CPC-0740.AMD61711.1.Philippines/ 291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 63
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVLIFLST-
AVSA isol-BeH823339.AMK49164.2.Brazil/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 64
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVSTTVSNMAEVRSYCYEATISDIASDSRC-
PTQGEAYLDKQS
DTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHET-
DENRAKVEITP
NSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEF-
KDAHAKRQT
AVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSA isol-P6-740.AEN75264.1.Malaysia/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 65
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDXGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTV-
EVQYAGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWXRSGSTIGKAFEAT-
VRGAKRMAVLG
DTAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVLIFLS-
TAVSA KU744693.1/326-829 Zika virus isolate VE_Ganxian, China,
Flavivirus envelope glycoprotein E. SEQ ID NO: 66
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTAMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMLVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLAHKEWFHDIPLPWHAGAATGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETVDGTVTV-
EGQYGGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIIGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSG isol-VE_Ganxian.AMK79469.1.China/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 67
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTAMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMLVNDTGHE-
TDENRAKVEIT
PNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLAHKEWFHDIPLPWHAGAATGTPHWNNKEALVE-
FKDAHAKRQT
VVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETVDGTVTV-
EGQYGGTDGPCK
VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGEKKITHHWHRSGSTIGKAFEAT-
VRGAKRMAVLGD
TAWDFGSVGGALNSLGKGIHQIIGAAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLST-
AVSG ArD157995.AHL43503.1.-/291-794 Flavivirus envelope
glycoprotein E. SEQ ID NO: 68
ISCIGVSNRDLVEGMSGGTWVDVVLEHGGCVTEMAQDKPTVDIELVTMTVSNMAEVRSYCYEASLSDMASASRC-
PTQGEPSLDK
QSDTQSVCKRTLGDRGWGNGCGIFGKGSLVTCSKFTCCKKMPGKSIQPENLEYRIMLPVHGSQHSGMIVNDIGH-
ETDENRAKVEV
TPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALV-
EFKDAHAKRQ
TVVVLGSQEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVT-
VEVQSAGTDGPC
KVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEA-
TVRGAKRMAVL
GDTAWDFGSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFL-
STAVSA MR_766.ABI54475.1.Uganda/291-788 Flavivirus envelope
glycoprotein E. SEQ ID NO: 69
IRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIELVTTTVSNMAEVRSYCYEASISDMASDSRC-
PTQGEAYLDKQ
SDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFTCSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDENRA-
KVEVTPNSPRA
EATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWFHDIPLPWHAGADTGTPHWNNKEALVEFKDAHA-
KRQTVVVLGS
QEGAVHTALAGALEAEMDGAKGRLFSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKVPAETLHGTVTVEVQYAG-
TDGPCKVPAQM
AVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVGDKKITHHWHRSGSTIGKAFEATVRGAK-
RMAVLGDTAWDF
GSVGGVFNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTLLVWLGLNTKNGSISLTCLALGGVMIFLSTAVSA
5'-(dldC).sub.13-3' SEQ ID NO: 70 dldC dldC dldC dldC dldC dldC
dldC dldC dldC dldC dldC dldC dldC KLK peptide SEQ ID NO: 71
KLKLLLLLKLK
Provided below are examples of nucleic acid sequences of the
genomes of Chikungunya, Japanese Encephalitis and yellow fever
viruses that may be used in the methods, compositions, and/or
vaccines described herein.
TABLE-US-00007 Chikungunya virus strain LR2006_OPY1, complete
genome ACCESSION: DQ443544 SEQ ID NO: 72
ATGGCTGCGTGAGACACACGTAGCCTACCAGTTTCTTACTGCTCTACTCTGCAAAGCAAGAGATTAATAACCCA-
TCATGGATC
CTGTGTACGTGGACATAGACGCTGACAGCGCCTTTTTGAAGGCCCTGCAACGTGCGTACCCCATGTTTGAGGTG-
GAACCAA
GGCAGGTCACACCGAATGACCATGCTAATGCTAGAGCGTTCTCGCATCTAGCTATAAAACTAATAGAGCAGGAA-
ATTGACCC
CGACTCAACCATCCTGGATATCGGCAGTGCGCCAGCAAGGAGGATGATGTCGGACAGGAAGTACCACTGCGTCT-
GCCCGAT
GCGCAGTGCGGAAGATCCCGAGAGACTCGCCAATTATGCGAGAAAGCTAGCATCTGCCGCAGGAAAAGTCCTGG-
ACAGAA
ACATCTCTGGAAAGATCGGGGACTTACAAGCAGTAATGGCCGTGCCAGACACGGAGACGCCAACATTCTGCTTA-
CACACAG
ACGTCTCATGTAGACAGAGAGCAGACGTCGCTATATACCAAGACGTCTATGCTGTACACGCACCCACGTCGCTA-
TACCACCA
GGCGATTAAAGGGGTCCGAGTGGCGTACTGGGTTGGGTTCGACACAACCCCGTTCATGTACAATGCCATGGCGG-
GTGCCTA
CCCCTCATACTCGACAAACTGGGCAGATGAGCAGGTACTGAAGGCTAAGAACATAGGATTATGTTCAACAGACC-
TGACGGA
AGGTAGACGAGGCAAGTTGTCTATTATGAGAGGGAAAAAGCTAAAACCGTGCGACCGTGTGCTGTTCTCAGTAG-
GGTCAAC
GCTCTACCCGGAAAGCCGCAAGCTACTTAAGAGCTGGCACCTGCCATCGGTGTTCCATTTAAAGGGCAAACTCA-
GCTTCACA
TGCCGCTGTGATACAGTGGTTTCGTGTGAGGGCTACGTCGTTAAGAGAATAACGATGAGCCCAGGCCTTTATGG-
AAAAACC
ACAGGGTATGCGGTAACCCACCACGCAGACGGATTCCTGATGTGCAAGACTACCGACACGGTTGACGGCGAAAG-
ARTGTCA
TTCTCGGTGTGCACATACGTGCCGGCGACCATTTGTGATCAAATGACCGGCATCCTTGCTACAGAAGTCACGCC-
GGAGGATG
CACAGAAGCTGTTGGTGGGGCTGAACCAGAGAATAGTGGTTAACGGCAGAACGCAACGGAATACGAACACCATG-
AAAAAT
TATCTGCTTCCCGTGGTCGCCCAAGCCTTCAGTAAGTGGGCAAAGGAGTGCCGGAAAGACATGGAAGATGAAAA-
ACTCCTG
GGGGTCAGAGAAAGAACACTGACCTGCTGCTGTCTATGGGCATTCAAGAAGCAGAAAACACACACGGTCTACAA-
GAGGCCT
GATACCCAGTCAATTCAGAAGGTTCAGGCCGAGTTTGACAGCTTTGTGGTACCGAGTCTGTGGTCGTCCGGGTT-
GTCAATCC
CTTTGAGGACTAGAATCAAATGGTTGTTAAGCAAGGTGCCAAAAACCGACCTGATCCCATACAGCGGAGACGCC-
CGAGAAG
CCCGGGACGCAGAAAAAGAAGCAGAGGAAGAACGAGAAGCAGAACTGACTCGCGAAGCCCTACCACCTCTACAG-
GCAGCA
CAGGAAGATGTTCAGGTCGAAATCGACGTGGAACAGCTTGAGGACAGAGCGGGCGCAGGAATAATAGAGACTCC-
GAGAG
GAGCTATCAAAGTTACTGCCCAACCAACAGACCACGTCGTGGGAGAGTACCTGGTACTCTCCCCGCAGACCGTA-
CTACGTAG
CCAGAAGCTCAGTCTGATTCACGCTTTGGCGGAGCAAGTGAAGACGTGCACGCACAACGGACGAGCAGGGAGGT-
ATGCGG
TCGAAGCGTACGACGGCCGAGTCCTAGTGCCCTCAGGCTATGCAATCTCGCCTGAAGACTTCCAGAGTCTAAGC-
GAAAGCG
CAACGATGGTGTATAACGAAAGAGAGTTCGTAAACAGAAAGCTACACCATATTGCGATGCACGGACCAGCCCTG-
AACACCG
ACGAAGAGTCGTATGAGCTGGTGAGGGCAGAGAGGACAGAACACGAGTACGTCTACGACGTGGATCAGAGAAGA-
TGCTG
TAAGAAGGAAGAAGCCGCAGGACTGGTACTGGTGGGCGACTTGACTAATCCGCCCTACCACGAATTCGCATATG-
AAGGGCT
AAAAATCCGCCCTGCCTGCCCATACAAAATTGCAGTCATAGGAGTCTTCGGAGTACCGGGATCTGGCAAGTCAG-
CTATTATC
AAGAACCTAGTTACCAGGCAGGACCTGGTGACTAGCGGAAAGAAAGAAAACTGCCAAGAAATCACCACCGACGT-
GATGAG
ACAGAGAGGTCTAGAGATATCTGCACGTACGGTTGACTCGCTGCTCTTGAATGGATGCAACAGACCAGTCGACG-
TGTTGTA
CGTAGACGAGGCGTTTGCGTGCCACTCTGGAACGCTACTTGCTTTGATCGCCTTGGTGAGACCAAGGCAGAAAG-
TTGTACTT
TGTGGTGACCCGAAGCAGTGCGGCTTCTTCAATATGATGCAGATGAAAGTCAACTATAATCACAACATCTGCAC-
CCAAGTGT
ACCACAAAAGTATCTCCAGGCGGTGTACACTGCCTGTGACCGCCATTGTGTCATCGTTGCATTACGAAGGCAAA-
ATGCGCAC
TACGAATGAGTACAACAAGCCGATTGTAGTGGACACTACAGGCTCAACAAAACCTGACCCTGGAGACCTCGTGT-
TAACGTG
CTTCAGAGGGTGGGTTAAACAACTGCAAATTGACTATCGTGGATACGAGGTCATGACAGCAGCCGCATCCCAAG-
GGTTAAC
CAGAAAAGGAGTTTACGCAGTTAGACAAAAAGTTAATGAAAACCCGCTCTATGCATCAACGTCAGAGCACGTCA-
ACGTACTC
CTAACGCGTACGGAAGGTAAACTGGTATGGAAGACACTTTCCGGCGACCCGTGGATAAAGACGCTGCAGAACCC-
ACCGAAA
GGAAACTTCAAAGCAACTATTAAGGAGTGGGAGGTGGAGCATGCATCAATAATGGCGGGCATCTGCAGTCACCA-
AATGAC
CTTCGATACATTCCAAAATAAAGCCAACGTTTGTTGGGCTAAGAGCTTGGTCCCTATCCTCGAAACAGCGGGGA-
TAAAACTA
AATGATAGGCAGTGGTCTCAGATAATTCAAGCCTTCAAAGAAGACAAAGCATACTCACCTGAAGTAGCCCTGAA-
TGAAATAT
GTACGCGCATGTATGGGGTGGATCTAGACAGCGGGCTATTTTCTAAACCGTTGGTGTCTGTGTATTACGCGGAT-
AACCACTG
GGATAATAGGCCTGGAGGGAAAATGTTCGGATTTAACCCCGAGGCAGCATCCATTCTAGAAAGAAAGTATCCAT-
TCACAAA
AGGGAAGTGGAACATCAACAAGCAGATCTGCGTGACTACCAGGAGGATAGAAGACTTTAACCCTACCACCAACA-
TCATACC
GGCCAACAGGAGACTACCACACTCATTAGTGGCCGAACACCGCCCAGTAAAAGGGGAAAGAATGGAATGGCTGG-
TTAACA
AGATAAACGGCCACCACGTGCTCCTGGTCAGTGGCTATAACCTTGCACTGCCTACTAAGAGAGTCACTTGGGTA-
GCGCCGTT
AGGTGTCCGCGGAGCGGACTACACATACAACCTAGAGTTGGGTCTGCCAGCAACGCTTGGTAGGTATGACCTAG-
TGGTCAT
AAACATCCACACACCTTTTCGCATACACCATTACCAACAGTGCGTCGACCACGCAATGAAACTGCAAATGCTCG-
GGGGTGAC
TCATTGAGACTGCTCAAACCGGGCGGCTCTCTATTGATCAGAGCATATGGTTACGCAGATAGAACCAGTGAACG-
AGTCATCT
GCGTATTGGGACGCAAGTTTAGATCGTCTAGAGCGTTGAAACCACCATGTGTCACCAGCAACACTGAGATGTTT-
TTCCTATTC
AGCAACTTTGACAATGGCAGAAGGAATTTCACAACTCATGTCATGAACAATCAACTGAATGCAGCCTTCGTAGG-
ACAGGTCA
CCCGAGCAGGATGTGCACCGTCGTACCGGGTAAAACGCATGGACATCGCGAAGAACGATGAAGAGTGCGTAGTC-
AACGCC
GCTAACCCTCGCGGGTTACCGGGTGRCGGTGTTTGCAAGGCAGTATACAAAAAATGGCCGGAGTCCTTTAAGAA-
CAGTGCA
ACACCAGTGGGAACCGCAAAAACAGTTATGTGCGGTACGTATCCAGTAATCCACGCTGTTGGACCAAACTTCTC-
TAATTATT
CGGAGTCTGAAGGGGACCGGGAATTGGCAGCTGCCTATCGAGAAGTCGCAAAGGAAGTAACTAGGCTGGGAGTA-
AATAGT
GTAGCTATACCTCTCCTCTCCACAGGTGTATACTCAGGAGGGAAAGACAGGCTGACCCAGTCACTGAACCACCT-
CTTTACAG
CCATGGACTCGACGGATGCAGACGTGGTCATCTACTGCCGCGACAAAGAATGGGAGAAGAAAATATCTGAGGCC-
ATACAG
ATGCGGACCCAAGTAGAGCTGCTGGATGAGCACATCTCCATAGACTGCGATATTGTTCGCGTGCACCCTGACAG-
CAGCTTG
GCAGGCAGAAAAGGATACAGCACCACGGAAGGCGCACTGTACTCATATCTAGAAGGGACCCGTTTTCATCAGAC-
GGCTGTG
GATATGGCGGAGATACATACTATGTGGCCAAAGCAAACAGAGGCCAATGAGCAAGTCTGCCTATATGCCCTGGG-
GGAAAG
TATTGAATCGATCAGGCAGAAATGCCCGGTGGATGATGCAGACGCATCATCTCCCCCCAAAACTGTCCCGTGCC-
TTTGCCGT
TACGCTATGACTCCAGAACGCGTCACCCGGCTTCGCATGAACCACGTCACAAGCATAATTGTGTGTTCTTCGTT-
TCCCCTCCC
AAAGTACAAAATAGAAGGAGTGCAAAAAGTCAAATGCTCTAAGGTAATGCTATTTGACCACAACGTGCCATCGC-
GCGTAAG
TCCAAGGGAATATAKATCTTCCCAGGAGTCTGCACAGGAGGCGAGTACAATCACGTCACTGACGCATAGTCAAT-
TCGACCTA
AGCGTTGATGGCGAGATACTGCCCGTCCCGTCAGACCTGGATGCTGACGCCCCAGCCCTAGAACCAGCACTAGA-
CGACGGG
GCGACACACACGCTGCCATCCACAACCGGAAACCTTGCGGCCGTGTCTGATTGGGTAATGAGCACCGTACCTGT-
CGCGCCG
CCCAGAAGAAGGCGAGGGAGAAACCTGACTGTGACATGTGACGAGAGAGAAGGGAATATAACACCCATGGCTAG-
CGTCCG
ATTCTTTAGGGCAGAGCTGTGTCCGGTCGTACAAGAAACAGCGGAGACGCGTGACACAGCAATGTCTCTTCAGG-
CACCACC
GAGTACCGCCACGGAACCGAATCATCCGCCGATCTCCTTCGGAGCATCAAGCGAGACGTTCCCCATTACATTTG-
GGGACTTC
AACGAAGGAGAAATCGAAAGCTTGTCTTCTGAGCTACTAACTTTCGGAGACTTCTTACCAGGAGAAGTGGATGA-
CTTGACA
GACAGCGACTGGTCCACGTGCTCAGACACGGACGACGAGTTATGACTAGACAGGGCAGGTGGGTATATATTCTC-
GTCGGAC
ACCGGTCCAGGTCATTTACAACAGAAGTCAGTACGCCAGTCAGTGCTGCCGGTGAACACCCTGGAGGAAGTCCA-
CGAGGAG
AAGTGTTACCCACCTAAGCTGGATGAAGCAAAGGAGCAACTATTACTTAAGAAACTCCAGGAGAGTGCATCCAT-
GGCCAAC
AGAAGCAGGTATCAGTCGCGCAAAGTAGAAAACATGAAAGCAGCAATCATCCAGAGACTAAAGAGAGGCTGTAG-
ACTATA
CTTAATGTCAGAGACCCCAAAAGTCCCTACTTACCGGACTACATATCCGGCGCCTGTGTACTCGCCTCCGATCA-
ACGTCCGAT
TGTCCAATCCCGAGTCCGCAGTGGCAGCATGCAATGAGTTCTTAGCTAGAAACTATCCAACTGTCTCATCATAC-
CAAATTACC
GACGAGTATGATGCATATCTAGACATGGTGGACGGGTCGGAGAGTTGCCTGGACCGAGCGACATTCAATCCGTC-
AAAACTC
AGGAGCTACCCGAAACAGCACGCTTACCACGCGCCCTCCATCAGAAGCGCTGTACCGTCCCCATTCCAGAACAC-
ACTACAGA
ATGTACTGGCAGCAGCCACGAAAAGAAACTGCAACGTCACACAGATGAGGGAATTACCCACTTTGGACTCAGCA-
GTATTCA
ACGTGGAGTGTTTCAAAAAATTCGCATGCAACCAAGAATACTGGGAAGAATTTGCTGCCAGCCCTATTAGGATA-
ACAACTGA
GAATTTAGCAACCTATGTTACTAAACTAAAAGGGCCAAAAGCAGCAGCGCTATTCGCAAAAACCCATAATCTAC-
TGCCACTA
CAGGAAGTACCAATGGATAGGTTCACAGTAGATATGAAAAGGGACGTAAAGGTGACTCCTGGTACAAAGCATAC-
AGAGGA
AAGACCTAAGGTGCAGGTTATACAGGCGGCTGAACCCTTGGCGACAGCATACCTATGTGGGATTCACAGAGAGC-
TGGTTAG
GAGGCTGAACGCCGTCCTCCTACCCAATGTACATACACTATTTGACATGTCTGCCGAGGATTTCGATGCCATCA-
TAGCCGCAC
ACTTTAAGCCAGGAGACACTGTTTTGGAAACGGACATAGCCTCCTTTGATAAGAGCCAAGATGATTCACTTGCG-
CTTACTGC
TTTGATGCTGTTAGAGGATTTAGGGGTGGATCACTCCCTGCTGGACTTGATAGAGGCTGCTTTCGGAGAGATTT-
CCAGCTGT
CACCTACCGACAGGTACGCGCTTCAAGTTCGGCGCCATGATGAAATCAGGTATGTTCCTAACTCTGTTCGTCAA-
CACATTGTT
AAACATCACCATCGCCAGCCGAGTGCTGGAAGATCGTCTGACAAAATCCGCGTGCGCGGCCTTCATCGGCGACG-
ACAACAT
AATACATGGAGTCGTCTCCGATGAATTGATGGCAGCCAGATGTGCCACTTGGATGAACATGGAAGTGAAGATCA-
TAGATGC
AGTTGTATCCTTGAAAGCCCCTTACTTTTGTGGAGGGTTTATACTGCACGATACTGTGACAGGAACAGCTTGCA-
GAGTGGCA
GACCCGCTAAAAAGGCTTTTTAAACTGGGCAAACCGCTAGCGGCAGGTGACGAACAAGATGAAGATAGAAGACG-
AGCGCT
GGCTGACGAAGTGATCAGATGGCAACGAACAGGGCTAATTGATGAGCTGGAGAAAGCGGTATACTCTAGGTACG-
AAGTGC
AGGGTATATCAGTTGTGGTAATGTCCATGGCCACCTTTGCAAGCTCCAGATCCAACTTCGAGAAGCTCAGAGGA-
CCCGTCAT
AACTTTGTACGGCGGTCCTAAATAGGTACGCACTACAGCTACCTATTTTGCAGAAGCCGACAGCAAGTATCTAA-
ACACTAAT
CAGCTACAATGGAGTTCATCCCAACCCAAACTTTTTACAATAGGAGGTACCAGCCTCGACCCTGGACTCCGCGC-
CCTACTATC
CAAGTCATCAGGCCCAGACCGCGCCCTCAGAGGCAAGCTGGGCAACTTGCCCAGCTGATCTCAGCAGTTAATAA-
ACTGACA
ATGCGCGCGGTACCCCAACAGAAGCCACGCAGGAATCGGAAGAATAAGAAGCAAAAGCAAAAACAACAGGCGCC-
ACAAAA
CAACACAAATCAAAAGAAGCAGCCACCTAAAAAGAAACCGGCTCAAAAGAAAAAGAAGCCGGGCCGCAGAGAGA-
GGATG
TGCATGAAAATCGAAAATGATTGTATTTTCGAAGTCAAGCACGAAGGTAAGGTAACAGGTTACGCGTGCCTGGT-
GGGGGAC
AAAGTAATGAAACCAGCACACGTAAAGGGGACCATCGATAACGCGGACCTGGCCAAACTGGCCTTTAAGCGGTC-
ATCTAAG
TATGACCTTGAATGCGCGCAGATACCCGTGCACATGAAGTCCGACGCTTCGAAGTTCACCCATGAGAAACCGGA-
GGGGTAC
TACAACTGGCACCACGGAGCAGTACAGTACTCAGGAGGCCGGTTCACCATCCCTACAGGTGCTGGCAAACCAGG-
GGACAGC
GGCAGACCGATCTTCGACAACAAGGGACGCGTGGTGGCCATAGTCTTAGGAGGAGCTAATGAAGGAGCCCGTAC-
AGCCCT
CTCGGTGGTGACCTGGAATAAAGACATTGTCACTAAAATCACCCCCGAGGGGGCCGAAGAGTGGAGTCTTGCCA-
TCCCAGT
TATGTGCCTGTTGGCAAACACCACGTTCCCCTGCTCCCAGCCCCCTTGCACGCCCTGCTGCTACGAAAAGGAAC-
CGGAGGAA
ACCCTACGCATGCTTGAGGACAACGTCATGAGACCTGGGTACTATCAGCTGCTACAAGCATCCTTAACATGTTC-
TCCCCACCG
CCAGCGACGCAGCACCAAGGACAACTTCAATGTCTATAAAGCCACAAGACCATACTTAGCTCACTGTCCCGACT-
GTGGAGAA
GGGCACTCGTGCCATAGTCCCGTAGCACTAGAACGCATCAGAAATGAAGCGACAGACGGGACGCTGAAAATCCA-
GGTCTCC
TTGCAAATCGGAATAAAGACGGATGACAGCCACGATTGGACCAAGCTGCGTTATATGGACAACCACATGCCAGC-
AGACGCA
GAGAGGGCGGGGCTATTTGTAAGAACATCAGCACCGTGTACGATTACTGGAACAATGGGACACTTCATCCTGGC-
CCGATGT
CCAAAAGGGGAAACTCTGACGGTGGGATTCACTGACAGTAGGAAGATTAGTCACTCATGTACGCACCCATTTCA-
CCACGAC
CCTCCTGTGATAGGTCGGGAAAAATTCCATTCCCGACCGCAGCACGGTAAAGAGCTACCTTGCAGCACGTACGT-
GCAGAGC
ACCGCCGCAACTACCGAGGAGATAGAGGTACACATGCCCCCAGACACCCCTGATCGCACATTAATGTCACAACA-
GTCCGGC
AACGTAAAGATCACAGTCAATGGCCAGACGGTGCGGTACAAGTGTAATTGCGGTGGCTCAAATGAAGGACTAAC-
AACTACA
GACAAAGTGATTAATAACTGCAAGGTTGATCAATGTCATGCCGCGGTCACCAATCACAAAAAGTGGCAGTATAA-
CTCCCCTC
TGGTCCCGCGTAATGCTGAACTTGGGGACCGAAAAGGAAAAATTCACATCCCGTTTCCGCTGGCAAATGTAACA-
TGCAGGG
TGCCTAAAGCAAGGAACCCCACCGTGACGTACGGGAAAAACCAAGTCATCATGCTACTGTATCCTGACCACCCA-
ACACTCCT
GTCCTACCGGAATATGGGAGAAGAACCAAACTATCAAGAAGAGTGGGTGATGCATAAGAAGGAAGTCGTGCTAA-
CCGTGC
CGACTGAAGGGCTCGAGGTCACGTGGGGCAACAACGAGCCGTATAAGTATTGGCCGCAGTTATCTACAAACGGT-
ACAGCCC
ATGGCCACCCGCATGAGATAATTCTGTATTATTATGAGCTGTACCCCACTATGACTGTAGTAGTTGTGTCAGTG-
GCCACGTTC
ATACTCCTGTCGATGGTGGGTATGGCAGCGGGGATGTGCATGTGTGCACGACGCAGATGCATCACACCGTATGA-
ACTGACA
CCAGGAGCTACCGTCCCTTTCCTGCTTAGCCTAATATGCTGCATCAGAACAGCTAAAGCGGCCACATACCAAGA-
GGCTGCGA
TATACCTGTGGAACGAGCAGCAACCTTTGTTTTGGCTACAAGCCCTTATTCCGCTGGCAGCCCTGATTGTTCTA-
TGCAACTGT
CTGAGACTCTTACCATGCTGCTGTAAAACGTTGGCTTTTTTAGCCGTAATGAGCGTCGGTGCCCACACTGTGAG-
CGCGTACG
AACACGTAACAGTGATCCCGAACACGGTGGGAGTACCGTATAAGACTCTAGTCAATAGACCTGGCTACAGCCCC-
ATGGTATT
GGAGATGGAACTACTGTCAGTCACTTTGGAGCCAACACTATCGCTTGATTACATCACGTGCGAGTACAAAACCG-
TCATCCCG
TCTCCGTACGTGAAGTGCTGCGGTACAGCAGAGTGCAAGGACAAAAACCTACCTGACTACAGCTGTAAGGTCTT-
CACCGGC
GTCTACCCATTTATGTGGGGCGGCGCCTACTGCTTCTGCGACGCTGAAAACACGCAGTTGAGCGAAGCACACGT-
GGAGAAG
TCCGAATCATGCAAAACAGAATTTGCATCAGCATACAGGGCTCATACCGCATCTGCATCAGCTAAGCTCCGCGT-
CCTTTACCA
AGGAAATAACATCACTGTAACTGCCTATGCAAACGGCGACCATGCCGTCACAGTTAAGGACGCCAAATTCATTG-
TGGGGCC
AATGTCTTCAGCCTGGACACCTTTCGACAACAAAATTGTGGTGTACAAAGGTGACGTCTATAACATGGACTACC-
CGCCCTTTG
GCGCAGGAAGACCAGGACAATTTGGCGATATCCAAAGTCGCACACCTGAGAGTAAAGACGTCTATGCTAATACA-
CAACTGG
TACTGCAGAGACCGGCTGTGGGTACGGTACACGTGCCATACTCTCAGGCACCATCTGGCTTTAAGTATTGGCTA-
AAAGAACG
CGGGGCGTCGCTGCAGCACACAGCACCATTTGGCTGCCAAATAGCAACAAACCCGGTAAGAGCGGTGAACTGCG-
CCGTAG
GGAACATGCCCATCTCCATCGACATACCGGAAGCGGCCTTCACTAGGGTCGTCGACGCGCCCTCTTTAACGGAC-
ATGTCGTG
CGAGGTACCAGCCTGCACCCATTCCTCAGACTTTGGGGGCGTCGCCATTATTAAATATGCAGCCAGCAAGAAAG-
GCAAGTG
TGCGGTGCATTCGATGACTAACGCCGTCACTATTCGGGAAGCTGAGATAGAAGTTGAAGGGAATTCTCAGCTGC-
AAATCTCT
TTCTCGACGGCCTTAGCCAGCGCCGAATTCCGCGTACAAGTCTGTTCTACACAAGTACACTGTGCAGCCGAGTG-
CCACCCCC
CGAAGGACCACATAGTCAACTACCCGGCGTCACATACCACCCTCGGGGTCCAGGACATCTCCGCTACGGCGATG-
TCATGGG
TGCAGAAGATCACGGGAGGTGTGGGACTGGTTGTTGCTGTTGCCGCACTGATTCTAATCGTGGTGCTATGCGTG-
TCGTTCA
GCAGGCACTAACTTGACAATTAAGTATGAAGGTATATGTGTCCCCTAAGAGACACACTGTACATAGCAAATAAT-
CTATAGAT
CAAAGGGCTACGCAACCCCTGAATAGTAACAAAATACAAAATCACTAAAAATTATAAAAACAGAAAAATACATA-
AATAGGT
ATACGTGTCCCCTAAGAGACACATTGTATGTAGGTGATAAGTATAGATCAAAGGGCCGAATAACCCCTGAATAG-
TAACAAA
ATATGAAAATCAATAAAAATCATAAAATAGAAAAACCATAAACAGAAGTAGTTCAAAGGGCTATAAAACCCCTG-
AATAGTA
ACAAAACATAAAATTAATAAAAATCAAATGAATACCATAATTGGCAAACGGAAGAGATGTAGGTACTTAAGCTT-
CCTAAAAG
CAGCCGAACTCACTTTGAGAAGTAGGCATAGCATACCGAACTCTTCCACGATTCTCCGAACCCACAGGGACGTA-
GGAGATGT TATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAA Japanese
encephalitis virus strain SA14-14-2, complete genome, ACCESSION:
KC517497 SEQ ID NO: 73
TTTAAACAGTTTTTTAGAACGGAAGATAACCATGACTAAAAAACCAGGAGGGCCCGGTAAAAACCGGGCTATCA-
ATATGCT
GAAACGCGGCCTACCCCGCGTATTCCCACTAGTGGGAGTGAAGAGGGTAGTAATGAGCTTGTTGGACGGCAGAG-
GGCCAG
TACGTTTCGTGCTGGCTCTTATCACGTTCTTCAAGTTTACAGCATTAGCCCCGACCAAGGCGCTTTTAGGCCGA-
TGGAAAGCA
GTGGAAAAGAGTGTGGCAATGAAACATCTTACTAGTTTCAAACGAGAACTTGGAACACTCATTGACGCCGTGAA-
CAAGCGG
GGCAGAAAGCAAAACAAAAGAGGAGGAAATGAAGGCTCAATCATGTGGCTCGCGAGCTTGGCAGTTGTCATAGC-
TTGTGC
AGGAGCCATGAAGTTGTCGAATTTCCAGGGGAAGCTTTTGATGACCATCAACAACACGGACATTGCAGACGTTA-
TCGTGATT
CCCACCTCAAAAGGAGAGAACAGATGCTGGGTCCGGGCAATCGACGTCGGCTACATGTGTGAGGACACTATCAC-
GTACGAA
TGTCCTAAGCTTACCATGGGCAATGATCCAGAGGATGTGGATTGCTGGTGTGACAACCAAGAAGTCTACGTCCA-
ATATGGA
CGGTGCACGCGGACCAGGCATTCCAAGCGAAGCAGGAGATCCGTGTCGGTCCAAACACATGGGGAGAGTTCACT-
AGTGAA
TAAAAAAGAGGCTTGGCTGGATTCAACGAAAGCCACACGATATCTCATGAAAACTGAGAACTGGATCATAAGGA-
ATCCTGG
CTATGCTTTCCTGGCGGCGGTACTTGGCTGGATGCTTGGCAGTAACAACGGTCAACGCGTGGTATTTACCATCC-
TCCTGCTGT
TGGTCGCTCCGGCTTACAGTTTTAATTGTCTGGGAATGGGCAATCGTGACTTCATAGAAGGAGCCAGTGGAGCC-
ACTTGGG
TGGACTTGGTGCTAGAAGGAGATAGCTGCTTGACAATCATGGCAAACGACAAACCAACATTGGACGTCCGCATG-
ATTAACA
TCGAAGCTAGCCAACTTGCTGAGGTCAGAAGTTACTGCTATCATGCTTCAGTCACTGACATCTCGACGGTGGCT-
CGGTGCCC
CACGACTGGAGAAGCCCACAACGAGAAGCGAGCTGATAGTAGCTATGTGTGCAAACAAGGCTTCACTGACCGTG-
GGTGGG
GCAACGGATGTGGACTTTTCGGGAAGGGAAGCATTGACACATGTGCAAAATTCTCCTGCACCAGTAAAGCGATT-
GGGAGAA
CAATCCAGCCAGAAAACATCAAATACGAAGTTGGCATTTTTGTGCATGGAACCACCACTTCGGAAAACCATGGG-
AATTATTC
AGCGCAAGTTGGGGCGTCCCAGGCGGCAAAGTTTACAGTAACACCCAATGCTCCTTCGATAACCCTCAAACTTG-
GTGACTAC
GGAGAAGTCACACTGGACTGTGAGCCAAGGAGTGGACTGAACACTGAAGCGTTTTACGTCATGACCGTGGGGTC-
AAAGTC
ATTTCTGGTCCATAGGGAGTGGTTTCATGACCTCGCTCTCCCCTGGACGTCCCCTTCGAGCACAGCGTGGAGAA-
ACAGAGAA
CTCCTCATGGAATTTGAAGGGGCGCACGCCACAAAACAGTCCGTTGTTGCTCTTGGGTCACAGGAAGGAGGCCT-
CCATCAG
GCGTTGGCAGGAGCCATCGTGGTGGAGTACTCAAGCTCAGTGAAGTTAACATCAGGCCACCTGAAATGTAGGCT-
GAAAATG
GACAAACTGGCTCTGAAAGGCACAACCTATGGCATGTGTACAGAAAAATTCTCGTTCGCGAAAAATCCGGCGGA-
CACTGGT
CACGGAACAGTTGTCATTGAACTCTCCTACTCTGGGAGTGATGGCCCCTGCAAAATTCCGATTGTTTCCGTTGC-
GAGCCTCAA
TGACATGACCCCCGTTGGGCGGCTGGTGACAGTGAACCCCTTCGTCGCGACTTCCAGTGCCAACTCAAAGGTGC-
TGGTCGA
GATGGAACCCCCCTTCGGAGACTCCTACATCGTAGTTGGAAGGGGAGACAAGCAGATCAACCACCATTGGCACA-
AAGCTGG
AAGCACGCTGGGCAAGGCCTTTTCAACAACTTTGAAGGGAGCTCAAAGACTGGCAGCGTTGGGCGACACAGCCT-
GGGACTT
TGGCTCTATTGGAGGGGTCTTCAACTCCATAGGAAAAGCCGTTCACCAAGTGTTTGGTGGTGCCTTCAGAACAC-
TCTTTGGG
GGAATGTCTTGGATCACACAAGGGCTAATGGGTGCCCTACTGCTCTGGATGGGCGTCAACGCACGAGACCGATC-
AATTGCT
TTGGCCTTCTTAGCCACAGGGGGTGTGCTCGTGTTCTTAGCGACCAATGTGCATGCTGACACTGGATGTGCCAT-
TGACATCA
CAAGAAAAGAGATGAGATGTGGAAGTGGCATCTTCGTGCACAACGACGTGGAAGCCTGGGTGGATAGGTATAAA-
TATTTG
CCAGAAACGCCCAGATCCCTAGCGAAGATCGTCCACAAAGCGCACAAGGAAGGCGTGTGCGGAGTCAGATCTGT-
CACTAGA
CTGGAGCACCAAATGTGGGAAGCCGTACGGGACGAATTGAACGTCCTGCTCAAAGAGAATGCAGTGGACCTCAG-
TGTGGT
TGTGAACAAGCCCGTGGGAAGATATCGCTCAGCCCCTAAACGCCTATCCATGACGCAAGAGAAGTTTGAAATGG-
GCTGGAA
AGCATGGGGAAAAAGCATTCTCTTTGCCCCGGAATTGGCTAACTCCACATTTGTCGTAGATGGACCTGAGACAA-
AGGAATG
CCCTGATGAGCACAGAGCTTGGAACAGCATGCAAATCGAAGACTTCGGCTTTGGCATCACATCAACCCGTGTGT-
GGCTGAA
AATTAGAGAGGAGAGCACTGACGAGTGTGATGGAGCGATCATAGGCACGGCTGTCAAAGGACATGTGGCAGTCC-
ATAGTG
ACTTGTCGTACTGGATTGAGAGTCGCTACAACGACACATGGAAACTTGAGAGGGCAGTCTTTGGAGAGGTCAAA-
TCTTGCA
CTTGGCCAGAGACACACACCCTTTGGGGAGATGATGTTGAGGAAAGTGAACTCATCATTCCGCACACCATAGCC-
GGACCAA
AAAGCAAGCACAATCGGAGGGAAGGGTATAAGACACAAAACCAGGGACCTTGGGATGAGAATGGCATAGTCTTG-
GACTTT
GATTATTGCCCAGGGACAAAAGTCACCATTACAGAGGATTGTGGCAAGAGAGGCCCTTCGGTCAGAACCACTAC-
TGACAGT
GGAAAGTTGATCACTGACTGGTGCTGTCGCAGTTGCTCCCTTCCGCCCCTACGATTCCGGACAGAAAATGGCTG-
CTGGTACG
GAATGGAAATCAGACCTGTTAGGCATGATGAAACAACACTCGTCAGATCACAGGTTGATGCTTTCAATGGTGAA-
ATGGTTG
ACCCTTTTCAGCTGGGCCTTCTGGTGATGTTTCTGGCCACCCAGGAGGTCCTTCGCAAGAGGTGGACGGCCAGA-
TTGACCAT
TCCTGCGGTTTTGGGGGCCCTACTTGTGCTGATGCTTGGGGGCATCACTTACACTGATTTGGCGAGGTATGTGG-
TGCTAGTC
GCTGCTGCTTTCGCAGAGGCCAACAGTGGAGGAGACGTCCTGCACCTTGCTTTGATTGCCGTTTTTAAGATCCA-
ACCAGCAT
TTCTAGTGATGAACATGCTTAGCACGAGATGGACGAACCAAGAAAACGTGGTTCTGGTCCTAGGGGCTGCCTTT-
TTCCAATT
GGCCTCAGTAGATCTGCAAATAGGAGTCCACGGAATCCTGAATGCCGCCGCTATAGCATGGATGATTGTCCGAG-
CGATCAC
CTTCCCCACAACCTCCTCCGTCACCATGCCAGTCTTAGCGCTTCTAACTCCGGGGATGAGGGCTCTATACCTAG-
ACACTTACA
GAATCATCCTCCTCGTCATAGGGATTTGCTCCCTGCTGCACGAGAGGAAAAAGACCATGGCAAAAAAGAAAGGA-
GCTGTAC
TCTTGGGCTTAGCGCTCACATCCACTGGATGGTTCTCGCCCACCACTATAGCTGCCGGACTAATGGTCTGCAAC-
CCAAACAA
GAAGAGAGGGTGGCCAGCTACTGAGTTTTTGTCGGCAGTTGGATTGATGTTTGCCATCGTAGGTGGTTTGGCCG-
AGTTGGA
TATTGAATCCATGTCAATACCCTTCATGCTGGCAGGTCTCATGGCAGTGTCCTACGTGGTGTCAGGAAAAGCAA-
CAGATATG
TGGCTTGAACGGGCCGCCGACATCAGCTGGGAGATGGATGCTGCAATCACAGGAAGCAGTCGGAGGCTGGATGT-
GAAACT
GGATGATGACGGAGATTTTCACTTGATTGATGATCCCGGTGTTCCATGGAAGGTCTGGGTCCTGCGCATGTCTT-
GCATTGGC
TTAGCCGCCCTCACGCCTTGGGCCATCGTTCCCGCCGCTTTCGGTTATTGGCTCACTTTAAAAACAACAAAAAG-
AGGGGGCG
TGTTTTGGGACACGCCATCCCCAAAACCTTGCTCAAAAGGAGACACCACTACAGGAGTCTACCGAATTATGGCT-
AGAGGGAT
TCTTGGCACTTACCAGGCCGGCGTCGGAGTCATGTACGAGAATGTTTTCCACACACTATGGCACACAACTAGAG-
GAGCAGCC
ATTATGAGTGGAGAAGGAAAATTGACGCCATACTGGGGTAGTGTGAGAGAAGACCGCATAGCTTACGGAGGCCC-
ATGGAG
GTTTGACCGAAAATGGAATGGAACAGATGACGTGCAAGTGATCGTGGTAGAACCGGGGAAGGCTGCAGTAAACA-
TCCAGA
CAAAACCAGGAGTGTTTCGGACTCCCTTCGGGGAGGTTGGGGCTGTTAGTCTGGATTACCCGCGAGGAACATCC-
GGCTCAC
CCATTCTGGATTCCAATGGAGACATTATAGGCCTATACGGCAATGGAGTTGAGCTTGGCGATGGCTCATACGTC-
AGCGCCAT
CGTGCAGGGTGACCGTCAGGAGGAACCAGTCCCAGAAGCTTACACCCCAAACATGTTGAGAAAGAGACAGATGA-
CTGTGC
TAGATTTGCACCCTGGTTCAGGGAAAACCAGGAAAATTCTGCCACAAATAATTAAGGACGCTATCCAGCAGCGC-
CTAAGAAC
AGCTGTGTTGGCACCGACGCGGGTGGTAGCAGCAGAAATGGCAGAAGCTTTGAGAGGGCTCCCAGTACGATATC-
AAACTTC
AGCAGTGCAGAGAGAGCACCAAGGGAATGAAATAGTGGATGTGATGTGCCACGCCACTCTGACCCATAGACTGA-
TGTCACC
GAACAGAGTGCCCAACTACAACCTATTTGTCATGGATGAAGCTCATTTCACCGACCCAGCCAGTATAGCCGCAC-
GAGGATAC
ATTGCTACCAAGGTGGAATTAGGGGAGGCAGCAGCCATCTTTATGACAGCGACCCCGCCTGGAACCACGGATCC-
TTTTCCTG
ACTCAAATGCCCCAATCCATGATTTGCAAGATGAGATACCAGACAGGGCATGGAGCAGTGGATACGAATGGATC-
ACAGAAT
ATGCGGGTAAAACCGTGTGGTTTGTGGCGAGCGTAAAAATGGGGAATGAGATTGCAATGTGCCTCCAAAGAGCG-
GGGAAA
AAGGTCATCCAACTCAACCGCAAGTCCTATGACACAGAATACCCAAAATGTAAGAATGGAGACTGGGATTTTGT-
CATTACCA
CCGACATCTCTGAAATGGGGGCCAACTTCGGTGCGAGCAGGGTCATCGACTGTAGAAAGAGCGTGAAACCCACC-
ATCTTAG
AAGAGGGAGAAGGCAGAGTCATCCTCGGAAACCCATCTCCCATAACCAGTGCAAGCGCAGCTCAACGGAGGGGC-
AGAGTA
GGCAGAAACCCCAACCAAGTTGGAGATGAATACCACTATGGGGGGGCTACCAGTGAAGATGACAGTAACCTAGC-
CCATTG
GACAGAGGCAAAGATCATGTTAGACAACATACACATGCCCAATGGACTGGTGGCCCAGCTCTATGGACCAGAGA-
GGGAAA
AGGCTTTCACAATGGATGGCGAATACCGTCTCAGAGGTGAAGAAAAGAAAAACTTCTTAGAGCTGCTTAGGACG-
GCTGACC
TCCCGGTGTGGCTGGCCTACAAGGTGGCGTCCAATGGCATTCAGTACACCGACAGAAAGTGGTGTTTTGATGGG-
CCGCGTA
CGAATGCCATACTGGAGGACAACACCGAGGTAGAGATAGTCACCCGGATGGGTGAGAGGAAAATCCTCAAGCCG-
AGATGG
CTTGATGCAAGAGTTTATGCAGATCACCAAGCCCTCAAGTGGTTCAAAGACTTTGCAGCAGGGAAGAGATCAGC-
CGTTAGC
TTCATAGAGGTGCTCGGTCGCATGCCTGAGCATTTCATGGGAAAGACGCGGGAAGCTTTAGACACCATGTACTT-
GGTTGCA
ACGGCTGAGAAAGGTGGGAAAGCACACCGAATGGCTCTCGAAGAGCTGCCAGATGCACTGGAAACCATCACACT-
TATTGTC
GCCATTACTGTGATGACAGGAGGATTCTTCCTACTAATGATGCAGCGAAAGGGTATAGGGAAGATGGGTCTTGG-
AGCTCTA
GTGCTCACGCTAGCTACCTTCTTCCTGTGGGCGGCAGAGGTTCCTGGAACCAAAATAGCAGGGACCCTGCTGAT-
CGCCCTGC
TGCTGATGGTGGTTCTCATCCCAGAACCGGAAAAACAGAGGTCACAGACAGATAACCAACTGGCGGTGTTTCTC-
ATCTGTGT
CTTGACCGTGGTTGGAGTGGTGGCAGCAAACGAGTACGGGATGCTAGAAAAAACCAAAGCAGATCTCAAGAGCA-
TGTTTG
GCGGAAAGACGCAGGCATCAGGACTGACTGGATTGCCAAGCATGGCACTGGACCTGCGTCCAGCCACAGCCTGG-
GCACTG
TATGGGGGGAGCACAGTCGTGCTAACCCCTCTTCTGAAGCACCTGATCACGTCGGAATACGTCACCACATCGCT-
AGCCTCAA
TTAACTCACAAGCTGGCTCATTATTCGTCTTGCCACGAGGCGTGCCTTTTACCGACCTAGACTTGACCGTTGGC-
CTCGTCTTCC
TTGGCTGTTGGGGTCAAATCACCCTCACAACGTTTCTGACAGCCATGGTTCTGGCGACACTTCACTATGGGTAC-
ATGCTCCCT
GGATGGCAAGCAGAAGCACTCAGGGCTGCCCAGAGAAGGACAGCGGCTGGAATAATGAAGAATGCCGTTGTTGA-
CGGAA
TGGTCGCCACTGATGTGCCTGAACTGGAAAGGACTACTCCTCTGATGCAAAAGAAAGTCGGACAGGTGCTCCTC-
ATAGGGG
TAAGCGTGGCAGCGTTCCTCGTCAACCCTAATGTCACCACTGTGAGAGAAGCAGGGGTGTTGGTGACGGCGGCT-
ACGCTTA
CTTTGTGGGACAATGGAGCCAGTGCCGTTTGGAATTCCACCACAGCCACGGGACTCTGCCATGTCATGCGAGGT-
AGCTACCT
GGCTGGAGGCTCCATTGCTTGGACTCTCATCAAGAACGCTGATAAGCCCTCCTTGAAAAGGGGAAGGCCTGGGG-
GCAGGA
CGCTAGGGGAGCAGTGGAAGGAAAAACTAAATGCCATGAGCAGAGAAGAGTTTTTTAAATACCGGAGAGAGGCC-
ATAATC
GAGGTGGACCGCACTGAAGCACGCAGGGCCAGACGTGAAAATAACATAGTGGGAGGACATCCGGTTTCGCGAGG-
CTCAGC
AAAACTCCGTTGGCTCGTGGAGAAAGGATTTGTCTCGCCAATAGGAAAAGTCATTGATCTAGGGTGTGGGCGTG-
GAGGATG
GAGCTACTACGCAGCAACCCTGAAGAAGGTCCAGGAAGTCAGAGGATACACGAAAGGTGGGGCGGGACATGAAG-
AACCG
ATGCTCATGCAGAGCTACGGCTGGAACCTGGTCTCCCTGAAGAGTGGAGTGGACGTGTTTTACAAACCTTCAGA-
GCCCAGT
GACACCCTGTTCTGTGACATAGGGGAATCCTCCCCAAGTCCAGAAGTAGAAGAACAACGCACACTACGCGTCCT-
AGAGATG
ACATCTGACTGGTTGCACCGAGGACCTAGAGAGTTCTGCATTAAAGTTCTCTGCCCTTACATGCCCAAGGTTAT-
AGAAAAAA
TGGAAGTTCTGCAGCGCCGCTTCGGAGGTGGGCTAGTGCGTCTCCCCCTGTCCCGAAACTCCAATCACGAGATG-
TATTGGGT
TAGTGGAGCCGCTGGCAATGTGGTGCACGCTGTGAACATGACCAGCCAGGTACTACTGGGGCGAATGGATCGCA-
CAGTGT
GGAGAGGGCCAAAGTATGAGGAAGATGTCAACCTAGGGAGCGGAACAAGAGCCGTGGGAAAGGGAGAAGTCCAT-
AGCA
ATCAGGAGAAAATCAAGAAGAGAATCCAGAAGCTTAAAGAAGAATTCGCCACAACGTGGCACAAAGACCCTGAG-
CATCCAT
ACCGCACTTGGACATACCACGGAAGCTATGAAGTGAAGGCTACTGGCTCAGCCAGCTCTCTCGTCAACGGAGTG-
GTGAAGC
TCATGAGCAAACCTTGGGACGCCATTGCCAACGTCACCACCATGGCCATGACTGACACCACCCCTTTTGGACAG-
CAAAGAGT
TTTCAAGGAGAAAGTTGACACGAAGGCTCCTGAGCCACCAGCTGGAGCCAAGGAAGTGCTCAACGAGACCACCA-
ACTGGCT
GTGGGCCCACTTGTCACGGGAAAAAAGACCCCGCTTGTGCACCAAGGAAGAATTCATAAAGAAAGTCAACAGCA-
ACGCGG
CTCTTGGAGCAGTGTTCGCTGAACAGAATCAATGGAGCACGGCGCGTGAGGCTGTGGATGACCCGCGGTTTTGG-
GAGATG
GTTGATGAAGAGAGGGAAAACCATCTGCGAGGAGAGTGTCACACATGTATCTACAACATGATGGGAAAAAGAGA-
GAAGAA
GCCTGGAGAGTTTGGAAAAGCTAAAGGAAGCAGGGCCATTTGGTTCATGTGGCTTGGAGCACGGTATCTAGAGT-
TTGAAGC
TTTGGGGTTCCTGAATGAAGACCATTGGCTGAGCCGAGAGAATTCAGGAGGTGGAGTGGAAGGCTCAGGCGTCC-
AAAAGC
TGGGATACATCCTCCGTGACATAGCAGGAAAGCAAGGAGGGAAAATGTACGCTGATGATACCGCCGGGTGGGAC-
ACTAGA
ATTACCAGAACTGATTTAGAAAATGAAGCTAAGGTACTGGAGCTCCTAGACGGTGAACACCGCATGCTCGCCCG-
AGCCATA
ATTGAACTGACTTACAGGCACAAAGTGGTCAAGGTCATGAGACCTGCAGCAGAAGGAAAGACCGTGATGGACGT-
GATATC
AAGAGAAGATCAAAGGGGGAGTGGACAGGTGGTCACTTATGCTCTTAACACTTTCACGAACATCGCTGTCCAGC-
TCGTCAG
GCTGATGGAGGCTGAGGGGGTCATTGGACCACAACACTTGGAACAGCTACCTAGGAAAAACAAGATAGCTGTCA-
GGACCT
GGCTCTTTGAGAATGGAGAGGAGAGAGTGACCAGGATGGCGATCAGCGGAGACGACTGTGTCGTCAAGCCGCTG-
GACGA
CAGATTCGCCACAGCCCTCCACTTCCTCAACGCAATGTCAAAGGTCAGAAAAGACATCCAGGAATGGAAGCCTT-
CGCATGGC
TGGCACGATTGGCAGCAAGTTCCCTTCTGCTCTAACCATTTTCAGGAGATTGTGATGAAAGATGGAAGGAGTAT-
AGTTGTCC
CGTGCAGAGGACAGGATGAGCTGATAGGCAGGGCTCGCATCTCTCCAGGAGCTGGATGGAATGTGAAGGACACA-
GCTTGC
CTGGCCAAAGCATATGCACAGATGTGGCTACTCCTATACTTCCATCGCAGGGACTTGCGTCTCATGGCAAATGC-
GATTTGCTC
AGCAGTGCCAGTGGATTGGGTGCCCACAGGCAGGACATCCTGGTCAATACACTCGAAAGGAGAGTGGATGACCA-
CGGAAG
ACATGCTGCAGGTCTGGAACAGAGTCTGGATTGAAGAAAATGAATGGATGATGGACAAGACTCCAATCACAAGC-
TGGACA
GACGTTCCGTATGTGGGAAAGCGTGAGGACATCTGGTGTGGCAGCCTCATCGGAACGCGATCCAGAGCAACCTG-
GGCTGA
GAACATCTATGCGGCGATAAACCAGGTTAGAGCTGTCATTGGGAAAGAAAATTATGTTGACTACATGACCTCAC-
TCAGGAG
ATACGAAGACGTCTTGATCCAGGAAGACAGGGTCATCTAGTGTGATTTAAGGTAGAAAAGTAGACTATGTAAAT-
AATGTAA
ATGAGAAAATGCATGCATATGGAGTCAGGCCAGCAAAAGCTGCCACCGGATACTGGGTAGACGGTGCTGCCTGC-
GTCTCAG
TCCCAGGAGGACTGGGTTAACAAATCTGACAACAGAAAGTGAGAAAGCCCTCAGAACCGTCTCGGAAGTAGGTC-
CCTGCTC
ACTGGAAGTTGAAAGACCAACGTCAGGCCACAAATTTGTGCCACTCCGCTAGGGAGTGCGGCCTGCGCAGCCCC-
AGGAGG
ACTGGGTTACCAAAGCCGTTGAGGCCCCCACGGCCCAAGCCTCGTCTAGGATGCAATAGACGAGGTGTAAGGAC-
TAGAGGT
TAGAGGAGACCCCGTGGAAACAACAACATGCGGCCCAAGCCCCCTCGAAGCTGTAGAGGAGGTGGAAGGACTAG-
AGGTTA
GAGGAGACCCCGCATTTGCATCAAACAGCATATTGACACCTGGGAATAGACTGGGAGATCTTCTGCTCTATCTC-
AACATCAG CTACTAG Japanese encephalitis virus strain SA14-14-2,
complete genome, ACCESSION: JN604986 SEQ ID NO: 74
AGAAGTTTATCTGTGTGAACTTCTTGGCTTAGTATCGTAGAGAAGAATCGAGAGATTAGTGCAGTTTAAACAGT-
TTTTTAGA
ACGGAAGATAACCATGACTAAAAAACCAGGAGGGCCCGGTAAAAACCGGGCTATCAATATGCTGAAACGCGGCC-
TACCCC
GCGTATTCCCACTAGTGGGAGTGAAGAGGGTAGTAATGAGCTTGTTGGACGGCAGAGGGCCAGTACGTTTCGTG-
CTGGCTC
TTATCACGTTCTTCAAGTTTACAGCATTAGCCCCGACCAAGGCGCTTTCAGGCCGATGGAAAGCAGTGGAAAAG-
AGTGTGG
CAATGAAACATCTTACTAGTTTCAAACGAGAACTTGGAACACTCATTGACGCCGTGAACAAGCGGGGCAGAAAG-
CAAAACA
AAAGAGGAGGAAATGAAGGCTCAATCATGTGGCTCGCGAGCTTGGCAGTTGTCATAGCTTGTGCAGGAGCCATG-
AAGTTGT
CGAATTTCCAGGGGAAGCTTTTGATGACCATCAACAACACGGACATTGCAGACGTTATCGTGATTCCCACCTCA-
AAAGGAGA
GAACAGATGCTGGGTCCGGGCAATCGACGTCGGCTACATGTGTGAGGACACTATCACGTACGAATGTCCTAAGC-
TTACCAT
GGGCAATGATCCAGAGGATGTGGATTGCTGGTGTGACAACCAAGAAGTCTACGTCCAATATGGACGGTGCACGC-
GGACCA
GGCATTCCAAGCGAAGCAGGAGATCCGTGTCGGTCCAAACACATGGGGAGAGTTCACTAGTGAATAAAAAAGAG-
GCTTGG
CTGGATTCAACGAAAGCCACACGATATCTCATGAAAACTGAGAACTGGATCATAAGGAATCCTGGCTATGCTTT-
CCTGGCGG
CGGTACTTGGCTGGATGCTTGGCAGTAACAACGGTCAACGCGTGGTATTTACCATCCTCCTGCTGTTGGTCGCT-
CCGGCTTA
CAGTTTTAATTGTCTGGGAATGGGCAATCGTGACTTCATAGAAGGAGCCAGTGGAGCCACTTGGGTGGACTTGG-
TGCTAGA
AGGAGACAGCTGCTTGACAATCATGGCAAACGACAAACCAACATTGGACGTCCGCATGATTAACATCGAAGCTA-
GCCAACT
TGCTGAGGTCAGAAGTTACTGCTATCATGCTTCAGTCACTGACATCTCGACGGTGGCTCGGTGCCCCACGACTG-
GAGAAGCC
CACAACGAGAAGCGAGCTGATAGTAGCTATGTGTGCAAACAAGGCTTCACTGACCGTGGGTGGGGCAACGGATG-
TGGATT
TTTCGGGAAGGGAAGCATTGACACATGTGCAAAATTCTCCTGCACCAGTAAAGCGATTGGGAGAACAATCCAGC-
CAGAAAA
CATCAAATACAAAGTTGGCATTTTTGTGCATGGAACCACCACTTCGGAAAACCATGGGAATTATTCAGCGCAAG-
TTGGGGCG
TCCCAGGCGGCAAAGTTTACAGTAACACCCAATGCTCCTTCGGTAGCCCTCAAACTTGGTGACTACGGAGAAGT-
CACACTGG
ACTGTGAGCCAAGGAGTGGACTGAACACTGAAGCGTTTTACGTCATGACCGTGGGGTCAAAGTCATTTCTGGTC-
CATAGGG
AGTGGTTTCATGACCTCGCTCTCCCCTGGACGTCCCCTTCGAGCACAGCGTGGAGAAACAGAGAACTCCTCATG-
GAATTTGA
AGGGGCGCACGCCACAAAACAGTCCGTTGTTGCTCTTGGGTCACAGGAAGGAGGCCTCCATCATGCGTTGGCAG-
GAGCCAT
CGTGGTGGAGTACTCAAGCTCAGTGATGTTAACATCAGGCCACCTGAAATGTAGGCTGAAAATGGACAAACTGG-
CTCTGAA
AGGCACAACCTATGGCATGTGTACAGAAAAATTCTCGTTCGCGAAAAATCCGGTGGACACTGGTCACGGAACAG-
TTGTCATT
GAACTCTCCTACTCTGGGAGTGATGGCCCCTGCAAAATTCCGATTGTTTCCGTTGCGAGCCTCAATGACATGAC-
CCCCGTTGG
GCGGCTGGTGACAGTGAACCCCTTCGTCGCGACTTCCAGTGCCAACTCAAAGGTGCTGGTCGAGATGGAACCCC-
CCTTCGG
AGACTCCTACATCGTAGTTGGAAGGGGAGACAAGCAGATCAACCACCATTGGCACAAAGCTGGAAGCACGCTGG-
GCAAGG
CCTTTTCAACAACTTTGAAGGGAGCTCAAAGACTGGCAGCGTTGGGCGACACAGCCTGGGACTTTGGCTCTATT-
GGAGGGG
TCTTCAACTCCATAGGAAGAGCCGTTCACCAAGTGTTTGGTGGTGCCTTCAGAACACTCTTTGGGGGAATGTCT-
TGGATCAC
ACAAGGGCTAATGGGTGCCCTACTGCTCTGGATGGGCGTCAACGCACGAGACCGATCAATTGCTTTGGCCTTCT-
TAGCCACA
GGAGGTGTGCTCGTGTTCTTAGCGACCAATGTGCATGCTGACACTGGATGTGCCATTGACATCACAAGAAAAGA-
GATGAGA
TGTGGAAGTGGCATCTTCGTGCACAACGACGTGGAAGCCTGGGTGGATAGGTATAAATATTTGCCAGAAACGCC-
CAGATCC
CTAGCGAAGATCGTCCACAAAGCGCACAAGGAAGGCGTGTGCGGAGTCAGATCTGTCACTAGACTGGAGCACCA-
AATGTG
GGAAGCCGTAAGGGACGAATTGAACGTCCTGCTCAAAGAGAATGCAGTGGACCTCAGTGTGGTTGTGAACAAGC-
CCGTGG
GAAGATATCGCTCAGCCCCTAAACGCCTATCCATGACGCAAGAGAAGTTTGAAATGGGCTGGAAAGCATGGGGA-
AAAAGC
ATCCTCTTTGCCCCGGAATTGGCTAACTCCACATTTGTCGTAGATGGACCTGAGACAAAGGAATGCCCTGATGA-
GCACAGAG
CTTGGAACAGCATGCAAATCGAAGACTTCGGCTTTGGCATCACATCAACCCGTGTGTGGCTGAAAATTAGAGAG-
GAGAGCA
CTGACGAGTGTGATGGAGCGATCATAGGCACGGCTGTCAAAGGACATGTGGCAGTCCATAGTGACTTGTCGTAC-
TGGATTG
AGAGTCGCTACAACGACACATGGAAACTTGAGAGGGCAGTCTTTGGAGAGGTCAAATCTTGCACTTGGCCAGAG-
ACACACA
CCCTTTGGGGAGATGATGTTGAGGAAAGTGAACTCATCATTCCGCACACCATAGCCGGACCAAAAAGCAAGCAC-
AATCGGA
GGGAAGGGTATAAGACACAAAACCAGGGACCTTGGGATGAGAATGGCATAGTCTTGGACTTTGATTATTGCCCA-
GGGACA
AAAGTCACCATTACAGAGGATTGTAGCAAGAGAGGCCCTTCGGTCAGAACCACTACTGACAGTGGAAAGTTGAT-
CACTGAC
TGGTGCTGTCGCAGTTGCTCCCTTCCGCCCCTACGATTCCGGACAGAAAATGGCTGCTGGTACGGAATGGAAAT-
CAGACCTG
TTATGCATGATGAAACAACACTCGTCAGATCACAGGTTCATGCTTTCAAAGGTGAAATGGTTGACCCTTTTCAG-
CTGGGCCTT
CTGGTGATGTTTCTGGCCACCCAGGAAGTCCTTCGCAAGAGGTGGACGGCCAGATTGACCATTCCTGCGGTTTT-
GGGGGTC
CTACTTGTGCTGATGCTTGGGGGTATCACTTACACTGATTTGGCGAGGTATGTGGTGCTAGTCGCTGCTGCTTT-
CGCAGAGG
CCAACAGTGGAGGAGACGTCCTGCACCTTGCTTTGATTGCTGTTTTTAAGATCCAACCAGCATTTTTAGTGATG-
AACATGCTT
AGCACGAGATGGACGAACCAAGAAAACGTGGTTCTGGTCCTAGGGGCTGCCTTTTTCCAATTGGCCTCAGTAGA-
TCTGCAA
ATAGGAGTCCACGGAATCCTGAATGCCGCCGCTATAGCATGGATGATTGTCCGAGCGATCACCTTCCCCACAAC-
CTCCTCCG
TCACCATGCCAGTCTTAGCGCTTCTAACTCCGGGGATGAGGGCTCTATACCTAGACACTTACAGAATCATCCTC-
CTCGTCATA
GGGATTTGCTCCCTGCTGCACGAGAGGAAAAAGACCATGGCGAAAAAGAAAGGAGCTGTACTCTTGGGCTTAGC-
GCTCACA
TCCACTGGATGGTTCTCGCCCACCACTATAGCTGCCGGACTAATGGTCTGCAACCCAAACAAGAAGAGAGGGTG-
GCCAGCT
ACTGAGTTTTTGTCGGCAGTTGGATTGATGTTTGCCATCGTAGGTGGTTTGGCCGAGTTGGATATTGAATCCAT-
GTCAATACC
CTTCATGCTGGCAGGTCTCATGGCAGTGTCCTACGTGGTGTCAGGAAAAGCAACAGATATGTGGCTTGAACGGG-
CCGCCGA
CATCAGCTGGGATATGGGTGCTGCAATCACAGGAAGCAGTCGGAGGCTGGATGTGAAACTGGATGATGACGGAG-
ATTTTC
ACTTGATTGATGATCCCGGTGTTCCATGGAAGGTCTGGGTCCTGCGCATGTCTTGCATTGGCTTAGCCGCCCTC-
ACGCCTTGG
GCCATCGTTCCCGCCGCTTTCGGTTATTGGCTCACTTTAAAAACAACAAAAAGAGGGGGCGTGTTTTGGGACAC-
GCCATCCC
CAAAACCTTGCTCAAAAGGAGACACCACTACAGGAGTCTACCGAATTATGGCTAGAGGGATTCTTGGCACTTAC-
CAGGCCG
GCGTCGGAGTCATGTACGAGAATGTTTTCCACACACTATGGCACACAACTAGAGGAGCAGCCATTGTGAGTGGA-
GAAGGAA
AATTGACGCCATACTGGGGTAGTGTGAAAGAAGACCGCATAGCTTACGGAGGCCCATGGAGGTTTGACCGAAAA-
TGGAAT
GGAACAGATGACGTGCAAGTGATCGTGGTAGAACCGGGGAAGGGCGCAGTAAACATCCAGACAAAACCAGGAGT-
GTTTCG
GACTCCCTTCGGGGAGGTTGGGGCTGTTAGTCTGGATTACCCGCGAGGAACATCCGGCTCACCCATTCTGGATT-
CCAATGGA
GACATTATAGGCCTATACGGCAATGGAGTTGAGCTTGGCGATGGCTCATACGTCAGCGCCATCGTGCAGGGTGA-
CCGTCAG
GAGGAACCAGTCCCAGAAGCTTACACCCCAAACATGTTGAGAAAGAGACAGATGACTGTGCTAGATTTGCACCC-
TGGTTCA
GGGAAAACCAGGAAAATTCTGCCACAAATAATTAAGGACGCTATCCAGCAGCGCCTAAGAACAGCTGTGTTGGC-
ACCGACG
CGGGTGGTAGCAGCAGAAATGGCAGAAGCTTTGAGAGGGCTCCCAGTACGATATCAAACTTCAGCAGTGCAGAG-
AGAGCA
CCAAGGGAATGAAATAGTGGATGTGATGTGCCACGCCACTCTGACCCATAGACTGATGTCACCGAACAGAGTGC-
CCAACTA
CAACCTATTTGTCATGGATGAAGCTCATTTCACCGACCCAGCCAGTATAGCCGCACGAGGATACATTGCTACCA-
AGGTGGAA
TTAGGGGAGGCAGCAGCCATCTTTATGACAGCGACCCCGCCTGGAACCACGGATCCTTTTCCTGACTCAAATGC-
CCCAATCC
ATGATTTGCAAGATGAGATACCAGACAGGGCATGGAGCAGTGGATACGAATGGATCACAGAATATGCGGGTAAA-
ACCGTG
TGGTTTGTGGCGAGCGTAAAAATGGGGAATGAGATTGCAATGTGCCTCCAAAGAGCGGGGAAAAAGGTCATCCA-
ACTCAA
CCGCAAGTCCTATGACACAGAATACCCAAAATGTAAGAATGGAGACTGGGATTTTGTCATTACCACCGACATCT-
CTGAAATG
GGGGCCAACTTCGGTGCGAGCAGGGTCATCGACTGTAGAAAGAGCGTGAAACCCACCATCTTAGAAGAGGGAGA-
AGGCA
GAGTCATCCTCGGAAACCCATCTCCCATAACCAGTGCAAGCGCAGCTCAACGGAGGGGCAGAGTAGGCAGAAAC-
CCCAATC
AAGTTGGAGATGAATACCACTATGGGGGGGCTACCAGTGAAGATGACAGTAACCTAGCCCATTGGACAGAGGCA-
AAGATC
ATGTTAGACAACATACACATGCCCAATGGACTGGTGGCCCAGCTCTATGGACCAGAGAGGGAAAAGGCTTTCAC-
AATGGAT
GGCGAATACCGTCTCAGAGGTGAAGAAAAGAAAAACTTCTTAGAGCTGCTTAGGACGGCTGACCTCCCGGTGTG-
GCTGGCC
TACAAGGTGGCGTCCAATGGCATTCAGTACACCGACAGAAAGTGGTGTTTTGATGGGCCGCGTACGAATGCCAT-
ACTGGAG
GACAACACCGAGGTAGAGATAGTCACCCGGATGGGTGAGAGGAAAATCCTCAAGCCGAGATGGCTTGATGCAAG-
AGTTTA
TGCAGATCACCAGGCCCTCAAGTGGTTCAAAGACTTTGCAGCAGGGAAGAGATCAGCCGTTAGCTTCATAGAGG-
TGCTCGG
TCGCATGCCTGAGCATTTCATGGGAAAGACGCGGGAAGCTTTAGACACCATGTACTTGGTTGCAACGGCTGAGA-
AAGGTGG
GAAAGCACACCGAATGGCTCTCGAAGAGCTGCCAGATGCACTGGAAACCATCACACTTATTGTCGCCATTACTG-
TGATGACA
GGAGGATTCTTCCTACTAATGATGCAGCGAAAGGGTATAGGGAAGATGGGTCTTGGAGCTCTAGTGCTCACACT-
AGCTACC
TTCTTCCTGTGGGCGGCAGAGGTTCCTGGAACCAAAATAGCAGGGACCCTGCTGATCGCCCTGCTGCTGATGGT-
GGTTCTCA
TCCCAGAACCGGAAAAACAGAGGTCACAGACAGATAACCAACTGGCGGTGTTTCTCATCTGTGTCTTGACCGTG-
GTTGGAG
TGGTGGCAGCAAACGAGTACGGGATGCTAGAAAAAACCAAAGCGGATCTCAAGAGCATGTTTGGCGGAAAGACG-
CAGGCA
TCAGGACTGACTGGATTGCCAAGCATGGCACTGGACCTGCGTCCAGCCACAGCCTGGGCACTGTATGGGGGGAG-
CACAGTC
GTGCTAACCCCTCTTCTGAAGCACCTGATCACGTCGGAATACGTCACCACATCGCTAGCTTCAATTAACTCACA-
AGCTGGCTC
ATTATTCGTCTTGCCACGAGGCGTGCCTTTTACCGACCTAGACTTGACTGTTGGCCTCGTCTTCCTTGGCTGTT-
GGGGTCAAG
TCACCCTCACAACGTTTCTGACAGCCATGGTTCTGGCGACACTTCACTATGGGTACATGCTCCCTGGATGGCAA-
GCAGAAGC
ACTCAGGGCTGCCCAGAGAAGGACAGCGGCTGGAATAATGAAGAATGCCGTTGTTGACGGAATGGTCGCCACTG-
ATGTGC
CTGAACTGGAAAGGACTACTCCTCTGATGCAAAAGAAAGTCGGACAGGTGCTCCTCATAGGGGTAAGCGTGGCA-
GCGTTCC
TCGTCAACCCTAATGTCACCACTGTGAGAGAAGCAGGGGTGTTGGTGACGGCGGCTACGCTTACTTTGTGGGAC-
AATGGAG
CCAGTGCCGTTTGGAATTCCACCACAGCCACGGGACTCTGCCATGTCATGCGAGGTAGCTACCTGGCTGGAGGC-
TCCATTGC
TTGGACTCTCATCAAGAACGCTGATAAGCCCTCCTTGAAAAGGGGAAGGCCTGGGGGCAGGACGCTAGGGGAGC-
AGTGGA
AGGAAAAACTAAATGCCATGAGTAGAGAAGAGTTTTTTAAATACCGGAGAGAGGCCATAATCGAGGTGGACCGC-
ACTGAA
GCACGCAGGGCCAGACGTGAAAATAACATAGTGGGAGGACATCCGGTTTCGCGAGGCTCAGCAAAACTCCGTTG-
GCTCGT
GGAGAAAGGATTTGTCTCGCCAATAGGAAAAGTCATTGATCTAGGGTGTGGGCGTGGAGGATGGAGCTACTACG-
CAGCAA
CCCTGAAGAAGGTCCAGGAAGTCAGAGGATACACGAAAGGTGGGGCGGGACATGAAGAACCGATGCTCATGCAG-
AGCTA
CGGCTGGAACCTGGTCTCCCTGAAGAGTGGAGTGGACGTGTTTTACAAACCTTCAGAGCCCAGTGATACCCTGT-
TCTGTGAC
ATAGGGGAATCCTCCCCAAGTCCAGAAGTAGAAGAACAACGCACACTACGCGTCCTAGAGATGACATCTGACTG-
GTTGCAC
CGAGGACCTAGAGAGTTCTGCATTAAAGTTCTCTGCCCTTACATGCCCAAGGTTATAGAAAAAATGGAAGTTCT-
GCAGCGTC
GCTTCGGAGGTGGGCTAGTGCGTCTCCCCCTGTCCCGAAACTCCAATCACGAGATGTATTGGGTTAGTGGAGCC-
GCTGGCA
ATGTGGTGCACGCTGTGAACATGACCAGCCAGGTATTACTGGGGCGAATGGATCGCACAGTGTGGAGAGGGCCA-
AAGTAT
GAGGAAGATGTCAACCTAGGGAGCGGAACAAGAGCCGTGGGAAAGGGAGAAGTCCATAGCAATCAGGAGAAAAT-
CAAGA
AGAGAATCCAGAAGCTTAAAGAAGAATTCGCCACAACGTGGCACAAAGACCCTGAGCATCCATACCGCACTTGG-
ACATACC
ACGGAAGCTATGAAGTGAAGGCTACTGGCTCAGCCAGCTCTCTCGTCAACGGAGTGGTGAAGCTCATGAGCAAA-
CCTTGGG
ACGCCATTGCCAACGTCACCACCATGGCCATGACTGACACCACCCCTTTTGGACAGCAAAGAGTTTTCAAGGAG-
AAAGTTGA
CACGAAGGCTCCTGAGCCACCAGCTGGAGCCAAGGAAGTGCTCAACGAGACCACCAACTGGCTGTGGGCCTACT-
TGTCACG
GGAAAAAAGACCCCGCTTGTGCACCAAGGAAGAATTCATTAAGAAAGTTAACAGCAACGCGGCTCTTGGAGCAG-
TGTTCGC
TGAACAGAATCAATGGAGCACGGCGCGTGAGGCTGTGGATGACCCGCGGTTTTGGGAGATGGTTGATGAAGAGA-
GGGAA
AACCATCTGCGAGGAGAGTGTCACACATGTATCTACAACATGATGGGAAAAAGAGAGAAGAAGCCTGGAGAGTT-
TGGAAA
AGCTAAAGGAAGCAGGGCCATTTGGTTCATGTGGCTTGGAGCACGGTATCTAGAGTTTGAAGCTTTGGGGTTCC-
TGAATGA
AGACCATTGGCTGAGCCGAGAGAATTCAGGAGGTGGAGTGGAAGGCTCAGGCGTCCAAAAGCTGGGATACATCC-
TCCGTG
ACATAGCAGGAAAGCAAGGAGGGAAAATGTACGCTGATGATACCGCCGGGTGGGACACTAGAATTACCAGAACT-
GATTTA
GAAAATGAAGCTAAGGTACTGGAGCTCCTAGACGGTGAACACCGCATGCTCGCCCGAGCCATAATTGAACTGAC-
TTACAGG
CACAAAGTGGTCAAGGTCATGAGACCTGCAGCAGAAGGAAAGACCGTGATGGACGTGATATCAAGAGAAGATCA-
AAGGG
GGAGTGGACAGGTGGTCACTTATGCTCTTAACACTTTCACGAACATCGCTGTCCAGCTCGTCAGGCTGATGGAG-
GCTGAGG
GGGTCATTGGACCACAACACTTGGAACATCTACCTAGGAAAAACAAGATAGCTGTCAGGACCTGGCTCTTTGAG-
AATGGAG
AGGAGAGAGTGACCAGGATGGCGATCAGCGGAGACGACTGTGCCGTCAAACCGCTGGACGACAGATTCGCCACA-
GCCCTC
CACTTCCTCAACGCAATGTCAAAGGTCAGAAAAGACATCCAGGAATGGAAGCCTTCGCATGGCTGGCACGATTG-
GCAGCAA
GTTCCCTTCTGTTCTAACCATTTTCAGGAGATTGTGATGAAAGATGGAAGGAGTATAGTTGTCCCGTGCAGAGG-
ACAGGATG
AGCTGATAGGCAGGGCTCGCATCTCTCCTGGAGCTGGATGGAATGTGAAGGACACAGCTTGCCTGGCCAAAGCA-
TATGCAC
AGATGTGGCTACTCCTATACTTCCATCGCAGGGACTTGCGTCTCATGGCAAATGCGATTTGCTCAGCAGTGCCA-
GTAGATTG
GGTGCCCACAGGCAGGACATCCTGGTCAATACACTCGAAAGGAGAGTGGATGACCACGGAAGACATGCTGCAGG-
TCTGGA
ACAGAGTTTGGATTGAAGAAAATGAATGGATGATGGACAAGACTCCAATCACAAGCTGGACAGACGTTCCGTAT-
GTGGGA
AAGCGCGAGGACATCTGGTGTGGCAGCCTCATCGGAACGCGATCCAGAGCAACCTGGGCTGAGAACATCTATGC-
GGCGAT
AAACCAGGTTAGAGCTGTCATTGGGAAAGAAAATTATGTTGACTACATGACCTCACTCAGGAGATACGAAGACG-
TCTTGATC
CAGGAAGACAGGGTCATCTAGTGTGATTTAAGGTAGAAAAGTAGACTATGTAAACAATGTAAATGAGAAAATGC-
ATGCATA
TGGAGTCAGGCCAGCAAAAGCTGCCACCGGATACTGGGTAGACGGTGCTGCCTGCGTCTCAGTCCCAGGAGGAC-
TGGGTT
AACAAATCTGACAACAGAAAGTGAGAAAGCCCTCAGAACCGTCTCGGAAGTAGGTCCCTGCTCACTGGAAGTTG-
AAAGACC
AACGTCAGGCCACAAATTTGTGCCACTCCGCTAGGGAGTGCGGCCTGCGCAGCCCCAGGAGGACTGGGTTACCA-
AAGCCGT
TGAGGCCCCCACGGCCCAAGCCTCGTCTAGGATGCAATAGACGAGGTGTAAGGACTAGAGGTTAGAGGAGACCC-
CGTGGA
AACAACAACATGCGGCCCAAGCCCCCTCGAAGCTGTAGAGGAGGTGGAAGGACTAGAGGTTAGAGGAGACCCCG-
CATTTG
CATCAAACAGCATATTGACACCTGGGAATAGACTGGGAGATCTTCTGCTCTATCTCAACATCAGCTACTAGGCA-
CAGAGCGC
CGAAGTATGTAGCTGGTGGTGAGGAAGAACACAGGATCT Japanese encephalitis virus
strain SA14-14-2, complete genome, ACCESSION: AF315119 SEQ ID NO:
75
AGAAGTTTATCTGTGTGAACTTCTTGGCTTAGTATCGTAGAGAAGAATCGAGAGATTAGTGCAGTTTAAACAGT-
TTTTTAGA
ACGGAAGATAACCATGACTAAAAAACCAGGAGGGCCCGGTAAAAACCGGGCTATCAATATGCTGAAACGCGGCC-
TACCCC
GCGTATTCCCACTAGTGGGAGTGAAGAGGGTAGTAATGAGCTTGTTGGACGGCAGAGGGCCAGTACGTTTCGTG-
CTGGCTC
TTATCACGTTCTTCAAGTTTACAGCATTAGCCCCGACCAAGGCGCTTTCAGGCCGATGGAAAGCAGTGGAAAAG-
AGTGTGG
CAATGAAACATCTTACTAGTTTCAAACGAGAACTTGGAACACTCATTGACGCCGTGAACAAGCGGGGCAGAAAG-
CAAAACA
AAAGAGGAGGAAATGAAGGCTCAATCATGTGGCTCGCGAGCTTGGCAGTTGTCATAGCTTGTGCAGGAGCCATG-
AAGTTGT
CGAATTTCCAGGGGAAGCTTTTGATGACCATCAACAACACGGACATTGCAGACGTTATCGTGATTCCCACCTCA-
AAAGGAGA
GAACAGATGCTGGGTCCGGGCAATCGACGTCGGCTACATGTGTGAGGACACTATCACGTACGAATGTCCTAAGC-
TTACCAT
GGGCAATGATCCAGAGGATGTGGATTGCTGGTGTGACAACCAAGAAGTCTACGTCCAATATGGACGGTGCACGC-
GGACCA
GGCATTCCAAGCGAAGCAGGAGATCCGTGTCGGTCCAAACACATGGGGAGAGTTCACTAGTGAATAAAAAAGAG-
GCTTGG
CTGGATTCAACGAAAGCCACACGATATCTCATGAAAACTGAGAACTGGATCATAAGGAATCCTGGCTATGCTTT-
CCTGGCGG
CGGTACTTGGCTGGATGCTTGGCAGTAACAACGGTCAACGCGTGGTATTTACCATCCTCCTGCTGTTGGTCGCT-
CCGGCTTA
CAGTTTTAATTGTCTGGGAATGGGCAATCGTGACTTCATAGAAGGAGCCAGTGGAGCCACTTGGGTGGACTTGG-
TGCTAGA
AGGAGACAGCTGCTTGACAATCATGGCAAACGACAAACCAACATTGGACGTCCGCATGATTAACATCGAAGCTA-
GCCAACT
TGCTGAGGTCAGAAGTTACTGCTATCATGCTTCAGTCACTGACATCTCGACGGTGGCTCGGTGCCCCACGACTG-
GAGAAGCC
CACAACGAGAAGCGAGCTGATAGTAGCTATGTGTGCAAACAAGGCTTCACTGACCGTGGGTGGGGCAACGGATG-
TGGATT
TTTCGGGAAGGGAAGCATTGACACATGTGCAAAATTCTCCTGCACCAGTAAAGCGATTGGGAGAACAATCCAGC-
CAGAAAA
CATCAAATACAAAGTTGGCATTTTTGTGCATGGAACCACCACTTCGGAAAACCATGGGAATTATTCAGCGCAAG-
TTGGGGCG
TCCCAGGCGGCAAAGTTTACAGTAACACCCAATGCTCCTTCGGTAGCCCTCAAACTTGGTGACTACGGAGAAGT-
CACACTGG
ACTGTGAGCCAAGGAGTGGACTGAACACTGAAGCGTTTTACGTCATGACCGTGGGGTCAAAGTCATTTCTGGTC-
CATAGGG
AGTGGTTTCATGACCTCGCTCTCCCCTGGACGTCCCCTTCGAGCACAGCGTGGAGAAACAGAGAACTCCTCATG-
GAATTTGA
AGGGGCGCACGCCACAAAACAGTCCGTTGTTGCTCTTGGGTCACAGGAAGGAGGCCTCCATCATGCGTTGGCAG-
GAGCCAT
CGTGGTGGAGTACTCAAGCTCAGTGATGTTAACATCAGGCCACCTGAAATGTAGGCTGAAAATGGACAAACTGG-
CTCTGAA
AGGCACAACCTATGGCATGTGTACAGAAAAATTCTCGTTCGCGAAAAATCCGGTGGACACTGGTCACGGAACAG-
TTGTCATT
GAACTCTCCTACTCTGGGAGTGATGGCCCCTGCAAAATTCCGATTGTTTCCGTTGCGAGCCTCAATGACATGAC-
CCCCGTTGG
GCGGCTGGTGACAGTGAACCCCTTCGTCGCGACTTCCAGTGCCAACTCAAAGGTGCTGGTCGAGATGGAACCCC-
CCTTCGG
AGACTCCTACATCGTAGTTGGAAGGGGAGACAAGCAGATCAACCACCATTGGCACAAAGCTGGAAGCACGCTGG-
GCAAGG
CCTTTTCAACAACTTTGAAGGGAGCTCAAAGACTGGCAGCGTTGGGCGACACAGCCTGGGACTTTGGCTCTATT-
GGAGGGG
TCTTCAACTCCATAGGAAGAGCCGTTCACCAAGTGTTTGGTGATGCCTTCAGAACACTCTTTGGGGGAATGTCT-
TGGATCAC
ACAAGGGCTAATGGGTGCCCTACTGCTCTGGATGGGCGTCAACGCACGAGACCGATCAATTGCTTTGGCCTTCT-
TAGCCACA
GGAGGTGTGCTCGTGTTCTTAGCGACCAATGTGCATGCTGACACTGGATGTGCCATTGACATCACAAGAAAAGA-
GATGAGA
TGTGGAAGTGGCATCTTCGTGCACAACGACGTGGAAGCCTGGGTGGATAGGTATAAATATTTGCCAGAAACGCC-
CAGATCC
CTAGCGAAGATCGTCCACAAAGCGCACAAGGAAGGCGTGTGCGGAGTCAGATCTGTCACTAGACTGGAGCACCA-
AATGTG
GGAAGCCGTAAGGGACGAATTGAACGTCCTGCTCAAAGAGAATGCAGTGGACCTCAGTGTGGTTGTGAACAAGC-
CCGTGG
GAAGATATCGCTCAGCCCCTAAACGCCTATCCATGACGCAAGAGAAGTTTGAAATGGGCTGGAAAGCATGGGGA-
AAAAGC
ATCCTCTTTGCCCCGGAATTGGCTAACTCCACATTTGTCGTAGATGGACCTGAGACAAAGGAATGCCCTGATGA-
GCACAGAG
CTTGGAACAGCATGCAAATCGAAGACTTCGGCTTTGGCATCACATCAACCCGTGTGTGGCTGAAAATTAGAGAG-
GAGAGCA
CTGACGAGTGTGATGGAGCGATCATAGGCACGGCTGTCAAAGGACATGTGGCAGTCCATAGTGACTTGTCGTAC-
TGGATTG
AGAGTCGCTACAACGACACATGGAAACTTGAGAGGGCAGTCTTTGGAGAGGTCAAATCTTGCACTTGGCCAGAG-
ACACACA
CCCTTTGGGGAGATGATGTTGAGGAAAGTGAACTCATCATTCCGCACACCATAGCCGGACCAAAAAGCAAGCAC-
AATCGGA
GGGAAGGGTATAAGACACAAAACCAGGGACCTTGGGATGAGAATGGCATAGTCTTGGACTTTGATTATTGCCCA-
GGGACA
AAAGTCACCATTACAGAGGATTGTAGCAAGAGAGGCCCTTCGGTCAGAACCACTACTGACAGTGGAAAGTTGAT-
CACTGAC
TGGTGCTGTCGCAGTTGCTCCCTTCCGCCCCTACGATTCCGGACAGAAAATGGCTGCTGGTACGGAATGGAAAT-
CAGACCTG
TTATGCATGATGAAACAACACTCGTCAGATCACAGGTTCATGCTTTCAAAGGTGAAATGGTTGACCCTTTTCAG-
CTGGGCCTT
CTGGTGATGTTTCTGGCCACCCAGGAAGTCCTTCGCAAGAGGTGGACGGCCAGATTGACCATTCCTGCGGTTTT-
GGGGGTC
CTACTTGTGCTGATGCTTGGGGGTATCACTTACACTGATTTGGCGAGGTATGTGGTGCTAGTCGCTGCTGCTTT-
CGCAGAGG
CCAACAGTGGAGGAGACGTCCTGCACCTTGCTTTGATTGCTGTTTTTAAGATCCAACCAGCATTTTTAGTGATG-
AACATGCTT
AGCACGAGATGGACGAACCAAGAAAACGTGGTTCTGGTCCTAGGGGCTGCCTTTTTCCAATTGGCCTCAGTAGA-
TCTGCAA
ATAGGAGTCCACGGAATCCTGAATGCCGCCGCTATAGCATGGATGATTGTCCGAGCGATCACCTTCCCCACAAC-
CTCCTCCG
TCACCATGCCAGTCTTAGCGCTTCTAACTCCGGGGATGAGGGCTCTATACCTAGACACTTACAGAATCATCCTC-
CTCGTCATA
GGGATTTGCTCCCTGCTGCACGAGAGGAAAAAGACCATGGCGAAAAAGAAAGGAGCTGTACTCTTGGGCTTAGC-
GCTCACA
TCCACTGGATGGTTCTCGCCCACCACTATAGCTGCCGGACTAATGGTCTGCAACCCAAACAAGAAGAGAGGGTG-
GCCAGCT
ACTGAGTTTTTGTCGGCAGTTGGATTGATGTTTGCCATCGTAGGTGGTTTGGCCGAGTTGGATATTGAATCCAT-
GTCAATACC
CTTCATGCTGGCAGGTCTCATGGCAGTGTCCTACGTGGTGTCAGGAAAAGCAACAGATATGTGGCTTGAACGGG-
CCGCCGA
CATCAGCTGGGATATGGGTGCTGCAATCACAGGAAGCAGTCGGAGGCTGGATGTGAAACTGGATGATGACGGAG-
ATTTTC
ACTTCATTGATGATCCCGGTGTTCCATGGAAGGTCTGGGTCCTGCGCATGTCTTGCATTGGCTTAGCCGCCCTC-
ACGCCTTGG
GCCATCGTTCCCGCCGCTTTCGGTTATTGGCTCACTTTAAAAACAACAAAAAGAGGGGGCGTGTTTTGGGACAC-
GCCATCCC
CAAAACCTTGCTCAAAAGGAGACACCACTACAGGAGTCTACCGAATTATGGCTAGAGGGATTCTTGGCACTTAC-
CAGGCCG
GCGTCGGAGTCATGTACGAGAATGTTTTCCACACACTATGGCACACAACTAGAGGAGCAGCCATTGTGAGTGGA-
GAAGGAA
AATTGACGCCATACTGGGGTAGTGTGAAAGAAGACCGCATAGCTTACGGAGGCCCATGGAGGTTTGACCGAAAA-
TGGAAT
GGAACAGATGACGTGCAAGTGATCGTGGTAGAACCGGGGAAGGGCGCAGTAAACATCCAGACAAAACCAGGAGT-
GTTTCG
GACTCCCTTCGGGGAGGTTGGGGCTGTTAGTCTGGATTACCCGCGAGGAACATCCGGCTCACCCATTCTGGATT-
CCAATGGA
GACATTATAGGCCTATACGGCAATGGAGTTGAGCTTGGCGATGGCTCATACGTCAGCGCCATCGTGCAGGGTGA-
CCGTCAG
GAGGAACCAGTCCCAGAAGCTTACACCCCAAACATGTTGAGAAAGAGACAGATGACTGTGCTAGATTTGCACCC-
TGGTTCA
GGGAAAACCAGGAAAATTCTGCCACAAATAATTAAGGACGCTATCCAGCAGCGCCTAAGAACAGCTGTGTTGGC-
ACCGACG
CGGGTGGTAGCAGCAGAAATGGCAGAAGTTTTGAGAGGGCTCCCAGTACGATATCAAACTTCAGCAGTGCAGAG-
AGAGCA
CCAAGGGAATGAAATAGTGGATGTGATGTGCCACGCCACTCTGACCCATAGACTGATGTCACCGAACAGAGTGC-
CCAACTA
CAACCTATTTGTCATGGATGAAGCTCATTTCACCGACCCAGCCAGTATAGCCGCACGAGGATACATTGCTACCA-
AGGTGGAA
TTAGGGGAGGCAGCAGCCATCTTTATGACAGCGACCCCGCCTGGAACCACGGATCCTTTTCCTGACTCAAATGC-
CCCAATCC
ATGATTTGCAAGATGAGATACCAGACAGGGCATGGAGCAGTGGATACGAATGGATCACAGAATATGCGGGTAAA-
ACCGTG
TGGTTTGTGGCGAGCGTAAAAATGGGGAATGAGATTGCAATGTGCCTCCAAAGAGCGGGGAAAAAGGTCATCCA-
ACTCAA
CCGCAAGTCCTATGACACAGAATACCCAAAATGTAAGAATGGAGACTGGGATTTTGTCATTACCACCGACATCT-
CTGAAATG
GGGGCCAACTTCGGTGCGAGCAGGGTCATCGACTGTAGAAAGAGCGTGAAACCCACCATCTTAGAAGAGGGAGA-
AGGCA
GAGTCATCCTCGGAAACCCATCTCCCATAACCAGTGCAAGCGCAGCTCAACGGAGGGGCAGAGTAGGCAGAAAC-
CCCAATC
AAGTTGGAGATGAATACCACTATGGGGGGGCTACCAGTGAAGATGACAGTAACCTAGCCCATTGGACAGAGGCA-
AAGATC
ATGTTAGACAACATACACATGCCCAATGGACTGGTGGCCCAGCTCTATGGACCAGAGAGGGAAAAGGCTTTCAC-
AATGGAT
GGCGAATACCGTCTCAGAGGTGAAGAAAAGAAAAACTTCTTAGAGCTGCTTAGGACGGCTGACCTCCCGGTGTG-
GCTGGCC
TACAAGGTGGCGTCCAATGGCATTCAGTACACCGACAGAAAGTGGTGTTTTGATGGGCCGCGTACGAATGCCAT-
ACTGGAG
GACAACACCGAGGTAGAGATAGTCACCCGGATGGGTGAGAGGAAAATCCTCAAGCCGAGATGGCTTGATGCAAG-
AGTTTA
TGCAGATCACCAGGCCCTCAAGTGGTTCAAAGACTTTGCAGCAGGGAAGAGATCAGCCGTTAGCTTCATAGAGG-
TGCTCGG
TCGCATGCCTGAGCATTTCATGGGAAAGACGCGGGAAGCTTTAGACACCATGTACTTGGTTGCAACGGCTGAGA-
AAGGTGG
GAAAGCACACCGAATGGCTCTCGAAGAGCTGCCAGATGCACTGGAAACCATCACACTTATTGTCGCCATTACTG-
TGATGACA
GGAGGATTCTTCCTACTAATGATGCAGCGAAAGGGTATAGGGAAGATGGGTCTTGGAGCTCTAGTGCTCACACT-
AGCTACC
TTCTTCCTGTGGGCGGCAGAGGTTCCTGGAACCAAAATAGCAGGGACCCTGCTGATCGCCCTGCTGCTGATGGT-
GGTTCTCA
TCCCAGAACCGGAAAAACAGAGGTCACAGACAGATAACCAACTGGCGGTGTTTCTCATCTGTGTCTTGACCGTG-
GTTGGAG
TGGTGGCAGCAAACGAGTACGGGATGCTAGAAAAAACCAAAGCGGATCTCAAGAGCATGTTTGGCGGAAAGACG-
CAGGCA
TCAGGACTGACTGGATTGCCAAGCATGGCACTGGACCTGCGTCCAGCCACAGCCTGGGCACTGTATGGGGGGAG-
CACAGTC
GTGCTAACCCCTCTTCTGAAGCACCTGATCACGTCGGAATACGTCACCACATCGCTAGCTTCAATTAACTCACA-
AGCTGGCTC
ATTATTCGTCTTGCCACGAGGCGTGCCTTTTACCGACCTAGACTTGACTGTTGGCCTCGTCTTCCTTGGCTGTT-
GGGGTCAAG
TCACCCTCACAACGTTTCTGACAGCCATGGTTCTGGCGACACTTCACTATGGGTACATGCTCCCTGGATGGCAA-
GCAGAAGC
ACTCAGGGCTGCCCAGAGAAGGACAGCGGCTGGAATAATGAAGAATGCCGTTGTTGACGGAATGGTCGCCACTG-
ATGTGC
CTGAACTGGAAAGGACTACTCCTCTGATGCAAAAGAAAGTCGGACAGGTGCTCCTCATAGGGGTAAGCGTGGCA-
GCGTTCC
TCGTCAACCCTAATGTCACCACTGTGAGAGAAGCAGGGGTGTTGGTGACGGCGGCTACGCTTACTTTGTGGGAC-
AATGGAG
CCAGTGCCGTTTGGAATTCCACCACAGCCACGGGACTCTGCCATGTCATGCGAGGTAGCTACCTGGCTGGAGGC-
TCCATTGC
TTGGACTCTCATCAAGAACGCTGATAAGCCCTCCTTGAAAAGGGGAAGGCCTGGGGGCAGGACGCTAGGGGAGC-
AGTGGA
AGGAAAAACTAAATGCCATGAGTAGAGAAGAGTTTTTTAAATACCGGAGAGAGGGCATAATCGAGGTGGACCGC-
ACTGAA
GCACGCAGGGCCAGAAGTGAAAATAACATAGTGGGAGGACATCCGGTTTCGCGAGGCTCAGCAAAACTCCGTTG-
GCTTGT
GGAGAAAGGATTTGTCTCGCCAATAGGAAAAGTCATTGATCTAGGGTGTGGGCGTGGAGGATGGAGCTACTACG-
CAGCAA
CCCTGAAGAAGGTCCAGGAAGTCAGAGGATACACGAAAGGTGGGGCGGGACATGAAGAACCGATGCTCATGCAG-
AGCTA
CGGCTGGAACCTGGTCTCCCTGAAGAGTGGAGTGGACGTGTTTTACAAACCTTCAGAGCCCAGTGATACCCTGT-
TCTGTGAC
ATAGGGGAATCCTCCCCAAGTCCAGAAGTAGAAGAACAACGCACACTACGCGTCCTAGAGATGACATCTGACTG-
GTTGCAC
CGAGGACCTAGAGAGTTCTGCATTAAAGTTCTCTGCCCTTACATGCCCAAGGTTATAGAAAAAATTGAAGTTCT-
GCAGCGCC
GCTTCGGAGGTGGGCTAGTGCGTCTCCCCCTGTCCCGAAACTCCAATCACGAGATGTATTGGGTTAGTGGAGCC-
GCTGGCA
ATGTGGTGCACGCTGTGAACATGACCAGCCAGGTATTACTGGGGCGAATGGATCGCACAGTGTGGAGAGGGCCA-
AAGTAT
GAGGAAGATGTCAACCTAGGGAGCGGAACAAGAGCCGTGGGAAAGGGAGAAGTCCATAGCAATCAGGAGAAAAT-
CAAGA
AGAGAATCCAGAAGCTTAAAGAAGAATTCGCCACAACGTGGCACAAAGACCCTGAGCATCCATACCGCACTTGG-
ACATACC
ACGGAAGCTATGAAGTGAAGGCTACTGGCTCAGCCAGCTCTCTCGTCAACGGAGTGGTGAAGCTCATGAGCAAA-
CCTTGGG
ACGCCATTGCCAACGTCACCACCATGGCCATGACTGACACCACCCCTTTTGGACAGCAAAGAGTTTTCAAGGAG-
AAAGTTGA
CACGAAGGCTCCTGAGCCACCAGCTGGAGCCAAGGAAGTGCTCAACGAGACCACCAACTGGCTGTGGGCCTACT-
TGTCACG
GGAAAAAAGACCCCGCTTGTGCACCAAGGAAGAATTCATTAAGAAAGTTAACAGCAACGCGGCTCTTGGAGCAG-
TGTTCGC
TGAACAGAATCAATGGAGCACGGCGCGTGAGGCTGTGGATGACCCGCGGTTTTGGGAGATGGTTGATGAAGAGA-
GGGAA
AACCATCTGCGAGGAGAGTGTCACACATGTATCTACAACATGATGGGAAAAAGAGAGAAGAAGCCTGGAGAGTT-
TGGAAA
AGCTAAAGGAAGCAGGGCCATTTGGTTCATGTGGCTTGGAGCACGGTATCTAGAGTTTGAAGCTTTGGGGTTCC-
TGAATGA
AGACCATTGGCTGAGCCGAGAGAATTCAGGAGGTGGAGTGGAAGGCTCAGGCGTCCAAAAGCTGGGATACATCC-
TCCGTG
ACATAGCAGGAAAGCAAGGAGGGAAAATGTACGCTGATGATACCGCCGGGTGGGACACTAGAATTACCAGAACT-
GATTTA
GAAAATGAAGCTAAGGTACTGGAGCTCCTAGACGGTGAACACCGCATGCTCGCCCGAGCCATAATTGAACTGAC-
TTACAGG
CACAAAGTGGTCAAGGTCATGAGACCTGCAGCAGAAGGAAAGACCGTGATGGACGTGATATCAAGAGAAGATCA-
AAGGG
GGAGTGGACAGGTGGTCACTTATGCTCTTAACACTTTCACGAACATCGCTGTCCAGCTCGTCAGGCTGATGGAG-
GCTGAGG
GGGTCATTGGACCACAACACTTGGAACATCTACCTAGGAAAAACAAGATAGCTGTCAGGACCTGGCTCTTTGAG-
AATGGAG
AGGAGAGAGTGACCAGGATGGCGATCAGCGGAGACGACTGTGCCGTCAAACCGCTGGACGACAGATTCGCCACA-
GCCCTC
CACTTCCTCAACGCAATGTCAAAGGTCAGAAAAGACATCCAGGAATGGAAGCCTTCGCATGGCTGGCACGATTG-
GCAGCAA
GTTCCCTTCTGTTCTAACCATTTTCAGGAGATTGTGATGAAAGATGGAAGGAGTATAGTTGTCCCGTGCAGAGG-
ACAGGATG
AGCTGATAGGCAGGGCTCGCATCTCTCCAGGAGCTGGATGGAATGTGAAGGACACAGCTTGCCTGCCCAAAGCA-
TATGCAC
AAATGTGGGTACTCCTATACTTCCACCGCAGGGACTTGCGTCTCATGGCAAATGCGATTTGCTCAGCAGTGCCA-
GTAGATTG
GGTGCCCACAGGCAGGACATCCTGGTCAATACACTCGAAAGGAGAGTGGATGACCACGGAAGACATGCTGCAGG-
TCTGGA
ACAGAGTTTGGATTGAAGAAAATGAATGGATGATGGACAAGACTCCAATCACAAGCTGGACAGACGTTCCGTAT-
GTGGGA
AAGCGCGAGGACATCTGGTGTGGCAGCCTCATCGGAACGCGATCCAGAGCAACCTGGGCTGAGAACATCTATGC-
GGCGAT
AAACCAGGTTAGAGCTGTCATTGGGAAAGAAAATTATGTTGACTACATGACCTCACTCAGGAGATACGAAGACG-
TCTTGATC
CAGGAAGACAGGGTCATCTAGTGTGATTTAAGGTAGAAAAGTAGACTATGTAAACAATGTAAATGAGAAAATGC-
ATGCATA
TGGAGTCAGGCCAGCAAAAGCTGCCACCGGATACTGGGTAGACGGTGCTGCCTGCGTCTCAGTCCCAGGAGGAC-
TGGGTT
AACAAATCTGACAACAGAAAGTGAGAAAGCCCTCAGAACTGTCTCGGAAGTAGGTCCCTGCTCACTGGAAGTTG-
AAAGACC
AACGTCAGGCCACAAATTTGTGCCACTCCGCTAGGGAGTGCGGCCTGCGCAGCCCCAGGAGGACTGGGTTACCA-
AAGCCGT
TGAGCCCCCACGGCCCAAGCCTCGTCTAGGATGCAATAGACGAGGTGTAAGGACTAGAGGTTAGAGGAGACCCC-
GTGGAA
ACAACAACATGCGGCCCAAGCCCCCTCGAAGCTGTAGAGGAGGTGGAAGGACTAGAGGTTAGAGGAGACCCCGC-
ATTTGC
ATCAAACAGCATATTGACACCTGGGAATAGACTGGGAGATCTTCTGCTCTATCTCAACATCAGCTACTAGGCAC-
AGAGCGCC GAAGTATGTACGTGGTGGTGAGGAAGAACACAGGATCT
>gi|564014614|gb|KF769015.1| Yellow fever virus strain 17D-204,
complete genome SEQ ID NO: 76
GTGCTAATTGAGGTGCATTGGTCTGCAAATCGAGTTGCTAGGCAATAAACACATTTGGATTAATTTTAATCGTT-
CGTTGAGCGA
TTAGCAGAGAACTGACCAGAACATGTCTGGTCGTAAAGCTCAGGGAAAAACCCTGGGCGTCAATATGGTACGAC-
GAGGAGTT
CGCTCCTTGTCAAACAAAATAAAACAAAAAACAAAACAAATTGGAAACAGACCTGGACCTTCAAGAGGTGTTCA-
AGGATTTAT
CTTTTTCTTTTTGTTCAACATTTTGACTGGAAAAAAGATCACAGCCCACCTAAAGAGGTTGTGGAAAATGCTGG-
ACCCAAGACA
AGGCTTGGCTGTTCTAAGGAAAGTCAAGAGAGTGGTGGCCAGTTTGATGAGAGGATTGTCCTCAAGGAAACGCC-
GTTCCCAT
GATGTTCTGACTGTGCAATTCCTAATTTTGGGAATGCTGTTGATGACGGGTGGAGTGACCTTGGTGCGGAAAAA-
CAGATGGTT
GCTCCTAAATGTGACATCTGAGGACCTCGGGAAAACATTCTCTGTGGGCACAGGCAACTGCACAACAAACATTT-
TGGAAGCCA
AGTACTGGTGCCCAGACTCAATGGAATACAACTGTCCCAATCTCAGTCCAAGAGAGGAGCCAGATGACATTGAT-
TGCTGGTGC
TATGGGGTGGAAAACGTTAGAGTCGCATATGGTAAGTGTGACTCAGCAGGCAGGTCTAGGAGGTCAAGAAGGGC-
CATTGACT
TGCCTACGCATGAAAACCATGGTTTGAAGACCCGGCAAGAAAAATGGATGACTGGAAGAATGGGTGAAAGGCAA-
CTCCAAAA
GATTGAGAGATGGTTCGTGAGGAACCCCTTTTTTGCAGTGACGGCTCTGACCATTGCCTACCTTGTGGGAAGCA-
ACATGACGC
AACGAGTCGTGATTGCCCTACTGGTCTTGGCTGTTGGTCCGGCCTACTCAGCTCACTGCATTGGAATTACTGAC-
AGGGATTTCA
TTGAGGGGGTGCATGGAGGAACTTGGGTTTCAGCTACCCTGGAGCAAGACAAGTGTGTCACTGTTATGGCCCCT-
GACAAGCCT
TCATTGGACATCTCACTAGAGACAGTAGCCATTGATAGACCTGCTGAGGTGAGGAAAGTGTGTTACAATGCAGT-
TCTCACTCAT
GTGAAGATTAATGACAAGTGCCCCAGCACTGGAGAGGCCCACCTAGCTGAAGAGAACGAAGGGGACAATGCGTG-
CAAGCGC
ACTTATTCTGATAGAGGCTGGGGCAATGGCTGTGGCCTATTTGGGAAAGGGAGCATTGTGGCATGCGCCAAATT-
CACTTGTGC
CAAATCCATGAGTTTGTTTGAGGTTGATCAGACCAAAATTCAGTATGTCATCAGAGCACAATTGCATGTAGGGG-
CCAAGCAGG
AAAATTGGACTACCGACATTAAGACTCTCAAGTTTGATGCCCTGTCAGGCTCCCAGGAAGTCGAGTTCATTGGG-
TATGGAAAA
GCTACACTGGAATGCCAGGTGCAAACTGCGGTGGACTTTGGTAACAGTTACATCGCTGAGATGGAAACAGAGAG-
CTGGATAG
TGGACAGACAGTGGGCCCAGGACTTGACCCTGCCATGGCAGAGTGGAAGTGGCGGGGTGTGGAGAGAGATGCAT-
CATCTTG
TCGAATTTGAACCTCCGCATGCCGCCACTATCAGAGTACTGGCCCTGGGAAACCAGGAAGGCTCCTTGAAAACA-
GCTCTTACT
GGCGCAATGAGGGTTACAAAGGACACAAATGACAACAACCTTTACAAACTACATGGTGGACATGTTTCTTGCAG-
AGTGAAATT
GTCAGCTTTGACACTCAAGGGGACATCCTACAAAATATGCACTGACAAAATGTTTTTTGTCAAGAACCCAACTG-
ACACTGGCCA
TGGCACTGTTGTGATGCAGGTGAAAGTGTCAAAAGGAGCCCCCTGCAGGATTCCAGTGATAGTAGCTGATGATC-
TTACAGCG
GCAATCAATAAAGGCATTTTGGTTACAGTTAACCCCATCGCCTCAACCAATGATGATGAAGTGCTGATTGAGGT-
GAACCCACCT
TTTGGAGACAGCTACATTATCGTTGGGAGAGGAGATTCACGTCTCACTTACCAGTGGCACAAAGAGGGAAGCTC-
AATAGGAA
AGTTGTTCACTCAGACCATGAAAGGCGTGGAACGCCTGGCCGTCATGGGAGACACCGCCTGGGATTTCAGCTCC-
GCTGGAGG
GTTCTTCACTTCGGTTGGGAAAGGAATTCATACGGTGTTTGGCTCTGCCTTTCAGGGGCTATTTGGCGGCTTGA-
ACTGGATAAC
AAAGGTCATCATGGGGGCGGTACTTATATGGGTTGGCATCAACACAAGAAACATGACAATGTCCATGAGCATGA-
TCTTGGTAG
GAGTGATCATGATGTTTTTGTCTCTAGGAGTTGGGGCGGATCAAGGATGCGCCATCAACTTTGGCAAGAGAGAG-
CTCAAGTGC
GGAGATGGTATCTTCATATTTAGAGACTCTGATGACTGGCTGAACAAGTACTCATACTATCCAGAAGATCCTGT-
GAAGCTTGCA
TCAATAGTGAAAGCCTCTTTTGAAGAAGGGAAGTGTGGCCTAAATTCAGTTGACTCCCTTGAGCATGAGATGTG-
GAGAAGCAG
GGCAGATGAGATCAATGCCATTTTTGAGGAAAACGAGGTGGACATTTCTGTTGTCGTGCAGGATCCAAAGAATG-
TTTACCAGA
GAGGAACTCATCCATTTTCCAGAATTCGGGATGGTCTGCAGTATGGTTGGAAGACTTGGGGTAAGAACCTTGTG-
TTCTCCCCA
GGGAGGAAGAATGGAAGCTTCATCATAGATGGAAAGTCCAGGAAAGAATGCCCGTTTTCAAACCGGGTCTGGAA-
TTCTTTCC
AGATAGAGGAGTTTGGGACGGGAGTGTTCACCACACGCGTGTACATGGACGCAGTCTTTGAATACACCATAGAC-
TGCGATGG
ATCTATCTTGGGTGCAGCGGTGAACGGAAAAAAGAGTGCCCATGGCTCTCCAACATTTTGGATGGGAAGTCATG-
AAGTAAAT
GGGACATGGATGATCCACACCTTGGAGGCATTAGATTACAAGGAGTGTGAGTGGCCACTGACACATACGATTGG-
AACATCAG
TTGAAGAGAGTGAAATGTTCATGCCGAGATCAATCGGAGGCCCAGTTAGCTCTCACAATCATATCCCTGGATAC-
AAGGTTCAG
ACGAACGGACCTTGGATGCAGGTACCACTAGAAGTGAAGAGAGAAGCTTGCCCAGGGACTAGCGTGATCATTGA-
TGGCAACT
GTGATGGACGGGGAAAATCAACCAGATCCACCACGGATAGCGGGAAAGTTATTCCTGAATGGTGTTGCCGCTCC-
TGCACAAT
GCCGCCTGTGAGCTTCCATGGTAGTGATGGGTGTTGGTATCCCATGGAAATTAGGCCAAGGAAAACGCATGAAA-
GCCATCTG
GTGCGCTCCTGGGTTACAGCTGGAGAAATACATGCTGTCCCTTTTGGTTTGGTGAGCATGATGATAGCAATGGA-
AGTGGTCCT
AAGGAAAAGACAGGGACCAAAGCAAATGTTGGTTGGAGGAGTAGTGCTCTTGGGAGCAATGCTGGTCGGGCAAG-
TAACTCT
CCTTGATTTGCTGAAACTCACAGTGGCTGTGGGATTGCATTTCCATGAGATGAACAATGGAGGAGACGCCATGT-
ATATGGCGT
TGATTGCTGCCTTTTCAATCAGACCAGGGCTGCTCATCGGCTTTGGGCTCAGGACCCTATGGAGCCCTCGGGAA-
CGCCTTGTGC
TGACCCTAGGAGCAGCCATGGTGGAGATTGCCTTGGGTGGCGTGATGGGCGGCCTGTGGAAGTATCTAAATGCA-
GTTTCTCTC
TGCATCCTGACAATAAATGCTGTTGCTTCTAGGAAAGCATCAAATACCATCTTGCCCCTCATGGCTCTGTTGAC-
ACCTGTCACTA
TGGCTGAGGTGAGACTTGCCGCAATGTTCTTTTGTGCCGTGGTTATCATAGGGGTCCTTCACCAGAATTTCAAG-
GACACCTCCA
TGCAGAAGACTATACCTCTGGTGGCCCTCACACTCACATCTTACCTGGGCTTGACACAACCTTTTTTGGGCCTG-
TGTGCATTTCT
GGCAACCCGCATATTTGGGCGAAGGAGTATCCCAGTGAATGAGGCACTCGCAGCAGCTGGTCTAGTGGGAGTGC-
TGGCAGG
ACTGGCTTTTCAGGAGATGGAGAACTTCCTTGGTCCGATTGCAGTTGGAGGACTCCTGATGATGCTGGTTAGCG-
TGGCTGGGA
GGGTGGATGGGCTAGAGCTCAAGAAGCTTGGTGAAGTTTCATGGGAAGAGGAGGCGGAGATCAGCGGGAGTTCC-
GCCCGCT
ATGATGTGGCACTCAGTGAACAAGGGGAGTTCAAGCTGCTTTCTGAAGAGAAAGTGCCATGGGACCAGGTTGTG-
ATGACCTC
GCTGGCCTTGGTTGGGGCTGCCCTCCATCCATTTGCTCTTCTGCTGGTCCTTGCTGGGTGGCTGTTTCATGTCA-
GGGGAGCTAG
GAGAAGTGGGGATGTCTTGTGGGATATTCCCACTCCTAAGATCATCGAGGAATGTGAACATCTGGAGGATGGGA-
TTTATGGC
ATATTCCAGTCAACCTTCTTGGGGGCCTCCCAGCGAGGAGTGGGAGTGGCACAGGGAGGGGTGTTCCACACAAT-
GTGGCATG
TCACAAGAGGAGCTTTCCTTGTCAGGAATGGCAAGAAGTTGATTCCATCTTGGGCTTCAGTAAAGGAAGACCTT-
GTCGCCTAT
GGTGGCTCATGGAAGTTGGAAGGCAGATGGGATGGAGAGGAAGAGGTCCAGTTGATCGCGGCTGTTCCAGGAAA-
GAACGTG
GTCAACGTCCAGACAAAACCGAGCTTGTTCAAAGTGAGGAATGGGGGAGAAATCGGGGCTGTCGCTCTTGACTA-
TCCGAGTG
GCACTTCAGGATCTCCTATTGTTAACAGGAACGGAGAGGTGATTGGGCTGTACGGCAATGGCATCCTTGTCGGT-
GACAACTCC
TTCGTGTCCGCCATATCCCAGACTGAGGTGAAGGAAGAAGGAAAGGAGGAGCTCCAAGAGATCCCGACAATGCT-
AAAGAAA
GGAATGACAACTGTCCTTGATTTTCATCCTGGAGCTGGGAAGACAAGACGTTTCCTCCCACAGATCTTGGCCGA-
GTGCGCACG
GAGACGCTTGCGCACTCTTGTGTTGGCCCCCACCAGGGTTGTTCTTTCTGAAATGAAGGAGGCTTTTCACGGCC-
TGGACGTGA
AATTCCACACACAGGCTTTTTCCGCTCACGGCAGCGGGAGAGAAGTCATTGATGCTATGTGCCATGCCACCCTA-
ACTTACAGGA
TGTTGGAACCAACTAGGGTTGTTAACTGGGAAGTGATCATTATGGATGAAGCCCATTTTTTGGATCCAGCTAGC-
ATAGCCGCTA
GAGGTTGGGCAGCGCACAGAGCTAGGGCAAATGAAAGTGCAACAATCTTGATGACAGCCACACCGCCTGGGACT-
AGTGATG
AATTTCCACATTCAAATGGTGAAATAGAAGATGTTCAAACGGACATACCCAGTGAGCCCTGGAACACAGGGCAT-
GACTGGATC
CTGGCTGACAAAAGGCCCACGGCATGGTTCCTTCCATCCATCAGAGCTGCAAATGTCATGGCTGCCTCTTTGCG-
TAAGGCTGG
AAAGAGTGTGGTGGTCCTGAACAGGAAAACCTTTGAGAGAGAATACCCCACGATAAAGCAGAAGAAACCTGACT-
TTATATTG
GCCACTGACATAGCTGAAATGGGAGCCAACCTTTGCGTGGAGCGAGTGCTGGATTGCAGGACGGCTTTTAAGCC-
TGTGCTTGT
GGATGAAGGGAGGAAGGTGGCAATAAAAGGGCCACTTCGTATCTCCGCATCCTCTGCTGCTCAAAGGAGGGGGC-
GCATTGG
GAGAAATCCCAACAGAGATGGAGACTCATACTACTATTCTGAGCCTACAAGTGAAAATAATGCCCACCACGTCT-
GCTGGTTGG
AGGCCTCAATGCTCTTGGACAACATGGAGGTGAGGGGTGGAATGGTCGCCCCACTCTATGGCGTTGAAGGAACT-
AAAACACC
AGTTTCCCCTGGTGAAATGAGACTGAGGGATGACCAGAGGAAAGTCTTCAGAGAACTAGTGAGGAATTGTGACC-
TGCCCGTT
TGGCTTTCGTGGCAAGTGGCCAAGGCTGGTTTGAAGACGAATGATCGTAAGTGGTGTTTTGAAGGCCCTGAGGA-
ACATGAGA
TCTTGAATGACAGCGGTGAAACAGTGAAGTGCAGGGCTCCTGGAGGAGCAAAGAAGCCTCTGCGCCCAAGGTGG-
TGTGATG
AAAGGGTGTCATCTGACCAGAGTGCGCTGTCTGAATTTATTAAGTTTGCTGAAGGTAGGAGGGGAGCTGCTGAA-
GTGCTAGTT
GTGCTGAGTGAACTCCCTGATTTCCTGGCTAAAAAAGGTGGAGAGGCAATGGATACCATCAGTGTGTTTCTCCA-
CTCTGAGGA
AGGCTCTAGGGCTTACCGCAATGCACTATCAATGATGCCTGAGGCAATGACAATAGTCATGCTGTTTATACTGG-
CTGGACTACT
GACATCGGGAATGGTCATCTTTTTCATGTCTCCCAAAGGCATCAGTAGAATGTCTATGGCGATGGGCACAATGG-
CCGGCTGTG
GATATCTCATGTTCCTTGGAGGCGTCAAACCCACTCACATCTCCTATATCATGCTCATATTCTTTGTCCTGATG-
GTGGTTGTGATC
CCCGAGCCAGGGCAACAAAGGTCCATCCAAGACAACCAAGTGGCATACCTCATTATTGGCATCCTGACGCTGGT-
TTCAGCGGT
GGCAGCCAACGAGCTAGGCATGCTGGAGAAAACCAAAGAGGACCTCTTTGGGAAGAAGAACTTAATTCCATCTA-
GTGCTTCAC
CCTGGAGTTGGCCGGATCTTGACCTGAAGCCAGGAGCTGCCTGGACAGTGTACGTTGGCATTGTTACAATGCTC-
TCTCCAATG
TTGCACCACTGGATCAAAGTCGAATATGGCAACCTGTCTCTGTCTGGAATAGCCCAGTCAGCCTCAGTCCTTTC-
TTTCATGGACA
AGGGGATACCATTCATGAAGATGAATATCTCGGTCATAATGCTGCTGGTCAGTGGCTGGAATTCAATAACAGTG-
ATGCCTCTG
CTCTGTGGCATAGGGTGCGCCATGCTCCACTGGTCTCTCATTTTACCTGGAATCAAAGCGCAGCAGTCAAAGCT-
TGCACAGAG
AAGGGTGTTCCATGGCGTTGCCAAGAACCCTGTGGTTGATGGGAATCCAACAGTTGACATTGAGGAAGCTCCTG-
AAATGCCTG
CCCTTTATGAGAAGAAACTGGCTCTATATCTCCTTCTTGCTCTCAGCCTAGCTTCTGTTGCCATGTGCAGAACG-
CCCTTTTCATTG
GCTGAAGGCATTGTCCTAGCATCAGCTGCCCTAGGGCCGCTCATAGAGGGAAACACCAGCCTTCTTTGGAATGG-
ACCCATGGC
TGTCTCCATGACAGGAGTCATGAGGGGGAATCACTATGCTTTTGTGGGAGTCATGTACAATCTATGGAAGATGA-
AAACTGGAC
GCCGGGGGAGCGCGAATGGAAAAACTTTGGGTGAAGTCTGGAAGAGGGAACTGAATCTGTTGGACAAGCGACAG-
TTTGAGT
TGTATAAAAGGACCGACATTGTGGAGGTGGATCGTGATACGGCACGCAGGCATTTGGCCGAAGGGAAGGTGGAC-
ACCGGGG
TGGCGGTCTCCAGGGGGACCGCAAAGTTAAGGTGGTTCCATGAGCGTGGCTATGTCAAGCTGGAAGGTAGGGTG-
ATTGACCT
GGGGTGTGGCCGCGGAGGCTGGTGTTACTACGCTGCTGCGCAAAAGGAAGTGAGTGGGGTCAAAGGATTTACTC-
TTGGAAG
AGACGGCCATGAGAAACCCATGAATGTGCAAAGTCTGGGATGGAACATCATCACCTTCAAGGACAAAACTGATA-
TCCACCGCC
TAGAACCAGTGAAATGTGACACCCTTTTGTGTGACATTGGAGAGTCATCATCGTCATCGGTCACAGAGGGGGAA-
AGGACCGT
GAGAGTTCTTGATACTGTAGAAAAATGGCTGGCTTGTGGGGTTGACAACTTCTGTGTGAAGGTGTTAGCTCCAT-
ACATGCCAG
ATGTTCTCGAGAAACTGGAATTGCTCCAAAGGAGGTTTGGCGGAACAGTGATCAGGAACCCTCTCTCCAGGAAT-
TCCACTCAT
GAAATGTACTACGTGTCTGGAGCCCGCAGCAATGTCACATTTACTGTGAACCAAACATCCCGCCTCCTGATGAG-
GAGAATGAG
GCGTCCAACTGGAAAAGTGACCCTGGAGGCTGACGTCATCCTCCCAATTGGGACACGCAGTGTTGAGACAGACA-
AGGGACCC
CTGGACAAAGAGGCCATAGAAGAAAGGGTTGAGAGGATAAAATCTGAGTACATGACCTCTTGGTTTTATGACAA-
TGACAACC
CCTACAGGACCTGGCACTACTGTGGCTCCTATGTCACAAAAACCTCAGGAAGTGCGGCGAGCATGGTAAATGGT-
GTTATTAAA
ATTCTGACATATCCATGGGACAGGATAGAGGAGGTCACAAGAATGGCAATGACTGACACAACCCCTTTTGGACA-
GCAAAGAG
TGTTTAAAGAAAAAGTTGACACCAGAGCAAAGGATCCACCAGCGGGAACTAGGAAGATCATGAAAGTTGTCAAC-
AGGTGGCT
GTTCCGCCACCTGGCCAGAGAAAAGAACCCCAGACTGTGCACAAAGGAAGAATTTATTGCAAAAGTCCGAAGTC-
ATGCAGCC
ATTGGAGCTTACCTGGAAGAACAAGAACAGTGGAAGACTGCCAATGAGGCTGTCCAAGACCCAAAGTTCTGGGA-
ACTGGTGG
ATGAAGAAAGGAAGCTGCACCAACAAGGCAGGTGTCGGACTTGTGTGTACAACATGATGGGGAAAAGAGAGAAG-
AAGCTGT
CAGAGTTTGGGAAAGCAAAGGGAAGCCGTGCCATATGGTATATGTGGCTGGGAGCGCGGTATCTTGAGTTTGAG-
GCCCTGGG
ATTCCTGAATGAGGACCATTGGGCTTCCAGGGAAAACTCAGGAGGAGGAGTGGAAGGCATTGGCTTACAATACC-
TAGGATAT
GTGATCAGAGACCTGGCTGCAATGGATGGTGGTGGATTCTACGCGGATGACACCGCTGGATGGGACACGCGCAT-
CACAGAG
GCAGACCTTGATGATGAACAGGAGATCTTGAACTACATGAGCCCACATCACAAAAAACTGGCACAAGCAGTGAT-
GGAAATGA
CATACAAGAACAAAGTGGTGAAAGTGTTGAGACCAGCCCCAGGAGGGAAAGCCTACATGGATGTCATAAGTCGA-
CGAGACCA
GAGAGGATCCGGGCAGGTAGTGACTTATGCTCTGAACACCATCACCAACTTGAAAGTCCAATTGATCAGAATGG-
CAGAAGCA
GAGATGGTGATACATCACCAACATGTTCAAGATTGTGATGAATCAGTTCTGACCAGGCTGGAGGCATGGCTCAC-
TGAGCACGG
ATGTAACAGACTGAAGAGGATGGCGGTGAGTGGAGACGACTGTGTGGTCCGGCCCATCGATGACAGGTTCGGCC-
TGGCCCTG
TCCCATCTCAACGCCATGTCCAAGGTTAGAAAGGACATATCTGAATGGCAGCCATCAAAAGGGTGGAATGATTG-
GGAGAATG
TGCCCTTCTGTTCCCACCACTTCCATGAACTACAGCTGAAGGATGGCAGGAGGATTGTGGTGCCTTGCCGAGAA-
CAGGACGAG
CTCATTGGGAGAGGAAGGGTGTCTCCAGGAAACGGCTGGATGATCAAGGAAACAGCTTGCCTCAGCAAAGCCTA-
TGCCAACA
TGTGGTCACTGATGTATTTTCACAAAAGGGACATGAGGCTACTGTCATTGGCTGTTTCCTCAGCTGTTCCCACC-
TCATGGGTTCC
ACAAGGACGCACAACATGGTCGATTCATGGGAAAGGGGAGTGGATGACCACGGAAGACATGCTTGAGGTGTGGA-
ACAGAGT
ATGGATAACCAACAACCCACACATGCAGGACAAGACAATGGTGAAAAAATGGAGAGATGTCCCTTATCTAACCA-
AGAGACAA
GACAAGCTGTGCGGATCACTGATTGGAATGACCAATAGGGCCACCTGGGCCTCCCACATCCATTTGGTCATCCA-
TCGTATCCGA
ACGCTGATTGGACAGGAGAAATACACTGACTACCTAACAGTCATGGACAGGTATTCTGTGGATGCTGACCTGCA-
ACTGGGTGA
GCTTATCTGAAACACCATCTAACAGGAATAACCGGGATACAAACCACGGGTGGAGAACCGGACTCCCCACAACC-
TGAAACCG
GGATATAAACCACGGCTGGAGAACCGGACTCCGCACTTAAAATGAAACAGAAACCGGGATAAAAACTACGGATG-
GAGAACC
GGACTCCACACATTGAGACAGAAGAAGTTGTCAGCCCAGAACCCCACACGAGTTTTGCCACTGCTAAGCTGTGA-
GGCAGTGCA
GGCTGGGACAGCCGACCTCCAGGTTGCGAAAAACCTGGTTTCTGGGACCTCCCACCCCAGAGTAAAAAGAACGG-
AGCCTCCG
CTACCACCCTCCCACGTGGTGGTAGAAAGACGGGGTCTAGAGGTTAGAGGAGACCCTCCAGGGAACAAATAGTG-
GGACCATA
TTGACGCCAGGGAAAGACCGGAGTGGTTCTCTGCTTTTCCTCCAGAGGTCTGTGAGCACAGTTTGCTCAAGAAT-
AAGCAGACC TTTGGATGACAAA Attenuated Chikungunya "Delta5nsP3"
sequence SEQ ID NO: 77
GATGGCTGCGTGAGACACACGTAGCCTACCAGTTTCTTACTGCTCTACTCTGCAAAGCAAGAGATTAATAACCC-
ATCATGGATC
CTGTGTACGTGGACATAGACGCTGACAGCGCCTTTTTGAAGGCCCTGCAACGTGCGTACCCCATGTTTGAGGTG-
GAACCAAGG
CAGGTCACACCGAATGACCATGCTAATGCTAGAGCGTTCTCGCATCTAGCTATAAAACTAATAGAGCAGGAAAT-
TGACCCCGA
CTCAACCATCCTGGATATCGGCAGTGCGCCAGCAAGGAGGATGATGTCGGACAGGAAGTACCACTGCGTCTGCC-
CGATGCGC
AGTGCGGAAGATCCCGAGAGACTCGCCAATTATGCGAGAAAGCTAGCATCTGCCGCAGGAAAAGTCCTGGACAG-
AAACATCT
CTGGAAAGATCGGGGACTTACAAGCAGTAATGGCCGTGCCAGACACGGAGACGCCAACATTCTGCTTACACACA-
GACGTCTCA
TGTAGACAGAGAGCAGACGTCGCTATATACCAAGACGTCTATGCTGTACACGCACCCACGTCGCTATACCACCA-
GGCGATTAA
AGGGGTCCGAGTGGCGTACTGGGTTGGGTTCGACACAACCCCGTTCATGTACAATGCCATGGCGGGTGCCTACC-
CCTCATACT
CGACAAACTGGGCAGATGAGCAGGTACTGAAGGCTAAGAACATAGGATTATGTTCAACAGACCTGACGGAAGGT-
AGACGAG
GCAAGTTGTCTATTATGAGAGGGAAAAAGCTAAAACCGTGCGACCGTGTGCTGTTCTCAGTAGGGTCAACGCTC-
TACCCGGAA
AGCCGCAAGCTACTTAAGAGCTGGCACCTGCCATCGGTGTTCCATTTAAAGGGCAAACTCAGCTTCACATGCCG-
CTGTGATACA
GTGGTTTCGTGTGAGGGCTACGTCGTTAAGAGAATAACGATGAGCCCAGGCCTTTATGGAAAAACCACAGGGTA-
TGCGGTAA
CCCACCACGCAGACGGATTCCTGATGTGCAAGACTACCGACACGGTTGACGGCGAAAGAATGTCATTCTCGGTG-
TGCACATAC
GTGCCGGCGACCATTTGTGATCAAATGACCGGCATCCTTGCTACAGAAGTCACGCCGGAGGATGCACAGAAGCT-
GTTGGTGG
GGCTGAACCAGAGAATAGTGGTTAACGGCAGAACGCAACGGAATACGAACACCATGAAAAATTATCTGCTTCCC-
GTGGTCGC
CCAAGCCTTCAGTAAGTGGGCAAAGGAGTGCCGGAAAGACATGGAAGATGAAAAACTCCTGGGGGTCAGAGAAA-
GAACACT
GACCTGCTGCTGTCTATGGGCATTCAAGAAGCAGAAAACACACACGGTCTACAAGAGGCCTGATACCCAGTCAA-
TTCAGAAGG
TTCAGGCCGAGTTTGACAGCTTTGTGGTACCGAGTCTGTGGTCGTCCGGGTTGTCAATCCCTTTGAGGACTAGA-
ATCAAATGGT
TGTTAAGCAAGGTGCCAAAAACCGACCTGATCCCATACAGCGGAGACGCCCGAGAAGCCCGGGACGCAGAAAAA-
GAAGCAG
AGGAAGAACGAGAAGCAGAACTGACTCGCGAAGCCCTACCACCTCTACAGGCAGCACAGGAAGATGTTCAGGTC-
GAAATCGA
CGTGGAACAGCTTGAGGACAGAGCGGGCGCAGGAATAATAGAGACTCCGAGAGGAGCTATCAAAGTTACTGCCC-
AACCAAC
AGACCACGTCGTGGGAGAGTACCTGGTACTCTCCCCGCAGACCGTACTACGTAGCCAGAAGCTCAGTCTGATTC-
ACGCTTTGG
CGGAGCAAGTGAAGACGTGCACGCACAACGGACGAGCAGGGAGGTATGCGGTCGAAGCGTACGACGGCCGAGTC-
CTAGTGC
CCTCAGGCTATGCAATCTCGCCTGAAGACTTCCAGAGTCTAAGCGAAAGCGCAACGATGGTGTATAACGAAAGA-
GAGTTCGTA
AACAGAAAGCTACACCATATTGCGATGCACGGACCAGCCCTGAACACCGACGAAGAGTCGTATGAGCTGGTGAG-
GGCAGAGA
GGACAGAACACGAGTACGTCTACGACGTGGATCAGAGAAGATGCTGTAAGAAGGAAGAAGCCGCAGGACTGGTA-
CTGGTGG
GCGACTTGACTAATCCGCCCTACCACGAATTCGCATATGAAGGGCTAAAAATCCGCCCTGCCTGCCCATACAAA-
ATTGCAGTCA
TAGGAGTCTTCGGAGTACCGGGATCTGGCAAGTCAGCTATTATCAAGAACCTAGTTACCAGGCAGGACCTGGTG-
ACTAGCGG
AAAGAAAGAAAACTGCCAAGAAATCACCACCGACGTGATGAGACAGAGAGGTCTAGAGATATCTGCACGTACGG-
TTGACTCG
CTGCTCTTGAATGGATGCAACAGACCAGTCGACGTGTTGTACGTAGACGAGGCGTTTGCGTGCCACTCTGGAAC-
GCTACTTGC
TTTGATCGCCTTGGTGAGACCAAGGCAGAAAGTTGTACTTTGTGGTGACCCGAAGCAGTGCGGCTTCTTCAATA-
TGATGCAGA
TGAAAGTCAACTATAATCACAACATCTGCACCCAAGTGTACCACAAAAGTATCTCCAGGCGGTGTACACTGCCT-
GTGACCGCCA
TTGTGTCATCGTTGCATTACGAAGGCAAAATGCGCACTACGAATGAGTACAACAAGCCGATTGTAGTGGACACT-
ACAGGCTCA
ACAAAACCTGACCCTGGAGACCTCGTGTTAACGTGCTTCAGAGGGTGGGTTAAACAACTGCAAATTGACTATCG-
TGGATACGA
GGTCATGACAGCAGCCGCATCCCAAGGGTTAACCAGAAAAGGAGTTTACGCAGTTAGACAAAAAGTTAATGAAA-
ACCCGCTCT
ATGCATCAACGTCAGAGCACGTCAACGTACTCCTAACGCGTACGGAAGGTAAACTGGTATGGAAGACACTTTCC-
GGCGACCCG
TGGATAAAGACGCTGCAGAACCCACCGAAAGGAAACTTCAAAGCAACTATTAAGGAGTGGGAGGTGGAGCATGC-
ATCAATAA
TGGCGGGCATCTGCAGTCACCAAATGACCTTCGATACATTCCAAAATAAAGCCAACGTTTGTTGGGCTAAGAGC-
TTGGTCCCTA
TCCTCGAAACAGCGGGGATAAAACTAAATGATAGGCAGTGGTCTCAGATAATTCAAGCCTTCAAAGAAGACAAA-
GCATACTCA
CCTGAAGTAGCCCTGAATGAAATATGTACGCGCATGTATGGGGTGGATCTAGACAGCGGGCTATTTTCTAAACC-
GTTGGTGTC
TGTGTATTACGCGGATAACCACTGGGATAATAGGCCTGGAGGGAAAATGTTCGGATTTAACCCCGAGGCAGCAT-
CCATTCTAG
AAAGAAAGTATCCATTCACAAAAGGGAAGTGGAACATCAACAAGCAGATCTGCGTGACTACCAGGAGGATAGAA-
GACTTTAA
CCCTACCACCAACATCATACCGGCCAACAGGAGACTACCACACTCATTAGTGGCCGAACACCGCCCAGTAAAAG-
GGGAAAGAA
TGGAATGGCTGGTTAACAAGATAAACGGCCACCACGTGCTCCTGGTCAGTGGCTATAACCTTGCACTGCCTACT-
AAGAGAGTC
ACTTGGGTAGCGCCGTTAGGTGTCCGCGGAGCGGACTACACATACAACCTAGAGTTGGGTCTGCCAGCAACGCT-
TGGTAGGT
ATGACCTAGTGGTCATAAACATCCACACACCTTTTCGCATACACCATTACCAACAGTGCGTCGACCACGCAATG-
AAACTGCAAA
TGCTCGGGGGTGACTCATTGAGACTGCTCAAACCGGGCGGCTCTCTATTGATCAGAGCATATGGTTACGCAGAT-
AGAACCAGT
GAACGAGTCATCTGCGTATTGGGACGCAAGTTTAGATCGTCTAGAGCGTTGAAACCACCATGTGTCACCAGCAA-
CACTGAGAT
GTTTTTCCTATTCAGCAACTTTGACAATGGCAGAAGGAATTTCACAACTCATGTCATGAACAATCAACTGAATG-
CAGCCTTCGTA
GGACAGGTCACCCGAGCAGGATGTGCACCGTCGTACCGGGTAAAACGCATGGACATCGCGAAGAACGATGAAGA-
GTGCGTA
GTCAACGCCGCTAACCCTCGCGGGTTACCGGGTGGCGGTGTTTGCAAGGCAGTATACAAAAAATGGCCGGAGTC-
CTTTAAGA
ACAGTGCAACACCAGTGGGAACCGCAAAAACAGTTATGTGCGGTACGTATCCAGTAATCCACGCTGTTGGACCA-
AACTTCTCT
AATTATTCGGAGTCTGAAGGGGACCGGGAATTGGCAGCTGCCTATCGAGAAGTCGCAAAGGAAGTAACTAGGCT-
GGGAGTA
AATAGTGTAGCTATACCTCTCCTCTCCACAGGTGTATACTCAGGAGGGAAAGACAGGCTGACCCAGTCACTGAA-
CCACCTCTTT
ACAGCCATGGACTCGACGGATGCAGACGTGGTCATCTACTGCCGCGACAAAGAATGGGAGAAGAAAATATCTGA-
GGCCATAC
AGATGCGGACCCAAGTAGAGCTGCTGGATGAGCACATCTCCATAGACTGCGATATTGTTCGCGTGCACCCTGAC-
AGCAGCTTG
GCAGGCAGAAAAGGATACAGCACCACGGAAGGCGCACTGTACTCATATCTAGAAGGGACCCGTTTTCATCAGAC-
GGCTGTGG
ATATGGCGGAGATACATACTATGTGGCCAAAGCAAACAGAGGCCAATGAGCAAGTCTGCCTATATGCCCTGGGG-
GAAAGTAT
TGAATCGATCAGGCAGAAATGCCCGGTGGATGATGCAGACGCATCATCTCCCCCCAAAACTGTCCCGTGCCTTT-
GCCGTTACGC
TATGACTCCAGAACGCGTCACCCGGCTTCGCATGAACCACGTCACAAGCATAATTGTGTGTTCTTCGTTTCCCC-
TCCCAAAGTAC
AAAATAGAAGGAGTGCAAAAAGTCAAATGCTCTAAGGTAATGCTATTTGACCACAACGTGCCATCGCGCGTAAG-
TCCAAGGG
CTTATAGAGGTGCCGCTGCCGGTAACCTTGCGGCCGTGTCTGATTGGGTAATGAGCACCGTACCTGTCGCGCCG-
CCCAGAAGA
AGGCGAGGGAGAAACCTGACTGTGACATGTGACGAGAGAGAAGGGAATATAACACCCATGGCTAGCGTCCGATT-
CTTTAGG
GCAGAGCTGTGTCCGGTCGTACAAGAAACAGCGGAGACGCGTGACACAGCAATGTCTCTTCAGGCACCACCGAG-
TACCGCCA
CGGAACCGAATCATCCGCCGATCTCCTTCGGAGCATCAAGCGAGACGTTCCCCATTACATTTGGGGACTTCAAC-
GAAGGAGAA
ATCGAAAGCTTGTCTTCTGAGCTACTAACTTTCGGAGACTTCTTACCAGGAGAAGTGGATGACTTGACAGACAG-
CGACTGGTC
CACGTGCTCAGACACGGACGACGAGTTAAGACTAGACAGGGCAGGTGGGTATATATTCTCGTCGGACACCGGTC-
CAGGTCAT
TTACAACAGAAGTCAGTACGCCAGTCAGTGCTGCCGGTGAACACCCTGGAGGAAGTCCACGAGGAGAAGTGTTA-
CCCACCTA
AGCTGGATGAAGCAAAGGAGCAACTATTACTTAAGAAACTCCAGGAGAGTGCATCCATGGCCAACAGAAGCAGG-
TATCAGTC
GCGCAAAGTAGAAAACATGAAAGCAGCAATCATCCAGAGACTAAAGAGAGGCTGTAGACTATACTTAATGTCAG-
AGACCCCA
AAAGTCCCTACTTACCGGACTACATATCCGGCGCCTGTGTACTCGCCTCCGATCAACGTCCGATTGTCCAATCC-
CGAGTCCGCA
GTGGCAGCATGCAATGAGTTCTTAGCTAGAAACTATCCAACTGTCTCATCATACCAAATTACCGACGAGTATGA-
TGCATATCTA
GACATGGTGGACGGGTCGGAGAGTTGCCTGGACCGAGCGACATTCAATCCGTCAAAACTCAGGAGCTACCCGAA-
ACAGCACG
CTTACCACGCGCCCTCCATCAGAAGCGCTGTACCGTCCCCATTCCAGAACACACTACAGAATGTACTGGCAGCA-
GCCACGAAAA
GAAACTGCAACGTCACACAGATGAGGGAATTACCCACTTTGGACTCAGCAGTATTCAACGTGGAGTGTTTCAAA-
AAATTCGCA
TGCAACCAAGAATACTGGGAAGAATTTGCTGCCAGCCCTATTAGGATAACAACTGAGAATTTAGCAACCTATGT-
TACTAAACTA
AAAGGGCCAAAAGCAGCAGCGCTATTCGCAAAAACCCATAATCTACTGCCACTACAGGAAGTACCAATGGATAG-
GTTCACAGT
AGATATGAAAAGGGACGTAAAGGTGACTCCTGGTACAAAGCATACAGAGGAAAGACCTAAGGTGCAGGTTATAC-
AGGCGGC
TGAACCCTTGGCGACAGCATACCTATGTGGGATTCACAGAGAGCTGGTTAGGAGGCTGAACGCCGTCCTCCTAC-
CCAATGTAC
ATACACTATTTGACATGTCTGCCGAGGATTTCGATGCCATCATAGCCGCACACTTTAAGCCAGGAGACACTGTT-
TTGGAAACGG
ACATAGCCTCCTTTGATAAGAGCCAAGATGATTCACTTGCGCTTACTGCTTTGATGCTGTTAGAGGATTTAGGG-
GTGGATCACT
CCCTGCTGGACTTGATAGAGGCTGCTTTCGGAGAGATTTCCAGCTGTCACCTACCGACAGGTACGCGCTTCAAG-
TTCGGCGCC
ATGATGAAATCAGGTATGTTCCTAACTCTGTTCGTCAACACATTGTTAAACATCACCATCGCCAGCCGAGTGCT-
GGAAGATCGT
CTGACAAAATCCGCGTGCGCGGCCTTCATCGGCGACGACAACATAATACATGGAGTCGTCTCCGATGAATTGAT-
GGCAGCCAG
ATGTGCCACTTGGATGAACATGGAAGTGAAGATCATAGATGCAGTTGTATCCTTGAAAGCCCCTTACTTTTGTG-
GAGGGTTTAT
ACTGCACGATACTGTGACAGGAACAGCTTGCAGAGTGGCAGACCCGCTAAAAAGGCTTTTTAAACTGGGCAAAC-
CGCTAGCG
GCAGGTGACGAACAAGATGAAGATAGAAGACGAGCGCTGGCTGACGAAGTGATCAGATGGCAACGAACAGGGCT-
AATTGAT
GAGCTGGAGAAAGCGGTATACTCTAGGTACGAAGTGCAGGGTATATCAGTTGTGGTAATGTCCATGGCCACCTT-
TGCAAGCTC
CAGATCCAACTTCGAGAAGCTCAGAGGACCCGTCATAACTTTGTACGGCGGTCCTAAATAGGTACGCACTACAG-
CTACCTATTT
TGCAGAAGCCGACAGCAAGTATCTAAACACTAATCAGCTACAATGGAGTTCATCCCAACCCAAACTTTTTACAA-
TAGGAGGTAC
CAGCCTCGACCCTGGACTCCGCGCCCTACTATCCAAGTCATCAGGCCCAGACCGCGCCCTCAGAGGCAAGCTGG-
GCAACTTGC
CCAGCTGATCTCAGCAGTTAATAAACTGACAATGCGCGCGGTACCACAACAGAAGCCACGCAGGAATCGGAAGA-
ATAAGAAG
CAAAAGCAAAAACAACAGGCGCCACAAAACAACACAAATCAAAAGAAGCAGCCACCTAAAAAGAAACCGGCTCA-
AAAGAAAA
AGAAGCCGGGCCGCAGAGAGAGGATGTGCATGAAAATCGAAAATGATTGTATTTTCGAAGTCAAGCACGAAGGT-
AAGGTAA
CAGGTTACGCGTGCCTGGTGGGGGACAAAGTAATGAAACCAGCACACGTAAAGGGGACCATCGATAACGCGGAC-
CTGGCCA
AACTGGCCTTTAAGCGGTCATCTAAGTATGACCTTGAATGCGCGCAGATACCCGTGCACATGAAGTCCGACGCT-
TCGAAGTTC
ACCCATGAGAAACCGGAGGGGTACTACAACTGGCACCACGGAGCAGTACAGTACTCAGGAGGCCGGTTCACCAT-
CCCTACAG
GTGCTGGCAAACCAGGGGACAGCGGCAGACCGATCTTCGACAACAAGGGACGCGTGGTGGCCATAGTCTTAGGA-
GGAGCTA
ATGAAGGAGCCCGTACAGCCCTCTCGGTGGTGACCTGGAATAAAGACATTGTCACTAAAATCACCCCCGAGGGG-
GCCGAAGA
GTGGAGTCTTGCCATCCCAGTTATGTGCCTGTTGGCAAACACCACGTTCCCCTGCTCCCAGCCCCCTTGCACGC-
CCTGCTGCTAC
GAAAAGGAACCGGAGGAAACCCTACGCATGCTTGAGGACAACGTCATGAGACCTGGGTACTATCAGCTGCTACA-
AGCATCCTT
AACATGTTCTCCCCACCGCCAGCGACGCAGCACCAAGGACAACTTCAATGTCTATAAAGCCACAAGACCATACT-
TAGCTCACTG
TCCCGACTGTGGAGAAGGGCACTCGTGCCATAGTCCCGTAGCACTAGAACGCATCAGAAATGAAGCGACAGACG-
GGACGCTG
AAAATCCAGGTCTCCTTGCAAATCGGAATAAAGACGGATGACAGCCACGATTGGACCAAGCTGCGTTATATGGA-
CAACCACAT
GCCAGCAGACGCAGAGAGGGCGGGGCTATTTGTAAGAACATCAGCACCGTGTACGATTACTGGAACAATGGGAC-
ACTTCATC
CTGGCCCGATGTCCAAAAGGGGAAACTCTGACGGTGGGATTCACTGACAGTAGGAAGATTAGTCACTCATGTAC-
GCACCCATT
TCACCACGACCCTCCTGTGATAGGTCGGGAAAAATTCCATTCCCGACCGCAGCACGGTAAAGAGCTACCTTGCA-
GCACGTACG
TGCAGAGCACCGCCGCAACTACCGAGGAGATAGAGGTACACATGCCCCCAGACACCCCTGATCGCACATTAATG-
TCACAACAG
TCCGGCAACGTAAAGATCACAGTCAATGGCCAGACGGTGCGGTACAAGTGTAATTGCGGTGGCTCAAATGAAGG-
ACTAACAA
CTACAGACAAAGTGATTAATAACTGCAAGGTTGATCAATGTCATGCCGCGGTCACCAATCACAAAAAGTGGCAG-
TATAACTCC
CCTCTGGTCCCGCGTAATGCTGAACTTGGGGACCGAAAAGGAAAAATTCACATCCCGTTTCCGCTGGCAAATGT-
AACATGCAG
GGTGCCTAAAGCAAGGAACCCCACCGTGACGTACGGGAAAAACCAAGTCATCATGCTACTGTATCCTGACCACC-
CAACACTCC
TGTCCTACCGGAATATGGGAGAAGAACCAAACTATCAAGAAGAGTGGGTGATGCATAAGAAGGAAGTCGTGCTA-
ACCGTGCC
GACTGAAGGGCTCGAGGTCACGTGGGGCAACAACGAGCCGTATAAGTATTGGCCGCAGTTATCTACAAACGGTA-
CAGCCCAT
GGCCACCCGCATGAGATAATTCTGTATTATTATGAGCTGTACCCCACTATGACTGTAGTAGTTGTGTCAGTGGC-
CACGTTCATA
CTCCTGTCGATGGTGGGTATGGCAGCGGGGATGTGCATGTGTGCACGACGCAGATGCATCACACCGTATGAACT-
GACACCAG
GAGCTACCGTCCCTTTCCTGCTTAGCCTAATATGCTGCATCAGAACAGCTAAAGCGGCCACATACCAAGAGGCT-
GCGATATACC
TGTGGAACGAGCAGCAACCTTTGTTTTGGCTACAAGCCCTTATTCCGCTGGCAGCCCTGATTGTTCTATGCAAC-
TGTCTGAGAC
TCTTACCATGCTGCTGTAAAACGTTGGCTTTTTTAGCCGTAATGAGCGTCGGTGCCCACACTGTGAGCGCGTAC-
GAACACGTAA
CAGTGATCCCGAACACGGTGGGAGTACCGTATAAGACTCTAGTCAATAGACCTGGCTACAGCCCCATGGTATTG-
GAGATGGA
ACTACTGTCAGTCACTTTGGAGCCAACACTATCGCTTGATTACATCACGTGCGAGTACAAAACCGTCATCCCGT-
CTCCGTACGT
GAAGTGCTGCGGTACAGCAGAGTGCAAGGACAAAAACCTACCTGACTACAGCTGTAAGGTCTTCACCGGCGTCT-
ACCCATTTA
TGTGGGGCGGCGCCTACTGCTTCTGCGACGCTGAAAACACGCAGTTGAGCGAAGCACACGTGGAGAAGTCCGAA-
TCATGCAA
AACAGAATTTGCATCAGCATACAGGGCTCATACCGCATCTGCATCAGCTAAGCTCCGCGTCCTTTACCAAGGAA-
ATAACATCAC
TGTAACTGCCTATGCAAACGGCGACCATGCCGTCACAGTTAAGGACGCCAAATTCATTGTGGGGCCAATGTCTT-
CAGCCTGGA
CACCTTTCGACAACAAAATTGTGGTGTACAAAGGTGACGTCTATAACATGGACTACCCGCCCTTTGGCGCAGGA-
AGACCAGGA
CAATTTGGCGATATCCAAAGTCGCACACCTGAGAGTAAAGACGTCTATGCTAATACACAACTGGTACTGCAGAG-
ACCGGCTGT
GGGTACGGTACACGTGCCATACTCTCAGGCACCATCTGGCTTTAAGTATTGGCTAAAAGAACGCGGGGCGTCGC-
TGCAGCACA
CAGCACCATTTGGCTGCCAAATAGCAACAAACCCGGTAAGAGCGGTGAACTGCGCCGTAGGGAACATGCCCATC-
TCCATCGAC
ATACCGGAAGCGGCCTTCACTAGGGTCGTCGACGCGCCCTCTTTAACGGACATGTCGTGCGAGGTACCAGCCTG-
CACCCATTC
CTCAGACTTTGGGGGCGTCGCCATTATTAAATATGCAGCCAGCAAGAAAGGCAAGTGTGCGGTGCATTCGATGA-
CTAACGCCG
TCACTATTCGGGAAGCTGAGATAGAAGTTGAAGGGAATTCTCAGCTGCAAATCTCTTTCTCGACGGCCTTAGCC-
AGCGCCGAA
TTCCGCGTACAAGTCTGTTCTACACAAGTACACTGTGCAGCCGAGTGCCACCCCCCGAAGGACCACATAGTCAA-
CTACCCGGC
GTCACATACCACCCTCGGGGTCCAGGACATCTCCGCTACGGCGATGTCATGGGTGCAGAAGATCACGGGAGGTG-
TGGGACTG
GTTGTTGCTGTTGCCGCACTGATTCTAATCGTGGTGCTATGCGTGTCGTTCAGCAGGCACTAACTTGACAATTA-
AGTATGAAGG
TATATGTGTCCCCTAAGAGACACACTGTACATAGCAAATAATCTATAGATCAAAGGGCTACGCAACCCCTGAAT-
AGTAACAAAA
TACAAAATCACTAAAAATTATAAAAACAGAAAAATACATAAATAGGTATACGTGTCCCCTAAGAGACACATTGT-
ATGTAGGTG
ATAAGTATAGATCAAAGGGCCGAATAACCCCTGAATAGTAACAAAATATGAAAATCAATAAAAATCATAAAATA-
GAAAAACCA
TAAACAGAAGTAGTTCAAAGGGCTATAAAACCCCTGAATAGTAACAAAACATAAAATTAATAAAAATCAAATGA-
ATACCATAA
TTGGCAAACGGAAGAGATGTAGGTACTTAAGCTTCCTAAAAGCAGCCGAACTCACTTTGAGAAGTAGGCATAGC-
ATACCGAAC
TCTTCCACGATTCTCCGAACCCACAGGGACGTAGGAGATGTTATTTTGTTTTTAATATTTCAAAAAAAAAAAAA-
AAAAAAAAAAA ZIKV Sequence H/PF/2013 as sequenced SEQ ID NO: 78
CAGACTGCGACAGTTCGAGTTTGAAGCGAAAGCTAGCAACAGTATCAACAGGTTTTATTTTGGATTTGGAAACG-
AGAGTTTCT
GGTCATGAAAAACCCAAAAAAGAAATCCGGAGGATTCCGGATTGTCAATATGCTAAAACGCGGAGTAGCCCGTG-
TGAGCCCC
TTTGGGGGCTTGAAGAGGCTGCCAGCCGGACTTCTGCTGGGTCATGGGCCCATCAGGATGGTCTTGGCGATTCT-
AGCCTTTTT
GAGATTCACGGCAATCAAGCCATCACTGGGTCTCATCAATAGATGGGGTTCAGTGGGGAAAAAAGAGGCTATGG-
AAATAATA
AAGAAGTTCAAGAAAGATCTGGCTGCCATGCTGAGAATAATCAATGCTAGGAAGGAGAAGAAGAGACGAGGCGC-
AGATACT
AGTGTCGGAATTGTTGGCCTCCTGCTGACCACAGCTATGGCAGCGGAGGTCACTAGACGTGGGAGTGCATACTA-
TATGTACTT
GGACAGAAACGACGCTGGGGAGGCCATATCTTTTCCAACCACATTGGGGATGAATAAGTGTTATATACAGATCA-
TGGATCTTG
GACACATGTGTGATGCCACCATGAGCTATGAATGCCCTATGCTGGATGAGGGGGTGGAACCAGATGACGTCGAT-
TGTTGGTG
CAACACGACGTCAACTTGGGTTGTGTACGGAACCTGCCATCACAAAAAAGGTGAAGCACGGAGATCTAGAAGAG-
CTGTGACG
CTCCCCTCCCATTCCACTAGGAAGCTGCAAACGCGGTCGCAAACCTGGTTGGAATCAAGAGAATACACAAAGCA-
CTTGATTAG
AGTCGAAAATTGGATATTCAGGAACCCTGGCTTCGCGTTAGCAGCAGCTGCCATCGCTTGGCTTTTGGGAAGCT-
CAACGAGCC
AAAAAGTCATATACTTGGTCATGATACTGCTGATTGCCCCGGCATACAGCATCAGGTGCATAGGAGTCAGCAAT-
AGGGACTTT
GTGGAAGGTATGTCAGGTGGGACTTGGGTTGATGTTGTCTTGGAACATGGAGGTTGTGTCACCGTAATGGCACA-
GGACAAAC
CGACTGTCGACATAGAGCTGGTTACAACAACAGTCAGCAACATGGCGGAGGTAAGATCCTACTGCTATGAGGCA-
TCAATATCG
GACATGGCTTCGGACAGCCGCTGCCCAACACAAGGTGAAGCCTACCTTGACAAGCAATCAGACACTCAATATGT-
CTGCAAAAG
AACGTTAGTGGACAGAGGCTGGGGAAATGGATGTGGACTTTTTGGCAAAGGGAGCCTGGTGACATGCGCTAAGT-
TTGCATGC
TCCAAGAAAATGACCGGGAAGAGCATCCAGCCAGAGAATCTGGAGTACCGGATAATGCTGTCAGTTCATGGCTC-
CCAGCACA
GTGGGATGATCGTTAATGACACAGGACATGAAACTGATGAGAATAGAGCGAAGGTTGAGATAACGCCCAATTCA-
CCAAGAGC
CGAAGCCACCCTGGGGGGTTTTGGAAGCCTAGGACTTGATTGTGAACCGAGGACAGGCCTTGACTTTTCAGATT-
TGTATTACT
TGACTATGAATAACAAGCACTGGTTGGTTCACAAGGAGTGGTTCCACGACATTCCATTACCTTGGCACGCTGGG-
GCAGACACC
GGAACTCCACACTGGAACAACAAAGAAGCACTGGTAGAGTTCAAGGACGCACATGCCAAAAGGCAAACTGTCGT-
GGTTCTAG
GGAGTCAAGAAGGAGCAGTTCACACGGCCCTTGCTGGAGCTCTGGAGGCTGAGATGGATGGTGCAAAGGGAAGG-
CTGTCCT
CTGGCCACTTGAAATGTCGCCTGAAAATGGATAAACTTAGATTGAAGGGCGTGTCATACTCCTTGTGTACCGCA-
GCGTTCACAT
TCACCAAGATCCCGGCTGAAACACTGCACGGGACAGTCACAGTGGAGGTACAGTACGCAGGGACAGATGGACCT-
TGCAAGGT
TCCAGCTCAGATGGCGGTGGACATGCAAACTCTGACCCCAGTTGGGAGGTTGATAACCGCTAACCCCGTAATCA-
CTGAAAGCA
CTGAGAACTCTAAGATGATGCTGGAACTTGATCCACCATTTGGGGACTCTTACATTGTCATAGGAGTCGGGGAG-
AAGAAGATC
ACCCACCACTGGCACAGGAGTGGCAGCACCATTGGAAAAGCATTTGAAGCCACTGTGAGAGGTGCCAAGAGAAT-
GGCAGTCT
TGGGAGACACAGCCTGGGACTTTGGATCAGTTGGAGGCGCTCTCAACTCATTGGGCAAGGGCATCCATCAAATT-
TTTGGAGCA
GCTTTCAAATCATTGTTTGGAGGAATGTCCTGGTTCTCACAAATTCTCATTGGAACGTTGCTGATGTGGTTGGG-
TCTGAACACA
AAGAATGGATCTATTTCCCTTATGTGCTTGGCCTTAGGGGGAGTGTTGATCTTCTTATCCACAGCTGTCTCTGC-
TGATGTGGGG
TGCTCGGTGGACTTCTCAAAGAAGGAGACGAGATGCGGTACAGGGGTGTTCGTCTATAACGACGTTGAAGCCTG-
GAGGGACA
GGTACAAGTACCATCCTGACTCCCCCCGTAGATTGGCAGCAGCAGTCAAGCAAGCCTGGGAAGATGGTATCTGT-
GGGATCTCC
TCTGTTTCAAGAATGGAAAACATCATGTGGAGATCAGTAGAAGGGGAGCTCAACGCAATCCTGGAAGAGAATGG-
AGTTCAAC
TGACGGTCGTTGTGGGATCTGTAAAAAACCCCATGTGGAGAGGTCCACAGAGATTGCCCGTGCCTGTGAACGAG-
CTGCCCCAC
GGCTGGAAGGCTTGGGGGAAATCGTACTTCGTCAGAGCAGCAAAGACAAATAACAGCTTTGTCGTGGATGGTGA-
CACACTGA
AGGAATGCCCACTCAAACATAGAGCATGGAACAGCTTTCTTGTGGAGGATCATGGGTTCGGGGTATTTCACACT-
AGTGTCTGG
CTCAAGGTTAGAGAAGATTATTCATTAGAGTGTGATCCAGCCGTTATTGGAACAGCTGTTAAGGGAAAGGAGGC-
TGTACACA
GTGATCTAGGCTACTGGATTGAGAGTGAGAAGAATGACACATGGAGGCTGAAGAGGGCCCATCTGATCGAGATG-
AAAACAT
GTGAATGGCCAAAGTCCCACACATTGTGGACAGATGGAATAGAAGAGAGTGATCTGATCATACCCAAGTCTTTA-
GCTGGGCCA
CTCAGCCATCACAATACCAGAGAGGGCTACAGGACCCAAATGAAAGGGCCATGGCACAGTGAAGAGCTTGAAAT-
TCGGTTTG
AGGAATGCCCAGGCACTAAGGTCCACGTGGAGGAAACATGTGGAACAAGAGGACCATCTCTGAGATCAACCACT-
GCAAGCGG
AAGGGTGATCGAGGAATGGTGCTGCAGGGAGTGCACAATGCCCCCACTGTCGTTCCGGGCTAAAGATGGCTGTT-
GGTATGGA
ATGGAGATAAGGCCCAGGAAAGAACCAGAAAGTAACTTAGTAAGGTCAATGGTGACTGCAGGATCAACTGATCA-
CATGGATC
ACTTCTCCCTTGGAGTGCTTGTGATTCTGCTCATGGTGCAGGAAGGGCTGAAGAAGAGAATGACCACAAAGATC-
ATCATAAGC
ACATCGATGGCAGTGCTGGTAGCTATGATCCTGGGAGGATTTTCAATGAGTGACCTGGCTAAGCTTGCAATTTT-
GATGGGTGC
CACCTTCGCGGAAATGAACACTGGAGGAGATGTAGCTCATCTGGCGCTGATAGCGGCATTCAAAGTCAGACCAG-
CGTTGCTG
GTATCTTTCATCTTCAGAGCTAATTGGACACCCCGTGAAAGCATGCTGCTGGCCTTGGCCTCGTGTCTTTTGCA-
AACTGCGATCT
CCGCCTTGGAAGGCGACCTGATGGTTCTCATCAATGGTTTTGCTTTGGCCTGGTTGGCAATACGAGCGATGGTT-
GTTCCACGCA
CTGATAACATCACCTTGGCAATCCTGGCTGCTCTGACACCACTGGCCCGGGGCACACTGCTTGTGGCGTGGAGA-
GCAGGCCTT
GCTACTTGCGGGGGGTTTATGCTCCTCTCTCTGAAGGGAAAAGGCAGTGTGAAGAAGAACTTACCATTTGTCAT-
GGCCCTGGG
ACTAACCGCTGTGAGGCTGGTCGACCCCATCAACGTGGTGGGACTGCTGTTGCTCACAAGGAGTGGGAAGCGGA-
GCTGGCCC
CCTAGCGAAGTACTCACAGCTGTTGGCCTGATATGCGCATTGGCTGGAGGGTTCGCCAAGGCAGATATAGAGAT-
GGCTGGGC
CCATGGCCGCGGTCGGTCTGCTAATTGTCAGTTACGTGGTCTCAGGAAAGAGTGTGGACATGTACATTGAAAGA-
GCAGGTGA
CATCACATGGGAAAAAGATGCGGAAGTCACTGGAAACAGTCCCCGGCTCGATGTGGCGCTAGATGAGAGTGGTG-
ATTTCTCC
CTGGTGGAGGATGACGGTCCCCCCATGAGAGAGATCATACTCAAGGTGGTCCTGATGACCATCTGTGGCATGAA-
CCCAATAGC
CATACCCTTTGCAGCTGGAGCGTGGTACGTATACGTGAAGACTGGAAAAAGGAGTGGTGCTCTATGGGATGTGC-
CTGCTCCCA
AGGAAGTAAAAAAGGGGGAGACCACAGATGGAGTGTACAGAGTAATGACTCGTAGACTGCTAGGTTCAACACAA-
GTTGGAG
TGGGAGTTATGCAAGAGGGGGTCTTTCACACTATGTGGCACGTCACAAAAGGATCCGCGCTGAGAAGCGGTGAA-
GGGAGAC
TTGATCCATACTGGGGAGATGTCAAGCAGGATCTGGTGTCATACTGTGGTCCATGGAAGCTAGATGCCGCCTGG-
GACGGGCA
CAGCGAGGTGCAGCTCTTGGCCGTGCCCCCCGGAGAGAGAGCGAGGAACATCCAGACTCTGCCCGGAATATTTA-
AGACAAAG
GATGGGGACATTGGAGCGGTTGCGCTGGATTACCCAGCAGGAACTTCAGGATCTCCAATCCTAGACAAGTGTGG-
GAGAGTGA
TAGGACTTTATGGCAATGGGGTCGTGATCAAAAATGGGAGTTATGTTAGTGCCATCACCCAAGGGAGGAGGGAG-
GAAGAGA
CTCCTGTTGAGTGCTTCGAGCCTTCGATGCTGAAGAAGAAGCAGCTAACTGTCTTAGACTTGCATCCTGGAGCT-
GGGAAAACC
AGGAGAGTTCTTCCTGAAATAGTCCGTGAAGCCATAAAAACAAGACTCCGTACTGTGATCTTAGCTCCAACCAG-
GGTTGTCGCT
GCTGAAATGGAGGAAGCCCTTAGAGGGCTTCCAGTGCGTTATATGACAACAGCAGTCAATGTCACCCACTCTGG-
AACAGAAAT
CGTCGACTTAATGTGCCATGCCACCTTCACTTCACGTCTACTACAGCCAATCAGAGTCCCCAACTATAATCTGT-
ATATTATGGAT
GAGGCCCACTTCACAGATCCCTCAAGTATAGCAGCAAGAGGATACATTTCAACAAGGGTTGAGATGGGCGAGGC-
GGCTGCCA
TCTTCATGACCGCCACGCCACCAGGAACCCGTGACGCATTTCCGGACTCCAACTCACCAATTATGGACACCGAA-
GTGGAAGTCC
CAGAGAGAGCCTGGAGCTCAGGCTTTGATTGGGTGACGGATCATTCTGGAAAAACAGTTTGGTTTGTTCCAAGC-
GTGAGGAA
CGGCAATGAGATCGCAGCTTGTCTGACAAAGGCTGGAAAACGGGTCATACAGCTCAGCAGAAAGACTTTTGAGA-
CAGAGTTC
CAGAAAACAAAACATCAAGAGTGGGACTTTGTCGTGACAACTGACATTTCAGAGATGGGCGCCAACTTTAAAGC-
TGACCGTGT
CATAGATTCCAGGAGATGCCTAAAGCCGGTCATACTTGATGGCGAGAGAGTCATTCTGGCTGGACCCATGCCTG-
TCACACATG
CCAGCGCTGCCCAGAGGAGGGGGCGCATAGGCAGGAATCCCAACAAACCTGGAGATGAGTATCTGTATGGAGGT-
GGGTGCG
CAGAGACTGACGAAGACCATGCACACTGGCTTGAAGCAAGAATGCTCCTTGACAATATTTACCTCCAAGATGGC-
CTCATAGCC
TCGCTCTATCGACCTGAGGCCGACAAAGTAGCAGCCATTGAGGGAGAGTTCAAGCTTAGGACGGAGCAAAGGAA-
GACCTTTG
TGGAACTCATGAAAAGAGGAGATCTTCCTGTTTGGCTGGCCTATCAGGTTGCATCTGCCGGAATAACCTACACA-
GATAGAAGA
TGGTGCTTTGATGGCACGACCAACAACACCATAATGGAAGACAGTGTGCCGGCAGAGGTGTGGACCAGACACGG-
AGAGAAA
AGAGTGCTCAAACCGAGGTGGATGGACGCCAGAGTTTGTTCAGATCATGCGGCCCTGAAGTCATTCAAGGAGTT-
TGCCGCTG
GGAAAAGAGGAGCGGCTTTTGGAGTGATGGAAGCCCTGGGAACACTGCCAGGACACATGACAGAGAGATTCCAG-
GAAGCCA
TTGACAACCTCGCTGTGCTCATGCGGGCAGAGACTGGAAGCAGGCCTTACAAAGCCGCGGCGGCCCAATTGCCG-
GAGACCCT
AGAGACCATTATGCTTTTGGGGTTGCTGGGAACAGTCTCGCTGGGAATCTTTTTCGTCTTGATGAGGAACAAGG-
GCATAGGGA
AGATGGGCTTTGGAATGGTGACTCTTGGGGCCAGCGCATGGCTCATGTGGCTCTCGGAAATTGAGCCAGCCAGA-
ATTGCATGT
GTCCTCATTGTTGTGTTCCTATTGCTGGTGGTGCTCATACCTGAGCCAGAAAAGCAAAGATCTCCCCAGGACAA-
CCAAATGGCA
ATCATCATCATGGTAGCAGTAGGTCTTCTGGGCTTGATTACCGCCAATGAACTCGGATGGTTGGAGAGAACAAA-
GAGTGACCT
AAGCCATCTAATGGGAAGGAGAGAGGAGGGGGCAACCATAGGATTCTCAATGGACATTGACCTGCGGCCAGCCT-
CAGCTTGG
GCCATCTATGCTGCCTTGACAACTTTCATTACCCCAGCCGTCCAACATGCAGTGACCACTTCATACAACAACTA-
CTCCTTAATGG
CGATGGCCACGCAAGCTGGAGTGTTGTTTGGTATGGGCAAAGGGATGCCATTCTACGCATGGGACTTTGGAGTC-
CCGCTGCTA
ATGATAGGTTGCTACTCACAATTAACACCCCTGACCCTAATAGTGGCCATCATTTTGCTCGTGGCGCACTACAT-
GTACTTGATCC
CAGGGCTGCAGGCAGCAGCTGCGCGTGCTGCCCAGAAGAGAACGGCAGCTGGCATCATGAAGAACCCTGTTGTG-
GATGGAA
TAGTGGTGACTGACATTGACACAATGACAATTGACCCCCAAGTGGAGAAAAAGATGGGACAGGTGCTACTCATA-
GCAGTAGC
CGTCTCCAGCGCCATACTGTCGCGGACCGCCTGGGGGTGGGGGGAGGCTGGGGCCCTGATCACAGCGGCAACTT-
CCACTTTG
TGGGAAGGCTCTCCGAACAAGTACTGGAACTCCTCTACAGCCACTTCACTGTGTAACATTTTTAGGGGAAGTTA-
CTTGGCTGGA
GCTTCTCTAATCTACACAGTAACAAGAAACGCTGGCTTGGTCAAGAGACGTGGGGGTGGAACAGGAGAGACCCT-
GGGAGAG
AAATGGAAGGCCCGCTTGAACCAGATGTCGGCCCTGGAGTTCTACTCCTACAAAAAGTCAGGCATCACCGAGGT-
GTGCAGAG
AAGAGGCCCGCCGCGCCCTCAAGGACGGTGTGGCAACGGGAGGCCATGCTGTGTCCCGAGGAAGTGCAAAGCTG-
AGATGGT
TGGTGGAGCGGGGATACCTGCAGCCCTATGGAAAGGTCATTGATCTTGGATGTGGCAGAGGGGGCTGGAGTTAC-
TACGCCGC
CACCATCCGCAAAGTTCAAGAAGTGAAAGGATACACAAAAGGAGGCCCTGGTCATGAAGAACCCATGTTGGTGC-
AAAGCTAT
GGGTGGAACATAGTCCGTCTTAAGAGTGGGGTGGACGTCTTTCATATGGCGGCTGAGCCGTGTGACACGTTGCT-
GTGTGACA
TAGGTGAGTCATCATCTAGTCCTGAAGTGGAAGAAGCACGGACGCTCAGAGTCCTCTCCATGGTGGGGGATTGG-
CTTGAAAA
AAGACCAGGAGCCTTTTGTATAAAAGTGTTGTGCCCATACACCAGCACTATGATGGAAACCCTGGAGCGACTGC-
AGCGTAGGT
ATGGGGGAGGACTGGTCAGAGTGCCACTCTCCCGCAACTCTACACATGAGATGTACTGGGTCTCTGGAGCGAAA-
AGCAACAC
CATAAAAAGTGTGTCCACCACGAGCCAGCTCCTCTTGGGGCGCATGGACGGGCCCAGGAGGCCAGTGAAATATG-
AGGAGGAT
GTGAATCTCGGCTCTGGCACGCGGGCTGTGGTAAGCTGCGCTGAAGCTCCCAACATGAAGATCATTGGTAACCG-
CATTGAAAG
GATCCGCAGTGAGCACGCGGAAACGTGGTTCTTTGACGAGAACCACCCATATAGGACATGGGCTTACCATGGAA-
GCTATGAG
GCCCCCACACAAGGGTCAGCGTCCTCTCTAATAAACGGGGTTGTCAGGCTCCTGTCAAAACCCTGGGATGTGGT-
GACTGGAGT
CACAGGAATAGCCATGACCGACACCACACCGTATGGTCAGCAAAGAGTTTTCAAGGAAAAAGTGGACACTAGGG-
TGCCAGAC
CCCCAAGAAGGCACTCGTCAGGTTATGAGCATGGTCTCTTCCTGGTTGTGGAAAGAGCTAGGCAAACACAAACG-
GCCACGAG
TCTGTACCAAAGAAGAGTTCATCAACAAGGTTCGTAGCAATGCAGCATTAGGGGCAATATTTGAAGAGGAAAAA-
GAGTGGAA
GACTGCAGTGGAAGCTGTGAACGATCCAAGGTTCTGGGCTCTAGTGGACAAGGAAAGAGAGCACCACCTGAGAG-
GAGAGTG
CCAGAGTTGTGTGTACAACATGATGGGAAAAAGAGAAAAGAAACAAGGGGAATTTGGAAAGGCCAAGGGCAGCC-
GCGCCAT
CTGGTATATGTGGCTAGGGGCTAGATTTCTAGAGTTCGAAGCCCTTGGATTCTTGAACGAGGATCACTGGATGG-
GGAGAGAG
AACTCAGGAGGTGGTGTTGAAGGGCTGGGATTACAAAGACTCGGATATGTCCTAGAAGAGATGAGTCGCATACC-
AGGAGGA
AGGATGTATGCAGATGACACTGCTGGCTGGGACACCCGCATCAGCAGGTTTGATCTGGAGAATGAAGCTCTAAT-
CACCAACCA
AATGGAGAAAGGGCACAGGGCCTTGGCATTGGCCATAATCAAGTACACATACCAAAACAAAGTGGTAAAGGTCC-
TTAGACCA
GCTGAAAAAGGGAAGACAGTTATGGACATTATTTCGAGACAAGACCAAAGGGGGAGCGGACAAGTTGTCACTTA-
CGCTCTTA
ACACATTTACCAACCTAGTGGTGCAACTCATTCGGAATATGGAGGCTGAGGAAGTTCTAGAGATGCAAGACTTG-
TGGCTGCTG
CGGAGGTCAGAGAAAGTGACCAACTGGTTGCAGAGCAACGGATGGGATAGGCTCAAACGAATGGCAGTCAGTGG-
AGATGAT
TGCGTTGTGAAGCCAATTGATGATAGGTTTGCACATGCCCTCAGGTTCTTGAATGATATGGGAAAAGTTAGGAA-
GGACACACA
AGAGTGGAAACCCTCAACTGGATGGGACAACTGGGAAGAAGTTCCGTTTTGCTCCCACCACTTCAACAAGCTCC-
ATCTCAAGG
ACGGGAGGTCCATTGTGGTTCCCTGCCGCCACCAAGATGAACTGATTGGCCGGGCCCGCGTCTCTCCAGGGGCG-
GGATGGAG
CATCCGGGAGACTGCTTGCCTAGCAAAATCATATGCGCAAATGTGGCAGCTCCTTTATTTCCACAGAAGGGACC-
TCCGACTGAT
GGCCAATGCCATTTGTTCATCTGTGCCAGTTGACTGGGTTCCAACTGGGAGAACTACCTGGTCAATCCATGGAA-
AGGGAGAAT
GGATGACCACTGAAGACATGCTTGTGGTGTGGAACAGAGTGTGGATTGAGGAGAACGACCACATGGAAGACAAG-
ACCCCAG
TTACGAAATGGACAGACATTCCCTATTTGGGAAAAAGGGAAGACTTGTGGTGTGGATCTCTCATAGGGCACAGA-
CCGCGCACC
ACCTGGGCTGAGAACATTAAAAACACAGTCAACATGGTGCGCAGGATCATAGGTGATGAAGAAAAGTACATGGA-
CTACCTAT
CCACCCAAGTTCGCTACTTGGGTGAAGAAGGGTCTACACCTGGAGTGCTGTAAGCACCAATCTTAGTGTTGTCA-
GGCCTGCTA
GTCAGCCACAGCTTGGGGAAAGCTGTGCAGCCTGTGACCCCCCCAGGAGAAGCTGGGAAACCAAGCCTATAGTC-
AGGCCGAG
AACGCCATGGCACGGAAGAAGCCATGCTGCCTGTGAGCCCCTCAGAGGACACTGAGTCAAAAAACCCCACGCGC-
TTGGAGGC
GCAGGATGGGAAAAGAAGGTGGCGACCTTCCCCACCCTTCAATCTGGGGCCTGAACTGGAGATCAGCTGTGGAT-
CTCCAGAA
GAGGGACTAGTGGTTAGAGGAGACCCCCCGGAAAACGCAAAACAGCATATTGACGCTGGGAAAGACCAGAGACT-
CCATGAG TTTCCACCACGCTGGCCGCCAGGCACAGATCGCCGAATAGCGGCGGCCGGTGTGGGG
AHZ13508.1, Zika virus polyprotein from Polynesian outbreak
(H/PF/2013) SEQ ID NO: 79
MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGK-
KEAMEIIKKFKKDL
AAMLRIINARKEKKRRGADTSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQIMD-
LGHMCDATMSYE
CPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVTLPSHSTRKLQTRSQTWLESREYTKHLIRVE-
NWIFRNPGFAL
AAAAIAWLLGSSTSQKVIYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDI-
ELVTTTVSNMAE
VRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWGNGCGLFGKGSLVTCAKFACSKKMTGKS-
IQPENLEYRIML
SVHGSQHSGMIVNDTGHETDENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHK-
EWFHDIPLPWH
AGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAEMDGAKGRLSSGHLKCRLKMDKLRL-
KGVSYSLCTA
AFTFTKIPAETLHGTVTVEVQYAGTDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGD-
SYIVIGVGEKKIT
HHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFGAAFKSLFGGMSWFSQILIGTL-
LMWLGLNTKN
GSISLMCLALGGVLIFLSTAVSADVGCSVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWE-
DGICGISSVSRME
NIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGKSYFVRAAKTNNSFVVDGD-
TLKECPLKH
RAWNSFLVEDHGFGVFHTSVWLKVREDYSLECDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMK-
TCEWPKSHTLW
TDGIEESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTRGPSLRSTTASGR-
VIEEWCCRECTMPPL
SFRAKDGCWYGMEIRPRKEPESNLVRSMVTAGSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVA-
MILGGFSMSDLA
KLAILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQTAISALEGDLMVLI-
NGFALAWLAIRAM
VVPRTDNITLAILAALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVG-
LLLLTRSGKRSWPP
SEVLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKDAEVTGNSPRLDVAL-
DESGDFSLVEDD
GPPMREIILKVVLMTICGMNPIAIPFAAGAWYVYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGST-
QVGVGVMQEG
VFHTMWHVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARNIQTLPGIFKT-
KDGDIGAVAL
DYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAITQGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTR-
RVLPEIVREAIKTRLR
TVILAPTRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLYIMDEAHFTDP-
SSIAARGYISTRVE
MGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEVPERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAG-
KRVIQLSRKTF
ETEFQKTKHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQRRGRIGRNPN-
KPGDEYLYGGGC
AETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADKVAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVAS-
AGITYTDRRWCF
DGTTNNTIMEDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTLPGHMTER-
FQEAIDNLA
VLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPA-
RIACVLIVVFLLLV
VLIPEPEKQRSPQDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPASAWA-
IYAALTTFITPAVQ
HAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPLLMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQA-
AAARAAQKRTA
AGIMKNPVVDGIVVTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSPNKY-
WNSSTATSLCNI
FRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQMSALEFYSYKKSGITEVCREEARRALKDGVA-
TGGHAVSRGSAKL
RWLVERGYLQPYGKVIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPMLVQSYGWNIVRLKSGVDVFHMA-
AEPCDTLLCDI
GESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMMETLERLQRRYGGGLVRVPLSRNSTHEMYW-
VSGAKSNTIKSVS
TTSQLLLGRMDGPRRPVKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFFDENHPYRTWAYHG-
SYEAPTQGSASS
LINGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPDPQEGTRQVMSMVSSWLWKELGKHKRPRV-
CTKEEFINKV
RSNAALGAIFEEEKEWKTAVEAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYM-
WLGARFLEF
EALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWDTRISRFDLENEALITNQMEKG-
HRALALAIIKY
TYQNKVVKVLRPAEKGKTVMDIISRQDQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVT-
NWLQSNGWDR
LKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHHFNKLHLKDGRSIVVPC-
RHQDELIGRA
RVSPGAGWSIRETACLAKSYAQMWQLLYFHRRDLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTTEDMLVV-
WNRVWIEE
NDHMEDKTPVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGDEEKYMDYLSTQVRYLGEE-
GSTPGVL 9320_Zika_PF_1F SEQ ID NO: 80 ttaggatccGTTGTTGATCTGTGTGAAT
9321_Zika_PF_1R
SEQ ID NO: 81 taactcgagCGTACACAACCCAAGTT 9322_Zika_PF_2F SEQ ID NO:
82 ttaggatccTCACTAGACGTGGGAGTG 9323_Zika_PF_2R SEQ ID NO: 83
taactcgagAAGCCATGTCYGATATTGAT 9324_Zika_PF_3F SEQ ID NO: 84
ttaggatccGCATACAGCATCAGGTG 9325_Zika_PF_3R SEQ ID NO: 85
taactcgagTGTGGAGTTCCGGTGTCT 9326_Zika_PF_4F SEQ ID NO: 86
ttaggatccGAATAGAGCGAARGTTGAGATA 9327_Zika_PF_4R SEQ ID NO: 87
taactcgAGTGGTGGGTGATCTTCTTCT 9328_Zika_PF_5F SEQ ID NO: 88
ttaggatcCAGTCACAGTGGAGGTACAGTAC 9329_Zika_PF_5R SEQ ID NO: 89
taactcgagCRCAGATACCATCTTCCC 9330_Zika_PF_6F SEQ ID NO: 90
ttaggatCCCTTATGTGCTTGGCCTTAG 9331_Zika_PF_6R SEQ ID NO: 91
taactcgagTCTTCAGCCTCCATGTG 9332_Zika_PF_7F SEQ ID NO: 92
ttaggatccAATGCCCACTCAAACATAGA 9333_Zika_PF_7R SEQ ID NO: 93
taactcgagTCATTCTCTTCTTCAGCCCTT 9334_Zika_PF_8F SEQ ID NO: 94
ttaggatccAAGGGTGATCGAGGAAT 9335_Zika_PF_8R SEQ ID NO: 95
taactcgagTTCCCTTCAGAGAGAGGAGC 9336_Zika_PF_9F SEQ ID NO: 96
ttaggatccTCTTTTGCAAACTGCGATC 9337_Zika_PF_9R SEQ ID NO: 97
taactcgagTCCAGCTGCAAAGGGTAT 9338_Zika_PF_10F SEQ ID NO: 98
ttaggatccGTGTGGACATGTACATTGA 9339_Zika_PF_10R SEQ ID NO: 99
taactcgagCCCATTGCCATAAAGTC 9340_Zika_PF_11F SEQ ID NO: 100
ttaggatccTCATACTGTGGTCCATGGA 9341_Zika_PF_11R SEQ ID NO: 101
taactcgagGCCCATCTCAACCCTTG 9342_Zika_PF_12F SEQ ID NO: 102
ttaggatccTAGAGGGCTTCCAGTGC 9343_Zika_PF_12R SEQ ID NO: 103
taactcgAGATACTCATCTCCAGGTTTGTTG 9344_Zika_PF_13F SEQ ID NO: 104
ttaggatccGAAAACAAAACATCAAGAGTG 9345_Zika_PF_13R SEQ ID NO: 105
taactcgagGAATCTCTCTGTCATGTGTCCT 9346_Zika_PF_14F SEQ ID NO: 106
ttaggatccTTGATGGCACGACCAAC 9347_Zika_PF_14R SEQ ID NO: 107
ttaggatccGTTGTTGATCTGTGTGAAT 9348_Zika_PF_15F SEQ ID NO: 108
taactcgagCAGGTCAATGTCCATTG 9349_Zika_PF_15R SEQ ID NO: 109
ttaggatccTGTTGTGTTCCTATTGCTGGT 9350_Zika_PF_16F SEQ ID NO: 110
taactcgaGTGATCAGRGCCCCAGC 9351_Zika_PF_16R SEQ ID NO: 111
ttaggatccTGCTGCCCAGAAGAGAA 9352_Zika_PF_17F SEQ ID NO: 112
taactcgaGCACCAACAYGGGTTCTT 9353_Zika_PF_17R SEQ ID NO: 113
ttaggatcCTCAAGGACGGTGTGGC 9354_Zika_PF_18F SEQ ID NO: 114
taactcgagCAATGATCTTCATGTTGGG 9355_Zika_PF_18R SEQ ID NO: 115
ttaggatccTATGGGGGAGGACTGGT 9356_Zika_PF_19F SEQ ID NO: 116
taactcGAGCCCAGAACCTTGGATC 9357_Zika_PF_19R SEQ ID NO: 117
ttaggatcCAGACCCCCAAGAAGGC 9358_Zika_PF_20F SEQ ID NO: 118
taactcgagCCCCTTTGGTCTTGTCT 9359_Zika_PF_20R SEQ ID NO: 119
ttaggatccAGGAAGGATGTATGCAGATG 9360_Zika_PF_21F SEQ ID NO: 120
taactcgagACATTTGCGCATATGATTTTG 9361_Zika_PF_21R SEQ ID NO: 121
ttaggatccAGGAAGGACACACAAGAGT 9362_Zika_PF_22F SEQ ID NO: 122
taactcgagACAGGCTGCACAGCTTT 9363_Zika_PF_22R SEQ ID NO: 123
ttaggatccTCTCTCATAGGGCACAGAC
In some embodiments, the Zika virus has a polyprotein, including an
envelope (E) protein, with an amino acid sequence provided by any
one of SEQ ID NO: 14-69 or 78. In some embodiments, the polyprotein
or E protein sequence is at least 80%, 81%, 82%, 83%, 84%, 85%,
86%, 87%, 88%, 89%, 90%, 91%, 92%. 93%, 94%, 95%, 96%, 97%, 98%,
99%, 99.5%, 99.6%, 99.7%, 99.8% or 99.9% identical to any one of
SEQ ID NOs: 2-69 or 78.
The terms "identical" or percent "identity" in the context of two
or more nucleic acids or amino acid sequences refer to two or more
sequences or subsequences that are the same. Two sequences are
"substantially identical" if two sequences have a specified
percentage of amino acid residues or nucleotides that are the same
(e.g., at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity)
over a specified region or over the entire sequence, when compared
and aligned for maximum correspondence over a comparison window, or
designated region as measured using one of the following sequence
comparison algorithms or by manual alignment and visual inspection.
Optionally, the identity exists over a region that is at least
about 50 nucleotides (or 10 amino acids) in length, or more
preferably over a region that is 100 to 500 or 1000 or more
nucleotides (or 20, 50, 200 or more amino acids) in length. In some
embodiments, the identity exists over the length of a protein, such
as the E protein.
For sequence comparison, typically one sequence acts as a reference
sequence, to which test sequences are compared. Methods of
alignment of sequences for comparison are well known in the art.
See, e.g., by the local homology algorithm of Smith and Waterman
(1970) Adv. Appl. Math. 2:482c, by the homology alignment algorithm
of Needleman and Wunsch, J. Mol. Biol. 48:443, 1970, by the search
for similarity method of Pearson and Lipman. Proc. Natl. Acad. Sci.
USA 85:2444, 1988, by computerized implementations of these
algorithms (GAP, BESTFIT, FASTA, Jalview and TFASTA in the
Wisconsin Genetics Software Package, Genetics Computer Group. 575
Science Dr., Madison. Wis.), by multi sequence alignment
implementation using e.g. CLUSTALW (Larkin et al., (2007).
Bioinformatics, 23, 2947-2948.) or MAFFT (Katoh & Toh 2008
Briefings in Bioinformatics 9:286-298), or by manual alignment and
visual inspection (see. e.g., Brent et al., Current Protocols in
Molecular Biology, John Wiley & Sons, Inc. (Ringbou ed.,
2003)). Two examples of algorithms that are suitable for
determining percent sequence identity and sequence similarity are
the BLAST and BLAST 2.0 algorithms, which are described in Altschul
et al., Nuc. Acids Res. 25:3389-3402, 1977 and Altschul et al., J.
Mol. Biol. 215:403-410, 1990, respectively.
EXAMPLES
Example 1: Development of a Purification Process for Live
Attenuated Chikungunya Virus Vaccine Produced in Vero Cells
A downstream process was developed for the purification of
infectious Chikungunya virus particles whereby non-infectious virus
particles and aggregates are removed by the addition of protamine
sulphate. The unexpected and novel purification properties of
protamine sulphate (PS) were evaluated in purification processes
for Chikungunya Virus (ChikV) as follows:
A downstream purification process for the attenuated Chikungunya
virus mutant ".DELTA.5nsP3" (as described by Hallengird et al.,
2014, supra and provided by SEQ ID NO: 77) produced under standard
cell culture conditions in Vero cells was developed. The attenuated
.DELTA.5nsP3 Chikungunya virus was derived from the strain
LR2006-OPY1, the complete genome of which is provided herein as SEQ
ID NO: 72. Briefly, the downstream process consists of crude
harvest filtration followed by concentration and diafiltration on a
tangential flow filtration (TFF) system. Host cell DNA and host
cell proteins were reduced by precipitation with protamine sulphate
and by batch adsorption, respectively. Sucrose density gradient
centrifugation was done as a final polishing step. Out of 16 roller
bottles 1.times.10.sup.12 total PFU were purified with an overall
DSP process yield of 10-15% (.about.1 log 10 TCID50 loss). Sucrose
gradient pool samples were characterized with regard to
product-related impurities, such as hcDNA, HCP and endotoxins and
met safety criteria.
Harvest of Vero Cell Culture Medium Containing ChikV
.DELTA.5nsP3
ChikV .DELTA.5nsP3 was grown on Vero cells in roller bottles. A
first harvest was performed after 24 hours post infection (hpi; day
1 harvest) and stored at 2-8.degree. C. until further processing.
After the first harvest, fresh medium was added and the roller
bottles were returned to the incubator. A second harvest was done
after 48 hours post infection (day 2 harvest) and stored at
2-8.degree. C.
Filtration of Crude Cell Culture Harvest
At both harvest timepoints, the crude harvest was immediately
filtered using a 0.2 .mu.m filter capsule (GE ULTA.TM. CG, 2 inch).
The filtered harvest after 48 hpi was pooled together with the 24
hpi harvest and the pooled filtered harvest material was
immediately further processed by ultrafiltration.
Purification of ChikV .DELTA.5nsP3 by Tangential Flow Filtration
(TFF)
The pooled filtered harvest material was further processed by
tangential flow filtration (TFF) in order to concentrate the
harvest, reduce host cell proteins and replace the depleted cell
culture medium with a defined buffer system (buffer exchange). A
Millipore TFF system (Millipore Pellicon II mini membrane holder)
equipped with a 100 kDa cutoff PES membrane module (Pellicon2
Biomax, 1000 cm.sup.2) was used for concentration and buffer
exchange. A Pellicon2 Biomax membrane module was mounted on the
Pellicon II mini filter holder and the device was connected to a
peristaltic pump. The system was first rinsed with ultra-pure water
and then sanitized by recirculation of 0.1 M NaOH for 60 min. In
case the system was not used immediately, it was stored in 0.1 M
NaOH until use. Prior to use the system was rinsed with 1 L of
RO-water followed by buffer A until the permeate pH value was
constant at pH 7.4.+-.0.2.
Adjustment of the ChikV .DELTA.5nsP3 Harvest (pH, Salt)
The pooled filtered harvest material was adjusted to a final
concentration of 25 mM Tris and 150 mM NaCl using stock solutions
of both components (see Table 1). This adjustment was done to
increase buffering capacity and to reduce unspecific adsorption to
the membrane. The necessary volumes of stock solutions D (1 M Tris,
pH 7.4) and E (4.5 M NaCl) were calculated as follows: Volume of
stock solution D (1 M Tris, pH 7.4) added to pooled harvest=Volume
of pooled filtered harvest/40 Volume of stock solution E (4.5 M
NaCl) added to pooled harvest=Volume of pooled filtered harvest/30
Example: 4 L harvest obtained from 20 RB (850 cm.sup.2) would
require addition of 100 mL stock solution D (1 M Tris, pH 7.4) and
133 mL stock solution E (4.5 M NaCl).
The calculated volumes of stock solution D and Buffer E were added
to the pooled filtered harvest under gentle stirring. The adjusted
harvest was then stirred using a magnetic stirrer for 5 minutes at
room temperature.
Concentration and Diafiltration of the ChikV .DELTA.5nsP3 Harvest
by TFF
In a first step, the adjusted harvest material was concentrated
approximately 10 fold. The feed flowrate was approximately 220
mL/min. The transmembrane flux at a transmembrane pressure of
approximately 0.6 bar was in the range of 90.+-.5 mL/min per 1000
cm.sup.2 membrane. After concentration, the cell culture medium was
exchanged against 25 mM Tris, 150 mM NaCl, pH 7.5, by continuous
diafiltration with 6 volume exchanges. The diafiltration buffer was
supplied to the feed vessel from a measuring cylinder by a second
peristaltic pump set to a flowrate of approximately 90 mL/min.
Minor flowrate adjustments of the second peristaltic pump in the
range of +10 mL/min were done manually to ensure a constant volume
of harvest in the feed vessel. After 6 volume exchanges,
diafiltration was stopped. The liquid remaining in the membrane
module was recovered by pumping the module empty with air.
Sucrose Addition to Diafiltrated ChikV .DELTA.5nsP3 Material
After diafiltration, sucrose stock solution H (50% (w/w) sucrose
solution) was added to the diafiltrated material to achieve a final
sucrose concentration of 10% (w/w). The volume of buffer H was
calculated as follows: Volume of stock solution H added (mL)=Volume
(mL) of diafiltrated ChikV material.times.0.25 (dilution
factor=1:4) (i.e., final sucrose concentration is 10%) Example: 400
mL diafiltrated ChikV solution would require addition of 100 mL
stock solution H (50% sucrose).
The calculated volume of solution H was added to the diafiltrated
ChikV .DELTA.5nsP3 material under gentle stirring and the solution
was then stirred using a magnetic stirrer for a further 5 minutes
at room temperature. (At this stage of the process the material can
be either immediately further processed or stored frozen
(<-65.degree. C., hold step).)
DNA Reduction by Protamine Sulphate Precipitation
A DNA precipitation step using protamine sulphate (PS) was
performed to reduce hcDNA. Protamine sulphate stock solution L (50
mg/mL PS in PBS) was added to the diafiltrated ChikV .DELTA.5nsP3
material to a final nominal concentration of .about.1.6 mg/mL. The
necessary volume of stock solution L was calculated as follows:
Volume of stock solution L (50 mg/mL PS) added=Volume of
diafiltrated ChikV .DELTA.5nsP3 material in 10% sucrose/31 Example:
500 mL diafiltrated ChikV .DELTA.5nsP3 solution in 10% sucrose
would require addition of 16 mL stock solution L (50 mg/mL PS in
PBS).
The protamine sulphate stock solution was added while stirring the
ChikV .DELTA.5nsP3 material using a magnetic stirrer followed by
incubation at 2-8.degree. C. for 30 minutes. After incubation, the
precipitate was not removed. The material was immediately further
processed by batch adsorption with Capto.TM. Core 700
chromatography media.
Batch Adsorption with Capto.TM. Core 700
To reduce HCPs, a batch adsorption step with Capto.TM. Core 700
(CC700) chromatography medium was performed after DNA
precipitation. CC700 slurry (50% slurry in buffer A) was added
directly to the protamine sulphate treated material. The required
slurry volume was determined based on the volume of .DELTA.5nsP3
ChikV harvest material (d1+d2) and was calculated as follows:
Volume of CC700 slurry added to PS-treated concentrated harvest
(mL)=Volume of .DELTA.5nsP3 ChikV harvest material (mL).times.0.02
(dilution factor=1:50) (i.e., final concentration of CC700 is 1%)
After slurry addition, the material was incubated at 4.degree. C.
for 15 minutes under constant agitation using a magnetic stirrer.
After incubation, the CC700 solid matter was allowed to settle by
gravity for 10 minutes. The .DELTA.5nsP3 ChikV material was then
removed from the top of the solution in order to avoid blocking of
the filter by the CaptoCore particles. The remaining CaptoCore
particles and the DNA precipitate were then removed from the
solution by filtration using a 0.2 .mu.m Mini Kleenpak EKV filter
capsule (Pall). The resulting filtrate was further processed by
sucrose density gradient centrifugation. Sucrose Density Gradient
Centrifugation
Sucrose density gradient centrifugation (SGC) was used for final
concentration and polishing of the .DELTA.5nsP3 ChikV material. The
.DELTA.5nsP3 ChikV material was loaded on top of a solution
consisting of three layers of sucrose with different densities. The
three sucrose layers were selected based on a preliminary study
which showed the formation of a linear sucrose gradient and good
separation of the virus particles from residual contaminants. The
optimal volumes of the sucrose solutions were determined
empirically. The volumes of individual layers for a centrifugation
at 500 mL scale are shown in Table 3.
TABLE-US-00008 TABLE 3 Sucrose concentrations and volumes (500 mL
scale). Volume Solution (mL) Harvest with 10% sucrose 360 15%
sucrose 40 35% sucrose 40 50% sucrose 60 Total volume 500
Preparation of the Sucrose Gradient
The sucrose gradient bottles (500 mL) were prepared by underlaying
the individual sucrose layers. A 3.5 mm ID plastic tube was
attached to 60 cm of peristaltic pump tubing. The plastic tube was
mounted on a laboratory stand using a clamp and placed into the
centrifuge bottle. The nozzle of the plastic tube was placed at the
bottom of the bottle. Using a peristaltic pump set to a flow rate
of 25 mL per minute, the .DELTA.5nsP3 ChikV material and the
sucrose solutions were pumped into the cylinder. A measuring
cylinder was used as a feed vessel. The first solution pumped was
the .DELTA.5nsP3 ChikV material as it had the lowest density (10%
sucrose (w/w)). Following the addition of the .DELTA.5nsP3 ChikV
material, the sucrose solutions were pumped in ascending order
starting with the lowest (15%), followed by the 35% sucrose
solution and finishing with the highest density sucrose solution
(50%). After all sucrose solutions were transferred, the plastic
tubing was carefully removed in order not to disturb the layers. An
illustration of a completed gradient is shown in FIG. 14.
Centrifugation
Prior to centrifugation a Beckman Avanti JXN-26 centrifuge equipped
with rotor Beckman 10.500 was pre-cooled to 4.degree. C. The
prepared SG bottles were carefully transferred into the pre-cooled
(4.degree. C.) rotor so as to not to disturb the sucrose layers.
The bottles were centrifuged at 10,000 rpm (.about.18,500 rcf) at
4.degree. C. for 17-20 hours. (In case a different centrifuge
system with a different rotor would be used, the necessary speed
and centrifugation times would need to be calculated based on the
k-factor in order to achieve comparable centrifugation
efficiency.)
Sucrose Gradient Harvest Harvesting of the sucrose gradient
following centrifugation was done manually using a peristaltic
pump.
A 3.5 mm ID plastic tube attached to 60 cm of peristaltic pump
tubing was used for harvesting the sucrose gradient. The 500 mL
bottle containing the centrifuged gradient was mounted onto a
laboratory stand in a tilted position (.about.12.degree.) using a
clamp. The plastic tubing was then placed into the bottle touching
the bottom edge of the bottle and was fastened in position using a
clamp. This resulted in a small gap of 1-2 mm between the tubing
inlet and the bottom of the bottle (see FIG. 14). Using a
peristaltic pump set to a flow rate of 60 mL per minute, the
gradient was harvested and manually split into 5 mL fractions. A
third of the bottle volume was harvested and the rest was
discarded. The fractions were immediately tested by measuring UV
absorbance in a plate reader as described below.
Analysis of fractions by UV absorbance and SEC-HPLC
UV absorbance measurement was used as primary method for analysis
of the sucrose gradient fractions. Absorbance at 214, 280 and 260
nm was tested immediately after fractionation was completed.
Briefly, a 100 .mu.L sample of each fraction was transferred into a
96 well plate and absorbance at 214, 260 and 280 nm was measured
using a plate reader. The absorbance values were plotted against
the fraction number. A representative profile is shown in FIG. 11A.
The .DELTA.5nsP3 ChikV containing fractions were indicated by a
peak in all three measured wavelengths (FIG. 11A, grey shaded
area). The presence of impurities was indicated by an increase of
the UV214 signal after the main peak. The fractions comprising the
main peak were pooled from the peak start to the valley of the 214
nm curve. This method can be used as single method for pooling
.DELTA.5nsP3 ChikV fractions.
After identification of the virus containing fractions, the
respective fractions were pooled. Pooling criteria for SGC
fractions were based on UV 260 nm data, e.g. start of pooling at
.about.10% of peak maximum, end of pooling at .about.30% of peak
maximum. (Final pooling criteria at a manufacturing scale may need
to be determined empirically.) The sucrose gradient pool was either
stored at <-65.degree. C. or immediately further formulated to
drug substance (DS).
Size Exclusion Chromatography
The final pooled SGC fractions containing purified infectious
.DELTA.5nsP3 ChikV particles were analyzed for purity by SEC-HPLC.
In brief, SEC was performed as follows: a Superose 6 10/300
Increase column (GE Healthcare) equilibrated with PBS+250 mM NaCl,
pH 7.4 at 1 ml/min and 25.degree. C., was used to detect ChikV
particles at 214 nm detection wavelength in the pooled samples.
SEC-HPLC is a semi-quantitative (relative yield) and qualitative
(purity) method that separates intact virus particles from virus
aggregates and host cell proteins (HCPs). The method cannot
distinguish between infectious and non-infectious virus particles
due to their identical retention time.
As shown in FIG. 11B, there were two defined peaks identified by
SEC: the .DELTA.5nsP3 ChikV peak and a peak corresponding to buffer
components. The SGC step yield based on SEC-HPLC data for pooled
fractions F6-F11 was estimated at .about.70%. The final purity of
the .DELTA.5nsP3 ChikV SGC pool, based on SEC-HPLC analysis, was
estimated at >95%.
SDS-PAGE and Silver Stain
SDS-PAGE silver stain was performed in order to qualitatively
assess sample purity throughout the purification process from the
first crude harvest through SGC. Briefly, ChikV process samples
analyzed by SDS-PAGE/silver stain were diluted 1:1.33 with LDS
buffer and were heated to 70.degree. C. for 5 minutes. The samples
were loaded onto 4-12% Bis-Tris Gels (NuPAGE). Silver staining was
done using the Silver Express staining kit (Invitrogen).
A silver-stained gel of a representative ChikV .DELTA.5nsP3
purification is shown in FIG. 11C. The viral proteins E1, E2 and C
are marked on the right-hand side of the gel. The final SGC pool
(fraction 7-fraction 11) is shown in lane 12. Note that a defined
HCP band migrating between ChikV protein E2 and C still appears
after CaptoCore700 treatment that has been identified as a single
band in SDS-PAGE. This impurity is removed by sucrose gradient
centrifugation, but can still be seen in fractions 13 and 14
(corresponding to lanes 14 and 15 of FIG. 11C).
Enrichment of Infectious .DELTA.5nsP3 ChikV particles by PS
Treatment
Although generally used as a method of removing contaminating
hcDNA, it was observed in the course of the present invention that
PS treatment also removes virus aggregates and HCPs. Size exclusion
chromatography (SEC-HPLC, as described above) was used throughout
the purification process to determine the purity of the ChikV virus
relative to impurities which also generate UV absorption.
As can be seen in FIG. 12B, treatment with PS reduces not only host
cell proteins and low molecular weight contaminants of the
.DELTA.5nsP3 ChikV preparation, but also reduces the SEC area
corresponding to virus products, including aggregates as indicated.
A surprising finding, however, was that even a reduction of the
total SEC area by 86% (in a representative experiment shown in FIG.
12A, grey portion of bars) did not result in a concomitant
reduction in infectious virus particles as measured by TCID50 (FIG.
12A, left axis). Instead, even though a large percentage of virus
particles were removed by PS treatment, the majority of infectious
particles remained. This observation indicates that PS treatment
selectively enriches infectious virus particles from a larger pool
of total virus particles present in the crude harvest.
TCID50 was performed to quantify infectious virus particles during
the course of the purification process and to assign an active
virus titer to final drug substance and drug product samples.
Briefly, Vero cells were seeded at 2.times.10.sup.4 cells per well
in 100 .mu.L medium (EMEM with 2 mM L-Glutamine+5% FBS+1%
antibiotic/antimycotic) in 96-well TC-treated flat-bottom plates
and incubated overnight at 35.degree. C./5% CO.sub.2. On day two,
Vero cell monolayers were infected by adding 100 .mu.L of 1:10
serial dilutions of test samples to each of quintuplicate wells
seeded with Vero cells and incubated at 35.degree. C./5% CO.sub.2.
On day seven, plaques were counted by visualization under a
microscope. The TCID50 was calculated according to the Reed &
Munch endpoint calculation method (Reed, L. J.; Muench, H. (1938) A
simple method of estimating fifty percent endpoints, The American
Journal of Hygiene 27: 493-497).
Furthermore, electron microscopy of .DELTA.5nsP3 ChikV samples
before and after PS treatment showed that not only large aggregates
but also smaller non-infectious virus-like particles (essentially
not fully assembled particles lacking the RNA genome) were
effectively removed by PS (FIG. 13).
This enrichment of infectious virus particles was also observed
when analyzing day one and day two crude harvests separately. As
presented in Table 4, the SEC area (total virus particles) of the
day 1 harvest remains roughly the same after PS treatment; whereas
a large decrease in virus peak area is seen for the day 2 harvest
after PS treatment. This observation was confirmed by MALLS
analysis of the virus preparation, wherein it was seen that a
higher percentage of virus particles were of the correct size
following PS treatment. Similarly to the results shown in FIG. 12,
day 1 and day 2 harvests showed no reduction in infectious
particles as measured by TCID50 following PS treatment, indicating
that mainly non-infectious, immature and/or aggregated virus
particles are removed during the PS treatment and infectious
particles are enriched in the preparation.
The PS-treated samples were further purified by sucrose gradient
centrifugation (see FIG. 14 for a schematic preparation of an
optimized sucrose gradient). An optimal sucrose gradient was
determined experimentally as shown in FIG. 15. Results of the
further purification of PS-treated ChikV on the optimized sucrose
gradient of the invention are shown in FIG. 15D.
TABLE-US-00009 TABLE 4 Overview of the process of .DELTA.5nsP3
ChikV purification as described in Example 1. SEC-MALLS analysis of
harvests before and after PS treatment shows the removal of larger
virus particles (aggregates), an effect that is particularly
pronounced for day 2 harvests. MALLS Infectious SEC % correct
particles Area Total size TCID50 [mAU*min] particles/mL (20-40 nm)
log 10 Harvest 1 (H1) 57 1.17E+11 49% 10.2 H1 + protamine 53
1.33E+11 81% 10.0 sulphate Harvest 2 (H2) 36 4.60E+09 3% 7.9 H2 +
protamine 2 8.80E+09 59% 7.9 sulphate Combined 67 2.60E+10 14% 9.9
Harvests (C) C + protamine 24 8.00E+10 72% 10.1 sulphate
Finally, an overview of the relative amounts of .DELTA.5nsP3 ChikV
particles and other components as measured by SEC-HPLC at various
steps throughout the entire virus purification process from crude
harvest (a) to the final SGC purified pool is presented in FIG. 16.
In sum, not only are the vast majority of contaminants and
undesired products removed by the process, infectious ChikV
particles are highly purified. As shown by the previously presented
data, the final preparation is a highly enriched preparation of
infectious ChikV particles.
Drug Substance (DS) Formulation
The pooled SGC fractions are diluted with DS formulation buffer M
(10 mM Tris, 5% Sucrose (w/w), 1% (10 mg/mL) rHSA, pH 7.4.+-.0.2).
The final target volume of DS should be in the range of
approximately 2 L. Based on current data the estimated range of the
dilution factor might be 1:20 to 1:50.
Final DS Sterile Filtration
The final DS was filtered under aseptic conditions in a laminar
flow hood using a sterility grade 0.2 .mu.m syringe filter (e.g.
0.2 .mu.m Mini Kleenpak EKV filter capsule with 220 cm.sup.2 filter
surface, Pall).
Quantification of Host Cell DNA (hcDNA) Host Cell Protein (HCP) and
Endotoxin
The residual host cell DNA content of the sucrose gradient pool
samples was determined by using the qPCR based assay. The DNA
content in SGC pool was determined to be <0.002 ng/mL. The
presence of residual host cell protein (HCP) from Vero cells was
determined by ELISA. Residual host cell proteins present in the
sucrose gradient pool samples were quantified using the Vero Cell
HCP ELISA kit (Cygnus, F500). The residual host cell protein
content in SGC pool was determined to be <200 ng/mL.
Endotoxin content of the SGC pool and DS was measured by
Endosafe.RTM.-PTS.TM. system (Charles River). The system uses
Limulus Amembocyte Lysate (LAL) reagents by a kinetic chromogenic
methodology to measure color intensity directly related to the
endotoxin content in a sample. Each cartridge contains precise
amounts of a licensed LAL reagent, chromogenic substrate and an
endotoxin control standard. Samples were diluted 1:100 in WFI. The
SGC Pool F7-F11 was determined to be <5.00 EU/mL; likewise, the
Drug Substance was also determined to have <5.00 EU/mL.
The following specifications for impurities in final Drug product
were proposed: hcDNA <10 ng/dose; Endotoxins <50 EU/dose; HCP
<200 ng/dose. These residual specifications would already be met
in the highly concentrated SGC pool (.about.10 log TCID50/mL),
which provides a high margin of safety considering the high
dilution factor of SGC pool to final DP of >1:1000.
Example 2: Production of a Zika Drug Substance Suitable for
Application as a Vaccine in Humans and Animals
Materials and Methods:
For the production of ZikaV the JEV process platform (Srivastava et
al., Vaccine 19 (2001) 4557-4565; U.S. Pat. No. 6,309,650B1) was
used as a basis. Small changes of certain process steps were
adapted to ZikaV properties and to improve purity. A short summary
of the process steps is outlined below (see also FIGS. 17A and B).
Briefly, the unexpected and novel purification properties of
protamine sulphate (PS) were evaluated in purification processes
for Zika Virus similarly as found above. Again non-infectious virus
particle aggregates, HCP and other LMW impurities were removed by
PS precipitation as shown by removal of aggregate shoulder in
SEC-HPLC and no loss of infectious virus titer by PS treatment
(FIG. 18). Further optimization of the Zika purification protocol
is provided below.
Upstream:
Roller Bottle based Vero cell expansion (25.times.850 cm2
CellBind): 5% CO.sub.2, 35.degree. C., MEM+2 mM L-Glutamine+10% FBS
Infection with ZikaV research Master Seed Bank (rMSB) at MOI 0.01
Virus Production without serum 5% CO.sub.2, 35.degree. C., MEM+2 mM
L-Glutamine Multiple harvests (days 2, 3, 5 and 7) with re-feed
Sterile filtration of harvests and storage at 2-8.degree. C. until
further processing Downstream: Pooling of harvests and
concentration by ultrafiltration (100 kDa) Stabilization of
concentrated harvest (Tris/10% sucrose) for storage if required
(-80.degree. C.) Removal of hcDNA by Protamine Sulphate (2 mg/mL)
Sucrose Gradient Purification (optimized three layered gradient)
Formaldehyde Inactivation (0.02%, 22.degree. C., 10 days),
neutralization with Na-metabisulfite Dilution to DS antigen target
content and formulation with Aluminium hydroxide (0.5 mg Al/mL)
Zika Virus Strain H/PF/2013 was originally isolated from a
51-year-old woman (accession number KJ776791.1, also SEQ ID NO: 13
herein) from French Polynesia. A sample was obtained from the
European Virus Archive (EVAg; Ref-SKU: 001v-EVA1545). Based on this
material, a research master seed bank (rMSB) was prepared on Vero
cells as the cell substrate and the genomic sequence was checked by
sequencing. Because the genomic sequence at the 5' and 3' flanking
sequences of Zika virus strain H/PF/2013 was unknown, primers for
sequencing were designed in those regions based on other Zika virus
strains whereas the internal primers were designed from the
published sequence (SEQ ID NOs: 80 to 123, see also Table A). The
sequence obtained from the rMSB by use of these primers is provided
by SEQ ID NO: 78. There was 100% overlap of the sequence with the
published sequence of Zika Virus Strain H/PF/2013 (SEQ ID NO: 13).
However, we sequenced additional regions 5' (an additional 40 bp)
and 3 (an additional 160 bp) represented in SEQ ID NO: 78. In a
preferred embodiment, the Zika virus of the invention comprises SEQ
ID NO: 78. The genomic RNA is somewhat longer than the sequence
according to SEQ ID NO: 78 (perhaps an additional 200 bp).
Additionally, a Zika virus adapted to a host cell such as e.g. Vero
cells may be expected to contain one or more mutations. For these
reasons, the Zika virus of the current invention comprises the
sequence of SEQ ID NO: 78 or, preferably, a sequence with at least
95%, 96%, 97%, 98%, or at least 99% sequence identity to the
sequence provided by SEQ ID NO: 78. Furthermore, because the viral
genome is likely to contain even further flanking regions to SEQ ID
NO: 78; in one embodiment, the Zika virus of the invention contains
the sequence of SEQ ID NO: 78 and optionally further comprises
extensions at the 5' and/or 3' ends of at least 10, at least 20, at
least 30, at least 40, at least 50, at least 60, at least 70, at
least 80, at least 90, at least 100, at least 110, at least 120 or
at least 130 nucleotides. In a preferred embodiment, the Zika virus
comprises at least the coding sequence for the entire polyprotein
of Zika Virus Strain H/PF/2013 of the invention i.e. the amino acid
sequence of SEQ ID NO: 79 or a polyprotein with at least 95%, 96%,
97%, 98%, or at least 99% sequence identity to the sequence
provided by SEQ ID NO: 79. Furthermore, the Zika virus comprises at
least the coding sequence for the E-protein of Zika Virus Strain
H/PF/2013 of the invention SEQ ID NO: 47 or an E-protein thereof
with at least 95%, 96%, 97%, 98%, or at least 99% sequence identity
to the sequence provided by SEQ ID NO: 47.
Virus Growth on Vero Cells
Vero cells were grown in Eagle's minimal essential medium (EMEM)
containing 10% fetal bovine serum (FBS). Roller bottle cultures of
Vero cell monolayers were infected with Zika Virus Strain H/PF/2013
at a multiplicity of infection (moi) of 0.01 plaque forming units
(pfu) per cell. After 2 hours of virus adsorption, the cultures
were washed 3 times with PBS and fed with EMEM without FBS and
incubated at +35.degree. C. with 5% CO.sub.2. Infected Vero cell
cultures were incubated until the virus titer reaches a desired
level.
The culture medium was harvested at days 2, 3, 5 and 7 and were
pooled from those harvest days and then centrifuged in a standard
centrifuge. The supernatants were then filtered. Virus culture
supernatants were concentrated by TFF ultrafiltration to remove
cell culture media components and to reduce batch volume.
Evaluation of Harvest Procedure
The current JEV harvest process has scheduled harvests on days 3,
5, 7 and 9 post infection. To mimic the JEV process roller bottles
were infected with ZIKV bank P4-FBS at an MOI of 0.01 in infection
medium (MEM with 2% FBS+2 mM L-glutamine) for 2 hours. After
removing the inoculum the cells were washed twice with PBS and 200
mL production medium (MEM+2 mM L-glutamine) was added.
After taking a sample on day 2 the first virus harvest was
conducted on day 3 after infection. At this point significantly
higher CPE could be observed compared to cells where virus was
removed on day 2. Plaque assay analysis showed that the viral
titers on day 2 were in the same range as for the standard
harvesting schedule. However, starting with the day 3 harvest, the
observed titers were significantly lower correlating with the
increased CPE observed compared to the standard harvest schedule.
On day 5 post infection no more living cells could be observed at
all and the experiment was terminated with a final day 5
harvest.
TABLE-US-00010 TABLE 5 The calculated titers per plaque assay are
summarized in the list below. Log 10 PFU/mL sample day 2 7.02
harvest day 3 6.66 harvest day 5 6.26
This finding led to an optimized harvest schedule to better control
of CPE and allow additional harvest day 5 and 7, see FIG. 23. For
both harvest days the optimized ZikaV protocol yield significant
higher virus titers compared to the modified protocol showing that
the time of the first harvest is crucial for production yields.
Additionally first harvesting at day 3 results in maximum 2 harvest
points whereas first harvesting at day 2 allows for 4 harvest
points further increasing the yield gain.
Downstream Purification of Zika virus
The purification process was carried out at room temperature
(18-22.degree. C.) unless stated otherwise. Virus purification
started with concentration of filtered combined harvest using 100
kDa cut-off TFF ultrafiltration modules to remove cell culture
media components and reduce batch volume. After concentration, the
pooled filtered harvest material was adjusted to a final
concentration of 25 mM Tris pH 7.5 and 10% sucrose (w/w) using
stock solution of both components (see FIG. 19 for SEC-HPLC of
different harvests prior to PS treatment). This allowed for
freezing the concentrated harvest at <-65.degree. C. if
required.
Host cell DNA and protein reduction as well reduction of
non-infectious virus aggregates in the concentrated material was
achieved by precipitation with protamine sulphate (2 mg/mL)
followed by sucrose density centrifugation (2-8.degree. C.) as
final polishing step (see FIG. 20 for SEC-HPLC of different
harvests post PS treatment). The purification process was designed
to be completed within 2 working days with SGC starting on end of
day 1 followed by fractionation and SDS-PAGE analysis on day 2. The
sucrose gradient fractions were stored at 2-8.degree. C. during the
SDS-PAGE analysis (Silver staining) to identify the pure fractions
containing ZikaV (see FIG. 21). After pooling the relevant
fractions, the pool was diluted and inactivated by Formalin. After
pooling the relevant fractions of sucrose gradient centrifugation,
the pool was diluted 1:3 in PBS and inactivated by Formalin (0.02%
v/v, 200 ppm). Fractions were subjected to analysis by
SDS-PAGE.
Effect of PS Treatment on Virus Recovery
Samples of individual 30.times. concentrated harvests days 2, 3, 5
and 7 were analysed before (FIG. 19) and after PS (FIG. 20)
treatment by SEC-HPLC and plaque assay. SEC-HPLC was used for
determination of relative total ZikaV content (active+inactive)
expressed as peak area, whereas the rel. ZikaV peak purity is given
as relative content of virus monomer population to total virus
peak. Plaque assay states the content of total active virus
particles in each sample. Experimental results are summarized in
Table 1. The virus peak recovery by SEC-HPLC was only between 12 to
36% with peak purity after PS treatment in the range of >90% (no
virus aggregates detected). The recovery of active virus particles
by plaque assay was all >100% (130-700%, range within the
variability of the assay) showing that no active virus particles
were lost during PS treatment. These results show that during PS
treatment only non-infective (immature and/or aggregated virus)
particles were removed.
TABLE-US-00011 TABLE 6 ZikaV recovery by SEC-HPLC and plaque assay
before and after PS treatment. SEC-HPLC Peak area mAU*min SEC rel.
virus monomer Harvest 30x 30x + Recovery content after PS day conc
PS (%) (%) Day 2 101.36 18.63 18 89% Day 3 144.51 17.48 12 90% Day
5 19.97 5.92 30 96% Day 7 68.80 24.43 36 99% Plaque Assay PFU/mL
Plaque Harvest 30x 30x + Recovery day conc PS (%) Day 2 3E+08 5E+08
179 Day 3 2E+08 4E+08 193 Day 5 1E+08 9E+08 700 Day 7 3E+08 4E+08
132
Sucrose Gradient Centrifugation
The PS treated harvest was split in two parts and loaded on two
centrifuge bottles.
Sucrose density gradient centrifugation (SGC) was used for final
concentration and polishing of the ZikaV material. The ZikaV PS
treated concentrated harvest was loaded on top of a solution
consisting of three layers of sucrose with different densities. The
three sucrose layers were selected based on a preliminary study
which showed the formation of a linear sucrose gradient and
complete separation of the virus particles from residual
contaminants as demonstrated for ChikV (FIG. 15D). The optimal
volumes of the sucrose solutions were determined empirically. The
volumes of individual layers for a centrifugation in 100 mL bottle
scale are shown in Table 2.
TABLE-US-00012 TABLE 7 Individual layers/volumes for a
centrifugation in bottle. Volume Solution (mL) PS treated harvest
in 10% sucrose (L) 40 15% sucrose (J) 15 35% sucrose (I) 15 50%
sucrose (H) 20 Total volume 90
The sucrose gradient bottles were prepared by stratifying the
individual sucrose layers. A plastic tube was attached to
peristaltic pump tubing. The plastic tube was mounted on a
laboratory stand using a clamp and placed into the centrifuge
bottle. The nozzle of the plastic tube was touching the bottom of
the bottle. Using a peristaltic pump the ZikaV material and the
sucrose solutions were pumped into the cylinder. A measuring
cylinder was used as feed vessel. The first solution pumped was the
ZikaV material as it represented the solution of lowest density
(10% sucrose (w/w)). After the ZikaV material the sucrose solutions
were pumped in ascending order starting with the 15% (w/w) solution
J, followed by 35% sucrose solution I and finishing with the
highest density sucrose solution H (50% (w/w)). The described setup
is shown in FIG. 14. After all sucrose solutions were transferred
the plastic tubing was carefully removed in order not to disturb
the layers.
Prior to centrifugation the centrifuge was pre-cooled to 4.degree.
C. The prepared SG bottles were carefully transferred into the
pre-cooled rotor. (Note: Sudden movement of the bottles during
transfer to the rotor must be avoided in order not to disturb the
sucrose layers.) The bottles were centrifuged at .about.11.000 RCF
max at 4.degree. C. for at least 20 hours, no brake/deceleration
activated. In case a different centrifuge system with a different
rotor is used the necessary speed and centrifugation times need to
be calculated based on the k-factor in order to achieve comparable
centrifugation efficiency.
Harvesting of the sucrose gradient was done manually using a
peristaltic pump. A plastic tube attached to peristaltic pump
tubing was used for harvesting the sucrose gradient. The bottle
containing the gradient was mounted onto a laboratory stand in a
tilted position (.about.12.degree.) using a clamp. The plastic
tubing was then placed into the bottle touching the bottom edge of
the bottle and was fastened in position using a clamp.
This resulted in a small gap of 1-2 mm between the tubing inlet and
the bottom of the bottle (see FIG. 14).
Using a peristaltic pump set to a flow rate of 30 mL per minute the
gradient was harvested and manually split into 2 mL fractions. A
total number of 32 fractions per bottle were harvested (.about.64
mL) and the remaining volume was discarded. The fractions were
immediately tested by SDS-PAGE/silver stain to identify the virus
containing fractions with sufficient high purity. Representative
SDS-PAGE is shown in FIG. 21. Fraction 10-14 were pooled and
further processed.
The purified viral solution was inactivated by incubation with
0.02% formaldehyde over a period of ten days in a 22.degree. C.
controlled-temperature incubator. The formaldehyde is neutralized
by addition of sodium metabisulphite on the tenth day.
The sucrose gradient pool (.about.17 mL after sampling) was further
diluted 3-fold with PBS to a final volume of 51 mL in a PETG
container. A volume of 1% formaldehyde (10,000 ppm) solution
equivalent to 1/50 of the final volume of the pre-formaldehyde pool
was added to this pool resulting in an effective concentration of
200 ppm. The formaldehyde-treated solution was mixed on a magnetic
stirrer for 10 minutes. After sampling, the formaldehyde-treated
viral solution was placed within a cooled incubator at 22.degree.
C..+-.2.degree. C. On Day 5 post addition of formaldehyde, the
formaldehyde-treated viral solution was filtered through a 0.2 m
filter and then placed in the incubator at 22.degree.
C..+-.2.degree. C. again. On Day 10, after removing the 10-Day
inactivation final sample, a volume of 1% (of the weight of the
final formaldehyde-treated viral solution) of 200 mM-sodium
metabisulphite solution (2 mM final concentration) was aseptically
transferred into the PETG container containing the
formaldehyde-treated viral solution. After mixing for 5 minutes on
a magnetic stirrer, the neutralized inactivated viral solution is
held at room temperature (20 to 25.degree. C.) for a minimum of 30
minutes. After sampling, the neutralized inactivated viral solution
is stored at 5.degree. C..+-.3.degree. C. until further
processing.
Inactivation by Formaldehyde
Critical parameters for this step are final formalin concentration,
temperature, mixing and transfer into a new container. A
preliminary acceptance criterion for maximum pfu/mL (determined by
plaque assay) has been set on the diluted pool pre formaldehyde
treatment.
The quality of the neutralized inactivated viral solution was
monitored by the following parameters: Plaque assay on Day 10,
SEC-HPLC, SDS-PAGE/Western Blot.
Interestingly, SEC-HPLC analysis of samples taken during the
inactivation period followed by neutralization with bisulfite
showed more or less constant peak area throughout the inactivation
period. This is in contrast to JEV where losses of viral particles
up to 60% are observed using the process disclosed by Srivastava et
al. Vaccine 19 (2001) 4557-4565. In a scale-down model the viral
losses were even much higher due to surface/area ratio at smaller
scale and high losses due to unspecific adsorption. Differences of
the ZikaV inactivation experiment and JEV inactivation were noticed
as follows: A) Much higher purity of ZikaV SGP pool with regard to
residual PS (<2 .mu.g/mL) compared to JEV. The 3-fold ZikaV
inactivated sample contained therefore <<1 .mu.g/mL of
residual PS. Commercial JEV SGP pool contains on average .about.120
g/mL (up to 152 .mu.g/mL possible). The average dilution to
inactivation solution of .about.14-fold results in a residual PS
content up to .about.11 .mu.g/mL. It may be that higher amount of
residual PS could cause virus precipitation due to
cross-linking/reaction with formalin. B) ZikaV inactivation sample
contained .about.10% sucrose (3-fold dilution of SGP pool
containing .about.30-35% sucrose). Sucrose might have stabilizing
effect of viral ZikaV particles during treatment with formalin.
Dilution to DS and Formulation with Aluminium Hydroxide (DP)
For preparation of ZikaV drug substance used in mouse potency assay
an antigen content (expressed as total viral particles or SEC peak
area) of 5 times higher compared to Ixiaro was targeted. The basis
for determination of antigen content was SEC-HPLC. Briefly, a
Superose 6 10/300 Increase column (GE Healthcare) equilibrated with
PBS+250 mM NaCl, pH 7.4 at 1 ml/min and 25.degree. C., was used to
detect ZikaV at 214 nm detection wavelength in harvest samples and
throughout the downstream process. In the current JEV process the
antigen content in NIV is determined by a specific ELISA. A good
correlation was observed between antigen content determined by
ELISA and SEC-HPLC. On average, the antigen content in commercial
NIV samples is in the range of 33 AU/mL corresponding to .about.5.2
mAU JEV peak area, see FIG. 22.
ZikaV NIV day 10 (Zika peak .about.36 mAU, analysed on Waters
HPLC/Superose6 Increase column) was diluted with PBS to a target of
6.3 (.about.5.7.times. dilution). Aluminium hydroxide was added to
a final concentration of 0.5 mg/mL Aluminium (1/20 v/v Alum 2%
stock solution added) to prepare ZikaV Drug Product (DP). The DP
was gently mixed for 5 min. An aliquot of the DP was removed, Alum
sedimented by centrifugation and the clear supernatant analysed by
SEC-HPLC. No ZikaV peak was detected in the supernatant indicating
complete adsorption (estimated as >95%) of viral particles on
the mineral adjuvant. Formulated ZikaV DP was stored at 2-8.degree.
C.
The impurity profile of the inactivated Zika virus DS is comparable
to the profile of JEV DS with the exception of a lower PS content
(Table 8).
TABLE-US-00013 TABLE 8 Determination of impurity profile in Zika
and JEV DS samples: Specification (JEV DS) JEV Zika HCP (ng/mL)
<100 <LOQ <LOQ LOQ 12 ng/mL DNA (pg/mL) <200 <40
<40 LOQ 40 pg/mL Aggregates Not specified, <LOQ <LOQ by
SEC- part of characterization MALLS (%) LOQ 5% PS (.mu.g/mL)
Specification only at SGP pool to ~4* <<LOQ demonstrate
consistent process performance (19-152 .mu.g/mL), *PS content in DS
calculated based on PS content in SGP pool (~100 .mu.g/mL) and
average dilution factor (~28.times.) to DS; LOQ 2 .mu.g/mL *Typical
PS impurity in a JEV sample produced in accordance with protocol
disclosed in Srivastava et al. Vaccine 19 (2001) 4557-4565.
SEC-MALLS Results
A representative SEC-HPLC elution profile of ZikaV NIV at 214 nm
detection wave length is shown in FIG. 24. Note that BSA (50 g/mL)
was added to the sample to minimize losses in HPLC glass vial due
to unspecific surface adsorption. ZikaV monomer content was
estimated as .about.98% with a multimer content of .about.2%.
SEC-MALLS analysis (FIG. 25) of the sample confirmed the radius Rz
of the monomer ZikaV population peak 1 as 21.6 nm and .about.49 nm
for the multimer peak 2. Cumulative particle size distribution
showed that 89% of all viral particles are within a radius range
between 18 to 25 nm (FIG. 26).
Results confirm purity and homogeneity of ZikaV NIV.
Viral Titer by Plaque Assay
TABLE-US-00014 TABLE 9 Active ZikaV pfus were quantified by plaque
assay throughout the process. Sample Pfu/mL Harvest day 2
(filtered) 6.4 .times. 10.sup.7 Harvest day 3 (filtered) 1.0
.times. 10.sup.8 Harvest day 5 (filtered) 1.5 .times. 10.sup.8
Harvest day 7 (filtered) 1.1 .times. 10.sup.8 PS treated harvest
300.times. concentrate (=SGP load) 9.0 .times. 10.sup.8 SGP pool
8.9 .times. 10.sup.8 Inactivation start (SGP pool 1:3 diluted) 3.4
.times. 10.sup.8 Inactivation day 5 <LOD Inactivation day 10
<LOD
Comparison of PS and Benzonase on Process Performance
A direct comparison of DNA removal method of concentrated ZikaV
harvest pool was done. One aliquot was treated with PS (2 mg/mL, 15
min at room temperature), the other aliquot was treated with
Benzonase (50 U/mL, 2 mM MgCl2, 4 h RT, 48 h 2-8.degree. C.). Both
samples were further purified by sucrose gradient as described in
this report. Interestingly, the Benzonase treated samples did not
yield any pure fractions after sucrose gradient centrifugation of
the treated ZikaV harvest. In those fractions where the specific
virus bands were detected, a high amount of host cell protein was
detected throughout the collected fractions. The PS treated
material resulted in pure ZikaV containing fractions as expected.
This finding may suggest that PS is not only effective for DNA
removal by precipitation; in addition it improves the recovery of
virus particles in the gradient by disrupting interaction of DNA
(fragments) and virus particles. Benzonase treatment does not
remove DNA, it only results in its fragmentation. Residual DNA
fragments might still interact with virus particles and residual
HCPs resulting in cross-contamination and co-purification in the
sucrose gradient. Pooled SGP fractions were also analysed by
SEC-HPLC. Although a large peak was detected, SDS-PAGE confirmed
that this sample was highly contaminated with HCPs. A large peak
might be detected at UV214 and 280 nm after SEC-HPLC analysis due
to possible interaction of HCPs with large virus particles,
changing the UV absorbance.
Immunogenicity of Vero Grown Zika Virus
Immunization of Mice
Prior to immunization, groups of ten 6-week-old female CD1 mice
were bled via vena facialis and pre-immune sera were prepared. One
intraperitoneal immunizations of 200 .mu.L were administered. A
dose titration (12 .mu.g, 3 .mu.g, 1 .mu.g, 0.33 .mu.g, 0.11 .mu.g,
0.037 .mu.g and 0.012 .mu.g, equivalent to the protein amount in
IXIARO) of inactivated Zika virus formulated with aluminium
hydroxide (Al(OH)3) at a final concentration of 0.7%. Three weeks
after immunization, blood was collected and immune sera were
prepared. All animal experiments were conducted in accordance with
Austrian law (BGB1 Nr. 501/1989) and approved by
"Magistratsabteilung 58".
Plaque Reduction Neutralization Test (PRNT)
Twelve well plates were used for PRNT. Each well was seeded with 1
mL medium containing 4.times.10.sup.5 Vero cells and incubated
35.degree. C. with 5% CO2 overnight. Pools of heat inactivated sera
from each dose group were tested in triplicate. The target viruses
(H/PF/2013 (SEQ ID NO: 13) or MR766 (SEQ ID NO: 11)) were diluted
to 100 pfu/165 .mu.L. Equal volumes of target virus and serum
dilution were incubated at 35.degree. C. with 5% CO.sub.2 for 1
hour. The cell culture medium was aspirated from the Vero cells and
330 .mu.L of the mixture target virus/serum dilution were added to
each well and the plates were rocked back and forth 5 times before
incubating for 2 hours at 35.degree. C. with 5% CO.sub.2. To each
well 1 mL of a 2% methylcellulose solution containing EMEM and
nutrients was added, the plates were then incubated for 5 days at
35.degree. C. with 5% CO.sub.2 before staining the cells for 1 hour
with crystal violet/5% formaldehyde and subsequently washed 3 times
with deionized water. The plates were air dried and the numbers of
plaques in each well were manually counted.
Results
Neutralization was observed with serum pools from mice immunized
with inactivated Zika virus vaccine (H/PF/2013) down to 37 ng
(dosing equivalent to the amount protein in IXIARO.RTM.) against
Zika viruses of both the Asian (H/PF/2013) and African (MR766)
lineages (FIGS. 27 and 28, respectively). Complete inhibition was
seen at the 1:20 serum dilution with an immunization dose down to
110 ng (dosing equivalent to the amount protein in IXIARO.RTM.).
The neutralization of both the Asian (H/PF/2013) and African
(MR766) lineages of the Zika virus was equivalent, which indicates
high cross-neutralization between different Zika virus strains of
the inactivated Zika virus vaccine (H/PF/2013).
Another neutralization assay was performed using the
microneutralization assay as described by Larocca, et al. (2016,
Nature doi:10.1038/nature18952). It was found that the inactivated
Zika virus of the current invention had an MN50
(microneutralization) titer of 90 at 1 .mu.g of inactivated
purified virus.
Further methods: The immunogenicity of inactivated Zika virus
preparations is assessed using a mouse model of Zika infection.
Groups of adult mice are immunized subcutaneously (s.c.) with 500,
50, or 5 ng of inactivated Zika virus with adjuvant (e.g. aluminium
hydroxide with or without IC31.RTM.), or without adjuvant. An
additional group of mice receive PBS as a negative control. Each
group is administered the indicated inoculum at t=0 and in some
cases also at three to four weeks later (t=3/4). Beginning
approximately three weeks after administration of the last
immunization, serum samples are obtained from each of the mice at
regular intervals. The serum samples are tested for the presence of
neutralizing antibodies using PRNT.
The in vivo protective efficacy of the inactivated Zika virus
preparations is also assessed using a mouse model of Zika
infection, i.e. IFN-alpha/beta receptor knock-out mice (A129) (see
e.g. Dowall et al., 4. Mar. 2016, http://dx.doi.org/10.1101/042358)
or blocking of the IFN-alpha/beta receptor by administration of
anti-IFN-alpha/beta receptor monoclonal antibodies to C57BL/6 or
BALB/c mice (see e.g. Pinto et al., 7. Dec. 2011, DOI:
10.1371/journal.ppat.1002407). For protection assays, groups of 10
three- to eight-weeks-old A129, C57BL/6 of BALB/c mice are
inoculated subcutaneously in the hindquarters with inactivated Zika
virus with adjuvant (aluminium hydroxide) or without adjuvant at
t=0. Age-matched controls are inoculated with PBS or non-specific
antigens in alum. Mice are optionally boosted with a second
administration of the indicated inoculation three to four weeks
later. The mice are then challenged subcutaneously at three to
eight weeks post immunization by inoculation with a deadly dose of
live Zika virus. One day prior to challenge of C57BL/6 and BALB/c
mice, they are passively administered (intraperitoneally)
anti-IFN-alpha/beta receptor monoclonal antibodies. Challenged mice
are monitored daily for morbidity and mortality for up to
twenty-one days. Another alternative is to challenge intracranially
adult vaccinated/non-vaccinated adult mice and observe
protection.
It is expected that the Zika virus produced by the process of the
invention will provide very similar functional read-outs in in
vitro, in vivo and finally human trials as the currently licensed
JEV vaccine in the EU and US and elsewhere, IXIARO.RTM.. The dosage
may alter but due to the very similar impurity profile and almost
identical manufacture, a very similar efficacy and safety result
will be expected as was determined for the currently licensed JEV
vaccine (licensed in the EU and US and elsewhere).
Discussion & Conclusion
The existing manufacturing platform for production of inactivated
JEV vaccine IXIARO.RTM. was used as a basis for a manufacturing
feasibility study of inactivated ZikaV vaccine candidate (Asian
strain H/PF/2013). The virus was produced on Vero cells cultivated
in roller bottles. The virus was purified by PS treatment followed
by an optimized sucrose gradient. Inactivation was done by formalin
treat (0.02%, 10 days at 22.degree. C.). For exploratory
immunization studies in mice, a DP formulated with Alum was
prepared with an estimated 5-fold higher virus particle content
compared to IXIARO.RTM., the commercial JEV Vaccine. The impurity
profile of the DS met all criteria as defined in the specification
for IXIARO.RTM., the commercial JEV vaccine. The neutralization of
both the Asian (H/PF/2013) and African (MR766) lineages of the Zika
virus was equivalent, which indicates high cross-neutralization
between different Zika virus strains of the inactivated Zika virus
vaccine (H/PF/2013).
The in vivo data regarding immunogenicity of the inactivated Zika
virus vaccine of the current invention indicates that the virus is
surprisingly potently immunogenic and also highly cross-protective
(very similar immunogenicity in African and Asian strains). Data
indicate that immunogenicity was higher than the recently reported
inactivated Zika virus vaccine candidate (Larocca, et. al, 2016,
supra.). Inactivated viruses are among the safest vaccines and
especially preferred for deliver to populations where safety is
especially concerning, such as pregnant women, children and
immunocompromised individuals, which makes the herein disclosed
inactivated Zika virus particularly suitable. Obtaining a high
titer of inactivated virus is a challenge in the field. The herein
disclosed process for purifying inactivated Zika virus results in
not only a high yield, but also a very pure drug substance.
Example 3: Development of a Purification Process for Yellow Fever
Virus Vaccine Produced in Vero Cells
A downstream process was developed for the purification of
infectious yellow fever virus particles whereby host cell nucleic
acids, non-infectious virus particles and aggregates are removed by
the addition of protamine sulphate as described in Examples 1 and
2. The unexpected and novel purification properties of protamine
sulphate (PS) were evaluated in purification processes for yellow
fever (YF) as follows:
As before the treatment of YF-harvest with PS significantly reduces
the amount of aggregates as seen with SEC for two vaccine strains
currently in development (FIG. 29).
Further more detailed aspects of the invention:
A1. A process of purification of infectious alphavirus particles,
preferably Chikungunya virus particles, comprising the steps of: a)
providing a crude harvest (a) comprising virus particles and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate; b) reducing impurities from
the crude harvest (a) by precipitation with an agent comprising a
protamine salt, preferably a protamine sulphate, even more
preferably a recombinant protamine sulphate, to obtain a virus
preparation (b); c) contacting the virus preparation (b) with (i) a
solid-phase matrix packed in a column comprising a ligand-activated
core and an inactive shell comprising pores, wherein the molecular
weight cut off of the pores excludes the virus particles from
entering the ligand-activated core, and wherein a molecule smaller
than the molecular weight cut-off of the pores can enter the
ligand-activated core and collecting the virus particles to obtain
a virus preparation (d), or (ii) a solid-phase matrix comprising a
ligand-activated core and an inactive shell comprising pores,
wherein the molecular weight cut off of the pores excludes the
virus particles from entering the ligand-activated core, and
wherein a molecule smaller than the molecular weight cut-off of the
pores can enter the ligand-activated core and separating the
solid-phase matrix from the virus particles by filtration to
produce a virus preparation (c); and d) further purifying the virus
preparation (c) by sucrose density gradient centrifugation to
obtain a virus preparation (d) comprising the infectious virus
particles, wherein the residual host cell DNA of the virus
preparation (d) is less than 100 ng/mL and the residual host cell
protein of the final virus preparation (d) is less than 1
.mu.g/mL.
A2. The process of A1, wherein the residual host cell DNA of the
virus preparation (d) is less than 10 ng/mL and the residual host
cell protein of the final virus preparation (d) is less than 100
ng/mL.
A3. The process of A1 or A2, wherein the crude harvest (a)
comprising virus particles and impurities is subjected to one or
more pre-purification step(s) prior to step (b).
A4. The process of any one of A1 to A3, wherein the one or more
pre-purification step(s) comprises a) filtration using a filter
having a pore size equal to or less than 0.2 .mu.m; and/or b)
digestion of host cell genomic DNA by enzymatic treatment; and/or
c) ultra/diafiltration using a hollow fiber membrane having a pore
size equal to or greater than 300 kDa, preferably equal to or
greater than 100 kDa.
A5. The process of any one of A1 to A4, wherein the concentration
of protamine sulphate is 1 to 2 mg/ml, more preferably 1.2 to 1.8
mg/ml, more preferably 1.4 to 1.6 mg/ml, most preferably 1.6
mg/ml.
A6. The process of any one of A1 to A5, wherein the molecule
entering the core of the solid-phase matrix has a molecular weight
less than 700 kDa.
A7. The process of any one of A1 to A6, wherein the ligand of the
ligand-activated core of the solid-phase matrix is capable of
binding the molecule that enters the ligand-activated core via
cationic-, anionic-, hydrophobic- or mixed interactions.
A8. The process of any one of A1 to A7, wherein the ligand of the
ligand-activated core of the solid-phase matrix is octylamine.
A9. The process of any one of A1 to A8, wherein the solid-phase
matrix is used as a slurry and at a final concentration between
0.5% (v/v) and 10% (v/v), preferably 0.6%, 0.7%, 0.9%, 1%, 2%, 3%,
4%, 5%, 6%, 7%, 8%, 9%, 10%, most preferably 1%.
A10. The process of any one of A1 to A9, wherein the solid-phase
matrix is incubated with the protamine-treated virus preparation
(b) at refrigerated temperatures (2.degree. C. to 8.degree. C.)
with a stirring for at least 10 minutes, preferably 15 minutes, 30
minutes or 1 hour, most preferably 15 minutes.
A11. The process of any one of A1 to A10, wherein the enrichment of
infectious virus particles in the final virus preparation relative
to total virus products in the crude harvest (a) is in the range
from at least 50% to 95%, preferably at least 80%.
A12. The process of any one A1 to A11, wherein the filtration of
step (c) of preferred aspect 1 is performed using a filter having a
pore size equal to or less than 1 .mu.m, preferably 0.2 .mu.m.
A13. The process of any one of A1 to A12, wherein the residual
impurity of the final virus preparation is less than 10%.
A14. The process of any one of A1 to A13, wherein the virus is
propagated in a cell line selected from the group consisting of an
EB66 cell line, a Vero cell line, a Vero-.alpha.His cell line, a
HeLa cell line, a HeLa-S3 cell line, a 293 cell line, a PC12 cell
line, a CHO cell line, a 3T3 cell line, a PerC6 cell line, a MDSK
cell line, a chicken embryonic fibroblast cell line, a duck cell
line, and a diploid avian cell line.
A15. The process of A14, wherein said cell line is a Vero cell
line.
A16. The process of any one of A1 to A15, wherein the Chikungunya
virus is a live virus, an attenuated live virus, a chimeric virus,
a modified live virus, or a recombinant live virus.
A17. The process of any one of A1 to A16, wherein the Chikungunya
virus is the .DELTA.5nsP3 attenuated mutant or an immunogenic
variant thereof.
A18. The process of any one of A1 to A17, wherein said process
resulting in final virus preparation (d) is followed by an
inactivation step, wherein the virus is inactivated preferably by
formaldehyde.
A19. Use of the process according to any one of A1 to A18 for
manufacturing a composition for immunization against a Chikungunya
virus infection.
A20. The use according to A19, wherein the composition for
immunization against a Chikungunya virus infection is a
vaccine.
A21. A composition comprising the virus particles obtainable by the
process of any one of A1 to A18 for treating and/or preventing a
Chikungunya virus infection.
N1. A process of purification of infectious alphavirus particles,
preferably Chikungunya virus particles, comprising the steps of:
(a) providing a crude harvest (a) comprising virus particles and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate; (b) reducing impurities from
the crude harvest (a) by precipitation with an agent comprising a
protamine salt, preferably a protamine sulphate, even more
preferably a recombinant protamine sulphate, to obtain a virus
preparation (b); (c) contacting the virus preparation (b) with (i)
a solid-phase matrix packed in a column comprising a
ligand-activated core and an inactive shell comprising pores,
wherein the molecular weight cut off of the pores excludes the
virus particles from entering the ligand-activated core, and
wherein a molecule smaller than the molecular weight cut-off of the
pores can enter the ligand-activated core and collecting the virus
particles to obtain a virus preparation (d), or (ii) a solid-phase
matrix comprising a ligand-activated core and an inactive shell
comprising pores, wherein the molecular weight cut off of the pores
excludes the virus particles from entering the ligand-activated
core, and wherein a molecule smaller than the molecular weight
cut-off of the pores can enter the ligand-activated core and
separating the solid-phase matrix from the virus particles by
filtration to produce a virus preparation (c); and (d) further
purifying the virus preparation (c) by sucrose density gradient
centrifugation to obtain a virus preparation (d) comprising the
infectious virus particles, wherein the residual host cell DNA of
the virus preparation (d) is less than 100 ng/mL and the residual
host cell protein of the final virus preparation (d) is less than 1
.mu.g/mL.
N2. The process of N1, wherein the residual host cell DNA of the
virus preparation (d) is less than 10 ng/mL and the residual host
cell protein of the final virus preparation (d) is less than 100
ng/mL.
N3. The process of N1 or 2, wherein the crude harvest (a)
comprising virus particles and impurities is subjected to one or
more pre-purification step(s) prior to step (b).
N4. The process of any one of N1 to 3, wherein the one or more
pre-purification step(s) comprises
(a) filtration using a filter having a pore size equal to or less
than 0.2 .mu.m; and/or
(b) digestion of host cell genomic DNA by enzymatic treatment;
and/or
(c) ultra/diafiltration using a hollow fiber membrane having a pore
size equal to or greater than 300 kDa, preferably equal to or
greater than 100 kDa.
N5. The process of any one of N1 to 4, wherein the concentration of
protamine sulphate is 1 to 2 mg/ml, more preferably 1.2 to 1.8
mg/ml, more preferably 1.4 to 1.6 mg/ml, most preferably 1.6
mg/ml.
N6. The process of any one of N1 to 5, wherein the molecule
entering the core of the solid-phase matrix has a molecular weight
less than 700 kDa.
N7. The process of any one of N1 to 6, wherein the ligand of the
ligand-activated core of the solid-phase matrix is capable of
binding the molecule that enters the ligand-activated core via
cationic-, anionic-, hydrophobic- or mixed interactions.
N8. The process of any one of N1 to 7, wherein the ligand of the
ligand-activated core of the solid-phase matrix is octylamine.
N9. The process of any one of N1 to 8, wherein the solid-phase
matrix is used as a slurry and at a final concentration between
0.5% (v/v) and 10% (v/v), preferably 0.6%, 0.7%, 0.9%, 1%, 2%, 3%,
4%, 5%, 6%, 7%, 8%, 9%, 10%, most preferably 1%.
N10. The process of any one of N1 to 9, wherein the solid-phase
matrix is incubated with the protamine-treated virus preparation
(b) at refrigerated temperatures (2.degree. C. to 8.degree. C.)
with a stirring for at least 10 minutes, preferably 15 minutes, 30
minutes or 1 hour, most preferably 15 minutes.
N11. The process of any one of N1 to 10, wherein the enrichment of
infectious virus particles in the final virus preparation relative
to total virus products in the crude harvest (a) is in the range
from at least 50% to 95%, preferably at least 80%.
N12. The process of any one of N1 to 11, wherein the filtration of
step (c) of N1 is performed using a filter having a pore size equal
to or less than 1 .mu.m, preferably 0.2 .mu.m.
N13. The process of any one of N1 to 12, wherein the residual
impurity of the final virus preparation is less than 10%.
N14. The process of any one of N1 to 13, wherein the virus is
propagated in a cell line selected from the group consisting of an
EB66 cell line, a Vero cell line, a Vero-.alpha.His cell line, a
HeLa cell line, a HeLa-S3 cell line, a 293 cell line, a PC12 cell
line, a CHO cell line, a 3T3 cell line, a PerC6 cell line, a MDSK
cell line, a chicken embryonic fibroblast cell line, a duck cell
line, and a diploid avian cell line.
N15. The process of N14, wherein said cell line is a Vero cell
line.
N16. The process of any one of N1 to 15, wherein the Chikungunya
virus is a live virus, an attenuated live virus, a chimeric virus,
a modified live virus, or a recombinant live virus.
N17. The process of any one of N1 to 16, wherein the Chikungunya
virus is the .DELTA.5nsP3 attenuated mutant or an immunogenic
variant thereof.
N18. The process of any one of N1 to 17, wherein said process
resulting in final virus preparation (d) is followed by an
inactivation step, wherein the virus is inactivated preferably by
formaldehyde.
N19. Use of the process according to any one of N1 to 18 for
manufacturing a composition for immunization against a Chikungunya
virus infection.
N20. The use according to N19, wherein the composition for
immunization against a Chikungunya virus infection is a
vaccine.
N21. A composition comprising the virus particles obtainable by the
process of any one of N1 to 18 for treating and/or preventing a
Chikungunya virus infection.
P1. A Zika virus vaccine comprising an optimally inactivated Zika
virus particle, wherein the Zika virus particle is able to
seroconvert a subject that is administered the Zika virus vaccine
with at least a 70% probability.
P2. The Zika virus vaccine of P1, wherein the Zika virus particle
is able to seroconvert the subject that is administered the Zika
virus vaccine with at least a 80%, 85%, 90%, or 95% probability,
preferably a 80% probability.
P3. The vaccine of P1 or 2, wherein the Zika virus particle has a
RNA genome corresponding to the DNA sequence provided by any one of
the nucleic acid sequences of SEQ ID NOs: 2-13, or a variant
nucleic acid sequence that is at least 88% identical to any one of
SEQ ID NOs: 2-13 and able to pack a virulent Zika virus.
P4. The vaccine of any one of P1-3, wherein the Zika virus particle
has an E protein selected from the amino acid sequences provided by
any one of SEQ ID NOs: 14-69, or a variant amino acid sequence that
is at least 95% identical to any one of SEQ ID NOs: 14-69 and able
to pack a virulent Zika virus.
P5. The vaccine of any one of P1-4, wherein the Zika virus is
inactivated by chemical inactivation, thermal inactivation, pH
inactivation, or UV inactivation.
P6. The vaccine of P5, wherein the chemical inactivation comprises
contacting the Zika virus with a chemical inactivation agent for
longer than is required to completely inactivate the Zika virus as
measured by plaque assay.
P7. The vaccine of P6, wherein the chemical inactivation comprises
contacting the Zika virus with formaldehyde.
P8. The vaccine of P7, wherein the formaldehyde inactivation
comprises contacting the Zika virus with formaldehyde for between
2-10 days.
P9. The vaccine of any one of P5-8, wherein the chemical activation
is performed at about +4.degree. C. or about +22.degree. C.
P10. The vaccine of any one of P1-9, further comprising an
adjuvant.
P11. The vaccine of P10, wherein the adjuvant is an aluminum salt
adjuvant.
P12. The vaccine of P11, wherein the aluminum salt adjuvant is
aluminium hydroxide or aluminium phosphate salt.
P13. The vaccine of any one of P10-12, wherein the vaccine
comprises or further comprises an adjuvant comprising a peptide and
a deoxyinosine-containing immunostimulatory oligodeoxynucleic acid
molecule (I-ODN).
P14. The vaccine of P13, wherein the peptide comprises the sequence
KLKL5KLK (SEQ ID NO: 71) and the I-ODN comprises oligo-d(IC)13 (SEQ
ID NO: 70).
P15. The vaccine of any one of P1-14, further comprising one or
more pharmaceutically acceptable excipient.
Q1. A process of purification of infectious virus particles,
comprising the steps of:
(a) providing a crude harvest (a) comprising virus particles and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate;
(b) reducing impurities from the crude harvest (a) by precipitation
with an agent comprising protamine, preferably a protamine salt,
more preferably a protamine sulphate, even more preferably a
recombinant protamine sulphate, to obtain a virus preparation
(b);
(c) further purifying the virus preparation (b) by an optimized
sucrose density gradient centrifugation, wherein the optimized
sucrose gradient is provided such that the protamine can be
completely or almost completely separated from the virus fraction;
and wherein the protamine concentration is reduced by this step to
the extent that the protamine concentration in the final drug
substance is below 1 .mu.g/ml, preferably below 0.5 .mu.g/mL, more
preferably below 0.1 .mu.g/mL, most preferably below 0.05
.mu.g/mL.
Q2. The process of Q2, wherein the virus particles are selected
from the group consisting of flaviviruses, e.g. yellow fever virus
or Zika virus and alphaviruses, e.g. Chikungunya.
Q3. The process of Q1 or Q2, additionally comprising the step
of:
(d) a solid-phase matrix packed in a column comprising a
ligand-activated core and an inactive shell comprising pores,
wherein the molecular weight cut off of the pores excludes the
virus particles from entering the ligand-activated core, and
wherein a molecule smaller than the molecular weight cutoff of the
pores can enter the ligand-activated core and collecting the virus
particles.
Q4. The process of any of Q1 to 3, wherein the residual host cell
DNA of the virus preparation (c) is less than 10 ng/mL and the
residual host cell protein of the final virus preparation (c) is
less than 100 ng/mL.
Q5. The process of any of Q1 to 4, wherein the crude harvest (a)
comprising virus particles and impurities is subjected to one or
more pre-purification step(s) prior to step (b).
Q6. The process of Q5, wherein the one or more pre-purification
step(s) comprises
(a) filtration using a filter having a pore size equal to or less
than 0.2 .mu.m; and/or
(b) digestion of host cell genomic DNA by enzymatic treatment;
and/or
(c) ultra/diafiltration using a hollow fiber membrane having a pore
size equal to or greater than 300 kDa, preferably equal to or
greater than 100 kDa.
Q7. The process of any one of Q1 to 6, wherein the concentration of
protamine sulphate is 0.5 to 3 mg/ml, more preferably 1 to 2 mg/ml,
more preferably 1.2 to 1.8 mg/ml, more preferably 1.4 to 1.6 mg/ml,
most preferably 1.6 mg/ml or 2 mg/ml.
Q8. The process of any one of Q1 to 7, wherein the enrichment of
infectious virus particles in the virus preparation (c) or any
final virus preparation relative to total virus products in the
crude harvest (a) is in the range from at least 50% to 95%,
preferably at least 80%.
Q9. The process of any one of Q5 to 8, wherein the one or more
pre-purification step(s) prior to step (b) of any of Q5 to 8 is
performed using a filter having a pore size equal to or less than 1
.mu.m, preferably 0.2 .mu.m.
Q10. The process of any one of Q1 to 9, wherein the residual
impurity of the virus preparation
(c) is less than 10%.
Q11. The process of any one of Q1 to 10, wherein the virus is
propagated in a cell line selected from the group consisting of an
EB66 cell line, a Vero cell line, a Vero-.alpha.His cell line, a
HeLa cell line, a HeLa-S3 cell line, a 293 cell line, a PC12 cell
line, a CHO cell line, a 3T3 cell line, a PerC6 cell line, a MDSK
cell line, a chicken embryonic fibroblast cell line, a duck cell
line, and a diploid avian cell line.
Q12. The process of Q11, wherein said cell line is a Vero cell
line.
Q13. The process of any one of Q1 to 12, wherein the infectious
virus particles is an infectious Zika virus particle that is a live
virus, an attenuated live virus, a chimeric virus, a modified live
virus, or a recombinant live virus.
Q14. The process of any one of Q1 to 13, wherein the Zika virus is
a Zika virus strain of the Asian lineage or an immunogenic variant
thereof.
Q15. The process of any one of Q1 to 14, wherein said process
resulting in final virus preparation (c) or (d) is followed by an
inactivation step, wherein the virus is inactivated preferably by
formaldehyde.
Q16. Use of the process according to any one of Q1 to 15 for
manufacturing a composition for immunization against a virus
infection.
Q17. The use according to Q16, wherein the composition for
immunization against a virus infection is an infection caused by a
group of viruses consisting of yellow fever virus, Chikungunya
virus and Zika virus.
Q18. A composition comprising the virus particles obtainable or
obtained by the process of any one of Q1 to 17 for treating and/or
preventing an infection, such as e.g. a Zika virus infection.
Q19. A Zika virus vaccine comprising an inactivated Zika virus
particle grown on vero cells, wherein the Zika virus particle is
able to seroconvert a subject that is administered the Zika virus
vaccine with at least a 70% probability and comprises minor amounts
of protamine sulphate, preferably below the detection limit.
Q20. The Zika virus vaccine of Q19, wherein the Zika virus particle
is able to seroconvert the subject that is administered the Zika
virus vaccine with at least a 80%, 85%, 90%, or 95% probability,
preferably a 80% probability.
Q21. The vaccine of Q19 or 20, wherein the Zika virus particle has
a RNA genome corresponding to the DNA sequence provided by any one
of the nucleic acid sequences of SEQ ID NOs: 2-13, or a variant
nucleic acid sequence that is at least 88% identical to any one of
SEQ ID NOs: 2-13 and able to pack a virulent Zika virus.
Q22. The vaccine of any one of Q19, 20 and 21, wherein the Zika
virus particle has an E protein selected from the amino acid
sequences provided by any one of SEQ ID NOs: 14-69, or a variant
amino acid sequence that is at least 95% identical to any one of
SEQ ID NOs: 14-69 and able to pack a virulent Zika virus.
Q23. The vaccine of any one of Q19, 20 to 22, wherein the Zika
virus obtained by culturing on Vero cells is purified by protamine
sulfate precipitation and sucrose gradient centrifugation.
Q24. The vaccine of Q23, wherein the sucrose gradient
centrifugation is an optimized sucrose gradient centrifugation.
Q25. The vaccine of Q24, wherein the optimized sucrose gradient
centrifugation comprises a virus comprising fraction in a 10% (w/w)
sucrose solution and three layers of sucrose with different
densities, i.e. a first sucrose solution with 15% (w/w) sucrose
solution, a second sucrose solution with 35% (w/w) sucrose
solution, and a third sucrose solution with a 50% (w/w) sucrose
solution.
Q26. The vaccine of any one of Q19, 20 to 25, wherein the Zika
virus is inactivated by chemical inactivation, thermal
inactivation, pH inactivation, or UV inactivation.
Q27. The vaccine of Q26, wherein the chemical inactivation
comprises contacting the Zika virus with a chemical inactivation
agent for longer than is required to completely inactivate the Zika
virus as measured by plaque assay.
Q28. The vaccine of Q27, wherein the chemical inactivation
comprises contacting the Zika virus with formaldehyde.
Q29. The vaccine of Q28, wherein the formaldehyde inactivation
comprises contacting the Zika virus with formaldehyde for between
2-10 days.
Q30. The vaccine of any one of Q27-29, wherein the chemical
activation is performed at about +4.degree. C. or about +22.degree.
C.
Q31. The vaccine of any one of Q19 to 30, further comprising an
adjuvant.
Q32. The vaccine of Q31, wherein the adjuvant is an aluminum salt
adjuvant.
Q33. The vaccine of Q32, wherein the aluminum salt adjuvant is
aluminium hydroxide or aluminium phosphate salt.
Q34. The vaccine of Q32, wherein the aluminum salt adjuvant is
aluminium hydroxide with less than 1.25 ppb Cu based on the final
pharmaceutical composition comprising the Zika virus, preferably
the inactivated Zika virus.
Q35. The vaccine of any one of Q19 to 34, further comprising one or
more pharmaceutically acceptable excipient.
R1. Use of protamine, preferably a protamine salt, to separate
infectious and non-infectious virus particles, host cell proteins
and/or undefined low molecular weight materials.
R2. A process of purification of infectious virus particles,
comprising the steps of:
(a) providing a crude harvest (a) comprising virus particles and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate;
(b) reducing impurities from the crude harvest (a) by precipitation
with an agent comprising protamine, preferably a protamine salt,
more preferably a protamine sulphate, even more preferably a
recombinant protamine sulphate, to obtain a virus preparation (b),
wherein the enrichment of infectious virus particles in the virus
preparation (b) relative to total virus products in the crude
harvest (a) is in the range from at least 50% to 95%, preferably at
least 80%.
R3. The use of R1 or the process of R2, wherein the virus particles
are selected from the group consisting of flaviviruses, e.g. yellow
fever virus or Zika virus and alphaviruses, e.g.
Chikungunya.
R4. A process of purification of infectious virus particles,
comprising the steps of:
(a) providing a crude harvest (a) comprising virus particles and
impurities, wherein the impurities are generated from growing said
virus particles on a cell substrate;
(b) reducing impurities from the crude harvest (a) by precipitation
with an agent comprising protamine, preferably a protamine salt,
more preferably a protamine sulphate, even more preferably a
recombinant protamine sulphate, to obtain a virus preparation
(b);
(c) further purifying the virus preparation (b) by one or more size
exclusion methods such as (i) a sucrose density gradient
centrifugation, (ii) a solid-phase matrix packed in a column
comprising a ligand-activated core and an inactive shell comprising
pores, wherein the molecular weight cut off of the pores excludes
the virus particles from entering the ligand-activated core, and
wherein a molecule smaller than the molecular weight cutoff of the
pores can enter the ligand-activated core and collecting the virus
particles, and/or (iii) size exclusion chromatography to obtain a
virus preparation (c) comprising the infectious virus particles,
wherein the residual host cell DNA of the virus preparation (c) is
less than 100 ng/mL and the residual host cell protein and the
residual aggregates of infectious virus particles of the final
virus preparation (c) is less than 1 .mu.g/mL.
R5. The process of R4, wherein the residual host cell DNA of the
virus preparation (c) is less than 10 ng/mL and the residual host
cell protein of the final virus preparation (c) is less than 100
ng/mL.
R6. The process of any of R2 to 5, wherein the crude harvest (a)
comprising virus particles and impurities is subjected to one or
more pre-purification step(s) prior to step (b).
R7. The process of R6, wherein the one or more pre-purification
step(s) comprises
(a) filtration using a filter having a pore size equal to or less
than 0.2 .mu.m; and/or
(b) digestion of host cell genomic DNA by enzymatic treatment;
and/or
(c) ultra/diafiltration using a hollow fiber membrane having a pore
size equal to or greater than 300 kDa, preferably equal to or
greater than 100 kDa.
R8. The process of any one of R2 to 7, wherein the concentration of
protamine sulphate is 0.5 to 3 mg/ml, more preferably 1 to 2 mg/ml,
more preferably 1.2 to 1.8 mg/ml, more preferably 1.4 to 1.6 mg/ml,
most preferably 1.6 mg/ml.
R9. The process of any one of R2 to 8, wherein the enrichment of
infectious virus particles in the virus preparation (c) or any
final virus preparation relative to total virus products in the
crude harvest (a) is in the range from at least 50% to 95%,
preferably at least 80%.
R10. The process of any one of R6 to 9, wherein the one or more
pre-purification step(s) prior to step (b) of any of R6 to 9 is
performed using a filter having a pore size equal to or less than 1
.mu.m, preferably 0.2 .mu.m.
R11. The process of any one of R2 to 10, wherein the residual
impurity of the virus preparation (c) is less than 10%.
R12. The process of any one of R2 to 11, wherein the virus is
propagated in a cell line selected from the group consisting of an
EB66 cell line, a Vero cell line, a Vero-.alpha.His cell line, a
HeLa cell line, a HeLa-S3 cell line, a 293 cell line, a PC12 cell
line, a CHO cell line, a 3T3 cell line, a PerC6 cell line, a MDSK
cell line, a chicken embryonic fibroblast cell line, a duck cell
line, and a diploid avian cell line.
R13. The process of R12, wherein said cell line is a Vero cell
line.
R14. The process of any one of R2 to 13, wherein the Zika virus is
a live virus, an attenuated live virus, a chimeric virus, a
modified live virus, or a recombinant live virus.
R15. The process of any one of R2 to 14, wherein the Zika virus is
a Zika virus strain of the Asian lineage or an immunogenic variant
thereof.
R16. The process of any one of R2 to 15, wherein said process
resulting in final virus preparation (c) is followed by an
inactivation step, wherein the virus is inactivated preferably by
formaldehyde.
R17. Use of the process according to any one of R1 to 16 for
manufacturing a composition for immunization against a virus
infection.
R18. The use according to R17, wherein the composition for
immunization against a virus infection is an infection caused by a
group of viruses consisting of yellow fever virus, Chikungunya
virus and Zika virus.
R19. A composition comprising the virus particles obtainable or
obtained by the process of any one of R2 to 16 for treating and/or
preventing an infection.
SEQUENCE LISTINGS
1
123132PRTArtificial Sequencesynthetic peptide 1Pro Arg Arg Arg Arg
Ser Ser Ser Arg Pro Val Arg Arg Arg Arg Arg1 5 10 15Pro Arg Val Ser
Arg Arg Arg Arg Arg Arg Gly Gly Arg Arg Arg Arg 20 25
30210676DNAZika virus 2gttgttactg ttgctgactc agactgcgac agttcgagtt
tgaagcgaaa gctagcaaca 60gtatcaacag gttttatttg gatttggaaa cgagagtttc
tggtcatgaa aaacccaaaa 120aagaaatccg gaggattccg gattgtcaat
atgctaaaac gcggagtagc ccgtgtgagc 180ccctttgggg gcttgaagag
gctgccagcc ggacttctgc tgggtcatgg gcccatcagg 240atggtcttgg
caattctagc ctttttgaga ttcacggcaa tcaagccatc actgggtctc
300atcaatagat ggggttcagt ggggaaaaaa gaggctatgg aaataataaa
gaagttcaag 360aaagatctgg ctgccatgct gagaataatc aatgctagga
aggagaagaa gagacggggc 420gcagatacta gtgtcggaat tgttggcctc
ctgctgacca cagctatggc agcggaggtc 480actagacgtg ggagtgcata
ctatatgtac ttggacagaa acgatgctgg ggaggccata 540tcttttccaa
ccacattggg gatgaataag tgttatatac agatcatgga tcttggacac
600atgtgtgatg ccaccatgag ctatgaatgc cctatgctgg atgagggggt
ggaaccagat 660gacgtcgatt gttggtgcaa cacgacgtca acttgggttg
tgtacggaac ctgccatcac 720aaaaaaggtg aagcacggag atctagaaga
gctgtgacgc tcccctccca ttccactagg 780aagctgcaaa cgcggtcgca
aacctggttg gaatcaagag aatacacaaa gcacttgatt 840agagtcgaaa
attggatatt caggaaccct ggcttcgcgt tagcagcagc tgccatcgct
900tggcttttgg gaagctcaac gagccaaaaa gtcatatact tggtcatgat
actgctgatt 960gccccggcat acagcatcag gtgcatagga gtcagcaata
gggactttgt ggaaggtatg 1020tcaggtggga cttgggttga tattgtcttg
gaacatggag gttgtgtcac cgtaatggca 1080caggacaaac cgactgtcga
catagagctg gttacaacaa cagtcagcaa catggcggag 1140gtaagatcct
actgctatga ggcatcaata tcagacatgg cttcggacag ccgctgccca
1200acacaaggtg aagcctacct tgacaagcaa tcagacactc aatatgtctg
caaaagaacg 1260ttagtggaca gaggctgggg aaatggatgt ggactttttg
gcaaagggag tctggtgaca 1320tgcgctaagt ttgcatgctc caagaaaatg
accgggaaga gcatccagcc agagaatctg 1380gagtaccgga taatgctgtc
agttcatggc tcccagcaca gtgggatgat cgttaatgac 1440acaggacatg
aaactgatga gaatagagcg aaggttgaga taacgcccaa ttcaccaaga
1500gccgaagcca ccctgggggg ttttggaagc ctaggacttg attgtgaacc
gaggacaggc 1560cttgactttt cagatttgta ttacttgact atgaataaca
agcactggtt ggttcacaag 1620gagtggttcc acgacattcc attaccttgg
cacgctgggg cagacaccgg aactccacac 1680tggaacaaca aagaagcact
ggtagagttc aaggacgcac atgccaaaag gcaaactgtc 1740gtggttctag
ggagtcaaga aggagcagtt cacacggccc ttgctggagc tctggaggct
1800gagatggatg gtgcaaaggg aaggctgtcc tctggccact tgaaatgtcg
cctgaaaatg 1860gataaactta gattgaaggg cgtgtcatac tccttgtgta
ccgcagcgtt cacattcacc 1920aagatcccgg ctgaaacact gcacgggaca
gtcacagtgg aggtacagta cgcagggaca 1980gatggacctt gcaaggttcc
agctcagatg gcggtggaca tgcaaactct gaccccagtt 2040gggaggttga
taaccgctaa ccccgtaatc actgaaagca ctgagaactc taagatgatg
2100ctggaacttg atccaccatt tggggactct tacattgtca taggagtcgg
ggagaagaag 2160atcacccacc actggcacag gagtggcagc accattggaa
aagcatttga agccactgtg 2220agaggtgcca agagaatggc agtcttggga
gacacagcct gggactttgg atcagttgga 2280ggcgctctca actcattggg
caagggcatc catcaaattt ttggagcagc tttcaaatca 2340ttgtttggag
gaatgtcctg gttctcacaa attctcattg gaacgttgct gatgtggttg
2400ggtctgaaca caaagaatgg atctatttcc cttatgtgct tggccttagg
gggagtgttg 2460atcttcttat ccacagccgt ctctgctgat gtggggtgct
cggtggactt ctcaaagaag 2520gagacgagat gcggtacagg ggtgttcgtc
tataacgacg ttgaagcctg gagggacagg 2580tacaagtacc atcctgactc
cccccgtaga ttggcagcag cagtcaagca agcctgggaa 2640gatggtatct
gcgggatctc ctctgtttca agaatggaaa acatcatgtg gagatcagta
2700gaaggggagc tcaacgcaat cctggaagag aatggagttc aactgacggt
cgttgtggga 2760tctgtaaaaa accccatgtg gagaggtcca cagagattgc
ccgtgcctgt gaacgagctg 2820ccccacggct ggaaggcttg ggggaaatcg
cacttcgtca gagcagcaaa gacaaataac 2880agctttgtcg tggatggtga
cacactgaag gaatgcccac tcaaacatag agcatggaac 2940agctttcttg
tggaggatca tgggttcggg gtatttcaca ctagtgtctg gctcaaggtt
3000agagaagatt attcattaga gtgtgatcca gccgttattg gaacagctgt
taagggaaag 3060gaggctgtac acagtgatct aggctactgg attgagagtg
agaagaatga cacatggagg 3120ctgaagaggg cccatctgat cgagatgaaa
acatgtgaat ggccaaagtc ccacacattg 3180tggacagatg gaatagaaga
gagtgatctg atcataccca agtctttagc tgggccactc 3240agccatcaca
ataccagaga gggctacagg acccaaatga aagggccatg gcacagtgaa
3300gagcttgaaa ttcggtttga ggaatgccca ggcactaagg tccacgtgga
ggaaacatgt 3360ggaacaagag gaccatctct gagatcaacc actgcaagcg
gaagggtgat cgaggaatgg 3420tgctgcaggg agtgcacaat gcccccactg
tcgttccggg ctaaagatgg ctgttggtat 3480ggaatggaga taaggcccag
gaaagaacca gaaagcaact tagtaaggtc aatggtgact 3540gcaggatcaa
ctgatcacat ggatcacttc tcccttggag tgcttgtgat tctgctcatg
3600gtgcaggaag ggctgaagaa gagaatgacc acaaagatca tcataagcac
atcaatggca 3660gtgctggtag ctatgatcct gggaggattt tcaatgagtg
acctggctaa gcttgcaatt 3720ttgatgggtg ccaccttcgc ggaaatgaac
actggaggag atgtagctca tctggcgctg 3780atagcggcat tcaaagtcag
accagcgttg ctggtatctt tcatcttcag agctaattgg 3840acaccccgtg
aaagcatgct gctggccttg gcctcgtgtc ttttgcaaac tgcgatctcc
3900gccttggaag gcgacctgat ggttctcatc aatggttttg ctttggcctg
gttggcaata 3960cgagcgatgg ttgttccacg cactgataac atcaccttgg
caatcctggc tgctctgaca 4020ccactggccc ggggcacact gcttgtggcg
tggagagcag gccttgctac ttgcgggggg 4080tttatgctcc tctctctgaa
gggaaaaggc agtgtgaaga agaacttacc atttgtcatg 4140gccctgggac
taaccgctgt gaggctggtc gaccccatca acgtggtggg gctgctgttg
4200ctcacaagga gtgggaagcg gagctggccc cctagcgaag tactcacagc
tgttggcctg 4260atatgcgcat tggctggagg gttcgccaag gcagatatag
agatggctgg gcccatggcc 4320gcggtcggtc tgctaattgt cagttacgtg
gtctcaggaa agagtgtgga catgtacatt 4380gaaagagcag gtgacatcac
atgggaaaaa gatgcggaag tcactggaaa cagtccccgg 4440ctcgatgtgg
cgctagatga gagtggtgat ttctccctgg tggaggatga cggtcccccc
4500atgagagaga tcatactcaa ggtggtcctg atgaccatct gtggcatgaa
cccaatagcc 4560ataccctttg cagctggagc gtggtacgta tacgtgaaga
ctggaaaaag gagtggtgct 4620ctatgggatg tgcctgctcc caaggaagta
aaaaaggggg agaccacaga tggagtgtac 4680agagtaatga ctcgtagact
gctaggttca acacaagttg gagtgggagt tatgcaagag 4740ggggtctttc
acactatgtg gcacgtcaca aaaggatccg cgctgagaag cggtgaaggg
4800agacttgatc catactgggg agatgtcaag caggatctgg tgtcatactg
tggtccatgg 4860aagctagatg ccgcctggga cgggcacagc gaggtgcagc
tcttggccgt gccccccgga 4920gagagagcga ggaacatcca gactctgccc
ggaatattta agacaaagga tggggacatt 4980ggagcggttg cgctggatta
cccagcagga acttcaggat ctccaatcct agacaagtgt 5040gggagagtga
taggacttta tggcaatggg gtcgtgatca aaaatgggag ttatgttagt
5100gccatcaccc aagggaggag ggaggaagag actcctgttg agtgcttcga
gccttcgatg 5160ctgaagaaga agcagctaac tgtcttagac ttgcatcctg
gagctgggaa aaccaggaga 5220gttcttcctg aaatagtccg tgaagccata
aaaacaagac tccgtactgt gatcttagct 5280ccaaccaggg ttgtcgctgc
tgaaatggag gaagccctta gagggcttcc agtgcgttat 5340atgacaacag
cagtcaatgt cacccactct ggaacagaaa tcgtcgactt aatgtgccat
5400gccaccttca cttcacgtct actacagcca atcagagtcc ccaactataa
tctgtatatt 5460atggatgagg cccacttcac agatccctca agtatagcag
caagaggata catttcaaca 5520agggttgaga tgggcgaggc ggctgccatc
ttcatgaccg ccacgccacc aggaacccgt 5580gacgcatttc cggactccaa
ctcaccaatt atggacaccg aagtggaagt cccagagaga 5640gcctggagct
caggctttga ttgggtgacg gattattctg gaaaaacagt ttggtttgtt
5700ccaagcgtga ggaacggcaa tgagatcgca gcttgtctga caaaggctgg
aaaacgggtc 5760atacagctca gcagaaagac ttttgagaca gagttccaga
aaacaaaaca tcaagagtgg 5820gactttgtcg tgacaactga catttcagag
atgggcgcca actttaaagc tgaccgtgtc 5880atagattcca ggagatgcct
aaagccggtc atacttgatg gcgagagagt cattctggct 5940ggacccatgc
ctgtcacaca tgccagcgct gcccagagga gggggcgcat aggcaggaat
6000cccaacaaac ctggagatga gtatctgtat ggaggtgggt gcgcagagac
tgacgaagac 6060catgcacact ggcttgaagc aagaatgctc cttgacaata
tttacctcca agatggcctc 6120atagcctcgc tctatcgacc tgaggccgac
aaagtagcag ccattgaggg agagttcaag 6180cttaggacgg agcaaaggaa
gacctttgtg gaactcatga aaagaggaga tcttcctgtt 6240tggctggcct
atcaggttgc atctgccgga ataacctaca cagatagaag atggtgcttt
6300gatggcacga ccaacaacac cataatggaa gacagtgtgc cggcagaggt
gtggaccaga 6360cacggagaga aaagagtgct caaaccgagg tggatggacg
ccagagtttg ttcagatcat 6420gcggccctga agtcattcaa ggagtttgcc
gctgggaaaa gaggagcggc ttttggagtg 6480atggaagccc tgggaacact
gccaggacac atgacagaga gattccagga agccattgac 6540aacctcgctg
tgctcatgcg ggcagagact ggaagcaggc cttacaaagc cgcggcggcc
6600caattgccgg agaccctaga gaccattatg cttttggggt tgctgggaac
agtctcgctg 6660ggaatctttt tcgtcttgat gaggaacaag ggcataggga
agatgggctt tggaatggtg 6720actcttgggg ccagcgcatg gctcatgtgg
ctctcggaaa ttgagccagc cagaattgca 6780tgtgtcctca ttgttgtgtt
cctattgctg gtggtgctca tacctgagcc agaaaagcaa 6840agatctcccc
aggacaacca aatggcaatc atcatcatgg tagcagtagg tcttctgggc
6900ttgattaccg ccaatgaact cggatggttg gagagaacaa agagtgacct
aagccatcta 6960atgggaagga gagaggaggg ggcaaccatg ggattctcaa
tggacattga cctgcggcca 7020gcctcagctt gggccatcta tgctgccttg
acaactttca ttaccccagc cgtccaacat 7080gcagtgacca cttcatacaa
caactactcc ttaatggcga tggccacgca agctggagtg 7140ttgtttggta
tgggcaaagg gatgccattc tacgcatggg actttggagt cccgctgcta
7200atgataggtt gctactcaca attaacgccc ctgaccctaa tagtggccat
cattttgctc 7260gtggcgcact acatgtactt gatcccaggg ctgcaggcag
cagctgcgcg tgctgcccag 7320aagagaacgg cagctggcat catgaagaac
cctgttgtgg atggaatagt ggtgactgac 7380attgacacaa tgacaattga
cccccaagtg gagaaaaaga tgggacaggt gctactcatg 7440gcagtagccg
tctccagcgc catactgtcg cggaccgcct gggggtgggg ggaggctggg
7500gccctgatca cagccgcaac ttccactttg tgggaaggct ctccgaacaa
gtactggaac 7560tcctctacag ccacttcact gtgtaacatt tttaggggaa
gttacttggc tggagcttct 7620ctaatctaca cagtaacaag aaacgctggc
ttggtcaaga gacgtggggg tggaacagga 7680gagaccctgg gagagaaatg
gaaggcccgc ttgaaccaga tgtcggccct ggagttctac 7740tcctacaaaa
agtcaggcat caccgaggtg tgcagagaag aggcccgccg cgccctcaag
7800gacggtgtgg caacgggagg ccatgctgtg tcccgaggaa gtgcaaagct
gagatggttg 7860gtggagcggg gatacctgca gccctatgga aaggtcattg
atcttggatg tggcagaggg 7920ggctggagtt actacgccgc caccatccgc
aaagttcaag aagtgaaagg atacacaaaa 7980ggaggccctg gtcatgaaga
acccgtgttg gtgcaaagct atgggtggaa catagtccgt 8040cttaagagtg
gggtggacgt ctttcatatg gcggctgagc cgtgtgacac gttgctgtgt
8100gacataggtg agtcatcatc tagtcctgaa gtggaagaag cacggacgct
cagagtcctc 8160tccatggtgg gggattggct tgaaaaaaga ccaggagcct
tttgtataaa agtgttgtgc 8220ccatacacca gcactatgat ggaaaccctg
gagcgactgc agcgtaggta tgggggagga 8280ctggtcagag tgccactctc
ccgcaactct acacatgaga tgtactgggt ctctggagcg 8340aaaagcaaca
ccataaaaag tgtgtccacc acgagccagc tcctcttggg gcgcatggac
8400gggcctagga ggccagtgaa atatgaggag gatgtgaatc tcggctctgg
cacgcgggct 8460gtggtaagct gcgctgaagc tcccaacatg aagatcattg
gtaaccgcat tgaaaggatc 8520cgcagtgagc acgcggaaac gtggttcttt
gacgagaacc acccatatag gacatgggct 8580taccatggaa gctatgaggc
ccccacacaa gggtcagcgt cctctctaat aaacggggtt 8640gtcaggctcc
tgtcaaaacc ctgggatgtg gtgactggag tcacaggaat agccatgacc
8700gacaccacac cgtatggtca gcaaagagtt ttcaaggaaa aagtggacac
tagggtgcca 8760gacccccaag aaggtactcg tcaggttatg agcatggtct
cttcctggtt gtggaaagag 8820ctaggcaaac acaaacggcc acgagtctgt
accaaagaag agttcatcaa caaggttcgt 8880agcaatgcag cattaggggc
aatatttgaa gaggaaaaag agtggaagac tgcagtggaa 8940gctgtgaacg
atccaaggtt ctgggctcta gtggacaagg aaagagagca ccacctgaga
9000ggagagtgcc agagttgtgt gtacaacatg atgggaaaaa gagaaaagaa
acaaggggaa 9060tttggaaagg ccaagggcag ccgcgccatc tggtatatgt
ggctaggggc tagatttcta 9120gagttcgaag cccttggatt cttgaacgag
gatcactgga tggggagaga gaactcagga 9180ggtggtgttg aagggctggg
attacaaaga ctcggatatg tcctagaaga gatgagtcgc 9240ataccaggag
gaaggatgta tgcagatgac actgctggct gggacacccg catcagcagg
9300tttgatctgg agaatgaagc tctaatcacc aaccaaatgg agaaagggca
cagggccttg 9360gcattggcca taatcaagta cacataccaa aacaaagtgg
taaaggtcct tagaccagct 9420gaaaaaggga aaacagttat ggacattatt
tcgagacaag accaaagggg gagcggacaa 9480gttgtcactt acgctcttaa
cacatttacc aacctagtgg tgcaactcat tcggaatatg 9540gaggctgagg
aagtcctaga gatgcaagac ttgtggctgc tgcggaggtc agagaaagtg
9600accaactggt tgcagagcaa cggatgggat aggctcaaac gaatggcagt
cagtggagat 9660gattgcgttg tgaagccaat tgatgatagg tttgcacatg
ccctcaggtt cttgaatgat 9720atgggaaaag ttaggaagga cacacaagag
tggaaaccct caactggatg ggacaactgg 9780gaagaagttc cgttttgctc
ccaccacttc aacaagctcc atctcaagga cgggaggtcc 9840attgtggttc
cctgccgcca ccaagatgaa ctgattggcc gggcccgcgt ctctccaggg
9900gcgggatgga gcatccggga gactgcttgc ctagcaaaat catatgcgca
aatgtggcag 9960ctcctttatt tccacagaag ggacctccga ctgatggcca
atgccatttg ttcatctgtg 10020ccagttgact gggttccaac tgggagaact
acctggtcaa tccatggaaa gggagaatgg 10080atgaccactg aagacatgct
tgtggtgtgg aacagagtgt ggattgagga gaacgaccac 10140atggaagaca
agaccccagt tacgaaatgg acagacattc cctatttggg aaaaagggaa
10200gacttgtggt gtggatctct catagggcac agaccgcgca ccacctgggc
tgagaacatt 10260aaaaacacag tcaacatggt gcgcaggatc ataggtgatg
aagaaaagta catggactac 10320ctatccaccc aagttcgcta cttgggtgaa
gaagggtcta cacctggagt gctgtaagca 10380ccaatcttaa tgttgtcagg
cctgctagtc agccacagct tggggaaagc tgtgcagcct 10440gtgacccccc
caggagaagc tgggaaacca agcctatagt caggccgaga acgccatggc
10500acggaagaag ccatgctgcc tgtgagcccc tcagaggaca ctgagtcaaa
aaaccccacg 10560cgcttggagg cgcaggatgg gaaaagaagg tggcgacctt
ccccaccctt caatctgggg 10620cctgaactgg agatcagctg tggatctcca
gaagagggac tagtggttag aggaga 10676310793DNAZika virus 3ccaatctgtg
aatcagactg cgacagttcg agtttgaagc gaaagctagc aacagtatca 60acaggtttta
ttttggattt ggaaacgaga gtttctggtc atgaaaaacc caaaaaagaa
120atccggagga ttccggattg tcaatatgct aaaacgcgga gtagcccgtg
tgagcccctt 180tgggggcttg aagaggctgc cagccggact tctgctgggt
catgggccca tcaggatggt 240cttggcgatt ctagcctttt tgagattcac
ggcaatcaag ccatcactgg gtctcatcaa 300tagatggggt tcagtgggga
aaaaagaggc tatggaaata ataaagaagt tcaagaaaga 360tctggctgcc
atgctgagaa taatcaatgc caggaaggag aagaagagac gaggcgcaga
420tactagtgtc ggaatcgttg gcctcctgct gaccacagct atggcagcgg
aggtcactag 480acgtgggagt gcatactata tgtacttgga cagaaacgat
gctggggagg ccatatcttt 540tccaaccaca ttggggatga ataagtgtta
tatacagatc atggatcttg gacacatgtg 600tgatgccacc atgagctatg
aatgccctat gctggatgag ggggtggaac cagatgacgt 660cgattgttgg
tgcaacacga cgtcaacttg ggttgtgtac ggaacctgcc atcacaaaaa
720aggtgaagca cggagatcta gaagagctgt gacgctcccc tcccattcca
ctaggaagct 780gcaaacgcgg tcgcaaacct ggttggaatc aagagaatac
acaaagcact tgattagagt 840cgaaaattgg atattcagga accctggctt
cgcgttagca gcagctgcca tcgcttggct 900tttgggaagc tcaacgagcc
aaaaagtcat atacttggtc atgatactgc tgattgcccc 960ggcatacagc
atcaggtgca taggagtcag caatagggac tttgtggaag gtatgtcagg
1020tgggacttgg gttgatgttg tcttggaaca tgggggttgt gtcaccgtaa
tggcacagga 1080caaaccgact gtcgacatag agctggttac aacaacagtc
agcaacatgg cggaggtaag 1140atcctactgc tatgaggcat caatatcaga
catggcttcg gacagccgct gcccaacaca 1200aggtgaagcc taccttgaca
agcaatcaga cactcaatat gtctgcaaaa gaacgttagt 1260ggacagaggc
tggggaaatg gatgtggact ttttggcaaa gggagcctgg tgacatgcgc
1320taagtttgca tgctccaaga aaatgaccgg gaagagcatc cagccagaga
atctggagta 1380ccggataatg ctgtcagttc atggctccca gcacagtggg
atgatcgtta atgacacagg 1440acatgaaact gatgagaata gagcgaaggt
tgagataacg cccaattcac caagagccga 1500agccaccctg gggggttttg
gaagcttagg acttgattgt gaaccgagga caggccttga 1560cttttcagat
ttgtattact tgactatgaa taacaagcac tggttggttc acaaggagtg
1620gttccacgac attccattac cttggcacgc tggggcagac accggaactc
cacactggaa 1680caacaaagaa gcactggtag agttcaagga cgcacatgcc
aaaaggcaaa ctgtcgtggt 1740tctagggact caagaaggag cagttcacac
ggcccttgct ggagctctgg aggctgagat 1800ggatggtgca aagggaaggc
tgtcctctgg ccacttgaaa tgtcgcctga aaatggataa 1860acttagattg
aagggcgtgt catactcctt gtgtaccgca gcgttcacat tcaccaagat
1920cccggctgaa acactgcacg ggacagtcac agtggaggta cagtacgcag
ggacagatgg 1980accttgcaag gttccagctc agatggcggt ggacatgcaa
actctgaccc cagttgggag 2040gttgataacc gctaaccccg taatcactga
aagcactgag aactctaaga tgatgctgga 2100acttgatcca ccatttgggg
actcttacat tgtcatagga gtcggggaga agaagatcac 2160ccaccactgg
cacaggagtg gcagcaccat tggaaaagca tttgaagcca ctgtgagagg
2220tgccaagaga atggcagtct tgggagacac agcctgggac tttggatcag
ttggaggcgc 2280tctcaactca ttgggcaagg gcatccatca aatttttgga
gcagctttca aatcattgtt 2340tggaggaatg tcctggttct cacaaattct
cattggaacg ttgctgatgt ggttgggtct 2400gaacacaaag aatggatcta
tttcccttat gtgcttggcc ttagggggag tgttgatctt 2460cttatccaca
gccgtctctg ctgatgtggg gtgctcggtg gacttctcaa agaaggagac
2520gagatgtggt acaggggtgt tcgtctataa cgacgttgaa gcctggaggg
acaggtacaa 2580gtaccatcct gactctcccc gtagattggc agcagcagtc
aagcaagcct gggaagatgg 2640tatctgcggg atctcctctg tttcaagaat
ggaaaacatc atgtggagat cagtagaagg 2700ggagcttaac gcaatcctgg
aagagaatgg agttcaactg acggtcgttg tgggatctgt 2760aaaaaacccc
atgtggagag gtccacagag attgcccgtg cctgtgaacg agctgcccca
2820cggctggaag gcttggggga aatcgtactt cgtcagagca gcaaagacaa
ataacagctt 2880tgtcgtggat ggtgacacac tgaaggaatg cccactcaaa
catagagcat ggaacagctt 2940tcttgtggag gatcatgggt tcggggtatt
tcacactagt gtctggctca aggttagaga 3000agattattca ttagagtgtg
atccagccgt tattggaaca gctgttaagg gaaaggaggc 3060tgtacacagt
gatctaggct actggattga gagtgagaag aatgacacat ggaggctgaa
3120gagggcccat ctgatcgaga tgaaaacatg tgaatggcca aagtcccaca
cattgtggac 3180agatggaata gaagagagtg atctgatcat acccaagtct
ttagctgggc cactcagcca 3240tcacaatacc agagagggct acaggaccca
aatgaaaggg ccatggcaca gtgaagagct 3300tgaaattcgg tttgaggaat
gcccaggcac taaggtccac gtggaggaaa catgtggaac 3360aagaggacca
tctctgagat caaccactgc aagcggaagg gtgatcgagg aatggtgctg
3420cagggagtgc acaatgcccc cactgtcgtt ccgggctaaa gatggctgtt
ggtatggaat 3480ggagataagg cccaggaaag aaccagaaag caacttagta
aggtcaatgg tgactgcagg 3540atcaactgat cacatggatc acttctccct
tggagtgctt gtgattctgc tcatggtgca 3600ggaagggctg aagaagagaa
tgaccacaaa gatcatcata agcacatcaa tggcagtgct 3660ggtagctatg
atcctgggag gattttcaat gagtgacctg gctaagcttg caattttgat
3720gggtgccacc ttcgcggaaa tgaacactgg aggagatgta gctcatctgg
cgctgatagc 3780ggcattcaaa gtcagaccag cgttgctggt atctttcatc
ttcagagcta attggacacc 3840ccgtgaaagc atgctgctgg ccttggcctc
gtgttttttg caaactgcga tctccgcctt 3900ggaaggcgac ctgatggttc
tcatcaatgg ttttgctttg gcctggttgg caatacgagc 3960gatggttgtt
ccacgcactg acaacatcac cttggcaatc ctggctgctc tgacaccact
4020ggcccggggc acactgcttg tggcgtggag agcaggcctt gctacttgcg
gggggtttat 4080gctcctctct ctgaagggaa
aaggcagtgt gaagaagaac ttaccatttg tcatggccct 4140gggactaacc
gctgtgaggc tggtcgaccc catcaacgtg gtgggactgc tgttgctcac
4200aaggagtggg aagcggagct ggccccctag cgaagtactc acagctgttg
gcctgatatg 4260cgcattggct ggagggttcg ccaaggcaga tatagagatg
gctgggccca tggccgcggt 4320cggtctgcta attgtcagtt acgtggtctc
aggaaagagt gtggacatgt acattgaaag 4380agcaggtgac atcacatggg
aaaaagatgc ggaagtcact ggaaacagtc cccggctcga 4440tgtggcgcta
gatgagagtg gtgacttctc cctggtggag gatgacggtc cccccatgag
4500agagatcata ctcaaggtgg tcctgatgac catctgtggc atgaacccaa
tagccatacc 4560ctttgcagct ggagcgtggt acgtatacgt gaagactgga
aaaaggagtg gtgctctatg 4620ggatgtgcct gctcccaagg aagtaaaaaa
gggggagacc acagatggag tgtacagagt 4680aatgactcgt agactgctag
gttcaacaca agttggagtg ggagttatgc aagagggggt 4740ctttcacact
atgtggcacg tcacaaaagg atccgcgctg agaagcggtg aagggagact
4800tgatccatac tggggagatg tcaagcagga tctggtgtca tactgtggtc
catggaagct 4860agatgccgcc tgggacgggc acagcgaggt gcagctcttg
gccgtgcccc ccggagagag 4920agcgaggaac atccagactc tgcccggaat
atttaagaca aaggatgggg acattggagc 4980ggttgcgctg gattacccag
caggaacttc aggatctcca atcctagaca agtgtgggag 5040agtgatagga
ctttatggca atggggtcgt gataaaaaat gggagttatg ttagtgccat
5100cacccaaggg aggagggagg aagagactcc tgttgagtgc ttcgagcctt
cgatgctgaa 5160gaagaagcag ctaactgtct tagacttgca tcctggagct
gggaaaacca ggagagttct 5220tcctgaaata gtccgtgaag ccataaaaac
aagactccgt actgtgatct tagctccaac 5280cagggttgtc gctgctgaaa
tggaggaagc ccttagaggg cttccagtgc gttatatgac 5340aacagcagtc
aatgtcaccc actctggaac agaaatcgtc gacttaatgt gccatgccac
5400cttcacttca cgtctactac agccaatcag agtccccaac tataatctgt
atattatgga 5460tgaggcccac ttcacagatc cctcaagcat agcagcaaga
ggatacattt caacaagggt 5520tgagatgggc gaggcggctg ccatcttcat
gaccgccacg ccaccaggaa cccgtgacgc 5580atttccggac tccaactcac
caattatgga caccgaagtg gaagtcccag agagagcctg 5640gagctcaggc
tttgattggg tgacggatca ttctggaaaa acagtttggt ttgttccaag
5700cgtgaggaac ggcaatgaga tcgcagcttg tctgacaaag gctggaaaac
gggtcataca 5760gctcagcaga aagacttttg agacagagtt ccagaaaaca
aaacatcaag agtgggactt 5820tgtcgtgaca actgacattt cagagatggg
cgccaacttt aaagctgacc gtgtcataga 5880ttccaggaga tgcctaaagc
cggtcatact tgatggcgag agagtcattc tggctggacc 5940catgcctgtc
acacatgcca gcgctgccca gaggaggggg cgcataggca ggaatcccaa
6000caaacctgga gatgagtacc tgtatggagg tgggtgcgca gagactgacg
aagaccatgc 6060acactggctt gaagcaagaa tgctccttga caatatttac
ctccaagatg gcctcatagc 6120ctcgctctat cgacctgagg ccgacaaagt
agcagccatt gagggagagt tcaagcttag 6180gacggagcaa aggaagacct
ttgtggaact catgaaaaga ggagatcttc ctgtttggct 6240ggcctatcag
gttgcatctg ccggaataac ctacacagat agaagatggt gctttgatgg
6300cacgaccaac aacaccataa tggaagacag tgtgccggca gaggtgtgga
ccagacacgg 6360agagaaaaga gtgctcaaac cgaggtggat ggacgccaga
gtttgttcag atcatgcggc 6420cctgaagtca ttcaaggagt ttgccgctgg
gaaaagagga gcggcttttg gagtgatgga 6480agccctggga acactgccag
gacacatgac agagagattc caggaagcca ttgacaacct 6540cgctgtgctc
atgcgggcag agactggaag caggccttac aaagccgcgg cggcccaatt
6600gccggagacc ctagagacca ttatgctttt ggggttgctg ggaacagtct
cgctgggaat 6660ctttttcgtc ttgatgagga acaagggcat agggaagatg
ggctttggaa tggtgactct 6720tggggccagc gcatggctca tgtggctctc
ggaaattgag ccagccagaa ttgcatgtgt 6780cctcattgtt gtgttcctat
tgctggtggt gctcatacct gagccagaaa agcaaagatc 6840tccccaggac
aaccaaatgg caatcatcat catggtagca gtaggtcttc tgggcttgat
6900taccgccaat gaactcggat ggttggagag aacaaagagt gacctaagcc
atctaatggg 6960aaggagagag gagggggcaa ccataggatt ctcaatggac
attgacctgc ggccagcctc 7020agcttgggcc atctatgctg ccttgacaac
tttcattacc ccagccgtcc aacatgcagt 7080gaccacttca tacaacaact
actccttaat ggcgatggcc acgcaagctg gagtgttgtt 7140tggtatgggc
aaagggatgc cattctacgc atgggacttt ggagtcccgc tgctaatgat
7200aggttgctac tcacaattaa cacccctgac cctaatagtg gccatcattt
tgctcgtggc 7260gcactacatg tacttgatcc cagggctgca ggcagcagct
gcgcgtgctg cccagaagag 7320aacggcagct ggcatcatga agaaccctgt
tgtggatgga atagtggtga ctgacattga 7380cacaatgaca attgaccccc
aagtggagaa aaagatggga caggtgctac tcatagcagt 7440agccgtctcc
agcgccatac tgtcgcggac cgcctggggg tggggggagg ctggggccct
7500gatcacagcc gcaacttcca ctttgtggga aggctctccg aacaagtact
ggaactcctc 7560tacagccact tcactgtgta acatttttag gggaagttac
ttggctggag cttctctaat 7620ctacacagta acaagaaacg ctggcttggt
caagagacgt gggggtggaa caggagagac 7680cctgggagag aaatggaagg
cccgcttgaa ccagatgtcg gccctggagt tctactccta 7740caaaaagtca
ggcatcaccg aggtgtgcag agaagaggcc cgccgcgccc tcaaggacgg
7800tgtggcaacg ggaggccatg ctgtgtcccg aggaagtgca aagctgagat
ggttggtgga 7860gcggggatac ctgcagccct atggaaaggt cattgatctt
ggatgtggca gagggggctg 7920gagttactac gccgccacca tccgcaaagt
tcaagaagtg aaaggataca caaaaggagg 7980ccctggtcat gaagaacccg
tgttggtgca aagctatggg tggaacatag tccgtcttaa 8040gagtggggtg
gacgtctttc atatggcggc tgagccgtgt gacacgttgc tgtgtgacat
8100aggtgagtca tcatctagtc ctgaagtgga agaagcacgg acgctcagag
tcctctccat 8160ggtgggggat tggcttgaaa aaagaccagg agccttttgc
ataaaagtgt tgtgcccata 8220caccagcact atgatggaaa ccctggagcg
actgcagcgt aggtatgggg gaggactggt 8280cagagtgcca ctctcccgca
actctacaca tgagatgtac tgggtctctg gagcgaaaag 8340caacaccata
aaaagtgtgt ccaccacgag ccagctcctc ttggggcgca tggacgggcc
8400taggaggcca gtgaaatatg aggaggatgt gaatctcggc tctggcacgc
gggctgtggt 8460aagctgcgct gaagctccca acatgaagat cattggtaac
cgcattgaaa ggatccgcag 8520tgagcacgcg gaaacgtggt tctttgacga
aaaccaccca tataggacat gggcttacca 8580tggaagctat gtggccccca
cacaagggtc agcgtcctct ctaataaacg gggttgtcag 8640gctcctgtca
aaaccctggg atgtggtgac tggagtcaca ggaatagcca tgaccgacac
8700cacaccgtat ggtcagcaaa gagttttcaa ggaaaaagtg gacactaggg
tgccagaccc 8760ccaagaaggc actcgtcagg ttatgagcat ggtctcttcc
tggttgtgga aagagctagg 8820caaacacaaa cgaccacgag tctgtaccaa
agaagagttc atcaacaagg ttcgtagcaa 8880tgcagcatta ggggcaatat
ttgaagagga aaaagagtgg aagactgcag tggaagctgt 8940gaacgatcca
aggttctggg ctctagtgga caaggaaaga gagcaccacc tgagaggaga
9000gtgccagagt tgtgtgtaca acatgatggg aaaaagagaa aagaaacaag
gggaatttgg 9060aaaggccaag ggcagccgcg ccatctggta tatgtggcta
ggggctagat ttctagagtt 9120cgaagccctt ggattcttga acgaggatca
ctggatgggg agagagaact caggaggtgg 9180tgttgaaggg ctgggattac
aaagactcgg atatgtccta gaagagatga gtcgcatacc 9240aggaggaagg
atgtatgcag atgacactgc tggctgggac acccgcatca gcaggtttga
9300tctggagaat gaagctctaa tcaccaacca aatggagaaa gggcacaggg
ccttggcatt 9360ggccataatc aagtacacat accaaaacaa agtggtaaag
gtccttagac cagctgaaaa 9420agggaaaaca gttatggaca ttatttcgag
acaagaccaa agggggagcg gacaagttgt 9480cacttacgct cttaacacat
ttaccaacct agtggtgcaa ctcattcgga atatggaggc 9540tgaggaagtt
ctagagatgc aagacttgtg gctgctgcgg aggtcagaga aagtgaccaa
9600ctggttgcag agcaacggat gggataggct caaacgaatg gcagtcagtg
gagatgattg 9660cgttgtgaag ccaattgatg ataggtttgc acatgccctc
aggttcttga atgatatggg 9720aaaagttagg aaggacacac aagagtggaa
accctcaact ggatgggaca actgggaaga 9780agttccgttt tgctcccacc
acttcaacaa gctccatctc aaggacggga ggtccattgt 9840ggttccctgc
cgccaccaag atgaactgat tggccgggcc cgcgtctctc caggggcggg
9900atggagcatc cgggagactg cttgcctagc aaaatcatat gcgcaaatgt
ggcagctcct 9960ttatttccac agaagggacc tccgactgat ggccaatgcc
atttgttcat ctgtgccagt 10020tgactgggtt ccaactggga gaactacctg
gtcaatccat ggaaagggag aatggatgac 10080cactgaagac atgcttgtgg
tgtggaacag agtgtggatt gaggagaacg accacatgga 10140agacaagacc
ccagttacga aatggacaga cattccctat ttgggaaaaa gggaagactt
10200gtggtgtgga tctctcatag ggcacagacc gcgcaccacc tgggctgaga
acattaaaaa 10260tacagtcaac atggtgcgca ggatcatagg tgatgaagaa
aagtacatgg actacctatc 10320cacccaagtt cgctacttgg gtgaagaagg
gtctacacct ggagtgctgt gagcaccaat 10380cttaatgttg tcaggcctgc
tagtcagcca cagcttgggg aaagctgtgc agcctgtgac 10440ccctccagga
gaagctgggt aaccaagcct atagtcaggc cgagaacgcc atggcacgga
10500agaagccatg ctgcctgtga gcccctcaga ggacactgag tcaaaaaacc
ccacgcgctt 10560ggaggcgcag gatgggaaaa gaaggtggcg accttcccca
cccttcaatc tggggcctga 10620actggagatc agctgtggat ctccagaaga
gggactagtg gttagaggag accccccgga 10680aaacgcaaaa cagcatattg
acgctgggaa agaccagaga ctccatgagt ttccaccacg 10740ctggccgcca
ggcacagatc gccgaatagc ggcggccggt gtggggaaat cca 10793410675DNAZika
virus 4gttgttgatc tgtgtgaatc agactgcgac agttcgagtt tgaagcgaaa
gctagcaaca 60gtatcaacag gttttatttt ggatttggaa acgagagttt ctggtcatga
aaaacccaaa 120aaagaaatcc ggaggattcc ggattgtcaa tatgctaaaa
cgcggagtag cccgtgtgag 180cccctttggg ggcttgaaga ggctgccagc
cggacttctg ctgggtcatg ggcccatcag 240gatggtcttg gcgattctag
cctttttgag attcacggca atcaagccat cactgggtct 300catcaataga
tggggttcag tggggaaaaa agaggctatg gaaacaataa agaagttcaa
360gaaagatctg gctgccatgc tgagaataat caatgctagg aaggagaaga
agagacgagg 420cgcagatact agtgtcggaa ttgttggcct cctgctgacc
acagctatgg cagcggaggt 480cactagacgt gggagtgcat actatatgta
cttggacaga aacgatgctg gggaggccat 540atcttttcca accacattgg
ggatgaataa gtgttatata cagatcatgg atcttggaca 600catgtgtgat
gccaccatga gctatgaatg ccctatgctg gatgaggggg tggaaccaga
660tgacgtcgat tgttggtgca acacgacgtc aacttgggtt gtgtacggaa
cctgccatca 720caaaaaaggt gaagcacgga gatctagaag agctgtgacg
ctcccctccc attccaccag 780gaagctgcaa acgcggtcgc aaacctggtt
ggaatcaaga gaatacacaa agcacttgat 840tagagtcgaa aattggatat
tcaggaaccc tggcttcgcg ttagcagcag ctgccatcgc 900ttggcttttg
ggaagctcaa cgagccaaaa agtcatatac ttggtcatga tactgctgat
960tgccccggca tacagcatca ggtgcatagg agtcagcaat agggactttg
tggaaggtat 1020gtcaggtggg acttgggttg atgttgtctt ggaacatgga
ggttgtgtca ccgtaatggc 1080acaggacaaa ccgactgtcg acatagagct
ggttacaaca acagtcagca acatggcgga 1140ggtaagatcc tactgctatg
aggcatcaat atcagacatg gcttctgaca gccgctgccc 1200aacacaaggt
gaagcctacc ttgacaagca atcagacact caatatgtct gcaaaagaac
1260gttagtggac agaggctggg gaaatggatg tggacttttt ggcaaaggga
gcctggtgac 1320atgcgctaag tttgcatgct ccaagaaaat gaccgggaag
agcatccagc cagagaatct 1380ggagtaccgg ataatgctgt cagttcatgg
ctcccagcac agtgggatga tcgttaatga 1440cacaggacat gaaactgatg
agaatagagc gaaagttgag ataacgccca attcaccgag 1500agccgaagcc
accctggggg gttttggaag cctaggactt gattgtgaac cgaggacagg
1560ccttgacttt tcagatttgt attacttgac tatgaataac aagcactggt
tggttcacaa 1620ggagtggttc cacgacattc cattaccttg gcacgctggg
gcagacaccg gaactccaca 1680ctggaacaac aaagaagcac tggtagagtt
caaggacgca catgccaaaa ggcaaactgt 1740cgtggttcta gggagtcaag
aaggagcagt tcacacggcc cttgctggag ctctggaggc 1800tgagatggat
ggtgcaaagg gaaggctgtc ctctggccac ttgaaatgtc gcctgaaaat
1860ggataaactt agattgaagg gcgtgtcata ctccttgtgt actgcagcgt
tcacattcac 1920caagatcccg gctgaaacac tgcacgggac agtcacagtg
gaggtacagt acgcagggac 1980agatggacct tgcaaggttc cagctcagat
ggcggtggac atgcaaactc tgaccccagt 2040tgggaggttg ataaccgcta
accccgtaat cactgaaagc actgagaact ctaagatgat 2100gctggaactt
gatccaccat ttggggactc ttacattgtc ataggagtcg gggagaagaa
2160gatcacccac cactggcaca ggagtggcag caccattgga aaagcatttg
aagccactgt 2220gagaggtgcc aagagaatgg cagtcttggg agacacagcc
tgggactttg gatcagttgg 2280aggcgctctc aactcattgg gcaagggcat
ccatcaaatt tttggagcag ctttcaaatc 2340attgtttgga ggaatgtcct
ggttctcaca aattctcatt ggaacgttgc tgatgtggtt 2400gggtctgaac
acaaagaatg gatctatttc ccttatgtgc ttggccttag ggggagtgtt
2460gatcttctta tccacagccg tctctgctga tgtggggtgc tcggtggact
tctcaaagaa 2520ggagacgaga tgcggtacag gggtgttcgt ctataacgac
gttgaagcct ggagggacag 2580gtacaagtac catcctgact ccccccgtag
attggcagca gcagtcaagc aagcctggga 2640agatggtatc tgcgggatct
cctctgtttc aagaatggaa aacatcatgt ggagatcagt 2700agaaggggag
ctcaacgcaa tcctggaaga gaatggagtt caactgacgg tcgttgtggg
2760atctgtaaaa aaccccatgt ggagaggtcc acagagattg cccgtgcctg
tgaacgagct 2820gccccacggc tggaaggctt gggggaaatc gtatttcgtc
agagcagcaa agacaaataa 2880cagctttgtc gtggatggtg acacactgaa
ggaatgccca ctcaaacata gagcatggaa 2940cagctttctt gtggaggatc
atgggttcgg ggtatttcac actagtgtct ggctcaaggt 3000tagagaagat
tattcattag agtgtgatcc agccgttatt ggaacagctg ttaagggaaa
3060ggaggctgta cacagtgatc taggctactg gattgagagt gagaagaatg
acacatggag 3120gctgaagagg gcccatctga tcgagatgaa aacatgtgaa
tggccaaagt cccacacatt 3180gtggacagat ggaatagaag agagtgatct
gatcataccc aagtctttag ctgggccact 3240cagccatcac aataccagag
agggctacag gacccaaatg aaagggccat ggcacagtga 3300agagcttgaa
attcggtttg aggaatgccc aggcactaag gtccacgtgg aggaaacatg
3360tggaacaaga ggaccatctc tgagatcaac cactgcaagc ggaagggtga
tcgaggaatg 3420gtgctgcagg gagtgcacaa tgcccccact gtcgttccgg
gctaaagatg gctgttggta 3480tggaatggag ataaggccca ggaaagaacc
agaaagcaac ttagtaaggt caatggtgac 3540tgcaggatca actgatcaca
tggaccactt ctcccttgga gtgcttgtga tcctgctcat 3600ggtgcaggaa
gggctgaaga agagaatgac cacaaagatc atcataagca catcaatggc
3660agtgctggta gctatgatcc tgggaggatt ttcaatgagt gacctggcta
agcttgcaat 3720tttgatgggt gccaccttcg cggaaatgaa cactggagga
gatgtagctc atctggcgct 3780gatagcggca ttcaaagtca gaccagcgtt
gctggtatct ttcatcttca gagctaattg 3840gacaccccgt gaaagcatgc
tgctggcctt ggcctcgtgt cttttgcaaa ctgcgatctc 3900cgccttggaa
ggcgacctga tggttctcat caatggtttt gctttggcct ggttggcaat
3960acgagcgatg gttgttccac gcactgataa catcaccttg gcaatcctgg
ctgctctgac 4020accactggcc cggggcacac tgcttgtggc gtggagagca
ggccttgcta cttgcggggg 4080gtttatgctc ctctctctga agggaaaagg
cagtgtgaag aagaacttac catttgtcat 4140ggccctggga ctaaccgctg
tgaggctggt cgaccccatc aacgtggtgg gactgctgtt 4200gctcacaagg
agtgggaagc ggagctggcc ccctagcgaa gtactcacag ctgttggcct
4260gatatgcgca ttggctggag ggttcgccaa ggcagatata gagatggctg
ggcccatggc 4320cgcggtcggt ctgctaattg tcagttacgt ggtctcagga
aagagtgtgg acatgtacat 4380tgaaagagca ggtgacatca catgggaaaa
agatgcggaa gtcactggaa acagtccccg 4440gctcgatgtg gcgctagatg
agagtggtga tttctccctg gtggaggatg acggtccccc 4500catgagagag
atcatactca aggtggtcct gatgaccatc tgtggcatga acccaatagc
4560catacccttt gcagctggag cgtggtacgt atacgtgaag actggaaaaa
ggagtggtgc 4620tctatgggat gtgcctgctc ccaaggaagt aaaaaagggg
gagaccacag atggagtgta 4680cagagtaatg actcgtagac tgctaggttc
aacacaagtt ggagtgggag ttatgcaaga 4740gggggtcttt cacactatgt
ggcacgtcac aaaaggatcc gcgctgagaa gcggtgaagg 4800gagacttgat
ccatactggg gagatgtcaa gcaggatctg gtgtcatact gtggtccatg
4860gaagctagat gccgcctggg atgggcacag cgaggtgcag ctcttggccg
tgccccccgg 4920agagagagcg aggaacatcc agactctgcc cggaatattt
aagacaaagg atggggacat 4980tggagcggtt gcgctggatt acccagcagg
aacttcagga tctccaatcc tagacaagtg 5040tgggagagtg ataggacttt
atggcaatgg ggtcgtgatc aaaaacggga gttatgttag 5100tgccatcacc
caagggagga gggaggaaga gactcctgtt gagtgcttcg agccctcgat
5160gctgaagaag aagcagctaa ctgtcttaga cttgcatcct ggagctggga
aaaccaggag 5220agttcttcct gaaatagtcc gtgaagccat aaaaacaaga
ctccgtactg tgatcttagc 5280tccaaccagg gttgtcgctg ctgaaatgga
ggaggccctt agagggcttc cagtgcgtta 5340tatgacaaca gcagtcaatg
tcacccactc tggaacagaa atcgtcgact taatgtgcca 5400tgccaccttc
acttcacgtc tactacagcc aatcagagtc cccaactata atctgtatat
5460tatggatgag gcccacttca cagatccctc aagtatagca gcaagaggat
acatttcaac 5520aagggttgag atgggcgagg cggctgccat cttcatgacc
gccacgccac caggaacccg 5580tgacgcattt ccggactcca actcaccaat
tatggacacc gaagtggaag tcccagagag 5640agcctggagc tcaggctttg
attgggtgac ggatcattct ggaaaaacag tttggtttgt 5700tccaagcgtg
aggaacggca atgagatcgc agcttgtctg acaaaggctg gaaaacgggt
5760catacagctc agcagaaaga cttttgagac agagttccag aaaacaaaac
atcaagagtg 5820ggactttgtc gtgacaactg acatttcaga gatgggcgcc
aactttaaag ctgaccgtgt 5880catagattcc aggagatgcc taaagccggt
catacttgat ggcgagagag tcattctggc 5940tggacccatg cctgtcacac
atgccagcgc tgcccagagg agggggcgca taggcaggaa 6000tcccaacaaa
cctggagatg agtatctgta tggaggtggg tgcgcagaga ctgacgaaga
6060ccatgcacac tggcttgaag caagaatgct ccttgacaat atttacctcc
aagatggcct 6120catagcctcg ctctatcgac ctgaggccga caaagtagca
gccattgagg gagagttcaa 6180gcttaggacg gagcaaagga agacctttgt
ggaactcatg aaaagaggag atcttcctgt 6240ttggctggcc tatcaggttg
catctgccgg aataacctac acagatagaa gatggtgctt 6300tgatggcacg
accaacaaca ccataatgga agacagtgtg ccggcagagg tgtggaccag
6360acacggagag aaaagagtgc tcaaaccgag gtggatggac gccagagttt
gttcagatca 6420tgcggccctg aagtcattca aggagtttgc cgctgggaaa
agaggagcgg cttttggagt 6480gatggaagcc ctgggaacac tgccaggaca
catgacagag agattccagg aagccattga 6540caacctcgct gtgctcatgc
gggcagagac tggaagcagg ccttacaaag ccgcggcggc 6600ccaattgccg
gagaccctag agaccataat gcttttgggg ttgctgggaa cagtctcgct
6660gggaatcttc ttcgtcttga tgaggaacaa gggcataggg aagatgggct
ttggaatggt 6720gactcttggg gccagcgcat ggctcatgtg gctctcggaa
attgagccag ccagaattgc 6780atgtgtcctc attgttgtgt tcctattgct
ggtggtgctc atacctgagc cagaaaagca 6840aagatctccc caggacaacc
aaatggcaat catcatcatg gtagcagtag gtcttctggg 6900cttgattacc
gccaatgaac tcggatggtt ggagagaaca aagagtgacc taagccatct
6960aatgggaagg agagaggagg gggcaaccat aggattctca atggacattg
acctgcggcc 7020agcctcagct tgggccatct atgctgcctt gacaactttc
attaccccag ccgtccaaca 7080tgcagtgacc acctcataca acaactactc
cttaatggcg atggccacgc aagctggagt 7140gttgtttggc atgggcaaag
ggatgccatt ctacgcatgg gactttggag tcccgctgct 7200aatgataggt
tgctactcac aattaacacc cctgacccta atagtggcca tcattttgct
7260cgtggcgcac tacatgtact tgatcccagg gctgcaggca gcagctgcgc
gtgctgccca 7320gaagagaacg gcagctggca tcatgaagaa ccctgttgtg
gatggaatag tggtgactga 7380cattgacaca atgacaattg acccccaagt
ggagaaaaag atgggacagg tgctactcat 7440agcagtagcc gtctccagcg
ccatactgtc gcggaccgcc tgggggtggg gggaggctgg 7500ggctctgatc
acagccgcaa cttccacttt gtgggaaggc tctccgaaca agtactggaa
7560ctcctctaca gccacttcac tgtgtaacat ttttagggga agttacttgg
ctggagcttc 7620tctaatctac acagtaacaa gaaacgctgg cttggtcaag
agacgtgggg gtggaacagg 7680agagaccctg ggagagaaat ggaaggcccg
cttgaaccag atgtcggccc tggagttcta 7740ctcctacaaa aagtcaggca
tcaccgaggt gtgcagagaa gaggcccgcc gcgccctcaa 7800ggacggtgtg
gcaacgggag gccatgctgt gtcccgagga agtgcaaagc tgagatggtt
7860ggtggagcgg ggatacctgc agccctatgg aaaggtcatt gatcttggat
gtggcagagg 7920gggctggagt tactacgtcg ccaccatccg caaagttcaa
gaagtgaaag gatacacaaa 7980aggaggccct ggtcatgaag aacccgtgtt
ggtgcaaagc tatgggtgga acatagtccg 8040tcttaagagt ggggtggacg
tctttcatat ggcggctgag ccgtgtgaca cgttgctgtg 8100tgacataggt
gagtcatcat ctagtcctga agtggaagaa gcacggacgc tcagagtcct
8160ctccatggtg ggggattggc ttgaaaaaag accaggagcc ttttgtataa
aagtgttgtg 8220cccatacacc agcactatga tggaaaccct ggagcgactg
cagcgtaggt atgggggagg 8280actggtcaga gtgccactct cccgcaactc
tacacatgag atgtactggg tctctggagc
8340gaaaagcaac accataaaaa gtgtgtccac cacgagccag ctcctcttgg
ggcgcatgga 8400cgggcctagg aggccagtga aatatgagga ggatgtgaat
ctcggctctg gcacgcgggc 8460tgtggtaagc tgcgctgaag ctcccaacat
gaagatcatt ggtaaccgca ttgaaaggat 8520ccgcagtgag cacgcggaaa
cgtggttctt tgacgagaac cacccatata ggacatgggc 8580ttaccatgga
agctatgagg cccccacaca agggtcagcg tcctctctaa taaacggggt
8640tgtcaggctc ctgtcaaaac cctgggatgt ggtgactgga gtcacaggaa
tagccatgac 8700cgacaccaca ccgtatggtc agcaaagagt tttcaaggaa
aaagtggaca ctagggtgcc 8760agacccccaa gaaggcactc gtcaggttat
gagcatggtc tcttcctggt tgtggaaaga 8820gctaggcaaa cacaaacggc
cacgagtctg caccaaagaa gagttcatca acaaggttcg 8880tagcaatgca
gcattagggg caatatttga agaggaaaaa gagtggaaga ctgcagtgga
8940agctgtgaac gatccaaggt tctgggctct agtggacaag gaaagagagc
accacctgag 9000aggagagtgc cagagctgtg tgtacaacat gatgggaaaa
agagaaaaga aacaagggga 9060atttggaaag gccaagggca gccgcgccat
ctggtatatg tggctagggg ctagatttct 9120agagttcgaa gcccttggat
tcttgaacga ggatcactgg atggggagag agaactcagg 9180aggtggtgtt
gaagggctgg gattacaaag actcggatat gtcctagaag agatgagtcg
9240tataccagga ggaaggatgt atgcagatga cactgctggc tgggacaccc
gcattagcag 9300gtttgatctg gagaatgaag ctctaatcac caaccaaatg
gagaaagggc acagggcctt 9360ggcattggcc ataatcaagt acacatacca
aaacaaagtg gtaaaggtcc ttagaccagc 9420tgaaaaaggg aaaacagtta
tggacattat ttcgagacaa gaccaaaggg ggagcggaca 9480agttgtcact
tacgctctta acacatttac caacctagtg gtgcaactca ttcggaatat
9540ggaggctgag gaagttctag agatgcaaga cttgtggctg ctgcggaggt
cagagaaagt 9600gaccaactgg ttgcagagca acggatggga taggctcaaa
cgaatggcag tcagtggaga 9660tgattgcgtt gtgaagccaa ttgatgatag
gtttgcacat gccctcaggt tcttgaatga 9720tatgggaaaa gttaggaagg
acacacaaga gtggaaaccc tcaactggat gggacaactg 9780ggaagaagtt
ccgttttgct cccaccactt caacaagctc catctcaagg acgggaggtc
9840cattgtggtt ccctgccgcc accaagatga actgattggc cgggcccgcg
tctctccagg 9900ggcgggatgg agcatccggg agactgcttg cctagcaaaa
tcatatgcgc aaatgtggca 9960gctcctttat ttccacagaa gggacctccg
actgatggcc aatgccattt gttcatctgt 10020gccagttgac tgggttccaa
ctgggagaac tacctggtca atccatggaa agggagaatg 10080gatgaccact
gaagacatgc ttgtggtgtg gaacagagtg tggattgagg agaacgacca
10140catggaagac aagaccccag ttacgaaatg gacagacatt ccctatttgg
gaaaaaggga 10200agacttgtgg tgtggatctc tcatagggca cagaccgcgc
accacctggg ctgagaacat 10260taaaaacaca gtcaacatgg tgcgcaggat
cataggtgat gaagaaaagt acatggacta 10320cctatccacc caagttcgct
acttgggtga agaagggtct acacctggag tgctgtaagc 10380accaatctta
atgttgtcag gcctgctagt cagccacagc ttggggaaag ctgtgcagcc
10440tgtgaccccc ccaggagaag ctgggaaacc aagcctatag tcaggccgag
aacgccatgg 10500cacggaagaa gccatgctgc ctgtgagccc ctcagaggac
actgagtcaa aaaaccccac 10560gcgcttggag gcgcaggatg ggaaaagaag
gtggcgacct tccccaccct tcaatctggg 10620gcctgaactg gagatcagct
gtggatctcc agaagaggga ctagtggtta gagga 10675510676DNAZika virus
5gttgttactg ttgctgactc agactgcgac agttcgagtt tgaagcgaaa gctagcaaca
60gtatcaacag gttttatttg gatttggaaa cgagagtttc tggtcatgaa aaacccaaaa
120aagaaatccg gaggattccg gattgtcaat atgctaaaac gcggagtagc
ccgtgtgagc 180ccctttgggg gcttgaagag gctgccagcc ggacttctgc
tgggtcatgg gcccatcagg 240atggtcttgg caattctagc ctttttgaga
ttcacggcaa tcaagccatc actgggtctc 300atcaatagat ggggttcagt
ggggaaaaaa gaggctatgg aaataataaa gaagttcaag 360aaagatctgg
ctgccatgct gagaataatc aatgctagga aggagaagaa gagacgaggc
420gcagatacta gtgtcggaat tgttggcctc ctgctgacca cagctatggc
agcggaggtc 480actagacgtg ggagtgcata ctatatgtac ttggacagaa
acgatgctgg ggaggccata 540tcttttccaa ccacattggg gatgaataag
tgttatatac agatcatgga tcttggacac 600atgtgtgatg ccaccatgag
ctatgaatgc cctatgctgg atgagggggt ggaaccagat 660gacgtcgatt
gttggtgcaa cacgacgtca acttgggttg tgtacggaac ctgccatcac
720aaaaaaggtg aagcacggag atctagaaga gctgtgacgc tcccctccca
ttccactagg 780aagctgcaaa cgcggtcgca aacctggttg gaatcaagag
aatacacaaa gcacttgatt 840agagtcgaaa attggatatt caggaaccct
ggcttcgcgt tagcagcagc tgccatcgct 900tggcttttgg gaagctcaac
gagccaaaaa gtcatatact tggtcatgat actgctgatt 960gccccggcat
acagcatcag gtgcatagga gtcagcaata gggactttgt ggaaggtatg
1020tcaggtggga cttgggttga tgttgtcttg gaacatggag gttgtgtcac
cgtaatggca 1080caggacaaac cgactgtcga catagagctg gttacaacaa
cagtcagcaa catggcggag 1140gtaagatcct actgctatga ggcatcaata
tcagacatgg cttcggacag ccgctgccca 1200acacaaggtg aagcctacct
tgacaagcaa tcagacactc aatatgtctg caaaagaacg 1260ttagtggaca
gaggctgggg aaatggatgt ggactttttg gcaaagggag tctggtgaca
1320tgcgctaagt ttgcatgctc caagaaaatg accgggaaga gcatccagcc
agagaatctg 1380gagtaccgga taatgctgtc agttcatggc tcccagcaca
gtgggatgat cgttaatgac 1440acaggacatg aaactgatga gaatagagcg
aaggttgaga taacgcccaa ttcaccaaga 1500gccgaagcca ccctgggggg
ttttggaagc ctaggacttg attgtgaacc gaggacaggc 1560cttgactttt
cagatttgta ttacttgact atgaataaca agcactggtt ggttcacaag
1620gagtggttcc acgacattcc attaccttgg cacgctgggg cagacaccgg
aactccacac 1680tggaacaaca aagaagcact ggtagagttc aaggacgcac
atgccaaaag gcaaactgtc 1740gtggttctag ggagtcaaga aggagcagtt
cacacggccc ttgctggagc tctggaggct 1800gagatggatg gtgcaaaggg
aaggctgtcc tctggccact tgaaatgtcg cctgaaaatg 1860gataaactta
gattgaaggg cgtgtcatac tccttgtgta ccgcagcgtt cacattcacc
1920aagatcccgg ctgaaacact gcacgggaca gtcacagtgg aggtacagta
cgcagggaca 1980gatggacctt gcaaggttcc agctcagatg gcggtggaca
tgcaaactct gaccccagtt 2040gggaggttga taaccgctaa ccccgtaatc
actgaaagca ctgagaactc taagatgatg 2100ctggaacttg atccaccatt
tggggactct tacattgtca taggagtcgg ggagaagaag 2160atcacccacc
actggcacag gagtggcagc accattggaa aagcatttga agccactgtg
2220agaggtgcca agagaatggc agtcttggga gacacagcct gggactttgg
atcagttgga 2280ggcgctctca actcattggg caagggcatc catcaaattt
ttggagcagc tttcaaatca 2340ttgtttggag gaatgtcctg gttctcacaa
attctcattg gaacgttgct gatgtggttg 2400ggtctgaaca caaagaatgg
atctatttcc cttatgtgct tggccttagg gggagtgttg 2460atcttcttat
ccacagccgt ctctgctgat gtggggtgct cggtggactt ctcaaagaag
2520gagacgagat gcggtacagg ggtgttcgtc tataacgacg ttgaagcctg
gagggacagg 2580tacaagtacc atcctgactc cccccgtaga ttggcagcag
cagtcaagca agcctgggaa 2640gatggtatct gcgggatctc ctctgtttca
agaatggaaa acatcatgtg gagatcagta 2700gaaggggagc tcaacgcaat
cctggaagag aatggagttc aactgacggt cgttgtggga 2760tctgtaaaaa
accccatgtg gagaggtcca cagagattgc ccgtgcctgt gaacgagctg
2820ccccacggct ggaaggcttg ggggaaatcg cacttcgtca gagcagcaaa
gacaaataac 2880agctttgtcg tggatggtga cacactgaag gaatgcccac
tcaaacatag agcatggaac 2940agctttcttg tggaggatca tgggttcggg
gtatttcaca ctagtgtctg gctcaaggtt 3000agagaagatt attcattaga
gtgtgatcca gccgttattg gaacagctgt taagggaaag 3060gaggctgtac
acagtgatct aggctactgg attgagagtg agaagaatga cacatggagg
3120ctgaagaggg cccatctgat cgagatgaaa acatgtgaat ggccaaagtc
ccacacattg 3180tggacagatg gaatagaaga gagtgatctg atcataccca
agtctttagc tgggccactc 3240agccatcaca ataccagaga gggctacagg
acccaaatga aagggccatg gcacagtgaa 3300gagcttgaaa ttcggtttga
ggaatgccca ggcactaagg tccacgtgga ggaaacatgt 3360ggaacaagag
gaccatctct gagatcaacc actgcaagcg gaagggtgat cgaggaatgg
3420tgctgcaggg agtgcacaat gcccccactg tcgttccggg ctaaagatgg
ctgttggtat 3480ggaatggaga taaggcccag gaaagaacca gaaagcaact
tagtaaggtc aatggtgact 3540gcaggatcaa ctgatcacat ggatcacttc
tcccttggag tgcttgtgat tctgctcatg 3600gtgcaggaag ggctgaagaa
gagaatgacc acaaagatca tcataagcac atcaatggca 3660gtgctggtag
ctatgatcct gggaggattt tcaatgagtg acctggctaa gcttgcaatt
3720ttgatgggtg ccaccttcgc ggaaatgaac actggaggag atgtagctca
tctggcgctg 3780atagcggcat tcaaagtcag accagcgttg ctggtatctt
tcatcttcag agctaattgg 3840acaccccgtg aaagcatgct gctggccttg
gcctcgtgtc ttttgcaaac tgcgatctcc 3900gccttggaag gcgacctgat
ggttctcatc aatggttttg ctttggcctg gttggcaata 3960cgagcgatgg
ttgttccacg cactgataac atcaccttgg caatcctggc tgctctgaca
4020ccactggccc ggggcacact gcttgtggcg tggagagcag gccttgctac
ttgcgggggg 4080tttatgctcc tctctctgaa gggaaaaggc agtgtgaaga
agaacttacc atttgtcatg 4140gccctgggac taaccgctgt gaggctggtc
gaccccatca acgtggtggg gctgctgttg 4200ctcacaagga gtgggaagcg
gagctggccc cctagcgaag tactcacagc tgttggcctg 4260atatgcgcat
tggctggagg gttcgccaag gcagatatag agatggctgg gcccatggcc
4320gcggtcggtc tgctaattgt cagttacgtg gtctcaggaa agagtgtgga
catgtacatt 4380gaaagagcag gtgacatcac atgggaaaaa gatgcggaag
tcactggaaa cagtccccgg 4440ctcgatgtgg cgctagatga gagtggtgat
ttctccctgg tggaggatga cggtcccccc 4500atgagagaga tcatactcaa
ggtggtcctg atgaccatct gtggcatgaa cccaatagcc 4560ataccctttg
cagctggagc gtggtacgta tacgtgaaga ctggaaaaag gagtggtgct
4620ctatgggatg tgcctgctcc caaggaagta aaaaaggggg agaccacaga
tggagtgtac 4680agagtaatga ctcgtagact gctaggttca acacaagttg
gagtgggagt tatgcaagag 4740ggggtctttc acactatgtg gcacgtcaca
aaaggatccg cgctgagaag cggtgaaggg 4800agacttgatc catactgggg
agatgtcaag caggatctgg tgtcatactg tggtccatgg 4860aagctagatg
ccgcctggga cgggcacagc gaggtgcagc tcttggccgt gccccccgga
4920gagagagcga ggaacatcca gactctgccc ggaatattta agacaaagga
tggggacatt 4980ggagcggttg cgctggatta cccagcagga acttcaggat
ctccaatcct agacaagtgt 5040gggagagtga taggacttta tggcaatggg
gtcgtgatca aaaatgggag ttatgttagt 5100gccatcaccc aagggaggag
ggaggaagag actcctgttg agtgcttcga gccttcgatg 5160ctgaagaaga
agcagctaac tgtcttagac ttgcatcctg gagctgggaa aaccaggaga
5220gttcttcctg aaatagtccg tgaagccata aaaacaagac tccgtactgt
gatcttagct 5280ccaaccaggg ttgtcgctgc tgaaatggag gaagccctta
gagggcttcc agtgcgttat 5340atgacaacag cagtcaatgt cacccactct
ggaacagaaa tcgtcgactt aatgtgccat 5400gccaccttca cttcacgtct
actacagcca atcagagtcc ccaactataa tctgtatatt 5460atggatgagg
cccacttcac agatccctca agtatagcag caagaggata catttcaaca
5520agggttgaga tgggcgaggc ggctgccatc ttcatgaccg ccacgccacc
aggaacccgt 5580gacgcatttc cggactccaa ctcaccaatt atggacaccg
aagtggaagt cccagagaga 5640gcctggagct caggctttga ttgggtgacg
gattattctg gaaaaacagt ttggtttgtt 5700ccaagcgtga ggaacggcaa
tgagatcgca gcttgtctga caaaggctgg aaaacgggtc 5760atacagctca
gcagaaagac ttttgagaca gagttccaga aaacaaaaca tcaagagtgg
5820gactttgtcg tgacaactga catttcagag atgggcgcca actttaaagc
tgaccgtgtc 5880atagattcca ggagatgcct aaagccggtc atacttgatg
gcgagagagt cattctggct 5940ggacccatgc ctgtcacaca tgccagcgct
gcccagagga gggggcgcat aggcaggaat 6000cccaacaaac ctggagatga
gtatctgtat ggaggtgggt gcgcagagac tgacgaagac 6060catgcacact
ggcttgaagc aagaatgctc cttgacaata tttacctcca agatggcctc
6120atagcctcgc tctatcgacc tgaggccgac aaagtagcag ccattgaggg
agagttcaag 6180cttaggacgg agcaaaggaa gacctttgtg gaactcatga
aaagaggaga tcttcctgtt 6240tggctggcct atcaggttgc atctgccgga
ataacctaca cagatagaag atggtgcttt 6300gatggcacga ccaacaacac
cataatggaa gacagtgtgc cggcagaggt gtggaccaga 6360cacggagaga
aaagagtgct caaaccgagg tggatggacg ccagagtttg ttcagatcat
6420gcggccctga agtcattcaa ggagtttgcc gctgggaaaa gaggagcggc
ttttggagtg 6480atggaagccc tgggaacact gccaggacac atgacagaga
gattccagga agccattgac 6540aacctcgctg tgctcatgcg ggcagagact
ggaagcaggc cttacaaagc cgcggcggcc 6600caattgccgg agaccctaga
gaccattatg cttttggggt tgctgggaac agtctcgctg 6660ggaatctttt
tcgtcttgat gaggaacaag ggcataggga agatgggctt tggaatggtg
6720actcttgggg ccagcgcatg gctcatgtgg ctctcggaaa ttgagccagc
cagaattgca 6780tgtgtcctca ttgttgtgtt cctattgctg gtggtgctca
tacctgagcc agaaaagcaa 6840agatctcccc aggacaacca aatggcaatc
atcatcatgg tagcagtagg tcttctgggc 6900ttgattaccg ccaatgaact
cggatggttg gagagaacaa agagtgacct aagccatcta 6960atgggaagga
gagaggaggg ggcaaccatg ggattctcaa tggacattga cctgcggcca
7020gcctcagctt gggccatcta tgctgccttg acaactttca ttaccccagc
cgtccaacat 7080gcagtgacca cttcatacaa caactactcc ttaatggcga
tggccacgca agctggagtg 7140ttgtttggta tgggcaaagg gatgccattc
tacgcatggg actttggagt cccgctgcta 7200atgataggtt gctactcaca
attaacgccc ctgaccctaa tagtggccat cattttgctc 7260gtggcgcact
acatgtactt gatcccaggg ctgcaggcag cagctgcgcg tgctgcccag
7320aagagaacgg cagctggcat catgaagaac cctgttgtgg atggaatagt
ggtgactgac 7380attgacacaa tgacaattga cccccaagtg gagaaaaaga
tgggacaggt gctactcatg 7440gcagtagccg tctccagcgc catactgtcg
cggaccgcct gggggtgggg ggaggctggg 7500gccctgatca cagccgcaac
ttccactttg tgggaaggct ctccgaacaa gtactggaac 7560tcctctacag
ccacttcact gtgtaacatt tttaggggaa gttacttggc tggagcttct
7620ctaatctaca cagtaacaag aaacgctggc ttggtcaaga gacgtggggg
tggaacagga 7680gagaccctgg gagagaaatg gaaggcccgc ttgaaccaga
tgtcggccct ggagttctac 7740tcctacaaaa agtcaggcat caccgaggtg
tgcagagaag aggcccgccg cgccctcaag 7800gacggtgtgg caacgggagg
ccatgctgtg tcccgaggaa gtgcaaagct gagatggttg 7860gtggagcggg
gatacctgca gccctatgga aaggtcattg atcttggatg tggcagaggg
7920ggctggagtt actacgccgc caccatccgc aaagttcaag aagtgaaagg
atacacaaaa 7980ggaggccctg gtcatgaaga acccgtgttg gtgcaaagct
atgggtggaa catagtccgt 8040cttaagagtg gggtggacgt ctttcatatg
gcggctgagc cgtgtgacac gttgctgtgt 8100gacataggtg agtcatcatc
tagtcctgaa gtggaagaag cacggacgct cagagtcctc 8160tccatggtgg
gggattggct tgaaaaaaga ccaggagcct tttgtataaa agtgttgtgc
8220ccatacacca gcactatgat ggaaaccctg gagcgactgc agcgtaggta
tgggggagga 8280ctggtcagag tgccactctc ccgcaactct acacatgaga
tgtactgggt ctctggagcg 8340aaaagcaaca ccataaaaag tgtgtccacc
acgagccagc tcctcttggg gcgcatggac 8400gggcctagga ggccagtgaa
atatgaggag gatgtgaatc tcggctctgg cacgcgggct 8460gtggtaagct
gcgctgaagc tcccaacatg aagatcattg gtaaccgcat tgaaaggatc
8520cgcagtgagc acgcggaaac gtggttcttt gacgagaacc acccatatag
gacatgggct 8580taccatggaa gctatgaggc ccccacacaa gggtcagcgt
cctctctaat aaacggggtt 8640gtcaggctcc tgtcaaaacc ctgggatgtg
gtgactggag tcacaggaat agccatgacc 8700gacaccacac cgtatggtca
gcaaagagtt ttcaaggaaa aagtggacac tagggtgcca 8760gacccccaag
aaggcactcg tcaggttatg agcatggtct cttcctggtt gtggaaagag
8820ctaggcaaac acaaacggcc acgagtctgt accaaagaag agttcatcaa
caaggttcgt 8880agcaatgcag cattaggggc aatatttgaa gaggaaaaag
agtggaagac tgcagtggaa 8940gctgtgaacg atccaaggtt ctgggctcta
gtggacaagg aaagagagca ccacctgaga 9000ggagagtgcc agagttgtgt
gtacaacatg atgggaaaaa gagaaaagaa acaaggggaa 9060tttggaaagg
ccaagggcag ccgcgccatc tggtatatgt ggctaggggc tagatttcta
9120gagttcgaag cccttggatt cttgaacgag gatcactgga tggggagaga
gaactcagga 9180ggtggtgttg aagggctggg attacaaaga ctcggatatg
tcctagaaga gatgagtcgc 9240ataccaggag gaaggatgta tgcagatgac
actgctggct gggacacccg catcagcagg 9300tttgatctgg agaatgaagc
tctaatcacc aaccaaatgg agaaagggca cagggccttg 9360gcattggcca
taatcaagta cacataccaa aacaaagtgg taaaggtcct tagaccagct
9420gaaaaaggga agacagttat ggacattatt tcgagacaag accaaagggg
gagcggacaa 9480gttgtcactt acgctcttaa cacatttacc aacctagtgg
tgcaactcat tcggaatatg 9540gaggctgagg aagttctaga gatgcaagac
ttgtggctgc tgcggaggtc agagaaagtg 9600accaactggt tgcagagcaa
cggatgggat aggctcaaac gaatggcagt cagtggagat 9660gattgcgttg
tgaagccaat tgatgatagg tttgcacatg ccctcaggtt cttgaatgat
9720atgggaaaag ttaggaagga cacacaagag tggaaaccct caactggatg
ggacaactgg 9780gaagaagttc cgttttgctc ccaccacttc aacaagctcc
atctcaagga cgggaggtcc 9840attgtggttc cctgccgcca ccaagatgaa
ctgattggcc gggcccgcgt ctctccaggg 9900gcgggatgga gcatccggga
gactgcttgc ctagcaaaat catatgcgca aatgtggcag 9960ctcctttatt
tccacagaag ggacctccga ctgatggcca atgccatttg ttcatctgtg
10020ccagttgact gggttccaac tgggagaact acctggtcaa tccatggaaa
gggagaatgg 10080atgaccactg aagacatgct tgtggtgtgg aacagagtgt
ggattgagga gaacgaccac 10140atggaagaca agaccccagt tacgaaatgg
acagacattc cctatttggg aaaaagggaa 10200gacttgtggt gtggatctct
catagggcac agaccgcgca ccacctgggc tgagaacatt 10260aaaaacacag
tcaacatggt gcgcaggatc ataggtgatg aagaaaagta catggactac
10320ctatccaccc aagttcgcta cttgggtgaa gaagggtcta cacctggagt
gctgtaagca 10380ccaatcttaa tgttgtcagg cctgctagtc agccacagct
tggggaaagc tgtgcagcct 10440gtgacccccc caggagaagc tgggaaacca
agcctatagt caggccgaga acgccatggc 10500acggaagaag ccatgctgcc
tgtgagcccc tcagaggaca ctgagtcaaa aaaccccacg 10560cgcttggagg
cgcaggatgg gaaaagaagg tggcgacctt ccccaccctt caatctgggg
10620cctgaactgg agatcagctg tggatctcca gaagagggac tagtggttag aggaga
10676610808DNAZika virus 6agttgttgat ctgtgtgaat cagactgcga
cagttcgagt ttgaagcgaa agctagcaac 60agtatcaaca ggttttattt tggatttgga
aacgagagtt tctggtcatg aaaaacccaa 120aaaagaaatc cggaggattc
cggattgtca atatgctaaa acgcggagta gcccgtgtga 180gcccctttgg
gggcttgaag aggctgccag ccggacttct gctgggtcat gggcccatca
240ggatggtctt ggcaattcta gcctttttga gattcacggc aatcaagcca
tcactgggtc 300tcatcaatag atggggttca gtggggaaaa aagaggctat
ggaaataata aagaagttca 360agaaagatct ggctgccatg ctgagaataa
tcaatgctag gaaggagaag aagagacgag 420gcgcagatac tagtgtcgga
attgttggcc tcctgctgac cacagctatg gcagcggagg 480tcactagacg
tgggagtgca tactatatgt acttggacag aaacgatgct ggggaggcca
540tatcttttcc aaccacattg gggatgaata agtgttatat acagatcatg
gatcttggac 600acatgtgtga tgccaccatg agctatgaat gccctatgct
ggatgagggg gtggaaccag 660atgacgtcga ttgttggtgc aacacgacgt
caacttgggt tgtgtacgga acctgccatc 720acaaaaaagg tgaagcacgg
agatctagaa gagctgtgac gctcccctcc cattccacta 780ggaagctgca
aacgcggtcg caaacctggt tggaatcaag agaatacaca aagcacttga
840ttagagtcga aaattggata ttcaggaacc ctggcttcgc gttagcagca
gctgccatcg 900cttggctttt gggaagctca acgagccaaa aagtcatata
cttggtcatg atactgctga 960ttgccccggc atacagcatc aggtgcatag
gagtcagcaa tagggacttt gtggaaggta 1020tgtcaggtgg gacttgggtt
gatgttgtct tggaacatgg aggttgtgtc accgtaatgg 1080cacaggacaa
accgactgtc gacatagagc tggttacaac aacagtcagc aacatggcgg
1140aggtaagatc ctactgctat gaggcatcaa tatcagacat ggcttcggac
agccgctgcc 1200caacacaagg tgaagcctac cttgacaagc aatcagacac
tcaatatgtc tgcaaaagaa 1260cgttagtgga cagaggctgg ggaaatggat
gtggactttt tggcaaaggg agcctggtga 1320catgcgctaa gtttgcatgc
tccaagaaaa tgaccgggaa gagcatccag ccagagaatc 1380tggagtaccg
gataatgctg tcagttcatg gctcccagca cagtgggatg atcgttaatg
1440acacaggaca tgaaactgat gagaatagag cgaaggttga gataacgccc
aattcaccaa 1500gagccgaagc caccctgggg ggttttggaa gcctaggact
tgattgtgaa ccgaggacag 1560gccttgactt ttcagatttg tattacttga
ctatgaataa caagcactgg ttggtccaca 1620aggagtggtt ccacgacatt
ccattacctt ggcacgctgg ggcagacacc ggaactccac 1680actggaacaa
caaagaagca ctggtagagt tcaaggacgc acatgccaaa aggcaaactg
1740tcgtggttct agggagtcaa gaaggagcag ttcacacggc ccttgctgga
gctctggagg 1800ctgagatgga tggtgcaaag ggaaggctgt cctctggcca
cttgaaatgt cgcctgaaaa 1860tggataaact tagattgaag ggcgtgtcat
actccttgtg taccgcagcg ttcacattca 1920ccaagatccc ggctgaaaca
ctgcacggga cagtcacagt ggaggtacag tacgcaggga 1980cagatggacc
ttgcaaggtt
ccagctcaga tggcggtgga catgcaaact ctgaccccag 2040ttgggaggtt
gataaccgct aaccccgtaa tcactgaaag cactgagaac tctaagatga
2100tgctggaact tgatccacca tttggggact cttacattgt cataggagtc
ggggagaaga 2160agatcaccca ccactggcac aggagtggca gcaccattgg
aaaagcattt gaagccactg 2220tgagaggtgc caagagaatg gcagtcttgg
gagacacagc ctgggacttt ggatcagttg 2280gaggcgctct caactcattg
ggcaagggca tccatcaaat ttttggagca gctttcaaat 2340cattgtttgg
aggaatgtcc tggttctcac aaatcctcat tggaacgttg ctgatgtggt
2400tgggtctgaa cacaaagaat ggatctattt cccttatgtg cttggcctta
gggggagtgt 2460tgatcttctt atccacagcc gtctctgctg atgtggggtg
ctcggtggac ttctcaaaga 2520aggagacgag atgcggtaca ggggtgttcg
tctataacga cgttgaagcc tggagggaca 2580ggtacaagta ccatcctgac
tccccccgta gattggcagc agcagtcaag caagcctggg 2640aagatggtat
ctgcgggatc tcctctgttt caagaatgga gaacatcatg tggagatcag
2700tagaagggga gctcaacgca atcttggaag agaatggagt tcaactgacg
gtcgttgtgg 2760gatctgtaaa aaaccccatg tggagaggtc cacagagatt
gcccgtgcct gtgaacgagc 2820tgccccacgg ctggaaggct tgggggaaat
cgtacttcgt cagagcagca aagacaaata 2880acagctttgt cgtggatggt
gacacactga aggaatgccc actcgaacat agagcatgga 2940acagctttct
tgtggaggat catgggttcg gggtatttca cactagtgtc tggctcaagg
3000ttagagaaga ttattcatta gagtgtgatc cagccgttat tggaacagct
gttaagggga 3060aggaggctgt acacagtgat ctaggctact ggattgagag
tgagaagaat gacacatgga 3120ggctgaagag ggcccatcta atcgagatga
aaacatgtga atggccaaag tcccacacat 3180tgtgggcaga tggaatagaa
gagagtgatc tgatcattcc caagtcttta gctgggccac 3240tcagccatca
caataccaga gagggctaca ggacccaaat gaaagggcca tggcacagtg
3300aagagcttga aattcggttt gaggaatgcc cgggcactaa ggtccacgtg
gaggaaacat 3360gtggaacaag aggaccatct ctgagatcaa ccactgcaag
cggaagggtg atcgaggaat 3420ggtgctgcag ggagtgcaca atgcccccac
tgtcgttccg ggctaaagat ggctgttggt 3480atggaatgga gataaggccc
aggaaagaac cagaaagcaa cttagtaagg tcagtggtga 3540ctgcaggatc
aactgatcac atggatcact tctcccttgg agtgcttgtg attctgctca
3600tggtgcagga agggctgaag aagagaatga ccacaaagat catcataagc
acatcaatgg 3660cagtgctggt agctatgatc ctgggaggat tttcaatgag
tgacctggct aagcttgcaa 3720ttttgatggg cgccaccttc gcggaaatga
acactggagg agatgtagct catctggcgc 3780tgatagcggc attcaaagtc
agaccagcgt tgctggtatc tttcatcttc agagctaatt 3840ggacaccccg
tgaaagcatg ctgctggcct tggcctcgtg tcttttgcaa actgcgatct
3900ccgccttgga aggcgacctg atggttctca tcaatggttt tgctttggcc
tggttggcaa 3960tacgagcgat ggttgttcca cgcactgata acatcacctt
ggcaatcctg gctgctctga 4020caccactggc ccggggcaca ctgcttgtgg
cgtggagagc aggccttgct acttgcgggg 4080ggtttatgct cctctctctg
aagggaaaag gcagtgtgaa gaagaactta ccatttgtca 4140tggccctggg
actaaccgct gtgaggctgg tcgaccccat caacgtggtg ggactgctgt
4200tgctcacaag gagtgggaag cggagctggc cccctagcga agtactcaca
gctgttggcc 4260tgatatgcgc attggctgga gggttcgcca aggcagatat
agagatggct gggcccatgg 4320ccgcggtcgg tctgctaatt gtcagttacg
tggtctcagg aaagagtgtg gacatgtaca 4380ttgaaagagc aggtgacatc
acatgggaaa aagatgcgga agtcactgga aacagtcccc 4440ggctcgatgt
ggcgctagat gagagtggtg atttctccct ggtggaggat gacggtcccc
4500ccatgagaga gatcatactc aaggtggtcc tgatgaccat ctgtggcatg
aacccaatag 4560ccataccctt tgcagctgga gcgtggtacg tatacgtgaa
gactggaaaa aggagtggtg 4620ctctatggga tgtgcctgct cccaaggaag
taaaaaaggg ggagaccaca gatggagtgt 4680acagagtaat gactcgtaga
ctgctaggtt caacacaagt tggagtggga gttatgcaag 4740agggggtctt
tcacactatg tggcacgtca caaaaggatc cgcgctgaga agcggtgaag
4800ggagacttga tccatactgg ggagatgtca agcaggatct ggtgtcatac
tgtggtccat 4860ggaagctaga tgccgcctgg gacgggcaca gcgaggtgca
gctcttggcc gtgccccccg 4920gagagagagc gaggaacatc cagactctgc
ccggaatatt taagacaaag gatggggaca 4980ttggagcggt tgcgctggat
tacccagcag gaacttcagg atctccaatc ctagacaagt 5040gtgggagagt
gataggactt tatggcaatg gggtcgtgat caaaaatggg agttatgtta
5100gtgccatcac ccaagggagg agggaggaag agactcctgt tgagtgcttc
gagccttcga 5160tgctgaagaa gaagcagcta actgtcttag acttgcatcc
tggagctggg aaaaccagga 5220gagttcttcc tgaaatagtc cgtgaagcca
taaaaacaag actccgtact gtgatcttag 5280ctccaaccag ggttgtcgct
gctgaaatgg aggaagccct tagagggctt ccagtgcgtt 5340atatgacaac
agcagtcaat gtcacccact ctggaacaga aatcgtcgac ttaatgtgcc
5400atgccacctt cacttcacgt ctactacagc caatcagagt ccccaactat
aatctgtata 5460ttatggatga ggcccacttc acagatccct caagtatagc
agcaagagga tacatttcaa 5520caagggttga gatgggcgag gcggctgcca
tcttcatgac cgccacgcca ccaggaaccc 5580gtgacgcatt tccggactcc
aactcaccaa ttatggacac cgaagtggaa gtcccagaga 5640gagcctggag
ctcaggcttt gattgggtga cggatcattc tggaaaaaca gtttggtttg
5700ttccaagcgt gaggaacggc aatgagatcg cagcttgtct gacaaaggct
ggaaaacggg 5760tcatacagct cagcagaaag acttttgaga cagagttcca
gaaaacaaaa catcaagagt 5820gggactttgt cgtgacaact gacatttcag
agatgggcgc caactttaaa gctgaccgtg 5880tcatagattc caggagatgc
ctaaagccgg tcatacttga tggcgagaga gtcattttgg 5940ctggacccat
gcctgtcaca catgccagcg ctgcccagag gagggggcgc ataggcagga
6000atcccaacaa acctggagat gagtatctgt atggaggtgg gtgcgcagag
actgacgaag 6060accatgcaca ctggcttgaa gcaagaatgc tccttgacaa
tatttacctc caagatggcc 6120tcatagcctc gctctatcga cctgaggccg
acaaagtagc agccattgag ggagagttca 6180agcttaggac ggagcaaagg
aagacctttg tggaactcat gaaaagagga gatcttcctg 6240tttggctggc
ctatcaggtt gcatctgccg gaataaccta cacagataga agatggtgct
6300ttgatggcac gaccaacaac accataatgg aagacagtgt gccggcagag
gtgtggacca 6360gacacggaga gaaaagagtg ctcaaaccga ggtggatgga
cgccagagtt tgttcagatc 6420atgcggccct gaagtcattc aaggagtttg
ccgctgggaa aagaggagcg gcttttggag 6480tgatggaagc cctgggaaca
ctgccaggac acatgacaga gagattccag gaagccattg 6540acaacctcgc
tgtgctcatg cgggcagaga ctggaagcag gccttacaaa gccgcggcgg
6600cccaattgcc ggagacccta gagaccatta tgcttttggg gttgctggga
acagtctcgc 6660tgggaatctt tttcgtcttg atgaggaaca agggcatagg
gaagatgggc tttggaatgg 6720tgactcttgg ggccagcgca tggctcatgt
ggctctcgga aattgagcca gccagaattg 6780catgtgtcct cattgttgtg
ttcctattgc tggtggtgct catacctgag ccagaaaagc 6840aaagatctcc
ccaggacaac caaatggcaa tcatcatcat ggtagcagta ggtcttctgg
6900gcttgattac cgccaatgaa ctcggatggt tggagagaac aaagagtgac
ctaagccatc 6960taatgggaag gagagaggag ggagcaacca taggattctc
aatggacatt gacctgcggc 7020cagcctcagc ttgggccatc tatgctgcct
tgacaacttt cattacccca gccgtccaac 7080atgcagtgac cacttcatac
aacaactact ccttaatggc gatggccacg caagctggag 7140tgttgtttgg
tatgggcaaa gggatgccat tctacgcatg ggactttgga gtcccgctgc
7200taatgatagg ttgctactca caattaacac ccctgaccct aatagtggcc
atcattttgc 7260tcgtggcgca ctacatgtac ttgatcccag ggctgcaggc
agcagctgcg cgtgctgccc 7320agaagagaac ggcagctggc atcatgaaga
accctgttgt ggatggaata gtggtgactg 7380acattgacac aatgacaatt
gacccccaag tggagaaaaa gatgggacag gtgctactca 7440tagcagtagc
agtctccagc gccatactgt cgcggaccgc ctgggggtgg ggggaggctg
7500gggccctgat cacagccgca acttccactt tgtgggaagg ctctccgaac
aagtactgga 7560actcctctac agccacttca ctgtgtaaca tttttagggg
aagttacttg gctggagctt 7620ctctaatcta catagtaaca agaaacgctg
gcttggtcaa gagacgtggg ggtggaacag 7680gagagaccct gggagagaaa
tggaaggccc gcttgaacca gatgtcggcc ctggagttct 7740actcctacaa
aaagtcaggc atcaccgagg tgtgcagaga agaggcccgc cgcgccctca
7800aggatggtgt ggcaacggga ggccatgctg tgtcccgagg aagtgcaaag
ctgagatggt 7860tggtggagcg gggatacctg cagccctatg gaaaggtcat
tgatcttgga tgtggcagag 7920ggggctggag ttactacgcc gccaccatcc
gcaaagttca agaagtgaaa ggatacacaa 7980aaggaggccc tggtcatgaa
gaacccgtgt tggtgcaaag ctatgggtgg aacatagtcc 8040gtcttaagag
tggggtggac gtctttcata tggcggctga gccgtgtgac acgttgctgt
8100gtgacatagg tgagtcatca tctagtcctg aagtggaaga agcacggacg
ctcagagtcc 8160tctccatggt gggggattgg cttgaaaaaa gaccaggagc
cttttgtata aaagtgttgt 8220gcccatacac cagcactatg atggaaaccc
tggagcgact gcagcgtagg tatgggggag 8280gactggtcag agtgccactc
tcccgcaact ctacacatga gatgtactgg gtctctggag 8340cgaaaagcaa
caccataaaa agtgtgtcca ccacgagcca gctcctcttg gggcgcatgg
8400acgggcctag gaggccagtg aaatatgagg aggatgtgaa tctcggctct
ggcacgcggg 8460ctgtggtaag ctgcgctgaa gctcccaaca tgaagatcat
tggtaaccgc attgaaagga 8520tccgcagtga gcacgcggaa acgtggttct
ttgacgagaa ccacccatat aggacatggg 8580cttaccatgg aagctatgag
gcccccacac aagggtcagc gtcctctcta ataaacgggg 8640ttgtcaggct
cctgtcaaaa ccctgggatg tggtgactgg agtcacagga atagccatga
8700ccgacaccac accgtatggt cagcaaagag ttttcaagga aaaagtggac
actagggtgc 8760cagaccccca agaaggcact cgtcaggtta tgagcatggt
ctcttcctgg ttgtggaaag 8820agctaggcaa acacaaacgg ccacgagtct
gtaccaaaga agagttcatc aacaaggttc 8880gtagcaatgc agcattaggg
gcaatatttg aagaggaaaa agagtggaag actgcagtgg 8940aagctgtgaa
cgatccaagg ttctgggctc tagtggacaa ggaaagagag caccacctga
9000gaggagagtg ccagagttgt gtgtacaaca tgatgggaaa aagagaaaag
aaacaagggg 9060aatttggaaa ggccaagggc agccgcgcca tctggtatat
gtggctaggg gctagatttc 9120tagagttcga agcccttgga ttcttgaacg
aggatcactg gatggggaga gagaactcag 9180gaggtggtgt tgaagggctg
ggattacaaa gactcggata tgtcctagaa gagatgagtc 9240gcataccagg
aggaaggatg tatgcagatg acactgctgg ctgggacacc cgcatcagca
9300ggttcgatct ggagaatgaa gctctaatca ccaaccaaat ggagaaaggg
catagggcct 9360tggcattggc cataatcaag tacacatacc aaaacaaagt
ggtaaaggtc cttagaccag 9420ctgaaaaagg gaaaacagtt atggacatta
tttcgagaca agaccaaagg gggagcggac 9480aagttgtcac ttacgctctt
aacacattta ccaacctagt ggtgcaactc attcggaata 9540tggaggctga
ggaagttcta gagatgcaag acttgtggct gctgcggagg tcagagaaag
9600tgaccaactg gttgcagagc aacggatggg ataggctcaa acgaatggca
gtcagtggag 9660atgattgcgt tgtgaagcca attgatgata ggtttgcaca
tgccctcagg ttcttgaatg 9720atatgggaaa agttaggaag gacacacaag
agtggaaacc ctcaactgga tgggacaact 9780gggaagaagt tccgttttgc
tcccaccact tcaacaagct ccatctcaag gacgggaggt 9840ccattgtggt
tccctgccgc caccaagatg aactgattgg ccgggcccgc gtctctccag
9900gggcgggatg gagcatccgg gagactgctt gcctagcaaa atcatatgcg
caaatgtggc 9960agctccttta tttccacaga agggacctcc gactgatggc
caatgccatt tgttcatctg 10020tgccagttga ctgggttcca actgggagaa
ctacctggtc aatccatgga aagggagaat 10080ggatgaccac tgaagacatg
cttgtggtgt ggaacagagt gtggattgag gagaacgacc 10140acatggaaga
caagacccca gttacgaaat ggacagacat tccctatttg ggaaaaaggg
10200aagacttgtg gtgtggatct ctcatagggc acagaccgcg caccacctgg
gctgagaaca 10260ttaaaaacac agtcaacatg gtgcgcagga tcataggtga
tgaagaaaag tacatggact 10320acctatccac ccaagttcgc tacttgggtg
aagaagggtc tacacctgga gtgctgtaag 10380caccaatctt aatgttgtca
ggcctgctag tcagccacag cttggggaaa gctgtgcagc 10440ctgtgacccc
cccaggagaa gctgggaaac caagcctata gtcaggccga gaacgccatg
10500gcacggaaga agccatgctg cctgtgagcc cctcagagga cactgagtca
aaaaacccca 10560tgcgcttgga ggcgcaggat gggaaaagaa ggtggcgacc
ttccccaccc ttcaatctgg 10620ggcctgaact ggagatcagc tgtggatctc
cagaagaggg actagtggtt agaggagacc 10680ccccggaaaa cgcaaaacag
catattgacg ctgggaaaga ccagagactc catgagtttc 10740caccacgctg
gccgccaggc acagatcgcc gaatagcggc ggccggtgtg gggaaatcca
10800tgggtctt 10808710807DNAZika virus 7agttgttgat ctgtgtgaat
cagactgcga cagttcgagt ttgaagcgaa agctagcaac 60agtatcaaca ggttttattt
tggatttgga aacgagagtt tctggtcatg aaaaacccaa 120aaaagaaatc
cggaggattc cggattgtca atatgctaaa acgcggagta gcccgtgtga
180gcccctttgg gggcttgaag aggctgccag ccggacttct gctgggtcat
gggcccatca 240ggatggtctt ggcgattcta gcctttttga gattcacggc
aatcaagcca tcactgggtc 300tcatcaatag atggggttca gtgggaaaaa
aagaggctat ggaaataata aagaagttca 360agaaagatct ggctgccatg
ctgagaataa tcaatgctag gaaggagaag aagagacgag 420gcacagatac
tagtgtcgga attgttggcc tcctgctgac cacagctatg gcagcggagg
480tcactagacg tgggagtgca tactatatgt acttggacag aagcgatgct
ggggaggcca 540tatcttttcc aaccacactg gggatgaata agtgttatat
acagatcatg gatcttggac 600acatgtgtga tgccaccatg agctatgaat
gccctatgct ggatgagggg gtagaaccag 660atgacgtcga ttgttggtgc
aacacgacgt caacttgggt tgtgtacgga acctgccatc 720acaaaaaagg
tgaagcacgg agatccagaa gagctgtgac gctcccctcc cattccacta
780ggaagctgca aacgcggtcg cagacctggt tggaatcaag agaatacaca
aagcacttga 840ttagagtcga aaattggata ttcaggaacc ctggcttcgc
gttagcagca gctgccatcg 900cttggctttt gggaagctca acgagccaaa
aagtcatata cttggtcatg atactgctga 960ttgccccggc atacagcatc
aggtgcatag gagtcagtaa tagggacttt gtggaaggta 1020tgtcaggtgg
gacttgggtt gatgttgtct tggaacatgg aggttgtgtc accgtaatgg
1080cacaggacaa accgactgtc gacatagagc tggttacaac aacagtcagc
aacatggcgg 1140aggtaagatc ctactgctat gaggcatcaa tatcggacat
ggcttcggac agccgctgcc 1200caacacaagg tgaagcctac cttgacaagc
aatcagacac tcaatatgtc tgcaaaagaa 1260cgttagtgga cagaggctgg
ggaaatggat gtggactttt tggcaaaggg agcctggtga 1320catgcgctaa
gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag ccagagaatc
1380tggagtaccg gataatgctg tcagttcatg gctcccagca cagtgggatg
atcgttaatg 1440acacaggaca tgaaactgat gagaatagag cgaaggttga
gataacgccc aattcaccaa 1500gagccgaagc caccctgggg ggttttggaa
gcctaggact tgattgtgaa ccgaggacag 1560gccttgactt ttcagatttg
tattacttga ctatgaacaa caagcactgg ttggttcaca 1620aggagtggtt
ccacgacatt ccattacctt ggcacactgg ggcagacacc ggaactccac
1680actggaacaa caaagaagca ctggtagagt tcaaggacgc acatgccaaa
aggcaaactg 1740tcgtggttct agggagtcaa gaaggagcag ttcacacggc
ccttgctgga gctctggagg 1800ctgagatgga tggtgcaaag ggaaggctgt
cctctggcca cttgaaatgt cgcctgaaaa 1860tggataaact tagattgaag
ggcgtgtcat actccttgtg taccgcagcg ttcacattca 1920ccaagatccc
ggctgaaaca ctgcacggga cagtcacagt ggaggtacag tacgcaggga
1980cagatggacc ttgcaaggtt ccagctcaga tggcggtgga catgcaaact
ctgaccccag 2040ttgggaggtt gataaccgct aaccccgtaa tcactgaagg
cactgagaac tctaagatga 2100tgctggaact tgatccacca tttggggact
cttacattgt cataggagtc ggggagaaga 2160agatcaccca ccactggcac
aggagtggca gcaccattgg aaaagcattt gaagccactg 2220tgagaggtgc
caagagaatg gcagtcttgg gagacacagc ctgggacttt ggatcagttg
2280gaggcgttct taactcattg ggcaagggca tccatcaaat ttttggagca
gctttcaaat 2340cattgtttgg aggaatgtcc tggttctcac aaattctcat
tggaacgttg ctgatgtggt 2400tgggtctgaa tacaaagaat ggatctattt
cccttatgtg cttggcctta gggggagtgt 2460tgatcttctt atccacagcc
gtctccgctg atgtggggtg ctcggtggac ttctcaaaga 2520aggaaacgag
atgcggtaca ggggtgttcg tctataacga cgttgaagcc tggagggaca
2580ggtacaagta ccatcctgac tcccctcgta gattggcagc agtagtcaag
caagcctggg 2640aagatggtat ctgtgggatc tcctctgttt caagaatgga
aaacatcatg tggagatcag 2700tagaagggga gctcaacgca atcctggaag
agaatggagt tcaactgacg gtcgttgtgg 2760gatctgtaaa aaaccccatg
tggagaggtc cacagagatt gcccgtgcct gtgaacgagc 2820tgccccacgg
ctggaaggct tgggggaaat cgtacttcgt cagagcagca aagacaaata
2880acagctttgt cgtggatggt gacacactga aggaatgccc actcaaacat
agagcatgga 2940acagctttct tgtggaggat catgggttcg gggtatttca
cactagtgtc tggctcaagg 3000ttagagaaga ttattcacta gagtgtgatc
cagccgtcat tggaacagct gttaagggaa 3060aggaggctgt acacagtgat
ctaggctact ggattgagag tgagaagaac gacacatgga 3120ggctgaggag
ggcccacctg atcgagatga aaacatgtga atggccaaag tcccacacat
3180tgtggacaga tggaatagaa gagagtgatc tgatcatacc caagtcttta
gctgggccac 3240tcagccatca caacaccaga gagggctaca ggacccaaat
gaaagggcca tggcacagtg 3300aagagcttga aattcggttt gaggaatgcc
caggcactaa ggtccacgtg gaggaaacat 3360gtggaacaag aggaccatct
ctgagatcaa ccactgcaag cggaagggtg atcgaggaat 3420ggtgctgcag
ggagtgcaca atgcccccac tgtcgttccg ggctaaagat ggctgttggt
3480atggaatgga gataaggccc aggaaagaac cagaaagtaa cttagtaagg
tcaatggtga 3540ctgcaggatc aactgatcac atggatcact tttcccttgg
agtgcttgtg attctgctca 3600tggtgcagga agggctgaag aagagaatga
ccacaaagat catcataagc acatcaatgg 3660cagtgctggt agctatgatc
ctgggaggat tttcaatgag tgatctggct aagcttgcaa 3720ttttgatggg
tgccaccttt gcggaaatga acactggagg agatgtagct catctggcgc
3780tggtagcggc attcaaagtc agaccagcgt tgctggtatc tttcatcttc
agagctaatt 3840ggacaccccg tgaaagcatg ctgctggcct tggcctcgtg
tcttttgcaa actgcgatct 3900ccgccttgga aggcgacctg atggttctca
tcaatggttt tgctttggcc tggttggcaa 3960tacgagcgat ggttgttcca
cgcactgaca atatcacctt ggcaatcctg gctgctctga 4020caccactggc
ccggggcaca ctgcttgtgg cgtggagagc aggccttgct acttgcgggg
4080ggttcatgct cctctctctg aaggggaaag gcagtgtgaa gaagaactta
ccatttgtca 4140tggccctggg actaaccgct gtgaggctgg tcgaccccat
caacgtggtg ggactgctgt 4200tgctcacaag gagtgggaag cggagctggc
cccctagcga agtactcaca gctgttggcc 4260tgatatgcgc attggctgga
gggttcgcca aggcagatat agagatggct gggcccatgg 4320ccgcggtcgg
tctgctaatt gtcagttacg tggtctcagg aaagagtgtg gacatgtaca
4380ttgaaagagc aggtgacatc acatgggaaa aagatgcgga agttactgga
aacagtcccc 4440ggctcgatgt ggcactagat gagagtggtg atttctccct
ggtggaggat gacggtcccc 4500ccatgagaga gatcatactc aaagtggtcc
tgatgaccat ctgtggcatg aacccaatag 4560ccataccctt tgcagctgga
gcgtggtacg tatacgtgaa aactggaaaa aggagtggtg 4620ctctatggga
tgtgcctgct cccaaggaag taaaaaaggg ggagaccaca gatggagtgt
4680acagagtaat gactcgtaga ctgctaggtt caacacaagt tggagtggga
gttatgcaag 4740agggggtctt tcacactatg tggcatgtca caaaaggatc
cgcgctgaga agcggtgaag 4800ggagacttga tccatactgg ggagatgtca
agcaggatct ggtgtcatac tgtggtccat 4860ggaagctaga tgccgcctgg
gacgggcaca gcgaggtgca gctcttggcc gtgccccccg 4920gagagagagc
gaggaacatc cagactctgc ccggaatatt taagacaaag gatggggaca
4980ttggagcggt tgcgctggac tatccagcag gaacttcagg atctccaatc
ctagacaagt 5040gtgggagagt gataggactc tatggcaatg gggtcgtgat
caagaatggg agttatgtca 5100gtgccatcac ccaagggagg agggaggaag
agactcctgt tgagtgcttc gagccttcga 5160tgctgaagaa gaagcagcta
actgtcttag acttgcatcc tggagctggg aaaaccagga 5220gagttcttcc
tgaaatagtc cgtgaagcca taaaaacgag actccgtact gtgatcttag
5280ctccaaccag ggttgtcgct gctgaaatgg aggaagccct tagagggctt
ccagtgcgtt 5340atatgacaac agcagtcaat gtcacccatt ctgggacaga
aatcgttgac ttaatgtgcc 5400atgccacctt cacttcacgt ctactacagc
caatcagagt ccccaactat aatctgtata 5460ttatggatga ggcccacttc
acagatccct caagtatagc agcaagagga tacatttcaa 5520caagggttga
gatgggcgag gcagctgcca tcttcatgac cgccacgcca ccaggaaccc
5580gtgacgcatt cccggactcc aactcaccaa ttatggacac cgaagtggaa
gtcccagaga 5640gagcctggag ctcaggcttt gattgggtga cggatcattc
tggaaaaaca gtttggtttg 5700tcccaagcgt gaggaacggc aatgagatcg
cagcttgtct gacaaaggct ggaaaacggg 5760tcatacagct cagcagaaag
acttttgaga cagagttcca gaaaacaaaa catcaagagt 5820gggacttcgt
cgtgacaact gacatttcag agatgggcgc caactttaaa gctgaccgtg
5880tcatagattc caggagatgc ctaaagccgg tcatacttga tggcgagaga
gtcattctgg 5940ctggacccat gcctgtcaca catgccagcg ctgcccagag
gagggggcgc ataggcagga 6000atcccaacaa acctggagat gagtatctgt
atggaggtgg gtgcgcagag actgatgaag 6060accatgcaca ctggcttgaa
gcaagaatgc tccttgacaa tatttacctc caagatggcc 6120tcatagcctc
gctctatcga cctgaggccg acaaagtagc agccattgag ggagagttca
6180agcttaggac ggagcaaagg aagacctttg tggaactcat gaaaagagga
gatcttcctg 6240tttggctggc ctatcaggtt gcatctgccg gaataaccta
cacagataga agatggtgct 6300ttgatggcat gaccaacaac accataatgg
aagacagtgt gccggcagag gtgtggacca 6360gacacggaga gaaaagagtg
ctcaaaccga ggtggatgga cgccagagtt tgttcagatc 6420atgcggccct
gaagtcattc aaggagtttg ccgctgggaa aagaggagcg gcttttggag
6480tgatggaagc cctgggaaca ctgccaggac acatgacgga gagattccag
gaagccattg 6540acaacctcgc tgtgctcatg cgggcagaga ctggaagcag
gccttacaaa gccgcggcgg 6600cccaattgcc ggagacccta gagaccatta
tgcttttggg gttgctggga acagtctcgc 6660tgggaatctt tttcgtcttg
atgcggaaca agggcatagg gaagatgggc tttggaatgg 6720tgactcttgg
ggccagcgca tggctcatgt ggctctcgga aattgagcca gccagaattg
6780catgcgtcct cattgttgtg ttcctattgc tggtggtgct catacctgag
ccagaaaagc 6840aaagatcccc ccaggacaac caaatggcaa tcatcatcat
ggtagcagta ggtcttctgg 6900gcttgattac cgccaatgaa ctcggatggt
tggagagaac aaagagtgac ctaagccatc 6960taatgggaag gagagaggag
ggggcaacca taggattctc aatggacatt gacctgcggc 7020cagcctcggc
ctgggccatc tatgctgccc tgacaacttt cattacccca gccgtccaac
7080atgcagtgac cacttcatac aacaactact ccttaatggc gatggccacg
caagctggag 7140tgttgtttgg tatgggcaaa gggatgccat tctacgcatg
ggactttgga gtcccgctgc 7200taatgatagg ttgctactca caattaacac
ccctgaccct aatagtggct atcattttgc 7260tcgtggcgca ctacatgtac
ttgatcccag ggctgcaggc agcagctgcg cgtgctgccc 7320agaagagaac
ggcagctggc atcatgaaga accctgttgt ggatggaata gtggtgactg
7380acattgacac aatgactatt gacccccaag tggagaaaaa gatgggacag
gtgctactca 7440tagcagtagc cgtctccagc gccatactgt cgcggaccgc
ctgggggtgg ggggaagctg 7500gggccctgat cacagctgca acttccactt
tgtgggaagg ctctccgaac aagtactgga 7560actcctctac agccacttca
ctgtgcaaca tttttagggg aagttacttg gctggagctt 7620ctctaatcta
cacagtaaca agaaacgctg gcttggtcaa gagacgtggg ggtggaacag
7680gagagaccct gggagagaaa tggaaggccc gcttgaacca gatgtcggcc
ctggagttct 7740actcctacaa aaagtcaggc atcaccgagg tgtgcagaga
agaggcccgc cgcgccctca 7800aggacggtgt ggcaacggga ggccatgctg
tgtcccgagg aagtgcaaag ctgagatggt 7860tggtggagcg gggatacctg
cagccctatg gaaaggtcat tgatcttgga tgtggcagag 7920ggggctggag
ttactacgcc gccaccatcc gcaaagttca agaagtgaaa ggatacacaa
7980aaggaggccc tggtcatgaa gaacccatgt tggtgcaaag ctatgggtgg
aacatagtcc 8040gtcttaagag tggggtggac gtctttcata tggcggctga
gccgtgtgac acgttgctgt 8100gtgacatagg tgagtcatca tctagtcctg
aagtggaaga agcacggacg ctcagagtcc 8160tctccatggt gggggattgg
cttgaaaaaa gaccaggagc cttttgtgta aaagtgttgt 8220gcccatacac
cagcactatg atggaaaccc tggagcgact gcagcgtagg tatgggggag
8280gactggtcag agtgccactc tcccgcaact ctacacatga gatgtactgg
gtctctggag 8340cgaaaagcaa caccataaaa agtgtgtcca ccacgagcca
gctcctcttg gggcgcatgg 8400acgggcccag gaggccagtg aaatatgagg
aggatgtgaa tctcggctct ggcacgcggg 8460ctgtggtaag ctgcgctgaa
gctcccaaca tgaagatcat tggtaaccgc attgaaagga 8520tccgcagtga
gcacgcggaa acgtggttct ttgacgagaa ccacccatat aggacatggg
8580cttaccatgg aagctatgag gcccctacac aagggtcagc gtcctctcta
ataaacgggg 8640ttgtcaggct cctgtcaaaa ccctgggatg tggtgactgg
agtcacagga atagccatga 8700ccgacaccac accgtatggt cagcaaagag
ttttcaagga aaaagtggac accagggtgc 8760cagaccccca agaaggcact
cgtcaggtta tgagcatggt ctcttcctgg ttgtggaaag 8820agctaggcaa
acacaaacgg ccacgagtct gtaccaaaga agagttcatc aacaaggttc
8880gtagcaatgc agcattaggg gcaatatttg aagaggaaaa agagtggaag
accgcagtgg 8940aagctgtgaa cgatccaagg ttctgggctc tagtggacaa
ggaaagagag caccacctga 9000gaggagagtg ccagagctgt gtgtacaaca
tgatgggaaa aagagaaaag aaacaagggg 9060aatttggaaa ggccaagggc
agccgcgcca tctggtatat gtggctaggg gctagatttc 9120tagagttcga
agcccttgga ttcttaaatg aggatcactg gatggggaga gagaactcag
9180gaggtggtgt tgaagggctg ggattacaaa gactcggata tgtcctagaa
gagatgagtc 9240gcataccagg aggaaggatg tatgcagatg acactgctgg
ctgggacacc cgcatcagca 9300ggtttgatct ggagaatgaa gctttaatca
ccaaccaaat ggagaaaggg cacagggcct 9360tagcattggc cataatcaag
tacacatacc aaaacaaagt ggtaaaggtc cttagaccag 9420ctgaaaaagg
gaagacagtt atggacatta tttcaagaca agaccaaagg gggagcggac
9480aagttgtcac ttacgctctt aacacattta ccaacctagt ggtgcaactc
attcggaata 9540tggaggctga ggaagttcta gagatgcaag acttgtggct
gctgcggagg tcagagaaag 9600tgaccaactg gttgcagagc aacggatggg
ataggctcaa acgaatggca gtcagtggag 9660atgattgcgt tgtgaagcca
attgatgata ggtttgcaca tgccctcagg ttcttgaatg 9720atatgggaaa
agttaggaag gacacacaag agtggaaacc ctcaactgga tgggacaact
9780gggaagaagt tccgttttgt tcccaccact tcaacaagct ccatctcaag
gacgggaggt 9840ccattgtggt tccctgccgc caccaagatg aactgattgg
ccgggcccgt gtctctccag 9900gggcgggatg gagcatccgg gagactgctt
gcctagcaaa gtcatatgcg caaatgtggc 9960agctccttta tttccacaga
agggacctcc gactgatggc caatgccatc tgttcatctg 10020tgccagttga
ctgggttcca actgggagaa ctacctggtc aatccatgga aagggagaat
10080ggatgaccac tgaagacatg cttgtggtgt ggaacagagt gtggattgag
gagaacgacc 10140acatggaaga caagacccca gttacgaaat ggacagacat
tccctatctg ggaaaaaggg 10200aagacttgtg gtgtggatct ctcatagggc
acagaccgcg caccacctgg gctgagaaca 10260ttaaaaacac agtcaacatg
gtgcgcagga tcataggtga tgaagaaaag tacatggact 10320acctatccac
ccaagttcgc tacttgggtg aagaagggtc tacacctgga gtgctataag
10380caccaatctt agtgttgtca ggcctgctag tcagccacag cttggggaaa
gctgtgcagc 10440ctgtgacccc cccaggagag gctgggaaac caagcccata
gtcaggccga gaacgccatg 10500gcacggaaga agccatgctg cctgtgagcc
cctcagagga cactgagtca aaaaacccca 10560cgcgcttgga ggcgcaggat
gggaaaagaa ggtggcgacc ttccccaccc ttcaatctgg 10620ggcctgaact
ggagatcagc tgtggatctc cagaagaggg actagtggtt agaggagacc
10680ccccggaaaa cgcaaaacag catattgacg ctgggaaaga ccagagactc
catgagtttc 10740caccacgctg gccgccaggc acagatcgcc gaatagcggc
ggccggtgtg gggaaatcca 10800tgggtct 10807810807DNAZika virus
8agttgttgat ctgtgtgaat cagactgcga cagttcgagt ttgaagcgaa agctagcaac
60agtatcaaca ggttttattt tggatttgga aacgagagtt tctggtcatg aaaaacccaa
120aaaagaaatc cggaggattc cggattgtca atatgctaaa acgcggagta
gcccgtgtga 180gcccctttgg gggcttgaag aggctgccag ccggacttct
gctgggccat gggcccatca 240ggatggtctt ggcgatacta gcctttttga
gattcacggc aatcaagcca tcactgggtc 300tcatcaatag atggggttca
gtggggaaaa aagaggctat ggaaataata aagaagttca 360agaaagatct
ggctgccatg ctgagaataa tcaatgctag gaaggagaag aagagacgag
420gcgcagatac tagcgtcgga attgttggcc tcctcctgac cacagccatg
gcagtagagg 480tcactagacg tgggagtgca tactatatgt acttggacag
aagcgatgct ggggaggcca 540tatcttttcc aaccacactg gggatgaata
agtgttacat acaaatcatg gatcttggac 600acatgtgtga tgccaccatg
agctatgaat gccctatgtt ggatgagggg gtagaaccag 660atgacgtcga
ttgctggtgc aacacgacat caacttgggt tgtgtatgga acctgccacc
720acaaaaaagg tgaagcacgg agatctagaa gagctgtgac gctcccctcc
cattccacta 780ggaagctgca aacgcggtcg cagacctggt tggaatcaag
agaatacaca aagcacctga 840ttagagttga aaattggata ttcaggaacc
ctggcttcgc gttagcagca gctgtcatcg 900cttggctttt gggaagttca
acgagccaaa aagtcatata tctggtcatg atactgctga 960ttgccccggc
atacagcatc aggtgcatag gagtcagcaa tagggacttt gtggaaggta
1020tgtcaggtgg gacttgggtt gatgttgtct tggaacatgg aggttgtgtt
accgtaatgg 1080cacaggacaa accgactgtc gacatagagc tggttacaac
aacagtcagc aacatggcgg 1140aggtaagatc ctactgctat gaggcatcaa
tatcggatat ggcttcggac agccgctgcc 1200caacacaagg tgaggcctac
cttgacaagc agtcagacac tcaatatgtc tgcaaaagaa 1260cgttagtgga
cagaggctgg ggaaatggat gtggactttt tggcaaaggg agcctggtga
1320catgcgctaa gtttgcatgc tccaagaaaa tgaccgggaa gagcatccag
ccagagaatc 1380tggagtaccg gataatgctg tcagttcatg gctcccagca
cagtgggatg atcgttaatg 1440acacaggaca tgaaactgat gagaatagag
cgaaggttga gataacgccc aattcaccaa 1500gagccgaagc caccctgggg
ggttttggga gcctaggact tgattgtgaa ccgaggacag 1560gccttgactt
ttcagatttg tattacctga ctatgaataa caagcactgg ttggttcaca
1620aggagtggtt ccacgacatt ccattacctt ggcatgctgg ggcagacact
ggaactccac 1680attggaacaa caaagaagca ctggtagagt tcaaggacgc
acatgcaaaa aggcaaactg 1740tcgtggttct agggagtcaa gaaggagcag
ttcacacggc ccttgctgga gctctggagg 1800ctgagatgga tggagccaag
ggaaggctgt cctctggcca cttgaaatgt cgcctgaaaa 1860tggataaact
tagattgaag ggcgtgtcat actccttgtg cactgcagcg ttcacattca
1920ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag
tacgcaggga 1980cagatggacc ttgcaaggtt ccagctcaga tggcggtgga
tatgcaaact ctgaccccag 2040ttgggaggtt gataaccgct aaccctgtaa
tcactgaaag caccgagaac tctaagatga 2100tgctggaact tgatccacca
tttggggact cttacattgt cataggagtc ggggagaaga 2160agatcaccca
tcactggcac aggagtggca gcaccattgg aaaagcattt gaagccactg
2220tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt
ggatcagttg 2280ggggtgctct caactcattg ggcaagggca tccatcaaat
ttttggagca gctttcaaat 2340cattgttcgg aggaatgtcc tggttctcac
aaattctcat tggaacgttg ctggtgtggt 2400tgggtctgaa tacaaagaat
ggatctattt cccttacgtg cttggcctta gggggagtgt 2460tgatcttctt
atccacagcc gtttctgctg atgtggggtg ctcggtggac ttctcaaaga
2520aggaaacgag atgcggtaca ggggtgttcg tctataacga cgttgaagcc
tggagggaca 2580ggtacaagta ccatcctgac tcccctcgta gattggcagc
agcagtcaag caagcctggg 2640aagatgggat ctgtgggatc tcctctgtct
caagaatgga aaacatcatg tggagatcag 2700tagaagggga gctcaacgca
atcctggaag agaatggagt tcaactgacg gtcgttgtgg 2760gatctgtaaa
aaaccccatg tggagaggtc cacagagatt gcccgtgcct gtgaacgagc
2820tgccccacgg ctggaaggct tgggggaaat cgtacttcgt cagagcagca
aagacaaata 2880acagctttgt cgtggatggt gacacactga aggaatgccc
actcaaacat agagcatgga 2940acagctttct tgtggaggat catgggtttg
gggtatttca cactagtgtc tggctcaagg 3000ttagagaaga ttattcatta
gagtgtgatc cagccgtcat tggaacagct gctaagggaa 3060aggaggctgt
gcacagcgat ctaggctact ggattgagag tgagaagaac gacacatgga
3120ggctgaagag ggcccacctg atcgagatga aaacatgtga atggccaaag
tcccacacat 3180tgtggacaga tggagtagaa gaaagtgatc tgatcatacc
caagtcttta gctgggccac 3240tcagccatca caacaccaga gagggctaca
ggactcaaat gaaagggcca tggcacagtg 3300aagagcttga aattcggttt
gaggaatgcc caggcactaa ggtccacgtg gaggaaacat 3360gtgggacaag
aggaccatcc ctgagatcaa ccactgcaag cggaagggtg atcgaggaat
3420ggtgctgcag ggaatgcaca atgcccccac tgtcgttccg agctaaagat
ggctgttggt 3480atggaatgga gataaggccc aggaaagaac cagaaagtaa
cttagtaagg tcaatggtga 3540ctgcaggatc aactgatcac atggatcact
tctctcttgg agtgcttgtg attttgctca 3600tggtgcagga agggctgaag
aagagaatga ccacaaagat catcataagc acatcaatgg 3660cagtgctggt
agccatgatc ctgggaggat tttcaatgag tgacctggct aagcttgcaa
3720ttttgatggg tgccaccttc gcggaaatga acactggagg agatgtagct
catttggcgc 3780tgatagcggc attcaaagtc agacctgcgt tgctggtatc
tttcatcttc agagctaatt 3840ggacaccccg tgagagcatg ctgctggcct
tggcctcgtg tcttctgcaa actgcgatct 3900ccgccttgga aggcgacctg
atggttctca tcaatggttt tgctttggcc tggttggcaa 3960tacgagcgat
ggttgttcca cgcactgaca acatcacctt ggcaatcctg gctgctctga
4020caccactggc ccggggcaca ctgcttgtgg cgtggagagc aggccttgct
acttgcgggg 4080ggttcatgct cctctctctg aaggggaaag gcagtgtgaa
gaagaaccta ccatttgtca 4140tggccttggg actaactgct gtgaggctgg
tcgaccccat caacgtggtg ggactgctgt 4200tgctcacaag gagtgggaag
cggagctggc cccctagtga agtactcaca gctgttggcc 4260tgatatgcgc
attggctgga gggttcgcca aggcggatat agagatggct gggcccatgg
4320ccgcggtcgg tctgctaatt gtcagttacg tggtctcagg aaagagtgtg
gacatgtaca 4380ttgaaagagc aggtgacatc acatgggaaa aagatgcgga
aatcactgga aacagtcccc 4440ggctcgatgt ggcactagat gagagtggtg
atttctccct agtggaggat gatggtccac 4500ccatgagaga gatcatactc
aaagtggtcc tgatgaccat ctgcggcatg aacccaatag 4560ccataccctt
tgcagctgga gcgtggtacg tgtatgtgaa gactggaaaa aggagtggtg
4620ctctatggga tgtgcctgct cccaaggaag taaaaaaggg ggagaccaca
gatggagtgt 4680acagagtaat gactcgtaga ctgcttggtt caacacaagt
tggagtggga gtcatgcaag 4740agggggtctt ccacactatg tggcacgtca
caaaaggatc cgcgctgaga agcggtgaag 4800ggagacttga tccatactgg
ggagatgtca agcaggatct ggtgtcatac tgtggtccgt 4860ggaagctaga
cgccgcctgg gacgggcaca gcgaggtgca gctcttggcc gtgccccccg
4920gagagagagc gaggaacatc cagactctgc ccggaacatt taagacaaag
gatggggaca 4980ttggagcagt tgcgctggac tacccagcag gaacttcagg
atctccaatc ctagacaagt 5040gtgggagagt gataggactc tatggtaatg
gggtcgtgat aaaaaatggg agttatgtta 5100gtgccatcac ccaagggagg
agggaggaag agactcctgt tgagtgcttc gagccttcga 5160tgctgaagaa
gaagcagcta actgtcttag acctgcatcc tggagccggg aaaaccagga
5220gagttcttcc tgaaatagtc cgtgaagcca taaaaacaag actccgtact
gtgatcttag 5280ctccaaccag ggtcgtcgct gctgaaatgg aggaagccct
tagagggctt ccagttcgtt 5340atatgacaac agcagtcaat gtcacccatt
ctgggacaga aatcgttgac ttaatgtgcc 5400atgctacctt cacttcacgc
ctactacaac caatcagagt ccccaactat aatttgtata 5460ttatggatga
ggcccacttc acagatccct caagtatagc agcaagagga tacatttcaa
5520caagggttga gatgggcgag gcggctgcca tcttcatgac cgccacgcca
ccaggaaccc 5580gtgacgcatt cccggactcc aactcaccaa ttatggacac
cgaggtggaa gtcccagaga 5640gagcctggag cacaggcttt gattgggtga
cggatcattc tgggaaaaca gtctggtttg 5700ttccaagcgt gaggaacggc
aatgagatcg cagcttgtct gacaaaggct ggaaaacggg 5760tcatacagct
cagcagaaag acttttgaga cagagttcca gaaaacgaaa aatcaagagt
5820gggacttcgt cgtgacaacc gacatttcag agatgggcgc caactttaaa
gctgaccgtg 5880tcatagattc caggagatgc ttaaagccgg tcatacttga
tggcgagaga gtcattttgg 5940ctggacccat gcctgtcaca catgccagcg
ctgctcagag gagggggcgc ataggcagga 6000atcccaacaa acctggagat
gagtatctgt atggaggtgg gtgcgcagag actgatgaag 6060atcacgcaca
ctggcttgaa gcaagaatgc ttcttgacaa catttacctc caagatggcc
6120tcatagcttc gctctatcga cctgaggccg acaaagtagc agctattgag
ggagagttca 6180agcttaggac ggagcaaagg aagacctttg tggaactcat
gaaaagagga gatcttccgg 6240tttggttggc ctatcaggtt gcatctgccg
gaataaccta cacagataga agatggtgct 6300ttgatggcat gaccaacaac
accataatgg aagacagtgt gccggcagag gtgtggacca 6360gatacggaga
gaaaagagtg ctcaaaccga ggtggatgga cgccagagtt tgttcagatc
6420atgcggccct gaagtcattc aaagagtttg ccgctgggaa aagaggagcg
gcctttggag 6480tgatagaagc cctgggaaca ctgccaggac acatgacaga
gagattccag gaagccattg 6540acaacctcgc tgtgctcatg cgggcagaga
ctggaagcag gccttacaaa gccgcggcgg 6600cccaattgcc ggagacccta
gagaccatta tgcttttggg gttgctggga acagtctcgc 6660tgggaatctt
tttcgtcttg atgcggaaca agggcatggg gaagatgggc tttggaatgg
6720tgactcttgg ggccagcgca tggcttatgt ggctctcgga aattgagcca
gccagaattg 6780catgtgtcct cattgtcgtg ttcctattgc tggtggtgct
catacctgag ccagaaaagc 6840aaagatctcc tcaggacaac caaatggcaa
tcatcatcat ggtagcagtg ggtcttctgg 6900gcttgattac cgccaatgaa
ctcggatggt tggagagaac aaaaagtgac ctaagccatc 6960taatgggaag
gagagaggag ggggcaacca caggattctc aatggacatt gacctgcggc
7020cagcctcagc ttgggctatc tatgctgctc tgacaacttt catcacccca
gccgtccaac 7080atgcggtgac cacttcatac aacaactact ccttaatggc
gatggccacg caagctgggg 7140tgttgtttgg tatgggcaaa gggatgccat
tctacgcatg ggactttgga gtcccgctgc 7200taatgatggg ttgctactca
caattaacac ctctgaccct aatagtggcc atcattttgc 7260tcgtggcgca
ctacatgtac ttgatcccag ggctgcaggc agcagctgcg cgggctgccc
7320agaagagaac ggcagctggc atcatgaaga accctgttgt ggatggaata
gtggtgactg 7380acattgacac aatgacaatt gacccccaag tggaaaaaaa
gatggggcag gtgctactca 7440tagcagtagc cgtctccagc gccatactgt
cgcggaccgc ctgggggtgg ggggaggctg 7500gggccctgat cacagctgca
acttccacct tgtgggaagg ctctccgaac aagtactgga 7560actcctccac
agccacttca ctgtgtaaca tttttagggg aagttacttg gctggagctt
7620ctctaatcta cacagtaaca agaaacgctg gcttggtcaa gagacgtggg
ggtggaacgg 7680gagagaccct gggagagaaa tggaaggccc gcctgaacca
gatgtcggcc ctggagttct 7740actcctacaa aaagtcaggc atcaccgagg
tgtgcagaga agaggcccgc cgtgccctca 7800aggacggtgt ggcaacagga
ggccatgctg tgtcccgagg aagtgcaaag cttagatggc 7860tggtggagag
aggatacctg cagccctatg gaaaggtcat tgatcttgga tgtggcagag
7920ggggctggag ttactatgcc gccaccatcc gcaaagttca ggaagtgaaa
ggatacacaa 7980aaggaggccc tggtcatgaa gaacccatgt tggtgcaaag
ctatgggtgg aacatagtcc 8040gtcttaagag tggggtggac gtctttcaca
tggcggctga gccgtgtgac actttgctgt 8100gtgatatagg tgagtcatca
tctagtcctg aagtggaaga agcacggacg ctcagagtcc 8160tctccatggt
gggggattgg cttgaaaaaa gaccaggagc cttttgtata aaagtgttgt
8220gcccatacac cagcactatg atggaaaccc tggagcgact gcagcgtagg
tatgggggag 8280gactggtcag ggtgccactc tcccgcaact ctacacatga
gatgtactgg gtctctggag 8340cgaaaagcaa caccataaaa agtgtgtcca
ccacgagcca gctcctcttg gggcgcatgg 8400acgggcccag gaggccagtg
aaatatgagg aggatgtgaa tctcggctct ggcacgcggg 8460ctgtggtaag
ctgcgctgaa gctcccaaca tgaagatcat tggtaaccgc attgagagga
8520tccgcagtga gcacgcggaa acgtggttct ttgacgagaa ccacccatat
aggacatggg 8580cttaccatgg aagctatgag gcccctacac aagggtcagc
gtcctctcta ataaacgggg 8640ttgtcaggct cctgtcaaaa ccctgggatg
tggtgactgg agtcacagga atagccatga 8700ctgacaccac accgtatggt
cagcaaagag ttttcaagga aaaagtggac actagggtgc 8760cagaccccca
agaaggcact cgtcaggtta tgagcatggt ctcttcctgg ttatggaagg
8820agctaggcaa acacaaacgg ccacgagtct gtaccaaaga agagttcatc
aacaaggttc 8880gtagcaatgc agcattaggg gcaatatttg aagaggaaaa
agagtggaag actgcagtgg 8940aagctgtgaa tgatccaagg ttctgggctc
tagtggacaa ggaaagagag catcacctga 9000gaggagagtg tcagagctgt
gtgtacaaca tgatgggaaa aagagaaaag aaacaagggg 9060aatttggaaa
ggccaagggc agccgcgcca tctggtatat gtggctaggg gctagattcc
9120tagagttcga agcccttgga ttcttgaatg aggatcattg gatggggaga
gagaattcag 9180gaggtggtgt tgaaggactg ggattacaaa gactcggata
tgtcctagaa gagatgagtc 9240gcataccagg aggaaggatg tatgcagatg
atactgctgg ctgggacacc cgcatcagca 9300ggtttgatct ggagaatgaa
gctctaatca ccaaccaaat ggagaaaggg cacagggcct 9360tggcattggc
cataatcaag tacacatacc aaaacaaagt ggtaaaggtc cttagaccag
9420ctgaaaaagg gaagacagtt atggacatta tttcaagaca agaccaaagg
gggagcggac 9480aagttgtcac ttacgctctt aatacattca ccaacctggt
ggtgcagctc attcggaata 9540tggaggctga ggaagttcta gagatgcaag
acttgtggct gctgcggagg ccagagaaag 9600tgaccaactg gttgcaaagc
aacggatggg ataggctcaa aagaatggca gtcagtggag 9660atgattgcgt
tgtgaaacca attgatgata ggtttgcaca tgccctcagg ttcttgaatg
9720atatgggaaa agttaggaag gacacacaag agtggaaacc ctcaactgga
tgggacaact 9780gggaagaagt tccgttttgc tcccaccact tcaacaaact
ccatcttaag gacgggaggt 9840ccattgtggt tccctgccgc caccaagatg
aactgattgg ccgagcccgc gtatcaccag 9900gggcgggatg gagcatccgg
gagactgctt gcctagcaaa atcatatgcg caaatgtggc 9960agctccttta
tttccacaga agggacctcc gactgatggc caatgccatt tgttcatctg
10020tgccagttga ttgggttcca actgggagaa ctacctggtc aatccatgga
aagggagaat 10080ggatgaccac tgaagacatg cttgtggtat ggaacagagt
gtggattgag gaaaacgacc 10140acatggaaga caagacccca gttacaaaat
ggacagacat tccctatttg ggaaaaagag 10200aagacttgtg gtgtggatct
ctcatagggc acagaccgcg tactacctgg gctgagaaca 10260tcaaaaatac
agtcaacatg atgcgcagga tcataggtga tgaagaaaag tacatggact
10320acctatccac ccaggttcgc tacttgggtg aagaagggtc
cacacctgga gtgctgtaag 10380caccaatctt agtgttgtca ggcctgctag
tcagccacag cttggggaaa gctgtgcagc 10440ctgtgacccc cccaggagaa
gctgggaaac caagcctata gtcaggccga gaacgccatg 10500gcacggaaga
agccatgctg cctgtgagcc cctcagagga cactgagtca aaaaacccca
10560cgcgcttgga ggcgcaggat gggaaaagaa ggtggcgacc ttccccaccc
ttcaatctgg 10620ggcctgaact ggagatcagc tgtggatctc cagaagaggg
actagtggtt agaggagacc 10680ccccggaaaa cgcaaaacag catattgacg
ctgggaaaga ccagagactc catgagtttc 10740caccacgctg gccgccaggc
acagatcgcc gaatagcggc ggccggtgtg gggaaatcca 10800tgggtct
10807910648DNAZika virus 9gacagttcga gtttgaagcg aaagctagca
acagtatcaa caggttttat ttggatttgg 60aaacgagagt ttctggtcat gaaaaaccca
aaaaagaaat ccggaggatt ccggattgtc 120aatatgctaa aacgcggagt
agcccgtgtg agcccctttg ggggcttgaa gaggctgcca 180gccggacttc
tgctgggtca tgggcccatc aggatggtct tggcgattct agcctttttg
240agattcacgg caatcaagcc atcactgggt ctcatcaata gatggggttc
agtggggaaa 300aaagaggcta tggaaataat aaagaagttc aagaaagatc
tggctgccat gctgagaata 360atcaatgcta ggaaggagaa gaagagacga
ggcgcagata ctagtgtcgg aattgttggc 420ctcctgctga ccacagctat
ggcagcggag gtcactagac gtgggagtgc atactatatg 480tacttggaca
gaaacgatgc tggggaggcc atatcttttc caaccacatt ggggatgaat
540aagtgttata tacagatcat ggatcttgga cacatgtgtg atgccaccat
gagctatgaa 600tgccctatgc tggatgaggg ggtggaacca gatgacgtcg
attgttggtg caacacgacg 660tcaacttggg ttgtgtacgg aacctgccat
cacaaaaaag gtgaagcacg gagatctaga 720agagctgtga cgctcccctc
ccattccact aggaagctgc aaacgcggtc gcaaacctgg 780ttggaatcaa
gagaatacac aaagcacttg attagagtcg aaaattggat attcaggaac
840cctggcttcg cgttagcagc agctgccatc gcttggcttt tgggaagctc
aacgagccaa 900aaagtcatat acttggtcat gatactgctg attgccccgg
catacagcat caggtgcata 960ggagtcagca atagggactt tgtggaaggt
atgtcaggtg ggacctgggt tgatgttgtc 1020ttggaacatg gaggttgtgt
caccgtaatg gcacaggaca aaccgactgt cgacatagag 1080ctggttacaa
caacagtcag caacatggcg gaggtaagat cctactgcta tgaggcatca
1140atatcagaca tggcttcgga cagccgctgc ccaacacaag gtgaagccta
ccttgacaag 1200caatcagaca ctcaatatgt ctgcaaaaga acgttagtgg
acagaggctg gggaaatgga 1260tgtggacttt ttggcaaagg gagcctggtg
acatgcgcta agtttgcatg ctccaagaaa 1320atgaccggga agagcatcca
gccagagaat ctggagtacc ggataatgct gtcagttcat 1380ggctcccagc
acagtgggat gattgttaat gacacaggac atgaaactga tgagaataga
1440gcgaaagttg agataacgcc caattcacca agagccgaag ccaccctggg
gggttttgga 1500agcctaggac ttgattgtga accgaggaca ggccttgact
tttcagattt gtattacttg 1560actatgaata acaagcactg gttggttcac
aaggagtggt tccacgacat tccattacct 1620tggcacgctg gggcagacac
cggaactcca cactggaaca acaaagaagc actggtagag 1680ttcaaggacg
cacatgccaa aaggcaaact gtcgtggttc tagggagtca agaaggagca
1740gttcacacgg cccttgctgg agctctggag gctgagatgg atggtgcaaa
gggaaggctg 1800tcctctggcc acttgaaatg tcgcctgaaa atggataaac
ttagattgaa gggcgtgtca 1860tactccttgt gtactgcagc gttcacattc
accaagatcc cggctgaaac actgcacggg 1920acagtcacag tggaggtaca
gtacgcaggg acagatggac cttgcaaggt tccagctcag 1980atggcggtgg
acatgcaaac tctgacccca gttgggaggt tgataaccgc taaccccgta
2040atcactgaaa gcactgagaa ctctaagatg atgctggaac ttgatccacc
atttggggac 2100tcttacattg tcataggagt cggggagaag aagatcaccc
accactggca caggagtggc 2160agcaccattg gaaaagcatt tgaagccact
gtgagaggtg ccaagagaat ggcagtcttg 2220ggagacacag cctgggactt
tggatcagtt ggaggcgctc tcaactcatt gggcaagggc 2280atccatcaaa
tttttggagc agctttcaaa tcattgtttg gaggaatgtc ctggttctca
2340caaattctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa
tggatctatt 2400tcccttatgt gcttggcctt agggggagtg ttgatcttct
tatccacagc cgtctctgct 2460gatgtggggt gctcggtgga cttctcaaag
aaggagacga gatgcggtac aggggtgttc 2520gtctataacg acgttgaagc
ctggagggac aggtacaagt accatcctga ctccccccgt 2580agattggcag
cagcagtcaa gcaagcctgg gaagatggta tctgcgggat ctcctctgtt
2640tcaagaatgg aaaacatcat gtggagatca gtagaagggg agctcaacgc
aatcctggaa 2700gagaatggag ttcaactgac ggtcgttgtg ggatctgtaa
aaaaccccat gtggagaggt 2760ccacagagat tgcccgtgcc tgtgaacgag
ctgccccacg gctggaaggc ttgggggaaa 2820tcgtacttcg tcagagcagc
aaagacaaat aacagctttg tcgtggatgg tgacacactg 2880aaggaatgcc
cactcaaaca tagagcatgg aacagctttc ttgtggagga tcatgggttc
2940ggggtatttc acactagtgt ctggctcaag gttagagaag attattcatt
agagtgtgat 3000ccagccgtta ttggaacagc tgttaaggga aaggaggctg
tacacagtga tctaggctac 3060tggattgaga gtgagaagaa tgacacatgg
aggctgaaga gggcccatct gatcgagatg 3120aaaacatgtg aatggccaaa
gtcccacaca ttgtggacag atggaataga agagagtgat 3180ctgatcatac
ccaagtcttt agctgggcca ctcagccatc acaataccag agagggctac
3240aggacccaaa tgaaagggcc atggcacagt gaagagcttg aaattcggtt
tgaggaatgc 3300ccaggcacta aggtccacgt ggaggaaaca tgtggaacaa
gaggaccatc tctgagatca 3360accactgcaa gcggaagggt gatcgaggaa
tggtgctgca gggagtgcac aatgccccca 3420ctgtcgttcc gggctaaaga
tggctgttgg tatggaatgg agataaggcc caggaaagaa 3480ccagaaagca
acttagtaag gtcaatggtg actgcaggat caactgatca catggaccac
3540ttctcccttg gagtgcttgt gattctgctc atggtgcagg aagggctgaa
gaagagaatg 3600accacaaaga tcatcataag cacatcaatg gcagtgctgg
tagctatgat cctgggagga 3660ttttcaatga gtgacctggc taagcttgca
attttgatgg gtgccacctt cgcggaaatg 3720aacactggag gagatgtagc
tcatctggcg ctgatagcgg cattcaaagt cagaccagcg 3780ttgctggtat
ctttcatctt cagagctaat tggacacccc gtgaaagcat gctgctggcc
3840ttggcctcgt gtcttttgca aactgcgatc tccgccttgg aaggcgacct
gatggttctc 3900atcaatggtt ttgctttggc ctggttggca atacgagcga
tggttgttcc acgcactgat 3960aacatcacct tggcaatcct ggctgctctg
acaccactgg cccggggcac actgcttgtg 4020gcgtggagag caggccttgc
tacttgcggg gggtttatgc tcctctctct gaagggaaaa 4080ggcagtgtga
agaagaactt accatttgtc atggccctgg gactaaccgc tgtgaggctg
4140gtcgacccca tcaacgtggt gggactgctg ttgctcacaa ggagtgggaa
gcggagctgg 4200ccccctagcg aagtactcac agctgttggc ctgatatgcg
cattggctgg agggttcgcc 4260aaggcagata tagagatggc tgggcccatg
gccgcggtcg gtctgctaat tgtcagttac 4320gtggtctcag gaaagagtgt
ggacatgtac attgaaagag caggtgacat cacatgggaa 4380aaagatgcgg
aagtcactgg aaacagtccc cggctcgatg tggcgctaga tgagagtggt
4440gatttctccc tggtggagga tgacggtccc cccatgagag agatcatact
caaggtggtc 4500ctgatgacca tctgtggcat gaacccaata gccataccct
ttgcagctgg agcgtggtac 4560gtatacgtga agactggaaa aaggagtggt
gctctatggg atgtgcctgc tcccaaggaa 4620gtaaaaaagg gggagaccac
agatggagtg tacagagtaa tgactcgtag actgctaggt 4680tcaacacaag
ttggagtggg agttatgcaa gagggggtct ttcacactat gtggcacgtc
4740acaaaaggat ccgcgctgag aagcggtgaa gggagacttg atccatactg
gggagatgtc 4800aagcaggatc tggtgtcata ctgtggtcca tggaagctag
atgccgcctg ggacgggcac 4860agcgaggtgc agctcttggc cgtgcccccc
ggagagagag cgaggaacat ccagactctg 4920cccggaatat ttaagacaaa
ggatggggac attggagcgg ttgcgctgga ttacccagca 4980ggaacttcag
gatctccaat cctagacaag tgtgggagag tgataggact ttatggcaat
5040ggggtcgtga tcaaaaatgg gagttatgtt agtgccatca cccaagggag
gagggaggaa 5100gagactcctg ttgagtgctt cgagccttcg atgctgaaga
agaagcagct aactgtctta 5160gacttgcatc ctggagctgg gaaaaccagg
agagttcttc ctgaaatagt ccgtgaagcc 5220ataaaaacaa gactccgtac
tgtgatctta gctccaacca gggttgtcgc tgctgaaatg 5280gaggaggccc
ttagagggct tccagtgcgt tatatgacaa cagcagtcaa tgtcacccac
5340tctggaacag aaatcgtcga cttaatgtgc catgccacct tcacttcacg
tctactacag 5400ccaatcagag tccccaacta taatctgtat attatggatg
aggcccactt cacagatccc 5460tcaagtatag cagcaagagg atacatttca
acaagggttg agatgggcga ggcggctgcc 5520atcttcatga ccgccacgcc
accaggaacc cgtgacgcat ttccggactc caactcacca 5580attatggaca
ccgaagtgga agtcccagag agagcctgga gctcaggctt tgattgggtg
5640acggatcatt ctggaaaaac agtttggttt gttccaagcg tgaggaacgg
caatgagatc 5700gcagcttgtc tgacaaaggc tggaaaacgg gtcatacagc
tcagcagaaa gacttttgag 5760acagagttcc agaaaacaaa acatcaagag
tgggactttg tcgtgacaac tgacatttca 5820gagatgggcg ccaactttaa
agctgaccgt gtcatagatt ccaggagatg cctaaagccg 5880gtcatacttg
atggcgagag agtcattctg gctggaccca tgcctgtcac acatgccagc
5940gctgcccaga ggagggggcg cataggcagg aatcccaaca aacctggaga
tgagtatctg 6000tatggaggtg ggtgcgcaga gactgacgaa gaccatgcac
actggcttga agcaagaatg 6060ctccttgaca atatttacct ccaagatggc
ctcatagcct cgctctatcg acctgaggcc 6120gacaaagtag cagccattga
gggagagttc aagcttagga cggagcaaag gaagaccttt 6180gtggaactca
tgaaaagagg agatcttcct gtttggctgg cctatcaggt tgcatctgcc
6240ggaataacct acacagatag aagatggtgc tttgatggca cgaccaacaa
caccataatg 6300gaagacagtg tgccggcaga ggtgtggacc agacacggag
agaaaagagt gctcaaaccg 6360aggtggatgg acgccagagt ttgttcagat
catgcggccc tgaagtcatt caaggagttt 6420gccgctggga aaagaggagc
ggcttttgga gtgatggaag ccctgggaac actgccagga 6480cacatgacag
agagattcca ggaagccatt gacaacctcg ctgtgctcat gcgggcagag
6540actggaagca ggccttacaa agccgcggcg gcccaattgc cggagaccct
agagaccatt 6600atgcttttgg ggttgctggg aacagtctcg ctgggaatct
tcttcgtctt gatgaggaac 6660aagggcatag ggaagatggg ctttggaatg
gtgactcttg gggccagcgc atggctcatg 6720tggctctcgg aaattgagcc
agccagaatt gcatgtgtcc tcattgttgt gtttctattg 6780ctggtggtgc
tcatacctga gccagaaaag caaagatctc cccaggacaa ccaaatggca
6840atcatcatca tggtagcagt aggtcttctg ggcttgatta ccgccaatga
actcggatgg 6900ttggagagaa caaagagtga cctaagccat ctaatgggaa
ggagagagga gggggcaacc 6960ataggattct caatggacat tgacctgcgg
ccagcctcag cttgggccat ctatgctgcc 7020ttgacaactt tcattacccc
agccgtccaa catgcagtga ccacttcata caacaactac 7080tccttaatgg
cgatggccac gcaagctgga gtgttgtttg gtatgggcaa agggatgcca
7140ttctacgcat gggactttgg agtcccgctg ctaatgatag gttgctactc
acaattaaca 7200cccctgaccc taatagtggc catcattttg ctcgtggcgc
actacatgta cttgatccca 7260gggctgcagg cagcagctgc gcgtgctgcc
cagaagagaa cggcagctgg catcatgaag 7320aaccctgttg tggatggaat
agtggtgact gacattgaca caatgacaat tgacccccaa 7380gtggagaaaa
agatgggaca ggtgctactc atagcagtag ccgtctccag cgccatactg
7440tcgcggaccg cctgggggtg gggggaggct ggggccctga tcacagccgc
aacttccact 7500ttgtgggaag gctctccgaa caagtactgg aactcctcta
cagccacttc actgtgtaac 7560atttttaggg gaagttactt ggctggagct
tctctaatct acacagtaac aagaaacgct 7620ggcttggtca agagacgtgg
gggtggaaca ggagagaccc tgggagagaa atggaaggcc 7680cgcttgaacc
agatgtcggc cctggagttc tactcctaca aaaagtcagg catcaccgag
7740gtgtgcagag aagaggcccg ccgcgccctc aaggacggtg tggcaacggg
aggccatgct 7800gtgtcccgag gaagtgcaaa gctgagatgg ttggtggagc
ggggatacct gcagccctat 7860ggaaaggtca ttgatcttgg atgtggcaga
gggggctgga gttactacgc cgccaccatc 7920cgcaaagttc aagaagtgaa
aggatacaca aaaggaggcc ctggtcatga agaacccgtg 7980ttggtgcaaa
gctatgggtg gaacatagtc cgtcttaaga gtggggtgga cgtctttcat
8040atggcggctg agccgtgtga cacgttgctg tgtgacatag gtgagtcatc
atctagtcct 8100gaagtggaag aagcacggac gctcagagtc ctctccatgg
tgggggattg gcttgaaaaa 8160agaccaggag ccttttgtat aaaggtgttg
tgcccataca ccagcactat gatggaaacc 8220ctggagcgac tgcagcgtag
gtatggggga ggactggtca gagtgccact ctcccgcaac 8280tctacacatg
agatgtattg ggtctctgga gcgaaaagca acaccataaa aagtgtgtcc
8340accacgagcc agctcctctt ggggcgcatg gacgggccta ggaggccagt
gaaatatgag 8400gaggatgtga atctcggctc tggcacgcgg gctgtggtaa
gctgcgctga agctcccaac 8460atgaagatca ttggtaaccg cattgaaagg
atccgcagtg agcacgcgga aacgtggttc 8520tttgacgaga accacccata
taggacatgg gcttaccatg gaagctatga ggcccccaca 8580caagggtcag
cgtcctctct aataaacggg gttgtcaggc tcctgtcaaa accctgggat
8640gtggtgactg gagtcacagg aatagccatg accgacacca caccgtatgg
tcagcaaaga 8700gttttcaagg aaaaagtgga cactagggtg ccagaccccc
aagaaggcac tcgtcaggtt 8760atgagcatgg tctcttcctg gttgtggaaa
gagctaggca aacacaaacg gccacgagtc 8820tgtaccaaag aagagttcat
caacaaggtt cgtagcaatg cagcattagg ggcaatattt 8880gaagaggaaa
aagagtggaa gactgcagtg gaagctgtga acgatccaag gttctgggct
8940ctagtggata aggaaagaga gcaccacctg agaggagagt gccagagttg
tgtgtacaac 9000atgatgggaa aaagagaaaa gaaacaaggg gaatttggaa
aggccaaggg cagccgcgcc 9060atctggtata tgtggctagg ggctagattt
ctagagttcg aagcccttgg attcttgaac 9120gaggatcact ggatggggag
agagaactca ggaggtggtg ttgaagggct gggattacaa 9180agactcggat
atgtcctaga agagatgagt cgtataccag gaggaaggat gtatgcagat
9240gacactgctg gctgggacac ccgcatcagc aggtttgatc tggagaatga
agctctaatc 9300accaaccaaa tggaaaaagg gcacagggcc ttggcattgg
ccataatcaa gtacacatac 9360caaaacaaag tggtaaaggt ccttagacca
gctgaaaaag ggaaaacagt tatggacatt 9420atttcgagac aagaccaaag
ggggagcgga caagttgtca cttacgctct taacacattt 9480accaacctag
tggtgcaact cattcggaat atggaggctg aggaagttct agagatgcaa
9540gacttgtggc tgctgcggag gtcagagaaa gtgaccaact ggttgcagag
caacggatgg 9600gataggctca aacgaatggc agtcagtgga gatgattgcg
ttgtgaagcc aattgatgat 9660aggtttgcac atgccctcag gttcttgaat
gatatgggaa aagttaggaa ggacacacaa 9720gagtggaaac cctcaactgg
atgggacaac tgggaagaag ttccgttttg ctcccaccac 9780ttcaacaagc
tccatctcaa ggacgggagg tccattgtgg ttccctgccg ccaccaagat
9840gaactgattg gccgggcccg cgtctctcca ggggcgggat ggagcatccg
ggagactgct 9900tgcctagcaa aatcatatgc gcaaatgtgg cagctccttt
atttccacag aagggacctc 9960cgactgatgg ccaatgccat ttgttcatct
gtgccagttg actgggttcc aactgggaga 10020actacctggt caatccatgg
aaagggagaa tggatgacca ctgaagacat gcttgtggtg 10080tggaacagag
tgtggattga ggagaacgac cacatggaag acaagacccc agttacgaaa
10140tggacagaca tcccctattt gggaaaaagg gaagacttgt ggtgtggatc
tctcataggg 10200cacagaccgc gcaccacctg ggctgagaac attaaaaaca
cagtcaacat ggtgcgcagg 10260atcataggtg atgaagaaaa gtacatggac
tacctatcca cccaagttcg ctacttgggt 10320gaagaagggt ctacacctgg
agtgctgtaa gcaccagtct taatgttgtc aggcctgcta 10380gtcagccaca
gcttggggaa agctgtgcag cctgtgaccc ccccaggaga agctgggaaa
10440ccaagcctat agtcaggccg agaacgccat ggcacggaag aagccatgct
gcctgtgagc 10500ccctcagagg acactgagtc aaaaaacccc acgcgcttgg
aggcgcagga tgggaaaaga 10560aggtggcgac cttccccacc cttcaatctg
gggcctgaac tggagatcag ctgtggatct 10620ccagaagagg gactagtggt
tagaggag 106481010676DNAZika virus 10gttgttactg ttgctgactc
agactgcgac agttcgagtt tgaagcgaaa gctagcaaca 60gtatcaacag gttttatttg
gatttggaaa cgagagtttc tggtcatgaa aaacccaaaa 120aagaaatccg
gaggattccg gattgtcaat atgctaaaac gcggagtagc ccgtgtgagc
180ccctttgggg gcttgaagag gctgccagcc ggacttctgc tgggtcatgg
gcccatcagg 240atggtcttgg caattctagc ctttttgaga ttcacggcaa
tcaagccatc actgggtctc 300atcaatagat ggggttcagt ggggaaaaaa
gatgctatgg aaataataaa gaagttcaag 360aaagatctgg ctgccatgct
gagaataatc aatgctagga aggagaagaa gagacgaggc 420gcagatacta
gtgtcggaat tgttggcctc ctgctgacca cagctatggc agcggaggtc
480actagacgtg ggagtgcata ctatatgtac ttggacagaa acgatgctgg
ggaggccata 540tcttttccaa ccacattggg gatgaataag tgttatatac
agatcatgga tcttggacac 600atgtgtgatg ccaccatgag ctatgaatgc
cctatgctgg atgagggggt ggaaccagat 660gacgtcgatt gttggtgcaa
cacgacgtca acttgggttg tgtacggaac ctgccatcac 720aaaaaaggtg
aagcacggag atctagaaga gctgtgacgc tcccttccca ttccactagg
780aagctgcaaa cgcggtcgca aacctggttg gaatcaagag aatacacaaa
gcacttgatt 840agagtcgaaa attggatatt caggaaccct ggcttcgcgt
tagcagcagc tgccatcgct 900tggcttttgg gaagctcaac gagccaaaaa
gtcatatact tggtcatgat actgctgatt 960gccccggcat acagcatcag
gtgcatagga gtcagcaata gggactttgt ggaaggtatg 1020tcaggtggga
cttgggttga tgttgtcttg gaacatggag gttgtgtcac cgcaatggca
1080caggacaaac cgactgtcga catagagctg gttacaacaa cagtcagcaa
catggcggag 1140gtaagatcct actgctatga ggcatcaata tcagacatgg
cttcggacag ccgctgccca 1200acacaaggtg aagcctacct tgacaagcaa
tcagacactc aatatgtttg caaaagaacg 1260ttagtggaca gaggctgggg
aaatggatgt ggactttttg gcaaagggag tctggtgaca 1320tgcgctaagt
ttgcatgctc caagaaaatg accgggaaga gcatccagcc agagaatctg
1380gagtaccgga taatgctgtc agttcatggc tcccagcaca gtgggatgct
cgttaatgac 1440acaggacatg aaactgatga gaatagagcg aaggttgaga
taacgcccaa ttcaccaaga 1500gccgaagcca ccctgggggg ttttggaagc
ctaggacttg attgtgaacc gaggacaggc 1560cttgactttt cagatttgta
ttacttgact atgaataaca agcactggtt ggctcacaag 1620gagtggttcc
acgacattcc attaccttgg cacgctgggg cagccaccgg aactccacac
1680tggaacaaca aagaagcact ggtagagttc aaggacgcac atgccaaaag
gcaaactgtc 1740gtggttctag ggagtcaaga aggagcagtt cacacggccc
ttgctggagc tctggaggct 1800gagatggatg gtgcaaaggg aaggctgtcc
tctggccact tgaaatgtcg cctgaaaatg 1860gataaactta gattgaaggg
cgtgtcatac tccttgtgta ccgcagcgtt cacattcacc 1920aagatcccgg
ctgaaacagt ggacgggaca gtcacagtgg agggacagta cggagggaca
1980gatggacctt gcaaggttcc agctcagatg gcggtggaca tgcagactct
gaccccagtt 2040gggaggttga taaccgctaa ccccgtaatc actgaaagca
ctgagaactc taagatgatg 2100ctggaacttg atccaccatt tggggactct
tacattgtca taggagtcgg ggagaagaag 2160atcacccacc actggcacag
gagtggcagc accattggaa aagcatttga agccactgtg 2220agaggtgcca
agagaatggc agtcttggga gacacagcct gggactttgg atcagttgga
2280ggcgctctca actcattggg caagggcatc catcaaatta ttggagcagc
tttcaaatca 2340ttgtttggag gaatgtcctg gttctcacaa attctcattg
ggacgttgct gatgtggttg 2400ggtctgaaca caaagaatgg atctatttcc
cttatgtgct tggccttagg gggagtgttg 2460atcttcttat ccacagccgt
ctcaggtggt gtggggtgct cggtggactt ctcaaagaag 2520gagacgagat
gcggtacagg ggtgttcgtc tataacgatg ttgaagcctg gagggacagg
2580tacaagtacc atcctgactc cccccgtaga ttggcagcag cagtcaagca
agcctgggaa 2640gatggtatct gcgggatctc ctctgtttca agaatggaaa
acatcatgtg gagatcagta 2700gaaggggagc tcaacgcaat cctggaagag
aatggagttc aactgacggt cgttgtggga 2760tctgtaaaaa accccatgtg
gagaggtcca cagagattgc ccgtgcctgt gaacgagctg 2820ccccacggct
ggaaggcttg ggggaaatcg tacttcgtca gagcagcaaa gacaaataac
2880agctttgtcg tggatggtga cacactgaag gaatgcccac tcaaacatag
agcatggaac 2940agctttcttg tggaggatca tgggttcggg gtatttcaca
ctagtgtctg gctcaaggtt 3000agagaagact attggttaga gtgtgatcca
gccgttattg gaacagctgt taagggaaag 3060gaggctgtac acagtgatct
aggctactgg attgagagtg agaagaatga cacatggtgg 3120ctgaagaggg
cccatctgat cgagatgaaa acatgtgaat ggccaaagtc ccacacattg
3180tggacagatg gaatagaaga gagtgatctg atcataccca agtctttagc
tgggccactc 3240agccatcaca atgccagaga gggctacagg acccaaatga
aagggccatg gcacagtgaa 3300gagcttgaaa ttcggtttga ggaatgccca
ggcactaagg tccacgtgga ggaaacatgt 3360ggaacaagag gaccatctct
gagatcaacc actgcaagcg gaagggtgat cgaggaatgg 3420tgctccaggg
agtgcacaat gcccccactg tccttccagg ctaaagatgg ctgttggtat
3480ggaatggaga taaggcccag gaaagaacca gaaagcaact tagtaaggtc
aatggtgact 3540gcaggatcaa ctgatcacat ggatcacttc tcccttggag
tgcttgtgat tctgctcatg 3600gtgcaggaag ggctgaagaa gagaatgacc
acaaagatca tcataagcac atcaatggca 3660gtgctggtag ctatgatcct
gggaggattt tcaatgagtg acctggctaa gcttgcaatt 3720ttgatgggtg
ccaccttcgc ggaaatgaac actggaggag atgtagctca tctggcgctg
3780atagcggcat tcaaagtcag accagcgttg ctggtatctt tcatcttcag
agctaattgg
3840acaccccgtg aaagcatgct gctggccttg gcctcgtgtc ttttgcaaac
tgcgatctcc 3900gccttggaag gcgacctgat ggttctcatc aatggttttg
ctttggcctg gttggcaata 3960cgagcgatgg ttgttccacg cactgataac
atcaccttag caatcctggc tgctctgaca 4020ccactggccc ggggcacact
gcttgtggcg tggagagcag gccttgctac ttgcgggggg 4080tttatgctcc
tctctctgaa gggaaaaggc agtgtgaaga agaacttacc atttgtcatg
4140gccctgggac taaccgctgt gaggctggtc gaccccatca acgtggtggg
actgctgttg 4200ctcacaagga gtgggaagcg gagctggccc cctagcgaag
tactcacagc tgttggcctg 4260atatgcgcat tggctggagg gttcgccaag
gcagatatag agatggctgg gcccatggcc 4320gcggtcggtc tgctaattgt
cagttacgtg gtctcaggaa agagtgtgga catgtacatt 4380gaaagagcag
gtgacatcac atgggaaaaa gatgcggaag tcactggaaa cagtccccgg
4440ctcgatgtgg cgctagatga gagtggtgat ttctccctgg tggaggatga
cggtcccccc 4500atgagagaga tcatactcaa ggtggtcctg atgaccatct
gtggcatgaa cccaatagcc 4560ataccctttg cagctggagc gtggtacgta
tacgtgaaga ctggaaaaag gagtggtgct 4620ctatgggatg tgcctgctcc
caaggaagta aaaaaggggg agaccacaga tggagtgtac 4680agagtaatga
ctcgcagact gctaggttca acacaagttg gagtgggagt tatgcaagag
4740ggggtctttc acactatgtg gcacgtcaca aaaggatccg cgctgagaag
cggtgaaggg 4800agacttgatc catactgggg agatgtcaag caggatctgg
tgtcatactg tggtccatgg 4860aagctagatg ccgcctggga cgggcacagc
gaggtgcagc tcttggccgt gccccccgga 4920gagagagcga ggaacatcca
gactctgccc ggaatattta agacaaagga tggggacatt 4980ggagcggttg
cactggatta cccagcagga acttcaggat ctccaatcct agacaagtgt
5040gggagagtga taggacttta tggcaatggg gtcgtgatca aaaatgggag
ttatgttagt 5100gccatcaccc aagggaggag ggaggaagag actcctgttg
agtgcttcga gccttcgatg 5160ctgaagaaga agcagctaac tgtcttagac
ttgcatcctg gagctgggaa aaccaggaga 5220gttcttcctg aaatagtccg
tgaagccata aaaacaagac tccgtactgt gatcttggct 5280ccaaccaggg
ttgtcgctgc tgaaatggag gaggccctta gagggcttcc agtgcgttat
5340atgacaacag cagtcaatgt cacccactct ggaacagaaa tcgtcgactt
aatgtgccat 5400gccaccttca cttcacgtct actacagcca attagagtcc
ccaactataa tctgtatatt 5460atggatgagg cccacttcac agatccctca
agtatagcag caagaggata catttcaaca 5520agggttgaga tgggcgaggc
ggctgccatc ttcatgaccg ccacgccacc aggaacccgt 5580gacgcatttc
cggactccaa ctcaccaatt atggacaccg aagtggaagt cccagagaga
5640gcctggagct caggctttga ttgggtgacg gagtattctg gaaaaacagt
ttggtttgtt 5700ccacgcgtga ggaacggcaa tgagatcgca gcttgtctga
caaaggctgg aaaacgggtc 5760atacagctca gcagaaagac ttttgagaca
gagttccaga aaacaaaaca tcaagagtgg 5820gactttgtcg tgacaactga
catttcagag atgggcgcca actttaaagc tgaccgtgtc 5880atagattcca
ggagatgcct aaagccggtc atacttggtg gcgagagagt cattctggct
5940ggacccatgc ctgtcacaca tgccagcgct gcccagagga gggggcgcat
aggcaggaat 6000cccaacaaac ctggagatga gtatctgtat ggaggtgggt
gcgcagagac tgacgaagac 6060catgcacact ggcttgaagc aagaatgctc
cttgacaata tttacctcca agatggcctc 6120atagcctcgc tctatcgacc
tgaggccgac aaagtagcag ccattgaggg agagttcaag 6180cttaggacgg
agcaaaggaa gacctttgtg gaactcatga aaagaggaga tcttcctgtt
6240tggctggcct atcaggttgc atctgccgga ataacctaca cagatagaag
atggtgcttt 6300gatggcacga ccaacaacac cataatggaa gacagtgtgc
cggcagaggt gtggaccaga 6360cacggagaga aaagagtgct caaaccgagg
tggatggacg ccagagtttg ttcagatcat 6420gcggccctga agtcattcaa
ggagtttgcc gctgggaaaa gaggagcggc ttttggagtg 6480atggaagccc
tgggaacact gccaggacac atgacagaga gattccagga agccattgac
6540aacctcgctg tgctcatgcg ggcagagact ggaagcaggc cttacaaagc
cgcggcggcc 6600caattgccgg agaccctaga gaccattatg cttttggggt
tgctgggaac agtctcgctg 6660ggaatctttt tcgtcttgat gaggaacaag
ggcataggga agatgggctt tggaatggtg 6720actcttgggg ccagcgcatg
gctcatgtgg ctctcggaaa ttgagccagc cagaattgca 6780tgtgtcctca
ttgttgtgtt cctattgctg gtggtgctca tacctgagcc agaaaagcaa
6840agatctcccc aggacaacca aatggccatc atcatcatgg tagcagtagg
tcttctgggc 6900ttgattaccg ccaatgaact cggatggttg gagagaacaa
agagtgacct aagccatcta 6960atgggaagga gagaggaggg ggcaaccatg
ggattctcaa tggacattga cctgcggcca 7020gcctcagctt gggccatcta
tcctgccttg acatctttca ttaccccagc cgtccaacat 7080gcagtgacca
cttcatacaa caactactcc ttaatggcga tggccacgca agctggagtg
7140ttgtttggta tgggcaaagg gatgccattc tacgcatggg actttggagt
cccgctgcta 7200atgataggtt gctactcaca attaacgccc ctgaccctaa
tagtggccat cattttgctc 7260gtggcgcact acatgtactt gatcccaggg
ctgcaggcag cagctgcgcg tgctgcccag 7320aagagaacgg cagctggcat
catgaagaac cctgttgtgg agggaatagt ggtgactgac 7380attgacacaa
tgacaattga cccccaagtg gagaaaaaga tgggacaggt gctactcatg
7440gcagtagccg tctccagcgc catactgtcg aggaccgcct gggggtgggg
ggaggctggg 7500gccctgatca cagccgcaac ttccactttg tgggaaggct
ctccgaacaa gtactggaac 7560tcctctacag ccacctcact gtgtaacatt
tttaggggaa gttacttggc tggagcttct 7620ctaatctaca cagtaacaag
aaacgctggc ttggtcaaga gacgtggggg tggaacagga 7680gagaccctgg
gagagaaatg gaaggcccgc ttgaaccaga tgtcggccct ggagttctac
7740tcctacaaaa agtcaggcat caccgaggtg tgcagagaag aggcccgccg
cgccctcaag 7800gacggtgtgg caacgggagg ccatgctgtg tcccgaggaa
gtgcaaagct gagatggttg 7860gtggagcggg gatacctgca gccctatgga
aaggtcattg atcttggatg tggcagaggg 7920ggctggagtt actacgccgc
caccatccgc aaagttcaag aagtgaaagg atacacaaaa 7980ggaggccctg
gtcatgaaga acccgtgttg gtgcaaagct atgggtggaa catagtccgt
8040cttaagagtg gggtggacgt ctttcatatg gcggctgagc cgtgtgacac
gttgctgtgt 8100gacataggtg agtcatcatc tagtcctgaa gtggaagaag
cacggacgct cagagtcctc 8160tccatggtgg gggattggct tgaaaaaaga
ccaggagcct tttgtataaa agtgttgtgc 8220ccatacacca gcactatgat
ggaaaccctg gagcgactgc agcgtaggta tgggggagga 8280ctggtcagag
tgccactctc ccgcaactct acacatgaga tgtactgggt ctctggagcg
8340aaaagcaaca ccataaaaag tgtgtccacc acgagccagc tcctcttggg
gcgcatggac 8400gggcctagga ggccagtgaa atatgaggag gatgtgaatc
tcggctctgg cacgcgggct 8460gtggtaagct gcgctgaagc tcccaacatg
aagatcattg gtaaccgcat tgaaaggatc 8520cgcgctgaga aagcggaaac
gtggttcttt gacgagaacc acccatatag gacatgggct 8580taccatggaa
gctatgatgc cgccacacaa gggtcagcgt cctctctaat aaacggggtt
8640gtcaggctcc tgtcaaaacc ctgggatgtg gtgactggag tcacaggaat
agccatgacc 8700gacaccacac cgtatggtca gcaaagagtt ttcaaggaaa
aagtggacac tagggtgcca 8760gacccccaag aaggcactcg tcaggttatg
agcatggtct cttcctggtt gtggaaagag 8820ctaggcaaac acaaacggcc
acgagtctgt accaaagaag agttcatcaa caaggttcgt 8880agcaatgcag
cattaggggc aatatttgaa gaggaaaaag agtggaagac tgcagtggaa
8940gctgtgaacg atccaaggtt ctgggctcta gtggacaagg aaagagagca
ccacctgaga 9000ggagagtgcc agagttgtgt gtacatcaca atgggaaaaa
gagaaaagaa acaaggggaa 9060tttggaaagg ccaagggcag ccgcgccatc
tggtatatgt ggctaggggc tagatttcta 9120gagttcgaag cccttggatt
cttgaacgag gatcactgga tggggagaga gaactcagga 9180ggtggtgttg
aagggctggg attacaaaga ctcggatatg tcctagaaga gatgagtcgc
9240ataccaggag gaaggatgta tgcagatgac actgctggct gggacacccg
catcagcagg 9300tttgatctgg agaatgaagc tctaatcacc aaccaaatgg
agaaagggca cagggccttg 9360gcattggcca taatcaagta cacataccaa
aacaaagtgg taaaggtcct tagaccagct 9420gaaaaaggga agacagttat
ggacattatt tcgagacaag accaaagggg gagcggacaa 9480gttgtcactt
acgctctcaa cacatttacc aacctagtgg tgcaactcat tcggaatatg
9540gaggctgagg aagttctaga gatgcaagac ttgtggctgc tgcggaggtc
agagaaagtg 9600accaactggt tgcagagcaa cggatgggat aggctcaaac
gaatggcggt cagtggagat 9660gattgcgttg tgaaaccaat tgatgatagg
tttgcacatg ccctcaggtt cttgaatgat 9720atgggaaaag ttaggaagga
cacacaagag tggaaaccct caactggatg ggacaactgg 9780gaagaagttc
ccttctgctc ccaccacttc aacaagctcc atctcaagga cgggaggtcc
9840attgtggttc cctgccgcca ccaagatgaa ctgattggcc gggcccgcgt
ctctccaggg 9900gcgggatgga gcatccggga gactgcttgc ctagcaaaat
catatgcgca aatgtggcag 9960ctcctttatt tccacagaag ggacctccga
ctgatggcca atgccatttg ttcatctgtg 10020ccagttgact gggttccaac
tgggagaact acctggtcaa tccatggaaa gggagaatgg 10080atgaccactg
aagacatgct tgtggcgtgg aacagagtgt ggattgagga gaacgaccac
10140atggaagaca agaccccagt cacgaaatgg acagacattc cctatttggg
aaaaagggaa 10200gacttgtggt gtggatctct catagggcac agaccgcgca
ccacctgggc tgagaacatt 10260aaaaacacag tcaacatggt gcgcaggatc
ataggtgatg aagaaaagta catggactac 10320ctatccaccc aagttcgcta
cttgggtgaa gaagggtcta cacctggagt gctgtaagca 10380ccaatcttaa
tgttgtcagg cctgctagtc agccacagct tggggaaagc tgtgcagcct
10440gtgacccccc caggagaagc tgggaaacca agcctatagt caggccgaga
acgccatggc 10500acggaagaag ccatgctgcc tgtgagcccc tcagaggaca
ctgagtcaaa aaaccccacg 10560cgcttggagg cgcaggatgg gaaaagaagg
tggcgacctt ccccaccctt caatctgggg 10620cctgaactgg agatcagctg
tggatctcca gaagagggac tagtggttag aggaga 106761110807DNAZika virus
11agttgttgat ctgtgtgagt cagactgcga cagttcgagt ctgaagcgag agctaacaac
60agtatcaaca ggtttaattt ggatttggaa acgagagttt ctggtcatga aaaacccaaa
120gaagaaatcc ggaggattcc ggattgtcaa tatgctaaaa cgcggagtag
cccgtgtaaa 180ccccttggga ggtttgaaga ggttgccagc cggacttctg
ctgggtcatg gacccatcag 240aatggttttg gcgatactag cctttttgag
atttacagca atcaagccat cactgggcct 300tatcaacaga tggggttccg
tggggaaaaa agaggctatg gaaataataa agaagttcaa 360gaaagatctt
gctgccatgt tgagaataat caatgctagg aaagagagga agagacgtgg
420cgcagacacc agcatcggaa tcattggcct cctgctgact acagccatgg
cagcagagat 480cactagacgc gggagtgcat actacatgta cttggatagg
agcgatgccg ggaaggccat 540ttcgtttgct accacattgg gagtgaacaa
gtgccacgta cagatcatgg acctcgggca 600catgtgtgac gccaccatga
gttatgagtg ccctatgctg gatgagggag tggaaccaga 660tgatgtcgat
tgctggtgca acacgacatc aacttgggtt gtgtacggaa cctgtcatca
720caaaaaaggt gaggcacggc gatctagaag agccgtgacg ctcccttctc
actctacaag 780gaagttgcaa acgcggtcgc agacctggtt agaatcaaga
gaatacacga agcacttgat 840caaggttgaa aactggatat tcaggaaccc
cgggtttgcg ctagtggccg ttgccattgc 900ctggcttttg ggaagctcga
cgagccaaaa agtcatatac ttggtcatga tactgctgat 960tgccccggca
tacagtatca ggtgcattgg agtcagcaat agagacttcg tggagggcat
1020gtcaggtggg acctgggttg atgttgtctt ggaacatgga ggctgcgtta
ccgtgatggc 1080acaggacaag ccaacagttg acatagagtt ggtcacgacg
acggttagta acatggccga 1140ggtaagatcc tattgctacg aggcatcgat
atcggacatg gcttcggaca gtcgttgccc 1200aacacaaggt gaagcctacc
ttgacaagca atcagacact caatatgtct gcaaaagaac 1260attagtggac
agaggttggg gaaacggttg tggacttttt ggcaaaggga gcttggtgac
1320atgtgccaag tttacgtgtt ctaagaagat gaccgggaag agcattcaac
cggaaaatct 1380ggagtatcgg ataatgctat cagtgcatgg ctcccagcat
agcgggatga ctgtcaatga 1440tataggatat gaaactgacg aaaatagagc
gaaagtcgag gttacgccta attcaccaag 1500agcggaagca accttgggag
gctttggaag cttaggactt gactgtgaac caaggacagg 1560ccttgacttt
tcagatctgt attacctgac catgaacaat aagcattggt tggtgcacaa
1620agagtggttt catgacatcc cattgccttg gcatgctggg gcagacactg
gaactccaca 1680ctggaacaac aaagaggcat tggtagaatt caaggatgcc
cacgccaaga ggcaaaccgt 1740cgtcgttctg gggagccagg aaggagccgt
tcacacggct ctcgctggag ctctagaggc 1800tgagatggat ggtgcaaagg
gaaagctgtt ctctggccat ttgaaatgcc gcctaaaaat 1860ggacaagctt
agattgaagg gcgtgtcata ttccttgtgc actgcggcat tcacattcac
1920caaggtccca gctgaaacac tgcatggaac agtcacagtg gaggtgcagt
atgcagggac 1980agatggaccc tgcaagatcc cagtccagat ggcggtggac
atgcagaccc tgaccccagt 2040tggaaggctg ataaccgcca accccgtgat
tactgaaagc actgagaact caaagatgat 2100gttggagctt gacccaccat
ttggggattc ttacattgtc ataggagttg gggacaagaa 2160aatcacccac
cactggcata ggagtggtag caccatcgga aaggcatttg aggccactgt
2220gagaggcgcc aagagaatgg cagtcctggg ggatacagcc tgggacttcg
gatcagtcgg 2280gggtgtgttc aactcactgg gtaagggcat tcaccagatt
tttggagcag ccttcaaatc 2340actgtttgga ggaatgtcct ggttctcaca
gatcctcata ggcacgctgc tagtgtggtt 2400aggtttgaac acaaagaatg
gatctatctc cctcacatgc ttggccctgg ggggagtgat 2460gatcttcctc
tccacggctg tttctgctga cgtggggtgc tcagtggact tctcaaaaaa
2520ggaaacgaga tgtggcacgg gggtattcat ctataatgat gttgaagcct
ggagggaccg 2580gtacaagtac catcctgact ccccccgcag attggcagca
gcagtcaagc aggcctggga 2640agaggggatc tgtgggatct catccgtttc
aagaatggaa aacatcatgt ggaaatcagt 2700agaaggggag ctcaatgcta
tcctagagga gaatggagtt caactgacag ttgttgtggg 2760atctgtaaaa
aaccccatgt ggagaggtcc acaaagattg ccagtgcctg tgaatgagct
2820gccccatggc tggaaagcct gggggaaatc gtattttgtt agggcggcaa
agaccaacaa 2880cagttttgtt gtcgacggtg acacactgaa ggaatgtccg
cttgagcaca gagcatggaa 2940tagttttctt gtggaggatc acgggtttgg
agtcttccac accagtgtct ggcttaaggt 3000cagagaagat tactcattag
aatgtgaccc agccgtcata ggaacagctg ttaagggaag 3060ggaggccgcg
cacagtgatc tgggctattg gattgaaagt gaaaagaatg acacatggag
3120gctgaagagg gcccacctga ttgagatgaa aacatgtgaa tggccaaagt
ctcacacatt 3180gtggacagat ggagtagaag aaagtgatct tatcataccc
aagtctttag ctggtccact 3240cagccaccac aacaccagag agggttacag
aacccaagtg aaagggccat ggcacagtga 3300agagcttgaa atccggtttg
aggaatgtcc aggcaccaag gtttacgtgg aggagacatg 3360cggaactaga
ggaccatctc tgagatcaac tactgcaagt ggaagggtca ttgaggaatg
3420gtgctgtagg gaatgcacaa tgcccccact atcgtttcga gcaaaagacg
gctgctggta 3480tggaatggag ataaggccca ggaaagaacc agagagcaac
ttagtgaggt caatggtgac 3540agcggggtca accgatcata tggaccactt
ctctcttgga gtgcttgtga ttctactcat 3600ggtgcaggag gggttgaaga
agagaatgac cacaaagatc atcatgagca catcaatggc 3660agtgctggta
gtcatgatct tgggaggatt ttcaatgagt gacctggcca agcttgtgat
3720cctgatgggt gctactttcg cagaaatgaa cactggagga gatgtagctc
acttggcatt 3780ggtagcggca tttaaagtca gaccagcctt gctggtctcc
ttcattttca gagccaattg 3840gacaccccgt gagagcatgc tgctagccct
ggcttcgtgt cttctgcaaa ctgcgatctc 3900tgctcttgaa ggtgacttga
tggtcctcat taatggattt gctttggcct ggttggcaat 3960tcgagcaatg
gccgtgccac gcactgacaa catcgctcta ccaatcttgg ctgctctaac
4020accactagct cgaggcacac tgctcgtggc atggagagcg ggcctggcta
cttgtggagg 4080gatcatgctc ctctccctga aagggaaagg tagtgtgaag
aagaacctgc catttgtcat 4140ggccctggga ttgacagctg tgagggtagt
agaccctatt aatgtggtag gactactgtt 4200actcacaagg agtgggaagc
ggagctggcc ccctagtgaa gttctcacag ccgttggcct 4260gatatgtgca
ctggccggag ggtttgccaa ggcagacatt gagatggctg gacccatggc
4320tgcagtaggc ttgctaattg tcagctatgt ggtctcggga aagagtgtgg
acatgtacat 4380tgaaagagca ggtgacatca catgggaaaa ggacgcggaa
gtcactggaa acagtcctcg 4440gcttgacgtg gcactggatg agagtggtga
tttctccttg gtagaggaag atggtccacc 4500catgagagag atcatactta
aggtggtcct gatggccatc tgtggcatga acccaatagc 4560tatacctttt
gctgcaggag cgtggtatgt gtatgtgaag actgggaaaa ggagtggcgc
4620cctctgggac gtgcctgctc ccaaagaagt gaagaaagga gagaccacag
atggagtgta 4680cagagtgatg actcgcagac tgctaggttc aacacaggtt
ggagtgggag tcatgcaaga 4740gggagtcttc cacaccatgt ggcacgttac
aaaaggagcc gcactgagga gcggtgaggg 4800aagacttgat ccatactggg
gggatgtcaa gcaggacttg gtgtcatact gtgggccttg 4860gaagttggat
gcagcttggg atggactcag cgaggtacag cttttggccg tacctcccgg
4920agagagggcc agaaacattc agaccctgcc tggaatattc aagacaaagg
acggggacat 4980cggagcagtt gctctggact accctgcagg gacctcagga
tctccgatcc tagacaaatg 5040tggaagagtg ataggactct atggcaatgg
ggttgtgatc aagaatggaa gctatgttag 5100tgctataacc cagggaaaga
gggaggagga gactccggtt gaatgtttcg aaccctcgat 5160gctgaagaag
aagcagctaa ctgtcttgga tctgcatcca ggagccggaa aaaccaggag
5220agttcttcct gaaatagtcc gtgaagccat aaaaaagaga ctccggacag
tgatcttggc 5280accaactagg gttgtcgctg ctgagatgga ggaggccttg
agaggacttc cggtgcgtta 5340catgacaaca gcagtcaacg tcacccattc
tgggacagaa atcgttgatt tgatgtgcca 5400tgccactttc acttcacgct
tactacaacc catcagagtc cctaattaca atctctacat 5460catggatgaa
gcccacttca cagacccctc aagtatagct gcaagaggat atatatcaac
5520aagggttgaa atgggcgagg cggctgccat ttttatgact gccacaccac
caggaacccg 5580tgatgcgttt cctgactcta actcaccaat catggacaca
gaagtggaag tcccagagag 5640agcctggagc tcaggctttg attgggtgac
agaccattct gggaaaacag tttggttcgt 5700tccaagcgtg agaaacggaa
atgaaatcgc agcctgtctg acaaaggctg gaaagcgggt 5760catacagctc
agcaggaaga cttttgagac agaatttcag aaaacaaaaa atcaagagtg
5820ggactttgtc ataacaactg acatctcaga gatgggcgcc aacttcaagg
ctgaccgggt 5880catagactct aggagatgcc taaaaccagt catacttgat
ggtgagagag tcatcttggc 5940tgggcccatg cctgtcacgc atgctagtgc
tgctcagagg agaggacgta taggcaggaa 6000ccctaacaaa cctggagatg
agtacatgta tggaggtggg tgtgcagaga ctgatgaagg 6060ccatgcacac
tggcttgaag caagaatgct tcttgacaac atctacctcc aggatggcct
6120catagcctcg ctctatcggc ctgaggccga taaggtagcc gccattgagg
gagagtttaa 6180gctgaggaca gagcaaagga agaccttcgt ggaactcatg
aagagaggag accttcccgt 6240ctggctagcc tatcaggttg catctgccgg
aataacttac acagacagaa gatggtgctt 6300tgatggcaca accaacaaca
ccataatgga agacagcgta ccagcagagg tgtggacaaa 6360gtatggagag
aagagagtgc tcaaaccgag atggatggat gctagggtct gttcagacca
6420tgcggccctg aagtcgttca aagaattcgc cgctggaaaa agaggagcgg
ctttgggagt 6480aatggaggcc ctgggaacac tgccaggaca catgacagag
aggtttcagg aagccattga 6540caacctcgcc gtgctcatgc gagcagagac
tggaagcagg ccttataagg cagcggcagc 6600ccaactgccg gagaccctag
agaccattat gctcttaggt ttgctgggaa cagtttcact 6660ggggatcttc
ttcgtcttga tgcggaataa gggcatcggg aagatgggct ttggaatggt
6720aacccttggg gccagtgcat ggctcatgtg gctttcggaa attgaaccag
ccagaattgc 6780atgtgtcctc attgttgtgt ttttattact ggtggtgctc
atacccgagc cagagaagca 6840aagatctccc caagataacc agatggcaat
tatcatcatg gtggcagtgg gccttctagg 6900tttgataact gcaaacgaac
ttggatggct ggaaagaaca aaaaatgaca tagctcatct 6960aatgggaagg
agagaagaag gagcaaccat gggattctca atggacattg atctgcggcc
7020agcctccgcc tgggctatct atgccgcatt gacaactctc atcaccccag
ctgtccaaca 7080tgcggtaacc acttcataca acaactactc cttaatggcg
atggccacac aagctggagt 7140gctgtttggc atgggcaaag ggatgccatt
ttatgcatgg gaccttggag tcccgctgct 7200aatgatgggt tgctattcac
aattaacacc cctgactctg atagtagcta tcattctgct 7260tgtggcgcac
tacatgtact tgatcccagg cctacaagcg gcagcagcgc gtgctgccca
7320gaaaaggaca gcagctggca tcatgaagaa tcccgttgtg gatggaatag
tggtaactga 7380cattgacaca atgacaatag acccccaggt ggagaagaag
atgggacaag tgttactcat 7440agcagtagcc atctccagtg ctgtgctgct
gcggaccgcc tggggatggg gggaggctgg 7500agctctgatc acagcagcga
cctccacctt gtgggaaggc tctccaaaca aatactggaa 7560ctcctctaca
gccacctcac tgtgcaacat cttcagagga agctatctgg caggagcttc
7620ccttatctat acagtgacga gaaacgctgg cctggttaag agacgtggag
gtgggacggg 7680agagactctg ggagagaagt ggaaagctcg tctgaatcag
atgtcggccc tggagttcta 7740ctcttataaa aagtcaggta tcactgaagt
gtgtagagag gaggctcgcc gtgccctcaa 7800ggatggagtg gccacaggag
gacatgccgt atcccgggga agtgcaaagc tcagatggtt 7860ggtggagaga
ggatatctgc agccctatgg gaaggttgtt gacctcggat gtggcagagg
7920gggctggagc tattatgccg ccaccatccg caaagtgcag gaggtgagag
gatacacaaa 7980gggaggtccc ggtcatgaag aacccatgct ggtgcaaagc
tatgggtgga acatagttcg 8040tctcaagagt ggagtggacg tcttccacat
ggcggctgag ccgtgtgaca ctctgctgtg 8100tgacataggt gagtcatcat
ctagtcctga agtggaagag acacgaacac tcagagtgct 8160ctctatggtg
ggggactggc ttgaaaaaag accaggggcc
ttctgtataa aggtgctgtg 8220cccatacacc agcactatga tggaaaccat
ggagcgactg caacgtaggc atgggggagg 8280attagtcaga gtgccattgt
ctcgcaactc cacacatgag atgtactggg tctctggggc 8340aaagagcaac
atcataaaaa gtgtgtccac cacaagtcag ctcctcctgg gacgcatgga
8400tggccccagg aggccagtga aatatgagga ggatgtgaac ctcggctcgg
gtacacgagc 8460tgtggcaagc tgtgctgagg ctcctaacat gaaaatcatc
ggcaggcgca ttgagagaat 8520ccgcaatgaa catgcagaaa catggtttct
tgatgaaaac cacccataca ggacatgggc 8580ctaccatggg agctacgaag
cccccacgca aggatcagcg tcttccctcg tgaacggggt 8640tgttagactc
ctgtcaaagc cttgggacgt ggtgactgga gttacaggaa tagccatgac
8700tgacaccaca ccatacggcc aacaaagagt cttcaaagaa aaagtggaca
ccagggtgcc 8760agatccccaa gaaggcactc gccaggtaat gaacatagtc
tcttcctggc tgtggaagga 8820gctggggaaa cgcaagcggc cacgcgtctg
caccaaagaa gagtttatca acaaggtgcg 8880cagcaatgca gcactgggag
caatatttga agaggaaaaa gaatggaaga cggctgtgga 8940agctgtgaat
gatccaaggt tttgggccct agtggatagg gagagagaac accacctgag
9000aggagagtgt cacagctgtg tgtacaacat gatgggaaaa agagaaaaga
agcaaggaga 9060gttcgggaaa gcaaaaggta gccgcgccat ctggtacatg
tggttgggag ccagattctt 9120ggagtttgaa gcccttggat tcttgaacga
ggaccattgg atgggaagag aaaactcagg 9180aggtggagtc gaagggttag
gattgcaaag acttggatac attctagaag aaatgaatcg 9240ggcaccagga
ggaaagatgt acgcagatga cactgctggc tgggacaccc gcattagtaa
9300gtttgatctg gagaatgaag ctctgattac caaccaaatg gaggaagggc
acagaactct 9360ggcgttggcc gtgattaaat acacatacca aaacaaagtg
gtgaaggttc tcagaccagc 9420tgaaggagga aaaacagtta tggacatcat
ttcaagacaa gaccagagag ggagtggaca 9480agttgtcact tatgctctca
acacattcac caacttggtg gtgcagctta tccggaacat 9540ggaagctgag
gaagtgttag agatgcaaga cttatggttg ttgaggaagc cagagaaagt
9600gaccagatgg ttgcagagca atggatggga tagactcaaa cgaatggcgg
tcagtggaga 9660tgactgcgtt gtgaagccaa tcgatgatag gtttgcacat
gccctcaggt tcttgaatga 9720catgggaaaa gttaggaaag acacacagga
gtggaaaccc tcgactggat ggagcaattg 9780ggaagaagtc ccgttctgct
cccaccactt caacaagctg tacctcaagg atgggagatc 9840cattgtggtc
ccttgccgcc accaagatga actgattggc cgagctcgcg tctcaccagg
9900ggcaggatgg agcatccggg agactgcctg tcttgcaaaa tcatatgcgc
agatgtggca 9960gctcctttat ttccacagaa gagaccttcg actgatggct
aatgccattt gctcggctgt 10020gccagttgac tgggtaccaa ctgggagaac
cacctggtca atccatggaa agggagaatg 10080gatgaccact gaggacatgc
tcatggtgtg gaatagagtg tggattgagg agaacgacca 10140tatggaggac
aagactcctg taacaaaatg gacagacatt ccctatctag gaaaaaggga
10200ggacttatgg tgtggatccc ttatagggca cagaccccgc accacttggg
ctgaaaacat 10260caaagacaca gtcaacatgg tgcgcaggat cataggtgat
gaagaaaagt acatggacta 10320tctatccacc caagtccgct acttgggtga
ggaagggtcc acacccggag tgttgtaagc 10380accaatttta gtgttgtcag
gcctgctagt cagccacagt ttggggaaag ctgtgcagcc 10440tgtaaccccc
ccaggagaag ctgggaaacc aagctcatag tcaggccgag aacgccatgg
10500cacggaagaa gccatgctgc ctgtgagccc ctcagaggac actgagtcaa
aaaaccccac 10560gcgcttggaa gcgcaggatg ggaaaagaag gtggcgacct
tccccaccct tcaatctggg 10620gcctgaactg gagactagct gtgaatctcc
agcagaggga ctagtggtta gaggagaccc 10680cccggaaaac gcaaaacagc
atattgacgc tgggaaagac cagagactcc atgagtttcc 10740accacgctgg
ccgccaggca cagatcgccg aacagcggcg gccggtgtgg ggaaatccat 10800ggtttct
108071210794DNAZika virus 12agttgttgat ctgtgtgagt cagactgcga
cagttcgagt ctgaagcgag agctaacaac 60agtatcaaca ggtttaattt ggatttggaa
acgagagttt ctggtcatga aaaaccccaa 120agaagaaatc cggaggatcc
ggattgtcaa tatgctaaaa cgcggagtag cccgtgtaaa 180ccccttggga
ggtttgaaga ggttgccagc cggacttctg ctgggtcatg gacccatcag
240aatggttttg gcgatactag cctttttgag atttacagca atcaagccat
cactgggcct 300tatcaacaga tggggttccg tggggaaaaa agaggctatg
gaaataataa agaagttcaa 360gaaagatctt gctgccatgt tgagaataat
caatgctagg aaagagagga agagacgtgg 420cgcagacacc agcatcggaa
tcattggcct cctgctgact acagccatgg cagcagagat 480cactagacgc
gggagtgcat actacatgta cttggatagg agcgatgccg ggaaggccat
540ttcgtttgct accacattgg gagtgaacaa gtgccacgta cagatcatgg
acctcgggca 600catgtgtgac gccaccatga gttatgagtg ccctatgctg
gatgagggag tggaaccaga 660tgatgtcgat tgctggtgca acacgacatc
aacttgggtt gtgtacggaa cctgtcatca 720caaaaaaggt gaggcacggc
gatctagaag agccgtgacg ctcccttctc actctacaag 780gaagttgcaa
acgcggtcgc agacctggtt agaatcaaga gaatacacga agcacttgat
840caaggttgaa aactggatat tcaggaaccc cgggtttgcg ctagtggccg
ttgccattgc 900ctggcttttg ggaagctcga cgagccaaaa agtcatatac
ttggtcatga tactgctgat 960tgccccggca tacagtatca ggtgcattgg
agtcagcaat agagacttcg tggagggcat 1020gtcaggtggg acctgggttg
atgttgtctt ggaacatgga ggctgcgtta ccgtgatggc 1080acaggacaag
ccaacagtcg acatagagtt ggtcacgacg acggttagta acatggccga
1140ggtaagatcc tattgctacg aggcatcgat atcggacatg gcttcggaca
gtcgttgccc 1200aacacaaggt gaagcctacc ttgacaagca atcagacact
caatatgtct gcaaaagaac 1260attagtggac agaggttggg gaaacggttg
tggacttttt ggcaaaggga gcttggtgac 1320atgtgccaag tttacgtgtt
ctaagaagat gaccgggaag agcattcaac cggaaaatct 1380ggagtatcgg
ataatgctat cagtgcatgg ctcccagcat agcgggatga ttggatatga
1440aactgacgaa gatagagcga aagtcgaggt tacgcctaat tcaccaagag
cggaagcaac 1500cttgggaggc tttggaagct taggacttga ctgtgaacca
aggacaggcc ttgacttttc 1560agatctgtat tacctgacca tgaacaataa
gcattggttg gtgcacaaag agtggtttca 1620tgacatccca ttgccttggc
atgctggggc agacaccgga actccacact ggaacaacaa 1680agaggcattg
gtagaattca aggatgccca cgccaagagg caaaccgtcg tcgttctggg
1740gagccaggaa ggagccgttc acacggctct cgctggagct ctagaggctg
agatggatgg 1800tgcaaaggga aggctgttct ctggccattt gaaatgccgc
ctaaaaatgg acaagcttag 1860attgaagggc gtgtcatatt ccttgtgcac
tgcggcattc acattcacca aggtcccagc 1920tgaaacactg catggaacag
tcacagtgga ggtgcagtat gcagggacag atggaccctg 1980caagatccca
gtccagatgg cggtggacat gcagaccctg accccagttg gaaggctgat
2040aaccgccaac cccgtgatta ctgaaagcac tgagaactca aagatgatgt
tggagcttga 2100cccaccattt ggggattctt acattgtcat aggagttggg
gacaagaaaa tcacccacca 2160ctggcatagg agtggtagca ccatcggaaa
ggcatttgag gccactgtga gaggcgccaa 2220gagaatggca gtcctggggg
atacagcctg ggacttcgga tcagtcgggg gtgtgttcaa 2280ctcactgggt
aagggcattc accagatttt tggagcagcc ttcaaatcac tgtttggagg
2340aatgtcctgg ttctcacaga tcctcatagg cacgctgcta gtgtggttag
gtttgaacac 2400aaagaatgga tctatctccc tcacatgctt ggccctgggg
ggagtgatga tcttcctctc 2460cacggctgtt tctgctgacg tggggtgctc
agtggacttc tcaaaaaagg aaacgagatg 2520tggcacgggg gtattcatct
ataatgatgt tgaagcctgg agggaccggt acaagtacca 2580tcctgactcc
ccccgcagat tggcagcagc agtcaagcag gcctgggaag aggggatctg
2640tgggatctca tccgtttcaa gaatggaaaa catcatgtgg aaatcagtag
aaggggagct 2700caatgctatc ctagaggaga atggagttca actgacagtt
gttgtgggat ctgtaaaaaa 2760ccccatgtgg agaggtccac aaagattgcc
agtgcctgtg aatgagctgc cccatggctg 2820gaaagcctgg gggaaatcgt
attttgttag ggcggcaaag accaacaaca gttttgttgt 2880cgacggtgac
acactgaagg aatgtccgct tgagcacaga gcatggaata gttttcttgt
2940ggaggatcac gggtttggag tcttccacac cagtgtctgg cttaaggtca
gagaagatta 3000ctcattagaa tgtgacccag ccgtcatagg aacagctgtt
aagggaaggg aggccgcgca 3060cagtgatctg ggctattgga ttgaaagtga
aaagaatgac acatggaggc tgaagagggc 3120ccacctgatt gagatgaaaa
catgtgaatg gccaaagtct cacacattgt ggacagatgg 3180agtagaagaa
agtgatctta tcatacccaa gtctttagct ggtccactca gccaccacaa
3240caccagagag ggttacagaa cccaagtgaa agggccatgg cacagtgaag
agcttgaaat 3300ccggtttgag gaatgtccag gcaccaaggt ttacgtggag
gagacatgcg gaactagagg 3360accatctctg agatcaacta ctgcaagtgg
aagggtcatt gaggaatggt gctgtaggga 3420atgcacaatg cccccactat
cgtttcgagc aaaagacggc tgctggtatg gaatggagat 3480aaggcccagg
aaagaaccag agagcaactt agtgaggtca atggtgacag cggggtcaac
3540cgatcatatg gaccacttct ctcttggagt gcttgtgatt ctactcatgg
tgcaggaggg 3600gttgaagaag agaatgacca caaagatcat catgagcaca
tcaatggcag tgctggtagt 3660catgatcttg ggaggatttt caatgagtga
cctggccaag cttgtgatcc tgatgggtgc 3720tactttcgca gaaatgaaca
ctggaggaga tgtagctcac ttggcattgg tagcggcatt 3780taaagtcaga
ccagccttgc tggtctcctt cattttcaga gccaattgga caccccgtga
3840gagcatgctg ctagccctgg cttcgtgtct tctgcaaact gcgatctctg
ctcttgaagg 3900tgacttgatg gtcctcatta atggatttgc tttggcctgg
ttggcaattc gagcaatggc 3960cgtgccacgc actgacaaca tcgctctacc
aatcttggct gctctaacac cactagctcg 4020aggcacactg ctcgtggcat
ggagagcggg cctggctact tgtggaggga tcatgctcct 4080ctccctgaaa
gggaaaggta gtgtgaagaa gaacctgcca tttgtcatgg ccctgggatt
4140gacagctgtg agggtagtag accctattaa tgtggtagga ctactgttac
tcacaaggag 4200tgggaagcgg agctggcccc ctagtgaagt tctcacagcc
gttggcctga tatgtgcact 4260ggccggaggg tttgccaagg cagacattga
gatggctgga cccatggctg cagtaggctt 4320gctaattgtc agctatgtgg
tctcgggaaa gagtgtggac atgtacattg aaagagcagg 4380tgacatcaca
tgggaaaagg acgcggaagt cactggaaac agtcctcggc ttgacgtggc
4440actggatgag agtggtgact tctccttggt agaggaagat ggtccaccca
tgagagagat 4500catactcaag gtggtcctga tggccatctg tggcatgaac
ccaatagcta taccttttgc 4560tgcaggagcg tggtatgtgt atgtgaagac
tgggaaaagg agtggcgccc tctgggacgt 4620gcctgctccc aaagaagtga
agaaaggaga gaccacagat ggagtgtaca gagtgatgac 4680tcgcagactg
ctaggttcaa cacaggttgg agtgggagtc atgcaagagg gagtcttcca
4740caccatgtgg cacgttacaa aaggagccgc actgaggagc ggtgagggaa
gacttgatcc 4800atactggggg gatgtcaagc aggacttggt gtcatactgt
gggccttgga agttggatgc 4860agcttgggat ggactcagcg aggtacagct
tttggccgta cctcccggag agagggccag 4920aaacattcag accctgcctg
gaatattcaa gacaaaggac ggggacatcg gagcagttgc 4980tctggactac
cctgcaggga cctcaggatc tccgatccta gacaaatgtg gaagagtgat
5040aggactctat ggcaatgggg ttgtgatcaa gaatggaagc tatgttagtg
ctataaccca 5100gggaaagagg gaggaggaga ctccggttga atgtttcgaa
ccctcgatgc tgaagaagaa 5160gcagctaact gtcttggatc tgcatccagg
agccggaaaa accaggagag ttcttcctga 5220aatagtccgt gaagccataa
aaaagagact ccggacagtg atcttggcac caactagggt 5280tgtcgctgct
gagatggagg aggccttgag aggacttccg gtgcgttaca tgacaacagc
5340agtcaacgtc acccattctg ggacagaaat cgttgatttg atgtgccatg
ccactttcac 5400ttcacgctta ctacaaccca tcagagtccc taattacaat
ctcaacatca tggatgaagc 5460ccacttcaca gacccctcaa gtatagctgc
aagaggatac atatcaacaa gggttgaaat 5520gggcgaggcg gctgccattt
ttatgactgc cacaccacca ggaacccgtg atgcgtttcc 5580tgactctaac
tcaccaatca tggacacaga agtggaagtc ccagagagag cctggagctc
5640aggctttgat tgggtgacag accattctgg gaaaacagtt tggttcgttc
caagcgtgag 5700aaacggaaat gaaatcgcag cctgtctgac aaaggctgga
aagcgggtca tacagctcag 5760caggaagact tttgagacag aatttcagaa
aacaaaaaat caagagtggg actttgtcat 5820aacaactgac atctcagaga
tgggcgccaa cttcaaggct gaccgggtca tagactctag 5880gagatgccta
aaaccagtca tacttgatgg tgagagagtc atcttggctg ggcccatgcc
5940tgtcacgcat gctagtgctg ctcagaggag aggacgtata ggcaggaacc
ctaacaaacc 6000tggagatgag tacatgtatg gaggtgggtg tgcagagact
gatgaaggcc atgcacactg 6060gcttgaagca agaatgcttc ttgacaacat
ctacctccag gatggcctca tagcctcgct 6120ctatcggcct gaggccgata
aggtagccgc cattgaggga gagtttaagc tgaggacaga 6180gcaaaggaag
accttcgtgg aactcatgaa gagaggagac cttcccgtct ggctagccta
6240tcaggttgca tctgccggaa taacttacac agacagaaga tggtgctttg
atggcacaac 6300caacaacacc ataatggaag acagtgtacc agcagaggtt
tggacaaagt atggagagaa 6360gagagtgctc aaaccgagat ggatggatgc
tagggtctgt tcagaccatg cggccctgaa 6420gtcgttcaaa gaattcgccg
ctggaaaaag aggagcggct ttgggagtaa tggaggccct 6480gggaacactg
ccaggacaca tgacagagag gtttcaggaa gccattgaca acctcgccgt
6540gctcatgcga gcagagactg gaagcaggcc ttataaggca gcggcagccc
aactgccgga 6600gaccctagag accattatgc tcttaggttt gctgggaaca
gtttcactgg ggatcttctt 6660cgtcttgatg cggaataagg gcatcgggaa
gatgggcttt ggaatggtaa cccttggggc 6720cagtgcatgg ctcatgtggc
tttcggaaat tgaaccagcc agaattgcat gtgtcctcat 6780tgttgtgttt
ttattactgg tggtgctcat acccgagcca gagaagcaaa gatctcccca
6840agataaccag atggcaatta tcatcatggt ggcagtgggc cttctaggtt
tgataactgc 6900aaacgaactt ggatggctgg aaagaacaaa aaatgacata
gctcatctaa tgggaaggag 6960agaagaagga gcaaccatgg gattctcaat
ggacattgat ctgcggccag cctccgcctg 7020ggctatctat gccgcattga
caactctcat caccccagct gtccaacatg cggtaaccac 7080ttcatacaac
aactactcct taatggcgat ggccacacaa gctggagtgc tgtttggcat
7140gggcaaaggg atgccattta tgcatgggga ccttggagtc ccgctgctaa
tgatgggttg 7200ctattcacaa ttaacacccc tgactctgat agtagctatc
attctgcttg tggcgcacta 7260catgtacttg atcccaggcc tacaagcggc
agcagcgcgt gctgcccaga aaaggacagc 7320agctggcatc atgaagaatc
ccgttgtgga tggaatagtg gtaactgaca ttgacacaat 7380gacaatagac
ccccaggtgg agaagaagat gggacaagtg ttactcatag cagtagccat
7440ctccagtgct gtgctgctgc ggaccgcctg gggatggggg gaggctggag
ctctgatcac 7500agcagcgacc tccaccttgt gggaaggctc tccaaacaaa
tactggaact cctctacagc 7560cacctcactg tgcaacatct tcagaggaag
ctatctggca ggagcttccc ttatctatac 7620agtgacgaga aacgctggcc
tggttaagag acgtggaggt gggacgggag agactctggg 7680agagaagtgg
aaagctcgtc tgaatcagat gtcggccctg gagttctact cttataaaaa
7740gtcaggtatc actgaagtgt gtagagagga ggctcgccgt gccctcaagg
atggagtggc 7800cacaggagga catgccgtat cccggggaag tgcaaagatc
agatggttgg aggagagagg 7860atatctgcag ccctatggga aggttgttga
cctcggatgt ggcagagggg gctggagcta 7920ttatgccgcc accatccgca
aagtgcagga ggtgagagga tacacaaagg gaggtcccgg 7980tcatgaagaa
cccatgctgg tgcaaagcta tgggtggaac atagttcgtc tcaagagtgg
8040agtggacgtc ttccacatgg cggctgagcc gtgtgacact ctgctgtgtg
acataggtga 8100gtcatcatct agtcctgaag tggaagagac acgaacactc
agagtgctct ctatggtggg 8160ggactggctt gaaaaaagac caggggcctt
ctgtataaag gtgctgtgcc catacaccag 8220cactatgatg gaaaccatgg
agcgactgca acgtaggcat gggggaggat tagtcagagt 8280gccattgtgt
cgcaactcca cacatgagat gtactgggtc tctggggcaa agagcaacat
8340cataaaaagt gtgtccacca caagtcagct cctcctggga cgcatggatg
gccccaggag 8400gccagtgaaa tatgaggagg atgtgaacct cggctcgggt
acacgagctg tggcaagctg 8460tgctgaggct cctaacatga aaatcatcgg
caggcgcatt gagagaatcc gcaatgaaca 8520tgcagaaaca tggtttcttg
atgaaaacca cccatacagg acatgggcct accatgggag 8580ctacgaagcc
cccacgcaag gatcagcgtc ttccctcgtg aacggggttg ttagactcct
8640gtcaaagcct tgggacgtgg tgactggagt tacaggaata gccatgactg
acaccacacc 8700atacggccaa caaagagtct tcaaagaaaa agtggacacc
agggtgccag atccccaaga 8760aggcactcgc caggtaatga acatagtctc
ttcctggctg tggaaggagc tggggaaacg 8820caagcggcca cgcgtctgca
ccaaagaaga gtttatcaac aaggtgcgca gcaatgcagc 8880actgggagca
atatttgaag aggaaaaaga atggaagacg gctgtggaag ctgtgaatga
8940tccaaggttt tgggccctag tggataggga gagagaacac cacctgagag
gagagtgtca 9000cagctgtgtg tacaacatga tgggaaaaag agaaaagaag
caaggagagt tcgggaaagc 9060aaaaggtagc cgcgccatct ggtacatgtg
gttgggagcc agattcttgg agtttgaagc 9120ccttggattc ttgaacgagg
accattggat gggaagagaa aactcaggag gtggagtcga 9180agggttagga
ttgcaaagac ttggatacat tctagaagaa atgaatcggg caccaggagg
9240aaagatgtac gcagatgaca ctgctggctg ggacacccgc attagtaagt
ttgatctgga 9300gaatgaagct ctgattacca accaaatgga ggaagggcac
agaactctgg cgttggccgt 9360gattaaatac acataccaaa acaaagtggt
gaaggttctc agaccagctg aaggaggaaa 9420aacagttatg gacatcattt
caagacaaga ccagagaggg agtggacaag ttgtcactta 9480tgctctcaac
acattcacca acttggtggt gcagcttatc cggaacatgg aagctgagga
9540agtgttagag atgcaagact tatggttgtt gaggaagcca gagaaagtga
ccagatggtt 9600gcagagcaat ggatgggata gactcaaacg aatggcggtc
agtggagatg actgcgttgt 9660gaagccaatc gatgataggt ttgcacatgc
cctcaggttc ttgaatgaca tgggaaaagt 9720taggaaagac acacaggagt
ggaaaccctc gactggatgg agcaattggg aagaagtccc 9780gttctgctcc
caccacttca acaagctgta cctcaaggat gggagatcca ttgtggtccc
9840ttgccgccac caagatgaac tgattggccg agctcgcgtc tcaccagggg
caggatggag 9900catccgggag actgcctgtc ttgcaaaatc atatgcgcag
atgtggcagc tcctttattt 9960ccacagaaga gaccttcgac tgatggctaa
tgccatttgc tcggctgtgc cagttgactg 10020ggtaccaact gggagaacca
cctggtcaat ccatggaaag ggagaatgga tgaccactga 10080ggacatgctc
atggtgtgga atagagtgtg gattgaggag aacgaccata tggaggacaa
10140gactcctgta acaaaatgga cagacattcc ctatctagga aaaagggagg
acttatggtg 10200tggatccctt atagggcaca gaccccgcac cacttgggct
gaaaacatca aagacacagt 10260caacatggtg cgcaggatca taggtgatga
agaaaagtac atggactatc tatccaccca 10320agtccgctac ttgggtgagg
aagggtccac acccggagtg ttgtaagcac caattttagt 10380gttgtcaggc
ctgctagtca gccacagttt ggggaaagct gtgcagcctg taaccccccc
10440aggagaagct gggaaaccaa gctcatagtc aggccgagaa cgccatggca
cggaagaagc 10500catgctgcct gtgagcccct cagaggacac tgagtcaaaa
aaccccacgc gcttggaagc 10560gcaggatggg aaaagaaggt ggcgaccttc
cccacccttc aatctggggc ctgaactgga 10620gactagctgt gaatctccag
cagagggact agtggttaga ggagaccccc cggaaaacgc 10680aaaacagcat
attgacgtgg gaaagaccag agactccatg agtttccacc acgctggccg
10740ccaggcacag atcgccgaac ttcggcggcc ggtgtgggga aatccatggt ttct
107941310617DNAZika virus 13agtatcaaca ggttttattt tggatttgga
aacgagagtt tctggtcatg aaaaacccaa 60aaaagaaatc cggaggattc cggattgtca
atatgctaaa acgcggagta gcccgtgtga 120gcccctttgg gggcttgaag
aggctgccag ccggacttct gctgggtcat gggcccatca 180ggatggtctt
ggcgattcta gcctttttga gattcacggc aatcaagcca tcactgggtc
240tcatcaatag atggggttca gtggggaaaa aagaggctat ggaaataata
aagaagttca 300agaaagatct ggctgccatg ctgagaataa tcaatgctag
gaaggagaag aagagacgag 360gcgcagatac tagtgtcgga attgttggcc
tcctgctgac cacagctatg gcagcggagg 420tcactagacg tgggagtgca
tactatatgt acttggacag aaacgacgct ggggaggcca 480tatcttttcc
aaccacattg gggatgaata agtgttatat acagatcatg gatcttggac
540acatgtgtga tgccaccatg agctatgaat gccctatgct ggatgagggg
gtggaaccag 600atgacgtcga ttgttggtgc aacacgacgt caacttgggt
tgtgtacgga acctgccatc 660acaaaaaagg tgaagcacgg agatctagaa
gagctgtgac gctcccctcc cattccacta 720ggaagctgca aacgcggtcg
caaacctggt tggaatcaag agaatacaca aagcacttga 780ttagagtcga
aaattggata ttcaggaacc ctggcttcgc gttagcagca gctgccatcg
840cttggctttt gggaagctca acgagccaaa aagtcatata cttggtcatg
atactgctga 900ttgccccggc atacagcatc aggtgcatag gagtcagcaa
tagggacttt gtggaaggta 960tgtcaggtgg gacttgggtt gatgttgtct
tggaacatgg aggttgtgtc accgtaatgg 1020cacaggacaa accgactgtc
gacatagagc tggttacaac aacagtcagc aacatggcgg 1080aggtaagatc
ctactgctat gaggcatcaa tatcggacat ggcttcggac agccgctgcc
1140caacacaagg tgaagcctac cttgacaagc aatcagacac tcaatatgtc
tgcaaaagaa 1200cgttagtgga cagaggctgg ggaaatggat gtggactttt
tggcaaaggg agcctggtga 1260catgcgctaa gtttgcatgc tccaagaaaa
tgaccgggaa gagcatccag ccagagaatc 1320tggagtaccg gataatgctg
tcagttcatg gctcccagca cagtgggatg atcgttaatg 1380acacaggaca
tgaaactgat gagaatagag cgaaggttga gataacgccc aattcaccaa
1440gagccgaagc caccctgggg ggttttggaa gcctaggact tgattgtgaa
ccgaggacag 1500gccttgactt ttcagatttg tattacttga ctatgaataa
caagcactgg ttggttcaca
1560aggagtggtt ccacgacatt ccattacctt ggcacgctgg ggcagacacc
ggaactccac 1620actggaacaa caaagaagca ctggtagagt tcaaggacgc
acatgccaaa aggcaaactg 1680tcgtggttct agggagtcaa gaaggagcag
ttcacacggc ccttgctgga gctctggagg 1740ctgagatgga tggtgcaaag
ggaaggctgt cctctggcca cttgaaatgt cgcctgaaaa 1800tggataaact
tagattgaag ggcgtgtcat actccttgtg taccgcagcg ttcacattca
1860ccaagatccc ggctgaaaca ctgcacggga cagtcacagt ggaggtacag
tacgcaggga 1920cagatggacc ttgcaaggtt ccagctcaga tggcggtgga
catgcaaact ctgaccccag 1980ttgggaggtt gataaccgct aaccccgtaa
tcactgaaag cactgagaac tctaagatga 2040tgctggaact tgatccacca
tttggggact cttacattgt cataggagtc ggggagaaga 2100agatcaccca
ccactggcac aggagtggca gcaccattgg aaaagcattt gaagccactg
2160tgagaggtgc caagagaatg gcagtcttgg gagacacagc ctgggacttt
ggatcagttg 2220gaggcgctct caactcattg ggcaagggca tccatcaaat
ttttggagca gctttcaaat 2280cattgtttgg aggaatgtcc tggttctcac
aaattctcat tggaacgttg ctgatgtggt 2340tgggtctgaa cacaaagaat
ggatctattt cccttatgtg cttggcctta gggggagtgt 2400tgatcttctt
atccacagct gtctctgctg atgtggggtg ctcggtggac ttctcaaaga
2460aggagacgag atgcggtaca ggggtgttcg tctataacga cgttgaagcc
tggagggaca 2520ggtacaagta ccatcctgac tccccccgta gattggcagc
agcagtcaag caagcctggg 2580aagatggtat ctgtgggatc tcctctgttt
caagaatgga aaacatcatg tggagatcag 2640tagaagggga gctcaacgca
atcctggaag agaatggagt tcaactgacg gtcgttgtgg 2700gatctgtaaa
aaaccccatg tggagaggtc cacagagatt gcccgtgcct gtgaacgagc
2760tgccccacgg ctggaaggct tgggggaaat cgtacttcgt cagagcagca
aagacaaata 2820acagctttgt cgtggatggt gacacactga aggaatgccc
actcaaacat agagcatgga 2880acagctttct tgtggaggat catgggttcg
gggtatttca cactagtgtc tggctcaagg 2940ttagagaaga ttattcatta
gagtgtgatc cagccgttat tggaacagct gttaagggaa 3000aggaggctgt
acacagtgat ctaggctact ggattgagag tgagaagaat gacacatgga
3060ggctgaagag ggcccatctg atcgagatga aaacatgtga atggccaaag
tcccacacat 3120tgtggacaga tggaatagaa gagagtgatc tgatcatacc
caagtcttta gctgggccac 3180tcagccatca caataccaga gagggctaca
ggacccaaat gaaagggcca tggcacagtg 3240aagagcttga aattcggttt
gaggaatgcc caggcactaa ggtccacgtg gaggaaacat 3300gtggaacaag
aggaccatct ctgagatcaa ccactgcaag cggaagggtg atcgaggaat
3360ggtgctgcag ggagtgcaca atgcccccac tgtcgttccg ggctaaagat
ggctgttggt 3420atggaatgga gataaggccc aggaaagaac cagaaagtaa
cttagtaagg tcaatggtga 3480ctgcaggatc aactgatcac atggatcact
tctcccttgg agtgcttgtg attctgctca 3540tggtgcagga agggctgaag
aagagaatga ccacaaagat catcataagc acatcgatgg 3600cagtgctggt
agctatgatc ctgggaggat tttcaatgag tgacctggct aagcttgcaa
3660ttttgatggg tgccaccttc gcggaaatga acactggagg agatgtagct
catctggcgc 3720tgatagcggc attcaaagtc agaccagcgt tgctggtatc
tttcatcttc agagctaatt 3780ggacaccccg tgaaagcatg ctgctggcct
tggcctcgtg tcttttgcaa actgcgatct 3840ccgccttgga aggcgacctg
atggttctca tcaatggttt tgctttggcc tggttggcaa 3900tacgagcgat
ggttgttcca cgcactgata acatcacctt ggcaatcctg gctgctctga
3960caccactggc ccggggcaca ctgcttgtgg cgtggagagc aggccttgct
acttgcgggg 4020ggtttatgct cctctctctg aagggaaaag gcagtgtgaa
gaagaactta ccatttgtca 4080tggccctggg actaaccgct gtgaggctgg
tcgaccccat caacgtggtg ggactgctgt 4140tgctcacaag gagtgggaag
cggagctggc cccctagcga agtactcaca gctgttggcc 4200tgatatgcgc
attggctgga gggttcgcca aggcagatat agagatggct gggcccatgg
4260ccgcggtcgg tctgctaatt gtcagttacg tggtctcagg aaagagtgtg
gacatgtaca 4320ttgaaagagc aggtgacatc acatgggaaa aagatgcgga
agtcactgga aacagtcccc 4380ggctcgatgt ggcgctagat gagagtggtg
atttctccct ggtggaggat gacggtcccc 4440ccatgagaga gatcatactc
aaggtggtcc tgatgaccat ctgtggcatg aacccaatag 4500ccataccctt
tgcagctgga gcgtggtacg tatacgtgaa gactggaaaa aggagtggtg
4560ctctatggga tgtgcctgct cccaaggaag taaaaaaggg ggagaccaca
gatggagtgt 4620acagagtaat gactcgtaga ctgctaggtt caacacaagt
tggagtggga gttatgcaag 4680agggggtctt tcacactatg tggcacgtca
caaaaggatc cgcgctgaga agcggtgaag 4740ggagacttga tccatactgg
ggagatgtca agcaggatct ggtgtcatac tgtggtccat 4800ggaagctaga
tgccgcctgg gacgggcaca gcgaggtgca gctcttggcc gtgccccccg
4860gagagagagc gaggaacatc cagactctgc ccggaatatt taagacaaag
gatggggaca 4920ttggagcggt tgcgctggat tacccagcag gaacttcagg
atctccaatc ctagacaagt 4980gtgggagagt gataggactt tatggcaatg
gggtcgtgat caaaaatggg agttatgtta 5040gtgccatcac ccaagggagg
agggaggaag agactcctgt tgagtgcttc gagccttcga 5100tgctgaagaa
gaagcagcta actgtcttag acttgcatcc tggagctggg aaaaccagga
5160gagttcttcc tgaaatagtc cgtgaagcca taaaaacaag actccgtact
gtgatcttag 5220ctccaaccag ggttgtcgct gctgaaatgg aggaagccct
tagagggctt ccagtgcgtt 5280atatgacaac agcagtcaat gtcacccact
ctggaacaga aatcgtcgac ttaatgtgcc 5340atgccacctt cacttcacgt
ctactacagc caatcagagt ccccaactat aatctgtata 5400ttatggatga
ggcccacttc acagatccct caagtatagc agcaagagga tacatttcaa
5460caagggttga gatgggcgag gcggctgcca tcttcatgac cgccacgcca
ccaggaaccc 5520gtgacgcatt tccggactcc aactcaccaa ttatggacac
cgaagtggaa gtcccagaga 5580gagcctggag ctcaggcttt gattgggtga
cggatcattc tggaaaaaca gtttggtttg 5640ttccaagcgt gaggaacggc
aatgagatcg cagcttgtct gacaaaggct ggaaaacggg 5700tcatacagct
cagcagaaag acttttgaga cagagttcca gaaaacaaaa catcaagagt
5760gggactttgt cgtgacaact gacatttcag agatgggcgc caactttaaa
gctgaccgtg 5820tcatagattc caggagatgc ctaaagccgg tcatacttga
tggcgagaga gtcattctgg 5880ctggacccat gcctgtcaca catgccagcg
ctgcccagag gagggggcgc ataggcagga 5940atcccaacaa acctggagat
gagtatctgt atggaggtgg gtgcgcagag actgacgaag 6000accatgcaca
ctggcttgaa gcaagaatgc tccttgacaa tatttacctc caagatggcc
6060tcatagcctc gctctatcga cctgaggccg acaaagtagc agccattgag
ggagagttca 6120agcttaggac ggagcaaagg aagacctttg tggaactcat
gaaaagagga gatcttcctg 6180tttggctggc ctatcaggtt gcatctgccg
gaataaccta cacagataga agatggtgct 6240ttgatggcac gaccaacaac
accataatgg aagacagtgt gccggcagag gtgtggacca 6300gacacggaga
gaaaagagtg ctcaaaccga ggtggatgga cgccagagtt tgttcagatc
6360atgcggccct gaagtcattc aaggagtttg ccgctgggaa aagaggagcg
gcttttggag 6420tgatggaagc cctgggaaca ctgccaggac acatgacaga
gagattccag gaagccattg 6480acaacctcgc tgtgctcatg cgggcagaga
ctggaagcag gccttacaaa gccgcggcgg 6540cccaattgcc ggagacccta
gagaccatta tgcttttggg gttgctggga acagtctcgc 6600tgggaatctt
tttcgtcttg atgaggaaca agggcatagg gaagatgggc tttggaatgg
6660tgactcttgg ggccagcgca tggctcatgt ggctctcgga aattgagcca
gccagaattg 6720catgtgtcct cattgttgtg ttcctattgc tggtggtgct
catacctgag ccagaaaagc 6780aaagatctcc ccaggacaac caaatggcaa
tcatcatcat ggtagcagta ggtcttctgg 6840gcttgattac cgccaatgaa
ctcggatggt tggagagaac aaagagtgac ctaagccatc 6900taatgggaag
gagagaggag ggggcaacca taggattctc aatggacatt gacctgcggc
6960cagcctcagc ttgggccatc tatgctgcct tgacaacttt cattacccca
gccgtccaac 7020atgcagtgac cacttcatac aacaactact ccttaatggc
gatggccacg caagctggag 7080tgttgtttgg tatgggcaaa gggatgccat
tctacgcatg ggactttgga gtcccgctgc 7140taatgatagg ttgctactca
caattaacac ccctgaccct aatagtggcc atcattttgc 7200tcgtggcgca
ctacatgtac ttgatcccag ggctgcaggc agcagctgcg cgtgctgccc
7260agaagagaac ggcagctggc atcatgaaga accctgttgt ggatggaata
gtggtgactg 7320acattgacac aatgacaatt gacccccaag tggagaaaaa
gatgggacag gtgctactca 7380tagcagtagc cgtctccagc gccatactgt
cgcggaccgc ctgggggtgg ggggaggctg 7440gggccctgat cacagcggca
acttccactt tgtgggaagg ctctccgaac aagtactgga 7500actcctctac
agccacttca ctgtgtaaca tttttagggg aagttacttg gctggagctt
7560ctctaatcta cacagtaaca agaaacgctg gcttggtcaa gagacgtggg
ggtggaacag 7620gagagaccct gggagagaaa tggaaggccc gcttgaacca
gatgtcggcc ctggagttct 7680actcctacaa aaagtcaggc atcaccgagg
tgtgcagaga agaggcccgc cgcgccctca 7740aggacggtgt ggcaacggga
ggccatgctg tgtcccgagg aagtgcaaag ctgagatggt 7800tggtggagcg
gggatacctg cagccctatg gaaaggtcat tgatcttgga tgtggcagag
7860ggggctggag ttactacgcc gccaccatcc gcaaagttca agaagtgaaa
ggatacacaa 7920aaggaggccc tggtcatgaa gaacccatgt tggtgcaaag
ctatgggtgg aacatagtcc 7980gtcttaagag tggggtggac gtctttcata
tggcggctga gccgtgtgac acgttgctgt 8040gtgacatagg tgagtcatca
tctagtcctg aagtggaaga agcacggacg ctcagagtcc 8100tctccatggt
gggggattgg cttgaaaaaa gaccaggagc cttttgtata aaagtgttgt
8160gcccatacac cagcactatg atggaaaccc tggagcgact gcagcgtagg
tatgggggag 8220gactggtcag agtgccactc tcccgcaact ctacacatga
gatgtactgg gtctctggag 8280cgaaaagcaa caccataaaa agtgtgtcca
ccacgagcca gctcctcttg gggcgcatgg 8340acgggcccag gaggccagtg
aaatatgagg aggatgtgaa tctcggctct ggcacgcggg 8400ctgtggtaag
ctgcgctgaa gctcccaaca tgaagatcat tggtaaccgc attgaaagga
8460tccgcagtga gcacgcggaa acgtggttct ttgacgagaa ccacccatat
aggacatggg 8520cttaccatgg aagctatgag gcccccacac aagggtcagc
gtcctctcta ataaacgggg 8580ttgtcaggct cctgtcaaaa ccctgggatg
tggtgactgg agtcacagga atagccatga 8640ccgacaccac accgtatggt
cagcaaagag ttttcaagga aaaagtggac actagggtgc 8700cagaccccca
agaaggcact cgtcaggtta tgagcatggt ctcttcctgg ttgtggaaag
8760agctaggcaa acacaaacgg ccacgagtct gtaccaaaga agagttcatc
aacaaggttc 8820gtagcaatgc agcattaggg gcaatatttg aagaggaaaa
agagtggaag actgcagtgg 8880aagctgtgaa cgatccaagg ttctgggctc
tagtggacaa ggaaagagag caccacctga 8940gaggagagtg ccagagttgt
gtgtacaaca tgatgggaaa aagagaaaag aaacaagggg 9000aatttggaaa
ggccaagggc agccgcgcca tctggtatat gtggctaggg gctagatttc
9060tagagttcga agcccttgga ttcttgaacg aggatcactg gatggggaga
gagaactcag 9120gaggtggtgt tgaagggctg ggattacaaa gactcggata
tgtcctagaa gagatgagtc 9180gcataccagg aggaaggatg tatgcagatg
acactgctgg ctgggacacc cgcatcagca 9240ggtttgatct ggagaatgaa
gctctaatca ccaaccaaat ggagaaaggg cacagggcct 9300tggcattggc
cataatcaag tacacatacc aaaacaaagt ggtaaaggtc cttagaccag
9360ctgaaaaagg gaagacagtt atggacatta tttcgagaca agaccaaagg
gggagcggac 9420aagttgtcac ttacgctctt aacacattta ccaacctagt
ggtgcaactc attcggaata 9480tggaggctga ggaagttcta gagatgcaag
acttgtggct gctgcggagg tcagagaaag 9540tgaccaactg gttgcagagc
aacggatggg ataggctcaa acgaatggca gtcagtggag 9600atgattgcgt
tgtgaagcca attgatgata ggtttgcaca tgccctcagg ttcttgaatg
9660atatgggaaa agttaggaag gacacacaag agtggaaacc ctcaactgga
tgggacaact 9720gggaagaagt tccgttttgc tcccaccact tcaacaagct
ccatctcaag gacgggaggt 9780ccattgtggt tccctgccgc caccaagatg
aactgattgg ccgggcccgc gtctctccag 9840gggcgggatg gagcatccgg
gagactgctt gcctagcaaa atcatatgcg caaatgtggc 9900agctccttta
tttccacaga agggacctcc gactgatggc caatgccatt tgttcatctg
9960tgccagttga ctgggttcca actgggagaa ctacctggtc aatccatgga
aagggagaat 10020ggatgaccac tgaagacatg cttgtggtgt ggaacagagt
gtggattgag gagaacgacc 10080acatggaaga caagacccca gttacgaaat
ggacagacat tccctatttg ggaaaaaggg 10140aagacttgtg gtgtggatct
ctcatagggc acagaccgcg caccacctgg gctgagaaca 10200ttaaaaacac
agtcaacatg gtgcgcagga tcataggtga tgaagaaaag tacatggact
10260acctatccac ccaagttcgc tacttgggtg aagaagggtc tacacctgga
gtgctgtaag 10320caccaatctt agtgttgtca ggcctgctag tcagccacag
cttggggaaa gctgtgcagc 10380ctgtgacccc cccaggagaa gctgggaaac
caagcctata gtcaggccga gaacgccatg 10440gcacggaaga agccatgctg
cctgtgagcc cctcagagga cactgagtca aaaaacccca 10500cgcgcttgga
ggcgcaggat gggaaaagaa ggtggcgacc ttccccaccc ttcaatctgg
10560ggcctgaact ggagatcagc tgtggatctc cagaagaggg actagtggtt agaggag
1061714498PRTZika virus 14Ile Arg Cys Ile Gly Val Ser Asn Arg Asp
Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu
Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr
Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu
Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser
Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys
Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu
Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Glu Asn Arg
Ala Lys145 150 155 160Val Glu Val Thr Pro Asn Ser Pro Arg Ala Glu
Ala Thr Leu Gly Gly 165 170 175Phe Gly Ser Leu Gly Leu Asp Cys Glu
Pro Arg Thr Gly Leu Asp Phe 180 185 190Ser Asp Leu Tyr Tyr Leu Thr
Met Asn Asn Lys His Trp Leu Val His 195 200 205Lys Glu Trp Phe His
Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp 210 215 220Thr Gly Thr
Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe Lys225 230 235
240Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser Gln Glu
245 250 255Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu
Met Asp 260 265 270Gly Ala Lys Gly Arg Leu Phe Ser Gly His Leu Lys
Cys Arg Leu Lys 275 280 285Met Asp Lys Leu Arg Leu Lys Gly Val Ser
Tyr Ser Leu Cys Thr Ala 290 295 300Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu Thr Leu His Gly Thr Val305 310 315 320Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro 325 330 335Ala Gln Met
Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu 340 345 350Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser Lys Met 355 360
365Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val Ile Gly
370 375 380Val Gly Asp Lys Lys Ile Thr His His Trp His Arg Ser Gly
Ser Thr385 390 395 400Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly
Ala Lys Arg Met Ala 405 410 415Val Leu Gly Asp Thr Ala Trp Asp Phe
Gly Ser Val Gly Gly Val Phe 420 425 430Asn Ser Leu Gly Lys Gly Ile
His Gln Ile Phe Gly Ala Ala Phe Lys 435 440 445Ser Leu Phe Gly Gly
Met Ser Trp Phe Ser Gln Ile Leu Ile Gly Thr 450 455 460Leu Leu Val
Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser Leu465 470 475
480Thr Cys Leu Ala Leu Gly Gly Val Met Ile Phe Leu Ser Thr Ala Val
485 490 495Ser Ala15498PRTZika virus 15Ile Arg Cys Ile Gly Val Ser
Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp
Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp
Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn
Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp
Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr
Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90
95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr
Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met
Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn
Asp Glu Asn Arg Ala Lys145 150 155 160Val Glu Val Thr Pro Asn Ser
Pro Arg Ala Glu Ala Thr Leu Gly Gly 165 170 175Phe Gly Ser Leu Gly
Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe 180 185 190Ser Asp Leu
Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His 195 200 205Lys
Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ser Gly Ala Asp 210 215
220Thr Glu Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu Phe
Lys225 230 235 240Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu
Gly Ser Gln Glu 245 250 255Gly Ala Val His Thr Ala Leu Ala Gly Ala
Leu Glu Ala Glu Met Asp 260 265 270Gly Ala Lys Gly Arg Leu Ser Ser
Gly His Leu Lys Cys Arg Leu Lys 275 280 285Met Asp Lys Leu Arg Leu
Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala 290 295 300Ala Phe Thr Phe
Thr Lys Val Pro Ala Glu Thr Leu His Gly Thr Val305 310 315 320Thr
Val Glu Val Gln Tyr Ala Gly Arg Asp Gly Pro Cys Lys Val Pro 325 330
335Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly Arg Leu
340 345 350Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser
Lys Met 355 360 365Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr
Ile Val Ile Gly 370 375 380Val Gly Asp Lys Lys Ile Thr His His Trp
His Arg Ser Gly Ser Ile385 390 395 400Ile Gly Lys Ala Phe Glu Ala
Thr Val Arg Gly Ala Lys Arg Met Ala 405 410 415Val Leu Gly Asp Thr
Ala Trp Asp Phe Gly Ser Val Gly Gly Val Phe 420 425 430Asn Ser Leu
Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala Phe Lys 435 440 445Ser
Leu Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile Gly Thr 450 455
460Leu Leu Val Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile Ser
Leu465 470 475 480Thr Cys
Leu Ala Leu Gly Gly Val Met Ile Phe Leu Ser Thr Ala Val 485 490
495Ser Ala16504PRTZika virusmisc_feature(156)..(162)X=any amino
acid 16Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys
Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln
Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly
Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys
Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro
Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser
Gln His Ser Gly Met Ile Val Asn Asp Xaa Xaa Xaa Xaa Xaa145 150 155
160Xaa Xaa Asn Arg Ala Glu Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50017504PRTZika
virusmisc_feature(152)..(156)X=any amino acid 17Ile Arg Cys Ile Gly
Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp
Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val
Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile
Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Xaa Xaa
Xaa Xaa Xaa Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Val Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Arg Leu Val Arg Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Asp Lys Lys Ile Thr His His Trp385 390 395 400Leu Lys Lys Gly Ser
Ser Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly Val His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu
Gly Gly Val Met Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50018504PRTZika virus 18Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Ile Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Val Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Asp Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50019504PRTZika virus
19Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Ile Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50020504PRTZika virus 20Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Ile Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Val Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Asp Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala
Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly Val His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu
Gly Gly Val Met Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50021504PRTZika virus 21Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Ile Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Val Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Asp Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn
Ser Leu Gly Lys Gly Val His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50022504PRTZika virus
22Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50023504PRTZika virus 23Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Thr Val
Asn Asp Ile Gly Tyr Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Val Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Lys Leu Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Ile Pro Val Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Asp Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu
Gly Gly Val Met Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50024504PRTZika virus 24Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Thr Val Asn Asp Ile Gly Tyr
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Lys Leu
Phe Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Val Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Ile Pro Val Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Asp Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50025504PRTZika virus
25Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly Tyr Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Lys Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Ile Pro Val Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly Asp Lys Lys Ile
Thr His His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys
Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val
Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Val
Phe Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala
Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln
Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470
475 480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met
Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50026504PRTZika
virus 26Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys
Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln
Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly
Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys
Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro
Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser
Gln His Ser Gly Met Ile Val Asn Asp Ile Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Val Cys Thr Ala Ala Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50027500PRTZika virus 27Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Gly
Tyr Glu Thr Asp Glu Asp Arg145 150 155 160Ala Lys Val Glu Val Thr
Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu 165 170 175Gly Gly Phe Gly
Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu 180 185 190Asp Phe
Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu 195 200
205Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly
210 215 220Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu
Val Glu225 230 235 240Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val
Val Val Leu Gly Ser 245 250 255Gln Glu Gly Ala Val His Thr Ala Leu
Ala Gly Ala Leu Glu Ala Glu 260 265 270Met Asp Gly Ala Lys Gly Arg
Leu Phe Ser Gly His Leu Lys Cys Arg 275 280 285Leu Lys Met Asp Lys
Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys 290 295 300Thr Ala Ala
Phe Thr Phe Thr Lys Val Pro Ala Glu Thr Leu His Gly305 310 315
320Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys
325 330 335Ile Pro Val Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro
Val Gly 340 345 350Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
Thr Glu Asn Ser 355 360 365Lys Met Met Leu Glu Leu Asp Pro Pro Phe
Gly Asp Ser Tyr Ile Val 370 375 380Ile Gly Val Gly Asp Lys Lys Ile
Thr His His Trp His Arg Ser Gly385 390 395 400Ser Thr Ile Gly Lys
Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg 405 410 415Met Ala Val
Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly 420 425 430Val
Phe Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala 435 440
445Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile
450 455 460Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys Asn Gly
Ser Ile465 470 475 480Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met
Ile Phe Leu Ser Thr 485 490 495Ala Val Ser Ala 50028500PRTZika
virus 28Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys
Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln
Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly
Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys
Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro
Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser
Gln His Ser Gly Met Ile Gly Tyr Glu Thr Asp Glu Asp Arg145 150 155
160Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala Glu Ala Thr Leu
165 170 175Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro Arg Thr
Gly Leu 180 185 190Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn
Lys His Trp Leu 195 200 205Val His Lys Glu Trp Phe His Asp Ile Pro
Leu Pro Trp His Ala Gly 210 215 220Ala Asp Thr Gly Thr Pro His Trp
Asn Asn Lys Glu Ala Leu Val Glu225 230 235 240Phe Lys Asp Ala His
Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser 245 250 255Gln Glu Gly
Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu Ala Glu 260 265 270Met
Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His Leu Lys Cys Arg 275 280
285Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser Tyr Ser Leu Cys
290 295 300Thr Ala Ala Phe Thr Phe Thr Lys Val Pro Ala Glu Thr Leu
His Gly305 310 315 320Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr
Asp Gly Pro Cys Lys 325 330 335Ile Pro Val Gln Met Ala Val Asp Met
Gln Thr Leu Thr Pro Val Gly 340 345 350Arg Leu Ile Thr Ala Asn Pro
Val Ile Thr Glu Ser Thr Glu Asn Ser 355 360 365Lys Met Met Leu Glu
Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val 370 375 380Ile Gly Val
Gly Asp Lys Lys Ile Thr His His Trp His Arg Ser Gly385 390 395
400Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg
405 410 415Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val
Gly Gly 420 425 430Val Phe Asn Ser Leu Gly Lys Gly Ile His Gln Ile
Phe Gly Ala Ala 435 440 445Phe Lys Ser Leu Phe Gly Gly Met Ser Trp
Phe Ser Gln Ile Leu Ile 450 455 460Gly Thr Leu Leu Val Trp Leu Gly
Leu Asn Thr Lys Asn Gly Ser Ile465 470 475 480Ser Leu Thr Cys Leu
Ala Leu Gly Gly Val Met Ile Phe Leu Ser Thr 485 490 495Ala Val Ser
Ala 50029500PRTZika virus 29Ile Arg Cys Ile Gly Val Ser Asn Arg Asp
Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu
Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr
Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu
Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser
Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys
Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu
Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Gly Tyr Glu Thr Asp Glu
Asp Arg145 150 155 160Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg
Ala Glu Ala Thr Leu 165 170 175Gly Gly Phe Gly Ser Leu Gly Leu Asp
Cys Glu Pro Arg Thr Gly Leu 180 185 190Asp Phe Ser Asp Leu Tyr Tyr
Leu Thr Met Asn Asn Lys His Trp Leu 195 200 205Val His Lys Glu Trp
Phe His Asp Ile Pro Leu Pro Trp His Ala Gly 210 215 220Ala Asp Thr
Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu225 230 235
240Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val Val Leu Gly Ser
245 250 255Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala Leu Glu
Ala Glu 260 265 270Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His
Leu Lys Cys Arg 275 280 285Leu Lys Met Asp Lys Leu Arg Leu Lys Gly
Val Ser Tyr Ser Leu Cys 290 295 300Thr Ala Ala Phe Thr Phe Thr Lys
Val Pro Ala Glu Thr Leu His Gly305 310 315 320Thr Val Thr Val Glu
Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys 325 330 335Ile Pro Val
Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly 340 345 350Arg
Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu Asn Ser 355 360
365Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp Ser Tyr Ile Val
370 375 380Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp His Arg
Ser Gly385 390 395 400Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val
Arg Gly Ala Lys Arg 405 410 415Met Ala Val Leu Gly Asp Thr Ala Trp
Asp Phe Gly Ser Val Gly Gly 420 425 430Val Phe Asn Ser Leu Gly Lys
Gly Ile His Gln Ile Phe Gly Ala Ala 435 440 445Phe Lys Ser Leu Phe
Gly Gly Met Ser Trp Phe Ser Gln Ile Leu Ile 450 455 460Gly Thr Leu
Leu Val Trp Leu Gly Leu Asn Thr Lys Asn Gly Ser Ile465 470 475
480Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile Phe Leu Ser Thr
485 490 495Ala Val Ser Ala 50030504PRTZika virus 30Ile Arg Cys Ile
Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr
Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met
Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr
Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55
60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65
70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr
Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys
Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys
Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg
Ile Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile
Val Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala
Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr
Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg
Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Thr Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310
315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr
Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met
Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro
Val Ile Thr Glu Gly 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu
Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val
Gly Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly
Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala
Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425
430Ser Val Gly Gly Val Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp
Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly
Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu
Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser
Ala 50031504PRTZika virus 31Ile Arg Cys Ile Gly Val Ser Asn Arg Asp
Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu
Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr
Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu
Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser
Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys
Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg
Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu
Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Thr
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Gly 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Val Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50032504PRTZika virus
32Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50033504PRTZika virus 33Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50034504PRTZika virus 34Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50035504PRTZika virus
35Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln
Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu
Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295
300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala
Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr
Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala
Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr
Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met
Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val
Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395 400His
Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410
415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly
420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His
Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met
Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp
Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met
Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala
Val Ser Ala 50036504PRTZika virus 36Ile Arg Cys Ile Gly Val Ser Asn
Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val
Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys
Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met
Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met
Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu
Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val
Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105
110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr
Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile
Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe
Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp
His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230
235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val
Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys
Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala
Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly
Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro
Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345
350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile
Thr His His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys
Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val
Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala
Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala
Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln
Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470
475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu
Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50037504PRTZika
virus 37Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys
Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln
Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly
Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys
Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro
Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser
Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50038504PRTZika virus 38Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50039504PRTZika virus 39Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50040504PRTZika virus
40Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195
200 205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu
Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn
Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala
Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala
Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp
Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg
Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser
Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50041504PRTZika virus 41Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50042504PRTZika virus
42Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50043504PRTZika virus 43Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50044504PRTZika virus 44Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50045504PRTZika virus
45Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp
Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu
Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly
Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu
Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His
Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215
220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys
Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg
Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His
Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala
Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys
Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys
Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr
Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330
335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu
340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr
Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro
Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys
Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile
Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met
Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly
Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe
Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455
460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr
Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly
Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50046504PRTZika virus 46Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50047504PRTZika virus
47Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50048504PRTZika virus 48Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50049504PRTZika virus 49Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Ile Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50050504PRTZika virus
50Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Ile Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85
90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50051504PRTZika virus 51Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Ile
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50052504PRTZika virus
52Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Thr Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50053504PRTZika virus 53Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Thr Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50054504PRTZika virus 54Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Arg Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50055504PRTZika virus
55Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro
Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala
Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala
Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp
Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp
Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105
110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys
115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser
Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr
Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile
Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe
Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp
Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp
Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp
His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230
235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val
Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu
Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg
Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys
Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala
Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly
Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro
Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345
350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser
355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe
Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile
Thr His His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys
Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val
Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala
Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala
Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln
Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470
475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu
Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50056504PRTZika
virus 56Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met
Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys
Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu
Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys
Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro
Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln
Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly
Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys
Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro
Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser
Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Ala Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50057504PRTZika virus 57Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Ala Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50058504PRTZika virus 58Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Thr Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50059504PRTZika virus
59Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Gly Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Met Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Met Cys Leu Ala Leu Gly
Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50060504PRTZika virus 60Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Leu Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50061504PRTZika virus
61Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Ala Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50062504PRTZika virus 62Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Val His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Ala
50063504PRTZika virus 63Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Ile Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Val
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Asp Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Leu His Gly Thr
Val Thr Val Glu Val Gln Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Val Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Thr Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Ala 50064504PRTZika virus
64Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Ser Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Thr 50 55 60Ile Ser Asp Ile Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Thr Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Ala Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Glu Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425
430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile
435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp
Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly
Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu
Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser
Ala 50065504PRTZika virusmisc_feature(1)..(504)X=any amino acid
65Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Val Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr
Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr
Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys
Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ala Lys Phe
Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Xaa Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Tyr Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400Xaa Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Leu Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50066504PRTZika virus 66Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Ala Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Leu Val
Asn Asp Thr Gly His Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys
Val Glu Ile Thr Pro Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu
Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr
Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200
205Lys His Trp Leu Ala His Lys Glu Trp Phe His Asp Ile Pro Leu Pro
210 215 220Trp His Ala Gly Ala Ala Thr Gly Thr Pro His Trp Asn Asn
Lys Glu225 230 235 240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys
Arg Gln Thr Val Val 245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val
His Thr Ala Leu Ala Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly
Ala Lys Gly Arg Leu Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu
Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu
Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro Ala Glu305 310 315
320Thr Val Asp Gly Thr Val Thr Val Glu Gly Gln Tyr Gly Gly Thr Asp
325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp Met Gln
Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val
Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys Met Met Leu Glu Leu
Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile Val Ile Gly Val Gly
Glu Lys Lys Ile Thr His His Trp385 390 395 400His Arg Ser Gly Ser
Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg 405 410 415Gly Ala Lys
Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser
Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His Gln Ile 435 440
445Ile Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser
450 455 460Gln Ile Leu Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn
Thr Lys465 470 475 480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu
Gly Gly Val Leu Ile 485 490 495Phe Leu Ser Thr Ala Val Ser Gly
50067504PRTZika virus 67Ile Arg Cys Ile Gly Val Ser Asn Arg Asp Phe
Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu
His Gly Gly Cys Val Thr 20 25 30Ala Met Ala Gln Asp Lys Pro Thr Val
Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser Asn Met Ala Glu Val
Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser Asp Met Ala Ser Asp
Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75 80Tyr Leu Asp Lys Gln
Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu 85 90 95Val Asp Arg Gly
Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser 100 105 110Leu Val
Thr Cys Ala Lys Phe Ala Cys Ser Lys Lys Met Thr Gly Lys 115 120
125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile Met Leu Ser Val His
130 135 140Gly Ser Gln His Ser Gly Met Leu Val Asn Asp Thr Gly His
Glu Thr145 150 155 160Asp Glu Asn Arg Ala Lys Val Glu Ile Thr Pro
Asn Ser Pro Arg Ala 165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser
Leu Gly Leu Asp Cys Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser
Asp Leu Tyr Tyr Leu Thr Met Asn Asn 195 200 205Lys His Trp Leu Ala
His Lys Glu Trp Phe His Asp Ile Pro Leu Pro 210 215 220Trp His Ala
Gly Ala Ala Thr Gly Thr Pro His Trp Asn Asn Lys Glu225 230 235
240Ala Leu Val Glu Phe Lys Asp Ala His Ala Lys Arg Gln Thr Val Val
245 250 255Val Leu Gly Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala
Gly Ala 260 265 270Leu Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu
Ser Ser Gly His 275 280 285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu
Arg Leu Lys Gly Val Ser 290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe
Thr Phe Thr Lys Ile Pro Ala Glu305 310 315 320Thr Val Asp Gly Thr
Val Thr Val Glu Gly Gln Tyr Gly Gly Thr Asp 325 330 335Gly Pro Cys
Lys Val Pro Ala Gln Met Ala Val Asp Met Gln Thr Leu 340 345 350Thr
Pro Val Gly Arg Leu Ile Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360
365Thr Glu Asn Ser Lys Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp
370 375 380Ser Tyr Ile Val Ile Gly Val Gly Glu Lys Lys Ile Thr His
His Trp385 390 395 400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe
Glu Ala Thr Val Arg 405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly
Asp Thr Ala Trp Asp Phe Gly 420 425 430Ser Val Gly Gly Ala Leu Asn
Ser Leu Gly Lys Gly Ile His Gln Ile 435 440 445Ile Gly Ala Ala Phe
Lys Ser Leu Phe Gly Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu
Ile Gly Thr Leu Leu Met Trp Leu Gly Leu Asn Thr Lys465 470 475
480Asn Gly Ser Ile Ser Leu Met Cys Leu Ala Leu Gly Gly Val Leu Ile
485 490 495Phe Leu Ser Thr Ala Val Ser Gly 50068504PRTZika virus
68Ile Ser Cys Ile Gly Val Ser Asn Arg Asp Leu Val Glu Gly Met Ser1
5 10 15Gly Gly Thr Trp Val Asp Val Val Leu Glu His Gly Gly Cys Val
Thr 20 25 30Glu Met Ala Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val
Thr Met 35 40 45Thr Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr
Glu Ala Ser 50 55 60Leu Ser Asp Met Ala Ser Ala Ser Arg Cys Pro Thr
Gln Gly Glu Pro65 70 75 80Ser Leu Asp Lys Gln Ser Asp Thr Gln Ser
Val Cys Lys Arg Thr Leu 85 90 95Gly Asp Arg Gly Trp Gly Asn Gly Cys
Gly Ile Phe Gly Lys Gly Ser 100 105 110Leu Val Thr Cys Ser Lys Phe
Thr Cys Cys Lys Lys Met Pro Gly Lys 115 120 125Ser Ile Gln Pro Glu
Asn Leu Glu Tyr Arg Ile Met Leu Pro Val His 130 135 140Gly Ser Gln
His Ser Gly Met Ile Val Asn Asp Ile Gly His Glu Thr145 150 155
160Asp Glu Asn Arg Ala Lys Val Glu Val Thr Pro Asn Ser Pro Arg Ala
165 170 175Glu Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys
Glu Pro 180 185 190Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu
Thr Met Asn Asn 195 200 205Lys His Trp Leu Val His Lys Glu Trp Phe
His Asp Ile Pro Leu Pro 210 215 220Trp His Ala Gly Ala Asp Thr Gly
Thr Pro His Trp Asn Asn Lys Glu225 230 235 240Ala Leu Val Glu Phe
Lys Asp Ala His Ala Lys Arg Gln Thr Val Val 245 250 255Val Leu Gly
Ser Gln Glu Gly Ala Val His Thr Ala Leu Ala Gly Ala 260 265 270Leu
Glu Ala Glu Met Asp Gly Ala Lys Gly Arg Leu Phe Ser Gly His 275 280
285Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly Val Ser
290 295 300Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Val Pro
Ala Glu305 310 315 320Thr Leu His Gly Thr Val Thr Val Glu Val Gln
Ser Ala Gly Thr Asp 325 330 335Gly Pro Cys Lys Val Pro Ala Gln Met
Ala Val Asp Met Gln Thr Leu 340 345 350Thr Pro Val Gly Arg Leu Ile
Thr Ala Asn Pro Val Ile Thr Glu Ser 355 360 365Thr Glu Asn Ser Lys
Met Met Leu Glu Leu Asp Pro Pro Phe Gly Asp 370 375 380Ser Tyr Ile
Val Ile Gly Val Gly Asp Lys Lys Ile Thr His His Trp385 390 395
400His Arg Ser Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr Val Arg
405 410 415Gly Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp
Phe Gly 420 425 430Ser Val Gly Gly Val Phe Asn Ser Leu Gly Lys Gly
Ile His Gln Ile 435 440 445Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly
Gly Met Ser Trp Phe Ser 450 455 460Gln Ile Leu Ile Gly Thr Leu Leu
Val Trp Leu Gly Leu Asn Thr Lys465 470 475 480Asn Gly Ser Ile Ser
Leu Thr Cys Leu Ala Leu Gly Gly Val Met Ile 485 490 495Phe Leu Ser
Thr Ala Val Ser Ala 50069498PRTZika virus 69Ile Arg Cys Ile Gly Val
Ser Asn Arg Asp Phe Val Glu Gly Met Ser1 5 10 15Gly Gly Thr Trp Val
Asp Val Val Leu Glu His Gly Gly Cys Val Thr 20 25 30Val Met Ala Gln
Asp Lys Pro Thr Val Asp Ile Glu Leu Val Thr Thr 35 40 45Thr Val Ser
Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu Ala Ser 50 55 60Ile Ser
Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly Glu Ala65 70 75
80Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg Thr Leu
85 90 95Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly
Ser 100 105 110Leu Val Thr Cys Ala Lys Phe Thr Cys Ser Lys Lys Met
Thr Gly Lys 115 120 125Ser Ile Gln Pro Glu Asn Leu Glu Tyr Arg Ile
Met Leu Ser Val His 130 135 140Gly Ser Gln His Ser Gly Met Ile Val
Asn Asp Glu Asn Arg Ala Lys145 150 155 160Val Glu Val Thr Pro Asn
Ser Pro Arg Ala Glu Ala Thr Leu Gly Gly 165 170 175Phe Gly Ser Leu
Gly Leu Asp Cys Glu Pro Arg Thr Gly Leu Asp Phe 180 185 190Ser Asp
Leu Tyr Tyr Leu Thr Met Asn Asn Lys His Trp Leu Val His 195 200
205Lys Glu Trp Phe His Asp Ile Pro Leu Pro Trp His Ala Gly Ala Asp
210 215 220Thr Gly Thr Pro His Trp Asn Asn Lys Glu Ala Leu Val Glu
Phe Lys225 230 235 240Asp Ala His Ala Lys Arg Gln Thr Val Val Val
Leu Gly Ser Gln Glu 245 250 255Gly Ala Val His Thr Ala Leu Ala Gly
Ala Leu Glu Ala Glu Met Asp 260 265 270Gly Ala Lys Gly Arg Leu Phe
Ser Gly His Leu Lys Cys Arg Leu Lys 275 280 285Met Asp Lys Leu Arg
Leu Lys Gly Val Ser Tyr Ser Leu Cys Thr Ala 290 295 300Ala Phe Thr
Phe Thr Lys Val Pro Ala Glu Thr Leu His Gly Thr Val305 310 315
320Thr Val Glu Val Gln Tyr Ala Gly Thr Asp Gly Pro Cys Lys Val Pro
325 330 335Ala Gln Met Ala Val Asp Met Gln Thr Leu Thr Pro Val Gly
Arg Leu 340 345 350Ile Thr Ala Asn Pro Val Ile Thr Glu Ser Thr Glu
Asn Ser Lys Met 355 360 365Met Leu Glu Leu Asp Pro Pro
Phe Gly Asp Ser Tyr Ile Val Ile Gly 370 375 380Val Gly Asp Lys Lys
Ile Thr His His Trp His Arg Ser Gly Ser Thr385 390 395 400Ile Gly
Lys Ala Phe Glu Ala Thr Val Arg Gly Ala Lys Arg Met Ala 405 410
415Val Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Val Phe
420 425 430Asn Ser Leu Gly Lys Gly Ile His Gln Ile Phe Gly Ala Ala
Phe Lys 435 440 445Ser Leu Phe Gly Gly Met Ser Trp Phe Ser Gln Ile
Leu Ile Gly Thr 450 455 460Leu Leu Val Trp Leu Gly Leu Asn Thr Lys
Asn Gly Ser Ile Ser Leu465 470 475 480Thr Cys Leu Ala Leu Gly Gly
Val Met Ile Phe Leu Ser Thr Ala Val 485 490 495Ser
Ala7026DNAArtificial Sequence1-26
n=inosinemisc_feature(1)..(26)n=inosine 70ncncncncnc ncncncncnc
ncncnc 267111PRTArtificial Sequencesynthetic peptide 71Lys Leu Lys
Leu Leu Leu Leu Leu Lys Leu Lys1 5 107211840DNAChikungunya virus
72atggctgcgt gagacacacg tagcctacca gtttcttact gctctactct gcaaagcaag
60agattaataa cccatcatgg atcctgtgta cgtggacata gacgctgaca gcgccttttt
120gaaggccctg caacgtgcgt accccatgtt tgaggtggaa ccaaggcagg
tcacaccgaa 180tgaccatgct aatgctagag cgttctcgca tctagctata
aaactaatag agcaggaaat 240tgaccccgac tcaaccatcc tggatatcgg
cagtgcgcca gcaaggagga tgatgtcgga 300caggaagtac cactgcgtct
gcccgatgcg cagtgcggaa gatcccgaga gactcgccaa 360ttatgcgaga
aagctagcat ctgccgcagg aaaagtcctg gacagaaaca tctctggaaa
420gatcggggac ttacaagcag taatggccgt gccagacacg gagacgccaa
cattctgctt 480acacacagac gtctcatgta gacagagagc agacgtcgct
atataccaag acgtctatgc 540tgtacacgca cccacgtcgc tataccacca
ggcgattaaa ggggtccgag tggcgtactg 600ggttgggttc gacacaaccc
cgttcatgta caatgccatg gcgggtgcct acccctcata 660ctcgacaaac
tgggcagatg agcaggtact gaaggctaag aacataggat tatgttcaac
720agacctgacg gaaggtagac gaggcaagtt gtctattatg agagggaaaa
agctaaaacc 780gtgcgaccgt gtgctgttct cagtagggtc aacgctctac
ccggaaagcc gcaagctact 840taagagctgg cacctgccat cggtgttcca
tttaaagggc aaactcagct tcacatgccg 900ctgtgataca gtggtttcgt
gtgagggcta cgtcgttaag agaataacga tgagcccagg 960cctttatgga
aaaaccacag ggtatgcggt aacccaccac gcagacggat tcctgatgtg
1020caagactacc gacacggttg acggcgaaag artgtcattc tcggtgtgca
catacgtgcc 1080ggcgaccatt tgtgatcaaa tgaccggcat ccttgctaca
gaagtcacgc cggaggatgc 1140acagaagctg ttggtggggc tgaaccagag
aatagtggtt aacggcagaa cgcaacggaa 1200tacgaacacc atgaaaaatt
atctgcttcc cgtggtcgcc caagccttca gtaagtgggc 1260aaaggagtgc
cggaaagaca tggaagatga aaaactcctg ggggtcagag aaagaacact
1320gacctgctgc tgtctatggg cattcaagaa gcagaaaaca cacacggtct
acaagaggcc 1380tgatacccag tcaattcaga aggttcaggc cgagtttgac
agctttgtgg taccgagtct 1440gtggtcgtcc gggttgtcaa tccctttgag
gactagaatc aaatggttgt taagcaaggt 1500gccaaaaacc gacctgatcc
catacagcgg agacgcccga gaagcccggg acgcagaaaa 1560agaagcagag
gaagaacgag aagcagaact gactcgcgaa gccctaccac ctctacaggc
1620agcacaggaa gatgttcagg tcgaaatcga cgtggaacag cttgaggaca
gagcgggcgc 1680aggaataata gagactccga gaggagctat caaagttact
gcccaaccaa cagaccacgt 1740cgtgggagag tacctggtac tctccccgca
gaccgtacta cgtagccaga agctcagtct 1800gattcacgct ttggcggagc
aagtgaagac gtgcacgcac aacggacgag cagggaggta 1860tgcggtcgaa
gcgtacgacg gccgagtcct agtgccctca ggctatgcaa tctcgcctga
1920agacttccag agtctaagcg aaagcgcaac gatggtgtat aacgaaagag
agttcgtaaa 1980cagaaagcta caccatattg cgatgcacgg accagccctg
aacaccgacg aagagtcgta 2040tgagctggtg agggcagaga ggacagaaca
cgagtacgtc tacgacgtgg atcagagaag 2100atgctgtaag aaggaagaag
ccgcaggact ggtactggtg ggcgacttga ctaatccgcc 2160ctaccacgaa
ttcgcatatg aagggctaaa aatccgccct gcctgcccat acaaaattgc
2220agtcatagga gtcttcggag taccgggatc tggcaagtca gctattatca
agaacctagt 2280taccaggcag gacctggtga ctagcggaaa gaaagaaaac
tgccaagaaa tcaccaccga 2340cgtgatgaga cagagaggtc tagagatatc
tgcacgtacg gttgactcgc tgctcttgaa 2400tggatgcaac agaccagtcg
acgtgttgta cgtagacgag gcgtttgcgt gccactctgg 2460aacgctactt
gctttgatcg ccttggtgag accaaggcag aaagttgtac tttgtggtga
2520cccgaagcag tgcggcttct tcaatatgat gcagatgaaa gtcaactata
atcacaacat 2580ctgcacccaa gtgtaccaca aaagtatctc caggcggtgt
acactgcctg tgaccgccat 2640tgtgtcatcg ttgcattacg aaggcaaaat
gcgcactacg aatgagtaca acaagccgat 2700tgtagtggac actacaggct
caacaaaacc tgaccctgga gacctcgtgt taacgtgctt 2760cagagggtgg
gttaaacaac tgcaaattga ctatcgtgga tacgaggtca tgacagcagc
2820cgcatcccaa gggttaacca gaaaaggagt ttacgcagtt agacaaaaag
ttaatgaaaa 2880cccgctctat gcatcaacgt cagagcacgt caacgtactc
ctaacgcgta cggaaggtaa 2940actggtatgg aagacacttt ccggcgaccc
gtggataaag acgctgcaga acccaccgaa 3000aggaaacttc aaagcaacta
ttaaggagtg ggaggtggag catgcatcaa taatggcggg 3060catctgcagt
caccaaatga ccttcgatac attccaaaat aaagccaacg tttgttgggc
3120taagagcttg gtccctatcc tcgaaacagc ggggataaaa ctaaatgata
ggcagtggtc 3180tcagataatt caagccttca aagaagacaa agcatactca
cctgaagtag ccctgaatga 3240aatatgtacg cgcatgtatg gggtggatct
agacagcggg ctattttcta aaccgttggt 3300gtctgtgtat tacgcggata
accactggga taataggcct ggagggaaaa tgttcggatt 3360taaccccgag
gcagcatcca ttctagaaag aaagtatcca ttcacaaaag ggaagtggaa
3420catcaacaag cagatctgcg tgactaccag gaggatagaa gactttaacc
ctaccaccaa 3480catcataccg gccaacagga gactaccaca ctcattagtg
gccgaacacc gcccagtaaa 3540aggggaaaga atggaatggc tggttaacaa
gataaacggc caccacgtgc tcctggtcag 3600tggctataac cttgcactgc
ctactaagag agtcacttgg gtagcgccgt taggtgtccg 3660cggagcggac
tacacataca acctagagtt gggtctgcca gcaacgcttg gtaggtatga
3720cctagtggtc ataaacatcc acacaccttt tcgcatacac cattaccaac
agtgcgtcga 3780ccacgcaatg aaactgcaaa tgctcggggg tgactcattg
agactgctca aaccgggcgg 3840ctctctattg atcagagcat atggttacgc
agatagaacc agtgaacgag tcatctgcgt 3900attgggacgc aagtttagat
cgtctagagc gttgaaacca ccatgtgtca ccagcaacac 3960tgagatgttt
ttcctattca gcaactttga caatggcaga aggaatttca caactcatgt
4020catgaacaat caactgaatg cagccttcgt aggacaggtc acccgagcag
gatgtgcacc 4080gtcgtaccgg gtaaaacgca tggacatcgc gaagaacgat
gaagagtgcg tagtcaacgc 4140cgctaaccct cgcgggttac cgggtgrcgg
tgtttgcaag gcagtataca aaaaatggcc 4200ggagtccttt aagaacagtg
caacaccagt gggaaccgca aaaacagtta tgtgcggtac 4260gtatccagta
atccacgctg ttggaccaaa cttctctaat tattcggagt ctgaagggga
4320ccgggaattg gcagctgcct atcgagaagt cgcaaaggaa gtaactaggc
tgggagtaaa 4380tagtgtagct atacctctcc tctccacagg tgtatactca
ggagggaaag acaggctgac 4440ccagtcactg aaccacctct ttacagccat
ggactcgacg gatgcagacg tggtcatcta 4500ctgccgcgac aaagaatggg
agaagaaaat atctgaggcc atacagatgc ggacccaagt 4560agagctgctg
gatgagcaca tctccataga ctgcgatatt gttcgcgtgc accctgacag
4620cagcttggca ggcagaaaag gatacagcac cacggaaggc gcactgtact
catatctaga 4680agggacccgt tttcatcaga cggctgtgga tatggcggag
atacatacta tgtggccaaa 4740gcaaacagag gccaatgagc aagtctgcct
atatgccctg ggggaaagta ttgaatcgat 4800caggcagaaa tgcccggtgg
atgatgcaga cgcatcatct ccccccaaaa ctgtcccgtg 4860cctttgccgt
tacgctatga ctccagaacg cgtcacccgg cttcgcatga accacgtcac
4920aagcataatt gtgtgttctt cgtttcccct cccaaagtac aaaatagaag
gagtgcaaaa 4980agtcaaatgc tctaaggtaa tgctatttga ccacaacgtg
ccatcgcgcg taagtccaag 5040ggaatataka tcttcccagg agtctgcaca
ggaggcgagt acaatcacgt cactgacgca 5100tagtcaattc gacctaagcg
ttgatggcga gatactgccc gtcccgtcag acctggatgc 5160tgacgcccca
gccctagaac cagcactaga cgacggggcg acacacacgc tgccatccac
5220aaccggaaac cttgcggccg tgtctgattg ggtaatgagc accgtacctg
tcgcgccgcc 5280cagaagaagg cgagggagaa acctgactgt gacatgtgac
gagagagaag ggaatataac 5340acccatggct agcgtccgat tctttagggc
agagctgtgt ccggtcgtac aagaaacagc 5400ggagacgcgt gacacagcaa
tgtctcttca ggcaccaccg agtaccgcca cggaaccgaa 5460tcatccgccg
atctccttcg gagcatcaag cgagacgttc cccattacat ttggggactt
5520caacgaagga gaaatcgaaa gcttgtcttc tgagctacta actttcggag
acttcttacc 5580aggagaagtg gatgacttga cagacagcga ctggtccacg
tgctcagaca cggacgacga 5640gttatgacta gacagggcag gtgggtatat
attctcgtcg gacaccggtc caggtcattt 5700acaacagaag tcagtacgcc
agtcagtgct gccggtgaac accctggagg aagtccacga 5760ggagaagtgt
tacccaccta agctggatga agcaaaggag caactattac ttaagaaact
5820ccaggagagt gcatccatgg ccaacagaag caggtatcag tcgcgcaaag
tagaaaacat 5880gaaagcagca atcatccaga gactaaagag aggctgtaga
ctatacttaa tgtcagagac 5940cccaaaagtc cctacttacc ggactacata
tccggcgcct gtgtactcgc ctccgatcaa 6000cgtccgattg tccaatcccg
agtccgcagt ggcagcatgc aatgagttct tagctagaaa 6060ctatccaact
gtctcatcat accaaattac cgacgagtat gatgcatatc tagacatggt
6120ggacgggtcg gagagttgcc tggaccgagc gacattcaat ccgtcaaaac
tcaggagcta 6180cccgaaacag cacgcttacc acgcgccctc catcagaagc
gctgtaccgt ccccattcca 6240gaacacacta cagaatgtac tggcagcagc
cacgaaaaga aactgcaacg tcacacagat 6300gagggaatta cccactttgg
actcagcagt attcaacgtg gagtgtttca aaaaattcgc 6360atgcaaccaa
gaatactggg aagaatttgc tgccagccct attaggataa caactgagaa
6420tttagcaacc tatgttacta aactaaaagg gccaaaagca gcagcgctat
tcgcaaaaac 6480ccataatcta ctgccactac aggaagtacc aatggatagg
ttcacagtag atatgaaaag 6540ggacgtaaag gtgactcctg gtacaaagca
tacagaggaa agacctaagg tgcaggttat 6600acaggcggct gaacccttgg
cgacagcata cctatgtggg attcacagag agctggttag 6660gaggctgaac
gccgtcctcc tacccaatgt acatacacta tttgacatgt ctgccgagga
6720tttcgatgcc atcatagccg cacactttaa gccaggagac actgttttgg
aaacggacat 6780agcctccttt gataagagcc aagatgattc acttgcgctt
actgctttga tgctgttaga 6840ggatttaggg gtggatcact ccctgctgga
cttgatagag gctgctttcg gagagatttc 6900cagctgtcac ctaccgacag
gtacgcgctt caagttcggc gccatgatga aatcaggtat 6960gttcctaact
ctgttcgtca acacattgtt aaacatcacc atcgccagcc gagtgctgga
7020agatcgtctg acaaaatccg cgtgcgcggc cttcatcggc gacgacaaca
taatacatgg 7080agtcgtctcc gatgaattga tggcagccag atgtgccact
tggatgaaca tggaagtgaa 7140gatcatagat gcagttgtat ccttgaaagc
cccttacttt tgtggagggt ttatactgca 7200cgatactgtg acaggaacag
cttgcagagt ggcagacccg ctaaaaaggc tttttaaact 7260gggcaaaccg
ctagcggcag gtgacgaaca agatgaagat agaagacgag cgctggctga
7320cgaagtgatc agatggcaac gaacagggct aattgatgag ctggagaaag
cggtatactc 7380taggtacgaa gtgcagggta tatcagttgt ggtaatgtcc
atggccacct ttgcaagctc 7440cagatccaac ttcgagaagc tcagaggacc
cgtcataact ttgtacggcg gtcctaaata 7500ggtacgcact acagctacct
attttgcaga agccgacagc aagtatctaa acactaatca 7560gctacaatgg
agttcatccc aacccaaact ttttacaata ggaggtacca gcctcgaccc
7620tggactccgc gccctactat ccaagtcatc aggcccagac cgcgccctca
gaggcaagct 7680gggcaacttg cccagctgat ctcagcagtt aataaactga
caatgcgcgc ggtaccccaa 7740cagaagccac gcaggaatcg gaagaataag
aagcaaaagc aaaaacaaca ggcgccacaa 7800aacaacacaa atcaaaagaa
gcagccacct aaaaagaaac cggctcaaaa gaaaaagaag 7860ccgggccgca
gagagaggat gtgcatgaaa atcgaaaatg attgtatttt cgaagtcaag
7920cacgaaggta aggtaacagg ttacgcgtgc ctggtggggg acaaagtaat
gaaaccagca 7980cacgtaaagg ggaccatcga taacgcggac ctggccaaac
tggcctttaa gcggtcatct 8040aagtatgacc ttgaatgcgc gcagataccc
gtgcacatga agtccgacgc ttcgaagttc 8100acccatgaga aaccggaggg
gtactacaac tggcaccacg gagcagtaca gtactcagga 8160ggccggttca
ccatccctac aggtgctggc aaaccagggg acagcggcag accgatcttc
8220gacaacaagg gacgcgtggt ggccatagtc ttaggaggag ctaatgaagg
agcccgtaca 8280gccctctcgg tggtgacctg gaataaagac attgtcacta
aaatcacccc cgagggggcc 8340gaagagtgga gtcttgccat cccagttatg
tgcctgttgg caaacaccac gttcccctgc 8400tcccagcccc cttgcacgcc
ctgctgctac gaaaaggaac cggaggaaac cctacgcatg 8460cttgaggaca
acgtcatgag acctgggtac tatcagctgc tacaagcatc cttaacatgt
8520tctccccacc gccagcgacg cagcaccaag gacaacttca atgtctataa
agccacaaga 8580ccatacttag ctcactgtcc cgactgtgga gaagggcact
cgtgccatag tcccgtagca 8640ctagaacgca tcagaaatga agcgacagac
gggacgctga aaatccaggt ctccttgcaa 8700atcggaataa agacggatga
cagccacgat tggaccaagc tgcgttatat ggacaaccac 8760atgccagcag
acgcagagag ggcggggcta tttgtaagaa catcagcacc gtgtacgatt
8820actggaacaa tgggacactt catcctggcc cgatgtccaa aaggggaaac
tctgacggtg 8880ggattcactg acagtaggaa gattagtcac tcatgtacgc
acccatttca ccacgaccct 8940cctgtgatag gtcgggaaaa attccattcc
cgaccgcagc acggtaaaga gctaccttgc 9000agcacgtacg tgcagagcac
cgccgcaact accgaggaga tagaggtaca catgccccca 9060gacacccctg
atcgcacatt aatgtcacaa cagtccggca acgtaaagat cacagtcaat
9120ggccagacgg tgcggtacaa gtgtaattgc ggtggctcaa atgaaggact
aacaactaca 9180gacaaagtga ttaataactg caaggttgat caatgtcatg
ccgcggtcac caatcacaaa 9240aagtggcagt ataactcccc tctggtcccg
cgtaatgctg aacttgggga ccgaaaagga 9300aaaattcaca tcccgtttcc
gctggcaaat gtaacatgca gggtgcctaa agcaaggaac 9360cccaccgtga
cgtacgggaa aaaccaagtc atcatgctac tgtatcctga ccacccaaca
9420ctcctgtcct accggaatat gggagaagaa ccaaactatc aagaagagtg
ggtgatgcat 9480aagaaggaag tcgtgctaac cgtgccgact gaagggctcg
aggtcacgtg gggcaacaac 9540gagccgtata agtattggcc gcagttatct
acaaacggta cagcccatgg ccacccgcat 9600gagataattc tgtattatta
tgagctgtac cccactatga ctgtagtagt tgtgtcagtg 9660gccacgttca
tactcctgtc gatggtgggt atggcagcgg ggatgtgcat gtgtgcacga
9720cgcagatgca tcacaccgta tgaactgaca ccaggagcta ccgtcccttt
cctgcttagc 9780ctaatatgct gcatcagaac agctaaagcg gccacatacc
aagaggctgc gatatacctg 9840tggaacgagc agcaaccttt gttttggcta
caagccctta ttccgctggc agccctgatt 9900gttctatgca actgtctgag
actcttacca tgctgctgta aaacgttggc ttttttagcc 9960gtaatgagcg
tcggtgccca cactgtgagc gcgtacgaac acgtaacagt gatcccgaac
10020acggtgggag taccgtataa gactctagtc aatagacctg gctacagccc
catggtattg 10080gagatggaac tactgtcagt cactttggag ccaacactat
cgcttgatta catcacgtgc 10140gagtacaaaa ccgtcatccc gtctccgtac
gtgaagtgct gcggtacagc agagtgcaag 10200gacaaaaacc tacctgacta
cagctgtaag gtcttcaccg gcgtctaccc atttatgtgg 10260ggcggcgcct
actgcttctg cgacgctgaa aacacgcagt tgagcgaagc acacgtggag
10320aagtccgaat catgcaaaac agaatttgca tcagcataca gggctcatac
cgcatctgca 10380tcagctaagc tccgcgtcct ttaccaagga aataacatca
ctgtaactgc ctatgcaaac 10440ggcgaccatg ccgtcacagt taaggacgcc
aaattcattg tggggccaat gtcttcagcc 10500tggacacctt tcgacaacaa
aattgtggtg tacaaaggtg acgtctataa catggactac 10560ccgccctttg
gcgcaggaag accaggacaa tttggcgata tccaaagtcg cacacctgag
10620agtaaagacg tctatgctaa tacacaactg gtactgcaga gaccggctgt
gggtacggta 10680cacgtgccat actctcaggc accatctggc tttaagtatt
ggctaaaaga acgcggggcg 10740tcgctgcagc acacagcacc atttggctgc
caaatagcaa caaacccggt aagagcggtg 10800aactgcgccg tagggaacat
gcccatctcc atcgacatac cggaagcggc cttcactagg 10860gtcgtcgacg
cgccctcttt aacggacatg tcgtgcgagg taccagcctg cacccattcc
10920tcagactttg ggggcgtcgc cattattaaa tatgcagcca gcaagaaagg
caagtgtgcg 10980gtgcattcga tgactaacgc cgtcactatt cgggaagctg
agatagaagt tgaagggaat 11040tctcagctgc aaatctcttt ctcgacggcc
ttagccagcg ccgaattccg cgtacaagtc 11100tgttctacac aagtacactg
tgcagccgag tgccaccccc cgaaggacca catagtcaac 11160tacccggcgt
cacataccac cctcggggtc caggacatct ccgctacggc gatgtcatgg
11220gtgcagaaga tcacgggagg tgtgggactg gttgttgctg ttgccgcact
gattctaatc 11280gtggtgctat gcgtgtcgtt cagcaggcac taacttgaca
attaagtatg aaggtatatg 11340tgtcccctaa gagacacact gtacatagca
aataatctat agatcaaagg gctacgcaac 11400ccctgaatag taacaaaata
caaaatcact aaaaattata aaaacagaaa aatacataaa 11460taggtatacg
tgtcccctaa gagacacatt gtatgtaggt gataagtata gatcaaaggg
11520ccgaataacc cctgaatagt aacaaaatat gaaaatcaat aaaaatcata
aaatagaaaa 11580accataaaca gaagtagttc aaagggctat aaaacccctg
aatagtaaca aaacataaaa 11640ttaataaaaa tcaaatgaat accataattg
gcaaacggaa gagatgtagg tacttaagct 11700tcctaaaagc agccgaactc
actttgagaa gtaggcatag cataccgaac tcttccacga 11760ttctccgaac
ccacagggac gtaggagatg ttattttgtt tttaatattt caaaaaaaaa
11820aaaaaaaaaa aaaaaaaaaa 118407310863DNAJapanese encephalitis
virus 73tttaaacagt tttttagaac ggaagataac catgactaaa aaaccaggag
ggcccggtaa 60aaaccgggct atcaatatgc tgaaacgcgg cctaccccgc gtattcccac
tagtgggagt 120gaagagggta gtaatgagct tgttggacgg cagagggcca
gtacgtttcg tgctggctct 180tatcacgttc ttcaagttta cagcattagc
cccgaccaag gcgcttttag gccgatggaa 240agcagtggaa aagagtgtgg
caatgaaaca tcttactagt ttcaaacgag aacttggaac 300actcattgac
gccgtgaaca agcggggcag aaagcaaaac aaaagaggag gaaatgaagg
360ctcaatcatg tggctcgcga gcttggcagt tgtcatagct tgtgcaggag
ccatgaagtt 420gtcgaatttc caggggaagc ttttgatgac catcaacaac
acggacattg cagacgttat 480cgtgattccc acctcaaaag gagagaacag
atgctgggtc cgggcaatcg acgtcggcta 540catgtgtgag gacactatca
cgtacgaatg tcctaagctt accatgggca atgatccaga 600ggatgtggat
tgctggtgtg acaaccaaga agtctacgtc caatatggac ggtgcacgcg
660gaccaggcat tccaagcgaa gcaggagatc cgtgtcggtc caaacacatg
gggagagttc 720actagtgaat aaaaaagagg cttggctgga ttcaacgaaa
gccacacgat atctcatgaa 780aactgagaac tggatcataa ggaatcctgg
ctatgctttc ctggcggcgg tacttggctg 840gatgcttggc agtaacaacg
gtcaacgcgt ggtatttacc atcctcctgc tgttggtcgc 900tccggcttac
agttttaatt gtctgggaat gggcaatcgt gacttcatag aaggagccag
960tggagccact tgggtggact tggtgctaga aggagatagc tgcttgacaa
tcatggcaaa 1020cgacaaacca acattggacg tccgcatgat taacatcgaa
gctagccaac ttgctgaggt 1080cagaagttac tgctatcatg cttcagtcac
tgacatctcg acggtggctc ggtgccccac 1140gactggagaa gcccacaacg
agaagcgagc tgatagtagc tatgtgtgca aacaaggctt 1200cactgaccgt
gggtggggca acggatgtgg acttttcggg aagggaagca ttgacacatg
1260tgcaaaattc tcctgcacca gtaaagcgat tgggagaaca atccagccag
aaaacatcaa 1320atacgaagtt ggcatttttg tgcatggaac caccacttcg
gaaaaccatg ggaattattc 1380agcgcaagtt ggggcgtccc aggcggcaaa
gtttacagta acacccaatg ctccttcgat 1440aaccctcaaa cttggtgact
acggagaagt cacactggac tgtgagccaa ggagtggact 1500gaacactgaa
gcgttttacg tcatgaccgt ggggtcaaag tcatttctgg tccataggga
1560gtggtttcat gacctcgctc tcccctggac gtccccttcg agcacagcgt
ggagaaacag 1620agaactcctc atggaatttg aaggggcgca cgccacaaaa
cagtccgttg ttgctcttgg 1680gtcacaggaa ggaggcctcc atcaggcgtt
ggcaggagcc atcgtggtgg agtactcaag 1740ctcagtgaag ttaacatcag
gccacctgaa atgtaggctg aaaatggaca aactggctct 1800gaaaggcaca
acctatggca tgtgtacaga aaaattctcg ttcgcgaaaa atccggcgga
1860cactggtcac ggaacagttg tcattgaact ctcctactct gggagtgatg
gcccctgcaa 1920aattccgatt gtttccgttg cgagcctcaa tgacatgacc
cccgttgggc ggctggtgac 1980agtgaacccc ttcgtcgcga cttccagtgc
caactcaaag gtgctggtcg agatggaacc 2040ccccttcgga gactcctaca
tcgtagttgg aaggggagac aagcagatca accaccattg
2100gcacaaagct ggaagcacgc tgggcaaggc cttttcaaca actttgaagg
gagctcaaag 2160actggcagcg ttgggcgaca cagcctggga ctttggctct
attggagggg tcttcaactc 2220cataggaaaa gccgttcacc aagtgtttgg
tggtgccttc agaacactct ttgggggaat 2280gtcttggatc acacaagggc
taatgggtgc cctactgctc tggatgggcg tcaacgcacg 2340agaccgatca
attgctttgg ccttcttagc cacagggggt gtgctcgtgt tcttagcgac
2400caatgtgcat gctgacactg gatgtgccat tgacatcaca agaaaagaga
tgagatgtgg 2460aagtggcatc ttcgtgcaca acgacgtgga agcctgggtg
gataggtata aatatttgcc 2520agaaacgccc agatccctag cgaagatcgt
ccacaaagcg cacaaggaag gcgtgtgcgg 2580agtcagatct gtcactagac
tggagcacca aatgtgggaa gccgtacggg acgaattgaa 2640cgtcctgctc
aaagagaatg cagtggacct cagtgtggtt gtgaacaagc ccgtgggaag
2700atatcgctca gcccctaaac gcctatccat gacgcaagag aagtttgaaa
tgggctggaa 2760agcatgggga aaaagcattc tctttgcccc ggaattggct
aactccacat ttgtcgtaga 2820tggacctgag acaaaggaat gccctgatga
gcacagagct tggaacagca tgcaaatcga 2880agacttcggc tttggcatca
catcaacccg tgtgtggctg aaaattagag aggagagcac 2940tgacgagtgt
gatggagcga tcataggcac ggctgtcaaa ggacatgtgg cagtccatag
3000tgacttgtcg tactggattg agagtcgcta caacgacaca tggaaacttg
agagggcagt 3060ctttggagag gtcaaatctt gcacttggcc agagacacac
accctttggg gagatgatgt 3120tgaggaaagt gaactcatca ttccgcacac
catagccgga ccaaaaagca agcacaatcg 3180gagggaaggg tataagacac
aaaaccaggg accttgggat gagaatggca tagtcttgga 3240ctttgattat
tgcccaggga caaaagtcac cattacagag gattgtggca agagaggccc
3300ttcggtcaga accactactg acagtggaaa gttgatcact gactggtgct
gtcgcagttg 3360ctcccttccg cccctacgat tccggacaga aaatggctgc
tggtacggaa tggaaatcag 3420acctgttagg catgatgaaa caacactcgt
cagatcacag gttgatgctt tcaatggtga 3480aatggttgac ccttttcagc
tgggccttct ggtgatgttt ctggccaccc aggaggtcct 3540tcgcaagagg
tggacggcca gattgaccat tcctgcggtt ttgggggccc tacttgtgct
3600gatgcttggg ggcatcactt acactgattt ggcgaggtat gtggtgctag
tcgctgctgc 3660tttcgcagag gccaacagtg gaggagacgt cctgcacctt
gctttgattg ccgtttttaa 3720gatccaacca gcatttctag tgatgaacat
gcttagcacg agatggacga accaagaaaa 3780cgtggttctg gtcctagggg
ctgccttttt ccaattggcc tcagtagatc tgcaaatagg 3840agtccacgga
atcctgaatg ccgccgctat agcatggatg attgtccgag cgatcacctt
3900ccccacaacc tcctccgtca ccatgccagt cttagcgctt ctaactccgg
ggatgagggc 3960tctataccta gacacttaca gaatcatcct cctcgtcata
gggatttgct ccctgctgca 4020cgagaggaaa aagaccatgg caaaaaagaa
aggagctgta ctcttgggct tagcgctcac 4080atccactgga tggttctcgc
ccaccactat agctgccgga ctaatggtct gcaacccaaa 4140caagaagaga
gggtggccag ctactgagtt tttgtcggca gttggattga tgtttgccat
4200cgtaggtggt ttggccgagt tggatattga atccatgtca atacccttca
tgctggcagg 4260tctcatggca gtgtcctacg tggtgtcagg aaaagcaaca
gatatgtggc ttgaacgggc 4320cgccgacatc agctgggaga tggatgctgc
aatcacagga agcagtcgga ggctggatgt 4380gaaactggat gatgacggag
attttcactt gattgatgat cccggtgttc catggaaggt 4440ctgggtcctg
cgcatgtctt gcattggctt agccgccctc acgccttggg ccatcgttcc
4500cgccgctttc ggttattggc tcactttaaa aacaacaaaa agagggggcg
tgttttggga 4560cacgccatcc ccaaaacctt gctcaaaagg agacaccact
acaggagtct accgaattat 4620ggctagaggg attcttggca cttaccaggc
cggcgtcgga gtcatgtacg agaatgtttt 4680ccacacacta tggcacacaa
ctagaggagc agccattatg agtggagaag gaaaattgac 4740gccatactgg
ggtagtgtga gagaagaccg catagcttac ggaggcccat ggaggtttga
4800ccgaaaatgg aatggaacag atgacgtgca agtgatcgtg gtagaaccgg
ggaaggctgc 4860agtaaacatc cagacaaaac caggagtgtt tcggactccc
ttcggggagg ttggggctgt 4920tagtctggat tacccgcgag gaacatccgg
ctcacccatt ctggattcca atggagacat 4980tataggccta tacggcaatg
gagttgagct tggcgatggc tcatacgtca gcgccatcgt 5040gcagggtgac
cgtcaggagg aaccagtccc agaagcttac accccaaaca tgttgagaaa
5100gagacagatg actgtgctag atttgcaccc tggttcaggg aaaaccagga
aaattctgcc 5160acaaataatt aaggacgcta tccagcagcg cctaagaaca
gctgtgttgg caccgacgcg 5220ggtggtagca gcagaaatgg cagaagcttt
gagagggctc ccagtacgat atcaaacttc 5280agcagtgcag agagagcacc
aagggaatga aatagtggat gtgatgtgcc acgccactct 5340gacccataga
ctgatgtcac cgaacagagt gcccaactac aacctatttg tcatggatga
5400agctcatttc accgacccag ccagtatagc cgcacgagga tacattgcta
ccaaggtgga 5460attaggggag gcagcagcca tctttatgac agcgaccccg
cctggaacca cggatccttt 5520tcctgactca aatgccccaa tccatgattt
gcaagatgag ataccagaca gggcatggag 5580cagtggatac gaatggatca
cagaatatgc gggtaaaacc gtgtggtttg tggcgagcgt 5640aaaaatgggg
aatgagattg caatgtgcct ccaaagagcg gggaaaaagg tcatccaact
5700caaccgcaag tcctatgaca cagaataccc aaaatgtaag aatggagact
gggattttgt 5760cattaccacc gacatctctg aaatgggggc caacttcggt
gcgagcaggg tcatcgactg 5820tagaaagagc gtgaaaccca ccatcttaga
agagggagaa ggcagagtca tcctcggaaa 5880cccatctccc ataaccagtg
caagcgcagc tcaacggagg ggcagagtag gcagaaaccc 5940caaccaagtt
ggagatgaat accactatgg gggggctacc agtgaagatg acagtaacct
6000agcccattgg acagaggcaa agatcatgtt agacaacata cacatgccca
atggactggt 6060ggcccagctc tatggaccag agagggaaaa ggctttcaca
atggatggcg aataccgtct 6120cagaggtgaa gaaaagaaaa acttcttaga
gctgcttagg acggctgacc tcccggtgtg 6180gctggcctac aaggtggcgt
ccaatggcat tcagtacacc gacagaaagt ggtgttttga 6240tgggccgcgt
acgaatgcca tactggagga caacaccgag gtagagatag tcacccggat
6300gggtgagagg aaaatcctca agccgagatg gcttgatgca agagtttatg
cagatcacca 6360agccctcaag tggttcaaag actttgcagc agggaagaga
tcagccgtta gcttcataga 6420ggtgctcggt cgcatgcctg agcatttcat
gggaaagacg cgggaagctt tagacaccat 6480gtacttggtt gcaacggctg
agaaaggtgg gaaagcacac cgaatggctc tcgaagagct 6540gccagatgca
ctggaaacca tcacacttat tgtcgccatt actgtgatga caggaggatt
6600cttcctacta atgatgcagc gaaagggtat agggaagatg ggtcttggag
ctctagtgct 6660cacgctagct accttcttcc tgtgggcggc agaggttcct
ggaaccaaaa tagcagggac 6720cctgctgatc gccctgctgc tgatggtggt
tctcatccca gaaccggaaa aacagaggtc 6780acagacagat aaccaactgg
cggtgtttct catctgtgtc ttgaccgtgg ttggagtggt 6840ggcagcaaac
gagtacggga tgctagaaaa aaccaaagca gatctcaaga gcatgtttgg
6900cggaaagacg caggcatcag gactgactgg attgccaagc atggcactgg
acctgcgtcc 6960agccacagcc tgggcactgt atggggggag cacagtcgtg
ctaacccctc ttctgaagca 7020cctgatcacg tcggaatacg tcaccacatc
gctagcctca attaactcac aagctggctc 7080attattcgtc ttgccacgag
gcgtgccttt taccgaccta gacttgaccg ttggcctcgt 7140cttccttggc
tgttggggtc aaatcaccct cacaacgttt ctgacagcca tggttctggc
7200gacacttcac tatgggtaca tgctccctgg atggcaagca gaagcactca
gggctgccca 7260gagaaggaca gcggctggaa taatgaagaa tgccgttgtt
gacggaatgg tcgccactga 7320tgtgcctgaa ctggaaagga ctactcctct
gatgcaaaag aaagtcggac aggtgctcct 7380cataggggta agcgtggcag
cgttcctcgt caaccctaat gtcaccactg tgagagaagc 7440aggggtgttg
gtgacggcgg ctacgcttac tttgtgggac aatggagcca gtgccgtttg
7500gaattccacc acagccacgg gactctgcca tgtcatgcga ggtagctacc
tggctggagg 7560ctccattgct tggactctca tcaagaacgc tgataagccc
tccttgaaaa ggggaaggcc 7620tgggggcagg acgctagggg agcagtggaa
ggaaaaacta aatgccatga gcagagaaga 7680gttttttaaa taccggagag
aggccataat cgaggtggac cgcactgaag cacgcagggc 7740cagacgtgaa
aataacatag tgggaggaca tccggtttcg cgaggctcag caaaactccg
7800ttggctcgtg gagaaaggat ttgtctcgcc aataggaaaa gtcattgatc
tagggtgtgg 7860gcgtggagga tggagctact acgcagcaac cctgaagaag
gtccaggaag tcagaggata 7920cacgaaaggt ggggcgggac atgaagaacc
gatgctcatg cagagctacg gctggaacct 7980ggtctccctg aagagtggag
tggacgtgtt ttacaaacct tcagagccca gtgacaccct 8040gttctgtgac
ataggggaat cctccccaag tccagaagta gaagaacaac gcacactacg
8100cgtcctagag atgacatctg actggttgca ccgaggacct agagagttct
gcattaaagt 8160tctctgccct tacatgccca aggttataga aaaaatggaa
gttctgcagc gccgcttcgg 8220aggtgggcta gtgcgtctcc ccctgtcccg
aaactccaat cacgagatgt attgggttag 8280tggagccgct ggcaatgtgg
tgcacgctgt gaacatgacc agccaggtac tactggggcg 8340aatggatcgc
acagtgtgga gagggccaaa gtatgaggaa gatgtcaacc tagggagcgg
8400aacaagagcc gtgggaaagg gagaagtcca tagcaatcag gagaaaatca
agaagagaat 8460ccagaagctt aaagaagaat tcgccacaac gtggcacaaa
gaccctgagc atccataccg 8520cacttggaca taccacggaa gctatgaagt
gaaggctact ggctcagcca gctctctcgt 8580caacggagtg gtgaagctca
tgagcaaacc ttgggacgcc attgccaacg tcaccaccat 8640ggccatgact
gacaccaccc cttttggaca gcaaagagtt ttcaaggaga aagttgacac
8700gaaggctcct gagccaccag ctggagccaa ggaagtgctc aacgagacca
ccaactggct 8760gtgggcccac ttgtcacggg aaaaaagacc ccgcttgtgc
accaaggaag aattcataaa 8820gaaagtcaac agcaacgcgg ctcttggagc
agtgttcgct gaacagaatc aatggagcac 8880ggcgcgtgag gctgtggatg
acccgcggtt ttgggagatg gttgatgaag agagggaaaa 8940ccatctgcga
ggagagtgtc acacatgtat ctacaacatg atgggaaaaa gagagaagaa
9000gcctggagag tttggaaaag ctaaaggaag cagggccatt tggttcatgt
ggcttggagc 9060acggtatcta gagtttgaag ctttggggtt cctgaatgaa
gaccattggc tgagccgaga 9120gaattcagga ggtggagtgg aaggctcagg
cgtccaaaag ctgggataca tcctccgtga 9180catagcagga aagcaaggag
ggaaaatgta cgctgatgat accgccgggt gggacactag 9240aattaccaga
actgatttag aaaatgaagc taaggtactg gagctcctag acggtgaaca
9300ccgcatgctc gcccgagcca taattgaact gacttacagg cacaaagtgg
tcaaggtcat 9360gagacctgca gcagaaggaa agaccgtgat ggacgtgata
tcaagagaag atcaaagggg 9420gagtggacag gtggtcactt atgctcttaa
cactttcacg aacatcgctg tccagctcgt 9480caggctgatg gaggctgagg
gggtcattgg accacaacac ttggaacagc tacctaggaa 9540aaacaagata
gctgtcagga cctggctctt tgagaatgga gaggagagag tgaccaggat
9600ggcgatcagc ggagacgact gtgtcgtcaa gccgctggac gacagattcg
ccacagccct 9660ccacttcctc aacgcaatgt caaaggtcag aaaagacatc
caggaatgga agccttcgca 9720tggctggcac gattggcagc aagttccctt
ctgctctaac cattttcagg agattgtgat 9780gaaagatgga aggagtatag
ttgtcccgtg cagaggacag gatgagctga taggcagggc 9840tcgcatctct
ccaggagctg gatggaatgt gaaggacaca gcttgcctgg ccaaagcata
9900tgcacagatg tggctactcc tatacttcca tcgcagggac ttgcgtctca
tggcaaatgc 9960gatttgctca gcagtgccag tggattgggt gcccacaggc
aggacatcct ggtcaataca 10020ctcgaaagga gagtggatga ccacggaaga
catgctgcag gtctggaaca gagtctggat 10080tgaagaaaat gaatggatga
tggacaagac tccaatcaca agctggacag acgttccgta 10140tgtgggaaag
cgtgaggaca tctggtgtgg cagcctcatc ggaacgcgat ccagagcaac
10200ctgggctgag aacatctatg cggcgataaa ccaggttaga gctgtcattg
ggaaagaaaa 10260ttatgttgac tacatgacct cactcaggag atacgaagac
gtcttgatcc aggaagacag 10320ggtcatctag tgtgatttaa ggtagaaaag
tagactatgt aaataatgta aatgagaaaa 10380tgcatgcata tggagtcagg
ccagcaaaag ctgccaccgg atactgggta gacggtgctg 10440cctgcgtctc
agtcccagga ggactgggtt aacaaatctg acaacagaaa gtgagaaagc
10500cctcagaacc gtctcggaag taggtccctg ctcactggaa gttgaaagac
caacgtcagg 10560ccacaaattt gtgccactcc gctagggagt gcggcctgcg
cagccccagg aggactgggt 10620taccaaagcc gttgaggccc ccacggccca
agcctcgtct aggatgcaat agacgaggtg 10680taaggactag aggttagagg
agaccccgtg gaaacaacaa catgcggccc aagccccctc 10740gaagctgtag
aggaggtgga aggactagag gttagaggag accccgcatt tgcatcaaac
10800agcatattga cacctgggaa tagactggga gatcttctgc tctatctcaa
catcagctac 10860tag 108637410977DNAJapanese encephalitis virus
74agaagtttat ctgtgtgaac ttcttggctt agtatcgtag agaagaatcg agagattagt
60gcagtttaaa cagtttttta gaacggaaga taaccatgac taaaaaacca ggagggcccg
120gtaaaaaccg ggctatcaat atgctgaaac gcggcctacc ccgcgtattc
ccactagtgg 180gagtgaagag ggtagtaatg agcttgttgg acggcagagg
gccagtacgt ttcgtgctgg 240ctcttatcac gttcttcaag tttacagcat
tagccccgac caaggcgctt tcaggccgat 300ggaaagcagt ggaaaagagt
gtggcaatga aacatcttac tagtttcaaa cgagaacttg 360gaacactcat
tgacgccgtg aacaagcggg gcagaaagca aaacaaaaga ggaggaaatg
420aaggctcaat catgtggctc gcgagcttgg cagttgtcat agcttgtgca
ggagccatga 480agttgtcgaa tttccagggg aagcttttga tgaccatcaa
caacacggac attgcagacg 540ttatcgtgat tcccacctca aaaggagaga
acagatgctg ggtccgggca atcgacgtcg 600gctacatgtg tgaggacact
atcacgtacg aatgtcctaa gcttaccatg ggcaatgatc 660cagaggatgt
ggattgctgg tgtgacaacc aagaagtcta cgtccaatat ggacggtgca
720cgcggaccag gcattccaag cgaagcagga gatccgtgtc ggtccaaaca
catggggaga 780gttcactagt gaataaaaaa gaggcttggc tggattcaac
gaaagccaca cgatatctca 840tgaaaactga gaactggatc ataaggaatc
ctggctatgc tttcctggcg gcggtacttg 900gctggatgct tggcagtaac
aacggtcaac gcgtggtatt taccatcctc ctgctgttgg 960tcgctccggc
ttacagtttt aattgtctgg gaatgggcaa tcgtgacttc atagaaggag
1020ccagtggagc cacttgggtg gacttggtgc tagaaggaga cagctgcttg
acaatcatgg 1080caaacgacaa accaacattg gacgtccgca tgattaacat
cgaagctagc caacttgctg 1140aggtcagaag ttactgctat catgcttcag
tcactgacat ctcgacggtg gctcggtgcc 1200ccacgactgg agaagcccac
aacgagaagc gagctgatag tagctatgtg tgcaaacaag 1260gcttcactga
ccgtgggtgg ggcaacggat gtggattttt cgggaaggga agcattgaca
1320catgtgcaaa attctcctgc accagtaaag cgattgggag aacaatccag
ccagaaaaca 1380tcaaatacaa agttggcatt tttgtgcatg gaaccaccac
ttcggaaaac catgggaatt 1440attcagcgca agttggggcg tcccaggcgg
caaagtttac agtaacaccc aatgctcctt 1500cggtagccct caaacttggt
gactacggag aagtcacact ggactgtgag ccaaggagtg 1560gactgaacac
tgaagcgttt tacgtcatga ccgtggggtc aaagtcattt ctggtccata
1620gggagtggtt tcatgacctc gctctcccct ggacgtcccc ttcgagcaca
gcgtggagaa 1680acagagaact cctcatggaa tttgaagggg cgcacgccac
aaaacagtcc gttgttgctc 1740ttgggtcaca ggaaggaggc ctccatcatg
cgttggcagg agccatcgtg gtggagtact 1800caagctcagt gatgttaaca
tcaggccacc tgaaatgtag gctgaaaatg gacaaactgg 1860ctctgaaagg
cacaacctat ggcatgtgta cagaaaaatt ctcgttcgcg aaaaatccgg
1920tggacactgg tcacggaaca gttgtcattg aactctccta ctctgggagt
gatggcccct 1980gcaaaattcc gattgtttcc gttgcgagcc tcaatgacat
gacccccgtt gggcggctgg 2040tgacagtgaa ccccttcgtc gcgacttcca
gtgccaactc aaaggtgctg gtcgagatgg 2100aacccccctt cggagactcc
tacatcgtag ttggaagggg agacaagcag atcaaccacc 2160attggcacaa
agctggaagc acgctgggca aggccttttc aacaactttg aagggagctc
2220aaagactggc agcgttgggc gacacagcct gggactttgg ctctattgga
ggggtcttca 2280actccatagg aagagccgtt caccaagtgt ttggtggtgc
cttcagaaca ctctttgggg 2340gaatgtcttg gatcacacaa gggctaatgg
gtgccctact gctctggatg ggcgtcaacg 2400cacgagaccg atcaattgct
ttggccttct tagccacagg aggtgtgctc gtgttcttag 2460cgaccaatgt
gcatgctgac actggatgtg ccattgacat cacaagaaaa gagatgagat
2520gtggaagtgg catcttcgtg cacaacgacg tggaagcctg ggtggatagg
tataaatatt 2580tgccagaaac gcccagatcc ctagcgaaga tcgtccacaa
agcgcacaag gaaggcgtgt 2640gcggagtcag atctgtcact agactggagc
accaaatgtg ggaagccgta agggacgaat 2700tgaacgtcct gctcaaagag
aatgcagtgg acctcagtgt ggttgtgaac aagcccgtgg 2760gaagatatcg
ctcagcccct aaacgcctat ccatgacgca agagaagttt gaaatgggct
2820ggaaagcatg gggaaaaagc atcctctttg ccccggaatt ggctaactcc
acatttgtcg 2880tagatggacc tgagacaaag gaatgccctg atgagcacag
agcttggaac agcatgcaaa 2940tcgaagactt cggctttggc atcacatcaa
cccgtgtgtg gctgaaaatt agagaggaga 3000gcactgacga gtgtgatgga
gcgatcatag gcacggctgt caaaggacat gtggcagtcc 3060atagtgactt
gtcgtactgg attgagagtc gctacaacga cacatggaaa cttgagaggg
3120cagtctttgg agaggtcaaa tcttgcactt ggccagagac acacaccctt
tggggagatg 3180atgttgagga aagtgaactc atcattccgc acaccatagc
cggaccaaaa agcaagcaca 3240atcggaggga agggtataag acacaaaacc
agggaccttg ggatgagaat ggcatagtct 3300tggactttga ttattgccca
gggacaaaag tcaccattac agaggattgt agcaagagag 3360gcccttcggt
cagaaccact actgacagtg gaaagttgat cactgactgg tgctgtcgca
3420gttgctccct tccgccccta cgattccgga cagaaaatgg ctgctggtac
ggaatggaaa 3480tcagacctgt tatgcatgat gaaacaacac tcgtcagatc
acaggttcat gctttcaaag 3540gtgaaatggt tgaccctttt cagctgggcc
ttctggtgat gtttctggcc acccaggaag 3600tccttcgcaa gaggtggacg
gccagattga ccattcctgc ggttttgggg gtcctacttg 3660tgctgatgct
tgggggtatc acttacactg atttggcgag gtatgtggtg ctagtcgctg
3720ctgctttcgc agaggccaac agtggaggag acgtcctgca ccttgctttg
attgctgttt 3780ttaagatcca accagcattt ttagtgatga acatgcttag
cacgagatgg acgaaccaag 3840aaaacgtggt tctggtccta ggggctgcct
ttttccaatt ggcctcagta gatctgcaaa 3900taggagtcca cggaatcctg
aatgccgccg ctatagcatg gatgattgtc cgagcgatca 3960ccttccccac
aacctcctcc gtcaccatgc cagtcttagc gcttctaact ccggggatga
4020gggctctata cctagacact tacagaatca tcctcctcgt catagggatt
tgctccctgc 4080tgcacgagag gaaaaagacc atggcgaaaa agaaaggagc
tgtactcttg ggcttagcgc 4140tcacatccac tggatggttc tcgcccacca
ctatagctgc cggactaatg gtctgcaacc 4200caaacaagaa gagagggtgg
ccagctactg agtttttgtc ggcagttgga ttgatgtttg 4260ccatcgtagg
tggtttggcc gagttggata ttgaatccat gtcaataccc ttcatgctgg
4320caggtctcat ggcagtgtcc tacgtggtgt caggaaaagc aacagatatg
tggcttgaac 4380gggccgccga catcagctgg gatatgggtg ctgcaatcac
aggaagcagt cggaggctgg 4440atgtgaaact ggatgatgac ggagattttc
acttgattga tgatcccggt gttccatgga 4500aggtctgggt cctgcgcatg
tcttgcattg gcttagccgc cctcacgcct tgggccatcg 4560ttcccgccgc
tttcggttat tggctcactt taaaaacaac aaaaagaggg ggcgtgtttt
4620gggacacgcc atccccaaaa ccttgctcaa aaggagacac cactacagga
gtctaccgaa 4680ttatggctag agggattctt ggcacttacc aggccggcgt
cggagtcatg tacgagaatg 4740ttttccacac actatggcac acaactagag
gagcagccat tgtgagtgga gaaggaaaat 4800tgacgccata ctggggtagt
gtgaaagaag accgcatagc ttacggaggc ccatggaggt 4860ttgaccgaaa
atggaatgga acagatgacg tgcaagtgat cgtggtagaa ccggggaagg
4920gcgcagtaaa catccagaca aaaccaggag tgtttcggac tcccttcggg
gaggttgggg 4980ctgttagtct ggattacccg cgaggaacat ccggctcacc
cattctggat tccaatggag 5040acattatagg cctatacggc aatggagttg
agcttggcga tggctcatac gtcagcgcca 5100tcgtgcaggg tgaccgtcag
gaggaaccag tcccagaagc ttacacccca aacatgttga 5160gaaagagaca
gatgactgtg ctagatttgc accctggttc agggaaaacc aggaaaattc
5220tgccacaaat aattaaggac gctatccagc agcgcctaag aacagctgtg
ttggcaccga 5280cgcgggtggt agcagcagaa atggcagaag ctttgagagg
gctcccagta cgatatcaaa 5340cttcagcagt gcagagagag caccaaggga
atgaaatagt ggatgtgatg tgccacgcca 5400ctctgaccca tagactgatg
tcaccgaaca gagtgcccaa ctacaaccta tttgtcatgg 5460atgaagctca
tttcaccgac ccagccagta tagccgcacg aggatacatt gctaccaagg
5520tggaattagg ggaggcagca gccatcttta tgacagcgac cccgcctgga
accacggatc 5580cttttcctga ctcaaatgcc ccaatccatg atttgcaaga
tgagatacca gacagggcat 5640ggagcagtgg atacgaatgg atcacagaat
atgcgggtaa aaccgtgtgg tttgtggcga 5700gcgtaaaaat ggggaatgag
attgcaatgt gcctccaaag agcggggaaa aaggtcatcc 5760aactcaaccg
caagtcctat gacacagaat acccaaaatg taagaatgga gactgggatt
5820ttgtcattac caccgacatc tctgaaatgg gggccaactt cggtgcgagc
agggtcatcg 5880actgtagaaa gagcgtgaaa cccaccatct tagaagaggg
agaaggcaga gtcatcctcg 5940gaaacccatc tcccataacc agtgcaagcg
cagctcaacg gaggggcaga gtaggcagaa 6000accccaatca agttggagat
gaataccact atgggggggc taccagtgaa gatgacagta 6060acctagccca
ttggacagag gcaaagatca tgttagacaa catacacatg cccaatggac
6120tggtggccca gctctatgga ccagagaggg aaaaggcttt cacaatggat
ggcgaatacc 6180gtctcagagg tgaagaaaag
aaaaacttct tagagctgct taggacggct gacctcccgg 6240tgtggctggc
ctacaaggtg gcgtccaatg gcattcagta caccgacaga aagtggtgtt
6300ttgatgggcc gcgtacgaat gccatactgg aggacaacac cgaggtagag
atagtcaccc 6360ggatgggtga gaggaaaatc ctcaagccga gatggcttga
tgcaagagtt tatgcagatc 6420accaggccct caagtggttc aaagactttg
cagcagggaa gagatcagcc gttagcttca 6480tagaggtgct cggtcgcatg
cctgagcatt tcatgggaaa gacgcgggaa gctttagaca 6540ccatgtactt
ggttgcaacg gctgagaaag gtgggaaagc acaccgaatg gctctcgaag
6600agctgccaga tgcactggaa accatcacac ttattgtcgc cattactgtg
atgacaggag 6660gattcttcct actaatgatg cagcgaaagg gtatagggaa
gatgggtctt ggagctctag 6720tgctcacact agctaccttc ttcctgtggg
cggcagaggt tcctggaacc aaaatagcag 6780ggaccctgct gatcgccctg
ctgctgatgg tggttctcat cccagaaccg gaaaaacaga 6840ggtcacagac
agataaccaa ctggcggtgt ttctcatctg tgtcttgacc gtggttggag
6900tggtggcagc aaacgagtac gggatgctag aaaaaaccaa agcggatctc
aagagcatgt 6960ttggcggaaa gacgcaggca tcaggactga ctggattgcc
aagcatggca ctggacctgc 7020gtccagccac agcctgggca ctgtatgggg
ggagcacagt cgtgctaacc cctcttctga 7080agcacctgat cacgtcggaa
tacgtcacca catcgctagc ttcaattaac tcacaagctg 7140gctcattatt
cgtcttgcca cgaggcgtgc cttttaccga cctagacttg actgttggcc
7200tcgtcttcct tggctgttgg ggtcaagtca ccctcacaac gtttctgaca
gccatggttc 7260tggcgacact tcactatggg tacatgctcc ctggatggca
agcagaagca ctcagggctg 7320cccagagaag gacagcggct ggaataatga
agaatgccgt tgttgacgga atggtcgcca 7380ctgatgtgcc tgaactggaa
aggactactc ctctgatgca aaagaaagtc ggacaggtgc 7440tcctcatagg
ggtaagcgtg gcagcgttcc tcgtcaaccc taatgtcacc actgtgagag
7500aagcaggggt gttggtgacg gcggctacgc ttactttgtg ggacaatgga
gccagtgccg 7560tttggaattc caccacagcc acgggactct gccatgtcat
gcgaggtagc tacctggctg 7620gaggctccat tgcttggact ctcatcaaga
acgctgataa gccctccttg aaaaggggaa 7680ggcctggggg caggacgcta
ggggagcagt ggaaggaaaa actaaatgcc atgagtagag 7740aagagttttt
taaataccgg agagaggcca taatcgaggt ggaccgcact gaagcacgca
7800gggccagacg tgaaaataac atagtgggag gacatccggt ttcgcgaggc
tcagcaaaac 7860tccgttggct cgtggagaaa ggatttgtct cgccaatagg
aaaagtcatt gatctagggt 7920gtgggcgtgg aggatggagc tactacgcag
caaccctgaa gaaggtccag gaagtcagag 7980gatacacgaa aggtggggcg
ggacatgaag aaccgatgct catgcagagc tacggctgga 8040acctggtctc
cctgaagagt ggagtggacg tgttttacaa accttcagag cccagtgata
8100ccctgttctg tgacataggg gaatcctccc caagtccaga agtagaagaa
caacgcacac 8160tacgcgtcct agagatgaca tctgactggt tgcaccgagg
acctagagag ttctgcatta 8220aagttctctg cccttacatg cccaaggtta
tagaaaaaat ggaagttctg cagcgtcgct 8280tcggaggtgg gctagtgcgt
ctccccctgt cccgaaactc caatcacgag atgtattggg 8340ttagtggagc
cgctggcaat gtggtgcacg ctgtgaacat gaccagccag gtattactgg
8400ggcgaatgga tcgcacagtg tggagagggc caaagtatga ggaagatgtc
aacctaggga 8460gcggaacaag agccgtggga aagggagaag tccatagcaa
tcaggagaaa atcaagaaga 8520gaatccagaa gcttaaagaa gaattcgcca
caacgtggca caaagaccct gagcatccat 8580accgcacttg gacataccac
ggaagctatg aagtgaaggc tactggctca gccagctctc 8640tcgtcaacgg
agtggtgaag ctcatgagca aaccttggga cgccattgcc aacgtcacca
8700ccatggccat gactgacacc accccttttg gacagcaaag agttttcaag
gagaaagttg 8760acacgaaggc tcctgagcca ccagctggag ccaaggaagt
gctcaacgag accaccaact 8820ggctgtgggc ctacttgtca cgggaaaaaa
gaccccgctt gtgcaccaag gaagaattca 8880ttaagaaagt taacagcaac
gcggctcttg gagcagtgtt cgctgaacag aatcaatgga 8940gcacggcgcg
tgaggctgtg gatgacccgc ggttttggga gatggttgat gaagagaggg
9000aaaaccatct gcgaggagag tgtcacacat gtatctacaa catgatggga
aaaagagaga 9060agaagcctgg agagtttgga aaagctaaag gaagcagggc
catttggttc atgtggcttg 9120gagcacggta tctagagttt gaagctttgg
ggttcctgaa tgaagaccat tggctgagcc 9180gagagaattc aggaggtgga
gtggaaggct caggcgtcca aaagctggga tacatcctcc 9240gtgacatagc
aggaaagcaa ggagggaaaa tgtacgctga tgataccgcc gggtgggaca
9300ctagaattac cagaactgat ttagaaaatg aagctaaggt actggagctc
ctagacggtg 9360aacaccgcat gctcgcccga gccataattg aactgactta
caggcacaaa gtggtcaagg 9420tcatgagacc tgcagcagaa ggaaagaccg
tgatggacgt gatatcaaga gaagatcaaa 9480gggggagtgg acaggtggtc
acttatgctc ttaacacttt cacgaacatc gctgtccagc 9540tcgtcaggct
gatggaggct gagggggtca ttggaccaca acacttggaa catctaccta
9600ggaaaaacaa gatagctgtc aggacctggc tctttgagaa tggagaggag
agagtgacca 9660ggatggcgat cagcggagac gactgtgccg tcaaaccgct
ggacgacaga ttcgccacag 9720ccctccactt cctcaacgca atgtcaaagg
tcagaaaaga catccaggaa tggaagcctt 9780cgcatggctg gcacgattgg
cagcaagttc ccttctgttc taaccatttt caggagattg 9840tgatgaaaga
tggaaggagt atagttgtcc cgtgcagagg acaggatgag ctgataggca
9900gggctcgcat ctctcctgga gctggatgga atgtgaagga cacagcttgc
ctggccaaag 9960catatgcaca gatgtggcta ctcctatact tccatcgcag
ggacttgcgt ctcatggcaa 10020atgcgatttg ctcagcagtg ccagtagatt
gggtgcccac aggcaggaca tcctggtcaa 10080tacactcgaa aggagagtgg
atgaccacgg aagacatgct gcaggtctgg aacagagttt 10140ggattgaaga
aaatgaatgg atgatggaca agactccaat cacaagctgg acagacgttc
10200cgtatgtggg aaagcgcgag gacatctggt gtggcagcct catcggaacg
cgatccagag 10260caacctgggc tgagaacatc tatgcggcga taaaccaggt
tagagctgtc attgggaaag 10320aaaattatgt tgactacatg acctcactca
ggagatacga agacgtcttg atccaggaag 10380acagggtcat ctagtgtgat
ttaaggtaga aaagtagact atgtaaacaa tgtaaatgag 10440aaaatgcatg
catatggagt caggccagca aaagctgcca ccggatactg ggtagacggt
10500gctgcctgcg tctcagtccc aggaggactg ggttaacaaa tctgacaaca
gaaagtgaga 10560aagccctcag aaccgtctcg gaagtaggtc cctgctcact
ggaagttgaa agaccaacgt 10620caggccacaa atttgtgcca ctccgctagg
gagtgcggcc tgcgcagccc caggaggact 10680gggttaccaa agccgttgag
gcccccacgg cccaagcctc gtctaggatg caatagacga 10740ggtgtaagga
ctagaggtta gaggagaccc cgtggaaaca acaacatgcg gcccaagccc
10800cctcgaagct gtagaggagg tggaaggact agaggttaga ggagaccccg
catttgcatc 10860aaacagcata ttgacacctg ggaatagact gggagatctt
ctgctctatc tcaacatcag 10920ctactaggca cagagcgccg aagtatgtag
ctggtggtga ggaagaacac aggatct 109777510976DNAJapanese encephalitis
virus 75agaagtttat ctgtgtgaac ttcttggctt agtatcgtag agaagaatcg
agagattagt 60gcagtttaaa cagtttttta gaacggaaga taaccatgac taaaaaacca
ggagggcccg 120gtaaaaaccg ggctatcaat atgctgaaac gcggcctacc
ccgcgtattc ccactagtgg 180gagtgaagag ggtagtaatg agcttgttgg
acggcagagg gccagtacgt ttcgtgctgg 240ctcttatcac gttcttcaag
tttacagcat tagccccgac caaggcgctt tcaggccgat 300ggaaagcagt
ggaaaagagt gtggcaatga aacatcttac tagtttcaaa cgagaacttg
360gaacactcat tgacgccgtg aacaagcggg gcagaaagca aaacaaaaga
ggaggaaatg 420aaggctcaat catgtggctc gcgagcttgg cagttgtcat
agcttgtgca ggagccatga 480agttgtcgaa tttccagggg aagcttttga
tgaccatcaa caacacggac attgcagacg 540ttatcgtgat tcccacctca
aaaggagaga acagatgctg ggtccgggca atcgacgtcg 600gctacatgtg
tgaggacact atcacgtacg aatgtcctaa gcttaccatg ggcaatgatc
660cagaggatgt ggattgctgg tgtgacaacc aagaagtcta cgtccaatat
ggacggtgca 720cgcggaccag gcattccaag cgaagcagga gatccgtgtc
ggtccaaaca catggggaga 780gttcactagt gaataaaaaa gaggcttggc
tggattcaac gaaagccaca cgatatctca 840tgaaaactga gaactggatc
ataaggaatc ctggctatgc tttcctggcg gcggtacttg 900gctggatgct
tggcagtaac aacggtcaac gcgtggtatt taccatcctc ctgctgttgg
960tcgctccggc ttacagtttt aattgtctgg gaatgggcaa tcgtgacttc
atagaaggag 1020ccagtggagc cacttgggtg gacttggtgc tagaaggaga
cagctgcttg acaatcatgg 1080caaacgacaa accaacattg gacgtccgca
tgattaacat cgaagctagc caacttgctg 1140aggtcagaag ttactgctat
catgcttcag tcactgacat ctcgacggtg gctcggtgcc 1200ccacgactgg
agaagcccac aacgagaagc gagctgatag tagctatgtg tgcaaacaag
1260gcttcactga ccgtgggtgg ggcaacggat gtggattttt cgggaaggga
agcattgaca 1320catgtgcaaa attctcctgc accagtaaag cgattgggag
aacaatccag ccagaaaaca 1380tcaaatacaa agttggcatt tttgtgcatg
gaaccaccac ttcggaaaac catgggaatt 1440attcagcgca agttggggcg
tcccaggcgg caaagtttac agtaacaccc aatgctcctt 1500cggtagccct
caaacttggt gactacggag aagtcacact ggactgtgag ccaaggagtg
1560gactgaacac tgaagcgttt tacgtcatga ccgtggggtc aaagtcattt
ctggtccata 1620gggagtggtt tcatgacctc gctctcccct ggacgtcccc
ttcgagcaca gcgtggagaa 1680acagagaact cctcatggaa tttgaagggg
cgcacgccac aaaacagtcc gttgttgctc 1740ttgggtcaca ggaaggaggc
ctccatcatg cgttggcagg agccatcgtg gtggagtact 1800caagctcagt
gatgttaaca tcaggccacc tgaaatgtag gctgaaaatg gacaaactgg
1860ctctgaaagg cacaacctat ggcatgtgta cagaaaaatt ctcgttcgcg
aaaaatccgg 1920tggacactgg tcacggaaca gttgtcattg aactctccta
ctctgggagt gatggcccct 1980gcaaaattcc gattgtttcc gttgcgagcc
tcaatgacat gacccccgtt gggcggctgg 2040tgacagtgaa ccccttcgtc
gcgacttcca gtgccaactc aaaggtgctg gtcgagatgg 2100aacccccctt
cggagactcc tacatcgtag ttggaagggg agacaagcag atcaaccacc
2160attggcacaa agctggaagc acgctgggca aggccttttc aacaactttg
aagggagctc 2220aaagactggc agcgttgggc gacacagcct gggactttgg
ctctattgga ggggtcttca 2280actccatagg aagagccgtt caccaagtgt
ttggtgatgc cttcagaaca ctctttgggg 2340gaatgtcttg gatcacacaa
gggctaatgg gtgccctact gctctggatg ggcgtcaacg 2400cacgagaccg
atcaattgct ttggccttct tagccacagg aggtgtgctc gtgttcttag
2460cgaccaatgt gcatgctgac actggatgtg ccattgacat cacaagaaaa
gagatgagat 2520gtggaagtgg catcttcgtg cacaacgacg tggaagcctg
ggtggatagg tataaatatt 2580tgccagaaac gcccagatcc ctagcgaaga
tcgtccacaa agcgcacaag gaaggcgtgt 2640gcggagtcag atctgtcact
agactggagc accaaatgtg ggaagccgta agggacgaat 2700tgaacgtcct
gctcaaagag aatgcagtgg acctcagtgt ggttgtgaac aagcccgtgg
2760gaagatatcg ctcagcccct aaacgcctat ccatgacgca agagaagttt
gaaatgggct 2820ggaaagcatg gggaaaaagc atcctctttg ccccggaatt
ggctaactcc acatttgtcg 2880tagatggacc tgagacaaag gaatgccctg
atgagcacag agcttggaac agcatgcaaa 2940tcgaagactt cggctttggc
atcacatcaa cccgtgtgtg gctgaaaatt agagaggaga 3000gcactgacga
gtgtgatgga gcgatcatag gcacggctgt caaaggacat gtggcagtcc
3060atagtgactt gtcgtactgg attgagagtc gctacaacga cacatggaaa
cttgagaggg 3120cagtctttgg agaggtcaaa tcttgcactt ggccagagac
acacaccctt tggggagatg 3180atgttgagga aagtgaactc atcattccgc
acaccatagc cggaccaaaa agcaagcaca 3240atcggaggga agggtataag
acacaaaacc agggaccttg ggatgagaat ggcatagtct 3300tggactttga
ttattgccca gggacaaaag tcaccattac agaggattgt agcaagagag
3360gcccttcggt cagaaccact actgacagtg gaaagttgat cactgactgg
tgctgtcgca 3420gttgctccct tccgccccta cgattccgga cagaaaatgg
ctgctggtac ggaatggaaa 3480tcagacctgt tatgcatgat gaaacaacac
tcgtcagatc acaggttcat gctttcaaag 3540gtgaaatggt tgaccctttt
cagctgggcc ttctggtgat gtttctggcc acccaggaag 3600tccttcgcaa
gaggtggacg gccagattga ccattcctgc ggttttgggg gtcctacttg
3660tgctgatgct tgggggtatc acttacactg atttggcgag gtatgtggtg
ctagtcgctg 3720ctgctttcgc agaggccaac agtggaggag acgtcctgca
ccttgctttg attgctgttt 3780ttaagatcca accagcattt ttagtgatga
acatgcttag cacgagatgg acgaaccaag 3840aaaacgtggt tctggtccta
ggggctgcct ttttccaatt ggcctcagta gatctgcaaa 3900taggagtcca
cggaatcctg aatgccgccg ctatagcatg gatgattgtc cgagcgatca
3960ccttccccac aacctcctcc gtcaccatgc cagtcttagc gcttctaact
ccggggatga 4020gggctctata cctagacact tacagaatca tcctcctcgt
catagggatt tgctccctgc 4080tgcacgagag gaaaaagacc atggcgaaaa
agaaaggagc tgtactcttg ggcttagcgc 4140tcacatccac tggatggttc
tcgcccacca ctatagctgc cggactaatg gtctgcaacc 4200caaacaagaa
gagagggtgg ccagctactg agtttttgtc ggcagttgga ttgatgtttg
4260ccatcgtagg tggtttggcc gagttggata ttgaatccat gtcaataccc
ttcatgctgg 4320caggtctcat ggcagtgtcc tacgtggtgt caggaaaagc
aacagatatg tggcttgaac 4380gggccgccga catcagctgg gatatgggtg
ctgcaatcac aggaagcagt cggaggctgg 4440atgtgaaact ggatgatgac
ggagattttc acttcattga tgatcccggt gttccatgga 4500aggtctgggt
cctgcgcatg tcttgcattg gcttagccgc cctcacgcct tgggccatcg
4560ttcccgccgc tttcggttat tggctcactt taaaaacaac aaaaagaggg
ggcgtgtttt 4620gggacacgcc atccccaaaa ccttgctcaa aaggagacac
cactacagga gtctaccgaa 4680ttatggctag agggattctt ggcacttacc
aggccggcgt cggagtcatg tacgagaatg 4740ttttccacac actatggcac
acaactagag gagcagccat tgtgagtgga gaaggaaaat 4800tgacgccata
ctggggtagt gtgaaagaag accgcatagc ttacggaggc ccatggaggt
4860ttgaccgaaa atggaatgga acagatgacg tgcaagtgat cgtggtagaa
ccggggaagg 4920gcgcagtaaa catccagaca aaaccaggag tgtttcggac
tcccttcggg gaggttgggg 4980ctgttagtct ggattacccg cgaggaacat
ccggctcacc cattctggat tccaatggag 5040acattatagg cctatacggc
aatggagttg agcttggcga tggctcatac gtcagcgcca 5100tcgtgcaggg
tgaccgtcag gaggaaccag tcccagaagc ttacacccca aacatgttga
5160gaaagagaca gatgactgtg ctagatttgc accctggttc agggaaaacc
aggaaaattc 5220tgccacaaat aattaaggac gctatccagc agcgcctaag
aacagctgtg ttggcaccga 5280cgcgggtggt agcagcagaa atggcagaag
ttttgagagg gctcccagta cgatatcaaa 5340cttcagcagt gcagagagag
caccaaggga atgaaatagt ggatgtgatg tgccacgcca 5400ctctgaccca
tagactgatg tcaccgaaca gagtgcccaa ctacaaccta tttgtcatgg
5460atgaagctca tttcaccgac ccagccagta tagccgcacg aggatacatt
gctaccaagg 5520tggaattagg ggaggcagca gccatcttta tgacagcgac
cccgcctgga accacggatc 5580cttttcctga ctcaaatgcc ccaatccatg
atttgcaaga tgagatacca gacagggcat 5640ggagcagtgg atacgaatgg
atcacagaat atgcgggtaa aaccgtgtgg tttgtggcga 5700gcgtaaaaat
ggggaatgag attgcaatgt gcctccaaag agcggggaaa aaggtcatcc
5760aactcaaccg caagtcctat gacacagaat acccaaaatg taagaatgga
gactgggatt 5820ttgtcattac caccgacatc tctgaaatgg gggccaactt
cggtgcgagc agggtcatcg 5880actgtagaaa gagcgtgaaa cccaccatct
tagaagaggg agaaggcaga gtcatcctcg 5940gaaacccatc tcccataacc
agtgcaagcg cagctcaacg gaggggcaga gtaggcagaa 6000accccaatca
agttggagat gaataccact atgggggggc taccagtgaa gatgacagta
6060acctagccca ttggacagag gcaaagatca tgttagacaa catacacatg
cccaatggac 6120tggtggccca gctctatgga ccagagaggg aaaaggcttt
cacaatggat ggcgaatacc 6180gtctcagagg tgaagaaaag aaaaacttct
tagagctgct taggacggct gacctcccgg 6240tgtggctggc ctacaaggtg
gcgtccaatg gcattcagta caccgacaga aagtggtgtt 6300ttgatgggcc
gcgtacgaat gccatactgg aggacaacac cgaggtagag atagtcaccc
6360ggatgggtga gaggaaaatc ctcaagccga gatggcttga tgcaagagtt
tatgcagatc 6420accaggccct caagtggttc aaagactttg cagcagggaa
gagatcagcc gttagcttca 6480tagaggtgct cggtcgcatg cctgagcatt
tcatgggaaa gacgcgggaa gctttagaca 6540ccatgtactt ggttgcaacg
gctgagaaag gtgggaaagc acaccgaatg gctctcgaag 6600agctgccaga
tgcactggaa accatcacac ttattgtcgc cattactgtg atgacaggag
6660gattcttcct actaatgatg cagcgaaagg gtatagggaa gatgggtctt
ggagctctag 6720tgctcacact agctaccttc ttcctgtggg cggcagaggt
tcctggaacc aaaatagcag 6780ggaccctgct gatcgccctg ctgctgatgg
tggttctcat cccagaaccg gaaaaacaga 6840ggtcacagac agataaccaa
ctggcggtgt ttctcatctg tgtcttgacc gtggttggag 6900tggtggcagc
aaacgagtac gggatgctag aaaaaaccaa agcggatctc aagagcatgt
6960ttggcggaaa gacgcaggca tcaggactga ctggattgcc aagcatggca
ctggacctgc 7020gtccagccac agcctgggca ctgtatgggg ggagcacagt
cgtgctaacc cctcttctga 7080agcacctgat cacgtcggaa tacgtcacca
catcgctagc ttcaattaac tcacaagctg 7140gctcattatt cgtcttgcca
cgaggcgtgc cttttaccga cctagacttg actgttggcc 7200tcgtcttcct
tggctgttgg ggtcaagtca ccctcacaac gtttctgaca gccatggttc
7260tggcgacact tcactatggg tacatgctcc ctggatggca agcagaagca
ctcagggctg 7320cccagagaag gacagcggct ggaataatga agaatgccgt
tgttgacgga atggtcgcca 7380ctgatgtgcc tgaactggaa aggactactc
ctctgatgca aaagaaagtc ggacaggtgc 7440tcctcatagg ggtaagcgtg
gcagcgttcc tcgtcaaccc taatgtcacc actgtgagag 7500aagcaggggt
gttggtgacg gcggctacgc ttactttgtg ggacaatgga gccagtgccg
7560tttggaattc caccacagcc acgggactct gccatgtcat gcgaggtagc
tacctggctg 7620gaggctccat tgcttggact ctcatcaaga acgctgataa
gccctccttg aaaaggggaa 7680ggcctggggg caggacgcta ggggagcagt
ggaaggaaaa actaaatgcc atgagtagag 7740aagagttttt taaataccgg
agagagggca taatcgaggt ggaccgcact gaagcacgca 7800gggccagaag
tgaaaataac atagtgggag gacatccggt ttcgcgaggc tcagcaaaac
7860tccgttggct tgtggagaaa ggatttgtct cgccaatagg aaaagtcatt
gatctagggt 7920gtgggcgtgg aggatggagc tactacgcag caaccctgaa
gaaggtccag gaagtcagag 7980gatacacgaa aggtggggcg ggacatgaag
aaccgatgct catgcagagc tacggctgga 8040acctggtctc cctgaagagt
ggagtggacg tgttttacaa accttcagag cccagtgata 8100ccctgttctg
tgacataggg gaatcctccc caagtccaga agtagaagaa caacgcacac
8160tacgcgtcct agagatgaca tctgactggt tgcaccgagg acctagagag
ttctgcatta 8220aagttctctg cccttacatg cccaaggtta tagaaaaaat
tgaagttctg cagcgccgct 8280tcggaggtgg gctagtgcgt ctccccctgt
cccgaaactc caatcacgag atgtattggg 8340ttagtggagc cgctggcaat
gtggtgcacg ctgtgaacat gaccagccag gtattactgg 8400ggcgaatgga
tcgcacagtg tggagagggc caaagtatga ggaagatgtc aacctaggga
8460gcggaacaag agccgtggga aagggagaag tccatagcaa tcaggagaaa
atcaagaaga 8520gaatccagaa gcttaaagaa gaattcgcca caacgtggca
caaagaccct gagcatccat 8580accgcacttg gacataccac ggaagctatg
aagtgaaggc tactggctca gccagctctc 8640tcgtcaacgg agtggtgaag
ctcatgagca aaccttggga cgccattgcc aacgtcacca 8700ccatggccat
gactgacacc accccttttg gacagcaaag agttttcaag gagaaagttg
8760acacgaaggc tcctgagcca ccagctggag ccaaggaagt gctcaacgag
accaccaact 8820ggctgtgggc ctacttgtca cgggaaaaaa gaccccgctt
gtgcaccaag gaagaattca 8880ttaagaaagt taacagcaac gcggctcttg
gagcagtgtt cgctgaacag aatcaatgga 8940gcacggcgcg tgaggctgtg
gatgacccgc ggttttggga gatggttgat gaagagaggg 9000aaaaccatct
gcgaggagag tgtcacacat gtatctacaa catgatggga aaaagagaga
9060agaagcctgg agagtttgga aaagctaaag gaagcagggc catttggttc
atgtggcttg 9120gagcacggta tctagagttt gaagctttgg ggttcctgaa
tgaagaccat tggctgagcc 9180gagagaattc aggaggtgga gtggaaggct
caggcgtcca aaagctggga tacatcctcc 9240gtgacatagc aggaaagcaa
ggagggaaaa tgtacgctga tgataccgcc gggtgggaca 9300ctagaattac
cagaactgat ttagaaaatg aagctaaggt actggagctc ctagacggtg
9360aacaccgcat gctcgcccga gccataattg aactgactta caggcacaaa
gtggtcaagg 9420tcatgagacc tgcagcagaa ggaaagaccg tgatggacgt
gatatcaaga gaagatcaaa 9480gggggagtgg acaggtggtc acttatgctc
ttaacacttt cacgaacatc gctgtccagc 9540tcgtcaggct gatggaggct
gagggggtca ttggaccaca acacttggaa catctaccta 9600ggaaaaacaa
gatagctgtc aggacctggc tctttgagaa tggagaggag agagtgacca
9660ggatggcgat cagcggagac gactgtgccg tcaaaccgct ggacgacaga
ttcgccacag 9720ccctccactt cctcaacgca atgtcaaagg tcagaaaaga
catccaggaa tggaagcctt 9780cgcatggctg gcacgattgg cagcaagttc
ccttctgttc taaccatttt caggagattg 9840tgatgaaaga tggaaggagt
atagttgtcc cgtgcagagg acaggatgag ctgataggca 9900gggctcgcat
ctctccagga gctggatgga atgtgaagga cacagcttgc ctgcccaaag
9960catatgcaca aatgtgggta ctcctatact tccaccgcag ggacttgcgt
ctcatggcaa 10020atgcgatttg ctcagcagtg ccagtagatt gggtgcccac
aggcaggaca tcctggtcaa 10080tacactcgaa aggagagtgg atgaccacgg
aagacatgct gcaggtctgg aacagagttt 10140ggattgaaga aaatgaatgg
atgatggaca agactccaat cacaagctgg acagacgttc 10200cgtatgtggg
aaagcgcgag gacatctggt gtggcagcct
catcggaacg cgatccagag 10260caacctgggc tgagaacatc tatgcggcga
taaaccaggt tagagctgtc attgggaaag 10320aaaattatgt tgactacatg
acctcactca ggagatacga agacgtcttg atccaggaag 10380acagggtcat
ctagtgtgat ttaaggtaga aaagtagact atgtaaacaa tgtaaatgag
10440aaaatgcatg catatggagt caggccagca aaagctgcca ccggatactg
ggtagacggt 10500gctgcctgcg tctcagtccc aggaggactg ggttaacaaa
tctgacaaca gaaagtgaga 10560aagccctcag aactgtctcg gaagtaggtc
cctgctcact ggaagttgaa agaccaacgt 10620caggccacaa atttgtgcca
ctccgctagg gagtgcggcc tgcgcagccc caggaggact 10680gggttaccaa
agccgttgag cccccacggc ccaagcctcg tctaggatgc aatagacgag
10740gtgtaaggac tagaggttag aggagacccc gtggaaacaa caacatgcgg
cccaagcccc 10800ctcgaagctg tagaggaggt ggaaggacta gaggttagag
gagaccccgc atttgcatca 10860aacagcatat tgacacctgg gaatagactg
ggagatcttc tgctctatct caacatcagc 10920tactaggcac agagcgccga
agtatgtacg tggtggtgag gaagaacaca ggatct 109767610838DNAYellow fever
virus 76gtgctaattg aggtgcattg gtctgcaaat cgagttgcta ggcaataaac
acatttggat 60taattttaat cgttcgttga gcgattagca gagaactgac cagaacatgt
ctggtcgtaa 120agctcaggga aaaaccctgg gcgtcaatat ggtacgacga
ggagttcgct ccttgtcaaa 180caaaataaaa caaaaaacaa aacaaattgg
aaacagacct ggaccttcaa gaggtgttca 240aggatttatc tttttctttt
tgttcaacat tttgactgga aaaaagatca cagcccacct 300aaagaggttg
tggaaaatgc tggacccaag acaaggcttg gctgttctaa ggaaagtcaa
360gagagtggtg gccagtttga tgagaggatt gtcctcaagg aaacgccgtt
cccatgatgt 420tctgactgtg caattcctaa ttttgggaat gctgttgatg
acgggtggag tgaccttggt 480gcggaaaaac agatggttgc tcctaaatgt
gacatctgag gacctcggga aaacattctc 540tgtgggcaca ggcaactgca
caacaaacat tttggaagcc aagtactggt gcccagactc 600aatggaatac
aactgtccca atctcagtcc aagagaggag ccagatgaca ttgattgctg
660gtgctatggg gtggaaaacg ttagagtcgc atatggtaag tgtgactcag
caggcaggtc 720taggaggtca agaagggcca ttgacttgcc tacgcatgaa
aaccatggtt tgaagacccg 780gcaagaaaaa tggatgactg gaagaatggg
tgaaaggcaa ctccaaaaga ttgagagatg 840gttcgtgagg aacccctttt
ttgcagtgac ggctctgacc attgcctacc ttgtgggaag 900caacatgacg
caacgagtcg tgattgccct actggtcttg gctgttggtc cggcctactc
960agctcactgc attggaatta ctgacaggga tttcattgag ggggtgcatg
gaggaacttg 1020ggtttcagct accctggagc aagacaagtg tgtcactgtt
atggcccctg acaagccttc 1080attggacatc tcactagaga cagtagccat
tgatagacct gctgaggtga ggaaagtgtg 1140ttacaatgca gttctcactc
atgtgaagat taatgacaag tgccccagca ctggagaggc 1200ccacctagct
gaagagaacg aaggggacaa tgcgtgcaag cgcacttatt ctgatagagg
1260ctggggcaat ggctgtggcc tatttgggaa agggagcatt gtggcatgcg
ccaaattcac 1320ttgtgccaaa tccatgagtt tgtttgaggt tgatcagacc
aaaattcagt atgtcatcag 1380agcacaattg catgtagggg ccaagcagga
aaattggact accgacatta agactctcaa 1440gtttgatgcc ctgtcaggct
cccaggaagt cgagttcatt gggtatggaa aagctacact 1500ggaatgccag
gtgcaaactg cggtggactt tggtaacagt tacatcgctg agatggaaac
1560agagagctgg atagtggaca gacagtgggc ccaggacttg accctgccat
ggcagagtgg 1620aagtggcggg gtgtggagag agatgcatca tcttgtcgaa
tttgaacctc cgcatgccgc 1680cactatcaga gtactggccc tgggaaacca
ggaaggctcc ttgaaaacag ctcttactgg 1740cgcaatgagg gttacaaagg
acacaaatga caacaacctt tacaaactac atggtggaca 1800tgtttcttgc
agagtgaaat tgtcagcttt gacactcaag gggacatcct acaaaatatg
1860cactgacaaa atgttttttg tcaagaaccc aactgacact ggccatggca
ctgttgtgat 1920gcaggtgaaa gtgtcaaaag gagccccctg caggattcca
gtgatagtag ctgatgatct 1980tacagcggca atcaataaag gcattttggt
tacagttaac cccatcgcct caaccaatga 2040tgatgaagtg ctgattgagg
tgaacccacc ttttggagac agctacatta tcgttgggag 2100aggagattca
cgtctcactt accagtggca caaagaggga agctcaatag gaaagttgtt
2160cactcagacc atgaaaggcg tggaacgcct ggccgtcatg ggagacaccg
cctgggattt 2220cagctccgct ggagggttct tcacttcggt tgggaaagga
attcatacgg tgtttggctc 2280tgcctttcag gggctatttg gcggcttgaa
ctggataaca aaggtcatca tgggggcggt 2340acttatatgg gttggcatca
acacaagaaa catgacaatg tccatgagca tgatcttggt 2400aggagtgatc
atgatgtttt tgtctctagg agttggggcg gatcaaggat gcgccatcaa
2460ctttggcaag agagagctca agtgcggaga tggtatcttc atatttagag
actctgatga 2520ctggctgaac aagtactcat actatccaga agatcctgtg
aagcttgcat caatagtgaa 2580agcctctttt gaagaaggga agtgtggcct
aaattcagtt gactcccttg agcatgagat 2640gtggagaagc agggcagatg
agatcaatgc catttttgag gaaaacgagg tggacatttc 2700tgttgtcgtg
caggatccaa agaatgttta ccagagagga actcatccat tttccagaat
2760tcgggatggt ctgcagtatg gttggaagac ttggggtaag aaccttgtgt
tctccccagg 2820gaggaagaat ggaagcttca tcatagatgg aaagtccagg
aaagaatgcc cgttttcaaa 2880ccgggtctgg aattctttcc agatagagga
gtttgggacg ggagtgttca ccacacgcgt 2940gtacatggac gcagtctttg
aatacaccat agactgcgat ggatctatct tgggtgcagc 3000ggtgaacgga
aaaaagagtg cccatggctc tccaacattt tggatgggaa gtcatgaagt
3060aaatgggaca tggatgatcc acaccttgga ggcattagat tacaaggagt
gtgagtggcc 3120actgacacat acgattggaa catcagttga agagagtgaa
atgttcatgc cgagatcaat 3180cggaggccca gttagctctc acaatcatat
ccctggatac aaggttcaga cgaacggacc 3240ttggatgcag gtaccactag
aagtgaagag agaagcttgc ccagggacta gcgtgatcat 3300tgatggcaac
tgtgatggac ggggaaaatc aaccagatcc accacggata gcgggaaagt
3360tattcctgaa tggtgttgcc gctcctgcac aatgccgcct gtgagcttcc
atggtagtga 3420tgggtgttgg tatcccatgg aaattaggcc aaggaaaacg
catgaaagcc atctggtgcg 3480ctcctgggtt acagctggag aaatacatgc
tgtccctttt ggtttggtga gcatgatgat 3540agcaatggaa gtggtcctaa
ggaaaagaca gggaccaaag caaatgttgg ttggaggagt 3600agtgctcttg
ggagcaatgc tggtcgggca agtaactctc cttgatttgc tgaaactcac
3660agtggctgtg ggattgcatt tccatgagat gaacaatgga ggagacgcca
tgtatatggc 3720gttgattgct gccttttcaa tcagaccagg gctgctcatc
ggctttgggc tcaggaccct 3780atggagccct cgggaacgcc ttgtgctgac
cctaggagca gccatggtgg agattgcctt 3840gggtggcgtg atgggcggcc
tgtggaagta tctaaatgca gtttctctct gcatcctgac 3900aataaatgct
gttgcttcta ggaaagcatc aaataccatc ttgcccctca tggctctgtt
3960gacacctgtc actatggctg aggtgagact tgccgcaatg ttcttttgtg
ccgtggttat 4020cataggggtc cttcaccaga atttcaagga cacctccatg
cagaagacta tacctctggt 4080ggccctcaca ctcacatctt acctgggctt
gacacaacct tttttgggcc tgtgtgcatt 4140tctggcaacc cgcatatttg
ggcgaaggag tatcccagtg aatgaggcac tcgcagcagc 4200tggtctagtg
ggagtgctgg caggactggc ttttcaggag atggagaact tccttggtcc
4260gattgcagtt ggaggactcc tgatgatgct ggttagcgtg gctgggaggg
tggatgggct 4320agagctcaag aagcttggtg aagtttcatg ggaagaggag
gcggagatca gcgggagttc 4380cgcccgctat gatgtggcac tcagtgaaca
aggggagttc aagctgcttt ctgaagagaa 4440agtgccatgg gaccaggttg
tgatgacctc gctggccttg gttggggctg ccctccatcc 4500atttgctctt
ctgctggtcc ttgctgggtg gctgtttcat gtcaggggag ctaggagaag
4560tggggatgtc ttgtgggata ttcccactcc taagatcatc gaggaatgtg
aacatctgga 4620ggatgggatt tatggcatat tccagtcaac cttcttgggg
gcctcccagc gaggagtggg 4680agtggcacag ggaggggtgt tccacacaat
gtggcatgtc acaagaggag ctttccttgt 4740caggaatggc aagaagttga
ttccatcttg ggcttcagta aaggaagacc ttgtcgccta 4800tggtggctca
tggaagttgg aaggcagatg ggatggagag gaagaggtcc agttgatcgc
4860ggctgttcca ggaaagaacg tggtcaacgt ccagacaaaa ccgagcttgt
tcaaagtgag 4920gaatggggga gaaatcgggg ctgtcgctct tgactatccg
agtggcactt caggatctcc 4980tattgttaac aggaacggag aggtgattgg
gctgtacggc aatggcatcc ttgtcggtga 5040caactccttc gtgtccgcca
tatcccagac tgaggtgaag gaagaaggaa aggaggagct 5100ccaagagatc
ccgacaatgc taaagaaagg aatgacaact gtccttgatt ttcatcctgg
5160agctgggaag acaagacgtt tcctcccaca gatcttggcc gagtgcgcac
ggagacgctt 5220gcgcactctt gtgttggccc ccaccagggt tgttctttct
gaaatgaagg aggcttttca 5280cggcctggac gtgaaattcc acacacaggc
tttttccgct cacggcagcg ggagagaagt 5340cattgatgct atgtgccatg
ccaccctaac ttacaggatg ttggaaccaa ctagggttgt 5400taactgggaa
gtgatcatta tggatgaagc ccattttttg gatccagcta gcatagccgc
5460tagaggttgg gcagcgcaca gagctagggc aaatgaaagt gcaacaatct
tgatgacagc 5520cacaccgcct gggactagtg atgaatttcc acattcaaat
ggtgaaatag aagatgttca 5580aacggacata cccagtgagc cctggaacac
agggcatgac tggatcctgg ctgacaaaag 5640gcccacggca tggttccttc
catccatcag agctgcaaat gtcatggctg cctctttgcg 5700taaggctgga
aagagtgtgg tggtcctgaa caggaaaacc tttgagagag aataccccac
5760gataaagcag aagaaacctg actttatatt ggccactgac atagctgaaa
tgggagccaa 5820cctttgcgtg gagcgagtgc tggattgcag gacggctttt
aagcctgtgc ttgtggatga 5880agggaggaag gtggcaataa aagggccact
tcgtatctcc gcatcctctg ctgctcaaag 5940gagggggcgc attgggagaa
atcccaacag agatggagac tcatactact attctgagcc 6000tacaagtgaa
aataatgccc accacgtctg ctggttggag gcctcaatgc tcttggacaa
6060catggaggtg aggggtggaa tggtcgcccc actctatggc gttgaaggaa
ctaaaacacc 6120agtttcccct ggtgaaatga gactgaggga tgaccagagg
aaagtcttca gagaactagt 6180gaggaattgt gacctgcccg tttggctttc
gtggcaagtg gccaaggctg gtttgaagac 6240gaatgatcgt aagtggtgtt
ttgaaggccc tgaggaacat gagatcttga atgacagcgg 6300tgaaacagtg
aagtgcaggg ctcctggagg agcaaagaag cctctgcgcc caaggtggtg
6360tgatgaaagg gtgtcatctg accagagtgc gctgtctgaa tttattaagt
ttgctgaagg 6420taggagggga gctgctgaag tgctagttgt gctgagtgaa
ctccctgatt tcctggctaa 6480aaaaggtgga gaggcaatgg ataccatcag
tgtgtttctc cactctgagg aaggctctag 6540ggcttaccgc aatgcactat
caatgatgcc tgaggcaatg acaatagtca tgctgtttat 6600actggctgga
ctactgacat cgggaatggt catctttttc atgtctccca aaggcatcag
6660tagaatgtct atggcgatgg gcacaatggc cggctgtgga tatctcatgt
tccttggagg 6720cgtcaaaccc actcacatct cctatatcat gctcatattc
tttgtcctga tggtggttgt 6780gatccccgag ccagggcaac aaaggtccat
ccaagacaac caagtggcat acctcattat 6840tggcatcctg acgctggttt
cagcggtggc agccaacgag ctaggcatgc tggagaaaac 6900caaagaggac
ctctttggga agaagaactt aattccatct agtgcttcac cctggagttg
6960gccggatctt gacctgaagc caggagctgc ctggacagtg tacgttggca
ttgttacaat 7020gctctctcca atgttgcacc actggatcaa agtcgaatat
ggcaacctgt ctctgtctgg 7080aatagcccag tcagcctcag tcctttcttt
catggacaag gggataccat tcatgaagat 7140gaatatctcg gtcataatgc
tgctggtcag tggctggaat tcaataacag tgatgcctct 7200gctctgtggc
atagggtgcg ccatgctcca ctggtctctc attttacctg gaatcaaagc
7260gcagcagtca aagcttgcac agagaagggt gttccatggc gttgccaaga
accctgtggt 7320tgatgggaat ccaacagttg acattgagga agctcctgaa
atgcctgccc tttatgagaa 7380gaaactggct ctatatctcc ttcttgctct
cagcctagct tctgttgcca tgtgcagaac 7440gcccttttca ttggctgaag
gcattgtcct agcatcagct gccctagggc cgctcataga 7500gggaaacacc
agccttcttt ggaatggacc catggctgtc tccatgacag gagtcatgag
7560ggggaatcac tatgcttttg tgggagtcat gtacaatcta tggaagatga
aaactggacg 7620ccgggggagc gcgaatggaa aaactttggg tgaagtctgg
aagagggaac tgaatctgtt 7680ggacaagcga cagtttgagt tgtataaaag
gaccgacatt gtggaggtgg atcgtgatac 7740ggcacgcagg catttggccg
aagggaaggt ggacaccggg gtggcggtct ccagggggac 7800cgcaaagtta
aggtggttcc atgagcgtgg ctatgtcaag ctggaaggta gggtgattga
7860cctggggtgt ggccgcggag gctggtgtta ctacgctgct gcgcaaaagg
aagtgagtgg 7920ggtcaaagga tttactcttg gaagagacgg ccatgagaaa
cccatgaatg tgcaaagtct 7980gggatggaac atcatcacct tcaaggacaa
aactgatatc caccgcctag aaccagtgaa 8040atgtgacacc cttttgtgtg
acattggaga gtcatcatcg tcatcggtca cagaggggga 8100aaggaccgtg
agagttcttg atactgtaga aaaatggctg gcttgtgggg ttgacaactt
8160ctgtgtgaag gtgttagctc catacatgcc agatgttctc gagaaactgg
aattgctcca 8220aaggaggttt ggcggaacag tgatcaggaa ccctctctcc
aggaattcca ctcatgaaat 8280gtactacgtg tctggagccc gcagcaatgt
cacatttact gtgaaccaaa catcccgcct 8340cctgatgagg agaatgaggc
gtccaactgg aaaagtgacc ctggaggctg acgtcatcct 8400cccaattggg
acacgcagtg ttgagacaga caagggaccc ctggacaaag aggccataga
8460agaaagggtt gagaggataa aatctgagta catgacctct tggttttatg
acaatgacaa 8520cccctacagg acctggcact actgtggctc ctatgtcaca
aaaacctcag gaagtgcggc 8580gagcatggta aatggtgtta ttaaaattct
gacatatcca tgggacagga tagaggaggt 8640cacaagaatg gcaatgactg
acacaacccc ttttggacag caaagagtgt ttaaagaaaa 8700agttgacacc
agagcaaagg atccaccagc gggaactagg aagatcatga aagttgtcaa
8760caggtggctg ttccgccacc tggccagaga aaagaacccc agactgtgca
caaaggaaga 8820atttattgca aaagtccgaa gtcatgcagc cattggagct
tacctggaag aacaagaaca 8880gtggaagact gccaatgagg ctgtccaaga
cccaaagttc tgggaactgg tggatgaaga 8940aaggaagctg caccaacaag
gcaggtgtcg gacttgtgtg tacaacatga tggggaaaag 9000agagaagaag
ctgtcagagt ttgggaaagc aaagggaagc cgtgccatat ggtatatgtg
9060gctgggagcg cggtatcttg agtttgaggc cctgggattc ctgaatgagg
accattgggc 9120ttccagggaa aactcaggag gaggagtgga aggcattggc
ttacaatacc taggatatgt 9180gatcagagac ctggctgcaa tggatggtgg
tggattctac gcggatgaca ccgctggatg 9240ggacacgcgc atcacagagg
cagaccttga tgatgaacag gagatcttga actacatgag 9300cccacatcac
aaaaaactgg cacaagcagt gatggaaatg acatacaaga acaaagtggt
9360gaaagtgttg agaccagccc caggagggaa agcctacatg gatgtcataa
gtcgacgaga 9420ccagagagga tccgggcagg tagtgactta tgctctgaac
accatcacca acttgaaagt 9480ccaattgatc agaatggcag aagcagagat
ggtgatacat caccaacatg ttcaagattg 9540tgatgaatca gttctgacca
ggctggaggc atggctcact gagcacggat gtaacagact 9600gaagaggatg
gcggtgagtg gagacgactg tgtggtccgg cccatcgatg acaggttcgg
9660cctggccctg tcccatctca acgccatgtc caaggttaga aaggacatat
ctgaatggca 9720gccatcaaaa gggtggaatg attgggagaa tgtgcccttc
tgttcccacc acttccatga 9780actacagctg aaggatggca ggaggattgt
ggtgccttgc cgagaacagg acgagctcat 9840tgggagagga agggtgtctc
caggaaacgg ctggatgatc aaggaaacag cttgcctcag 9900caaagcctat
gccaacatgt ggtcactgat gtattttcac aaaagggaca tgaggctact
9960gtcattggct gtttcctcag ctgttcccac ctcatgggtt ccacaaggac
gcacaacatg 10020gtcgattcat gggaaagggg agtggatgac cacggaagac
atgcttgagg tgtggaacag 10080agtatggata accaacaacc cacacatgca
ggacaagaca atggtgaaaa aatggagaga 10140tgtcccttat ctaaccaaga
gacaagacaa gctgtgcgga tcactgattg gaatgaccaa 10200tagggccacc
tgggcctccc acatccattt ggtcatccat cgtatccgaa cgctgattgg
10260acaggagaaa tacactgact acctaacagt catggacagg tattctgtgg
atgctgacct 10320gcaactgggt gagcttatct gaaacaccat ctaacaggaa
taaccgggat acaaaccacg 10380ggtggagaac cggactcccc acaacctgaa
accgggatat aaaccacggc tggagaaccg 10440gactccgcac ttaaaatgaa
acagaaaccg ggataaaaac tacggatgga gaaccggact 10500ccacacattg
agacagaaga agttgtcagc ccagaacccc acacgagttt tgccactgct
10560aagctgtgag gcagtgcagg ctgggacagc cgacctccag gttgcgaaaa
acctggtttc 10620tgggacctcc caccccagag taaaaagaac ggagcctccg
ctaccaccct cccacgtggt 10680ggtagaaaga cggggtctag aggttagagg
agaccctcca gggaacaaat agtgggacca 10740tattgacgcc agggaaagac
cggagtggtt ctctgctttt cctccagagg tctgtgagca 10800cagtttgctc
aagaataagc agacctttgg atgacaaa 108387711674DNAChikungunya virus
77gatggctgcg tgagacacac gtagcctacc agtttcttac tgctctactc tgcaaagcaa
60gagattaata acccatcatg gatcctgtgt acgtggacat agacgctgac agcgcctttt
120tgaaggccct gcaacgtgcg taccccatgt ttgaggtgga accaaggcag
gtcacaccga 180atgaccatgc taatgctaga gcgttctcgc atctagctat
aaaactaata gagcaggaaa 240ttgaccccga ctcaaccatc ctggatatcg
gcagtgcgcc agcaaggagg atgatgtcgg 300acaggaagta ccactgcgtc
tgcccgatgc gcagtgcgga agatcccgag agactcgcca 360attatgcgag
aaagctagca tctgccgcag gaaaagtcct ggacagaaac atctctggaa
420agatcgggga cttacaagca gtaatggccg tgccagacac ggagacgcca
acattctgct 480tacacacaga cgtctcatgt agacagagag cagacgtcgc
tatataccaa gacgtctatg 540ctgtacacgc acccacgtcg ctataccacc
aggcgattaa aggggtccga gtggcgtact 600gggttgggtt cgacacaacc
ccgttcatgt acaatgccat ggcgggtgcc tacccctcat 660actcgacaaa
ctgggcagat gagcaggtac tgaaggctaa gaacatagga ttatgttcaa
720cagacctgac ggaaggtaga cgaggcaagt tgtctattat gagagggaaa
aagctaaaac 780cgtgcgaccg tgtgctgttc tcagtagggt caacgctcta
cccggaaagc cgcaagctac 840ttaagagctg gcacctgcca tcggtgttcc
atttaaaggg caaactcagc ttcacatgcc 900gctgtgatac agtggtttcg
tgtgagggct acgtcgttaa gagaataacg atgagcccag 960gcctttatgg
aaaaaccaca gggtatgcgg taacccacca cgcagacgga ttcctgatgt
1020gcaagactac cgacacggtt gacggcgaaa gaatgtcatt ctcggtgtgc
acatacgtgc 1080cggcgaccat ttgtgatcaa atgaccggca tccttgctac
agaagtcacg ccggaggatg 1140cacagaagct gttggtgggg ctgaaccaga
gaatagtggt taacggcaga acgcaacgga 1200atacgaacac catgaaaaat
tatctgcttc ccgtggtcgc ccaagccttc agtaagtggg 1260caaaggagtg
ccggaaagac atggaagatg aaaaactcct gggggtcaga gaaagaacac
1320tgacctgctg ctgtctatgg gcattcaaga agcagaaaac acacacggtc
tacaagaggc 1380ctgataccca gtcaattcag aaggttcagg ccgagtttga
cagctttgtg gtaccgagtc 1440tgtggtcgtc cgggttgtca atccctttga
ggactagaat caaatggttg ttaagcaagg 1500tgccaaaaac cgacctgatc
ccatacagcg gagacgcccg agaagcccgg gacgcagaaa 1560aagaagcaga
ggaagaacga gaagcagaac tgactcgcga agccctacca cctctacagg
1620cagcacagga agatgttcag gtcgaaatcg acgtggaaca gcttgaggac
agagcgggcg 1680caggaataat agagactccg agaggagcta tcaaagttac
tgcccaacca acagaccacg 1740tcgtgggaga gtacctggta ctctccccgc
agaccgtact acgtagccag aagctcagtc 1800tgattcacgc tttggcggag
caagtgaaga cgtgcacgca caacggacga gcagggaggt 1860atgcggtcga
agcgtacgac ggccgagtcc tagtgccctc aggctatgca atctcgcctg
1920aagacttcca gagtctaagc gaaagcgcaa cgatggtgta taacgaaaga
gagttcgtaa 1980acagaaagct acaccatatt gcgatgcacg gaccagccct
gaacaccgac gaagagtcgt 2040atgagctggt gagggcagag aggacagaac
acgagtacgt ctacgacgtg gatcagagaa 2100gatgctgtaa gaaggaagaa
gccgcaggac tggtactggt gggcgacttg actaatccgc 2160cctaccacga
attcgcatat gaagggctaa aaatccgccc tgcctgccca tacaaaattg
2220cagtcatagg agtcttcgga gtaccgggat ctggcaagtc agctattatc
aagaacctag 2280ttaccaggca ggacctggtg actagcggaa agaaagaaaa
ctgccaagaa atcaccaccg 2340acgtgatgag acagagaggt ctagagatat
ctgcacgtac ggttgactcg ctgctcttga 2400atggatgcaa cagaccagtc
gacgtgttgt acgtagacga ggcgtttgcg tgccactctg 2460gaacgctact
tgctttgatc gccttggtga gaccaaggca gaaagttgta ctttgtggtg
2520acccgaagca gtgcggcttc ttcaatatga tgcagatgaa agtcaactat
aatcacaaca 2580tctgcaccca agtgtaccac aaaagtatct ccaggcggtg
tacactgcct gtgaccgcca 2640ttgtgtcatc gttgcattac gaaggcaaaa
tgcgcactac gaatgagtac aacaagccga 2700ttgtagtgga cactacaggc
tcaacaaaac ctgaccctgg agacctcgtg ttaacgtgct 2760tcagagggtg
ggttaaacaa ctgcaaattg actatcgtgg atacgaggtc atgacagcag
2820ccgcatccca agggttaacc agaaaaggag tttacgcagt tagacaaaaa
gttaatgaaa 2880acccgctcta tgcatcaacg tcagagcacg tcaacgtact
cctaacgcgt acggaaggta 2940aactggtatg gaagacactt tccggcgacc
cgtggataaa gacgctgcag aacccaccga 3000aaggaaactt caaagcaact
attaaggagt gggaggtgga gcatgcatca ataatggcgg 3060gcatctgcag
tcaccaaatg accttcgata cattccaaaa taaagccaac gtttgttggg
3120ctaagagctt ggtccctatc ctcgaaacag cggggataaa actaaatgat
aggcagtggt 3180ctcagataat tcaagccttc aaagaagaca aagcatactc
acctgaagta gccctgaatg 3240aaatatgtac gcgcatgtat ggggtggatc
tagacagcgg gctattttct aaaccgttgg 3300tgtctgtgta ttacgcggat
aaccactggg ataataggcc tggagggaaa atgttcggat 3360ttaaccccga
ggcagcatcc attctagaaa
gaaagtatcc attcacaaaa gggaagtgga 3420acatcaacaa gcagatctgc
gtgactacca ggaggataga agactttaac cctaccacca 3480acatcatacc
ggccaacagg agactaccac actcattagt ggccgaacac cgcccagtaa
3540aaggggaaag aatggaatgg ctggttaaca agataaacgg ccaccacgtg
ctcctggtca 3600gtggctataa ccttgcactg cctactaaga gagtcacttg
ggtagcgccg ttaggtgtcc 3660gcggagcgga ctacacatac aacctagagt
tgggtctgcc agcaacgctt ggtaggtatg 3720acctagtggt cataaacatc
cacacacctt ttcgcataca ccattaccaa cagtgcgtcg 3780accacgcaat
gaaactgcaa atgctcgggg gtgactcatt gagactgctc aaaccgggcg
3840gctctctatt gatcagagca tatggttacg cagatagaac cagtgaacga
gtcatctgcg 3900tattgggacg caagtttaga tcgtctagag cgttgaaacc
accatgtgtc accagcaaca 3960ctgagatgtt tttcctattc agcaactttg
acaatggcag aaggaatttc acaactcatg 4020tcatgaacaa tcaactgaat
gcagccttcg taggacaggt cacccgagca ggatgtgcac 4080cgtcgtaccg
ggtaaaacgc atggacatcg cgaagaacga tgaagagtgc gtagtcaacg
4140ccgctaaccc tcgcgggtta ccgggtggcg gtgtttgcaa ggcagtatac
aaaaaatggc 4200cggagtcctt taagaacagt gcaacaccag tgggaaccgc
aaaaacagtt atgtgcggta 4260cgtatccagt aatccacgct gttggaccaa
acttctctaa ttattcggag tctgaagggg 4320accgggaatt ggcagctgcc
tatcgagaag tcgcaaagga agtaactagg ctgggagtaa 4380atagtgtagc
tatacctctc ctctccacag gtgtatactc aggagggaaa gacaggctga
4440cccagtcact gaaccacctc tttacagcca tggactcgac ggatgcagac
gtggtcatct 4500actgccgcga caaagaatgg gagaagaaaa tatctgaggc
catacagatg cggacccaag 4560tagagctgct ggatgagcac atctccatag
actgcgatat tgttcgcgtg caccctgaca 4620gcagcttggc aggcagaaaa
ggatacagca ccacggaagg cgcactgtac tcatatctag 4680aagggacccg
ttttcatcag acggctgtgg atatggcgga gatacatact atgtggccaa
4740agcaaacaga ggccaatgag caagtctgcc tatatgccct gggggaaagt
attgaatcga 4800tcaggcagaa atgcccggtg gatgatgcag acgcatcatc
tccccccaaa actgtcccgt 4860gcctttgccg ttacgctatg actccagaac
gcgtcacccg gcttcgcatg aaccacgtca 4920caagcataat tgtgtgttct
tcgtttcccc tcccaaagta caaaatagaa ggagtgcaaa 4980aagtcaaatg
ctctaaggta atgctatttg accacaacgt gccatcgcgc gtaagtccaa
5040gggcttatag aggtgccgct gccggtaacc ttgcggccgt gtctgattgg
gtaatgagca 5100ccgtacctgt cgcgccgccc agaagaaggc gagggagaaa
cctgactgtg acatgtgacg 5160agagagaagg gaatataaca cccatggcta
gcgtccgatt ctttagggca gagctgtgtc 5220cggtcgtaca agaaacagcg
gagacgcgtg acacagcaat gtctcttcag gcaccaccga 5280gtaccgccac
ggaaccgaat catccgccga tctccttcgg agcatcaagc gagacgttcc
5340ccattacatt tggggacttc aacgaaggag aaatcgaaag cttgtcttct
gagctactaa 5400ctttcggaga cttcttacca ggagaagtgg atgacttgac
agacagcgac tggtccacgt 5460gctcagacac ggacgacgag ttaagactag
acagggcagg tgggtatata ttctcgtcgg 5520acaccggtcc aggtcattta
caacagaagt cagtacgcca gtcagtgctg ccggtgaaca 5580ccctggagga
agtccacgag gagaagtgtt acccacctaa gctggatgaa gcaaaggagc
5640aactattact taagaaactc caggagagtg catccatggc caacagaagc
aggtatcagt 5700cgcgcaaagt agaaaacatg aaagcagcaa tcatccagag
actaaagaga ggctgtagac 5760tatacttaat gtcagagacc ccaaaagtcc
ctacttaccg gactacatat ccggcgcctg 5820tgtactcgcc tccgatcaac
gtccgattgt ccaatcccga gtccgcagtg gcagcatgca 5880atgagttctt
agctagaaac tatccaactg tctcatcata ccaaattacc gacgagtatg
5940atgcatatct agacatggtg gacgggtcgg agagttgcct ggaccgagcg
acattcaatc 6000cgtcaaaact caggagctac ccgaaacagc acgcttacca
cgcgccctcc atcagaagcg 6060ctgtaccgtc cccattccag aacacactac
agaatgtact ggcagcagcc acgaaaagaa 6120actgcaacgt cacacagatg
agggaattac ccactttgga ctcagcagta ttcaacgtgg 6180agtgtttcaa
aaaattcgca tgcaaccaag aatactggga agaatttgct gccagcccta
6240ttaggataac aactgagaat ttagcaacct atgttactaa actaaaaggg
ccaaaagcag 6300cagcgctatt cgcaaaaacc cataatctac tgccactaca
ggaagtacca atggataggt 6360tcacagtaga tatgaaaagg gacgtaaagg
tgactcctgg tacaaagcat acagaggaaa 6420gacctaaggt gcaggttata
caggcggctg aacccttggc gacagcatac ctatgtggga 6480ttcacagaga
gctggttagg aggctgaacg ccgtcctcct acccaatgta catacactat
6540ttgacatgtc tgccgaggat ttcgatgcca tcatagccgc acactttaag
ccaggagaca 6600ctgttttgga aacggacata gcctcctttg ataagagcca
agatgattca cttgcgctta 6660ctgctttgat gctgttagag gatttagggg
tggatcactc cctgctggac ttgatagagg 6720ctgctttcgg agagatttcc
agctgtcacc taccgacagg tacgcgcttc aagttcggcg 6780ccatgatgaa
atcaggtatg ttcctaactc tgttcgtcaa cacattgtta aacatcacca
6840tcgccagccg agtgctggaa gatcgtctga caaaatccgc gtgcgcggcc
ttcatcggcg 6900acgacaacat aatacatgga gtcgtctccg atgaattgat
ggcagccaga tgtgccactt 6960ggatgaacat ggaagtgaag atcatagatg
cagttgtatc cttgaaagcc ccttactttt 7020gtggagggtt tatactgcac
gatactgtga caggaacagc ttgcagagtg gcagacccgc 7080taaaaaggct
ttttaaactg ggcaaaccgc tagcggcagg tgacgaacaa gatgaagata
7140gaagacgagc gctggctgac gaagtgatca gatggcaacg aacagggcta
attgatgagc 7200tggagaaagc ggtatactct aggtacgaag tgcagggtat
atcagttgtg gtaatgtcca 7260tggccacctt tgcaagctcc agatccaact
tcgagaagct cagaggaccc gtcataactt 7320tgtacggcgg tcctaaatag
gtacgcacta cagctaccta ttttgcagaa gccgacagca 7380agtatctaaa
cactaatcag ctacaatgga gttcatccca acccaaactt tttacaatag
7440gaggtaccag cctcgaccct ggactccgcg ccctactatc caagtcatca
ggcccagacc 7500gcgccctcag aggcaagctg ggcaacttgc ccagctgatc
tcagcagtta ataaactgac 7560aatgcgcgcg gtaccacaac agaagccacg
caggaatcgg aagaataaga agcaaaagca 7620aaaacaacag gcgccacaaa
acaacacaaa tcaaaagaag cagccaccta aaaagaaacc 7680ggctcaaaag
aaaaagaagc cgggccgcag agagaggatg tgcatgaaaa tcgaaaatga
7740ttgtattttc gaagtcaagc acgaaggtaa ggtaacaggt tacgcgtgcc
tggtggggga 7800caaagtaatg aaaccagcac acgtaaaggg gaccatcgat
aacgcggacc tggccaaact 7860ggcctttaag cggtcatcta agtatgacct
tgaatgcgcg cagatacccg tgcacatgaa 7920gtccgacgct tcgaagttca
cccatgagaa accggagggg tactacaact ggcaccacgg 7980agcagtacag
tactcaggag gccggttcac catccctaca ggtgctggca aaccagggga
8040cagcggcaga ccgatcttcg acaacaaggg acgcgtggtg gccatagtct
taggaggagc 8100taatgaagga gcccgtacag ccctctcggt ggtgacctgg
aataaagaca ttgtcactaa 8160aatcaccccc gagggggccg aagagtggag
tcttgccatc ccagttatgt gcctgttggc 8220aaacaccacg ttcccctgct
cccagccccc ttgcacgccc tgctgctacg aaaaggaacc 8280ggaggaaacc
ctacgcatgc ttgaggacaa cgtcatgaga cctgggtact atcagctgct
8340acaagcatcc ttaacatgtt ctccccaccg ccagcgacgc agcaccaagg
acaacttcaa 8400tgtctataaa gccacaagac catacttagc tcactgtccc
gactgtggag aagggcactc 8460gtgccatagt cccgtagcac tagaacgcat
cagaaatgaa gcgacagacg ggacgctgaa 8520aatccaggtc tccttgcaaa
tcggaataaa gacggatgac agccacgatt ggaccaagct 8580gcgttatatg
gacaaccaca tgccagcaga cgcagagagg gcggggctat ttgtaagaac
8640atcagcaccg tgtacgatta ctggaacaat gggacacttc atcctggccc
gatgtccaaa 8700aggggaaact ctgacggtgg gattcactga cagtaggaag
attagtcact catgtacgca 8760cccatttcac cacgaccctc ctgtgatagg
tcgggaaaaa ttccattccc gaccgcagca 8820cggtaaagag ctaccttgca
gcacgtacgt gcagagcacc gccgcaacta ccgaggagat 8880agaggtacac
atgcccccag acacccctga tcgcacatta atgtcacaac agtccggcaa
8940cgtaaagatc acagtcaatg gccagacggt gcggtacaag tgtaattgcg
gtggctcaaa 9000tgaaggacta acaactacag acaaagtgat taataactgc
aaggttgatc aatgtcatgc 9060cgcggtcacc aatcacaaaa agtggcagta
taactcccct ctggtcccgc gtaatgctga 9120acttggggac cgaaaaggaa
aaattcacat cccgtttccg ctggcaaatg taacatgcag 9180ggtgcctaaa
gcaaggaacc ccaccgtgac gtacgggaaa aaccaagtca tcatgctact
9240gtatcctgac cacccaacac tcctgtccta ccggaatatg ggagaagaac
caaactatca 9300agaagagtgg gtgatgcata agaaggaagt cgtgctaacc
gtgccgactg aagggctcga 9360ggtcacgtgg ggcaacaacg agccgtataa
gtattggccg cagttatcta caaacggtac 9420agcccatggc cacccgcatg
agataattct gtattattat gagctgtacc ccactatgac 9480tgtagtagtt
gtgtcagtgg ccacgttcat actcctgtcg atggtgggta tggcagcggg
9540gatgtgcatg tgtgcacgac gcagatgcat cacaccgtat gaactgacac
caggagctac 9600cgtccctttc ctgcttagcc taatatgctg catcagaaca
gctaaagcgg ccacatacca 9660agaggctgcg atatacctgt ggaacgagca
gcaacctttg ttttggctac aagcccttat 9720tccgctggca gccctgattg
ttctatgcaa ctgtctgaga ctcttaccat gctgctgtaa 9780aacgttggct
tttttagccg taatgagcgt cggtgcccac actgtgagcg cgtacgaaca
9840cgtaacagtg atcccgaaca cggtgggagt accgtataag actctagtca
atagacctgg 9900ctacagcccc atggtattgg agatggaact actgtcagtc
actttggagc caacactatc 9960gcttgattac atcacgtgcg agtacaaaac
cgtcatcccg tctccgtacg tgaagtgctg 10020cggtacagca gagtgcaagg
acaaaaacct acctgactac agctgtaagg tcttcaccgg 10080cgtctaccca
tttatgtggg gcggcgccta ctgcttctgc gacgctgaaa acacgcagtt
10140gagcgaagca cacgtggaga agtccgaatc atgcaaaaca gaatttgcat
cagcatacag 10200ggctcatacc gcatctgcat cagctaagct ccgcgtcctt
taccaaggaa ataacatcac 10260tgtaactgcc tatgcaaacg gcgaccatgc
cgtcacagtt aaggacgcca aattcattgt 10320ggggccaatg tcttcagcct
ggacaccttt cgacaacaaa attgtggtgt acaaaggtga 10380cgtctataac
atggactacc cgccctttgg cgcaggaaga ccaggacaat ttggcgatat
10440ccaaagtcgc acacctgaga gtaaagacgt ctatgctaat acacaactgg
tactgcagag 10500accggctgtg ggtacggtac acgtgccata ctctcaggca
ccatctggct ttaagtattg 10560gctaaaagaa cgcggggcgt cgctgcagca
cacagcacca tttggctgcc aaatagcaac 10620aaacccggta agagcggtga
actgcgccgt agggaacatg cccatctcca tcgacatacc 10680ggaagcggcc
ttcactaggg tcgtcgacgc gccctcttta acggacatgt cgtgcgaggt
10740accagcctgc acccattcct cagactttgg gggcgtcgcc attattaaat
atgcagccag 10800caagaaaggc aagtgtgcgg tgcattcgat gactaacgcc
gtcactattc gggaagctga 10860gatagaagtt gaagggaatt ctcagctgca
aatctctttc tcgacggcct tagccagcgc 10920cgaattccgc gtacaagtct
gttctacaca agtacactgt gcagccgagt gccacccccc 10980gaaggaccac
atagtcaact acccggcgtc acataccacc ctcggggtcc aggacatctc
11040cgctacggcg atgtcatggg tgcagaagat cacgggaggt gtgggactgg
ttgttgctgt 11100tgccgcactg attctaatcg tggtgctatg cgtgtcgttc
agcaggcact aacttgacaa 11160ttaagtatga aggtatatgt gtcccctaag
agacacactg tacatagcaa ataatctata 11220gatcaaaggg ctacgcaacc
cctgaatagt aacaaaatac aaaatcacta aaaattataa 11280aaacagaaaa
atacataaat aggtatacgt gtcccctaag agacacattg tatgtaggtg
11340ataagtatag atcaaagggc cgaataaccc ctgaatagta acaaaatatg
aaaatcaata 11400aaaatcataa aatagaaaaa ccataaacag aagtagttca
aagggctata aaacccctga 11460atagtaacaa aacataaaat taataaaaat
caaatgaata ccataattgg caaacggaag 11520agatgtaggt acttaagctt
cctaaaagca gccgaactca ctttgagaag taggcatagc 11580ataccgaact
cttccacgat tctccgaacc cacagggacg taggagatgt tattttgttt
11640ttaatatttc aaaaaaaaaa aaaaaaaaaa aaaa 116747810773DNAZika
virus 78cagactgcga cagttcgagt ttgaagcgaa agctagcaac agtatcaaca
ggttttattt 60tggatttgga aacgagagtt tctggtcatg aaaaacccaa aaaagaaatc
cggaggattc 120cggattgtca atatgctaaa acgcggagta gcccgtgtga
gcccctttgg gggcttgaag 180aggctgccag ccggacttct gctgggtcat
gggcccatca ggatggtctt ggcgattcta 240gcctttttga gattcacggc
aatcaagcca tcactgggtc tcatcaatag atggggttca 300gtggggaaaa
aagaggctat ggaaataata aagaagttca agaaagatct ggctgccatg
360ctgagaataa tcaatgctag gaaggagaag aagagacgag gcgcagatac
tagtgtcgga 420attgttggcc tcctgctgac cacagctatg gcagcggagg
tcactagacg tgggagtgca 480tactatatgt acttggacag aaacgacgct
ggggaggcca tatcttttcc aaccacattg 540gggatgaata agtgttatat
acagatcatg gatcttggac acatgtgtga tgccaccatg 600agctatgaat
gccctatgct ggatgagggg gtggaaccag atgacgtcga ttgttggtgc
660aacacgacgt caacttgggt tgtgtacgga acctgccatc acaaaaaagg
tgaagcacgg 720agatctagaa gagctgtgac gctcccctcc cattccacta
ggaagctgca aacgcggtcg 780caaacctggt tggaatcaag agaatacaca
aagcacttga ttagagtcga aaattggata 840ttcaggaacc ctggcttcgc
gttagcagca gctgccatcg cttggctttt gggaagctca 900acgagccaaa
aagtcatata cttggtcatg atactgctga ttgccccggc atacagcatc
960aggtgcatag gagtcagcaa tagggacttt gtggaaggta tgtcaggtgg
gacttgggtt 1020gatgttgtct tggaacatgg aggttgtgtc accgtaatgg
cacaggacaa accgactgtc 1080gacatagagc tggttacaac aacagtcagc
aacatggcgg aggtaagatc ctactgctat 1140gaggcatcaa tatcggacat
ggcttcggac agccgctgcc caacacaagg tgaagcctac 1200cttgacaagc
aatcagacac tcaatatgtc tgcaaaagaa cgttagtgga cagaggctgg
1260ggaaatggat gtggactttt tggcaaaggg agcctggtga catgcgctaa
gtttgcatgc 1320tccaagaaaa tgaccgggaa gagcatccag ccagagaatc
tggagtaccg gataatgctg 1380tcagttcatg gctcccagca cagtgggatg
atcgttaatg acacaggaca tgaaactgat 1440gagaatagag cgaaggttga
gataacgccc aattcaccaa gagccgaagc caccctgggg 1500ggttttggaa
gcctaggact tgattgtgaa ccgaggacag gccttgactt ttcagatttg
1560tattacttga ctatgaataa caagcactgg ttggttcaca aggagtggtt
ccacgacatt 1620ccattacctt ggcacgctgg ggcagacacc ggaactccac
actggaacaa caaagaagca 1680ctggtagagt tcaaggacgc acatgccaaa
aggcaaactg tcgtggttct agggagtcaa 1740gaaggagcag ttcacacggc
ccttgctgga gctctggagg ctgagatgga tggtgcaaag 1800ggaaggctgt
cctctggcca cttgaaatgt cgcctgaaaa tggataaact tagattgaag
1860ggcgtgtcat actccttgtg taccgcagcg ttcacattca ccaagatccc
ggctgaaaca 1920ctgcacggga cagtcacagt ggaggtacag tacgcaggga
cagatggacc ttgcaaggtt 1980ccagctcaga tggcggtgga catgcaaact
ctgaccccag ttgggaggtt gataaccgct 2040aaccccgtaa tcactgaaag
cactgagaac tctaagatga tgctggaact tgatccacca 2100tttggggact
cttacattgt cataggagtc ggggagaaga agatcaccca ccactggcac
2160aggagtggca gcaccattgg aaaagcattt gaagccactg tgagaggtgc
caagagaatg 2220gcagtcttgg gagacacagc ctgggacttt ggatcagttg
gaggcgctct caactcattg 2280ggcaagggca tccatcaaat ttttggagca
gctttcaaat cattgtttgg aggaatgtcc 2340tggttctcac aaattctcat
tggaacgttg ctgatgtggt tgggtctgaa cacaaagaat 2400ggatctattt
cccttatgtg cttggcctta gggggagtgt tgatcttctt atccacagct
2460gtctctgctg atgtggggtg ctcggtggac ttctcaaaga aggagacgag
atgcggtaca 2520ggggtgttcg tctataacga cgttgaagcc tggagggaca
ggtacaagta ccatcctgac 2580tccccccgta gattggcagc agcagtcaag
caagcctggg aagatggtat ctgtgggatc 2640tcctctgttt caagaatgga
aaacatcatg tggagatcag tagaagggga gctcaacgca 2700atcctggaag
agaatggagt tcaactgacg gtcgttgtgg gatctgtaaa aaaccccatg
2760tggagaggtc cacagagatt gcccgtgcct gtgaacgagc tgccccacgg
ctggaaggct 2820tgggggaaat cgtacttcgt cagagcagca aagacaaata
acagctttgt cgtggatggt 2880gacacactga aggaatgccc actcaaacat
agagcatgga acagctttct tgtggaggat 2940catgggttcg gggtatttca
cactagtgtc tggctcaagg ttagagaaga ttattcatta 3000gagtgtgatc
cagccgttat tggaacagct gttaagggaa aggaggctgt acacagtgat
3060ctaggctact ggattgagag tgagaagaat gacacatgga ggctgaagag
ggcccatctg 3120atcgagatga aaacatgtga atggccaaag tcccacacat
tgtggacaga tggaatagaa 3180gagagtgatc tgatcatacc caagtcttta
gctgggccac tcagccatca caataccaga 3240gagggctaca ggacccaaat
gaaagggcca tggcacagtg aagagcttga aattcggttt 3300gaggaatgcc
caggcactaa ggtccacgtg gaggaaacat gtggaacaag aggaccatct
3360ctgagatcaa ccactgcaag cggaagggtg atcgaggaat ggtgctgcag
ggagtgcaca 3420atgcccccac tgtcgttccg ggctaaagat ggctgttggt
atggaatgga gataaggccc 3480aggaaagaac cagaaagtaa cttagtaagg
tcaatggtga ctgcaggatc aactgatcac 3540atggatcact tctcccttgg
agtgcttgtg attctgctca tggtgcagga agggctgaag 3600aagagaatga
ccacaaagat catcataagc acatcgatgg cagtgctggt agctatgatc
3660ctgggaggat tttcaatgag tgacctggct aagcttgcaa ttttgatggg
tgccaccttc 3720gcggaaatga acactggagg agatgtagct catctggcgc
tgatagcggc attcaaagtc 3780agaccagcgt tgctggtatc tttcatcttc
agagctaatt ggacaccccg tgaaagcatg 3840ctgctggcct tggcctcgtg
tcttttgcaa actgcgatct ccgccttgga aggcgacctg 3900atggttctca
tcaatggttt tgctttggcc tggttggcaa tacgagcgat ggttgttcca
3960cgcactgata acatcacctt ggcaatcctg gctgctctga caccactggc
ccggggcaca 4020ctgcttgtgg cgtggagagc aggccttgct acttgcgggg
ggtttatgct cctctctctg 4080aagggaaaag gcagtgtgaa gaagaactta
ccatttgtca tggccctggg actaaccgct 4140gtgaggctgg tcgaccccat
caacgtggtg ggactgctgt tgctcacaag gagtgggaag 4200cggagctggc
cccctagcga agtactcaca gctgttggcc tgatatgcgc attggctgga
4260gggttcgcca aggcagatat agagatggct gggcccatgg ccgcggtcgg
tctgctaatt 4320gtcagttacg tggtctcagg aaagagtgtg gacatgtaca
ttgaaagagc aggtgacatc 4380acatgggaaa aagatgcgga agtcactgga
aacagtcccc ggctcgatgt ggcgctagat 4440gagagtggtg atttctccct
ggtggaggat gacggtcccc ccatgagaga gatcatactc 4500aaggtggtcc
tgatgaccat ctgtggcatg aacccaatag ccataccctt tgcagctgga
4560gcgtggtacg tatacgtgaa gactggaaaa aggagtggtg ctctatggga
tgtgcctgct 4620cccaaggaag taaaaaaggg ggagaccaca gatggagtgt
acagagtaat gactcgtaga 4680ctgctaggtt caacacaagt tggagtggga
gttatgcaag agggggtctt tcacactatg 4740tggcacgtca caaaaggatc
cgcgctgaga agcggtgaag ggagacttga tccatactgg 4800ggagatgtca
agcaggatct ggtgtcatac tgtggtccat ggaagctaga tgccgcctgg
4860gacgggcaca gcgaggtgca gctcttggcc gtgccccccg gagagagagc
gaggaacatc 4920cagactctgc ccggaatatt taagacaaag gatggggaca
ttggagcggt tgcgctggat 4980tacccagcag gaacttcagg atctccaatc
ctagacaagt gtgggagagt gataggactt 5040tatggcaatg gggtcgtgat
caaaaatggg agttatgtta gtgccatcac ccaagggagg 5100agggaggaag
agactcctgt tgagtgcttc gagccttcga tgctgaagaa gaagcagcta
5160actgtcttag acttgcatcc tggagctggg aaaaccagga gagttcttcc
tgaaatagtc 5220cgtgaagcca taaaaacaag actccgtact gtgatcttag
ctccaaccag ggttgtcgct 5280gctgaaatgg aggaagccct tagagggctt
ccagtgcgtt atatgacaac agcagtcaat 5340gtcacccact ctggaacaga
aatcgtcgac ttaatgtgcc atgccacctt cacttcacgt 5400ctactacagc
caatcagagt ccccaactat aatctgtata ttatggatga ggcccacttc
5460acagatccct caagtatagc agcaagagga tacatttcaa caagggttga
gatgggcgag 5520gcggctgcca tcttcatgac cgccacgcca ccaggaaccc
gtgacgcatt tccggactcc 5580aactcaccaa ttatggacac cgaagtggaa
gtcccagaga gagcctggag ctcaggcttt 5640gattgggtga cggatcattc
tggaaaaaca gtttggtttg ttccaagcgt gaggaacggc 5700aatgagatcg
cagcttgtct gacaaaggct ggaaaacggg tcatacagct cagcagaaag
5760acttttgaga cagagttcca gaaaacaaaa catcaagagt gggactttgt
cgtgacaact 5820gacatttcag agatgggcgc caactttaaa gctgaccgtg
tcatagattc caggagatgc 5880ctaaagccgg tcatacttga tggcgagaga
gtcattctgg ctggacccat gcctgtcaca 5940catgccagcg ctgcccagag
gagggggcgc ataggcagga atcccaacaa acctggagat 6000gagtatctgt
atggaggtgg gtgcgcagag actgacgaag accatgcaca ctggcttgaa
6060gcaagaatgc tccttgacaa tatttacctc caagatggcc tcatagcctc
gctctatcga 6120cctgaggccg acaaagtagc agccattgag ggagagttca
agcttaggac ggagcaaagg 6180aagacctttg tggaactcat gaaaagagga
gatcttcctg tttggctggc ctatcaggtt 6240gcatctgccg gaataaccta
cacagataga agatggtgct ttgatggcac gaccaacaac 6300accataatgg
aagacagtgt gccggcagag gtgtggacca gacacggaga gaaaagagtg
6360ctcaaaccga ggtggatgga cgccagagtt tgttcagatc atgcggccct
gaagtcattc 6420aaggagtttg ccgctgggaa aagaggagcg gcttttggag
tgatggaagc cctgggaaca 6480ctgccaggac acatgacaga gagattccag
gaagccattg acaacctcgc tgtgctcatg 6540cgggcagaga ctggaagcag
gccttacaaa gccgcggcgg cccaattgcc ggagacccta 6600gagaccatta
tgcttttggg gttgctggga acagtctcgc tgggaatctt tttcgtcttg
6660atgaggaaca agggcatagg gaagatgggc tttggaatgg tgactcttgg
ggccagcgca 6720tggctcatgt
ggctctcgga aattgagcca gccagaattg catgtgtcct cattgttgtg
6780ttcctattgc tggtggtgct catacctgag ccagaaaagc aaagatctcc
ccaggacaac 6840caaatggcaa tcatcatcat ggtagcagta ggtcttctgg
gcttgattac cgccaatgaa 6900ctcggatggt tggagagaac aaagagtgac
ctaagccatc taatgggaag gagagaggag 6960ggggcaacca taggattctc
aatggacatt gacctgcggc cagcctcagc ttgggccatc 7020tatgctgcct
tgacaacttt cattacccca gccgtccaac atgcagtgac cacttcatac
7080aacaactact ccttaatggc gatggccacg caagctggag tgttgtttgg
tatgggcaaa 7140gggatgccat tctacgcatg ggactttgga gtcccgctgc
taatgatagg ttgctactca 7200caattaacac ccctgaccct aatagtggcc
atcattttgc tcgtggcgca ctacatgtac 7260ttgatcccag ggctgcaggc
agcagctgcg cgtgctgccc agaagagaac ggcagctggc 7320atcatgaaga
accctgttgt ggatggaata gtggtgactg acattgacac aatgacaatt
7380gacccccaag tggagaaaaa gatgggacag gtgctactca tagcagtagc
cgtctccagc 7440gccatactgt cgcggaccgc ctgggggtgg ggggaggctg
gggccctgat cacagcggca 7500acttccactt tgtgggaagg ctctccgaac
aagtactgga actcctctac agccacttca 7560ctgtgtaaca tttttagggg
aagttacttg gctggagctt ctctaatcta cacagtaaca 7620agaaacgctg
gcttggtcaa gagacgtggg ggtggaacag gagagaccct gggagagaaa
7680tggaaggccc gcttgaacca gatgtcggcc ctggagttct actcctacaa
aaagtcaggc 7740atcaccgagg tgtgcagaga agaggcccgc cgcgccctca
aggacggtgt ggcaacggga 7800ggccatgctg tgtcccgagg aagtgcaaag
ctgagatggt tggtggagcg gggatacctg 7860cagccctatg gaaaggtcat
tgatcttgga tgtggcagag ggggctggag ttactacgcc 7920gccaccatcc
gcaaagttca agaagtgaaa ggatacacaa aaggaggccc tggtcatgaa
7980gaacccatgt tggtgcaaag ctatgggtgg aacatagtcc gtcttaagag
tggggtggac 8040gtctttcata tggcggctga gccgtgtgac acgttgctgt
gtgacatagg tgagtcatca 8100tctagtcctg aagtggaaga agcacggacg
ctcagagtcc tctccatggt gggggattgg 8160cttgaaaaaa gaccaggagc
cttttgtata aaagtgttgt gcccatacac cagcactatg 8220atggaaaccc
tggagcgact gcagcgtagg tatgggggag gactggtcag agtgccactc
8280tcccgcaact ctacacatga gatgtactgg gtctctggag cgaaaagcaa
caccataaaa 8340agtgtgtcca ccacgagcca gctcctcttg gggcgcatgg
acgggcccag gaggccagtg 8400aaatatgagg aggatgtgaa tctcggctct
ggcacgcggg ctgtggtaag ctgcgctgaa 8460gctcccaaca tgaagatcat
tggtaaccgc attgaaagga tccgcagtga gcacgcggaa 8520acgtggttct
ttgacgagaa ccacccatat aggacatggg cttaccatgg aagctatgag
8580gcccccacac aagggtcagc gtcctctcta ataaacgggg ttgtcaggct
cctgtcaaaa 8640ccctgggatg tggtgactgg agtcacagga atagccatga
ccgacaccac accgtatggt 8700cagcaaagag ttttcaagga aaaagtggac
actagggtgc cagaccccca agaaggcact 8760cgtcaggtta tgagcatggt
ctcttcctgg ttgtggaaag agctaggcaa acacaaacgg 8820ccacgagtct
gtaccaaaga agagttcatc aacaaggttc gtagcaatgc agcattaggg
8880gcaatatttg aagaggaaaa agagtggaag actgcagtgg aagctgtgaa
cgatccaagg 8940ttctgggctc tagtggacaa ggaaagagag caccacctga
gaggagagtg ccagagttgt 9000gtgtacaaca tgatgggaaa aagagaaaag
aaacaagggg aatttggaaa ggccaagggc 9060agccgcgcca tctggtatat
gtggctaggg gctagatttc tagagttcga agcccttgga 9120ttcttgaacg
aggatcactg gatggggaga gagaactcag gaggtggtgt tgaagggctg
9180ggattacaaa gactcggata tgtcctagaa gagatgagtc gcataccagg
aggaaggatg 9240tatgcagatg acactgctgg ctgggacacc cgcatcagca
ggtttgatct ggagaatgaa 9300gctctaatca ccaaccaaat ggagaaaggg
cacagggcct tggcattggc cataatcaag 9360tacacatacc aaaacaaagt
ggtaaaggtc cttagaccag ctgaaaaagg gaagacagtt 9420atggacatta
tttcgagaca agaccaaagg gggagcggac aagttgtcac ttacgctctt
9480aacacattta ccaacctagt ggtgcaactc attcggaata tggaggctga
ggaagttcta 9540gagatgcaag acttgtggct gctgcggagg tcagagaaag
tgaccaactg gttgcagagc 9600aacggatggg ataggctcaa acgaatggca
gtcagtggag atgattgcgt tgtgaagcca 9660attgatgata ggtttgcaca
tgccctcagg ttcttgaatg atatgggaaa agttaggaag 9720gacacacaag
agtggaaacc ctcaactgga tgggacaact gggaagaagt tccgttttgc
9780tcccaccact tcaacaagct ccatctcaag gacgggaggt ccattgtggt
tccctgccgc 9840caccaagatg aactgattgg ccgggcccgc gtctctccag
gggcgggatg gagcatccgg 9900gagactgctt gcctagcaaa atcatatgcg
caaatgtggc agctccttta tttccacaga 9960agggacctcc gactgatggc
caatgccatt tgttcatctg tgccagttga ctgggttcca 10020actgggagaa
ctacctggtc aatccatgga aagggagaat ggatgaccac tgaagacatg
10080cttgtggtgt ggaacagagt gtggattgag gagaacgacc acatggaaga
caagacccca 10140gttacgaaat ggacagacat tccctatttg ggaaaaaggg
aagacttgtg gtgtggatct 10200ctcatagggc acagaccgcg caccacctgg
gctgagaaca ttaaaaacac agtcaacatg 10260gtgcgcagga tcataggtga
tgaagaaaag tacatggact acctatccac ccaagttcgc 10320tacttgggtg
aagaagggtc tacacctgga gtgctgtaag caccaatctt agtgttgtca
10380ggcctgctag tcagccacag cttggggaaa gctgtgcagc ctgtgacccc
cccaggagaa 10440gctgggaaac caagcctata gtcaggccga gaacgccatg
gcacggaaga agccatgctg 10500cctgtgagcc cctcagagga cactgagtca
aaaaacccca cgcgcttgga ggcgcaggat 10560gggaaaagaa ggtggcgacc
ttccccaccc ttcaatctgg ggcctgaact ggagatcagc 10620tgtggatctc
cagaagaggg actagtggtt agaggagacc ccccggaaaa cgcaaaacag
10680catattgacg ctgggaaaga ccagagactc catgagtttc caccacgctg
gccgccaggc 10740acagatcgcc gaatagcggc ggccggtgtg ggg
10773793423PRTZika virus 79Met Lys Asn Pro Lys Lys Lys Ser Gly Gly
Phe Arg Ile Val Asn Met1 5 10 15Leu Lys Arg Gly Val Ala Arg Val Ser
Pro Phe Gly Gly Leu Lys Arg 20 25 30Leu Pro Ala Gly Leu Leu Leu Gly
His Gly Pro Ile Arg Met Val Leu 35 40 45Ala Ile Leu Ala Phe Leu Arg
Phe Thr Ala Ile Lys Pro Ser Leu Gly 50 55 60Leu Ile Asn Arg Trp Gly
Ser Val Gly Lys Lys Glu Ala Met Glu Ile65 70 75 80Ile Lys Lys Phe
Lys Lys Asp Leu Ala Ala Met Leu Arg Ile Ile Asn 85 90 95Ala Arg Lys
Glu Lys Lys Arg Arg Gly Ala Asp Thr Ser Val Gly Ile 100 105 110Val
Gly Leu Leu Leu Thr Thr Ala Met Ala Ala Glu Val Thr Arg Arg 115 120
125Gly Ser Ala Tyr Tyr Met Tyr Leu Asp Arg Asn Asp Ala Gly Glu Ala
130 135 140Ile Ser Phe Pro Thr Thr Leu Gly Met Asn Lys Cys Tyr Ile
Gln Ile145 150 155 160Met Asp Leu Gly His Met Cys Asp Ala Thr Met
Ser Tyr Glu Cys Pro 165 170 175Met Leu Asp Glu Gly Val Glu Pro Asp
Asp Val Asp Cys Trp Cys Asn 180 185 190Thr Thr Ser Thr Trp Val Val
Tyr Gly Thr Cys His His Lys Lys Gly 195 200 205Glu Ala Arg Arg Ser
Arg Arg Ala Val Thr Leu Pro Ser His Ser Thr 210 215 220Arg Lys Leu
Gln Thr Arg Ser Gln Thr Trp Leu Glu Ser Arg Glu Tyr225 230 235
240Thr Lys His Leu Ile Arg Val Glu Asn Trp Ile Phe Arg Asn Pro Gly
245 250 255Phe Ala Leu Ala Ala Ala Ala Ile Ala Trp Leu Leu Gly Ser
Ser Thr 260 265 270Ser Gln Lys Val Ile Tyr Leu Val Met Ile Leu Leu
Ile Ala Pro Ala 275 280 285Tyr Ser Ile Arg Cys Ile Gly Val Ser Asn
Arg Asp Phe Val Glu Gly 290 295 300Met Ser Gly Gly Thr Trp Val Asp
Val Val Leu Glu His Gly Gly Cys305 310 315 320Val Thr Val Met Ala
Gln Asp Lys Pro Thr Val Asp Ile Glu Leu Val 325 330 335Thr Thr Thr
Val Ser Asn Met Ala Glu Val Arg Ser Tyr Cys Tyr Glu 340 345 350Ala
Ser Ile Ser Asp Met Ala Ser Asp Ser Arg Cys Pro Thr Gln Gly 355 360
365Glu Ala Tyr Leu Asp Lys Gln Ser Asp Thr Gln Tyr Val Cys Lys Arg
370 375 380Thr Leu Val Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe
Gly Lys385 390 395 400Gly Ser Leu Val Thr Cys Ala Lys Phe Ala Cys
Ser Lys Lys Met Thr 405 410 415Gly Lys Ser Ile Gln Pro Glu Asn Leu
Glu Tyr Arg Ile Met Leu Ser 420 425 430Val His Gly Ser Gln His Ser
Gly Met Ile Val Asn Asp Thr Gly His 435 440 445Glu Thr Asp Glu Asn
Arg Ala Lys Val Glu Ile Thr Pro Asn Ser Pro 450 455 460Arg Ala Glu
Ala Thr Leu Gly Gly Phe Gly Ser Leu Gly Leu Asp Cys465 470 475
480Glu Pro Arg Thr Gly Leu Asp Phe Ser Asp Leu Tyr Tyr Leu Thr Met
485 490 495Asn Asn Lys His Trp Leu Val His Lys Glu Trp Phe His Asp
Ile Pro 500 505 510Leu Pro Trp His Ala Gly Ala Asp Thr Gly Thr Pro
His Trp Asn Asn 515 520 525Lys Glu Ala Leu Val Glu Phe Lys Asp Ala
His Ala Lys Arg Gln Thr 530 535 540Val Val Val Leu Gly Ser Gln Glu
Gly Ala Val His Thr Ala Leu Ala545 550 555 560Gly Ala Leu Glu Ala
Glu Met Asp Gly Ala Lys Gly Arg Leu Ser Ser 565 570 575Gly His Leu
Lys Cys Arg Leu Lys Met Asp Lys Leu Arg Leu Lys Gly 580 585 590Val
Ser Tyr Ser Leu Cys Thr Ala Ala Phe Thr Phe Thr Lys Ile Pro 595 600
605Ala Glu Thr Leu His Gly Thr Val Thr Val Glu Val Gln Tyr Ala Gly
610 615 620Thr Asp Gly Pro Cys Lys Val Pro Ala Gln Met Ala Val Asp
Met Gln625 630 635 640Thr Leu Thr Pro Val Gly Arg Leu Ile Thr Ala
Asn Pro Val Ile Thr 645 650 655Glu Ser Thr Glu Asn Ser Lys Met Met
Leu Glu Leu Asp Pro Pro Phe 660 665 670Gly Asp Ser Tyr Ile Val Ile
Gly Val Gly Glu Lys Lys Ile Thr His 675 680 685His Trp His Arg Ser
Gly Ser Thr Ile Gly Lys Ala Phe Glu Ala Thr 690 695 700Val Arg Gly
Ala Lys Arg Met Ala Val Leu Gly Asp Thr Ala Trp Asp705 710 715
720Phe Gly Ser Val Gly Gly Ala Leu Asn Ser Leu Gly Lys Gly Ile His
725 730 735Gln Ile Phe Gly Ala Ala Phe Lys Ser Leu Phe Gly Gly Met
Ser Trp 740 745 750Phe Ser Gln Ile Leu Ile Gly Thr Leu Leu Met Trp
Leu Gly Leu Asn 755 760 765Thr Lys Asn Gly Ser Ile Ser Leu Met Cys
Leu Ala Leu Gly Gly Val 770 775 780Leu Ile Phe Leu Ser Thr Ala Val
Ser Ala Asp Val Gly Cys Ser Val785 790 795 800Asp Phe Ser Lys Lys
Glu Thr Arg Cys Gly Thr Gly Val Phe Val Tyr 805 810 815Asn Asp Val
Glu Ala Trp Arg Asp Arg Tyr Lys Tyr His Pro Asp Ser 820 825 830Pro
Arg Arg Leu Ala Ala Ala Val Lys Gln Ala Trp Glu Asp Gly Ile 835 840
845Cys Gly Ile Ser Ser Val Ser Arg Met Glu Asn Ile Met Trp Arg Ser
850 855 860Val Glu Gly Glu Leu Asn Ala Ile Leu Glu Glu Asn Gly Val
Gln Leu865 870 875 880Thr Val Val Val Gly Ser Val Lys Asn Pro Met
Trp Arg Gly Pro Gln 885 890 895Arg Leu Pro Val Pro Val Asn Glu Leu
Pro His Gly Trp Lys Ala Trp 900 905 910Gly Lys Ser Tyr Phe Val Arg
Ala Ala Lys Thr Asn Asn Ser Phe Val 915 920 925Val Asp Gly Asp Thr
Leu Lys Glu Cys Pro Leu Lys His Arg Ala Trp 930 935 940Asn Ser Phe
Leu Val Glu Asp His Gly Phe Gly Val Phe His Thr Ser945 950 955
960Val Trp Leu Lys Val Arg Glu Asp Tyr Ser Leu Glu Cys Asp Pro Ala
965 970 975Val Ile Gly Thr Ala Val Lys Gly Lys Glu Ala Val His Ser
Asp Leu 980 985 990Gly Tyr Trp Ile Glu Ser Glu Lys Asn Asp Thr Trp
Arg Leu Lys Arg 995 1000 1005Ala His Leu Ile Glu Met Lys Thr Cys
Glu Trp Pro Lys Ser His 1010 1015 1020Thr Leu Trp Thr Asp Gly Ile
Glu Glu Ser Asp Leu Ile Ile Pro 1025 1030 1035Lys Ser Leu Ala Gly
Pro Leu Ser His His Asn Thr Arg Glu Gly 1040 1045 1050Tyr Arg Thr
Gln Met Lys Gly Pro Trp His Ser Glu Glu Leu Glu 1055 1060 1065Ile
Arg Phe Glu Glu Cys Pro Gly Thr Lys Val His Val Glu Glu 1070 1075
1080Thr Cys Gly Thr Arg Gly Pro Ser Leu Arg Ser Thr Thr Ala Ser
1085 1090 1095Gly Arg Val Ile Glu Glu Trp Cys Cys Arg Glu Cys Thr
Met Pro 1100 1105 1110Pro Leu Ser Phe Arg Ala Lys Asp Gly Cys Trp
Tyr Gly Met Glu 1115 1120 1125Ile Arg Pro Arg Lys Glu Pro Glu Ser
Asn Leu Val Arg Ser Met 1130 1135 1140Val Thr Ala Gly Ser Thr Asp
His Met Asp His Phe Ser Leu Gly 1145 1150 1155Val Leu Val Ile Leu
Leu Met Val Gln Glu Gly Leu Lys Lys Arg 1160 1165 1170Met Thr Thr
Lys Ile Ile Ile Ser Thr Ser Met Ala Val Leu Val 1175 1180 1185Ala
Met Ile Leu Gly Gly Phe Ser Met Ser Asp Leu Ala Lys Leu 1190 1195
1200Ala Ile Leu Met Gly Ala Thr Phe Ala Glu Met Asn Thr Gly Gly
1205 1210 1215Asp Val Ala His Leu Ala Leu Ile Ala Ala Phe Lys Val
Arg Pro 1220 1225 1230Ala Leu Leu Val Ser Phe Ile Phe Arg Ala Asn
Trp Thr Pro Arg 1235 1240 1245Glu Ser Met Leu Leu Ala Leu Ala Ser
Cys Leu Leu Gln Thr Ala 1250 1255 1260Ile Ser Ala Leu Glu Gly Asp
Leu Met Val Leu Ile Asn Gly Phe 1265 1270 1275Ala Leu Ala Trp Leu
Ala Ile Arg Ala Met Val Val Pro Arg Thr 1280 1285 1290Asp Asn Ile
Thr Leu Ala Ile Leu Ala Ala Leu Thr Pro Leu Ala 1295 1300 1305Arg
Gly Thr Leu Leu Val Ala Trp Arg Ala Gly Leu Ala Thr Cys 1310 1315
1320Gly Gly Phe Met Leu Leu Ser Leu Lys Gly Lys Gly Ser Val Lys
1325 1330 1335Lys Asn Leu Pro Phe Val Met Ala Leu Gly Leu Thr Ala
Val Arg 1340 1345 1350Leu Val Asp Pro Ile Asn Val Val Gly Leu Leu
Leu Leu Thr Arg 1355 1360 1365Ser Gly Lys Arg Ser Trp Pro Pro Ser
Glu Val Leu Thr Ala Val 1370 1375 1380Gly Leu Ile Cys Ala Leu Ala
Gly Gly Phe Ala Lys Ala Asp Ile 1385 1390 1395Glu Met Ala Gly Pro
Met Ala Ala Val Gly Leu Leu Ile Val Ser 1400 1405 1410Tyr Val Val
Ser Gly Lys Ser Val Asp Met Tyr Ile Glu Arg Ala 1415 1420 1425Gly
Asp Ile Thr Trp Glu Lys Asp Ala Glu Val Thr Gly Asn Ser 1430 1435
1440Pro Arg Leu Asp Val Ala Leu Asp Glu Ser Gly Asp Phe Ser Leu
1445 1450 1455Val Glu Asp Asp Gly Pro Pro Met Arg Glu Ile Ile Leu
Lys Val 1460 1465 1470Val Leu Met Thr Ile Cys Gly Met Asn Pro Ile
Ala Ile Pro Phe 1475 1480 1485Ala Ala Gly Ala Trp Tyr Val Tyr Val
Lys Thr Gly Lys Arg Ser 1490 1495 1500Gly Ala Leu Trp Asp Val Pro
Ala Pro Lys Glu Val Lys Lys Gly 1505 1510 1515Glu Thr Thr Asp Gly
Val Tyr Arg Val Met Thr Arg Arg Leu Leu 1520 1525 1530Gly Ser Thr
Gln Val Gly Val Gly Val Met Gln Glu Gly Val Phe 1535 1540 1545His
Thr Met Trp His Val Thr Lys Gly Ser Ala Leu Arg Ser Gly 1550 1555
1560Glu Gly Arg Leu Asp Pro Tyr Trp Gly Asp Val Lys Gln Asp Leu
1565 1570 1575Val Ser Tyr Cys Gly Pro Trp Lys Leu Asp Ala Ala Trp
Asp Gly 1580 1585 1590His Ser Glu Val Gln Leu Leu Ala Val Pro Pro
Gly Glu Arg Ala 1595 1600 1605Arg Asn Ile Gln Thr Leu Pro Gly Ile
Phe Lys Thr Lys Asp Gly 1610 1615 1620Asp Ile Gly Ala Val Ala Leu
Asp Tyr Pro Ala Gly Thr Ser Gly 1625 1630 1635Ser Pro Ile Leu Asp
Lys Cys Gly Arg Val Ile Gly Leu Tyr Gly 1640 1645 1650Asn Gly Val
Val Ile Lys Asn Gly Ser Tyr Val Ser Ala Ile Thr 1655 1660 1665Gln
Gly Arg Arg Glu Glu Glu Thr Pro Val Glu Cys Phe Glu Pro 1670 1675
1680Ser Met Leu Lys Lys Lys Gln Leu Thr Val Leu Asp Leu His Pro
1685 1690 1695Gly Ala Gly Lys Thr Arg Arg Val Leu Pro Glu Ile Val
Arg Glu 1700 1705 1710Ala Ile Lys Thr Arg Leu Arg Thr Val Ile Leu
Ala Pro Thr Arg 1715 1720 1725Val Val Ala Ala Glu Met Glu Glu Ala
Leu Arg Gly Leu Pro Val 1730 1735 1740Arg Tyr Met Thr Thr Ala Val
Asn Val Thr His Ser Gly Thr Glu 1745 1750 1755Ile Val Asp Leu Met
Cys His Ala Thr Phe Thr Ser Arg Leu Leu 1760 1765 1770Gln Pro Ile
Arg Val Pro Asn Tyr Asn Leu Tyr Ile Met Asp Glu 1775 1780
1785Ala
His Phe Thr Asp Pro Ser Ser Ile Ala Ala Arg Gly Tyr Ile 1790 1795
1800Ser Thr Arg Val Glu Met Gly Glu Ala Ala Ala Ile Phe Met Thr
1805 1810 1815Ala Thr Pro Pro Gly Thr Arg Asp Ala Phe Pro Asp Ser
Asn Ser 1820 1825 1830Pro Ile Met Asp Thr Glu Val Glu Val Pro Glu
Arg Ala Trp Ser 1835 1840 1845Ser Gly Phe Asp Trp Val Thr Asp His
Ser Gly Lys Thr Val Trp 1850 1855 1860Phe Val Pro Ser Val Arg Asn
Gly Asn Glu Ile Ala Ala Cys Leu 1865 1870 1875Thr Lys Ala Gly Lys
Arg Val Ile Gln Leu Ser Arg Lys Thr Phe 1880 1885 1890Glu Thr Glu
Phe Gln Lys Thr Lys His Gln Glu Trp Asp Phe Val 1895 1900 1905Val
Thr Thr Asp Ile Ser Glu Met Gly Ala Asn Phe Lys Ala Asp 1910 1915
1920Arg Val Ile Asp Ser Arg Arg Cys Leu Lys Pro Val Ile Leu Asp
1925 1930 1935Gly Glu Arg Val Ile Leu Ala Gly Pro Met Pro Val Thr
His Ala 1940 1945 1950Ser Ala Ala Gln Arg Arg Gly Arg Ile Gly Arg
Asn Pro Asn Lys 1955 1960 1965Pro Gly Asp Glu Tyr Leu Tyr Gly Gly
Gly Cys Ala Glu Thr Asp 1970 1975 1980Glu Asp His Ala His Trp Leu
Glu Ala Arg Met Leu Leu Asp Asn 1985 1990 1995Ile Tyr Leu Gln Asp
Gly Leu Ile Ala Ser Leu Tyr Arg Pro Glu 2000 2005 2010Ala Asp Lys
Val Ala Ala Ile Glu Gly Glu Phe Lys Leu Arg Thr 2015 2020 2025Glu
Gln Arg Lys Thr Phe Val Glu Leu Met Lys Arg Gly Asp Leu 2030 2035
2040Pro Val Trp Leu Ala Tyr Gln Val Ala Ser Ala Gly Ile Thr Tyr
2045 2050 2055Thr Asp Arg Arg Trp Cys Phe Asp Gly Thr Thr Asn Asn
Thr Ile 2060 2065 2070Met Glu Asp Ser Val Pro Ala Glu Val Trp Thr
Arg His Gly Glu 2075 2080 2085Lys Arg Val Leu Lys Pro Arg Trp Met
Asp Ala Arg Val Cys Ser 2090 2095 2100Asp His Ala Ala Leu Lys Ser
Phe Lys Glu Phe Ala Ala Gly Lys 2105 2110 2115Arg Gly Ala Ala Phe
Gly Val Met Glu Ala Leu Gly Thr Leu Pro 2120 2125 2130Gly His Met
Thr Glu Arg Phe Gln Glu Ala Ile Asp Asn Leu Ala 2135 2140 2145Val
Leu Met Arg Ala Glu Thr Gly Ser Arg Pro Tyr Lys Ala Ala 2150 2155
2160Ala Ala Gln Leu Pro Glu Thr Leu Glu Thr Ile Met Leu Leu Gly
2165 2170 2175Leu Leu Gly Thr Val Ser Leu Gly Ile Phe Phe Val Leu
Met Arg 2180 2185 2190Asn Lys Gly Ile Gly Lys Met Gly Phe Gly Met
Val Thr Leu Gly 2195 2200 2205Ala Ser Ala Trp Leu Met Trp Leu Ser
Glu Ile Glu Pro Ala Arg 2210 2215 2220Ile Ala Cys Val Leu Ile Val
Val Phe Leu Leu Leu Val Val Leu 2225 2230 2235Ile Pro Glu Pro Glu
Lys Gln Arg Ser Pro Gln Asp Asn Gln Met 2240 2245 2250Ala Ile Ile
Ile Met Val Ala Val Gly Leu Leu Gly Leu Ile Thr 2255 2260 2265Ala
Asn Glu Leu Gly Trp Leu Glu Arg Thr Lys Ser Asp Leu Ser 2270 2275
2280His Leu Met Gly Arg Arg Glu Glu Gly Ala Thr Ile Gly Phe Ser
2285 2290 2295Met Asp Ile Asp Leu Arg Pro Ala Ser Ala Trp Ala Ile
Tyr Ala 2300 2305 2310Ala Leu Thr Thr Phe Ile Thr Pro Ala Val Gln
His Ala Val Thr 2315 2320 2325Thr Ser Tyr Asn Asn Tyr Ser Leu Met
Ala Met Ala Thr Gln Ala 2330 2335 2340Gly Val Leu Phe Gly Met Gly
Lys Gly Met Pro Phe Tyr Ala Trp 2345 2350 2355Asp Phe Gly Val Pro
Leu Leu Met Ile Gly Cys Tyr Ser Gln Leu 2360 2365 2370Thr Pro Leu
Thr Leu Ile Val Ala Ile Ile Leu Leu Val Ala His 2375 2380 2385Tyr
Met Tyr Leu Ile Pro Gly Leu Gln Ala Ala Ala Ala Arg Ala 2390 2395
2400Ala Gln Lys Arg Thr Ala Ala Gly Ile Met Lys Asn Pro Val Val
2405 2410 2415Asp Gly Ile Val Val Thr Asp Ile Asp Thr Met Thr Ile
Asp Pro 2420 2425 2430Gln Val Glu Lys Lys Met Gly Gln Val Leu Leu
Ile Ala Val Ala 2435 2440 2445Val Ser Ser Ala Ile Leu Ser Arg Thr
Ala Trp Gly Trp Gly Glu 2450 2455 2460Ala Gly Ala Leu Ile Thr Ala
Ala Thr Ser Thr Leu Trp Glu Gly 2465 2470 2475Ser Pro Asn Lys Tyr
Trp Asn Ser Ser Thr Ala Thr Ser Leu Cys 2480 2485 2490Asn Ile Phe
Arg Gly Ser Tyr Leu Ala Gly Ala Ser Leu Ile Tyr 2495 2500 2505Thr
Val Thr Arg Asn Ala Gly Leu Val Lys Arg Arg Gly Gly Gly 2510 2515
2520Thr Gly Glu Thr Leu Gly Glu Lys Trp Lys Ala Arg Leu Asn Gln
2525 2530 2535Met Ser Ala Leu Glu Phe Tyr Ser Tyr Lys Lys Ser Gly
Ile Thr 2540 2545 2550Glu Val Cys Arg Glu Glu Ala Arg Arg Ala Leu
Lys Asp Gly Val 2555 2560 2565Ala Thr Gly Gly His Ala Val Ser Arg
Gly Ser Ala Lys Leu Arg 2570 2575 2580Trp Leu Val Glu Arg Gly Tyr
Leu Gln Pro Tyr Gly Lys Val Ile 2585 2590 2595Asp Leu Gly Cys Gly
Arg Gly Gly Trp Ser Tyr Tyr Ala Ala Thr 2600 2605 2610Ile Arg Lys
Val Gln Glu Val Lys Gly Tyr Thr Lys Gly Gly Pro 2615 2620 2625Gly
His Glu Glu Pro Met Leu Val Gln Ser Tyr Gly Trp Asn Ile 2630 2635
2640Val Arg Leu Lys Ser Gly Val Asp Val Phe His Met Ala Ala Glu
2645 2650 2655Pro Cys Asp Thr Leu Leu Cys Asp Ile Gly Glu Ser Ser
Ser Ser 2660 2665 2670Pro Glu Val Glu Glu Ala Arg Thr Leu Arg Val
Leu Ser Met Val 2675 2680 2685Gly Asp Trp Leu Glu Lys Arg Pro Gly
Ala Phe Cys Ile Lys Val 2690 2695 2700Leu Cys Pro Tyr Thr Ser Thr
Met Met Glu Thr Leu Glu Arg Leu 2705 2710 2715Gln Arg Arg Tyr Gly
Gly Gly Leu Val Arg Val Pro Leu Ser Arg 2720 2725 2730Asn Ser Thr
His Glu Met Tyr Trp Val Ser Gly Ala Lys Ser Asn 2735 2740 2745Thr
Ile Lys Ser Val Ser Thr Thr Ser Gln Leu Leu Leu Gly Arg 2750 2755
2760Met Asp Gly Pro Arg Arg Pro Val Lys Tyr Glu Glu Asp Val Asn
2765 2770 2775Leu Gly Ser Gly Thr Arg Ala Val Val Ser Cys Ala Glu
Ala Pro 2780 2785 2790Asn Met Lys Ile Ile Gly Asn Arg Ile Glu Arg
Ile Arg Ser Glu 2795 2800 2805His Ala Glu Thr Trp Phe Phe Asp Glu
Asn His Pro Tyr Arg Thr 2810 2815 2820Trp Ala Tyr His Gly Ser Tyr
Glu Ala Pro Thr Gln Gly Ser Ala 2825 2830 2835Ser Ser Leu Ile Asn
Gly Val Val Arg Leu Leu Ser Lys Pro Trp 2840 2845 2850Asp Val Val
Thr Gly Val Thr Gly Ile Ala Met Thr Asp Thr Thr 2855 2860 2865Pro
Tyr Gly Gln Gln Arg Val Phe Lys Glu Lys Val Asp Thr Arg 2870 2875
2880Val Pro Asp Pro Gln Glu Gly Thr Arg Gln Val Met Ser Met Val
2885 2890 2895Ser Ser Trp Leu Trp Lys Glu Leu Gly Lys His Lys Arg
Pro Arg 2900 2905 2910Val Cys Thr Lys Glu Glu Phe Ile Asn Lys Val
Arg Ser Asn Ala 2915 2920 2925Ala Leu Gly Ala Ile Phe Glu Glu Glu
Lys Glu Trp Lys Thr Ala 2930 2935 2940Val Glu Ala Val Asn Asp Pro
Arg Phe Trp Ala Leu Val Asp Lys 2945 2950 2955Glu Arg Glu His His
Leu Arg Gly Glu Cys Gln Ser Cys Val Tyr 2960 2965 2970Asn Met Met
Gly Lys Arg Glu Lys Lys Gln Gly Glu Phe Gly Lys 2975 2980 2985Ala
Lys Gly Ser Arg Ala Ile Trp Tyr Met Trp Leu Gly Ala Arg 2990 2995
3000Phe Leu Glu Phe Glu Ala Leu Gly Phe Leu Asn Glu Asp His Trp
3005 3010 3015Met Gly Arg Glu Asn Ser Gly Gly Gly Val Glu Gly Leu
Gly Leu 3020 3025 3030Gln Arg Leu Gly Tyr Val Leu Glu Glu Met Ser
Arg Ile Pro Gly 3035 3040 3045Gly Arg Met Tyr Ala Asp Asp Thr Ala
Gly Trp Asp Thr Arg Ile 3050 3055 3060Ser Arg Phe Asp Leu Glu Asn
Glu Ala Leu Ile Thr Asn Gln Met 3065 3070 3075Glu Lys Gly His Arg
Ala Leu Ala Leu Ala Ile Ile Lys Tyr Thr 3080 3085 3090Tyr Gln Asn
Lys Val Val Lys Val Leu Arg Pro Ala Glu Lys Gly 3095 3100 3105Lys
Thr Val Met Asp Ile Ile Ser Arg Gln Asp Gln Arg Gly Ser 3110 3115
3120Gly Gln Val Val Thr Tyr Ala Leu Asn Thr Phe Thr Asn Leu Val
3125 3130 3135Val Gln Leu Ile Arg Asn Met Glu Ala Glu Glu Val Leu
Glu Met 3140 3145 3150Gln Asp Leu Trp Leu Leu Arg Arg Ser Glu Lys
Val Thr Asn Trp 3155 3160 3165Leu Gln Ser Asn Gly Trp Asp Arg Leu
Lys Arg Met Ala Val Ser 3170 3175 3180Gly Asp Asp Cys Val Val Lys
Pro Ile Asp Asp Arg Phe Ala His 3185 3190 3195Ala Leu Arg Phe Leu
Asn Asp Met Gly Lys Val Arg Lys Asp Thr 3200 3205 3210Gln Glu Trp
Lys Pro Ser Thr Gly Trp Asp Asn Trp Glu Glu Val 3215 3220 3225Pro
Phe Cys Ser His His Phe Asn Lys Leu His Leu Lys Asp Gly 3230 3235
3240Arg Ser Ile Val Val Pro Cys Arg His Gln Asp Glu Leu Ile Gly
3245 3250 3255Arg Ala Arg Val Ser Pro Gly Ala Gly Trp Ser Ile Arg
Glu Thr 3260 3265 3270Ala Cys Leu Ala Lys Ser Tyr Ala Gln Met Trp
Gln Leu Leu Tyr 3275 3280 3285Phe His Arg Arg Asp Leu Arg Leu Met
Ala Asn Ala Ile Cys Ser 3290 3295 3300Ser Val Pro Val Asp Trp Val
Pro Thr Gly Arg Thr Thr Trp Ser 3305 3310 3315Ile His Gly Lys Gly
Glu Trp Met Thr Thr Glu Asp Met Leu Val 3320 3325 3330Val Trp Asn
Arg Val Trp Ile Glu Glu Asn Asp His Met Glu Asp 3335 3340 3345Lys
Thr Pro Val Thr Lys Trp Thr Asp Ile Pro Tyr Leu Gly Lys 3350 3355
3360Arg Glu Asp Leu Trp Cys Gly Ser Leu Ile Gly His Arg Pro Arg
3365 3370 3375Thr Thr Trp Ala Glu Asn Ile Lys Asn Thr Val Asn Met
Val Arg 3380 3385 3390Arg Ile Ile Gly Asp Glu Glu Lys Tyr Met Asp
Tyr Leu Ser Thr 3395 3400 3405Gln Val Arg Tyr Leu Gly Glu Glu Gly
Ser Thr Pro Gly Val Leu 3410 3415 34208028DNAArtificial
Sequenceprimer 80ttaggatccg ttgttgatct gtgtgaat 288126DNAArtificial
Sequenceprimer 81taactcgagc gtacacaacc caagtt 268227DNAArtificial
Sequenceprimer 82ttaggatcct cactagacgt gggagtg 278329DNAArtificial
Sequenceprimer 83taactcgaga agccatgtcy gatattgat
298426DNAArtificial Sequenceprimer 84ttaggatccg catacagcat caggtg
268527DNAArtificial Sequenceprimer 85taactcgagt gtggagttcc ggtgtct
278631DNAArtificial Sequenceprimer 86ttaggatccg aatagagcga
argttgagat a 318728DNAArtificial Sequenceprimer 87taactcgagt
ggtgggtgat cttcttct 288831DNAArtificial Sequenceprimer 88ttaggatcca
gtcacagtgg aggtacagta c 318927DNAArtificial Sequenceprimer
89taactcgagc rcagatacca tcttccc 279028DNAArtificial Sequenceprimer
90ttaggatccc ttatgtgctt ggccttag 289126DNAArtificial Sequenceprimer
91taactcgagt cttcagcctc catgtg 269229DNAArtificial Sequenceprimer
92ttaggatcca atgcccactc aaacataga 299330DNAArtificial
Sequenceprimer 93taactcgagt cattctcttc ttcagccctt
309426DNAArtificial Sequenceprimer 94ttaggatcca agggtgatcg aggaat
269529DNAArtificial Sequenceprimer 95taactcgagt tcccttcaga
gagaggagc 299628DNAArtificial Sequenceprimer 96ttaggatcct
cttttgcaaa ctgcgatc 289727DNAArtificial Sequenceprimer 97taactcgagt
ccagctgcaa agggtat 279828DNAArtificial Sequenceprimer 98ttaggatccg
tgtggacatg tacattga 289926DNAArtificial Sequenceprimer 99taactcgagc
ccattgccat aaagtc 2610028DNAArtificial Sequenceprimer 100ttaggatcct
catactgtgg tccatgga 2810126DNAArtificial Sequenceprimer
101taactcgagg cccatctcaa cccttg 2610226DNAArtificial Sequenceprimer
102ttaggatcct agagggcttc cagtgc 2610331DNAArtificial Sequenceprimer
103taactcgaga tactcatctc caggtttgtt g 3110430DNAArtificial
Sequenceprimer 104ttaggatccg aaaacaaaac atcaagagtg
3010531DNAArtificial Sequenceprimer 105taactcgagg aatctctctg
tcatgtgtcc t 3110626DNAArtificial Sequenceprimer 106ttaggatcct
tgatggcacg accaac 2610728DNAArtificial Sequenceprimer 107ttaggatccg
ttgttgatct gtgtgaat 2810826DNAArtificial Sequenceprimer
108taactcgagc aggtcaatgt ccattg 2610930DNAArtificial Sequenceprimer
109ttaggatcct gttgtgttcc tattgctggt 3011025DNAArtificial
Sequenceprimer 110taactcgagt gatcagrgcc ccagc 2511126DNAArtificial
Sequenceprimer 111ttaggatcct gctgcccaga agagaa 2611226DNAArtificial
Sequenceprimer 112taactcgagc accaacaygg gttctt 2611325DNAArtificial
Sequenceprimer 113ttaggatcct caaggacggt gtggc 2511428DNAArtificial
Sequenceprimer 114taactcgagc aatgatcttc atgttggg
2811526DNAArtificial Sequenceprimer 115ttaggatcct atgggggagg actggt
2611625DNAArtificial Sequenceprimer 116taactcgagc ccagaacctt ggatc
2511725DNAArtificial Sequenceprimer 117ttaggatcca gacccccaag aaggc
2511826DNAArtificial Sequenceprimer 118taactcgagc ccctttggtc ttgtct
2611929DNAArtificial Sequenceprimer 119ttaggatcca ggaaggatgt
atgcagatg 2912030DNAArtificial Sequenceprimer 120taactcgaga
catttgcgca tatgattttg 3012128DNAArtificial Sequenceprimer
121ttaggatcca ggaaggacac acaagagt 2812226DNAArtificial
Sequenceprimer 122taactcgaga caggctgcac agcttt 2612328DNAArtificial
Sequenceprimer 123ttaggatcct ctctcatagg gcacagac 28
* * * * *
References