Pan-lyssavirus Vaccines Against Rabies

Wu; Xianfu ;   et al.

Patent Application Summary

U.S. patent application number 14/978135 was filed with the patent office on 2016-04-28 for pan-lyssavirus vaccines against rabies. This patent application is currently assigned to The United States of America, as represented by the Secretary, Department of Health and Human Serv. The applicant listed for this patent is The United States of America, as represented by the Secretary, Department of Health and Human Serv, The United States of America, as represented by the Secretary, Department of Health and Human Serv. Invention is credited to Ivan V. Kuzmin, Charles E. Rupprecht, Xianfu Wu.

Application Number20160114026 14/978135
Document ID /
Family ID45372099
Filed Date2016-04-28

United States Patent Application 20160114026
Kind Code A1
Wu; Xianfu ;   et al. April 28, 2016

PAN-LYSSAVIRUS VACCINES AGAINST RABIES

Abstract

Described herein are recombinant rabies viruses encoding rabies virus glycoprotein and at least one heterologous glycoprotein from another lyssavirus, such as Mokola virus, Lagos bat virus and/or West Caucasian bat virus. In particular embodiments, the recombinant rabies virus includes two or three heterologous lyssavirus glycoproteins. The disclosed recombinant rabies viruses can be used as pan-lyssavirus vaccines to provide protection against lyssaviruses that cause rabies.


Inventors: Wu; Xianfu; (Atlanta, GA) ; Rupprecht; Charles E.; (Lawrenceville, GA) ; Kuzmin; Ivan V.; (Galveston, TX)
Applicant:
Name City State Country Type

The United States of America, as represented by the Secretary, Department of Health and Human Serv

Bethesda

MD

US
Assignee: The United States of America, as represented by the Secretary, Department of Health and Human Serv
Bethesda
MD

Family ID: 45372099
Appl. No.: 14/978135
Filed: December 22, 2015

Related U.S. Patent Documents

Application Number Filing Date Patent Number
13806622 Dec 21, 2012 9248179
PCT/US2011/041579 Jun 23, 2011
14978135
61358288 Jun 24, 2010

Current U.S. Class: 424/199.1 ; 435/235.1; 435/320.1; 435/352
Current CPC Class: C12N 2800/50 20130101; A61K 2039/70 20130101; C12N 15/86 20130101; C12N 2760/20143 20130101; C12N 2760/20152 20130101; C12N 7/00 20130101; C07K 14/005 20130101; A61K 2039/5256 20130101; C12N 2760/20122 20130101; C12N 2760/20121 20130101; A61K 39/12 20130101; C12N 2760/20134 20130101; A61K 39/205 20130101
International Class: A61K 39/205 20060101 A61K039/205; C07K 14/005 20060101 C07K014/005; C12N 7/00 20060101 C12N007/00

Claims



1. A recombinant rabies virus, the genome of which comprises rabies virus nucleoprotein (N), phosphoprotein (P), matrix protein (M), RNA-dependent RNA polymerase (L) and glycoprotein (G) genes and at least two different heterologous lyssavirus G genes, wherein the lyssavirus is selected from the group consisting of Lagos bat virus (LBV), Mokola virus (MOKV), Duvenhage virus (DUVV), European bat lyssavirus-1 (EBLV-1), European bat lyssavirus-2 (EBLV-2), Australian bat lyssavirus (ABLV), Aravan virus (ARAV), Khujand virus (KHUV), Irkut virus (IRKV) and West Caucasian bat virus (WCBV).

2. The recombinant rabies virus of claim 1, comprising two different heterologous lyssavirus G genes and the two heterologous G genes are MOKV and WCBV G genes.

3. The recombinant rabies virus of claim 2, wherein the nucleotide sequence of the MOKV G gene is at least 95% identical to the nucleotide sequence of SEQ ID NO: 47, the nucleotide sequence of the WCBV G gene is at least 95% identical to the nucleotide sequence of SEQ ID NO: 49, or both.

4. The recombinant rabies virus of claim 2, wherein the MOKV G gene comprises the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene comprises the nucleotide sequence of SEQ ID NO: 49, or both.

5. The recombinant rabies virus of claim 2, wherein the two heterologous G genes are located between the rabies virus P and M genes and between the rabies virus G and L genes.

6. The recombinant rabies virus of claim 1, wherein the genome is derived from the rabies virus ERA strain.

7. The recombinant rabies virus of claim 1, wherein the rabies virus glycoprotein comprises a Glu at amino acid position 333 (SEQ ID NO: 5).

8. A vector comprising a full-length rabies virus antigenomic DNA, wherein the antigenomic DNA comprises rabies virus N, P, M, L and G genes, and at least two different heterologous lyssavirus G genes, wherein the lyssavirus is selected from LBV, MOKV, DUVV, EBLV-1, EBLV-2, ABLV, ARAV, KHUV, IRKV and WCBV.

9. The vector of claim 8, comprising two different heterologous lyssavirus G genes and the two heterologous G genes are MOKV and WCBV G genes.

10. The vector of claim 9, wherein the nucleotide sequence of the MOKV G gene is at least 95% identical to the nucleotide sequence of SEQ ID NO: 47, the nucleotide sequence of the WCBV G gene is at least 95% identical to the nucleotide sequence of SEQ ID NO: 49, or both.

11. The vector of claim 9, wherein the MOKV G gene comprises the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene comprises the nucleotide sequence of SEQ ID NO: 49, or both.

12. The vector of claim 9, wherein the two heterologous G genes are located between the rabies virus P and M genes and between the rabies virus G and L genes.

13. The vector of claim 8, wherein the antigenomic DNA is derived from the rabies virus ERA strain.

14. A cell comprising the vector of claim 8.

15. A composition comprising the recombinant rabies virus of claim 1 and a pharmaceutically acceptable carrier.

16. A method of eliciting an immune response in a subject against lyssavirus, comprising administering to the subject the recombinant rabies virus of claim 1.

17. The method of claim 16, wherein the immune response in the subject against lyssavirus protects the subject against infection by at least three different genotypes of lyssavirus.
Description



CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This is a continuation of U.S. patent application Ser. No. 13/806,622, filed Dec. 21, 2012, which is the U.S. National Stage of International Application No. PCT/US2011/041579, filed Jun. 23, 2011, published in English under PCT Article 21(2), which claims the benefit of U.S. Provisional Application No. 61/358,288, filed Jun. 24, 2010. All of the above-referenced applications are herein incorporated by reference in their entirety.

FIELD

[0002] This disclosure concerns recombinant rabies viruses and their use as pan-lyssavirus vaccines for protection against lyssavirus infections.

BACKGROUND

[0003] The genus Lyssavirus is a member of the Rhabdoviridae family within the order Mononegavirales (viruses with a single-stranded, negative sense genome). Lyssaviruses are the etiological agents of rabies encephalitis in warm-blooded animals and humans (Tordo et al., "Lyssaviruses" In Fauquet et al. eds. Virus taxonomy: the classification and nomenclature of viruses. The 8.sup.th Report of the International Committee on Taxonomy of Viruses. San Diego: Oxford Academic, 2006, pages 623-629; World Health Organization Expert Consultation on Rabies, 5-8 Oct. 2004, first report, World Health Organization Technical report series 931, Geneva: World Health Organization, 2005, pages 15-19). Lyssavirus species include rabies virus (RABV; genotype 1), Lagos bat virus (LBV; genotype 2), Mokola virus (MOKV; genotype 3), Duvenhage virus (DUVV; genotype 4), European bat lyssavirus-1 (EBLV-1; genotype 5), European bat lyssavirus-2 (EBLV-2; genotype 6), Australian bat lyssavirus (ABLV; genotype 7) and four additional species isolated from bats in central Asia and Russia (Aravan virus--ARAV; Khujand virus--KHUV; Irkut virus--IRKV; and West Caucasian bat virus--WCBV) (Kuzmin et al., Emerg. Infect. Dis. 14(12):1887-1889, 2008; Weyer et al., Epidemiol. Infect. 136:670-678, 2007; Kuzmin and Rupprecht, "Bat rabies" In Rabies, 2.sup.nd Edition, New York, Academic Press, 2007, pages 259-307, Jackson and Wunner, eds.).

[0004] Based on phylogeny, immunogenicity and virulence of lyssavirus isolates, two lyssavirus phylogroups have been proposed (Badrane et al., J. Virol. 75:3268-3276, 2001). The division into phylogroups generally correlates with the pattern of vaccine cross-protection observed for lyssaviruses (Badrane et al., J. Virol. 75:3268-3276, 2001; Hanlon et al., Virus Res. 111:44-54, 2005; Nel et al., Expert Rev. Vaccines 4:553-540, 2005). Phylogroupl includes genotypes 1, 4, 5, 6 and 7, as well as ARAV, KHUV and IRKV (Kuzmin et al., Virus Res. 97:65-79, 2003; Kuzmin et al., Virus Res. 111:28-43, 2005; Hanlon et al., Virus Res. 111:44-54, 2005). Currently available commercial vaccines and biologicals are considered to be effective against infections of viruses from this phylogroup (Nel et al., Expert Rev. Vaccines 4:553-540, 2005). However, these vaccines and biologics for rabies do not offer full protection against infection with viruses outside of lyssavirus phylogroup 1 (i.e., genotypes 2 and 3). In addition, WCBV is recognized as the most divergent lyssavirus and exhibits limited relatedness to genotype 2 and 3 viruses. Previous studies have demonstrated little or no cross-neutralization of anti-RABV sera with WCBV (Botvinkin et al., Emerg. Infect. Dis. 9:1623-1625, 2003; Hanlon et al., Virus Res. 111:44-54, 2005).

[0005] Thus, a need exists to develop a rabies vaccine that can protect against a broad spectrum of lyssaviruses, particularly WCBV and lyssaviruses of genotypes 2 and 3.

SUMMARY

[0006] Disclosed herein are recombinant rabies viruses having glycoprotein genes from at least two different lyssaviruses. The disclosed viruses can be used as pan-lyssavirus vaccines to provide protection against infection by multiple genotypes of lyssavirus.

[0007] Provided herein are recombinant rabies viruses. In some embodiments, the genome of the recombinant rabies virus includes rabies virus nucleoprotein (N), phosphoprotein (P), matrix protein (M), RNA-dependent RNA polymerase (L) and glycoprotein (G) genes and at least one, at least two or at least three different heterologous lyssavirus glycoprotein genes. In some embodiments, the lyssavirus is selected from LBV, MOKV, DUVV, EBLV-1, EBLV-2, ABLV, ARAV, KHUV, IRKV and WCBV. In particular embodiments, the lyssavirus is selected from LBV, MOKV and WCBV.

[0008] Further provided is a vector comprising a full-length rabies virus antigenomic DNA. In some embodiments, the antigenomic DNA includes rabies virus N, P, M, L and G genes, and the vector further includes at least one, at least two, or at least three different heterologous lyssavirus G genes. Also provided are cells comprising a rabies virus vector described herein.

[0009] Also provided are compositions comprising one or more recombinant rabies viruses described herein and a pharmaceutically acceptable carrier. Methods of eliciting an immune response in a subject against lyssavirus by administering to the subject one or more of the recombinant rabies viruses disclosed herein is further provided.

[0010] The foregoing and other objects, features, and advantages of the invention will become more apparent from the following detailed description, which proceeds with reference to the accompanying figures.

BRIEF DESCRIPTION OF THE FIGURES

[0011] FIG. 1A: Schematic illustration of the ERA transcription plasmid. Positions of the hammerhead ribozymes and antigenomic ERA genome are indicated graphically. Relative positions of the N, P, M G and L proteins are shown in a 5' to 3' direction.

[0012] FIG. 1B: Schematic diagram of the construction of the full-length ERA rabies virus genomic cDNA plasmid pTMF. RT-PCR products F1 and F2 fragments, and restriction enzyme recognition sites (Nhe1, Kpn1, Blp1, Pst1 and Not1) are shown. RdRz-hammerhead and HDVRz-hepatitis delta virus ribozymes are indicated. The diamond symbols indicate that Kpn1 or Pst1 sites were deleted, and the vertical arrows indicate that Nhe1 or Not1 sites were left intact.

[0013] FIG. 2: Schematic illustration of the proposed mechanism of NLST7 RNA polymerase autogene action by pNLST7 plasmids. The DNA-transfection reagent complex is taken into cells by endocytosis. The majority of the DNA released from lysosomes and endosomes is retained in the cell cytoplasm. A limited amount of plasmid is transferred to the nucleus: 1) through a CMV immediate early promoter, the NLST7 gene is transcribed by cellular RNA polymerase II; 2) mature NLST7 mRNA is transported from the nucleus to the cytoplasm for NLST7 RNA polymerase synthesis; 3) newly synthesized NLST7 RNA polymerase is translocated to the nucleus, while a trace amount of NLST7 remains in the cytoplasm; and 4) NLST7 RNA polymerase initiates transcription through a pT7 promoter. By posttranscriptional modifications, additional NLST7 mRNA is produced for protein synthesis, thus increasing virus recovery efficiency.

[0014] FIG. 3: Schematic diagram of ten derivative ERA virus genomes. The size of each gene is not drawn to scale. Symbol "*" denotes mutations of G at amino acid residue 333 (referred to herein as G333) and ".PSI." indicates the Psi-region.

[0015] FIG. 4: Schematic of the construction of ERA-3G. The G333 mutation is introduced into the ERA backbone and two transcriptional (trans) units are added. The transcriptional units are introduced between the P and M genes and between the G and L genes. The MOKV and WCBV G genes are cloned into the transcriptional units to form a recombinant ERA rabies virus with three glycoprotein genes (ERA-3G).

[0016] FIG. 5: Schematic of the construction of ERA-4G. The G333 mutation is introduced into the ERA backbone and three transcriptional (trans) units are added. The transcriptional units are introduced between the N and P genes, between the P and M genes, and between the G and L genes. The LBV, MOKV and WCBV G genes are cloned into the transcriptional units to form a recombinant ERA rabies virus with four glycoprotein genes (ERA-4G).

SEQUENCE LISTING

[0017] The nucleic and amino acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases, and three letter code for amino acids, as defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood as included by any reference to the displayed strand. The Sequence Listing is submitted as an ASCII text file, created on Dec. 17, 2015, 135 KB, which is incorporated by reference herein. In the accompanying sequence listing:

[0018] SEQ ID NO: 1 is the nucleotide sequence of recombinant rabies virus ERA recovered by reverse genetics. Nucleotides 4370-4372 of the recombinant virus have been changed (relative to the wild-type virus) from aga to gag, which introduces an Arg to Glu amino acid change in the G protein at residue 333.

[0019] SEQ ID NO: 2 is the amino acid sequence of the rabies virus ERA N protein.

[0020] SEQ ID NO: 3 is the amino acid sequence of the rabies virus ERA P protein.

[0021] SEQ ID NO: 4 is the amino acid sequence of the rabies virus ERA M protein.

[0022] SEQ ID NO: 5 is the amino acid sequence of the rabies virus ERA G protein mutated at amino acid position 333 (from Arg to Glu).

[0023] SEQ ID NO: 6 is the amino acid sequence of the rabies virus ERA L protein.

[0024] SEQ ID NO: 7 is the amino acid sequence of the wild-type rabies virus ERA G protein.

[0025] SEQ ID NOs: 8-11 are the nucleotide sequences of RT-PCR primers for amplification of full-length rabies virus genomic cDNA.

[0026] SEQ ID NOs: 12-15 are oligonucleotide sequences used to synthesize hammerhead and hepatitis delta virus ribozymes.

[0027] SEQ ID NOs: 16-40 are the nucleotide sequences of PCR primers.

[0028] SEQ ID NOs: 41 and 42 are the nucleotide sequences of transcription units for incorporating heterologous ORFs.

[0029] SEQ ID NOs: 43 and 44 are the nucleotide sequences of RT-PCR primers for amplification of the MOKV G gene.

[0030] SEQ ID NOs: 45 and 46 are the nucleotide sequences of RT-PCR primers for amplification of the WCBV G gene.

[0031] SEQ ID NOs: 47 and 48 are the nucleotide and amino acid sequences, respectively, of MOKV G.

[0032] SEQ ID NOs: 49 and 50 are the nucleotide and amino acid sequences, respectively, of WCBV G.

[0033] SEQ ID NOs: 51 and 52 are the nucleotide sequences of RT-PCR primers for amplification of the LBV G gene.

[0034] SEQ ID NOs: 53 and 54 are the nucleotide and amino acid sequences, respectively, of LBV G.

DETAILED DESCRIPTION

I. Abbreviations

[0035] ABLV Australian bat lyssavirus

[0036] ARAV Aravan virus

[0037] CMV cytomegalovirus

[0038] DFA direct fluorescent antibody

[0039] DUVV Duvenhage virus

[0040] EBLV-1 European bat lyssavirus-1

[0041] EBLV-2 European bat lyssavirus-2

[0042] ERA Evelyn-Rokitnicki-Abelseth

[0043] FFU focus-forming unit

[0044] G glycoprotein

[0045] i.m. intramuscular

[0046] IRES internal ribosome entry site

[0047] IRKV Irkut virus

[0048] KHUV Khujand virus

[0049] L RNA-dependent RNA polymerase

[0050] LBV Lagos bat virus

[0051] M matrix protein

[0052] MOKV Mokola virus

[0053] N nucleoprotein

[0054] NLS nuclear localization signal

[0055] ORF open reading frame

[0056] P phosphoprotein

[0057] PAGE polyacrylamide gel electrophoresis

[0058] RABV rabies virus

[0059] RNP ribonucleoprotein

[0060] RABV rabies virus

[0061] WCBV West Caucasian bat virus

II. Terms and Methods

[0062] Unless otherwise noted, technical terms are used according to conventional usage. Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8).

[0063] In order to facilitate review of the various embodiments of the disclosure, the following explanations of specific terms are provided:

[0064] Adjuvant: A substance or vehicle that non-specifically enhances the immune response to an antigen. Adjuvants can include a suspension of minerals (alum, aluminum hydroxide, or phosphate) on which antigen is adsorbed; or water-in-oil emulsion in which antigen solution is emulsified in mineral oil (for example, Freund's incomplete adjuvant), sometimes with the inclusion of killed mycobacteria (Freund's complete adjuvant) to further enhance antigenicity. Immunostimulatory oligonucleotides (such as those including a CpG motif) can also be used as adjuvants (for example, see U.S. Pat. Nos. 6,194,388; 6,207,646; 6,214,806; 6,218,371; 6,239,116; 6,339,068; 6,406,705; and 6,429,199). Adjuvants also include biological molecules, such as co-stimulatory molecules. Exemplary biological adjuvants include IL-2, RANTES, GM-CSF, TNF-.alpha., IFN-.gamma., G-CSF, LFA-3, CD72, B7-1, B7-2, OX-40L and 41 BBL.

[0065] Administer: As used herein, administering a composition, such as a vaccine, to a subject means to give, apply or bring the composition into contact with the subject. Administration can be accomplished by any of a number of routes, such as, for example, topical, oral, subcutaneous, intramuscular, intraperitoneal, intravenous, intrathecal and intramuscular.

[0066] Animal: Living multi-cellular vertebrate organisms, a category that includes, for example, mammals and birds. The term mammal includes both human and non-human mammals. The term "animal" includes both human and veterinary subjects, for example, humans, non-human primates, dogs, cats, horses, raccoons, bats, rats, mice, foxes, squirrels, opossum, coyotes, wolves and cows.

[0067] Antibody: A protein (or protein complex) that includes one or more polypeptides substantially encoded by immunoglobulin genes or fragments of immunoglobulin genes. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.

[0068] The basic immunoglobulin (antibody) structural unit is generally a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" (about 25 kDa) and one "heavy" (about 50-70 kDa) chain. The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms "variable light chain" (V.sub.L) and "variable heavy chain" (V.sub.H) refer, respectively, to these light and heavy chains.

[0069] As used herein, the term "antibody" includes intact immunoglobulins as well as a number of well-characterized fragments. For instance, Fabs, Fvs, and single-chain Fvs (SCFvs) that bind to target protein (or epitope within a protein or fusion protein) would also be specific binding agents for that protein (or epitope). These antibody fragments are as follows: (1) Fab, the fragment which contains a monovalent antigen-binding fragment of an antibody molecule produced by digestion of whole antibody with the enzyme papain to yield an intact light chain and a portion of one heavy chain; (2) Fab', the fragment of an antibody molecule obtained by treating whole antibody with pepsin, followed by reduction, to yield an intact light chain and a portion of the heavy chain; two Fab' fragments are obtained per antibody molecule; (3) (Fab').sub.2, the fragment of the antibody obtained by treating whole antibody with the enzyme pepsin without subsequent reduction; (4) F(ab').sub.2, a dimer of two Fab' fragments held together by two disulfide bonds; (5) Fv, a genetically engineered fragment containing the variable region of the light chain and the variable region of the heavy chain expressed as two chains; and (6) single chain antibody, a genetically engineered molecule containing the variable region of the light chain, the variable region of the heavy chain, linked by a suitable polypeptide linker as a genetically fused single chain molecule. Methods of making these fragments are routine (see, for example, Harlow and Lane, Using Antibodies: A Laboratory Manual, CSHL, New York, 1999).

[0070] Antibody binding affinity: The strength of binding between a single antibody binding site and a ligand (e.g., an antigen or epitope). The affinity of an antibody binding site X for a ligand Y is represented by the dissociation constant (K.sub.d), which is the concentration of Y that is required to occupy half of the binding sites of X present in a solution. A smaller K.sub.d indicates a stronger or higher-affinity interaction between X and Y and a lower concentration of ligand is needed to occupy the sites. In general, antibody binding affinity can be affected by the alteration, modification and/or substitution of one or more amino acids in the epitope recognized by the antibody paratope. Binding affinity can be measured using any technique known in the art, such as end-point titration in an Ag-ELISA assay.

[0071] Antigen: A compound, composition, or substance that can stimulate the production of antibodies or a T-cell response in an animal, including compositions that are injected or absorbed into an animal. An antigen reacts with the products of specific humoral or cellular immunity, including those induced by heterologous immunogens.

[0072] Antigenomic: In the context of a virus with a negative-strand RNA genome (such as the genome of a lyssavirus), "antigenomic" refers to the complement (positive strand) of the negative strand genome.

[0073] Attenuated: In the context of a live virus, such as a rabies virus, the virus is attenuated if its ability to infect a cell or subject and/or its ability to produce disease is reduced (for example, eliminated). Typically, an attenuated virus retains at least some capacity to elicit an immune response following administration to an immunocompetent subject. In some cases, an attenuated virus is capable of eliciting a protective immune response without causing any signs or symptoms of infection.

[0074] Epitope: An antigenic determinant. These are particular chemical groups, such as contiguous or non-contiguous peptide sequences, on a molecule that are antigenic, that is, that elicit a specific immune response. An antibody binds a particular antigenic epitope based on the three dimensional structure of the antibody and the matching (or cognate) three dimensional structure of the epitope.

[0075] Evelyn-Rokitnicki-Abelseth (ERA): The ERA strain of rabies virus was derived from the Street-Alabama-Dufferin (SAD) strain, first isolated from a rabid dog in Alabama (USA) in 1935. The ERA strain was derived after multiple passages of SAD rabies virus in mouse brains, baby hamster kidney (BHK) cells, and chicken embryos.

[0076] Fusion protein: A protein generated by expression of a nucleic acid sequence engineered from nucleic acid sequences encoding at least a portion of two different (heterologous) proteins. To create a fusion protein, the nucleic acid sequences must be in the same reading frame and contain no internal stop codons in that frame.

[0077] Heterologous: As used herein, a "heterologous nucleic acid sequence" is a nucleic acid sequence that is derived from a different source, species or strain. In some embodiments described herein, the heterologous nucleic acid sequence is a nucleic acid sequence encoding a glycoprotein from a lyssavirus other than rabies virus ERA. In the context of a recombinant ERA rabies virus, a heterologous nucleic acid sequence is any nucleic acid sequence that is not derived from the ERA rabies virus.

[0078] Immune response: A response of a cell of the immune system, such as a B-cell, T-cell, macrophage or polymorphonucleocyte, to a stimulus such as an antigen. An immune response can include any cell of the body involved in a host defense response, including for example, an epithelial cell that secretes an interferon or a cytokine. An immune response includes, but is not limited to, an innate immune response or inflammation. As used herein, a protective immune response refers to an immune response that protects a subject from infection (prevents infection or prevents the development of disease associated with infection).

[0079] Immunize: To render a subject protected from a disease (for example, an infectious disease), such as by vaccination.

[0080] Immunogen: A compound, composition, or substance which is capable, under appropriate conditions, of stimulating an immune response, such as the production of antibodies or a T-cell response in an animal, including compositions that are injected or absorbed into an animal. As used herein, an "immunogenic composition" is a composition comprising an immunogen.

[0081] Immunogenic composition: A composition useful for stimulating or eliciting a specific immune response (or immunogenic response) in a vertebrate. In some embodiments, the immunogenic composition includes a recombinant rabies virus, such as a recombinant rabies virus expressing one or more heterologous glycoproteins (such as the glycoproteins from MOKV, LBV or WCBV). In some embodiments, the immunogenic response is protective or provides protective immunity, in that it enables the animal to better resist infection with or disease progression from the pathogen against which the immunogenic composition is directed (e.g., rabies virus and other lyssaviruses). One specific example of a type of immunogenic composition is a vaccine.

[0082] In some embodiments, an "effective amount" or "immune-stimulatory amount" of an immunogenic composition is an amount which, when administered to a subject, is sufficient to engender a detectable immune response. Such a response may comprise, for instance, generation of antibodies specific to one or more of the epitopes provided in the immunogenic composition. Alternatively, the response may comprise a T-helper or CTL-based response to one or more of the epitopes provided in the immunogenic composition. In other embodiments, a "protective effective amount" of an immunogenic composition is an amount which, when administered to an animal, is sufficient to confer protective immunity upon the animal.

[0083] Inhibiting or treating a disease: Inhibiting the full development of a disease or condition, for example, in a subject who is at risk for a disease. One specific example of a disease is rabies. "Treatment" refers to a therapeutic intervention that ameliorates a sign or symptom of a disease or pathological condition after it has begun to develop. As used herein, the term "ameliorating," with reference to a disease, pathological condition or symptom, refers to any observable beneficial effect of the treatment. The beneficial effect can be evidenced, for example, by a delayed onset of clinical symptoms of the disease in a susceptible subject, a reduction in severity of some or all clinical symptoms of the disease, a slower progression of the disease, a reduction in the number of relapses of the disease, an improvement in the overall health or well-being of the subject, or by other parameters well known in the art that are specific to the particular disease.

[0084] Isolated: An "isolated" or "purified" biological component (such as a nucleic acid, peptide, protein, protein complex, or particle) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs, that is, other chromosomal and extra-chromosomal DNA and RNA, and proteins. Nucleic acids, peptides and proteins that have been "isolated" or "purified" thus include nucleic acids and proteins purified by standard purification methods. The term also embraces nucleic acids, peptides and proteins prepared by recombinant expression in a host cell, as well as chemically synthesized nucleic acids or proteins. The term "isolated" or "purified" does not require absolute purity; rather, it is intended as a relative term. Thus, for example, an isolated biological component is one in which the biological component is more enriched than the biological component is in its natural environment within a cell, or other production vessel. Preferably, a preparation is purified such that the biological component represents at least 50%, such as at least 70%, at least 90%, at least 95%, or greater, of the total biological component content of the preparation.

[0085] Label: A detectable compound or composition that is conjugated directly or indirectly to another molecule to facilitate detection of that molecule. Specific, non-limiting examples of labels include fluorescent tags, enzymatic linkages, and radioactive isotopes.

[0086] Lyssavirus: A genus of viruses that is part of the Rhabdoviridae family within the order Mononegavirales (viruses with a single-stranded, negative sense genome). Lyssaviruses are the etiological agents of rabies encephalitis in warm-blooded animals and humans. Lyssavirus species include rabies virus (RABV; genotype 1), Lagos bat virus (LBV; genotype 2), Mokola virus (MOKV; genotype 3), Duvenhage virus (DUVV; genotype 4), European bat lyssavirus-1 (EBLV-1; genotype 5), European bat lyssavirus-2 (EBLV-2; genotype 6) Australian bat lyssavirus (ABLV; genotype 7) and four additional species isolated from bats in central Asia and Russia (Aravan virus--ARAV; Khujand virus--KHUV; Irkut virus--IRKV; and West Caucasian bat virus--WCBV) (Kuzmin et al., Emerg. Infect. Dis. 14(12):1887-1889, 2008; Weyer et al., Epidemiol. Infect. 136:670-678, 2007; Kuzmin and Rupprecht, "Bat rabies" In Rabies, 2.sup.nd Edition, New York, Academic Press, 2007, pages 259-307, Jackson and Wunner, eds.).

[0087] ORF (open reading frame): A series of nucleotide triplets (codons) coding for amino acids without any termination codons. These sequences are usually translatable into a peptide.

[0088] Operably linked: A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame. If introns are present, the operably linked DNA sequences may not be contiguous.

[0089] Pharmaceutically acceptable carriers: The pharmaceutically acceptable carriers useful in this disclosure are conventional. Remington's Pharmaceutical Sciences, by E. W. Martin, Mack Publishing Co., Easton, Pa., 15th Edition (1975), describes compositions and formulations suitable for pharmaceutical delivery of one or more therapeutic compounds or molecules, proteins or antibodies that bind these proteins, viruses or vectors, and additional pharmaceutical agents.

[0090] In general, the nature of the carrier will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (for example, powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.

[0091] Plasmid: A circular nucleic acid molecule capable of autonomous replication in a host cell.

[0092] Polypeptide: A polymer in which the monomers are amino acid residues joined together through amide bonds. When the amino acids are alpha-amino acids, either the L-optical isomer or the D-optical isomer can be used, the L-isomers being preferred for many biological uses. The terms "polypeptide" or "protein" as used herein are intended to encompass any amino acid molecule and include modified amino acid molecules. The term "polypeptide" is specifically intended to cover naturally occurring proteins, as well as those which are recombinantly or synthetically produced.

[0093] Conservative amino acid substitutions are those substitutions that, when made, least interfere with the properties of the original protein, that is, the structure and especially the function of the protein is conserved and not significantly changed by such substitutions. Examples of conservative substitutions are shown below.

TABLE-US-00001 Original Residue Conservative Substitutions Ala Ser Arg Lys Asn Gln, His Asp Glu Cys Ser Gln Asn Glu Asp His Asn; Gln Ile Leu, Val Leu Ile; Val Lys Arg; Gln; Glu Met Leu; Ile Phe Met; Leu; Tyr Ser Thr Thr Ser Trp Tyr Tyr Trp; Phe Val Ile; Leu

[0094] Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain.

[0095] Amino acids are typically classified in one or more categories, including polar, hydrophobic, acidic, basic and aromatic, according to their side chains. Examples of polar amino acids include those having side chain functional groups such as hydroxyl, sulfhydryl, and amide, as well as the acidic and basic amino acids. Polar amino acids include, without limitation, asparagine, cysteine, glutamine, histidine, selenocysteine, serine, threonine, tryptophan and tyrosine. Examples of hydrophobic or non-polar amino acids include those residues having nonpolar aliphatic side chains, such as, without limitation, leucine, isoleucine, valine, glycine, alanine, proline, methionine and phenylalanine. Examples of basic amino acid residues include those having a basic side chain, such as an amino or guanidino group. Basic amino acid residues include, without limitation, arginine, homolysine and lysine. Examples of acidic amino acid residues include those having an acidic side chain functional group, such as a carboxy group. Acidic amino acid residues include, without limitation aspartic acid and glutamic acid. Aromatic amino acids include those having an aromatic side chain group. Examples of aromatic amino acids include, without limitation, biphenylalanine, histidine, 2-napthylalananine, pentafluorophenylalanine, phenylalanine, tryptophan and tyrosine. It is noted that some amino acids are classified in more than one group, for example, histidine, tryptophan, and tyrosine are classified as both polar and aromatic amino acids. Additional amino acids that are classified in each of the above groups are known to those of ordinary skill in the art.

[0096] Substitutions which in general are expected to produce the greatest changes in protein properties will be non-conservative, for instance changes in which (a) a hydrophilic residue, for example, seryl or threonyl, is substituted for (or by) a hydrophobic residue, for example, leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, for example, lysyl, arginyl, or histadyl, is substituted for (or by) an electronegative residue, for example, glutamyl or aspartyl; or (d) a residue having a bulky side chain, for example, phenylalanine, is substituted for (or by) one not having a side chain, for example, glycine.

[0097] Promoter: A promoter is an array of nucleic acid control sequences which direct transcription of a nucleic acid. A promoter includes necessary nucleic acid sequences near the start site of transcription. A promoter also optionally includes distal enhancer or repressor elements. A "constitutive promoter" is a promoter that is continuously active and is not subject to regulation by external signals or molecules. In contrast, the activity of an "inducible promoter" is regulated by an external signal or molecule (for example, a transcription factor).

[0098] Purified: The term "purified" does not require absolute purity; rather, it is intended as a relative term. Thus, for example, a purified peptide, protein, virus, or other active compound is one that is isolated in whole or in part from naturally associated proteins and other contaminants. In certain embodiments, the term "substantially purified" refers to a peptide, protein, virus or other active compound that has been isolated from a cell, cell culture medium, or other crude preparation and subjected to fractionation to remove various components of the initial preparation, such as proteins, cellular debris, and other components.

[0099] Rabies: A viral disease that causes acute encephalitis (inflammation of the brain) in warm-blooded animals. Rabies is zoonotic (transmitted by animals), most commonly by a bite from an infected animal but occasionally by other forms of contact. Rabies is almost frequently fatal if post-exposure prophylaxis is not administered prior to the onset of severe symptoms. Rabies is caused by viruses of the Lyssavirus genus.

[0100] Rabies virus (RABV or RABV): A member of the Rhabdoviridae family having a non-segmented RNA genome with negative sense polarity. Rabies virus is the prototype of the Lyssavirus genus. The rabies virus Evelyn-Rokitnicki-Abelseth (ERA) strain is a strain derived from the Street-Alabama-Dufferin (SAD) strain, first isolated from a rabid dog in Alabama (USA) in 1935. The ERA strain was derived after multiple passages of SAD RABV in mouse brains, baby hamster kidney (BHK) cells, and chicken embryos. The complete genomic sequence of the ERA strain is disclosed in PCT Publication No. WO 2007/047459, and the sequence of the ERA strain recovered by reverse genetics is set forth herein as SEQ ID NO: 1.

[0101] Recombinant: A recombinant nucleic acid, protein or virus is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, for example, by genetic engineering techniques. In some embodiments, recombinant rabies virus is generated using reverse genetics, such as the reverse genetics system described in PCT Publication No. WO 2007/047459. In some examples, the recombinant rabies viruses comprise one or more mutations in a viral virulence factors, such as glycoprotein. In other examples, the recombinant rabies viruses comprise a heterologous gene, such as a sequence encoding a glycoprotein from another lyssavirus (such as Mokola virus, West Caucasian bat virus or Lagos bat virus).

[0102] Reverse genetics: Refers to the process of introducing mutations (such as deletions, insertions or point mutations) into the genome of an organism or virus in order to determine the phenotypic effect of the mutation. For example, introduction of a mutation in a specific viral gene enables one to determine the function of the gene.

[0103] Sequence identity: The similarity between two nucleic acid sequences, or two amino acid sequences, is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are.

[0104] Methods of alignment of sequences for comparison are well known in the art. Various programs and alignment algorithms are described in: Smith and Waterman (Adv. Appl. Math., 2:482, 1981); Needleman and Wunsch (J. Mol. Biol., 48:443, 1970); Pearson and Lipman (Proc. Natl. Acad. Sci., 85:2444, 1988); Higgins and Sharp (Gene, 73:237-44, 1988); Higgins and Sharp (CABIOS, 5:151-53, 1989); Corpet et al. (Nuc. Acids Res., 16:10881-90, 1988); Huang et al. (Comp. Appls. Biosci., 8:155-65, 1992); and Pearson et al. (Meth. Mol. Biol., 24:307-31, 1994). Altschul et al. (Nature Genet., 6:119-29, 1994) presents a detailed consideration of sequence alignment methods and homology calculations.

[0105] The alignment tools ALIGN (Myers and Miller, CABIOS 4:11-17, 1989) or LFASTA (Pearson and Lipman, 1988) may be used to perform sequence comparisons (Internet Program.COPYRGT. 1996, W. R. Pearson and the University of Virginia, "fasta20u63" version 2.0u63, release date December 1996). ALIGN compares entire sequences against one another, while LFASTA compares regions of local similarity. These alignment tools and their respective tutorials are available on the Internet at the NCSA website. Alternatively, for comparisons of amino acid sequences of greater than about 30 amino acids, the "Blast 2 sequences" function can be employed using the default BLOSUM62 matrix set to default parameters, (gap existence cost of 11, and a per residue gap cost of 1). When aligning short peptides (fewer than around 30 amino acids), the alignment should be performed using the "Blast 2 sequences" function, employing the PAM30 matrix set to default parameters (open gap 9, extension gap 1 penalties). The BLAST sequence comparison system is available, for instance, from the NCBI web site; see also Altschul et al., J. Mol. Biol., 215:403-10, 1990; Gish and States, Nature Genet., 3:266-72, 1993; Madden et al., Meth. Enzymol., 266:131-41, 1996; Altschul et al., Nucleic Acids Res., 25:3389-402, 1997; and Zhang and Madden, Genome Res., 7:649-56, 1997.

[0106] Orthologs (equivalent to proteins of other species) of proteins are in some instances characterized by possession of greater than 75% sequence identity counted over the full-length alignment with the amino acid sequence of specific protein using ALIGN set to default parameters. Proteins with even greater similarity to a reference sequence will show increasing percentage identities when assessed by this method, such as at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, or at least 98% sequence identity. In addition, sequence identity can be compared over the full length of one or both binding domains of the disclosed fusion proteins.

[0107] When significantly less than the entire sequence is being compared for sequence identity, homologous sequences will typically possess at least 80% sequence identity over short windows of 10-20, and may possess sequence identities of at least 85%, at least 90%, at least 95%, or at least 99% depending on their similarity to the reference sequence. Sequence identity over such short windows can be determined using LFASTA; methods are described at the NCSA website. One of skill in the art will appreciate that these sequence identity ranges are provided for guidance only; it is entirely possible that strongly significant homologs could be obtained that fall outside of the ranges provided. Similar homology concepts apply for nucleic acids as are described for protein. An alternative indication that two nucleic acid molecules are closely related is that the two molecules hybridize to each other under stringent conditions.

[0108] Nucleic acid sequences that do not show a high degree of identity may nevertheless encode similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences that each encode substantially the same protein.

[0109] Subject: Living multi-cellular vertebrate organisms, a category that includes both human and non-human mammals.

[0110] Therapeutically effective amount: A quantity of a specified agent sufficient to achieve a desired effect in a subject being treated with that agent. For example, this may be the amount of a recombinant rabies virus useful for eliciting an immune response in a subject and/or for preventing infection by rabies virus and other lyssaviruses. Ideally, in the context of the present disclosure, a therapeutically effective amount of a recombinant rabies virus is an amount sufficient to increase resistance to, prevent, ameliorate, and/or treat infection caused by one or more lyssaviruses in a subject without causing a substantial cytotoxic effect in the subject. The effective amount of a recombinant rabies virus useful for increasing resistance to, preventing, ameliorating, and/or treating infection in a subject will be dependent on, for example, the subject being treated, the manner of administration of the therapeutic composition and other factors. In some embodiments, the recombinant rabies viruses described herein comprise a nucleic acid sequence encoding one or more glycoproteins from a lyssavirus other than rabies virus ERA.

[0111] Vaccine: A preparation of immunogenic material capable of stimulating an immune response, administered for the prevention, amelioration, or treatment of infectious or other type of disease (such as cancer). The immunogenic material may include attenuated or killed microorganisms (such as attenuated viruses), or antigenic proteins, peptides or DNA derived from them. Vaccines may elicit both prophylactic (preventative) and therapeutic responses. Methods of administration vary according to the vaccine, but may include inoculation, ingestion, inhalation or other forms of administration. Inoculations can be delivered by any of a number of routes, including parenteral, such as intravenous, subcutaneous or intramuscular. Vaccines may be administered with an adjuvant to boost the immune response.

[0112] Vector: A nucleic acid molecule that can be introduced into a host cell, thereby producing a transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication (DNA sequences that participate in initiating DNA synthesis). A vector may also include one or more selectable marker genes and other genetic elements known in the art.

[0113] Virus: Microscopic infectious organism that reproduces inside living cells. A virus typically consists essentially of a core of nucleic acid (single- or double-stranded RNA or DNA) surrounded by a protein coat, and in some cases lipid envelope, and has the ability to replicate only inside a living cell. "Viral replication" is the production of additional virus by the occurrence of at least one viral life cycle.

[0114] Unless otherwise explained, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. The singular terms "a," "an," and "the" include plural referents unless context clearly indicates otherwise. Similarly, the word "or" is intended to include "and" unless the context clearly indicates otherwise. Hence "comprising A or B" means including A, or B, or A and B. It is further to be understood that all base sizes or amino acid sizes, and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are approximate, and are provided for description. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, suitable methods and materials are described below. In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.

III. Overview of Several Embodiments

[0115] Disclosed herein are recombinant rabies viruses having glycoprotein (G) genes from at least two different lyssaviruses. The disclosed viruses can be used as pan-lyssavirus vaccines to provide protection against infection by multiple genotypes of lyssavirus. Prior to the present disclosure, no vaccines had been described that protect against West Caucasian bat virus and/or lyssaviruses of genotypes 2 (Lagos bat virus) and 3 (Mokola virus). Thus, the recombinant rabies viruses described herein represent a significant advance in the development of vaccines for the prevention of rabies.

[0116] The recombinant rabies viruses exemplified herein are generating using a previously described reverse genetics system based on the ERA strain of rabies virus (PCT Publication No. WO 2007/047459). However, other reverse genetics systems for rabies virus (see, for example, Ito et al., J. Virol. 75(19):9121-9128) could be used to generate recombinant viruses having multiple lyssavirus G genes.

[0117] Provided herein is a recombinant rabies virus, wherein the genome of the recombinant rabies virus comprises rabies virus nucleoprotein (N), phosphoprotein (P), matrix protein (M), RNA-dependent RNA polymerase (L) and glycoprotein (G) genes and at least one, at least two or at least three different heterologous lyssavirus glycoprotein genes, wherein the lyssavirus is selected from Lagos bat virus (LBV), Mokola virus (MOKV), Duvenhage virus (DUVV), European bat lyssavirus-1 (EBLV-1), European bat lyssavirus-2 (EBLV-2), Australian bat lyssavirus (ABLV), Aravan virus (ARAV), Khujand virus (KHUV), Irkut virus (IRKV) and West Caucasian bat virus (WCBV). In particular embodiments, the lyssavirus is selected from LBV, MOKV and WCBV.

[0118] In some embodiments, the recombinant rabies virus comprises two heterologous G genes. In particular examples, the two heterologous G genes are from MOKV and WCBV. In other examples, the two heterologous G genes are from LBV and MOKV. In yet other examples, the two heterologous G genes are from LBV and WCBV.

[0119] In some embodiments, the recombinant rabies virus comprises three heterologous G genes. In particular examples, the three heterologous G genes are from LBV, MOKV and WCBV.

[0120] In some embodiments in which the recombinant rabies virus comprises a MOKV G gene, the nucleotide sequence of the MOKV G gene is at least 80%, is at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 47. In some embodiments in which the recombinant rabies virus comprises a WCBV G gene, the nucleotide sequence of the WCBV G gene at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 49. In some embodiments in which the recombinant rabies virus comprises the LBV G gene, the nucleotide sequence of the LBV G gene is at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 53.

[0121] In some examples, the MOKV G gene comprises the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene comprises the nucleotide sequence of SEQ ID NO: 49 and/or the LBV G gene comprises the nucleotide sequence of SEQ ID NO: 53. In particular examples, the MOKV G gene consists of the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene consists of the nucleotide sequence of SEQ ID NO: 49 and/or the LBV G gene consists of the nucleotide sequence of SEQ ID NO: 53.

[0122] The heterologous G genes can be cloned into the rabies virus genome in any suitable location, and in any order, to allow for expression of the heterologous proteins without altering expression of the endogenous rabies virus genes. In some embodiments, heterologous G genes are inserted between the rabies virus P and M genes, between the rabies virus G and L genes and/or between the rabies virus N and P genes. In particular examples, the recombinant rabies virus comprises two heterologous G genes and the heterologous G genes are located between the rabies virus P and M genes and between the G and L genes. In other examples, the recombinant rabies virus comprises three heterologous G genes and the three heterologous G genes are located between the rabies virus N and P genes, between the rabies virus P and M genes and between the rabies virus G and L genes.

[0123] Insertion of heterologous genes into the rabies virus genome can be facilitated by synthesizing a transcriptional unit. The transcriptional unit is inserted at the desired gene junction and the heterologous G gene is cloned into the transcriptional unit. In some embodiments, the nucleotide sequence of the transcriptional unit is at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to SEQ ID NO: 42. In some examples, the transcriptional unit comprises the nucleotide sequence of SEQ ID NO: 42.

[0124] In some embodiments, the genome of the recombinant rabies virus is derived from the rabies virus ERA strain. In some embodiments, the nucleotide sequence of the ERA strain genome comprises a sequence that is at least 90%, at least 95%, at least 98% or at least 99% identical to SEQ ID NO: 1. In particular examples, the nucleotide sequence of the ERA strain genome comprises SEQ ID NO: 1.

[0125] In some embodiments, the recombinant rabies virus includes one or more attenuating mutations. In exemplary embodiments, the rabies virus glycoprotein comprises a Glu at amino acid position 333 (SEQ ID NO: 5).

[0126] Further provided is a vector comprising a full-length rabies virus antigenomic DNA, wherein the antigenomic DNA comprises rabies virus N, P, M, L and G genes, and wherein the vector further comprises at least one, at least two, or at least three different heterologous lyssavirus G genes, wherein the lyssavirus is selected from LBV, MOKV, DUVV, EBLV-1, EBLV-2, ABLV, ARAV, KHUV, IRKV and WCBV. In particular embodiments, the lyssavirus is selected from LBV, MOKV and WCBV.

[0127] In some embodiments, the vector comprises two different heterologous lyssavirus G genes. In particular examples, the two heterologous G genes are MOKV and WCBV G genes. In other examples, the two heterologous G genes are MOKV and LBV G genes. In other examples, the two heterologous G genes are LBV and WCBV G genes.

[0128] In some embodiments, the vector comprises three heterologous G genes. In particular examples, the three heterologous G genes are from LBV, MOKV and WCBV.

[0129] In some embodiments in which the vector comprises a MOKV G gene, the nucleotide sequence of the MOKV G gene is at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 47. In some embodiments in which the vector comprises a WCBV G gene, the nucleotide sequence of the WCBV G gene is at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 49. In some embodiments in which the vector comprises the LBV G gene, the nucleotide sequence of the LBV G gene is at least 80%, at least 85%, at least 90%, at least 95%, at least 98% or at least 99% identical to the nucleotide sequence of SEQ ID NO: 53.

[0130] In some examples, the MOKV G gene comprises the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene comprises the nucleotide sequence of SEQ ID NO: 49 and/or the LBV G gene comprises the nucleotide sequence of SEQ ID NO: 53. In particular examples, the MOKV G gene consists of the nucleotide sequence of SEQ ID NO: 47, the WCBV G gene consists of the nucleotide sequence of SEQ ID NO: 49 and/or the LBV G gene consists of the nucleotide sequence of SEQ ID NO: 53.

[0131] The heterologous G genes can be cloned into the vector encoding the rabies virus genome in any suitable location, and in any order, to allow for expression of the heterologous proteins without altering expression of the endogenous rabies virus genes. In some embodiments, heterologous G genes are inserted between the rabies virus P and M genes, between the rabies virus G and L genes and/or between the rabies virus N and P genes. In particular examples, the recombinant rabies virus comprises two heterologous G genes and the heterologous G genes are located between the rabies virus P and M genes and between the G and L genes. In other examples, the recombinant rabies virus comprises three heterologous G genes and the three heterologous G genes are located between the rabies virus N and P genes, between the rabies virus P and M genes and between the rabies virus G and L genes.

[0132] In some embodiments, rabies virus antigenomic DNA inserted in the vector is derived from the rabies virus ERA strain. In some examples, the nucleotide sequence of the ERA strain antigenomic DNA comprises a sequence that is at least 90%, at least 95%, at least 98% or at least 99% identical to SEQ ID NO: 1. In particular examples, the nucleotide sequence of the ERA strain antigenomic DNA comprises SEQ ID NO: 1.

[0133] Further provided herein is a cell comprising one or more rabies virus vectors disclosed herein.

[0134] Also provided are compositions comprising the recombinant rabies viruses described herein and a pharmaceutically acceptable carrier. In some embodiments, the compositions further comprise an adjuvant.

[0135] Also contemplated are compositions comprising multiple recombinant rabies viruses, each encoding at least one heterologous G gene. In some embodiments, the compositions comprise (i) a first recombinant rabies virus, wherein the genome of the first recombinant rabies virus comprises a rabies virus G gene and at least one heterologous lyssavirus G gene; and (ii) a second recombinant rabies virus, wherein the genome of the second recombinant rabies virus comprises at least one G gene from a different lyssavirus (i.e. a lyssavirus G gene that is not in the first recombinant rabies virus); wherein the lyssavirus is selected from LBV, MOKV, DUVV, EBLV-1, EBLV-2, ABLV, ARAV, KHUV, IRKV and WCBV. In particular embodiments, the lyssavirus is selected from LBV, MOKV and WCBV. In some examples, the second recombinant rabies virus also includes a rabies virus G gene. In some examples, the first and/or second recombinant rabies virus comprises at least two heterologous G genes.

[0136] In some examples, the composition comprises (i) a first recombinant rabies virus, wherein the genome of the first recombinant rabies virus comprises a rabies virus G gene and a G gene from MOKV and WCBV; and (ii) a second recombinant rabies virus, wherein the genome of the second recombinant rabies virus comprises a G gene from LBV.

[0137] Further provided is a method of eliciting an immune response in a subject against lyssavirus by administering to the subject one or more recombinant rabies viruses or compositions disclosed herein. In some embodiments, the immune response in the subject against lyssavirus protects the subject against infection by at least three different genotypes of lyssavirus. In some embodiments, the immune response in the subject against lyssavirus protects the subject against infection by at least four different genotypes of lyssavirus. In some embodiments, the subject is a human. In other embodiments, the subject is a non-human animal.

IV. Determinants of Rabies Virus Pathogenicity

[0138] Rabies virus (RABV) is a rhabdovirus--a non-segmented RNA virus with negative sense polarity. Within the Rhabdoviridae family, rabies virus is the prototype of the Lyssavirus genus. Lyssaviruses are composed of two major structural components, a nucleocapsid or ribonucleoprotein (RNP), and an envelope in the form of a bilayer membrane surrounding the RNP core. The infectious component of all rhabdoviruses is the RNP core, which consists of the negative strand RNA genome encapsidated by nucleoprotein (N) in combination with RNA-dependent RNA-polymerase (L) and phosphoprotein (P). The membrane surrounding the RNP contains two proteins, the trans-membrane glycoprotein (G) and the matrix (M) protein, located at the inner site of the membrane. Thus, the viral genome codes for these five proteins: the three proteins in the RNP (N, L and P), the matrix protein (M), and the glycoprotein (G).

[0139] The molecular determinants of pathogenicity of various rabies virus strains have not been fully elucidated. RABV pathogenicity was attributed to multigenic events (Yamada et al., Microbiol. Immunol. 50:25-32, 2006). For example, some positions in the RABV genome if mutated, affect viral transcription or replication, reducing virulence. Mutations at serine residue 389 of the phosphorylation site in the N gene (Wu et al., J. Virol. 76:4153-4161, 2002) or GDN core sequence of the highly conserved C motif in the L gene (Schnell and Conzelmann, Virol. 214:522-530, 1995) dramatically reduced RABV transcription and replication.

[0140] The G protein, also referred to as spike protein, is involved in cell attachment and membrane fusion of RABV. The amino acid region at position 330 to 340 (referred to as antigenic site III) of the G protein has been identified as important for virulence of certain strains of RABV. Several studies support the concept that the pathogenicity of fixed RABV strains is determined by the presence of arginine or lysine at amino acid residue 333 of the glycoprotein (Dietzschold et al., Proc. Natl. Acad. Sci. USA 80: 70-74, 1983; Tuffereau et al., Virology 172: 206-212, 1989).

[0141] This phenomenon seems to apply at least to fixed rabies viruses such as CVS, ERA, PV, SAD-B19 and HEP-Flury strains (Anilionis et al., Nature 294:275-278, 1981; Morimoto et al., Virology 173:465-477, 1989). For example, rabies vaccine viruses possessing an amino acid differing from Arg at position 333 of the glycoprotein are described, for instance, in WO 00/32755 (describing RABV mutants in which all three nucleotides in the G protein Arg.sub.333 codon are altered compared to the parent virus, such that the Arg at position 333 is substituted with another amino acid); European Patent 350398 (describing an avirulent RABV mutant SAG1 derived from the Bern SAD strain of RABV, in which the Arg at position 333 of the glycoprotein has been substituted to Ser); and European patent application 583998 (describing an attenuated RABV mutant, SAG2, in which the Arg at position 333 in the G protein has been substituted by Glu).

[0142] Other strains, such as the RC-HL strain, possess an arginine residue at position 333 of the G, but do not cause lethal infection in adult mice (Ito et al., Microl. Immunol. 38:479-482, 1994; Ito et al., J. Virol. 75:9121-9128, 2001). As such, the entire G may contribute to the virulence of RABV, although the determinants or regions have not been fully elucidated.

[0143] The G gene encodes the only protein that induces viral neutralizing antibody. At least three states of RABV glycoprotein are known: the native state (N) being responsible for receptor binding; an active hydrophobic state (A) necessary in the initial step in membrane fusion process (Gaudin, J. Cell Biol. 150:601-612, 2000), and a fusion inactive conformation (I). Correct folding and maturation of the G protein play important roles for immune recognition. The three potential glycosylated positions in ERA G extracellular domain occur at Asn.sup.37, Asn.sup.247 and Asn.sup.319 residues (Wojczyk et al., Glycobiology. 8: 121-130, 1998). Nonglycosylation of G not only affects conformation, but also inhibits presentation of the protein at the cell surface.

[0144] It has been previously demonstrated (see PCT Publication No. WO 2007/047459, which is incorporated herein by reference) that expression of G enhances the anti-RABV immune response. In addition, introduction of an Arg to Glu mutation at amino acid position 333 of RABV ERA glycoprotein results in an attenuated virus (referred to as ERAg3). This attenuated virus is capable of eliciting significant titers of neutralizing antibodies in animals and conferring protection against wild-type virus challenge. Furthermore, as described in PCT Publication No. WO 2007/047459, a recombinant RABV comprising two copies of glycoprotein with the G333 mutation is particularly useful as a vaccine due to its ability to elicit high titers of neutralizing antibodies without morbidity or mortality. In some examples herein, a recombinant rabies virus comprising the G333 mutation in glycoprotein is used as a platform to introduce one or more (such as one, two or three) additional G genes from one or more different genotypes of lyssavirus. However, one of ordinary skill in the art will recognize that any one of a number of recombinant rabies viruses can be used to incorporate heterologous sequences using the reverse genetics systems disclosed in PCT Publication No. WO 2007/047459 (or another rabies virus reverse genetics system) as summarized below.

V. Rabies Virus Reverse Genetics System

[0145] RNA cannot readily be manipulated directly by molecular biological methods. Traditional RNA virus vaccines are from naturally attenuated isolates, which are difficult to control and provide unpredictable results. Reverse genetics technology makes it possible to manipulate RNA viruses as DNA, which can be mutated, deleted or reconstructed according to deliberate designs. Every gene function can be studied carefully, independently, and in concert, which benefits vaccine development. Reverse genetics involves reverse transcription of the RNA viral genome into cDNA, and cloning into a vector, such as a plasmid. After transfection of host cells, the vector is transcribed into RNA, to be encapsidated by viral structural proteins, which can also be supplied by plasmids. The encapsidated RNA forms a ribonucleoprotein complex, which results in virions that can be recovered.

[0146] An efficient reverse genetics system based on the rabies virus ERA strain is described in PCT Publication No. WO 2007/047459, which is incorporated herein by reference. This rabies reverse genetics system is useful for a variety of purposes, including to attenuate ERA virus in a defined manner for vaccine development and to produce ERA virus vectors for expression of heterologous proteins, such as a protein from another lyssavirus for the generation of a pan-lyssavirus vaccine.

[0147] The reverse genetics system disclosed in PCT Publication No. WO 2007/047459 has some or all of the following characteristics, illustrated schematically in FIG. 1A using the exemplary ERA strain antigenomic cDNA.

[0148] The rabies virus reverse genetics system is based on a full length transcription plasmid plus a plurality of helper plasmids (e.g., five helper plasmids). The helper plasmids encode the N, P and L proteins, and optionally the G protein, as well as the T7 polymerase. Although the G protein is not necessary for virus rescue, it improves virus recovery efficiency or virus budding when included in transfection.

[0149] Transcription involves both cellular RNA dependent RNA polymerase II, which is available in mammalian cells, and T7 RNA polymerase, which is supplied by pNLST7 plasmids. The dual polymerases result in virus recovery efficiency that is both high and stable.

[0150] In the transcription plasmid, hammerhead and hepatitis delta virus ribozymes flank a rabies virus (e.g., ERA strain) antigenomic cDNA, enabling the production of authentic 5' and 3' ends of antigenomic viral RNA by transcription. The first ten nucleotides of the hammerhead sequence are designed to be complementary to the first ten nucleotides of the antisense genomic sequence.

[0151] Two modified T7 RNA polymerase constructs support virus recovery more efficiently than the wild type T7 RNA polymerase applied previously. One T7 RNA polymerase has been mutated from the first ATG to AT. The second T7 RNA polymerase has an eight amino acid nuclear localization signal (NLS) derived from the SV40 virus large T antigen fused after the first ATG from the parental T7. Addition of the NLS results in the T7 RNA polymerase being present predominantly in the nucleus. Following transfection mechanism of the NLS modified plasmid, the DNA/transfection reagent complex binds to the surface of the cell. Through endocytosis, the complex is taken into the endosome/lysosome, and the DNA is released into the cytosol. In the absence of the NLS, the majority of the transfected plasmids are retained in the cytosol and only a small percentage of the released DNA reaches the nucleus, where it is transcribed into RNA. After protein synthesis, the NLST7 RNA polymerase is transported back to the cell nucleus, and the helper plasmids (with T7/CMV promoters) in the nucleus will be transcribed by both NLST7 and cellular polymerase II. Thus, more mRNAs of the helper plasmids and cRNA of the full-length pTMF or its derivatives are synthesized and result in high efficiency of virus recovery.

[0152] After the initial expression of NLST7 by the CMV promoter, NLST7 polymerase binds to pT7 for transcription of the NLST7 gene. Through modification of the transcripts in the nucleus, more NLST7 mRNA is synthesized, resulting in greater expression of NLST7 polymerase. The pT7 of the NLST7 polymerase as well as of the full length antigenomic transcription unit is under the control of the NLST7 polymerase, which acts as an "autogene." The autogene mechanism of NLST7 RNA polymerase is illustrated in FIG. 2. After expression of T7 RNA polymerase in the nucleus, the transfected T7 constructs continue to transcribe full length RNA template for N protein encapsidation and/or L protein binding, enhancing virus recovery efficiency.

[0153] The T7 polymerase, and all other plasmids, except the N protein encoding plasmid pTN, are placed under control of both CMV and T7 transcriptional regulatory elements. The N protein encoding nucleic acid is under the control of a T7 promoter and is translated in cap-independent manner based on an IRES (internal ribosome entry site). Cellular RNA polymerase II alone can help the recovery of RABV if all the plasmids were cloned under the control of the CMV promoter. In the ERA reverse genetics system disclosed in PCT Publication No. WO 2007/047459, only pTN is under the control of the T7 promoter and is translated in a cap-independent manner. All other constructs are under control of both CMV and the T7 transcriptional regulatory elements. Typically, in RABV, N synthesis is abundant and the ratio among N, P and L is approximately 50:25:1. To mimic the wild type viral transcription and assembly in RABV reverse genetics, N expression should be the highest. With the aid of NLST7 polymerase and IRES translation mode, N protein is expressed efficiently after plasmid transfection. This reduces competition for transcription with housekeeping genes in host cells, because the T7 transcription initiation signal does not exist in mammalian cells, and results in increased efficiency of T7 transcription.

[0154] In addition, as described in PCT Publication No. WO 2007/047459, to enhance production of viral proteins, the helper plasmids can be constructed to incorporate a Kozak sequence that has been optimized for the translation efficiency for each protein encoding sequence. After five days post-transfection in the ERA reverse genetics system, the rescued viruses reliably and repeatably grew to 10.sup.7 FFU/ml without further amplification.

[0155] Recombinant rabies viruses with favorable properties for vaccination can be designed using, for example, the reverse genetics system disclosed in PCT Publication No. WO 2007/047459. Modified strains having mutated glycoproteins are particularly suited for use as immunogenic compositions. This RABV reverse genetics system also enables a rabies virus vector system for foreign (heterologous) gene expression. An extra transcription unit was previously demonstrated to be functional in two different locations after incorporation into the RABV ERA genome. Thus, the RABV reverse genetics system provides a means for introducing heterologous proteins. In some examples, the heterologous protein is a glycoprotein from a lyssavirus other than the RABV ERA strain.

VI. Administration and Use of Recombinant Rabies Virus Compositions

[0156] The recombinant rabies viruses provided herein comprise at least one heterologous nucleic acid sequence encoding a glycoprotein from a lyssavirus other than RABV ERA. The immunogenic compositions provided herein are designed to provide protection to multiple lyssavirus genotypes, and in some cases, provide protection against all 11 known lyssavirus genotypes. The immunogenic compositions provided herein are contemplated for use with both human and non-human animals.

[0157] The immunogenic formulations may be conveniently presented in unit dosage form and prepared using conventional pharmaceutical techniques. Such techniques include the step of bringing into association the active ingredient and the pharmaceutical carrier(s) or excipient(s). In general, the formulations are prepared by uniformly and intimately bringing into association the active ingredient with liquid carriers. Formulations suitable for parenteral administration include aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the blood of the intended recipient; and aqueous and non-aqueous sterile suspensions which may include suspending agents and thickening agents. The formulations may be presented in unit-dose or multi-dose containers, for example, sealed ampoules and vials, and may be stored in a freeze-dried (lyophilized) condition requiring only the addition of a sterile liquid carrier, for example, water for injections, immediately prior to use. Extemporaneous injection solutions and suspensions may be prepared from sterile powders, granules and tablets commonly used by one of ordinary skill in the art.

[0158] In certain embodiments, unit dosage formulations are those containing a dose or unit, or an appropriate fraction thereof, of the administered ingredient. It should be understood that in addition to the ingredients particularly mentioned above, formulations encompassed herein may include other agents commonly used by one of ordinary skill in the art.

[0159] The compositions provided herein, including those for use as immunogenic compositions, may be administered through different routes, such as oral, including buccal and sublingual, rectal, parenteral, aerosol, nasal, intramuscular, subcutaneous, intradermal, and topical. They may be administered in different forms, including but not limited to solutions, emulsions and suspensions, microspheres, particles, microparticles, nanoparticles, and liposomes. In some embodiments, the immunogenic compositions are administered orally.

[0160] The volume of administration will vary depending on the route of administration. Those of ordinary skill in the art will know appropriate volumes for different routes of administration.

[0161] Administration can be accomplished by single or multiple doses. The dose administered to a subject in the context of the present disclosure should be sufficient to induce a beneficial therapeutic response over time, such as to prevent lyssavirus infection or the development of rabies. The dose required may vary depending on, for example, the age, weight and general health of the subject.

[0162] The amount of immunogenic composition in each dose is selected as an amount that induces an immunostimulatory response without significant, adverse side effects. Such amount will vary depending upon which specific composition is employed and how it is administered. Initial doses may range from about 1 .mu.g to about 1 mg, with some embodiments having a range of about 10 .mu.g to about 800 and still other embodiments a range of from about 25 .mu.g to about 500 .mu.g. Following an initial administration of the immunogenic composition, subjects may receive one or several booster administrations, adequately spaced. Booster administrations may range from about 1 .mu.g to about 1 mg, with other embodiments having a range of about 10 .mu.g to about 750 .mu.g, and still others a range of about 50 .mu.g to about 500 .mu.g. Periodic boosters at intervals of 1-5 years, for instance three years, may be desirable to maintain the desired levels of protective immunity. In preferred embodiments, subjects receive a single dose of an immunogenic composition.

[0163] Provided herein are pharmaceutical compositions (also referred to as immunogenic or immunostimulatory compositions) which include a therapeutically effective amount of a recombinant RABV alone or in combination with a pharmaceutically acceptable carrier. In some embodiments, the recombinant RABV comprises a heterologous protein, such as glycoprotein from another lyssavirus that causes rabies.

[0164] Pharmaceutically acceptable carriers include, but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The carrier and composition can be sterile, and the formulation suits the mode of administration. The composition can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. The composition can be a liquid solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, or powder. The composition can be formulated as a suppository, with traditional binders and carriers such as triglycerides. Oral formulations can include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, and magnesium carbonate. Any of the common pharmaceutical carriers, such as sterile saline solution or sesame oil, can be used. The medium can also contain conventional pharmaceutical adjunct materials such as, for example, pharmaceutically acceptable salts to adjust the osmotic pressure, buffers, preservatives and the like. Other media that can be used with the compositions and methods provided herein are normal saline and sesame oil.

[0165] The recombinant RABVs described herein can be administered alone or in combination with other therapeutic agents to enhance antigenicity. For example, the recombinant viruses can be administered with an adjuvant, such as Freund incomplete adjuvant or Freund's complete adjuvant.

[0166] Optionally, one or more cytokines, such as IL-2, IL-6, IL-12, RANTES, GM-CSF, TNF-.alpha., or IFN-.gamma., one or more growth factors, such as GM-CSF or G-CSF; one or more molecules such as OX-40L or 41 BBL, or combinations of these molecules, can be used as biological adjuvants (see, for example, Salgaller et al., 1998, J. Surg. Oncol. 68(2):122-38; Lotze et al., 2000, Cancer J. Sci. Am. 6(Suppl 1):S61-6; Cao et al., 1998, Stem Cells 16(Suppl 1):251-60; Kuiper et al., 2000, Adv. Exp. Med. Biol. 465:381-90). These molecules can be administered systemically (or locally) to the host.

[0167] A number of means for inducing cellular responses, both in vitro and in vivo, are known. Lipids have been identified as agents capable of assisting in priming CTL in vivo against various antigens. For example, as described in U.S. Pat. No. 5,662,907, palmitic acid residues can be attached to the alpha and epsilon amino groups of a lysine residue and then linked (for example, via one or more linking residues, such as glycine, glycine-glycine, serine, serine-serine, or the like) to an immunogenic peptide. The lipidated peptide can then be injected directly in a micellar form, incorporated in a liposome, or emulsified in an adjuvant. As another example, E. coli lipoproteins, such as tripalmitoyl-S-glycerylcysteinlyseryl-serine can be used to prime tumor specific CTL when covalently attached to an appropriate peptide (see, Deres et al., Nature 342:561, 1989). Further, as the induction of neutralizing antibodies can also be primed with the same molecule conjugated to a peptide which displays an appropriate epitope, two compositions can be combined to elicit both humoral and cell-mediated responses where that is deemed desirable.

[0168] The following examples are provided to illustrate certain particular features and/or embodiments. These examples should not be construed to limit the disclosure to the particular features or embodiments described.

EXAMPLES

Example 1

Construction of Plasmids for a Reverse Genetics System for Rabies Virus

[0169] This example describes the design and development of a reverse genetics system for rabies virus. Rabies virus strain ERA was obtained from the ATCC and was prepared as described (Wu et al., J. Virol. 76, 4153-4161, 2002). To obtain virus genome full-length virus cDNA, BSR cells (a clone of baby hamster kidney, BHK, cells) were infected with ERA strain virus and grown in Dulbecco's minimal essential medium supplemented with 10% of fetal bovine serum. Supernatants were recovered and subjected to centrifugation at 22,000 g for 1 hour. The virus pellets were collected for viral genomic RNA purification by use of a RNA virus extraction kit purchased from Qiagen (Valencia, Calif.) according to the manufacturer's instructions. The integrity of viral genomic RNA was confirmed by gel electrophoresis. Viral genomic cDNA was transcribed with the first-strand cDNA synthesis kit from Life Technologies (Carlsbad, Calif.). The reverse transcription (RT) reaction mixture was applied to amplification by the polymerase chain reaction (PCR) for the synthesis of full-length viral genomic cDNA, N, P, G and L genes, respectively. For assembling the full-length virus genomic cDNA, a pTMF plasmid was constructed in four sequential steps as illustrated schematically in FIG. 1B. Superscript III reverse transcriptase and proof reading platinum pfx polymerase (Life Technologies, Carlsbad, Calif.) were applied for cDNA transcript synthesis and consecutive PCR amplifications. For reverse transcription reactions, 1 .mu.g of purified genomic RNA was used in the RT reaction mix and incubated at 50.degree. C. for 80 min, followed by heating at 85.degree. C. for 5 minutes to inactivate Superscript III. After the RT reaction, 1 unit of RNaseH was added to digest template RNA in the cDNA-RNA hybrids.

[0170] To generate full-length virus genomic cDNA, two overlapping fragments were amplified by RT-PCR as follows: Fragment1 (F1) was RT-PCR amplified with primers: Le5-Kpn (CCGGGTACCACGCTTAAC AACCAGATCAAAGA; SEQ ID NO: 8, Kpn1 recognition site shown in bold) and Le3-Blp (TAGGTCGCTTGCTAAGCACTCCTGGTAGGAC; SEQ ID NO: 9, Blp1 recognition site shown in bold). Fragment 2 (F2) was RT-PCR amplified with primers: Tr5-Blp (GTCCTACCAGGAGTGCTTAGCAAGCGACCTA; SEQ ID NO: 10, Blp1 recognition site shown in bold) and Tr3-Pst (AAAACTGCAGACGCTTAACAAATAAACAACAAAA; SEQ ID NO: 11, Pst1 recognition site shown in bold). After successful synthesis of the above two fragments, F1 digested by Kpn1 and Blp1 restriction enzymes was subjected to gel purification and cloned to pBluescriptIISK(+) phagemid (Stratagene, La Jolla, Calif.) to form the pSKF1 plasmid. The gel purified F2 fragment, cut by Blp1 and Pst1 was consecutively cloned to the pSKF1 plasmid to form the full-length viral antigenomic cDNA. Hammerhead ribozyme (oligo1, CAAGGCTAGCTGTTAAGCGTCTGATGAGTCCGTGAGGACGAAACTATAGGAAAGGAA TTCCTATAGTCGGTACCACGCT; SEQ ID NO: 12, Nhe1 and Kpn1 recognition sites shown in bold; oligo2, AGCGTGGTACCGACTATAGGAATTCCTTTCCTATAGTTTCGTCCTCACGGACTCATCAG ACGCTTAACAGCTAGCCTTG; SEQ ID NO: 13, Kpn1 and Nhe1 recognition sites shown in bold) was synthesized containing a Nhe1 recognition site at the 5' end and a Kpn1 site at the 3' end. This was fused ahead of the 5' end of the F1 fragment. A hepatitis delta virus ribozyme (oligo3, GACCTGCAGGGGTCGGCATGGCATCTCCACCTCCTCGCGGTCCGACCTGGGCATCCGA AGGAGGACGCACGTCCACTCGGATGGCTAAGGGAGGGCGCGGCCGCACTC; SEQ ID NO: 14, Pst1 and Not1 recognition sites shown in bold; oligo4, GAGTGCGGCCGCGCCCTCCCTTAGCCATCCGAGTGGACGTGCGTCCTCCTTCGGATGC CCAGGTCGGACCGCGAGGAGGTGGAGATGCCATGCCGACCCCTGCAGGTC; SEQ ID NO: 15, Not1 and Pst1 recognition sites shown in bold) (Symons, Annu. Rev. Biochem. 61: 641-671, 1992) was synthesized, having a Pst1 site at its 5' end and a Not1 site at its 3' end, and was fused to the 3' end of the F2 fragment. The connective Kpn1 recognition site, between the hammerhead ribozyme and the F1 fragment, and the Pst1 site between the F2 fragment and the hepatitis delta virus ribozyme, were deleted by site-directed mutagenesis. The full-length viral antigenomic cDNA was sandwiched by the hammerhead and hepatitis delta virus ribozymes. This was removed and cloned to the pBluescriptIISK(+) phagemid to make a pSKF construct. The full viral antigenomic cDNA with two ribozymes was fused downstream of the T7 transcription initiation site under control of the CMV immediate-early promoter in pcDNA3.1/Neo (+) plasmid (Life Technologies, Carlsbad, Calif.). This last step finished the construction of the pTMF plasmid.

[0171] The wild type ERA viral genome includes a polyA tract of eight residues (polyA.sub.8) in the intergenic region between the G and Psi regions. To distinguish the rescued ERA (rERA) virus from the parental strain, a stretch of seven A (polyA.sub.7) was introduced to the pTMF construct by deletion of one A instead of the original polyA.sub.8. After rERA virus was recovered, RT-PCR was performed and subsequent sequence data confirmed the existence of the introduced poly A.sub.7 sequence marker.

[0172] pTN plasmid: The N gene was amplified by RT-PCR with primers (5N: ACCACCATGGATGCCGACAAGATTG; SEQ ID NO: 16, Nco1 recognition site and start codon shown in bold; and 3N: GGCCCATGGTTATGAGTCACTCGAATATGTCTT; SEQ ID NO: 17, Nco1 recognition site and stop codon shown in bold) and cloned to the pCITE-2a(+) (Cap-Independent Translation Enhancer) plasmid (Novagen, Madison Wis.).

[0173] pMP plasmid: the P gene was amplified by RT-PCR with primers (5P: TTGGTACCACCATGAGCAAGATCTTTGTCAATC; SEQ ID NO: 18, Kpn1 recognition site and start codon shown in bold; and 3P: GGAGAGGAATTCTTAGCAAGATGTATAGCGATTC; SEQ ID NO: 19, EcoR1 recognition site and stop codon shown in bold) and cloned to the pcDNA3.1/Neo (+) plasmid.

[0174] pMG plasmid: the G gene was amplified by RT-PCR with primers (5G: TTGGTACCACCATGGTTCCTCAGGCTCTCCTG; SEQ ID NO: 20, Kpn1 recognition site and start codon shown in bold; and 3G: AAAACTGCAGTCACAGTCTGGTCTCACCCCCAC; SEQ ID NO: 21, Pst1 recognition site and stop codon shown in bold) and cloned to the pcDNA3.1/Neo (+) plasmid.

[0175] pML plasmid: the L gene was amplified by RT-PCR with primers (5L: ACCGCTAGCACCACCATGCTCGATCCTGGAGAGGTC; SEQ ID NO: 22, Nhe1 recognition site and start codon shown in bold; and 3L: AAAACTGCAGTCACAGGCAACTGTAGTCTAGTAG; SEQ ID NO: 23, Pst1 recognition site and stop codon shown in bold) and cloned to the pcDNA3.1/Neo (+) plasmid.

[0176] pT7 plasmid: genomic DNA from bacteria BL-21 (Novagene, Madison, Wis.) was extracted with the Dneasy Tissue Kit (Qiagen, Valencia, Calif.) according to the manufacturer's instructions. The T7 RNA polymerase gene was amplified from the purified genomic DNA by PCR with primers (5T7: TCGCTAGCACCACCATGAACACGATTAACATCGCTAAG; SEQ ID NO: 24, Nhe1 recognition site and start codon shown in bold; and 3T7: GATGAATTCTTACGCGAACGCGAAGTCCGACTC; SEQ ID NO: 25, EcoR1 recognition site and stop codon shown in bold) and cloned to the pcDNA3.1/Neo (+) plasmid.

[0177] pNLST7 plasmid: an eight amino acid nuclear location signal (NLS), derived from SV40 large T antigen, was added to the N terminus of the T7 RNA polymerase by PCR amplification, using the pT7 plasmid as the template, with primers (5T7NLS: TCGCTAGCCACCATGCCAAAAAAGAAGAGAAAGGTAGAAAACACGATTAACATCGCT AAGAAC; SEQ ID NO: 26, NLS shown in bold and 3T7 primer). The amplified fragment was designated NLST7, and was cloned to pcDNA3.1/Neo (+) to form the pNLST7 construct.

[0178] pGFP plasmid: Monster Green Fluorescent Protein (GFP) plasmid phMGFP was purchased from Promega (Madison, Wis.). The GFP gene was amplified by PCR with primers (GFP5: AAAACTGCAGGCCACCATGGGCGTGATCAAG; SEQ ID NO: 27, Pst1 recognition site and start codon shown in bold; and GFP3: CCGCTCGGTACCTATTAGCCGGCCTGGCGGG; SEQ ID NO: 28, Kpn1 recognition site and stop codon shown in bold) and cloned to the pcDNA3.1/Neo (+) plasmid.

[0179] All plasmid constructs were sequenced at least three times to confirm the absence of unexpected mutations or deletions after cloning, site-directed mutagenesis, or gene deletion. Additionally, the presence of a marker sequence consisting of a polyA tract having seven adenosine residues rather than the eight residues observed in the wild type ERA genome between the glycoprotein and Psi region was confirmed.

Example 2

Defined Modification of Rabies Virus Evelyn-Rokitnicki-Abelseth (ERA) Strain

[0180] In addition to the parental ERA virus strain described above, derivative virus strains were developed using the reverse genetics system disclosed herein. Several exemplary modified viruses were produced, namely ERA--(deletion of the whole psi-region), ERAgreen1 (green florescent protein gene inserted in ERA viral genome psi region), ERAgreen2 (green florescent protein gene inserted in phosphoprotein and matrix protein intergenic region), ERA2g (containing an extra copy of glycoprotein in the psi-region), ERAg3 (with a mutation at amino acid 333 in glycoprotein), ERA2g3 (with an extra copy of mutated glycoprotein at Aa333 in psi-region), ERA-G (with glycoprotein deleted) ERAgm (M and G genes switched in the genome), and ERAgmg (two copies of G in the rearranged ERAgm construct) These derivatives are illustrated schematically in FIG. 3. By optimizing the growth conditions as described, all of the rescued viruses can be obtained at virus titers of 10.sup.9 to 10.sup.10 ffu/ml in both tissue culture flasks and bioreactors.

Gene Deletion and Site-Directed Mutagenesis in the Reverse Genetics System

[0181] Deletion of the Psi Region of the Rabies Virus ERA Genome

[0182] The complete Psi-region of the rabies virus ERA genome was deleted as follows: 3'.DELTA..psi. fragment was amplified using pTMF as template by PCR with primers (5.DELTA..psi.: CCCTCTGCAGTTTGGTACCGTCGAGAAAAAAACATTAGATCAGAAG; SEQ ID NO: 29, Pst1 and Kpn1 recognition sites shown in bold; and Le3-Blp primer) and was cloned to pCR-BluntII-TOPO vector (Life Technologies, Carlsbad, Calif.) for the construction of pP.DELTA.5.psi. plasmid. The 5'.DELTA..psi. fragment was amplified using the same template by PCR with primers (SnaB5: ATGAACTTTCTACGTAAGATAGTG; SEQ ID NO: 30, SnaB1 recognition site shown in bold; and 3.DELTA..psi.: CAAACTGCAGAGGGGTGTTAGTTTTTTTCAAAAAGAACCCCCCAAG; SEQ ID NO: 31, Pst1 recognition site shown in bold) was successively cloned to the above pP.DELTA.5.psi. plasmid to finish the construction of the pP.DELTA..psi. plasmid. The fragment recovered by SnaB1 and Pst1 restriction enzyme digestion from the pP.DELTA..psi. plasmid substituted the counterpart in the pSKF construct to make the pSKF.DELTA..psi. plasmid. The whole DNA fragment containing the ERA genomic cDNA, digested by Nhe1 and Not1 from pSKF.DELTA..psi. plasmid, was re-cloned to the pcDNA3.1/Neo (+) plasmid to finalize the construction of pTMF.DELTA..psi.. For verification of the rescued strain lacking Psi, designated Era-, primers covering the Psi-region were applied in RT-PCR with total RNA from ERA-infected BSR cells. A 400 bp fragment corresponding to the Psi region was amplified only from rERA virus, but not from ERA. Sequence data verified the complete deletion of the Psi-region.

[0183] Deletion of the Glycoprotein Gene in the Rabies Virus ERA Genome:

[0184] The 5'g.DELTA..psi. fragment was amplified using pSKF as template by PCR with primers (SnaB5 primer, and 3.DELTA.g: CAAACTGCAGAGGGGTGTTAGTTTTTTTCACATCCAAGAGGATC; SEQ ID NO: 32). After digestion by SnaB1 and Pst1 restriction enzymes, this recovered fragment was cloned to replace its counterpart in the pSKF.DELTA..psi. construct. The 3'g.DELTA..psi. fragment was amplified using the same template by PCR with primers (5.DELTA.g: CCTCTGCAGTTTGGTACCTTGAAAAAAACCTGGGTTCAATAG; SEQ ID NO: 33, and Le3-Blp primer), and was consecutively cloned to the modified pSKF.DELTA..psi., to replace its counterpart. The final fragment, recovered by SnaB1 and Blp1 restriction enzymes cut from this pSKF.DELTA..psi. without the G gene, was re-cloned to pcDNA3.1/Neo (+) plasmid to form the pTMF.DELTA.g construct for virus recovery.

[0185] Glycoprotein Gene Site-Directed Mutagenesis:

[0186] Site directed mutagenesis to introduce a three nucleotide change from AGA to GAG at amino acid position 333 of the glycoprotein was performed as previously described (Wu et al., J. Virol. 76: 4153-4161, 2002). The primers in the mutagenesis reaction were M5G primer: CTCACTACAAGTCAGTCGAGACTTGGAATGAGATC (SEQ ID NO: 34, the three mutated nucleotides shown in bold) and M3G primer: GACTGACTTTGAGTGAGCATCGGCTTCCATCAAGG (SEQ ID NO: 35). For the recovered strain (ERAg3), three nucleotide changes from AGA to GAG at amino acid position 333 (aa333) were confirmed by sequencing after RT-PCR with primers 5G and 3G. After confirmation by DNA sequencing, the mutated G was cloned back to the pTMF plasmid to make the pTMFg3 construct for virus recovery. The glycoprotein encoded by this mutated G gene is represented by SEQ ID NO: 7.

Incorporation of an Exogenous ORF into ERA Rabies Virus Genome

[0187] To express exogenous ORFs in RABV, an extra transcription unit with Pst1 and Kpn1 recognition sites were created and incorporated at the Psi or P-M gene intergenic regions, respectively. In brief, for creation of an extra transcription unit at the Psi-region, the same steps were followed, except for the 5.DELTA..psi. fragment amplification step, the 3.DELTA..psi. primer was changed to 3.DELTA..psi.cis: CCAAACTGCAGCGAAAGGAGGGGTGTTAGTTTTTTTCATGATGAACCCCCCAAGGGGA GG (SEQ ID NO: 36). The final construct without the Psi-region, but with an extra transcription unit, was designated as pMTF.DELTA..psi.cis. The GFP, ERA G, or mutated G at amino acid residues 333 were cloned to this transcriptional unit to form pMTFgfp1, pMTF2g, pMTFg3, pMTF2g3 constructs, respectively, for virus rescue.

[0188] To incorporate an extra transcription unit to the P-M intergenic region, the cisp5 fragment was amplified using pMTF as template with primers cis55: GACTCACTATAGGGAGACCCAAGCTGGCTAGCTGTTAAG (SEQ ID NO: 37), cis53: CCAAACTGCAGCGAAAGGAGGGGTGTTAGTTTTTTTCATGTTGACTTTAGGACATCTCG G (SEQ ID NO: 38), and was cloned in substitution of its counterpart in the pMTF plasmid. The cisp3 fragment was amplified and cloned in a similar way with primers cis35: CCTTTCGCTGCAGTTTGGTACCGTCGAGAAAAAAACAGGCAACACCACTGATAAAATG AAC (SEQ ID NO: 39) and cis33: CCTCCCCTTCAAGAGGGCCCCTGGAATCAG (SEQ ID NO: 40). After assembling the cisp5 and cisp3 fragments together, the final construct was designated as pMTFcisp, for accepting ORFs. The recombinant construct containing the GFP gene was named pTMFgfp2 for virus recovery.

[0189] To produce an ERA derivative, designated ERAgm, in which the glycoprotein encoding sequence was reversed in order with the matrix protein encoding sequence, the glycoprotein gene was deleted as described above. The G gene (amplified as disclosed above) was then inserted between P and M genes, yielding a rabies virus genome in the order of N-P-G-M-L. Similarly, the same strategy was applied to produce the ERAg3m derivative, in which the glycoprotein has a triple nucleotide mutation at 333 amino acid residue (from AGA to GAG) by substituting the G gene produced by site directed mutagenesis as described above. To produce the ERAgmg construct, an extra copy of glycoprotein gene was inserted between P and M genes, and made the rabies virus genome in the order of N-P-G-M-G-L.

[0190] An extra transcription unit was modified and incorporated into two different regions of the ERA genome, namely psi-region and P-M intergenic region. When heterologous ORFs are incorporated into these transcription units, designated trans 1 and trans 2, respectively, efficient production of the encoded product results.

Sequence of the transcription unit is: CTAACACCCCTCCTTTCGCTGCAGTTTGGTACCGTCGAGAAAAAAA (SEQ ID NO: 41, Pst1 and Kpn1 were underlined).

Example 3

Recovery of Parental and Derivative Viruses

[0191] This example describes the recovery of parental ERA virus and exemplary derivatives using the reverse genetics system disclosed herein. BSR cells were transfected at near 80% confluence in six-well-plates with viral full length transcription plasmid pTMF (pTMF.DELTA..psi., pTMFg3, pTMF2g, pTMF2g3, pTMFgfp1, pTMFgfp2, pTMF.DELTA.g, pTMFgm, or pTMFgmg, respectively) at 3 .mu.g/well, together with five helper plasmids: pTN (1 .mu.g/well), pMP (0.5 .mu.g/well), pML (0.5 .mu.g/well), pMG (0.5 .mu.g/well) and pNLST7 (1 .mu.g/well) by TransIT-LT1 reagent (Mirus, Madison, Wis.) following the protocol recommended by the manufacturer. Four days after transfection, 1 ml of fresh BSR cell suspension (about 5.times.10.sup.5 cells) was added to each well. Cells were incubated at 37.degree. C., 5% CO.sub.2 for 3 days. Cell supernatants were collected for virus titration.

[0192] To titrate the recovered virus, monolayers of BSR cells in LAB-TEK eight-well-plates (Naperville, Ill.) were infected with serial 10-fold dilutions of virus supernatant and incubated at 37.degree. C., 0.5% CO.sub.2 for 48 h. Cells were fixed in 80% chilled acetone at room temperature for 1 h and stained with FITC-labeled anti-rabies virus N monoclonal antibody at 37.degree. C. for 30 minutes. After three rinses of the plates with PBS, stained foci were counted using direct fluorescent microscopy. Details for direct RABV fluorescent assay (DFA) can be found on the World Wide Web at cdc.gov/ncidod/dvrd/rabies/professional/publications/DFA-diagnosis/DFA_pr- otocol.htm.

[0193] All of the viruses except ERA-G were recovered at high titer from cultured BSR cells as indicated in Table 1. Surprisingly, rearrangement and switching of the G gene with the M gene did not hinder recovery of recombinant derivative ERA virus. Rearrangement of the G gene in the RABV genomes was previously not believed feasible due to cell death from overexpression of G protein (Faber et al., J. Virol. 76:3374-3381, 2002). However, these results demonstrate that rearrangement is possible in the ERA strain. Accordingly, it is likely that RABV gene shuffling is possible not only for the G gene, but also for other genes as well.

[0194] The ERA-G (without G) virus was recovered after plasmid transfection following the same procedure as for the other viral constructs rescue, but virus foci were very limited and restrained in local areas after the first round of transfection. The rescued virus was not capable of spreading further to the nearby healthy BSR cells even after one week of incubation at 37.degree. C., 5% CO.sub.2. Infection of normal BSR cells with the above transfection supernatants presented single cell staining in the DFA test, which suggested the recovered virus was incapable of spread. The ERA-G virus was amplified using a BHK cell line that constitutively expresses ERA G (PCT Publication No. WO 2007/047459). By indirect fluorescent assay screening, a pool of BHK cells expressing G were selected and maintained for amplification of ERA-G virus. With the aid of the BHK-G cell line, ERA-G virus grew to 10.sup.7 ffu/ml. Total RNA from ERA-G virus-infected BHK-G cells was extracted for Northern blot analysis with a G gene probe. The G gene was absent in the viral genomic RNA, however G mRNA was detected, which came from infected supportive BHK-G cells. In purified ERA-G viral genomic RNA, no hybridization signal was detected by G probe, indicating the deletion of the G gene in the ERA genome.

Example 4

Growth of Rescued ERA Virus and its Derivatives to High Titer in a Bioreactor

[0195] In oral vaccine development, high virus titer is typically required to elicit reliable immunity after administration. This example demonstrates that the ERA virus and derivatives can be grown to high titer in a bioreactor at volumes applicable to commercial scale-up. All 10 rescued ERA viruses were amplified in a bioreactor, CELLine AD1000 (IBS Integra Bioscience, Chur, Switherland) to titers ranging from 10.sup.7 to 10.sup.10 ffu/ml. In brief, BSR cells were transfected with the exemplary antigenome transcription vectors and helper vectors, as described above. Cells were inoculated at a multiplicity of infection of 1 virion per cell, at a concentration of 10.sup.6 cells/ml in one tenth the bioreactor vessel volume. Transfected cells were grown at 37.degree. C., 5% CO.sub.2 in DMEM supplemented with 10% fetal bovine serum. The supernatant was harvested every three to five days for between two and three harvests. The deficient ERA-G grew less well compared with other viruses, with only 10.sup.8 ffu/ml for the ERA-G (Table 1).

TABLE-US-00002 TABLE 1 Full-length plasmid constructs and corresponding rescued viruses Titers Titers Plasmid Rescued ffu/ml from ffu/ml in constructs viruses cultured cells bioreactors pTMF rERA 5 .times. 10.sup.7 .sup. 3 .times. 10.sup.10 pTMF.DELTA..psi. ERA- 6.3 .times. 10.sup.7 .sup. 3.2 .times. 10.sup.10 pTMFg3 ERAg3 3 .times. 10.sup.6 1.8 .times. 10.sup.9 pTMFgfp1 ERAgreen1 3.5 .times. 10.sup.6 5.6 .times. 10.sup.9 pTMFgfp2 ERAgreen2 2 .times. 10.sup.7 6.2 .times. 10.sup.9 pTMF2g ERA2g 1.6 .times. 10.sup.6 3.9 .times. 10.sup.9 pTMF2g3 ERA2g3 8 .times. 10.sup.7 4.6 .times. 10.sup.9 pTMF.DELTA.g ERA-G 1.2 .times. 10.sup.2 1.5 .times. 10.sup.7 pTMFgm ERAgm 5.31 .times. 10.sup.6 1.9 .times. 10.sup.9 pTMFgmg ERAgmg 3.1 .times. 10.sup.6 1.2 .times. 10.sup.9

Example 5

Expression of Exogenous Proteins from Extra Transcriptional Units in Rabies Virus

[0196] This example demonstrates the expression of recombinant proteins from a heterologous ORF inserted into a rabies virus vector. In this example, the ERA virus vector is used as a prototype rabies virus vector. To construct ERA virus as a vector for accepting ORFs, a conservative RABV transcriptional unit between the N and P genes was modified and introduced into the ERA genome at two different locations: 1) at the psi region (trans 1), and 2) at the P-M intergenic region (trans 2). The transcriptional unit was designed to possess two unique restriction enzyme recognition sites to facilitate introduction of heterologous polynucleotide sequences (TTTTTTTGATTGTGGGGAGGAAAGCGACGTCAAACCATGGCAGCTCTTTTTTT; SEQ ID NO: 42, Pst1 and Kpn1 sites shown in bold).

[0197] In a first example, the GFP gene was cloned into this unit for virus recovery, since GFP expression could be observed directly under a UV microscope when the transfected BSR cells were still incubating. Expression of the GFP protein was directly visible by fluorescent microscopy with an excitation filter of 470.+-.20 nm. The ERAgreen2 (GFP gene inserted after P gene in RABV genome-trans 2)-infected cells showed clear green foci after three days of plasmid transfection, while ERAgreen1 (GFP gene inserted after G gene in the "traditional" .PSI. region-trans 1) did not present obvious green foci until five days post-transfection. The introduced transcriptional unit was functional in the RABV genome at both locations, although expression and accumulation was apparent more rapidly when GFP was expressed from trans 2. Thus, these results also indicate that the level of expression from a heterologous ORF can be modulated by selecting the transcription unit into which the ORF is cloned.

[0198] In other examples, 1) an additional copy of ERA G; or 2) an additional copy of ERA G with an amino acid substitution at position 333, was incorporated into the ERA viral genome. The successfully rescued viruses were named ERA2g and ERA2g3, respectively. Since quantitation of viral G expression was not practical, the relative increase in expression levels of G in ERA2g and ERA2g3-infected cells was confirmed by Northern-blot with a G probe. In brief, the ERA G gene probe was labeled using the Dig DNA Labeling Kit (Roche, Indianapolis, Ind.) and imaged with Dig Nucleic Acid Detection Kit (Roche, Indianapolis, Ind.) and was measured by density spectrophotometry. The tandem linked G genes in the recovered viruses were also confirmed by RT-PCR with 5G and 3G primers. A predominant band indicating a single G copy was observed at 1.5 kb. In addition, a second weaker band was observed at approximately 3.0 kb indicative of the two Gs in a tandem arrangement.

[0199] These results demonstrate that introduction of transcription units into the ERA genome can be used to express diverse heterologous proteins from introduced ORFs. Furthermore, expression of the protein encoded by the heterologous ORF is modulated by the position into which the ORF is inserted. Thus, ERA virus is a widely adaptable vector for the expression of recombinant proteins.

Example 6

Construction and Characterization of Recombinant Rabies Virus with Three Glycoprotein Genes

[0200] This example describes the generation and characterization of a recombinant ERA strain rabies virus encoding three different glycoprotein genes. The recombinant virus, referred to as ERA-3G, comprises rabies virus glycoprotein, Mokola virus (MOKV) glycoprotein and West Caucasian bat virus (WCBV) glycoprotein. The cloning strategy for ERA-3G is shown in FIG. 4. The rabies virus reverse genetics system used to generate this virus in described in the Examples above. ERA-3G includes the attenuating mutation in the glycoprotein gene that results in an arginine to glutamic acid change at amino acid residue 433 of the protein (SEQ ID NO: 5).

[0201] The G genes from MOKV and WCBV were cloned into the ERA backbone by RT-PCR using viral genomic RNA as template from virus-infected cells. The following primers were used for amplification of the glycoprotein genes:

TABLE-US-00003 MokolaG5 - (SEQ ID NO: 43) CGACTGCAGATGAATATACCTTGCTTTGTTGTGATTC MokolaG3 - (SEQ ID NO: 44) CGTGGTACCTCATGTACCTGGAAGCCCTTTATAGGACTC WCBVG5 - (SEQ ID NO: 45) CATCTGCTAGCAATGGCTTCCTACTTTGCGTTG WCBVG3 - (SEQ ID NO: 46) TTCAATGGTACCTTATTGGGCAGTTTGTCCCTT

[0202] The amplified G genes for MOKV (SEQ ID NO: 47) and WCBV (SEQ ID NO: 49) were confirmed by sequencing. Two extra transcription units were synthesized (each with the sequence of SEQ ID NO: 42) and introduced into the gene junctions between the phosphoprotein (P) and the matrix protein (M), and the G and the RNA dependent RNA polymerase (L) (FIG. 4). The MOKV G was cloned into the gene junction between the P and M, and WCBV G into the gene junction between the G and L in the ERA genome backbone.

[0203] Recombinant virus was recovered by transfection of the above described construct into BSR cells using the method described in Example 3. Approximately 5-7 days following transfection, BSR cells were fixed for DFA staining using FITC-conjugated anti-rabies antibodies.

[0204] The recovered ERA-3G virus was characterized with a one-step growth curve in BSR cells. Virus titer was evaluated at 24, 48, 72, 96 and 120 hours after infection. At the 72, 96 and 120 hour time points, virus titer in bioreactor incubation ranged from 10.sup.8 to 10.sup.9 focus forming unit (ffu)/ml.

[0205] ERA-3G virus was then tested in a hamster model to determine whether vaccination with ERA-3G provides protection against challenge with RABV, LBV, MOKV and/or WCBV. Nine hamsters were vaccinated (i.m.) with either ERA-3G, RabAvert.TM. (Chiron Corporation, Emeryville, Calif.), or IMRAB.TM. (Merial, Duluth, Ga.). RabAvert.TM. was administered on days 0, 7 and 14, while ERA-3G and IMRAB.TM. were administered on day 0. Animals were challenged with RABV, LBV, MOK or WCBV on day 22. Control animals received no vaccine. The results of the challenge experiment are shown in Table 2.

TABLE-US-00004 TABLE 2 Survivorship of hamsters after pre-exposure vaccination and i.m. challenge with several lyssaviruses Group RABV (I-151) LBV (SA) MOK (SA) WCBV RabAvert .TM. 9/9 0/9 0/9 5/9 IMRAB .TM. 9/9 1/9 0/9 3/9 ERA-3G 9/9 1/9 9/9 9/9 Control 0/9 0/9 0/9 1/9

[0206] The results demonstrate that ERA-3G provides complete protection against RABV, MOK and WCBV. In contrast, the currently available vaccines RabAvert.TM. and IMRAB.TM., provide protection only against RABV.

[0207] For animal vaccine development, ERA-3G will be adapted to growth in chicken embryo fibroblast (CEF) and Vero cells. It is believed that ERA-3G will grow to high titers ranging from 10.sup.8 to 10.sup.9 ffu/ml in the BSR cells for animal vaccine development. For human vaccine development, ERA-3G will be adapted to CEF and Vero cells. It is believed that ERA-3G titers in the CEF and BSR cells after adaptation will be comparable to virus growth in BSR cells. The purity of ERA-3G will be verified, and the seed virus will be prepared for industrial production. Potential mycoplasma contamination will be tested using a standard PCR method.

Example 7

Construction and Characterization of Recombinant Rabies Virus with Four Glycoprotein Genes

[0208] This example describes the generation and characterization of a recombinant ERA strain rabies virus encoding three different glycoprotein genes. The recombinant virus, referred to as ERA-4G, comprises rabies virus glycoprotein, Lagos bat virus (LBV) glycoprotein, MOKV glycoprotein and WCBV glycoprotein. The cloning strategy for ERA-4G is shown in FIG. 5. The rabies virus reverse genetics system used to generate this virus in described in the Examples above. ERA-4G includes the attenuating mutation in the G gene that results in an arginine to glutamic acid change at amino acid residue 433 of the protein (SEQ ID NO: 5).

[0209] The G genes from LBV, MOKV and WCBV were cloned into the ERA backbone by RT-PCR using viral genomic RNA as template from virus-infected cells. The following primers were used for amplification of the glycoprotein genes:

TABLE-US-00005 LagosG5 - (SEQ ID NO: 51) CGACTGCAGATGAGTCAACTAAATTTGATACCCTTTTTC LagosG3 - (SEQ ID NO: 52) CCGTACGTATCAGACATTAGAGGTACCCTTATAAGATTCCCA MokolaG5 - (SEQ ID NO: 43) CGACTGCAGATGAATATACCTTGCTTTGTTGTGATTC MokolaG3 - (SEQ ID NO: 44) CGTGGTACCTCATGTACCTGGAAGCCCTTTATAGGACTC WCBVG5 - (SEQ ID NO: 45) CATCTGCTAGCAATGGCTTCCTACTTTGCGTTG WCBVG3 - (SEQ ID NO: 46) TTCAATGGTACCTTATTGGGCAGTTTGTCCCTT

[0210] The amplified G genes for LBV (SEQ ID NO: 53), MOKV (SEQ ID NO: 47) and WCBV (SEQ ID NO: 49) were confirmed by sequencing. Three extra transcription units were synthesized (each with the sequence of SEQ ID NO: 42) and introduced into the gene junctions between the N and P genes, between the P and M genes, and the G and L genes (FIG. 5). The LBV G was cloned into the gene junction between N and P, MOKV G was cloned into the gene junction between P and M, and WCBV G was cloned into the gene junction between the G and L in the ERA genome backbone.

[0211] Recombinant virus was recovered by transfection of the above described construct into BSR cells using the method described in Example 3. Approximately 5-7 days following transfection, BSR cells were fixed for DFA staining using FITC-conjugated anti-rabies antibodies.

[0212] The recovered ERA-4G virus was characterized with a one-step growth curve in BSR cells. Virus titer was determined at 24, 48, 72, 96 and 120 hours after infection. The results are shown in Table 3 below.

TABLE-US-00006 TABLE 3 Growth of ERA-4G in BSR cells Timepoint (h) 24 48 72 96 120 Titer (ffu/ml) 1 .times. 10.sup.3 5 .times. 10.sup.3 1.2 .times. 10.sup.5 1.3 .times. 10.sup.7 3.2 .times. 10.sup.5

[0213] ERA-4G virus will be tested in a hamster model to determine whether vaccination with ERA-4G confers protection against challenge with lyssaviruses RABV, LBV, MOKV and WCBV. The vaccination and challenge experiment will be performed as described for ERA-3G in Example 6.

[0214] For animal vaccine development, ERA-4G will be adapted to growth in chicken embryo fibroblast (CEF) and Vero cells. It is believed that ERA-4G will grow to high titers ranging from 10.sup.8 to 10.sup.9 ffu/ml in the BSR cells for animal vaccine development. For human vaccine development, ERA-4G will be adapted to CEF and Vero cells. It is believed that ERA-4G titers in the CEF and BSR cells after adaptation will be comparable to virus growth in BSR cells. The purity of ERA-4G will be verified, and the seed virus will be prepared for industrial production. Potential mycoplasma contamination will be tested using a standard PCR method.

[0215] In view of the many possible embodiments to which the principles of the disclosed invention may be applied, it should be recognized that the illustrated embodiments are only preferred examples of the invention and should not be taken as limiting the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Sequence CWU 1

1

54111930DNAArtificial SequenceRecombinant ERA rabies virus genome 1acgcttaaca accagatcaa agaaaaaaca gacattgtca attgcaaagc aaaaatgtaa 60cacccctaca atg gat gcc gac aag att gta ttc aaa gtc aat aat cag 109 Met Asp Ala Asp Lys Ile Val Phe Lys Val Asn Asn Gln 1 5 10 gtg gtc tct ttg aag cct gag att atc gtg gat caa cat gag tac aag 157Val Val Ser Leu Lys Pro Glu Ile Ile Val Asp Gln His Glu Tyr Lys 15 20 25 tac cct gcc atc aaa gat ttg aaa aag ccc tgt ata acc cta gga aag 205Tyr Pro Ala Ile Lys Asp Leu Lys Lys Pro Cys Ile Thr Leu Gly Lys 30 35 40 45 gct ccc gat tta aat aaa gca tac aag tca gtt ttg tca ggc atg agc 253Ala Pro Asp Leu Asn Lys Ala Tyr Lys Ser Val Leu Ser Gly Met Ser 50 55 60 gcc gcc aaa ctt gat cct gac gat gta tgt tcc tat ttg gca gcg gca 301Ala Ala Lys Leu Asp Pro Asp Asp Val Cys Ser Tyr Leu Ala Ala Ala 65 70 75 atg cag ttt ttt gag ggg aca tgt ccg gaa gac tgg acc agc tat gga 349Met Gln Phe Phe Glu Gly Thr Cys Pro Glu Asp Trp Thr Ser Tyr Gly 80 85 90 atc gtg att gca cga aaa gga gat aag atc acc cca ggt tct ctg gtg 397Ile Val Ile Ala Arg Lys Gly Asp Lys Ile Thr Pro Gly Ser Leu Val 95 100 105 gag ata aaa cgt act gat gta gaa ggg aat tgg gct ctg aca gga ggc 445Glu Ile Lys Arg Thr Asp Val Glu Gly Asn Trp Ala Leu Thr Gly Gly 110 115 120 125 atg gaa ctg aca aga gac ccc act gtc cct gag cat gcg tcc tta gtc 493Met Glu Leu Thr Arg Asp Pro Thr Val Pro Glu His Ala Ser Leu Val 130 135 140 ggt ctt ctc ttg agt ctg tat agg ttg agc aaa ata tcc ggg caa aac 541Gly Leu Leu Leu Ser Leu Tyr Arg Leu Ser Lys Ile Ser Gly Gln Asn 145 150 155 act ggt aac tat aag aca aac att gca gac agg ata gag cag att ttt 589Thr Gly Asn Tyr Lys Thr Asn Ile Ala Asp Arg Ile Glu Gln Ile Phe 160 165 170 gag aca gcc cct ttt gtt aaa atc gtg gaa cac cat act cta atg aca 637Glu Thr Ala Pro Phe Val Lys Ile Val Glu His His Thr Leu Met Thr 175 180 185 act cac aaa atg tgt gct aat tgg agt act ata cca aac ttc aga ttt 685Thr His Lys Met Cys Ala Asn Trp Ser Thr Ile Pro Asn Phe Arg Phe 190 195 200 205 ttg gcc gga acc tat gac atg ttt ttc tcc cgg att gag cat cta tat 733Leu Ala Gly Thr Tyr Asp Met Phe Phe Ser Arg Ile Glu His Leu Tyr 210 215 220 tca gca atc aga gtg ggc aca gtt gtc act gct tat gaa gac tgt tca 781Ser Ala Ile Arg Val Gly Thr Val Val Thr Ala Tyr Glu Asp Cys Ser 225 230 235 gga ctg gta tca ttt act ggg ttc ata aaa caa atc aat ctc acc gct 829Gly Leu Val Ser Phe Thr Gly Phe Ile Lys Gln Ile Asn Leu Thr Ala 240 245 250 aga gag gca ata cta tat ttc ttc cac aag aac ttt gag gaa gag ata 877Arg Glu Ala Ile Leu Tyr Phe Phe His Lys Asn Phe Glu Glu Glu Ile 255 260 265 aga aga atg ttt gag cca ggg cag gag aca gct gtt cct cac tct tat 925Arg Arg Met Phe Glu Pro Gly Gln Glu Thr Ala Val Pro His Ser Tyr 270 275 280 285 ttc atc cac ttc cgt tca cta ggc ttg agt ggg aaa tct cct tat tca 973Phe Ile His Phe Arg Ser Leu Gly Leu Ser Gly Lys Ser Pro Tyr Ser 290 295 300 tca aat gct gtt ggt cac gtg ttc aat ctc att cac ttt gta gga tgc 1021Ser Asn Ala Val Gly His Val Phe Asn Leu Ile His Phe Val Gly Cys 305 310 315 tat atg ggt caa gtc aga tcc cta aat gca acg gtt att gct gca tgt 1069Tyr Met Gly Gln Val Arg Ser Leu Asn Ala Thr Val Ile Ala Ala Cys 320 325 330 gct cct cat gaa atg tct gtt cta ggg ggc tat ctg gga gag gaa ttc 1117Ala Pro His Glu Met Ser Val Leu Gly Gly Tyr Leu Gly Glu Glu Phe 335 340 345 ttc ggg aaa ggg aca ttt gaa aga aga ttc ttc aga gat gag aaa gaa 1165Phe Gly Lys Gly Thr Phe Glu Arg Arg Phe Phe Arg Asp Glu Lys Glu 350 355 360 365 ctt caa gaa tac gag gcg gct gaa ctg aca aag act gac gta gca ctg 1213Leu Gln Glu Tyr Glu Ala Ala Glu Leu Thr Lys Thr Asp Val Ala Leu 370 375 380 gca gat gat gga act gtc aac tct gac gac gag gac tac ttc tca ggt 1261Ala Asp Asp Gly Thr Val Asn Ser Asp Asp Glu Asp Tyr Phe Ser Gly 385 390 395 gaa acc aga agt ccg gag gct gtt tat act cga atc atg atg aat gga 1309Glu Thr Arg Ser Pro Glu Ala Val Tyr Thr Arg Ile Met Met Asn Gly 400 405 410 ggt cga cta aag aga tct cac ata cgg aga tat gtc tca gtc agt tcc 1357Gly Arg Leu Lys Arg Ser His Ile Arg Arg Tyr Val Ser Val Ser Ser 415 420 425 aat cat caa gcc cgt cca aac tca ttc gcc gag ttt cta aac aag aca 1405Asn His Gln Ala Arg Pro Asn Ser Phe Ala Glu Phe Leu Asn Lys Thr 430 435 440 445 tat tcg agt gac tca taagaagttg aacaacaaaa tgccggaaat ctacggattg 1460Tyr Ser Ser Asp Ser 450 tgtatatcca tcatgaaaaa aactaacacc cctcctttcg aaccatccca aac atg 1516 Met agc aag atc ttt gtc aat cct agt gct att aga gcc ggt ctg gcc gat 1564Ser Lys Ile Phe Val Asn Pro Ser Ala Ile Arg Ala Gly Leu Ala Asp 455 460 465 ctt gag atg gct gaa gaa act gtt gat ctg atc aat aga aat atc gaa 1612Leu Glu Met Ala Glu Glu Thr Val Asp Leu Ile Asn Arg Asn Ile Glu 470 475 480 gac aat cag gct cat ctc caa ggg gaa ccc ata gaa gtg gac aat ctc 1660Asp Asn Gln Ala His Leu Gln Gly Glu Pro Ile Glu Val Asp Asn Leu 485 490 495 cct gag gat atg ggg cga ctt cac ctg gat gat gga aaa tcg ccc aac 1708Pro Glu Asp Met Gly Arg Leu His Leu Asp Asp Gly Lys Ser Pro Asn 500 505 510 515 cct ggt gag atg gcc aag gtg gga gaa ggc aag tat cga gag gac ttt 1756Pro Gly Glu Met Ala Lys Val Gly Glu Gly Lys Tyr Arg Glu Asp Phe 520 525 530 cag atg gat gaa gga gag gat ctt agc ttc ctg ttc cag tca tac ctg 1804Gln Met Asp Glu Gly Glu Asp Leu Ser Phe Leu Phe Gln Ser Tyr Leu 535 540 545 gaa aat gtt gga gtc caa ata gtc aga caa atg agg tca gga gag aga 1852Glu Asn Val Gly Val Gln Ile Val Arg Gln Met Arg Ser Gly Glu Arg 550 555 560 ttt ctc aag ata tgg tca cag acc gta gaa gag att ata tcc tat gtc 1900Phe Leu Lys Ile Trp Ser Gln Thr Val Glu Glu Ile Ile Ser Tyr Val 565 570 575 gcg gtc aac ttt ccc aac cct cca gga aag tct tca gag gat aaa tca 1948Ala Val Asn Phe Pro Asn Pro Pro Gly Lys Ser Ser Glu Asp Lys Ser 580 585 590 595 acc cag act act ggc cga gag ctc aag aag gag aca aca ccc act cct 1996Thr Gln Thr Thr Gly Arg Glu Leu Lys Lys Glu Thr Thr Pro Thr Pro 600 605 610 tct cag aga gaa agc caa tca tcg aaa gcc agg atg gcg gct caa att 2044Ser Gln Arg Glu Ser Gln Ser Ser Lys Ala Arg Met Ala Ala Gln Ile 615 620 625 gct tct ggc cct cca gcc ctt gaa tgg tcg gcc acc aat gaa gag gat 2092Ala Ser Gly Pro Pro Ala Leu Glu Trp Ser Ala Thr Asn Glu Glu Asp 630 635 640 gat cta tca gtg gag gct gag atc gct cac cag att gca gaa agt ttc 2140Asp Leu Ser Val Glu Ala Glu Ile Ala His Gln Ile Ala Glu Ser Phe 645 650 655 tcc aaa aaa tat aag ttt ccc tct cga tcc tca ggg ata ctc ttg tat 2188Ser Lys Lys Tyr Lys Phe Pro Ser Arg Ser Ser Gly Ile Leu Leu Tyr 660 665 670 675 aat ttt gag caa ttg aaa atg aac ctt gat gat ata gtt aaa gag gca 2236Asn Phe Glu Gln Leu Lys Met Asn Leu Asp Asp Ile Val Lys Glu Ala 680 685 690 aaa aat gta cca ggt gtg acc cgt tta gcc cat gac ggg tcc aaa ctc 2284Lys Asn Val Pro Gly Val Thr Arg Leu Ala His Asp Gly Ser Lys Leu 695 700 705 ccc cta aga tgt gta ctg gga tgg gtc gct ttg gcc aac cct aag aaa 2332Pro Leu Arg Cys Val Leu Gly Trp Val Ala Leu Ala Asn Pro Lys Lys 710 715 720 ttc cag ttg tta gtc gaa tcc gac aag ctg agt aaa atc atg caa gat 2380Phe Gln Leu Leu Val Glu Ser Asp Lys Leu Ser Lys Ile Met Gln Asp 725 730 735 gac ttg aat cgc tat aca tct tgc taaccgaacc tctccactca gtccctctag 2434Asp Leu Asn Arg Tyr Thr Ser Cys 740 745 acaataaagt ccgagatgtc ctaaagtcaa catgaaaaaa acaggcaaca ccactgataa 2494a atg aac ttt cta cgt aag ata gtg aaa aat tgc agg gac gag gac act 2543 Met Asn Phe Leu Arg Lys Ile Val Lys Asn Cys Arg Asp Glu Asp Thr 750 755 760 caa aaa ccc tct ccc gtg tca gcc cct ctg gat gac gat gac ttg tgg 2591Gln Lys Pro Ser Pro Val Ser Ala Pro Leu Asp Asp Asp Asp Leu Trp 765 770 775 ctt cca ccc cct gaa tac gtc ccg ctg aaa gaa ctt aca agc aag aag 2639Leu Pro Pro Pro Glu Tyr Val Pro Leu Lys Glu Leu Thr Ser Lys Lys 780 785 790 795 aac atg agg aac ttt tgt atc aac gga ggg gtt aaa gtg tgt agc ccg 2687Asn Met Arg Asn Phe Cys Ile Asn Gly Gly Val Lys Val Cys Ser Pro 800 805 810 aat ggt tac tcg ttc agg atc ctg cgg cac att ctg aaa tca ttc gac 2735Asn Gly Tyr Ser Phe Arg Ile Leu Arg His Ile Leu Lys Ser Phe Asp 815 820 825 gag ata tat tct ggg aat cat agg atg atc ggg tta gcc aaa gta gtt 2783Glu Ile Tyr Ser Gly Asn His Arg Met Ile Gly Leu Ala Lys Val Val 830 835 840 att gga ctg gct ttg tca gga tct cca gtc cct gag ggc atg aac tgg 2831Ile Gly Leu Ala Leu Ser Gly Ser Pro Val Pro Glu Gly Met Asn Trp 845 850 855 gta tac aaa ttg agg aga acc ttt atc ttc cag tgg gct gat tcc agg 2879Val Tyr Lys Leu Arg Arg Thr Phe Ile Phe Gln Trp Ala Asp Ser Arg 860 865 870 875 ggc cct ctt gaa ggg gag gag ttg gaa tac tct cag gag atc act tgg 2927Gly Pro Leu Glu Gly Glu Glu Leu Glu Tyr Ser Gln Glu Ile Thr Trp 880 885 890 gat gat gat act gag ttc gtc gga ttg caa ata aga gtg att gca aaa 2975Asp Asp Asp Thr Glu Phe Val Gly Leu Gln Ile Arg Val Ile Ala Lys 895 900 905 cag tgt cat atc cag ggc aga atc tgg tgt atc aac atg aac ccg aga 3023Gln Cys His Ile Gln Gly Arg Ile Trp Cys Ile Asn Met Asn Pro Arg 910 915 920 gca tgt caa cta tgg tct gac atg tct ctt cag aca caa agg tcc gaa 3071Ala Cys Gln Leu Trp Ser Asp Met Ser Leu Gln Thr Gln Arg Ser Glu 925 930 935 gag gac aaa gat tcc tct ctg ctt cta gaa taatcagatt atatcccgca 3121Glu Asp Lys Asp Ser Ser Leu Leu Leu Glu 940 945 aatttatcac ttgtttacct ctggaggaga gaacatatgg gctcaactcc aacccttggg 3181agcaatataa caaaaaacat gttatggtgc cattaaaccg ctgcatttca tcaaagtcaa 3241gttgattacc tttacatttt gatcctcttg gatgtgaaaa aaactattaa catccctcaa 3301aagactcaag gaaag atg gtt cct cag gct ctc ctg ttt gta ccc ctt ctg 3352 Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu 950 955 960 gtt ttt cca ttg tgt ttt ggg aaa ttc cct att tac acg ata cca gac 3400Val Phe Pro Leu Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp 965 970 975 aag ctt ggt ccc tgg agc ccg att gac ata cat cac ctc agc tgc cca 3448Lys Leu Gly Pro Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro 980 985 990 aac aat ttg gta gtg gag gac gaa gga tgc acc aac ctg tca ggg ttc 3496Asn Asn Leu Val Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe 995 1000 1005 tcc tac atg gaa ctt aaa gtt gga tac atc tta gcc ata aaa atg 3541Ser Tyr Met Glu Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met 1010 1015 1020 aac ggg ttc act tgc aca ggc gtt gtg acg gag gct gaa acc tat 3586Asn Gly Phe Thr Cys Thr Gly Val Val Thr Glu Ala Glu Thr Tyr 1025 1030 1035 act aac ttc gtt ggt tat gtc aca acc acg ttc aaa aga aag cat 3631Thr Asn Phe Val Gly Tyr Val Thr Thr Thr Phe Lys Arg Lys His 1040 1045 1050 ttc cgc cca aca cca gat gca tgt aga gcc gcg tac aac tgg aag 3676Phe Arg Pro Thr Pro Asp Ala Cys Arg Ala Ala Tyr Asn Trp Lys 1055 1060 1065 atg gcc ggt gac ccc aga tat gaa gag tct cta cac aat ccg tac 3721Met Ala Gly Asp Pro Arg Tyr Glu Glu Ser Leu His Asn Pro Tyr 1070 1075 1080 cct gac tac cac tgg ctt cga act gta aaa acc acc aag gag tct 3766Pro Asp Tyr His Trp Leu Arg Thr Val Lys Thr Thr Lys Glu Ser 1085 1090 1095 ctc gtt atc ata tct cca agt gtg gca gat ttg gac cca tat gac 3811Leu Val Ile Ile Ser Pro Ser Val Ala Asp Leu Asp Pro Tyr Asp 1100 1105 1110 aga tcc ctt cac tcg agg gtc ttc cct agc ggg aag tgc tca gga 3856Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly Lys Cys Ser Gly 1115 1120 1125 gta gcg gtg tct tct acc tac tgc tcc act aac cac gat tac acc 3901Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His Asp Tyr Thr 1130 1135 1140 att tgg atg ccc gag aat ccg aga cta ggg atg tct tgt gac att 3946Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys Asp Ile 1145 1150 1155 ttt acc aat agt agg ggg aag aga gca tcc aaa ggg agt gag act 3991Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser Glu Thr 1160 1165 1170 tgc ggc ttt gta gat gaa aga ggc cta tat aag tct tta aaa gga 4036Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly 1175 1180 1185 gca tgc aaa ctc aag tta tgt gga gtt cta gga ctt aga ctt atg 4081Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met 1190 1195 1200 gat gga aca tgg gtc gcg atg caa aca tca aat gaa acc aaa tgg 4126Asp Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp 1205 1210 1215 tgc ccc ccc gat cag ttg gtg aac ctg cac gac ttt cgc tca gac 4171Cys Pro Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp 1220 1225 1230 gaa att gag cac ctt gtt gta gag gag ttg gtc agg aag aga gag 4216Glu Ile Glu His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu 1235 1240 1245 gag tgt ctg gat gca cta gag tcc atc atg aca acc aag tca gtg 4261Glu Cys Leu Asp Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val 1250 1255 1260 agt ttc aga cgt ccc agt cat tta aga aaa ctt gtc cct ggg ttt 4306Ser Phe Arg Arg Pro Ser His Leu Arg Lys Leu Val Pro Gly Phe 1265 1270 1275 gga aaa gca tat acc ata ttc aac aag acc ttg

atg gaa gcc gat 4351Gly Lys Ala Tyr Thr Ile Phe Asn Lys Thr Leu Met Glu Ala Asp 1280 1285 1290 gct cac tac aag tca gtc gag act tgg aat gag atc ctc cct tca 4396Ala His Tyr Lys Ser Val Glu Thr Trp Asn Glu Ile Leu Pro Ser 1295 1300 1305 aaa ggg tgt tta aga gtt ggg ggg agg tgt cat cct cat gtg aac 4441Lys Gly Cys Leu Arg Val Gly Gly Arg Cys His Pro His Val Asn 1310 1315 1320 ggg gtg ttt ttc aat ggt ata ata tta gga cct gac ggc aat gtc 4486Gly Val Phe Phe Asn Gly Ile Ile Leu Gly Pro Asp Gly Asn Val 1325 1330 1335 tta atc cca gag atg caa tca tcc ctc ctc cag caa cat atg gag 4531Leu Ile Pro Glu Met Gln Ser Ser Leu Leu Gln Gln His Met Glu 1340 1345 1350 ttg ttg gaa tcc tcg gtt atc ccc ctt gtg cac ccc ctg gca gac 4576Leu Leu Glu Ser Ser Val Ile Pro Leu Val His Pro Leu Ala Asp 1355 1360 1365 ccg tct acc gtt ttc aag gac ggt gac gag gct gag gat ttt gtt 4621Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu Asp Phe Val 1370 1375 1380 gaa gtt cac ctt ccc gat gtg cac aat cag gtc tca gga gtt gac 4666Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly Val Asp 1385 1390 1395 ttg ggt ctc ccg aac tgg ggg aag tat gta tta ctg agt gca ggg 4711Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala Gly 1400 1405 1410 gcc ctg act gcc ttg atg ttg ata att ttc ctg atg aca tgt tgt 4756Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys 1415 1420 1425 aga aga gtc aat cga tca gaa cct acg caa cac aat ctc aga ggg 4801Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly 1430 1435 1440 aca ggg agg gag gtg tca gtc act ccc caa agc ggg aag atc ata 4846Thr Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile 1445 1450 1455 tct tca tgg gaa tca cac aag agt ggg ggt gag acc aga ctg 4888Ser Ser Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu 1460 1465 1470 tgaggactgg ccgtcctttc aactatccaa gtcctgaaga tcacctcccc ttggggggtt 4948ctttttgaaa aaaacctggg ttcaatagtc ctcctcgaac tccatgcaac tgggtagatt 5008caagagtcat gagattttca ttaatcctct cagttgatca agcaagatca tgtagattct 5068cataataggg gagatcttct agcagtttca gtgactaacg gtactttcat tctccaggaa 5128ctgacaccaa cagttgtaga caaaccacgg ggtgtctcgg gtgactctgt gcttgggcac 5188agacaaaggt catggtgtgt tccatgatag cggactcagg atgagttaat tgagagaggc 5248agtcttcctc ccgtgaagga cataagcagt agctcacaat catcccgcgt ctcagcaaag 5308tgtgcataat tataaagtgc tgggtcatct aagcttttca gtcgagaaaa aaacattaga 5368tcagaagaac aactggcaac acttctcaac ctgagaccta cttcaag atg ctc gat 5424 Met Leu Asp 1475 cct gga gag gtc tat gat gac cct att gac cca atc gag tta gag 5469Pro Gly Glu Val Tyr Asp Asp Pro Ile Asp Pro Ile Glu Leu Glu 1480 1485 1490 gat gaa ccc aga gga acc ccc act gtc ccc aac atc ttg agg aac 5514Asp Glu Pro Arg Gly Thr Pro Thr Val Pro Asn Ile Leu Arg Asn 1495 1500 1505 tct gac tac aat ctc aac tct cct ttg ata gaa gat cct gct aga 5559Ser Asp Tyr Asn Leu Asn Ser Pro Leu Ile Glu Asp Pro Ala Arg 1510 1515 1520 cta atg tta gaa tgg tta aaa aca ggg aat aga cct tat cgg atg 5604Leu Met Leu Glu Trp Leu Lys Thr Gly Asn Arg Pro Tyr Arg Met 1525 1530 1535 act cta aca gac aat tgc tcc agg tct ttc aga gtt ttg aaa gat 5649Thr Leu Thr Asp Asn Cys Ser Arg Ser Phe Arg Val Leu Lys Asp 1540 1545 1550 tat ttc aag aag gta gat ttg ggt tct ctc aag gtg ggc gga atg 5694Tyr Phe Lys Lys Val Asp Leu Gly Ser Leu Lys Val Gly Gly Met 1555 1560 1565 gct gca cag tca atg att tct ctc tgg tta tat ggt gcc cac tct 5739Ala Ala Gln Ser Met Ile Ser Leu Trp Leu Tyr Gly Ala His Ser 1570 1575 1580 gaa tcc aac agg agc cgg aga tgt ata aca gac ttg gcc cat ttc 5784Glu Ser Asn Arg Ser Arg Arg Cys Ile Thr Asp Leu Ala His Phe 1585 1590 1595 tat tcc aag tcg tcc ccc ata gag aag ctg ttg aat ctc acg cta 5829Tyr Ser Lys Ser Ser Pro Ile Glu Lys Leu Leu Asn Leu Thr Leu 1600 1605 1610 gga aat aga ggg ctg aga atc ccc cca gag gga gtg tta agt tgc 5874Gly Asn Arg Gly Leu Arg Ile Pro Pro Glu Gly Val Leu Ser Cys 1615 1620 1625 ctt gag agg gtt gat tat gat aat gca ttt gga agg tat ctt gcc 5919Leu Glu Arg Val Asp Tyr Asp Asn Ala Phe Gly Arg Tyr Leu Ala 1630 1635 1640 aac acg tat tcc tct tac ttg ttc ttc cat gta atc acc tta tac 5964Asn Thr Tyr Ser Ser Tyr Leu Phe Phe His Val Ile Thr Leu Tyr 1645 1650 1655 atg aac gcc cta gac tgg gat gaa gaa aag acc atc cta gca tta 6009Met Asn Ala Leu Asp Trp Asp Glu Glu Lys Thr Ile Leu Ala Leu 1660 1665 1670 tgg aaa gat tta acc tca gtg gac atc ggg aag gac ttg gta aag 6054Trp Lys Asp Leu Thr Ser Val Asp Ile Gly Lys Asp Leu Val Lys 1675 1680 1685 ttc aaa gac caa ata tgg gga ctg ccg atc gtg aca aag gac ttt 6099Phe Lys Asp Gln Ile Trp Gly Leu Pro Ile Val Thr Lys Asp Phe 1690 1695 1700 gtt tac tcc caa agt tcc aat tgt ctt ttt gac aga aac tac aca 6144Val Tyr Ser Gln Ser Ser Asn Cys Leu Phe Asp Arg Asn Tyr Thr 1705 1710 1715 ctt atg cta aaa gaa ctt ttc ttg tct cgc ttc aac tcc tta atg 6189Leu Met Leu Lys Glu Leu Phe Leu Ser Arg Phe Asn Ser Leu Met 1720 1725 1730 gtc ttg ctc tct ccc cca gag ccc cga tac tca gat gac ttg ata 6234Val Leu Leu Ser Pro Pro Glu Pro Arg Tyr Ser Asp Asp Leu Ile 1735 1740 1745 tct caa cta tgc cag ctg tac att gct ggg gat caa gtc ttg tct 6279Ser Gln Leu Cys Gln Leu Tyr Ile Ala Gly Asp Gln Val Leu Ser 1750 1755 1760 atg tgt gga aac tcc ggc tat gaa gtc atc aaa ata ttg gag cca 6324Met Cys Gly Asn Ser Gly Tyr Glu Val Ile Lys Ile Leu Glu Pro 1765 1770 1775 tat gtc gtg aat agt tta gtc cag aga gca gaa aag ttt agg cct 6369Tyr Val Val Asn Ser Leu Val Gln Arg Ala Glu Lys Phe Arg Pro 1780 1785 1790 ctc att cat tcc ttg gga gac ttt cct gta ttt ata aaa gac aag 6414Leu Ile His Ser Leu Gly Asp Phe Pro Val Phe Ile Lys Asp Lys 1795 1800 1805 gta agt caa ctt gaa gag acg ttc ggt ccc tgt gca aga agg ttc 6459Val Ser Gln Leu Glu Glu Thr Phe Gly Pro Cys Ala Arg Arg Phe 1810 1815 1820 ttt agg gct ctg gat caa ttc gac aac ata cat gac ttg gtt ttt 6504Phe Arg Ala Leu Asp Gln Phe Asp Asn Ile His Asp Leu Val Phe 1825 1830 1835 gtg tat ggc tgt tac agg cat tgg ggg cac cca tat ata gat tat 6549Val Tyr Gly Cys Tyr Arg His Trp Gly His Pro Tyr Ile Asp Tyr 1840 1845 1850 cga aag ggt ctg tca aaa cta tat gat cag gtt cac att aaa aaa 6594Arg Lys Gly Leu Ser Lys Leu Tyr Asp Gln Val His Ile Lys Lys 1855 1860 1865 gtg ata gat aag tcc tac cag gag tgc tta gca agc gac cta gcc 6639Val Ile Asp Lys Ser Tyr Gln Glu Cys Leu Ala Ser Asp Leu Ala 1870 1875 1880 agg agg atc ctt aga tgg ggt ttt gat aag tac tcc aag tgg tat 6684Arg Arg Ile Leu Arg Trp Gly Phe Asp Lys Tyr Ser Lys Trp Tyr 1885 1890 1895 ctg gat tca aga ttc cta gcc cga gac cac ccc ttg act cct tat 6729Leu Asp Ser Arg Phe Leu Ala Arg Asp His Pro Leu Thr Pro Tyr 1900 1905 1910 atc aaa acc caa aca tgg cca ccc aaa cat att gta gac ttg gtg 6774Ile Lys Thr Gln Thr Trp Pro Pro Lys His Ile Val Asp Leu Val 1915 1920 1925 ggg gat aca tgg cac aag ctc ccg atc acg cag atc ttt gag att 6819Gly Asp Thr Trp His Lys Leu Pro Ile Thr Gln Ile Phe Glu Ile 1930 1935 1940 cct gaa tca atg gat ccg tca gaa ata ttg gat gac aaa tca cat 6864Pro Glu Ser Met Asp Pro Ser Glu Ile Leu Asp Asp Lys Ser His 1945 1950 1955 tct ttc acc aga acg aga cta gct tct tgg ctg tca gaa aac cga 6909Ser Phe Thr Arg Thr Arg Leu Ala Ser Trp Leu Ser Glu Asn Arg 1960 1965 1970 ggg gga cct gtt cct agc gaa aaa gtt att atc acg gcc ctg tct 6954Gly Gly Pro Val Pro Ser Glu Lys Val Ile Ile Thr Ala Leu Ser 1975 1980 1985 aag ccg cct gtc aat ccc cga gag ttt ctg agg tct ata gac ctc 6999Lys Pro Pro Val Asn Pro Arg Glu Phe Leu Arg Ser Ile Asp Leu 1990 1995 2000 gga gga ttg cca gat gaa gac ttg ata att ggc ctc aag cca aag 7044Gly Gly Leu Pro Asp Glu Asp Leu Ile Ile Gly Leu Lys Pro Lys 2005 2010 2015 gaa cgg gaa ttg aag att gaa ggt cga ttc ttt gct cta atg tca 7089Glu Arg Glu Leu Lys Ile Glu Gly Arg Phe Phe Ala Leu Met Ser 2020 2025 2030 tgg aat cta aga ttg tat ttt gtc atc act gaa aaa ctc ttg gcc 7134Trp Asn Leu Arg Leu Tyr Phe Val Ile Thr Glu Lys Leu Leu Ala 2035 2040 2045 aac tac atc ttg cca ctt ttt gac gcg ctg act atg aca gac aac 7179Asn Tyr Ile Leu Pro Leu Phe Asp Ala Leu Thr Met Thr Asp Asn 2050 2055 2060 ctg aac aag gtg ttt aaa aag ctg atc gac agg gtc acc ggg caa 7224Leu Asn Lys Val Phe Lys Lys Leu Ile Asp Arg Val Thr Gly Gln 2065 2070 2075 ggg ctt ttg gac tat tca agg gtc aca tat gca ttt cac ctg gac 7269Gly Leu Leu Asp Tyr Ser Arg Val Thr Tyr Ala Phe His Leu Asp 2080 2085 2090 tat gaa aag tgg aac aac cat caa aga tta gag tca aca gag gat 7314Tyr Glu Lys Trp Asn Asn His Gln Arg Leu Glu Ser Thr Glu Asp 2095 2100 2105 gta ttt tct gtc cta gat caa gtg ttt gga ttg aag aga gtg ttt 7359Val Phe Ser Val Leu Asp Gln Val Phe Gly Leu Lys Arg Val Phe 2110 2115 2120 tct aga aca cac gag ttt ttt caa aag gcc tgg atc tat tat tca 7404Ser Arg Thr His Glu Phe Phe Gln Lys Ala Trp Ile Tyr Tyr Ser 2125 2130 2135 gac aga tca gac ctc atc ggg tta cgg gag gat caa ata tac tgc 7449Asp Arg Ser Asp Leu Ile Gly Leu Arg Glu Asp Gln Ile Tyr Cys 2140 2145 2150 tta gat gcg tcc aac ggc cca acc tgt tgg aat ggc cag gat ggc 7494Leu Asp Ala Ser Asn Gly Pro Thr Cys Trp Asn Gly Gln Asp Gly 2155 2160 2165 ggg cta gaa ggc tta cgg cag aag ggc tgg agt cta gtc agc tta 7539Gly Leu Glu Gly Leu Arg Gln Lys Gly Trp Ser Leu Val Ser Leu 2170 2175 2180 ttg atg ata gat aga gaa tct caa atc agg aac aca aga acc aaa 7584Leu Met Ile Asp Arg Glu Ser Gln Ile Arg Asn Thr Arg Thr Lys 2185 2190 2195 ata cta gct caa gga gac aac cag gtt tta tgt ccg aca tat atg 7629Ile Leu Ala Gln Gly Asp Asn Gln Val Leu Cys Pro Thr Tyr Met 2200 2205 2210 ttg tcg cca ggg cta tct caa gag ggg ctc ctc tat gaa ttg gag 7674Leu Ser Pro Gly Leu Ser Gln Glu Gly Leu Leu Tyr Glu Leu Glu 2215 2220 2225 aga ata tca agg aat gca ctt tcg ata tac aga gcc gtc gag gaa 7719Arg Ile Ser Arg Asn Ala Leu Ser Ile Tyr Arg Ala Val Glu Glu 2230 2235 2240 ggg gca tct aag cta ggg ctg atc atc aag aaa gaa gag acc atg 7764Gly Ala Ser Lys Leu Gly Leu Ile Ile Lys Lys Glu Glu Thr Met 2245 2250 2255 tgt agt tat gac ttc ctc atc tat gga aaa acc cct ttg ttt aga 7809Cys Ser Tyr Asp Phe Leu Ile Tyr Gly Lys Thr Pro Leu Phe Arg 2260 2265 2270 ggt aac ata ttg gtg cct gag tcc aaa aga tgg gcc aga gtc tct 7854Gly Asn Ile Leu Val Pro Glu Ser Lys Arg Trp Ala Arg Val Ser 2275 2280 2285 tgc gtc tct aat gac caa ata gtc aac ctc gcc aat ata atg tcg 7899Cys Val Ser Asn Asp Gln Ile Val Asn Leu Ala Asn Ile Met Ser 2290 2295 2300 aca gtg tcc acc aat gcg cta aca gtg gca caa cac tct caa tct 7944Thr Val Ser Thr Asn Ala Leu Thr Val Ala Gln His Ser Gln Ser 2305 2310 2315 ttg atc aaa ccg atg agg gat ttt ctg ctc atg tca gta cag gca 7989Leu Ile Lys Pro Met Arg Asp Phe Leu Leu Met Ser Val Gln Ala 2320 2325 2330 gtc ttt cac tac ctg cta ttt agc cca atc tta aag gga aga gtt 8034Val Phe His Tyr Leu Leu Phe Ser Pro Ile Leu Lys Gly Arg Val 2335 2340 2345 tac aag att ctg agc gct gaa ggg gat agc ttt ctc cta gcc atg 8079Tyr Lys Ile Leu Ser Ala Glu Gly Asp Ser Phe Leu Leu Ala Met 2350 2355 2360 tca agg ata atc tat cta gat cct tct ttg gga ggg gta tct gga 8124Ser Arg Ile Ile Tyr Leu Asp Pro Ser Leu Gly Gly Val Ser Gly 2365 2370 2375 atg tcc ctc gga aga ttc cat ata cga cag ttc tca gac cct gtc 8169Met Ser Leu Gly Arg Phe His Ile Arg Gln Phe Ser Asp Pro Val 2380 2385 2390 tct gaa ggg tta tcc ttc tgg aga gag atc tgg tta agc tcc cac 8214Ser Glu Gly Leu Ser Phe Trp Arg Glu Ile Trp Leu Ser Ser His 2395 2400 2405 gag tcc tgg att cac gcg ttg tgt caa gag gct gga aac cca gat 8259Glu Ser Trp Ile His Ala Leu Cys Gln Glu Ala Gly Asn Pro Asp 2410 2415 2420 ctt gga gag aga aca ctc gag agc ttc act cgc ctt cta gaa gat 8304Leu Gly Glu Arg Thr Leu Glu Ser Phe Thr Arg Leu Leu Glu Asp 2425 2430 2435 cct acc acc tta aat atc aga gga ggg gcc agt cct acc att cta 8349Pro Thr Thr Leu Asn Ile Arg Gly Gly Ala Ser Pro Thr Ile Leu 2440 2445 2450 ctc aag gat gca atc aga aag gct tta tat gac gag gtg gac aag 8394Leu Lys Asp Ala Ile Arg Lys Ala Leu Tyr Asp Glu Val Asp Lys 2455 2460 2465 gtg gag aat tca gag ttt cga gag gca atc ctg ttg tcc aag acc 8439Val Glu Asn Ser Glu Phe Arg Glu Ala Ile Leu Leu Ser Lys Thr 2470 2475 2480 cat aga gat aat ttt ata ctc ttc tta aca tct gtt gag cct ctg

8484His Arg Asp Asn Phe Ile Leu Phe Leu Thr Ser Val Glu Pro Leu 2485 2490 2495 ttt cct cga ttt ctc agt gag cta ttc agt tcg tct ttt ttg gga 8529Phe Pro Arg Phe Leu Ser Glu Leu Phe Ser Ser Ser Phe Leu Gly 2500 2505 2510 atc ccc gag tca atc att gga ttg ata caa aac tcc cga acg ata 8574Ile Pro Glu Ser Ile Ile Gly Leu Ile Gln Asn Ser Arg Thr Ile 2515 2520 2525 aga agg cag ttt aga aag agt ctc tca aaa act tta gaa gaa tcc 8619Arg Arg Gln Phe Arg Lys Ser Leu Ser Lys Thr Leu Glu Glu Ser 2530 2535 2540 ttc tac aac tca gag atc cac ggg att agt cgg atg acc cag aca 8664Phe Tyr Asn Ser Glu Ile His Gly Ile Ser Arg Met Thr Gln Thr 2545 2550 2555 cct cag agg gtt ggg ggg gtg tgg cct tgc tct tca gag agg gca 8709Pro Gln Arg Val Gly Gly Val Trp Pro Cys Ser Ser Glu Arg Ala 2560 2565 2570 gat cta ctt agg gag atc tct tgg gga aga aaa gtg gta ggc acg 8754Asp Leu Leu Arg Glu Ile Ser Trp Gly Arg Lys Val Val Gly Thr 2575 2580 2585 aca gtt cct cac cct tct gag atg ttg ggg tta ctt ccc aag tcc 8799Thr Val Pro His Pro Ser Glu Met Leu Gly Leu Leu Pro Lys Ser 2590 2595 2600 tct att tct tgc act tgt gga gca aca gga gga ggc aat cct aga 8844Ser Ile Ser Cys Thr Cys Gly Ala Thr Gly Gly Gly Asn Pro Arg 2605 2610 2615 gtt tct gta tca gta ctc ccg tcc ttt gat cag tca ttt ttt tca 8889Val Ser Val Ser Val Leu Pro Ser Phe Asp Gln Ser Phe Phe Ser 2620 2625 2630 cga ggc ccc cta aag ggg tac ttg ggc tcg tcc acc tct atg tcg 8934Arg Gly Pro Leu Lys Gly Tyr Leu Gly Ser Ser Thr Ser Met Ser 2635 2640 2645 acc cag cta ttc cat gca tgg gaa aaa gtc act aat gtt cat gtg 8979Thr Gln Leu Phe His Ala Trp Glu Lys Val Thr Asn Val His Val 2650 2655 2660 gtg aag aga gct cta tcg tta aaa gaa tct ata aac tgg ttc att 9024Val Lys Arg Ala Leu Ser Leu Lys Glu Ser Ile Asn Trp Phe Ile 2665 2670 2675 act aga gat tcc aac ttg gct caa gct cta att agg aac att atg 9069Thr Arg Asp Ser Asn Leu Ala Gln Ala Leu Ile Arg Asn Ile Met 2680 2685 2690 tct ctg aca ggc cct gat ttc cct cta gag gag gcc cct gtc ttc 9114Ser Leu Thr Gly Pro Asp Phe Pro Leu Glu Glu Ala Pro Val Phe 2695 2700 2705 aaa agg acg ggg tca gcc ttg cat agg ttc aag tct gcc aga tac 9159Lys Arg Thr Gly Ser Ala Leu His Arg Phe Lys Ser Ala Arg Tyr 2710 2715 2720 agc gaa gga ggg tat tct tct gtc tgc ccg aac ctc ctc tct cat 9204Ser Glu Gly Gly Tyr Ser Ser Val Cys Pro Asn Leu Leu Ser His 2725 2730 2735 att tct gtt agt aca gac acc atg tct gat ttg acc caa gac ggg 9249Ile Ser Val Ser Thr Asp Thr Met Ser Asp Leu Thr Gln Asp Gly 2740 2745 2750 aag aac tac gat ttc atg ttc cag cca ttg atg ctt tat gca cag 9294Lys Asn Tyr Asp Phe Met Phe Gln Pro Leu Met Leu Tyr Ala Gln 2755 2760 2765 aca tgg aca tca gag ctg gta cag aga gac aca agg cta aga gac 9339Thr Trp Thr Ser Glu Leu Val Gln Arg Asp Thr Arg Leu Arg Asp 2770 2775 2780 tct acg ttt cat tgg cac ctc cga tgc aac agg tgt gtg aga ccc 9384Ser Thr Phe His Trp His Leu Arg Cys Asn Arg Cys Val Arg Pro 2785 2790 2795 att gac gac gtg acc ctg gag acc tct cag atc ttc gag ttt ccg 9429Ile Asp Asp Val Thr Leu Glu Thr Ser Gln Ile Phe Glu Phe Pro 2800 2805 2810 gat gtg tcg aaa aga ata tcc aga atg gtt tct ggg gct gtg cct 9474Asp Val Ser Lys Arg Ile Ser Arg Met Val Ser Gly Ala Val Pro 2815 2820 2825 cac ttc cag agg ctt ccc gat atc cgt ctg aga cca gga gat ttt 9519His Phe Gln Arg Leu Pro Asp Ile Arg Leu Arg Pro Gly Asp Phe 2830 2835 2840 gaa tct cta agc ggt aga gaa aag tct cac cat atc gga tca gct 9564Glu Ser Leu Ser Gly Arg Glu Lys Ser His His Ile Gly Ser Ala 2845 2850 2855 cag ggg ctc tta tac tca atc tta gtg gca att cac gac tca gga 9609Gln Gly Leu Leu Tyr Ser Ile Leu Val Ala Ile His Asp Ser Gly 2860 2865 2870 tac aat gat gga acc atc ttc cct gtc aac ata tac gac aag gtt 9654Tyr Asn Asp Gly Thr Ile Phe Pro Val Asn Ile Tyr Asp Lys Val 2875 2880 2885 tcc cct aga gac tat ttg aga ggg ctc gca agg gga gta ttg ata 9699Ser Pro Arg Asp Tyr Leu Arg Gly Leu Ala Arg Gly Val Leu Ile 2890 2895 2900 gga tcc tcg att tgc ttc ttg aca aga atg aca aat atc aat att 9744Gly Ser Ser Ile Cys Phe Leu Thr Arg Met Thr Asn Ile Asn Ile 2905 2910 2915 aat aga cct ctt gaa ttg atc tca ggg gta atc tca tat att ctc 9789Asn Arg Pro Leu Glu Leu Ile Ser Gly Val Ile Ser Tyr Ile Leu 2920 2925 2930 ctg agg cta gat aac cat ccc tcc ttg tac ata atg ctc aga gaa 9834Leu Arg Leu Asp Asn His Pro Ser Leu Tyr Ile Met Leu Arg Glu 2935 2940 2945 ccg tct ctt aga gga gag ata ttt tct atc cct cag aaa atc ccc 9879Pro Ser Leu Arg Gly Glu Ile Phe Ser Ile Pro Gln Lys Ile Pro 2950 2955 2960 gcc gct tat cca acc act atg aaa gaa ggc aac aga tca atc ttg 9924Ala Ala Tyr Pro Thr Thr Met Lys Glu Gly Asn Arg Ser Ile Leu 2965 2970 2975 tgt tat ctc caa cat gtg cta cgc tat gag cga gag ata atc acg 9969Cys Tyr Leu Gln His Val Leu Arg Tyr Glu Arg Glu Ile Ile Thr 2980 2985 2990 gcg tct cca gag aat gac tgg cta tgg atc ttt tca gac ttt aga 10014Ala Ser Pro Glu Asn Asp Trp Leu Trp Ile Phe Ser Asp Phe Arg 2995 3000 3005 agt gcc aaa atg acg tac cta acc ctc att act tac cag tct cat 10059Ser Ala Lys Met Thr Tyr Leu Thr Leu Ile Thr Tyr Gln Ser His 3010 3015 3020 ctt cta ctc cag agg gtt gag aga aac cta tct aag agt atg aga 10104Leu Leu Leu Gln Arg Val Glu Arg Asn Leu Ser Lys Ser Met Arg 3025 3030 3035 gat aac ctg cga caa ttg agt tcc ttg atg agg cag gtg ctg ggc 10149Asp Asn Leu Arg Gln Leu Ser Ser Leu Met Arg Gln Val Leu Gly 3040 3045 3050 ggg cac gga gaa gat acc tta gag tca gac gac aac att caa cga 10194Gly His Gly Glu Asp Thr Leu Glu Ser Asp Asp Asn Ile Gln Arg 3055 3060 3065 ctg cta aaa gac tct tta cga agg aca aga tgg gtg gat caa gag 10239Leu Leu Lys Asp Ser Leu Arg Arg Thr Arg Trp Val Asp Gln Glu 3070 3075 3080 gtg cgc cat gca gct aga acc atg act gga gat tac agc ccc aac 10284Val Arg His Ala Ala Arg Thr Met Thr Gly Asp Tyr Ser Pro Asn 3085 3090 3095 aag aag gtg tcc cgt aag gta gga tgt tca gaa tgg gtc tgc tct 10329Lys Lys Val Ser Arg Lys Val Gly Cys Ser Glu Trp Val Cys Ser 3100 3105 3110 gct caa cag gtt gca gtc tct acc tca gca aac ccg gcc cct gtc 10374Ala Gln Gln Val Ala Val Ser Thr Ser Ala Asn Pro Ala Pro Val 3115 3120 3125 tcg gag ctt gac ata agg gcc ctc tct aag agg ttc cag aac cct 10419Ser Glu Leu Asp Ile Arg Ala Leu Ser Lys Arg Phe Gln Asn Pro 3130 3135 3140 ttg atc tcg ggc ttg aga gtg gtt cag tgg gca acc ggt gct cat 10464Leu Ile Ser Gly Leu Arg Val Val Gln Trp Ala Thr Gly Ala His 3145 3150 3155 tat aag ctt aag cct att cta gat gat ctc aat gtt ttc cca tct 10509Tyr Lys Leu Lys Pro Ile Leu Asp Asp Leu Asn Val Phe Pro Ser 3160 3165 3170 ctc tgc ctt gta gtt ggg gac ggg tca ggg ggg ata tca agg gca 10554Leu Cys Leu Val Val Gly Asp Gly Ser Gly Gly Ile Ser Arg Ala 3175 3180 3185 gtc ctc aac atg ttt cca gat gcc aag ctt gtg ttc aac agt ctc 10599Val Leu Asn Met Phe Pro Asp Ala Lys Leu Val Phe Asn Ser Leu 3190 3195 3200 tta gag gtg aat gac ctg atg gct tcc gga aca cat cca ctg cct 10644Leu Glu Val Asn Asp Leu Met Ala Ser Gly Thr His Pro Leu Pro 3205 3210 3215 cct tca gca atc atg agg gga gga aat gat atc gtc tcc aga gtg 10689Pro Ser Ala Ile Met Arg Gly Gly Asn Asp Ile Val Ser Arg Val 3220 3225 3230 ata gat ttt gac tca atc tgg gaa aaa ccg tcc gac ttg aga aac 10734Ile Asp Phe Asp Ser Ile Trp Glu Lys Pro Ser Asp Leu Arg Asn 3235 3240 3245 ttg gca acc tgg aaa tac ttc cag tca gtc caa aag cag gtc aac 10779Leu Ala Thr Trp Lys Tyr Phe Gln Ser Val Gln Lys Gln Val Asn 3250 3255 3260 atg tcc tat gac ctc att att tgc gat gca gaa gtt act gac att 10824Met Ser Tyr Asp Leu Ile Ile Cys Asp Ala Glu Val Thr Asp Ile 3265 3270 3275 gca tct atc aac cgg ata acc ctg tta atg tcc gat ttt gca ttg 10869Ala Ser Ile Asn Arg Ile Thr Leu Leu Met Ser Asp Phe Ala Leu 3280 3285 3290 tct ata gat gga cca ctc tat ttg gtc ttc aaa act tat ggg act 10914Ser Ile Asp Gly Pro Leu Tyr Leu Val Phe Lys Thr Tyr Gly Thr 3295 3300 3305 atg cta gta aat cca aac tac aag gct att caa cac ctg tca aga 10959Met Leu Val Asn Pro Asn Tyr Lys Ala Ile Gln His Leu Ser Arg 3310 3315 3320 gcg ttc ccc tcg gtc aca ggg ttt atc acc caa gta act tcg tct 11004Ala Phe Pro Ser Val Thr Gly Phe Ile Thr Gln Val Thr Ser Ser 3325 3330 3335 ttt tca tct gag ctc tac ctc cga ttc tcc aaa cga ggg aag ttt 11049Phe Ser Ser Glu Leu Tyr Leu Arg Phe Ser Lys Arg Gly Lys Phe 3340 3345 3350 ttc aga gat gct gag tac ttg acc tct tcc acc ctt cga gaa atg 11094Phe Arg Asp Ala Glu Tyr Leu Thr Ser Ser Thr Leu Arg Glu Met 3355 3360 3365 agc ctt gtg tta ttc aat tgt agc agc ccc aag agt gag atg cag 11139Ser Leu Val Leu Phe Asn Cys Ser Ser Pro Lys Ser Glu Met Gln 3370 3375 3380 aga gct cgt tcc ttg aac tat cag gat ctt gtg aga gga ttt cct 11184Arg Ala Arg Ser Leu Asn Tyr Gln Asp Leu Val Arg Gly Phe Pro 3385 3390 3395 gaa gaa atc ata tca aat cct tac aat gag atg atc ata act ctg 11229Glu Glu Ile Ile Ser Asn Pro Tyr Asn Glu Met Ile Ile Thr Leu 3400 3405 3410 att gac agt gat gta gaa tct ttt cta gtc cac aag atg gtt gat 11274Ile Asp Ser Asp Val Glu Ser Phe Leu Val His Lys Met Val Asp 3415 3420 3425 gat ctt gag tta cag agg gga act ctg tct aaa gtg gct atc att 11319Asp Leu Glu Leu Gln Arg Gly Thr Leu Ser Lys Val Ala Ile Ile 3430 3435 3440 ata gcc atc atg ata gtt ttc tcc aac aga gtc ttc aac gtt tcc 11364Ile Ala Ile Met Ile Val Phe Ser Asn Arg Val Phe Asn Val Ser 3445 3450 3455 aaa ccc cta act gac ccc ttg ttc tat cca ccg tct gat ccc aaa 11409Lys Pro Leu Thr Asp Pro Leu Phe Tyr Pro Pro Ser Asp Pro Lys 3460 3465 3470 atc ctg agg cac ttc aac ata tgt tgc agt act atg atg tat cta 11454Ile Leu Arg His Phe Asn Ile Cys Cys Ser Thr Met Met Tyr Leu 3475 3480 3485 tct act gct tta ggt gac gtc cct agc ttc gca aga ctt cac gac 11499Ser Thr Ala Leu Gly Asp Val Pro Ser Phe Ala Arg Leu His Asp 3490 3495 3500 ctg tat aac aga cct ata act tat tac ttc aga aag caa ttc att 11544Leu Tyr Asn Arg Pro Ile Thr Tyr Tyr Phe Arg Lys Gln Phe Ile 3505 3510 3515 cga ggg aac gtt tat cta tct tgg agt tgg tcc aac gac acc tca 11589Arg Gly Asn Val Tyr Leu Ser Trp Ser Trp Ser Asn Asp Thr Ser 3520 3525 3530 gtg ttc aaa agg gta gcc tgt aat tct agc ctg agt ctg tca tct 11634Val Phe Lys Arg Val Ala Cys Asn Ser Ser Leu Ser Leu Ser Ser 3535 3540 3545 cac tgg atc agg ttg att tac aag ata gtg aag act acc aga ctc 11679His Trp Ile Arg Leu Ile Tyr Lys Ile Val Lys Thr Thr Arg Leu 3550 3555 3560 gtt ggc agc atc aag gat cta tcc aga gaa gtg gaa aga cac ctt 11724Val Gly Ser Ile Lys Asp Leu Ser Arg Glu Val Glu Arg His Leu 3565 3570 3575 cat agg tac aac agg tgg atc acc cta gag gat atc aga tct aga 11769His Arg Tyr Asn Arg Trp Ile Thr Leu Glu Asp Ile Arg Ser Arg 3580 3585 3590 tca tcc cta cta gac tac agt tgc ctg tgaaccggat actcctggaa 11816Ser Ser Leu Leu Asp Tyr Ser Cys Leu 3595 3600 gcctgcccat gctaagactc ttgtgtgatg tatcttgaaa aaaacaagat cctaaatctg 11876aacctttggt tgtttgattg tttttctcat ttttgttgtt tatttgttaa gcgt 119302450PRTArtificial SequenceSynthetic Construct 2Met Asp Ala Asp Lys Ile Val Phe Lys Val Asn Asn Gln Val Val Ser 1 5 10 15 Leu Lys Pro Glu Ile Ile Val Asp Gln His Glu Tyr Lys Tyr Pro Ala 20 25 30 Ile Lys Asp Leu Lys Lys Pro Cys Ile Thr Leu Gly Lys Ala Pro Asp 35 40 45 Leu Asn Lys Ala Tyr Lys Ser Val Leu Ser Gly Met Ser Ala Ala Lys 50 55 60 Leu Asp Pro Asp Asp Val Cys Ser Tyr Leu Ala Ala Ala Met Gln Phe 65 70 75 80 Phe Glu Gly Thr Cys Pro Glu Asp Trp Thr Ser Tyr Gly Ile Val Ile 85 90 95 Ala Arg Lys Gly Asp Lys Ile Thr Pro Gly Ser Leu Val Glu Ile Lys 100 105 110 Arg Thr Asp Val Glu Gly Asn Trp Ala Leu Thr Gly Gly Met Glu Leu 115 120 125 Thr Arg Asp Pro Thr Val Pro Glu His Ala Ser Leu Val Gly Leu Leu 130 135 140 Leu Ser Leu Tyr Arg Leu Ser Lys Ile Ser Gly Gln Asn Thr Gly Asn 145 150 155 160 Tyr Lys Thr Asn Ile Ala Asp Arg Ile Glu Gln Ile Phe Glu Thr Ala 165 170 175 Pro Phe Val Lys Ile Val Glu His His Thr Leu Met Thr Thr His Lys 180 185 190 Met Cys Ala Asn Trp Ser Thr Ile Pro Asn Phe Arg Phe Leu Ala Gly 195 200 205 Thr Tyr Asp Met Phe Phe Ser Arg Ile Glu His Leu Tyr Ser Ala Ile 210 215 220

Arg Val Gly Thr Val Val Thr Ala Tyr Glu Asp Cys Ser Gly Leu Val 225 230 235 240 Ser Phe Thr Gly Phe Ile Lys Gln Ile Asn Leu Thr Ala Arg Glu Ala 245 250 255 Ile Leu Tyr Phe Phe His Lys Asn Phe Glu Glu Glu Ile Arg Arg Met 260 265 270 Phe Glu Pro Gly Gln Glu Thr Ala Val Pro His Ser Tyr Phe Ile His 275 280 285 Phe Arg Ser Leu Gly Leu Ser Gly Lys Ser Pro Tyr Ser Ser Asn Ala 290 295 300 Val Gly His Val Phe Asn Leu Ile His Phe Val Gly Cys Tyr Met Gly 305 310 315 320 Gln Val Arg Ser Leu Asn Ala Thr Val Ile Ala Ala Cys Ala Pro His 325 330 335 Glu Met Ser Val Leu Gly Gly Tyr Leu Gly Glu Glu Phe Phe Gly Lys 340 345 350 Gly Thr Phe Glu Arg Arg Phe Phe Arg Asp Glu Lys Glu Leu Gln Glu 355 360 365 Tyr Glu Ala Ala Glu Leu Thr Lys Thr Asp Val Ala Leu Ala Asp Asp 370 375 380 Gly Thr Val Asn Ser Asp Asp Glu Asp Tyr Phe Ser Gly Glu Thr Arg 385 390 395 400 Ser Pro Glu Ala Val Tyr Thr Arg Ile Met Met Asn Gly Gly Arg Leu 405 410 415 Lys Arg Ser His Ile Arg Arg Tyr Val Ser Val Ser Ser Asn His Gln 420 425 430 Ala Arg Pro Asn Ser Phe Ala Glu Phe Leu Asn Lys Thr Tyr Ser Ser 435 440 445 Asp Ser 450 3297PRTArtificial SequenceSynthetic Construct 3Met Ser Lys Ile Phe Val Asn Pro Ser Ala Ile Arg Ala Gly Leu Ala 1 5 10 15 Asp Leu Glu Met Ala Glu Glu Thr Val Asp Leu Ile Asn Arg Asn Ile 20 25 30 Glu Asp Asn Gln Ala His Leu Gln Gly Glu Pro Ile Glu Val Asp Asn 35 40 45 Leu Pro Glu Asp Met Gly Arg Leu His Leu Asp Asp Gly Lys Ser Pro 50 55 60 Asn Pro Gly Glu Met Ala Lys Val Gly Glu Gly Lys Tyr Arg Glu Asp 65 70 75 80 Phe Gln Met Asp Glu Gly Glu Asp Leu Ser Phe Leu Phe Gln Ser Tyr 85 90 95 Leu Glu Asn Val Gly Val Gln Ile Val Arg Gln Met Arg Ser Gly Glu 100 105 110 Arg Phe Leu Lys Ile Trp Ser Gln Thr Val Glu Glu Ile Ile Ser Tyr 115 120 125 Val Ala Val Asn Phe Pro Asn Pro Pro Gly Lys Ser Ser Glu Asp Lys 130 135 140 Ser Thr Gln Thr Thr Gly Arg Glu Leu Lys Lys Glu Thr Thr Pro Thr 145 150 155 160 Pro Ser Gln Arg Glu Ser Gln Ser Ser Lys Ala Arg Met Ala Ala Gln 165 170 175 Ile Ala Ser Gly Pro Pro Ala Leu Glu Trp Ser Ala Thr Asn Glu Glu 180 185 190 Asp Asp Leu Ser Val Glu Ala Glu Ile Ala His Gln Ile Ala Glu Ser 195 200 205 Phe Ser Lys Lys Tyr Lys Phe Pro Ser Arg Ser Ser Gly Ile Leu Leu 210 215 220 Tyr Asn Phe Glu Gln Leu Lys Met Asn Leu Asp Asp Ile Val Lys Glu 225 230 235 240 Ala Lys Asn Val Pro Gly Val Thr Arg Leu Ala His Asp Gly Ser Lys 245 250 255 Leu Pro Leu Arg Cys Val Leu Gly Trp Val Ala Leu Ala Asn Pro Lys 260 265 270 Lys Phe Gln Leu Leu Val Glu Ser Asp Lys Leu Ser Lys Ile Met Gln 275 280 285 Asp Asp Leu Asn Arg Tyr Thr Ser Cys 290 295 4202PRTArtificial SequenceSynthetic Construct 4Met Asn Phe Leu Arg Lys Ile Val Lys Asn Cys Arg Asp Glu Asp Thr 1 5 10 15 Gln Lys Pro Ser Pro Val Ser Ala Pro Leu Asp Asp Asp Asp Leu Trp 20 25 30 Leu Pro Pro Pro Glu Tyr Val Pro Leu Lys Glu Leu Thr Ser Lys Lys 35 40 45 Asn Met Arg Asn Phe Cys Ile Asn Gly Gly Val Lys Val Cys Ser Pro 50 55 60 Asn Gly Tyr Ser Phe Arg Ile Leu Arg His Ile Leu Lys Ser Phe Asp 65 70 75 80 Glu Ile Tyr Ser Gly Asn His Arg Met Ile Gly Leu Ala Lys Val Val 85 90 95 Ile Gly Leu Ala Leu Ser Gly Ser Pro Val Pro Glu Gly Met Asn Trp 100 105 110 Val Tyr Lys Leu Arg Arg Thr Phe Ile Phe Gln Trp Ala Asp Ser Arg 115 120 125 Gly Pro Leu Glu Gly Glu Glu Leu Glu Tyr Ser Gln Glu Ile Thr Trp 130 135 140 Asp Asp Asp Thr Glu Phe Val Gly Leu Gln Ile Arg Val Ile Ala Lys 145 150 155 160 Gln Cys His Ile Gln Gly Arg Ile Trp Cys Ile Asn Met Asn Pro Arg 165 170 175 Ala Cys Gln Leu Trp Ser Asp Met Ser Leu Gln Thr Gln Arg Ser Glu 180 185 190 Glu Asp Lys Asp Ser Ser Leu Leu Leu Glu 195 200 5524PRTArtificial SequenceSynthetic Construct 5Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu Val Phe Pro Leu 1 5 10 15 Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp Lys Leu Gly Pro 20 25 30 Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro Asn Asn Leu Val 35 40 45 Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe Ser Tyr Met Glu 50 55 60 Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met Asn Gly Phe Thr Cys 65 70 75 80 Thr Gly Val Val Thr Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 Val Thr Thr Thr Phe Lys Arg Lys His Phe Arg Pro Thr Pro Asp Ala 100 105 110 Cys Arg Ala Ala Tyr Asn Trp Lys Met Ala Gly Asp Pro Arg Tyr Glu 115 120 125 Glu Ser Leu His Asn Pro Tyr Pro Asp Tyr His Trp Leu Arg Thr Val 130 135 140 Lys Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Val Ala Asp 145 150 155 160 Leu Asp Pro Tyr Asp Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly 165 170 175 Lys Cys Ser Gly Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His 180 185 190 Asp Tyr Thr Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys 195 200 205 Asp Ile Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser Glu 210 215 220 Thr Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly 225 230 235 240 Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met Asp 245 250 255 Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp Cys Pro 260 265 270 Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp Glu Ile Glu 275 280 285 His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu Glu Cys Leu Asp 290 295 300 Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val Ser Phe Arg Arg Pro 305 310 315 320 Ser His Leu Arg Lys Leu Val Pro Gly Phe Gly Lys Ala Tyr Thr Ile 325 330 335 Phe Asn Lys Thr Leu Met Glu Ala Asp Ala His Tyr Lys Ser Val Glu 340 345 350 Thr Trp Asn Glu Ile Leu Pro Ser Lys Gly Cys Leu Arg Val Gly Gly 355 360 365 Arg Cys His Pro His Val Asn Gly Val Phe Phe Asn Gly Ile Ile Leu 370 375 380 Gly Pro Asp Gly Asn Val Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 Gln Gln His Met Glu Leu Leu Glu Ser Ser Val Ile Pro Leu Val His 405 410 415 Pro Leu Ala Asp Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu 420 425 430 Asp Phe Val Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly 435 440 445 Val Asp Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala 450 455 460 Gly Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys 465 470 475 480 Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly Thr 485 490 495 Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile Ser Ser 500 505 510 Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu 515 520 62127PRTArtificial SequenceSynthetic Construct 6Met Leu Asp Pro Gly Glu Val Tyr Asp Asp Pro Ile Asp Pro Ile Glu 1 5 10 15 Leu Glu Asp Glu Pro Arg Gly Thr Pro Thr Val Pro Asn Ile Leu Arg 20 25 30 Asn Ser Asp Tyr Asn Leu Asn Ser Pro Leu Ile Glu Asp Pro Ala Arg 35 40 45 Leu Met Leu Glu Trp Leu Lys Thr Gly Asn Arg Pro Tyr Arg Met Thr 50 55 60 Leu Thr Asp Asn Cys Ser Arg Ser Phe Arg Val Leu Lys Asp Tyr Phe 65 70 75 80 Lys Lys Val Asp Leu Gly Ser Leu Lys Val Gly Gly Met Ala Ala Gln 85 90 95 Ser Met Ile Ser Leu Trp Leu Tyr Gly Ala His Ser Glu Ser Asn Arg 100 105 110 Ser Arg Arg Cys Ile Thr Asp Leu Ala His Phe Tyr Ser Lys Ser Ser 115 120 125 Pro Ile Glu Lys Leu Leu Asn Leu Thr Leu Gly Asn Arg Gly Leu Arg 130 135 140 Ile Pro Pro Glu Gly Val Leu Ser Cys Leu Glu Arg Val Asp Tyr Asp 145 150 155 160 Asn Ala Phe Gly Arg Tyr Leu Ala Asn Thr Tyr Ser Ser Tyr Leu Phe 165 170 175 Phe His Val Ile Thr Leu Tyr Met Asn Ala Leu Asp Trp Asp Glu Glu 180 185 190 Lys Thr Ile Leu Ala Leu Trp Lys Asp Leu Thr Ser Val Asp Ile Gly 195 200 205 Lys Asp Leu Val Lys Phe Lys Asp Gln Ile Trp Gly Leu Pro Ile Val 210 215 220 Thr Lys Asp Phe Val Tyr Ser Gln Ser Ser Asn Cys Leu Phe Asp Arg 225 230 235 240 Asn Tyr Thr Leu Met Leu Lys Glu Leu Phe Leu Ser Arg Phe Asn Ser 245 250 255 Leu Met Val Leu Leu Ser Pro Pro Glu Pro Arg Tyr Ser Asp Asp Leu 260 265 270 Ile Ser Gln Leu Cys Gln Leu Tyr Ile Ala Gly Asp Gln Val Leu Ser 275 280 285 Met Cys Gly Asn Ser Gly Tyr Glu Val Ile Lys Ile Leu Glu Pro Tyr 290 295 300 Val Val Asn Ser Leu Val Gln Arg Ala Glu Lys Phe Arg Pro Leu Ile 305 310 315 320 His Ser Leu Gly Asp Phe Pro Val Phe Ile Lys Asp Lys Val Ser Gln 325 330 335 Leu Glu Glu Thr Phe Gly Pro Cys Ala Arg Arg Phe Phe Arg Ala Leu 340 345 350 Asp Gln Phe Asp Asn Ile His Asp Leu Val Phe Val Tyr Gly Cys Tyr 355 360 365 Arg His Trp Gly His Pro Tyr Ile Asp Tyr Arg Lys Gly Leu Ser Lys 370 375 380 Leu Tyr Asp Gln Val His Ile Lys Lys Val Ile Asp Lys Ser Tyr Gln 385 390 395 400 Glu Cys Leu Ala Ser Asp Leu Ala Arg Arg Ile Leu Arg Trp Gly Phe 405 410 415 Asp Lys Tyr Ser Lys Trp Tyr Leu Asp Ser Arg Phe Leu Ala Arg Asp 420 425 430 His Pro Leu Thr Pro Tyr Ile Lys Thr Gln Thr Trp Pro Pro Lys His 435 440 445 Ile Val Asp Leu Val Gly Asp Thr Trp His Lys Leu Pro Ile Thr Gln 450 455 460 Ile Phe Glu Ile Pro Glu Ser Met Asp Pro Ser Glu Ile Leu Asp Asp 465 470 475 480 Lys Ser His Ser Phe Thr Arg Thr Arg Leu Ala Ser Trp Leu Ser Glu 485 490 495 Asn Arg Gly Gly Pro Val Pro Ser Glu Lys Val Ile Ile Thr Ala Leu 500 505 510 Ser Lys Pro Pro Val Asn Pro Arg Glu Phe Leu Arg Ser Ile Asp Leu 515 520 525 Gly Gly Leu Pro Asp Glu Asp Leu Ile Ile Gly Leu Lys Pro Lys Glu 530 535 540 Arg Glu Leu Lys Ile Glu Gly Arg Phe Phe Ala Leu Met Ser Trp Asn 545 550 555 560 Leu Arg Leu Tyr Phe Val Ile Thr Glu Lys Leu Leu Ala Asn Tyr Ile 565 570 575 Leu Pro Leu Phe Asp Ala Leu Thr Met Thr Asp Asn Leu Asn Lys Val 580 585 590 Phe Lys Lys Leu Ile Asp Arg Val Thr Gly Gln Gly Leu Leu Asp Tyr 595 600 605 Ser Arg Val Thr Tyr Ala Phe His Leu Asp Tyr Glu Lys Trp Asn Asn 610 615 620 His Gln Arg Leu Glu Ser Thr Glu Asp Val Phe Ser Val Leu Asp Gln 625 630 635 640 Val Phe Gly Leu Lys Arg Val Phe Ser Arg Thr His Glu Phe Phe Gln 645 650 655 Lys Ala Trp Ile Tyr Tyr Ser Asp Arg Ser Asp Leu Ile Gly Leu Arg 660 665 670 Glu Asp Gln Ile Tyr Cys Leu Asp Ala Ser Asn Gly Pro Thr Cys Trp 675 680 685 Asn Gly Gln Asp Gly Gly Leu Glu Gly Leu Arg Gln Lys Gly Trp Ser 690 695 700 Leu Val Ser Leu Leu Met Ile Asp Arg Glu Ser Gln Ile Arg Asn Thr 705 710 715 720 Arg Thr Lys Ile Leu Ala Gln Gly Asp Asn Gln Val Leu Cys Pro Thr 725 730 735 Tyr Met Leu Ser Pro Gly Leu Ser Gln Glu Gly Leu Leu Tyr Glu Leu 740 745 750 Glu Arg Ile Ser Arg Asn Ala Leu Ser Ile Tyr Arg Ala Val Glu Glu 755 760 765 Gly Ala Ser Lys Leu Gly Leu Ile Ile Lys Lys Glu Glu Thr Met Cys 770 775 780 Ser Tyr Asp Phe Leu Ile Tyr Gly Lys Thr Pro Leu Phe Arg Gly Asn 785 790 795 800 Ile Leu Val Pro Glu Ser Lys Arg Trp Ala Arg Val Ser Cys Val Ser 805 810 815 Asn Asp Gln Ile Val Asn Leu Ala Asn Ile Met Ser Thr Val Ser Thr 820 825 830 Asn Ala Leu Thr Val Ala Gln His Ser Gln Ser Leu Ile Lys Pro Met 835 840 845 Arg Asp Phe Leu Leu Met Ser Val Gln Ala Val Phe His Tyr Leu Leu 850 855 860 Phe Ser Pro Ile Leu Lys Gly Arg Val Tyr Lys Ile Leu Ser Ala Glu 865 870 875 880 Gly Asp Ser Phe Leu Leu Ala Met Ser Arg Ile Ile Tyr Leu Asp Pro 885 890 895 Ser Leu Gly Gly Val Ser Gly Met Ser Leu Gly Arg Phe His Ile Arg 900 905 910 Gln Phe Ser Asp Pro Val Ser Glu Gly Leu Ser Phe Trp Arg Glu Ile 915 920 925 Trp Leu Ser Ser His Glu Ser Trp Ile His Ala Leu Cys Gln Glu Ala 930 935 940 Gly Asn Pro Asp Leu Gly Glu Arg Thr Leu Glu Ser Phe Thr Arg Leu 945 950 955 960 Leu Glu Asp Pro Thr Thr Leu Asn Ile Arg Gly Gly Ala Ser Pro Thr 965 970 975 Ile Leu Leu Lys Asp Ala Ile Arg Lys Ala Leu Tyr Asp Glu Val Asp 980 985 990 Lys Val Glu Asn Ser Glu Phe Arg Glu Ala Ile Leu Leu Ser Lys Thr 995 1000 1005 His Arg Asp Asn Phe Ile Leu Phe Leu Thr Ser Val Glu Pro Leu 1010 1015 1020 Phe Pro Arg Phe Leu Ser Glu Leu Phe Ser Ser Ser Phe Leu Gly

1025 1030 1035 Ile Pro Glu Ser Ile Ile Gly Leu Ile Gln Asn Ser Arg Thr Ile 1040 1045 1050 Arg Arg Gln Phe Arg Lys Ser Leu Ser Lys Thr Leu Glu Glu Ser 1055 1060 1065 Phe Tyr Asn Ser Glu Ile His Gly Ile Ser Arg Met Thr Gln Thr 1070 1075 1080 Pro Gln Arg Val Gly Gly Val Trp Pro Cys Ser Ser Glu Arg Ala 1085 1090 1095 Asp Leu Leu Arg Glu Ile Ser Trp Gly Arg Lys Val Val Gly Thr 1100 1105 1110 Thr Val Pro His Pro Ser Glu Met Leu Gly Leu Leu Pro Lys Ser 1115 1120 1125 Ser Ile Ser Cys Thr Cys Gly Ala Thr Gly Gly Gly Asn Pro Arg 1130 1135 1140 Val Ser Val Ser Val Leu Pro Ser Phe Asp Gln Ser Phe Phe Ser 1145 1150 1155 Arg Gly Pro Leu Lys Gly Tyr Leu Gly Ser Ser Thr Ser Met Ser 1160 1165 1170 Thr Gln Leu Phe His Ala Trp Glu Lys Val Thr Asn Val His Val 1175 1180 1185 Val Lys Arg Ala Leu Ser Leu Lys Glu Ser Ile Asn Trp Phe Ile 1190 1195 1200 Thr Arg Asp Ser Asn Leu Ala Gln Ala Leu Ile Arg Asn Ile Met 1205 1210 1215 Ser Leu Thr Gly Pro Asp Phe Pro Leu Glu Glu Ala Pro Val Phe 1220 1225 1230 Lys Arg Thr Gly Ser Ala Leu His Arg Phe Lys Ser Ala Arg Tyr 1235 1240 1245 Ser Glu Gly Gly Tyr Ser Ser Val Cys Pro Asn Leu Leu Ser His 1250 1255 1260 Ile Ser Val Ser Thr Asp Thr Met Ser Asp Leu Thr Gln Asp Gly 1265 1270 1275 Lys Asn Tyr Asp Phe Met Phe Gln Pro Leu Met Leu Tyr Ala Gln 1280 1285 1290 Thr Trp Thr Ser Glu Leu Val Gln Arg Asp Thr Arg Leu Arg Asp 1295 1300 1305 Ser Thr Phe His Trp His Leu Arg Cys Asn Arg Cys Val Arg Pro 1310 1315 1320 Ile Asp Asp Val Thr Leu Glu Thr Ser Gln Ile Phe Glu Phe Pro 1325 1330 1335 Asp Val Ser Lys Arg Ile Ser Arg Met Val Ser Gly Ala Val Pro 1340 1345 1350 His Phe Gln Arg Leu Pro Asp Ile Arg Leu Arg Pro Gly Asp Phe 1355 1360 1365 Glu Ser Leu Ser Gly Arg Glu Lys Ser His His Ile Gly Ser Ala 1370 1375 1380 Gln Gly Leu Leu Tyr Ser Ile Leu Val Ala Ile His Asp Ser Gly 1385 1390 1395 Tyr Asn Asp Gly Thr Ile Phe Pro Val Asn Ile Tyr Asp Lys Val 1400 1405 1410 Ser Pro Arg Asp Tyr Leu Arg Gly Leu Ala Arg Gly Val Leu Ile 1415 1420 1425 Gly Ser Ser Ile Cys Phe Leu Thr Arg Met Thr Asn Ile Asn Ile 1430 1435 1440 Asn Arg Pro Leu Glu Leu Ile Ser Gly Val Ile Ser Tyr Ile Leu 1445 1450 1455 Leu Arg Leu Asp Asn His Pro Ser Leu Tyr Ile Met Leu Arg Glu 1460 1465 1470 Pro Ser Leu Arg Gly Glu Ile Phe Ser Ile Pro Gln Lys Ile Pro 1475 1480 1485 Ala Ala Tyr Pro Thr Thr Met Lys Glu Gly Asn Arg Ser Ile Leu 1490 1495 1500 Cys Tyr Leu Gln His Val Leu Arg Tyr Glu Arg Glu Ile Ile Thr 1505 1510 1515 Ala Ser Pro Glu Asn Asp Trp Leu Trp Ile Phe Ser Asp Phe Arg 1520 1525 1530 Ser Ala Lys Met Thr Tyr Leu Thr Leu Ile Thr Tyr Gln Ser His 1535 1540 1545 Leu Leu Leu Gln Arg Val Glu Arg Asn Leu Ser Lys Ser Met Arg 1550 1555 1560 Asp Asn Leu Arg Gln Leu Ser Ser Leu Met Arg Gln Val Leu Gly 1565 1570 1575 Gly His Gly Glu Asp Thr Leu Glu Ser Asp Asp Asn Ile Gln Arg 1580 1585 1590 Leu Leu Lys Asp Ser Leu Arg Arg Thr Arg Trp Val Asp Gln Glu 1595 1600 1605 Val Arg His Ala Ala Arg Thr Met Thr Gly Asp Tyr Ser Pro Asn 1610 1615 1620 Lys Lys Val Ser Arg Lys Val Gly Cys Ser Glu Trp Val Cys Ser 1625 1630 1635 Ala Gln Gln Val Ala Val Ser Thr Ser Ala Asn Pro Ala Pro Val 1640 1645 1650 Ser Glu Leu Asp Ile Arg Ala Leu Ser Lys Arg Phe Gln Asn Pro 1655 1660 1665 Leu Ile Ser Gly Leu Arg Val Val Gln Trp Ala Thr Gly Ala His 1670 1675 1680 Tyr Lys Leu Lys Pro Ile Leu Asp Asp Leu Asn Val Phe Pro Ser 1685 1690 1695 Leu Cys Leu Val Val Gly Asp Gly Ser Gly Gly Ile Ser Arg Ala 1700 1705 1710 Val Leu Asn Met Phe Pro Asp Ala Lys Leu Val Phe Asn Ser Leu 1715 1720 1725 Leu Glu Val Asn Asp Leu Met Ala Ser Gly Thr His Pro Leu Pro 1730 1735 1740 Pro Ser Ala Ile Met Arg Gly Gly Asn Asp Ile Val Ser Arg Val 1745 1750 1755 Ile Asp Phe Asp Ser Ile Trp Glu Lys Pro Ser Asp Leu Arg Asn 1760 1765 1770 Leu Ala Thr Trp Lys Tyr Phe Gln Ser Val Gln Lys Gln Val Asn 1775 1780 1785 Met Ser Tyr Asp Leu Ile Ile Cys Asp Ala Glu Val Thr Asp Ile 1790 1795 1800 Ala Ser Ile Asn Arg Ile Thr Leu Leu Met Ser Asp Phe Ala Leu 1805 1810 1815 Ser Ile Asp Gly Pro Leu Tyr Leu Val Phe Lys Thr Tyr Gly Thr 1820 1825 1830 Met Leu Val Asn Pro Asn Tyr Lys Ala Ile Gln His Leu Ser Arg 1835 1840 1845 Ala Phe Pro Ser Val Thr Gly Phe Ile Thr Gln Val Thr Ser Ser 1850 1855 1860 Phe Ser Ser Glu Leu Tyr Leu Arg Phe Ser Lys Arg Gly Lys Phe 1865 1870 1875 Phe Arg Asp Ala Glu Tyr Leu Thr Ser Ser Thr Leu Arg Glu Met 1880 1885 1890 Ser Leu Val Leu Phe Asn Cys Ser Ser Pro Lys Ser Glu Met Gln 1895 1900 1905 Arg Ala Arg Ser Leu Asn Tyr Gln Asp Leu Val Arg Gly Phe Pro 1910 1915 1920 Glu Glu Ile Ile Ser Asn Pro Tyr Asn Glu Met Ile Ile Thr Leu 1925 1930 1935 Ile Asp Ser Asp Val Glu Ser Phe Leu Val His Lys Met Val Asp 1940 1945 1950 Asp Leu Glu Leu Gln Arg Gly Thr Leu Ser Lys Val Ala Ile Ile 1955 1960 1965 Ile Ala Ile Met Ile Val Phe Ser Asn Arg Val Phe Asn Val Ser 1970 1975 1980 Lys Pro Leu Thr Asp Pro Leu Phe Tyr Pro Pro Ser Asp Pro Lys 1985 1990 1995 Ile Leu Arg His Phe Asn Ile Cys Cys Ser Thr Met Met Tyr Leu 2000 2005 2010 Ser Thr Ala Leu Gly Asp Val Pro Ser Phe Ala Arg Leu His Asp 2015 2020 2025 Leu Tyr Asn Arg Pro Ile Thr Tyr Tyr Phe Arg Lys Gln Phe Ile 2030 2035 2040 Arg Gly Asn Val Tyr Leu Ser Trp Ser Trp Ser Asn Asp Thr Ser 2045 2050 2055 Val Phe Lys Arg Val Ala Cys Asn Ser Ser Leu Ser Leu Ser Ser 2060 2065 2070 His Trp Ile Arg Leu Ile Tyr Lys Ile Val Lys Thr Thr Arg Leu 2075 2080 2085 Val Gly Ser Ile Lys Asp Leu Ser Arg Glu Val Glu Arg His Leu 2090 2095 2100 His Arg Tyr Asn Arg Trp Ile Thr Leu Glu Asp Ile Arg Ser Arg 2105 2110 2115 Ser Ser Leu Leu Asp Tyr Ser Cys Leu 2120 2125 7524PRTRabies virus ERA strain 7Met Val Pro Gln Ala Leu Leu Phe Val Pro Leu Leu Val Phe Pro Leu 1 5 10 15 Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp Lys Leu Gly Pro 20 25 30 Trp Ser Pro Ile Asp Ile His His Leu Ser Cys Pro Asn Asn Leu Val 35 40 45 Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe Ser Tyr Met Glu 50 55 60 Leu Lys Val Gly Tyr Ile Leu Ala Ile Lys Met Asn Gly Phe Thr Cys 65 70 75 80 Thr Gly Val Val Thr Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 Val Thr Thr Thr Phe Lys Arg Lys His Phe Arg Pro Thr Pro Asp Ala 100 105 110 Cys Arg Ala Ala Tyr Asn Trp Lys Met Ala Gly Asp Pro Arg Tyr Glu 115 120 125 Glu Ser Leu His Asn Pro Tyr Pro Asp Tyr His Trp Leu Arg Thr Val 130 135 140 Lys Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Val Ala Asp 145 150 155 160 Leu Asp Pro Tyr Asp Arg Ser Leu His Ser Arg Val Phe Pro Ser Gly 165 170 175 Lys Cys Ser Gly Val Ala Val Ser Ser Thr Tyr Cys Ser Thr Asn His 180 185 190 Asp Tyr Thr Ile Trp Met Pro Glu Asn Pro Arg Leu Gly Met Ser Cys 195 200 205 Asp Ile Phe Thr Asn Ser Arg Gly Lys Arg Ala Ser Lys Gly Ser Glu 210 215 220 Thr Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly 225 230 235 240 Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met Asp 245 250 255 Gly Thr Trp Val Ala Met Gln Thr Ser Asn Glu Thr Lys Trp Cys Pro 260 265 270 Pro Asp Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp Glu Ile Glu 275 280 285 His Leu Val Val Glu Glu Leu Val Arg Lys Arg Glu Glu Cys Leu Asp 290 295 300 Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val Ser Phe Arg Arg Pro 305 310 315 320 Ser His Leu Arg Lys Leu Val Pro Gly Phe Gly Lys Ala Tyr Thr Ile 325 330 335 Phe Asn Lys Thr Leu Met Glu Ala Asp Ala His Tyr Lys Ser Val Arg 340 345 350 Thr Trp Asn Glu Ile Leu Pro Ser Lys Gly Cys Leu Arg Val Gly Gly 355 360 365 Arg Cys His Pro His Val Asn Gly Val Phe Phe Asn Gly Ile Ile Leu 370 375 380 Gly Pro Asp Gly Asn Val Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 Gln Gln His Met Glu Leu Leu Glu Ser Ser Val Ile Pro Leu Val His 405 410 415 Pro Leu Ala Asp Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Ala Glu 420 425 430 Asp Phe Val Glu Val His Leu Pro Asp Val His Asn Gln Val Ser Gly 435 440 445 Val Asp Leu Gly Leu Pro Asn Trp Gly Lys Tyr Val Leu Leu Ser Ala 450 455 460 Gly Ala Leu Thr Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys 465 470 475 480 Arg Arg Val Asn Arg Ser Glu Pro Thr Gln His Asn Leu Arg Gly Thr 485 490 495 Gly Arg Glu Val Ser Val Thr Pro Gln Ser Gly Lys Ile Ile Ser Ser 500 505 510 Trp Glu Ser His Lys Ser Gly Gly Glu Thr Arg Leu 515 520 832DNAArtificial SequenceSynthetic oligonucleotide 8ccgggtacca cgcttaacaa ccagatcaaa ga 32931DNAArtificial SequenceSynthetic oligonucleotide 9taggtcgctt gctaagcact cctggtagga c 311031DNAArtificial SequenceSynthetic oligonucleotide 10gtcctaccag gagtgcttag caagcgacct a 311134DNAArtificial SequenceSynthetic oligonucleotide 11aaaactgcag acgcttaaca aataaacaac aaaa 341279DNAArtificial SequenceSynthetic oligonucleotide 12caaggctagc tgttaagcgt ctgatgagtc cgtgaggacg aaactatagg aaaggaattc 60ctatagtcgg taccacgct 791379DNAArtificial SequenceSynthetic oligonucleotide 13agcgtggtac cgactatagg aattcctttc ctatagtttc gtcctcacgg actcatcaga 60cgcttaacag ctagccttg 7914108DNAArtificial SequenceSynthetic oligonucleotide 14gacctgcagg ggtcggcatg gcatctccac ctcctcgcgg tccgacctgg gcatccgaag 60gaggacgcac gtccactcgg atggctaagg gagggcgcgg ccgcactc 10815108DNAArtificial SequenceSynthetic oligonucleotide 15gagtgcggcc gcgccctccc ttagccatcc gagtggacgt gcgtcctcct tcggatgccc 60aggtcggacc gcgaggaggt ggagatgcca tgccgacccc tgcaggtc 1081625DNAArtificial SequenceSynthetic oligonucleotide 16accaccatgg atgccgacaa gattg 251733DNAArtificial SequenceSynthetic oligonucleotide 17ggcccatggt tatgagtcac tcgaatatgt ctt 331833DNAArtificial SequenceSynthetic oligonucleotide 18ttggtaccac catgagcaag atctttgtca atc 331934DNAArtificial SequenceSynthetic oligonucleotide 19ggagaggaat tcttagcaag atgtatagcg attc 342032DNAArtificial SequenceSynthetic oligonucleotide 20ttggtaccac catggttcct caggctctcc tg 322133DNAArtificial SequenceSynthetic oligonucleotide 21aaaactgcag tcacagtctg gtctcacccc cac 332236DNAArtificial SequenceSynthetic oligonucleotide 22accgctagca ccaccatgct cgatcctgga gaggtc 362334DNAArtificial SequenceSynthetic oligonucleotide 23aaaactgcag tcacaggcaa ctgtagtcta gtag 342438DNAArtificial SequenceSynthetic oligonucleotide 24tcgctagcac caccatgaac acgattaaca tcgctaag 382533DNAArtificial SequenceSynthetic oligonucleotide 25gatgaattct tacgcgaacg cgaagtccga ctc 332663DNAArtificial SequenceSynthetic oligonucleotide 26tcgctagcca ccatgccaaa aaagaagaga aaggtagaaa acacgattaa catcgctaag 60aac 632731DNAArtificial SequenceSynthetic oligonucleotide 27aaaactgcag gccaccatgg gcgtgatcaa g 312831DNAArtificial SequenceSynthetic oligonucleotide 28ccgctcggta cctattagcc ggcctggcgg g 312946DNAArtificial SequenceSynthetic oligonucleotide 29ccctctgcag tttggtaccg tcgagaaaaa aacattagat cagaag 463024DNAArtificial SequenceSynthetic oligonucleotide 30atgaactttc tacgtaagat agtg 243146DNAArtificial SequenceSynthetic oligonucleotide 31caaactgcag aggggtgtta gtttttttca aaaagaaccc cccaag 463244DNAArtificial SequenceSynthetic oligonucleotide 32caaactgcag aggggtgtta gtttttttca catccaagag gatc 443342DNAArtificial SequenceSynthetic oligonucleotide 33cctctgcagt ttggtacctt gaaaaaaacc tgggttcaat ag 423435DNAArtificial SequenceSynthetic oligonucleotide 34ctcactacaa gtcagtcgag acttggaatg agatc 353535DNAArtificial SequenceSynthetic oligonucleotide 35gactgacttt gagtgagcat cggcttccat caagg 353660DNAArtificial SequenceSynthetic oligonucleotide 36ccaaactgca gcgaaaggag gggtgttagt ttttttcatg atgaaccccc caaggggagg 603739DNAArtificial SequenceSynthetic oligonucleotide 37gactcactat agggagaccc aagctggcta gctgttaag 393860DNAArtificial SequenceSynthetic oligonucleotide 38ccaaactgca gcgaaaggag gggtgttagt ttttttcatg ttgactttag gacatctcgg 603961DNAArtificial SequenceSynthetic oligonucleotide 39cctttcgctg cagtttggta ccgtcgagaa aaaaacaggc aacaccactg ataaaatgaa 60c

614030DNAArtificial SequenceSynthetic oligonucleotide 40cctccccttc aagagggccc ctggaatcag 304146DNAArtificial SequenceSynthetic oligonucleotide 41ctaacacccc tcctttcgct gcagtttggt accgtcgaga aaaaaa 464253DNAArtificial SequenceSynthetic oligonucleotide 42tttttttgat tgtggggagg aaagcgacgt caaaccatgg cagctctttt ttt 534337DNAArtificial SequenceSynthetic oligonucleotide 43cgactgcaga tgaatatacc ttgctttgtt gtgattc 374439DNAArtificial SequenceSynthetic oligonucleotide 44cgtggtacct catgtacctg gaagcccttt ataggactc 394533DNAArtificial SequenceSynthetic oligonucleotide 45catctgctag caatggcttc ctactttgcg ttg 334633DNAArtificial SequenceSynthetic oligonucleotide 46ttcaatggta ccttattggg cagtttgtcc ctt 33471569DNAMokola virusCDS(1)..(1569) 47atg aat ata cct tgc ttt gtt gtg att ctt gga ttc aca act aca tat 48Met Asn Ile Pro Cys Phe Val Val Ile Leu Gly Phe Thr Thr Thr Tyr 1 5 10 15 tct ctt ggg gaa ttt cct ttg tac aca att ccc gag aag ata gag aaa 96Ser Leu Gly Glu Phe Pro Leu Tyr Thr Ile Pro Glu Lys Ile Glu Lys 20 25 30 tgg acc cca ata gac atg atc cat cta agc tgc ccc aac aac tta tta 144Trp Thr Pro Ile Asp Met Ile His Leu Ser Cys Pro Asn Asn Leu Leu 35 40 45 tcc gag gag gaa ggt tgc aat aca gag tcg ccc ctc acc tac ttc gag 192Ser Glu Glu Glu Gly Cys Asn Thr Glu Ser Pro Leu Thr Tyr Phe Glu 50 55 60 ctc aag agt ggt tac tta gct cat cag aaa gtt ccg ggg ttt acc tgt 240Leu Lys Ser Gly Tyr Leu Ala His Gln Lys Val Pro Gly Phe Thr Cys 65 70 75 80 aca ggg gta gtg aat gag gcg gag aca tac aca aat ttt gtc ggg tat 288Thr Gly Val Val Asn Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 gtc acc aca aac ttc aaa aga aaa cac ttt aag cct aca gtc tcc gcc 336Val Thr Thr Asn Phe Lys Arg Lys His Phe Lys Pro Thr Val Ser Ala 100 105 110 tgt cgt gat gcc tac aac tgg aaa gcg tcc ggg gat ccc agg tat gag 384Cys Arg Asp Ala Tyr Asn Trp Lys Ala Ser Gly Asp Pro Arg Tyr Glu 115 120 125 gag tca ctg cac act cct tac cct gac agc agc tgg ttg aga act gta 432Glu Ser Leu His Thr Pro Tyr Pro Asp Ser Ser Trp Leu Arg Thr Val 130 135 140 act acc acc aaa gaa tcc ctt ctt ata ata tcg cct agc atc gtg gag 480Thr Thr Thr Lys Glu Ser Leu Leu Ile Ile Ser Pro Ser Ile Val Glu 145 150 155 160 atg gat gta tat ggc agg act ctc cat tcc ccc atg ttc cct tca ggg 528Met Asp Val Tyr Gly Arg Thr Leu His Ser Pro Met Phe Pro Ser Gly 165 170 175 ata tgt tct aag ctc tat ccc tct gtt cca tcc tgc aaa acc aac cat 576Ile Cys Ser Lys Leu Tyr Pro Ser Val Pro Ser Cys Lys Thr Asn His 180 185 190 gat tac aca tta tgg ctg cca gaa gat cct agt ttg agt tta atc tgt 624Asp Tyr Thr Leu Trp Leu Pro Glu Asp Pro Ser Leu Ser Leu Ile Cys 195 200 205 gat att ttc act tct ggc agc gga agg aag gcc atg aat ggg tcc cgc 672Asp Ile Phe Thr Ser Gly Ser Gly Arg Lys Ala Met Asn Gly Ser Arg 210 215 220 atc tgc gga ttc aag gat gaa agg gga ttt tac aga tct ttg aaa ggc 720Ile Cys Gly Phe Lys Asp Glu Arg Gly Phe Tyr Arg Ser Leu Lys Gly 225 230 235 240 gct tgt aag ctg aca ttg tgc gga agg cct ggg atc aga tta ttt gac 768Ala Cys Lys Leu Thr Leu Cys Gly Arg Pro Gly Ile Arg Leu Phe Asp 245 250 255 gga act tgg gtc tct ttt aca agg cca gaa gtt cac gtg tgg tgc acc 816Gly Thr Trp Val Ser Phe Thr Arg Pro Glu Val His Val Trp Cys Thr 260 265 270 cct aac caa ttg gtc aat ata cac aat gat aga ata gat gag atc gag 864Pro Asn Gln Leu Val Asn Ile His Asn Asp Arg Ile Asp Glu Ile Glu 275 280 285 cac ctg att gtt gaa gac att gtc aaa aga agg gag gag tgt tta gac 912His Leu Ile Val Glu Asp Ile Val Lys Arg Arg Glu Glu Cys Leu Asp 290 295 300 act cta gag aca gta ttt atg tct caa tca att agt ttt agg agg ttg 960Thr Leu Glu Thr Val Phe Met Ser Gln Ser Ile Ser Phe Arg Arg Leu 305 310 315 320 agc cac ttt cgg aaa ttg gtt ccc gga tat ggg aaa gct tac acc att 1008Ser His Phe Arg Lys Leu Val Pro Gly Tyr Gly Lys Ala Tyr Thr Ile 325 330 335 ttg aat ggt agc ctg atg gaa gca aat gtc tac tat aaa aga gtt gac 1056Leu Asn Gly Ser Leu Met Glu Ala Asn Val Tyr Tyr Lys Arg Val Asp 340 345 350 agg tgg gcg gac att tta ccc tct aag gga tgt ctg aaa gtc ggg caa 1104Arg Trp Ala Asp Ile Leu Pro Ser Lys Gly Cys Leu Lys Val Gly Gln 355 360 365 caa tgt atg gac cct gtc aac gga gtc ctc ttc aat ggg att atc aaa 1152Gln Cys Met Asp Pro Val Asn Gly Val Leu Phe Asn Gly Ile Ile Lys 370 375 380 ggt cca gat ggc cag atc ttg atc cct gaa atg cag tca gag cag ctc 1200Gly Pro Asp Gly Gln Ile Leu Ile Pro Glu Met Gln Ser Glu Gln Leu 385 390 395 400 aag cag cat atg gac tta tta aag gca gca gtg ttc cct ctc aga cat 1248Lys Gln His Met Asp Leu Leu Lys Ala Ala Val Phe Pro Leu Arg His 405 410 415 cct tta atc agc caa gac gcc atc ttt aag aaa gac ggg gag gca gat 1296Pro Leu Ile Ser Gln Asp Ala Ile Phe Lys Lys Asp Gly Glu Ala Asp 420 425 430 gat ttt gtg gac ctc cat atg cca gat gta cac aaa tct gta tca gat 1344Asp Phe Val Asp Leu His Met Pro Asp Val His Lys Ser Val Ser Asp 435 440 445 gtc gac ttg ggt ttg cct cac tgg ggg ttt tgg atg ttg atc ggg gca 1392Val Asp Leu Gly Leu Pro His Trp Gly Phe Trp Met Leu Ile Gly Ala 450 455 460 act gta gtg gca ttt ttg gtc ttg gtg tgt ctg ctc cgt gtc tgc tgt 1440Thr Val Val Ala Phe Leu Val Leu Val Cys Leu Leu Arg Val Cys Cys 465 470 475 480 aag aga gtg agg agg aga ggt tca cga cgt aca act cag gag atc ccc 1488Lys Arg Val Arg Arg Arg Gly Ser Arg Arg Thr Thr Gln Glu Ile Pro 485 490 495 ctc aac gtt tcc tct gtc ccc gtc cct cgg gcc aca gtg gtg tca tca 1536Leu Asn Val Ser Ser Val Pro Val Pro Arg Ala Thr Val Val Ser Ser 500 505 510 tgg gag tcc tat aaa ggg ctt cca ggt aca tga 1569Trp Glu Ser Tyr Lys Gly Leu Pro Gly Thr 515 520 48522PRTMokola virus 48Met Asn Ile Pro Cys Phe Val Val Ile Leu Gly Phe Thr Thr Thr Tyr 1 5 10 15 Ser Leu Gly Glu Phe Pro Leu Tyr Thr Ile Pro Glu Lys Ile Glu Lys 20 25 30 Trp Thr Pro Ile Asp Met Ile His Leu Ser Cys Pro Asn Asn Leu Leu 35 40 45 Ser Glu Glu Glu Gly Cys Asn Thr Glu Ser Pro Leu Thr Tyr Phe Glu 50 55 60 Leu Lys Ser Gly Tyr Leu Ala His Gln Lys Val Pro Gly Phe Thr Cys 65 70 75 80 Thr Gly Val Val Asn Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 Val Thr Thr Asn Phe Lys Arg Lys His Phe Lys Pro Thr Val Ser Ala 100 105 110 Cys Arg Asp Ala Tyr Asn Trp Lys Ala Ser Gly Asp Pro Arg Tyr Glu 115 120 125 Glu Ser Leu His Thr Pro Tyr Pro Asp Ser Ser Trp Leu Arg Thr Val 130 135 140 Thr Thr Thr Lys Glu Ser Leu Leu Ile Ile Ser Pro Ser Ile Val Glu 145 150 155 160 Met Asp Val Tyr Gly Arg Thr Leu His Ser Pro Met Phe Pro Ser Gly 165 170 175 Ile Cys Ser Lys Leu Tyr Pro Ser Val Pro Ser Cys Lys Thr Asn His 180 185 190 Asp Tyr Thr Leu Trp Leu Pro Glu Asp Pro Ser Leu Ser Leu Ile Cys 195 200 205 Asp Ile Phe Thr Ser Gly Ser Gly Arg Lys Ala Met Asn Gly Ser Arg 210 215 220 Ile Cys Gly Phe Lys Asp Glu Arg Gly Phe Tyr Arg Ser Leu Lys Gly 225 230 235 240 Ala Cys Lys Leu Thr Leu Cys Gly Arg Pro Gly Ile Arg Leu Phe Asp 245 250 255 Gly Thr Trp Val Ser Phe Thr Arg Pro Glu Val His Val Trp Cys Thr 260 265 270 Pro Asn Gln Leu Val Asn Ile His Asn Asp Arg Ile Asp Glu Ile Glu 275 280 285 His Leu Ile Val Glu Asp Ile Val Lys Arg Arg Glu Glu Cys Leu Asp 290 295 300 Thr Leu Glu Thr Val Phe Met Ser Gln Ser Ile Ser Phe Arg Arg Leu 305 310 315 320 Ser His Phe Arg Lys Leu Val Pro Gly Tyr Gly Lys Ala Tyr Thr Ile 325 330 335 Leu Asn Gly Ser Leu Met Glu Ala Asn Val Tyr Tyr Lys Arg Val Asp 340 345 350 Arg Trp Ala Asp Ile Leu Pro Ser Lys Gly Cys Leu Lys Val Gly Gln 355 360 365 Gln Cys Met Asp Pro Val Asn Gly Val Leu Phe Asn Gly Ile Ile Lys 370 375 380 Gly Pro Asp Gly Gln Ile Leu Ile Pro Glu Met Gln Ser Glu Gln Leu 385 390 395 400 Lys Gln His Met Asp Leu Leu Lys Ala Ala Val Phe Pro Leu Arg His 405 410 415 Pro Leu Ile Ser Gln Asp Ala Ile Phe Lys Lys Asp Gly Glu Ala Asp 420 425 430 Asp Phe Val Asp Leu His Met Pro Asp Val His Lys Ser Val Ser Asp 435 440 445 Val Asp Leu Gly Leu Pro His Trp Gly Phe Trp Met Leu Ile Gly Ala 450 455 460 Thr Val Val Ala Phe Leu Val Leu Val Cys Leu Leu Arg Val Cys Cys 465 470 475 480 Lys Arg Val Arg Arg Arg Gly Ser Arg Arg Thr Thr Gln Glu Ile Pro 485 490 495 Leu Asn Val Ser Ser Val Pro Val Pro Arg Ala Thr Val Val Ser Ser 500 505 510 Trp Glu Ser Tyr Lys Gly Leu Pro Gly Thr 515 520 491578DNAWest Caucasian bat virusCDS(1)..(1578) 49atg gct tcc tac ttt gcg ttg gtc ttg aac ggg atc tct atg gtt ttc 48Met Ala Ser Tyr Phe Ala Leu Val Leu Asn Gly Ile Ser Met Val Phe 1 5 10 15 agt caa ggt ctt ttc ccc ctt tac act atc cct gac cat ctg gga cca 96Ser Gln Gly Leu Phe Pro Leu Tyr Thr Ile Pro Asp His Leu Gly Pro 20 25 30 tgg acc ccc ata gat cta agt cac ctt cac tgc ccg aac aat ctt tat 144Trp Thr Pro Ile Asp Leu Ser His Leu His Cys Pro Asn Asn Leu Tyr 35 40 45 act gat gcc tct tat tgt aca act gaa caa agc ata acc tac aca gag 192Thr Asp Ala Ser Tyr Cys Thr Thr Glu Gln Ser Ile Thr Tyr Thr Glu 50 55 60 ttg aag gtc gga tca tct gtg tca caa aaa atc ccc gga ttt aca tgt 240Leu Lys Val Gly Ser Ser Val Ser Gln Lys Ile Pro Gly Phe Thr Cys 65 70 75 80 acg ggg gta aga act gaa tct gta aca tat acc aac ttt gtt ggc tat 288Thr Gly Val Arg Thr Glu Ser Val Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 gtg act acc acg ttc aag aaa aaa cac ttt cct cct aaa tcc agg gac 336Val Thr Thr Thr Phe Lys Lys Lys His Phe Pro Pro Lys Ser Arg Asp 100 105 110 tgt aga gag gcg tat gag agg aag aaa gca gga gat cct aga tat gaa 384Cys Arg Glu Ala Tyr Glu Arg Lys Lys Ala Gly Asp Pro Arg Tyr Glu 115 120 125 gag tct tta gcc cac cca tat cct gac aac agt tgg ctg aga aca gtg 432Glu Ser Leu Ala His Pro Tyr Pro Asp Asn Ser Trp Leu Arg Thr Val 130 135 140 act aca aca aag gat tcc tgg gtg atc atc gag ccc agt gta gtg gag 480Thr Thr Thr Lys Asp Ser Trp Val Ile Ile Glu Pro Ser Val Val Glu 145 150 155 160 tta gat ata tac aca agt gcc ttg tat tca cct ctt ttc aag gat gga 528Leu Asp Ile Tyr Thr Ser Ala Leu Tyr Ser Pro Leu Phe Lys Asp Gly 165 170 175 aca tgt tca aaa tct aga aca tat tcc ccc tac tgt cca acc aat cat 576Thr Cys Ser Lys Ser Arg Thr Tyr Ser Pro Tyr Cys Pro Thr Asn His 180 185 190 gac ttc acc att tgg atg cca gag agt gaa aac ata aga tct gcc tgt 624Asp Phe Thr Ile Trp Met Pro Glu Ser Glu Asn Ile Arg Ser Ala Cys 195 200 205 aat ctg ttt tcc aca agt aga ggg aaa cta gtc agg aac cgc aca tcc 672Asn Leu Phe Ser Thr Ser Arg Gly Lys Leu Val Arg Asn Arg Thr Ser 210 215 220 acc tgc ggg att atc gat gag aga ggg ctg ttc aga tca gtt aaa gga 720Thr Cys Gly Ile Ile Asp Glu Arg Gly Leu Phe Arg Ser Val Lys Gly 225 230 235 240 gca tgc aaa ata tca ata tgc ggt agg cag gga atc cgt tta gtg gat 768Ala Cys Lys Ile Ser Ile Cys Gly Arg Gln Gly Ile Arg Leu Val Asp 245 250 255 gga act tgg atg tct ttt aga tac tca gag tac tta cct gtg tgt tct 816Gly Thr Trp Met Ser Phe Arg Tyr Ser Glu Tyr Leu Pro Val Cys Ser 260 265 270 cca tca cag ctg atc aac acg cac gac atc aag gtc gat gag ctg gag 864Pro Ser Gln Leu Ile Asn Thr His Asp Ile Lys Val Asp Glu Leu Glu 275 280 285 aat gct ata gtt tta gac ttg att agg agg aga gaa gaa tgt ctt gac 912Asn Ala Ile Val Leu Asp Leu Ile Arg Arg Arg Glu Glu Cys Leu Asp 290 295 300 acc cta gaa aca att ttg atg tca gga tct gtg agt cac agg agg ctg 960Thr Leu Glu Thr Ile Leu Met Ser Gly Ser Val Ser His Arg Arg Leu 305 310 315 320 agt cat ttc aga aag ctg gtt cca gga tct ggg aag gct tac tct tat 1008Ser His Phe Arg Lys Leu Val Pro Gly Ser Gly Lys Ala Tyr Ser Tyr 325 330 335 ata aac ggc acc tta atg gaa tca gat gct cac tac atc aag gta gag 1056Ile Asn Gly Thr Leu Met Glu Ser Asp Ala His Tyr Ile Lys Val Glu 340 345 350 aat tgg tca gag gtc atc cca cac aaa gga tgt ctc atg gtc ggg ggc 1104Asn Trp Ser Glu Val Ile Pro His Lys Gly Cys Leu Met Val Gly Gly 355 360 365 aaa tgc tat gag cca gtc aat gat gtg tat ttc aac ggg atc att cgg 1152Lys Cys Tyr Glu Pro Val Asn Asp Val Tyr Phe Asn Gly Ile Ile Arg 370 375 380 gat tca aat aat cag atc ttg ata cct gag atg cag tcc agt ctt ctc 1200Asp Ser Asn Asn Gln Ile Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 aga gaa cat gtt gac ctg ttg aag gct aat ata gtt ccg ttc agg cat 1248Arg Glu His Val Asp Leu Leu Lys Ala Asn Ile Val Pro Phe Arg His 405 410 415 cca atg tta ctt agg tcc ttc aca tct gac act gaa gaa gat atc gtc 1296Pro Met Leu Leu Arg Ser Phe Thr Ser Asp Thr Glu Glu Asp Ile Val 420 425 430 gag ttt gtc aac cct cat ctc caa gat acc cag aag ttg gtg tca gat 1344Glu Phe Val Asn Pro His Leu Gln Asp Thr Gln Lys Leu Val Ser Asp 435 440 445

atg gat ctc ggg tta tca gac tgg aag aga tat cta cta att gga tct 1392Met Asp Leu Gly Leu Ser Asp Trp Lys Arg Tyr Leu Leu Ile Gly Ser 450 455 460 ttg gcc gta gga gga gtg gta gca atc tta ttc atc gga aca tgt tgt 1440Leu Ala Val Gly Gly Val Val Ala Ile Leu Phe Ile Gly Thr Cys Cys 465 470 475 480 ctg aga tgt aga gca ggg aga aac aga aga aca atc cga tcc aat cat 1488Leu Arg Cys Arg Ala Gly Arg Asn Arg Arg Thr Ile Arg Ser Asn His 485 490 495 agg tca ttg tcc cat gac gtg gtg ttc cat aaa gat aag gat aaa gtg 1536Arg Ser Leu Ser His Asp Val Val Phe His Lys Asp Lys Asp Lys Val 500 505 510 att act tct tgg gaa tct tac aag gga caa act gcc caa taa 1578Ile Thr Ser Trp Glu Ser Tyr Lys Gly Gln Thr Ala Gln 515 520 525 50525PRTWest Caucasian bat virus 50Met Ala Ser Tyr Phe Ala Leu Val Leu Asn Gly Ile Ser Met Val Phe 1 5 10 15 Ser Gln Gly Leu Phe Pro Leu Tyr Thr Ile Pro Asp His Leu Gly Pro 20 25 30 Trp Thr Pro Ile Asp Leu Ser His Leu His Cys Pro Asn Asn Leu Tyr 35 40 45 Thr Asp Ala Ser Tyr Cys Thr Thr Glu Gln Ser Ile Thr Tyr Thr Glu 50 55 60 Leu Lys Val Gly Ser Ser Val Ser Gln Lys Ile Pro Gly Phe Thr Cys 65 70 75 80 Thr Gly Val Arg Thr Glu Ser Val Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 Val Thr Thr Thr Phe Lys Lys Lys His Phe Pro Pro Lys Ser Arg Asp 100 105 110 Cys Arg Glu Ala Tyr Glu Arg Lys Lys Ala Gly Asp Pro Arg Tyr Glu 115 120 125 Glu Ser Leu Ala His Pro Tyr Pro Asp Asn Ser Trp Leu Arg Thr Val 130 135 140 Thr Thr Thr Lys Asp Ser Trp Val Ile Ile Glu Pro Ser Val Val Glu 145 150 155 160 Leu Asp Ile Tyr Thr Ser Ala Leu Tyr Ser Pro Leu Phe Lys Asp Gly 165 170 175 Thr Cys Ser Lys Ser Arg Thr Tyr Ser Pro Tyr Cys Pro Thr Asn His 180 185 190 Asp Phe Thr Ile Trp Met Pro Glu Ser Glu Asn Ile Arg Ser Ala Cys 195 200 205 Asn Leu Phe Ser Thr Ser Arg Gly Lys Leu Val Arg Asn Arg Thr Ser 210 215 220 Thr Cys Gly Ile Ile Asp Glu Arg Gly Leu Phe Arg Ser Val Lys Gly 225 230 235 240 Ala Cys Lys Ile Ser Ile Cys Gly Arg Gln Gly Ile Arg Leu Val Asp 245 250 255 Gly Thr Trp Met Ser Phe Arg Tyr Ser Glu Tyr Leu Pro Val Cys Ser 260 265 270 Pro Ser Gln Leu Ile Asn Thr His Asp Ile Lys Val Asp Glu Leu Glu 275 280 285 Asn Ala Ile Val Leu Asp Leu Ile Arg Arg Arg Glu Glu Cys Leu Asp 290 295 300 Thr Leu Glu Thr Ile Leu Met Ser Gly Ser Val Ser His Arg Arg Leu 305 310 315 320 Ser His Phe Arg Lys Leu Val Pro Gly Ser Gly Lys Ala Tyr Ser Tyr 325 330 335 Ile Asn Gly Thr Leu Met Glu Ser Asp Ala His Tyr Ile Lys Val Glu 340 345 350 Asn Trp Ser Glu Val Ile Pro His Lys Gly Cys Leu Met Val Gly Gly 355 360 365 Lys Cys Tyr Glu Pro Val Asn Asp Val Tyr Phe Asn Gly Ile Ile Arg 370 375 380 Asp Ser Asn Asn Gln Ile Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 Arg Glu His Val Asp Leu Leu Lys Ala Asn Ile Val Pro Phe Arg His 405 410 415 Pro Met Leu Leu Arg Ser Phe Thr Ser Asp Thr Glu Glu Asp Ile Val 420 425 430 Glu Phe Val Asn Pro His Leu Gln Asp Thr Gln Lys Leu Val Ser Asp 435 440 445 Met Asp Leu Gly Leu Ser Asp Trp Lys Arg Tyr Leu Leu Ile Gly Ser 450 455 460 Leu Ala Val Gly Gly Val Val Ala Ile Leu Phe Ile Gly Thr Cys Cys 465 470 475 480 Leu Arg Cys Arg Ala Gly Arg Asn Arg Arg Thr Ile Arg Ser Asn His 485 490 495 Arg Ser Leu Ser His Asp Val Val Phe His Lys Asp Lys Asp Lys Val 500 505 510 Ile Thr Ser Trp Glu Ser Tyr Lys Gly Gln Thr Ala Gln 515 520 525 5139DNAArtificial SequenceSynthetic oligonucleotide 51cgactgcaga tgagtcaact aaatttgata ccctttttc 395242DNAArtificial SequenceSynthetic oligonucleotide 52ccgtacgtat cagacattag aggtaccctt ataagattcc ca 42531569DNALagos vat virusCDS(1)..(1569) 53atg agt caa cta aat ttg ata ccc ttt ttc tgt gta att ata gtc ttg 48Met Ser Gln Leu Asn Leu Ile Pro Phe Phe Cys Val Ile Ile Val Leu 1 5 10 15 tct gta gag gac ttt cct cta tat aca att cct gaa aag ata ggt cct 96Ser Val Glu Asp Phe Pro Leu Tyr Thr Ile Pro Glu Lys Ile Gly Pro 20 25 30 tgg act ccg atc gac ctg atc cat ctg agc tgt cct aat aat ttg cag 144Trp Thr Pro Ile Asp Leu Ile His Leu Ser Cys Pro Asn Asn Leu Gln 35 40 45 tca gag gat gaa gga tgt ggt acc tca tca gtc ttc agt tat gta gag 192Ser Glu Asp Glu Gly Cys Gly Thr Ser Ser Val Phe Ser Tyr Val Glu 50 55 60 ctc aag aca ggt tat ctc act cat cag aaa gtg tct ggg ttc acc tgt 240Leu Lys Thr Gly Tyr Leu Thr His Gln Lys Val Ser Gly Phe Thr Cys 65 70 75 80 aca gga gtg gtt aat gag gct gtc aca tac act aac ttt gtc gga tat 288Thr Gly Val Val Asn Glu Ala Val Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 gtg aca acc acc ttt aag cgg aaa cat ttc aag ccg acg gca ttg gct 336Val Thr Thr Thr Phe Lys Arg Lys His Phe Lys Pro Thr Ala Leu Ala 100 105 110 tgc aga gat gct tat cat tgg aag att tct ggg gat cca agg tat gag 384Cys Arg Asp Ala Tyr His Trp Lys Ile Ser Gly Asp Pro Arg Tyr Glu 115 120 125 gag tct ctc cac aca cca tat cct gac aac agc tgg ttg agg aca gtt 432Glu Ser Leu His Thr Pro Tyr Pro Asp Asn Ser Trp Leu Arg Thr Val 130 135 140 acc aca acc aaa gaa tct ctt gtg ata atc tct cca agc att gtg gag 480Thr Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Ile Val Glu 145 150 155 160 atg gat gta tat agt aga aca ctt cat tct ccc atg ttt ccc acc ggg 528Met Asp Val Tyr Ser Arg Thr Leu His Ser Pro Met Phe Pro Thr Gly 165 170 175 acc tgt tct agg ttc tat ccg tca tcc cct tct tgt gcc aca aat cat 576Thr Cys Ser Arg Phe Tyr Pro Ser Ser Pro Ser Cys Ala Thr Asn His 180 185 190 gat tac act tta tgg ctt cca gat gac cct aat ctg agt ttg gca tgt 624Asp Tyr Thr Leu Trp Leu Pro Asp Asp Pro Asn Leu Ser Leu Ala Cys 195 200 205 gat atc ttt gtg acc agc aca ggg aaa aag tca atg aat ggc tct aga 672Asp Ile Phe Val Thr Ser Thr Gly Lys Lys Ser Met Asn Gly Ser Arg 210 215 220 atg tgt gga ttt aca gac gag aga ggg tat tac cgg aca ata aaa gga 720Met Cys Gly Phe Thr Asp Glu Arg Gly Tyr Tyr Arg Thr Ile Lys Gly 225 230 235 240 gct tgt aaa ctg aca tta tgt ggg aaa cca ggt ttg agg tta ttt gat 768Ala Cys Lys Leu Thr Leu Cys Gly Lys Pro Gly Leu Arg Leu Phe Asp 245 250 255 ggc aca tgg ata tcc ttc ccc cgc ccg gaa gtc act acc cgg tgc ctt 816Gly Thr Trp Ile Ser Phe Pro Arg Pro Glu Val Thr Thr Arg Cys Leu 260 265 270 cct aat cag tta gtc aat att cac aac aat agg ata gat gaa gtt gag 864Pro Asn Gln Leu Val Asn Ile His Asn Asn Arg Ile Asp Glu Val Glu 275 280 285 cat ctg att gta gaa gat ctc att cga aaa aga gaa gag tgt ttg gac 912His Leu Ile Val Glu Asp Leu Ile Arg Lys Arg Glu Glu Cys Leu Asp 290 295 300 act tta gag aca gtt tta atg tcc aaa tca atc agt ttt aga cga cta 960Thr Leu Glu Thr Val Leu Met Ser Lys Ser Ile Ser Phe Arg Arg Leu 305 310 315 320 agt cac ttc aga aaa tta gtg cca gga tat ggg aag gct tac act att 1008Ser His Phe Arg Lys Leu Val Pro Gly Tyr Gly Lys Ala Tyr Thr Ile 325 330 335 tta aat ggg agc tta atg gaa act aac gtt cat tat tta aag gtt gac 1056Leu Asn Gly Ser Leu Met Glu Thr Asn Val His Tyr Leu Lys Val Asp 340 345 350 aat tgg agt gaa ata ctg cct tcc aag gga tgt tta aaa ata aac aat 1104Asn Trp Ser Glu Ile Leu Pro Ser Lys Gly Cys Leu Lys Ile Asn Asn 355 360 365 cag tgt gtt gct cat tat aag ggg gtc ttc ttt aac ggg atc atc aag 1152Gln Cys Val Ala His Tyr Lys Gly Val Phe Phe Asn Gly Ile Ile Lys 370 375 380 gga cca gat ggt cat att tta atc ccc gag atg cag tca agt ttg ttg 1200Gly Pro Asp Gly His Ile Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 aaa cag cac atg gac ctc ttg aag gca gcg gtt ttt ccc ttg aaa cat 1248Lys Gln His Met Asp Leu Leu Lys Ala Ala Val Phe Pro Leu Lys His 405 410 415 cct ctg att gaa ccg ggc tct ttg ttc aat aag gat ggt gat gcc gat 1296Pro Leu Ile Glu Pro Gly Ser Leu Phe Asn Lys Asp Gly Asp Ala Asp 420 425 430 gaa ttt gtt gat gtc cac atg cct gat gta cat aag ttg gta tca gat 1344Glu Phe Val Asp Val His Met Pro Asp Val His Lys Leu Val Ser Asp 435 440 445 gtc gac ttg ggg cta ccc gat tgg agc ctt tat gcg ttg ata ggg gca 1392Val Asp Leu Gly Leu Pro Asp Trp Ser Leu Tyr Ala Leu Ile Gly Ala 450 455 460 act att ata gct ttc ttt ata ctg ata tgt ctt att cgt atc tgc tgc 1440Thr Ile Ile Ala Phe Phe Ile Leu Ile Cys Leu Ile Arg Ile Cys Cys 465 470 475 480 aag aag ggg ggt cgg aga aac tct ccc aca aat aga cct gat ctt cct 1488Lys Lys Gly Gly Arg Arg Asn Ser Pro Thr Asn Arg Pro Asp Leu Pro 485 490 495 ata ggg ttg tct act aca cct caa ccc aag tct aaa gtg ata tca tca 1536Ile Gly Leu Ser Thr Thr Pro Gln Pro Lys Ser Lys Val Ile Ser Ser 500 505 510 tgg gaa tct tat aag ggt acc tct aat gtc tga 1569Trp Glu Ser Tyr Lys Gly Thr Ser Asn Val 515 520 54522PRTLagos vat virus 54Met Ser Gln Leu Asn Leu Ile Pro Phe Phe Cys Val Ile Ile Val Leu 1 5 10 15 Ser Val Glu Asp Phe Pro Leu Tyr Thr Ile Pro Glu Lys Ile Gly Pro 20 25 30 Trp Thr Pro Ile Asp Leu Ile His Leu Ser Cys Pro Asn Asn Leu Gln 35 40 45 Ser Glu Asp Glu Gly Cys Gly Thr Ser Ser Val Phe Ser Tyr Val Glu 50 55 60 Leu Lys Thr Gly Tyr Leu Thr His Gln Lys Val Ser Gly Phe Thr Cys 65 70 75 80 Thr Gly Val Val Asn Glu Ala Val Thr Tyr Thr Asn Phe Val Gly Tyr 85 90 95 Val Thr Thr Thr Phe Lys Arg Lys His Phe Lys Pro Thr Ala Leu Ala 100 105 110 Cys Arg Asp Ala Tyr His Trp Lys Ile Ser Gly Asp Pro Arg Tyr Glu 115 120 125 Glu Ser Leu His Thr Pro Tyr Pro Asp Asn Ser Trp Leu Arg Thr Val 130 135 140 Thr Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Ile Val Glu 145 150 155 160 Met Asp Val Tyr Ser Arg Thr Leu His Ser Pro Met Phe Pro Thr Gly 165 170 175 Thr Cys Ser Arg Phe Tyr Pro Ser Ser Pro Ser Cys Ala Thr Asn His 180 185 190 Asp Tyr Thr Leu Trp Leu Pro Asp Asp Pro Asn Leu Ser Leu Ala Cys 195 200 205 Asp Ile Phe Val Thr Ser Thr Gly Lys Lys Ser Met Asn Gly Ser Arg 210 215 220 Met Cys Gly Phe Thr Asp Glu Arg Gly Tyr Tyr Arg Thr Ile Lys Gly 225 230 235 240 Ala Cys Lys Leu Thr Leu Cys Gly Lys Pro Gly Leu Arg Leu Phe Asp 245 250 255 Gly Thr Trp Ile Ser Phe Pro Arg Pro Glu Val Thr Thr Arg Cys Leu 260 265 270 Pro Asn Gln Leu Val Asn Ile His Asn Asn Arg Ile Asp Glu Val Glu 275 280 285 His Leu Ile Val Glu Asp Leu Ile Arg Lys Arg Glu Glu Cys Leu Asp 290 295 300 Thr Leu Glu Thr Val Leu Met Ser Lys Ser Ile Ser Phe Arg Arg Leu 305 310 315 320 Ser His Phe Arg Lys Leu Val Pro Gly Tyr Gly Lys Ala Tyr Thr Ile 325 330 335 Leu Asn Gly Ser Leu Met Glu Thr Asn Val His Tyr Leu Lys Val Asp 340 345 350 Asn Trp Ser Glu Ile Leu Pro Ser Lys Gly Cys Leu Lys Ile Asn Asn 355 360 365 Gln Cys Val Ala His Tyr Lys Gly Val Phe Phe Asn Gly Ile Ile Lys 370 375 380 Gly Pro Asp Gly His Ile Leu Ile Pro Glu Met Gln Ser Ser Leu Leu 385 390 395 400 Lys Gln His Met Asp Leu Leu Lys Ala Ala Val Phe Pro Leu Lys His 405 410 415 Pro Leu Ile Glu Pro Gly Ser Leu Phe Asn Lys Asp Gly Asp Ala Asp 420 425 430 Glu Phe Val Asp Val His Met Pro Asp Val His Lys Leu Val Ser Asp 435 440 445 Val Asp Leu Gly Leu Pro Asp Trp Ser Leu Tyr Ala Leu Ile Gly Ala 450 455 460 Thr Ile Ile Ala Phe Phe Ile Leu Ile Cys Leu Ile Arg Ile Cys Cys 465 470 475 480 Lys Lys Gly Gly Arg Arg Asn Ser Pro Thr Asn Arg Pro Asp Leu Pro 485 490 495 Ile Gly Leu Ser Thr Thr Pro Gln Pro Lys Ser Lys Val Ile Ser Ser 500 505 510 Trp Glu Ser Tyr Lys Gly Thr Ser Asn Val 515 520

* * * * *


uspto.report is an independent third-party trademark research tool that is not affiliated, endorsed, or sponsored by the United States Patent and Trademark Office (USPTO) or any other governmental organization. The information provided by uspto.report is based on publicly available data at the time of writing and is intended for informational purposes only.

While we strive to provide accurate and up-to-date information, we do not guarantee the accuracy, completeness, reliability, or suitability of the information displayed on this site. The use of this site is at your own risk. Any reliance you place on such information is therefore strictly at your own risk.

All official trademark data, including owner information, should be verified by visiting the official USPTO website at www.uspto.gov. This site is not intended to replace professional legal advice and should not be used as a substitute for consulting with a legal professional who is knowledgeable about trademark law.

© 2024 USPTO.report | Privacy Policy | Resources | RSS Feed of Trademarks | Trademark Filings Twitter Feed